BioPerl-1.7.8000755000766000024 014006427353 13137 5ustar00cjfieldsstaff000000000000AUTHORS100644000766000024 1451514006427353 14316 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8=head1 PRIMARY AUTHORS AND MAJOR CONTRIBUTORS TO BIOPERL =head2 Releases co-ordinated and submitted by bioperl core devs. =over =item * Sendu Bala =item * Chris Dagdigian =item * Christopher Fields =item * Mark Jensen =item * Hilmar Lapp =item * Heikki Lehväslaiho =item * Aaron Mackey =item * Brian Osborne =item * Francisco J. Ossandon =item * Jason Stajich =item * Lincoln Stein =back =head2 Previous Bioperl Coordinators: =over =item * Ewan Birney =item * Steven Brenner =item * Georg Fuellen =item * Steve Chervitz =back =head2 Major Contributors (Feel free to add descriptions of which modules you are responsible for if you see fit) =over =item * Richard Adams =item * Shuly Avraham - Bio::Graphics::Glyph =item * Peter Blaiklock =item * Benjamin Berman =item * Matthew Betts =item * David Block =item * Kris Boulez =item * Tim Bunce - code optimizations =item * Scott Cain - Bio::Graphics::Glyph, Bio::DB::GFF::Adaptor::dbi::pg, GFF related tools and scripts =item * Yee Man Chan - Bio::Tools::dpAlign =item * Brad Chapman =item * Roy R. Chaudhuri - Bio::SeqUtils Bio::Align::Utilities =item * Michele Clamp =item * Malcolm Cook =item * Tony Cox =item * James Cuff =item * Andrew Dalke =item * Allen Day =item * Jared Fox - Bio::SeqIO::interpro =item * Brian O'Connor - Bio::TreeIO::svggraph =item * James Diggans =item * Peter Dimitrov - Bio::Ontology =item * Rich Dobson - Bio::PopGen::IO::hapmap,phase =item * Paul Edlefsen =item * Rob Edwards - Bio::Restriction =item * Arne Elofsson =item * David Evans =item * Mark Fiers =item * The Fugu Team =item * Luc Gauthier =item * James Gilbert =item * Nat Goodman =item * Ed Green =item * Matthew Hahn =item * Roger Hall =item * Todd Harris - SVG support in Bio::Graphics =item * Mauricio Herrera Cuadra =item * Ian Holmes =item * Shawn Hoon =item * Robert Hubley =item * Joseph Insana - Bio::LiveSeq =item * Donald Jackson - SiRNA =item * Keith James - Bio::Tools::Geneid =item * Mark A. Jensen - Bio::DB::HIVQuery, Bio::Search::Tiling =item * Nicolas Joly =item * Ian Korf =item * Dan Kortschak =item * Arek Kasprzyk =item * Andreas Kähäri =item * Charles C. Kim =item * Stefan Kirov - Bio::Matrix::PSM =item * Balamurugan Kumarasamy =item * Josh Lauricha - Bio::SeqIO::tigr =item * Eckhard Lehmann =item * Catherine Letondal =item * Philip Lijnzaad =item * Brad Marshall =item * Chad Matsalla =item * Andrew Macgregor =item * Sheldon McKay =item * Dave Messina - Deobfuscator, judicious meddling =item * Chase Miller - Bio::Nexml and related IO modules =item * Juha Muilu =item * Chris Mungall =item * Giri Narasimhan =item * Xiaokang Pan - Bio::Graphics::Glyph =item * Jong Park =item * Matthew Pocock =item * Lorenz Pollack -- BPlite porting =item * Richard Resnick -- original Bio::Seq =item * Todd Richmond =item * Peter Schattner =item * Torsten Seemann -- Bio::Tools::Run::StandaloneBlast =item * Martin Senger -- Bio::Biblio =item * Nigam Shah =item * Shengqiang Shu - Bio::Graphics::Glyph =item * Allen Smith -- Bio::Matrix and Bio::SimpleAlign fixes =item * Marc Sohrmann =item * Robson Francisco de Souza - Bio::Assembly =item * Mark Southern =item * Will Spooner =item * Arne Stabenau =item * Elia Stupka =item * Gert Thijs =item * James Thompson - Bio::Matrix::PSM protein-related modules. =item * Charles Tilford =item * Anthony Underwood =item * Paul-Christophe Varoutas =item * Andrew G. Walsh =item * Kai Wang =item * Gary Williams =item * Mark Wilkinson =item * Helge Weissig =item * Juguang Xiao =item * Alex Zelensky - Bioperl-DB =item * Peili Zhang =item * Christian M. Zmasek - Bio::Phenotype & Bio::Ontology =back Changes100644000766000024 26660514006427353 14572 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8Summary of important user-visible changes for BioPerl ----------------------------------------------------- 1.7.8 2021-02-02 23:02:18-06:00 America/Chicago * Bio::SeqIO::interpro has been moved to a separate repository to deal with issues with XML::DOM::XPath bitrot [#347] * Pull requests: * Adjust Swiss-Prot FT A..B lines [#348] from @smoe * Update %FTQUAL_NO_QUOTE: List of qualifiers without quote [#339] from @hdevillers 1.7.7 2019-12-07 13:41:36-06:00 America/Chicago * The program bp_chaos_plot has been removed. * GD is now no longer a dependency, suggestion or requirement. * #321 - GenBank format fix for un-quoted features, text wrapping * Bio::DB::Query::WebQuery now includes methods for delay(), delay_policy(), and a 'private' _sleep() function that mirror those from Bio::DB::WebDBSeqI, primarily for compliance with potential website restrictions for the number and frequency of queries (e.g. NCBI eUtils). * Fix bug #329, related to Bio::Tree::Statistics::transfer_bootstrap_expectation in last release. 1.7.6 2019-08-28 12:37:01+01:00 Europe/London * The program bp_classify_hits_kingdom has been removed and is now part of the examples documentation instead. * GD is now listed as a suggestion instead of a requirement. The bp_chaos_plot program will now work with the GD module. * New method Bio::Tree::Statistics::transfer_bootstrap_expectation to compute Transfer Bootstrap Expectation (TBE) for internal nodes based on the methods outlined in Lemoine et al, Nature, 2018. * New method Bio::SeqIO::fasta::next_seq_fast to retrieve next sequence in the stream faster but not perfect. 1.7.5 2019-02-11 14:57:45+00:00 Europe/London * The following modules have been removed from the BioPerl distribution to be part of a separate distribution with independent development: Bio::Symbol::* * The Bio::Seq::SeqWithQuality module, which was deprecated since 2001, was finally removed. * The deprecated() method has been deprecated. It is recommended to use Carp::carp to warn. * The following methods have been deprecated for a long while and have now been removed: Bio::Align::AlignI->no_residues Bio::Align::AlignI->no_sequences Bio::LocatableSeq->no_gap Bio::LocatableSeq->no_sequences Bio::SeqFeature::Generic->slurp_gff_file Bio::SimpleAlign->no_residues Bio::SimpleAlign->no_sequences 1.7.4 2019-02-05 16:23:53+00:00 Europe/London * Fix Bio::Root::Test, and the testuite, to properly check for internet connection and the NO_NETWORK_TESTING environment variable. Previously, tests that required internet connection were not being skipped, causing tests to fail. 1.7.3 2019-01-30 13:30:34+00:00 Europe/London * The following modules have been removed from the BioPerl distribution to be part of a separate distribution. They have been integrated into other module distributions for independent development: Bio::Align::Graphics Bio::AlignIO::nexml Bio::AlignIO::stockholm Bio::Assembly::* Bio::Cluster::* Bio::ClusterI::* Bio::ClusterIO::* Bio::DB::Ace Bio::DB::BioFetch Bio::DB::CUTG Bio::DB::EMBL Bio::DB::EntrezGene Bio::DB::Expression::* Bio::DB::GFF Bio::DB::GFF::Adaptor::* Bio::DB::GFF::Aggregator::* Bio::DB::GFF::Featname Bio::DB::GFF::Feature Bio::DB::GFF::Homol Bio::DB::GFF::RelSegment Bio::DB::GFF::Segment Bio::DB::GFF::Typename Bio::DB::GenBank Bio::DB::GenPept Bio::DB::HIV::* Bio::DB::MeSH Bio::DB::NCBIHelper Bio::DB::Query::GenBank Bio::DB::Query::HIVQuery Bio::DB::RefSeq Bio::DB::SeqFeature::* Bio::DB::SeqVersion::* Bio::DB::SwissProt Bio::DB::TFBS::* Bio::DB::Taxonomy::entrez Bio::DB::Taxonomy::sqlite Bio::DB::Universal Bio::Draw::Pictogram Bio::Factory::MapFactoryI Bio::Index::Hmmer Bio::Index::Stockholm Bio::LiveSeq::* Bio::Map::* Bio::MapIO::* Bio::MolEvol::CodonModel Bio::Nexml::Factory Bio::NexmlIO Bio::Perl Bio::Phenotype::* Bio::PhyloNetwork::* Bio::PopGen::* Bio::Restriction::* Bio::Root::Build Bio::Search::HSP::HMMERHSP Bio::Search::HSP::HmmpfamHSP Bio::Search::Hit::HMMERHit Bio::Search::Hit::HmmpfamHit Bio::Search::Hit::hmmer3Hit Bio::Search::Result::HMMERResult Bio::Search::Result::HmmpfamResult Bio::Search::Result::hmmer3Result Bio::SearchDist Bio::SearchIO::hmmer Bio::SearchIO::hmmer2 Bio::SearchIO::hmmer3 Bio::SearchIO::hmmer_pull Bio::SeqEvolution::* Bio::SeqFeature::SiRNA::* Bio::SeqIO::abi Bio::SeqIO::agave Bio::SeqIO::alf Bio::SeqIO::chadoxml Bio::SeqIO::chaos Bio::SeqIO::chaosxml Bio::SeqIO::ctf Bio::SeqIO::entrezgene Bio::SeqIO::excel Bio::SeqIO::exp Bio::SeqIO::flybase_chadoxml Bio::SeqIO::lasergene Bio::SeqIO::nexml Bio::SeqIO::pln Bio::SeqIO::strider Bio::SeqIO::ztr Bio::Structure::* Bio::Taxonomy::* Bio::Tools::AlignFactory Bio::Tools::Analysis::* (except SimpleAnalysisBase) Bio::Tools::Gel Bio::Tools::HMMER::* Bio::Tools::Hmmpfam Bio::Tools::Phylo::Gumby Bio::Tools::Protparam Bio::Tools::Run::RemoteBlast Bio::Tools::SiRNA::* Bio::Tools::dpAlign Bio::Tools::pSW Bio::Tree::AlleleNode Bio::Tree::Draw::Cladogram Bio::TreeIO::nexml Bio::TreeIO::svggraph Bio::Variation::* * The following modules are new in the BioPerl distribution. They have been previously released in the BioPerl-Run distribution. This will enable smaller distributions that provide a Bio::Tool::Run interface, to be only dependent on the BioPerl distribution instead of the whole (very large) BioPerl-Run: Bio::Tools::Run::Analysis Bio::Tools::Run::AnalysisFactory Bio::Tools::Run::Phylo::PhyloBase Bio::Tools::Run::WrapperBase Bio::Tools::Run::WrapperBase::CommandExts * The following programs have been removed: bp_biofetch_genbank_proxy bp_blast2tree bp_bulk_load_gff bp_composite_LD bp_das_server bp_download_query_genbank bp_fast_load_gff bp_flanks bp_genbank2gff bp_generate_histogram bp_heterogeneity_test bp_hivq bp_hmmer_to_table bp_load_gff bp_meta_gff bp_netinstall bp_parse_hmmsearch bp_process_wormbase bp_query_entrez_taxa bp_remote_blast bp_seqfeature_delete bp_seqfeature_gff3 bp_seqfeature_load * Because of the move of so many modules and programs into separate distributions, the following modules are no longer prerequisites: Ace Ace::Sequence::Homol Algorithm::Munkres Apache::DBI Archive::Tar Array::Compare Bio::ASN1::EntrezGene Bio::Expression::Contact Bio::Expression::DataSet Bio::Expression::Platform Bio::Expression::Sample Bio::Ext::Align Bio::GMOD::CMap::Utils Bio::Phylo::Factory Bio::Phylo::Forest::Tree Bio::Phylo::IO Bio::Phylo::Matrices Bio::Phylo::Matrices::Datum Bio::Phylo::Matrices::Matrix Bio::SeqFeature::Annotated Bio::SeqIO::staden::read Bio::Tools::Run::Alignment::Clustalw Bio::Tools::Run::Ensembl Bio::Tools::Run::Phylo::Molphy::ProtML Bio::Tools::Run::Phylo::Phylip::Neighbor Bio::Tools::Run::Phylo::Phylip::ProtDist Bio::Tools::Run::Phylo::Phylip::ProtPars Bio::Tools::Run::Samtools CGI CPAN Cache::FileCache Config Convert::Binary::C DBD::Pg DBD::SQLite Data::Stag::XMLWriter Encode English ExtUtils::Install ExtUtils::Manifest File::Glob GD::Simple Getopt::Std Graph::Undirected GraphViz HTML::HeadParser HTML::TableExtract LWP LWP::Simple MIME::Base64 Memoize PostScript::TextBlock SVG SVG::Graph SVG::Graph::Data SVG::Graph::Data::Node SVG::Graph::Data::Tree Sort::Naturally Spreadsheet::ParseExcel Term::ReadLine Text::NSP::Measures::2D::Fisher2::twotailed Text::ParseWords Time::Local Tree::DAG_Node URI::Escape WWW::Mechanize XML::Simple * The following is a new prerequisite: Test::RequiresInternet * The deobfuscator has been removed. * The emacs bioperl minor mode is no longer distributed as part of the perl module distributions. See https://github.com/bioperl/emacs-bioperl-mode 1.7.2 - "Entebbe" [Bugs] * #247 - Omit unnecessary parent_id attribute added by GFF3Loader [nathanweeks] * #245 - Code coverage fixes [zmughal,cjfields] * #237 - Fix warning in Bio::DB::IndexedBase [willmclaren,bosborne] * #238 - Use a Travis cron job for network tests [zmughal,cjfields] * #218 - Bio::DB::Flat::BinarySearch should use _fh() instead of fh() as fh() does not take arguments in [thibauthourlier,bosborne] * #227 - Bio::SeqIO Ignores first line of sequence [VAR121,bosborne] * #223 - Use Travis Perl helper script and enable coverage [zmughal,cjfields] * #222 - Fix test RemoteDB/Taxonomy.t: requires networking [zmughal,cjfields] * #216 - Apply carsonhh's patch (Inline::C fixes) [carsonh,bosborne] * #213 - Support FTS5 in Bio::DB::SeqFeature::Store::DBI::SQLite [nathanweeks,bosborne] * #210 - Sorting qualifiers while write embl files [hdevillers,cjfields] * #209 - Fixed bug in _toDsspKey() [jvolkening,hlapp] [Code changes] * PAML-related code from bioperl and bioperl-run are now in a separate distribution on CPAN [carandraug] 1.7.1 - "Election" [Bugs] * Minor release to incorporate fix for CPAN indexing, which prevented proper updates [cjfields] * Fix problem in managing Target attribute for gff3 [Jukes34] * Minor bug fixes related to NCBI HTTPS support [cjfields] 1.7.0 - "Disney" [New site] * We have migrated to Github Pages. This was actually planned, but the recent OBF server compromise forced our hand. Brian Osborne [bosborne] took this under his wing to move docs and has done a tremendous amount of work formatting the site and working out some of the idiosyncracies with the new Jekyll-based design. Mark Jensen, Paul Cantalupo and Franscison Ossandon also helped. Kudos!! * Similarly, the official issue tracker is now Github Issues. This has been updated in the relevant documentation bits (we hope!) [Code changes] * Previously deprecated modules removed * Bio::Tools::Infernal, Bio::Tools::ERPIN, Bio::Tools::RNAMotif * Bio::DB::SeqHound has been removed due to the service no longer being available * Bio::Tools::Analysis::Protein::Mitoprot has been removed for security reasons due to the server no longer having a valid cert * Bio::EUtilities, Bio::Biblio are now separate releases on CPAN * Bio::Coordinate, Bio::SearchIO::blastxml, Bio::SearchIO::Writer::BSMLResultWriter are now separate releases to be added on CPAN [New features] * Docker instances of tagged releases are available! [hlapp] * NCBI HTTPS support [mjohnson and others] * Bio::SearchIO::infernal - Issue #131: added CMSEARCH parsing support for Infernal 1.1 [pcantalupo] * Bio::Search::HSP::ModelHSP - Added a 'noncanonical_string' method to retrieve the NC line from CMSEARCH reports [pcantalupo] * Bio::Search::Result::INFERNALResult - Added new module to represent features of Infernal reports [pcantalupo] * Bio::DB::Taxonomy SQLite option [cjfields] * WrapperBase quoted option values [majensen] * Various documentation fixes and updates [bosborne] [Bug Fixes] * Fixes in Bio::Root::Build to deal with META.json/yml for CPAN indexing [cjfields] * Bio::SeqFeature::Generic spliced_seq() bug fix [Eric Snyder, via bosborne] * NeXML parser fixes [fjossandon] * Bug fix for Bio::DB::SeqFeature memory adapter [lstein] * RT 103272 : SeqFeature database deletion skipped features with a decimal - Joshua Fortriede (Xenbase) * RT 98374: AlignIO issues with sequence names not correctly parsing - Xiaoyu Zhuo * Issue #70: CONTIG parsing in GenBank output fixed [fjossandon] * Issue #76: Circular genome fixes with Bio::Location::Split [fjossandon] * Issue #80: Fix lack of caching issue with Bio::DB::Taxonomy [fjossandon] * Issue #81: Small updates to make sure possible memory leaks are detected [cjfields] * Issue #84: EMBL format wrapping problem [nyamned] * Issue #90: Missing entries for translation tables 24 and 25 [fjossandon] * Issue #95: Speed up of Bio::DB::Fasta::subseq by using a compiled regex or compiled C code (when Inline::C is installed) [rocky] * Fix various Bio::Tools::Analysis remote server config problems [cjfields] * Added several missing 'Data::Stag' and 'LWP::UserAgent' requirements [fjossandon] * Added a workaround in Bio::DB::Registry to get Username in Windows [fjossandon] * For HMMer report parsing, changed "$hsp->bits" to return 0 instead of undef to be consistent with "$hit->bits" behaviour [fjossandon] * Fixed a bug in HMMer3 parsing, where an homology line ending in CS or RF aminoacids made "next_seq" confused and broke the parser [fjossandon] * Adjusted FTLocationFactory.pm to comply with current GenBank Feature Table Definition, so now "join(complement(C..D),complement(A..B))" is equivalent to "complement(join(A..B,C..D))" [fjossandon] * For the many many many fixes that weren't mentioned - blame the release guy! 1.6.924 [Significant changes] * Bug/feature issue tracking has moved to GitHub Issues: https://github.com/bioperl/bioperl-live/issues * DB_File has been demoted from "required" to "recommended", which should make easier for Windows users to install BioPerl if they don't need that module. [New features] * Bio::Search::HSP::GenericHSP - Bug #3370, added a "posterior_string" method to retrieve the posterior probability lines (PP) from HMMER3 reports [fjossandon] - Added a "consensus_string" method to retrieve the consensus structure lines (CS|RF) from HMMER2 and HMMER3 reports when available [fjossandon] * Bio::SearchIO::hmmer2 - The number of identical and conserved residues are now calculated directly from the homology line [fjossandon] - Now the Query Length and Hit Length are reported when the alignment runs until the end of the sequence/model ('.]' or '[]') [fjossandon] - Implemented the capture of the consensus structure lines [fjossandon] * Bio::SearchIO::hmmer3 - The number of identical and conserved residues are now calculated directly from the homology line [fjossandon] - Now the Hit Length is reported when the alignment runs until the end of the sequence/model ('.]' or '[]') [fjossandon] - Implemented the capture of the consensus structure lines [fjossandon] - Implemented the capture of the posterior probability lines [fjossandon] - Completed the development of NHMMER parsing, including alignments [fjossandon] * Bio::SearchIO::SearchResultEventBuilder & Bio::SearchIO::IteratedSearchResultEventBuilder - Feature #2615, moved "_init_parse_params", "max_significance, "signif", "min_score", "min_bits, and "hit_filter" methods from 'IteratedSearchResultEventBuilder' to parent 'SearchResultEventBuilder'. This means that the Bio::SearchIO->new() parameters '-signif', '-score', '-bits' and '-hit_filter' will now work with other Bio::SearchIO formats besides Blast, instead of being ignored. Added tests for all moved methods using HMMER outputs and run the full test suite and everything pass [fjossandon] * Bio::SeqIO::MultiFile - Autodetection of file format [fangly] * Bio::Tools::GuessSeqFormat: - Format detection from non-seekable filehandles such as STDIN [fangly] [Bug fixes] * Fix problems when using Storable as backend for cloning [v1.6.x branch, tsibley] * Fix potential problems with Storable in Bio::DB::SeqFeature::Store [tsibley] * SeqFeature::Lite: Fixed wrong strand when using "+", "-", or "." [nathanweeks] * Abstract: Fixed ActivePerl incapability of removing temporary files because of problems closing tied filehandles [fjossandon] * IndexedBase: For Windows' ActivePerl, several LocalDB tests were failing because ActivePerl were producing a ".index.pag" and ".index.dir" files instead of a single ".index" file (like Strawberry Perl). Now those temporary files are correctly considered and deleted. [fjossandon] * Test files: Added missing module requirements (DB_File and Data::Stag) to several tests files that were failing because those modules were not present. Now those test files are correctly skipped instead. [fjossandon] * Blast: Added support to changes in bl2seq from BLAST+ output, which now uses "Subject=" instead of ">" to start hit lines [yschensandiego] * Phylip: Return undef in "next_aln" at file end to avoid an infinite loop [yschensandiego] * HMMER3: When a hit description is too long, it is truncated in the Scores table. In those cases, the more complete description from the Annotation line (>>) will be used [fjossandon] * GenericHSP: Added '.' to gap symbols in "_pre_gaps" (except for ERPIN), since it is now used by HMMER3 format in alignments [fjossandon] * GenericHit: Changed "frac_aligned_query" and "frac_aligned_hit" to return undef if the query/hit length is unknown (like in some HMMER outputs), to avoid division by 0 crashes. Also "query_length" now is set to 0 if its undefined, to be consistent with hit "length" [fjossandon] * HMMER: fixed many bugs in the parsing of Hmmer2 and Hmmer3 outputs, added support to multi-query reports, reduced code redundancy, and eliminated the automatic removal of hits below "inclusion threshold" [fjossandon] * [3369] - Fixed reported bugs in parse from HMMSEARCH3 reports [fjossandon] * [3446] - Fixed wrong marker position in Bio::Map::Physical [fjossandon] * [3455] - Fixed wrong print of DBLink in Genbank file [bosborne] * Fixed some Bio::Root::Utilities subroutines [fjossandon] * Double-quotes on paths are needed in some places [fjossandon] * [3453] - Allow multiple homologies and products in Entrezgene [fjossandon] * Use "NUL" instead of"/dev/null" when running in Windows [fjossandon] * Updated all files from Bio-Root, Bio-Coordinate and Bio-SearchIO-blastxml with the latest changes made in their own repositories [fjossandon] * General synching of files with the master branch [fjossandon] * Fixed tests failing in Windows because of using Linux commands [fjossandon] * Closed many open filehandles that prevented temporary files deletion [fjossandon] * Fixed broken MeSH parser [fjossandon] * Fixed missing detection of format in SeqIO when given a -string [fangly] 1.6.923 * Major Windows support updates! [fjossandon] * MAKER update to allow for stricter standard codon table [cjfields] * Better support for circular sequences [fjossandon] * Fixes for some complex location types [fjossandon] * Address CONTIG bug in GenBank format, bug #3448 [cjfields] * Fix bug #2978 related to BLAST report type [fjossandon] * Deobfuscator fixes [DaveMessina] 1.6.922 * Address CPAN test failures [cjfields] * Add BIOPROJECT support for Genbank files [hyphaltip] * Better regex support for HMMER3 output [bosborne] 1.6.921 * Minor update to address CPAN test failures 1.6.920 * Remove Bio::Biblio and related files [carandraug] - this cause version clashes with an independently-released version of Bio::Biblio 1.6.910 [New features] * Hash randomization fixes for perl 5.18.x - Note: at least one module (Bio::Map::Physical) still has a failing test; this is documented in bug #3446 and has been TODO'd; we will be pulling Bio::Map and similar modules out of core into separate distributions in the 1.7.x release series [cjfields] [New features] * Bio::Seq::SimulatedRead - New module to represent reads taken from other sequences [fangly] * Bio::Root::Root - Support of Clone::Fast as a faster cloning alternative [fangly] * Bio::Root::IO - Moved the format() and variant() methods from Bio::*IO modules to Bio::Root::IO [fangly] - Can now use format() to get the type of IO format in use [fangly] * Bio::Tools::IUPAC - New regexp() method to create regular expressions from IUPAC sequences [fangly] * Bio::SeqFeature::Primer and Bio::Seq::PrimedSeq: - Code refresh [fangly] * Bio::DB::Taxonomy - Added support for the Greengenes and Silva taxonomies [fangly] * Bio::Tree::TreeFunctionsI - get_lineage_string() represents a lineage as a string [fangly] - add_trait() returns instead of reporting an error when the column number is exceeded in add_trait() [fangly] - Option to support tree leaves without trait [fangly] - Allow ID of 0 in trait files [fangly] * Bio::DB::Taxonomy::list - Misc optimizations [fangly] - Option -names of get_taxon() to help with ambiguous taxa [fangly] * Bio::DB::Taxonomy::* - get_num_taxa() returns the number of taxa in the database [fangly] * Bio::DB::Fasta and Bio::DB::Qual - support indexing an arbitrary list of files [fangly] - user can supply an arbitrary index file name [fangly] - new option to remove index file at the end [fangly] * Bio::DB::Fasta - now handles IUPAC degenerate residues [fangly] * Bio::PrimarySeq and Bio::PrimarySeqI - speed improvements for large sequences [Ben Woodcroft, fangly] * Bio::PrimaryQual - tightened and optimized quality string validation [fangly] * Bio::SeqIO::fasta - new method and option 'block', to create FASTA output with space intervaled blocks (similar to genbank or EMBL) has been implemented. - package variables $WIDTH and $DEFAULT_SEQ_ID_TYPE have been removed in favour of the methods 'width' and 'preferred_id_type` respectively. * Bio::FeatureIO::* - moved from bioperl-live into the separate distribution Bio-FeatureIO * Bio::SeqFeature::Annotated - moved from bioperl-live into the separate distribution Bio-FeatureIO * Bio::Cluster::SequenceFamily - improved performance when using get_members with overlapping multiple criteria * Bio::SearchIO::hmmer3 - now supports nhmmer [bosborne] [Bug fixes] * [3302] Fixes bug in Bio::SearchIO::hmmer2.pm to correctly parse multi-query hmmer output [Francisco J. Ossandon, Paul Cantalupo] * [3421] Fixes bug in Bio::SearchIO::hmmer2.pm to correctly parse an HSP with a line full of dashes [Francisco J. Ossandon, Paul Cantalupo] * [3298] Fix bug in Bio::SearchIO::blast.pm where algorithm version information was lost in a multi-result blast file [Paul Cantalupo] * [3343] Fix bug in Bio::SearchIO::blasttable.pm to correctly calculate total gaps [Paul Cantalupo] * [3375] Fix DBLINK parsing bug in Bio::SeqIO::genbank.pm [Paul Cantalupo] * [3376] Fix bug in Bio::SearchIO::hmmer2.pm to correctly handle case when end of domain indicator is split across lines [Paul Cantalupo] * [3240] Bio::AlignIO::stockholm now parses simple sequences [Bernd Web, cjfields] * [3237] Bio::DB::Fasta now allows blank lines between sequences, catches instances where blank lines are within sequences [cjfields] * Bio::DB::Fasta reports correct alphabet for files with multiple sequence types [fangly] * Bio::DB::Fasta rev-comps sequences other than DNA properly [fangly] * [3238] Fixes for Bio::DB::SeqFeature::Store::DBI::Pg [Thomas Burkhard, cjfields] * Various fixes for Stockholm file indexing and processing [bosborne] * Fix edge case in FASTQ parsing where sequence of length 1 and qual of 0 breaks parsing [cjfields] * Fix case where Bio::Seq::Meta* objects with no meta information could not be reverse-complemented [fangly] * Fix bug for fields without aliases in Bio::DB::Query::HIVQuery [fangly] * Fix Bio::PopGen::IO::phase: sort values lexically instead of numerically when unsure that values will be numerical [fangly] * Fix undef warnings in Bio::SeqIO::embl [fangly] * Fix undef warnings in Bio::DB::Fasta and Bio::DB::Qual [fangly] * Fix Bio::Tools::IUPAC should accept any sequence object [fangly] * Fix for 'Inappropriate ioctl' in Bio::DB::Store::berkeleydb3 [Olivier Sallou] * Bio::SeqFeature::Generic SeqfeatureI compliance: methods primary_tag, source_tag and display_name must return a string, not undef [fangly] * Bio::SimpleAlign and Bio::Seq compliance with Bio::FeatureHolderI add_SeqFeature takes a single argument [fangly] * Use cross-platform filenames and temporary directory in Bio::DB::Taxonomy::flatfile [fangly] * Fix bug in Bio::DB::Taxonomy::list where taxa with no ancestors were not properly identified as existing taxa in the database [fangly] * Fix issue where a Bio::DB::Taxonomy::list object could not be created without also passing a lineage to store [fangly] * Prevent passing a directory to the gi2taxid option (-g) of bp_classify_hits_kingdom.pl and remove an 'earlier declaration' warning [fangly] * Fixed bp_genbank2gff3.pl crash when missing source feature date [fangly] * Bio::PrimarySeq constructor -direct works for -seq or -ref_to_seq [fangly] * Bio::Cluster::SequenceFamily - checks if the sequence has a Bio::Species object before trying to access, and no longer returns repeated sequences. 1.6.901 May 18, 2011 [Notes] * Use of AcePerl is deprecated; Ace.pm isn't actively maintained, and modules using Ace will also be deprecated [lds, cjfields] * Minor bug fix release * Bio::SeqIO::gbxml tests require XML::SAX [hartzell] * Address Build.PL issues when DBI is not present [hartzell] * Skip gbxml.t and Interpro tests when modules not installed [cjfields] * Remove deprecated code for perl 5.14.0 compat [cjfields] * Due to schema changes and lack of support for older versions, support for NeXML 0.9 is only (very) partially implemented. See: https://redmine.open-bio.org/issues/3207 [Bug fixes] * [3205] - small fix to Bio::Perl blast_sequence() to make compliant with docs [genehack, cjfields] * $VERSION for CPAN/cpanm-based installs was broken; force setting of module version from dist_version (probably not the best way to do this, but it seems to work) [rbuels, cjfields] 1.6.900 April 14, 201 [Notes] * This will probably be the last release to add significant features to core modules; subsequent releases will be for bug fixes alone. We are planning on a restructuring of core for summer 2011, potentially as part of the Google Summer of Code. This may become BioPerl 2.0. * Version bump represents 'just prior to v 1.7'. We may have point releases to deal with bugs, with increments of 1.6.901, 1.6.902, etc. This code essentially is what is on the github master branch. [New features] * Core code updated for perl 5.12.x [cjfields, Charle Tilford] * Bio::Tree refactor - major overhaul of Bio::Tree code by Greg Jordan, fixes several bugs - removal of Scalar::Util::weaken code, which was causing odd headaches with premature GC, memory leaks with perl 5.10.0, etc [cjfields] * Bio::DB::SeqFeature bug fixes for GBrowse2 compatibility [lds, scottcain, many others] * Bio::SeqIO::msout, Bio::SeqIO::mbsout - parsers for ms and mbs [Warren Kretzschmar] * Bio::SeqIO::gbxml - bug 2515 - new contribution [Ryan Golhar, jhannah] * Bio::Assembly::IO - support for reading Maq, Sam and Bowtie files [maj] - support for reading 454 GS Assembler (Newbler) ACE files [fangly] - bug 2483: support for writing ACE files [Joshua Udall, fangly] - bug 2599: support DBLINK annotation in GenBank files [cjfields] - bug 2726: reading/writing granularity: whole scaffold or one contig at a time [Joshua Udall, fangly] * Bio::OntologyIO - Added parsing of xrefs to OBO files, which are stored as secondary dbxrefs of the cvterm [Naama Menda] - General Interpro-related code refactors [dukeleto, rbuels, cjfields] * PAML code updated to work with PAML 4.4d [DaveMessina] [Bug fixes] * [3198] - sort tabular BLAST hits by score [DaveMessina] * [3196] - fix invalid metadata produced by latest Module::Build [cjfields] * [3190] - RemoteBlast GAPCOSTS regex fix [Ali Walsh, cjfields] * [3185] - Bio::Tools::SeqStats->get_mol_wt now gives correct MW [cjfields] * [3178] - fix tr/// issue in Bio::Range [Andrew Conley, cjfields] * [3172] - Bio::DB::Fasta - catch possibly bad FASTA files [cjfields] * [3164] - TreeFunctionsI syntax bug [gjuggler] * [3163] - AssemblyIO speedup [fangly] * [3160] - Bio::SearchIO::Writer::TextResultWriter output [Paul Cantalupo, hyphaltip] * [3159] - add SwissPfam support to bp_index.PLS [hyphaltip] * [3158] - fix EMBL file mis-parsing [cjfields] * [3157] - Bio::Restriction::Analysis 'sizes' method fixed [Marc Perry, cjfields] * [3153] - fix SeqIO::swiss TagTree issues [Charles Tilford, cjfields] * [3148] - URL change for UniProt [cjfields] * [3145] - AXT off-by-1 error [Aaron Goodman, cjfields] * [3136] - HMMer3 parser fixes [kblin] * [3126] - catch description [Toshihiko Akiba] * [3122] - Catch instances where non-seekable filehandles were being seek'd w/o checking for status [Stefan Kirov, Roy Chaudhuri] * [3121] - Bio::OntologyIO cannot parse the full InterPro XML file [dukeleto, rbuels, cjfields] * [3120] - bp_seqfeature_gff3.pl round-trip fixes [genehack, David Breimann, jhannah] * [3116,3117] - perl 5.12.x warnings fixed [cjfields, Charles Tilford] * [3110] - Better 'namespace' support for bp_seqfeature_load.PLS [dbolser, cjfields] * [3107] - BLAST alignment column_from_residue_number() [cjfields] * [3104] - Bio::Species single node hierarchies [Charles Tilford, cjfields] * [3092, 3090] - parsing of BLAST HSP stats [Razi Khaja, cjfields] * [3089] - HSPTableWriter missing methods [Robson de Souza, cjfields] * [3086] - EMBL misparsing long tags [kblin, cjfields] * [3085] - CommandExts and array of files [maj, hyphaltip] * [3077] - Bio::SimpleAlign slice() now correctly computes seq coordinates for alignment slices [Ha X. Dang, cjfields] * [3076] - XMFA alignment strand wrong [Ha X., cjfields] * [3073] - fix parsing of GenBank files from RDP [cjfields] * [3068] - FASTQ parse failure with trailing 0 [cjfields] * [3064] - All-gap midline BLAST report issues [cjfields] * [3063] - BLASt report RID [Razi Khaja, cjfields] * [3058] - SearchIO::fasta parsing [DaveMessina, cjfields] * [3053] - LOCUS line formatting [M. Wayne, cjfields] * [3039] - correct Newick output root node branch length [gjuggler, DaveMessina] * [3038] - SELEX alignment error [Bernd, cjfields] * [3033] - PrimarySeq ID setting [Bernd, maj] * [3032] - Fgenesh errors [Wes Barris, hyphaltip] * [3034] - AlignIO::clustal output [Bernd, DaveMessina] * [3031] - Parse algorithm ref for BLAST [Razi Khaja, cjfields] * [3028] - Bio::TreeIO::nexus and FigTree compat [Kevin Balbi, cjfields] * [3025] - Bio::SeqIO::embl infinite loop [Adam Sjøgren, cjfields] * [3040, 3023, 2974, 2921, 2753, 2636, 2482] - PAML parser fixed, works with PAML 4.4d [DaveMessina] * [3015, 3022] - Bio::Restriction withrefm regexp [Emmanuel Quevillon, DaveMessina] * [3020] - GFF3Loader alias attribute [Nathan Weeks, cjfields] * [3018, 3019, 3021] - gmap_f9 parsing [Kiran Mukhyala, cjfields] * [3017] - using threads with Bio::DB::GenBank [cjfields] * [3012] - Bio::Root::HTTPget fixes [maj, cjfields] * [3011] - namespace support for SF::Store::DBI::Pg [Adam Witney, cjfields] * [3002] - Bio::DB::EUtilities NCBI policy updates [cjfields] * [3001] - seq identifier '0' dropped with FASTA [Michael Kuhn, maj] * [2984] - let LocatableSeq decide on length of phylip aln [Adam Witney, cjfields] * [2983] - fix score/percent ID mixup [Alexie Papanicolaou] * [2977] - TreeIO issues [DaveMessina] * [2959] - Bio::SeqUtils->revcom_with_features [Roy Chaudhuri, maj] * [2944] - Bio::Tools::GFF score [cjfields] * [2942] - correct MapTiling output [maj] * [2939] - PDB residue insertion codes [John May, maj] * [2930] - PrimarySeqI term symbol [Adam Sjøgren, maj] * [2928] - GuessSeqFormat raw [maj] * [2926] - Bio:Tools::TandemRepeatsFinder seq_id [takadonet, cjfields] * [2922] - open() directive issue [cjfields] * [2915] - GenBank parser infinite loop [Francisco Ossandon, cjfields] * [2901] - DNAStatistics div by zero error [Janet Young, cjfields] * [2899] - SeqFeature::Store host issues [lstein, dbolser] * [2897] - Add a "mask_below_threshold" method to Seq::Quality [dbolser, cjfields] * [2881] - .scf files don't' roundtrip [Adam Sjøgren, cjfields] * [2876] - CDD search with RemoteBlast [Malcolm Cook] * [2863] - Root::IO::_initialize_io causes crash [rbuels, maj, DaveMessina] * [2845] - Bio::Seq::Quality gives seq with no ID [Tristan Lefebure, cjfields] * [2843] - FeatureIO BED to GFF fails w/ no phase [cassjm cjfields] * [2773] - Bio::Tree::Node premature GC [Morgan Price, cjfields] * [2764] - add ID Tracker helper for SwissProt [heikki, cjfields] * [2758] - Bio::AssemblyIO ace problems [fangly] * [2744] - Bio::LocatableSeq::end [Bernd, cjfields] * [2726] - ace file IO [Josh, fangly] * [2700] - Refactor Build.PL [cjfields] * [2673] - addition of simple Root-based clone() method [cjfields] * [2648] - Bio::Assembly::Scaffold->get_all_seq_ids [dbolser, fangly] * [2599] - support for DBLINK annotation in GenBank files [cjfields] * [2594] - Bio::Species memory leak [cjfields] * [2515] - GenBank XML parser [jhannah] * [2499] - Method "pi" in package Bio::PopGen::Statistics [hyphaltip] * [2483] - Bio::Assembly::IO::ace write_assembly implemented [fangly] * [2350] - ID consistency btwn Bio::SeqI, Bio::Align::AlignI [fangly, cjfields] * [1572] - no docs Bio::Location::Simple/Atomic::trunc [hyphaltip] [Deprecated] * Bio::Expression modules - these were originally designed to go with the bioperl-microarray suite of tools, however they have never been completed and so have been removed from the distribution. The original code has been moved into the inactive bioperl-microarray suite. [cjfields] [Other] * Repository moved from Subversion (SVN) to http://github.com/bioperl/bioperl-live [cjfields] * Bug database has moved to Redmine (https://redmine.open-bio.org) * Bio::Micrarray - the tools developed for ReSeq chip analysis by Marian Thieme have been moved to their own distribution (Bio-Microarray). [cjfields] 1.6.1 Sept. 29, 2009 (point release) * No change from last alpha except VERSION and doc updates [cjfields] 1.6.0_6 Sept. 27, 2009 (sixth 1.6.1 alpha) * Fix for silent OBDA bug related to FASTA validation [cjfields] 1.6.0_5 Sept. 27, 2009 (fifth 1.6.1 alpha) * Possible fix for RT 49950 (Strawberry Perl installation) [cjfields] * [RT 50048] - removed redundant VERSION, which was borking CPANPLUS [cjfields] * BioPerl.pod -> BioPerl.pm (Perl Best Practices) [cjfields] 1.6.0_4 Sept. 25, 2009 (fourth 1.6.1 alpha) * WinXP test fixes [cjfields, maj] * BioPerl.pod added for descriptive information, fixes CPAN indexing [cjfields] * Minor doc fixes [cjfields] 1.6.0_3 Sept. 22, 2009 (third 1.6.1 alpha) * Fix tests failing due to merging issues [cjfields] * More documentation updates for POD parsing [cjfields] 1.6.0_2 Sept. 22, 2009 (second 1.6.1 alpha) * Bio::Root::Build - fix YAML meta data generation [cjfields] 1.6.0_1 Sept. 15, 2009 (first 1.6.1 alpha) * Bio::Align::DNAStatistics - fix divide by zero problem [jason] * Bio::AlignIO::* - bug 2813 - fix faulty logic to detect end-of-stream [cjfields] * Bio::AlignIO::stockholm - bug 2796 - fix faulty logic to detect end-of-stream [cjfields] * Bio::Assembly::Tools::ContigSpectrum - function to score contig spectrum [fangly] * Bio::DB::EUtilities - small updates [cjfields] * Bio::DB::Fasta - berkeleydb database now autoindexes wig files and locks correctly [lstein] * Bio::DB::HIV - various small updates for stability; tracking changes to LANL database interface [maj] * Bio::DB::SeqFeature (lots of updates and changes) - add Pg, SQLite, and faster BerkeleyDB implementations [lstein, scain] - bug 2835 - patch [Dan Bolser] - bug RT 44535 - patch FeatureFileLoader [Cathy Gresham] * Bio::DB::SwissProt - bug 2764 - idtracker() method [cjfields, courtesy Neil Saunders] * Bio::Factory::FTLocationFactory - mailing list bug fix [cjfields] * Bio::LocatableSeq - performance work on column_from_residue_number [hartzell] * Bio::Matrix::IO::phylip - bug 2800 - patch to fix phylip parsing [Wei Zou] * Bio::Nexml - Google Summer of Code project from Chase Miller - parsers for Nexml file format [maj, chmille4] * Bio::PopGen - Make Individual, Population, Marker objects AnnotatableI [maj] - simplify LD code [jason] * Bio::RangeI - deal with empty intersection [jason] * Bio::Restriction - significant overhaul of Bio::Restriction system: complete support for external and non-palindromic cutters. [maj] * Bio::Root::Build - CPANPLUS support, no automatic installation [sendu] * Bio::Root::IO - allow IO::String (regression fix) [cjfields] - catch unintentional undef values [cjfields] - throw if non-fh is passed to -fh [maj] * Bio::Root::Root/RootI - small debugging and core fixes [cjfields] * Bio::Root::Test - bug RT 48813 - fix for Strawberry Perl bug [kmx] * Bio::Root::Utilities - bug 2737 - better warnings [cjfields] * Bio::Search - tiling completely refactored, HOWTO added [maj] NOTE : Bio::Search::Hit::* classes do not use this code directly; we will deprecate usage of the older tiling code in the next BioPerl release - small fixes [cjfields] * Bio::SearchIO - Infernal 1.0 output now parsed [cjfields] - new parser for gmap -f9 output [hartzell] - bug 2852 - fix infinite loop in some output [cjfields] - blastxml output now passes all TODO tests [cjfields] - bug 2346, 2850 - psl and exonerate parsing fixes [rbuels, jhannah, bvecchi, YAPC hackathon] - RT 44782 - GbrowseGFF writer now catches evalues [Allen Day] - bug 2575 - add two columns of additional output to HSPTableWriter [cjfields] * Bio::Seq::LargePrimarySeq - delete tempdirs [cjfields] - bug fixes [rbuels, jhannah, bvecchi, YAPC hackathon] * Bio::Seq::Quality - extract regions based on quality threshold value [Dan Bolser, heikki] - bug 2847 - resolve threshold issue (rbuels, jhannah, bvecchi) * Bio::SeqFeature::Lite - various Bio::DB::SeqFeature-related fixes [lstein] * Bio::SeqFeature::Tools::TypeMapper - additional terms for GenBank to SO map [scain] * Bio::SeqIO::chadoxml - bug 2785 - patch to get this working for bp_seqconvert [cjfields] * Bio::SeqIO::embl - support for CDS records [dave_messina, Sylvia] * Bio::SeqIO::fastq - complete refactoring to handle all FASTQ variants, perform validation, write output. API now conforms with other Bio* parsers and the rest of Bio::SeqIO (e.g. write_seq() creates fastq output, not fasta output). [cjfields] * Bio::SeqIO::genbank - bug 2784 - fix DBSOURCE issue [Phillip Garland] - bug RT 44536 - support for UniProt/UniProtKB tests [cjfields] * Bio::SeqIO::largefasta - parser returns a Bio::Seq::LargePrimarySeq [jhannah] * Bio::SeqIO::raw - add option for 'single' and 'multiple' * Bio::SeqIO::scf - bug 2881 - fix scf round-tripping [Adam Søgren] * Bio::SeqUtils - bug 2766, 2810 - copy over tags from features, doc fixes [David Jackson] * Bio::SimpleAlign - bug 2793 - patch for add_seq index issue [jhannah, maj] - bug 2801 - throw if args are required [cjfields] - bug 2805 - uniq_seq returns SimpleAlign and hash ref of sequence types [Tristan Lefebure, maj] - bug fixes from YAPC hackathon [rbuels, jhannah, bvecchi] - fix POD and add get_SeqFeatures filter [maj] * Bio::Tools::dpAlign - add support for LocatableSeq [ymc] - to be moved to a separate distribution [cjfields, rbuels] * Bio::Tools::EUtilities - fix for two bugs from mail list [Adam Whitney, cjfields] - add generic ItemContainerI interface for containing same methods [cjfields] * Bio::Tools::HMM - fix up code, add more warnings [cjfields] - to be moved to a separate distribution [cjfields, rbuels] * Bio::Tools::Primer3 - bug 2862 - fenceposting issue fixed [maj] * Bio::Tools::Run::RemoteBlast - tests for remote RPS-BLAST [mcook] * Bio::Tools::SeqPattern - bug 2844 - backtranslate method [rbuels, jhannah, bvecchi] * Bio::Tools::tRNAscanSE - use 'gene' and 'exon' for proper SO, ensure ID is unique [jason] * Bio::Tree::* - bug 2456 - fix reroot_tree(), added create_node_on_branch() [maj] * Bio::Tree::Statistics - several methods for calculating Fitch-based score, internal trait values, statratio(), sum of leaf distances [heikki] * Bio::Tree::Tree - bug 2869 - add docs indicating edge case where nodes can be prematurely garbage-collected [cjfields] - add as_text() function to create Tree as a string in specified format [maj] * Bio::Tree::TreeFunctionsI - bug 2877 - fix bug where bootstrap assigned to the wrong node [Tristan Lefebure, maj] * Bio::TreeIO::newick - fix small semicolon issue [cjfields] * scripts - update to bp_seqfeature_load for SQLite [lstein] - hivq.pl - commmand-line interface to Bio::DB::HIV [maj] - fastam9_to_table - fix for MPI output [jason] - gccalc - total stats [jason] * General Stuff - POD cleanup re: FEEDBACK section [maj, cjfields] - cleanup or fix dead links [cjfields] - Use of no_* methods (indicating 'number of something') is deprecated in favor of num_* [cjfields] - lots of new tests for the above bugs and refactors [everyone!] - new template for Komodo text editor [cjfields] 1.6.0 Winter 2009 * Feature/Annotation rollback - Problematic changes introduced prior to the 1.5 release have been rolled back. These changes led to subtle bugs involving operator overloading and interface methods. - Behavior is very similar to that for BioPerl 1.4, with tag values being stored generically as simple scalars. Results in a modest speedup. * Bio::Graphics - Split into a separate distribution on CPAN, primarily so development isn't reliant on a complete BioPerl release. - Bio::Graphics::Pictogram has been renamed to Bio::Draw::Pictogram but is only available via Subversion (via bioperl-live main trunk) * Bio::Root::Test - Common test bed for all BioPerl modules * Bio::Root::Build - Common Module::Build-based subclass for all BioPerl modules * Bio::DB::EUtilities - Complete refactoring to split up parsing (Bio::Tools::EUtilities), parameter handling (Bio::Tools::EUtilities::EUtilParameters), and user agent request posting and retrieval * Test implementation and reorganization - Tests have been reorganized into groups based on classes or use cases. - Automated test coverage is now online: http://www.bioperl.org/wiki/Test_Coverage - After this release, untested modules will be moved into a separate developer distribution until tests can be derived. Also, new modules to be added are expected to have a test suite and adequate test coverage. 1.5.2 Developer release Full details of changes since 1.5.1 are available online at: http://www.bioperl.org/wiki/Change_log The following represents a brief overview of the most important changes. o Bio::Map - Overhaul. Brand new system fully allows markers to have multiple positions on multiple maps, and to have relative positions. Should be backward compatible. o Bio::Taxonomy - This module and all the modules in the Taxonomy directory now deprecated in favour of Bio::Taxon and Bio::Tree::Tree o Bio::DB::Taxonomy - Taxonomy.pm * get_Taxonomy_Node() eventually to be deprecated, renamed get_taxon(). * New methods ancestor(), each_Descendent() and _handle_internal_id(). * Allows for different database modules to create Bio::Taxon objects with the same internal id when the same taxon is requested from each. - flatfile.pm * get_Children_Taxids() is deprecated, superceded by each_Descendent(). * No longer includes the fake root node 'root'; there are multiple roots now (10239, 12884, 12908, 29384 and 131567). Consistent with entrez.pm - entrez.pm * get_node() has new option -full * Caches data retrieved from website o Bio::Species - Now a Bio::Taxon. Carries out the species name -> specific name munging that Bio::DB::Taxonomy modules and SeqIO modules used to do, for backward compatability in species() method. o Bio::Search and Bio::SearchIO - Overhaul. The existing system has been sped up via some minor changes (mostly gain-of-function to the API). Bio::PullParserI is introduced as a potential eventual replacment for the existing system, though as yet only a Hmmpfam parser exists written using it. 1.5.1 Developer release o Major problem with how Annotations were written out with Bio::Seq is fixed by reverting to old behavior for Bio::Annotation objects. o Bio::SeqIO - genbank.pm * bug #1871; REFLOOP' parsing loop, I changed the pattern to expect at l east 9 spaces at the beginning of a line to indicate line wrapping. * Treat multi-line SOURCE sections correctly, this defect broke both common_name() and classification() * parse swissprot fields in genpept file * parse WGS genbank records - embl.pm * Changed regexp for ID line. The capturing parentheses are the same, the difference is an optional repeated-not-semi- colon expression following the captured \S+. This means the regexp works when the division looks like /PRO;/ or when the division looks like /ANG ;/ - the latter is from EMBL repbase * fix ID line parsing: the molecule string can have spaces in it. Like: "genomic DNA" - swiss.pm: bugs #1727, #1734 - entrezgene.pm * Added parser for entrezgene ASN1 (text format) files. Uses Bio::ASN1::EntrezGene as a low level parser (get it from CPAN) o Bio::AlignIO - maf.pm coordinate problem fixed o Bio::Taxonomy and Bio::DB::Taxonomy - Parse NCBI XML now so that nearly all the taxonomy up-and-down can be done via Web without downloading all the sequence. o Bio::Tools::Run::RemoteBlast supports more options and complies to changes to the NCBI interface. It is reccomended that you retrieve the data in XML instead of plain-text BLAST report to insure proper parsing and retrieval of all information as NCBI fully expects to change things in the future. o Bio::Tree and Bio::TreeIO - Fixes so that re-rooting a tree works properly - Writing out nhx format from a newick/nexus file will properly output bootstrap information. The use must move the internal node labels over to bootstraps. for my $node ( grep { ! $_->is_Leaf } $tree->get_nodes ) { $node->bootstrap($node->id); $node->id(''); } - Nexus parsing is much more flexible now, does not care about LF. - Cladogram drawing module in Bio::Tree::Draw - Node height and depth now properly calculated - fix tree pruning algorithm so that node with 1 child gets merged o Graphics tweaks. Glyph::xyplot improved. Many other small-medium sized bugs and improvements were added, see Gbrowse mailing list for most of these. o Bio::DB::GFF partially supports GFF3. See information about gff3_munge flag in scripts/Bio-DB-GFF/bulk_load_gff.pl. o Better location parsing in Bio::Factory::FTLocationFactory - this is part of the engine for parsing EMBL/GenBank feature table locations. Nested join/order-by/complement are allowed now o Bio::PrimarySeqI->translate now takes named parameters o Bio::Tools::Phylo::PAML - parsing RST (ancestral sequence reconstruction) is now supported. Parsing different models and branch specific parametes are now supported. o Bio::Factory::FTLocationFactory - parse hierarchical locations (joins of joins) o Bio::Matrix::DistanceMatrix returns arrayrefs instead of arrays for getter/setter functions o Bio::SearchIO - blast bug #1739; match scientific notation in score and possible e+ values - blast.pm reads more WU-BLAST parameters and parameters, match a full database pathname, - Handle NCBI WEB and newer BLAST formats specifically (Query|Sbjct:) match in alignment blocks can now be (Query|Sbjct). - psl off-by-one error fixed - exonerate parsing much improved, CIGAR and VULGAR can be parsed and HSPs can be constructed from them. - HSPs query/hit now have a seqdesc field filled out (this was always available via $hit->description and $result->query_description - hmmer.pm can parse -A0 hmmpfam files - Writer::GbrowseGFF more customizeable. o Bio::Tools::Hmmpfam make e-value default score displayed in gff, rather than raw score allow parse of multiple records 1.5 Developer release o Bio::Align::DNAStatistics and Bio::Align::ProteinStatistics provide Jukes-Cantor and Kimura pairwise distance methods, respectively. o Bio::AlignIO support for "po" format of POA, and "maf"; Bio::AlignIO::largemultifasta is a new alternative to Bio::AlignIO::fasta for temporary file-based manipulation of particularly large multiple sequence alignments. o Bio::Assembly::Singlet allows orphan, unassembled sequences to be treated similarly as an assembled contig. o Bio::CodonUsage provides new rare_codon() and probable_codons() methods for identifying particular codons that encode a given amino acid. o Bio::Coordinate::Utils provides new from_align() method to build a Bio::Coordinate pair directly from a Bio::Align::AlignI-conforming object. o Bio::DB::Biblio::eutils is a class for querying NCBI's Eutils. Send a Pubmed, Pubmed Central, Entrez, or other query to NCBI's web service using standard Pubmed query syntax, and retrieve results as XML. o Bio::DB::GFF has various sundry bug fixes. o Bio::FeatureIO is a new SeqIO-style subsystem for writing/reading genomic features to/from files. I/O classes exist for BED, GTF (aka GFF v2.5), and GFF v3. Bio::FeatureIO classes only read/write Bio::SeqFeature::Annotated objects. Notably, the GFF v3 class requires features to be typed into the Sequence Ontology. o Bio::Graph namespace contains new modules for manipulation and analysis of protein interaction graphs. o Bio::Graphics has many bug fixes and shiny new glyphs. o Bio::Index::Hmmer and Bio::Index::Qual provide multiple-file indexing for HMMER reports and FASTA qual files, respectively. o Bio::Map::Clone, Bio::Map::Contig, and Bio::Map::FPCMarker are new objects that can be placed within a Bio::Map::MapI-compliant genetic/physical map; Bio::Map::Physical provides a new physical map type; Bio::MapIO::fpc provides finger-printed clone mapping import. o Bio::Matrix::PSM provide new support for postion-specific (scoring) matrices (e.g. profiles, or "possums"). o Bio::Ontology::Ontology and Bio::Ontology::Term objects can now be instantiated without explicitly using Bio::OntologyIO. This is possible through changes to Bio::Ontology::OntologyStore to download ontology files from the web as necessary. Locations of ontology files are hard-coded into Bio::Ontology::DocumentRegistry. o Bio::PopGen includes many new methods and data types for population genetics analyses. o New constructor to Bio::Range, unions(). Given a list of ranges, returns another list of "flattened" ranges -- overlapping ranges are merged into a single range with the mininum and maximum coordinates of the entire overlapping group. o Bio::Root::IO now supports -url, in addition to -file and -fh. The new -url argument allows one to specify the network address of a file for input. -url currently only works for GET requests, and thus is read-only. o Bio::SearchIO::hmmer now returns individual Hit objects for each domain alignment (thus containing only one HSP); previously separate alignments would be merged into one hit if the domain involved in the alignments was the same, but this only worked when the repeated domain occured without interruption by any other domain, leading to a confusing mixture of Hit and HSP objects. o Bio::Search::Result::ResultI-compliant report objects now implement the "get_statistics" method to access Bio::Search::StatisticsI objects that encapsulate any statistical parameters associated with the search (e.g. Karlin's lambda for BLAST/FASTA). o Bio::Seq::LargeLocatableSeq combines the functionality already found in Bio::Seq::LargeSeq and Bio::LocatableSeq. o Bio::SeqFeature::Annotated is a replacement for Bio::SeqFeature::Generic. It breaks compliance with the Bio::SeqFeatureI interface because the author was sick of dealing with untyped annotation tags. All Bio::SeqFeature::Annotated annotations are Bio::AnnotationI compliant, and accessible through Bio::Annotation::Collection. o Bio::SeqFeature::Primer implements a Tm() method for primer melting point predictions. o Bio::SeqIO now supports AGAVE, BSML (via SAX), CHAOS-XML, InterProScan-XML, TIGR-XML, and NCBI TinySeq formats. o Bio::Taxonomy::Node now implements the methods necessary for Bio::Species interoperability. o Bio::Tools::CodonTable has new reverse_translate_all() and make_iupac_string() methods. o Bio::Tools::dpAlign now provides sequence profile alignments. o Bio::Tools::GFF now parses GFF version 2.5 (a.k.a. GTF). o Bio::Tools::Fgenesh, Bio::Tools::tRNAscanSE are new report parsers. o Bio::Tools::SiRNA includes two new rulesets (Saigo and Tuschl) for designing small inhibitory RNA. o Bio::Tree::DistanceFactory provides NJ and UPGMA tree-building methods based on a distance matrix. o Bio::Tree::Statistics provides an assess_bootstrap() method to calculate bootstrap support values on a guide tree topology, based on provided bootstrap tree topologies. o Bio::TreeIO now supports the Pagel (PAG) tree format. 1.4 branch 1.4.1 o Improvements to Bio::AlignIO::nexus for parsing TreeBase nexus files o Bio::Graphics will work with gd1 or gd2 o Bio::SearchIO - hmmer.pm Better hmmpfam parsing, fix bug for small number of alignment outputs (RF lines alone) - blast.pm Parse multi-line query fields properly - small speed improvements to blasttable.pm and others o Bio::DB::Taxonomy has better support for hierarchy traversal so that Bio::Taxonomy::Node can be as simple as Bio::Species object while still supporting more complex queries 1.4. Stable major release Since initial 1.2.0, 3000 separate changes have been made to make this release. o installable scripts o global module version from Bio::Root:Version o Bio::Graphics - major improvements; SVG support o Bio::Popgen - population genetics - support several population genetics types of questions. - Tests for statistical neutrality of mutations (Fu and Li's D/F, Tajima's D) are in Bio::PopGen::Statistics. Tests of population structure (Wright's F-statistic: Fst) is in Bio::PopGen::PopStats. Calculating composite linkage disequilibrium (LD) is available in Bio::PopGen::Statistics as well. - Bio::PopGen::IO for reading in prettybase (SeattleSNPs) and csv (comma delimited formatted) data. - a directory for implementing population simulations has been added Bio::PopGen::Simulation and 2 simulations - a Coalescent and a simple single-locus multi-allele genetic drift simulation have been provided. This replaces the code in Bio::Tree::RandomTree which has been deprecated until proper methods for generating random phylogenetic trees are implemented. o Bio::Restriction - new restrion analysis modules o Bio::Tools::Analysis - web based DNA and Protein analysis framework and several implementations o Bio::Seq::Meta - per residue annotable sequences o Bio::Matrix - Bio::Matrix::PSM - Position Scoring Matrix - Bio::Matrix::IO has been added for generalized parsing of matrix data. Matrix::IO::scoring and Matrix::IO::phylip are initial implementations for parsing BLOSUM/PAM and Phylip Distance matricies respectively. A generic matrix implementation for general use was added in Bio::Matrix::Generic. o Bio::Ontology - major changes o Bio:Tree o Bio::Tools::SiRNA, Bio::SeqFeature::SiRNA - small inhibitory RNA o Bio::SeqFeature::Tools - seqFeature mapping tools - Bio::SeqFeature::Tools::Unflattener.pm -- deal with mapping GenBank feature collections into Chado/GFF3 processable feature sets (with SO term mappings) o Bio::Tools::dpAlign - pure perl dynamic programming sequence alignment - needs Bioperl-ext o new Bio::SearchIO formats - axt and psl: UCSC formats. - blasttable: NCBI -m 8 or -m 9 format from blastall o new Bio::SeqIO formats - chado, tab, kegg, tigr, game - important fixes for old modules o Bio::AlignIO: maf o improved Bio::Tools::Genewise o Bio::SeqIO now can recongnize sequence formats automatically from stream o new parsers in Bio::Tools: Blat, Geneid, Lagan, Mdust, Promoterwise, PrositeScan, o Bio::DB::Registry bugs fixed - BerkeleyDB-indexed flat files can be used by the OBDA system - Multiple seqdatabase.ini locations in OBDA_SEARCH_PATH are all used by the OBDA system o several new HOWTOs - SimpleWebAnalysis, Trees, Feature Annotation, OBDA Access, Flat Databases o hundreds of new and improved files o o Bio::Tree::AlleleNode has been updated to be a container of an Bio::PopGen::Individual object for use in the Coalescent simulations. 1.2 Branch 1.2.3 Stable release update o Bug #1475 - Fix and add speedup to spliced_seq for remote location handling. o Bug #1477 - Sel --> Sec abbreviation fixed o Fix bug #1487 where paring in-between locations when end < start caused the FTLocationFactory logic to fail. o Fix bug #1489 which was not dealing with keywords as an arrayref properly (this is fixed on the main trunk because keywords returns a string and the array is accessible via get_keywords). o Bio::Tree::Tree memory leak (bug #1480) fixed Added a new initialization option -nodelete which won't try and cleanup the containing nodes if this is true. o Bug with parsing labeled nodes with Bio::TreeIO::newick fixed this was only present on the branch for the 1.2.1 and 1.2.2 series - Also merged main trunk changes to the branch which make newick -> nhx round tripping more effective (storing branch length and bootstrap values in same locate for NodeNHX and Node implementations.) Fixes to TreeIO parsing for labeled internal also required small changes to TreeIO::nhx. Improved tests for this module as well. o Bio::SearchIO - Fixed bugs in BLAST parsing which couldn't parse NCBI gapped blast properly (was losing hit significance values due to the extra unexpeted column). - Parsing of blastcl3 (netblast from NCBI) now can handle case of integer overflow (# of letters in nt seq dbs is > MAX_INT) although doesn't try to correct it - will get the negative number for you. Added a test for this as well. - Fixed HMMER parsing bug which prevented parsing when a hmmpfam report has no top-level family classification scores but does have scores and alignments for individual domains. - Parsing FASTA reports where ungapped percent ID is < 10 and the regular expression to match the line was missing the possibility of an extra space. This is rare, which is why we probably did not catch it before. - BLAST parsing picks up more of the statistics/parameter fields at the bottom of reports. Still not fully complete. - SearchIO::Writer::HTMLResultWriter and TextResultWriter were fixed to include many improvements and added flexiblity in outputting the files. Bug #1495 was also fixed in the process. o Bio::DB::GFF - Update for GFF3 compatibility. - Added scripts for importing from UCSC and GenBank. - Added a 1.2003 version number. o Bio::Graphics - Updated tutorial. - Added a 1.2003 version number. o SeqIO::swiss Bug #1504 fixed with swiss writing which was not properly writing keywords out. o Bio::SeqIO::genbank - Fixed bug/enhancement #1513 where dates of the form D-MMM-YYYY were not parsed. Even though this is invalid format we can handle it - and also cleanup the date string so it is properly formatted. - Bug/enhancement #1517 fixed so that SEGMENT line can be parsed and written with Genbank format. Similarly bug #1515 is fixed to parse in the ORIGIN text. o Bio::SeqIO::fasta, a new method called preferred_id_type allows you to specify the ID type, one of (accession accession.version display primary). See Bio::SeqIO::preferred_id_type method documentation for more information. o Unigene parsing updated to handle file format changes by NCBI 1.2.2 Stable release update o A series of bug fixes of the Bio::OntologyIO dagflat-related parsers: - auto-discover ontology name - bug in parsing relationships when certain characters are in the term - fixed hard-coded prefix for term identifiers - various smaller issues o Fixed bug in Bio::Annotation::OntologyTerm of not implementing all of Bio::Ontology::TermI o brought the OBDA Registry code up to latest specs o Bio::DB::GenBank - eutils URL change - accession number retrieval fixed o Bio::SearchIO::blast - fix bug #1443 (missing last hits), parse megablast o Bio::SearchIO::Writer::(HTML|Text)ResultWriter fix bugs #1458, #1459 which now properly report alignment start/end info for translated BLAST/FASTA searches. o Bio::TreeIO::newick can parse labeled internal nodes o Bio::Tools::BPbl2seq can properly report strand info for HSPs for BLASTX if if you provide -report_type => 'BLASTX' when initializing a BPbl2seq object. Bioperl 1.3 will have better support for bl2seq in the SearchIO system. o Bio::Root::IO support a -noclose boolean flag which will not close a filehandle upon object cleanup - useful when sharing a filehandle among objects. Additionally code added s.t. STDOUT/STDIN/STDERR will never be closed by Root::IO cleanup. o Bio::Tools::Genemark bug #1435 fixed which was missing last prediction o Bio::SeqIO::genbank - bug #1456 fixed which generated extra sequence lines - write moltype correctly for genpept 1.2.1 Stable release update o Inclusion of WrapperBase, a needed component for StandAloneBlast o Addition from main trunk of Ontology objects, principly to allow BioSQL releases against 1.2.1 o Fixes and cleanup of Bio::Coordinate modules o A fix to Bio::Index::EMBL allowing retrieval of entries using the primary accession number o Other bug fixes, including bpindex GenBank fix o Bio::SeqIO::genbank bug #1389 fixed 1.2 Stable major release o More functionality added to Bio::Perl, the newbie module o Bug fixes in Bio::TreeIO::newick fixes bug introduced in 1.0.2 Support for New Hampshire Extended (NHX) format parsing. o Bio::Tools added support for parsing Genomewise, Pseudowise, Est2Genome, Tmhmm, SignalP, Seg, RepeatMasker, FootPrinter, and a lightweight Hmmpfam parser. o New ontology parsing Bio::Ontology o Bug fixes in Bio::SearchIO for HMMer parsing, support for multi-report (mlib) fasta reports, support for waba and exonerate. o Bio::ClusterIO for parsing Unigene clusters o Bio::Assembly added for representing phrap and ace assembly clusters. o Rudimentary support for writing Chado XML (see GMOD project: www.gmod.org for more information) o Bio::Coordinate for mapping between different coordinate systems such as protein -> cDNA -> Exon -> DNA and back. Useful for mapping features into different coordinate systems. o Bio::DB::GenBank/Bio::DB::GenPept now support Entrez queries with the get_Stream_by_query method and supports the latest NCBI eutils interface. o Bugs fixed in Bio::SeqFeature::Collection an in-memory fast object for extracting subsets of features : currently only supports extraction by location. 1.1.1 Developer release o Deprecated modules are now listed in the DEPRECATED file o New HowTo documents located in doc/howto describing a domain of Bioperl. o Note that bugs are now stored at redmine.open-bio.org/projects/bioperl/ and all old bugs are searchable through the bugzilla interface. o Several reported bugs in Bio::Tools::Sigcleave and Bio::SimpleAlign have been addressed. o Support for Genewise parsing in Bio::Tools::Genewise o Start of Ontology framework with Bio::Ontology o Speedup to the Bio::Root::Root object method _rearrange. A global _load_module method was implemented to simplify the dynamic loading of modules ala Bio::SeqIO::genbank. This method is now used by all the XXIO (AlignIO,TreeIO,SearchIO,SeqIO, etc). o Several performance improvements to sequence parsing in Bio::SeqIO. Attempt to speedup by reducing object creation overhead. o Bio::DB::GenBank and Bio::DB::GenPept use the NCBI's approved method for sequence retrieval with their E-utils CGI scripts. More work to support Entrez queries to their fullest is planned before 1.2 release. o Numerous fixes to Bio::SearchIO and sequence parsing (swissprot) 1.1 Developer release o Bio::Tools::Run has been broken off into a new pkg bioperl-run, this separation removes some of the complexity in our test suite and separates the core modules in bioperl from those that need external programs to run. o With latest ExtUtils::MakeMaker module installed SGI/IRIX should not run into trouble running the makefile o Bio::Location and Bio::SeqIO::FTHelper are fixed to properly read,create,and write locations for grouped/split locations (like mRNA features on genomic sequence). o Bio::Tools::Phlyo added for wrappers for parsing Molphy (protml) and PAML (codeml,aaml, etc) parsing. o Bio::Tree:: objects expanded to handle testing monophyly, paraphyly, least common ancestor, etc. o Bio::Coordinate for mapping locations from different coordinate spaces o Bio::SearchIO::waba added for parsing WABA, Bio::SearchIO::hmmer added for parsing hmmpfam and hmmsearch output. o Bio::SearchIO::Writer::TextResultWriter for outputting a pseudo-blast textfile format 1.0.2 Bug fix release o Note: The modules Bio::DB::GenBank and Bio::DB::GenPept provided in this release will not work after December 2002 when NCBI shuts off the old Entrez cgi scripts. We have already fixed on our main development branch and the functionality will be available in the next stable bioperl release (1.2) slated for Fall 2002. o Numerous parsing bugs in Bio::SearchIO::fasta found through testset by Robin Emig. These were fixed as was the get_aln method in Bio::Search::HSP::GenericHSP to handle the extra context sequence that is provided with a FastA alignment. o Migrating differences between Bio::Search::XX::BlastXX to Bio::Search::XX::GenericXX objects. This included mechanism to retrieve whole list of HSPs from Hits and whole list of Hits from Results in addition to the current next_XX iterator methods that are available. Added seq_inds() method to GenericHSP which identifies indexes in the query or hit sequences where conserved,identical,gaps, or mismatch residues are located (adapted from Steve Chervitz's implementation in BlastHSP). o Bio::DB::GFF bugs fixed and are necessary for latest GBrowse release. Bio::DB::GFF::RelSegment is now Bio::SeqI compliant. o Bio::Graphics glyph set improved and extended for GBrowse release o Bio::Tree::Tree get_nodes implementation improvement thanks to Howard Ross notice performance problem when writing out unbalanced trees. o Bio::Location::Fuzzy::new named parameter -loc_type became -location_type, Bio::Location::Simple::new named parameter -seqid becamse -seq_id. o Fixed major Bio::AlignIO::emboss parsing bug on needle output, was mis-detecting that gaps should be placed at the beginning of the alignment when the best alignment starts internally in the sequence. 1.0.1 Bug fix release o Minor bug fixes to Bio::DB:GFF. Glyph sets improved. o Parser fixes in SearchIO blast, fasta for more complete WU BLAST and mixed (3.3 - 3.4) versions of FASTA. o Small API change to add methods for completeness across implementations of Bio::Search objects. These new methods in the interface are implemented by the GenericXX object as well as the BlastXX objects. * Bio::Search::Result::ResultI - hits() method returns list of all Hits (next_hit is an iterator method) * Bio::Search::Hit::HitI - hsps() method returns list of all HSPs (next_hsp is an iterator method) o The Bio::SearchIO::Writer classes have been fixed to handle results created from either psiblast (Search::BlastXX objects) or blast|fasta|blastxml objects (Search::GenericXX objects). More work has to be done here to make it work properly and will nee major API changes. o Bugs in Bio::Tools::HMMER fixed, including * #1178 - Root::IO destructor wasn't being called * #1034 - filter_on_cutoff now behaves properly o Bio::SeqFeature::Computation initialization args fixed and tests added. o Tests are somewhat cleaner, flat.t now properly cleans up after itsself, o Updated FAQ with more example based answers to typical questions o Bug #1202 was fixed which would improperly join together qual values parsed by Bio::SeqIO::qual when a trailing space was not present before the newline. 1.0.0 Major Stable Release This represents a major release of bioperl with significant improvements over the 0.7.x series of releases. o Bio::Tools::Blast is officially deprecated. Please see Bio::SearchIO for BLAST and FastA parsing. o The methods trunc() and subseq() in Bio::PrimarySeqI now accepts Bio::LocationI objects as well as start/end. o Bio::Biblio contains modules for Bibliographic data. Bio::DB::Biblio contains the query modules. Additionally one can parse medlinexml from the ebi bibliographic query service (BQS) system and Pubmed xml from NCBI. See Martin Senger's documentation in Bio::Biblio for more information. o Bio::DB::Registry is a sequence database registry part of Open Bioinformatics Database Access. See http://obda.open-bio.org for more information. o File-based and In-Memory Sequence caching is provided by Bio::DB::InMemoryCache and Bio::DB::FileCache which acts like a local database. o Bio::Graphics for rendering sequences as PNG,JPG, or GIFs has been added by Lincoln Stein. o XEMBL SOAP service access in provided in Bio::DB::XEMBL. o A FAQ has been started and is included in the release to provide a starting point for frequent questions and issues. 0.9.3 Developer's release o Event based parsing system improved (SearchIO). With parsers for XML Blast (blastxml), Text Blast (blast), and FASTA results (fasta). Additionally a lazy parsing system for text and html blast reports was added and is called psiblast (name subject to change in future releases). o Bio::Search objects improved and standardized with associated Interfaces written. The concept of a search "Hit" was standardized to be called "hit" consistently and the use of "subject" was deprecated in all active modules. o Bio::Structure added (since 0.9.1) for Protein structure objects and PDB parser to retrieve and write these structures from data files. o Several important Bio::DB::GFF bug fixes for handling features that are mapped to multiple reference points. Updated mysql adaptor so as to be able to store large (>100 megabase) chunks of DNA into Bio::DB::GFF databases. 0.9.2 Developer's release o Bio::Search and Bio::SearchIO system introduced for event based parsing of Blast,Fasta reports Bio::SearchIO supports ncbi BLAST in text and XML and FASTA reports in standard output format. o Bio::Tree and Bio::TreeIO for phylogenetic trees. A Random tree generator is included in Bio::TreeIO::RandomTrees and a statistics module for evaluating. o Bio::DB::GFF, Lincoln Stein's GFF database suitable as a DB server for DAS servers. o Bio::Tools::BPlite is provides more robust parsing of BLAST files. The entire BPlite system migrated to using Bio::Root::IO for the data stream. o Bio::Tools::Alignment for Consed and sequence Trimming functionality. o Bio::Structure for Protein structure information and parsing o Bio::DB::GenBank/Bio::DB::GenPept updated to new NCBI Entrez cgi-bin entry point which should be more reliable. o Bio::Map and Bio::MapIO for biological map navigation and a framework afor parsing them in. Only preliminary work here. o Interface for executing EMBOSS programs locally in Bio::Factory::EMBOSS Future work will integrate Pise and allow submission of analysis on remote servers. o Bio::AnnotationCollectionI and Bio::Annotation::Collection introduced as new objects for handling Sequence Annotation information (dblinks, references, etc) and is more robust that previous system. o Bio::Tools::FASTAParser introduced. o Scripts from the bioperl script submission project and new scripts from bioperl authors are included in "scripts" directory. o Factory objects and interfaces are being introduced and are more strictly enforced. o Bio::Root::Root introduced as the base object while Bio::Root::RootI is now simply an interface. o Bio::DB::RefSeq provides database access to copy of the NCBI RefSeq database using the EBI dbfetch script. 0.9.0 Developer's release o perl version at least 5.005 is now required instead of perl 5.004 o Bio::Tools::Run::RemoteBlast is available for running remote blast jobs at NCBI. o Bio::Tools::BPbl2seq was fixed to handle multiple HSPs. o Bio::SeqFeature::GeneStructure migrated to Bio::SeqFeature::Gene. Also added are related modules UTR3, UTR5, Exon, Intron, Promotor, PolyA and Transcript. o Speedup of translate method in PrimarySeq o Bio::SimpleAlign has new methods: location_from_column(), slice(), select(), dot(), get_seq_by_pos(), column_from_residue_number() o Various fixes to Variation toolkit o Bio::DB::EMBL provides database access to EMBL sequence data. Bio::DB::Universal provides a central way to point to indexes and dbs in a single interface. o Bio::DB::GFF - a database suitable for running DAS servers locally. o Bio::Factory::EMBOSS is still in design phase as is Bio::Factory::ApplicationFactoryI o Dia models for bioperl design are provided in the models/ directory 0.7.2 Bug fix release o documentation fixes in many modules - SYNOPSIS code verified to be runnable in many (but not all modules) o corrected MANIFEST file from 0.7.1 release o Bug fix in Bio::SeqIO::FTHelper to properly handle split locations o Bio::SeqIO::genbank * Correct parsing and writing of genbank format with protein data * moltype and molecule separation o Bio::SeqIO::largefasta fix to avoid inifinite loops o Bio::SimpleAlign fixed to correctly handle consensus sequence calculation o Bio::Tools::HMMER supports hmmer 2.2g o Bio::Tools::BPlite to support report type specific parsing. Most major changes are not on the 0.7 branch. o Bio::Tools::Run::StandAloneBlast exists_blast() fixed and works with File::Spec o Bio::Variation::AAChange/RNAChange corrected labels and mutated alleles in several types of mutations: 1.) AA level: deletion, complex 2.) AA level: complex, inframe 3.) RNA level: silent o BPbl2seq parsing of empty reports will not die, but will return a valid, empty, Bio::SeqFeature::SimilarityFeature for $report->query() and $report->subject() methods. So an easy way to test if report was empty is to see if $report->query->seqname is undefined. 0.7.1 Bug fix release o Better parsing of genbank/EMBL files especially fixing bugs related to Feature table parsing and locations on remote sequences. Additionally, species name parsing was better. o Bio::SeqIO::genbank can parse now NCBI produced genbank database which include a number of header lines. o More strict genbank and EMBL format writing (corrected number of spaces where appropriate). o Bio::Tools::BPlite can better parse BLASTX reports - see BUGS for related BPlite BUGS that are unresolved in this release. o Bio::DB::GenBank, Bio::DB::GenPept have less problems downloading sequences from NCBI via HTTP. Bio::DB::SwissProt can use expasy mirrors or EBI dbfetch cgi-script. o A moderate number of documentation improvements were made as well to provide a better code synopsis in each module. 0.7 Large number of changes, including refactoring of the Object system, new parsers, new functionality and all round better system. Highlights are: o Refactored root of inheritance: moved to a lightweight Bio::Root::RootI; Bio::Root::IO for I/O and file/handle capabilities. o Imported BPlite modules from Ian Korf for BLAST parsing. This is considered the supported BLAST parser; Bio::Tools::Blast.pm will eventually phase out due to lack of support. o Improved Sequence Feature model. Added complete location modelling (with fuzzy and compound locations). See Bio::LocationI and the modules under Bio/Location. Added support in Genbank/EMBL format parsing to completely parse feature tables for complex locations. o Moved special support for databanks etc to specialized modules under Bio/Seq/. One of these supports very large sequences through a temporary file as a backend. o Explicit Gene, Transcript and Exon SeqFeature objects, supporting CDS retrieval and exon shuffling. o More parsers: Sim4, Genscan, MZEF, ESTScan, BPbl2seq, GFF o Refactored Bio/DB/GenBank+GenPept. There is now also DB/SwissProt and DB/GDB (the latter has platform-specific limitations). o New analysis parser framework for HT sequence annotation (see Bio::SeqAnalysisParserI and Bio::Factory::SeqAnalysisParserFactory) o New Alignment IO framework o New Index modules (Swissprot) o New modules for running Blast within perl (Bio::Tools::Run::StandAloneBlast). Added modules for running Multiple Sequence Alignment tools ClustalW and TCoffee (Bio::Tools::Run::Alignment). o New Cookbook-style tutorial (see bptutorial.pl). Improved documentation across the package. o Much improved cross platform support. Many known incompatibilities have been fixed; however, NT and Mac do not work across the entire setup (see PLATFORMS). o Many bug fixes, code restructuring, etc. Overall stability and maintainability benefit a lot. o A total of 957 automatic tests 0.6.2 There are very few functionality changes but a large number of software improvements/bug fixes across the package. o The EMBL/GenBank parsing are improved. o The Swissprot reading is improved. Swissprot writing is disabled as it doesn't work at all. This needs to wait for 0.7 release o BLAST reports with no hits are correctly parsed. o Several other bugs of the BLAST parser (regular expressions, ...) fixed. o Old syntax calls have been replaced with more modern syntax o Modules that did not work at all, in particular the Sim4 set have been removed o Bio::SeqFeature::Generic and Bio::SeqFeature::FeaturePair have improved compliance with interface specs and documentation o Mailing list documentation updated throughout the distribution o Most minor bug fixes have happened. o The scripts in /examples now work and have the modern syntax rather than the deprecated syntax 0.6.1 Sun April 2 2000 o Sequences can have Sequence Features attached to them - The sequence features can be read from or written to EMBL and GenBank style flat files o Objects for Annotation, including References (but not full medline abstracts), Database links and Comments are provided o A Species object to represent nodes on a taxonomy tree is provided o The ability to parse HMMER and Sim4 output has been added o The Blast parsing has been improved, with better PSI-BLAST support and better overall behaviour. o Flat file indexed databases provide both random access and sequential access to their component sequences. o A CodonTable object has been written with all known CodonTables accessible. o A number of new lightweight analysis tools have been added, such as molecular weight determination. The 0.6 release also has improved software engineering o The sequence objects have been rewritten, providing more maintainable and easier to implement objects. These objects are backwardly compatible with the 0.05.1 objects o Many objects are defined in terms of interfaces and then a Perl implementation has been provided. The interfaces are found in the 'I' files (module names ending in 'I'). This means that it is possible to wrap C/CORBA/SQL access as true "bioperl" objects, compatible with the rest of bioperl. o The SeqIO system has been overhauled to provide better processing and perl-like automatic interpretation of <> over arguments. o Many more tests have been added (a total of 172 automatic tests are now run before release). 0.05.1 Tue Jun 29 05:30:44 1999 - Central distribution now requires Perl 5.004. This was done to get around 5.003-based problems in Bio/Index/* and SimpleAlign. - Various bug fixes in the Bio::Tools::Blast modules including better exception handling and PSI-Blast support. See Bio/Tools/Blast/CHANGES for more. - Fixed the Parse mechanism in Seq.pm to use readseq. Follow the instructions in README for how to install it (basically, you have to edit Parse.pm). - Improved documentation of Seq.pm, indicating where objects are returned and where strings are returned. - Fixed uninitialized warnings in Bio::Root::Object.pm and Bio::Tools::SeqPattern.pm. - Bug fixes for PR#s: 30,31,33-35,41,42,44,45,47-50,52. 0.05 Sun Apr 25 01:14:11 1999 - Bio::Tools::Blast modules have less memory problems and faster parsing. Webblast uses LWP and supports more functionality. See Bio/Tools/Blast/CHANGES for more. - The Bio::SeqIO system has been started, moving the sequence reformatting code out of the sequence object - The Bio::Index:: system has been started, providing generic index capabilities and specifically works for Fasta formatted databases and EMBL .dat formatted databases - The Bio::DB:: system started, providing access to databases, both via flat file + index (see above) and via http to NCBI - The scripts/ directory, where industrial strength scripts are put has been started. - Many changes - a better distribution all round. 0.04.4 Wed Feb 17 02:20:13 1999 - Bug fixes in the Bio::Tools::Blast modules and postclient.pl (see Bio::Tools::Blast::CHANGES). - Fixed a bug in Bio::Tools::Fasta::num_seqs(). - Beefed up the t/Fasta.t test script. - Small fix in Bio::Seq::type() (now always returns a string). - Changed Bio::Root::Utilities::get_newline_char() to get_newline() since it could return more than one char. - Added $NEWLINE and $TIMEOUT_SECS to Bio::Root::Global. - Changed default timeout to 20 seconds (was 3). - Moved lengthy modification notes to the bottom of some files. - Fixed SimpleAlign write_fasta bug. - Beefed up SimpleAlign.t test 0.04.3 Thu Feb 4 07:48:53 1999 - Bio::Root::Object.pm and Global.pm now detect when script is run as a CGI and suppress output that is only appropriate when running interactively. - Bio::Root::Err::_set_context() adds name of script ($0). - Added comments in Bio::Tools::WWW.pm and Bio::Root::Utilities.pm regarding the use of the static objects via the qw(:obj) tag. - Fixed the ambiguous reverse calls in Seq.pm and UnivAln.pm to CORE::reverse, avoiding Perl warnings. - Bug fixes in Bio::Tools::Blast modules (version 0.074) and example scripts (see Bio::Tools::Blast::CHANGES). - examples/seq/seqtools.pl no longer always warns about using -prot or -nucl command-line arguments; only when using the -debug argument. - Methods added to Bio::Root::Utilities: create_filehandle(), get_newline_char(), and taste_file() to generalize filehandle creation and autodetect newline characters in files/streams (see bug report #19). - Bio::Root::IOManager::read() now handles timeouts and uses Utilities::create_filehandle(). - Bio::Tools::Fasta.pm uses Utilities::get_newline_char() instead of hardwiring in "\n". - Bug fixes in the Bio::SimpleAlign and Bio::Tools::pSW 0.04.2 Wed Dec 30 02:27:36 1998 - Bug fixes in Bio::Tools::Blast modules, version 0.073 (see Bio::Tools::Blast::CHANGES). - Changed reverse calls in Bio/Seq.pm and Bio/UnivAln.pm to CORE::reverse (prevents ambiguous warnings with 5.005). - Appending '.tmp.bioperl' to temporary files created by Bio::Root::Utilities::compress() or uncompress() to make it easy to identify & cleanup these files as needed. - Developers: Created CVS branch release-0-04-bug from release-0-04-1. Before making bug fixes to the 0.04.1 release, be sure to cvs checkout this branch into a clean area. 0.04.1 Wed Dec 16 05:39:15 1998 - Bug fixes in Bio::Tools::Blast modules, version 0.072 (see Bio::Tools::Blast::CHANGES). - Compile/SW/Makefile.PL now removes *.o and *.a files with make clean. 0.04 Tue Dec 8 07:49:19 1998 - Lots of new modules added including: * Ewan Birney's Bio::SimpleAlign.pm, Bio::Tools::AlignFactory.pm, and Bio/Compile directory containing XS-linked C code for creating Smith-Waterman sequence alignments from within Perl. * Steve Chervitz's Blast distribution has been incorporated. * Georg Fuellen's Bio::UnivAln.pm for multiple alignment objects. - Bio/examples directory for demo scripts for all included modules. - Bio/t directory containing test suit for all included modules. - For changes specific to the Blast-related modules prior to incorporation in this central distribution, see the CHANGES file in the Bio/Tools/Blast directory. 0.01 Tue Sep 8 14:23:22 1998 - original version from central CVS tree; created by h2xs 1.18 LICENSE100644000766000024 4371314006427353 14255 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8This software is copyright (c) 2021 by See individual modules. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself. Terms of the Perl programming language system itself a) the GNU General Public License as published by the Free Software Foundation; either version 1, or (at your option) any later version, or b) the "Artistic License" --- The GNU General Public License, Version 1, February 1989 --- This software is Copyright (c) 2021 by See individual modules. This is free software, licensed under: The GNU General Public License, Version 1, February 1989 GNU GENERAL PUBLIC LICENSE Version 1, February 1989 Copyright (C) 1989 Free Software Foundation, Inc. 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Preamble The license agreements of most software companies try to keep users at the mercy of those companies. By contrast, our General Public License is intended to guarantee your freedom to share and change free software--to make sure the software is free for all its users. The General Public License applies to the Free Software Foundation's software and to any other program whose authors commit to using it. You can use it for your programs, too. When we speak of free software, we are referring to freedom, not price. Specifically, the General Public License is designed to make sure that you have the freedom to give away or sell copies of free software, that you receive source code or can get it if you want it, that you can change the software or use pieces of it in new free programs; and that you know you can do these things. To protect your rights, we need to make restrictions that forbid anyone to deny you these rights or to ask you to surrender the rights. These restrictions translate to certain responsibilities for you if you distribute copies of the software, or if you modify it. For example, if you distribute copies of a such a program, whether gratis or for a fee, you must give the recipients all the rights that you have. You must make sure that they, too, receive or can get the source code. And you must tell them their rights. We protect your rights with two steps: (1) copyright the software, and (2) offer you this license which gives you legal permission to copy, distribute and/or modify the software. Also, for each author's protection and ours, we want to make certain that everyone understands that there is no warranty for this free software. If the software is modified by someone else and passed on, we want its recipients to know that what they have is not the original, so that any problems introduced by others will not reflect on the original authors' reputations. The precise terms and conditions for copying, distribution and modification follow. GNU GENERAL PUBLIC LICENSE TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION 0. This License Agreement applies to any program or other work which contains a notice placed by the copyright holder saying it may be distributed under the terms of this General Public License. The "Program", below, refers to any such program or work, and a "work based on the Program" means either the Program or any work containing the Program or a portion of it, either verbatim or with modifications. Each licensee is addressed as "you". 1. You may copy and distribute verbatim copies of the Program's source code as you receive it, in any medium, provided that you conspicuously and appropriately publish on each copy an appropriate copyright notice and disclaimer of warranty; keep intact all the notices that refer to this General Public License and to the absence of any warranty; and give any other recipients of the Program a copy of this General Public License along with the Program. You may charge a fee for the physical act of transferring a copy. 2. You may modify your copy or copies of the Program or any portion of it, and copy and distribute such modifications under the terms of Paragraph 1 above, provided that you also do the following: a) cause the modified files to carry prominent notices stating that you changed the files and the date of any change; and b) cause the whole of any work that you distribute or publish, that in whole or in part contains the Program or any part thereof, either with or without modifications, to be licensed at no charge to all third parties under the terms of this General Public License (except that you may choose to grant warranty protection to some or all third parties, at your option). c) If the modified program normally reads commands interactively when run, you must cause it, when started running for such interactive use in the simplest and most usual way, to print or display an announcement including an appropriate copyright notice and a notice that there is no warranty (or else, saying that you provide a warranty) and that users may redistribute the program under these conditions, and telling the user how to view a copy of this General Public License. d) You may charge a fee for the physical act of transferring a copy, and you may at your option offer warranty protection in exchange for a fee. Mere aggregation of another independent work with the Program (or its derivative) on a volume of a storage or distribution medium does not bring the other work under the scope of these terms. 3. You may copy and distribute the Program (or a portion or derivative of it, under Paragraph 2) in object code or executable form under the terms of Paragraphs 1 and 2 above provided that you also do one of the following: a) accompany it with the complete corresponding machine-readable source code, which must be distributed under the terms of Paragraphs 1 and 2 above; or, b) accompany it with a written offer, valid for at least three years, to give any third party free (except for a nominal charge for the cost of distribution) a complete machine-readable copy of the corresponding source code, to be distributed under the terms of Paragraphs 1 and 2 above; or, c) accompany it with the information you received as to where the corresponding source code may be obtained. (This alternative is allowed only for noncommercial distribution and only if you received the program in object code or executable form alone.) Source code for a work means the preferred form of the work for making modifications to it. For an executable file, complete source code means all the source code for all modules it contains; but, as a special exception, it need not include source code for modules which are standard libraries that accompany the operating system on which the executable file runs, or for standard header files or definitions files that accompany that operating system. 4. You may not copy, modify, sublicense, distribute or transfer the Program except as expressly provided under this General Public License. Any attempt otherwise to copy, modify, sublicense, distribute or transfer the Program is void, and will automatically terminate your rights to use the Program under this License. However, parties who have received copies, or rights to use copies, from you under this General Public License will not have their licenses terminated so long as such parties remain in full compliance. 5. By copying, distributing or modifying the Program (or any work based on the Program) you indicate your acceptance of this license to do so, and all its terms and conditions. 6. Each time you redistribute the Program (or any work based on the Program), the recipient automatically receives a license from the original licensor to copy, distribute or modify the Program subject to these terms and conditions. You may not impose any further restrictions on the recipients' exercise of the rights granted herein. 7. The Free Software Foundation may publish revised and/or new versions of the General Public License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. Each version is given a distinguishing version number. If the Program specifies a version number of the license which applies to it and "any later version", you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Program does not specify a version number of the license, you may choose any version ever published by the Free Software Foundation. 8. If you wish to incorporate parts of the Program into other free programs whose distribution conditions are different, write to the author to ask for permission. For software which is copyrighted by the Free Software Foundation, write to the Free Software Foundation; we sometimes make exceptions for this. Our decision will be guided by the two goals of preserving the free status of all derivatives of our free software and of promoting the sharing and reuse of software generally. NO WARRANTY 9. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 10. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. END OF TERMS AND CONDITIONS Appendix: How to Apply These Terms to Your New Programs If you develop a new program, and you want it to be of the greatest possible use to humanity, the best way to achieve this is to make it free software which everyone can redistribute and change under these terms. To do so, attach the following notices to the program. It is safest to attach them to the start of each source file to most effectively convey the exclusion of warranty; and each file should have at least the "copyright" line and a pointer to where the full notice is found. Copyright (C) 19yy This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 1, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston MA 02110-1301 USA Also add information on how to contact you by electronic and paper mail. If the program is interactive, make it output a short notice like this when it starts in an interactive mode: Gnomovision version 69, Copyright (C) 19xx name of author Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'. This is free software, and you are welcome to redistribute it under certain conditions; type `show c' for details. The hypothetical commands `show w' and `show c' should show the appropriate parts of the General Public License. Of course, the commands you use may be called something other than `show w' and `show c'; they could even be mouse-clicks or menu items--whatever suits your program. You should also get your employer (if you work as a programmer) or your school, if any, to sign a "copyright disclaimer" for the program, if necessary. Here a sample; alter the names: Yoyodyne, Inc., hereby disclaims all copyright interest in the program `Gnomovision' (a program to direct compilers to make passes at assemblers) written by James Hacker. , 1 April 1989 Ty Coon, President of Vice That's all there is to it! --- The Artistic License 1.0 --- This software is Copyright (c) 2021 by See individual modules. This is free software, licensed under: The Artistic License 1.0 The Artistic License Preamble The intent of this document is to state the conditions under which a Package may be copied, such that the Copyright Holder maintains some semblance of artistic control over the development of the package, while giving the users of the package the right to use and distribute the Package in a more-or-less customary fashion, plus the right to make reasonable modifications. Definitions: - "Package" refers to the collection of files distributed by the Copyright Holder, and derivatives of that collection of files created through textual modification. - "Standard Version" refers to such a Package if it has not been modified, or has been modified in accordance with the wishes of the Copyright Holder. - "Copyright Holder" is whoever is named in the copyright or copyrights for the package. - "You" is you, if you're thinking about copying or distributing this Package. - "Reasonable copying fee" is whatever you can justify on the basis of media cost, duplication charges, time of people involved, and so on. (You will not be required to justify it to the Copyright Holder, but only to the computing community at large as a market that must bear the fee.) - "Freely Available" means that no fee is charged for the item itself, though there may be fees involved in handling the item. It also means that recipients of the item may redistribute it under the same conditions they received it. 1. You may make and give away verbatim copies of the source form of the Standard Version of this Package without restriction, provided that you duplicate all of the original copyright notices and associated disclaimers. 2. You may apply bug fixes, portability fixes and other modifications derived from the Public Domain or from the Copyright Holder. A Package modified in such a way shall still be considered the Standard Version. 3. You may otherwise modify your copy of this Package in any way, provided that you insert a prominent notice in each changed file stating how and when you changed that file, and provided that you do at least ONE of the following: a) place your modifications in the Public Domain or otherwise make them Freely Available, such as by posting said modifications to Usenet or an equivalent medium, or placing the modifications on a major archive site such as ftp.uu.net, or by allowing the Copyright Holder to include your modifications in the Standard Version of the Package. b) use the modified Package only within your corporation or organization. c) rename any non-standard executables so the names do not conflict with standard executables, which must also be provided, and provide a separate manual page for each non-standard executable that clearly documents how it differs from the Standard Version. d) make other distribution arrangements with the Copyright Holder. 4. You may distribute the programs of this Package in object code or executable form, provided that you do at least ONE of the following: a) distribute a Standard Version of the executables and library files, together with instructions (in the manual page or equivalent) on where to get the Standard Version. b) accompany the distribution with the machine-readable source of the Package with your modifications. c) accompany any non-standard executables with their corresponding Standard Version executables, giving the non-standard executables non-standard names, and clearly documenting the differences in manual pages (or equivalent), together with instructions on where to get the Standard Version. d) make other distribution arrangements with the Copyright Holder. 5. You may charge a reasonable copying fee for any distribution of this Package. You may charge any fee you choose for support of this Package. You may not charge a fee for this Package itself. However, you may distribute this Package in aggregate with other (possibly commercial) programs as part of a larger (possibly commercial) software distribution provided that you do not advertise this Package as a product of your own. 6. The scripts and library files supplied as input to or produced as output from the programs of this Package do not automatically fall under the copyright of this Package, but belong to whomever generated them, and may be sold commercially, and may be aggregated with this Package. 7. C or perl subroutines supplied by you and linked into this Package shall not be considered part of this Package. 8. The name of the Copyright Holder may not be used to endorse or promote products derived from this software without specific prior written permission. 9. THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE. The End dist.ini100644000766000024 340714006427353 14670 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8name = BioPerl version = 1.7.8 author = See individual modules copyright_holder = See individual modules license = Perl_5 ;; Modules should be fixed so that these don't have to be removed. [@BioPerl] -remove = PodCoverageTests -remove = PodWeaver -remove = Test::EOL -remove = Test::NoTabs ;; [AutoMetaResources] is not suitable for us because our repo name ;; is different from the distribution name. -remove = AutoMetaResources ;; This is a dependency for Windows only AutoPrereqs.skips[0] = ^Win32$ ;; The program bp_fetch can make use of this Bio::DB modules that are ;; not part of the distribution. AutoPrereqs.skips[1] = ^Bio::DB::Ace$ AutoPrereqs.skips[2] = ^Bio::DB::EMBL$ AutoPrereqs.skips[3] = ^Bio::DB::GenBank$ AutoPrereqs.skips[4] = ^Bio::DB::GenPept$ AutoPrereqs.skips[5] = ^Bio::DB::SwissProt$ [OSPrereqs / MSWin32] Win32 = 0 [Prereqs / bin_bp_fetch_RuntimeSuggests] -relationship = suggests Bio::DB::Ace = 0 Bio::DB::EMBL = 0 Bio::DB::GenBank = 0 Bio::DB::GenPept = 0 Bio::DB::SwissProt = 0 ;; PodWeaver has some minimum requirements on the files before it can ;; be used. To allow a migration overtime, we can list the files that ;; are ready here. [FileFinder::ByName / PodWeaver-Ready] file = lib/BioPerl.pm [PodWeaver] config_plugin = @BioPerl finder = PodWeaver-Ready ;; The distribution name is different from the repository name so we ;; can't use [AutoMetaResources] that comes from [@BioPerl]. [MetaResources] homepage = https://metacpan.org/release/BioPerl bugtracker.web = https://github.com/bioperl/bioperl-live/issues bugtracker.mailto = bioperl-l@bioperl.org repository.url = git://github.com/bioperl/bioperl-live.git repository.web = https://github.com/bioperl/bioperl-live repository.type = git [PruneFiles] filename = .travis.yml match = ^travis_scripts META.yml100644000766000024 2766414006427353 14530 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8--- abstract: 'Perl modules for biology' author: - 'See individual modules' build_requires: File::Spec: '0' FindBin: '0' IO::Handle: '0' IPC::Open3: '0' Test::Memory::Cycle: '0' Test::More: '0' Test::Weaken: '0' lib: '0' perl: '5.006' configure_requires: ExtUtils::MakeMaker: '0' dynamic_config: 1 generated_by: 'Dist::Zilla version 6.017, CPAN::Meta::Converter version 2.150010' license: perl meta-spec: url: http://module-build.sourceforge.net/META-spec-v1.4.html version: '1.4' name: BioPerl requires: AnyDBM_File: '0' Carp: '0' Cwd: '0' DB_File: '0' Data::Dumper: '0' Data::Stag: '0' Digest::MD5: '0' Dumpvalue: '0' Error: '0' Exporter: '0' Fcntl: '0' File::Basename: '0' File::Copy: '0' File::Path: '0' File::Spec: '0' File::Spec::Functions: '0' File::Temp: '0' FileHandle: '0' Getopt::Long: '0' Graph::Directed: '0' HTTP::Request::Common: '0' HTTP::Response: '0' IO::File: '0' IO::Handle: '0' IO::Pipe: '0' IO::Scalar: '0' IO::Socket: '0' IO::String: '0' IPC::Run: '0' LWP::UserAgent: '0' List::MoreUtils: '0' List::Util: '0' Math::BigFloat: '0' Module::Build: '0' POSIX: '0' Pod::Usage: '0' Scalar::Util: '0' Set::Scalar: '0' Storable: '0' Symbol: '0' Test::Builder: '0' Test::Builder::Module: '0' Test::Most: '0' Test::RequiresInternet: '0' Text::Balanced: '0' Text::Wrap: '0' Tie::Handle: '0' Tie::RefHash: '0' Time::HiRes: '0' UNIVERSAL: '0' URI: '0' XML::DOM: '0' XML::LibXML: '0' XML::LibXML::Reader: '0' XML::Parser::PerlSAX: '0' XML::SAX: '0' XML::SAX::Base: '0' XML::SAX::Writer: '0' XML::Twig: '0' XML::Writer: '0' YAML: '0' base: '0' constant: '0' integer: '0' overload: '0' parent: '0' strict: '0' utf8: '0' vars: '0' version: '0' warnings: '0' resources: bugtracker: https://github.com/bioperl/bioperl-live/issues homepage: https://metacpan.org/release/BioPerl repository: git://github.com/bioperl/bioperl-live.git version: 1.7.8 x_Dist_Zilla: perl: version: '5.033001' plugins: - class: Dist::Zilla::Plugin::GatherDir config: Dist::Zilla::Plugin::GatherDir: exclude_filename: [] exclude_match: [] follow_symlinks: 0 include_dotfiles: 0 prefix: '' prune_directory: [] root: . name: '@BioPerl/@Filter/GatherDir' version: '6.017' - class: Dist::Zilla::Plugin::PruneCruft name: '@BioPerl/@Filter/PruneCruft' version: '6.017' - class: Dist::Zilla::Plugin::ManifestSkip name: '@BioPerl/@Filter/ManifestSkip' version: '6.017' - class: Dist::Zilla::Plugin::MetaYAML name: '@BioPerl/@Filter/MetaYAML' version: '6.017' - class: Dist::Zilla::Plugin::License name: '@BioPerl/@Filter/License' version: '6.017' - class: Dist::Zilla::Plugin::ExtraTests name: '@BioPerl/@Filter/ExtraTests' version: '6.017' - class: Dist::Zilla::Plugin::ExecDir name: '@BioPerl/@Filter/ExecDir' version: '6.017' - class: Dist::Zilla::Plugin::ShareDir name: '@BioPerl/@Filter/ShareDir' version: '6.017' - class: Dist::Zilla::Plugin::MakeMaker config: Dist::Zilla::Role::TestRunner: default_jobs: 1 name: '@BioPerl/@Filter/MakeMaker' version: '6.017' - class: Dist::Zilla::Plugin::Manifest name: '@BioPerl/@Filter/Manifest' version: '6.017' - class: Dist::Zilla::Plugin::TestRelease name: '@BioPerl/@Filter/TestRelease' version: '6.017' - class: Dist::Zilla::Plugin::ConfirmRelease name: '@BioPerl/@Filter/ConfirmRelease' version: '6.017' - class: Dist::Zilla::Plugin::UploadToCPAN name: '@BioPerl/@Filter/UploadToCPAN' version: '6.017' - class: Dist::Zilla::Plugin::MetaConfig name: '@BioPerl/MetaConfig' version: '6.017' - class: Dist::Zilla::Plugin::MetaJSON name: '@BioPerl/MetaJSON' version: '6.017' - class: Dist::Zilla::Plugin::PkgVersion name: '@BioPerl/PkgVersion' version: '6.017' - class: Dist::Zilla::Plugin::PodSyntaxTests name: '@BioPerl/PodSyntaxTests' version: '6.017' - class: Dist::Zilla::Plugin::Test::Compile config: Dist::Zilla::Plugin::Test::Compile: bail_out_on_fail: '0' fail_on_warning: author fake_home: 0 filename: t/00-compile.t module_finder: - ':InstallModules' needs_display: 0 phase: test script_finder: - ':PerlExecFiles' skips: [] switch: [] name: '@BioPerl/Test::Compile' version: '2.058' - class: Dist::Zilla::Plugin::MojibakeTests name: '@BioPerl/MojibakeTests' version: '0.8' - class: Dist::Zilla::Plugin::AutoPrereqs name: '@BioPerl/AutoPrereqs' version: '6.017' - class: Dist::Zilla::Plugin::MetaResources name: '@BioPerl/MetaResources' version: '6.017' - class: Dist::Zilla::Plugin::Encoding name: '@BioPerl/Encoding' version: '6.017' - class: Dist::Zilla::Plugin::NextRelease name: '@BioPerl/NextRelease' version: '6.017' - class: Dist::Zilla::Plugin::Git::Check config: Dist::Zilla::Plugin::Git::Check: untracked_files: die Dist::Zilla::Role::Git::DirtyFiles: allow_dirty: - Changes - dist.ini allow_dirty_match: [] changelog: Changes Dist::Zilla::Role::Git::Repo: git_version: 2.28.0 repo_root: . name: '@BioPerl/Git::Check' version: '2.047' - class: Dist::Zilla::Plugin::Git::Commit config: Dist::Zilla::Plugin::Git::Commit: add_files_in: [] commit_msg: v%V%n%n%c signoff: '0' Dist::Zilla::Role::Git::DirtyFiles: allow_dirty: - Changes - dist.ini allow_dirty_match: [] changelog: Changes Dist::Zilla::Role::Git::Repo: git_version: 2.28.0 repo_root: . Dist::Zilla::Role::Git::StringFormatter: time_zone: local name: '@BioPerl/Git::Commit' version: '2.047' - class: Dist::Zilla::Plugin::Git::Tag config: Dist::Zilla::Plugin::Git::Tag: branch: ~ changelog: Changes signed: 0 tag: BioPerl-v1.7.8 tag_format: '%N-v%v' tag_message: '%N-v%v' Dist::Zilla::Role::Git::Repo: git_version: 2.28.0 repo_root: . Dist::Zilla::Role::Git::StringFormatter: time_zone: local name: '@BioPerl/Git::Tag' version: '2.047' - class: Dist::Zilla::Plugin::OSPrereqs config: Dist::Zilla::Plugin::OSPrereqs: os: MSWin32 name: MSWin32 version: '0.011' - class: Dist::Zilla::Plugin::Prereqs config: Dist::Zilla::Plugin::Prereqs: phase: runtime type: suggests name: bin_bp_fetch_RuntimeSuggests version: '6.017' - class: Dist::Zilla::Plugin::FileFinder::ByName name: PodWeaver-Ready version: '6.017' - class: Dist::Zilla::Plugin::PodWeaver config: Dist::Zilla::Plugin::PodWeaver: config_plugins: - '@BioPerl' finder: - PodWeaver-Ready plugins: - class: Pod::Weaver::Plugin::EnsurePod5 name: '@CorePrep/EnsurePod5' version: '4.015' - class: Pod::Weaver::Plugin::H1Nester name: '@CorePrep/H1Nester' version: '4.015' - class: Pod::Weaver::Section::Name name: '@BioPerl/Name' version: '4.015' - class: Pod::Weaver::Section::Version name: '@BioPerl/Version' version: '4.015' - class: Pod::Weaver::Section::Region name: '@BioPerl/prelude' version: '4.015' - class: Pod::Weaver::Section::Generic name: SYNOPSIS version: '4.015' - class: Pod::Weaver::Section::Generic name: DESCRIPTION version: '4.015' - class: Pod::Weaver::Section::Generic name: OVERVIEW version: '4.015' - class: Pod::Weaver::Section::Collect name: ATTRIBUTES version: '4.015' - class: Pod::Weaver::Section::Collect name: METHODS version: '4.015' - class: Pod::Weaver::Section::Collect name: FUNCTIONS version: '4.015' - class: Pod::Weaver::Section::Collect name: 'INTERNAL METHODS' version: '4.015' - class: Pod::Weaver::Section::Leftovers name: '@BioPerl/Leftovers' version: '4.015' - class: Pod::Weaver::Section::Region name: '@BioPerl/postlude' version: '4.015' - class: Pod::Weaver::Section::GenerateSection name: FEEDBACK version: '1.06' - class: Pod::Weaver::Section::GenerateSection name: 'Mailing lists' version: '1.06' - class: Pod::Weaver::Section::GenerateSection name: Support version: '1.06' - class: Pod::Weaver::Section::GenerateSection name: 'Reporting bugs' version: '1.06' - class: Pod::Weaver::Section::Legal::Complicated name: '@BioPerl/Legal' version: '1.23' - class: Pod::Weaver::Section::Contributors name: '@BioPerl/Contributors' version: '0.009' - class: Pod::Weaver::Plugin::SingleEncoding name: SingleEncoding version: '4.015' - class: Pod::Weaver::Plugin::Transformer name: '@BioPerl/List' version: '4.015' - class: Pod::Weaver::Plugin::EnsureUniqueSections name: EnsureUniqueSections version: '0.163250' name: PodWeaver version: '4.008' - class: Dist::Zilla::Plugin::MetaResources name: MetaResources version: '6.017' - class: Dist::Zilla::Plugin::PruneFiles name: PruneFiles version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':InstallModules' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':IncModules' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':TestFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':ExtraTestFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':ExecFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':PerlExecFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':ShareFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':MainModule' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':AllFiles' version: '6.017' - class: Dist::Zilla::Plugin::FinderCode name: ':NoFiles' version: '6.017' zilla: class: Dist::Zilla::Dist::Builder config: is_trial: '0' version: '6.017' x_generated_by_perl: v5.33.1 x_serialization_backend: 'YAML::Tiny version 1.73' x_spdx_expression: 'Artistic-1.0-Perl OR GPL-1.0-or-later' MANIFEST100644000766000024 10702114006427353 14412 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8# This file was automatically generated by Dist::Zilla::Plugin::Manifest v6.017. AUTHORS Changes HACKING.md LICENSE MANIFEST META.json META.yml Makefile.PL README.md bin/bp_aacomp bin/bp_bioflat_index bin/bp_biogetseq bin/bp_dbsplit bin/bp_extract_feature_seq bin/bp_fastam9_to_table bin/bp_fetch bin/bp_filter_search bin/bp_find-blast-matches bin/bp_gccalc bin/bp_genbank2gff3 bin/bp_index bin/bp_local_taxonomydb_query bin/bp_make_mrna_protein bin/bp_mask_by_search bin/bp_mrtrans bin/bp_mutate bin/bp_nexus2nh bin/bp_nrdb bin/bp_oligo_count bin/bp_process_gadfly bin/bp_process_sgd bin/bp_revtrans-motif bin/bp_search2alnblocks bin/bp_search2gff bin/bp_search2table bin/bp_search2tribe bin/bp_seq_length bin/bp_seqconvert bin/bp_seqcut bin/bp_seqpart bin/bp_seqret bin/bp_seqretsplit bin/bp_split_seq bin/bp_sreformat bin/bp_taxid4species bin/bp_taxonomy2tree bin/bp_translate_seq bin/bp_tree2pag bin/bp_unflatten_seq dist.ini examples/Bio-DB-GFF/load_ucsc.pl examples/align/FastAlign.pl examples/align/align_on_codons.pl examples/align/aligntutorial.pl examples/align/simplealign.pl examples/classify_hits_kingdom examples/contributed/nmrpdb_parse.pl examples/contributed/prosite2perl.pl examples/contributed/rebase2list.pl examples/db/est_tissue_query.pl examples/db/get_seqs.pl examples/db/rfetch.pl examples/db/use_registry.pl examples/generate_random_seq.pl examples/longorf.pl examples/make_primers.pl examples/quality/svgtrace.pl examples/rev_and_trans.pl examples/revcom_dir.pl examples/root/README examples/root/exceptions1.pl examples/root/exceptions2.pl examples/root/exceptions3.pl examples/root/exceptions4.pl examples/searchio/blast_example.pl examples/searchio/custom_writer.pl examples/searchio/hitwriter.pl examples/searchio/hspwriter.pl examples/searchio/htmlwriter.pl examples/searchio/psiblast_features.pl examples/searchio/psiblast_iterations.pl examples/searchio/rawwriter.pl examples/searchio/resultwriter.pl examples/searchio/waba2gff.pl examples/searchio/waba2gff3.pl examples/tk/gsequence.pl examples/tk/hitdisplay.pl examples/tools/extract_genes.pl examples/tools/gb_to_gff.pl examples/tools/gff2ps.pl examples/tools/parse_codeml.pl examples/tools/reverse-translate.pl examples/tools/run_genscan.pl examples/tools/run_primer3.pl examples/tools/seq_pattern.pl examples/tools/standaloneblast.pl examples/tree/paup2phylip.pl lib/Bio/Align/AlignI.pm lib/Bio/Align/DNAStatistics.pm lib/Bio/Align/PairwiseStatistics.pm lib/Bio/Align/ProteinStatistics.pm lib/Bio/Align/StatisticsI.pm lib/Bio/Align/Utilities.pm lib/Bio/AlignIO.pm lib/Bio/AlignIO/Handler/GenericAlignHandler.pm lib/Bio/AlignIO/arp.pm lib/Bio/AlignIO/bl2seq.pm lib/Bio/AlignIO/clustalw.pm lib/Bio/AlignIO/emboss.pm lib/Bio/AlignIO/fasta.pm lib/Bio/AlignIO/largemultifasta.pm lib/Bio/AlignIO/maf.pm lib/Bio/AlignIO/mase.pm lib/Bio/AlignIO/mega.pm lib/Bio/AlignIO/meme.pm lib/Bio/AlignIO/metafasta.pm lib/Bio/AlignIO/msf.pm lib/Bio/AlignIO/nexus.pm lib/Bio/AlignIO/pfam.pm lib/Bio/AlignIO/phylip.pm lib/Bio/AlignIO/po.pm lib/Bio/AlignIO/proda.pm lib/Bio/AlignIO/prodom.pm lib/Bio/AlignIO/psi.pm lib/Bio/AlignIO/selex.pm lib/Bio/AlignIO/xmfa.pm lib/Bio/AnalysisI.pm lib/Bio/AnalysisParserI.pm lib/Bio/AnalysisResultI.pm lib/Bio/AnnotatableI.pm lib/Bio/Annotation/AnnotationFactory.pm lib/Bio/Annotation/Collection.pm lib/Bio/Annotation/Comment.pm lib/Bio/Annotation/DBLink.pm lib/Bio/Annotation/OntologyTerm.pm lib/Bio/Annotation/Reference.pm lib/Bio/Annotation/Relation.pm lib/Bio/Annotation/SimpleValue.pm lib/Bio/Annotation/StructuredValue.pm lib/Bio/Annotation/TagTree.pm lib/Bio/Annotation/Target.pm lib/Bio/Annotation/Tree.pm lib/Bio/Annotation/TypeManager.pm lib/Bio/AnnotationCollectionI.pm lib/Bio/AnnotationI.pm lib/Bio/CodonUsage/IO.pm lib/Bio/CodonUsage/Table.pm lib/Bio/DB/DBFetch.pm lib/Bio/DB/Failover.pm lib/Bio/DB/Fasta.pm lib/Bio/DB/FileCache.pm lib/Bio/DB/Flat.pm lib/Bio/DB/Flat/BDB.pm lib/Bio/DB/Flat/BDB/embl.pm lib/Bio/DB/Flat/BDB/fasta.pm lib/Bio/DB/Flat/BDB/genbank.pm lib/Bio/DB/Flat/BDB/swiss.pm lib/Bio/DB/Flat/BinarySearch.pm lib/Bio/DB/GFF/Util/Binning.pm lib/Bio/DB/GFF/Util/Rearrange.pm lib/Bio/DB/GenericWebAgent.pm lib/Bio/DB/InMemoryCache.pm lib/Bio/DB/IndexedBase.pm lib/Bio/DB/LocationI.pm lib/Bio/DB/Qual.pm lib/Bio/DB/Query/WebQuery.pm lib/Bio/DB/QueryI.pm lib/Bio/DB/RandomAccessI.pm lib/Bio/DB/ReferenceI.pm lib/Bio/DB/Registry.pm lib/Bio/DB/SeqI.pm lib/Bio/DB/Taxonomy.pm lib/Bio/DB/Taxonomy/flatfile.pm lib/Bio/DB/Taxonomy/greengenes.pm lib/Bio/DB/Taxonomy/list.pm lib/Bio/DB/Taxonomy/silva.pm lib/Bio/DB/UpdateableSeqI.pm lib/Bio/DB/WebDBSeqI.pm lib/Bio/DBLinkContainerI.pm lib/Bio/Das/FeatureTypeI.pm lib/Bio/Das/SegmentI.pm lib/Bio/DasI.pm lib/Bio/DescribableI.pm lib/Bio/Event/EventGeneratorI.pm lib/Bio/Event/EventHandlerI.pm lib/Bio/Factory/AnalysisI.pm lib/Bio/Factory/ApplicationFactoryI.pm lib/Bio/Factory/DriverFactory.pm lib/Bio/Factory/FTLocationFactory.pm lib/Bio/Factory/LocationFactoryI.pm lib/Bio/Factory/ObjectBuilderI.pm lib/Bio/Factory/ObjectFactory.pm lib/Bio/Factory/ObjectFactoryI.pm lib/Bio/Factory/SeqAnalysisParserFactory.pm lib/Bio/Factory/SeqAnalysisParserFactoryI.pm lib/Bio/Factory/SequenceFactoryI.pm lib/Bio/Factory/SequenceProcessorI.pm lib/Bio/Factory/SequenceStreamI.pm lib/Bio/Factory/TreeFactoryI.pm lib/Bio/FeatureHolderI.pm lib/Bio/HandlerBaseI.pm lib/Bio/IdCollectionI.pm lib/Bio/IdentifiableI.pm lib/Bio/Index/Abstract.pm lib/Bio/Index/AbstractSeq.pm lib/Bio/Index/Blast.pm lib/Bio/Index/BlastTable.pm lib/Bio/Index/EMBL.pm lib/Bio/Index/Fasta.pm lib/Bio/Index/Fastq.pm lib/Bio/Index/GenBank.pm lib/Bio/Index/Qual.pm lib/Bio/Index/SwissPfam.pm lib/Bio/Index/Swissprot.pm lib/Bio/LocatableSeq.pm lib/Bio/Location/Atomic.pm lib/Bio/Location/AvWithinCoordPolicy.pm lib/Bio/Location/CoordinatePolicyI.pm lib/Bio/Location/Fuzzy.pm lib/Bio/Location/FuzzyLocationI.pm lib/Bio/Location/NarrowestCoordPolicy.pm lib/Bio/Location/Simple.pm lib/Bio/Location/Split.pm lib/Bio/Location/SplitLocationI.pm lib/Bio/Location/WidestCoordPolicy.pm lib/Bio/LocationI.pm lib/Bio/Matrix/Generic.pm lib/Bio/Matrix/IO.pm lib/Bio/Matrix/IO/mlagan.pm lib/Bio/Matrix/IO/phylip.pm lib/Bio/Matrix/IO/scoring.pm lib/Bio/Matrix/MatrixI.pm lib/Bio/Matrix/Mlagan.pm lib/Bio/Matrix/PSM/IO.pm lib/Bio/Matrix/PSM/IO/mast.pm lib/Bio/Matrix/PSM/IO/masta.pm lib/Bio/Matrix/PSM/IO/meme.pm lib/Bio/Matrix/PSM/IO/psiblast.pm lib/Bio/Matrix/PSM/IO/transfac.pm lib/Bio/Matrix/PSM/InstanceSite.pm lib/Bio/Matrix/PSM/InstanceSiteI.pm lib/Bio/Matrix/PSM/ProtMatrix.pm lib/Bio/Matrix/PSM/ProtPsm.pm lib/Bio/Matrix/PSM/Psm.pm lib/Bio/Matrix/PSM/PsmHeader.pm lib/Bio/Matrix/PSM/PsmHeaderI.pm lib/Bio/Matrix/PSM/PsmI.pm lib/Bio/Matrix/PSM/SiteMatrix.pm lib/Bio/Matrix/PSM/SiteMatrixI.pm lib/Bio/Matrix/PhylipDist.pm lib/Bio/Matrix/Scoring.pm lib/Bio/Ontology/DocumentRegistry.pm lib/Bio/Ontology/GOterm.pm lib/Bio/Ontology/InterProTerm.pm lib/Bio/Ontology/OBOEngine.pm lib/Bio/Ontology/OBOterm.pm lib/Bio/Ontology/Ontology.pm lib/Bio/Ontology/OntologyEngineI.pm lib/Bio/Ontology/OntologyI.pm lib/Bio/Ontology/OntologyStore.pm lib/Bio/Ontology/Path.pm lib/Bio/Ontology/PathI.pm lib/Bio/Ontology/Relationship.pm lib/Bio/Ontology/RelationshipFactory.pm lib/Bio/Ontology/RelationshipI.pm lib/Bio/Ontology/RelationshipType.pm lib/Bio/Ontology/SimpleGOEngine/GraphAdaptor.pm lib/Bio/Ontology/SimpleOntologyEngine.pm lib/Bio/Ontology/Term.pm lib/Bio/Ontology/TermFactory.pm lib/Bio/Ontology/TermI.pm lib/Bio/OntologyIO.pm lib/Bio/OntologyIO/Handlers/BaseSAXHandler.pm lib/Bio/OntologyIO/Handlers/InterProHandler.pm lib/Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm lib/Bio/OntologyIO/InterProParser.pm lib/Bio/OntologyIO/dagflat.pm lib/Bio/OntologyIO/goflat.pm lib/Bio/OntologyIO/obo.pm lib/Bio/OntologyIO/simplehierarchy.pm lib/Bio/OntologyIO/soflat.pm lib/Bio/ParameterBaseI.pm lib/Bio/PrimarySeq.pm lib/Bio/PrimarySeqI.pm lib/Bio/PullParserI.pm lib/Bio/Range.pm lib/Bio/RangeI.pm lib/Bio/Root/Exception.pm lib/Bio/Root/HTTPget.pm lib/Bio/Root/IO.pm lib/Bio/Root/Root.pm lib/Bio/Root/RootI.pm lib/Bio/Root/Storable.pm lib/Bio/Root/Test.pm lib/Bio/Root/TestObject.pm lib/Bio/Root/Utilities.pm lib/Bio/Root/Version.pm lib/Bio/Search/BlastStatistics.pm lib/Bio/Search/BlastUtils.pm lib/Bio/Search/DatabaseI.pm lib/Bio/Search/GenericDatabase.pm lib/Bio/Search/GenericStatistics.pm lib/Bio/Search/HSP/BlastHSP.pm lib/Bio/Search/HSP/BlastPullHSP.pm lib/Bio/Search/HSP/FastaHSP.pm lib/Bio/Search/HSP/GenericHSP.pm lib/Bio/Search/HSP/HSPFactory.pm lib/Bio/Search/HSP/HSPI.pm lib/Bio/Search/HSP/ModelHSP.pm lib/Bio/Search/HSP/PSLHSP.pm lib/Bio/Search/HSP/PsiBlastHSP.pm lib/Bio/Search/HSP/PullHSPI.pm lib/Bio/Search/HSP/WABAHSP.pm lib/Bio/Search/Hit/BlastHit.pm lib/Bio/Search/Hit/BlastPullHit.pm lib/Bio/Search/Hit/Fasta.pm lib/Bio/Search/Hit/GenericHit.pm lib/Bio/Search/Hit/HitFactory.pm lib/Bio/Search/Hit/HitI.pm lib/Bio/Search/Hit/ModelHit.pm lib/Bio/Search/Hit/PsiBlastHit.pm lib/Bio/Search/Hit/PullHitI.pm lib/Bio/Search/Iteration/GenericIteration.pm lib/Bio/Search/Iteration/IterationI.pm lib/Bio/Search/Processor.pm lib/Bio/Search/Result/BlastPullResult.pm lib/Bio/Search/Result/BlastResult.pm lib/Bio/Search/Result/CrossMatchResult.pm lib/Bio/Search/Result/GenericResult.pm lib/Bio/Search/Result/INFERNALResult.pm lib/Bio/Search/Result/PullResultI.pm lib/Bio/Search/Result/ResultFactory.pm lib/Bio/Search/Result/ResultI.pm lib/Bio/Search/Result/WABAResult.pm lib/Bio/Search/SearchUtils.pm lib/Bio/Search/StatisticsI.pm lib/Bio/Search/Tiling/MapTileUtils.pm lib/Bio/Search/Tiling/MapTiling.pm lib/Bio/Search/Tiling/TilingI.pm lib/Bio/SearchIO.pm lib/Bio/SearchIO/EventHandlerI.pm lib/Bio/SearchIO/FastHitEventBuilder.pm lib/Bio/SearchIO/IteratedSearchResultEventBuilder.pm lib/Bio/SearchIO/SearchResultEventBuilder.pm lib/Bio/SearchIO/SearchWriterI.pm lib/Bio/SearchIO/Writer/GbrowseGFF.pm lib/Bio/SearchIO/Writer/HSPTableWriter.pm lib/Bio/SearchIO/Writer/HTMLResultWriter.pm lib/Bio/SearchIO/Writer/HitTableWriter.pm lib/Bio/SearchIO/Writer/ResultTableWriter.pm lib/Bio/SearchIO/Writer/TextResultWriter.pm lib/Bio/SearchIO/axt.pm lib/Bio/SearchIO/blast.pm lib/Bio/SearchIO/blast_pull.pm lib/Bio/SearchIO/blasttable.pm lib/Bio/SearchIO/cross_match.pm lib/Bio/SearchIO/erpin.pm lib/Bio/SearchIO/exonerate.pm lib/Bio/SearchIO/fasta.pm lib/Bio/SearchIO/gmap_f9.pm lib/Bio/SearchIO/infernal.pm lib/Bio/SearchIO/megablast.pm lib/Bio/SearchIO/psl.pm lib/Bio/SearchIO/rnamotif.pm lib/Bio/SearchIO/sim4.pm lib/Bio/SearchIO/waba.pm lib/Bio/SearchIO/wise.pm lib/Bio/Seq.pm lib/Bio/Seq/BaseSeqProcessor.pm lib/Bio/Seq/EncodedSeq.pm lib/Bio/Seq/LargeLocatableSeq.pm lib/Bio/Seq/LargePrimarySeq.pm lib/Bio/Seq/LargeSeq.pm lib/Bio/Seq/LargeSeqI.pm lib/Bio/Seq/Meta.pm lib/Bio/Seq/Meta/Array.pm lib/Bio/Seq/MetaI.pm lib/Bio/Seq/PrimaryQual.pm lib/Bio/Seq/PrimedSeq.pm lib/Bio/Seq/QualI.pm lib/Bio/Seq/Quality.pm lib/Bio/Seq/RichSeq.pm lib/Bio/Seq/RichSeqI.pm lib/Bio/Seq/SeqBuilder.pm lib/Bio/Seq/SeqFactory.pm lib/Bio/Seq/SeqFastaSpeedFactory.pm lib/Bio/Seq/SequenceTrace.pm lib/Bio/Seq/SimulatedRead.pm lib/Bio/Seq/TraceI.pm lib/Bio/SeqAnalysisParserI.pm lib/Bio/SeqFeature/Amplicon.pm lib/Bio/SeqFeature/AnnotationAdaptor.pm lib/Bio/SeqFeature/Collection.pm lib/Bio/SeqFeature/CollectionI.pm lib/Bio/SeqFeature/Computation.pm lib/Bio/SeqFeature/FeaturePair.pm lib/Bio/SeqFeature/Gene/Exon.pm lib/Bio/SeqFeature/Gene/ExonI.pm lib/Bio/SeqFeature/Gene/GeneStructure.pm lib/Bio/SeqFeature/Gene/GeneStructureI.pm lib/Bio/SeqFeature/Gene/Intron.pm lib/Bio/SeqFeature/Gene/NC_Feature.pm lib/Bio/SeqFeature/Gene/Poly_A_site.pm lib/Bio/SeqFeature/Gene/Promoter.pm lib/Bio/SeqFeature/Gene/Transcript.pm lib/Bio/SeqFeature/Gene/TranscriptI.pm lib/Bio/SeqFeature/Gene/UTR.pm lib/Bio/SeqFeature/Generic.pm lib/Bio/SeqFeature/Lite.pm lib/Bio/SeqFeature/PositionProxy.pm lib/Bio/SeqFeature/Primer.pm lib/Bio/SeqFeature/Similarity.pm lib/Bio/SeqFeature/SimilarityPair.pm lib/Bio/SeqFeature/SubSeq.pm lib/Bio/SeqFeature/Tools/FeatureNamer.pm lib/Bio/SeqFeature/Tools/IDHandler.pm lib/Bio/SeqFeature/Tools/TypeMapper.pm lib/Bio/SeqFeature/Tools/Unflattener.pm lib/Bio/SeqFeature/TypedSeqFeatureI.pm lib/Bio/SeqFeatureI.pm lib/Bio/SeqI.pm lib/Bio/SeqIO.pm lib/Bio/SeqIO/FTHelper.pm lib/Bio/SeqIO/Handler/GenericRichSeqHandler.pm lib/Bio/SeqIO/MultiFile.pm lib/Bio/SeqIO/ace.pm lib/Bio/SeqIO/asciitree.pm lib/Bio/SeqIO/bsml.pm lib/Bio/SeqIO/bsml_sax.pm lib/Bio/SeqIO/embl.pm lib/Bio/SeqIO/embldriver.pm lib/Bio/SeqIO/fasta.pm lib/Bio/SeqIO/fastq.pm lib/Bio/SeqIO/game.pm lib/Bio/SeqIO/game/featHandler.pm lib/Bio/SeqIO/game/gameHandler.pm lib/Bio/SeqIO/game/gameSubs.pm lib/Bio/SeqIO/game/gameWriter.pm lib/Bio/SeqIO/game/seqHandler.pm lib/Bio/SeqIO/gbdriver.pm lib/Bio/SeqIO/gbxml.pm lib/Bio/SeqIO/gcg.pm lib/Bio/SeqIO/genbank.pm lib/Bio/SeqIO/kegg.pm lib/Bio/SeqIO/largefasta.pm lib/Bio/SeqIO/locuslink.pm lib/Bio/SeqIO/mbsout.pm lib/Bio/SeqIO/metafasta.pm lib/Bio/SeqIO/msout.pm lib/Bio/SeqIO/phd.pm lib/Bio/SeqIO/pir.pm lib/Bio/SeqIO/qual.pm lib/Bio/SeqIO/raw.pm lib/Bio/SeqIO/scf.pm lib/Bio/SeqIO/seqxml.pm lib/Bio/SeqIO/swiss.pm lib/Bio/SeqIO/swissdriver.pm lib/Bio/SeqIO/tab.pm lib/Bio/SeqIO/table.pm lib/Bio/SeqIO/tigr.pm lib/Bio/SeqIO/tigrxml.pm lib/Bio/SeqIO/tinyseq.pm lib/Bio/SeqIO/tinyseq/tinyseqHandler.pm lib/Bio/SeqUtils.pm lib/Bio/SimpleAlign.pm lib/Bio/SimpleAnalysisI.pm lib/Bio/Species.pm lib/Bio/Taxon.pm lib/Bio/Tools/Alignment/Consed.pm lib/Bio/Tools/Alignment/Trim.pm lib/Bio/Tools/AmpliconSearch.pm lib/Bio/Tools/Analysis/SimpleAnalysisBase.pm lib/Bio/Tools/AnalysisResult.pm lib/Bio/Tools/Blat.pm lib/Bio/Tools/CodonTable.pm lib/Bio/Tools/Coil.pm lib/Bio/Tools/ECnumber.pm lib/Bio/Tools/EMBOSS/Palindrome.pm lib/Bio/Tools/EPCR.pm lib/Bio/Tools/ESTScan.pm lib/Bio/Tools/Eponine.pm lib/Bio/Tools/Est2Genome.pm lib/Bio/Tools/Fgenesh.pm lib/Bio/Tools/FootPrinter.pm lib/Bio/Tools/GFF.pm lib/Bio/Tools/Geneid.pm lib/Bio/Tools/Genemark.pm lib/Bio/Tools/Genewise.pm lib/Bio/Tools/Genomewise.pm lib/Bio/Tools/Genscan.pm lib/Bio/Tools/Glimmer.pm lib/Bio/Tools/Grail.pm lib/Bio/Tools/GuessSeqFormat.pm lib/Bio/Tools/IUPAC.pm lib/Bio/Tools/Lucy.pm lib/Bio/Tools/MZEF.pm lib/Bio/Tools/Match.pm lib/Bio/Tools/OddCodes.pm lib/Bio/Tools/Phylo/Gerp.pm lib/Bio/Tools/Phylo/Molphy.pm lib/Bio/Tools/Phylo/Molphy/Result.pm lib/Bio/Tools/Phylo/Phylip/ProtDist.pm lib/Bio/Tools/Prediction/Exon.pm lib/Bio/Tools/Prediction/Gene.pm lib/Bio/Tools/Primer/Assessor/Base.pm lib/Bio/Tools/Primer/AssessorI.pm lib/Bio/Tools/Primer/Feature.pm lib/Bio/Tools/Primer/Pair.pm lib/Bio/Tools/Primer3.pm lib/Bio/Tools/Prints.pm lib/Bio/Tools/Profile.pm lib/Bio/Tools/Promoterwise.pm lib/Bio/Tools/PrositeScan.pm lib/Bio/Tools/Pseudowise.pm lib/Bio/Tools/QRNA.pm lib/Bio/Tools/RandomDistFunctions.pm lib/Bio/Tools/RepeatMasker.pm lib/Bio/Tools/Run/Analysis.pm lib/Bio/Tools/Run/AnalysisFactory.pm lib/Bio/Tools/Run/GenericParameters.pm lib/Bio/Tools/Run/ParametersI.pm lib/Bio/Tools/Run/Phylo/PhyloBase.pm lib/Bio/Tools/Run/WrapperBase.pm lib/Bio/Tools/Run/WrapperBase/CommandExts.pm lib/Bio/Tools/Seg.pm lib/Bio/Tools/SeqPattern.pm lib/Bio/Tools/SeqPattern/Backtranslate.pm lib/Bio/Tools/SeqStats.pm lib/Bio/Tools/SeqWords.pm lib/Bio/Tools/Sigcleave.pm lib/Bio/Tools/Signalp.pm lib/Bio/Tools/Signalp/ExtendedSignalp.pm lib/Bio/Tools/Sim4/Exon.pm lib/Bio/Tools/Sim4/Results.pm lib/Bio/Tools/Spidey/Exon.pm lib/Bio/Tools/Spidey/Results.pm lib/Bio/Tools/TandemRepeatsFinder.pm lib/Bio/Tools/TargetP.pm lib/Bio/Tools/Tmhmm.pm lib/Bio/Tools/ipcress.pm lib/Bio/Tools/isPcr.pm lib/Bio/Tools/pICalculator.pm lib/Bio/Tools/tRNAscanSE.pm lib/Bio/Tree/AnnotatableNode.pm lib/Bio/Tree/Compatible.pm lib/Bio/Tree/DistanceFactory.pm lib/Bio/Tree/Node.pm lib/Bio/Tree/NodeI.pm lib/Bio/Tree/NodeNHX.pm lib/Bio/Tree/RandomFactory.pm lib/Bio/Tree/Statistics.pm lib/Bio/Tree/Tree.pm lib/Bio/Tree/TreeFunctionsI.pm lib/Bio/Tree/TreeI.pm lib/Bio/TreeIO.pm lib/Bio/TreeIO/NewickParser.pm lib/Bio/TreeIO/TreeEventBuilder.pm lib/Bio/TreeIO/cluster.pm lib/Bio/TreeIO/lintree.pm lib/Bio/TreeIO/newick.pm lib/Bio/TreeIO/nexus.pm lib/Bio/TreeIO/nhx.pm lib/Bio/TreeIO/pag.pm lib/Bio/TreeIO/phyloxml.pm lib/Bio/TreeIO/tabtree.pm lib/Bio/UpdateableSeqI.pm lib/Bio/WebAgent.pm lib/BioPerl.pm t/00-compile.t t/Align/AlignStats.t t/Align/AlignUtil.t t/Align/SimpleAlign.t t/Align/TreeBuild.t t/Align/Utilities.t t/AlignIO/AlignIO.t t/AlignIO/arp.t t/AlignIO/bl2seq.t t/AlignIO/clustalw.t t/AlignIO/emboss.t t/AlignIO/fasta.t t/AlignIO/largemultifasta.t t/AlignIO/maf.t t/AlignIO/mase.t t/AlignIO/mega.t t/AlignIO/meme.t t/AlignIO/metafasta.t t/AlignIO/msf.t t/AlignIO/nexus.t t/AlignIO/pfam.t t/AlignIO/phylip.t t/AlignIO/po.t t/AlignIO/prodom.t t/AlignIO/psi.t t/AlignIO/selex.t t/AlignIO/xmfa.t t/Annotation/Annotation.t t/Annotation/AnnotationAdaptor.t t/LocalDB/Fasta.t t/LocalDB/Flat.t t/LocalDB/Index/Blast.t t/LocalDB/Index/BlastTable.t t/LocalDB/Index/Index.t t/LocalDB/Qual.t t/LocalDB/Registry.t t/LocalDB/Taxonomy/greengenes.t t/LocalDB/Taxonomy/silva.t t/Matrix/IO/masta.t t/Matrix/IO/psm.t t/Matrix/InstanceSite.t t/Matrix/Matrix.t t/Matrix/ProtMatrix.t t/Matrix/ProtPsm.t t/Matrix/SiteMatrix.t t/Ontology/GOterm.t t/Ontology/GraphAdaptor.t t/Ontology/IO/go.t t/Ontology/IO/interpro.t t/Ontology/IO/obo.t t/Ontology/Ontology.t t/Ontology/OntologyEngine.t t/Ontology/OntologyStore.t t/Ontology/Relationship.t t/Ontology/RelationshipType.t t/Ontology/Term.t t/RemoteDB/SeqRead_fail.t t/RemoteDB/Taxonomy.t t/Root/Exception.t t/Root/HTTPget.t t/Root/IO.t t/Root/RootI.t t/Root/RootIO.t t/Root/Storable.t t/Root/Utilities.t t/SearchIO/CigarString.t t/SearchIO/SearchIO.t t/SearchIO/SimilarityPair.t t/SearchIO/Tiling.t t/SearchIO/Writer/GbrowseGFF.t t/SearchIO/Writer/HSPTableWriter.t t/SearchIO/Writer/HTMLWriter.t t/SearchIO/Writer/HitTableWriter.t t/SearchIO/Writer/TextWriter.t t/SearchIO/axt.t t/SearchIO/blast.t t/SearchIO/blast_pull.t t/SearchIO/blasttable.t t/SearchIO/cross_match.t t/SearchIO/erpin.t t/SearchIO/exonerate.t t/SearchIO/fasta.t t/SearchIO/gmap_f9.t t/SearchIO/infernal.t t/SearchIO/megablast.t t/SearchIO/psl.t t/SearchIO/rnamotif.t t/SearchIO/sim4.t t/SearchIO/waba.t t/SearchIO/wise.t t/Seq/DBLink.t t/Seq/EncodedSeq.t t/Seq/LargeLocatableSeq.t t/Seq/LargePSeq.t t/Seq/LocatableSeq.t t/Seq/MetaSeq.t t/Seq/PrimaryQual.t t/Seq/PrimarySeq.t t/Seq/PrimedSeq.t t/Seq/Quality.t t/Seq/Seq.t t/Seq/SimulatedRead.t t/SeqFeature/Amplicon.t t/SeqFeature/Clone.t t/SeqFeature/Collection.t t/SeqFeature/Computation.t t/SeqFeature/FeaturePair.t t/SeqFeature/Gene.t t/SeqFeature/Generic.t t/SeqFeature/Location.t t/SeqFeature/LocationFactory.t t/SeqFeature/Primer.t t/SeqFeature/Range.t t/SeqFeature/RangeI.t t/SeqFeature/SeqAnalysisParser.t t/SeqFeature/SubSeq.t t/SeqFeature/Unflattener.t t/SeqIO/Handler.t t/SeqIO/MultiFile.t t/SeqIO/Multiple_fasta.t t/SeqIO/SeqBuilder.t t/SeqIO/SeqIO.t t/SeqIO/Splicedseq.t t/SeqIO/ace.t t/SeqIO/asciitree.t t/SeqIO/bsml.t t/SeqIO/bsml_sax.t t/SeqIO/embl.t t/SeqIO/fasta.t t/SeqIO/fastq.t t/SeqIO/game.t t/SeqIO/gbxml.t t/SeqIO/gcg.t t/SeqIO/genbank.t t/SeqIO/kegg.t t/SeqIO/largefasta.t t/SeqIO/locuslink.t t/SeqIO/mbsout.t t/SeqIO/metafasta.t t/SeqIO/msout.t t/SeqIO/phd.t t/SeqIO/pir.t t/SeqIO/qual.t t/SeqIO/raw.t t/SeqIO/scf.t t/SeqIO/seqxml.t t/SeqIO/swiss.t t/SeqIO/tab.t t/SeqIO/table.t t/SeqIO/tigr.t t/SeqIO/tigrxml.t t/SeqIO/tinyseq.t t/SeqTools/Backtranslate.t t/SeqTools/CodonTable.t t/SeqTools/ECnumber.t t/SeqTools/GuessSeqFormat.t t/SeqTools/OddCodes.t t/SeqTools/SeqPattern.t t/SeqTools/SeqStats.t t/SeqTools/SeqUtils.t t/SeqTools/SeqWords.t t/Species.t t/Tools/Alignment/Consed.t t/Tools/AmpliconSearch.t t/Tools/EMBOSS/Palindrome.t t/Tools/Est2Genome.t t/Tools/FootPrinter.t t/Tools/GFF.t t/Tools/Geneid.t t/Tools/Genewise.t t/Tools/Genomewise.t t/Tools/Genpred.t t/Tools/IUPAC.t t/Tools/Lucy.t t/Tools/Match.t t/Tools/Phylo/Gerp.t t/Tools/Phylo/Molphy.t t/Tools/Phylo/Phylip/ProtDist.t t/Tools/Primer3.t t/Tools/Promoterwise.t t/Tools/PrositeScan.t t/Tools/Pseudowise.t t/Tools/QRNA.t t/Tools/RandDistFunctions.t t/Tools/RepeatMasker.t t/Tools/Seg.t t/Tools/Sigcleave.t t/Tools/Signalp.t t/Tools/Signalp/ExtendedSignalp.t t/Tools/Sim4.t t/Tools/Spidey/Spidey.t t/Tools/TandemRepeatsFinder.t t/Tools/TargetP.t t/Tools/Tmhmm.t t/Tools/ePCR.t t/Tools/pICalculator.t t/Tools/tRNAscanSE.t t/Tree/Compatible.t t/Tree/Node.t t/Tree/RandomTreeFactory.t t/Tree/Tree.t t/Tree/TreeIO.t t/Tree/TreeIO/lintree.t t/Tree/TreeIO/newick.t t/Tree/TreeIO/nexus.t t/Tree/TreeIO/nhx.t t/Tree/TreeIO/phyloxml.t t/Tree/TreeIO/tabtree.t t/Tree/TreeStatistics.t t/author-mojibake.t t/author-pod-syntax.t t/data/13-pilE-F.scf t/data/1ZZ19XR301R-Alignment.tblastn t/data/2008.blasttable t/data/503384.MEGABLAST.0 t/data/503384.MEGABLAST.2 t/data/5X_1895.FASTXY t/data/AAC12660.fa t/data/AB077698.gb t/data/AE003528_ecoli.bls t/data/AE003644_Adh-genomic.gb t/data/AF032047.gbk t/data/AF165282.gb t/data/AF222649-rc.gbk t/data/AF305198.gb t/data/AHCYL1.kegg t/data/ATF14F8.gbk t/data/AY095303S1.gbk t/data/AY763288.gb t/data/AnnIX-v003.gbk t/data/BAB68554.gb t/data/BC000007.gbk t/data/BEL16-LTR_AG.embl t/data/BK000016-tpa.gbk t/data/BLOSUM50 t/data/BN000066-tpa.embl t/data/BOSS_DROME.FASTP_v35_04 t/data/Bird_Ovomucoids.nex t/data/D10483.gbk t/data/D12555.gbk t/data/DQ018368.gb t/data/ECAPAH02.embl t/data/EG352462.gbxml t/data/ENr111.mfa.example.elems t/data/GO.defs.test t/data/GO.defs.test2 t/data/Genscan.FastA t/data/Glimmer2.out t/data/Glimmer3.detail t/data/Glimmer3.predict t/data/GlimmerHMM.out t/data/GlimmerM.out t/data/HM138502.gbk t/data/HUMBETGLOA.FASTA t/data/HUMBETGLOA.gff t/data/HUMBETGLOA.grail t/data/HUMBETGLOA.grailexp t/data/HUMBETGLOA.mzef t/data/HUMBETGLOA.tblastx t/data/KF527485.gbk t/data/Kingdoms_DNA.nex t/data/LOAD_Ccd1.dnd t/data/M12730.gb t/data/MSGEFTUA.gb t/data/Mcjanrna_rdbII.gbk t/data/MmCT t/data/NC_000007-ribosomal-slippage.gb t/data/NC_001284.gbk t/data/NC_002058_multDBLINK_bug3375.gb t/data/NC_006346.gb t/data/NC_006511-short.gbk t/data/NC_008536.gb t/data/NT_021877.gbk t/data/ORTHOMCL2345.cluster.aa.fa.aln.aa.phy.txt t/data/O_sat.wgs t/data/P33897 t/data/P35527.gb t/data/P39765.gb t/data/PAM250 t/data/PX1CG.gb t/data/Primate_mtDNA.nex t/data/Q8GBD3.swiss t/data/Rab1.chaos-xml t/data/SPAN_Family4nl.nex t/data/SPAN_Family7n.nex t/data/SPAN_Family8a.nex t/data/SwissProt.dat t/data/Treebase-chlamy-dna.nex t/data/U58726.gb t/data/U71225.gb t/data/U71225.gb.mac t/data/U71225.gb.unix t/data/U71225.gb.win t/data/U83300.bsml t/data/UnaSmithHIV-both.nex t/data/X98338_Adh-mRNA.gb t/data/YP_007988852.gp t/data/ZABJ4EA7014.CH878695.1.blast.txt t/data/a_thaliana.blastn t/data/acefile.ace.1 t/data/acefile.singlets t/data/adh.mb_tree.nexus t/data/alleles.fas t/data/alnfile.fasta t/data/amino.fa t/data/amphora.newick t/data/atp1.matrix t/data/ay007676.gb t/data/ay116458.gb t/data/ay149291.gb t/data/bad_dbfa/bug3172.fa t/data/bad_dbfa/shotdb.fa t/data/badfasta.fa t/data/barns-combined.nex t/data/basic-bush.nex t/data/basic-ladder.nex t/data/biorecipe.nhx t/data/bl2seq+.blastn t/data/bl2seq.blastn t/data/bl2seq.blastn.rev t/data/bl2seq.blastx.out t/data/bl2seq.bug940.out t/data/bl2seq.out t/data/bl2seq.tblastn.out t/data/bl2seq.tblastx.out t/data/blast.report t/data/blast_no_hit_desc.txt t/data/blast_plus.blastp t/data/blastp2215.blast t/data/blat.psLayout3 t/data/blosum62.bla t/data/bootstrap.tre t/data/brassica_ATH.WUBLASTN t/data/bug1986.blast2 t/data/bug1986.blastp t/data/bug2120.phd t/data/bug2246.blast t/data/bug2391.megablast t/data/bug2399.tblastn t/data/bug2453.maf t/data/bug2473.fasta t/data/bug2862.pmr t/data/bug2869.tree t/data/bug2901.fa t/data/bug2937.fasta t/data/bug2942.blastx t/data/bug2982.embl t/data/bug2982.gb t/data/bug3021.gmap t/data/bug3086.embl t/data/c200-vs-yeast.BLASTN t/data/c200-vs-yeast.BLASTN.m8 t/data/calm.swiss t/data/catalase-webblast.BLASTP t/data/cds-266.fas t/data/cds_sample.embl t/data/chad100.scf t/data/char-interleave.nex t/data/char-matrix-spaces.nex t/data/cmsearch.multi.out t/data/cmsearch.nohit.out t/data/cmsearch_output.txt t/data/codeml45b.mlc t/data/component.ontology.test t/data/component.ontology.test2 t/data/consed_project/edit_dir/test_project.contigs t/data/consed_project/edit_dir/test_project.fasta t/data/consed_project/edit_dir/test_project.fasta.log t/data/consed_project/edit_dir/test_project.fasta.screen t/data/consed_project/edit_dir/test_project.fasta.screen.ace.1 t/data/consed_project/edit_dir/test_project.fasta.screen.contigs t/data/consed_project/edit_dir/test_project.fasta.screen.contigs.qual t/data/consed_project/edit_dir/test_project.fasta.screen.log t/data/consed_project/edit_dir/test_project.fasta.screen.problems t/data/consed_project/edit_dir/test_project.fasta.screen.problems.qual t/data/consed_project/edit_dir/test_project.fasta.screen.qual t/data/consed_project/edit_dir/test_project.fasta.screen.singlets t/data/consed_project/edit_dir/test_project.fasta.screen.view t/data/consed_project/edit_dir/test_project.newtags t/data/consed_project/edit_dir/test_project.screen.out t/data/consed_project/edit_dir/test_projectNewChromats.fof t/data/consed_project/edit_dir/test_project_to_alu.cross t/data/consed_project/phd_dir/ML4922R.phd.1 t/data/consed_project/phd_dir/ML4924F.phd.1 t/data/consed_project/phd_dir/ML4924R.phd.1 t/data/consed_project/phd_dir/ML4947F.phd.1 t/data/contig-by-hand.wublastp t/data/crab.dat.cn t/data/crab.nj t/data/crab.njb t/data/crypto.sim4-0 t/data/crypto.sim4-3 t/data/crypto.sim4-4 t/data/cys1_dicdi.water t/data/cysprot.fa t/data/cysprot.msf t/data/cysprot.needle t/data/cysprot.tblastn t/data/cysprot.water t/data/cysprot1.FASTA t/data/cysprot1.fa t/data/cysprot1a.fa t/data/cysprot1a.msf t/data/cysprot1b.fa t/data/cysprot1b.msf t/data/cysprot1b.newick t/data/cysprot_vs_gadfly.FASTA t/data/dbfa/1.fa t/data/dbfa/2.fa t/data/dbfa/3.fa t/data/dbfa/4.fa t/data/dbfa/5.fa t/data/dbfa/6.fa t/data/dbfa/7.fa t/data/dbfa/mixed_alphabet.fasta t/data/dbqual/1.qual t/data/dbqual/2.qual t/data/dbqual/3.qual t/data/dcr1_sp.WUBLASTP t/data/dmel_2Lchunk.gb t/data/dna2.fa t/data/dnaE-bsub-prot.fa t/data/dnaE-bsub.fa t/data/dnaEbsub_ecoli.wublastx t/data/dnaEbsub_ecoli.wutblastn t/data/dnaEbsub_ecoli.wutblastx t/data/dq519393.gb t/data/echofilter.wublastn t/data/ecoli-trna-qrna.out t/data/ecoli_domains.rpsblast t/data/ecolitst.bls t/data/ecolitst.noseqs.wublastp t/data/ecolitst.wublastp t/data/empty.bl2seq t/data/ex1.nucl.nhx t/data/example.vcf t/data/exonerate.output.dontwork t/data/exonerate.output.negativescore.works t/data/exonerate.output.works t/data/exonerate.whitespace_before_query.works t/data/expected.blast.out t/data/exsignalp.out t/data/fastq/RT98876.fastq t/data/fastq/bug2335.fastq t/data/fastq/error_diff_ids.fastq t/data/fastq/error_double_qual.fastq t/data/fastq/error_double_seq.fastq t/data/fastq/error_long_qual.fastq t/data/fastq/error_no_qual.fastq t/data/fastq/error_qual_del.fastq t/data/fastq/error_qual_escape.fastq t/data/fastq/error_qual_null.fastq t/data/fastq/error_qual_space.fastq t/data/fastq/error_qual_tab.fastq t/data/fastq/error_qual_unit_sep.fastq t/data/fastq/error_qual_vtab.fastq t/data/fastq/error_short_qual.fastq t/data/fastq/error_spaces.fastq t/data/fastq/error_tabs.fastq t/data/fastq/error_trunc_at_plus.fastq t/data/fastq/error_trunc_at_qual.fastq t/data/fastq/error_trunc_at_seq.fastq t/data/fastq/error_trunc_in_plus.fastq t/data/fastq/error_trunc_in_qual.fastq t/data/fastq/error_trunc_in_seq.fastq t/data/fastq/error_trunc_in_title.fastq t/data/fastq/evil_wrapping.fastq t/data/fastq/example.fasta t/data/fastq/example.fastq t/data/fastq/example.qual t/data/fastq/illumina_faked.fastq t/data/fastq/sanger_93.fastq t/data/fastq/sanger_faked.fastq t/data/fastq/solexa_example.fastq t/data/fastq/solexa_faked.fastq t/data/fastq/test1_sanger.fastq t/data/fastq/test2_solexa.fastq t/data/fastq/test3_illumina.fastq t/data/fastq/tricky.fastq t/data/fastq/wrapping_issues.fastq t/data/fastq/zero_qual.fastq t/data/fgenesh.out t/data/footprinter.out t/data/forward_primer.fa t/data/forward_reverse_primers.fa t/data/frac_problems.blast t/data/frac_problems2.blast t/data/frac_problems3.blast t/data/geneid_1.0.out t/data/genemark-fragment.out t/data/genemark.out t/data/genewise.out t/data/genewise_output.paracel_btk t/data/genomewise.out t/data/genomic-seq.epcr t/data/genomic-seq.fasta t/data/genomic-seq.genscan t/data/genomic-seq.mzef t/data/gf-s71.needle t/data/glimmer3-fragment.detail t/data/glimmer3-fragment.predict t/data/gmap_f9-multiple_results.txt t/data/gmap_f9-reverse-strand.txt t/data/gmap_f9.txt t/data/headerless.psl t/data/hg16_chroms.gff t/data/hmmpfam.out t/data/hs_est.est2genome t/data/hs_fugu.newick t/data/hs_owlmonkey.aln t/data/hs_owlmonkey.fas t/data/hs_owlmonkey.fasta t/data/hsinsulin.blastcl3.blastn t/data/humor.maf t/data/humts1.pal t/data/hybrid2.gff3 t/data/ids-with-spaces.phy t/data/in.fasta t/data/insulin.water t/data/interpro.xml t/data/interpro_relationship.xml t/data/interpro_sample.xml t/data/interpro_short.xml t/data/intrablock-comment.nex t/data/issue255_ssearch.fasta t/data/little.largemultifasta t/data/long-names.nex t/data/longnames.aln t/data/longnames.dnd t/data/lucy.info t/data/lucy.qual t/data/lucy.seq t/data/lucy.stderr t/data/lysozyme6.protml t/data/lysozyme6.simple.protml t/data/map_hem/HEM1-HEM12.fa t/data/map_hem/HEM1-HEM12.fa.revcom t/data/map_hem/HEM1-HEM12.meme.txt t/data/map_hem/HEM1-HEM13.fa t/data/map_hem/HEM1-HEM13.meme.txt t/data/map_hem/HEM1-HEM14.fa t/data/map_hem/HEM1-HEM14.meme.txt t/data/map_hem/HEM1-HEM15.fa t/data/map_hem/HEM1-HEM15.meme.txt t/data/map_hem/HEM1-HEM2.fa t/data/map_hem/HEM1-HEM2.fa.revcom t/data/map_hem/HEM1-HEM2.meme.txt t/data/map_hem/HEM1-HEM3.fa t/data/map_hem/HEM1-HEM3.meme.txt t/data/map_hem/HEM1-HEM4.fa t/data/map_hem/HEM1-HEM4.meme.txt t/data/map_hem/HEM1.ups.fa_ t/data/map_hem/HEM1.ups.fa_.revcom t/data/map_hem/HEM12-HEM13.fa t/data/map_hem/HEM12-HEM13.meme.txt t/data/map_hem/HEM12-HEM14.fa t/data/map_hem/HEM12-HEM14.meme.txt t/data/map_hem/HEM12-HEM15.fa t/data/map_hem/HEM12-HEM15.meme.txt t/data/map_hem/HEM12.ups.fa_ t/data/map_hem/HEM12.ups.fa_.revcom t/data/map_hem/HEM13-HEM14.fa t/data/map_hem/HEM13-HEM14.meme.txt t/data/map_hem/HEM13-HEM15.fa t/data/map_hem/HEM13-HEM15.meme.txt t/data/map_hem/HEM13.ups.fa_ t/data/map_hem/HEM13.ups.fa_.revcom t/data/map_hem/HEM14-HEM15.fa t/data/map_hem/HEM14-HEM15.meme.txt t/data/map_hem/HEM14.ups.fa_ t/data/map_hem/HEM14.ups.fa_.revcom t/data/map_hem/HEM15.ups.fa_ t/data/map_hem/HEM15.ups.fa_.revcom t/data/map_hem/HEM2-HEM12.fa t/data/map_hem/HEM2-HEM12.meme.txt t/data/map_hem/HEM2-HEM13.fa t/data/map_hem/HEM2-HEM13.meme.txt t/data/map_hem/HEM2-HEM14.fa t/data/map_hem/HEM2-HEM14.meme.txt t/data/map_hem/HEM2-HEM15.fa t/data/map_hem/HEM2-HEM15.meme.txt t/data/map_hem/HEM2-HEM3.fa t/data/map_hem/HEM2-HEM3.meme.txt t/data/map_hem/HEM2-HEM4.fa t/data/map_hem/HEM2-HEM4.meme.txt t/data/map_hem/HEM2.ups.fa_ t/data/map_hem/HEM2.ups.fa_.revcom t/data/map_hem/HEM3-HEM12.fa t/data/map_hem/HEM3-HEM12.meme.txt t/data/map_hem/HEM3-HEM13.fa t/data/map_hem/HEM3-HEM13.meme.txt t/data/map_hem/HEM3-HEM14.fa t/data/map_hem/HEM3-HEM14.meme.txt t/data/map_hem/HEM3-HEM15.fa t/data/map_hem/HEM3-HEM15.meme.txt t/data/map_hem/HEM3-HEM4.fa t/data/map_hem/HEM3-HEM4.meme.txt t/data/map_hem/HEM3.ups.fa_ t/data/map_hem/HEM3.ups.fa_.revcom t/data/map_hem/HEM4-HEM12.fa t/data/map_hem/HEM4-HEM12.meme.txt t/data/map_hem/HEM4-HEM13.fa t/data/map_hem/HEM4-HEM13.meme.txt t/data/map_hem/HEM4-HEM14.fa t/data/map_hem/HEM4-HEM14.meme.txt t/data/map_hem/HEM4-HEM15.fa t/data/map_hem/HEM4-HEM15.meme.txt t/data/map_hem/HEM4.ups.fa_ t/data/map_hem/HEM4.ups.fa_.revcom t/data/map_hem/yeast.nc.1.freq t/data/mast.dat t/data/masta.dat t/data/match.output t/data/mbsout/mbsout_infile1 t/data/mbsout/mbsout_infile2 t/data/mbsout/mbsout_infile3 t/data/megablast_output.paracel_btk t/data/meme.dat t/data/mini-AE001405.gb t/data/mini-align.aln t/data/mixedmast.dat t/data/mpath.ontology.test t/data/msout/bad_msout_infile1 t/data/msout/bad_msout_infile2 t/data/msout/msout_infile1 t/data/msout/msout_infile2 t/data/msout/msout_infile3 t/data/msout/msout_infile4 t/data/multi.blast.m8 t/data/multi.blast.m9 t/data/multi.phd t/data/multi_1.fa t/data/multi_2.fa t/data/multi_blast.bls t/data/multifa.seq t/data/multifa.seq.qual t/data/multiline-intrablock-comment.nex t/data/multiresult_blastn+.bls t/data/multiseq.bls t/data/multiseq_tags.phd t/data/myco_sites.gff t/data/nei_gojobori_test.aln t/data/neighbor.dist t/data/new_blastn.txt t/data/nhx-bacteria.nhx t/data/no-genes.genscan t/data/no_FH.embl t/data/no_cds_example.gb t/data/no_hsps.blastp t/data/no_semicolon.newick t/data/noninterleaved.phy t/data/nucmatrix.txt t/data/pep-266.aln t/data/phi.out t/data/phipsi.out t/data/phylipdist-36.out t/data/phylipdist.out t/data/phyloxml_examples.xml t/data/pre_rel9.swiss t/data/primedseq.fa t/data/primer3_infile.txt t/data/primer3_outfile.txt t/data/primer3_output.txt t/data/prints.out t/data/promoterwise.out t/data/protpars.phy t/data/protpars_longid.phy t/data/ps_scan/out.PrositeScan t/data/pseudowise.out t/data/psi_xml.dat t/data/psiblastreport.out t/data/purine_v081.infernal t/data/puzzle.tre t/data/qrna-relloc.out t/data/qualfile.qual t/data/quoted-strings1.nex t/data/quoted-strings2.nex t/data/radical-whitespace.nex t/data/radical-whitespace_02.nex t/data/registry/bdb/seqdatabase.ini t/data/registry/flat/seqdatabase.ini t/data/regulation_test.obo t/data/rel9.swiss t/data/repeatmasker.fa.out t/data/revcomp_mrna.gb t/data/ribosome-slippage.gb t/data/roa1.dat t/data/roa1.gbxml t/data/roa1.genbank t/data/roa1.swiss t/data/roa1_v2.dat t/data/rpsblast.bls t/data/rpsblast_no_hits.bls t/data/sbay_c127.fas t/data/sbay_c545-yeast.BLASTZ.PSL t/data/seg.out t/data/semicolon.newick t/data/seqdatabase.ini t/data/seqfeaturedb/test.gff3 t/data/seqfile-no-desc.pir t/data/seqfile.pir t/data/seqs.fas t/data/seqxml.xml t/data/short.blx t/data/signalp.hmm.short t/data/signalp.hmm.summary t/data/signalp.negative.out t/data/signalp.nn.short t/data/signalp.nn.summary t/data/signalp.positive.out t/data/signalp.short t/data/signalp.summary t/data/sim4.for.for t/data/sim4.for.rev t/data/sim4.rev t/data/singlescore.gbk t/data/so.obo t/data/sofa.ontology t/data/sp_subset.obo t/data/spaced_fasta.fa t/data/spaces.nex t/data/sparsealn.needle t/data/spidey.noalignment t/data/spidey.test1 t/data/sprintf.rnamotif t/data/sv40_small.xml t/data/swiss.dat t/data/swisspfam.data t/data/tab1part.mif t/data/tab2part.mif t/data/tab3part.mif t/data/tandem_repeats_finder.dat t/data/tandem_repeats_finder.noresults t/data/tandem_repeats_finder_no_desc.dat t/data/targetp.out t/data/taxdump/names.dmp t/data/taxdump/nodes.dmp t/data/taxonomy/greengenes_taxonomy_16S_candiv_gg_2011_1.txt t/data/taxonomy/silva_SSURef_108_tax_silva_trunc.fasta t/data/tblastn.out 't/data/test 2.txt' t/data/test-1.tab t/data/test-1.tab.gb t/data/test-3.0-1.meme t/data/test-3.0-2.meme t/data/test-4.9.meme t/data/test.ace t/data/test.embl t/data/test.embl2sq t/data/test.fasta t/data/test.fastq t/data/test.game t/data/test.gcg t/data/test.gcgblast t/data/test.gcgfasta t/data/test.genbank t/data/test.genbank.noseq t/data/test.infernal t/data/test.locuslink t/data/test.metafasta t/data/test.nh t/data/test.nhx t/data/test.phd t/data/test.pir t/data/test.raw t/data/test.swiss t/data/test.tab t/data/test.tigrxml t/data/test.tseq t/data/test.tsv t/data/test.txt t/data/test.waba t/data/test1.blasttab3 t/data/test1.wublastp t/data/test2.infernal t/data/test2.raw t/data/test_badlf.gcg t/data/test_clear_range.fastq t/data/test_data.axt t/data/test_space.embl t/data/testaln.arp t/data/testaln.clustalw t/data/testaln.fasta t/data/testaln.fastq t/data/testaln.list t/data/testaln.mase t/data/testaln.mega t/data/testaln.metafasta t/data/testaln.msf t/data/testaln.nexus t/data/testaln.pfam t/data/testaln.phylip t/data/testaln.po t/data/testaln.prodom t/data/testaln.psi t/data/testaln.selex t/data/testaln.stockholm t/data/testaln.xmfa t/data/testaln2.arp t/data/testaln2.fasta t/data/testdat.exonerate t/data/testdata.crossmatch t/data/testdbaccnums.out t/data/testfile.erpin t/data/testfuzzy.genbank t/data/tmhmm.out t/data/tmp.fst t/data/tol-2010-02-18.nhx t/data/traits.tab t/data/traittree.nexus t/data/transfac.dat t/data/tree_nonewline.nexus t/data/tricky.wublast t/data/trna.strict.rnamotif t/data/urease.tre.nexus t/data/version2.scf t/data/version3.scf t/data/wellcome_tol.nhx t/data/worm_fam_2785.cdna t/data/yeast.tRNAscanSE README.md100644000766000024 510114006427353 14474 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8[![DOI](https://zenodo.org/badge/doi/10.5281/zenodo.16344.svg)](http://dx.doi.org/10.5281/zenodo.16344) [![Build Status](https://travis-ci.org/bioperl/bioperl-live.svg?branch=master)](https://travis-ci.org/bioperl/bioperl-live) [![Coverage Status](https://coveralls.io/repos/bioperl/bioperl-live/badge.svg?branch=master)](https://coveralls.io/r/bioperl/bioperl-live?branch=master) [![Documentation Status](https://readthedocs.org/projects/bioperl/badge/?version=latest)](https://readthedocs.org/projects/bioperl/?badge=latest) # About BioPerl BioPerl is a project for development of free and open source Perl tools for computational molecular biology. For example, it includes classes for biological sequences, readers of multiple formats, sequence alignments, database searching objects, and interfaces to multiple programs such as EMBOSS, ClustalW, and BLAST. The BioPerl project has developed multiple module distributions for different purposes. The one named BioPerl (named after the project) provides the foundation for all others distributions. This is the repository for the BioPerl distribution only. Other distributions have [their own repositories](https://github.com/bioperl/). # Installation BioPerl distribution has the same name as the BioPerl. However, the BioPerl distribution only includes a subset of the project modules. Because of this, the meaning of "installing BioPerl" is rarely clear. Instead of "install BioPerl", the aim must be "install module X". [CPAN.org](https://www.cpan.org/modules/INSTALL.html) provides an overview on how to install and manage Perl modules but the bottom-line is: 1. find the module you need, for example `Bio::DB::EUtilities` 2. install it with `cpanm`, for example `cpanm Bio::DB::EUtilities` Alternatively, some Linux distributions have packaged BioPerl and have it available through their package manager. # Documentation and Support Documentation for individual modules is in POD and can also be read online at [metacpan](https://metacpan.org/pod/BioPerl). Useful documentation in the form of example code can also be found in the `examples/` and `bin/` directories. Additional resources and information about the project is available on the [project website](https://bioperl.org), with discussion happening on the [bioperl-l@bioperl.org](mailto:bioperl-l@bioperl.org) mailing list, and on the `#bioperl` channel of the freenode IRC server. Bug reports are handle on the distribution github page. # Development See the [`HACKING.md`](HACKING.md) file for details on the project structure, such as building from source and running the test suite. META.json100644000766000024 4635614006427353 14677 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8{ "abstract" : "Perl modules for biology", "author" : [ "See individual modules" ], "dynamic_config" : 1, "generated_by" : "Dist::Zilla version 6.017, CPAN::Meta::Converter version 2.150010", "license" : [ "perl_5" ], "meta-spec" : { "url" : "http://search.cpan.org/perldoc?CPAN::Meta::Spec", "version" : 2 }, "name" : "BioPerl", "prereqs" : { "configure" : { "requires" : { "ExtUtils::MakeMaker" : "0" } }, "develop" : { "requires" : { "Test::Mojibake" : "0", "Test::Pod" : "1.41" } }, "runtime" : { "requires" : { "AnyDBM_File" : "0", "Carp" : "0", "Cwd" : "0", "DB_File" : "0", "Data::Dumper" : "0", "Data::Stag" : "0", "Digest::MD5" : "0", "Dumpvalue" : "0", "Error" : "0", "Exporter" : "0", "Fcntl" : "0", "File::Basename" : "0", "File::Copy" : "0", "File::Path" : "0", "File::Spec" : "0", "File::Spec::Functions" : "0", "File::Temp" : "0", "FileHandle" : "0", "Getopt::Long" : "0", "Graph::Directed" : "0", "HTTP::Request::Common" : "0", "HTTP::Response" : "0", "IO::File" : "0", "IO::Handle" : "0", "IO::Pipe" : "0", "IO::Scalar" : "0", "IO::Socket" : "0", "IO::String" : "0", "IPC::Run" : "0", "LWP::UserAgent" : "0", "List::MoreUtils" : "0", "List::Util" : "0", "Math::BigFloat" : "0", "Module::Build" : "0", "POSIX" : "0", "Pod::Usage" : "0", "Scalar::Util" : "0", "Set::Scalar" : "0", "Storable" : "0", "Symbol" : "0", "Test::Builder" : "0", "Test::Builder::Module" : "0", "Test::Most" : "0", "Test::RequiresInternet" : "0", "Text::Balanced" : "0", "Text::Wrap" : "0", "Tie::Handle" : "0", "Tie::RefHash" : "0", "Time::HiRes" : "0", "UNIVERSAL" : "0", "URI" : "0", "XML::DOM" : "0", "XML::LibXML" : "0", "XML::LibXML::Reader" : "0", "XML::Parser::PerlSAX" : "0", "XML::SAX" : "0", "XML::SAX::Base" : "0", "XML::SAX::Writer" : "0", "XML::Twig" : "0", "XML::Writer" : "0", "YAML" : "0", "base" : "0", "constant" : "0", "integer" : "0", "overload" : "0", "parent" : "0", "strict" : "0", "utf8" : "0", "vars" : "0", "version" : "0", "warnings" : "0" }, "suggests" : { "Bio::DB::Ace" : "0", "Bio::DB::EMBL" : "0", "Bio::DB::GenBank" : "0", "Bio::DB::GenPept" : "0", "Bio::DB::SwissProt" : "0" } }, "test" : { "requires" : { "File::Spec" : "0", "FindBin" : "0", "IO::Handle" : "0", "IPC::Open3" : "0", "Test::Memory::Cycle" : "0", "Test::More" : "0", "Test::Weaken" : "0", "lib" : "0", "perl" : "5.006" } } }, "release_status" : "stable", "resources" : { "bugtracker" : { "mailto" : "bioperl-l@bioperl.org", "web" : "https://github.com/bioperl/bioperl-live/issues" }, "homepage" : "https://metacpan.org/release/BioPerl", "repository" : { "type" : "git", "url" : "git://github.com/bioperl/bioperl-live.git", "web" : "https://github.com/bioperl/bioperl-live" } }, "version" : "1.7.8", "x_Dist_Zilla" : { "perl" : { "version" : "5.033001" }, "plugins" : [ { "class" : "Dist::Zilla::Plugin::GatherDir", "config" : { "Dist::Zilla::Plugin::GatherDir" : { "exclude_filename" : [], "exclude_match" : [], "follow_symlinks" : 0, "include_dotfiles" : 0, "prefix" : "", "prune_directory" : [], "root" : "." } }, "name" : "@BioPerl/@Filter/GatherDir", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::PruneCruft", "name" : "@BioPerl/@Filter/PruneCruft", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::ManifestSkip", "name" : "@BioPerl/@Filter/ManifestSkip", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::MetaYAML", "name" : "@BioPerl/@Filter/MetaYAML", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::License", "name" : "@BioPerl/@Filter/License", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::ExtraTests", "name" : "@BioPerl/@Filter/ExtraTests", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::ExecDir", "name" : "@BioPerl/@Filter/ExecDir", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::ShareDir", "name" : "@BioPerl/@Filter/ShareDir", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::MakeMaker", "config" : { "Dist::Zilla::Role::TestRunner" : { "default_jobs" : 1 } }, "name" : "@BioPerl/@Filter/MakeMaker", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::Manifest", "name" : "@BioPerl/@Filter/Manifest", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::TestRelease", "name" : "@BioPerl/@Filter/TestRelease", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::ConfirmRelease", "name" : "@BioPerl/@Filter/ConfirmRelease", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::UploadToCPAN", "name" : "@BioPerl/@Filter/UploadToCPAN", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::MetaConfig", "name" : "@BioPerl/MetaConfig", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::MetaJSON", "name" : "@BioPerl/MetaJSON", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::PkgVersion", "name" : "@BioPerl/PkgVersion", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::PodSyntaxTests", "name" : "@BioPerl/PodSyntaxTests", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::Test::Compile", "config" : { "Dist::Zilla::Plugin::Test::Compile" : { "bail_out_on_fail" : 0, "fail_on_warning" : "author", "fake_home" : 0, "filename" : "t/00-compile.t", "module_finder" : [ ":InstallModules" ], "needs_display" : 0, "phase" : "test", "script_finder" : [ ":PerlExecFiles" ], "skips" : [], "switch" : [] } }, "name" : "@BioPerl/Test::Compile", "version" : "2.058" }, { "class" : "Dist::Zilla::Plugin::MojibakeTests", "name" : "@BioPerl/MojibakeTests", "version" : "0.8" }, { "class" : "Dist::Zilla::Plugin::AutoPrereqs", "name" : "@BioPerl/AutoPrereqs", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::MetaResources", "name" : "@BioPerl/MetaResources", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::Encoding", "name" : "@BioPerl/Encoding", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::NextRelease", "name" : "@BioPerl/NextRelease", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::Git::Check", "config" : { "Dist::Zilla::Plugin::Git::Check" : { "untracked_files" : "die" }, "Dist::Zilla::Role::Git::DirtyFiles" : { "allow_dirty" : [ "Changes", "dist.ini" ], "allow_dirty_match" : [], "changelog" : "Changes" }, "Dist::Zilla::Role::Git::Repo" : { "git_version" : "2.28.0", "repo_root" : "." } }, "name" : "@BioPerl/Git::Check", "version" : "2.047" }, { "class" : "Dist::Zilla::Plugin::Git::Commit", "config" : { "Dist::Zilla::Plugin::Git::Commit" : { "add_files_in" : [], "commit_msg" : "v%V%n%n%c", "signoff" : 0 }, "Dist::Zilla::Role::Git::DirtyFiles" : { "allow_dirty" : [ "Changes", "dist.ini" ], "allow_dirty_match" : [], "changelog" : "Changes" }, "Dist::Zilla::Role::Git::Repo" : { "git_version" : "2.28.0", "repo_root" : "." }, "Dist::Zilla::Role::Git::StringFormatter" : { "time_zone" : "local" } }, "name" : "@BioPerl/Git::Commit", "version" : "2.047" }, { "class" : "Dist::Zilla::Plugin::Git::Tag", "config" : { "Dist::Zilla::Plugin::Git::Tag" : { "branch" : null, "changelog" : "Changes", "signed" : 0, "tag" : "BioPerl-v1.7.8", "tag_format" : "%N-v%v", "tag_message" : "%N-v%v" }, "Dist::Zilla::Role::Git::Repo" : { "git_version" : "2.28.0", "repo_root" : "." }, "Dist::Zilla::Role::Git::StringFormatter" : { "time_zone" : "local" } }, "name" : "@BioPerl/Git::Tag", "version" : "2.047" }, { "class" : "Dist::Zilla::Plugin::OSPrereqs", "config" : { "Dist::Zilla::Plugin::OSPrereqs" : { "os" : "MSWin32" } }, "name" : "MSWin32", "version" : "0.011" }, { "class" : "Dist::Zilla::Plugin::Prereqs", "config" : { "Dist::Zilla::Plugin::Prereqs" : { "phase" : "runtime", "type" : "suggests" } }, "name" : "bin_bp_fetch_RuntimeSuggests", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FileFinder::ByName", "name" : "PodWeaver-Ready", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::PodWeaver", "config" : { "Dist::Zilla::Plugin::PodWeaver" : { "config_plugins" : [ "@BioPerl" ], "finder" : [ "PodWeaver-Ready" ], "plugins" : [ { "class" : "Pod::Weaver::Plugin::EnsurePod5", "name" : "@CorePrep/EnsurePod5", "version" : "4.015" }, { "class" : "Pod::Weaver::Plugin::H1Nester", "name" : "@CorePrep/H1Nester", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Name", "name" : "@BioPerl/Name", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Version", "name" : "@BioPerl/Version", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Region", "name" : "@BioPerl/prelude", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Generic", "name" : "SYNOPSIS", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Generic", "name" : "DESCRIPTION", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Generic", "name" : "OVERVIEW", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Collect", "name" : "ATTRIBUTES", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Collect", "name" : "METHODS", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Collect", "name" : "FUNCTIONS", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Collect", "name" : "INTERNAL METHODS", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Leftovers", "name" : "@BioPerl/Leftovers", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::Region", "name" : "@BioPerl/postlude", "version" : "4.015" }, { "class" : "Pod::Weaver::Section::GenerateSection", "name" : "FEEDBACK", "version" : "1.06" }, { "class" : "Pod::Weaver::Section::GenerateSection", "name" : "Mailing lists", "version" : "1.06" }, { "class" : "Pod::Weaver::Section::GenerateSection", "name" : "Support", "version" : "1.06" }, { "class" : "Pod::Weaver::Section::GenerateSection", "name" : "Reporting bugs", "version" : "1.06" }, { "class" : "Pod::Weaver::Section::Legal::Complicated", "name" : "@BioPerl/Legal", "version" : "1.23" }, { "class" : "Pod::Weaver::Section::Contributors", "name" : "@BioPerl/Contributors", "version" : "0.009" }, { "class" : "Pod::Weaver::Plugin::SingleEncoding", "name" : "SingleEncoding", "version" : "4.015" }, { "class" : "Pod::Weaver::Plugin::Transformer", "name" : "@BioPerl/List", "version" : "4.015" }, { "class" : "Pod::Weaver::Plugin::EnsureUniqueSections", "name" : "EnsureUniqueSections", "version" : "0.163250" } ] } }, "name" : "PodWeaver", "version" : "4.008" }, { "class" : "Dist::Zilla::Plugin::MetaResources", "name" : "MetaResources", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::PruneFiles", "name" : "PruneFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":InstallModules", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":IncModules", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":TestFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":ExtraTestFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":ExecFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":PerlExecFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":ShareFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":MainModule", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":AllFiles", "version" : "6.017" }, { "class" : "Dist::Zilla::Plugin::FinderCode", "name" : ":NoFiles", "version" : "6.017" } ], "zilla" : { "class" : "Dist::Zilla::Dist::Builder", "config" : { "is_trial" : 0 }, "version" : "6.017" } }, "x_generated_by_perl" : "v5.33.1", "x_serialization_backend" : "Cpanel::JSON::XS version 4.25", "x_spdx_expression" : "Artistic-1.0-Perl OR GPL-1.0-or-later" } HACKING.md100644000766000024 1573614006427353 14642 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8# Working with development sources BioPerl uses [Dist::Zilla](http://dzil.org/) to author releases. You will also need the `Dist::Zilla::PluginBundle::BioPerl` installed as well as its dependencies. Then, you can run the following commands: dzil test dzil install # The Directory Structure The bioperl-live repository structure is organized as follows: * `lib/` - BioPerl modules * `examples/` - Scripts demonstrating the many uses of BioPerl * `scripts/` - Useful production-quality scripts with POD documentation * `t/` - Perl built-in tests, tests are divided into subdirectories based on the specific classes being tested * `t/data/` - Data files used for the tests, provides good example data * `travis_scripts/` - script to customize Travis ## `Bio::` namespace summary The BioPerl project is split over multiple Perl module distributions. The BioPerl distribution is the BioPerl core distribution, including a selection of modules and namespaces but not all. For example, the entire Bio::Biblio is not included in the BioPerl distribution. Similarly, while many Bio::SearchIO modules in the BioPerl distribution, there also Bio::SearchIO modules in other distributions such as Bio-SearchIO-blastxml. This section describes most of the Bio:: namespaces developed by the BioPerl project, including those which are not part of the BioPerl distribution. For example, the Bio::Biblio and Bio::Assembly are documented here but are not part of the BioPerl distribution. * `Bio::Seq` is for *Sequences* (protein and DNA). * `Bio::PrimarySeq` is a plain sequence (sequence data + identifiers) * `Bio::Seq` is a fancier `PrimarySeq`, in that it has annotation (via `Bio::Annotation::Collection`) and sequence features (via `Bio::SeqFeatureI` objects, attached via `Bio::FeatureHolderI`). * `Bio::Seq::RichSeq` is all of the above, plus it has slots for extra information specific to GenBank/EMBL/SwissProt files. * `Bio::Seq::LargeSeq` is for sequences which are too big for fitting into memory. * `Bio::SeqIO` is for *reading and writing Sequences*. It is a front end module for separate driver modules supporting the different sequence formats. * `Bio::SeqFeature` represent start/stop/strand-based localised annotations (features) of sequences * `Bio::SeqFeature::Generic` is basic catchall * `Bio::SeqFeature::Similarity` a similarity sequence feature * `Bio::SeqFeature::FeaturePair` a sequence feature which is pairwise such as query/hit pairs * `Bio::SearchIO` is for reading and writing pairwise alignment reports, like BLAST or FASTA. * `Bio::Search` is where the alignment objects for `SearchIO` are defined * `Bio::Search::Result::GenericResult` is the result object (a blast query is a `Result` object) * `Bio::Search::Hit::GenericHit` is the `Hit` object (a query will have 0 to many hits in a database) * `Bio::Search::HSP::GenericHSP` is the High-scoring Segment Pair object defining the alignment(s) of the query and hit. * `Bio::SimpleAlign` is for multiple sequence alignments * `Bio::AlignIO` is for reading and writing multiple sequence alignment formats * `Bio::Assembly` provides the start of an infrastructure for assemblies and `Bio::Assembly::IO` *IO converters* for them * `Bio::DB` is the namespace for database query classes * `Bio::DB::GenBank/GenPept` are two modules which query NCBI entrez for sequences. * `Bio::DB::SwissProt/EMBL` query various EMBL and SwissProt repositories for a sequences. * `Bio::DB::GFF` is Lincoln Stein's fast, lightweight feature and sequence database which is the backend to his [GBrowse](www.gmod.org) system. * `Bio::DB::Flat` is a fast implementation of the OBDA flat-file indexing system (cross-language and cross-platform supported by O|B|F projects see http://obda.open-bio.org). * `Bio::DB::BioFetch/DBFetch` for OBDA, Web (HTTP) access to remote databases. * `Bio::DB::InMemoryCache/FileCache` (fast local caching of sequences from remote dbs to speed up your access). * `Bio::DB::Registry` interface to the OBDA specification for remote data sources. * `Bio::DB::Biblio` for access to remote bibliographic databases. * `Bio::DB::EUtilities` is the initial set of modules used for generic queried using NCBI's eUtils. * `Bio::Annotation` collection of annotation objects (comments, DBlinks, References, and misc key/value pairs) * `Bio::Coordinate`** is a system for mapping between different coordinate systems such as DNA to protein or between assemblies * `Bio::Index` is for locally indexed flatfiles with BerkeleyDB * `Bio::Tools` contains many *miscellaneous parsers and functions* for different bioinformatics needs such as: * Gene prediction parser (Genscan, MZEF, Grail, Genemark) * Annotation format (GFF) * Enumerate codon tables and valid sequences symbols (CodonTable, IUPAC) * Phylogenetic program parsing (PAML, Molphy, Phylip) * `Bio::Map` represents genetic and physical map representations * `Bio::Structure` parse and represent protein structure data * `Bio::TreeIO` is for reading and writing Tree formats * `Bio::Tree` is the namespace for all associated Tree classes * `Bio::Tree::Tree` is the basic tree object * `Bio::Tree::Node` are the nodes which make up the tree * `Bio::Tree::Statistics` is for computing statistics for a tree * `Bio::Tree::TreeFunctionsI` is where specific tree functions are implemented (like `is_monophyletic` and `lca`) * `Bio::Biblio` is where bibliographic data and database access objects are kept * `Bio::Variation` represent sequences with mutations and variations applied so one can compare and represent wild-type and mutation versions of a sequence. * `Bio::Root` are basic objects for the internals of BioPerl # Releases BioPerl currently uses a [semantic versioning](https://semver.org/) scheme for version numbers. Basically, a version has three numbers in the form `MAJOR.MINOR.PATH`, each of which changes when: 1. `MAJOR` --- incompatible API changes, 2. `MINOR` --- new functionality in a backwards-compatible manner, 3. `PATCH` --- backwards-compatible bug fixes. ## 1.7 releases Before 1.7 release, the BioPerl project had a single distribution with all of BioPerl modules. During the 1.7 release series, subsets of the modules were extracted into separate distribution. ## Pre 1.7 releases From version 1.0 until 1.6, even numbers (e.g. version 1.4) indicated stable releases. Stable releases were well tested and recommended for most uses. Odd numbers (e.g. version 1.3) were development releases which one would only use if interested in the latest features. The final number (e.g. in `1.2.1`) is the point or patch release. The higher the number the more bug fixes has been incorporated. In theory you can upgrade from one point or patch release to the next with no changes to your own code (for production cases, obviously check things out carefully before you switch over). bin000755000766000024 014006427353 13630 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8bp_nrdb100644000766000024 655414006427353 15333 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # Author Jason Stajich # # Make a non-redundant database based on sequence (not on ID!) # This script is still in progress but is intended to mimic what # Warren Gish's nrdb does # It requires that Digest::MD5 is installed (for now) =head1 NAME bp_nrdb.PLS - a script to emulate Warren Gish's nrdb, make a unique sequence database from a set of input databases =head1 SYNOPSIS Usage: bp_nrdb.PLS [options] file1 file2 file3 Alternative usage bp_nrdb.PLS -p [options] file1 id1 file2 id2 file3 id3 =head1 DESCRIPTION This script will create a unique database of sequences (quasi-nonredundant). The options are: -o filename - the filename the db is written (STDOUT by default) -a filename - the filename to append the db to -l# - minimum required sequence length -i - do not check for duplicates -n# - max number of descriptions to report per seq -d# - delimiter to use between consecutive descriptions -p - use database id prefixes from command line =head1 AUTHOR Jason Stajich, jason-at-bioperl-dot-org =cut use strict; use warnings; use Bio::SeqIO; use Getopt::Long; use Digest::MD5 qw(md5_hex); my ($output,$append,$min_len, $no_duplicate_check,$desc_count, $delimiter, $expect_prefixes,$help); $delimiter = ';'; GetOptions( 'o|output:s' => \$output, 'a|append:s' => \$append, 'n:s' => \$desc_count, 'l:s' => \$min_len, 'd:s' => \$delimiter, 'p' => \$expect_prefixes, 'i' => \$no_duplicate_check, 'h' => \$help, ); die("must supply a positive integer for -d") if ( defined $desc_count && ( $desc_count !~ /^\d+$/ || $desc_count < 1) ); die("must supply a positive integer for -l") if ( defined $min_len && ( $min_len !~ /^\d+$/ || $min_len < 1) ); my @files; if( $help || ! @ARGV ) { exec('perldoc',$0); exit(0); } while( @ARGV ) { my ($file, $id) = (undef,''); if( $expect_prefixes ) { ($file,$id) = (shift @ARGV, shift @ARGV); if( ! $id ) { die("Must provide 'name id' pairing of dbfile and id"); } } else { $file = shift @ARGV; } push @files, [ $file,$id]; } my $out; if( $append ) { $out = new Bio::SeqIO(-file => ">>$append"); } elsif( $output ) { $out = new Bio::SeqIO(-file => ">$output"); } else { $out = new Bio::SeqIO(); # use STDOUT } my %unique; my %seqcount; my $counter = 0; foreach my $pair ( @files ) { my ($file,$id) = @$pair; my $in = new Bio::SeqIO(-file => $file); while( my $seq = $in->next_seq ) { next if defined $min_len && $seq->length < $min_len; if( $id ) { $seq->display_id("$id:".$seq->display_id); } my $s = lc($seq->seq()); my $md5sum = md5_hex($s); if( $no_duplicate_check ) { $md5sum = $counter++; } if( defined $unique{$md5sum} ) { $seqcount{$md5sum}++; next if defined $desc_count && $seqcount{$md5sum++} > $desc_count; my $desc = $unique{$md5sum}->description; my $id2 = sprintf("%s %s:%s %s",$delimiter, $id,$seq->display_id,$seq->description); $unique{$md5sum}->desc($desc . $id2); } else { $unique{$md5sum} = $seq; } } } foreach my $seq ( values %unique ) { $out->write_seq($seq); } __END__ t000755000766000024 014006427353 13323 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8Species.t100644000766000024 715714006427353 15255 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t# -*-Perl-*- Test Harness script for Bioperl use strict; my $CYCLE; my $WEAKEN; BEGIN { use Bio::Root::Test; eval { require Test::Memory::Cycle; 1; }; $CYCLE = $@ ? 0 : 1; eval { require Test::Weaken; 1; }; $WEAKEN = $@ ? 0 : 1; test_begin(-tests => 27); use_ok('Bio::Species'); use_ok('Bio::DB::Taxonomy'); } ok my $sps = Bio::Species->new(); $sps->classification(qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota)); is $sps->binomial, 'Homo sapiens'; ok $sps->sub_species('sapiensis'); is $sps->binomial, 'Homo sapiens'; is $sps->binomial('FULL'), 'Homo sapiens sapiensis'; is $sps->sub_species, 'sapiensis'; $sps->classification(qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota)); is $sps->binomial, 'Homo sapiens'; # test cmd line initializtion ok my $species = Bio::Species->new( -classification => [ qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota) ], -common_name => 'human'); is $species->binomial, 'Homo sapiens'; is $species->species, 'sapiens'; is $species->genus, 'Homo'; # test -common_name parameter, bug 2549 is $species->common_name, 'human'; # A Bio::Species isa Bio::Taxon, so test some things from there briefly is $species->scientific_name, 'sapiens'; is $species->rank, 'species'; # We can make a species object from just an id an db handle SKIP: { test_skip(-tests => 5, -requires_module => 'Bio::DB::Taxonomy::entrez', -requires_networking => 1); $species = Bio::Species->new(-id => 51351); my $taxdb = Bio::DB::Taxonomy->new(-source => 'entrez'); eval {$species->db_handle($taxdb);}; skip "Unable to connect to entrez database; no network or server busy?", 5 if $@; is $species->binomial, 'Brassica rapa subsp.'; is $species->binomial('FULL'), 'Brassica rapa subsp. pekinensis'; is $species->genus, 'Brassica'; is $species->species, 'rapa subsp.'; is $species->sub_species, 'pekinensis'; } SKIP: { skip("Test::Memory::Cycle not installed, skipping", 3) if !$CYCLE; # this sub leaks, should return true my ($a, $b); $a = \$b; $b = \$a; Test::Memory::Cycle::memory_cycle_exists($a); # this sub shouldn't leak (no circ. refs) $species = Bio::Species->new( -classification => [ qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota) ], -common_name => 'human'); Test::Memory::Cycle::memory_cycle_exists($species); # Github issue #81 Test::Memory::Cycle::memory_cycle_exists(Bio::Species->new(-classification => ['A'])); } SKIP: { skip("Test::Weaken not installed, skipping", 3) if !$WEAKEN; # this sub leaks, should return true ok(Test::Weaken::leaks({ constructor => sub { my ($a, $b); $a = \$b; $b = \$a} })); # this sub shouldn't leak (no circ. refs) ok(!Test::Weaken::leaks({ constructor => sub{ Bio::Species->new( -classification => [ qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota) ], -common_name => 'human') }, } )); # Github issue #81 ok(!Test::Weaken::leaks({ constructor => sub{ Bio::Species->new( -classification => ['A']) }, } )); } Seq000755000766000024 014006427353 14053 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tSeq.t100644000766000024 1715514006427353 15161 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 76); use_ok('Bio::Seq'); use_ok('Bio::Seq::RichSeq'); use_ok('Bio::SeqFeature::Generic'); use_ok('Bio::Species'); use_ok('Bio::Annotation::SimpleValue'); } ok my $seq = Bio::Seq->new(-seq=>'ACTGTGGCGTCAACT', -desc=>'Sample Bio::Seq object', -alphabet => 'dna', -is_circular => 1 ); isa_ok($seq,"Bio::AnnotatableI"); ok $seq->is_circular; ok not $seq->is_circular(0); ok not $seq->is_circular; my $trunc = $seq->trunc(1,4); is $trunc->length, 4, 'truncated sequence length'; is $trunc->seq, 'ACTG', 'truncated sequence string'; # test ability to get str function is $seq->seq(), 'ACTGTGGCGTCAACT' ; ok $seq = Bio::Seq->new(-seq=>'actgtggcgtcaact', -desc=>'Sample Bio::Seq object', -display_id => 'something', -accession_number => 'accnum', -alphabet => 'dna' ); is uc $seq->alphabet, 'DNA' , 'alphabet'; # basic methods is $seq->id(), 'something', "id"; is $seq->accession_number, 'accnum', "accession number"; is $seq->subseq(5, 9), 'tggcg', "subseq"; # check IdentifiableI and DescribableI interfaces isa_ok $seq, 'Bio::IdentifiableI'; isa_ok $seq, 'Bio::DescribableI'; # make sure all methods are implemented is $seq->authority("bioperl.org"), "bioperl.org"; is $seq->namespace("t"), "t"; is $seq->version(0), 0; is $seq->lsid_string(), "bioperl.org:t:accnum"; is $seq->namespace_string(), "t:accnum.0"; is $seq->description(), 'Sample Bio::Seq object'; is $seq->display_name(), "something"; # check that feature accession works regardless of lazy things going on is scalar($seq->top_SeqFeatures()), 0; is scalar($seq->flush_SeqFeatures()), 0; my $newfeat = Bio::SeqFeature::Generic->new( -start => 10, -end => 12, -primary => 'silly', -source => 'stuff'); $seq->add_SeqFeature($newfeat); is $seq->feature_count, 1; my $species = Bio::Species->new (-verbose => 1, -classification => [ qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota )]); $seq->species($species); is $seq->species->binomial, 'Homo sapiens'; $seq->annotation->add_Annotation('description', Bio::Annotation::SimpleValue->new(-value => 'desc-here')); my ($descr) = $seq->annotation->get_Annotations('description'); is $descr->value(), 'desc-here'; is $descr->tagname(), 'description'; # # translation tests # my $trans = $seq->translate(); is $trans->seq(), 'TVAST' , 'translated sequence'; # unambiguous two character codons like 'ACN' and 'GTN' should give out an amino # acid ...with the addendum that there should be no assumption by the method # to complete the codon unless specified, using the -complete_codons flag. $seq->seq('ACTGTGGCGTCAACN'); $trans = $seq->translate(); is $trans->seq(), 'TVAST', 'translated sequence with explicit unambiguous codons'; $seq->seq('ACTGTGGCGTCAAC'); $trans = $seq->translate(); is $trans->seq(), 'TVAS', 'translated sequence with unknown unambiguous codons'; $seq->seq('ACTGTGGCGTCAAC'); $trans = $seq->translate(-complete_codons => 1); is $trans->seq(), 'TVAST', 'translated sequence with unknown unambiguous codons, completed'; $seq->seq('ACTGTGGCGTCAACA'); $trans = $seq->translate(); is $trans->seq(), 'TVAST', 'translated sequence with unambiguous codons'; $seq->seq('ACTGTGGCGTCAACAG'); $trans = $seq->translate(); is $trans->seq(), 'TVAST', 'translated sequence with unambiguous codons'; $seq->seq('ACTGTGGCGTCAACAGT'); $trans = $seq->translate(-complete_codons => 1); is $trans->seq(), 'TVASTV', 'translated sequence with unknown unambiguous codons, completed'; $seq->seq('ACTGTGGCGTCAACAGTA'); $trans = $seq->translate(); is $trans->seq(), 'TVASTV', 'translated sequence with unambiguous codons'; $seq->seq('AC'); is $seq->translate(-complete_codons => 1)->seq , 'T', 'translated sequence with unknown unambiguous codons, completed'; #difference between the default and full CDS translation $seq->seq('atgtggtaa'); $trans = $seq->translate(); is $trans->seq(), 'MW*' , 'translated sequence with stop'; $seq->seq('atgtggtaa'); $trans = $seq->translate(undef,undef,undef,undef,1); is $trans->seq(), 'MW', 'translated sequence'; #frame my $string; my @frames = (0, 1, 2); foreach my $frame (@frames) { $string .= $seq->translate(undef, undef, $frame)->seq; $string .= $seq->revcom->translate(undef, undef, $frame)->seq; } is $string, 'MW*LPHCGYHVVTT'; #Translating with all codon tables using method defaults $string = ''; my @codontables = qw(0 1 2 3 4 5 6 9 10 11 12 13 14 16 21 22 23 24 25 26 27 28 29 30 31); foreach my $ct (@codontables) { $string .= $seq->translate(undef, undef, undef, $ct)->seq; } is $string, 'MW*MW*MW*MW*MW*MW*MWQMW*MW*MW*MW*MW*MWYMW*MW*MW*MW*MW*MW*MW*MWQMWQMWYMWEMWE'; # CDS translation set to throw an exception for internal stop codons $seq->seq('atgtggtaataa'); eval { $seq->translate(undef, undef, undef, undef, 'CDS' , 'throw'); }; like ($@, qr/EX/); $seq->seq('atgtggtaataa'); is( $seq->translate('J', '-',)->seq, 'MWJJ'); # tests for RichSeq ok my $richseq = Bio::Seq::RichSeq->new( -seq => 'atgtggtaataa', -accession_number => 'AC123', -alphabet => 'rna', -molecule => 'mRNA', -id => 'id1', -dates => [ '2001/1/1' ], -pid => '887821', -keywords => 'JUNK1;JUNK2', -division => 'Fungi', -secondary_accessions => 'AC1152' ); is ($richseq->seq, 'atgtggtaataa'); is ($richseq->display_id, 'id1'); is (($richseq->get_dates)[0], '2001/1/1'); is (($richseq->get_secondary_accessions)[0], 'AC1152'); is ($richseq->accession_number, 'AC123'); is ($richseq->alphabet, 'rna'); is ($richseq->molecule, 'mRNA'); is ($richseq->pid, 887821); is ($richseq->division, 'Fungi'); is ($richseq->keywords, 'JUNK1; JUNK2'); $richseq->seq_version('2'); is ($richseq->seq_version, 2); # Test adding a feature to a RichSeq type, then # trunc() and see if the feature vanishes (we shouldn't # be using clone() for RichSeq types) $richseq->add_SeqFeature($newfeat); is $richseq->feature_count, 1; my $newrichseq = $richseq->trunc(1,5); is $newrichseq->feature_count, 0, "Don't use clone for trunc of Bio::Seq::RichSeq"; is $newrichseq->length, 5; # tests for subtle misbehaviors $seq = Bio::Seq->new(-primary_id => 'blah', -accession_number => 'foo'); is ($seq->accession_number, $seq->primary_seq->accession_number); is ($seq->primary_id, $seq->primary_seq->primary_id); $seq->accession_number('blurb'); $seq->primary_id('bar'); is ($seq->accession_number, $seq->primary_seq->accession_number); is ($seq->primary_id, $seq->primary_seq->primary_id); # Bug #2864: $seq = Bio::Seq->new(-display_id => 0, -seq => 'GATC'); is $seq->display_id, 0, "Bug #2864"; # transcribe/rev_transcribe $seq = Bio::Seq->new( -id => 'seq1', -alphabet=>'dna', -seq=> 'attTcgcatgT' ); ok my $xseq = $seq->transcribe; is $xseq->alphabet, 'rna'; ok !($xseq->seq =~ /[tT]/); is $xseq->seq, 'auuUcgcaugU'; ok !$xseq->transcribe; ok $seq = $xseq->rev_transcribe; is $seq->seq, 'attTcgcatgT'; is $seq->alphabet, 'dna'; Root000755000766000024 014006427353 14246 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tIO.t100644000766000024 3313114006427353 15123 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 154); use_ok 'Bio::Root::IO'; } ok my $obj = Bio::Root::IO->new(); isa_ok $obj, 'Bio::Root::IO'; ############################################# # tests for exceptions/debugging/verbosity ############################################# throws_ok { $obj->throw('Testing throw') } qr/Testing throw/, 'Throw'; $obj->verbose(-1); throws_ok { $obj->throw('Testing throw') } qr/Testing throw/; eval { $obj->warn('Testing warn') }; ok !$@, 'Warn'; $obj->verbose(1); throws_ok { $obj->throw('Testing throw') } qr/Testing throw/; ok my @stack = $obj->stack_trace(), 'Stack trace'; is scalar @stack, 2; ok my $verbobj = Bio::Root::IO->new( -verbose => 1, -strict => 1 ), 'Verbosity'; is $verbobj->verbose(), 1; ok $obj->verbose(-1); ############################################# # tests for finding executables ############################################# ok my $io = Bio::Root::IO->new(); # An executable file my $out_file = 'test_file.txt'; my $out_fh; open $out_fh, '>', $out_file or die "Could not write file '$out_file': $!\n"; print $out_fh 'test'; close $out_fh; # -X test file will fail in Windows regardless of chmod, # because it looks for the executable suffix (like ".exe") if ($^O =~ m/mswin/i) { # An executable file my $exec_file = 'test_exec.exe'; open my $exe_fh, '>', $exec_file or die "Could not write file '$exec_file': $!\n"; close $exe_fh; ok $obj->exists_exe($exec_file), 'executable file'; unlink $exec_file or die "Could not delete file '$exec_file': $!\n"; # A not executable file ok (! $obj->exists_exe($out_file), 'non-executable file'); unlink $out_file or die "Could not delete file '$out_file': $!\n"; } else { # An executable file chmod 0777, $out_file or die "Could not change permission of file '$out_file': $!\n"; ok $obj->exists_exe($out_file), 'executable file'; # A not executable file chmod 0444, $out_file or die "Could not change permission of file '$out_file': $!\n"; ok (! $obj->exists_exe($out_file), 'non-executable file'); unlink $out_file or die "Could not delete file '$out_file': $!\n"; } # An executable dir my $out_dir = 'test_dir'; mkdir $out_dir or die "Could not write dir '$out_dir': $!\n"; chmod 0777, $out_dir or die "Could not change permission of dir '$out_dir': $!\n"; ok (! $obj->exists_exe($out_dir), 'executable dir'); rmdir $out_dir or die "Could not delete dir '$out_dir': $!\n"; ############################################# # tests for handle read and write abilities ############################################# # Test catfile ok my $in_file = Bio::Root::IO->catfile(qw(t data test.waba)); is $in_file, test_input_file('test.waba'); ok my $in_file_2 = Bio::Root::IO->catfile(qw(t data test.txt)); $out_file = test_output_file(); # Test with files ok my $rio = Bio::Root::IO->new( -input => $in_file ), 'Read from file'; is $rio->file, $in_file; is_deeply [$rio->cleanfile], [undef, $in_file]; is $rio->mode, 'r'; ok $rio->close; ok $rio = Bio::Root::IO->new( -file => '<'.$in_file ); is $rio->file, '<'.$in_file; is_deeply [$rio->cleanfile], ['<', $in_file]; 1 while $rio->_readline; # read entire file content is $rio->mode, 'r'; ok $rio->close; ok my $wio = Bio::Root::IO->new( -file => ">$out_file" ), 'Write to file'; is $wio->file, ">$out_file"; is_deeply [$wio->cleanfile], ['>', $out_file]; is $wio->mode, 'w'; ok $wio->close; ok $rio = Bio::Root::IO->new( -file => "+>$out_file" ), 'Read+write to file'; is $rio->file, "+>$out_file"; is_deeply [$rio->cleanfile], ['+>', $out_file]; is $rio->mode, 'rw'; ok $rio->close; # Test with handles my $in_fh; open $in_fh , '<', $in_file or die "Could not read file '$in_file': $!\n", 'Read from GLOB handle'; ok $rio = Bio::Root::IO->new( -fh => $in_fh ); is $rio->_fh, $in_fh; is $rio->mode, 'r'; close $in_fh; open $out_fh, '>', $out_file or die "Could not write file '$out_file': $!\n", 'Write to GLOB handle'; ok $wio = Bio::Root::IO->new( -fh => $out_fh ); is $wio->_fh, $out_fh; is $wio->mode, 'w'; close $out_fh; SKIP: { eval { require File::Temp; } or skip 'could not create File::Temp object, maybe your File::Temp is 10 years old', 4; $out_fh = File::Temp->new; ok $wio = Bio::Root::IO->new( -fh => $out_fh ), 'Read from File::Temp handle'; isa_ok $wio, 'Bio::Root::IO'; is $wio->mode, 'rw', 'is a write handle'; warnings_like sub { $wio->close }, '', 'no warnings in ->close()'; ok $wio->close; } # Exclusive arguments open $in_fh , '<', $in_file or die "Could not read file '$in_file': $!\n", 'Read from GLOB handle'; throws_ok {$rio = Bio::Root::IO->new( -input => $in_file, -fh => $in_fh )} qr/Providing both a file and a filehandle for reading/, 'Exclusive arguments'; throws_ok {$rio = Bio::Root::IO->new( -input => $in_file, -file => $in_file_2 )} qr/Input file given twice/; throws_ok {$rio = Bio::Root::IO->new( -input => $in_file, -string => 'abcedf' )} qr/File or filehandle provided with -string/; throws_ok {$rio = Bio::Root::IO->new( -fh => $in_fh , -file => $in_file )} qr/Providing both a file and a filehandle for reading/; throws_ok {$rio = Bio::Root::IO->new( -fh => $in_fh , -string => 'abcedf' )} qr/File or filehandle provided with -string/; throws_ok {$rio = Bio::Root::IO->new( -file => $in_file, -string => 'abcedf' )} qr/File or filehandle provided with -string/; close $in_fh; lives_ok {$rio = Bio::Root::IO->new( -input => $in_file, -file => $in_file )} 'Same file'; ############################################## # tests _pushback for multi-line buffering ############################################## ok $rio = Bio::Root::IO->new( -file => $in_file ), 'Pushback'; ok my $line1 = $rio->_readline; ok my $line2 = $rio->_readline; ok $rio->_pushback($line2); ok $rio->_pushback($line1); ok my $line3 = $rio->_readline; ok my $line4 = $rio->_readline; ok my $line5 = $rio->_readline; is $line1, $line3; is $line2, $line4; isnt $line5, $line4; ok $rio->close; ############################################## # test _print and _insert ############################################## ok my $fio = Bio::Root::IO->new( -file => ">$out_file" ); ok $fio->_print("line 1\n"), '_print'; ok $fio->_print("line 2\n"); ok $fio->_insert("insertion at line 2\n",2), '_insert at middle of file'; ok $fio->_print("line 3\n"); ok $fio->_print("line 4\n"); ok $fio->close; open my $checkio, '<', $out_file or die "Could not read file '$out_file': $!\n"; my @content = <$checkio>; close $checkio; is_deeply \@content, ["line 1\n","insertion at line 2\n","line 2\n","line 3\n","line 4\n"]; ok $fio = Bio::Root::IO->new(-file=>">$out_file"); ok $fio->_insert("insertion at line 1\n",1), '_insert in empty file'; ok $fio->close; open $checkio, '<', $out_file or die "Could not read file '$out_file': $!\n"; @content = <$checkio>; close $checkio; is_deeply \@content, ["insertion at line 1\n"]; ############################################## # test Win vs UNIX line ending ############################################## { ok my $unix_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.unix')); ok my $win_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.win' )); ok my $mac_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.mac' )); my $expected = "LOCUS U71225 1164 bp DNA linear VRT 27-NOV-2001\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; like $mac_rio->_readline, qr#^LOCUS.*//\n$#ms; # line spans entire file because lines end with "\r" but $/ is "\n" $expected = "DEFINITION Desmognathus quadramaculatus 12S ribosomal RNA gene, partial\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; is $mac_rio->_readline , undef; $expected = " sequence; tRNA-Val gene, complete sequence; and 16S ribosomal RNA\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; is $mac_rio->_readline , undef; $expected = " gene, partial sequence, mitochondrial genes for mitochondrial RNAs.\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; is $mac_rio->_readline , undef; $expected = "ACCESSION U71225\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; is $mac_rio->_readline , undef; # In Windows the "-raw" parameter has no effect, because Perl already discards # the '\r' from the line when reading in text mode from the filehandle # ($line = <$fh>), and put it back automatically when printing if ($^O =~ m/mswin/i) { is $win_rio->_readline( -raw => 1) , "VERSION U71225.1 GI:2804359\n"; } else { is $win_rio->_readline( -raw => 1) , "VERSION U71225.1 GI:2804359\r\n"; } is $win_rio->_readline( -raw => 0) , "KEYWORDS .\n"; } ############################################## # test Win vs UNIX line ending using PerlIO::eol ############################################## SKIP: { test_skip(-tests => 20, -requires_module => 'PerlIO::eol'); local $Bio::Root::IO::HAS_EOL = 1; ok my $unix_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.unix')); ok my $win_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.win' )); ok my $mac_rio = Bio::Root::IO->new(-file => test_input_file('U71225.gb.mac' )); my $expected = "LOCUS U71225 1164 bp DNA linear VRT 27-NOV-2001\n"; is $unix_rio->_readline, $expected; is $win_rio->_readline , $expected; is $mac_rio->_readline , $expected; $expected = "DEFINITION Desmognathus quadramaculatus 12S ribosomal RNA gene, partial\n"; is $unix_rio->_readline, $expected; TODO: { local $TODO = "Sporadic test failures when running using PerlIO::eol on Linux w/". "Windows line endings: #"; is $win_rio->_readline , $expected; }; is $mac_rio->_readline , $expected; $expected = " sequence; tRNA-Val gene, complete sequence; and 16S ribosomal RNA\n"; is $unix_rio->_readline, $expected; TODO: { local $TODO = "Sporadic test failures when running using PerlIO::eol on Linux w/". "Windows line endings: #"; is $win_rio->_readline , $expected; }; is $mac_rio->_readline , $expected; $expected = " gene, partial sequence, mitochondrial genes for mitochondrial RNAs.\n"; is $unix_rio->_readline, $expected; TODO: { local $TODO = "Sporadic test failures when running using PerlIO::eol on Linux w/". "Windows line endings: #"; is $win_rio->_readline , $expected; }; is $mac_rio->_readline , $expected; $expected = "ACCESSION U71225\n"; is $unix_rio->_readline, $expected; TODO: { local $TODO = "Sporadic test failures when running using PerlIO::eol on Linux w/". "Windows line endings: #"; is $win_rio->_readline , $expected; }; is $mac_rio->_readline , $expected; # $HAS_EOL ignores -raw is $win_rio->_readline( -raw => 1) , "VERSION U71225.1 GI:2804359\n"; is $win_rio->_readline( -raw => 0) , "KEYWORDS .\n"; } ############################################## # test Path::Class support ############################################## SKIP: { test_skip(-tests => 2, -requires_module => 'Path::Class'); my $f = sub { Bio::Root::IO->new( -file => Path::Class::file(test_input_file('U71225.gb.unix') ) ) }; lives_ok(sub { $f->() } , 'Bio::Root::IO->new can handle a Path::Class object'); isa_ok($f->(), 'Bio::Root::IO'); } ############################################## # test -string ############################################## my $teststring = "Foo\nBar\nBaz"; ok $rio = Bio::Root::IO->new(-string => $teststring), 'Read string'; is $rio->mode, 'r'; ok $line1 = $rio->_readline; is $line1, "Foo\n"; ok $line2 = $rio->_readline; is $line2, "Bar\n"; ok $rio->_pushback($line2); ok $line3 = $rio->_readline; is $line3, "Bar\n"; ok $line3 = $rio->_readline; is $line3, 'Baz'; ############################################## # test tempfile() ############################################## { ok my $obj = Bio::Root::IO->new(-verbose => 0); isa_ok $obj, 'Bio::Root::IO'; my $TEST_STRING = "Bioperl rocks!\n"; my ($tfh,$tfile); eval { ($tfh, $tfile) = $obj->tempfile(); isa_ok $tfh, 'GLOB'; print $tfh $TEST_STRING; close $tfh; open my $IN, '<', $tfile or die "Could not read file '$tfile': $!\n"; my $val = join '', <$IN>; is $val, $TEST_STRING; close $IN; ok -e $tfile; undef $obj; }; undef $obj; if ( $@ ) { ok 0; } else { ok ! -e $tfile, 'auto UNLINK => 1'; } $obj = Bio::Root::IO->new(); eval { my $tdir = $obj->tempdir(CLEANUP=>1); ok -d $tdir; ($tfh, $tfile) = $obj->tempfile(dir => $tdir); close $tfh; ok -e $tfile; undef $obj; # see Bio::Root::IO::_io_cleanup }; if ( $@ ) { ok 0; } else { ok ! -e $tfile, 'tempfile deleted'; } eval { $obj = Bio::Root::IO->new(-verbose => 0); ($tfh, $tfile) = $obj->tempfile(UNLINK => 0); isa_ok $tfh, 'GLOB'; close $tfh; ok -e $tfile; undef $obj; # see Bio::Root::IO::_io_cleanup }; if ( $@ ) { ok 0; } else { ok -e $tfile, 'UNLINK => 0'; } ok unlink( $tfile) == 1 ; ok $obj = Bio::Root::IO->new; # check suffix is applied my ($fh1, $fn1) = $obj->tempfile(SUFFIX => '.bioperl'); isa_ok $fh1, 'GLOB'; like $fn1, qr/\.bioperl$/, 'tempfile suffix'; ok close $fh1; # check single return value mode of File::Temp my $fh2 = $obj->tempfile; isa_ok $fh2, 'GLOB'; ok $fh2, 'tempfile() in scalar context'; ok close $fh2; } data000755000766000024 014006427353 14234 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tMmCT100644000766000024 754414006427353 15131 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataCodon usage table Pan troglodytes [gbpri] 325 CDS's AmAcid Codon Number /1000 Fraction Pro CCC 2351.00 22.55 0.41 Pro CCT 1234.00 11.84 0.22 Pro CCA 1158.00 11.11 0.20 Pro CCG 923.00 8.85 0.16 Leu CTC 2553.00 24.49 0.24 Leu CTT 985.00 9.45 0.09 Leu CTA 802.00 7.69 0.08 Leu CTG 4997.00 47.93 0.48 His CAC 1745.00 16.74 0.63 His CAT 1041.00 9.99 0.37 Gln CAA 925.00 8.87 0.17 Gln CAG 4576.00 43.90 0.83 Arg CGC 1580.00 15.16 0.24 Arg CGT 397.00 3.81 0.06 Arg CGA 592.00 5.68 0.09 Arg CGG 1208.00 11.59 0.19 Ser TCC 2158.00 20.70 0.27 Ser TCT 1651.00 15.84 0.21 Ser TCA 923.00 8.85 0.12 Ser TCG 417.00 4.00 0.05 Phe TTC 2447.00 23.47 0.70 Phe TTT 1025.00 9.83 0.30 Leu TTA 423.00 4.06 0.04 Leu TTG 733.00 7.03 0.07 Tyr TAC 2585.00 24.80 0.69 Tyr TAT 1153.00 11.06 0.31 Ter TAA 68.00 0.65 0.17 Ter TAG 69.00 0.66 0.17 Cys TGC 1448.00 13.89 0.63 Cys TGT 851.00 8.16 0.37 Ter TGA 275.00 2.64 0.67 Trp TGG 2178.00 20.89 1.00 Thr ACC 3341.00 32.05 0.50 Thr ACT 1121.00 10.75 0.17 Thr ACA 1452.00 13.93 0.22 Thr ACG 821.00 7.88 0.12 Ile ATC 2742.00 26.30 0.62 Ile ATT 1065.00 10.22 0.24 Ile ATA 643.00 6.17 0.14 Met ATG 2152.00 20.64 1.00 Asn AAC 1941.00 18.62 0.67 Asn AAT 949.00 9.10 0.33 Lys AAA 1276.00 12.24 0.32 Lys AAG 2707.00 25.97 0.68 Ser AGC 1872.00 17.96 0.24 Ser AGT 828.00 7.94 0.11 Arg AGA 1250.00 11.99 0.19 Arg AGG 1467.00 14.07 0.23 Ala GCC 3337.00 32.01 0.39 Ala GCT 2211.00 21.21 0.26 Ala GCA 1374.00 13.18 0.16 Ala GCG 1671.00 16.03 0.19 Val GTC 1610.00 15.44 0.24 Val GTT 774.00 7.42 0.12 Val GTA 397.00 3.81 0.06 Val GTG 3873.00 37.15 0.58 Asp GAC 3294.00 31.60 0.70 Asp GAT 1427.00 13.69 0.30 Glu GAA 1326.00 12.72 0.22 Glu GAG 4666.00 44.76 0.78 Gly GGC 2639.00 25.31 0.37 Gly GGT 782.00 7.50 0.11 Gly GGA 1562.00 14.98 0.22 Gly GGG 2206.00 21.16 0.31 Coding GC 57.49% 1st letter GC 57.76% 2nd letter GC 45.35% 3rd letter GC 45.35% Genetic code 1 Makefile.PL100644000766000024 1356114006427353 15220 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8# This file was automatically generated by Dist::Zilla::Plugin::MakeMaker v6.017. use strict; use warnings; use 5.006; use ExtUtils::MakeMaker; my %WriteMakefileArgs = ( "ABSTRACT" => "Perl modules for biology", "AUTHOR" => "See individual modules", "CONFIGURE_REQUIRES" => { "ExtUtils::MakeMaker" => 0 }, "DISTNAME" => "BioPerl", "EXE_FILES" => [ "bin/bp_aacomp", "bin/bp_bioflat_index", "bin/bp_biogetseq", "bin/bp_dbsplit", "bin/bp_extract_feature_seq", "bin/bp_fastam9_to_table", "bin/bp_fetch", "bin/bp_filter_search", "bin/bp_find-blast-matches", "bin/bp_gccalc", "bin/bp_genbank2gff3", "bin/bp_index", "bin/bp_local_taxonomydb_query", "bin/bp_make_mrna_protein", "bin/bp_mask_by_search", "bin/bp_mrtrans", "bin/bp_mutate", "bin/bp_nexus2nh", "bin/bp_nrdb", "bin/bp_oligo_count", "bin/bp_process_gadfly", "bin/bp_process_sgd", "bin/bp_revtrans-motif", "bin/bp_search2alnblocks", "bin/bp_search2gff", "bin/bp_search2table", "bin/bp_search2tribe", "bin/bp_seq_length", "bin/bp_seqconvert", "bin/bp_seqcut", "bin/bp_seqpart", "bin/bp_seqret", "bin/bp_seqretsplit", "bin/bp_split_seq", "bin/bp_sreformat", "bin/bp_taxid4species", "bin/bp_taxonomy2tree", "bin/bp_translate_seq", "bin/bp_tree2pag", "bin/bp_unflatten_seq" ], "LICENSE" => "perl", "MIN_PERL_VERSION" => "5.006", "NAME" => "BioPerl", "PREREQ_PM" => { "AnyDBM_File" => 0, "Carp" => 0, "Cwd" => 0, "DB_File" => 0, "Data::Dumper" => 0, "Data::Stag" => 0, "Digest::MD5" => 0, "Dumpvalue" => 0, "Error" => 0, "Exporter" => 0, "Fcntl" => 0, "File::Basename" => 0, "File::Copy" => 0, "File::Path" => 0, "File::Spec" => 0, "File::Spec::Functions" => 0, "File::Temp" => 0, "FileHandle" => 0, "Getopt::Long" => 0, "Graph::Directed" => 0, "HTTP::Request::Common" => 0, "HTTP::Response" => 0, "IO::File" => 0, "IO::Handle" => 0, "IO::Pipe" => 0, "IO::Scalar" => 0, "IO::Socket" => 0, "IO::String" => 0, "IPC::Run" => 0, "LWP::UserAgent" => 0, "List::MoreUtils" => 0, "List::Util" => 0, "Math::BigFloat" => 0, "Module::Build" => 0, "POSIX" => 0, "Pod::Usage" => 0, "Scalar::Util" => 0, "Set::Scalar" => 0, "Storable" => 0, "Symbol" => 0, "Test::Builder" => 0, "Test::Builder::Module" => 0, "Test::Most" => 0, "Test::RequiresInternet" => 0, "Text::Balanced" => 0, "Text::Wrap" => 0, "Tie::Handle" => 0, "Tie::RefHash" => 0, "Time::HiRes" => 0, "UNIVERSAL" => 0, "URI" => 0, "XML::DOM" => 0, "XML::LibXML" => 0, "XML::LibXML::Reader" => 0, "XML::Parser::PerlSAX" => 0, "XML::SAX" => 0, "XML::SAX::Base" => 0, "XML::SAX::Writer" => 0, "XML::Twig" => 0, "XML::Writer" => 0, "YAML" => 0, "base" => 0, "constant" => 0, "integer" => 0, "overload" => 0, "parent" => 0, "strict" => 0, "utf8" => 0, "vars" => 0, "version" => 0, "warnings" => 0 }, "TEST_REQUIRES" => { "File::Spec" => 0, "FindBin" => 0, "IO::Handle" => 0, "IPC::Open3" => 0, "Test::Memory::Cycle" => 0, "Test::More" => 0, "Test::Weaken" => 0, "lib" => 0 }, "VERSION" => "1.7.8", "test" => { "TESTS" => "t/*.t t/Align/*.t t/AlignIO/*.t t/Annotation/*.t t/LocalDB/*.t t/LocalDB/Index/*.t t/LocalDB/Taxonomy/*.t t/Matrix/*.t t/Matrix/IO/*.t t/Ontology/*.t t/Ontology/IO/*.t t/RemoteDB/*.t t/Root/*.t t/SearchIO/*.t t/SearchIO/Writer/*.t t/Seq/*.t t/SeqFeature/*.t t/SeqIO/*.t t/SeqTools/*.t t/Tools/*.t t/Tools/Alignment/*.t t/Tools/EMBOSS/*.t t/Tools/Phylo/*.t t/Tools/Phylo/Phylip/*.t t/Tools/Signalp/*.t t/Tools/Spidey/*.t t/Tree/*.t t/Tree/TreeIO/*.t" } ); my %FallbackPrereqs = ( "AnyDBM_File" => 0, "Carp" => 0, "Cwd" => 0, "DB_File" => 0, "Data::Dumper" => 0, "Data::Stag" => 0, "Digest::MD5" => 0, "Dumpvalue" => 0, "Error" => 0, "Exporter" => 0, "Fcntl" => 0, "File::Basename" => 0, "File::Copy" => 0, "File::Path" => 0, "File::Spec" => 0, "File::Spec::Functions" => 0, "File::Temp" => 0, "FileHandle" => 0, "FindBin" => 0, "Getopt::Long" => 0, "Graph::Directed" => 0, "HTTP::Request::Common" => 0, "HTTP::Response" => 0, "IO::File" => 0, "IO::Handle" => 0, "IO::Pipe" => 0, "IO::Scalar" => 0, "IO::Socket" => 0, "IO::String" => 0, "IPC::Open3" => 0, "IPC::Run" => 0, "LWP::UserAgent" => 0, "List::MoreUtils" => 0, "List::Util" => 0, "Math::BigFloat" => 0, "Module::Build" => 0, "POSIX" => 0, "Pod::Usage" => 0, "Scalar::Util" => 0, "Set::Scalar" => 0, "Storable" => 0, "Symbol" => 0, "Test::Builder" => 0, "Test::Builder::Module" => 0, "Test::Memory::Cycle" => 0, "Test::More" => 0, "Test::Most" => 0, "Test::RequiresInternet" => 0, "Test::Weaken" => 0, "Text::Balanced" => 0, "Text::Wrap" => 0, "Tie::Handle" => 0, "Tie::RefHash" => 0, "Time::HiRes" => 0, "UNIVERSAL" => 0, "URI" => 0, "XML::DOM" => 0, "XML::LibXML" => 0, "XML::LibXML::Reader" => 0, "XML::Parser::PerlSAX" => 0, "XML::SAX" => 0, "XML::SAX::Base" => 0, "XML::SAX::Writer" => 0, "XML::Twig" => 0, "XML::Writer" => 0, "YAML" => 0, "base" => 0, "constant" => 0, "integer" => 0, "lib" => 0, "overload" => 0, "parent" => 0, "strict" => 0, "utf8" => 0, "vars" => 0, "version" => 0, "warnings" => 0 ); unless ( eval { ExtUtils::MakeMaker->VERSION(6.63_03) } ) { delete $WriteMakefileArgs{TEST_REQUIRES}; delete $WriteMakefileArgs{BUILD_REQUIRES}; $WriteMakefileArgs{PREREQ_PM} = \%FallbackPrereqs; } delete $WriteMakefileArgs{CONFIGURE_REQUIRES} unless eval { ExtUtils::MakeMaker->VERSION(6.52) }; if ( $^O eq 'MSWin32' ) { $WriteMakefileArgs{PREREQ_PM}{'Win32'} = $FallbackPrereqs{'Win32'} = '0'; } WriteMakefile(%WriteMakefileArgs); bp_index100644000766000024 1065014006427353 15525 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_index.pl - indexes files for use by bp_fetch.pl =head1 SYNOPSIS bp_index.pl index_name file1 file2 etc. =head1 DESCRIPTION bp_index.pl builds a bioperl index for the sequence files given in the argument list, under the index name. For example bp_index.pl nrdb /data/nrdb/nrdb.fasta would build an index called 'nrdb' as the index name for the file nrdb.fasta, and bp_index.pl -fmt EMBL swiss /data/swiss/*.dat would build an index called swiss for all the files in /data/swiss which end in .dat which are in EMBL format. The indexes are built using the Bio/Index/* modules, in particular, Bio::Index::EMBL and the Bio::Index::Fasta modules. Any script which uses these modules can use the index. A good example script is bp_fetch which fetches sequences and pipes them to STDOUT, for example bp_fetch swiss:ROA1_HUMAN gets the ROA1_HUMAN sequence from the swiss index and writes it as fasta format on STDOUT. =head1 OPTIONS -fmt - Fasta (default), swiss or EMBL -dir - directory where the index files are found (overrides BIOPERL_INDEX environment variable) Options for expert use -type - DBM_file type. (overrides BIOPERL_INDEX_TYPE environment variable) -v - report every index addition (debugging) =head1 ENVIRONMENT bp_index and bp_fetch coordinate where the databases lie using the environment variable BIOPERL_INDEX. This can be overridden using the -dir option. There is no default value, so you must use the -dir option or set BIOPERL_INDEX. The DB type is coordinated with BIOPERL_INDEX_TYPE which if it is not there, defaults to whatever the bioperl modules have installed, which itself defaults to SDBM_File. =head1 USING IT YOURSELF bp_index.pl is a script that drives the Index modules. If you want to use this script heavily in your work, if it is Perl based, it is almost certainly better to look at the code in this script and copy it across (probably you will be more likely to want to use the bp_fetch code). =head1 EXTENDING IT bp_index is just a wrapper around James Gilbert's excellent Index modules found in bioperl =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Ewan Birney Ewan Birney Ebirney@ebi.ac.ukE =cut #' use strict; use warnings; # # Doofus catcher for people who are trying this script without # installing bioperl # use Bio::Index::Fasta; use Bio::Index::EMBL; use Bio::Index::Swissprot; use Bio::Index::GenBank; use Bio::Index::SwissPfam; my $dir = $ENV{'BIOPERL_INDEX'}; my $type = $ENV{'BIOPERL_INDEX_TYPE'}; my $fmt = 'Fasta'; my $verbose = 0; use Getopt::Long; &GetOptions("f|fmt=s" => \$fmt, "d|dir=s" => \$dir, "t|type=s" => \$type, "v!" => \$verbose); exec('perldoc',$0) unless @ARGV; my $name = shift; if( !$dir ) { print STDERR "\nNo directory specified for index\nDirectory must be specified by the environment variable BIOPERL_INDEX or -dir option\ngo bp_index with no arguments for more help\n\n"; exit(1); } # # Reset the type if needed # if( $type ) { $Bio::Index::Abstract::USE_DBM_TYPE = $type; } # # Rock and roll... # my $index; $_ = $fmt; SWITCH : { /Fasta/i && do { $index = Bio::Index::Fasta->new("$dir/$name", 'WRITE'); last; }; /EMBL/i && do { $index = Bio::Index::EMBL->new("$dir/$name", 'WRITE'); last; }; /swisspfam|pfam/i && do { $index = Bio::Index::SwissPfam->new("$dir/$name", 'WRITE'); last; }; /swiss/i && do { $index = Bio::Index::Swissprot->new("$dir/$name", 'WRITE'); last; }; /GenBank/i && do { $index = Bio::Index::GenBank->new("$dir/$name", 'WRITE'); last; }; die("No index format called $fmt"); } if( $verbose != 0 ) { $index->verbose(1); } $index->make_index(@ARGV); # finished. Neat eh. # # if you are using this in a script, to # to force deallocation + closing of the index, go # $index = undef; # bp_fetch100644000766000024 1617014006427353 15512 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_fetch.pl - fetches sequences from bioperl indexed databases =head1 SYNOPSIS bp_fetch.pl swiss:ROA1_HUMAN bp_fetch.pl net::genbank:JX295726 bp_fetch.pl net::genpept:ROA1_HUMAN bp_fetch.pl ace::myserver.somewhere.edu,21000:X56676 bp_fetch.pl -fmt GCG swiss:ROA1_HUMAN =head1 DESCRIPTION Fetches sequences using the DB access systems in Bioperl. The most common use of this is to bp_fetch sequences from bioperl indices built using bpindex.pl, or to fetch sequences from the NCBI website The format for retrieving sequences is delibrately like the GCG/EMBOSS format like the following: db:name with the potential of putting in a 'meta' database type, being meta::db:name The meta information can be one of three types local - local indexed flat file database net - networked http: based database ace - ACeDB database This information defaults to 'local' for database names with no meta db information =head1 OPTIONS -fmt - Output format Fasta (default), EMBL, Raw, swiss or GCG -acc - string is an accession number, not an id. options only for expert use -dir - directory to find the index files (overrides BIOPERL_INDEX environment variable) -type - type of DBM file to open (overrides BIOPERL_INDEX_TYPE environment variable) =head1 ENVIRONMENT bp_index and bp_fetch coordinate where the databases lie using the environment variable BIOPERL_INDEX. This can be overridden using the -dir option. The index type (SDBM or DB_File or another index file) is controlled by the BIOPERL_INDEX_TYPE variable. This defaults to SDBM_File =head1 USING IT YOURSELF bp_fetch is a wrapper around the bioperl modules which support the Bio::DB::BioSeqI abstract interface. These include: Author Code James Gilbert - Fasta indexer, Abstract indexer Aaron Mackay - GenBank and GenPept DB access Ewan Birney - EMBL .dat indexer Many people - SeqIO code These modules can be used directly, which is far better than using this script as a system call or a pipe to read from. Read the source code for bp_fetch to see how it is used. =head1 EXTENDING IT bp_fetch uses a number of different modules to provide access to databases. Any module which subscribes to the Bio::DB::BioSeqI interface can be used here. For flat file indexers, this is best done by extending Bio::Index::Abstract, as is done in Bio::Index::EMBL and Bio::Index::Fasta. For access to other databases you will need to roll your own interface. For new output formats, you need to add a new SeqIO module. The easiest thing is to look at Bio::SeqIO::Fasta and figure out how to hack it for your own format (call it something different obviously). =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Ewan Birney Ebirney@ebi.ac.ukE =cut use strict; use warnings; use Getopt::Long; use Bio::Index::EMBL; use Bio::Index::Fasta; use Bio::Index::GenBank; use Bio::Index::SwissPfam; use Bio::Index::Swissprot; use Bio::SeqIO; # # Start processing the command line # my $dir = $ENV{'BIOPERL_INDEX'}; my $type = $ENV{'BIOPERL_INDEX_TYPE'}; my $fmt = 'Fasta'; my $useacc = 0; my $ret = GetOptions('d|dir=s' => \$dir, 'f|fmt=s' => \$fmt , 't|type=s' => \$type , 'acc!' => \$useacc); # # print pod documentation if we have no arguments # exec('perldoc',$0) unless @ARGV; my ($isnet,$db,$dbobj,$id,$seq,$seqio,$out,$meta); # # Reset the type if needed # if( $type ) { $Bio::Index::Abstract::USE_DBM_TYPE = $type; } # # Build at run time the SeqIO output # if ( $fmt !~ /swisspfam|pfam/ ) { $out = Bio::SeqIO->new(-fh => \*STDOUT , -format => $fmt); } # # Main loop over remaining arguments # for my $arg ( @ARGV ) { $_= $arg; # strip out meta:: if there if ( /^(\w+)::/ ) { $meta = $1; s/^(\w+):://; } else { $meta = 'local'; } # parse to db:id /^(\S+)\:(\S+)$/ || do { warn "$_ is not parsed as db:name\n"; next; }; ($db,$id) = split/:/,$_,2; # # the eval block catches exceptions if they occur # in the code in the block. The exception goes in $@ # eval { SWITCH : { $_ = $meta; /^net$/ && do { if ( $db =~ /genbank/i ) { require Bio::DB::GenBank; $dbobj = Bio::DB::GenBank->new(-format => $fmt); } elsif ( $db =~ /genpept/i ) { require Bio::DB::GenPept; $dbobj = Bio::DB::GenPept->new(); } elsif ( $db =~ /embl/i ) { require Bio::DB::EMBL; $dbobj = Bio::DB::EMBL->new(); } else { die "Net database $db not available"; } last SWITCH; }; /^ace$/ && do { # yank in Bio::DB::Ace at runtime eval { require Bio::DB::Ace; }; if ( $@ ) { die "Unable to load Bio::DB::Ace for ace::$db\n\n$@\n"; } # db is server,port my ($server,$port); $db =~ /(\S+)\,(\d+)/ || die "$db is not server.name,port for acedb database"; $server = $1; $port = $2; # print STDERR "Connecting to $server,$port\n"; $dbobj = Bio::DB::Ace->new(-host => $server, -port => $port); last SWITCH; }; /^local$/ && do { if ( !$dir ) { die "\nNo directory specified for index\nDirectory must be specified by the environment variable BIOPERL_INDEX or --dir option\ngo bp_index with no arguments for more help\n\n"; } # # $db gets re-blessed to the correct index when # it is made from the abstract class. Cute eh? # $dbobj = Bio::Index::Abstract->new("$dir/$db"); last SWITCH; }; die "Meta database $meta is not valid"; } }; # end of eval to get db if ( $@ ) { warn("Database $db in $arg is not loadable. Skipping\n\nError $@"); next; } # # We expect the databases to adhere to the BioSeqI # the sequence index databases and the GenBank/GenPept do already # if ( $dbobj->isa("Bio::Index::SwissPfam") ) { my $seq = $dbobj->fetch($id); if ( $seq ) { my $started; while ( <$seq> ) { last if ( /^\s+$/ ); print; } } else { warn("Cannot find $id\n"); } next; } if ( ! $dbobj->isa('Bio::DB::RandomAccessI') ) { warn("$db in $arg does not inherit from Bio::DB::RandomAccessI, so is not expected to work under the DB guidlines. Going to try it anyway"); } eval { if ( $useacc == 0 ) { $seq = $dbobj->get_Seq_by_id($id); } else { $seq = $dbobj->get_Seq_by_acc($id); } }; if ( $@ ) { warn("Sequence $id in Database $db in $arg is not loadable. Skipping.\n\nError $@"); next; } elsif ( !defined $seq ) { warn("Sequence $id in Database $db is not present\n"); next; } $out->write_seq($seq); } bp_aacomp100644000766000024 531414006427353 15637 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; use Carp; use Bio::SeqIO; use Getopt::Long; use Bio::SeqUtils; use Bio::Tools::IUPAC; my $table = new Bio::SeqUtils; my @BASES = $table->valid_aa(0); my %all = $table->valid_aa(2); my ($file,$format,$help) = ( undef, 'fasta'); GetOptions( 'i|in:s' => \$file, 'f|format:s' => \$format, 'h|help|?' => \$help, ); my $USAGE = "usage: aacomp [-f format] filename\n\tdefault format is fasta\n"; $file = shift unless $file; die $USAGE if $help; my $seqin; if( defined $file ) { print "Could not open file [$file]\n$USAGE" and exit unless -e $file; $seqin = new Bio::SeqIO(-format => $format, -file => $file); } else { $seqin = new Bio::SeqIO(-format => $format, -fh => \*STDIN); } my %composition; my $total; foreach my $base ( @BASES ) { $composition{$base} = 0; } while ( my $seq = $seqin->next_seq ) { if( $seq->alphabet ne 'protein' ) { confess("Must only provide amino acid sequences to aacomp...skipping this seq"); next; } foreach my $base ( split(//,$seq->seq()) ) { $composition{uc $base}++; $total++; } } printf("%d aa\n",$total); printf("%5s %4s\n", 'aa', '#' ); my $ct = 0; foreach my $base ( @BASES ) { printf(" %s %s %3d\n", $base, $all{$base}, $composition{$base} ); $ct += $composition{$base}; } printf( "%6s %s\n", '','-'x5); printf( "%6s %3d\n", '',$ct); __END__ =head1 NAME bp_aacomp - amino acid composition of protein sequences =head1 SYNOPSIS bp_aacomp [-f/--format FORMAT] [-h/--help] filename or bp_aacomp [-f/--format FORMAT] < filename or bp_aacomp [-f/--format FORMAT] -i filename =head1 DESCRIPTION This scripts prints out the count of amino acids over all protein sequences from the input file. =head1 OPTIONS The default sequence format is fasta. The sequence input can be provided using any of the three methods: =over 3 =item unnamed argument bp_aacomp filename =item named argument bp_aacomp -i filename =item standard input bp_aacomp < filename =back =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Jason Stajich Email jason@bioperl.org =head1 HISTORY Based on aacomp.c from an old version of EMBOSS =cut bp_seqcut100644000766000024 723014006427353 15702 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_seqcut.pl - cut FASTA sequences with a given range =head1 USAGE bp_seqcut.pl [options -h,-s,-e,-f,-w] ... bp_seqcut.pl [options -h,-f,-w] s-e ... -h this help message -s which residue to start cutting on -e which residue to finish cutting on -f format of the files, defaults to FASTA but you can specify anything supported by SeqIO from BioPerl -w hard wrap width, this might not be supported depending on which format you are using =head1 Description A script to cut FASTA sequences with a given range `fastacut -s 1 -e 10 *.fasta` or `fastacut 1-10 *.fasta`. This is just a convenience wrapper around the Bio::SeqIO module. Useful if you wish to trim out a section of an alignment to build a profile of a specific region of sequence. =head1 AUTHOR B - I =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 EDIT HISTORY 2010-11-22 - Matt Oates First features added. =head1 DEPENDENCY B Used to parse command line options. B Used for usage and help output. B Used to cut up sequences and parse FASTA. =cut use strict; use warnings; use Getopt::Long; #Deal with command line options use Pod::Usage; #Print a usage man page from the POD comments after __END__ use Bio::SeqIO; # Command Line Options my $help; #Same again but this time should we output the POD man page defined after __END__ my $format = "Fasta"; my $start; my $end; my $width = 72; #Default for Jalview output my $outfile = '/dev/stdout'; #Set command line flags and parameters. GetOptions("help|h!" => \$help, "start|s=s" => \$start, "format|f=s" => \$format, "end|e=s" => \$end, "width|w=s" => \$width, "outfile|o=s" => \$outfile, ) or die "Fatal Error: Problem parsing command-line ".$!; #Get other command line arguments that weren't optional flags. ($start,$end) = split (/-/, shift) unless ($start and $end); my @files = @ARGV; #Print out some help if it was asked for or if no arguments were given. pod2usage(-exitstatus => 0, -verbose => 2) if $help; pod2usage(-exitstatus => 0, -verbose => 1, -msg => 'Please specify the sequence files you wish to cut.') unless scalar @files; pod2usage(-exitstatus => 0, -verbose => 1, -msg => 'Please specify the region you wish to cut -s 1 -e 10 or 1-10.') unless defined $end; my $out = Bio::SeqIO->newFh(-file => ">$outfile", -format => $format) or die "Couldn't open selected output sequence file."; #Open and iterate over all sequence in all files foreach my $file (@files) { my $in = Bio::SeqIO->new(-file => $file, -format => $format); while ( my $seq = $in->next_seq() ) { #Alter the ID to be postfixed with '/s-e' $seq->display_id($seq->display_id."/$start-$end"); #Edit the sequence we have cut out my $sequence = $seq->subseq($start,$end); $sequence =~ s/([^\n]{0,$width})/$1\n/gi; chomp $sequence; $seq->seq($sequence); #Print the sequence back out print $out $seq; } } 1; __END__ bp_seqret100644000766000024 430214006427353 15676 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # -*-Perl-*- mode (for emacs) =head1 NAME bp_seqret - bioperl implementation of sequence fetch from local db (like EMBOSS seqret) =head1 USAGE bp_seqret [-f/--format outputformat] [-o/--out/--outfile outfile] [-d/--db dbname] [-i/--id/-s/--seqname seqname1] Example usage: bp_seqret -f fasta -db db.fa -i seq1 -i seq2 > output.fas bp_seqret db.fa:seq1 output.fas bp_seqret db.fa:seq1 -o output.fas bp_seqret -db db.fa -o output.fas seq1 seq2 seq3 bp_seqret -db db.fa seq1 seq2 seq3 output.fas bp_seqret -db db.fa seq1 seq2 seq3 - > output.fas The DB is expected to be a Fasta formatted sequence file with multiple sequences. Output format is Fasta by default. If no output filename is provided then output is written to STDOUT. Providing '-' as the output filename will accomplish the same thing. =head1 AUTHOR Jason Stajich jason_AT_bioperl-dot-org =cut use strict; use warnings; use Bio::DB::Fasta; use Bio::SeqIO; use Getopt::Long; my $dbname; my @names; my $format = 'fasta'; my $outfile; my ($start,$end); GetOptions( 'f|format:s' => \$format, 'o|out|outfile:s' => \$outfile, 's|sbegin|begin|start:s' => \$start, 'e|send|end|stop:s' => \$end, 'd|db|dbname:s' => \$dbname, 'i|id|seqname:s' => \@names); if( ! $dbname ) { die "need a dbname\n" unless @ARGV; $dbname = shift @ARGV; if( $dbname =~ s/^([^:]+):// ) { push @names, $dbname; $dbname = $1; } } my $db = Bio::DB::Fasta->new($dbname, -glob => "*.{fa,fas,fsa,fasta,pep,aa,seq,cds,peps}"); if( ! $outfile ) { $outfile = pop @ARGV; } my $out; if( $outfile ) { $out = Bio::SeqIO->new(-format => $format, -file => ">$outfile"); } else { $out = Bio::SeqIO->new(-format => $format); } for my $nm ( @names ) { my $seq; if( $start || $end ) { $seq = $db->seq($nm, $start => $end); } else { $seq = $db->seq($nm); } if( $seq ) { my ($id,$desc) = split(/\s+/,$db->header($nm),2); if( $start && $end ) { $id = sprintf("%s_%d-%d",$id,$start || 0,$end || 0); } $out->write_seq(Bio::PrimarySeq->new(-display_id => $id, -description => $desc, -seq => $seq)); } else { warn("$nm not found\n"); } } bp_gccalc100644000766000024 640214006427353 15612 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; use Bio::SeqIO; use Bio::Tools::SeqStats; use Getopt::Long; my $format = 'fasta'; my $file; my $help =0; my $aggregate; GetOptions( 'f|format:s' => \$format, 'i|in:s' => \$file, 'h|help|?' => \$help, 'a|aggregate!'=> \$aggregate, ); my $USAGE = "usage: gccalc.pl -f format -i filename\n"; if( $help ) { die $USAGE; } $file = shift unless $file; my $seqin; if( defined $file ) { print "Could not open file [$file]\n$USAGE" and exit unless -e $file; $seqin = new Bio::SeqIO(-format => $format, -file => $file); } else { $seqin = new Bio::SeqIO(-format => $format, -fh => \*STDIN); } my ($total_base, $total_gc); while( my $seq = $seqin->next_seq ) { next if( $seq->length == 0 ); if( $seq->alphabet eq 'protein' ) { warn("gccalc does not work on amino acid sequences ...skipping this seq"); next; } my $seq_stats = Bio::Tools::SeqStats->new('-seq'=>$seq); my $hash_ref = $seq_stats->count_monomers(); # for DNA sequence print "Seq: ", $seq->display_id, " "; print $seq->desc if $seq->desc; print " Len:", $seq->length, "\n"; $total_base += $seq->length; $total_gc += $hash_ref->{'G'} + $hash_ref->{'C'}; printf "GC content is %.4f\n", ($hash_ref->{'G'} + $hash_ref->{'C'}) / $seq->length(); foreach my $base (sort keys %{$hash_ref}) { print "Number of bases of type ", $base, "= ", $hash_ref->{$base},"\n"; } print "--\n"; } if( $aggregate ) { printf "Total GC content is %.4f out of %d bases\n", $total_gc / $total_base, $total_base; } # alternatively one could use code submitted by # cckim@stanford.edu sub calcgc { my $seq = $_[0]; my @seqarray = split('',$seq); my $count = 0; foreach my $base (@seqarray) { $count++ if $base =~ /[G|C]/i; } my $len = $#seqarray+1; return $count / $len; } __END__ =head1 NAME bp_gccalc - GC content of nucleotide sequences =head1 SYNOPSIS bp_gccalc [-f/--format FORMAT] [-h/--help] filename or bp_gccalc [-f/--format FORMAT] < filename or bp_gccalc [-f/--format FORMAT] -i filename =head1 DESCRIPTION This scripts prints out the GC content for every nucleotide sequence from the input file. =head1 OPTIONS The default sequence format is fasta. The sequence input can be provided using any of the three methods: =over 3 =item unnamed argument bp_gccalc filename =item named argument bp_gccalc -i filename =item standard input bp_gccalc < filename =back =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Jason Stajich Email jason@bioperl.org =head1 HISTORY Based on script code (see bottom) submitted by cckim@stanford.edu Submitted as part of bioperl script project 2001/08/06 =cut bp_mutate100644000766000024 1014314006427353 15712 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_mutate.pl - randomly mutagenize a single protein or DNA sequence =head1 SYNOPSIS ./bp_mutate.pl -p 25 -i test.fa -n 5 -f swiss -o muts.swiss #or ./bp_mutate.pl --percent=25 --input=test.fa --number=5 -output=x.fa =head1 DESCRIPTION Randomly mutagenize a single protein or DNA sequence one or more times. Specify percentage mutated and number of resulting mutant sequences. Print mutagenized sequences to STDOUT or write to an output file. -h|--help Help -p|--percent Percent mutagenized -n|--number Number of mutant sequences created -o|--output Output file (optional) -f|--format Output format (default: fasta) -i|--input Input file =head1 FEEDBACK User feedback is an integral part of the evolution of this and other Bioperl scripts. Send your comments and suggestions to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Brian Osborne, bosborne at alum.mit.edu =cut use strict; use warnings; use Getopt::Long; use Bio::SeqIO; my ( $help, $in_file, $percent, $out_file, $number ); my $format = "fasta"; my @dna = qw(a g c t); my @amino = qw(a c d e f g h i k l m n p q r s t v w y); GetOptions( "h|help" => \$help, "p|percent:i" => \$percent, "n|number:i" => \$number, "o|output:s" => \$out_file, "f|format:s" => \$format, "i|input:s" => \$in_file ); usage() if ( $help || !$percent || !$in_file || !$number || $percent > 100 ); # Seed the random number generator. "time|$$" combines the # current time with the current process id srand( time | $$ ); my $seqio = Bio::SeqIO->new( -file => $in_file ); my $seqobj = $seqio->next_seq; my $num_mut = percent_to_num($percent); my @seq_arr = (); # don't keep a mutant that's already been made while ( $number > $#seq_arr + 1 ) { my $mut_seq = mutate_all( $seqobj, $num_mut ); push @seq_arr, $mut_seq unless ( grep /$mut_seq/, @seq_arr ); } foreach my $mut_seq (@seq_arr) { my $name = $seqobj->display_id . "-${percent}_percent-$number"; my $outseq = Bio::Seq->new( -seq => $mut_seq, -display_id => $name, -desc => $seqobj->desc ); my %args = ( -format => $format ); $args{-file} = ">>$out_file" if $out_file; my $seqio = Bio::SeqIO->new(%args); $seqio->write_seq($outseq); $number--; } # mutagenize the sequence, one-by-one sub mutate_all { my ( $seq_obj, $num ) = @_; my $type = $seq_obj->alphabet; my $str = $seq_obj->seq; # store the mutagenized positions in $positions my $positions = ""; for ( my $i = 0 ; $i < $num_mut ; ++$i ) { ( $str, $positions ) = mutate_one( $str, $type, $positions ); } $str; } # mutagenize one position sub mutate_one { my ( $str, $type, $positions ) = @_; my ( $position, $new_char ); # pick a random position in the sequence, checking # that the position isn't already mutagenized do { $position = random_position($str); } until ( $positions !~ /\b$position\b/ ); $positions .= "$position "; my $current_char = substr( $str, $position, 1 ); # pick a random char that's not the existing char do { $new_char = random_char($type); } until ( $new_char ne $current_char ); substr( $str, $position, 1, $new_char ); ( $str, $positions ); } # randomly select a position in the sequence sub random_position { my $string = shift; int( rand( length($string) ) ); } # randomly select one of the chars depending on alphabet sub random_char { my $type = shift; $type eq "protein" ? return $amino[ rand @amino ] : return $dna[ rand @dna ]; } sub percent_to_num { my $percent = shift; int( $percent / 100 * length( $seqobj->seq ) ); } sub usage { exec( 'perldoc', $0 ); exit(0); } __END__ Tree000755000766000024 014006427353 14222 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tNode.t100644000766000024 507114006427353 15437 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tree# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; use File::Temp qw(tempfile); test_begin( -tests => 33 ); use_ok('Bio::Tree::Node'); use_ok('Bio::TreeIO'); } my $node1 = Bio::Tree::Node->new(); my $node2 = Bio::Tree::Node->new(); ok( $node1->is_Leaf() ); is( $node1->ancestor, undef ); # tests for tags ok !$node1->has_tag('test'); is $node1->add_tag_value( 'test', 'a' ), 1; ok $node1->has_tag('test'); is $node1->add_tag_value( 'test', 'b' ), 2; my @tags = $node1->get_tag_values('test'); is scalar @tags, 2; is scalar $node1->get_tag_values('test'), 'a', 'retrieve the first value'; is $node1->remove_tag('test2'), 0; is $node1->remove_tag('test'), 1; ok !$node1->has_tag('test'); is $node1->set_tag_value( 'test', ( 'a', 'b', 'c' ) ), 3; is $node1->remove_all_tags(), undef; ok !$node1->has_tag('test'); my $pnode = Bio::Tree::Node->new(); $pnode->add_Descendent($node1); is( $node1->ancestor, $pnode ); $pnode->add_Descendent($node2); is( $node2->ancestor, $pnode ); ok( !$pnode->is_Leaf ); my $phylo_node = Bio::Tree::Node->new( -bootstrap => 0.25, -id => 'ADH_BOV', -desc => 'Taxon 1' ); $node1->add_Descendent($phylo_node); ok( !$node1->is_Leaf ); is( $phylo_node->ancestor, $node1 ); is( $phylo_node->id, 'ADH_BOV' ); is( $phylo_node->bootstrap, 0.25 ); is( $phylo_node->description, 'Taxon 1' ); is $phylo_node->ancestor($node2), $node2; ok $node1->is_Leaf; is my @descs = $node2->each_Descendent, 1; is $descs[0], $phylo_node; # bug 2877 my $str = "(A:52,(B:46,C:50):11,D:70)68"; my $in = Bio::TreeIO->new( -internal_node_id => 'bootstrap', -format => 'nhx', -string => $str, ); my $t = $in->next_tree; my $s; my $old_root = $t->get_root_node(); my ($b) = $t->find_node( -id => "B" ); my $b_anc = $b->ancestor; my $r = $b->create_node_on_branch( -FRACTION => 0.5 ); $r->id('fake'); # before reroot is( $t->as_text('newick',$in->params), "(A:52,(C:50,(B:23)fake:23):11,D:70)68;", 'with fake node' ); # after reroot $t->reroot($r); is( $t->as_text('newick',$in->params), "(B:23,(C:50,(A:52,D:70)68:11):23)fake;", "after reroot on fake node" ); $t->reroot($b); is( $t->as_text('newick',$in->params), "(((C:50,(A:52,D:70)68:11):23)fake:23)B;", "reroot on B" ); $t->reroot($b_anc); $t->splice( -remove_id => 'fake' ); is( $t->as_text('newick',$in->params), "(B:23,C:50,(A:52,D:70)68:11);", "remove fake node, reroot on former B anc" ); $t->reroot($old_root); is( $t->as_text('newick',$in->params), "(A:52,(B:23,C:50):11,D:70)68;", "roundtrip" ); Tree.t100644000766000024 3133214006427353 15470 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tree# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 66); use_ok('Bio::TreeIO'); } my $verbose = test_debug(); my $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'nhx', -file => test_input_file('test.nhx')); my $tree = $treeio->next_tree; # tests for tags ok ! $tree->has_tag('test'); is $tree->add_tag_value('test','a'), 1; ok $tree->has_tag('test'); is $tree->add_tag_value('test','b'), 2; my @tags = $tree->get_tag_values('test'); is scalar @tags, 2; is scalar $tree->get_tag_values('test'), 'a', 'retrieve the first value'; is $tree->remove_tag('test2'), 0; is $tree->remove_tag('test'), 1; ok ! $tree->has_tag('test'); is $tree->set_tag_value('test',('a','b','c')), 3; is $tree->remove_all_tags(), undef; ok ! $tree->has_tag('test'); my @nodes = $tree->find_node('ADH2'); is(@nodes, 2,'Number of nodes that have ADH2 as name'); if( $verbose ) { $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'nhx', ); $treeio->write_tree($tree); print "nodes are: \n", join(", ", map { $_->id . ":". (defined $_->branch_length ? $_->branch_length : '' ) } @nodes), "\n"; } $treeio = Bio::TreeIO->new(-format => 'newick', -file => test_input_file('test.nh')); $tree = $treeio->next_tree; if( $verbose ) { my $out = Bio::TreeIO->new(-format => 'tabtree'); $out->write_tree($tree); } my @hADH = ( $tree->find_node('hADH1'), $tree->find_node('hADH2') ); my ($n4) = $tree->find_node('yADH4'); is($tree->is_monophyletic(-nodes => \@hADH, -outgroup => $n4),1,'Test Monophyly'); my @mixgroup = ( $tree->find_node('hADH1'), $tree->find_node('yADH2'), $tree->find_node('yADH3'), ); my ($iADHX) = $tree->find_node('iADHX'); # test height is($iADHX->height, 0,'Height'); is($iADHX->depth,0.22,'Depth'); isnt( $tree->is_monophyletic(-nodes => \@mixgroup, -outgroup=> $iADHX),1, 'non-monophyletic group'); # binary tree? is $tree->is_binary, 0, 'not a binary tree'; is scalar $tree->get_nodes, 12, '12 nodes'; $tree->verbose(-1); $tree->force_binary; is $tree->is_binary, 1, 'after force_binary() it is'; is scalar $tree->get_nodes, 17, 'and there are more nodes (17)'; my $in = Bio::TreeIO->new(-format => 'newick', -fh => \*DATA); $tree = $in->next_tree; my ($a,$b,$c,$d) = ( $tree->find_node('A'), $tree->find_node('B'), $tree->find_node('C'), $tree->find_node('D')); is($tree->is_monophyletic(-nodes => [$b,$c], -outgroup => $d),1, 'B,C are Monophyletic'); is($tree->is_monophyletic(-nodes => [$b,$a], -outgroup => $d),1,'A,B are Monophyletic'); $tree = $in->next_tree; my ($e,$f,$i); ($a,$b,$c,$d,$e,$f,$i) = ( $tree->find_node('A'), $tree->find_node('B'), $tree->find_node('C'), $tree->find_node('D'), $tree->find_node('E'), $tree->find_node('F'), $tree->find_node('I'), ); isnt( $tree->is_monophyletic(-nodes => [$b,$f], -outgroup => $d),1,'B,F are not Monophyletic' ); is($tree->is_monophyletic(-nodes => [$b,$a], -outgroup => $f),1, 'A,B are Monophyletic'); # test for paraphyly isnt( $tree->is_paraphyletic(-nodes => [$a,$b,$c], -outgroup => $d), 1,'A,B,C are not Monophyletic w D as outgroup'); is( $tree->is_paraphyletic(-nodes => [$a,$f,$e], -outgroup => $i), 1, 'A,F,E are monophyletic with I as outgroup'); # test for rerooting the tree my $out = Bio::TreeIO->new(-format => 'newick', -fh => \*STDERR, -noclose => 1); $tree = $in->next_tree; $tree->verbose( -1 ) unless $verbose; my $node_cnt_orig = scalar($tree->get_nodes); # reroot on an internal node: should work fine $a = $tree->find_node('A'); # removing node_count checks because re-rooting can change the # number of internal nodes (if it is done correctly) my $total_length_orig = $tree->total_branch_length; is $tree->total_branch_length, $tree->subtree_length, "subtree_length() without attributes is an alias to total_branch_lenght()"; cmp_ok($total_length_orig, '>',$tree->subtree_length($a->ancestor), 'Length of the tree is larger that lenght of a subtree'); $out->write_tree($tree) if $verbose; is($tree->reroot($a),1, 'Can re-root with A as outgroup'); $out->write_tree($tree) if $verbose; is($node_cnt_orig, scalar($tree->get_nodes), 'Count the number of nodes'); my $total_length_new = $tree->total_branch_length; my $eps = 0.001 * $total_length_new; # tolerance for checking length warn("orig total len ", $total_length_orig, "\n") if $verbose; warn("new total len ", $tree->total_branch_length,"\n") if $verbose; # according to retree in phylip these branch lengths actually get larger # go figure... # this should be fixed now/maj ok(($total_length_orig >= $tree->total_branch_length - $eps) && ($total_length_orig <= $tree->total_branch_length + $eps),'same length'); # prob with below: rerooted tree on node A at line 146; so $a IS root #/maj is($tree->get_root_node, $a->ancestor, "Root node is A's ancestor"); is($tree->get_root_node, $a, "Root node is A"); # former test expected the old behavior of reroot; here is the new # test/maj my $desc = ($a->each_Descendent)[0]; my $newroot = $desc->create_node_on_branch(-FRACTION=>0.5, -ANNOT=>{-id=>'newroot'}); $tree->reroot($newroot); is($tree->get_root_node, $a->ancestor, "Root node is A's ancestor"); # try to reroot on an internal, will result in there being 1 less node # Rerooting should be an invariant operation with respect to node number!/maj # the test show that it now is, because the secret removal of nodes # no longer occurs $a = $tree->find_node('C')->ancestor; $out->write_tree($tree) if $verbose; is($tree->reroot($a),1, "Can reroot with C's ancsestor"); $out->write_tree($tree) if $verbose; #/maj is($node_cnt_orig, scalar($tree->get_nodes), 'Check to see that node count is correct after an internal node was removed after this re-rooting'); # but we did add a new node at line 166, so is($node_cnt_orig+1, scalar($tree->get_nodes), 'Node count correct'); warn("orig total len ", $total_length_orig, "\n") if $verbose; warn("new total len ", $tree->total_branch_length,"\n") if $verbose; cmp_ok($total_length_orig, '>=', $tree->total_branch_length - $eps, 'Total original branch length is what it is supposed to be'); # branch length should also be invariant w/r to rerooting... cmp_ok($total_length_orig, '<=',$tree->total_branch_length + $eps, 'Updated total branch length after the reroot'); # again, we rerooted ON THE NODE, so $a IS the root./maj is($tree->get_root_node, $a, 'Make sure root is really what we asked for'); # try to reroot on new root: should fail #/maj $a = $tree->get_root_node; isnt( $tree->reroot($a),1, 'Testing for failed re-rerooting'); # try a more realistic tree $tree = $in->next_tree; $a = $tree->find_node('VV'); $node_cnt_orig = scalar($tree->get_nodes); $total_length_orig = $tree->total_branch_length; $out->write_tree($tree) if $verbose; is($tree->reroot($a),1, 'Test that rooting succeeded'); #mod /maj $out->write_tree($tree) if $verbose; # node number should be invariant after reroot/maj is($node_cnt_orig, scalar($tree->get_nodes), 'Test that re-rooted tree has proper number of nodes after re-rooting'); #mod /maj $total_length_new = $tree->total_branch_length; $eps = 0.001 * $total_length_new; # tolerance for checking length cmp_ok($total_length_orig, '>=', $tree->total_branch_length - $eps, 'Branch length before rerooting'); cmp_ok($total_length_orig, '<=', $tree->total_branch_length + $eps, 'Branch length after rerooting'); is($tree->get_root_node, $a,'Root is really the ancestor we asked for'); #mod /maj # BFS and DFS search testing $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => test_input_file('test.nh')); $tree = $treeio->next_tree; my ($ct,$n) = (0); my $let = ord('A'); for $n ( $tree->get_leaf_nodes ) { $n->id(chr($let++)); } for $n ( grep {! $_->is_Leaf } $tree->get_nodes ) { $n->id($ct++); } # enable for debugging Bio::TreeIO->new(-format => 'newick')->write_tree($tree) if( $verbose ); my $BFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'b'))); is($BFSorder, '0,1,3,2,C,D,E,F,G,H,A,B', 'BFS traversal order'); my $DFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'd'))); is($DFSorder, '0,1,2,A,B,C,D,3,E,F,G,H', 'DFS travfersal order'); # test some Bio::Tree::TreeFunctionI methods #find_node tested extensively already $tree->remove_Node('H'); $DFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'd'))); is($DFSorder, '0,1,2,A,B,C,D,3,E,F,G', 'DFS traversal after removing H'); $tree->splice(-remove_id => 'G'); $DFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'd'))); is($DFSorder, '0,1,2,A,B,C,D,3,E,F', 'DFS traversal after removing G'); $tree->splice(-remove_id => [('E', 'F')], -keep_id => 'F'); $DFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'd'))); # the node '3' is not explicitly removed, so it should still be there # I suspect that it disappeared before was due to the previously # automatic removal of internal degree 2 nodes../maj is($DFSorder, '0,1,2,A,B,C,D,3,F', 'DFS traversal after removing E'); $tree->splice(-keep_id => [qw(0 1 2 A B C D)]); $DFSorder = join(",", map { $_->id } ( $tree->get_nodes(-order => 'd'))); is($DFSorder, '0,1,2,A,B,C,D', 'DFS after removing all but 0,1,2,A,B,C,D'); #get_lineage_nodes, get_lineage_string, get_lca, merge_lineage, contract_linear_paths tested in Taxonomy.t # try out the id to bootstrap copy method $treeio = Bio::TreeIO->new(-format => 'newick', -file => test_input_file('bootstrap.tre')); $tree = $treeio->next_tree; my ($test_node) = $tree->find_node(-id => 'A'); is($test_node->ancestor->id, 90,'Testing bootstrap copy'); is($test_node->ancestor->ancestor->id, '25','Testing bootstrap copy'); is($test_node->ancestor->ancestor->ancestor->id, '0','Testing bootstrap copy'); $tree->move_id_to_bootstrap; is($test_node->ancestor->id, '','Testing bootstrap copy'); is($test_node->ancestor->bootstrap, '90', 'Testing bootstrap copy'); is($test_node->ancestor->ancestor->id, '', 'Testing bootstrap copy'); is($test_node->ancestor->ancestor->bootstrap, '25', 'Testing bootstrap copy'); is($test_node->ancestor->ancestor->ancestor->bootstrap, '0','Testing bootstrap copy'); # change TreeIO to parse $treeio = Bio::TreeIO->new(-format => 'newick', -file => test_input_file('bootstrap.tre'), -internal_node_id => 'bootstrap'); $tree = $treeio->next_tree; ($test_node) = $tree->find_node(-id => 'A'); is($test_node->ancestor->id, '','Testing auto-boostrap copy during parse'); is($test_node->ancestor->ancestor->id, '', 'Testing auto-boostrap copy during parse'); is($test_node->ancestor->bootstrap, '90', 'Testing auto-boostrap copy during parse'); is($test_node->ancestor->ancestor->bootstrap, '25', 'Testing auto-boostrap copy during parse'); # return an empty array when no nodes are found ok $tree = Bio::Tree::Tree->new(); @nodes = $tree->get_nodes; is scalar @nodes, 0; __DATA__ (D,(C,(A,B))); (I,((D,(C,(A,B)x)y),(E,(F,G)))); (((A:0.3,B:2.1):0.45,C:0.7),D:4); (A:0.031162,((((((B:0.022910,C:0.002796):0.010713,(D:0.015277,E:0.020484):0.005336):0.005588,((F:0.013293,(G:0.018374,H:0.003108):0.005318):0.006047,I:0.014607):0.001677):0.004196,(((((J:0.003307,K:0.001523):0.011884,L:0.006960):0.006514,((M:0.001683,N:0.000100):0.002226,O:0.007085):0.014649):0.008004,P:0.037422):0.005201,(Q:0.000805,R:0.000100):0.015280):0.005736):0.004612,S:0.042283):0.017979,(T:0.006883,U:0.016655):0.040226):0.014239,((((((V:0.000726,W:0.000100):0.028490,((((X:0.011182,Y:0.001407):0.005293,Z:0.011175):0.004701,AA:0.007825):0.016256,BB:0.029618):0.008146):0.004279,CC:0.035012):0.060215,((((((DD:0.014933,(EE:0.008148,FF:0.000100):0.015458):0.003891,GG:0.010996):0.001489,(HH:0.000100,II:0.000100):0.054265):0.003253,JJ:0.019722):0.013796,((KK:0.001960,LL:0.004924):0.013034,MM:0.010071):0.043273):0.011912,(NN:0.031543,OO:0.018307):0.059182):0.026517):0.011087,((PP:0.000100,QQ:0.002916):0.067214,(RR:0.064486,SS:0.013444):0.011613):0.050846):0.015644,((TT:0.000100,UU:0.009287):0.072710,(VV:0.009242,WW:0.009690):0.035346):0.042993):0.060365); Tools000755000766000024 014006427353 14423 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tSeg.t100644000766000024 215414006427353 15470 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 15); use_ok('Bio::Tools::Seg'); } my ($infile, $parser) ; $infile = test_input_file('seg.out'); ok ($parser = Bio::Tools::Seg->new(-file=>$infile), 'parser defined') ; my @feat; while ( my $feat = $parser->next_result ) { push @feat, $feat; } is scalar(@feat), 3; # seq 0 #>LBL_0012(32-46) complexity=2.47 (12/2.20/2.50) #gdggwtfegwggppe # seq 1 #>LBL_0012(66-80) complexity=2.31 (12/2.20/2.50) #kfssrasakavakks # seq 2 #>LBL_0012(123-138) complexity=2.31 (12/2.20/2.50) #svivsqsqgvvkgvgv my $raa_testdata = [ [ 'LBL_0012', 32, 46, 2.47 ], [ 'LBL_0012', 66, 80, 2.31 ], [ 'LBL_0012', 123, 138, 2.31 ], ] ; for (0..( scalar(@feat)-1 )) { is ( $feat[$_]->seq_id, $raa_testdata->[$_]->[0], "seq id for seq $_ identified" ) ; is ( $feat[$_]->start, $raa_testdata->[$_]->[1], "start for seq $_ identified" ) ; is ( $feat[$_]->end, $raa_testdata->[$_]->[2], "end for seq $_ identified" ) ; is ( $feat[$_]->score, $raa_testdata->[$_]->[3], "score for seq $_ identified" ) ; } GFF.t100644000766000024 611214006427353 15352 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 34); use_ok('Bio::Tools::GFF'); use_ok('Bio::SeqFeature::Generic'); } my $feat = Bio::SeqFeature::Generic->new( -start => 10, -end => 100, -strand => -1, -primary => 'repeat', -source => 'repeatmasker', -score => 1000, -tag => { new => 1, author => 'someone', sillytag => 'this is silly!;breakfast' } ); ok($feat); my ($out1, $out2) = (test_output_file(), test_output_file()); my $gff1out = Bio::Tools::GFF->new(-gff_version => 1, -file => ">$out1"); ok($gff1out); my $gff2out = Bio::Tools::GFF->new(-gff_version => 2, -file => ">$out2"); ok($gff2out); $gff1out->write_feature($feat); $gff2out->write_feature($feat); $gff1out->close(); $gff2out->close(); my $gff1in = Bio::Tools::GFF->new(-gff_version => 1, -file => "$out1"); ok($gff1in); my $gff2in = Bio::Tools::GFF->new(-gff_version => 2, -file => "$out2"); ok($gff2in); my $feat1 = $gff1in->next_feature(); ok($feat1); is($feat1->start, $feat->start); is($feat1->end, $feat->end); is($feat1->primary_tag, $feat->primary_tag); is($feat1->score, $feat->score); my $feat2 = $gff2in->next_feature(); ok($feat2); is($feat2->start, $feat->start); is($feat2->end, $feat->end); is($feat2->primary_tag, $feat->primary_tag); is($feat2->score, $feat->score); is(($feat2->get_tag_values('sillytag'))[0], 'this is silly!;breakfast'); #test sequence-region parsing $gff2in = Bio::Tools::GFF->new(-gff_version => 2, -file => test_input_file('hg16_chroms.gff')); is($gff2in->next_feature(),undef); my $seq = $gff2in->next_segment; is($seq->display_id, 'chr1'); is($seq->end, 246127941); is($seq->start, 1); # GFF3 SKIP: { test_skip(-tests => 12, -requires_module => 'IO::String'); my $str = IO::String->new; my $gffout = Bio::Tools::GFF->new(-fh => $str, -gff_version => 3); my $feat_test = Bio::SeqFeature::Generic->new (-primary_tag => 'tag', -source_tag => 'exon', -seq_id => 'testseq', -score => undef, -start => 10, -end => 120, -strand => 1, -tag => { 'bungle' => 'jungle;mumble', 'lion' => 'snake=tree' }); $feat_test->add_tag_value('giant_squid', 'lakeshore manor'); $gffout->write_feature($feat_test); seek($str,0,0); my $in = Bio::Tools::GFF->new(-fh => $str, -gff_version => 3); my $f_recon = $in->next_feature; is($f_recon->primary_tag, $feat_test->primary_tag); is($f_recon->source_tag, $feat_test->source_tag); is($f_recon->score, $feat_test->score); is($f_recon->start, $feat_test->start); is($f_recon->end, $feat_test->end); is($f_recon->strand, $feat_test->strand); for my $tag ( $feat_test->get_all_tags ) { ok($f_recon->has_tag($tag)); if( $f_recon->has_tag($tag) ) { my @v = $feat_test->get_tag_values($tag); my @g = $f_recon->get_tag_values($tag); while( @v && @g ) { is(shift @v, shift @g); } } } } SeqIO000755000766000024 014006427353 14303 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tscf.t100644000766000024 2024514006427353 15426 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 80); use_ok('Bio::SeqIO::scf'); use_ok('Bio::Seq::SequenceTrace'); } my $verbose = test_debug(); ok my $in_scf = Bio::SeqIO->new(-file => test_input_file('chad100.scf'), -format => 'scf', -verbose => $verbose); my $swq = $in_scf->next_seq(); isa_ok($swq,"Bio::Seq::SequenceTrace"); cmp_ok (length($swq->seq()), '>', 10); my $qualities = join(' ',@{$swq->qual()}); cmp_ok (length($qualities), '>', 10); my $id = $swq->id(); is ($swq->id(), "ML4942R"); my $a_channel = $swq->trace("a"); cmp_ok (scalar(@$a_channel), '>', 10); my $c_channel = $swq->trace("c"); cmp_ok (scalar(@$c_channel), '>', 10); my $g_channel = $swq->trace("g"); cmp_ok (scalar(@$g_channel), '>', 10); my $t_channel = $swq->trace("t"); cmp_ok (scalar(@$t_channel), '>', 10); my $ref = $swq->peak_indices(); my @indices = @$ref; my $indexcount = 761; is (scalar(@indices), $indexcount); #use Data::Dumper; #---------------------------------------- isa_ok $swq->seq_obj, 'Bio::Seq::Quality'; isa_ok $swq->qual_obj, 'Bio::Seq::Quality'; is $swq->alphabet, 'dna', 'alphabet'; is $swq->display_id, 'ML4942R', 'display_id'; like $swq->primary_id, qr/HASH/, 'primary_id is the stringified memory position'; is $swq->primary_id('ABC'), 'ABC', 'set primary_id'; is $swq->accession_number, 'unknown', 'accession_number'; is $swq->desc, undef, 'desc'; is $swq->desc('test'), 'test', 'desc'; is $swq->id, 'ML4942R', 'id'; is $swq->id('test'), 'test', 'id'; is length($swq->seq), $indexcount, 'seq'; my $len = 7; my $start = $swq->length-$len+1; my $end = $swq->length; is $swq->subseq($start,$end), 'cctcaag', 'subseq'; is $swq->baseat($start), 'c', 'baseat'; is $swq->qualat($start), '18', 'qualat'; is $swq->trace_value_at('a',$start), '482', 'trace_value_at'; TODO: { local $TODO = 'documentation and code for accuracies() do not match' if 1; is $swq->accuracies('a',$start), '482', 'accuracies'; } my $qualstring = join(' ',@{$swq->subqual($start,$end)}); is ($qualstring, '18 18 21 15 8 8 8'); my $refs = $swq->sub_peak_index($start,$end); is @$refs, $len, 'sub_peak_index'; is $swq->peak_index_at($start), 8819, 'peak_index_at'; my $indices_at_end = join(' ',@{$swq->sub_peak_index($start,$end)}); is($indices_at_end, '8819 8831 8843 8853 8862 8873 8891'); my $swq_end = $swq->trace_length(); my $swq_start = $swq_end - $len +1; my $subtrace_a = join(' ',@{$swq->sub_trace('a',$swq_start,$swq_end)}); is $subtrace_a, '13 3 0 0 75 274 431'; my $swq2 = $swq->sub_trace_object(1,5); #$traces2->verbose(-1); isa_ok($swq2, 'Bio::Seq::SequenceTrace'); $swq2->_synthesize_traces(), 1; # this should not be a private method! Heikki $swq2->set_accuracies(), 1; is $swq->accuracy_at('a',1), '755', 'accuracy_at'; #---------------------------------------- warn("Now checking version3...\n") if $verbose; my $in_scf_v3 = Bio::SeqIO->new(-file => test_input_file('version3.scf'), -format => 'scf', -verbose => $verbose); my $v3 = $in_scf_v3->next_seq(); isa_ok($v3, 'Bio::Seq::SequenceTrace'); my $ind = $v3->peak_indices(); my @ff = @$ind; @indices = @{$v3->peak_indices()}; is (scalar(@indices), 1106); my %header = %{$in_scf_v3->get_header()}; is $header{bases}, 1106; is $header{samples}, 14107; # is the Bio::Seq::SequenceTrace AnnotatableI? my $ac = $v3->annotation(); isa_ok($ac,"Bio::Annotation::Collection"); my @name_comments = grep {$_->tagname() eq 'NAME'} $ac->get_Annotations('comment'); is $name_comments[0]->as_text(), 'Comment: IIABP1D4373'; # also get comments this way... $ac = $in_scf_v3->get_comments(); isa_ok($ac,"Bio::Annotation::Collection"); @name_comments = grep {$_->tagname() eq 'NAME'} $ac->get_Annotations('comment'); is $name_comments[0]->as_text(), 'Comment: IIABP1D4373'; my @conv_comments = grep {$_->tagname() eq 'CONV'} $ac->get_Annotations('comment'); is $conv_comments[0]->as_text(), 'Comment: phred version=0.990722.h'; # is the SequenceTrace object annotated? my $st_ac = $swq->annotation(); isa_ok ($st_ac, "Bio::Annotation::Collection"); my @ann = $st_ac->get_Annotations(); is $ann[0]->tagname, 'SIGN'; is $ann[2]->text, 'SRC3700'; is $ann[5]->tagname, 'LANE'; is $ann[5]->text, 89; is $ann[6]->text, 'phred version=0.980904.e'; is $ann[8]->text, 'ABI 373A or 377'; my $outfile = test_output_file(); my $out_scf = Bio::SeqIO->new(-file => ">$outfile", -format => 'scf', -verbose => $verbose); # Bug 2196 - commentless scf my $in = Bio::SeqIO->new(-file => test_input_file('13-pilE-F.scf'), -format => 'scf', -verbose => $verbose); my $seq = $in->next_seq; ok ($seq); isa_ok($seq, 'Bio::Seq::SequenceTrace'); $ac = $seq->annotation; isa_ok($ac, 'Bio::Annotation::Collection'); @name_comments = grep {$_->tagname() eq 'NAME'} $ac->get_Annotations('comment'); is $name_comments[0], undef; @conv_comments = grep {$_->tagname() eq 'CONV'} $ac->get_Annotations('comment'); is $conv_comments[0], undef; # the new way warn("Now testing the _writing_ of scfs\n") if $verbose; $out_scf->write_seq(-target => $v3, -MACH => 'CSM sequence-o-matic 5000', -TPSW => 'trace processing software', -BCSW => 'basecalling software', -DATF => 'AM_Version=2.00', -DATN => 'a22c.alf', -CONV => 'Bioperl-scf.pm'); ok( -s $outfile && ! -z "$outfile" ); # TODO? tests below don't do much $out_scf = Bio::SeqIO->new(-verbose => 1, -file => ">$outfile", -format => 'scf'); $swq = Bio::Seq::Quality->new(-seq =>'ATCGATCGAA', -qual =>"10 20 30 40 50 20 10 30 40 50", -alphabet =>'dna'); my $trace = Bio::Seq::SequenceTrace->new(-swq => $swq); $out_scf->write_seq( -target => $trace, -MACH => 'CSM sequence-o-matic 5000', -TPSW => 'trace processing software', -BCSW => 'basecalling software', -DATF => 'AM_Version=2.00', -DATN => 'a22c.alf', -CONV => 'Bioperl-scf.pm' ); warn("Trying to write an scf with a subset of a real scf...\n") if $verbose; $out_scf = Bio::SeqIO->new(-verbose => 1, -file => ">$outfile", -format => 'scf'); $in_scf_v3 = Bio::SeqIO->new(-file => test_input_file('version3.scf'), -format => 'scf', -verbose => $verbose); $v3 = $in_scf_v3->next_seq(); my $sub_v3 = $v3->sub_trace_object(5,50); #warn("The subtrace object is this:\n") if $DEBUG; $out_scf->write_seq(-target => $sub_v3 ); my $in_scf_v2 = Bio::SeqIO->new(-file => test_input_file('version2.scf'), -format => 'scf', -verbose => $verbose); $v3 = $in_scf_v2->next_seq(); ok($v3); $out_scf = Bio::SeqIO->new(-file => ">$outfile", -format => "scf"); $out_scf->write_seq( -target => $v3, -version => 2 ); # simple round trip tests (bug 2881) my %file_map = ( # filename # write_seq args 'chad100.scf' => 1, '13-pilE-F.scf' => 1, 'version2.scf' => 1, 'version3.scf' => 1 ); for my $f (sort keys %file_map) { my $outfile = test_output_file(); my $in = Bio::SeqIO->new(-file => test_input_file($f), -format => 'scf'); my $out = Bio::SeqIO->new(-file => ">$outfile", -format => 'scf'); my $seq1 = $in->next_seq(); isa_ok($seq1, 'Bio::Seq::SequenceTrace'); ok($out->write_seq(-target => $seq1)); my $in2 = Bio::SeqIO->new(-file => "<$outfile", -format => 'scf'); my $seq2 = $in2->next_seq(); isa_ok($seq2, 'Bio::Seq::SequenceTrace'); if ($seq1->display_id) { TODO: { local $TODO = "display_id doesn't round trip yet"; is($seq1->display_id, $seq2->display_id, 'display_id matches'); } } is_deeply($seq1->qual, $seq2->qual, 'qual scores match'); } # synthesizing traces roundtrip (bug 2881): my @sequences=('A', 'ATGGAGCTCATCAAAGAATCGACTCATATATCCATCCCTGAACGGCTGACTCACATTAATGGTTGA'); foreach my $sequence (@sequences) { my $qualstr=join ' ', map { 65 } (1 .. length($sequence)); my $seq_qual=Bio::Seq::Quality->new(-seq=>$sequence, -qual=>$qualstr); my $outfile=test_output_file(); my $out=Bio::SeqIO->new(-file=>">$outfile", -format=>'scf'); $out->write_seq(-target=>$seq_qual); my $in=Bio::SeqIO->new(-file=>$outfile, -format=>'scf'); my $in_seq=$in->next_seq(); is_deeply($seq_qual, $in_seq->{swq}, 'Bio::Sequence::Quality matches'); } ace.t100644000766000024 271314006427353 15363 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 7); use_ok 'Bio::SeqIO'; } my $verbose = test_debug(); my $t_file = test_input_file('test.ace'); my $before; { local $/ = undef; open my $BEFORE, '<', $t_file or die "Could not read file '$t_file': $!\n"; $before = <$BEFORE>; close $BEFORE; } my $a_in = Bio::SeqIO->new( -FILE => $t_file, -verbose => $verbose, -FORMAT => 'ace' ); my @a_seq; while (my $a = $a_in->next_seq) { push @a_seq, $a; } is @a_seq, 3, 'number of sequence objects'; my $esc_name = $a_seq[1]->display_id; is $esc_name, 'Name; 4% strewn with \ various / escaped characters', "unescaping of characters, $esc_name"; is $a_seq[0]->alphabet, 'protein', 'alphabets detected'; is $a_seq[1]->alphabet, 'dna', 'alphabets detected'; my $o_file = test_output_file(); my $a_out = Bio::SeqIO->new( -FILE => ">$o_file", -verbose => $verbose, -FORMAT => 'ace' ); my $a_out_ok = 1; for my $a (@a_seq) { $a_out->write_seq($a) or $a_out_ok = 0; } undef($a_out); # Flush to disk is $a_out_ok,1,'writing sequence'; my $after; { local $/ = undef; open my $AFTER, '<', $o_file or die "Could not read file '$o_file': $!\n"; $after = <$AFTER>; close $AFTER; } is( ($before and $after and ($before eq $after)), 1, 'test output'); raw.t100644000766000024 1114314006427353 15441 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 25, -requires_modules => [], -requires_networking => 0, ); use_ok('Bio::SeqIO::raw'); } my $verbose = test_debug(); my $format = 'raw'; my $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test.$format"), -format => $format); isa_ok($seqio_obj, 'Bio::SeqIO'); is $seqio_obj->variant, 'multiple'; my @methods = qw(next_seq write_seq); foreach my $method (@methods) { can_ok($seqio_obj, $method) || diag "$method method not implemented for $format"; } # checking the first sequence object my $seq_obj = $seqio_obj->next_seq(); isa_ok($seq_obj, 'Bio::Seq'); my %expected = ('seq' => 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGL' . 'DYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFI' . 'TYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPN' . 'AGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVI' . 'DKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVD' . 'VKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGN' . 'QNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGY' . 'GGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGY' . 'QQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGN' . 'YGNNQGFNNGGNNRRY', 'length' => '358', ); is ($seq_obj->seq(), $expected{'seq'}, 'sequence'); is ($seq_obj->length(), $expected{'length'}, 'length'); # checking the second sequence object my $seq_obj2 = $seqio_obj->next_seq(); isa_ok($seq_obj2, 'Bio::Seq'); my %expected2 = ('seq' => 'MNKQRGTYSEVSLAQDPKRQQRKLKGNKISISGTKQEI' . 'FQVELNLQNASSDHQGNDKTYHCKGLLPPPEKLTAEVL' . 'GIICIVLMATVLKTIVLIPCIGVLEQNNFSLNRRMQKA' . 'RHCGHCPEEWITYSNSCYYIGKERRTWEERVCWPVLRR' . 'TLICFL', 'length' => '158', ); is ($seq_obj2->seq(), $expected2{'seq'}, 'sequence'); is ($seq_obj2->length(), $expected2{'length'}, 'length'); # from testformats.pl SKIP: { test_skip(-tests => 2, -requires_modules => [qw(Algorithm::Diff IO::ScalarArray IO::String)]); use_ok('Algorithm::Diff', qw(diff LCS)); my ($file, $type) = ("test.$format", $format); my $filename = test_input_file($file); print "processing file $filename\n" if $verbose; open my $FILE, '<', $filename or die "Could not read file '$filename': $!\n"; my @datain = <$FILE>; close $FILE; my $in = IO::String->new(join('', @datain)); my $seqin = Bio::SeqIO->new( -fh => $in, -format => $type); my $out = IO::String->new; my $seqout = Bio::SeqIO->new( -fh => $out, -format => $type); my $seq; while( defined($seq = $seqin->next_seq) ) { $seqout->write_seq($seq); } $seqout->close(); $seqin->close(); my $strref = $out->string_ref; my @dataout = map { $_."\n"} split(/\n/, $$strref ); my @diffs = &diff( \@datain, \@dataout); is(@diffs, 0, "$format format can round-trip"); if(@diffs && $verbose) { use Data::Dumper; # should be safe for 5.005 and greater foreach my $d ( @diffs ) { print STDERR Dumper $d; foreach my $diff ( @$d ) { chomp($diff->[2]); print $diff->[0], $diff->[1], "\n>", $diff->[2], "\n"; } } print "in is \n", join('', @datain), "\n"; print "out is \n", join('',@dataout), "\n"; } } # test raw variants my @seq = qw(MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVV VMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVK KLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQK QHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGN NWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGND FGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY); $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test2.raw"), -format => 'raw'); is($seqio_obj->variant, 'multiple'); my $ct = 0; while (my $seq = $seqio_obj->next_seq) { is($seq->seq, $seq[$ct]); $ct++; } is($ct, 6); $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test2.raw"), -format => 'raw-single'); is($seqio_obj->variant, 'single'); my $seq = $seqio_obj->next_seq; is($seq->seq, join('', @seq)); $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test2.raw"), -format => 'raw', -variant => 'single'); is($seqio_obj->variant, 'single'); $seq = $seqio_obj->next_seq; is($seq->seq, join('', @seq)); gcg.t100644000766000024 714014006427353 15372 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 17, -requires_modules => [], -requires_networking => 0, ); use_ok('Bio::SeqIO::gcg'); } my $verbose = test_debug(); my $format = 'gcg'; my $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test.$format"), -format => $format); isa_ok($seqio_obj, 'Bio::SeqIO'); my @methods = qw(next_seq write_seq); foreach my $method (@methods) { can_ok($seqio_obj, $method) || diag "$method method not implemented for $format"; } # checking the first sequence object my $seq_obj = $seqio_obj->next_seq(); isa_ok($seq_obj, 'Bio::Seq'); isa_ok($seq_obj, 'Bio::Seq::RichSeq'); my %expected = ('seq' => 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGL' . 'DYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFI' . 'TYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPN' . 'AGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVI' . 'DKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVD' . 'VKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGN' . 'QNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGY' . 'GGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGY' . 'QQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGN' . 'YGNNQGFNNGGNNRRY', 'length' => '358', 'primary_id' => 'roa1_drome', 'description' => qr(Rea guano receptor type III), ); is ($seq_obj->seq(), $expected{'seq'}, 'sequence'); is ($seq_obj->length(), $expected{'length'}, 'length'); TODO: { local $TODO = 'possible bug: RichSeq not setting primary_id?'; is ($seq_obj->primary_id(), $expected{'primary_id'}, 'primary_id'); } like ($seq_obj->description(), $expected{'description'}, 'description'); # test DOS linefeeds in gcg parser my $str = Bio::SeqIO->new(-file => test_input_file('test_badlf.gcg'), -verbose => $verbose, -format => 'GCG'); ok($str); my $seq = $str->next_seq(); isa_ok ($seq, 'Bio::SeqI'); is(length($seq->seq), $seq->length); print "Sequence 1 of 1 from GCG stream:\n", $seq->seq, "\n" if( $verbose); # from testformats.pl SKIP: { test_skip(-tests => 4, -requires_modules => [qw(Algorithm::Diff IO::ScalarArray IO::String)]); use_ok('Algorithm::Diff'); eval "use Algorithm::Diff qw(diff LCS);"; use_ok('IO::ScalarArray'); use_ok('IO::String'); my ($file, $type) = ("test.$format", $format); my $filename = test_input_file($file); print "processing file $filename\n" if $verbose; open my $FILE, '<', $filename or die "Could not read file '$filename': $!\n"; my @datain = <$FILE>; close $FILE; my $in = IO::String->new(join('', @datain)); my $seqin = Bio::SeqIO->new( -fh => $in, -format => $type); my $out = IO::String->new; my $seqout = Bio::SeqIO->new( -fh => $out, -format => $type); my $seq; while( defined($seq = $seqin->next_seq) ) { $seqout->write_seq($seq); } $seqout->close(); $seqin->close(); my $strref = $out->string_ref; my @dataout = map { $_."\n"} split(/\n/, $$strref ); my @diffs = &diff( \@datain, \@dataout); is(@diffs, 0, "$format format can round-trip"); if(@diffs && $verbose) { foreach my $d ( @diffs ) { foreach my $diff ( @$d ) { chomp($diff->[2]); print $diff->[0], $diff->[1], "\n>", $diff->[2], "\n"; } } print "in is \n", join('', @datain), "\n"; print "out is \n", join('',@dataout), "\n"; } } phd.t100644000766000024 1341114006427353 15423 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 21); use_ok('Bio::SeqIO::phd'); } my $DEBUG = test_debug(); print("Checking to see if Bio::Seq::Quality objects can be created from a file...\n") if ($DEBUG); my $in_phd = Bio::SeqIO->new('-file' => test_input_file('test.phd'), '-format' => 'phd', '-verbose' => $DEBUG); isa_ok($in_phd,'Bio::SeqIO::phd'); my $phd = $in_phd->next_seq(); is($phd->quality_levels,'99',"Did you get the 'QUALITY_LEVELS' comment?"); isa_ok($phd,"Bio::Seq::Quality"); if( $DEBUG ) { my $position = 6; print("I saw these in phredfile.phd:\n\n"); print $_->tagname,": ",$_->display_text || 0," \n" for ($phd->annotation->get_Annotations('header')); print("What is the base at position $position (using subseq)?\n"); print($phd->subseq($position,$position)."\n"); print("What is the base at position $position (using baseat)?\n"); print($phd->baseat($position)."\n"); print("What is the quality at $position? (using subqual)\n"); my @qualsretr = @{$phd->subqual($position,$position)}; print($qualsretr[0]."\n"); print("What is the quality at $position? (using qualat)\n"); print($phd->qualat($position)."\n"); print("What is the trace at $position? (using trace_index_at)\n"); print($phd->trace_index_at($position)."\n"); print("What is the trace at $position? (using subtrace)\n"); my @tracesretr = @{$phd->subtrace($position,$position)}; print($tracesretr[0]."\n"); } print("OK. Now testing write_phd...\n") if($DEBUG); my $outfile = test_output_file(); my $out_phd = Bio::SeqIO->new(-file => ">$outfile", '-format' => 'phd'); isa_ok($out_phd,"Bio::SeqIO::phd"); $out_phd->write_seq($phd); ok( -s $outfile); # Bug 2120 my @qual = q(9 9 12 12 8 8 9 8 8 8 9); my @trace = q(113 121 130 145 153 169 177 203 210 218 234); $in_phd = Bio::SeqIO->new('-file' => test_input_file('bug2120.phd'), '-format' => 'phd', '-verbose' => $DEBUG); my $seq = $in_phd->next_seq; is($seq->subseq(10,20),'gggggccttat','$seq->subseq()'); my @seq_qual =$seq->subqual_text(10,20); is_deeply(\@seq_qual,\@qual,'$seq->subqual_tex()'); my @seq_trace = $seq->subtrace_text(10,20); is_deeply(\@seq_trace,\@trace,'$seq->subqual_tex()'); if($DEBUG) { print "\nDefault header ... \n\n"; use Bio::Seq::Quality; my $seq = Bio::Seq::Quality->new('-seq' => 'GAATTC'); $out_phd->_fh(\*STDOUT); $out_phd->write_header($seq); print "Complete output\n\n"; $out_phd->write_seq($seq); } print("Testing the header manipulation\n") if($DEBUG); is($phd->chromat_file(),'ML4924R','$phd->chromat_file()'); $phd->chromat_file('ML4924R.esd'); is($phd->chromat_file(), 'ML4924R.esd','$phd->chromat_file()'); $phd->touch(); # Commented out 1/17/09. # This isn't exactly a stable regression test as the comparison tests # localtime() called from two different timepoints. They can differ if the calls # occurred before and after a change in seconds, for example. #my $localtime = localtime(); #is($phd->time, "$localtime", $phd->time.':'.$localtime); if ($DEBUG){ print "Testing the sequence ...\n"; print ">",$phd->id," ",$phd->desc,"\n",$phd->seq,"\n"; my $revcom = $phd->revcom; print ">revcom\n",$revcom->seq,"\n"; print ">revcom_qual at 6\n",$revcom->qualat(6),"\n"; print ">revcom_trace at 6 !!\n",$revcom->trace_index_at(6),"\n"; my $trunc = $phd->trunc(10,20); print ">TRUNC 10,20\n",$trunc->seq,"\n>qual\n@{$trunc->qual}\n>trace\n@{$trunc->trace}\n"; } # Multiple seqs in one file $in_phd = Bio::SeqIO->new('-file' => test_input_file('multi.phd'), '-format' => 'phd', '-verbose' => $DEBUG); @qual = qq(9 9 15 17 17 22 22 25 25 22 22); @trace = qq(98 105 119 128 143 148 162 173 185 197 202); $seq = $in_phd->next_seq; is($seq->id, 'ML4924F'); is($seq->subseq(10,20),'tctcgagggta','$seq->subseq()'); @seq_qual =$seq->subqual_text(10,20); is_deeply(\@seq_qual,\@qual,'$seq->subqual_tex()'); @seq_trace = $seq->subtrace_text(10,20); is_deeply(\@seq_trace,\@trace,'$seq->subqual_tex()'); @qual = qq(11 9 6 6 9 19 20 32 34 34 39); @trace = qq(98 104 122 128 140 147 159 167 178 190 200); $seq = $in_phd->next_seq; is($seq->id, 'ML4924R'); is($seq->subseq(10,20),'gcctgcaggta','$seq->subseq()'); @seq_qual =$seq->subqual_text(10,20); is_deeply(\@seq_qual,\@qual,'$seq->subqual_tex()'); @seq_trace = $seq->subtrace_text(10,20); is_deeply(\@seq_trace,\@trace,'$seq->subqual_tex()'); #if($DEBUG) { # print "\nDefault header ... \n\n"; # use Bio::Seq::Quality; # my $seq = Bio::Seq::Quality->new('-seq' => 'GAATTC'); # $out_phd->_fh(\*STDOUT); # $out_phd->write_header($seq); # print "Complete output\n\n"; # $out_phd->write_seq($seq); #} ##print("Testing the header manipulation\n") if($DEBUG); #is($phd->chromat_file(),'ML4924R','$phd->chromat_file()'); #$phd->chromat_file('ML4924R.esd'); #is($phd->chromat_file(), 'ML4924R.esd','$phd->chromat_file()'); #$phd->touch(); #my $localtime = localtime(); #is($phd->time, "$localtime"); #if ($DEBUG){ # print "Testing the sequence ...\n"; # print ">",$phd->id," ",$phd->desc,"\n",$phd->seq,"\n"; # my $revcom = $phd->revcom; # print ">revcom\n",$revcom->seq,"\n"; # print ">revcom_qual at 6\n",$revcom->qualat(6),"\n"; # print ">revcom_trace at 6 !!\n",$revcom->trace_index_at(6),"\n"; # my $trunc = $phd->trunc(10,20); # print ">TRUNC 10,20\n",$trunc->seq,"\n>qual\n@{$trunc->qual}\n>trace\n@{$trunc->trace}\n"; #} # # Whole-read tags in the file $in_phd = Bio::SeqIO->new('-file' => test_input_file('multiseq_tags.phd'), '-format' => 'phd', '-verbose' => $DEBUG); isa_ok($in_phd,'Bio::SeqIO::phd'); my @seqs = (); while (my $seq = $in_phd->next_seq){ push @seqs, $seq; } is( scalar @seqs, 2 ); pir.t100644000766000024 170114006427353 15421 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin( -tests => 12 ); use_ok('Bio::SeqIO::pir'); } my $verbose = test_debug(); my $in = Bio::SeqIO->new( -file => test_input_file('seqfile.pir'), -verbose => $verbose, -format => 'pir' ); ok( defined $in, 'new instance is defined ' ); isa_ok( $in, 'Bio::SeqIO' ); my $out = Bio::SeqIO->new( -format => 'pir', -fh => \*STDOUT ); while ( my $seq = $in->next_seq() ) { ok( $seq->length > 1, 'checked length' ); $out->write_seq($seq) if $verbose > 0; } # Empty description line $in = Bio::SeqIO->new( -file => test_input_file('seqfile-no-desc.pir'), -verbose => $verbose, -format => 'pir' ); my $seq = $in->next_seq(); ok( $seq->seq =~ /^MGD/, 'Correct start' ); $seq = $in->next_seq(); ok( $seq->seq =~ /^GDV/, 'Correct start' ); $seq = $in->next_seq(); ok( $seq->seq =~ /^GDV/, 'Correct start' ); tab.t100644000766000024 103714006427353 15377 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 8); use_ok('Bio::SeqIO::tab'); } my $verbose = test_debug(); my $io = Bio::SeqIO->new(-format => 'tab', -verbose => $verbose, -file => test_input_file('test.tab')); isa_ok($io, 'Bio::SeqIO'); while (my $seq = $io->next_seq) { ok ( $seq && defined $seq, 'seq is defined' ) ; is ( $seq->length, 358, 'check seq length' ) ; like ($seq->display_id, qr/^roa\d_drome$/, 'check matching' ); } PAM250100644000766000024 402414006427353 15163 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# # This matrix was produced by "pam" Version 1.0.6 [28-Jul-93] # # PAM 250 substitution matrix, scale = ln(2)/3 = 0.231049 # # Expected score = -0.844, Entropy = 0.354 bits # # Lowest score = -8, Highest score = 17 # A R N D C Q E G H I L K M F P S T W Y V B Z X * A 2 -2 0 0 -2 0 0 1 -1 -1 -2 -1 -1 -3 1 1 1 -6 -3 0 0 0 0 -8 R -2 6 0 -1 -4 1 -1 -3 2 -2 -3 3 0 -4 0 0 -1 2 -4 -2 -1 0 -1 -8 N 0 0 2 2 -4 1 1 0 2 -2 -3 1 -2 -3 0 1 0 -4 -2 -2 2 1 0 -8 D 0 -1 2 4 -5 2 3 1 1 -2 -4 0 -3 -6 -1 0 0 -7 -4 -2 3 3 -1 -8 C -2 -4 -4 -5 12 -5 -5 -3 -3 -2 -6 -5 -5 -4 -3 0 -2 -8 0 -2 -4 -5 -3 -8 Q 0 1 1 2 -5 4 2 -1 3 -2 -2 1 -1 -5 0 -1 -1 -5 -4 -2 1 3 -1 -8 E 0 -1 1 3 -5 2 4 0 1 -2 -3 0 -2 -5 -1 0 0 -7 -4 -2 3 3 -1 -8 G 1 -3 0 1 -3 -1 0 5 -2 -3 -4 -2 -3 -5 0 1 0 -7 -5 -1 0 0 -1 -8 H -1 2 2 1 -3 3 1 -2 6 -2 -2 0 -2 -2 0 -1 -1 -3 0 -2 1 2 -1 -8 I -1 -2 -2 -2 -2 -2 -2 -3 -2 5 2 -2 2 1 -2 -1 0 -5 -1 4 -2 -2 -1 -8 L -2 -3 -3 -4 -6 -2 -3 -4 -2 2 6 -3 4 2 -3 -3 -2 -2 -1 2 -3 -3 -1 -8 K -1 3 1 0 -5 1 0 -2 0 -2 -3 5 0 -5 -1 0 0 -3 -4 -2 1 0 -1 -8 M -1 0 -2 -3 -5 -1 -2 -3 -2 2 4 0 6 0 -2 -2 -1 -4 -2 2 -2 -2 -1 -8 F -3 -4 -3 -6 -4 -5 -5 -5 -2 1 2 -5 0 9 -5 -3 -3 0 7 -1 -4 -5 -2 -8 P 1 0 0 -1 -3 0 -1 0 0 -2 -3 -1 -2 -5 6 1 0 -6 -5 -1 -1 0 -1 -8 S 1 0 1 0 0 -1 0 1 -1 -1 -3 0 -2 -3 1 2 1 -2 -3 -1 0 0 0 -8 T 1 -1 0 0 -2 -1 0 0 -1 0 -2 0 -1 -3 0 1 3 -5 -3 0 0 -1 0 -8 W -6 2 -4 -7 -8 -5 -7 -7 -3 -5 -2 -3 -4 0 -6 -2 -5 17 0 -6 -5 -6 -4 -8 Y -3 -4 -2 -4 0 -4 -4 -5 0 -1 -1 -4 -2 7 -5 -3 -3 0 10 -2 -3 -4 -2 -8 V 0 -2 -2 -2 -2 -2 -2 -1 -2 4 2 -2 2 -1 -1 -1 0 -6 -2 4 -2 -2 -1 -8 B 0 -1 2 3 -4 1 3 0 1 -2 -3 1 -2 -4 -1 0 0 -5 -3 -2 3 2 -1 -8 Z 0 0 1 3 -5 3 3 0 2 -2 -3 0 -2 -5 0 0 -1 -6 -4 -2 2 3 -1 -8 X 0 -1 0 -1 -3 -1 -1 -1 -1 -1 -1 -1 -1 -2 -1 0 0 -4 -2 -1 -1 -1 -1 -8 * -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 -8 1 P33897100644000766000024 7033214006427353 15161 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID ALD_HUMAN STANDARD; PRT; 745 AA. AC P33897; DT 01-FEB-1994 (Rel. 28, Created) DT 01-FEB-1994 (Rel. 28, Last sequence update) DT 15-JUN-2004 (Rel. 44, Last annotation update) DE Adrenoleukodystrophy protein (ALDP). GN ABCD1 OR ALD. OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=93180910; PubMed=8441467; RA Mosser J., Douar A.-M., Sarde C.-O., Kioschis P., Feil R., Moser H., RA Poustka A.-M., Mandel J.-L., Aubourg P.; RT "Putative X-linked adrenoleukodystrophy gene shares unexpected RT homology with ABC transporters."; RL Nature 361:726-730(1993). RN [2] RP SEQUENCE FROM N.A. RA Platzer M., Bauer D., Brenner V., Drescher B., Nyakatura G., RA Reichwald K., Sandoval N., Coy J., Kioschis P., Korn B., RA Poustka A.-M., Rosenthal A.; RL Submitted (MAY-1996) to the EMBL/GenBank/DDBJ databases. RN [3] RP SUBUNITS, AND CHARACTERIZATION OF VARIANTS X-ALD HIS-389; GLN-401; RP ARG-484 AND GLN-591. RX MEDLINE=20020240; PubMed=10551832; RA Liu L.X., Janvier K., Berteaux-Lecellier V., Cartier N., Benarous R., RA Aubourg P.; RT "Homo- and heterodimerization of peroxisomal ATP-binding cassette RT half-transporters."; RL J. Biol. Chem. 274:32738-32743(1999). RN [4] RP FUNCTION, AND CHARACTERIZATION OF VARIANTS X-ALD SER-512 AND LEU-606. RX MEDLINE=21145507; PubMed=11248239; RA Roerig P., Mayerhofer P., Holzinger A., Gaertner J.; RT "Characterization and functional analysis of the nucleotide binding RT fold in human peroxisomal ATP binding cassette transporters."; RL FEBS Lett. 492:66-72(2001). RN [5] RP REVIEW. RX MEDLINE=93283453; PubMed=8507690; RA Aubourg P., Mosser J., Douar A.-M., Sarde C.-O., Lopez J., RA Mandel J.-L.; RT "Adrenoleukodystrophy gene: unexpected homology to a protein involved RT in peroxisome biogenesis."; RL Biochimie 75:293-302(1993). RN [6] RP REVIEW ON VARIANTS. RX MEDLINE=97338663; PubMed=9195223; RA Dodd A., Rowland S.A., Hawkes S.L.J., Kennedy M.A., Love D.R.; RT "Mutations in the adrenoleukodystrophy gene."; RL Hum. Mutat. 9:500-511(1997). RN [7] RP REVIEW ON VARIANTS. RX MEDLINE=21614879; PubMed=11748843; RA Kemp S., Pujol A., Waterham H.R., van Geel B.M., Boehm C.D., RA Raymond G.V., Cutting G.R., Wanders R.J.A., Moser H.W.; RT "ABCD1 mutations and the X-linked adrenoleukodystrophy mutation RT database: role in diagnosis and clinical correlations."; RL Hum. Mutat. 18:499-515(2001). RN [8] RP VARIANT X-ALD LYS-291. RX MEDLINE=94108454; PubMed=7904210; RA Cartier N., Sarde C.-O., Douar A.-M., Mosser J., Mandel J.-L., RA Aubourg P.; RT "Abnormal messenger RNA expression and a missense mutation in RT patients with X-linked adrenoleukodystrophy."; RL Hum. Mol. Genet. 2:1949-1951(1993). RN [9] RP VARIANTS X-ALD SER-148; ASP-174; ARG-266; GLN-401; TRP-418 AND RP PHE-515. RX MEDLINE=95152524; PubMed=7849723; RA Fuchs S., Sarde C.-O., Wedemann H., Schwinger E., Mandel J.-L., RA Gal A.; RT "Missense mutations are frequent in the gene for X-chromosomal RT adrenoleukodystrophy (ALD)."; RL Hum. Mol. Genet. 3:1903-1905(1994). RN [10] RP VARIANTS X-ALD TRP-518; LEU-606; CYS-617 AND HIS-617. RX MEDLINE=94314951; PubMed=8040304; RA Fanen P., Guidoux S., Sarde C.-O., Mandel J.-L., Goossens M., RA Aubourg P.; RT "Identification of mutations in the putative ATP-binding domain of RT the adrenoleukodystrophy gene."; RL J. Clin. Invest. 94:516-520(1994). RN [11] RP VARIANTS X-ALD. RX MEDLINE=95126139; PubMed=7825602; RA Ligtenberg M.J.L., Kemp S., Sarde C.-O., van Geel B.M., Kleijer W.J., RA Barth P.G., Mandel J.-L., van Oost B.A., Bolhuis P.A.; RT "Spectrum of mutations in the gene encoding the adrenoleukodystrophy RT protein."; RL Am. J. Hum. Genet. 56:44-50(1995). RN [12] RP VARIANTS X-ALD HIS-104; GLU-178; LEU-560 AND GLY-528 DEL. RX MEDLINE=95233433; PubMed=7717396; RA Braun A., Ambach H., Kammerer S., Rolinski B., Stoeckler S., Rabl W., RA Gaertner J., Zierz S., Roscher A.A.; RT "Mutations in the gene for X-linked adrenoleukodystrophy in patients RT with different clinical phenotypes."; RL Am. J. Hum. Genet. 56:854-861(1995). RN [13] RP VARIANTS X-ALD. RX MEDLINE=96047143; PubMed=7581394; RA Kok F., Neumann S., Sarde C.-O., Zheng S., Wu K.-H., Wei H.-M., RA Bergin J., Watkins P.A., Gould S., Sack G., Moser H., Mandel J.-L., RA Smith K.D.; RT "Mutational analysis of patients with X-linked adrenoleukodystrophy."; RL Hum. Mutat. 6:104-115(1995). RN [14] RP VARIANTS X-ALD. RX MEDLINE=96213748; PubMed=8651290; RA Feigenbaum V., Lombard-Platet G., Guidoux S., Sarde C.-O., RA Mandel J.-L., Aubourg P.; RT "Mutational and protein analysis of patients and heterozygous women RT with X-linked adrenoleukodystrophy."; RL Am. J. Hum. Genet. 58:1135-1144(1996). RN [15] RP VARIANTS X-ALD. RX MEDLINE=96163493; PubMed=8566952; RA Krasemann E.W., Meier V., Korenke G.C., Hunneman D.H., Hanefeld F.; RT "Identification of mutations in the ALD-gene of 20 families with RT adrenoleukodystrophy/adrenomyeloneuropathy."; RL Hum. Genet. 97:194-197(1996). RN [16] RP VARIANT X-ALD ARG-679. RX MEDLINE=98112466; PubMed=9452087; RA Korenke G.C., Krasemann E., Meier V., Beuche W., Hunneman D.H., RA Hanefeld F.; RT "First missense mutation (W679R) in exon 10 of the RT adrenoleukodystrophy gene in siblings with adrenomyeloneuropathy."; RL Hum. Mutat. Suppl. 1:S204-S206(1998). RN [17] RP VARIANTS X-ALD. RX MEDLINE=99408241; PubMed=10480364; RA Wichers M., Kohler W., Brennemann W., Boese V., Sokolowski P., RA Bidlingmaier F., Ludwig M.; RT "X-linked adrenomyeloneuropathy associated with 14 novel ALD-gene RT mutations: no correlation between type of mutation and age of onset."; RL Hum. Genet. 105:116-119(1999). RN [18] RP VARIANTS X-ALD LEU-108 AND SER-143. RX MEDLINE=99299442; PubMed=10369742; RA Perusi C., Gomez-Lira M., Mottes M., Pignatti P.F., Bertini E., RA Cappa M., Vigliani M.C., Schiffer D., Rizzuto N., Salviati A.; RT "Two novel missense mutations causing adrenoleukodystrophy in Italian RT patients."; RL Mol. Cell. Probes 13:179-182(1999). RN [19] RP VARIANTS X-ALD. RX MEDLINE=20202141; PubMed=10737980; RA Lachtermacher M.B., Seuanez H.N., Moser A.B., Moser H.W., Smith K.D.; RT "Determination of 30 X-linked adrenoleukodystrophy mutations, RT including 15 not previously described."; RL Hum. Mutat. 15:348-353(2000). RN [20] RP VARIANTS X-ALD GLN-401; TRP-418; LEU-543 AND ARG-556. RX MEDLINE=20438355; PubMed=10980539; RA Lira M.G., Mottes M., Pignatti P.F., Medica I., Uziel G., Cappa M., RA Bertini E., Rizzuto N., Salviati A.; RT "Detection of mutations in the ALD gene (ABCD1) in seven Italian RT families: description of four novel mutations."; RL Hum. Mutat. 16:271-271(2000). RN [21] RP VARIANTS X-ALD LEU-98; ASP-99; GLU-217; GLN-518; ASP-608; ILE-633 AND RP PRO-660, AND VARIANT THR-13. RX MEDLINE=21331689; PubMed=11438993; RA Dvorakova L., Storkanova G., Unterrainer G., Hujova J., Kmoch S., RA Zeman J., Hrebicek M., Berger J.; RT "Eight novel ABCD1 gene mutations and three polymorphisms in patients RT with X-linked adrenoleukodystrophy: the first polymorphism causing an RT amino acid exchange."; RL Hum. Mutat. 18:52-60(2001). RN [22] RP VARIANT X-ALD VAL-GLY-GLN-300 INS. RX MEDLINE=21668186; PubMed=11810273; DOI=10.1007/s00439-001-0632-z; RA Guimaraes C.P., Lemos M., Menezes I., Coelho T., Sa-Miranda C., RA Azevedo J.E.; RT "Characterisation of two mutations in the ABCD1 gene leading to low RT levels of normal ALDP."; RL Hum. Genet. 109:616-622(2001). RN [23] RP INVOLVEMENT IN CONTIGUOUS ABCD1/DXS1375E DELETION SYNDROME. RX PubMed=11992258; RA Corzo D., Gibson W., Johnson K., Mitchell G., LePage G., Cox G.F., RA Casey R., Zeiss C., Tyson H., Cutting G.R., Raymond G.V., Smith K.D., RA Watkins P.A., Moser A.B., Moser H.W., Steinberg S.J.; RT "Contiguous deletion of the X-linked adrenoleukodystrophy gene (ABCD1) RT and DXS1357E: a novel neonatal phenotype similar to peroxisomal RT biogenesis disorders."; RL Am. J. Hum. Genet. 70:1520-1531(2002). CC -!- FUNCTION: Probable transporter. The nucleotide-binding fold acts CC as an ATP-binding subunit with ATPase activity. CC -!- SUBUNIT: Can form homo- and heterodimers with ABCD2/ALDR and CC ABCD3/PMP70. Dimerization is necessary to form an active CC transporter. CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Peroxisomal. CC -!- DISEASE: Defects in ABCD1 are the cause of recessive X-linked CC adrenoleukodystrophy (X-ALD) [MIM:300100]. X-ALD is a rare CC peroxisomal metabolic disorder that occurs in boys and is CC characterized by progressive multifocal demyelination of the CC central nervous system and by adrenocortical insufficiency. It CC produces mental deterioration, corticospinal tract dysfunction, CC and cortical blindness. There is laboratory evidence of adrenal CC cortical dysfunction. Different clinical manifestations exist CC like: cerebral childhood ALD (CALD), adult cerebral ALD (ACALD), CC adrenomyeloneuropathy (AMN) and "Addison disease only" (ADO) CC phenotype. CC -!- DISEASE: Microdeletions in ABCD1 are involved in the contiguous CC ABCD1/DXS1375E deletion syndrome (CADDS) [MIM:300475]. Patients CC manifest profound neonatal hypotonia, subsequent failure to CC thrive, and cholestatic liver disease. CC -!- SIMILARITY: Belongs to the ABC transporter family. ALD subfamily. CC -!- DATABASE: NAME=X-ALD gene mutation database; CC WWW="http://www.x-ald.nl/". CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; Z21876; CAA79922.1; -. DR EMBL; Z31348; CAA83230.1; -. DR EMBL; Z31006; CAA83230.1; JOINED. DR EMBL; Z31007; CAA83230.1; JOINED. DR EMBL; Z31008; CAA83230.1; JOINED. DR EMBL; Z31009; CAA83230.1; JOINED. DR EMBL; Z31010; CAA83230.1; JOINED. DR EMBL; U52111; -; NOT_ANNOTATED_CDS. DR PIR; G02500; G02500. DR Genew; HGNC:61; ABCD1. DR MIM; 300371; -. DR MIM; 300100; -. DR MIM; 300475; -. DR GO; GO:0005779; C:integral to peroxisomal membrane; NAS. DR GO; GO:0004009; F:ATP-binding cassette (ABC) transporter acti...; NAS. DR GO; GO:0005215; F:transporter activity; NAS. DR GO; GO:0015919; P:peroxisomal membrane transport; NAS. DR GO; GO:0007031; P:peroxisome organization and biogenesis; NAS. DR InterPro; IPR003593; AAA_ATPase. DR InterPro; IPR003439; ABC_transporter. DR InterPro; IPR005283; FA_transporter. DR Pfam; PF00005; ABC_tran; 1. DR Pfam; PF06472; Ald_N; 1. DR ProDom; PD000006; ABC_transporter; 1. DR TIGRFAMs; TIGR00954; 3a01203; 1. DR PROSITE; PS50929; ABC_TM1F; 1. DR PROSITE; PS00211; ABC_TRANSPORTER_1; 1. DR PROSITE; PS50893; ABC_TRANSPORTER_2; 1. KW ATP-binding; Glycoprotein; Transmembrane; Transport; Peroxisome; KW Disease mutation; Polymorphism. FT TRANSMEM 92 112 Potential. FT TRANSMEM 131 151 Potential. FT TRANSMEM 238 258 Potential. FT TRANSMEM 333 353 Potential. FT TRANSMEM 473 493 Potential. FT NP_BIND 507 514 ATP (By similarity). FT VARIANT 13 13 N -> T (very rare polymorphism; does not FT affect ALDP function). FT /FTId=VAR_013340. FT VARIANT 90 90 E -> K (in X-ALD). FT /FTId=VAR_009349. FT VARIANT 98 98 S -> L (in X-ALD; CALD type). FT /FTId=VAR_000024. FT VARIANT 99 99 A -> D (in X-ALD; AMN-type). FT /FTId=VAR_013341. FT VARIANT 103 103 S -> R (in X-ALD). FT /FTId=VAR_009350. FT VARIANT 104 104 R -> C (in X-ALD). FT /FTId=VAR_000025. FT VARIANT 104 104 R -> H (in X-ALD; ADO-type). FT /FTId=VAR_000026. FT VARIANT 105 105 T -> I (in X-ALD; ADO-type). FT /FTId=VAR_000027. FT VARIANT 105 105 T -> P (in X-ALD). FT /FTId=VAR_009351. FT VARIANT 107 107 L -> P (in X-ALD; ALD/AMN/ADO-types and FT asymptomatic). FT /FTId=VAR_000028. FT VARIANT 108 108 S -> L (in X-ALD). FT /FTId=VAR_009352. FT VARIANT 108 108 S -> W (in X-ALD; CALD and AMN-types). FT /FTId=VAR_000029. FT VARIANT 113 113 R -> C (in X-ALD). FT /FTId=VAR_009353. FT VARIANT 113 113 R -> P (in X-ALD). FT /FTId=VAR_013342. FT VARIANT 116 116 G -> R (in X-ALD; CALD-type). FT /FTId=VAR_000030. FT VARIANT 123 123 A -> V. FT /FTId=VAR_000031. FT VARIANT 138 141 Missing (in X-ALD; ALD-type). FT /FTId=VAR_000032. FT VARIANT 141 141 A -> T (in X-ALD). FT /FTId=VAR_000033. FT VARIANT 143 143 P -> S (in X-ALD). FT /FTId=VAR_009354. FT VARIANT 148 148 N -> S (in X-ALD; ADO-type). FT /FTId=VAR_000034. FT VARIANT 149 149 S -> N (in X-ALD). FT /FTId=VAR_000035. FT VARIANT 152 152 R -> C (in X-ALD; ADO-type). FT /FTId=VAR_000036. FT VARIANT 152 152 R -> L (in X-ALD). FT /FTId=VAR_009355. FT VARIANT 152 152 R -> P (in X-ALD). FT /FTId=VAR_000037. FT VARIANT 152 152 R -> S (in X-ALD). FT /FTId=VAR_009356. FT VARIANT 161 161 S -> P (in X-ALD). FT /FTId=VAR_009357. FT VARIANT 163 163 R -> H (in X-ALD). FT /FTId=VAR_000038. FT VARIANT 163 163 R -> P (in X-ALD). FT /FTId=VAR_009358. FT VARIANT 174 174 Y -> C (in X-ALD). FT /FTId=VAR_009359. FT VARIANT 174 174 Y -> D (in X-ALD; ALD-type). FT /FTId=VAR_000039. FT VARIANT 174 174 Y -> S (in X-ALD; CALD-type). FT /FTId=VAR_000040. FT VARIANT 178 178 Q -> E (in X-ALD; AMN-type). FT /FTId=VAR_000041. FT VARIANT 181 181 Y -> C (in X-ALD; ALMD-type). FT /FTId=VAR_000042. FT VARIANT 182 182 R -> P (in X-ALD). FT /FTId=VAR_000043. FT VARIANT 189 189 R -> W (in X-ALD). FT /FTId=VAR_009360. FT VARIANT 190 190 L -> P (in X-ALD). FT /FTId=VAR_009361. FT VARIANT 194 194 D -> H (in X-ALD). FT /FTId=VAR_000044. FT VARIANT 198 198 T -> K (in X-ALD). FT /FTId=VAR_009362. FT VARIANT 200 200 D -> N (in X-ALD). FT /FTId=VAR_009363. FT VARIANT 200 200 D -> V (in X-ALD; CALD-type). FT /FTId=VAR_000045. FT VARIANT 207 207 S -> SAAS (in X-ALD). FT /FTId=VAR_013343. FT VARIANT 211 211 L -> P (in X-ALD). FT /FTId=VAR_000046. FT VARIANT 213 213 S -> C (in X-ALD). FT /FTId=VAR_009364. FT VARIANT 214 214 N -> D (in X-ALD). FT /FTId=VAR_009365. FT VARIANT 217 217 K -> E (in X-ALD). FT /FTId=VAR_013344. FT VARIANT 218 218 P -> T (in X-ALD). FT /FTId=VAR_009366. FT VARIANT 220 220 L -> P (in X-ALD). FT /FTId=VAR_000047. FT VARIANT 221 221 D -> G (in X-ALD; CALD and AMN-types). FT /FTId=VAR_000048. FT VARIANT 224 224 V -> E (in X-ALD). FT /FTId=VAR_013345. FT VARIANT 229 229 L -> P (in X-ALD). FT /FTId=VAR_009367. FT VARIANT 254 254 T -> M (in X-ALD; AMN-type). FT /FTId=VAR_000049. FT VARIANT 254 254 T -> P (in X-ALD; AMN-type). FT /FTId=VAR_000050. FT VARIANT 263 263 P -> L (in X-ALD; CALD, AMN and AD- FT typeS). FT /FTId=VAR_000051. FT VARIANT 266 266 G -> R (in X-ALD). FT /FTId=VAR_000052. FT VARIANT 271 271 E -> K (in X-ALD). FT /FTId=VAR_009368. FT VARIANT 274 274 R -> W (in X-ALD). FT /FTId=VAR_013346. FT VARIANT 276 276 K -> E (in X-ALD; CALD-type). FT /FTId=VAR_000053. FT VARIANT 277 277 G -> R (in X-ALD; AMN-type). FT /FTId=VAR_000054. FT VARIANT 277 277 G -> GN (in X-ALD; ADO-type). FT /FTId=VAR_000055. FT VARIANT 277 277 G -> W (in X-ALD). FT /FTId=VAR_000056. FT VARIANT 280 280 R -> C (in X-ALD). FT /FTId=VAR_013347. FT VARIANT 285 285 R -> P (in X-ALD). FT /FTId=VAR_009369. FT VARIANT 291 291 E -> D (in X-ALD; ACALD and CALD-types). FT /FTId=VAR_000057. FT VARIANT 291 291 E -> K (in X-ALD). FT /FTId=VAR_000058. FT VARIANT 291 291 Missing (in X-ALD; ALD-type). FT /FTId=VAR_000059. FT VARIANT 294 294 A -> T (in X-ALD; AMN-type). FT /FTId=VAR_000060. FT VARIANT 296 296 Y -> C (in X-ALD). FT /FTId=VAR_009370. FT VARIANT 298 298 G -> D (in X-ALD). FT /FTId=VAR_009371. FT VARIANT 300 300 E -> EVGQ (in X-ALD). FT /FTId=VAR_013348. FT VARIANT 302 302 E -> K (in X-ALD). FT /FTId=VAR_009372. FT VARIANT 322 322 L -> P (in X-ALD). FT /FTId=VAR_009373. FT VARIANT 336 336 K -> M (in X-ALD). FT /FTId=VAR_009374. FT VARIANT 339 339 W -> R (in X-ALD). FT /FTId=VAR_013349. FT VARIANT 342 342 S -> P (in X-ALD; AMN-type). FT /FTId=VAR_000061. FT VARIANT 343 343 G -> D (in X-ALD). FT /FTId=VAR_013350. FT VARIANT 389 389 R -> G (in X-ALD; AMN-type). FT /FTId=VAR_000062. FT VARIANT 389 389 R -> H (in X-ALD; does not affect protein FT stability, homo- and heterodimerization FT with ALDR and PMP70). FT /FTId=VAR_000063. FT VARIANT 401 401 R -> Q (in X-ALD; ALD and AMN-types; does FT not affect protein stability, homo- and FT heterodimerization with ALDR and PMP70). FT /FTId=VAR_000064. FT VARIANT 401 401 R -> W (in X-ALD). FT /FTId=VAR_009375. FT VARIANT 418 418 R -> W (in X-ALD; AMN-type). FT /FTId=VAR_000065. FT VARIANT 427 427 Missing (in X-ALD). FT /FTId=VAR_013351. FT VARIANT 484 484 P -> R (in X-ALD; CALD, AMN and ADO- FT types; significantly decreases FT homodimerization and abolishes FT heterodimerization with ALDR and PMP70). FT /FTId=VAR_000066. FT VARIANT 507 507 G -> V (in X-ALD; CALD-types). FT /FTId=VAR_000067. FT VARIANT 512 512 G -> S (in X-ALD; CALD and AS-types; FT reduced ATPase activity). FT /FTId=VAR_000068. FT VARIANT 515 515 S -> F (in X-ALD). FT /FTId=VAR_000069. FT VARIANT 518 518 R -> Q (in X-ALD; CALD-type). FT /FTId=VAR_000070. FT VARIANT 518 518 R -> W (in X-ALD; CALD-type). FT /FTId=VAR_000071. FT VARIANT 522 522 G -> W (in X-ALD; AD-type). FT /FTId=VAR_000072. FT VARIANT 528 528 Missing (in X-ALD; CALD-type). FT /FTId=VAR_000073. FT VARIANT 529 529 G -> S (in X-ALD). FT /FTId=VAR_009376. FT VARIANT 534 534 P -> L (in X-ALD; CALD-type). FT /FTId=VAR_000074. FT VARIANT 540 540 F -> S (in X-ALD). FT /FTId=VAR_009377. FT VARIANT 543 543 P -> L (in X-ALD). FT /FTId=VAR_009378. FT VARIANT 544 544 Q -> R (in X-ALD). FT /FTId=VAR_009379. FT VARIANT 552 552 S -> P (in X-ALD). FT /FTId=VAR_009380. FT VARIANT 554 554 R -> H (in X-ALD). FT /FTId=VAR_009381. FT VARIANT 556 556 Q -> R (in X-ALD; ACALD type). FT /FTId=VAR_013352. FT VARIANT 560 560 P -> L (in X-ALD; CALD-type). FT /FTId=VAR_000075. FT VARIANT 560 560 P -> R (in X-ALD; AMN and ALMD-types). FT /FTId=VAR_000076. FT VARIANT 560 560 P -> S (in X-ALD). FT /FTId=VAR_013353. FT VARIANT 566 566 M -> K (in X-ALD). FT /FTId=VAR_000077. FT VARIANT 591 591 R -> P (in X-ALD). FT /FTId=VAR_013354. FT VARIANT 591 591 R -> Q (in X-ALD; AMN-type; significantly FT decreases homodimerization and abolishes FT heterodimerization with ALDR and PMP70). FT /FTId=VAR_000078. FT VARIANT 591 591 R -> W (in X-ALD). FT /FTId=VAR_009382. FT VARIANT 606 606 S -> L (in X-ALD; decreased ATP-binding FT affinity). FT /FTId=VAR_000079. FT VARIANT 606 606 S -> P (in X-ALD; CALD, AMN and ALMD- FT types). FT /FTId=VAR_000080. FT VARIANT 608 608 G -> D (in X-ALD; CALD-type). FT /FTId=VAR_013355. FT VARIANT 609 609 E -> G (in X-ALD). FT /FTId=VAR_000081. FT VARIANT 609 609 E -> K (in X-ALD; AMN-type). FT /FTId=VAR_000082. FT VARIANT 616 616 A -> V (in X-ALD). FT /FTId=VAR_009383. FT VARIANT 617 617 R -> C (in X-ALD; ALD-type and FT asymptomatic). FT /FTId=VAR_000083. FT VARIANT 617 617 R -> G (in X-ALD; ADO and AMN-types with FT cerebral involvement). FT /FTId=VAR_000084. FT VARIANT 617 617 R -> H (in X-ALD). FT /FTId=VAR_000085. FT VARIANT 626 626 A -> D (in X-ALD). FT /FTId=VAR_013356. FT VARIANT 626 626 A -> T (in X-ALD; CALD and AMN-types). FT /FTId=VAR_000086. FT VARIANT 629 629 D -> H (in X-ALD). FT /FTId=VAR_000087. FT VARIANT 630 630 E -> G (in X-ALD). FT /FTId=VAR_009384. FT VARIANT 631 631 C -> Y (in X-ALD). FT /FTId=VAR_009385. FT VARIANT 632 632 T -> I (in X-ALD). FT /FTId=VAR_013357. FT VARIANT 633 633 S -> I (in X-ALD; asymptomatic). FT /FTId=VAR_013358. FT VARIANT 633 633 S -> R (in X-ALD). FT /FTId=VAR_009386. FT VARIANT 635 635 V -> M (in X-ALD). FT /FTId=VAR_013359. FT VARIANT 636 636 S -> I (in X-ALD). FT /FTId=VAR_009387. FT VARIANT 638 638 D -> Y (in X-ALD). FT /FTId=VAR_009388. FT VARIANT 646 646 A -> P (in X-ALD). FT /FTId=VAR_009389. FT VARIANT 654 654 L -> P (in X-ALD). FT /FTId=VAR_009390. FT VARIANT 657 657 Missing (in X-ALD; CALD-type). FT /FTId=VAR_000088. FT VARIANT 660 660 R -> P (in X-ALD; CALD-type). FT /FTId=VAR_013360. FT VARIANT 660 660 R -> W (in X-ALD; CALD, ALMD and AS- FT types). FT /FTId=VAR_000089. FT VARIANT 667 667 H -> D (in X-ALD). FT /FTId=VAR_009391. FT VARIANT 668 668 T -> I (in X-ALD). FT /FTId=VAR_009392. FT VARIANT 679 679 W -> R (in X-ALD; AMN-type). FT /FTId=VAR_000090. FT VARIANT 693 693 T -> M (in X-ALD). FT /FTId=VAR_009393. SQ SEQUENCE 745 AA; 82908 MW; 13A8EFDE9EB1E7FA CRC64; MPVLSRPRPW RGNTLKRTAV LLALAAYGAH KVYPLVRQCL APARGLQAPA GEPTQEASGV AAAKAGMNRV FLQRLLWLLR LLFPRVLCRE TGLLALHSAA LVSRTFLSVY VARLDGRLAR CIARKDPRAF GWQLLQWLLI ALPATFVNSA IRYLEGQLAL SFRSRLVAHA YRLYFSQQTY YRVSNMDGRL RNPDQSLTED VVAFAASVAH LYSNLTKPLL DVAVTSYTLL RAARSRGAGT AWPSAIAGLV VFLTANVLRA FSPKFGELVA EEARRKGELR YMHSRVVANS EEIAFYGGHE VELALLQRSY QDLASQINLI LLERLWYVML EQFLMKYVWS ASGLLMVAVP IITATGYSES DAEAVKKAAL EKKEEELVSE RTEAFTIARN LLTAAADAIE RIMSSYKEVT ELAGYTARVH EMFQVFEDVQ RCHFKRPREL EDAQAGSGTI GRSGVRVEGP LKIRGQVVDV EQGIICENIP IVTPSGEVVV ASLNIRVEEG MHLLITGPNG CGKSSLFRIL GGLWPTYGGV LYKPPPQRMF YIPQRPYMSV GSLRDQVIYP DSVEDMQRKG YSEQDLEAIL DVVHLHHILQ REGGWEAMCD WKDVLSGGEK QRIGMARMFY HRPKYALLDE CTSAVSIDVE GKIFQAAKDA GIALLSITHR PSLWKYHTHL LQFDGEGGWK FEKLDSAARL SLTEEKQRLE QQLAGIPKMQ RRLQELCQIL GEAVAPAHVP APSPQGPGGL QGAST // so.obo100644000766000024 65633414006427353 15577 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataformat-version: 1.2 date: 13:02:2006 15:53 saved-by: kareneilbeck auto-generated-by: OBO-Edit 1.000-beta15 subsetdef: SOFA "SO feature annotation" default-namespace: sequence remark: autogenerated-by\: DAG-Edit version 1.417\nsaved-by\: eilbeck\ndate\: Tue May 11 15\:18\:44 PDT 2004\nversion\: $Revision\: 1.45 $ [Term] id: SO:0000000 name: Sequence_Ontology subset: SOFA [Term] id: SO:0000001 name: region def: "Continuous sequence." [SO:ke] subset: SOFA synonym: "sequence" RELATED [] is_a: SO:0000110 ! located_sequence_feature [Term] id: SO:0000002 name: sequence_secondary_structure def: "A folded sequence." [SO:ke] is_a: SO:0000001 ! region [Term] id: SO:0000003 name: G_quartet def: "G-quartets are unusual nucelic acid structures consisting of a planar arrangement where each guanine is hydrogen bonded by hoogsteen pairing to another guanine in the quartet." [http://www.library.csi.cuny.edu/ ~ davis/molbiol/lecture_notes/post-transcriptional_processes/RNACapping.pdf] synonym: "G-quartet" RELATED [] is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000004 name: interior_coding_exon is_a: SO:0000195 ! coding_exon [Term] id: SO:0000005 name: satellite_DNA def: "The many tandem repeats (identical or related) of a short basic repeating unit; many have a base composition or other property different from the genome average that allows them to be separated from the bulk (main band) genomic DNA." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] is_a: SO:0000657 ! repeat_region [Term] id: SO:0000006 name: PCR_product def: "A region amplified by a PCR reaction." [SO:ke] subset: SOFA synonym: "amplicon" RELATED [] is_a: SO:0000695 ! reagent [Term] id: SO:0000007 name: read_pair def: "A pair of sequencing reads in which the two members of the pair are related by originating at either end of a clone insert." [SO:ls] subset: SOFA is_a: SO:0000143 ! assembly_component relationship: part_of SO:0000149 ! contig [Term] id: SO:0000008 name: gene_sensu_your_favorite_organism is_a: SO:0000009 ! gene_class [Term] id: SO:0000009 name: gene_class is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000010 name: protein_coding_gene is_a: SO:0000009 ! gene_class [Term] id: SO:0000011 name: non_protein_coding_gene is_a: SO:0000009 ! gene_class [Term] id: SO:0000012 name: scRNA_primary_transcript def: "The primary transcript of any one of several small cytoplasmic RNA moleculespresent in the cytoplasm and sometimes nucleus of a eukaryote." [http:www.ebi.ac.uk/embl/WebFeat/align/scRNA_s.html] synonym: "small_cytoplasmic_RNA" RELATED [] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000013 name: scRNA def: "Any one of several small cytoplasmic RNA moleculespresent in the cytoplasm and sometimes nucleus of a eukaryote." [http:www.ebi.ac.uk/embl/WebFeat/align/scRNA_s.html] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000014 name: INR_motif def: "A sequence element characteristic of some RNA polymerase II promoters required for the correct positioning of the polymerase for the start of transcription. Overlaps the TSS. The mammalian consensus sequence is YYAN(T|A)YY; the Drosophila consensus sequence is TCA(G|T)t(T|C). In each the A is at position +1 with respect to the TSS. Functionally similar to the TATA box element." [PMID:12651739] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000015 name: DPE_motif def: "A sequence element characteristic of some RNA polymerase II promoters; always found with the INR_motif. Positioned from +28 to +32 with respect to the TSS (+1). Consensus sequence (A|G)G(A|T)(C|T)(G|A|C). Required for TFIID binding to TATA-less promoters." [PMID:12651739] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000016 name: BRE_motif def: "A sequence element characteristic of some RNA polymerase II promoters, located immediately upstream of some TATA box elements at -37 to -32 with respect to the TSS (+1). Consensus sequence is (G|C)(G|C)(G|A)CGCC. Binds TFIIB." [PMID:12651739] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000017 name: PSE_motif def: "A sequence element characteristic of the promoters of snRNA genes transcribed by RNA polymerase II or by RNA polymerase III. Located between -45 and -60 relative to the TSS. The human PSE_motif consensus sequence is TCACCNTNA(C|G)TNAAAAG(T|G)." [PMID:12651739] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000018 name: linkage_group def: "A group of loci that can be grouped in a linear order representing the different degrees of linkage among the genes concerned." [ISBN:038752046] is_a: SO:0000001 ! region [Term] id: SO:0000019 name: RNA_hairpin_loop is_a: SO:0000715 ! RNA_motif [Term] id: SO:0000020 name: RNA_internal_loop is_a: SO:0000715 ! RNA_motif [Term] id: SO:0000021 name: asymmetric_RNA_internal_loop is_a: SO:0000020 ! RNA_internal_loop [Term] id: SO:0000022 name: A_minor_RNA_motif is_a: SO:0000021 ! asymmetric_RNA_internal_loop [Term] id: SO:0000023 name: K_turn_RNA_motif is_a: SO:0000021 ! asymmetric_RNA_internal_loop [Term] id: SO:0000024 name: Sarcin_like_RNA_motif is_a: SO:0000021 ! asymmetric_RNA_internal_loop [Term] id: SO:0000025 name: symmetric_RNA_internal_loop is_a: SO:0000020 ! RNA_internal_loop [Term] id: SO:0000026 name: RNA_junction_loop is_a: SO:0000715 ! RNA_motif [Term] id: SO:0000027 name: RNA_hook_turn is_a: SO:0000026 ! RNA_junction_loop [Term] id: SO:0000028 name: base_pair is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000029 name: WC_base_pair def: "The canonical base pair, where two bases interact via WC edges, with glycosidic bonds oriented cis relative to the axis of orientation." [PMID:12177293] synonym: "Watson_Crick_based_pair" RELATED [] is_a: SO:0000028 ! base_pair [Term] id: SO:0000030 name: sugar_edge_base_pair def: "A type of non-canonical base-pairing." [PMID:12177293] is_a: SO:0000028 ! base_pair [Term] id: SO:0000031 name: aptamer def: "DNA or RNA molecules that have been selected from random pools based on their ability to bind other molecules." [http:http://aptamer.icmb.utexas.edu] is_a: SO:0000351 ! synthetic_sequence is_a: SO:0000696 ! oligo [Term] id: SO:0000032 name: DNA_aptamer def: "DNA molecules that have been selected from random pools based on their ability to bind other molecules." [http:aptamer.icmb.utexas.edu] is_a: SO:0000031 ! aptamer [Term] id: SO:0000033 name: RNA_aptamer def: "RNA molecules that have been selected from random pools based on their ability to bind other molecules." [http:aptamer.icmb.utexas.edu] is_a: SO:0000031 ! aptamer [Term] id: SO:0000034 name: morpholino def: "Morpholino oligos are synthesized from four different Morpholino subunits, each of which contains one of the four genetic bases (A, C, G, T) linked to a 6-membered morpholine ring. Eighteen to 25 subunits of these four subunit types are joined in a specific order by non-ionic phosphorodiamidate intersubunit linkages to give a Morpholino." [http:www.gene-tools.com/Morpholinos/morpholinos.HTML] is_a: SO:0000351 ! synthetic_sequence is_a: SO:0000696 ! oligo [Term] id: SO:0000035 name: riboswitch def: "Riboswitches are mRNAs that can act as direct sensors of small molecules to control their own expression. A riboswitch contains a cis element within mRNA, that can act as a direct sensor of metabolites without a protein intermediate." [PMID:2820954] is_a: SO:0000234 ! mRNA [Term] id: SO:0000036 name: matrix_attachment_site def: "A DNA region that is required for the binding of chromatin to the nuclear matrix." [SO:ma] synonym: "MAR" RELATED [] synonym: "SMAR" RELATED [] synonym: "scaffold_attachment_site" RELATED [] is_a: SO:0000626 ! chromosomal_regulatory_element [Term] id: SO:0000037 name: locus_control_region def: "A DNA region that includes DNAse hypersensitive sites located 5' to a gene that confers the high-level, position-independent, and copy number-dependent expression to that gene." [SO:ma] is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000038 name: match_set def: "A collection of match parts" [SO:ke] subset: SOFA is_a: SO:0000343 ! match [Term] id: SO:0000039 name: match_part def: "A part of a match, for example an hsp from blast isa match_part." [SO:ke] subset: SOFA is_a: SO:0000343 ! match relationship: part_of SO:0000038 ! match_set [Term] id: SO:0000040 name: genomic_clone def: "A clone of a DNA region of a genome." [SO:ma] is_a: SO:0000151 ! clone [Term] id: SO:0000041 name: variation_operation def: "An operation that can be applied to a sequence, that results in a chnage." [SO:ke] is_a: SO:0000000 ! Sequence_Ontology [Term] id: SO:0000042 name: pseudogene_attribute def: "An attribute of a pseudogene (SO:0000336)." [SO:ma] is_a: SO:0000009 ! gene_class [Term] id: SO:0000043 name: processed_pseudogene synonym: "pseudogene_by_reverse_transcription" RELATED [] is_a: SO:0000042 ! pseudogene_attribute [Term] id: SO:0000044 name: pseudogene_by_unequal_crossing_over is_a: SO:0000042 ! pseudogene_attribute [Term] id: SO:0000045 name: delete def: "To remove a subsection of sequence." [SO:ke] is_a: SO:0000041 ! variation_operation [Term] id: SO:0000046 name: insert def: "To insert a subsection of sequence." [SO:ke] is_a: SO:0000041 ! variation_operation [Term] id: SO:0000047 name: invert def: "To invert a subsection of sequence." [SO:ke] is_a: SO:0000041 ! variation_operation [Term] id: SO:0000048 name: substitute def: "To substitute a subsection of sequence for another." [SO:ke] is_a: SO:0000041 ! variation_operation [Term] id: SO:0000049 name: translocate def: "To translocate a subsection of sequence." [SO:ke] is_a: SO:0000041 ! variation_operation [Term] id: SO:0000050 name: gene_part def: "A part of a gene, that has no other route in the ontology back to region. This concept is necessary for logical inference as these parts must have the properties of region. It is also allows us to associate all the parts of genes with a gene." [SO:ke] subset: SOFA is_obsolete: true [Term] id: SO:0000051 name: probe def: "A DNA sequence used experimentally to detect the presence or absence of a complementary nucleic acid." [SO:ma] is_a: SO:0000695 ! reagent [Term] id: SO:0000052 name: assortment_derived_deficiency synonym: "assortment-derived_deficiency" RELATED [] is_obsolete: true [Term] id: SO:0000053 name: mutation_affecting_regulatory_region is_a: SO:1000132 ! consequences_of_mutation [Term] id: SO:0000054 name: aneuploid is_a: SO:1000182 ! chromosome_number_variation [Term] id: SO:0000055 name: hyperploid is_a: SO:0000054 ! aneuploid [Term] id: SO:0000056 name: hypoploid is_a: SO:0000054 ! aneuploid [Term] id: SO:0000057 name: operator def: "A regulatory element of an operon to which activators or repressors bind hereby effecting translation of genes in that operon." [SO:ma] subset: SOFA is_a: SO:0000752 ! gene_group_regulatory_region [Term] id: SO:0000058 name: assortment_derived_aneuploid synonym: "assortment-derived_aneuploid" RELATED [] is_obsolete: true [Term] id: SO:0000059 name: nuclease_binding_site is_a: SO:0000410 ! protein_binding_site [Term] id: SO:0000060 name: compound_chromosome_arm is_a: SO:1000042 ! compound_chromosome [Term] id: SO:0000061 name: restriction_enzyme_binding_site is_a: SO:0000059 ! nuclease_binding_site [Term] id: SO:0000062 name: deficient_intrachromosomal_transposition is_a: SO:1000041 ! intrachromosomal_transposition [Term] id: SO:0000063 name: deficient_interchromosomal_transposition is_a: SO:1000155 ! interchromosomal_transposition [Term] id: SO:0000064 name: gene_by_transcript_attribute comment: This classes of attributes was added by MA to allow the broad description of genes based on qualities of the transcript(s). A product of SO meeting 2004. is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000065 name: free_chromosome_arm is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:0000066 name: gene_by_polyadenylation_attribute is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000067 name: gene_to_gene_feature is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000068 name: overlapping_gene is_a: SO:0000067 ! gene_to_gene_feature [Term] id: SO:0000069 name: gene_included_within_intron is_a: SO:0000068 ! overlapping_gene [Term] id: SO:0000070 name: gene_included_within_intron_antiparallel is_a: SO:0000069 ! gene_included_within_intron [Term] id: SO:0000071 name: gene_included_within_intron_parallel is_a: SO:0000069 ! gene_included_within_intron [Term] id: SO:0000072 name: end_overlapping_gene is_a: SO:0000068 ! overlapping_gene [Term] id: SO:0000073 name: end_overlapping_gene_five_primethree_prime_overlap is_a: SO:0000072 ! end_overlapping_gene [Term] id: SO:0000074 name: end_overlapping_gene_five_primefive_prime_overlap is_a: SO:0000072 ! end_overlapping_gene [Term] id: SO:0000075 name: end_overlapping_gene_three_primethree_prime_overlap is_a: SO:0000072 ! end_overlapping_gene [Term] id: SO:0000076 name: end_overlapping_gene_three_primefive_prime_overlap is_a: SO:0000072 ! end_overlapping_gene [Term] id: SO:0000077 name: antisense_gene is_a: SO:0000068 ! overlapping_gene [Term] id: SO:0000078 name: polycistronic_transcript is_a: SO:0000115 ! transcript_feature [Term] id: SO:0000079 name: dicistronic_transcript is_a: SO:0000078 ! polycistronic_transcript [Term] id: SO:0000080 name: member_of_operon is_a: SO:0000081 ! member_gene_array [Term] id: SO:0000081 name: member_gene_array is_a: SO:0000067 ! gene_to_gene_feature [Term] id: SO:0000082 name: processed_transcript_attribute is_a: SO:0000237 ! transcript_attribute [Term] id: SO:0000083 name: macronuclear_sequence_feature is_a: SO:0000735 ! sequence_location [Term] id: SO:0000084 name: micronuclear_sequence_feature is_a: SO:0000735 ! sequence_location [Term] id: SO:0000085 name: gene_by_genome_location is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000086 name: gene_by_organelle_of_genome is_a: SO:0000085 ! gene_by_genome_location [Term] id: SO:0000087 name: nuclear_gene is_a: SO:0000086 ! gene_by_organelle_of_genome [Term] id: SO:0000088 name: mt_gene synonym: "mitochondrial_gene" RELATED [] is_a: SO:0000086 ! gene_by_organelle_of_genome [Term] id: SO:0000089 name: kinetoplast_gene is_a: SO:0000088 ! mt_gene [Term] id: SO:0000090 name: plastid_gene is_a: SO:0000086 ! gene_by_organelle_of_genome [Term] id: SO:0000091 name: apicoplast_gene is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000092 name: ct_gene synonym: "chloroplast_gene" RELATED [] is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000093 name: chromoplast_gene is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000094 name: cyanelle_gene is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000095 name: leucoplast_gene is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000096 name: proplastid_gene is_a: SO:0000090 ! plastid_gene [Term] id: SO:0000097 name: nucleomorph_gene is_a: SO:0000086 ! gene_by_organelle_of_genome [Term] id: SO:0000098 name: plasmid_gene is_a: SO:0000085 ! gene_by_genome_location [Term] id: SO:0000099 name: proviral_gene is_a: SO:0000085 ! gene_by_genome_location [Term] id: SO:0000100 name: endogenous_retroviral_gene is_a: SO:0000099 ! proviral_gene [Term] id: SO:0000101 name: transposable_element def: "A transposon or insertion sequence. An element that can insert in a variety of DNA sequences." [http://www.sci.sdsu.edu/ ~ smaloy/Glossary/T.html] subset: SOFA is_a: SO:0000187 ! repeat_family is_a: SO:1000028 ! intrachromosomal_mutation [Term] id: SO:0000102 name: expressed_sequence_match def: "A match to an EST or cDNA sequence." [SO:ke] subset: SOFA is_a: SO:0000347 ! nucleotide_match [Term] id: SO:0000103 name: clone_insert_end def: "The end of the clone insert." [SO:ke] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000753 ! clone_insert [Term] id: SO:0000104 name: polypeptide def: "A sequence of amino acids linked by peptide bonds which may lack appreciable tertiary structure and may not be liable to irreversable denaturation." [SO:ma] subset: SOFA relationship: derives_from SO:0000316 ! CDS [Term] id: SO:0000105 name: chromosome_arm def: "A region of the chromosome between the centromere and the telomere. Human chromosomes have two arms, the p arm (short) and the q arm (long) which are separated from each other by the centromere." [http://www.exactsciences.com/cic/glossary/_index.htm] relationship: part_of SO:0000340 ! chromosome [Term] id: SO:0000106 name: non_capped_primary_transcript is_a: SO:0000146 ! primary_transcript_by_cap_class [Term] id: SO:0000107 name: sequencing_primer is_a: SO:0000112 ! primer [Term] id: SO:0000108 name: mRNA_with_frameshift is_a: SO:0000082 ! processed_transcript_attribute [Term] id: SO:0000109 name: sequence_variant def: "A region of sequence where variation has been observed." [SO:ke] subset: SOFA synonym: "mutation" RELATED [] is_a: SO:0000110 ! located_sequence_feature [Term] id: SO:0000110 name: located_sequence_feature def: "A biological feature that can be attributed to a region of biological sequence." [SO:ke] subset: SOFA is_a: SO:0000000 ! Sequence_Ontology [Term] id: SO:0000111 name: transposable_element_gene def: "A gene encoded within a transposable element. For example gag, int, env and pol are the transpable element genes of the TY element in yeast." [SO:ke] is_a: SO:0000009 ! gene_class [Term] id: SO:0000112 name: primer def: "A short preexisting polynucleotide chain to which new deoxyribonucleotides can be added by DNA polymerase." [http://www.ornl.gov/TechResources/Human_Genome/publicat/primer2001/glossary.html] subset: SOFA is_a: SO:0000696 ! oligo [Term] id: SO:0000113 name: integrated_virus def: "A viral sequence which has integrated into the host genome." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000114 name: methylated_C def: "A methylated deoxy-cytosine." [SO:ke] subset: SOFA is_a: SO:0000306 ! methylated_base_feature [Term] id: SO:0000115 name: transcript_feature is_a: SO:0000237 ! transcript_attribute [Term] id: SO:0000116 name: edited_transcript def: "A gene whose transcript is edited." [http://www.rna.ucla.edu/] is_a: SO:0000115 ! transcript_feature [Term] id: SO:0000117 name: transcript_with_readthrough_stop_codon is_obsolete: true [Term] id: SO:0000118 name: transcript_with_translational_frameshift is_a: SO:1001261 ! recoded_mRNA [Term] id: SO:0000119 name: gene_by_class_of_regulation is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000120 name: protein_coding_primary_transcript def: "A primary transcript that, at least in part, encodes one or more proteins." [SO:ke] comment: May contain introns subset: SOFA synonym: "pre-mRNA" RELATED [] is_a: SO:0000185 ! primary_transcript [Term] id: SO:0000121 name: forward_primer def: "A single stranded oligo used for polymerase chain reaction." [http://mged.sourceforge.net/ontologies/MGEDontology.php] is_a: SO:0000112 ! primer [Term] id: SO:0000122 name: RNA_sequence_secondary_structure def: "A folded RNA sequence." [SO:ke] is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000123 name: transcriptionally_regulated def: "." [SO:ma] comment: by\: is_a: SO:0000119 ! gene_by_class_of_regulation [Term] id: SO:0000124 name: transcriptionally_constitutive is_a: SO:0000123 ! transcriptionally_regulated [Term] id: SO:0000125 name: transcriptionally_induced is_a: SO:0000123 ! transcriptionally_regulated [Term] id: SO:0000126 name: transcriptionally_repressed is_a: SO:0000123 ! transcriptionally_regulated [Term] id: SO:0000127 name: silenced_gene is_a: SO:0000126 ! transcriptionally_repressed [Term] id: SO:0000128 name: gene_silenced_by_DNA_modification is_a: SO:0000127 ! silenced_gene [Term] id: SO:0000129 name: gene_silenced_by_DNA_methylation is_a: SO:0000128 ! gene_silenced_by_DNA_modification [Term] id: SO:0000130 name: post_translationally_regulated synonym: "post-translationally_regulated" RELATED [] is_a: SO:0000119 ! gene_by_class_of_regulation [Term] id: SO:0000131 name: translationally_regulated is_a: SO:0000119 ! gene_by_class_of_regulation [Term] id: SO:0000132 name: reverse_primer def: "A single stranded oligo used for polymerase chain reaction." [http://mged.sourceforge.net/ontologies/MGEDontology.php] is_a: SO:0000112 ! primer [Term] id: SO:0000133 name: gene_by_epigenetic_modification is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000134 name: imprinted is_a: SO:0000119 ! gene_by_class_of_regulation is_a: SO:0000133 ! gene_by_epigenetic_modification [Term] id: SO:0000135 name: maternally_imprinted is_a: SO:0000134 ! imprinted [Term] id: SO:0000136 name: paternally_imprinted is_a: SO:0000134 ! imprinted [Term] id: SO:0000137 name: allelically_excluded is_a: SO:0000133 ! gene_by_epigenetic_modification [Term] id: SO:0000138 name: gene_rearranged_at_DNA_level is_a: SO:0000133 ! gene_by_epigenetic_modification [Term] id: SO:0000139 name: ribosome_entry_site def: "Region in mRNA where ribosome assembles." [SO:ke] comment: gene\: subset: SOFA relationship: part_of SO:0000203 ! UTR [Term] id: SO:0000140 name: attenuator def: "A sequence segment located between the promoter and a structural gene that causes partial termination of transcription." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0000752 ! gene_group_regulatory_region is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000141 name: terminator def: "The sequence of DNA located either at the end of the transcript that causes RNA polymerase to terminate transcription." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0000752 ! gene_group_regulatory_region is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000142 name: DNA_sequence_secondary_structure def: "A folded DNA sequence." [SO:ke] is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000143 name: assembly_component def: "A region of sequence which may be used to manufacture a longer assembled, sequence." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000144 name: primary_transcript_attribute is_a: SO:0000237 ! transcript_attribute [Term] id: SO:0000145 name: recoded_codon is_a: SO:0000360 ! codon [Term] id: SO:0000146 name: primary_transcript_by_cap_class is_a: SO:0000144 ! primary_transcript_attribute [Term] id: SO:0000147 name: exon def: "A region of the genome that codes for portion of spliced messenger RNA (SO:0000234); may contain 5'-untranslated region (SO:0000204), all open reading frames (SO:0000236) and 3'-untranslated region (SO:0000205)." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA relationship: part_of SO:0000673 ! transcript [Term] id: SO:0000148 name: supercontig def: "One or more contigs that have been ordered and oriented using end-read information. Contains gaps that are filled with N's." [SO:ls] subset: SOFA synonym: "scaffold" RELATED [] is_a: SO:0000353 ! assembly relationship: part_of SO:0000719 ! ultracontig [Term] id: SO:0000149 name: contig def: "A contiguous sequence derived from sequence assembly. Has no gaps, but may contain N's from unvailable bases." [SO:ls] subset: SOFA is_a: SO:0000143 ! assembly_component is_a: SO:0000353 ! assembly relationship: part_of SO:0000148 ! supercontig [Term] id: SO:0000150 name: read def: "A sequence obtained from a single sequencing experiment. Typically a read is produced when a base calling program interprets information from a chromatogram trace file produced from a sequencing machine." [SO:rd] subset: SOFA is_a: SO:0000143 ! assembly_component relationship: part_of SO:0000149 ! contig [Term] id: SO:0000151 name: clone def: "A piece of DNA that has been inserted in a vector so that it can be propagated in E. coli or some other organism." [http://www.geospiza.com/community/support/glossary/] subset: SOFA is_a: SO:0000695 ! reagent [Term] id: SO:0000152 name: YAC def: "Yeast Artificial Chromosome, a vector constructed from the telomeric, centromeric, and replication origin sequences needed for replication in yeast cells." [SO:ma] is_a: SO:0000440 ! vector relationship: part_of SO:0000760 ! YAC_clone [Term] id: SO:0000153 name: BAC def: "Bacterial Artificial Chromosome, a cloning vector that can be propagated as mini-chromosomes in a bacterial host." [SO:ma] is_a: SO:0000440 ! vector relationship: part_of SO:0000764 ! BAC_clone [Term] id: SO:0000154 name: PAC def: "P1 Artificial Chromosome. These vectors can hold large inserts, typically 80-200 kb, and propagate in E. coli as a single copy episome." [https://www.ncbi.nlm.nih.gov/genome/guide/mouse/glossary.htm] synonym: "P1" RELATED [] is_a: SO:0000440 ! vector relationship: part_of SO:0000762 ! PAC_clone [Term] id: SO:0000155 name: plasmid def: "A self-replicating circular DNA molecule that is distinct from a chromosome in the organism." [SO:ma] is_a: SO:0000695 ! reagent [Term] id: SO:0000156 name: cosmid def: "A cloning vector that is a hybrid of lambda phages and a plasmid that can be propagated as aplasmids or packaged as a phage,since they retain the lambda cos sites." [SO:ma] comment: vans GA et al. High efficiency vectors for cosmid microcloning and genomic analysis. Gene 1989; 79(1)\:9-20. is_a: SO:0000440 ! vector relationship: part_of SO:0000765 ! cosmid_clone [Term] id: SO:0000157 name: phagemid def: "A plasmid which carries within its sequence a bacteriophage replication origin. When the host bacterium is infected with \"helper\" phage, a phagemid is replicated along with the phage DNA and packaged into phage capsids." [SO:ma] is_a: SO:0000440 ! vector relationship: part_of SO:0000761 ! phagemid_clone [Term] id: SO:0000158 name: fosmid def: "A cloning vector that utilises the E. coli F factor." [SO:ma] comment: Birren BW et al. A human chromosome 22 fosmid resource\: mapping and analysis of 96 clones. Genomics 1996; is_a: SO:0000440 ! vector relationship: part_of SO:0000763 ! fosmid_clone [Term] id: SO:0000159 name: deletion def: "The sequence that is deleted." [SO:ke] subset: SOFA is_a: SO:0000001 ! region is_a: SO:0000109 ! sequence_variant relationship: sequence_of SO:0000045 ! delete [Term] id: SO:0000160 name: lambda_clone def: "A linear clone derived from lambda bacteriophage. The genes involved in the lysogenic pathway are removed from the from the viral DNA. Up to 25 kb of foreign DNA can then be inserted into the lambda genome." [ISBN:0-1767-2380-8] is_a: SO:0000151 ! clone [Term] id: SO:0000161 name: methylated_A def: "A methylated adenine." [SO:ke] subset: SOFA is_a: SO:0000250 ! modified_RNA_base_feature is_a: SO:0000306 ! methylated_base_feature [Term] id: SO:0000162 name: splice_site def: "The position where intron is excised." [SO:ke] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000185 ! primary_transcript [Term] id: SO:0000163 name: splice_donor_site def: "The junction between the 3 prime end of an exon and the following intron." [http://www.ucl.ac.uk/ ~ ucbhjow/b241/glossary.html] subset: SOFA synonym: "donor" RELATED [] synonym: "donor_splice_site" RELATED [] is_a: SO:0000162 ! splice_site [Term] id: SO:0000164 name: splice_acceptor_site def: "The junction between the 3 prime end of an intron and the following exon." [http://www.ucl.ac.uk/ ~ ucbhjow/b241/glossary.html] subset: SOFA synonym: "acceptor" RELATED [] synonym: "acceptor_splice_site" RELATED [] is_a: SO:0000162 ! splice_site [Term] id: SO:0000165 name: enhancer def: "A cis-acting sequence that increases the utilization of (some) eukaryotic promoters, and can function in either orientation and in any location (upstream or downstream) relative to the promoter." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000166 name: enhancer_by_bound_factor is_a: SO:0000402 ! enhancer_attribute [Term] id: SO:0000167 name: promoter def: "The region on a DNA molecule involved in RNA polymerase binding to initiate transcription." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000168 name: restriction_enzyme_cut_site def: "A specific nucleotide sequence of DNA at or near which a particular restriction enzyme cuts the DNA." [SO:ma] is_obsolete: true [Term] id: SO:0000169 name: RNApol_I_promoter def: "A DNA sequence sequence in eukaryotic DNA to which RNA polymerase I binds, to begin transcription." [SO:ke] synonym: "RNA_polymerase_A_promoter" RELATED [] is_a: SO:0000167 ! promoter [Term] id: SO:0000170 name: RNApol_II_promoter def: "A DNA sequence in eukaryotic DNA to which RNA polymerase II binds, to begin transcription." [SO:ke] synonym: "RNA_polymerase_B_promoter" RELATED [] is_a: SO:0000167 ! promoter [Term] id: SO:0000171 name: RNApol_III_promoter def: "A DNA sequence in eukaryotic DNA to which RNA polymerase III binds, to begin transcription." [SO:ke] synonym: "RNA_polymerase_C_promoter" RELATED [] is_a: SO:0000167 ! promoter [Term] id: SO:0000172 name: CAAT_signal def: "Part of a conserved sequence located about 75-bp upstream of the start point of eukaryotic transcription units which may be involved in RNA polymerase binding; consensus=GG(C|T)CAATCT." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000173 name: GC_rich_region def: "A conserved GC-rich region located upstream of the start point of eukaryotic transcription units which may occur in multiple copies or in either orientation; consensus=GGGCGG." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "GC-rich_region" RELATED [] relationship: part_of SO:0000170 ! RNApol_II_promoter [Term] id: SO:0000174 name: TATA_box def: "A conserved AT-rich septamer found about 25-bp before the start point of many eukaryotic RNA polymerase II transcript units; may be involved in positioning the enzyme for correct initiation; consensus=TATA(A|T)A(A|T)." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] relationship: part_of SO:0000170 ! RNApol_II_promoter relationship: part_of SO:0000171 ! RNApol_III_promoter [Term] id: SO:0000175 name: minus_10_signal def: "A conserved region about 10-bp upstream of the start point of bacterial transcription units which may be involved in binding RNA polymerase; consensus=TAtAaT." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "-10_signal" RELATED [] synonym: "Pribnow_box" RELATED [] relationship: part_of SO:0000613 ! bacterial_RNApol_promoter [Term] id: SO:0000176 name: minus_35_signal def: "A conserved hexamer about 35-bp upstream of the start point of bacterial transcription units; consensus=TTGACa or TGTTGACA." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "-35_signal" RELATED [] relationship: part_of SO:0000613 ! bacterial_RNApol_promoter [Term] id: SO:0000177 name: cross_genome_match def: "A nucleotide match against a sequence from another organism." [SO:ma] subset: SOFA is_a: SO:0000347 ! nucleotide_match [Term] id: SO:0000178 name: operon def: "A group of contiguous genes transcribed as a single (polycistronic) mRNA from a single regulatory region." [SO:ma] subset: SOFA is_a: SO:0005855 ! gene_group [Term] id: SO:0000179 name: clone_insert_start def: "The start of the clone insert." [SO:ke] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000753 ! clone_insert [Term] id: SO:0000180 name: retrotransposon def: "A transposable element that is incorporated into a chromosome by a mechanism that requires reverse transcriptase." [http://www.genpromag.com/scripts/glossary.asp?LETTER=R] is_a: SO:0000101 ! transposable_element [Term] id: SO:0000181 name: translated_nucleotide_match def: "A match against a translated sequence." [SO:ke] subset: SOFA is_a: SO:0000347 ! nucleotide_match [Term] id: SO:0000182 name: DNA_transposon def: "A transposon where the mechanism of transposition is via a DNA intermediate." [SO:ke] is_a: SO:0000101 ! transposable_element [Term] id: SO:0000183 name: non_transcribed_region def: "A region of the gene which is not transcribed." [SO:ke] subset: SOFA is_a: SO:0000001 ! region relationship: part_of SO:0000704 ! gene [Term] id: SO:0000184 name: U2_intron def: "A major type of spliceosomal intron spliced by the U2 spliceosome, that includes U1, U2, U4/U6 and U5 snRNAs." [PMID:9428511] comment: May have either GT-AG or AT-AG 5' and 3' boundaries. is_a: SO:0000662 ! spliceosomal_intron [Term] id: SO:0000185 name: primary_transcript def: "The primary (initial, unprocessed) transcript; includes five_prime_clip (SO:0000555), five_prime_untranslated_region (SO:0000204), open reading frames (SO:0000236), introns (SO:0000188) and three_prime_ untranslated_region (three_prime_UTR), and three_prime_clip (SO:0000557)." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA synonym: "precursor_RNA" RELATED [] is_a: SO:0000673 ! transcript [Term] id: SO:0000186 name: LTR_retrotransposon def: "A retrotransposon flanked by long terminal repeat sequences." [SO:ke] is_a: SO:0000180 ! retrotransposon [Term] id: SO:0000187 name: repeat_family def: "A group of characterized repeat sequences." [SO:ke] subset: SOFA is_a: SO:0000657 ! repeat_region [Term] id: SO:0000188 name: intron def: "A segment of DNA that is transcribed, but removed from within the transcript by splicing together the sequences (exons) on either side of it." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA relationship: part_of SO:0000185 ! primary_transcript [Term] id: SO:0000189 name: non_LTR_retrotransposon def: "A retrotransposon without long terminal repeat sequences." [SO:ke] is_a: SO:0000180 ! retrotransposon [Term] id: SO:0000190 name: five_prime_intron is_a: SO:0000188 ! intron [Term] id: SO:0000191 name: interior_intron is_a: SO:0000188 ! intron [Term] id: SO:0000192 name: three_prime_intron is_a: SO:0000188 ! intron [Term] id: SO:0000193 name: RFLP_fragment def: "A polymorphism detectable by the size differences in DNA fragments generated by a restriction enzyme." [PMID:6247908] subset: SOFA synonym: "restriction_fragment_length_polymorphism" RELATED [] is_a: SO:0000412 ! restriction_fragment [Term] id: SO:0000194 name: LINE_element def: "A dispersed repeat family with many copies, each from 1 to 6 kb long. New elements are generated by retroposition of a transcribed copy. Typically the LINE contains 2 ORF's one of which is reverse transcriptase, and 3'and 5' direct repeats." [http:www.ucl.ac.uk/~ucbhjow/b241/glossary.html] synonym: "Long interspersed element" RELATED [] synonym: "Long interspersed nuclear element" RELATED [] is_a: SO:0000189 ! non_LTR_retrotransposon [Term] id: SO:0000195 name: coding_exon def: "An exon whereby at least one base is part of a codon, including the stop_codon." [SO:ke] is_a: SO:0000147 ! exon [Term] id: SO:0000196 name: five_prime_exon_coding_region def: "The sequence of the 5' exon that encodes for protein." [SO:ke] is_a: SO:0000195 ! coding_exon relationship: part_of SO:0000200 ! five_prime_coding_exon [Term] id: SO:0000197 name: three_prime_exon_coding_region def: "The sequence of the 3' exon that encodes for protein." [SO:ke] is_a: SO:0000195 ! coding_exon relationship: part_of SO:0000202 ! three_prime_coding_exon [Term] id: SO:0000198 name: noncoding_exon def: "An exon that does not contain any codons." [SO:ke] synonym: "noncoding_exon" RELATED [] is_a: SO:0000147 ! exon [Term] id: SO:0000199 name: translocation def: "A region of nucleotide sequence that has translocated to a new position." [SO:ke] relationship: sequence_of SO:0000049 ! translocate [Term] id: SO:0000200 name: five_prime_coding_exon def: "The 5' most coding exon." [SO:ke] is_a: SO:0000147 ! exon [Term] id: SO:0000201 name: interior_exon is_a: SO:0000147 ! exon [Term] id: SO:0000202 name: three_prime_coding_exon def: "The exon that is most 3-prime on a given transcript." [SO:ma] is_a: SO:0000147 ! exon [Term] id: SO:0000203 name: UTR def: "Messenger RNA sequences that are untranslated and lie five prime and three prime to sequences which are translated." [SO:ke] subset: SOFA synonym: "untranslated_region" RELATED [] relationship: part_of SO:0000234 ! mRNA [Term] id: SO:0000204 name: five_prime_UTR def: "A region at the 5' end of a mature transcript (preceding the initiation codon) that is not translated into a protein." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA synonym: "five_prime_untranslated_region" RELATED [] is_a: SO:0000203 ! UTR [Term] id: SO:0000205 name: three_prime_UTR def: "A region at the 3' end of a mature transcript (following the stop codon) that is not translated into a protein." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA synonym: "three_prime_untranslated_region" RELATED [] is_a: SO:0000203 ! UTR [Term] id: SO:0000206 name: SINE_element def: "A repetitive element, a few hundred base pairs long, that is dispersed throughout the genome. A common human SINE is the Alu element." [SO:ke] synonym: "Short interspersed element" RELATED [] synonym: "Short interspersed nuclear element" RELATED [] is_a: SO:0000189 ! non_LTR_retrotransposon [Term] id: SO:0000207 name: simple_sequence_length_polymorphism is_a: SO:0000248 ! sequence_length_variation [Term] id: SO:0000208 name: terminal_inverted_repeat_element def: "A DNA transposable element defined as having termini with perfect, or nearly perfect short inverted repeats, generally 10 - 40 nucleotides long." [http:www.genetics.org/cgi/reprint/156/4/1983.pdf] is_a: SO:0000182 ! DNA_transposon [Term] id: SO:0000209 name: rRNA_primary_transcript def: "A primary transcript encoding a ribosomal RNA." [SO:ke] synonym: "ribosomal_RNA_primary_transcript" RELATED [] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000210 name: tRNA_primary_transcript def: "A primary transcript encoding a transfer RNA (SO:0000253.)" [SO:ke] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000211 name: alanine_tRNA_primary_transcript is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000212 name: arginine_tRNA_primary_transcript def: "A primary transcript encoding arginyl tRNA (SO:0000255)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000213 name: asparagine_tRNA_primary_transcript def: "A primary transcript encoding asparaginyl tRNA (SO:0000256)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000214 name: aspartic_acid_tRNA_primary_transcript def: "A primary transcript encoding aspartyl tRNA (SO:0000257)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000215 name: cysteine_tRNA_primary_transcript def: "A primary transcript encoding cysteinyl tRNA (SO:0000258)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000216 name: glutamic_acid_tRNA_primary_transcript def: "A primary transcript encoding glutaminyl tRNA (SO:0000260)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000217 name: glutamine_tRNA_primary_transcript def: "A primary transcript encoding glutamyl tRNA (SO:0000260)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000218 name: glycine_tRNA_primary_transcript def: "A primary transcript encoding glycyl tRNA (SO:0000263)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000219 name: histidine_tRNA_primary_transcript def: "A primary transcript encoding histidyl tRNA (SO:0000262)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000220 name: isoleucine_tRNA_primary_transcript def: "A primary transcript encoding isoleucyl tRNA (SO:0000263)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000221 name: leucine_tRNA_primary_transcript def: "A primary transcript encoding leucyl tRNA (SO:0000264)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000222 name: lysine_tRNA_primary_transcript def: "A primary transcript encoding lysyl tRNA (SO:0000265)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000223 name: methionine_tRNA_primary_transcript def: "A primary transcript encoding methionyl tRNA (SO:0000266)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000224 name: phenylalanine_tRNA_primary_transcript def: "A primary transcript encoding phenylalanyl tRNA (SO:0000267)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000225 name: proline_tRNA_primary_transcript def: "A primary transcript encoding prolyl tRNA (SO:0000268)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000226 name: serine_tRNA_primary_transcript def: "A primary transcript encoding seryl tRNA (SO:000269)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000227 name: threonine_tRNA_primary_transcript def: "A primary transcript encoding threonyl tRNA (SO:000270)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000228 name: tryptophan_tRNA_primary_transcript def: "A primary transcript encoding tryptophanyl tRNA (SO:000271)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000229 name: tyrosine_tRNA_primary_transcript def: "A primary transcript encoding tyrosyl tRNA (SO:000272)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000230 name: valine_tRNA_primary_transcript def: "A primary transcript encoding valyl tRNA (SO:000273)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0000231 name: snRNA_primary_transcript def: "A primary transcript encoding a small nuclear mRNA (SO:0000274)." [SO:ke] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000232 name: snoRNA_primary_transcript def: "A primary transcript encoding a small nucleolar mRNA (SO:0000275)." [SO:ke] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000233 name: processed_transcript def: "A transcript which has undergone processing to remove parts such as introns and transcribed_spacer_regions." [SO:ke] comment: A processed transcript cannot contain introns. subset: SOFA is_a: SO:0000673 ! transcript relationship: derives_from SO:0000185 ! primary_transcript [Term] id: SO:0000234 name: mRNA def: "Messenger RNA is the intermediate molecule between DNA and protein. It includes UTR and coding sequences. It does not contain introns." [SO:ma] comment: mRNA does not contain introns as it is a processd_transcript.nThe equivalent kind of primary_transcript is protein_coding_primary_transcript (SO:0000120) which may contain introns. subset: SOFA synonym: "messenger_RNA" RELATED [] is_a: SO:0000233 ! processed_transcript [Term] id: SO:0000235 name: TF_binding_site def: "A region of a molecule that binds to a transcription factor." [SO:ke] subset: SOFA synonym: "transcription_factor_binding_site" RELATED [] is_a: SO:0000410 ! protein_binding_site is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000236 name: ORF def: "The inframe interval between the stop codons of a reading frame which when read as sequential triplets, has the potential of encoding a sequential string of amino acids. TER(NNN)nTER" [SO:ma, SO:rb] comment: The definition was modified by Rama. This terms now basically is the same as a CDS. This must be revised. subset: SOFA synonym: "open_reading_frame" RELATED [] is_a: SO:0000717 ! reading_frame [Term] id: SO:0000237 name: transcript_attribute is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000238 name: foldback_element def: "A transposable element with extensive secondary structure, characterised by large modular imperfect long inverted repeats" [http:www.genetics.org/cgi/reprint/156/4/1983.pdf] synonym: "LVR element" RELATED [] synonym: "long inverted repeat element" RELATED [] is_a: SO:0000182 ! DNA_transposon [Term] id: SO:0000239 name: flanking_region def: "The DNA sequences extending on either side of a specific locus." [http://biotech.icmb.utexas.edu/search/dict-search.mhtml] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000240 name: chromosome_variation is_a: SO:0000000 ! Sequence_Ontology [Term] id: SO:0000241 name: internal_UTR is_a: SO:0000203 ! UTR [Term] id: SO:0000242 name: untranslated_region_polyicistronic_mRNA def: "The untranslated sequence separating the 'cistrons' of multicistronic mRNA." [SO:ke] is_a: SO:0000203 ! UTR [Term] id: SO:0000243 name: internal_ribosome_entry_site def: "Sequence element that recruits a ribosomal subunit to internal mRNA for translation initiation." [SO:ke] synonym: "IRES" RELATED [] is_a: SO:0000139 ! ribosome_entry_site [Term] id: SO:0000244 name: four_cutter_restriction_site synonym: "4-cutter_restriction_site" RELATED [] synonym: "four-cutter_restriction_sit" RELATED [] is_obsolete: true [Term] id: SO:0000245 name: mRNA_by_polyadenylation_status is_a: SO:0000082 ! processed_transcript_attribute [Term] id: SO:0000246 name: mRNA_polyadenylated is_a: SO:0000245 ! mRNA_by_polyadenylation_status [Term] id: SO:0000247 name: mRNA_not_polyadenylated is_a: SO:0000245 ! mRNA_by_polyadenylation_status [Term] id: SO:0000248 name: sequence_length_variation is_a: SO:1000002 ! substitution [Term] id: SO:0000249 name: six_cutter_restriction_site synonym: "6-cutter_restriction_site" RELATED [] synonym: "six-cutter_restriction_site" RELATED [] is_obsolete: true [Term] id: SO:0000250 name: modified_RNA_base_feature def: "A post_transcriptionally modified base." [SO:ke] relationship: part_of SO:0000673 ! transcript [Term] id: SO:0000251 name: eight_cutter_restriction_site synonym: "8-cutter_restriction_site" RELATED [] synonym: "eight-cutter_restriction_site" RELATED [] is_obsolete: true [Term] id: SO:0000252 name: rRNA def: "RNA that comprises part of a ribosome, and that can provide both structural scaffolding and catalytic activity." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types, ISBN:0198506732] subset: SOFA synonym: "ribsomal_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000253 name: tRNA def: "Transfer RNA (tRNA) molecules are approximately 80 nucleotides in length. Their secondary structure includes four short double-helical elements and three loops (D, anti-codon, and T loops). Further hydrogen bonds mediate the characteristic L-shaped molecular structure. tRNAs have two regions of fundamental functional importance: the anti-codon, which is responsible for specific mRNA codon recognition, and the 3' end, to which the tRNA's corresponding amino acid is attached (by aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of the genetic code in two manners: having more than one tRNA (with a specific anti-codon) for a particular amino acid; and 'wobble' base-pairing, i.e. permitting non-standard base-pairing at the 3rd anti-codon position." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00005, ISBN:0198506732] subset: SOFA synonym: "transfer_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000254 name: alanyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000255 name: rRNA_small_subunit_primary_transcript def: "A primary transcript encoding a small ribosomal subunit RNA." [SO:ke] is_a: SO:0000209 ! rRNA_primary_transcript [Term] id: SO:0000256 name: asparaginyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000257 name: aspartyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000258 name: cysteinyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000259 name: glutaminyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000260 name: glutamyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000261 name: glycyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000262 name: histidyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000263 name: isoleucyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000264 name: leucyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000265 name: lysyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000266 name: methionyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000267 name: phenylalanyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000268 name: prolyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000269 name: seryl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000270 name: threonyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000271 name: tryptophanyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000272 name: tyrosyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000273 name: valyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000274 name: snRNA def: "Small non-coding RNA in the nucleoplasm. A small nuclear RNA molecule involved in pre-mRNA splicing and processing" [ems:WB, http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types, PMID:11733745] subset: SOFA synonym: "small_nuclear_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000275 name: snoRNA def: "Small nucleolar RNAs (snoRNAs) are involved in the processing and modification of rRNA in the nucleolus. There are two main classes of snoRNAs: the box C/D class, and the box H/ACA class. U3 snoRNA is a member of the box C/D class. Indeed, the box C/D element is a subset of the six short sequence elements found in all U3 snoRNAs, namely boxes A, A', B, C, C', and D. The U3 snoRNA secondary structure is characterised by a small 5' domain (with boxes A and A'), and a larger 3' domain (with boxes B, C, C', and D), the two domains being linked by a single-stranded hinge. Boxes B and C form the B/C motif, which appears to be exclusive to U3 snoRNAs, and boxes C' and D form the C'/D motif. The latter is functionally similar to the C/D motifs found in other snoRNAs. The 5' domain and the hinge region act as a pre-rRNA-binding domain. The 3' domain has conserved protein-binding sites. Both the box B/C and box C'/D motifs are sufficient for nuclear retention of U3 snoRNA. The box C'/D motif is also necessary for nucleolar localization, stability and hypermethylation of U3 snoRNA. Both box B/C and C'/D motifs are involved in specific protein interactions and are necessary for the rRNA processing functions of U3 snoRNA." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00012] subset: SOFA synonym: "small_nucleolar_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000276 name: miRNA def: "Small, ~22-nt, RNA molecule that is the endogenous transcript of a miRNA gene. miRNAs are produced from precursor molecules (SO:0000647) that can form local hairpin strcutures, which ordinarily are processed (via the Dicer pathway) such that a single miRNA molecule accumulates from one arm of a hairpinprecursor molecule. miRNAs may trigger the cleavage of their target molecules oract as translational repressors." [PMID:12592000] subset: SOFA synonym: "micro_RNA" RELATED [] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000277 name: transcript_by_bound_factor is_a: SO:0000237 ! transcript_attribute [Term] id: SO:0000278 name: transcript_by_bound_nucleic_acid is_a: SO:0000277 ! transcript_by_bound_factor [Term] id: SO:0000279 name: transcript_by_bound_protein is_a: SO:0000277 ! transcript_by_bound_factor [Term] id: SO:0000280 name: engineered_gene is_a: SO:0000009 ! gene_class [Term] id: SO:0000281 name: engineered_foreign_gene is_a: SO:0000280 ! engineered_gene is_a: SO:0000285 ! foreign_gene [Term] id: SO:0000282 name: mRNA_with_minus_1_frameshift is_a: SO:0000108 ! mRNA_with_frameshift [Term] id: SO:0000283 name: engineered_foreign_transposable_element_gene is_a: SO:0000111 ! transposable_element_gene is_a: SO:0000280 ! engineered_gene [Term] id: SO:0000284 name: type_I_enzyme_restriction_site def: "The recognition site is bipartate and interupted." [http://www.promega.com] is_obsolete: true [Term] id: SO:0000285 name: foreign_gene is_a: SO:0000452 ! transgene [Term] id: SO:0000286 name: long_terminal_repeat def: "A sequence directly repeated at both ends of a defined sequence, of the sort typically found in retroviruses." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "LTR" RELATED [] synonym: "direct_terminal _repeat" RELATED [] is_a: SO:0000657 ! repeat_region relationship: part_of SO:0000186 ! LTR_retrotransposon [Term] id: SO:0000287 name: fusion_gene is_a: SO:0000009 ! gene_class [Term] id: SO:0000288 name: engineered_fusion_gene is_a: SO:0000280 ! engineered_gene is_a: SO:0000287 ! fusion_gene [Term] id: SO:0000289 name: microsatellite def: "A very short unit sequence of DNA (2 to 4 bp) that is repeated multiple times in tandem." [http://www.informatics.jax.org/silver/glossary.shtml] subset: SOFA is_a: SO:0000705 ! tandem_repeat [Term] id: SO:0000290 name: dinucleotide_repeat_microsatellite_feature is_a: SO:0000289 ! microsatellite [Term] id: SO:0000291 name: trinucleotide_repeat_microsatellite_feature is_a: SO:0000289 ! microsatellite [Term] id: SO:0000292 name: repetitive_element is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000293 name: engineered_foreign_repetitive_element is_a: SO:0000280 ! engineered_gene is_a: SO:0000292 ! repetitive_element [Term] id: SO:0000294 name: inverted_repeat def: "The sequence is complementarily repeated on the opposite strand. Example: GCTGA-----TCAGC." [SO:ke] subset: SOFA is_a: SO:0000657 ! repeat_region [Term] id: SO:0000295 name: U12_intron def: "A type of spliceosomal intron spliced by the U12 spliceosome, that includes U11, U12, U4atac/U6atac and U5 snRNAs." [PMID:9428511] comment: May have either GT-AC or AT-AC 5' and 3' boundaries. is_a: SO:0000662 ! spliceosomal_intron [Term] id: SO:0000296 name: origin_of_replication def: "The origin of replication; starting site for duplication of a nucleic acid molecule to give two identical copies." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000297 name: D_loop def: "Displacement loop; a region within mitochondrial DNA in which a short stretch of RNA is paired with one strand of DNA, displacing the original partner DNA strand in this region; also used to describe the displacement of a region of one strand of duplex DNA by a single stranded invader in the reaction catalyzed by RecA protein." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] is_a: SO:0000296 ! origin_of_replication [Term] id: SO:0000298 name: recombination_feature is_a: SO:0000001 ! region [Term] id: SO:0000299 name: specific_recombination_site is_a: SO:0000669 ! sequence_rearrangement_feature [Term] id: SO:0000300 name: recombination_feature_of_rearranged_gene is_a: SO:0000299 ! specific_recombination_site [Term] id: SO:0000301 name: recombination_feature_of_vertebrate_immune_system_gene is_a: SO:0000300 ! recombination_feature_of_rearranged_gene [Term] id: SO:0000302 name: J_gene_recombination_feature def: "Recombination signal including J-heptamer, J-spacer and J-nonamer in 5' of J-region of a J-gene or J-sequence." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-RS] synonym: "J-RS" RELATED [] is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000303 name: clip def: "Part of the primary transcript that is clipped off during processing." [SO:ke] subset: SOFA relationship: part_of SO:0000185 ! primary_transcript [Term] id: SO:0000304 name: type_II_enzyme_restriction_site def: "The recognition site is either palindromic, partially palindromic or an interupted palidrome. Cleavage occurs within the recognition site." [http://www.promega.com] is_obsolete: true [Term] id: SO:0000305 name: modified_base_site def: "A modified nucleotide, i.e. a nucleotide other than A, T, C. G or (in RNA) U." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] comment: modified base\: subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000306 name: methylated_base_feature def: "A nucleotide modified by methylation." [SO:ke] subset: SOFA is_a: SO:0000305 ! modified_base_site [Term] id: SO:0000307 name: CpG_island def: "Regions of a few hundred to a few thousand bases in vertebrate genomes that are relatively GC and CpG rich; they are typically unmethylated and often found near the 5' ends of genes." [SO:rd] subset: SOFA synonym: "CG_island" RELATED [] is_a: SO:0000001 ! region [Term] id: SO:0000308 name: sequence_feature_locating_method is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000309 name: computed_feature is_a: SO:0000308 ! sequence_feature_locating_method [Term] id: SO:0000310 name: predicted_ab_initio_computation is_a: SO:0000309 ! computed_feature [Term] id: SO:0000311 name: computed_feature_by_similarity def: "." [SO:ma] comment: similar to\: is_a: SO:0000309 ! computed_feature [Term] id: SO:0000312 name: experimentally_determined_feature is_a: SO:0000308 ! sequence_feature_locating_method [Term] id: SO:0000313 name: stem_loop def: "A double-helical region of nucleic acid formed by base-pairing between adjacent (inverted) complementary sequences." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000314 name: direct_repeat def: "A repeat where the same sequence is repeated in the same direction. Example: GCTGA-----GCTGA." [SO:ke] subset: SOFA is_a: SO:0000657 ! repeat_region [Term] id: SO:0000315 name: transcription_start_site def: "The site where transcription begins." [SO:ke] subset: SOFA synonym: "TSS" RELATED [] is_a: SO:0000699 ! junction relationship: part_of SO:0000185 ! primary_transcript [Term] id: SO:0000316 name: CDS def: "A contiguous sequence which begins with, and includes, a start codon and ends with, and includes, a stop codon." [SO:ma] subset: SOFA synonym: "coding_sequence" RELATED [] relationship: part_of SO:0000234 ! mRNA [Term] id: SO:0000317 name: cDNA_clone def: "Complementary DNA; A piece of DNA copied from an mRNA and spliced into a vector for propagation in a suitable host." [http://seqcore.brcf.med.umich.edu/doc/educ/dnapr/mbglossary/mbgloss.html] is_a: SO:0000151 ! clone [Term] id: SO:0000318 name: start_codon def: "First codon to be translated by a ribosome." [SO:ke] synonym: "initiation codon" RELATED [] is_a: SO:0000360 ! codon [Term] id: SO:0000319 name: stop_codon def: "In mRNA, a set of three nucleotides that indicates the end of information for protein synthesis." [SO:ke] is_a: SO:0000360 ! codon [Term] id: SO:0000320 name: intronic_splice_enhancer def: "Sequences within the intron that modulate splice site selection for some introns." [SO:ke] is_a: SO:0000344 ! splice_enhancer relationship: part_of SO:0000662 ! spliceosomal_intron [Term] id: SO:0000321 name: mRNA_with_plus_1_frameshift is_a: SO:0000108 ! mRNA_with_frameshift [Term] id: SO:0000322 name: nuclease_hypersensitive_site is_a: SO:0000684 ! nuclease_sensitive_site [Term] id: SO:0000323 name: coding_start def: "The first base to be translated into protein." [SO:ke] synonym: "translation_start" RELATED [] relationship: part_of SO:0000316 ! CDS [Term] id: SO:0000324 name: tag def: "A nucleotide sequence that may be used to identify a larger sequence." [SO:ke] subset: SOFA is_a: SO:0000695 ! reagent [Term] id: SO:0000325 name: rRNA_large_subunit_primary_transcript def: "A primary transcript encoding a large ribosomal subunit RNA." [SO:ke] is_a: SO:0000209 ! rRNA_primary_transcript [Term] id: SO:0000326 name: SAGE_tag def: "A short diagnostic sequence tag, serial analysis of gene expression (SAGE), that allows the quantitative and simultaneous analysis of a large number of transcripts." [https://www.ncbi.nlm.nih.gov/pubmed/7570003] subset: SOFA is_a: SO:0000324 ! tag [Term] id: SO:0000327 name: coding_end def: "The last base to be translated into protein. It does not include the stop codon." [SO:ke] synonym: "translation_end" RELATED [] relationship: part_of SO:0000316 ! CDS [Term] id: SO:0000328 name: microarray_oligo synonym: "microarray_oligonucleotide" RELATED [] is_a: SO:0000051 ! probe is_a: SO:0000324 ! tag is_a: SO:0000696 ! oligo [Term] id: SO:0000329 name: mRNA_with_plus_2_frameshift is_a: SO:0000108 ! mRNA_with_frameshift [Term] id: SO:0000330 name: conserved_region def: "Region of sequence similarity by descent from a common ancestor." [SO:ke] is_a: SO:0000001 ! region [Term] id: SO:0000331 name: STS def: "Short (typically a few hundred base pairs) DNA sequence that has a single occurrence in a genome and whose location and base sequence are known." [http://www.biospace.com] subset: SOFA synonym: "sequence_tag_site" RELATED [] is_a: SO:0000324 ! tag [Term] id: SO:0000332 name: coding_conserved_region def: "Coding region of sequence similarity by descent from a common ancestor." [SO:ke] is_a: SO:0000330 ! conserved_region [Term] id: SO:0000333 name: exon_junction def: "The boundary between two exons in a processed transcript." [SO:ke] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000233 ! processed_transcript [Term] id: SO:0000334 name: nc_conserved_region def: "Non-coding region of sequence similarity by descent from a common ancestor." [SO:ke] synonym: "noncoding_conserved_region" RELATED [] is_a: SO:0000330 ! conserved_region [Term] id: SO:0000335 name: mRNA_with_minus_2_frameshift is_a: SO:0000108 ! mRNA_with_frameshift [Term] id: SO:0000336 name: pseudogene def: "A sequence that closely resembles a known functional gene, at another locus within a genome, that is non-functional as a consequence of (usually several) mutations that prevent either its transcription or translation (or both). In general, pseudogenes result from either reverse transcription of a transcript of their \"normal\" paralog (SO:0000043) (in which case the pseudogene typically lacks introns and includes a poly(A) tail) or from recombination (SO:0000044) (in which case the pseudogene is typically a tandem duplication of its \"normal\" paralog)." [http://www.ucl.ac.uk/ ~ ucbhjow/b241/glossary.html] subset: SOFA is_a: SO:0000462 ! pseudogenic_region relationship: non_functional_homolog_of SO:0000704 ! gene [Term] id: SO:0000337 name: RNAi_reagent def: "A double stranded RNA duplex, at least 20bp long, used experimentally to inhibit gene function by RNA interference." [SO:rd] subset: SOFA is_a: SO:0000695 ! reagent is_a: SO:0000696 ! oligo [Term] id: SO:0000338 name: MITE def: "A highly repetitive and short (100-500 base pair) transposable element with terminal inverted repeats (TIR) and target site duplication (TSD). MITES do not encode proteins." [http:www.pnas.org/cgi/content/full/97/18/10083] is_a: SO:0000208 ! terminal_inverted_repeat_element [Term] id: SO:0000339 name: recombination_hotspot def: "A region in a genome whioch promotes recombination." [SO:rd] is_a: SO:0000298 ! recombination_feature [Term] id: SO:0000340 name: chromosome def: "Structural unit composed of long DNA molecule." [http://biotech.icmb.utexas.edu/search/dict-search.mhtml] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000341 name: chromosome_band def: "A cytologically distinguishable feature of a chromosome, often made visible by staining, and usually alternating light and dark." [SO:ma] synonym: "cytological_band" RELATED [] relationship: part_of SO:0000340 ! chromosome [Term] id: SO:0000342 name: site_specific_recombination_target_region is_a: SO:0000299 ! specific_recombination_site [Term] id: SO:0000343 name: match def: "A region of sequence, aligned to another sequence with some statistical significance, using an algorithm such as BLAST or SIM4." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000344 name: splice_enhancer def: "Region of a transcript that regulates splicing." [SO:ke] subset: SOFA is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000345 name: EST def: "Expressed Sequence Tag: The sequence of a single sequencing read from a cDNA clone or PCR product; typically a few hundred base pairs long." [http://genomics.phrma.org/lexicon/e.html] subset: SOFA synonym: "expressed_sequence_tag" RELATED [] is_a: SO:0000695 ! reagent relationship: derives_from SO:0000234 ! mRNA [Term] id: SO:0000346 name: Cre_recombination_target_region synonym: "lox_site" RELATED [] is_a: SO:0000342 ! site_specific_recombination_target_region [Term] id: SO:0000347 name: nucleotide_match def: "A match against a nucleotide sequence." [SO:ke] subset: SOFA is_a: SO:0000343 ! match [Term] id: SO:0000348 name: nucleic_acid is_a: SO:0000443 ! polymer_type [Term] id: SO:0000349 name: protein_match def: "A match against a protein sequence." [SO:ke] subset: SOFA is_a: SO:0000343 ! match [Term] id: SO:0000350 name: FLP_recombination_target_region synonym: "FRT_site" RELATED [] is_a: SO:0000342 ! site_specific_recombination_target_region [Term] id: SO:0000351 name: synthetic_sequence def: "A sequence of nucleotides or amino acids that has been designed by an experimentor and which may, or may not, correspond with any natural sequence." [SO:ma] is_a: SO:0000443 ! polymer_type [Term] id: SO:0000352 name: DNA is_a: SO:0000348 ! nucleic_acid [Term] id: SO:0000353 name: assembly def: "A sequence of nucleotides that has been algorithmically derived from an alignment of two or more different sequences." [SO:ma] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000354 name: group_1_intron_homing_endonuclease_target_region is_a: SO:0000684 ! nuclease_sensitive_site [Term] id: SO:0000355 name: haplotype_block def: "A region of the genome which in which markers are co-inherited as the result of the lack of historic recombination between them due to their close proximity." [SO:ma] is_a: SO:0000298 ! recombination_feature [Term] id: SO:0000356 name: RNA is_a: SO:0000348 ! nucleic_acid [Term] id: SO:0000357 name: sequence_by_flanking_target_attribute is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000358 name: protein def: "One or more polypeptides which may, or may not, be covalently bonded, and which assume a native secondary and tertiary structure." [SO:ma] comment: This definition no longer matches the meaning of the concept! Term should probably be proteinacious or something... KEn is_a: SO:0000443 ! polymer_type [Term] id: SO:0000359 name: floxed_sequence is_a: SO:0000357 ! sequence_by_flanking_target_attribute is_a: SO:0000452 ! transgene [Term] id: SO:0000360 name: codon def: "A set of (usually) three nucleotide bases in a DNA or RNA sequence, which together signify a unique amino acid or the termination of translation." [http://genomics.phrma.org/lexicon/c.html] subset: SOFA relationship: part_of SO:0000234 ! mRNA [Term] id: SO:0000361 name: FRT_flanked_sequence is_a: SO:0000357 ! sequence_by_flanking_target_attribute [Term] id: SO:0000362 name: chimeric_cDNA_clone def: "A cDNA clone constructed from more than one mRNA. Usually an experimental artifact." [SO:ma] is_a: SO:0000317 ! cDNA_clone [Term] id: SO:0000363 name: floxed_gene is_a: SO:0000359 ! floxed_sequence [Term] id: SO:0000364 name: transposable_element_flanking_region def: "The region of sequence surrounding a transposible element." [SO:ke] is_a: SO:0000239 ! flanking_region [Term] id: SO:0000365 name: integron def: "DNA elements capable of mobilizing individual gene cassettes into bacterial chromosomes by site- specific recombination." [http://www.genomicglossaries.com/content/DNA.asp] is_a: SO:0000669 ! sequence_rearrangement_feature [Term] id: SO:0000366 name: insertion_site def: "The junction where an insertion occurred." [SO:ke] subset: SOFA is_a: SO:0000109 ! sequence_variant is_a: SO:0000699 ! junction relationship: position_of SO:0000046 ! insert [Term] id: SO:0000367 name: attI_site relationship: part_of SO:0000365 ! integron [Term] id: SO:0000368 name: transposable_element_insertion_site def: "The junction in a genome where a transposable_element has inserted." [SO:ke] subset: SOFA is_a: SO:0000366 ! insertion_site [Term] id: SO:0000369 name: integrase_coding_region relationship: part_of SO:0000365 ! integron [Term] id: SO:0000370 name: small_regulatory_ncRNA def: "A non-coding RNA, usually with a specific secondary structure, that acts to regulate gene expression." [SO:ma] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000371 name: conjugative_transposon def: "A transposon that encodes function required for conjugation." [http://www.sci.sdsu.edu/ ~ smaloy/Glossary/C.html] is_a: SO:0000182 ! DNA_transposon [Term] id: SO:0000372 name: enzymatic_RNA def: "A non-coding RNA, usually with a specific secondary structure, that acts to regulate gene expression." [SO:ma] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000373 name: recombinationally_inverted is_a: SO:0000456 ! recombinationally_rearranged_gene [Term] id: SO:0000374 name: ribozyme def: "An RNA with catalytic activity." [SO:ma] subset: SOFA is_a: SO:0000372 ! enzymatic_RNA [Term] id: SO:0000375 name: rRNA_5.8S def: "5.8S ribosomal RNA (5.8S rRNA) is a component of the large subunit of the eukaryotic ribosome. It is transcribed by RNA polymerase I as part of the 45S precursor that also contains 18S and 28S rRNA. Functionally, it is thought that 5.8S rRNA may be involved in ribosome translocation. It is also known to form covalent linkage to the p53 tumour suppressor protein. 5.8S rRNA is also found in archaea." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00002] subset: SOFA synonym: "5.8S_rRNA" RELATED [] is_a: SO:0000252 ! rRNA [Term] id: SO:0000376 name: RNA_6S def: "A small (184-nt in E. coli) RNA that forms a hairpin type structure. 6S RNA associates with RNA polymerase in a highly specific manner. 6S RNA represses expression from a sigma70-dependent promoter during stationary phase." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00013] synonym: "6S_RNA" RELATED [] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000377 name: CsrB_RsmB_RNA def: "An enterobacterial RNA that binds the CsrA protein. The CsrB RNAs contain a conserved motif CAGGXXG that is found in up to 18 copies and has been suggested to bind CsrA. The Csr regulatory system has a strong negative regulatory effect on glycogen biosynthesis, glyconeogenesis and glycogen catabolism and a positive regulatory effect on glycolysis. In other bacteria such as Erwinia caratovara the RsmA protein has been shown to regulate the production of virulence determinants, such extracellular enzymes. RsmA binds to RsmB regulatory RNA which is also a member of this family." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00018] synonym: "CsrB-RsmB_RNA" RELATED [] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000378 name: DsrA_RNA def: "DsrA RNA regulates both transcription, by overcoming transcriptional silencing by the nucleoid-associated H-NS protein, and translation, by promoting efficient translation of the stress sigma factor, RpoS. These two activities of DsrA can be separated by mutation: the first of three stem-loops of the 85 nucleotide RNA is necessary for RpoS translation but not for anti-H-NS action, while the second stem-loop is essential for antisilencing and less critical for RpoS translation. The third stem-loop, which behaves as a transcription terminator, can be substituted by the trp transcription terminator without loss of either DsrA function. The sequence of the first stem-loop of DsrA is complementary with the upstream leader portion of RpoS messenger RNA, suggesting that pairing of DsrA with the RpoS message might be important for translational regulation." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00014] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000379 name: GcvB_RNA def: "A small untranslated RNA involved in expression of the dipeptide and oligopeptide transport systems in Escherichia coli." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00022] is_a: SO:0000378 ! DsrA_RNA [Term] id: SO:0000380 name: hammerhead_ribozyme def: "A small catalytic RNA motif that catalyzes self-cleavage reaction. Its name comes from its secondary structure which resembles a carpenter's hammer. The hammerhead ribozyme is involved in the replication of some viroid and some satellite RNAs." [http:rnaworld.bio.ukans.edu/class/RNA/RNA00/RNA_World_3.html] subset: SOFA is_a: SO:0000374 ! ribozyme [Term] id: SO:0000381 name: group_IIA_intron is_a: SO:0000603 ! group_II_intron [Term] id: SO:0000382 name: group_IIB_intron is_a: SO:0000603 ! group_II_intron [Term] id: SO:0000383 name: MicF_RNA def: "A non-translated 93 nt antisense RNA that binds its target ompF mRNA and regulates ompF expression by inhibiting translation and inducing degradation of the message." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00033] is_a: SO:0000644 ! antisense_RNA [Term] id: SO:0000384 name: OxyS_RNA def: "A small untranslated RNA which is induced in response to oxidative stress in Escherichia coli. Acts as a global regulator to activate or repress the expression of as many as 40 genes, including the fhlA-encoded transcriptional activator and the rpoS-encoded sigma(s) subunit of RNA polymerase. OxyS is bound by the Hfq protein, that increases the OxyS RNA interaction with its target messages." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00035] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000385 name: RNase_MRP_RNA def: "The RNA molecule essential for the catalytic activity of RNase MRP, an enzymatically active ribonucleoprotein with two distinct roles in eukaryotes. In mitochondria it plays a direct role in the initiation of mitochondrial DNA replication. In the nucleus it is involved in precursor rRNA processing, where it cleaves the internal transcribed spacer 1 between 18S and 5.8S rRNAs." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00030] subset: SOFA is_a: SO:0000372 ! enzymatic_RNA [Term] id: SO:0000386 name: RNase_P_RNA def: "The RNA component of Ribonuclease P (RNase P), a ubiquitous endoribonuclease, found in archaea, bacteria and eukarya as well as chloroplasts and mitochondria. Its best characterised activity is the generation of mature 5 prime ends of tRNAs by cleaving the 5 prime leader elements of precursor-tRNAs. Cellular RNase Ps are ribonucleoproteins. RNA from bacterial RNase Ps retains its catalytic activity in the absence of the protein subunit, i.e. it is a ribozyme. Isolated eukaryotic and archaeal RNase P RNA has not been shown to retain its catalytic function, but is still essential for the catalytic activity of the holoenzyme. Although the archaeal and eukaryotic holoenzymes have a much greater protein content than the bacterial ones, the RNA cores from all the three lineages are homologous. Helices corresponding to P1, P2, P3, P4, and P10/11 are common to all cellular RNase P RNAs. Yet, there is considerable sequence variation, particularly among the eukaryotic RNAs." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00010] subset: SOFA is_a: SO:0000374 ! ribozyme [Term] id: SO:0000387 name: RprA_RNA def: "Translational regulation of the stationary phase sigma factor RpoS is mediated by the formation of a double-stranded RNA stem-loop structure in the upstream region of the rpoS messenger RNA, occluding the translation initiation site. Clones carrying rprA (RpoS regulator RNA) increased the translation of RpoS. The rprA gene encodes a 106 nucleotide regulatory RNA. As with DsrA Rfam:RF00014, RprA is predicted to form three stem-loops. Thus, at least two small RNAs, DsrA and RprA, participate in the positive regulation of RpoS translation. Unlike DsrA, RprA does not have an extensive region of complementarity to the RpoS leader, leaving its mechanism of action unclear. RprA is non-essential." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00034] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000388 name: RRE_RNA def: "The Rev response element (RRE) is encoded within the HIV-env gene. Rev is an essential regulatory protein of HIV that binds an internal loop of the RRE leading, encouraging further Rev-RRE binding. This RNP complex is critical for mRNA export and hence for expression of the HIV structural proteins." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00036] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000389 name: spot_42_RNA def: "A 109-nucleotide RNA of E. coli that seems to have a regulatory role on the galactose operon. Changes in Spot 42 levels are implicated in affecting DNA polymerase I levels." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00021] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000390 name: telomerase_RNA def: "The RNA component of telomerase, a reverse transcriptase that synthesises telomeric DNA." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00025] subset: SOFA is_a: SO:0000372 ! enzymatic_RNA [Term] id: SO:0000391 name: U1_snRNA def: "U1 is a small nuclear RNA (snRNA) component of the spliceosome (involved in pre-mRNA splicing). Its 5' end forms complementary base pairs with the 5' splice junction, thus defining the 5' donor site of an intron. There are significant differences in sequence and secondary structure between metazoan and yeast U1 snRNAs, the latter being much longer (568 nucleotides as compared to 164 nucleotides in human). Nevertheless, secondary structure predictions suggest that all U1 snRNAs share a 'common core' consisting of helices I, II, the proximal region of III, and IV." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00003] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000392 name: U2_snRNA def: "U2 is a small nuclear RNA (snRNA) component of the spliceosome (involved in pre-mRNA splicing). Complementary binding between U2 snRNA (in an area lying towards the 5' end but 3' to hairpin I) and the branchpoint sequence (BPS) of the intron results in the bulging out of an unpaired adenine, on the BPS, which initiates a nucleophilic attack at the intronic 5' splice site, thus starting the first of two transesterification reactions that mediate splicing." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00004] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000393 name: U4_snRNA def: "U4 small nuclear RNA (U4 snRNA) is a component of the major U2-dependent spliceosome. It forms a duplex with U6, and with each splicing round, it is displaced from U6 (and the spliceosome) in an ATP-dependent manner, allowing U6 to refold and create the active site for splicing catalysis. A recycling process involving protein Prp24 re-anneals U4 and U6." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00015] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000394 name: U4atac_snRNA def: "An snRNA required for the splicing of the minor U12-dependent class of eukaryotic nuclear introns. It forms a base paired complex with U6atac_snRNA (SO:0000397)." [PMID:=12409455] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000395 name: U5_snRNA def: "U5 RNA is a component of both types of known spliceosome. The precise function of this molecule is unknown, though it is known that the 5' loop is required for splice site selection and p220 binding, and that both the 3' stem-loop and the Sm site are important for Sm protein binding and cap methylation." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00020] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000396 name: U6_snRNA def: "U6 snRNA is a component of the spliceosome which is involved in splicing pre-mRNA. The putative secondary structure consensus base pairing is confined to a short 5' stem loop, but U6 snRNA is thought to form extensive base-pair interactions with U4 snRNA." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00015] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000397 name: U6atac_snRNA def: "U6atac_snRNA -An snRNA required for the splicing of the minor U12-dependent class of eukaryotic nuclear introns. It forms a base paired complex with U4atac_snRNA (SO:0000394)." [https://www.ncbi.nlm.nih.gov/pubmed/12409455] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000398 name: U11_snRNA def: "U11 snRNA plays a role in splicing of the minor U12-dependent class of eukaryotic nuclear introns, similar to U1 snRNA in the major class spliceosome it base pairs to the conserved 5' splice site sequence." [PMID:9622129] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000399 name: U12_snRNA def: "The U12 small nuclear (snRNA), together with U4atac/U6atac, U5, and U11 snRNAs and associated proteins, forms a spliceosome that cleaves a divergent class of low-abundance pre-mRNA introns." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00007] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000400 name: sequence_attribute is_a: SO:0000000 ! Sequence_Ontology [Term] id: SO:0000401 name: gene_attribute is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000402 name: enhancer_attribute is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000403 name: U14_snRNA def: "U14 small nucleolar RNA (U14 snoRNA) is required for early cleavages of eukaryotic precursor rRNAs. In yeasts, this molecule possess a stem-loop region (known as the Y-domain) which is essential for function. A similar structure, but with a different consensus sequence, is found in plants, but is absent in vertebrates." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00016] subset: SOFA is_a: SO:0000274 ! snRNA [Term] id: SO:0000404 name: vault_RNA def: "A family of RNAs are found as part of the enigmatic vault ribonuceoprotein complex. The complex consists of a major vault protein (MVP), two minor vault proteins (VPARP and TEP1), and several small untranslated RNA molecules. It has been suggested that the vault complex is involved in drug resistance." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00006] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000405 name: Y_RNA def: "Y RNAs are components of the Ro ribonucleoprotein particle (Ro RNP), in association with Ro60 and La proteins. The Y RNAs and Ro60 and La proteins are well conserved, but the function of the Ro RNP is not known. In humans the RNA component can be one of four small RNAs: hY1, hY3, hY4 and hY5. These small RNAs are predicted to fold into a conserved secondary structure containing three stem structures. The largest of the four, hY1, contains an additional hairpin." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00019] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000406 name: twintron def: "An intron within an intron." [PMID:1899376] is_a: SO:0000188 ! intron [Term] id: SO:0000407 name: rRNA_18S def: "18S_rRNA -A large polynucleotide which functions as a part of the small subunit of the ribosome" [SO:ke] subset: SOFA synonym: "16S_rRNA" RELATED [] is_a: SO:0000252 ! rRNA [Term] id: SO:0000408 name: site def: "The interbase position where something (eg an aberration) occurred." [SO:ke] is_obsolete: true [Term] id: SO:0000409 name: binding_site def: "A region on the surface of a molecule that may interact with another molecule." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000410 name: protein_binding_site def: "A region of a molecule that binds to a protein." [SO:ke] is_a: SO:0000409 ! binding_site [Term] id: SO:0000411 name: rescue_fragment is_a: SO:0000695 ! reagent [Term] id: SO:0000412 name: restriction_fragment def: "Any of the individual polynucleotide sequences produced by digestion of DNA with a restriction endonuclease." [http://www.agron.missouri.edu/cgi-bin/sybgw_mdb/mdb3/Term/119] subset: SOFA is_a: SO:0000695 ! reagent [Term] id: SO:0000413 name: sequence_difference def: "A region where the sequences differs from that of a specified sequence." [SO:ke] subset: SOFA is_a: SO:0000700 ! remark [Term] id: SO:0000414 name: genomically_contaminated_cDNA_clone is_a: SO:0000317 ! cDNA_clone [Term] id: SO:0000415 name: genomic_polyA_primed_cDNA_clone is_a: SO:0000317 ! cDNA_clone [Term] id: SO:0000416 name: partially_unprocessed_cDNA_clone is_a: SO:0000317 ! cDNA_clone [Term] id: SO:0000417 name: polypeptide_domain def: "A region of a single polypeptide chain that folds into an independent unit and exhibits biological activity. A polypeptide chain may have multiple domains." [http:www.molbiol.bbsrc.ac.uk/new_protein/domains.html] relationship: part_of SO:0000104 ! polypeptide [Term] id: SO:0000418 name: signal_peptide def: "The sequence for an N-terminal domain of a secreted protein; this domain is involved in attaching nascent polypeptide to the membrane leader sequence." [http:www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html] subset: SOFA synonym: "signal peptide coding sequence" RELATED [] relationship: part_of SO:0000104 ! polypeptide [Term] id: SO:0000419 name: mature_peptide def: "The coding sequence for the mature or final peptide or protein product following post-translational modification." [http:www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html] subset: SOFA relationship: part_of SO:0000104 ! polypeptide [Term] id: SO:0000420 name: five_prime_terminal_inverted_repeat is_a: SO:0000481 ! terminal_inverted_repeat [Term] id: SO:0000421 name: three_prime_terminal_inverted_repeat is_a: SO:0000481 ! terminal_inverted_repeat [Term] id: SO:0000422 name: U5_LTR_region relationship: part_of SO:0000286 ! long_terminal_repeat [Term] id: SO:0000423 name: R_LTR_region relationship: part_of SO:0000286 ! long_terminal_repeat [Term] id: SO:0000424 name: U3_LTR_region relationship: part_of SO:0000286 ! long_terminal_repeat [Term] id: SO:0000425 name: five_prime_LTR is_a: SO:0000286 ! long_terminal_repeat [Term] id: SO:0000426 name: three_prime_LTR is_a: SO:0000286 ! long_terminal_repeat [Term] id: SO:0000427 name: R_five_prime_LTR_region is_a: SO:0000423 ! R_LTR_region relationship: part_of SO:0000425 ! five_prime_LTR [Term] id: SO:0000428 name: U5_five_prime_LTR_region is_a: SO:0000422 ! U5_LTR_region relationship: part_of SO:0000425 ! five_prime_LTR [Term] id: SO:0000429 name: U3_five_prime_LTR_region is_a: SO:0000424 ! U3_LTR_region relationship: part_of SO:0000425 ! five_prime_LTR [Term] id: SO:0000430 name: R_three_prime_LTR_region relationship: part_of SO:0000426 ! three_prime_LTR [Term] id: SO:0000431 name: U3_three_prime_LTR_region relationship: part_of SO:0000426 ! three_prime_LTR [Term] id: SO:0000432 name: U5_three_prime_LTR_region relationship: part_of SO:0000426 ! three_prime_LTR [Term] id: SO:0000433 name: non_LTR_retrotransposon_polymeric_tract def: "A polymeric tract, such as poly(dA), within a non_LTR_retrotransposon." [SO:ke] is_a: SO:0000657 ! repeat_region relationship: part_of SO:0000189 ! non_LTR_retrotransposon [Term] id: SO:0000434 name: transposable_element_target_site_duplication def: "A sequence of DNA that is duplicated when a transposable element inserts; usually found at each end the insertion." [http:www.koko.gov.my/CocoaBioTech/Glossaryt.html] is_a: SO:0000657 ! repeat_region [Term] id: SO:0000435 name: RR_tract def: "A polypurine tract within an LTR_retrotransposon." [SO:ke] synonym: "LTR_retrotransposon_poly_purine_tract" RELATED [] is_a: SO:0000186 ! LTR_retrotransposon [Term] id: SO:0000436 name: ARS def: "A sequence that can autonomously replicate, as a plasmid, when transformed into a bacterial host." [SO:ma] subset: SOFA synonym: "autonomously_replicating_sequence" RELATED [] is_a: SO:0000001 ! region [Term] id: SO:0000437 name: assortment_derived_duplication is_obsolete: true [Term] id: SO:0000438 name: gene_not_polyadenylated is_a: SO:0000066 ! gene_by_polyadenylation_attribute [Term] id: SO:0000439 name: inverted_ring_chromosome is_a: SO:1000030 ! chromosomal_inversion is_a: SO:1000045 ! ring_chromosome [Term] id: SO:0000440 name: vector def: "A DNA molecule that can be used to transfer DNA molecules between organisms." [SO:ma] is_a: SO:0000695 ! reagent relationship: part_of SO:0000151 ! clone [Term] id: SO:0000441 name: ss_oligo def: "A single stranded oligonucleotide." [SO:ke] synonym: "single stranded oligonucleotide.new synonym" RELATED [] synonym: "ss_oligonucleotide" RELATED [] is_a: SO:0000696 ! oligo [Term] id: SO:0000442 name: ds_oligo def: "A double stranded oligonucleotide." [SO:ke] synonym: "double stranded oligonucleotide" RELATED [] synonym: "ds_oligonucleotide" RELATED [] is_a: SO:0000696 ! oligo [Term] id: SO:0000443 name: polymer_type is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000444 name: three_prime_noncoding_exon def: "Non-coding exon in the 3' UTR." [SO:ke] is_a: SO:0000198 ! noncoding_exon [Term] id: SO:0000445 name: five_prime_noncoding_exon def: "Non-coding exon in the 5' UTR." [SO:ke] synonym: "five_prime_noncoding_exon" RELATED [] is_a: SO:0000198 ! noncoding_exon [Term] id: SO:0000446 name: UTR_intron def: "Intron located in the untranslated region." [SO:ke] is_a: SO:0000188 ! intron [Term] id: SO:0000447 name: five_prime_UTR_intron def: "An intron located in the 5' UTR." [SO:ke] is_a: SO:0000446 ! UTR_intron [Term] id: SO:0000448 name: three_prime_UTR_intron def: "An intron located in the 3' UTR." [SO:ke] is_a: SO:0000446 ! UTR_intron [Term] id: SO:0000449 name: random_sequence def: "A sequence of nucleotides or amino acids which, by design, has a \"random\" order of components, given a predetermined input frequencyof these components." [SO:ma] is_a: SO:0000351 ! synthetic_sequence [Term] id: SO:0000450 name: interband def: "A light region between two darkly staining bands in a polytene chromosome." [SO:ma] is_a: SO:0000341 ! chromosome_band [Term] id: SO:0000451 name: gene_polyadenylated is_a: SO:0000066 ! gene_by_polyadenylation_attribute [Term] id: SO:0000452 name: transgene is_a: SO:0000009 ! gene_class [Term] id: SO:0000453 name: transposition is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:0000454 name: rasiRNA def: "A small, 17-28-nt, small interfering RNA derived from transcripts ofrepetitive elements." [http://www.developmentalcell.com/content/article/abstract?uid=PIIS1534580703002284] subset: SOFA synonym: "repeat associated small interfering RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000455 name: gene_with_mRNA_with_frameshift is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000456 name: recombinationally_rearranged_gene is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000457 name: interchromosomal_duplication is_a: SO:1000037 ! chromosomal_duplication [Term] id: SO:0000458 name: D_gene def: "Germline genomic DNA including D-region with 5' UTR and 3' UTR, also designated as D-segment." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-GENE] synonym: "D-GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000504 ! D_DJ_C_cluster relationship: part_of SO:0000505 ! D_DJ_cluster relationship: part_of SO:0000506 ! D_DJ_J_C_cluster relationship: part_of SO:0000508 ! D_DJ_J_cluster relationship: part_of SO:0000509 ! D_J_C_cluster relationship: part_of SO:0000527 ! V_D_DJ_C_cluster relationship: part_of SO:0000528 ! V_D_DJ_cluster relationship: part_of SO:0000529 ! V_D_DJ_J_C_cluster relationship: part_of SO:0000530 ! V_D_DJ_J_cluster relationship: part_of SO:0000531 ! V_D_J_C_cluster relationship: part_of SO:0000532 ! V_D_J_cluster relationship: part_of SO:0000559 ! D_cluster relationship: part_of SO:0000560 ! D_J_cluster [Term] id: SO:0000459 name: gene_with_trans_spliced_transcript is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000460 name: vertebrate_immunoglobulin_T_cell_receptor_gene synonym: "vertebrate_immunoglobulin/T-cell_receptor_gene" RELATED [] is_a: SO:0000456 ! recombinationally_rearranged_gene [Term] id: SO:0000461 name: inversion_derived_bipartite_deficiency def: "A chromosome generated by recombination between two inversions; has a deficiency at each end of the inversion." [FB:km] is_a: SO:1000029 ! chromosomal_deletion [Term] id: SO:0000462 name: pseudogenic_region def: "A non-functional descendent of a functional entitity." [SO:cjm] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000463 name: gene_with_alternately_spliced_transcript is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000464 name: decayed_exon def: "A non-functional descendent of an exon." [SO:ke] subset: SOFA is_a: SO:0000462 ! pseudogenic_region relationship: non_functional_homolog_of SO:0000147 ! exon [Term] id: SO:0000465 name: inversion_derived_deficiency_plus_duplication def: "A chromosome generated by recombination between two inversions; there is a deficiency at one end of the inversion and a duplication at the other end of the inversion." [FB:km] is_a: SO:1000029 ! chromosomal_deletion is_a: SO:1000038 ! intrachromosomal_duplication [Term] id: SO:0000466 name: V_gene def: "Germline genomic DNA including L-part1, V-intron and V-exon, with the 5' UTR and 3' UTR." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-GENE] synonym: "V_GENE" RELATED [] synonym: "variable_gene" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000518 ! V_DJ_cluster relationship: part_of SO:0000519 ! V_DJ_J_cluster relationship: part_of SO:0000520 ! V_VDJ_C_cluster relationship: part_of SO:0000521 ! V_VDJ_cluster relationship: part_of SO:0000522 ! V_VDJ_J_cluster relationship: part_of SO:0000523 ! V_VJ_C_cluster relationship: part_of SO:0000524 ! V_VJ_cluster relationship: part_of SO:0000525 ! V_VJ_J_cluster relationship: part_of SO:0000526 ! V_cluster relationship: part_of SO:0000527 ! V_D_DJ_C_cluster relationship: part_of SO:0000528 ! V_D_DJ_cluster relationship: part_of SO:0000529 ! V_D_DJ_J_C_cluster relationship: part_of SO:0000530 ! V_D_DJ_J_cluster relationship: part_of SO:0000531 ! V_D_J_C_cluster relationship: part_of SO:0000532 ! V_D_J_cluster relationship: part_of SO:0000534 ! V_J_cluster relationship: part_of SO:0000535 ! V_J_C_cluster relationship: part_of SO:0000542 ! V_DJ_C_cluster relationship: part_of SO:0000564 ! V_DJ_J_C_cluster relationship: part_of SO:0000565 ! V_VDJ_J_C_cluster relationship: part_of SO:0000566 ! V_VJ_J_C_cluster [Term] id: SO:0000467 name: post_translationally_regulated_by_protein_stability synonym: "post-translationally_regulated_by_protein_stability" RELATED [] is_a: SO:0000130 ! post_translationally_regulated [Term] id: SO:0000468 name: golden_path_fragment def: "One of the pieces of sequence that make up a golden path." [SO:rd] subset: SOFA is_a: SO:0000143 ! assembly_component relationship: part_of SO:0000688 ! golden_path [Term] id: SO:0000469 name: post_translationally_regulated_by_protein_modification synonym: "post-translationally_regulated_by_protein_modification" RELATED [] is_a: SO:0000130 ! post_translationally_regulated [Term] id: SO:0000470 name: J_gene def: "Germline genomic DNA of an immunoglobulin/T-cell receptor gene including J-region with 5' UTR (SO:0000204) and 3' UTR (SO:0000205), also designated as J-segment." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-GENE] synonym: "J-GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000485 ! DJ_J_cluster relationship: part_of SO:0000487 ! VDJ_J_C_cluster relationship: part_of SO:0000488 ! VDJ_J_cluster relationship: part_of SO:0000490 ! VJ_J_C_cluster relationship: part_of SO:0000491 ! VJ_J_cluster relationship: part_of SO:0000506 ! D_DJ_J_C_cluster relationship: part_of SO:0000508 ! D_DJ_J_cluster relationship: part_of SO:0000509 ! D_J_C_cluster relationship: part_of SO:0000511 ! J_C_cluster relationship: part_of SO:0000513 ! J_cluster relationship: part_of SO:0000519 ! V_DJ_J_cluster relationship: part_of SO:0000522 ! V_VDJ_J_cluster relationship: part_of SO:0000525 ! V_VJ_J_cluster relationship: part_of SO:0000529 ! V_D_DJ_J_C_cluster relationship: part_of SO:0000530 ! V_D_DJ_J_cluster relationship: part_of SO:0000531 ! V_D_J_C_cluster relationship: part_of SO:0000532 ! V_D_J_cluster relationship: part_of SO:0000534 ! V_J_cluster relationship: part_of SO:0000535 ! V_J_C_cluster relationship: part_of SO:0000540 ! DJ_J_C_cluster relationship: part_of SO:0000560 ! D_J_cluster relationship: part_of SO:0000564 ! V_DJ_J_C_cluster relationship: part_of SO:0000565 ! V_VDJ_J_C_cluster relationship: part_of SO:0000566 ! V_VJ_J_C_cluster [Term] id: SO:0000471 name: autoregulated is_a: SO:0000123 ! transcriptionally_regulated [Term] id: SO:0000472 name: tiling_path def: "A set of regions which overlap with minimal polymorphism to form a linear sequence." [CJM:SO] subset: SOFA is_a: SO:0000353 ! assembly [Term] id: SO:0000473 name: negatively_autoregulated is_a: SO:0000126 ! transcriptionally_repressed is_a: SO:0000471 ! autoregulated [Term] id: SO:0000474 name: tiling_path_fragment def: "A piece of sequence that makes up a tiling_path.SO:0000472." [SO:ke] subset: SOFA is_a: SO:0000143 ! assembly_component relationship: part_of SO:0000472 ! tiling_path [Term] id: SO:0000475 name: positively_autoregulated is_a: SO:0000125 ! transcriptionally_induced is_a: SO:0000471 ! autoregulated [Term] id: SO:0000476 name: contig_read def: "A DNA sequencer read which is part of a contig." [SO:ke] is_a: SO:0000150 ! read [Term] id: SO:0000477 name: polycistronic_gene is_a: SO:0000081 ! member_gene_array [Term] id: SO:0000478 name: C_gene def: "Genomic DNA of immunoglobulin/T-cell receptor gene including C-region (and introns if present) with 5' UTR (SO:0000204) and 3' UTR (SO:0000205)." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#C-GENE] synonym: "C_GENE" RELATED [] synonym: "constant_gene" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000487 ! VDJ_J_C_cluster relationship: part_of SO:0000489 ! VJ_C_cluster relationship: part_of SO:0000490 ! VJ_J_C_cluster relationship: part_of SO:0000504 ! D_DJ_C_cluster relationship: part_of SO:0000506 ! D_DJ_J_C_cluster relationship: part_of SO:0000509 ! D_J_C_cluster relationship: part_of SO:0000511 ! J_C_cluster relationship: part_of SO:0000520 ! V_VDJ_C_cluster relationship: part_of SO:0000523 ! V_VJ_C_cluster relationship: part_of SO:0000527 ! V_D_DJ_C_cluster relationship: part_of SO:0000529 ! V_D_DJ_J_C_cluster relationship: part_of SO:0000531 ! V_D_J_C_cluster relationship: part_of SO:0000535 ! V_J_C_cluster relationship: part_of SO:0000539 ! DJ_C_cluster relationship: part_of SO:0000540 ! DJ_J_C_cluster relationship: part_of SO:0000541 ! VDJ_C_cluster relationship: part_of SO:0000542 ! V_DJ_C_cluster relationship: part_of SO:0000558 ! C_cluster relationship: part_of SO:0000564 ! V_DJ_J_C_cluster relationship: part_of SO:0000565 ! V_VDJ_J_C_cluster relationship: part_of SO:0000566 ! V_VJ_J_C_cluster [Term] id: SO:0000479 name: trans_spliced_transcript synonym: "trans-spliced_transcript" RELATED [] is_a: SO:0000082 ! processed_transcript_attribute [Term] id: SO:0000480 name: tiling_path_clone def: "A clone which is part of a tiling path. A tiling path is a set of sequencing substrates, typically clones, which have been selected in order to efficiently cover a region of the genome in preparation for sequencing and assembly.A minimal_tiling path is a set of sequencing substrates, typically clones, which have been selected in order to efficiently cover a region of the genome in preparation for sequencing and assembly attempting to minimize the overlap between adjacent clones. (LS)" [SO:ke] is_a: SO:0000151 ! clone is_a: SO:0000474 ! tiling_path_fragment [Term] id: SO:0000481 name: terminal_inverted_repeat def: "An inverted repeat (SO:0000294) occuring at the termini of a DNA transposon." [SO:ke] synonym: "TIR" RELATED [] is_a: SO:0000657 ! repeat_region relationship: part_of SO:0000208 ! terminal_inverted_repeat_element [Term] id: SO:0000482 name: vertebrate_immunoglobulin_T_cell_receptor_gene_cluster synonym: "vertebrate_immunoglobulin/T-cell_receptor_gene-cluster" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene [Term] id: SO:0000483 name: nc_primary_transcript def: "A primary transcript that is never translated into a protein." [SO:ke] subset: SOFA synonym: "noncoding_primary_transcript" RELATED [] is_a: SO:0000185 ! primary_transcript [Term] id: SO:0000484 name: three_prime_exon_noncoding_region def: "The sequence of the 3' exon that is not coding." [SO:ke] synonym: "three_prime_exon_noncoding_region" RELATED [] relationship: part_of SO:0000202 ! three_prime_coding_exon [Term] id: SO:0000485 name: DJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one DJ-gene, and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(DJ)-J-CLUSTER] synonym: "(DJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000486 name: five_prime_exon_noncoding_region def: "The sequence of the 5' exon preceeding the start codon." [SO:ke] synonym: "five_prime_exon_noncoding_region" RELATED [] relationship: part_of SO:0000200 ! five_prime_coding_exon [Term] id: SO:0000487 name: VDJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VDJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VDJ)-J-C-CLUSTER] synonym: "(VDJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000488 name: VDJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VDJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VDJ)-J-CLUSTER] synonym: "(VDJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000489 name: VJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VJ)-C-CLUSTER] synonym: "(VJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000490 name: VJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VJ)-J-C-CLUSTER] synonym: "(VJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000491 name: VJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VJ)-J-CLUSTER] synonym: "(VJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000492 name: D_gene_recombination_feature is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000493 name: three_prime_D_heptamer def: "7 nucleotide recombination site like CACAGTG, part of a 3' D-recombination signal sequence of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#3'D-HEPTAMER] synonym: "3'D-HEPTAMER" RELATED [] is_a: SO:0000561 ! heptamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000570 ! three_prime_D_recombination_signal_sequence [Term] id: SO:0000494 name: three_prime_D_nonamer def: "A 9 nucleotide recombination site (e.g. ACAAAAACC), part of a 3' D-recombination signal sequence of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#3'D-NONAMER] synonym: "3'D-NOMAMER" RELATED [] is_a: SO:0000562 ! nonamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000570 ! three_prime_D_recombination_signal_sequence [Term] id: SO:0000495 name: three_prime_D_spacer def: "A 12 or 23 nucleotide spacer between the 3'D-HEPTAMER and 3'D-NONAMER of a 3'D-RS." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#3'D-SPACER] synonym: "3'D-SPACER" RELATED [] is_a: SO:0000563 ! spacer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000570 ! three_prime_D_recombination_signal_sequence [Term] id: SO:0000496 name: five_prime_D_heptamer def: "7 nucleotide recombination site (e.g. CACTGTG), part of a 5' D-recombination signal sequence (SO:0000556) of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#5'D-HEPTAMER] synonym: "5'D-HEPTAMER" RELATED [] is_a: SO:0000561 ! heptamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000556 ! five_prime_D_recombination_signal_sequence [Term] id: SO:0000497 name: five_prime_D_nonamer def: "9 nucleotide recombination site (e.g. GGTTTTTGT), part of a five_prime_D-recombination signal sequence (SO:0000556) of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#5'D-NONAMER] synonym: "5'D-NONAMER" RELATED [] is_a: SO:0000562 ! nonamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000556 ! five_prime_D_recombination_signal_sequence [Term] id: SO:0000498 name: five_prime_D_spacer def: "12 or 23 nucleotide spacer between the 5' D-heptamer (SO:0000496) and 5' D-nonamer (SO:0000497) of a 5' D-recombination signal sequence (SO:0000556) of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#5'D-SPACER] synonym: "5'-SPACER" RELATED [] synonym: "five_prime_D-spacer" RELATED [] is_a: SO:0000563 ! spacer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000556 ! five_prime_D_recombination_signal_sequence [Term] id: SO:0000499 name: virtual_sequence def: "A continous piece of sequence similar to the 'virtual contig' concept of ensembl." [SO:ke] subset: SOFA is_a: SO:0000353 ! assembly [Term] id: SO:0000500 name: Hoogsteen_base_pair def: "A type of non-canonical base-pairing. This is less energetically favourable than watson crick base pairing. Hoogsteen GC base pairs only have two hydrogen bonds." [PMID:12177293] is_a: SO:0000028 ! base_pair [Term] id: SO:0000501 name: reverse_Hoogsteen_base_pair def: "A type of non-canonical base-pairing." [SO:ke] is_a: SO:0000028 ! base_pair [Term] id: SO:0000502 name: transcribed_region def: "A region of sequence that is transcribed. This region may cover the transcript of a gene, it may emcompas the sequence covered by all of the transcripts of a alternately spliced gene, or it may cover the region transcribed by a polycistronic transcript. A gene may have 1 or more transcribed regions and a transcribed_region may belong to one or more genes." [SO:ke] comment: This concept cam about as a direct result of the SO meeting August 2004.nThe exact nature of the relationship between transcribed_region and gene is still up for discussion. We are going with 'associated_with' for the time being. subset: SOFA is_obsolete: true [Term] id: SO:0000503 name: alternately_spliced_gene_encodeing_one_transcript is_obsolete: true [Term] id: SO:0000504 name: D_DJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one D-gene, one DJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-(DJ)-C-CLUSTER] synonym: "D-(DJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000505 name: D_DJ_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one D-gene and one DJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-(DJ)-CLUSTER] synonym: "D-(DJ)-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000506 name: D_DJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one D-gene, one DJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-(DJ)-J-C-CLUSTER] synonym: "D-(DJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000507 name: pseudogenic_exon is_a: SO:0000462 ! pseudogenic_region relationship: non_functional_homolog_of SO:0000147 ! exon relationship: part_of SO:0000516 ! pseudogenic_transcript [Term] id: SO:0000508 name: D_DJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one D-gene, one DJ-gene, and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-(DJ)-J-CLUSTER] synonym: "D-(DJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000509 name: D_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one D-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-J-C-CLUSTER] synonym: "D-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000510 name: VD_gene def: "Genomic DNA of immunoglobulin/T-cell receptor gene in partially rearranged genomic DNA including L-part1, V-intron and V-D-exon, with the 5' UTR (SO:0000204) and 3' UTR (SO:0000205)." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-GENE] synonym: "V_D_GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene [Term] id: SO:0000511 name: J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-C-CLUSTER] synonym: "J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000512 name: inversion_derived_deficiency_plus_aneuploid def: "A chromosome generated by recombination between two inversions; has a deficiency at one end and presumed to have a deficiency or duplication at the other end of the inversion." [FB:km] is_a: SO:1000029 ! chromosomal_deletion [Term] id: SO:0000513 name: J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including more than one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-CLUSTER] synonym: "J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000514 name: J_nonamer def: "9 nucleotide recombination site (e.g. GGTTTTTGT), part of a J-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-NONAMER] synonym: "J-NONAMER" RELATED [] is_a: SO:0000562 ! nonamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000302 ! J_gene_recombination_feature [Term] id: SO:0000515 name: J_heptamer def: "7 nucleotide recombination site (e.g. CACAGTG), part of a J-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-HEPTAMER] synonym: "J-HEPTAMER" RELATED [] is_a: SO:0000561 ! heptamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000302 ! J_gene_recombination_feature [Term] id: SO:0000516 name: pseudogenic_transcript is_a: SO:0000462 ! pseudogenic_region relationship: non_functional_homolog_of SO:0000673 ! transcript relationship: part_of SO:0000336 ! pseudogene [Term] id: SO:0000517 name: J_spacer def: "12 or 23 nucleotide spacer between the J-nonamer and the J-heptamer of a J-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#J-SPACER] synonym: "J-SPACER" RELATED [] is_a: SO:0000563 ! spacer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000302 ! J_gene_recombination_feature [Term] id: SO:0000518 name: V_DJ_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene and one DJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(DJ)-CLUSTER] synonym: "V-(DJ)-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000519 name: V_DJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one DJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(DJ)-J-CLUSTER] synonym: "V-(DJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000520 name: V_VDJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VDJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VDJ)-C-CLUSTER] synonym: "V-(VDJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000521 name: V_VDJ_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene and one VDJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VDJ)-CLUSTER] synonym: "V-(VDJ)-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000522 name: V_VDJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VDJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VDJ)-J-CLUSTER] synonym: "V-(VDJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000523 name: V_VJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VJ)-C-CLUSTER] synonym: "V-(VJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000524 name: V_VJ_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene and one VJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VJ)-CLUSTER] synonym: "V-(VJ)-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000525 name: V_VJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VJ)-J-CLUSTER] synonym: "V-(VJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000526 name: V_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including more than one V-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-CLUSTER] synonym: "V-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000527 name: V_D_DJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one D-gene, one DJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-(DJ)-C-CLUSTER] synonym: "V-D-(DJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000528 name: V_D_DJ_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one D-gene, one DJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-(DJ)-CLUSTER] synonym: "V-D-(DJ)-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000529 name: V_D_DJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one D-gene, one DJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-(DJ)-J-C-CLUSTER] synonym: "V-D-(DJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000530 name: V_D_DJ_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one D-gene, one DJ-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-(DJ)-J-CLUSTER] synonym: "V-D-(DJ)-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000531 name: V_D_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one V-gene, one D-gene and one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-J-C-CLUSTER] synonym: "V-D-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000532 name: V_D_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one V-gene, one D-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-J-CLUSTER] synonym: "V-D-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000533 name: V_heptamer def: "7 nucleotide recombination site (e.g. CACAGTG), part of V-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-HEPTAMER] synonym: "V-HEPTAMER" RELATED [] is_a: SO:0000561 ! heptamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000538 ! V_gene_recombination_feature [Term] id: SO:0000534 name: V_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one V-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-J-CLUSTER] synonym: "V-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000535 name: V_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one V-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-J-C-CLUSTER] synonym: "V-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000536 name: V_nonamer def: "9 nucleotide recombination site (e.g. ACAAAAACC), part of V-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-NONAMER] synonym: "V-NONAMER" RELATED [] is_a: SO:0000562 ! nonamer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000538 ! V_gene_recombination_feature [Term] id: SO:0000537 name: V_spacer def: "12 or 23 nucleotide spacer between the V-heptamer and the V-nonamer of a V-gene recombination feature of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-SPACER] synonym: "V-SPACER" RELATED [] is_a: SO:0000563 ! spacer_of_recombination_feature_of_vertebrate_immune_system_gene relationship: part_of SO:0000538 ! V_gene_recombination_feature [Term] id: SO:0000538 name: V_gene_recombination_feature def: "Recombination signal including V-heptamer, V-spacer and V-nonamer in 3' of V-region of a V-gene or V-sequence of an immunoglobulin/T-cell receptor gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-RS] synonym: "V-RS" RELATED [] is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000539 name: DJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one DJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(DJ)-C-CLUSTER] synonym: "(DJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000540 name: DJ_J_C_cluster def: "Genomic DNA in rearranged configuration including at least one D-J-GENE, one J-GENE and one C-GENE." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(DJ)-J-C-CLUSTER] synonym: "(DJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000541 name: VDJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one VDJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#(VDJ)-C-CLUSTER] synonym: "(VDJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000542 name: V_DJ_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one DJ-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(DJ)-C-CLUSTER] synonym: "V-(DJ)-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000543 name: alternately_spliced_gene_encoding_greater_than_one_transcript is_obsolete: true [Term] id: SO:0000544 name: helitron def: "A rolling circle transposon. Autonomous Helitrons encode a 5'-to-3' DNA helicase and nuclease/ligase similar to those encoded by known rolling-circle replicons." [http://www.pnas.org/cgi/content/full/100/11/6569] is_a: SO:0000101 ! transposable_element [Term] id: SO:0000545 name: recoding_pseudoknot def: "The pseudoknots involved in recoding are unique in that, as they play their role as a structure, they are immediately unfolded and their now linear sequence serves as a template for decoding." [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=33937] is_a: SO:0000591 ! pseudoknot relationship: part_of SO:1001268 ! recoding_stimulatory_region [Term] id: SO:0000546 name: designed_sequence is_a: SO:0000351 ! synthetic_sequence [Term] id: SO:0000547 name: inversion_derived_bipartite_duplication def: "A chromosome generated by recombination between two inversions; there is a duplication at each end of the inversion." [FB:km] is_a: SO:1000038 ! intrachromosomal_duplication [Term] id: SO:0000548 name: gene_with_edited_transcript is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000549 name: inversion_derived_duplication_plus_aneuploid def: "A chromosome generated by recombination between two inversions; has a duplication at one end and presumed to have a deficiency or duplication at the other end of the inversion." [FB:km] is_a: SO:1000038 ! intrachromosomal_duplication [Term] id: SO:0000550 name: aneuploid_chromosome is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:0000551 name: polyA_signal_sequence def: "The recognition sequence necessary for endonuclease cleavage of an RNA transcript that is followed by polyadenylation; consensus=AATAAA." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000552 name: Shine_Dalgarno_sequence def: "Region in 5' UTR where ribosome assembles on mRNA." [SO:ke] synonym: "RBS" RELATED [] synonym: "Shine-Dalgarno_sequence" RELATED [] synonym: "five_prime_ribosome_binding_site" RELATED [] is_a: SO:0000139 ! ribosome_entry_site [Term] id: SO:0000553 name: polyA_site def: "The site on an RNA transcript to which will be added adenine residues by post-transcriptional polyadenylation." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000233 ! processed_transcript [Term] id: SO:0000554 name: assortment_derived_deficiency_plus_duplication is_obsolete: true [Term] id: SO:0000555 name: five_prime_clip def: "5' most region of a precursor transcript that is clipped off during processing." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "five_prime_-clip" RELATED [] is_a: SO:0000303 ! clip [Term] id: SO:0000556 name: five_prime_D_recombination_signal_sequence def: "Recombination signal of an immunoglobulin/T-cell receptor gene, including the 5' D-nonamer (SO:0000497), 5' D-spacer (SO:0000498), and 5' D-heptamer (SO:0000396) in 5' of the D-region of a D-gene, or in 5' of the D-region of DJ-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#5'D-RS] synonym: "5'RS" RELATED [] synonym: "five_prime_D-recombination_signal_sequence" RELATED [] is_a: SO:0000492 ! D_gene_recombination_feature [Term] id: SO:0000557 name: three_prime_clip def: "3'-most region of a precursor transcript that is clipped off during processing." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] synonym: "3'-clip" RELATED [] is_a: SO:0000303 ! clip [Term] id: SO:0000558 name: C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene including more than one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#C-CLUSTER] synonym: "C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000559 name: D_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including more than one D-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-CLUSTER] synonym: "D-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000560 name: D_J_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in germline configuration including at least one D-gene and one J-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-J-CLUSTER] synonym: "D-J-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000561 name: heptamer_of_recombination_feature_of_vertebrate_immune_system_gene def: " 7 nucleotide recombination site (e.g. CACAGTG), part of V-gene, D-gene or J-gene recombination feature of an immunoglobulin/T-cell receptor gene" [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#HEPTAMER] synonym: "HEPTAMER" RELATED [] is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000562 name: nonamer_of_recombination_feature_of_vertebrate_immune_system_gene is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000563 name: spacer_of_recombination_feature_of_vertebrate_immune_system_gene is_a: SO:0000301 ! recombination_feature_of_vertebrate_immune_system_gene [Term] id: SO:0000564 name: V_DJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one DJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(DJ)-J-C-CLUSTER] synonym: "V-(DJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000565 name: V_VDJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VDJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VDJ)-J-C-CLUSTER] synonym: "V-(VDJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000566 name: V_VJ_J_C_cluster def: "Genomic DNA of immunoglobulin/T-cell receptor gene in rearranged configuration including at least one V-gene, one VJ-gene, one J-gene and one C-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-(VJ)-J-C-CLUSTER] synonym: "V-(VJ)-J-C-CLUSTER" RELATED [] is_a: SO:0000482 ! vertebrate_immunoglobulin_T_cell_receptor_gene_cluster [Term] id: SO:0000567 name: inversion_derived_aneuploid_chromosome def: "A chromosome may be generated by recombination between two inverversions; presumed to have a deficiency or duplication at each end of the inversion." [FB:km] is_a: SO:0000550 ! aneuploid_chromosome [Term] id: SO:0000568 name: bidirectional_promotor is_a: SO:0000167 ! promoter [Term] id: SO:0000569 name: retrotransposed_protein_coding_gene alt_id: SO:0100042 synonym: "captured_pseudogene" RELATED [] is_a: SO:0000010 ! protein_coding_gene is_a: SO:0000042 ! pseudogene_attribute [Term] id: SO:0000570 name: three_prime_D_recombination_signal_sequence def: "Recombination signal of an immunoglobulin/T-cell receptor gene, including the 3' D-heptamer (SO:0000493), 3' D-spacer, and 3' D-nonamer (SO:0000494) in 3' of the D-region of a D-gene." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#3'D-RS] synonym: "3'D-RS" RELATED [] synonym: "three_prime_D-recombination_signal_sequence" RELATED [] is_a: SO:0000492 ! D_gene_recombination_feature [Term] id: SO:0000571 name: miRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000572 name: DJ_gene def: "Genomic DNA of immunoglobulin/T-cell receptor gene in partially rearranged genomic DNA including D-J-region with 5' UTR and 3' UTR, also designated as D-J-segment." [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#D-J-GENE] synonym: "D_J_GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000485 ! DJ_J_cluster relationship: part_of SO:0000504 ! D_DJ_C_cluster relationship: part_of SO:0000505 ! D_DJ_cluster relationship: part_of SO:0000506 ! D_DJ_J_C_cluster relationship: part_of SO:0000508 ! D_DJ_J_cluster relationship: part_of SO:0000518 ! V_DJ_cluster relationship: part_of SO:0000519 ! V_DJ_J_cluster relationship: part_of SO:0000527 ! V_D_DJ_C_cluster relationship: part_of SO:0000528 ! V_D_DJ_cluster relationship: part_of SO:0000529 ! V_D_DJ_J_C_cluster relationship: part_of SO:0000530 ! V_D_DJ_J_cluster relationship: part_of SO:0000539 ! DJ_C_cluster relationship: part_of SO:0000540 ! DJ_J_C_cluster relationship: part_of SO:0000542 ! V_DJ_C_cluster relationship: part_of SO:0000564 ! V_DJ_J_C_cluster [Term] id: SO:0000573 name: rRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000574 name: DJ_gene def: " Rearranged genomic DNA of immunoglobulin/T-cell receptor gene including L-part1, V-intron and V-D-J-exon, with the 5'UTR (SO:0000204) and 3'UTR (SO:0000205)" [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-D-J-GENE] synonym: "V-D-J-GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000487 ! VDJ_J_C_cluster relationship: part_of SO:0000488 ! VDJ_J_cluster relationship: part_of SO:0000520 ! V_VDJ_C_cluster relationship: part_of SO:0000521 ! V_VDJ_cluster relationship: part_of SO:0000522 ! V_VDJ_J_cluster relationship: part_of SO:0000541 ! VDJ_C_cluster relationship: part_of SO:0000565 ! V_VDJ_J_C_cluster [Term] id: SO:0000575 name: scRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000576 name: VJ_gene def: " Rearranged genomic DNA of immunoglobulin/T-cell receptor gene including L-part1, V-intron and V-J-exon, with the 5'UTR (SO:0000204) and 3'UTR (SO:0000205)" [http://imgt.cines.fr/ligmdb/LIGMlect?query=7#V-J-GENE] synonym: "V-J-GENE" RELATED [] is_a: SO:0000460 ! vertebrate_immunoglobulin_T_cell_receptor_gene relationship: part_of SO:0000489 ! VJ_C_cluster relationship: part_of SO:0000490 ! VJ_J_C_cluster relationship: part_of SO:0000491 ! VJ_J_cluster relationship: part_of SO:0000523 ! V_VJ_C_cluster relationship: part_of SO:0000524 ! V_VJ_cluster relationship: part_of SO:0000525 ! V_VJ_J_cluster relationship: part_of SO:0000566 ! V_VJ_J_C_cluster [Term] id: SO:0000577 name: centromere def: "A region of chromosome where the spindle fibers attach during mitosis and meiosis." [SO:ke] subset: SOFA is_a: SO:0000628 ! chromosomal_structural_element [Term] id: SO:0000578 name: snoRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000579 name: edited_transcript_feature def: "A locatable feature on a transcript that is edited." [SO:ma] relationship: part_of SO:0000673 ! transcript [Term] id: SO:0000580 name: methylation_guide_snoRNA_primary_transcript def: "A primary transcript encoding a methylation guide small nucleolar RNA." [SO:ke] is_a: SO:0000232 ! snoRNA_primary_transcript [Term] id: SO:0000581 name: cap def: "A structure consisting of a 7-methylguanosine in 5'-5' triphosphate linkage with the first nucleotide of an mRNA. It is added post-transcriptionally, and is not encoded in the DNA." [http://seqcore.brcf.med.umich.edu/doc/educ/dnapr/mbglossary/mbgloss.html] subset: SOFA relationship: adjacent_to SO:0000234 ! mRNA [Term] id: SO:0000582 name: rRNA_cleavage_snoRNA_primary_transcript def: "A primary transcript encoding an rRNA cleavage snoRNA." [SO:ke] is_a: SO:0000232 ! snoRNA_primary_transcript [Term] id: SO:0000583 name: pre_edited_region def: "The region of a transcript that will be edited." [http://www.rna.ucla.edu] synonym: "pre-edited_region" RELATED [] is_a: SO:0000579 ! edited_transcript_feature [Term] id: SO:0000584 name: tmRNA def: "tmRNA liberates a mRNA from a stalled ribosome. To accomplish this part of the tmRNA is used as a reading frame that ends in a translation stop signal. The broken mRNA is replaced in the ribosome by the tmRNA and translation of the tmRNA leads to addition of a proteolysis tag to the incomplete protein enabling recognition by a protease. Recently a number of permuted tmRNAs genes have been found encoded in two parts. tmRNAs have been identified in eubacteria and some chloroplasts but are absent from archeal and eukaryote nuclear genomes." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00023] synonym: "10Sa_RNA" RELATED [] synonym: "ssrA" RELATED [] is_a: SO:0000370 ! small_regulatory_ncRNA [Term] id: SO:0000585 name: C_D_box_snoRNA_gene is_a: SO:0000578 ! snoRNA_gene [Term] id: SO:0000586 name: tmRNA_primary_transcript def: "A primary transcript encoding a tmRNA (SO:0000584)." [SO:ke] synonym: "10Sa_RNA_primary_transcript" RELATED [] synonym: "ssrA_RNA_primary_transcript" RELATED [] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000587 name: group_I_intron def: "Group I catalytic introns are large self-splicing ribozymes. They catalyse their own excision from mRNA, tRNA and rRNA precursors in a wide range of organisms. The core secondary structure consists of 9 paired regions (P1-P9). These fold to essentially two domains, the P4-P6 domain (formed from the stacking of P5, P4, P6 and P6a helices) and the P3-P9 domain (formed from the P8, P3, P7 and P9 helices). Group I catalytic introns often have long ORFs inserted in loop regions." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00028] subset: SOFA is_a: SO:0000188 ! intron [Term] id: SO:0000588 name: autocatalytically_spliced_intron def: "A self spliced intron." [SO:ke] subset: SOFA is_a: SO:0000188 ! intron is_a: SO:0000374 ! ribozyme [Term] id: SO:0000589 name: SRP_RNA_primary_transcript def: "A primary transcript encoding a signal recognition particle RNA." [SO:ke] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000590 name: SRP_RNA def: "The signal recognition particle (SRP) is a universally conserved ribonucleoprotein. It is involved in the co-translational targeting of proteins to membranes. The eukaryotic SRP consists of a 300-nucleotide 7S RNA and six proteins: SRPs 72, 68, 54, 19, 14, and 9. Archaeal SRP consists of a 7S RNA and homologues of the eukaryotic SRP19 and SRP54 proteins. In most eubacteria, the SRP consists of a 4.5S RNA and the Ffh protein (a homologue of the eukaryotic SRP54 protein). Eukaryotic and archaeal 7S RNAs have very similar secondary structures, with eight helical elements. These fold into the Alu and S domains, separated by a long linker region. Eubacterial SRP is generally a simpler structure, with the M domain of Ffh bound to a region of the 4.5S RNA that corresponds to helix 8 of the eukaryotic and archaeal SRP S domain. Some Gram-positive bacteria (e.g. Bacillus subtilis), however, have a larger SRP RNA that also has an Alu domain. The Alu domain is thought to mediate the peptide chain elongation retardation function of the SRP. The universally conserved helix which interacts with the SRP54/Ffh M domain mediates signal sequence recognition. In eukaryotes and archaea, the SRP19-helix 6 complex is thought to be involved in SRP assembly and stabilizes helix 8 for SRP54 binding." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00017] subset: SOFA synonym: "7S RNA" RELATED [] synonym: "signal_recognition_particle_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000591 name: pseudoknot def: "A stem-loop RNA structure where nucleotides in the loop participate in complementary interactions with a region of RNA downstream of the stem-loop." [https://www.ncbi.nlm.nih.gov/pubmed/12519954] is_a: SO:0000002 ! sequence_secondary_structure [Term] id: SO:0000592 name: H_pseudoknot def: "A pseudoknot which contains two stems and at least two loops." [https://www.ncbi.nlm.nih.gov/pubmed/10334330] synonym: "H-pseudoknot" RELATED [] is_a: SO:0000591 ! pseudoknot [Term] id: SO:0000593 name: C_D_box_snoRNA def: "Most box C/D snoRNAs also contain long (>10 nt) sequences complementary to rRNA. Boxes C and D, as well as boxes C' and D', are usually located in close proximity, and form a structure known as the box C/D motif. This motif is important for snoRNA stability, processing, nucleolar targeting and function. A small number of box C/D snoRNAs are involved in rRNA processing; most, however, are known or predicted to serve as guide RNAs in ribose methylation of rRNA. Targeting involves direct base pairing of the snoRNA at the rRNA site to be modified and selection of a rRNA nucleotide a fixed distance from box D or D'." [http://www.bio.umass.edu/biochem/rna-sequence/Yeast_snoRNA_Database/snoRNA_DataBase.html] synonym: "C/D_box_snoRNA" RELATED [] is_a: SO:0000275 ! snoRNA [Term] id: SO:0000594 name: H_ACA_box_snoRNA def: "Members of the box H/ACA family contain an ACA triplet, exactly 3 nt upstream from the 3' end and an H-box in a hinge region that links two structurally similar functional domains of the molecule. Both boxes are important for snoRNA biosynthesis and function. A few box H/ACA snoRNAs are involved in rRNA processing; most others are known or predicted to participate in selection of uridine nucleosides in rRNA to be converted to pseudouridines. Site selection is mediated by direct base pairing of the snoRNA with rRNA through one or both targeting domains." [http://www.bio.umass.edu/biochem/rna-sequence/Yeast_snoRNA_Database/snoRNA_DataBase.html] synonym: "H/ACA_box_snoRNA" RELATED [] is_a: SO:0000275 ! snoRNA [Term] id: SO:0000595 name: C_D_box_snoRNA_primary_transcript def: "A primary transcript encoding a small nucleolar RNA of the box C/D family." [SO:ke] is_a: SO:0000232 ! snoRNA_primary_transcript [Term] id: SO:0000596 name: H_ACA_box_snoRNA_primary_transcript def: "A primary transcript encoding a small nucleolar RNA of the box H/ACA family." [SO:ke] is_a: SO:0000232 ! snoRNA_primary_transcript [Term] id: SO:0000597 name: transcript_edited_by_U_insertion/deletion def: "The insertion and deletion of uridine (U) residues, usually within coding regions of mRNA transcripts of cryptogenes in the mitochondrial genome of kinetoplastid protozoa." [http://www.rna.ucla.edu/index.html] is_a: SO:0000116 ! edited_transcript [Term] id: SO:0000598 name: transcript_edited_by_C_insertion_and_dinucleotide_insertion def: "The type of RNA editing found in the mitochondria of Myxomycota, characterized by the insertion of mono- and dinucleotides in RNAs relative to their mtDNA template and in addition, C to U base conversion. The most common mononucleotide insertion is cytidine, although a number of uridine mononucleotides are inserted at specific sites. Adenine and guanine have not been observed in mononucleotide insertions. Five different dinucleotide insertions have been observed, GC, GU, CU, AU and AA. Both mono- and dinucleotide insertions create open reading frames in mRNA and contribute to highly conserved structural features of rRNAs and tRNAs." [http://nsm1.utdallas.edu/bio/miller/physarum/overview.htm] synonym: "transcript_edited_by_C-insertion_and_dinucleotide_insertion" RELATED [] is_a: SO:0000116 ! edited_transcript [Term] id: SO:0000599 name: transcript_edited_by_C_to_U_substitution is_a: SO:0000116 ! edited_transcript [Term] id: SO:0000600 name: transcript_edited_by_A_to_I_substitution is_a: SO:0000116 ! edited_transcript [Term] id: SO:0000601 name: transcript_edited_by_G_addition is_a: SO:0000116 ! edited_transcript [Term] id: SO:0000602 name: guide_RNA def: "A short 3'-uridylated RNA that can form a perfect duplex (except for the oligoU tail (SO:0000609)) with a stretch of mature edited mRNA." [http://www.rna.ucla.edu/index.html] subset: SOFA synonym: "gRNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000603 name: group_II_intron def: "Group II introns are found in rRNA, tRNA and mRNA of organelles in fungi, plants and protists, and also in mRNA in bacteria. They are large self-splicing ribozymes and have 6 structural domains (usually designated dI to dVI). A subset of group II introns also encode essential splicing proteins in intronic ORFs. The length of these introns can therefore be up to 3kb. Splicing occurs in almost identical fashion to nuclear pre-mRNA splicing with two transesterification steps. The 2' hydroxyl of a bulged adenosine in domain VI attacks the 5' splice site, followed by nucleophilic attack on the 3' splice site by the 3' OH of the upstream exon. Protein machinery is required for splicing in vivo, and long range intron-intron and intron-exon interactions are important for splice site positioning. Group II introns are further sub-classified into groups IIA and IIB which differ in splice site consensus, distance of bulged A from 3' splice site, some tertiary interactions, and intronic ORF phylogeny." [http://www.sanger.ac.uk/Software/Rfam/browse/index.shtml] subset: SOFA is_a: SO:0000188 ! intron [Term] id: SO:0000604 name: editing_block def: "Edited mRNA sequence mediated by a single guide RNA (SO:0000602)." [http://www.rna.ucla/] is_a: SO:0000579 ! edited_transcript_feature [Term] id: SO:0000605 name: intergenic_region def: "The region between two known genes." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000606 name: editing_domain def: "Edited mRNA sequence mediated by two or more overlapping guide RNAs (SO:0000602)." [http://www.rna.ucla/] is_a: SO:0000579 ! edited_transcript_feature [Term] id: SO:0000607 name: unedited_region def: "The region of an edited transcript that will not be edited." [http://www.rna.ucla.edu/] is_a: SO:0000579 ! edited_transcript_feature [Term] id: SO:0000608 name: H_ACA_box_snoRNA_gene is_a: SO:0000578 ! snoRNA_gene [Term] id: SO:0000609 name: oligo_U_tail def: "The string of non-encoded U's at the 3' end of a guide RNA (SO:0000602)." [http://www.rna.ucla.edu/] relationship: adjacent_to SO:0000602 ! guide_RNA [Term] id: SO:0000610 name: polyA_sequence def: "Sequence of about 100 nucleotides of A added to the 3' end of most eukaryotic mRNAs." [SO:ke] subset: SOFA relationship: adjacent_to SO:0000234 ! mRNA [Term] id: SO:0000611 name: branch_site def: "A pyrimidine rich sequence near the 3' end of an intron to which the 5'end becomes covalently bound during nuclear splicing. The resulting structure resembles a lariat." [SO:ke] subset: SOFA synonym: "branch_point" RELATED [] relationship: part_of SO:0000662 ! spliceosomal_intron [Term] id: SO:0000612 name: polypyrimidine_tract def: "The polypyrimidine tract is one of the cis-acting sequence elements directing intron removal in pre-mRNA splicing." [http://nar.oupjournals.org/cgi/content/full/25/4/888] subset: SOFA relationship: part_of SO:0000662 ! spliceosomal_intron [Term] id: SO:0000613 name: bacterial_RNApol_promoter def: "A DNA sequence to which bacterial RNA polymerase binds, to begin transcription." [SO:ke] is_a: SO:0000167 ! promoter is_a: SO:0000752 ! gene_group_regulatory_region [Term] id: SO:0000614 name: bacterial_terminator def: "A terminator signal for bacterial transcription." [SO:ke] is_a: SO:0000141 ! terminator [Term] id: SO:0000615 name: terminator_of_type_2_RNApol_III_promoter def: "A terminator signal for RNA polymerase III transcription." [SO:ke] is_a: SO:0000141 ! terminator [Term] id: SO:0000616 name: transcription_end_site def: "The site where transcription ends." [SO:ke] subset: SOFA is_a: SO:0000699 ! junction relationship: part_of SO:0000185 ! primary_transcript [Term] id: SO:0000617 name: RNApol_III_promoter_type_1 is_a: SO:0000171 ! RNApol_III_promoter [Term] id: SO:0000618 name: RNApol_III_promoter_type_2 synonym: "tRNA_promoter" RELATED [] is_a: SO:0000171 ! RNApol_III_promoter [Term] id: SO:0000619 name: A_box relationship: part_of SO:0000618 ! RNApol_III_promoter_type_2 [Term] id: SO:0000620 name: B_box relationship: part_of SO:0000618 ! RNApol_III_promoter_type_2 [Term] id: SO:0000621 name: RNApol_III_promoter_type_3 is_a: SO:0000171 ! RNApol_III_promoter [Term] id: SO:0000622 name: C_box relationship: part_of SO:0000617 ! RNApol_III_promoter_type_1 [Term] id: SO:0000623 name: snRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000624 name: telomere def: "A specific structure at the end of a linear chromosome, required for the integrity and maintenence of the end," [SO:ma] subset: SOFA is_a: SO:0000628 ! chromosomal_structural_element [Term] id: SO:0000625 name: silencer def: "Combination of short DNA sequence elements which suppress the transcription of an adjacent gene or genes." [http://www.brunel.ac.uk/depts/bio/project/old_hmg/gloss3.htm#s] subset: SOFA is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000626 name: chromosomal_regulatory_element relationship: part_of SO:0000340 ! chromosome [Term] id: SO:0000627 name: insulator subset: SOFA synonym: "insulator_element" RELATED [] is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000628 name: chromosomal_structural_element subset: SOFA relationship: part_of SO:0000340 ! chromosome [Term] id: SO:0000629 name: five_prime_open_reading_frame relationship: part_of SO:0000204 ! five_prime_UTR [Term] id: SO:0000630 name: upstream_AUG_codon relationship: part_of SO:0000203 ! UTR [Term] id: SO:0000631 name: polycistronic_primary_transcript def: "A primary transcript encoding for more than one protein product." [SO:ke] is_a: SO:0000078 ! polycistronic_transcript [Term] id: SO:0000632 name: monocistronic_primary_transcript def: "A primary transcript encoding for more than one protein product." [SO:ke] is_a: SO:0000665 ! monocistronic_transcript [Term] id: SO:0000633 name: monocistronic_mRNA def: "An mRNA with either a single protein product, or for which the regions encoding all its protein products overlap." [SO:rd] synonym: "monocistronic_processed_transcript" RELATED [] is_a: SO:0000665 ! monocistronic_transcript [Term] id: SO:0000634 name: polycistronic_mRNA def: "An mRNA that encodes multiple proteins from at least two non-overlapping regions." [SO:rd] synonym: "polycistronic_processed_transcript" RELATED [] is_a: SO:0000078 ! polycistronic_transcript [Term] id: SO:0000635 name: mini_exon_donor_RNA def: "The 3' site of a mini-exon which is trans-spliced on to the 5'end of a mature mRNA." [SO:ke] synonym: "mini-exon_donor_RNA" RELATED [] is_a: SO:0000185 ! primary_transcript [Term] id: SO:0000636 name: spliced_leader_RNA synonym: "mini-exon" RELATED [] relationship: part_of SO:0000635 ! mini_exon_donor_RNA [Term] id: SO:0000637 name: engineered_plasmid synonym: "engineered_plasmid_gene" RELATED [] is_a: SO:0000098 ! plasmid_gene is_a: SO:0000280 ! engineered_gene [Term] id: SO:0000638 name: transcribed_spacer_region def: "Part of an rRNA transcription unit that is transcribed but discarded during maturation, not giving rise to any part of rRNA." [http://oregonstate.edu/instruction/bb492/general/glossary.html] relationship: part_of SO:0000209 ! rRNA_primary_transcript [Term] id: SO:0000639 name: internal_transcribed_spacer_region def: "Non-coding regions of DNA sequence that separate genes coding for the 28S, 5.8S, and 18S ribosomal RNAs." [SO:ke] is_a: SO:0000638 ! transcribed_spacer_region [Term] id: SO:0000640 name: external_transcribed_spacer_region def: "Non-coding regions of DNA that precede the sequence that codes for the ribosomal RNA." [SO:ke] is_a: SO:0000638 ! transcribed_spacer_region [Term] id: SO:0000641 name: tetranucleotide_repeat_microsatellite_feature is_a: SO:0000289 ! microsatellite [Term] id: SO:0000642 name: SRP_RNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000643 name: minisatellite def: "A repetitive sequence spanning 500 to 20,000 base pairs (a repeat unit is 5 - 30 base pairs)." [http://www.rerf.or.jp/eigo/glossary/minisate.htm] subset: SOFA is_a: SO:0000705 ! tandem_repeat [Term] id: SO:0000644 name: antisense_RNA def: "Antisense RNA is RNA that is transcribed from the coding, rather than the template, strand of DNA. It is therefore complementary to mRNA." [SO:ke] subset: SOFA is_a: SO:0000655 ! ncRNA [Term] id: SO:0000645 name: antisense_primary_transcript def: "The reverse complement of the primary transcript." [SO:ke] subset: SOFA is_a: SO:0000185 ! primary_transcript [Term] id: SO:0000646 name: siRNA def: "Small RNA molecule that is the product of a longerexogenous or endogenous dsRNA, which is either a bimolecular duplexe or very longhairpin, processed (via the Dicer pathway) such that numerous siRNAs accumulatefrom both strands of the dsRNA. sRNAs trigger the cleavage of their target molecules." [PMID:12592000] subset: SOFA synonym: "small_interfering_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000647 name: miRNA_primary_transcript def: "A primary transcript encoding a micro RNA." [SO:ke] synonym: "micro_RNA_primary_transcript" RELATED [] is_a: SO:0000483 ! nc_primary_transcript [Term] id: SO:0000648 name: stRNA_primary_transcript def: "A primary transcript encoding a small temporal mRNA (SO:0000649)." [SO:ke] synonym: "small_temporal_RNA_primary_transcript" RELATED [] is_a: SO:0000647 ! miRNA_primary_transcript [Term] id: SO:0000649 name: stRNA def: "Non-coding RNAs of about 21 nucleotides in length that regulate temporal development; first discovered in C. elegans." [PMID:11081512] subset: SOFA synonym: "small_temporal_RNA" RELATED [] is_a: SO:0000655 ! ncRNA [Term] id: SO:0000650 name: small_subunit_rRNA is_a: SO:0000252 ! rRNA [Term] id: SO:0000651 name: large_subunit_rRNA is_a: SO:0000252 ! rRNA [Term] id: SO:0000652 name: rRNA_5S def: "5S ribosomal RNA (5S rRNA) is a component of the large ribosomal subunit in both prokaryotes and eukaryotes. In eukaryotes, it is synthesised by RNA polymerase III (the other eukaryotic rRNAs are cleaved from a 45S precursor synthesised by RNA polymerase I). In Xenopus oocytes, it has been shown that fingers 4-7 of the nine-zinc finger transcription factor TFIIIA can bind to the central region of 5S RNA. Thus, in addition to positively regulating 5S rRNA transcription, TFIIIA also stabilises 5S rRNA until it is required for transcription." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00001] subset: SOFA is_a: SO:0000252 ! rRNA [Term] id: SO:0000653 name: rRNA_28S def: "A component of the large ribosomal subunit." [SO:ke] subset: SOFA synonym: "23S_rRNA" RELATED [] synonym: "28S_rRNA" RELATED [] is_a: SO:0000252 ! rRNA [Term] id: SO:0000654 name: maxi_circle_gene synonym: "maxi-circle" RELATED [] is_a: SO:0000088 ! mt_gene [Term] id: SO:0000655 name: ncRNA def: "An mRNA sequence that does not encode for a protein rather the RNA molecule is the gene product." [SO:ke] comment: ncRNA is a processed_transcript so it may not contain parts such as transcribed_spacer_regions that are removed in the act of processing. For the corresponding primary_transcripts, please see term SO:0000483 nc_primary_transcript. subset: SOFA synonym: "noncoding_RNA" RELATED [] is_a: SO:0000233 ! processed_transcript [Term] id: SO:0000656 name: stRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000657 name: repeat_region def: "A region of sequence containing one or more repeat units." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000658 name: dispersed_repeat def: "A repeat that is located at dispersed sites in the genome." [SO:ke] subset: SOFA synonym: "interspersed_repeat" RELATED [] is_a: SO:0000657 ! repeat_region [Term] id: SO:0000659 name: tmRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000660 name: DNA_invertase_target_sequence is_a: SO:0000342 ! site_specific_recombination_target_region [Term] id: SO:0000661 name: intron_attribute is_a: SO:0000401 ! gene_attribute [Term] id: SO:0000662 name: spliceosomal_intron def: "An intron which is spliced by the spliceosome." [SO:ke] subset: SOFA is_a: SO:0000188 ! intron [Term] id: SO:0000663 name: tRNA_gene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:0000664 name: introgressed_chromosome_region relationship: part_of SO:0000340 ! chromosome [Term] id: SO:0000665 name: monocistronic_transcript is_a: SO:0000115 ! transcript_feature [Term] id: SO:0000666 name: mobile_intron is_a: SO:0000661 ! intron_attribute [Term] id: SO:0000667 name: insertion def: "A region of sequence identified as having been inserted." [SO:ke] subset: SOFA is_a: SO:0000001 ! region is_a: SO:0000109 ! sequence_variant relationship: sequence_of SO:0000046 ! insert [Term] id: SO:0000668 name: EST_match def: "A match against an EST sequence." [SO:ke] subset: SOFA is_a: SO:0000102 ! expressed_sequence_match [Term] id: SO:0000669 name: sequence_rearrangement_feature is_a: SO:0000298 ! recombination_feature [Term] id: SO:0000670 name: chromosome_breakage_sequence def: "A sequence within the micronuclear DNA of ciliates at which chromosome breakage and telomere addition occurs during nuclear differentiation." [SO:ma] is_a: SO:0000669 ! sequence_rearrangement_feature [Term] id: SO:0000671 name: internal_eliminated_sequence def: "A sequence eliminated from the genome of ciliates during nuclear differentiation." [SO:ma] is_a: SO:0000669 ! sequence_rearrangement_feature [Term] id: SO:0000672 name: macronucleus_destined_segment def: "A sequence that is conserved, although rearranged relative to the micronucleus, in the macronucleus of a ciliate genome." [SO:ma] is_a: SO:0000669 ! sequence_rearrangement_feature [Term] id: SO:0000673 name: transcript def: "An RNA synthesized on a DNA or RNA template by an RNA polymerase." [SO:ma] subset: SOFA relationship: member_of SO:0000704 ! gene [Term] id: SO:0000674 name: non_canonical_splice_site def: "A splice site where the donor and acceptor sites differ from the canonical form." [SO:ke] synonym: "non-canonical_splice_site" RELATED [] is_a: SO:0000162 ! splice_site [Term] id: SO:0000675 name: canonical_splice_site def: "The major class of splice site with dinucleotides GT and AG for donor and acceptor sites, respectively." [SO:ke] is_a: SO:0000162 ! splice_site [Term] id: SO:0000676 name: canonical_three_prime_splice_site def: "The canonical 3' splice site has the sequence \"AG\"." [SO:ke] is_a: SO:0000164 ! splice_acceptor_site is_a: SO:0000675 ! canonical_splice_site [Term] id: SO:0000677 name: canonical_five_prime_splice_site def: "The canonical 5' splice site has the sequence \"GT\"." [SO:ke] is_a: SO:0000163 ! splice_donor_site is_a: SO:0000675 ! canonical_splice_site [Term] id: SO:0000678 name: non_canonical_three_prime_splice_site def: "A 3' splice site that does not have the sequence \"AG\"." [SO:ke] synonym: "non-canonical_three_prime_splice_site" RELATED [] is_a: SO:0000164 ! splice_acceptor_site is_a: SO:0000674 ! non_canonical_splice_site [Term] id: SO:0000679 name: non_canonical_five_prime_splice_site def: "A 5' splice site which does not have the sequence \"GT\"." [SO:ke] synonym: "non-canonical-five_prime_splice_site" RELATED [] is_a: SO:0000163 ! splice_donor_site is_a: SO:0000674 ! non_canonical_splice_site [Term] id: SO:0000680 name: non_canonical_start_codon def: "A start codon that is not the usual AUG sequence." [SO:ke] synonym: "non-canonical_start_codon" RELATED [] synonym: "non_ATG_start_codon" RELATED [] is_a: SO:0000318 ! start_codon [Term] id: SO:0000681 name: aberrant_processed_transcript def: "A transcript that has been processed \"incorrectly\", for example by the failure of splicing of one or more exons." [SO:ke] is_a: SO:0000233 ! processed_transcript [Term] id: SO:0000682 name: splicing_feature is_obsolete: true [Term] id: SO:0000683 name: exonic_splice_enhancer def: "Exonic splicing enhancers (ESEs) facilitate exon definition by assisting in the recruitment of splicing factors to the adjacent intron." [https://www.ncbi.nlm.nih.gov/pubmed/12403462] is_a: SO:0000344 ! splice_enhancer [Term] id: SO:0000684 name: nuclease_sensitive_site def: "A region of nucleotide sequence targeting by a nuclease enzyme." [SO:ma] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000685 name: DNAaseI_hypersensitive_site is_a: SO:0000322 ! nuclease_hypersensitive_site [Term] id: SO:0000686 name: translocation_element def: "For some translocations, particularly but not exclusively, reciprocal translocations, the chromosomes carrying non-homologous centromeres may be recovered independently. These chromosomes are described as translocation elements." [SO:ma] relationship: part_of SO:1000044 ! chromosomal_translocation [Term] id: SO:0000687 name: deletion_junction def: "The space between two bases in a sequence which marks the position where a deletion has occured." [SO:ke] subset: SOFA is_a: SO:0000109 ! sequence_variant is_a: SO:0000699 ! junction relationship: position_of SO:0000045 ! delete [Term] id: SO:0000688 name: golden_path def: "A set of subregions selected from sequence contigs which when concatenated form a nonredundant linear sequence." [SO:ls] subset: SOFA is_a: SO:0000353 ! assembly [Term] id: SO:0000689 name: cDNA_match def: "A match against cDNA sequence." [SO:ke] subset: SOFA is_a: SO:0000102 ! expressed_sequence_match [Term] id: SO:0000690 name: gene_with_polycistronic_transcript is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000691 name: translocation_site def: "The space between two bases in a sequence which marks the position where a translocation has occurred." [SO:ke] relationship: position_of SO:0000049 ! translocate [Term] id: SO:0000692 name: gene_with_dicistronic_transcript is_a: SO:0000690 ! gene_with_polycistronic_transcript [Term] id: SO:0000693 name: gene_with_recoded_mRNA is_a: SO:0000064 ! gene_by_transcript_attribute [Term] id: SO:0000694 name: SNP def: "SNPs are single base pair positions in genomic DNA at which different sequence alternatives (alleles) exist in normal individuals in some population(s), wherein the least frequent allele has an abundance of 1% or greater." [http://www.cgr.ki.se/cgb/groups/brookes/Articles/essence_of_snps_article.pdf] subset: SOFA synonym: "single_nucleotide_polymorphism" RELATED [] is_a: SO:1000008 ! point_mutation [Term] id: SO:0000695 name: reagent def: "A sequence used in experiment." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000696 name: oligo def: "A short oligonucleotide sequence, of length on the order of 10's of bases; either single or double stranded." [SO:ma] subset: SOFA synonym: "oligonucleotide" RELATED [] is_a: SO:0000695 ! reagent [Term] id: SO:0000697 name: gene_with_stop_codon_read_through is_a: SO:0000693 ! gene_with_recoded_mRNA [Term] id: SO:0000698 name: gene_with_stop_codon_redefined_as_pyrrolysine is_a: SO:0000697 ! gene_with_stop_codon_read_through [Term] id: SO:0000699 name: junction def: "A junction refers to an interbase location of zero in a sequence." [SO:ke] subset: SOFA synonym: "boundary" RELATED [] is_a: SO:0000110 ! located_sequence_feature [Term] id: SO:0000700 name: remark def: "A comment about the sequence." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000701 name: possible_base_call_error def: "A region of sequence where the validity of the base calling is questionable." [SO:ke] subset: SOFA is_a: SO:0000413 ! sequence_difference [Term] id: SO:0000702 name: possible_assembly_error def: "A region of sequence where there may have been an error in the assembly." [SO:ke] subset: SOFA is_a: SO:0000413 ! sequence_difference [Term] id: SO:0000703 name: experimental_result_region def: "A region of sequence implicated in an experimental result." [SO:ke] subset: SOFA is_a: SO:0000700 ! remark [Term] id: SO:0000704 name: gene def: "A locatable region of genomic sequence, corresponding to a unit of inheritance, which is associated with regulatory regions, transcribed regions and/or other functional sequence regions" [SO:rd] subset: SOFA is_a: SO:0000001 ! region relationship: member_of SO:0005855 ! gene_group [Term] id: SO:0000705 name: tandem_repeat def: "Two or more adjacent copies of a DNA sequence." [http://www.sci.sdsu.edu/ ~ smaloy/Glossary/T.html] subset: SOFA is_a: SO:0000657 ! repeat_region relationship: part_of SO:0000005 ! satellite_DNA [Term] id: SO:0000706 name: trans_splice_acceptor_site def: "The process that produces mature transcripts by combining exons of independent pre-mRNA molecules. The acceptor site lies on the 3' of these molecules." [SO:ke] subset: SOFA is_a: SO:0000164 ! splice_acceptor_site [Term] id: SO:0000707 name: trans_splice_donor_site def: "The site at which trans-splicing occurs." [SO:ke] synonym: "trans-splice_donor_site" RELATED [] is_a: SO:0000163 ! splice_donor_site [Term] id: SO:0000708 name: SL1_acceptor_site is_a: SO:0000706 ! trans_splice_acceptor_site [Term] id: SO:0000709 name: SL2_acceptor_site is_a: SO:0000706 ! trans_splice_acceptor_site [Term] id: SO:0000710 name: gene_with_stop_codon_redefined_as_selenocysteine is_a: SO:0000697 ! gene_with_stop_codon_read_through [Term] id: SO:0000711 name: gene_with_mRNA_recoded_by_translational_bypass is_a: SO:0000693 ! gene_with_recoded_mRNA [Term] id: SO:0000712 name: gene_with_transcript_with_translational_frameshift is_a: SO:0000693 ! gene_with_recoded_mRNA [Term] id: SO:0000713 name: DNA_motif is_a: SO:0000714 ! nucleotide_motif [Term] id: SO:0000714 name: nucleotide_motif def: "A region of nucleotide sequence corresponding to a known motif." [SO:ke] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000715 name: RNA_motif is_a: SO:0000714 ! nucleotide_motif [Term] id: SO:0000716 name: dicistronic_mRNA synonym: "dicistronic_processed_transcript" RELATED [] is_a: SO:0000079 ! dicistronic_transcript [Term] id: SO:0000717 name: reading_frame def: "A nucleic acid sequence that when read as sequential triplets, has the potential of encoding a sequential string of amino acids. It does not contain the start or stop codon." [SO:rb] comment: This term was added after a request by SGD.nAgust 2004. Modified after SO meeting in Cambridge to not include start or stop. subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0000718 name: blocked_reading_frame def: "A reading_frame that is interupted by one or more stop codons; usually identified through intergenomic sequence comparisons." [SO:rb] comment: Term requested by Rama from SGD is_a: SO:0000717 ! reading_frame [Term] id: SO:0000719 name: ultracontig def: "An ordered and oriented set of scaffolds based on somewhat weaker sets of inferential evidence such as one set of mate pair reads together with supporting evidence from ESTs or location of markers from SNP or microsatellite maps, or cytogenetic localization of contained markers." [FB:WG] subset: SOFA is_a: SO:0000353 ! assembly [Term] id: SO:0000720 name: foreign_transposable_element comment: requested by Michael on 19 Nov 2004 is_a: SO:0000101 ! transposable_element [Term] id: SO:0000721 name: gene_with_dicistronic_primary_transcript comment: Requested by Michael, 19 nov 2004 is_a: SO:0000692 ! gene_with_dicistronic_transcript [Term] id: SO:0000722 name: gene_with_dicistronic_mRNA comment: Requested by MA nov 19 2004 synonym: "gene_with_dicistronic_processed_transcript" RELATED [] is_a: SO:0000692 ! gene_with_dicistronic_transcript [Term] id: SO:0000723 name: iDNA def: "Genomic sequence removed from the genome, as a normal event, by a process of recombination." [SO:ma] synonym: "intervening DNA" RELATED [] is_a: SO:0000298 ! recombination_feature [Term] id: SO:0000724 name: origin_of_transfer def: "A region of a DNA molecule whre transfer is initiated during the process of conjugation or mobilization." [http:http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] subset: SOFA synonym: "oriT" RELATED [] is_a: SO:0000001 ! region [Term] id: SO:0000725 name: transit_peptide def: "The coding sequence for an N-terminal domain of a nuclear-encoded organellar protein: this domain is involved in post translational import of the protein into the organelle." [http:http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] comment: Added to bring SO inline with the embl ddbj genbank feature table. subset: SOFA relationship: part_of SO:0000104 ! polypeptide [Term] id: SO:0000726 name: repeat_unit def: "A single repeat element." [http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html#line_types] comment: Added to comply with the feature table. is_a: SO:0000657 ! repeat_region [Term] id: SO:0000727 name: TF_module def: "A regulatory_region where more than 1 TF_binding_site together are regulatorily active" [SO:SG] comment: Requested by Stepen Grossmann Dec 2004. synonym: "CRM" RELATED [] synonym: "cis_regulatory_module" RELATED [] is_a: SO:0005836 ! regulatory_region [Term] id: SO:0000728 name: intein relationship: part_of SO:0000104 ! polypeptide [Term] id: SO:0000729 name: intein_containing_protein_coding_gene is_a: SO:0000010 ! protein_coding_gene [Term] id: SO:0000730 name: gap def: "A gap in the sequence of known length. The unkown bases are filled in with N's." [SO:ke] subset: SOFA is_a: SO:0000143 ! assembly_component relationship: part_of SO:0000353 ! assembly [Term] id: SO:0000731 name: fragment comment: added because of request by MO people. is_a: SO:0000733 ! feature_attribute [Term] id: SO:0000732 name: predicted is_a: SO:0000733 ! feature_attribute [Term] id: SO:0000733 name: feature_attribute is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000734 name: exemplar_mRNA def: "An exemplar is a representative cDNA sequence for each gene. The exemplar approach is a method that usually involves some initial clustering into gene groups and the subsequent selection of a representative from each gene group." [http:mged.sourceforge.net/ontologies/MGEDontology.php#exemplar_mRNA] comment: Added for the MO people. is_a: SO:0000082 ! processed_transcript_attribute [Term] id: SO:0000735 name: sequence_location is_a: SO:0000400 ! sequence_attribute [Term] id: SO:0000736 name: organelle_location is_a: SO:0000735 ! sequence_location [Term] id: SO:0000737 name: mitochondrial_sequence is_a: SO:0000736 ! organelle_location [Term] id: SO:0000738 name: nuclear_sequence is_a: SO:0000736 ! organelle_location [Term] id: SO:0000739 name: nucleomorphic_sequence is_a: SO:0000736 ! organelle_location [Term] id: SO:0000740 name: plastid_sequence is_a: SO:0000736 ! organelle_location [Term] id: SO:0000741 name: kinetoplast_sequence is_a: SO:0000737 ! mitochondrial_sequence [Term] id: SO:0000742 name: maxicircle_sequence is_a: SO:0000737 ! mitochondrial_sequence [Term] id: SO:0000743 name: apicoplast_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000744 name: chromoplast_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000745 name: chloroplast_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000746 name: cyanelle_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000747 name: leucoplast_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000748 name: proplastid_sequence is_a: SO:0000740 ! plastid_sequence [Term] id: SO:0000749 name: plasmid_sequence is_a: SO:0000735 ! sequence_location [Term] id: SO:0000750 name: amplification_origin def: "An origin_of_replication that is used for the amplification of a chromosomal nucleic acid sequence." [SO:ma] is_a: SO:0000296 ! origin_of_replication [Term] id: SO:0000751 name: proviral_sequence is_a: SO:0000735 ! sequence_location [Term] id: SO:0000752 name: gene_group_regulatory_region is_a: SO:0005836 ! regulatory_region relationship: member_of SO:0005855 ! gene_group [Term] id: SO:0000753 name: clone_insert relationship: part_of SO:0000151 ! clone [Term] id: SO:0000754 name: lambda_vector is_a: SO:0000440 ! vector relationship: part_of SO:0000160 ! lambda_clone [Term] id: SO:0000755 name: plasmid_vector is_a: SO:0000440 ! vector relationship: part_of SO:0000759 ! plasmid_clone [Term] id: SO:0000756 name: cDNA def: "DNA synthesized by reverse transcriptase using RNA as a template" [SO:ma] is_a: SO:0000695 ! reagent relationship: part_of SO:0000317 ! cDNA_clone [Term] id: SO:0000757 name: single_stranded_cDNA is_a: SO:0000756 ! cDNA [Term] id: SO:0000758 name: double_stranded_cDNA is_a: SO:0000756 ! cDNA [Term] id: SO:0000759 name: plasmid_clone is_a: SO:0000151 ! clone [Term] id: SO:0000760 name: YAC_clone is_a: SO:0000151 ! clone [Term] id: SO:0000761 name: phagemid_clone is_a: SO:0000151 ! clone [Term] id: SO:0000762 name: PAC_clone synonym: "P1_clone" RELATED [] is_a: SO:0000151 ! clone [Term] id: SO:0000763 name: fosmid_clone is_a: SO:0000151 ! clone [Term] id: SO:0000764 name: BAC_clone is_a: SO:0000151 ! clone [Term] id: SO:0000765 name: cosmid_clone is_a: SO:0000151 ! clone [Term] id: SO:0000766 name: pyrrolysyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0000767 name: clone_insert_start is_obsolete: true [Term] id: SO:0000768 name: episome def: "A plasmid that may integrate with a chromosome. " [SO:ma] is_a: SO:0000155 ! plasmid [Term] id: SO:0000769 name: tmRNA_coding_piece def: "The region of a two-piece tmRNA that bears the reading frame encoding the proteolysis tag. The tmRNA gene undergoes circular permutation in some groups of bacteria; processing of the transcripts from such a gene leaves the mature tmRNA in two pieces, base-paired together." [Indiana:kw] comment: Added in response to comment from Kelly Williams from Indiana.nhttp://nar.oxfordjournals.org/cgi/content/full/32/15/4531n10 Nov, 2005 relationship: part_of SO:0000584 ! tmRNA [Term] id: SO:0000770 name: tmRNA_acceptor_piece def: "The acceptor region of a two-piece tmRNA that when mature is charged at its 3' end with alanine. The tmRNA gene undergoes circular permutation in some groups of bacteria; processing of the transcripts from such a gene leaves the mature tmRNA in two pieces, base-paired together." [Indiana:kw] comment: Added in response to Kelly Williams from Indiananhttp://nar.oxfordjournals.org/cgi/content/full/32/15/4531n10 nov 2005 relationship: part_of SO:0000584 ! tmRNA [Term] id: SO:0000771 name: QTL def: "Quantitative Trait Locus (QTL) is a polymorphic locus which contains alleles that differentially affect the expression of a continuously distributed phenotypic trait. Usually it is a marker described by statistical association to quantitative variation in the particular phenotypic trait that is thought to be controlled by the cumulative action of alleles at multiple loci." [http:rgd.cbi.pku.edu.cn/tu/qtls/] comment: Added in respose to request by Simon Twigger November 14th 2005 is_a: SO:0000001 ! region [Term] id: SO:0000772 name: genomic_island comment: Genomic islands are transmissible elements characterized by large size (>10kb). is_a: SO:0000001 ! region [Term] id: SO:0000773 name: pathogenic_island def: "Mobile genetic elements that contribute to rapid changes in virulence potential. They are present on the genomes of pathogenic strains but absent from the genomes of non pathogenic members of the same or related species." [SO:ke] comment: Nature Reviews Microbiology 2, 414-424 (2004); doi:10.1038/nrmicro884 nGENOMIC ISLANDS IN PATHOGENIC AND ENVIRONMENTAL MICROORGANISMSnUlrich Dobrindt, Bianca Hochhut, Ute Hentschel & Jorg Hacker is_a: SO:0000772 ! genomic_island [Term] id: SO:0000774 name: metabolic_island def: "A transmissible_element containing genes involved in metabolism, analogous to the pathogenicity islands of gram negative bacteria." [SO:ke] comment: genes for phenolic compound degradation in Pseudomonas putida are found on metabolic islands is_a: SO:0000772 ! genomic_island [Term] id: SO:0000775 name: adaptive_island comment: The iron-uptake ability of many pathogens are conveyed by adaptive islands.nNature Reviews Microbiology 2, 414-424 (2004); doi:10.1038/nrmicro884 nGENOMIC ISLANDS IN PATHOGENIC AND ENVIRONMENTAL MICROORGANISMSnUlrich Dobrindt, Bianca Hochhut, Ute Hentschel & Jorg Hacker is_a: SO:0000772 ! genomic_island [Term] id: SO:0000776 name: symbiosis_island def: "A transmissible_element containing genes involved in symbiosis, analogous to the pathogenicity islands of gram negative bacteria." [SO:ke] comment: Nitrogen fixation in Rhizobiaceae species is encoded by symbiosis islands.nnEvolution of rhizobia by acquisition of a 500-kb symbiosis island that integrates into a phe-tRNA genenJohn T. Sullivan and Clive W. RonsonnPNAS 1998 Apr 28 95 (9) 5145-5149n is_a: SO:0000772 ! genomic_island [Term] id: SO:0000777 name: pseudogenic_rRNA comment: Added Jan 2006 to allow the annotation of the pseudogenic rRNA by flybase. subset: SOFA is_a: SO:0000462 ! pseudogenic_region [Term] id: SO:0000778 name: pseudogenic_tRNA comment: Added Jan 2006 to allow the annotation of the pseudogenic tRNA by flybase. subset: SOFA is_a: SO:0000462 ! pseudogenic_region [Term] id: SO:0001044 name: nuclear_mt_pseudogene synonym: "NUMT" RELATED [] synonym: "nuclear_mitochondrial_pseudogene" RELATED [] is_a: SO:0000042 ! pseudogene_attribute [Term] id: SO:0005836 name: regulatory_region def: "A DNA sequence that controls the expression of a gene." [http://www.genpromag.com/scripts/glossary.asp?LETTER=R] subset: SOFA is_a: SO:0000001 ! region relationship: member_of SO:0000704 ! gene [Term] id: SO:0005837 name: snRNA_4.5S_primary_transcript def: "A primary transcript encoding a 4.5S snRNA." [SO:ke] synonym: "4.5S_snRNA_primary_transcript" RELATED [] is_a: SO:0000231 ! snRNA_primary_transcript [Term] id: SO:0005839 name: snRNA_4.5S synonym: "4.5S_snRNA" RELATED [] is_a: SO:0000274 ! snRNA [Term] id: SO:0005841 name: methylation_guide_snoRNA is_a: SO:0000275 ! snoRNA [Term] id: SO:0005843 name: rRNA_cleavage_snoRNA is_a: SO:0000275 ! snoRNA [Term] id: SO:0005845 name: single_exon is_a: SO:0000147 ! exon [Term] id: SO:0005847 name: member_of_gene_cassette_array is_a: SO:0005848 ! member_of_gene_cassette [Term] id: SO:0005848 name: member_of_gene_cassette is_a: SO:0000081 ! member_gene_array [Term] id: SO:0005849 name: member_of_gene_subarray is_a: SO:0000081 ! member_gene_array [Term] id: SO:0005850 name: primer_binding_site def: "Non-covalent primer binding site for initiation of replication, transcription, or reverse transcription." [http:www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html] relationship: part_of SO:0000186 ! LTR_retrotransposon [Term] id: SO:0005851 name: gene_array def: "An array includes two or more genes, or two or more gene subarrays, contiguously arranged where the individual genes, or subarrays, are either identical in sequence, or essentially so." [SO:ma] comment: This would include\, for example\, a cluster of genes each encoding the major ribosomal RNAs and a cluster of histone gene subarrays. is_a: SO:0005855 ! gene_group [Term] id: SO:0005852 name: gene_subarray def: "A subarray is, by defintition, a member of a gene array (SO:0005851); the members of a subarray may differ substantially in sequence, but are closely related in function." [SO:ma] comment: This would include\, for example\, a cluster of genes encoding different histones. is_a: SO:0005851 ! gene_array [Term] id: SO:0005853 name: gene_cassette def: "A non-functional gene that, when captured by recombination forms a functional gene." [SO:ma] comment: This would include\, for example\, the mating type gene cassettes of S. cerevisiae. is_a: SO:0005855 ! gene_group [Term] id: SO:0005854 name: gene_cassette_array def: "An array of non-functional genes whose members, when captured by recombination form functional genes." [SO:ma] comment: This would include\, for example\, the arrays of non-functional VSG genes of Trypanosomes. is_a: SO:0005853 ! gene_cassette [Term] id: SO:0005855 name: gene_group def: "A collection of related genes." [SO:ma] subset: SOFA is_a: SO:0000001 ! region [Term] id: SO:0005856 name: selenocysteine_tRNA_primary_transcript def: "A primary transcript encoding seryl tRNA (SO:000269)." [SO:ke] is_a: SO:0000210 ! tRNA_primary_transcript [Term] id: SO:0005857 name: selenocysteinyl_tRNA is_a: SO:0000253 ! tRNA [Term] id: SO:0005858 name: syntenic_region def: "A region in which two or more pairs of homologous markers occur on the same chromosome in two or more species." [http://tbase.jax.org/docs/glossary.html] is_a: SO:0000330 ! conserved_region [Term] id: SO:1000002 name: substitution def: "Any change in genomic DNA caused by a single event." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] subset: SOFA is_a: SO:0000001 ! region is_a: SO:0000109 ! sequence_variant relationship: sequence_of SO:0000048 ! substitute [Term] id: SO:1000004 name: partially_characterised_change_in_DNA_sequence def: "The nature of the mutation event is only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000007 ! uncharacterised_change_in_nucleotide_sequence [Term] id: SO:1000005 name: complex_substitution def: "When no simple or well defined DNA mutation event describes the observed DNA change, the keyword \"complex\" should be used. Usually there are multiple equally plausible explanations for the change." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] subset: SOFA is_a: SO:1000002 ! substitution [Term] id: SO:1000007 name: uncharacterised_change_in_nucleotide_sequence def: "The nature of the mutation event is either uncharacterised or only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000002 ! substitution [Term] id: SO:1000008 name: point_mutation def: "A mutation event where a single DNA nucleotide changes into another nucleotide." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] subset: SOFA is_a: SO:1000002 ! substitution [Term] id: SO:1000009 name: transition def: "Change of a pyrimidine nucleotide, C or T, into an other pyrimidine nucleotide, or change of a purine nucleotide, A or G, into an other purine nucleotide." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000008 ! point_mutation [Term] id: SO:1000010 name: pyrimidine_transition def: "A substitution of a pyrimidine, C or T, for another pyrimidine." [SO:ke] is_a: SO:1000009 ! transition [Term] id: SO:1000011 name: C_to_T_transition def: "A transition of a cytidine to a thymine." [SO:ke] is_a: SO:1000010 ! pyrimidine_transition [Term] id: SO:1000012 name: C_to_T_transition_at_pCpG_site def: "The transition of cytidine to thymine occurring at a pCpG site as a consequence of the spontaneous deamination of 5'-methylcytidine." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000011 ! C_to_T_transition [Term] id: SO:1000013 name: T_to_C_transition is_a: SO:1000010 ! pyrimidine_transition [Term] id: SO:1000014 name: purine_transition def: "A substitution of a purine, A or G, for another purine." [SO:ke] is_a: SO:1000009 ! transition [Term] id: SO:1000015 name: A_to_G_transition def: "A transition of an adenine to a guanine." [SO:ke] is_a: SO:1000014 ! purine_transition [Term] id: SO:1000016 name: G_to_A_transition def: "A transition of a guanine to an adenine." [SO:ke] is_a: SO:1000014 ! purine_transition [Term] id: SO:1000017 name: transversion def: "Change of a pyrimidine nucleotide, C or T, into a purine nucleotide, A or G, or vice versa." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000008 ! point_mutation [Term] id: SO:1000018 name: pyrimidine_to_purine_transversion def: "Change of a pyrimidine nucleotide, C or T, into a purine nucleotide, A or G." [SO:ke] is_a: SO:1000017 ! transversion [Term] id: SO:1000019 name: C_to_A_transversion def: "A transversion from cytidine to adenine." [SO:ke] is_a: SO:1000018 ! pyrimidine_to_purine_transversion [Term] id: SO:1000020 name: C_to_G_transversion is_a: SO:1000018 ! pyrimidine_to_purine_transversion [Term] id: SO:1000021 name: T_to_A_transversion def: "A transversion from T to A." [SO:ke] is_a: SO:1000018 ! pyrimidine_to_purine_transversion [Term] id: SO:1000022 name: T_to_G_transversion def: "A transversion from T to G." [SO:ke] is_a: SO:1000018 ! pyrimidine_to_purine_transversion [Term] id: SO:1000023 name: purine_to_pyrimidine_transversion def: "Change of a purine nucleotide, A or G , into a pyrimidine nucleotide C or T." [SO:ke] is_a: SO:1000017 ! transversion [Term] id: SO:1000024 name: A_to_C_transversion def: "A transversion from adenine to cytidine." [SO:ke] is_a: SO:1000023 ! purine_to_pyrimidine_transversion [Term] id: SO:1000025 name: A_to_T_transversion def: "A transversion from adenine to thymine." [SO:ke] is_a: SO:1000023 ! purine_to_pyrimidine_transversion [Term] id: SO:1000026 name: G_to_C_transversion def: "A transversion from guanine to cytidine." [SO:ke] is_a: SO:1000023 ! purine_to_pyrimidine_transversion [Term] id: SO:1000027 name: G_to_T_transversion def: "A transversion from guanine to thymine." [SO:ke] is_a: SO:1000023 ! purine_to_pyrimidine_transversion [Term] id: SO:1000028 name: intrachromosomal_mutation is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000029 name: chromosomal_deletion synonym: "(Drosophila)Df" RELATED [] synonym: "(bacteria)&Dgr;" RELATED [] synonym: "(fungi)D" RELATED [] is_a: SO:0000550 ! aneuploid_chromosome is_a: SO:1000028 ! intrachromosomal_mutation [Term] id: SO:1000030 name: chromosomal_inversion synonym: "(Drosophila)In" RELATED [] synonym: "(bacteria)IN" RELATED [] synonym: "(fungi)In" RELATED [] is_a: SO:1000028 ! intrachromosomal_mutation [Term] id: SO:1000031 name: interchromosomal_mutation is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000032 name: indel def: "A hybrid term (insertion/deletion) to describe sequence length change when the direction of the change is unspecified." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:0000109 ! sequence_variant [Term] id: SO:1000033 name: nucleotide_deletion def: "One or more continuous nucleotides are excised from the sequence." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000032 ! indel [Term] id: SO:1000034 name: nucleotide_insertion def: "One or more nucleotides are added between two adjacent nucleotides in the sequence." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000032 ! indel [Term] id: SO:1000035 name: nucleotide_duplication def: "One or more nucleotides are added between two adjacent nucleotides in the sequence; the inserted sequence derives from, or is identical in sequence to, nucleotides adjacent to insertion point." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000034 ! nucleotide_insertion [Term] id: SO:1000036 name: inversion def: "A continuous nucleotide sequence is inverted in the same position." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] subset: SOFA is_a: SO:0000001 ! region is_a: SO:0000109 ! sequence_variant relationship: sequence_of SO:0000047 ! invert [Term] id: SO:1000037 name: chromosomal_duplication synonym: "(Drosophila)Dp" RELATED [] synonym: "(fungi)Dp" RELATED [] is_a: SO:0000550 ! aneuploid_chromosome is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000038 name: intrachromosomal_duplication is_a: SO:1000028 ! intrachromosomal_mutation is_a: SO:1000037 ! chromosomal_duplication [Term] id: SO:1000039 name: direct_tandem_duplication is_a: SO:1000173 ! tandem_duplication [Term] id: SO:1000040 name: inverted_tandem_duplication is_a: SO:1000173 ! tandem_duplication [Term] id: SO:1000041 name: intrachromosomal_transposition synonym: "(Drosophila)Tp" RELATED [] is_a: SO:0000453 ! transposition is_a: SO:1000038 ! intrachromosomal_duplication [Term] id: SO:1000042 name: compound_chromosome is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000043 name: Robertsonian_fusion is_a: SO:1000044 ! chromosomal_translocation [Term] id: SO:1000044 name: chromosomal_translocation synonym: "(Drosophila)T" RELATED [] synonym: "(fungi)T" RELATED [] is_a: SO:1000031 ! interchromosomal_mutation [Term] id: SO:1000045 name: ring_chromosome synonym: "(Drosophila)R" RELATED [] synonym: "(fungi)C" RELATED [] is_a: SO:1000028 ! intrachromosomal_mutation [Term] id: SO:1000046 name: pericentric_inversion is_a: SO:1000030 ! chromosomal_inversion [Term] id: SO:1000047 name: paracentric_inversion is_a: SO:1000030 ! chromosomal_inversion [Term] id: SO:1000048 name: reciprocal_chromosomal_translocation is_a: SO:1000044 ! chromosomal_translocation [Term] id: SO:1000049 name: mutation_affecting_transcript def: "Any change in mature, spliced and processed, RNA that results from a change in the corresponding DNA sequence." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000132 ! consequences_of_mutation [Term] id: SO:1000050 name: no_change_in_transcript def: "No effect on the state of the RNA." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000052 name: complex_change_in_transcript is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000054 name: mutation_affecting_coding_sequence def: "Any of the amino acid coding triplets of a gene are affected by the DNA mutation." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000079 ! mutation_affecting_transcript_sequence [Term] id: SO:1000055 name: initiator_codon_change_in_transcript def: "The DNA mutation changes, usually destroys, the first coding triplet of a gene. Usually prevents translation although another initiator codon may be used." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000056 ! amino_acid_coding_codon_change_in_transcript [Term] id: SO:1000056 name: amino_acid_coding_codon_change_in_transcript def: "The DNA mutation affects the amino acid coding sequence of a gene; this region includes both the initiator and terminator codons." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000054 ! mutation_affecting_coding_sequence [Term] id: SO:1000057 name: synonymous_codon_change_in_transcript def: "The changed codon has the same translation product as the original codon." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000056 ! amino_acid_coding_codon_change_in_transcript [Term] id: SO:1000058 name: non_synonymous_codon_change_in_transcript def: "A DNA point mutation that causes a substitution of an amino acid by an other." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] synonym: "non-synonymous_codon_change_in_transcript" RELATED [] is_a: SO:1000056 ! amino_acid_coding_codon_change_in_transcript [Term] id: SO:1000059 name: missense_codon_change_in_transcript def: "The nucleotide change in the codon leads to a new codon coding for a new amino acid." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000058 ! non_synonymous_codon_change_in_transcript [Term] id: SO:1000060 name: conservative_missense_codon_change_in_transcript def: "The amino acid change following from the codon change does not change the gross properties (size, charge, hydrophobicity) of the amino acid at that position." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] comment: The exact rules need to be stated\, a common set of rules can be derived from e.g. BLOSUM62 amino acid distance matrix. is_a: SO:1000059 ! missense_codon_change_in_transcript [Term] id: SO:1000061 name: nonconservative_missense_codon_change_in_transcript def: "The amino acid change following from the codon change changes the gross properties (size, charge, hydrophobicity) of the amino acid in that position." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] comment: The exact rules need to be stated\, a common set of rules can be derived from e.g. BLOSUM62 amino acid distance matrix. is_a: SO:1000059 ! missense_codon_change_in_transcript [Term] id: SO:1000062 name: nonsense_codon_change_in_transcript def: "The nucleotide change in the codon triplet creates a terminator codon." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000056 ! amino_acid_coding_codon_change_in_transcript [Term] id: SO:1000063 name: terminator_codon_change_in_transcript is_a: SO:1000054 ! mutation_affecting_coding_sequence [Term] id: SO:1000064 name: mutation_affecting_reading_frame def: "An umbrella term for terms describing an effect of a mutation on the frame of translation." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000054 ! mutation_affecting_coding_sequence [Term] id: SO:1000065 name: frameshift_mutation def: "." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000064 ! mutation_affecting_reading_frame [Term] id: SO:1000066 name: plus_1_frameshift_mutation is_a: SO:1000065 ! frameshift_mutation [Term] id: SO:1000067 name: minus_1_frameshift_mutation is_a: SO:1000065 ! frameshift_mutation [Term] id: SO:1000068 name: plus_2_frameshift_mutation is_a: SO:1000065 ! frameshift_mutation [Term] id: SO:1000069 name: minus_2_frameshift_mutation is_a: SO:1000065 ! frameshift_mutation [Term] id: SO:1000070 name: mutation_affecting_transcript_processing def: "Mutation affects the way in which the primary transcriptional product is processed to form the mature transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000079 ! mutation_affecting_transcript_sequence [Term] id: SO:1000071 name: mutation_affecting_splicing def: "Mutation affects the way in which the primary transcriptional product is processed to form the mature transcript, specifically by the removal (splicing) of intron sequences." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000132 ! consequences_of_mutation [Term] id: SO:1000072 name: splice_donor_mutation is_a: SO:1000071 ! mutation_affecting_splicing is_a: SO:1000074 ! cryptic_splice_activator_mutation [Term] id: SO:1000073 name: splice_acceptor_mutation is_a: SO:1000071 ! mutation_affecting_splicing [Term] id: SO:1000074 name: cryptic_splice_activator_mutation def: "Mutation creates a new (functional) splice site." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000071 ! mutation_affecting_splicing [Term] id: SO:1000075 name: mutation_affecting_editing def: "Mutation affects the editing of the transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000070 ! mutation_affecting_transcript_processing [Term] id: SO:1000076 name: mutation_affecting_transcription def: "Mutation affects the process of transcription, its initiation, progression or termination." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000078 name: mutation_decreasing_rate_of_transcription is_a: SO:1000081 ! mutation_affecting_rate_of_transcription [Term] id: SO:1000079 name: mutation_affecting_transcript_sequence is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000080 name: mutation_increasing_rate_of_transcription is_a: SO:1000081 ! mutation_affecting_rate_of_transcription [Term] id: SO:1000081 name: mutation_affecting_rate_of_transcription is_a: SO:1000076 ! mutation_affecting_transcription [Term] id: SO:1000082 name: mutation_affecting_transcript_stability def: "Mutation affects the stability of the transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000079 ! mutation_affecting_transcript_sequence [Term] id: SO:1000083 name: mutation_increasing_transcript_stability def: "Mutation increases the stability (half-life) of the transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000082 ! mutation_affecting_transcript_stability [Term] id: SO:1000084 name: mutation_decreasing_transcript_stability def: "Mutation decreases the stability (half-life) of the transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000082 ! mutation_affecting_transcript_stability [Term] id: SO:1000085 name: mutation_affecting_level_of_transcript is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000086 name: mutation_decreasing_level_of_transcript is_a: SO:1000085 ! mutation_affecting_level_of_transcript [Term] id: SO:1000087 name: mutation_increasing_level_of_transcript is_a: SO:1000085 ! mutation_affecting_level_of_transcript [Term] id: SO:1000088 name: mutation_affecting_translational_product def: "Mutation causes a change in primary translation product of a transcript." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000132 ! consequences_of_mutation [Term] id: SO:1000089 name: no_change_of_translational_product def: "The change at RNA level does not lead to any change in polypeptide." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000090 name: uncharacterised_change_of_translational_product def: "The nature of the mutation event is either uncharacterised or only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000091 name: partially_characterised_change_of_translational_product def: "The nature of the mutation event is only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000090 ! uncharacterised_change_of_translational_product [Term] id: SO:1000092 name: complex_change_of_translational_product def: "Any mutation effect that is known at nucleotide level but can not be explained by using other key terms." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000093 name: amino_acid_substitution def: "The replacement of a single amino acid by an other." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000094 name: conservative_amino_acid_substitution is_a: SO:1000093 ! amino_acid_substitution [Term] id: SO:1000095 name: nonconservative_amino_acid_substitution is_a: SO:1000093 ! amino_acid_substitution [Term] id: SO:1000096 name: amino_acid_insertion def: "The insertion of one or more amino acids from the polypeptide, without affecting the surrounding sequence." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000097 name: amino_acid_deletion def: "The deletion of one or more amino acids from the polypeptide, without affecting the surrounding sequence." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000098 name: polypeptide_truncation def: "The translational product is truncated at its C-terminus, usually a result of a nonsense codon change in transcript (SO:1000062)." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000099 name: polypeptide_elongation def: "The extension of the translational product at either (or both) the N-terminus and/or the C-terminus." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000100 name: polypeptide_N_terminal_elongation def: "." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] synonym: "polypeptide_N-terminal_elongation" RELATED [] is_a: SO:1000099 ! polypeptide_elongation [Term] id: SO:1000101 name: polypeptide_C_terminal_elongation def: "." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] synonym: "polypeptide_C-terminal_elongation" RELATED [] is_a: SO:1000099 ! polypeptide_elongation [Term] id: SO:1000102 name: mutation_affecting_level_of_translational_product is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000103 name: mutation_decreasing_level_of_translation_product is_a: SO:1000102 ! mutation_affecting_level_of_translational_product [Term] id: SO:1000104 name: mutation_increasing_level_of_translation_product is_a: SO:1000102 ! mutation_affecting_level_of_translational_product [Term] id: SO:1000105 name: mutation_affecting_polypeptide_amino_acid_sequence is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000106 name: inframe_polypeptide_N_terminal_elongation synonym: "inframe_polypeptide_N-terminal_elongation" RELATED [] is_a: SO:1000100 ! polypeptide_N_terminal_elongation [Term] id: SO:1000107 name: out_of_frame_polypeptide_N_terminal_elongation synonym: "out_of_frame_polypeptide_N-terminal_elongation" RELATED [] is_a: SO:1000100 ! polypeptide_N_terminal_elongation [Term] id: SO:1000108 name: inframe_polypeptide_C_terminal_elongation synonym: "inframe_polypeptide_C-terminal_elongation" RELATED [] is_a: SO:1000101 ! polypeptide_C_terminal_elongation [Term] id: SO:1000109 name: out_of_frame_polypeptide_C_terminal_elongation synonym: "out_of_frame_polypeptide_C-terminal_elongation" RELATED [] is_a: SO:1000101 ! polypeptide_C_terminal_elongation [Term] id: SO:1000110 name: frame_restoring_mutation is_a: SO:1000065 ! frameshift_mutation [Term] id: SO:1000111 name: mutation_affecting_3D_structure_of_polypeptide synonym: "mutation_affecting_3D-structure_of_polypeptide" RELATED [] is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000112 name: no_3D_structural_change is_a: SO:1000111 ! mutation_affecting_3D_structure_of_polypeptide [Term] id: SO:1000113 name: uncharacterised_3D_structural_change is_a: SO:1000111 ! mutation_affecting_3D_structure_of_polypeptide [Term] id: SO:1000114 name: partially_characterised_3D_structural_change is_a: SO:1000113 ! uncharacterised_3D_structural_change [Term] id: SO:1000115 name: complex_3D_structural_change is_a: SO:1000111 ! mutation_affecting_3D_structure_of_polypeptide [Term] id: SO:1000116 name: conformational_change is_a: SO:1000111 ! mutation_affecting_3D_structure_of_polypeptide [Term] id: SO:1000117 name: mutation_affecting_polypeptide_function is_a: SO:1000088 ! mutation_affecting_translational_product [Term] id: SO:1000118 name: loss_of_function_of_polypeptide synonym: "loss-of-function_of_polypeptide" RELATED [] is_a: SO:1000117 ! mutation_affecting_polypeptide_function [Term] id: SO:1000119 name: inactive_ligand_binding_site is_a: SO:1000118 ! loss_of_function_of_polypeptide [Term] id: SO:1000120 name: inactive_catalytic_site is_a: SO:1000119 ! inactive_ligand_binding_site [Term] id: SO:1000121 name: polypeptide_localization_affected is_a: SO:1000117 ! mutation_affecting_polypeptide_function [Term] id: SO:1000122 name: polypeptide_post_translational_processing_affected synonym: "polypeptide_post-translational_processing_affected" RELATED [] is_a: SO:1000117 ! mutation_affecting_polypeptide_function is_a: SO:1000118 ! loss_of_function_of_polypeptide [Term] id: SO:1000123 name: polypeptide_post_translational_processing_affected synonym: "polypeptide_post-translational_processing_affected" RELATED [] is_obsolete: true [Term] id: SO:1000124 name: partial_loss_of_function_of_polypeptide synonym: "partial_loss-of-function_of_polypeptide" RELATED [] is_a: SO:1000118 ! loss_of_function_of_polypeptide [Term] id: SO:1000125 name: gain_of_function_of_polypeptide synonym: "gain-of-function_of_polypeptide" RELATED [] is_a: SO:1000117 ! mutation_affecting_polypeptide_function [Term] id: SO:1000126 name: mutation_affecting_transcript_secondary_structure is_a: SO:1000079 ! mutation_affecting_transcript_sequence [Term] id: SO:1000127 name: compensatory_transcript_secondary_structure_mutation is_a: SO:1000126 ! mutation_affecting_transcript_secondary_structure [Term] id: SO:1000132 name: consequences_of_mutation is_a: SO:0000000 ! Sequence_Ontology [Term] id: SO:1000134 name: polypeptide_fusion is_a: SO:1000105 ! mutation_affecting_polypeptide_amino_acid_sequence [Term] id: SO:1000136 name: autosynaptic_chromosome synonym: "(Drosophila)A" RELATED [] is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000138 name: homo_compound_chromosome synonym: "homo-compound_chromosome" RELATED [] is_a: SO:1000042 ! compound_chromosome [Term] id: SO:1000140 name: hetero_compound_chromosome synonym: "hetero-compound_chromosome" RELATED [] is_a: SO:1000042 ! compound_chromosome [Term] id: SO:1000141 name: chromosome_fission is_a: SO:1000028 ! intrachromosomal_mutation [Term] id: SO:1000142 name: dexstrosynaptic_chromosome is_a: SO:1000136 ! autosynaptic_chromosome [Term] id: SO:1000143 name: laevosynaptic_chromosome is_a: SO:1000136 ! autosynaptic_chromosome [Term] id: SO:1000144 name: free_duplication is_a: SO:1000037 ! chromosomal_duplication [Term] id: SO:1000145 name: free_ring_duplication synonym: "(Drosophila)R" RELATED [] is_a: SO:1000045 ! ring_chromosome is_a: SO:1000144 ! free_duplication [Term] id: SO:1000146 name: complex_chromosomal_mutation is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000147 name: deficient_translocation def: "A translocation in which one of the four broken ends loses a segment before re-joining." [fb:reference_manual] synonym: "(Drosophila)Df" RELATED [] synonym: "(Drosophila)DfT" RELATED [] is_a: SO:1000029 ! chromosomal_deletion is_a: SO:1000044 ! chromosomal_translocation [Term] id: SO:1000148 name: inversion_cum_translocation def: "The first two breaks are in the same chromosome, and the region between them is rejoined in inverted order to the other side of the first break, such that both sides of break one are present on the same chromosome. The remaining free ends are joined as a translocation with those resulting from the third break." [fb:reference_manual] synonym: "(Drosophila)InT" RELATED [] synonym: "(Drosophila)T" RELATED [] synonym: "inversion-cum-translocation" RELATED [] is_a: SO:1000030 ! chromosomal_inversion is_a: SO:1000044 ! chromosomal_translocation [Term] id: SO:1000149 name: bipartite_duplication def: "The (large) region between the first two breaks listed is lost, and the two flanking segments (one of them centric) are joined as a translocation to the free ends resulting from the third break." [fb:reference_manual] synonym: "(Drosophila)bDp" RELATED [] is_a: SO:1000031 ! interchromosomal_mutation [Term] id: SO:1000150 name: cyclic_translocation def: "Three breaks in three different chromosomes. The centric segment resulting from the first break listed is joined to the acentric segment resulting from the second, rather than the third." [fb:reference_manual] is_a: SO:1000044 ! chromosomal_translocation [Term] id: SO:1000151 name: bipartite_inversion def: "Three breaks in the same chromosome; both central segments are inverted in place (i.e., they are not transposed)." [fb:reference_manual] synonym: "(Drosophila)bIn" RELATED [] is_a: SO:1000030 ! chromosomal_inversion [Term] id: SO:1000152 name: uninverted_insertional_duplication def: "A copy of the segment between the first two breaks listed is inserted at the third break; the insertion is in cytologically the same orientation as its flanking segments." [fb:reference_manual] synonym: "(Drosophila)eDp" RELATED [] is_a: SO:1000154 ! insertional_duplication [Term] id: SO:1000153 name: inverted_insertional_duplication def: "A copy of the segment between the first two breaks listed is inserted at the third break; the insertion is in cytologically inverted orientation with respect to its flanking segments." [fb:reference_manual] synonym: "(Drosophila)iDp" RELATED [] is_a: SO:1000154 ! insertional_duplication [Term] id: SO:1000154 name: insertional_duplication synonym: "(Drosophila)Dpp" RELATED [] is_a: SO:1000037 ! chromosomal_duplication [Term] id: SO:1000155 name: interchromosomal_transposition synonym: "(Drosophila)Tp" RELATED [] is_a: SO:0000453 ! transposition is_a: SO:1000031 ! interchromosomal_mutation [Term] id: SO:1000156 name: inverted_interchromosomal_transposition synonym: "(Drosophila)iTp" RELATED [] is_a: SO:1000155 ! interchromosomal_transposition [Term] id: SO:1000157 name: uninverted_interchromosomal_transposition synonym: "(Drosophila)eTp" RELATED [] is_a: SO:1000155 ! interchromosomal_transposition [Term] id: SO:1000158 name: inverted_intrachromosomal_transposition def: "The segment between the first two breaks listed is removed and inserted at the third break; the insertion is in cytologically inverted orientation with respect to its flanking segments." [fb:reference_manual] synonym: "(Drosophila)iTp" RELATED [] is_a: SO:1000030 ! chromosomal_inversion is_a: SO:1000041 ! intrachromosomal_transposition [Term] id: SO:1000159 name: uninverted_intrachromosomal_transposition def: "The segment between the first two breaks listed is removed and inserted at the third break; the insertion is in cytologically the same orientation as its flanking segments." [fb:reference_manual] synonym: "(Drosophila)eTp" RELATED [] is_a: SO:1000041 ! intrachromosomal_transposition [Term] id: SO:1000160 name: unoriented_insertional_duplication def: "A copy of the segment between the first two breaks listed is inserted at the third break; the orientation of the insertion with respect to its flanking segments is not recorded." [fb:reference_manual] synonym: "(Drosophila)uDp" RELATED [] is_a: SO:1000154 ! insertional_duplication [Term] id: SO:1000161 name: unorientated_interchromosomal_transposition synonym: "(Drosophila)uTp" RELATED [] is_a: SO:1000155 ! interchromosomal_transposition [Term] id: SO:1000162 name: unorientated_intrachromosomal_transposition def: "The segment between the first two breaks listed is removed and inserted at the third break; the orientation of the insertion with respect to its flanking segments is not recorded." [fb:reference_manual] synonym: "(Drosophila)uTp" RELATED [] is_a: SO:1000041 ! intrachromosomal_transposition [Term] id: SO:1000170 name: uncharacterised_chromosomal_mutation is_a: SO:1000183 ! chromosome_structure_variation [Term] id: SO:1000171 name: deficient_inversion def: "Three breaks in the same chromosome; one central region lost, the other inverted." [fb:reference_manual] synonym: "(Drosophila)Df" RELATED [] synonym: "(Drosophila)DfIn" RELATED [] is_a: SO:1000029 ! chromosomal_deletion is_a: SO:1000030 ! chromosomal_inversion [Term] id: SO:1000173 name: tandem_duplication is_a: SO:1000038 ! intrachromosomal_duplication [Term] id: SO:1000175 name: partially_characterised_chromosomal_mutation is_a: SO:1000170 ! uncharacterised_chromosomal_mutation [Term] id: SO:1000177 name: uncharacterised_change_in_transcript def: "The nature of the mutation event is either uncharacterised or only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000049 ! mutation_affecting_transcript [Term] id: SO:1000179 name: partially_characterised_change_in_transcript def: "The nature of the mutation event is only partially characterised." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000177 ! uncharacterised_change_in_transcript [Term] id: SO:1000180 name: mutation_affecting_gene_structure is_a: SO:1000132 ! consequences_of_mutation [Term] id: SO:1000181 name: gene_fusion is_a: SO:1000180 ! mutation_affecting_gene_structure [Term] id: SO:1000182 name: chromosome_number_variation is_a: SO:0000240 ! chromosome_variation [Term] id: SO:1000183 name: chromosome_structure_variation is_a: SO:0000240 ! chromosome_variation [Term] id: SO:1000184 name: mutation_causes_exon_loss is_a: SO:1000071 ! mutation_affecting_splicing [Term] id: SO:1000185 name: mutation_causes_intron_gain def: "Mutation causes an intron to be gained by the processed transcript; usually a result of a donor acceptor mutation (SO:1000072)." [http://www.ebi.ac.uk/mutations/recommendations/mutevent.html] is_a: SO:1000071 ! mutation_affecting_splicing [Term] id: SO:1000186 name: cryptic_splice_donor_activation is_a: SO:1000074 ! cryptic_splice_activator_mutation [Term] id: SO:1001186 name: cryptic_splice_acceptor_activation is_a: SO:1000074 ! cryptic_splice_activator_mutation [Term] id: SO:1001187 name: alternatively_spliced_transcript is_a: SO:0000115 ! transcript_feature [Term] id: SO:1001188 name: alternatively_spliced_transcript_encoding_1_polypeptide is_a: SO:1001187 ! alternatively_spliced_transcript [Term] id: SO:1001189 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide is_a: SO:1001187 ! alternatively_spliced_transcript [Term] id: SO:1001190 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_same_start_codon_different_stop_codon is_a: SO:1001189 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide is_a: SO:1001194 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_overlapping [Term] id: SO:1001191 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_same_stop_codon is_a: SO:1001189 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide is_a: SO:1001194 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_overlapping [Term] id: SO:1001192 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon is_a: SO:1001189 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide [Term] id: SO:1001193 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon_coding_regions_overlapping is_a: SO:1001192 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon is_a: SO:1001194 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_overlapping [Term] id: SO:1001194 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_overlapping is_a: SO:1001189 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide [Term] id: SO:1001195 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_not_overlapping is_a: SO:1001189 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide [Term] id: SO:1001196 name: cryptogene is_a: SO:0000011 ! non_protein_coding_gene [Term] id: SO:1001197 name: dicistronic_primary_transcript is_a: SO:0000079 ! dicistronic_transcript [Term] id: SO:1001217 name: member_of_regulon is_a: SO:0000081 ! member_gene_array [Term] id: SO:1001244 name: alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon_coding_regions_non_overlapping synonym: "alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon_coding_regions_non-overlapping" RELATED [] is_a: SO:1001192 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_different_start_codon_different_stop_codon is_a: SO:1001195 ! alternatively_spliced_transcript_encoding_greater_than_1_polypeptide_coding_regions_not_overlapping [Term] id: SO:1001246 name: CDS_independently_known is_a: SO:1001255 ! status_of_coding_sequence [Term] id: SO:1001247 name: orphan_CDS def: "A CDS whose predicted amino acid sequence is unsupported by any experimental evidence or by any match with any other known sequence." [MA:SO] is_a: SO:1001254 ! CDS_predicted [Term] id: SO:1001249 name: CDS_supported_by_domain_match_data is_a: SO:1001254 ! CDS_predicted [Term] id: SO:1001251 name: CDS_supported_by_sequence_similarity_data is_a: SO:1001254 ! CDS_predicted [Term] id: SO:1001254 name: CDS_predicted is_a: SO:1001255 ! status_of_coding_sequence [Term] id: SO:1001255 name: status_of_coding_sequence is_a: SO:0000400 ! sequence_attribute [Term] id: SO:1001259 name: CDS_supported_by_EST_or_cDNA_data is_a: SO:1001254 ! CDS_predicted [Term] id: SO:1001260 name: internal_Shine_Dalgarno_sequence def: "A Shine Delgarno sequence that is upstream of a non-5' CDS in a polycistronic mRNA." [SO:ke] is_a: SO:0000243 ! internal_ribosome_entry_site is_a: SO:1001268 ! recoding_stimulatory_region [Term] id: SO:1001261 name: recoded_mRNA def: "A gene coding an mRNA which is recoded before translation, usually by special cis-acting signals." [https://www.ncbi.nlm.nih.gov/pubmed/8811194] is_a: SO:0000115 ! transcript_feature [Term] id: SO:1001262 name: minus_1_translational_frameshift is_a: SO:0000118 ! transcript_with_translational_frameshift [Term] id: SO:1001263 name: plus_1_translational_frameshift is_a: SO:0000118 ! transcript_with_translational_frameshift [Term] id: SO:1001264 name: mRNA_recoded_by_translational_bypass def: "A gene whose mRNA is translated by ribosomes that suspend translation at a particular codon and resume translation at a particular non-overlapping downstream codon." [https://www.ncbi.nlm.nih.gov/pubmed/8811194] is_a: SO:1001261 ! recoded_mRNA [Term] id: SO:1001265 name: mRNA_recoded_by_codon_redefinition def: "A gene whose mRNA is recoded by an alteration of codon meaning." [SO:ma] is_a: SO:1001261 ! recoded_mRNA [Term] id: SO:1001266 name: stop_codon_redefinition_as_selenocysteine is_a: SO:1001267 ! stop_codon_readthrough [Term] id: SO:1001267 name: stop_codon_readthrough is_a: SO:1001265 ! mRNA_recoded_by_codon_redefinition [Term] id: SO:1001268 name: recoding_stimulatory_region def: "A site in an mRNA sequence that stimulates the recoding of the same mRNA." [https://www.ncbi.nlm.nih.gov/pubmed/12519954] synonym: "recoding_stimulatory_signal" RELATED [] relationship: part_of SO:0000234 ! mRNA [Term] id: SO:1001269 name: four_bp_start_codon def: "A non-canonical start codon with 4 pase pairs." [SO:ke] synonym: "4bp_start_codon" RELATED [] is_a: SO:0000680 ! non_canonical_start_codon [Term] id: SO:1001270 name: stop_codon_redefinition_as_pyrrolysine is_a: SO:1001267 ! stop_codon_readthrough [Term] id: SO:1001271 name: archeal_intron def: "Intron characteristic of tRNA genes; splices by an endonuclease-ligase mediated mechanism." [SO:ma] is_a: SO:0000661 ! intron_attribute [Term] id: SO:1001272 name: tRNA_intron is_a: SO:0000661 ! intron_attribute [Term] id: SO:1001273 name: CTG_start_codon def: "A non-canonical start codon of sequence CTG." [SO:ke] is_a: SO:0000680 ! non_canonical_start_codon [Term] id: SO:1001274 name: SECIS_element def: "The incorporation of selenocysteine into a protein sequence is directed by an in-frame UGA codon (usually a stop codon) within the coding region of the mRNA. Selenoprotein mRNAs contain a conserved secondary structure in the 3' UTR that is required for the distinction of UGA stop from UGA selenocysteine. The selenocysteine insertion sequence (SECIS) is around 60 nt in length and adopts a hairpin structure which is sufficiently well-defined and conserved to act as a computational screen for selenoprotein genes." [http://www.sanger.ac.uk/cgi-bin/Rfam/getacc?RF00031] is_a: SO:1001268 ! recoding_stimulatory_region [Term] id: SO:1001275 name: retron def: "Sequence coding for a short, single-stranded, DNA sequence via a retrotransposed RNA intermediate; characteristic of some microbial genomes." [SO:ma] is_a: SO:0000009 ! gene_class [Term] id: SO:1001277 name: three_prime_recoding_site is_a: SO:1001268 ! recoding_stimulatory_region [Term] id: SO:1001279 name: three_prime_stem_loop_structure def: "The stem-loop secondary structural element downstream of the redefined region." [SO:ke] is_a: SO:1001277 ! three_prime_recoding_site [Term] id: SO:1001280 name: five_prime_recoding_site def: "The recoding signal found 5' of the redefined codon." [SO:ke] is_a: SO:1001268 ! recoding_stimulatory_region [Term] id: SO:1001281 name: flanking_three_prime_quadruplet_recoding_signal def: "Four base pair sequence immediately downstream of the redefined region. The redefined region is a frameshift site. The quadruplet is 2 overlapping codons." [SO:ke] is_a: SO:1001277 ! three_prime_recoding_site [Term] id: SO:1001282 name: UAG_stop_codon_signal is_a: SO:1001288 ! stop_codon_signal [Term] id: SO:1001283 name: UAA_stop_codon_signal is_a: SO:1001288 ! stop_codon_signal [Term] id: SO:1001284 name: regulon def: "A group of genes, whether linked as a cluster or not, that respond to a common regulatory signal." [ISBN:0198506732] subset: SOFA is_a: SO:0005855 ! gene_group [Term] id: SO:1001285 name: UGA_stop_codon_signal is_a: SO:1001288 ! stop_codon_signal [Term] id: SO:1001286 name: three_prime_repeat_recoding_signal def: "It is a downstream sequence important for recoding that contains repetitive elements." [SO:ke] is_a: SO:1001277 ! three_prime_recoding_site [Term] id: SO:1001287 name: distant_three_prime_recoding_signal def: "A recoding signal that is found many hundreds of nucleotides 3' of a redefined stop codon." [https://www.ncbi.nlm.nih.gov/pubmed/8709208] is_a: SO:1001277 ! three_prime_recoding_site [Term] id: SO:1001288 name: stop_codon_signal is_a: SO:1001268 ! recoding_stimulatory_region [Term] id: SO:2000061 name: databank_entry def: "The sequence referred to by an entry in a databank such as Genbank or SwissProt." [SO:ke] subset: SOFA synonym: "accession" RELATED [] is_a: SO:0000695 ! reagent [Typedef] id: adjacent_to name: adjacent_to def: "A geometric operator, specified in Egenhofer 1989. Two features meet if they share a junction on the sequence." [SO:ke] subset: SOFA domain: SO:0000110 ! located_sequence_feature range: SO:0000110 ! located_sequence_feature is_symmetric: true [Typedef] id: associated_with name: associated_with comment: This relationship is vague and up for discussion. is_symmetric: true [Typedef] id: derives_from name: derives_from subset: SOFA is_transitive: true [Typedef] id: genome_of name: genome_of [Typedef] id: has_genome_location name: has_genome_location domain: SO:0000085 ! gene_by_genome_location range: SO:0000704 ! gene is_obsolete: true [Typedef] id: homologous_to name: homologous_to is_symmetric: true is_a: similar_to ! similar_to [Typedef] id: member_of name: member_of comment: A subtype of part_of.ninverse is collection_of.nWinston, M, Chaffin, R, Herrmann: A taxonomy of part-whole relations. Cognitive Science 1987, 11:417-444. subset: SOFA is_a: part_of ! part_of [Typedef] id: non_functional_homolog_of name: non_functional_homolog_of def: "A relationship between a pseudogenic feature and its functional ancestor." [SO:ke] is_a: homologous_to ! homologous_to [Typedef] id: orthologous_to name: orthologous_to is_symmetric: true is_a: homologous_to ! homologous_to [Typedef] id: paralogous_to name: paralogous_to is_symmetric: true is_a: homologous_to ! homologous_to [Typedef] id: part_of name: part_of subset: SOFA is_transitive: true [Typedef] id: position_of name: position_of [Typedef] id: regulated_by name: regulated_by is_obsolete: true [Typedef] id: sequence_of name: sequence_of [Typedef] id: similar_to name: similar_to is_symmetric: true bp_dbsplit100644000766000024 522214006427353 16036 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl #-*-Perl-*- =head1 NAME bp_dbsplit - script to split an input set of database(s) into smaller pieces =head1 SYNOPSIS bp_dbsplit.PLS --size 50 [-i inputfile] [-if inputformat] [-of outputformat] [--prefix outputprefix] [ < file1 file 2 OR file1 file2] =head1 DESCRIPTION This script will take as input a list of filenames or a single file or from STDIN a sequence database and split the database into separate files of X numbers of sequences. You specify X with the C<--size/-s> parameter. The input and output sequence format is any that is supported by bioperl (fasta,embl,genbank,gcg, swissprot, etc). You can specify the input data either as a single file with -i filename, or as a single file as an argument like % bp_dbsplit file1 file2 or as a list of sequence data with % cat file1 file2 file3 | bp_dbsplit You'll want to use the C<--prefix> to specify what the output prefix will be. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Jason Stajich, jason-at-bioperl-dot-org =cut use strict; use warnings; use Bio::SeqIO; use Bio::SeqIO::MultiFile; use Getopt::Long; my $dbsize = 100; my $prefix; my ($informat,$outformat,$infile) = ( 'fasta', 'fasta'); GetOptions ( 's|size:s' => \$dbsize, 'if:s' => \$informat, 'of:s' => \$outformat, 'i:s' => \$infile, 'p|prefix:s' => \$prefix, ); if( @ARGV == 1 ) { $infile = shift @ARGV; } $prefix ||= $infile || $ARGV[0] || 'db'; my $in; if( @ARGV ) { $in = new Bio::SeqIO::MultiFile(-files => [@ARGV], -format => $informat || 'fasta'); } elsif( $infile ) { $in = new Bio::SeqIO(-file => $infile, -format=> $informat); } else { $in = new Bio::SeqIO(-format=> $informat); } my $count = 1; my $out = new Bio::SeqIO(-format => $outformat, -file => ">$prefix.$count"); my $scount = 0; while( my $seq = $in->next_seq ) { if( ++$scount > $dbsize && $count ) { $out->close(); undef($out); $count++; $out = new Bio::SeqIO(-format => $outformat, -file => ">$prefix.$count"); $scount = 1; } $out->write_seq($seq); } __END__ bp_seqpart100644000766000024 1062214006427353 16074 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; =head1 NAME I - Takes one or more sequence files and splits them into a number of load balanced files. =head1 USAGE bp_seqpart.pl -n [-h, -p , -f , -o ] -n number of files to create through partitioning -h this help message -p prefix for all FASTA file names output, files are of the form /#. -f format of the files, defaults to FASTA but you can specify anything supported by SeqIO from BioPerl -o output directory where to dump the split sequence files =head1 DESCRIPTION Script wrapping SeqIO that allows partitioning of multiple sequence files into near equal sized parts for later parallel processing. Even if you have 10 input files outputting to 10 files will balance the files to contain similar total length of sequence. ID's are ignored when deciding on how to balance each sequence. =head1 AUTHOR B - I =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 EDIT HISTORY 2012-04-03 - Matt Oates First features added. =cut =head1 DEPENDANCY B Used to parse command line options. B Used for usage and help output. B Used to cut up sequences and parse FASTA. =cut use Getopt::Long; #Deal with command line options use Pod::Usage; #Print a usage man page from the POD comments after __END__ use Bio::SeqIO; #Deal with sequence parsing, format and file IO # Command Line Options my $help; #Same again but this time should we output the POD man page defined after __END__ my $prefix = 'part'; #Name each part my $format = 'fasta'; #Sequence format we are using, default to fasta my $outdir = '.'; #Use the current directory as default my $num_splits; #Number of files to split into my @partitions; #Details of each partition for the split #Set command line flags and parameters. GetOptions("help|h!" => \$help, "prefix|p=s" => \$prefix, "format|f=s" => \$format, "num-splits|n=i" => \$num_splits, "outdir|o=s" => \$outdir, ) or die "Fatal Error: Problem parsing command-line ".$!; #Print out some help if it was asked for or if no arguments were given. pod2usage(-exitstatus => 0, -verbose => 2) if $help; pod2usage(-exitstatus => 0, -verbose => 1, -msg => 'Please specify the number of split parts with -n ') unless defined $num_splits; #Setup a bunch of empty partitions including some SeqIO file handles to write to @partitions = map { $_ = { length => 0, size => 0, file => Bio::SeqIO->new( -file => ">$outdir/$prefix$_.$format", -format => $format, ) } } 1..$num_splits; #Get sequences from all the files specified. foreach my $file (@ARGV) { #Open each input file in turn for reading my $in = Bio::SeqIO->new( -file => "<$file", -format => $format ); #While there are still sequences to consume while ( my $seq = $in->next_seq() ) { #Sort the partitions on how full they are @partitions = sort {$a->{size} <=> $b->{size}} @partitions; #Add the length of the current seq to the smallest partition size my $length = $seq->length; $partitions[0]{size} += $length; #Increase the length of the partition $partitions[0]{length}++; #Write this sequence to the partitions file $partitions[0]{file}->write_seq($seq); } } #Report some basic statistics after the job my $part = 1; foreach my $partition (@partitions) { print STDERR "$outdir/$prefix$part.$format\n"; print STDERR "\tSequence count = $partition->{length}\n"; print STDERR "\tSequence characters = $partition->{size}\n"; $part++; } 1; __END__ bp_mrtrans100644000766000024 560014006427353 16063 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; # Author: Jason Stajich # Description: Perl implementation of Bill Pearson's mrtrans # to project protein alignment back into cDNA coordinates # =head1 NAME bp_mrtrans - implement a transformer of alignments from protein to mrna coordinates =head1 SYNOPSIS Usage: bp_mrtrans -i inputfile -o outputfile [-if input format] [-of output format] [-s cDNA sequence database] [-sf cDNA sequence format] [-h] =head1 DESCRIPTION This script will convert a protein alignment back into a cDNA. Loosely based on Bill Pearson's mrtrans. The options are: -o filename - the output filename [default STDOUT] -of format - output sequence format (multiple sequence alignment) [default phylip] -i filename - the input filename [required] -if format - input sequence format (multiple sequence alignment) [ default clustalw] -s --seqdb filename - the cDNA sequence database file -sf --seqformat - the cDNA seq db format (flatfile sequence format) -h - this help menu =head1 AUTHOR Jason Stajich, jason-at-bioperl-dot-org =cut use strict; use warnings; use Bio::Align::Utilities qw(aa_to_dna_aln); use Bio::AlignIO; use Bio::SeqIO; use Getopt::Long; # TODO - finish documentation, # - add support for extra options in output alignment formats # such as idnewline in phylip out to support Molphy input files my ($iformat,$seqformat,$oformat,$seqdb,$input,$output) = ('clustalw','fasta', 'phylip'); my ($help,$usage); $usage = "usage: bp_mrtrans.pl -i prot_alignment -if align_format -o out_dna_align -of output_format -s cDNA_seqdb -sf fasta\n". "defaults: -if clustalw -of phylip -sf fasta\n"; GetOptions( 'if|iformat:s' => \$iformat, 'i|input:s' => \$input, 'o|output:s' => \$output, 'of|outformat:s'=> \$oformat, 's|seqdb|db:s' => \$seqdb, 'sf|seqformat:s'=> \$seqformat, 'h|help' => sub{ exec('perldoc',$0); exit(0) }, ); $input ||= shift; $seqdb ||= shift; $output ||= shift; if( ! defined $seqdb ) { die("cannot proceed without a valid seqdb\n$usage"); } if( ! defined $input ) { die("cannot proceed without an input file\n$usage"); } my $indb = new Bio::SeqIO(-file => $seqdb, -format=>$seqformat); my %seqs; while( my $seq = $indb->next_seq ) { $seqs{$seq->id} = $seq; } my $in = new Bio::AlignIO(-format => $iformat, -file => $input); my $out = new Bio::AlignIO(-format => $oformat, -idlength => 22, -interleaved => 0, defined $output ? (-file => ">$output") : () ); while( my $aln = $in->next_aln ) { my $dnaaln = aa_to_dna_aln($aln,\%seqs); $dnaaln->set_displayname_flat(1); $out->write_aln($dnaaln); } __END__ Lucy.t100644000766000024 234214006427353 15665 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 22); use_ok('Bio::Tools::Lucy'); } my @params = (adv_stderr => 1, seqfile => test_input_file('lucy.seq'), rev_desig => 'R'); # Bio::Tools::Lucy will find .qual, .info, and .stderr files in this folder my $lucyObj = Bio::Tools::Lucy->new(@params); isa_ok $lucyObj,'Bio::Tools::Lucy'; ok $lucyObj->seqfile(); $lucyObj->adv_stderr(1); my $stderr = $lucyObj->adv_stderr(); ok $stderr; my $names =$lucyObj->get_sequence_names(); ok $names; my $seq = shift @$names; is $seq, 'TaLr1010B10R'; is $lucyObj->length_raw("$seq"), 1060; is $lucyObj->length_clear("$seq"), 420; is $lucyObj->start_clear("$seq"), 86; is $lucyObj->end_clear("$seq"), 505; ok $lucyObj->avg_quality("$seq"); ok $lucyObj->full_length("$seq"); ok $lucyObj->polyA("$seq"); is $lucyObj->direction("$seq"), 'R'; ok $lucyObj->per_GC("$seq"); ok $lucyObj->sequence("$seq"); ok $lucyObj->quality("$seq"); my $seqObj = $lucyObj->get_Seq_Obj("$seq"); ok $seqObj; my $seqObjs = $lucyObj->get_Seq_Objs(); ok $seqObjs; my $rejects = $lucyObj->get_rejects(); ok $rejects; my ($key) = (sort keys %$rejects); is $key, 'TaLr1011A07R'; is $rejects->{$key}, 'Q'; Sim4.t100644000766000024 306714006427353 15572 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id: Sim4.t 11525 2007-06-27 10:16:38Z sendu $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 27); use_ok('Bio::Tools::Sim4::Results'); } my $sim4 = Bio::Tools::Sim4::Results->new(-file=> test_input_file('sim4.rev'), -estisfirst=>0); ok ( $sim4, 'new Sim4 results instance') ; my $exonset = $sim4->next_exonset; my @exons = $exonset->sub_SeqFeature(); is (scalar(@exons), 10); my $exon = 1; is $exons[$exon]->est_hit()->seq_id(), 'HSHNCPA1'; like($exons[$exon]->seq_id(), qr/human/); is $exons[$exon]->strand(), -1; is $exons[$exon]->start(), 1048; is $exons[$exon]->end(), 1117; is $exons[$exon]->score, 93; is $exons[$exon]->est_hit()->seqlength(), 1198; $sim4 = Bio::Tools::Sim4::Results->new(-file=> test_input_file('sim4.for.for'), -estisfirst=>0); ok ( $sim4, 'new Sim4 results instance') ; $exonset = $sim4->next_exonset; @exons = $exonset->sub_SeqFeature(); is (scalar(@exons), 4); $exon = 1; is $exons[$exon]->est_hit()->seq_id(), 'hs_est'; is $exons[$exon]->seq_id(), 'human'; is $exons[$exon]->strand(), 1; is $exons[$exon]->start(), 1377; is $exons[$exon]->end(), 1500; is $exons[$exon]->score, 99; is $exons[$exon]->est_hit()->seqlength(), 479; ok($sim4->next_exonset); @exons = $exonset->sub_SeqFeature(); is $exons[$exon]->est_hit()->seq_id(), 'hs_est'; is $exons[$exon]->seq_id(), 'human'; is $exons[$exon]->strand(), 1; is $exons[$exon]->est_hit->start(), 120; is $exons[$exon]->est_hit->end(), 243; is $exons[$exon]->score, 99; is $exons[$exon]->est_hit()->seqlength(), 479; QRNA.t100644000766000024 343414006427353 15515 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 30); use_ok('Bio::Tools::QRNA'); } my $inputfilename= test_input_file('ecoli-trna-qrna.out'); ok my $parser = Bio::Tools::QRNA->new(-file => $inputfilename); my $rnacount = 0; while( my $f = $parser->next_feature ) { if( $f->primary_tag eq 'RNA' ) { # winning model is primary tag if( ! $rnacount ) { # 1st time through let's test is($f->feature1->start,4); is($f->feature1->end, 70); is($f->score, 22.147); is($f->feature1->seq_id,'DA0780-1-'); is($f->feature2->start, 4); is($f->feature2->end, 70); is($f->feature2->seq_id, 'ECOLI-3979754-'); is(($f->get_tag_values('alignment_len'))[0], 70); is(($f->get_tag_values('alignment_pid'))[0], '72.86'); is(($f->get_tag_values('COD_score'))[0], '16.954'); is(($f->get_tag_values('COD_logoddspost'))[0], '-4.365'); is(($f->get_tag_values('OTH_score'))[0], '21.319'); is(($f->get_tag_values('OTH_logoddspost'))[0], '0.000'); } $rnacount++; } } is($rnacount, 21); $inputfilename= test_input_file('qrna-relloc.out'); $parser = Bio::Tools::QRNA->new(-file => $inputfilename); my $qrna = $parser->next_feature; is($qrna->primary_tag, 'COD'); is($qrna->source_tag, 'qrna'); is($qrna->feature1->seq_id, 'Contig1'); is($qrna->feature2->seq_id, 'chr5.pseudo'); is($qrna->feature1->start, 24732); is($qrna->feature1->end, 24881); is($qrna->feature2->start, 527251); is($qrna->feature2->end, 527400); is($parser->seq_file,'tst.out'); is($parser->RNA_model, '/mix_tied_linux.cfg'); is($parser->PAM_model, 'BLOSUM62 scaled by 1.000'); is($parser->program_name, 'qrna'); is($parser->program_version, '1.2b'); is($parser->program_date, 'Tue Dec 18 15:04:38 CST 2001'); ePCR.t100644000766000024 153214006427353 15542 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ BEGIN { use Bio::Root::Test; test_begin(-tests => 27); use_ok('Bio::Tools::EPCR'); use_ok('Bio::SeqIO'); } my $DEBUG = test_debug(); my $seqio = Bio::SeqIO->new('-format' => 'fasta', '-file' => test_input_file('genomic-seq.fasta')); my $seq = $seqio->next_seq; ok($seq); my $epcr = Bio::Tools::EPCR->new( '-file' => test_input_file('genomic-seq.epcr')); ok ($epcr); my %strand; while( defined(my $feature = $epcr->next_feature) ) { ok($feature); ok($feature->start); ok($feature->end); $seq->add_SeqFeature($feature); $strand{$feature->strand} ++; } is ($strand{1}, 3, 'got 3 forward strand ePCR hits'); is ($strand{-1}, 3, 'got 3 reverse strand ePCR hits'); if( $DEBUG ) { $seqio = Bio::SeqIO->new('-format' => 'genbank' ); $seqio->write_seq($seq); } bsml.t100644000766000024 226014006427353 15565 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 16, -requires_modules => ['XML::DOM'] ); use_ok('XML::DOM'); use_ok('Bio::SeqIO::bsml'); } my $verbose = test_debug(); my $str = Bio::SeqIO->new(-format => 'bsml', -verbose => $verbose, -file => test_input_file('U83300.bsml')); my $seq = $str->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); my @refs = $seq->annotation->get_Annotations('reference'); is(@refs, 2, 'got correct number of refs'); is($seq->display_id, 'MIVN83300', 'display_id'); is($seq->molecule, 'DNA', 'molecule'); ok(! $seq->is_circular, 'is_circular'); is($seq->get_dates, 2, 'dates'); is($seq->accession_number, 'U83300', 'accession_number'); is($seq->seq_version, 1, 'seq_version'); my @feats = $seq->get_SeqFeatures; is(@feats, 2, 'got correct number of SeqFeatures'); is($feats[1]->start, 1, 'feature start'); is($feats[1]->end, 946, 'feature end'); is($feats[1]->get_tag_values('db_xref'), 3, 'get_tag_values db_xref'); is($seq->annotation->get_Annotations('reference'), 2, 'get_Annotations reference'); is($seq->annotation->get_Annotations('dblink'), 2, 'get_Annotations dblink'); kegg.t100644000766000024 707114006427353 15552 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 16); use_ok('Bio::SeqIO::kegg'); } my $verbose = test_debug(); my $io = Bio::SeqIO->new(-format => 'kegg', -verbose => $verbose, -file => test_input_file('AHCYL1.kegg')); isa_ok($io, 'Bio::SeqIO'); my $kegg = $io->next_seq(); ok($kegg); isa_ok($kegg, 'Bio::Seq::RichSeq'); is($kegg->accession, '10768'); is($kegg->display_id, 'AHCYL1'); is($kegg->alphabet, 'dna'); is($kegg->seq,'atgtcgatgcctgacgcgatgccgctgcccggggtcggggaggagctgaagcaggccaaggagatcgaggacgccgagaagtactccttcatggccaccgtcaccaaggcgcccaagaagcaaatccagtttgctgatgacatgcaggagttcaccaaattccccaccaaaactggccgaagatctttgtctcgctcgatctcacagtcctccactgacagctacagttcagctgcatcctacacagatagctctgatgatgaggtttctccccgagagaagcagcaaaccaactccaagggcagcagcaatttctgtgtgaagaacatcaagcaggcagaatttggacgccgggagattgagattgcagagcaagacatgtctgctctgatttcactcaggaaacgtgctcagggggagaagcccttggctggtgctaaaatagtgggctgtacacacatcacagcccagacagcggtgttgattgagacactctgtgccctgggggctcagtgccgctggtctgcttgtaacatctactcaactcagaatgaagtagctgcagcactggctgaggctggagttgcagtgttcgcttggaagggcgagtcagaagatgacttctggtggtgtattgaccgctgtgtgaacatggatgggtggcaggccaacatgatcctggatgatgggggagacttaacccactgggtttataagaagtatccaaacgtgtttaagaagatccgaggcattgtggaagagagcgtgactggtgttcacaggctgtatcagctctccaaagctgggaagctctgtgttccggccatgaacgtcaatgattctgttaccaaacagaagtttgataacttgtactgctgccgagaatccattttggatggcctgaagaggaccacagatgtgatgtttggtgggaaacaagtggtggtgtgtggctatggtgaggtaggcaagggctgctgtgctgctctcaaagctcttggagcaattgtctacattaccgaaatcgaccccatctgtgctctgcaggcctgcatggatgggttcagggtggtaaagctaaatgaagtcatccggcaagtcgatgtcgtaataacttgcacaggaaataagaatgtagtgacacgggagcacttggatcgcatgaaaaacagttgtatcgtatgcaatatgggccactccaacacagaaatcgatgtgaccagcctccgcactccggagctgacgtgggagcgagtacgttctcaggtggaccatgtcatctggccagatggcaaacgagttgtcctcctggcagagggtcgtctactcaatttgagctgctccacagttcccacctttgttctgtccatcacagccacaacacaggctttggcactgatagaactctataatgcacccgaggggcgatacaagcaggatgtgtacttgcttcctaagaaaatggatgaatacgttgccagcttgcatctgccatcatttgatgcccaccttacagagctgacagatgaccaagcaaaatatctgggactcaacaaaaatgggccattcaaacctaattattacagatactaa'); ok($kegg->translate->seq); is(($kegg->annotation->get_Annotations('description'))[0]->text, 'S-adenosylhomocysteine hydrolase-like 1 [EC:3.3.1.1]'); is(($kegg->annotation->get_Annotations('pathway'))[0]->text, 'Metabolism; Amino Acid Metabolism; Methionine metabolism'); is( (grep {$_->database eq 'KO'} $kegg->annotation->get_Annotations('dblink'))[0]->comment, 'adenosylhomocysteinase' ); is( (grep {$_->database eq 'PATH'} $kegg->annotation->get_Annotations('dblink'))[0]->primary_id, 'hsa00271' ); is( ($kegg->annotation->get_Annotations('aa_seq'))[0]->text, 'MSMPDAMPLPGVGEELKQAKEIEDAEKYSFMATVTKAPKKQIQFADDMQEFTKFPTKTGRRSLSRSISQSSTDSYSSAASYTDSSDDEVSPREKQQTNSKGSSNFCVKNIKQAEFGRREIEIAEQDMSALISLRKRAQGEKPLAGAKIVGCTHITAQTAVLIETLCALGAQCRWSACNIYSTQNEVAAALAEAGVAVFAWKGESEDDFWWCIDRCVNMDGWQANMILDDGGDLTHWVYKKYPNVFKKIRGIVEESVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDGLKRTTDVMFGGKQVVVCGYGEVGKGCCAALKALGAIVYITEIDPICALQACMDGFRVVKLNEVIRQVDVVITCTGNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVTSLRTPELTWERVRSQVDHVIWPDGKRVVLLAEGRLLNLSCSTVPTFVLSITATTQALALIELYNAPEGRYKQDVYLLPKKMDEYVASLHLPSFDAHLTELTDDQAKYLGLNKNGPFKPNYYRY'); is( ($kegg->annotation->get_Annotations('position'))[0]->text, '1:join(26813..26932,50794..50905,52974..53117,54122..54222,54657.. 54759,56523..56617,57185..57291,58104..58220,58427..58490,59255.. 59343,59706..59776,60133..60227,60312..60410,60811..60879,61308.. 61386,62491..62611,63434..63440)'); is($kegg->species->common_name , 'H.sapiens');embl.t100644000766000024 2760314006427353 15577 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 100); use_ok('Bio::SeqIO::embl'); } my $verbose = test_debug(); my $ast = Bio::SeqIO->new( -format => 'embl', -verbose => $verbose, -file => test_input_file('roa1.dat')); $ast->verbose($verbose); my $as = $ast->next_seq(); ok defined $as->seq; is($as->display_id, 'HSHNCPA1'); is($as->accession_number, 'X79536'); is($as->seq_version, 1); is($as->version, 1); is($as->desc, 'H.sapiens mRNA for hnRNPcore protein A1'); is($as->molecule, 'RNA'); is($as->alphabet, 'rna'); is(scalar $as->all_SeqFeatures(), 4); is($as->length, 1198); is($as->species->binomial(), 'Homo sapiens'); # EMBL Release 87 changes (8-17-06) $ast = Bio::SeqIO->new( -format => 'embl', -verbose => $verbose, -file => test_input_file('roa1_v2.dat')); $ast->verbose($verbose); $as = $ast->next_seq(); ok defined $as->seq; # accession # same as display name now is($as->display_id, 'X79536'); is($as->accession_number, 'X79536'); is($as->seq_version, 1); is($as->version, 1); is($as->desc, 'H.sapiens mRNA for hnRNPcore protein A1'); # mRNA instead of RNA is($as->molecule, 'mRNA'); is($as->alphabet, 'rna'); is(scalar $as->all_SeqFeatures(), 4); is($as->length, 1198); is($as->species->binomial(), 'Homo sapiens'); my $ent = Bio::SeqIO->new( -file => test_input_file('test.embl'), -format => 'embl'); my $seq = $ent->next_seq(); is(defined $seq->seq(), 1, 'success reading Embl with ^ location and badly split double quotes'); is(scalar $seq->annotation->get_Annotations('reference'), 3); my $out_file = test_output_file(); my $out = Bio::SeqIO->new(-file=> ">$out_file", -format => 'embl'); is($out->write_seq($seq),1, 'success writing Embl format with ^ < and > locations'); # embl with no FT $ent = Bio::SeqIO->new( -file => test_input_file('test.embl'), -format => 'embl'); $seq = $ent->next_seq(); ok($seq); is(lc($seq->subseq(1,10)),'gatcagtaga'); is($seq->length, 4870); # embl with no FH my $noFH = Bio::SeqIO->new( -file => test_input_file('no_FH.embl'), -format => 'embl'); is(scalar($noFH->next_seq->get_SeqFeatures), 4); # bug 1571 $ent = Bio::SeqIO->new( -format => 'embl', -file => test_input_file('test.embl2sq')); is($ent->next_seq->length,4877); # embl repbase $ent = Bio::SeqIO->new(-file => test_input_file('BEL16-LTR_AG.embl'), -format => 'embl'); $seq = $ent->next_seq; is($seq->display_id,'BEL16-LTR_AG'); # test secondary accessions in EMBL (bug #1332) my $seqio = Bio::SeqIO->new( -format => 'embl', -file => test_input_file('ECAPAH02.embl')); $seq = $seqio->next_seq; is($seq->accession_number, 'D10483'); is($seq->seq_version, 2); my @accs = $seq->get_secondary_accessions(); is($accs[0], 'J01597'); is($accs[-1], 'X56742'); ### TPA TESTS - Thanks to Richard Adams ### # test Third Party Annotation entries in EMBL/Gb format # to ensure compatability with parsers. my $str = Bio::SeqIO->new( -format =>'embl', -file => test_input_file('BN000066-tpa.embl')); $seq = $str->next_seq; ok(defined $seq); is($seq->accession_number, 'BN000066'); is($seq->alphabet, 'dna'); is($seq->display_id, 'AGA000066'); is($seq->length, 5195); is($seq->division, 'INV'); is($seq->get_dates, 2); is($seq->keywords, 'acetylcholinesterase; achE1 gene; Third Party Annotation; TPA.'); is($seq->seq_version, 1); is($seq->feature_count, 15); my $spec_obj = $seq->species; is ($spec_obj->common_name, 'African malaria mosquito'); is ($spec_obj->species, 'gambiae'); is ($spec_obj->genus, 'Anopheles'); is ($spec_obj->binomial, 'Anopheles gambiae'); my $ac = $seq->annotation; my $reference = ($ac->get_Annotations('reference') )[1]; is ($reference->title,'"A novel acetylcholinesterase gene in mosquitoes codes for the insecticide target and is non-homologous to the ace gene in Drosophila"'); is ($reference->authors,'Weill M., Fort P., Berthomi eu A., Dubois M.P., Pasteur N., Raymond M.'); my $cmmnt = ($ac->get_Annotations('comment') )[0]; is($cmmnt->text, 'see also AJ488492 for achE-1 from Kisumu strain Third Party Annotation Database: This TPA record uses Anopheles gambiae trace archive data (http://trace.ensembl.org) '); $ent = Bio::SeqIO->new( -file => test_input_file('test.embl'), -format => 'embl'); $ent->verbose($verbose); $seq = $ent->next_seq(); my $species = $seq->species(); my @cl = $species->classification(); is( $cl[3] ne $species->genus(), 1, 'genus duplication test'); $ent->close(); # ## read-write - test embl writing of a PrimarySeq # my $primaryseq = Bio::PrimarySeq->new( -seq => 'AGAGAGAGATA', -id => 'myid', -desc => 'mydescr', -alphabet => 'DNA', -accession_number => 'myaccession'); $verbose = -1 unless $ENV{'BIOPERLDEBUG'}; # silence warnings unless we are debuggin my $embl = Bio::SeqIO->new(-format => 'embl', -verbose => $verbose, -file => ">$out_file"); ok($embl->write_seq($primaryseq)); # this should generate a warning my $scalar = "test"; eval { $embl->write_seq($scalar); }; ok ($@); # CDS records # (which have nonstandard 'PA' and 'OX' tags) # see http://bioperl.org/pipermail/bioperl-l/2009-February/029252.html # and the rest of that thread my $cds_file = Bio::SeqIO->new(-format =>'embl', -file => test_input_file('cds_sample.embl')); my $cds_seq = $cds_file->next_seq; ok(defined $cds_seq); is($cds_seq->display_id, 'EAL24309'); is($cds_seq->accession_number, 'CH236947.1', 'CDS - accession on PA line'); is($cds_seq->alphabet, 'dna'); is($cds_seq->length, 192); is($cds_seq->species->binomial(), 'Homo sapiens'); is($cds_seq->seq_version, 1); is($cds_seq->feature_count, 2); my $cds_annot = $cds_seq->annotation; ok(defined $cds_annot); my $cds_dblink = ($cds_annot->get_Annotations('dblink'))[0]; ok(defined $cds_dblink); is($cds_dblink->tagname, 'dblink', 'CDS - OX tagname'); is($cds_dblink->database, 'NCBI_TaxID', 'CDS - OX database'); is($cds_dblink->primary_id, '9606', 'CDS - OX primary_id'); #bug 2982 - parsing contig descriptions sans sequence data ok( $embl = Bio::SeqIO->new( -file => test_input_file('bug2982.embl'), -format => 'embl') ); my $i; for ($i=0; my $seq = $embl->next_seq; $i++) { ok !$seq->seq; ok ( my $ann = ($seq->annotation->get_Annotations('contig'))[0] ); like $ann->value, qr/join\(/; } is $i, 4; # bug 3086 - parsing long lines correctly ok( $embl = Bio::SeqIO->new(-file => test_input_file('bug3086.embl'), -format => 'embl', -verbose => '$verbose') ); $seq = $embl->next_seq; foreach my $feature ($seq->top_SeqFeatures) { if ($feature->has_tag('product')) { my ($product) = $feature->get_tag_values('product'); is($product, 'bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase', 'Check if product was parsed correctly'); } } # long labels handled { # Create sequence with feature with a long label qualifier my $seq=Bio::Seq->new(-seq=>'actg'); my $feature=Bio::SeqFeature::Generic->new(-primary=>'CDS', -start=>1, -end=>4); $feature->add_tag_value(label=>'1 2 3 4 5 6 7 8 9 a b c d e f g h i j k l m n o p q r'); $seq->add_SeqFeature($feature); # Write EMBL my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 2); my $out=Bio::SeqIO->new(-format=>'embl', -fh => $str_fh); $out->write_seq($seq); # Read EMBL my $in=Bio::SeqIO->new(-format=>'embl', -string => $string); my $ret=eval { my $embl=$in->next_seq }; my $error; $error=$@ if (!$ret); ok($ret, 'Parse long qualifier'); is($error, undef); } # NCBI TaxIDs should roundtrip { my $seq=Bio::Seq->new(-seq=>'actg'); my $species = Bio::Species->new(-ncbi_taxid => 7165, -classification=> [ 'Anopheles gambiae', 'Anopheles', 'Culicoidea', 'Nematocera', 'Diptera', 'Endopterygota', 'Neoptera', 'Pterygota', 'Insecta', 'Hexapoda', 'Arthropoda', 'Metazoa', 'Eukaryota' ]); $seq->species($species); is($seq->species->ncbi_taxid, 7165, 'TaxID set correctly'); # Write EMBL my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 2); my $out=Bio::SeqIO->new(-format=>'embl', -fh => $str_fh); $out->write_seq($seq); # Read EMBL my $in=Bio::SeqIO->new(-format=>'embl', -string => $string); my $embl_seq; my $ret=eval { $embl_seq=$in->next_seq }; my $error; $error=$@ if (!$ret); # Check that TaxID has roundtripped my $embl_species = $embl_seq->species; ok(defined $embl_species, "The read sequence has a species object"); is($embl_species->ncbi_taxid, 7165, "NCBI TaxID has roundtripped"); is($embl_species->binomial(), 'Anopheles gambiae', "Name has roundtripped"); } # a taxon db_xref on a source feature should override an OX line { my $seq=Bio::Seq->new(-seq=>'actg'); my $species = Bio::Species->new(-ncbi_taxid => 7165, -classification=> [ 'Anopheles gambiae', 'Anopheles', 'Culicoidea', 'Nematocera', 'Diptera', 'Endopterygota', 'Neoptera', 'Pterygota', 'Insecta', 'Hexapoda', 'Arthropoda', 'Metazoa', 'Eukaryota' ]); $seq->species($species); is($seq->species->ncbi_taxid, 7165, 'TaxID set correctly'); my $seq_feature = Bio::SeqFeature::Generic->new(-primary=>'source', -start => 1, -end=> length($seq->seq)); $seq_feature->add_tag_value('db_xref', 'taxon:9606'); $seq->add_SeqFeature($seq_feature); # Write EMBL my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 2); my $out=Bio::SeqIO->new(-format=>'embl', -fh => $str_fh); $out->write_seq($seq); # Read EMBL my $in=Bio::SeqIO->new(-format=>'embl', -string => $string); my $embl_seq; my $ret=eval { $embl_seq=$in->next_seq }; my $error; $error=$@ if (!$ret); # Check that TaxID has roundtripped my $embl_species = $embl_seq->species; ok(defined $embl_species, "The read sequence has a species object"); is($embl_species->ncbi_taxid, 9606, "The taxid of the source feature overrides that of the OX line"); is($embl_species->binomial(), 'Anopheles gambiae', "Name has roundtripped"); } # Handle Seq objects that only define an ID, not an accession number { my $seq = Bio::Seq->new(-seq=>'actg', -id=>'test_id'); my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 1); my $out = Bio::SeqIO->new(-format=>'embl', -fh=>$str_fh); $out->write_seq($seq); ok($string =~ m/ID test_id;/, "The ID field was written correctly"); } # Test lenient handling of space after '=' sign in qualifiers: { my $ent = Bio::SeqIO->new( -file => test_input_file('test_space.embl'), -format => 'embl'); my $seq; eval { $seq = $ent->next_seq(); }; my $error=$@; is($error, '', 'EMBL format with space after equal sign parses'); my ($feature)=$seq->all_SeqFeatures; is($feature->primary_tag, 'CDS', 'CDS read'); ok($feature->has_tag('product'), '/product found'); my ($value)=$feature->get_tag_values('product'); is($value, 'somewordandt extthatisquite lon gandthereforewraps', 'Qualifier /product value matches'); } game.t100644000766000024 361114006427353 15542 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 24, -requires_modules => [qw(XML::Parser::PerlSAX XML::Writer)]); use_ok('Bio::SeqIO::game'); } my $verbose = test_debug() || -1; my $str = Bio::SeqIO->new('-file'=> test_input_file('test.game'), '-format' => 'game', '-verbose' => $verbose); isa_ok ($str, 'Bio::SeqIO'); my $seq = $str->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeq'); # exercise game parsing $str = Bio::SeqIO->new( -format =>'game', -file => test_input_file('test.game') ); $seq = $str->next_seq; ok(defined $seq); ok(defined $seq->seq); is($seq->alphabet, 'dna'); is($seq->display_id, 'L16622'); is($seq->length, 28735); is($seq->species->binomial, 'Caenorhabditis elegans'); my @feats = $seq->get_SeqFeatures; is(scalar(@feats), 7); my $source = grep { $_->primary_tag eq 'source' } @feats; ok($source); my @genes = grep { $_->primary_tag eq 'gene' } @feats; is(scalar(@genes), 3); ok($genes[0]->has_tag('gene')); my $gname; if ( $genes[0]->has_tag('gene') ) { ($gname) = $genes[0]->get_tag_values('gene'); } is($gname, 'C02D5.3'); is($genes[0]->strand, 1); my $cds = grep { $_->primary_tag eq 'CDS' } @feats; is($cds, 3); # make sure we can read what we write # test XML-writing my $testfile = test_output_file(); # map argument is require to write a element my $out = Bio::SeqIO->new(-format => 'game', -file => ">$testfile", -map => 1); $out->write_seq($seq); $out->close(); $str = Bio::SeqIO->new(-format =>'game', -file => $testfile); $seq = $str->next_seq; ok(defined $seq); ok(defined $seq->seq); is($seq->alphabet, 'dna'); is($seq->display_id, 'L16622'); is($seq->length, 28735); is($seq->species->binomial, 'Caenorhabditis elegans'); my $genes = grep { $_->primary_tag eq 'gene' } @feats; $cds = grep { $_->primary_tag eq 'CDS' } @feats; is($genes, 3); is($cds, 3); tigr.t100644000766000024 234014006427353 15574 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 8, -requires_module => 'Error', -requires_networking => 0, ); use_ok('Bio::SeqIO::tigr'); } my $verbose = test_debug(); TODO: { my $format = 'tigr'; todo_skip "No tests for $format format -- no sample file to test against", 7, if 1; my $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test.$format"), -format => $format); isa_ok($seqio_obj, 'Bio::SeqIO'); my @methods = qw(next_seq write_seq); foreach my $method (@methods) { can_ok($seqio_obj, $method) || diag "$method method not implemented for $format"; } # checking the first sequence object my $seq_obj = $seqio_obj->next_seq(); isa_ok($seq_obj, 'Bio::Seq'); my %expected = ('seq' => '' . 'length' => '', 'primary_id' => '', 'description' => qr(), ); is ($seq_obj->seq(), $expected{'seq'}, 'sequence'); is ($seq_obj->length(), $expected{'length'}, 'length'); is ($seq_obj->primary_id(), $expected{'primary_id'}, 'primary_id'); like ($seq_obj->description(), $expected{'description'}, 'description'); }qual.t100644000766000024 256714006427353 15604 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 18); use_ok('Bio::SeqIO'); use_ok('Bio::Seq::PrimaryQual'); } my $DEBUG = test_debug(); my $in_qual = Bio::SeqIO->new('-file' => test_input_file('qualfile.qual'), '-format' => 'qual'); ok($in_qual); my @quals; my $first = 1; while ( my $qual = $in_qual->next_seq() ) { # ::dumpValue($qual); isa_ok($qual, 'Bio::Seq::PrimaryQual'); @quals = @{$qual->qual()}; if( $DEBUG ) { warn($qual->id()."\n"); warn("(".scalar(@quals).") quality values.\n"); } if( $first ) { is(@quals, 484); } $first = 0; } # in October 2004, Carlos Mauricio La Rota posted a problem with descriptions # this routine is to test that @quals = 10..20; # this one has a forced header my $seq = Bio::Seq::PrimaryQual->new( -qual => \@quals, -header => "Hank is a good cat. I gave him a bath yesterday."); my $out = Bio::SeqIO->new(-file => ">".test_output_file(), -format => 'qual'); # yes, that works is $seq->header, 'Hank is a good cat. I gave him a bath yesterday.'; @quals = @{$seq->qual()}; is scalar(@quals), 11; ok $out->write_seq($seq); $seq->header(''); is $seq->header, ''; $seq->id('Hank1'); is $seq->id, 'Hank1'; # yes, that works ok $out->write_seq($seq); DBLink.t100644000766000024 240014006427353 15477 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 131); use_ok('Bio::SeqIO'); } my @genbank_files = qw/BK000016-tpa.gbk ay116458.gb ay149291.gb NC_006346.gb ay007676.gb dq519393.gb P35527.gb/; # bug 2152 #------------------------------------ my $verbose = test_debug(); for my $in ( @genbank_files ) { my $infile = test_input_file($in); my $seqio = Bio::SeqIO->new( -format =>'genbank', -verbose => $verbose, -file => $infile, ); my $seq = $seqio->next_seq; my @values = $seq->annotation->get_Annotations('dblink'); foreach my $value (@values) { my $output = $value->display_text; ok(defined $output, '"'.$output . '"'); # check value is not empty ok(index($output,'::') < 0 , 'no double colon' ); # these tests seems silly ok( substr($output,-1) ne ':' , 'no trailing colon'); # but all have been known to occur ok(index($output,' ') < 0 , 'no double space' ); # my @parts = split(/:/,$output, 2); ok( scalar(@parts) == 2, 'dblink value is splittable'); } } #------------------------------------ AlignIO000755000766000024 014006427353 14605 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tpo.t100644000766000024 301614006427353 15550 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: po.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 11); use_ok('Bio::AlignIO::po'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # POA # just skip on perl 5.6.0 and earlier as it causes a crash on # default perl with OS X 10.2 # fink perl 5.6.0 does not seem to have the problem # can't figure out what it is so just skip for now SKIP: { skip("skipping due to bug in perl 5.6.0 that comes with OS X 10.2", 10) unless ($^O ne 'darwin' || $] > 5.006); $str = Bio::AlignIO->new( -file => test_input_file('testaln.po'), -format => 'po', ); isa_ok($str, 'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->num_sequences, 6; # output is? i.e. does conversion from clustalw to po give the same alignment? $str = Bio::AlignIO->new( '-file' => test_input_file('testaln.clustalw'), '-format' => 'clustalw'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $strout = Bio::AlignIO->new( '-file' => ">" . test_output_file(), '-format' => 'po'); $status = $strout->write_aln($aln); is $status, 1, "po output test"; $str = Bio::AlignIO->new( '-file' => test_input_file('testaln.po'), '-format' => 'po'); isa_ok($str,'Bio::AlignIO'); my $aln2 = $str->next_aln(); isa_ok($aln2,'Bio::Align::AlignI'); is $aln2->num_sequences, $aln->num_sequences; is $aln2->length, $aln->length; } RootI.t100644000766000024 1272714006427353 15660 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 43); use_ok 'Bio::Root::Root'; } ok my $obj = Bio::Root::Root->new(); isa_ok $obj, 'Bio::Root::RootI'; throws_ok { $obj->throw('Testing throw') } qr/Testing throw/;# 'throw failed'; # test throw_not_implemented() throws_ok { $obj->throw_not_implemented() } qr/EXCEPTION: Bio::Root::NotImplemented/; { package Bio::FooI; use base qw(Bio::Root::RootI); sub new { my $class = shift; my $self = {}; bless $self, ref($class) || $class; return $self; }; } $obj = Bio::FooI->new(); throws_ok { $obj->throw_not_implemented() } qr/EXCEPTION /; $obj = Bio::Root::Root->new(); # doesn't work in perl 5.00405 #my $val; #eval { # my ($tfh,$tfile) = $obj->tempfile(); # local * STDERR = $tfh; # $obj->warn('Testing warn'); # close $tfh; # open(IN, $tfile) or die("cannot open $tfile"); # $val = join("", ) ; # close IN; # unlink $tfile; #}; #ok $val =~ /Testing warn/; #'verbose(0) warn did not work properly' . $val; $obj->verbose(-1); throws_ok { $obj->throw('Testing throw') } qr/Testing throw/;# 'verbose(-1) throw did not work properly' . $@; lives_ok { $obj->warn('Testing warn') }; $obj->verbose(1); throws_ok { $obj->throw('Testing throw') } qr/Testing throw/;# 'verbose(1) throw did not work properly' . $@; # doesn't work in perl 5.00405 #undef $val; #eval { # my ($tfh,$tfile) = $obj->tempfile(); # local * STDERR = $tfh; # $obj->warn('Testing warn'); # close $tfh; # open(IN, $tfile) or die("cannot open $tfile"); # $val = join("", ); # close IN; # unlink $tfile; #}; #ok $val =~ /Testing warn/;# 'verbose(1) warn did not work properly' . $val; my @stack = $obj->stack_trace(); is scalar @stack, 2; my $verbobj = Bio::Root::Root->new(-verbose=>1,-strict=>1); is $verbobj->verbose(), 1; $Bio::Root::Root::DEBUG = 1; my $seq = Bio::Root::Root->new(); is $seq->verbose, 1; # test for bug #1343 my @vals = Bio::Root::RootI->_rearrange([qw(apples pears)], -apples => 'up the', -pears => 'stairs'); is shift @vals, 'up the'; is shift @vals, 'stairs'; # tests for _set_from_args() # Let's not pollute Bio::Root::Root namespace if possible # Create temp classes instead which inherit Bio::Root::Root, then test { package Bio::Foo1; use base qw(Bio::Root::Root); our $VERSION = '2.00'; sub new { my $class = shift; my $self = {}; bless $self, ref($class) || $class; $self->_set_from_args(\@_); return $self; }; } $obj = Bio::Foo1->new(-verbose => 1, t1 => 1, '--Test-2' => 2); #ok ! $obj->can('t1'), 'arg not callable'; { package Bio::Foo2; use base qw(Bio::Root::Root); sub new { my $class = shift; my $self = {}; bless $self, ref($class) || $class; $self->_set_from_args(\@_, -create => 1); return $self; }; } $obj = Bio::Foo2->new(-verbose => 1, t3 => 1, '--Test-4' => 2); ok $obj->can('t3'), 'arg callable since method was created'; ok $obj->can('test_4'), 'mal-formed arg callable since method was created with good name'; for my $m (qw(t3 test_4)) { can_ok('Bio::Foo2',$m); ok ! Bio::Root::Root->can($m), "Methods don't pollute original Bio::Root::Root namespace"; } { package Bio::Foo3; use base qw(Bio::Root::Root); sub new { my $class = shift; my $self = {}; bless $self, ref($class) || $class; $self->_set_from_args(\@_, -methods => ['verbose', 't5'], -create => 1); return $self; }; } $obj = Bio::Foo3->new(-verbose => 1, t5 => 1, '--Test-6' => 2); can_ok $obj, 't5'; ok ! $obj->can('test_6'), 'arg not in method list not created'; can_ok ('Bio::Foo3','t5'); ok ! UNIVERSAL::can('Bio::Root::Root','t5'), "Methods don't pollute original Bio::Root::Root namespace"; { package Bio::Foo4; use base qw(Bio::Root::Root); sub new { my $class = shift; my $self = {}; bless $self, ref($class) || $class; my %args = @_; $self->_set_from_args(\%args, -methods => {(verbose => 'v', test7 => 't7', test_8 => 't8')}, -create => 1); return $self; }; } # with synonyms $obj = Bio::Foo4->new(-verbose => 1, t7 => 1, '--Test-8' => 2); is $obj->verbose, 1, 'verbose was set correctly'; is $obj->t7, 1, 'synonym was set correctly'; is $obj->test7, 1, 'real method of synonym was set correctly'; is $obj->test_8, 2, 'mal-formed arg correctly resolved to created method'; is $obj->t8, 2, 'synonym of set method was set correctly'; for my $m (qw(t7 test7 test_8 t8)) { can_ok 'Bio::Foo4', $m; ok ! UNIVERSAL::can('Bio::Root::Root','t7'), "Methods don't pollute original Bio::Root::Root namespace"; } # test basic Root::clone() my $clone = $obj->clone; is $clone->t7, $obj->t7, 'clone'; is $clone->test7, $obj->test7, 'clone'; is $clone->test_8, $obj->test_8, 'clone'; $clone->test_8('xyz'); isnt $clone->test_8, $obj->test_8, 'clone changed, original didn\'t'; # test Root::clone() with parameter passing, only works for methods # (introspection via can()) my $clone2 = $obj->clone(-t7 => 'foo'); is $clone2->t7, 'foo', 'parameters passed to clone() modify object'; is $obj->t7, 1, 'original is not modified'; test.nh100644000766000024 15314006427353 15661 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data(((hADH2:0.1,hADH1:0.11):0.05,nADHY:0.1,iADHX:0.12):0.1,(yADH4:0.09,yADH3:0.13,yADH2:0.12,yADH1:0.11):0.1);phi.out100644000766000024 3363214006427353 15734 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBLASTP 2.0.14 [Jun-29-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= CYS1_DICDI (351 letters) Database: /home/peter/blast/data/swissprot 88,780 sequences; 31,984,247 total letters Searching...................................................................................................................................................... 3 occurrence(s) of pattern in query CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 23 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 120 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 237 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 done Score E (bits) Value Significant matches for pattern occurrence 1 at position 23 sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 688 0.0 sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE 8 4.8 sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST... 7 6.0 sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 7 7.6 sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7... 7 9.6 Significant matches for pattern occurrence 2 at position 120 sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT 13 0.13 sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT... 11 0.43 sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) 11 0.55 sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNI... 10 1.1 sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 I... 8 3.0 sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURS... 7 6.0 sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 7 7.6 sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] 7 7.6 Significant matches for pattern occurrence 3 at position 237 sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, ... 9 1.4 sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, ... 9 1.4 sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 8 4.8 sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROT... 7 6.0 sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 7 9.6 Significant alignments for pattern occurrence 1 at position 23 >sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR Length = 343 Score = 688 bits (1789), Expect = 0.0 Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 pattern 23 **** MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120 pattern 120 * ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119 Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 pattern 121 *** TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 pattern 237 **** CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232 Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292 Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 >sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE Length = 4969 Score = 7.8 bits (25), Expect = 4.8 Identities = 14/39 (35%), Positives = 19/39 (47%) Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 pattern 23 **** PEEQ +F E + K +K EE E + G+ EE Sbjct: 4414 PEEQEKFQEQKTKEEEKEEKEETKSEPEKAEGEDGEKEE 4452 >sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST CLASS-ALPHA) Length = 221 Score = 7.4 bits (24), Expect = 6.0 Identities = 19/67 (28%), Positives = 35/67 (51%), Gaps = 12/67 (17%) Query: 21 IPPEEQ-SQFLEFQDKFNKKY---------SH-EEYLERFEIFKSNLGKIEEL-NLIAIN 68 pattern 23 **** +PPEEQ ++ + +DK +Y SH ++YL ++ K+++ +E L N+ +N Sbjct: 112 LPPEEQEAKLAQIKDKAKNRYFPAFEKVLKSHGQDYLVGNKLSKADILLVELLYNVEELN 171 Query: 69 HKADTKF 75 A F Sbjct: 172 PGATASF 178 >sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 Length = 356 Score = 7.1 bits (23), Expect = 7.6 Identities = 14/67 (20%), Positives = 32/67 (46%), Gaps = 5/67 (7%) Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGK---IEELNLIAINHKADTKFGVNK 79 pattern 23 **** PEEQ++ ++D+ N + ++Y + + L K +LN + ++A ++ + Sbjct: 75 PEEQAK--TYKDEGNDYFKEKDYKKAVISYTEGLKKKCADPDLNAVLYTNRAAAQYYLGN 132 Query: 80 FADLSSD 86 F +D Sbjct: 133 FRSALND 139 >sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7 INTERGENIC REGION Length = 725 Score = 6.8 bits (22), Expect = 9.6 Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 21/99 (21%) Query: 21 IPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 78 pattern 23 **** + PEEQ L+F ++ H ER + +++G +N + + G+ Sbjct: 213 LTPEEQKDKDLLQFAEQI-----HSMRTER--LSGAHIGNSPAIN------RLRGELGLQ 259 Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 DL +E ++ + +DD+ ++ DEF++S Sbjct: 260 AMEDLPEEEITDH------KVLSDDIDLSQATIDEFVHS 292 Significant alignments for pattern occurrence 2 at position 120 >sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT Length = 555 Score = 13.0 bits (40), Expect = 0.13 Identities = 16/28 (57%), Positives = 18/28 (64%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 pattern 120 **** IFT D +AD LDD F IN + PEEQ Sbjct: 170 IFTGDDELADELDDRFVIDINKLFPEEQ 197 >sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT (MCR I ALPHA) Length = 553 Score = 11.2 bits (35), Expect = 0.43 Identities = 14/28 (50%), Positives = 18/28 (64%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEFINSIP---PEEQ 123 pattern 120 **** I T DL +AD +DD+F+ I PEEQ Sbjct: 168 IITGDLELADEIDDKFLIDIEKLFPEEQ 195 >sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) Length = 101 Score = 10.9 bits (34), Expect = 0.55 Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 1/23 (4%) Query: 114 FINSIPPEEQTAF-DWRTRGAVT 135 pattern 120 **** F S+ PEEQ AF +W+TR +T Sbjct: 78 FGKSLTPEEQRAFEEWKTRYGIT 100 >sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNIT (MCR II ALPHA) Length = 553 Score = 9.8 bits (31), Expect = 1.1 Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 pattern 120 **** IFT D +AD +D F IN + PEEQ Sbjct: 168 IFTGDDELADEIDKRFLIDINKLFPEEQ 195 >sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 INTERGENIC REGION Length = 462 Score = 8.5 bits (27), Expect = 3.0 Identities = 13/39 (33%), Positives = 21/39 (53%), Gaps = 9/39 (23%) Query: 112 DEFINSIP-------PEEQT--AFDWRTRGAVTPVKNQG 141 pattern 120 **** DEF+N+ P PEEQ+ A++W + + + N G Sbjct: 308 DEFLNTSPSPEVFTLPEEQSGMAWEWHDKDWMLDLTNDG 346 >sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURSOR (MAGP) (MAGP-1) Length = 183 Score = 7.4 bits (24), Expect = 6.0 Identities = 11/37 (29%), Positives = 18/37 (47%) Query: 100 FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTP 136 pattern 120 **** + D + ADY D + ++ PEEQ + + V P Sbjct: 37 YGDQIDNADYYDYQEVSPRTPEEQFQSQQQVQQEVIP 73 >sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 Length = 199 Score = 7.1 bits (23), Expect = 7.6 Identities = 11/27 (40%), Positives = 15/27 (54%), Gaps = 1/27 (3%) Query: 105 PVADYLDDE-FINSIPPEEQTAFDWRT 130 pattern 120 **** PV+ Y DE + + PEEQ D+ T Sbjct: 171 PVSSYSSDEGSYDPLSPEEQELLDFTT 197 >sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] Length = 812 Score = 7.1 bits (23), Expect = 7.6 Identities = 8/13 (61%), Positives = 11/13 (84%) Query: 112 DEFINSIPPEEQT 124 pattern 120 **** D+ +S+PPEEQT Sbjct: 359 DQSDSSVPPEEQT 371 Significant alignments for pattern occurrence 3 at position 237 >sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE B) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN B) Length = 1034 Score = 9.5 bits (30), Expect = 1.4 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 >sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE A) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN A) Length = 1037 Score = 9.5 bits (30), Expect = 1.4 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + Sbjct: 83 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 136 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 137 MQD-------LASKNKIFK 148 >sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN) Length = 1034 Score = 7.8 bits (25), Expect = 4.8 Identities = 21/79 (26%), Positives = 38/79 (47%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F +++ I +T P AI D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKAIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 >sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROTEIN 6 Length = 1081 Score = 7.4 bits (24), Expect = 6.0 Identities = 25/93 (26%), Positives = 37/93 (38%), Gaps = 17/93 (18%) Query: 159 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI-IKNGGIQTESS 217 +F S+N+ +S L E M + E C L P ++I N I +S+ Sbjct: 642 NFTSKNEQEKISNDKL-----EVMVIKTVSTLCETCREELTPYLMHFISFLNTVIMPDSN 696 Query: 218 YPYTAETG--------TQCNFNSANIGPEEQAK 242 pattern 237 **** + T QC ++ GPEEQAK Sbjct: 697 VSHFTRTKLVRSIGYVVQCQVSN---GPEEQAK 726 >sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN) Length = 1034 Score = 6.8 bits (22), Expect = 9.6 Identities = 20/79 (25%), Positives = 38/79 (47%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F +++ I +T P +I D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 Database: /home/peter/blast/data/swissprot Posted date: Oct 10, 2000 10:43 AM Number of letters in database: 31,984,247 Number of sequences in database: 88,780 Lambda K H C 0.270 0.0470 0.230 0.500 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1047 Number of Sequences: 88780 Number of extensions: 1047 Number of successful extensions: 36 Number of sequences better than 10.0: 36 Number of HSP's better than 10.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 351 length of database: 31,984,247 effective HSP length: 50 effective length of query: 301 effective length of database: 27,545,247 effective search space: 8291119347 effective search space used: 8291119347 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.6 bits) S2: 65 (29.7 bits) crab.nj100644000766000024 161414006427353 15636 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 13 sequences 1 A-salina 2 C-vittat 3 C-sp. 4 L-aequit 5 P-camtsc 6 E-tenuim 7 L-splend 8 P-bernha 9 P-acadia 10 P-p(NE) 11 P-p(GU) 12 P-l(NE) 13 P-l(GU) 14 and 2 0.087619 14 and 3 0.108092 15 and 1 0.155362 15 and 14 0.020241 16 and 10 0.011208 16 and 11 0.004878 17 and 12 0.002136 17 and 13 0.000545 18 and 16 0.016086 18 and 17 0.011394 19 and 4 0.006423 19 and 5 0.006982 20 and 15 0.056669 20 and 18 0.028117 21 and 8 0.003267 21 and 9 0.002095 22 and 6 0.011003 22 and 21 0.010445 23 and 20 0.018557 23 and 19 0.020149 24 and 22 0.012944 24 and 7 0.024589 24 and 23 0.007331 file:crab.dat NJ tree p-distance was used. Number of nucleotide sites compared 373 (nsite=421) seed=1850 dna2.fa100644000766000024 110514006427353 15525 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>Test1 AGCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTGATAGCAGC TTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAATACTTTAACCAA TATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAACGCATTAGCACCACC ATTACCACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGCGTACAGGAAACACAGAAAAAAG >Test2 CCCGCACCTGACAGTGCGGGCTTTTTTTTTCGACCAAAGGTAACGAGGTAACAACCATGCGAGTGTTGAA GTTCGGCGGTACATCAGTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATGCC AGGCAGGGGCAGGTGGCCACCGTCCTCTCTGCCCCCGCCAAAATCACCAACCACCTGGTGGCGATGATTG AAAAAACCATTAGCGGCCAGGATGCTTTACCCAATATCAGCGATGCCGAACGTATTTTTGCCGAACTTTTtmp.fst100644000766000024 113014006427353 15705 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>TEST00001 EOL aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa >TEST00002 EOL ccccccccccccccccccccccccccccccccccccccccccccccccc ccccccccccccccccccccccccccccccccccccccccccccccccc >TEST00003 EOL ggggggggggggggggggggggggggggggggggggggggggggggggg ggggggggggggggggggggggggggggggggggggggggggggggggg >TEST00004 EOL ttttttttttttttttttttttttttttttttttttttttttttttttt ttttttttttttttttttttttttttttttttttttttttttttttttt seg.out100644000766000024 30614006427353 15662 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>LBL_0012(32-46) complexity=2.47 (12/2.20/2.50) gdggwtfegwggppe >LBL_0012(66-80) complexity=2.31 (12/2.20/2.50) kfssrasakavakks >LBL_0012(123-138) complexity=2.31 (12/2.20/2.50) svivsqsqgvvkgvgv lib000755000766000024 014006427353 13626 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8BioPerl.pm100644000766000024 1612414006427353 15704 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/libpackage BioPerl; $BioPerl::VERSION = '1.7.8'; use utf8; use strict; use warnings; # ABSTRACT: Perl modules for biology # AUTHOR: See the individual modules for their authors. # OWNER: many people (see the individual modules for their copyright holders) # LICENSE: Perl_5 1; __END__ =pod =encoding UTF-8 =head1 NAME BioPerl - Perl modules for biology =head1 VERSION version 1.7.8 =head1 SYNOPSIS If you're new to BioPerl, you should start reading the BioPerl HOWTO's: L =head2 References for Individual Modules For ease of maintenance and coordination amongst contributors, BioPerl code is maintained in a modular form, as is the documentation. Refer to the documentation for individual modules by using perldoc, i.e. C to get documentation for the Bio::Seq object. =head1 DESCRIPTION BioPerl is the product of a community effort to produce Perl code which is useful in biology. Examples include Sequence objects, Alignment objects and database searching objects. These objects not only do what they are advertised to do in the documentation, but they also interact - Alignment objects are made from the Sequence objects, Sequence objects have access to Annotation and SeqFeature objects and databases, Blast objects can be converted to Alignment objects, and so on. This means that the objects provide a coordinated and extensible framework to do computational biology. BioPerl development focuses on Perl classes, or code that is used to create objects representing biological entities. There are scripts provided in the scripts/ and examples/ directories but scripts are not the main focus of the BioPerl developers. Of course, as the objects do most of the hard work for you, all you have to do is combine a number of objects together sensibly to make useful scripts. The intent of the BioPerl development effort is to make reusable tools that aid people in creating their own sites or job-specific applications. The BioPerl website at L also attempts to maintain links and archives of standalone bio-related Perl tools that are not affiliated or related to the core BioPerl effort. Check the site for useful code ideas and contribute your own if possible. =head1 INSTALLATION The BioPerl modules are distributed as a tar file that expands into a standard perl CPAN distribution. Detailed installation directions can be found in the distribution INSTALL file. Installing on windows using ActiveState Perl is covered in the INSTALL.WIN file. We highly suggest reading the installation instructions on the BioPerl website: L Note that only the following are supported at this time with the current API: =over 3 =item BioPerl-db =item BioPerl-network =item BioPerl-run =item BioPerl-pedigree =item Bio::Graphics =back =head1 GETTING STARTED The distribution I directory has working scripts for use with BioPerl, check the self-described I directory as well. You are more than welcome to contribute your script! If you have installed BioPerl in the standard way, as detailed in the INSTALL in the distribution, these scripts should work by just running them. =head1 GETTING INVOLVED BioPerl is a completely open community of developers. We are not funded and we don't have a mission statement. We encourage collaborative code, in particular in Perl. You can help us in many different ways, from just a simple statement about how you have used BioPerl to doing something interesting to contributing a whole new object hierarchy. See L for more information. Here are some ways of helping us: =head2 Asking questions and telling us you used it We are very interested to hear how you experienced using BioPerl. Did it install cleanly? Did you understand the documentation? Could you get the objects to do what you wanted them to do? If BioPerl was useless we want to know why, and if it was great - that too. Post a message to B, the BioPerl mailing list, where all the developers are. Only by getting people's feedback do we know whether we are providing anything useful. =head2 Writing a script that uses it By writing a good script that uses BioPerl you both show that BioPerl is useful and probably save someone elsewhere writing it. If you contribute it to the 'script central' at L then other people can view and use it. Don't be nervous if you've never done this sort of work, advice is freely given and all are welcome! =head2 Find bugs! We know that there are bugs in this code. If you find something which you are pretty sure is a problem, post a bug report using our Bugzilla tracking system: L Please read the main bug tracking (L) for an overview of what we expect in a bug report. Specifically, having a code and data example where appropriate helps tremendously. We gladly accept all patches after a quick code review. =head2 Suggest new functionality You can suggest areas where the objects are not ideally written and could be done better. The best way is to find the main developer of the module (each module was written principally by one person, except for Seq.pm). Talk to him or her and suggest changes. =head2 Make your own objects If you can make a useful object we will happily include it into the core. Probably you will want to read a lot of the documentation in L and talk to people on the BioPerl mailing list, B. =head2 Writing documentation We appreciate good documentation. It's what tells the world what's in BioPerl, it's what instructs the user, it's what describes the rationale and inner workings of the package. Feel free to contribute. =head1 ACKNOWLEDGEMENTS For a more detailed history of the BioPerl project, we recommend the History of BioPerl: L =head1 FEEDBACK =head2 Mailing lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion https://bioperl.org/Support.html - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR See the individual modules for their authors. =head1 COPYRIGHT This software is copyright (c) by many people (see the individual modules for their copyright holders). This software is available under the same terms as the perl 5 programming language system itself. =cut Bio000755000766000024 014006427353 14337 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/libSeq.pm100644000766000024 12020414006427353 15624 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::Seq # # Please direct questions and support issues to # # Cared for by Ewan Birney # # Copyright Ewan Birney # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::Seq - Sequence object, with features =head1 SYNOPSIS # This is the main sequence object in Bioperl # gets a sequence from a file $seqio = Bio::SeqIO->new( '-format' => 'embl' , -file => 'myfile.dat'); $seqobj = $seqio->next_seq(); # SeqIO can both read and write sequences; see Bio::SeqIO # for more information and examples # get from database $db = Bio::DB::GenBank->new(); $seqobj = $db->get_Seq_by_acc('X78121'); # make from strings in script $seqobj = Bio::Seq->new( -display_id => 'my_id', -seq => $sequence_as_string); # gets sequence as a string from sequence object $seqstr = $seqobj->seq(); # actual sequence as a string $seqstr = $seqobj->subseq(10,50); # slice in biological coordinates # retrieves information from the sequence # features must implement Bio::SeqFeatureI interface @features = $seqobj->get_SeqFeatures(); # just top level foreach my $feat ( @features ) { print "Feature ",$feat->primary_tag," starts ",$feat->start," ends ", $feat->end," strand ",$feat->strand,"\n"; # features retain link to underlying sequence object print "Feature sequence is ",$feat->seq->seq(),"\n" } # sequences may have a species if( defined $seq->species ) { print "Sequence is from ",$species->binomial," [",$species->common_name,"]\n"; } # annotation objects are Bio::AnnotationCollectionI's $ann = $seqobj->annotation(); # annotation object # references is one type of annotations to get. Also get # comment and dblink. Look at Bio::AnnotationCollection for # more information foreach my $ref ( $ann->get_Annotations('reference') ) { print "Reference ",$ref->title,"\n"; } # you can get truncations, translations and reverse complements, these # all give back Bio::Seq objects themselves, though currently with no # features transferred my $trunc = $seqobj->trunc(100,200); my $rev = $seqobj->revcom(); # there are many options to translate - check out the docs my $trans = $seqobj->translate(); # these functions can be chained together my $trans_trunc_rev = $seqobj->trunc(100,200)->revcom->translate(); =head1 DESCRIPTION A Seq object is a sequence with sequence features placed on it. The Seq object contains a PrimarySeq object for the actual sequence and also implements its interface. In Bioperl we have 3 main players that people are going to use frequently Bio::PrimarySeq - just the sequence and its names, nothing else. Bio::SeqFeatureI - a feature on a sequence, potentially with a sequence and a location and annotation. Bio::Seq - A sequence and a collection of sequence features (an aggregate) with its own annotation. Although Bioperl is not tied heavily to file formats these distinctions do map to file formats sensibly and for some bioinformaticians this might help Bio::PrimarySeq - Fasta file of a sequence Bio::SeqFeatureI - A single entry in an EMBL/GenBank/DDBJ feature table Bio::Seq - A single EMBL/GenBank/DDBJ entry By having this split we avoid a lot of nasty circular references (sequence features can hold a reference to a sequence without the sequence holding a reference to the sequence feature). See L and L for more information. Ian Korf really helped in the design of the Seq and SeqFeature system. =head2 Examples A simple and fundamental block of code: use Bio::SeqIO; my $seqIOobj = Bio::SeqIO->new(-file=>"1.fa"); # create a SeqIO object my $seqobj = $seqIOobj->next_seq; # get a Seq object With the Seq object in hand one has access to a powerful set of Bioperl methods and related Bioperl objects. This next script will take a file of sequences in EMBL format and create a file of the reverse-complemented sequences in Fasta format using Seq objects. It also prints out details about the exons it finds as sequence features in Genbank Flat File format. use Bio::Seq; use Bio::SeqIO; $seqin = Bio::SeqIO->new( -format => 'EMBL' , -file => 'myfile.dat'); $seqout= Bio::SeqIO->new( -format => 'Fasta', -file => '>output.fa'); while((my $seqobj = $seqin->next_seq())) { print "Seen sequence ",$seqobj->display_id,", start of seq ", substr($seqobj->seq,1,10),"\n"; if( $seqobj->alphabet eq 'dna') { $rev = $seqobj->revcom; $id = $seqobj->display_id(); $id = "$id.rev"; $rev->display_id($id); $seqout->write_seq($rev); } foreach $feat ( $seqobj->get_SeqFeatures() ) { if( $feat->primary_tag eq 'exon' ) { print STDOUT "Location ",$feat->start,":", $feat->end," GFF[",$feat->gff_string,"]\n"; } } } Let's examine the script. The lines below import the Bioperl modules. Seq is the main Bioperl sequence object and SeqIO is the Bioperl support for reading sequences from files and to files use Bio::Seq; use Bio::SeqIO; These two lines create two SeqIO streams: one for reading in sequences and one for outputting sequences: $seqin = Bio::SeqIO->new( -format => 'EMBL' , -file => 'myfile.dat'); $seqout= Bio::SeqIO->new( -format => 'Fasta', -file => '>output.fa'); Notice that in the "$seqout" case there is a greater-than sign, indicating the file is being opened for writing. Using the '-argument' => value syntax is common in Bioperl. The file argument is like an argument to open() . You can also pass in filehandles or FileHandle objects by using the -fh argument (see L documentation for details). Many formats in Bioperl are handled, including Fasta, EMBL, GenBank, Swissprot (swiss), PIR, and GCG. $seqin = Bio::SeqIO->new( -format => 'EMBL' , -file => 'myfile.dat'); $seqout= Bio::SeqIO->new( -format => 'Fasta', -file => '>output.fa'); This is the main loop which will loop progressively through sequences in a file, and each call to $seqio-Enext_seq() provides a new Seq object from the file: while((my $seqobj = $seqio->next_seq())) { This print line below accesses fields in the Seq object directly. The $seqobj-Edisplay_id is the way to access the display_id attribute of the Seq object. The $seqobj-Eseq method gets the actual sequence out as string. Then you can do manipulation of this if you want to (there are however easy ways of doing truncation, reverse-complement and translation). print "Seen sequence ",$seqobj->display_id,", start of seq ", substr($seqobj->seq,1,10),"\n"; Bioperl has to guess the alphabet of the sequence, being either 'dna', 'rna', or 'protein'. The alphabet attribute is one of these three possibilities. if( $seqobj->alphabet eq 'dna') { The $seqobj-Erevcom method provides the reverse complement of the Seq object as another Seq object. Thus, the $rev variable is a reference to another Seq object. For example, one could repeat the above print line for this Seq object (putting $rev in place of $seqobj). In this case we are going to output the object into the file stream we built earlier on. $rev = $seqobj->revcom; When we output it, we want the id of the outputted object to be changed to "$id.rev", ie, with .rev on the end of the name. The following lines retrieve the id of the sequence object, add .rev to this and then set the display_id of the rev sequence object to this. Notice that to set the display_id attribute you just need call the same method, display_id(), with the new value as an argument. Getting and setting values with the same method is common in Bioperl. $id = $seqobj->display_id(); $id = "$id.rev"; $rev->display_id($id); The write_seq method on the SeqIO output object, $seqout, writes the $rev object to the filestream we built at the top of the script. The filestream knows that it is outputting in fasta format, and so it provides fasta output. $seqout->write_seq($rev); This block of code loops over sequence features in the sequence object, trying to find ones who have been tagged as 'exon'. Features have start and end attributes and can be outputted in Genbank Flat File format, GFF, a standarized format for sequence features. foreach $feat ( $seqobj->get_SeqFeatures() ) { if( $feat->primary_tag eq 'exon' ) { print STDOUT "Location ",$feat->start,":", $feat->end," GFF[",$feat->gff_string,"]\n"; } } The code above shows how a few Bio::Seq methods suffice to read, parse, reformat and analyze sequences from a file. A full list of methods available to Bio::Seq objects is shown below. Bear in mind that some of these methods come from PrimarySeq objects, which are simpler than Seq objects, stripped of features (see L for more information). # these methods return strings, and accept strings in some cases: $seqobj->seq(); # string of sequence $seqobj->subseq(5,10); # part of the sequence as a string $seqobj->accession_number(); # when there, the accession number $seqobj->alphabet(); # one of 'dna','rna',or 'protein' $seqobj->version() # when there, the version $seqobj->keywords(); # when there, the Keywords line $seqobj->length() # length $seqobj->desc(); # description $seqobj->primary_id(); # a unique id for this sequence regardless # of its display_id or accession number $seqobj->display_id(); # the human readable id of the sequence Some of these values map to fields in common formats. For example, The display_id() method returns the LOCUS name of a Genbank entry, the (\S+) following the E character in a Fasta file, the ID from a SwissProt file, and so on. The desc() method will return the DEFINITION line of a Genbank file, the description following the display_id in a Fasta file, and the DE field in a SwissProt file. # the following methods return new Seq objects, but # do not transfer features across to the new object: $seqobj->trunc(5,10) # truncation from 5 to 10 as new object $seqobj->revcom # reverse complements sequence $seqobj->translate # translation of the sequence # if new() can be called this method returns 1, else 0 $seqobj->can_call_new # the following method determines if the given string will be accepted # by the seq() method - if the string is acceptable then validate() # returns 1, or 0 if not $seqobj->validate_seq($string) # the following method returns or accepts a Species object: $seqobj->species(); Please see L for more information on this object. # the following method returns or accepts an Annotation object # which in turn allows access to Annotation::Reference # and Annotation::Comment objects: $seqobj->annotation(); These annotations typically refer to entire sequences, unlike features. See L, L, L, and L for details. It is also important to be able to describe defined portions of a sequence. The combination of some description and the corresponding sub-sequence is called a feature - an exon and its coordinates within a gene is an example of a feature, or a domain within a protein. # the following methods return an array of SeqFeatureI objects: $seqobj->get_SeqFeatures # The 'top level' sequence features $seqobj->get_all_SeqFeatures # All sequence features, including sub-seq # features, such as features in an exon # to find out the number of features use: $seqobj->feature_count Here are just some of the methods available to SeqFeatureI objects: # these methods return numbers: $feat->start # start position (1 is the first base) $feat->end # end position (2 is the second base) $feat->strand # 1 means forward, -1 reverse, 0 not relevant # these methods return or accept strings: $feat->primary_tag # the name of the sequence feature, eg # 'exon', 'glycoslyation site', 'TM domain' $feat->source_tag # where the feature comes from, eg, 'EMBL_GenBank', # or 'BLAST' # this method returns the more austere PrimarySeq object, not a # Seq object - the main difference is that PrimarySeq objects do not # themselves contain sequence features $feat->seq # the sequence between start,end on the # correct strand of the sequence See L for more details on PrimarySeq objects. # useful methods for feature comparisons, for start/end points $feat->overlaps($other) # do $feat and $other overlap? $feat->contains($other) # is $other completely within $feat? $feat->equals($other) # do $feat and $other completely agree? # one can also add features $seqobj->add_SeqFeature($feat) # returns 1 if successful # sub features. For complex join() statements, the feature # is one sequence feature with many sub SeqFeatures $feat->sub_SeqFeature # returns array of sub seq features Please see L and L, for more information on sequence features. It is worth mentioning that one can also retrieve the start and end positions of a feature using a Bio::LocationI object: $location = $feat->location # $location is a Bio::LocationI object $location->start; # start position $location->end; # end position This is useful because one needs a Bio::Location::SplitLocationI object in order to retrieve the coordinates inside the Genbank or EMBL join() statements (e.g. "CDS join(51..142,273..495,1346..1474)"): if ( $feat->location->isa('Bio::Location::SplitLocationI') && $feat->primary_tag eq 'CDS' ) { foreach $loc ( $feat->location->sub_Location ) { print $loc->start . ".." . $loc->end . "\n"; } } See L and L for more information. =head1 Implemented Interfaces This class implements the following interfaces. =over 4 =item Bio::SeqI Note that this includes implementing Bio::PrimarySeqI. =item Bio::IdentifiableI =item Bio::DescribableI =item Bio::AnnotatableI =item Bio::FeatureHolderI =back =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Ewan Birney, inspired by Ian Korf objects Email birney@ebi.ac.uk =head1 CONTRIBUTORS Jason Stajich Ejason@bioperl.orgE Mark A. Jensen maj -at- fortinbras -dot- us =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a "_". =cut #' # Let the code begin... package Bio::Seq; $Bio::Seq::VERSION = '1.7.8'; use strict; use Carp; use Bio::Annotation::Collection; use Bio::PrimarySeq; use base qw(Bio::Root::Root Bio::SeqI Bio::IdentifiableI Bio::DescribableI Bio::AnnotatableI Bio::FeatureHolderI Bio::AnnotationCollectionI); =head2 new Title : new Usage : $seq = Bio::Seq->new( -seq => 'ATGGGGGTGGTGGTACCCT', -id => 'human_id', -accession_number => 'AL000012', ); Function: Returns a new Seq object from basic constructors, being a string for the sequence and strings for id and accession_number Returns : a new Bio::Seq object =cut sub new { my($caller,@args) = @_; if( $caller ne 'Bio::Seq') { $caller = ref($caller) if ref($caller); } # we know our inherietance hierarchy my $self = Bio::Root::Root->new(@args); bless $self,$caller; # this is way too sneaky probably. We delegate the construction of # the Seq object onto PrimarySeq and then pop primary_seq into # our primary_seq slot my $pseq = Bio::PrimarySeq->new(@args); # as we have just made this, we know it is ok to set hash directly # rather than going through the method $self->{'primary_seq'} = $pseq; # setting this array is now delayed until the final # moment, again speed ups for non feature containing things # $self->{'_as_feat'} = []; my ($ann, $pid,$feat,$species) = &Bio::Root::RootI::_rearrange($self,[qw(ANNOTATION PRIMARY_ID FEATURES SPECIES)], @args); # for a number of cases - reading fasta files - these are never set. This # gives a quick optimisation around testing things later on if( defined $ann || defined $pid || defined $feat || defined $species ) { $pid && $self->primary_id($pid); $species && $self->species($species); $ann && $self->annotation($ann); if( defined $feat ) { if( ref($feat) !~ /ARRAY/i ) { if( ref($feat) && $feat->isa('Bio::SeqFeatureI') ) { $self->add_SeqFeature($feat); } else { $self->warn("Must specify a valid Bio::SeqFeatureI or ArrayRef of Bio::SeqFeatureI's with the -features init parameter for ".ref($self)); } } else { foreach my $feature ( @$feat ) { $self->add_SeqFeature($feature); } } } } return $self; } =head1 PrimarySeq interface The PrimarySeq interface provides the basic sequence getting and setting methods for on all sequences. These methods implement the Bio::PrimarySeq interface by delegating to the primary_seq inside the object. This means that you can use a Seq object wherever there is a PrimarySeq, and of course, you are free to use these functions anyway. =cut =head2 seq Title : seq Usage : $string = $obj->seq() Function: Get/Set the sequence as a string of letters. The case of the letters is left up to the implementer. Suggested cases are upper case for proteins and lower case for DNA sequence (IUPAC standard), but implementations are suggested to keep an open mind about case (some users... want mixed case!) Returns : A scalar Args : Optionally on set the new value (a string). An optional second argument presets the alphabet (otherwise it will be guessed). Both parameters may also be given in named parameter style with -seq and -alphabet being the names. =cut sub seq { return shift->primary_seq()->seq(@_); } =head2 validate_seq Title : validate_seq Usage : if(! $seqobj->validate_seq($seq_str) ) { print "sequence $seq_str is not valid for an object of alphabet ",$seqobj->alphabet, "\n"; } Function: Test that the given sequence is valid, i.e. contains only valid characters. The allowed characters are all letters (A-Z) and '-','.', '*','?','=' and '~'. Spaces are not valid. Note that this implementation does not take alphabet() into account. Returns : 1 if the supplied sequence string is valid, 0 otherwise. Args : - Sequence string to be validated - Boolean to throw an error if the sequence is invalid =cut sub validate_seq { return shift->primary_seq()->validate_seq(@_); } =head2 length Title : length Usage : $len = $seq->length() Function: Example : Returns : Integer representing the length of the sequence. Args : None =cut sub length { return shift->primary_seq()->length(@_); } =head1 Methods from the Bio::PrimarySeqI interface =head2 subseq Title : subseq Usage : $substring = $obj->subseq(10,40); Function: Returns the subseq from start to end, where the first base is 1 and the number is inclusive, ie 1-2 are the first two bases of the sequence Start cannot be larger than end but can be equal Returns : A string Args : 2 integers =cut sub subseq { return shift->primary_seq()->subseq(@_); } =head2 display_id Title : display_id Usage : $id = $obj->display_id or $obj->display_id($newid); Function: Gets or sets the display id, also known as the common name of the Seq object. The semantics of this is that it is the most likely string to be used as an identifier of the sequence, and likely to have "human" readability. The id is equivalent to the LOCUS field of the GenBank/EMBL databanks and the ID field of the Swissprot/sptrembl database. In fasta format, the >(\S+) is presumed to be the id, though some people overload the id to embed other information. Bioperl does not use any embedded information in the ID field, and people are encouraged to use other mechanisms (accession field for example, or extending the sequence object) to solve this. Notice that $seq->id() maps to this function, mainly for legacy/convenience issues. Returns : A string Args : None or a new id =cut sub display_id { return shift->primary_seq->display_id(@_); } =head2 accession_number Title : accession_number Usage : $unique_biological_key = $obj->accession_number; Function: Returns the unique biological id for a sequence, commonly called the accession_number. For sequences from established databases, the implementors should try to use the correct accession number. Notice that primary_id() provides the unique id for the implementation, allowing multiple objects to have the same accession number in a particular implementation. For sequences with no accession number, this method should return "unknown". Can also be used to set the accession number. Example : $key = $seq->accession_number or $seq->accession_number($key) Returns : A string Args : None or an accession number =cut sub accession_number { return shift->primary_seq->accession_number(@_); } =head2 desc Title : desc Usage : $seqobj->desc($string) or $seqobj->desc() Function: Sets or gets the description of the sequence Example : Returns : The description Args : The description or none =cut sub desc { return shift->primary_seq->desc(@_); } =head2 primary_id Title : primary_id Usage : $unique_implementation_key = $obj->primary_id; Function: Returns the unique id for this object in this implementation. This allows implementations to manage their own object ids in a way the implementation can control clients can expect one id to map to one object. For sequences with no natural id, this method should return a stringified memory location. Can also be used to set the primary_id (or unset to undef). [Note this method name is likely to change in 1.3] Example : $id = $seq->primary_id or $seq->primary_id($id) Returns : A string Args : None or an id, or undef to unset the primary id. =cut sub primary_id { # Note: this used to not delegate to the primary seq. This is # really bad in very subtle ways. E.g., if you created the object # with a primary id given to the constructor and then later you # change the primary id, if this method wouldn't delegate you'd # have different values for primary id in the PrimarySeq object # compared to this instance. Not good. # I can't remember why not delegating was ever deemed # advantageous, but I hereby claim that its problems far outweigh # its advantages, if there are any. Convince me otherwise if you # disagree. HL 2004/08/05 return shift->primary_seq->primary_id(@_); } =head2 can_call_new Title : can_call_new Usage : if ( $obj->can_call_new ) { $newobj = $obj->new( %param ); } Function: can_call_new returns 1 or 0 depending on whether an implementation allows new constructor to be called. If a new constructor is allowed, then it should take the followed hashed constructor list. $myobject->new( -seq => $sequence_as_string, -display_id => $id -accession_number => $accession -alphabet => 'dna', ); Example : Returns : 1 or 0 Args : None =cut sub can_call_new { return 1; } =head2 alphabet Title : alphabet Usage : if ( $obj->alphabet eq 'dna' ) { /Do Something/ } Function: Get/Set the type of sequence being one of 'dna', 'rna' or 'protein'. This is case sensitive. This is not called because this would cause upgrade problems from the 0.5 and earlier Seq objects. Returns : A string either 'dna','rna','protein'. NB - the object must make a call of the type - if there is no type specified it has to guess. Args : optional string to set : 'dna' | 'rna' | 'protein' =cut sub alphabet { my $self = shift; return $self->primary_seq->alphabet(@_) if @_ && defined $_[0]; return $self->primary_seq->alphabet(); } =head2 is_circular Title : is_circular Usage : if( $obj->is_circular) { /Do Something/ } Function: Returns true if the molecule is circular Returns : Boolean value Args : none =cut sub is_circular { return shift->primary_seq()->is_circular(@_); } =head1 Methods for Bio::IdentifiableI compliance =head2 object_id Title : object_id Usage : $string = $obj->object_id() Function: a string which represents the stable primary identifier in this namespace of this object. For DNA sequences this is its accession_number, similarly for protein sequences This is aliased to accession_number(). Returns : A scalar =cut sub object_id { return shift->accession_number(@_); } =head2 version Title : version Usage : $version = $obj->version() Function: a number which differentiates between versions of the same object. Higher numbers are considered to be later and more relevant, but a single object described the same identifier should represent the same concept Returns : A number =cut sub version{ return shift->primary_seq->version(@_); } =head2 authority Title : authority Usage : $authority = $obj->authority() Function: a string which represents the organisation which granted the namespace, written as the DNS name for organisation (eg, wormbase.org) Returns : A scalar =cut sub authority { return shift->primary_seq()->authority(@_); } =head2 namespace Title : namespace Usage : $string = $obj->namespace() Function: A string representing the name space this identifier is valid in, often the database name or the name describing the collection Returns : A scalar =cut sub namespace{ return shift->primary_seq()->namespace(@_); } =head1 Methods for Bio::DescribableI compliance =head2 display_name Title : display_name Usage : $string = $obj->display_name() Function: A string which is what should be displayed to the user the string should have no spaces (ideally, though a cautious user of this interface would not assume this) and should be less than thirty characters (though again, double checking this is a good idea) This is aliased to display_id(). Returns : A scalar =cut sub display_name { return shift->display_id(@_); } =head2 description Title : description Usage : $string = $obj->description() Function: A text string suitable for displaying to the user a description. This string is likely to have spaces, but should not have any newlines or formatting - just plain text. The string should not be greater than 255 characters and clients can feel justified at truncating strings at 255 characters for the purposes of display This is aliased to desc(). Returns : A scalar =cut sub description { return shift->desc(@_); } =head1 Methods for implementing Bio::AnnotatableI =head2 annotation Title : annotation Usage : $ann = $seq->annotation or $seq->annotation($ann) Function: Gets or sets the annotation Returns : Bio::AnnotationCollectionI object Args : None or Bio::AnnotationCollectionI object See L and L for more information =cut sub annotation { my ($obj,$value) = @_; if( defined $value ) { $obj->throw("object of class ".ref($value)." does not implement ". "Bio::AnnotationCollectionI. Too bad.") unless $value->isa("Bio::AnnotationCollectionI"); $obj->{'_annotation'} = $value; } elsif( ! defined $obj->{'_annotation'}) { $obj->{'_annotation'} = Bio::Annotation::Collection->new(); } return $obj->{'_annotation'}; } =head1 Methods for delegating Bio::AnnotationCollectionI =head2 get_Annotations() Usage : my @annotations = $seq->get_Annotations('key') Function: Retrieves all the Bio::AnnotationI objects for a specific key for this object Returns : list of Bio::AnnotationI - empty if no objects stored for a key Args : string which is key for annotations =cut sub get_Annotations { shift->annotation->get_Annotations(@_); } =head2 add_Annotation() Usage : $seq->add_Annotation('reference',$object); $seq->add_Annotation($object,'Bio::MyInterface::DiseaseI'); $seq->add_Annotation($object); $seq->add_Annotation('disease',$object,'Bio::MyInterface::DiseaseI'); Function: Adds an annotation for a specific key for this sequence object. If the key is omitted, the object to be added must provide a value via its tagname(). If the archetype is provided, this and future objects added under that tag have to comply with the archetype and will be rejected otherwise. Returns : none Args : annotation key ('disease', 'dblink', ...) object to store (must be Bio::AnnotationI compliant) [optional] object archetype to map future storage of object of these types to =cut sub add_Annotation { shift->annotation->add_Annotation(@_) } =head2 remove_Annotations() Usage : $seq->remove_Annotations() Function: Remove the annotations for the specified key from this sequence object Returns : an list of Bio::AnnotationI compliant objects which were stored under the given key(s) for this sequence object Args : the key(s) (tag name(s), one or more strings) for which to remove annotations (optional; if none given, flushes all annotations) =cut sub remove_Annotations { shift->annotation->remove_Annotations(@_) } =head2 get_num_of_annotations() Usage : my $count = $seq->get_num_of_annotations() Alias : num_Annotations Function: Returns the count of all annotations stored for this sequence object Returns : integer Args : none =cut sub get_num_of_annotations { shift->annotation->get_num_of_annotations(@_) } sub num_Annotations { shift->get_num_of_annotations }; #DWYM =head1 Methods to implement Bio::FeatureHolderI This includes methods for retrieving, adding, and removing features. =cut =head2 get_SeqFeatures Title : get_SeqFeatures Usage : Function: Get the feature objects held by this feature holder. Features which are not top-level are subfeatures of one or more of the returned feature objects, which means that you must traverse the subfeature arrays of each top-level feature object in order to traverse all features associated with this sequence. Specific features can be obtained by primary tag, specified in the argument. Use get_all_SeqFeatures() if you want the feature tree flattened into one single array. Example : my @feats = $seq->get_SeqFeatures or my @genefeats = $seq->get_SeqFeatures('gene') Returns : an array of Bio::SeqFeatureI implementing objects Args : [optional] string (feature tag) =cut sub get_SeqFeatures{ my $self = shift; my $tag = shift; if( !defined $self->{'_as_feat'} ) { $self->{'_as_feat'} = []; } if ($tag) { return map { $_->primary_tag eq $tag ? $_ : () } @{$self->{'_as_feat'}}; } else { return @{$self->{'_as_feat'}}; } } =head2 get_all_SeqFeatures Title : get_all_SeqFeatures Usage : @feat_ary = $seq->get_all_SeqFeatures(); Function: Returns the tree of feature objects attached to this sequence object flattened into one single array. Top-level features will still contain their subfeature-arrays, which means that you will encounter subfeatures twice if you traverse the subfeature tree of the returned objects. Use get_SeqFeatures() if you want the array to contain only the top-level features. Returns : An array of Bio::SeqFeatureI implementing objects. Args : None =cut # this implementation is inherited from FeatureHolderI =head2 feature_count Title : feature_count Usage : $seq->feature_count() Function: Return the number of SeqFeatures attached to a sequence Returns : integer representing the number of SeqFeatures Args : None =cut sub feature_count { my ($self) = @_; if (defined($self->{'_as_feat'})) { return ($#{$self->{'_as_feat'}} + 1); } else { return 0; } } =head2 add_SeqFeature Title : add_SeqFeature Usage : $seq->add_SeqFeature($feat); Function: Adds the given feature object to the feature array of this sequence. The object passed is required to implement the Bio::SeqFeatureI interface. The 'EXPAND' qualifier (see L) is supported, but has no effect, Returns : 1 on success Args : A Bio::SeqFeatureI implementing object. =cut sub add_SeqFeature { my ($self, @feat) = @_; $self->{'_as_feat'} = [] unless $self->{'_as_feat'}; if (scalar @feat > 1) { Carp::carp('Providing an array of features to Bio::Seq add_SeqFeature()' . ' is deprecated and will be removed in a future version.' . ' Add a single feature at a time instead.'); } for my $feat ( @feat ) { next if $feat eq 'EXPAND'; # Need to support it for FeatureHolderI compliance if( !$feat->isa("Bio::SeqFeatureI") ) { $self->throw("Expected a Bio::SeqFeatureI object, but got a $feat."); } # make sure we attach ourselves to the feature if the feature wants it my $aseq = $self->primary_seq; $feat->attach_seq($aseq) if $aseq; push(@{$self->{'_as_feat'}},$feat); } return 1; } =head2 remove_SeqFeatures Title : remove_SeqFeatures Usage : $seq->remove_SeqFeatures(); Function: Removes all attached SeqFeatureI objects or those with the specified primary tag Example : my @gene_feats = $seq->remove_seqFeatures('gene') or my @feats = $seq->remove_seqFeatures() Returns : The array of Bio::SeqFeatureI objects removed from the sequence Args : None, or a feature primary tag =cut sub remove_SeqFeatures { my ( $self, $tag ) = @_; return () unless $self->{'_as_feat'}; if ( $tag ) { my @selected_feats = grep { $_->primary_tag eq $tag } @{ $self->{'_as_feat'} }; my @unselected_feats = grep { $_->primary_tag ne $tag } @{ $self->{'_as_feat'} }; $self->{'_as_feat'} = \@unselected_feats; return @selected_feats; } else { my @all_feats = @{ $self->{'_as_feat'} }; $self->{'_as_feat'} = []; return @all_feats; } } =head1 Methods provided in the Bio::PrimarySeqI interface These methods are inherited from the PrimarySeq interface and work as one expects, building new Bio::Seq objects or other information as expected. See L for more information. Sequence Features are B transferred to the new objects. To reverse complement and include the features use L. =head2 revcom Title : revcom Usage : $rev = $seq->revcom() Function: Produces a new Bio::Seq object which is the reversed complement of the sequence. For protein sequences this throws an exception of "Sequence is a protein. Cannot revcom" The id is the same id as the original sequence, and the accession number is also identical. If someone wants to track that this sequence has be reversed, it needs to define its own extensions To do an in-place edit of an object you can go: $seq = $seq->revcom(); This of course, causes Perl to handle the garbage collection of the old object, but it is roughly speaking as efficient as an in-place edit. Returns : A new (fresh) Bio::Seq object Args : None =head2 trunc Title : trunc Usage : $subseq = $myseq->trunc(10,100); Function: Provides a truncation of a sequence Example : Returns : A fresh Seq object Args : A Seq object =head2 id Title : id Usage : $id = $seq->id() Function: This is mapped on display_id Returns : value of display_id() Args : [optional] value to update display_id =cut sub id { return shift->display_id(@_); } =head1 Seq only methods These methods are specific to the Bio::Seq object, and not found on the Bio::PrimarySeq object =head2 primary_seq Title : primary_seq Usage : $seq->primary_seq or $seq->primary_seq($newval) Function: Get or set a PrimarySeq object Example : Returns : PrimarySeq object Args : None or PrimarySeq object =cut sub primary_seq { my ($obj,$value) = @_; if( defined $value) { if( ! ref $value || ! $value->isa('Bio::PrimarySeqI') ) { $obj->throw("$value is not a Bio::PrimarySeq compliant object"); } $obj->{'primary_seq'} = $value; # descend down over all seqfeature objects, seeing whether they # want an attached seq. foreach my $sf ( $obj->get_SeqFeatures() ) { $sf->attach_seq($value); } } return $obj->{'primary_seq'}; } =head2 species Title : species Usage : $species = $seq->species() or $seq->species($species) Function: Gets or sets the species Returns : L object Args : None or L object See L for more information =cut sub species { my ($self, $species) = @_; if ($species) { $self->{'species'} = $species; } else { return $self->{'species'}; } } # Internal methods follow... # keep AUTOLOAD happy sub DESTROY { } ############################################################################ # aliases due to name changes or to compensate for our lack of consistency # ############################################################################ # in all other modules we use the object in the singular -- # lack of consistency sucks *flush_SeqFeature = \&remove_SeqFeatures; *flush_SeqFeatures = \&remove_SeqFeatures; # this is now get_SeqFeatures() (from FeatureHolderI) *top_SeqFeatures = \&get_SeqFeatures; # this is now get_all_SeqFeatures() in FeatureHolderI sub all_SeqFeatures{ return shift->get_all_SeqFeatures(@_); } sub accession { my $self = shift; $self->warn(ref($self)."::accession is deprecated, ". "use accession_number() instead"); return $self->accession_number(@_); } 1; 00-compile.t100644000766000024 4500114006427353 15535 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tuse 5.006; use strict; use warnings; # this test was generated with Dist::Zilla::Plugin::Test::Compile 2.058 use Test::More; plan tests => 511 + ($ENV{AUTHOR_TESTING} ? 1 : 0); my @module_files = ( 'Bio/Align/AlignI.pm', 'Bio/Align/DNAStatistics.pm', 'Bio/Align/PairwiseStatistics.pm', 'Bio/Align/ProteinStatistics.pm', 'Bio/Align/StatisticsI.pm', 'Bio/Align/Utilities.pm', 'Bio/AlignIO.pm', 'Bio/AlignIO/Handler/GenericAlignHandler.pm', 'Bio/AlignIO/arp.pm', 'Bio/AlignIO/bl2seq.pm', 'Bio/AlignIO/clustalw.pm', 'Bio/AlignIO/emboss.pm', 'Bio/AlignIO/fasta.pm', 'Bio/AlignIO/largemultifasta.pm', 'Bio/AlignIO/maf.pm', 'Bio/AlignIO/mase.pm', 'Bio/AlignIO/mega.pm', 'Bio/AlignIO/meme.pm', 'Bio/AlignIO/metafasta.pm', 'Bio/AlignIO/msf.pm', 'Bio/AlignIO/nexus.pm', 'Bio/AlignIO/pfam.pm', 'Bio/AlignIO/phylip.pm', 'Bio/AlignIO/po.pm', 'Bio/AlignIO/proda.pm', 'Bio/AlignIO/prodom.pm', 'Bio/AlignIO/psi.pm', 'Bio/AlignIO/selex.pm', 'Bio/AlignIO/xmfa.pm', 'Bio/AnalysisI.pm', 'Bio/AnalysisParserI.pm', 'Bio/AnalysisResultI.pm', 'Bio/AnnotatableI.pm', 'Bio/Annotation/AnnotationFactory.pm', 'Bio/Annotation/Collection.pm', 'Bio/Annotation/Comment.pm', 'Bio/Annotation/DBLink.pm', 'Bio/Annotation/OntologyTerm.pm', 'Bio/Annotation/Reference.pm', 'Bio/Annotation/Relation.pm', 'Bio/Annotation/SimpleValue.pm', 'Bio/Annotation/StructuredValue.pm', 'Bio/Annotation/TagTree.pm', 'Bio/Annotation/Target.pm', 'Bio/Annotation/Tree.pm', 'Bio/Annotation/TypeManager.pm', 'Bio/AnnotationCollectionI.pm', 'Bio/AnnotationI.pm', 'Bio/CodonUsage/IO.pm', 'Bio/CodonUsage/Table.pm', 'Bio/DB/DBFetch.pm', 'Bio/DB/Failover.pm', 'Bio/DB/Fasta.pm', 'Bio/DB/FileCache.pm', 'Bio/DB/Flat.pm', 'Bio/DB/Flat/BDB.pm', 'Bio/DB/Flat/BDB/embl.pm', 'Bio/DB/Flat/BDB/fasta.pm', 'Bio/DB/Flat/BDB/genbank.pm', 'Bio/DB/Flat/BDB/swiss.pm', 'Bio/DB/Flat/BinarySearch.pm', 'Bio/DB/GFF/Util/Binning.pm', 'Bio/DB/GFF/Util/Rearrange.pm', 'Bio/DB/GenericWebAgent.pm', 'Bio/DB/InMemoryCache.pm', 'Bio/DB/IndexedBase.pm', 'Bio/DB/LocationI.pm', 'Bio/DB/Qual.pm', 'Bio/DB/Query/WebQuery.pm', 'Bio/DB/QueryI.pm', 'Bio/DB/RandomAccessI.pm', 'Bio/DB/ReferenceI.pm', 'Bio/DB/Registry.pm', 'Bio/DB/SeqI.pm', 'Bio/DB/Taxonomy.pm', 'Bio/DB/Taxonomy/flatfile.pm', 'Bio/DB/Taxonomy/greengenes.pm', 'Bio/DB/Taxonomy/list.pm', 'Bio/DB/Taxonomy/silva.pm', 'Bio/DB/UpdateableSeqI.pm', 'Bio/DB/WebDBSeqI.pm', 'Bio/DBLinkContainerI.pm', 'Bio/Das/FeatureTypeI.pm', 'Bio/Das/SegmentI.pm', 'Bio/DasI.pm', 'Bio/DescribableI.pm', 'Bio/Event/EventGeneratorI.pm', 'Bio/Event/EventHandlerI.pm', 'Bio/Factory/AnalysisI.pm', 'Bio/Factory/ApplicationFactoryI.pm', 'Bio/Factory/DriverFactory.pm', 'Bio/Factory/FTLocationFactory.pm', 'Bio/Factory/LocationFactoryI.pm', 'Bio/Factory/ObjectBuilderI.pm', 'Bio/Factory/ObjectFactory.pm', 'Bio/Factory/ObjectFactoryI.pm', 'Bio/Factory/SeqAnalysisParserFactory.pm', 'Bio/Factory/SeqAnalysisParserFactoryI.pm', 'Bio/Factory/SequenceFactoryI.pm', 'Bio/Factory/SequenceProcessorI.pm', 'Bio/Factory/SequenceStreamI.pm', 'Bio/Factory/TreeFactoryI.pm', 'Bio/FeatureHolderI.pm', 'Bio/HandlerBaseI.pm', 'Bio/IdCollectionI.pm', 'Bio/IdentifiableI.pm', 'Bio/Index/Abstract.pm', 'Bio/Index/AbstractSeq.pm', 'Bio/Index/Blast.pm', 'Bio/Index/BlastTable.pm', 'Bio/Index/EMBL.pm', 'Bio/Index/Fasta.pm', 'Bio/Index/Fastq.pm', 'Bio/Index/GenBank.pm', 'Bio/Index/Qual.pm', 'Bio/Index/SwissPfam.pm', 'Bio/Index/Swissprot.pm', 'Bio/LocatableSeq.pm', 'Bio/Location/Atomic.pm', 'Bio/Location/AvWithinCoordPolicy.pm', 'Bio/Location/CoordinatePolicyI.pm', 'Bio/Location/Fuzzy.pm', 'Bio/Location/FuzzyLocationI.pm', 'Bio/Location/NarrowestCoordPolicy.pm', 'Bio/Location/Simple.pm', 'Bio/Location/Split.pm', 'Bio/Location/SplitLocationI.pm', 'Bio/Location/WidestCoordPolicy.pm', 'Bio/LocationI.pm', 'Bio/Matrix/Generic.pm', 'Bio/Matrix/IO.pm', 'Bio/Matrix/IO/mlagan.pm', 'Bio/Matrix/IO/phylip.pm', 'Bio/Matrix/IO/scoring.pm', 'Bio/Matrix/MatrixI.pm', 'Bio/Matrix/Mlagan.pm', 'Bio/Matrix/PSM/IO.pm', 'Bio/Matrix/PSM/IO/mast.pm', 'Bio/Matrix/PSM/IO/masta.pm', 'Bio/Matrix/PSM/IO/meme.pm', 'Bio/Matrix/PSM/IO/psiblast.pm', 'Bio/Matrix/PSM/IO/transfac.pm', 'Bio/Matrix/PSM/InstanceSite.pm', 'Bio/Matrix/PSM/InstanceSiteI.pm', 'Bio/Matrix/PSM/ProtMatrix.pm', 'Bio/Matrix/PSM/ProtPsm.pm', 'Bio/Matrix/PSM/Psm.pm', 'Bio/Matrix/PSM/PsmHeader.pm', 'Bio/Matrix/PSM/PsmHeaderI.pm', 'Bio/Matrix/PSM/PsmI.pm', 'Bio/Matrix/PSM/SiteMatrix.pm', 'Bio/Matrix/PSM/SiteMatrixI.pm', 'Bio/Matrix/PhylipDist.pm', 'Bio/Matrix/Scoring.pm', 'Bio/Ontology/DocumentRegistry.pm', 'Bio/Ontology/GOterm.pm', 'Bio/Ontology/InterProTerm.pm', 'Bio/Ontology/OBOEngine.pm', 'Bio/Ontology/OBOterm.pm', 'Bio/Ontology/Ontology.pm', 'Bio/Ontology/OntologyEngineI.pm', 'Bio/Ontology/OntologyI.pm', 'Bio/Ontology/OntologyStore.pm', 'Bio/Ontology/Path.pm', 'Bio/Ontology/PathI.pm', 'Bio/Ontology/Relationship.pm', 'Bio/Ontology/RelationshipFactory.pm', 'Bio/Ontology/RelationshipI.pm', 'Bio/Ontology/RelationshipType.pm', 'Bio/Ontology/SimpleGOEngine/GraphAdaptor.pm', 'Bio/Ontology/SimpleOntologyEngine.pm', 'Bio/Ontology/Term.pm', 'Bio/Ontology/TermFactory.pm', 'Bio/Ontology/TermI.pm', 'Bio/OntologyIO.pm', 'Bio/OntologyIO/Handlers/BaseSAXHandler.pm', 'Bio/OntologyIO/Handlers/InterProHandler.pm', 'Bio/OntologyIO/Handlers/InterPro_BioSQL_Handler.pm', 'Bio/OntologyIO/InterProParser.pm', 'Bio/OntologyIO/dagflat.pm', 'Bio/OntologyIO/goflat.pm', 'Bio/OntologyIO/obo.pm', 'Bio/OntologyIO/simplehierarchy.pm', 'Bio/OntologyIO/soflat.pm', 'Bio/ParameterBaseI.pm', 'Bio/PrimarySeq.pm', 'Bio/PrimarySeqI.pm', 'Bio/PullParserI.pm', 'Bio/Range.pm', 'Bio/RangeI.pm', 'Bio/Root/Exception.pm', 'Bio/Root/HTTPget.pm', 'Bio/Root/IO.pm', 'Bio/Root/Root.pm', 'Bio/Root/RootI.pm', 'Bio/Root/Storable.pm', 'Bio/Root/Test.pm', 'Bio/Root/TestObject.pm', 'Bio/Root/Utilities.pm', 'Bio/Root/Version.pm', 'Bio/Search/BlastStatistics.pm', 'Bio/Search/BlastUtils.pm', 'Bio/Search/DatabaseI.pm', 'Bio/Search/GenericDatabase.pm', 'Bio/Search/GenericStatistics.pm', 'Bio/Search/HSP/BlastHSP.pm', 'Bio/Search/HSP/BlastPullHSP.pm', 'Bio/Search/HSP/FastaHSP.pm', 'Bio/Search/HSP/GenericHSP.pm', 'Bio/Search/HSP/HSPFactory.pm', 'Bio/Search/HSP/HSPI.pm', 'Bio/Search/HSP/ModelHSP.pm', 'Bio/Search/HSP/PSLHSP.pm', 'Bio/Search/HSP/PsiBlastHSP.pm', 'Bio/Search/HSP/PullHSPI.pm', 'Bio/Search/HSP/WABAHSP.pm', 'Bio/Search/Hit/BlastHit.pm', 'Bio/Search/Hit/BlastPullHit.pm', 'Bio/Search/Hit/Fasta.pm', 'Bio/Search/Hit/GenericHit.pm', 'Bio/Search/Hit/HitFactory.pm', 'Bio/Search/Hit/HitI.pm', 'Bio/Search/Hit/ModelHit.pm', 'Bio/Search/Hit/PsiBlastHit.pm', 'Bio/Search/Hit/PullHitI.pm', 'Bio/Search/Iteration/GenericIteration.pm', 'Bio/Search/Iteration/IterationI.pm', 'Bio/Search/Processor.pm', 'Bio/Search/Result/BlastPullResult.pm', 'Bio/Search/Result/BlastResult.pm', 'Bio/Search/Result/CrossMatchResult.pm', 'Bio/Search/Result/GenericResult.pm', 'Bio/Search/Result/INFERNALResult.pm', 'Bio/Search/Result/PullResultI.pm', 'Bio/Search/Result/ResultFactory.pm', 'Bio/Search/Result/ResultI.pm', 'Bio/Search/Result/WABAResult.pm', 'Bio/Search/SearchUtils.pm', 'Bio/Search/StatisticsI.pm', 'Bio/Search/Tiling/MapTileUtils.pm', 'Bio/Search/Tiling/MapTiling.pm', 'Bio/Search/Tiling/TilingI.pm', 'Bio/SearchIO.pm', 'Bio/SearchIO/EventHandlerI.pm', 'Bio/SearchIO/FastHitEventBuilder.pm', 'Bio/SearchIO/IteratedSearchResultEventBuilder.pm', 'Bio/SearchIO/SearchResultEventBuilder.pm', 'Bio/SearchIO/SearchWriterI.pm', 'Bio/SearchIO/Writer/GbrowseGFF.pm', 'Bio/SearchIO/Writer/HSPTableWriter.pm', 'Bio/SearchIO/Writer/HTMLResultWriter.pm', 'Bio/SearchIO/Writer/HitTableWriter.pm', 'Bio/SearchIO/Writer/ResultTableWriter.pm', 'Bio/SearchIO/Writer/TextResultWriter.pm', 'Bio/SearchIO/axt.pm', 'Bio/SearchIO/blast.pm', 'Bio/SearchIO/blast_pull.pm', 'Bio/SearchIO/blasttable.pm', 'Bio/SearchIO/cross_match.pm', 'Bio/SearchIO/erpin.pm', 'Bio/SearchIO/exonerate.pm', 'Bio/SearchIO/fasta.pm', 'Bio/SearchIO/gmap_f9.pm', 'Bio/SearchIO/infernal.pm', 'Bio/SearchIO/megablast.pm', 'Bio/SearchIO/psl.pm', 'Bio/SearchIO/rnamotif.pm', 'Bio/SearchIO/sim4.pm', 'Bio/SearchIO/waba.pm', 'Bio/SearchIO/wise.pm', 'Bio/Seq.pm', 'Bio/Seq/BaseSeqProcessor.pm', 'Bio/Seq/EncodedSeq.pm', 'Bio/Seq/LargeLocatableSeq.pm', 'Bio/Seq/LargePrimarySeq.pm', 'Bio/Seq/LargeSeq.pm', 'Bio/Seq/LargeSeqI.pm', 'Bio/Seq/Meta.pm', 'Bio/Seq/Meta/Array.pm', 'Bio/Seq/MetaI.pm', 'Bio/Seq/PrimaryQual.pm', 'Bio/Seq/PrimedSeq.pm', 'Bio/Seq/QualI.pm', 'Bio/Seq/Quality.pm', 'Bio/Seq/RichSeq.pm', 'Bio/Seq/RichSeqI.pm', 'Bio/Seq/SeqBuilder.pm', 'Bio/Seq/SeqFactory.pm', 'Bio/Seq/SeqFastaSpeedFactory.pm', 'Bio/Seq/SequenceTrace.pm', 'Bio/Seq/SimulatedRead.pm', 'Bio/Seq/TraceI.pm', 'Bio/SeqAnalysisParserI.pm', 'Bio/SeqFeature/Amplicon.pm', 'Bio/SeqFeature/AnnotationAdaptor.pm', 'Bio/SeqFeature/Collection.pm', 'Bio/SeqFeature/CollectionI.pm', 'Bio/SeqFeature/Computation.pm', 'Bio/SeqFeature/FeaturePair.pm', 'Bio/SeqFeature/Gene/Exon.pm', 'Bio/SeqFeature/Gene/ExonI.pm', 'Bio/SeqFeature/Gene/GeneStructure.pm', 'Bio/SeqFeature/Gene/GeneStructureI.pm', 'Bio/SeqFeature/Gene/Intron.pm', 'Bio/SeqFeature/Gene/NC_Feature.pm', 'Bio/SeqFeature/Gene/Poly_A_site.pm', 'Bio/SeqFeature/Gene/Promoter.pm', 'Bio/SeqFeature/Gene/Transcript.pm', 'Bio/SeqFeature/Gene/TranscriptI.pm', 'Bio/SeqFeature/Gene/UTR.pm', 'Bio/SeqFeature/Generic.pm', 'Bio/SeqFeature/Lite.pm', 'Bio/SeqFeature/PositionProxy.pm', 'Bio/SeqFeature/Primer.pm', 'Bio/SeqFeature/Similarity.pm', 'Bio/SeqFeature/SimilarityPair.pm', 'Bio/SeqFeature/SubSeq.pm', 'Bio/SeqFeature/Tools/FeatureNamer.pm', 'Bio/SeqFeature/Tools/IDHandler.pm', 'Bio/SeqFeature/Tools/TypeMapper.pm', 'Bio/SeqFeature/Tools/Unflattener.pm', 'Bio/SeqFeature/TypedSeqFeatureI.pm', 'Bio/SeqFeatureI.pm', 'Bio/SeqI.pm', 'Bio/SeqIO.pm', 'Bio/SeqIO/FTHelper.pm', 'Bio/SeqIO/Handler/GenericRichSeqHandler.pm', 'Bio/SeqIO/MultiFile.pm', 'Bio/SeqIO/ace.pm', 'Bio/SeqIO/asciitree.pm', 'Bio/SeqIO/bsml.pm', 'Bio/SeqIO/bsml_sax.pm', 'Bio/SeqIO/embl.pm', 'Bio/SeqIO/embldriver.pm', 'Bio/SeqIO/fasta.pm', 'Bio/SeqIO/fastq.pm', 'Bio/SeqIO/game.pm', 'Bio/SeqIO/game/featHandler.pm', 'Bio/SeqIO/game/gameHandler.pm', 'Bio/SeqIO/game/gameSubs.pm', 'Bio/SeqIO/game/gameWriter.pm', 'Bio/SeqIO/game/seqHandler.pm', 'Bio/SeqIO/gbdriver.pm', 'Bio/SeqIO/gbxml.pm', 'Bio/SeqIO/gcg.pm', 'Bio/SeqIO/genbank.pm', 'Bio/SeqIO/kegg.pm', 'Bio/SeqIO/largefasta.pm', 'Bio/SeqIO/locuslink.pm', 'Bio/SeqIO/mbsout.pm', 'Bio/SeqIO/metafasta.pm', 'Bio/SeqIO/msout.pm', 'Bio/SeqIO/phd.pm', 'Bio/SeqIO/pir.pm', 'Bio/SeqIO/qual.pm', 'Bio/SeqIO/raw.pm', 'Bio/SeqIO/scf.pm', 'Bio/SeqIO/seqxml.pm', 'Bio/SeqIO/swiss.pm', 'Bio/SeqIO/swissdriver.pm', 'Bio/SeqIO/tab.pm', 'Bio/SeqIO/table.pm', 'Bio/SeqIO/tigr.pm', 'Bio/SeqIO/tigrxml.pm', 'Bio/SeqIO/tinyseq.pm', 'Bio/SeqIO/tinyseq/tinyseqHandler.pm', 'Bio/SeqUtils.pm', 'Bio/SimpleAlign.pm', 'Bio/SimpleAnalysisI.pm', 'Bio/Species.pm', 'Bio/Taxon.pm', 'Bio/Tools/Alignment/Consed.pm', 'Bio/Tools/Alignment/Trim.pm', 'Bio/Tools/AmpliconSearch.pm', 'Bio/Tools/Analysis/SimpleAnalysisBase.pm', 'Bio/Tools/AnalysisResult.pm', 'Bio/Tools/Blat.pm', 'Bio/Tools/CodonTable.pm', 'Bio/Tools/Coil.pm', 'Bio/Tools/ECnumber.pm', 'Bio/Tools/EMBOSS/Palindrome.pm', 'Bio/Tools/EPCR.pm', 'Bio/Tools/ESTScan.pm', 'Bio/Tools/Eponine.pm', 'Bio/Tools/Est2Genome.pm', 'Bio/Tools/Fgenesh.pm', 'Bio/Tools/FootPrinter.pm', 'Bio/Tools/GFF.pm', 'Bio/Tools/Geneid.pm', 'Bio/Tools/Genemark.pm', 'Bio/Tools/Genewise.pm', 'Bio/Tools/Genomewise.pm', 'Bio/Tools/Genscan.pm', 'Bio/Tools/Glimmer.pm', 'Bio/Tools/Grail.pm', 'Bio/Tools/GuessSeqFormat.pm', 'Bio/Tools/IUPAC.pm', 'Bio/Tools/Lucy.pm', 'Bio/Tools/MZEF.pm', 'Bio/Tools/Match.pm', 'Bio/Tools/OddCodes.pm', 'Bio/Tools/Phylo/Gerp.pm', 'Bio/Tools/Phylo/Molphy.pm', 'Bio/Tools/Phylo/Molphy/Result.pm', 'Bio/Tools/Phylo/Phylip/ProtDist.pm', 'Bio/Tools/Prediction/Exon.pm', 'Bio/Tools/Prediction/Gene.pm', 'Bio/Tools/Primer/Assessor/Base.pm', 'Bio/Tools/Primer/AssessorI.pm', 'Bio/Tools/Primer/Feature.pm', 'Bio/Tools/Primer/Pair.pm', 'Bio/Tools/Primer3.pm', 'Bio/Tools/Prints.pm', 'Bio/Tools/Profile.pm', 'Bio/Tools/Promoterwise.pm', 'Bio/Tools/PrositeScan.pm', 'Bio/Tools/Pseudowise.pm', 'Bio/Tools/QRNA.pm', 'Bio/Tools/RandomDistFunctions.pm', 'Bio/Tools/RepeatMasker.pm', 'Bio/Tools/Run/Analysis.pm', 'Bio/Tools/Run/AnalysisFactory.pm', 'Bio/Tools/Run/GenericParameters.pm', 'Bio/Tools/Run/ParametersI.pm', 'Bio/Tools/Run/Phylo/PhyloBase.pm', 'Bio/Tools/Run/WrapperBase.pm', 'Bio/Tools/Run/WrapperBase/CommandExts.pm', 'Bio/Tools/Seg.pm', 'Bio/Tools/SeqPattern.pm', 'Bio/Tools/SeqPattern/Backtranslate.pm', 'Bio/Tools/SeqStats.pm', 'Bio/Tools/SeqWords.pm', 'Bio/Tools/Sigcleave.pm', 'Bio/Tools/Signalp.pm', 'Bio/Tools/Signalp/ExtendedSignalp.pm', 'Bio/Tools/Sim4/Exon.pm', 'Bio/Tools/Sim4/Results.pm', 'Bio/Tools/Spidey/Exon.pm', 'Bio/Tools/Spidey/Results.pm', 'Bio/Tools/TandemRepeatsFinder.pm', 'Bio/Tools/TargetP.pm', 'Bio/Tools/Tmhmm.pm', 'Bio/Tools/ipcress.pm', 'Bio/Tools/isPcr.pm', 'Bio/Tools/pICalculator.pm', 'Bio/Tools/tRNAscanSE.pm', 'Bio/Tree/AnnotatableNode.pm', 'Bio/Tree/Compatible.pm', 'Bio/Tree/DistanceFactory.pm', 'Bio/Tree/Node.pm', 'Bio/Tree/NodeI.pm', 'Bio/Tree/NodeNHX.pm', 'Bio/Tree/RandomFactory.pm', 'Bio/Tree/Statistics.pm', 'Bio/Tree/Tree.pm', 'Bio/Tree/TreeFunctionsI.pm', 'Bio/Tree/TreeI.pm', 'Bio/TreeIO.pm', 'Bio/TreeIO/NewickParser.pm', 'Bio/TreeIO/TreeEventBuilder.pm', 'Bio/TreeIO/cluster.pm', 'Bio/TreeIO/lintree.pm', 'Bio/TreeIO/newick.pm', 'Bio/TreeIO/nexus.pm', 'Bio/TreeIO/nhx.pm', 'Bio/TreeIO/pag.pm', 'Bio/TreeIO/phyloxml.pm', 'Bio/TreeIO/tabtree.pm', 'Bio/UpdateableSeqI.pm', 'Bio/WebAgent.pm', 'BioPerl.pm' ); my @scripts = ( 'bin/bp_aacomp', 'bin/bp_bioflat_index', 'bin/bp_biogetseq', 'bin/bp_dbsplit', 'bin/bp_extract_feature_seq', 'bin/bp_fastam9_to_table', 'bin/bp_fetch', 'bin/bp_filter_search', 'bin/bp_find-blast-matches', 'bin/bp_gccalc', 'bin/bp_genbank2gff3', 'bin/bp_index', 'bin/bp_local_taxonomydb_query', 'bin/bp_make_mrna_protein', 'bin/bp_mask_by_search', 'bin/bp_mrtrans', 'bin/bp_mutate', 'bin/bp_nexus2nh', 'bin/bp_nrdb', 'bin/bp_oligo_count', 'bin/bp_process_gadfly', 'bin/bp_process_sgd', 'bin/bp_revtrans-motif', 'bin/bp_search2alnblocks', 'bin/bp_search2gff', 'bin/bp_search2table', 'bin/bp_search2tribe', 'bin/bp_seq_length', 'bin/bp_seqconvert', 'bin/bp_seqcut', 'bin/bp_seqpart', 'bin/bp_seqret', 'bin/bp_seqretsplit', 'bin/bp_split_seq', 'bin/bp_sreformat', 'bin/bp_taxid4species', 'bin/bp_taxonomy2tree', 'bin/bp_translate_seq', 'bin/bp_tree2pag', 'bin/bp_unflatten_seq' ); # no fake home requested my @switches = ( -d 'blib' ? '-Mblib' : '-Ilib', ); use File::Spec; use IPC::Open3; use IO::Handle; open my $stdin, '<', File::Spec->devnull or die "can't open devnull: $!"; my @warnings; for my $lib (@module_files) { # see L my $stderr = IO::Handle->new; diag('Running: ', join(', ', map { my $str = $_; $str =~ s/'/\\'/g; q{'} . $str . q{'} } $^X, @switches, '-e', "require q[$lib]")) if $ENV{PERL_COMPILE_TEST_DEBUG}; my $pid = open3($stdin, '>&STDERR', $stderr, $^X, @switches, '-e', "require q[$lib]"); binmode $stderr, ':crlf' if $^O eq 'MSWin32'; my @_warnings = <$stderr>; waitpid($pid, 0); is($?, 0, "$lib loaded ok"); shift @_warnings if @_warnings and $_warnings[0] =~ /^Using .*\bblib/ and not eval { +require blib; blib->VERSION('1.01') }; if (@_warnings) { warn @_warnings; push @warnings, @_warnings; } } foreach my $file (@scripts) { SKIP: { open my $fh, '<', $file or warn("Unable to open $file: $!"), next; my $line = <$fh>; close $fh and skip("$file isn't perl", 1) unless $line =~ /^#!\s*(?:\S*perl\S*)((?:\s+-\w*)*)(?:\s*#.*)?$/; @switches = (@switches, split(' ', $1)) if $1; close $fh and skip("$file uses -T; not testable with PERL5LIB", 1) if grep { $_ eq '-T' } @switches and $ENV{PERL5LIB}; my $stderr = IO::Handle->new; diag('Running: ', join(', ', map { my $str = $_; $str =~ s/'/\\'/g; q{'} . $str . q{'} } $^X, @switches, '-c', $file)) if $ENV{PERL_COMPILE_TEST_DEBUG}; my $pid = open3($stdin, '>&STDERR', $stderr, $^X, @switches, '-c', $file); binmode $stderr, ':crlf' if $^O eq 'MSWin32'; my @_warnings = <$stderr>; waitpid($pid, 0); is($?, 0, "$file compiled ok"); shift @_warnings if @_warnings and $_warnings[0] =~ /^Using .*\bblib/ and not eval { +require blib; blib->VERSION('1.01') }; # in older perls, -c output is simply the file portion of the path being tested if (@_warnings = grep { !/\bsyntax OK$/ } grep { chomp; $_ ne (File::Spec->splitpath($file))[2] } @_warnings) { warn @_warnings; push @warnings, @_warnings; } } } is(scalar(@warnings), 0, 'no warnings found') or diag 'got warnings: ', ( Test::More->can('explain') ? Test::More::explain(\@warnings) : join("\n", '', @warnings) ) if $ENV{AUTHOR_TESTING}; bp_tree2pag100644000766000024 255714006427353 16116 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_tree2pag - convert Bio::TreeIO parseable format trees to pagel format =head1 SYNOPSIS bp_tree2pag -if nexus -i file.nexus > file.pag # OR pipe in through STDIN, and use newick format instead cat file.newick | bp_tree2pag -if newick > file.nh # OR specify an output and input bp_tree2pag -o file.pag -i file.newick =head1 DESCRIPTION Convert TreeIO parseable files into Pagel format tree files. Be warned that pagel format only really supports a single tree per file so. Also Pagel parsing is not yet available in bioperl. =cut use strict; use warnings; use Bio::TreeIO; use Getopt::Long; my ($iformat,$oformat) = ('newick', 'pag'); my ($outfile,$infile); GetOptions( 'if|informat:s' => \$iformat, 'of|outformat:s' => \$oformat, 'i|in:s' => \$infile, 'o|out:s' => \$outfile, 'h|help' => sub { exec('perldoc', $0); exit(0); }, ); my $in; if( ! $infile ) { $in = Bio::TreeIO->new(-format => $iformat, -fh => \*ARGV); } else { $in = Bio::TreeIO->new(-format => $iformat, -file => $infile); } my $out; if( $outfile) { $out = Bio::TreeIO->new(-format => $oformat, -file => ">$outfile"); } else { $out = Bio::TreeIO->new(-format => $oformat); #print to STDOUT instead } while( my $t = $in->next_tree ) { $out->write_tree($t); } bp_nexus2nh100644000766000024 147614006427353 16156 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_nexus2nh - convert nexus format trees (from PAUP* and MrBayes) to new hampshire =head1 SYNOPSIS bp_nexus2nh file.nexus > file.nh # OR pipe in through STDIN cat file.nexus | bp_nexus2nh > file.nh # OR specify an output bp_nexus2nh -o file.nh file.nexus =head1 DESCRIPTION Convert Nexus Tree files into Newick/New Hampshire format tree files. =cut use strict; use warnings; use Bio::TreeIO; use Getopt::Long; my $outfile; GetOptions('o|out|outfile:s' => \$outfile); my $in = Bio::TreeIO->new(-format => 'nexus', -fh => \*ARGV); my $out; if( $outfile ) { $out= Bio::TreeIO->new(-format => 'newick', -file => ">$outfile"); } else { # write to STDOUT $out= Bio::TreeIO->new(-format => 'newick'); } while( my $t = $in->next_tree ) { $out->write_tree($t); } TreeIO.t100644000766000024 2114114006427353 15715 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tree# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 76); use_ok('Bio::TreeIO'); } my $verbose = test_debug(); ok my $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => test_input_file('cysprot1b.newick')); my $tree = $treeio->next_tree; isa_ok($tree, 'Bio::Tree::TreeI'); my @nodes = $tree->get_nodes; is(@nodes, 6); my ($rat) = $tree->find_node('CATL_RAT'); ok($rat); is($rat->branch_length, '0.12788'); # move the id to the bootstap is($rat->ancestor->bootstrap($rat->ancestor->id), '95'); $rat->ancestor->id(''); # maybe this can be auto-detected, but then can't distinguish # between internal node labels and bootstraps... is($rat->ancestor->bootstrap, '95'); is($rat->ancestor->branch_length, '0.18794'); is($rat->ancestor->id, ''); if ($verbose) { foreach my $node ( $tree->get_root_node()->each_Descendent() ) { print "node: ", $node->to_string(), "\n"; my @ch = $node->each_Descendent(); if( @ch ) { print "\tchildren are: \n"; foreach my $node ( $node->each_Descendent() ) { print "\t\t ", $node->to_string(), "\n"; } } } } my $FILE1 = test_output_file(); $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => ">$FILE1"); $treeio->write_tree($tree); undef $treeio; ok( -s $FILE1 ); $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => test_input_file('LOAD_Ccd1.dnd')); ok($treeio); $tree = $treeio->next_tree; isa_ok($tree,'Bio::Tree::TreeI'); @nodes = $tree->get_nodes; is(@nodes, 52); if( $verbose ) { foreach my $node ( @nodes ) { print "node: ", $node->to_string(), "\n"; my @ch = $node->each_Descendent(); if( @ch ) { print "\tchildren are: \n"; foreach my $node ( $node->each_Descendent() ) { print "\t\t ", $node->to_string(), "\n"; } } } } is($tree->total_branch_length, 7.12148); my $FILE2 = test_output_file(); $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => ">$FILE2"); $treeio->write_tree($tree); undef $treeio; ok(-s $FILE2); $treeio = Bio::TreeIO->new(-verbose => $verbose, -format => 'newick', -file => test_input_file('hs_fugu.newick')); $tree = $treeio->next_tree(); @nodes = $tree->get_nodes(); is(@nodes, 5); # no relable order for the bottom nodes because they have no branchlen my @vals = qw(SINFRUP0000006110); my $saw = 0; foreach my $node ( $tree->get_root_node()->each_Descendent() ) { foreach my $v ( @vals ) { if( defined $node->id && $node->id eq $v ){ $saw = 1; last; } } last if $saw; } is($saw, 1, "Saw $vals[0] as expected"); if( $verbose ) { foreach my $node ( @nodes ) { print "\t", $node->id, "\n" if $node->id; } } $treeio = Bio::TreeIO->new(-format => 'newick', -fh => \*DATA); my $treeout = Bio::TreeIO->new(-format => 'tabtree'); my $treeout2 = Bio::TreeIO->new(-format => 'newick'); $tree = $treeio->next_tree; if( $verbose > 0 ) { $treeout->write_tree($tree); $treeout2->write_tree($tree); } $treeio = Bio::TreeIO->new(-verbose => $verbose, -file => test_input_file('test.nhx')); ok($treeio); $tree = $treeio->next_tree; isa_ok($tree, 'Bio::Tree::TreeI'); @nodes = $tree->get_nodes; is(@nodes, 12, "Total Nodes"); my $adhy = $tree->find_node('ADHY'); is($adhy->branch_length, 0.1); is(($adhy->get_tag_values('S'))[0], 'nematode'); is(($adhy->get_tag_values('E'))[0], '1.1.1.1'); # try lintree parsing $treeio = Bio::TreeIO->new(-format => 'lintree', -file => test_input_file('crab.njb')); my (@leaves, $node); while( $tree = $treeio->next_tree ) { isa_ok($tree, 'Bio::Tree::TreeI'); @nodes = $tree->get_nodes; @leaves = $tree->get_leaf_nodes; is(@leaves, 13); #/maj is(@nodes, 25); is(@nodes, 24); # this is clear from the datafile and counting \maj ($node) = $tree->find_node(-id => '18'); ok($node); is($node->id, '18'); is($node->branch_length, '0.030579'); is($node->bootstrap, 998); } $treeio = Bio::TreeIO->new(-format => 'lintree', -file => test_input_file('crab.nj')); $tree = $treeio->next_tree; isa_ok($tree, 'Bio::Tree::TreeI'); @nodes = $tree->get_nodes; @leaves = $tree->get_leaf_nodes; is(@leaves, 13); #/maj is(@nodes, 25); is(@nodes, 24); #/maj ($node) = $tree->find_node('18'); is($node->id, '18'); is($node->branch_length, '0.028117'); ($node) = $tree->find_node(-id => 'C-vittat'); is($node->id, 'C-vittat'); is($node->branch_length, '0.087619'); is($node->ancestor->id, '14'); $treeio = Bio::TreeIO->new(-format => 'lintree', -file => test_input_file('crab.dat.cn')); $tree = $treeio->next_tree; isa_ok($tree, 'Bio::Tree::TreeI'); @nodes = $tree->get_nodes; @leaves = $tree->get_leaf_nodes; is(@leaves, 13, "Leaf nodes"); #/maj is(@nodes, 25, "All nodes"); is(@nodes, 24, "All nodes"); ($node) = $tree->find_node('18'); is($node->id, '18'); is($node->branch_length, '0.029044'); ($node) = $tree->find_node(-id => 'C-vittat'); is($node->id, 'C-vittat'); is($node->branch_length, '0.097855'); is($node->ancestor->id, '14'); SKIP: { test_skip(-tests => 8, -requires_module => 'IO::String'); # test nexus tree parsing $treeio = Bio::TreeIO->new(-format => 'nexus', -verbose => $verbose, -file => test_input_file('urease.tre.nexus')); $tree = $treeio->next_tree; ok($tree); is($tree->id, 'PAUP_1'); is($tree->get_leaf_nodes, 6); ($node) = $tree->find_node(-id => 'Spombe'); is($node->branch_length,0.221404); # test nexus MrBayes tree parsing $treeio = Bio::TreeIO->new(-format => 'nexus', -file => test_input_file('adh.mb_tree.nexus')); $tree = $treeio->next_tree; my $ct = 1; ok($tree); is($tree->id, 'rep.1'); is($tree->get_leaf_nodes, 54); ($node) = $tree->find_node(-id => 'd.madeirensis'); is($node->branch_length,0.039223); while ($tree = $treeio->next_tree) { $ct++; } is($ct,13,'bug 2356'); } # bug #1854 # process no-newlined tree $treeio = Bio::TreeIO->new(-format => 'nexus', -verbose => $verbose, -file => test_input_file('tree_nonewline.nexus')); $tree = $treeio->next_tree; ok($tree); ok($tree->find_node('TRXHomo')); # parse trees with scores $treeio = Bio::TreeIO->new(-format => 'newick', -file => test_input_file('puzzle.tre')); $tree = $treeio->next_tree; ok($tree); is($tree->score, '-2673.059726'); # bug #2205 # process trees with node IDs containing spaces $treeio = Bio::TreeIO->new(-format => 'nexus', -verbose => $verbose, -file => test_input_file('spaces.nex')); $tree = $treeio->next_tree; my @nodeids = ("'Allium drummondii'", "'Allium cernuum'",'A.cyaneum'); ok($tree); for my $node ($tree->get_leaf_nodes) { is($node->id, shift @nodeids); } # bug #2221 # process tree with names containing quoted commas $tree = $treeio->next_tree; @nodeids = ("'Allium drummondii, USA'", "'Allium drummondii, Russia'",'A.cyaneum'); ok($tree); for my $node ($tree->get_leaf_nodes) { is($node->id, shift @nodeids); } # bug #2221 # process tree with names containing quoted commas on one line $tree = $treeio->next_tree; @nodeids = ("'Allium drummondii, Russia'", "'Allium drummondii, USA'",'A.cyaneum'); ok($tree); for my $node ($tree->get_leaf_nodes) { is($node->id, shift @nodeids); } # bug #2869 # # proper way (Tree isn't GC'd) $tree = Bio::TreeIO->new(-format => 'newick', -verbose => $verbose, -file => test_input_file('bug2869.tree'))->next_tree; my $root = $tree->get_root_node; isa_ok($root, 'Bio::Tree::NodeI'); my $total1 = 0; for my $child ($root->get_Descendents) { $total1++; } is($total1, 198); undef $tree; # GC $root = Bio::TreeIO->new(-format => 'newick', -verbose => $verbose, -file => test_input_file('bug2869.tree'), -no_cleanup => 1)->next_tree->get_root_node; isa_ok($root, 'Bio::Tree::NodeI'); TODO: { local $TODO = 'The nodes are garbage-collected away b/c Tree isn\'t retained in memory'; my $total2 = 0; for my $child ($root->get_Descendents) { $total2++; } is($total2, $total1); } __DATA__ (((A:1,B:1):1,(C:1,D:1):1):1,((E:1,F:1):1,(G:1,H:1):1):1); Match.t100644000766000024 265114006427353 16010 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id: Match.t,v 1.15 2007/06/27 10:16:38 sendu Exp $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 38, -requires_module => 'URI::Escape'); use_ok('Bio::Tools::Match'); } ok my $parser = Bio::Tools::Match->new(-file => test_input_file('match.output')); my $count = 0; my @expected = ([qw(4338 4349 1.000 0.945 V$MYOD_01)], [qw(7390 7401 1.000 0.932 V$MYOD_01)], [qw(8503 8514 1.000 0.941 V$MYOD_01)], [qw(8767 8778 1.000 0.937 V$MYOD_01)], [qw(33 47 0.693 0.779 V$E47_01)]); while (my $feat = $parser->next_result) { $count++; my @exp = @{shift(@expected)}; isa_ok $feat, 'Bio::SeqFeature::Generic'; is $feat->source_tag, 'transfac_match', 'correct source'; is $feat->start, shift(@exp), 'feature start correct'; is $feat->end, shift(@exp), 'feature end correct'; my $core_score = $feat->score; my $matrix_score = ($feat->annotation->get_Annotations('matrix_score'))[0]->value; my $matrix_id = ($feat->annotation->get_Annotations('matrix_id'))[0]->value; is $core_score, shift(@exp), 'feature core score correct'; is $matrix_score, shift(@exp), 'feature matrix score correct'; is $matrix_id, shift(@exp), 'feature matrix id correct'; last if $count == 5; } is $count, 5, "correct number of results managed to get tested"; IUPAC.t100644000766000024 350514006427353 15614 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 46); use_ok('Bio::Tools::IUPAC'); use_ok('Bio::Seq'); use_ok('Bio::PrimarySeq'); } # IUPAC sequences and regular expressions my $ambiseq = Bio::Seq->new( -seq => 'ARTCGTTGN', -alphabet => 'dna', ); my $ambiprimaryseq = Bio::Seq->new( -seq => 'ARTCGTTGN', -alphabet => 'dna', ); ok my $iupac = Bio::Tools::IUPAC->new( -seq => $ambiprimaryseq ); ok $iupac = Bio::Tools::IUPAC->new( -seq => $ambiseq ); ok my $regexp = $iupac->regexp, 'Regexp'; is $regexp, 'A[AGR]TCGTTG[ACGTBDHKMNRSVWY]'; $regexp = $iupac->regexp(1); is $regexp, 'A[AGR][TU]CG[TU][TU]G[ACGTUBDHKMNRSVWY]', 'Regexp'; is $iupac->count(), 8, 'Count'; my @seqs; while (my $uniqueseq = $iupac->next_seq()) { push @seqs, $uniqueseq->seq; is $uniqueseq->isa('Bio::PrimarySeqI'), 1; like $uniqueseq->seq, qr/$regexp/i; } @seqs = sort @seqs; is_deeply \@seqs, [ 'AATCGTTGA', 'AATCGTTGC', 'AATCGTTGG', 'AATCGTTGT', 'AGTCGTTGA', 'AGTCGTTGC', 'AGTCGTTGG', 'AGTCGTTGT' ]; like $ambiseq->seq, qr/$regexp/i, 'Regexp matches ambiguous sequences'; like 'ARTCGTTGW', qr/$regexp/i; # IUPAC code methods my %iupac; ok %iupac = $iupac->iupac_iub(), 'Nucleic IUPAC'; ok exists $iupac{'A'}; ok not exists $iupac{'Z'}; ok %iupac = $iupac->iupac_iub_amb(); ok exists $iupac{'N'}; ok not exists $iupac{'A'}; ok %iupac = $iupac->iupac_rev_iub(); ok %iupac = $iupac->iupac_iup(), 'Proteic IUPAC'; ok exists $iupac{'A'}; ok exists $iupac{'Z'}; ok %iupac = $iupac->iupac_iup_amb(); ok exists $iupac{'B'}; ok not exists $iupac{'A'}; ok %iupac = $iupac->iupac(); ok not(exists $iupac{'Z'}); ok %iupac = $iupac->iupac_amb(); ok not(exists $iupac{'A'}); ok %iupac = Bio::Tools::IUPAC->new->iupac_iup; Tmhmm.t100644000766000024 130314006427353 16027 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 12); use_ok('Bio::Tools::Tmhmm'); } my $infile = test_input_file('tmhmm.out'); ok my $parser = Bio::Tools::Tmhmm->new(-file=>$infile), 'new()'; my @feat; while ( my $feat = $parser->next_result ) { push @feat, $feat; } is @feat, 3, 'got 3 feat'; is $feat[0]->seq_id, 'my_sequence_id'; is $feat[0]->source_tag, 'TMHMM2.0'; is $feat[0]->primary_tag, 'transmembrane'; my $raa_test_data = [ [ 54, 76], [116, 138], [151, 173], ]; for (0..(scalar(@feat)-1)) { is $feat[$_]->start, $raa_test_data->[$_]->[0]; is $feat[$_]->end, $raa_test_data->[$_]->[1]; } gbxml.t100644000766000024 335714006427353 15751 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIOuse strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14, -requires_modules => [ qw(XML::SAX) ] ); use_ok('Bio::SeqIO::gbxml'); } my $verbose = test_debug(); my $in = Bio::SeqIO->new(-format => 'gbxml', -verbose => $verbose, # This class can't parse dbEST data yet... # -file => test_input_file('roa1.gbxml')); # So let's try a file: -file => test_input_file('EG352462.gbxml')); isa_ok($in, 'Bio::SeqIO'); my $seq = $in->next_seq(); is($seq->molecule, 'mRNA', 'molecule'); is($seq->alphabet, 'dna', 'alphabet'); is($seq->primary_id, 116038450, 'primary_id'); is($seq->display_id, 'EG352462', 'display_id'); is($seq->version, 1, 'version'); is($seq->is_circular, 0, 'is_circular'); is(substr($seq->description, 0, 10), 'SAAH-aad23', 'description'); is(substr($seq->seq, 0, 10), 'aataaaatta', 'sequence'); my @class = $seq->species->classification; is($class[$#class],'Eukaryota', 'classification'); my ($feat) = $seq->get_SeqFeatures; is_deeply([ $feat->get_tag_values('clone_lib') ], [ 'Agen 0058' ], 'feat - clone_lib'); is_deeply([ $feat->get_tag_values('db_xref') ], [ 'taxon:79327' ], 'feat - db_xref'); is_deeply([ $feat->get_tag_values('lab_host') ], [ 'DH10B cells' ], 'feat - lab_host'); fasta.t100644000766000024 1214514006427353 15751 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 22, -requires_modules => [], -requires_networking => 0, ); use_ok('Bio::SeqIO::fasta'); } my $verbose = test_debug(); my $format = 'fasta'; my $seqio_obj = Bio::SeqIO->new(-file => test_input_file("test.$format"), -format => $format); isa_ok($seqio_obj, 'Bio::SeqIO'); my @methods = qw(next_seq write_seq); foreach my $method (@methods) { can_ok($seqio_obj, $method) || diag "$method method not implemented for $format"; } # checking the first sequence object my $seq_obj = $seqio_obj->next_seq(); isa_ok($seq_obj, 'Bio::Seq'); my %expected = ('seq' => 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGL' . 'DYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFI' . 'TYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPN' . 'AGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVI' . 'DKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVD' . 'VKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGN' . 'QNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGY' . 'GGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGY' . 'QQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGN' . 'YGNNQGFNNGGNNRRY', 'length' => '358', 'primary_id' => 'roa1_drome', 'description' => qr(Rea guano receptor type III), ); is ($seq_obj->seq(), $expected{'seq'}, 'sequence'); is ($seq_obj->length(), $expected{'length'}, 'length'); is ($seq_obj->primary_id(), $expected{'primary_id'}, 'primary_id'); like ($seq_obj->description(), $expected{'description'}, 'description'); # checking the second sequence object my $seq_obj2 = $seqio_obj->next_seq(); isa_ok($seq_obj2, 'Bio::Seq'); my %expected2 = ('seq' => 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGL' . 'DYRTTDENLKAHEKWGNIVDVVVMKDPTSTSTSTSTST' . 'STSTSTMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPN' . 'AGATVKKLFVGALKDDHDEQSIRDYFQHLLLLLLLDLL' . 'LLDLLLLDLLLFVEFDDYDPVDKVVLQKQHQLNGKMVD' . 'VKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGN' . 'QNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGY' . 'GGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGY' . 'QQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGN' . 'YGNNQGFNNGGNNRRY', 'length' => '358', 'primary_id' => 'roa2_drome', 'description' => qr(Rea guano ligand), ); is ($seq_obj2->seq(), $expected2{'seq'}, 'sequence'); is ($seq_obj2->length(), $expected2{'length'}, 'length'); is ($seq_obj2->primary_id(), $expected2{'primary_id'}, 'primary_id'); like ($seq_obj2->description(), $expected2{'description'}, 'description'); # from testformats.pl SKIP: { test_skip(-tests => 4, -requires_modules => [qw(Algorithm::Diff IO::ScalarArray IO::String)]); use_ok('Algorithm::Diff'); eval "use Algorithm::Diff qw(diff LCS);"; use_ok('IO::ScalarArray'); use_ok('IO::String'); my ($file, $type) = ("test.$format", $format); my $filename = test_input_file($file); print "processing file $filename\n" if $verbose; open my $FILE, '<', $filename or die "Could not read file '$filename': $!\n"; my @datain = <$FILE>; close $FILE; my $in = IO::String->new(join('', @datain)); my $seqin = Bio::SeqIO->new( -fh => $in, -format => $type); my $out = IO::String->new; my $seqout = Bio::SeqIO->new( -fh => $out, -format => $type); my $seq; while( defined($seq = $seqin->next_seq) ) { $seqout->write_seq($seq); } $seqout->close(); $seqin->close(); my $strref = $out->string_ref; my @dataout = map { $_."\n"} split(/\n/, $$strref ); my @diffs = &diff( \@datain, \@dataout); is(@diffs, 0, "$format format can round-trip"); if(@diffs && $verbose) { foreach my $d ( @diffs ) { foreach my $diff ( @$d ) { chomp($diff->[2]); print $diff->[0], $diff->[1], "\n>", $diff->[2], "\n"; } } print "in is \n", join('', @datain), "\n"; print "out is \n", join('',@dataout), "\n"; } } # bug 1508 # test genbank, gcg, ace against fasta (should throw an exception on each) for my $file (qw(roa1.genbank test.gcg test.ace test.raw)) { my $in = Bio::SeqIO->new(-file => test_input_file($file), -format => 'fasta'); throws_ok {$in->next_seq} qr/The sequence does not appear to be FASTA format/, "dies with $file"; } swiss.t100644000766000024 3705514006427353 16032 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 247, -requires_module => 'Data::Stag'); use_ok('Bio::SeqIO::swiss'); } use Bio::Annotation::SimpleValue; my $verbose = test_debug(); my $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('test.swiss')); isa_ok($seqio, 'Bio::SeqIO'); my $seq = $seqio->next_seq; my @gns = $seq->annotation->get_Annotations('gene_name'); my $outfile = test_output_file(); $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => ">$outfile"); $seqio->write_seq($seq); # reads it in once again $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => $outfile); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, 6239); # version, seq_update, dates (5 tests) is($seq->version, 40); my ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 35,'operator overloading in AnnotationI is deprecated'); my @dates = $seq->get_dates; my @date_check = qw(01-NOV-1997 01-NOV-1997 16-OCT-2001); for my $date (@dates) { my $expdate = shift @date_check; if ($expdate) { is($date, $expdate,'dates'); } else { is($date, $expdate); } } my @gns2 = $seq->annotation->get_Annotations('gene_name'); # check gene name is preserved (was losing suffix in worm gene names) ok($#gns2 == 0 && $gns[0]->value eq $gns2[0]->value); # test swissprot multiple RP lines my $str = Bio::SeqIO->new(-file => test_input_file('P33897')); $seq = $str->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); my @refs = $seq->annotation->get_Annotations('reference'); is( @refs, 23); is($refs[20]->rp, 'VARIANTS X-ALD LEU-98; ASP-99; GLU-217; GLN-518; ASP-608; ILE-633 AND PRO-660, AND VARIANT THR-13.'); # version, seq_update, dates (5 tests) is($seq->version, 44); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 28,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-FEB-1994 01-FEB-1994 15-JUN-2004); for my $date (@dates) { is($date, shift @date_check); } my $ast = Bio::SeqIO->new(-verbose => $verbose, -format => 'swiss' , -file => test_input_file('roa1.swiss')); my $as = $ast->next_seq(); ok defined $as->seq; is($as->id, 'ROA1_HUMAN', "id is ".$as->id); like($as->primary_id, qr(Bio::PrimarySeq)); is($as->length, 371); is($as->alphabet, 'protein'); is($as->division, 'HUMAN'); is(scalar $as->all_SeqFeatures(), 16); is(scalar $as->annotation->get_Annotations('reference'), 11); # version, seq_update, dates (6 tests) is($as->version, 35); ($ann) = $as->annotation->get_Annotations('seq_update'); is($ann->display_text, 15,'operator overloading in AnnotationI is deprecated'); @dates = $as->get_dates; @date_check = qw(01-MAR-1989 01-AUG-1990 01-NOV-1997); for my $date (@dates) { is($date, shift @date_check); } ($ann) = $as->annotation->get_Annotations('evidence'); is($ann->value,"1: Evidence at protein level"); my ($ent,$out) = undef; ($as,$seq) = undef; $seqio = Bio::SeqIO->new(-format => 'swiss' , -verbose => $verbose, -file => test_input_file('swiss.dat')); $seq = $seqio->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); # more tests to verify we are actually parsing correctly like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->display_id, 'MA32_HUMAN'); is($seq->length, 282); is($seq->division, 'HUMAN'); is($seq->alphabet, 'protein'); my @f = $seq->all_SeqFeatures(); is(@f, 2); is($f[1]->primary_tag, 'CHAIN'); is(($f[1]->get_tag_values('description'))[0], 'COMPLEMENT COMPONENT 1, Q SUBCOMPONENT BINDING PROTEIN'); # version, seq_update, dates (5 tests) is($seq->version, 40); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 31,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-FEB-1995 01-FEB-1995 01-OCT-2000); for my $date (@dates) { is($date, shift @date_check); } my @genenames = qw(GC1QBP HABP1 SF2P32 C1QBP); ($ann) = $seq->annotation->get_Annotations('gene_name'); # use Data::Stag findval and element name to get values/nodes foreach my $gn ( $ann->findval('Name') ) { ok ($gn, shift(@genenames)); } foreach my $gn ( $ann->findval('Synonyms') ) { ok ($gn, shift(@genenames)); } like($ann->value, qr/Name: GC1QBP/); # test for feature locations like ?..N $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->display_id, 'ACON_CAEEL'); is($seq->length, 788); is($seq->division, 'CAEEL'); is($seq->alphabet, 'protein'); is(scalar $seq->all_SeqFeatures(), 5); foreach my $gn ( $seq->annotation->get_Annotations('gene_name') ) { ok ($gn->value, 'F54H12.1'); } # test species in swissprot -- this can be a n:n nightmare $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); my @sec_acc = $seq->get_secondary_accessions(); is($sec_acc[0], 'P29360'); is($sec_acc[1], 'Q63631'); is($seq->accession_number, 'P42655'); my @kw = $seq->get_keywords; is( $kw[0], 'Brain'); is( $kw[1], 'Neurone'); is($kw[3], 'Multigene family'); is($seq->display_id, '143E_HUMAN'); is($seq->species->binomial, "Homo sapiens"); is($seq->species->common_name, "Human"); is($seq->species->ncbi_taxid, 9606); $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->species->binomial, "Bos taurus"); is($seq->species->common_name, "Bovine"); is($seq->species->ncbi_taxid, 9913); # multiple genes in swissprot $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); ($ann) = $seq->annotation->get_Annotations("gene_name"); @genenames = qw(CALM1 CAM1 CALM CAM CALM2 CAM2 CAMB CALM3 CAM3 CAMC); my $flatnames = "(CALM1 OR CAM1 OR CALM OR CAM) AND (CALM2 OR CAM2 OR CAMB) AND (CALM3 OR CAM3 OR CAMC)"; my @names = @genenames; # copy array my @ann_names = $ann->get_all_values(); is(scalar(@ann_names), scalar(@names)); # do this in a layered way (nested tags) for my $node ($ann->findnode('gene_name')) { for my $name ($node->findval('Name')) { is($name, shift(@names)); } for my $name ($node->findval('Synonyms')) { is($name, shift(@names)); } } is(scalar(@names),0); # same entry as before, but with the new gene names format $seqio = Bio::SeqIO->new(-format => 'swiss', -verbose => $verbose, -file => test_input_file('calm.swiss')); $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); ($ann) = $seq->annotation->get_Annotations("gene_name"); @names = @genenames; # copy array my @ann_names2 = $ann->get_all_values(); #emulate StructuredValue's flattened array is(scalar(@ann_names2), scalar(@names)); for my $node ($ann->findnode('gene_name')) { for my $name ($node->findval('Name')) { is($name, shift(@names)); } for my $name ($node->findval('Synonyms')) { is($name, shift(@names)); } } is(scalar(@names),0); # test proper parsing of references my @litrefs = $seq->annotation->get_Annotations('reference'); is(scalar(@litrefs), 17); my @titles = ( '"Complete amino acid sequence of human brain calmodulin."', '"Multiple divergent mRNAs code for a single human calmodulin."', '"Molecular analysis of human and rat calmodulin complementary DNA clones. Evidence for additional active genes in these species."', '"Isolation and nucleotide sequence of a cDNA encoding human calmodulin."', '"Structure of the human CALM1 calmodulin gene and identification of two CALM1-related pseudogenes CALM1P1 and CALM1P2."', undef, '"Characterization of the human CALM2 calmodulin gene and comparison of the transcriptional activity of CALM1, CALM2 and CALM3."', '"Cloning of human full-length CDSs in BD Creator(TM) system donor vector."', '"The DNA sequence and analysis of human chromosome 14."', '"Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences."', '"Alpha-helix nucleation by a calcium-binding peptide loop."', '"Solution structure of Ca(2+)-calmodulin reveals flexible hand-like properties of its domains."', '"Calmodulin structure refined at 1.7 A resolution."', '"Drug binding by calmodulin: crystal structure of a calmodulin-trifluoperazine complex."', '"Structural basis for the activation of anthrax adenylyl cyclase exotoxin by calmodulin."', '"Physiological calcium concentrations regulate calmodulin binding and catalysis of adenylyl cyclase exotoxins."', '"Crystal structure of a MARCKS peptide containing the calmodulin-binding domain in complex with Ca2+-calmodulin."', ); my @locs = ( "Biochemistry 21:2565-2569(1982).", "J. Biol. Chem. 263:17055-17062(1988).", "J. Biol. Chem. 262:16663-16670(1987).", "Biochem. Int. 9:177-185(1984).", "Eur. J. Biochem. 225:71-82(1994).", "Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases.", "Cell Calcium 23:323-338(1998).", "Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases.", "Nature 421:601-607(2003).", "Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002).", "Proc. Natl. Acad. Sci. U.S.A. 96:903-908(1999).", "Nat. Struct. Biol. 8:990-997(2001).", "J. Mol. Biol. 228:1177-1192(1992).", "Biochemistry 33:15259-15265(1994).", "Nature 415:396-402(2002).", "EMBO J. 21:6721-6732(2002).", "Nat. Struct. Biol. 10:226-231(2003).", ); my @positions = ( undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, 94, 103, 1, 76, undef, undef, undef, undef, 5, 148, 1, 148, undef, undef, ); foreach my $litref (@litrefs) { is($litref->title, shift(@titles)); is($litref->location, shift(@locs)); is($litref->start, shift(@positions)); is($litref->end, shift(@positions)); } # format parsing changes (pre-rel 9.0) $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('pre_rel9.swiss')); ok($seqio); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, "6239"); # version, seq_update, dates (5 tests) is($seq->version, 44); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 1,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-NOV-1997 01-NOV-1996 30-MAY-2006 ); for my $date (@dates) { is($date, shift @date_check); } my @idcheck = qw(Z66513 T22647 Cel.30446 Q06319 Q20772 F54D5.7 WBGene00010052 F54D5.7 GO:0005515 IPR006089 IPR006091 IPR006090 IPR006092 IPR009075 IPR009100 IPR013764 PF00441 PF02770 PF02771 PS00072 PS00073); for my $dblink ( $seq->annotation->get_Annotations('dblink') ) { is($dblink->primary_id, shift @idcheck); } $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('pre_rel9.swiss')); my @namespaces = qw(Swiss-Prot TrEMBL TrEMBL); while (my $seq = $seqio->next_seq) { is($seq->namespace, shift @namespaces); } # format parsing changes (rel 9.0, Oct 2006) $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('rel9.swiss')); ok($seqio); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, 6239); is($seq->version, 47); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 1,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-NOV-1997 01-NOV-1996 31-OCT-2006 ); for my $date (@dates) { is($date, shift @date_check); } @idcheck = qw(Z66513 T22647 Cel.30446 Q06319 Q20772 F54D5.7 cel:F54D5.7 WBGene00010052 F54D5.7 GO:0005515 IPR006089 IPR006091 IPR006090 IPR006092 IPR009075 IPR013786 IPR009100 IPR013764 PF00441 PF02770 PF02771 PS00072 PS00073 ); for my $dblink ( $seq->annotation->get_Annotations('dblink') ) { is($dblink->primary_id, shift @idcheck); } $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('rel9.swiss')); @namespaces = qw(Swiss-Prot TrEMBL TrEMBL); while (my $seq = $seqio->next_seq) { is($seq->namespace, shift @namespaces); } # bug 2288 # Q8GBD3.swiss $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('Q8GBD3.swiss')); while (my $seq = $seqio->next_seq) { my $lineage = join(';', $seq->species->classification); is ($lineage, 'Acetobacter aceti;Acetobacter subgen. Acetobacter;'. 'Acetobacter;Acetobacteraceae;Rhodospirillales;Alphaproteobacteria;'. 'Proteobacteria;Bacteria'); } # Test for roundtrippability swiss->fasta->swiss # 1. Swiss -> Fasta $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('test.swiss'), ); my $fasta_output = test_output_file(); my $seqio_out = Bio::SeqIO->new( -verbose => $verbose, -format => 'fasta', -file => ">$fasta_output", ); my $seq_first = $seqio->next_seq(); $seqio_out->write_seq( $seq_first ); # 2. Fasta -> Swiss my $swiss_output = test_output_file(); $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'fasta', -file => $fasta_output, ); $seqio_out = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => ">$swiss_output", ); my $seq_second = $seqio->next_seq(); is( $seq_second->id, $seq_first->id, 'Converting to fasta seqids match'); is( $seq_second->seq, $seq_first->seq, 'Converting to fasta sequences match'); $seqio_out->write_seq( $seq_second ); # 3. Check that we can open and read the resulting swiss-prot file $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => $swiss_output, ); my $seq_third; SKIP: { skip "Can't parse generated swissprot file", 1 unless lives_ok( sub {$seq_third = $seqio->next_seq()}, 'Can parse generated swiss' ); is( $seq_third->id, $seq_first->id, 'Roundtrip, seqids match'); is( $seq_third->seq, $seq_first->seq, 'Roundtrip, sequences match'); }; # bug 3153 # the default type for gene_name is Bio::Annotation::TagTree, but we need to # allow Bio::Annotation::SimpleValue as well for output (even though we will not # support parsing it) $seqio = Bio::SeqIO->new(-format => 'swiss', -file => test_input_file('test.swiss')); $seq = $seqio->next_seq; $seq->annotation->remove_Annotations('gene_name'); $seq->add_Annotation('gene_name', Bio::Annotation::SimpleValue->new(-name => 'foo', -value => 'bar')); $outfile = test_output_file(); my $seqout = Bio::SeqIO->new(-format => 'swiss', -file => ">$outfile"); lives_ok {$seqout->write_seq($seq)}; $seqout->close; open my $swissfh, '<', $outfile or die "Could not read file '$outfile': $!\n"; my $seen_gn; while (<$swissfh>) { if (/^GN\s+(\S+)/) { $seen_gn = $1; last } } close $swissfh; is $seen_gn, 'bar'; SeqIO.t100644000766000024 1227614006427353 15640 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 58); use_ok 'Bio::SeqIO'; } my $verbose = test_debug(); my @formats = qw(gcg fasta raw pir tab ace ); # The following files or formats are failing: swiss genbank interpro embl for my $format (@formats) { print "======== $format ========\n" if $verbose; my $seq; my $str = Bio::SeqIO->new( -file => test_input_file("test.$format"), -format => $format ); is $str->format(), $format; ok $seq = $str->next_seq(); print "Sequence 1 of 2 from $format stream:\n", $seq->seq, "\n\n" if $verbose; unless ($format eq 'raw') { is $seq->id, 'roa1_drome',"ID for format $format"; is $seq->length, 358; } unless ($format eq 'gcg') { # GCG file can contain only one sequence ok $seq = $str->next_seq(); print "Sequence 2 of 2 from $format stream:\n", $seq->seq, $seq->seq, "\n" if $verbose; } my $outfile = test_output_file(); my $out = Bio::SeqIO->new( -file => ">$outfile", -format => $format ); ok $out->write_seq($seq); if ($format eq 'fasta') { my $id_type; ok($id_type = $out->preferred_id_type('accession.version'), 'accession.version'); } ok -s $outfile; } # from testformats.pl SKIP: { test_skip(-tests => 6, -requires_modules => [qw(Algorithm::Diff IO::ScalarArray IO::String)]); use_ok 'Algorithm::Diff'; eval "use Algorithm::Diff qw(diff LCS);"; use_ok 'IO::ScalarArray'; use_ok 'IO::String'; my %files = ( #'test.embl' => 'embl', #'test.ace' => 'ace', 'test.fasta' => 'fasta', #'test.game' => 'game', 'test.gcg' => 'gcg', #'test.genbank' => 'genbank', 'test.raw' => 'raw', #'test_badlf.gcg' => 'gcg' ); while( my ($file, $type) = each %files ) { my $filename = test_input_file($file); print "processing file $filename\n" if $verbose; open my $FILE, '<', $filename or die "Could not read file '$filename': $!\n"; my @datain = <$FILE>; close $FILE; my $in = IO::String->new( join('', @datain) ); my $seqin = Bio::SeqIO->new( -fh => $in, -format => $type ); my $out = IO::String->new(); my $seqout = Bio::SeqIO->new( -fh => $out, -format => $type ); my $seq; while( defined($seq = $seqin->next_seq) ) { $seqout->write_seq($seq); } $seqout->close(); $seqin->close(); my $strref = $out->string_ref; my @dataout = map { $_."\n"} split(/\n/, $$strref ); my @diffs = &diff( \@datain, \@dataout); is @diffs, 0; if(@diffs && $verbose) { foreach my $d ( @diffs ) { foreach my $diff ( @$d ) { chomp($diff->[2]); print $diff->[0], $diff->[1], "\n>", $diff->[2], "\n"; } } print "in is \n", join('', @datain), "\n"; print "out is \n", join('',@dataout), "\n"; } } } # simple tests specific to Bio::SeqIO interface (applicable to all SeqIO # modules) ############################################## # test format() and variant() in Bio::RootIO ############################################## my $in = Bio::SeqIO->new( -file => test_input_file('bug2901.fa'), -format => "fasta", ); is $in->format, 'fasta'; is $in->variant, undef; $in = Bio::SeqIO->new( -file => test_input_file('fastq', 'illumina_faked.fastq'), -format => "fastq", -variant => 'illumina', ); is $in->format, 'fastq'; is $in->variant, 'illumina'; ###################################################### # test format detection from different inputs ###################################################### $in = Bio::SeqIO->new( -file => test_input_file('test.fastq') ); is $in->format, 'fastq'; open my $fh, '<', test_input_file('test.genbank') or die "Could not read file 'test.genbank': $!\n"; $in = Bio::SeqIO->new( -fh => $fh ); is $in->format, 'genbank'; close $fh; my $string = ">seq\nACGATCG\n"; $in = Bio::SeqIO->new( -string => $string ); is $in->format, 'fasta'; ############ EXCEPTION HANDLING ############ TODO: { local $TODO = 'file/fh-based tests should be in Bio::Root::IO, see issue #3204'; throws_ok { Bio::SeqIO->new(); } qr/No file, fh, or string argument provided/, 'Must pass a file or file handle'; } throws_ok { Bio::SeqIO->new(-fh => undef); } qr/fh argument provided, but with an undefined value/, 'Must pass a file or file handle'; throws_ok { Bio::SeqIO->new(-file => undef); } qr/file argument provided, but with an undefined value/, 'Must pass a file or file handle'; throws_ok { Bio::SeqIO->new(-file => 'foo.bar'); } qr/Could not read file 'foo.bar':/, 'Must pass a real file'; msout.t100644000766000024 5065414006427353 16031 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO#!/usr/bin/perl use version; our $API_VERSION = $Bio::SeqIO::msout::API_VERSION; use strict; use File::Path qw(mkpath rmtree); BEGIN { use Bio::Root::Test; test_begin( -tests => 165, -requires_modules => [q(Bio::SeqIO::msout)], -requires_networking => 0 ); use_ok('Bio::SeqIO::msout'); } # skip tests if the msout.pm module is too old. my $api_version = $Bio::SeqIO::msout::API_VERSION; cmp_ok( $api_version, '>=', qv('1.1.5'), "Bio::SeqIO::msout is at least api version 1.1.5" ); test_file_1( 0, "msout/msout_infile1" ); # 23 tests test_file_2( 0, "msout/msout_infile2" ); # 22 tests test_file_3( 0, "msout/msout_infile3" ); # 17 tests # tests to run for api versions >= 1.1.6 SKIP: { skip q($Bio::SeqIO::msout::API_VERSION < 1.1.6), 22 unless $api_version >= qv('1.1.6'); test_file_1( 0, q(msout/msout_infile4) ); } # tests to run for api versions >= 1.1.7 SKIP: { skip q($Bio::SeqIO::msout::API_VERSION < 1.1.7), 4 unless $api_version >= qv('1.1.7'); bad_test_file_1( 0, q(msout/bad_msout_infile1) ); # 2 tests bad_test_file_2( 0, q(msout/bad_msout_infile2) ); # 2 tests } # tests to run for api version >= 1.1.8 SKIP: { skip q($Bio::SeqIO::msout::API_VERSION < 1.1.8), 75 unless $api_version >= qv('1.1.8'); test_file_1( 0, "msout/msout_infile1", 100 ); test_file_2( 0, "msout/msout_infile2", 10 ); test_file_1( 0, q(msout/msout_infile4), 100 ); bad_test_file_1( 0, q(msout/bad_msout_infile1), 1000 ); bad_test_file_2( 0, q(msout/bad_msout_infile2), 1000 ); bad_n_sites( 0, q(msout/msout_infile1) ); # 2 tests } sub create_dir { my $dir = shift; $dir = Bio::Root::Test::test_input_file($dir); unless ( -d $dir ) { mkpath($dir); } } sub remove_dir { my $dir = shift; $dir = Bio::Root::Test::test_input_file($dir); if ( -d $dir ) { rmtree($dir); } else { warn "Tried to remove $dir, but it does not exist" } } sub test_file_1 { ############################################################################## ## Test file 1 ############################################################################## my $gzip = shift; my $infile = shift; my $n_sites = shift; $infile = Bio::Root::Test::test_input_file($infile); # the files are now part of the git repo and don't have to be printed # print_file1( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gzip -dc <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', -n_sites => $n_sites, ); isa_ok( $msout, 'Bio::SeqIO::msout' ); my $rh_base_conversion_table = $msout->get_base_conversion_table; my %attributes = ( RUNS => 3, SEGSITES => 7, N_SITES => $n_sites, SEEDS => [qw(1 1 1)], MS_INFO_LINE => 'ms 6 3 -s 7 -I 3 3 2 1', TOT_RUN_HAPS => 6, POPS => [qw(3 2 1)], NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(0.01 0.25 0.31 0.35 0.68 0.76 0.85)], CURRENT_RUN_SEGSITES => 7 ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POPS|SEEDS|POSITIONS/ ) { $func = ucfirst($func); } $func = 'get_' . $func; my @returns = $msout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } # Testing next_hap at beginning of run my @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_seq ); my @data_expected; if ( !defined($n_sites) ) { @data_expected = qw(1111111); } else { @data_expected = qw(1000000000000000000000001000001000100000000000000000000000000000000100000001000000001000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run" ); # Testing next_hap after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_seq ); if ( !defined($n_sites) ) { @data_expected = qw(5555555); } else { @data_expected = qw(5000000000000000000000005000005000500000000000000000000000000000000500000005000000005000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_hap after beginning of run" ); # Surprise test! testing msout::outgroup my $outgroup = $msout->outgroup; is( $outgroup, 1, "Testing msout::outgroup" ); # Testing next_pop after beginning of pop @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); if ( !defined($n_sites) ) { @data_expected = qw(4444444); } else { @data_expected = qw(4000000000000000000000004000004000400000000000000000000000000000000400000004000000004000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_pop after beginning of pop" ); # Testing next_pop at beginning of pop @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); if ( !defined($n_sites) ) { @data_expected = qw(4444444 5555555); } else { @data_expected = qw(4000000000000000000000004000004000400000000000000000000000000000000400000004000000004000000000000000 5000000000000000000000005000005000500000000000000000000000000000000500000005000000005000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_pop at beginning of pop" ); # Testing next_run after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_run ); if ( !defined($n_sites) ) { @data_expected = qw(4444444); } else { @data_expected = qw(4000000000000000000000004000004000400000000000000000000000000000000400000004000000004000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_run after beginning of run" ); # Testing next_pop at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); if ( !defined($n_sites) ) { @data_expected = qw(5555555 5555555 5555555); } else { @data_expected = qw(5000000000000000000000005000005000500000000000000000000000000000000500000005000000005000000000000000 5000000000000000000000005000005000500000000000000000000000000000000500000005000000005000000000000000 5000000000000000000000005000005000500000000000000000000000000000000500000005000000005000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_pop at beginning of run" ); # Testing next_hap after pop @data_got = $msout->get_next_hap; @data_expected = qw(1010101); is_deeply( \@data_got, \@data_expected, "Get next_hap after pop" ); # Testing next_run after pop and hap @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_run ); if ( !defined($n_sites) ) { @data_expected = qw(1111111 1515151); } else { @data_expected = qw(1000000000000000000000001000001000100000000000000000000000000000000100000001000000001000000000000000 1000000000000000000000005000001000500000000000000000000000000000000100000005000000001000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_run after pop and hap" ); # Testing next_run at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_run ); if ( !defined($n_sites) ) { @data_expected = qw(1414141 1414141 1515151 1414141 1515151 1515151); } else { @data_expected = qw(1000000000000000000000004000001000400000000000000000000000000000000100000004000000001000000000000000 1000000000000000000000004000001000400000000000000000000000000000000100000004000000001000000000000000 1000000000000000000000005000001000500000000000000000000000000000000100000005000000001000000000000000 1000000000000000000000004000001000400000000000000000000000000000000100000004000000001000000000000000 1000000000000000000000005000001000500000000000000000000000000000000100000005000000001000000000000000 1000000000000000000000005000001000500000000000000000000000000000000100000005000000001000000000000000); } is_deeply( \@data_got, \@data_expected, "Get next_run at beginning of run" ); is( $msout->get_next_run_num, undef, 'have all lines been read?' ); } sub test_file_2 { ############################################################################## ## Test file 2 ############################################################################## my $gzip = shift; my $infile = shift; my $n_sites = shift; $infile = Bio::Root::Test::test_input_file($infile); # the files are now part of the git repo and don't have to be printed # print_file2( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gzip -dc <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', -n_sites => $n_sites, ); isa_ok( $msout, 'Bio::SeqIO::msout' ); my %attributes = ( RUNS => 3, SEGSITES => 7, N_SITES => $n_sites, SEEDS => [qw(1 1 1)], MS_INFO_LINE => 'ms 6 3', TOT_RUN_HAPS => 6, POPS => 6, NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(0.01 0.25 0.31 0.35 0.68 0.76 0.85)], CURRENT_RUN_SEGSITES => 7 ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POPS|SEEDS|POSITIONS/ ) { $func = ucfirst($func); } $func = 'get_' . $func; my @returns = $msout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } my $rh_base_conversion_table = $msout->get_base_conversion_table; # Testing next_hap at beginning of run my @data_got = $msout->get_next_hap; my @data_expected = '1111111'; is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run" ); # Testing next_hap after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_seq ); if ( !defined($n_sites) ) { @data_expected = '5555555'; } else { @data_expected = '5555055500'; } is_deeply( \@data_got, \@data_expected, "Get next_hap after beginning of run" ); # Surprise test! testing msout::outgroup my $outgroup = $msout->outgroup; is( $outgroup, 0, "Testing msout::outgroup" ); # Testing next_pop after beginning of pop @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); if ( !defined($n_sites) ) { @data_expected = qw(4444444 4444444 5555555 4444444); } else { @data_expected = qw(4444044400 4444044400 5555055500 4444044400); } is_deeply( \@data_got, \@data_expected, "Get next_pop after beginning of pop" ); # Testing next_pop at beginning of pop/run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); if ( !defined($n_sites) ) { @data_expected = qw(5555555 5555555 5555555 1010101 1111111 1515151); } else { @data_expected = qw(5555055500 5555055500 5555055500 1010010100 1111011100 1515015100); } is_deeply( \@data_got, \@data_expected, "Get next_pop at beginning of pop/run" ); # Testing next_run at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_run ); if ( !defined($n_sites) ) { @data_expected = qw(1414141 1414141 1515151 1414141 1515151 1515151); } else { @data_expected = qw(1414014100 1414014100 1515015100 1414014100 1515015100 1515015100); } is_deeply( \@data_got, \@data_expected, "Get next_run at beginning of run" ); # Testing next_hap at beginning of run 2 @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_seq ); if ( !defined($n_sites) ) { @data_expected = '1515151'; } else { @data_expected = '1515015100'; } is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run 2" ); # Testing next_run after hap @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_run ); if ( !defined($n_sites) ) { @data_expected = qw(5050505 5151515 5555555 5454545 5454545); } else { @data_expected = qw(5050050500 5151051500 5555055500 5454054500 5454054500); } is_deeply( \@data_got, \@data_expected, "Get next_run after hap" ); is( $msout->get_next_run_num, 5, 'next run should be 5.' ); # getting the last hap of the file via next hap # Testing next_run after hap @data_got = $msout->get_next_hap; @data_expected = qw( 5555555 ); is_deeply( \@data_got, \@data_expected, "Get last hap through next_hap" ); } sub test_file_3 { ############################################################################## ## Test file 3 ############################################################################## my $gzip = shift; my $infile = shift; $infile = Bio::Root::Test::test_input_file($infile); # the files are now part of the git repo and don't have to be printed # print_file3( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gzip -dc <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', ); isa_ok( $msout, 'Bio::SeqIO::msout' ); my $rh_base_conversion_table = $msout->get_base_conversion_table; my %attributes = ( RUNS => 1, SEGSITES => 7, SEEDS => [qw(1 1 1)], MS_INFO_LINE => 'ms 3 1', TOT_RUN_HAPS => 3, POPS => 3, NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(0.01 0.25 0.31 0.35 0.68 0.76 0.85)], CURRENT_RUN_SEGSITES => 7 ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POPS|SEEDS|POSITIONS/ ) { $func = ucfirst($func); } $func = 'get_' . $func; my @returns = $msout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } # Testing next_hap at beginning of run my @data_got = convert_bases_to_nums( $rh_base_conversion_table, $msout->get_next_pop ); my @data_expected = qw(1111111 5555555 4444444); is_deeply( \@data_got, \@data_expected, "Get next_pop at end of run" ); is( $msout->get_next_run_num, undef, 'have all lines been read?' ); # Testing what happens when we read from empty stream @data_got = $msout->get_next_pop; @data_expected = (); is_deeply( \@data_got, \@data_expected, "Get next_pop at eof" ); # Testing what happens when we read from empty stream @data_got = $msout->get_next_run; @data_expected = (); is_deeply( \@data_got, \@data_expected, "Get next_run at eof" ); # Testing what happens when we read from empty stream @data_got = $msout->get_next_hap; @data_expected = undef; is_deeply( \@data_got, \@data_expected, "Get next_hap at eof" ); # Testing what happens when we read from empty stream @data_got = $msout->get_next_seq; @data_expected = (); is_deeply( \@data_got, \@data_expected, "Get next_seq at eof" ); } sub print_to_file { my ( $ra_in, $out ) = @_; open my $OUT, '>', $out or die "\nCould not write outfile '$out': $!\n"; print $OUT ("@$ra_in"); close $OUT; } sub convert_bases_to_nums { my ( $rh_base_conversion_table, @seqs ) = @_; my @out_seqstrings; foreach my $seq (@seqs) { my $seqstring = $seq->seq; foreach my $base ( keys %{$rh_base_conversion_table} ) { $seqstring =~ s/($base)/$rh_base_conversion_table->{$base}/g; } push @out_seqstrings, $seqstring; } return @out_seqstrings; } sub bad_test_file_1 { ############################################################################## ## Bad Test file 1 ############################################################################## # This sub tests to see if msout.pm will catch if the msinfo line's # advertized haps are less than are actually in the file my $gzip = shift; my $infile = shift; my $n_sites = shift; $infile = test_input_file($infile); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gunzip -c <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', -n_sites => $n_sites, ); isa_ok( $msout, 'Bio::SeqIO::msout' ); throws_ok { $msout->get_next_run } qr/msout file has only 2 hap\(s\), which is less than indicated in msinfo line \( 9 \)/, q(Caught error in bad msout file 1); } sub bad_test_file_2 { ############################################################################## ## Bad Test file 2 ############################################################################## # This sub tests to see if msout.pm will catch if the msinfo line's # advertized haps are more than are actually in the file my $gzip = shift; my $infile = shift; my $n_sites = shift; $infile = test_input_file($infile); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gunzip -c <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', -n_sites => $n_sites, ); isa_ok( $msout, 'Bio::SeqIO::msout' ); throws_ok { $msout->get_next_run } qr/\'\/\/\' not encountered when expected. There are more haplos in one of the msOUT runs than advertised in the msinfo line/, q(Caught error in bad msout file 2); } sub bad_n_sites { ############################################################################## ## Bad n_sites ############################################################################## # this sub tests if msout.pm dies when n_sites is smaller than segsites my $gzip = shift; my $infile = shift; $infile = Bio::Root::Test::test_input_file($infile); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gzip -dc <$file_sequence |"; } my $msout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'msout', ); # test nsites -1 throws_ok { $msout->set_n_sites(-1) } qr|first argument needs to be a positive integer. argument supplied: -1|; # test nsites smaller than next hap $msout->set_n_sites(1); throws_ok{$msout->get_next_seq} qr/n_sites needs to be at least the number of segsites of every run/, 'too few n_sites failed OK'; } fastq.t100644000766000024 5210514006427353 15771 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin( -tests => 149 ); use_ok('Bio::SeqIO::fastq'); use_ok('Bio::Seq::Quality'); } my $DEBUG = test_debug(); # simple parsing, data conversion of fastq example files my %example_files = ( bug2335 => { 'variant' => 'sanger', 'seq' => 'TTGGAATGTTGCAAATGGGAGGCAGTTTGAAATACTGAATAGGCCTCATC'. 'GAGAATGTGAAGTTTCAGTAAAGACTTGAGGAAGTTGAATGAGCTGATGA'. 'ATGGATATATG', 'qual' => '31 23 32 23 31 22 27 28 32 24 25 23 30 25 2 21 33 '. '29 9 17 33 27 27 27 25 33 29 9 28 32 27 7 27 21 '. '26 21 27 27 17 26 23 31 23 32 24 27 27 28 27 28 '. '28 27 27 31 23 23 28 27 27 32 23 27 35 30 12 28 '. '27 27 25 33 29 10 27 28 28 33 25 27 27 31 23 34 '. '27 27 32 24 27 30 22 24 28 24 27 28 27 26 28 27 '. '28 32 24 28 33 25 23 27 27 28 27 28 26', 'display_id' => 'DS6BPQV01A2G0A', 'desc' => undef, 'count' => 1 }, RT98876 => { 'variant' => 'sanger', 'seq' => 'CCGCCATTTCTTCAAATCTTTTCTTTTCTTTAGGAGTCATCAATTTCCAT'. 'TTCTCTGCACATTTCTTTGAAAATTA', 'qual' => '31 34 34 34 34 34 34 34 34 33 34 34 34 34 34 34 '. '34 34 34 34 34 32 32 34 34 34 34 34 34 34 34 34 '. '30 34 34 34 34 34 34 34 34 34 34 34 34 34 32 32 '. '34 34 34 34 34 34 34 34 34 34 34 34 34 34 34 34 '. '34 34 34 34 33 30 30 27 33 34 29 2', 'display_id' => 'Illumina_SRR125365.38', 'desc' => 's_5_1_0001_qseq_37 length=76', 'count' => 1 }, test1_sanger => { 'variant' => 'sanger', 'seq' => 'TATTGACAATTGTAAGACCACTAAGGATTTTTGGGCGGCAGCGACTTGGA'. 'GCTCTTGTAAAAGCGCACTGCGTTCCTTTTCTTTATTCTTTTGATCTTGA'. 'GAATCTTCTAAAAATGCCGAAAAGAAATGTTGGGAAGAGAGCGTAATCAG'. 'TTTAGAAATGCTCTTGATGGTAGCTTTATGTTGATCCATTCTTCTGCCTC'. 'CTTTACGAATAAAATAGAATAAAACTCAAATGACTAATTACCTGTATTTT'. 'ACCTAATTTTGTGATAAAATTCAAGAAAATATGTTCGCCTTCAATAATTA'. 'TG', 'qual' => '37 37 37 37 37 37 37 37 37 37 37 40 38 40 40 37 '. '37 37 39 39 40 39 39 39 39 39 37 33 33 33 33 33 '. '39 39 34 29 28 28 38 39 39 39 39 39 39 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 38 '. '38 29 29 29 34 38 37 37 33 33 33 33 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 37 37 37 37 37 38 38 '. '38 37 37 37 37 37 37 37 37 37 37 37 37 37 37 37 '. '37 37 37 37 37 37 37 37 37 34 34 34 38 38 37 37 '. '37 37 37 37 37 37 37 37 40 40 40 40 38 38 38 38 '. '40 40 40 38 38 38 40 40 40 40 40 40 40 40 40 40 '. '38 38 38 38 38 32 25 25 25 25 30 30 31 32 32 31 '. '31 31 31 31 31 31 31 31 19 19 19 19 19 22 22 31 '. '31 31 31 31 31 31 31 32 32 31 32 31 31 31 31 31 '. '31 25 25 25 28 28 30 30 30 30 30 31 31 32', 'display_id' => 'SRR005406.250', 'desc' => 'FB9GE3J10F6I2T length=302', 'count' => 250 }, test2_solexa => { 'variant' => 'solexa', 'seq' => 'GTATTATTTAATGGCATACACTCAA', 'qual' => '25 25 25 25 25 25 25 25 25 25 23 25 25 25 25 23 '. '25 23 23 21 23 23 23 17 17', 'display_id' => 'SLXA-B3_649_FC8437_R1_1_1_183_714', 'desc' => undef, 'count' => 5 }, test3_illumina => { 'variant' => 'illumina', 'seq' => 'CCAAATCTTGAATTGTAGCTCCCCT', 'qual' => '15 19 24 15 17 24 24 24 24 24 19 24 24 21 24 24 '. '20 24 24 24 24 20 18 13 19', 'display_id' => 'FC12044_91407_8_200_285_136', 'desc' => undef, 'count' => 25 }, example => { 'variant' => 'sanger', # TODO: guessing on the format here... 'seq' => 'GTTGCTTCTGGCGTGGGTGGGGGGG', 'qual' => '26 26 26 26 26 26 26 26 26 26 26 24 26 22 26 26 '. '13 22 26 18 24 18 18 18 18', 'display_id' => 'EAS54_6_R1_2_1_443_348', 'desc' => undef, 'count' => 3 }, illumina_faked => { 'variant' => 'illumina', 'seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTN', 'qual' => '40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 '. '21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0', 'display_id' => 'Test', 'desc' => 'PHRED qualities from 40 to 0 inclusive', 'count' => 1 }, sanger_93 => { 'variant' => 'sanger', 'seq' => 'ACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGAC'. 'TGACTGACTGACTGACTGACTGACTGACTGACTGAN', 'qual' => '93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 '. '74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 '. '55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 '. '36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 '. '17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0', 'display_id' => 'Test', 'desc' => 'PHRED qualities from 93 to 0 inclusive', 'count' => 1 }, sanger_faked => { 'variant' => 'sanger', 'seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTN', 'qual' => '40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 '. '21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0', 'display_id' => 'Test', 'desc' => 'PHRED qualities from 40 to 0 inclusive', 'count' => 1 }, solexa_example => { 'variant' => 'solexa', 'seq' => 'GTATTATTTAATGGCATACACTCAA', 'qual' => '25 25 25 25 25 25 25 25 25 25 23 25 25 25 25 23 '. '25 23 23 21 23 23 23 17 17', 'display_id' => 'SLXA-B3_649_FC8437_R1_1_1_183_714', 'desc' => undef, 'count' => 5 }, solexa_faked => { 'variant' => 'solexa', 'seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTNNNNNN', 'qual' => '40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 '. '24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 10 9 '. '8 7 6 5 5 4 4 3 3 2 2 1 1', 'display_id' => 'slxa_0001_1_0001_01', 'desc' => undef, 'count' => 1 }, tricky => { 'variant' => 'sanger', # TODO: guessing on the format here... 'seq' => 'TGGGAGGTTTTATGTGGAAAGCAGCAATGTACAAGA', 'qual' => '40 40 40 40 40 40 40 13 40 40 40 40 40 40 16 31 '. '19 19 31 12 22 13 4 27 5 10 14 3 14 4 19 7 10 10 '. '7 4', 'display_id' => '071113_EAS56_0053:1:3:990:501', 'desc' => undef, 'count' => 4 }, evil_wrapping => { 'variant' => 'sanger', # TODO: guessing on the format here... 'seq' => 'AACCCGTCCCATCAAAGATTTTGGTTGGAACCCGAAAGGGTTTTGAATTC'. 'AAACCCCTTTCGGTTCCAACTATTCAATTGTTTAACTTTTTTTAAATTGA'. 'TGGTCTGTTGGACCATTTGTAATAATCCCCATCGGAATTTCTTT', 'qual' => '32 26 31 26 4 22 20 30 25 2 27 27 24 36 32 16 '. '26 28 36 32 18 4 33 26 33 26 32 26 33 26 31 26 '. '4 24 36 32 16 36 32 16 36 32 18 4 27 33 26 32 26 '. '23 36 32 15 35 31 18 3 36 32 16 28 33 26 32 26 33 '. '26 33 26 25 28 25 33 26 25 33 25 32 24 25 36 32 '. '15 32 24 27 37 32 23 16 10 5 1 35 30 12 33 26 19 '. '27 25 25 14 27 26 28 25 32 24 23 12 20 30 21 28 '. '34 29 10 23 27 27 18 26 28 19 25 35 32 18 4 27 26 '. '28 23 12 24 13 32 28 8 25 33 28 9', 'display_id' => 'SRR014849.203935', 'desc' => 'EIXKN4201B4HU6 length=144', 'count' => 3 }, ); for my $example (sort keys %example_files) { my $file = test_input_file('fastq', "$example.fastq"); my $variant = $example_files{$example}->{variant}; my $in = Bio::SeqIO->new(-format => "fastq-$variant", -file => $file, -verbose => 2); #strictest level my $ct = 0; my $sample_seq; eval { while (my $seq = $in->next_seq) { $ct++; $sample_seq = $seq; # always grab the last seq } }; ok(!$@, "$example parses"); is($ct, $example_files{$example}->{count}, "correct num. seqs in $example"); ok(defined($sample_seq), 'sample sequence obtained'); if ($sample_seq) { isa_ok($sample_seq, 'Bio::Seq::Quality'); for my $method (qw(seq desc display_id)) { is($sample_seq->$method, $example_files{$example}->{$method}, "$method() matches $example"); } is(join(' ', map {sprintf("%.0f", $_)} @{$sample_seq->qual}), $example_files{$example}->{qual}, "qual() matches $example"); my $truncated = $sample_seq->trunc(1,10); is(scalar(@{$truncated->meta}), $truncated->length); } } # test round-trip and conversions (single file of each type) my @variants = qw(sanger illumina solexa); my %conversion = ( # check conversions, particularly solexa sanger_93 => { 'variant' => 'sanger', 'to_solexa' => { '-seq' => 'ACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGAN', '-qual' => [ (map {62} 0..31),(reverse(1..61)),1 ], '-raw_quality' => '~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~}|{zyxwvutsrqponmlkjihgfedcba`_^]\[ZYXWVUTSRQPONMLKJHGFECB@>;;', '-id' => 'Test', '-desc' => 'PHRED qualities from 93 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 93 to 0 inclusive' }, 'to_illumina' => { '-seq' => 'ACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGAN', '-qual' => [ (map {62} 0..31),(reverse(0..61)) ], '-raw_quality' => '~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~}|{zyxwvutsrqponmlkjihgfedcba`_^]\[ZYXWVUTSRQPONMLKJIHGFEDCBA@', '-id' => 'Test', '-desc' => 'PHRED qualities from 93 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 93 to 0 inclusive' }, 'to_sanger' => { '-seq' => 'ACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGAN', '-qual' => [reverse(0..93)], '-raw_quality' => '~}|{zyxwvutsrqponmlkjihgfedcba`_^]\[ZYXWVUTSRQPONMLKJIHGFEDCBA@?>=<;:9876543210/.-,+*)(\'&%$#"!', '-id' => 'Test', '-desc' => 'PHRED qualities from 93 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 93 to 0 inclusive' }, }, solexa_faked => { 'variant' => 'solexa', 'to_solexa' => {'-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTNNNNNN', '-qual' => [qw(40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 10 9 8 7 6 5 5 4 4 3 3 2 2 1 1)], '-raw_quality' => 'hgfedcba`_^]\[ZYXWVUTSRQPONMLKJIHGFEDCBA@?>=<;', '-id' => 'slxa_0001_1_0001_01', '-desc' => '', '-descriptor' => 'slxa_0001_1_0001_01' }, 'to_illumina' => { '-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTNNNNNN', '-qual' => [qw(40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 10 9 8 7 6 5 5 4 4 3 3 2 2 1 1)], '-namespace' => 'solexa', '-raw_quality' => 'hgfedcba`_^]\\[ZYXWVUTSRQPONMLKJJIHGFEEDDCCBBAA', '-id' => 'slxa_0001_1_0001_01', '-desc' => '', '-descriptor' => 'slxa_0001_1_0001_01' }, 'to_sanger' => { '-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTNNNNNN', '-qual' => [qw(40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 10 9 8 7 6 5 5 4 4 3 3 2 2 1 1)], '-namespace' => 'solexa', '-raw_quality' => 'IHGFEDCBA@?>=<;:9876543210/.-,++*)(\'&&%%$$##""', '-id' => 'slxa_0001_1_0001_01', '-desc' => '', '-descriptor' => 'slxa_0001_1_0001_01' }, }, illumina_faked => { 'variant' => 'illumina', 'to_solexa' => { '-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTN', '-qual' => [reverse(1..40), 1], # round trip from solexa is lossy '-namespace' => 'illumina', '-raw_quality' => 'hgfedcba`_^]\[ZYXWVUTSRQPONMLKJHGFECB@>;;', '-id' => 'Test', '-desc' => 'PHRED qualities from 40 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 40 to 0 inclusive' }, 'to_illumina' => { '-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTN', '-qual' => [reverse(0..40)], '-raw_quality' => 'hgfedcba`_^]\\[ZYXWVUTSRQPONMLKJIHGFEDCBA@', '-id' => 'Test', '-desc' => 'PHRED qualities from 40 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 40 to 0 inclusive' }, 'to_sanger' => { '-seq' => 'ACGTACGTACGTACGTACGTACGTACGTACGTACGTACGTN', '-qual' => [reverse(0..40)], '-raw_quality' => 'IHGFEDCBA@?>=<;:9876543210/.-,+*)(\'&%$#"!', '-id' => 'Test', '-desc' => 'PHRED qualities from 40 to 0 inclusive', '-descriptor' => 'Test PHRED qualities from 40 to 0 inclusive' } }, ); for my $example (sort keys %conversion) { my $file = test_input_file('fastq', "$example.fastq"); my $variant = $conversion{$example}->{variant}; my $in = Bio::SeqIO->new(-format => "fastq-$variant", -file => $file, -verbose => 2); #strictest level # this both tests the next_dataset method and helps check roundtripping my $seq = $in->next_seq; for my $newvar (@variants) { next unless exists $conversion{$example}->{"to_$newvar"}; my $outfile = test_output_file(); Bio::SeqIO->new(-format => "fastq-$newvar", -file => ">$outfile", -verbose => -1)->write_seq($seq); my $newdata = Bio::SeqIO->new(-format => "fastq-$newvar", -file => $outfile)->next_dataset; # round for simple comparison, get around floating pt comparison probs if ($newvar eq 'solexa') { $newdata->{-qual} = [map {sprintf("%.0f",$_)} @{$newdata->{-qual}}]; } #print Dumper($newdata) if $variant eq 'sanger' && $newvar eq 'illumina'; $conversion{$example}->{"to_$newvar"}->{'-namespace'} = $newvar; is_deeply($newdata, $conversion{$example}->{"to_$newvar"}, "Conversion from $variant to $newvar"); } } # test fastq exception handling my %error = ( # file name error_diff_ids => { variant => 'sanger', exception => qr/doesn't\smatch\sseq\sdescriptor/xms, }, error_long_qual => { variant => 'sanger', exception => qr/doesn't\smatch\slength\sof\ssequence/xms, }, error_no_qual => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_qual_del => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_escape => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_null => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_space => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_tab => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_unit_sep => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_qual_vtab => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_short_qual => { variant => 'sanger', exception => qr/doesn't\smatch\slength\sof\ssequence/, }, error_spaces => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_tabs => { variant => 'sanger', exception => qr/Unknown\ssymbol\swith\sASCII\svalue/xms, }, error_trunc_at_plus => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_trunc_at_qual => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_trunc_at_seq => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_trunc_in_title => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_trunc_in_seq => { variant => 'sanger', exception => qr/Missing\ssequence\sand\/or\squality\sdata/xms, }, error_trunc_in_plus => { variant => 'sanger', exception => qr/doesn't\smatch\sseq\s descriptor/xms, }, error_trunc_in_qual => { variant => 'sanger', exception => qr/doesn't\smatch\slength\sof\ssequence/xms, }, ); for my $example (sort keys %error) { my $file = test_input_file('fastq', "$example.fastq"); my $variant = $error{$example}->{variant}; my $in = Bio::SeqIO->new(-format => "fastq-$variant", -file => $file, -verbose => 2); #strictest level my $ct = 0; throws_ok { while (my $seq = $in->next_seq) { $ct++; } } $error{$example}->{exception}, "Exception caught for $example"; } # fastq my $in = Bio::SeqIO->new(-format => 'fastq', -file => test_input_file('fastq', 'zero_qual.fastq'), -verbose => 2); # strictest level lives_and {my $seq = $in->next_seq; is($seq->seq, 'G');} 'edge case; single 0 in quality fails'; table.t100644000766000024 1255514006427353 15747 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 351, -requires_module => 'IO::Scalar'); use_ok('Bio::Tools::CodonTable'); use_ok('Bio::SeqIO::table'); } my @names = qw(A6 A6r A6ps1 A6ps2 CaMK2d CaMKK2 AMPKa1 AMPKa2 MARK3 MARK2); my @accs = qw(SK001 SK512 SK752 SK766 SK703 SK482 SK032 SK033 SK096 SK120); my @num_anns = (5, 5, 5, 5, 6, 7, 7, 7, 7, 7); my @psg = (0, 0, 1, 1, 0, 0, 0, 0, 0, 0); my @rs = (0, 0, 0, 0, 1, 1, 1, 1, 1, 1); ok my $seqin = Bio::SeqIO->new(-file => test_input_file("test.tsv"), -format => 'table', -species => "Homo sapiens", -delim => "\t", -header => 1, -display_id => 1, -accession_number => 2, -seq => 7, -annotation => 1, -trim => 1); run_tests([@names],[@accs],[@num_anns],[@psg],[@rs]); $seqin->close(); ok $seqin = Bio::SeqIO->new(-file => test_input_file("test.tsv"), -format => 'table', -species => "Homo sapiens", -delim => "\t", -header => 1, -display_id => 1, -accession_number => 2, -seq => 7, -colnames => "[Family,Subfamily,Pseudogene?,Protein,Novelty]", -trim => 1); run_tests([@names],[@accs],[4,4,4,4,4,5,5,5,5,5],[@psg],[@rs]); $seqin->close(); ok $seqin = Bio::SeqIO->new(-file => test_input_file("test.tsv"), -format => 'table', -species => "Homo sapiens", -delim => "\t", -header => 1, -display_id => 1, -accession_number => 2, -seq => 7, -annotation => "[4,5,6,8,10]", -trim => 1); run_tests([@names],[@accs],[4,4,4,4,4,5,5,5,5,5],[@psg],[@rs]); # Tests to check that 'description' is read from 'table' format ok $seqin = Bio::SeqIO->new( -file => test_input_file("test-1.tab"), -format => 'table', -header => 1, -display_id => 1, -accession_number => 1, -seq => 3, -desc => 2 ); ok($seqin); my $seq = $seqin->next_seq; ok($seq); is( $seq->desc, 'd1'); is( $seq->display_id, 'n1'); is( $seq->seq, 'aaaa'); $seq = $seqin->next_seq; ok($seq); is( $seq->desc, 'd2'); is( $seq->display_id, 'n2'); is( $seq->seq, 'tttt'); $seqin->close(); # Tests to check that we can _not_ write to 'table' format ok $seqin = Bio::SeqIO->new( -file => test_input_file("test-1.tab.gb"), -format => 'genbank' ); ok($seqin); $seq = $seqin->next_seq; ok($seq); my $tmpfile = test_output_file(); my $seqout = Bio::SeqIO->new( -format => 'table', -file => ">$tmpfile" ); # dies_ok not available # dies_ok { $seqout->write_seq($seq) } "write_seq() not implemented"; sub run_tests { my ($names_,$accs_,$num_anns_,$psg_,$rs_) = @_; my @names = @$names_; my @accs = @$accs_; my @num_anns = @$num_anns_; my @psg = @$psg_; my @rs = @$rs_; my $n = 0; my $translator = Bio::Tools::CodonTable->new(-id => 1); while (my $seq = $seqin->next_seq()) { $n++; is ($seq->display_id, shift(@names)); is ($seq->accession_number, shift(@accs)); ok ($seq->species); is ($seq->species->binomial, "Homo sapiens"); my @anns = $seq->annotation->get_Annotations(); is (scalar(@anns), shift(@num_anns)); @anns = grep { $_->value eq "Y"; } $seq->annotation->get_Annotations("Pseudogene?"); is (scalar(@anns), shift(@psg)); # check sequences and that they translate to what we expect if (($n >= 5) && ($seq->display_id ne "MARK3")) { my $dna = $seq->seq; my $protein = ""; my $frame = 0; while ($frame <= 2) { my $inframe = substr($dna,$frame); # translate to protein my $protseq = $translator->translate($inframe); # chop off everything after the stop and before the first Met while ($protseq =~ /(M[^\*]+)/g) { $protein = $1 if length($1) > length($protein); } $frame++; } # retrieve expected result from annotation and compare my ($protann) = $seq->annotation->get_Annotations("Protein"); ok (defined $protann); is ($protein, $protann->value); } @anns = grep { $_->value eq "Known - Refseq"; } $seq->annotation->get_Annotations("Novelty"); is (scalar(@anns), shift(@rs)); @anns = $seq->annotation->get_Annotations("Subfamily"); is (scalar(@anns), ($n <= 5) ? 0 : 1); @anns = $seq->annotation->get_Annotations("Family"); is (scalar(@anns), 1); is (substr($anns[0]->value,0,4), ($n <= 4) ? "A6" : "CAMK"); } is ($n, 10); } MetaSeq.t100644000766000024 1715514006427353 15770 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 132); use_ok('Bio::Seq::Meta'); use_ok('Bio::Seq::Meta::Array'); use_ok('Bio::SeqIO'); use_ok('Bio::AlignIO'); use_ok('Bio::Seq::Quality'); } my $DEBUG = test_debug(); ok my $seq = Bio::Seq::Meta->new( -seq => "AT-CGATCGA"); is $seq->is_flush, 1; is $seq->revcom->seq, 'TCGATCG-AT'; is $seq->meta, ""; ok $seq->force_flush(1); is $seq->meta, " "; $seq->seq("AT-CGATCGATT"); is $seq->meta, " "; ok not $seq->force_flush(0); ok $seq = Bio::Seq::Meta::Array->new( -seq => "AT-CGATCGA"); is $seq->is_flush, 1; is $seq->revcom->seq, 'TCGATCG-AT'; is $seq->meta_text, ""; ok $seq->force_flush(1); $seq->seq("AT-CGATCGATT"); is $seq->meta_text, "0 0 0 0 0 0 0 0 0 0 0 0"; ok not $seq->force_flush(0); ok $seq = Bio::Seq::Quality->new( -seq => "AT-CGATCGA"); is $seq->meta_text, ""; ok $seq->force_flush(1); is $seq->meta_text, "0 0 0 0 0 0 0 0 0 0"; $seq->seq("AT-CGATCGATT"); is $seq->meta_text, "0 0 0 0 0 0 0 0 0 0 0 0"; ok not $seq->force_flush(0); ok $seq = Bio::Seq::Meta->new ( -seq => "", -meta => "", -alphabet => 'dna', -id => 'myid' ); # create a sequence object ok $seq = Bio::Seq::Meta->new( -seq => "AT-CGATCGA", -id => 'test', -verbose => 2, -force_flush => 1 ); is $seq->meta, " "; is $seq->meta_length, 10; # Create some random meta values, but gap in the wrong place my $metastring = "a-abb bb "; $seq->meta($metastring); $seq->verbose(1); # create some random meta values, but not for the last residue $metastring = "aa-bb bb"; ok $seq->meta($metastring), $metastring. " "; # truncate the sequence by assignment $seq->force_flush(1); $seq->seq('AT-CGA'); $seq->alphabet('dna'); is $seq->meta, 'aa-bb '; is $seq->start, 1; is $seq->end, 5; $seq->force_flush(0); # truncate the sequence with trunc() is $seq->strand(-1), -1; ok $seq = $seq->trunc(1,5); is $seq->start, 2; is $seq->end, 5; is $seq->seq, 'AT-CG'; is $seq->meta, 'aa-bb'; is $seq->strand, -1; # revcom ok $seq = $seq->revcom; is $seq->seq, 'CG-AT'; is $seq->meta, 'bb-aa'; is $seq->strand, 1; # submeta is $seq->subseq(2,4), 'G-A'; is $seq->submeta(2,4), 'b-a'; is $seq->submeta(2,undef, 'c-c'), 'c-ca'; is $seq->submeta(2,4), 'c-c'; is $seq->meta, 'bc-ca'; is $seq->meta(''), ' '; is $seq->submeta(2,undef, 'c-c'), 'c-c '; is $seq->meta, ' c-c '; # add named meta annotations my $first = '11-22'; is $seq->named_meta('first', $first), $first; is $seq->named_meta('first'), $first; my $second = '[[-]]'; ok $seq->named_meta('second', $second); # undefined range arguments is $seq->named_submeta('second', 3, 4), '-]'; is $seq->named_submeta('second', 3), '-]]'; is $seq->named_submeta('second'), '[[-]]'; my @names = $seq->meta_names; is @names, 3; is $names[0], 'DEFAULT'; # # IO tests # sub diff { my ($infile, $outfile) = @_; my ($in, $out); open my $FH_IN, '<', $infile or die "Could not read file '$infile': $!\n"; $in .= $_ while (<$FH_IN>); close $FH_IN; open my $FH_OUT, '<', $outfile or die "Could not read file '$outfile': $!\n"; $out .= $_ while (<$FH_OUT>); close $FH_OUT; print "|$in||$out|\n" if $DEBUG; is $in, $out; } # SeqIO my $str = Bio::SeqIO->new ( '-file'=> test_input_file('test.metafasta'), '-format' => 'metafasta'); ok $seq = $str->next_seq; my $outfile = test_output_file(); my $strout = Bio::SeqIO->new ('-file'=> ">". $outfile, '-format' => 'metafasta'); ok $strout->write_seq($seq); diff (test_input_file('test.metafasta'), $outfile ); # AlignIO $str = Bio::AlignIO->new ( '-file'=> test_input_file('testaln.metafasta'), '-format' => 'metafasta'); ok my $aln = $str->next_aln; $outfile = test_output_file(); $strout = Bio::AlignIO->new ('-file'=> ">". $outfile, '-format' => 'metafasta'); ok $strout->write_aln($aln); diff (test_input_file('testaln.metafasta'), $outfile ); # ## ### tests for Meta::Array ## # ok $seq = Bio::Seq::Meta::Array->new ( -seq => "", -meta => "", -alphabet => 'dna', -id => 'myid' ); # create a sequence object ok $seq = Bio::Seq::Meta::Array->new( -seq => "AT-CGATCGA", -id => 'test', -force_flush => 1, -verbose => 2 ); is $seq->is_flush, 1; #is $seq->meta_text, " "; is $seq->meta_text, '0 0 0 0 0 0 0 0 0 0'; # create some random meta values, but not for the last residue $metastring = "a a - b b 0 b b 0"; is join (' ', @{$seq->meta($metastring)}), $metastring. ' 0'; is $seq->meta_text, $metastring. ' 0'; # truncate the sequence by assignment $seq->seq('AT-CGA'); $seq->alphabet('dna'); is $seq->meta_text, 'a a - b b 0'; # truncate the sequence with trunc() is $seq->strand(-1), -1; ok $seq = $seq->trunc(1,5); is $seq->seq, 'AT-CG'; is $seq->meta_text, 'a a - b b'; is $seq->strand, -1; #is $seq->length, 5; #is $seq->meta_length, 6; #ok $seq->force_flush(1); #is $seq->meta_length, 5; # revcom ok $seq = $seq->revcom; is $seq->seq, 'CG-AT'; is $seq->meta_text, 'b b - a a'; is $seq->strand, 1; # submeta is $seq->subseq(2,4), 'G-A'; is $seq->submeta_text(2,4), 'b - a'; is $seq->submeta_text(2,undef, 'c - c'), 'c - c'; is $seq->submeta_text(2,4), 'c - c'; is $seq->meta_text, 'b c - c a'; is $seq->meta_text(''), '0 0 0 0 0'; is $seq->submeta_text(2,undef, 'c - c'), 'c - c'; is $seq->meta_text, '0 c - c 0'; # add named meta annotations $first = '1 10 - 222 23'; is $seq->named_meta_text('first', $first), $first; is $seq->named_meta_text('first'), $first; $second = '[ [ - ] ]'; ok $seq->named_meta_text('second', $second); # undefined range arguments is $seq->named_submeta_text('second', 3, 4), '- ]'; is $seq->named_submeta_text('second', 3), '- ] ]'; is $seq->named_submeta_text('second'), '[ [ - ] ]'; @names = $seq->meta_names; is @names, 3; is $names[0], 'DEFAULT'; # # testing the forcing of flushed meta values # ok $seq = Bio::Seq::Meta->new( -seq => "AT-CGATCGA", -id => 'test', -verbose => 2 ); is $seq->submeta(4, 6, '456'), '456'; is $seq->meta_length, 6; is $seq->length, 10; is $seq->meta, " 456"; ok $seq->force_flush(1); is $seq->meta, " 456 "; ok $seq->seq('aaatttc'); is $seq->meta, " 456 "; ok $seq = Bio::Seq::Meta::Array->new( -seq => "AT-CGATCGA", -id => 'test', -verbose => 2 ); is join (' ', @{$seq->submeta(4, 6, '4 5 6')}), '4 5 6'; is $seq->meta_length, 6; is $seq->length, 10; is $seq->meta_text, "0 0 0 4 5 6"; ok $seq->force_flush(1); is $seq->meta_text, "0 0 0 4 5 6 0 0 0 0"; ok $seq->seq('aaatttc'); is $seq->meta_text, "0 0 0 4 5 6 0"; is $seq->meta_length, 7; ok $seq = Bio::Seq::Quality->new( -seq => "AT-CGATCGA", -id => 'test', -verbose => 2 ); is join (' ', @{$seq->submeta(4, 6, '4 5 6')}), '4 5 6'; is $seq->meta_length, 6; is $seq->length, 10; is $seq->meta_text, "0 0 0 4 5 6"; ok $seq->force_flush(1); is $seq->meta_text, "0 0 0 4 5 6 0 0 0 0"; ok $seq->seq('aaatttc'); is $seq->meta_text, "0 0 0 4 5 6 0"; is $seq->meta_length, 7; is $seq->trace_length, 7; #is $seq->quality_length, 7; is $seq->is_flush, 1; is $seq->trace_is_flush, 1; is $seq->quality_is_flush, 1; # quality: trace_lengths, trace_is_flush, quality_is_flush Quality.t100644000766000024 3071714006427353 16060 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 85); use_ok('Bio::Seq::Quality'); } use Bio::SeqIO; my $DEBUG = test_debug(); # create some random sequence object with no id my $seqobj_broken = Bio::Seq::Quality-> new( -seq => "ATCGATCGA", ); my $seqobj; lives_ok { $seqobj = Bio::Seq::Quality-> new( -seq => "ATCGATCGA", -id => 'QualityFragment-12', -accession_number => 'X78121', ); }; # create some random quality object with the same number of qualities # and the same identifiers my $string_quals = "10 20 30 40 50 40 30 20 10"; my $qualobj; lives_ok { $qualobj = Bio::Seq::Quality-> new( -qual => $string_quals, -id => 'QualityFragment-12', -accession_number => 'X78121', ); }; # check to see what happens when you construct the Quality object ok my $swq1 = Bio::Seq::Quality-> new( -seq => "ATCGATCGA", -id => 'QualityFragment-12', -accession_number => 'X78121', -qual => $string_quals); print("Testing various weird constructors...\n") if $DEBUG; print("\ta) No ids, Sequence object, no quality...\n") if $DEBUG; # w for weird my $wswq1; lives_ok { $wswq1 = Bio::Seq::Quality-> new( -seq => "ATCGATCGA", -qual => ""); }; print $@ if $DEBUG; print("\tb) No ids, no sequence, quality object...\n") if $DEBUG; # note that you must provide a alphabet for this one. $wswq1 = Bio::Seq::Quality-> new( -seq => "", -qual => $string_quals, -alphabet => 'dna' ); print("\tc) Absolutely nothing. (HAHAHAHA)...\n") if $DEBUG; lives_ok { $wswq1 = Bio::Seq::Quality->new( -seq => "", -qual => "", -alphabet => 'dna' ); }; print("\td) Absolutely nothing but an ID\n") if $DEBUG; lives_ok { $wswq1 = Bio::Seq::Quality-> new( -seq => "", -qual => "", -alphabet => 'dna', -id => 'an object with no sequence and no quality but with an id' ); }; print("\td) No sequence, no quality, no ID...\n") if $DEBUG; warnings_like { $wswq1 = Bio::Seq::Quality-> new( -seq => "", -qual => "", -verbose => 0); } qr/not guess alphabet/i; print("Testing various methods and behaviors...\n") if $DEBUG; print("1. Testing the seq() method...\n") if $DEBUG; print("\t1a) get\n") if $DEBUG; my $original_seq = $swq1->seq(); is ($original_seq, "ATCGATCGA"); print("\t1b) set\n") if $DEBUG; ok ($swq1->seq("AAAAAAAAAAAA")); print("\t1c) get (again, to make sure the set was done.)\n") if $DEBUG; is($swq1->seq(), "AAAAAAAAAAAA"); print("\tSetting the sequence back to the original value...\n") if $DEBUG; $swq1->seq($original_seq); print("2. Testing the qual() method...\n") if $DEBUG; print("\t2a) get\n") if $DEBUG; my @qual = @{$swq1->qual()}; my $str_qual = join(' ',@qual); is $str_qual, "10 20 30 40 50 40 30 20 10"; print("\t2b) set\n") if $DEBUG; ok $swq1->qual("10 10 10 10 10"); print("\t2c) get (again, to make sure the set was done.)\n") if $DEBUG; my @qual2 = @{$swq1->qual()}; my $str_qual2 = join(' ',@qual2); is($str_qual2, "10 10 10 10 10 0 0 0 0"); ###! print("\tSetting the quality back to the original value...\n") if $DEBUG; $swq1->qual($str_qual); print("3. Testing the length() method...\n") if $DEBUG; print("\t3a) When lengths are equal...\n") if $DEBUG; is($swq1->length(), 9); print("\t3b) When lengths are different\n") if $DEBUG; $swq1->qual("10 10 10 10 10"); isnt ($swq1->length(), "DIFFERENT"); print("6. Testing the subqual() method...\n") if $DEBUG; my $t_subqual = "10 20 30 40 50 60 70 80 90"; $swq1->qual($t_subqual); print("\t6d) Testing the subqual at the start (border condition)\n") if $DEBUG; ok ('10 20 30' eq join(' ',@{$swq1->subqual(1,3)})); print("\t6d) Testing the subqual at the end (border condition)\n") if $DEBUG; ok ('70 80 90' eq join(' ',@{$swq1->subqual(7,9)})); print("\t6d) Testing the subqual in the middle\n") if $DEBUG; ok ('40 50 60' eq join(' ',@{$swq1->subqual(4,6)})); print("7. Testing cases where quality is zero...\n") if $DEBUG; $swq1 = Bio::Seq::Quality->new(-seq => 'G', -qual => '0', ); my $swq2 = Bio::Seq::Quality->new(-seq => 'G', -qual => '65', ); is $swq1->length, $swq2->length; $swq1 = Bio::Seq::Quality->new(-seq => 'GC', -qual => '0 0', ); $swq2 = Bio::Seq::Quality->new(-seq => 'GT', -qual => '65 0', ); my $swq3 = Bio::Seq::Quality->new(-seq => 'AG', -qual => '0 60', ); is $swq1->length, $swq2->length; is $swq1->length, $swq3->length; # # end of test inherited from seqwithquality.t # ################################################################# # # testing new functionality # my $qual = '0 1 2 3 4 5 6 7 8 9 11 12 13'; my $trace = '0 5 10 15 20 25 30 35 40 45 50 55 60'; ok my $seq = Bio::Seq::Quality->new ( -qual => $qual, -trace_indices => $trace, -seq => 'atcgatcgatcgt', -id => 'human_id', -accession_number => 'S000012', -verbose => $DEBUG >= 0 ? $DEBUG : 0 ); print("2. Testing the trace() method...\n") if $DEBUG; print("\t2a) get\n") if $DEBUG; my @trace = @{$seq->trace()}; my $str_trace = join(' ',@trace); is $str_trace, $trace; print("\t2b) set\n") if $DEBUG; ok $seq->trace("10 10 10 10 10"); print("\t2c) get (again, to make sure the set was done.)\n") if $DEBUG; my @trace2 = @{$seq->trace()}; my $str_trace2 = join(' ',@trace2); is($str_trace2, "10 10 10 10 10 0 0 0 0 0 0 0 0"); ###! print("\tSetting the trace back to the original value...\n") if $DEBUG; $seq->trace($trace); is_deeply $seq->qual, [split / /, $qual]; is_deeply $seq->trace, [split / /, $trace]; is_deeply $seq->trace_indices, [split / /, $trace]; #deprecated is $seq->qual_text, $qual; is $seq->trace_text, $trace; is join (' ', @{$seq->subqual(2, 3)}), '1 2'; is $seq->subqual_text(2, 3), '1 2'; is join (' ', @{$seq->subqual(2, 3, "9 9")}), '9 9'; is $seq->subqual_text(2, 3, "8 8"), '8 8'; is join (' ', @{$seq->subtrace(2, 3)}), '5 10'; is $seq->subtrace_text(2, 3), '5 10'; is join (' ', @{$seq->subtrace(2, 3, "9 9")}), '9 9'; is $seq->subtrace_text(2, 3, "8 8"), '8 8'; is $seq->trace_index_at(5), 20; is join(' ', @{$seq->sub_trace_index(5,6)}), "20 25"; is $seq->baseat(2), 't'; is $seq->baseat(3), 'c'; is $seq->baseat(4), 'g'; is $seq->baseat(5), 'a'; ############################################# # # same tests using Seq::Meta::Array methods follow ... # my $meta = '0 1 2 3 4 5 6 7 8 9 11 12'; $trace = '0 5 10 15 20 25 30 35 40 45 50 55'; my @trace_array = qw(0 5 10 15 20 25 30 35 40 45 50 55); ok $seq = Bio::Seq::Quality->new ( -meta => $meta, -seq => 'atcgatcgatcg', -id => 'human_id', -accession_number => 'S000012', -verbose => $DEBUG >= 0 ? $DEBUG : 0 ); $seq->named_meta('trace', \@trace_array); is_deeply $seq->meta, [split / /, $meta]; is_deeply $seq->named_meta('trace'), [split / /, $trace]; is $seq->meta_text, $meta; is $seq->named_meta_text('trace'), $trace; is join (' ', @{$seq->submeta(2, 3)}), '1 2'; is $seq->submeta_text(2, 3), '1 2'; is join (' ', @{$seq->submeta(2, 3, "9 9")}), '9 9'; is $seq->submeta_text(2, 3, "8 8"), '8 8'; is join (' ', @{$seq->named_submeta('trace', 2, 3)}), '5 10'; is $seq->named_submeta_text('trace', 2, 3), '5 10'; is join (' ', @{$seq->named_submeta('trace', 2, 3, "9 9")}), '9 9'; is $seq->named_submeta_text('trace', 2, 3, "8 8"), '8 8'; ok $seq = Bio::Seq::Quality->new( -seq => "ATGGGGGTGGTGGTACCCTATGGGGGTGGTGGTACCCT", -qual => "10 59 12 75 63 76 84 36 42 10 35 97 81 50 81 53 93 13 38 10 59 12 75 63 76 84 36 42 10 35 97 81 50 81 53 93 13 38", -trace_indices => "1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38" ); my $rev; ok $rev = $seq->revcom; is $rev->seq, 'AGGGTACCACCACCCCCATAGGGTACCACCACCCCCAT'; is $rev->qual_text, "38 13 93 53 81 50 81 97 35 10 42 36 84 76 63 75 12 59 10 38 13 93 53 81 50 81 97 35 10 42 36 84 76 63 75 12 59 10"; # selecting ranges based on quality # test seq with three high quality regions (13, 12 and 3), one very short (3) ok $seq = Bio::Seq::Quality->new( -seq => "ATGGGGGTGGTGGTACCCTATGGGGGTGGTGGTACCCT", -qual => "0 5 10 20 30 40 40 50 50 50 50 50 40 10 10 10 5 5 20 20 30 40 50 44 44 50 50 50 50 50 5 5 40 40 40 40 50 50" ); is $seq->threshold, undef; is $seq->threshold(10), 10; is $seq->threshold(13), 13; is $seq->count_clear_ranges, 3; my $newseq = $seq->get_clear_range; is $newseq->length, 12; my @ranges = $seq->get_all_clean_ranges; is scalar @ranges, 3; my $min_length = 10; @ranges = $seq->get_all_clean_ranges($min_length); is scalar @ranges, 2; my $seqio = Bio::SeqIO->new( -file => test_input_file('test_clear_range.fastq'), -format => 'fastq' ); while ( my $seq = $seqio->next_seq() ) { my $newqualobj; lives_ok { $newqualobj = $seq->get_clear_range(0) }; if ($newqualobj) { is($newqualobj->id, $seq->id, 'Bug 2845'); } else { ok(0, "No object returned via get_clear_range()"); } } ############################################# # # try testing some 'meta morphic relations' # ## belief; As the threshold is increased, the number of clear ranges ## (ncr) should not decrease. ## belief; As the thrshold is increased, the length of the clear ## ranges (lcr) should not decrease. ## belief; As the threshold is incrazed, the clear range length (clr) ## should not increase. Sorry for the terribe var names. ## belief; The number of clear ranges should vary between zero and ## half the sequence length. ## belief; The length of the clear ranges should vary between zero and ## the sequence length. ## belief; The length of the clear range should vary between zero and ## the sequence length. ## belief; The lenght of the clear range should not be larger than the ## length of hte clear ranges. my @bases = qw (A T C G a t c g); my @qualities = 0..65; ## See beliefs above: my $ncr_thresh_sanity = 0; my $lcr_thresh_sanity = 0; my $clr_thresh_sanity = 0; my $ncr_range_sanity = 0; my $lcr_range_sanity = 0; my $clr_range_sanity = 0; my $final_loss_of_sanity = 0; ## Go time: for (1..100){ $seq = join("", map {$bases[rand(@bases)]} 1..1000 ); $qual = join(" ", map {$qualities[rand(@qualities)]} 1..1000 ); $seq = Bio::Seq::Quality-> new( -seq => $seq, -qual => $qual, ); $seq->threshold(10); my $a_ncr = $seq->count_clear_ranges; my $a_lcr = $seq->clear_ranges_length; my $a_clr = scalar(@{$seq->get_clear_range->qual}); $ncr_range_sanity ++ if $a_ncr >= 0 && $a_ncr <= 500; $lcr_range_sanity ++ if $a_lcr >= 0 && $a_lcr <= 1000; $clr_range_sanity ++ if $a_clr >= 0 && $a_clr <= 1000; $final_loss_of_sanity ++ if $a_lcr >= $a_clr; $seq->threshold(20); my $b_ncr = $seq->count_clear_ranges; my $b_lcr = $seq->clear_ranges_length; my $b_clr = scalar(@{$seq->get_clear_range->qual}); $ncr_range_sanity ++ if $b_ncr >= 0 && $b_ncr <= 500; $lcr_range_sanity ++ if $b_lcr >= 0 && $b_lcr <= 1000; $clr_range_sanity ++ if $b_clr >= 0 && $b_clr <= 1000; $final_loss_of_sanity ++ if $b_lcr >= $b_clr; $seq->threshold(30); my $c_ncr = $seq->count_clear_ranges; my $c_lcr = $seq->clear_ranges_length; my $c_clr = scalar(@{$seq->get_clear_range->qual}); $ncr_range_sanity ++ if $c_ncr >= 0 && $c_ncr <= 500; $lcr_range_sanity ++ if $c_lcr >= 0 && $c_lcr <= 1000; $clr_range_sanity ++ if $c_clr >= 0 && $c_clr <= 1000; $final_loss_of_sanity ++ if $c_lcr >= $c_clr; $ncr_thresh_sanity ++ if $a_ncr <= $b_ncr && $b_ncr <= $c_ncr; $lcr_thresh_sanity ++ if $a_ncr <= $b_ncr && $b_ncr <= $c_ncr; $clr_thresh_sanity ++ if $a_clr >= $b_clr && $b_clr >= $c_clr; } is $ncr_thresh_sanity, 100; is $lcr_thresh_sanity, 100; is $clr_thresh_sanity, 100; is $ncr_range_sanity, 300; is $lcr_range_sanity, 300; is $clr_range_sanity, 300; is $final_loss_of_sanity, 300; ## Test the mask sequence function ... ## Ideally we'd at least test each function with each permutation of constructors. my $x = Bio::Seq::Quality-> new( -seq => "aaaattttccccgggg", -qual =>"1 1 1 1 2 2 2 2 1 1 1 1 3 3 3 3"); $x->threshold(1); is $x->mask_below_threshold, "aaaattttccccgggg"; $x->threshold(2); is $x->mask_below_threshold, "XXXXttttXXXXgggg"; $x->threshold(3); is $x->mask_below_threshold, "XXXXXXXXXXXXgggg"; $x->threshold(4); is $x->mask_below_threshold, "XXXXXXXXXXXXXXXX"; maf.t100644000766000024 211614006427353 15675 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: maf.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 11); use_ok('Bio::AlignIO::maf'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # MAF $str = Bio::AlignIO->new( '-file' => test_input_file("humor.maf")); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'NM_006987/5000-1', "maf input test"; is $aln->get_seq_by_pos(1)->strand, -1; # MAF - bug 2453 $str = Bio::AlignIO->new( '-file' => test_input_file("bug2453.maf")); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'hg16.chr7/27578829-27578866', "maf input test"; is $aln->get_seq_by_pos(1)->strand, 1; $aln = $str->next_aln(); is $aln->get_seq_by_pos(1)->get_nse, 'hg16.chr7/27699740-27699745', "maf input test"; is $aln->get_seq_by_pos(1)->strand, 1; $aln = $str->next_aln(); is $aln->get_seq_by_pos(1)->get_nse, 'hg16.chr7/27707222-27707234', "maf input test"; is $aln->get_seq_by_pos(1)->strand, 1; msf.t100644000766000024 121714006427353 15720 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: msf.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 4); use_ok('Bio::AlignIO::msf'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # MSF $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.msf")); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, '1433_LYCES/9-246', "msf input test"; $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'msf'); $status = $strout->write_aln($aln); is $status, 1, "msf output test"; psi.t100644000766000024 105514006427353 15726 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: psi.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 5); use_ok('Bio::AlignIO::psi'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # PSI format $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.psi"), '-format' => 'psi'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse, 'QUERY/1-798'); is($aln->num_sequences, 56); arp.t100644000766000024 622114006427353 15715 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: arp.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 48, -requires_module => 'Data::Stag'); use_ok('Bio::AlignIO::arp'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # ARP format $str = Bio::AlignIO ->new( '-file' => test_input_file("testaln.arp"), -verbose => 1, '-format' => 'arp'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse, '01/1-399','ARP get_nse()'); is($aln->get_seq_by_pos(1)->length, '407'); is($aln->num_sequences, 60,'ARP num_sequences()'); is($aln->id, 'Mandenka', 'ARP id()'); is($aln->description, 'mtDNA sequences in the Senegalese Mandenka (hypervariable region 1)', 'ARP description()'); my $coll = $aln->annotation; isa_ok($coll, 'Bio::AnnotationCollectionI'); my ($ann) = $coll->get_Annotations('Samples'); isa_ok($ann, 'Bio::AnnotationI'); my %nodes = $ann->pairs; is(keys %nodes, 60); is($nodes{'03'}, 10); is(($coll->get_Annotations('DataType'))[0]->value,'DNA'); is(($coll->get_Annotations('MissingData'))[0]->value,'?'); $str = Bio::AlignIO->new( '-file' => test_input_file("testaln2.arp"), '-format' => 'arp'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse, '000/1-29','ARP get_nse()'); is($aln->num_sequences, 3,'ARP num_sequences()'); is($aln->id, 'Population 1', 'ARP id()'); is($aln->description, 'An example of DNA sequence data', 'ARP description()'); $coll = $aln->annotation; isa_ok($coll, 'Bio::AnnotationCollectionI'); ($ann) = $coll->get_Annotations('Samples'); isa_ok($ann, 'Bio::AnnotationI'); %nodes = $ann->pairs; is(keys %nodes, 3); is($nodes{'001'}, 1); is(($coll->get_Annotations('DataType'))[0]->value, 'DNA'); is(($coll->get_Annotations('SampleSize'))[0]->value, 6); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(2)->get_nse, '001/1-29','ARP get_nse()'); is($aln->num_sequences, 8,'ARP num_sequences()'); is($aln->id, 'Population 2', 'ARP id()'); is($aln->description, 'An example of DNA sequence data', 'ARP description()'); $coll = $aln->annotation; isa_ok($coll, 'Bio::AnnotationCollectionI'); ($ann) = $coll->get_Annotations('Samples'); isa_ok($ann, 'Bio::AnnotationI'); %nodes = $ann->pairs; is(keys %nodes, 8); is($nodes{'001'}, 1); is(($coll->get_Annotations('DataType'))[0]->value, 'DNA'); is(($coll->get_Annotations('SampleSize'))[0]->value, 8); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(2)->get_nse, '024/1-29','ARP get_nse()'); is($aln->num_sequences, 6,'ARP num_sequences()'); is($aln->id, 'Population 3', 'ARP id()'); is($aln->description, 'An example of DNA sequence data', 'ARP description()'); $coll = $aln->annotation; isa_ok($coll, 'Bio::AnnotationCollectionI'); ($ann) = $coll->get_Annotations('Samples'); isa_ok($ann, 'Bio::AnnotationI'); %nodes = $ann->pairs; is(keys %nodes, 6); is($nodes{'024'}, 1); is(($coll->get_Annotations('DataType'))[0]->value, 'DNA'); is(($coll->get_Annotations('SampleSize'))[0]->value, 6); RootIO.t100644000766000024 71114006427353 15725 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root############################################## # tests http retrieval ############################################## use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 3, -requires_networking => 1); use_ok 'Bio::Root::IO'; } my $TESTURL = 'http://www.google.com/index.html'; my $rio; ok $rio = Bio::Root::IO->new(-url=>$TESTURL), 'default -url method'; lives_ok {$rio = Bio::Root::IO->new(-url=>$TESTURL)}; seqs.fas100644000766000024 161714006427353 16047 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>gi|239758|bbs|68379 glucocorticoid receptor, GR [human, Peptide Partial, 394 aa] MDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSASSPSLAVASQSDSKQRRLLVDFPKGSV >gi|239752|bbs|68871 PML-3=putative zinc finger protein [human, Peptide, 802 aa] MPPPETPSEGRQPSPSPSPTERAPASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQ >gi|238775|bbs|65126 putative tyrosine kinase receptor=UFO [human, NIH3T3, Peptide, 894 aa] MAWRCPRMGRVPLAWCLALCGWACMAPRGTQAEESPFVGNPGNITGARGLTGTLRCQLQVQGEPPEVHWL >gi|239006|bbs|65162 alpha(1,3)-fucosyltransferase, ELFT [human, Peptide, 400 aa] MGAPWGSPTAAAGGRRGWRRGRGLPWTVCVLAAAGLTCTALITYACWGQLPPLPWASPTPSRPVGVLLWW >gi|237597|bbs|60089 putative adhesion molecule=ADMLX [human, Peptide, 679 aa] MVPGVPGAVLTLCLWLAASSGSWRPAPARLCAAAGRVAVCRERPARSCASRCLSLQITRISAFFQHFQNN >gi|237995|bbs|62046 NK-1 receptor [human, lung, Peptide, 407 aa] MDNVLPVDSDLSPNISTNTSEPNQFVQPAWEIVLWAAAYTVIVVTSVVGNVVVMWIILAHKRMRTVTNYF mast.dat100644000766000024 50640714006427353 16106 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data******************************************************************************** MAST - Motif Alignment and Search Tool ******************************************************************************** MAST version 3.0 (Release date: 2002/04/02 00:11:59) For further information on how to interpret these results or to get a copy of the MAST software please access http://meme.sdsc.edu. ******************************************************************************** ******************************************************************************** REFERENCE ******************************************************************************** If you use this program in your research, please cite: Timothy L. Bailey and Michael Gribskov, "Combining evidence using p-values: application to sequence homology searches", Bioinformatics, 14(48-54), 1998. ******************************************************************************** ******************************************************************************** DATABASE AND MOTIFS ******************************************************************************** DATABASE /home/sao/applications/mask_db/upstream.masked (nucleotide) Last updated on Mon Jul 28 14:24:33 2003 Database contains 35386 sequences, 70745108 residues Reverse complement strands are not scored. MOTIFS acro1.set.fasta.conserved.25.meme (nucleotide) MOTIF WIDTH BEST POSSIBLE MATCH ----- ----- ------------------- 2 21 ATGAAAAGCAGTGATTCACAG 3 21 AGTGCCCTGGGCCTTTAAGAG 4 23 AGGAGAGAGCTCACACCTCTCCC 8 21 AACTTCAAAATGTCTCTTTGA PAIRWISE MOTIF CORRELATIONS: MOTIF 2 3 4 ----- ----- ----- ----- 3 0.20 4 0.32 0.17 8 0.20 0.23 0.23 No overly similar pairs (correlation > 0.60) found. Random model letter frequencies (from /home/sao/proj_motif/model.bckg): A 0.264 C 0.241 G 0.237 T 0.258 ******************************************************************************** ******************************************************************************** SECTION I: HIGH-SCORING SEQUENCES ******************************************************************************** SEQUENCE NAME DESCRIPTION E-VALUE LENGTH ------------- ----------- -------- ------ NM_148937 ENSMUSG00000026173 1e-19 2000 NM_080688 ENSRNOG00000016361 9.5e-18 2000 NM_032726 ENSG00000115556 7.2e-17 2000 NM_006422 ENSG00000111254 1.5e-06 2000 NM_011775 ENSMUSG00000030911 7.7e-06 2000 NM_013455 ENSMUSG00000022622 0.0015 2000 NM_009581 ENSMUSG00000042554 0.0045 2000 NM_022915 ENSG00000135900 0.034 2000 NM_012490 ENSRNOG00000013584 0.044 2000 NM_146726 ENSMUSG00000047041 0.075 2000 NM_003460 ENSG00000103310 0.11 2000 NM_029236 ENSMUSG00000037525 0.14 2000 NM_012069 ENSG00000101892 0.2 2000 NM_007868 ENSMUSG00000025065 0.2 2000 NM_006805 ENSG00000177733 0.21 2000 NM_012038 ENSMUSG00000019005 0.27 2000 NM_176924 ENSMUSG00000047730 0.27 2000 NM_053967 ENSRNOG00000007231 0.28 2000 NM_022484 ENSG00000146802 0.31 2000 NM_016106 ENSG00000092108 0.4 2000 NM_016163 ENSG00000092108 0.4 2000 NM_021475 ENSMUSG00000022057 0.57 2000 NM_013462 ENSMUSG00000031489 0.74 2000 NM_002227 ENSG00000162434 0.83 2000 NM_000717 ENSG00000167434 0.97 2000 NM_147080 ENSMUSG00000044506 1 2000 NM_001097 ENSG00000100312 1.1 2000 NM_023662 ENSMUSG00000031592 1.1 2000 NM_026576 ENSMUSG00000016984 1.4 2000 NM_021757 ENSRNOG00000006655 1.6 2000 NM_019794 ENSMUSG00000031701 1.7 2000 NM_019343 ENSRNOG00000003618 1.7 2000 NM_144715 ENSG00000163576 1.9 2000 NM_010510 ENSMUSG00000048806 2 2000 NM_009241 ENSMUSG00000029682 2.1 2000 NM_014783 ENSG00000175306 2.1 2000 NM_006056 ENSG00000171596 2.5 2000 NM_175619 ENSG00000182223 2.7 2000 NM_000994 ENSG00000144713 2.7 2000 NM_031922 ENSG00000135597 2.7 2000 NM_007260 ENSG00000011009 2.8 2000 NM_011839 ENSMUSG00000046212 2.9 2000 NM_009531 ENSMUSG00000030094 3.5 2000 NM_011717 ENSMUSG00000024050 3.5 2000 NM_011291 ENSMUSG00000025922 3.5 2000 NM_000324 ENSG00000112077 3.6 2000 NM_026408 ENSMUSG00000024534 3.7 2000 NM_026065 ENSMUSG00000009550 3.7 2000 NM_010324 ENSMUSG00000025190 3.7 2000 NM_028381 ENSMUSG00000003208 3.8 2000 NM_033030 ENSG00000152430 3.8 2000 NM_003215 ENSG00000135605 3.9 2000 NM_172893 ENSMUSG00000038507 4 2000 NM_133867 ENSMUSG00000040600 4.1 2000 NM_027877 ENSMUSG00000024906 4.1 2000 NM_011937 ENSMUSG00000024437 4.2 2000 NM_024993 ENSG00000176204 4.2 2000 NM_002578 ENSG00000077264 4.3 2000 NM_021763 ENSRNOG00000010533 4.3 2000 NM_016957 ENSMUSG00000051601 4.4 2000 NM_172499 ENSMUSG00000041945 4.4 2000 NM_021470 ENSMUSG00000029130 4.9 2000 NM_177001 ENSMUSG00000047433 4.9 2000 NM_021836 ENSRNOG00000003678 4.9 2000 NM_011167 ENSMUSG00000021345 5.1 2000 NM_020610 ENSMUSG00000031019 5.2 2000 NM_026838 ENSMUSG00000031253 5.3 2000 NM_153270 ENSG00000185915 5.3 2000 NM_013255 ENSG00000128585 5.3 2000 NM_031817 ENSRNOG00000015326 5.4 2000 NM_172801 ENSMUSG00000050201 5.5 2000 NM_033072 ENSMUSG00000025409 5.6 2000 NM_177229 ENSMUSG00000047113 5.7 2000 NM_026776 ENSMUSG00000035120 5.7 2000 NM_027279 ENSMUSG00000041396 5.8 2000 NM_052828 ENSG00000137394 6.1 2000 NM_153189 ENSG00000106304 6.2 2000 NM_003117 ENSG00000106304 6.2 2000 NM_004490 ENSG00000115290 6.2 2000 NM_153784 ENSMUSG00000043782 6.3 2000 NM_011741 ENSMUSG00000037327 6.4 2000 NM_002192 ENSG00000122641 6.6 2000 NM_031249 ENSMUSG00000050614 6.6 2000 NM_016704 ENSMUSG00000022181 6.6 2000 NM_146140 ENSMUSG00000044528 6.7 2000 NM_026856 ENSMUSG00000049606 6.8 2000 NM_153572 ENSMUSG00000041298 6.9 2000 NM_024251 ENSMUSG00000030051 7.6 2000 NM_176831 ENSMUSG00000009768 7.6 2000 NM_172599 ENSMUSG00000037526 7.9 2000 NM_028539 ENSMUSG00000048409 8.1 2000 NM_025781 ENSMUSG00000031953 8.3 2000 NM_174924 ENSG00000169340 8.4 2000 NM_007003 ENSG00000101951 8.4 2000 NM_028604 ENSMUSG00000019792 8.5 2000 NM_013166 ENSRNOG00000012460 8.5 2000 NM_153399 ENSMUSG00000040566 8.5 2000 NM_173084 ENSG00000175447 8.6 2000 NM_009412 ENSMUSG00000027506 8.6 2000 NM_005565 ENSG00000043462 9.3 2000 NM_145514 ENSMUSG00000038733 9.4 2000 NM_000319 ENSG00000139197 9.5 2000 NM_139063 ENSMUSG00000038982 9.6 2000 NM_009980 ENSMUSG00000030970 9.8 2000 ******************************************************************************** ******************************************************************************** SECTION II: MOTIF DIAGRAMS ******************************************************************************** SEQUENCE NAME E-VALUE MOTIF DIAGRAM ------------- -------- ------------- NM_148937 1e-19 46_[2]_195_<3>_17_[4]_967_[4]_687 NM_080688 9.5e-18 681_[3]_998_[2]_195_[3]_17_[4]_23 NM_032726 7.2e-17 73_[4]_1628_[2]_163_[3]_15_[4]_33 NM_006422 1.5e-06 94_[4]_71_[4]_105_[8]_1062_[2]_ 361_[8]_198 NM_011775 7.7e-06 624_[8]_512_[3]_141_[4]_86_[2]_ 74_[2]_456 NM_013455 0.0015 993_[2]_532_[8]_433 NM_009581 0.0045 117_[4]_262_[2]_1577 NM_022915 0.034 1378_[4]_250_[8]_71_[2]_236 NM_012490 0.044 147_[2]_694_[3]_618_[2]_40_[8]_417 NM_146726 0.075 1067_[3]_139_[2]_267_[8]_98_[4]_ 41_[2]_281 NM_003460 0.11 592_[3]_571_[8]_71_[4]_701 NM_029236 0.14 263_[8]_1004_[2]_661_[4]_7 NM_012069 0.2 141_[8]_153_[8]_732_[4]_243_[2]_ 117_[2]_108_[2]_378 NM_007868 0.2 493_[4]_136_[8]_522_[8]_89_[3]_ 4_[8]_362_[4]_39_[8]_45_[8]_138 NM_006805 0.21 281_[8]_1075_[2]_78_[3]_3_[8]_ 431_[4]_25 NM_012038 0.27 740_[8]_146_[2]_221_[8]_418_[3]_ 50_[8]_239_[8]_60 NM_176924 0.27 396_[3]_1539_[4]_21 NM_053967 0.28 148_[8]_19_[4]_799_[8]_969 NM_022484 0.31 111_[8]_178_[4]_108_[8]_1293_ [8]_224 NM_016106 0.4 673_[2]_899_[8]_47_[3]_275_[4]_20 NM_016163 0.4 673_[2]_899_[8]_47_[3]_275_[4]_20 NM_021475 0.57 634_[8]_527_[8]_364_[2]_102_[8]_289 NM_013462 0.74 333_[3]_255_[3]_593_[8]_146_[2]_589 NM_002227 0.83 582_[8]_356_[8]_911_[4]_86 NM_000717 0.97 167_[3]_131_[8]_772_[2]_251_[4]_593 NM_147080 1 355_[2]_292_[3]_909_[3]_5_[8]_ 243_[4]_89 NM_001097 1.1 1490_[8]_489 NM_023662 1.1 705_[8]_239_[3]_630_[8]_47_[8]_ 124_[4]_148 NM_026576 1.4 435_[4]_299_[2]_565_[8]_367_[8]_248 NM_021757 1.6 436_[3]_117_[2]_304_[4]_1041_[8]_16 NM_019794 1.7 682_[8]_57_[3]_237_[8]_102_[4]_836 NM_019343 1.7 228_[8]_40_[3]_58_[2]_655_[8]_ 70_[4]_241_[3]_580 NM_144715 1.9 5_[3]_464_[4]_57_[3]_606_[8]_782 NM_010510 2 86_[3]_1326_[2]_153_[2]_277_[8]_74 NM_009241 2.1 208_[2]_1224_[3]_294_[4]_209 NM_014783 2.1 419_[3]_585_[8]_774_[2]_159 NM_006056 2.5 724_[2]_641_[8]_593 NM_175619 2.7 563_[2]_38_[4]_167_[8]_36_[8]_1110 NM_000994 2.7 327_[4]_87_[8]_302_[8]_944_[8]_254 NM_031922 2.7 396_[4]_58_[8]_92_[8]_253_[8]_ 418_[2]_676 NM_007260 2.8 711_[8]_915_[2]_332 NM_011839 2.9 111_[8]_61_[8]_414_[8]_491_[4]_ 211_[8]_213_[3]_64_[8]_286 NM_009531 3.5 90_[8]_729_[2]_1002_[3]_87_[2]_8 NM_011717 3.5 146_[8]_277_[8]_589_[4]_152_[2]_750 NM_011291 3.5 678_[3]_299_[2]_435_[8]_161_[4]_ 317_[4]_1 NM_000324 3.6 1220_[8]_55_[3]_105_[2]_557 NM_026408 3.7 28_[2]_151_[4]_27_[4]_227_[4]_ 311_[8]_220_[2]_904 NM_026065 3.7 107_[8]_176_[3]_286_[4]_735_[8]_610 NM_010324 3.7 605_[2]_42_[2]_1105_[3]_185 NM_028381 3.8 34_[8]_1159_[3]_6_[2]_433_[2]_284 NM_033030 3.8 189_[3]_218_[8]_1213_[8]_53_[2]_ 3_[4]_170_[3]_26 NM_003215 3.9 97_[8]_88_[3]_758_[4]_369_[2]_602 NM_172893 4 44_[3]_340_[2]_74_[3]_791_[3]_ 133_[3]_130_[8]_362 NM_133867 4.1 769_[3]_282_[2]_761_[4]_123 NM_027877 4.1 114_[8]_370_[2]_57_[8]_1295_[4]_78 NM_011937 4.2 418_[4]_623_[4]_913 NM_024993 4.2 98_[3]_552_[8]_192_[3]_111_[4]_961 NM_002578 4.3 434_[2]_359_[2]_704_[3]_440 NM_021763 4.3 788_[2]_130_[2]_361_[2]_173_[4]_462 NM_016957 4.4 421_[8]_591_[3]_430_[8]_473_[8]_1 NM_172499 4.4 686_[3]_176_[3]_34_[8]_806_[4]_212 NM_021470 4.9 621_[4]_380_[8]_256_[2]_244_[4]_411 NM_177001 4.9 548_[2]_1093_[3]_317 NM_021836 4.9 1433_[4]_61_[8]_462 NM_011167 5.1 175_[2]_314_[3]_577_[4]_584_[3]_ 160_[3]_83 NM_020610 5.2 1245_[4]_285_[2]_29_[3]_376 NM_026838 5.3 8_[2]_55_[3]_1012_[3]_538_[2]_ 45_[2]_106_[8]_110 NM_153270 5.3 38_[8]_66_[4]_175_[3]_1026_[8]_609 NM_013255 5.3 141_[3]_111_[8]_1249_[4]_122_ [8]_81_[4]_187 NM_031817 5.4 72_[8]_1324_[8]_97_[2]_444 NM_172801 5.5 499_[3]_72_[2]_367_[8]_486_[2]_492 NM_033072 5.6 460_[2]_458_[3]_190_[8]_399_[3]_409 NM_177229 5.7 412_[2]_472_[8]_74_[4]_977 NM_026776 5.7 519_[4]_464_[2]_158_[3]_794 NM_027279 5.8 60_[2]_237_[8]_1023_[4]_615 NM_052828 6.1 814_[2]_215_[8]_137_[4]_737_[2]_11 NM_153189 6.2 352_[3]_1004_[8]_602 NM_003117 6.2 352_[3]_1004_[8]_602 NM_004490 6.2 677_[3]_153_[3]_257_[2]_576_[8]_253 NM_153784 6.3 120_[8]_1007_[2]_521_[8]_134_ [3]_134 NM_011741 6.4 1899_[2]_80 NM_002192 6.6 28_[4]_183_[3]_96_[8]_1046_[4]_ 202_[8]_126_[8]_147_[2]_21 NM_031249 6.6 502_[8]_158_[4]_247_[2]_97_[3]_ 473_[8]_72_[4]_321 NM_016704 6.6 587_[8]_93_[3]_219_[2]_432_[4]_583 NM_146140 6.7 443_[2]_103_[3]_254_[8]_99_[3]_ 23_[8]_386_[4]_98_[4]_443 NM_026856 6.8 397_[3]_368_[8]_269_[2]_170_[4]_ 199_[2]_457_[2]_12 NM_153572 6.9 97_[2]_458_[4]_196_[3]_861_[8]_ 102_[4]_156_[8] NM_024251 7.6 73_[2]_64_[4]_508_[8]_658_[4]_609 NM_176831 7.6 44_[8]_733_[3]_368_[2]_99_[8]_672 NM_172599 7.9 1166_[2]_279_[8]_391_[8]_101 NM_028539 8.1 226_[4]_1039_[2]_60_[8]_182_[8]_407 NM_025781 8.3 55_[2]_61_[8]_123_[8]_23_[4]_ 823_[2]_624_[3]_163 NM_174924 8.4 779_[3]_64_[4]_957_[2]_135 NM_007003 8.4 311_[4]_68_[2]_1483_[4]_71 NM_028604 8.5 577_[8]_511_[3]_310_[4]_356_[4]_158 NM_013166 8.5 605_[8]_749_[2]_604 NM_153399 8.5 39_[2]_895_[4]_1022 NM_173084 8.6 201_[2]_643_[8]_634_[3]_459 NM_009412 8.6 125_[8]_526_[2]_87_[2]_483_[8]_ 83_[4]_70_[8]_498 NM_005565 9.3 878_[8]_563_[2]_409_[8]_87 NM_145514 9.4 314_[3]_436_[8]_635_[8]_521_[8]_10 NM_000319 9.5 118_[2]_227_[3]_160_[8]_1250_ [4]_159 NM_139063 9.6 622_[4]_876_[4]_203_[2]_232 NM_009980 9.8 437_[3]_1109_[8]_62_[2]_308_[8] ******************************************************************************** ******************************************************************************** SECTION III: ANNOTATED SEQUENCES ******************************************************************************** NM_148937 ENSMUSG00000026173 LENGTH = 2000 COMBINED P-VALUE = 2.87e-24 E-VALUE = 1e-19 DIAGRAM: 46_[2]_195_[3]_17_[4]_967_[4]_687 [2] 2.9e-13 ATGAAAAGCAGTGATTCACAG +++++++++++++++++++++ 1 GAATTGAGGGCAAAAACAGCGGTATTCCACACAGACTCTGAGCAGAATGAAAAGCAGTGATTCACAGCACACTGC [3] 1.9e-13 AGTGCCCTGGGCCTTTAAGAG +++++++++++++++++++++ 226 TTCTGCAACCCCACCCCCACAATTCCTCCTGTTGACCAGTGCCCTGGGCCTTTAAGAGTTGGATAGGCTGGACCT [4] 2.6e-13 AGGAGAGAGCTCACACCTCTCCC +++++++++++++++++++++++ 301 AGGAGAGCGCTCACACCTCTCCCCACCCCACTCCTCCTCTCTGGCTGTAACTCTGCCAGTTTACAGCAGCAAACT [4] 3.6e-05 AGGAGAGAGCTCACACCTCTCCC +++ +++ +++++ ++++++++ 1276 CACTTTATATGAAACAGGTTAGCCATAACCCTTCTCAGAAGTGAGAACATAGACAAAATAGCTCTCTTCCTTCCT NM_080688 ENSRNOG00000016361 LENGTH = 2000 COMBINED P-VALUE = 2.67e-22 E-VALUE = 9.5e-18 DIAGRAM: 681_[3]_998_[2]_195_[3]_17_[4]_23 [3] 9.1e-05 AGTGCCCTGGGCCTTTAAGAG ++ + + + +++++++++++ 676 TGTTACAAAGTTGTAATTCATTAAAAGATGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN [2] 2.9e-13 ATGAAAAGCAGTGATTCACAG +++++++++++++++++++++ 1651 AGATGAAATGAGGGCAAAAACAGTGGTATTCCACACAGACTCCGACCAGAATGAAAAGCAGTGATTCACAGAGCA [3] 4.9e-12 AGTGCCCTGGGCCTTTAAGAG +++++++++++++++++++++ 1876 CCTTTTCTGTCACTCCACCCCCACAACTCCTCCTGTTGACCAGTGCTCTGGGCCTTTAAGAGTTAGATAGGCTGG [4] 7.7e-13 AGGAGAGAGCTCACACCTCTCCC +++++++++++++++++++++++ 1951 ACCTAGGAAAGCGCTCACACCTCTCCCCACTCCACTCCTCTCTGGCTGTA NM_032726 ENSG00000115556 LENGTH = 2000 COMBINED P-VALUE = 2.04e-21 E-VALUE = 7.2e-17 DIAGRAM: 73_[4]_1628_[2]_163_[3]_15_[4]_33 [4 6. AG + 1 GAAGAATACTGTTGTATTCATGGGAACATTGAAACAGAATATTTGAGATCTAGACTCTCCCAGAAAATCCACTAT ] 7e-06 GAGAGAGCTCACACCTCTCCC +++ ++ +++++ ++++++++ 76 GAGTGATAACACCCCTCACAGATTACCCAAGAGGGAAGGAGGTTGGGCAGCATTAAATTGATGCAAATAGTGACT [ 2 A + 1651 GTGGAAACAGACACTATGGAAAGGAGCCAGATTAAATAAGGGCAAAAACAGCCGTATTACATGCAGGAACCAGAA 2] .9e-13 TGAAAAGCAGTGATTCACAG ++++++++++++++++++++ 1726 TGAAAAGCAGTGATTCACAGAGCACTGCAACTGAGGACACTCCCTTCACTTTGAGAGCTACTGCGGGATTGGGGT [3] [4] 4.2e-11 2.1e-1 AGTGCCCTGGGCCTTTAAGAG AGGAGA ++++ ++++++++++++++++ +++++ 1876 CCACCCCACCCCCACTACTCCCTCACATTGACCAGTGTCCTGGGCCTTTAAGAGTTGGATGGGCTGGACTGGAGA 2 GAGCTCACACCTCTCCC +++++++++++++++++ 1951 GAGCTCACACCTCTCCCCTTCTTACTGCTTCCCTCCGGCTATAACTTGCC NM_006422 ENSG00000111254 LENGTH = 2000 COMBINED P-VALUE = 4.16e-11 E-VALUE = 1.5e-06 DIAGRAM: 94_[4]_71_[4]_105_[8]_1062_[2]_361_[8]_198 [4] 3.6e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++ +++++ +++++ ++++ 76 GATCAGCAGGACTGTCTCCAGCACGGAGCTGACACTGTTCAGCTGCCTCAACTGGTAGACGCTCCCAAGAAGTCA [4] 1.0e-11 AGGAGAGAGCTCACACCTCTCCC +++++++++++++++++++++++ 151 GAGGCAGCTGTCGGGGCGGAGGTGTCCATGACCTCCCCAGGACAGAGCAAAAACTTCTCCCTCAAGAACACAAAC [8] 2.3e-09 AACTTCAAAATGTCTCTTTGA ++++++ +++++++++++++ 301 CACTTATATTGTACAATACTTCAGACTGTTTTTTTTAAATACATAAAACTTTATGTTAAAAAACTCTATTAACAT [2] 3.7e-05 ATGAAAAGCAGTGATTCACAG + ++++ ++++ +++ ++ + 1351 CCAGGAAAGACAAATATATAAGGAGTTGCCAGAGTTGAGGAGGTCCTCCAAGAGACCCAGAGATACAGAAAGAAA [8] 4.9e-06 AACTTCAAAATGTCTCTTT ++ + ++++++ ++ ++ 1726 TCAGGATGATTTTGATTGAATAATTCAAAAGTATCTTCTGATTTTCATGGTTATAAAAGGTTAAAATGAATTCTT GA ++ 1801 GATAAACTGTGGCTTATTTTAAGAGAATAGTTTAAATAAGGAAGCTGCAAATTGTGACACAGACAGGGGACAGGC NM_011775 ENSMUSG00000030911 LENGTH = 2000 COMBINED P-VALUE = 2.18e-10 E-VALUE = 7.7e-06 DIAGRAM: 624_[8]_512_[3]_141_[4]_86_[2]_74_[2]_456 [8] 3.2e-09 AACTTCAAAATGTCTCTTTGA +++++ +++++++++++++++ 601 ATTTGAACCTCTTCCCAATTACAGAGATTAAAAATGTTTTTTGGAGGGGGGAGATCATATTTTCTAGTCAAAGGT [3] 4.0e-05 AGTGCCCTGGGCCTTTAAGAG +++++ +++ + +++++++ 1126 AAAAATACACATGTCAAGCTTTGGCATCGAGAGGTGCTGTGGACATTTAAGAAGTTATTAGGCCATGGAGGATGA [4] 2.9e-05 AGGAGAGAGCTCACACCTCTCCC +++++ + +++ ++ +++ ++ 1276 ATGATGGGACCATCCAGGCTTGGACTTGAGCATCAACAGGAAGGTGGAAAACAATCCCAAATCATCCCCAAAGCC [2] 4.0e-05 ATGAAAAGCAGTGATTCACAG + +++ ++++++++++ ++ 1426 TGTGTCCAAGTAAGGGACTCAAACACAGAGGACTCTTAAGTGTACTGTCAGGTGGGCTGCAGCTAAACCCTATTG [2] 7.5e-10 ATGAAAAGCAGTGATTCACAG ++++++ +++++++++++++ 1501 AACATACATGACATGTGTCGGGTATAAGACTAAGGGATTCACAGAATCTCAGATAGGAGATTTACCTTCTCTTGT NM_013455 ENSMUSG00000022622 LENGTH = 2000 COMBINED P-VALUE = 4.27e-08 E-VALUE = 0.0015 DIAGRAM: 993_[2]_532_[8]_433 [2] 1.4e-05 ATGAAAAGCAGTGATTCACAG + +++++ ++++ ++++ +++ 976 CGGGCCTTGATGTGGAAAAAAAAAAACAGTTATTCTCACATATGGAGAAAGGAAGTCTGCTTTCTGATGCTTAGG [8] 1.7e-12 AACTTCAAAATGTCTCTTTGA +++++++++++++++++++++ 1501 ACTCAGCAGTGGGCAAGAAGGTAGGCTTTGAAGTCATAAGTCAAGCAACTTCAAAATGGCTCTTTGAAAGTGTTC NM_009581 ENSMUSG00000042554 LENGTH = 2000 COMBINED P-VALUE = 1.26e-07 E-VALUE = 0.0045 DIAGRAM: 117_[4]_262_[2]_1577 [4] 2.6e-09 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++ ++++++++ ++ 76 CACACTTTCGTTCTGAATTTACGTTTATAGGGATGAAACTTCAGGAAAGAAAAGACACTTCATAGTGTAGAAACA [2] 5.9e-09 ATGAAAAGCAGTGATTCACAG + ++++++++++++++++ ++ 376 NNNNNNNNNNNNNTCCCAGAAAGCATAAAAGGAGCAAGTGATTCATAGACTGCTTGCTTTCTGCAGGTGCAAACA NM_022915 ENSG00000135900 LENGTH = 2000 COMBINED P-VALUE = 9.57e-07 E-VALUE = 0.034 DIAGRAM: 1378_[4]_250_[8]_71_[2]_236 [4] 5.2e-06 AGGAGAGAGCTCACACCTCTCCC ++++++++ +++ ++ + ++ + 1351 AAAAATTGTACTGTATTAACAGTTTGGTAGGAAAGCAATAGAATTACATTGTTAAAGAGTGTTAACATAAAGAAG [8] 1.1e-08 AACTTCAAAATGTCTCTTTGA +++++ ++++++++++ +++ 1651 TAAATTAAAAATGTTTTCTTTTTTTGAGGATTTAAATGTTTCCTTAACCTGAAAGCGAAACAGATTTTTGTTTGT [2] 5.3e-06 ATGAAAAGCAGTGATTCACAG +++++++++ ++ + ++++ 1726 TTTGCTGTAACGAGAGAGGAAGGAACAAGAGAGTGACAGGGTGAAGCAACTTCTCCCAGAAACTTTTGGGCGCAA NM_012490 ENSRNOG00000013584 LENGTH = 2000 COMBINED P-VALUE = 1.25e-06 E-VALUE = 0.044 DIAGRAM: 147_[2]_694_[3]_618_[2]_40_[8]_417 [2] 9.4 ATG ++ 76 TTACTGGAGATATGCTTATAGCCCAAGAGGGAGGATGAGACCTTCTAGTCACTTACAGAGGAAAGACAAAACCTA e-05 AAAAGCAGTGATTCACAG ++++++++++ +++ ++ 151 GAAAGCAGTGCTTCTCAACCTTCCTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN [3] 4.2e-05 AGTGCCCTGGGCCTTTAAGAG ++ +++ ++++++++ + + + 826 AGAACGTGTGACAAGGTGCCACGCAGAAGAAAGAAATAAGACCTTGGGCCCTGATGTGGAAAAGAACAGTTCTCA [2] [8] 6.0e-05 7.2e-11 ATGAAAAGCAGTGATTCACAG AACTTCAAAATGT +++++ + +++++ + +++ +++++++++++++ 1501 TCAAGAAGTATGTGATACTCAGAAATGGGCAAGAAGGTAGGCTTTGAAATCATAAGTCAAGAAACTTCAAAATGG CTCTTTGA +++ ++++ 1576 CTCCTTGAAAGTGTTCTGGTAAACCCCCCCCCCCCCACGCCCCTGTACTGATGATCCCATCAGTCAGCAGAACCC NM_146726 ENSMUSG00000047041 LENGTH = 2000 COMBINED P-VALUE = 2.12e-06 E-VALUE = 0.075 DIAGRAM: 1067_[3]_139_[2]_267_[8]_98_[4]_41_[2]_281 [3] 5.6e-05 AGTGCCCTGGGCCTTTAAGAG + + ++++++ +++++++ 1051 AGTTGTCAACAATTACTATTTAGATGGGCTCTTAAAATAATCTTTTATGGAACTCATGAGTTGGTGCCAAACAAC [2] 8.0e-05 ATGAAAAGCAGTGATTCACAG + ++++++++ + + ++++ 1201 TGGGACTAGTGAGATCATGGACCTTAGAAAAGAAGCAATTTCCGACACTTTGCTAGGCCAGCATTAATTCCTTAC [8] 5.0e-05 AACTTCAAAATGTCTCTTTGA + ++++++++ + +++ ++ 1501 AGGCTTTTGTAATCATGTTTCAAAATTTATTTCTTTAATGGTTATTCAAATTTTTGAACAATGGGCTTAGGTAGA [4] 2.3e-07 AGGAGAGAGCTCACAC ++++++++++ ++ 1576 AGCATAGGGACTGAGCAGCTGCTTAATTAGAGAATAGCAGTGACTTGAACTCTCAGAAATGGACAGAGAAGCAAG [2] 1.4e-07 CTCTCCC ATGAAAAGCAGTGATTCACAG + +++++ + +++++++ +++++ ++ ++ 1651 CACACACACAATATAAAACTGAACAGAATGGTAAGTGGCAGAACAGAGAGAAAAAGAGGTGATACATAGAAAGAC NM_003460 ENSG00000103310 LENGTH = 2000 COMBINED P-VALUE = 3.14e-06 E-VALUE = 0.11 DIAGRAM: 592_[3]_571_[8]_71_[4]_701 [3] 1.5e-09 AGTGCCCT ++++++ + 526 TTAGTTATGATATACAATGCAACTTGATAAAGTCACACAGAATTTTGAATCTAAAACTATTTAGCAGAATGCCTT GGGCCTTTAAGAG ++ ++++++++++ 601 GGATCCTTAAAAGAAGGTCCAAAGAAGGCTTTTGATGAGACAATTTGAGCTTTTGACCCTATTACTCTAGAAATG [8] 7.4e-06 AACTTCAAAATGTCTC +++++ ++++ ++++ 1126 AGCTGACATTTAAATCCAGCCACTAGAAGCTTTCATCACTTATTCTTCCTGCCATACGTAGATTTGAAATATTTC TTTGA +++ 1201 CTTGCAAAAATTTTAGACTCAAGTTTTCTAGAATACAAGTGAATTTCCATATATCTTGAAGGTCTCTAAGAGCTG [4] 6.3e-05 AGGAGAGAGCTCACACCTCTCCC ++ + + ++++ +++++ +++ 1276 CAGTATAAAGCAGACACTAGTCATGTTGATGACATAATTACTTAGAATAAGACTGGCTACCCTCAGGACGAAGGC NM_029236 ENSMUSG00000037525 LENGTH = 2000 COMBINED P-VALUE = 3.92e-06 E-VALUE = 0.14 DIAGRAM: 263_[8]_1004_[2]_661_[4]_7 [8] 7.8e-06 AACTTCAAAATGTCTCTTTGA ++++ +++++++++ +++ 226 TCTTTTCCCATCCTTCCATTCCCACTGTCTCCTAACCCTGATTTAAAATGTTTGGTGGCCCTCAAACTGAACATG [2] 1.0e-05 ATGAAAAGCAGTGATTCACAG + ++ +++++ ++ + ++ ++ 1276 AAATATATAAACTAAGACAGCCAAGGCTACATAGATACCTTGTCTTAAAAGACAAACAAATAAACAAAAAGTCCC [4] 6.6e-09 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++ ++++ +++++ 1951 AGAAACCAGATCAGATTTTAAGGAGAGCAAAGAAACGACACAGCCTTTTT NM_012069 ENSG00000101892 LENGTH = 2000 COMBINED P-VALUE = 5.75e-06 E-VALUE = 0.2 DIAGRAM: 141_[8]_153_[8]_732_[4]_243_[2]_117_[2]_108_[2]_378 [8] 3.3e-06 AACTTCAAA + +++++++ 76 ACCTGTGCCCAAGAAATGAGCATGTAGTTTTCTAGACTTTAATAATGTGCCCCCGAATCTGCTGTTATATTCAAA ATGTCTCTTTGA + ++++ +++ 151 GTCTCTCCTTTCTCCCTCCCCTTGTCCCTTTGTAGCCACTGCTTCATTTGCATCCGCATTAGCCACTTGTGGCTC [8] 7.5e-07 AACTTCAAAATGTCTCTTTGA ++++++++++ + +++ +++ 301 TAGTGTTTTGAGTGTAAATTCATACAGATTTCTGTCTTCAACATCCTATGGGATTTGGTTATTTTACCCCACCCA [4] 7.1e-07 AGGAGAGAGCTCACACCTCTCCC +++++ ++ ++++++++ ++ ++ 1051 ATGGGGTTTGATTTGTGAAGGAAGGATCACAAACTCCATCCAGGCCTTTTAGCCACCCAGGCTCCAGTAGGGTGG [2] 3.7e-06 ATGAAAAGCAGTGATT +++++ ++++++++++ 1276 GGATAAATATAGATCTCAACCACTACAGAAATTTGGCCAGGATTTGGAGGGGTGTGTGTATGAGGGGCAGTGACT CACAG + + 1351 GATGGGGCCACTCTGACATGTGGGCTAGAAAAAGGCTTCCTTATGTCCAAGACCCTCTTTGGGGCAGGGAGGAAT [2] 7.7e-06 ATGAAAAGCAGTGATTCACAG + +++ ++ +++++ ++++ 1426 GAAGTAGGAGCCTGGCTTCTAAAATCAGAAAGAATAATCCACATTTTAAGAAGCTCATGGATTGACACAAATAAG [2] 3.7e-05 ATGAAAAGCAGTGATTCACAG + ++++++++++ ++++ + 1576 TCCTTTTGCATGAGATCAGCCTTATGAAAAAAGCAAGTTATTCTTGGCCATCTCCCCTCACCCCCAACTCCCACC NM_007868 ENSMUSG00000025065 LENGTH = 2000 COMBINED P-VALUE = 5.78e-06 E-VALUE = 0.2 DIAGRAM: 493_[4]_136_[8]_522_[8]_89_[3]_4_[8]_362_[4]_39_[8]_45_[8]_138 [4] 2.2e-05 AGGAGAGAGCTCACACCTCTCCC +++++ + + +++++++ + 451 TTCTACCTTGCACACAAACTTCTGCATTTAAAGTGACTGTGGCTGGAAACATGAGCCACTTCTTCAAGATTACTA [8] 1.0e-07 AACTTCAAAATGTCTCTTTGA +++++++ ++++ +++++ + 601 AGGGGAGGGAAGGAATTTTTTTTCAGTCACACAGATCAATTCAATTTGTCGGAGCTTCAGAATGAATTTTTAAAT [8] 5.5e- AACTT +++ 1126 CTATGCAACAGGATCAGTGCTGTAGTCCCCGGTTCAAGCTGAAAATGTTGCACAGGAAGACATATCATGTAAAGG 05 CAAAATGTCTCTTTGA ++ +++ ++++++++ 1201 TATTATGATTTTTTTATCTGTGTCTATACAAACAGAACTTAATGTCATAACTCTTGTCATAACCCTTACCAAGGT [3] [8] 1.1e-06 6.3e-05 AGTGCCCTGGGCCTTTAAGAG AACTTCAAAATGTCTCTTTG ++++ +++++ ++++++ + ++++ ++++ ++ ++++ 1276 CCTATGGACTCTTTACCTGTCAAGTCTGAAAATATGATGGTATTTTAAATGGGAATGCTTAGTAATCATTGTTTG A + 1351 ATGTTGATGTATCTATGATGCCATAGTACCAAGGTGAACTGCTGTCCGTTTTCAAGAAGATGCTCTTGTCTCATT [4] 1.9e-05 AGGAGAGAGCTC ++++++ + +++ 1651 AAATTATAAAATGTGCTATCTGTATATTAATTCTATTATATAATTATAATAACTTGGCAAGGCAGGACACCAAAA [8] 5.6e-05 ACACCTCTCCC AACTTCAAAATGTCTCTTTGA +++ + ++++ ++ + +++++ ++++++++ 1726 ACATTATTCAGGACATATTTTCTTGTTAGTTGCCAAGATAGAATTTCTTTGAACTAATAATTTTTTTTTTTTTTT [8] 3.9e-05 AACTTCAAAATGTCTCTTTGA +++++ +++++ ++++ + 1801 AGTTCTTCAAATTCACATTCCCAGCAAGTCTCTTTGTTTGGTAATTCTGAATGTGTTTTATTGCTTTCTGGTAGG NM_006805 ENSG00000177733 LENGTH = 2000 COMBINED P-VALUE = 5.92e-06 E-VALUE = 0.21 DIAGRAM: 281_[8]_1075_[2]_78_[3]_3_[8]_431_[4]_25 [8] 2.5e-05 AACTTCAAAATGTCTCTTT ++++++ ++ + + ++++ 226 TAAAAAATGGTTCNNNNNNNNNNNNNNNNNNNNNNNNNNNGCCCAATCTTTCAGAAAGCTTCTTAGTATAGCTTT GA ++ 301 GAGCCTTCAACGTCAACATGTAAGTCAAATATTTAAGTTAAATATTAACATCAAGTTAAGCATACTAGATTAAAA [2] 6.2e-05 ATGAAAAGCAGTGATTCACAG +++++ ++ + +++ ++++ 1351 GACCTCTTAAGTTTCTCTAAGTACACATCAAGAATAAAGAATTAACACGTAAAGTCTTAAACAAATATCCCTTAA [3] 2.0e-06 AGTGCCCTGGGCCTTTAAGAG ++ ++ +++ ++ +++++++ 1426 AAAACGGTACGGAATGGATCCTAGAAAAAAAATGTTAGACATGTACGGTCAAACACAATGATTTATTAAAAATAA [8] 1.6e-07 AACTTCAAAATGTCTCTTTGA +++ + ++++++ ++++++++ 1501 AACGTAAAAATGATTTTTGTACATATGCTTCCAAATTTCAGGCATGGGATCCAAGTAGATTTCATAGAAAACGCT [4] 1.5e-05 AGGAGAGAGCTCACACCTCTCCC +++++++++++ + ++ + ++ 1951 GTTGGAAAGAGCTACCCCTATAGCCACTCCCAGGCATTTTAAATTTTCTT NM_012038 ENSMUSG00000019005 LENGTH = 2000 COMBINED P-VALUE = 7.63e-06 E-VALUE = 0.27 DIAGRAM: 740_[8]_146_[2]_221_[8]_418_[3]_50_[8]_239_[8]_60 [8] 3.4e-06 AACTTCAAAA +++ +++++ 676 GAGGACACACATGGCTTTTCTTCAGTGCTGAGCGGGACAGTCTACACTAATAAAAAAGAGGACTAAAAAGCAAAC TGTCTCTTTGA +++ ++++++ 751 TGGATTTTTGTTCACACAGCATCTACATCTTCTAAAAAGAACGAGTGGCATGGAGGAGCCGAGCAGTGGTGGCAC [2] 1.0e-07 ATGAAAAGCAGTGATTCACAG + ++ ++++++++++ +++++ 901 GGGTTCCAGGATAGCCAGGGATACACAGAAAATCCTGTCTCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN [8] 6.7e-05 AACTTCAAAATGTCTCTTTGA + ++ + ++++++++++++ 1126 CTATTCCCTGCTCTTATCTTGCTGTGTTTTCTTCTGTTTCTTGGAGTTTGACATTCATGAGTGTGTCGTCTGTCT [3] 1.0e-05 AGTGCCCTGGGCCTTTAAGAG ++++++ +++ +++ ++++ 1576 GCACTCCATTAAAAGTGCTTTGGATCAAGGAAAGAATGCTGCAGTGTTTGATAAAAGGAAGCATGGGCTCTACAC [8] 1.3e-05 AACTTCAAAATGTCTCTTTGA ++++ +++ + ++++ ++ 1651 AAGTTAGTTTAATTAAAAGGGATTTTGTGTTTTTGTTTTGTGGTCACAGCGCCTGCCCCTGAGAGGCTTAATAGC [8] 6.1e-05 AACTTCAAAATGTCTCTTTGA ++++ ++ + +++++++ 1876 GTCAGTTTTATTCCATCCTGAAAAATAAAAAGTAGCATGCCTTTTACTTTGAATTTAATCTTGGATGACCCCACT NM_176924 ENSMUSG00000047730 LENGTH = 2000 COMBINED P-VALUE = 7.76e-06 E-VALUE = 0.27 DIAGRAM: 396_[3]_1539_[4]_21 [3] 1.2e-05 AGTGCCCTGGGCCTTTAAGAG ++++ +++++ + +++ +++ 376 GAAATTTTATATTTAATTTTTAATAGCATGGATTTTTTAAAAATGTCTGTTGCAACTAGTTCAGAGACTTCAAGC [4] 2.0e-09 AGGAGAGAGCTCACACCTCTCCC ++++++++ +++++++ ++ + 1951 GTCTGAAGGAGAGATCACACACGAGTCTGACCACTGTCACCTCTCTGCAG NM_053967 ENSRNOG00000007231 LENGTH = 2000 COMBINED P-VALUE = 7.90e-06 E-VALUE = 0.28 DIAGRAM: 148_[8]_19_[4]_799_[8]_969 [8 2. AA ++ 76 NNNGTTCTTTTTACATAGTGTATTCTGCTGTCACTTCCCCCTCCAACTCCTCCCAGATCTTTCTCACATACCTAA ] [4] 7e-10 5.6e-05 CTTCAAAATGTCTCTTTGA AGGAGAGAGCTCACACCTCTCCC ++++++++++++++++++ ++ ++++ ++++++ +++++ ++ 151 CTTCATACTGTCTCTTTTTCTTCCTCTCTTTAAAAAACAGAAGAGTGAAAAACCTTCAAACTNNNNNNNNNNNNN [8] 5.3e-05 AACTTCAAAATGTCTCTTTGA +++ ++++++ +++ +++ 976 TATAGAAGATGGTAGGGGCAAGGGACACTCTTACTAACAATATAATGATTTATTGTTTTATTAGATTTATTATTG NM_022484 ENSG00000146802 LENGTH = 2000 COMBINED P-VALUE = 8.65e-06 E-VALUE = 0.31 DIAGRAM: 111_[8]_178_[4]_108_[8]_1293_[8]_224 [8] 1.4e-05 AACTTCAAAATGTCTCTTTGA ++++ + +++ + +++++ 76 AATCTACCCACTAATGCATTAAATATAAGTGCATTTTAATTAAGACTTTAATTTTTTTAAAGCATAACTGGTCAT [4] 1.7e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++++++++ +++++ + + 301 TGAAAAGTCATGGGGAGAGAAAGAACTTGGCTCCTTATTTATTTTGCAACAGTGTGTATGCTATTNNNNNNNNNN [8] 6.6e-10 AACTTCAAA +++++++++ 376 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTAAAATAAATAAATAATGGTTAAAAAAAGAAAATAAATTCATA ATGTCTCTTTGA ++++++++++++ 451 CTGTTTTTTTTATTCTATTGAAAAAAAAGATGATGTTTTTAGAGCTAGTGGTTTTGTGGTTTTAGCAAGGTATCA [8] 5.0e-05 AACTTCAAAATGTCTCTTTGA ++++ + + + +++++++ 1726 CAAATAGCTGAGAAATTTAAAAGGAGATTTTACTTAACATTAACTTTTTTTAAGCTAAAAGCTCATAATTTTAAC NM_016106 ENSG00000092108 LENGTH = 2000 COMBINED P-VALUE = 1.12e-05 E-VALUE = 0.4 DIAGRAM: 673_[2]_899_[8]_47_[3]_275_[4]_20 [2 7. AT + 601 NNNNNNNNNNNNNNAGAGATTATGTAATAAGATAACCTGTTTTAAATGTCACAAAAATAAATTTTTATTAATGAA ] 3e-08 GAAAAGCAGTGATTCACAG +++++++++++ +++++++ 676 AAAAGCCAGTGTTTCACACTATAAAACAATATATAATCGTTAGGCTAACCATTTCATTTATATAATTGTGAAAGT [8] 3.2e-05 AACTTCAAAATGTCTCTTTGA +++ + ++++++++ + +++ 1576 CACTGAGAAACGAGAGCTAACGTTAAAATGTTGCCTGTGTTTCAGGAGCTTCATTTACTTCAGTCCTGTGGTCCT [3] 1.9e-05 AGTGCCCTGGGCCTTTAAGAG ++ + +++ ++++++++ + 1651 TATACTCCTAGAGCATTATCACCCTTTAAGTGCTCTGTAAGTCACCAAACCAACAGTTCAGTAATTAGGTTCTTA [4] 1.4e-05 AGGAGAGAGCTCACACCTCTCCC ++ +++ +++ +++ +++ ++ 1951 TCCGGGGCGGTAAGGGCAGCCACGTCATCCCCCCGCTCCGCTCCCCAGCC NM_016163 ENSG00000092108 LENGTH = 2000 COMBINED P-VALUE = 1.12e-05 E-VALUE = 0.4 DIAGRAM: 673_[2]_899_[8]_47_[3]_275_[4]_20 [2 7. AT + 601 NNNNNNNNNNNNNNAGAGATTATGTAATAAGATAACCTGTTTTAAATGTCACAAAAATAAATTTTTATTAATGAA ] 3e-08 GAAAAGCAGTGATTCACAG +++++++++++ +++++++ 676 AAAAGCCAGTGTTTCACACTATAAAACAATATATAATCGTTAGGCTAACCATTTCATTTATATAATTGTGAAAGT [8] 3.2e-05 AACTTCAAAATGTCTCTTTGA +++ + ++++++++ + +++ 1576 CACTGAGAAACGAGAGCTAACGTTAAAATGTTGCCTGTGTTTCAGGAGCTTCATTTACTTCAGTCCTGTGGTCCT [3] 1.9e-05 AGTGCCCTGGGCCTTTAAGAG ++ + +++ ++++++++ + 1651 TATACTCCTAGAGCATTATCACCCTTTAAGTGCTCTGTAAGTCACCAAACCAACAGTTCAGTAATTAGGTTCTTA [4] 1.4e-05 AGGAGAGAGCTCACACCTCTCCC ++ +++ +++ +++ +++ ++ 1951 TCCGGGGCGGTAAGGGCAGCCACGTCATCCCCCCGCTCCGCTCCCCAGCC NM_021475 ENSMUSG00000022057 LENGTH = 2000 COMBINED P-VALUE = 1.62e-05 E-VALUE = 0.57 DIAGRAM: 634_[8]_527_[8]_364_[2]_102_[8]_289 [8] 3.1e-08 AACTTCAAAATGTCTCTTTGA +++++ +++ +++ ++ +++ 601 TCAGTGTGTGTGTGTAATCTATCCAATAATTCATAAATTAATAGTGGATTCTTGCCTTTCCGGTTATGCACATCC [8] 1.9e-05 AACTTCAAAATGTCTCTT +++ + +++++++++ 1126 ACTGATAGATCAAGAACGAATTGAGGAACAAGACCTTAGTATCAGACCCACCCTCTGTACTCAACAATGGCTCTC TGA +++ 1201 TGAACTATTTTATGACTCTAGTAATAACTACCTACCCTAAATAATTTCATAGAAGACATCACATTGTTGATTTGG [2] 2.0e-06 ATGAAAAG +++++++ 1501 GGACTCAACAGATTATATATAGTAATGTATAAGCATAAAAATGTACATATGTGCGCAATAATAATTGATAAAAGA CAGTGATTCACAG ++ +++ +++++ 1576 AAAGGAGACACAGTTTTGAAGAAGAGTGGGGAAGGATATATAGGAAAGTTTGAAGGGAGAGAAGGGAAGGTAGAG [8] 3.1e-05 AACTTCAAAATGTCTCTTTGA +++ +++++ ++ +++++ 1651 ATATTGTAATTAAAATATAATTTCAAAAATAAGCAACAACAACATCAAAGTGAGCCTTGTCATGATAAAATCTAT NM_013462 ENSMUSG00000031489 LENGTH = 2000 COMBINED P-VALUE = 2.08e-05 E-VALUE = 0.74 DIAGRAM: 333_[3]_255_[3]_593_[8]_146_[2]_589 [3] 8.3e-05 AGTGCCCTGGGCCTTTAAGAG ++++++++++++ +++ + 301 TCTTTGCTCTTGGAGTTAAAAAAGAGCCAGTGCAATACTCTGGTCACTTCCATAGAACCTCCCCACACCCCACCC [3] 5.6e-07 AGTGCCCTGGGCCTTTAAGAG +++++++++ ++++++ + 601 TCTCAAACCAGTGCCCTGAACCTTTAGATACACAGGCACACAAAACTGAAGCAAAGTGTTTGCTGATGTCAATGA [8] 1.4e-06 AACTTCAAAATGTCTCTTTGA +++ +++++++ ++++++++ 1201 CCATAGTGAGGAGACAGGGATGAAACCTCAAACTTTTTTTTTTTCCTGGATTCCTGCTCTCAAACTAACCAGAGA [2] 2.7e-05 ATGAAAAGCAGTGATTCACAG + + +++++++++++ +++ 1351 GGAGGGAGGGGTGCTCTCTTTTCCCTAAGCATACAGCACAAACATAGGCAGGGACTGGCAGGCCTCCAAGGCCAT NM_002227 ENSG00000162434 LENGTH = 2000 COMBINED P-VALUE = 2.34e-05 E-VALUE = 0.83 DIAGRAM: 582_[8]_356_[8]_911_[4]_86 [8] 7.0e-05 AACTTCAAAATGTCTCTT +++ ++++++++ +++ 526 AGTGGTGTTTGTTTGTTTGTTAGTAGTAAGGACCGTAACCCACCCCTTCCTCCCCCCAAAACTATACTGTTATTT TGA ++ 601 TTCTATGTAAGCAAATTGTTTGAAATGCTTTGAAAATGAAATCTTAATATTAAAAGGCATATGCATGTAAAGTCT [8] 1.5e-05 AACTTCAAAATGTCTC ++++ +++++++++ 901 TAATCGTTCACATTGAATCAATGACTAAACATTTTTGATTACCCAGCTACCTCCAAGCAAACTGAAAACTGTCTA TTTGA ++++ 976 GTGGATCCTGAAGTCCATAGTGCCTCTAGCCGGGTCTTTCAAGTGTTGCACCACAGGGTGATGATTGATGGTAAA [4] 2.3e-09 AGGAGAGAGCTCACACCTCTCCC +++++++ +++++++++++ + 1876 GAGTTCAGTGACTTTTTGGACAGAACACAAACTTCTTTCATTAGAAATTTTTAAAAAATGACTTGGGCATTTGTT NM_000717 ENSG00000167434 LENGTH = 2000 COMBINED P-VALUE = 2.75e-05 E-VALUE = 0.97 DIAGRAM: 167_[3]_131_[8]_772_[2]_251_[4]_593 [3] 1.1e-05 AGTGCCCTGGGCCTTTAAGAG + ++ +++ +++ ++++++ 151 GCTCTGTGGTGCCCATAATTAAGATGCACCCATAAGAGAGTGTATAGCTTTACCCCTCCCGCATTCCTCAGAGGG [8] 1.0e-07 AACTTCAAAATGTCTCTTTGA +++++ +++ +++ ++ ++++ 301 GGGGGGCCTGAATGCAATGAACTTTAAAGTGGATCATTGATCAACAGTCCCTTAACCCCACCAAAGTTTCCCAAG [2] 8.0e-05 ATGAAAAGCAGTG +++++++++++ 1051 CCAAATGTTGGAGCCGCCCTTAAGGGAACTTCCCCTGGACTCCTCTGGAGGTCTTGCCAGATGGGAGAACCAGGG ATTCACAG + ++ + 1126 GTCCATTCAGACCCCAGTCTCTTCTCTTCCCATCTCATCTCTCCACCTGGGGGGCTCAAAGAACCTAGCTCCTCC [4] 2.2e-05 AGGAGAGAGCTCACACCTCTCCC ++++++ ++++ + + ++++ 1351 GTCTGCTTAGAAATTAAGTTTGGAGCAGCCTGGGAGGAAATAGAAGAGATGAGACCCAGAGAATCAATGGCCACT NM_147080 ENSMUSG00000044506 LENGTH = 2000 COMBINED P-VALUE = 2.87e-05 E-VALUE = 1 DIAGRAM: 355_[2]_292_[3]_909_[3]_5_[8]_243_[4]_89 [2] 6.4e-06 ATGAAAAGCAGTGATTCACA ++++++++ +++++ + + 301 ATGAAAATAAGTATTTAAAAGCATCCAGAATCAAAGAATAAAAATAATTAGTAATGGAAGAGGAATTGATTGAAA G + 376 CTGAAACTAGTCATTGCCCTCTAATGTCCATTGAACAAAAGACATTTATAAGTATAGAAGAATAATTCATTAAAA [3] 5.6e-07 AGTGCCC +++++++ 601 TGTAACCTTCTAATTCTTTATCAATAATTATTAGATATTATTATGGAATTAATGGAATTAACATAAAGAATGCTA TGGGCCTTTAAGAG ++ ++ ++++++ + 676 TGATTTTTTAAAGGTTTTATTTATTTATTGTATATAAATATACTGTAGCTGTCTTCAGACACTCCAGAAGAGGGC [3] [8] 2.7e-05 2.9e-05 AGTGCCCTGGGCCTTTAAGAG AACTTCAAAATGTCTCTTTGA + ++ + + ++ +++++++ ++++ + ++++++++++++ 1576 ATGTTTGCTTGTTTGTTTGTTTGATTGTTTTAATCTATTAAGACTTTGAAACTGTCTGATGTTTTTTGTATGCAC [4] 1.8e-05 AGGAGAGAGCTCACACCTCTCCC +++ ++ + ++++++ ++ + ++ 1876 CACTATCCTTCACAGGTAATATCTAACATTTGTTAGAACATCCTGAATGTGACATTTAACGTTTGCTTCTTTCCA NM_001097 ENSG00000100312 LENGTH = 2000 COMBINED P-VALUE = 2.98e-05 E-VALUE = 1.1 DIAGRAM: 1490_[8]_489 [8] 3.0e-10 AACTTCAAAA ++++++++++ 1426 TAATAGGAGACCTTGTGATATTCAGCAACGGACAGGAAGGTGGGCTTTGCAGTTGTAAATTAGGAAAATTCAAAA TGTCTCTTTGA ++ ++++++++ 1501 TGACTCTTGGAAAAGTGTGTTGATAGCATTCACTTGGAAGAGGAAAAGAAAACTTCCCCAACAACAATTAAGGAT NM_023662 ENSMUSG00000031592 LENGTH = 2000 COMBINED P-VALUE = 3.09e-05 E-VALUE = 1.1 DIAGRAM: 705_[8]_239_[3]_630_[8]_47_[8]_124_[4]_148 [8] 3.0e-05 AACTTCAAAATGTCTCTTTGA ++ ++ ++ +++ ++++ +++ 676 GTTGAATTTTATGTCACTATTTTTAGATAAAATTTTAATATGATTCTAGTAAAATTATCTGCCTGTTAAAGTTTA [3] 1.0e-05 AGTGCCCTGG ++++++++ 901 CGGCCTCTGAGCCTCTGCTTCCGTGTCAGCTTCTATTTCTCCTCTCTGATCCAGGCATCATATTTAGTACCCTTA GCCTTTAAGAG ++ ++++ + + 976 GTTTTTATATGTATATGTACTCCATGGGGTGTATTGCTAGGGTTGTCCGCACTTCTAATTCTAGTGCTTGGGGCT [8] 1.2e-05 AACTTCAAAATGTCTCTTTGA + ++ +++ +++ ++ +++ 1576 TATAATGTGTATATCATGTAGATCAATTTATCTGTGATACGTGTTTGATAGTGTATTCTTTTATATTTTTGGTTG [8] 5.0e-06 AACTTCAAAATGTCTCTTTGA ++++ +++ +++ ++ ++++ 1651 TGAGCCTAGCCTTTAACAGCTGAGCCATCTCTCCAGCTCGATAGTGTATTCTTTAAGATAAGTGTTTGAAAGATT [4] 1.9e-07 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++ ++ ++ + ++ 1801 AACCTGATGACCTCAGATCACACAGTGGCAGGAGAGAACTGACTCCAGATAGTTGCTCTGACCTCTGCACACATG NM_026576 ENSMUSG00000016984 LENGTH = 2000 COMBINED P-VALUE = 3.89e-05 E-VALUE = 1.4 DIAGRAM: 435_[4]_299_[2]_565_[8]_367_[8]_248 [4] 2.0e-05 AGGAGAGAGCTCACA ++++++ ++++++ 376 AGAGGGGTAGCCTGCTGGACCAGCACATCTCCACTATCCAGAAGTGTTTTTATGTTTCCTTAGAAAGATAACACA CCTCTCCC ++++ 451 AGTCACTTAGTATCAATTTCAGTGAGGATGGTCAGGTATAAATTTCAAAAAAAAAAACCAAAAAACAAAAAAACA [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 751 GAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAACCNNNNNNNNNNNNNNNNNNNNNNNNN [8] 3.2e-07 AACTTCA ++++ + 1276 TGGAAAGATTAGTTAGTTCTTTAAGATAAAGAATGTATGTAAGTACTGCTAACAAGCAGCCTTTAATGTACTTTA AAATGTCTCTTTGA ++ +++++++ ++ 1351 AAGTGTTTCTGTGCCTTCCAGTTATTTTAAGTACGAACAGACTGAAGCAAGTCATGCAACAGTTAAGATTGTCTC [8] 6.1e-05 AACTTCAAAATGTCTCTTTGA ++ + ++++ + +++++++ 1726 ATTACTTACATAATAAAGCTTTTTTTTTCTTTTTGCTAAACTAGACTTAATGCCTAGTCTGTGGGTTTCCTAACA NM_021757 ENSRNOG00000006655 LENGTH = 2000 COMBINED P-VALUE = 4.60e-05 E-VALUE = 1.6 DIAGRAM: 436_[3]_117_[2]_304_[4]_1041_[8]_16 [3] 2.4e-05 AGTGCCCTGGGCCT ++++ ++ ++++ 376 ATTTGGTAACCTTGCCCTAGTAATCTAGTGTTTCCTTTCAGAGAGTACTAGGAAATGTTGAAATGTCACAATCCA TTAAGAG +++++ 451 TTAAATAAATGAGCCCTGTATGGACAGGACATTTGGGTTAGTTTTCAGGATTAAAGAGTAAATATGCCGTGCACT [2] 2.0e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++ ++++++++++ 526 GTGACTGTATGTGTCAGGTTACTCCCGGAAAGCTGTTTCCTAGAAATGTAGCAGGACAAAGGATTCACAGAGCTT [ 6 A + 826 CTTTGTGACAAGTTCCAACAGCACTAATCCTATCAGTTGTCTTCTCTGCTTCTTTACTAGTATCTAACTATTGCA 4] .6e-05 GGAGAGAGCTCACACCTCTCCC + +++ + + +++++ +++ + 901 GAACAAAACCCACACAACACTCAGCCTTCCCTGAAGACGTATCAGCATCTGCTGGAGATTCCTGGAAAACACATC [8] 1.1e-06 AACTTCAAAATGTCTCTTTGA ++++ +++++++++ +++++ 1951 ATTTTACTACTAAAACTATAAACTGTTTATTTTAGAAATCAAACACAGGT NM_019794 ENSMUSG00000031701 LENGTH = 2000 COMBINED P-VALUE = 4.85e-05 E-VALUE = 1.7 DIAGRAM: 682_[8]_57_[3]_237_[8]_102_[4]_836 [8] 4.2e-06 AACTTCAAAATGTCTCTTTGA ++++++ ++ + ++ ++++ 676 AAGTATATACTTCAGAAGGAATTGTTGAAATGAAGGGTAAGAAACTGGAATCAGGGCTGGTGCATTTCTAGGCTG [3] 7.5e-07 AGTGCCCTGGGCCTTTAAGAG + +++++++ + +++ ++++ 751 AGCTAATTGAATTGCCCTGAATTCTTCAAAGGCAGCCTAGAATACAACAGCAAATCTTTTGTCTCAAAACAAGTA [8] 7.5e-05 AACTTCAAAATGTCTCTTTGA ++++ ++++++ +++ + + 976 AACATATGAAAGGATACATGAACATTAGTCATCAGGGAAACACAAATCAAAACTGCTTTCTATCCATGGGAGAGG [4] 1.0e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++++++++ + ++ +++++ 1126 AGAGAGATGGTTCAGTAGTGAAGAGCACCAGCTACTCACCCAAAGGACAGGAGTTCAAATCCCAGCACCCACATG NM_019343 ENSRNOG00000003618 LENGTH = 2000 COMBINED P-VALUE = 4.94e-05 E-VALUE = 1.7 DIAGRAM: 228_[8]_40_[3]_58_[2]_655_[8]_70_[4]_241_[3]_580 [8] [3] 1.3e-05 3.3e-05 AACTTCAAAATGTCTCTTTGA AGTGCCCTGGG ++++ ++++++ ++ +++++ + ++ + + + 226 CTTGGCTTTAAAATGCTTGTTTGATTATTTTCTCCTTTCAAAGGTTGAAAAGCACAGAACTTTCATTGTCTTAAT [2] 7.8e-05 CCTTTAAGAG ATGAAAA + ++++++ + +++++ 301 CTTTTAAACATAAGAAAGAGTTGAGGGTAAGGGCAAAAAAAATGCATGCAACTATTGTTAATAAGGAAAAGGAAA GCAGTGATTCACAG ++ ++ ++ ++ ++ 376 CATGGAACACAGACACAAAGAGGAACTTATCCTTCATTGTCTACACCATGCTATACACTTCTTTCCAAACCTTAG [8] 7.0e-0 AACTTC ++++ 976 CAAGAGAAGCAGAAATGTTGTAAACGCGGCAAGCTTTCGTGCTGACACTTGCAAAGCCTAAGTAGCTGAAAATCT 5 AAAATGTCTCTTTGA + + +++++ ++++ 1051 AGCAGGGTTCCTTGAGAGTTGATTCCTAATCTTCTCTAATATCATCGATTTGATAAAGAGTTGCTGCGGGTTAGA [4] 1.1e-07 AGGAGAGAGCTCACACCTCTCCC ++++++++ +++ +++++ + + 1126 CCTTGCTCAAAGGACAGAAATCTCACTTGATTCTATCTCTTGTTAAAGCCTGTAGGGGTGGTGAGTGAGCCCTTG [3] 6.2e-05 AGTGCCCTGGGCCTTTAAGAG + +++ +++++ + ++++ + 1351 AAAAAATAATGAATGAGGTTGATTTTTCTCTACAGATTTTCTATTTGCTATTACATTGGTCACATAAGCGATTTA NM_144715 ENSG00000163576 LENGTH = 2000 COMBINED P-VALUE = 5.47e-05 E-VALUE = 1.9 DIAGRAM: 5_[3]_464_[4]_57_[3]_606_[8]_782 [3] 3.6e-06 AGTGCCCTGGGCCTTTAAGAG ++++ + ++ ++++ ++++++ 1 TTTCAAATGTCGTGCTCCAATAAGAGAAGACAGTATACATAAAGTAGATTTTTAGGCCAGATATAGTAAATGCAT [4] 3.9e-05 AGGAGAGAGCTCACACCTCTCCC ++++ ++++++++++++++ 451 CTATGGTCTTCCCAGTGAGAAGCAGCTCAAGACGCAGCTGCTTAAAGGAAACACACCTCTCCCCAGGTCTTCCAA [3] 2.1e-05 AGTGCCCTGGGCCTTTAAGAG + + ++ +++ + +++++ 526 ACACCTTGGCTGAAACTAGGCAGAAGCAGAAAAAAGGGAGAGAGAATTTCCTTGGATTTTTAACTCACTGGACAC [8] 3.6 AAC +++ 1126 TGATGACTTAAGCCAAACGTGTTAAGAAAGCATGAGTTTAGAGATCATTTGGAAAAGATTGCTTATTCCCCAAAC e-07 TTCAAAATGTCTCTTTGA + ++++++++++ +++ 1201 CTTATAATGTTTTCTTTTGTCTTGTTTTTGCTGTCTTAAAAGCTTTCAATTTGTGGTTTTTAATATATGGAAAGA NM_010510 ENSMUSG00000048806 LENGTH = 2000 COMBINED P-VALUE = 5.54e-05 E-VALUE = 2 DIAGRAM: 86_[3]_1326_[2]_153_[2]_277_[8]_74 [3] 3.5e-07 AGTGCCCTGGGCCTTTAAGAG ++++ +++++++++ ++ ++ 76 ATGAAGACACAAATGTGCTGGTTCTTAAACAGCACCACACAGATGCCTTTCTGTCTTTTATGGTGCTGTACTTAA [2] 4.3e-07 ATGAAAAGCAGTGATTCACAG + + +++++++++++ ++++ 1426 CAGATGTGAAGTGAAGAAGGGACAGACACCCTCAGCTTCCCCAGGGGATCATGAGGAGCAAGATAAAGGCATTTA [2] 4.4e-05 ATGAAAAGCAGTGATTCACAG + ++++++ +++++ ++ + 1576 ATCATTTCCCTGATGGCAAAGGTTTTGGGGTAAATAAAAGAGGTGATACATTCCTCTGTCTGGCCACCCTAAATG [8] 8.8e-05 AACTTCAAAATGTCTCTTTGA ++++ + ++ + + ++++ 1876 CCAGAAACTATACTACTTTTTTTAAAGGAGTACTTTACAAAGAACCCTTTATAAGAAGTGTCCAGCAATTATTAA NM_009241 ENSMUSG00000029682 LENGTH = 2000 COMBINED P-VALUE = 5.89e-05 E-VALUE = 2.1 DIAGRAM: 208_[2]_1224_[3]_294_[4]_209 [2] 2.5e-05 ATGAAAAGCAGTGATTC + ++++ ++++ ++++ 151 CATAGTTATTATCAATGCTTCCACCATGCCTATGCAATAAAGTCACTATAAAACCTTGAAAGAATTAAGGAATTC ACAG +++ 226 TCAGGTCATAGATCTTGTGAAGATTTCTGGATCGAGGTTCACCTGGGGAAGGTTTGAAGCTTTAACTAGCTTCTC [3] 2.1e-08 AGTGCCCTGGGCCTTTAAGAG +++++++++ ++++ ++++ + 1426 AAGTTCCAGGACACCCAGGGAATTTAACAATACCATGATCCAATAAATGAGTCAACGAACAATCAAACAAACAAA [4] 4.3e-05 AGGAGAGAGCTCACACCTCTCCC + + ++++ +++ ++++ ++ + 1726 TCTAAAATACTGATAGATACAGGACCCTATATATCTGGGAATGAAGTAAGAAAAATAACTACATTCAGTTTTCTG NM_014783 ENSG00000175306 LENGTH = 2000 COMBINED P-VALUE = 5.94e-05 E-VALUE = 2.1 DIAGRAM: 419_[3]_585_[8]_774_[2]_159 [3] 9.5e-09 AGTGCCCTGGGCCTTTAAGAG + ++ +++++++++++++++ 376 CTTTAAAAAAGGATCACGACGTAGAAAAAGACACAACACTGAAGATTGTCATGGGTCTTTAAGAAAAGGAAATTT [8] 3.9e-05 AACTTCAAAATGTCTCTTTGA +++ +++++++ ++++ ++ 976 CTTTTCTTTGGATTGGGAAGGGTAAAATATCCGAAGATTTGAACTCCAAAAGAAACAAAATGATTCTATGCAAAC [2] 6.5e-05 ATGAAAAGCAGTGATTCACAG ++++++++++++ + ++ 1801 CCGCCCCAAAAACTGGCCTTGAAGGAACAAGTGAAACTCATCCTGCTTTTCATGTTTGCTGGGTTTGCCCGTTAC NM_006056 ENSG00000171596 LENGTH = 2000 COMBINED P-VALUE = 7.19e-05 E-VALUE = 2.5 DIAGRAM: 724_[2]_641_[8]_593 [2] 1.8e-09 ATGAAAAGCAGTGATTCACAG ++++++++++++++++ ++++ 676 CATCCCTGTTTCTCAAGGGGTGAGTTCAGATCTGCTATTCCCGTTTTCCATGGGAACCAGTGACTGACAGGACCT [8] 6.3e-05 AACTTCAAAATGTCTCTTTGA +++++ ++ +++++ +++ 1351 CTCAGGGGCAGCACTCAGGTGCTCACGGCTCTGCACAGCTTTTCACAGGCTCCTTTCTCTGCAGAATCTCCCTGT NM_175619 ENSG00000182223 LENGTH = 2000 COMBINED P-VALUE = 7.51e-05 E-VALUE = 2.7 DIAGRAM: 563_[2]_38_[4]_167_[8]_36_[8]_1110 [2] 1.0e-06 ATGAAAAGCAGTGATTCACAG + ++++++++ ++ + ++++ 526 GAAGATGAAGATACAGATATTTCTTATAAAAAACTAAAAGAAGAGGAAATGGCAGACACTTCCTATGGCACAGTG [4] 3.3e-07 AGGAGAGAGCTCACACCTCTCCC +++++ ++++++ +++++++ + 601 AAAGCAGAAAATATAATAATGATGGAAACCGCTCAGACTTCTCTCTAAATGTGGAGATACACAGGAGCTTCTATC [8] 9.7e-05 AACTTCAAAATGT +++++ ++++ 751 TCATACATGTAACAATTCTGATCTTTTTAAGGTTCACTGGTGTATTAACCAAACGTTGTCACAAATTACAAATCA [8] 6.5e-05 CTCTTTGA AACTTCAAAATGTCTCTTTGA + ++++ ++++ ++ ++++++++ + 826 ATGCTGTAATATAATTTGCACCTGGAATGGCTAACGTGAAGCCTGAATTAAATGTGGTTTTTAGTTTTTACCATC NM_000994 ENSG00000144713 LENGTH = 2000 COMBINED P-VALUE = 7.57e-05 E-VALUE = 2.7 DIAGRAM: 327_[4]_87_[8]_302_[8]_944_[8]_254 [4] 4.8e-07 AGGAGAGAGCTCACACCTCTCCC +++++ ++++ +++++ +++ + 301 AGGCTGTGCTGTCAGCTGTAAAAGATCAGGAGGCAGCAGACACCACTCTGGTTTCTTCACTGCATTCAGCAATGC [8] 1.7e-06 AACTTCAAAATGT ++++++++++ ++ 376 CTGAAGTTAGTGCTCAGGCCGGGCATCTCAAAAGAAAAGATACTTGAGTTATTCACATTTTAAAATTCAAAACGG CTCTTTGA +++ +++ 451 TTCATTTTTAAGTGGCAGTGATGAATCAGAAATTTGGAAGATGATACGGGTTTCTTTTTTCCAGGGAGGAGGAAT [8] 1.0e-05 AACTTCAAAATGTCTCTTTGA ++++++ ++ + +++ +++ 751 AATAAAGATTAGATTCTAAGTTACTTCTTTCCTCTGCACGACTCGTCTCCAATTGTTAATCNNNNNNNNNNNNNN [8] 8.4e-05 AACTTCAAAATGTCTCTTTGA ++ + + ++++ ++ +++ 1726 AGGCTAAGAATGACTACTTGTGGCTTGGGAGCCACAAGCTTCTTCAAGTGTCTCAGAACCTACCTGGTGTGAGGG NM_031922 ENSG00000135597 LENGTH = 2000 COMBINED P-VALUE = 7.61e-05 E-VALUE = 2.7 DIAGRAM: 396_[4]_58_[8]_92_[8]_253_[8]_418_[2]_676 [4] 3.0e-05 AGGAGAGAGCTCACACCTCTCCC + +++++ +++ ++++ ++ ++ 376 GAATGTACTCCTCTAGAGAGCATTAAAGATATACCACTACAAAGTACTAAAAAGTTTTCTATAATTTTTAAGTGT [8] 2.5e-05 AACTTCAAAATGTCTCTTTGA +++++ ++ + + +++++++ 451 TATTTATTGGAAGGCTAAATTCTAAGAAGCTTTAACAAATATTTTTGACATTGTTAAATTAGAAGGCTATTTTTA [8] 2.0e-07 AACTTCAAAA ++ + ++++ 526 AAAAGCATATTTTAAAAAACAAAACTAAATGGAAACAGCAAAACTTGCTTGTAATGAAATACAGCTGAGTAAAAA TGTCTCTTTGA +++++++++++ 601 TGTTTCTTTGAAAACAAAGACACTCAGAAAAACAGGTGTTGAGTAGTTTCCTGGGCCTTTAATGCTTTAAATTTA [8] 9.3e-05 AACTTCAAAATGTCTCTTTGA + ++ ++ ++ ++++ +++ 826 CAAAGTCTATCGGTGCAGTTTAGGACTGTGAATCTATAGTATTTAAACATTTTTTCTTTCAGAAATGAAAATACA [2] 8.7e-06 ATGAAAAGCAGTGATTCACAG + +++++++ + +++++ ++ 1276 AAGTTTTAAAACAATACTTAAAAAAATTAAGAAAGGCCATTACTCATACCTATTTCATAAGTTTACATGACTGCT NM_007260 ENSG00000011009 LENGTH = 2000 COMBINED P-VALUE = 7.96e-05 E-VALUE = 2.8 DIAGRAM: 711_[8]_915_[2]_332 [8] 4.6e-07 AACTTCAAAATGTCTCTTTGA +++++ +++++++++ +++ 676 CTCTGGGCCTGGGGGAAAGTGCTTTGTTGAGCTTCCAACTTTGTACTGTTTTCTTTTTCTNNNNNNNNNNNNNNN [2] 3.5 ATG +++ 1576 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTTCAGAGTGTGTCATTCCATTCATGATCCTGGAATAGGGCATG e-07 AAAAGCAGTGATTCACAG ++++++++++ + ++++ 1651 AAAGCCAGTGGCAGACACCCGAGGCCTTAAGAGGTGTGTCCCATGAGGATGGGAAGGTGGTAAAGTTTATGCAAA NM_011839 ENSMUSG00000046212 LENGTH = 2000 COMBINED P-VALUE = 8.15e-05 E-VALUE = 2.9 DIAGRAM: 111_[8]_61_[8]_414_[8]_491_[4]_211_[8]_213_[3]_64_[8]_286 [8] 7.8e-06 AACTTCAAAATGTCTCTTTGA +++++ ++ + ++++++++ 76 TTCTTTGAAGCTATTTGTTCTTTTATAGGAAGCTGTAGCTTTTAAGTTTTTTTTGTCTTTCTGTAGCATAATTAC [8] 9.3e-05 AACTTCAAAATGTCTCTTTGA + +++ ++ +++ +++++ 151 ATTAGTTTTCTTTCTCTGTTGCTTGTAAAACATAAGATATTTTACCTTAATCATGAAATTTGTCTTGTACTGAAA [8] 2.4e-05 AACTTCAAAATGTCTCTTTGA +++++++ ++ ++++ ++ + 601 CAAACATACTCCTTTGAAAATTTTCTTAAAATTCAGACCTTCTCCTTAAGAGGATAAATTGTTTTATTGTTCAAA [4] 4.7e-07 AGGAGAGAGCTCACACCTCTCCC ++ ++++ ++++++++ + + 1126 TCAAATCTTGTCCTTTGGCCAGAAAACAAACCAGTTTCTTTAGGATATTTTCAGTTTTCCTAATTATTTTTTATT [8] 2.3e-06 AACTTCAAAATGTCTCTTTGA +++++++++++ ++++++ + 1351 GTTTTAAGACCTTTATATATAGGTAGATTCAAAATAATTTTTGAAATTTAATTAAAATGATACTGTTTATTGCAT [3] 6.5e-05 AGTGCCCTGGGCCTTTAAGAG +++ + + +++++++++++ 1576 AACTTAAAAAGAATAGTTGAAGAAAGAGCATGACATGACTTCATTCTTTAAAAGCAATTTTAAAAACATTTAGAA [8] 1.4e-05 AACTTCAAAATGTCTCTTTGA ++ ++ +++ ++++++++++ 1651 CTAGATTTTAAAAAACCTTACAGCATTAAAAAAATCCTGATATAATTTTAAATAGGCTTTTTTATTATTTTAAAT NM_009531 ENSMUSG00000030094 LENGTH = 2000 COMBINED P-VALUE = 9.75e-05 E-VALUE = 3.5 DIAGRAM: 90_[8]_729_[2]_1002_[3]_87_[2]_8 [8] 5.5e-07 AACTTCAAAATGTCTCTTTGA +++ ++ + ++++++++++ 76 CCGGCCTCAAGCAGTTGCTGCAGAGTGGTTCTTTTTGCATTTAATGCCCATCACTATCCTCATCCCAAATAAAAT [2] 7.6e-05 ATGAAAAGCAGTGATTCACAG + ++++ ++ +++++++ + 826 CTACTTTGTCTTGGCTTTAAAGTAAATGACTCATGCTTCATGAACATTGAGAATGTGTCAGAAGCATGATGCACA [3] 1.9e-05 AGTGCCCTGGGC ++++++++++++ 1801 CTCTGACAAAAGAGCAGCAGGCAGTACTGGAAACCTGACTCAGGAAGGCAGGCCTGAAAAGCAAATGCCCTGGGT CTTTAAGAG ++ + +++ 1876 CTGAAGGAGAGGAAAGCAAGCGTTGTGAACGGCACTAAAGGAAGATTTGTCTCAGTGCTCCAGTACCCATAATTT [2] 2.7e-06 ATGAAAAGCAGTGATTCACAG + +++++ ++ ++++ + + 1951 CCCAGTGAGTTCTATTTAAAAACAGAAATAAAGGACAGATAAGGCATTGT NM_011717 ENSMUSG00000024050 LENGTH = 2000 COMBINED P-VALUE = 9.76e-05 E-VALUE = 3.5 DIAGRAM: 146_[8]_277_[8]_589_[4]_152_[2]_750 [8] 9.5e AACT +++ 76 ATAGCCAAAACTGTTCTTCACAATTTAGTTGGGGCAGGGGAGGTAGAGAAGTATAGATTAGGCTGTTGCTGTGAT -05 TCAAAATGTCTCTTTGA + +++ + ++ ++++ 151 TAAAAGGGCTTGTTTTTACCATTTGTCCAGATACAATGTCCTGACGATAATTTCTCCCCACTTTTGCAAAAGAAA [8] 1.1e-0 AACTTC ++ ++ 376 GCACTTGGTGTGATTTCACCCAAGGTCTTGTTTAATTTCTGATGTCCTCACAGGTTGAGCTTAGCCTAAAAGTTA 6 AAAATGTCTCTTTGA + + +++++++++++ 451 AGATTGTTTCTTTGAGACCATGGAGTGGTTTAAGCAAAGACTTAAGCTATGCTCAGGAGTTTTAGGAAAAGGTCT [4] 1.4e-05 AGGAGAGAGCTCACACCTCTCCC ++++++ + ++ ++ +++ + ++ 1051 AAGTAGGAGATAACAGAATCTTATTACCTTAGAAATGTTCAGTAAACATTAGCTGAGCAAGAAAATAGCATTGTT [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 1201 TTTGAGGCCAGCCTAGTTTATAGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTTGAAAAGCAAAA NM_011291 ENSMUSG00000025922 LENGTH = 2000 COMBINED P-VALUE = 9.78e-05 E-VALUE = 3.5 DIAGRAM: 678_[3]_299_[2]_435_[8]_161_[4]_317_[4]_1 [3] 2.5e-05 AGTGCCCTGGGCCTTTAAGAG + ++ + ++ ++++ ++ + 676 GATATTATCTAGGACCATCAACAAGAGAAATTACTTCATAACCGGACAAGAACTTTCCACCAACAGTACCCAAGG [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 976 AGCCTGGTCTATAAGTAAGTTCCAGGACAGCCAGGGTTACACAGAGAAACCCCCAAAAGAAAAACAAAAACCCCA [8] 3.8e-06 AACTTCAAAATGTCTCTTTGA ++++ +++++++ +++++ 1426 GAAAAGCCTTTAACAAGTTGTCTTTCAAAAACTATAAACTGGAACTTTGCACATCATAGGCTGTTGATCTGACTG [4] 3.7e-05 AGGAGAGAGCTCAC + + ++ + +++++ 1576 TGCCAAGCTGTGACTGTTTATTAACTGACCAGATTACAAAAATACAATGGGTGACACCTGAAAGGGAAAAAACAC ACCTCTCCC ++ + ++ 1651 ACGAGTTAGAGACTGTCTTCCAAGCCGCACATCAGCACCCCCCCGTAAAGGAGCACACTTCCATTAAAGACACAT [4] 9.6e-05 AGGAGAGAGCTCACACCTCTCCC + +++ + +++ +++++++ ++ 1951 CGATTGCCAGAAATAGATTCTGTCTCAAAAAACAACACTAACTTCTTACC NM_000324 ENSG00000112077 LENGTH = 2000 COMBINED P-VALUE = 1.01e-04 E-VALUE = 3.6 DIAGRAM: 1220_[8]_55_[3]_105_[2]_557 [8] 1.6e-07 AACTTCAAAATGTCTCTTTGA +++ ++++++ ++++++++ 1201 NNNNNNNNNNNNNTTTGATTTTCTTAAAAATGATTTTTTGACGATGTCAGTTATCTTTTCTAAAATTTAACTTTT [3] 2.0e-06 AGTGCCCTGGGCCTTTAAGAG + ++ + ++ ++ +++++++ 1276 TACAACATTCTATAATGAGACATTAACTTGATTTTTTAAAAATAGCAAATACATGATAAATATTTAAATTGAGTG [2] 6.3 ATG + + 1351 TGGACTAAATATGAAGTGTGTGTGTGTGCACCTGTGTGCTTGTTACTGACATGAATTCCAACTACCAAAGAAAGG e-05 AAAAGCAGTGATTCACAG ++++ ++ ++++ + 1426 AGAGTCAAAGACTGATGNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGACATTTTATGCAAAATAGACTTTGATCA NM_026408 ENSMUSG00000024534 LENGTH = 2000 COMBINED P-VALUE = 1.05e-04 E-VALUE = 3.7 DIAGRAM: 28_[2]_151_[4]_27_[4]_227_[4]_311_[8]_220_[2]_904 [2] 3.6e-05 ATGAAAAGCAGTGATTCACAG + +++++++ ++ +++++ 1 TAGGAGAAGCAAACCAGCAAACAGTCCAAACAAAAGCAAACACACACAGTCTATGTGGGAAAGTGTATACAAGCC [4] 5.6e-06 AGGAGAGAGCTCACACCTCTCCC ++++++ + ++++++ ++++ + 151 GAAAAAAAAAGTAACAAAAGGACTATAAAAAATAGTAGGGGAAATAACTGAGGAAACATCACAAAGATCTCTGCC [4] 2.0e-06 AGGAGAGAGCTCACACCTCTCCC ++ ++++ +++ + ++++++++ 226 TTACACTGTGGCTTGCTGGTTGCTGTGGTCAGAAAAACCTCTTCACCCCCAGGCAATCTATTTTCTGCACATACA [4] 9.1e-05 AGGAGAGAGCTCACACCTCTCCC + + ++++++++++ ++ + ++ 451 CAATGAAGTCAGGAGGGCAGGAACTGACATTTCTACTCTTGTCTAGCTCCACGTCAGAGAAAACGCCAATTCCAT [8] 5.8e-07 AACTTCAAAATGTCTCTTTGA ++++ ++++++++++ +++ 826 TAGGGAGGAGGATTAAAACTGTCTTCTGGTGACATTCAAGGCTGTCATTTCTAGGGGTGCCATTTACTTATGGTA [2] 7.3e-05 ATGAAAAGCAGTGATTCACAG +++ +++++++ ++ ++ + 1051 CTTTCTTACTGGCAGTGTATTTTCTGTGACAGGCAGGCATACATCCCTCTGCCAGCACCCAGGTAGTATTGGATT NM_026065 ENSMUSG00000009550 LENGTH = 2000 COMBINED P-VALUE = 1.05e-04 E-VALUE = 3.7 DIAGRAM: 107_[8]_176_[3]_286_[4]_735_[8]_610 [8] 1.7e-06 AACTTCAAAATGTCTCTTTGA +++ ++++++++++++ ++ 76 GAATGGGGGATGAGCCAAAAGACACAGACAAGTGCTATAAAATGTCTTTTAGACTTTAGTGACCAATTCCACTCA [3] 8.7e-05 AGTGCCCTGGGCCTTTAAGAG + + + +++++ + ++++ 301 TTTTGGCATTTTGGTTTAAAAAAAAAAACCCAGAGATATTAGAATACTATTTTCATGTTTTGGTTTTACTCTCAG [4] 3.0e-07 AGGAGAGAGCTCACACCTCTCCC ++++++++++++++++ ++++ 601 ATTAAGACTTCCTGACAGAGATCAAACCTGTCCCAAAGAACTGGACACCCCAAGCAGCAAGACCTAGTCTACCCA [8] 5.2e-05 AACTTCAAAATGTCTCTTTGA + +++++++++ +++ +++ 1351 ACTGATGATGTAAAAATAGTTAATCATACTGGATTTATGACAATAAGTAACTGGGCTGTGCAGGGTGGATCACTC NM_010324 ENSMUSG00000025190 LENGTH = 2000 COMBINED P-VALUE = 1.06e-04 E-VALUE = 3.7 DIAGRAM: 605_[2]_42_[2]_1105_[3]_185 [2] [2] 8.1e-05 1.6e-07 ATGAAAAGCAGTGATTCACAG ATGAAAA + +++++++++ + + +++ + ++ ++ 601 CCAGCACGGGAGGCAGAGGCAGGCAGATTTCTGAGTTTAAGGCCAGCCTGGTTTACAGAGTGAGTTCCAGGACAG GCAGTGATTCACAG +++++++ +++++ 676 TCAGGGATACACAGAGAGACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAAGGTGGGGGGTGGTTACTGGGTAG [3] 2.4e-0 AGTGCC + + + 1726 ATACACTAGGGACATAGCAGCTCCAGCGGTGACGGAAAGAGCTGGTATTCTAGAATCCCTCTAAATGATATCATC 6 CTGGGCCTTTAAGAG ++ +++ +++++++ 1801 CTAGTTTCTTAAGAAAATGGCTGGTGTCTGAACGATAAACCTACAAAAGCAACATGGACTTACAGCCTGGATCCT NM_028381 ENSMUSG00000003208 LENGTH = 2000 COMBINED P-VALUE = 1.06e-04 E-VALUE = 3.8 DIAGRAM: 34_[8]_1159_[3]_6_[2]_433_[2]_284 [8] 2.5e-06 AACTTCAAAATGTCTCTTTGA +++++ + +++ +++++++ 1 GACGTGCACCACCACCCAATAGGAATTACTATTAAAATTAAGAATTCCTCTTTGCCCATTTTAAATTTTTAAATA [3] [2] 8.3e-05 4.6e-05 AGTGCCCTGGGCCTTTAAGAG ATGAAAAGCAGTGATTCACAG + + +++++++ ++ +++ + + ++ +++++++ + +++++ 1201 TTTCTGAGTTCGAGACCAGCCTGGTCTATAAAGTGAGTTCCAGAACAGCCAGGACCACACAGAGAAACCCTGTCT [2] 1.7e-07 ATGAAAAGCAGTGATTCACAG + ++ ++++++++++ +++++ 1651 GATCTCCTTTAGTTTGTGACCACCCTGAGCTACATAGTGAGTTACAGGACAGCCAGGGACACACAGTGAGACTCT NM_033030 ENSG00000152430 LENGTH = 2000 COMBINED P-VALUE = 1.07e-04 E-VALUE = 3.8 DIAGRAM: 189_[3]_218_[8]_1213_[8]_53_[2]_3_[4]_170_[3]_26 [3] 6.2e-05 AGTGCCCTGGGCCTTTAAGAG + ++ ++ ++++++ ++++ 151 NNNNNNNNNNNNNNNNNNNNNNNGTAACATGTTTGGGCTATTAAGCTAGGCCATGAAGAAGGCTGTATGATATCC [8] 8.8e-05 AACTTCAAAATGTCTCTTTGA +++ ++++++ +++++ 376 GTTGTGGAAGAAACACACTTGGGTATCAATTATCATCAAAATTCCTTATCCAATAATACATAATTAATTTTTATT [8] 5.4e-07 AACTTCAAAATGTCTCTTTGA ++++ ++++++ +++++ + 1651 TTAAAAGTATTATAATTAAAAATGATTCTTATCCTAAATATGAAAATAAATATCAAATAATATGAAACATAACAT [2] [4] 3.1e-05 1.3e-05 ATGAAAAGCAGTGATTCACAG AGGAGAGAGCTCACACCTCTCCC +++++++ ++ + ++ ++ ++ ++ +++ + ++ +++++++++ 1726 CTAATTGTTTGATAAGAAAAAATAACACAAAGCAGAGAAAATAAAAGACACCTCACTATTCCCAATGTGGTTATT [3] 4.3e-05 AGTGCCCTGGGCCTTTAAGAG ++++ ++ + +++++ 1951 CTGAGTATGGTGAGGTATTAACTAACACTAAGTTTCACAACGGGCAGCTT NM_003215 ENSG00000135605 LENGTH = 2000 COMBINED P-VALUE = 1.11e-04 E-VALUE = 3.9 DIAGRAM: 97_[8]_88_[3]_758_[4]_369_[2]_602 [8] 9.6e-06 AACTTCAAAATGTCTCTTTGA +++++++++++ ++ ++ 76 ACAGTCTCATTCTAAAAACAAGTACTTCATACTGATTGCATTTCTACACACACATATGCAGATTTCTGGACACTC [3] 1.3e-06 AGTGCCCTGGGCCTTTAAG + +++ ++++++++++++ 151 TTTCTGTTTTTCTTTCATTTATGTTCTCAATGCTGAACCTTTATTAGAATTACATCATCACTGTGGTTCTTTAAA AG 226 TAAGAAACAAAACTCTGTCACAACATCTGAGGAAATTATCTTTTAATGTCCCCAAGCTCTATGCCTTGGAAAAAA [4] 3.5e-05 AGGAGAGAGCTCACACCTCTCCC +++++++ ++ ++ + +++ 976 TTCATCCCACTGGAAAGCATAACCATGTGTCAACTAATTAACAGAGCAACTTTAATGAAGACACAGCATATTCTA [2] 8.8e-05 ATGAAAAGCAGTGATTCACAG +++ ++++ + ++ ++++ 1351 TGGGATGGGAGGAGGGAAGGTCAGGAAGGAAGGAGAAAAGGCTGACAGTGATAGTAAGGTAGAGTGTGCCAGGAC NM_172893 ENSMUSG00000038507 LENGTH = 2000 COMBINED P-VALUE = 1.14e-04 E-VALUE = 4 DIAGRAM: 44_[3]_340_[2]_74_[3]_791_[3]_133_[3]_130_[8]_362 [3] 4.0e-07 AGTGCCCTGGGCCTTTAAGAG +++++ +++++ ++++++++ 1 ATGTATATAGACACTCAGCCTCCAAAGTAGCTCCAAAGCCTCATTATACTTTGGTTTTTTAAAAGGACAGAGAAA [2] 9.7e-05 ATGAAAAGCAGTGATTCACAG +++++ +++++++ ++ 376 CAAAGAACAGAGGGGGTTGTAAGTGCAAGAGGGGAAATTGAGGATTCAAACTGCCATCAAGGGACAGGAGTCCAG [3] 7.3e-06 AGTGCCCTGGGCCTTTAAGAG +++++ ++++++ ++++ + 451 TGATTCTAATGCAGTCCTGTCCCTATTCACAACAGACAAGGCAAGGCAGGTATGCCTGGGGTCTATAAATGAAGA [3] 1.5e-05 AGTGCCCTGGGCCTTTAAGAG ++ +++++ +++ + ++++ + 1276 ATTGGGTCCAAAACAGGAATCCCTCATCAGTGAGATTAACACCCTTGTTTAATAAAGGGTCTGGGATAGTCTCCT [3] 6.1e-05 AGTGCCCTGGGCCTTTAAGAG ++++++ + ++++ + ++++ 1426 CTTGATTTTGAACTTCCTAGTCTCCAAAACCTTGAACAATAAATACCTAGTGTCCATTAAAGCCACTCCGTTTAT [8] 1.4e-06 AACTTCAAAATGTCTCTTTGA ++++++ + +++++++++ 1576 TCTTAAAAACAGACATCAAAAAAATAAACTCCAGTCTTCATCTACTTCAGATTGTCTTTTTATTTCCATACTCTC NM_133867 ENSMUSG00000040600 LENGTH = 2000 COMBINED P-VALUE = 1.16e-04 E-VALUE = 4.1 DIAGRAM: 769_[3]_282_[2]_761_[4]_123 [3] 4.0e-07 AGTGCCCTGGGCCTTTAAGAG ++++ ++ +++++++++++ 751 AAACCCAGCCTGAGAATGGAATATGATTGGCCTTTAAGAATAAATAAACAAATTAAGGCTATTCAAACCTGAAAG [2] 4.4e-06 ATGAAAAGCAGTGATTCACAG ++++++++ ++++ ++ + 1051 ACATCATCTCTCACTTGGTCCTGAGGAAGCCAAGGACACATTCCTGACTGCAGGCTATAGTTATATGATCTGTGG [4] 1.3e-05 AGGAGAGAGCTCACACCTCTC + +++++ ++ +++++ ++ 1801 GCCACACCTGAGGTCAGAGGCCAACAAGGAAGAAGCTGGAGAGCAGGGGTCTCGGGTAAAGCTCTGGCACCTGTC CC ++ 1876 AGCACAGACACTTCCTCAGGTCCACTACCGTGCTGTGCGGTTGGTGGCCTGAAAGGCCTCTCCACCCTCTAGAGT NM_027877 ENSMUSG00000024906 LENGTH = 2000 COMBINED P-VALUE = 1.17e-04 E-VALUE = 4.1 DIAGRAM: 114_[8]_370_[2]_57_[8]_1295_[4]_78 [8] 9.1e-05 AACTTCAAAATGTCTCTTTGA +++ ++ ++++++++++ 76 AAAAATGACCTGAATATAATTCTCTACTTTCTTTATCCCAGAAATTAAGTGGTTTTTTGTTCTTTATGGTTCCTG [2] 2.4e-06 ATGAAAAGCAGTGATTCACA + ++ ++++++++ + ++++ 451 CAGAGGCAGGCGGATTTCTGAGTGCGAGGCAAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGCTACACA [8] 7.9e-07 G AACTTCAAAATGTCTCT + ++++++ + ++++++++ 526 GAGAAACCCTGTCTTGAAAAATCAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAACTTCTTTATGTTTCT TTGA ++++ 601 TTTACGATACAGGAAAAGGAATACTTCAATTTAATTCCAGAGTTGATAGGTTTATGATCATTTTAGCACAAAGAC [4] 4.9e-05 AGGAGAGAGCTCACACCTCTCCC ++ +++ + ++++++ ++ + 1876 GCGCTGTCCTCCTTTTTAACAGTCAGTACACAAAAAAAAAGAGTCTGCTCTCGAATTTTTGGGTGAGAGGAAAGG NM_011937 ENSMUSG00000024437 LENGTH = 2000 COMBINED P-VALUE = 1.19e-04 E-VALUE = 4.2 DIAGRAM: 418_[4]_623_[4]_913 [4] 8.3e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++++++++ + +++ +++ 376 CATGATGGGCACTATGGTGACATGCAGCCTGGCACTGGGCAGTAGTTGAGAGCTACAGCCTGACCTGCNNNNNNN [4] 1.1e-09 AGGAGAGAGCTCACACCTCTCCC ++++++++++++ ++++++++++ 1051 AAGGTAGGTGAGAAAGGACAGAGATACAACTTCACAGTGTACCCCAACATATGGGGCCATTCTTAATCAATGGGA NM_024993 ENSG00000176204 LENGTH = 2000 COMBINED P-VALUE = 1.19e-04 E-VALUE = 4.2 DIAGRAM: 98_[3]_552_[8]_192_[3]_111_[4]_961 [3] 6.4e-06 AGTGCCCTGGGCCTTTAAGAG + + +++++++++++++++ 76 AAAAGTCTTAGAACCCCAGTTCTCAAATCCTGGTTCTTTAAGAATGTTTCTTTCTTCTTGGTGATTCTTAAACAT [8] 3.1e AACT ++ 601 ATATGCTATCAAAGAGCAATTATATTTTCCAACTGTTAACTAGCATTAGAAAGGGCAACATCTTTAATTACTCAT -06 TCAAAATGTCTCTTTGA + +++++++++++++ + 676 TAAAACTGTCTTTTGAAAGAGTGATCATCTAAAATCAGATACTAAAACATTATATTGCCAGAATTTAACCTCTCT [3] 5.3e-05 AGTGCCCTGGGCCTTT + ++ ++++ ++++ + 826 CAATTTACATCCAGCATTACTTATTGACAAGTTTGAAATTTGTGACTATTTGTCTGCTCACTAACATGATCCTGT AAGAG +++ + 901 AAGTGATCTTTTAAATTAAATAATCAGAAACCGGGTCCAAATTTATTCTTATAAAAACAAAAATAATATAACATG [4] 1.1e-05 AGGAGAGAGCTCACACCTCTCCC ++++ ++++++++++++ + 976 GAAGGTACACCTTTATTTGAGGCTGTTACAGAAAGGGAGGATACCCAGAACTCAAACCTCTCTCCTTTTATTTAT NM_002578 ENSG00000077264 LENGTH = 2000 COMBINED P-VALUE = 1.20e-04 E-VALUE = 4.3 DIAGRAM: 434_[2]_359_[2]_704_[3]_440 [2] 7.8e-05 ATGAAAAGCAGTGATT ++++++++ ++++ 376 CATGATGTAACTTAGAGTGCATTACTGAGAGTAATAAACATACTATAACCACTAAAGGGCCGGAAAGCATTGATA CACAG + + 451 GATAATTTTTCAGCACAACAGTTAATGAGTAGACCCCTGTTTTTGTTTTTATTGTTCTTAGGTTTGTTCATAGTA [2] 2.5e-05 ATGAAAAGCAG ++++++++++ 751 TGGAGACAGAAACATTCCCTTGGGGAGTTCATGCTTTTCCAGTTTACTTTATCCTCCCAGGGGAATGAAAACCAA TGATTCACAG ++ + +++ 826 ATATACTCAGATGTAGACACAAGATACTAAAGAATAATGAACTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN [3] 1.1e-08 AGTGCCCTGGGCCTTTAAGAG + ++ +++++++++++++++ 1501 ACACCAAGGACATCTCGATGTTAAAGCAGCTAACAAGTCATTGTTATGGGCCATTAAAAATCAACAGCCAAACCT NM_021763 ENSRNOG00000010533 LENGTH = 2000 COMBINED P-VALUE = 1.21e-04 E-VALUE = 4.3 DIAGRAM: 788_[2]_130_[2]_361_[2]_173_[4]_462 [2] 2.5e-05 ATGAAAAGCAGTGATTCACAG + +++++ ++ +++++ + ++ 751 GAGACTGCATGTGTTCAGCGCCTGAGAGATTCAGACTGAGAAGAAACAATGACTTAGAGGATGGCAGAGTAAGAG [2] 5.9e-08 ATGAAAAGCAGTGATTCACAG +++++++++ ++++++++++ 901 TTGAGGCCAGGGCAAGTGGATACTGCAGCAGCCACTTTAATAAGAAGACAGGATTCACACAGAGGCACACTCGGA [2] 1.5e-05 ATGAAAAGCAGTGATTCACAG ++++++ ++++++ + ++++ 1276 AAGATTACATCACTGAGGAACCACAGTAGACACGCCAACCACATACATAAGACCAAGTGCTGCACAAGTAACCCA [4] 8.0e-06 AGGAGAGAGCTCACACCTCTCCC ++++++ + ++++++ + +++++ 1501 AGGTGTGGTTTTATGAGGACAAAAAAAAAATCCCTCCGTTACAGAGGAATTGCGTCATCTGTAGGAGTTTCTTCA NM_016957 ENSMUSG00000051601 LENGTH = 2000 COMBINED P-VALUE = 1.23e-04 E-VALUE = 4.4 DIAGRAM: 421_[8]_591_[3]_430_[8]_473_[8]_1 [8] 2.4e-06 AACTTCAAAATGTCTCTTTGA ++++++++ ++++ ++ +++ 376 TATCTCACTTAGGGTGATATTCTCAAGATCCATCCATTTGCCTGCAAAATTCATGATGTATTGTTGTTTTGTTTT [3] 3.1e-08 AGTGCCCTGGGCCTTTA ++++++ ++ ++++++ 976 TGACCACAGTGTAAGACCTATCTCAGAGTTGTTTTGATTTGCTTTTCCCTGATGACTAAGTACTTTGAATCTTTA AGAG ++ + 1051 AGTGCTTCTTGGCCATTCAAGATTCCTATCTTGTGAATTCTCTGTTTAGCTCTCTGCCCCATTTTTAAAATTGGG [8] 3.8e-05 AACTTCAAAATGTCTC ++++ + ++++ +++ 1426 ATTTATTAATTTGAGACCTCTCAAATTTTTGGATGTAGACACTTAACCATTAATCTTGCCACTTAAGACTGCCTT TTTGA +++ 1501 CATTATGTCCCATAGATTTTGGTATGTTCTGGTTTTATTTTCATTCATTTCTAGGACTTTTAACATTTCATTCTT [8] 1.3e-05 AACTTCAAAATGTCTCTTTGA ++++ + + ++++++++ 1951 TATGGCTGATTTTGGAGAAAAATGTGTGTGCTTTAGTGTTTTTTTTTTTT NM_172499 ENSMUSG00000041945 LENGTH = 2000 COMBINED P-VALUE = 1.24e-04 E-VALUE = 4.4 DIAGRAM: 686_[3]_176_[3]_34_[8]_806_[4]_212 [3] 3.3e-05 AGTGCCCTGGGCCTTTAAGAG ++++++++++++ + ++ 676 TTAAAAAATAAAGTGCTATGGTTTTAAGAATACAAACATGGCTTTGATAAGCATTTGAAAGAAAAGTCCACTGTC [3] 7.7e-08 AGTGCCCTGGGCCTTTA +++++ ++++++++++ 826 CTCTCTGAGGCTCATCTCCGTGACATTCAGATGCTGTGTGAGGACATGGCACTGGTGCCATGCTTTGGTCCTTTA [8] 8.6e-06 AGAG AACTTCAAAATGTCTCTTTGA ++++ +++++ +++ ++ ++ ++ 901 AGAGTTCTTAGCGCCTGTGTTAAATTCACAGTGGAAACAACTTAAAAGTGAATTGGTTTTCTCACAGCCAGAGAG [4] 6.9e-05 AGGAGAGAGCTCACACCTCTCCC +++ + ++ + ++ ++++++++ 1726 CTGCCTTTCCCTAAGAGCAAATGTTAACTCTACTGTGCCCAGGTGGGAAGAGAAGCCTCTCAGGCAAGCTTCTGA NM_021470 ENSMUSG00000029130 LENGTH = 2000 COMBINED P-VALUE = 1.38e-04 E-VALUE = 4.9 DIAGRAM: 621_[4]_380_[8]_256_[2]_244_[4]_411 [4] 4.2e-05 AGGAGAGAGCTCACACCTCTCCC ++++++ ++ +++++ + + 601 AAAATAAGAAGCCCATAACTCAGGACAATAAAGCAACTTGTTAAAATTCAGACATCCAAAGAGTTAAATGTAGGT [8] 2.5e-07 AACTTCAAAATGTCTCTTTGA +++++ ++++++ +++++++ 976 TTAAGTGGAGTTTAAAATTATATTGTTCCAGAGACTTATCATGTGTACCAGATTTATAATGACTTTTGGGACTTC [2] 5.3e-06 ATGAAAAGCAGTGATTCACAG +++++++++ ++ ++++ ++ 1276 AGCGCGGGGATGCAGACTTAAGAGGGGTAGGAGGCAATGTCTCAGAGTGCAGAAGGCCAGGAGCCGCAAGGACAC [4] 5.5e-05 AGGAGAGAG +++ ++ ++ 1501 AGGGCGGATGGCGACGCCACACAGCTCGCAAAACAGTCGAGGGGCCGTTATCAAGTCTTTCGACTGAGGGGACAG CTCACACCTCTCCC ++++++ + +++ + 1576 CAAAAAGCGCACTCCAAAGAGCCGCACGGGTCAGCGCATCCTCGGGACGCATCACGCAGCCTGGCTGGAGGGCAG NM_177001 ENSMUSG00000047433 LENGTH = 2000 COMBINED P-VALUE = 1.39e-04 E-VALUE = 4.9 DIAGRAM: 548_[2]_1093_[3]_317 [2] 7.3e-08 ATGAAAAGCAGTGATTCACAG + ++++++++ +++ ++++++ 526 CACAGATGAAGAAACTGAGAGCCAGAAAAAGAAATGAGTCACACAACTAGAGACAAACCCAGTCACTTAGAAACT [3] 4.7e-06 AGTGCCCTGGGCCTTTAAGAG ++ +++++++++++++ + + 1651 AAAAAAACTAACAACACTCTGGTCCCTTCATTGAAGCCAGGGATAACTGCAGAAACTCTCATCTTGGAACACCCA NM_021836 ENSRNOG00000003678 LENGTH = 2000 COMBINED P-VALUE = 1.39e-04 E-VALUE = 4.9 DIAGRAM: 1433_[4]_61_[8]_462 [4] 9.4e-07 AGGAGAGAGCTCACACCTCTCCC ++++++++ +++ ++ ++ +++ 1426 TACCCGAAAGGAGAGAACTACAAACTGCCCCGGGCTTGGGGCAGCCCAATCTTCTCCCTCCATCCGCCCAGCTTG [8] 3.6e-07 AACTTCAAAATGTCTCTTTGA +++++ ++++ + +++++++ 1501 GAAGAAAGAGACCACAGAAATTAATAAAGACTTTTTTTTTACTTAAATAGATTCAATAAAAAGTCAAACACACAC NM_011167 ENSMUSG00000021345 LENGTH = 2000 COMBINED P-VALUE = 1.43e-04 E-VALUE = 5.1 DIAGRAM: 175_[2]_314_[3]_577_[4]_584_[3]_160_[3]_83 [2] 4.1e-05 ATGAAAAGCAGTGATTCACAG +++++ ++++++ ++++ 151 AAGGGACAGCACATGCTGGTAAGGATGGGAAATAAGGGAAACACATCTCCATTAATGGTGGAAGTACAAACCTGT [3] 1.5e-05 AGTGCCCTGGGCCTT + ++++++ ++++ 451 ATTCTAATATTAAAAACAGACATCCTGAATTTTGCAGGCAAATGGATGGAACTTGATAATATCACCCTGAGTCAG TAAGAG + ++ + 526 TTAGTGCAGACACAAAAATAAATGCATTGTATGCAATCACTTTTAATTGGACATTATACATAACGTGTAGGATAA [4] 4.8e-06 AGGAGAGAGCTCACACC + +++++++ ++++++ 1051 ACAAAAGATTGAGGGAATATCCAACCAATGCTTTGCCCAACTTGAAACCTATCCTATTGGAAAAGAGCCAACACC TCTCCC + ++++ 1126 TGACACTATTAATGATATTCTGCTATGCTTTCAGACAGGAGCCTGAGAAACTGCATCCAGCAGATGACAGAAATA [3] 2.9e-05 AGTGCCCTGG ++++ + + 1651 AAGGTTATAGTTGGACTTATGACACATACTGAAGAGATCATCACTAACTGATATTTGCATCCTAAAATAATTTTA GCCTTTAAGAG ++ +++++++ 1726 TTTTTTAAAAATCAAGTTTTGGGGTATCTCAGACATGAATACAGTGTATTTTTATCATCTCTAGTCTTTGCCTCC [3] 5.4e-07 AGTGCCCTGGGCCTTTAAGAG ++++ +++++ ++++++++ 1876 TGGTTTCCAAAGGACAGCCTGAATGTTATGGAGTATTAAAAGTTTTATAATTCTGATTAGGATACATGTATCAGC NM_020610 ENSMUSG00000031019 LENGTH = 2000 COMBINED P-VALUE = 1.48e-04 E-VALUE = 5.2 DIAGRAM: 1245_[4]_285_[2]_29_[3]_376 [4] 9.6e-05 AGGAGAGAGCTCACACCTCTCCC +++ ++ + ++++ ++++ +++ 1201 TGGATTGATTAATAAAAAGCAAATTCTTGGAAGAAGCAGAGAAATAGGCCACAAATAAGACCTGGCAGTGTCAAA [2] 7.3e-07 ATGAAAAGCAGTGATTCACAG + ++++++++ + +++ + ++ 1501 CACATGCGCACACGCTTCAGGCTGCAATTATGCTTCTGAAGTGCTGAAATGAGAAAGAAACAAATAATTGATAGC [3] 1.2e-05 AGTGCCCTGGGCCTTTAAGAG + ++ + +++++ ++ +++ + 1576 AAATTCAAAGGAGTTAATCTGAAGGCAAATTAACTTGGGTTTTAAAATGTCTCATGTATACATCTATAATCCTAG NM_026838 ENSMUSG00000031253 LENGTH = 2000 COMBINED P-VALUE = 1.49e-04 E-VALUE = 5.3 DIAGRAM: 8_[2]_55_[3]_1012_[3]_538_[2]_45_[2]_106_[8]_110 [2] 2.5e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 1 TGAGTTCCAGGACAGCAAGGGCTACACAGAGAAACCGTGCCTTGGAAAAAACAAACAAACAAACAAGCAGCAACA [3] 1.0e-05 AGTGCCCTGGGCCTTTAAGAG ++ ++++++ ++++++ + + 76 ACAACAATAAAAACCCTGATTCTTTCACAATTGTATAGAGATCTTAAAATTTTTCCTGTTACACACATATATTAT [3] 4.1e-05 AGTGCCCT + ++ + 1051 TGAATAGCCCTCCATCCAAGACAGCTGAAAAACAGCTTAGCAATAAATATCACCTTGCCATGGATAGATTATTTA GGGCCTTTAAGAG + ++ +++ ++++ 1126 GATTTATTTAAAGAACCTTTCAGAAAAGCCTCCCTAGGCATGCTTTCCTTCCATTCATAGTATGGGAAATTAATC [2] 7.2e-06 ATGAAAAGCAGTGATTCACAG +++++++++++++ + + +++ 1651 TTTCAATCCCGATGTCATGCTGAGTCATAAAAAGCAGTGTTCCTCAGAAGTCCTCTCTGGAGGTCCTATGCTAGT [2] 5.9e-05 ATGAAAAGCAGTGATTCACAG + ++++++++++++ +++ 1726 CATCACTCCACCCAGAGCTTGAAACCAGTGACAGTCACACTTCCCCTCTTCTGCAGCAGACAGCACTAGCTCCTC [8] 5.2e-0 AACTTC +++++ 1801 TAATCCTCTTGCTTCCCCCTCCCCCAACCATTTCTTGGGGAATAACAAATATAGCTTTGGGGATAATATAGCTTT 6 AAAATGTCTCTTTGA ++ + + +++++++ 1876 AAGACGACTTTTGGCAAATGTAAATGTCCTAACATCTGGGCAGTGTTACCAGAATCCCGGAGGCCCTGACAGACC NM_153270 ENSG00000185915 LENGTH = 2000 COMBINED P-VALUE = 1.50e-04 E-VALUE = 5.3 DIAGRAM: 38_[8]_66_[4]_175_[3]_1026_[8]_609 [8] 4.2e-07 AACTTCAAAATGTCTCTTTGA +++++ ++++++++ + ++++ 1 TCCTTTACTATTCTATATCCTAAAATCTACTTCTAATCAGCTTTATACTGTTGCCTGTACAGCTCAGTGAATGTA [4] 4.6e-06 AGGAGAGAGCTCACACCTCTCCC ++++++ ++ +++++++ +++++ 76 CTTTCATCTTTAAGAGTTCAGATATATGCCAGTGAATATTTTTGCTGTAGAGGAGAAAGTAAAAACTCCACAGCG [3] 2.1e-05 AGTGCCCTGGGCCTTTAAGAG ++++ + + ++ +++++++ 301 GTATGGAATTTTGGGTCCAGTGTAATATTTTTATCATTTAAAAAGAACTCTATTTGTAAAAACATTTATTTACTG [8] 7.7e-05 AACTTCAAAATGTCTCTTTGA ++++ + ++ ++++++++ 1351 CTTTCTAGATTAAAAAAAAGTGCTTAGAGATTTTTTTTTTCCTTCTCAGTTCCATATTTATATCAAACCATCAGA NM_013255 ENSG00000128585 LENGTH = 2000 COMBINED P-VALUE = 1.51e-04 E-VALUE = 5.3 DIAGRAM: 141_[3]_111_[8]_1249_[4]_122_[8]_81_[4]_187 [3] 1.0e-05 AGTGCCCTG + ++ ++++ 76 GCCTCTGTTAAACAGTTTAGAAGGGTATTGAAAGTCACCAAAAGGATTTAAGCTTAAGAAGAACTAATTAACATG GGCCTTTAAGAG ++ +++++++ 151 TGTTTTTAAAAAAGATTGCTCAGGAACTATGTCTGGAAAAGAACAAGGACTTGACTTACATTAAGGTAAGAAGTG [8] 2.0e-05 AACTTCAAAATGTCTCTTTGA +++ +++ +++++ ++++ 226 GAATTATAAAATCTGGGGCCCAAAGGAATTGGAGTGAGTTAATTCCTCCCCTTTAAAGTGTCTATTTTCTTCAAA [4] 7.3e-05 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++ + ++ ++ ++ 1501 TGGAAGGCGGGGACAAGACATTTAAAACCAGGACCACTCCAGGAGGACAGCACATTAGCTACTGCCCTCCAGCTC [8] 2.1e-05 AACTTCAAAATGTCTCTTTGA ++++ +++ +++++++ ++ 1651 TTCCAACTTGGTCTTCTTATTTTCCCTTCCTGCTCCATAACTACAACGTGTTTCTGTTCGTAATCATCTCGGCTT [4] 3.1e-07 AGGAGAGAGC +++++++++ 1726 TTCCTTACCTCTGCTTGTAAGAAAAGTCCTTTGCGCCATTCCCCTCCAAGCCCCAGGGAAGGGCCTGGAAAGCGC TCACACCTCTCCC + ++++++++ ++ 1801 TTACACCTCAGACCCAGTGTCGGTCAGCCGTCAAGGAGCGAGCGACGTGGGAAACCCTGAGATAGGACATTGGCC NM_031817 ENSRNOG00000015326 LENGTH = 2000 COMBINED P-VALUE = 1.52e-04 E-VALUE = 5.4 DIAGRAM: 72_[8]_1324_[8]_97_[2]_444 [8] 9.0 AAC +++ 1 TAATACCAATTTTTTTCTTTTTTTTCAGTTGCCAGATGTATACTTTGGGTTTTTGTTAAGGATTATTTTTTAAAA e-09 TTCAAAATGTCTCTTTGA ++ +++ +++ +++++++ 76 TTAAAAGTGTATTTTGTAATTTAGAAGGAATTTAACAATTTTTAGACTTCAAAATTGAGAATTTTCACTTTTTAG [8] 7.9e-05 AACTTCAA +++++ + 1351 CATCTGCTAATTGTACCTTGTCTGAAACTTTACTGTGTGTTTTTGGATTCCTTGTACTGGCATACACAGATTTAC AATGTCTCTTTGA + + ++++ +++ 1426 ATTTGTTCCTTTTTTTAATCACGAGTCCTGTGAAGCATAAGTAAAAGAATTCACACAACAGTTTGAATATTCACC [2] 3.1e-05 ATGAAAAGCAGTGATTCACAG + +++ + +++++++ ++ + 1501 TTAAAGCTTACTTAAAATACTTTTAAAATGAACTGAAGAACATAAGTGATACAGATGGGAGGAGGCTACAGCTCA NM_172801 ENSMUSG00000050201 LENGTH = 2000 COMBINED P-VALUE = 1.56e-04 E-VALUE = 5.5 DIAGRAM: 499_[3]_72_[2]_367_[8]_486_[2]_492 [3] 4.6e-05 AGTGCCCTGGGCCTTTAAGAG +++ ++++++++++ ++++ 451 AGTTGGAGATGATAATGTTTAAGTAAACATGCTCAAGGCAGCCCTTCAAGGTGGCATGGGTCATGAAAAAACAAC [2] 1.0e-05 ATGAAAAG + ++ +++ 526 ACTCAGGGAGTAGGGACAGGCAGGTCTCTGTGAGTTCAAGACCAGTCTGGTCTACAGTATGAGTTCCAGGATAGC CAGTGATTCACAG ++ ++ + ++ ++ 601 CAAGGCTACATAGTGAGATTCTGTCTAGTAAGTAGATGAGAATAAAATAATGTACCATAATAGAGAGAGAAAAGA [8] 9.3e-07 AACTTCAAAATGTCTCTTTGA ++++ ++++++ + +++ 976 NNNNGTACTTTAAAATGAAATCTTTCAAAATGAAAGAAAGAAAACTGCATTTCCTGTCTTTCCCAATAGCCAGCC [2] 2.4e-06 ATGAAAAGCAGTG + ++ ++++++++ 1426 CTCGGGCGACAGGGGTGGGCAGATCTGTGTTCGAGGCCAGCCTGGCCTACCAAGCAAGTTCTAGGACAGCCAGGG ATTCACAG + +++++ 1501 CTACACAGAGAAACCCTGTCTCAAAAAAACAATAACAAAAGTCAGATGATGGAGAAAATGTCCTCTAAGTAAGTC NM_033072 ENSMUSG00000025409 LENGTH = 2000 COMBINED P-VALUE = 1.58e-04 E-VALUE = 5.6 DIAGRAM: 460_[2]_458_[3]_190_[8]_399_[3]_409 [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 451 ACAGAGTTCCAGGACAGCCAGGGCTACACAGATAAAGTCTCGAAAAAAGAGAGAGAGTTTCTTCTGGGTGTTCCT [3] 8.9e-05 AGTGCCCTGGGCCTTTAAGAG + ++ + + ++++++++ + 901 CCTCTAACGTCTGTCTCTGGCCGAGTCTAACACTGTACAACTGTCTCTGACCATTAAATGCTGTTGTACCGTGGA [8] 6.3e-06 AACTTCAAAATGTCTCTTTGA +++++ + + ++++++ ++ 1126 GAGTTTCAAAATATTCTCATCTGTTAAATTAAGAGTGTCTCCCATAGAAAAGCAGTGGAGGCCCCACAGGGCAAG [3] 7.3e- AGTGC + +++ 1501 TCTGTGCTTTGAGCAACCTGAGCCAGAGGCAGAGGGGTGCTAGTGGGAATCGGGGAGATGCATGATGCTTATTGC 06 CCTGGGCCTTTAAGAG ++++ +++++ + + 1576 TCTGTACCTTTCACTGGGAAGGAGGGCAGCAGCCAACAGTAGCTCACAGGTTTGTAAACTGAGCCTGCTAGCTTT NM_177229 ENSMUSG00000047113 LENGTH = 2000 COMBINED P-VALUE = 1.60e-04 E-VALUE = 5.7 DIAGRAM: 412_[2]_472_[8]_74_[4]_977 [2] 2.5e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 376 TGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCCAGAACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAA [8] 8.9e-07 AACTTCAAAATGTCTCTTTGA ++++ +++++++ ++ ++ 901 ACGTCTGCTTTAAAATGTATTCAGGTAAGAAAATTTCTTAAGCCTCTTATCAAAGATATTAGCAATTAATACCTT [4] 3.0e-05 AGGAGAGAGCTCACACCTCTCCC ++++++++++++ + ++ + 976 TATGGTAACTAAATTTCAGACTAACTTTACAGAGAACAAATCAGTCTCAGCATTGTCCTCTGAGGATTCTGGCAG NM_026776 ENSMUSG00000035120 LENGTH = 2000 COMBINED P-VALUE = 1.61e-04 E-VALUE = 5.7 DIAGRAM: 519_[4]_464_[2]_158_[3]_794 [4] 4.5e-0 AGGAGA +++++ 451 GTGTGTCTGAAGACAACTACAGTGTACTCACATATCTAAAATAGATATATCTTTAAAAAAAACAGACCATGGACA 5 GAGCTCACACCTCTCCC +++++++++ + + + 526 CAGCACACACATGTAATCTCAGTTCTCAGAAGGTTGAGGCAGTGGGATTAAAGGCTGAGCTCTGGGATAAAGGTG [2] 4.5e-06 ATGAAAAGCAGTGATTCACAG ++++ +++++++++ +++ 976 TTTGGAGCTATGAATTAAACATTGTGACAGAGAGAGATTAAGGGATTCTCACAATAATTCTTCAGGGAACAGAAA [3] 2.7e-07 AGTGCCCTGGGCCTT ++++++++ +++++ 1126 TAAAGGTGAGCATCTCCTAGCAGCCCAAGGTCAGCCTTCTTTTCAATACACCTTTTGTTTAGTGCTCTTATCCCT TAAGAG ++++ + 1201 TAAATGGGTTCATATTGCATGTCAGGTCAGAAGACTCCCAAACTTGTATTTCTAGATCCAACAAGGCTCCCCTGA NM_027279 ENSMUSG00000041396 LENGTH = 2000 COMBINED P-VALUE = 1.63e-04 E-VALUE = 5.8 DIAGRAM: 60_[2]_237_[8]_1023_[4]_615 [2] 2.4e-06 ATGAAAAGCAGTGAT + ++ ++++++++ + 1 GGAGTCAGAGGCAGGCGGATTTCTGAGTTCGAGGGCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCT TCACAG +++++ 76 ACACAGAGAAACCCTGTCTTGAAAAAACAAAACAAACAAACAAACAAAAAAAGATTGATAAAAAGGTTCCTGTAA [8] 1.1e-05 AACTTCAAAATGTCTCTTTGA ++ ++++ +++++ +++++ + 301 ATAGGGAACCTTGGAGACAAGTTCAGACTGTGTTTTTAAAATTAAGCCACATATAAAAGAGGGAGGTTTGGAGTT [4] 3.2e-06 AGGAGAGAGCTCACACCTCTCCC + + ++++ +++ ++++ ++ ++ 1351 AGATGTGGTACCACGGAAGAACTACAACTACATCCTTCAGCTCTCACGCTTCATCCAGATACTGAGCACCAGCGC NM_052828 ENSG00000137394 LENGTH = 2000 COMBINED P-VALUE = 1.72e-04 E-VALUE = 6.1 DIAGRAM: 814_[2]_215_[8]_137_[4]_737_[2]_11 [2] 6.5e-05 ATGAAAAGCAG ++ ++++ + 751 AAAGTGACAGCTAGCAATTTCTGCATCCCAGATGGAGTTAATGTCACCAAGAGAACTTGTACTAGGAGTAGGAGG TGATTCACAG ++++ ++++ 826 AGACTGACAGCCCCCAGGGTCTCTCCTCAGGAGAGAATTCAGTTATACTGAAGATGCCTTCCAGGCCCCCCTTGG [8] 5.9e-06 AACTTCAAAATGTCTCTTTGA +++++++ ++ ++ ++ +++ 1051 AGATTCAGAAGGGATCCATGATGAATGTGTCATTAGTTGGCAAGAAGAGCAGACACAGAGAGAATCAGAGATGCA [4] 1.8e-05 AGGAGAGAGCTCACACCTCTCCC ++++ +++ + +++++++ ++ 1201 CGCTTCTTTGGACTTCGAGCCCACTTCTTACCGGTAGGTCCTGGGCATACAACATACCACTNNNNNNNNNNNNNN [2] 3.0e-07 ATGAAAAGCAGTGATTCACAG ++++++++++ +++ ++++ 1951 CAAGTAAAATTACCCTAGGGGAGAAGCAGGTACTGACAGACCAACATGAG NM_153189 ENSG00000106304 LENGTH = 2000 COMBINED P-VALUE = 1.74e-04 E-VALUE = 6.2 DIAGRAM: 352_[3]_1004_[8]_602 [3] 5.3e-09 AGTGCCCTGGGCCTTTAAGAG + +++ ++++++ ++++++++ 301 AAAATACTTGTAAATATTTTTGCTTTAGGAGAAAATATGTGCTTTCAGGGATATTACGCTGGTCTCTTAAAAGTT [8] 4.0e-05 AACTTCAAAATGTCTCTTTGA + +++ +++ +++ +++ ++ 1351 AAGTTTGTTCTGGCAAACTATTTTCTAACATTAAAATTGTATTTGTTTCCACTTTTTATTTGAGAAACAGTATGA NM_003117 ENSG00000106304 LENGTH = 2000 COMBINED P-VALUE = 1.74e-04 E-VALUE = 6.2 DIAGRAM: 352_[3]_1004_[8]_602 [3] 5.3e-09 AGTGCCCTGGGCCTTTAAGAG + +++ ++++++ ++++++++ 301 AAAATACTTGTAAATATTTTTGCTTTAGGAGAAAATATGTGCTTTCAGGGATATTACGCTGGTCTCTTAAAAGTT [8] 4.0e-05 AACTTCAAAATGTCTCTTTGA + +++ +++ +++ +++ ++ 1351 AAGTTTGTTCTGGCAAACTATTTTCTAACATTAAAATTGTATTTGTTTCCACTTTTTATTTGAGAAACAGTATGA NM_004490 ENSG00000115290 LENGTH = 2000 COMBINED P-VALUE = 1.74e-04 E-VALUE = 6.2 DIAGRAM: 677_[3]_153_[3]_257_[2]_576_[8]_253 [3] 2.3e-05 AGTGCCCTGGGCCTTTAAGAG +++++++ ++++++++ +++ 676 GAAATACTCAGGGCCATTCTGAGAGGTTGAAAAAGGCAAGTGTGTTTACTCATGCTGTTCTACTTAGAAGTCCTG [3] 2.4e-05 AGTGCCCTGGGCCTTTAAGAG ++++ + +++++++++ 826 TAGCAATTAGGCTTGCATTAGCTAAAAGTAAGTTAAATCTTTAAAAATTCAATAAATAATTCTTTTTCAGTTATT [2] 1.3e-06 ATGAAAAGCAGTGATTCACAG ++++++++++ ++ ++++++ 1126 AAAAATGGGAACAAAAGAATCACAGAATCTAAAAACTATTCCCAGATGGTCTCGAAGAAGTGATAAGGAAGGTTA [8] 3.6e-06 AACTTCAAAATGTCTCTTTGA ++++++++++ ++++++ + 1726 ATAATTCAAAATAGTTTTTAGTTTCATTAACAGGTTGTGGATAGTTTCTTTTAATTGATAAGTTATTATCATATT NM_153784 ENSMUSG00000043782 LENGTH = 2000 COMBINED P-VALUE = 1.78e-04 E-VALUE = 6.3 DIAGRAM: 120_[8]_1007_[2]_521_[8]_134_[3]_134 [8] 3.0e-05 AACTTCAAAATGTCTCTTTGA +++++ ++++ +++++++ 76 CTCCCAAATGCTGGGATTAAAGGTGTACGCTGCCACCACCTGGCCTACTTCCTACTTCCTTTTTTTTTTTTTTTT [2] 5.0e-06 ATGAAAAGCAGTGATTCACAG ++++++ ++ ++ ++ +++++ 1126 AATTTGAGGCCATCCTGTGACACATGAGACCCTGTCACACACACATCCCAAAAATGGCACCGCACACATTCACTG [8] 2.9e-05 AACTTCAAAATGTCTCTTTGA +++ ++++ ++++++ +++ 1651 TAGATGCTGCTTCCTTAGGGTGTTACAAAAGAAAAAAAATTACTATAAAAGGGCTTTCTTAGTGGAGGGCTTACC [3] 4.6e-07 AGTGCCCTGGGCCTTTAAGAG ++ + ++++ ++++++++++ 1801 GTAGATCTGCACATAGTAGGTGTTCAATTAATGTTCCCAGTAAGAAGCAAGATGGATCTTTAAGAGGATGGATGT NM_011741 ENSMUSG00000037327 LENGTH = 2000 COMBINED P-VALUE = 1.81e-04 E-VALUE = 6.4 DIAGRAM: 1899_[2]_80 [2] 9.6e-10 ATGAAAAGCAGTGATTCACAG + +++++++++++++++++++ 1876 GAGTTCAATCCCCACGTCCCACAAAGGAAAGGAAGTGACTCACACAAACTGTCTTCTCAGGACACGCAAGTGGCA NM_002192 ENSG00000122641 LENGTH = 2000 COMBINED P-VALUE = 1.86e-04 E-VALUE = 6.6 DIAGRAM: 28_[4]_183_[3]_96_[8]_1046_[4]_202_[8]_126_[8]_147_[2]_21 [4] 1.4e-05 AGGAGAGAGCTCACACCTCTCCC ++++++++ +++ +++++ + 1 ATAGTATACCAACTTAGTATATTTTTCAAGGAGAGCTAAACCACCTTTTGTAATGTTTGGTTTCTCACTGTTATC [3] 4.7e-06 AGTGCCCTGGGCCTTTAAGAG ++ +++++++ + ++ +++++ 226 TGTAAAAATAGGGCTCTGGATTTTCAAAAGCACATACATGAATAATTTATTAGCTATTCCAGGCAAGCTAAGTAC [8] 7.0e-05 AACTTCAAAATGTCTCTTTGA +++ ++ ++++++++++ 301 TAGAATAAACTAGATAAAAACTTGGCTTTAAGCATGTACTTTGATATTTATAAAACAAAGGTGTTTTTTTTTCAT [4] 5.2e-05 AGGAGAG ++ +++ 1351 NNNNNNNNNNNNNNNNNNNNNNNNNNGCTATTTTGTGTGCTGTAGCAGTTCTTTTATAGCTCACATTAAGTGCAG AGCTCACACCTCTCCC + ++ +++++ +++++ 1426 CTCTTAAACCCCACCCCCACCCCCAACCAAAGAAAATACTTGTTAAATAAGGATTAGACAGGTCAAACACCATTG [8] 6.1e-05 AACTTCA ++ + 1576 ATGGACAAACATATTCCTTTCAAGTATCTCTCCTTGAAGAAAATAAAAATTAATCAGATTACTTCCAATACAAAA AAATGTCTCTTTGA ++ + ++++++++ 1651 AAGTCTCTCTTTTTGTTCTCTCTCAGGTAAACAGTTTCAAACCTATTAGGTTGCATAGTTCTAAGATCATAAGCA [8] 4.9e-05 AACTTCAAAA ++ ++ +++ 1726 CCTTAACGAAATGTAACTTGGTATTCTTTTTTCCTTGATCTTTCATTCTTTCCATACTTGTTCTCAATTTTTTAA TGTCTCTTTGA ++++ +++++ 1801 TGTTGTTTGTTTTGTTTTGTTGCAAAAGTGGTACAGAAAAAGGAAGTGTGACCACCTGCACACGATTGTTCTTTT [2] 5.7e-06 ATGAAAAGCAGTGATTCACAG + +++ + +++++ + +++++ 1951 TGATATACACAGAGATAAGTGTCACACAGACATACCTTATGACCTGGGTA NM_031249 ENSMUSG00000050614 LENGTH = 2000 COMBINED P-VALUE = 1.86e-04 E-VALUE = 6.6 DIAGRAM: 502_[8]_158_[4]_247_[2]_97_[3]_473_[8]_72_[4]_321 [8] 1.2e-05 AACTTCAAAATGTCTCTTTGA ++++ ++ ++ + +++++ 451 GGATGTCCTGGAAGTATCCCAAGTGCTAGGATTAAAGATGAGCCGCCATGCCTAATTTATGGTGAATGTTTTATT [4] 1.8e-05 AGGAGAGAGCTCACACCTCTCCC + +++ ++++ +++++ ++ ++ 676 CAAATTTGAAGACAGCTTAAACTACATACACAGTGAATTTTTCAGGCCAACCTGGGCTGTAGAAGGGCCCTGTAT [2] 3.6e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 901 GTAGGTGGATCTCTTGAGTTCAACGCCAGCCTCATCTACAGACTAAGTTCTAGGGCAGCCAGGGCTACACAGACA [3] 2.4e-05 AGTGCCCTGGGCCTTTAAGAG + ++ + +++++++ +++ 1051 CCAAGTTAAAACAGTGAAAATTATGTTTGGTCATTTAAAATAAAATGGCAGTATTTAAAGTACAAAACTGGCAAG [8] 4.2e-05 AACTTCAAAATG ++++ ++++ 1501 TTTAATGTTGTTAAGCTATTTTAAGGGATAAATAACAATATTTTTACCTCATAATATTGGATATAATTTTAAATA TCTCTTTGA +++++ +++ 1576 TTTCTATTATCCTACACTAAACGAAAGCTGTGAAAACACATTAAGCCTTAAATCAGCTTCCGGCTTGCGGTTTCC [4] 4.4e-05 AGGAGAGAGCTCACACCTCTCCC ++++ ++ ++ +++ + +++ 1651 CGCCGGTGGAACGCAATGAAAACAGCCCCGAGTGTGTGGAAACACAGACGCTGCCGGTTACCCCATGGAAGCCCG NM_016704 ENSMUSG00000022181 LENGTH = 2000 COMBINED P-VALUE = 1.87e-04 E-VALUE = 6.6 DIAGRAM: 587_[8]_93_[3]_219_[2]_432_[4]_583 [8] 8.2e-05 AACTTCAAAATGT +++++++ + +++ 526 AGATTCTGCTGAAAGAATACCCTGATATAGCTGTCTCTTGTGAGGCTATGCCAGTTCCTGGCAAATTCAGAGTGG CTCTTTGA + + 601 ATGCTCACAGTCGTCTATAGGATGGAACACAGGGTCCCCAATGGAGGAGCTAGAGAAAGCACCCAAGGAGCTGAA [3] 1.0e-05 AGTGCCCTGGGCCTTTAAGAG ++ ++++++ + + + ++ 676 GGGGTCTGCAACCCTATAGGTGAAACAACACTATGAACTAATCAGTACCCCCAGAGCTCGTGTCTCTAGCTGCAT [2] 3.4e-07 ATGAAAAGCAGTGATTCACAG + ++++++++ ++ + +++++ 901 TTGGAATAGCATTTGAAATGTAAATGAAGAAAATATCTAATAAAAAAGGAAATGGCACACAGGCTGATAGTGTCA [4] 6.9e-05 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++++++ +++ 1351 GGATTTTCTTGGCAGAATCACTTGGTTTTGAATCTATTGTCACTAGGAGAGAAAAAAAATGGTTCCTAACTGAAT NM_146140 ENSMUSG00000044528 LENGTH = 2000 COMBINED P-VALUE = 1.88e-04 E-VALUE = 6.7 DIAGRAM: 443_[2]_103_[3]_254_[8]_99_[3]_23_[8]_386_[4]_98_[4]_443 [2] 1.3e-05 ATGAAAA + + +++ 376 AACCCCCATACCAGGCTGGAGGAATCACCCCAGTAAGAGGAGGAAAAAAGGCCCCAGAACTGAACATCAGATGAG GCAGTGATTCACAG ++ ++++ +++ + 451 TCAATGACACACCCACTCCCATTGTTAGGAGTCCCACAAAAACCCCAAGCCAAACCACCATAGCAAGTATGGGGA [3] 9.4e-05 AGTGCCCTGGGCCTTTAAGAG +++++ +++++++ +++ 526 TGAGGGAATGCAGATCCATGTATTCCCTGTGATTGGCGCTCATTCTCTCTGAGTCCTTAGGAGCCCAGCTTAGTT [8] 2.3e-05 AACTTCAAAATGTCTCTTTGA ++++++++ +++ +++ + + 826 CCGTATTTTCCTCTGGAAAATTCAAGATGCCTTCTATTGATACAATCTCCAAGCTAAATAAACAAGGTTCTATTT [3] 1.3e-05 AGTGCCCTGGGCC ++ +++ ++ +++ 901 ATACTGCGCTATGCTATGTAGCGAACTGAAGGATGACGATAAATCTCAGCACTCTAACGATAAACGCTTTGTTTC [8] 4.9e-05 TTTAAGAG AACTTCAAAATGTCTCTTTGA +++ +++ ++++ ++++++ + + +++ 976 TTTTAAAAGGGCCTGTAGCGTTATTCGTCTGAACTGCATACTTTATACTTTCACCCATCATTCCTCTCCGTAAGG [4] 9.1e-05 AGGAGAGAGCTC + ++++++++ 1351 TTCTGAAACTGCTTCTTTGCATTGGTGTTGCCAGGTCTGACTGATTTAACATAGAGACAGCACAATACAGCGCTG ACACCTCTCCC + + + +++ + 1426 AGAATACTCTGGTTCCACTCAGGATTTAAATCGTGGCTACTGATATTTATTAGACCTGTGGTGATAATAATGGCA [4] 4.4e-06 AGGAGAGAGCTCACACCTCTCCC ++++++++++++ + ++++ 1501 GGCATGGTGTGTCACAAAGAGGTGGGTGGGGAAAAGGAGAGAGAAACAGAGAGACAGAGGCAAAGACAAAAAACA NM_026856 ENSMUSG00000049606 LENGTH = 2000 COMBINED P-VALUE = 1.93e-04 E-VALUE = 6.8 DIAGRAM: 397_[3]_368_[8]_269_[2]_170_[4]_199_[2]_457_[2]_12 [3] 6.5e-05 AGTGCCCTGGGCCTTTAAGAG ++++ +++++++++ ++ 376 GTTTAGACATAGTACTACCCAGAGTGGGCTGGGCCCTCTAATATTAATCAATAAGACGCCCCCAGACCAATCTGA [8] 4.9e-07 AACTTCAAAATGTCTCTTTGA +++ +++ + ++++++ +++ 751 TCAACACAAGCCTATACCCAACATTATAAATACTGAAAACTCAGAGTGTTTCCTTTTTGAACCAGGAGCGAGGCA [2] 4.0e-05 ATGAAAAGCAGTGATTCACAG ++++++++++ +++ ++ + 1051 AAGAGATGGTTCCTAAAAAGTAAATAATGGAAGGAAAGGAGATACGGAGGAAAAGAGGGGAGGAGAGGTGGGGGT [4] 2.9e-05 AGGAGAGA ++++++ + 1201 TTAATTCTGAACTATGTGGCTATACTAATTCTTCCTGCATTCCTATACATTGATAGCTGTCATGATTAGGAAAAA GCTCACACCTCTCCC ++++++ +++ + 1276 AAAAAAAAGTCATAATTAAATCTAAGAGGCTCTAAAAATTCCAAGTCTTCCAACTAGGAATCCTGTCAATTTTTT [2] 5.7e-05 ATGAAAAGCAG + +++++ +++ 1426 GCACACTGGTGTAGTCAGTCTCAGAAGGGCTTGAAATGGAGGATTCTGAGTTCAAAGCCAGCCCAGGGAAATCAG TGATTCACAG +++ + + + 1501 TGAGACCCTCCCCAAACAAACAGAGCGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCA [2] 2.2e-05 ATGAAAAGCAGTGATTCACAG + ++ +++++ ++ ++++++ 1951 TTTCCCATGAAATAAAAAAAATAACAAATGTTTCACATTAGAGGCACTGT NM_153572 ENSMUSG00000041298 LENGTH = 2000 COMBINED P-VALUE = 1.94e-04 E-VALUE = 6.9 DIAGRAM: 97_[2]_458_[4]_196_[3]_861_[8]_102_[4]_156_[8] [2] 1.4e-05 ATGAAAAGCAGTGATTCACAG +++++++++++ + +++ +++ 76 ATAAATAAAAGCAAGTTTGTTCATGGAAGGAAGAGTCTCTCAGTCTAATTTTAAAATTAGTCCCATTACAAATGT [4] 9.6e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++ +++ +++++++ ++++ 526 AGACCTTGGTAGTGAGCTTCAGGATCAAGCTAACAGCTCAGTGGGGGAGAGGGGTAAACGACAAAACCTGTCAGT [3] 9.7e-06 AGTGCCCTGGGCCTTTAAGAG ++++++ + ++ ++++++++ 751 ATTTATAGGTTTTAGTAAGAAAAAGGGGTCAAATATCTTAAGCAACATACTAGGAGTTTTTAAAAGTGCCTCTAA [8] 7.2e-05 AACTTCAAAATGTCTCTTTGA +++++ ++++ + ++ +++ 1651 ACTCTAACAGGAAGTATTAGATTCTAGAGATTTAAAAATTATTCCTGAAGAACTTGTGATCATTCCACTCAAAGT [4] 8.2e-06 AGGAGAGAGCTCACACCTCTCCC + +++++++++ + +++++ ++ 1801 TGCACAGAGCTACCCCTTCAGCCCGGGCCACTGGCTCTCTTATTCTGTACACAAGTATGAGGTTTGGTAGCCTGG [8] 1.7e-05 AACTTCAAAATGTCTCTTTGA ++++++ + ++ ++++++++ 1951 GGCCTTCAAGTGAAGATTCCTCTGATTAAAACTTCCAGATTTTTTTTTTT NM_024251 ENSMUSG00000030051 LENGTH = 2000 COMBINED P-VALUE = 2.14e-04 E-VALUE = 7.6 DIAGRAM: 73_[2]_64_[4]_508_[8]_658_[4]_609 [2 2. AT 1 TTTCAACATCCTTGAATTTATACTGAGTGGAAGGGTTAAAGGACCCCTTAAAGATTGAGTCAGTGCAGGATGGGA ] 8e-08 GAAAAGCAGTGATTCACAG +++++ ++ ++++++++++ 76 AGAAGTCAAGGACTCACAGGTTCACCAACCCCTCCACAATAGAAGGTGTGAGTGTTAATCTTGCTTTGTCACTTG [4] 6.9e-05 AGGAGAGAGCTCACACCTCTCCC ++ + +++++++++++ + ++ 151 ACTGGACCTGGGGTTAGCTAAAACCTGTGCCTCTGGGCAGGGCTGTGAGGGTGTTTCCCTGAAGGACGGACTTGG [8] 4.6e-05 AACTTCAAAATGTCTCTTTGA + +++ ++ ++++++++ ++ 676 CTGATAATAGAGAAATATTTATTCTGTCTTTCTTCAGTTTAAGTAAATTTAACCCTGTGCAATTACATGGAGAGA [4] 3.7e-05 AGGAGAGAGCTCACACCTCTCCC + ++++++++ ++++++++++ 1351 GAGGCCAGAACCCTATCCACAGCAGAGAACGCACCTCTCAGGTGACCCTCTGAGAAGACAGTGGCTTGAAGCTTC NM_176831 ENSMUSG00000009768 LENGTH = 2000 COMBINED P-VALUE = 2.15e-04 E-VALUE = 7.6 DIAGRAM: 44_[8]_733_[3]_368_[2]_99_[8]_672 [8] 7.9e-07 AACTTCAAAATGTCTCTTTGA ++++++++++++ ++++ +++ 1 CTCTATGCAAAAAAACCCTCGCTTGAGAGCTGCAAAATACACTCAGATTCAAACTGCCTCTGTGATTTGTTTCTG [3] 3.9e-05 AGTGCCCTGGGCCTTTAAGAG +++++++ +++++++++++ 751 CACTTACGTGCAATGTTGCTCTCTGTTACCTGGTCAACCGTAGCCTGAAATACCAAAGTTCATTAAAACGATCCA [2] 2.3e-06 ATGAAAAGCAGTG + ++ ++++++++ 1126 TCGAGAAGCAGAGACGGGAGGCTCTCCGAGTTCCAAGCTACTCTGGTATAAAGAGGAAGTTCAGGACAGCCAGGG ATTCACAG ++ ++ ++ 1201 ATACAAAGAGAAACCTTGTCTCGAGACAACAAAACAAAACAACAGAAAGTCTAAGTTCTTGACTTGATACAATTT [8] 7.7e-05 AACTTCAAAATGTCTCTTTGA ++++ ++++++ +++ ++ 1276 TAAAGTTGTATGTTGAGTTCACACAAGTTTTTAAATATTTACTGTATCTGTTTTGTATGTGTGTACACATGCAGA NM_172599 ENSMUSG00000037526 LENGTH = 2000 COMBINED P-VALUE = 2.22e-04 E-VALUE = 7.9 DIAGRAM: 1166_[2]_279_[8]_391_[8]_101 [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 1126 TTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCT [8] 2.9e-05 AACTTCAAAATGTCTCTTTGA ++ + +++++ ++ ++ ++++ 1426 TCCATATCACCTGGAAGCTACTGCTGCCTGTCGGAGAGATTAGTTACAAAAAGGATTCTTGAGAGCTTGCCGACT [8] 2.7e-07 AACTTCAAAATGTCTCTTTGA +++ +++++++ ++ ++++ 1876 ACATTCTTTAAAATGTATCCTTTAATTTCTATGAAGTACTCCAGGTTAGCTTGCACTTTTGCTGCTACCATCAAT NM_028539 ENSMUSG00000048409 LENGTH = 2000 COMBINED P-VALUE = 2.30e-04 E-VALUE = 8.1 DIAGRAM: 226_[4]_1039_[2]_60_[8]_182_[8]_407 [4] 5.2e-06 AGGAGAGAGCTCACACCTCTCCC +++++ + +++ + ++ ++ + 226 GTGGAGAAAACTACAGCTAGTCTCGAGTATGACCAGGGGATGTGTGATTCTCTGACTTTAACATGAATTTGGAAA [2] 2.4e-06 ATGAAAAGCAGTGATTCACAG + ++ ++++++++ + +++++ 1276 CTCTATGAGTTTCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTTGACCCCCCATCTTCCCCCCCAAAAAGG [8] 7.9e-05 AACTTCAAAATGTCTCTTTGA + + +++++++++ ++++ 1351 TGAAACTAAAAATTTCATCTGTCTTCAAATGGTTCCTTTAAAACTGCACTCAGGAAACAGGCATCAGGGCATATG [8] 7.0 AAC ++ 1501 CTCTGAAAACAGATAAAGGTGAGTCTGTGTGCTTGGAATGTAGTTCAGTTGGTAGACTGCTTGCCTAGCTTATAA e-06 TTCAAAATGTCTCTTTGA ++ ++ ++ +++++++++ 1576 TTAATCCTAGCTCTTGGAAAGCAGGAGCAGGAGGACCAGAAGTTTGACAGTATCTTCAGCCAGTGGGAGATCTAG NM_025781 ENSMUSG00000031953 LENGTH = 2000 COMBINED P-VALUE = 2.35e-04 E-VALUE = 8.3 DIAGRAM: 55_[2]_61_[8]_123_[8]_23_[4]_823_[2]_624_[3]_163 [2] 9.4e-06 ATGAAAAGCAGTGATTCACA + + +++++ ++++ ++ + 1 GAGAAAGGAAGACTGCCACAAATTCAAGGCCAGCCTGGGCTTCATGAAGAGATGGAGGCTAGCCAAGGATACATA [8] 7.0e-05 G AACTTCAAAATGT + ++++ +++ + 76 GCAAGACATTGCATCAAATGAAATCTAAAAACAAACAAACAAATCAAGGTTCCCAACGCAAAGAATTTAAAGTAA CTCTTTGA ++++++ 151 TTCTTTATTGATAATAAACATTTATACTAGCATTAGAAATGTCAGTAGAAAAATAATATTTAAATAAATCTCTAA [8] 1.0e-05 AACTTCAAAATGTCTCTTT +++ + ++++++ ++++ + 226 TGTACAAAGATTTGATCATTTTGACTATAGCAGTTAAGTGCAATAACCTAATATAAAAAGTTAAACTGATTTTGG [4] 2.0e-05 GA AGGAGAGAGCTCACACCTCTCCC + + ++++++ +++ ++ ++++++ 301 TCATAACCAGCCCTACACAGTACACAAGAAAGAAATACAAGTTCACAAAACCTTCTGCGCGTGGACCATTAATGG [2] 4.3e-05 ATGAAAAGCAGTGATTCACAG + ++++++++ ++ +++++ 1126 CAGAGAAACCTGTCTCCNNNNNNNNNNNNNNNNNNNNNNNNNNNCCAGAAAAACCACACACACACACCCCCAAAA [3] 1.2e-05 AGTGCCCTGGGCCTTTAAGAG ++ + + +++ + +++++++ 1801 GTGCAAGTGCCCACAGAGAGTCTTGGATATTTAAAAAGAAATTTTATTTATGTGTGACGTGAACAGCATGTAGAG NM_174924 ENSG00000169340 LENGTH = 2000 COMBINED P-VALUE = 2.36e-04 E-VALUE = 8.4 DIAGRAM: 779_[3]_64_[4]_957_[2]_135 [3] 1.4e-07 AGTGCCCTGGGCCTTTAAGAG ++ +++++++++ ++++++++ 751 AATTAGACAAGGTGATGTTGCAAGTGCTTAAGGCTATGGGTTTTTAAAAGATAGCCTCAGTGATTGCATGGAATA [4] 4.1e-05 AGGAGAGAGCTCACACCTCTCCC ++ ++++ ++++ +++ + ++ 826 TGGAGGGGTTTGGGATGGGGGCTGGGAGGGAGGCACAGAGGGGCAGCAAAAAGACCAGTTAGGAGACTTCAGCTG [2] 8.3e-06 ATGAAAAGCAGTGATTCACAG + +++++++ ++++ + +++ 1801 TTCTTCTAAGACTGCACCAACCAGGAAAAAGAGGTCATGAGTAAAACAGAGCCAAGGACACTCAGAGGCTTTATT NM_007003 ENSG00000101951 LENGTH = 2000 COMBINED P-VALUE = 2.37e-04 E-VALUE = 8.4 DIAGRAM: 311_[4]_68_[2]_1483_[4]_71 [4] 6.9e-05 AGGAGAGAGCTCACACCTCTCCC ++++++ + ++ ++ +++ +++ 301 GCACCAGGTGCAGGAGACCTATGACCCCTAGCACTGCCTTTTATCTGCTGGGTTCCCCTTGGAGAACCATATATC [2] 1.9e-05 ATGAAAAGCAGTGATTCACAG + ++++++++ + ++ + ++ 376 TTTGGAGCCCCAGAACATGCTGGTTGCACAAGAACAAAGCACACTTACTGTCTGAGAGGAAGGCTGGTGGTGTGG [4] 1.4e-07 AGGAGAGAGCTCACACCTCTCCC +++++++ +++ +++ + ++++ 1876 GGCACAGAGCCTCTAGGAGGCGGGAGAGGGCTGGAAAGACAACCCACGTGACACACCCCCGAGGTTTGGGATTGG NM_028604 ENSMUSG00000019792 LENGTH = 2000 COMBINED P-VALUE = 2.39e-04 E-VALUE = 8.5 DIAGRAM: 577_[8]_511_[3]_310_[4]_356_[4]_158 [8] 1.1e-06 AACTTCAAAATGTCTCTTTGA +++ ++++++++++++ +++ 526 GAGTTTTCTCAAAGCGGAAGAAGTAGAAGTCCTGAGTTACCTCCCAGAGCTGAACCTCAAACTGTTTTATTGCTC [3] 1.8e-06 AGTGCCCTGGGCCTTT + + + +++++ +++ 1051 GCTCTGTAACTTCTGTAGGAAGTAAAGGCTCGTTTTGTAGGTAACAAAGTTTGAGGGCAATCATTTTGGGTTTTT AAGAG ++++ 1126 GAGAGTGGGTTTCTCTTCAGGTAGCTCTGTCTGTACTGGATATCTCTCTGTAGAGCAGGCTGGCTTCAAAGTCAA [4] 9.6e-05 AGGAGAGAGCTCACACCTCTCCC + ++++ ++ +++++++++++ 1426 GAACTTACTAATAAAAAATAAGAAAATAAACCTCACAGCTGATGCTCAAAAGATTTCAATTAATAATAATAAAAG [4] 8.3e-05 AGGAGAGAGCTCACACCTCTCCC ++++++++++++ + + + + 1801 AAATCAGGAGCACATGACAAGGAGAGCGCAACTAAATGCCTGACCTCTGCACAGCATTGGTCATTTGTGGCTCCT NM_013166 ENSRNOG00000012460 LENGTH = 2000 COMBINED P-VALUE = 2.40e-04 E-VALUE = 8.5 DIAGRAM: 605_[8]_749_[2]_604 [8] 2.0e-07 AACTTCAAAATGTCTCTTTGA +++ + +++++++++++++ 601 TGAGATTCTTAAGAATGTTTCTTGGATCAATCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN [2] 7.0e-06 ATGAAAAGCAGTGATTCACAG +++++++++ ++++ + ++ 1351 AAGGTCTCTTGGGTTTTAAAGATAAGTAAAAACAAATGATACTGACAAATTCTGGATTCACAGTGTACAGGCCTC NM_153399 ENSMUSG00000040566 LENGTH = 2000 COMBINED P-VALUE = 2.41e-04 E-VALUE = 8.5 DIAGRAM: 39_[2]_895_[4]_1022 [2] 1.5e-06 ATGAAAAGCAGTGATTCACAG + ++++++++ ++++ + + 1 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGAGGGAAAGAAAGGATAGATTCTCTACCTCCCCCTTC [4] 7.6e-07 AGGAGAGAGCTCACACCTCT ++++++++ +++++++++ + 901 CCAGGTGAAAGGGTTTGAGGATATTTGATTCCTTCCAAGCGTGAACCATTGGAGCAGGAAAGCACACACACTTGA CCC + 976 GAAGAGTTGTCTATCTGACCCCAGCCCCCCTGTCTCTTCTGGGAATGTGCTCAGGGGAAGGCAGTGCAAGTGCTA NM_173084 ENSG00000175447 LENGTH = 2000 COMBINED P-VALUE = 2.43e-04 E-VALUE = 8.6 DIAGRAM: 201_[2]_643_[8]_634_[3]_459 [2] 1.7e-06 ATGAAAAGCAGTGATTCACAG ++++++++++++ +++++ + 151 CATACATCAACTTTTGCAATCTTCCTGCTGGTAAAAATACAAAACAGCAGCATAAGAACCAGTTACTCATATACT [8] 2.7e-06 AACTTCAAAATGTCTCTTTGA ++ ++ +++++++++++++ 826 GTTTATGTGCAATCTGTAGGGCCAGGAGCATAGTCTAATCTGACCCAGAATGTCTTTTTGATTTGATATAGGTAA [3] 3.2e-05 AGTGCCCTGGGCCTTTAAGAG + + ++++ ++ + +++++ 1501 TTGCAGACCAAGTTACCAACATGATTCTGTTCTAATAAGAATGAGTTTTTAATCCAAGAACTGATTTGACTGACG NM_009412 ENSMUSG00000027506 LENGTH = 2000 COMBINED P-VALUE = 2.44e-04 E-VALUE = 8.6 DIAGRAM: 125_[8]_526_[2]_87_[2]_483_[8]_83_[4]_70_[8]_498 [8] 9.7e-05 AACTTCAAAATGTCTCTTTGA +++++ ++ ++ ++++ ++++ 76 TTCTCCTTCTCTCCACTTAATCATTTTCCTCATATGAAATCTCTATACAAAACTTGAAGATTTCTTATTTAAAAA [2] 9.8 ATG + + 601 CTCTGAAAAAGGCTTTAGGTCACTCCAAGCTTGGCAGTTAACATTTGGCATGGACACTGGTAAAACCACAATAGA e-07 AAAAGCAGTGATTCACAG +++++++++ ++ + ++ 676 GAAAGAAGTAACAGATACAACCTCATAGGCCATAAACATACACAGCAAAGGAAGCAGGAGTGAGTCAGGAGTGAC [2] 1.3e-05 ATGAAAAGCAGTGATTCACAG ++++++++ ++++++++++ 751 CACAGTTAACAGAGGGGCTTAGCAGAGCCACTGGGAAGCTTTGACTCACAGCCCCGACAGTTCTTAGAGTGGGAA [8] 5.6e-05 AACTTCAAAATGTCTCTTTGA +++++ ++ + +++++++ 1276 ATTTACCTGAAATTAAATTTTAATCTTGTAAAAAAATGTGTGAGAATGGTATAATAAATAACTGATAAGCATATT [4] 1.4e-05 AGGAGAGAGCTCACACCTCTCCC +++ +++++++ + ++ +++ + 1351 TGAGCTCAAATAGATGTAGCTACAAATGACAGTTACCAAGGCAAGAGCAGGACCCACACTCTCCCCACTGGAAGT [8] 1.2e-05 AACTTCAAAATGTCTCTTT ++++ + +++++ + ++ 1426 TTGAGTGTCTGCATGTTTTTAATTTACAGAACTATAGAACTGCATACACACCTGTCTACTTTAGTCTGGTGTCTT GA ++ 1501 TAGAAACCCAAGATATGGTCCTCACAGAACTGTTTATCCGAGAAAAAAAATGTAGCTGGTAGTTTCTCCTTTGAA NM_005565 ENSG00000043462 LENGTH = 2000 COMBINED P-VALUE = 2.62e-04 E-VALUE = 9.3 DIAGRAM: 878_[8]_563_[2]_409_[8]_87 [8] 7.2e-05 AACTTCAAAATGTCTCTTTGA +++ + ++++++ +++++ 826 CAGTCATAGCCATGGAACAGAACATCTTAATCCTGATGGCACTTTCCACTAATAGACTTAAAATGAGCTTTGGTT [2] 3.9e-05 ATGAAAAGCAGTGATTCACAG ++++++++ +++++ +++ 1426 TTGTGCAATAACAACCAAGGTCCCAAAGGGAAAAACTTGAAGAACAACTGATTGTCACTATTTGGCTGAATACCA [8] 1.1e-08 AACTTCAAAATGTCTCTTTGA +++++ ++++++++++ +++ 1876 ACCAAAGTCTCATCCCTAAATTTAAAATGTTTTCTTTTGGTATTTGTAGTTTAAATTTTCTTTAAGACAAAAATA NM_145514 ENSMUSG00000038733 LENGTH = 2000 COMBINED P-VALUE = 2.67e-04 E-VALUE = 9.4 DIAGRAM: 314_[3]_436_[8]_635_[8]_521_[8]_10 [3] 1.9e-05 AGTGCCCTGGGCCTTTAAGAG ++ + ++++++ ++ ++++ 301 TTCTCTGATTTTCTAAAGTGATGGTCTCTGAAAAATGAAGCTTAAAGGAATCTTTTCCATTACCCATTTGAATTA [8] 8.6e-05 AACTTCAAAATGTCTCTTTGA ++++ ++ + +++++++ 751 ATGTTAGATATAGGGTCTTCAAAATGACAAGTCACTTTTTTTCCATAAAGGAAACATTCCCGTAAACTAAAAGGG [8] 2.1e-05 AACTTCAAAATGTCTCTTTGA +++++ ++++ + +++++ + 1426 AATACTTCTTAATTTATTTTTAAAAAATTGTGCTGTTAACCCTTTTACGGGGCAACAACTATGTGAAAAGTACAA [8] 5.6e-08 AACTTCAAAATGTCTCTTTGA +++++++++++++++++++ 1951 AAATGTTTGGCCCCAATCGGGCTTCAAAATGGTTCTTTTTTTTTTTTTTT NM_000319 ENSG00000139197 LENGTH = 2000 COMBINED P-VALUE = 2.68e-04 E-VALUE = 9.5 DIAGRAM: 118_[2]_227_[3]_160_[8]_1250_[4]_159 [2] 2.2e-05 ATGAAAAGCAGTGATTCACAG + +++ ++++ ++ + +++++ 76 GGGTGTAGAAGTGGGGCCTCTGAAAGAGGGCGAAAAGGCAAGGAAAAGGAGAAATGTTGCACAGTCTTATGGTCT [3] 3.9e-05 AGTGCCCTG + ++++++ 301 CCTGCTGCCTGCCTGCCTCCGCTCATGCACCCTGGGCTGGGATGGTACTTCTGTTCGTCTGGCATTATTGCCCTT GGCCTTTAAGAG +++++++ + + 376 GGTCATTTACCGGCAGCCCTGGGCCCCTCCTTGCTCTTCTCCATGACACTAGGACTCCCTTGGTCTTGAAGCTGG [8] 1.0e-05 AACTTCAAAATGTCTCTTTGA ++++ +++ +++++++++ 526 CTTGAGGCTTAGTCTTATTCCGTGATTAAAAGCATTTCTTGTATGTTTGTCCTTTCCCAGCCTGGAGTGATGGGA [4] 3.2e-06 AGGAGAGAGCTCACACCTCTCCC +++++++++ ++ +++++ + + 1801 GCCTTTGAGGGGGGCGGCAGGAGAGAGTACCGACCTCCCTCGAACTCCTGGCAGAGGTGGGGGTCGCAGCAAAAG NM_139063 ENSMUSG00000038982 LENGTH = 2000 COMBINED P-VALUE = 2.71e-04 E-VALUE = 9.6 DIAGRAM: 622_[4]_876_[4]_203_[2]_232 [4] 2.6e-08 AGGAGAGAGCTCACACCTCTCCC ++++++++ ++ ++ + +++++ 601 GTCCCCAGAGTCCACATGGCAGAGGAGAGAACTGACTCGACTCCCACAAATTATCCTCTGATCTCCACACGTGTG [4] 1.9e-06 AGGAGAGAGCTCACACCTCTCCC +++++++ + +++++++ ++ 1501 ACAAAACTGAGGACATTAACATGGAAAGCTTAGAAACTTCGTCCTGGTCCACACCCTTAGAAAGGCGTATGCTGT [2] 3.8e-05 ATGAAAAGCAGTGATTCACAG + +++ +++ +++++ ++ ++ 1726 AGTGGTCACTAACGACATTCCAAAAAGTGCCTGTGATACAAAGGGGATTTTCACTTGCGGCTCCCTCCCCTCCCT NM_009980 ENSMUSG00000030970 LENGTH = 2000 COMBINED P-VALUE = 2.78e-04 E-VALUE = 9.8 DIAGRAM: 437_[3]_1109_[8]_62_[2]_308_[8] [3] 9.4e-05 AGTGCCCTGGGCC ++ + +++ + 376 CTGACCACACATAATTTGATAGGTACCCATTCTTCCCTGTGGGAAAGAATTAAAGAAGGCTCCATTTCTTGGACT TTTAAGAG +++++++ 451 CTTAAGATAGGAATGGGGCTTTCATTAGTTTCTCTAGCAGACTGCTGGCTGTGAAATGGCCAACCCCTTTGGGAA [8] 4.2e-05 AACTTCAA + ++ ++ 1501 AAATACAGTAACGATAACTCACACTAAAACAAAACATTTCTGATAGCCATTATTTTTCTGTTTGGGACAGTTTAA AATGTCTCTTTGA + + ++++++++ 1576 AGTTTTTTTTTTCTTTTGTGTCACAAAAACAGGAATGTACCTATACAAAGGCTCAAAATAGGCCATCTTTAAAAA [2] 1.6e-06 ATGAAAAGCAGTGATTCACAG ++++++++ +++++++++ 1651 CAAAAAGGCAATGATTCACAAAAGACTATGAATAGAACATGTAACTAGCTGATACAAATCTAATAGGATTTGTTA [8] 4.9e-07 AACTTCAAAATGTCTCTTTGA +++++ +++++ ++++ +++ 1951 AAATAGATTTGAGTACAAACAGCTTGTGAAACTTAATACTTTTTTCTTTT ******************************************************************************** CPU: crick Time 67.670000 secs. mast acro1.set.fasta.conserved.25.meme -m 2 -m 3 -m 4 -m 8 -text -brief -remcorr -norc -stdout -d /home/sao/applications/mask_db/upstream.masked -bfile /home/sao/proj_motif/model.bckg test.gcg100644000766000024 112114006427353 16030 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataRea guano receptor type III >> 0.1 roa1_drome Length: 358 Thu Apr 17 12:09:52 2003 Type: P Check: 8903 .. 1 MVNSNQNQNG NSNGHDDDFP QDSITEPEHM RKLFIGGLDY RTTDENLKAH 51 EKWGNIVDVV VMKDPRTKRS RGFGFITYSH SSMIDEAQKS RPHKIDGRVE 101 PKRAVPRQDI DSPNAGATVK KLFVGALKDD HDEQSIRDYF QHFGNIVDNI 151 VIDKETGKKR GFAFVEFDDY DPVDKVVLQK QHQLNGKMVD VKKALPKNDQ 201 QGGGGGRGGP GGRAGGNRGN MGGGNYGNQN GGGNWNNGGN NWGNNRGNDN 251 WGNNSFGGGG GGGGGYGGGN NSWGNNNPWD NGNGGGNFGG GGNNWNGGND 301 FGGYQQNYGG GPQRGGGNFN NNRMQPYQGG GGFKAGGGNQ GNYGNNQGFN 351 NGGNNRRY in.fasta100644000766000024 401014006427353 16015 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataFASTA searches a protein or DNA sequence data bank version 35.03 Feb. 18, 2008 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: fake.fa 1>>>TM1000020|total:2746_L:1626_-3:439_0:279_+3:402 32 nt - 32 nt Library: /home/va473/databases/contaminants/contaminants.fasta 1055411 residues in 2012 sequences 1055411 residues in 2016 sequences Statistics: Expectation_n fit: rho(ln(x))= 0.9735+/-0.00348; mu= 31.8193+/- 0.161 mean_var=45.4281+/-12.600, 0's: 0 Z-trim: 6 B-trim: 532 in 2/40 Lambda= 0.190288 Algorithm: FASTA (3.5 Sept 2006) [optimized] Parameters: +5/-4 matrix (5:-4) ktup: 2 join: 65, opt: 50, open/ext: -100/-4, width: 16 Scan time: 0.230 The best scores are: opt bits E(2016) DQ345959.1 | Symbols: | chloroplast | Gossypi (160301) [r] 142 50.2 2.7e-08 >>DQ345959.1 | Symbols: | chloroplast | Gossypium hirsu (160301 nt) rev-comp initn: 141 init1: 141 opt: 142 Z-score: 196.2 bits: 50.2 E(): 2.7e-08 banded Smith-Waterman score: 142; 93.8% identity (93.8% similar) in 32 nt overlap (32-1:31483-31514) 30 20 10 TM100- GCCGGTGCTCTGACCAATTGAACTAGAATC : ::::::::::::::::::::::: :::: DQ3459 GGCTCGAACCCGCAGCTTCCGCCTTGACAGGGCGGTGCTCTGACCAATTGAACTACAATC 31460 31470 31480 31490 31500 31510 TM100- CC :: DQ3459 CCAGGGAAATAAAGAAAAGTGTACAACAGAGATAGTCTTATGATTTCATTCATTTTCTAT 31520 31530 31540 31550 31560 31570 32 residues in 1 query sequences 1055411 residues in 2012 library sequences Tcomplib [35.03] (4 proc) start: Sun Jul 6 00:14:21 2008 done: Sun Jul 6 00:14:21 2008 Total Scan time: 0.230 Total Display time: 0.010 Function used was FASTA [version 35.03 Feb. 18, 2008] test.ace100644000766000024 216514006427353 16031 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data Protein : "roa1_drome" Peptide "roa1_drome" Peptide : "roa1_drome" MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVV VMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVK KLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQK QHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGN NWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGND FGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY Sequence : "Name\; 4\% strewn with \\ various \/ escaped characters" DNA "Name\; 4\% strewn with \\ various \/ escaped characters" DNA : "Name\; 4\% strewn with \\ various \/ escaped characters" gctcggactctatctagcagaaacctcgttcagctagtcttgcttcatggaggtttgatc tagactgcaaacgtcggtgctaaaagaccatacttccgtatgtgcctatcgggagcagtc gctgagaagtgcggaatgatccttcaatgaccgccgttaaagcctgggagtccgcgccac aatcattccatatacagcaacacgcgctacgcggacctctcggtgggtgacgattctatt gaggcgttgaagcgagaaagatattccgattcttttcgagtctatagttaaatcggactg catcatccattttagggcata Sequence : "Last" DNA "Last" DNA : "Last" ggccgggatggccggacctgttctgaacatcttatatccacccgaacaagttataaacaa tttaaatctgggcggccatctataagcgtgtcttcagtatgagagtcttcggatatcacg acccattaggaaag test.raw100644000766000024 100614006427353 16063 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataMVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY MNKQRGTYSEVSLAQDPKRQQRKLKGNKISISGTKQEIFQVELNLQNASSDHQGNDKTYHCKGLLPPPEKLTAEVLGIICIVLMATVLKTIVLIPCIGVLEQNNFSLNRRMQKARHCGHCPEEWITYSNSCYYIGKERRTWEERVCWPVLRRTLICFL test.nhx100644000766000024 63014006427353 16051 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data(((ADH2[&&NHX:S=human:E=1.1.1.1]:0.1,ADH1:0.11[&&NHX:S=human:E=1.1.1.1]):0.05[&&NHX:S=Primates:E=1.1.1.1:D=Y:B=100],ADHY:0.1[&&NHX:S=nematode:E=1.1.1.1],ADHX:0.12[&&NHX:S=insect:E=1.1.1.1]):0.1[&&NHX:S=Metazoa:E=1.1.1.1:D=N],(ADH4:0.09[&&NHX:S=yeast:E=1.1.1.1],ADH3:0.13[&&NHX:S=yeast:E=1.1.1.1],ADH2:0.12[&&NHX:S=yeast:E=1.1.1.1],ADH1:0.11[&&NHX:S=yeast:E=1.1.1.1]):0.1[&&NHX:S=Fungi])[&&NHX:E=1.1.1.1:D=N];sim4.rev100644000766000024 64714006427353 15755 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data seq1 = /nfs/disk21/birney/prog/wise2/example/human.rev, 5368 bp seq2 = temp.cdna (>HSHNCPA1), 1198 bp (complement) 486-503 (10-27) 83% == 1048-1117 (194-265) 93% <- 1408-1563 (266-421) 100% <- 2501-2575 (422-496) 100% <- 2710-2802 (497-589) 100% <- 2889-2981 (590-682) 100% <- 3075-3285 (683-893) 100% <- 3434-3580 (894-1040) 100% <- 3876-3992 (1041-1157) 100% <- 4556-4596 (1158-1198) 100% amino.fa100644000766000024 67614006427353 16000 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CYS1_DICDI fragment SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE >ALEU_HORVU MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGK SYESAAEVRRRFRIFSESLEEVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRD AAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAF QVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCA IATCASYPVVAA meme.dat100644000766000024 5334614006427353 16044 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data******************************************************************************** MEME - Motif discovery tool ******************************************************************************** MEME version 3.0 (Release date: 2001/03/05 14:24:28) For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu. This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu. ******************************************************************************** ******************************************************************************** REFERENCE ******************************************************************************** If you use this program in your research, please cite: Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994. ******************************************************************************** ******************************************************************************** TRAINING SET ******************************************************************************** DATAFILE= D10Mit194.set.genbank.fasta.nref ALPHABET= ACGT Sequence name Weight Length Sequence name Weight Length ------------- ------ ------ ------------- ------ ------ 20218 1.0000 2000 10657 1.0000 2000 83796 1.0000 2000 6603 1.0000 2000 ******************************************************************************** ******************************************************************************** COMMAND LINE SUMMARY ******************************************************************************** This information can also be useful in the event you wish to report a problem with the MEME software. command: meme D10Mit194.set.genbank.fasta.nref -dna -print_fasta -nmotifs 3 -maxw 25 model: mod= zoops nmotifs= 3 evt= inf object function= E-value of product of p-values width: minw= 8 maxw= 25 minic= 0.00 width: wg= 11 ws= 1 endgaps= yes nsites: minsites= 2 maxsites= 4 wnsites= 0.8 theta: prob= 1 spmap= uni spfuzz= 0.5 em: prior= dirichlet b= 0.01 maxiter= 50 distance= 1e-05 data: n= 8000 N= 4 strands: + sample: seed= 0 seqfrac= 1 Letter frequencies in dataset: A 0.255 C 0.236 G 0.252 T 0.257 Background letter frequencies (from dataset with add-one prior applied): A 0.255 C 0.236 G 0.252 T 0.257 ******************************************************************************** ******************************************************************************** MOTIF 1 width = 25 sites = 4 llr = 106 E-value = 1.2e-002 ******************************************************************************** -------------------------------------------------------------------------------- Motif 1 Description -------------------------------------------------------------------------------- Simplified A :8:88aaa:53a8:::3:a::3::3 pos.-specific C a3::3:::::3:::5a8a:8a5aa8 probability G ::8:::::3:5::::::::3::::: matrix T ::33::::85::3a5::::::3::: bits 2.1 * * * * ** 1.9 * *** * * * ** * ** 1.7 * *** * * * ** * ** 1.5 * *** * * * ** * ** Information 1.2 ********* *** ****** *** content 1.0 ********** ********** *** (38.1 bits) 0.8 ********** ********** *** 0.6 ********** ************** 0.4 ************************* 0.2 ************************* 0.0 ------------------------- Multilevel CAGAAAAATAGAATCCCCACCCCCC consensus CTTC GTA T T A G A A sequence C T -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- ------------------------- 6603 1311 2.59e-15 GGCGCATTGA CAGAAAAATTGAATTCCCACCCCCC AATGAGGAGG 83796 1284 2.59e-15 GGAGGATTGA CAGAAAAATTGAATTCCCACCCCCC AACGAGGAGG 20218 938 6.34e-12 TTTTTGGTAA CCTTAAAATAAAATCCCCACCACCA CTTTTAAAAA 10657 1685 8.70e-12 GGCCCGCGCG CAGACAAAGACATTCCACAGCTCCC GCCCCCTCCA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- 6603 2.6e-15 1310_[1]_665 83796 2.6e-15 1283_[1]_692 20218 6.3e-12 937_[1]_1038 10657 8.7e-12 1684_[1]_291 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 in FASTA format -------------------------------------------------------------------------------- >6603 pos 1311 CAGAAAAATTGAATTCCCACCCCCC >83796 pos 1284 CAGAAAAATTGAATTCCCACCCCCC >20218 pos 938 CCTTAAAATAAAATCCCCACCACCA >10657 pos 1685 CAGACAAAGACATTCCACAGCTCCC -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 4 w= 25 n= 7904 bayes= 11.6849 E= 1.2e-002 -865 208 -865 -865 156 8 -865 -865 -865 -865 157 -4 156 -865 -865 -4 156 8 -865 -865 197 -865 -865 -865 197 -865 -865 -865 197 -865 -865 -865 -865 -865 -1 154 97 -865 -865 96 -3 8 99 -865 197 -865 -865 -865 156 -865 -865 -4 -865 -865 -865 196 -865 108 -865 96 -865 208 -865 -865 -3 166 -865 -865 -865 208 -865 -865 197 -865 -865 -865 -865 166 -1 -865 -865 208 -865 -865 -3 108 -865 -4 -865 208 -865 -865 -865 208 -865 -865 -3 166 -865 -865 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 4 w= 25 n= 7904 E= 1.2e-002 0.000635 0.998096 0.000629 0.000640 0.748765 0.249966 0.000629 0.000640 0.000635 0.000589 0.748759 0.250017 0.748765 0.000589 0.000629 0.250017 0.748765 0.249966 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.000635 0.000589 0.250006 0.748770 0.499388 0.000589 0.000629 0.499393 0.250012 0.249966 0.499382 0.000640 0.998141 0.000589 0.000629 0.000640 0.748765 0.000589 0.000629 0.250017 0.000635 0.000589 0.000629 0.998147 0.000635 0.499343 0.000629 0.499393 0.000635 0.998096 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 0.000635 0.998096 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.000635 0.748719 0.250006 0.000640 0.000635 0.998096 0.000629 0.000640 0.250012 0.499343 0.000629 0.250017 0.000635 0.998096 0.000629 0.000640 0.000635 0.998096 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 -------------------------------------------------------------------------------- Time 38.44 secs. ******************************************************************************** ******************************************************************************** MOTIF 2 width = 25 sites = 4 llr = 101 E-value = 4.5e-001 ******************************************************************************** -------------------------------------------------------------------------------- Motif 2 Description -------------------------------------------------------------------------------- Simplified A ::::a::383:5:3:::a33:83:a pos.-specific C a3a8:3a33:a::5::8::::383: probability G ::::::::::::::::3:38a::8: matrix T :8:3:8:5:8:5a3aa::5:::::: bits 2.1 * * * * * 1.9 * * * * * * ** * * * 1.7 * * * * * * ** * * * 1.5 * * * * * * ** * * * Information 1.2 ******* *** * **** ****** content 1.0 ******* ***** **** ****** (36.6 bits) 0.8 ******* ***** **** ****** 0.6 ******* ********** ****** 0.4 ************************* 0.2 ************************* 0.0 ------------------------- Multilevel CTCCATCTATCATCTTCATGGACGA consensus C T C ACA T A G AA CAC sequence C T G -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- ------------------------- 6603 1000 1.62e-15 CGGGAACATG CTCCATCTATCATCTTCATGGACGA AATCGACTCC 83796 978 4.69e-15 CGAGAACATG CTCCATCCATCATCTTCATGGACGA GATTGACTCT 20218 1545 1.69e-11 TAGCTTCTCT CCCCATCAATCTTATTCAGAGCCCA CCCCTCCCCC 10657 1075 3.40e-11 AGGATCTGGT CTCTACCTCACTTTTTGAAGGAAGA AACACTTAAT -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- 6603 1.6e-15 999_[2]_976 83796 4.7e-15 977_[2]_998 20218 1.7e-11 1544_[2]_431 10657 3.4e-11 1074_[2]_901 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 in FASTA format -------------------------------------------------------------------------------- >6603 pos 1000 CTCCATCTATCATCTTCATGGACGA >83796 pos 978 CTCCATCCATCATCTTCATGGACGA >20218 pos 1545 CCCCATCAATCTTATTCAGAGCCCA >10657 pos 1075 CTCTACCTCACTTTTTGAAGGAAGA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 4 w= 25 n= 7904 bayes= 10.9476 E= 4.5e-001 -865 208 -865 -865 -865 8 -865 154 -865 208 -865 -865 -865 166 -865 -4 197 -865 -865 -865 -865 8 -865 154 -865 208 -865 -865 -3 8 -865 96 156 8 -865 -865 -3 -865 -865 154 -865 208 -865 -865 97 -865 -865 96 -865 -865 -865 196 -3 108 -865 -4 -865 -865 -865 196 -865 -865 -865 196 -865 166 -1 -865 197 -865 -865 -865 -3 -865 -1 96 -3 -865 157 -865 -865 -865 198 -865 156 8 -865 -865 -3 166 -865 -865 -865 8 157 -865 197 -865 -865 -865 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 4 w= 25 n= 7904 E= 4.5e-001 0.000635 0.998096 0.000629 0.000640 0.000635 0.249966 0.000629 0.748770 0.000635 0.998096 0.000629 0.000640 0.000635 0.748719 0.000629 0.250017 0.998141 0.000589 0.000629 0.000640 0.000635 0.249966 0.000629 0.748770 0.000635 0.998096 0.000629 0.000640 0.250012 0.249966 0.000629 0.499393 0.748765 0.249966 0.000629 0.000640 0.250012 0.000589 0.000629 0.748770 0.000635 0.998096 0.000629 0.000640 0.499388 0.000589 0.000629 0.499393 0.000635 0.000589 0.000629 0.998147 0.250012 0.499343 0.000629 0.250017 0.000635 0.000589 0.000629 0.998147 0.000635 0.000589 0.000629 0.998147 0.000635 0.748719 0.250006 0.000640 0.998141 0.000589 0.000629 0.000640 0.250012 0.000589 0.250006 0.499393 0.250012 0.000589 0.748759 0.000640 0.000635 0.000589 0.998135 0.000640 0.748765 0.249966 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 0.000635 0.249966 0.748759 0.000640 0.998141 0.000589 0.000629 0.000640 -------------------------------------------------------------------------------- Time 78.29 secs. ******************************************************************************** ******************************************************************************** MOTIF 3 width = 21 sites = 4 llr = 88 E-value = 4.8e-001 ******************************************************************************** -------------------------------------------------------------------------------- Motif 3 Description -------------------------------------------------------------------------------- Simplified A :::58:::::3::::33:::: pos.-specific C 8:833::8:8:a:8a8:a5a8 probability G 3::::::3:::::3::::::: matrix T :a33:aa:a38:a:::8:5:3 bits 2.1 * * * * 1.9 * ** * ** * * * 1.7 * ** * ** * * * 1.5 * ** * ** * * * Information 1.2 *** ************** ** content 1.0 *** ***************** (31.8 bits) 0.8 *** ***************** 0.6 *** ***************** 0.4 ********************* 0.2 ********************* 0.0 --------------------- Multilevel CTCAATTCTCTCTCCCTCCCC consensus G TCC G TA G AA T T sequence T -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------------- 10657 1511 1.45e-13 CCCAGGCGGT CTCAATTCTCTCTCCCTCCCC TTTCCGTGAC 83796 1801 7.40e-12 TGTATATGCA CTCTCTTCTCTCTCCCTCTCC AGGTCATGCA 6603 1811 1.22e-10 GTAACTTAAT GTTCATTCTCTCTCCCACCCC TAGGTCATGC 20218 606 7.57e-10 CCCAGGCCAG CTCAATTGTTACTGCATCTCT AGGATTGGAA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- 10657 1.5e-13 1510_[3]_469 83796 7.4e-12 1800_[3]_179 6603 1.2e-10 1810_[3]_169 20218 7.6e-10 605_[3]_1374 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 in FASTA format -------------------------------------------------------------------------------- >10657 pos 1511 CTCAATTCTCTCTCCCTCCCC >83796 pos 1801 CTCTCTTCTCTCTCCCTCTCC >6603 pos 1811 GTTCATTCTCTCTCCCACCCC >20218 pos 606 CTCAATTGTTACTGCATCTCT -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 4 w= 21 n= 7920 bayes= 10.9506 E= 4.8e-001 -865 166 -1 -865 -865 -865 -865 196 -865 166 -865 -4 97 8 -865 -4 156 8 -865 -865 -865 -865 -865 196 -865 -865 -865 196 -865 166 -1 -865 -865 -865 -865 196 -865 166 -865 -4 -3 -865 -865 154 -865 208 -865 -865 -865 -865 -865 196 -865 166 -1 -865 -865 208 -865 -865 -3 166 -865 -865 -3 -865 -865 154 -865 208 -865 -865 -865 108 -865 96 -865 208 -865 -865 -865 166 -865 -4 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 4 w= 21 n= 7920 E= 4.8e-001 0.000635 0.748719 0.250006 0.000640 0.000635 0.000589 0.000629 0.998147 0.000635 0.748719 0.000629 0.250017 0.499388 0.249966 0.000629 0.250017 0.748765 0.249966 0.000629 0.000640 0.000635 0.000589 0.000629 0.998147 0.000635 0.000589 0.000629 0.998147 0.000635 0.748719 0.250006 0.000640 0.000635 0.000589 0.000629 0.998147 0.000635 0.748719 0.000629 0.250017 0.250012 0.000589 0.000629 0.748770 0.000635 0.998096 0.000629 0.000640 0.000635 0.000589 0.000629 0.998147 0.000635 0.748719 0.250006 0.000640 0.000635 0.998096 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 0.250012 0.000589 0.000629 0.748770 0.000635 0.998096 0.000629 0.000640 0.000635 0.499343 0.000629 0.499393 0.000635 0.998096 0.000629 0.000640 0.000635 0.748719 0.000629 0.250017 -------------------------------------------------------------------------------- Time 117.82 secs. ******************************************************************************** ******************************************************************************** SUMMARY OF MOTIFS ******************************************************************************** -------------------------------------------------------------------------------- Combined block diagrams: non-overlapping sites with p-value < 0.0001 -------------------------------------------------------------------------------- SEQUENCE NAME COMBINED P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- 20218 7.77e-19 605_[3(7.57e-10)]_311_[1(6.34e-12)]_582_[2(1.69e-11)]_431 10657 5.46e-22 1_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_7_[1(6.00e-08)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_5_[1(1.18e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_4_[1(1.29e-07)]_332_[2(3.40e-11)]_383_[3(7.75e-07)]_7_[3(1.45e-13)]_56_[3(3.47e-05)]_76_[1(8.70e-12)]_291 83796 1.73e-27 977_[2(4.69e-15)]_281_[1(2.59e-15)]_492_[3(7.40e-12)]_179 6603 9.32e-27 597_[1(1.75e-07)]_[1(1.75e-07)]_[1(1.75e-07)]_327_[2(1.62e-15)]_286_[1(2.59e-15)]_475_[3(1.22e-10)]_169 -------------------------------------------------------------------------------- ******************************************************************************** ******************************************************************************** Stopped because nmotifs = 3 reached. ******************************************************************************** CPU: hydra-1.lsd.ornl.gov ******************************************************************************** lucy.seq100644000766000024 212514006427353 16062 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>TaLr1010B10R 0 0 0 86 505 ATACGCCAAGCTCGAAATTAACCCTCACTAAAGGGAACAAAAGCTGGAGC TCGCGCGCCTGCAGGTCGACACTAGTGGATCCAAAGAATTCGGCACGAGA CCTACCCGTTCGTCAAGGAGGGTGTGTCCAACGGAACCCTCAAGCTCGTT GGCGGCCACTACGACTTCGTCTCCGGCAAGTTCGAGACATGGGAGCAGTA AATCTTCCCCACCGGTTAACTCCTATATATACATGCGAATACAAACGCGT ACATATATCAAGATATCGTCCGACCGTCCGACCGACCGATGTGAATCCAA TGCCATGGAGTGCGTACTCGTTATTTTCCAGTACTGGATGCCGGGATGGC CCGATGTGAATTCGCCATAAGCAATAGAACCTTTTTTCTTTTACCATTTT CTGACGAGGAATTGTACTACCATGTGATGCATAATTTGATCGTCCTGTGA TCAAAAGACATCATATATAAGTTTAATAATATTTTCATGAACAGTTTACC CTTTTAAAAAAAAAAAAAAAAAACTCGAGAGTACTTTTAGAGCGGCCGCG GGCCCATCGATTTTCCACCCGGGTGGGGTACCAGGTAAGTGTACCCAATT CGCCCTATAGTGAGTCGTATTACAATTCACTGGCCGGCGTTTTACAACGG CGGGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTTGAGCACA TTCCCCTTTTGCCAGGTTGGCGTAATAACAAAAGAGAGCCCGAACCGATC GCCCTTTCCAACAAGTTGGGCAACTTGAATGGGTGAATGGAGAATCCAAA TTTTTAAGTGGTGTAAAGGGTGTTAAACTCACTGGGATCCTTAAAGGGTT GGGGTGATTATAAAAAACACAAACGCCCAAAGCGTTATATTCCAGGCCCC CTCAGGTCCCTTAACAGGGCTGTGTGATGGGTCATTAAACCGCCCCCTAC CCACCATTTCGTAGAAGAGGGTTTAACTTGCGTTTTAAAAAGAAAACTCT CACACCCCTCTCCTCCTCGGACCCTGGAGAACAATAAATAGAAAAGGCGA CGTTGTTTCG roa1.dat100644000766000024 5471014006427353 15757 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID HSHNCPA1 standard; RNA; HUM; 1198 BP. XX AC X79536; XX SV X79536.1 XX DT 03-JUN-1994 (Rel. 39, Created) DT 03-JUN-1994 (Rel. 39, Last updated, Version 1) XX DE H.sapiens mRNA for hnRNPcore protein A1 XX KW core protein. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; OC Primates; Catarrhini; Hominidae; Homo. XX RN [2] RP 1-1198 RA Leffers H.; RT ; RL Submitted (01-JUN-1994) to the EMBL/GenBank/DDBJ databases. RL H. Leffers, Inst. of Medical Research Biochemistry & Danish Centre for RL Human Genome Research, Ole Worms Alle 170, Aarhus Univ., 8000 Aarhus C, RL DENMARK XX RN [3] RA Knudsen S.M., Leffers H.; RT "Cloning and sequencing of a splice variant of human hnRNP core protein RT A1"; RL Unpublished. XX DR SWISS-PROT; P09651; ROA1_HUMAN. XX FH Key Location/Qualifiers FH FT source 1..1198 FT /db_xref="taxon:9606" FT /organism="Homo sapiens" FT /tissue_type="lung" FT /cell_type="fibroblast" FT /cell_line="MRC5" FT /clone_lib="lambda ZapII" FT /clone="HDP1" FT CDS 27..989 FT /db_xref="SWISS-PROT:P09651" FT /product="hnRNPcore protein A1" FT /protein_id="CAA56072.1" FT /translation="MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVV FT MRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTV FT KKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIV FT IQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNF FT SGRGGFGGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNF FT GGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" FT misc_feature 995 FT /note="alternative splice site" FT polyA_signal 1180..1185 XX SQ Sequence 1198 BP; 334 A; 216 C; 336 G; 312 T; 0 other; ttaaagtctc tcttcaccct gccgtcatgt ctaagtcaga gtctcctaaa gagcccgaac 60 agctgaggaa gctcttcatt ggagggttga gctttgaaac aactgatgag agcctgagga 120 gccattttga gcaatgggga acgctcacgg actgtgtggt aatgagagat ccaaacacca 180 agcgctctag gggctttggg tttgtcacat atgccactgt ggaggaggtg gatgcagcta 240 tgaatgcaag gccacacaag gtggatggaa gagttgtgga accaaagaga gctgtctcca 300 gagaagattc tcaaagacca ggtgcccact taactgtgaa aaagatattt gttggtggca 360 ttaaagaaga cactgaagaa catcacctaa gagattattt tgaacagtat ggaaaaattg 420 aagtgattga aatcatgact gaccgaggca gtggcaagaa aaggggcttt gcctttgtaa 480 cctttgacga ccatgactcc gtggataaga ttgtcattca gaaataccat actgtgaatg 540 gccacaactg tgaagttaga aaagccctgt caaagcaaga gatggctagt gcttcatcca 600 gccaaagagg tcgaagtggt tctggaaact ttggtggtgg tcgtggaggt ggtttcggtg 660 ggaatgacaa cttcggtcgt ggaggaaact tcagtggtcg tggtggcttt ggtggcagcc 720 gtggtggtgg tggatatggt ggcagtgggg atggctataa tggatttggc aatgatggaa 780 gcaattttgg aggtggtgga agctacaatg attttgggaa ttacaacaat cagtcttcaa 840 attttggacc catgaaggga ggaaattttg gaggcagaag ctctggcccc tatggcggtg 900 gaggccaata ctttgcaaaa ccacgaaacc aaggtggcta tggcggttcc agcagcagca 960 gtagctatgg cagtggcaga agattttaat tagggaggag tctgctacta gtcttatcag 1020 ctcttaaaaa cagaaactca tctgtccaag ttcgtggcag aaaggaacgt ccttgtgaag 1080 acctttatct gagccactgt acttcgttat cacgccatgc agtttacatg agctgttctg 1140 cagctcgaaa ttccattttg tgaatgggtt ttttttttta ataaactgta tttaactt 1198 // ID HSHNRNPA standard; DNA; HUM; 5368 BP. XX AC X12671; XX SV X12671.1 XX DT 23-NOV-1989 (Rel. 21, Created) DT 24-APR-1993 (Rel. 35, Last updated, Version 3) XX DE Human gene for heterogeneous nuclear ribonucleoprotein (hnRNP) core DE protein A1 XX KW hnRNP A1 proten; ribonucleoprotein; RNA binding protein. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; OC Primates; Catarrhini; Hominidae; Homo. XX RN [1] RP 1-5368 RA Riva S.; RT ; RL Submitted (23-AUG-1988) to the EMBL/GenBank/DDBJ databases. RL Riva S., Consiglio Nazionale Delle Ricerche, Istituto Di Genetica RL Biochimica Ed Evoluzionistica CNR, Via Abbiategrasso 2D7, 27100 Pavia, RL Italy. XX RN [2] RP 1-5368 RA Biamonti G., Buvoli M., Bassi M.T., Morandi C., Cobianchi F., Riva S.; RT "Isolation of an active gene encoding human hnRNP protein A1"; RL J. Mol. Biol. 207:491-503(1988). XX DR SWISS-PROT; P09651; ROA1_HUMAN. XX FH Key Location/Qualifiers FH FT source 1..5368 FT /db_xref="taxon:9606" FT /organism="Homo sapiens" FT /clone="pES5" FT /tissue_type="liver" FT /clone_lib="lambdaCh4A." FT mRNA join(695..813,1377..1493,1789..1935,2084..2294,2388..2480, FT 2567..2659,2794..2868,3806..3961,4252..4311,4543..5240) FT CDS join(799..813,1377..1493,1789..1935,2084..2294,2388..2480, FT 2567..2659,2794..2868,3806..3961,4252..4307) FT /db_xref="SWISS-PROT:P09651" FT /product="hnrnp a1 protein" FT /protein_id="CAA31191.1" FT /translation="MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVV FT MRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTV FT KKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIV FT IQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNF FT SGRGGFGGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNF FT GGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" FT exon 695..813 FT /number=1 FT misc_feature 695..695 FT /note="mRNA initiation site" FT misc_feature 715..715 FT /note="mRNA initiation site" FT misc_feature 735..735 FT /note="mRNA initiation site" FT intron 814..1376 FT /number=1 FT exon 1377..1493 FT /number=2 FT intron 1494..1788 FT /number=2 FT exon 1789..1935 FT /number=3 FT intron 1936..2083 FT /number=3 FT exon 2084..2294 FT /number=4 FT intron 2295..2387 FT /number=4 FT exon 2388..2480 FT /number=5 FT intron 2481..2566 FT /number=5 FT exon 2567..2659 FT /number=6 FT intron 2660..2793 FT /number=6 FT exon 2794..2868 FT /number=7 FT intron 2869..3805 FT /number=7 FT exon 3806..3961 FT /number=8 FT intron 3962..4251 FT /number=8 FT exon 4252..4311 FT /number=9 FT intron 4312..4542 FT /number=9 FT exon 4543..5240 FT /number=10 XX SQ Sequence 5368 BP; 1476 A; 1052 C; 1270 G; 1570 T; 0 other; gggattgaga gtgatcactc acgctaacgt ctgccctgtt cctgtatggt gaggccgcac 60 cacaagccac caccgccgcc gccttctgcg caacgccaac cgcccgccaa aacggatcct 120 tccctgcgcc tgcgcaacca atcttgggac cggacctttt ttctccgccc actacgcatg 180 cgcaaagcta ggacaaactc ccgccaacac gcaggcgccg taggttcact gcctactcct 240 gcccgccatt tcacgtgttc tcagaggcag gtggaacttc ttaatgcgcc tgcgcaaaac 300 tcgccatttt actacacgtg cggtcaacaa gagttcattg caaaaaaatt gttacctcct 360 agctgcttgt ctaatacata gtgttaatca tgctttgcca agcgacttga ctgtaatatt 420 tgcgcgtgga agattaaaaa gatgttaaac acccaaggta gattcaaatg tgaatgattg 480 gtcggttggc caatcagact ggttaacaat aacattactc gggaaccaat ggactccaag 540 gggtggagac ggcgtagaac gaccgaagga atgacgttac acagcaatgt ggcaccacag 600 gccaatagca gggggaagcg atttcaagta tccaatcaga gctgttctag ggcggagtct 660 accaatgccg aaagcgagga ggcggggtaa aaaagagagg gcgaaggtag gctggcagat 720 acgttcgtca gcttgctcct ttctgcccgt ggacgccgcc gaagaagcat cgttaaagtc 780 tctcttcacc ctgccgtcat gtctaagtca gaggtgagtt aggcgcgctt tcccacttga 840 attttttcct ctccctttcc tgaatcggta agatgctgct gggtttcgtt ccttgcacca 900 gcccattcta cagttccttc ggtcgctgcc acggcctacc cctcccaaag ttcaagtcgc 960 cattttgtcc tcttgatcgc catgaggccg ctctccgcca accatgtgtt atcatgcggg 1020 actcgttact cgtagcaaaa ttcttaggca cacaggatct ttgtcttttt ttaaaccttg 1080 ccttggtgag cgagttttct aaagagcgat tagtcccatt gtggagatgc acccctaccg 1140 cccaagcctt tgttgcgcgt gcgtcggaag gcgactaggg acgcatgcgc ttgcgatttc 1200 ctagcactcc caactccagc atacggcctc ccttgatagg cagaagcacg tgtcttgttg 1260 cgacctgaac gaacaataag tgctaggtac acagttggtg tctagttttt cttttcctcg 1320 atggaaattg tttcgtgttg tagcccattt aacacttccc cctcccccca ctctagtctc 1380 ctaaagagcc cgaacagctg aggaagctct tcattggagg gttgagcttt gaaacaactg 1440 atgagagcct gaggagccat tttgagcaat ggggaacgct cacggactgt gtggtaagat 1500 ttggaaggga caaagcagta aaacagccga tttccttggc ttatcttggt gcagtcttct 1560 ccgaatgctt atgaaagtag ttaatagcat tatagttaga gctttgttgg caaaggaacg 1620 tcctgctttg attttaaaag ctaacctctt aaatctaagg gtagtgggaa actggacgaa 1680 ctttttataa aaggctggtg taaagtttcc tattgcccta ttcaaagtta aaataacaaa 1740 agcttttgcg gtcagacttt gtgttacata aattaacact gttctcaggt aatgagagat 1800 ccaaacacca agcgctctag gggctttggg tttgtcacat atgccactgt ggaggaggtg 1860 gatgcagcta tgaatgcaag gccacacaag gtggatggaa gagttgtgga accaaagaga 1920 gctgtctcca gagaagtgag tgggtttttt ttcttcttct tcttaaactt acttggatat 1980 gtgctgctat gaacttaaga ttcgggagtt ttctaaactt accaaaattt tttattcgag 2040 tataggcttt gctaatctaa acctatggtt tttctcctat taggattctc aaagaccagg 2100 tgcccactta actgtgaaaa agatatttgt tggtggcatt aaagaagaca ctgaagaaca 2160 tcacctaaga gattattttg aacagtatgg aaaaattgaa gtgattgaaa tcatgactga 2220 ccgaggcagt ggcaagaaaa ggggctttgc ctttgtaacc tttgacgacc atgactccgt 2280 ggataagatt gtcagtaagt atcagatagt ggcatttagt aagggttcca caatctgtat 2340 ggcattctaa accctgatac catgttgtat ctatgttttt tttttagttc agaaatacca 2400 tactgtgaat ggccacaact gtgaagttag aaaagccctg tcaaagcaag agatggctag 2460 tgcttcatcc agccaaagag gtatgcttgt tgcttaatta aaccttaaag gtaactttga 2520 gttactccag tatgaatgat ttaatgctta aacttcatgt cttaaggtcg aagtggttct 2580 ggaaactttg gtggtggtcg tggaggtggt ttcggtggga atgacaactt cggtcgtgga 2640 ggaaacttca gtggtcgtgg tatgtatggt ttatctacat gtagttctga cttctcacca 2700 tctttgctat gaagatttta cagtacggga actgcattca gaatgtcact ttaagtccaa 2760 gtcatactta aaacttgaaa ctttttctta caggtggctt tggtggcagc cgtggtggtg 2820 gtggatatgg tggcagtggg gatggctata atggatttgg caatgatggt aagtttttta 2880 ggaataagta gagaaaaatt cctggcaacc tggatcttta gaataggtta gtagagacta 2940 aaattctggt gcatgtcaaa ctcaactttg cccataacac gcatgctgtg agcaggcctt 3000 cagccgttac acttgcacaa gttttcattg tcaaatactt ttgtcttatt gagaagaatt 3060 gtattcttgt aggtggttat ggaggaggcg gccctggtta ctctggagga agcagaggct 3120 atggaagtgg tggacagggt tatggaaacc agggcagtgg ctatggcggg agtggcagct 3180 atgacagcta taacaacgga ggcggaggcg gctttggcgg tggtagtggt aggtatccag 3240 tgatccaagt acttggtgtg acagctagat tagcctttta gagcttgggt tctggtgctg 3300 ttgaagcatt gtgtggtaca ctgcatggta tattaaaaac aaatgggctt gctatgctac 3360 ctcctcctag ctttaagctg gggccgcctc actcccaaat agtagagata agtggatagt 3420 gttgtctttg agttagatta gtatcataga aggatttagt attttaactc ctttgggacc 3480 ttaggcgctt agttgatgta tccaagatac ttctgcttgc tgtggccctg gatccgtgaa 3540 ggccttcaag gctgaagggt atgcttgtgc cactctgaaa atctctttat tttatgtcat 3600 ggtgagttag gccagttttc tttgtattac tggattattc aactgaatgc ctttcccaga 3660 gaatgaaatg caaagattgg agtcaccata gtttgggaga aaggaaggct gataactcaa 3720 ccttatttta ttctgactgc taaacagaat tggaaactaa catcatcctc aggtaacaga 3780 taaaggccct ctttcccatt cataggaagc aattttggag gtggtggaag ctacaatgat 3840 tttgggaatt acaacaatca gtcttcaaat tttggaccca tgaagggagg aaattttgga 3900 ggcagaagct ctggccccta tggcggtgga ggccaatact ttgcaaaacc acgaaaccaa 3960 ggtatggtat ctatgtaatt ttggataatg tcaaaagagt gtctgtagct actgctggga 4020 agaaagccct ttaactgcta tgtctgggca gcaaaacgtt tatagtttag aaccttcaga 4080 aagtgataat ttgatcacaa attagaaaaa tcatgggacc tctttaccac ctcccttgta 4140 gtagggccat ttttaaatgg ccagacactt gaatttaact tttattatcc caaatatgaa 4200 aacattactg ttggcacttt gaaactttaa aagaaaaatt gtacttttca ggtggctatg 4260 gcggttccag cagcagcagt agctatggca gtggcagaag attttaatta ggtaagtaag 4320 cacctttttg tgtgttgaca taatttttta aattgctgat gaacccaata accctaatgt 4380 agctgagcag tgcaacatag ttaacattat aattgcagta attgtggata taaagttaat 4440 attcagatca gcaaaatttg tgggaaacaa acttgatatt ggattgtagc cttgagtctt 4500 aatatgttta gattaacaac tctattccat attgttcaac aggaaacaaa gcttagcagg 4560 agaggagagc cagagaagtg acagggaagc tacaggttac aacagatttg tgaactcagc 4620 caagcacagt ggtggcaggg cctagctgct acaaagaaga catgttttag acaaatactc 4680 atgtgtatgg gcaaaaaact cgaggactgt atttgtgact aattgtataa caggttattt 4740 tagtttctgt tctgtggaaa gtgtaaagca ttccaacaaa gggttttaat gtagattttt 4800 ttttttgcac cccatgctgt tgattgctaa atgtaacagt ctgatcgtga cgctgaataa 4860 atgtcttttt tttaatgtgc tgtgtaaagt tagtctactc ttaagccatc ttggtaaatt 4920 tccccaacag tgtgaagtta gaattccttc agggtgatgc caggttctat ttggaattta 4980 tatacaacct gcttgggtgg agaagccatt gtcttcggaa accttggtgt agttgaactg 5040 atagttactg ttgtgacctg aagttcacca ttaaaaggga ttacccaagc aaaatcatgg 5100 aatggttata aaagtgattg ttggcacatc ctatgcaata tatctaaatt gaataatggt 5160 accagataaa attatagatg ggaatgaagc ttgtgtatcc attatcatgt gtaatcaata 5220 aacgatttaa ttctcttgaa tgaaatgaca actgtatgga tttgggactg gcagagattt 5280 ggactttccc tacccactcc ccctgataat aatgttgaat gcttctatca caattcaagt 5340 tcaaagctct gctagggaat agaaacta 5368 // ID HSRNPA1 standard; RNA; HUM; 1747 BP. XX AC X06747; XX SV X06747.1 XX DT 08-JUN-1988 (Rel. 16, Created) DT 12-SEP-1993 (Rel. 36, Last updated, Version 2) XX DE Human hnRNP core protein A1 XX KW hnRNP A1 protein; ribonucleoprotein. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; OC Primates; Catarrhini; Hominidae; Homo. XX RN [1] RP 1-1747 RA Riva S.; RT ; RL Submitted (26-JAN-1988) to the EMBL/GenBank/DDBJ databases. RL Riva S., Istituto Di Genetica Biochimica, Ed Evoluzionistica - CNR, Via RL Abbiategrasso 207, 27100 Pavia, Italy. XX RN [2] RP 1-1747 RX MEDLINE; 88233978. RA Buvoli M., Biamonti G., Ghetti A., Riva S., Bassi M.T., Horandi C.; RT "cDNA cloning of human hnRNP protein A1 reveals the existence of multiple RT mRNA isoforms."; RL Nucleic Acids Res. 16:3751-3770(1988). XX DR SWISS-PROT; P09651; ROA1_HUMAN. XX CC pRP15 sequence encodes the protein A1 type alpha isoform, CC pRP12 (truncated at the 5' end, respectively) the beta isoform XX FH Key Location/Qualifiers FH FT source 1..1747 FT /db_xref="taxon:9606" FT /organism="Homo sapiens" FT /tissue_type="fibroblast" FT /clone_lib="lambdagt11" FT /clone="pRP15" FT CDS 86..1048 FT /db_xref="SWISS-PROT:P09651" FT /note="protein A1-alpha (AA 1-320)" FT /protein_id="CAA29922.1" FT /translation="MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVV FT MRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTV FT KKIFVGGIKEDTEEHHLRDYFEQFGKIEVIEIMTDRGSGKKKGFAFVTFDDHDSVDKIV FT IQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNF FT SGRGGFGGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNF FT GGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" FT misc_feature 209..209 FT /note="5' end of clone pRP12" FT variation 467..469 FT /note="uau (Tyr) is uuu (Phe) in pRP12 (protein A1-beta)" FT variation 521..523 FT /note="agg (Arg) is aag (Lys) in pRP12 (protein A1-beta)" FT misc_feature 1365..1370 FT /note="polyA signal" FT misc_feature 1726..1731 FT /note="polyA signal" FT polyA_site 1747..1747 FT /note="polyA site" XX SQ Sequence 1747 BP; 512 A; 300 C; 466 G; 469 T; 0 other; cgaaaggtta caaaggcaaa gccccttttc tgcccgtgga cgccgccgaa gaagcatcgt 60 taaagtctct cttcaccctg ccgtcatgtc taagtcagag tctcctaaag agcccgaaca 120 gctgaggaag ctcttcattg gagggttgag ctttgaaaca actgatgaga gcctgaggag 180 ccattttgag caatggggaa cgctcacgga ctgtgtggta atgagagatc caaacaccaa 240 gcgctctagg ggctttgggt ttgtcacata tgccactgtg gaggaggtgg atgcagctat 300 gaatgcaagg ccacacaagg tggatggaag agttgtggaa ccaaagagag ctgtctccag 360 agaagattct caaagaccag gtgcccactt aactgtgaaa aagatatttg ttggtggcat 420 taaagaagac actgaagaac atcacctaag agattatttt gaacagtttg gaaaaattga 480 agtgattgaa atcatgactg accgaggcag tggcaagaaa aagggctttg cctttgtaac 540 ctttgacgac catgactccg tggataagat tgtcattcag aaataccata ctgtgaatgg 600 ccacaactgt gaagttagaa aagccctgtc aaagcaagag atggctagtg cttcatccag 660 ccaaagaggt cgaagtggtt ctggaaactt tggtggtggt cgtggaggtg gtttcggtgg 720 gaatgacaac ttcggtcgtg gaggaaactt cagtggtcgt ggtggctttg gtggcagccg 780 tggtggtggt ggatatggtg gcagtgggga tggctataat ggatttggca atgatggaag 840 caattttgga ggtggtggaa gctacaatga ttttgggaat tacaacaatc agtcttcaaa 900 ttttggaccc atgaagggag gaaattttgg aggcagaagc tctggcccct atggcggtgg 960 aggccaatac tttgcaaaac cacgaaacca aggtggctat ggcggttcca gcagcagcag 1020 tagctatggc agtggcagaa gattttaatt aggaaacaaa gcttagcagg agaggagagc 1080 cagagaagtg acagggaagc tacaggttac aacagatttg tgaactcagc caagcacagt 1140 ggtggcaggg cctagctgct acaaagaaga catgttttag acaaatactc atgtgtatgg 1200 gcaaaaaact cgaggactgt atttgtgact aattgtataa caggttattt tagtttctgt 1260 tctgtggaaa gtgtaaagca ttccaacaaa gggttttaat gtagattttt tttttgcacc 1320 ccatgctgtt gattgctaaa tgtaacagtc tgatcgtgac gctgaataaa tgtctttttt 1380 ttaatgtgct gtgtaaagtt agtctactct taagccatct tggtaaattt ccccaacagt 1440 gtgaagttag aattccttca gggtgatgcc aggttctatt tggaatttat atacaacctg 1500 cttgggtgga gaagccattg tcttcggaaa ccttggtgta gttgaactga tagttactgt 1560 tgtgacctga agttcaccat taaaagggat tacccaagca aaatcatgga atggttataa 1620 aagtgattgt tggcacatcc tatgcaatat atctaaattg aataatggta ccagataaaa 1680 ttatagatgg gaatgaagct tgtgtatcca ttatcatgtg taatcaataa acgatttaat 1740 tctcttg 1747 // ID HSUPIR1 standard; RNA; HUM; 924 BP. XX AC X04347; XX SV X04347.1 XX DT 18-NOV-1986 (Rel. 10, Created) DT 12-JUL-1995 (Rel. 44, Last updated, Version 3) XX DE Human liver mRNA fragment DNA binding protein UPI homologue DE (C-terminus) XX KW DNA binding protein. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; OC Primates; Catarrhini; Hominidae; Homo. XX RN [1] RP 1-924 RX MEDLINE; 87053868. RA Riva S., Morandi C., Tsoulfas P., Pandolfo M., Biamonti G., Merrill B., RA Williams K.R., Multhaup G., Beyreuther K., Werr H., Heinrich B., RA Schaefer K.P.; RT "Mammalian single-stranded DNA binding protein UP I is derived from the RT hnRNP cor protein A1"; RL EMBO J. 5:2267-2273(1986). XX DR SWISS-PROT; P09651; ROA1_HUMAN. XX CC Pos. 1-216 is homologous to UPI (aa 126-195); pos. 184-273 is CC homologous to V8 protease peptide AI/II (aa 185-214) XX CC Data kindly reviewed (08-NOV-1986) by K. Beyreuther XX FH Key Location/Qualifiers FH FT source 1..924 FT /db_xref="taxon:9606" FT /organism="Homo sapiens" FT CDS <1..591 FT /codon_start=1 FT /db_xref="SWISS-PROT:P09651" FT /note="ORF protein; C-terminal (aa 125-319; 196aa)" FT /protein_id="CAA27874.1" FT /translation="FEQYGKIEVIEIMTDPGSGKKRGFAFVTFDDHDSVDKIVIQKYHT FT VNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGF FT GGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFGGRSSG FT PYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" FT misc_feature 208..210 FT /note="dimethylated arginin" FT polyA_signal 908..913 FT /note="pot. polyA signal" FT polyA_site 924..924 FT /note="polyA site" XX SQ Sequence 924 BP; 262 A; 148 C; 266 G; 248 T; 0 other; tttgaacagt atggaaaaat tgaagtgatt gaaatcatga ctgacccagg cagtggcaag 60 aaaaggggct ttgcctttgt aacctttgac gaccatgact ccgtggataa gattgtcatt 120 cagaaatacc atactgtgaa tggccacaac tgtgaagtta gaaaagccct gtcaaagcaa 180 gagatggcta gtgcttcatc cagccaaaga ggtcgaagtg gttctggaaa ctttggtggt 240 ggtcgtggag gtggtttcgg tgggaatgac aacttcggtc gtggaggaaa cttcagtggt 300 cgtggtggct ttggtggcag ccgtggtggt ggtggatatg gtggcagtgg ggatggctat 360 aatggatttg gcaatgatgg aagcaatttt ggaggtggtg gaagctacaa tgattttggg 420 aattacaaca atcagtcttc aaattttgga cccatgaagg gaggaaattt tggaggcaga 480 agctctggcc cctatggcgg tggaggccaa tactttgcaa aaccacgaaa ccaaggtggc 540 tatggcggtt ccagcagcag cagtagctat ggcagtggca gaagatttta attaggaaac 600 aagcttggca ggagaggaga gccagagaag tgacagggaa gctacaggtt acaacagatt 660 tgtgaactca gccaagcaca gtggtggcag ggcctagctg ctacaaagaa gacatgtttt 720 agacaaatac tcatgtgtat gggcaaaaaa ctcgaggact gtatttgtga ctaattgtat 780 aacaggttat tttagtttct gttctgtgga aagtgaaagc attccaacaa agggttttaa 840 tgtagatttt ttttttttgc accccatgct gttgattgct aaatgtaaca gtctgatcgt 900 gacgctgaat aaatgtcttt tttt 924 // test.tab100644000766000024 134414006427353 16045 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataroa1_drome MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY roa2_drome MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPTSTSTSTSTSTSTSTSTMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHLLLLLLLDLLLLDLLLLDLLLFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY test.phd100755000766000024 1717314006427353 16104 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBEGIN_SEQUENCE ML4924R BEGIN_COMMENT CHROMAT_FILE: ML4924R ABI_THUMBPRINT: 0 PHRED_VERSION: 0.980904.e CALL_METHOD: phred QUALITY_LEVELS: 99 TIME: Thu Jun 21 11:27:03 2001 TRACE_ARRAY_MIN_INDEX: 0 TRACE_ARRAY_MAX_INDEX: 8792 CHEM: term DYE: big END_COMMENT BEGIN_DNA a 6 1 c 6 20 t 6 17 t 8 26 t 8 35 g 12 48 g 18 58 t 16 66 c 14 83 g 11 98 c 9 104 c 6 122 t 6 128 g 9 140 c 19 147 a 20 159 g 32 167 g 34 178 t 34 190 a 39 200 c 39 208 c 39 221 g 29 233 g 40 243 t 28 255 c 21 267 c 13 277 g 4 288 n 4 297 g 4 298 a 8 314 t 8 328 t 23 339 c 29 349 c 39 359 c 40 369 g 40 381 g 40 390 g 40 401 t 46 413 c 34 425 g 33 436 a 29 447 c 29 453 c 32 466 c 33 476 a 34 485 c 34 492 g 34 505 c 34 514 g 40 527 t 40 537 c 34 548 c 34 559 g 34 572 c 34 579 t 34 592 t 34 602 t 39 613 g 40 625 c 40 633 t 40 645 c 40 656 a 40 664 t 40 673 c 40 685 t 46 697 c 51 707 c 51 718 c 56 728 t 51 738 c 51 749 t 35 760 g 35 772 a 35 783 c 35 790 t 35 803 t 35 813 t 39 824 g 40 836 c 40 844 a 40 855 t 40 865 c 56 876 c 42 887 c 51 898 t 46 909 t 46 921 a 46 931 a 46 940 t 46 951 c 46 962 c 46 974 t 46 985 c 40 995 a 40 1004 g 40 1015 a 40 1028 a 40 1037 t 40 1048 g 35 1060 g 35 1071 c 35 1080 g 35 1096 t 35 1106 c 35 1116 a 40 1125 g 40 1138 a 40 1150 g 51 1160 a 51 1172 a 51 1182 g 51 1193 a 40 1204 a 35 1214 g 35 1226 c 39 1235 t 39 1250 c 39 1259 t 39 1272 c 40 1282 g 51 1295 a 51 1305 a 51 1315 c 51 1325 c 51 1337 c 51 1348 a 51 1358 a 51 1368 t 51 1381 g 51 1393 a 51 1403 g 51 1415 g 51 1427 g 39 1440 a 39 1450 t 39 1462 a 39 1470 t 39 1483 c 35 1494 a 40 1503 a 40 1514 g 40 1528 g 40 1540 t 40 1552 t 39 1565 c 39 1574 a 39 1583 g 39 1598 a 38 1609 a 40 1619 g 39 1632 c 39 1642 t 39 1657 c 39 1666 g 39 1680 t 40 1691 t 45 1703 c 51 1713 t 51 1726 c 51 1736 a 51 1746 a 51 1757 c 51 1767 a 45 1778 t 45 1791 c 45 1802 t 45 1816 c 45 1826 c 45 1838 g 45 1852 t 51 1863 t 51 1875 g 56 1887 g 43 1899 t 43 1911 g 43 1924 a 43 1933 g 43 1946 a 43 1958 g 56 1970 t 56 1982 g 56 1994 g 56 2006 t 56 2017 g 51 2030 a 43 2041 t 43 2053 c 43 2063 g 43 2079 t 43 2089 c 43 2100 t 43 2114 c 43 2124 a 43 2135 c 43 2144 c 43 2157 c 43 2169 g 43 2184 t 43 2195 g 43 2207 c 43 2216 c 43 2229 t 43 2244 c 43 2253 c 43 2265 a 46 2277 a 56 2288 g 56 2299 g 56 2311 t 56 2324 t 56 2336 t 56 2348 t 56 2360 g 56 2373 g 51 2385 a 51 2397 a 51 2408 c 51 2418 a 43 2430 g 43 2443 c 45 2454 t 45 2469 c 45 2480 a 45 2491 g 56 2503 t 56 2516 g 56 2528 g 51 2540 t 51 2552 c 51 2564 a 51 2574 a 51 2585 a 51 2598 c 45 2608 a 45 2620 c 45 2630 c 45 2644 t 45 2658 g 51 2671 t 51 2682 c 51 2693 t 51 2707 t 45 2719 c 45 2730 t 51 2743 c 51 2753 c 51 2765 a 51 2775 a 51 2787 g 43 2798 g 43 2810 c 43 2822 g 43 2837 a 43 2847 g 43 2859 g 51 2872 t 51 2884 a 51 2894 c 43 2904 a 43 2917 c 43 2927 t 43 2943 g 43 2956 t 43 2967 g 51 2980 a 51 2992 g 45 3003 g 45 3017 t 45 3029 c 45 3039 t 45 3053 t 45 3065 t 51 3077 c 51 3088 g 51 3102 g 51 3113 t 51 3125 a 43 3136 t 43 3148 t 43 3161 a 43 3170 g 43 3183 a 43 3196 c 43 3205 g 43 3220 t 43 3231 a 45 3243 a 56 3254 t 56 3266 g 56 3279 a 56 3290 a 56 3302 a 56 3314 a 51 3325 g 51 3337 a 51 3350 t 43 3360 t 43 3373 g 43 3385 c 43 3394 g 43 3410 t 43 3421 g 43 3434 c 43 3443 t 43 3458 a 43 3468 c 43 3477 g 43 3493 t 43 3505 c 43 3515 a 43 3527 c 43 3536 c 43 3550 g 43 3565 t 43 3576 g 43 3588 a 45 3601 g 45 3611 a 45 3625 g 45 3635 g 45 3648 a 43 3660 g 43 3672 a 38 3684 c 38 3692 a 38 3705 a 43 3717 g 56 3729 g 56 3741 c 56 3752 a 56 3765 a 51 3776 t 51 3789 g 51 3801 c 51 3811 a 43 3824 g 43 3836 c 43 3846 t 43 3862 t 43 3874 c 43 3884 t 51 3897 t 51 3909 g 56 3922 a 56 3933 g 56 3944 a 56 3957 g 56 3968 t 43 3980 g 43 3992 g 43 4004 c 43 4013 t 43 4029 t 43 4041 g 43 4053 a 56 4064 a 56 4075 a 51 4087 g 51 4098 t 51 4111 c 51 4121 a 51 4132 a 51 4144 g 56 4155 g 56 4168 a 51 4181 g 51 4192 t 43 4205 a 43 4215 c 43 4225 g 43 4240 a 43 4250 g 43 4263 c 43 4274 t 45 4289 g 45 4301 t 45 4312 t 56 4324 g 56 4336 a 56 4347 g 56 4358 g 56 4371 a 56 4383 g 56 4394 g 51 4407 a 43 4419 a 43 4430 c 43 4440 t 43 4455 t 43 4467 c 43 4477 a 51 4488 g 51 4500 t 51 4513 g 51 4525 a 51 4536 c 45 4546 a 45 4558 c 45 4569 t 51 4583 g 45 4595 g 45 4607 c 45 4618 t 45 4633 g 45 4645 t 45 4656 t 56 4668 t 56 4680 t 56 4692 g 56 4704 g 56 4716 a 56 4727 t 56 4738 t 56 4751 c 56 4762 g 56 4775 g 56 4787 t 51 4798 a 51 4809 t 51 4821 c 51 4831 c 51 4844 a 51 4854 g 51 4866 g 51 4879 a 56 4892 g 51 4903 c 51 4914 a 51 4926 c 51 4936 a 51 4948 t 51 4960 t 56 4973 g 56 4985 a 56 4997 t 56 5008 c 56 5020 t 56 5033 t 56 5045 g 56 5057 g 56 5069 a 56 5080 a 40 5091 t 40 5104 c 40 5115 a 40 5124 a 40 5136 g 40 5149 t 40 5162 a 40 5172 c 40 5182 g 40 5197 a 40 5207 t 40 5220 c 51 5231 c 56 5243 t 56 5256 t 51 5268 c 51 5279 t 51 5292 a 51 5302 c 51 5312 g 51 5325 g 51 5338 g 51 5350 t 51 5362 a 45 5372 t 45 5384 c 45 5395 t 45 5409 a 45 5419 t 45 5431 g 51 5444 g 56 5456 a 56 5468 a 56 5479 t 56 5492 g 40 5503 g 39 5516 a 39 5528 c 39 5537 t 39 5552 t 39 5564 c 39 5575 t 40 5588 a 40 5597 t 40 5609 g 40 5622 t 56 5634 t 56 5646 g 51 5658 t 51 5669 t 51 5682 c 51 5692 t 51 5705 t 51 5718 g 51 5730 a 45 5741 a 40 5752 c 40 5762 g 37 5776 c 37 5785 c 40 5798 c 40 5810 g 56 5823 g 56 5834 g 51 5846 a 51 5859 t 40 5870 a 40 5880 c 40 5890 c 40 5904 g 40 5918 t 51 5929 g 51 5941 t 37 5953 t 37 5966 g 37 5978 c 37 5987 a 37 5999 c 40 6009 g 40 6023 t 34 6035 c 34 6047 g 29 6060 c 29 6068 c 29 6083 g 29 6096 t 32 6107 a 29 6118 g 25 6128 g 19 6142 t 27 6154 g 27 6167 c 32 6176 a 32 6187 a 34 6199 g 40 6212 a 34 6224 c 40 6234 t 34 6248 c 35 6258 g 35 6272 c 35 6281 g 35 6296 t 39 6307 t 40 6320 g 56 6331 g 56 6342 g 56 6355 a 51 6367 t 51 6378 t 46 6391 c 46 6401 a 42 6411 g 42 6424 c 42 6435 a 46 6447 t 40 6459 a 37 6469 g 37 6482 a 40 6494 g 29 6506 t 29 6518 t 29 6531 a 29 6539 c 29 6549 c 27 6563 a 32 6573 a 29 6585 g 29 6596 g 34 6609 a 48 6622 t 48 6633 g 48 6645 a 32 6656 t 34 6668 g 34 6681 c 34 6690 c 31 6704 a 31 6714 t 37 6726 g 46 6739 a 48 6750 a 48 6762 g 40 6774 t 40 6786 g 40 6797 g 27 6810 t 25 6821 t 21 6834 c 16 6842 c 16 6856 a 18 6865 a 19 6877 g 23 6889 t 27 6901 t 25 6915 a 29 6923 a 24 6936 g 29 6947 t 32 6960 a 29 6970 c 29 6980 g 25 6995 a 25 7004 a 21 7018 g 15 7026 g 15 7041 t 23 7053 g 27 7065 t 27 7075 t 27 7089 a 27 7098 t 25 7110 c 22 7120 c 24 7133 t 29 7147 c 32 7157 a 32 7168 a 40 7179 c 29 7189 a 29 7200 a 29 7211 g 48 7224 t 48 7237 c 40 7248 t 40 7261 c 40 7272 a 29 7282 g 29 7294 a 25 7306 a 29 7317 c 29 7327 a 40 7338 t 40 7350 c 32 7362 a 32 7372 c 32 7383 t 32 7397 g 32 7409 g 25 7422 t 25 7432 t 25 7445 g 29 7457 a 40 7467 a 36 7479 g 40 7491 a 36 7503 g 36 7514 t 37 7525 t 33 7537 t 46 7549 g 24 7561 t 24 7571 t 24 7584 t 32 7596 c 32 7606 t 40 7619 t 34 7630 t 22 7642 t 25 7654 c 25 7665 a 27 7674 t 25 7685 t 22 7698 t 29 7709 t 29 7721 g 25 7734 t 21 7744 t 21 7757 t 22 7767 t 29 7779 g 24 7791 t 24 7802 g 24 7814 g 31 7827 t 31 7838 g 28 7850 t 28 7861 t 24 7873 g 19 7884 g 19 7897 c 23 7906 t 27 7921 c 25 7931 t 22 7943 t 29 7954 t 34 7965 t 32 7976 t 32 7988 t 32 8000 a 25 8009 t 18 8020 a 18 8031 a 18 8039 t 18 8053 t 23 8066 t 32 8077 g 26 8089 c 25 8099 g 19 8112 g 15 8124 t 15 8134 t 15 8148 c 23 8157 t 23 8170 g 22 8182 g 34 8193 a 37 8205 t 40 8216 g 32 8227 a 32 8238 t 29 8250 a 22 8259 a 18 8269 g 18 8281 g 13 8294 c 22 8305 t 16 8318 t 20 8328 t 25 8341 g 29 8353 t 24 8364 t 24 8376 g 25 8388 t 19 8398 t 19 8412 g 25 8422 t 24 8433 g 21 8444 g 21 8455 t 27 8467 g 21 8479 g 20 8491 t 26 8502 a 25 8512 g 18 8524 a 15 8536 a 15 8545 t 10 8558 c 13 8566 c 13 8581 t 17 8593 c 21 8603 a 22 8613 a 25 8624 g 29 8635 t 25 8647 a 25 8657 c 25 8666 c 24 8678 c 19 8691 t 10 8703 c 10 8712 g 11 8726 a 17 8736 g 17 8747 a 12 8759 t 19 8768 a 16 8778 t 11 8787 END_DNA END_SEQUENCE test.tsv100644000766000024 7304014006427353 16135 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataName Accession Group Family Subfamily Pseudogene? cDNA Protein Kinase Domain Novelty A6 SK001 Atypical A6 N CCGCCGGCCGGGGCCGCTGGCTGCACTCAGCGCCGGAGCCGGGAGCTAGCGGCCGCCGCCATGTCCCACCAGACCGGCATCCAAGCAAGTGAAGATGTTAAAGAGATCTTTGCCAGAGCCAGAAATGGAAAGTACAGACTTCTGAAAATATCTATTGAAAATGAGCAACTTGTGATTGGATCATATAGTCAGCCTTCAGATTCCTGGGATAAGGATTATGATTCCTTTGTTTTACCCCTGTTGGAGGACAAACAACCATGCTATATATTATTCAGGTTAGATTCTCAGAATGCCCAGGGATATGAATGGATATTCATTGCATGGTCTCCAGATCATTCTCATGTTCGTCAAAAAATGTTGTATGCAGCAACAAGAGCAACTCTGAAGAAGGAATTTGGAGGTGGCCACATTAAAGATGAAGTATTTGGAACAGTAAAGGAAGATGTATCATTACATGGATATAAAAAATACTTGCTGTCACAATCTTCCCCTGCCCCACTGACTGCAGCTGAGGAAGAACTACGACAGATTAAAATCAATGAGGTACAGACTGACGTGGGTGTGGACACTAAGCATCAAACACTACAAGGAGTAGCATTTCCCATTTCTCGAGAAGCCTTTCAGGCTTTGGAAAAATTGAATAATAGACAGCTCAACTATGTGCAGTTGGAAATAGATATAAAAAATGAAATTATAATTTTGGCCAACACAACAAATACAGAACTGAAAGATTTGCCAAAGAGGATTCCCAAGGATTCAGCTCGTTACCATTTCTTTCTGTATAAACATTCCCATGAAGGAGACTATTTAGAGTCCATAGTTTTTATTTATTCAATGCCTGGATACACATGCAGTATAAGAGAGCGGATGCTGTATTCTAGCTGCAAGAGCCGTCTGCTAGAAATTGTAGAAAGACAACTACAAATGGATGTAATTAGAAAGATCGAGATAGACAATGGGGATGAGTTGACTGCAGACTTCCTTTATGAAGAAGTACATCCCAAGCAGCATGCACACAAGCAAAGTTTTGCAAAACCAAAAGGTCCTGCAGGAAAAAGAGGAATTCGAAGACTAATTAGGGGCCCAGCGGAAACTGAAGCTACTACTGATTAAAGTCATCACATTAAACATTGTAATACTAGTTTTTTAAAAGTCCAGCTTTTAGTACAGGAGAACTGAAATCATTCCATGTTGATATAAAGTAGGGAAAAAAATTGTACTTTTTGGAAAATAGCACTTTTCACTTCTGTGTGTTTTTAAAATTAATGTTATAGAAGACTCATGATTTCTATTTTTGAGTTAAAGCTAGAAAAGGGTTCAACATAATGTTTAATTTTGTCACACTGTTTTCATAGCGTTGATTCCACACTTCAAATACTTCTTAAAATTTTATACAGTTGGGCCAGTTCTAGAAAGTCTGATGTCTCAAAGGGTAAACTTACTACTTTCTTGTGGGACAGAAAGACCTTAAAATATTCATATTACTTAATGAATATGTTAAGGACCAGGCTAGAGTATTTTCTAAGCTGGAAACTTAGTGTGCCTTGGAAAAGGCCGCAAGTTGCTTACTCCGAGTAGCTGTGCTAGCTCTGTCAGACTGTAGGATCATGTCTGCAACTTTTAGAAATAGTGCTTTATATTGCAGCAGTCTTTTATATTTGACTTTTTTTTAATAGCATTAAAATTGCAGATCAGCTCACTCTGAAACTTTAAGGGTACCAGATATTTTCTATACTGCAGGATTTCTGATGACATTGAAAGACTTTAAACAGCCTTAGTAAATTATCTTTCTAATGCTCTGTGAGGCCAAACATTTATGTTCAGATTGAAATTTAAATTAATATCATTCAAAAGGAAACAAAAAATGTTGAGTTTTAAAAATCAGGATTGACTTTTTTCTCCAAAACCATACATTTATGGGCAAATTGTGTTCTTTATCACTTCCGAGCAAATACTCAGATTTAAAATTACTTTAAAGTCCTGGTACTTAACAGGCTAACGTAGATAAACACCTTAATAATCTCAGTTAATACTGTATTTCAAAACACATTTAACTGTTTTCTAATGCTTTGCATTATCAGTTACAACCTAGAGAGATTTTGAGCCTCATATTTCTTTGATACTTGAAATAGAGGGAGCTAGAACACTTAATGTTTAATCTGTTAAACCTGCTGCAAGAGCCATAACTTTGAGGCATTTTCTAAATGAACTGTGGGGATCCAGGATTTGTAATTTCTTGATCTAAACTTTATGCTGCATAAATCACTTATCGGAAATGCACATTTCATAGTGTGAAGCACTCATTTCTAAACCTTATTATCTAAGGTAATATATGCACCTTTCAGAAATTTGTGTTCGAGTAAGTAAAGCATATTAGAATAATTGTGGGTTGACAGATTTTTAAAATAGAATTTAGAGTATTTGGGGTTTTGTTTGTTTACAAATAATCAGACTATAATATTTAAACATGCAAAATAACTGACAATAATGTTGCACTTGTTTACTAAAGATATAAGTTGTTCCATGGGTGTACACGTAGACAGACACACATACACCCAAATTATTGCATTAAGAATCCTGGAGCAGACCATAGCTGAAGCTGTTATTTTCAGTCAGGAAGACTACCTGTCATGAAGGTATAAAATAATTTAGAAGTGAATGTTTTTCTGTACCATCTATGTGCAATTATACTCTAAATTCCACTACACTACATTAAAGTAAATGGACATTCCAGAATATAGATGTGATTATAGTCTTAAACTAATTATTATTAAACCAATGATTGCTGAAAATCAGTGATGCATTTGTTATAGAGTATAACTCATCGTTTACAGTATGTTTTAGTTGGCAGTATCATACCTAGATGGTGAATAACATATTCCCAGTAAATTTATATAGCAGTGAAGAATTACATGCCTTCTGGTGGACATTTTATAAGTGCATTTTATATCACAATAAAAATTTTTTCTCTTTAAAAAAAAAAAACAAGAAAAAAAAAAAA MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPLLEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEVFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAFPISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTTNTELKDLPKRIPKDSARYHFFLYKHSHEGDYLESIVFIYSMPGYTCSIRERMLYSSCKSRLLEIVERQLQMDVIRKIEIDNGDELTADFLYEEVHPKQHAHKQSFAKPKGPAGKRGIRRLIRGPAETEATTD Known - Literature A6r SK512 Atypical A6 N GACAGCCTCCGCCACATCCTCCACCTCTCTTGGTCCAGCGAGCGTTGCCGGGCCAGGGTCAAGCGGAGGGCTCCGACGGCGCGGACGGAGCGAAGCGCCGAGCCATGGCGCACCAAACGGGCATCCACGCCACGGAAGAGCTGAAGGAATTCTTTGCCAAGGCACGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTGAGGACGAGCAGCTCGTGCTGGGTGCCTCGCAGGAGCCAGTAGGCCGCTGGGATCAGGACTATGACAGGGCCGTGCTGCCACTGCTGGACGCCCAGCAGCCCTGCTACCTGCTCTACCGCCTCGACTCACAGAATGCTCAGGGCTTCGAATGGCTCTTCCTCGCCTGGTCGCCTGATAACTCCCCCGTGCGGCTGAAGATGCTGTACGCGGCCACGCGGGCCACAGTGAAAAAGGAGTTTGGAGGTGGCCACATCAAGGATGAGCTCTTCGGGACTGTGAAGGATGACCTCTCTTTTGCTGGGTACCAGAAACACCTGTCGTCCTGTGCGGCACCTGCCCCGCTGACCTCGGCTGAGAGAGAGCTCCAGCAGATCCGCATTAACGAGGTGAAGACAGAGATCAGTGTGGAAAGCAAGCACCAGACCCTGCAGGGCCTCGCCTTCCCCCTGCAGCCTGAGGCCCAGCGGGCACTCCAGCAGCTCAAGCAGAAAATGGTCAACTACATCCAGATGAAGCTGGACCTAGAGCGGGAAACCATTGAGCTGGTGCACACAGAGCCCACGGATGTGGCCCAGCTGCCCTCCCGGGTGCCCCGAGATGCTGCCCGCTACCACTTCTTCCTCTACAAGCACACCCATGAGGGCGACCCCCTTGAGTCTGTAGTGTTCATCTACTCCATGCCGGGGTACAAGTGCAGCATCAAGGAGCGAATGCTCTACTCCAGCTGCAAGAGCCGCCTCCTCGACTCCGTGGAGCAGGACTTCCATCTGGAGATCGCCAAGAAAATTGAGATTGGCGATGGGGCAGAGCTGACGGCAGAGTTCCTCTACGACGAGGTGCACCCCAAGCAACACGCCTTCAAGCAGGCCTTCGCCAAGCCCAAGGGCCCAGGGGGCAAGCGGGGCCATAAGCGCCTCATCCGCGGCCCGGGTGAAAATGGGGATGACAGCTAGGAGGCTGGAGCAGGGCCGGCCACGTGTGGACTGTGGGGCTGCCCACCTTCCGCTCCCTGCCACCATCCTCCTTCCTGGGCTCCAGGAAAGTGTTTCTGGGAGGTCAGGAGGGCTGGCAGCTGAACGCACTTGCAGCGTCCGAGGGCCACCGGGCTGGCATTTTGTGACCCTTCCCTGTTGCTGTCCCTGCATCTCGTCTGTGTGCCCAGGGTGTCCGGGGACCCTGCCTGGCTGGCTTAAGGGGGCTGGGTCAGGGGCCTGGCATGAACCTGGCCTCCCGGGGAGCTGAGACTAGGGTCCCAGCACAGCCCAGAAACCTTTGGCCACAAGAAGTGGGGTCAGTCAGGGCTGGGGCAGGGGTCACTGCAGTTTGGGATGGTTGAATGCTGTATTTTCTAAAGAATAAAATATTTTTAAATC MAHQTGIHATEELKEFFAKARAGSVRLIKVVIEDEQLVLGASQEPVGRWDQDYDRAVLPLLDAQQPCYLLYRLDSQNAQGFEWLFLAWSPDNSPVRLKMLYAATRATVKKEFGGGHIKDELFGTVKDDLSFAGYQKHLSSCAAPAPLTSAERELQQIRINEVKTEISVESKHQTLQGLAFPLQPEAQRALQQLKQKMVNYIQMKLDLERETIELVHTEPTDVAQLPSRVPRDAARYHFFLYKHTHEGDPLESVVFIYSMPGYKCSIKERMLYSSCKSRLLDSVEQDFHLEIAKKIEIGDGAELTAEFLYDEVHPKQHAFKQAFAKPKGPGGKRGHKRLIRGPGENGDDS Known - Literature A6ps1 SK752 Atypical A6 Y CCGCCGGCCGGGGCCGCTGGCTGCACTCAGCGCCGGAGCCGGGAGCTAGCGGCCGCCGCCATGTCCCACCAGACCGGCATCCAAGCAAGTGAAGATGTTAAAGAGATCTTTGCCAGAGCCAGAAATGGAAAGTACAGACTTCTGAAAATATCTATTGAAAATGAGCAACTTGTGATTGGATCATATAGTCAGCCTTCAGATTCCTGGGATAAGGATTATGATTCCTTTGTTTTACCCCTGTTGGAGGACAAACAACCATGCTATATATTATTCAGGTTAGATTCTCAGAATGCCCAGGGATATGAATGGATATTCATTGCATGGTCTCCAGATCATTCTCATGTTCGTCAAAAAATGTTGTATGCAGCAACAAGAGCAACTCTGAAGAAGGAATTTGGAGGTGGCCACATTAAAGATGAAATATTTGGAACAGTAAAGGAAGATGTATCATTACATGGATATAAAAAATACTTGCTGTCACAATCTTCCCCTGCCCCACTGACTGCAGCTGAGGAAGAATTACGACAGATTAAAATCAATGAGGTACAGACTGACGTGGGTGTGGACACTAAGCATCAAACACTACAAGGAGTAGCATTTCCCATTTCTCGAGAAGCCTTTCAGGCTTTGGAAAAATTGAATAACAGACAGCTCAACTATGTGCAGTTGGAAATAGATATAAAAAATGAAATTATAATTTTGGCCAACACAACAAATACAGAACTGAAAGATTTGCCAAAGAGGATTCCCAAGGATTCAGCTCGTTACCATTTCTTTCTGTATAAACATTCCCATGAAGGAGACTATTTAGAGTCCATAGTTNTTATTTATTCAATGCCTGGATACACATGCAGTATAAGAGAGCGGATGCTGTATTCTAGCTGCAAGAGCCCTCTGCTAGAAATTGTAGAAAGACAACTACAAATGGATGTAATTAGAAAGATCGAGATAGACAATGGGGATGAGTTGACTGCAGACTTCCTTTATGAAGAAGTACATCCCAAGCAGCAGGCACACAAGCAAAGTTTTGCAAAACCAAAAGGTCCTGCAGGAAAAAGAGGAATTCGAAGACTAATTAGGGGCCCAGCGGAAACTGAAGCTACTACTGATTAAAGTCGTCACATTAAACATTGCAATACTAGTTTTTTAAAAGTCCAGCTTTTAGTACAGGAGAACTGAAATCATTCCATGTTGATATAAAGTAGGGAAAAACATTGTACTTTTTGGAAAATAGCACTTTTCACTTCTGTGTGTTTTTAAAATTAATGTTATAGAAGACTCATGATTTCTATTTTTGAGTTAAAGCTAGAAAAGGGTTCAACATAATGTTTAATTTTGTCACACTGTTTTCATAGTGTTGATTCCACACTTCAAATACTTCTTAAAATTTTATACAGTTGGGCCAGCTCTAGAAAGTCTGATGTCTCAAAGGGTAAACTTACTACTTTCTTGTGGGACAGAAGGACCTTAAAATATTCATATTACTTAATGAATATGTTAAGGACCAGGCTAGAGTATTTTCTAAGCTGGAAACTTAGTGTGCCTCGGAAAAGGCCGCAAGTTGCTTACTCCGAGTAGCTGTGCTAGCTCTGTCAGACTGTAGGATCATGTCTGCAACTTTTAGAAATAGTGCTTTATATTGCAGCAGTCTTTTATATTTGACTTTTTTTTTAATAGCATTAAAATTGCAGATCAGCTCACTCTGAAACTTTAAGGGTACCAGATATTTTCTATACTGCAGGATTTCTAATGACATTGAAAGACTTTTAAACAGCCTTAGTAAATTATCTTTCTAATGCTCTGTGAGGCCAAACATTTATGTTCAGATTGAAATTTAAATTAATATCATTCAAAAGGAAACAAAAAATGTTGAGTTTTAAAAATCAGGATTGACTTTTTTCTCCAAAACCATACATTTATAGGCAAATTGTGTTCTTTATCACTTCTGAGCAAATACTCAGATTTAAAATTACTTTAAAGTCCTGGTACTTAACAGGCTAACGTAGATAAACACCTTAATAATCTCAGTTAATACTGTATTTCAAAACACATTTAACTGTTTTCTAATGCTTTGCATTATCAGTTACAACCTAGAGAGATTTTGAGCCTCATATTTCTTTGATACTTGAAATAGAGGGAGCTAGAACACTTAATGTTTAATCTGTTAAACCTGCTGCAAGAGCCATAACTTTGAGGCATTTTCTAAATGAACTGTGGGGATCCAGGATTTGTAATTTCTTGATCTAAACTTTATGCTGCATAAATCACTTATCGGAAATGCACATTTCATAGTGTGAAGCACTCATTTCTAAACCTTATTATCTAAGGTAATATATGCACCTTTCAGAAATTTGTGTTCGAGTAAGTAAAGCATATTAGAATAATTATGGGTTGACAGATTTTTAAAATAGAATTTAGAGTATTTGTGTGGGGTTTTGTTTGTTTACAAATAATCAGACTATAATATTTAAACATGCAAAATAACTGAGAATAATGTTGCACTTGTTTACTAAAGATATAAGTTGTTCCATGGGTGTACACGTAGACAGACACACATACACCCAAATTATTGCATTAAGAATCCTGGAGCAGACCATAGCTGAAGCTGTTATTTTCAGTCAGGAAGACTACCTGTCATGAAGGTATACAATAATTTAGAAGTGAATGTTTTTCTGTACCATCTATGTGCAATTATACTCTAAATTCCACTACACTACATTAAAGTAAATGGACATTCCAGAATATAGATGTGATTATAGTCTTAAACTAATTATTAAACCAATGATTGCTGAAAATCAGTGATGCATTTGTTATAGAGTATAACTCATCGTTTACAGTATGTTTTAGTTGGCAGTATCATACCTAGATGGTGAATAACATATTCCCAGTAAATTTATATAGCAGTGAAGAATTACATGCCTTCTGGTGGACATTTTATAAGTGCATTTTATATCACAATAAAATTTTTTCTCTTTAA MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPLLEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEIFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAFPISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTTNTELKDLPKRIPKDSARYHFFLYKHSHEGDYLESIVXIYSMPGYTCSIRERMLYSSCKSPLLEIVERQLQMDVIRKIEIDNGDELTADFLYEEVHPKQQAHKQSFAKPKGPAGKRGIRRLIRGPAETEATTD* Pseudogene A6ps2 SK766 Atypical A6 Y TCCCACCAGACCTGCATCCAAGCAAGTGAAGATGTTAAAGAGATCNNNGCCAGAGCCAGAAATGGAAAGTACAGACCTCTGAAAATATCTATTGAAAATGGGCAACTTATGATTGGATCATATNNNCAGCCTTCAGATTCCTGGGATAACGATTATGATTCCTTTGTTTTACCCCTGTTGGAGGACAAACAACTGTGCTATATATTATTCAGGTTAGATTCTCAGAATGCCCAGGGATATGAATGGATATTCATTGCATGGTTTCCAGATCATTCTCATGTCCGTCAAAAAAGGTTATATGCAGCAACAAGAGCAACTCTGGAAAAGGAATCTGGAGGTGGCCACGTTAAAGATGAAGTATTTGGAACAGTAAAGGAAGATGTATCATTACATGGATATAAAAAATGTTTGCTCTCACAATCTTCCCCTGCCCCACTGACTGCAGCTGAGGAAGAATTANNNGACATTAAAATCAATGAGGTACAGACTGACGTGGGTGTGGACGCTAAGCATCAAACACTACAAGGAGTAGCATTTCCTATTTCTCGAGAAGCTTTTCAGGCTTTGGAAAAAATAAATAACNNNCTGAACTATGTGCAGTTGGAAATAAACATAAAAAATGAAATTATAATTTTGGCCAACACAACAAATACAGAACTAAAAGATTTGCCAAAGAGGATTCCCAAGGATTCAGCTCGTTACCATTTCTTTCTGTATAAACATTCCCATGAAGGAGACTATTTAGAGTCCATAGTTTTTATCTATTCAATGCCCAGATACACATGCAGTATAAGAGAACGGATGCTGTATTCTAGCTGCAAGAGCCCTCTGCTAGAAATTGTAGAAAGACAACTANNNGTTGTAATGGATGTAATTAGAAAGATTGAGATAGACAATGAGGATTAGTTGACTTCAGACTTCCTTTGTGAAGAAGAAGTACATCCCAAGCAGCATGCAGGAAAAAGAAGAATTCGAAGACTAATTAGGGGCCCAGCGGAAAATGAAGCTACTACTGAT SHQTCIQASEDVKEIXARARNGKYRPLKISIENGQLMIGSYXQPSDSWDNDYDSFVLPLLEDKQLCYILFRLDSQNAQGYEWIFIAWFPDHSHVRQKRLYAATRATLEKESGGGHVKDEVFGTVKEDVSLHGYKKCLLSQSSPAPLTAAEEELXDIKINEVQTDVGVDAKHQTLQGVAFPISREAFQALEKINNXLNYVQLEINIKNEIIILANTTNTELKDLPKRIPKDSARYHFFLYKHSHEGDYLESIVFIYSMPRYTCSIRERMLYSSCKSPLLEIVERQLXVVMDVIRKIEIDNED*LTSDFLCEEEVHPKQHAGKRRIRRLIRGPAENEATTD Pseudogene CaMK2d SK703 CAMK CAMK2 N ATGGCTTCGACCACCACCTGCACCAGGTTCACGGACGAGTATCAGCTTTTCGAGGAGCTTGGAAAGGGGGCATTCTCAGTGGTGAGAAGATGTATGAAAATTCCTACTGGACAAGGATATGCTGCCAAAATTATCAACACCAAAAAGCTTTCTGCTAGGGATCATCAGAAACTAGAAAGAGAAGCTAGAATCTGCCGTCTTTTGAAGCACCCTAATATTGTGCGACTTCATGATAGCATATCAGAAGAGGGCTTTCACTACTTGGTGTTTGATTTAGTTACTGGAGGTGAACTGTTTGAAGACATAGTGGCAAGAGAATACTACAGTGAAGCTGATGCCAGTCATTGTATACAGCAGATTCTAGAAAGTGTTAATCATTGTCACCTAAATGGCATAGTTCACAGGGACCTGAAGCCTGAGAATTTGCTTTTAGCTAGCAAATCCAAGGGAGCAGCTGTGAAATTGGCAGACTTTGGCTTAGCCATAGAAGTTCAAGGGGACCAGCAGGCGTGGTTTGGTTTTGCTGGCACACCTGGATATCTTTCTCCAGAAGTTTTACGTAAAGATCCTTATGGAAAGCCAGTGGATATGTGGGCATGTGGTGTCATTCTCTATATTCTACTTGTGGGGTATCCACCCTTCTGGGATGAAGACCAACACAGACTCTATCAGCAGATCAAGGCTGGAGCTTATGATTTTCCATCACCAGAATGGGACACGGTGACTCCTGAAGCCAAAGACCTCATCAATAAAATGCTTACTATCAACCCTGCCAAACGCATCACAGCCTCAGAGGCACTGAAGCACCCATGGATCTGTCAACGTTCTACTGTTGCTTCCATGATGCACAGACAGGAGACTGTAGACTGCTTGAAGAAATTTAATGCTAGAAGAAAACTAAAGGGTGCCATCTTGACAACTATGCTGGCTACAAGGAATTTCTCAGCAGCCAAGAGTTTGTTGAAGAAACCAGATGGAGTAAAGGAGTCAACTGAGAGTTCAAATACAACAATTGAGGATGAAGATGTGAAAGCACGAAAGCAAGAGATTATCAAAGTCACTGAACAACTGATCGAAGCTATCAACAATGGGGACTTTGAAGCCTACACAAAAATCTGTGACCCAGGCCTTACTGCTTTTGAACCTGAAGCTTTGGGTAATTTAGTGGAAGGGATGGATTTTCACCGATTCTACTTTGAAAATGCTTTGTCCAAAAGCAATAAACCAATCCACACTATTATTCTAAACCCTCATGTACATCTGGTAGGGGATGATGCCGCCTGCATAGCATATATTAGGCTCACACAGTACATGGATGGCAGTGGAATGCCAAAGACAATGCAGTCAGAAGAGACTCGTGTGTGGCACCGCCGGGATGGAAAGTGGCAGAATGTTCATTTTCATCGCTCGGGGTCACCAACAGTACCCATCAAGCCACCCTGTATTCCAAATGGGAAAGAAAACTTCTCAGGAGGCACCTCTTTGTGGCAAAACATCTGA MASTTTCTRFTDEYQLFEELGKGAFSVVRRCMKIPTGQGYAAKIINTKKLSARDHQKLEREARICRLLKHPNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQILESVNHCHLNGIVHRDLKPENLLLASKSKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGYLSPEVLRKDPYGKPVDMWACGVILYILLVGYPPFWDEDQHRLYQQIKAGAYDFPSPEWDTVTPEAKDLINKMLTINPAKRITASEALKHPWICQRSTVASMMHRQETVDCLKKFNARRKLKGAILTTMLATRNFSAAKSLLKKPDGVKESTESSNTTIEDEDVKARKQEIIKVTEQLIEAINNGDFEAYTKICDPGLTAFEPEALGNLVEGMDFHRFYFENALSKSNKPIHTIILNPHVHLVGDDAACIAYIRLTQYMDGSGMPKTMQSEETRVWHRRDGKWQNVHFHRSGSPTVPIKPPCIPNGKENFSGGTSLWQNI YQLFEELGKGAFSVVRRCMKIPTGQGYAAKIINTKKLSARDHQKLEREARICRLLKHPNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQILESVNHCHLNGIVHRDLKPENLLLASKSKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGYLSPEVLRKDPYGKPVDMWACGVILYILLVGYPPFWDEDQHRLYQQIKAGAYDFPSPEWDTVTPEAKDLINKMLTINPAKRITASEALKHPWI Known - Refseq CaMKK2 SK482 Other CAMKK Meta N GAGCCGAGCCGAGCTGGGGGCGCAGACGCGGAGGCGGCGGCGGCGCGGAGCCCAGGTGGCTCCGTGCCGGATGGGAGTGCCCCAGTGTGCTGGATGAAGCTGGCGCATGCACCATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTGGGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTCTCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCGGGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGAGGTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCTCTGCAAGAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGCTGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTGCCCCGGCGGCCGACAGTGGAGTCTCACCACGTCTCCATCACGGGTATGCAGGACTGTGTGCAGCTGAATCAGTATACCCTGAAGGATGAAATTGGAAAGGGCTCCTATGGTGTCGTCAAGTTGGCCTACAATGAAAATGACAATACCTACTATGCAATGAAGGTGCTGTCCAAAAAGAAGCTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCTGGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTCAAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCCCAATGAGGACCATCTGTACATGGTGTTCGAACTGGTCAACCAAGGGCCCGTGATGGAAGTGCCCACCCTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATCGAGTACTTACACTACCAGAAGATCATCCACCGTGACATCAAACCTTCCAACCTCCTGGTCGGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGTGACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCTGAGACCCGCAAGATCTTCTCTGGGAAGGCCAAGGATGTTTGGGCCATGGGTGTGACACTATACTGCTTTGTCTTTGGCCAGTGCCCATTCATGGACGAGCGGATCATGTGTTTACACAGTAAGATCAAGAGTCAGGCCCTGGAATTTCCAGACCAGCCCGACATAGCTGAGGACTTGAAGGACCTGATCACCCGTATGCTGGACAAGAACCCCGAGTCGAGGATCGTGGTGCCGGAAATCAAGCTGCACCCCTGGGTCACGAGGCATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAACTGCACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGCTTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTCGAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAACTTGCTCACCAAAAAACCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCTCCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCCTGCGTGGAAAGTTGCTGGGCCCCCGCCCCCGGCTCCCCCGCACGCATGCATCCACTGCGGCCGGAGGAGGCCATGGAGCCCGAGTAGCTGCCTGGATCGCTCGACCTCGCATGCGCGCCGCGTCGCCTCTGGGGGGCTGCTGCACCGCGTTTCCATAGCAGCATGTCCTACGGAAACCCAGCACGTGTGTTTCCATAGCAGCATGTCTGCAACCCAGACGCGGTGTGTAGAGCCTCGATCGTCATCTCTGGTTATTTGTTTTTTCCTTTGTTGTTTTAAAGGGACAAAAAAAAAAAAGGACTGACTCCATAGCTCGACCGTGGCCGCTGGCTGGCTGGACAGGCGGGTGTGAGGAGTTGCAGACCCAAACCCACGTGCATTTTGGGACAATTGCTTTTTAAAACGTTTTTATGCCAAAAATCCTTCATTGTGATTTTCAGAACCACGTCAGATATACCAAGTGACTGTGTGTGGGGTTTGACAACTGTGGAAAGGCGAGCAGAAAACTCCGGCGGTCTGAGGCCATGGAGGTGGTTGCTGCATTTGAGAGGGAGTAGGGGGCTAGATGTGGCTCCTAGTGCAAACCGGAAACCATGGCACCTTCCAGAGCCGTGGTCTCAAGGAGTCAGAGCAGGGAGCTTTGATGCAACTTATTTGTAAGAAGGATTTTTAAATTTTTTATGGGTAGAATTGTAGTCAGGAAAACAGAAAGGGCTTGAAATTTAATAAGTGCTGCTGGAGGGGATTTTCCAAGCCTGGAAGGGTATTCAGCAGCTGTGGTGGGGAAAGATTTCTCCTGAAAGACTGAACGTGTTTCTTCATGACAGCTGCTCAAAGCAGGTTTCTGAGATAGCTGACCGAGCTCTGGTAAATCTCTTTGTCAAATTACGAAAACTTCAGGGCGAAATCCTATGCTTCCATGTACATTACATGGCTTAAGTTAAACAAAAACATTTTTCAAGTCTCTAACTAGAGTGAA MSSCVSSQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGMESFIVVTECEPGCAVDLGLARDRPLEADGQEVPLDTSGSQARPHLSGRKLSLQERSQGGLAAGGSLDMNGRCICPSLPYSPVSSPQSSPRLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKGSYGVVKLAYNENDNTYYAMKVLSKKKLIRQAAFPRRPPPRGTRPAPGGCIQPRGPIEQVYQEIAILKKLDHPNVVKLVEVLDDPNEDHLYMVFELVNQGPVMEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSNEFKGSDALLSNYVGTPAFMAPESLSETRKIFSGKAKDVWAMGVTLYCFVFGQCPFMDERIMCLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIVVPEIKLHPWVTRHGAEPLPSEDENCTLVEVTEEEVENSVKHIPSLATVILVKTMIRKRSFGNPFEGSRREERSLSAPGNLLTKKPTRECESLSELKEARQRRQPPGHRPAPRGGGGSALVRGSPCVESCWAPAPGSPARMHPLRPEEAMEPE YTLKDEIGKGSYGVVKLAYNENDNTYYAMKVLSKKKLIRQAAFPRRPPPRGTRPAPGGCIQPRGPIEQVYQEIAILKKLDHPNVVKLVEVLDDPNEDHLYMVFELVNQGPVMEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSNEFKGSDALLSNYVGTPAFMAPESLSETRKIFSGKAKDVWAMGVTLYCFVFGQCPFMDERIMCLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIVVPEIKLHPWV Known - Refseq AMPKa1 SK032 CAMK CAMKL AMPK N GCAGACTCAGTTCCTGGAGAAAGATGGCGACAGCCGAGAAGCAGAAACACGACGGGCGGGTGAAGATCGGCCACTACATTCTGGGTGACACGCTGGGGGTCGGCACCTTCGGCAAAGTGAAGGTTGGCAAACATGAATTGACTGGGCATAAAGTAGCTGTGAAGATACTCAATCGACAGAAGATTCGGAGCCTTGATGTGGTAGGAAAAATCCGCAGAGAAATTCAGAACCTCAAGCTTTTCAGGCATCCTCATATAATTAAACTGTACCAGGTCATCAGTACACCATCTGATATTTTCATGGTGATGGAATATGTCTCAGGAGGAGAGCTATTTGATTATATCTGTAAGAATGGAAGGCTGGATGAAAAAGAAAGTCGGCGTCTGTTCCAACAGATCCTTTCTGGTGTGGATTATTGTCACAGGCATATGGTGGTCCATAGAGATTTGAAACCTGAAAATGTCCTGCTTGATGCACACATGAATGCAAAGATAGCTGATTTTGGTCTTTCAAACATGATGTCAGATGGTGAATTTTTAAGAACAAGTTGTGGCTCACCCAACTATGCTGCACCAGAAGTAATTTCAGGAAGATTGTATGCAGGCCCAGAGGTAGATATATGGAGCAGTGGGGTTATTCTCTATGCTTTATTATGTGGAACCCTTCCATTTGATGATGACCATGTGCCAACTCTTTTTAAGAAGATATGTGATGGGATCTTCTATACCCCTCAATATTTAAATCCTTCTGTGATTAGCCTTTTGAAACATATGCTGCAGGTGGATCCCATGAAGAGGGCCTCAATCAAAGATATCAGGGAACATGAATGGTTTAAACAGGACCTTCCAAAATATCTCTTTCCTGAGGATCCATCATATAGTTCAACCATGATTGATGATGAAGCCTTAAAAGAAGTATGTGAAAAGTTTGAGTGCTCAGAAGAGGAAGTTCTCAGCTGTCTTTACAACAGAAATCACCAGGATCCTTTGGCAGTTGCCTACCATCTCATAATAGATAACAGGAGAATAATGAATGAAGCCAAAGATTTCTATTTGGCGACAAGCCCACCTGATTCTTTTCTTGATGATCATCACCTGACTCGGCCCCATCCTGAAAGAGTACCATTCTTGGTTGCTGAAACACCAAGGGCACGCCATACCCTTGATGAATTAAATCCACAGAAATCCAAACACCAAGGTGTAAGGAAAGCAAAATGGCATTTAGGAATTAGAAGTCAAAGTCGACCAAATGATATTATGGCAGAAGTATGTAGAGCAATCAAACAATTGGATTATGAATGGAAGGTTGTAAACCCATATTATTTGCGTGTACGAAGGAAGAATCCTGTGACAAGCACTTACTCCAAAATGAGTCTACAGTTATACCAAGTGGATAGTAGAACTTATCTACTGGATTTCCGTAGTATTGATGATGAAATTACAGAAGCCAAATCAGGGACTGCTACTCCACAGAGATCGGGATCAGTTAGCAACTATCGATCTTGCCAAAGGAGTGATTCAGATGCTGAGGCTCAAGGAAAATCCTCAGAAGTTTCTCTTACCTCATCTGTGACCTCACTTGACTCTTCTCCTGTTGACCTAACTCCAAGACCTGGAAGTCACACAATAGAATTTTTTGAGATGTGTGCAAATCTAATTAAAATTCTTGCACAATAAACAGAAAACTTTGCTTATTTCTTTTGCAGCAATAAGCATGCATAATAAGTCACAGCCAAATGCTTCCATTTGTAATCAAGTTATACATAATTATAACCGAGGGCTGGCGTTTTGGAATCGAATTTCGACAGGGATTGGAACATGATTTATAGTTAAAAGCCTAATATCGAGAAATGAATTAAGATCA MATAEKQKHDGRVKIGHYILGDTLGVGTFGKVKVGKHELTGHKVAVKILNRQKIRSLDVVGKIRREIQNLKLFRHPHIIKLYQVISTPSDIFMVMEYVSGGELFDYICKNGRLDEKESRRLFQQILSGVDYCHRHMVVHRDLKPENVLLDAHMNAKIADFGLSNMMSDGEFLRTSCGSPNYAAPEVISGRLYAGPEVDIWSSGVILYALLCGTLPFDDDHVPTLFKKICDGIFYTPQYLNPSVISLLKHMLQVDPMKRASIKDIREHEWFKQDLPKYLFPEDPSYSSTMIDDEALKEVCEKFECSEEEVLSCLYNRNHQDPLAVAYHLIIDNRRIMNEAKDFYLATSPPDSFLDDHHLTRPHPERVPFLVAETPRARHTLDELNPQKSKHQGVRKAKWHLGIRSQSRPNDIMAEVCRAIKQLDYEWKVVNPYYLRVRRKNPVTSTYSKMSLQLYQVDSRTYLLDFRSIDDEITEAKSGTATPQRSGSVSNYRSCQRSDSDAEAQGKSSEVSLTSSVTSLDSSPVDLTPRPGSHTIEFFEMCANLIKILAQ YILGDTLGVGTFGKVKVGKHELTGHKVAVKILNRQKIRSLDVVGKIRREIQNLKLFRHPHIIKLYQVISTPSDIFMVMEYVSGGELFDYICKNGRLDEKESRRLFQQILSGVDYCHRHMVVHRDLKPENVLLDAHMNAKIADFGLSNMMSDGEFLRTSCGSPNYAAPEVISGRLYAGPEVDIWSSGVILYALLCGTLPFDDDHVPTLFKKICDGIFYTPQYLNPSVISLLKHMLQVDPMKRASIKDIREHEWF Known - Refseq AMPKa2 SK033 CAMK CAMKL AMPK N GGTAGCGGCGGCGGCGGCGGCTAGCGGAGCGGCAGGCGGTGGAGCGAGGCCGCGCGCGCCGAAGATGGCTGAGAAGCAGAAGCACGACGGGCGGGTGAAGATCGGACACTACGTGCTGGGCGACACGCTGGGCGTCGGCACCTTCGGCAAAGTGAAGATTGGAGAACATCAATTAACAGGCCATAAAGTGGCAGTTAAAATCTTAAATAGACAGAAGATTCGCAGTTTAGATGTTGTTGGAAAAATAAAACGAGAAATTCAAAATCTAAAACTCTTTCGTCATCCTCATATTATCAAACTATACCAGGTGATCAGCACTCCAACAGATTTTTTTATGGTAATGGAATATGTGTCTGGAGGTGAATTATTTGACTACATCTGTAAGCATGGACGGGTTGAAGAGATGGAAGCCAGGCGGCTCTTTCAGCAGATTCTGTCTGCTGTGGATTACTGTCATAGGCATATGGTTGTTCATCGAGACCTGAAACCAGAGAATGTCCTGTTGGATGCACACATGAATGCCAAGATAGCCGATTTCGGATTATCTAATATGATGTCAGATGGTGAATTTCTGAGAACTAGTTGCGGATCTCCAAATTATACAGCACCTGAAGTCATCTCAGGCAGATTGTATGCAGGTCCTGAAGTTGATATCTGGAGCTGTGGTGTTATCTTGTATGCTCTTCTTTGTGGCACCCTCCCATTTGATGATGAGCATGTACCTACGTTATTTAAGAAGATCCGAGGGGGTGTCTTTTATATCCCAGAATATCTCAATCGTTCTGTCGCCACTCTCCTGATGCATATGCTGCAGGTTGACCCACTGAAACGAGCAACTATCAAAGACATAAGAGAGCATGAATGGTTTAAACAAGGTTTGCCCAGTTACTTATTTCCTGAAGACCCTTCCTATGATGCTAACGTCATTGATGATGAGGCTGTGAAAGAAGTGTGTGAAAAATTTGAATGTACAGAATCAGAAGTAATGAACAGTTTATATAGTGGTGACCCTCAAGACCAGCTTGCAGTGGCTTATCATCTTATCATTGACAATCGGAGAATAATGAACCAAGCCAGTGAGTTCTACCTCGCCTCTAGTCCTCCATCTGGTTCTTTTATGGATGATAGTGCCATGCATATTCCCCCAGGCCTGAAACCTCATCCAGAAAGGATGCCACCTCTTATAGCAGACAGCCCCAAAGCAAGATGTCCATTGGATGCACTGAATACGACTAAGCCCAAATCTTTAGCTGTGAAAAAAGCCAAGTGGCGTCAAGGAATCCGAAGTCAGAGCAAACCGTATGACATTATGGCTGAAGTTTACCGAGCTATGAAGCAGCTGGATTTTGAATGGAAGGTAGTGAATGCATACCATCTTCGTGTAAGAAGAAAAAATCCAGTGACTGGCAATTACGTGAAAATGAGCTTACAACTTTACCTGGTTGATAACAGGAGCTATCTTTTGGACTTTAAAAGCATTGATGATGAAGTAGTGGAGCAGAGATCTGGTTCCTCAACACCTCAGCGTTCCTGTTCTGCTGCTGGCTTACACAGACCAAGATCAAGTTTTGATTCCACAACTGCAGAGAGCCATTCACTTTCTGGCTCTCTCACTGGCTCTTTGACCGGAAGCACATTGTCTTCAGTTTCACCTCGCCTGGGCAGTCACACCATGGATTTTTTTGAAATGTGTGCCAGTCTGATTACTACTTTAGCCCGTTGATCTGTCTCTAGTTTCTTTCTGTTATTGCACTATGAAAATCAGTTATATTCTTTAAATTTTTATCTTACTTTTGGATAATATCCACTGCAATACTAATTGAGAAACATGAATTATTTCCAGGGGCACACAATGCTATTGAAATTACTGAAAACAAAATATCTGACATCTTATTTACTTGTAGAAATCTGTAATTCTATTGTGCCTATGATAAATTCACATAGGCAATATCTTTAATAGGTTAATATCAATGAAGATTTTTAATTACAATAATGAGTTCACTACAGACGATTAACACACCACACTGGCGAACCATCTCAATGTAAGGGTGGTTTGGCAACACCTCCTTGCTTTGCTGTTTGGTGTAGTAAATCTAGTTTACTTCCTAAATTTCAGTAGGCTTTATGCTGTGTTTATCGCCCAATTTATTTTAACAAAAGAAGATTAAAAAGTAAAGAACCACGAGTAAGATATTATTTAAATGTTGAAATCTTAAAACCTGCCTCCAAGATTTCAGAAGCCAAGTTTTTCTAACAGTATTTGTACAAATACTGCCTAGTGTATTCAACAGAAGACTGTGGTCATGTAACAGGTAACCACAATTTTCAGGTTTCTTAAAAACAGCTGTAACTAACTCAGGA MAEKQKHDGRVKIGHYVLGDTLGVGTFGKVKIGEHQLTGHKVAVKILNRQKIRSLDVVGKIKREIQNLKLFRHPHIIKLYQVISTPTDFFMVMEYVSGGELFDYICKHGRVEEMEARRLFQQILSAVDYCHRHMVVHRDLKPENVLLDAHMNAKIADFGLSNMMSDGEFLRTSCGSPNYTAPEVISGRLYAGPEVDIWSCGVILYALLCGTLPFDDEHVPTLFKKIRGGVFYIPEYLNRSVATLLMHMLQVDPLKRATIKDIREHEWFKQGLPSYLFPEDPSYDANVIDDEAVKEVCEKFECTESEVMNSLYSGDPQDQLAVAYHLIIDNRRIMNQASEFYLASSPPSGSFMDDSAMHIPPGLKPHPERMPPLIADSPKARCPLDALNTTKPKSLAVKKAKWRQGIRSQSKPYDIMAEVYRAMKQLDFEWKVVNAYHLRVRRKNPVTGNYVKMSLQLYLVDNRSYLLDFKSIDDEVVEQRSGSSTPQRSCSAAGLHRPRSSFDSTTAESHSLSGSLTGSLTGSTLSSVSPRLGSHTMDFFEMCASLITTLAR YVLGDTLGVGTFGKVKIGEHQLTGHKVAVKILNRQKIRSLDVVGKIKREIQNLKLFRHPHIIKLYQVISTPTDFFMVMEYVSGGELFDYICKHGRVEEMEARRLFQQILSAVDYCHRHMVVHRDLKPENVLLDAHMNAKIADFGLSNMMSDGEFLRTSCGSPNYTAPEVISGRLYAGPEVDIWSCGVILYALLCGTLPFDDEHVPTLFKKIRGGVFYIPEYLNRSVATLLMHMLQVDPLKRATIKDIREHEWF Known - Refseq MARK3 SK096 CAMK CAMKL MARK N GACGGCCCGGGCCAGGCCCGGGATCTAGAACGGCCGTAGGGGGAAGGGAGCCGCCCTCCCCACGGCGCCTTTTCGGAACTGCCGTGGACTCGAGGACGCTGGTCGCCGGCCTCCTAGGGCTGTGCTGTTTTGTTTTGACCCTCGCATTGTGCAGAATTAAAGTGCAGTAAAATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCACATGGAGATGGGCGTCAAGAAGTTACCTCTCGTACCAGCCGCTCAGGAGCTCGGTGTAGAAACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAAACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCAGAGAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCAACAAGTCTACAAAAGCTCTTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAAGTCATTGAAACTCAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTAAAGTATTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCAAGATCTAAATTTAGACAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAGGCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGCAGATTTCGGTTTTAGCAATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTCCATACGCAGCACCTGAGCTCTTCCAGGGCAAGAAATATGACGGGCCAGAAGTGGATGTGTGGAGTCTGGGGGTCATTTTATACACACTAGTCAGTGGCTCACTTCCCTTTGATGGGCAAAACCTAAAGGAACTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGTGAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAGCAAATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTTGAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATATTCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACATATTTGTTATTGGGGAGAAAATCTTCAGAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTCTCCTCACCACAAAGTGCAGAGAAGTGTTTCTTCAAGCCAAAAGCAAAGACGCTACAGTGACCATGCTGGACCAGGTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACCAGCACTGCAGATAGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAACTGGCAGTGCTGTTGGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAGGCGGATATTCCTGAACGCAAGAAAAGCTCCACTGTCCCTAGTAGTAACACAGCATCTGGTGGAATGACACGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGATGATAGACACTCAGTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATCAGAGAACTCCAGTTGCTTCAACACACAGTATCAGTAGTGCAGCCACCCCAGATCGAATCCGCTTCCCAAGAGGCACTGCCAGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCTCCTGCCTCTCCCAGCCTGTCCCATGAAGCCACACCATTGTCCCAGACTCGAAGCCGAGGCTCCACTACTCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCTAAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATCCCTACGCTTCACCTGGAGCATGAAAACCACTAGTTCAATGGATCCCGGGGACATGATGCGGGAAATCCGCAAAGTGTTGGACGCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGCGTCCACGGAGATGGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCTCTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATCCATAGCCTTCAAAAATATTGCTTCCAAAATTGCCAATGAGCTAAAGCTGTAACCCAGTGATTATGATGTAAATTAAGTAGCAAGTAAAGTGTTTTCCTGAACACTGATGGAAATGTATAGAATAATATTTAGGCAATAACGTCTGCATCTTCTAAATCATGAAATTAAAGTCTGAGGACGAGAGCACGCCTGGGAGCGAAAGCTGGCCTTTTTTCTACGAATGCACTACATTAAAGATGTGCAACCTATGCGCCCCCTGCCCTACTTCCGTTACCCTGAGAGTCGGCGTGTGGCCCCATCTCCATGTGCCTCCCGTCTGGGTGGGTGTGAGAGTGGACGGTATGTGTGTGAAGTGGTGTATATGGAAGCATCTCCCTACACTGGCAGCCAGTCATTACTAGTACCTCTGCGGGAGATCATCCGGTGCTAAAACATTACAGTTGCCAAGGAGGAAAATACTGAATGACTGCTAAGAATTAACCTTAAGACCAGTTCATAGTTAATACAGGTTTACAGTTCATGCCTGTGGTTTTGTGTTTGTTGTTTTGTGTTTTTTTAGTGCAAAAGGTTTAAATTTATAGTTGTGAACATTGCTTGTGTGTGTTTTTCTAAGTAGATTCACAAGATAATTAAAAATTCACTTTTTCTCAGGT MSTRTPLPTVNERDTENHTSHGDGRQEVTSRTSRSGARCRNSIASCADEQPHIGNYRLLKTIGKGNFAKVKLARHILTGREVAIKIIDKTQLNPTSLQKLFREVRIMKILNHPNIVKLFEVIETEKTLYLIMEYASGGEVFDYLVAHGRMKEKEARSKFRQIVSAVQYCHQKRIVHRDLKAENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYMSTDCENLLKRFLVLNPIKRGTLEQIMKDRWINAGHEEDELKPFVEPELDISDQKRIDIMVGMGYSQEEIQESLSKMKYDEITATYLLLGRKSSELDASDSSSSSNLSLAKVRPSSDLNNSTGQSPHHKVQRSVSSSQKQRRYSDHAGPAIPSVVAYPKRSQTSTADGDLKEDGISSRKSSGSAVGGKGIAPASPMLGNASNPNKADIPERKKSSTVPSSNTASGGMTRRNTYVCSERTTADRHSVIQNGKENSTIPDQRTPVASTHSISSAATPDRIRFPRGTASRSTFHGQPRERRTATYNGPPASPSLSHEATPLSQTRSRGSTNLFSKLTSKLTRSRNVSAEQKDENKEAKPRSLRFTWSMKTTSSMDPGDMMREIRKVLDANNCDYEQRERFLLFCVHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIASKIANELKL YRLLKTIGKGNFAKVKLARHILTGREVAIKIIDKTQLNPTSLQKLFREVRIMKILNHPNIVKLFEVIETQKTLYLIMEYASGGKVFDYLVAHGRMKEKEARSKFRQIVSAVQYCHQKRIVHRDLKAENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYMSTDCENLLKRFLVLNPIKRGTLEQIMKDRWI Known - Refseq MARK2 SK120 CAMK CAMKL MARK N CGGCGGTGGTGGCGGCCATGTTGGGAGCAGCAGGTCCGGCGGCGGCTGCCTGTGTGCCGGGCGCGGAGCAGTGCCGCTGAGGGCAGGGGAGGAGCGAGGCAGGCGGCCGGCTGCGGCGGCAGAGAGTAGGCGGAGCGGCGCGGCCCGGCCGAAAGGCGGCACAGCCCAGCCGGGGGTCGGGGGGGTGCGGTCCGGAGCCGCTCGGAGCCGGCGCGGCCTAGCCCGAGCGGCGCATCCCCGGGCTGGCGTGAGCGGCTGCCCGGCCTCCCCGCACCCCCGGCCGGGGCCCATGCGGCGGGTGCTCCTGCTGTGAGAAGCCCCGCCCGGCCGGGCTCCGCGCCTTCCCTTCCCTCCCTTCCTCCAAGCTTCTCGGTTCCCTCCCCCGAGATACCGGCGCCATGTCCAGCGCTCGGACCCCCCTACCCACGCTGAACGAGAGGGACACGGAGCAGCCCACCTTGGGACACCTTGACTCCAAGCCCAGCAGTAAGTCCAACATGATTCGGGGCCGCAACTCAGCCACCTCTGCTGATGAGCAGCCCCACATTGGAAACTACCGGCTCCTCAAGACCATTGGCAAGGGTAATTTTGCCAAGGTGAAGTTGGCCCGACACATCCTGACTGGGAAAGAGGTAGCTGTGAAGATCATTGACAAGACTCAACTGAACTCCTCCAGCCTCCAGAAACTATTCCGCGAAGTAAGAATAATGAAGGTTTTGAATCATCCCAACATAGTTAAATTATTTGAAGTGATTGAGACTGAGAAAACGCTCTACCTTGTCATGGAGTACGCTAGTGGCGGAGAGGTATTTGATTACCTAGTGGCTCATGGCAGGATGAAAGAAAAAGAGGCTCGAGCCAAATTCCGCCAGATAGTGTCTGCTGTGCAGTACTGTCACCAGAAGTTTATTGTCCATAGAGACTTAAAGGCAGAAAACCTGCTCTTGGATGCTGATATGAACATCAAGATTGCAGACTTTGGCTTCAGCAATGAATTCACCTTTGGGAACAAGCTGGACACCTTCTGTGGCAGTCCCCCTTATGCTGCCCCAGAACTCTTCCAGGGCAAAAAATATGATGGACCCGAGGTGGATGTGTGGAGCCTAGGAGTTATCCTCTATACACTGGTCAGCGGATCCCTGCCTTTTGATGGACAGAACCTCAAGGAGCTGCGGGAACGGGTACTGAGGGGAAAATACCGTATTCCATTCTACATGTCCACGGACTGTGAAAACCTGCTTAAGAAATTTCTCATTCTTAATCCCAGCAAGAGAGGCACTTTAGAGCAAATCATGAAAGATCGATGGATGAATGTGGGTCACGAAGATGATGAACTAAAGCCTTACGTGGAGCCACTCCCTGACTACAAGGACCCCCGGCGGACAGAGCTGATGGTGTCCATGGGTTATACACGGGAAGAGATCCAGGACTCGCTGGTGGGCCAGAGATACAACGAGGTGATGGCCACCTATCTGCTCCTGGGCTACAAGAGCTCCGAGCTGGAAGGCGACACCATCACCCTGAAACCCCGGCCTTCAGCTGATCTAACCAATAGCAGCGCCCAATTCCCATCCCACAAGGTACAGCGAAGCGTGTCGGCCAATCCCAAGCAGCGGCGCTTCAGCGACCAGGCTGGTCCTGCCATTCCCACCTCTAATTCTTACTCTAAGAAGACTCAGAGTAACAACGCAGAAAATAAGCGGCCTGAGGAGGACCGGGAGTCAGGGCGGAAAGCCAGCAGCACAGCCAAGGTGCCTGCCAGCCCCCTGCCCGGTCTGGAGAGGAAGAAGACCACCCCAACCCCCTCCACGAACAGCGTCCTCTCCACCAGCACAAATCGAAGCAGGAATTCCCCACTTTTGGAGCGGGCCAGCCTCGGCCAGGCCTCCATCCAGAATGGCAAAGACAGCCTAACCATGCCAGGGTCCCGGGCCTCCACGGCTTCTGCTTCTGCCGCAGTCTCTGCGGCCCGGCCCCGCCAGCACCAGAAATCCATGTCGGCCTCCGTGCACCCCAACAAGGCCTCTGGGCTGCCCCCCACGGAGAGTAACTGTGAGGTGCCGCGGCCCAGCACAGCCCCCCAGCGTGTCCCTGTTGCCTCCCCATCCGCCCACAACATCAGCAGCAGTGGTGGAGCCCCAGACCGAACTAACTTCCCCCGGGGTGTGTCCAGCCGAAGCACCTTCCATGCTGGGCAGCTCCGACAGGTGCGGGACCAGCAGAATTTGCCCTACGGTGTGACCCCAGCCTCTCCCTCTGGCCACAGCCAGGGCCGGCGGGGGGCCTCTGGGAGCATCTTCAGCAAGTTCACCTCCAAGTTTGTACGCAGAAATCTGTCTTTCAGGTTTGCCAGAAGGAACCTGAATGAACCTGAAAGCAAAGACCGAGTGGAGACGCTCAGACCTCACGTGGTGGGCAGTGGCGGCAACGACAAAGAAAAGGAAGAATTTCGGGAGGCCAAGCCCCGCTCCCTCCGCTTCACGTGGAGTATGAAGACCACGAGCTCCATGGAGCCCAACGAGATGATGCGGGAGATCCGCAAGGTGCTGGACGCGAACAGCTGCCAGAGCGAGCTGCATGAGAAGTACATGCTGCTGTGCATGCACGGCACGCCGGGCCACGAGGACTTCGTGCAGTGGGAGATGGAGGTGTGCAAACTGCCGCGGCTCTCTCTCAACGGGGTTCGATTTAAGCGGATATCGGGCACCTCCATGGCCTTCAAAAACATTGCCTCCAAAATAGCCAACGAGCTGAAGCTTTAACAGGCTGCCAGGAGCGGGGGCGGCGGGGGCGGGCCAGCTGGACGGGCTGCCGGCCGTGCGCCGCCCCACCTGGGCGAGACTGCAGCGATGGATTGGTGTGTCTCCCTGCTGGCACTTCTCCCCTCCCTGGCCCTTCTCAGTTTTCTCCCACATTCACCCCTGCCCAGAGATTCCCCCTTCTCCTCTCCCCTACTGGAGGCAAAGGAAGGGGAGGGTGGATGGGGGGGCAGGGCTCCCCCTCGGTACTGCGGTTGCACAGAGTATTTCGCCTAAACCAAGAAATTTTTTATTACCAAAAAGA MSSARTPLPTLNERDTEQPTLGHLDSKPSSKSNMIRGRNSATSADEQPHIGNYRLLKTIGKGNFAKVKLARHILTGKEVAVKIIDKTQLNSSSLQKLFREVRIMKVLNHPNIVKLFEVIETEKTLYLVMEYASGGEVFDYLVAHGRMKEKEARAKFRQIVSAVQYCHQKFIVHRDLKAENLLLDADMNIKIADFGFSNEFTFGNKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYMSTDCENLLKKFLILNPSKRGTLEQIMKDRWMNVGHEDDELKPYVEPLPDYKDPRRTELMVSMGYTREEIQDSLVGQRYNEVMATYLLLGYKSSELEGDTITLKPRPSADLTNSSAQFPSHKVQRSVSANPKQRRFSDQAGPAIPTSNSYSKKTQSNNAENKRPEEDRESGRKASSTAKVPASPLPGLERKKTTPTPSTNSVLSTSTNRSRNSPLLERASLGQASIQNGKDSLTMPGSRASTASASAAVSAARPRQHQKSMSASVHPNKASGLPPTESNCEVPRPSTAPQRVPVASPSAHNISSSGGAPDRTNFPRGVSSRSTFHAGQLRQVRDQQNLPYGVTPASPSGHSQGRRGASGSIFSKFTSKFVRRNLSFRFARRNLNEPESKDRVETLRPHVVGSGGNDKEKEEFREAKPRSLRFTWSMKTTSSMEPNEMMREIRKVLDANSCQSELHEKYMLLCMHGTPGHEDFVQWEMEVCKLPRLSLNGVRFKRISGTSMAFKNIASKIANELKL YRLLKTIGKGNFAKVKLARHILTGKEVAVKIIDKTQLNSSSLQKLFREVRIMKVLNHPNIVKLFEVIETEKTLYLVMEYASGGEVFDYLVAHGRMKEKEARAKFRQIVSAVQYCHQKFIVHRDLKAENLLLDADMNIKIADFGFSNEFTFGNKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYMSTDCENLLKKFLILNPSKRGTLEQIMKDRWM Known - RefseqPX1CG.gb100644000766000024 5534014006427353 15557 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS PX1CG 5386 bp ss-DNA circular PHG 16-JUN-2004 DEFINITION Coliphage phi-X174, complete genome. ACCESSION J02482 M10348 M10379 M10714 M10749 M10750 M10866 M10867 M24859 V01128 VERSION J02482.1 GI:216019 KEYWORDS . SOURCE Enterobacteria phage phiX174 ORGANISM Enterobacteria phage phiX174 Viruses; ssDNA viruses; Microviridae; Microvirus. REFERENCE 1 (bases 1047 to 1094) AUTHORS Ziff,E.B., Sedat,J.W. and Galibert,F. TITLE Determination of the nucleotide sequence of a fragment of bacteriophage phiX 174 DNA JOURNAL Nature New Biol. 241 (106), 34-37 (1973) PUBMED 4349156 REFERENCE 2 (bases 2370 to 2421) AUTHORS Robertson,H.D., Barrell,B.G., Weith,H.L. and Donelson,J.E. TITLE Isolation and sequence analysis of a ribosome-protected fragment from bacteriophage phiX 174 DNA JOURNAL Nature New Biol. 241 (106), 38-40 (1973) PUBMED 4572838 REFERENCE 3 (bases 2370 to 2420) AUTHORS Barrell,B.G., Weith,H.L., Donelson,J.E. and Robertson,H.D. TITLE Sequence analysis of the ribosome-protected bacteriophase phiX174 DNA fragment containing the gene G initiation site JOURNAL J. Mol. Biol. 92 (3), 377-393 (1975) PUBMED 1095758 REFERENCE 4 (bases 2365 to 2591) AUTHORS Air,G.M., Blackburn,E.H., Sanger,F. and Coulson,A.R. TITLE The nucleotide and amino acid sequences of the N (5') terminal region of gene G of bacteriophage phiphiX 174 JOURNAL J. Mol. Biol. 96 (4), 703-719 (1975) PUBMED 1081600 REFERENCE 5 (bases 2263 to 2421) AUTHORS Fiddes,J.C. TITLE Nucleotide sequence of the intercistronic region between genes G and F in bacteriophage phiX174 DNA JOURNAL J. Mol. Biol. 107 (1), 1-24 (1976) PUBMED 826639 REFERENCE 6 (bases 1017 to 1081) AUTHORS Sedat,J., Ziff,E. and Galibert,F. TITLE Direct determination of DNA nucleotide sequences. Structure of large specific fragments of bacteriophage phiX174 DNA JOURNAL J. Mol. Biol. 107 (4), 391-416 (1976) PUBMED 1003475 REFERENCE 7 (bases 730 to 903) AUTHORS Blackburn,E.H. TITLE Transcription and sequence analysis of a fragment of bacteriophage phiX174 DNA JOURNAL J. Mol. Biol. 107 (4), 417-431 (1976) PUBMED 826641 REFERENCE 8 (bases 1017 to 1762) AUTHORS Air,G.M., Blackburn,E.H., Coulson,A.R., Galibert,F., Sanger,F., Sedat,J.W. and Ziff,E.B. TITLE Gene F of bacteriophage phiX174. Correlation of nucleotide sequences from the DNA and amino acid sequences from the gene product JOURNAL J. Mol. Biol. 107 (4), 445-458 (1976) PUBMED 1088826 REFERENCE 9 (bases 2395 to 2922) AUTHORS Air,G.M., Sanger,F. and Coulson,A.R. TITLE Nucleotide and amino acid sequences of gene G of omegaX174 JOURNAL J. Mol. Biol. 108 (3), 519-533 (1976) PUBMED 1088827 REFERENCE 10 (bases 4137 to 4207) AUTHORS Mansfeld,A.D., Vereijken,J.M. and Jansz,H.S. TITLE The nucleotide sequence of a DNA fragment, 71 base pairs in length, near the origin of DNA replication of bacteriophage 0X174 JOURNAL Nucleic Acids Res. 3 (10), 2827-2844 (1976) PUBMED 995652 REFERENCE 11 (bases 4505 to 5374) AUTHORS Brown,N.L. and Smith,M. TITLE The sequence of a region of bacteriophage phiX174 DNA coding for parts of genes A and B JOURNAL J. Mol. Biol. 116 (1), 1-28 (1977) PUBMED 592379 REFERENCE 12 (bases 1 to 5375) AUTHORS Sanger,F., Air,G.M., Barrell,B.G., Brown,N.L., Coulson,A.R., Fiddes,C.A., Hutchison,C.A., Slocombe,P.M. and Smith,M. TITLE Nucleotide sequence of bacteriophage phi X174 DNA JOURNAL Nature 265 (5596), 687-695 (1977) PUBMED 870828 REFERENCE 13 (bases 5022 to 5132) AUTHORS Brown,N.L. and Smith,M. TITLE DNA sequence of a region of the phi X174 genome coding for a ribosome binding site JOURNAL Nature 265 (5596), 695-698 (1977) PUBMED 859573 REFERENCE 14 (bases 5346 to 5386; 1 to 159) AUTHORS Smith,M., Brown,N.L., Air,G.M., Barrell,B.G., Coulson,A.R., Hutchison,C.A. III and Sanger,F. TITLE DNA sequence at the C termini of the overlapping genes A and B in bacteriophage phi X174 JOURNAL Nature 265 (5596), 702-705 (1977) PUBMED 859575 REFERENCE 15 (sites) AUTHORS Fiddes,J.C. TITLE The nucleotide sequence of a viral DNA JOURNAL Sci. Am. 237 (6), 54-67 (1977) PUBMED 929160 REFERENCE 16 (bases 1 to 5386) AUTHORS Sanger,F., Coulson,A.R., Friedmann,T., Air,G.M., Barrell,B.G., Brown,N.L., Fiddes,J.C., Hutchison,C.A. III, Slocombe,P.M. and Smith,M. TITLE The nucleotide sequence of bacteriophage phiX174 JOURNAL J. Mol. Biol. 125 (2), 225-246 (1978) PUBMED 731693 REFERENCE 17 (bases 1290 to 1302; 1340 to 1430; 1510 to 1570; 1600 to 1750) AUTHORS Air,G.M., Coulson,A.R., Fiddes,J.C., Friedmann,T., Hutchison,C.A. III, Sanger,F., Slocombe,P.M. and Smith,A.J. TITLE Nucleotide sequence of the F protein coding region of bacteriophage phiX174 and the amino acid sequence of its product JOURNAL J. Mol. Biol. 125 (2), 247-254 (1978) PUBMED 731694 REFERENCE 18 (bases 4256 to 4317) AUTHORS Langeveld,S.A., van Mansfeld,A.D., de Winter,J.M. and Weisbeek,P.J. TITLE Cleavage of single-stranded DNA by the A and A* proteins of bacteriophage phi X174 JOURNAL Nucleic Acids Res. 7 (8), 2177-2188 (1979) PUBMED 160544 REFERENCE 19 (bases 4248 to 4332) AUTHORS Heidekamp,F., Langeveld,S.A., Baas,P.D. and Jansz,H.S. TITLE Studies of the recognition sequence of phi X174 gene A protein. Cleavage site of phi X gene A protein in St-1 RFI DNA JOURNAL Nucleic Acids Res. 8 (9), 2009-2021 (1980) PUBMED 6253953 REFERENCE 20 (bases 436 to 490; 630 to 669; 930 to 979) AUTHORS Takeshita,M., Kappen,L.S., Grollman,A.P., Eisenberg,M. and Goldberg,I.H. TITLE Strand scission of deoxyribonucleic acid by neocarzinostatin, auromomycin, and bleomycin: studies on base release and nucleotide sequence specificity JOURNAL Biochemistry 20 (26), 7599-7606 (1981) PUBMED 6173064 REFERENCE 22 (bases 1064 to 1757) AUTHORS Merville,M.P., Piette,J., Lopez,M., Decuyper,J. and van de Vorst,A. TITLE Termination sites of the in vitro DNA synthesis on single-stranded DNA photosensitized by promazines JOURNAL J. Biol. Chem. 259 (24), 15069-15077 (1984) PUBMED 6239864 REFERENCE 23 (bases 2380 to 2512; 2593 to 2786; 2788 to 2947) AUTHORS Air,G.M., Els,M.C., Brown,L.E., Laver,W.G. and Webster,R.G. TITLE Location of antigenic sites on the three-dimensional structure of the influenza N2 virus neuraminidase JOURNAL Virology 145 (2), 237-248 (1985) PUBMED 2411049 REFERENCE 24 (bases 449 to 482; 504 to 598; 1047 to 1111) AUTHORS Ueda,K., Morita,J. and Komano,T. TITLE Sequence specificity of heat-labile sites in DNA induced by mitomycin C JOURNAL Biochemistry 23 (8), 1634-1640 (1984) PUBMED 6232949 COMMENT On Apr 28, 2004 this sequence version replaced gi:15535. [8] intermittent sequences. [15] review; discussion of complete genome. Double checked with sumex tape. Single-stranded circular DNA which codes for eleven proteins. Replicative form is duplex, icosahedron, related to s13 & g4. [21] indicates that mitomycin C reduced with sodium borohydride induced heat-labile sites in DNA most preferentially at dinucleotide sequence 'gt' (especially 'Pu-g-t'). Bacteriophage phi-X174 single stranded DNA molecules were irradiated with near UV light in the presence of promazine derivatives, after priming with restriction fragments or synthetic primers [22]. The resulting DNA fragments were used as templates for in vitro complementary chain synthesis by E.coli DNA polymerase I [22]. More than 90% of the observed chain terminations were mapped one nucleotide before a guanine residue [22]. Photoreaction occurred more predominantly with guanine residues localized in single-stranded parts of the genome [22]. These same guanine residues could also be damaged when the reaction was performed in the dark, in the presence of promazine cation radicals [22]. FEATURES Location/Qualifiers source 1..5386 /organism="Enterobacteria phage phiX174" /mol_type="genomic DNA" /db_xref="taxon:10847" CDS join(3981..5386,1..136) /function="viral strand synthesis" /note="rf replication" /codon_start=1 /transl_table=11 /product="A" /protein_id="AAA32570.1" /db_xref="GI:216020" /translation="MVRSYYPSECHADYFDFERIEALKPAIEACGISTLSQSPMLGFH KQMDNRIKLLEEILSFRMQGVEFDNGDMYVDGHKAASDVRDEFVSVTEKLMDELAQCY NVLPQLDINNTIDHRPEGDEKWFLENEKTVTQFCRKLAAERPLKDIRDEYNYPKKKGI KDECSRLLEASTMKSRRGFAIQRLMNAMRQAHADGWFIVFDTLTLADDRLEAFYDNPN ALRDYFRDIGRMVLAAEGRKANDSHADCYQYFCVPEYGTANGRLHFHAVHFMRTLPTG SVDPNFGRRVRNRRQLNSLQNTWPYGYSMPIAVRYTQDAFSRSGWLWPVDAKGEPLKA TSYMAVGFYVAKYVNKKSDMDLAAKGLGAKEWNNSLKTKLSLLPKKLFRIRMSRNFGM KMLTMTNLSTECLIQLTKLGYDATPFNQILKQNAKREMRLRLGKVTVADVLAAQPVTT NLLKFMRASIKMIGVSNLQSFIASMTQKLTLSDISDESKNYLDKAGITTACLRIKSKW TAGGK" CDS join(4497..5386,1..136) /function="shut off host DNA synthesis" /codon_start=1 /transl_table=11 /product="A*" /protein_id="AAA32571.1" /db_xref="GI:216021" /translation="MKSRRGFAIQRLMNAMRQAHADGWFIVFDTLTLADDRLEAFYDN PNALRDYFRDIGRMVLAAEGRKANDSHADCYQYFCVPEYGTANGRLHFHAVHFMRTLP TGSVDPNFGRRVRNRRQLNSLQNTWPYGYSMPIAVRYTQDAFSRSGWLWPVDAKGEPL KATSYMAVGFYVAKYVNKKSDMDLAAKGLGAKEWNNSLKTKLSLLPKKLFRIRMSRNF GMKMLTMTNLSTECLIQLTKLGYDATPFNQILKQNAKREMRLRLGKVTVADVLAAQPV TTNLLKFMRASIKMIGVSNLQSFIASMTQKLTLSDISDESKNYLDKAGITTACLRIKS KWTAGGK" CDS join(5075..5386,1..51) /function="capsid morphogenesis" /codon_start=1 /transl_table=11 /product="B" /protein_id="AAA32572.1" /db_xref="GI:216022" /translation="MEQLTKNQAVATSQEAVQNQNEPQLRDENAHNDKSVHGVLNPTY QAGLRRDAVQPDIEAERKKRDEIEAGKSYCSRRFGGATCDDKSAQIYARFDKNDWRIQ PAEFYRFHDAEVNTFGYF" variation 23 /note="in am18 and am35 [14]" /replace="t" variation 25 /note="ts116 [14]" /replace="c" CDS 51..221 /codon_start=1 /transl_table=11 /product="K" /protein_id="AAA32573.1" /db_xref="GI:216023" /translation="MSRKIILIKQELLLLVYELNRSGLLAENEKIRPILAQLEKLLLC DLSPSTNDSVKN" variation 57 /note="am6 [14]" /replace="c" variation 117 /note="am6 [14]" /replace="a" CDS 133..393 /note="DNA maturation" /codon_start=1 /transl_table=11 /product="C" /protein_id="AAA32574.1" /db_xref="GI:216024" /translation="MRKFDLSLRSSRSSYFATFRHQLTILSKTDALDEEKWLNMLGTF VKDWFRYESHFVHGRDSLVDILKERGLLSESDAVQPLIGKKS" mRNA 358..3975 /product="major transcript" mRNA 358..991 /product="minor transcript" CDS 390..848 /function="capsid morphogenesis" /codon_start=1 /transl_table=11 /product="D" /protein_id="AAA32575.1" /db_xref="GI:216025" /translation="MSQVTEQSVRFQTALASIKLIQASAVLDLTEDDFDFLTSNKVWI ATDRSRARRCVEACVYGTLDFVGYPRFPAPVEFIAAVIAYYVHPVNIQTACLIMEGAE FTENIINGVERPVKAAELFAFTLRVRAGNTDVLTDAEENVRQKLRAEGVM" CDS 568..843 /function="cell lysis" /codon_start=1 /transl_table=11 /product="E" /protein_id="AAA32576.1" /db_xref="GI:216026" /translation="MVRWTLWDTLAFLLLLSLLLPSLLIMFIPSTFKRPVSSWKALNL RKTLLMASSVRLKPLNCSRLPCVYAQETLTFLLTQKKTCVKNYVRKE" CDS 848..964 /note="core protein; DNA condensation" /codon_start=1 /transl_table=11 /product="J" /protein_id="AAA32577.1" /db_xref="GI:216027" /translation="MSKGKKRSGARPGRPQPLRGTKGKRKGARLWYVGGQQF" CDS 1001..2284 /note="major coat protein" /codon_start=1 /transl_table=11 /product="F" /protein_id="AAA32578.1" /db_xref="GI:216028" /translation="MSNIQTGAERMPHDLSHLGFLAGQIGRLITISTTPVIAGDSFEM DAVGALRLSPLRRGLAIDSTVDIFTFYVPHRHVYGEQWIKFMKDGVNATPLPTVNTTG YIDHAAFLGTINPDTNKIPKHLFQGYLNIYNNYFKAPWMPDRTEANPNELNQDDARYG FRCCHLKNIWTAPLPPETELSRQMTTSTTSIDIMGLQAAYANLHTDQERDYFMQRYHD VISSFGGKTSYDADNRPLLVMRSNLWASGYDVDGTDQTSLGQFSGRVQQTYKHSVPRF FVPEHGTMFTLALVRFPPTATKEIQYLNAKGALTYTDIAGDPVLYGNLPPREISMKDV FRSGDSSKKFKIAEGQWYRYAPSYVSPAYHLLEGFPFIQEPPSGDLQERVLIRHHDYD QCFQSVQLLQWNSQVKFNVTVYRNLPTTRDSIMTS" CDS 2395..2922 /note="major spike protein" /codon_start=1 /transl_table=11 /product="G" /protein_id="AAA32579.1" /db_xref="GI:216029" /translation="MFQTFISRHNSNFFSDKLVLTSVTPASSAPVLQTPKATSSTLYF DSLTVNAGNGGFLHCIQMDTSVNAANQVVSVGADIAFDADPKFFACLVRFESSSVPTT LPTAYDVYPLNGRHDGGYYTVKDCVTIDVLPRTPGNNVYVGFMVWSNFTATKCRGLVS LNQVIKEIICLQPLK" CDS 2931..3917 /function="adsorption" /note="minor spike protein" /codon_start=1 /transl_table=11 /product="H" /protein_id="AAA32580.1" /db_xref="GI:216030" /translation="MFGAIAGGIASALAGGAMSKLFGGGQKAASGGIQGDVLATDNNT VGMGDAGIKSAIQGSNVPNPDEAAPSFVSGAMAKAGKGLLEGTLQAGTSAVSDKLLDL VGLGGKSAADKGKDTRDYLAAAFPELNAWERAGADASSAGMVDAGFENQKELTKMQLD NQKEIAEMQNETQKEIAGIQSATSRQNTKDQVYAQNEMLAYQQKESTARVASIMENTN LSKQQQVSEIMRQMLTQAQTAGQYFTNDQIKEMTRKVSAEVDLVHQQTQNQRYGSSHI GATAKDISNVVTDAASGVVDIFHGIDKAVADTWNNFWKDGKADGIGSNLSRK" misc_feature 3962 /note="transcription start site" rep_origin 4306 /note="origin of viral strand synthesis" misc_feature 4899 /note="transcription start site" ORIGIN 1 gagttttatc gcttccatga cgcagaagtt aacactttcg gatatttctg atgagtcgaa 61 aaattatctt gataaagcag gaattactac tgcttgttta cgaattaaat cgaagtggac 121 tgctggcgga aaatgagaaa attcgaccta tccttgcgca gctcgagaag ctcttacttt 181 gcgacctttc gccatcaact aacgattctg tcaaaaactg acgcgttgga tgaggagaag 241 tggcttaata tgcttggcac gttcgtcaag gactggttta gatatgagtc acattttgtt 301 catggtagag attctcttgt tgacatttta aaagagcgtg gattactatc tgagtccgat 361 gctgttcaac cactaatagg taagaaatca tgagtcaagt tactgaacaa tccgtacgtt 421 tccagaccgc tttggcctct attaagctca ttcaggcttc tgccgttttg gatttaaccg 481 aagatgattt cgattttctg acgagtaaca aagtttggat tgctactgac cgctctcgtg 541 ctcgtcgctg cgttgaggct tgcgtttatg gtacgctgga ctttgtggga taccctcgct 601 ttcctgctcc tgttgagttt attgctgccg tcattgctta ttatgttcat cccgtcaaca 661 ttcaaacggc ctgtctcatc atggaaggcg ctgaatttac ggaaaacatt attaatggcg 721 tcgagcgtcc ggttaaagcc gctgaattgt tcgcgtttac cttgcgtgta cgcgcaggaa 781 acactgacgt tcttactgac gcagaagaaa acgtgcgtca aaaattacgt gcggaaggag 841 tgatgtaatg tctaaaggta aaaaacgttc tggcgctcgc cctggtcgtc cgcagccgtt 901 gcgaggtact aaaggcaagc gtaaaggcgc tcgtctttgg tatgtaggtg gtcaacaatt 961 ttaattgcag gggcttcggc cccttacttg aggataaatt atgtctaata ttcaaactgg 1021 cgccgagcgt atgccgcatg acctttccca tcttggcttc cttgctggtc agattggtcg 1081 tcttattacc atttcaacta ctccggttat cgctggcgac tccttcgaga tggacgccgt 1141 tggcgctctc cgtctttctc cattgcgtcg tggccttgct attgactcta ctgtagacat 1201 ttttactttt tatgtccctc atcgtcacgt ttatggtgaa cagtggatta agttcatgaa 1261 ggatggtgtt aatgccactc ctctcccgac tgttaacact actggttata ttgaccatgc 1321 cgcttttctt ggcacgatta accctgatac caataaaatc cctaagcatt tgtttcaggg 1381 ttatttgaat atctataaca actattttaa agcgccgtgg atgcctgacc gtaccgaggc 1441 taaccctaat gagcttaatc aagatgatgc tcgttatggt ttccgttgct gccatctcaa 1501 aaacatttgg actgctccgc ttcctcctga gactgagctt tctcgccaaa tgacgacttc 1561 taccacatct attgacatta tgggtctgca agctgcttat gctaatttgc atactgacca 1621 agaacgtgat tacttcatgc agcgttacca tgatgttatt tcttcatttg gaggtaaaac 1681 ctcttatgac gctgacaacc gtcctttact tgtcatgcgc tctaatctct gggcatctgg 1741 ctatgatgtt gatggaactg accaaacgtc gttaggccag ttttctggtc gtgttcaaca 1801 gacctataaa cattctgtgc cgcgtttctt tgttcctgag catggcacta tgtttactct 1861 tgcgcttgtt cgttttccgc ctactgcgac taaagagatt cagtacctta acgctaaagg 1921 tgctttgact tataccgata ttgctggcga ccctgttttg tatggcaact tgccgccgcg 1981 tgaaatttct atgaaggatg ttttccgttc tggtgattcg tctaagaagt ttaagattgc 2041 tgagggtcag tggtatcgtt atgcgccttc gtatgtttct cctgcttatc accttcttga 2101 aggcttccca ttcattcagg aaccgccttc tggtgatttg caagaacgcg tacttattcg 2161 ccaccatgat tatgaccagt gtttccagtc cgttcagttg ttgcagtgga atagtcaggt 2221 taaatttaat gtgaccgttt atcgcaatct gccgaccact cgcgattcaa tcatgacttc 2281 gtgataaaag attgagtgtg aggttataac gccgaagcgg taaaaatttt aatttttgcc 2341 gctgaggggt tgaccaagcg aagcgcggta ggttttctgc ttaggagttt aatcatgttt 2401 cagactttta tttctcgcca taattcaaac tttttttctg ataagctggt tctcacttct 2461 gttactccag cttcttcggc acctgtttta cagacaccta aagctacatc gtcaacgtta 2521 tattttgata gtttgacggt taatgctggt aatggtggtt ttcttcattg cattcagatg 2581 gatacatctg tcaacgccgc taatcaggtt gtttctgttg gtgctgatat tgcttttgat 2641 gccgacccta aattttttgc ctgtttggtt cgctttgagt cttcttcggt tccgactacc 2701 ctcccgactg cctatgatgt ttatcctttg aatggtcgcc atgatggtgg ttattatacc 2761 gtcaaggact gtgtgactat tgacgtcctt ccccgtacgc cgggcaataa cgtttatgtt 2821 ggtttcatgg tttggtctaa ctttaccgct actaaatgcc gcggattggt ttcgctgaat 2881 caggttatta aagagattat ttgtctccag ccacttaagt gaggtgattt atgtttggtg 2941 ctattgctgg cggtattgct tctgctcttg ctggtggcgc catgtctaaa ttgtttggag 3001 gcggtcaaaa agccgcctcc ggtggcattc aaggtgatgt gcttgctacc gataacaata 3061 ctgtaggcat gggtgatgct ggtattaaat ctgccattca aggctctaat gttcctaacc 3121 ctgatgaggc cgcccctagt tttgtttctg gtgctatggc taaagctggt aaaggacttc 3181 ttgaaggtac gttgcaggct ggcacttctg ccgtttctga taagttgctt gatttggttg 3241 gacttggtgg caagtctgcc gctgataaag gaaaggatac tcgtgattat cttgctgctg 3301 catttcctga gcttaatgct tgggagcgtg ctggtgctga tgcttcctct gctggtatgg 3361 ttgacgccgg atttgagaat caaaaagagc ttactaaaat gcaactggac aatcagaaag 3421 agattgccga gatgcaaaat gagactcaaa aagagattgc tggcattcag tcggcgactt 3481 cacgccagaa tacgaaagac caggtatatg cacaaaatga gatgcttgct tatcaacaga 3541 aggagtctac tgctcgcgtt gcgtctatta tggaaaacac caatctttcc aagcaacagc 3601 aggtttccga gattatgcgc caaatgctta ctcaagctca aacggctggt cagtatttta 3661 ccaatgacca aatcaaagaa atgactcgca aggttagtgc tgaggttgac ttagttcatc 3721 agcaaacgca gaatcagcgg tatggctctt ctcatattgg cgctactgca aaggatattt 3781 ctaatgtcgt cactgatgct gcttctggtg tggttgatat ttttcatggt attgataaag 3841 ctgttgccga tacttggaac aatttctgga aagacggtaa agctgatggt attggctcta 3901 atttgtctag gaaataaccg tcaggattga caccctccca attgtatgtt ttcatgcctc 3961 caaatcttgg aggctttttt atggttcgtt cttattaccc ttctgaatgt cacgctgatt 4021 attttgactt tgagcgtatc gaggctctta aacctgctat tgaggcttgt ggcatttcta 4081 ctctttctca atccccaatg cttggcttcc ataagcagat ggataaccgc atcaagctct 4141 tggaagagat tctgtctttt cgtatgcagg gcgttgagtt cgataatggt gatatgtatg 4201 ttgacggcca taaggctgct tctgacgttc gtgatgagtt tgtatctgtt actgagaagt 4261 taatggatga attggcacaa tgctacaatg tgctccccca acttgatatt aataacacta 4321 tagaccaccg ccccgaaggg gacgaaaaat ggtttttaga gaacgagaag acggttacgc 4381 agttttgccg caagctggct gctgaacgcc ctcttaagga tattcgcgat gagtataatt 4441 accccaaaaa gaaaggtatt aaggatgagt gttcaagatt gctggaggcc tccactatga 4501 aatcgcgtag aggctttgct attcagcgtt tgatgaatgc aatgcgacag gctcatgctg 4561 atggttggtt tatcgttttt gacactctca cgttggctga cgaccgatta gaggcgtttt 4621 atgataatcc caatgctttg cgtgactatt ttcgtgatat tggtcgtatg gttcttgctg 4681 ccgagggtcg caaggctaat gattcacacg ccgactgcta tcagtatttt tgtgtgcctg 4741 agtatggtac agctaatggc cgtcttcatt tccatgcggt gcactttatg cggacacttc 4801 ctacaggtag cgttgaccct aattttggtc gtcgggtacg caatcgccgc cagttaaata 4861 gcttgcaaaa tacgtggcct tatggttaca gtatgcccat cgcagttcgc tacacgcagg 4921 acgctttttc acgttctggt tggttgtggc ctgttgatgc taaaggtgag ccgcttaaag 4981 ctaccagtta tatggctgtt ggtttctatg tggctaaata cgttaacaaa aagtcagata 5041 tggaccttgc tgctaaaggt ctaggagcta aagaatggaa caactcacta aaaaccaagc 5101 tgtcgctact tcccaagaag ctgttcagaa tcagaatgag ccgcaacttc gggatgaaaa 5161 tgctcacaat gacaaatctg tccacggagt gcttaatcca acttaccaag ctgggttacg 5221 acgcgacgcc gttcaaccag atattgaagc agaacgcaaa aagagagatg agattgaggc 5281 tgggaaaagt tactgtagcc gacgttttgg cggcgcaacc tgtgacgaca aatctgctca 5341 aatttatgcg cgcttcgata aaaatgattg gcgtatccaa cctgca // test.txt100644000766000024 107514006427353 16117 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>Test AGCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTGATAGCAGC TTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAATACTTTAACCAA TATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAACGCATTAGCACCACC ATTACCACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGCGTACAGGAAACACAGAAAAAAG CCCGCACCTGACAGTGCGGGCTTTTTTTTTCGACCAAAGGTAACGAGGTAACAACCATGCGAGTGTTGAA GTTCGGCGGTACATCAGTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATGCC AGGCAGGGGCAGGTGGCCACCGTCCTCTCTGCCCCCGCCAAAATCACCAACCACCTGGTGGCGATGATTG AAAAAACCATTAGCGGCCAGGATGCTTTACCCAATATCAGCGATGCCGAACGTATTTTTGCCGAACTTTTBLOSUM50100644000766000024 411214006427353 15463 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# Matrix made by matblas from blosum50.iij # * column uses minimum score # BLOSUM Clustered Scoring Matrix in 1/3 Bit Units # Blocks Database = /data/blocks_5.0/blocks.dat # Cluster Percentage: >= 50 # Entropy = 0.4808, Expected = -0.3573 A R N D C Q E G H I L K M F P S T W Y V B Z X * A 5 -2 -1 -2 -1 -1 -1 0 -2 -1 -2 -1 -1 -3 -1 1 0 -3 -2 0 -2 -1 -1 -5 R -2 7 -1 -2 -4 1 0 -3 0 -4 -3 3 -2 -3 -3 -1 -1 -3 -1 -3 -1 0 -1 -5 N -1 -1 7 2 -2 0 0 0 1 -3 -4 0 -2 -4 -2 1 0 -4 -2 -3 4 0 -1 -5 D -2 -2 2 8 -4 0 2 -1 -1 -4 -4 -1 -4 -5 -1 0 -1 -5 -3 -4 5 1 -1 -5 C -1 -4 -2 -4 13 -3 -3 -3 -3 -2 -2 -3 -2 -2 -4 -1 -1 -5 -3 -1 -3 -3 -2 -5 Q -1 1 0 0 -3 7 2 -2 1 -3 -2 2 0 -4 -1 0 -1 -1 -1 -3 0 4 -1 -5 E -1 0 0 2 -3 2 6 -3 0 -4 -3 1 -2 -3 -1 -1 -1 -3 -2 -3 1 5 -1 -5 G 0 -3 0 -1 -3 -2 -3 8 -2 -4 -4 -2 -3 -4 -2 0 -2 -3 -3 -4 -1 -2 -2 -5 H -2 0 1 -1 -3 1 0 -2 10 -4 -3 0 -1 -1 -2 -1 -2 -3 2 -4 0 0 -1 -5 I -1 -4 -3 -4 -2 -3 -4 -4 -4 5 2 -3 2 0 -3 -3 -1 -3 -1 4 -4 -3 -1 -5 L -2 -3 -4 -4 -2 -2 -3 -4 -3 2 5 -3 3 1 -4 -3 -1 -2 -1 1 -4 -3 -1 -5 K -1 3 0 -1 -3 2 1 -2 0 -3 -3 6 -2 -4 -1 0 -1 -3 -2 -3 0 1 -1 -5 M -1 -2 -2 -4 -2 0 -2 -3 -1 2 3 -2 7 0 -3 -2 -1 -1 0 1 -3 -1 -1 -5 F -3 -3 -4 -5 -2 -4 -3 -4 -1 0 1 -4 0 8 -4 -3 -2 1 4 -1 -4 -4 -2 -5 P -1 -3 -2 -1 -4 -1 -1 -2 -2 -3 -4 -1 -3 -4 10 -1 -1 -4 -3 -3 -2 -1 -2 -5 S 1 -1 1 0 -1 0 -1 0 -1 -3 -3 0 -2 -3 -1 5 2 -4 -2 -2 0 0 -1 -5 T 0 -1 0 -1 -1 -1 -1 -2 -2 -1 -1 -1 -1 -2 -1 2 5 -3 -2 0 0 -1 0 -5 W -3 -3 -4 -5 -5 -1 -3 -3 -3 -3 -2 -3 -1 1 -4 -4 -3 15 2 -3 -5 -2 -3 -5 Y -2 -1 -2 -3 -3 -1 -2 -3 2 -1 -1 -2 0 4 -3 -2 -2 2 8 -1 -3 -2 -1 -5 V 0 -3 -3 -4 -1 -3 -3 -4 -4 4 1 -3 1 -1 -3 -2 0 -3 -1 5 -4 -3 -1 -5 B -2 -1 4 5 -3 0 1 -1 0 -4 -4 0 -3 -4 -2 0 0 -5 -3 -4 5 2 -1 -5 Z -1 0 0 1 -3 4 5 -2 0 -3 -3 1 -1 -4 -1 0 -1 -2 -2 -3 2 5 -1 -5 X -1 -1 -1 -1 -2 -1 -1 -2 -1 -1 -1 -1 -1 -2 -2 -1 0 -3 -1 -1 -1 -1 -1 -5 * -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 -5 1 test.pir100644000766000024 144214006427353 16070 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>P1;roa1_drome Rea guano receptor type III >> 0.1 MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY* >P1;roa2_drome Rea guano ligand MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPTSTSTSTSTSTSTSTSTMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHLLLLLLLDLLLLDLLLLDLLLFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY* crab.njb100644000766000024 217214006427353 16000 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 13 sequences 1000 bootstraping 1 A-salina 2 C-vittat 3 C-sp. 4 L-aequit 5 P-camtsc 6 E-tenuim 7 L-splend 8 P-bernha 9 P-acadia 10 P-p(NE) 11 P-p(GU) 12 P-l(NE) 13 P-l(GU) 14 and 2 0.098857 1000 14 and 3 0.127932 1000 15 and 1 0.197471 1000 15 and 14 0.029273 874 16 and 10 0.011732 1000 16 and 11 0.004529 1000 17 and 12 0.002258 1000 17 and 13 0.000428 1000 18 and 16 0.017512 1000 18 and 17 0.010824 998 19 and 4 0.006534 1000 19 and 5 0.006992 1000 20 and 15 0.070461 1000 20 and 18 0.030579 998 21 and 8 0.003339 1000 21 and 9 0.002042 1000 22 and 6 0.011142 1000 22 and 21 0.010693 983 23 and 20 0.020714 996 23 and 19 0.020350 1000 24 and 23 0.008665 826 24 and 22 0.013457 972 24 and 7 0.025598 1000 JC distance was used Number of nucleotide sites compared 373 (nsite=421) seed=27165 ninap=0 SeqI.pm100644000766000024 1466014006427353 15725 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::SeqI # # Please direct questions and support issues to # # Cared for by Ewan Birney # # Copyright Ewan Birney # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::SeqI - [Developers] Abstract Interface of Sequence (with features) =head1 SYNOPSIS # Bio::SeqI is the interface class for sequences. # If you are a newcomer to bioperl, you should # start with Bio::Seq documentation. This # documentation is mainly for developers using # Bioperl. # Bio::SeqI implements Bio::PrimarySeqI $seq = $seqobj->seq(); # actual sequence as a string $seqstr = $seqobj->subseq(10,50); # Bio::SeqI has annotationcollections $ann = $seqobj->annotation(); # annotation object # Bio::SeqI has sequence features # features must implement Bio::SeqFeatureI @features = $seqobj->get_SeqFeatures(); # just top level @features = $seqobj->get_all_SeqFeatures(); # descend into sub features =head1 DESCRIPTION Bio::SeqI is the abstract interface of annotated Sequences. These methods are those which you can be guaranteed to get for any Bio::SeqI. For most users of the package the documentation (and methods) in this class are not at useful - this is a developers only class which defines what methods have to be implemented by other Perl objects to comply to the Bio::SeqI interface. Go "perldoc Bio::Seq" or "man Bio::Seq" for more information. There aren't many method here, because too many complicated functions here would prevent implementations which are just wrappers around a database or similar delayed mechanisms. Most of the clever stuff happens inside the SeqFeatureI system. A good reference implementation is Bio::Seq which is a pure perl implementation of this class with a lot of extra pieces for extra manipulation. However, if you want to be able to use any sequence object in your analysis, if you can do it just using these methods, then you know you will be future proof and compatible with other implementations of Seq. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Ewan Birney Email birney@ebi.ac.uk =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut #' # Let the code begin... package Bio::SeqI; $Bio::SeqI::VERSION = '1.7.8'; use strict; # Object preamble - inherits from Bio::PrimarySeqI use base qw(Bio::PrimarySeqI Bio::AnnotatableI Bio::FeatureHolderI); =head2 get_SeqFeatures Title : get_SeqFeatures Usage : my @feats = $seq->get_SeqFeatures(); Function: retrieve just the toplevel sequence features attached to this seq Returns : array of Bio::SeqFeatureI objects Args : none This method comes through extension of Bio::FeatureHolderI. See L and L for more information. =head2 get_all_SeqFeatures Title : get_all_SeqFeatures Usage : my @feats = $seq->get_all_SeqFeatures(); Function: returns all SeqFeatures, including sub SeqFeatures Returns : an array of Bio::SeqFeatureI objects Args : none This method comes through extension of Bio::FeatureHolderI. See L and L for more information. =head2 feature_count Title : feature_count Usage : my $count = $seq->feature_count(); Function: Return the number of SeqFeatures attached to a sequence Returns : integer representing the number of SeqFeatures Args : none This method comes through extension of Bio::FeatureHolderI. See L for more information. =head2 seq Title : seq Usage : my $string = $seq->seq(); Function: Retrieves the sequence string for the sequence object Returns : string Args : none =cut sub seq { my ($self) = @_; $self->throw_not_implemented(); } =head2 write_GFF Title : write_GFF Usage : $seq->write_GFF(\*FILEHANDLE); Function: Convenience method to write out all the sequence features in GFF format to the provided filehandle (STDOUT by default) Returns : none Args : [optional] filehandle to write to (default is STDOUT) =cut sub write_GFF { my ($self,$fh) = @_; $fh || do { $fh = \*STDOUT; }; foreach my $sf ( $self->get_all_SeqFeatures() ) { print $fh $sf->gff_string, "\n"; } } =head2 annotation Title : annotation Usage : my $ann = $seq->annotation($seq_obj); Function: retrieve the attached annotation object Returns : Bio::AnnotationCollectionI or none; See L and L for more information. This method comes through extension from L. =head2 species Title : species Usage : Function: Gets or sets the species Example : my $species = $seq->species(); Returns : Bio::Species object Args : Bio::Species object or none; See L for more information =cut sub species { my ($self) = @_; $self->throw_not_implemented(); } =head2 primary_seq Title : primary_seq Usage : my $primaryseq = $seq->primary_seq($newval) Function: Retrieve the underlying Bio::PrimarySeqI object if available. This is in the event one has a sequence with lots of features but want to be able to narrow the object to just one with the basics of a sequence (no features or annotations). Returns : Bio::PrimarySeqI Args : Bio::PrimarySeqI or none; See L for more information =cut sub primary_seq { my ($self) = @_; $self->throw_not_implemented; } 1; DasI.pm100644000766000024 3276714006427353 15714 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::DasI # # Please direct questions and support issues to # # Cared for by Lincoln Stein # # Copyright Lincoln Stein # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::DasI - DAS-style access to a feature database =head1 SYNOPSIS # Open up a feature database somehow... $db = Bio::DasI->new(@args); @segments = $db->segment(-name => 'NT_29921.4', -start => 1, -end => 1000000); # segments are Bio::Das::SegmentI - compliant objects # fetch a list of features @features = $db->features(-type=>['type1','type2','type3']); # invoke a callback over features $db->features(-type=>['type1','type2','type3'], -callback => sub { ... } ); $stream = $db->get_seq_stream(-type=>['type1','type2','type3']); while (my $feature = $stream->next_seq) { # each feature is a Bio::SeqFeatureI-compliant object } # get all feature types @types = $db->types; # count types %types = $db->types(-enumerate=>1); @feature = $db->get_feature_by_name($class=>$name); @feature = $db->get_feature_by_target($target_name); @feature = $db->get_feature_by_attribute($att1=>$value1,$att2=>$value2); $feature = $db->get_feature_by_id($id); $error = $db->error; =head1 DESCRIPTION Bio::DasI is a simplified alternative interface to sequence annotation databases used by the distributed annotation system (see L). In this scheme, the genome is represented as a series of features, a subset of which are named. Named features can be used as reference points for retrieving "segments" (see L), and these can, in turn, be used as the basis for exploring the genome further. In addition to a name, each feature has a "class", which is essentially a namespace qualifier and a "type", which describes what type of feature it is. Das uses the GO consortium's ontology of feature types, and so the type is actually an object of class Bio::Das::FeatureTypeI (see L). Bio::DasI provides methods for interrogating the database for the types it contains and the counts of each type. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Lincoln Stein Email lstein@cshl.org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut #' # Let the code begin... package Bio::DasI; $Bio::DasI::VERSION = '1.7.8'; use strict; use Bio::Das::SegmentI; # Object preamble - inherits from Bio::Root::Root; use base qw(Bio::Root::RootI Bio::SeqFeature::CollectionI); =head2 new Title : new Usage : Bio::DasI->new(@args) Function: Create new Bio::DasI object Returns : a Bio::DasI object Args : see below The new() method creates a new object. The argument list is either a single argument consisting of a connection string, or the following list of -name=Evalue arguments: Argument Description -------- ----------- -dsn Connection string for database -adaptor Name of an adaptor class to use when connecting -aggregator Array ref containing list of aggregators "semantic mappers" to apply to database -user Authentication username -pass Authentication password Implementors of DasI may add other arguments. =cut sub new {shift->throw_not_implemented} =head2 types Title : types Usage : $db->types(@args) Function: return list of feature types in database Returns : a list of Bio::Das::FeatureTypeI objects Args : see below This routine returns a list of feature types known to the database. It is also possible to find out how many times each feature occurs. Arguments are -option=Evalue pairs as follows: -enumerate if true, count the features The returned value will be a list of Bio::Das::FeatureTypeI objects (see L. If -enumerate is true, then the function returns a hash (not a hash reference) in which the keys are the stringified versions of Bio::Das::FeatureTypeI and the values are the number of times each feature appears in the database. =cut sub types { shift->throw_not_implemented; } =head2 parse_types Title : parse_types Usage : $db->parse_types(@args) Function: parses list of types Returns : an array ref containing ['method','source'] pairs Args : a list of types in 'method:source' form Status : internal This method takes an array of type names in the format "method:source" and returns an array reference of ['method','source'] pairs. It will also accept a single argument consisting of an array reference with the list of type names. =cut # turn feature types in the format "method:source" into a list of [method,source] refs sub parse_types { my $self = shift; return [] if !@_ or !defined($_[0]); return $_[0] if ref $_[0] eq 'ARRAY' && ref $_[0][0]; my @types = ref($_[0]) ? @{$_[0]} : @_; my @type_list = map { [split(':',$_,2)] } @types; return \@type_list; } =head2 segment Title : segment Usage : $db->segment(@args); Function: create a segment object Returns : segment object(s) Args : see below This method generates a Bio::Das::SegmentI object (see L). The segment can be used to find overlapping features and the raw sequence. When making the segment() call, you specify the ID of a sequence landmark (e.g. an accession number, a clone or contig), and a positional range relative to the landmark. If no range is specified, then the entire region spanned by the landmark is used to generate the segment. Arguments are -option=Evalue pairs as follows: -name ID of the landmark sequence. -class A namespace qualifier. It is not necessary for the database to honor namespace qualifiers, but if it does, this is where the qualifier is indicated. -version Version number of the landmark. It is not necessary for the database to honor versions, but if it does, this is where the version is indicated. -start Start of the segment relative to landmark. Positions follow standard 1-based sequence rules. If not specified, defaults to the beginning of the landmark. -end End of the segment relative to the landmark. If not specified, defaults to the end of the landmark. The return value is a list of Bio::Das::SegmentI objects. If the method is called in a scalar context and there are no more than one segments that satisfy the request, then it is allowed to return the segment. Otherwise, the method must throw a "multiple segment exception". =cut #' sub segment { shift->throw_not_implemented } =head2 features Title : features Usage : $db->features(@args) Function: get all features, possibly filtered by type Returns : a list of Bio::SeqFeatureI objects Args : see below Status : public This routine will retrieve features in the database regardless of position. It can be used to return all features, or a subset based on their type Arguments are -option=Evalue pairs as follows: -types List of feature types to return. Argument is an array of Bio::Das::FeatureTypeI objects or a set of strings that can be converted into FeatureTypeI objects. -callback A callback to invoke on each feature. The subroutine will be passed each Bio::SeqFeatureI object in turn. -attributes A hash reference containing attributes to match. The -attributes argument is a hashref containing one or more attributes to match against: -attributes => { Gene => 'abc-1', Note => 'confirmed' } Attribute matching is simple exact string matching, and multiple attributes are ANDed together. See L for a more sophisticated take on this. If one provides a callback, it will be invoked on each feature in turn. If the callback returns a false value, iteration will be interrupted. When a callback is provided, the method returns undef. =cut sub features { shift->throw_not_implemented } =head2 get_feature_by_name Title : get_feature_by_name Usage : $db->get_feature_by_name(-class=>$class,-name=>$name) Function: fetch features by their name Returns : a list of Bio::SeqFeatureI objects Args : the class and name of the desired feature Status : public This method can be used to fetch named feature(s) from the database. The -class and -name arguments have the same meaning as in segment(), and the method also accepts the following short-cut forms: 1) one argument: the argument is treated as the feature name 2) two arguments: the arguments are treated as the class and name (note: this uses _rearrange() so the first argument must not begin with a hyphen or it will be interpreted as a named argument). This method may return zero, one, or several Bio::SeqFeatureI objects. The implementor may allow the name to contain wildcards, in which case standard C-shell glob semantics are expected. =cut sub get_feature_by_name { shift->throw_not_implemented(); } =head2 get_feature_by_target Title : get_feature_by_target Usage : $db->get_feature_by_target($class => $name) Function: fetch features by their similarity target Returns : a list of Bio::SeqFeatureI objects Args : the class and name of the desired feature Status : public This method can be used to fetch a named feature from the database based on its similarity hit. The arguments are the same as get_feature_by_name(). If this is not implemented, the interface defaults to using get_feature_by_name(). =cut sub get_feature_by_target { shift->get_feature_by_name(@_); } =head2 get_feature_by_id Title : get_feature_by_id Usage : $db->get_feature_by_target($id) Function: fetch a feature by its ID Returns : a Bio::SeqFeatureI objects Args : the ID of the feature Status : public If the database provides unique feature IDs, this can be used to retrieve a single feature from the database. If not overridden, this interface calls get_feature_by_name() and returns the first element. =cut sub get_feature_by_id { (shift->get_feature_by_name(@_))[0]; } =head2 get_feature_by_attribute Title : get_feature_by_attribute Usage : $db->get_feature_by_attribute(attribute1=>value1,attribute2=>value2) Function: fetch features by combinations of attribute values Returns : a list of Bio::SeqFeatureI objects Args : the class and name of the desired feature Status : public This method can be used to fetch a set of features from the database. Attributes are a list of name=Evalue pairs. They will be logically ANDed together. If an attribute value is an array reference, the list of values in the array is treated as an alternative set of values to be ORed together. =cut sub get_feature_by_attribute { shift->throw_not_implemented(); } =head2 search_notes Title : search_notes Usage : $db->search_notes($search_term,$max_results) Function: full-text search on features, ENSEMBL-style Returns : an array of [$name,$description,$score] Args : see below Status : public This routine performs a full-text search on feature attributes (which attributes depend on implementation) and returns a list of [$name,$description,$score], where $name is the feature ID, $description is a human-readable description such as a locus line, and $score is the match strength. Since this is a decidedly non-standard thing to do (but the generic genome browser uses it), the default method returns an empty list. You do not have to implement it. =cut sub search_notes { return } =head2 get_seq_stream Title : get_seq_stream Usage : $seqio = $db->get_seq_stream(@args) Function: Performs a query and returns an iterator over it Returns : a Bio::SeqIO stream capable of returning Bio::SeqFeatureI objects Args : As in features() Status : public This routine takes the same arguments as features(), but returns a Bio::SeqIO::Stream-compliant object. Use it like this: $stream = $db->get_seq_stream('exon'); while (my $exon = $stream->next_seq) { print $exon,"\n"; } NOTE: In the interface this method is aliased to get_feature_stream(), as the name is more descriptive. =cut sub get_seq_stream { shift->throw_not_implemented } sub get_feature_stream {shift->get_seq_stream(@_) } =head2 refclass Title : refclass Usage : $class = $db->refclass Function: returns the default class to use for segment() calls Returns : a string Args : none Status : public For data sources which use namespaces to distinguish reference sequence accessions, this returns the default namespace (or "class") to use. This interface defines a default of "Accession". =cut sub refclass { "Accession" } 1; bp_split_seq100644000766000024 1015014006427353 16414 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; =head1 NAME bp_split_seq - splits a sequence into equal sized chunks with an optional overlapping range =head1 SYNOPSIS bp_split_seq -c 10000 [-o 1000] [-i] -f seq.in =head1 DESCRIPTION The script will split sequences into chunks Mandatory Options: -c Desired length of the resulting sequences. -f Input file (must be FASTA format). Special Options: -o Overlapping range between the resulting sequences. -i Create an index file with the resulting sequence files. This is useful if you want to pass this list as input arguments into another programs (i.e. CLUSTAL, HMMER, etc.). =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHORS Ewan Birney Ebirney-at-ebi.ac.ukE Mauricio Herrera Cuadra Emauricio at intelligenomes.comE (some enhancements) =cut # Modules, pragmas and variables to use use Bio::Seq; use Bio::SeqIO; use Getopt::Long; use vars qw($opt_c $opt_o $opt_i $opt_f $index_file); # Gets options from the command line GetOptions qw(-c=i -o:i -i -f=s); # If no mandatory options are given prints an error and exits if (!$opt_c) { print "ERROR: No chunk size has been specified.\n" and exit(); } elsif (!$opt_f) { print "ERROR: No FASTA file has been specified.\n" and exit(); } # Declares offset size my $offset = $opt_o ? $opt_o : "0"; # Opens the FASTA file my $in = Bio::SeqIO->new( -file => "$opt_f", -format => "Fasta", ); print "==> Opening FASTA file:\t\t\t\t$opt_f\n"; # Reads the next sequence object while (my $seq = $in->next_seq()) { # Reads the ID for the sequence and prints it my $id = $seq->id(); print "--> The ID for this sequence is:\t\t$id\n"; # Reads the description for the sequence and prints it my $desc = $seq->desc(); print "--> The description for this sequence is:\t$desc\n"; # Gets sequence length and prints it my $seq_length = $seq->length(); print "--> The length of this sequence is:\t\t$seq_length\n"; # If the chunk size is bigger than the sequence length prints the error and exits (print "ERROR: Specified chunk size is bigger than sequence length.\n" and exit()) if ($opt_c > $seq_length); # Creates a directory for writing the resulting files mkdir("split", 0755) unless -e "split" and -d "split"; # Creates the INDEX file if the option was given my $FH; if ($opt_i) { $index_file = "$id.c$opt_c.o$offset.INDEX"; open $FH, '>', $index_file or die "Could not write file '$index_file': $!\n"; } # Loops through the sequence for (my $i = 1; $i < $seq_length; $i += $opt_c) { my $end = (($i + $opt_c) > $seq_length) ? ($seq_length + 1) : ($i + $opt_c); my $seq_range = (($i + $opt_c) > $seq_length) ? "$i-".($end - 1) : "$i-$end"; my $id = $seq->id(); $id .= "_$seq_range"; # Stores chunk into its corresponding FASTA file my $out = Bio::SeqIO->new( -file => ">split/$id.faa", -format => "Fasta", ); my $trunc_seq = $seq->trunc($i, $end - 1); $trunc_seq->id($id); $out->write_seq($trunc_seq); print "==> Sequence chunk:\t$seq_range\tstored in file:\tsplit/$id.faa\n"; # Prints the current file name into the INDEX file if the option was given print $FH "split/$id.faa\n" if $opt_i; # Decreases the $i value with the offset value $i -= $offset; } # Closes the INDEX file if the option was given if ($opt_i) { print "==> INDEX stored in file:\t\t\t$index_file\n"; close $FH; } } __END__ bp_sreformat100644000766000024 1326714006427353 16427 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # Author: Jason Stajich # Purpose: Bioperl implementation of Sean Eddy's sreformat # We're not as clever as Sean's squid library though so # you have to specify the input format rather than letting # the application guess. use strict; use warnings; use Bio::SeqIO; use Bio::AlignIO; use Getopt::Long; my $USAGE = "bp_sreformat -if INFORMAT -of OUTFORMAT -i FILENAME -o output.FORMAT -h/--help Print this help -if/--informat Specify the input format -of/--outformat Specify the output format -i/--input Specify the input file name (to pass in data on STDIN use minus sign as filename) -o/--output Specify the output file name (to pass data out on STDOUT use minus sign as filename) --msa Specify this is multiple sequence alignment data --special=specialparams Specify special params supported by some formats Comma or space separated please. These include: nointerleaved -- for phylip,non-interleaved format idlinebreak -- for phylip, makes it molphy format percentages -- for clustalw, show % id per line flat -- don't show start-end in seqid linelength -- line length for clustalw mrbayes -- for MrBayes proper NEXUS output "; my ($input,$output,$informat,$outformat,$msa,$special); GetOptions( 'h|help' => sub { print STDERR ($USAGE); exit(0) }, 'i|input:s' => \$input, 'o|output:s' => \$output, 'if|informat:s' => \$informat, 'of|outformat:s' => \$outformat, 'msa' => \$msa, 's|special:s' => \$special, ); unless( defined $informat && defined $outformat ) { die(sprintf("Cannot proceed without a defined input and output you gave (%s,%s)\n", defined $informat ? $informat : "''" , defined $outformat ? $outformat : "''")); } my ($in,$out); my @extra; if( $special ) { @extra = map { my @rc; if( /nointerleaved/) { @rc = ('-interleaved' => '0'); } elsif( /mrbayes/ ) { @rc = ('-show_symbols' => 0, '-show_endblock' => 0); } elsif( /(\S+)\=(\S+)/ ) { @rc = ( "-$1" => $2) } else{ @rc = ("-$_" => 1) } @rc; } split(/[\s,]/,$special); } # guess we're talking about MSA if any of the standard MSA names are used if( $informat =~ /nexus|phylip|clustal|maf|stockholm|bl2seq|msf/ || $outformat =~ /nexus|phylip|clustal|maf|stockholm|bl2seq|msf/ ) { $msa = 1; } if( $msa ) { eval { if( defined $input ) { $in = new Bio::AlignIO(-format => $informat, -file => $input); } else { $in = new Bio::AlignIO(-format => $informat, -fh => \*ARGV); } }; if( $@ ) { die("Unknown MSA format to bioperl $informat\n"); } eval { if( $output ) { $out = new Bio::AlignIO(-format => $outformat, -file => ">$output", @extra); } else { # default to STDOUT for output $out = new Bio::AlignIO(-format => $outformat,@extra); } }; if( $@ ) { die("Unknown MSA format to bioperl $outformat\n"); } while( my $aln = $in->next_aln) { if( $special =~ /flat/ ) {$aln->set_displayname_flat(1); } $out->write_aln($aln) } } else { eval { if( defined $input ) { $in = new Bio::SeqIO(-format => $informat, -file => $input); } else { $in = new Bio::SeqIO(-format => $informat, -fh => \*ARGV); } }; if( $@ ) { if( $@ =~ /Could not open/ ) { die("Could not open input file: $input\n"); } else { die("Unknown sequence format to bioperl $informat\n"); } } eval { if( $output ) { $out = new Bio::SeqIO(-format => $outformat, -file => ">$output"); } else { # default to STDOUT for output $out = new Bio::SeqIO(-format => $outformat); } }; if( $@ ) { if( $@ =~ /Could not open/ ) { die("Could not open output file: $output\n"); } else { die("Unknown sequence format to bioperl $outformat: $@\n"); } } while( my $seq = $in->next_seq ) { $out->write_seq($seq); } } =head1 NAME bpsreformat - convert sequence formats =head1 DESCRIPTION This script uses the SeqIO system that allows conversion of sequence formats either sequence data or multiple sequence alignment data. The name comes from the fact that Sean Eddy's program sreformat (part of the HMMER pkg) already does this. Sean's program tries to guess the input formats while in our code we currently require your to specify what the input and output formats are and if the data is from a multiple sequence alignment or from straight sequence files. Usage: bpsreformat -if INFORMAT -of OUTFORMAT -i FILENAME -o output.FORMAT -h/--help Print this help -if/--informat Specify the input format -of/--outformat Specify the output format -i/--input Specify the input file name (to pass in data on STDIN use minus sign as filename) -o/--output Specify the output file name (to pass data out on STDOUT use minus sign as filename) --msa Specify this is multiple sequence alignment data --special Will pass on special parameters to the AlignIO/SeqIO object -- most of these are for Bio::AlignIO objects Comma separated list of the following nointerleaved -- for phylip,non-interleaved format idlinebreak -- for phylip, makes it molphy format percentages -- for clustalw, show % id per line =cut bp_biogetseq100644000766000024 274414006427353 16365 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # # OBDA Registry compliant sequence retrieval script # # Copyright Heikki Lehvaslaiho # You may distribute this program under the same terms as perl itself use Bio::DB::Registry; use Bio::SeqIO; use Getopt::Long; use strict; use warnings; my ($help, $format, $namespace, $dbname) = ('', 'embl', 'acc', 'embl'); GetOptions ("help" => \$help, "format=s" => \$format, "namespace=s" => \$namespace, "dbname=s" => \$dbname ); if ($help || !@ARGV) { system("perldoc $0"); exit 0; } my $get_function = 'get_Seq_by_'. $namespace; my $registry = new Bio::DB::Registry(); while (my $id = shift) { my $db = $registry->get_database($dbname); my $seq = $db->$get_function($id); if ($seq) { my $out = new Bio::SeqIO('-format' => $format); $out->write_seq($seq); } else { print STDERR "Could not find sequence with identifier [$id]\n"; } } =head1 NAME bp_biogetseq - sequence retrieval using OBDA registry =head1 DESCRIPTION This script retrieves sequences from the source defined by users registry setup. The current alternatives are from a local indexed file, sql database or over the web. =head1 USAGE Usage: bp_biogetseq --dbname embl --format embl --namespace acc id [ ids... ] * dbname defaults to embl * format defaults to embl * namespace defaults to 'acc' ['id', 'acc', 'version'] * unnamed arguments are ids in the given namespace =cut Geneid.t100644000766000024 337214006427353 16150 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 26); use_ok('Bio::Tools::Geneid'); } my $inputfilename = test_input_file('geneid_1.0.out'); my $parser = Bio::Tools::Geneid->new(-file => $inputfilename); my @genes; while (my $gene= $parser->next_prediction) { push(@genes, $gene); } my @transcripts = $genes[0]->transcripts; my @exons = $transcripts[0]->exons; is($transcripts[0]->seq_id, '10'); is($exons[0]->seq_id, '10'); is($transcripts[0]->source_tag, 'geneid'); is($exons[0]->source_tag, 'geneid'); is($transcripts[0]->primary_tag, 'transcript'); is($exons[0]->primary_tag, 'Initial'); is(scalar($transcripts[0]->exons), 2); is($transcripts[0]->start, 6090); is($transcripts[0]->end, 7276); is($transcripts[0]->score, 36.87); is($transcripts[0]->strand, 1); is($exons[0]->start, 6090); is($exons[0]->end, 6155); is($exons[0]->score, '1.40'); is($exons[0]->strand, 1); my ($type) = $exons[0]->get_tag_values('Type'); is($type, 'Initial'); my ($phase) = $exons[0]->get_tag_values('phase'); is($phase, 0); my ($end_phase) = $exons[0]->get_tag_values('end_phase'); is($end_phase, 0); my ($start_signal_score) = $exons[0]->get_tag_values('start_signal_score'); is($start_signal_score, 2.15); my ($end_signal_score) = $exons[0]->get_tag_values('end_signal_score'); is($end_signal_score, 3.63); my ($coding_potential_score) = $exons[0]->get_tag_values('coding_potential_score'); is($coding_potential_score, 12.34); my ($homology_score) = $exons[0]->get_tag_values('homology_score'); is($homology_score, '0.00'); is(scalar(@genes), 3); @transcripts = $genes[1]->transcripts; is(scalar($transcripts[0]->exons), 5); @transcripts = $genes[2]->transcripts; is(scalar($transcripts[0]->exons), 1); mbsout.t100644000766000024 3644214006427353 16172 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO#!/usr/bin/perl use version; our $API_VERSION = qv('1.1.3'); use strict; use File::Path qw(mkpath rmtree); use Carp; BEGIN { use Bio::Root::Test; test_begin( -tests => 74, -requires_modules => [q(Bio::SeqIO::mbsout)], -requires_networking => 0 ); use_ok('Bio::SeqIO::mbsout'); } # skip tests if the msout.pm module is too old. cmp_ok( $Bio::SeqIO::mbsout::API_VERSION, '>=', qv('1.1.3'), "Bio::SeqIO::mbsout is at least api version 1.1.3" ); create_dir("mbsout"); test_file_1( 0, "mbsout/mbsout_infile1" ); test_file_2( 0, "mbsout/mbsout_infile2" ); test_file_3( 0, "mbsout/mbsout_infile3" ); sub create_dir { my $dir = shift; $dir = test_input_file($dir); unless ( -d $dir ) { mkpath($dir); } } sub test_file_1 { ############################################################################## ## Test file 1 ############################################################################## my $gzip = shift; my $infile = shift; $infile = test_input_file($infile); #print_file1( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gunzip -c <$file_sequence |"; } my $mbsout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'mbsout', ); isa_ok( $mbsout, 'Bio::SeqIO::mbsout' ); my $rh_base_conversion_table = $mbsout->get_base_conversion_table; isa_ok( $mbsout, 'Bio::SeqIO::mbsout' ); my %attributes = ( RUNS => 3, SEGSITES => 7, MBS_INFO_LINE => 'command: mbs 6 -t 0.001 -r 0.00025 -s 5000 2500 -f 3 1 traj ', TOT_RUN_HAPS => 6, NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(79.1001 80.1001 81.101 82.101 83.10001 84.801 85)], CURRENT_RUN_SEGSITES => 7, POP_MUT_PARAM_PER_SITE => 0.001, POP_RECOMB_PARAM_PER_SITE => 0.00025, NSITES => 5000, SELPOS => 2500, NFILES => 3, NREPS => 1, TRAJ_FILENAME => 'traj' ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POPS|SEEDS|POSITIONS/ ) { $func = ucfirst($func); } $func = 'get_' . $func; my @returns = $mbsout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } # Testing next_hap at beginning of run my @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_seq ); my @data_expected = qw(1111111); is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run" ); # Testing next_hap after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_seq ); @data_expected = qw(5555555); is_deeply( \@data_got, \@data_expected, "Get next_hap after beginning of run" ); # Testing next_pop after beginning of pop @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_seq ); @data_expected = qw(4444444); is_deeply( \@data_got, \@data_expected, "Get next_pop after beginning of pop" ); # Testing next_hap @data_got = $mbsout->get_next_hap; @data_expected = qw(4444444); is_deeply( \@data_got, \@data_expected, "Get next_hap" ); # Testing next_hap @data_got = $mbsout->get_next_hap; @data_expected = qw(5555555); is_deeply( \@data_got, \@data_expected, "Get next_hap" ); # Testing next_run after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw(4444444); is_deeply( \@data_got, \@data_expected, "Get next_run after beginning of run" ); # Testing next_run at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw(5555555 5555555 5555555 1010101 1111111 1515151); is_deeply( \@data_got, \@data_expected, "Get next_run at beginning of run" ); # Testing next_run at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw( 1414141 1414141 1515151 1414141 1515151 1515151); is_deeply( \@data_got, \@data_expected, "Get next_run at beginning of run" ); is( $mbsout->get_next_run_num, undef, 'have all lines been read?' ); } sub test_file_2 { ############################################################################## ## Test file 2 ############################################################################## my $gzip = shift; my $infile = shift; $infile = test_input_file($infile); #print_file2( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gunzip -c <$file_sequence |"; } my $mbsout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'mbsout', ); isa_ok( $mbsout, 'Bio::SeqIO::mbsout' ); my %attributes = ( RUNS => 5, SEGSITES => 7, MBS_INFO_LINE => 'command: mbs 6 -t 0.001 -r 0.00025 -s 5000 2500 -f 5 1 traj ', TOT_RUN_HAPS => 6, NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(79.1001 80.1001 81.101 82.101 83.10001 84.801 85)], CURRENT_RUN_SEGSITES => 7, POP_MUT_PARAM_PER_SITE => 0.001, POP_RECOMB_PARAM_PER_SITE => 0.00025, NSITES => 5000, SELPOS => 2500, NFILES => 5, NREPS => 1, TRAJ_FILENAME => 'traj' ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POSITIONS/ ) { $func = ucfirst($func); } elsif ( $attribute =~ m/\_file/ ) { $func = q(infile); } $func = 'get_' . $func; my @returns = $mbsout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } my $rh_base_conversion_table = $mbsout->get_base_conversion_table; # Testing next_hap at beginning of run my @data_got = $mbsout->get_next_hap; my @data_expected = qw(1111111); is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run" ); # Testing next_hap after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_seq ); @data_expected = '5555555'; is_deeply( \@data_got, \@data_expected, "Get next_hap after beginning of run" ); # Surprise test! testing mbsout::outgroup my $outgroup = $mbsout->outgroup; is( $outgroup, 0, "Testing mbsout::outgroup" ); # Testing next_run after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw( 4444444 4444444 5555555 4444444); is_deeply( \@data_got, \@data_expected, "Get next_run after beginning of run" ); # Testing next_run after beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw( 5555555 5555555 5555555 1010101 1111111 1515151); is_deeply( \@data_got, \@data_expected, "Get next_run after beginning of run" ); # Testing next_run at beginning of run @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw( 1414141 1414141 1515151 1414141 1515151 1515151); is_deeply( \@data_got, \@data_expected, "Get next_run at beginning of run" ); # Testing next_hap at beginning of run 2 @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_seq ); @data_expected = '1515151'; is_deeply( \@data_got, \@data_expected, "Get next_hap at beginning of run 2" ); # Testing next_run after hap @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); @data_expected = qw( 5050505 5151515 5555555 5454545 5454545); is_deeply( \@data_got, \@data_expected, "Get next_run after hap" ); is( $mbsout->get_next_run_num, 5, 'next run should be 5.' ); } sub test_file_3 { ############################################################################## ## Test file 3 ############################################################################## my $gzip = shift; my $infile = shift; $infile = test_input_file($infile); #print_file3( $infile, $gzip ); my $file_sequence = $infile; if ($gzip) { $file_sequence = "gunzip -c <$file_sequence |"; } my $mbsout = Bio::SeqIO->new( -file => "$file_sequence", -format => 'mbsout', ); isa_ok( $mbsout, 'Bio::SeqIO::mbsout' ); my $rh_base_conversion_table = $mbsout->get_base_conversion_table; my %attributes = ( RUNS => 1, SEGSITES => 7, MBS_INFO_LINE => 'command: mbs 3 -t 0.001 -r 0.00025 -s 5000 2500 -f 1 1 traj ', TOT_RUN_HAPS => 3, NEXT_RUN_NUM => 1, LAST_READ_HAP_NUM => 0, POSITIONS => [qw(79.1001 80.1001 81.101 82.101 83.10001 84.801 85)], CURRENT_RUN_SEGSITES => 7, POP_MUT_PARAM_PER_SITE => 0.001, POP_RECOMB_PARAM_PER_SITE => 0.00025, NSITES => 5000, SELPOS => 2500, NFILES => 1, NREPS => 1, TRAJ_FILENAME => 'traj' ); foreach my $attribute ( keys %attributes ) { my $func = lc($attribute); if ( $attribute =~ m/POPS|SEEDS|POSITIONS/ ) { $func = ucfirst($func); } $func = 'get_' . $func; my @returns = $mbsout->$func(); my ( $return, $got ); # If there were more than one return value, then compare references to # arrays instead of scalars unless ( @returns > 1 ) { $got = shift @returns; } else { $got = \@returns } my $expected = $attributes{$attribute}; if ( defined $got && defined $expected ) { is_deeply( $got, $expected, "Get $attribute" ); } else { is_deeply( $got, $expected, "Get $attribute" ) } } # Testing next_run at beginning of run my @data_got = convert_bases_to_nums( $rh_base_conversion_table, $mbsout->get_next_run ); my @data_expected = qw(1111111 5555555 4444444); is_deeply( \@data_got, \@data_expected, "Get next_run at end/beginning of run" ); is( $mbsout->get_next_run_num, undef, 'have all lines been read?' ); # Testing what happens when we read from empty stream @data_got = $mbsout->get_next_run; @data_expected = (); is_deeply( \@data_got, \@data_expected, "Get next_run at eof" ); # Testing what happens when we read from empty stream @data_got = $mbsout->get_next_hap; @data_expected = undef; is_deeply( \@data_got, \@data_expected, "Get next_hap at eof" ); # Testing what happens when we read from empty stream @data_got = $mbsout->get_next_seq; @data_expected = (); is_deeply( \@data_got, \@data_expected, "Get next_seq at eof" ); } sub print_file1 { my $destination = shift; my $gzip = shift; my $out = <$destination" or croak "Could not write file '$destination': $!\n"; print $OUT $out; close $OUT; } sub print_file2 { my $destination = shift; my $gzip = shift; my $out = <$destination" or croak "Could not write file '$destination': $!\n"; print $OUT $out; close $OUT; } sub print_file3 { my $destination = shift; my $gzip = shift; my $out = <$destination" or croak "Could not write file '$destination': $!\n"; print $OUT $out; close $OUT; } sub print_to_file { my ( $ra_in, $out ) = @_; open my $OUT, '>', $out or croak "\nCould not write outfile '$out': $!\n"; print $OUT ("@$ra_in"); close $OUT; } sub convert_bases_to_nums { my ( $rh_base_conversion_table, @seqs ) = @_; my @out_seqstrings; foreach my $seq (@seqs) { my $seqstring = $seq->seq; foreach my $base ( keys %{$rh_base_conversion_table} ) { $seqstring =~ s/($base)/$rh_base_conversion_table->{$base}/g; } push @out_seqstrings, $seqstring; } return @out_seqstrings; } seqxml.t100644000766000024 2107514006427353 16166 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO#-*-perl-*- # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin( -tests => 61, -requires_modules => [qw(XML::LibXML XML::LibXML::Reader XML::Writer)] ); use_ok('Bio::SeqIO'); } use_ok('Bio::PrimarySeq'); my $verbose = test_debug(); SKIP: { # XML library version checks if ( 1000 * $] < 5008 ) { skip( "Reader interface only supported in Perl >= 5.8", 96 ); } elsif ( XML::LibXML::LIBXML_VERSION() <= 20620 ) { skip( "Reader not supported for libxml2 <= 2.6.20", 96 ); } if ($verbose) { diag( "libxml version: ", XML::LibXML::LIBXML_VERSION() ); } # checks that your module is there and loads ok use_ok('Bio::SeqIO::seqxml'); # read data ok( my $seq_stream = Bio::SeqIO->new( -file => test_input_file("seqxml.xml"), -format => 'seqxml', -verbose => $verbose, ), 'stream ok', ); # check metadata is( $seq_stream->seqXMLversion, '0.3', 'seqXML version' ); is( $seq_stream->source, 'Ensembl', 'source' ); is( $seq_stream->sourceVersion, '56', 'source version' ); # now get and check the sequence entry itself my $seq_obj = $seq_stream->next_seq; isa_ok( $seq_obj, 'Bio::Seq' ); is( $seq_obj->display_id, 'ENST00000308775', 'display id' ); is( $seq_obj->primary_id, 'ENST00000308775', 'primary id' ); is( $seq_obj->desc, 'dystroglycan 1', 'description' ); is( $seq_obj->seq, 'AAGGC----UGAUGUC.....ACAU', 'sequence' ); is( $seq_obj->length, 25, 'length' ); my ($source) = $seq_obj->get_Annotations('source'); if ($source) { is($source->value, 'Ensembl', 'entry source'); } # species isa_ok( $seq_obj->species, 'Bio::Species', 'species' ); is( $seq_obj->species->node_name, 'Homo sapiens', 'species name' ); is( $seq_obj->species->ncbi_taxid, '9606', 'NCBI tax id' ); # alternative IDs my @dblinks = $seq_obj->get_Annotations('dblink'); my $dblink = shift @dblinks; isa_ok( $dblink, 'Bio::Annotation::DBLink' ); is( $dblink->database, 'RefSeq', 'dblink source' ); is( $dblink->primary_id, 'NM_004393', 'dblink ID' ); # properties my @annotations = $seq_obj->get_Annotations(); foreach my $annot_obj (@annotations) { next if ( $annot_obj->tagname eq 'dblink' ); next if ( $annot_obj->tagname eq 'source' ); isa_ok( $annot_obj, 'Bio::Annotation::SimpleValue' ); if ( $annot_obj->tagname eq 'has_splice_variants' ) { is( $annot_obj->value, undef, 'boolean property' ); } elsif ( $annot_obj->tagname eq 'prediction_method' ) { is( $annot_obj->value, 'manual curation', 'property with value' ); } } # write data my $outfile = test_output_file(); ok( my $seq_writer = Bio::SeqIO->new( -file => ">$outfile", -format => 'seqxml', -verbose => $verbose, -source => 'Ensembl', -sourceVersion => '56', -seqXMLversion => '0.3', ), 'writer ok', ); $seq_writer->flush; # to make sure output is written to file ok( -s $outfile, 'outfile is created' ); # check metadata is( $seq_writer->seqXMLversion, '0.3', 'seqXML version' ); is( $seq_writer->source, 'Ensembl', 'source' ); is( $seq_writer->sourceVersion, '56', 'source version' ); is( $seq_writer->schemaLocation, 'http://www.seqxml.org/0.3/seqxml.xsd', 'schemaLocation' ); # write one sequence entry to file $seq_writer->write_seq($seq_obj); $seq_writer->close; if ( $verbose > 0 ) { diag("writing first seqXML outfile"); diag(`cat $outfile`); } # verify written data by roundtripping it { my $new_in = Bio::SeqIO->new( -file => $outfile, -format => 'seqxml' ); my $new_seqobj = $new_in->next_seq; isa_ok( $new_seqobj, 'Bio::Seq' ); is( $new_seqobj->display_id, 'ENST00000308775', 'display id' ); is( $new_seqobj->primary_id, 'ENST00000308775', 'primary id' ); is( $new_seqobj->desc, 'dystroglycan 1', 'description' ); is( $new_seqobj->seq, 'AAGGC----UGAUGUC.....ACAU', 'sequence' ); is( $new_seqobj->length, 25, 'length' ); my ($new_source) = $new_seqobj->get_Annotations('source'); if ($new_source) { is($new_source->value, 'Ensembl', 'entry source'); } # species isa_ok( $new_seqobj->species, 'Bio::Species', 'species' ); is( $new_seqobj->species->node_name, 'Homo sapiens', 'species name' ); is( $new_seqobj->species->ncbi_taxid, '9606', 'NCBI tax id' ); # alternative IDs my @dblinks = $new_seqobj->get_Annotations('dblink'); my $dblink = shift @dblinks; isa_ok( $dblink, 'Bio::Annotation::DBLink' ); is( $dblink->database, 'RefSeq', 'dblink source' ); is( $dblink->primary_id, 'NM_004393', 'dblink ID' ); # properties my @annotations = $new_seqobj->get_Annotations(); foreach my $annot_obj (@annotations) { next if ( $annot_obj->tagname eq 'dblink' ); next if ( $annot_obj->tagname eq 'source' ); isa_ok( $annot_obj, 'Bio::Annotation::SimpleValue' ); if ( $annot_obj->tagname eq 'has_splice_variants' ) { is( $annot_obj->value, undef, 'boolean property' ); } elsif ( $annot_obj->tagname eq 'prediction_method' ) { is( $annot_obj->value, 'manual curation', 'property with value' ); } } } # write data from a Seq object created from a fasta file { # forcing a Bio::Seq to be created # due to SeqIO::fasta creating a PrimarySeq by default # as of r16838 my $factory = Bio::Seq::SeqFactory->new(-type => 'Bio::Seq'); my $seq_stream = Bio::SeqIO->new( -file => test_input_file("test.fasta"), -format => 'fasta', -seqfactory => $factory, ); my $outfile = test_output_file(); my $writer = Bio::SeqIO->new( -file => ">$outfile", -format => 'seqxml' ); $writer->flush; ok( -s $outfile, 'outfile is created' ); while ( my $seq_obj = $seq_stream->next_seq ) { $writer->write_seq($seq_obj); } $writer->close; if ( $verbose > 0 ) { diag(`cat $outfile`); } # now read that newly made seqxml back in my $in = Bio::SeqIO->new( -file => $outfile, -format => 'seqxml' ); # check header is( $in->seqXMLversion, '0.3', 'seqXML version' ); is( $in->source, undef, 'source' ); is( $in->sourceVersion, undef, 'source version' ); # check first sequence entry my $seqxml_obj = $in->next_seq; is( $seqxml_obj->display_id, 'roa1_drome', 'display id' ); is( $seqxml_obj->primary_id, 'roa1_drome', 'primary id' ); is( $seqxml_obj->desc, 'Rea guano receptor type III >> 0.1', 'description' ); is( $seqxml_obj->seq, 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY', 'sequence' ); is( $seqxml_obj->length, 358, 'length' ); # check second sequence entry my $seqxml_obj2 = $in->next_seq; is( $seqxml_obj2->display_id, 'roa2_drome', 'display id' ); is( $seqxml_obj2->primary_id, 'roa2_drome', 'primary id' ); is( $seqxml_obj2->desc, 'Rea guano ligand', 'description' ); is( $seqxml_obj2->seq, 'MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVVVMKDPTSTSTSTSTSTSTSTSTMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVKKLFVGALKDDHDEQSIRDYFQHLLLLLLLDLLLLDLLLLDLLLFVEFDDYDPVDKVVLQKQHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGNNWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGNDFGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY', 'sequence' ); is( $seqxml_obj2->length, 358, 'length' ); } } LocalDB000755000766000024 014006427353 14563 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tFlat.t100644000766000024 724114006427353 16002 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/LocalDB# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 27, -requires_module => 'DB_File'); use_ok('Bio::DB::Flat'); } my $verbose = test_debug(); # First of all we need to create an flat db my $tmpdir = test_output_dir(); my $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'bdb', -dbname => 'mydb', -format => 'fasta', -verbose => $verbose, -write_flag => 1 ); ok($db); my $dir = test_input_file('AAC12660.fa'); my $result = $db->build_index(glob($dir)); ok($result); # Now let's get the sequence out again my $seq = $db->get_Seq_by_id('AAC12660'); ok($seq); is($seq->length,504); undef $db; $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'bdb', -format => 'embl', -dbname => 'myembl', -verbose => $verbose, -write_flag => 1 ); $dir= test_input_file('cds_sample.embl'); $result = $db->build_index(glob($dir)); is ($db->get_all_primary_ids, 1); #is ($db->get_all_accs, 1); ok($result); $seq = $db->get_Seq_by_id('EAL24309'); ok($seq); is($seq->length,192); # deal with wantarray conditions $seq = $db->get_Seq_by_acc('CH236947.1'); ok($seq && ref($seq)); is($seq->length,192); undef $db; $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'binarysearch', -format => 'fasta', -dbname => 'mybinfa', -verbose => $verbose, -write_flag => 1 ); $dir = test_input_file('dbfa', '1.fa'); $result = $db->build_index($dir); ok($result); $seq = $db->get_Seq_by_id('AW057119'); ok($seq); is($seq->length, 808); $seq = $db->get_Seq_by_id('AW057118'); ok($seq); is($seq->length, 299); undef $db; SKIP: { test_skip(-tests => 4, -requires_module => 'Data::Stag'); $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'binarysearch', -format => 'swiss', -dbname => 'mybinswiss', -verbose => $verbose, -write_flag => 1 ); $dir= test_input_file('swiss.dat'); $result = $db->build_index($dir); ok($result); $seq = $db->get_Seq_by_id('ACON_CAEEL'); ok($seq); is($seq->length,788); $seq = $db->get_Seq_by_id('ACON_CAEEL'); ok($seq && ref($seq)); undef $db; } $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'binarysearch', -format => 'fasta', -dbname => 'myfasta', -verbose => $verbose, -write_flag => 1 ); $dir = test_input_file('tmp.fst'); $result = $db->build_index(glob($dir)); ok($result); $seq = $db->get_Seq_by_id('TEST00004'); is($seq->length,98); undef $db; $db = Bio::DB::Flat->new(-directory => $tmpdir, -index => 'bdb', -format => 'fasta', -dbname => 'mybfasta', -verbose => $verbose, -write_flag => 1 ); $dir = test_input_file('tmp.fst'); $result = $db->build_index(glob($dir)); ok($result); for my $id ( qw(TEST00001 TEST00002 TEST00003 TEST00004) ) { $seq = $db->get_Seq_by_id($id); is($seq->length,98); } Qual.t100644000766000024 716114006427353 16017 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/LocalDBBEGIN { use Bio::Root::Test; test_begin( -tests => 56, -requires_module => 'Bio::DB::Qual'); use_ok('Bio::Root::IO'); use_ok('File::Copy'); } my $DEBUG = test_debug(); { my $test_dbdir = setup_temp_dir('dbqual'); # now use this temporary dir for the db file ok my $db = Bio::DB::Qual->new($test_dbdir, -reindex => 1); is $db->glob, '*.{qual,QUAL,qa,QA}'; isa_ok $db, 'Bio::DB::Qual'; ok my @ids = $db->ids; is scalar(@ids), 15; @ids = sort {$a <=> $b} @ids; is $ids[0] , '17601976'; is $ids[14], '17601991'; my $seqid = '17601979'; # direct indexed qual file database access is ref($db->qual($seqid)), 'ARRAY'; is_deeply $db->qual($seqid), [23, 32, 24, 27, 26, 27, 27, 27, 28, 23, 28, 31, 23, 27]; is $db->length($seqid), 14; is $db->length($seqid, -1000, 1000), 14; # length() ignores start and stop is $db->header($seqid), '17601979'; is_deeply $db->qual($seqid, 2, 11), [32, 24, 27, 26, 27, 27, 27, 28, 23, 28]; is_deeply $db->qual($seqid, 2, 11, 1), [32, 24, 27, 26, 27, 27, 27, 28, 23, 28]; is_deeply $db->qual($seqid, 11, 2), [28, 23, 28, 27, 27, 27, 26, 27, 24, 32]; is_deeply $db->qual($seqid, 2, 11, -1), [28, 23, 28, 27, 27, 27, 26, 27, 24, 32]; is_deeply $db->qual($seqid, 11, 2, -1), [32, 24, 27, 26, 27, 27, 27, 28, 23, 28]; # the bioperl way is $db->get_Qual_by_id('foobarbaz'), undef; ok my $obj = $db->get_Qual_by_id($seqid); isa_ok $obj, 'Bio::Seq::PrimaryQual::Qual'; isa_ok $obj, 'Bio::Seq::QualI'; is ref($obj->qual($seqid)), 'ARRAY'; is $obj->length, 14; is $obj->id, '17601979'; is $obj->display_id, '17601979'; is $obj->accession_number, 'unknown'; like $obj->primary_id, qr/^Bio::Seq::PrimaryQual::Qual=HASH/; is $obj->validate_qual( join(' ', @{$obj->qual($seqid)}) ), 1; is $obj->translate, 0; is $obj->qualat(12), 31; is_deeply $obj->subqual(2, 11), [32, 24, 27, 26, 27, 27, 27, 28, 23, 28]; is $obj->header, undef; is $obj->desc, undef; ok my $truncobj = $obj->trunc(1,3); isa_ok $truncobj, 'Bio::Seq::PrimaryQual::Qual'; isa_ok $obj, 'Bio::Seq::QualI'; is ref($truncobj->qual($seqid)), 'ARRAY'; is $truncobj->length, 3; ok my $revobj = $obj->revcom; isa_ok $revobj, 'Bio::Seq::PrimaryQual::Qual'; isa_ok $revobj, 'Bio::Seq::PrimaryQual'; is ref($revobj->qual), 'ARRAY'; is $revobj->length, 14; undef $obj; undef $truncobj; undef $revobj; # using get_PrimarySeq_stream streaming ok my $stream = $db->get_PrimaryQual_stream; ok $stream = $db->get_PrimarySeq_stream; isa_ok $stream, 'Bio::DB::Indexed::Stream'; ok my $streamqual = $stream->next_seq; isa_ok $streamqual, 'Bio::Seq::PrimaryQual'; # using newFh streaming ok my $fh = Bio::DB::Qual->newFh($test_dbdir); my $fhqual = <$fh>; isa_ok $fhqual, 'Bio::Seq::PrimaryQual'; undef $fh; # tied-hash access my (%h,$dna1,$dna2); ok tie(%h,'Bio::DB::Qual',$test_dbdir); ok $h{$seqid}; ok $dna1 = $h{"$seqid:1,10"}; ok $dna2 = $h{"$seqid:10,1"}; } sub setup_temp_dir { # this obfuscation is to deal with lockfiles by GDBM_File which can # only be created on local filesystems apparently so will cause test # to block and then fail when the testdir is on an NFS mounted system my $data_dir = shift; my $io = Bio::Root::IO->new(); my $tempdir = test_output_dir(); my $test_dbdir = $io->catfile($tempdir, $data_dir); mkdir($test_dbdir); # make the directory my $indir = test_input_file($data_dir); opendir(my $INDIR,$indir) || die("cannot open dir $indir"); # effectively do a cp -r but only copy the files that are in there, no subdirs for my $file ( map { $io->catfile($indir,$_) } readdir($INDIR) ) { next unless (-f $file ); copy($file, $test_dbdir); } closedir($INDIR); return $test_dbdir } mase.t100644000766000024 101014006427353 16047 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: mase.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 3); use_ok('Bio::AlignIO::mase'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # MASE $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.mase"), '-format' => 'mase'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'AK1H_ECOLI/1-318', "mase input test "; xmfa.t100644000766000024 371614006427353 16074 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: xmfa.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 30); use_ok('Bio::AlignIO::xmfa'); } my $DEBUG = test_debug(); # foo my ($str,$aln,$strout,$status); # XMFA $str = Bio::AlignIO->new( -file => test_input_file("testaln.xmfa"), -format => 'xmfa'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); # test seqs my @test_data = ( # 1:1-598 + chrY [ 'chrY/1-598', 1, 598, 1, 'chrY', undef], # 2:5000-5534 - chr17 [ 'chr17/5534-5000', 5000, 5534, -1, 'chr17', undef], # 3:19000-19537 - chr7 [ 'chr7/19537-19000', 19000, 19537, -1, 'chr7', undef], ); for my $pos (1..3) { my $seq = $aln->get_seq_by_pos($pos); my @seq_data = @{shift @test_data}; is $seq->get_nse, shift @seq_data, "xmfa input test "; is $seq->start, shift @seq_data, "xmfa input test for start"; is $seq->end, shift @seq_data, "xmfa input test for end"; is $seq->strand, shift @seq_data, "xmfa strand test"; is $seq->display_id, shift @seq_data, "xmfa input test for id"; is $seq->description, shift @seq_data, "xmfa input test for id"; } # test aln is $aln->score, 111, 'xmfa alignment score'; $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'chrY/1000-1059', "xmfa input test "; is $aln->get_seq_by_pos(1)->strand, 1, "xmfa strand"; is ($aln->get_seq_by_pos(2)->description, undef, "xmfa input test for description"); is ($aln->get_seq_by_pos(3)->display_id, 'chr12', "xmfa input test for id"); is ($aln->get_seq_by_pos(2)->start, 6000, "xmfa input test for end"); is ($aln->get_seq_by_pos(1)->end, 1059, "xmfa input test for end"); is ($aln->score, 11, 'xmfa alignment score'); $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'xmfa'); $status = $strout->write_aln($aln); is $status, 1,"xmfa output test"; pfam.t100644000766000024 127114006427353 16056 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: pfam.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 5); use_ok('Bio::AlignIO::pfam'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # PFAM format (no annotation) $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.pfam")); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse, '1433_LYCES/9-246'); $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'pfam'); $status = $strout->write_aln($aln); is($status, 1, " pfam output test"); mega.t100644000766000024 141614006427353 16045 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: mega.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 6); use_ok('Bio::AlignIO::mega'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # MEGA $str = Bio::AlignIO->new('-format' => 'mega', '-file' => test_input_file("testaln.mega")); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse,'Human/1-141'); is($aln->get_seq_by_pos(2)->get_nse,'Horse/1-144'); $aln->unmatch(); is($aln->get_seq_by_pos(3)->subseq(1,10), 'V-LSAADKGN'); $strout = Bio::AlignIO->new('-format' => 'mega', '-file' => ">" .test_output_file()); $status = $strout->write_aln($aln); is $status, 1, "mega output test"; meme.t100644000766000024 250714006427353 16061 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 20); use_ok('Bio::AlignIO::meme'); } my $DEBUG = test_debug(); # MEME # this file has no Strand column, and it's version 3.0 my $str = Bio::AlignIO->new( -file => test_input_file('test-3.0-1.meme'), -format => 'meme'); isa_ok($str,'Bio::AlignIO'); my $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI');is $aln->length,25; is $aln->num_sequences,4; is $aln->get_seq_by_pos(3)->seq(),"CCTTAAAATAAAATCCCCACCACCA"; is $aln->get_seq_by_pos(3)->strand,"1"; # this file has a Strand column, also version 3.0 $str = Bio::AlignIO->new( -file => test_input_file('test-3.0-2.meme'), -format => 'meme'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI');is $aln->length,20; is $aln->num_sequences,8; is $aln->get_seq_by_pos(8)->seq(),"CCAGTCTCCCCTGAATACCC"; is $aln->get_seq_by_pos(7)->strand,"-1"; is $aln->get_seq_by_pos(6)->strand,"1"; # version 4.9 $str = Bio::AlignIO->new( -file => test_input_file('test-4.9.meme'), -format => 'meme'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI');is $aln->length,21; is $aln->num_sequences,47; is $aln->get_seq_by_pos(3)->seq(),"AGAGAAACAAGAGGCCTCTTT"; is $aln->get_seq_by_pos(3)->strand,"1"; SearchIO000755000766000024 014006427353 14760 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tpsl.t100644000766000024 537014006427353 16110 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: UCSCParsers.t 11525 2007-06-27 10:16:38Z sendu $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 53); use_ok('Bio::SearchIO'); } my $pslparser = Bio::SearchIO->new(-format => 'psl', -file => test_input_file('sbay_c545-yeast.BLASTZ.PSL')); my $result = $pslparser->next_result; is($result->query_name, 'I'); is($result->query_length, 230203); my $hit = $result->next_hit; is($hit->name, 'sbay_c545'); is($hit->length, 28791); my $hsp = $hit->next_hsp; is($hsp->query->start,139871); is($hsp->query->end,141472); is($hsp->query->length, 1602); is($hsp->query->strand, 1); is($hsp->hit->strand, 1); my $q_gapblocks = $hsp->gap_blocks('query'); is(scalar @$q_gapblocks, 24); is($q_gapblocks->[0]->[1],45); is($q_gapblocks->[1]->[1],10); is($q_gapblocks->[1]->[0],139921); $hsp = $hit->next_hsp; $hsp = $hit->next_hsp; is($hsp->hit->start,27302); is($hsp->hit->end,27468); is($hsp->hit->length,167); is($hsp->query->start, 123814); is($hsp->query->end, 123972); is($hsp->query->length, 159); is($hsp->query->strand,-1); $q_gapblocks = $hsp->gap_blocks('query'); is(scalar @$q_gapblocks, 4); is($q_gapblocks->[0]->[1],116); is($q_gapblocks->[1]->[1],4); is($q_gapblocks->[1]->[0],123856); #----------------------------------- $pslparser = Bio::SearchIO->new(-format => 'psl', -file => test_input_file('blat.psLayout3')); $result = $pslparser->next_result; is($result->query_name, 'sequence_10'); is($result->query_length, 1775); $hit = $result->next_hit; is($hit->name, 'sequence_10'); is($hit->length, 1775); $hsp = $hit->next_hsp; is($hsp->query->start,1); is($hsp->query->end,1775); is($hsp->query->length,1775); is($hsp->query->strand,1); is($hsp->hit->strand,1); $q_gapblocks = $hsp->gap_blocks('query'); is(scalar @$q_gapblocks, 1); is($q_gapblocks->[0]->[1],1775); is($q_gapblocks->[1]->[1],undef); is($q_gapblocks->[1]->[0],undef); $hsp = $hit->next_hsp; is($hsp->hit->start,841); is($hsp->hit->end,1244); is($hsp->query->start, 841); is($hsp->query->end, 1244); is($hsp->query->length, 404); is($hsp->query->strand,-1); is($hsp->hit->strand,1); $q_gapblocks = $hsp->gap_blocks('query'); is(scalar @$q_gapblocks, 4); is($q_gapblocks->[0]->[1],14); is($q_gapblocks->[1]->[1],21); is($q_gapblocks->[1]->[0],1152); is( $hit->next_hsp, undef, 'next_hsp should be undef'); is( $result->next_hit, undef, 'next_hit should be undef'); TODO: { local $TODO = "next_result should really return undef, not empty string"; is( $pslparser->next_result, undef, 'next_result should be undef'); } # bug 2850 my $searchio = Bio::SearchIO->new( -format => 'psl', -file => test_input_file('headerless.psl'), ); lives_ok { my $result = $searchio->next_result }; axt.t100644000766000024 201614006427353 16100 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 19); use_ok('Bio::SearchIO'); } my $axtparser = Bio::SearchIO->new(-format => 'axt', -file => test_input_file('test_data.axt')); my $result = $axtparser->next_result; is($result->query_name, 'chr19'); my $hit = $result->next_hit; is($hit->name, 'chr11'); my $hsp = $hit->next_hsp; is($hsp->query->start,3001012); is($hsp->query->end,3001075); is($hsp->query->length, 64); is($hsp->query->strand, 1); is($hsp->hit->start,70568380); is($hsp->hit->end,70568443); is($hsp->hit->length, 64); is($hsp->hit->strand, -1); # next HSP returns next alignment, but shouldn't this be the next_hit instead???? # what happens if the hit name changes? $hsp = $hit->next_hsp; is($hsp->query->start,3008279); is($hsp->query->end,3008357); is($hsp->query->length, 79); is($hsp->query->strand, 1); is($hsp->hit->start,70573976); is($hsp->hit->end,70574054); is($hsp->hit->length, 79); is($hsp->hit->strand, -1); HTTPget.t100644000766000024 550014006427353 16052 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 29, -requires_networking => 1); use_ok('Bio::Root::HTTPget'); } my $TESTURL = 'http://www.google.com/index.html'; my $TEST_PROXY = 'http://myproxy'; my @TEST_AUTHENTICATION = qw(foo bar); my ($fh, $proxy); my @auth; =head1 Bio::Root::HTTPget comments This module is a bit schizophrenic in that it is called in three different ways; as an instance method, a class method, or as an explicit subroutine. These tests check for all call types. They are by no means incomplete. =cut # test object method calls my $obj = Bio::Root::HTTPget->new(); ok defined($obj) && $obj->isa('Bio::Root::Root'); lives_ok {$obj->get($TESTURL)}; lives_ok {$fh = $obj->getFH($TESTURL)}; isa_ok($fh, 'IO::Socket::INET'); undef($fh); is ($obj->proxy(), undef); is_deeply([$obj->authentication], []); $obj->proxy('http', $TEST_PROXY); $obj->authentication(@TEST_AUTHENTICATION); is ($obj->proxy(), $TEST_PROXY); is_deeply([$obj->authentication], \@TEST_AUTHENTICATION); # test class method calls; note that mixing class and sub calls pollutes the # class attributes lives_ok {Bio::Root::HTTPget->get($TESTURL)}; lives_ok {$fh = Bio::Root::HTTPget->getFH($TESTURL)}; isa_ok($fh, 'IO::Socket::INET'); undef($fh); is (Bio::Root::HTTPget->proxy(), undef); is_deeply([Bio::Root::HTTPget->authentication], []); Bio::Root::HTTPget->proxy('http', $TEST_PROXY); Bio::Root::HTTPget->authentication(@TEST_AUTHENTICATION); is (Bio::Root::HTTPget->proxy('http'), $TEST_PROXY); is_deeply([Bio::Root::HTTPget->authentication], \@TEST_AUTHENTICATION); # test sub calls (not called as method) lives_ok {Bio::Root::HTTPget::get($TESTURL)}; lives_ok {$fh = Bio::Root::HTTPget::getFH($TESTURL)}; isa_ok($fh, 'IO::Socket::INET'); undef($fh); # note that mixing class and sub calls pollutes the class attributes, have to # manually reset Bio::Root::HTTPget->authentication(undef, undef); my $old = Bio::Root::HTTPget->clear_proxy('http'); is (Bio::Root::HTTPget::proxy(), undef); is ($old, $TEST_PROXY); is_deeply([Bio::Root::HTTPget->authentication], [undef, undef]); Bio::Root::HTTPget::proxy('http', $TEST_PROXY); Bio::Root::HTTPget::authentication(@TEST_AUTHENTICATION); is (Bio::Root::HTTPget::proxy('http'), $TEST_PROXY); is_deeply([Bio::Root::HTTPget->authentication], \@TEST_AUTHENTICATION); # check to make sure new instance attributes are not polluted by class attrbutes # from previous tests my $newobj = Bio::Root::HTTPget->new(); ok defined($newobj) && $obj->isa('Bio::Root::Root'); is ($newobj->proxy(), undef); is_deeply([$newobj->authentication], []); $newobj->proxy('http', $TEST_PROXY); $newobj->authentication(@TEST_AUTHENTICATION); is ($newobj->proxy(), $TEST_PROXY); is_deeply([$newobj->authentication], \@TEST_AUTHENTICATION); test.waba100644000766000024 15116314006427353 16256 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataU57623.c1 align 40.9% of 3833 pair1_hs.fa U57623:33-2971 + pair1_mm.fa.U02884:242-3687 + agctatgatggcgtcacagtactccagcctgggagacacagcga-gagactttgtctctaaaa------------------------------------aataataataaaataaaaagttcaatgaaacaatacacccaaagccctcagcatgcaataaatagcaagacaaggcaggtcttattt-ttactgaaagtgcttagta-aactatacagtg-acaaaccaccgcacaacaggctctcgaaaggaggcagcaaattacccaaaagtg-caggcggcttgctagt-gtgcacaggccaaagaaagggcggcaggtggggaaggcagccat------------------gggccttgaagagctgaccgaattggcagaatttctgcaggaggggagctgggaacgacctgagctaaagctcggagctgtgcgaagaaaccggaaaagcccagagcacttgcaggggcgggtggggagctagatggtggggtggggtggggacggaggagggc-ca-gcaggagacattccgcagggaggggcaagcacgtgtgaggcgggc--ggggcgcgaagggtcaggcttttgctcaaaacaggcagaggacaaggtcagctcagccgcagaccgagccgctggtgactgtctccgccaccaggcagtgagagtgaagggagagcgcgacctctgaagcccgctagactaagcttgcaatctgagctccattcaccccctcctatttcttgagaccttgtcagttcccctgtgagcctcggactcacttgta-aa-acgaggacagatgcccgt----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------gccagaa-g-tcaaccagagctttccccggcgtgggcaccagcccaagggcg-ttttgttttctagtctcatctctgctctgacgctaagctcaaagagggactgggggacgggaagatatccaccatggatgcgccctagatctcgggctggtgtcggctgttccttctcagattccagagtgtctagaggccaggaaagggagaaggtcctaccagcctggggtagggactcgggggccagg-cactg-gcgctgacgcagg-ctagcagggcgccactggctggtc-cccacccacctcggtgggt-tgggggatgggcgcaccagcccctcctgggtgagccctagcctggggcttcctatttcgggagccgggggcgtgggccacgtctcctcatgtgatgcgagggctatttaaagcggcagcccgggcagggagccgccg-tcggagcccttgcacgcctgctctcttgtagcttctctcagcctagcccagcatcactatggtggacgctttcctgggcacctggaagctagtggacagcaagaatttcgatgactacatgaagtcactcggtgagcaagccgcggggctcaggatgttgg-----------------------cttggggactggctggtggcgtgcctagccccacgcagcactcctgccgcatccctcctggttaagactggggaataggggagcgcggagatggcagcctggcctagagcaggt--------------------ggggcctgttcagagggggctttggtggtccaaatctggttagagaccacggtagggaggtggtggaaggaggcagctgtg-----------------tgggaggctctttccaggaag-agggatatgtgatttggaggtaggaggagggtttggataaagaacactgatcacaggaaagggagtgtagccaggggagaaaaagaacaggggcatgggtagtttagaaattggaggagactgaacccagaaagggaatggggcagccagggagtgtacaatgatgtaaacaagtaggaaatacctaggaggaaaaagattagtggggaaaaaactgtggatcagtgaatcagatatgagaaggacgtaagacaggaacctgcagtaagcagcaatccccatctctgc---tt-ggttagggaagagaattcttgctggagaatgccctttctcaccagccagtctgaccttgtcctgcagtctatgtatccaggccttcatcactgtctgtgagcctcgtggtagggtggggcaagaggcccatgatcagctgggcctttcctgcaac-ccaa-ggctcacc-tatctgtgcgaggggtaggcag-agaaagccattggacttctgatgtgcagtagagggtccca-aggcaaggtcaagacctgggagggaggatcactggtttaggaggatgtggagaactcctgtggtgttgggatggagaagaatcaggattcaaagaatctcac-aggtgaggaacttggagattccca-tacca-tctagttcaacagggaaactgaaaccaggagagtagaaatgtattataacaattccacagcagagccaatatgaaaatctaaggtttctagatctgta-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------acccagagctcttcccactaccctacaggccctgcgagtgggaagaaaagtagaaactgcttagctaatgattgacctcagcccttcttctactgctttgggcttagatggagag---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------gtcaaagctctcaacggcctctaccctatcttg--------------ggcgctatgcccagtaattctaggcaggcagtcattct-tagaggagcagcccccagcccccacgaacacagcccagcagctattgggaagttggaatgcccagatttagttcctccttccaaagctgggccagagctgagtct-tgaattgagctgcaacaactttacc-attctt-gttcccttattctgccccg-agttggg-tcagcgggctg-gtctccctgaagtcctgttatctttcagcagcttatgttaaggcagccagcattctcatcgtaggaatggaaagcctgggaaaataccctcctcagctctcagtaagtagtgctggcttcatttctaagtagaacccagatctccctgagtctcctaaattctgtcagctcaatattcttagtttctcttggttcagaccctcactcatcccgcagtggtttccttttcaaacactccatacctctgggtagatcctaagtgaacag agccagagcagcggcacacagctgtaacccaggagacagaggcaggagaatctttagttcaaggtcaacccctgctacacattgagatgctgacttggtaataatta-aacagaaactgctgaactaaggaataggctccactgaggttccct-tactcacctgtaaaaaggggatgataccacctaccaacgaaaaagttgagtgtgaccatgccctgaagtaggctacaaccatcaatagtcgggtcttatttaataacgtactttaaggtgacaagcagtctagtggcagaagtcaggggaaaaaactgacttcagcagagggtcgcggctttccgggagttaaggtggccgaggccggaagaaccctctgaatagaca-aa-ttgt-c-ttcgcggagtgaagaacgaccctggcacaagctcagaggtcagtaaataaagc-ctgaagcgctttcaggcagcggcgacgggtgggactg-----------------------cggagaaaggcgcaggcgggagacattccgcagggaggggctagcacgtgtggggctagcatgagggaagcaaggtcacg-ttctccgc-cagcagg-tgaggcgctgggcagctcagccatccgcggtgtccaaggcaactcttttc---------cact-tgtctggtaggagcaagagggctc-aaaggccactagaccatgctctctgtccaggctcca---attcttttttacttacggcgaccgcgtca-ttcctctccgagcctctgagcctcttctacaagaagaggacataggaccgttgagatgggtttttgggtaaaggcccttgctgtcaagccttgacaaccccagtttgatacgtgggacccacacggtggaagcagagaagggactcccgcgagttacaacgaacgccccagtctcccaccccttccccataagtacgcctacacgagcatacacaatataagaataaaaccacagcgaattaaaaaacaaggcggcagaaggatcaagcggcg-tttctccagcgt-ggcaccagctcaagggcgagtttcctttc-agt-------------------------------------------------------------------------------------------------------------------------------atggccgggggatgctctacttgggttgcgggaagcgccccgcagccaggccagggatgggttagatggcaccaacaggaccgcgggcgccgctgacgtaggcgacgggagggctgtgggggatgggc-c-ccagccctt--tgcgggagtgcaagcc-ccggcttcctatttcgggagcgaggggtgtgggccac-tttcatcatgtgatgcgagggctatttaaagaggctgtcc-agccgggagctgcggttctcagtgcctgctcgcctcctcactcatcg-----------------------caccatggcggacgcctttgtcggtacctggaagctagtggacagcaagaattttgatgactacatgaagtcactcggtgagcga-acgaacggcgcaggatctagggtcaggagggccggcaaggcggtcttggcgctgagctcccagggggagtgcccccatg-tgc-ctcccgcaagctcctagccagtccagac-agggaatactgaggtgcggag--ggtggcctgggctgaagccactccactccaccccaccccaccccggcctcctgggaggggggtgtcgcggtccaagcttggcgagcctcgtagct--ggaggggaagggtagaggcagctgtggccgcagaggtccgggatgggagg--ctttctaggaagcagtg-taggtgatccggaggt-gga--------------aaggggaggga-aagaagggcgggaggctggccgcaggagaaggcaaagaggagcat-ggtggtccagaa----------attgaattccgaaagggaatagagcagcta-ggagtgtacag-----------------------------------agcct-ggaggaagactaaagaaaatcagtgaattccatctgggaagaggtgaagatacagccaggcagtcagcaacaagccctacccctccatgttggggtagtgaagaggcctctctctggaagatgccctggttc-tca-ccagcctgaccttcacctacagtgtgtgcagcca-cccctgggatcagtcggagacgctgctgctagagcagggcaagacgaccacta-cacatagg-cttcccggccgcaccagtcggccaccggatcagtgctggggatagggtgaagaaagc----------ctgggatcgagcagagggtgtcagaagaaaggtgaagagctatgaagggaga-gtgtggctt-gg-ggctgggga-aa-ttgtgtggtgtgggcggtgacacaacgcctttaaccagcactctgggaagcagaggcaggtgaatctcccgagttctaggcctggtctatacagaga-gaattccaggacag-ccaggactacacagagaaaccatgtcttgaaaaaaaaaagaaaggaagagtcccatgatttacttaataggaagacagcttgggacacatgagctcatcgcctcataggaaagcccaggatttctttttgaagactgaactagagccttgtgcatgccccctactgctgagttatactccccactcacacacacacacacacacaccctctttttactctgtgtaacaggttctcactaacttccccaggttggctttgatcttgtaacctgccatctcggccttccaaatagttgagaacccagaactacctagagttcttcccatt------------tcaacagtggggaatgtcacatgaaccacttatccaa-ga-cggccccagcccttcctttcttgccttgagcttagataaagacctctacctgcggagtccctggctatatcatcctggtctaggaggctggggcagggaaaacaggactgtgtcatgcctgagctagcttccactccgtcttccccgggaaggagggctggaatcggacatgttgagggatgtgtgtagttgcctctcacctacttccagctcttctctgaaacaggcccacaaagcaatttgtccttttggtttggggaatggaacccaaggccctttgcccgtgctaagcaagcactgcgactgctgaaccacatctccagacagggctcccaccggcacctaccctaccctgaggctctccaggaaggcagctggtcttgtcttttaagacagggttttactgtgtatccctggctctcctggaactccctaa-ggagaccaggtcctgtt-gtctctgcctctcccgcactggggttagaggtatgagcccacacccagctagttggctgtcaagttagggagtactagattacctgggcttagttctgtttcactgagctgccgctgccttcctagacttctttttgcctcagggcatgttgtc-ttcaggccattgt---------------------------------------------------------------tctgtgggtgctgaacccagctttagtcag-gggactgaaattctatttagcctaaaaatatcgacaggctgaaggccagta--aagtctagatgcaccccagcttccagcagtaactggcttcactggcaccctacacctacctgtaggtgggtcctgggaaaacag LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTT312312312312312312312312312HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHQQQQQQQQQQQQQQQQQHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTT231231231231231231231231231231123123123123TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ1231231231231231231TTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL231231231231231231231231231231231231231231QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ231231231231231231231231231231231QQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL U57623.c2 align 43.7% of 4211 pair1_hs.fa U57623:3021-6913 + pair1_mm.fa.U02884:3704-6847 + gcctgactcatcctgataccaaggggcaatgccaagttcctcactggccaagcaagggtgggctgacagcataacagcagaggcagcccctgcccctcctgctgtagacctagggctctcaaggggcaaagaggtcccgtctagtaccagtgaccacaggcacaactgctggcct---ggattgagtatgtgctggacagaatcgcccagtgaaaatagtcaacagttttggagccgaggttcaaatctatgtcagtagtttattctctttgaattt-----------------------------------------------------------------------------------------------------------------------------------------------------cgacaagacact-tcgcactcttcattgtaaactg--gggat-aatctacgcttcgaggctgttacaagcattaagtaaaacaacccatgtagggcatgtgcagagtacctagcttccagcaagcactatgtagccaggtacatttggagactttacac-acaccacctcactacactgggctgcctcctgcctcacctttg-------------------------------------------ccttggaaga-cagttcaatgttaagctgctggggggagagggggcagtcatgattagttctttgttctttacttggttgcaggacacttaggactttgcccagtacccaaggaagccatgcttgggtcaggaagagagtctctgtaaagccttagactgggagtcaggagacgggtttgagtctaactca-ttgttgctaccgctttaggctcctcctgaatctgcacaataggacaaatacttcctttgta-c-ctaactcctagatcatagataacaggctttgaaaatgatgggttgccatgtataagggacaagagcactaacacttcttag-tttcagggtaaaaacttccaaagttggaaaactcctatgcctaaggctttggaagggaaagtctatgtttctcttctttcctcagccttattcctaaggctttgagagcttttcaggtgccctggaaggcagccttatgctccagccttgggaggtagtatagctgagcacttaagcaagctctggactcagacaattctgggcttcaatc-tcagatttgtgaccctgggctttacctctgtttttgtatctgtaacgtggaaacagtcttcagaagaacaggaagaactaaatgagataacatgtacagttcttactacacaaaaagctcatagtacttaatagtagctttttttttttttttttgagatggaatctcactctgttggctaggctggagtgcagtggcacaatctcgactcactgcaacctccacctcctaggttcaagcaattctcagcctcagcctcctgagtagctgggattacaggcacataccaccacatctggctaatttttt-gtatttttagtagagacgggtttcaccatattggccag-gctggtcttaaactcctggcctcatgtgatccgcctgccttggcctcccaaagtgtgattacaggcgtgagccaccacacttggcccaatagtagcttattctaatcccagctctgccactgacttgctatggcactgctgttccttaagtatctctcatctaatgggatcagttatctgtgttcaccaaacagaactaagcgcaagactgaattttaaaattcccatgca-aaggctttgaaagatacagtcctccacttccccatacccaggcctgagagttattcattgagtttcttgtacactgcttctctaccccagctcatatactcataac-ct-tccccctaccctcaggtgtgggttttgctaccaggcaggtggccagcatgaccaagcctaccacaatcatcgaaaagaatggggacattctcaccctaaaaacacacagcaccttcaagaacacagagatcagctttaagttgggggtggagttcgatgagacaacagcagatgacaggaaggtcaaggtaagtcagggaaacaggggtggggaatggagagtgctgagactctaaaagagaataggctggtagtcttggctccctgg-tattgcaccctgaggggcagactatcatggggaatttacatgaaacaagattcataaagcctgtgtagtgctggaatgccactgatgctaaatacatg-tcagttctgtcctcttgttttcttccctcccttcttgggattcatctattgtctgcctc-ggaatgggcagcacagagccaggatgttcttctgacctcagtatctactccagctccagctgggtgaccctgt---gcaaggtatgcagtagctct-aggtttctttccccttccatagatggagagttatgtggccatggctgtgacctgaagtgctttaggaatgatgcccagaagtcagggccctccactgagtgaggtcattgtgacctccagcagcaaaaaaggcagccaggaactagaagc-acctactcagatgccgcttca-act-tctaactcccagacatggccaatgaccctgacaaactatttccagtgttgccagctgacaggca-------ggaaagagctatgttccgtgatagggcattcaccttgtcatgaatgtgtttgcagtgtctcccaccaagccttagcccctcctcccagggttctatcaccctgcagtggctgtcttggcagcttgc-ctcagccttccaggccaggcatgggagcgagagaacttaagggctttgacctctatagggtgtccctatagcagtgttctat-catgacactatcattcagccccatcagctgtttcctcttcctcatagctgtccccagaaagaacagg---------------------------------------------------atcacacaggtggctggcagcagagctggggatggtgcccaaagatggcagtctaccttggataaaggtggctgccccaccacctgctcatacctccttggacttgcctactttctcaaggggcaagaaccccaattaaacacaata--gc-cctgtggaatgcctagggcaaaaatatctactctgagtaggcaaaaaaaactaggggaat-gagaacaaggagtaaggtaaggataaaaaagagcacactaaga-gacaggcctcataccccttatcacctaaacaatacacagaaccttctcagattctcctactgaaccaccttgctcatcaggatcccttagcctggccttgtggcccccaaactcct--aggaaagagagctggaagagctgccaaatgagaaccagctgatgtatgtatgctggcagcacccagagctgaggaaccacttcaagggcatcca------------------------------gtcacaggactttgtggttgctgccctcttgttggctaa-agaggtcacatgatgtggaccaagaaaaggtgtaggaatacagggcaggaagtctaattatccaatacttcctatcactaagggtcttttagacattatgtggactaaccacaaggctggataaagattctcaggactactcctcctcctcagtcagtctttcccagggatagactagtaaatcccacctgtatctgaggggaccaggctacgggaatcacctagagtacagataagtgtctgtcttgaaggcttgtggtacttctcagagccaggctctctggctccaccatactgcctgcctctccctccttgcctaatatctgaaggcctcttccccagaaaggcagtagtggagcagaggctggaggtgaactagatgtcttgcagggatagctgggaggcggattgcctgagctcttgtcctcacaccatcactagtttgggtcaaaggctgtgtcctctgtggcccagtgtccagaccccaccctgcccctcaattcctgactaagatcacagctcaggcctctaccctctttccacagtccattgtgacactggatggagggaaacttgttcacctgcagaaatgggacgggcaagagaccacacttgtgcgggagctaattgatggaaaactcatcctggtaagatgggca gcccaactcacacagataccagtaggcaaagccaagcctctcact------------------------tcttgctgtgtagccaaagctcctgacttatcctatagaaccaaaggt-tcttaggacaaagcagcccagcctagtttaagtgacttcaagcacagatggtggcttcaagggtagagtatgttattccaagaatgatatagtgagactaaaagagagtttgggaatctatg-tcactaaactcggatta-tttatttacttagcctttttgagacagggtttcactatagcccatactctggaagctacgcagcccaagctggccttgaattctcggcatttcccctgctccagtcccctgcctcttgagtgagattctaggggagtcaccatgcttggcccgtttgactttggctaagacagcaccgtgtctcagcctcgtttgtaaatggaaactataaagcttaggtttagggctgtcccaagga-t-act-tggcca-ccacttagagcttgtgcagtgtgcacagcttgcagcag------------------------------atgcacaataccatagccttatattgggctgcctcctgccacatcgctgaggatggctcagagtgtgctggggccagacgacaggtagtcaaccatggaagattccaggaaagctactaacccaaagcaccaaaggcttgac-ccaaggggtctgtgaactttacctgcttgagggacacctgggaccttgcctaggactcag--------------------------------atccaatgattatgtcaggagtctccccagggacttccaagtcatgcagttgtcgctacttttttcagc-cct-ctacgtctg--tggta-gac-aagac-tcctttgtatctctaac--------------caggaggctttgaaactga--cgctgccatacagacggca-gagagcact-gc-tgtctcagttttctgggtggaaa---------tgggagacgacccttgtccaggggactctagaaggcagttgacg-atctcttggttcttcagtcctgtt-ctgtgtgttcaggag------------------------------------------------actagaagccagcgggt-acccagctctggagcgacacagtgcttagcagcttccatctgaattgtgaccctg-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ttcacatctacaaaatcttccaggggcaggactgcgtgaggggctggctatgtggctcacaccccatggtaccttccttgccgctgttgccgaccctgccct-gcatccacatgtctcatatttgacatgag-------------------------------------------------------------------------------------------------------tttaacagctgccgagctaagggaaagagcta-agggaatgagaaaatggccccacacaaagacgtgggccactgag-gatcgccttccccatgcctaggccacaaactt-ctctttgggtct-atata-ac-acgtctctaccctacctaatgtgtcctcaactatctgcccctgcccttaggtgtgggctttgccaccaggcaggtggctagcatgaccaagcctactaccatcatcgagaagaacggggatactatcaccataaagacacaaagtaccttcaagaacacagagatcaactttcagctgggaatagagttcgacgaggtgacagcagatgaccggaaggtcaaggtgagtcagag-aa-agggga-tggagggcactg-gatggaacaccacagggtaagaggctggccctcttagctccttggctttttaaccccaaggggcagg-----------------------------ttcataaagcctgctcagtgctgcgatggcccagggactaagtataagctctggcc--------tgttttcttctc-accttcctgggaaggatctatcagctgtcactggagtgggcag--cagagccaagatattcttccgacct-tgt-gc--ctccagctccagctgggtgaccttgtacaccaaggtacccagtggctctgagg-------------------------acatcagcagccatggctgtgacctgaagtgttttaggaatgatgcccagaagtcagtg--ctctac--agcgaggtcatcctg-gctctggcagcagaggaggcagctgggaacaacaagctgagttctatccacaggtttcctgctctgtagctcccaggctagaccactgaccggaacaaactgtttccagtgccaccagttgacaggcacctgcgtttaccaacaggagtgctgttgggtgtccatagtctcttcaagtatttattt-aagggtatgggagggagtgtctacctccagtgatgctcagttatctcatggtggttatcttcttagcagcttgctcaaaatctcccagcactgg---gccacgcacagaactaagtaacatctgtatctataggttgtccttata-cagtatcctctacatcacaacccta-aaattcatctcttcagctctttcctcctcacagttgcccccagaaagaacagggtaggtaccaaccagtcttgcagttacagaggcgctaaagcccagcccaggaccacacagcaagctggaagtgaagatccccaaggcgccctgctgcaccctgctccaccacat-------------gtcccacctac-c-tacatctttgaacttgccatcttccttgagagaccagatttcacattaaatataacagtggtccactgggatgtct---------------------------------------ggacaatggggaatgaagattccagaaggtggaatatagacaggaaagagatgagtgacagccatcacctt-tctagactcctaccgacaaaccctgcttag-ctctccttctga---------------------------------cctgcaactcccaaattcctaaagcagatagatttggggcagctgcccaacacgaactagctga-g-a--taggctggcagcaccaaggaccgaatggtcactggagctagagctagagaacactccaaggatgcctgggtccttggtccaaggaccttgcaattgccgctcttttgccagttaagggaggtcacatggtagaaacaa---------------------------aagtctaatttgccaata-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------tatcaaggctgagctgcctgccatgcccactgccagagggcc-ccgcccctgcagagcagga---------------------------------agcttgggcagctgggctggaggtgggctgtctgctgatcccagcaccatcactggttaggttcctggtttg--------------------------------------------tcctggctctggtcacagcttagg-ctct-gtctctttccacagtcactggtgacgctggacggaggcaaactcatccatgtgcagaagtggaacgggcaggagacaacactaactagggagctagttgacgggaaactcatcctggtaagatgggca 231231231231231231231231231231231231231231231TTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL23123123123123123123123123TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLL2312312312312312312312312QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL23123123123123123123TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL12312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312312HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL31231231231231231231231231231QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT12312312312312312312312331231123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123 U57623.c3 align 50.3% of 2218 pair1_hs.fa U57623:7003-9170 + pair1_mm.fa.U02884:6891-8711 + gctgggacaagagagtggttgtgggtcagggtggtatcaggtgggaatttttctgtgtagtggctttggactcacacaggccggaactcaaatcttaccttataggctacatgactgtgggcaaatcaccttttccaagtgcaactgtaaaacgggtattaataataccaaccttgtagggctgctgggaagcctgtaagagacagtgtatgcacagcacaaagca-tcact-gat-tgaggaacacagcaggtgctccatgtcctttgtttgctcttcctgtgtttctaccttgcctcacctcaggaagaagtagaaaacagggccaaatctgatcccaggccctctaggaggggctcccattgcctatctcagcattccctttcctctcctccctaggactgcattgtcacttgcagggacaggctcgtgactggtggggacactgaatga-cagtacagtcctttcttccccattctagtcctac-cccattttcatgctttctatgtctggcctactgaaactacttgactactgcttgggtaggaagtaccacagccaggctggcagatctgttcaagcttggggacttcacttggagaatctagccttgactgaattccccccagacccagggagagcagccaactgtggattctgcctaaccacagggcctcaggttttcacctaggcatcttcactgcacaccttcttgggtcagcataacctgttaactgcattcttgtactcatgtgggacaggggtccccttgaagtttggaatgaggtgcctagctttggtggggatgtgatatgcaggaccaaattctcagtggcagctgaactatggtgaggccatgggtctggctctatgatgccagaccggatagtgggaggtacagggctctggccctggcactactctaagttagggaaggattggagttagtacccaaacacagtcctttcctgagtctctggatatttttcctatttgtc-aac-tatatgccaggcaccatcttagacactaaggatgaagaagccaaatggtataagggaaggaaaaacactcaggtcttgaccaaattacttcctctctaaaggctc-gttttt--tccaaatctctaaaataagaattacaatgcctgtcttaaggatttgctgtgcatatcagaaaaaaaaaattatgtatgtatacacacacacacacacacacacacacatacatacttgccggcactggtaggtctcagtgaca-attatcaggaggaagggagggtagaatgctcgcaatg-gtgttcctggctc-ccac-cccccatctc-actctgtctttccttccagacactcacccacggcactgcagtttgcactcgcacttacgagaaagaggcatgacctgactgcactgttgctgac----tactactctgccaatcggctacccctcgactcagcaccacattgcctcat--ttct-tcctctg-cattttgtac-aa-atccacgaattcttctggggtcaggtgccactgaccgggatcc-agttccagttcccatggtgtatgtggttttttttttttttttttaactgcactcatagggtgctctgaggtcaataaagcagagccaaggccacccagttgccttttggcctttggtaacataactctgggagtcttggtttatcctgtgtgtcagagagtg-ggcagaaataacggcctgaaggttactgaggaagaagcactggatgggagactgaaatggacagtctcggagc-ctgttaatcagctgatcaccttacacatttaataataaaagagctgtacctacacgttgcctttacactgcccc-----------------ccctccatggtcaaatgacctagttcagtcagtgatggggcttccccaggtttggctattgaactgtcacttcaggcccatcctacactga-aa-gctc-ttgggt-ctggctgttctctgtgaaatgctgtagtctctccctttccagaattcaggttcagggca-cagaacccaggcttgtaccatggtggtgggaga-aaatgaccactggccaagaggactgctgacctgtgcaccaggctagtacttatgactacaaattcttactgcttctctaatcaactctgagggaagagggcatctgatcattacaaaagggagggcttataagtgat actaggacaagagag-----------------------------gcaggggt-ggggtcctggctgtggatttacacaggtcttggttcaagcatcagtctaaaggctatctgac-aacacataaccttcaagggccactgaaatgg-------------------------------gggctgctgggaggcc----------agtgtattcagagtccaaagcactggccaaatgggaagaacgcagtaggcacccacaaacacttcctcctgcttgctgt----------------tccctgggaagaggcagaaagcaggacc-agt-agatcccaggctggagaggagcagctgccat-----------------cctctcccctcctgagtagaaatgca-ggagtc-tgcaggaccaagtgtgtgcctggtgggcactgccagggagcagccccctccttactcacaattttgtctactgctcctggcttcctg-gaaatatctattgtcta------------------------gctaggaggtgatacaaccaggcctgcagatctgct---------------------------------ttgggacaagtttgaaagcatccctggaatagtccctacatcctcgtaggactgtac-ctgggccaggagcatggaccacaatgtcagcatcacagacctgggtcgctcagtggaacccactgattgca------tattcacatgagactggggttcccactgagttagaagtgaagggcctagctctggaagggaggtaacaaagaggatcatgttcac-taggtagctagagtaaggtgaggccatggct-gggctctacagtgccaga------------------------------------atactgcatgttaggggagg-gagga-aaagttggc-agcttagcagtttctcaaggctctgttcttatcacccgtttgtctgccacatacaccaggcacccccttaggca-----ggtgctgaaatgaacacaaaggaggaacaggaatgtact-ggatcttgaccagtttacctcctctctcaagggcctatttttcccccaaatctctaaaatgctaattataa--catct-taaaagatttg-------tatcagaaaaaaaa-----------------------------------------------------------gtaaagtgcctggcacacagtaggtgctcaagtgctggtcaggatgagggtggggagcactccctcctctgctctgccccatctgaaacctgtctttctttctagactctcactcatggcagtgtggtgagcactcggacttatgagaaggaggcgtgacctggctgctccgtcactgaccgcccgctcctctgccaactggccacccctcagctcagcaccatgctgcctcatggttttcccctctgacattttgtataaacattcttgggttgggat-ttttctggagatacggggcatcagcctggacccagttcctactatgtatgtggtttatttttt------aaaactgtatccaaagggtgctccaaggtcaataaagcagaaccaaggccacccagttgtct-------------------------------gtctttggtcctcctttcctgtgtgtcaggttgaaatgaaggcctataggtcacctgggaagcagcactgtcaaggag-ccg-agtggacaggctcaaggctcagttagg-----------------------------gaacagtagcacctatgtaatacccttacactgacctgccaaggctcagagaagctagctgtcattctagcatctatgcaagcccttacactggcctgcccatggcagagcagctggctgtcactgtgtggctatttcacattcatcctgcacagacattcctggatttgctgtatggtgtgctgtggtcaccctctctctagagtacaggctcaggacatcaaggtccaggtgtgaacaactgtggtgggaggtgactgctaagagtcgcccactcatgcccagcaagtccccagggttacaaatacaagggaaagcggtc---------------------------------------atcactatggaagagaaggtttatgagtaat 312312312312312TTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTHHHHHHHHHHHHHHHHTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTT231231231231231231231231231231231231TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL312312312312312312312312TTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL3123123123123123123123123123123123123123123123123123123123123123123123123123123123123123123QQQQLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHTTTTTTLLLLLLLLLLLLLL312312312312312312312312312312312312312312312312TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTT3123123123123123123123123123123123123QQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT2312312312312312312312312312312 X57152.c1 align 46.8% of 7572 pair9_hs.fa X57152:3-5844 + pair9_mm.fa.X80685:631-7367 + ctgtcggttggggtcctacttttacataacgcccccacaatgcccttcgccttcctcaacgtggcccccgctccaagcccattttctggagccaggaatccactctgtgggttaggaaaggccctcaggaggc-ggagggaaacctgtggaatgccgagaagccgtgtaatgaaataacggtcacggcctggcccctcaccattactctgaccagggttcgaag-----------------------gtcacacttagagcctaaggggaaatggagaagtgcaaagggacgag------cagaatggctggcaccacctcaggttagcgcactgggacgttcca-gttctcacaccgcccaccccaccccacccaagtcctacgca-cggagccaag-ccg-cacctctcccctcatgaggcaggagccccggaggaaacagtacgcccgtcaagggtctctggcgggactgattcgcactaggggcccaacaggcaataaggacccagcggattggccgaggataggccagtcccctgggcagcagcgccgcgccgggactagaggggaacgtgaggaga-gctgcggaaagagatccagcctggctc-cctcctttccccgccctaagtcagcctcttcacccagtgagcacaaaactgtattgcccagactcccgggccccg-a-acgccatacctggcttccgcttccggtggcttctcgttgtgccccgcccgcaagcgccctcctccgggccttcgtgacagccag-gtcgtgcgcgggtcatcctgggattggtagttcg-ctttctctcatttagccagtttctttctctaccggggactccgtgtcccggcatccaccgcggcacctgac-c--cttggcgcttgcgtgttgccctcttccccaccctccctaatttccact------------ccccccaccccacttcgcctgccgcggtcgggtccgcggcctgcgctgtagcggtcgccgccgttccctggaagtagcaacttccctaccccaccccagtcctggtccccgtccagccggtgagtctgaagtcgtcgctgctccgagtcccttgtcgctgggagcggcacatggggtctccggactttgatgt-ggggcgggggaggaagcgaccaggtccg-gcacgaaggagggagaggtggcctgaggagcggaggggggatgtgtggattccggtgaaagggacctgacaat-c-gcc--c-ccaaccc-gtgagaaaaggaggagcccagttcttgcttgagaatgataaacttggaaacccttgggaaaggcgtgggggtcatgcagagacttgtattggtagggagcctgagtcgaggtccctgccggagttgacacagaggagagagggccctggccttcgggagctccagggatgtgggtcgggctggtgggtcaaagtatctg-ttggcttctttcaagtggtgg-gaccccaaagaatgtttaacttcaaagaaaaggggctgagatgtaaattagaggagctggagaggagtgcttcagagtttgggttgctttaagaaagggtggttccgaattctcccgtggttggagggccgaatgtgggaggagggaggataccagaggcagggaaggagaacttgagctttactgac--actgttctt-tttc-tagctgacgtgaagatgagcagctcagaggaggtgtcctggatttcctggttctgtgggctccgtggcaatgaattcttctgtgaagtgagttctcttcaacc-tccctacttgccagcttcacatatcttcccaccagacgttccttcacatattccacttctacactgttctcttacatgctatttgaaaacttcctatcagcaaagagtcccccctataaaccccgacgaacctgtgctaaagtggcaaaactggggcccaagtcctgagtctgccaccgtccagcaatataacgttgggctagtcaatttgtgtctttttcttttttttgagactgggtctcactctgtcaccgaggctggagggtagtggtgcgatctcggcttactgccacctctgcctcccaggttcaagcgattctcctgctccagcctcccaagtagctgggattacaagtgcctgccaccatgcctggctaatttttgtatttttagtagagacagggtttcactatgttggcaaggctggtctcgaactccagacctcagggtgatctgcctgcctcgggcctcccaaagtgctgggattacaggcgtgagcattgcgcccggcctgtatcttttgttactaaagtggcactgctagtacttgtctcaggtggcctttaggaaaactgaaatgctacacattgaaatgttttg-------------------------------------------ttcagaaaccatgctgttcagcttccacc-ttccttagccagctgagaggacaaaactggttcctagagacgggatacaggagtggagtagggacaaagatcttggaaaagaatgtctaa-gaaaaag-a-ttgctgtatctacttatccttagaaaagaaaagccaaagcttttatgggagagagtgtaggtgaactagggagagacacaagtacttctgctgagttgggagtgagaaacaagcacaacagatgcagttgtgttgatgataaggcatcacttagagcattttgcccaggtcaaagatgaggattttgata-tgggttc--cctcttggcttccatgt-cctgacaggtggatgaagactacatccaggacaaatttaatcttactggactcaatgagcaggtccctcactatcgacaagctctagacatgatcttggacctggagcctggtgaggcaccctcagggttgtt---------------------------------------------ttgtgtgtgtgcgtgcactatttttctcttc-aaat-ctctattcacttgcctgaattttgccaaatttcctttggttctctgatttctttaaccccaaattcatgctttattttgatcctccacctgactcttgtctagt-tttgtgacgtatatcacttgttctcatgttttctaaatccgcaattcagacctattccaaaatgcgtttcctcag-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ggtctggtttgttgtctgtttctcctgctttgcaccttccagtctagagtttcatcttctgcattgacattgttgcagttatgtattgaggagggagttgggagggagagcaaggagcagaggctgaaaaggtgtgaggggaaggcagagctgtcttcgtttgatgcaagggtcagaagcccaggtttctgggtcccatgcccagatgttggatggggtaaggcccaaaagtaggtgctaggcaaactgaatagcccgcagcccctggatatgggcagggcacctaggaaagctgaaaaacaagtagttgcatttggccgggctgtggttcagatgaagaactggaagacaaccccaaccagagtgacctgattgagcaggcagccgagatgctttatggattgatccacgcccgctacatccttaccaaccgtggcatcgcccagatggtgaggcctctctgctcctacctgcctccttctgagcagtaagagacacaggttcctgcagcaagaagtcatgtttaagccctgtttaaggaagctagctgagaagaggggaagaaccccagaacttgggcctgggaattgaattctgattgggggtcatcctgaagggattgttttcagggagggagac-agaccttgaatcagagagttgtgatagactgcctcttcctcaaggaacaaacaacaaatggctctgatggtttgtagccctgccctaatttggaagaaaggcaacacagaagtttgagagcccatctagtccagagaagggggcctctggacagagttggaaggagtgccgacagagttggtatgggttgggctgcgaagggagttgc-ctcttctttacat-ct-acctgccaaccccttccattgtattcacctcagttggaaaagtaccagcaaggagactttggttactgtcctcgtgtgtactgtgagaaccagccaatgcttcccattggtgagtgttgaagaagggaaaggaaagcaccgtgtggcagtcttatgggaaggagttggggctcaacacattggagcctgagtcctgaggggaggttaggtaggaatagggggatacctggcctgctgagtctggctg----tctcccaggcctttcagacatcccaggtgaagccatggtgaagctctactgccccaagtgcatggatgtgtacacacccaagtcatcaagacaccatcacacggatggcgcctacttcggcactggtttccctcacatgctcttcatggtgcatcccgagtaccggcccaagagacctgccaaccagtttgtgcccaggtagggagcagggagagtcattaagggtcaaaggaaaggcccaagatcccccagagaggggaggacagggcatggccctttcttgaggtctgcttctcccagaatcagggcatctccctgctgagtgactgtgggaaagttatttgattatctgtgc-ttgagttacc----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ttattgtagaatgttcttgagctgagaagttgggaaccacgaggctttagctctgagcaggtccat-----------------------------------------------------------------------------------------------------------------------------agaggagctcaggtgg-ggaggtgggaatgcaggtgactggcagggcctggatggggctcatgctgctgcctctctgacctctgccctggcctaggctctacggtttcaagatccatccgatggcctaccagctgcagctccaagccgccagcaacttcaagagcccagtcaagacgattcgctgattccctcccccacctgtcctgcagtctttgtcttttccttt-cttttttgccaccctttcaggaaccctgtatggtttttagtttaaattaaaggagtcgttatcgtggtgggaatatgaaataaagtagaagaaaaggccatgagctagtctgctggtgcttgctgttggggaagggaaggtgatggtgtgttggactccaggggccctcatggcccagcccaccctccccagattgaaaaccaggacagatttgtgctcagtggat-tgggtggtgtttttagtatggagcagaacagaattcctaggactgcgtgtgatgaaatgcaaggtcaaaaggaaaagacaaagcatatttcaaagatgagaaatatttgtttggatatctatgactgtctgtttatactgtaaggggcttaatcagcagctccatcttttagttttagttctaaaggaaaagtag-cctaaagtcagtataactaaagggtggaacgaggtgggacaaggtccggaattgctgctcagtgatgtgtgtgtgcctgccgctggtggagctgagactgctcactctcagaaggatggggatgcttgatttcctggccaggttgtcccagcacagtggggattggccctgttgtatgacgaagacagcacatggtggcagagatagatactaacccatggactttccaagggagggaataggtctttggagggtatgcaagacaaaggtagacactggataaagaacccggtagtgcccaggtattaccccatctgggccattactcccacactcaggaaccagacgttgtgggtgaggacatgctgtccctcctgccaagta-ataacttccttccca-gccaggatcctgccccaagtaggaatatagctctgcatttacagcagctcctgctcagac--cttgtcaaaaccaccctgcagcttaggattaaggagcatggtcacaggaaggtggggtttcagggcatcccctcaggaactgcccatctccccagaattccaaaatgaaggtccatatgcttgtaggtgtgctggtcatggtgggctca-cagtaggaaagggtaagtggggcccaggggcagggaggg ctgtcggttggggtcctgcttttacataacgcccccacaatgcccttcgcc--------------------------------------------------accctccgcgttcagaaagactctgaggacacagtggggagacctgtgggatgccgaggagctgtgagatgaagtaac-atcacgg--ttacccctcaccactgctctgagcagggtttgaagaacagtcagggtcatcagcacaactctctaccccag-ctctgaggagatggagaagtacaaagggacaagcggcggcagaatggctggccccacctcaggttagctcgcaggg-cggtccaggttctcacgcagcgtgcacagctcagcgcatcccaagcacacccgacctaagtccctcaccacaactctgcgcatgccggagcccgagcgcactctgt----------------------------------------------------------------cgcgggcctggcga-gaga-accaggatcccgtgggttggcctggcacagggaagagggagaaaaaaaagcaaaggcaactgagggagcacc-gcagggatgacaatcagcccgaggacctacccggcagctt-agcgag-cctaacaagactatatttcccaggcttcctggtcgggaagaccccctgctccgcttccgcttccggtggcgcctcggttcgccccgccc-ccag-gccttcctctagggcctcgcggcttagagagtcatgcgcgggtcctcttgggattggtagtttgtcttccccccatttcgtcagtttctctctgccgacgggactacatctcccggcgtccacggaggtagggggcggttcttagcgcctgc-cgttgccct-tttctcaccc-aaatcctcaccagtttccagtggccccccccctccctacttcgtctgctgcggccgggtcggctttttgcgctgtagtggtctctgcggttccttggaagca-cagctccccttccccgccccagtcccagtccccgtccggccggtgagtgtggggttgtccccgcgtcgagtgcc-cgtccctgggcgcggcac-tctggttggagctcctcgatgtgggggcgacgaaggaagtgaccgggtcggtagccgaaggaggtcgggggccctgggcgagaggaggcggcttttggggacgctcgtgagacccacagaagtgtggtgccttcgccaaccctgcgagtgcagga-gaggccggctc-tg-ttggggaaaatgaacttcggaagccttgggacaggcgccaggaacatgcaggcacctgcggtggtggggagccggagccgagggccctgtcgcggctgacatggaggggagaggggcctggctttgaggagcgccagggacgagggtc-------------acactaacagatcaggacctgtcattttctggaaagtagcgaaacttattgccagcacactgcaagggggctgaggtgaaggagagaagcttg-ga-ga-tgcttcagaggccgggatccttcaagcctggctggactcagattcttccgttgtt-gtggagaatgtgtgggaggagggaagagacaaga----------aacagcttggtctccacttacttaacgctcttctttcacagcggacataaagatgagtagctctgaggaggtgtcctggatttcctggttctgtgggctccgtggtaatgaattcttctgtgaggtgagtttaggtcggccaactccacatgcc------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ctcgatgttttattttatgttgtttgagagcagatttc----------------------------gacggaactcactatgcagcctagactggccttgaactcctggcagtcctacttcctcagtctcctgagt-gctaggatttcaaggttttgtcaatatgcatggctaacagaggtacctct-------------------------------------------------------------------------------------------------------------------------------------------------------------------gggtgaccactaggataattaaaatatttcatgttaaagttgggtaatgtgcaaagcccggtggcttcagtagaaaattataaatgcctttcagaaactaaactgtttagccttcatgattcccaagtcttcaactcagagagtttgctatccccaaga-agatggcaagtgcggtttctaaaataagcttgtgtcgaag-ctgaataatgtctatgtctttgctggagatggttagcctta-aagaaagggcctgcagtttttatgagacttc--------------------------agtacctctgcagaattggga--------ataggacacgagaggtgacttggctgaggataagacatctcttgg--------gcccaggtggaagatgggccttcctatattattttcaaccccttggcttccctgtgcccg-taggtggatgaagactacatccaggacaaatttaatcttactggactcaatgagcaggtgcctcactatcgacaagctctggacatgatcttagacctggaacctggtaatgtgcccttagggttgttgctgactgtggggttggtgataaagtgtgcactcccctccataggtgctttgttttcattttcttctggtctctgctcactggcctcttgaggtggttg-ttgtgggaggtggaaactgggttttcttttttcttt-ttcatatatatatattttatgtatataagcacactaaagctgtacaggtggttgtga-gcct-tcatgtggttgctgggaattgaatttaggaactctgctctcgagtcaaacccgctcactcagtccctgctcactctggcccaaagatttatttattattataaataaatacactgtaactgttttcagacacaccagaagagggcatcagatcccattacagatggttgtgagccaccatgtggttgctgggatttgaactcaggaccttaggaagagcagtcagtgctcttaaccgctgagccatctctctggcccgagacaggttttcttgggtagccttgtctgttctagaactcattctgtagaccagcctgtccttgaactcacagagatctgcttgtctctgcctcctgagtactggatcacagactggcttgcttcttgaatttctaaatttccttttaataatccagctctctctcctatccccagatcccttgctcctgtcttttctcccccacctcaaggcccaagcattcataacccaaactggactttaccccttggtctttctgccctgctttcgatggctgagtcgtggcacacacagcagtgcttggcttcttcactgttttgattccttatgactcttgctttgtctccaagctctctgaaagtagctcagttctggttcgtaacgatagtttcctctcgggtttggtgtgctgctgcctcttcccctctgctgctctagag-ggcctcctctgcgtttgtgtgtttacatttacttactgtgaaaagag-aggcaggcagagactaaag-------aggaacacagagaaggccctgccgagcagtcttgctttg-tagaagaatcaaaagcccaggtttctgggtcagatgtatagatgatgggtagggcaaggctcagaaacaggtactagagaaactgga-ggcgc-caccttgcaggtaaggtggggttaccctggaaaactaaacaagcggccaccccacccacgtt--------ttcagatgaagagctggaagacaaccccaaccagagcgacttgatcgaacaggcagctgagatgctttatgggttgatccacgcccgctacatcctcaccaaccgaggcatcgcacaaatggtgaggc-------------tctg-ctccttct--gcaggaagggaaacaggtgcatg-ggtccg-aggcctggttcctacctg-gcaaggagcgtggcagaggagagagggagagacccagagc-tggg-ctgaggagtgaatcctcactcagggtcccattgaa-gg--tgttttaaggaagggagatgaaaccggaaaacgaaggagtttg-----------------cagggaacaaatagcaaatggcccctgggctctgcag-actgccc-cacaaag--gaaaggc-ac-ttg-agtttgaggacctatctcg-acagag-agggggcttctagtcagaagtgggagacatgggg-------------gggggggaccatgaaatgagttgagattgcctcttcatgctcacccaccagtcccttctgttttacacaccttagttggaaaagtaccagcagggagactttggctactgtcctcgtgtatactgtgagaaccagccaatgcttcctatcggtgagtgttgaa----------------------------------agccagaagctgccttactgtac-tcagtgcccgagtcct-ttgggaggttggggagcgcta--gggatacctggtctgc--agtctggctgtctatctcccaggcctttcagacatcccaggcgaggccatggtgaaactctactgccccaagtgcatggacgtgtacacacccaagtcctccagacaccaccacacggacggcgcatacttcggcactggtttccctcacatgctcttcatggtgcatccagagtaccggcccaagcgacctgccaaccagtttgtacccaggtagggagcaaagacagtcacaaaggatcaaag-----agccaaggt-ctagagagaattgaggaca-agcatagccctttcttgaggcttgcctatcccaaagtcagggctttttcctgctgagtggctg-ggg-aagttacttgcttgtgagtccattttcttatctatgaagcagggatgccaacagtacctgcctagcaacagttttaaatgtatggcacttgactatttaaacagtgactgttgctgttcccattttggatgcttagctttgtgatttagtgagacttatcatcctggtatctttttttttactgtataccaggggagaaagattctattttgaacaggtactaaatcttcttttttttttttttttttttttttccgagacagggtttctctgtgtagctctggctgtcctggaactcactttgtagatcaggctggcctcaaactcagaaatccgcctgcctctgcctcccaagtgctgggattaaaggcatgtgccaccatgcccggcagtactaaatcttaataaatttgaaatttcctgtgtgggttggaagtttcccctggcctcatgcctgccaaaccatcctctctctctgagctatactttcactgtgagtttttggttggttggttggtttggcacaggtgttgtgagaggcacagggaatgtttttttttttaagatttatttatttatttatttatttatttatttatttatttatttattatatgtaagtacactgtagctgtcctcagacactccagaagagggagtcagatcttgttacggatggttgtgagccaccatgtggttgctgggatttgaacttccgacctttggaagagcagtcgccatctcaccagcccccacagggaatgtttgacagggtcgttgttggctaagttgatctcaagctataaataagactataaatgaggcacgcattacctggcctggaattttaattttagtggatgtttatggta-catttttttaagctgggga-tggttacacaaggggaggcagaggtaagcagatctctttattttgagccaaggccctgttagacacagaagccctgtcttgaaaaactagcccctccccaaataagagaaagagagagacatgtttcatgtattcatgtgcaaggagattggggtcctgtggagtcgatgtgaggaggctgggatggggttggccg-cactggcagagctagcagagagcttaccgtgctgcctctctga-c-ct-ccttcacccaggctctatggtttcaagatccatccaatggcttaccagctgcagctccaagccgccagcaacttcaagagcccagtcaagactattcgctgattgcc-cacccacctctccctctgtctgtgacaccaccattcctctgctgccaccctttcaggaagtc--tatggtttttagtttaaattaaaggaattgttactgtggtgggaatatgaaataaag--gaagaaaaggccatgagct-gtctcctactgtgtgttgctggggtcgggaagg-ggtgggctactgcactccagggaccctcttgctctagcccatcttccctaggttggaagc-agcccagattgcggcttcatgaatctgtaccatgtacagatagtgtgtcagagcagcact-ctagg-ttgggtgagcgcaagtgcaggattgaagggaaaggtttcagaggttggagaacatctgctctcttggggctg-tggctgtggctg-ctg--tagacttcatgggaagt--------------cccctcagtcttgctgttaaagggaatgtaaccctacagtaaataaaactgga---------gaggaggca-gagagccccaactgttgctcagtggtgtctgtgtgccccctgctgggagacctgaga--gc-c-gtctggaaaaggttggaaggcttg-tct-ctggccaacctctcccagcataatgggggccaagcctactgtgtg-ggtggacagcaggtagtggcagagacaggt--------------ccctgaagtgggggtacaggccttt-caaggcatg-tcta-ggatgaagacacctgataaagagccagctggtgctcttctgtggcct---------------------acacaggaaccagacaatgtgggtgaggacaggc-cttcttcctgccaa-tagatgacttcctcctctggtcaggatcttgcctcaggcaggagtgcacctctgcacccacagcagtttctgctcagactgcctgtcaaacccattctgcctctgaagatca--------------agaaaggtggggctt-ggggta-ctttgtgggagttgctcctctcccccaaattccataatgagggcccgcgtgcttgtaggtatgctgaccatggtgggctttgcgatggggaagggtaagt-aagaccagggtctgggagga HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL2312312312312312312312312312312312312312HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLL312312312312312312312312312QQQQQQHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL231231231231231231231231231231LLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL231231231231231231231231231231231231231231231231231231231231231231231231231231231231231TTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL3123123123123123123123123TTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ3123123123123123123123123123LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTTTTTT123123123123123123123TTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH12312312312312312312312312312312312312312312312312QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLL31231231231231231231231231231231231231LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTT31231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231231TTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTT1231231231231231231231LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH3123123123123123123123TTTTTLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHH12312312312312312312312312312312312312312312HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL3123123123123123123123123123123123123123123123LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL231231231231231TTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTTTTTTTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLTTTTTTTTTTTTTTLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL2312312312312312312312312312312312312312312312312312312312312311231231231231231231123123123123123123123 multi.phd100644000766000024 3562714006427353 16260 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBEGIN_SEQUENCE ML4924F BEGIN_COMMENT CHROMAT_FILE: ML4924F ABI_THUMBPRINT: 0 PHRED_VERSION: 0.980904.e CALL_METHOD: phred QUALITY_LEVELS: 99 TIME: Thu Jun 21 11:27:03 2001 TRACE_ARRAY_MIN_INDEX: 0 TRACE_ARRAY_MAX_INDEX: 8444 CHEM: term DYE: big END_COMMENT BEGIN_DNA t 10 3 t 16 14 g 19 26 t 19 36 t 8 47 c 6 55 t 6 74 t 6 77 t 6 95 t 9 98 c 9 105 t 15 119 c 17 128 g 17 143 a 22 148 g 22 162 g 25 173 g 25 185 t 22 197 a 22 202 c 22 213 t 25 227 t 34 239 g 34 250 a 42 260 g 35 270 g 35 282 a 38 294 t 40 303 t 40 314 c 35 323 t 34 335 a 31 344 c 31 351 c 33 364 a 34 373 c 35 381 c 33 394 a 29 404 c 35 412 a 29 424 a 29 434 c 32 443 a 34 454 a 34 463 c 35 473 a 35 484 a 42 494 a 42 506 g 42 516 c 42 526 c 35 539 t 35 551 t 37 562 a 35 571 t 35 581 c 35 592 a 35 600 t 35 612 c 31 623 c 33 635 a 24 644 g 24 655 a 24 667 a 29 677 c 26 686 c 29 698 g 29 712 c 29 719 a 29 731 a 29 740 a 29 751 t 35 763 t 35 775 a 35 782 t 35 794 a 35 803 a 35 813 a 42 824 a 42 835 a 47 846 a 47 857 g 40 869 a 35 880 g 35 891 c 35 900 c 32 913 a 32 921 a 32 932 c 32 942 a 42 953 c 42 963 c 42 976 a 37 986 c 37 996 a 30 1007 a 30 1018 a 35 1030 a 35 1041 c 35 1050 a 35 1061 a 35 1071 a 37 1083 a 32 1095 t 32 1106 g 35 1119 a 35 1128 a 37 1139 a 37 1150 a 42 1161 g 42 1174 a 37 1184 a 37 1194 a 40 1206 c 37 1215 a 37 1226 a 35 1236 a 35 1249 c 35 1259 t 37 1274 c 35 1284 t 35 1298 t 37 1310 c 37 1319 a 37 1328 a 37 1339 c 35 1349 c 32 1362 a 29 1371 g 31 1385 t 31 1400 g 29 1410 a 35 1419 t 35 1433 g 35 1445 t 35 1457 t 35 1469 c 35 1478 t 35 1492 g 35 1505 a 35 1514 g 32 1526 a 32 1538 c 35 1547 t 32 1562 t 35 1573 g 35 1586 t 32 1596 t 32 1608 g 32 1622 a 32 1630 g 32 1644 g 35 1656 a 35 1666 t 35 1679 a 35 1687 a 35 1698 c 35 1709 a 37 1721 c 37 1731 c 35 1745 t 35 1760 t 32 1773 c 32 1782 g 32 1796 t 32 1808 a 32 1817 c 30 1826 t 35 1842 t 35 1854 a 35 1862 a 35 1874 c 35 1884 t 35 1900 t 32 1913 g 35 1924 g 42 1936 a 42 1948 a 42 1958 c 42 1968 c 35 1981 a 35 1992 c 38 2002 t 38 2018 t 38 2030 c 38 2040 a 42 2052 t 40 2064 g 40 2077 g 38 2089 c 35 2098 a 35 2112 t 35 2124 c 42 2135 a 50 2147 t 56 2159 c 56 2171 c 50 2183 t 56 2196 t 56 2209 g 42 2222 g 42 2234 t 42 2246 a 38 2256 a 38 2267 c 38 2277 t 38 2293 c 38 2303 t 38 2318 a 40 2328 t 38 2339 g 38 2353 c 38 2363 t 40 2378 g 40 2390 a 40 2401 a 44 2413 t 42 2425 c 42 2435 c 42 2448 c 38 2460 a 36 2471 a 36 2483 c 32 2492 g 32 2507 c 32 2517 g 35 2532 a 35 2543 g 38 2555 t 41 2569 c 41 2579 t 41 2593 t 41 2605 g 41 2617 c 42 2627 a 41 2638 c 41 2649 c 41 2662 t 41 2676 a 41 2686 c 41 2697 g 41 2711 g 41 2723 c 41 2735 g 35 2749 a 40 2759 c 35 2769 g 32 2784 t 32 2796 g 36 2809 c 38 2818 a 35 2830 a 35 2842 c 37 2852 a 38 2865 c 35 2876 g 35 2891 g 35 2903 t 35 2915 a 37 2926 t 37 2938 c 41 2950 c 41 2962 c 41 2974 g 41 2989 g 41 3000 g 41 3012 c 41 3023 g 35 3038 t 35 3050 t 35 3063 c 32 3071 a 35 3083 a 31 3094 g 33 3107 a 32 3120 a 35 3131 c 35 3140 a 37 3153 a 29 3164 c 33 3174 a 16 3187 t 16 3198 a 9 3210 g 20 3223 a 13 3236 a 23 3247 g 23 3259 t 33 3272 c 35 3281 c 37 3294 a 35 3305 t 40 3317 t 40 3331 c 45 3341 c 40 3353 a 40 3364 t 50 3375 a 30 3388 g 30 3400 a 33 3413 t 40 3424 a 33 3434 c 37 3444 c 29 3458 c 29 3470 g 21 3485 t 25 3497 a 21 3508 g 23 3519 a 23 3532 a 50 3543 g 50 3554 g 37 3567 a 35 3579 t 35 3591 c 35 3600 g 32 3616 t 32 3627 a 32 3637 c 37 3647 t 37 3662 t 37 3675 g 40 3686 a 37 3698 t 37 3709 t 37 3722 c 37 3731 c 35 3744 a 35 3755 a 44 3766 g 44 3778 a 44 3791 t 44 3802 c 44 3813 a 56 3824 a 44 3836 t 56 3849 g 37 3861 t 35 3873 g 35 3885 c 35 3894 t 35 3910 c 35 3918 c 26 3932 t 32 3945 g 26 3958 g 26 3969 a 26 3982 t 32 3992 a 32 4003 c 35 4012 c 35 4025 g 35 4040 a 35 4051 a 35 4063 t 42 4075 c 42 4085 c 35 4097 a 42 4108 a 42 4119 a 42 4131 a 42 4143 c 35 4153 a 33 4165 g 29 4178 c 33 4189 c 29 4202 a 30 4212 g 28 4225 t 35 4236 g 33 4250 t 35 4261 c 35 4273 a 35 4283 c 35 4293 t 37 4308 g 37 4321 a 45 4332 a 44 4344 g 44 4355 t 44 4367 t 44 4378 c 44 4391 c 44 4402 t 44 4415 c 39 4427 c 39 4438 t 42 4450 c 42 4461 a 42 4472 a 42 4483 c 42 4493 a 35 4505 g 28 4517 c 27 4529 t 24 4543 c 20 4555 g 20 4567 t 22 4576 a 29 4589 c 29 4599 t 40 4613 c 40 4624 c 42 4637 t 42 4648 t 35 4662 g 35 4673 a 35 4685 c 37 4694 t 35 4708 t 35 4720 t 33 4733 c 33 4743 a 33 4752 a 35 4764 g 35 4776 c 34 4787 c 34 4801 a 33 4811 c 33 4821 t 33 4835 c 29 4847 t 28 4860 c 28 4872 a 27 4881 a 21 4894 c 21 4908 a 15 4917 a 15 4928 g 21 4939 c 21 4949 t 25 4964 g 29 4976 c 29 4984 a 29 4998 t 29 5010 t 33 5023 g 33 5034 c 25 5044 c 27 5058 t 23 5071 t 23 5083 g 11 5094 c 10 5104 c 10 5117 t 24 5130 c 24 5139 c 29 5152 t 29 5165 c 27 5177 t 29 5189 c 29 5202 a 29 5209 c 29 5220 g 32 5233 g 36 5245 t 36 5258 g 35 5269 a 45 5281 c 40 5291 g 38 5305 t 38 5317 a 28 5328 g 28 5338 c 26 5351 a 32 5363 c 32 5372 g 35 5387 c 35 5397 a 35 5410 a 42 5421 t 37 5432 c 34 5445 t 34 5456 t 42 5468 t 29 5480 t 29 5491 c 29 5505 a 35 5513 t 35 5526 t 26 5538 a 24 5548 c 17 5557 g 14 5574 t 12 5585 c 12 5598 t 25 5609 a 27 5618 a 35 5630 t 37 5643 a 38 5653 c 40 5663 c 34 5677 g 34 5690 a 34 5700 a 35 5711 a 32 5725 g 33 5735 a 32 5747 c 29 5756 c 29 5770 t 21 5782 c 21 5794 a 27 5805 c 33 5815 a 26 5827 g 16 5838 t 16 5851 g 10 5863 c 10 5873 a 10 5885 c 20 5894 c 12 5908 t 6 5921 g 6 5931 g 8 5946 c 9 5950 c 9 5970 t 11 5981 t 13 5995 g 18 6004 g 15 6017 a 15 6030 c 17 6042 a 11 6050 a 8 6064 c 11 6077 a 9 6086 c 13 6093 a 9 6106 g 19 6119 g 21 6132 t 24 6146 g 24 6156 t 18 6168 t 15 6181 t 15 6194 g 21 6205 a 25 6215 c 29 6225 c 29 6239 a 25 6248 c 25 6260 t 14 6274 g 14 6285 a 9 6298 g 9 6309 c 9 6322 t 24 6333 g 14 6345 t 11 6355 t 8 6369 a 8 6383 c 8 6391 a 15 6400 a 20 6410 a 27 6422 a 40 6433 c 40 6444 c 40 6457 t 26 6470 t 24 6481 g 12 6493 g 12 6503 a 15 6517 g 13 6528 g 10 6540 c 17 6549 a 18 6565 c 15 6574 g 12 6586 g 15 6597 g 16 6609 t 19 6621 g 23 6635 a 20 6645 g 24 6658 a 21 6670 c 21 6679 g 21 6693 a 21 6703 t 24 6712 c 16 6725 a 20 6736 c 18 6746 c 20 6763 a 16 6771 c 16 6781 t 14 6796 c 12 6808 t 10 6819 c 10 6829 a 12 6841 c 17 6850 c 17 6864 a 29 6873 a 29 6885 c 29 6893 g 19 6908 g 16 6919 a 9 6934 g 9 6941 a 9 6956 t 16 6965 g 13 6978 t 11 6990 c 9 6998 g 11 7014 a 9 7025 g 10 7037 a 9 7049 a 19 7059 c 14 7069 g 14 7081 a 10 7093 g 10 7104 c 10 7118 t 14 7129 t 14 7142 c 25 7154 t 26 7166 g 28 7176 a 32 7187 a 32 7198 c 32 7208 c 33 7222 t 33 7234 t 28 7244 g 21 7256 a 18 7268 t 12 7279 a 11 7288 t 13 7300 t 9 7314 c 9 7322 c 12 7336 t 10 7346 c 11 7359 a 10 7368 t 11 7378 t 12 7394 g 11 7402 g 11 7413 g 10 7423 c 10 7439 t 10 7454 c 10 7462 c 8 7477 a 8 7484 c 10 7497 a 9 7506 g 9 7518 c 9 7527 t 6 7544 c 6 7546 t 8 7557 t 8 7574 t 11 7590 c 9 7600 t 9 7611 c 10 7621 t 10 7632 g 14 7644 a 14 7655 c 10 7666 g 8 7678 c 8 7690 c 17 7703 a 10 7711 t 10 7719 t 9 7729 c 12 7746 t 9 7755 g 11 7770 a 7 7781 g 7 7787 g 10 7802 a 12 7813 t 9 7821 c 6 7837 t 6 7840 a 6 7852 g 6 7864 g 8 7880 g 8 7893 a 11 7903 t 12 7915 g 10 7925 c 9 7938 a 9 7947 c 8 7961 a 8 7968 c 12 7978 c 14 7991 c 20 8004 a 16 8013 c 17 8026 a 10 8036 a 10 8045 g 12 8059 g 17 8071 a 17 8084 c 24 8096 a 18 8107 t 14 8116 g 14 8128 a 11 8140 c 14 8149 c 12 8162 a 15 8174 a 15 8183 a 10 8193 a 9 8205 t 11 8214 c 9 8228 g 9 8237 a 10 8252 c 8 8259 a 6 8275 t 6 8279 c 8 8286 t 8 8305 g 7 8318 g 7 8327 g 9 8341 c 9 8349 c 7 8365 c 6 8372 a 10 8387 c 6 8400 t 6 8400 c 6 8415 g 6 8437 END_DNA END_SEQUENCE BEGIN_SEQUENCE ML4924R BEGIN_COMMENT CHROMAT_FILE: ML4924R ABI_THUMBPRINT: 0 PHRED_VERSION: 0.980904.e CALL_METHOD: phred QUALITY_LEVELS: 99 TIME: Thu Jun 21 11:27:03 2001 TRACE_ARRAY_MIN_INDEX: 0 TRACE_ARRAY_MAX_INDEX: 8792 CHEM: term DYE: big END_COMMENT BEGIN_DNA a 6 1 c 6 20 t 6 17 t 8 26 t 8 35 g 12 48 g 18 58 t 16 66 c 14 83 g 11 98 c 9 104 c 6 122 t 6 128 g 9 140 c 19 147 a 20 159 g 32 167 g 34 178 t 34 190 a 39 200 c 39 208 c 39 221 g 29 233 g 40 243 t 28 255 c 21 267 c 13 277 g 4 288 n 4 297 g 4 298 a 8 314 t 8 328 t 23 339 c 29 349 c 39 359 c 40 369 g 40 381 g 40 390 g 40 401 t 46 413 c 34 425 g 33 436 a 29 447 c 29 453 c 32 466 c 33 476 a 34 485 c 34 492 g 34 505 c 34 514 g 40 527 t 40 537 c 34 548 c 34 559 g 34 572 c 34 579 t 34 592 t 34 602 t 39 613 g 40 625 c 40 633 t 40 645 c 40 656 a 40 664 t 40 673 c 40 685 t 46 697 c 51 707 c 51 718 c 56 728 t 51 738 c 51 749 t 35 760 g 35 772 a 35 783 c 35 790 t 35 803 t 35 813 t 39 824 g 40 836 c 40 844 a 40 855 t 40 865 c 56 876 c 42 887 c 51 898 t 46 909 t 46 921 a 46 931 a 46 940 t 46 951 c 46 962 c 46 974 t 46 985 c 40 995 a 40 1004 g 40 1015 a 40 1028 a 40 1037 t 40 1048 g 35 1060 g 35 1071 c 35 1080 g 35 1096 t 35 1106 c 35 1116 a 40 1125 g 40 1138 a 40 1150 g 51 1160 a 51 1172 a 51 1182 g 51 1193 a 40 1204 a 35 1214 g 35 1226 c 39 1235 t 39 1250 c 39 1259 t 39 1272 c 40 1282 g 51 1295 a 51 1305 a 51 1315 c 51 1325 c 51 1337 c 51 1348 a 51 1358 a 51 1368 t 51 1381 g 51 1393 a 51 1403 g 51 1415 g 51 1427 g 39 1440 a 39 1450 t 39 1462 a 39 1470 t 39 1483 c 35 1494 a 40 1503 a 40 1514 g 40 1528 g 40 1540 t 40 1552 t 39 1565 c 39 1574 a 39 1583 g 39 1598 a 38 1609 a 40 1619 g 39 1632 c 39 1642 t 39 1657 c 39 1666 g 39 1680 t 40 1691 t 45 1703 c 51 1713 t 51 1726 c 51 1736 a 51 1746 a 51 1757 c 51 1767 a 45 1778 t 45 1791 c 45 1802 t 45 1816 c 45 1826 c 45 1838 g 45 1852 t 51 1863 t 51 1875 g 56 1887 g 43 1899 t 43 1911 g 43 1924 a 43 1933 g 43 1946 a 43 1958 g 56 1970 t 56 1982 g 56 1994 g 56 2006 t 56 2017 g 51 2030 a 43 2041 t 43 2053 c 43 2063 g 43 2079 t 43 2089 c 43 2100 t 43 2114 c 43 2124 a 43 2135 c 43 2144 c 43 2157 c 43 2169 g 43 2184 t 43 2195 g 43 2207 c 43 2216 c 43 2229 t 43 2244 c 43 2253 c 43 2265 a 46 2277 a 56 2288 g 56 2299 g 56 2311 t 56 2324 t 56 2336 t 56 2348 t 56 2360 g 56 2373 g 51 2385 a 51 2397 a 51 2408 c 51 2418 a 43 2430 g 43 2443 c 45 2454 t 45 2469 c 45 2480 a 45 2491 g 56 2503 t 56 2516 g 56 2528 g 51 2540 t 51 2552 c 51 2564 a 51 2574 a 51 2585 a 51 2598 c 45 2608 a 45 2620 c 45 2630 c 45 2644 t 45 2658 g 51 2671 t 51 2682 c 51 2693 t 51 2707 t 45 2719 c 45 2730 t 51 2743 c 51 2753 c 51 2765 a 51 2775 a 51 2787 g 43 2798 g 43 2810 c 43 2822 g 43 2837 a 43 2847 g 43 2859 g 51 2872 t 51 2884 a 51 2894 c 43 2904 a 43 2917 c 43 2927 t 43 2943 g 43 2956 t 43 2967 g 51 2980 a 51 2992 g 45 3003 g 45 3017 t 45 3029 c 45 3039 t 45 3053 t 45 3065 t 51 3077 c 51 3088 g 51 3102 g 51 3113 t 51 3125 a 43 3136 t 43 3148 t 43 3161 a 43 3170 g 43 3183 a 43 3196 c 43 3205 g 43 3220 t 43 3231 a 45 3243 a 56 3254 t 56 3266 g 56 3279 a 56 3290 a 56 3302 a 56 3314 a 51 3325 g 51 3337 a 51 3350 t 43 3360 t 43 3373 g 43 3385 c 43 3394 g 43 3410 t 43 3421 g 43 3434 c 43 3443 t 43 3458 a 43 3468 c 43 3477 g 43 3493 t 43 3505 c 43 3515 a 43 3527 c 43 3536 c 43 3550 g 43 3565 t 43 3576 g 43 3588 a 45 3601 g 45 3611 a 45 3625 g 45 3635 g 45 3648 a 43 3660 g 43 3672 a 38 3684 c 38 3692 a 38 3705 a 43 3717 g 56 3729 g 56 3741 c 56 3752 a 56 3765 a 51 3776 t 51 3789 g 51 3801 c 51 3811 a 43 3824 g 43 3836 c 43 3846 t 43 3862 t 43 3874 c 43 3884 t 51 3897 t 51 3909 g 56 3922 a 56 3933 g 56 3944 a 56 3957 g 56 3968 t 43 3980 g 43 3992 g 43 4004 c 43 4013 t 43 4029 t 43 4041 g 43 4053 a 56 4064 a 56 4075 a 51 4087 g 51 4098 t 51 4111 c 51 4121 a 51 4132 a 51 4144 g 56 4155 g 56 4168 a 51 4181 g 51 4192 t 43 4205 a 43 4215 c 43 4225 g 43 4240 a 43 4250 g 43 4263 c 43 4274 t 45 4289 g 45 4301 t 45 4312 t 56 4324 g 56 4336 a 56 4347 g 56 4358 g 56 4371 a 56 4383 g 56 4394 g 51 4407 a 43 4419 a 43 4430 c 43 4440 t 43 4455 t 43 4467 c 43 4477 a 51 4488 g 51 4500 t 51 4513 g 51 4525 a 51 4536 c 45 4546 a 45 4558 c 45 4569 t 51 4583 g 45 4595 g 45 4607 c 45 4618 t 45 4633 g 45 4645 t 45 4656 t 56 4668 t 56 4680 t 56 4692 g 56 4704 g 56 4716 a 56 4727 t 56 4738 t 56 4751 c 56 4762 g 56 4775 g 56 4787 t 51 4798 a 51 4809 t 51 4821 c 51 4831 c 51 4844 a 51 4854 g 51 4866 g 51 4879 a 56 4892 g 51 4903 c 51 4914 a 51 4926 c 51 4936 a 51 4948 t 51 4960 t 56 4973 g 56 4985 a 56 4997 t 56 5008 c 56 5020 t 56 5033 t 56 5045 g 56 5057 g 56 5069 a 56 5080 a 40 5091 t 40 5104 c 40 5115 a 40 5124 a 40 5136 g 40 5149 t 40 5162 a 40 5172 c 40 5182 g 40 5197 a 40 5207 t 40 5220 c 51 5231 c 56 5243 t 56 5256 t 51 5268 c 51 5279 t 51 5292 a 51 5302 c 51 5312 g 51 5325 g 51 5338 g 51 5350 t 51 5362 a 45 5372 t 45 5384 c 45 5395 t 45 5409 a 45 5419 t 45 5431 g 51 5444 g 56 5456 a 56 5468 a 56 5479 t 56 5492 g 40 5503 g 39 5516 a 39 5528 c 39 5537 t 39 5552 t 39 5564 c 39 5575 t 40 5588 a 40 5597 t 40 5609 g 40 5622 t 56 5634 t 56 5646 g 51 5658 t 51 5669 t 51 5682 c 51 5692 t 51 5705 t 51 5718 g 51 5730 a 45 5741 a 40 5752 c 40 5762 g 37 5776 c 37 5785 c 40 5798 c 40 5810 g 56 5823 g 56 5834 g 51 5846 a 51 5859 t 40 5870 a 40 5880 c 40 5890 c 40 5904 g 40 5918 t 51 5929 g 51 5941 t 37 5953 t 37 5966 g 37 5978 c 37 5987 a 37 5999 c 40 6009 g 40 6023 t 34 6035 c 34 6047 g 29 6060 c 29 6068 c 29 6083 g 29 6096 t 32 6107 a 29 6118 g 25 6128 g 19 6142 t 27 6154 g 27 6167 c 32 6176 a 32 6187 a 34 6199 g 40 6212 a 34 6224 c 40 6234 t 34 6248 c 35 6258 g 35 6272 c 35 6281 g 35 6296 t 39 6307 t 40 6320 g 56 6331 g 56 6342 g 56 6355 a 51 6367 t 51 6378 t 46 6391 c 46 6401 a 42 6411 g 42 6424 c 42 6435 a 46 6447 t 40 6459 a 37 6469 g 37 6482 a 40 6494 g 29 6506 t 29 6518 t 29 6531 a 29 6539 c 29 6549 c 27 6563 a 32 6573 a 29 6585 g 29 6596 g 34 6609 a 48 6622 t 48 6633 g 48 6645 a 32 6656 t 34 6668 g 34 6681 c 34 6690 c 31 6704 a 31 6714 t 37 6726 g 46 6739 a 48 6750 a 48 6762 g 40 6774 t 40 6786 g 40 6797 g 27 6810 t 25 6821 t 21 6834 c 16 6842 c 16 6856 a 18 6865 a 19 6877 g 23 6889 t 27 6901 t 25 6915 a 29 6923 a 24 6936 g 29 6947 t 32 6960 a 29 6970 c 29 6980 g 25 6995 a 25 7004 a 21 7018 g 15 7026 g 15 7041 t 23 7053 g 27 7065 t 27 7075 t 27 7089 a 27 7098 t 25 7110 c 22 7120 c 24 7133 t 29 7147 c 32 7157 a 32 7168 a 40 7179 c 29 7189 a 29 7200 a 29 7211 g 48 7224 t 48 7237 c 40 7248 t 40 7261 c 40 7272 a 29 7282 g 29 7294 a 25 7306 a 29 7317 c 29 7327 a 40 7338 t 40 7350 c 32 7362 a 32 7372 c 32 7383 t 32 7397 g 32 7409 g 25 7422 t 25 7432 t 25 7445 g 29 7457 a 40 7467 a 36 7479 g 40 7491 a 36 7503 g 36 7514 t 37 7525 t 33 7537 t 46 7549 g 24 7561 t 24 7571 t 24 7584 t 32 7596 c 32 7606 t 40 7619 t 34 7630 t 22 7642 t 25 7654 c 25 7665 a 27 7674 t 25 7685 t 22 7698 t 29 7709 t 29 7721 g 25 7734 t 21 7744 t 21 7757 t 22 7767 t 29 7779 g 24 7791 t 24 7802 g 24 7814 g 31 7827 t 31 7838 g 28 7850 t 28 7861 t 24 7873 g 19 7884 g 19 7897 c 23 7906 t 27 7921 c 25 7931 t 22 7943 t 29 7954 t 34 7965 t 32 7976 t 32 7988 t 32 8000 a 25 8009 t 18 8020 a 18 8031 a 18 8039 t 18 8053 t 23 8066 t 32 8077 g 26 8089 c 25 8099 g 19 8112 g 15 8124 t 15 8134 t 15 8148 c 23 8157 t 23 8170 g 22 8182 g 34 8193 a 37 8205 t 40 8216 g 32 8227 a 32 8238 t 29 8250 a 22 8259 a 18 8269 g 18 8281 g 13 8294 c 22 8305 t 16 8318 t 20 8328 t 25 8341 g 29 8353 t 24 8364 t 24 8376 g 25 8388 t 19 8398 t 19 8412 g 25 8422 t 24 8433 g 21 8444 g 21 8455 t 27 8467 g 21 8479 g 20 8491 t 26 8502 a 25 8512 g 18 8524 a 15 8536 a 15 8545 t 10 8558 c 13 8566 c 13 8581 t 17 8593 c 21 8603 a 22 8613 a 25 8624 g 29 8635 t 25 8647 a 25 8657 c 25 8666 c 24 8678 c 19 8691 t 10 8703 c 10 8712 g 11 8726 a 17 8736 g 17 8747 a 12 8759 t 19 8768 a 16 8778 t 11 8787 END_DNA END_SEQUENCE test.tseq100644000766000024 1433414006427353 16276 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 11321596 ref|NM_002253.1| 9606 Homo sapiens Homo sapiens kinase insert domain receptor (a type III receptor tyrosine kinase) (KDR), mRNA 5830 ACTGAGTCCCGGGACCCCGGGAGAGCGGTCAGTGTGTGGTCGCTGCGTTTCCTCTGCCTGCGCCGGGCATCACTTGCGCGCCGCAGAAAGTCCGTCTGGCAGCCTGGATATCCTCTCCTACCGGCACCCGCAGACGCCCCTGCAGCCGCCGGTCGGCGCCCGGGCTCCCTAGCCCTGTGCGCTCAACTGTCCTGCGCTGCGGGGTGCCGCGAGTTCCACCTCCGCGCCTCCTTCTCTAGACAGGCGCTGGGAGAAAGAACCGGCTCCCGAGTTCTGGGCATTTCGCCCGGCTCGAGGTGCAGGATGCAGAGCAAGGTGCTGCTGGCCGTCGCCCTGTGGCTCTGCGTGGAGACCCGGGCCGCCTCTGTGGGTTTGCCTAGTGTTTCTCTTGATCTGCCCAGGCTCAGCATACAAAAAGACATACTTACAATTAAGGCTAATACAACTCTTCAAATTACTTGCAGGGGACAGAGGGACTTGGACTGGCTTTGGCCCAATAATCAGAGTGGCAGTGAGCAAAGGGTGGAGGTGACTGAGTGCAGCGATGGCCTCTTCTGTAAGACACTCACAATTCCAAAAGTGATCGGAAATGACACTGGAGCCTACAAGTGCTTCTACCGGGAAACTGACTTGGCCTCGGTCATTTATGTCTATGTTCAAGATTACAGATCTCCATTTATTGCTTCTGTTAGTGACCAACATGGAGTCGTGTACATTACTGAGAACAAAAACAAAACTGTGGTGATTCCATGTCTCGGGTCCATTTCAAATCTCAACGTGTCACTTTGTGCAAGATACCCAGAAAAGAGATTTGTTCCTGATGGTAACAGAATTTCCTGGGACAGCAAGAAGGGCTTTACTATTCCCAGCTACATGATCAGCTATGCTGGCATGGTCTTCTGTGAAGCAAAAATTAATGATGAAAGTTACCAGTCTATTATGTACATAGTTGTCGTTGTAGGGTATAGGATTTATGATGTGGTTCTGAGTCCGTCTCATGGAATTGAACTATCTGTTGGAGAAAAGCTTGTCTTAAATTGTACAGCAAGAACTGAACTAAATGTGGGGATTGACTTCAACTGGGAATACCCTTCTTCGAAGCATCAGCATAAGAAACTTGTAAACCGAGACCTAAAAACCCAGTCTGGGAGTGAGATGAAGAAATTTTTGAGCACCTTAACTATAGATGGTGTAACCCGGAGTGACCAAGGATTGTACACCTGTGCAGCATCCAGTGGGCTGATGACCAAGAAGAACAGCACATTTGTCAGGGTCCATGAAAAACCTTTTGTTGCTTTTGGAAGTGGCATGGAATCTCTGGTGGAAGCCACGGTGGGGGAGCGTGTCAGAATCCCTGCGAAGTACCTTGGTTACCCACCCCCAGAAATAAAATGGTATAAAAATGGAATACCCCTTGAGTCCAATCACACAATTAAAGCGGGGCATGTACTGACGATTATGGAAGTGAGTGAAAGAGACACAGGAAATTACACTGTCATCCTTACCAATCCCATTTCAAAGGAGAAGCAGAGCCATGTGGTCTCTCTGGTTGTGTATGTCCCACCCCAGATTGGTGAGAAATCTCTAATCTCTCCTGTGGATTCCTACCAGTACGGCACCACTCAAACGCTGACATGTACGGTCTATGCCATTCCTCCCCCGCATCACATCCACTGGTATTGGCAGTTGGAGGAAGAGTGCGCCAACGAGCCCAGCCAAGCTGTCTCAGTGACAAACCCATACCCTTGTGAAGAATGGAGAAGTGTGGAGGACTTCCAGGGAGGAAATAAAATTGAAGTTAATAAAAATCAATTTGCTCTAATTGAAGGAAAAAACAAAACTGTAAGTACCCTTGTTATCCAAGCGGCAAATGTGTCAGCTTTGTACAAATGTGAAGCGGTCAACAAAGTCGGGAGAGGAGAGAGGGTGATCTCCTTCCACGTGACCAGGGGTCCTGAAATTACTTTGCAACCTGACATGCAGCCCACTGAGCAGGAGAGCGTGTCTTTGTGGTGCACTGCAGACAGATCTACGTTTGAGAACCTCACATGGTACAAGCTTGGCCCACAGCCTCTGCCAATCCATGTGGGAGAGTTGCCCACACCTGTTTGCAAGAACTTGGATACTCTTTGGAAATTGAATGCCACCATGTTCTCTAATAGCACAAATGACATTTTGATCATGGAGCTTAAGAATGCATCCTTGCAGGACCAAGGAGACTATGTCTGCCTTGCTCAAGACAGGAAGACCAAGAAAAGACATTGCGTGGTCAGGCAGCTCACAGTCCTAGAGCGTGTGGCACCCACGATCACAGGAAACCTGGAGAATCAGACGACAAGTATTGGGGAAAGCATCGAAGTCTCATGCACGGCATCTGGGAATCCCCCTCCACAGATCATGTGGTTTAAAGATAATGAGACCCTTGTAGAAGACTCAGGCATTGTATTGAAGGATGGGAACCGGAACCTCACTATCCGCAGAGTGAGGAAGGAGGACGAAGGCCTCTACACCTGCCAGGCATGCAGTGTTCTTGGCTGTGCAAAAGTGGAGGCATTTTTCATAATAGAAGGTGCCCAGGAAAAGACGAACTTGGAAATCATTATTCTAGTAGGCACGGCGGTGATTGCCATGTTCTTCTGGCTACTTCTTGTCATCATCCTACGGACCGTTAAGCGGGCCAATGGAGGGGAACTGAAGACAGGCTACTTGTCCATCGTCATGGATCCAGATGAACTCCCATTGGATGAACATTGTGAACGACTGCCTTATGATGCCAGCAAATGGGAATTCCCCAGAGACCGGCTGAAGCTAGGTAAGCCTCTTGGCCGTGGTGCCTTTGGCCAAGTGATTGAAGCAGATGCCTTTGGAATTGACAAGACAGCAACTTGCAGGACAGTAGCAGTCAAAATGTTGAAAGAAGGAGCAACACACAGTGAGCATCGAGCTCTCATGTCTGAACTCAAGATCCTCATTCATATTGGTCACCATCTCAATGTGGTCAACCTTCTAGGTGCCTGTACCAAGCCAGGAGGGCCACTCATGGTGATTGTGGAATTCTGCAAATTTGGAAACCTGTCCACTTACCTGAGGAGCAAGAGAAATGAATTTGTCCCCTACAAGACCAAAGGGGCACGATTCCGTCAAGGGAAAGACTACGTTGGAGCAATCCCTGTGGATCTGAAACGGCGCTTGGACAGCATCACCAGTAGCCAGAGCTCAGCCAGCTCTGGATTTGTGGAGGAGAAGTCCCTCAGTGATGTAGAAGAAGAGGAAGCTCCTGAAGATCTGTATAAGGACTTCCTGACCTTGGAGCATCTCATCTGTTACAGCTTCCAAGTGGCTAAGGGCATGGAGTTCTTGGCATCGCGAAAGTGTATCCACAGGGACCTGGCGGCACGAAATATCCTCTTATCGGAGAAGAACGTGGTTAAAATCTGTGACTTTGGCTTGGCCCGGGATATTTATAAAGATCCAGATTATGTCAGAAAAGGAGATGCTCGCCTCCCTTTGAAATGGATGGCCCCAGAAACAATTTTTGACAGAGTGTACACAATCCAGAGTGACGTCTGGTCTTTTGGTGTTTTGCTGTGGGAAATATTTTCCTTAGGTGCTTCTCCATATCCTGGGGTAAAGATTGATGAAGAATTTTGTAGGCGATTGAAAGAAGGAACTAGAATGAGGGCCCCTGATTATACTACACCAGAAATGTACCAGACCATGCTGGACTGCTGGCACGGGGAGCCCAGTCAGAGACCCACGTTTTCAGAGTTGGTGGAACATTTGGGAAATCTCTTGCAAGCTAATGCTCAGCAGGATGGCAAAGACTACATTGTTCTTCCGATATCAGAGACTTTGAGCATGGAAGAGGATTCTGGACTCTCTCTGCCTACCTCACCTGTTTCCTGTATGGAGGAGGAGGAAGTATGTGACCCCAAATTCCATTATGACAACACAGCAGGAATCAGTCAGTATCTGCAGAACAGTAAGCGAAAGAGCCGGCCTGTGAGTGTAAAAACATTTGAAGATATCCCGTTAGAAGAACCAGAAGTAAAAGTAATCCCAGATGACAACCAGACGGACAGTGGTATGGTTCTTGCCTCAGAAGAGCTGAAAACTTTGGAAGACAGAACCAAATTATCTCCATCTTTTGGTGGAATGGTGCCCAGCAAAAGCAGGGAGTCTGTGGCATCTGAAGGCTCAAACCAGACAAGCGGCTACCAGTCCGGATATCACTCCGATGACACAGACACCACCGTGTACTCCAGTGAGGAAGCAGAACTTTTAAAGCTGATAGAGATTGGAGTGCAAACCGGTAGCACAGCCCAGATTCTCCAGCCTGACTCGGGGACCACACTGAGCTCTCCTCCTGTTTAAAAGGAAGCATCCACACCCCAACTCCCGGACATCACATGAGAGGTCTGCTCAGATTTTGAAGTGTTGTTCTTTCCACCAGCAGGAAGTAGCCGCATTTGATTTTCATTTCGACAACAGAAAAAGGACCTCGGACTGCAGGGAGCCAGTCTTCTAGGCATATCCTGGAAGAGGCTTGTGACCCAAGAATGTGTCTGTGTCTTCTCCCAGTGTTGACCTGATCCTCTTTTTTCATTCATTTAAAAAGCATTATCATGCCCCTGCTGCGGGTCTCACCATGGGTTTAGAACAAAGAGCTTCAAGCAATGGCCCCATCCTCAAAGAAGTAGCAGTACCTGGGGAGCTGACACTTCTGTAAAACTAGAAGATAAACCAGGCAACGTAAGTGTTCGAGGTGTTGAAGATGGGAAGGATTTGCAGGGCTGAGTCTATCCAAGAGGCTTTGTTTAGGACGTGGGTCCCAAGCCAAGCCTTAAGTGTGGAATTCGGATTGATAGAAAGGAAGACTAACGTTACCTTGCTTTGGAGAGTACTGGAGCCTGCAAATGCATTGTGTTTGCTCTGGTGGAGGTGGGCATGGGGTCTGTTCTGAAATGTAAAGGGTTCAGACGGGGTTTCTGGTTTTAGAAGGTTGCGTGTTCTTCGAGTTGGGCTAAAGTAGAGTTCGTTGTGCTGTTTCTGACTCCTAATGAGAGTTCCTTCCAGACCGTTAGCTGTCTCCTTGCCAAGCCCCAGGAAGAAAATGATGCAGCTCTGGCTCCTTGTCTCCCAGGCTGATCCTTTATTCAGAATACCACAAAGAAAGGACATTCAGCTCAAGGCTCCCTGCCGTGTTGAAGAGTTCTGACTGCACAAACCAGCTTCTGGTTTCTTCTGGAATGAATACCCTCATATCTGTCCTGATGTGATATGTCTGAGACTGAATGCGGGAGGTTCAATGTGAAGCTGTGTGTGGTGTCAAAGTTTCAGGAAGGATTTTACCCTTTTGTTCTTCCCCCTGTCCCCAACCCACTCTCACCCCGCAACCCATCAGTATTTTAGTTATTTGGCCTCTACTCCAGTAAACCTGATTGGGTTTGTTCACTCTCTGAATGATTATTAGCCAGACTTCAAAATTATTTTATAGCCCAAATTATAACATCTATTGTATTATTTAGACTTTTAACATATAGAGCTATTTCTACTGATTTTTGCCCTTGTTCTGTCCTTTTTTTCAAAAAAGAAAATGTGTTTTTTGTTTGGTACCATAGTGTGAAATGCTGGGAACAATGACTATAAGACATGCTATGGCACATATATTTATAGTCTGTTTATGTAGAAACAAATGTAATATATTAAAGCCTTATATATAATGAACTTTGTACTATTCACATTTTGTATCAGTATTATGTAGCATAACAAAGGTCATAATGCTTTCAGCAATTGATGTCATTTTATTAAAGAACATTGAAAAACTTGA short.blx100644000766000024 3630614006427353 16272 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBLASTX 2.0.13 [May-26-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= 18.ctg12393 (6959 letters) Database: h_nrNov10_2000 57,234 sequences; 18,303,128 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gi|728837|sp|P39194|ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAM... 161 8e-39 gi|728836|sp|P39193|ALU6_HUMAN ALU SUBFAMILY SP SEQUENCE CONTAM... 156 3e-37 >gi|728837|sp|P39194|ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY Length = 593 Score = 161 bits (404), Expect = 8e-39 Identities = 73/97 (75%), Positives = 76/97 (78%) Frame = -1 Query: 2996 FFLRQSFTLVTQAGVQWHDLSSLQPLPPRFKGFSSLSLPISWDYRRLPPCLANFCIFHKD 2817 FFLR+SF LV QAGVQW DL SLQP PP FK FS LSLP SWDYRR PP ANFCIF +D Sbjct: 299 FFLRRSFALVAQAGVQWRDLGSLQPPPPGFKRFSCLSLPSSWDYRRPPPRPANFCIFSRD 358 Query: 2816 GVLPCWPGWS*TPDLR*YAHFGIPKCWDYRREPPCPA 2706 GV PCWPGWS TPDLR G+PKCWDYRREPP PA Sbjct: 359 GVSPCWPGWSRTPDLRXSTRLGLPKCWDYRREPPRPA 395 Score = 156 bits (391), Expect = 3e-37 Identities = 76/97 (78%), Positives = 83/97 (85%) Frame = +3 Query: 2706 GRARWLTPVIPALWDAKVGISSEVRSSRPAWPTWQNSVFMKNTKISQAWWQAPVIPANWE 2885 GRARWLTPVIPALW+A+ G S EVRSSRPAWPTW N V KNTKIS+AWW+APVIPA E Sbjct: 1 GRARWLTPVIPALWEAEAGGSPEVRSSRPAWPTWXNPVSTKNTKISRAWWRAPVIPATRE 60 Query: 2886 AEAGESLESRRQRLQ*AEIVPLHSSLGDKSKTLSQKK 2996 AEAGESLE R+RLQ AEI PLHSSLG+KS+T SQKK Sbjct: 61 AEAGESLEPGRRRLQXAEIAPLHSSLGNKSETPSQKK 97 Score = 140 bits (349), Expect = 2e-32 Identities = 74/101 (73%), Positives = 79/101 (77%) Frame = -2 Query: 2995 FF*DRVLLLSPRLECNGTISAHCNLCLLDSRDSPASASQLAGITGACHHAWLIFVFFIKT 2816 FF D V LL PRLEC+G ISAHCNL L S DSPASAS++AGITGA HHA LIFVF ++T Sbjct: 399 FFXDGVSLLLPRLECSGAISAHCNLRLPGSSDSPASASRVAGITGARHHARLIFVFLVET 458 Query: 2815 EFCHVGQAGLELLTSDDMPTLASQSAGITGVSHRARPGIMF 2693 F HVGQAGLELLTS D P ASQSAGITGVSHRAR F Sbjct: 459 GFHHVGQAGLELLTSGDPPASASQSAGITGVSHRARXXXFF 499 Score = 134 bits (334), Expect = 1e-30 Identities = 67/89 (75%), Positives = 71/89 (79%) Frame = +1 Query: 2707 AGHGGSRL*SQHFGMPKWAYHLRSGVQDQPGQHGKTPSL*KIQKLARHGGRRL*SQLIGR 2886 AG GGSRL SQHFG P+ HLRSGV+DQPGQHG+TPSL KIQKLA GG RL SQL+GR Sbjct: 101 AGRGGSRLXSQHFGRPRRVDHLRSGVRDQPGQHGETPSLLKIQKLAGRGGGRLXSQLLGR 160 Query: 2887 LRLENPLNLGGRGCSELRSCHCTPAWVTR 2973 LR EN LN GG GCSE RS HCTPAW TR Sbjct: 161 LRQENRLNPGGGGCSEPRSRHCTPAWATR 189 Score = 129 bits (321), Expect = 4e-29 Identities = 72/101 (71%), Positives = 75/101 (73%) Frame = +2 Query: 2693 KHYTWPGTVAHACNPSTLGCQSGHII*GQEFKTSLANMAKLRLYEKYKN*PGMVAGACNP 2872 K PG VAHACNPSTLG + G I GQEF+TSLANM K RLY KYKN PG+VAGACNP Sbjct: 195 KKXXXPGAVAHACNPSTLGGRGGWITXGQEFETSLANMVKPRLYXKYKNXPGVVAGACNP 254 Query: 2873 S*LGG*GWRIP*I*EAEVAVS*DRAIALQPG*QE*NSVSKK 2995 S GG G RI EAEVAVS DRA ALQPG QE NSVSKK Sbjct: 255 SYSGGXGRRIAXTREAEVAVSRDRATALQPGQQERNSVSKK 295 Score = 119 bits (295), Expect = 5e-26 Identities = 74/101 (73%), Positives = 79/101 (77%), Gaps = 54/101 (53%) Frame = +1 Query: 5179 FFLRWSFALSPRLECSGAISAHCNLRLPGSSNSPASASRVAGTTGVRHHTRLIFVFFVKT 5358 FF L PRLECSGAISAHCNLRLPGSS+SPASASRVAG TG RHH RLIFVF V+T Sbjct: 399 FFXDGVSLLLPRLECSGAISAHCNLRLPGSSDSPASASRVAGITGARHHARLIFVFLVET 458 Query: 5359 GFHHVGQ------------------AGLN------------------------------- 5391 GFHHVGQ AG+ Sbjct: 459 GFHHVGQAGLELLTSGDPPASASQSAGITGVSHRARXXXFFETEFRSCCPGWSAVARSRL 518 Query: 5392 -----SRPQVIHLSQPPKVLGLQV*ATVPGYFMDF 5481 SR Q I L QPP+ LGLQ AT PG F+ F Sbjct: 519 TATSASRVQAILLPQPPEXLGLQAPATTPGXFLYF 553 Score = 113 bits (281), Expect = 2e-24 Identities = 65/94 (69%), Positives = 73/94 (77%), Gaps = 55/94 (58%) Frame = +3 Query: 5205 VTQAGMQWCNLSSLQPPPPRFKQFSCLSLPSSWDYRCAPPHQANFCIFCKDRFSPRWPGW 5384 V QAG+QW +L SLQPPPP FK+FSCLSLPSSWDYR PP ANFCIF +D SP WPGW Sbjct: 308 VAQAGVQWRDLGSLQPPPPGFKRFSCLSLPSSWDYRRPPPRPANFCIFSRDGVSPCWPGW 367 Query: 5385 S-----------------------------------------ELQTSG------------ 5405 S L+ SG Sbjct: 368 SRTPDLRXSTRLGLPKCWDYRREPPRPAXXXFFXDGVSLLLPRLECSGAISAHCNLRLPG 427 Query: 5406 --DSPVSASQSAGITGVSHRARLLHGFLI 5486 DSP SAS+ AGITG H ARL+ FL+ Sbjct: 428 SSDSPASASRVAGITGARHHARLIFVFLV 456 Score = 111 bits (276), Expect(2) = 1e-32 Identities = 51/63 (80%), Positives = 55/63 (86%) Frame = -1 Query: 5387 RPAWPTW*KPVFTKNTKISLVWWRTPVVPATREAEAGELLEPGRRRLQ*AEIAPLHSSLG 5208 RPAWPTW PV TKNTKIS WWR PV+PATREAEAGE LEPGRRRLQ AEIAPLHSSLG Sbjct: 28 RPAWPTWXNPVSTKNTKISRAWWRAPVIPATREAEAGESLEPGRRRLQXAEIAPLHSSLG 87 Query: 5207 DRA 5199 +++ Sbjct: 88 NKS 90 Score = 89.7 bits (219), Expect(2) = 6e-24 Identities = 45/61 (73%), Positives = 45/61 (73%) Frame = -2 Query: 5386 DQPGQRGENLSLQKIQKLAWCGGAHL*SQLLGRLRQENCLNLGGGGCSELRLHHCIPAWV 5207 DQPGQ GE SL KIQKLA GG L SQLLGRLRQEN LN GGGGCSE R HC PAW Sbjct: 128 DQPGQHGETPSLLKIQKLAGRGGGRLXSQLLGRLRQENRLNPGGGGCSEPRSRHCTPAWA 187 Query: 5206 T 5204 T Sbjct: 188 T 188 Score = 87.8 bits (214), Expect = 1e-16 Identities = 53/89 (59%), Positives = 56/89 (62%) Frame = -3 Query: 2973 SCHPGWSAMARSQLTATSASXXXXXXXXXXXXXXXXXXXATMPG*FLYFS*RRSFAMLAR 2794 SC PGWSA+ARS+LTATSAS AT PG FLYF RR F MLAR Sbjct: 505 SCCPGWSAVARSRLTATSASRVQAILLPQPPEXLGLQAPATTPGXFLYFXXRRGFTMLAR 564 Query: 2793 LVLNS*PQMICPLWHPKVLGLQA*ATVPG 2707 LV NS PQ+I P PKVLGLQA AT PG Sbjct: 565 LVSNSXPQVIHPPRPPKVLGLQAXATAPG 593 Score = 83.1 bits (202), Expect = 4e-15 Identities = 45/62 (72%), Positives = 48/62 (76%) Frame = -3 Query: 5388 QTSLANVVKTCLYKKYKN*PGVVAHTCSPSYSGG*GRRIA*TWEAEVAVS*DCTTAFQPG 5209 +TSLAN+VK LY KYKN PGVVA C+PSYSGG GRRIA T EAEVAVS D TA QPG Sbjct: 226 ETSLANMVKPRLYXKYKNXPGVVAGACNPSYSGGXGRRIAXTREAEVAVSRDRATALQPG 285 Query: 5208 *Q 5203 Q Sbjct: 286 QQ 287 Score = 62.8 bits (150), Expect = 5e-09 Identities = 36/61 (59%), Positives = 41/61 (67%) Frame = +2 Query: 5204 CHPGWNAVVQSQLTATSASQVQAIXXXXXXXXXXXXVCATTPG*FLYFL*RQVFTTLARL 5383 C PGW+AV +S+LTATSAS+VQAI ATTPG FLYF R+ FT LARL Sbjct: 506 CCPGWSAVARSRLTATSASRVQAILLPQPPEXLGLQAPATTPGXFLYFXXRRGFTMLARL 565 Query: 5384 V 5386 V Sbjct: 566 V 566 Score = 50.4 bits (118), Expect(2) = 1e-32 Identities = 23/26 (88%), Positives = 23/26 (88%) Frame = -2 Query: 5464 RARWLTPVIPALWEAETGESPEVWSS 5387 RARWLTPVIPALWEAE G SPEV SS Sbjct: 2 RARWLTPVIPALWEAEAGGSPEVRSS 27 Score = 47.6 bits (111), Expect = 2e-04 Identities = 23/38 (60%), Positives = 24/38 (62%) Frame = +2 Query: 5390 TPDLR*FTCLSLPKCWDYRCEPPCPATSWIFDYRLCLL 5503 TPDLR T L LPKCWDYR EPP PA F + LL Sbjct: 370 TPDLRXSTRLGLPKCWDYRREPPRPAXXXFFXDGVSLL 407 Score = 43.4 bits (100), Expect(2) = 6e-24 Identities = 20/27 (74%), Positives = 23/27 (85%) Frame = -3 Query: 5466 AGHGGSHL*SQHFGRLRQVNHLRSGVQ 5386 AG GGS L SQHFGR R+V+HLRSGV+ Sbjct: 101 AGRGGSRLXSQHFGRPRRVDHLRSGVR 127 >gi|728836|sp|P39193|ALU6_HUMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION WARNING ENTRY Length = 593 Score = 156 bits (390), Expect = 3e-37 Identities = 71/97 (73%), Positives = 74/97 (76%) Frame = -1 Query: 2996 FFLRQSFTLVTQAGVQWHDLSSLQPLPPRFKGFSSLSLPISWDYRRLPPCLANFCIFHKD 2817 FFLR+SF LV QAGVQW DL S QP PP FK FS LSLP SWDYR PP ANFCIF +D Sbjct: 299 FFLRRSFALVAQAGVQWRDLGSPQPPPPGFKRFSCLSLPSSWDYRHAPPRPANFCIFSRD 358 Query: 2816 GVLPCWPGWS*TPDLR*YAHFGIPKCWDYRREPPCPA 2706 GV PCW GWS TPDLR A G+PKCWDYRREPP PA Sbjct: 359 GVSPCWSGWSRTPDLRXSARLGLPKCWDYRREPPRPA 395 Score = 146 bits (364), Expect = 4e-34 Identities = 72/97 (74%), Positives = 81/97 (83%) Frame = +3 Query: 2706 GRARWLTPVIPALWDAKVGISSEVRSSRPAWPTWQNSVFMKNTKISQAWWQAPVIPANWE 2885 GRARWLTPVIPALW+A+ G S EV SSRPA PTW+N V KNTKIS+AWW+ PVIPA E Sbjct: 1 GRARWLTPVIPALWEAEAGGSPEVGSSRPAXPTWRNPVSTKNTKISRAWWRMPVIPATRE 60 Query: 2886 AEAGESLESRRQRLQ*AEIVPLHSSLGDKSKTLSQKK 2996 AEAGESLE R+RL+ AEI PLHSSLG+KS+T SQKK Sbjct: 61 AEAGESLEPGRRRLRXAEIAPLHSSLGNKSETPSQKK 97 Score = 133 bits (332), Expect = 2e-30 Identities = 72/101 (71%), Positives = 76/101 (74%) Frame = -2 Query: 2995 FF*DRVLLLSPRLECNGTISAHCNLCLLDSRDSPASASQLAGITGACHHAWLIFVFFIKT 2816 FF D V LL PRLECNG ISAH NL L S DSPASAS++AGITG HHA LIFVF ++T Sbjct: 399 FFXDGVSLLLPRLECNGAISAHRNLRLPGSSDSPASASRVAGITGMRHHARLIFVFLVET 458 Query: 2815 EFCHVGQAGLELLTSDDMPTLASQSAGITGVSHRARPGIMF 2693 F HVGQAGLEL TS D P ASQSAGITGVSHRAR F Sbjct: 459 GFLHVGQAGLELPTSGDPPASASQSAGITGVSHRARXXXFF 499 Score = 129 bits (320), Expect = 5e-29 Identities = 65/89 (73%), Positives = 69/89 (77%) Frame = +1 Query: 2707 AGHGGSRL*SQHFGMPKWAYHLRSGVQDQPGQHGKTPSL*KIQKLARHGGRRL*SQLIGR 2886 AG GGSRL SQHFG P+ A HLRSGV+DQP QHG+TPSL KIQKLA GG L SQL+GR Sbjct: 101 AGRGGSRLXSQHFGRPRRADHLRSGVRDQPDQHGETPSLLKIQKLAGRGGACLXSQLLGR 160 Query: 2887 LRLENPLNLGGRGCSELRSCHCTPAWVTR 2973 LR EN LN GG GC E RS HCTPAW TR Sbjct: 161 LRQENRLNPGGGGCGEPRSRHCTPAWATR 189 Score = 125 bits (311), Expect = 6e-28 Identities = 70/101 (69%), Positives = 74/101 (72%) Frame = +2 Query: 2693 KHYTWPGTVAHACNPSTLGCQSGHII*GQEFKTSLANMAKLRLYEKYKN*PGMVAGACNP 2872 K PG VAHACNPSTLG + G I G+EF+TSL NM K RLY KYKN PG+VA ACNP Sbjct: 195 KKXXXPGAVAHACNPSTLGGRGGRITXGREFETSLTNMEKPRLYXKYKNXPGVVAHACNP 254 Query: 2873 S*LGG*GWRIP*I*EAEVAVS*DRAIALQPG*QE*NSVSKK 2995 S GG G RI EAEVAVS DRAIALQPG QE NSVSKK Sbjct: 255 SYSGGXGRRIAXTREAEVAVSRDRAIALQPGQQERNSVSKK 295 Score = 109 bits (270), Expect = 4e-23 Identities = 71/101 (70%), Positives = 78/101 (76%), Gaps = 54/101 (53%) Frame = +1 Query: 5179 FFLRWSFALSPRLECSGAISAHCNLRLPGSSNSPASASRVAGTTGVRHHTRLIFVFFVKT 5358 FF L PRLEC+GAISAH NLRLPGSS+SPASASRVAG TG+RHH RLIFVF V+T Sbjct: 399 FFXDGVSLLLPRLECNGAISAHRNLRLPGSSDSPASASRVAGITGMRHHARLIFVFLVET 458 Query: 5359 GFHHVGQ------------------AGLN------------------------------- 5391 GF HVGQ AG+ Sbjct: 459 GFLHVGQAGLELPTSGDPPASASQSAGITGVSHRARXXXFFETEFRSCCPGWSAMARSRL 518 Query: 5392 -----SRPQVIHLSQPPKVLGLQV*ATVPGYFMDF 5481 SR Q I L QPP+ LGLQ AT PG F+ F Sbjct: 519 TATSASRVQAILLPQPPEXLGLQACATTPGXFLYF 553 Score = 105 bits (260), Expect = 6e-22 Identities = 62/94 (65%), Positives = 71/94 (74%), Gaps = 55/94 (58%) Frame = +3 Query: 5205 VTQAGMQWCNLSSLQPPPPRFKQFSCLSLPSSWDYRCAPPHQANFCIFCKDRFSPRWPGW 5384 V QAG+QW +L S QPPPP FK+FSCLSLPSSWDYR APP ANFCIF +D SP W GW Sbjct: 308 VAQAGVQWRDLGSPQPPPPGFKRFSCLSLPSSWDYRHAPPRPANFCIFSRDGVSPCWSGW 367 Query: 5385 S-----------------------------------------------------ELQTSG 5405 S L+ G Sbjct: 368 SRTPDLRXSARLGLPKCWDYRREPPRPAXXXFFXDGVSLLLPRLECNGAISAHRNLRLPG 427 Query: 5406 DS--PVSASQSAGITGVSHRARLLHGFLI 5486 S P SAS+ AGITG+ H ARL+ FL+ Sbjct: 428 SSDSPASASRVAGITGMRHHARLIFVFLV 456 Score = 104 bits (258), Expect(2) = 1e-30 Identities = 49/63 (77%), Positives = 54/63 (84%) Frame = -1 Query: 5387 RPAWPTW*KPVFTKNTKISLVWWRTPVVPATREAEAGELLEPGRRRLQ*AEIAPLHSSLG 5208 RPA PTW PV TKNTKIS WWR PV+PATREAEAGE LEPGRRRL+ AEIAPLHSSLG Sbjct: 28 RPAXPTWRNPVSTKNTKISRAWWRMPVIPATREAEAGESLEPGRRRLRXAEIAPLHSSLG 87 Query: 5207 DRA 5199 +++ Sbjct: 88 NKS 90 Score = 88.2 bits (215), Expect = 1e-16 Identities = 53/89 (59%), Positives = 56/89 (62%) Frame = -3 Query: 2973 SCHPGWSAMARSQLTATSASXXXXXXXXXXXXXXXXXXXATMPG*FLYFS*RRSFAMLAR 2794 SC PGWSAMARS+LTATSAS AT PG FLYF RR F+ML R Sbjct: 505 SCCPGWSAMARSRLTATSASRVQAILLPQPPEXLGLQACATTPGXFLYFXXRRGFSMLVR 564 Query: 2793 LVLNS*PQMICPLWHPKVLGLQA*ATVPG 2707 LV NS PQ+I P PKVLGLQA AT PG Sbjct: 565 LVSNSRPQVIRPPRPPKVLGLQAXATAPG 593 Score = 85.8 bits (209), Expect(2) = 2e-22 Identities = 44/61 (72%), Positives = 44/61 (72%) Frame = -2 Query: 5386 DQPGQRGENLSLQKIQKLAWCGGAHL*SQLLGRLRQENCLNLGGGGCSELRLHHCIPAWV 5207 DQP Q GE SL KIQKLA GGA L SQLLGRLRQEN LN GGGGC E R HC PAW Sbjct: 128 DQPDQHGETPSLLKIQKLAGRGGACLXSQLLGRLRQENRLNPGGGGCGEPRSRHCTPAWA 187 Query: 5206 T 5204 T Sbjct: 188 T 188 Score = 80.8 bits (196), Expect = 2e-14 Identities = 43/62 (69%), Positives = 46/62 (73%) Frame = -3 Query: 5388 QTSLANVVKTCLYKKYKN*PGVVAHTCSPSYSGG*GRRIA*TWEAEVAVS*DCTTAFQPG 5209 +TSL N+ K LY KYKN PGVVAH C+PSYSGG GRRIA T EAEVAVS D A QPG Sbjct: 226 ETSLTNMEKPRLYXKYKNXPGVVAHACNPSYSGGXGRRIAXTREAEVAVSRDRAIALQPG 285 Query: 5208 *Q 5203 Q Sbjct: 286 QQ 287 Score = 63.2 bits (151), Expect = 4e-09 Identities = 34/61 (55%), Positives = 41/61 (66%) Frame = +2 Query: 5204 CHPGWNAVVQSQLTATSASQVQAIXXXXXXXXXXXXVCATTPG*FLYFL*RQVFTTLARL 5383 C PGW+A+ +S+LTATSAS+VQAI CATTPG FLYF R+ F+ L RL Sbjct: 506 CCPGWSAMARSRLTATSASRVQAILLPQPPEXLGLQACATTPGXFLYFXXRRGFSMLVRL 565 Query: 5384 V 5386 V Sbjct: 566 V 566 Score = 50.8 bits (119), Expect(2) = 1e-30 Identities = 23/26 (88%), Positives = 23/26 (88%) Frame = -2 Query: 5464 RARWLTPVIPALWEAETGESPEVWSS 5387 RARWLTPVIPALWEAE G SPEV SS Sbjct: 2 RARWLTPVIPALWEAEAGGSPEVGSS 27 Score = 45.7 bits (106), Expect = 7e-04 Identities = 22/38 (57%), Positives = 23/38 (59%) Frame = +2 Query: 5390 TPDLR*FTCLSLPKCWDYRCEPPCPATSWIFDYRLCLL 5503 TPDLR L LPKCWDYR EPP PA F + LL Sbjct: 370 TPDLRXSARLGLPKCWDYRREPPRPAXXXFFXDGVSLL 407 Score = 41.8 bits (96), Expect(2) = 2e-22 Identities = 19/27 (70%), Positives = 22/27 (81%) Frame = -3 Query: 5466 AGHGGSHL*SQHFGRLRQVNHLRSGVQ 5386 AG GGS L SQHFGR R+ +HLRSGV+ Sbjct: 101 AGRGGSRLXSQHFGRPRRADHLRSGVR 127 Database: h_nrNov10_2000 Posted date: Nov 10, 2000 9:40 AM Number of letters in database: 18,303,128 Number of sequences in database: 57,234 Lambda K H 0.318 0.135 0.401 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 262249022 Number of Sequences: 57234 Number of extensions: 6465920 Number of successful extensions: 16162 Number of sequences better than 1.0e-03: 302 Number of HSP's better than 0.0 without gapping: 147 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 14866 Number of HSP's gapped (non-prelim): 1104 length of query: 2319 length of database: 18,303,128 effective HSP length: 55 effective length of query: 2264 effective length of database: 15,155,258 effective search space: 34311504112 effective search space used: 34311504112 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 105 (45.3 bits) test.embl100644000766000024 3241014006427353 16234 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID SC10H5 standard; DNA; PRO; 4870 BP. XX AC AL031232; XX DE Streptomyces coelicolor cosmid 10H5. XX KW integral membrane protein. XX OS Streptomyces coelicolor OC Eubacteria; Firmicutes; Actinomycetes; Streptomycetes; OC Streptomycetaceae; Streptomyces. XX RN [1] RP 1-4870 RA Oliver K., Harris D.; RT ; RL Unpublished. XX RN [2] RP 1-4870 RA Parkhill J., Barrell B.G., Rajandream M.A.; RT ; RL Submitted (10-AUG-1998) to the EMBL/GenBank/DDBJ databases. RL Streptomyces coelicolor sequencing project, RL Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA RL E-mail: barrell@sanger.ac.uk RL Cosmids supplied by Prof. David A. Hopwood, [3] RL John Innes Centre, Norwich Research Park, Colney, RL Norwich, Norfolk NR4 7UH, UK. XX RN [3] RP 1-4870 RA Redenbach M., Kieser H.M., Denapaite D., Eichner A., RA Cullum J., Kinashi H., Hopwood D.A.; RT "A set of ordered cosmids and a detailed genetic and physical RT map for the 8 Mb Streptomyces coelicolor A3(2) chromosome."; RL Mol. Microbiol. 21(1):77-96(1996). XX CC Notes: CC CC Streptomyces coelicolor sequencing at The Sanger Centre is funded CC by the BBSRC. CC CC Details of S. coelicolor sequencing at the Sanger Centre CC are available on the World Wide Web. CC (URL; http://www.sanger.ac.uk/Projects/S_coelicolor/) CC CC CDS are numbered using the following system eg SC7B7.01c. CC SC (S. coelicolor), 7B7 (cosmid name), .01 (first CDS), CC c (complementary strand). CC CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC CC The length in codons is given for each CDS. CC CC Usually the highest scoring match found by fasta -o is given for CC CDS which show significant similarity to other CDS in the database. CC The position of possible ribosome binding site sequences are CC given where these have been used to deduce the initiation codon. CC CC Gene prediction is based on positional base preference in codons CC using a specially developed Hidden Markov Model (Krogh et al., CC Nucleic Acids Research, 22(22):4768-4778(1994)) and the FramePlot CC program of Bibb et al., Gene 30:157-66(1984) as implemented at CC http://www.nih.go.jp/~jun/cgi-bin/frameplot.pl. CAUTION: We may CC not have predicted the correct initiation codon. Where possible CC we choose an initiation codon (atg, gtg, ttg or (att)) which is CC preceded by an upstream ribosome binding site sequence (optimally CC 5-13bp before the initiation codon). If this cannot be identified CC we choose the most upstream initiation codon. CC CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC CC Cosmid 10H5 lies to the right of 3A7 on the AseI-B genomic restriction CC fragment. XX FH Key Location/Qualifiers FH FT source 1..4870 FT /organism="Streptomyces coelicolor" FT /strain="A3(2)" FT /clone="cosmid 10H5" FT CDS complement(<1..327) FT /note="SC10H5.01c, unknown, partial CDS, len >109 aa; FT possible integral membrane protein" FT /gene="SC10H5.01c" FT /product="hypothetical protein SC10H5.01c" FT CDS complement(350..805) FT /note="SC10H5.02c, probable integral membrane protein, len: FT 151 aa; similar to S. coelicolor hypothetical protein FT TR:O54194 (EMBL:AL021411) SC7H1.35 (155 aa), fasta scores; FT opt: 431 z-score: 749.8 E(): 0, 53.5% identity in 114 aa FT overlap." FT /product="putative integral membrane protein" FT /gene="SC10H5.02c" FT RBS complement(812..815) FT /note="possible RBS upstream of SC10H5.02c" FT CDS complement(837..1301) FT /note="SC10H5.03c, probable integral membrane protein, len: FT 154 aa" FT /product="putative integral membrane protein" FT /gene="SC10H5.03c" FT RBS complement(1308..1312) FT /note="possible RBS upstream of SC10H5.03c" FT CDS complement(1427..1735) FT /note="SC10H5.04c, unknown, len: 103 aa; possible membrane" FT /gene="SC10H5.04c" FT /product="hypothetical protein SC10H5.04c" FT RBS complement(1738..1741) FT /note="possible RBS upstream of SC10H5.05c" FT misc_feature 1800^1801 FT /note="Zero-length feature added to test Bioperl parsing" FT CDS 1933..2022 FT /note="SC10H5.05, questionable ORF, len: 29 aa" FT /gene="SC10H5.05" FT /product="hypothetical protein SC10H5.05" FT CDS 2019..2642 FT /note="SC10H5.06, probable membrane protein, len: 207 aa; FT similar to S. coelicolor TR:O54192 SC7H1.33c (191 aa), FT fasta scores; opt: 312 z-score: 355.2 E(): 1.6e-12, 36.8% FT identity in 182 aa overlap" FT /product="putative membrane protein" FT /gene="SC10H5.06" FT RBS 2627..2631 FT /note="possible RBS upstream of SC10H5.07" FT CDS 2639..4048 FT /note="SC10H5.07, unknown, len: 469 aa" FT /gene="SC10H5.07" FT /product="hypothetical protein SC10H5.07" FT CDS complement(4100..4297) FT /note="SC10H5.08c, unknown, len: 65 aa" FT /gene="SC10H5.08c" FT /product="hypothetical protein SC10H5.08c" FT RBS complement(4314..4319) FT /note="possible RBS upstream of SC10H5.08c" FT CDS complement(4439..>4870) FT /note="SC10H5.09c, probable integral membrane protein, FT partial CDS len: >143 aa; some similarity in C-terminus to FT S. coelicolor hypothetical protein TR:O54106 FT (EMBL:AL021529) SC10A5.15 (114 aa), fasta scores; opt: 145 FT z-score: 233.8 E(): 9.2e-06, 33.3% identity in 81 aa FT overlap. Overlaps and extends SC3A7.01c" FT /product="putative integral membrane protein" FT /gene="SC10H5.09c" FT misc_feature 4769..4870 FT /note="overlap with cosmid 3A7 from 1 to 102" XX SQ Sequence 4870 BP; 769 A; 1717 C; 1693 G; 691 T; 0 other; gatcagtaga cccagcgaca gcagggcggg gcccagcagg ccggccgtgg cgtagagcgc 60 gaggacggcg accggcgtgg ccaccgacag gatggctgcg gcgacgcgga cgacaccgga 120 gtgtgccagg gcccaccaca cgccgatggc cgcgagcgcg agtcccgcgc tgccgaacag 180 ggcccacagc acactgcgca gaccggcggc cacgagtggc gccaggacgg tgcccagcag 240 gagcagcagg gtgacgtggg cgcgcgctgc actgtggccg ccccgtccgc ccgacgcgcg 300 cggctcgtca tctcgcggtc ccaccaccgg tcggccccat tactcgtcct caaccctgtg 360 gcgactgacg ttccccggac aggtcgtacc gattgccgcc acgccccacc acgcacaggg 420 cccagacgac gaagcctgac atggtgatca tgacgacgga ccacaccggg tagtacggca 480 gcgagaggaa gttggcgatg atcaccagcc cggcgatggc gaccccggtg acacgtgccc 540 acatcgccgt tttgagcagc ccggcgctga cgaccatggc gagcgcgccg agcgcgagat 600 ggatccaccc ccacccggtg agatcgaact ggaaaacgta gttgggcgtg gtgacgaaga 660 cgtcgtcctc ggcgatggcc atgatgcccc ggaagaggct gagcagcccg gcgaggaaga 720 gcatcaccgc cgcgaaggcg gtaaggcccg tcgcccattc ctgcctcgcg gtgtgtgccg 780 ggtggtgggt atgtgacgtg gtcatctcgg acctcgtttc gtggaatgcg gatgcttcag 840 cgagcggagg cgccggtgcc cgccgcgccc gtgtgccctg ccgggccgtg accggacagg 900 accaattcct tcgccttgcg gaactcctcg tccgtgatgg caccccggtc tcggatctcg 960 gagagccggg ccagctcgtc gacgctgctg gacccgccgc ccacggtctt cctgatgtag 1020 gcgtcgaact cctcctgctg agcccgtgcc cgcgttgtct cccggctgcc catgttcttg 1080 ccgcgagcga tcacgtagac gaaaacgccc aggaagggca ggaggatgca gaacaccaac 1140 cagccggcct tcgcccagcc actcagtccg tcgtcccgga agatgtcggt gacgacgcgg 1200 aagagcagga cgaaccacat gatccacagg aagatcatca gcatcgtcca gaaggcaccc 1260 agcagtgggt agtcgtacgc caggtaggtc tgtgcactca tgtccgtcct ccgtcctccg 1320 gggcgcggcc cggcggccct cgttccgtac tgacatcagg gtggtcacgg gtcccaccgg 1380 tcggcatcac ccggcacggg tgagtggggc gccgaggccg tcgtggtcag gcccgggaca 1440 ccggtgtgac cctggtggaa ggacgcgtcc cgtggggcac gcaccgccgg ccgagggcga 1500 ccaccgcctc ggtcagtccg agcaggccca gccacaggcc gagaagtcgg gtcagggcac 1560 gggccgactc ggcgggcagc gcgaggacga cgattccggc gacgtcgacg gccagcgggt 1620 tgcgcaggcc cagcactccg gccggggcgc ccggcaccag cgtggcgagg gccgatgcca 1680 tgagccaggt ccaggaaccc ccaagcctgg cgaggacgtg cgccggatcg ctcaatgctc 1740 cggtgaccgc cccgcccgac ccgtctccct tgtcggcagg ttccgccgca tcacgcggaa 1800 cggagatggc tcccctgtgg atcgggcggc cgctgcgggg ccgcccggtt ggtcggtcgg 1860 tgagcgccgg actccccctt cagctcttcc agggtcgggg tcgacaccga ggtcctggat 1920 cacccgtcag gggtgatccg ggcatgccgt cgtggcggtg aggtgggata cgggaacgat 1980 cggcccacgg gggaccggac gagacgaaga gacgtgagat gagcgatacg aactcgggcg 2040 gcgggcgcca ggccgcttcc ggaccggccc cacgtggccg actccctttc cgccggcgcg 2100 tggccctggt cgctgtcgca cgtcccctga tcgtcacggt cggtctcgtc accgcctact 2160 acctgcttcc cctggacgag agactcagcg ccggcaccct ggtgtcgctg gtgtgcggac 2220 tgctcgcagt ccttctggtg ttctgctggg aggtgcgggc catcacgcgc tccccgcatc 2280 cgcgtctgag agcgatcgag ggcctggccg ccacgctggt gctgttcctg gtcctcttcg 2340 ccggctccta ctacctgctg ggtcgctccg cgcccggctc cttcagcgag ccgctgaaca 2400 ggacggacgc gctgtacttc actctgacca cgttcgccac cgtcggcttc ggggacatca 2460 ccgcacgctc cgagaccggg cggatcctca cgatggcgca gatgacggga gggctactgc 2520 tcgtcggagt cgccgcccgg gtgctggcga gcgcagtgca ggcggggctg caccgacagg 2580 gccggggacc ggcggcatcg ccacgctccg gtgctgcgga ggagccggag gccggaccat 2640 gaccgtaccc ggtggcttca ccgcctccct gccgccggcc gagcgagccg cgtacggcag 2700 gaaggcccgt aaaagggcct cacgttcgtg ccacggctgg tacgagccgg ggcagcggcg 2760 gcctgacccc gtcgacctgc tggagcgcca gtccggcgag cgtgtcccgg cactcgtgcc 2820 catccgctac ggtcgcatgc tggagtcgcc gttccgcttc taccgcggtg cggcagcgat 2880 catggcggcg gacctggcac ccctgcccag cagcggactc caggtgcaat tgtgcgggga 2940 cgcgcacccg ttgaacttcc ggctcctggc ctcaccggag cgccggctgg tcttcgacat 3000 caacgacttc gacgagacgc tgcccggccc cttcgagtgg gacgtcaaac ggctggcggc 3060 cggattcgtg atcgcggccc ggtcgaacgg cttctcgtcc aaggaacaga accgcaccgt 3120 tcgggcctgt gtgcgggcct accgggagcg catgagggag ttcgccgtca tgccgaccct 3180 ggacatctgg tacgcccagg acgacgccga ccacgtacgg caactgctgg ctacggaggc 3240 cagaggagaa gctgagcagc ggctcaggga cgcggctgcg aaggcccgca cacgcaccca 3300 catgagggcg ttcgcgaagc tcacccgcgt cacggccgag ggccggcgca tcacccccga 3360 cccgccgctg atcaccccac tcggcgatct gctcaccgac ccggccgaag ccggccggga 3420 ggaggaactg cggtccgtcg tgaacggcta cgcacggtcc ctgccgcccg agcgccggca 3480 cctgctgcgt cactaccggc ttgtggacat ggcgcgcaag gtggtcggcg tcggcagtgt 3540 cggcacccgc tgctgggtac tgcttctgct cggcagggac gacgacgatc ctctgctgct 3600 ccaggccaag gaagcctcgg aatcggtgct ggcggcccac acgggcggcg aacgctacga 3660 ccatcagggc cgcagggtcg tggccggcca gcgtctgatc cagaccaccg gtgacatctt 3720 tctcggctgg gcgcgcgtca ccggcttcga cggaaaggcc cgggacttct acgtgcgtca 3780 actgtgggac tggaagggcg tcgcgcggcc ggaaaccatg gggcccgacc tgctctccct 3840 cttcgcccgg ctgtgcggtg cctgcctggc gagggcccac gcccgttccg gtgaccccgt 3900 cgcgctcgcc gcgtacctgg gcggcagcga ccgcttcgac ggcgcgctca ccgagttcgc 3960 ccagtcctac gccgatcaga atgaacgcga ccacgaagct ctgctggcgg cctgccgctc 4020 cggcagggtc acggccgccc gtttgtgagg ccgacccggg aacggccggc gggctggcac 4080 acaccgccgc cggtcggcgt cattccggaa gctgccgcat ctccaggacg cgcaggccca 4140 gcgactggca gcgggtgagc aacccgtaca gatgggcctc gtcgatcacc gtgccgaaca 4200 gcacggtctg gccggacatg acgacgtgct ccagctccgg gaacgcgttg gccagcgtcc 4260 gtgacaggtg tccctcgacg cggatctcgt agcgcacgag cggtcctttc accgtaggag 4320 ctcgggacac cgcccggggc tccgggtcgg acggtgctct tggtgacgag cctgcgcctc 4380 gtcgccctcc ggtgccctca cccagcacag gtgactccaa ccgcagtgtc agtgcctttc 4440 agtgcgtcac tgtgatcttg acgacgacga tcaccaggcc gagcagtacg ttgaccgtcg 4500 cggtgacggc caccagtcgt cgcgaggcgc ccgcgcggtg cgccgcggcg acggaccagc 4560 ccacctgacc ggcgacggcg acggacagcg ccagccacag ggtgcccggg acgtccagcc 4620 ccagtacggg gctgacggcg atggccgcgg ccggaggcac ggcggccttg acgatcggcc 4680 actcctcgcg gcacacacgc agaatcaccc gccggtccgg agtgtgccgc gcgagacgcg 4740 ctccgaacag ttcggcgtgg acgtgagcga tccagaacac caagctggtg agcaacagca 4800 gaagaaccag ttcggcgcgg gggaacgagc ccagggtgcc ggcgccgatc acgacggagg 4860 ctgcgagcat 4870 // U58726.gb100644000766000024 20014014006427353 15563 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS U58726 31931 bp DNA linear INV 29-APR-2011 DEFINITION Caenorhabditis elegans cosmid T01C8, complete sequence. ACCESSION U58726 VERSION U58726.1 GI:1326250 KEYWORDS HTG. SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 31931) CONSRTM C. elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282 (5396), 2012-2018 (1998) PUBMED 9851916 REMARK Erratum:[Science 1999 Jan 1;283(5398):35] REFERENCE 2 (bases 1 to 31931) AUTHORS Wohldmann,P. and Hawkins,J. TITLE The sequence of C. elegans cosmid T01C8 JOURNAL Unpublished REFERENCE 3 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (19-MAY-1996) Robert Waterston REFERENCE 4 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (25-JUL-2001) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 5 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (23-MAY-2002) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 6 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (29-JUN-2002) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 7 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (06-NOV-2002) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 8 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (21-NOV-2002) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 9 (bases 1 to 31931) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (19-APR-2003) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 10 (bases 1 to 31931) CONSRTM WormBase Consortium TITLE Direct Submission JOURNAL Submitted (21-SEP-2004) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 11 (bases 1 to 31931) CONSRTM WormBase Consortium TITLE Direct Submission JOURNAL Submitted (10-SEP-2005) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 12 (bases 1 to 31931) CONSRTM WormBase Consortium TITLE Direct Submission JOURNAL Submitted (06-MAY-2006) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA REFERENCE 13 (bases 1 to 31931) CONSRTM WormBase Consortium TITLE Direct Submission JOURNAL Submitted (29-APR-2011) Department of Genetics, Washington University, Genome Sequencing Center, 4444 Forest Park Avenue, St. Louis, MO 63110, USA COMMENT [WARNING] On Oct 13, 2011 this sequence was replaced by gi:351061076. Submitted by: Genome Sequencing Center Department of Genetics, Washington University St. Louis , MO 63110, USA, and Sanger Centre, Hinxton Hall Cambridge CB10 IRQ, England email: stlouis@wormbase.org and sanger@wormbase.org NOTICE: This sequence may not be the entire insert of this clone. It may be shorter because we only sequence overlapping sections once, or longer because we provide a small overlap between neighboring submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistry or covered by high quality data (i.e., phred quality >= 30); an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. For a graphical representation of this clone sequence and its analysis see: http://www.wormbase.org/db/seq/sequence?name=T01C8 NEIGHBORING CLONE INFORMATION The 5' clone is EGAP8, 200 bp overlap; the 3' clone is F41G4, 2000 bp overlap. Actual start of this clone is at base position 1 of T01C8; actual end is at 10488 of F41G4. NOTES: Coding seqences below are the result of integration and manual review of the following data : computer analysis using the program Genefinder (P. Green and L. Hillier, personal communication), the large scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html) and The C. elegans ORFeome cloning project (http://worfdb.dfci.harvard.edu/), similarity to other proteins from BlastX analyses (http://blast.wustl.edu/), sequence conservation with C. briggsae using Jim Kent's WABA alignment program (Genome Research 10:1115-1125, 2000), individual C. elegans GenBank submissions, and personal communications with C. elegans researchers. tRNAs are predicted using the program tRNAscan-SE (Lowe, T.M. and Eddy, S.R., 1997, Nucl. Acids. Res., 25, 955-964). FEATURES Location/Qualifiers source 1..31931 /organism="Caenorhabditis elegans" /mol_type="genomic DNA" /strain="Bristol N2" /db_xref="taxon:6239" /chromosome="X" /clone="T01C8" gene 1698..1819 /locus_tag="T01C8.8" /db_xref="WormBase:WBGene00196318" ncRNA 1698..1819 /locus_tag="T01C8.8" /ncRNA_class="other" /product="T01C8.8" /db_xref="WormBase:T01C8.8" /db_xref="WormBase:WBGene00196318" gene complement(5778..7678) /locus_tag="T01C8.3" /db_xref="WormBase:WBGene00020144" CDS complement(join(5778..5951,5996..6076,6123..6219, 6268..6317,6359..6509,6555..6631,7562..7678)) /locus_tag="T01C8.3" /standard_name="T01C8.3" /note="contains similarity to Paramecium tetraurelia Chromosome undetermined scaffold_129, whole genome shotgun sequence.; TR:A0BUB6; coded for by the following C. elegans cDNAs: FM864997" /codon_start=1 /product="Hypothetical protein T01C8.3" /protein_id="AAB00575.3" /db_xref="GI:331028906" /db_xref="WormBase:T01C8.3" /db_xref="WormBase:WBGene00020144" /translation="MQLLENCNFRCTNCTGDYCYAVNYKHLNPSLKNEQSYYQGCFTS PTDLPLGCSTNSRGSIFCICNSTDYCNEMTNVKEEKNITYLICEYAKDSMFRGADCVQ PWCVKTASSYMDEMVECGEGTYEMEMYDIGFVYSGMLLPINSCYAVADDSRYDKSQIC TYKVNKTTPYKLKVPGSTKCFAPGEVMTRMKNSTCIGQFCYSASAVFGCISQFNREGA ILKVTIFHFEILNKNNNICLTIRNFCKKKL" misc_feature 11227..11228 /note="SL1 trans-splice acceptor; see yk1591b07.5" misc_feature 11227..11228 /note="SL2 defined by RNASeq short reads (Hillier et al.); see RNASeq_Hillier.dauer_daf-2" gene 11238..12262 /locus_tag="T01C8.2" /db_xref="WormBase:WBGene00020143" CDS join(11238..11384,12167..12262) /locus_tag="T01C8.2" /standard_name="T01C8.2" /note="contains similarity to Escherichia coli Type 4 prepilin-like proteins leader peptide-processing enzymes[Includes: Leader peptidase (EC 3.4.23.43) (Prepilin peptidase); N-smethyltransferase (EC 2.1.1.-)].; SW:P25960; coded for by the following C. elegans cDNAs: OSTF201H6_1, OSTR201H6_1, yk725a4.3, yk1428d07.3, yk1428d07.5, yk1591b07.3, yk1591b07.5, MM454_FPK17YK01E1TJO" /codon_start=1 /product="Hypothetical protein T01C8.2" /protein_id="AAB00576.1" /db_xref="GI:1326252" /db_xref="WormBase:T01C8.2" /db_xref="WormBase:WBGene00020143" /translation="MPKQEFNPLDYTGPLIVGAIFCVFLFVISFFVINFFCITKYDDI TKFELMGGKYGWRLGPHPLIVVKKGGFVAEEEVDDA" gene complement(12416..13336) /locus_tag="T01C8.4" /db_xref="WormBase:WBGene00020145" CDS complement(join(12416..12555,12785..13093,13138..13336)) /locus_tag="T01C8.4" /standard_name="T01C8.4" /note="contains similarity to Pfam domain PF00155 (Aminotransferase class I and II); coded for by the following C. elegans cDNAs: FN888767, FN888768, FN888770, FM864682" /codon_start=1 /product="Hypothetical protein T01C8.4" /protein_id="AAB00577.2" /db_xref="GI:331028907" /db_xref="WormBase:T01C8.4" /db_xref="WormBase:WBGene00020145" /translation="MDPTREQWIQMAQVIKQKNLFTFFHIADQGLASGDADADAWAVR FFVEQGLEMIVSQSFSKNFGLYNDRVGSLTVIVNKPSHIANLKSQLTLVNVSNFSNPP AYGARIVHEILKSPKYREQWQNSIKMMAFRIKKTRQELIRELNMLQTSGKWDRITQQS GLFSYTGLTPCQVDHLIAHHKIYLLSDGRINICGLNMSNLDYVARAIDDTVRTIH" gene complement(16309..17896) /locus_tag="T01C8.5" /db_xref="WormBase:WBGene00020146" CDS complement(join(16309..16457,16507..16815,16863..17233, 17499..17896)) /locus_tag="T01C8.5" /standard_name="T01C8.5" /note="contains similarity to Pfam domain PF00155 (Aminotransferase class I and II); coded for by the following C. elegans cDNAs: OSTR040A9_1, OSTR040A9_2, yk204c10.5, yk327b7.3, yk327b7.5, yk468h3.5, yk503d6.3, yk503d6.5, yk512b9.3, yk512b9.5, yk597a11.3, yk597a11.5, yk829h10.3, yk829h10.5, yk899e07.3, yk1020c11.5, yk1023a07.3, yk1023a07.5, yk1070e06.3, yk1070e06.5, yk1318d04.3, yk1318d04.5, cm20g8.5, yk1400g10.3, yk1400g10.5, yk1444g09.3, yk1444g09.5, yk1452e06.3, yk1452e06.5, yk1471h11.3, yk1471h11.5, yk1500a04.3, yk1546b06.3, yk1546b06.5, yk1560g10.5, yk1611e03.3, yk1611e03.5, yk1613g08.3, yk1630a11.5, yk1672d12.3, yk1672d12.5, yk1675b02.3, yk1675b02.5, yk1755e02.3, yk1755e02.5, yk899b12.3, yk899b12.5, EC036254, EC029162, MM454_contig05811, FN906088, FN906090, FN906089, FN906091" /codon_start=1 /product="Hypothetical protein T01C8.5" /protein_id="AAB00578.1" /db_xref="GI:1326254" /db_xref="WormBase:T01C8.5" /db_xref="WormBase:WBGene00020146" /translation="MSFFDGIPVAPPIEVFHKNKMYLDETAPVKVNLTIGAYRTEEGQ PWVLPVVHETEVEIANDTSLNHEYLPVLGHEGFRKAATELVLGAESPAIKEERSFGVQ CLSGTGALRAGAEFLASVCNMKTVYVSNPTWGNHKLVFKKAGFTTVADYTFWDYDNKR VHIEKFLSDLESAPEKSVIILHGCAHNPTGMDPTQEQWKLVAEVIKRKNLFTFFDIAY QGFASGDPAADAWAIRYFVDQGMEMVVSQSFAKNFGLYNERVGNLTVVVNNPAVIAGF QSQMSLVIRANWSNPPAHGARIVHKVLTTPARREQWNQSIQAMSSRIKQMRAALLRHL MDLGTPGTWDHIIQQIGMFSYTGLTSAQVDHLIANHKVFLLRDGRINICGLNTKNVEY VAKAIDETVRAVKSNI" gene 16755..16901 /locus_tag="T01C8.10" /db_xref="WormBase:WBGene00201366" ncRNA 16755..16901 /locus_tag="T01C8.10" /ncRNA_class="other" /product="T01C8.10" /db_xref="WormBase:T01C8.10" /db_xref="WormBase:WBGene00201366" gene 17202..17310 /locus_tag="T01C8.11" /db_xref="WormBase:WBGene00202374" ncRNA 17202..17310 /locus_tag="T01C8.11" /ncRNA_class="other" /product="T01C8.11" /db_xref="WormBase:T01C8.11" /db_xref="WormBase:WBGene00202374" misc_feature complement(17910..17911) /note="SL1 trans-splice acceptor; see cm20g8.5" misc_feature complement(17910..17911) /note="SL2 defined by RNASeq short reads (Hillier et al.); see RNASeq_Hillier.L1_larva" misc_feature 20870..20871 /note="SL1 trans-splice acceptor; see yk1146c08.5" misc_feature 20870..20871 /note="SL2 defined by RNASeq short reads (Hillier et al.); see RNASeq_Hillier.dauer_entry_daf-2" gene 20873..27634 /gene="aak-2" /locus_tag="T01C8.1" /db_xref="WormBase:WBGene00020142" CDS join(20873..21015,24491..24654,24830..25298,25473..25591, 25817..25922,26012..26117,26167..26374,26476..26787, 27198..27349,27533..27634) /gene="aak-2" /locus_tag="T01C8.1" /standard_name="T01C8.1b" /note="contains similarity to Pfam domains PF07714 (Protein tyrosine kinase), PF00069 (Protein kinase domain); coded for by the following C. elegans cDNAs: OSTF004G3_1, OSTR004G3_1, yk114d1.5, yk133b5.5, yk222d7.5, yk273e10.5, yk298g7.5, yk390b1.5, yk443g12.5, yk507b7.5, yk607a4.5, yk652d12.5, yk677a2.5, yk848d05.3, yk1005g04.3, yk1005g04.5, yk1146c08.3, yk1146c08.5, yk1235g04.3, yk1235g04.5, yk1251g04.3, yk1251g04.5, yk848d05.5, yk1552h06.3, yk1625b07.3, yk237e11.5, MM454_contig01620" /codon_start=1 /product="Amp-activated kinase protein 2, isoform b, confirmed by transcript evidence" /protein_id="AAM69096.1" /db_xref="GI:21629470" /db_xref="WormBase:T01C8.1b" /db_xref="WormBase:WBGene00020142" /translation="MFSHQDRDRDRKEDGGGDGTEMKSKSRSQPSGLNRVKNLSRKLS AKSRKERKDRDSTDNSSKMSSPGGETSTKQQQELKAQIKIGHYILKETLGVGTFGKVK VGIHETTQYKVAVKILNRQKIKSLDVVGKIRREIQNLSLFRHPHIIRLYQVISTPSDI FMIMEHVSGGELFDYIVKHGRLKTAEARRFFQQIISGVDYCHRHMVVHRDLKPENLLL DEQNNVKIADFGLSNIMTDGDFLRTSCGSPNYAAPEVISGKLYAGPEVDVWSCGVILY ALLCGTLPFDDEHVPSLFRKIKSGVFPTPDFLERPIVNLLHHMLCVDPMKRATIKDVI AHEWFQKDLPNYLFPPINESEASIVDIEAVREVTEFQRYHVAEEEVTSALLGDDPHHH LSIAYNLIVDNKRIADETAKLSIEEFYQVTPNKGPGPVHRHPERIAASVSSKITPTLD NTEASGANRNKRAKWHLGIRSQSRPEDIMFEVFRAMKQLDMEWKVLNPYHVIVRRKPD APAADPPKMSLQLYQVDQRSYLLDFKSLADEESGSASASSSRHASMSMPQKPAGIRGT RTSSMPQAMSMEASIEKMEVHDFSDMSCDVTPPPSPGGAKLSQTMQFFEICAALIGTL AR" CDS join(20873..21015,24491..24654,24830..25298,25473..25591, 25817..25922,26012..26117,26173..26374,26476..26787, 27198..27349,27533..27634) /gene="aak-2" /locus_tag="T01C8.1" /standard_name="T01C8.1a" /note="contains similarity to Pfam domains PF07714 (Protein tyrosine kinase), PF00069 (Protein kinase domain); coded for by the following C. elegans cDNAs: OSTF004G3_1, OSTR004G3_1, yk114d1.5, yk133b5.5, yk222d7.5, yk273e10.5, yk298g7.5, yk390b1.5, yk402f12.5, yk443g12.5, yk507b7.5, yk607a4.5, yk617g6.5, yk652d12.5, yk677a2.5, yk848d05.3, yk1005g04.3, yk1005g04.5, yk1146c08.3, yk1146c08.5, yk1235g04.3, yk1235g04.5, yk1251g04.3, yk1251g04.5, yk848d05.5, yk1552h06.3, yk1625b07.3, yk114d1.3, yk133b5.3, yk222d7.3, yk273e10.3, yk390b1.3, yk402f12.3, yk607a4.3, yk617g6.3, yk652d12.3, yk677a2.3, yk1552h06.5, yk1625b07.5, AY347273, MM454_contig01620" /codon_start=1 /product="Amp-activated kinase protein 2, isoform a, confirmed by transcript evidence" /protein_id="AAM69095.1" /db_xref="GI:21629469" /db_xref="WormBase:T01C8.1a" /db_xref="WormBase:WBGene00020142" /translation="MFSHQDRDRDRKEDGGGDGTEMKSKSRSQPSGLNRVKNLSRKLS AKSRKERKDRDSTDNSSKMSSPGGETSTKQQQELKAQIKIGHYILKETLGVGTFGKVK VGIHETTQYKVAVKILNRQKIKSLDVVGKIRREIQNLSLFRHPHIIRLYQVISTPSDI FMIMEHVSGGELFDYIVKHGRLKTAEARRFFQQIISGVDYCHRHMVVHRDLKPENLLL DEQNNVKIADFGLSNIMTDGDFLRTSCGSPNYAAPEVISGKLYAGPEVDVWSCGVILY ALLCGTLPFDDEHVPSLFRKIKSGVFPTPDFLERPIVNLLHHMLCVDPMKRATIKDVI AHEWFQKDLPNYLFPPINESEASIVDIEAVREVTERYHVAEEEVTSALLGDDPHHHLS IAYNLIVDNKRIADETAKLSIEEFYQVTPNKGPGPVHRHPERIAASVSSKITPTLDNT EASGANRNKRAKWHLGIRSQSRPEDIMFEVFRAMKQLDMEWKVLNPYHVIVRRKPDAP AADPPKMSLQLYQVDQRSYLLDFKSLADEESGSASASSSRHASMSMPQKPAGIRGTRT SSMPQAMSMEASIEKMEVHDFSDMSCDVTPPPSPGGAKLSQTMQFFEICAALIGTLAR " gene complement(21899..21958) /locus_tag="T01C8.9" /db_xref="WormBase:WBGene00199326" ncRNA complement(21899..21958) /locus_tag="T01C8.9" /ncRNA_class="other" /product="T01C8.9" /db_xref="WormBase:T01C8.9" /db_xref="WormBase:WBGene00199326" misc_feature 24490..24491 /gene="aak-2" /locus_tag="T01C8.1" /note="SL1 trans-splice acceptor; see yk848d05.5" misc_feature 24490..24491 /gene="aak-2" /locus_tag="T01C8.1" /note="SL2 defined by RNASeq short reads (Hillier et al.); see RNASeq_Hillier.L1_larva" CDS join(24534..24654,24830..25298,25473..25591,25817..25922, 26012..26117,26173..26374,26476..26787,27198..27349, 27533..27634) /gene="aak-2" /locus_tag="T01C8.1" /standard_name="T01C8.1c" /note="contains similarity to Pfam domains PF07714 (Protein tyrosine kinase), PF00069 (Protein kinase domain); coded for by the following C. elegans cDNAs: OSTF004G3_1, OSTR004G3_1, yk114d1.5, yk133b5.5, yk222d7.5, yk273e10.5, yk298g7.5, yk390b1.5, yk402f12.5, yk443g12.5, yk507b7.5, yk607a4.5, yk617g6.5, yk652d12.5, yk677a2.5, yk848d05.3, yk848d05.5, yk1005g04.3, yk1005g04.5, yk1146c08.3, yk1146c08.5, yk1235g04.3, yk1235g04.5, yk1251g04.3, yk1251g04.5, yk1552h06.3, yk1552h06.5, yk1625b07.3, yk1625b07.5, FN880253, MM454_contig01620" /codon_start=1 /product="Amp-activated kinase protein 2, isoform c, confirmed by transcript evidence" /protein_id="AAP13770.1" /db_xref="GI:30025106" /db_xref="WormBase:T01C8.1c" /db_xref="WormBase:WBGene00020142" /translation="MSSPGGETSTKQQQELKAQIKIGHYILKETLGVGTFGKVKVGIH ETTQYKVAVKILNRQKIKSLDVVGKIRREIQNLSLFRHPHIIRLYQVISTPSDIFMIM EHVSGGELFDYIVKHGRLKTAEARRFFQQIISGVDYCHRHMVVHRDLKPENLLLDEQN NVKIADFGLSNIMTDGDFLRTSCGSPNYAAPEVISGKLYAGPEVDVWSCGVILYALLC GTLPFDDEHVPSLFRKIKSGVFPTPDFLERPIVNLLHHMLCVDPMKRATIKDVIAHEW FQKDLPNYLFPPINESEASIVDIEAVREVTERYHVAEEEVTSALLGDDPHHHLSIAYN LIVDNKRIADETAKLSIEEFYQVTPNKGPGPVHRHPERIAASVSSKITPTLDNTEASG ANRNKRAKWHLGIRSQSRPEDIMFEVFRAMKQLDMEWKVLNPYHVIVRRKPDAPAADP PKMSLQLYQVDQRSYLLDFKSLADEESGSASASSSRHASMSMPQKPAGIRGTRTSSMP QAMSMEASIEKMEVHDFSDMSCDVTPPPSPGGAKLSQTMQFFEICAALIGTLAR" gene complement(28176..31898) /gene="mec-4" /locus_tag="T01C8.7" /db_xref="WormBase:WBGene00003168" CDS complement(join(28176..28361,28501..28576,28625..28753, 28811..28871,28916..29030,29154..29249,29301..29473, 29567..29655,29698..29855,29903..30051,30149..30309, 30526..30637,30690..31052,31100..31355,31716..31898)) /gene="mec-4" /locus_tag="T01C8.7" /standard_name="T01C8.7" /note="contains similarity to Pfam domain PF00858 (Amiloride-sensitive sodium channel); coded for by the following C. elegans cDNAs: yk411c2.3, yk411c2.5, yk1105a08.3, yk1105a08.5, EC002941, RST5_376858, FM864873" /codon_start=1 /product="Mechanosensory abnormality protein 4, partially confirmed by transcript evidence" /protein_id="AAB00580.3" /db_xref="GI:74356167" /db_xref="WormBase:T01C8.7" /db_xref="WormBase:WBGene00003168" /translation="MSWMQNLKNYQHLRDPSEYMSQVYGDPLAYLQETTKFVTEREYY EDFGYGECFNSTESEVQCELITGEFDPKLLPYDKRLAWHFKEFCYKTSAHGIPMIGEA PNVYYRAVWVVLFLGCMIMLYLNAQSVLDKYNRNEKIVDIQLKFDTAPFPAITLCNLN PYKASLATSVDLVKRTLSAFDGAMGKAGGNKDHEEEREVVTEPPTTPAPTTKPARRRG KRDLSGAFFEPGFARCLCGSQGSSEQEDKDEEKEEELLETTTKKVFNINDADEEWDGM EEYDNEHYENYDVEATTGMNMMEECQSERTKFDEPTGFDDRCICAFDRSTHDAWPCFL NGTWETTECDTCNEHAFCTKDNKTAKGHRSPCICAPSRFCVAYNGKTPPIEIWTYLQG GTPTEDPNFLEAMGFQGMTDEVAIVTKAKENIMFAMATLSMQDRERLSTTKRELVHKC SFNGKACDIEADFLTHIDPAFGSCFTFNHNRTVNLTSIRAGPMYGLRMLVYVNASDYM PTTEATGVRLTIHDKEDFPFPDTFGYSAPTGYVSSFGLRLRKMSRLPAPYGDCVPDGK TSDYIYSNYEYSVEGCYRSCFQQLVLKECRCGDPRFPVPENARHCDAADPIARKCLDA RMNDLGGLHGSFRCRCQQPCRQSIYSVTYSPAKWPSLSLQIQLGSCNGTAVECNKHYK ENGAMVEVFYEQLNFEMLTESEAYGFVNLLADFGGQLGLWCGISFLTCCEFVFLFLET AYMSAEHNYSLYKKKKAEKAKKIASGSF" ORIGIN 1 gatcaaaacc caaaaaaaaa tttcattcaa aaatttggtt ctttcttaca agcaaaatgc 61 gaggaatagg tgagaaggaa ctgagactga aactaaaaat ttgtgaatag aaactaaaaa 121 aaaactttat caaggtgttg caattttggt ttctgcaaaa ctgacttgac agtgataagc 181 tgatagcagc cagaaaccga tagcggtttc aatgacaatt cggggcaagt tggagatcaa 241 ggactcgttc cacgtggagc gatgtaaagg tctgtttgtt ttgtattgaa tcaaaatcag 301 aaaaaaaaag aatcagaaac agcaatagat gaaacagaag aaagaaaaat atgataaaat 361 tgataatgaa attgaaaacc acgccgacgc tgttcgttga ttgattgagg tgggaaacgt 421 gagcgagaaa aaagtcaaaa gatgataagt ctatgtgaac aagtcacgtt gcttgaatgg 481 acccctcgag gcgtgagtgt gtcctccgtt atcaagtgat catagtgggt aggtgggtgt 541 atgatttgaa attgtgaaca ccgacgaata tacataactt acgccgaaca gttgttcgct 601 cgcaagacaa ggtacttacc aaactctcaa cgttttccat ttctcactct gccgaccccc 661 catcacttta ccgaagtgtt taggtgtgtt ggtgtgttgg tactcaaaac catagttgag 721 ttgacatttc atgagtcagc gtagaacgat atacaaagat cataagtgaa atgtatgtag 781 gttagtggaa aaaagatttg ttagtatgta agtacattaa attttgaaaa atcagacttt 841 taatatatat ggctgtgaat tcaaaattta aaaaaatata ggttaaaaat gacctaattt 901 tagttgaaat cttcaatacg acaggatgga agaagcttaa aggtggtgta gtcgaatttg 961 agactatgct taaaaacatt ctctatgctc gtacaagtcg atatgataaa gcgaaacatc 1021 tcaaaaattt ctctaacgga agttatgagc cttcaaagtg ccgaaaaaaa tctctttctt 1081 cactgtcaaa attttttttg ttgaaaattt agcaatttac caaaacttcc actacaactt 1141 ttgaaaggaa ggatcaatgc aacaaagttc aggcgtagta ggcaataaat gcatttttcg 1201 aataatttta gctattttcg aattagcatt taagttgcac ttttgaaaat gatatttttc 1261 agtttcaaga tgttttatct tttcagtaat attccaaact aggaaagtca tatggaaaaa 1321 atgacaagga acatgatttc gattcaaagt tacactatat gttatattaa aaagggagca 1381 atactttttc agagtagcgc ggcaagcgca atattactac ggaggctatg ataatagaag 1441 gagtacggta aaactagagt tcaaatagtt caaggagaac aagtcttaga catatgacaa 1501 tgatcacaat tctacgagaa tcttgctcat tttaatgcta tttaaggaaa ggaatcactt 1561 gtcataaatt gtaaagcaaa taaacaaagt tcaagtaagc gggggatgac cgttcttcct 1621 aattgtttat cttttcttat cctcttttca caatcgagca gaaggatgca tcacggaatg 1681 agaatacgaa tacacagaaa aacatctgca caccattatt gtttacaagc cttgcatatt 1741 tcaaggtctt caaggaactg atcaaaaagt gacggacaac acattgtaca ggtcagttga 1801 tgaacgagag aaagagtttc aaaaaagtgt tatatattga gtatcgggtt acaaggaagg 1861 aaacggcaga gttaaaacgt gtcgggttac aaaacaacac aattgttgat tggattaatc 1921 cagaggaatt gaacaaagaa tggaaagaga aaaacaaaag cgaagtcaac attcagaaac 1981 taatgcgctt gggaactttc gatcaaatgg agaaaaaatt attgcattgg agggctagag 2041 caaagagttg ctatactttt tcagtttggc acaatattgt actctgttaa tttctccaat 2101 tagttactta agttgattaa ttaattggtt tatttttcag ttgaggtaaa aaaaaacaaa 2161 aaataccgcg tcttgttatt ttattatcgc atgcaaaatt atcgcatgtt attttattat 2221 cgcatgttca tatttagtgc aatactgaaa ttggagacaa ttttcacgtt tttttggtca 2281 cataatttga gttttacaaa aaaggcattt tttacgtcaa tttgaagcag cttatcaatt 2341 gtttttggtg actataatag agcaatagaa aaaaattcgc tactctagca ggaaatcaat 2401 aaacttaggt atgcagtgct ttaaggaacg ctcggggata ctaatagata aatttcaatt 2461 ttatagtact tcataaatat caatcaccaa tgttcagagc gttttgaaac tgagaaattg 2521 tttacaaatg atgcgatcgt ttgtaatgtg caattaaacg tttgaaaaca tttcaaaaat 2581 cttccaggaa caatgtgagt ttctgaccga tgattgtatc atttcacatt tggatatctt 2641 tatcattttg aaaactttac atacatttaa tttttacaac actaatacaa gtacaataga 2701 cgacaatttc taaatttagc tttacatgac ataactcatt gaaccatgtt ttaattgttc 2761 cctaacctca ttaactcaga ttttacactt gatagaaaaa ccgaaattca tagaccaaat 2821 acgtgaaaat aaaataaaat aggaaaaagc tggttgggtt gtttttgttt tccttattgt 2881 atactactca catttctcaa caaaaacact tgaagaatta gtttttttat cgcatttcca 2941 gatcttttgc agcagaaatg tagcaaaaaa aattgaaaaa aaaaagaaaa aagcttgaag 3001 aaaaaagaaa gatgaactac aaaggaagtt gagggaaaca ctaaacagaa gctagaacat 3061 tttagatctg caaaaaacat ttaaaaaagt attggctatg ttggggttga agaaacatta 3121 ttggaaactt tgaaacaact tgtaaatact caacctttaa aaatttgtta tattttcttg 3181 tgacaacaat ttattttctt aatctcatat tttttttgta tgccgtttca tttttcatgc 3241 tcaacaactt aaaaacaata atcattctaa aacagcaaag tcaatacaga caagaacaca 3301 aaaattcaat tcttaacatt ccaaaatgtt tcttaaaagc tttaagcttg acgctttcta 3361 aattgaaatt taatgtatgc aaactagtgg ggagaaaagc agaccaacga ggtggtcatt 3421 cgtagaaaga gcggcagacc cccccccccc ctttgtgttg ttgctcatac tcctccaaat 3481 gtgatgcaat ctctctgtct gcctcacacc gcttcaaaag cctcccgcgc tccgtgtccg 3541 tcagaatatt cgaatgcatt tcattttcat ctgccgttta tgagatgtga gggctgctga 3601 aatagaaatt gaaaaatgtc tggaaagcat tgtcgtcgaa tccaccctaa aaaggctaca 3661 atcgacgacc ttgcagcgct ataacatagt tctcattgga atatcccact attcttaaac 3721 tatgttaaca ctgccagatt ctcctaatta ctctcttgtt aattcctttt atttcattca 3781 aaaccgttaa gtataagccg agatacactc aacttacata atcaccaaac gagctagttg 3841 cattcaaact acgcgcccga tcgccaccgt acctctcttc ccctcgagac ccacgcggca 3901 ccactacttg gcaccgcgcc aagaaagggg gagaggtggc ggcgcgcgct atgtttgttc 3961 gtacacgaag atatgcgtgc cacccaacgg ttaattctac tcgaaaatac cttttaatcg 4021 agcaagaata ttattatttg ttagaattaa ttaaatatta aagtttgaac caaattgata 4081 gtctaaagat caaaatcgta ttccatttcg ccatgaggat ttcggaccac aagaacagtt 4141 ttttataaca cctacttttc tgttctttaa tgtattaccc actttttact tgttattttc 4201 ttatttattg atggcctgcg cgaccctagg gcgtggttgt acaataaata agtttttgag 4261 ttatcttcgt ttgtgttatt cgtttctttc tctctattgc aggttttagg tgtattccgt 4321 cgtgaagggg gtgtttggga gtgggaaggg ttgtccacga tgccgactac gtacacttta 4381 ttggatttcg ccacaagcat gttggattca tatatttcaa tttttaaccg gttcaattca 4441 ttaacttggt ggtttcattt atataaggtt taatgcaatt atgattctct cagaagcacc 4501 gagaaaattg cggtttccac tgaacacaat ataaagaaat ttgatttaaa taggcgcaat 4561 aaatttaaaa aaacaactta ttgtaatttc cttcccgatg actaagtatg taaatttgga 4621 atatcactga atttttaatt tttcaagctt gaaacggctt caaaaaaatg tatttccttc 4681 gctcagttct ttttaaaaac aaaattttta cctataatta tttttctaca tacatggata 4741 gagccaacta gacacatttt tattttgtag atattcagga acagaaaaaa aaacaaattt 4801 ttccaaaaaa taaaaacaaa gtgtttagaa aatttaaaaa ttctagcttt tcttatcatt 4861 aataaatttt tcttatcact tgtctgtgca aaaaacgaaa tcaaataacg gatgacatct 4921 ttattggaaa atccaaatat tttagagaaa caagcaaaag aaatatggga aagaatgaga 4981 aaatgatctt tgcaacgacg gcggccaaga aaagggcatg ttctgaaagt gcttttcctt 5041 gaaggcaaac tctctgagca actgagcaac tgggaattag acatacacca accacaccaa 5101 ttgtatcaaa atgagcacta cgaagcattg cttctttgca gaactccaca cataagttgt 5161 cgggtaagaa tatcttcctt cgacatcatc ttccaactga aatagttgtt cttcgaaccg 5221 gggtgaaagg tgtgaatgtc aatgataaga aacagacact agtacatacc gtttgattgt 5281 tgcaaaagtc gctatcacag atttggaagg gtgttaaaaa cggtgtaaga tggtataagc 5341 cggcctgaac acgtcttgat tgcacttttt cgtttgctcg gtttgggggt acttttgttc 5401 ttgctcgaaa tatttggact tgagaatatg aaagaggggt caagtgaata cgaaactgtc 5461 aacaacaaaa aaattccaga ccttgtcttc gtgacccaaa ttaaggtttt cttcgtgacc 5521 caaattaagg cttgtaaaat aatattaaaa aaacaaaaat cttaaaatat ttttcttcgg 5581 caagtttagg tatggtatag tttcatcaaa cttgaatttg atagaaactt tcagaaagtc 5641 ttttcagatt tcaacatgac tttcgcggta actcaaagta tgaaatgaag aaatacctga 5701 atttagtcgt atgtatttct acaatttgtt aattttccca gaaaaaatgt gacgtcactc 5761 gtgtttttaa taacgtttta cagttttttt ttgcagaaat ttctgatggt caaacaaatg 5821 ttgttgtttt tgtttaaaat ttcaaaatga aaaatggtta ctttcaatat ggctccttcc 5881 ctattgaatt gagaaatgca tccaaaaaca gcgctggctg agtaacagaa ctgcccaata 5941 cacgttgaat tctaaatttt cattttctcg ttgttttgtt ttttttcatt ctcactttca 6001 tccttgtcat gacttctcct ggagcaaagc atttggtact tccgggcact ttcagtttgt 6061 atggagttgt tttgttctga actttaacta tgtaatattt gcggtcttca atcataactt 6121 accactttat atgtacaaat ttgacttttg tcataacgtg agtcatctgc aacggcatag 6181 catgagttga taggtagtaa cataccactg tagacgaatc tataaaaata ttctttatta 6241 gtttcaagtc atacagtttt cctttaccca atatcataca tctccatttc atatgtccct 6301 tctccgcatt caaccatcta aattattaac atagttatat atataatcac agccttacct 6361 catccatata actagatgct gtcttcacac accacggttg cacacaatcg gctcctctga 6421 acatcgaatc tttagcgtat tcacaaatta aatacgtgat atttttctcc tcttttacgt 6481 tggtcatctc attacagtaa tccgtactgc tggaaaaaaa aatgtagagt tttaactttc 6541 tataggaatt ttacttgcaa atacagaaaa tgcttcctcg gctgtttgtg gaacatccca 6601 atggcaagtc ggtaggagat gtgaagcacc cctaaaattt ttttaaaaat tgtctttttg 6661 aaattttgcg gtagtttaga attttaccct cttttcgttt aatttgaaat taaggaatac 6721 tatttattgc cggaacggta taaaatttta aagaataatg ctactaaata tttattttcc 6781 acgtgtgttg ggcagaaaac tataattttc tgaaactgca gtccattttt ttctgaaact 6841 gctgtctttt tccaaacttc tggcaaactg tggttgagca ttttttttta tggaagtttt 6901 tgaatgtcta ttttcaaact acaaaaattg aaggaacttg ttcgaggatt aagaaaaaaa 6961 aatttcaaat gttttcaaaa aaggttttct ttaaaaatat taaaaaagaa ccgaaaactt 7021 cgaaaacatt tacatagttt tttagttttg gttataaaaa aggtaaacga aaaatttccg 7081 agtacaaaaa atttggcaaa tcggcaaact gccctttctc agattttttc ctgatatcat 7141 acccgatacc tgaatcaaga ttcaaagaga ctgaaattta ataatggtgt tatacgcgtg 7201 gtgtcagagt gtctcatttc ggcttgatct acgtagatct acaaaaaaat gcgagaaaag 7261 agacgcagag ttctgaactg atttcgtatg agagaacgtg ctgacgtcac atatttttcg 7321 gcacaacatt tccgcaattt ttgtagatca aaacgtaatg ggacagcctg gcaccacgtg 7381 ttttaatata ttcaaattag ctattgcaaa aagtcaaaac attttatgag agaatcaaaa 7441 ttcataactt ttcatttcgt tattcaaaaa aatcagcttt cagttctcta ttaattcgtt 7501 ttgggaagtt ttttacattt tcagaatcaa cattagcatt tatacattga gtaaaacaaa 7561 cctgataata agactgctcg tttttcagag acgggttcag gtgtttataa tttacggcgt 7621 aacagtaatc tccggtgcag tttgtacatc tgaagttaca gttttcaagc agttgcattt 7681 acaggaaaaa ttacgtttta tctaccgaaa taacccttcc atcacatttg acacattcaa 7741 gttcgcaatc acaaaaagtc gataaaaata gtaaaaacca taaaatccga agcaacattt 7801 gactgaaagc tacttgacag cgccggctta cttgctccgt ttcaaattac gttgttctta 7861 tcaatttctt ggttttcatt ttcagctgtc gcgcccccaa atcaccattt taggggactc 7921 gtttctatag caatttgtat catgtgtaat tggaaaaaaa accaattttc gtgtgataaa 7981 tgacagaaag aaaaacgaaa aagaaggaag agattttccc tttgattaaa atgatagaca 8041 gttggtcagt gcgggaagaa gaagagattt tggaaaaaaa gtgcagaggg ataccggaag 8101 aattgggaaa agtgtgattt tgaaacaatc agagctcatt catttgtact tggttttttg 8161 ctcgggttca ttgcatcccg agaatctgtc tgtgttgggc aactgtattg gcaaaatttt 8221 aaacatcctg aatgaaattt cagattcact tttttttgtt cgaagttcta ttcagaagct 8281 atttttgtga caatattttt ataatgtgtt ctatcataca acacagattt tgtttggaat 8341 ttcgagattt ccgggttttc cgaagttttt ctttattaac attaattacg actaaatatt 8401 aaataacttg aaaaaataaa aattattgaa tttctcaatc aaaacgtttt tatcttgttc 8461 cttgatggct taaattgtat tctcagaggt tcccttccat attctcacat ttcaagtgtc 8521 atttagaatt cttgtcagat gagtaatatt aggaaggcgt ctagactaaa taataaaagt 8581 aatacatagt acggaatttt gacagcgctt ttactaattt actaaatttt tgtacttcac 8641 atttaataag aaaacaattc atttccgttt ttccgatttg cgatcgaatc aattgaaaat 8701 aaaacaggtt ttggtttgtg tccgttttta tttataatat ttttgatatt caaaattaaa 8761 caaaaatttt gcataatatt caaaatcata aaaaatattg gttcttttat tggctcaatg 8821 aatcttcttc acaattcctc ccaactaacc ttttttcaaa gttcccctat gaatcaatct 8881 cccgccccag cccatgtttc cttccactaa tttttctgtt tttcatttta attcttctct 8941 attgccacgc atcgattcca ccccagcaaa aattgaaaaa gtaacgaagg tcaccgagag 9001 atcgaccatt ttcatctacg tcgattcttt ttaaaagatg cgcgcacctc cccccgaaaa 9061 tagatcaatc tgtctcagtt ttgctggttt ttattagtga gtttttatct gacaattgtg 9121 agaaggtgat actagagata atcaaagaat tcatttgatt caatgatcca gaagtctttt 9181 tttttatatt ttcaaaaaag tgaaactagc aatattaaaa gaatcccaat cctacaaaat 9241 tttcgaagta aattcaatta aaattaaaat taaaattaga aaaaaaactt taatttatca 9301 agattgatgc gatatcatcg tcaacaattg ctactcaatt attagattta ttttatttat 9361 ttcatttaga cataaatatg tacgtgagaa acaattgttt tctgttacac tgaataaact 9421 aaactcattt tatttttttt gagcttacga taattgctat gaattattca taacattttc 9481 taatgtcgag tgcacttttg cagatatgac tcactttttt ttgttttgtg agttgagcag 9541 cctaggactt tgtatgaatt aatttgcaaa aaactaggta aatttaatca tttacgtttt 9601 ttttcaaatc aattgcaaat ataaattctt gtctgaaatg tatattaccg tatttctttt 9661 attagtatat atgtaactac attttttaac aagtttttca aaagtctccc aataacgcaa 9721 aatcataaaa aattctacac ataggtacat attatttaga gttttctcaa gacctatatc 9781 acctcatttt caatttttcc ttatcaatca atctcgtttt cttcaaaaat tgaatcatct 9841 tttgatacaa cagacgtatt ctattcatag cctcatattg gcattggcta tcaatcaaga 9901 tgccaatatt ttacaatgca tttttcattt ccggtgaaaa tctagaatct agaattgaga 9961 atcgagaatc tagaatccgg cccgcttcaa ctattttata tatatacaaa tatataaatt 10021 tctgcccact ctttcttgat ttatttgcga tttggttgca tgacgcaggc gtgttgctct 10081 tactacccaa acaaggaata aagatcttta aatgcggatt tcatatatta atttttttct 10141 tctttcaatt ttattttgtt tgccaagaaa aaaaaaacgt gtcctttact acccttcaat 10201 atttttgttt caactgaaaa tataggtttt ttaaattaaa ggattaaaaa tgaattaaaa 10261 ctttttttta aagtaagaaa gtctgtttta gatttgtttt tattttaaac ttgaaaccat 10321 acctacaaaa ataccattaa aaattttaac aaaatctttg taacatttga agtcaacttt 10381 aaatacttca aataagcatt tgattatgag aagtgtgcat atgtatattt attactatta 10441 ttttttgtag aattttttaa aacgtatgta aaatttacat cttatgaaac atgtatgggg 10501 aaattttcag atgaatggta agagggttag taagtttatt aggtttatct attcaatcac 10561 aatgtaattt tttgactgaa atgcgcctgg ttttaaaatt tgagagcatg tacctttaat 10621 ttttgtccta catttttact ttatgggctg tttgttataa attttgtata tttacagaaa 10681 gatttcaact cttcattttt tatgaatctt ttttattgta aagagcatta gtcaatgctc 10741 accaaatgat ttctcaatta aaaatcaccg atctgagagc tcaaaatgct ccgttctcgt 10801 aacgccaaca tttttatctg tttgtgtctt cacattagtc taccaccacc aacactcaaa 10861 atatcaaatt tttctcaatt ttcattcacc gatcattgtc tccactcttc gcctcaatct 10921 ccgtcgacct gtaactaacc agttgtcgaa tcgcattgga tcgtcgcgta tcgctctctc 10981 atttcctcat ggtcttcccc ctcagggcac tactcctctt cttttcccaa aatgcccatg 11041 attttcactt ttcattctgt gttgttgttg acacgacgac tgtctcgttt cgtcctttcc 11101 tcttgtttca tccctcccac ggttacctct tcaagtcttt tctcattgat ttttgaaagt 11161 ttttcaattc ttttgtaccg taatgatagt gatattgata agttgaacta ttttagtaca 11221 gtttcaggtg actagagatg ccaaagcaag agttcaaccc acttgattac actggaccac 11281 tgatagttgg agcaattttc tgcgtctttt tgtttgtgat ttcgttcttt gttatcaact 11341 tcttctgtat taccaaatac gatgatatta caaagtttga attggtgagt ttagtacttt 11401 gtagttttat ctttaaatta tttacaaata gaaggaaacc aatagcattg tacgttctaa 11461 aggttgggtt tgctagagag cctgctttat gttaattaaa aagttgagta tctatacaac 11521 atacacattt gatttttggc tttgctgcgt accattatta ctcaagaatt ggtatgtatt 11581 caaataatgc aataacgttg tgaaattttt tctctagaaa tcgttcataa tttctaagta 11641 gaaaagtcaa ttatttctga caatttctaa tttttttata aataaaaatg cacattaaga 11701 acttttgata atgttgcaag aaagttggga aagttatttg tctggtcact tactttcgaa 11761 atgtattaat cgatccttga acttttttgt tccgcagagg ctggcggagt ttacaagcgt 11821 acgacgtggt tcaattttca tttaaagctt taaaaatgga acatagatga acatttcgaa 11881 tgctaaatgc aaaatcaaat aatttaattt tcacgactct ataaagtttg ccagcttcct 11941 ataactgaca caaataaaaa ggaatccttt aatgagaacg agtataattt ctggaaaccc 12001 ccgcccctgc ataatactca aattctcaaa aaatttccaa ttgtctttac tgcagacgtt 12061 caatgaactt ttcccccacc atatcaatga taagtgtttt actacctaaa catgatttat 12121 agtttgaacc aatcagtctc tttcctttat atgcattttt tttcagatgg gaggtaaata 12181 tggatggcgc ctaggaccac atccattgat cgttgtcaag aagggtggat ttgtcgccga 12241 ggaggaggtt gatgatgctt aaatacttag ttccattcca acaacaaatc tatattttgc 12301 ttttaatggt tttgtttttg aaatatgcag tacgtttatt gcatttttaa taattccgca 12361 actttttgct gcaaactttt attcgaataa aattttaatt tgaagtgaat actttttaat 12421 gaattgttcg tacagtgtca tcaattgctc tagccacata atccaagtta ctcatgttaa 12481 gcccgcagat attaatacga ccgtcactga gaaggtagat cttgtgatgg gctatgagat 12541 ggtcgacttg acaagctgaa aaccatttca tttatttttt aggtttttca gaaaatgtct 12601 aatctaaaat tgctttttat taattttatt ccccacgacc atacggactt aaagctggat 12661 gtaacttttg tgcaaaatgg cacaattccg tccggagctt tggggtgttt tacgagaata 12721 ggtaaaataa cctaatgctt tttcactatg ctaaatagtt ttttgaaaaa tattgcttac 12781 ttacgcgtca atccggtata actgaacaac cccgattgtt gagtaatacg atcccatttt 12841 ccagaagttt gaagcatatt caactctcga atcagttctt gtcttgtttt cttaatacga 12901 aaagccatca ttttaatgga attttgccat tgttctctgt acttcggcga cttgagaatc 12961 tcatgcacaa tacgtgctcc gtaagctgga gggttagaaa aattagaaac attaacaagt 13021 gtcagttggg attttaggtt cgcaatatga cttggtttgt tcacgatcac agtaaggctt 13081 ccaacacgat catctgaaaa ttttaatcct aactaataat attcaattca aacttactgt 13141 aaagcccaaa attctttgag aaagattgag agacaatcat ttccaaaccc tgttcgacaa 13201 agaaccgaac agcccaagca tctgcatcag cgtcgccaga agctaatccc tgatcagcaa 13261 tatgaaaaaa agtgaaaaga ttcttctgtt tgatgacttg cgccatttgt atccattgct 13321 ctcgagttgg gtccattcct gttggattat agcacaggct tgaagaagaa cagctgatct 13381 ttccggagct tggcggaggt cgtcaagtaa ctgtgtgatg ttaatttctc tgttatgcaa 13441 gtcccagtac gtatattccc taattgaatt gaaaccactc ttttgaaatg tacgctcgta 13501 gtttcccctg gaaaaagaaa catcacacaa ttggtcgttt taataatcgg ttgaaaaaat 13561 tttgattttc tcattaaaaa atacccccaa aaagcattta cttaccagca tggattgctg 13621 acataaatcg tcttcatact aagaacttga gcaagaaact ctgctccaac acatatggca 13681 cctgtaccag agatgcactg aacactaaat gatcttccct ctttgatagc caaagaatca 13741 tttccgagta gcaaagcggt ggctgatttg cagaatccat catgtccaag aatcggtagg 13801 tactcatgat tgtgatgagg ctcgtgaggg aatttaagtt cgatttctcg aaccactggc 13861 aaaacccacg gctctccatc ttccgtgcga taagcttcta ttgcaagatt tattttcaca 13921 gggcaaattt ctttctggaa gagttccgat gtatggagct ctttgattgg cgaagcaaca 13981 tgaattccat caaagaaaga tagccgcata ttttaactga aatttatttg ctgtttaagc 14041 ttagataaca ttaaccctta gcctaagatg gacaaaaaag tcacgtaaaa ctatgtataa 14101 gaatggccac catttctggg tcagttgaac tacacaagta ccaaaagttg aatagttcca 14161 tgcataacac gaaatagaaa aatgggagtt ttttggagca ggtggcgata agaaaacgaa 14221 attttaattt atagctgacg ttgaaaaaaa ctcttgaact cagaaagtgc aagaaacaaa 14281 atttctggga ctgcaaattc agtacatttt atactttgac tataaaacat ttttgtagct 14341 cactgtcttc tgaaaaaaag aactgggagt actttttttc cttagcgcaa tcctgtttat 14401 atacggttgc aaaaatcgtt atagcgttcg caacatggta attttgttca agaatcaaaa 14461 aactgcggta ctaagttttt ttttttttta gtttattttt ttttagtttt tttatttttt 14521 taaatttatt tagtttaact atgtaaagtt tttctgttcg tttgaaaatg aatttgtgtt 14581 cctgattact tcctggcacg tattcacatt ttcatggtga atcgagtctg gtgaagatgg 14641 gcgtacccac ggttcgacat gttgctggaa ttctcgtgct ttttgaaaaa tgaatatgcc 14701 gttctattaa ttttgtaacc ccgtaacaaa tgaaaaccgt tttgtaacgg ggtttcttag 14761 taataatatt gggtataagt acaacaataa caaccataat aacaatataa taataataat 14821 aataacaata acaataataa gtacaaccgc ttgataattg agtttgccat ttttagagca 14881 atggaaaata ttacattgtc attttctatt tattaattgt tatgatcaaa taaaaccttt 14941 ttcatccttc ccatattcat tggtaatgat tactctattt ttttccataa aacaactact 15001 tgatgaacga tgttacggcg aaacacgaag gttcgaaaaa aactaagaca gcttaaagtg 15061 cattgcaata aaaaatgatt tatcacttga ttgacgttga taagaaaata gaagcctacc 15121 tcaactttga atactccggg aaatgaagtt cgtgtagatg agaatgcgaa agtttgacgt 15181 ggtcagttat agaattaaat cagtttttca aactgtagta acaaatatta acagctgctc 15241 gaaaaaaagt ttttaaacat ggtgccaggt ggtcagtaat acactgtatt ttgaagcata 15301 tgacagaagt tgaaacactt ttttttcaat taactacaag aaaattacaa aaataaaata 15361 aagcctagtg atgattagta tttgcaccat atatcataat tgcttgaaca gaaaacattg 15421 ttcataaatc cccgcaattt tttaactgtc tttcacactc cgggtcatcg aagtttgcaa 15481 gacgtaagct tttcaggtaa tggaacactg atgtatcctt acctgtacac gtgatttcag 15541 aaagttggag agccgttttt gaattggtgc tccgttcaaa ttgactctaa aagttattca 15601 cgaaatccac atatagttga atgttttttt agtattgtgg aataatttta aaacaattat 15661 tctaaatact agagaaccat ttttagtaca aaactgatta ttttatccca tcccgttgta 15721 caacgtaaca gaaaagcata atttgaaagt ttctcggagc acactttcaa tccaatgaac 15781 tgtaaagaaa attgttgtaa caagattttt ctcaaatttt gccacatgac ttcgaggaga 15841 ctgaaaaatt ttaaccaaat cgacaggtgt ctaaacaata gatagatggc aaatattctt 15901 aatctaaact tggggtgacc agaattacaa actttgtaac aaaatgtcgt agacttttcg 15961 cacaacagtt ttagttttat ttttgttccg gcaattctga tgttcctctt gcttccatat 16021 gccaaaaatc cgctgcttca tgcattatgg cagtgtatcc ataattgtcg cggttaggag 16081 cgaatatttt atgaaatagt tccccaagta aacgtccaca tcacacggat aataatagca 16141 atgattcatt ttccattcaa gaaaaaaaag taatcgggtt aaaataaata aaagtacaca 16201 cgggaaatta aaactaagtg catagaaata ctcaatctac acttaatgct caaacttgaa 16261 gagcacaaaa actcaatttt tcacataaaa ttcataaatt tcaaagcttt agatgttgct 16321 cttcacggca cgcacagtct catcaatcgc cttggcgaca tattccacgt ttttcgtgtt 16381 aagaccacaa atgttgatgc ggccgtctct gagcaggaat actttgtgat tggcaatcag 16441 gtgatccact tgggcagctg aaaattaaga ttgtcctcat cagagtgtta aaaaaagcaa 16501 actcacaagt aagtccggtg tagctgaaca ttccaatttg ttgaatgatg tgatcccaag 16561 ttccaggagt gccaagatcc atcaaatggc gcaataaggc ggctctcatc tgcttgatac 16621 gagaagacat cgcttgaata gactggttcc attgttcacg gcgagctggg gtggtgagca 16681 ccttgtgcac aattcttgct ccatgtgctg gtggattgga ccagttggca cggataacca 16741 acgacatttg agactggaat ccggcaatga cagctggatt gttgacgacg acagtaaggt 16801 ttccgacacg ctcgtctgaa attttgaagt attttacttt tcatcttcta ttcaaaactt 16861 actgtaaagt ccaaagttct tggcaaatga ttgggacaca accatctcca ttccctggtc 16921 tacaaagtag cggatcgccc aggcgtccgc agctggatca ccagatgcga atccttggta 16981 agcaatgtca aagaaggtga agaggttctt tctcttgatc acctcggcaa ccaacttcca 17041 ttgctcctga gttgggtcca ttccagttgg gttgtgagca catccgtgaa gaatgatgac 17101 agacttctct ggggctgact caagatcgga gaggaacttc tcaatgtgca cacgtttgtt 17161 atcatagtcc cagaaggtgt agtcggcgac ggtggtgaaa ccagccttct tgaagacaag 17221 cttgtgattt cccctgaaaa tgataaaaat tcatgatttg atgaatttcg cccagcatta 17281 tcagaggttc aggaaaagta ccttacacct ctttgaaaga gccttgctga aactcatttt 17341 ttacctagat tacctaggtt tatggtaata ctgccaagaa ttgggcggct attttaagaa 17401 gttgaagttc tggttgactg ctactcactc tggccaatac aagacttgtt atattttatc 17461 agataagtat gtttggctcc gatttttcga ccacttacca tgttgggttg ctcacgtaga 17521 cggtcttcat gttacacact gaagcaagaa actcggctcc agcacgaaga gctccggttc 17581 cagagagaca ctgtactcca aacgatcttt cctccttgat agctggcgat tcggctccga 17641 ggacaagctc ggtggcagcc ttgcggaaac cctcgtgacc gagaactgga agatactcgt 17701 gattgagcga ggtgtcgttg gcgatttcga cttcagtctc atgaacgact ggaagcaccc 17761 atggctgtcc ctcctctgtg cggtaggctc caattgtcag gttaactttg acgggagccg 17821 tctcatcgag gtacatcttg ttcttgtgga acacctcgat tggtggagca actgggattc 17881 cgtcaaagaa ggacatgttg agtgttgttt ctggaattaa aaaaaatcaa taagtttagt 17941 tatagaaatt gaagagatga aaagagtggc ggtatgccgc aaaacgcgat aatgaggtgt 18001 atttatatag cgaaagagat caaaatcgaa aaatgaccac gttaaaagga ggtcacctca 18061 tcgacattcg agaggattta cttggaaaca aaaaacgttt taggtgtttc caataaaaat 18121 cgatttgatg ataaaaatta ttatacgata agaaacgtaa aactttctac tttcttattt 18181 tagcactcaa gaaatcaaca gaaacgtgtg ttgataagac tacgggccca gtttcaagac 18241 tgaactagat ggttactttc tattgcacca aagtcgacag ttcacaacaa aagttacagg 18301 aaattttgcc tagttttcgg acaaattgag gaaagggggt ctcaaaaacg gactgcctct 18361 tctcgacaac tctttctcgt cggcccgccc tccaataagc acacactatt tccgagaaaa 18421 caacacatgc gggtagaaaa cgagaggcag cgttgttgtt tgccagagac gtttttcgag 18481 acgtgaccat tttgaaaatt tttatttcga ggaaaacgtg atggcaaggt ggagagaacg 18541 ggcaaaacgc agcgcaatga gatcgaaatg tgtcatttgt gagggtttca aagtgttctt 18601 tgataaggca aattactttt ttgaaaattg aaaaaaaaga ttggatattt atctaggtta 18661 atgtaataag atcctaggaa taaatgaaat tatatcaaaa ttccctcacc aaaaatgaaa 18721 aactattttt ttttttgaaa ttttctagtg gaagttttct tctgtttttg cctatagaaa 18781 aaccttgaaa gagctgattt tttcaaaatc cctttacaaa aggaatcaag gttaacctat 18841 acaactacct cgtgaacttc aaagattcaa aaaccaaacg accacttcaa aaattattgg 18901 caaaatgacc ttttgaaatg agtgatggtg cgaaaccgcc tagataaata aatataacaa 18961 tctcacgtgg ccaggcggaa aaacatgttt atgtaaattt tcgaaacgaa aaataatcgg 19021 ttttcttttt ggggtgatga tggcgggaca tgatgaaaag aaatggaaca gtatacacga 19081 gaaaatccaa gttgctgggt atttagaaat atggatgaga actaaacaac attccgcccg 19141 gtaaacgcct ttgaatgaat gagaagtaaa aatgaaaaaa aatcacagaa aaatattcac 19201 cgtaatgaga atttgcatac aagatttccg gtagaaagaa gggtgagaga agaggtaaag 19261 aaaaaggcga taagaggcgc ctccgggccg ccttaaagcc tcaattctac gtggcgaagt 19321 tctcaacaaa gttttgtaaa atttacgaca actaaaaatt ttttcagcac attctgattt 19381 attattttag tcaccgactt taatgggtgg cctttaatga ttttgaagtt aatttgaaaa 19441 tttttaaaat caataaagat gaacatggac atggattttt cacaaaacta acaattaaga 19501 ggcaacaaaa acaaacataa tgatagttgt tagaaaaatg gaaattgaac aaaaatttca 19561 aaaacaaaga aagcaagtca acaagtgtct aacgaagaag taaacttttc tcatcacttg 19621 gacgccgagt gtgctcaaaa ccattgtgct ccgaaatatg acacttttca actcgtccca 19681 tcgttcaaag ttattattcc ctctcatttt ttttcccgcg ctctcgttgg ctctcgtatc 19741 atgagctctc gttgacaagt aacttttaag tttcaaaatt tatctatagt tgttaaagta 19801 taatgttttc ctagaagctg taagcttggc aggtgacttt ccctgcggga attgcgaaaa 19861 aaatagagac tgaataaaac gcatatactg tttcatattg tctggcacgt gccttgtggc 19921 aggaggtata ccaatacact gccccgttct gatccgatta ctttgacaaa agtcgccgag 19981 tctgatactc acaaacagta gcgtcaagtg aggagggctc actttaccag aatttattat 20041 gaatgtttct ggtatcatca cattgtttca actttccacg cgctgttaca ttgcgttttg 20101 aatcgatttt ttggtattct gccaaatata gcttcagtgc gttgcactcg agcatctgaa 20161 catatttgct ttttgtttgg cgagaggaag gaaaaagcac tgaaaagtgc ttaaaaatcg 20221 ttttgaaata aggattagta actggattag agctagagca ttaagtgata ttaggcaatt 20281 tttgttattt ctaaaaatat ggaactatgc gggtggaatt ttttgttcac atgaacgtgt 20341 caaaaaaatt gttaatgttc ataccctcat ttacatttcc acttttatgg cattttggaa 20401 aaagatggct atattcgccc tgttagtttt agaaaacctt ttccactaga tattgaaatt 20461 tcgaagtgaa gttttcgctt tcatcaaaca ttcttcgaga gctagcttca attgttctcc 20521 ttcatttttt ctattgcttt atgttgcatt gaatagctaa aaaaaaattc aaaaagctat 20581 cccttcaaaa aaagaaaatt cgctgcttct cgacgtgaaa aacaatgaaa aaagtgttcg 20641 gtgagtactt gttctattta cacagcaaac ctttttagca gccgagctgt tagaaaaacc 20701 aaaaaacgcc accaaaatat gcttattcac attttccgtc atcaccatcg ccgccgattt 20761 ctgcgcatga caaagtggct tttctctctt tttttggatt tcaaggtcta acattgccgt 20821 tctaatgccc gttttttagg ttacatttca aaataaccga ctattttcag aaatgttttc 20881 tcatcaagat cgagaccgcg ataggaagga ggacggtggt ggagacggta ctgagatgaa 20941 atccaagagt cggagtcagc caagtggact taatcgcgtg aaaaatcttt caagaaagct 21001 atcggcaaag tcaaggttag tttttgaggg gttttcgggg ggtcaaggga agtcaaaaac 21061 cgcaattcta aaattttgcg atgggtagcc acagaaggga tgttcttttc atttgaaaat 21121 gtgaaaaaga ttaatttttt ttaaaaccac actgatgagt ttagtgaaaa aattaaacat 21181 gcttctagcg cctcagctaa tcaactcaga ttttccgtac aaattacttt agtattacga 21241 aatattgaaa aattcatcgg ccctcagcaa attagattcc aattcgggaa aatactgaat 21301 tttccctgaa ttccgtgttt tgtctgtgtg aattaaagat tgttcacatt taaaataacg 21361 taccaaggtt tttttttgaa aactttctag atcactagtt tttgaaagac taaatattaa 21421 ttttcagcaa ctttttgtaa acgcccatta cgattagcaa aacagaaaaa gataacagca 21481 aaaatgttca atgcaaaaac aaaagaaaag agactggagc cttaatgcaa aacgaacaat 21541 gtgggggccc ttcccgaata agtgagccgc gcggaacgct gccgccctcc actaaacaac 21601 gcgcggtacg atactccgtt catcggacca acttcccccc tctttaccta gttggcaaca 21661 ttttcttcgt attctcacgg attctcctca ttcattccgc tttcaaaccc aatttccgtg 21721 catttagcat tggaaaaaca cgtgccgcaa cgtatccgaa aatagtgtga gacgcagacg 21781 gctggcgccc cgcgtgaggc gaccaaaagg cttttcttct ttttgtcctc cgttgaaaat 21841 catggtaaat tgagtgatct cgtttgcgta tttgtgtgta tgagagagtg tgagtgtgtg 21901 tgtgtgtgtg tgtggacacg ccaaagcttg atggagtcac atgtatatca tgaaaaacac 21961 aaacagatac cacaccaacg aatcttgaac cttgagttct tccgccagtg ttcattttta 22021 acatctcttc atcgttctgc aaggagacct gtcaaaattt ccaacaaaaa aaagagtttc 22081 ccataaatta ggaaaattct gataggttcg gaagttttcc gtgaagtaac tgaatatatt 22141 ttcaatgatc acgagaaggc aattgaataa atgttagttg gaaaattttt attgagcttc 22201 tagattttgt tgtctaatat ttgagcctca aacttttaat ataggttatt attttttatt 22261 gcagccgaca attttttgtg gcaatttata tacaaaagca taaatatctg agatgcttag 22321 tttcatagaa attcagttca agaggcttca ccgtataata taatatattc ttgctgaaac 22381 tacaggaaaa tttagtggga acttgtgaaa tttgaaagct cccagattcg attttgtcat 22441 tcttttatgc aatgcaaaaa ttatttttaa aaaactgttt tcgacatttt ctcatgattt 22501 gtgaaaaagt ttttttctca aatagaagtt aagctcattt ttctaaatag tttttctaaa 22561 ccctggattg tctcaaatcc gctgtagaaa agttgaagcg actgacatct tatgagctaa 22621 atttttcctg agtttttgaa atagactaca tcttgataat tgaacattac agtttgacaa 22681 aatttataaa aatattgatc agttggattt tattattttt ctgttgtttt gttctagctc 22741 cagtagggtg gatatttttc aaaaatctca aatttttgtg caacttctat tagtacaaat 22801 ataaaaatat ttcaaacgtg cactttttga acatggacat acgttaaatt tccaatataa 22861 agggcaaact ttcagctgag ttaggagaat atgagcagct gttaatcaca taaatctaac 22921 aaaaaaacca catgttccaa cttaaaattg atttcaaaat tttgtggcca aaatgaaacc 22981 ctacgatgtc agccgagttt actttttcaa agtgcagtta gaaaaattgt ctcaattttt 23041 tatcagtgga aacgtttttg gtagtctcac taattttaaa gtttaactcg ttttagtaga 23101 agtcatttaa aaaattcaaa aattgcggtg taacagttct tttaatactt tttcaaatgc 23161 cataaatcgt ctgtaattaa ttttctacca acttgacaat ttcaattagt aaaatcacag 23221 ttttaatgca taactttttg atgctagtcg tatcaaattg tgattattgt tgttatcttc 23281 accacttata ttacgagtgt atcaaatttc gatgacacaa tacggctatt attcattgag 23341 cgcttgtgtt ttctctatcg tctccactgt ctgaccttcg atgatgtttt tctacatgtt 23401 tttttgttct ctcaaaatag atgtttcagt tcctccaacc ttttgaaaat ctgtaattgg 23461 tgttgatttt ggaacctttc ctattgtctc ttctaccatt catatccggc acatttatgt 23521 caaaacactg actgcacctg ctgtttttga aatttagtgt ctggtgtctg ggcggcagtc 23581 ataagacctg atgggcatac ggttttccta gagactccta ctgcaatggt cattctgtga 23641 taagaagttg cttttttgtt tatccattca aaattaccat ccaaaaacca caccagttat 23701 caatttttaa gtctttgaga cccgtaaaat caatcaatca tgtccacatt gcacttttgc 23761 ctgtagaaaa acgcgatttt cggttttcaa tcttatctca tttctggttt tcgttcccat 23821 ttgactactt gttgtcaatt ttttaggtta ggtctaaata aatattaaag attttttaaa 23881 aatttaaatg aaagttcaat ttcttctttc tatttttttg ttcctaccta ccaacgctgt 23941 ttgcgccgga ggacgataga cataaggaaa cgaacacacg ggatgccttt cccttctatt 24001 tttctcctct ttttctctat ctctcgccat tgctttgtgc tgtatggtca actctcgttg 24061 tcttcgctgt gcgtgtgtgt ttgtctgttg ttagaaaacg aggaaaattg tgagaaagag 24121 tgagcgagga agatcgagag agaagagtgt gcgtgtgaga ggtgtagggg tcgcgagagt 24181 gtgcgggagg ttggcggacc ttttaccaca ttttcggcca aaggaaaaaa ggcatcgatc 24241 tttgttggtt gtagttctag acattttttg tttctgctat gttctcaaaa caccaaaact 24301 tgttcttaga tttctgatct ttgaaaaata catgctcttt atacacttgt ttttctcgtt 24361 gttttgctat ttgcaggagg ttcctcgtga ctgtatttat tgatttttta actgaacaat 24421 tattctgagg aacttgtgaa ttcgagattt tacttacgat catttaccta ccaacaacgt 24481 acgttttcag aaaggaacgc aaggaccgcg atagcacaga caacagttcg aaaatgtcgt 24541 ctccaggagg agaaacgtcg acgaagcagc aacaagagct caaagctcaa atcaaaatcg 24601 gacattacat tctcaaggag acactcggag ttggtacttt tggtaaagta aaaggtaagc 24661 tacaacacct taaaattcaa ccaaaccgga aagttagaaa ctttttcatt tgctgcaact 24721 tcctggcaac accataagct gccaaattgt tgcaaaaatt ttgtcccaat ctgccaaata 24781 ctgacatatt tatctttctt ttctgttcat ctagttttca cattttcagt tggaatccat 24841 gagacaactc aatacaaagt ggctgtcaag attctgaacc gtcagaagat caagtcactg 24901 gatgtcgttg gaaagattcg ccgcgaaatc caaaacctct cgctcttccg ccatccgcat 24961 atcatccgcc tctaccaagt catcagtaca ccttctgaca ttttcatgat tatggagcac 25021 gtttccggcg gggagctctt tgactacatt gttaagcacg gacggctgaa gaccgcagaa 25081 gctcgtcgct tctttcaaca aatcatttcc ggcgttgact actgtcatcg tcatatggtt 25141 gtccatagag atttgaagcc agagaatttg ttgctcgatg agcagaacaa tgtgaagatt 25201 gcggactttg gactttcaaa tattatgacg gatggtgact tcttacgcac cagctgcgga 25261 tcgccaaatt atgctgcccc tgaggttatt agcggaaagt gagtgattat ggtggtctcc 25321 tgatcatggt tcaatgtaaa ataaaaacca aattagaact gaataacaaa acatgaaaca 25381 tgtgttcgca aacaccaaga cgacattttg cactagtaga aatccaatta aagtaattga 25441 acaagattag acacaacaac tttgttttgc aggttgtacg caggtcccga agttgatgta 25501 tggtcgtgtg gagtcatttt gtatgcactt ctttgtggaa ccctgccatt tgatgatgag 25561 cacgtgccaa gtcttttcag aaaaattaaa tgtacgttag tgtgacatga gaatcaaaac 25621 acaaaaatgg caaaggatta cgacaacata ccaatttcat ttagacatat gaaagcgaaa 25681 tgggatttga aaattgaaat atccctatta caaaaggcac tattactatg ttttggcgca 25741 ctgcccgggt gataaatgtt tgaaatttaa attggaatca ttaaaaatta taacgttcac 25801 gaaatatgtt ttttagctgg cgtattccca actccagact ttctggagcg cccaattgta 25861 aatctgcttc accatatgct ctgcgtagac ccgatgaaga gggccaccat caaggacgtc 25921 atgtaagtgg tcttctaaaa agaaaaaaaa aacaaacatc gttgctatga gacgccaagg 25981 aaacaagaca ataactgctt tatattttca gtgctcacga gtggttccag aaggatttgc 26041 cgaactactt gttcccacca atcaacgaga gtgaggcttc cattgttgat attgaggctg 26101 tccgagaggt cactgaggta ttaatcgtgt tcgataggag cagaagtttc taatcacaat 26161 gaaaagtttc agcgctatca tgtcgccgaa gaggaagtca cctcagcatt gctgggagat 26221 gatccacatc accatttgtc gattgcatat aacttgattg ttgataacaa gagaattgcc 26281 gatgagactg ccaagttgtc aattgaggag ttttatcaag tgacgccgaa taagggacct 26341 ggaccagttc atcgccatcc agagcgcatt gcaggtagga gagggtgcat agaaaaactc 26401 ttggtggcaa agaataataa aacttttgca tcgaacatga aattcgtaaa acaatatcta 26461 aatcttcaat ttcagcgtca gtcagcagca agatcacacc aactctcgac aacacggaag 26521 ccagtggcgc gaaccgcaac aaacgtgcca agtggcatct gggtatccga tcccaaagtc 26581 gtccggaaga catcatgttc gaggtgttcc gtgcgatgaa gcagctggac atggagtgga 26641 aggtgctcaa cccatatcat gtgattgttc gtcgcaagcc tgatgcaccc gctgccgacc 26701 cgccaaagat gtcgttgcag ttgtaccaag tggatcagag aagttacttg ttggatttca 26761 agagtttggc cgacgaggag tctggatgta agggttttct ttgtttagat tgaacttttt 26821 gaaagtggaa cctgtttaaa atatttagtt aatcgttcta gaaaatacta gaaaatacaa 26881 gttagtgcat ttttttaaac taaaatcaca atttgaattt gaaaagccgt agaataatat 26941 tttgcaatgg tctctgaaca tgcccctcta taacctaagt ttctgcatgt agtctagctt 27001 aacgtgtcac aataattttg ttcagaaatg tttaaaacga agtttgcttt gcagttggaa 27061 aaataaaaag gtcaatggaa attctttttc aatagcgtca agtcttctca agacgatttt 27121 ttttgtatgc attttttcct aatgggtcaa atgaatatat aatttccacc tcaaccaata 27181 ttcattaaaa atttcagccg ccagtgcatc ttcatcccga cacgcatcaa tgtctatgcc 27241 gcagaagccg gccggaattc gtggaactag aacgtcaagc atgccacagg cgatgagtat 27301 ggaggcgagt attgagaaaa tggaagtgca tgatttttcg gacatgtcgg tgagaattaa 27361 tatttttctt ttatttcaat atttttattg tgaagtatta caaataactg gaattgtatt 27421 tggagtggtt gatatagtta taggtttttt aaaaatcttc gagtgacaag aagttgatgc 27481 cactaccttt aaataattaa cgcactaatt ttcaatgacc gagtattttc agtgcgatgt 27541 gacaccacca ccatctcctg gaggagctaa gctttcccag actatgcaat tctttgagat 27601 ttgcgccgca ttgattggaa cactggctcg ttaagcactc cttgatgcca acaatgctct 27661 gtgtaaaatt caacttttct gtctcaaaat ttatttcttc cattccttgt attaagcttg 27721 aaacccccgt ttcttatatt ggttcacttt gttgcagctt acagtatctt tgtatttttt 27781 ttcttgactc tgtttggttt tgcccctccc cccttcacct ccgtaaatgc cacgtatttt 27841 aattgatgct attctagtgc cgctttcatt ttcagccaat tttgacaagt aaccatacaa 27901 ataattattt aaagttatga cttgacttgt cacccccttt gtttctgtga atagattttc 27961 cttcagcgag tgattgtttc tgttttttat atgtttctct tcagtagttt ttttttctgt 28021 cttttaaact ccccttctcc tacttctttg tgtctattgt aagtaaaata ttaattgaat 28081 aatttcctac ccaaaattat tcaattattc tcatgtaaat ttttatttta agacacaaca 28141 ttgcaatggt aactttaaaa caagaaaaaa caaattcaga aagatccaga cgcaattttc 28201 tttgccttct cagccttctt ctttttgtac agagagtagt tatgttcggc actcatgtag 28261 gcagtttcca agaaaaggaa cacaaattcg caacaggtaa ggaaggatat tccgcaccaa 28321 agaccgagtt gtccaccaaa atcggctagc aagttgacaa actagaaata gttgagataa 28381 attttttgga ataaaaataa attttttttt gcctgctcgc tagtagtaat tcggcatttt 28441 ttcgaatttt tttgatacat ttctttgata cacgttcttt gcctaaattt ttcaactcac 28501 cccataagcc tctgattcag tgagcatttc aaaattcaac tgctcgtaga acacttccac 28561 cattgctccg ttctctctca aataggccca attttaaagt gtgttttttc ttgaattaac 28621 ttacttataa tgcttattac actctaccgc tgtaccatta cacgatccta gttgaatttg 28681 caaagataac gacggccact ttgccggcga gtatgtaacg gagtagattg actggcggca 28741 tggttgttgg catcttaaaa ttgaaattgc agttttttat ttaaaaagta tataacaatt 28801 caaaagttac ctgcaacgga aagatccgtg taggcctccc aagtcattca ttctggcgtc 28861 aagacatttt cctgcagaaa aaaatattta tggagatgtc aaatttgaaa ctaacttgca 28921 atagggtctg ctgcatcgca atgccgtgca ttttcaggga ctgggaaacg tggatctcca 28981 catctgcact ctttcagcac gagttgttgg aagcaagaac ggtagcagcc ctggaatagt 29041 tccctctgtt atcatattaa gttctatatt taattgctat tttaacaaga atcacaaagt 29101 ttctatgacc catttttcct actaaacttg cttaaattct gtttactatt cacctctacc 29161 gaatattcat aattgctgta aatatagtcc gatgttttgc catctggcac acaatctcca 29221 taaggtgctg gcaaacgtga catctttcgc tgaaaacaac aacaccatta ctctataaaa 29281 agagtttgaa tttctaatac caatcgtaat ccaaatgagg atacatatcc agttggagca 29341 gaataaccga acgtatcagg aaatgggaaa tcttctttgt catgaatagt caaacgaacg 29401 cctgtggctt ccgtggttgg catatagtca gacgcgttta cataaaccag catacgtaat 29461 ccgtacatgg gacctggaat cggaacacgt tttttggttt atacataacg gacttcaaaa 29521 attattttag aaaaaaatat gtggaaaaaa ttttgaggaa actaacctgc tcgaatacta 29581 gtcaagttta ctgttcgatt atgattgaag gtaaagcacg aaccaaacgc agggtcaata 29641 tgagtcagaa aatctctgaa taattgaatt tatatcatat ctgaaagcct aactaacgct 29701 tcgatatcac acgcttttcc gttaaacgag cacttgtgga caagttccct ttttgtagta 29761 cttagccgtt ccctatcttg cattgacaag gtagccattg caaacatgat gttttccttg 29821 gctttagtga caattgcaac ttcatctgtc attccctgaa acttttcagt ttccacttga 29881 tttatatatt tgtgattgtt acctgaaatc ccatagcttc aaggaagttt ggatcttcag 29941 ttggagttcc tccttgaaga tatgtccaaa tttcaattgg tggcgtcttt ccgttgtatg 30001 ctacacagaa tctagatgga gcacaaatac atggggatct atggcccttc gctgaaaaat 30061 tccgatttag aaaaaaacaa caataaaaat taagaagagt aaaatcaaac aagtgaagct 30121 agtttggcaa agtgattgaa aaacctacca gttttgttat ctttggtgca gaaagcatgt 30181 tcattgcaag tatcacattc ggtggtttcc caggttccgt tcagaaaaca gggccacgca 30241 tcatgagttg atctatcgaa agcgcaaata caccgatcgt caaatcccgt cggctcgtcg 30301 aattttgttc tgaacgagaa aaaagatttg tggagaacat ttcgaaacgt ctcattatga 30361 acttctgtat tttggaacca tatgaatttc aaatttatgg aaggcaaaat tgagccttct 30421 caaaaattgg acttatcaaa atggacaaac tttaccgtac ctcaaaattg taagactaga 30481 ctttttcaaa cacttttttg tcaaagttgt tagactagac tttacctctc tgattgacat 30541 tcttccatca tattcattcc agtagttgct tccacatcgt aattctcata atgctcattg 30601 tcatattctt ccattccatc ccattcctca tcggcatctg tgttcaaaag aatgttttac 30661 gcgatgctca tttgacatgc aacactcacc attaatatta aatacctttt tggtagttgt 30721 ttcaagtaac tcttcctcct tctcctcatc cttatcctct tgctcactag acccttggct 30781 tccacaaagg catcttgcaa atcctggctc aaaaaatgct ccagataaat cacgttttcc 30841 tcgacgacgt gccggttttg tggtgggtgc aggggtggtg ggtggttcgg tgacaacttc 30901 gcgttcctct tcgtgatctt tgtttcctcc ggcttttccc attgctccat caaatgctga 30961 caacgttcgc tttactaaat ccacgcttgt tgctaaactt gctttgtaag gattcaaatt 31021 acaaagcgta attgctggaa aaggtgcagt gtctggaaat aaaaaagctc agtaaggagc 31081 acatgatttt tcaaactacc aaatttcaat tgaatatcga caattttctc attcctgttg 31141 tatttgtcaa gcacagattg agcattaaga tagagcatta tcatacatcc aagaaaaagt 31201 acgacccaaa ctgctcgata atatacgttt ggagcttcac caatcatggg aattccgtga 31261 gcagatgttt tgtagcaaaa ctctttgaaa tgccaggcta gacgtttgtc atagggtagc 31321 aattttggat cgaattctcc cgtaataagt tcacactgaa attcggaaat gttagattac 31381 tcagctgcga ctttttgcaa gttttatttt ttttaccgtt tattgaacat tttaagcgtt 31441 ttgattatat tgttccaaaa caaaaaatct tggtttttaa ttttggtaca gttttgcaga 31501 taagctatca aagtaaaata aacaagttga ttcagcaaaa acttagtagt tataaagaca 31561 aaatacttat cgaagaatta agaaaatacg ggccttcaaa tttacagttg aaaaacttca 31621 atgtatataa cgaatatttc aaaaactggg aacaaaatac aattgcatac aaaaataaaa 31681 tttaccgttg caatttgaaa attacgtaaa ctcacttgta cttctgattc tgtagagttg 31741 aaacattcgc cataaccaaa atcttcataa tattctcttt ctgtcacaaa tttagtcgtc 31801 tcttgtaagt acgctaacgg gtctccataa acctgggaca tgtactcgga tgggtcccga 31861 aggtgttggt agtttttcag gttttgcatc catgacattc tataacttga tagcgataaa 31921 aaaaatagca t // lucy.info100644000766000024 10114006427353 16175 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataTaLr1010B10R CLR 86 505 CLB 11 577 CLN 11 577 CLZ 0 0 CLV 86 523 P39765.gb100644000766000024 4635214006427353 15555 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS P39765 181 aa linear BCT 03-MAR-2009 DEFINITION RecName: Full=Bifunctional protein pyrR; Includes: RecName: Full=Pyrimidine operon regulatory protein; Includes: RecName: Full=Uracil phosphoribosyltransferase; Short=UPRTase. ACCESSION P39765 VERSION P39765.2 GI:20141743 DBSOURCE UniProtKB: locus PYRR_BACSU, accession P39765; class: standard. extra accessions:P25982,Q45483 created: Feb 1, 1995. sequence updated: Jan 31, 2002. annotation updated: Mar 3, 2009. xrefs: M59757.2, AAA21265.2, Z99112.2, CAB13421.1, U48870.1, AAB57770.1, B57986, NP_389430.1, 1A3C_A, 1A4X_A, 1A4X_B xrefs (non-sequence databases): PDBsum:1A3C, PDBsum:1A4X, GeneID:938030, GenomeReviews:AL009126_GR, KEGG:bsu:BSU15470, NMPDR:fig|224308.1.peg.1549, SubtiList:BG10712, HOGENOM:P39765, BioCyc:BSUB224308:BSU1549-MON, BRENDA:2.4.2.9, GO:0003723, GO:0004845, GO:0009116, GO:0006355, GO:0006353, HAMAP:MF_01219, InterPro:IPR000836, Pfam:PF00156, PROSITE:PS00103 KEYWORDS 3D-structure; Complete proteome; Glycosyltransferase; RNA-binding; Transcription; Transcription regulation; Transcription termination; Transferase. SOURCE Bacillus subtilis ORGANISM Bacillus subtilis Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. REFERENCE 1 (residues 1 to 181) AUTHORS Quinn,C.L., Stephenson,B.T. and Switzer,R.L. TITLE Functional organization and nucleotide sequence of the Bacillus subtilis pyrimidine biosynthetic operon JOURNAL J. Biol. Chem. 266 (14), 9113-9127 (1991) PUBMED 1709162 REMARK NUCLEOTIDE SEQUENCE [GENOMIC DNA]. STRAIN=1A610, and JH861 REFERENCE 2 (residues 1 to 181) AUTHORS Turner,R.J., Lu,Y. and Switzer,R.L. TITLE Regulation of the Bacillus subtilis pyrimidine biosynthetic (pyr) gene cluster by an autogenous transcriptional attenuation mechanism JOURNAL J. Bacteriol. 176 (12), 3708-3722 (1994) PUBMED 8206849 REMARK SEQUENCE REVISION, AND CHARACTERIZATION. REFERENCE 3 (residues 1 to 181) AUTHORS Switzer,R.L. TITLE Direct Submission JOURNAL Submitted (??-MAY-1999) REMARK SEQUENCE REVISION TO 34 AND 53. REFERENCE 4 (residues 1 to 181) AUTHORS Kunst,F., Ogasawara,N., Moszer,I., Albertini,A.M., Alloni,G., Azevedo,V., Bertero,M.G., Bessieres,P., Bolotin,A., Borchert,S., Borriss,R., Boursier,L., Brans,A., Braun,M., Brignell,S.C., Bron,S., Brouillet,S., Bruschi,C.V., Caldwell,B., Capuano,V., Carter,N.M., Choi,S.-K., Codani,J.-J., Connerton,I.F., Cummings,N.J., Daniel,R.A., Denizot,F., Devine,K.M., Duesterhoeft,A., Ehrlich,S.D., Emmerson,P.T., Entian,K.-D., Errington,J., Fabret,C., Ferrari,E., Foulger,D., Fritz,C., Fujita,M., Fujita,Y., Fuma,S., Galizzi,A., Galleron,N., Ghim,S.-Y., Glaser,P., Goffeau,A., Golightly,E.J., Grandi,G., Guiseppi,G., Guy,B.J., Haga,K., Haiech,J., Harwood,C.R., Henaut,A., Hilbert,H., Holsappel,S., Hosono,S., Hullo,M.-F., Itaya,M., Jones,L.-M., Joris,B., Karamata,D., Kasahara,Y., Klaerr-Blanchard,M., Klein,C., Kobayashi,Y., Koetter,P., Koningstein,G., Krogh,S., Kumano,M., Kurita,K., Lapidus,A., Lardinois,S., Lauber,J., Lazarevic,V., Lee,S.-M., Levine,A., Liu,H., Masuda,S., Mauel,C., Medigue,C., Medina,N., Mellado,R.P., Mizuno,M., Moestl,D., Nakai,S., Noback,M., Noone,D., O'Reilly,M., Ogawa,K., Ogiwara,A., Oudega,B., Park,S.-H., Parro,V., Pohl,T.M., Portetelle,D., Porwollik,S., Prescott,A.M., Presecan,E., Pujic,P., Purnelle,B., Rapoport,G., Rey,M., Reynolds,S., Rieger,M., Rivolta,C., Rocha,E., Roche,B., Rose,M., Sadaie,Y., Sato,T., Scanlan,E., Schleich,S., Schroeter,R., Scoffone,F., Sekiguchi,J., Sekowska,A., Seror,S.J., Serror,P., Shin,B.-S., Soldo,B., Sorokin,A., Tacconi,E., Takagi,T., Takahashi,H., Takemaru,K., Takeuchi,M., Tamakoshi,A., Tanaka,T., Terpstra,P., Tognoni,A., Tosato,V., Uchiyama,S., Vandenbol,M., Vannier,F., Vassarotti,A., Viari,A., Wambutt,R., Wedler,E., Wedler,H., Weitzenegger,T., Winters,P., Wipat,A., Yamamoto,H., Yamane,K., Yasumoto,K., Yata,K., Yoshida,K., Yoshikawa,H.-F., Zumstein,E., Yoshikawa,H. and Danchin,A. TITLE The complete genome sequence of the gram-positive bacterium Bacillus subtilis JOURNAL Nature 390 (6657), 249-256 (1997) PUBMED 9384377 REMARK NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. STRAIN=168 REFERENCE 5 (residues 1 to 181) AUTHORS Pragai,Z., Tjalsma,H., Bolhuis,A., van Dijl,J.M., Venema,G. and Bron,S. TITLE The signal peptidase II (Isp) gene of Bacillus subtilis JOURNAL Microbiology (Reading, Engl.) 143 (PT 4), 1327-1333 (1997) PUBMED 9141696 REMARK NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-173. STRAIN=168 REFERENCE 6 (residues 1 to 181) AUTHORS Martinussen,J., Glaser,P., Andersen,P.S. and Saxild,H.H. TITLE Two genes encoding uracil phosphoribosyltransferase are present in Bacillus subtilis JOURNAL J. Bacteriol. 177 (1), 271-274 (1995) PUBMED 7798145 REMARK ENZYME ACTIVITY. REFERENCE 7 (residues 1 to 181) AUTHORS Turner,R.J., Bonner,E.R., Grabner,G.K. and Switzer,R.L. TITLE Purification and characterization of Bacillus subtilis PyrR, a bifunctional pyr mRNA-binding attenuation protein/uracil phosphoribosyltransferase JOURNAL J. Biol. Chem. 273 (10), 5932-5938 (1998) PUBMED 9488732 REMARK CHARACTERIZATION, AND MASS SPECTROMETRY. REFERENCE 8 (residues 1 to 181) AUTHORS Savacool,H.K. and Switzer,R.L. TITLE Characterization of the interaction of Bacillus subtilis PyrR with pyr mRNA by site-directed mutagenesis of the protein JOURNAL J. Bacteriol. 184 (9), 2521-2528 (2002) PUBMED 11948166 REMARK MUTAGENESIS OF ARG-15; THR-18; ARG-19; HIS-22; ARG-27; THR-41; HIS-140; ARG-141; ARG-146 AND LYS-152. REFERENCE 9 (residues 1 to 181) AUTHORS Tomchick,D.R., Turner,R.J., Switzer,R.L. and Smith,J.L. TITLE Adaptation of an enzyme to regulatory function: structure of Bacillus subtilis PyrR, a pyr RNA-binding attenuation protein and uracil phosphoribosyltransferase JOURNAL Structure 6 (3), 337-350 (1998) PUBMED 9551555 REMARK X-RAY CRYSTALLOGRAPHY (1.6 ANGSTROMS). COMMENT On or before Sep 27, 2005 this sequence version replaced gi:11376215, gi:730440. [FUNCTION] Regulates transcriptional attenuation of the pyrimidine nucleotide (pyr) operon by binding in a uridine-dependent manner to specific sites on pyr mRNA. This disrupts an antiterminator hairpin in the RNA and favors formation of a downstream transcription terminator, leading to a reduced expression of downstream genes. [FUNCTION] Also displays a weak uracil phosphoribosyltransferase activity which is not physiologically significant. [CATALYTIC ACTIVITY] UMP + diphosphate = uracil + 5-phospho-alpha-D-ribose 1-diphosphate. [BIOPHYSICOCHEMICAL PROPERTIES] pH dependence: Optimum pH is 8.2 for UPRTase activity. [SUBUNIT] Homodimer and homohexamer; in equilibrium. [MASS SPECTROMETRY] Mass=20263; Mass_error=2; Method=Electrospray; Range=1-181; Source=PubMed:9488732. [MISCELLANEOUS] Mutagenesis studies identified four amino acid residues that seem to be involved directly in binding of the protein to pyr mRNA: Thr-18, His-22, Arg-141 and Arg-146. Arg-27 and Lys-152 were also likely to be involved in RNA-binding, but mutations may have altered their subunit-subunit interactions. Arg-19 was implicated in pyr regulation, but a specific role in RNA-binding could not be demonstrated. [MISCELLANEOUS] UMP and UTP incresase the affinity of pyrR for RNA. [SIMILARITY] Belongs to the purine/pyrimidine phosphoribosyltransferase family. PyrR subfamily. FEATURES Location/Qualifiers source 1..181 /organism="Bacillus subtilis" /db_xref="taxon:1423" gene 1..181 /gene="pyrR" /locus_tag="BSU15470" Protein 1..181 /gene="pyrR" /locus_tag="BSU15470" /product="Bifunctional protein pyrR" /EC_number="2.4.2.9" Region 1..181 /gene="pyrR" /locus_tag="BSU15470" /region_name="Mature chain" /experiment="experimental evidence, no additional details recorded" /note="Bifunctional protein pyrR. /FTId=PRO_0000183030." Region 3..179 /gene="pyrR" /locus_tag="BSU15470" /region_name="Pribosyltran" /note="Phosphoribosyl transferase domain; cl00309" /db_xref="CDD:119714" Region 4..8 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Region 10..28 /gene="pyrR" /locus_tag="BSU15470" /region_name="Helical region" /experiment="experimental evidence, no additional details recorded" Site 15 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="R->Q: No effect on ability to regulate the pyr operon; no effect on uprtase activity." Site 18 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="T->A: No effect on ability to regulate the pyr operon only in presence of excess pyrimidines; reduced affinity for RNA; no effect on UPRTase activity." Site 19 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="R->Q: Loss of ability to regulate the pyr operon; no effect on UPRTase activity." Site 22 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="H->A: Loss of ability to regulate the pyr operon and to bind to RNA; no effect on UPRTase activity." Site 27 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="R->Q: No effect on ability to regulate the pyr operon only in presence of excess pyrimidines; reduced affinity for RNA; no effect on UPRTase activity." Region 35..40 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Region 41..58 /gene="pyrR" /locus_tag="BSU15470" /region_name="Helical region" /experiment="experimental evidence, no additional details recorded" Region 41..42 /gene="pyrR" /locus_tag="BSU15470" /region_name="Region of interest in the sequence" /inference="non-experimental evidence, no additional details recorded" /note="Substrate-binding (By similarity)." Site 41 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="T->I: Reduced ability to regulate the pyr operon; reduced affinity for RNA; loss of UPRTase activity." Region 63..70 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Region 86..91 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Region 99..111 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Region 101..113 /gene="pyrR" /locus_tag="BSU15470" /region_name="Short sequence motif of biological interest" /inference="non-experimental evidence, no additional details recorded" /note="PRPP-binding (By similarity)." Region 105..113 /gene="pyrR" /locus_tag="BSU15470" /region_name="Region of interest in the sequence" /inference="non-experimental evidence, no additional details recorded" /note="Substrate binding (By similarity)." Region 112..124 /gene="pyrR" /locus_tag="BSU15470" /region_name="Helical region" /experiment="experimental evidence, no additional details recorded" Region 128..137 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Site 138 /gene="pyrR" /locus_tag="BSU15470" /site_type="binding" /inference="non-experimental evidence, no additional details recorded" /note="Substrate (By similarity)." Site 140 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="H->A: Reduced ability to regulate the pyr operon; decreased UPRTase activity." Site 141 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="R->Q: Loss of ability to regulate the pyr operon; highly reduced affinity for RNA; no effect on UPRTase activity." Region 142..144 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Site 146 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="R->Q: Reduced ability to regulate the pyr operon, and loss of ability to bind to RNA; no effect on UPRTase activity." Region 148..153 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Site 152 /gene="pyrR" /locus_tag="BSU15470" /site_type="mutagenized" /experiment="experimental evidence, no additional details recorded" /note="K->Q: No effect on ability to regulate the pyr operon only in presence of excess pyrimidines; reduced affinity for RNA; no effect on UPRTase activity." Region 161..165 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" Site 162 /gene="pyrR" /locus_tag="BSU15470" /site_type="binding" /inference="non-experimental evidence, no additional details recorded" /note="Substrate (By similarity)." Region 167..170 /gene="pyrR" /locus_tag="BSU15470" /region_name="Helical region" /experiment="experimental evidence, no additional details recorded" Region 174..178 /gene="pyrR" /locus_tag="BSU15470" /region_name="Beta-strand region" /experiment="experimental evidence, no additional details recorded" ORIGIN 1 mnqkavilde qairraltri ahemiernkg mnncilvgik trgiylakrl aerieqiegn 61 pvtvgeidit lyrddlskkt sndeplvkga dipvditdqk vilvddvlyt grtvragmda 121 lvdvgrpssi qlavlvdrgh relpiradyi gkniptskse kvmvqldevd qndlvaiyen 181 e // test.game100644000766000024 13032414006427353 16251 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data Caenorhabditis elegans L16622 taxon:6239 GTGAACAGAAAAAAACGTTGAAAGGGAGGAAGAAGTCGTCACACAGATTCTAAAGTGACT TTGAGAGACGAAAATGTGTTCGTTAATCTTCTAGAGAAAACGAGAAAAAAAAACTGATGC AGTAAAAACTAAAAGAAGAAACTAATTCCTTTGCTCTTCGTAATGAGATTGTTTGGGTGG GCTTCGCTGTTTCAGAATTCAGTGGAACTCTTGAAATTTAAGGAAATGAAACTACTGTAG TGTGCATATTCATTTAATTTAGATTTAAAACAAATCTGGACAATATGCTTTCTTATCAGA ATATTCACTAGCAACTGTTATAAAAATAAAATTAGTACAATTTTTTATTTATTTTCAATT GAAAGTTCCGAATAAGGGATTGAATTCTAGGATTCTTCAAACTCAGATATAGTGAAGAAA TGACATTGTCAAGTTGATAGATTGATTGGTAAGCTTGATGCCATTGTAATTTGAGACTAG CTCCTAAAATTAATAAATTTCATATTAAATCAGAAAACAGAAAGTTGTCAAATCTAGAAG CATACTCATCGAATAGACAAGATTTGTTATTATTCTAAAATTTAAAAACTTGTGACTATG ATTAAAGAAGAAAAAGTCACAAGGTAGTCAAGTCTCATGGGTGTTCAGTTTTTGTTCCTA GAAACGGAATCAATGAAACACTCTTGGCTCACTACTTCCAACCGGTAGATTTAAATTATA CCGATTCCCCGGTGTGTCCGCGCCTCCAACTCCGGAAACCGTCTCTAAAAATGGAGTTAG GGTACATGAACATTCTACAGAGCGTTCATTAGACATTCTGCATATCTTTGAACTTTTTTT TCCTTTTTTAATACTTATTAAACGGAAGAGTAAAGTTTCTAAAGTATCAATAAAACCTAA CCAATTTAGAACATAAAACTTCAAGAGAGACCTAAAAATGTAAAATGTGCATTCTTAACC AATTACTTTCCATATTTTCTGTTTTTTCTATCCAATTTTGCTTCTTTCGGAAATGGCAAA AAAGACGTCGTTCTGTCTGGCAATGTTGAGCTCTATCAATCAGGCTGCCGTTTGGATCAG AAAAAGGGGAAAATTGAAGGATGAAAAGAATGGGTACTTCTTGAACTTTTTTCATCAATT CTTCTTTCTAACTTCATTCGTGCATTTTTCTGTTTTGGCTCTCCAGTGGCTTCTGCCGTT TCGTCAGGAAAATCTCATTAGATGAGCTATTGTCGTGAACTTTTTTCTGCCGGTTTTAAA TTTCTTTAGGGGCACGTGTATTAGATAAATATGGGTGGGAAGGAAAAAAAAACAAACACG AAAAAAATGAAAATGAGACAAGCGAGATGGAAGATTTTGGAAGAACTTTGATAGTTCAAA GAAAACAAGTAAAACAATAAACTATGTAAACGCAATTAATAAAATTTCGTAGTACAAAAA ACCTTGAGTTAAAATCCAGGTAAAGTCGAACGTCAGGGATTCTTTTTAAAATAAATTCCG CAAAATTTTTTTGGAGATATAAAACAAAAACTCGATAAATAGAGGAAAAAACTAACTGGC TAACATAATAAGATACTAAAGTTTGTCTTTAATCATACATACAATCAAACCTGTCTGCCT ACCTTTCAATCTGTATTTTACCATGTTTACAATTATTGGTAATAGGCACACCTGTAGGCA CAAGGCAGGTAAGCATGTCATACACGGGAAAATTGGAATTTTAAGCAACTGCATACTGCA ATATAATTCATTTACCCAATTTGAATATTAGAAGCTCAAAAATGTTTTGAAGTAAAAAAG CTTTTAGACTTTAGATTTAAAAAAATATATGTCCATTCAAATATTCCGTCAATATTCCGA AAACTTAAAAAATTAAATTAAATAATCAGAAGTTCTAATACCCTTTTCGATTCCCATGAC ACTGAATTACAATGAAAAATTTTACGTTTTCAAAAAAATGAGCGCAACATTTTTAGCATG TAGACCAAAAAATGAAGTCGGGGGAAATTGAAAATTGTCAGTCAACAACAAATAAGCATT TCTAATAAATATTTATGCATTGGGCTTTTGAAAACCTGCATAACACTAACATTTCACGGT TTGATTCATGACGATTTCCTTTTGCTTAAAATTTTAATTTTTTCATGAAAAGCTCATCAA TTGTAAAACAGAATGAATTTTCAAAAGACGTTTTTGAATATAAAAAATGAGTTTTCATTT TTCAACGAAAATTTACGTTGCTCAGTGACGTTTTATTACAGAAGAGGGTAAGAAGCAGTG ATGCCTTTTTTTTTGCAAAGTTATGGATGTGCATTTTGACTTTTTCTCAAATTTTCGAAG TGTATTCAATTTTTTAAATAGGAAAAAGTTATGACTTAATACCGGGACAATTTTTCCTCC TATCTGCAGAACTTTCTCATAGTTCTGAAACTCGGTGTGATTTCCGACAGAGTTATAAAT TCACCATGAACTTTTGCAAAATTTGATTAGATTAGAAAACTTGCACCAAAAAGACAAACT TATCCAACATTTGAAGATATTAATTTCATACAAAAAATTTCCTACAGTGAAGGCTTAACT CGGTTTGCAGCTAAATCAAATTTTCATTTATGGATCTTATTTTATCATCGTTGTAATAAA ATTAAACATTTTTCATAAGCTTTCAATTTTCCAAAAAAAATGAACATTAATTAAAAATCC AACAGAAAAAAAAACTTGAATCACTATTTGCAATGTGAAAATGGGGGGAAATCCAGTACA CATTTTACATTTATAGACAAAGTATTTTTTCTACGCTGAAAACATTCTGAAGAGTCTCTC GAAAGTTGGTTTGAGAACTAGACATACTTCGCAATAATCTGATAACATTTTCTGGTAGAT GTACTGCTAGAACACTATTTTTTGAAAATAAAATTTCTTATTTTTCTGAAAACGTTTTGT AAAATGAAAAATAAATTCTCTAAAAAATCCATAATTACATAAATGATAAATGGTAAACTA TGCACTTGAAAAGTGGCATTCTTCTGACTTGAAATTGTCTAGACAGTTTGAACAATTCCC AAGTCTGATGATTCTAAAAATAAAAAACAAATTTTGAAAGTATCAAAAATGTTTGAACAA AAAAAGGTTCTAGCTAAAGTTCCAAGGAACTTGATTTTGCGAAAAATCTTCTCCAACTCT CTGGTGACACCCTTCGATTATCTCTCCAACTCACAAGTTTTTCAGTCAACTCGCCGTTTA TTTCCGAAAAAAATAGAAAGAGTGGAAGAGATGAACGAAAATCTATTCGATAGTGTTCTC TTGTTTTCATTTAAAATTCTCTTTCAGATGTTTGAATTTTTCCAGTTGTTTTCAAGAAGC TCGGCAAACTTATCTCGCATTCATTTAAAGTTTGAATTTATGTTCTTACTTGATTAGACT GGAAAAGAAAAAGAAAAACACATGTTTAATTGAAACTTCAATAACTTGTTCTAGCTTCTG TGACCTTTCCAATCAGAACAATTATCACGAATGATTACTTGTGCTTACTCAATTTCATGT ACTTTTGCTTAGAATTGATGTAAATGTGTTCAGAACCATAGAAAAATTCCGAAGTTTTCG AAATGCAGTTACCGTCCACTTCCATTTCAGAATTTCATGCTTCATATTATCAGCATTTGA AGGCCTACTCCCATTATCCAGTCTGATGTGTAACCCACACTATGTGACGAATATTCTGAT GAAATTTAGTAAGCACAAGTTATCATTTTTAACAGTTATTTTTATCGTGACTGTCACAAA CTCCAATTTTCTTTTCGAACAATACCGACCTGGTGCATTTTTCTTGAAAGATATTAATCT CTATATTGCAATACAATATTGTAAGATAGTTCAAAGTCTTAGAATTGGTCTACATAACTC CGTAATTGGAATTAAAGTAAAGGCACCGGACTACCTACCGACCACTCATTGTCAGCATTT GGCACTAGCTTCTTGCCAAAGTTGGCCAAACGTTTATAGTTTTCTGCAATTTTTTGTTAC TTTTTAGTTGAGTGGTATAGTTGTTGGCAGTAGGTGCGCTTGTCGACACGAGGCAGGTAT TTCTGTGCCTACAGGCCTTCTACTTCAGACATGATTTCTGCAACAATTTAGATGACTAAA ATCTGATGAAATTGGGAAATCGCCAGACATCATTTTCTAGAAACATTTTGATCGCAATGA TCTTGGCAGAAAGTCTATTTCATTATCATCTCATCTTTAAAAGAACATAATTTTTAAGAA AAATCTATTGGCAAACTGTGTTTCTACTTTACTTCACAGGAAGAGCAACGTTTGTGTGTG TCTGTATTTTGTGCCAATATGTGGTTTGTAGCACAGCTATGCTACAAAGTTCAAATTCGT TTTGAAGGGGGAGAGAGGAACCCCGTGATTTCTGTGTTGTCTTATTCTCATCATTAAACT CTGAACATTCAGCAATTTTGATCCATAATACTCAAACTTCTAGTTTCATCATCCTCATCT TTTCTGGTGCCCCATTATCCTCTGATTCTAAAATATAGTTTTCCACCGTGAACTTCTCAA TTTTTTCCAAATTCTTTAGCGTACTTTACTTCTGAAAAACTGTGCAATCTCATTTTCAGA ATCTCCACTCAATTTTCATTCATTCAGTATGCAAACTATAAATTCGAGGTAATTTAAGGA TCAAAAAGTCCTTGGGAAGGTCAATAACCAATGACATGCCGCGAAGAAAAACGCAGGCAA TCCTTGAAAAGAGACGATACGCAGACTATTAAACAGAGATGTTTGTCATCATTTCTGTCT TCTCGTTCTTCTCTATTTCACTCTCCTCCATAAAAAGACTATTTATACCAAACTATTCTT CACTTTGAGCATCACTTTCTGAAGACTAGACTTAATTTTTGCCTTCTTTAAAAATCTGAA GTATTGAAGATTTTATTTCATTTAAATTTGATTTAGAACTCTTGTCTTTTACTATTCCAT CAAAATATAAATTAAATTTTCAGACTTATTCGCCGATTCATTTCTCAAAGAAACATGACA GTGCTTGCTGGAGTAAACAGTAAAATCGTCAAAAATGGTAAGGCTTCATTTCCAATTTTA GTTCAAGAATTCCCAAAAAAAGATAAAAAATTAATCACAGAGTAATATAATAATTGTATG TCCATATTGCATATGCAATACTATTTTTCTCAATCAATAACTTCAAGTGCAACTCTGATA GGGTTGCAATTCTTTTTCAGGGAAAATATGGTATTTAATTTGCTCAAATTGTAAAGAAGT GTTCCACTTTATCAAAAAATAGTTGGATATGATTAAAGTTCAAGTTTAATATTATTTATT AGAAAATCCTATCTTAGTTTTGCGGAAATTTAATATTATTTTTTCAGGTGACCCAGCGCC AGCTCCTCCGGCTGCTGGAACCATTCGTATCTACAACATGCGATTCTGCCCGTGGCTCAA CGTGCTCTAATCTATGCATCTGTTAAAAATATACCGAGCGATGTTATCAATGTTCACTTG CAAGAAAAACCCGACTGGTACTTCTCAAAACATTACAAAGGACAAGTTCCGACATTGGAG CACGACGAAGGAAAGAAGCATGTTATTGAATCAGCTGTGATTCCAGAATATTTGGATGAT ATCTATCCAGAGACTCGTATTCTTCCAACTGATCCTTACGAGAAGGTTCAGCAAAAGTTA TTGTTGGATAGAATCAGTGGACAAGTGTCACCTGCATTCTACGGAGTCGTTCAAGCTGTC AAAAATCCAGATCTCCGCGAAGAAAAATTTGCGGATATCAAGAAGGCGTATGATAATGCT GAGCAGTTATTGACTGGAGATTTCTATTCAGGTATTTTTTAAGAATTAAGATTGAGAATG CGAAACTTATAAAGAACAGAAACTAAGTTGTAGTTTTAGTCTTGTGTATTTTAAAATTAT TTGAATTAATTTCAGGAACCTCAAAACCTGGATTTGTTGACTATCTTCTCTACCCGAACA TTCAACGCGCTTACTGGGCAGCACACATAGTTCCAGATTTCCCATTGGAAGCTGAATCGT TCCCAGGACCAAACTATCCAAGACTATCCAAATGGTACAAGGCTCTGGAATCGATTCCAG AAGTTGCCGCTGCCAGTCAGCCAACAGAGAATGGGGTTGGATTCTTCAAGGATTACCTCG GTGGATCTCCAAACTATGACTATGGATTGTAAACATTTGTTATTATATTTTTAAACTTTG TGTTGTGGATGTGAATATGTGGAATTTAATAAAACATTTCTCGATATAATAATGATTTTG TTGAATTAGAAAAATTAGAAAAGTGGACGATTCTAAAAACAAAAGTTACAACGAAAATCA TCGAAGGAAAAAACAACTGAATTCCAAAATAGTTTTCAGAGGTGATCACAAAATGTTCTC AAACGATATATATTCTACCATCAATAATTTTATTGGCACTATATCACAGTCCATAATTCC TGTGCTTTAATTATACTTTTCAGTATAGAACAATATGCTATATTATCAAGTTATGCGTCC AATAAACACAATTTATTTTTCAGACTGAATTTAAGCCATATTGAGAATAGCGAAATAAAA ACGTAGAGGAAATTTGTGATCGCCATTCACAATTAATTCTTAGATCGCAATGATAACAAA CTTCGATTCAAAAGTCATCATGCAAATTCACCGTTCTCGTGTGTGTGTGTTTTTGGAGGA AATAACACAATTTTGTGACTGATTTTTTTACAACATGTGGTTTGTAGCATAGTTCAAAGT CATTCTAGAGGGGGCTCAGAGGGAGTTCTTTCGCTATGTCATCGTTTGTTTTTGCACACC AAGAAAAATGAAAATAAATGCTCTAGGATGTCATGGATCGTTTCCATTCTTAATAAGTAG AAGCTAGGATTTCCTATACAAAAATAAGTAATCTTCGTTTCTACGTCTATCAACTTAAAT TTTTGTATACAATCCACTTTGGTAATATTCAAGGCCTTCCTGTAAAATGTTTTATGATCA ATCCGTTACACCAAGAAAACAAGTGCAATTTGTCATCATGTAGGCTTCCGCCTGTGTTTA CTTCCTTCCCCCAGCACAACACTGACTATTTATACCAAATTAATAATGCAGCATTCCTCA TGTGATAACTCGTTTGACTTTTATATCTTTCTACGTGCATCTTTCAAGCTCGAAAATTAA TTTTAAAAATTTACATTGCAGAACAATTGCGGAACGAAGAAGCATGTCAGTCCTATCAGG ATTAAACACTAAAGTTGTCAAAAATGGTACGTTATTGAGATTGTTTTTGTCTGACAACTG AATTACCAAGATTCTTTCAGGTGATCCAGCTCCAGCTCCACCAGCTTCAGGAACCATTCG TATCTACAATATGCGATATTGCCCATGGGCTCAACGTGCTCTAATCTTTGCGTCTCTCAA AAAGATCCCGACCGAAGTAATCAACATCCATTTGGATCAAAAACCAGATTGGTTTTTCAC GAAACATTACAAAGGACAAGTTCCAGCACTAGAGCACGACGAGGGAAAGAAAATTGTGAT TGAGTCAGCTGTGATTCCAGAATATTTGGATGATATCTATCCAGAACCTCGTATTATTCC AACTGACCATTACGAGAAAGTTCAGCAAAAGTTATTGTTGGATAGAATCAGTGGACAACT TTCTTCTGCATTCTACGGTGTTGTTCAAGCGGCCAAAATTTCTGATCTTCTCAAGGAAAA ACTAGTTGAACTTGCAAAAGCATATGATACAGCAGAGGAACTCTTGACTGGAGATTTTTA TTCAGGCACGAAATATTTTAATATTGAAACTAATCAGTTTAATTAAAATTAATTTAATAT TTAAATCTTCAGGAACATCAAAACCAGGATTTGTCGATTATCTCATATATCCGAACATTC AACGTGCTTTCTGGACTTCCCATATCATCAAGGATTTCCCATTAAAAGTAGAATCGTTCC CAGGACCAAACTATCCGAAACTATCTAAATGGTATAAAAGATTGGATTCAATTCCAGAAG TTATTGCTACCAGCCAACCAACTGAAACAGCAGTTGAGTTCTTCAAAAGCTGGATTATTG GAGCACCAAACTTTGACTATGGATTGTAAATATCTTCTCTTCGTTACAAATTGCATATGA GTATTTGAATGTAAATAAACTTTTTGGAGAGATTGAAGCAGACTTTTTTGTTTATCCAGA AAGTAAGACAAAGAGGGAGATGCTATTAGTGACAGATGATGGTTGAGAATAAGAGAAAAA GGATGAAAACGACGAGATCTGATTGTTGTCTCTCGTCCAGTCACTTTCCATTTCATTTTT ATAGAAATACTTCGGCGACGCGGATGCGTGTCTGCTTCTTGCAAACTATTCTTCCTCTTA TAAAGTTGGCTTTGAGGATTTATTTTTTTTTGAAAATTGTTTTCTTTTAACTGACACCGG AGTTTTATTTCTATAGGTTTTTATGAAAAGTTTTATCATGAAGTGAACTTGTCACCTCTG TTCGGTTTCTTGTGTTTTTGATATTAAGTCTAACTGTCCCTTTTTATGAAATGTAATTTT TTTAATTTTGTATAAGCTTCATACAGTACGGACGTTTTGAACATTTGATGAGTCAGAGTT AAAGAGAAAACTGATAATTTTTTTTCCATCTTTCTCCTCACTTGTGAATAAACTAAACGC ATTTCTGTGGACGTTCCAAGTGTAATATGAGAGTTGTTTTCATTTGGAAATGCGGGAATA TATTGAATCTTCCATTAGATGTTCAGGAATATATAAATACGTTGTCTGCTCTGAAAATTC ACACGGAAAATCTAAAAATTGTCAAATTATAGATTTCATTCTCAAATGACTATATAACAT TTTATTTTTGCAATTTCTTTTCAATTAGGAAACATTTCAAAAAGCTACGTTGTTTTTCAC ATTCAAAATGATTACTGTCGGTGCGTTCATTTTCCGAGTTTTTCCAATTTCACGCTTGCT CTTCTTCGTAAAAAACTCGTAATTTAGAAATTGTGTCTAGATCAAAAAAAAAATTTTCTG AGCAATCCTGAATCAGGCATGCTCTCTAAACAACTCTCAGATATCTGAGATATGGGAAGC AAATTTTGAGACCTTACTAGTTATAAAAATCATTAAAAATCAACGCCGACAGTTTCTCAC AGAAACTTAAACCGAAAAATCCCAACGAAGACTTCAGCTCTTTTTTCTTTGAAATTTGAG ACAAAGGCCCGTTCTATTGTCTTTCCGACTCACATCGTTCATTAATAAATCGTTCTTTCT TCTACTTCATTCATCAATTTCCTCTTGACCAGAGAGAGTCCCTACTCTTGAAGCTCCTCT TCTTTACTCTTTTCTTACTTACGCACAAAAAGTCTCTCTATCACTGCGTCTCTCTATCCA TCTCTTCTACATGTCACTTGTCGTCTCTGCGCCTCTATAACACGTAACAATCTCTACCTT CAAGTTCTCTAGTCACCTGTCTTCGTCTATACCTTTTGCCACGAAAATTACTACGTAGAA GCTGTCCTATTGTAAAGATGAAACAGTTTGAAGAGAAATTGGATGATTGTGATCTATTGG TCTCAGAATTTGATGGATTTCTTTTCCATGTTTTCAATTTTAACGTCTATATTCTTACCT AGGTACTCATAATTTTAACTTTGTTTATATTTTTATAAACTTATAAGTTACAATTTTTAA ATCAGTTAACAACTTCCTATAATCAAATTGTATTCTATTTTTTTTGGCACAAACACATAT AAATGTCCAAATATTTGCGCACGAGTCACCCCTCTCCACTCATTTGCCGCCCAATTTTGA CGTTTTCTTCCTTGCACATTTTGACAGCATTTCTAATTTCAGGAAATTCTTCATATATCA ATTGGTCAGTCACAATTATCCTCCTCATTCTTGGACTTTACGCCTGTCATCGATTTTTGA ACATGAAGAAGTTGACTCGAGATGCACAGGAACCACTTTTACGTGAGTTTTTAAAGATTT TTTTTTTGAAAATTGATGTCTTGCATTTTATTTAGCTCACTGGATAGTAGAAAAAATATT TTTTTTATCTATTTGAAAATCAAATGTGTTAAAAAAATATTTTTGGAGAAAAATAACTGA AAGCTCCTTTCTGAATTATTGTTTTATTATTAAACATTTGTTTTCTTCTAACTTTATGTT TTTTAATGTTTTTTTTTTACTTTTTAAATCCTGAATTATTTTGTGAAAATTCAAACAGTT TCATTTTTAAAATTTCAAACCCTGATAAAAAGTTCAATATTTTTCACTGAACTTTAATTT TTTTAAAAGTTTATGAAAATTTCCTATGAAATTAAGTTCAGAAGTTTTTTAGCTCATATC CGCCCCTCCACAAGGAATAAAATTCGAAAATATATTTATGGAACTATTTTTATTTTATCA ATTTTTCTCCTTTATCGATCACTGAACAGTCCAGACACATCAAAACACGGAATTGGCAGA AATGGAGATGTAAGTTTTGAGATTTATTGCAACAAATAATTTACAAAATAATTTCAGTTT ATTGAATATGAGCCAAAAGCAGGACCCACGATAAAAGAGCCTGTAGAGAATATAGTTAAA TTGGACGTTTATATGGAAGCACAGTGTCCGGATACATCTAGGTGAGCAGTTAGTAATTAA ATTAATTTAATATTTGATTTATTTTAAGATTTTTCCGTCAACAACTTAAAAAAGCGTGGG ATATTCTAGGAAGGCTAAATCGAATCGAATTGAATGTAATTCCATTTGGAAAAGCGAGGT GTACAGAGAAAGGAAACGATTTCGAGTGAGTTTTTTTTGTTAATTGATTTTAAATCTGAT CATAAAATATTGCAGATGTCAATGTCAGCATGGTCCGACAGAATGTCAGATTAATCAATT AATGAATTGTGTCATTGATCGATTTGGGTTTCCACATAGATATTTGCCAGGTGTTTTGTG TATGCAGGGAAAATATTCATTAGATGAGGCAATGAAATGTGTTACTGAGAATTATCCATC TGAATATGAAAGGTATGTATTTTGTGCCGTAAATGCATAGTTAGACCAACGAATACTTTT TAAAATCATACGAAATATATTTTCATATATTATCACTGAATATAATAGTTAATGAAAGAG TAATGCTCATTTTTCAGTTCAACTTTATTTTTCAAAGAATATTGAATTTTAGAATGCGTG AATGTGCATCAGGAACTCGAGGTCGCCGCCTTCTTGCTCTTTCCGGACAGAAAACTGCAT CACTAACTCCAGCAATTGACTTCATTCCCTGGATTGTTATTAATGGTTCACGTAACTCGG ATGCTCTTTATGATCTAACACAGAATGTCTGTGAAGCAATGCAACCAATGCCATCTGCAT GCAAAGATTACTTACGTTCATTACAATAATCACATCTTTTACGGGTTGACTTTTCGTCTT ATAGTTTTTTTTAAAATACAATTGGTGTCTATCTATGAGTGCCTTTCACAACTCGGCGGG TCCTAAAATTGTTTATTATATTTATTTAAATTTTTGTTGTAGTTTGTGTTAGTGTGACTA ACTTATTGTGTTAATTTTCTTAAAAAGAACGTTTTTTATTAAAATAAAAAGTTGCAAATT GTAAAAGTTTGTGTTTATCACATTATGATATTTTGGGCAATTGTGAGGATCTATTAAAAA TTTATAAATCTCTTTGACAGTGTGTGGGAAAAATAAGTTATTTTTAGCTTCTGATATTTT CTAGGATTAACAGAAAAAACAGCAAATTTCAGGTATACCCGCTTGCCAGTTCGTGATCAA CTCCAGTGTTTTCCAAAAAAACAAATCTACCCTTCCCCAGCTTCAGATGTTACAAACTCG ATAAAATTTGTTTCAGAAACATCTCTTCAGTGTGACCACAAACTAGTCTTTCGCTTCCTT TTAACAACAAAAAATGGAAAAAGAAGGAGGGATTTACAAGAGGCTACGACGATACGAATG AATGAAAACGATTTGATGCAATCAGCTGCTGCTTCTGCATTTGCCATTCAATTTGTCACC TTTCTGCCAAATTTACACGATCTGTTTTGAGTGTGGACTTTTTGAAAGTTTAAACCACTT TTCGTCAATTTTTAAATGATGTTTTTACTTCAGTTTTTATTATTTTGTTTTGCAAAAAAT ATTTCAGTATGCCTGCATTTTTTAAATATTTAAAGTTTGATTTTTTTTAACATCCAAGTA GAAATGATAGCTCACCTACTCCAACTAAATTTTGACCAACAACTGTCACTTCTATATTTG AAGACATAATTAACATAAATCTTGAATTTTTGAAGTAATTTTAATGTCTGAACATCTTGT TTTGAATCTTGTTTTTTTGCCGAAAAATTTGAAGAAAAAAGAAACTGAAATATTGCAAAC ATCGCCAGAATGCAGACGGTAGGGTTGAATAAGATAGAGGGCATTGAACCCTTTCTAATT TTCTGTTTTGCAAATTATTTTACAGTAGGTCTGAACTTCACAGTTTCATGGTACGCCCAA TTTTTAACTTCTTTTTTGAATTCAAATTTTCTAAACTACATTATCGATTTCCATGAAAAC AGTTGCATTAACTTCCTCTGACCATTCCAAGAATTTCTGGCTTACCAACCGACATCACTC TTGCCCCCTCGTCATTAAGCCGTAATTGATAGCGACAAAAAAAAAGAAAAGCCGGCTATT TTAATCGAATCTTCTTCATTTGAGAATGGAGGGTGCTACTTGAATGGGTGACAATTGACT CGTGAAATTCTTCTTTATCTTTTCTCCTTATTTTTCTCAGAATTTCTTCATCATCCACTT TTTTGGAGTTTCAAATGTTAATTGCAATCTGTCTCATTTTGGTAGTCATTTGGAAAACAC GGGGGAGGCGATAACAGGAAGCTTAAGGGATAGACATACACTTGCAATTGTCGAAAAAGC GATATCTTTAACGATTATTACGATTCTTTCAGTGTGACGTAATCCTAATCAGTTTATTTT TATTTTTTCTGAAAGCTTCTTTTACGAATTGCGCAATTAATAGTGTCAGTAGAAAAGGCA TAATTTTTGAAGAATATGCCAAAATATGTAAACCCTCTCCGTTAATAGCAGTAGCTAGTG ATCTAGACTATATGCAATACACACTAGTTGTCCAATTGAAACAGGTATCCACAATATTCA CGATTTTTGAAGTGTGATGTATTAGATAATCCTATCATTTTTTCCTCATCGGCCAGTACT TTTTTTGTTGTTATTTTTGCAATATCCTCCGCTTTTTATTGTTTTCCTATTCACACCTGT ATTTGATTCTGGTTTCCCAAAAAGAACAGGCATAGTTTTTGCGTTGGGAACTGGTTTTAT TTCAGCATATCTTCTCATTTCTCAACCAGAATTAGAAACATTTTTAGAACAATCACATTT ATAGCCTAAATTTTTACTAAAAATATCTGAAAAACATGATATACACTTTGTAGAATTTTT GAAAATAATATCCGCCTATCCATGATTTAACCTTATTATTCGAAATCTGTGAGATTCCTC AAAGTAGAAACATAAAATTTCAGGCACAACACAAAAGTCGGAACTCAATTAAAATCGAAT ACCCTGTTTGAGATGGCGTTTCTGGCTCGTAAAACGTCTTCTCTCCTACCAGCCACCACA TCCTCTACAGTCAAGCATATGATCTACGATGAACCACATTTTGCAATGCAGAACAGTTTG GCAAAACTTATCAAAGAGAAAATAAACCCAAATGTTGCACAATGGGAAAAGAGTGGAAGA TATCCAGCACATTTTGTGTTCAAAATGCTTGGACAACTTGGAGTATTTGCGGTGAATAAG CCTGTAGGTGAGGATACTTATTTTAAAGAAAAAATTTTGGAAGTTGAAAATTATTGAAGA CTATGGTGGGACTGGTCGAGATTTTGCAATGTCAATAGCAATAGCTGAACAAATTGGAGC AGTTGATTGTGGATCGATTCCAATGTCAGTCATGGTTCAAAGTGACATGAGTACTCCTGC TCTTGCACAATTTGGTGAGTTCTATAAAACTTATACTGTAACTTAATTGATATATCAGGC TCCGATTCACTCCGCAATCGCTTTCTTCGTCCTTCAATCAATGGTGATCTAGTTAGTTCA ATTGCAGTCTCCGAACCACATGCAGGATCAGATGTATCCGCAATTCGCACACATGCCCGT CGGTACGGCAGCGACTTGATAATAAATGGCTCAAAAATGTGGATAACAAATGGAGATCAG GCAGATTGGGCATGTGTTCTAGTAAATACTTCAAATGCGAAAAATTTGCACAAAAATAAG TCGCTGGTGTGTATTCCACTGGACTCAATTGGTGTACATCGATCAACTCCGTTGGATAAA TTAGGAATGAGAAGCTCCGATACAGTTCAACTATTTTTTGAAGATGTTAGGGTGAGTTTC TTAAAATGATCTACGGCCCCTTTAACCAATTTTAATAAATAATTCAATGTTCATTTCAAT CGAATCATTTTTCAGGTTCCCTCGTCATACATAATAGGCGAAGAAGGACGTGGATTTGCA TATCAAATGAATCAATTCAATGATGAGCGCCTTGTAACAGTTGCTGTTGGGCTTCTCCCA CTTCAAAAATGTATAAATGAGACGATTGAGTATGCAAGAGAACGATTAATATTTGGAAAG ACACTTCTCGATCAGCAATATGTTCAATTTCGGTTAGCCGAGTTGGAGGCTGAACTGGAG GCAACCCGTTCTTTGCTCTATCGAACAGTGCTGGCACGTTGCCAAGGCGAGGATGTGAGC ATGTTGACTGCGATGGCGAAATTGAAAATTGGAAGACTGGCAAGAAAAGTTACTGATCAG TGTCTACAGGTGAGGCGTTTTTGTTCTAAAATATACAAAAAATTCTCAAAATATGTATAT AAATCACTTGTAATATTCTCCATATTAGACTTGAATATTCCTTGCTCTTCTTTGTCAGAT TATATCTCGGTTGTATTTGTTTTTATGAAAACAAAATTGCCAACTAACAAAATTTGTGCA AAATAATTTGCTTTATTTTGGATGTTGAACTTTTTTTGATGAAATTAAGACAACCGAGAT ATAAACAGTCAAAGTATAGCAATGCAAGGATAATTCGGTATATGTTTTTGTGATCCCTCC AGTGGCAGTTTTTCATAACTTGATGGTTTTTTTATAGAAATGAATTGGAATAACGCTAAA GCTTCATTATTAATATTCTCTTAATTTCAGATCTGGGGAGGTGCTGGATATCTGAATGAC AATGGAATATCGAGAGCCTTCAGAGATTTCCGTATATTTTCGATTGGCGCTGGTTGCGAT GAAGTTATGATGCAGATTATTCATAAAACACAGTCCAAAAGGCAACAGAAAAGAATTTGA GAACATTTTTAAATGTTATATTTGTAAATACGAAAATAAAATGCAATTGTACTGAAAACG ATAAAAATAAAACAGCGAAAAAGTCATATTGTATAGAATTTGGCACGTATATCTACAACC AGTTTCTAGTGACCCAGGTATCTTGAAGTAAGTATTCAATGAATCAATTCAAGTTATTAT ATTTATATTTGTCCGCATCGGAAGGAAAGCGCAAAGAAGTTTCTCTCTCCGCCTCATCAA ATTTTTTGTGTTTGCATTTCAAAAATGACTGCAATGAAACGCGAATTACTGCGAGTAAGT AAAGTTAGTTTTGATAGAAACTACTGTATGAGAAAACCGGTTGAAAAGTAAAGATGAGCA GCAGTATTTCATGGAAAAAAGAGGGAGACAACAAGAGACGGAGTATATAAGGTGTCATGG ATGCTCCGAGAGTGTTTACTTCTTTGTTTCAATTTTCACACTTTTCATTCTTTTCATTCT TTTTGTTTTTCACAATTATTTAGCAGATCGGTAACTTTTTGCTTTGATAATTTCATAGAT ACTTTCGAATCGAAATTAATTTTCAAATTAGCCTACAGTAATTTTGCTCTCATCTCTGAG TTCTAGATCATGTTTCAATTTACCGAAAGTGTTTACACAAGTTACCAAGAAAACAAAAAA TTCAAGTTTCCGAAAATTATCAAATGTTTATCAAAAAGGTCCTATGATGTTTAAAACAAT TTTTCAAACTTCCAGAAAAATTTTAACTTACTGTTTCTTGAGCGTTTACAGTAACTCCGG TGTTTCCAGTAGGCATAGCTTACCTTGAAAGCAGGCAGGCGAAAATTTCTCTAGACCAAC CAGAATAACTTACTTTATTGCTAAGTTGAATCAAACAATTTTGTAAAAAAAACGAATTTT GGAATCATGATCCCTATTCAAGCTTCTAGTTGCTGGTCAGCTAGGTTTTGGGTTTTTTTT TGGAAAAATATTCAAAAAACATTTATATAATAGTTAGAATTAACATTTTTTGATAAAACC TCGACATTTTTGTTTTGTCTGAAAAAATAGGAAAATCTTACGTTTTTCGAAAAAACCCGT GCTCGTGAAAAGTATGTCCTCTGAGAGAAGTAATGTTTCATCTGACCAGTTGCAACTTTC TGTGTGCACATTCTTTTGATAAAATGGTATCACAGATCTATTCTAAAAGCCAACATCTAA ATTCTTTGCTCTATCTTTATCAGTTGATACGGATCTTCTCATCTCATTCGCCCACAATCT TCCCATACTAATTCATCAAACCCACTTGTAAATATACGCGCGGTTGATCAAAATTTGTGT GTGTTATGGCACATTGTGCAAATAGTTTTACCACACTTACATACTTCAACTCAAACCTTT GAGGAGTTTGACAGAGAGATGGAAAGATAGTCTGCAAACGGCAGATTTTTGAAGTTTCAC CGCTGTCCATCTAATTTTAGGTATTTTTCGGAATCTTTTGCAGGACGTTATCATCTATCT TTCCCGTTATCAATTAGTCATAATTATCCAATTAGTGGCAGTTGTTAGAAGAAATAGGTA ATATGCATAATAGTGTCATTTGCCATTGGCCACCTCCACCAAACTTTCGATTATGCCGTT TTCCGTTTTCTGTGTGTTTCTTCGTCCTTCCTCATCATTTCTCATTCGCTTTTTTTTCTT CCCATCTTTTCCAACATGTCGCACTAAGAGTGACCAAAAAACCTTTCAAATTTTGCGTGT TCTTTCGGTCTTTCCGGAAGGGACAAAAATCAAAACGACACTGGAATTATGAACTCATCC ATTTTCCACTTTAAAAGTTGAAAAAAGTAAACAGCGGGGTTATTGTGGTTTGATCTCTTT TAAAAATCAGTTAAATATAGGAGTCAAGACCTCAATGAGCACTCTTCAAGATATGGTTCT ACTAGACTCAACTTGAAGATTTTCAAGAGTTCTGGAGACTTTTTCAAGGCTACTGCTTTC AAGCTTCAGAATTTTAAACATTTTGGAAATAATCTTAAACTGGAGTTCAATAGCCAATTG AGCAATTTGTTATAACGTTTTTTTCTTAATTTTTTAAATTAGAATCAGTGTAAATTTATA AGTTTCAAAATTAGTTTTGCACTTATCTTTGGGCGTTACTGAATTTTTTACGTGGTGAAC CTTGAGAAAAAATTCTAAGGCTTCTAATTGAGAAAACTAATTTAAATTCCGCTCCCAGGA GTTACCAATTTTAATACGTTTCCAAAAAATTAAATATTCTTCGAATCTCATTTTTAAAGT TTCCATTTGGCACAAACCACAATAATTTAAGTAAGACGTTTGATCTATGCCGTAGTTTGT GTACTTCAACGTTTATCCTTAAGTACCTAGGCCCGTGTTTTTACAGCTCTGCTCTTTATC GGTACATACTGTTCTCTGTCTTTATTGATAGAAATTTTGAAAAATGCAACAATATGGTAT CTATCAGGTCGTCCCATAAGTTTTTGTACTTTTTTAAAACTTTTTGAACAAGTTCTAAAC TGACAGAACAAAATCGAATCTTTTATAAATGCGCATGTATAGTATGTACTACTTGTCAAA ATTTTTATGCGTTATTTCAATATCCTCCTGATAACAATCACGGAAACCAGAGCCACAAAT AGCGACATACCCAAATAATGGGAGGTGTTTTCCTTCGTCCTGCTATTCACAGGGAATTTA TCAATGAACATGAAAACATAGTATTAGTAAAGATAATGATTCAAAATACATGTTCAGTAT GGTTAAAATTATCATTAGCACTTATTAGCCGTTTTGGACGTGGACTATTTGGCTCATGTT TATCAAGCACTGAGTGAACATCTTCATGGAATAATTTCTCACTAAAAGTGATGGGATTAT TTTGATTGTTGTTTCTAATTTTATATAACAATACTTGCATAGTACAAATACAAACTTCGT TTTACTTGCTGATTTCTCAATCATAAATTAGAAGCCCAACACTATAAATGTCGGGTATCA CATGAGGTTGGCCATGTAGATTGTTTGAACGAAGAGGCCACCAGTAAAGTTTGTTAATTT ATTTATGATACATATATCCACTTCTAAATAACACTAGACTTAATTATCTATCTTTCATTC CGAGGACTAAATGGACCAATATATGCTTCAATCACTCCTATAGGCAATTGTTAAAAGTAC AAAATAGTGTGGTTACAATGTTCTCAATTATAACATCTCCCCATGACTGAAAAAATTAAA TTTTTTTAAAATTTTGACTGCACATGATGTGCACTTATCGTAAACATACACGATGCACCC GTTCCATTCCCAGCGGCTTCACAGGAATCAAAAACTCGGGCGCCATATTTAATTGGCCTC AACAATTGTGTTTAGCTACAGTAGTTTTTCCGGAATAGTTATACTAAATTTAAAATTATT TAAAACAAGAGTGTGGAAGCATCTACTTGACAGTATATATTAACCATTACTTTAAGCTCT GGGTGGTGTAGAACAAACTCCAGAAGGAATGGTGTAAAAAGCTGATTCTATAGTTACTCG TTTTTCTAAACAACCGCGGGGGCCTGGGATGCCAGAGTTATGTTGCAATAAGGTGACAAG TTGGTGACATGCTACCACTAATATAAAATCTTAGAATTGTCCGAAAAAGTTTTGGGAATA ATTCGAAAAAAAGTACAAAAACTTATGGGACGACCTGATAGGATATATGTTAAAAACTAT TTTTGAAAAAATATTTTATTTTGAACAATGAAATAAGGTTCCTGCCTCAAGGTTTCTTTT TGACGCGAACTCCGATGACATTTTAATTATCAAACGGTCTAAGTGAAAATTTATTGGACA ACTCTTTAGTTGAAGTGCACTTTAGGAGCAGGCATACATGAAGGCGTGAGGCAGGCGTAG GTCGCTTACGAGGCAGACAATTTTTAAAAAAATCACCATCCTTTTGTACTAATAAACACT CTCTAAAAGTTTGCAATGTTGTCTCCCAACACGAAAAGTTCAATCAACTTCTGCACTCAA TTTTTTTGCAAGATGACCCATTTGATTCAAGGGGGTTACCAGTAGACTTACCTGCAAAAA AACAGTATTCGTGCATAAATCCATCAAAATGAAGTGTGCGTCTTCTTCTTAGTTTCCGTC TCCCGTTGTTTCTTAATGTATACAGAAGATGTACGGGGCAGCAGCAGCAGAAAAAAGATT TGCGTACACCAAACACATCAAAACGATATGCGTGAAATGAGCGAATCGTCCGCATTCTCC CCTTTTTTCTTTCAATTTTCAAGGAGAGAGAAAACTCTGTGAGACAGTGAAGAAGTGGGG TTTTGACTGGAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACTGATTCTTA TCTGAGTTCCGACGACGATTCCACCGTTTTTTGGTCTGGTCTTCTTTCCTCCGCTTCTTC TTCTTCTACTTCTCTTTTCACGTCTTTCTCATATTTGGTTGTTTTTCAAGTTTTGAACTC TTTCTACTACATACTTTTCACATGTACCTTTAAAAAACTCATAATTCATTTTCCAATGTG TTGAAAACTACTGTAACTGCTTAAAAGTCAGAAACAGTAACGAAACTATTTTCATGATAA AATCAAAAATTGTTTCGATTCGAAAATGTTTTTATATACTCGACATGTGTGTACATGTGT AAACCAGTCGTTTCAAAAATTTTACAAAAAAATGTAAAGAAACTGTTCAGTGATCAGTAT GCTCCAGCTTCTTAGTTTCTTAGTTTCTAGGACTTCACACACTGCCTGCCTTCAAACTAC CGCCTATTAACATTTATTCCGGTCGCTCTTTTGTATTTATTGAGGAAATCAACTACTGTA GTTTTTTAAAAATTAATTTATTGATTTGGCAATTTTTCTTTTTTTTTCAAGATTCAAAAA TAAGAAATTGTATTTTACTCACCATTATTCAAAAAACTTGATGAAATGTTTAAATTTTAT GGTAAATGATCAAAACTAATTTTTTGAAAAATCGACGCCTTCTCTTTTGTTTCTGAATAT AGGCACTCCTAGCCGTGCAGGCAGGCATTTCTGTGCCTACACGAAAGCCCTAGAATACAT ACTTTCAAAATGTTCACTGTTTGTTCTTTTTCCCTGAACATAGAACATCCGGAAATTGCT TACATTGGAATTTCATTCAAGTACATTTGTAAAACTTACAAACACTTGAGAAAACAAAAC AAATCTGAAAACGTGTTGTCTGAATGCATTGGGGAGACAAGTTGGGTTCTTCCTTCCATT TACCTTGTTGTTGTTTTTTTTTCTGTTCAAAAATTTATTCGTTCTCTGGGTCAAATTGTT CAATTATAGTTGTTGTTTGCATTCGGTTGTTCCATTTTTCACAGTTTTCTCTCTTGCAAA AATTTTATAAAAGACTACAAGTAAATATAAAACCATGAACTCACATTTGGCACGGGCTAG AATTCTGACTTCTGCCCTTCATTTTCTGTTGAAAATTGTCAAATTCTTTTCAAGGTCTTT TATTTATTTTGTTCCTCGTGGAACATAACATTTTTGATATCTTTCAATATCAAACTTTAG TCTACTCATAAACTGTATGTACCTACCTACAGTAACCTCTCAGAAACATTATGATCCTAA GTATTTCTGATATTTGCTCCGAGGTAAAAGTAGATCCTACTTAATTTATAAATTAAGTGC GTCAGTATCTATGTTGATTTAGTTTTTGTGATCTTCCAACTAATCGTATCTATCAGAACA TCAGTAGCAAAAATCTAAAATCATGAGAGAGCATTTTCTAAAAAACACATAATTTTCAGA AGGCTTCATGGCGAAGGCGCATGCCGAATTTTTGCTTTTTGACTCGACATTATACAATTT TCCTTTAAAAAAATTAAGTTTTATTATGATATTTGGTGATTTAAAAATTATAAGATTAAT TTTTAATCATTGCTCTACGGGCCGCACTACGCCTTTAAGCCTCAGTTCCAATATAATCAG TTTGAATATTTAGTCACCGTTTTGACATTTACAGCTTAATCCAATTACTTTCAACTCAAT TTATCAAACTGTCAAAATGCCCACCTAATTGTTCTTTGCCAGACTTTTGAAACCAAAAGC CACACCCTCAATAGGTCATCTAAAAGTAGCTGTGGTCTCTCCAATGACCTCTATTGTAAA TTGGGCGGCAATTCTTCTTCATCATCAATTTCAGATGCATCACAAGCATCAAAAGCATCA TATATGAAGAACAGTAGAAAGAGGAAACTCATACTTTTCTAACCATTTTTTCTGATATAG ACACTGGCATAGACCAATTCAAAAGTGTTGTAATATTGTACCGGCTACTCTCTTCTTTGT GTTCTGCATTTTTCTGTCATTTTTCTTGAAGAAAAAGGCTTCTGACCTCTCCTCCAATGT CACCTTTTTAGTGTTTCACAATAGGATTATTCCCATGCTTTTCACACAGTCTCTGAATTT TTTGTTGTTCGAACCAAAAAAAAAAGAGAGTGTTTAGCTTTCTGTTTAGTTGGTTTTCGA CTTTTTTATTGACTTTTTTATTAATATTTTCGTCATCTCATCAATCATACATAATTAAGC GGACAAGGCATTAGGACCGGTGGGTGAGATGTGTTGTTGCGTTAGACGAATTCTAGATTA CGTGATTTTAGTCGAGATTGTGTCGTCTGTGAATATCGAAATAGCGATTTAATTAGTCAT GGGAAAAGTTGTCGTGGTAGAGATTCAAGTGACCTTTCTTTGCCGTTTTTATGGCTTAAT CTATAAAAGGCTTAAAGGCTTGTAGTCTTCTAACCGGACGATCTATTACATAATATAATG AAGGGGATATTTAACTAATTTTTTTAAAATTAAAATCCATAAGACTAGCTGAAATTTACC TTAGGAATGCTATAGAAATATCAAATTTCGTCAGTGTTCTATTTTTTCTGATCACCAAAA ATTATCAGTCGACAAAAAAAGTATATTCCATTTTTTATGCATCTACTGTGCCATGGGCAA TGGATTTAATTTATATTCTTGGTGAGCCCTCCCATGTAGGCTCAAAACTGCCAAAAATGT GCAGCAGACAGAAGCTCAGAGAGGCAGGCAGGTATGAAACAGACAAATATACCTAGATAC GTGCCTGCCTACGTTTTCAAAATGCATCGATTTTGTCTTGGAAAAGCTGCTAAAATCTTA CAGCTCCTGTGCCCTTCACATGCGCTGGAGTTCAACTCAAAATGGCAAAAATGTGTGGTA GGCTAATTTTTTGCCCGCCTACGTTGTTGCATTGAGACAACTTCTGAACGGATTTCGTGA ACAAATTTTCTTTCCAGATTAATTTTTTAAAACTTTGCTCCCAAAATTGCATCATTATCG GTAAAGTTCGGGTATCCCCTCTTCATCTCTTAGACTTACTGTGTAACAATTTGCATAAAT ATACACTGGTCTCTCTCAATCTTCTTTCATTTGATCACTTATACTTTTACACTTGGCACG CACACATAGGCGAACCGCCAGGCTTCTCCTTTGCCCTTCTGGCGGCCAACCCCCCCCCCC CCCCCCCCTCCCCCCGTTTTTTAAGTTTTCTTGTCATCTTGAAATGGTCGAAAAAATTAG TGTTGTATTCTTGCACTGACGATGATGATGATGAAGTGGAAGAAAGAACCTGTTCTCTCG GCCCACAAAAAATTCCAGAGGTGTTGGTCAATTTGACCTTCTAGACCATCACATTCCAGC GCGGCTTCATAAAGAAATCGTCTCGAGAAAATTGAATTCAAGAATAGAAAAAATTATATT TCAAATGAACTCGATTTTTGAATGCTGAATTCTTCTTCCTCACAATGACAGTCTCATTGA ATTATAACTCAATTTTTAGAATTTTTGATTCCGAGTGAACCACATTTCAGAAAGGAAAAA AGTTTTGATGATGTGTGTTCAATGTATAGGTTACTGCCACACCACACTACCAATCAACCG GTTCTAAAAAGACAAGCTTCTCTGGAATTCAGTTTCAGAACACTGGGGCGGGGCTTCAGG TTATGTTGGGAGTGGGAACCAAGAAGACACACGACTTCTTATTACTTTTTTAGTTTTACA AAAATTTCATTTTCTTTGAAACTGTTTTACTCAACTCAAAAAAACGAGGCCAGGAGAGAG TTCAATATTTATCGACTTTTATAAATTTTATATAATTTCACCATAAGTTTTATACTATTT TATGCAAGTACATAGAGTGTGCTCTTGGTAAAGCGTCTAATTTTTCCGAAAGTTCACTCT CTTTAATATTTTGGAATTTTATTTGTTTTAAAAGTTCGTTTTGAGGATAAACCTTGAAGA TTGGAAAACGGTAATCTTTGCAGCGATTTTTATAACTGGAAACATGCTATCAGTAACTTA ATTTTATTTGTTCTATAAAGTTAAAAAAACGTGAGTATAAAAATTTCAAAAAAATCCAAA AAACAGACTCCGCCCAAATCTTGGCTTCTTTAAATTTTTGTATAAAATGTTCGTGTGCTT ATAATTCAACATTTTGACTAAAATGGAATACTGTAAGAAAATACCCCCGAACTTGATATC AACCCATTATAAACTCTTGTGATTTTATAATTTACAAAAAATGATAGTTGACAATTATTT GAAAATTATGGAACCAATGTCCTAATCTACAGCCACATACAGTTATTCTTACAATTATAT CAAAAAATCATATTCCCGTTCCGTTTTCCCTCTTTCAAAAGTCGTTCATATCTGCTTATG GCACCTGTTGCGCCACCCCACGCCACCCAGATACAAGATTCTGAAAAAAAAACGAGTGAG AGTAAATGTAAGACGAAGAGATTTAGAGAAAGTGTCAAAATCCCACTGGTTTTTTATTCG ACATCCCCCTTTTTCTCACTCACTCGTTCGTCTCGCCATCGCCGTCGCAAAAAGTATCAT AAAGTTGCCCTCACTGAGAGAACGACTTGCCTTCCGCTGAGGATAGATGGTTCCCGCGCG GAAGGAGAAGGAATCCCCTCTCTTTCTCCGACACTTTCACCGCTTCTCATATGATGCCAT TCTCGGGAAATACATTCGTAATTGGATCATCTGGGACTGGCAAAAATGAGAAAGAGAAGG GGGAACGTTTTCTTTTCAACTGGAAAATAGTTACTGACCTTGAGCCAACTGGAAGCAGAA AATGTTTAATTAAGGGGAAGAAACTAAATAAGGAAAATTAGTGTAACTACATTTACTTTT ATGGTAGGAGATCACTTTGTAACTTACTAAGTTGATTTTGCTAAAAATAAAACTGATGGC AGACCATTATCATTAAGTTTTGTAAAGCATCAATTTGAACAGCAAAGTTGAAGCATAAGC CTATGCGGAAGCTTGAATCTAAGTCTATGCCTAAGCCTAAGACTAAGCCTACGCATAATT GTAAGCACCTTTTTTTCACATTTTGGAACATTTGGAAATTTACCAAAAACTTCAATCGAA ATCTTTGAAAACTCAAAGAAGTTGTGTTCTATTATGTTATTCGATCATTTTGGACTAATT TAAATATATTCAGACAAAACCCCTCAGTGACTTTTGAATAGGTCCAAACATTTTTTAATT TTACAGAAACGAATCCATCAATATTTTGTTATTTTGCTTTCATAGCCGAGGGCACAAGTG TCTTTAGATGAGATGACATCCTCAATTCTAGACACGGATTCTAGACATGGGCTGTTGCAA TTAGCATAGTTCTTGGTACTTTACGCATGTCACATTGAAAACTAAAGTGAGAATAGTTTC CCGTTCCAATTAATACGTCGCGCCCTCTGTTTCTCCCAAAACGATACCAATGGCATCACG ATTGAAAGCGGCAAAATATAGAGAGATTGAGAGAGAAACGGTTTCTATATACAGTGTAAA TTTGCCCCCTGTTGCCATCATCATCACATAGACATACACACTCCATAATAGCTAAGGTTA GCTTCCTGTCGCCCCCATCTCGGCAATAAAAATCGCTTTGATTTCAATCTCATTTGTTTT TTAACCACCAGTTTGCTTATTTTTACTTTATAAATTGAGTTCTCTTGTTTCCAAATTTTT TTGCAAATTACATACGTTGAGGACATTTAGTCATAATAACCATTCTTTGAATTTTTTAAA AGTTTAGTGGCTAAAAGTCTTTCGTGCTTTTTTTCTAGACATTTTTGAAAAACAGCAACA ACTTTCGAAAGTTTAATCAAATACTTTTAAATTGTAAAATGCCTATTTGTTTATGTATTT CTGTTATGAACAATAGAGTTCCAGTTTGTGTATCTATGATCTTTGATGACTACTAAACAG ACTTTGTTTTGATAGTTCCCGGTTTTTAGCAGTTTCAATTTGAGAAAGAGTCTAGGTATT TCCTTATCACATTTGATAAAGTAGCTAATGGTGGCCGTGAACTGTGATTACTCAATGATG ATGTCGAGGATTTTAGGGATTTGAACGGTTTTTCCAAATTTTAGTTAATTCAGTTTTTAT AGCTGAGCTAACTATTATAACTTAGCACTAATTCCATTTATTTTGACTCTCCAATTGTCA TCTGGCACAGTTAAGTATTTCTTAAAGTTTTCGGTTTCTACATTGCCTATAATTTGCTCA ATTTTTGACCATGCCGATATTTGATACTTTTCACCTTACTTATGTAAAAAGTCAACTACC TTTTTTTGCCGCATGGCTTTATTAGGCAAATAGCAAAAGTCCCCCTGAATCTTTTTGAAA ACTCTTATTTTTAGTCCCTTGAACATTTAATTGAACTGCAAAATTGATGCTCGTTAATCT ACATATATCCAGTAAATAGGCCGTAGGCAGGCAGGTGCAATATTTGATTATTTGTACCCA TCTAACATGCTGGTCTGCTTCTGCAGCCACTAGATCTAGTTTTCATTGTTCAGCAAAAAA TCCTAGGCAAGATTTACTAAAGTTGTTGCCCTTCCTGACCTAAATTACATGAATTTCGTT ACTCTAAAATCTTTCAACGACTTTTGTAACGCATTAATCTGTTTTTTTTCACTTCCTTGA TTATCTAATAGTTCTTCTTTTTCATCTTCAACATCCTTTTTCCCCTTTACTCTTTCCCTA CATCTTACCATCTCATCAACAATCTTTCATAAATCTCTCTTTTAACTTTCGATTCTTTCA GTATTCGGATTCATCTCAACATGCGTGTCACCCTTCATTACTTCCTTTTTATTACCATTT CTGTTTAGTAGTTGAACATTTGTACATGTGTGTTCTTCCTGTCTACAGTTTTGAATGGGG GCTCGAAACCTTTGATGATTCACGGTATTTTTTCTCTTGTTCTCAAGTTTTAGAAATATA GAATAACACCACATATTTCTGATAAGATCATTATATATAGTTATGATGTATGCTCTAACC AAATAAACCTTATCACTATGTTGTTTTTCCAAGTTTTCGGAAACCACAGCGTTCACTTCT ATTTCCTCCAGCTGTATCATCAACCGTTACCGCAATGAACATTGCGCAACACTCATTTTC CATCGGTTTCCTAGTTCTTCTTCATCTTCTGTCTCTCTTATATTTTATAGAAACATTGAA AGAGACTAGGAGATATAGAGAGAAAAAGAGACAATTGGCATTCGTTGAAAGAGAGAAGGC CTGTTGCGCAGAACCCTCTCTCTTGCATCCCTGCTTTTCCATCAGCTTCTAGCATCAGTA ATCGATCCGATTAGGTTAAGCTCATTTTAAGGATCATCTCACTCTTAGTCTTTAAATTTC ATTGAATACTGTAGATAGAACTCTAGACTTTGAACTGACGCTTTTCTCCAGGCTTTTAGG ATTTTGGGCAGAGCCGATAGTCATGGTTTGTGACAACTGCCGGTATAAATTTCACGTTTT TTCTGAAGAAAAAAAGCGTCCAGAATACCTCTGAGATTCAAATATCTCACATTAAATTTA AAAAAAAACTATTTAAAAATTTTCGCGGCGTTCGACAACTTCGGCAATTGCCGGTGTCCG AAAATATTTGCCTACGGCAACTAGGTTGCTGGTAGCCAAAAACATTTGGCGCAGTTCGGT TGCAAGATTTTTAGAAGACAAATTGATATTTTGAGGAAAAAATACGACTCCTAAGGTAAT TAAAAAATTATAGGCTCCAAAAATTTTGGATTGAAAAAAAGAATTAGAAAAATGTAATCA CACTTCGAGCAAAATTTGAAAATTACAATTTCGAAATCCGATAGATTTCCACTATAAACG ATTTCTAATACTTGTCTGATTTTTTTGAACCCAACCAGAACAAAAAACTTGAACCAGAAC AAACCAGAACAAAAAAAAACAAAAGACCTAAAACCCACAGCAAAGGGCAAAAACTACAAA AGGCGGAAACCATTTTTTCTAGACATTTTTCTTTATAGGACGTCAAAAAATGTAAGAAAA GACATAGGGTAGAGTGAGAGAATAAGAAATGGATGCAAAAACCGCAAAAACTGTTTATAT CTGACTATCGCTCACCCAAAATGCAATGTATCTGTCGTCTCTTCGTCTCTTTCTCTTTCA TTTTCCCTCTCTCTCTCTCTCTTCCTTGATAATATCTTGTGAAGATAAATGAATGAATTG GATGAGACGTTTGCTTTTTGGGTTCCCCTATTTCCTGAAAATCAGGAGGTTATTGGGTTA CTGGATAACTTCCTTGTGTCCTTTTTGTCGGTGTCCTAGAATACTGTAGACATTTTCAAG GTTATGATGTACCCATCTTTTTTCCCCATTTTGCATGTTTTACCTCTCCCTTTCATCTTT ACTATCTTGAGGGTCAATCTGAACACTCTTTTTTTATTTTGGTTTGATGTCGTGTACTAT TACCGCTTTCATCTTTCTTGCATGTGTAAGACTTTCAGTTAGCCTATAAACTAGAAACGG CAGCCGACACGTCCCAGGTTGTTATAGAACCCGTGATTTGTCGGCTGCTCATCCCTTTGT AGTTTGCATTTCGAGTAATTAATTAAAAATAAATGAACTTTTGCTAGCGATGTTTCCAAA TTTGTTTTTTTTATTTGTTTCAAAAATTTGCCCTGTCTACCAAAAAAGTTTTAAAAGCTC CGATGACTTTAGTTTGGAATTTTTCTAATTCAAATTCCAAGGCTATCCATACCTTTTTTT ACTAAAATTCCCATGTTTTTTTGGGTGTTTTTACCCGAGAGACAAAACGGCGACAAATTT CTCGAATTCCCCGCTCCTCTCCTAATGAGTTATGTGGTCGTTTGGAAAAAGGAAAACCGT TTGGCCAGCAGTGGCCACTGCTAAAAATGAAAAAAGACGATGATGGGTGATGATC 1 28735 source source chromosome III clone C02D5 db_xref taxon:6239 mol_type genomic DNA organism Caenorhabditis elegans strain Bristol N2 source source chromosome III clone C02D5 db_xref taxon:6239 mol_type genomic DNA organism Caenorhabditis elegans strain Bristol N2 source source 1 28735 C02D5.3 gene gene C02D5.3 C02D5.3 C02D5.3 transcript db_xref GI:32453032 db_xref WormBase:C02D5.3 gene C02D5.3 note contains similarity to Pfam domain PF02798 (Glutathione S-transferase, N-terminal domain) product Hypothetical protein C02D5.3 protein_id AAO12454.2 standard_name C02D5.3 C02D5.3 start_codon 5035 5037 C02D5.3:1 exon 5035 5077 C02D5.3:2 exon 5411 5851 C02D5.3:3 exon 5956 6208 C02D5.3:4 exon 7984 8008 translation from_gene[C02D5.3] cds_boundaries:(L16622:5035..8008) transcript_info:[C02D5.3] MTVLAGVNSKIVKNGCWNHSYLQHAILPVAQRALIYASVKNIPSDVINVHLQEKPDWYFS KHYKGQVPTLEHDEGKKHVIESAVIPEYLDDIYPETRILPTDPYEKVQQKLLLDRISGQV SPAFYGVVQAVKNPDLREEKFADIKKAYDNAEQLLTGDFYSGTSKPGFVDYLLYPNIQRA YWAAHIVPDFPLEAESFPGPNYPRLSKWYKALESIPEVAAASQPTENGVGFFKDYLGGSP NYDYGLTKLSETI C02D5.2 gene gene C02D5.2 C02D5.2 C02D5.2 transcript db_xref GI:289605 db_xref WormBase:C02D5.2 gene C02D5.2 note contains similarity to Pfam domain PF03227 (Gamma interferon inducible lysosomal thiol reductase (GILT)); coded for by the following C. elegans cDNAs: CEESG21F, CEESG21R, CEESG21 product Hypothetical protein C02D5.2 protein_id AAA27914.1 standard_name C02D5.2 C02D5.2 start_codon 9783 9785 C02D5.2:1 exon 9783 10002 C02D5.2:2 exon 10373 10509 C02D5.2:3 exon 10558 10661 C02D5.2:4 exon 10709 10825 C02D5.2:5 exon 10876 11052 C02D5.2:6 exon 11213 11429 translation from_gene[C02D5.2] cds_boundaries:(L16622:9783..11429) transcript_info:[C02D5.2] MSKYLRTSHPSPLICRPILTFSSLHILTAFLISGNSSYINWSVTIILLILGLYACHRFLN MKKLTRDAQEPLLPHIRPSTRNKIRKYIYGTIFILSIFLLYRSLNSPDTSKHGIGRNGDF IEYEPKAGPTIKEPVENIVKLDVYMEAQCPDTSRFFRQQLKKAWDILGRLNRIELNVIPF GKARCTEKGNDFECQCQHGPTECQINQLMNCVIDRFGFPHRYLPGVLCMQGKYSLDEAMK CVTENYPSEYERMRECASGTRGRRLLALSGQKTASLTPAIDFIPWIVINGSRNSDALYDL TQNVCEAMQPMPSACKDYLRSLQ C02D5.1 gene gene C02D5.1 C02D5.1 C02D5.1 transcript db_xref GI:15145275 db_xref WormBase:C02D5.1 gene C02D5.1 note contains similarity to Pfam domains PF02770 (Acyl-CoA dehydrogenase, middle domain), PF02771 (Acyl-CoA dehydrogenase, N-terminal domain), PF00441 (Acyl-CoA dehydrogenases); coded for by the following C. elegans cDNAs: yk122b9.5 product Hypothetical protein C02D5.1 protein_id AAA27913.2 standard_name C02D5.1 C02D5.1 start_codon 13633 13635 C02D5.1:1 exon 13633 13867 C02D5.1:2 exon 13920 14054 C02D5.1:3 exon 14099 14451 C02D5.1:4 exon 14536 14889 C02D5.1:5 exon 15271 15420 translation from_gene[C02D5.1] cds_boundaries:(L16622:13633..15420) transcript_info:[C02D5.1] MAFLARKTSSLLPATTSSTVKHMIYDEPHFAMQNSLAKLIKEKINPNVAQWEKSGRYPAH FVFKMLGQLGVFAVNKPVDYGGTGRDFAMSIAIAEQIGAVDCGSIPMSVMVQSDMSTPAL AQFGSDSLRNRFLRPSINGDLVSSIAVSEPHAGSDVSAIRTHARRYGSDLIINGSKMWIT NGDQADWACVLVNTSNAKNLHKNKSLVCIPLDSIGVHRSTPLDKLGMRSSDTVQLFFEDV RVPSSYIIGEEGRGFAYQMNQFNDERLVTVAVGLLPLQKCINETIEYARERLIFGKTLLD QQYVQFRLAELEAELEATRSLLYRTVLARCQGEDVSMLTAMAKLKIGRLARKVTDQCLQI WGGAGYLNDNGISRAFRDFRIFSIGAGCDEVMMQIIHKTQSKRQQKRI tmhmm.out100644000766000024 104514006427353 16247 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# my_sequence_id Length: 178 # my_sequence_id Number of predicted TMHs: 3 # my_sequence_id Exp number of AAs in TMHs: 69.58306 # my_sequence_id Exp number, first 60 AAs: 8.42188 # my_sequence_id Total prob of N-in: 0.07306 my_sequence_id TMHMM2.0 outside 1 53 my_sequence_id TMHMM2.0 TMhelix 54 76 my_sequence_id TMHMM2.0 inside 77 115 my_sequence_id TMHMM2.0 TMhelix 116 138 my_sequence_id TMHMM2.0 outside 139 150 my_sequence_id TMHMM2.0 TMhelix 151 173 my_sequence_id TMHMM2.0 inside 174 178 swiss.dat100644000766000024 11176414006427353 16310 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID MA32_HUMAN STANDARD; PRT; 282 AA. AC Q07021; DT 01-FEB-1995 (Rel. 31, Created) DT 01-FEB-1995 (Rel. 31, Last sequence update) DT 01-OCT-2000 (Rel. 40, Last annotation update) DE COMPLEMENT COMPONENT 1, Q SUBCOMPONENT BINDING PROTEIN, MITOCHONDRIAL DE PRECURSOR (GLYCOPROTEIN GC1QBP) (GC1Q-R PROTEIN) (HYALURONAN-BINDING DE PROTEIN 1) (PRE-MRNA SPLICING FACTOR SF2, P32 SUBUNIT) (P33). GN GC1QBP OR HABP1 OR SF2P32 OR C1QBP. OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP SEQUENCE FROM N.A., AND SEQUENCE OF 74; 76-93 AND 208-216. RC TISSUE=FIBROBLAST; RX MEDLINE=94085792; PubMed=8262387; RA Honore B., Madsen P., Rasmussen H.H., Vandekerckhove J., Celis J.E., RA Leffers H.; RT "Cloning and expression of a cDNA covering the complete coding region RT of the P32 subunit of human pre-mRNA splicing factor SF2."; RL Gene 134:283-287(1993). RN [2] RP SEQUENCE OF 5-282 FROM N.A., AND SEQUENCE OF 74-114. RX MEDLINE=91309150; PubMed=1830244; RA Krainer A.R., Mayeda A., Kozak D., Binns G.; RT "Functional expression of cloned human splicing factor SF2: homology RT to RNA-binding proteins, U1 70K, and Drosophila splicing regulators."; RL Cell 66:383-394(1991). RN [3] RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. RX MEDLINE=94253723; PubMed=8195709; RA Ghebrehiwet B., Lim B.L., Peerschke E.I., Willis A.C., Reid K.B.; RT "Isolation, cDNA cloning, and overexpression of a 33-kD cell surface RT glycoprotein that binds to the globular 'heads' of C1q."; RL J. Exp. Med. 179:1809-1821(1994). RN [4] RP X-RAY CRYSTALLOGRAPHY (2.25 ANGSTROMS). RX MEDLINE=99199225; PubMed=10097078; RA Jiang J., Zhang Y., Krainer A.R., Xu R.-M.; RT "Crystal structure of human p32, a doughnut-shaped acidic RT mitochondrial matrix protein."; RL Proc. Natl. Acad. Sci. U.S.A. 96:3572-3577(1999). CC -!- FUNCTION: NOT KNOWN. BINDS TO THE GLOBULAR "HEADS" OF C1Q THUS CC INHIBITING C1 ACTIVATION. CC -!- SUBCELLULAR LOCATION: MITOCHONDRIAL MATRIX. CC -!- SIMILARITY: BELONGS TO THE MAM33 FAMILY. CC -!- CAUTION: WAS ORIGINALLY (REF.1 AND REF.2) THOUGHT TO BE A PRE-MRNA CC SPLICING FACTOR THAT PLAYS A ROLE IN PREVENTING EXON SKIPPING, CC ENSURING THE ACCURACY OF SPLICING AND REGULATING ALTERNATIVE CC SPLICING. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; L04636; AAA16315.1; -. DR EMBL; M69039; AAA73055.1; -. DR EMBL; X75913; CAA53512.1; -. DR PIR; JT0762; JT0762. DR PIR; S44104; S44104. DR PDB; 1P32; 06-APR-99. DR MIM; 601269; -. KW Mitochondrion; Transit peptide; 3D-structure. FT TRANSIT 1 73 MITOCHONDRION. FT CHAIN 74 282 COMPLEMENT COMPONENT 1, Q SUBCOMPONENT FT BINDING PROTEIN. SQ SEQUENCE 282 AA; 31362 MW; 2F747FA73BB1314B CRC64; MLPLLRCVPR VLGSSVAGLR AAAPASPFRQ LLQPAPRLCT RPFGLLSVRA GSERRPGLLR PRGPCACGCG CGSLHTDGDK AFVDFLSDEI KEERKIQKHK TLPKMSGGWE LELNGTEAKL VRKVAGEKIT VTFNINNSIP PTFDGEEEPS QGQKVEEQEP ELTSTPNFVV EVIKNDDGKK ALVLDCHYPE DEVGQEDEAE SDIFSIREVS FQSTGESEWK DTNYTLNTDS LDWALYDHLM DFLADRGVDN TFADELVELS TALEHQEYIT FLEDLKSFVK SQ // ID ACON_CAEEL STANDARD; PRT; 788 AA. AC P34455; DT 01-FEB-1994 (Rel. 28, Created) DT 01-FEB-1994 (Rel. 28, Last sequence update) DT 15-JUL-1999 (Rel. 38, Last annotation update) DE Probable aconitate hydratase, mitochondrial precursor (EC 4.2.1.3) DE (Citrate hydro-lyase) (Aconitase). GN F54H12.1. OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP SEQUENCE FROM N.A. RC STRAIN=BRISTOL N2; RX MEDLINE=94150718; PubMed=7906398; RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., RA Bonfield J., Burton J., Connell M., Copsey T., Cooper J., Coulson A., RA Craxton M., Dear S., Du Z., Durbin R., Favello A., Fraser A., RA Fulton L., Gardner A., Green P., Hawkins T., Hillier L., Jier M., RA Johnston L., Jones M., Kershaw J., Kirsten J., Laisster N., RA Latreille P., Lightning J., Lloyd C., Mortimore B., O'Callaghan M., RA Parsons J., Percy C., Rifken L., Roopra A., Saunders D., Shownkeen R., RA Sims M., Smaldon N., Smith A., Smith M., Sonnhammer E., Staden R., RA Sulston J., Thierry-Mieg J., Thomas K., Vaudin M., Vaughan K., RA Waterson R., Watson A., Weinstock L., Wilkinson-Sproat J., RA Wohldman P.; RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C. RT elegans."; RL Nature 368:32-38(1994). CC -!- CATALYTIC ACTIVITY: Citrate = cis-aconitate + H(2)O. CC -!- COFACTOR: ACONITASE HAS AN ACTIVE (4FE-4S) AND AN INACTIVE (3FE- CC 4S) FORMS. THE ACTIVE (4FE-4S) CLUSTER IS PART OF THE CATALYTIC CC SITE THAT INTERCONVERTS CITRATE, CIS-ACONITASE, AND ISOCITRATE (BY CC SIMILARITY). CC -!- PATHWAY: TRICARBOXYLIC ACID CYCLE. CC -!- SUBUNIT: MONOMER (BY SIMILARITY). CC -!- SUBCELLULAR LOCATION: Mitochondrial (By similarity). CC -!- SIMILARITY: BELONGS TO THE ACONITASE/IPM ISOMERASE FAMILY. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; L25599; AAA28050.1; -. DR PIR; S44831; S44831. DR HSSP; P20004; 1AMJ. DR WormPep; F54H12.1; CE00516. DR InterPro; IPR001030; Aconitase. DR InterPro; IPR000573; Aconitase_C. DR Pfam; PF00330; aconitase; 1. DR Pfam; PF00694; Aconitase_C; 1. DR PRINTS; PR00415; ACONITASE. DR ProDom; PD000511; Aconitase; 1. DR PROSITE; PS00450; ACONITASE_1; 1. DR PROSITE; PS01244; ACONITASE_2; 1. KW Hypothetical protein; Lyase; Tricarboxylic acid cycle; Iron-sulfur; KW Mitochondrion; Transit peptide; 4Fe-4S. FT TRANSIT 1 ? MITOCHONDRION (POTENTIAL). FT CHAIN ? 788 PROBABLE ACONITATE HYDRATASE. FT METAL 393 393 IRON-SULFUR (4FE-4S) (BY SIMILARITY). FT METAL 456 456 IRON-SULFUR (4FE-4S) (BY SIMILARITY). FT METAL 459 459 IRON-SULFUR (4FE-4S) (BY SIMILARITY). SQ SEQUENCE 788 AA; 85712 MW; 8861E6FC198B70D9 CRC64; MRYHFLFGSL RNHLFSFRGV IYCREKLFNC SKLSFRPSKV AISKFEPKSY LPYEKLSQTV KIVKDRLKRP LTLSEKILYG HLDQPKTQDI ERGVSYLRLR PDRVAMQDAT AQMAMLQFIS SGLPKTAVPS TIHCDHLIEA QKGGAQDLAR AKDLNKEVFN FLATAGSKYG VGFWKPGSGI IHQIILENYA FPGLLLIGTD SHTPNGGGLG GLCIGVGGAD AVDVMADIPW ELKCPKVIGI KLTGKLNGWT SAKDVILKVA DILTVKGGTG AIVEYFGPGV DSISATGMGT ICNMGAEIGA TTSVFPYNES MYKYLEATGR KEIAEEARKY KDLLTADDGA NYDQIIEINL DTLTPHVNGP FTPDLASSID KLGENAKKNG WPLDVKVSLI GSCTNSSYED MTRAASIAKQ ALDKGLKAKT IFTITPGSEQ VRATIERDGL SKIFADFGGM VLANACGPCI GQWDRQDVKK GEKNTIVTSY NRNFTGRNDA NPATHGFVTS PDITTAMAIS GRLDFNPLTD ELTAADGSKF KLQAPTGLDL PPKGYDPGED TFQAPSGSGQ VDVSPSSDRL QLLSPFDKWD GKDLEDMKIL IKVTGKCTTD HISAAGPWLK YRGHLDNISN NLFLTAINAD NGEMNKVKNQ VTGEYGAVPA TARKYKADGV RWVAIGDENY GEGSSREHAA LEPRHLGGRA IIVKSFARIH ETNLKKQGML PLTFANPADY DKIDPSDNVS IVGLSSFAPG KPLTAIFKKT NGSKVEVTLN HTFNEQQIEW FKAGSALNRM KEVFAKSK // ID 143E_HUMAN STANDARD; PRT; 255 AA. AC P42655; P29360; Q63631; DT 01-NOV-1995 (Rel. 32, Created) DT 01-NOV-1995 (Rel. 32, Last sequence update) DT 15-JUL-1999 (Rel. 38, Last annotation update) DE 14-3-3 protein epsilon (Mitochondrial import stimulation factor L DE subunit) (Protein kinase C inhibitor protein-1) (KCIP-1) (14-3-3E). GN YWHAE. OS Homo sapiens (Human), OS Mus musculus (Mouse), OS Rattus norvegicus (Rat), OS Bos taurus (Bovine), and OS Ovis aries (Sheep). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606, 10090, 10116, 9913, 9940; RN [1] RP SEQUENCE FROM N.A. RC SPECIES=Human; RX MEDLINE=95372385; PubMed=7644510; RA Conklin D.S., Galaktionov K., Beach D.; RT "14-3-3 proteins associate with cdc25 phosphatases."; RL Proc. Natl. Acad. Sci. U.S.A. 92:7892-7896(1995). RN [2] RP SEQUENCE FROM N.A. RC SPECIES=Human; TISSUE=Heart; RA Luk S.C.W., Lee C.Y., Waye M.M.Y.; RL Submitted (JUN-1995) to the EMBL/GenBank/DDBJ databases. RN [3] RP SEQUENCE FROM N.A. RC SPECIES=Human; RX MEDLINE=96300316; PubMed=8684458; RA Jin D.-Y., Lyu M.S., Kozak C.A., Jeang K.-T.; RT "Function of 14-3-3 proteins."; RL Nature 382:308-308(1996). RN [4] RP SEQUENCE FROM N.A. RC SPECIES=Human; TISSUE=Liver; RX MEDLINE=97011338; PubMed=8858348; RA Chong S.S., Tanigami A., Roschke A.V., Ledbetter D.H.; RT "14-3-3 epsilon has no homology to LIS1 and lies telomeric to it on RT chromosome 17p13.3 outside the Miller-Dieker syndrome chromosome RT region."; RL Genome Res. 6:735-741(1996). RN [5] RP SEQUENCE FROM N.A. RC SPECIES=Human; RA Tanigami A., Chong S.S., Ledbetter D.H.; RT "14-3-3 epsilon genomic sequence."; RL Submitted (AUG-1998) to the EMBL/GenBank/DDBJ databases. RN [6] RP SEQUENCE FROM N.A. RC SPECIES=Human; TISSUE=Placenta; RA Strausberg R.; RL Submitted (DEC-2000) to the EMBL/GenBank/DDBJ databases. RN [7] RP SEQUENCE FROM N.A. RC SPECIES=Rat, and Sheep; TISSUE=Pineal gland; RX MEDLINE=94296566; PubMed=8024705; RA Roseboom P.H., Weller J.L., Babila T., Aitken A., Sellers L.A., RA Moffet J.R., Namboodiri M.A., Klein D.C.; RT "Cloning and characterization of the epsilon and zeta isoforms of the RT 14-3-3 proteins."; RL DNA Cell Biol. 13:629-640(1994). RN [8] RP SEQUENCE FROM N.A. RC SPECIES=Rat; TISSUE=Liver; RX MEDLINE=95122474; PubMed=7822263; RA Alam R., Hachiya N., Sakaguchi M., Shun-Ichiro K., Iwanaga S., RA Kitajima M., Mihara K., Omura T.; RT "cDNA cloning and characterization of mitochondrial import RT stimulation factor (MSF) purified from rat liver cytosol."; RL J. Biochem. 116:416-425(1994). RN [9] RP SEQUENCE FROM N.A. RC SPECIES=Rat; TISSUE=Brain; RX MEDLINE=96280718; PubMed=8694795; RA Gao L., Gu X.B., Yu D.S., Yu R.K., Zeng G.; RT "Association of a 14-3-3 protein with CMP-NeuAc:GM1 alpha 2,3- RT sialyltransferase."; RL Biochem. Biophys. Res. Commun. 224:103-107(1996). RN [10] RP SEQUENCE FROM N.A. RC SPECIES=Mouse; STRAIN=SWISS; TISSUE=Kidney; RX MEDLINE=95269876; PubMed=7750640; RA McConnell J.E., Armstrong J.F., Bard J.B.; RT "The mouse 14-3-3 epsilon isoform, a kinase regulator whose RT expression pattern is modulated in mesenchyme and neuronal RT differentiation."; RL Dev. Biol. 169:218-228(1995). RN [11] RP SEQUENCE FROM N.A. RC SPECIES=Mouse; STRAIN=129/SV; RA Takihara Y., Irie K., Nomura M., Motaleb M., Matsumoto K., RA Shimada K.; RL Submitted (SEP-1996) to the EMBL/GenBank/DDBJ databases. RN [12] RP SEQUENCE FROM N.A. RC SPECIES=Bovine; RA Jones J.M., Niikura T., Pinke R.M., Guo W., Molday L., Leykam J., RA McConnell D.G.; RT "Expression of 14-3-3 proteins in bovine retinal photoreceptors."; RL Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases. RN [13] RP SEQUENCE OF 1-152; 165-184 AND 216-255. RC SPECIES=Sheep; TISSUE=Brain; RX MEDLINE=92283271; PubMed=1317796; RA Toker A., Sellers L.A., Amess B., Patel Y., Harris A., Aitken A.; RT "Multiple isoforms of a protein kinase C inhibitor (KCIP-1/14-3-3) RT from sheep brain. Amino acid sequence of phosphorylated forms."; RL Eur. J. Biochem. 206:453-461(1992). RN [14] RP SEQUENCE OF 1-23 AND 125-140. RC SPECIES=Sheep; TISSUE=Brain; RX MEDLINE=90345949; PubMed=2143472; RA Toker A., Ellis C.A., Sellers L.A., Aitken A.; RT "Protein kinase C inhibitor proteins. Purification from sheep brain RT and sequence similarity to lipocortins and 14-3-3 protein."; RL Eur. J. Biochem. 191:421-429(1990). CC -!- FUNCTION: ACTIVATES TYROSINE AND TRYPTOPHAN HYDROXYLASES IN THE CC PRESENCE OF CA(2+)/CALMODULIN-DEPENDENT PROTEIN KINASE II, AND CC STRONGLY ACTIVATES PROTEIN KINASE C. IS PROBABLY A MULTIFUNCTIONAL CC REGULATOR OF THE CELL SIGNALING PROCESSES MEDIATED BY BOTH CC KINASES. CC -!- SUBUNIT: HOMODIMER. CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. CC -!- TISSUE SPECIFICITY: 14-3-3 PROTEINS ARE LOCALIZED IN NEURONS, AND CC ARE AXONALLY TRANSPORTED TO THE NERVE TERMINALS. THEY MAY BE ALSO CC PRESENT, AT LOWER LEVELS, IN VARIOUS OTHER EUKARYOTIC TISSUES. CC -!- SIMILARITY: BELONGS TO THE 14-3-3 FAMILY. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; U28936; AAA75301.1; -. DR EMBL; U20972; AAC50175.1; -. DR EMBL; U43399; AAC50625.1; -. DR EMBL; U43430; AAD00026.1; -. DR EMBL; U54778; AAC50710.1; -. DR EMBL; AB017103; BAA32538.1; -. DR EMBL; AB017098; BAA32538.1; JOINED. DR EMBL; AB017099; BAA32538.1; JOINED. DR EMBL; AB017100; BAA32538.1; JOINED. DR EMBL; AB017101; BAA32538.1; JOINED. DR EMBL; AB017102; BAA32538.1; JOINED. DR EMBL; BC000179; AAH00179.1; -. DR EMBL; BC001440; AAH01440.1; -. DR EMBL; M84416; AAC37659.1; -. DR EMBL; D30739; BAA06401.1; -. DR EMBL; Z19599; CAA79659.1; -. DR EMBL; U53882; AAC52676.1; -. DR EMBL; L07914; AAC37321.1; -. DR EMBL; D87663; BAA13424.1; -. DR EMBL; AF043735; AAC61927.1; -. DR PIR; S10806; S10806. DR PIR; S10807; S10807. DR HSSP; P29312; 1A38. DR MIM; 605066; -. DR MGD; MGI:894689; Ywhae. DR InterPro; IPR000308; 14-3-3. DR Pfam; PF00244; 14-3-3; 1. DR PRINTS; PR00305; 1433ZETA. DR ProDom; PD000600; 14-3-3; 1. DR SMART; SM00101; 14_3_3; 1. DR PROSITE; PS00796; 1433_1; 1. DR PROSITE; PS00797; 1433_2; 1. KW Brain; Neurone; Acetylation; Multigene family. FT MOD_RES 1 1 ACETYLATION. FT CONFLICT 73 73 K -> T (IN REF. 9). FT CONFLICT 120 120 F -> S (IN REF. 9). FT CONFLICT 123 123 K -> Y (IN REF. 9). FT CONFLICT 129 129 H -> Y (IN REF. 14). SQ SEQUENCE 255 AA; 29174 MW; 07817CCBD1F75B26 CRC64; MDDREDLVYQ AKLAEQAERY DEMVESMKKV AGMDVELTVE ERNLLSVAYK NVIGARRASW RIISSIEQKE ENKGGEDKLK MIREYRQMVE TELKLICCDI LDVLDKHLIP AANTGESKVF YYKMKGDYHR YLAEFATGND RKEAAENSLV AYKAASDIAM TELPPTHPIR LGLALNFSVF YYEILNSPDR ACRLAKAAFD DAIAELDTLS EESYKDSTLI MQLLRDNLTL WTSDMQGDGE EQNKEALQDV EDENQ // ID 143B_BOVIN STANDARD; PRT; 245 AA. AC P29358; DT 01-DEC-1992 (Rel. 24, Created) DT 01-FEB-1996 (Rel. 33, Last sequence update) DT 16-OCT-2001 (Rel. 40, Last annotation update) DE 14-3-3 protein beta/alpha (Protein kinase C inhibitor protein-1) DE (KCIP-1). GN YWHAB. OS Bos taurus (Bovine), and OS Ovis aries (Sheep). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; OC Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913, 9940; RN [1] RP SEQUENCE. RC SPECIES=Bovine; RX MEDLINE=91108808; PubMed=1671102; RA Isobe T., Ichimura T., Sunaya T., Okuyama T., Takahashi N., Kuwano R., RA Takahashi Y.; RT "Distinct forms of the protein kinase-dependent activator of tyrosine RT and tryptophan hydroxylases."; RL J. Mol. Biol. 217:125-132(1991). RN [2] RP SEQUENCE OF 2-145 FROM N.A. RC SPECIES=Bovine; TISSUE=Retina; RA Jones J.M., Niikura T., Pinke R.M., Guo W., Molday L., Leykam J., RA McConnell D.G.; RT "Expression of 14-3-3 proteins in bovine retinal photoreceptors."; RL Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases. RN [3] RP SEQUENCE OF 2-83; 121-186 AND 199-241. RC SPECIES=Sheep; TISSUE=Brain; RX MEDLINE=92283271; PubMed=1317796; RA Toker A., Sellers L.A., Amess B., Patel Y., Harris A., Aitken A.; RT "Multiple isoforms of a protein kinase C inhibitor (KCIP-1/14-3-3) RT from sheep brain. Amino acid sequence of phosphorylated forms."; RL Eur. J. Biochem. 206:453-461(1992). RN [4] RP SEQUENCE OF 2-23. RC SPECIES=Sheep; TISSUE=Brain; RX MEDLINE=90345949; PubMed=2143472; RA Toker A., Ellis C.A., Sellers L.A., Aitken A.; RT "Protein kinase C inhibitor proteins. Purification from sheep brain RT and sequence similarity to lipocortins and 14-3-3 protein."; RL Eur. J. Biochem. 191:421-429(1990). RN [5] RP PHOSPHORYLATION. RC SPECIES=Sheep; RX MEDLINE=95197587; PubMed=7890696; RA Aitken A., Howell S., Jones D., Madrazo J., Patel Y.; RT "14-3-3 alpha and delta are the phosphorylated forms of RT raf-activating 14-3-3 beta and zeta. In vivo stoichiometric RT phosphorylation in brain at a Ser-Pro-Glu-Lys motif."; RL J. Biol. Chem. 270:5706-5709(1995). RN [6] RP POST-TRANSLATIONAL MODIFICATIONS. RC SPECIES=Sheep; RA Aitken A., Patel Y., Martin H., Jones D., Robinson K., Madrazo J., RA Howell S.; RT "Electrospray mass spectroscopy analysis with online trapping of RT posttranslationally modified mammalian and avian brain 14-3-3 RT isoforms."; RL J. Protein Chem. 13:463-465(1994). CC -!- FUNCTION: ACTIVATES TYROSINE AND TRYPTOPHAN HYDROXYLASES IN THE CC PRESENCE OF CA(2+)/CALMODULIN-DEPENDENT PROTEIN KINASE II, AND CC STRONGLY ACTIVATES PROTEIN KINASE C. IS PROBABLY A MULTIFUNCTIONAL CC REGULATOR OF THE CELL SIGNALING PROCESSES MEDIATED BY BOTH CC KINASES. CC -!- SUBUNIT: HOMODIMER. CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. CC -!- ALTERNATIVE PRODUCTS: TWO FORMS ARE PRODUCED BY ALTERNATIVE CC INITIATION. CC -!- TISSUE SPECIFICITY: 14-3-3 PROTEINS ARE LOCALIZED IN NEURONS, AND CC ARE AXONALLY TRANSPORTED TO THE NERVE TERMINALS. THEY MAY BE ALSO CC PRESENT, AT LOWER LEVELS, IN VARIOUS OTHER EUKARYOTIC TISSUES. CC -!- PTM: ISOFORM ALPHA DIFFERS FROM ISOFORM BETA IN BEING CC PHOSPHORYLATED. CC -!- SIMILARITY: BELONGS TO THE 14-3-3 FAMILY. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; AF043736; AAC02090.1; -. DR PIR; S13467; S13467. DR PIR; S10804; S10804. DR PIR; S23179; S23179. DR HSSP; P29312; 1A38. DR InterPro; IPR000308; 14-3-3. DR Pfam; PF00244; 14-3-3; 1. DR PRINTS; PR00305; 1433ZETA. DR ProDom; PD000600; 14-3-3; 1. DR SMART; SM00101; 14_3_3; 1. DR PROSITE; PS00796; 1433_1; 1. DR PROSITE; PS00797; 1433_2; 1. KW Brain; Neurone; Phosphorylation; Acetylation; Multigene family; KW Alternative initiation. FT INIT_MET 0 0 FT CHAIN 1 245 14-3-3 PROTEIN BETA/ALPHA, LONG ISOFORM. FT CHAIN 2 245 14-3-3 PROTEIN BETA/ALPHA, SHORT ISOFORM. FT INIT_MET 2 2 FOR SHORT ISOFORM. FT MOD_RES 1 1 ACETYLATION. FT MOD_RES 2 2 ACETYLATION (IN SHORT ISOFORM). FT MOD_RES 185 185 PHOSPHORYLATION. SQ SEQUENCE 245 AA; 27950 MW; AA91C2314D99549F CRC64; TMDKSELVQK AKLAEQAERY DDMAAAMKAV TEQGHELSNE ERNLLSVAYK NVVGARRSSW RVISSIEQKT ERNEKKQQMG KEYREKIEAE LQDICNDVLQ LLDKYLIPNA TQPESKVFYL KMKGDYFRYL SEVASGDNKQ TTVSNSQQAY QEAFEISKKE MQPTHPIRLG LALNFSVFYY EILNSPEKAC SLAKTAFDEA IAELDTLNEE SYKDSTLIMQ LLRDNLTLWT SENQGDEGDA GEGEN // ID CALM_HUMAN STANDARD; PRT; 148 AA. AC P02593; P99014; P70667; Q61379; Q61380; DT 21-JUL-1986 (Rel. 01, Created) DT 21-JUL-1986 (Rel. 01, Last sequence update) DT 16-OCT-2001 (Rel. 40, Last annotation update) DE Calmodulin. GN (CALM1 OR CAM1 OR CALM OR CAM) AND (CALM2 OR CAM2 OR CAMB) AND GN (CALM3 OR CAM3 OR CAMC). OS Homo sapiens (Human), OS Mus musculus (Mouse), OS Rattus norvegicus (Rat), OS Oryctolagus cuniculus (Rabbit), OS Bos taurus (Bovine), OS Gallus gallus (Chicken), OS Anas platyrhynchos (Domestic duck), OS Xenopus laevis (African clawed frog), OS Arbacia punctulata (Punctuate sea urchin), OS Oncorhynchus sp. (Salmon), and OS Oryzias latipes (Medaka fish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606, 10090, 10116, 9986, 9913, 9031, 8839, 8355, 7641, OX 8025, 8090; RN [1] RP SEQUENCE FROM N.A. RC SPECIES=Human; RX MEDLINE=89034207; PubMed=3182832; RA Fischer R., Koller M., Flura M., Mathews S., Strehler-Page M.A., RA Krebs J., Penniston J.T., Carafoli E., Strehler E.E.; RT "Multiple divergent mRNAs code for a single human calmodulin."; RL J. Biol. Chem. 263:17055-17062(1988). RN [2] RP SEQUENCE FROM N.A. RC SPECIES=Human; RX MEDLINE=88059053; PubMed=2445749; RA Sengupta B., Friedberg F., Detera-Wadleigh S.D.; RT "Molecular analysis of human and rat calmodulin complementary DNA RT clones. Evidence for additional active genes in these species."; RL J. Biol. Chem. 262:16663-16670(1987). RN [3] RP SEQUENCE FROM N.A. RC SPECIES=Human; RX MEDLINE=85022688; PubMed=6385987; RA Wawrzynczak E.J., Perham R.N.; RT "Isolation and nucleotide sequence of a cDNA encoding human RT calmodulin."; RL Biochem. Int. 9:177-185(1984). RN [4] RP SEQUENCE FROM N.A. RC SPECIES=Human; TISSUE=Blood; RX MEDLINE=95010144; PubMed=7925473; RA Rhyner J.A., Ottiger M., Wicki R., Greenwood T.M., Strehler E.E.; RT "Structure of the human CALM1 calmodulin gene and identification of RT two CALM1-related pseudogenes CALM1P1 and CALM1P2."; RL Eur. J. Biochem. 225:71-82(1994). RN [5] RP SEQUENCE FROM N.A. RC SPECIES=Human; TISSUE=Lymphoma; RA Kato S.; RL Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases. RN [6] RP SEQUENCE. RC SPECIES=Human; TISSUE=Brain; RX MEDLINE=82231946; PubMed=7093203; RA Sasagawa T., Ericsson L.H., Walsh K.A., Schreiber W.E., Fischer E.H., RA Titani K.; RT "Complete amino acid sequence of human brain calmodulin."; RL Biochemistry 21:2565-2569(1982). RN [7] RP SEQUENCE. RC SPECIES=Rabbit; TISSUE=Skeletal muscle; RX MEDLINE=81138220; PubMed=7202416; RA Grand R.J.A., Shenolikar S., Cohen P.; RT "The amino acid sequence of the delta subunit (calmodulin) of rabbit RT skeletal muscle phosphorylase kinase."; RL Eur. J. Biochem. 113:359-367(1981). RN [8] RP SEQUENCE. RC SPECIES=Bovine; TISSUE=Brain; RA Kasai H., Kato Y., Isobe T., Kawasaki H., Okuyama T.; RT "Determination of the complete amino acid sequence of calmodulin RT (phenylalanine-rich acidic protein II) from bovine brain."; RL Biomed. Res. 1:248-264(1980). RN [9] RP SEQUENCE. RC SPECIES=Bovine; TISSUE=Brain; RX MEDLINE=80094551; PubMed=7356670; RA Watterson D.M., Sharief F., Vanaman T.C.; RT "The complete amino acid sequence of the Ca2+-dependent modulator RT protein (calmodulin) of bovine brain."; RL J. Biol. Chem. 255:962-975(1980). RN [10] RP SEQUENCE. RC SPECIES=Bovine; TISSUE=Uterus; RA Grand R.J.A., Perry S.V.; RT "The amino acid sequence of the troponin C-like protein (modulator RT protein) from bovine uterus."; RL FEBS Lett. 92:137-142(1978). RN [11] RP SEQUENCE OF 38-60. RC SPECIES=Bovine; RX MEDLINE=89064822; PubMed=3058479; RA Pribilla I., Krueger H., Buchner K., Otto H., Schiebler W., RA Tripier D., Hucho F.; RT "Heat-resistant inhibitors of protein kinase C from bovine brain."; RL Eur. J. Biochem. 177:657-664(1988). RN [12] RP SEQUENCE FROM N.A. RC SPECIES=Mouse; RX MEDLINE=88257100; PubMed=3384819; RA Bender P.K., Dedman J.R., Emerson C.P.; RT "The abundance of calmodulin mRNAs is regulated in phosphorylase RT kinase-deficient skeletal muscle."; RL J. Biol. Chem. 263:9733-9737(1988). RN [13] RP SEQUENCE FROM N.A. RC SPECIES=Mouse; RX MEDLINE=90006775; PubMed=2551780; RA Danchin A., Sezer O., Glaser P., Chalon P., Caput D.; RT "Cloning and expression of mouse-brain calmodulin as an activator of RT Bordetella pertussis adenylate cyclase in Escherichia coli."; RL Gene 80:145-149(1989). RN [14] RP SEQUENCE FROM N.A. RC SPECIES=Mouse; STRAIN=BALB/C; TISSUE=Brain; RA Kato K.; RT "A collection of cDNA clones with specific expression patterns in RT mouse brain."; RL Eur. J. Neurosci. 2:704-711(1991). RN [15] RP SEQUENCE. RC SPECIES=Rat; TISSUE=Testis; RX MEDLINE=78066877; PubMed=201628; RA Dedman J.R., Jackson R.L., Schreiber W.E., Means A.R.; RT "Sequence homology of the Ca2+-dependent regulator of cyclic RT nucleotide phosphodiesterase from rat testis with other Ca2+-binding RT proteins."; RL J. Biol. Chem. 253:343-346(1978). RN [16] RP SEQUENCE FROM N.A. RC SPECIES=Rat; TISSUE=Brain; RX MEDLINE=87246077; PubMed=2885164; RA Sherbany A.A., Parent A.S., Brosius J.; RT "Rat calmodulin cDNA."; RL DNA 6:267-272(1987). RN [17] RP SEQUENCE FROM N.A. RC SPECIES=Rat; TISSUE=Brain; RX MEDLINE=87226204; PubMed=3035194; RA Nojima H., Hirofumi S.; RT "Structure of a gene for rat calmodulin."; RL J. Mol. Biol. 193:439-445(1987). RN [18] RP SEQUENCE FROM N.A. RC SPECIES=Rat; RX MEDLINE=87257889; PubMed=3037336; RA Nojima H., Kishi K., Sokabe H.; RT "Multiple calmodulin mRNA species are derived from two distinct RT genes."; RL Mol. Cell. Biol. 7:1873-1880(1987). RN [19] RP SEQUENCE FROM N.A. RC SPECIES=Rat; STRAIN=SHR; RX MEDLINE=89362474; PubMed=2527998; RA Nojima H.; RT "Structural organization of multiple rat calmodulin genes."; RL J. Mol. Biol. 208:269-282(1989). RN [20] RP SEQUENCE FROM N.A. RC SPECIES=Chicken; RX MEDLINE=84008199; PubMed=6137485; RA Putkey J.A., Ts'Ui K.F., Tanaka T., Lagace L., Stein J.P., Lai E.C., RA Means A.R.; RT "Chicken calmodulin genes. A species comparison of cDNA sequences and RT isolation of a genomic clone."; RL J. Biol. Chem. 258:11864-11870(1983). RN [21] RP SEQUENCE FROM N.A. RC SPECIES=Chicken; RX MEDLINE=85104969; PubMed=2981850; RA Simmen R.C.M., Tanaka T., Ts'Ui K.F., Putkey J.A., Scott M.J., RA Lai E.C., Means A.R.; RT "The structural organization of the chicken calmodulin gene."; RL J. Biol. Chem. 260:907-912(1985). RN [22] RP ERRATUM. RC SPECIES=Chicken; RA Simmen R.C.M., Tanaka T., Ts'Ui K.F., Putkey J.A., Scott M.J., RA Lai E.C., Means A.R.; RL J. Biol. Chem. 262:4928-4929(1987). RN [23] RP SEQUENCE FROM N.A. RC SPECIES=Chicken; RA Iida Y.; RT "cDNA sequences and molecular evolution of calmodulin genes of RT chicken and eel."; RL Bull. Chem. Soc. Jpn. 57:2667-2668(1984). RN [24] RP SEQUENCE FROM N.A. RC SPECIES=A.platyrhynchos; RX MEDLINE=93287810; PubMed=8389959; RA Kimura N., Kurosawa N., Kondo K., Tsukada Y.; RT "Molecular cloning of the kainate-binding protein and calmodulin RT genes which are induced by an imprinting stimulus in ducklings."; RL Brain Res. Mol. Brain Res. 17:351-355(1993). RN [25] RP SEQUENCE FROM N.A. RC SPECIES=X.laevis; RX MEDLINE=84191128; PubMed=6325880; RA Chien Y.-H., Dawid I.B.; RT "Isolation and characterization of calmodulin genes from Xenopus RT laevis."; RL Mol. Cell. Biol. 4:507-513(1984). RN [26] RP SEQUENCE OF 1-141 FROM N.A. RC SPECIES=A.punctulata; RX MEDLINE=88172463; PubMed=3351921; RA Hardy D.O., Bender P.K., Kretsinger R.H.; RT "Two calmodulin genes are expressed in Arbacia punctulata. An ancient RT gene duplication is indicated."; RL J. Mol. Biol. 199:223-227(1988). RN [27] RP SEQUENCE. RC SPECIES=Salmon; RA Yazawa M., Toda H., Yagi Y.; RT "Amino acid sequence of salmon calmodulin."; RL Seikagaku 57:1037-1037(1985). RN [28] RP SEQUENCE FROM N.A. RC SPECIES=O.latipes; RX MEDLINE=93012998; PubMed=1398109; RA Matsuo K., Sato K., Ikeshima H., Shimoda K., Takano T.; RT "Four synonymous genes encode calmodulin in the teleost fish, medaka RT (Oryzias latipes): conservation of the multigene one-protein RT principle."; RL Gene 119:279-281(1992). RN [29] RP SEQUENCE OF 1-27, AND UBIQUITYLATION OF LYS-21. RC SPECIES=Bovine; RX MEDLINE=98380241; PubMed=9716384; RA Laub M., Steppuhn J.A., Blueggel M., Immler D., Meyer H.E., RA Jennissen H.P.; RT "Modulation of calmodulin function by ubiquitin-calmodulin ligase and RT identification of the responsible ubiquitylation site in vertebrate RT calmodulin."; RL Eur. J. Biochem. 255:422-431(1998). RN [30] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS). RC SPECIES=Rat; RX MEDLINE=85188323; PubMed=3990807; RA Babu Y.S., Sack J.S., Greenhough T.J., Bugg C.E., Means A.R., RA Cook W.J.; RT "Three-dimensional structure of calmodulin."; RL Nature 315:37-40(1985). RN [31] RP X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS). RC SPECIES=Rat; RX MEDLINE=89110997; PubMed=3145979; RA Babu Y.S., Bugg C.E., Cook W.J.; RT "Structure of calmodulin refined at 2.2-A resolution."; RL J. Mol. Biol. 204:191-204(1988). RN [32] RP X-RAY CRYSTALLOGRAPHY (2 ANGSTROMS). RC SPECIES=Bovine; RX MEDLINE=98104088; PubMed=9438860; RA Wall M.E., Clarage J.B., Phillips G.N.; RT "Motions of calmodulin characterized using both Bragg and diffuse RT X-ray scattering."; RL Structure 5:1599-1612(1997). RN [33] RP STRUCTURE BY NMR OF 76-148. RX MEDLINE=94085641; PubMed=8262263; RA Finn B.E., Drakenberg T., Forsen S.; RT "The structure of apo-calmodulin. A 1H NMR examination of the RT carboxy-terminal domain."; RL FEBS Lett. 336:368-374(1993). RN [34] RP STRUCTURE BY NMR OF 76-148. RX MEDLINE=96018615; PubMed=7552749; RA Finn B.E., Evenas J., Drakenberg T., Waltho J.P., Thulin E., RA Forsen S.; RT "Calcium-induced structural changes and domain autonomy in RT calmodulin."; RL Nat. Struct. Biol. 2:777-783(1995). RN [35] RP STRUCTURE BY NMR. RX MEDLINE=96018613; PubMed=7552747; RA Zhang M., Tanaka T., Ikura M.; RT "Calcium-induced conformational transition revealed by the solution RT structure of apo calmodulin."; RL Nat. Struct. Biol. 2:758-767(1995). RN [36] RP STRUCTURE BY NMR. RX MEDLINE=96018614; PubMed=7552748; RA Kuboniwa H., Tjandra N., Grzesiek S., Ren H., Klee C.B., Bax A.; RT "Solution structure of calcium-free calmodulin."; RL Nat. Struct. Biol. 2:768-776(1995). RN [37] RP STRUCTURE BY NMR. RX MEDLINE=98179557; PubMed=9514729; RA Osawa M., Swindells M.B., Tanikawa J., Tanaka T., Mase T., Furuya T., RA Ikura M.; RT "Solution structure of calmodulin-W-7 complex: the basis of diversity RT in molecular recognition."; RL J. Mol. Biol. 276:165-176(1998). RN [38] RP STRUCTURE BY NMR. RX MEDLINE=99425120; PubMed=10493800; RA Elshorst B., Hennig M., Foersterling H., Diener A., Maurer M., RA Schulte P., Schwalbe H., Griesinger C., Krebs J., Schmid H., RA Vorherr T., Carafoli E.; RT "NMR solution structure of a complex of calmodulin with a binding RT peptide of the Ca(2+) pump."; RL Biochemistry 38:12320-12332(1999). CC -!- FUNCTION: CALMODULIN MEDIATES THE CONTROL OF A LARGE NUMBER OF CC ENZYMES BY CA(++). AMONG THE ENZYMES TO BE STIMULATED BY THE CC CALMODULIN-CA(++) COMPLEX ARE A NUMBER OF PROTEIN KINASES AND CC PHOSPHATASES. CC -!- PTM: UBIQUITYLATION STRONGLY DECREASES THE ACTIVITY. CC -!- MISCELLANEOUS: THIS PROTEIN HAS FOUR FUNCTIONAL CALCIUM-BINDING CC SITES. CC -!- SIMILARITY: TO OTHER EF-HAND CALCIUM BINDING PROTEINS. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; L00101; AAA48653.1; -. DR EMBL; L00096; AAA48653.1; JOINED. DR EMBL; L00097; AAA48653.1; JOINED. DR EMBL; L00098; AAA48653.1; JOINED. DR EMBL; L00099; AAA48653.1; JOINED. DR EMBL; L00100; AAA48653.1; JOINED. DR EMBL; M16659; AAA40864.1; -. DR EMBL; M27319; AAA35635.1; -. DR EMBL; U12022; AAB60644.1; -. DR EMBL; U11886; AAB60644.1; JOINED. DR EMBL; D45887; BAA08302.1; -. DR EMBL; X13817; CAA32050.1; -. DR EMBL; J04046; AAA51918.1; -. DR EMBL; M19311; AAA35641.1; -. DR EMBL; M19312; AAA40862.1; -. DR EMBL; M17069; AAA40863.1; -. DR EMBL; X13933; CAA32120.1; -. DR EMBL; X13931; CAA32119.1; -. DR EMBL; X13932; CAA32119.1; JOINED. DR EMBL; X05117; CAA32119.1; JOINED. DR EMBL; X13833; CAA32062.1; -. DR EMBL; X13834; CAA32062.1; JOINED. DR EMBL; X13835; CAA32062.1; JOINED. DR EMBL; X14265; CAA32478.1; -. DR EMBL; D83350; BAA11896.1; -. DR EMBL; M36167; AAA48650.1; -. DR EMBL; K01944; AAA49668.1; -. DR EMBL; K01945; AAA49669.1; -. DR EMBL; D10363; BAA01195.1; -. DR EMBL; M19380; AAA66181.1; -. DR EMBL; M19381; AAA66182.1; -. DR EMBL; L31642; AAA65934.1; -. DR EMBL; M27844; AAA37365.1; -. DR EMBL; X61432; CAA43674.1; -. DR PIR; S13159; MCHU. DR PIR; JK0013; MCON. DR PIR; A90719; MCBO. DR PIR; A91104; MCRB. DR PIR; S03206; MCRT. DR PIR; A92394; MCCH. DR PIR; S02690; S02690. DR PIR; A60781; A60781. DR PIR; JC1305; JC1305. DR PDB; 2CLN; 15-OCT-94. DR PDB; 3CLN; 09-JAN-89. DR PDB; 1TRC; 15-OCT-91. DR PDB; 1AK8; 17-SEP-97. DR PDB; 1CDL; 31-AUG-94. DR PDB; 1CDM; 31-AUG-94. DR PDB; 1CFC; 07-DEC-95. DR PDB; 1CFD; 07-DEC-95. DR PDB; 1CLL; 31-OCT-93. DR PDB; 1CM1; 04-MAR-98. DR PDB; 1CM4; 04-MAR-98. DR PDB; 1CMF; 07-DEC-95. DR PDB; 1CMG; 07-DEC-95. DR PDB; 1CTR; 20-DEC-94. DR PDB; 1DEG; 31-MAY-94. DR PDB; 1DMO; 01-AUG-96. DR PDB; 1LIN; 08-MAR-96. DR PDB; 1AJI; 17-SEP-97. DR PDB; 1A29; 16-SEP-98. DR PDB; 1MUX; 25-NOV-98. DR PDB; 1CFF; 24-SEP-91. DR SWISS-2DPAGE; P99014; MOUSE. DR Aarhus/Ghent-2DPAGE; 9048; IEF. DR MIM; 114180; -. DR MIM; 114182; -. DR MIM; 114183; -. DR MGD; MGI:88251; Calm. DR MGD; MGI:103250; Calm2. DR MGD; MGI:103249; Calm3. DR InterPro; IPR002048; EF-hand. DR Pfam; PF00036; efhand; 4. DR SMART; SM00054; EFh; 4. DR PROSITE; PS00018; EF_HAND; 4. KW Calcium-binding; Duplication; Methylation; Acetylation; KW 3D-structure. FT INIT_MET 0 0 FT MOD_RES 1 1 ACETYLATION. FT MOD_RES 115 115 METHYLATION (TRI-) (IN CHICKEN). FT CA_BIND 20 31 EF-HAND 1. FT CA_BIND 56 67 EF-HAND 2. FT CA_BIND 93 104 EF-HAND 3. FT CA_BIND 129 140 EF-HAND 4. FT BINDING 21 21 UBIQUITIN (MULTI-). FT CONFLICT 25 25 G -> N (IN REF. 12; AAA66182). FT HELIX 5 19 FT TURN 21 22 FT STRAND 26 27 FT HELIX 29 37 FT TURN 38 40 FT HELIX 45 55 FT TURN 57 58 FT STRAND 63 64 FT HELIX 65 92 FT TURN 94 95 FT STRAND 100 100 FT HELIX 102 111 FT TURN 112 113 FT HELIX 118 128 FT STRAND 136 136 FT HELIX 138 146 SQ SEQUENCE 148 AA; 16706 MW; 464B8A287475A1CA CRC64; ADQLTEEQIA EFKEAFSLFD KDGDGTITTK ELGTVMRSLG QNPTEAELQD MINEVDADGN GTIDFPEFLT MMARKMKDTD SEEEIREAFR VFDKDGNGYI SAAELRHVMT NLGEKLTDEE VDEMIREADI DGDGQVNYEE FVQMMTAK // P35527.gb100644000766000024 3401414006427353 15535 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS P35527 623 aa linear PRI 19-SEP-2006 DEFINITION Keratin, type I cytoskeletal 9 (Cytokeratin-9) (CK-9) (Keratin-9) (K9). ACCESSION P35527 VERSION P35527 GI:81175178 DBSOURCE swissprot: locus K1C9_HUMAN, accession P35527; class: standard. extra accessions:O00109,Q14665 created: Jun 1, 1994. sequence updated: Nov 8, 2005. annotation updated: Sep 19, 2006. xrefs: Z29074.1, CAA82315.1, S69510.1, AAC60619.1, X75015.1, CAA52924.1, AB001594.1, BAA19418.1, I37984 xrefs (non-sequence databases): HSSP:P08670, IntAct:P35527, Ensembl:ENSG00000171403, KEGG:hsa:3857, HGNC:6447, MIM:144200, MIM:607606, ArrayExpress:P35527, GO:0005200, GO:0008544, InterPro:IPR011000, InterPro:IPR001664, InterPro:IPR002957, Pfam:PF00038, PRINTS:PR01248, PROSITE:PS00226 KEYWORDS Coiled coil; Direct protein sequencing; Disease mutation; Intermediate filament; Keratin. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (residues 1 to 623) AUTHORS Langbein,L., Heid,H.W., Moll,I. and Franke,W.W. TITLE Molecular characterization of the body site-specific human epidermal cytokeratin 9: cDNA cloning, amino acid sequence, and tissue specificity of gene expression JOURNAL Differentiation 55 (1), 57-71 (1993) PUBMED 7507869 REMARK NUCLEOTIDE SEQUENCE [MRNA], AND PARTIAL PROTEIN SEQUENCE. TISSUE=Foot sole tissue Erratum:[Differentiation. 1994 Jan;55(2):164. PMID: 7511549] REFERENCE 2 (residues 1 to 623) AUTHORS Reis,A., Hennies,H.-C., Langbein,L., Digweed,M., Mischke,D., Dreschler,M., Schroek,E., Royer-Pokora,B., Franke,W.W., Sperling,K. and Kuester,W. TITLE Keratin 9 gene mutations in epidermolytic palmoplantar keratoderma (EPPK) JOURNAL Nat. Genet. 6 (2), 174-179 (1994) PUBMED 7512862 REMARK NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND VARIANTS EPPK LYS-161; GLN-163 AND TRP-163. REFERENCE 3 (residues 1 to 623) AUTHORS Bienvenut,W.V. JOURNAL Unpublished REMARK PROTEIN SEQUENCE OF 14-29, AND MASS SPECTROMETRY. TISSUE=Cervix carcinoma REFERENCE 4 (residues 1 to 623) AUTHORS Kobayashi,S., Tanaka,T., Matsuyoshi,N. and Imamura,S. TITLE Keratin 9 point mutation in the pedigree of epidermolytic hereditary palmoplantar keratoderma perturbs keratin intermediate filament network formation JOURNAL FEBS Lett. 386 (2-3), 149-155 (1996) PUBMED 8647270 REMARK NUCLEOTIDE SEQUENCE [MRNA] OF 147-372, AND VARIANT EPPK GLN-163. REFERENCE 5 (residues 1 to 623) AUTHORS Rosen,E.M., Meromsky,L., Romero,R., Setter,E. and Goldberg,I. TITLE Human placenta contains an epithelial scatter protein JOURNAL Biochem. Biophys. Res. Commun. 168 (3), 1082-1088 (1990) PUBMED 2140676 REMARK PROTEIN SEQUENCE OF 450-466. REFERENCE 6 (residues 1 to 623) AUTHORS Hennies,H.C., Zehender,D., Kunze,J., Kuster,W. and Reis,A. TITLE Keratin 9 gene mutational heterogeneity in patients with epidermolytic palmoplantar keratoderma JOURNAL Hum. Genet. 93 (6), 649-654 (1994) PUBMED 7516304 REMARK VARIANTS EPPK VAL-157 AND PRO-172. REFERENCE 7 (residues 1 to 623) AUTHORS Bonifas,J.M., Matsumura,K., Chen,M.A., Berth-Jones,J., Hutchison,P.E., Zloczower,M., Fritsch,P.O. and Epstein,E.H. Jr. TITLE Mutations of keratin 9 in two families with palmoplantar epidermolytic hyperkeratosis JOURNAL J. Invest. Dermatol. 103 (4), 474-477 (1994) PUBMED 7523529 REMARK VARIANT EPPK SER-161. REFERENCE 8 (residues 1 to 623) AUTHORS Torchard,D., Blanchet-Bardon,C., Serova,O., Langbein,L., Narod,S., Janin,N., Goguel,A.F., Bernheim,A., Franke,W.W., Lenoir,G.M. and Feunteun,J. TITLE Epidermolytic palmoplantar keratoderma cosegregates with a keratin 9 mutation in a pedigree with breast and ovarian cancer JOURNAL Nat. Genet. 6 (1), 106-110 (1994) PUBMED 7511021 REMARK VARIANT EPPK TYR-161. REFERENCE 9 (residues 1 to 623) AUTHORS Rothnagel,J.A., Wojcik,S., Liefer,K.M., Dominey,A.M., Huber,M., Hohl,D. and Roop,D.R. TITLE Mutations in the 1A domain of keratin 9 in patients with epidermolytic palmoplantar keratoderma JOURNAL J. Invest. Dermatol. 104 (3), 430-433 (1995) PUBMED 7532199 REMARK VARIANTS EPPK TRP-163 AND SER-168. REFERENCE 10 (residues 1 to 623) AUTHORS Endo,H., Hatamochi,A. and Shinkai,H. TITLE A novel mutation of a leucine residue in coil 1A of keratin 9 in epidermolytic palmoplantar keratoderma JOURNAL J. Invest. Dermatol. 109 (1), 113-115 (1997) PUBMED 9204965 REMARK VARIANT EPPK VAL-160. REFERENCE 11 (residues 1 to 623) AUTHORS Covello,S.P., Irvine,A.D., McKenna,K.E., Munro,C.S., Nevin,N.C., Smith,F.J., Uitto,J. and McLean,W.H. TITLE Mutations in keratin K9 in kindreds with epidermolytic palmoplantar keratoderma and epidemiology in Northern Ireland JOURNAL J. Invest. Dermatol. 111 (6), 1207-1209 (1998) PUBMED 9856842 REMARK VARIANTS EPPK THR-157; VAL-157 AND GLN-163. COMMENT On Nov 8, 2005 this sequence version replaced gi:547748. [FUNCTION] May serve an important special function either in the mature palmar and plantar skin tissue or in the morphogenic program of the formation of these tissues. [SUBUNIT] Heterotetramer of two type I and two type II keratins. [TISSUE SPECIFICITY] Expressed in the terminally differentiated epidermis of palms and soles. [DISEASE] Defects in KRT9 are a cause of epidermolytic palmoplantar keratoderma (EPPK) [MIM:144200]; also abbreviated EHPPK. EPPK is an autosomal dominant disease characterized by diffuse thickening of the epidermis on the entire surface of palms and soles sharply bordered with erythematous margins. [MISCELLANEOUS] There are two types of cytoskeletal and microfibrillar keratin, I (acidic) and II (neutral to basic) (40-55 and 56-70 kDa, respectively). [SIMILARITY] Belongs to the intermediate filament family. [CAUTION] Was originally (Ref.4) thought to be a 60 kDa chain of placental scatter protein. [WEB RESOURCE] NAME=Human Intermediate Filament Mutation Database; URL='http://www.interfil.org'. [WEB RESOURCE] NAME=GeneReviews; URL='http://www.genetests.org/query?gene=KRT9'. FEATURES Location/Qualifiers source 1..623 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..623 /gene="KRT9" Protein 1..623 /gene="KRT9" /product="Keratin, type I cytoskeletal 9" Region 1..623 /gene="KRT9" /region_name="Mature chain" /experiment="experimental evidence, no additional details recorded" /note="Keratin, type I cytoskeletal 9. /FTId=PRO_0000063640." Region 1..152 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Head." Region 12..13 /gene="KRT9" /region_name="Conflict" /experiment="experimental evidence, no additional details recorded" /note="SR -> T (in Ref. 1; AAC60619 and 2)." Region 15..26 /gene="KRT9" /region_name="Compositionally biased region" /experiment="experimental evidence, no additional details recorded" /note="Poly-Gly." Region 152..463 /gene="KRT9" /region_name="Filament" /note="Intermediate filament protein; pfam00038" /db_xref="CDD:40140" Region 153..461 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Rod." Region 153..188 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Coil 1A." Region 157 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="M -> T (in EPPK). /FTId=VAR_010499." Region 157 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="M -> V (in EPPK). /FTId=VAR_010500." Region 160 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="L -> V (in EPPK). /FTId=VAR_010501." Region 161 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="N -> K (in EPPK). /FTId=VAR_003822." Region 161 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="N -> S (in EPPK). /FTId=VAR_010502." Region 161 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="N -> Y (in EPPK). /FTId=VAR_010503." Region 163 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="R -> Q (in EPPK). /FTId=VAR_003823." Region 163 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="R -> W (in EPPK). /FTId=VAR_003824." Region 168 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="L -> S (in EPPK). /FTId=VAR_003825." Region 172 /gene="KRT9" /region_name="Variant" /experiment="experimental evidence, no additional details recorded" /note="Q -> P (in EPPK). /FTId=VAR_010504." Region 189..207 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Linker 1." Region 208..299 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Coil 1B." Region 300..322 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Linker 12." Region 323..461 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Coil 2." Region 462..623 /gene="KRT9" /region_name="Region of interest in the sequence" /experiment="experimental evidence, no additional details recorded" /note="Tail." ORIGIN 1 mscrqfsssy lsrsgggggg glgsggsirs sysrfsssgg rggggrfsss sgygggssrv 61 cgrggggsfg ysygggsggg fsasslgggf gggsrgfgga sgggysssgg fgggfgggsg 121 ggfgggygsg fgglggfggg agggdggilt anekstmqel nsrlasyldk vqaleeannd 181 lenkiqdwyd kkgpaaiqkn yspyyntidd lkdqivdltv gnnktlldid ntrmtlddfr 241 ikfemeqnlr qgvdadingl rqvldnltme ksdlemqyet lqeelmalkk nhkeemsqlt 301 gqnsgdvnve invapgkdlt ktlndmrqey eqliaknrkd ienqyetqit qiehevsssg 361 qevqssakev tqlrhgvqel eielqsqlsk kaaleksled tknrycgqlq miqeqisnle 421 aqitdvrqei ecqnqeysll lsikmrleke ietyhnlleg gqedfessga gkiglggrgg 481 sggsygrgsr ggsggsyggg gsgggygggs gsrggsggsy gggsgsgggs gggygggsgg 541 ghsggsgggh sggsggnygg gsgsgggsgg gygggsgsrg gsggshgggs gfggesggsy 601 gggeeasgsg ggygggsgks shs // U71225.gb100644000766000024 616514006427353 15523 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS U71225 1164 bp DNA linear VRT 27-NOV-2001 DEFINITION Desmognathus quadramaculatus 12S ribosomal RNA gene, partial sequence; tRNA-Val gene, complete sequence; and 16S ribosomal RNA gene, partial sequence, mitochondrial genes for mitochondrial RNAs. ACCESSION U71225 VERSION U71225.1 GI:2804359 KEYWORDS . SOURCE mitochondrion Desmognathus quadramaculatus (black-bellied salamander) ORGANISM Desmognathus quadramaculatus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia; Batrachia; Caudata; Salamandroidea; Plethodontidae; Desmognathinae; Desmognathus. REFERENCE 1 (bases 1 to 1164) AUTHORS Titus,T.A. and Larson,A. TITLE Molecular phylogenetics of Desmognathine salamanders (Caudata: Plethodontidae): A reevaluation of evolution in ecology, life history, and morphology JOURNAL Syst. Biol. 45, 451-472 (1996) REFERENCE 2 (bases 1 to 1164) AUTHORS Titus,T.A. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Biology, University of Oregon, Eugene, OR 97403, USA FEATURES Location/Qualifiers source 1..1164 /organism="Desmognathus quadramaculatus" /organelle="mitochondrion" /mol_type="genomic DNA" /db_xref="taxon:52105" rRNA <1..638 /product="12S ribosomal RNA" tRNA 639..706 /product="tRNA-Val" rRNA 707..>1164 /product="16S ribosomal RNA" ORIGIN 1 ggcccaaagg gtagttttag gtgaaataaa atagaattta aaatttatct agtagttata 61 tataaacata aaatgtaaaa tcaaaaacga aagtcatact atataacctt gaatctacta 121 cagctgagaa acaaactagg attagatacc ctactatgct caactttaaa atggaccttc 181 ccgccagagc actacgagcc acagcttaaa actcaaagga cttggcggtg ctctacaccc 241 acctagagga gcctgttcta taatcgacac tccccgataa acctcaccac ctcttgctaa 301 tacagcctat ataccaccgc cctcagttca cccttcaaaa gaataatagt gaacaaaata 361 atttaaaata aaaaagtcag gtcaaggtgc agcaaatgaa gtggaaagaa atgggctaca 421 ttttttatag taaaaaatac ggaatattct atgaaataaa atataaagga ggatttagaa 481 gtaaaaagaa aaaagagtgt tctttttaaa ttggcaatag agcacgcaca caccgcccgt 541 caccctcttc aaaattaaat aaactaaata aatatataaa tttataagaa aaggtaagtc 601 gtaacatggt aagtctaccg gaaggtggcc ttggatatcg aagtatagct taaataaagc 661 attttgctta caccaaaaaa atatttgtta acccaaatta ccttaaattt taaatctatg 721 ctaaatataa aatactactt cctaatacac aaaacattat tatatgatag tacgggcgac 781 agaaaactta ttagcgcaat agaaaaagta ctgtaaagga aagatgaaat aaaattgaaa 841 taaaataaaa atataaaaga gcaaagatta taacttttac ctttagcata atggtctagc 901 cagtctatat taacataaag aattttagtt atataccccg aaaccaggcg agctacccta 961 aaacagcaat atatgagcga actcttctct gtggcaaaag agtgagaaga atttttggta 1021 gaggcgaaaa accaaacgag cccggatata gctggttact tgagaatgaa ttttagttca 1081 attaaaagca taaatattat aaaaacataa cgcttttatt ataattaatt gaggtacagc 1141 ccaattaata aaggaaacaa ccta // O_sat.wgs100644000766000024 1405714006427353 16212 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AAAA02000000 50231 rc DNA linear PLN 11-MAR-2005 DEFINITION Oryza sativa (indica cultivar-group) whole genome shotgun sequencing project. ACCESSION AAAA00000000 VERSION AAAA00000000.2 GI:54362548 KEYWORDS WGS. SOURCE Oryza sativa (indica cultivar-group) ORGANISM Oryza sativa (indica cultivar-group) Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BEP clade; Ehrhartoideae; Oryzeae; Oryza. REFERENCE 1 (bases 1 to 50231) AUTHORS Yu,J., Wang,J., Lin,W., Li,S., Li,H., Zhou,J., Ni,P., Dong,W., Hu,S., Zeng,C., Zhang,J., Zhang,Y., Li,R., Xu,Z., Li,S., Li,X., Zheng,H., Cong,L., Lin,L., Yin,J., Geng,J., Li,G., Shi,J., Liu,J., Lv,H., Li,J., Wang,J., Deng,Y., Ran,L., Shi,X., Wang,X., Wu,Q., Li,C., Ren,X., Wang,J., Wang,X., Li,D., Liu,D., Zhang,X., Ji,Z., Zhao,W., Sun,Y., Zhang,Z., Bao,J., Han,Y., Dong,L., Ji,J., Chen,P., Wu,S., Liu,J., Xiao,Y., Bu,D., Tan,J., Yang,L., Ye,C., Zhang,J., Xu,J., Zhou,Y., Yu,Y., Zhang,B., Zhuang,S., Wei,H., Liu,B., Lei,M., Yu,H., Li,Y., Xu,H., Wei,S., He,X., Fang,L., Zhang,Z., Zhang,Y., Huang,X., Su,Z., Tong,W., Li,J., Tong,Z., Li,S., Ye,J., Wang,L., Fang,L., Lei,T., Chen,C., Chen,H., Xu,Z., Li,H., Huang,H., Zhang,F., Xu,H., Li,N., Zhao,C., Li,S., Dong,L., Huang,Y., Li,L., Xi,Y., Qi,Q., Li,W., Zhang,B., Hu,W., Zhang,Y., Tian,X., Jiao,Y., Liang,X., Jin,J., Gao,L., Zheng,W., Hao,B., Liu,S., Wang,W., Yuan,L., Cao,M., McDermott,J., Samudrala,R., Wang,J., Wong,G.K. and Yang,H. TITLE The Genomes of Oryza sativa: A History of Duplications JOURNAL PLoS Biol. 3 (2), E38 (2005) PUBMED 15685292 REFERENCE 2 (bases 1 to 50231) AUTHORS Yu,J., Hu,S., Wang,J., Li,S., Wong,K.-S.G., Liu,B., Deng,Y., Dai,L., Zhou,Y., Zhang,X., Cao,M., Liu,J., Sun,J., Tang,J., Chen,Y., Huang,X., Lin,W., Ye,C., Tong,W., Cong,L., Geng,J., Han,Y., Li,L., Li,W., Hu,G., Huang,X., Li,W., Li,J., Liu,Z., Li,L., Liu,J., Qi,Q., Liu,J., Li,L., Wang,X., Lu,H., Wu,T., Zhu,M., Ni,P., Han,H., Dong,W., Ren,X., Feng,X., Cui,P., Li,X., Wang,H., Xu,X., Zhai,W., Xu,Z., Zhang,J., He,S., Zhang,J., Xu,J., Zhang,K., Zheng,X., Dong,J., Zeng,W., Tao,L., Chen,X., He,J., Liu,D., Tian,W., Tian,C., Xia,H., Li,G., Gao,H., Li,P., Chen,W., Wang,X., Zhang,Y., Hu,J., Wang,J., Liu,S., Yang,J., Zhang,G., Bao,Q., Xiong,Y., Li,Z., Mao,L., Zhou,C., Chen,R., Zhu,Z., Hao,B., Zheng,W., Chen,S., Guo,W., Li,G., Liu,S., Huang,G., Tao,M., Wang,J., Zhu,L., Yuan,L. and Yang,H. TITLE Direct Submission JOURNAL Submitted (04-JAN-2002) Beijing Genomics Institute/Center of Genomics & Bioinformatics, Institute of Genomics, Chinese Academy of Sciences, Beijing Airport Industrial Zone B6, Beijing, Beijing 101300, P.R.China REFERENCE 3 (bases 1 to 50231) AUTHORS Yu,J., Wang,J., Lin,W., Li,S., Li,H., Zhou,J., Ni,P., Dong,W., Hu,S., Zeng,C., Zhang,J., Zhang,Y., Li,R., Xu,Z., Li,S., Li,X., Zheng,H., Cong,L., Lin,L., Yin,J., Geng,J., Li,G., Shi,J., Liu,J., Lv,H., Li,J., Wang,J., Deng,Y., Ran,L., Shi,X., Wang,X., Wu,Q., Li,C., Ren,X., Wang,J., Wang,X., Li,D., Liu,D., Zhang,X., Ji,Z., Zhao,W., Sun,Y., Zhang,Z., Bao,J., Han,Y., Dong,L., Ji,J., Chen,P., Wu,S., Liu,J., Xiao,Y., Bu,D., Tan,J., Yang,L., Ye,C., Zhang,J., Xu,J., Zhou,Y., Yu,Y., Zhang,B., Zhuang,S., Wei,H., Liu,B., Lei,M., Yu,H., Li,Y., Xu,H., Wei,S., He,X., Fang,L., Zhang,Z., Zhang,Y., Huang,X., Su,Z., Tong,W., Li,J., Tong,Z., Li,S., Ye,J., Wang,L., Fang,L., Lei,T., Chen,C., Chen,H., Xu,Z., Li,H., Huang,H., Zhang,F., Xu,H., Li,N., Zhao,C., Li,S., Dong,L., Huang,Y., Li,L., Xi,Y., Qi,Q., Li,W., Zhang,B., Hu,W., Zhang,Y., Zheng,W., Hao,B., Liu,S., Wang,W., Yuan,L., Cao,M.L., McDermott,J., Samudrala,R., Wang,J., Wong,G.K.-S. and Yang,H. TITLE Direct Submission JOURNAL Submitted (12-SEP-2003) Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing Airport Industrial Zone B6, Beijing, Beijing 101300, P.R.China COMMENT On Oct 21, 2004 this sequence version replaced gi:19924305. The Oryza sativa (indica cultivar-group) whole genome shotgun (WGS) project has the project accession AAAA00000000. This version of the project (02) has the accession number AAAA02000000, and consists of sequences AAAA02000001-AAAA02050231. The improved whole-genome shotgun (WGS) sequences for the genomes of indica and japonica rice, AAAA02000000 and AACV01000000, respectively, have multi-megabase contiguity and are nearly 1000-fold improved over the drafts of 2002. Tested against a non-redundant collection of 19,079 full-length cDNAs, 98.1% of the genes are aligned without fragmentation to the mapped super-scaffolds of one or the other genome. Despite having only a small variation in gene content, there is an enormous variation in intergenic regions. At least a quarter of the two sequences could not be aligned, and in the aligned region single-nucleotide-polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions, to 27.6 SNP/kb in the transposable elements. FEATURES Location/Qualifiers source 1..50231 /organism="Oryza sativa (indica cultivar-group)" /mol_type="genomic DNA" /cultivar="93-11" /db_xref="taxon:39946" WGS AAAA02000001-AAAA02050231 WGS_SCAFLD CM000126-CM000137 WGS_SCAFLD CH398081-CH401163 // humor.maf100644000766000024 10217114006427353 16255 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data##maf version=1 scoring=zero a s NM_006987 0 5000 - 5000 gcacagcctttactccctgactgcgtttatattctgcctctcccgtgcctgttggtccttggggacGGCCCTCCGGCAGGTTCTGGCCTCAGGA--------CCCCGCACCAGCCCCGGCC-----GTGCCAGCTGCCCTCCCACTGGCCTGGCCCAGCTCTCCAGAGAGGCTTCATGCAGCCAGTTCCCCAGGGAACACCGTTGCCCACACGTTGCTAATGT-TAAAACATGAATTTATTGCATCTGTAGCATCTTACCTTGAGCACGGCTTCTTTCAATGCTTTTATTTTGCAATTATAG----GTTTATTTTCTTCCTT-----TCCTCTCTCTTGAGCTTTTTAAGGTGTTTTCTGTCACTCAGCTGGAGACTTTAAATTTCCTATTTATATTAATTATCAAATTA------TTTCATATATGA---GTCATTCAGCTCAGCCCTTTTTGGCTCCTTCTTGCAACTCTTACTTGTGGATTTATTGGGCAGTTCAGCGCCTACTTTCATG------------------------GGCAGA---CTCAACTGGCGA----GATTTTAACTGGAGGGTGAGAGATGTCCT------------GGCCACGGGGCCCTGTTG-----------------------CTCACAGTC--------------------------------------CCTGGATCAGAGGA--TGGTGCC---------------------GGATGGGCAGGTGCTGAGGATGCACATACCGCCCTCGGGTTA--------GCACCGAAGGTTCTTGTGTCAGACGTGAGGCTTCCTTCCTGGGTTCTTTCTGGCTGCGCTAGTCCAGAAGACCAGCAAACCCGAGATGGTCTGAGGTGGACGATGGAGCCTGAATGGAGAGCCTGGGGCGTGAGCCAG-GGTCTGGGATACCCTGGGACAGAGTTAGATGCCCCTGCAGACGTGGATGAGCGGCTGAAGACTAAGGGAGCAGGTCACACGTGGTGACAGACAGGAGAGGCTGCTGTGCCATCCAGGGGCTGGGGAAGGAGCCCCCG-TGGAGAGGCTCCATTTCGGCCACGTGGCTGCTGCAGACAACCGGGAGTCAGCGTCGGCACAAACACGGGTGCCTCGAAAGAGAGCAGTGCCGGCC-TGGTGTCTCCGGGCCCAGCTGCCACTGGCACAGGCCTCCGAAGGGGCAGGAGCAGGAGCAGGAGCAGAAAGCGCACA-TCAGGGCTcatgctgtgcctgtgaagatgtcgggcacgtccgttagtgtgtgtggtgcgcggtcgcctgtgaagacgactgttagtgtgtgtggtgcgtggtcgcctgtgaagacgactgttagtatgtgtggtctacggtcgcctgtgaagacgactgttagtatctgtggtgcgcggtcgcctgtgaagacgactgttagtatgtgtggtctatggtcgcctgtgaagacgactgttagtatctgtggtgcgcggtcgcctgtgaagacgactgttagtaagtgtggtgcgcggtcgcctgtgaagacgactgttagtaagtgtggtgcgcggtcgcctgtgaagacgtctgttagtaagtgtggtgcgcggtcgcctgtgaagacgactgttagtAAGTGTG-GTGCGCGGTCGCCTGTGAAGACGACTGTTAGTAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGTGTG----GTGCGCGGTCGCCTGTGAAGACGACTGTTAGTAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGTAAGTGTGGTGCGCGGTCGCCTGTGAAGACGTCTGTTAGTAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGTATGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGTATGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGCAAGTGTGGTGCGCGGTCGTCTGTGAAGACGACTGTTAGCAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGCAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGCAAGTGTGGTGCGCGGTCGCCTGTGAAGACGACTGTTAGTATGTGTGGTGCACggtcgcctgtgaagatgtcttttagtaagtgtggtgcacggtcgcctgtgaagacgactgttagtatgtgtggtgcacggtcgcctgtgaagacgactgttagtaagtgtggtgcgcggtcgcctgtgaagacgactgttagtaagtgtggtgcgcggtcgcctgtgaagatgactgttagtaagtgtggtgcacggtcacaCCTTCCCACACAGTGCAGGTGACttactcagggctctggatggcaaatgacaa----ggacccaattcaaactagcataagcaaaaaggcagctctgtgggcctggca-cctggggatcaaaacagtcgttcaggcacggctggatccagccattgaccatgggtatgaggtgactcaccttcatctctcagctctgtttagctgcaccggcagacaggccgtttc-aagggggagccaggtggctactggccatcgtgagctagtgccg-------tgcttaccgtcccag---------------------------------------------acagtgggtt---tattgca--------------caggaactctggcaag--accccaggaggactcagtatggctgggtctgggtcacatgcccagccctgagccaattaccctgccttgggggtgg-----------ggctattctaattggtcaTCCTCATCACATTCACCCAGGAAGAAGGGGAAGG-AGGGGCTGTTTCTCTTGCCCAGGAAGACTGGGCGTGACTACTTTGACCTGAAGAGTTATTCTCAAAAAGGAAAGGAATGCTGGGTAGACAAAATATATGCCCATTGCAGAAGGCCAGCAGGGGCCCCTGGCCTTCGCTTTGCTAAAGGTGGC-----------------AGGTGCACA--GCTGTCCGGACCCTGGAGCTCAGTGTGAGGAGTGTAAAGGTGTAGAAGGTGCCGTCTGTGCAGCAGTGTGGCGGCGTTCTGCCTTTCTCAGCTGAATGGAAGACAAACACTCAGGCCTCTTCACACCTTCGTGACTGGTCCCCCCTGGACCTTTGCGCAGAGCTGGCCTTTC--------CATGAAGCAAACTCTGGGCACATGTGGAA-----ACCAGCCTCTTGCTGCCATGCCTGCCCCAGGGACTGACCAGGGTCGGCTCCAGTCACTGACATAGTTTATCACATCTCTTTGGCCTCTGGTCTCTGGATCCACACCTCTGGCTCTGACGTCTGCATCGAGCCCCTGTCCCTGCTGGGTTCA-AGTCTCTTTTCCAGAAGCTTCACTTGGCTCGAGGCTCCACTTGAT-TTGGTG-TCATTTGC-GTTTCTGAGCTTGGTCCTCCCGACCTGTGGTCCTGCCAGGGGAAGAGCAGCGTGGCCTGGCTGGACCCTGGCAAAGGAGGCTTCTCTTCTGCATTTCTTCTACACGGAGTGCATGAGCAGGACAAAGACCAGCTGATTCCTGCAGGCCTGGGGCTGTGGTGTGCGGGTCTCCCACAGGCCTTGGtatgtgtgtggtgtgtggtcccgcacactcagagatggcttccagcacattccacactttctttctttctttcttttttttttgagtcggagtctcgctctgttgcccaggctggagtgcagtggcgcgatctcggctcactgcaagctccgccttccgggttcaagcgattctcctgcctcagcctcccgagtagttgggacgacagttgcccacgaccatgtctggccaagcctcagccttcccaagtgctagggttacaggcgtgagccactatgcctggcctacccatgaattttattttattttttaaattttaaggcgaggtctcactctgtcacctaggctggagtacggtggctagatcatggctcacagcagcctcaaactcctagactccagccaccctcccacctcagcctcctgaatggctggggctacagacatttgccaccacacctggctaatttttgtattttcagtagagatgggtttttactatgttggccaggctggtctcgaactcctgaccttaggtgatccacccgcctcggcctctggagtagctggggttacaggagtgagccaccgtgccagtcctccttccaccctttcatataaccagcagcgcatatcagcacttctcaacatctttactgcaggacCTTTTCTTCAAATGAAGTCTTACCTGGAAGCCTGAcagatgacgaggaaaaagctgagttgctctgggttgggctgtaataacccccttgttcccagccccctgcttctgcttgcacggtcctgaagaggggctccacacccctggctccttggaacccagtttgttgggttataataaccggccttgttcccagccccctgcttcgcctgtacagtcctgaagagggtctccacaccccTGGCTCCTTGGAACCCAGTTTGAG-------AGCCTCTTGGCAATTATATCCATCTGTCTGTCTCTCTTGTGGTAGCACCTGCTGCTGCTCCCCATGGGGAAAGGTTGCTGATGGTGTTTATTTTTTTTTTAAGCATGAAAACATTTTCTTTTTTCTATCAGTAGCTTGTTTGCACTATGAAAAGGTCAACAGAGAGATCCTTGTCATCTTCCttctccctgcaggagggtgtcagggtgtaagtgctccctcgctgtgcaggggttcatttcattcatttcatTACCCTTGCCCTCCTCGAGGTACCTCCGGGAAGCTGTTCCATTTACACATCTGTCAAGTTCTCTGTGCGTCAATTTGCCTTGCTCCTGAAGAGCCACACCCAAAAGGGGCCCCACTCCAGGCAGCGGGGCTTCAGGAAGCGATGAGATGCT---GACGCAGGCCCCGTGCACCACCACTGCTG----CCTGTAAGGGCTGTTTTGGATACAGAAAATGTGCCCTTTCTAACC--------CAAAAAA------------------------------------------------------------------------------------------------------------------------------TGCTTGAAATGTGTAAAAGTGGCCAGACT---AACAGTCCCAAAGAGGGCTGC---------CCTCTAAGAGGAAGCGTCCCAAATCTGT---TCAGTTTTAGAGACTACGTGACTGGG------GTACG-------TGGTGGGGCCTTACCAGACATCCACG---------AGGAGAATCCAGGCCTTGGTTTGGCTCCAGCTGGGCCTGCCTGGTGGCTGCCACTTATTGACTTAAGTCCCAGTGATTCAGCTCCTCATCTGGAACACCTCGGGTCACCCCCG s mm3 0 3424 + 3424 ..................................................................GACCCCCCTGTACCCACTGGCCTAAGGAGCAACCCCACCCACAGCAGCTCTGGCCCTGTCATGCCAACCACTGTCCCACTGACCCAGACTACCCCTG-AGAAGGACATTGTACAGCTAAGTCACAGAGTAACACTGC----------------ATGTGTAACACAAGCGTGTGTT-----TGTAACAT---GCCTTCACCAAGGTTTCTTCTAATGCTTTTGTTTTGCAGTTCTAGCTGTTTTCATTTCCCTCCTTGGATCTTCTCTCTCTTGAACTTTTTAAAGTGTCATA-GTCTATAAGTTAGCAACTTCA--CTCCTTACGTGTATTAATTGTCAAATTACTTATTTTTTCTATGTGAATTGCTGTTTA------TCATCTTTGCCTCCTTTTTGCAAACTTTACGTATTCGGGGATTAAGCATTTTAGCATTTGCTCTTCTATTTAGAGACTGAGCAAGCAGTGATGGCGGAGCCCTGTGCTGGTGA-TGCATTTTGGGCTGGGAAGAGAATTCTACCTTTAGCAAAGC-AAGGCCA-TGGCCTCTGCTGTCAGTATATGATGGACGTTCATTCTGACTGTCTAGCTTCCTTCTTCAGGTGTAAGTAATCATGTCGTCCTCCTGGAGAAGATAAATTGGTCCCTGCGCCTCCCTCCTTAGGAAAGGACTGGCAGGTGCTTAAG------------CCAGTGTGTTAAGCCAGGAGCACTCAGAATCCT---GTCAGCAACCCTGCTTCTTTGTCATGATCATTCTAGGCGCACTT-TCTGGAAGATGAG--------GGGTGGTCAGAAGTGGATGACAGAGTCTCCACGTACAACCTTGGAGATAGGCCAG-GGTGTGGGATGAGCCTGGGATCTCTTGGGGATCATTGGAGACATAGGTGTA-ATTTGAAGA------GAGCAGGTCATAT-----------------------------TATCCAGG---------------------AGCTGGAGCTCTCTTTGGGCCCCATAGCTACAGCAGAAGCGCAA-AGTTAGCCAAAGAATGAACACTGGAGGTTGAAAGGAAGGTGATCTCTGCCATGGCATCTCGGAGTC----------------AGGCCTCAGAGG----ACAGTTAGCTGCAGGAGCAGACAGCTCTCC-CCAAAGCT......................................................................................................................................................................................................................................................................................................................................................................................................AAGTGAGCGTGTGCAGTGCATTCAGAAGAACAGAATGAGCGTGTGCAGTGTGCAGTGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGTGCACTGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGTGCACTGCATTCAGAAGAACAGAGTGAGC--GTGTGCAGCGCATTCA-----GAAGAACAGAGTGAGCGTGTGCAGTGTGCAGCGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGCAT-------TCAGAAGAACAGAGTGAGCGTGTGCAGTGTGCACTGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGTGCAGTGCATTCAGGAGAACAGAGTGAGCGTGTGCAGTGTGCACTGCATTCAGAAGAACAGAGTGAGCATGTGCAGTGTGCAGTGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGTGCAGTGCATTCAGAAGAACAGAGTGAGCGTGTGCAGTGTGC..........................................................................................................................................................................................................................AGGTGATTTTCTCAGAGTTC---ATGATCAATCACAAATACGACTccagctcaaactacctgaagcaaaaatgggactctgcgagtctggcaaactgggaaccataaaagtgggtcaggcttggctgcacccagtgactgacaatatcttcaggcgtattcgcctccttccgtcgacttctctaagcatggtttc-agcctggcagtttctgagaggcagcaagatggccgcAGGATTTCCCACACCAGCACCATCAACACTGTTTGCTTCCCACACTGCCACCTATCCACTGCTCACTTCCCACGTGGACACTACACACCAAGATGCTGTCTGTACCGCAGCTCACTGTCACAAAAGGTACACTTTCCAGTAACCCAGGGAAGACACACCTTGGC-----CTAGGTCACATATACAGCCTTGAGCCAGCCACTCT--CCAGGAAGTGGAGGAGGGTGATGGACATCTTAATAGGTCCCTC-AGTCATGTCCACCCCCAAGGAGGAGGAAGCAGGGAACAGTTTCTTTTCTCCAGGAGGTC--GACATGGTTAGTTAAACCTAAAGA-------------AGGAA------GCTGGGCAGGCAAGATGGGTCTCCATCACCTAGAGCTAGCAGAGGCCTCTGGCCTTTGCTTTGCTAAAG-TAGAGTTGTCTTTCCAGCCTAAGATGCACACAGTTGTCTGGATCTTGGTGCT........................AAGGCTGTCATCTGCTTAGCATTGCCACAG--TTCTGGGGTATACAGT----------GTGAACCCT-AGCAATCTTCAAGCCTTCTTGTGAGGGGCATCGTAGGCCTTCTCATGGAGG----CTTCCCACCACCCTGTGTGGGCAATCC----CTCACATGGAACCTTCACAAGCAGTAAGCAGCTATGCTTG-----GGG--------GAAATAGCTTT--TCCCTAACATGCTT------GTGTCCCAGGATGCTGGCCCCTAG-----CACCACTTGCTCT-----TTGC-------ACCTCTCCTTGCCAGGCTCATGGTCTCTTTG--AGATGCTCATCTTGGCTCAGATCACAACCAGATGTTGGTG-TTACCTATAGTTTGTGGACA-GGTCCATCTGGTAGGTGGAGCTGC-AGGGAATGGACAGAGCAGCCAGGCTGGGCCCCGACAAAGGAGGC................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................CTCTTCTTCAAGTGGAGTCTTATCTGGAAAGCTGA.............................................................................................................................................................................................................aGGCTCCTTGGAACCCCGATTGAGGATTGAGAAGTGTTGGGTGGCTATGCTCATTGGTCTCTC-----------AGAATCGTTTTTTACCTCCCTGGGGATAAGGTTGCTG--------------------------GAAAACATTTTCTTCCTTC----ACTAGATTGTCTGT--TATGAAAAGGTCACTTGTGATCTCCTTGTCATCCTCC.......................................................................TGCCCCTGTGCTGGGCTAGCTGCCTTTGGGGATTTGTTCTTCTACTGCACTGCTCAAGTGGTCAGGGCCTCAGTTTCCCTTCTTCCCAAGAAGCCAGAGCCA---GAGGCTCCACTGCAGGC-----GACTCTGGTAAGCTCTGTGATGCTCACAACTTGGGCCCTG----CCAGGACGGATAATAACTTTTGAGAAATGTCT-------AAAAGCTAAGTCCTTTCTAACAACAACAGACAAACAAACCCTTCAGTGCTGGCAGGTCCCTcatgtttataattccaatcactgtgaaggatgaagcagtaagatcctgggttctaggccagcctgggctactcagcaagatcctgtctcaagaaatgacaaaTGTTTCCAATTTGTAT--GTGTCCAGACTGAAAACACATCCGCGAGGGGATGTAAGCTTCTGCCTTGGGTGGGAAATACCCCACAATTGTCTGTCAGCTGCTGCCACTGTGGGACTTGGTCTCCAGTACAATAGTGCTGGGGTGGCTTGGCCCGGCTTCCCTGCACTAGGGAGAGAGAATCCAGGCCTGGGTTTGGCTCCAACTGGGCCTTGCAGG--GCTGTTACTTATTGAGTTAAGTCAGAGTGGCTCAAATGTTGATATGCAGCACCT-GGGTCACCCTTG s rn3 0 1951 + 1951 ..................................................................................................................................................................................................................................................................................................................................................GAGCTTTTTAAAGTGTCAGA-GC----AAGTTATCAACTTCA--TTCCCTATGAGTATTAATTATCAAATTACTTACGTTTCCTATATAAATTGCCATTTA------TCATCTTTGCCTCTTTCTTGCAAACTTTGCGTGTTCGTGGATTAAGCAATTTAGCGTTTGCTTTTCTGTTTAGAGAGTGAGCAAGCAGTGATGGCGGAGCCCCGTGCTGGTGCCTGCAGTGTGGGCTGGGAGGAGAAGGCTAACTTTAGCAAAGCGAAGGTCA-GGGCCTCTGCTGTCAGTATGTGATGGACAGCCATTCTACCTGTCCAGCTTCCTTCTTCAGGTATAAGTAATCACGTCATCCTCCTGGAGAAGATAA-CTGGCCCCTGCCCCTTCGTCCTTAGGAAAGGACTGACTGGTGCTTAAG------------CCAGTGTGTTAAGCCAGGGGCACTCAGGGTCCT---GTCAGCAAGGCTGTTTCTTTGCAGTGATCATTCTGGGTGCAGTC-TCTGGAAGACAAG--------GGGTGGTCAGAAGTGGATGGCACAGTCTCAGTGTACAACCTGGGAGATGGGCCCGAGGTCTGGGAT---------CTCTCTTGGGGATCATTGGAGACATAGGTGTG-ATTTGAAGA------GA-CAGGTCAGAC-----------------------------CATCCCGGGACCACAGAGAAAGCACATGGAGCTGGATCTCTCTTTGGGCCCCATGGCTACAGCAGAAGAGCAA-AG----------AATGAACACTGGAGATTGGAGGGA------TCTCTGCCATGGTGTCTCTCAGCC----------------AGACCTCAGAGG----AGAGTTAGCTGCAGGAACAGACAGCTCCCCACCAGAGCT..................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................CTTCTCCATTCTGCCTAGGTGATTTTCTCAGAGTTC---GTGATCAATCACAAATAAGAGTCCAGCTCAAA--------AGCAGAAATGGGACTCTGTGAGTCTGGCAACCTGGGAACTGTAAAAGTGGGTCAGGCTTGGCTGCACCCAGTGACTGACAATATCTTCAGGCATATTTGTCTTCTTCTGTTGGCTTCCCTAAGCATCATTTCCAGACTGGCTGCTTCTGAGAGGGGGCAAGATGGCCACAGGATTTCCCATGCCAGCACCACCAGCACTGTTTGCTCCCCACG---------------------------------------------ACAACACTGT---TACCACAGCTCACCGTCATGAAAGGTATACTTCTCAGTAACCCAGAGAAGACAAACCCTGGC-----CTAGGTCACCTG--CAGCCTTGGGCCAGCCACTCT--CCAGGAAGTGGAAGAGGGTGATGGATATCTTAATAGGTTCTCC-AGTCATGTCCAACCCTAAGGAGGAAGAAGC--------------------CAGGAGGAC--AACCTGGTTACTTAATCCTGAAGA-------------AGGAA------GCTGGGCAGGCAAGATGGTTGTCCTTCACCTAGAGCTAGCAGAAGCCCCTGGCCTTTGCTTTGCTAAAGGTAGAATTGTCTTCCCAGCCCAAGATGCACA--GCTGTCTGGACCTTGGTGC------TGCAGGGTTGAA-----TTAAGGCTATGATCTGCTTAGCATTGCCACAG--TTCTGGGGTACACAGT----------GTGAACGCT-AGTAATCTTCATGCTTTTGCGTAAAGGGCATCTTAGGCCTTCTCATGGAGG----CTTCC--------TGTGTGGGAAATCC----CTCACACGGCACCTTCACATGTGATAAGCTGCTATGCTTG-----GGG--------AAAATAGCTTT--TCCCTAACATGCTT------GTGTCCCAGGATGCTGGTCC-TAG-----CACCCCTTGGCCT-----CTGC------TACCTCTCCTTGCCAGGGTCACAGTCTCTTTG--GGATGCTACTCTCAGCTCAGATTTCAACCGGATGTTGGGGGTCACCTCTAGTTTGTGAACA-AGTCTATTTTGTGGGTGGACATGC-AGGGACGGGACAGAGCTGCCAGGCTGAGCCCTGACAAAGGAGGCACC----CGGTATGGCTTCGGAGCTCTCTGCACGCGCACAGCAA----CTTCCTAGTCTTGCTGCTCTGTGGCAGCTGCA--------CCCCACAAGCCCTGG.........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................AAGTTCCTTGGAACCCCATTTGAG-------AAGCTCTGGGTGATTGTGTCCATTGGTCTCTC............................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... a s NM_018289 0 5000 + 5000 tttgtccatgttggtcaggctggtctcgaactccccacctcaggtgatccacccacctcggcctcccagagtgctgggattacaggcgtgagccaccgtgcccagcctggaagttgttttttttGAGCCCAGCTCATTTGTGAAAATATCACGTGCTCCTTATTTTCATACCCAAATTTCTCATATCCATAAACTTCCACATTTTATTTCTGTCAGAAGTATACAATCTCTTTCTAGATTTCCACCTCCTGTCAAAGGTATTTCCTTCTTTTGTTTGACATAAATGAATCAATCTAGTCTGTTATTACTTA----AAGAAGTT--------TTTTAAAACCCA----------------------------------------------------------------GTGTGT----------------------------------------------------------------CAACCTATCTCTC--------------------------------------------------------------------------------------------------------------------------------------------------------------TGCTCTTTTAACTTCCTTTACTTCTTTGTACTGCATAACTCTCCTAAACTCAAACTTATATTATTAGTAGCTT----ATCTGCAAAGAGGGAGAAGGGGTATCATGGAGTTGGGAGCTCCAGAGGAGGCAAAGGTTTTGGGTCACGGGAGAGTGCCATTGAGAACGGACAATGGATGCTGAGAAGGACAATGAGAATCAGTGGAAATAAGGCAACAGGAATTTGTGCTGAACATGGTCACCCTGGTTTCATGATTCTCCACAACAATATGGGAAGCTCAGAGGCAAAGAAAAAAACAAAAAACAAAAAACAGAGAGACCAGGGCTGATGGCAGGCATGGtaaacct-ggtcatgcccctcctctgcttacagtccgtcagaggcttcccattccttcaagaagaccttccagatccttcaccgtccggcagtgcccatttctccagcttcctcgtgtatcactctcccttgggctttcaaacttccatcaccctggccgactcttttcatactcatggcttttgcacattctgttccttttatctatgtgttatcttcacttccaacccttggcttcagcttaaaaa--tcacttcctcagaaaagcctttcctggccgggcttgatggctcacacccgtaatcccagcagttagggaggtcaaggcaggcggatcacctgaggtcaggagttcaagaccagcatggccaacatggagaaaccccgtctctactaaaaatacaaaaattagccgggtgtggtggcgtatgcctgtaatcccagctactcaggaggctgaggcaggcgaatcacttgaatccaggaggcggaggttgcggtgagccgaggtcgcgccattgcactccagcctgggcaacaagagagaaactctgtctcaaaaaaaaaaaaaaaaaagaaaaaaagaaaaagaaaaagaaagaaaaggctttcctgacccctaatccaatcagaactctaaccatatagtctctcaacatgtgctttacttctcttttatcggatctatcacaattatatgtttatttgtacaaatgcccttttcatacatagaaatagctctgtagaggcaagcatggggtctattttgttcactgctgaattcccggcacctagcttgagtacctgtcgctaacaggaacaggtgttcaataaatgtattgactagcttaatgaaCGGCAGGAAGACTtaggtatacggccttaggcaaatttttattttgttgcttgttttaactagatcttca-tctccta-tctgttagatgggtatagtaataaaaactaacaacctc--agaattgctgtgtgaattaaaagagacaatgcatgtaacatgcttcgaatggtacctgtcatatagtaagggctaggaaattttccttcttattATCATCATTAAAAATAAGCTCTCTgagcaaacagaactctcatacattgctggtagcaatgtgaaagtgatacagccactttggaaaacagtttaccagtttcttagaaagttaaacacagacatatcatataacacagcaatcccacatctacatatttacccaagtgacatgaaaacttatgttcacacaaaaatctgtacatgaatgtttatagcagctttattcataattgccaaaaactggaaa-aacccatatgtccttcaa-ccaggga-ctagataaacaaatagtggtgtattcatctagtggaataccagcaagaagacatgaa-ccacagatacatgcaacaacatggatgaattttagatactttatgacagatgaaagaagcca-gactgaaaaggtgtcatactgtatgattccatc-----------tatatgatattctggaaatatcaaaactatagggacaggccaggcgtggtggctcatgcctgtaaccccagcactttgggagcctgatgctggcagatcacctgaggtcaggagtttgagaccatcctgggcaacatgatgaaaccctgtctctactaaaaatacaaaaattagccaggtgtggtggcacgtgcctgtcatcccagctactcaggaggctgaggcaggagaatcgcttgaacccgggaggcggaggttgcagtgagctgagattgcaccactgcactccagcctgggtgatagagcgagactccgtctcaaaaaaacaaaaacaaaaacaaaaactatagggacagaaatcaagtcagtggttgccagtggatggggtgaggcaagagggaatttggggga--atgaaattgtcctataactagattgtggtagtagttacatggttatatacatttatttattttgttttattattgttttcttattatttgtttgtttgttttgagatggagtcttgctctgtcacccaggctggagtgcaatggtgcaatcttggctcactgcaacctccacctctcgggttcaagtgattctgtgcaccaccacggccagctacgttttgtatttttagtagatagagacgggttttaccatgttggccaggctggtctcgaacccctgacctcaagtgatctggcagcctcggcctccccacgtgctgggattataggcgtgagtcaccatgcccagtcgtttatacacatttatcaaaactca----ctgcatcataaattgcaag-agtaaattttactgtatgtaaatatcttaatTTTTTAAAGGAAAAAAAAAGTCCAAAATAAGTAGCTATAATTACCTTTTCCTTTTAGAAAAAGGCATTCATATTTTCTGAGCATGAGAGAACTACAGG---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------CTCATTGTAGACATTTGGGAAAATGTACAAAGGATGTACATTCATCACGTGCAGAAAAGTAATATTTCCTTGCTGTGAGCTTT--ATTGGCACAG-------------------------------------------------------------------------------------------AATACACacaaagctgggaccatgctatgcattttttaaatgaaattatacattttgctttctgaaatatattgtgcatat-ttttcatgtcattaa----ttttcaaaatcatgattttatgaatcaataatattctatgtttacttaagagacagagtctccctttgtggcccaggctggagtgcagtggcgcaaaggaatcattcctcactgcagcctcaaactcctgggctcaagcgatcctcagtggtgggatgagatctagacccctgatattcaatctcagaaagtcaataattccctgcctcagcctcctgagtagctggggctgcaggcctgtgccactgcaccaggttcaatgtatattttattttgtctttTTTTTCTTTTTTTAATGTATATTTTTTAGCA---TTCCTTTGTTGTTAGAAATTACATTGtttatata--------------------------------------------------attattatttgtcaattaaaaa----------taaaagaaaaaatCATACTGCTCCTATTTTTTACTTTGAGCAACTCTGATAACA-------------ACAACAAAAAAAAGTTCAGTGGGGCCAACGCGTTTCCCTCCTGGAGTTTCCCG--GTTCACCTGGTGGAGCGGGTCCTCCTTGGGCTGCAATTCCCTCCTTCTTTTCAGTGTCTGCAGGAAAGGTGGTTTCACAGGGTGATCGGTGATCTGGCTTATTTGTTCCTTTGTGGGGGTAAGGGCTGGGCACCTCACTATTTCCTGTATTACCAGCTTGAATTTACATCGCGCTGTTAATGTTTAATCCTCTCCACATCGA----------CGCGGGGTag-------taatttgttccaagtcacacaacttatcaatggtggactattcaggacttagatcccggatattcaacctcag----tAAACGTCTCCCCAGCTCCAGGGCGCTGCGTCATCTGGCTTTAAAAAAACGTTTCTggccgggcgcagtagctcacgcctgtcatcccagcactttgggaggccgaggcgggcggatcacgaggtcaagagttcgagaccagcctggccaacatggtgaaacctcgtctccactaaaaatacaaaaattagctgggcgtggtggcgggcgcctgtaatcccagctactcgggaggctgagacaggagaatcgcttgaacccagcaggcggggttgcagtgagccgagatcgcgccgttgcactccagcctgggcaacaagtaagactccgtctcaaaaaaaaaaaaaaaaaaaaaaaagaCGTTTCTCCCCA------TAAGGAGGCTGCAGTCACCTCGAAAAGCTTCTGCGCAGAGCCCGGCGCACGGGGAATTTCTATCGAGTGG----GCACTGTCCCGTCCTACGAATGTCCTTGGCGACATCGCCACTCGCCAGCTGGGATCACGTCGCCCCCG-TGGCCGGGG-ACACCCGCTGCCACTGACCGAGGGGGCCCCGGGAGCTGAGAGGTCGCCG-CTGCGCTGCCCGCGGAGGCTGTGGGCTGCGGTTGGCGACGCCG-CAGGGCTG-ACTTTCATCCGGGCGTCCT-GCTGGA-GGCCAGACCCTACCCCAACCTCGGGGTCCTCAGTGCGGGGCGCGCCTTGGG-GAGGGAAGAGGCCCCGCCCATCCTAAGGACGCGCTGAGGCCACGCGGGGCCGCCGGGATACTGCGTTGCCCGGGAGACACCT---GACGTGACAGGAACGCC-GCCAGTCTCAGTCCGCCGTGATCCCACAGTTCCCCGGT-CCCGGAAGTGCCCTGGCCTGCCGGAAGTGCcggagcggctgacagagcggctgacggagcCGGGCTCACCAGGTCGCTGCCGCGAGGGAGTTGCTGTGCTGGGGCCTGGGT s mm3 0 3181 + 3181 ............................................................................................................................AAGCCCAGTTCATTTGTGAGACTATCACACGCTC---ATTTTCATAGACAAATTATTCTTCTCTATAAACTTCCCCCTTATATTTATCTTTAAAACATAACATCTGTTTTT----------CTCCAGCCAAAGTTATTTCCTG-TTTTGCTTATTATAAA-----CAGCTTGGCCTATTGTTGCTTATCCTAATGCATTAAAAAACTTTTAAAATTAca..........................................................................................................................................................................................................................................................................................................................................................................................ATTTGCAATTAAAGACAATATAATA-TATAATGGAGTTGGGAGCTTCAGAAGGAAGAATGGTTTCCAGTTGTAGCAGATA-----TAACCACTGAAAATGGATACACAAGAGGACAAC------CAGCAAGAACCAGACAGTGTGAATGCAGCCTGGACATAGTAGCCTTGGTTTCA-GACTCTCTGTAAGAGTGCAGGAAGCTCAGACACAGAGAAAAAAACC------------CAGAGGGCC--------ATGACAG---CAGTGAGCCTAGGTCA--CCACTCCTC-GCCTCCAGTCTTTCAAAGGCTATCCATTTCCCTAGGACAGCTCTC---AACACTAAGGAGCTGGCTCCACCCAGTTCTTTAGCCTCTACTCCAACC--TCTGCTCTTGACTTTCAAAACTCTTT--------TCAAGTCTTTTCTTCCCTATGGCTCT-GCACATATTGTTACTTGGGTTT--GTCATGCCTTCTCTTCCGAA---TTgcttcagcttaaaaa--tcactctcttggagaaagcttcccc...........................................................................................................................................................................................................................................................................................................................................gcattccgcaccc-------------------------acactctctcagtgtgcactgtcctgacctagcttgggacttatcacgattacatctttatt-gtatgaatgccatctccttacatgaagttagatgtga--tgacaggcgtttgatgtatctggttctttgctgaat--tcagcaccgagcataggtgcctgtcatcagcaggaacaggtgttgaatacatttattaattgagtgaaGGAGCAACAGAA-------GCCAGAGGGTCTAGGG--------TGCTTGGTT---TGTTTGAACCAGCTctccattctctca-tctgttaaattgggattatggtaa----ttaaactct---ggaactgctttgtggattaa--gagtcaaggc-tgtagtatgcttggtatg-tacatgacatatagcaagcattaggaaacattagct---attatCACCATTAAAAGGAAACTCT--gagtatctgaaactcttagacactgtcagtaggaatgtgaaaataacac-----cctt-------------------ccttataattgtaaacacagacagaccataccacttagcaatcttactcctaggcatacacttaagcgaatgaaaaactta------cagaaaactttagacaggaacgtttatagcagctttattcacaacgtctcagaatttaaaacaactcggatgttttccaaatcaccta-gtagatagacaaat---------------caacgga------gcaagaa---acaaa-ctgatggcacaaataataacacagatgaatcacagatccacgatgctaagtgagagaagacaagagtcaaaaggtgacatgctgtaggactccattcctggagaatttatatgacattctggaaa.........................................................................................................................................................................................................................................................................................................................................................GCCAGTGATTTGCAgacaatagagtg-ggtgtgaagga-----ggggatgatggaagtgtctcc--gccggactgtgct-gtgctgacacgatgaca....................................................................................................................................................................................................................................................................................................................acatgcatttatcaaagctcatgtacagcactggaaaccggaaacggctaatctcactgcatgcaaat-tctatctaatttaaaaTAGAAAAATAAGGCACGATAA--AACTGCACATTCCTGTTCCTTCAAGAAgACAGCACCTATGGTTTCTGATAATAAAAGTACCATATGGAGAAAAAAAAAAAAAAAagctgggcagtggtggcgcatgcctttagtcccagcacttgggaggcagaggcaggcagatttctgagttcgaggccagcctggtctacagagtgagttccaggacagccaaggctacacagagaaaccctgtcttgaaaaacaaacaaacaaacaaacaaacaCAAAAAAAAGTACCATATGTTGATTGTAGATGTGTGGGAAAATAAAGAAAAAATATA--------------------------TTTCCCTGTCATTTGCTTTTGGTTGAGACAG-------------------------------------------------------------------------------------------..............................................TAAATATTCTGCTTTCTGAAATACACTGTGCATATATCGCTATGTCATTAAATAATTGTCAAAATAATACTTTTATGAGTCAATA...........................................................................................................................................................................................................................................................----------------------------------TGTTGCTGTTGCTAAAACTTATACTACTCACACATTTTTGGGGGGCAAACTAAAACAACAATAAAATTCCATTCCTAAAACTATAATGTTTTTTAAAAACTAAAAA----------AAAAAAAGCAAATTATATTATTCCTGA-----------AGCAATTTTTATAAAACTATATGTACGGAAAAAAAAAAAAAAGTTCAGCAGGGCCAGATACTCCTCTGTCGAGGTTTTCACAATGTTGAACTGGTAAGGCAACCCTTCCCTTGGCTGCAATTCCATCATT-GTTCCAGTGCCTGCTGGAAAAGCGGTTCCACAGGGCTATC-------TGGCTCCTCT----TTTTGTAATGTACGCGGCTAAACACCCTAACATTTCCTGTACTACCTGCCTAAGTTTGCATAATACT-TCCATATTTAATG-TCACCCCATCTAACGTAGTGTAAGCAGGGCAGATGGGAATACTTTGTTCAAGATCATA-AGGTTATCAAGGGTGTA--AATAAGGATGTACAGCCCGGACATTCGGTCTCAAAAACCAAACACTTCCCA---------GCGTTGGGTCGCT----------------TTTCT................................................................................................................................................................................................................................................................................................................CCTTACTCCCCACCCTAGCGAGGAGGCTTCAGACAA----------TTCTGCCCAG--------------TAGATCCTTATCTAAGGAGCAGAAGCTCTCCCAAACCTCAAATGTCTTTGGCAACACTGCCACTTACCAGCTGGGGTCATTTCACTCCCGATGGCAGGGGGACACAGACTGCCCTAGACCAAGGGGATCCAGCGAGCGGGTTGCACGTTGGCTTCATGGCCAGCGGGGCTTATTGCCCATCCATTGGGACACTGCCGTGGCTA-CCGCTCTCTGGGGCGCCCG--------GGC--GACACTAC------------GGCGCTGGTGGGGGATGCGCCCTGGGCGAAGAAGAGAGGGCTTTCAATCTTAAGAACCCGCCGCGTGCGAACAGGGAACTTGAGGTTAAGGGTTGTCTGGAGATTTTCCAGGGACTTGACAGCCATGCCTGCGGAAGTCAGTCCTCCGCTATCCCAGAGTCCCCCGGTTCCCGGAAGTGCCCAGTTCT-CCGGAAGTGCCAGGCTAGCCGAGTAAGTAGC-----------GGCTCCCTCGAGCCCCGTACCTGGAGGCTTGCTCTTCTGGGGTCCTGGT s rn3 0 3320 + 3320 ............................................................................................................................AAGCCCAGTTCATTTGTGAGACTATCACATGCTC---ATTTTCATATACAAATTATTCTTATCTGTAAATATCCACATTATATTTATCTTAAAAACATACCATCTGTTTTT----------CTCCAGCCAAAGGTATTTCCTG-TTTTGCTTATTATGAA-----CAGCTTGGCCTATTGTTACTTATCCTAATGCATT--------TCTAAAATTACATTTATTTATTTACTTATATTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTGCTCCCTGAACTGCATATCAAGACCCTATTCAGTGGACTCGCAGGCAGTTAGGGAACTGTGTGCAGCCTGTCTCTGAAGAAATATAAGAGTGAGGTGGAGGCAAATTATGATATTTGGTTCTCTCCTTTTACCATGTGGGTTCAGGGACCATACTCAGGTCACTAGGCTTGGTGGCAAGTGTCTTTCCCACTGAGCCATTGGCCACAACAGTTTTTAAATCTAGGGTCTACCTGTGTTCTTTTAAAACAGCTTACTTTTT---ATTTTACAAACCCCCTACATT-GAACTTGCATT--TGATA--TT----ATTGTTAAAGACAATCTAATG-------------GAGAACTTCAGAGGGAAGAGTGGTTTCGAGTTGTAGCAGATAG----TGACCGCTAAAAATGGATACGCAAAAGGACAAC------CAGCAAGAACTAGAC--TGTGAATGTGGGTTGGGCATGGTGGCCTTCATTCCA-GGCTCTCTGTAACAGTGCAGGAAGCTCAGAAACAGAGAAAGAAAAA------------CAGAGGACCCTGAGCTGATGACAG---CAGTGAACCTGGGTCA--CCACTCCTC-ACCT--AGTCCTTCAGAAGCTATCCATTTCCCTATGACAACTC-----AACACTAAGCATCCGGCTCCACCCAATTCTTTAGCCTCTGCTTGTACC--TCTACTCTTGACTTTCAAAACTCCAT-GTCCTAGCTGTGTCTTTTCTTCCTCATGTCTCT-GCTCACATTGCTACTTTGGTTT--GTCATGCCTTCTCTTCCAAA---TTGCTTCAGCTTAAAAAAATCATTTCCTTTGAGAAAGCTTCCCC........................................................................................................................................................................................................................................................................................................................AAAG---------------GCATTCCACGCCC-------------------------GCACTCTCTCAGTGTGCACTGTCCTTACCTGGCTTGGGACCTATCACAATCACATCTCTATT-GTGTGACTACTATCTCCTTATGTGAAGCTAGCTTTGAGATGGCAGGCATATGATGTATCCAGCTCTTTGCTGAAT--CCAGCAGCCAGCATAGGTGCCTGTCATCAN---------GNGTTCAGTATATTAATTANNTGGGTGAATGAGCGACANNN-------GCCATANGATCTNGNGC-------TGTTTTGTT---TGTCTGAACCAGCTCTCCACTCNCNNNNTNTNNTAAGTTGNGANNATGGTNA----NNNNAANCTNNNGNNNCTACTTTGTGGATTAA--GAGTCAAGGC-TGTAGTATGCCTGGTATG-TGCATGACATATAGTAAGCATTAGGAAACATCCGCT---ATTATGGCCATTAAAAGGAAACTCTCTGAGTATCTGAAACTCTTAGACATTGTCAGTAGGAATGTGAAAATAATAC-----CCT--------------------CCTTATAATTGTAAACACAGACAGACCTTACCACTTAGCAGTCTTACTTCTAGGCATACACTTCAGCAAACAAAAAACTTA------CACAAAACTGTAGACAGGAATGTTTATAGCAGCTTTATTCACAACTTCTCAGAATTTAAAACAATG-------TTTCCAAATCACCTAAGTAGATAAACAAAT---------------CAAAGGA------GCAAGAA---ACAAAACTGATGACACAAATAATAATGTGGATGGATCACAGATCCATTACGCTAAGTGAGAGAAGACA-GAGTCGAAAGGTGACATGCTATAGGTCTCCATTCATGTAGAACTTATATGACATTCCGGAAA.........................................................................................................................................................................................................................................................................................................................................................ACCAGAGATTTCCAGACAATAGAGTG-GGTGTGAAGAAAAGTAGGGGATGATGGAAATATCTCA--ATTGGACTGTGGT-GTGCTGAGATGACAACATGCATTTAAT.................................................................................................................................................................................................................................................................................................................................................................................................................................................GACGGCACCTATAGTTTCTGATAATAAAACTACCGTATG---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------TTCATTATAGACATGTGGGAAAATAAAGAAAAAATAT----------------------------TTCCTTGTCATTTGCTTTTGGTTGAGAAAGTAAATTCCATATCCACCTACCCTCCATCTTCCAAGGGTTGGGATTAAATATAGACTTATACCATCACACCCAGCCAAATTTGCTCTCAAAAAATAAATA--------------------------------------------TTCTGCTTTCTGAAATACACTGTGTATACATTCCCATGTTATTAAATAATTTTCAAAACAATG---TTATGAGTCAATG...........................................................................................................................................................................................................................................................CTCTGCACTTCTGCAATTATATTTTTTAAGCAGTTTCTTTGGTTGCTAAAAATTATACTACTTACACATTTTTTTGG------------------CAAA---------CCAAAACAAAATTCCATTCCTAATACTGCAATTTTTTTTAATGAAAAGAGCAAATTATATTATTCCTGC-----------AGAAATTTCGATAAAACTATATGTCAG--AAAAAAAAAAAAAGTTCAGCAGGGCCAGATACTCCTTGGTCGAGGGTTTCACAATGTCAAACTGGTAAGGCAACCCTTCCCTTGGCTGCAATTCCCTCATT-GTTCCAGTACCTGCTGGAAAAGCCGTTCCACAGGGCGATC-------TGGCTCCTCT----TTTTGTGATGTAGGAGGCTAAACACCCTAACATTTCCTGTACTACCTGCCTAAGTTTGCATAATACT-TCCATATTTAATGCTCACCCCATCTAACGTAGTGTACGCAGGGCAGATGGGAGTACTTTGTTCAAGGTCGTA-AGGTTATCAAGGGTGTA--AACAAGGATGTACAGCTTGGACGTCCAGTCTCAAAAATCAAACACTTTCCT---------GCGTTGAGTCATTTTCCC................................................................................................................................................................................................................................................................................................................................CCCTACTCCCCACCCTAGTGAGGAGGCTTCAGACAA----------TTTTGCGCAG--------------GGGATCCTTATCTAAG------ACACTCCCCCAAACTTCATATGCCCTTGGCAACACCGCCACTTACCAGC--GAGTCATTTCACCCGCTATGGTAGGGGGTCACAGACTGTTCCAGACCAAGGGGATCCAGCGAGTGGGTTGTATGTTGGCTTCGTTGCCAGAGGGGCTTACTGCCCATCCATTGAGACACTGACGGGGCCAACCGCTCTCTGGGGCGCCCCCGCTCTCCAGC--GACACTAC------------GGCGCCGGTGGGAGACGCGCCCTGGGCGAGGTGGGGAGGGTTCCCAATCTTAAGAACCCGCCGCGTGCGGCCAGGGAACCTGTGGTTAAGGGTTATCTGGAGATTTTCCAGGGATTTGACTGCCATG---GCTGAAGTCAGTCCTCCGCTATCCCAGAGTCCCCCGGTTCCCGGAAGTGCCCGACTCT-CCGGAAGTGCCGGGCTAGACGAGCGAGTAGC-----------GGCTTTCTCGAGCCCTGTACCTGGAGGATTGCAATTCTGGGGTCCTGGT masta.dat100755000766000024 373314006427353 16204 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>m1seq pos 1311 CAGAAAAATTGAATTCCCACCCCCC CAGAAAAATTGAATTCCCACCCCCC CCTTAAAATAAAATCCCCACCACCA CAGACAAAGACATTCCACAGCTCCC CAGAAAAATTGAATTCCCACCCCCC CAGAAAAATTGAATTCCCACCCCCC CCTTAAAATAAAATCCCCACCACCA CAGACAAAGACATTCCACAGCTCCC >m1logs -865 208 -865 -865 156 8 -865 -865 -865 -865 157 -4 156 -865 -865 -4 156 8 -865 -865 197 -865 -865 -865 197 -865 -865 -865 197 -865 -865 -865 -865 -865 -1 154 97 -865 -865 96 -3 8 99 -865 197 -865 -865 -865 156 -865 -865 -4 -865 -865 -865 196 -865 108 -865 96 -865 208 -865 -865 -3 166 -865 -865 -865 208 -865 -865 197 -865 -865 -865 -865 166 -1 -865 -865 208 -865 -865 -3 108 -865 -4 -865 208 -865 -865 -865 208 -865 -865 -3 166 -865 -865 >m1freq 0.000635 0.998096 0.000629 0.000640 0.748765 0.249966 0.000629 0.000640 0.000635 0.000589 0.748759 0.250017 0.748765 0.000589 0.000629 0.250017 0.748765 0.249966 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.000635 0.000589 0.250006 0.748770 0.499388 0.000589 0.000629 0.499393 0.250012 0.249966 0.499382 0.000640 0.998141 0.000589 0.000629 0.000640 0.748765 0.000589 0.000629 0.250017 0.000635 0.000589 0.000629 0.998147 0.000635 0.499343 0.000629 0.499393 0.000635 0.998096 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 0.000635 0.998096 0.000629 0.000640 0.998141 0.000589 0.000629 0.000640 0.000635 0.748719 0.250006 0.000640 0.000635 0.998096 0.000629 0.000640 0.250012 0.499343 0.000629 0.250017 0.000635 0.998096 0.000629 0.000640 0.000635 0.998096 0.000629 0.000640 0.250012 0.748719 0.000629 0.000640 lucy.qual100644000766000024 530014006427353 16232 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>TaLr1010B10R 0 0 0 0 0 0 0 0 0 0 15 16 25 32 39 39 34 34 34 32 32 29 34 26 26 29 29 32 34 39 35 35 35 39 39 35 35 35 35 35 35 39 40 40 40 40 40 45 45 45 40 40 51 51 51 45 51 51 51 40 40 42 46 45 45 56 45 45 40 40 40 40 40 40 40 40 40 40 40 40 40 40 40 45 45 51 51 51 51 42 43 43 56 46 51 40 35 35 35 35 35 35 35 37 40 40 40 42 56 44 56 50 56 56 56 46 46 42 42 42 46 37 37 34 30 30 42 50 42 40 40 45 45 45 45 40 40 40 40 42 50 42 43 43 43 42 42 42 56 50 43 56 40 40 37 37 37 43 40 40 40 41 41 41 41 41 41 41 50 56 56 50 50 50 56 50 50 50 56 56 56 42 46 46 42 42 42 43 43 56 56 56 43 43 41 41 41 41 41 41 51 51 56 56 56 50 50 50 46 45 45 45 41 41 41 41 44 45 41 43 43 46 41 56 56 56 51 45 45 45 45 41 41 42 41 41 41 41 41 43 43 42 42 56 51 43 43 41 38 38 38 42 43 43 56 56 46 42 42 42 42 56 50 50 42 42 42 50 50 41 41 42 42 43 42 42 41 37 35 38 40 40 38 38 38 40 40 40 41 42 42 42 42 42 42 41 41 41 41 41 43 38 40 38 40 40 43 50 50 35 35 30 33 32 35 41 41 35 35 32 35 40 41 44 45 45 41 40 37 37 35 37 41 42 42 42 42 41 40 40 40 40 40 42 43 56 42 40 35 35 35 37 40 40 40 42 42 42 42 42 42 36 35 35 35 36 39 42 42 42 42 37 35 35 35 32 32 32 40 40 40 40 42 56 50 56 50 50 50 56 43 40 40 40 40 36 42 37 48 33 33 33 35 35 32 32 32 32 37 42 34 31 31 35 33 33 33 39 35 37 35 32 32 33 33 37 35 35 36 35 35 37 44 42 48 48 44 44 42 33 31 24 24 30 24 31 28 29 31 31 36 30 33 33 42 30 34 34 36 36 34 34 34 30 36 30 30 28 25 25 29 29 29 35 32 35 35 35 35 35 33 40 40 32 29 25 23 23 23 19 19 19 20 17 17 17 25 25 33 33 35 40 33 35 35 37 37 35 42 42 42 40 40 56 56 56 56 47 47 42 40 40 30 30 21 22 22 27 27 28 28 26 26 24 16 12 9 14 13 14 16 22 22 19 12 10 12 9 9 11 13 12 21 25 25 29 24 25 28 28 28 27 24 11 11 10 17 20 29 26 25 25 18 18 18 24 19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 test2.raw100644000766000024 55514006427353 16135 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataMVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVV VMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVK KLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQK QHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGN NWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGND FGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY M12730.gb100644000766000024 1472614006427353 15531 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS PNECG 2355 bp DNA circular BCT 21-APR-1996 DEFINITION Plasmid pNE131 (from S.epidermidis), complete genome. ACCESSION M12730 VERSION M12730.1 GI:150829 KEYWORDS MLS antibiotics resistance; complete genome; macrolide-lincosamide-streptogramin B resistance; plasmid; replication and maintenance protein. SOURCE Plasmid pNE131 ORGANISM Plasmid pNE131 other sequences; plasmids; Plasmid. REFERENCE 1 (bases 696 to 1586) AUTHORS Lampson,B.C. and Parisi,J.T. TITLE Naturally occurring Staphylococcus epidermidis plasmid expressing constitutive macrolide-lincosamide-streptogramin B resistance contains a deleted attenuator JOURNAL J. Bacteriol. 166 (2), 479-483 (1986) PUBMED 3084450 REFERENCE 2 (bases 1 to 2355) AUTHORS Lampson,B.C. and Parisi,J.T. TITLE Nucleotide sequence of the constitutive macrolide-lincosamide-streptogramin B resistance plasmid pNE131 from Staphylococcus epidermidis and homologies with Staphylococcus aureus plasmids pE194 and pSN2 JOURNAL J. Bacteriol. 167 (3), 888-892 (1986) PUBMED 3091582 COMMENT Original source text: Plasmid pNE131 (clone: pNE131.) DNA. Draft entry and clean copy sequence for [2] kindly provided by J.T.Parisi, 04-AUG-1986. Ribosome binding sites are located at positions 754-759, 769-773, and 1944-1948; a -35 region at 1889-1899, and -10 regions at 733-738, and 1927-1932. FEATURES Location/Qualifiers source 1..2355 /db_xref="taxon:2608" /mol_type="genomic DNA" /clone="pNE131." /specific_host="Staphylococcus epidermidis" /plasmid="Plasmid pNE131" /organism="Plasmid pNE131" CDS join(1959..2355,1..92) /db_xref="GI:150830" /codon_start=1 /protein_id="AAA98295.1" /translation="MKERYGTVYKGSQRLIDEESGEVIEVDKLYRKQTSGNFVKAYIV QLISMLDMIGGKKLKIVNYILDNVHLSNNTMIATTREIAKATGTSLQTVITTLKILEE GNIIKRKTGVLMLNPELLMRGDDQKQKYLLLEFGNFEQEANEKQENALYLIIILSRTS IT" /product="replication and maintenance protein" /transl_table=11 gene 706..1550 /gene="ermM" -35_signal 706..715 /gene="ermM" -10_signal 733..738 /gene="ermM" mRNA 746..1550 /gene="ermM" CDS 780..1514 /protein_id="AAA98296.1" /gene="ermM" /transl_table=11 /db_xref="GI:150831" /codon_start=1 /product="macrolide-lincosamide-streptogramin B-resistance protein" /translation="MNEKNIKHSQNFITSKHNIDKIMTNIRLNEHDNIFEIGSGKGHF TLELVQRCNFVTAIEIDHKLCKTTENKLVDHDNFQVLNKDILQFKFPKNQSYKIFGNI PYNISTDIIRKIVFDSIADEIYLIVEYGFAKRLLNTKRSFALFLMAEVDISILSMVPR EYFHPKPKVNSSLIRLNRKKSRISHKDKQKYNYFVMKWVNKEYKKIFTKNQFNNSLKH AGIDDLNNISFEQFLSLFNSYKLFNK" terminator 1513..1546 /gene="ermM" /note="putative" BASE COUNT 911 a 359 c 371 g 714 t ORIGIN 1 bp upstream of TaqI site. 1 tcgaatttgg gaactttgag caagaggcaa atgaaaaaca agaaaatgca ctttatctga 61 ttattattct ttcaaggact agtataacat aaatcgtcta caaatagaca aaaaacctgc 121 acgcttaatg tagatcaaaa gcttaacgca aatgaaatag attgacctcc caataacacc 181 acgttagtta ttgggagtca atctatgaaa tgcgattaag ctttttctaa ttcacataag 241 cgtgcaggtt taaagtacat aaaaaatata atgaaaaaaa gcatcattat actaacgtta 301 taccaacatt atactaattg cttattccaa tttcctattg gttggaacca acaggcgtta 361 gtgtgttgtt gagttggtac tttcatggga ttaatcccat gaaaccccca accaactcgc 421 caaagctttg gctaacacac acgccattcc aaccaatagt tttctcggca ttaaagccat 481 gctctgacgc ttaaatgcac taatgcctta aaaaaacatt aaagtctaac acactagact 541 tattttcatt cgtaattaag tcgttaaacc gtgtgctcta cgaccaaaag tataaaacct 601 ttaagaactt tcttttttct tgtaaaaaaa gaaactagat aaatctctca tatcttttat 661 tcaataatcg catcagattg cagtataaat ttaacgatca ctcatcatgt tcatatttat 721 cagagctcgt gctataatta tactaatttt ataaggagga aaaaataaag agggttataa 781 tgaacgagaa aaatataaaa cacagtcaaa actttattac ttcaaaacat aatatagata 841 aaataatgac aaatataaga ttaaatgaac atgataatat ctttgaaatc ggctcaggaa 901 aagggcattt tacccttgaa ttagtacaga ggtgtaattt cgtaactgcc attgaaatag 961 accataaatt atgcaaaact acagaaaata aacttgttga tcacgataat ttccaagttt 1021 taaacaagga tatattgcag tttaaatttc ctaaaaacca atcctataaa atatttggta 1081 atatacctta taacataagt acggatataa tacgcaaaat tgtttttgat agtatagctg 1141 atgagattta tttaatcgtg gaatacgggt ttgctaaaag attattaaat acaaaacgct 1201 cattcgcatt atttttaatg gcagaagttg atatttctat attaagtatg gttccaagag 1261 aatattttca tcctaaacct aaagtgaata gctcacttat cagattaaat agaaaaaaat 1321 caagaatatc acacaaagat aaacagaagt ataattattt cgttatgaaa tgggttaaca 1381 aagaatacaa gaaaatattt acaaaaaatc aatttaacaa ttccttaaaa catgcaggaa 1441 ttgacgattt aaacaatatt agctttgaac aattcttatc tcttttcaat agctataaat 1501 tatttaataa gtaagttaag ggatgcataa actgcatcct taacttgttt ttcgtgtacc 1561 tattttttgt gaatcgatta tgtcttttgc gcattcactt cttttctata taaatatgag 1621 cgacagataa gcgtcggaaa agcagcaaaa agtttccttt ttgctgttgg agcatggggg 1681 ttcagggggt gcagtatctg acgtcaatgc cgagcgaaag cgagccgaag ggtagcattt 1741 acgttagata accccctgat atgctccgac gctttatata gaaaagaaga ttcaactagg 1801 taaaatctta atataggttg agatgataag gtttataagg aatttgtttg ttctaatttt 1861 tcactcattt tgttctaatt tcttttaaca aatgttcttt tttttttaga acagttatga 1921 tatagttaga atagtttaaa ataaggagtg agaaaaagat gaaagaaaga tatggaacag 1981 tctataaagg ctctcagagg ctcatagacg aagaaagtgg agaagtcata gaggtagaca 2041 agttataccg taaacaaacg tctggtaact tcgtaaaggc atatatagtg caattaataa 2101 gtatgttaga tatgattggc ggaaaaaaac ttaaaatcgt taactatatc ctagataatg 2161 tccacttaag taacaataca atgatagcta caacaagaga aatagcaaaa gctacaggaa 2221 caagtctaca aacagtaata acaacactta aaatcttaga agaaggaaat attataaaaa 2281 gaaaaactgg agtattaatg ttaaaccctg aactactaat gagaggcgac gaccaaaaac 2341 aaaaatacct cttac // dbfa000755000766000024 014006427353 15130 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data4.fa100644000766000024 14022514006427353 16007 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>AW057442 tgctgcatctaatggcaccactgatatggagcaggagcattatctgcagt tggagttgaaccagtgtcttcagttgatccttggaccacaaagacggcga gaa >AW057443 tgagaaggatgaaccgtcagacaactgtcttttctcctcatcgtctcgat ccttgccgtttttgtccaccatggatttgctgctgctgaagaagagaaga atacagcttcagtcgtcagccctgctccggactctgaagcagcccaacct gctggaaacggaaccgaaacaccaaaagatgaggtgaaggatgaggcacc aaaagaaggtagtgaaactgaagcttcaccagaagccaagacaaaaggat ctatggtattccatgctcttggagccatttccacaagtggttctggccgg cattatgtgaagaagtctgccgaa >AW057444 aggcgatgtctacaagaagccagtgcagttttacacgaatgtcacagtgc cagtgttcgctccagtggtatcacctctcgaggtctacacaaacaccacc aaggcaactgcttttgctccagctcagaacatcaaagtggctgctattct tgaagaagatgctgatgcaattcatgttaagtcaatgagaatcgctggat tcattgcacaatccatcctatttttgtttgtctacacaattgtcactatg gatgttgagaattccgaagaatctgaagcagaggttcccgtcttcaagct ctacagagcccgtgagattcagacatgcccactgccagctcgacaaaatg t >AW057446 tacagtgcagcttctggtgctcttcttcaagttgccttcaagaacttgac tgctcagaatagaatccacatgtatcagattctcttggtctcttcgttcc tcttctcgaaggccctcgtatacggattgttcggatctgctgaacc >AW057447 atcaatcaactacggatgatatggtgtgtacttatccgtcgagcttgtat ccagccttggccttacggcagagaacaatggtgttatcagcgcagttctt ggcagttgcagaatgctcctttccttggtatccataaccatacttctcgt tacggacatccaccttggcaagaatctgctttcctccagcaacgttaaca actccaacggagatatctcccttgtggttgttgacatgaactggatgcca agctttgtctccaaactgagcagcctccgggaatgggatccatccatagt caaaaccacgaacactgtccggaagatagatgagaagctggatagatcca acattcttgcaatactcgttgttgaaccacgagaagttggcagccacctt tcctccttcattccagacacgacccataacaggttcaccttgcatgtacc agagagcaacatattggtctggattgactcctggaagggtgtccagagtc ttgtcaagagccttgacaagttggcgagtcggccatggatctccctcacg aatgtccatccattcgtctctctcgacacgtggcactggcttntggtgtg gtggagcagctccgcacacagcacatgggcaatgtggtgtcctccgatca gtttggcgtgatgatgtacatctcgttgagctgtgcccgagctgcttatc acctcccattgacgagaccatgcttctcagtctgtgt >AW057448 aattccgaattccttcgtctcaactgtccttctcctctctgttacaattg ctttggtgtctggatatccatcccaactccaaacaacttgtgtcacaaaa gctaaaagttgcaccatgttctttttaaatggagtatattgcaccgagtg cacgtattctggaactcttgaactgaaaattggctcaacatgtacttttt ccatttacgagaaaaaagtggcgagccagccaaatgaaaattcacaaaat gaagtagctcaatgcaaacagtcatcatgctactcaaatcaattttgtac cagttgactgtgcggctgcttttggaaatgaatatat >AW057449 ataacctctcccaacaacacctcaagatgaatgccatctacactgccgtc cttgttgcttcaactctcgcctacactgcaatggcttggattggactcag cattgaagccgccaacgaggatatgatctgaagtggcgccc >AW057450 tcgatcaaatcaattcgaaaaaatcatgccgtcggaaaaggaggatgatg taatgaaaaatgtaactttcgctgaaggcaaaaaatttggtgactggaaa atcggctaaacgatcgatgaaggaggatttgggaaggtttacattgcaac atcaatcagcgatccaaagaaagtggctgctttgaaagccgaatcaaatg aaatcgaaggaggatctgcaatcaaattggaggcaatgatcctaaacaaa ctgaatgccaatggacccgttccccacattccaatcgtccacttatgcgc aaaaacgaagctctactgctacatggtgatgacgttgttggggagaaatc tacgaaaactgaaatccacaaatctcgtagtcaacaatggattctcccgt ggaacgtggagccgaatcggaattcaatgcctgtatgcattgaaatatgt gcatgacaatggatttattcatcgagatgtgaagccacaaaacttcttgc taggaaatgagacggatagtgaaagagcaagaattgttcatatcttggac tttggtcttgcgagacctttcgctgttttcatgcccgagagaataagtgg gatcgacgtagagctcgtggaactgcagaagttcgtggaactctccgttc acgtctccgatgtcatctccgaaggacaaggacgggtgacatgatggtcc tgcttatgtcatcatgagctcaacgtggaaagctctccatggcaacccga c >AW057451 aaaaaattaagtagttcacggacaccatctccaacacaccactggccagt tctatcaatcagttttcggttcgtttaaactcgaagaacagtcttgacca acacgagatgtattcacttttgacgctcctcttcgtcctcttcttctctg gaagcactctgctcgttcaatgtggtggaaagaaaaagggagcaacttct gccgaaggaaaatcttcgacgatgggcccggctcctggaggagctcctgc tgctgcttccgctcaaggagaacctgaagagaaggagtaatg >AW057452 ttttctcgtggtgatcccaagcttctagtatatgacgttcaggctctggc tcctcggtcatttgctactgttccacgtgttattgataaaatccataagg cagtcatgaagcaagttcaggataaaccactcaagaaaatgattttgaat gcggcaatagcctataaactataccattataagatgacaggcaaagctac tcgtaaaacatgggtagataagtatgttttgcataaaatccagatgcttc tcggtcctaacatcagacaattgattcttggagctgcaaaatcagacgtt tccgcaatgaggtttgctcgtggggcttttggagttgaggttctggaagg atacggacaaactgaaacatctggaccaacaacacttcaattggttggag atacacgtattggatgtgttggaccaccgatggcttgtgcgatgataaaa ttaattgatgttcctgaacttggatattctgttgttaaaaatggtggaga agtacttgtgaagggtcataacgtcacttcaggatattacaagaatccag aagcaactgcatcatctttcactgaagatggatacatgacaactggagat attggaagattcacttgctgaggatctcttcacattattgatcgacgcan acacgttntcaacatgccacaaggacagttngtggctcagatctcacaga atccctctacactttctcgagtttgttcaacagattacgttcatggcgat actgacaaccgtggcttgtagcaatcgttggtccagatccagag >AW057454 cgtttatactgattgcctttgttcttcagtttggagaaggatcaattgca gttcaagaagttaaggatggcgaaaaagtgcaaattgaacttttcaaagg agccaaggcaatccagagatccgttgacgctggcgaacagattttccatt tcgaaggagaaaacaaaggagtgtttgtggatgctaatggaaaagctatt gactcgtcaaattatgaagagaataacggacatttggtcatcaaaaagct tacaaaggctgttgtttgatcatattctgaatattcaacgaaaattatca aaacgaaaacggatcatggattttctggagttgccgcgccagttctcaaa ctttctcttc >AW057455 agttactcaagatgctgaaagttcttttagcaatcggtctactgtgtttg atagatgtttcagctcagttccgtgctgaatgtgagcatccgcttcattt tggagttcagcaatgcaccaacacttcagttgtcagatatcacttcgaca tggaaagcaagaaatgccttgctttcaaatacactggatgtggaggaaac gagaacaatttcaaggattactcagcttgctcaaacttctgtattccaat ggactatttcacatgcccaggtggcagtgatagtgtcgctggaaaggaac gaaagagccactgtggaggaatggaacaattgaagtgcgatggcccgaat actttctgcttgaatggcccattcactggaatctgttgcgacacgagaat cagagacaaaatcgatgccgactacgccaaggagtgtggaccaggaaagc tgaagcatcaaattgatattggaggtgtcaagatcccaatgttcggaaaa acttgcgattctacattctgcccagcctatacaaagtgtcatcaaggaaa ttatcntgcttactgttgtgcttag >AW057456 tttgagatgtctaaatacgcaattctctgccttgttctggtcggcaccgt tgcctctttggactttatcggtcgtacacaatctgctgctataaagggaa gattagtatgcgagggaaagccagcttcaggagttaaagtcaagttgatg gagtccgataacagttttggacctggattccttgacagcgatgataagat ggcatcaggaaaggctgactcgcatggagaattcaatttgagtggatcta ccaaggaaatcaccggaatcgagccctatttggtagtggttcatgattgc aaggacggaattacaccatgccaacgcgtgttccgtgtcaacgttccaaa atcgtacaccaacagcggaagctctgccaagaaaacctatgatgctggag tcatcgagcttgccggaaagtatccaggagagaccagaagttgcctcaac >AW057457 tttctcggaacaactccaagcgaaaaaaattgttgacaagtcggcgtaca tgggtgctggtggctatggatccggatacatgggatccaacgcctcatcg tcgggatatgcccgcgaagattatgcacaaggaggaaatggaggcggaca acaacaaaaccagggaaacggaggaaacaccaacccaggaggacaggtct tcaaggcccgtaccgatcaatcgtgctaccttgggccataagtagctgct cgaataatgtgaagactcagccag >AW057459 aattccttggaaggttctagctatgaacgtcaccagtgtcacttcagagg atggtgttaaagaattcgaaaagattgttgtggaacctgaagatatcgaa tatgttgagattccggccgatgccaaaaacgttgacttgacgcgtcaccg tatcaaagaaatcggtgattattcgtggctcactcacgtcgaacacttct cgtttcgttggaatctgatcaaaaagattgaaaatctggattgtttgaca acgttgactcatctcgagttttacgataatcaaattacaaaagttgaaaa cttggatagcctcgtcaatttggagtcactcgacctgtcattcaatcgta tcaccaaaattgaaaatttggagaagttgacaaaactgaagactctcttt tttgttcataacaaaatcactaaaatcgagggtttggatacgttgactga gctggaatatctcgaattgggtgacaatagaattgcgaaaatcgagaatc tcgacaacaatctgaaactcgatagattgttccttggcgctaatcagatt cgtcttattgaaaatgttgatcatttgaagaagctcacagttctcagtct ttcagccaatgcgattactgtagttgataacatttcgggacttcacaact tgaaagagatttatctggctcanaatggcatcaagtacgtttgtggaatc gatgagcatcttnctcttgaaattctggatttcaatcagaatcgtcttga gaggtcgagatatccattcattgagacactacagacttttggcaagagga aatagnggattactgagcattatgg >AW057460 gcttcttgacatgataaccaagacggaagatcacttcgctcagagaattg tgtccagtgtcgcggtaaaccgaagcttcacgtgaaaacttcagaaattc attttcgtgaagtgggacaaccattgcaacttcgccaatcaatgcttgaa cttcatttgccgtttcgttgaaagcatagatggctttacgatacgtttcg ccttttgctttcacctctttcgtggtttttgaagacttcacatggtgacg agaatcatcgattgcatggaccaagttattcaaattttctacgttcgaat tcaatatctcactttcctctccagtgtacaaacgcatctttntaatcaga tgagaacgagttcttttctgcacctggatctccttttgagcaattcttcc acaggcgctactgtatttttccagcatcttagaatgttgcccatcaaact gatattttctcacataattcaaccatcctccaagaacttcccatggattc tcgccgggagcacacaaaactgctttcttgccattgccttggcatgacat gacttgaagtcgatacattcacattcgtgatta >AW057461 tcatacacatcctcctatggcttttctttaaattcaccggctactttctt tcctccgaagaaaatggcaatcgcgggcaaggcggtgatggcgagaacga aggaagcacttatcacatttg >AW057462 ttttttcactaacaaactttttattctttacatttctacaatacgagaac gcatttgtggatgctatttaaatgttctggctagatttctcgtgtggaaa ctgaataaacttaaagcttagacaacatctgcttcgcatcatcggcaaac tcttgttcagcgtcggaatacggttcgatgtcacacacttcggtaaggta cttgcgagcgttgttcttgtctttaattgctacgtagcatttggacacat aaaatgtgtnctccatccattttggattgaccttataagcagcaaggaaa tcctcgttggcctcttcataggaatgtgatggtggttgctgatagaacgt ggcagcaagcttcttctcaagccatgtcagagatgcaaccgagtacttgt aacggccacgcaaatggaggagagccgtatcctttggctccttagcaaga gctttgtcgagcaattccttgaacttcttactgcattccaacttttcttt cgttgccatatattcggttgcttgtccagtcaacacagcattccacttca gcgcctcgaaatggtttggatccttctggacagcttcttcagcatacttg agtccttcngtgacacttgccttacgttgagctnttggaacacatgcaga cttctcatgaattacttgagcaagtctncacaagacttccaccgaacgat cacatgtcgagacgcgctttgagcaaatcatatccttgatcgcgntcttg ggtccgaagtcttgtcattcatcaaactcat >AW057463 ttttactcaaaactatctatccaagttaatcagtagtgttagttctagtt aagttattaaggcgcacggtctgtctccttgcttcttctctttgtatccc ctttctcctttttcaaaacttcactttcatcaataattggttctttagaa tacagttttccaatttccacgtactctcttctcttccgatccttgtcaaa ctttttcttcgggagctcatcttctggaactactttcacatttttcgatg gaaccaaacgggaacgagttggcttttccaccaaaagattagcgtactcc gaactgtatttccccttctttttcttttcaagaggaacattttctcgttg agtatcatcgtcctccaaactttgttgagtagtcatggactgggtccgag agaattcaacggtaggcatggaacctttgctcttgtcgtcgtttgccttt ggtgcctttcccttttgaa >AW057464 tacaaaccctgcatgcctctgcaagattcttgctgatgaactggtactcc acacgatcctggtaagctgttccgtgtaatgtatccataactttactgag ataatacgtgtgtctgcgccccgcgaacatcgaggaccgctaacttctac ggcatttctctcttctccttgagcacctcatgcttgtcagagagctcacg ccaaggtataccgctagggactgattagcatctacacgcacttagtacca gctctcatagacatacttcggtcccatatcgctgttacgaccgtaggcga tcggatggaagcctaccgctctcttctatgcagctggttctgctagtgtc tccacctcattcttcgttttcacgcacttg >AW057465 tcatgtctcctgcagcaagatcctcgtccactgaatccttcttctctcct ttggttgcttctttctttcctccacagaagattgccacggccggtaagac ggtaacagcaagggca >AW057466 cgcggaccgaccgactatcagacgggtttgctaccactgtatgttcatga agattacgaatatgggattctgcttgaataagtgtttggtcggctttttg aacaattgcattgaatgcatgtgtggcgcctaagtattgtgccat >AW057467 gtggaaaaatccccttcatttctaaaaccgttacaatagtatctgtttaa cacatcaagtataggattcattgcccgacaagctccagtttctaataatg ttcgatgcacgaggccaagccattgagcaccggcagctgatactccgact ccattgacatctggttgaccggcatgaaggctccgtagctgttctgagca actggggcgatgcgctcgacaacaattggagcagaggcttgaactggagt ttgagcattcttaataagccctctgaatcttgaagtaacagctggcgagg ctggaacttgagtgatgtcctcac >AW057469 tttgtacgcaccggatgaagcacatagaatactttactgttcgccacatg cgtgatacagtatcacgcagtaccgacggatctgcgtactgttcgctatg cgccga >AW057470 ttttggtaaaaactaaaagatttattactgaaatccatgggggtacattt atatcaggccgcttgcacctctcattgcctcgaagtaggcgatccaccac atggagttgcttcaagacacacattcacctgactcttcttgttcttctta cgagactgtgaacggcaactcgacgatccagcagactttgcacttttctt cgaagaacgacttgaacgacgggatgctcctctaatagatatcttctctt tatgctcctcttcgccaccgcaaatgtagttggacaaccagcatccacct ttgttagtgttgcgacgtttgtattcctcgacaaatcccatcagatttcc aattggagcagcggtcagggcgtctttctcaatgttaactgcagtgatcg tgtgcagctcttcaattccaacatggtgatccgattgtttctccacacct ggtacttcctggcattgtggctacaacatgtgaacgctattactgacatg tggtcttgactggaagtgaaaatcgcttctgatgctgaaacctgctgaga ctttgatcgat >AW057471 taacctgctccttcttcttgaaaaccaacgtagcacgacgagtacggtcc ggagagccaaacagtgagcgcttcttcttcaaatggtccggaatcttttt gcccatcagctcaatcgtgtactttccacttggcggcttctttcctccaa attcttgaacctcataaaggtacccaatgacttgaccaatagtgtacgga tcgaaaatgttctcctcgctttttgtctgcaccggatgaagcacatagaa tcctttactgtttgccacaatcttgctccagtctcccgtaattccgtcgg atacggtgtactggtcgctttgagccgaaagtcgttcatttgcgttgggg aggatagggtccgcatgcatatattttccaccttttccaggcttgtatgg atagttcttgggctg >AW057472 ttcactccggaaatgatttattggataaagggtggctagtgttatttctt tgagcttttactcctcttcaaagtgcgatgatcactgattgtcgatccat tatccttttctgcagttgccttttccattattttagtatccagaaacatt ccctgacaatcatcctctctttttgtctcttcaaatgattttacatctcc gggtcccagaataagtcccatcagaataccacgatttctacggcggaaat ttccaaaagtgtcttctggtccatcttcaaatgatttgtgctttggatac gagaagaaaacacggaaccagttgattccattctttggcttttcggcttc agatttccattcattgaattttgcagtccatgtttttggaagatattggg gtgacaccaagaactctttcatcattgctgctttcagcttatccctcttc ttcataacagcctttgtctcctcatcaaccgcacaattattcttttgcaa ataatcaataatcttatccaatgcagaatccangacagccaatacatttg aatccagcttctgctgtcctgagaagaaacttgcangtacttcgttctcc tctggtgtcagacatatctcgattgacgcgtgctatttgctactataaga gacgtggcgcaanctcagatctcctatcgtcacatctgacgcatctcctc aactcatcatcatccagctatcaaatgctcagcttgatctcagccagtgc gtagtggaaagttccttgatacagaactgatggatcgagtagacacatgt catgggaccatctctcgtttattttatcgtctgtttcttggc >AW057473 aagtattgaggatcgcttgcttggcttgtgcaactgcgttcatgtcgagg atctcttaccatggaacacgtccaacagccagcttaacttgcatgtatag ccagacctccgcgtgattttttcttgcaagctgctgactcttatggcaag cgattggcgcatagcgaacaggtccacaggatccggcggcagctattggt ttgcggatcactccatagctgcgcgtgaacttgcgaccatggcgagagcg cgatataaacttacgaagtcattca >AW057474 gatttggtgttaaccattgccttttcaaggagctcgtagatggtcgtgta ctccggagcatcgaagaactttcctttgtcaagaatcgggaaaacttcag tgaattcacgcggacaacctccaaacagacacctaagacgagtcgtttta cattccttcttgaacactcccacatcatcactttcagtcagattacgcca gggaagacgtccacatgtcatttcaacgaccatgtagagccaactttcaa tgtcatccttgcgacactgctcacgttgaatatggcaagccaatggagca tacttcaccgttccacggaatccagcacgagcacgtggattgcgaagagt tccgtcttcacgggcaaacttgcgtgccattccgaaatc >AW057475 ttcataattatttattaaacatttaataagagctacaatgtttcccgtat ccaagaaacttctcgaaccagttgtagtcttcgcctttaacagtatccat ttcgaaaactccaatgtcaaaaatatgaacatccggtcgacatggattga cataccattccggaatgggaatctttacaagattcttggtacatctctcc ccattcatcacttctgttccgcaaaaattattgatccatagatacggttt gaatccaaaattcagcaatggaatgtcgactagagtctgaaaagaaatta tgtaatttctgtgttttcccacgctgtaactattcgaaattaatgggaat ttgtagaaataaaatttggtaccacatcattcccgttagtcgccatctcg aatctcccatcgactcctgttttttgagcggaccacatttggtcgagagg tttgggctcccaaatctgaaccatctccgcaacatatggtcttnctctgc acattagctggccttttacgtgaacagaagcacacgctgcataacagtag tttggagccagaagacacagaagaattagaaagagtgtcttcat >AW057476 ctcatccttctcccaatcgtactttgaactcggtgttatcttctcgttct ccatcacttgccagaaacatttgaagatcatgtgataatccggtctttga tagtaatcgagagaggccaaatgaggcatcactttatccatacaggctgg catatttgacatgacaaccttcgccggcaagttcagcttcatttgctcca cacgttcacgttgagaatcggtttgccatggaagagcttttccaccgttg agctcaatgatgacatatagcagggaccatacatcgtcaacccg >AW057478 aattttaaaattaaaaagtcgtttttttacttgaaaagcaacaattgaag aacaacaatatcaacaaacaaacaacttggagaataaattatcatgactt taagaaaagtcttttcggaggaaggtcatctcccattccttcagtttcac gtactccggctccgtaggtgataatcgcgataaatccctgattatcaggt gctgttctaataaaatctggtgtagttttcaggcgatcgttcatctcaac aagagttccacgaacacctgctctcacgacatgctctgaaccatctttgc atttgaatgtacacaattttgagtcgggttgtagaataagagctcctttt tttccttttccagaaacttgatttgttgaacgatctgcgccgtgcttctt cgtcgttccaaaatccactt >AW057479 tttctccttacagaaacacaccatactcttctttcaccttattcatataa tcgggctcagtttcaacagacaatcacttcttccatataccaactccatc cattccgctagtgacgacaatgagcaaggtcttgttgccacccaatgaag catcagaataatcatctctagaagacttgcattgtgtgatcgtgatgaaa ccagcggctgacgcaggtggtgcatgctcgaattgatatgacgatacatc ggtgaagtccttcagaccaatctcagaaggcattccttcgattttctcaa gaacatgaaactcttgtaaagtgactccatgctcgcgcgctgcacgaaca gtgtctaacaacaccgagtgagcttgaatgatagcagtctgcctctatca ccagtgcagttcatgcgacttcactgcttg >AW057480 tcttcagatctcgtgatggattgaactggcattggatgaacgcaagaggt agctttccacagatttggcagtttttccaaaatcaggattctgatcgaaa tccaccgggattggttggaatttgtccaactgatgaaggcattcaattga atcaatgatcttcgcattcgaaaaattgctgcactccaatccaatcactt tgctatccggaatcccgagctcggcttgaaggcgctctacgtaggcggtg gtgtcgaatgttggatgagctgattttgcagtgtgctctgacgccaaatc ctgagcaacgttcttgagaacatcagccattgtgacgttggtctccgagg caagggcagtgtactccgagaagcttgcaacattcattgtaacatttgct tctgatggtggcattgagtagacagagtagtcaagaacgttcatcaagtc tttgcaccgctcgaacttctcattcatggtcactggngtctcggattttg gagcagagtaaacggacaaattgaattgattgatatcttgacatggctgg aacccttcgctagctgacacatttgtttctgattttgggatggntagatc gaagaatcagcttgatcattctgcgtctcattgacagcgatatttctttt tgcagcacctcattgatggagcaattgtctccgatgctgagcgagtacac gagacattccatctgtccatt >AW057482 tctagtccaccactgattttctctgtccgcctgcgcctcgtattctctcc tctctctggcattcgttggctccagcagaactacagtttcatcgctgaaa actgctcttggggtgcagaagatttcagagacaaagcttggctcatcgca gttggctttgtgttgttgctcggcgagcacttgctccttaggcttcagga cgagctccttctcgagtttctggcgaagagaccggacggagacatgggga agtggtatttgctcgatggcagtgggcttcacctcgtcgacttcatcatc ctcatcatagtcataagatgctgacgatgtagacgacatctcttcaaaga ttccagaatcctccaccgccgtcatcatcgcgacattaaggcttccatca tcgcggngctctgtagacggcaagcactttccggcggctctcataatcgc gcacgtccacttgtgaacatcattatggccgacaaagaggtgaacattgc cacgttgcatgcggattttgatgtgacagcggaaatgacgctgctttttc gacgatcagagtgcacagacattcctggcgatcgcaatcatcaccaatcg tgtcgtcatctcgtaatgacatcanacgcgtgacgtacgtatcatcgctc cttggctcgagat >AW057483 tttacctccaaactttattaaataaaccgaatgaattacgaaataacaca ttcatatttctctttcaaatacttcttatgcatgtccttgattggtgcaa atccatcatcacgtgccgattgattcttcccaatttccttcctcattcta tccatttccattttatcaatctcttcatatttcttttctccttcaagttt cacaccaaactcttctttcaccttcttcatctcatcggtttcagtttcaa ccgacaatttcttcttccagaatccaacttcttccattccgtcagtgatg acgatgagcaagttcttgttgggaccccatgaagcatcggaatcatcatc tctagaagacttgtagtgtgtgaacgtgataaaaccagcggcagacgcaa gttgcgcatgctcgaaattatcgtacgaaacattggtcaaatccttcaga ccaatctcaaaagtcatttcttcgttntttccaagaacatgaaactcttg agaagtgactccatgcttacgtcttgcaccaacaatttcgaagaaaaccg aatgaactcgaatgatcacatcctgtttctcatcagcagtgcattttcaa gtgtactccaatggcttgtcgctgcgaatctgaatttcggtgaagagatt tcgaactcaaactgtttgcacgtgaaaccaggtttcttggcagtttgttg ctttccttctgtcaaacacat >AW057485 ccgtttcgacgattcatcactagaatttgtgcgggatcttgcaaaagact tcgaccgatccatcgtcgcggtgaggttcaccagtcactggatcatacaa gaatccgcccatctcaagtccttcaaatccattattatcagttggcatga cacatgtgaacaagtcatcatcaacatcatatgtcccaatgcattttctg tagttggatccagccttgtatgtcatgagcacacgttctccgtatcgaac cacgtaattgccggcgtcaggcgtcttgaatgtgttttccggtcccgtgc aaacattatcaagagtttgagatttcttgaatacggttttttccaagttg acaaagtcggcattcaacaggaaatactccatattgcacttcttcacgac attgattcctttgaacggcatacaaaagtttgacccgcttttttcccaag ttggaatgtacgtttcctcgattccggaaaatgtgaagacgccgtcgaga aggataacgccaatcattgcgtatntataactatccctggtgcctgtacg canattgtccaccanaatcatagtgttgtggaaatcaccttcggccgncg gaacaaccatcggttttttcaattcgaagcccactttgtacatnncggtg ctgtgctttggctcntggtncgcgagtctgctgcaacgagactggcacag ntggagcatcataagactgcaagaaatgtgacgtc >AW057486 actagtaaagttcataatcctcaatactcggcacatctgggctttctcgg tgcgtaccatgtcttgtttctttccatccaggcctcgtgcttagtattaa cggagtccgtgtcctgccacgaatagaactcctttggcggaatgttttct ttccagttctgaatgacgcgttcgagataaaacgcgactacattggcgtt cgtattcgtcgcagacagacagtactgtagaatctttgccatcctctcat tagtatctgggtccggaaacgtgcacgtggccagatagttgtacttcatt gttgccgccactatgtgatcctcctcatcgtgccattcgaaggtttccag cgtgataaaccgtgcacaaatgtaaaaccacgactcgtaatcgtcggcag gtcctcgtgccaatcggaaatgctgacgcctcgacaaatacgtgttgatg acgttcttcgacagtggctctcgctccgatttgttacgggcatgtcggga ctgttgaacgtaaagaggctgcagcccattgtgcttcgtgtacgaattta cgactgatgtttgaccgtttcgagttgagaagtattgacacacgactgct tgtgttctgagagagagacgagnaggacgatgcgagatgtgagc >AW057488 tttttaacaatagcatttcattccataaatatttaaggggtggttattat atcagttctcttaccagtcttctccatcttcgaatgcattatcacgatct gcagtggatgccgttccatttagcttggcagttatcggatccgggatatc agttgaattggcaacaatagcatcagtgacagcaacagtgacaacagatt gggatgaactttcagcgagcgtagaagtttgaagcg >AW057490 tatttcaatgagtacaatttttcgaaagaaaaacagattgaggataaaac ttgagtgatgagataaccgtaatatggagaattatatcagtgtcaagaag gcacattgttcagtttcatatttacagatgtttgggattaaatgaagatt cggtatgcatcgacgatcagaacaatgaacgagtgagttgaaagaccttt tgttgaatgatatcagttcttggagaagaccatctttcaattgcttcaaa ttcacgacgtcttttcaccagttgtaagaaggcggtcaacgacccgtcaa cgtttgcgtagtgtgtgcggcaggagaagagacaaattccatcatacgag agtccttggtgcagaatttcgtgaaccaaatcaaattctgcttcttgccg gtttagaagatctggtgccaaaatgatgtcgaatttttttccgccaagga acttcattgcctcttcaattgtaccacatgaaaccttggtcttgatcatt gnaatattatttctcttaagtgttggacgacagtaaagctccaaactagt cttatncattgtgtgcattgcatttcttctgcttcattttcgaagcatat actgaatgggagtcagttacgaatccgattncaaaacagatttncattcg anaagtcagttcatctcacattgacatatatt >AW057491 ttttttagaataactttttatttcgaatgtaatctcagagcaagctttta gaatctttttggcagtccgtacgagtcaacagtggattggtaggaagaac catcagaagacgacgtatccgattccgaggatttcgcagagttcatctca cggtccgataggtcagcaggagtcttgggaagattcgatttgctgcatgc ttgcaaatcttccccgtcttgtttgaactttctggcacggcagatctgct tgttgatgaccttattgtagttggccttttccttccggaagtttccacgt ggttccgaccagcaatagaagttcgaacatccataggatccatccggatt ctttacaatgccgaagcaggccttacacaacatgtgatcacactttccca atgcaatcaaatgattggttggcaacttgtttccacatggaccatcacac ttcacaaggacaacgcccgcttgctcggtgcactgaaatgctccgacagt ctgcacacacaccgtgtacgtgtattctttacgtcgagtcaccggattca ccaccgacgtcagattacaaccgagaacttcgcacgtggcanaaatgagg gtccagaagatttgcggatcat >AW057492 caccatacaaactttctgccattgcattcaaggccttcattatatcacag cgagcggccttatcactttgggatgcaaagattccatagcttgccatcct gaacatacgcttaatattcgattccatactaagtggaactggatattaat acacccaacgacaaacacggcacttggctttcaagtattgtgtcatgaag aggctgagagacggataggcaccacttttacgcacatccttctgatcatg atcgctgtgagcttattgtgactacttctgggcagtgtcttgactctcga cgcgaggagaagaaacatccactgtcggatcttgcgtaagatgtgtacaa tcaccagatggttgaggaagggtcgctggatcatcgaccatttggtcagc tgtaccagttgaggtataggttgttggcgctgtggaagacaccgatctag tgtgatacagcgcctcaaaccgaggagacgg >AW057493 gattaagttcagaatgatttggaaccgataattgaagaggaacatcggtc cagtagaatggcttcaccaagtaaagactcttagtgagtggcaatattgg ggaatccataatttgtgcctctcctttcttgagcgaaataaagttcattt ggcaggtgggcttcttgatgcattgcaaaatatcatcgacagtctccaca tgctcatttcgtgtcaagtcaaccatcacattatcagaaatcacgaccac ttgtttttgctcaatttttgcaatatttttcttgaaaaatgcttgaatgc gcaagttgtagtcttcgatgttctctgcacgaataaatccttgcgatgga agatattggatattgattggataacccatattgaaaaacgcttttggaga gagcaacatttcttgaaaattagtcggcaacatttgacgatagctgagaa gagctggatcaatgttgaaggtggtgtaggaaagaccaagtcctttaatc aagaaagctgagtttgaactgatntcatgtcangagagcaaaatacttca natggacgaattnncgcattcaacagaccacgcccataagttcagacaca atttgnccacgtcagtgagccggagatcgaattttgtagtgtaaagtgtc tanngaactatttnggaagcatatcatatcatatggatgatattcattgg accacgacgatacagacacgaacccattttggaagatcaa >AW057494 ctattagcaccgaagagatctagtccaccactgattttctctgtccgcct gcgcctcgtattctctcctctctctggcattcgttggctccagcagaact acagtttcatcgctgaaaactgctcttggggtgcagaagatttcagagac aaagcttggctcatcgcagttggctttgtgttgttgctcggcgagcactt gctccttaggcttcaggacgagctctttctcgagtttctggcgaagagac cggacggagacatggggaagtggtatttgctcgatggcagtgggcttcac ctcgtcgacttcatcatcctcatcatagtcataagatgctgacgatgtag acgacatctcttcaaagattccagaatcctccaccgccgtcatcatcgcg acattaaggctgtcatcatcgcggcgctctgtagacggcaagcactctcc ggcggctctcataatcgcgcacgtccacttgtgaacatcattatggccga caaagaggtgaacattgccacgttgcatgcggatcctgatgtgacagcng aaatgacgctgcttttcgaacgatcagcagtgcacagaacatttctggcc gatcgcaaatcaatcaccaatccgtggtcgtcatgctcgtaaatgatcat cataccgcgtgacgtcacggtatcaatcgctcttttgactcgcagatatg atcgagcaggaaggctgctctggaatcggattgagttgagcacctgacca cgtnggtacccatnccttggcattcactcgggacgctgagccgttatgca ggagtgatgtcactgatcactcgctcgcggatgccgagtcacggtgaacg tatatgttgtagacgcaa >AW057495 atcttgctgaattcttatcatcacttgaccttgcgaagagcttttccttc atgatcatgtccgatctcgtgagtttgagtttgctcgatgatagaaacac ggcgagcatttggaccggtagacattgcactcttcgctggaggcttcagt cctggaagagcttgacgatattgcgaagcaatgcctttcacatcttcgtt catttggacctgatggtggttgcatgagattggtagccaacgcctgtttc gcccattgcttggcatcgatcgaatcagggcgctcgacgacaaaagtgac ttttcctttctcttggatattcttgacgagaagatctctggcaacatcct tgtcactgactggaattccatcaacatcacacaaatgatctccaagaaca agacacttctcggctagtgatccaggatcaacacgtgaaacaagcacacg gttctggaaatgcttgattcccaaaccaagttttggtccattntgaaccc agancagggttgccaactcataaacgtatccctcacggcgttgaatgatc tttgcacgatcctcangaatatgaacacgagcttctaactcttcagcctt tttttcgtcacggttcaccgtgatctttgcacatggagcagcatagcgaa gcgcacggaaaaaagtcgtaacatcccttgcattttggtccgtcactttt tttacntggtcaccatatctcatttttccttacgaatggttccagcctga ttctanggataccgagctgtcattnngagttgcacaaagcgttcccctcc ctcatgcnataccaatctcgacatcttcttggacccat >AW057496 tttattcactcgatttggtttccccatgtgccaacgtctcgacctttgtc ttcggattgcacaatgccttgaacaatggaatcatgaagcagggatcttc attctttgcttcgattggagcttttttgctggatttcagagaatccggtt ttgcaactcttgggaggcatttcagcacggatttcgattcgaattgagca gcgcaggccgtgtcattttccagaaaagcacgaagaagtggtttactgtt gttcattttgattgcatatcccgtcgactcaatcattgtagatctcccgg gcaggtcgatgacaatcgttttcgccgaatactcggcatcgtcggcgaga aaattgagtgtctggacgagatgatgtggcttatacgacacgtgtccgtc aatccagtgacgaatgttttttcgaatgtacgggtaaatggccctagcgt agatatcgaccggcttgtttttcagcagagcagctttggatggtgtttca aggattgcgacttcagtatacttattggatagaaccaagcattcgtaacg gtcaagttttgggctttcgttgggatttacaatcttcactgctacgtcat anatcgaagtgtttcggagcacgattcgatattcagatttcacattttca cgaactaccacagaaagagtcat >AW057497 tatgtggacaacaagactgaggaagcatggttctcgttcaatgggaaggt gattaagcagctcgggccacagctcaacgagatgtacatcatcacgcgca actgcatcggaggaccaccacattgcccatgtgctgtgtgcggagctgct ccaccaccaccaaagccagtgccacgtgtcgagagagacgaatggatgga cattcgtgagggagatccatggccgactcgccaacttgtcaaggctcttg acaagactctggacacccttccaggagtcaatccagaccaatatgttgct ctctggtacatgcaaggtgaacctgttatgggtcgtgtctggaatgaagg aggaaaggtggctgccaacttctcgtggttcaacaacgagtattgcaaga atgttggatctatccagcttctcatctatcttccggacagtgttcgtggt tttgactatggatggatcccattcccggaggctgctcagtttggagacaa agcttggcatccagttcatgtcaacaaaccacagggagatatctncgttg gagttgntaacgttgctggaggaaagcagattcttgccaaggtggatgtn ncgtacgagaagtatgttatggataccanngaaaggagcattctgcnnac tgccagaactgcgctgantaacacatttgtctcttgcgtnagggcaaggc tggataccagctcgac >AW057498 acagtgataaacatgaaattcctaaattcctaaacagtaccccttcatat cttggccttttatcttcttctgtgagttctacaaattgtcgcagcacatc atctgcgagaatcggaatgttgcacaattgttcataggttttcccattga tgttgtgtttggttagcgcatatgaagcaagatactctggatctgggaca acgattgctacaagccacggtttttccatatcgccatgaacgtaaatctg ttggacaaaactcgaagaagtgtagagggattctgtgagatctggagcca caaactttccttgtggcatgttgaaaacgtttttgcgtcgatcaataatt tgaagagatccttcagcagtgaatcttccaatatctccagttttcatgta tccatcttcagtgaaagatgatgcagttgcttctggattcttgtaatatc ctgaagtgacgttatgacccttcacaagtacttctccaccatttttaaca acagaatatccaagttcaggaacatcaattaattttatcatcgcacaaag ccatcgtggtccaacacatnncatacgtgtatctccaaccaattgaagtg ttgtggtccagatgtttcagtttgtccgtattcttncagaaaacctcact ncaatagccccacgagcanacctcattgcngaaacgtctgattttgcagc ttccagaatcaattgtctgagggtaggaccgagaagcatctggatttatg caaacatacctatctaccatgtttacgatagcttgcctgcatcttata >AW057500 gttagatactttattgtttaaaaaatcgagtttttttaaaattcaaatga ccgtaatttcagaaggcactatgcaaaaaatatatcccatttaatttaaa aacactaagcacaacagtaagcaaaataatttccttgatgacactttgta taggctgggcagaatgtagaatcgcaagtttttccgaacattgggatctt gacacctccaatatcaatttgatgcttcagctttcctggtccacactcct tggcgtagtcggcatcgattttgtctctgattctcgtgtcgcaacagatt ccagtgaatgggccattcaagcagaaagtattcgggccatcgcacttcaa ttgttccattcctccacagtggctctttccgttctttccagcgacactat cactgccacctgggcatgtgaaatagtccattggaatacagaagtttgag caagctgagtaatccttgaaattgttctcgtttcctccacatccagtgta tttgaaagcaaggcatttcttgctttccatgtcgaagtgatatctgacaa ctgaagtgttggtgcattgctgaactccaaaatgaagcggatgctcacat tcagcacggaactgagctgaaacatctatcaaacacagtagaccgattgc taaaagaactttcagcat >AW057502 tcaagtattagacggactcagtgggatgatcatcagacggattatcaagt acaacacgcgaaagctctgtctcattcttctcttcttttggtggacggag catcgaatgctcacggcactgagttccatcaagacgagtaatttgttgca agagatgaataacctttccacgataagtatccactgatgcaattggattg aaatcgaggtaaacactattgagttgtggaagttcaaccaattcatccat aatgctccagttatccaacttatttcctcttgcccaaaagtctgtaagtg tcttcaattgatggatattctcgaccttctcaagacgattctgattgaaa tccagaatttcaagaggaagatgctcatcgattccacaaacgtacttgat gccattttgagccagataaatctctttcaagttgtgaagtcccgaaatgt tatcaactacagtaatcgcattggctggaagactgagaactgtgagcttc ttcanatgatcaacattntcaataagacgaatctgattagcgccaaggaa caatctatcgagtntcagaatggttgcgagaatctcgattttcgcaatnc tattgtcacccaaatcgagatatttcagctcagtcaacgatccaaaccct ncgatttagtgaatttgtatgaacaaaaaagaagtctcagtttgtcaact ctccaaattttcaatttgtgatacgat >AW057503 taactttatacttgtatttcacaacttttcccaatttgcaccaaatcgga agtgactgactgtcgaccatcggatgaaactctggagcctgtcctttgaa gtgagagatggatacgctcatcatattgtcacacgtaggtttgcccttga tcttattttcttcgaccacaaagtttgccgttttcttgccacctagcaca acctcatagtgaggattacttggggcttggcaataccggttggttctgta tctctcaacactgttcgagactagtggagaacgcataagtgaatacgcct ctggttcaagtttagcgagctgttccatgctatccgggtctgggaagaat tgcgactccttcgagtacatgtaaggcttcaacactttcttgcaaacatc cgattggttactacttctggtgtatgaggaagccgatctaacaataatcc catccttgaccggcactgttggatagtagatntccgaatatctacggccc atatagttaatctcggcgggggtgattgcaggcttgtattngcagagagc ttngcactncacttttncattaaatctgatcatcgcattttgtgggactg agaagtcaggtgtacatcgtagaaggacagagcangatatcaccatcnac tttgtcgtcttgatacatactcatcccacgttcngggctcgaatncagat ttgtggagtcttggagtagcttctccagcgtcagatgatctgtatgatgc cggcattctcctgaaactggcccac >AW057504 tttaacctccacgttttatttctgaatcagaattaaggcatgtatagttg agcgtgagttgggctgtagactttggaaaaatcgaaattttcaggcacac aatcagtgtagaccaggttacagacgtgcgaacgagttggttagtgtgtc aggaatggggtgagccagcaaaaactatccttattcagctgcccggatcg aacttcaaaactgacgaccgtcttctcggatgccttaatatcttttaaaa tctccgaaaacattgcctgagcgacacgtgtattttcagcataaataggc cctaaattacaatcttttctgttctcatagatcacagcttgcccgattcc aatcacatttccctcctcatcatacgccactttcccgaatccatcccgat cgtacatatgagagatgatgtacttttcgcggtgatacggatgaattgtc tgatcataatttatgatatctctcttcggaacttctcgagcatttttcac tgtaattgcagacaaatgcaacgtctcagggagcacgacgtcgtgtgcac tgtagaatgtcttgtagccaacgtcgccggctttgtgtccgagatccaag aattctttccgtttggacagctcgtgccagatatctgtgggccgttgggt tatctgtgagacaatattattatcttcgctgttcatttcactcgacacat atcattctgaaagtttgccgttccgggagatctgaattcaggatcaatcc atgcgcaaactcatgacattatctgatgattgattganggattcatgggg acgaatgtgcacaaatgtgctggtggatgactctcggcgtccttttaagc tatcactt >CEES071R ggcacgagtcaaccttcaccacaagcgtccgtgcaaccttctcatcattc cattttgatgaaaatcagcgcaaaaaggaatttggaaaagaagaagctgt gaagaagattcaaaagaaagcagcgaaggttgctcgtgntgattcaatgt tcagttctgaagaatttttccctgacattatcaagtgcatgtnncaccgt caaacgnagaatcgagcttcacacgaatgctttgacatacaatcacaagt taatgnagagatgcgaacaattctcatcgnttggttcagcgatgtcgtga aagagtacaattt >CEESA12F gaaagcaattagaagaagtactgcagagtgtgatcaagtcactttagaca atttaacgagtgtagaagaaaatcaaccaaactcaacantgtcgcctctt gcaaagataattgctaacattgaaaaaaggaatgaaaaagtgaaaatgac taagaaattcaaaaagtttggagttccgcttccaatgtnctcatcgaacc ttgaaagtggatataagcaatgtcgaatggacatcacatgtncttctggg tacagttgtgagaacantagtaaaacgagatgctgtatggaagcaaatca ttcaccggaaattgagcgaaaaactgaaganttcaagncatgcccgtttc aacttcaaatggcatacttctgccag >CEESA13F aaagtgatttatagatttacgaagcggactttgtttgggaattgtaaagt taaatacaaaggaaatagtgggaagaaattttnttttcgggacgaaaatt gaaatttaaaaaaagggttctcggggaatcacatgaggntagagactgga aaagagaaaatacaataaattaaaatcggttgaaaatgaacattggacag gagaaacaaacggaacggggaatcgttttttttatagaggcgacaaaaaa gagcaaaagcagcatcagaagcgtcaaatcagngtactcaaaaganggga atggatattccaattattcctgatcattcaacggcaagtgagt >CEESA14F acgtacgccagangagattcattcgaaaaaatgatgcaaacgtaaacaat aaagcaattttacacacaaaatagaaattattcccgagggttcagcgtct actttgaatcagctcgcgaagattcgtttcagcctccaccaatcctttct ccaaatagtctttctgcttctcgatagcttcaatttnctcttttgactgt ttagcttcagcttcatgtctggaaatttcagctggtttgtcggtcagcag gaaccatacggcccaacagatcgattagtacttgaggg >CEESA15F gaaaacgagtatttattgaggatttgtgagcaatggggatttgatgtgag gtaaaaaaaaataaaaacaaaaggtacaagantaaaatatacatatagga cccgcagaaattgagatttaaaaaaaaattcaaaaaaaagaaggaaattc aattaattgtgcatactattggtcatttctagcttaaaaggntcactgaa aagtgagggactttgtcggaaattataattatncgatgttgaagaagaag aagctccattaaccagcgtcggggaatcccaacttccaggttccattggg ttcttcacatccccttgatgattccgtattggntcctacatcatgatccg cttcaactgcggatagctctt >CEESA17F cgaagagtgaccatttatgcacattgatcggaaataaattaggaatccac gcacttcgaggatgaatcaataaaaacaggagaggggtgggaatagaata cgaaagatgaaagtaagaaaaaatcgcgtgagaaaattcgggagcgattc aaagggaacacagtgacaaccgggagtgaaggnttaancgtagtanttga gggacgcctttttctttgcctggncctccagantagcatccataaagtnt tcatgaagaatctcagtagcatctcgtcgaagagcaatcattccagcttc aacacaaaccgctttgcattggtgctccgttgaagtcatcggttacagcg gg >CEESA17R aggnaacaatattccgatatcggaggatgtgacaagcaaattcaggagct gattgaagctgttgtgcttccaatgactcacaaggntcgatttgttaatt tgggtattcancctccaaagggtgtgctcatgtatggaccaccaggaact ggtaaaacgatgatggcccgtgcggttgctgcccaaactaaatcaacatt cttgaagctcgcaggcccacaacttgttcagatgttcattggagatggag ccaagcttngttcgtgatgcttttgctcttgctaaggaaaagggttccag ctatttatttttcattgnatgagtttggatgcccattggtacgnagcgat ttcggattcagaggaaagcttggagg >CEESA18F ataacaaacagtttataaacaagaaatcacgcaacaatctcgaaacggag tgtggcgagaagttctggctcattgtnctgggtggtgacggtgtgagctc ctctaggggcgactggaagtctcttcaaagctgggactggttgtccagct gacttagcggccttcctactcgttacgacctggcgctagacaacgcccat gtcgcaacggcggagtataggtctctcgcttaagcgccatccatttncag ggctagttgattcggcaggtgagttgttacacactccttagcggataccg acttccatggccaccgtcctgctgtcaatatcaaccaaca >CEESA23F atagcacagaaaactatattnaatttaatattataatagcgattatnaag tcagctgctcactggaatganttccagagagggagagagaaatagacagt aaaacgagtctttgaagtaaagancaactcantacagaggcggggatgag tggtaaaaagattgcataatgtatttccaatattgaaagtagttatgtaa ttccgagacgacgggcttntccctcataanttaaaagtcgccacattt >CEESA24F aaaatttttttncattcttaaatattcgcagatgtcgtgggacagagttg caaaatgtcaagagtggtgtgtgtcggtatgggaaaataagaagcatgnc aatagtgtctgataagttaaaagaggggaagagggagggagagagagaga gtgtgcattttggnccagtagatgaagatgngtcgtaatnntgtaggaac acaattatatttatcagagaaaacgggataaaacaacaaactcgattcga gttgatcataaatctgtgttatcacaagaattcgacgnaacaggagttgg tggacgattgacgnggatattcgaccccgatggcaagggaaaaagtatt >CEESA49F ggaatgtgcatatatttatatatataantttaacaggaataacatggaaa acgtttcaaaaaactagcgagaaaacagagggttcgtattggaattatca caaaaggcacacaaggcgtagttggaaacatagtgatagattaggagtat agtgcaattaaaatacaatttnctttggtgaccataaaaaccctaattat gagaaaaggttagaaatttttaaaagcagattangagacggttacatagt taaaantgcatggcattataaagntcacaaattgggaaaggtntttttnc gagattcttcttctggtgggangtatagagcaagacggtcacgtcccata a >CEESA50F gataaatttcttatttagttgcacatgataaagtataaatgaaaataatt aaaattaaaaaaagagcaaaataatgtcacgtgaggtttaaaaaggagan taaaaagcccaaaaagtgaattgaacgnaggnaagatgtagangaganga gcattttgaaaaaataacgctaactatgctttaaaacagannganggtaa canaaaatgttgagaaccggtagag >CEESA51F gatcgaaatggtcagaccgttgtgaagcttgtggatcgtgccacattgct ccgagagcaagagcagaaggacaccgagaagaagcggaaggataaggaaa aggcggacaaggagcaaaaggctcgggagaaggctgataaggaggcggca gcgaagaagatcaagccggaagagctgttcaagcagggagagcacgtcgg gaaatactcgaagtttgatgaacgaggtgtaccgnctcatttggctgatg gaacggagatcacgaagagtcagatcaaaaagctggagaaggtgtacgga gctcaaaagnaaaagtatcagcaataaatattagtgcctaatataa >CEESA52F aaattttacaatgtttattgaagacgttgaacgtcaaattatcaaatttg atgaatgagataaataattataccagcatgaagattgtaagancacggag aacttacagggaagaaattggaaacaacataggacactagcgtagttcta tgtgtcattggggattgggaatgaggagatccgataagttagatgataga ngacagaggtaggancatattagaaggggaaaaggcagattatttaggcc ttggcggctggcttggcggccttcttggcagctggcttggcgatcttctt tggagcggccttcttggctggagactttngcgacccttctttggcaagct ggcttggcaaccttctttggggctcttgngccttccttgaccttcttaat ctccggtggccggccct >CEESA53F aacaataacaatttatttgaataaacaaatttaagccttagcttcggcct cggcaaggaaatctctcttgagttttcccatgaaagcacgcttctcggct gtggtttngaagcgaccgtgtccggtcttngaactggtgtcgatccactt gaggttgatcttctcgtgggcgactctcttggtttgggtgatgagcgact tncggagggtgataagacgcttctttggtccgagaacggntccacgaagc atgatgtagtcctcgttgacgataccgtatcttggggaatcctcccattt >CEESA54F attcttgaaaagttttttaaaagaaaagctgaaaattttacaataacgca gatgaaaatnccaaattttttggggatatttcgccaaaaaaatcattatg tgaatttntggtgatgaagatgatgacaatcgctttgaaaaaaaaaataa caaaaaatacaaaattcgggggaaaaaaatgaaaanttaacaataaaatt ggncattttagtttgagccaatttttggcagacagagagagngagagtaa gagtaaagagaggtagcagagannacagncaaaatttattcggggg >CEESA55R aaaagcttcgcaatgagttcaagttgtcagatttnnaacttctttacgat tacaagggaaataacttgagaagcgccatagttctnctnaaatatccaga tgcaatcaatctctgcaattctattcgttccaatccaactgtattcggaa aggaatggcacccgagagtcttcgaagtgctcgacgtagctgttcagccg ccaatcgataaataanctttttngtattttaatgcctgancctgtttttt ttgatgcttacatgaaaatttgt >CEESA56F actagttctctctnctttttttttttagaacaaacaatactttagtaaca actatgtgaaaaatgaaggcaaatgagagttaattcatcaattaatgtaa gagtgatatgacgattttaagcattgatttcagtttcccatttgtgcacc aatgttttgaacttccattcgtctgggtgtccgacgtggttgaccttgtc tttaaagtcaaactcagaccacttgaaagatggaaccttgccccatgttg gccctccttgtgcgacaaattgcaacttcttcatcaactcaacatttgna cccttgtaatcaagagctccgtggttaacgtgtccttgtcccggggaact cgtatgttccattagctggatt >CEESA56R gaggtttgctccacagcacattcgagatgaatgtntttattggcttactg ctagccactgttgtagcttctcaaagctcggaaggacgcgatgagagcta cacttacaagcaactttgcatagtggacgataagcctcaagttcttgatg gattcgactgccgcaaccaagttgctntcgccagatggcaaaacgctgtg aacacaactggctggactttcctggnagtcgnaaccaaggagaactactn nccacaaattcaaagcctactctgctgggatatctttaag >CEESA57F tccgatcaaatctacatggatatgcagaagttcggacgtgtccgtcgtca agccggaggatacggtggatatggtggatacggtagcggaccatctggac catccggaccatctggaccacacggtggattcccaggaggcccacaagga cacttcccaggaaatactggntcatcgaacaccccaactcttccaggagt tattggagttccaccatcagttactggacatccaggaggaagcccaatca acccagatggntccccatctgctggaccaggagacaagtgcaattgcaac accgaaaactcatgcccagctggnccagccggaccaaagggaactccagg acatgatggaccagatggaatttcaggagtt >CEESA58F atgagtgaggtgctttatttgaaaaatctttttggaattaaatttcaagt ttttttacagaaaaaaaaacaagttcagaaaggagcaaaaatacagaaac aaatttntggatgaaggggtacatgataattttnagggaggaaacatttt aagantagaattaagacaagatgcatcctggaaaattttgatcggcttcg gcgatttgcgacgcagtgaattgaagcatgaatagttggntccataaaat cctgataattcgattcagaaaccagagattctttatcagaagcaaaatcc atatctctatccaaataggaaatatcacttcccattgaacttcccgagtc gtcggcgatttgagaattcaatagcagt >CEESA59F gatcggcgccacagaattttttggagtaagactcgtcaatcgtgtatctg atatcttttataattggtctctcgatctcttgttcagctccactatggaa gacaggtagttgagattcttcaaaacctggagcataaccaagtcgtttga ttctagatagcatatccttgcaccgatttgcgacaaatttcgtttgaaaa tcatgaaaatcattttcagattgcttcttgatcagtttacaatgtttcgt gtaaatatcctcaaaggtcgattcgggattttcctcaatcattgtcttgg tgatctgttgatagtctgggttttccagaatatccttatgctgatcaatg ggntaaattggnttcatcacttcacaattgtttcgtcagctcattcccga ttttcc >CEESA60F ggtacagttcatttttncatttcagaataagcaacaaaaggtgttcatat gaatgtnctgtaggttagtaaaacactatgttaggttactctttgatgca aaaaggaatagttaaaaagttctcagaagctttcgaataaattataataa atacgttgcagaagtaactgggaaaggaatgatgatcgtnatttaaaaaa gatggaagagcttcaaaacaggggcttagaaaatncagttctataatant aaattgggaagagctcaaaanttcacaaaaaatggttaatgaatacgcat acaatgtcaccgcttgttgaccagcacactgagcgggtaaatttccgagt aagagatcgncgatatttcagcggatggagcacggtaggggaaagtagga ta >CEESA61F ataacaaacagtttataaacaagaaatcacgcaacaatctcgaaacggag tgtggcgagaagttctggctcattgtnctgggtggtgacggtgtgagctc ctctaggggcgactggaagtctctncaaagctgggactggttgtccagct gacttagcggccttnctctnctcgtcattggacttgacacggttcaagaa gtcggttctgcacttggatggcttgatgtgctcgatacggatgttgattc tctttggaagaatgtttccgcggactctcttgttgacgatgattccgacg gctcctctggtgacgttgaaagattctcccggttcttccgtggtaagcct tgaatggcata >CEESA64F aagagtttgaaacttttattagctgtttttttagttcaaagtgagaaaag atgaaagaaaaaacaataaacagtattatgttcagaagtgtattgaagag agatgggggcgaaactaatcctcaatgaatctaacttgaattatgttttt nctcatggaaaatcgcgataaaaggattactgtgtcttctacagtaaccc gaaacgtaagtttntgggtgttgggggtgggggagggttgattcgtgagc aggatttcggggcatttacacgaaacttttcctcatttttctcgtgccga attcctgcagcccgggggatccactagttctagagcggccgccaccgggt ggagctccagct >CEESA65F aataagtaacactttcatcacataaaacatcagtttagtgaaattgaccg gaaattgaagtaaaaataaacgcgggaaaggatggtgtgacttgactagg ttctaggcggcacgancagcaaatnttggttttnaagttattaaatgcaa aacgtttgatttttgantgttgggaaattgcacaatttagagggcattgc gagtntctgagaaaganatgaaatgttaattgttttgggcgctgaaatga aagatgaccagtggcaaagtacggatgagttaaggtgagtaaaaganata aatgcaaaggggtatgggtgggtaatgcgactagaaacactaagcnagta tatccgtaatggttggaaaattg >CEESA66F gaagcacacgaaactttatttttttttgttggagttcaacatgaaattca gcaattnacgaataaaataagacataaagaacggagagaaaagtggtgat gagatcggcggttcgntcgcaaaaatcaattttcgggatggaaaaatacg aggattatggtacaagttggnttaaatgaatattaaaagtgcttcgagaa ttggtgatggagcttaagcacgttctccgcggatgcgtctggcgagttgc atgtcctttggcatgatggtgacgcgcttggcgtggatggcgcacaggtt ggtgtcctcgaagagtccgacgaggtatgcttcaaatgcctcctggagag ctcccgattggcagccgnctggaaagccggnggtccagtcttt >CEESA67F acatagctaacatttatnagccatttgaggatcgggaataaatttgtata caaaacaagtataacaacgaacactaatgggggcggaaaaaaaggtgaca gagcaagtatttttttaaagagattcattgaaatcgatcaacagtaacaa gaaaaatgggatatactaatgcggatgctatccgtaccgttcatctcaca aaactcgcgaattggtcgatgaaagggtagtgatttattgctcatcggct ttcgtctcgttctttgaggcggcgtcgaagtctccaacaagttctggcac atcttcgtcctctcccttttccatctggtccgagctttggtaaccgttnt tggcaagttttcttgagggtggggtgaggggactctgggacc >CEESA68F aaaaaataaagaaattattcacgtaatcaaaaacagacagaaaaaaaaag taagctcgaataagcttatacatataccgagggtgaataagtgaattgta aatgtgagagttaggctttgaaacgttcgggagaagcgggaaagattgac aacttaaacgtgtaaaaccatganattccgtataatctagtttggggtgt gagcnttgaaatgtgcaggataacaacaacaaaanggtgggttgaaagan atctggngnaattaaacagttattagccgtgacgaacagaagcnccgggg ggctct >CEESA69F aaaaaaattcaattaaatttattatcaatgctccaaaactcatgccaaga agagatctgaaaacaggtgggtgtgtctgtgcaagtaaaaaattcaagaa aggacaagctggttggaaagaaaaatacaaaaaagtcgatggtctaacag aataaccagaacgagattgancgggaatncgnttgagangaaagcaagct tgtntgatggtagatgggatgnttggttgagatttcaatattaccaactg gctgagtattattnatgatttttnatcagcattgtcca >CEESA71F atcaattttttttattggaattcaacggtaaaacgagcgagggtggactg tattaattgaactacccaattgaggtctttncttgagaacacacacaaat taacaccaacgtatacaatattctncgatcggttttnttcggaggagatt tataaaaacactgccagagaactcatctttcaaaaaagaagacatcgggn ttgaaggacaacttgaaacaaatganggaaatgataatcacactaaaacc gagcatggtgcactaattanttataaaaaattaagagtgagagtaggacc gagagaaaagag >CEESA72R tcacaagtgattcaattgtttcgtaaaaatcaatagttttncttaattct gcttaaaaattggcctaaaatcttgaaaattaacaaagttatgaatttnc gaaaattttcaaaaaccaacaaaaaatttgattttttaaaatttaaaatc aataatctacaataaacttacaattaggcagatgaaaattccaattttng caaattttgaagctataacgctgaaaactcgtacagctaaaaactncgnc cattttggggtcccaccgcggncaacccaaaagtggggtgggaggcctag acgtnttagggggtcatttttcaaaaggtcttcggtg >CEESA73F agcagccaagtcctcacgagcctcggtgaactctccttcctccatacctt ctccgacgtaccantggacgaaggcacgcttggcgtacatcaagtcgaac ttgtagtcgagacgagaccaggcctcagcnatggcggtagtgttggagag catgcagacggcgcgtggcaccttggcaagatcacctcctggcacaacan ttggtggctggtagttgattccgaccttnaatccggttgggcaccaatcg acgaattggatggttctnttggtcttgattgcagcgatggcggtgttaac gtcctttngaacgacgtctactctgtacaagaggcacacagccatggtac tttccgtgacgtggatcaaaacttgaccatctggttaanccggcttcgaa agcangctattnggtgatg >CEESA74F gtgtgtgtgtgtgtgtcgaatcgttcgagaaaataggaaaatatgcgaaa aaaatgaaaaaaaatgaataagggagaaaaaagtacaagaaacagaaaaa ttagaagatatttttttattcaatcatcaccgggatgttcggggcagcaa ttcttccgtttgaaccaatcatcgatacattgtttatggtagatacatag acatggtaatcgtgctattttgtgtccagcttcgaggtcttccagacaaa tcgaacattctcctttgtcgtcttttagcacgncatcattataggnaatt ttgggtctcgtcaggcacatgnccagntgaatntccgcgtcgtccgatgg ttacaactttt >CEESA75R tatcaatgaatgtatttncccacctttcctatcaaattagcccttccagt caattcccccgccacctccctttccaatcatcagcacttgaccgatacag tcaacgcatctnagttgactccaatattttnccccgtctgatgttcttct tgtgttagtgaccttctcaatcatttctccccaaaattttttctctatca atgtgtactaacattgccaattctacggcggacttgtctccgttttagtg gtatganttatatacatatatataanntntnaatttaaaattgcatccta tttcgggtaatagg >CEESA76F accaaatgaggcatctcgattcattcgtagtattatggtacattcgacac aaaatgagcgacaaaaaaaaaagaaaaatgacaaaacaaaacaaagggaa aaaaaatggaatttgagttgggcataaattatatatatatntnnntatat aaancttgangaacttttttttgtgtttaanaagnggtgtggaacatttt tttaaggggaaaaggcattgaaacgtaagtagtcgngagggttttggctc gtgccgaattcctgcagcccgggggatccnctagttctagagcggccc >CEESA77F aagaatttctactttttattgatttnccgcataatgtaaagtaattttaa gaattacaagantaaaataattgaatgagaggncgtgggtgtgtttnctt aaaaacaacaacgagtgaagggggaattacagacaaaaagaaataaaana tgggatagagatgggggtgataggtggagatgaggggatatgaaaggtag aagancctggtaaaatggtctgncggggctcaangggaaatggggctcaa aaccaaaaacgaa >CEESA79FB gaaggggtgatttcattagatatttttaaaaattattccaaatttcacgc ataacagaaagaaaacaaaacaagaaggaatatcacaaaatgtttgatgg aaaccggtaagaagtgaggataataggcacgtnctgagtagctgatctat atagataaaatgtgaaaacaagttgaactaatctggcgtacgagaaaaga aaaggtaaatcgataaatatntatgtacaacgggtatagtggatcgtgag aaaagtgcatcgggacaagngacttagagntaaaaaacgtnaggcagagt tcactcaatanacancaaattttncgaaaaaaacatctatggattattca taaatgggncccttccgagtgt >CEESA79R attttagaaaagtatatcatcataatcaccacttcaaaaactttgacgtc ggcttcggaatttngacttctctttgggattatgttttccacacactcgg aatgggcccattataataatacatagatgtttttncggaaaatttgttgt ttattgagtgaactctgcctcacgtttttaatctctaagtctcttgtccc gattcacttttctcacgatccactatacccgtngtacatatatatttatc gatttaccttttcttttctcgtacgncagattagttcaacttggttttna acatttttnatctatataggtccagcttac >CEESA80F attgattcgaaataatttatcgtatacaacacaagcgatgagcatagaaa ttggaactcttttcattcaaaatttagaaaaaaataaaaagaagcgaatt aagcagaaattgatgcgagtncagtattatgcagattggagcaggcggca cgagtttaatactcttctccttcctcctcgtttcctccctcgttcgagtc agctccgacctcttcgtagtccttttcgagagcagccaagtcttcacgag cctcggtgaactctccctcctccattccttctccgacgtaccagtgaacg aaggcgcgcttggggtacataagatcgaacttgtagtcccaagcg >CEESA81F aaaatcgataaattcttcatcataattaatcaggaaatgtttgtnattga aaaaaaaacaagaaaaatggggcgtgtcgatgagaaattggggcaaaaaa aaaatcgataaatcgataaatcaagaggntctttgggcggaaaaatgaga ttttcagagagaaaaatggtgaaaaactaagaggtcagcgaaccgggaac acaagaaaaaantcaaaaaaaaantcgataaaatcgaatcatcgtccatt cggcatttncggcggcttttnctgggcctgggcctgagcctgagcctggg cctgtttgagcctgctggacttgggccaggnagcaaatttggctgtagac cgagcagtt >CEESA82F ggtgttgttaacagatttattgaaaacaaataacaagatctttagtcgaa gagaccgaagcccatgtcgtcatcggattcctcctttggctcctccttct tcttggtctcggcagctggggcggctcctccagcagctggagcagcagcg gcggcggctggagctggtccagatccggctccggaagagacagaagtgat gaggttcttcacatcaactccctcgagagccttggcgaagagtcctggcc agtatggctcgaactcgacgttggcggccttgagaagggtagcgatcttn tcgccggtgatggcgacctcgtcatcttgaaggatgagagcagcntagac gcaagccagtttcttggttcgaagccattnttcaatcgtaaagttncgcg gttttgcctta >CEESA83F gaatatttattttaaattgtgaaatgcaaattggtttcgttgaacttttc aagtgaaaatccatgcaataagagcgcaaaatcatacataatacagtgac gagaagcaatcgaaatatcacagaaaaagttaataagcgagatttttaga ttgggaatgagaaagtncttaatgggcttgcttcttggcgaggttctcag ccaatttntcgaggaattcaaaggtgttgaggtagtcggtacgagtgaca gctgaagcatttcctcccttcacgcaaatggcaagatccttggtgaggaa tccagcctccattgtctcaatgcacacagcttccaaattattggcgaatg tctcaagggcagagttcttgtcgagagtgggcacggtgagccaatccacg ggacccaggcgaagattggagggcaattgggattcgtgggaggttccttg tccccttttgggtgcattcctgtaaat >CEESA84F aaggaaaatagacagtttatattcggaatttataaaacaaatgtgataag aactgccggcggatacgnaaataccgaaaaaagtaatcaccgacgacacc gaaacggatggaaaatcgaaaaaaaataaataattgggggaaagaataca cacatcgacagangaccgcaatttagtgagtgatatccatggactcgacg acgtcgtacttgtaggtcttgtgtttcagcaccttggtggctccggcctg gaatttntagacgaggtcgttgggctctggtcgcttgcagaattctggca caattggccccttgaagacggtctcgcttggctcgatgatcacaagattc aaactgtttcctgaggcgttgtcaccgtgcataccggcttccagggcgcg ttggacaagcttctcggcttcaactttcgtcatatcaaccttaaaatcac gttccaaaatagtgatagcntgcata >CEESA85F gtattgtttgcaatttatttgaagacaaaaattggaaagaataattgggt agaattaaagggagaaaagagggaagaaaatagcttcacaagttttaaag tacaccagctaaccgaagaaagaaaanttgantgaaatatggttccttta tgaaaatctcttcgaaaaggaagatagttaacacaaagatggtccatcta gacaaaaaagcaggntcagcgactaanatgaaaancanctagaatattaa gaggttggtagagantgagaaagcnataaantaagggaataa >CEESA86R ggaccttnttgtnattaagccaaaggtgttcccagacgagcntggtttct tttccgagagttacaacaaaactgantgggcagagaaaatcggctacact gaggntcttcaacaggataaccactcgttctcccattatggcgttctccg tggtcttcatacccaaccacacatgggaaanttagttactgtggttagcg gcgagatcttcgatgtggctgttgacattcgcaaggncagtccaacgtac ggaaaatggcatggngtggtttctcaaccggggataataagcacgnnttc tgggatttccagcccggg >CEESA87F gcatggtaattgtcgaattttatttccaaacactcatgaaaaaagaaaaa taaataatacagtcaaatttttttncntgtagttggaaactnttaacgat aacgaaaattggtaaatgagaccacaagaaacagaggtcgataatttagg ggaaacaataggcattttactccatcagcacaatgccgggtaaaacgaag gtggaactcatatttagttgantgcataggagggtacatggaatcatttg gttcggccttcgatgtaggtagagacatcgatcttttctgggagctcagt gatgctgatatcgaagcggtcttgaacgctgttcaaggtcttggcgtcat tctcatcagaaacgaaagtgatggnccaaacccttggttcccaaaaccga ccagcacgtgcaactcttttgaaggntacgagtcgggaatcttcttggca tg >CEESA88F aacaataacaatttatttgaataaacaaatttaagccttagcttcggcct cggcaaggaaatctctcttgagttttcccatgaaagcacgcttctnggct gtggtttggaagcnaccgtntccggtcttggaactggtgtcgatccactt naggttgatcttctcgtgggcgactctnttggtttgggtgatgagcgact tgcggagggtgataagacgcttctttggtccgagaacggctccacgaagc atgatgtagtcctggttgacgataccgtatcttgggaatcctcccattgg ggtgatggtcttctgggtaagatcgaactcggtgg >CEESA89FB aatggtacattctatattgaaagaaatgctaaatagtttgtatgtacagg aaagtagccaaatacccataagcagagaaacagaagtggggaaaggaaaa cagacaagaaaaacagctagaaaggaaagtaagagatattaatcacaatg aaacgcggataacattgataagtgataatgttgataaactctgtgatgat gataaagcctacatacacaaacacacggatgaaaatactattcaaatgct caatgagagtgaccagaagctagaattgcggggacgacggctcctccaat ttattaacagaagctcttttgatcgtgtaacttngctcccagatgaatga ggaatttccctatttgaaggatggtgcagttgcatccacggcaagggga >CEESA89R tggaaatatccacgcacaaataggatattcactagttttggnatttgtac tcatgcttcttgttgatcaaattggaagtgtcacggtggcaagaaatgat agagcagggagaagccgaattggaatctctgccacaattgggctcgtggt acagctgcagcggacggtgtcgcattaggaagtgcttcagtcatcaacaa atccgatgttcaaataattgtttttggttgctataatgcttnacaaagca cccncttgcat >CEESA90F tgccgctcgtgccggcaaaaaaaaatcaatgggaaaatgtcatcataggt aatacaaaaaaataattttttgggagttttccagaaaaacgggcggcttt gaacaatgagaatttggagcaagaaattggtggaaaaaatcggcggtaaa aatcggaaaaatcaataatttatcgttaaaatcaataaaaatcaacgtcg actatgccgatccctgtccccccgatccctccgacgttccgaacttctcg accgttttcgacggcgatcccgttccgaagatcggcttctcgatcggtga ttatgtcgtgatgaagattgctctttttcacgacttttcgatcgtttttt gctctttttactcgattttntctccttctcatcgtctctcatctctngtt ctctcttnttttccgacgacgatttttccgattttttattaatcgccgnt ttttgaatctggaacggggtaaat >CEESA90R aatgaacaaagaggcccggaaaaaactgccggcctggntcctggaaggcc ttgaaaaggcganancgggagaagcaaaagcagtnggaaaaagaggaaaa gctgaagaaagcggaagaagaaaaagcccggcgaagagccgaagctggaa agagcaaatttgactcttcgtcggatgaagagagcccggaaaatgagaaa tttcctgttggtaatggaaaatctgaatatcaggaagatgataatgattc ggaagacgatttggaggagagacgagagcaatttatacgctgtgtgaaaa ctctaattgatgaacgtgctcctcgngtcttcgaatgacgtcatcatgcg tataatacaag >CEESA92F aaacacagacacgggatggaatattgaacttnttggtaaagtctcaatga gaggtaaaagtgcaagtttacatgtaagaaatcgttcatcattatccaga agcttctgggttagagtcgcagttnttggatatttntncttctgaggttg atagaagcntcgcacataaaatccgctcaagagaatgatttgggaaattt tgaagtgtatcactcgtggtggacggcccgacaaatgtttcaggttcttc aaatgaaaaaccagttggcattgatttacataatctnttgatggacgcga aaacttccgaagagttgtgcggtaaacgtccaaagttcgagtcttggtaa attaaaaaagcaatcctctcgcagagaccgagttttcacatcatatcgtc tttcgttttcgatgataatcaattatctctgcttgtgac >CEESA93R gaaagtcgaaaggtggtactcgtggtgaacaattcatttatgcggctgaa gcattcgattcgactaacaatgttccgataaaagtcggcgatctcacatc aaccaatactcatattattaaaaaaggaacagttgttgatgcgaaattcg cactggccgatcgttgtcatgtattcaaaaatgagattnatggaagtntc tntcaggcgacactntcgtttactgatcttacacagnataaggattcgta ttataagnttcaactgtt >CEESA94F gaaacagtaaatttatttcatagaaaantgaaaaaatgaataaaactata aacaantaaaancgacaaagtgggaaatatctatttagactttaccaatc gtgtacaattnctgagtgtagctggtccattccagtatcctttaatcaaa ccattccaatacagngctggcatgaaataccttttcatcaaatatgccca atatgtgggtttcgattgnncaantggagttgtctccatggctcctcgag gtccaaattctgcgaggttcacacgatttgtgctcacaaccagtgggcat gacgcgtancgatcggtactgcatacatggccgatttaccctgcatgact tgttgtcaaaatttctt >CEESA95F aagtggtcctaacaagtttattcatgttaagaaaccataaatataaagaa aatgaagtatctcatggaatttttntcaagtaaaatcgtgtttcgcctag aaatggaaggcacatnttgaaaaaaaaaatttgaaaaaaantcagcttag aagcactttcggtgaacaatcgatgggccggactcgtcgtactcctgctt ggagatccacatttnctggaaggttgagagggaagcaagantagntcctc cgnnccanacggagtacttgcgctctggnggagcaataatcttgatcttc attgtgcttggggccaaatnttgantctcctttt >CEESA96F atattggcattaaacatttatttttgacacactgaaatttaaacaaatgt atagtcaaaaaaaaacatgcaaataaattatgcttatacagatcagttgt gatttgaattcacaaccttcacagtggccaaccgatatatatatatatat ngnnccagatcagactagaatttggaatagaagcacctccaccttgtttt tatgtttgantttttctttttcatgatcatgttcatctgatcagaagttc atctttagccgctgttgagtagcaaagttcagcaatcgttctcaggtttg tagttgttgggccaggtagaatcggcttgatttgagtgatagttttaaca gcgtctaccgcccgtcccagtgtacaactngaatcaacgcttttgtgttg gangtattgcgga >CEESB01F aagtgcaatgtttgtagccattgaagctccttgtgtgagctcgttcactc ttgcaagaatgaatggggtgacactttgtgaagcgatgttttgttcaatg gcttcttgaacagcttggtcaattgctttctggatcgcatctccatcagc tgcatattttncaggaataggacaagccagaatagttccntaaggtaatc cgngagntttcnaggtttttaacaggtgaacaacttcttccaatgattca gtgcagaattgnagcttttctaactga 5.fa100644000766000024 10441614006427353 16012 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>CEESB04FB tttttttgttagagtattttatatatttattattattacagcttacagaa ctttgatttgttttacagaaaaaaggtgcaaccgcttagacaaattcaat ggattatcattatttgaaactttttgcagttccttattttcaaaaaaatc ttggtttttggtttgatcagggtgagaaaggatttcgggggtcgaaagct agaaaattatcaattttttgtgattttcgattgttttnatgtttttcatt tatacgagatagcgaagaaagaaagtttggnggcaggatggtagcattga atattggtcgaatcgtttaggcggcttcttcatcgacagtttccttgggc ggtgagtgggcttgagctcccggnttccggtgagtaccgacgnacnttct cctcagcg >CEESB04R gccaagaccagaagtnaacgaattcattggagtcaacgtcggattcggaa tcgccatcgttttcggtgttgctgtctctgccaaaatntncggaggacat attaaccctgccgtctcctttnccttcctttccgtcggacaaancaccat cgtccaattnattg >CEESB05F atgaaacgactttggaaaaaggttgggtctggtgtgaagaatgcnaatta tctcaaaaagtctgtagaagaagtagcgaaaattgaaaaaggggaaacaa aagtcgtcccaccaaaatatccaactgaaaaatcacgagaagttagcgan gaaatcaaaaaagaattagcgntgaaaaatgaaagtttngtgggaaatat gactaa >CEESB06F ctagcattccatganctgtgccaagcattctatcaacaaagattgaaaac tattcgatctagaaatattccgaataactcgccggcgttagtcgtacgac aacaattcaaattggcgtttctgtcaganctcagacaggatactcataca gcattgagaaattatcgnttggcttatgatcaatgtcgggatacagttga ncaatgggatggagtcgatgtttttangtggagaagtgttgttggattgc ttanttataaggnacagcagggnattncttgaaaacccacgaattccgtt attagtactnaggggtgtcccnttgccgggaaatttttgaaaa >CEESB07F gaagaagcaacaaactttatagtaaaaaaatgacaaaagagtatcattgg agaaaagaaaacggataactgggagagcaagcanttaancaattttnaaa aattgaaaattaagaaaaggtaatagcaaatgggtggagaaacagtgggg aagagatttcnttagaaagaancaaaatttgtaaatcggaaggaaancan gntgaaaacggnggtcg >CEESB09F aaagtgtataaatttattacaaaagcttttncaaattacaacagaagaga gactggaatgcatgatgattggtggtaaaaggggaaaaaggntcttatga aaaattagaacaaatcacagaaaatttcaggttcggacagacagacaggt ggataagcnctcgtgatatatatttccagngngtaaatcagaaatagaca agatatgatgcaaaggagctctacggtcaagacaatagangtnccnntag ggtgcttacnaaacacttcgaattttgt >CEESB10R agaagccgcgtgttgttaaggaagaagttatcgagccaggttcacaatct gaaactcaaaaagaatctccggagaaagttcgagttgttgtaccgaaagt tgaagttgaaagatcaccgtcgccaaaatcttctcgtgatcgtaagaagg ntcganagaaatctcgtgagaaagatcgtgaaagagatcgtgacagaaga gaaggttcaaaacatcgtgatagttatcatggncatcggnaacggcagca gtttctttccagtgtacgacggtat >CEESB11F actttgtctatttattggtttatnctaataaaataaangcagtaacgntt caaaaacgctcaaaacagaaattaaacgaagcanatntttaagtgctgag attcataaagttgagatggaatgtnttgagtgtcgattgaggaattctta attaaccgacgtcgtatccnatcctatcgncatttncaggagcatctcat tgggtgtttggttgtccgagcgtgtggagcattgatttcncggcagagat ccacgagttgaattcattttttnagctttcgg >CEESB11FB actttgtctatttattggtttattctaataaaataaatgcagtaacgttt caaaaacgctcaaaacagaaattaaacgaagcagatttttgagtgctgag attcataaagttgagatggaatgtgttgagtgtcgattgaggaattctta attaaccgacgtcgtatccaatcctatcgccattttcaggagcatctcat tgggtgtttggttgttcgagcgtntggagcattgatttctcggcagagat ccacgagttgaattcattttttgagctttcggttcttcttccgtttttct acgagcacttctatccaatttccaaatcttctcgcagttcccacgngtna ccatttgaaatcattaggcacttctacggattctactatggtcggcccat catacttttcactcg >CEESB11R aaaaatgccactgtccaagagacgagcggcaggtagaatcaatggactga aaggcccacccgcaaaaaaaagaaaggaaattgaacgacatgaagctctt ctcatgcatccgttagcgtatactgcagaacagtacgaacaagttgcaga ggagtcgaagttctacaaaanttgcttcgaaactactgccgccgagaacg ttgaacttctaaaatccaacggtcaactcaaaaaagagcttggaattttt gaaaaaccaacanaggggnatcagttgtaaaggtccgattattggaagtt tcaaaattccccaaattcg >CEESB13F caggagtcaattgatttatttacagaaatcatttagcaaaaaaagtaaaa ttggaagaaagaaaaatgagagaataaatcatttagaagagtctaaactt gagattgtgaagaattcctgcagtgattgttttacaagtccttgattggg tcattctccttcttgatggagtaagcgaggtaagcgtagaaagcggtcca tacagcgaatggaacgacaaggaggccagccttcttgtcaatcttgtaga atgcaaaanatgcagctgcagcagtcaaactgacaacagtggtattcttc cacaaacatcccaagctccttcttcttgacgatcggaaattgtttgcaaa cagcgnagagttgacactngctncatacaaaccaaggagccaaacttttg tg >CEESB14F acctttagtttgactttattggtagaacctgagaacgagagaaaaaaaat aaaatatataagtnaagctcattggctagagatgaaatgggaaaacaaat aataatttatgcggttncttcagtgcttttcgaaagaggttgcatttcat gcgagctgcggtacgaatctccctcctcgatctcttcaatactctccggc agtggttttcccatagtttcaggcaaaaagagcaacgtcatcactgcggc gagcacagccatacacccgaccggaataatcatgaaaaccttcccaaact tgcttcaagcgatccacatggagatngttacgaagcgggcgatcg >CEESB15F gaaaaaagtncgttattcgactttatttncagaatttcagacaaatacaa aataaaacccgcaaggaaaaaaagataccagattagtcaaaaattgtaca aattgttgtgttagttgtncagaaagtnccggatatattgttgatgtgct cgacaatcacatcgtagaatgtgtagatacccngtgcaatcatcacaatg antattgtgattatcancagtgttattatcagcactctgtgtgactgtcc tcgttcgattctctgcatttcttttatggggaagtgacagaggacgcatg tcgaaacaacgagnaatcttgagganttttgaa >CEESB16F aatttaaaagtttatttatcggaaatgttgataaggaagcacgaattaaa attgaaagagggggcggttgaggggggatacaattacaccgngtatcttg tcaatgaaggttttcatgtcattagaatgagacttgataaacgataaaaa atgcatctgaataactatgggcaatatgtgtgagatgggtaattaacaat gaaacatgggngataaaacgancaacatcctaataaaananctcttaaat acccncttgaaaacatcgnncaaggcgactgantactngctaaatcgaat ccnatgggcaatcaagagtggatttgttttaccccgtcttggggtccgac >CEESB17F gaagaagcaacaaactttatagtaaaaaaatgacaaaagagtatcattgg agaaaagaaaacggataactgggagagcaagcaattaancaattttnaaa aattgaaaattaagaaaaggtaatagcaaatgggtggagaaacagtgggg aagagatttcgatagaaaganacaaaanttgtaaatcggaaggaaaacaa gatgaaaaaggaggtcgntagactttgngagatggttatctntcgtaacc tncgtgtctgatggtttncttaatatcccnttctttcaatcgtcagggaa cacgcacatcgtcg >CEESB18F gaacaggtcgtctatttatnctgtnaaaaaccgtgctgttgtaccgtttg anactgtaaaacagcataggtcagaagaaaacaacatcactatacaaaat aattttggaaaacgggttgagctaattnatttattggttggcctttctna gttgatactcgacgacgatgaatggngcccagaatccgnttg >CEESB19F ctnattaaaagctttattatgaatgtggctcaaataatgagcatgattca nagaaaaaatggtttaaaatgtcaattttgtaatgagaaaatgggggtca tcggcagtaatagggtacaacaacaaaagtgattgcnttaaacctcaact tcaaaccaaagatacacagagaacctagttatacatgcctagattactac cggantagtattgaccaaatacaagagangttaccantgaagatttgggt gagaatgggaagcataatgcagtcggctagagaagttngaactattacta gcaatgtacacggagagggtgaaggaaaaa >CEESB20R aatggtgaaggaaacaggntattatgacgttntcggagtaaagccggatg cttcagacaatgagttnaagaaggcctaccgtaaaatggcgctcaaattc catccagacaagaatccagacggagctnagcaattnaagcagatctncca gggatacgaagtattat >CEESB21FB cgggtaaaaatgctcctagataaaaaagaactgtcacacagttaggggat ttaaaacatgatcagacaagtaaaagcgtgagtaggcgtgataaatataa actttgaaatatgaaaaaggaaaatgcatttgtggatcattaaaattcta taaagttatgaaggaaaacgcgacaaaaatagtgataagtacggtattgg ttactcggaatgtacatcggcaaaaaatgcgacagtgacaagntccaatg ggaaaaaaggtaaaaaccaaaatganaccatacattggggcccaatattg gggaaaaattttgctaccaa >CEESB21R gttactttccaaaacggacacctcatcagcgaattctggaagctcaccga ttctctgcaaaaatgttccnggnttaacgcctcaacagaagcgaatgtnt catgagaatccgaatatcatcaagtatctgatttctggacttcgaagtgc tcttcacacttgcgagtacacatttnaacgagaagcatngaactntactt taacacttccttggggttggaacttcacctcttcaaattgcctctcgtga atcagcttatgtggtacgcgatttttggcggnccngtgntgttctcactc gttggctcgtg >CEESB22F aancttcaataaatttatcagaaaacaaaaataaaacaaccgcgtaaaaa aaagagaaaacacaaaaaggctccaattattcctaaaatnccaagattgt taatcancggtccattcttcgncttctccatcttctactacctcgttttg aactttgcctttngcaggtaccttctgatccttagcaacaatgcaattgg tgacgncagcattctctccgatttcnactccatcacaaatnaatagggtt gcttatgggggnaccatttccaatgacaacgcctttggcaatgatactct cttttagctttgttttttca >CEESB25F agaatgaaatctatattatttggaaaaaagtttttaaaaaatccagattg tgaccgaaaaaaatnattcgaaaagaaaaaaacacaaaanttgaggnaaa acatgangaaagantggcaaaaagttttttgactcgcaaagaatcaccta aacatttcaaatttcgtatgaaagtttgtncgctttaatgataactttta aaattcacattagggcgcactttctgggggaaaagtcgaaggaagaagaa aaacgtgattatcacagaaaaatcaatgaaatgaaatggnatttgaagag gattgttggaaggaaaccttgtggattttcaacaaaaaaa >CEESB26F gttgtcaagtatactttcagatattgttgagttaatggctcagtgggtgc tctcattgaaatcgggtgctctacgagtncggaaactgtctgtaagtatc tttccatgtcgttttccgcagggatttcgtcgtagttaaccttatccaac accaacatatcccaccactcaatatccggtaccccattctccattttcgc cgtgcccgttggtgtaaccatcgcaagtttcacagccgatgagattcccg tcgattgtgcagccgatgagacctcattctgaagcctttcgagctttgcc attgctctttgcttattcgcaagtttctcgaattctcccttctcatggaa attgaacccccggcgacgtcgatctngcagttctcggctgaatccgtggg atccagatactcggatgagtttctccggttccttttggattcttttg >CEESB27F aattttttaatctagtagttctaaaataaatctcagactgataactgtga ccaaatacataattactcatacaatactcaggtcttctattagaataata atctctaattactaaacgataactaaagaaagactctaataatacataaa taaataagnntagtcctgcagttctaataataganccataagnggcaata atatttcataccgnataaacatcagggtaatctaaatatttacgtgggaa cccgtgtagtcctgcaaaatgtagcgggaaaaatgttaaatttaccccaa taaataataaaataaatactgcagntatcataaggttatctaacacatac cctggtaataaatcttcatccatag >CEESB29F cgttcacatttnctttctctgtcggtcttaactcaaccgctcccccgtta caaaaatctcagtcaaaaaaaattaattttgacacgncatatttgtngtn cttgaggnccccctgtgactttncaaaacgntattttaattgtnctattt tgtgtcaactactgatgaaagtcattgaaaatgaactcgtaaatttg >CEESB32F atagaaaaattgtttattaaaaataaattncnctctatattgatactacc tttaaattttttacaaaaaaaacatgtctgtgtgtaggcggatacaggga gggggntaccatcagtaattggngcttaaaattccggaaaaaattgggaa agaaagaaaccngtaacatttcggatgtntgggcgggcgggtgattgtgg taagtgagaattggatattctgagaaattttgagccatgggaggnaaaaa agnncaaaaactgggtgaggntcgggggacattttttttgttgcaaaata gtccccctacacgtactcaaaaattnggaagtcaaaaaaaaaattg >CEESB33F ggacaaatgttcattatcatttacaacttcaatagggcaaatagaaattc aataataaataaataaaatcagaaaattgacagcttgcnctgaattagac acttctnctgactgagattctggtgataaaaagaggcgggatcctaaggt gctcatcgacccgcaggagagaaactaaggtcgattagtgaggaaataca ttttaaaagataaatcaactaataaagaggaaagatgaattgatgatgtt tggtgatttntttgaggattgagattggagatcgcaattattaatgncac gcatcggtttggaacgaatccgtcanttctccatcacaatttcgggagtt tggagaactgcaaaacagcccantntccaaatcctgctccctaaatacca ccgcagcagtttaagataagtgttggtaaggtcatccc >CEESB34F atntttatgattttattttaacgtgaataaacatcacaaaagtnagctta ctcaaggggtggggtgtggggcggctaaaccaaccactaacaagtaacaa aaagaagggtgacagtaagaaaaaaaaacaggagatgggtatgcttagca actgggggaacgtgctaagagcacttggcaatgaacttattgcttctnag cggaaacgagaaccgatgcagcttcgtcgaccttcgagcggaacaattca ctatcttgaagcatcatgatcaactcggaattgtcgatttcgagcatcat tcctgtgaattttccctgcggtccttgtgtcccgggtagagtttctcaat gagnagccgtagatacgntcaccaagaagttg >CEESB34R tctgagagacacgctgaattgaaaaagaagcacgagcaacacaaggctga gcgtatgcagaagtatcaaggagtcaatctctacgtcaaaaacctcgacg agactgtcgatgatgatggcttgaagaagcaattcgagtcttacggaaac atcacgagtgctaaggtcatgactgacgaaaatgggagatcaaagggatt cggttttgtctgcttcgaaaagccggaggaagctacaagtgcagtnactg agatgaactcaaaaatggtgtgctcaaagccattntatgtngctattgct caacgtaaagaagatcgtcgtgcacagcttgcttctcaattacatgcaac gtcttgcagcatgngaatgcacggcaaactttccagg >CEESB35F aatcaacaatttattaccactttcgttcaagaggttcagaggttgggatg ggggataatagctgcaaccagcattcacatataatatttnagatgcgggg aagaggattattggaaaaaggagtgaacgaaagtttcagttgaatacatt atttcgagttacagaacaatgaaaagacaacgaaaatggggggaatgatg attgatttagttggtctggcttgggcagtcgcgggagatgtgtccggtct cctggcactggtagcaacgcttctcctcggccgagccactntcggtgcac tcgngggaaatgtgcctcgtgccgaattcctgcagcccgggggatccact agttctagagcggccgccaccgnggttggangctcccagctttttgtncc cttt >CEESB36F cgtaaaaatttgtctttattttgagttcgtcctgatttgcagagctctta tactaaatgaattgaattgcaaacaattgcgaatgatgatatccacagaa aaaaggatagtgaaatggagattctttcaagtgggggtgggatgagaagg agatggtggtgggggaagggtaacaattttaatgataaactggaataaaa cttgactattggtgttggcatctnacgccttgtccgtnatgtcctccgtg ttgtncttcgtccgaatcttcattgtaagcgtctccgccacgtcctcggg ctgtacttcttctcgtcatattcccataaggggaaaccttctgcggcagc tggggggtgacaattancaacctttttgaggtaggggaaggcangctttt tcagaaacgggcataa >CEESB37F gcgaagaaaaatataaatttnattcaaaaatgattcagaatagaaaaatt tgaaaagtgtcaaaaataaatgtggattcgacaaaaaccccagaaatttc cagataaaaattaatttagaaacataatggtaattatagaaaattaacaa taattaaaagttattaggantaaaacaaattatgaaagantaaagttaag agaagtcagtgctagagctggatgcagatgctccaaaattgtcaagaaac tcccgagaaagacccgntgaaggagcaaaccatgaagaaaactggggcat cagtggattttgagctggctcgtgccgaattcctgcagcccgggggntcc nctaggttctagagcggccgccaccgcggtgggagctccagcttttgttc cctttagtgagggntaatttcgagctttggcgtaatcatgggtcatagct gttttcc >CEESB38R attcactcctttttgcttatacccccttcttgtgagtacatcccaccctg tagatgtgctccttgcttgataaaccaggtccgcagtccgatttaggggc tcgtgttctggaacagttaaggaaaccatgctcttgttgttgttgttagc tgttttcgctgctactagcactccttttttgtcctgagttgatcgtgttt gaagcggatttccgatctaaatttttataaattaaaaagaacctttttcc aacaaaaaaaatccaaaggaaaagagtttggaaattcttnggactctttc ttcggacttttaaactccaattttttcactcgactttcttaggaataatt tattctaggaaaaaagtacggtttttcccaacttttccc >CEESB39FB atctggacatcttttatttcttgttatatatacaataagacaagacaata caagactactgtgacagatcaatgggaatcgaggacaagcaagcacgacc aattcaaattatgtacaattcctttattattaaacaataaattattcgaa ggaagagaaaacattaaagtacttgtggtggctaacctctacggtatcct gggcatcagaagagtgagccttaaacggntcagctacatcagaagctttt tccttaacggagtccataggctccagaaattttatcaccgattggtctca gtgatagatttggacttgtccttcgccgagtnagcaagcgtcttccagca gaattnccttgggcatcggagggccttggtcccttgggtgggtattccca aggcattcagaangccttgtcccttgggttgag >CEESB39R caaccgggacagcaaccattgagatcgcttagcttgacgtctgcggccaa gtcacctgaggaggttgagccggagcaggattcgaagaaaggagagccac gtgccaaggctgcaggattcggaggaggtggaggtggtggaaagagactc ccagaggcaagatctncgccaatggtcattccgaaccaggtagcagcgat gccggttcagatgactggcttcgtncaacttgtngncaacancattctca gcgncattttcccataccaactctgttnatggggttgtccaatgnatcaa ancccttcgagatctttta >CEESB40F gcaatagaataaatttataagcaataagcagagcaagacatgaacagcaa atgacaaccggcaacattctttaaataatttttacagagagaaaatacaa tataancagacattttcctttagattttacgtttagtagcagtgaaacgt tcttgataagcatcatatcctggtaaatcttgaaaatctttcaattggaa tcgatttccgtcgtatgacatcacggaaattgttgaagattctngtaaaa tcgcgagtgaactccgacttctccaaacagttttcatcgagacgaagagt cctcagtcgtgtagttttggcaagatttgntggatttanggagctcagtc gattttaattcagacttnaactcgataactttcaaat >CEESB41F aaatatgatttgatttattatagttatatttgtgatgaaaaaagacatca tggtgagatgagattgataataaatatacgaaaaagttacaagcgaaaaa ncgaaatgtnctgtagaagttgattaattagatcatgaaagtnccaatga gagagtgttagggacatgaataacgggtaaaatgctgtttaanatcaatt atagtaagttttttgataaagagtagaaatatataatgtaattccncaaa atgaaaagaaaaganaaaccacaacanctcattaaatanttgcaaacgac gactcatctacattgtccacaattgcgggattttcgattggaatttggtt tttccgacg >CEESB42R cggcacgagatgatgggacctataaaagaacagttggttcattggactcg gctatcgcaggcagattagaagctgaaggatctttgaatctggaaactct tgttcttcctccaaccaagcctcccggtgatgatgttacaactgggtctc gagggtttntcactccaacacattnaacgnctgcactcgatatgtcattt ttcactccacctcgncaaatgcgaactttggctgatgctgttcgtgaagn tgcacctgttggaagtgatccccgaatttttttgaacctgat >CEESB43F aagatactcacattttatactcaaaaaaagggtggaaatgtgactataag anggtgataaaagaagggtgggtaacagggaaagaaagancacaacacga ggaaaaggagaacatggaatagatggntggngatgaataataaagggggg aaatcngtgtgtacactaantattttncaatattatttatcaacnctgat aagttacaa >CEESB44F gtaagtaattagatttattatttgaagattattagaatatttagaactat tattaaatctgaatgttgttagtaatagtaacgggctgatcgancagcgc gttgagtagctctgaatggaagacgagagacagttggtccctggtatact gtaggagaagatccataggntcccatggccagtggaatcaccattgggta tccagaagtttttgggtacattggatatccattgagtagtaatggttgac cacctcccatcatcatatcagatgcagcgaaacgaggcttatacatgtta gatccatacgtatccattcccatcatactgctaggaaagtcccatacggg catcttgttgcaattnccttngcatngttatgggatactctgaaatt >CEESB47F aacctaccacaacttcattagagcacgagaaaattacgagagacaagttg tgcggaatgggatggtggtggaacttgaagtttaaataaataaatgtttg gttggataacgggtagattaaaaatgagcagaacatttgaaacacaaata cgggggaaaacgggatgcgtatatatttaattagaccctggaagatgttg agctttgtggagtaccagatggagttctgcggcttgaagggccaacgaat cggtttgagctcactgaagtagctgcagatgctcttgnattganttgatg atctggatccaatggaacttccagctnggcgtagttctggccgatgttgc tctgctccgaattcgcgctcanttgcaatcaacggggggacagattggct ntcggagcataacttccctgg >CEESB49F gaaaattaaaaaattattatgcacaaagaatatacaaaatgcttaattgg aaaattagatcaattgaaattncagcaaaaaatacagaaaaaaaaatgca atggtttcagtaacaatatctacatatgcncacacggnttcantagaaat tttaaaaaaagatataaatctacaagccagtnctctccataatagtttgc aagtctctctgaagaattatatttttngaaagtgtctcttcaagcattct ttgcagctttttgttcatttcacggacatctgaatctcgggaatcttggc cangtgtattgagcagttttcggaatgacgcttcangtttcggttgacca gangaaggagaagaagnaccccaggattattnctgttggttgaaatccgg tgcccaggagggtttttcgcggg >CEESB50F atcaaagcgcgcttaaatgcgaactccaaattttattcgccaaaaatgct tgcctcagagcgctgtgtgattagtgaaataaaataactaaattatatga ttattataatgtgtataaaataccaacaagttcaacaaaaaagtgatcaa aaaatgagggcagatgagaaaaggaaaacaaagaaaatcaacaattggta aaaaaaaaggntgaaaacaattggaacatacagttttttagagaagaaac aattnttcgaatttngttctcttattatnctgtcctccgaaacttccacc atcgtatgancgtttgaggnctccacgtccgcgncctccgaagtctcctc cacggcca >CEESB51F aaaactctgaattgatttnttgaaagctgaaagactttggatttgtgtgc accgagagaaaagaaaactgaatacaaaaatatacacatagagatgaaaa gatagagaaaaatttnatgttttgattaactcctaaaantttnccaaaaa ancgggaaagagtgaattatggaaaggagaaaaaatgatagangataaca aagggacactgggagaaactttgttttcagaaagngaagaagacccgtag ttttancttgagtaataantancgttnaagcgtattancggcgccaccat tagaataagtcgcttcgatgctgaaacaggctgctctt >CEESB51R cgatttcgaatcatcttcgcaagacgctctggcgacaccaaccaaaaagt tctcttcccaatgggaaaaagacgtcgatgacgttgaaggaaccgccaat gagcttgttcgtattgacgaacgtatatcggatattacagcacaagccga tgttattcaagacaagatccgtgaaacagaagttggaagttcagaagaag aaatgttgactgcatcatatcttgagttgacaaatgaacggaacactctt gtacatcgacaggaatactataatatcattgagacaattcgtcaggttac ttcggaattgaccattggggaaacaatcatgaagt >CEESB52F gagcacagcactgacgagatgaagaaacttgttgaaagtttgagtgaggc gtgcaaaaaagcagccgatgagttcgacagtaacgagaaaaatggtgatg ccggtgcagcggaaagtgaaaagaaggacatcgaaagaaaattcaaattt catacatgtgacgttaatctgaagcaaatcgaacgaagtcatgctgagct gaaaccattacacgaaatactcaagtcagaagaaacgaaaacttcattca aaccaccagcaaatgctaaattacaaaanggttgggatgttgattggagt cgacctgatgactcggcattgctcctgggtgtctggaagtacggttacgg tagttgggaagcgataaaaaatggatcctactctttggattggcctcgtg ccgaattccctgcagcccggggggatcc >CEESB53R gctgtacagtctaaatccaacccgtggtgtgcgtttccagactaatggca agtttgtcatgccagccagagtaaagtcggtgacgattatcaactacgac aaggaatttaatagaaacgtcgntatgtttnccgaagggcttgccaagca ttgctccgaacaaggaatgaagtttgatagccgcccgaacagctggaaaa aagttaatctcggctcatcagaccgacgaggtacaaaagtggagattaga agaagccatttcgaaacggcgtttacggtttgttttttttggaatttatt t >CEESB54F acgaaggacttcatcgtgatttcgcctgtnttctctactcaaagcttcaa aagaaactcactcaacaacgcatctacgatatcatcaaggacgctgtggc catcgaacaagaattcctgactgaggcacttccagttgacatgattggca tganctgtcgtcttatgtcacagtacatcgagtttntcgccgatcatttg ctcgtcgagctcggttgtgacaagctttacaagtcgaagaatccattcga cttcatggagaatatctcgatcgacggaaagactaacttcttcgagaagc gggtttccgagtntcaacgtcctggagtaatggtgaatnnaagncgcgag acagtttcgatctttaaggctgancttctaaaggaaaatatttccaaaat ttctaatttctaaaccc >CEESB55F agaatttacccaaaatttattgatacaagtattattaaaatttggnggca aaatagaatcacgngaatgaaaaattgtgtcagagtacagtcaatgcaca gtcaattatacagaaaaggtaaaaatttgaggcgaccnattcagaaatct tcatcatcctcaaaatcgatatcaatagcattaacagagttctgaagctc gtcgagcccggtgacttcttcgagacgaccnggcgagttcataacgtcgn gaacaaacttccagccattttnctgaaacttgtgcagcgataggngctcc ncaagcaacacattgntccgantcgtttttacagaaaggg >CEESB56F gtttttataaaatattattcatcaacaccctcaaataaattaaactgttg cgatgaagtggaccagccatcgattgcnctccgactagttcacagtggtg gtttcggagagtttgaccaaaaaagacggccaaatatcacataaattagg acagggctcgggctagaacgagcagccaacgccgtcgggtagcataggaa acgacacccggcaacgntcacaactaagcgnccagtcaccaagcttgtnt ccaagcaacatcaactgtntccggttccactccagcgatttcacgttctc ttgaactctctcttcaaagtcctttgcaactttccctcacgggacttgtt cgctatcgcantcgagtcgattatttagcctnagatgaagtttatcaccc actttaggg >CEESB57F atatatatttcatttatttagatatattatggttatttacgggacgtcat ttaaagaacaacatttaaaagttaaagaaaaccaaaaaaaagaagaaaaa aatagatcattaattgtagagggagagatttgtttttcctattccttgga ttcttccacaacttctgcgtctttatcctcagactcatcaataacaattg gntctncttctgtagatggctcagcagcctgctcagcactcgctggaact tcttnagaaggttcttcagntgattttncagcggatttctcggatacttt gtcanccttctnatcggttttctccntttcg >CEESB58F aatgtgaacaattttttaaatgaaaaccaacaacaataacaatagagaaa tcataacaacgaaaaaacaaatcgngtaatttatagaatggtcttgttca aattgctgtatcctacaccaacagcggtaaccatagctttatcggcactt cctgttttgagatcagcttcctcttccttcaatgtttttttaaagtcttc cattttaaactcgatctcaaaaaactgtttgagatgcctgagtgcgtgta cactatacattggaagaggaccgtagaggaagtgtgaaacatctttagga cagagtgtcataaatgtgcttgcaagtatttgagcactactgtcaagggc acctcccatgtaaatttgttgaaggaggagcatgaccggcttcaattcca atgtcttctggga >CEESB60R gctcaagctcctcgacgagttcctcattgtcaaggctggagctgctgagt cgaaggtcttctatctcaagntgaaaggagattactacagntacctcgcc gaggtcgcttnagaggatcgcgctgccgttgttgagaagtcccagaaggc ttaccaggaggctcttnatatcgctaaggacaagatgcagccaacccacc caattcgccttggacttgccctcaanttctctgtcttctactacggggat ctttgaacacttcag >CEESB61F ttcatcttgaaaattttttttaaaaatgcacaaaaatttgantttttggc aaaatttgtgttttcacatataaataaaataattccgaaaatcgtaataa aatgcaacaaaagttattgaataagagattaaaagcaggaggcacaacat agtagtcatgaatcctctcaacacgtgcataatcagtagaaaaagaagaa gaagaagacgtgaaaagagtatatgtatgtnggagagacgcagagaagca cacaaaacgaattggaattatgatgatgatgaagaaggaagcgacgatga ttcatttgggaagagtgtagagcaatcttattgagcgcttggcggatgtt caacttggttcgattgcagcaaatcgttgatattcttctccaaatcatcg attctcgttgtcatatcatccgattcgccgaataatctgatnccgccatg tgcttggaatcgatcccgtgtctgt >CEESB62F tccgggtagagcttgtttattcacaggtgtacaataagaataacggctaa aataaatagcaaaaaaaatggttctgtgtgctttttgggaacaaattgag attatgagatgattttttgtagatttttgtgtgatttatgaacagaaaat gtaaatttgaaaatcgctattactggttacgggaaacaacgggaaaaatt ctaagaagaatgatggagtctcgggatatgaagagaaaatattacacaat aaattattaggaaacatgtttcataaacatcttgatctataagtggccnc nttggaaggacattttggttgggaggaaacgnatcggaattggtttgaca agaacccgtaaaagtgcaccancaatctgaggtg >CEESB64F ggggttttttgtaagattatttgaagaaagtacaataggaaatgggaagg taaaaaaattggatgagaaattatgaaatgcagaatgaatactgctttca gtaccaaaaagtatagccaacaatttttncnctgaatatcagagaatatt acgaccttggcatgcaagatgaagttagagagcttagtttttagcagttg gagcagaagtagaagcagcagcactctgagcattggtgagcactccaaag gcttccncgaccttncttctgaacaactcagcgtcttggagcatcatgat caactcggcgttgtcaatctccagcatcattccagtgattttcccggcct cccttgtaggttccggggaacatcttctcggtaangagcataggata >CEESB64R ctcgtncaatctcggcagntgtncatttaagancttggntcgtgtcattg acaacaagtccgtctacgacactttctcgctttttggaaacattttgtct tgcaaagttgtcaccgacgatgaaggaaacagcaaaggatacggatttnt tcacttcgagactnagcactctgcgcaaactgccattnagaaagtcaatg gaatgcttctttctgataagaaagtctatgtcggaaaattccaacctcgc g >CEESB65F aaaanttttggtaaaaatttattagatgacccagtatgttttgacacgaa tgcaatgagagaagngacttgcattgcttacacaacacaaggggaaatac aaaaaagcaagngatagcaaaataagtagcacagggcagantaccntctt aactgacaacatcgtaaacaaaaccgntaatcgccttattagtaaagatg agatgatatgctgccaagagccgttttgagaaagggggaaagagangtaa aaatgaataagttaacggtgtttgacatttaacaacaggtccntggaacc ccncntgcgttgaact >CEESB66F gaatgntttgtaaaaactttatcatgttttaaaaataagancatctgaaa ttggaattgaataaaaatacaactaacttataggaagactnctgattatg aaacgaaaaattctacacaagaaagatagcagagggagcagagcacaggn ncttgtncattttattaatgagcatttaaaagtgaaggaagtgggancat ggagcaaaggtaagaaacatttggcaggagtatttcnnttttaaaatgta agtaaacgtcngggnaaaatgagc >CEESB67R gttgattcggaaatcagtgaggaggaagaagaagaagaaatatataataa acaaacaaaaactaatgcctcactcgactccattcatagaaggaagtcta aaccccaaccaatgctggagttcgacgcggagacgcaaaaaatgtttgat gatgcgtttcaaagtgacaaaaaatctacaaaagaaaagtatccgttcta atttctgaaaaaacatcagcacactgttctattggttccactttcctttt tattccatctttattgtggttaattatcccatctacttctctatatttcc ccttgataattaaaattggttttaatggttt >CEESB68F aattactattttacatttttattcttaacacgcatgactgcctgaaaatc tcagtttcaacaatggaaaacatattttacggttacaaaaacaaataaat gttatagagcnctattctaatttnncatttttaaacattttaccngcaac acaattaaaaaagtgggtatcaacagttagttggctaccncagangtatc acacaaggntcgggacg >CEESB69F aaatatacaacctttattgagaagagaccatttatatacttgtaagcttc taggaaaatttnagatactaaagagaagcatagattttaagacaagcagt taactaggtgaaagtaggatgagacagcttaggccttggtggcgatgtac gagangagatcaacaactctnttggagtatccganctcgttatcgtacca cgagacgagcttgacgaagtgtgggttgagtgagatggatgctccggcat cgaagatggaagagttggtatcggtcgacaaagtcggtggagacaacttg atccncagtgtaagcnaggaattncctcccattcggtcccntcagngggc agnctttcattaactttctttg >CEESB70R aaacaactcgaaattnaggctagctctctccgccgtgtggctcttgttgg agttgccgtctccttcaccgccacattggtgtgcgtcattgccgccccaa tgctctacaactacatgcaacacatgcaatccgttatgcaatccgaagtt gatttctgccgttcccgatctggaaanatctggagagaggtttcccgcac tcaagttctcgccaaggtttctnggaggagcccttcgttcccgncgtcaa gctnggatacggagagncgcccgggagttttgaaaggntctttcaagttt gggacaaccaaggg >CEESB71R acgctttcaatcagaggtaacaacatcagatacatcattttnccggatcc actcgctcttgacacccttttaatcgacgatgaaccaagaaagaaggccc gtgccgctcgcgccggagcttcacgtggacgtggtcgtggtggaatgcgc ggaggaagaggaggtcgtggtcgcggacgcggaggaccacgcggaggcgg tccacgtcgttaagctatcatcgggtcatagcaaatttgagtatcgaacg tcctatacttttgtatcacgttcctcaagtttaattcacattttgttncc cgttaaaagtttgctgacttttgttttaaanctttttgaaattaatttan ttacggg >CEESB73F aaacttcaatttttatnctaattaatcgtcaatattcaaatncgacgaaa attttcagattaccaaatatttggaaatttggagaggattttntgtggga gagggatggggatagagtataaganttncgagacgcagggtgaatatttc ctttaaaacaagaaatgggggaaaaaaaggataatgtaaaantaacatgg atttncaggtagtnctagatgggggtgggtttaaggcagatttcacggta gcncagggtttgtccggntatttnagaggaggttcttgagaatgagaata tc >CEESB74R caaaatatagaagtcactgcatctnattctgcagcagaacatgaagtgtt tgaaggaatatcatcaaatattgctgggaaaggagaaaagttagaagaag aaatagataacattggtattgtaatgcaaccagagccacgagttgtccat gaagcttccgaagtatcagacaacattgaacttaatatcaaagacgacct aaacttgaaaagtagactggacaacttcacaagagctaaattcaggcaat caaccaccgtaacacctaatattgttgctgtggagccttctattganggt gttgaagacggttttagatcat >CEESB75F gtatcgtctcatcaactttatttttaggcatgtacaatcacgtaaaggac acactgatgtncattggaggaaatgtgtgagaatctcacactgcataatt tttnccggtcggtgatttataggtataaatatagantggcggttaaaatt caaaagatgcatttnaattaggtgaaaaaaggaagaagtttttgggctgc cagagatgatgtaaaaaatagacagagaccatatcaggacaatgtgtgaa gtgtgaaggggaaggaagcgacatgtcgtttagaaatgtaatagagaaat aggcaaactgaagggtaatttantcctcgcaccagggcattctctttgcg g >CEESB76F gacgctcttcagtaatttattcaggattctcagaccgccagatgaaataa acgaagaacgaatgtntggtggtttggaagaaatcctttttcaatttctc gagaaaaatcagaggaagggaggagggagatttggagacaaaatagttta aaggggatgagagagaattgaggggatcaatctacaccgttcgatttact tttgaattataactcgtgccgaattcctgcagcccgggggatccactagt tctagagcggccgccaccgcggtggagctccagcttttntnccctttagt gagggttaatttcgagcttggcgtaatcatggtcatagctgtttcctgtg tgaaattgttatccgctcacaattccacacaacatacgagccggaagcat aaagtgtaaaagcctgggg >CEESB77F caagaaaacgtttnttttatctatcaagtgtagcagaaggaaaaaagaaa cagagtatacgngtagtacaacaataaaacggaaaacagtaacaatagaa attgaaaacaaaaantttggcaatttagtcggtatggaagtnagtgctca tctncggtgtgcttctttcttttncttggaattnctncttcatctgcctc cccaccttcagctccttcatttnccggttgtngctgctcgcggagtcgct tcaggncgggagnanaaagttaagcccggggggcccaangggggggaacn nnaannannggg >CEESB78F attaaaattttacttccatttcagaaagaaagaaaccaagacgattaaaa agcactgtncaggaaagaagagccacatgagatagaagggacaaattttt naaaggttcgcaagtnttatggaaggatgtatggggacagaggtacagaa cagttccaccaatttgaactagccaaatttcaagtagggggcataacana tgtgattcgattgaggggcaagatacgnttgcaaaanacatcnagnaaga aagcaacntgggaagtgngaaag >CEESB79FB ggantaaaatactttcatcgaaaatataattgaaatgaaaagtaatctta aagaggntaatttacattgcaaaatacaaaacagtcgaacgagcacctag aacgaaaatggtccnggnaaagctcaagctcttctcggaaggancatcaa ggcattttggtaataaattgtggaaaagccatagcacatacactttggag cnaggagaaattgtgggaaaagagattttaataattctagaaaaaaagtt tacactcgagaaaaggcaatgaaaaaaggttttaaaagnagaaacaagct gggattatggtagggtaaatttacaaaactcgcaaataaagc >CEESB79R caaatnccgcttttaagtcataccgtnatttatacttcgagggaggcgtc tcatcagtttacttctgggacttggataatggaggtttcgccggaattnt nctcatcaagaaagagggagacggagccaaaaatataacaggatgttggg attctnttcacgttattgagatcacggaaagagcacgccaggctcactac aagctcacctccactatcatgctgtggctgcagacaaacaaaagctccag cggtgtgatgaacctcggtggctccctttacaagacagcatgagatgggt gcaccaatcaacggtcagaacac >CEESB80F agatcacaacgtatttattacttctcctncttttcaagaaaaatagtcca gacaagcctaatgagagcctgaaagcctggaaattttgtcttgatagtga tctattcgatttcagtgaaaaaaatcgcaatccgtacagatggtagaggt ggaaatncgcaaaaaacgccaaaatctcagtggaaagttggcaaatttnc gggaaatcgcaaatttcgcgagagaaatttgntactttnccccnaaaata gccnagnaacagggnaaaagatgg >CEESB80R ggaaaaacgggttttctttcggcgaaaattttgaattatttagtgaaaaa atagccaaaattctcaaattttgcacggttttttcaatttttttgcaaat tttggtgcattttccgtaaaaatcgacaaaaaattgcgaaacacattttt ccgagttttttttttgccgatttcgtgcaaaaaacgtgggaaaatctgca aaaatgcagaaaacaggnaaaaattgattttctgccactattttgcaatt ttcggtcgattttagtgcatttttagccgattttgactgaaaatttgcaa aanattgattgaaattaaattttatcggttttttcccgatattttgg >CEESB81F cttaaatcgtttattattaaaattaaaaattgttataataacgnaaagcc attaaatgaaataaatattaaataaagatagaccagaaccacaaatgatt gtaccaccgtatcaggaaaagagcactcaagaaaaaagtgaatttttggt tgaaatataattttaaacaacaaaaaaagaaacatttttgaaatgtaata ntaaattatttagaaactttccaaacangtttctggcatctgatgtattc tgcgagatgaagtgtcagttggtncganttcatacgttttgcattggaaa >CEESB82F aatgtgtaaatgtgttcatgtncttcatgaataaaaaatagagtgataaa cgattacatgagatgacagagtgntaacaggaaaaatgtgggtatttttn aaaaccgtaaaagtctaagggtcaagaaaatgaaanttgaaaatcctatc tgtncacgngtgcaaaaatatgtnccacg >CEESB84F aaaaatatgatttactaaatgattagaaagcttgcaataccgaaatacaa aatattggcgaaaatggaaatcccgagcgatcggtactttcaagaaaagg aataaattaagangaaatacataaagtcatcacaatagaagagaactaga ctgaaatatgaaaagaaatagagacaggaagcaagantttagaagaaaat aagatgaattttaaaaatgcgagangaagaaaattcagattctggtcacc gaaaggnaaatggaacanttttagagaaaggagatggctgggggatgagg ggaaactctatgcacaaacacaagaagaaaaaagcaccaacacgncacaa tattcaaatagaagtatatatctncttaggaaattaaat >CEESB84R ggcacgagctcattcgaaaaaatccttgctgaagagcgtgaagctgagga gaatctctaagatcacctcggccacttcaaacagtgtgacatcgacgttc gacaaatctttaattatttatttctagtagatatatacttctatttgaat attgtgtcgtgttgtgcttttttcttcttgggtttgtgcatagagtttcc cctcatcccccagccatctcctttctctaaaattgttccattttcctttc ggtgnccagaatctgaattttcttcttctcgcatttttaaaatttcatc >CEESB85F ttttttaaatatgtatattcattttcaataaagcccatttaatgagaacg caaaagtacaagaaatacagaagtcagtgcaatgagatcgcatccacgtt gagaacgtcacttagttggtcgaaatcgctgagatccttggcgcactcag cacatcctttagtccattgcatgcagtacacaggatgatagttcagatgc acaatctgaccatcggcttttcctacccaaactaaaactcgacagccttc cgtctcacaatatctcccgtatctcgcgnatttnttgaacgccttcgtgt cgaaacgncttttactgttcaaggggaacggttcggctttctgggaactc aaatcgcagtagagtntgctcggttttgaagttttggcgaccccactttg cc >CEESB86F catcacttcttcttgctgagccttaacggcatcttcaggacgagaacgga gtgcagaagtcaaaacgctgggagattcacactcgatttgcacaatagta gctgctgatttggcttgttgtcggattaaatcaacttctccttcagtatg agcaacttcgcacagagcttccaaatacgtcatcgcctgtncagtttggt tgcagaaacgcgtgagtttcgaaatttgagcgttgatttctcgaatcttc ttgtcaaattctacagaatttcggaggaactcatcaatttccataaccgc atcggccttttctttacacatttgctggcatcttcccattcgagtactac aaagattcgctaaaacgacccccttcggcttccatttttcccctcgtggc cggaa >CEESB87F aagacataatagtgctttgttataagcaattcatcgaaaatttagtgctc ggcaacagcttttcctttctcctcgtacaatttcgccaattccttcggac aagtatgcatgtatcccttgttagcagcgcaaatctccttcattctagtg tacggntcgtagtttgcgaaaaattcttcatatttgttgtgacgtggcca gacgtaaaatgcgttgaaggcggcagtcgagacgacagcgacagccagtg aaacgnnaactcgtgccgaattcctgcagcccgggggatccactagttct agagcggccgccaccgcggtggagctccagcttttgttccctttagtgag ggttaattttcgagctttggcgtaatcatg >CEESB88F atattagtaagatcatcaataaacaaacacaaataaaaataatcacacta catcaacaaaatgtcaatataaaatagcaaattccaaccccagatgatga ttataattaaaatgattttttagaagacgtaaaaaattaaatgctaagan caaaccaccacacaaggcatganttccgtgaaatcccgtagataaataaa aaatccttccaaatactccgtctgcaatagaaaatctagcttctatatac tctattaattganttcctgtaaaataagcngccaataaacatgttaaaat tntactattagtacatcttttatttctaggtnatctgtggatgtgctcaa gtttactgtaacaccactcctttaattaaa >CEESB89F actcaaaaattgttcattcgaatcctataaaacggcgggaaaaagtgatt ggagtgtgatgaaataacggaaaaaacagaaaaacaataaaaattactag ctattcaaaaaaaaactacaaaaaaccggaaaaacattaaaaaaccagag gaaaattaaaagaaaattatttagagacgactccaacttgagcaggccga attggacgttctttcaggctgtagccgatctttgtacacacttcgatatg tccgacaggttgtttggcattagcagatggantctggaaaacggcttcgt ggaggattngggtcgaacttttccattggttggatccacggtcacaaagg nccatgctttgcgaatggtcttttgcccatgaccagtacggggtcatttg aaaactccttcc >CEESB91F caaagaaataaaatttattttaggcactgttgagcaaccngagttgtgga ataaaaataaaaattggaaaattaaaattncaaaaaaaaaatcgaaattt ttttaatttgcaaaaaacccgaaagtggngaaaagaaatgngaaattnta actggaatggttttttncgttgaattgttgactaggatgacacgtggata cacatatcagangctgataaggttaacggancaggtgaagacttntggag accncggcggtgagccatttcgggt >CEESB92F cgaaaaatttcaaatttatatgatgaacttgtttgggtgtgaaaagaata ggaaggaaagcagggaaatgggatggagancaaacaaaaaagtagttttt tttgaagaataagaagnaacatttggagaagaagttgaagcaaattatgc acaggtatcatgtaatttncgnaagnaaaaaaaacacggngnaaaatgat ttagagacgntcccaagagatttcagcctct >CEESB94F gagagagaacgtcaagcttcaaagcttccgggcgtaaaatctccgataca aaggcatgagggtgacagtaccatagcttccattgacactccgaagttga tgaggttggcaaaggcatggggctttaatactaaatattccacatcacca atgtgttctccctctggataatttggaatgactggatgtgcatcaccatc accaccaccgnctgctcgtttgacacgtttctgaggtttgttccagaacg gccactgcggccgtccttccgttgccagttttcgaattgcatcttctatc ctagtatctctttcttctctggtcatattggcgatgggcaagtagcttgt tggggtatcagcatagacaagctttgactt >CEESC01F gattataatccttcactggaaataaattcttccaattgataatgactgag gaagtgagcccaatcgacgtcttttgctatttgcagtttggaaacattac tctgagtgcagaatgcatcggttttgtagtgacaagttttatggcatacc attttgcagtctctgcactgataagcttgcttggaaaaactgcttcgaat cctttgctggcacacattgcacgtggcaccgcctttnactttgacggcaa caaatgtatgancgttgtagatgtgaagtttctttcctttcctcattaat cgagatgcagctggctcattgagcattgctggagggatgaggaagtattt tt >CEESC02F agacgaaggacttcatcgtgatttcgcctgtnttctctactcaaagcttc aaaagaaactcactcaacaacgcatctacgatatcatcaaggacgctgtg gccatcgaacaagaattcctgactgaggcacttccagttgacatgattgg catgaactgtcgtcttatgtcacagtacatcgagtttgtcgccgatcatt tgctcgtcgagctcggttgtgacaagccttacaagtcgaagaatccattc gacttcatggagaatatctcgatcgacgggaaagactaacttcttcgaga agcgggtttccgagtatcaacgtcctgggagtgatggtgaatgaagccga gagacagttcgatcttgaggctgacttctaaag >CEESC02R gcacgagaaaatatttccaaatttctttagaagtcagcctcaagatcgaa ctgtctctcggcttcattcaccatcactccaggacgttgatactcggaaa cccgcttctcgaagaagttagtctttccgtcgatcgagatattctccatg aagtcgaatggattcttcgacttgtaaggcttgtcacaaccgagctcgac gagcaaatgatcggcgacaaactcgatgtactgtgacataagacgacagt tcatgccaatcatgtcaactggaagtgcctcagtcaggaattct >CEESC03F aattcaaaaatttatacagaaaacagaatgcaaagaaatctgtacgtgag cttttcataaaagcgcattcaacaacaataagttctacagatataaataa atatcgaaatctcttgaggggttggaaagggagaaaatgaaatgagggga tattgtaattacacgtcattgatttggcggaggggtttcatttgaaagga cattattaaagctctaattaaaagttttnctttaaaaaaaagtgatgatg agctgcagaaaaagggacttcccgtgagttttcagatgtcaaaaagttaa ggtcagaggagttcagaaaaatgcaattgggagggcccgaagtgagatgc atttttcactagggagtttcagggaaattacgg >CEESC04F gatcatcgcctggttgaagaagaagaccggaccagtcgccaagccactcg ctgacgccgatgccgttaaggagcttcaagagtctgccgatgttgttgtc attgaaggattcacaaagttcctcgagaccaacggaaaggagggagctgg agcttccgaggaggagaaggccgaggaggaggctgatgaggagggacaca ccgagctctaaatccacattccaatacagttcaacgcatcggggttccat ggacctgttgttaaatgtcaaacatcgttaacttattcatttttacttct ctttccccctttctcaaaacggctcttggcagcatatcatctcatcttta ctaataaggcgattatcggttttggtttacggatgttgtcagtttaagga tgggtattcttgccctgtggct >CEESC05F aacttgcttcttgtatattcagagtccgaagatgatccaggaactctgaa gatcacagatttcggattggcaactaaataccgaaaggatggagaggaga tcatgttgagcgaggattgcgggtctaagccctatgcagcgccagaagtc tgcacagggaacgattatcgggggccacccgtcgatatctggtcggctgg agtcgttttgatgacgatgcttgttggggagcactcttggaaagttgcaa ataaagaaaaagacgcggcctacagcaactggntcaatgcaaaagacgaa aaggcgaacctgtggaatgtgatctccggaccaacgacggcgcttcttcg caaacttctccatgcgaaccnccgaaaaaaggggcaacaatggcg >CEESC05R nngaaaaaagggcaacaatggcgaaaattgtaccggaaccatggntccgc ttcaattttttngccattgttgcccttttttcggggttcgcatggagaag tttgcgaagangcgccgtcgttggtccggagatcacattccacaggttcg ccttttcgtcttttgcattgatccagttgctgtagg >CEESC06F atcaatagctgtttattgataacatagtgaacagtctgaacagtttctgg ggagagatatttcacgaaaacataaatttttaagggaaaaatggggagaa aatgtgtagaaaaaataggaacgacacaatgcagagatcaacgntcatga gatgaaaacatacaaatagatggaccaaaatagctgaaaatttaaaaaaa agaggnaaaattacataattgcgcaactatttctgattgattcaattatt gaagacttttgatttttaatagccggtggtgctgattcggtctcccacca cggacgtggacgctctccttttgacttttnatatcaatgatgtcggtcct catcatcgnccctcangtggacccatgttggaatggctccagaggaacgc ttgttggg >CEESC08F aatatgaaacagattttattttagtttcaaatgcaatatattgcaattac aaccacaaaaggggaaaggaaccgtaaagtgttcgagaagtactgagact gagaagtggggggagaaacaacattaaataaatagaaaacaacacaagtt atcttatcttatcacaatatcatcgagtgcataaagctaatggaatgggg agatgttttacattgtttagagctcagagtgctcctcctcaatcgttgtt tctncagcggcttctggctctggttcagcctcttcaatatgctgttcggg ttcaacttngagcatcttggggnnacatcgagccgattgggcgaagg >CEESC09F agtcgctgttcttttttattcagaaaaaaaaagcttcgaacaaaaatggc aaacaaaagctggatgggatagataacaacaacaacaaaaagaaactaac aacaagagtaaatatgagaataataaaaaaatatggaaaaagagaactgg ttgataaaaaacaagatttgaaaaataaaatcaacaatttaaatctaaag ggttctagttttagaatgtattctggcgttttccaacgtttacgtttcca ggaagttttttcagtccttttgtgaccagagcattcgggcgtcggctcac ttctttncgttcagctgcatgagctgcttgtttcgtctttgccagcttct gcacgaaggatccttatcacggtttcgaaatcgtga >CEESC10F gaagttggtgttacaactttattatgtgtattcaaaagcttgggggtgtt tcgaatgtcgagatttgaaagggggaaaagccgtaacaaaacggaacatt gaattgtatgggtagagacgggaagttttgatgagtgggctcagcacaga aatttgaataactnctgcatttggttgatgggaaaaggggagtgattgat gaatttatagaaaaaaatggaaagaaaaacgatggtttaactagtcaatg gtaataaataaatcatatgacaaatagtattatttatcattttcaagatc naggtgaggggtgtgtggcggtaacgtgtcggtntgaaggagtttaggtt g >CEESC11F ccctgcgaagggactttgcaactnattctttaccagttgttcaaccatct attgagacgcaattcaatatgcgaaaactatatgggagcactaaagtgcc tttntagccttttgttgaccacaccatcaccgcgaacacgcaggagggac gtattcatacacggtggactctctttcgacattttccaataatcttttgc aagaaggttgtgatgttcgattgtcgaagttcatcactgctgtgtcagct tggcgaaatgtgcgatgagagaattgaaggccccaaatgaatgaccattc gttgttctacatttgagtttaccaccatttcaacaatggcggatgtgagg tccgttttcgctcgtctttgcggattcccggagcttttgaaatccgtcct ccgtaggcccatgcaactgccgtccctctttaaatgacagtaaacttag 1.fa100644000766000024 16645214006427353 16016 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>AW057118 gcaatgtcacgttgtcaactcattgcaaacaaagactggaccaacattga acggagtgattggaagacaatccggacaagttgctggatttgactactcg gctgccaacaagaataaaggagttgtatgggacagacaaacacttttcga ctatttggctgatccgaaaaagtacatccccggaactaagatggtcttcg ctggtttgaagaaagctgaccaaacgagctgatctcatcaaatttattga agtggaagctgccaagaaaccatcggcataagcctctactaaataagaa >AW057119 test description tcatgttggcttctcggggtttttatggattaatacattttccaaacgat tctttgcgccttctgtggtgccgccttctccgaaggaactgacgaaaaat gacgtggatttgctgacaaatccaggcgaggaatatttggacggattgat gaaatggcacggcgacgagcgacccgtgttcaaaagagaggacatttatc gttggtcggatagttttccagaatatcggctaagaatgatttgtctgaaa gacacgacaagggtcattgcagtcggtcaatattgttactttgatgctct gaaagaaaggagagcagccattgttcttcttaggattgggatggacggat cctgaatatcgtaatcgggcagttatggagcttcaagcttcgatggcgct ggaggagagggatcggtatccgactgccaacgcggcatcgcatccaaata agttcatgaaacgattttggcacatattcaacggcctcaaagagcacgag gacaaaggtcacaaggctgccgctgtttcatacaagagcttctacgacct canagacatgatcattcctgaaaatctggatgtcagtggtattactgtaa atgatgcacgaaaggtgccacaaagagatataatcaactacgatcaaaca tttcatccatatcatcgagaaatggttataatttctcacatgtatgacaa tgatgggtttggaaaagtgcgtatgatgaggatggaaatgtacttggaat tgtctagcgatgtctttanaccaacaagactgcacattagtcaattatgc agatagcc >AW057120 aatctgtacatcttcaattgtggttcacttcttctatcgtcttgttcgag aaaaccacggagaaaaggagcaagaccgtggattgaaagacaccaaagaa accgccaaggatgtgctgggttttgtaaaaatgcttggaataatcctagc tatggttgtaggctttgccttgttggggtttgtcacgttttatctctatc agtatgcgag >AW057121 atgggcgctggtggctatggattcggatatatgggatccaacgcctcatc gtcgggatttgcccgcgaagattatgcccaaggaggaaatggtggaggac agcagcaaaaccagggatctggaggaaacaccaacccgggaggacaagtc ttcaaggcccgtacggatcaatcttgttacctcggaccataagaggcaag aactcagccaa >AW057122 gacaacttccatctctatcatagcattttgatgattagaacatgtcactc acaaaagatggttccgtagctgtgaccattctccaagcaatcatcttcat tcaattcggcttgtgcgttgcgatcacaattctgacaaccgtcggaatct catttgggtatccggtggcttctcattaccttatggctcttcttcaggca ttggttgcaattccgggaattgtgtacattgtgacacagacaaacatctg ggtggcagtctacatcagcttccaagtggtaaccgccgcgtgtgaagtct actggctcgtctacttgatcttcgacaatcaacccgctggatcttggatc gcacttgcaattatcacagccgtcaacattttggcagctgttgtcggtgt gtggttccgcaaaactgctttgaaccttccatgccttgataagaaaacga aaaaggctggtgatgcgaagaaggagaagcccanaatgaaggccccatca acttcaatgagtgatattgagaaaagtaaatccagagctg >AW057123 catcggatgaaacggatgatgtacttgcaaatgaagacgaaatgttcaaa acaaaaaaggacaaatacaaagtgatcgcattactcggtaaaggcggata tggagccgtatactctgtactccgcctcagcgatatggagaagtttgcga taaagtgtgagaaggcgactgctggaaagaaggttcttctgatggattgc aatgtgatgaaagttgcaactcagatcaaaagtagacacttttgtactgt actggatcgagctaacgtcaaggatcgtttaatttcattgtgatgaagct tatcgggaagaatctttgggacctgagacaggatcgtggcgatggaaaat tcacaatgggcacctcgttgaaagcagcgtcacagtgtcttgtatccatt gaacacttgcacagtttcggctaccttcaccgtgacatcaagccggtgaa ctttgccgacggacgaaaggaatccaacgagcatcacgtcatcttcatgc ttgactttggtctctgcagggagtacgtgaagcgagctgagggaaagatc ttcgagcagcccgtacaactgcaccattccgtggtactactcgatatgct ccattgacttcgatgctccagcaagatcaatcacgacaagatgacattga atcctggctctacatggntgtagaatggacttactgagattgacgcggcg catgtngaagctcacgatcgagagaggctctgcattacaacagtactacg tcaacaccg >AW057124 ttggatcaacagaaagtttaactcctcttctgacaactcaagagccataa agtccttatcttttccaacaaagaacaaaagtccatgaggcagctctgaa acgccacgtggctccgactgaaggattggctgttgctcagaatgacatac acaatttcctgttcgaatcacacggacaaatcttcgatatggcgatgatg tacgccgaatcggccggtttcaaacgagatcacctctcgtacgccgcgtt cggtttgatcgcattcttcttggttttcggatctgtggcacgtcttttgt gcaat >AW057125 aacaactcgactaaccgtctccactcttcacttgcacaaatcttcatgca accaatcaacgtcatgctcgctgttcttctcgccttggcttcatttgctc aaggaggcagatctgttgctccggctggtgcagtcactgaaccaacagtt actcaagctgttccagaaggatcaggacttagttcagatgtcactgatcg tccaaacatcgactccactgatgttgtatcaaatgcaacttcggtggaag atttgcttggaagttcaacaaatgcaaacaacactggtacattcaactct taggacctttgtaattgctccaatgatgattcttgctttggt >AW057126 gtgctcgacatcattgctcttctggcaattgtcgtttttcttgtcttttt gccggctgatagatgtctgaattttgaagcaagtgaacaaactccacgca acgtcgacaaaaagtgtcatcttcatccagtgcatcatccgaacttcagt ttcaccgaaaacttgaaaactaacgatcatttgattgatccaagtcatca agtcaccatttgtcaccgtttgaaatggtctcactattcattctgggata gcatgtatttgaatcttttgccggctctcacttgtgtttttctgattttc tacacagttcatgagatttgtgaatttgacgtgtttgttggacaggtaca aactactggcttggttattgtatcagcaatatcggtaatttactctatcg cagtaatcactcacgaaaagaatcgaattgagacggaatggccattgctc gctcatgcttatgtcctggagcaaaagaaggagaaacaaccgttaatgta ccaaaaacgtggcaatatagtattgctatgtgcattctgtcagcccttct caagatcggcagggttntggttcaacatttcattggtgataagacgttct tcttgatgacgatagccaaagggacgatcaaaatggtgaacctgtgctgt tggctggaagtgaagaagtgctgtaccactaccattctgaatgttccaga catccagaacttgaactgat >AW057127 aattccttttaatcgattcaagcacatcgtacatacgcgacaaaacgacg gaaatatgatgagcccgttggactatctgcatgtcactttgctgctcatt ggcccacttgccattctcatcggatgtggaggaaagaaaaaagcaccacc accaaagtcggcatccaagatgactgctccacctgctcctgcatcagcgc ctcccgctgctccagacgcagcccctgctgctcctgatgctgcagctgca ccagctgatgg >AW057128 gacgacgcttaccagtttttggctccgataactcgttcctccaccaccag tccaacaaccgccagcagttgctccaggagcatcacgtcccgcacctgct ccgtcgaatgagcttccacccgcgtcggaggcaaagaaattccaagtgca accggctaaaaaagcatcaaaatcgaagagcaaatcgaaagactcatcgc caagcaacggaaaggagaagaagaagcgcacaaagcgttcgggtggctcc aaatcgaaagaatcttccgagccacctccatcattgtnccgcctttaaat gccaaaaggcattcagtgcatgtcttgtccttatttcagatgctatcact ttgatgg >AW057129 gagatcgttggaaggttctagctatgaacgtactagtgtcttcagaggat ggtgttaaagaattcgaaaagattgttgtggaacctgaagatatcgaata tgttgagattccggccgatgccaaaaacgttgacttgacgcgtcaccgta tcaaagaaatcggtgattattcgtggctcactcacgtcgaacacttctcg tttcgttggaatctgatcaaaaagattgaaaatctggattgtttgacaac gttgactcatctcgagttttacgataatcaaattaccaaaggtgaaaact ttggttagcctcgtcaatttggagtcactcgacctgtcattcaatcgtat caccaaaattgaaaatttggagaagttgacaaaactgaagactctctttt ttgttcataacaaaatcactaaaatcgagggtttggatacgttgactgag ctggaatatctcgaattgggtgacaatagaattgcgaaaatcgagaatct cgacaacaatctganactcgatagattgttccttggcgctaatcagattc gtcttattgaaaatgttgatcatctgaagaagctcacagttctcagtctt ccagccaatgcgattactgtagttgataacattgcgggacttcacaactt ngaagagaattatctggctcaaaatggcatcaagtacgtctgtggaatcg atgagcatcttccttcttgaaatctggatttcaatcagaatcgtcttgag aggtcgagatatncattcaatgagacactacagacttttgggcagagaaa tagttgatacctg >AW057130 caaaagggctcggatagaaaaatgcccaacacttatcgaacttcggatga aattgtggctcagcgattggaggatggcagagatatgaatctgagtcttc aaaatatgaatctattcgcatctggagccttttcaaatgtctatcgtgga attgcacgcacagaatccaaccaccaaatggaaattgtcatcaaaaagac atggccacgtcataaaggatgtccattggaagtgaagattctcggaaaac ttggaaagttgaagcaccaagaacattgtccgccttctctttagttacca gaaacaacatgaaggtcgtatctgccttggtctaatcttcgagtacatcc caatgaatctccatcagtttctgaaggataacaatcgacgtgttgacatt atcgaggttaaactgattgtttggcagttgttccgtggacaagcacattt ggagaagtctgaaatttgtcatcgtgacatcaaaccacagaatttactgt acaatgctgaccactgtcttctgaagatttctgattatggatcatctgcg attgaatcagtgaagacaccacaacaaagctaccatgtcacaagatatta tagacccgcccgagttgcttctacgctccaaanactatggatgccagatt gtcactttgtcgtgtcgatgtgtctttggtgaatgctttaaggtggaatc tactggcaggcaatacagccagaatcagcagaagtgatttgtatggtcga gctccactgc >AW057131 tatctatattctcgactttggatttgctcatcagtacatgattaaggatg gaacactgaaacctccgtcagctcatccatggaaatacgtgggaagtctt cgtcatatgccacgtgccgcatattcgaaagtggaattctcaagaatgga agacttggaaatgtggttctatatgagtgttgagctggttaagggatgtc ttccgtgggctcatttgaagaaaccaaaagaagtgcatgactatcaaaag ttgtgccgaaatggccttcaaatgcgtgaaatgctccgaggtctttcacc agaattcgtcgacattatgcanataggtgacaaactttcattcaccgaca ctccgaactacacagaaatctacggacttctcaccaacgcgattctcttc agtggcaaaaatgaattcccgtacgattgggaggaggctgagatcaacga gttcaagaatccgcagaagccaagtgtggagcaggcaacctaatagctct ctttctatggaaataaattga >AW057132 ttcgataacttgctttattgtcgacaagttttatcgtaacaatgacagca ccgccaccgccactcgtggaacttcctccgggatctatggttgaaagatg gtcgattacaaagaagctgggagaaggaggctgtggagccgtctatttgt gcacggatgcaactggaaagtatgcactgaaagtcgagggaatctctgag gcaatgcaggtgctcaaaatggaagtgttggtgctcggagagctgacaaa gagaggaagtcgtcacttttgcaagattgaagacaaaggaagatatggcc aagttcaactatgttgtaatgacgcttgtcggaaagtctttgcaggatct tcgcaagggaaccgctcaacaatgtttgagtctggcctgttctctcagtg tcggaatccaatccttggaagctcttgaggatctccacaacattggatac ctgcaccgtgacgtgaagcccggaaactatacgattggccgtgccgagtt gaatgaacttcgtaaagtttatatcctcgacttcggaatggctcgcaagt tcacggacaacaatggagtgatccgcanaccaagagctgccgccggattc cgtggaactgttcgctatgcgccaatcgcttgccataagaatcangagct ttgaaganaagatgacgtgagttttgctctacatgcagttgagctgactg tttgacgtgtcccatggaagagatcacgacatgaacgcagttgacaagcc aagcnagcgatccgcatactccagaaagatgtt >AW057133 tcgctgtctttaaaggagaaatatcgattctataaaaatggaattcggag cgtcgtttgcttatagaagaattgacgcccacgttgaagcaactcttcaa gaagctgctcaagttctcgaaaagttggtaattgaaagagatgccgagtc ggccataacggtcgcatcccaaggaactccagatgcaataagtatcactc aagctgctcatgaaaccatttcaattgattcaattcttcgatcttcgaag caaccaaaggctaccattcgtactccacgtgactctaacaagaatcggaa attaactttcccaccagcccgaattgggttcaaatccgagcaattggaaa cttgtgattctggaataactgatagcactattgaccaagatccaccgacc ccggactccttgttcccaagtgccatctacattccggcaaagcagaagcc tcaaatgaccgtttcaataagtgcaacaactgcttcttcttgttccaaca aatcgctctaccgtaagcacattgaaaaactagcaattgagccattggag gacatcaaacacttgaagtgccgtggactgangaagtcanagccagatga ccttcttctaaagccacttacgatgcgggaactggnaaaccctaaatggg gttgtctccgagtcaatggtgtcggtcacctttatatgcatctacactgc taattcaatagcttgctatcacccaatg >AW057134 cttaccctcggaaaggcagaaagtcatatggacttactggaaatgtgaag aacgtggaaaagggtgtcaatgtgtggattgagttgttggaaaaaatgga ttttcgaaagacgatggaaccggaattcaaaccgattgacacaatgaaag cagatcttgagaagtgtcaagctttcaagaagaatattgatttttctcaa tcggagaatgttgaattgtacgatacaaatcgggtcaaaggaggtggaga agcggatttcttttatcacggaagcactctgactattccatcggtaacca acaaatcttacaatccttgtccaacttgcaattgtcgactcaccgcaatc tcttgaatcattctggataatgggtgcctcacaaaagatccaaagactct tcattcttcttggcgaagaagaactcgataaacctactctgagcgagtac tttccagaagacttcaaagagttcaagacgattcgtgtgaacaatcggaa gactgtgccgaagactgaagagcaagcaaacactcaattgtactatgaag ttgtgccaaaggattgtgcagaagctccatttgcaatgattgagatctgc gattcttggcctgatgcgaagattccaacgatgggttataacagaaattg ctgcgactgctgcagcgtgtctgaatctgatattgattgtgatgcttctt gtgctattgtgagcaactacngagccgctagagcacgatcatctacttgt gcagttctggctactgatgaagtgcacgcacgagaagccccaatattaaa gaaaatcgtctttagtacgttcgcacgcctgctgtattcgacattgctca tacatgtt >AW057135 tctcatcatgatcacggaagctgagatttgcttcaatcggatatggcgtt gacaattctggttccatggaactttcagctcaagcccccaattctcgatt ggggaattaatttacctggtatttattgattaactacanaatatcccgat tattttccagtagttgtactttgcttggggttttaactttattttattgt taaaaagggaaaagctggagcaaaatgcccattcactg >AW057136 atcgaacatcaaagcagtgaccataagtggaaagtgttgcgaaatatcta ctcgggaccgttttcagatgtttatgtcgttgcggatacagtaacgaacg aaaagtatgcgatgaaatgtgagagacaagaaggaaactctcgtccagtc ctgaagctagacgtgatggttctgatggcgacgaagggtcttcgaggatt tccgaactttgtggcggcaggcagaactgatgtttataggtactgcataa tgcaacctcgtccggaccggatcttggtcggctgcgtcgaacacgtncgg aacgcaagttctcccttccaacagctctccaaattctcggacaaactctc cgacgcctcgaggatcttcacaattgcggatggctttgtcgagaagtgaa ggcgccgaatttctgcattggtgtcggcgagaacgagtcgaccgtctata ttctggattttggattcgccagganatttgtagacaaggagggcaaaatc at >AW057137 gaaacttgtgaagaaggtctgcaatggctgcctcaaagaaattgatgcca aagcgaatgaagaagaagatgcaggtggagagaatggctcatgccagagc ttgcaaggttgcaaagagagaggctcgcgtcgctgaggaagcatctggaa aatcaactggtggatctactcgcggagccaagtgatagccgagccacaac acatga >AW057138 tcgaatcaatcgccttttgaatcggcaaagtagcaaagaagaaaaggaac agaagaatgggttccgagaagatgatcgagatttgtatcgacatggagga aggagaaccacttggtgcaactccaaatgacaagctcgttatcactaaga ttcaggctggaaccatttctgaaggaaaattgagaattggtgaccaagtt aaaaaagtgaacggacaaaattgcaaggattgtaacgactttttccgtgc gcttcgctttgctgctccatgtgcaaaaatcacggtgaaccgtgaccgaa aaaaaggctgaagagttagaagctcgtgttcatattcctgaggatcgtgc anagatcattcaacgccgtgagggatacgtttatgagttggcaacccttg tctgggttcaaaatggaccaaaacttggtttgggaatcaagcatttccag aaccgtgtgcttgtttcacgtgttgatcctggatcactagccgagaagtg tcttgttcttggagatcatttgtgtgatgttgatggaattccagtcagtg acaaggatgttgccagagatcttctcgtcaagaatatncaagagaaagga naagtcacttttgtcgtcgagcgccctgattcgatcgatgccaagcaatg ggcgatacaggcgttgctaccaatctcatgctaccaccatcagtccatat gaaccgagatgtgaaagcattgcttcgcatatcgtcaagctcttttcatg actgagcctncagcgaagagtgcattgtctacctgtccanaatgcttgcc gtgttctatcatcgagcaaactcaaaacttacgagatccggcatgatcat gacggaaagcttttcgcaaggtcagtga >AW057140 caattgagaacgctgtgcagacatgtcgatgaccaagaatcctgtgaaat caaaaccagccgatggaacaacgtccgctaaggattttgagaaccttcag agtgacttcttctccttcttgtacgctgatcatggaccattctacaaaga gaatgtgaaaaagttggaagacgcaactggtctgaaacgtgaaatgctcg catatgggctcatcgggctcaattgtgtctacatgattattggaagtggc gcccgagttgggggcaatttgattggagttgcctatccggcttatgtttc tgttaaggcgatccgaactgaaggaactgacgatgatacaatgtggttga tctattggactgttttcggtgccttttcaattatcgacttcttcgccgca atcattatgtcatatttcccaatttattgngttgccaaagcagcatttct cttgtacctctacttgccagaaactcacggatcccacgtcatttaccacc aactgattgatccatttgttgctcatatggagaagagcatgtccagaaag cttncagcanacgctggaactgttccaaaatgatcaggatcagccttgga tgcgaantaaca >AW057142 ttttcatgttcgccaaagcacagagaaaagaatagatggcagacaaatca gcatttgtgccagtcgatgcaatcggaaatcacaaaaacaccgatcttga cgttgatattgatgacgaattgttcggaaagaaaccaccaaaagccagtg caagtgcaaccaaagcagccgctccaccggctccggctccagccccagcc ccaccaaaacctgctgcagcacctgccccagccgctggaaagtaccaata caagaagtcgtcgacctatcagaagacctat >AW057143 gactggtgctcctccaacagcgaaataagcagactgggctccgccacctc cacctccacctccggctcccatgtagactgatgttcctggtggtggtgca gcaccacctccaaccttcgaggctccaccgcctcctcctgctggtcctcc tccaactccaaagtaagccgattgtgcaccagtattgctcc >AW057144 cgaaatgggagtcgagttatcgttggatccttctgtctgcccaatccaag ccaatggtggtgtttctacgcacaagatcattaatcactgcgacaaaatg ttggcctataagataaaatcttccaataactccaactacagtgtcaacac tatctttggacttattcagattgggtataccgctgatctgatcatcacta gaaagccaggaaagccgcaggccgacaagcttgttatccaattcgctgcc gttgagcagacttgccgtgatggctgtatttgccaaggctgtatttgcca attggactgccaactggagagtgctgtggagagacgatcatcaagctgtc agctgctgaatagaagaattatcacaataagatttgtgattatgaaaac >AW057145 gcgaaaacaaattcaccgcccgttccgttgtcctgtcttggtgaccggcg tcgtcgtcggcggagccgctcttctggcgattgccgcctactactactgg agccagaaaaagaaaagctctgatacttcatctgccacgtcatcggagtc caacgatgttgtcatgatgtcatcatcggagcccagagccgatggaggag ccgattcgaaggcaaagttcaatattgaggatgaaaatgtgagaagagtc tgcgagaagctgttcatggagcagatggatttgggggaagcttattttgg aggatgaagaaaccgaggagctcggcgcaatccacatggccaacgcaatc gtgctcaccggagagactgctcagctgctcaaagtgctccgcggctcgat ttcaccggctcactttgccaatattcaaaagtacctcccatcggctgact tgcgtgttcaccagcttctccaagacgagctcgccattgagactattgcc cagcatttcgactaagctcaacttctttntttttttt >AW057146 aatgtgtacagcggtggagtcactggagaaactcctgcctacttcaatga cttgcgaaaatgtggatgctggacagctccatctgtcaagcagttcgcgc aagatgagacagtctgcggtatcactgacaacagagatgttcatctagct ggaaatgtgctcaaggctgctgaagaagacgggaagatttatgctggacg attggtgcaaggaagcctgcaaatt >AW057147 aattccggcttggacacgctacggtactagttgatttggaaggtgtaaag tttgtgacggacccagtttgggctgatcgagcgtcgtttacgagttttgc tggaccgaagagatacaggccaccaccgatgaagttggaggatctgccgg atttggattttgcagtggtgtcgcatgatcattacgatcatttggatgct gacgcggtgaagaagatcacagatcgcaacccccaaatcaagtggttcgt tccgctgggaatgaagaaatggatggaaggccagggcatcggagtcgacg ggatcttcaccgctgtcaccgagctcaactggggagagagctcagaattt gtgaagaacgganagacctacaccatctggtgtctgcctgctcaacactg nggacagcgtggacttttcgaccggaaccacagattatggtcaggctgng cggtgatcggcgagaatcggcgattctattattccggagatactggtcac tgtgacggagagtttaagaagtttggcgagaagcttggaccttttgatct ggcagctattccaattggagcatacgagcccagatggttcatgatatccc agcatatcaatccggaagaggcgattgaggttcataaactcattcgggct aagaacagtattggaatacactgggtaacgtaccatatgggctctactga gtactacctggaaccacgtgacaagctcaaagagcttattgatgctccgg gagatcttangaacacgagttttgcacaattggaaatgggtcgatttggg aggcgtngatca >AW057148 agcagcaaagaagaaaaggaacagaagaatgggttccgagaagatgatcg agatttgtatcgacatggaggaaggagaaccacttggtgcaactccaaat gacaagctcgttatcactaagattcaggctggaaccatttctgaaggaaa attgagaattggtgaccaagttaaaaaagtgaacggacaaaattgcaagg attgtaacgactttttccgtgcgcttcgctttgctgctccatgtgcaaaa atcacggtgaaccgggnccaaaaaaaaggctgaagagttagaagctcgtg ttcatattcctgaggatcgtgcaaagatcattcaacgccgtgagggatac gtttatgagttggcaacccttgtctgggttcaaaatggaccaaaacttgg tttgggaatcaagcatttccagaaccgtgtgcttgtttcacgtgttgatc ctggatcactagccgagaagtgtcttgttcttggagatcatttgtgtgat gttgatggaattccagtcagtgacaaggatgttgccagagatcttctcgt caagaatatccaagagaaaaggaaagtcacttttgtcgtcgagcgccctg attcgatcgatgccaagcaatgggcgaaacagcgtggctaccaatctcat gcaccacctcagtccanatgacggagatggaaggcattgctcgcaatatc g >AW057149 ttatgagaaagtattcacatccacatattgttcggattattggtaagata atcgtcaaacatcttccaaaagttggtttggcagtagatgctcatccact aatgattgtaatggagatgtgcccacatggatcacttctttcatttcttc gtaagaacaaagggaaaacgacactttccgaacgtcttcgtttttgtatt gaatcagccgatggtcttgcatatcttgagaaaaaacaatgtcttcatcg tgatattgcagccagaaattgtttactttcgatcaccgatcaaattaaaa tttcggaattttggtctttcggatgacaaacgaactgaaatgcatgatga cacactcgataaggtaccagtgaaatggttggctccggaagttatgcagg ataaattgtattcattgaaaagtgatgtttgggcatttggagtgctcatg tgggagatatatgcagatggagctgatccatatccgggaatgacaaactt agttacgagagccaagatcttctgcgatgattacagaatgactnttcctg agactaccgcaccaaccatttctgaaatcgctttgaaacaaatgctggcg aaacttcccatcnatcgtgccacgatgaaaactgtgcatcataagctaaa gacctctcaantgccatggtcgatgtaggctcagtatgaaaattgaacgt >AW057150 tttcactcaaaacgttcaacaagaggcaaattcgacgaaaaatggacaca atggagcagcatcaggagcttgagcaagaagccatcggaccggctcttcc accgccgtcagctgctcaaaaatcggagctcttcgaggagcacaacgtcg agtacgagctgatcaacgggatcccgtgctaccagccggatcatgtggtc gacgggcaagttcagatcttcgaacgaatcggctacgacgacaaggtcgg cggaacgttcctcgggctcagcgccgacggaaaagagctcgtcgtccggn gtgcaccgatcgacgcgctgactcatgtggttcgtgccgaggcggcattc ctgtgcaaggtggaagccgagcttcaggactggcggctcttctcgcaagt tcacaagatctttctgaccgacgacgcgtggcacatgtcgctgtacttcc gtggtggaccgacgttggagcagtgctttgcgatgcggaacaagttcac >AW057151 tatccgctggtttgttgtactcattttcctcatcatttccaataaaaaga aacggtcgatatcatgtttggtaatgaaaagagcgaagaatccggaagtt ctgggtttggtttagctgaagttaaaaaggttttccagtggattctcgga tgcacttatgcgaagaaaacattcaaaactagaatttctgaaatgtttca ttttgctgatgctccgcacattgtcgtgtatgagcgaagtgaggagcgac catggtattcgatggttggaataataattggagttcttcattgtagttat gagtctctattacacacttgaagctggataccgttcaattcgaaaactca tgctttcaatgctccaagatccgcaacagagtactgtatccactccaaga tgtctatcaccaacaaacatgggatccaagaaatcttcgcctcaaactcc aaaaactccggatgtgattcgtcaaaaagttccgatgaatgagccagtca attgtgtgttcattcgaccagttattccaaanactattccagagggtact gtagccgtgccgaatttgaattcggaagaagatattctattggatagaga tcatcgtattggagagacagggacaaagatatatggaagttagaagagac acatagtgatagaagtggatagaaaaaataaacagaaactaaaagagtc >AW057152 gcaagaagaagtctgatacggcatccgttgttgctattccagaaggagac aatgagaaaggaaagaagatcttca >AW057153 aagaacagtcttgaccaacacgagatgtaaactcgattgacgctcctctt cgacctcttgttgtctggaagcactctgctcgttcaatgtggtggaaaga aaaagggagcaacttctgccgaaggaaaatcttcgacgatgggcccggct cctggaggagctcctgctgctgcttccgctcaaggagaacctgaagagaa ggagtaa >AW057154 atctacggaatgatgcgtcaagcttatggtgcatgtggatccaacgagaa tgctccgtacgattgggaaaatggaggtccagctgcctacttgctccact agaagaaatagacttga >AW057155 gacaagtgttctccgatgctccttcgtgccanatttctatattttcatca gcgcaccatgggtgttgtccgttactgttaagcggaatatgaccgagtat gagcagaaaattcatatcaacctactcaatgggatccgtcagaagaatgc gattgatgagcaagtggctaatatgcatgagctggtctacgatcccgctc tggaatccttatcatatccagaatgcgaaatctccaacgatgatattacc gtaaggaacaatgatggcgtatccacatattataatgcattcccaccaac cggacagatatttgggtgtactatgcgcaacacattgattccctgcatac gtctgccgcatgccatcatctcacctgcaaatccggcagttccaataata acagtgtcagcggatgtatttttggacctgttcgtaaattcagcagctcg gaagtggtgaaaggagagcctggttcacagtgcccaaaaggaagatcttc actggattt >AW057156 gtcgactgcaaagaagactcctgccaaaatacttagccgcagcagaagcc agcagaaggtgaaacgctcgatgagccgcaaaggttgaacggattg >AW057157 aaattccaaaccactgaaaccagttagtancaatgactgcacaaatcatg ttgccaatgagtgcaattttcgtgctacttggatccatagcaaattgcgg tggcaagaagaaaggagcagctggagccggagccggagttgcagccaaca gcccaaaaaaggcagatggaccgtcgaagaaggaaacgaagaaagaagga gatgacggaaactacgaggaactcgccgtcccacaatgatccccctgcat ccctgtc >AW057158 gctgggcggctgagctggatccacgtggctagcagaacatcactgactcg aatgttatcaatcgcgaggaatacgaggacgacgagtacggatcgccagt ataattttcataactcgttctacttctcgatagctcatcatagccaattt cgtgctataatcgtcttttttggcgcgggtttttcactcgtcttttcttt cttttttacttttactctgttctatactaatcgcggatatatat >AW057159 aacggtactcgatgagactgctctggatactaaagatatgttgatgaccg cattggcactcatgtcactcattggactctgctcaactcgtcgtgcattt ggtgtcttcactcttctcttcatgatgcatgcatttgtattctcagcttt ccatcttgcacatacanttgcactcttcattaaatcattcgattctccat gtcaatatctgaaaactccctcaactggaacacttaattcggatatctgt catgctgttaatggagttactctggtgtgtgcagtgatttcaatgattgc tactgctcttgccagtatggccgtcttcattcgtctcactacagtcgtcg tcaaaatttcgga >AW057160 tgaacaattgtcaaagagttcggatcccaacatctcctcgatgtacgttt tccatcaaggaattcaagtaaagcaggaaccaatcgatgatgaccaagag gaagagcaacaagtacaaaagcagcttgtattcaaaatcgagggctccga agacgaagaagctgtgaagaaggagt >AW057161 caactttatttatacacacaatacataattttcagagaagttttcataat cacaaatcttattgtgataattcttctattcagcagctgacagcttgatg atcgtctctccacagcactctccagttggcagtccattggcaaatacagc cttggcaaatacagccatcacggcaagtctgctcaacggcagcgaattgg ataacaagctaggcggcctgcggctttcctggctttctagtgatgatcag atcagcggtattcccatcttaataaggccacagaagcggtgacactgcag ttggagttattggacgattttatcttataggccaacattttgtcgcagtg attaatgatcttgtgcgtagaaacaccacc >AW057162 gaagagtgcagtagtccgattgaaaaggatcaacgtcgatccgccaactg gaaactatccggcaactggaggcaattcgacgcacaacatcacttccgaa tcggattcccgtcttgcattcaaggtgaagtcgtcgaacaacgagcacta ccgcgtccgtccagtttatgggttcgtggatgcaaagggaaagagcaaat tggacatcaatcgtcttccagggccaccaaaagaagacaaaattgtgatt caatacgcggaagtgccagccgaagagactgatccaatggcaccggtcaa ggcttgagctcaacaaggagaaatcattgtgaagctcatcgctgcttgaa tggaatgcaataactgaag >AW057163 atcacgaggctcaaaaagcgtttcgctcaattgaacacggtttcattatg aagttcgatgtctcttcgttgtgccaatttgttgttcaactcctccaatt ctgcaatgtttttattgttcttgcagtgatgaccatcacctcatgcggaa agaaaaaagcatcaaatagcaaggaaaattgcaaaaagagtttgcaaacc ggccctggagcagccaccgaagccggagctgcttcttcgttagctccggt tgacgcgaccaagcttgccacacctgtaccagcggcgccaaaaaaggaag aagctccacctccagaagagcccaagaaagaggagaagccaaaggagaaa tcgaagaagtcggcgaaatcgaagaaatccagcaagtccaagaaagacaa gaaggatggagaggaagagaatggatatgaaaactgccaggatatgactc cggatcagttgaagaagattgc >AW057164 tcatcagctcattcctcattcggtacatcaaatgactcgttttcttcagt ccaaagtgtgattctgccaccagttggaccatttggtcagaaaaagagaa cgtttgagtacattggtgtcacacttgttgtcgataagttgaaattggct gaatggatgaacgggatcgggaaattgtttggatccgcagagctccgtga taatgttaccaatcttcatatgcaattggtcccggtgattgatacgttta aaaa >AW057165 tgagaaatatgagctagctatgagaaatacgcactgaagatgaaggagga atgttatctggtgttataatggatacttctgatcattatgagcgtgatta tacaatggatcatgatgttggaccttcttcaatgaaaatgtctcctatac caccacctccgatcaaagaagaatcacctccaccaccgccac >AW057166 gaaagagtaaaaatacgaagaaagacggcgttgacaagaagaaaacttcg aaaacgaagaagaagtcaaatttgtcgatttccaaatcgcaaacttcggt ggatatgaatgaaaaggacaaatcaaaggaggcaaaggaagcaaaggagt tgaaagagaagaaggcaaaagaggaagccgagaagaaggttgttgccaca ccaaaaaaagatgcttcgaaggatcaggcaaagaaggaggaagatcctta ttagcaaaacgaaccaagtggaatggatgttcttgtgaaggaggacggca agaagacaaaaatggacgatggctacgaggatttcggtccaggcgccggt gctgctcaatgagcaaatggtcgagaaac >AW057167 ggggaatctggagaactgattacagtacaaacatcataagtcgccacaac aaccactgcacgacccattaagaatattgctgcaacaactaccgtcgcac ctctccaattgattgcggatgcttcacttgttgacaatgacctacaatcg aatcttgaagcaactggagtgtatgtcgatggaaagtggtggtggtgggc aatctacctgggatttgtcttgggcactctccttactttggctatcgggg gtggaatatgttacgtgttgcgacgaactgtttatggatattggtaccgc ggcatgtacagacgatatggatgtgatgtctctgcgacaaccgctggtct cactggagttggattcggagcaactacgaccgcaatgcagacgatttctc ctggaaagacgggtgcgacaacattgggaagtacttcaagtaccactgga attactgaaactactggaactactggatccacggcaaccactggaact >AW057168 tcccaacaacacctcaagatgaatgccatcttcactgccgtccttgttgc ttcaactctcgcctacactgcaatggcttggattggactcagcattgaag ccgccaacgaggatatgttctgaagtggtacc >AW057169 aattccttctactaaccctttcgactacaattaacatggacatggacaag cgatcatcggatttggaagctgctcttcgaattgtgctccagcagacttt gaacatcgttttgcaagcgcaggagaagctccccgaggcaaatgtggtac cctcaactccgcccacctcaccgagcactgatatcggcgaacaaatggca tcgttctggaatattccatcacccaaccctcctgcaacct >AW057170 gacaatttctcaagatggtcgaactgcaaatggagcaggaaatggagaaa atgaatgagattgagactgataaacttccgattgatcatcaattgagtga ctatcagaataacatcgaatcgggaaatgatcgtcaagttcaatcatgcc cagttgatgtgtctattccaaaagaagtcatgaagtgtgcaagctgtcct ttgctatgcttcaattgctcagttcaaatgcctgtctcacccgttccaaa caacaatcgaatcccgtaagatcaacgagactactcacttgatgatggaa ttttggaagattgttgcttcaaagtctgaagaggaaaagctcccatcact attcgaaaatgttgagggactgttttctgtcccattttcaactnttggaa cgtgggatgatgacaccctgtctggtgtcacatcgcttaattttgaaaag tctgatgaacaactctccgagcaagatgatgacaaaaccactgtttggag ttctaatttcccatcggctcatgttttaacagtctatgagaattctgaac agaagacggatganatggccgatgatgatatgtccgacacaacttcatct tttcttctactctttcacaacatgagtgctcaagtgccgcgctcatcttc tcagagagtcacttgcaagatcagt >AW057171 gtcgccagactttcagcccaatgactttgttattgattggaggtttgaaa aatggttgttctgagaatgaaaataaggaagaggggaagtttgagaaaat tgacaaagttttctttcctcccgagactgccaacaatactaatccagtcg gacgcctcattggtccacgtggaatgacaattcgccaactcgaaaaagat ctcggatgcaagctgttcattcgcggaaaaggatgtacaaaagatgatgc caaagaagaacgtcttcgtgaacgtgttggctgggagcatctcaaagaaa ccgattcacggtgatgatttcagtccgcttcgattcggaagaggctgcat ccgagaaactgtcatctatcaagaaaatgcttcaagaatttttggaacat actgactcggaactcaaacgctctcagcttgcaactagctgttatt >AW057172 aatgatcaacgtcgacccaccaactgttaatgttcctgcatctggaggta attcggttcacaacatcgtctcggagtcggattctcgtttggcattcaag gtgaaatcgtcgaacaatgagcactaccgtgttcgtcccgtctacggatt cattgatccgaagggaaaaaccaagttggatatcaatcgtattgctggac caccaaaagttgacaagcttgtcattcaatatgctgaagtgccagccgat gagactgatcctcaagcaccattcaaagctggagcacagcaaggtgaaag tcatcgtgaagatcactgccgaatgagaaaat >AW057173 gctcgtacaagaatgctcgtcccgctgactgcaattgtgactacgtcgtt gccgatggttgccgccatcgctttttgtgccaagaatcgtaagacggtcc atgctaaaaacaagaataagaacaagagcagcaaatctgccaaatcgtcg aaatctactcgtggagcgtcgaagagtgggaaatctcgccgttcatcgaa agctaagcactccaagagatcgtcgaagtctagtaagaagggaacgtctg taaaatcttggaatgggaagccgcaagcgtggagggaaatcatcaaagtc ttcgaaatcgaagaacgtcaagactgctaccacctctggttctcaagttt caactgtttccgctgctactggtgtttctgataagcaatctaactcatcg aaatcttctcgtaagagctcaaagagttcgaagagccgtaagaatcgtcg acttgattcggatgcccagaagaaaatggagaaatcgggacagagcggca aagttgctcttattccagaaacgcaacacacaactggaagccaagctgcg catagccttgctgaagaagtcaattcgatcaagcactccaaggaaatgaa tgtggcttctgctaaactgctataccagacactttggcgagtcaatcana ttgtattgaaggatacttcatatgaacgtaagcttataagatcagtgctc ggatactccctttatcggtcaatccgctttatgatttg >AW057174 tccaaatgacaatcgacatgaaggtggtggcctgggaaatgctcaaaggc tcggatcagccggtggacttggtccccaagaaggtcgtggttgccggaca ggaaatcacagttgattccaagtcgaagaacgaaa >AW057175 atcaactaataagatgccctgtcaaaagaagtcaaacccaacggaattgc acatctccactggccgcgagatcgttcaacggaactttgtgttccgcaac accactggcaaggacttcctgctgaagttgcatgctacgaatgaagccgt caccttcccaacggaagttttccgttttccaccattggctcatcgtgcca tccagttccgtgtgaactcatcaaagctctcccaatgggacaagatgaat cttttgatccaaaggatcgtgttgccgatctatgcgaagagcctcaagca gttcattgatcagaagaaaactgcaggaactaaggagcaagaggcattct cattgtctgtcaagttcacggatcagttctcggctccccagacagtcatc aacttgccaggatatgccacgtgtatcgagtcgactgattatccggttga cgtggaagaattggacactacaactgcagtcaacatcgaaagagatgtct ccactgctgttccaattggttcaatgatgggatttgttgaggagtacaaa cgtcgtcaattgaacaaaggatgctgtctttcaactacatctttggaact gaaagcaaccggagagcagtcaatgagatctctcgtgatcagccgtcgtc gatcatctgcaagagctcaaggt >AW057176 ataactctcccaacaacacctcaagatgaatgctttctacactgccgtcc ttgttgcttcaactctcgcctacactgcaatggcttggattggactcagc attgaagccgccaacgaggatatgatctgaagtggcgc >AW057177 acggattgctcgcgagcatccggaacgtgcggtgactttgctcaaggcgc ttttcgctactgtgtcgacatttgatcaagaaggttatgtttgtgtggag gataagaagttcactgagaaacagtccaaataacttt >AW057178 tctcgatccttgccgtttttgtccaccatggatttgctgctgctgaagaa gagaagaatacagcttcagtcgtcagccctgctccggactctgaagcagc ccaacctgctggaaacggaaccgaaacaccaaaagatgaggtgaaggatg aggcaccaaaagaaggtagtgaaactgaagcttcaccagaagccaagaca aaaggatctatggtattccatgctcttggagccatttccacagttgttct cgccggcattatgtgaagaagtctgccgaaag >AW057179 atcaaccaccatgagttttgatgaaattgacatgaccttcggaaccaaga accgcgatcaaggatatgatttgctcaaagcgcgtctcgacaaaggtgat cgttcggtggaagtcttgtggagacttgctcaagtaattcatgagaagtc tgcatgtgttccaaaagctcaacgtaaggcaagtgtcaccgaaggactca agtttgctgaagaagctgtccagaaggatccaaaccatttcaaggcgctg aagtggaatgctgtgttgactggacaagcaaccgaatatatggcaaccaa aggaaaagttggattgcagtaagaagttcaaggaattgctcgacaaagct cttgctaaggagccaaaggatacggctctcctccatttgcgtggccgtta caagtactcggtggcatctctgacatggcttgagaagaagcttgctgcca cgttctatcagcaaccaccatcacattcctatgaagaggccaacgaggat ttccttgctgcttataaggtcaatccaaaatggatggagaacacattnta tgtgtccaaatgctacgtagcaattaaagacaagaacaacgctcgcaagt cccttaccgaagtgtgtgacatcgaaccgtattccgacgctgaacaagag tttgccgatgatgcgaagcagatgttgtctaagctttaa >AW057180 gaaaaaatgatttcggcgttcacttcattcgctgtatcctacttcgtctt ggctatttcgttttacattgaaacaactgtcagcttgttccacctcgctt atttctcgtacagaaatccggcagtttcgaaggatctcatcaaaactgca ttccatcttttgaagacttcctacgacaacaaattgctgacatttgccga aatcatcgagactacacaaaacagtatgatcaagccaatggctcatcaga ataaacaacactttttggaggaaaaccagcgtaccgcacagttgcagacg atgaaaacatcaactgcttttcgcgttaa >AW057181 tacaatggatcaaatcccaccatacgagttcaacaagtacgtgctctacg tccgtggtgccgtcatcgtctgtgcttcatttgagctcctgttggtccta ttcggttccctcgaagattgcaacttccttgctaagctcttctacttcat cttccttggcggagcagttgccatctcagctcacaacattggtctcaacg tggatggtcgcgaggagctcaacaaggttctttcatcgtcagagaatgaa gttcgtggaaaggtctgctgccttgattctggtgccagcgctctccggtg tcctcgtcttcttatgtgtctctggacatgcattcttctctggtgctgca ccgtcagcacaggatccagctgctgctccaccagcccaatagac >AW057183 tatacgtgcatgtggaggatgaaaatccgaatgagaatgagccggcacgt ttgagagctggattcgagtgggctgctgaaccggatgagattctcattgc aggtgttcccaccaagttcattatgtttggtttgtcgtgtttccttgtaa ttctaacactcagtctatggattgccagcacccattattcttactttatc tggctcgtgtttgccacgttgtacatgggtcttatcgtgttccttcccga atatttctcgaatataatcagtttggctctgaactttatttactggatcg cttactgtattttcacttttattggaattattcttgatgttgttaggaga ggggacagttgtagttccggtatgagcaaggaagtttgtgatgctaatcg tcacggatacatgttggctatatgcttcggatgtgctgatctgctaattg ctggagttatcatgattctgatgttccgcatgctcaattactattacatc aatcgct >AW057184 gacatgtcggttttcataagtttgtgttctgcttggccttattgcaatga tggcagcccaattcggactaaactcgggacttgggcttggagttggaccg gcgagagctaatgctaacctaaacggaggttttcaacgtggctacggcgg caatggctacggtaaccgaggtggatacggtcagcagggcggctattatg gccagcaaggaggttacggtcaacagggtggatatggtggcaatcagggc tactatgggcaacaaggaggcggntattgaggtggcc >AW057185 attttttcaaccgatcgtccaacaagagacaatttcctctgaaaacgcaa tgccgaacccgccaccgaaggaagacacctgggcgtttcaaccaattgga gccccattcccaccgagtcctgtgaaatgtatgggagaacagaatatgta tgttgctctttggtacaagcacggtaaaccaatccacggtcgctcatgga acaatggaggagttgttgaatgttcattcccatataagcaagctgaattg acaaccaagcaacaactggaaggacagatccaggttcttcaatacgtang agaccataacaatcaaggtttctggtacgaatggattaagtacaaggatc gtattgagaagattgacgataaacatcaacttgtgcgttgtggtgattca ttcccaatcttctggaagcgtgccgaaggaaatcttcttggttatgtcga caacaagactgaggaggcttggttctcgttcaatggaaaagtgctgaaac aagttggaccacaactcaatgacatgtacatcatcacccgtaactgcatt ggtgggccaccatcttgtgattgtgccaactggtgaagtggaccanaggt tcgtgtcgagagagatgaatggatggacattcgtgaggtgatgcatggca actcgtcacttgtcaagctcttgatagactcttgatacttgccagtgtca atcagatcatacgttgcactttgacatgccagagaacctgtcatggtcgg atggaat >AW057186 tcccaacaacacctcaagatgaatgccatcttctttgccgtccttgttgc ttcaactctcgcctacactgcaatggcttggattggactcagcattgaag ctgccaacgaggatctcatctagatcgatcggagaaaaccgccacaagaa attttg >AW057187 actaactaactacaatcaactctactatacttatggtcaagaagattact gtctacactgcttttggacaattcctcgagatgatcgagcgtcaagccga acagagaagggaaacagttccagtcctttgtccgatcgttgaaaaggctc agccaaggacagctttgaacaaggttcaatcttgcccagttgttccaacg actgcaagagttacagaagagatcaagaagagcattagctgcccattgtt ggctctccac >AW057188 tttctcggaacaactccaagcgaaaaaaatggcggacaagtcggcgtaca tgggtgctggtggctatggatccggatacatgggatccaacgcctcatcg tcgggatatgcccgcgaagattatgcacaaggaggaaatggaggcggaca acaacaaaaccagggaaacggaggaaacaccaacccaggaggacaggtct tcaaggcccgtaccgatcaatcgtgctaccttgggccataagtagctgct cgaataatgtgaagactcagccag >AW057189 tctttcgttctcgccatcaccgccttgcccgcgattgccattttctgcgg aggaaagaaaggagccggtgaatctaaagaaaagccaaaggaggatgtgt atgaggatttggcaccaggagataagaagtaatacttgtgaactgacaga atgcacaatcgagcaacttc >AW057190 ttttgaacaacaatactcgatgcccaagttaaatagaagaatcgttcgtg caagggattcaaaaggacgcttcctacccggaaagaaggccaagtctgtt gccagcaagtctcgttcgagatccagaagccgcagtgttgtgagccgtat gacgacccgtaccaattcgttgactcgtcgtcgttcatcgacgaaagctc cttcgtctgcccgtcaatccagatctcgctcaagatctcgctcaagatct cgctccaaatcccgctccacgtcttcccgccgttcccgctctcgctcagc ttgttgtgtctcgttcaagcgtggacgtcctgcttcgtttgctatgaaga gtcctgaagagaagacggccgcaaagaagacggccgcaaagataatcctt agagtagattagccacctggaatgagatacagaat >AW057191 ccaacaacacctcaagatgaatgccatttactttgccgtccttgttggtt caactgtcggctacactgcaatggcttggattggactcagcattgaagcc gccaacgaggatatgatctggagtggc >AW057192 gcacctggttgtgatattgaaatggatacgcgtactttgttgtggaatga ctacaatgctgcagttccaactaatatgtgggaaatcggaaaatgtacat tcaacttc >AW057194 tcgntctactgagaaggatgaaccgncagtcaacttcttctcctcctcat cgactcgatccttgccgtttttgtccaccatggattggctgctgctgaag aagagaagaatacagcttcagtcgtcagccctgctccggactctgaagca gcccaacctgctggaaacggaaccgaaacaccaaaagatgaggtgaagga tgaggcaccaaaagaaggtagtgaaactgaagcttcaccagaagccaaga caaaaggatctatgggatttcatgctcttggagccatttccacagttgtt ctcgccggcattatgtgaagaagtctgccgaaa >AW057195 gtagatgaacttgcaatcgttctacattttaatgtactcggatggcccac tgactcccttcgagtagtagttgttgaaaaccggagcttcatagttgacc ggaggagccgagtaggtcggtatcggagtggtgactggagcggccacgtg gatctccggaacttcctcctcttgctccacatcctttttacgagcgaagc agtacccagactcttcatcgtcatacatcttgtgaagcatcttcggagca acttttggcttttgctgaacttgttgaactggctcgacgactggaacatc tggtgccggctgctgctgctgctggatctgaagagcctgcagctgagcca tcaactgttgctctgtgaagtccggagctggagacgcaggtgcctgctgn gtcgtggtgtacttaacacggtaggacaacggaatatccgatggtccacc atagtagtccgagtgcacaatcggctgttccggagcaactggagccgctg cttgacgaaccggtgatgatgnggcagacgcagacaaggagagcaccgaa cggtagagacggatggcatcctggattttggatgagtcagagatcctcgc gattgggtgtagcgngaaggcagcgtggcagagaggagcttgctcgctgg ctct >AW057196 agagattagtagatggatgtgataactttcgcatatcatcatcgtcatca ttgtcttcatttttgtcgtcttcgtgtattgttgttccacgattaataat tttcaacgccttctgagttcccttttctaacgaaatacttgtattcttgt taccacgatacgtagagttgttcatagatgaggactttttcagtctggaa ctaggtctcgcttgcat >AW057198 ggataagaagattgtagtaattggatacattatcggaacgactgcagcgg tagacttggcagcttcaaatccggatagactagttggagttgtgctgatt gctccgttgaccagtgcactgaggatgttctgcaacaatcctgacaagga aacgacttgcattgacaaaatctgccacatcaacacccgagtgctcatct gccatggagaccatgatcaacgcatcccaatgactcatggaatggctctc tatgaaaatctcaaaaaccccagtgccaccactgatcgtccatggcgcca atcatcattcaatcattagcggagagtatattgaagtctttactagaatt gcaagcttcatgcggaacgagacccttctgagctgccgagccaatcaaat cgagtcgtcctcgtcgaagaaattcaaacatgaatgaatagta >AW057200 cnaatnccttcaaaagtcgccggttgctcaatcagcataatatgagaaac aaagggcgtgataaaatgatggctgatgacaacgtctcattcacggatgc aggtgatccaacaccaggcgctaaacctcaaggaggatcgtcggcaatgc tggatctacttggaacactgaacaagaaggaagacaaaaagaagaaggat aagaagggcaaaaaaggaaagaagtcgaagggaaagtcgaagaccaagaa agtcagaaagacagacaagtttgagtcgcaaaacttcttggttcgcatcg agggaaccatattttgtgctggaatcgttgtcggattggtggtgctgctc gtcttcgttgcagttgcaatcttcttcagcgtgaagtctggaggaaacat ggtgcactacatccatccatggtggggaggacttgaagaatcgtcttcta attgagagacgaatcgaaagaaatgaaaaagtgacg >AW057201 aattggagtttttcgacggataaagaagtttcagctggaatcgaaagatc taaacgatgatctcactggtttctgtgattgccaccgcctcggctactag caccgtgcttgccatgtgctccagcaaggatcgtcaagcggaccgtaaaa agactaagaagggttccggatccaagtcttcgcgtgtttcttcgaagtcc catagatcctcaaaaaccaacaagagatctggaaaatctggaaaatctgg aaaaactggaaagttctgaaaagtctggaaaatcaagccgtggaaaatca tcaaaatctaagaagccttccaaatcaaaggccggagttcttccaatccc cggagccgcaggaggtcccaagtctgaatcgaagaagctcagacgtgata gcacggacaagtcaaagtcacagagatccaaaagatcttcgaaatctaag aaatgcgacaagtcctccaagaagtgcgacttgaacaaggctaagaacct ctgcccaacggttaaccaggcagatgtttccgacgtgtccatcanatcgg actccggtgagaaatccgannagtcgaaagctctgaagttgttnccaacc agtcgaagacacattcccggagggacaggttctgctcagtgaagagtcga ctcttctgcatcgnccacaagctccattcgca >AW057202 tctgctgcttttcatagctcaatgtgcatttggcggaccgacgaaactta ctgaggacgaacagtttgaatttttaaaacgagcaaataatttgatgcaa agcaaagctaaactcgacgcttatacttggatgaatttcgacgccgaact cgaggctgaaatattgaaaatgtcgtgcgatgaattggaaaaacagcgag gaattcgatttttcgagtcaaagtttggttaatcatgagattacaagtag aacaggagaaactggaattgcgtgcatcaagagtgattgtatgtacacag atctcataccagctgtgccgatcgctcatatgtgcctttacaatcgccca gtggtcacaacaaccgtcccaccatcaacgacttctggaactgacaatat tgcgagttttttctttggaattctcatttttggagttttgaacttgcctt g >AW057203 aattcccaggaacatcaaagagtcagattagaaaaaagaagagaaggaaa gcaaggagaccgaagttttcatcctcaagctcgaaaacacaacggtaaca ttcacaaaccccaagcttggcgatgaagtgaccattgcatcactaaacct gacaaatccgaccaaggatcggtatgcattcaagatcaagtgcacttcaa accagcttttcaagatcaagtcaccagttggttatatcaatccagaggag agcttaacaattccggtctaccactacccagcaactgtcattccggagaa taataagcactacttcgtggtctactacatcaaggctgcgaacacagtaa aggaaaagattccagttcgtgatttgtggaaagcggcagcatcatcagaa ggaacccgacgtgtcttcatagacttcaaaaa >AW057204 aacgaggcagcgaaaatttcccatcctctgccttcaaaaacttggtgact catccaaactaccgggattcgtctttcactccgttcgtcttgtttatttc cgatgacgttcctaacattcatgagtgcctcaaattcgaagagcgtatga gtgacattccaacgcagcacgtacttctcaaaaatgtcaaaaagatgcgt gacaacatcgaaaagaagtctcaaggtggaagaagagcatatgatttgac tcttgacaatat >AW057205 ttacatgtcacctgaaagaattctagaattcgggtataatttcaaatccg atttatggtcgactggatgtctgttatacgaaatggcagctcttcagtca ccattctacggtgacaagatgaacctgtactcactttgcaagaaaatcga aaactgcgaatatccacctctgcctgctgatatttattcaacgcaactcc gcgacttggtatcccgttgtatacttccagaagcttcaaaacgaccagag accagtgaaaggtctacagggttgccgaacacatgaacaattacttctcg ccttccggggaccaatcaacaactccttcaacgcaattctaaaaaaagct ataacatttcaatttcaaacattttctttaaaacgtagtgttcttgtatt ttcaaaaggtggaaacattcgtcaatgaccacgtgaatccgtgatgtgct aaattttac >AW057206 cagctttccatcttgcacatacaattgcactcttcattaaatcattcgat tctccatgtcaatatctgaaaactccctcaactgtaacacttaattcgga tatctgtcatgctgttaatggagttactctggtgtgtgcagtgatttcaa tgattgctactgctcttgccagtatggccgtcttcattcgtctcactaca gtcgtcgtcaaaatttcggataagagagttatggtcaccaagtcgttcat ttgacaactcaatgccacaattaatcagtagaaagtcaattgaatcggag aaagaagaagattgtttggagacaccaagaagaaaaatgggaca >AW057207 atgtatacttctatcagtttgactatcattactctgcaggattcggagtg ttccggtggcttcttccgtttttgggctccacacattgcacggaaatgag atatgtgctcggcaaaggaataatctcgaaattccgaccaaatgataatg ataagaagatgcttcacgttatgacaacttattttacaaattttgcaaaa tatggaaaccctaatggagaaaaccaggagactggagaatggcaaaagca cgactcggcacaccccgtccgccatttcaagattgatctggacgattctg aaatggttgaggactatcaggaacggagagccgagctatgggataaactg agagcattaaatgttagcagggctcagatgtgaaattgct >AW057208 aaatttcgaatcacatcaagacatgccgattgaagttacaatgtctacaa gcaatgaatccgcaaatcttctggtaccctcatttttgccacgtggcgaa gttctcggttggaatctgacgtcggtggtgaatccggtgactcgacgtaa agaatacacgtacacggtgtgtgtgcagactgtcggagcatttcagtgca ccgagcaagcgggcgttgtccttgtgaagtgtgatggtccatgtggaaac aagttgccaaccaatcatttgattgcattgggaaagtgtgatcacatgtt gtgtaaggcctgcttcggcattgtaaagaatccggatggatcctatggat gttcgaacttctattgctggtcggaaccacgtggaaacttccggaaggaa aaggccaactacaataaggtcatcaacaagcagatctgccgtgccagaaa gttcaaacaagacggggaagatttgcaagcatgcagcanatcgaatcttc ccaagactcctgctgacctatcggaccgtgagatgaactctgcgaaatcc tcggaatcggatacgtcgtcttnctgatggttcttcctaccaatccactg ttgactcgtactgactgccaanaagattctaa >AW057209 ttgtgtattcgaaagaaacatggcagaaaatagtgtaacacttctacaac ttgcccatctcggttacagcatgctcgctccaatggtgttcaccggatac gtcattgacaatgtggaggaaagaagaagactggtggatcatcgggtggt ggtggagctaaaactggaggaggaggaggagatggtgcaaccagtgcaaa gagtgataagaaagaaaaatcatcgagtgctccaaacccagagggaccaa aggctccaagtgataagaatgctgtcgcagggacacatgatccaaattat caaactcttgctggagttgatggaaatgtgttccaagagaaaggaaaagc ttctcctgttgctgctgctggtggagcttctcctgctgctggtgctccaa aacctggaggtcctggaatggctgccacccacgaccccaactaccaaaca cttgctggaattggaaacgattgtttcgacaagaaagaaggtgcaaaacc agcttgtggtggtgcggctccaggtgctccaaaagcctgtggtcctggaa tggctgctactcatgatccgaactatcagacacttgctggaattggaagc gattgcttncagaagaagtgattgtgtcaactcgtgcanaatcgggtaca tnnacgaaatatgtga >AW057210 aaaaatgaacgatcttgttattaatcagaagattcttccagatatttcga aatcgaaatgggatctcgacacgtattcaggtcgtgtaaaacactacttt gcatctgctaacccgatgacactcttcacctcatccaatacccaggaaat gtgcaggaaaatagttgtagactataaaaaggggataataaatccggaat tgacgatggatgagctatggagtgcaaagatcctctatgattcagtatat catcctgataccggcgaaaaagatgtctgtctcgggagaatgagcgctca gaccccagcaaatatggttatcactggaatgcttctcagctgctatcgta cctgtcctggtattatattctcccattggatcaatcagtcgttcaatgca attgtcaactataccaatcgaagcggaaattgcagaactaccaatcagca gctactctattcgtatttctgtgctactggagcggctacaacggcggctc tcggtctgaatatgatggtgaagaatagtcatggattggctggaagattg ttccatttgtggctgttgagttgcaaatgccattaatattccaatg >AW057211 gaataaattcatttttacactactcaacaattgatttgctgcttctttct ccacaatctacaacttctgaaaaatgatgaaaccaccgatttgccgccgg atgagcctgccaccagttctcgcaaagaaggaaggtgaacgaattcaagt tccagaaggtggagaattggtatgctaccgaaaccaagaagatcaacatc ttttcttaaaaatagaatactcgcgcgaaacgatgattaaactttcatcg tctgttttctgttatgcacgaccaactggcctcaaagaagtttccgatga gttgccggaactacttggactaccccattcgattccactgacagattctc cgccgaattttgaatgtcttcctgaaccattggcacatgtttcgggatgc aattcacatttcagtatgaaccatcctcctccacttctgagccctctacg tggccctgtaactgaagctgatgaggcgatgcgcacactcagtagtcatc gagaacttcagaagcggcttcaaaacatttcaatgcgaggagagttgagc ccttcaggatgcctggttgacagttgtgacacacctgcaattcgtcagat tcgcaatcctcagttcacttga >AW057212 tcgaagttactcggacttctcatcttcatcgctttttaatcaacatcatc atgtttggacgtttgaagcagaaagttaaggaaaagactggacgtgccaa ggcgacaactcttcccgcagaagtggacgatgcgatgggctacttcaaaa atctgacgccacgtgtcaaggaccttcacaagagcatgacaaacttggaa gatattagcaagtggcagaagaaggccagtttctctggcacccttgagaa ttactcgcgtctcggtgacaagatcaatgtgaaaccatttatggatgctg ttgatgctagaatgggtgccgaagctgatgccgtgaaaggggtcctcgcg atttgtgaaaaatacaagtcattctaccaaaacgagggaaaacttcacgc ggacagtatcgccaatttgaataggactcggctcgacatggacagtgcgg cggataaatatgcgaacaacgagactgaagttaacaagactcgtttggat acagtaccacggaatttgaagtggcttgtgagagaatgcgagaactggcg aacggaatcaagacaattgaatcgaaccattcttcctggcaagacgtctt atgaggagaataaagtgcngtgcgtanataa >AW057213 gataggtgtcacccccaaccgattgtccttcaaaaaattcgcattttcaa tcatcgagatgatgaggtgaaaatcgataatttttcggctcgtcgagctc caaaactccgcgacgatgatagtgacgagctcaacgtggatgtaccagct gaagatgatgatgatgtggaactcgatgacgtcatcgttgctcaaaatcc ggcgttttacggcacaattgaggcggagaagtgtgcggaacgagttgctg cacatctttcgatggcctgtgagaacatggaacgtctgcaatttgtgagc gaggccgtgtatccacagagtgctgatcatttgaagaaacttcaagaaat cgatgatgacgtcaaggatttcaattggcagatgagagagcgtcgtgtca aggcttcaaatccagcaggaacagccacaaaagttgcacacttcatt >AW057214 tcgaaacttgtgaagaaggtctgcaatggttgccgcaaagaaattgatgc caaagcgaatgaagaagaagatgcaggtggagagaatggctcatgccaga gcttgcaaggttgcaaagagagaggctcgcgtcgctgaggaagcatctgg aaaatctactggtggatctactcgcggagccaagtgatagccgagccaca acacatga >AW057215 gaaacttgtgaagaaggtctgcaatggctgccgcaaagaaattgatgcca aagcgaatgaagaagaagatgcaggtggagagaatggctcatgccagagc ttgcaaggttgcaaagagagaggctcgcgtcgctgaggaagcatctggaa aatcaactggtggatctactcgcggagccaagtgatagccgagccacaac acatg >AW057216 aattccgaattccattcgactaaccatctttttgcaaacttgcaccaaat caaccgacatgcaatcaatcaacatcttgttcgccatgctcctcatcttg gctccaattgtcaatggagacgatactgccgttgctgtgactgcaactga agtcactgaagatgcaactgaagtcactgaagatgcaactgtggctcaca ttgaaacaacagccgaagccgttgcagaagcagaaccagctacagaacca gttcaaaccacccgagctgttgaagaaactacacaagctgttgttgtaga atccactcaagagactgtaaatgctgtaaccaatactccagttgatactc cagctaccaacaacgtcgaagcaactactgaagcggcttctcgtccatcg ctttcatcgactgttgcatcaaatatgacttctgctgatgacttgctcgg agaaacttcaaccaatgccacaaaagctgcttacaacactggaaccttca ttgttgtcccaatggtcgttctcgctttgattcaatga >AW057217 gatgaaccgtcagacaactgctctcctcctcatcgtctcgatccttgccg tttttgtccaccatggatttgctgctgctgaagaagagaagaatacagct tcagtcgtcagccctgctccggactctgaagcagcccaacctgctggaaa cggaaccgaaacaccaaaagatgaggtgaaggatgaggcaccaaaagaag gtagtgaaactgaagcttcaccagaagccaagacaaaaggatctatggta ttccatgctcttggagccattnccacagttgttctcgccggcattatgtg aagaagtctgccgaa >AW057218 tctggaaagaggagtttcttctgaagacatantgattccttctgttcgtc ggggtgtcattccagtcaacactcttcgttaccaaatagaaaagcatctc gagatgtgtactccagcttctgaacaattgtcaaagagttcggatcccaa catctcctcgatgtacgttttccatcaaggaattcaagtaaagcaggaac caatcgatgatgaccaagaggaagagcaacaagtacaaaagcagcttgta ttcaaaatccgaggcttcgaaaacgaagaagctgtgaagaaggagtg >AW057219 tcccaacaacacctcaagatgaatgccatttacactgccgtccttgttgc ttcaactctcgcctacactgcaatggcttggattggactcagcattgaag ccgccaacgaggatatgatctgaagtggcgcc >AW057220 ctcttttctcttcgtcgccgnggatcaaaacgttgcccgacagggatgag ctcgaataagacctctcgctcaacttcgtcatcgtctgtcacatcatctt caggacatggtgcatcgagcttctccgaggattcgtctgttcgctctgtc accaacagtgttagaagtactagaagcgctggatctatcatgtcaatggc tagtgccgaggcaagtgtcgttgctccagatctgacaatctaccatggag atcgttagcaatcctaccagctcgctgacaaggggaaaatggtcgttatc aaccggaaaaatggggtgattgtctacatgcttcgttgtgtcgacggccg tcgtgtctacattgaganatcttccgaaggagccagtcttattctgacta atcaacgtggaaaagtgatcaaggcattggccgngcactactag >AW057221 atcaactttgtctcctcaaaaataagtctacaaatgatcaacatcgatcc accatccggcgactacccagcttctggtggttcatcaactcactacattg tctccgaatcggaatctcgtttggcattcaaagtcaagtcgtccaacaat gaatcgtatcgtgttcgcccagtctatggattcgttgatgcgaagggaaa ggctaagctcgaagtgaatcgtttggctggaccagcgaaggaggacaagc ttgtcattcaatacgccgaaagttcagctgatgagaccgatccgaaggct ccgtttgcggctggtgctcaacaaggagaagtcgttgtcaagatggttgc ta >AW057222 aacattcgactaaccatcatctttcaaacttgcaccaaatcaatcgacat gcaatcaatcaacatcttgttcgccatgctcctcatcttggctccaattg tcaatggagacgatactgccgttgctgtgactgcaactaaagtcactgaa gatgcaactgtggctctcatcgaaacaacagccaaagccgttgcagaaac agaaccagctacagaaccagttcaaaccaccgaagctgntgaagaaacta cagaagctgntgntgtagaaggcactcaagagactaaagatgctgaaacc aatgctncaagtgatactccagctgncaataacatcgaagctaccactga aagggcttctcgtccatcggtttcatcgactgttgcaccaaatgtaactc tggtgatgacctgttcgggaacctcaacaatgcacacaa >AW057223 tcgattctcgaaatggctccaacacgtacttcacgacgcagttcagcgaa cttttcattcgacgatgtaaatgttgaggaacagaggcaagcgtatctcc gctacgaacaggaattgaaggatctcgcacttgctcgaaaccttgagaat gaattgaactgggggccaaatccagccaacccagcgcctcctcaaaatcc tccacaacctgaaactgttcatatccaagtgaatcgagataatcctcaag ctcaacaacaaaatctgacaggaaccatagctccagctggaagagaaggt actggagtacaggtggctgctgtggccccggatcccaccagcgcagcgac tgggtcacaaggacaaccaactccccagaatgcacaaaatcagccaacaa tcagagccgcagctggaagagaaggttatgganggcacgtggacgttnta accgccgttcttgcaagaaacttgccgtcacaaggaatcaatgaaaatgc cataaaccctggatgatgacgaatctgaagtcaattgtcagactcttttg aagctgctccatcgaatcagccaagcacatcacaaggcacctcaaatcat gcaggctgagtggcttcacgatcgcgccacgggngaagtgaccgctgttt gggactcagtgaccaaacaagg >AW057224 gaatggatcgtagcattgatcgtcgtctttgtgggcactgtcaccaatcg attggaagtgaggcacttgtcgccatgaatcgtctctggcatccggacca cttcacgtgctcatcgtgcaaacgtccgatcaagcagacgttccaggctg ccgacaaccacgcctactgtgtccagtgctttgctcaaaagtacaacccg aaatgtgctggatgcatggaaactctcgtcgacacgtgccttcttgcttt ggaccgacactggcattcacgttgcttacgtgctcctcgtgcaatcgccc attgccaaatggagaattctatctggttgatgataagccgtacgacttgg attgtcactgggcaaagcgtctcgagaagagagagcacatggaacgtggt gaacgttaagaagaacgccgttaatttgtcgaacttccccactgtttttt tccttgtattcttgtgat >AW057225 caactcaagaagaagaagaagatttgctgtcaaatcggcatacatgtccg ctggaggatacggatctggatacatgggagcaaatgcttcatcatctgga tatgctcgggaggactacgctcagggtggcgctggaggatccggaaatca gaatcagggaggatccggtggaaataccaatccaggaggacaagtgttca aggcacgtaccgatcaatcgtgttacttgggaccataagtctcaactgta ttcgaccggcaatt >AW057226 aaaaaacttcatcaagatggctgcgaatatttatttatggatgtcgaaac gcaacgtcgaattgaatacacacgacagaaagtgatgagaatcgagcaaa tgaatgagcaacttcggaagttgcaagtcgaataagggtcgcgaggagaa gttggacagacttttgaaacgaaaggaatccttggaactggacgttgcaa gattgactgacgcgtcgatgagagctgagccagaggtcggagcggagctt cttcattctatcgaagagcccatggaagttgatatgatctacggtgaagc attccacgcaaagacttgtcaactgaaagtccttctcaacgagataattg ttcgaacaagcttcaacgagaaagcaatgtgtaaagagattggacatcag gaagctgaattcgagaatcgattgaaggaggtgatcagtggaagagctca attgacgctgaaatctgaagaagctcagaagaaatgtgaattgttgatga gagagcattcgaatgtttatgaagatgttcgagagatggaggataatatt gagaagtttgatgcatcgagatggttgttgaatgtggagaaaagacgagt ctctgatattctgatcgcaccaaagagagccgannatgggaatgagtgct gccagccatacattggtcttcngaagcttctcactttgagacttgcacat caaactgagtctcactcacgatcattt >AW057228 acgaaagagatggcggacaagtcggcatacatgggtgctggtggatatgg atcgggctacatggggtccaacgcctcatcgtcgggatatgcccgcgagg attatgcgcaaggaggaaatggcggaggacaacagcagaatcaaggatct ggaggaaacaccaacccaggagggcaggccttcaatgcccgcactgatca atcgtgctaccttgggccataagtggcggttcgaataatacaagagcaag tagtcaaccacccc >AW057229 atccgtccggaaaagctgcgaatgctgcgagagcttatggaagcgggatg aaaaactctgcagcctctttcaatcactcttgagagcagttcgataagct tttcgaagcttgtaacacattcaaggatcaaatcaatggaaacgtccttg ccaaattgatcagtttcaaagatgtggaatgcaaggatgtcgacactcaa atgacccagttgaagaaggctcagaaggattatgcgaataagaaatcaaa gatagttccagatcaagttcaggttgatgcagctgaagcaaacttaagaa gtt >AW057230 aagatcactctgatcgtcgtcgcaccaacaccttttcttcttcagccgat gaggatggagttccaaatgaggtcgccgactacctggtctacttttcccg catggttgacgaacaaaatgtgccggaaattttgactctctacgatcaag cttttccagatctcaccgagagattcttccgtgatcgcatgtggcccgat gagaatgttgtcgaaagaattattatacagtattg >AW057231 tctctggaagcactctgctcgttcaatgtggtggaaagaaaaagggagca acttctgccgaaggaaaatcttcgacgatgggcccggctcctggaggagc tcctgctgctgcttccgctcaaggagaacctgaagagaaggagtaa >AW057232 gaattcgagcagcaaatctgcctaatcgttgaaatctactcgtggagcgt cgaagagtgggaaatctcgccgttcatcgaaagctaagcactccaagaga tcgtcgaagtctagtaagaagggaacgtctggaaaatctggaaagggaag cagcaagcgtggagggaaatcatcaaagtcttcgaaatcgaagaaagtca agactgctaccacctctggttctcaagtttcaactgtttccgctgctact ggtgtttctgataagcaatctaactcatcgaaatcttctcgtaagagctc aaagagttcgaagagccgtaagaatcgtcgacttgattcggatgcccaga agaaaatggagaaatcgggaaagagcggcaaagttgctcttattccaaaa acgcaacaaacaactggaagccaagttgggtatagccttgctgaagaagt caattcgatcaagcactccaaggaaatgaatgtggctcctgctaaacttc aataccagacacttggcggagtcaatcaaattgaattgaagaatacttca aatgaacgtaaagcttataagatcaagtgctcggataactccctctacgt gtcaatccgtttatgatttgctgagcacgttctccgttaagatgatggtg agat >AW057233 ccggatccacagggaaagtactactgtattgtaggagctgatcgtgcgtt cgggagagaagtcgtcgagacacattaccgggcttgtcttcacgccggac tcaacatttttggaacaaatgccgaagtgactccaggacaatgggaattc caaattggaacctgcgaaggaatcgatatgggagatcagttgtggatgtc gagatacattctgcacagagttgctgaacaattcggtgtctgcgtatccc ttgatcccaaaccaaagggtcaccatgggagactggaacggagccggatg ccacaccaacttctcgactgccgaaatgcgtgctccaggtggaattgctg cgattgaagccgccatgacaggactcaagcggacacatttggaggcgatg aaggtgtacgatccacatggtggagaagacaatcttcgtcgtttgacagg acgtcatgagacaagttcggctgacaaattctcatggggagtcgccaatc gtggatgctcaatccgtattccgagacaggtggctgcggagagaaaagga tatnctggaggatcgtcgtccgtcatcanactgggatccttatcaagtga ctgcgatgattgcacagagcattctcttctag >AW057234 gaaaattcaatcaataatcactatatcaatgttcattatcatcgtcgtct cctataatgagctgacagctgaggaaaatgataagaggctcgagacatgt ggaaatgagcatattgggaaaccatcgaaaaactcgataatctctcctgt ctcctggcttaccaaattgacaagatccgaaatttctgctccggcagtta taatatctcctcgacacttgatcacttcttcacggcttttcctcacaaaa tcagcttggaaaaatagcggagattcgattgattgtgatgacagcataaa gcacttggaagttccggtaaacaagctctcggatgtgattgagccgtgcc tttctcaaaaggagaattgctccccgaaagtgatgaattttgccagagca tatattctgaacttttgcaaatcaacattggtgcaaaaaagagtgtactc cttcccaatgattntggagcttgatgagaatttggaaggcaactcaagtt atccatgtctagctgatgaatcaatcanacttgccaaaggagatgccatt gancgcttatgatgacannacaattcgaatggagcatcgaanagtgatgt cggcgccgtgtagtccgatatctataccgactgt >AW057235 agttactcaagctgttccagaaggatcagtacttagttcagatgtcactg atcgtccaaacatcgactccactgatgttgtatcaaatgcaacttcggtg gaagatttgcttggaagttcaacaaatgcaaacaacactggtacattcaa ctctaggacctttgtaattgctccaatgatgattcttgctttggt >AW057236 ctcccaacaacacctcaagatgaatgccatttacattgccgtccttgttg cttcaactctcgcctacactgcaatggcttggattggactcagcattgaa gccgccaacgaggatatgatctgaagtggcgcccat >AW057237 ttttgttttggtttgatttgtccgtaaaagttttcacaaaatcatcaatt ttctgttgtttttcttgctccttggcttgtttcttttccttaactgactc gacgaattctccaacttgttgagaaacctccttccatttctcgatcgctg cttcggcttcctttttcttggcttcagctttttcctgtttctccttttcc aattcatcggctctctgcttaactgcatcagatacctttaaagcgtttcg ttcagtttcttctgagtctctttcccgtatttgtctcttgctgctctcgc attntcaaaattaactctcagttcttcttccaatgccttttccgtatcaa ttactgactggaaggtgtcagcaagggagtagtagcgaacggatgaaacg ctcgacgcaaaaaa >AW057238 gcaatgtcacgttgtcaactcattgcaaacaaagactggaccaacattga acggagtgattggaagacaatccggacaagttgctggatttgactactcg gctgccaacaagaataaaggagttgtatgggacagacaaacacttttcga ctatttggctgatccgaaaaagtacattccccggaactaagatggtcttc gctggtttgaagaaagctgacgaacgagctgatctcatcaaatttattga agtggaagctgccaagaaaccatcggcataagcctctactaaata >AW057239 tcgtcagccctgctccggactctgaagcagcccaacctgctggaaacgga accgaaacaccaaaagatgaggtgaaggatgaggcaccaaaagaaggtag tgaaactgaagcttcaccagaagccaagacaaaaggatctatggtattcc atgctcttggagccattttcacagttgttctcgccggcattatgtgaaga agtctgccg >AW057241 atgagccaaagtgtgtagatgttgttgaaggaaaggaaagttctggagtg tgcaagacgaaaggcggagtctgtcgctttggtcattgctgcccatcact taccctgacaattgcaccatctggaaatggtactgagtcagcgacgccta ccttgggcccatatccatacttgactaaattatccgtgtgatgctaacaa acctatcccatctcaattcagcacctatgcattttgcgatcctgacacta atcgcgttggtattttgggcaaaaggcacttaactggagaagaacgtact gaggtgaagggatcggcatgctcttctaacaaagactgcaagtcgggaac tgtttgcgtgtatgttaatatcaataaacacgtctgctactaccatccgc tgaagaaaatcgcccgtgatgtcagtcaaccatggctctatgtgctcatt agcttcctcatttgcggttntatttntgtcattntggcagtcatgagctt cgtctgctaccgttcgaagtctgtgtttgacaagtaccagccaaagaaga atgcaggaacacatggtagcagacagtgatgggaagaaaggaaagatagt gggaagaacgagacgatacttanagtcaacgagctcccagtcaagaccag agacagagctgatcgacagcgga >AW057242 ttccacctggtaccactgtcgacactggaatcgtctctcccgaaggattt gatttctacctatgctctcactatggagtacagggaacttctcgccctgc gagatatcacgttcttctggatgaatgcaagttcactgctgatgaaattc aaaacatcacttatggaatgtgccatacatatggtcgttgtactcgttcg gtctccatcccaactccagtttactatgctgatttggttgctactcgtgc ccgctgtcacatcaagagaaagctcggtcttgccgacaacaatgactgtg acaccaactcgctctcttcatcacttgcttctttgctcaacgtgagaact ggaagtggaaaaggaaagaagtcacatgctccaagcgtcgatgatgaatc gtattctcttcctgacgctgcatctgatcaaatccttcaggactgcgtct cggttgcagctgactntangagtcgtatgtacttcatttgaagactcttc atgcagacggagccagagaaa >AW057243 accgccttcgtcccaaatgacggttgcntgaactttgttgaagaaaacga cgaagtgctcgtatctggtttcggacgttccggtcacgccgtcggagata ttcccggagttcgtttcaagatcgtgaaggtcgccaacacctccctcatc gccctgttcaagggaaagaaggagagaccacg >AW057244 ccgnctccactcttcacttgcactattcttatgcaaccaagcaacgncat gctcgctgttcttctcgccttggcttcatttgctcaaggaggcagatctg ttgctccggctggtgcagtcactgaaccaacagttactcaagctgttcca gaaggatcaggacttagttcagatgtcactgatcgtccaaacatcgactc cactgatgttgtatcaaatgcaacttcggtggaagatttgcttggaagtt caacaaatgcaaacaacactggtacattcaactctaggacctttgtaatt gctccaatgatgattcttgctttggtgcag >AW057245 cgaaaatcgacaacgagacgcagtcacagtgagaaaagttcatcgagagg ttccatgtcatcaccgccaaccagattctatccatctgaagattcagagt cgatttactcgactcgaaaatgctccaaaaggactacgacgactgctact acggatgaggagaagccgaacaacagctactacattgacgatatttatga ttcgactgaagaatatcaagtgacattcccgacggttgagctgaaattgc cacgtcagagaaagcattgccgcaagcgatcgaagagacaggatcaggca cagggagagcatgtgacaatcacgaaatgtgttgatagaagacaagtcta cggagagcccgataataagaacaccatatccgagcactctacgtacacct actctacccatccggaacgttgctctcaggccggccgtacttctcgttcg aacagctattctgacgccacagatgccacatatcggactgg >AW057246 gcttggttatccagcagcactactgcaattatgtttggaggtggagattc aaagcctatcgataagaagaaggaggacaagaaaggtttcgatactcgaa aattcttgattgatctggcctcgggaggaactgctgccgctgtttccaag actgctgttgctccgattgaacgtgtcaagcttctgttgcaagtacagga tgcttccctcaccattgctgccgataaacgttacaaaggaatcgtcgatg tcctcgttcgtgtccccaaagaccaaggatatgctgctctctggagagga aacttggctaacgtcatccgatacttcccgacccaggcgctgaacttcgc tttcaaggatacttacaaaaacattttccaaaagggattggataagaaga aggatttctggaagttcttcgccggaaatctagcttctggtggagcagct ggagccacttcgctctgttttgtctacccattggattttgctcgtacccg tttggctgctgatgtcggaaaagctaatgaacgtgaattcaaaggcctgg ccgattgtctcgtcaagatcgcanagtcggatggnaccaatcgactctac agaggtttctttgtctcggtacaaggtatcatcatctaccgcgccgctta ctttcggaatgtcgacacttgccaagatgtgttcactgctgatggcanga aactcaacttcttcgcttgcctggcttattgcctcagtngntacntnngt gatctgnatnctctnctatccatggnatcctgntcgtcgcgcatgatgat caagctggtcgcaagatgtctctaccagatactttgattggccg >AW057247 tttctctgaataaccccaaacgaaagacattcgatgttggacaagtcggc ttacatgggcgctggaggctatggatcaggctacatgggatccaatgcct cgtcgtcgggatatgcccgcgaagactatgcacaaggaggaaatggtggt ggacaacaacagcagaaccagggttccggaggaaacaccaacccaggtgg acaggtcttcaaggcccgcactgaccaatcgtgctaccttggaccataag atgatcgacactaggagagccagtagccaaca >AW057248 ttaaaagcaagcttcctcgtagctcgtcgtctggattctcgggaacagga tctgaaagatctatttcaagcaaaagatcgaaaattggaaacacttctgg gaaccgtaacaagatttcgcaaagtgctagaagaagacttttgcaagctg aaattgagagtcgacgcaatagatccaacaaatcaccaggttccagtcgt aaaagcattctgaagaaatctccgatgaaaaaaagaagatctctttcgag aaaatctttaccaaagaaagagcatattcctccagttcaaaagtttgcaa ttatcaagaatccagctgctcgtaatcaagttcgtggatttgttgctgaa tacgcacaagatgcagaatttgaagcttttgagcttttggttgacggaat tccatttgttgctctctctctgatgaatgctcatccagatcttcgtggaa gattcaaacctgttccaccaccaacaccgatgaagaagcatanctcaggt atncaatagcccacgaaacgntcctttgcgcgtgaatctatttccttcgt taannctcganaagtgttcgtttcactgantggaggatctctttntcgaa nagagctagaaaatcttcatccgggaatgcgattcatcggatcgtgagct tggagnagacagtatcgnatgccatgatntgacacttgtcgcagtacaat ggatttgtgggaagatgtgnaccatagtgtgaaaa >AW057249 tgactgtgaatgattctctgctacaccattttattatcatcatctcggta tccaattccatattcttatttcgagttatccagaaatggcgatccattac aatatgtggagt >AW057250 aattccttctacgcgaaaacaaatgcacctcccgttccgttgcccagtct tggtgaccggcgtcgtcgtcggcggagccgctcttctggcgattgccgcc tactactactggagccagaaaaagaaaagctctgatacttcatctgccac gtcatcggagtccaacgatgttgtcatgatgtcatcatcggagcccagag ccgatggaggagccgattcgaaggcaaagttcaatattgaggatgaaaat gtgagaagagtctgcgagaagctgttcatggagcagatggatttggggga agcttatttggaggatgaagaaaccgaggagctcggcgcaatccacatgg ctaacgcaatcgtgctcaccggagagactgctcagctgctcaaagtgctc ctcggctcgatttcaccggctcactttgccaatattcaaaagtacctccc atcggctgactngcgtgttcaccagcttctccaagacgagctcgccattg agactattgcccagcatttcgac >AW057252 accattaccgtcgttttgaagaatgatcatcgtcgacccaccaactggaa acttccctgcatctggaggtaattcggttcacaacatcgtctcggagtcg gattctcgtttggcattcaaggtgaaatcgtcgaacaatgagcactaccg tgttcgtcccgtctacggattcattgatccgaagggaaaaaccaagttgg atatcaatcgtattgctggaccaccaaaagttgacaagcttgtcattcaa tatgctgaagtgccagccgatgagactgatcctcaagcaccattcaaagc tggagcacagcaaggtgaagtcatcgtgaagatcactgccgaatgagaaa atagatc >AW057253 caaaagatagtacaaaaagtacctatgaatgtccaattcctcagggtgga ttctatggattagctgatcatccaaatcatgggcttattgcatcgatttt gaagaggaaacgccgcaaaatcaagatgacagaagacgtaaagaagatgt gcgggaaggttgaagcttacaagacatgctccgacaagctccatcaggca ctattgttcatgctcgtcgagagtccagaaatttcaaaggatttggtgac tcacttcaaaactgaaccaaaattatcgtatgctggaaaatatctcaaaa cttatgaggcaattgcaaataagggacgggataagacaaagtatgagagc ttggagccagcgattagtacgctttctctgttggatgcagaacgcgaatc tcgtgttcgcaagcagttggacaatttgaagccgctgacaaaattcattg gagaagaactattggagtacgcacggttgaggagagtgtactgggatggc ctggaagcctatgacgatgcgctgacccaacagagcnaagatcgcactga ggaagccgagcgaatcactgtcaacgctcaaaaatggagaaatgatgtcg ccagaaactgatggattcatcaaaaacgggattntcgacagagaccaaag cattgtgatgctattctgaaatccggatgaagcatcctctcncatcgcgc catgtcgaccacacaatcagccgaggntctgagaactgtggaaccccta >AW057254 cagcactgggtatttcaggcccgggtgtttctgcccaaaataccgcaact ggtggtaaagttggtgaaactagtgaggtcacaacacaagtgtttcaagc ttcgacatatggtgctgtcaaagcaccgaagattgtggctgatgcccagc aaggaaccaatagaagttcagaaacacttgaaaataaaatggtacacgat ttcatgtggtggattgggtggagtcgttggagctattcttttctctggaa tattcttctttttcggtgtagcatctatgcgtgataaagtctactttttg gaaattcctgacattgtacaaatgtttggtataccctcgactgtccagaa tgcatcg >AW057255 ttttatcaaagtgatacgctcaaacacgtgacaatttgaaaaaatgctta ttgaactcctcatatccatcaccgtgctcgcgtcgattctcgtctcgtgc aagaaagagaagaacgaagcggcaaaactgaagccacgtggcttcgatcg aaaatccagtggagcaccggctccggctggagccaacttggttaacaagt ctggcggttccggcggaggtagtggaggatctggagatgctaatagactt ctcaaaagagaaatcaaagaagtaaagatggaggagcgttcggctgacga taacgagacaatcaacgatgtaaagtccaactggggaactgtttgaaat >AW057256 aaaatcgatttttcgtcatgattcacgacacgtacaatccgtacactgct gttcagcgaagcccatcacaatggttcatctttgtgccatcttctgcacc attggataatactacaatcattacattgtcccatgatgctgtcaatttgg ctagtattccaccaacggactctgcggaaagtcttcaaagtcgttatgta gtcccggaagtaattccattcaaaagctcatctgaagttgacactcgttc ggctggaaatccatatttccatcatccaaacttcaccactccatctcaat actttgaacatattatggattctgaaggagaacaacacaagatggagaat aaaaacgaagaaaatttgctgactattatggatatcaagaacttngagga tccatttggaaattatgatatggatggagaacaggcaccaatggttagca tcaatccaccaaatgatgtgactgttaacaagaacgccttctcttctgac tcaattgctgatattcagaacatgaacatctcattttcgatcaaagcttt cactgctccagctatccccatcagtctcctgagggacacttantaagaaa tgtgagcncgaaatatgttcgagtgagcagactgatggaga >AW057257 aggtgatgtctacaagaaggcagtgcagttctactcaaacatcacggctc caagatcaacatctgtgcttgctccagtgatgtcgtcccttgaagtctac attaacaccaccacgacatctgcttttgcttcagctcagagcatcaaagt ggctgatattcttgaagaagatgcagatgcaatccgtgtgaagtcaatta gaatggctggattcattgcccaatgcatcatctttttatttgtctacacc atcgtcacgatggatgttgagatttggaaaattaatatggactggttgaa aattcaatattttcagcatttcgaagactccgctgctgaagttccggtct >AW057258 aattccgaagaacactcctcgattgcaaattgtaataaccctaaaaatga ctcatttgaacttcgagactcgcatgcctcttggaacagccgtcatcgat caattcctgggacttcgcccgcatcccacaaagatccaggcgacctatgt gtggatcgatggaactggggagaatctgcgctcaaagacacgaacattcg ataaacttccgaagagaatcgaggattatccaatctggaattatgatgga agctcgacgggacagggcanagggacgtgactcggttcgttacttgcgtc cagtcgccgcttacccggatccatttttgggcggtgggaataagttggtg atgtgtgacacgttggatcatcaaatgcagccaacatcaaccagccatcg tcaagcctgcgccgagatcatgcatgagatccgtgacacccgcccgtggt tcggaatggagcaggaatatctgatcgtcgacagagatgagcacccactc ggatggccgaaacacggattcccggatccacagggaaagtactactgtag cgtangagctgatcggcgttcgggagagagtcgtcgagacacattaccgn gcttgtcttcacgccggactcaacattgttggaacaaatgccgaagtgac tccaggacaatgggaattncaaattggaacctgcgaaggaatcganatgg gagatcagttgtggatgcgagatacattctgcacagagttgctgaacaaa attcgtgtctgcgtatcncctgatccccaaaccaaagtcaccatgggaga cttgaacnggagccgatgccacaccaactttctcgacttgcngnaaatgc tgct >AW057259 taaaatggatgtcaagcataatccaaaaaaagaagtgatcgaacgaagtc tgaatgcgactgaaaaagcgatcgaatcgattcgtgaaggtgacctagtc cgtcgtcacactgttttggagctccgtgacatgcaaatttcgatgagtga ggatttcgatcaaatccgtggaatgatgcatgaattggatcaccaaattg acaaggaacgcgctgaaaattcgaaatggatgaattggaaaattgagaag gccaaggcaactgctgatcaggcgttggcctctacactgatggtcaagga cgtccagctgttggaaaagaaggtcaatattctgaaggattctgtgattc aagttaacaaggcattctacaagtatgagaaggatgttgacatgaaagat ttgatggatcaagttactgacatggtgcatcgtacggaaaagaaggagca agatgcgttggagccccatgcgactgatgagcaagctatcgagaaagcct tccgtggagcaattgaaggcctttatggcctgaaatcttccaatccaaag gtcatggaggaagctaagctgttggctggagaaatgcgtgttttcagaga tgctngctgctacaagaacttncactcgatgatctcaaaagttgcgcctg gtaaatcggagtttccttgatacagcagctctgagactcatttcgacttc tgatcctctcgctgccaa >AW057260 ctttacggcggaaaaagtgaaattttcggtagtttctcgaaaaatatttt tcaattcaagcaagatgccagacgatgtgtgcgacgaaacattgaaaatt ggagtggttgttgggaaaaagtaccgagttatccaacagctgggtcaagg cggctgtggatctgtgtacaaggtggaggacatcgaagacaagacaaagc agtacgcgatgaaagtcgagttcaactcgaatgccaacgctggaaatgtt ttgaaaatggagggtcagatcctcacccatcttgtgtctaagaaccacgt ggcaaaatgcatggcaagcggaaaaaaggatcgatactcgtacgtggtga tgacccttctcggcgaaagccttgaatcacttatgaagaaacacggacca tttttcaacgtgtctacccaaatgcgcatcgggatttgccttttgttcgg catcaagcagattcatgatattgggtttatccaccgtgacttgaagcctg ccaacgtggcattgngaaataaaggctcccctgacgaacgctacttcatc gtgctggactttggcttggcacgccagtacatcacggataaggaggacgg aaaaaaagagcgtcgcccacgtgagaaagctcntcttcgtggcancctcc cgttattgctcgtagctatggcacatcgtttcgagccagggagagtgatt acctgtgggccttgtctacatngctcgcggangtggatgccagctggnct tgtctgatttggatgataaggcgaaatcnnggaaatgaancnaacgtngc cgacccagatctcttggcaaaagcccaatcaaa >AW057261 gccgagaatgtgctgggagcccctggagctggattcaaggtcgccatgga agcctttgacatgacgagaccaggagttgccgccggagcacttggactct cgtggagatgtttggacgagtcggccaagtatgctctggagagaaaggct ttcggaaccgtaattgccaatcaccaagccgtccagttcatgcttgctga catggctgtcaacctcgagctcgctcgtcttatcacctacaaatccgcca acgatgtcgacaacaaggtgcgctcttcatacaatgcttcgatcgccaag tgctgtgccgccgacactgccaaccaagctgccaccaacgctgttcagat ctttggaggaaatggcttcaactctgagtatccagtggagaaactgatgc gtgatgccaagatctatcagatctacgagggaacctcgcaaattcagcgc attgtcatctcgcgcatgcttctcggacatttcgcgcaaaatggaactag cagaatttaggatgtgccgtttttgagcaa 6.fa100644000766000024 3174014006427353 15772 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>CEESC12R gcacgagtccatctccatatgccaccacaacantggtcctgtcgaaccaa caaccagcttggctcaatgacaaaatgcttcgcgcgccanaatgccaaca aatcccgtgccaccagagccaccggcgcgatatgcagatcataccgctgg aagacgatctcgatcgagccgtgcatccgatgggagaggaactctgaatg gcggactccatcaccggactagcggaagtcaacggtcggatagtccacct cacacagatgtgagctatgttcagcttcactcatccgatggaactggtag tagtaaggaaagaantngggagcggagaacaccaccgaataaa >CEESC13F cttgcttgaaaaatttatataaatatttaagagaagaaaaataaataatc gcatctaatgacgtctgtccttgtatccctggtttccattgactggtgca ctttcctgtctttgaggacatggacaatattcggcatcagttcctggctc tccctcctctcctggtgctccagcagaaccgttctctccattatctccct tgtctccacgtggtccacgctctcctggtgctcctggaataccttgagct ccctcgtgccgaattcctgcagcccgggggatccactagttctagagcgg ccgccaccgcggtgggagctccagcttttgttncctttagtgagggttaa tttcgagcttggcgtaatcatggtcatagctgtttcctg >CEESC13R gcacgagggagctcaaggtattccaggagcaccaggagagcgtggaccac gtggagacaagggagataatggagagaacggttctgctggagcaccagga gaggagggagagccaggaactgatgccgaatattgtccatgtcctcaaag acaggaaagtgcaccagtcaatggaaaccagggatacaagaacagacgtc attagatgcgattatttatttttcttctcttaaatatttatataaatttt tcaagcaag >CEESC14F aaaaaatgcgaagntccaacagttccatgctatcgttatctggatatttc aaatggattattccttggaggccgtcctggaacttcgaagcaaatcgaga aggccttctctggatgtatttctgatttgtcagtggataaagaagatgtc gattttncaacgatcaaagaaatgcacaaagttggacaagttcatgaagg atgcaagcatcgtaaagatttttnctcaacttcggatggacaatgctcgg ctacctcgaagtgtgtcaatcgttggggaggcagaatttgtagctgtccg caatcggttcattcgactggtgaatgtgttggagcacttggaactcaaga tttacgtgggcattctctatttgaagaggaatcatttgtttttgtaccag ccaagccaagtatctgtaccgtttgaagtttcatttgaattccggacatc tcgagctgatatgcaagt >CEESC14R gcacgagtttttttttttcactgaaatcgatcttccatcaaccaaaacat tccgaatgcatccttcaaaacgtgatggatgtccagttccaggtgcaatt ccaaagtataagctctctagattcatatcagaaatggaagctttagcttc tgcactgtagattccatttatactagttgcaactgaatctgcttcaaact tgatgacaacattcatccaatgctttgatgttacttcaggagctggcaac tcaacttcagaatctccaatgttatatttcaaagttccattcatcaactt ccaagttgtaatgaacacttctttgg >CEESC15F cnctttgtaataaataatttattatgccncgaaaataattnccnccaaaa tcaatctttcagcgggtgggtgtaatcattgggaacngggaagtcactag gaaataaggaaatagngaaatacaataaataaaataataataataatagg cgactatgattagttagaaaacacagctctgggaattgtttggaagtgtt gagagaaattnttgattttttacaaatggggaatatgattgaccgttgga ataagtnaaaatattantaaaaatagcgctgantgaaaacttaataagtg acagtgaaaaggatttgaaaagntaattaanccaactacg >CEESC15R gcacgagggcggacaacctcaaggcgctacaccgggacaacccgatcaga actttgactacatgttcaagctcctgataatcggaaattcatcagttgga aaaacatcattcctcttccgttactgtgatgattcattcacttctgcctt cgtctctactgtcggaatcgatttcaaagtgaaaactgtgttccgtggag acaaacgagtcaaacttcaaatctgggataccgccggacaggagaggtac cgtaccatcaccaccgcctactatcgtggagcaatgggatttcattctga tgtatgacatcacttaatgaagggtcttttaatagtgttc >CEESC15RB ggcggacaacctcaaggcgctacaccgggacaacccgatcagaactttga ctacatgttcaagctcctgataatcggaaattcatcagttggaaaaacat cattcctcttccgttactgtgatgattcattcacttctgccttcgtctct actgtcggaatcgatttcaaagtgaaaactgtgttccgtggagacaaacg agtcaaacttcaaatctggggataccgccggacaggagaggtaccgtacc atcaccaccggcttactattcgtggagcaatggggattcatttctgatgt attgacatcactaattgaaggagtctttttaatagtgttccagggttggt gcactcaaatcaagncatactca >CEESC16F cccattttacaaatttatccagaggaatggattttcaattaaaatcttga aaaaaactaaaaagtagagaaaattggaaactttggtgggtttaaacgtt aaaagagattaaatttaaaaaaaaagggagatcgantcgaataatttggg tggatgggatcattgtacaatataaatagaaaaaaaggaagagttcaatt gggatagaaaaaaaaagtgaatttttttttttgataaggtagntagtgtg ggtggtggcggga >CEESC17F tttcctgaaaccgtcagtcttacttctcgacgaaccgaccaatcatttgg atttggaagcttgtgtgtggcttgaggaggaactcgctcagtataaaaga acctngttggtggtttctcactctcaagatttcatgaacggagtttgcac caacatcattcatttgttccaaaagcaattggnttactatggaggaaact acgaccagtttgtgaagacacgtcttgaattgctcgaaaatcaacaaaaa cgntacaactgggaacagtctcaactncaacacatgaaagattacgtcgc gaggttcggtcacggttctgccaaactcgctcgtcaagctcaatncaaag tgaaaa >CEESC17RB gaaaattcacacaaaacactacacatttagtgatgtgacaacaacaaagg agggtaattggaaaaaaagggtagaaacaggaaccggaccaacaattgga ggaaaaccgacaaaaattgggtcaaagagagtaaaagatgaatgaaaaca agagaaaatataatcaaaatcacaggaaaatgnaattgaaatatcctana ttgaanatggggggnaaggtgaataatgngagaaaaatctcgggaaatca gttcgattctaatattagaattggcagattttcgatgttttcggggggaa atagg >CEESC18F aagatcaatgatatggaatggtggaatcgattccttgattccgatcctcc aatcaatactaaggaagtgaagccagagaactcgaaattgagcgacttgg atggagagacacgtgccatggtcgaaaagatgatgcatgaactgttgcag catatcatgcttttccttctcatcagacgttgcggcacgagctcgtgcna aattcctgcagcccgggggatccactagttctagagcggccgccaccgcg gtggagctccagcttttgttccctttagtgagggttaatttcgagcttgg cgtaatcatgggtcatagctgtttcctgtgtgaaattgttatccgctcac aattccacacaacatacgagccggaagcataaagtgtaaagcctgggggt gcctaatgaagtgagctaactcacattaattgcggttgcg >CEESC18R ctcgtgccnaacgtctgatgagaagaaaaagcatgatatgctgcaacagt tcatgcatcatcttttcgaccatggcacgtgtctctccatccaagtcgct caatttcgagttctctggcttcacttccttagtattgattggaggatcgg aatcaaggaatcgattccaccattccatatcattgatcttctcgaggggg ggcccggtacccaattcgccctatagtgagtcgtattacaattcactggc cgtcgttttacaacgtcgtgactgggaaaaccctggggttacccaacttt aatcgccttgcagcacatccccctttcg >CEESC19F gctctcgactccatcattccaccacaacgcccaactgaccgaccactccg tctcccactccaggatgtgtacaagatcggaggaatcggaactgttccag tcggacgngttnagaccggaatcatcaagccaggaatggtcgttaccttc gntccacaaaacgtcaccactgaagtcaagtccgttgagatgcatcacga ntctnttccagaggccgtcccaggagacaacgttggattcaacgtcaaga acgtctccgtcaaggatattcgtcgtggatccgtctgctccgactccaag caagacccantcaaggaggcttcgnaccttccacgnccaggtcatcatca tgaaccatccagggcagatctccaacggantacactccantt >CEESC19R tggatccccngggctgcaggnaaaaatcaacaagataaactcaacaatga agatttacttcttctttggtgcagccttttgggcggacttggtgaccttt ccagaggatccatcagacttctcaacggacttgataactccgacagcgac ggtttgtctcatgtcacgaacggcgaaacgtccgagtngagcgtagtcgg tgaaggattcaacacaaagtggcttggttgggatgagctcgacgattcca gcatctncagacttgnggaactttgggaagtcctcaaccttgttaccggt acgacggtcaaccttctacttaagctcgttgaacttt >CEESC20F aacaacaaaattgattttaattgaaggagaggtagagaatgaaaacttgt gaattgaaaagaatagaaacaaaaaaattaaacagttatttagggcttat ggcgaggggtaacgaatgagaaaagccctttaactgtgagataaagtaaa aagaagaatgaaaaatagaaacaaaaaatatttaacagttatttagggct gaggctaaagaatgaaaatccattaactgtgagagaaattgaatggaaga atagaaaaaaaacaagttcaacagttattggttggcccaaaatcaacctc gatgagcgcttttcggatggcttctgtgcgcacttcgagaaccttctcgt tttcgacgttttcgagagcctctttctacg >CEESC21F gaagttgaggccaacgttccaaatgttatcaatgagcagctgagccggag tattccgaagacatgcagtttgatcagtgatctttgtctgattgcagccg acaagatcggctagtgctgttgatttttgttttgctcgtttcggagaatc cattgaccacttgttgtcgagggatcctgattggagaattccgttttgga ataatcccttcgacgccggtgcaattaagtgagctacaatcgaagctgct ccagccgattctccaaccaacgatatcctcgccggatttcctccaaacga gaaaatatgatcccttatccaataaagtgcgagttgttgatcaagcattc ccatatttccaggnacatcttcatgatctagg >CEESC21R tcgtnccgaattcggcagagntncntatgtgaatatttgggctccggcgg atgcttacaatcttactgtacttgtatggctgtttggtggtggcttctgg tatggntccccatnactgttactttacgncggaaaagaactagcaacacg tggaaatgtgatagtagtgaacatcaactatcgagttggaccatttggat acctgtttctagatcatgaagatgttcctggaaatatgggaatgcttgat caacaactcgcactttattggataagggatcatattt >CEESC22F aatcacaatgattttatagtcgaaaagaaataaaaaaacgcattatgctg agggcttcgacatacaaagtggaaagggttgggggaaatacatggaaatt nccttttttttttcggaaaaacaaatttttgttagtatttacaattacat tttgttaccagtcagacaagttttttgagggaaaaaaatccaataaaaat gagcatttttcagaaggacgtataatgtacacgaaggtggtngtgtnaaa aggagacaacaaaagggaaaaattgcgggttaaaaatggccgggaaaacc >CEESC23F tttttttgttagagtattttatatatttattattattacagcttacagaa ctttgatttgttttacagaaaaaaggtgcaaccgnttagacaaattcaat ggattatcattatttgaaactttttgcagttccttattttcaaaaaaatc ttggtttttggtttgatcagggtgagaaaggatttcgggggtcgaaagct agaaaattatcaattttttgtgattttcgattgtg >CEESC24F gtcttattaaaagctttattatgaatgtggctcaaataatgagcatgatt cagagaaaaaatggtttaaaatgtcaatttngtaatgagaaaatgggggt catcggcagtaatagggtacaacaacaaaagtgattgcattaaacctcaa cttcaaaccaaagttacacagngnacctagttatacatgcctagattact accggantagtattgancaaatacaagagaagttaccaatgaagatttgg gtgagantgggaagcataatgcagtcggctagagaagttgg >CEESC25F taaaccaatatgatttattattaaaattttaaaagaacaaaaacatgctt tagaattccaaaaatgattttaaacaagtgaatgaaagtatcacaaatac gaaaagagaacccgaagaagagaaaagaagaaattataaaaaaaatattt tagagctccgacttttgaaggntcgaataccgtttatcagatggcttaag ctctttgaacactgatggaggtggtgttgtgtcaattggacgagtagatg gagcttgagcttcatgatcatcagtgattccacgtgcagcttttgccttg gcgagctcgatcattcgttggatcaaggttctcgtggaagtccttgtgaa gctttccagagtgaagatccataacaaactctctaagtttacctgggata ttca >CEESC26F ctactcggcgaccagctccaccaagggaaacctcttcttctacctcaacg cgttgatcatctccatcgccccgctctacctnttctacggagttcaccag atggagatccaagactcgcttgtcgtgtggggactctntgccgtcggcac tgcctacctcctgtccctggcctgcaaaaaccagaagtgccttctcaagc atcaaatcgtgatgaagcgcggntcagctgtggaacgcgagatcagcgga caatatgctgctgacaagaaaatgactgttaaggagaaggaggagcgcgc gcttttccgcaaaaacgaggtcgncgacancgaatncacctacttgtcgg tcttctacacaaantcgctctacttgaccat >CEESC27F tcgaaagtttctccagaatttcgatcaattcacagtcgattagactattc actgaagactatgtcattctctcaggattacggagcaatcagtgaagaag gaccaatagaagtaggatcaggaattttgaaggtggagtcaattgaatat atttttgaatacgatgagaacatggntcaagtgaaaatcaaatgtttgtt ggccccggaacttgtgagattctcgaatgataagtcaataatcagaagat attttcattattacctggccgcttcacaacgaattgttcagcatgtcaaa ctcagggaaagtgacagtccatttcgaaaccttcaaagcctaaaanttcc atccatttcttgggactctaaacggagctagaggatatcctcaatttcat caattt >CEESC28F gttgaaaaacatttcattgaaagatcgatttttggtaaagcagatcagtc aaatttgaatgcagtgaaatgatgatctgtggggctggagagagcgattt agtggcaacaattgaaacgnggtaacagggtgaaactttggtttgtgtca aaaaattaattagttaaagcaaaaaaatgtggaaatgtcgggggaacaat aaacatgttaacangantaaaaaaccgtggatttatggaatggctcttct aacaatgttgttgcggaagaattcttgcattctgtgaaagtttccatcaa cattgccaactatggctcgaatattcgcctgcactgggataataattgat tcc >CEESC29F caagaagacaatttgttttgattggaaatggacgaatgatgaaaggaaat aaacatcttttaaaactctacaagtatgggttttcttgaatatttctgga actaatgaatacatatttncagacaccttcaatcggaaaaagtcatcttc ggaccgtcaattctcgaaaccttcaacgatcttgaggaagctgctctacc aagatcggataaattttgatttttggttgagcttaggtttttagatgata gtattcagtttctaacggatattcacttcatgtaactattattgatntca tattttnatgttt >CEESC30F agaaaatcaacaacaatttcatttgaatgaggaagagagtaacataacca acaatgatgaataaaaaacaaagaaatgaacaatttttggggaggggcgc ggggaaacgaacaataatggaaatagaagaaaagagcaaagcctacgtgc agaatagagtgaaagcgggaaatatttctcttctgcgtctctttctgttt gtgtgtgatttagaattccatactatccgtctttcggccttcgaccacaa ctgaagtgataacatgtccgtcttcggtttccttggattcgagctcacgg cgaatttcaggcgagatgagatagtgtccatggaatctgatcagctcata aatagctgccttctgctcggcattcaaatccgttcttgtatcgctggcac ag >CEESC32F gacggggaatggggagcaaaacaaagaacaatttgatatactataaacca ggggactgggaaattgaaagcagagaaagttgggatcacagattatttta tcagtttaatggtacttgcaaacagatggcactgtgcatccgtgcttctt gatgatttcttgagcagcataagatccacaggtgactgaagcttcgactc cctttcctttctctcgtgccgaattcctgcagcccgggggatccactagt tctagagcggccgccaccgcggtggagctccagcttttgttccctttagt gagggttaatttcgagcttggcgtaatcatggtcatagctgtttcctgtg tgaaattgttatccgctcacaattncacacaacatacgagccggaagcat aaagtgtaaagcctgggggtgcctaatgagtgag >CEESC33F aataaataaaattattttattaaagtattctcaaagtcaaaatggcaaat aaagcttganccaaaattttgttcactattattattacaacttccttgct aatttaatgtctccgccggttcttgaatagaactgatttggagcattata tttttnagttccattggaagatgttgagaagtaggcagtgacatctggaa tgacttgaggagtttgagaagcttttggcattgatacacccattcgacgg ccacggtattcacagcttttnccgaaatatggacgttcacagcggcaggc aaagttatagatagctggatttgttttcggtcgtggtaaa >CEESC33R tcggtctaccacaacaagactaccaataactactggtagtacacaaactg gtgaaccgtttcggactatggcaatcaactgaaatggggtgaagtttggt aggaaaacgaaaaaaagaagagaaagaagaagattgcactgccaacataa gatggtcattgtgtggtaccatcaactactgctagtacttctgatattta taggagctgccaaaggagctcagtacataggttcgggagcctcccaaccc aaccgaacggatgttgtgtggatggttccctcttggacatgtaaaaatga ntattcaattgatgtagaaaagtatgggattctgcaaaattgaggtncag ca >CEESC34F ccgtagttntttttttttttttttanttttatgattttattttaacgtga ataaacatcacaaaagtgagcttactcaaggggtggggtntggggcggct aaaccaaccactaacaagtaacaaaaagaagggtgacagtaagananaaa aacaggngatnggtatgcttagcaactnggggaacgtgctaagagcactt ggcaatgaacttattgcttctgagcggaaacgngaaccgatgcagcttcg tcgaccttcgagcggaacaattcactatcttgaagcatcatgatcaactc ggaattt >CEESC35R ggagacaagggacgtgagcgtgacaacatcaaggaggatcagaccctcta ctacaccgtccagctcgtcgatctgttccgcgctgtgccaggagagaagt ggaccaccgatgagggaatcgttattgagcagacacacaagattgatgag gataagtgcaagaagtcgaagagtggagacaccattcaccaacagtacgt gcttcatcttnaggacggaactttcgtcgattcgtctttttntcgcaacg ntccattcatcttcaagtttgaataattaatgaagtcattcaagggaatn ggacattncccattgactggaattgttgcga >CEESC36F aagattttaataaaactttattgaaatttgctcaatatcagangtaaata aatcagtatcaggataaatngtgaacagttatatttgcttctgtaaacag ttgggatttgaattcagatgtaaatttacataactnctcgttgctgaact tnatactccaaaatccatgtatccnctttatgactgangacaatanccgn gaagttgtttatatgaatggncagttg >CEESC37F agtttgaatgtttttatttttttactttaaaaaaaaatttaatttcaaaa ataaaaaaaaacagtttgtggcaaggaaaagggggaaaaattttntgagt gggcgtggggaacaacttgagttttttgaaagagtcattttggcgggaaa agcggaaattnttgcgaaatatctacccgttactcgcgtggctttttgta gnctaaaactttagtagaggaggagaaagaaaaanctggggaaaaaaatt tgggtcacagnaaaaaaatgcaattgattagcaancgcaagaaggtgggt agagcgtgtgaaa >CEESC39F agaattttagtgttttattgaattgttgaatacaaatcaataaaaaaata acatatgancggttattatgactttctttcatatatatatcccatatatg ggtttaccaaaatgtgcacgaaatgaatacaaataaattatttaatcagt gtccatcttcgcctcgaaacggcttcccaacccggtttcacttccagcga ctcctctattcacactatccagaacagagttccagtcgacgccgaatcga tcggttctttcttggattcgaagcacaacttggagagcatcagtggcgac agtcttcttctgttgagcacacagctcgtcggcctgctccaaaatgttgt caatcagcttcccagcggccactgg >CEESC40R cagagttacaaataaaagcgggcaacaatgtcagaaaaatcattgcaatc gaaaatnctttctactgtatttttncttgcaactctaattgcatttnctt ctgccgatggatatacctgtnccggaaatacgctgataaatccatttntn aatctttcggagccctactactatccagggncatggcgagaaaacatgga accagctgantatgctccagntcaaaagtgtaactggaagatca >CEESC41F aagaagaggatatatttatccaaactgcaacaacaaaaaacacaacaata taacttgaaaaataaaatacgctcataaaaaaacaattttaaaaattaaa aattattccttgtgctcggncaattgaaatcctgccttcgtcgagttcac tgatccntttgctttctggcgagggccccaattcttccctttattgcatt ggnaccgtacaaggntctctttcttggnctgttcgatggctccngggtga agagtgaagttgactgtgttggtgatgggttntcccagaatgtgatatcc ngctgattctttggccagacg 2.fa100644000766000024 6050414006427353 15766 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>AW057262 gacgagggtccgtctcggcgtggtgcaatttgctttagagagcgatcagt ttcaagaaacggtgacaatgaaattgatttggataaaagtttgagcttga agaaaccatctgcatcgaagaatttggccagtttgttggaaaagaaggaa gaggcaagataaactgatgtaacatcttccatgatcagttcaaaacctcc aacttctcctggcacctccgtctacatgtacaacactggatcagccaatt cgacttttatgtctgctaaagatttgcatanagaacgtgttgcagcttct actggtcccagatctgcttcaaaatctccaaaaggaacacttcgcgtgaa ggaaactaaagtcattcgtgaagttcgccaagaagatggaaagccacctg agattagtgagaagaaagaagaaactgtgaaagaggaaaaagtgaatctg agtgagagattgagagccagatcgagagcttcatctcctgcaactccaac tttgaagagaacattcaatcagacggatgagtcgaatattgtgactttga gtgcagtcaaagaaacccatcaaactttggaaatgacaccaatcatagtg aattctgaaacagttccttcaacttcttatggacaacgtgcttttccctg agagtgttcaattggatatgctcattgacaagaaacgacttgt >AW057264 aggctccgatcccaaagcctgaggaagatctcaagggctccacggatcaa agtaccactgagccaacgaagctcgcctgatcccaaagcccatcggaacg ccaggaaatccaacaaatgccgaattcattccgcgagttgcaggactcaa agttcaagccggctccaagcccaaagtctgaaaagggccccgcggagcaa agcttgtctgagccgtcgccggtccccgggaataaccgcaggaactctga gacatcacaagttgacacaatttccccggtgccaaccaagctcgttggaa cccaaagtccatcggaacgccaggaaattgagcaaatgcccaactcgttc cgcgagctgcaagactcgaagttcaagccggcgcaggctccaaacccaaa gcctgagcgcgttgagaggggctccgcggagcaaagcatgtcggagccgt tgtcgatttctagagttgcatttggctcgccgatcgctccgaaaccacgg ccatcgccactccaagctccgcttcttgagacgttggctactccaccgac aatcgacgctcctaccgctgcaatcgagacggcaatcgagagaagcgcgg aantttcgtcatctcactcggaggatccttccaactcactttttcaagtg tgcagnatgccgtaaggaagaatcgagtggtc >AW057265 ttcgctaatcattcccttgttctactgatcgttggaaggttctagctatg aacgtcaccagtgtcacttcagaggatggtgttaaagaattcgaaaagat tgttgtggaacctgaagatatcgaatatgttgagattccggccgatgcca aaaacgttgacttgacgcgtcaccgtatcaaagaaatcggtgattattcg tggctcactcacgtcgaacacttctcgtttcgttggaatctgatcaaaaa gattgaaaatctggattgtttgacaacgttgactcatctcgagttttacg ataatcaaattacaaaagttgaaaacttggatagcctcgtcaatttggag tcactcgacctgtcattcaatcgtatcaccaaaattgaaaatttggagaa gttgacaaaactgaagactctcttttttgttcataacaaaatcactaaaa tcgagggtttggatacgttgactgagctggaatatctcgaattgggtgac aatagaattgcgaaaatcgagaatctcgacaacaatctgaaactcgatag attgttccttggcgctaatcagattcgtcttattgaaaatgttgatcatt tgaagaagctcacagttctcagtcttccagccaatgcgattactgtagtt gataacatttcgggacttcacaacttgaaagagattt >AW057266 gaaagtctttggggagatgacgagcctgtacggaggaaggagaagatcag cactcggttgttgggggtccaattgcgttcagaatactcttgttttcagt cattttgtgaacagccatactgacggtattccacgtattactgattcatt ccatgacgacaggctgccatatgtttgcacgatggatgtcacgatgattc cgatcctt >AW057268 ctactacaagctcggtgttggaatgaatgagtggaagaaccctgagcacc ttgccgagcacatcaatggagctgcttactccaactttgacattgcttac tatccatcggagaacgagcggttcactttgtacactccagaggaattctt gctgtatgttaagagattg >AW057269 aaactcgaagaacagtcttgaccaacacgagatgtattcacttttgacgc tcctcttcgtcctcttcttctctggaagcactctgctcgttcaatgtggt ggaaagaaaaagggagcaacttctgccgaaggaaaatcttcgacgatggg cccggctcctggaggagctcctgctgctgcttccgctcaaggagaacctg aagagaaggagtaatgaaca >AW057270 tcgctcgcttgcgtctcttgctcgccgcccgtgcacttgaatgcacagcc cgtcttcagaatgttactgttaagggagttgccgtgcgcaataagaagag attggcaaatgttgaagttcaactctatgagaaggacacccttgacccag atgatcttttggacaccaagaaatctgatgctgaaggagaattcagcgtt tacggagaagaagatgagactcatgctattgccccataccttttgattac ccatagctgcaacccatctaacccaattgtgtccgcatcgcaagtacttg gtgccagaggacaagatcggaggaacctacgacatgacctacgtcaccct cgacatcaaggttcacggagagaaggaaaaatgccagtaaaaagtgcaaa cttcctggattttattgactatctaaatatatattttttctatatga >AW057271 aagcgatcatcggatttggaagctgctcttcgaattgtgctccagcagac tttgaacatcgttttgcaagcgcaggagaagctccccgaggcaaatgtgg taccctcaactccgcccacctcaccgagcactgatatcggcgaacaaatg gcatcgttctggaatattccatcacccaaccctcctgcaacc >AW057273 gttgatgatctataccactggaaacaacaactcaagtgagcttgtggatc caatgagcattactctctgtgtactctaatgtgcccaccaacatgagaat tgccaaacaccaccatgcagagttgactggcatctgctcattttgtacct gcttgccacggccacggatactcaaactcacgtcaatgcatactagctaa ctcttgctaccaggactcaatctgtttgatgccgaatgaacaactagtgt tcactccaggaatgttt >AW057274 tctcccaacaacacctcaagatgaatgccatctacactgccgtccttgtt gcttcaactctcgcctacactgcaatggcttggattggactcagcattga agccgccaacgaggatatgatctgaagtggcgccc >AW057276 tcgactaaccgtctccactcttcacttgctcaaatcttcatgcaaccaat caacgtcatgctcgctgttcttctcgccttggcttcatttgctcaaggag gcagatctgttgctccggctggtgcagtcactgaaccaacagttactcaa gctgttccagaaggatcaggacttagttcagatgtcactgatcgtccaaa catcgactccactgatgttgtatcaaatgcaacttcggtggaagatttgc ttggaagttcaacaaatgcaaacaacacttgtacatccaactctaggacc tttgaattgctccaatgatgattcttgctttggtg >AW057277 tcaaagttcataaacggatcaatacttgcaaatgatggcaaaatactttg gcgccacagatgcattcaatgcaattgttcaaaaagtcgacgaaacactt attcaagcagaatcccatcttcgtaatcttcatgaagatacagtgggagc aaagccgtctgatagtttgccggaccgcactatcgttccgtccccatctt ctcaatcggaacgttcatgctccccggagcctcgtattgttgctcctcaa ttgtctgcatactctggatcatccgctgcgtcttcttcttccgtgaatca tattgatgtgaagagcaagtcgtatttggcattggataagaagaaagcac tgatcatgacttcgctcaagtcaaagagagttatgaacgatagtgatgtg acaaaagttcagaaattgatcgatgacttgttcggaaaacaaacttcttc gtcctcatcttccatgtccatccttc >AW057278 aataacctctcccaacaacacctcaagatgtttgccatctacactgccgt ccttgttgcttgctctcgcctacactgcaatggcttggattggactcagc attgaagccgccaacgaggatatgatctgaagtggcgcccatc >AW057279 tcgaagtgatggattctccaacatcaccattgacttcttcaaatagtgga cttatcactgttctggaaagaggagtttcttctgaagacacattgattcc ttctgttcgtcgtggtgtcattccagtcaacactcttcgttaccaaatag aaaagcatctcgagatgtgtactccagcttctgaacaattgtcaaagagt tcggatcccaacatctcctcgatgtacgttttccatcaaggaattcaagt aaagcaggaaccaatcgatgatgaccaagaggaagagcaacaagtacaaa agcagcttgtattcaaaatcgagggctccgaagacgaagaagctgtgaag aatgagt >AW057280 aaccctttcgactacaactaacatggacatggactagcgatcatcggatt tggaagctgctcttcgaattgtgctccagcagactttgaacatcgttttg caagcgcaggagaagctccccgaggcaaatgtggtaccctcaactccgcc cacctcaccgagcactgatatcggcgaacaaatggcatcgttctggaata ttccatcacccaaccctcctgcaacc >AW057281 ggacaacaagactgaggaagcatggttctcgttcaatgggaaggtgatta agcagctcgggccacagctcaacgagatgtacatcatcacgcgcaactgc atcggaggaccaccacattgcccatgtgctgtgtgcggagctgctccacc accaccaaagccagtgccacgtgtcgagagagacgaatggatggacattc gtgagggagatccatggncgactcgnccaacttgtcaaggctcttgacaa gactctggacacccttccaggagtcaatccagaccaatatgttgctctct ggtacatgcaaggtgaacctgttatgggtcgtgtctggaatgaaggagga aaggtggctgccaacttctcgtggttcaacaacgagtattgcaagaatgt tggatctatccagcttctcatctatcttccggacagtgttcgtggttntg actatggatggatcccattcccggaggctgctcagtttggagacaaagct tggcatccagttcatgtcaacaaccacaagggagatatctncgttggagt tgttaacgttgctggaggaaagcagattcttgccaggggtgattgtccgt aacgagaagtatggttatggataccaaagaaaggagcattcttgcaa >AW057282 agtacgctctttaacccatgaattgttgtgtccgggatgtcgtcaatgtc tgagtcggatagttatcaatcgagccaattgacaagcgagcccgatcttg tggcgctggatgccaagataatggcggtaatggatggaactgaagagctg gagagggagattggcaagatgatggctctgcagcaagcgattagtgatta caagaagagccatcaacatagaactcaaagaaccaaggagaagcttacgg atatgtctaaagacccttacaatcaccgggaagaacattgttaagtcgtg tgaagagcggctcaagcaaatctcggatgtcaccgagccatacatcaaca gcggaatctcatcggtggaagatcacacctcctcggttgtggagcgctgc attcaaatgctcggagcaatttctggacttggtggggcaatgaagaaaac ggagggattgtcttcgaaagtgctcgagcatcacagaaagctcgcgatca tggagcacaagcaagccgatgcaatttctcgctacgagaaagccgtcgga gttcagaagctcgtccgcgagcatccggctggagacccactgcacgagc >AW057283 atcgatcatagtgcaaaatggagatttaatctaattgcattcaaaatgaa gagcaaaagaaggaaaaggaacttattgctgaaaaagcagcaaaggctgg atttgtcaaccgactttatgtgaacatcggacaaaaagttggagttgtcg agctgactaagttagagccacgttttgagaggaacatcgataagctcacc tcctaccacaacttaatctacaaaattgtgaatgtaatcgaacttcaagt tcaattcatgcccaaggcaatggcaaagaaagcagttttgtgtgctcccg gcgagattccatgggaagttcttggaggatggttgaattatttgggaaaa tatcagtttgatgggcaacattctaaaatgctggaaaaatacagtagcgc ctgtggaagaattgctcaaaaggagatccaggtgcagaaaagaactcgtt ctcatctgattaaaaagatgcgtttgtacactggagaggaaagtgagata ttgaattcgaacgtagaaaatttgaataacttgctccatgcaatcgatga ttctcgtcaccatgtgaagtcttcacaaaccacgaaagaggtgaaagcaa aaggcgaaacgtatcgtaaagccatcaatgctttcaacgaaacggcaaat gaagttcaagcattgattgacgaagttgcaatggtttgggtgccc >AW057285 aaaaactacaacttcggagtaatggaaaatgagaaaagtaagacggagag tttgaagaaggacgaaatcgatgaggcaaactcagaatcttcaaaagtgc cactaacaattgatccagaggaagccaaactcccaaatgccggcggaaaa tcggagcatatggtggtcaacttcacttcaaaacgcatggcgatcaaagt gagatgtggcaatgcactatttcgtgttgagccaactcacatgatcatcg agccgaacaagtgccgccaactgacaatcaatcggatgcccggaccaatt caaaaggataaagcgatcgttcaatacctccaaattgaaaatgatgtgca agatccgaaggctgcgttcaaagcagcggacagtgctggaactaagattc cacacttgaagatcaagctggtggccggagcaagtggaggtcgtcagatg tcgagagaggtggtggatgagtagtttgggaaaaaaac >AW057287 gaaaagaacatgaaaataattctcnggctgccgctagaattcagcaagtc attgcgaatgcagccggaattccatcatgtgaatatggaggaagtctctg gtataaacaagaagaaaggaataaactgaaggaaaccctcaaaactcagc atgatatttctggaagcagaaataatagtgatagtgggatatctggtgga ggaggaagcagtgataatttgagcatcgacgatttcgaatctgtttcgga gaaccagtgtgaggaaaatgtgataccggctatgaactgaatttgcacat tgtaatttttgttattaaatcatattgaaaatt >AW057288 caaagttgctcaaaattttcattgccggattgaccttttagctgcattgc tgacaagatcaaaagaatggtcaaaggaaaaggttcaaaaagatcaacac caagtttgcgcgccaagaagaaaactggcacggatagacagaagccgtct gtaaaacaaaatgcatctcaaaactcaaagaagagtagcagacagaaaaa gacccccagtgttggaaaagaacgggaacaagcaacggataagaaacgag aaattgagaaaaaaccacaggaaaagactgctttggatgagcagcaaagg aaagctcaaacggagactatcagcaacttggaaatccttccggacaagaa tcctgctaaaatggatgacggttatgaagatttcggtcctggtgcagctg ctcgctaagt >AW057290 tcgccatcaccgccttgctccgcgattgccattttctgcggaggaaagaa aggagccggagaatctaaagaaaagccataggaggatgtgtatgaggatc tggcaccaggagataagaag >AW057291 cgatcaaccaccatgagttttgatgaaattacttgaccttcggaaccaag aaccgcgatcaaggatatgatttgctcaaagcgcgtctcgacatgtgatc gttcggtggaagtcttgtggagacttgctcaagtaattcatgagaagtct gcatgtgttccaaaagctcaacgtaaggcaagtgtcaccgaaggactcaa gtttgctgaagaagctgtccagaaggatccaaaccatttcaaggcgctga agtggaatgctgtgttgactggacaagcaaccgaatatatggcaacgaaa gaaaagttggaatgcagtaagaagttcaaggaattgctcgacaaagctct tgctaaggagccaaaggatacggctctcctccatttgcgtggccgttaca agtactcggttgcatctctgacatggcttgagaagaagcttgctgccacg ttctatcagcaaccaccatcacattcctatgaagaggccaacgaggattt ccttgctgcttataaggtcaatccaaaatggatggagaacacattttatg tgtccaaatgctacgtagcaattaaagacaagaacaacgctcgcaagtac ccttaccgaagtgtgtgacattcgaaccgtaattccgacgcttgaaacaa gagttttgccgatgatgccgaagcagatgttgtcttaagctttaa >AW057292 tttgcgcattttgttgctgctcttctagcttacgtttaatctctctctgc ctttctgcctccttctttgcctcaatctcgagtcgttctttctcttcaga ttctcttctttttctctcgatttcagccaactcatcgcttttctgctgtt ttttgagatcagcaatcattgaatcaactcgtttcacataatgctcataa gcatttt >AW057293 aatttcaaagtaataatgcaaaaagtttgtgaaaaacggatcgatagcgt cgtggcgtcggctgtcgagggttctggaatgttctacgcggtgagattcg agccgccgccgtcgaaaacttgctttgacgtgactctaaagacgtacgga ccgccctacaccgagtacatcgtaaccgtcgcaattccgccgaaattccc gttctcgccgccggcgatcacttgcaaaactgataaaaatatgaaattcc tgtttttggaggaaaatcaatggaaaccgtctaccggaattgttgcagtc cttatcgaagcctgcagtgtgatttcgcgtcgagacctggtcccccgtgc accggttcttccacgcatccgtccaccacaagcacgtacgccaactagtg cttcgcctgcaaagtcgccaca >AW057294 tgagagatggtaaaaccggcaaatacttcttcttcacttatcactaaaag agttttgactctcggtaataatgtcacaattgatatatacgatcatcatt attatccaatgtggttttggattgtaatttctgttggatttgtcttctgt actctgagctgtgctgtttggtttatgtgtgctatgtggagattgaaaaa gggtaaagaatgtaaccatccatcgtttgaagcgcgtaccgttgtgacaa aagatggagaagagaaaccggatccccaaatggctcaaaaatcagaaaag acttgtaaaaaattaggtgcattgggtgaagctgaatcgttggccaagag cttcaaaagtatcagatcgaaaaagtcgatgaagtctacaaaatccaaga aatctgaaaaagatgtaggacatgatgatcataagaaggaagatgttcat ggtgatcaaaaggatgataataaagatcgaaacgatggaggacgagattc tcatgttgttcagatggaacataattctgaagaggaacacgagccgagtg gatttaaaaagctgggcaagtctttcttcaacttcaaaaagtagcaaact ctcttcaccttcacccaaagaatagc >AW057295 atcaactgacaccaacacatcctgcttaatcgacatctatcatgtccagg aataacactcaaatgcacatcacaagttctcaattagaagacggatttcc atcgattaccaacaattttctgacagtgacagtgaacttcaactacgatc catcgaatccttctgagccaccaacaaaagtcctggagaagatgagtgat ctgattggccaacagattgcgaatcttcaaaagggaaaggcaccaaaggc aaacgacgacaagagcaaaggttccatgcctaccgttgaattctctcgga cccagtccatgactactcaacaaagtttggaggacgatgatactcaacga gaaaatgttcctcttgaaaagaaaaagaaggggaaatacagttcggagta cgctaatcttttggtggaaaagccaactcgttaccgtttggttccatcga aaaatgtgaaagtagttccagaagatgagctcccgaagaaaaagtttgac aaggatcggaagagaagagagtacgtggaaattggaaaactgtattctaa agaaccaattattgatgaaagtgaagttttgaaaaaggagaaagggaata caaagagaagaagca >AW057296 tccaaatgacaatcgacatgaaggtggtgtttcttttaaatgctcaaagg ctcggatcagccggtggacttggtccccaagaaggtcgtggttgccggac aggaaatcacagttgattccaagtcgaagaacgaa >AW057297 ttaaagcactgtcaactgatacgatttcattcaatacctggaaaagcgat ttccttccatatgaagaacgttgtctgcttgagttatgaccgtaatcatc ctattcaatcaattcgaaagttccggaagaaaatgagcagcaagggagta atggagcaatcaactcagccgaaggacagcacaaaggaactt >AW057298 taaccctttcgactacaactaacatggacttgttcaagcgatcatcggat ttggaagctgctcttcgaattgtgctccagcagactttgaacatcgtttt gcaagcgcaggagaagctccccgaggcaaatgtggtaccctcaactccgc ccacctcaccgagcactgatatcggcgaacaaatggcatcgttctggaat attccatcacccaaccctcctgcaacct >AW057299 cagaatccccatccaactaccaccggctggcttatcaatggacgagctgg aagtattggttcaacaagcagtcgctggtcagaatatggtcatcactctt ccggttccagcccacaagaagttgattgtcgagcagatcgttgtgaagtg cgatgaacatgttatcagcctgccagcactcattgtcaaacatcgttagg ttcttcagtagccgaactcttgtcgaactctctttctctttt >AW057300 gtagaatctaggggaacggttcgttctgctccaatcgcctgccaccgtat caccgaaatgggaccgaaggatgactgagagagctggttctacttacttc tcgatctaatcgtacctagcggtctaccttggcgtaagctctctgacaag catgaggtgctcaaggagaaggaggaatgccgaaaagataagcgttcctc gatgttcgccgggctcacacagactgattatcctagtacagtttgtgact acattgacggaagagcttaccaggatcgtgtggactaccagttcatctac aagaatcttgcagaggcttgcaaggtttgtaatctt >AW057302 tcgttcctcttctcgaaggccctcgtatacggattgttcggatctgctga gc >AW057303 ccctgtcaaaagaagtcaaacccatttgatttgcacatctccactggccg cgaaatcgttcaacgcaactttgtgttccgcaacaccactggcaaagact tcctgctgaaattgcatgctacgaatggagctgtcacattcccaacggaa gtttttcgttctccaccactatcgcatcgcatcatccagttccgtgtgaa ctcatcaaagctctctcaatgggacaagatgaatctttcgatcagaggat acgtgttgccgatctatgcgaagagtctgaagcagttcattgatcagaaa aaaactgcaggaactaatgagcaagaggcattctcattgtctgtcaagtt cacggatcagttctcggctccgcagacagtcatcaacttgccaggatatg ccacgtgtatcgagtcgactgatcatccggttgacgtggaagaattggac actacaactgcagtcaacatcgaaagagatgtctccactgctgctccaat tggttcaatgatgggatttgttgatgagtacaaacgtcgtcaattgaaca aaggatgctggttgtccaactacatctgtggaactgaaaagcaaccggag aagcagtcaatgagatcttctcgtagatcaagccgttcgtcgaatcgttc tgcaaagagctcaaaggcttgccgtgttcaagcctaat >AW057304 tcggtggcgagtgcgtgagttggtcaataaagttcagtcgactaaagcaa aaagtgaaagaaaagtacgggcagaagcaaggcgacagtgcttcccgccg aaatcgacgaggcgttgatctacttcaaatctctgaagccccgcgtccag gatctctacaagcacatggcaaatctgaacgacgtggcaaattggcaagt gaaagccaacttttcaggcccactggagaactatgctctgctcggagatc gtatcaacgtgcaaccgttcatcaattgtgttgacacgcggatggaggca gatgtcgagtcgatggataaggggctggcgatttgtgagcggtacaaggc gttcacacagaacgagagtaagcttcacacaaatacaattgccaatttga ataagacacgtctcgatatggatagtgctgcgaacaagtacgccagcaac gacactgacgtcaatcggactcggtttgatgatgccactcgtgagtttga ggtggcttgtgatcggatgcgtgagttggcgataagtattcagacaattg aggagactcattccatgtggcaagatgaattgatgcgggagataaaggcc ggaatgaggaaaccgaattga >AW057305 tactgagaaggatgaaccgtcagacttctctctcctcctcatcgtctcga tccttgccgtttttgtccaccatggatttgctgctgctgaagaagagaag aatacagcttcagtcgtcagccctgctccggactctgaagcagcccaacc tgctggaaacggaaccgaaacaccaaaagatgaggtgaaggatgaggcac caaaagaaggtagtgaaactgaagcttcaccagaagccaagacaaaagga tctatggtattccatgcctcttggagcccattccacagttggtctcgccg gcattatgtgaagaagtctgccg >AW057306 ttttttgagattaatttaatttattccacagtaaaagttactcaaagagt ttcatagccgatggtcttgaaatcgcattcttcatgactggggatggctt cttctcacgttgctcccgcttcaaatcaccgatgattccccacatatcga aaaccattccatccgg >AW057307 ttgtccattacgagatgtattcacttttgacgctcctcttcgtcctcttc ttctctggaagcactctgctcgttcaatgtggtggaaagaaaaagggagc aacttctgccgaaggaaaatcttcgacgatgggcccggctcctggaggag ctcctgctgctgcttccgctcaaggagaacctgaagagaaggagtaa >AW057308 ttggaaaagaaggagacaaagaagaaaggttagtataacaagaagagcaa gaagaaggcgaagaagggaaagaccaagaaggttcgaaaagcagacaagt acgagtctcaaaactttctgtttcgagtggaaggagccatgttctgtgcg ggaattatcgttgctatgattatgctgttcgtcatcattatctacggaat aatcacttcaagtcaaactggaggacagttcaacagatacatggccccac tattctgattggatcaggacagagaaatgtcgcaagagacaa >AW057309 acggctacgagttctttgcctccaagaagatggtcaccattttctcggca cctcactactgcggacagtttgacaattcggctgcaacaatgaaggtcga tgagaacatggtctgcactttcgtcatgtacaagccaactccgaagtcca tgcgtcgaggataagctctgcaaactgtcaccaccatccaaccaaccaac caa >AW057310 taccgtcttaccggccgtggctatcttctgtggaggaaagaaaggaggaa cgaaaggagagaagaaggattcagtgtacgaggatcttgcttgcagagac aagaagtagttggaacttcatcgacaccaccaatcaacaaaagacgtctt caatgctttttcatcgtcttcttcta >AW057311 gaattccaagtttgagaaaatgaactgcttattctttccgctttcctcct tgttgccatctttgtcatatccgatgctgccgttgctcaacagcaggtta aggacggagaaaaagttgaaatcgatgctttcaagggagccaaggcaatc aagagaaccgttgccggtggagatcaaatcttccaccttgacggagataa caagggatcatttgttgatgctaagggaaagaagattgagtcaaccaatt atgaagctaataacggaatccttatcattaagaagttcaccaaggccgat gttggaacctactccgagcacccagctaaaaacaccgaaaccaagcacgc tgatggatccatctccgctgttccaggactcactcttgatatctccctgc aataaacaaa >AW057312 gcaagtatatccagcggtttgactctgtcagaccttatggctacttagag aaatcatcgagcgtttgccactctaagagcctacgatgcgatgaggatcg ttgttcgacatgattgtgctcctagagctccttatgaacacgctcctcga caaattggctatgatgcacctgtttatggatctcacatgcacgcagcttc tgtcgattacctactaactcgacctgttgccggtgccaaagctcttgacg ttggctcaagaagtggatatttgacagtcagtatggcaatg >AW057313 tcgaagaacgttgagacgatgactgcaattgattcagtcgaagtgatgga ttctccaacatcaccagtgacttcttcaaatagtggacttatcactgttc tggaaagaggagtttcttctgaagacacattgattccttctgttcgtcgt ggtgtcattccagtcaacactcttcgttaccaaatagaaaagcatctcga gatgtgtactccagcttctgaacaattgtcaaagagttcggatcccaaca tctcctcgatgtacgttttccatcaaggaattcaaagtaaagcaggaacc aatcgatgatgacaaagagcgaagccaccactccttcttcacagcttctt cgt >AW057314 cgtctcctcaagtccgccgtcgcccaatcgaacaatgtcgtatgcacaga gtttctacgccgatcagaagaaagtcgagaagccagcggagcaagcttcc tctcctgccacggctgccttccccgctaccaccccaatcgctgaggatcc tctgactccatcccaaatccaggatgccatccgtctctaccgttcggtgc tctccttgtctgcgtctgccccatcatcaccggttcgtcaagcagcggct ccagttgctccggaacagccgattgtgcactcggactactatggtggacc atcggatattccgttgtcctaccgtgttaagtacaccacgacccagcagg cacctgcgtctccagctccggacttcacagagcaacagttgatggctcag ctgcaggctcttcagatccagcagcagcagcagccggcaccagatgttcc agtcgtcgagccagttcaacaagttcagcaaaagccaaaagttgctccga agatgcttcacaagatgtatgacgatgaagagtctgggtactgcttcgct cgtaaaaaggatgtggagcaagaaggagaagttccggagaatccacgtgg nccgctccagttcaccactttccgataccgacctactcgggctcctccgg tcaactatgaagccttccggggtttttttccc >AW057316 catcatgttccttcgcaccctcgttgcccaattcacaagtttctgccatc agctccatcaccttgcaactctgtcaagagttctgtgctggtgtcaatgg tggtgaatcttacgcattctgctctccatggatcagttttgccactcaca gaaacaagacttgctacaatctctgtgttcataactgtgctgctgtctat gatggttcctgcacaactgataaagacttcagatgctgcttgaaaactac tccagccaagaaacaagaattcaagatgagtggttgcaacaagccttaca acaatctttaaatgagttctctggtt >AW057317 tctatgcaaaggattgttttaacattggatgggcagcagataaatcaaga cgactggacatgatgtgcgtatttaacgatgctctagacacgagtggatg ttgttatagagacacctcaaacttctgttcagaggggatgtcagtgttgc catctcaacgatgtgatacccttgatgactgtaatatgcgaacaaatcaa actgcgcaaagatggtgtgatcccgtttcaaaatanttgctgtccgattt gaaaaaggaacaagcacttttatgcccggacaacagtacagctttaatga atgaacatcattgtattaactacgacgaaaaagatatttggagtggaaag tgtaagacaccgaacggaatttgcaaatatggacactgttgcccatcaaa taaaactgaaaaattgttacctggaactccatatcgcactcatcaaaagt gcactaacaaaacaattattcgtgatgatcaacgttttggatactgtgat cctaaaaccggaagggtattcataatgagtgaactcaattttcacgggca gagaaacaaggaactctcgtcatactgtaatactgcaagagattgcggtc ggtcgtttggaatggataacgtatgtgttcgaatgaataaagaacgctca atgctttcttcaa >AW057318 gcgactgcgttggatcaaaccgacatggttcaaattcccaacactccaac attggttgccgaggaaaatttgaaccacaaacgctctaaagcaaatctcg tggtggctcaagagtctgtcgcaatggagcacatcgctgctcatcagctt ccagctcccgagccacgtcatcgtggaccggcgattaaggataagccgga gagaaaggatcgtcttccgacggttggagaatattttgaaaatgataaag gagatcgtttcattttgcgtcagaagctgggcgatggtgcaatgggacat gtttttctgagcatttttggtggcagaagtgttgcaatcaaagccgaaaa gtattcaacagggatgcttccaatggaaattaaggttttgttgagtatca gacgccacaatggagttcatttctgtgatatcattgattatggaaccatc cgtcgtgaatacaactacatgataatcagtattcttggaaaagatctcta ccgtcttcgtgccgaacaaccgactcgttcattcactctcaatacgacta caaagattgctcttgaaactattgaagctattgaagagcttcacaatatt ggatacctgagccgtgatgtcaagccaagcaactntgctccaggacaacg cgacaatggacagcataagacaattttcatgtttgaactttttggg >AW057319 tcagttctcatagtttcttgagtcattcctgggtgtgatttattaaaaac atcctcgcatcgtggaataattaaacatggagagcaagccagttgcgaca aaccaaaatacggaattggagaaggcaaagctgctgaaaaaaaaaactcc cgaagaattggcagcactggcaagcaagaaggtattctcaacggaatcag ttgaagaaccagttccagttactcgtcgtccaagtgaattgtcaatgggt tcattgaataatcaattgaaagaagttcaaatgggtaaaagcattcaatt cggctgagaatcatgcatataaaactgccaatgacggagagaaaaaggac >AW057320 aattccaatttttcgaaacgatggcngtcacatatgatacacttcgtgca gaaattgaagagaagaaagaagactcagtttccgaaaggacaactcgaaa tggaaaaatactgaatgcaaaggatgaaccagagtttggaatgaatatta ctccaaccacgttgttcttcaagtacccaattgggggaattggttattca ttctttacagtcaccaacacaacatcggagaaataagcattcaaggtgaa atccagtgacaacacttggttccgttcaaaaacccagcggtcggttcatt aagagtggtgaaaaagttcatgtgagagtcacattcaatagtccagacgc aggaaaagaaagacccgaaagaggctcagacaacaaaaatcatgttgcaa tcttccacgtggcagctggtgacgcgaagacgtacaaagaagcgtttgcg aagaaagcggcggacggagttcatcatttctttgcaagac >AW057321 ctggaggagactatacacaaagagcaggaaatgattcaaacaccggtgat atctgcctcatgacgttccaagcggtggcaactgatatggatgctgctcg tgacttctgtaacatcaaagctccgtggcggcttagagaagcaaagattg ataaatcacaagacagtattccggtgattatctgcgacgttgaagctaca ttcacttgcaatgccggatggattcaaatgttcgggtattgcttcaagat gagtgaggtccatgatcgctacacacgtgaaaaagctgagcaatactgta aagatcaagctgggccaagttttcaaggagaaattgccggcattcatcac agatacatcttgacgccttggagaagct >AW057322 tgccaaaaaatcagcgaagaagttaacaaatgatgacgtggatgttttga cgaatcccagcgacgaatacgtggattctttcatgaaatatcacggaaac ggtcgagcagtattcaagcgtgaagatttggctcagtggcgcgatagctt cccggactacaaattcaaagtgatcagcttaaaaggagcgccgagagtca tcgcaacggcacatttgtgcacatttcgcctcattgatccattcatccat taaatccgataatggtcattgggttccgtatggatagatcctggatttag atctcccggaccggcaaaacttcagaatgatatgtgtcgagtggaaatgg acagggaagatgataatattgtcttacagattaaccaaccggtcaaaaat ctctggcacatgctgtccaagcggtaagaattcttgtatctcggacacaa agccggcgacgttggctacaagacattctacagtgcacacgacgtcgtgc tccctgagaacttgcatttgtctggaattacagtgatagatgctcgagat gctccgaagagagatatcataaattatgatcagacactccatccgtatca ccgagacaagtacatcatctatcatatgtacgatcggtatggatctcgtg aagtgtcgtatgatgatgtagtgaatcgtgtatcgtcatctggcctagtc tggtatctatgatcatcagatcagatgtaatttatggcctaattatgctg tatatacacgtgtcgctacagcaatcgtt >AW057323 gtcggatccggctgcaaccagcccctgttcatccaatctccaggttgtgc aggctgtggtggcacccagaagttcccagtgatgaccttcaatggtgtgc tcgtcggagaaattgtccgcctctacccgggattcatgcaggagatgttc accgatgcggacacctatattgttcatttcccaatggacatgccgccaat cttgaagcttctgcttgtcacctcggttttcctgatcgacttcacctact tcgaggatcgtaaccaggatcaacatcgtaacggcggaatgccttaccga acgcatagcagcttctaaacttataatg >AW057324 ggaattgacaatcaggttcaatcctcacgcgatgatcgtctggtgtggct gacgttctgcccaactaaccttggttcaactgtacgagcatctgttcaca ttgctcttccaaaactcagcgctcgttaagactccaagcgattctgcgat aag >AW057325 gtgaatggttccaaggagacggtatggttcgtcgcaagatcctccctatc gag >AW057326 ctggatctatcatgtcaatggctagtgtttatgcaagtgtcgttgctcca gatctgacaatctaccatggagatcgtaagcaatcctaccattttcgctg acaaggggaaaatggtcgttatcaaccggaaaaatggggtgattgtctac atgcttcgttgtgtcgacggccgacgtgtctacattgagaaatcttccga aggagccagtcttattctgactaatcaacgtggaaaagtgat >AW057327 tttttacgatacaaaaatcactttactcatttacaaccaaaacagaaatg atattctcgacgagaccagactttcccttgaaacctaatcctcactgaca ttgcttgtcttgaaggtgatggaagtggtttcacggctatccaaaggcat tgacactgacttgctgttcttgttctttctgattttcagactaccctttg aagatctatcagtgttttccgacttttcagttggagacttttgaacgatt cgatggaagaagacgagttggatttggaggcagctctggatttacttttc tcaatatcactcattgaagttgatggggccttcattttgggcttctcctt cttcgcatcaccagcctttttcgttttcttatcaaggcatggaaggttca aagcagttttgcggaaccacacaccgacaacagctgccaaaatgttgacg gctgtgataattgcaagtgcgatccaagatccagcgggttgattgtcgaa gatcaagtagacgagccagtagacttcacacgcg >AW057328 taacgtgtaatattttagacgcaaattaaccacttccaattctggaatat ctggaatcattcagaatggtagttgttccagcaacttcttcacttccagc caacagcacaggttcaccattttgatcgtccctttggctatcgtcatcaa agaagaacttcttatcaccaatgaaatgttgaaccaaaacccttccgatc ttgaaaaggggttgacagaatgcacatagcaatactatattgccacgttt ttggtacattaacggatgtttctccttcttttgctccaggacataagcat gagcgagcaatggccattccgtctcaattcgattcttttcgtgagtgatt actgcgatagagtaaattaccgatattgctgatacaataaccaagccagt agtttgtacctgtccaacaaacacgtcaaattcacaaatctcatgaactg tgtagaaaatcagacaaacacaagtgagagccggcaagaggatcaaatac atgctatcccagattgaatagtgagaccatttcaaacggtgacaaatggt gactagatgacttgtatcaatcagatgatcgttagatctcaagttctcag tgatactgaagttcga >AW057329 atgctaaagttcaagtattagacggactcagtgggatgatcatcagacgg attatcaagtacaacacgcgaaagctctgtctcattcttctcttcttttg gtggacggagcatcgaatgctcacggcactgagttccatcaagacgagta atttgttgcaagagatgaataacctttccacgataagtatccactgatgc aattggattgaaatcgaggtaaacactattgagttgtggaagttcaacca attcatccataatgctccagttatccaacttatttcctcttgcccaaaag tctgtaagtgtcttcaattgatggatattctcgaccttctcaagacgatt ctgattgaaatccagaatttcaagaggaagatgctcatcgattccacaaa cgtacttgatgccattttgagccagataaatctctttcaagtt >AW057330 tgtatctcgagctgctcacattcatcgatgaaacatcttctgcagttgga actgcgaacatatcaaatacatcttctgcctgattcttggctgattccac tgccaggaagattgcacctcttagcaattcaccatagacacatacacacg accatgtgacaatcttgcatccatagtctatggagcctagaagcaactcg ggcgctctactatatcttgcgacatgatagctgtgatgtggtgtctccac tgatgcagtcgagatgacgcatatcacgagtctcagacgacagcgcctgc attgtcagcagatctgtgctcgatgcacg >AW057332 tatttaatttcaataagactgccacttctcaaattgaataggactgccca ttaatgacgagttttgccaaattcttcttcttcatggacccatttaagat tccagacaaatcaataacatgatcaccgattgacaaattggtagaaaact ttgtgcgattctcaatgttcaattctggtactcgattgcaattgttttgg aggaattggagcatcatcaatacatcgttcacgttttttggacttttcac tattttttttcttttgcttttcaagaggctgaagccacagcttgacgcgt ccatcagaggaaacaaactccattggttgataggcaagactattggattt agcagttgtagatgcatatgaaaggtcgaccgacaccattgaactcggag acaacgccattttaggttttccagttcccgcatcgtaagttgcttttgga acaaggtcatctggctttgacttcttcagtccacggcacttcaagtgttt gatgtcctccaatggctcaattgctagtttttcaatgtgcttacngtaga gcgatttgttggaacaagatgaagcagttgctgcacttatcgaaacgtca tttgagcctctgcttgccggatgtagatgcacttgggaacaggagtccgg ggtcgtgatcttgtcatagtgctatagtattcagaatacagtttcaatgc tcgattggaaccttctggctgctggaa 3.fa100644000766000024 12052714006427353 16011 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>AW057334 tttcatccatcaaaacatttttattcaatctagacagtgctattcatgac atcttgctgaattcttatcatcacttgaccttgcgaagagcttttccttc atgatcatgtccgatctcgtgagtttgagtttgctcgatgatagaaacac ggcgagcatttggaccggtagacattgcactcttcgctggaggcttcagt cctggaagagcttgacgatattgcgaagcaatgcctttcacatctttcgt tcatttggactgatggtggttgcatgagattggtagccaacgcctgtttc gcccattgcttggcatcgatcgaatcagggcgctcgacgacaaaagtgac ttttcctttctcttggatattcttgacgagaagatctctggcaacatcct tgtcactgactggaattccatcaacatcacacaaatgatctccaagaaca agacacttctcggctagtgatccaggatcaacacgtgaaacaagcacacg gttctggaaatgcttgattcccaaaccaagttntggtccattttgaaccc agacaagggttgccaactcataaacgtatccctcacggcgttgaatgatc tttgcacgatcctcangaatatgaacacgagcttctaactcttcagcctg tgtatcgtcacggctcaccgtgattattgcacatggagcagcacagcgaa gcgcacgngaaaagtcgttacaatncttgcaattntgtccgtgcactttn ttacttggcaccagtgctcatcttgctcagaaatgcttcagcctggatct agtgatacgagcttg >AW057335 ggcggtggttgatcgggcttaatcacattgttattcgcatccaaggctga tccctgatcatttggaacagttccagcgtttgctggaagctttctggaca tgctcttctccatatgagcaacaaatggatcaatcagttggtggtaaatg acgtgggatccgtgagtttctggcaagtagaggtacaggaaaaatgctgc tttgcaacccaataaattgggaaatatgacataatgattgcggcgaagaa gtcgataattgaaaaggcaccgaaaacagtccaatagatcaaccacattg tatcatcgtcagttccttcagttcggatcgccttaacagaaacataagcc ggataggcaactccaatcaaattgcacacaaactcggcgccacttccaat aatcatgtagacacaattgagcccgatgagcccatatgcgagcatttcac gtttcagaaccagtgcgtcttccaactttttcacattctctttgtagaat ggtccatgatcagcgtacaagaaggagaagaagtcactctgagggtctca aaatccttagcgacgttgtccatcggctggtttgattccacagattcttg tcatcgacat >AW057336 ttttttccataaaatacttcattcttaataaaaattcaaattcctcgtag tttatcactgatccgacgcctcccaaattcgacccatttcaattgtgaca aaactcgtgttcttaagatcttcccgagcatccataagctctttgagctt gtcacgtggctccaggtagtactcagtagagcccatatggtacgttcccc agtgtattccaatactgttcttagcccgaatgagtttatgaaccctcaat cgcctcttccggattgatatgctgggatttcatgaaccatctgggctcgt atgctccaattggaatagctgccagatcaaaaggtccaagcttctcgcca aacttcttaaactctccgtcacagtgaccagtatctccggaataatagaa tcgccgattctcgccgatcaccgcccagcctgaccataatctgtggttcc ggtcgaaaagtccacgctgtccccagtgttgagcaggcagacaccagatg gtgtaggtctttccgttcttcacaaattctgagctctctccccagttgag ctcggtgacagcggtggaggatccgtcgactccgatgccctggccttcca tccatttcttcattccagcggaacgaaccacttgattgggggttgcgatc tgtgatctcttcacgcgtcagcanccaatgatcgtatgatcatgcgaacc actgcaaaatc >AW057337 ttgctgaattcttatcatcacttgaccttgcgaagagcttttccttcatg atcatgtccgatctcgtgagtttgagtttgctcgatgatagaaacacggc gagcatttggaccggtagacattgcactcttcgctggaggcttcagtcct ggaagagcttgacgatattgcgaagcaatgcctttcacatcttcgttcat ttggactgatggtggttgcatgagattggtagccaacgcctgtttcgccc attgcttggcatcgatcgaatcagggcgctcgacgacaaaagtgactttt cctttctcttggatattcttgacgagaagatctctggcaacatccttgtc actgactggaattccatcaacatcacacaaatgatctccaagaacaagac acttctcggctagtgatccaggatcaacacgtgaaacaagcacacggttc tggaaatgcttgattcccaanaccaagtttggtccattntgaacccagan cagggttgccaactcatanacgtatccctcacggcgttgaatgatctttg cacgatcctcangaatatgaacacgagcttctaactcttcagcctttttt cgtcacgggtcaccgtgatttttgcacatggagcagcanagcgaagcgca cggaaaaagtcgtaccatccttngcaatttgtccgtcacttttttaactt gtcaccaatctcaattttctttagaatggttcagcctgaatctta >AW057339 cttcctcaaccgccggctcctcttcctccgcgatcgtcggaatctcatcg gcacggcacgccacaaaacgtggaagctcatccggcgtcggctcccgctc gaaatccatattctcctcgtgatccgtcacacctccgagctccttgtaga tttgctccgtgagccctgccaacttgacatattcctccttctcgacaacc gagtattccttggcgataaccacttcggcaatcttgtggaacacctctgg aagctccgcgaattccttgctcttctggaactcggccttcttgactccag cctcttcgcggctcagacttccccatgggagctcgccgagcaccaggtga accagctggtagaaccatgtctcgagatcctgccgggcgccaacaagacc atcgtcactgcacggagcataatccaagcagccagcgtagctagcaatcg gagctccatcatcaccagaaatgttcttgaccaaagacgagatgtccgcc atgaagaggtgacggcttgccgcatcgtagtggaagctgntgagatccat gttgcggacgaggtagccatgcttgtgagcacatcngaacacattgagca catcctcgggccagcgttcagccgncccaagcgtgacttgntccgcatcg caaagcactggctcaacgtcgtccacccacgaagacagcnacatgtgcca cgcgtcgtcgtccagaagatctggaactgcgagagagccgc >AW057340 cactgcattcattttcagatgcagccaataaatgtgaatcatcaataaca ctatcagcatcttgctcatttccaacagtctgctttgctcctaggcttgg ctcacagtgagatgatgcttccgaggcttcttgaacatcagtctcactaa acgatggctcaatttcagctaaactagacatgagactttctgtaaccatt gacattaccgaaagattctcatccgaagtggctgccaaaatattagaatc tccaagttgggcttggataatatcttgaatgttcccaaaaaaatcattcg aactaccgtgaagatccatgccagagtcttgctaagaaatggcatcacat gcacgagcatccgacctatcttcatcacgattcctgcatccagcactctg ttcacaacccgtcattattggatcacattcgagtgtaagagtttgcacat gtgcatattcgaacatcaga >AW057341 caatacaaagaaaggttgtcagttatatttcaagtctatttcttctagtg gagcaagtaggcagctggacctccattttcccaatcgtacggagcattct cgttggatccacatgcaccataagcttgacgcatcattccgtagattgct tgatagttgggtgcatcatagggaattcggaatt >AW057342 tgccttcttcaacggtccatatgccttttttgggatgacggctgcaactc cggctccgactccagctgctcctttcttcttgccaccgcaattggctatg gatccaaggatctcgaaaattgcaatattagcaacatgattcgtgcaatc attgatactaactgaattgagaggcttggaattgt >AW057343 atttattaaatcaattaaaatcacacaaagatatcagcttgtcccatttt tcttcttggtgtctccaaacaatcttcttctttctccgattcaattgact ttctactgattaattgtggcattgagttgtcaaatgaacgacttgttgcc ataactctcttatccgaaattttgacgacgactgtagtgagacgaatgaa gacggccatactggcaagagccgtagcatcaattgaattactgcacacac cagagtaactccattaacagcatgacagatatccgaattaagtgttccag ttgagggagttttcagatattgacatggagaatcgaatgatttaatgaag agtgcaattgtatgtgcaagatggaaagctgagaatacaaatgcatgcat catgaagagaagagtgaagacaccaaatgcacgacgagttgagcagagtc caatgagtgacatgagtgccaatgcggtcatcaacatatctttagtatcc agagcagtctcatcgagtaccgctcatgatcctgcaactttgtatgcagt tgagtagagaccactgactaggaagatgacagtgatgacagcacacagtg gtgcttcaatgacgataagcataccat >AW057344 atacataaacctgattgacacggtaaagggaggtatccgagcacttgatc ttataagctttacgttcatttaaagtattctctaattcaattggattgac tccgccatgtgcctggtattgaagtttaacaggagccacattcatttcct tgaagtgctggatcgaattgactgcttcagcaaggctatgcccaacttgg ctatcagtcgtttgttgagattctggaataagagcaacttggccgatctc tcacgactcttcattctattctggcatccgaataagtcgacgattcttcc gctcttgaactctatagctctacgagagacttcgatgagt >AW057345 cacagttcaatctaagcttgaatagttgatccaccacacggagttgcttc aaggcacacattgacatcacttttgttctttttattagcctgaatacggc aaacctttgagctctttgcagaatgattcgacgaacggcttgatctacga gaagatctcattgactgcttctccggttgcttttcagttccaaaagatgt agttgggaaaaccagcatcctttgttcaattgacgacgtttgtactcctc aacaaatcccatcattgaaccaattggaacagcagtggagacatctcttt cgatgttgactgcagttgtagtgtccaattcttccacgtcaaccggataa tcagtcgactcgatacacgtggcatatcctggcaagttgatgactgtctg gggagccgagaactgatccgtgaacttgacagacaatgagaatgcctctt gctccttagttcctgcagttttcttctgatcaatgaactgcttgaggctc ttcgcatagatcgggcacacgtatcctttgatcaaaagattcatcttgtc ccattgggagagctctgatgagttcacacggaactggatggcacgatgag ccaatggtgggaaacggaaaacttncgttgngaagtgacggctncattcg tagcatgcaacttcagcnagaaagtccttgccagtggtgtgcggaacaca aagttncgtntgacgatctcgcggccagtggaatgtgcaattccgtgggt tgacttctttgacaggcat >AW057346 ctagaacgcatttgtggatgctatttaaatgttctggctagatttctcgg gggaaactgaataaactaaagcttatcaacatctgcttcgcatcatcggc aaactcttgttcagcgtcggaatacggttcgatgtcacacacttcggtaa ggtacttgcgagcgttgctcttgtctttaattgctaccgagcatttggac acataaaatgtggtctccatccattgtggattgaccttataagcagcaag gaaatcctcgttggcctcttcataggaatgtgatggtggttgctgataga acgtggcagcaagctttttctcaagccatgtcagagatgccaccgagtac ttgtaacggccacgcaaatggaggagagccgtatcctttggctccttagc aagagctttgtcgagcaattccttgaacttcttactgcattccaactgtt ctttcgttgccatatattcggttgcttgtccagtcaacacagcattccac ttcagcgccttgaaatggtttggatccttctggacagcttcatcagcaca cttgagtccctcggagacacttgccatacgttgagctcttgaacacatgc agacttctcatgaaatacttgagcangtctgcacatgactttcaccggac gatcacctttgtcgagacgcgctttgagcaaatcatatacttgatcgcgg ntcttggntcccgaagtntngtcattttcatcatacatcat >AW057348 tagattgttcagatgatccataacacgagttattcagcttttggctcatc atctttgcaggtgagaatagttccctgtgacatcttatatgatccatcct cttgcttcttgaaatccataataagctcatcattctcttcgaaaacgaat ccgtttgctttaacatgcattgaaaggttatcctcacgagtgtatgcttg aatcaccttcttatgtcagatcagttgggaatttctctttttcaataaga gcatgcatgttgaatgcaacaaattgacccttgtacaaacgattaccgca catgttgaagatcatgtgtggcagaagccagttggtgagatcatggatat cttcactggttctcacgtgcattgtatcggcggccaactttgtggcgact gtctttgccaattccaagtctgcagcatcacgagcagcgtcacgtggctc cgcggtcaccgggatcagattactggtatcagcctccctttacacgaatg acataatgccactgagggcatgaacacgaaagtccagaaaagctgcat >AW057349 ctcttctagcgattgatgtaatagtaattgagcatgcggaacatcagaat catgataactccagcaattagcagatcagcacatccgaagcatatagcca acatgtatccgtgacgattagcatcacaaacttccttgctcataccggaa ctacaactgtcccctctcctaacaacatcaagaataatttccaataaaag tgaaaatacagtaagcgatccagtaaataaagttcagagcaaaactgatt atattcgagaaatattcgggaaggaacacgataagacccatgtacaacgt ggcaaacacgagccagataaagtaagaataatgggtgctggcaatccata gactgagtgttagaattacaaggaaacacgacaaaccaaacataatgaac ttggtgggaacacctgcaatgagaatctcatccggttcagcagcccactc gaatccagctctcaaacgtgccggctcattctcattcggattttcatcct ccacatgcacgtattccaaatccacggatgcacaactgtctggattcat >AW057351 ttttttttactagaaaatatctttttatcaataaacacatattcaataca gttcggaatggacagaattaaccatccaacttgcatcctggtttagcctt acggcacaagacagtagtattgtcagcacaagattttgcagaagcagtat gctcttttccttgataaccatatccgtacttttcattacgaacatcgacc tttgccaaaatttgctttcctccagaaacattgacaacttccactgagat gtcacccttgtggttgtcacatgaactggatgccattctttgtctccgaa tttggcagcttctgggaacgaaatccatccataatcgaaaccacgaaggt tatcaggcagatgaacgagaagttggatagatccaacattcttggtgtat tcgttattgaaccacgagaagttggcagccacctttccaccctcattcca tacacgacccatgacaggttctgcctgcatgtaccaaagtgcaacgtatt gatctggattgacacctggcaaagtatcaagagtcttatcaagagccttg acaagtngacgagtttggcatgcatcaccctcacgaatgtccatgcatgc atctctctcgacacgaaccttttgtccacttncacagttggcacaatcac aagatggtggcccaccaatgcagctacaggtgatgatgtacatgtcattg agtagtggtccaacttgttcagcacttttncattttgacgagaccaagcc tcctcagtnctgtngtcgacataaccaagaagaatttctttcggacgctt ccagagaatggggaatgatccacacnacgcaccaagtgatggtnatcg >AW057353 tcgaacaatgtcgtatgcacagagtttctacgccgatcagaagaaagtcg agaagccagcggagcaagcttcctctcctgccacggctgccttccccgct accaccccaatcgccgaggatcctctgactccatcccaaatccaggatgc catccgtctctaccgttcggtgctctccttgtctgcgtctgccccatcat caccggttcgtcaagcagcggctccagtttgctccggaacagccgattgt gcactcggactactatggtggaccatcggatattccgttgtcctaccgtg ttaagtacaccacgacccagcaggcacctgcgtctccagctccggacttc acagagcaacagttgatggctcagctgcaggctcttcagatccagcagca gcagcagccggcaccagatgttccagtcgtcgagccagttcaacaagttc agcaaaagccaaaagttgctccgaagatgcttcacaagatgtatgacgat gaagagtctgngtactgcttcgctcgtaaaaaagatgtggagcaagaggg agaagttnncgagatccacgtggccgctccagtcaccactccgataccga cctactcngctcctcggtcaactatgaagctccggtntcaacaactacta ctcgaaaggagtcagtgggccattcgagtacattgaatgtcgaacgattg cagttcatctacg >AW057354 ttttttttcactacaccaaaatttttattcaaatcaatgcatcatacttt tcacacacgatcaaaaaatctttttcgagaaactttttcttttggagtca gatgatggaggtctaggcacaatctagttggcaaccaaaggaatgacggt gagctcgcgtggtggatcactctcaaagactcccctggcacaattctgct cagcagaagcattggatgtgagaacaatcaagtgatcagtcttctcgact ccattatgacggagcacatcgatcgaaagcttgtccccgggctcaacaaa tccgaagactgggttgacgcggtagagaaggttgtcggaagttttgacct taaatgcctttctcgacttggtgttgttggcaatgctgactgtctgaact cctccggtcgttgcgaatggaagcttgttgggcgatgcacgaagagtcga cgtcttcactgaagcagaaccctgtccctccgggaatgtgtacttcgact ggttgggaaccaacttcagagctttcgacttttcggatttctcaccggag tccgatttgatggacacgtcggaaacatctgcctggttaaccgttgggca gaggttcttagccttgttcaagtcgcacttcttggaggacttgtcgcatt tcttagatttcgaagatcttttggatctctgtgactttgactttgccgtg ctatcacgtctgagcttcttcgattcagacttgggacctcctgccgctcc cgngatgggagaaactcngnctttgnattgaaggctnctagattt >AW057355 tcacataatttcgttattgcaccgattttgcacgagttgacacaaatcac ttcttctggaagcaatcgcttccaattccagccagtgtctgatagttcgg atcatgagtagcagccattccaggaccaccggcttttggagcacctggag ccgcaccaccaccagctggttttgcaccttctttcttgtcgaaacaatcg tttccaattccagcaagtgtttggtaggtggggtcgtgggtggcagccat tccaggacctccaggttttggagcaccagcagcaggagaagctccaccag cagcagcaacaggagaagcttttcctttctcttggaacacatttccatca actccagcaagagtttgataatttggatcatgtgtccctgcgacagcatt cttatcacttggagcctttggtccctctgggtttggagcactcgatgatt tttctttcttatcactctntgcactggttgcaccatctcctcctcctcct ccagttntagctccaccaccacccgatgatccaccagtcttcttctttcc tccacattgtcaatgacgtatccggtgaacaccattggagcgagcatgct gtaaccgagatggggcagttgtagaagtgttcactattttctgc >AW057356 tttcctctccaatatctgattctgcacgtcaacctccaactttctcacat gaattgcacttctctgcgggaaaagagcacatcccaacggtgtcatgaag taaagaccaatccccgccaaaaatgtctgaatcggcatctcggagtactt ttggactagtggacgagctttgtagtaggcggttcgcgtgaatcgattca tgatcaccgggctcagaaccatatcaggcatagccatgagaattcggctc agtgtgacttgagcaattgcgagagccgccagctgcttggattttgcgac gagatggtcgtcctcatcacatagctcaattccctcagaaagttcgcatt ggaatattaatggcatttgcaactgcaacagccacaaatggaaccaatct tccagccaatccatgactattcttcaccatcatattcagaccgagagccg ccgttgtagccgctccagtagcacagaaatacgaatagagtagctgctga ttggtagttctgcaatttccgcttcgattggtatagttgacaattgcatt gaacgactgattgatccaatgggagaatataataccangacaggtacgat agcagctgagaagcattccagtgataaccatatttgctgggtctgagcgc tcattctnccgagacagaaacatctttttcgcggtatcaggatgatatac ttgaaatcatagagatcttttgcactcatagctcaatcatcgtcaaatcc ggatt >AW057357 atgtcagtatttagaatgttttcgaatcaagtgaactgaggattgcgaat ctgacgaattgcaggtgtgtcacaactgtcaaccaggcatcctgaagggc tcaactctcctcgcattgaaatgttttgaagccgcttctgaagttctcga tgactactgagtgtgcgcatcgcctcatcagcttcagttacagggccacg tagagggctcagaagtggaggaggatggttcatactgaaatgtgaattgc atnccgaaacatgtgccaatggttcaggaagacattcaaaattcggcgga gaatctgtcagtggaatcgaatggggtagtccaagtagttccggcaactc atcggaaacttctttgaggccagttggtcgtgcataacagaaaacagacg atgaaagtttaatcatcgtttcgcgcgagtattctatttttaagaaaaga tgttgatcttcttggtttcggtagcataccaattctccaccttctggaac ttgaattcgttcaccttccttctttgcgagaactggtggcaggctcatcc ggcggcaaatcggggtttcat >AW057358 cgcaacgccactttaatctccttcataagaccgtcttgccaggaagaatg gttcgattcaattgtcttgattccgttcgccagttctcgcattctctcac aagccacttcaaattccgtggtactgttatccaaacgagtcttgttaact tcagtctcgttgttcgcatatttatccgccgcactgtccatgtcgagccg agtcctattcaaattggcgatactgtccgcgtgaagntttcccctcgttt tggtagaatgacttgtatttttcacaaatcgcgaggacccctttcacggc atcagcttcggcacccattctagcatcaacagcatccataaatggtttca cattgatcttgtcaccgagacgcgagtaattctcaagggtgccagagaaa ctggccttcttctgccacttgctaatatcttccaagtttgtcatgctctt gtgaaggtccttgacacgtggcgtcagatttttgaagtagcccatcgcat cgtccacttctgcgggaagagttgtcgccttggcacgtccagtcttttcc ttaactntctgcttcanacgtccaaacat >AW057359 aaaattaaatcaatatggttgacgagaatgagaacgacgagttgctcgta tctgacggaggcgaaatgattgacgtcgagacttgatgatttccatgggt gatacgtcaggtgtggcgatccagcgaaggcgacatagtggacacgaagg ctgggtagaggaacggtgccacctttgaacacatgttttgcatccaataa cgttcatgcatcgattacagccttgtgggtcaactggagtctcaaaacag acggtacactctccccgtgcgccgatcggtgagccaactcgagctgcatg atttgaggtgccttgtgatgtgcttggctgattcgatggagcagcttcaa aagagtctgacaatttgacttcagattcgtcatcatcaagggttatggca ttttcattgattccttgtgacggcaagtttcttgcaagaacggccggtaa aacgtccacgtggcctccataaccttctcttccagctgcggctctgattg ttggctgattntgtgcattcctgggagttgttgtccttgtgacccagtcg ctgcgctggtggatcncgggccacagcagccacctgtactccagtacctt ctcttccagctggagctatggntcctgtcagatnntgtgntgagcttgag gatatctcgatcacttgatatgacagtntcaggtgtggaggattttgaga agcgctggttggctggatttgcccccagtc >AW057360 taaattaaagagatgaaagctctttctcaatagccttcgagatcaaatcc atcaagttaattccactaatcggaagctgattcgcatgaaccttgcacgg ctgtttaaccgaaatcaacgagtttgaatgctcaatcgggcccaaattct tcttcttgttcttcgggcatccttccttatgaaccttcaccacttttcca atctcttcaacatcatcaaaaactacagtaggaatcggagctgtgccaag tttgaattgatgttccggagtaaaatgatcgtgagtgagacttccagttg atgtgacaaggtcttcaagtgaagaagcttccgaagacacaatgtatggc ttgcggcactcaattcccattttcggctctcttttggtgcgatccagaat atcagagactcgtcttttctccacattcaacaaccatctcgatgcatcaa acttctcaatattatcctccatctctcgaacatcttcataaacattcgaa tgctctctcatcaacaattcacatttcttctgagcttcttcagatttcag cgtcaattgagctcttccactgatcacctccttcaatcgattctcgaatt cagcttcctgatgtccaatctctttacacattgncttctcgntgaagctt ggtcgaacaaattatctcgtgagaanggacttcagttgacaagttcttgc gtggaatgcttcacccgtgatcatatcaactcccatgggctcttcgatag aa >AW057361 gacttcaagattgatgtcgaatcgattgtttaggcagccaccaatgggac aaccatcatctcacgatgctccgcttggttagcaaatgcctcatgaggat tagtagagccgttggcattggcagtcaagagcacaagtttgtcagttttc tgctctccattcaatctcaacacatcaatcttaacggaagaacgtggctc agcaaatccataaaccggattgacacggtaaagggagttatccgagcact tgatcttataagctttacgttcatttgaagtattcttcaattcaatttga ttgactccgccaagtgtctggtattgaagtttagcaggagccacattcat ttccttggagtgcttgatcgaattgacttcttcagcaaggctatacccaa cttggcttccagttgtttgttgcgtttttggaataagagcaactttgccg ctctttcccgatttctccattttcttctgggcatccgaatcaagtcgacg attcttacggctcttcgaactctttgagctcttacgagaagatntcgatg agttagattgcttatcagaaacaccagtagcagcgggaacagttganact tgagaacccagagtggtagcagtcttgactttcttcgatttcgaagactt tgatgattnccctccacgcttgctgctttcctttncagattttnccgacg ttcccttcttactagacttcnacgatctcctggagtgcctagcttcga >AW057362 caaatgggggtttttctttgttctggatatctcaaccgacatgattctag aagagaatgctctgtgcaatcatcgcagtcacttgataaggatcgcagtt tgatgacggacgacgatcctccagatatccttttctctccgcagccacct gtctcggaatacggattgagcatccacgattggcgactccccatgagaat ttgtcagccgaacttgtctcatgacgtcctgtcaaacgacgaagattgtc ttctccaccatgtggatcgtacaccttcatcgcctccaaatgtgtccgct tgagtcctgtcatggcggcttcaatcgcagcaattccacctggagcacgc atttcggcagtcgagaagttggtgtggcatccggctccgttccagtctcc catggtgacctttggtttgggatcaagggatacgcagacaccgaa >AW057363 atttgaatctcttctggctgctccgttgtttcagcgccgcttggagatgt acttggaggctctggtgaatccgttggctctgcagtggacgctgttgatg gttgaacggtagtagtcgtggaagcttcagaacaaattccagccaattcg cacaacgctggatacattgctgagctcattttgtagtaccacgggtattt ccggcttccaatagttgctgctaaatcgaatccaaccatccaaactctgc cggaatcattttttgtgagaggtcccagatcgaatcccttttcggtattt ggaacagttccgtatagaatatccggactaacaccggcgccgacattcac ttttcgatgcttccattccgaattgtttgtacatccataagcgtcaatgg catctcctttggcaagtttgattgattcatcagctagacatggataactt gagttgccttccaaattctcatcaagctccaaaatcattgggaaggagta cactctnttttgcaccaatgttgatttgcaaaagttcagaatatatgctc tggcaaaattcatcactttcggngagcanttctccttttgagaaaggcac ggctcaatcacatccgagagcttgtttaccgaactttccagtgctttatg ctgtcatcacaatcaatcgaatctncgctattnttccaagctgatttgtg gagaaaagccgnngagagtgatcagtgtcgagagatatataa >AW057364 aatatctagttcatctgagagctggtggctccttcaccagtagttccgct tgtcgaatcagcctctgtcttcttggtctttgacttggagctctttgact ttgaagtatcgtctcccttctttccacctttctttcctttctttccatca ctgncttttcccttttttcctcccttcttctttggcttgtacttgtcaaa cacagacttcgaacggtagcagacgaagctcatgactgccaaatgacaaa aataaaaccgcanatgaggaagctaatgagcacatagagccatggttgac tgacatcacgggcgattttcttcagcggatggtagtagcagacgtgttta ttgatattaacatacacgcaaacagttcccgacttgcagtctttgttaga agagcatgccgatcccttcacctcagtacgttcttctccagttaagtgcc ttttgcccaaaataccaacgcgattagtgtcaggatcgcaaaatgcatag gtgctgaattgagatgggataggtttgttagcatcacacggataattagt caagtatggatatggggcccaaggtaggcgtcgctgactcagtaccattt ccagatggtgcaattgtcagggtaagtgatgggcagcantgaccaaagcg acagactccggccctcgtcttgcacactccagaacttttcttttcttcaa cacatctacacactttggctcattataccatgtgtttgatagtcc >AW057365 agagagaaaaagagaaaaaagtccagaattcgagaagttgagagaaaaat cgaagcccatctagttttgtggaatcggtgtcaaattggggcgagcttgt gagagcacgtggtccacgaacgagcaggtgacgtcacacattagacggcg cgaacggcagtatctggcagcaaactcacagtaggaggcaaaaatgatcg catccattctagtgaacatatttggcaattgacataaagaattgctcaac aacgacggcccagtagcgagcccacggatagctgagcctaacgacgagag caatgacacgtggaatcagaatttccaagttgcaaaccaccaagtagcac agaatgatgccgaacacgagcttggtgaatctcggattgttgcggaaaaa gttgccaatcgacgtggcacagttgccgagagtttgaagacgattct >AW057366 gatcattaaatgaacttttggatttcatcgtagatggcaagaacgagtgc tccaccagttccacggaatacatttgacagggctcccttgaacatggcgg acattccttcgtttttgatgatcttgacggcacaatccaaagtattcttg tagagaacatctttgcgaccagactgcatcatcatgcgacgacgaacagt atcccatggataggagaggattccagatccaacagtaaccacttgagcaa tagcccaggcagcgaagaagttgagtttcttgccatcagcagtgaacacc atcttggcagtgtcgaacattccgaagtaagcggcgcggtagatgatgat accttgtaccgagacaaagaaacctctgtagagtccgattggtccatccg actttgcgatcttgacgagacaatcggccaggcctttgaattcacgttca ttagcttttccgacatcagcagccaaacgggtacgagcaaaatccaatgg gtagacaaaacagagcgaagtggctccagctgctccaccagaagctagat ttccggcgaagaacttccagaaatccttcttcttatccaatcccttttgg aaaatgntnttgtagtatccttgaaagcgaagtcagcgcctgggtcggaa gtatcggatgacgtagcaagtttctctcagagag >AW057367 acagagatcaatcatttttcaacactatgtgcacgatccttccacaaatc attgtactggcgaccagtgtcaatcatggcatccgatactgtctctccaa gctcacgaatcgatgaatcgacatctccggatgaagattttctagctctt ttcgaaaaagaagatccctcatcagtgaaacgaacacttttcgagttaac ggaagaaatagattcacgacgcaaaggacgttttcgtgggctattgatac ctggagtatgcttcttcatcggtgttggtggtggaacaggtttgaatctt ccacgaagatctggatgagcattcatcagagagagagcaacaaatggaat tccgtcaaccaaaagctcaaaagcttcaaattctgcatcttgtgcgtatt cagcaacaaatccacgaacttgattacgagcagctggattcttgataatt gcaaacttttgaactggaggaatatgctccttctttggtaaagattttct cgaaagagatcttctttntttcatcggagatttcttcagaatgctnttac gactggaacctgntgattttgtggatctattgcgtcgactctcaatttca gcttngcaaagtcttcttctagcactttgcgaaatcttgntacggttccc cagaagtggttccaaattttcgatctttttgcttgaaatagatctttcag atccctgtcccgagaatccagacgacgagctacgaggaagcttgctttta >AW057368 ttttaggggtttcacagcagtcttcagagcctcggctgattgttgtgggt cggacatggcgcgatgggagaggatggcttcatcacggaatttcagaata gcatcacaatgctttggtctctgatcgaaaatcccgtttttgatgaaatc catcagtttctggcgacaatcatttctccaattttgagcgttgacagtga ttcgctcggcttcctcagtgcgatctttgctctgttgggtcagcgcatcg tcataggctttcaggccatcccagtacactctcctcaaccgtgcgtactc ccaatagtcttctccaatgaattttgtcagcggcttcaaattgtccaact gcttgcgaacacgagattcgcgttctgcatccaacagagaaagcgtacta atcgctggctccaagctctcatactttgtcttatcccgtcccttatttgc aattgcctcataagttttgagatattttccagcataacgataatttggtt cagttttgaagtgagtcaccaaatcctttgaaatttctggactctcgacg agcatgaacaatagtgcctgatggagcttgtcggagcatgtctttgtagc ttcaaccttcccgcacatcttctttacgtcttctgtcatcttgatttttg cgcgtttcctcttcanaatcgatgcnnatagcccatgatttggatgatca gc >AW057369 tatatctgtttgtccatttgctttcaatctctcggctaaatccctccgta cgcaacattttcagtccattcaaattgttgacgctcgtacggtgacgggg aaattccgattccagtggccacatctttcgaggatcccgagttgaaatca gtgtattgagacaagttgcgcatgttgcgaagttctgcgagagattcggg cgagtagtcacggaggacaagtgagtttgtaagcatctccttgagctcga attctccatcagtctgctccaactcgtacattgattcggngctcaacatt ttcttattaagtgttccgtcaggagacgtggatgggatagctggagcagt gaaagctttgatcggaaatgagatgttcatgttctgaatatcagcaattg agtcagaagagaaggcgttcttgttaacagtcacatcatttggtggattg atgctaaccattggtgcctgttctccatccatatcataatttccaaatgg atccttcaagttcttgatatccataatagtcagcaaattttcttcgnttt attctccatcttgtgtgttctccttcagaatccataatatgttcaaagta tttgagatggagtggggaagtttggatgatggngaatatnatgttccagg cngaacgagtgtcaactttagatgaagcctttgaatgcaataaccttccg gnactacataaccgactttgagacttttccgaaagtccgttgctggatac tagcca >AW057370 tcgattccatttccataaattgtctcacgatcatcacattttggcgtgct ggcagtgggcaagtctgaatctcacgggccatgtagagcttgaagaccgg aacttcagcagcggagtcttcgaaatgctgaaaatattgaattttcaacc agtccatattaattttccaaatctcaacatccatcgtgacgatggtgtag acaaataaaaagatgatgcattgggcaatgaatccagccattctaattga cttcacacggattgcatctgcatcttcttcaagaatatcagccactttga tgctctgagctggagcaaaagcagatgtcgtggtggtgttaatgtagact tcaagggacgacatcactggagcaagcacagatgttgatcttggagccgt gatgtttgagtagaactgcactgcctt >AW057371 ataagctttctgcttgcaaaggatttttcttctcgcgagcctcggatttc gctctctacgactgcagcgagaagtacggtgtccagatagctgatccatt tgatgaagatgctctccgactattcaacgatcttccaccaaaacaaataa ttgcgccagactcgaaattgttccgttcaagtcccacaaaagccggaaaa tgttccgaccacgcgatctcgttctgccaattatccgaaaaggatatgcg agttcttcaattcggaatgtcgctgaaagtattcggacgcggaggtcttg acacttcttcacatgagactaccgatttttgatcttctccagcagcttct ccagaatctatccaatagtatttctatcatgggacgtgtatttctcttct tgccgggtgtaaatttaattatttttgaaattaattatttttccataaaa ta >AW057372 ttttcaaataaaattttattccgttcaattcaaatgggggtatttttttg ttctggatatcttaaccgacatgattctagaagagaatgctctgtgcaat catcgcagtcacttgataaggatcgcagtttgatgacggacgacgatcct ccagatatccttttctctccgcagccacctgtctcggaatacggattgag catccacgattggcgactccccatgagaatttgtcagccgaacttgtctc atgacgtcctgtcaaacgacgaagattgtcttctcaccatgtggatcgta caccttcatcgcctccaaatgtgtccgcttgagtcctgtcatggcggctt caatcgcagcaattccacctggagcacgcatttcggcagtcgagaagttg gtgtggcatccggctccgttccagtctcccatggtgacctttggtttggg atcaagggatacgcagacaccgaattgttcagcaactctgtgcagaatgt atctcgacatccacaactgatctcccatatcgattccttcgcaggttcca atttggaattcccattgtcctggagtcacttcggcatttgttccaaaaat gttgagtccggcgtgaagacaagcccgggtatgtgtctcgacgacttctc tcccgaacgcacgatcagctcctacgctacagtagtacttntccctgtgg atcccgggaatccgtgtctcggccatccgagtggntgctcatctctgtcg acgatcagatattcctggctcatttcgaacacggggcgttgtcacgg >AW057373 agacagttcaatcaatggggaaatacatgtagaaattaatgcgatccaga tatagacgaagcagcggaagaagacgactttccattcggtttggccagcg aggaggatgcagaagtcgaaaatgagctctcagagctgctggaatccaag gaaactcccgatttaccaggcgcaacttttgagatcatcgagtggaagtt cttgttagcagcagcatctctgaaaacacgcattttctcagccaacagct tagcttcctccatgacctttggattggaagatttcaggccataaaggcct tcaattgctccacggaaggctttctcgatagcttgctcatcagtcgcatg gggctccaacgcatcttgctccttcttttccgtacgatgcaccatgtcag taacttgatccatcaaatctttcatgtcaacatccttctcatacttgtag aatgccttgttaacttgaatcacagaatccttcagaatattgaccttctt ttccaacagctggacgtccttgaccatcagtgtagaggccaacgcctgat cagcagttgccttggccttctccattntccgattcatccatttcgaattn tcagcgcgttccttgtcaatntggtgatccaattcatgcatcattccacg gnantgatcgaaatcctcactcatcngaatttgcatgtcacggagctcca aaacagtgtgacgaccgactaggtcaccttcacgaaatcgatcgatcgt >AW057374 ccgagcgggtttgtgttgaagtcatcgacggtgaaaaaatcaacttcctc atttgatggtggtgtgacgccttttctggggacatttccaactgctggaa cacttggcttcttctttctcttttcgggctcccaatgatatgggtcggac cacttgtacttcgcgcttttcatggcatcattcaagagattgaaaagctt ttcgtagtcggggcggtgatagaattgcgtggcccgaacgatcttgacaa actccagcatttgaattgggctttttgcaaagagattctggtcggcgacg tgtcgcttcatttccccgatttcgaccttatcatccaaatcagaccaagc cagctggcatctcaactccgcgagcatgtagatcaaagcccacaggtcat cgactctcccctgctcgaaacgatcgtgcatagctaccgagcaataacgg gaggtgccacggaagagagccttctcacgtgggcgacgcatcttttttcc gtcctccttatccgtgatgtactggcgtgccaagccaaagtccagcacga tgaagtagcgttcgtcaggggagcctttatttcccaaatgccacgttgca gggctcaaagtacggtggataaacccaatatcatgaatctgcttgatgcc cgacaaaagggcaatcccgatgcgcaaattggtaagacacgttgaaaaat tggtcggttctttcatagtgattcaggctttcgccggagaggtca >AW057375 tggacaataagacggttgaaggatggttctcgttcaatgtcattgtgatc aaacaagtcggcccacaggggtatgaatggtacatcatcactcgcaactg tatcggaggatcaccacactgcgaatgcgagaactgt >AW057376 ttgtccatttgctttcaatctctcggctaaatccctccgtacgcaacatt ttcagtccattcaaattgttgacgctcgtacggtgacggggaaattccga ttccagtggccacatctttcgaggatcccgagttgaaatcagtgtattga gacaagttgcgcatgttgcgaagttctgcgagagattcgggcgagtagtc acggaggacaagtgagtttgtaagcatctccttgagctcgaattctccat cagtctgctccaactcgtacattgattcggggctcaacattttcttatta agtgttccgtcaggagacgtggatgggatagctggagcagtgaaagcttt gatcggaaatgagatgttcatgttctgaatatcagcaattgagtcagaag agaaggcgttcttgttaacagtcacatcatttggtggattgatgctaacc attggtgcctgttctccatccatatcataatttccaaatggatccttgca gttcttgatatccataatagtcagcaaattttcttcgttttattctccat cttgtgttgtctcttcagaaatccatatatgttcaaagtattgagatgga gtggtgaggttcgatgatgggaatatatgttcagccgacgagtgtcactt >AW057377 ttcagctcaagtatctagtaatagtccagttatttttgctcctcgtcttt tttctcttcatctcccaaaatgtcactggcttgacgagcaattgttttag tatgactagtagtatttgcacttgtgagattccattctgagctcttttcc cacaacttttgtcctttggatgtgaaaccaagaatttcaacttctggatg tctctcagcaagaagcatgaaattaacattaagcacaagtcgtttcttgt caatgagcatatccaattgaacactctcagggaaagcacgttgtccataa gaagttgaaggaactgtttcagaattcactttgattggtgtcatttccaa agtttgatgggtttctttgactgcactcaaagtcacaatattcgactcat ccgtctgattgaatgttctcttcaaagttggagttgcaggagatgaagct ctcgatctggctctcaatctctcactcagattcacttcttcctctttcac agtctcttctttcttctcactaatctcangtggctctccatcttcttggc gaactttacgaatgactatagtctccttcacgcgaagtgttccttttcga gattctgagcagatctgtaccagtagagctgcacacgttcttatgcaatc tctacagacataaaggcgaat >AW057378 tttatagtcacatttttattcaaaaaattcatagggctcaacatgatgat tgatgagacacgcgtgctgatgagccgcattcattgctccgagcattgga ttcattccggtcaagtaggtgtctggactattttcgagtggcgtgaagaa ggtttcggggagctccaagatttcatagctccctccagtgacgggatcca gcaaattcggcgggttatacttattgaagttgaccggcaacaccttcgtc gccgggatgtttgccttgatagatggatcatagacagcagcatccgaatc aatcaatagaccacctcgatccttcttacggcatctgcacactttgaaaa gtgagttggaaggatccatcgagtgagatgacgaaaattccgcgcttctc tcgattgccgtctcgattgcagcggtaggagcgtcgattgtcggtggagt agccaacgtctcaagaagcggagcttggagtggcgatggccgtggtttcg gagcgatcggcgagccatatgcaactctagacatcgacaacggctccgac atgctatgctccgcggagcccctctcaacgcgctcaggcttatggtctgg agcctgcgccggcttgaacttcgagtcttgcagctcgcgacacgagttgg gcattgctcaattcgctggcgtatcgatggactttggtttcaacgagctt ggttgcacagggaaattggtcaactgcgatgtctacagttctgcg >AW057379 aacctaaaaatcttcatctagttgaataagggcgaaagtttatagtatta gacggactcaccgggatgaccatcagacggattatcaagtacaacacgcg aaagctctgtctcattcttctcttcttttggtggacggagcatcgaatgc tcacggcactgagttccatcaagacgagtaatttgttgcaagagatgaat aacctttccacgataagtatccactgatgcaattggattgaaatcgaggt aaacactattgggttgtggaagttcaaccaattcatccataatgctccag ttatccaacttatttcctcttgcccaaaagtctgtaagtgtcttcaattg atggatattctcgaccttctcaagacgattctgattgaaatccagaattt caagaggaagatgctcatcgattccacaaacgtacttgatgccattctga gccagataaatctctttcaagttgtgaagtcccgaaatgttatcaactac agtaatcgcattggctggaagactgagaactgtgagcttcttcaaatgat caacattatcaataagacgaatctgattagcgccaaggcacaatctatcg agtttcagattgttgtcgagattctcgatggtcgcaattctattgtcacc caattcgagatattgcagctcagtcaacgtatccaaacactcgattttag tgatatgttatgaacgacaagagagcctcagcttgtcactcgtcaaat >AW057381 taccaaaaaactcaaaacaccgtcagcgacttgttgcatgcagaaataaa tcagaaatgattgccacttatccctttgcgttgagatgttgtggcgcctt gtaagccaactcgtgtacttctacattgtacagcttgacaacgattccgg aacgagcagccgtgagccaaatagctgaagctgagagtccgatttcgcaa ccaatgacaacggtgtttcagactttgattcccttggcggcgaaggctcg atcacaatcagccttcttcttgaatcccttggcagcacaaacttctacga ttgtgaagagcaattgatcctt >AW057382 cagcttgtcccatttttcttcttggtgtctccaaacaatcttcttctttc tccgattcaattgactttctactgattaattgtggcattgagttgtcaaa tgaacgacttgttgccataactctcttatccgaaattttgacgacgactg tagtgagacgaatgaagacggccatactggcaagagcagtagcaatcatt gaaatcactgcacacaccagagtaactccattaacagcatgacagatatc cgaattaagtgttccagttgagggagttttcagatattgacatggagaat cgaatgatttaatgaagagtgcaattgtatgtgcaagatggaaagctgag aatacaaatgcatgcatcatgaagagaagagtgaagacaccaaatgcacg acgagttgagcagagtccaatgagtgacatgagtgccaatgcggtcatca acatatctctagtatccagagcagtctcatcgagtaccgtttatgatccg gcaactctgtatgcagttgagtagagaccactgactaggaagatgacagt gatgacagcacatagtggtgcttcacttgacgactagcatcccat >AW057383 tatcatccgtagttgattgatatggtgtgtacttatccgtcgagcttgta tccagccttggccttacggcagagaacaatggtgttatcagcgcagttct tggcagttgcagaatgctcctttccttggtatccataaccatacttctcg ttacggacatccaccttggcaagaatctgctttcctccagcaacgttaac aactccaacggagatatctcccttgtggttgttgacatgaactggatgcc aagctttgtctccaaactgagcagcctccgggaatgggatccatccatag tcaaaaccacgaacactgtccggaagatagatgagaagctggatagatcc aacattcttgcaatactcgttgttgaaccacgagaagttggcagccacct ttcctccttcattccagacacgacccataacaggttcaccttgcatgtac cagagagcaacatattggtctggattgactcctggaaggtgtccagagtc ttgtcaagagccttgacangttggcgagtcggcccatgatctccctcacg aatgtccatccattcgtctctctccgacacgtgcaactgccttttgtggt gtggagcagctccgcacacagcacatggggcaaatggtgtgtcctcccga tgcagnttgcgcgttgatgatgtaccatctcgtttgagctgtgggcccga gcttgcttaatcacccttc >AW057384 catgaactctatctggacatctctcnttgagcaattctcgctgtttccgt tattgccctcaacgagaatgaaaacaattatctgcaagagttgctcgatg ctggaatttcccaggaaaccgctaacaagcttgtagacatcacagccagc cacaacaacgatggagaaatttctgagaaatcaggaaaaactattttcca agaaatcatttctgagactgatgcagctatcaaacaagcaccagctaatg atcagcaagcctacaaggccttcgttgaaagcaaggcagctgaattcggt caaccagacgagatttccattcaagttgaatctgattccgaataattttc taaaactcaagtatcgtctgtattacaa >AW057385 gagatggaattgcctttcgccgaccatcggcggagcctctgcgctttgtt atgaaggacagagccaaccagctagacgcactcaacaaccgatgggccag aacccccaggcaggtgcaagttacccacccaacatgaaccctgtgacaaa cggaccacgagcttggacactgaagcctggtggatacatacaatgggccc aggatcccggtagcatcaaatcacctaatccaccacgaggactcgtctac tatcagcctgagaactatacatacaagcctggaaaaggtggaaaatatat gcatgcgtgccctatcctccccaacgcaaacgaacgactctcgggtcaaa gcgaccagtacaccgtatccgacagaatcacggtacactggagcaagact gcggcatacagtctaagagtctatgtgctctatccggtgcagacagagag cgaggaggacatcttcctatccggcactattgctcaagtcatcgcgaacc tttatgacctcaaggaattacgaggaaacaagccgccaggtgtgagagca cacagaccgatgctgatgccgcaatagcgtagngcggatccacttagaag taggaagcgc >AW057386 ggataaacttgacatccattttgttcctctccagctgatgctactgatgc tcaagccgctttcgctgcagtgactcctgctggaactgtcaccattccaa tgtcggccaccgcctaa >AW057387 ttgaatgtgaataacgatattatctctgattattccaccagtacttatga gccaactagcttcactccaacttctggctccttcacccacacttcaacta gcggtgaaaatcttcatacaatgccaaatgagttgcagcctttgccactg tgtgatatttcatcggaatacgagtcgaaaaccgagccggacactggaaa acaccaggttgacgtgattgccgaggcaaatgggatggagatgaagctca atatgactggaatgcttaaagggcttaanttgaact >AW057388 aattccaccgcccgttttttggtctctcattgcggagagcagaattcaaa gagccgtacaaaagaagacacaccgacgccaacaattctgatatcatgag ttggaattgccttttgccgattttcgtcggattctttgtgcttttttatg aaggacagtgccaaccagctagatggactcaacaaccgggtggccagaac ccccagccaggtgcaagtttcccacccaacatgaaccctgtcacaaacgg accaggatcttggacacaaaagcctggaggatacaacaatgggccaggat ccggaagtttcaaatcaactaagccaccacgaggactcgtctactatcag cccaagaactatccatacaagcctggaaaaggtggaaaatatatgcatgc ggaccctatcctccccaacgcaaatgaacgactttcggctcaaagcgacc agtacaccgtatccgacggaattacgggagactggagcaagattgtggca aacagtaaaggattctatgtgcttcatccggtgcagacaaaaagcgagga ggacattttcgatccgtacactattggtcaagtcattggggtacctttat gaggttncaagaatttggaggaaaagannagccggcaagtggaaaagtac accgattgagctgaatgggcaataaagattccggaccatttgaagaaaga aaagccg >AW057389 aattccgaacaactcagatgctgccatgatagaagacgataaaagtaaac gagagaatgttccattgacagttgtctactcgaatccatcaatttctgta tcaaagaacttttccactacgccacttgctgagaatcaagctggagcatt tgataagctggatgatgatgaatttgagaagatgacgtcagatgttgacg aaaaggagattctgaagttggcgccacgtcttcttaaagtagccaaaaag cacgcggcaatcgagaaatgtctgacacccagagagaacgaagtacttgc caagttcttctcaggaaagcagaagctggattcaaatgtattggctgtct tggattctgcattggataagattattgattatttgcaaaagaataattgt gcggttgatgaggagacaaaggctgttatgaagaagagggataagctgaa agcagcaatgatgaaagagttcttggtgtcaccccaatatcttccaaaaa catggactgcaaaattcaatgaatggaaatctgaagccgaaaagcaaaag aatggaatcaactggttccgtgttttcttctcgtatccaaagcacaaatc atttgaagatggaccagaagacacttttggaaaatttccgccgtagnaaa tcgtggtattctgantgggacttattctgggacccggagattgtaaaa >AW057390 acgaactgatctctgtgatcatatcataataatggctacaatttgtgagc ttgtccagttgccagttggaagtgaatgtggaaaatggacaattttgaag aaactcggagaaggcgcatttggtgcagtctatcttgtcagccaaaaaga aaaacccaaggtggaatacgcgttgaaagttgaagcagagtcggatccat tgggcttgctgaaaatggaagtggctgtgcttttggaagtgaaaaagcag aaaatcgttggacgccactttttggagttggctgacagaggaaacctgcc acaaaagttcaattacatggtgatgacgttggttggaaaaagtttgcagg atctccgcaaaactgctccattcaacaaattctcaatgggaaccgccatt tctgtagccagacaatcattggaagctgttgagga >AW057391 aattccctctgaacaacctcaaacgaaagacaacgatggcggacaagtcg gcttacatgggcgctggtggctatggatcgggctacatgggatccaatgc ttcttcgtcaggttatgcccgcgaagactatgcacaaggaggcaatggtg gtggtggacagcagcagaaccaaggatccggaggaaacaccaacccaggt gggcaggtcttcaaggcccgtactgaccagtcctgttaccttggatcata agaaaatcgacacaagaagagccagtcgccc >AW057392 tcaccgccgaacagctcaccgatcctccacaaatcccgacggccttgtcc aactcggtcaacactgcgattggtggaactccatccgactttgagtcgaa ctccgggctatctgacacctcggcaggatcgggccgcgccaactcggccg tttccgatacgaccacagcaatgtcggcgaacgtctccggagattattat gaatgatcttcgaaaaggaagcctgtttggccgacagatagggagtctgg ctaccacggatggagggccccagagtgttactatcattactttctcgaac actactaacatgtcaactgcttcccagcccaccagcctcgatgacaaatc gcaaaagtcgcagaaaactggatcaatgaagactggaattccgatgagat cgcctggatcttccatggctggcacaggtgcgatgtctcgtaaaaagtcg tcgcaaaagcagatggatgctctgaagatagagcaagtgccggctgctcc cgatctctcaatataattcaatacatctaagatatcgaagagtcgtaaag gc >AW057393 tcgatcaaatcaattcgaaaaaatcatgccgtctttaaaaggaggatgat gtaatgaaaaatgtaactttcgctgaaggcaaaaaatttggtgactggaa aatcggcaaaacgatcgatgaaggaggatttgggaaggtttacattgcaa catcaatcagcgatccaaagaaagtggctgctttgaaagccgaatcaaat gaaatcgaaggaggatctgcaatcaaattggaggcaatgatcctaaacaa actgaatgccaatggacccgttccccacattccagtcgtccactaatgcg caaaacgaaagctctactgctacatggtgatgacgttgttggggagaaat ctacgaaaactgaaatccacaaatctcgtagtcaacaatggattctcccg tggaacgtggagccgaatcggaattcaatgcctgtatgcattgaaatatg tgcatgacaatggatttattcatcgagatgtgaagccacaaaacttcttg ctaggaaatgagacggatagtgaaagagcaagaattgttcatatcttgga ctttggtcttgcgagacctttcgctgtttttcatgcccgagagaataagt ggatcgcacgtagagctcgtggaactgcagagnttcgtggaactctccgt tacacgtctccgaatgttcatctncgaaagtaacaaggacgggttgacga tgtatggtccctgctatatgtcatcattgagctcaacggtgataagctct tccatggcaaaccgattctcaacgtcgacgtgtggagcaaatgaagctga acttgccggcgaaggtngtctgtcaatatgccagcctgtttgataagtga tgcct >AW057394 agacaagtacgagtctcaaaactttctgtttcgagtggaaggagccatgt tctgcgcgggaattatcgttgctatgattatgctgttcgtcatcattatc tacggaataatcacttcaagtcaaactggaggacagctcaacagatacat ggccccactattc >AW057395 tggcctgtgctaaacccgcttccaaccacctattcaattcctgtacgatc atccagactcctcatttcgaatgatggaattctagaagattgttgcttta aagtctgaagaggaaaagctcccatcactattcgaaaatgttgagggact gttgtctgtcccatctttaacttttggtacgtgggatgatgacaccctgt ctggtgtcacatctgttaatctagataagtctgatgaacaactctgcgag cgtgatgatgactacaccactgattggagtgctaat >AW057396 ttgtctgcaatggctgccgcccagaatacgctgccaaagcgaatgaagaa gaagatgcaggtgtagagaatggctcatgccacagcttgcaaggttgcaa agagagaggctcgcgtcgctgaggaagcatctggaaaatcaactggtgga tctactcgcggagccaagtgatagccgagccacaacacatg >AW057397 agattcaaacgccagagttggaagttccgtcaatcaacttggattcaagg gacatgggggaggacttggtccaagaatcattatggctggaactttgact gctcttcaatggttcatctatgattcgatcaaggttgcaatgaatcttcc tcgtccaccaccaccacaaatgccagaatctttgaagaagaagcttggaa ttccgggaaccactgaagttgctccagtcgctgaaaaagttgctgctcca gagaaaaactcaaaatgtgagaaacccagaaag >AW057399 tctatcgaatttattcaagaaaaagttcggaactttatgataatattgca tatcttcatcatccaagtggtgtcaccgttgtcgttcttcgaaatattcc tgaaagtgaagttgtcgaagtggattttggaacgacgaagaagcacggcg cagatcgttcaacaaatcaagtttctggaaaaggaaaaaaaggagctctt attctacaacccgactcaaaattgtgtacattcaaatgcaaagatggttc agagcctgtcctgagagcaggtcgtcgtggaactcttgttgagatgaacg atcgcctgaaaactacaccagattttattagaacagcacctgataatcag ggatttatcgcgattatcacctacggagccggagtacgtgaaactgaagg aatgggagatgaccttcctccgaaaagacttttctta >AW057401 taaatcatgttcaagtacgttctcctcgtcaatancttcatcgccctcat cgagatggcttcagccgatttttcgtgctatttctcggattccatctgca aatccatcacctgcagaaactgcaaagtcgccacctgtatcaccggagac tgcgtctgcaccctttgctaattttttgaataatttttttatcttttga >AW057402 tctgaagcncttcatcagatcgtctaaataactgccagttgaacaattag ttgatgactccgggatgctcatttcgcgttgcgtcgaga >AW057403 agctatggcttcctcgttcgacaaccaaatggatcaggatggaatgtgct ccgtgtactctgctcagccatcggagacaaattgctccatcaatgaggtg cttgccaaagaaattatcgctgtcaatgagacgccagatgatcaagctga ttcttcgatctacccaatcccaaaatcagaaacaaatgtgtcagctagcg aagggttccagccatgtcaagatatcaatcaattcaatttgtccggttac tctgctccaaaatccgagaccccagtgaccatgaatgagaagttcgagcg gtgcanagacttgatgaacgttcttgactactctgtctactcaatgccac catcagaagcaaatgttacaatgaatgttgcaagcttctcggagtacact gcccttgcctcggagaccaacgtcacaatggctgatgttctcaagaacgt tgctcaggatttggcgtcagagcacactgcaaaatcagctcatccaacat tcgacaccaccgcctacgtagagcgccttcaagccgagnctcggattccg gatagcaaagtgattggattggagtgcagcaatttttcgaatgcgaagat cattgattcaattgaatgccttcatcagttggacanattcaaaccaatcc ngtggattcgatcagaatnctgattttggaaaaactgcccaatctgtgga aagctacctnnctgcgtcatncaatgcagntcatncatcacagatctga >AW057404 agcttccggtgttcgtcaggtctccaanctactcgattaaaatttatgga gcaattcaacatgacttcggcatgtctgaagatgaatcagtccaatatcc ttctccgatttctccatcctcggattcttcaagtgaaacaatcacccatg tctcccctggatatctccttcaggaagtcttcttgcagagagcagtcatt gaagaacaaataagattattgatggttttcatgaaattcagaagctcaa >AW057405 ctatccgacagcgtcttttgatggagcagagtctgtcgatatttctgtgg actaaggaaaaagtccagtgagcacacacttgtcgttggcatcaaatcca actgaattcaagttcatggatgaaattgcacttttgaagcgtggccgcat ctacaaagacgctccaaagcatccgtacaatcgtcgtggccagcaaccaa tgatgaagaaaggtatattgtgtgacttgggtaatttcatcagcttcttg cctgctagaagtccatctatcatgacgttgatgggcggagttccagaggc cgaaaaagag >AW057407 aaaaataagtctacaaatgatcaacatttatccaccatccggcgactacc cagcttctggtggttcatcaactcactacattgtctccgaatcggaatct cgtttggcattcaaagtcaagtcgtccaacaatgaatcgtatcgtgttcg cccagtctatggattcgttgatgcgaagggaaaggctaagctcgaagtga atcgtttggctggaccagcgaaggaggacaagcttgtcattcaatacgcc gaagttccagctgatgagaccgatccgaangctcccgttgggggcttgtg ctcaacaaggagaagtcgttgtcaagatggttgctagctaagaa >AW057408 ctgccgaagtactgtctacaatacgaagattgattactctgatatgaaag attccgttgagctacttcgtcaggtgcttgtctacaatccgagcagacga ctctgtggaatagaatttctcacgaatccattcttcaccgtgttgttcaa cgagaagactgtgcgtttcaataaaaagaagatccaatgcgtgtcagctg tcgatctacaagctgtgaaatcgggagacgtcacactgacaaatgagtct gtagagcactccgacctaatc >AW057409 aggtgatgtctacaagaaggcagtgcagttctactcaaacatcacggctc caagatcaacatctgtgcttgctccagtgatgtcgtcccttgaagtctac attaacaccaccacgacatctgcttttgctccagctcagagcattcaagn gggctgatattcttgaagaagatgcagatgcaatccgtgtgaagtcaatt agaatggctggattcattgcccaatgcatcatctttttatttgtctacac catcgtcacgatggatgttgagatttggaaaattaatatggactggttga aaattcaatattttcagcatttcgaagactccgctgctgaagttccggtc ttcaa >AW057410 gactaccgtggcaaaagtcggatctaaaggttctaggatctacaaagaag acgattcaagctgtccgaccaacggtccgcacccaaggagcggttactcg ctctcaggctgctcttcgtggacatatggggatcactgactcttcgactt cgaccagttcatctcgaattccgaaggagaagttgaaaaagaaggcatca tctcgtagccgttcaagatcccgttcgaaatctactcgccgctcacgttc aaagtctactcgctcacgttccagatcgagatctcgaagccggagtagca ctcgtggaaagaagcgtgctccaaagaaggcagttaccacgaaagccgct cgatctatctctcccgtcaaagtgaaaaagacagaagccatcaaatcgcg tggaagcagcaaaaccgcccgtcgtgtgtctgcggctcacaagtaaataa tcgtcgcttcttgatggc >AW057411 tcgactaaccgtctccacttttcacttgcacaaatcttcatgcaaccaat caacgtcatgctcgctgttcttctcgccttggcttcatttgctcaaggag gcagatctgttgctccggctggtgcagtcactgaaccaacagttactcaa gctgttccagaaggatcaggacttagttcagatgtcactgatcgtccaaa catcgactccactgatgttgtatcaaatgcaacttcggtggaagatttgc ttggaagttcaacaaatgcaaacaacactggtacattcaaactcttaaga cctttgtatttgctccaatgatgattcttgctttggtgc >AW057412 ttcgaagaagcccaacaactaaccccaacaataatgtcttgcgttcaaca acaacgacgctccaccggactccgcatcgccgagcgattgaacaagtaca tcactcttccaaatcgccacagcttcacagtggattccaaggatgtgttc caacgtggtcaggtgctcagctacatccgatccaaagcacccttcctgct cgaacatatctccgaggcaaaggagcgattgataaccgtgacgtcacgcg gtttgatgatcatttacgagaatgacgaccacggatttggtgattgattt gcgatcggccaggaatgttctgtgcactgctgatcgttcgaaaaagcagc gtcatttccgctgtcacatcaaaatccgcatgcaacgtggcaatgttcac ctctttgtcggccataatgatgttcacaagtggacgtgcgcgattatgag agccgccggaaagtgcttgccgtctacagagccccgcgatgatggaagcc ttaatgtcgcgatgatgacggcggtggaggattctggaatctttgaagag atgtcgtctacatcgtcagcatcttatgactatgatgaggatgatgaagt cgacgaggtgaagcccactgccatcgagcaaataccacttccccatgtct ccgtnctgtctcttcgncagaaactcgagaaggagctcgtnctgaagcct aatgagcaagtgctcgccgagcaacaacacacagccaactgcgatgagcc aagctntgtctcttgaatcttctgcaccccaagagcagttntcagcgatg aaactgtagtctgctggagcccacgaatgccagatagaggagagaatacg aggcnca >AW057413 ttatcgaaacttgtgaagaaggtctgcacattctgccgcaaagaaattga tgccaaagcgaatgaagaagaagatgcaggtggagagaatggctcatgcc agagcttgcaaggttgcaaagagagaggctcgcgtcgctgaggaagcatc tggaaaatcaactggtggatctactcgcggagccaagtgatagccgagcc acaacacat >AW057414 tcccttctcgccatgtttgtggctcaggaagtcgccgaggaagctcttac ggatccagaagcagctgaagcggataatgccaaaaacaatgcaacggatg ctcccgctgatgctacaccgggatctggatcagatgctccagctgctcca gaaggatctggcgccgaagccgaagccaccacagcgaagagttctactgc tgcagtgaccatcattggagcaatcgccgtttttggagttgcccatctcc tctgagcattcttatcacttc >AW057415 ctattccctacctcgatcaaccatgggtttggacaaaggaaagcaacaaa ctgccaagaaacctggtttcaactgcaaaccgtttgagttcgaaatctct tcaacgaaatttcaaattcccaacgacaagccattgaagtacactttgaa atgcactgctgatgagaaacaggatgttatcattcaagttcattcggttt tcttcgaaattgttggtgcaagacgtaagcatggagtcacttctcaagag tttcatgttcttgggaaaaaccgacgaattgacctttcgaaattggtctg aaggatttgaccaatgtttcgtacgataatctcgagcatgcgcaacttgc gtctgccgctggttttatcacgttcacacactacaagtcttctagagatg atgattccgatgctttatggggtcccaacaagaacttgctcatcgtcatc actgacggaatgggagaagtaggattctggaagaagaattgtcgttgaaa ctgaaccgatgagagaagatgtgaagaagatttgttgtgaacttgaagag a 7.fa100644000766000024 22520014006427353 16007 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data/dbfa>Contig1 gcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagc ctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcct aagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaa gcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagc ctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcct aagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaa gcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagc ctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcct aagcctaagcctaagcctaagcctaagcctaaaaaattgagataagaaaa cattttactttttcaaaattgttttcatgctaaattcaaaacgttttttt tttagtgaagcttctagatatttggcgggtacctctaattttgcctgcct gccaacctatatgctcctgtgtttaggcctaatactaagcctaagcctaa gcctaatactaagcctaagcctaagactaagcctaatactaagcctaagc ctaagactaagcctaagactaagcctaagactaagcctaatactaagcct aagcctaagactaagcctaagcctaatactaagcctaagcctaagactaa gcctaatactaagcctaagcctaagactaagcctaagactaagcctaaga ctaagcctaatactaagcctaagcctaagactaagcctaagcctaaaaga atatggtagctacagaaacggtagtacactcttctgaaaatacaaaaaat ttgcaatttttatagctagggcactttttgtctgcccaaatataggcaac caaaaataattgccaagtttttaatgatttgttgcatattgaaaaaaaca tttttcgggttttttgaaatgaatatcgtagctacagaaacggttgtgca ctcatctgaaagtttgtttttcttgttttcttgcactttgtgcagaattc ttgattcttgattcttgcagaaatttgcaagaaaattcgcaagaaatttg tattaaaaactgttcaaaatttttggaaattagtttaaaaatctcacatt ttttttagaaaaattatttttaagaatttttcattttaggaatattgtta tttcagaaaatagctaaatgtgatttctgtaattttgcctgccaaattcg tgaaatgcaataaaaatctaatatccctcatcagtgcgatttccgaatca gtatatttttacgtaatagcttctttgacatcaataagtatttgcctata tgactttagacttgaaattggctattaatgccaatttcatgatatctagc cactttagtataattgtttttagtttttggcaaaactattgtctaaacag atattcgtgttttcaagaaatttttcatggtttttcttggtcttttcttg gtatttttttgacaaaaatttttgtttcttgattcttgcaaaaatttttc cgtttgacggccttgatgtgcactaccttcgcttaaatactacattttct gaaaatgttataatagtgttcattgtttcatacaaatacttatttaatag tatttctggttatataatttgtataaaaagtggttgacataacaaggctg acgaaactttgtgatggctgaaaatattttcctagctttattgattttta tttatacgtgtttgaataacttggccaaatcgccgagaaggaatagaata ctggacgacattgtacatattttccaaaaaatcagaaagtagatgacggg accaattctttctgtcaggttttacaaccgcccagtgcgtctacgtcaca tgttgtataaatggttgtaaacaatatgcggaaacaatcaaatgcattcc cataaggcataatatagaggctacaggcaatgagtatcgctctttgcttt gtttaaagggggagtagagtttgtggggaaatatatgtttctgactctaa ttttgcccctgataccgaatatcgatgtgaaaaaatttaaaaaaatttcc ctgattttatattaatttttaaaatccgaaaatccattggatgcctatat gtgagtttttaaacgcaaaattttcccggcagagacgccccgcccacgaa accgtgccgcacgtgtgggtttacgagctgaatattttccttctattttt atttgattttataccgattttcgtcgatttttctcattttttctcttttt tttggtgttttttattgaaaattttgtgattttcgtaaatttattcctat ttattaataaaaacaaaaacaattccattaaatatcccattttcagcgca aaatcgactggagactaggaaaatcgtctggagatagaacggatcaacaa gattattattatatcattaataatatttatcaattttcttctgagagtct cattgagactcttatttacgccaagaaataaatttaacattaaaattgtt catttttgaaaaaaaaataattaaaaaaacacattttttggaaaaaaaaa taaataaaaaaaattgtcctcgaggatcctccggagcgcgtcgaatcaat gtttccggaactctgaaaattaaatgtttgtatgattgtagaaccctttc gctattgagatttgataacttttaagtaataaaattttcgcagtaagaca ttaaaacatttcacaattaagctggttctgaactgtgtgaagtatattga aaaaaactaactgatacaaaaatataattttatgatagttttctggatgt cccaatataaacgatgtcaattctgcgacatgctacagtcatccacgaaa gtaacccgaataccgacaaaagaagaggaacgccaactttggatagacgc tctaggggctgattttggtcggaaaatagtcgggaaaaaatagaggacat tacagatgaggatgaggatgaagatagaaatttgccgacaacttcgtcat gccgctgatttttttgatgttctacgcttaaattttcagcgaacgaacta ttttttatattttgattgtttttaaataatatttgccataagaaattctc acttttccaggaaacgtcgtttcgccgcgattttcctcgtctccagtcga ttttgcgctgaaaatgggatatttaatggaattgtttttgtttttattaa taaataggaataaatttacgaaaatcacaaaattttcaataaaaaacacc aaaaaaaaagagaaaaaatgagaaaaatcgacgaaaatcggtataaaatc aaataaaaatagaaggaaaatattcagctcgtaaacccgcaagtgcggca cggtttcgtgggcggggcgtctctggcgggaaaattttgcgtttgaaaac tcacatataggcatccaatggattttcggattttcaaaattaatataaaa tcagggaaatttttttaaattttgtcacatcgatattcggtatcaggggc aaaattagagtcagaaacatatatttccccacaaactctactcccccttt aacaaccacccgaggatatattcgacaaacgatctatctactaggaataa ctcgattattgacatattatagacttcttttagtatttgtaaaatagagg atcagacccaaaattcagcccgcgaaggcatgacgtcagcgcgaggcagt agtttccagaagaactctgtcgtctaccttaatgcctcaaatgcgaaccc gcttcggccatccttctcgctcagagaatggattagagttctcatcaact cctctgtctaattttcaactgcggcggttggcgaccggtattaccgcggc gaccgacacctcccgggttccgtcgatcgctgtctgttgtgtgcgccgcg actccgcccaccggtggtaactttttgtgggggaatctttgtttttggtc atttttcagcgcttttcagcgattattgaccaattttgaataaaattttc aacagaatatcatctaaaatattgcttaacatttatttaacagaaataac gtgagcacgcatgtaaaacatgaaattttcgggaaaattgcaattaaacg aataaaaatcgatatttaaatcaattattggtgaatccggtgtgttgagg cttcaatgcatacatttttactggataaatctcctttgggaatccggttt gcagtgctttcgagaccatgtccagttgagaatcggcgaacgctttaaga agctcgggctgaataatgaattgttttaaaaaatgtttagtaaaaaattg ttttcgtgcaaattgtcttcgatattatccaaacgtgacgttttgcgatt ttcgcgctaaaattacagtaagtggggtctcgacacgacaatttttgtga aatacaaacgggcgtgtgtctttaagaagtactgtagtttaaaaacttca tttctgtggaattttcatatatttttcatagtttttctctttaaataaat cacttttcaacaaaaaactatgagacaatagtttgaaattacagtattct ttaaaggtgcacgcctgctcgaatttcgcaaaaacgtgtcgtgtcgagac cccaattacagtatttttgacccgaatatcgcgaaatttcgagtctgggt gaaaacattgaaatttttggcaaaataaaagaaatatgtcctttttcaga atatattttctaaatttcgagacgaaacaacaattttaaatgaattttaa ttttaaatattaaatatttcggaatttggcgttttttatgcatgtcgatt cactaacgattttgtactacacgtgggcaagtttatacagtttttggcta aaatttgtgaatttgaaccgtttttcggcgaatatttgaaaaattggcaa aactggttcaaaaacaaaaattttttaaactgtacaaactgtccaaaaat tcgtcgtaaatcgacacacccttctcattttttcaaaattttaattgttt tcgaatgttttttttgcagaataatttgtaaaatgagccttttgtgaatt ttttttaatttcaaagtttttattattttttctcaaaccagcacctctgt tctcgtccaactatgatcatcatcgtcgaataaccgtttctcgtgatttg tcacattatccttgagcacaatacatccaccaggtttcagtcctttctga aaatgaaaattaattttaaaaaaattgaattattttaaatgaaacagttt tcagagatttctcaacttttgagtccaccaccaggcctgcacgtttttcg ggttttatcttttaaaaaactgaaaaatcgaaaaatttcaatttctgttt tgtggtcaaaattgtaattacaggtaagcaaatagtttaattttaaaatt gaaaattagggaaatgaccggacataagtttaaaaacccgattttttcaa taaaaaggaaaattgaaaatttaataaaacaggttgtaaatcaaggagat cgtattgattgaaaaaaaatccgaatgttccggatttttcagtggttttt tttgaaagaaaatcgaaaaagtaaatgtttttaatttttaaatttaaatt tttaatcggaaaaaatgtacgaaattgactttttaatgtgaaaaattgtt gttttaaaaaaaaattttaaccgatacagattttctagactcagtttttt cggttgaatattgttttttactattttttcattacagaaagaatccaatt ttatttcgcttaaaaaataaccggagcatcgaaaatatttttttttctgt tttactcaaagcatttcaattacctagaattttgtttaaaaactacatgc tttatttatgaacgtaataaataagaccccctcttatttataaactttca acatattttcagttttcagtgctatctagtgcttaccgcacatcttttaa agaaatcaaccaaatcctcatcaaccaaatgccctgaaacccattgaatc catatcaaatcataacgtcgttcgggcggtgcaaacgtctgcagtccttc gacgaatttatctccaattcgtggatgttttccaatatattgatcacttt tcgtgatcaactcctcgacgacgtcttccatatcaactttcgagaagaat ggcattaagagatgctttgtaacacgtccgatacccgctccgcagtccag tgcatagtcaaagtagccgaatagattctggaaaatatttataaaattca aagttggcccaggggtgaccggcaatttcaagcaaatcggcaaattgtca attttctgaatttgccgaaaatttgacaaaaacgacaatttgccggttcg ccgaatttaccttttttaaatttaattttcaattcaggcaaactgacgat tttccgtttgccggatatcaatttgcaggaatttctcaaaggaattttta ttaagacggaaacacagtgcttttttgaattttttttcccgttttcttca gatatttttatagaatttactgacttttcagaatagatgtaggacaattt tgttgttttaaaaattgaaattctgaaatttccaacaaaaaaacatgtgc aaacccacaagttggcaaaaatattttgcatttgccgtttttcccgtttg ccgaaaagtctaatttcggtaattgggccatttttcgaaattttgagcca cataaaaaactttgaaccatttttgagaagtattattacgacattcgttt atttgagcacaatttgggcctatactttcaaaatcggggtttgaaaaccc ctatatgttcgaccgaatgttaatctcataaaaatttgatgaaaataaaa ttttctacggctcataaacgtatagcccccgtcagtctcaaaatttatac gatagacactttttggcgtttatcgcctatattccgtcaaaaaccattat tcatcattctttcaatgttgttttttttaaggctaaaaaactttcatgca aatttgttagccgtgtcgtggtttatacgaaaatttcagaatttataaaa taaaggaaaacgaaaatgtttctatataccctatttatgttctctgattc cgaataccaatgtgaaaaattcaaaaaaaattccctgattttatataaat ttttgtaagcgacaaaaattgtcgtttgaatttcacacttggttacaaaa atttatgaaaatgaggaaaatttgttttaattttttcacattgatattcg gaatcaggaaaataaatagggtctatagaaaaattccgaaccttcactcc ttctctgagtataataaatttaaaataaatacagaaaatttcagttcaga cctcattaaatttgggtatatttctaggatccgagtttttacaccagatt tacaaacttttagcctttcaccgcctttttatgcgcatttcccatcagtc aactccaaaaaaatcgcaacttttgcctcatatttcaagaatattcccct ttctctccccattgaaagtcattttcgaaacaagcggaagattcgtcata tgtggtaatgtgtggcgtgcgttggcaaacaacaagaaagaatcattctc tgaaaacaaaaagcgttttgggtgccaaagtaatattgaaaatctgccgt gttttctcattttccatcaaaagaaaatgagaaaaaagtttcggcgtttt atttgatttccgggaaagaagactcggaaaaagatttaattgaatttttc atagcaaacctatattgcaacaactttctaaaaggtcagaaattgccgcg tagcctagaaaattggaaaactcttccagctggtattatttcagacatgg tgcatcgaaaattcgaaaattacagaaattaacattttggagcatctgcc agaaaattgagattacagtaccccacttctgccgagaaattcgaggtgga agaggtcttacaaaattttcggtcacgtgaaaatgggaagcgttcaggct ccacacgacggaattcacctagttttcaggtgagaagatatcgtacgagg agaattgacctccaaatcctgatcgtgactacaggtcgtcgttcggagct gtggaagagttttgaaaatcttcgaccatgagagaaatagacaggacgac caaacattttcagtggaagagcttttcctaggccatcaggatgctatttc gacagagctgagttatcctcaagtcgttacgaggtgtggaaaagttttcc aaaatccccgaccaggtagaaaatgagcacaccgattaagtttctccagt ggtagagttttttctaggccatcattatgctatctagaaaaaagcttcgg ccatggggtttttaggccgtctatttatttctcataactttctcagaaat tcgtctatttctcagaaccccccaatgataagttttttgcaaaaaaagtt ctgctttgctcatcagccgtaatcaggtgacctcattaggcctacccaaa cacagatttgtcattatttttcagacaaaaaacacgaaaaaaatcttcac gcatggggtgataacctgattttaaatcttactgtgccggctggcgcggc gagcttcgatcactgagccgaaagattatgaaaactatgggaatgacggc gtagcctagaaatcgtccaggcagagattctgtctaattttcgagcatat atctcccagttttgttattaatttaagtaaactcaaacctagaaacaagt aaaacgggagggggggggggaatatcagaaaattaaatcttgcgacactt ttccattgatactttcaaggtaatgcccagaggtgtgcggcaaattttga aacttgcgcatgccgcctttttttttttctagaaaacagtcagaattttt tgtcgaatttgttgaaaattcgctaatatactgtgagtttagaaaaaata acgaaaaaactcggaaaaggaggaagagatctgaaatatgtagatttttt tagaaaagaccagaaattactgaaaaattggcatttttcgtcgaaacccc aatatactaaattattcggatttttagaaaattttcaaattcaccataca gtgcattttttcctacttctacgactttaaaggggggagcatttatgcgg aagggtcttgccgcgcatttagtcatcatttttagcagtttctgtgtaaa attcgcgtagatcacatgaagatcacaaaatatttatcccatatttcgta tttctgttgctttttcacaaattaattgtgatctacgcgtgatctccgcg aattttgagcagactttgttaaaaatgatgactatgtgcacggcaagacc cttccgcataaatgcgcccccctttaaagtcgtagaagtggaaaaaaaat gcactgtagcaaaaaatcgaacatttctgttcgatttttgaatttctcga aattttttaaaataatttttaaaataacatttttattttatttcgaaaac taccgattttagaaaaattctaaaatttcgattttttttgttgatttttc gatttttaaaataaaatttcataattttttaaaccgatctttcttgcttt tcctgaaaaatcgatgatttctatacctttttcttcagtccttcaataaa tcgtttcgacgccgatatgtcgggcgcgtgaagcgcttcgaatccgccga gcattccgttgacgtcctggctcgcgcggctccagtattcctccgcctga aagagaatagttgaaaacattgttttgagacttaaaaattttttttttag tttttttcaaaaattcttacatgttatagagtttttttttcaaattttca gcttttttcagaaaaacttagtattttcgataattttaaataaaaaagtt ttttttcaaaaaatgtttcggttttttttttaatttttggtctaaaattc tccgcaaaagatttgcgtgctggccgaactttttgattttgtaccttttc ataaacatcttcaccattgtgaattctagaagatgatgaagagctcattt ttgatgttgtgacagctgctccgagcaatctggagacttttgtgacgaaa agacgagaggtcacggatatgatgatgatactggaaatgagatatttata tttactagttcatcgggaaaattattacgagaaagataaacagacatgtg cgtttttttaatggaagagaaacacaagaaaaatctggaaaactaggcca cggctatcagtgtcgatttacggcatacggtctcgacacgactatttttg ttaaatgtgaaggtatgcacctttaaagagtactgtagtttgtaactctc attgctgcaacatatttgacgctcagcgaaaactacagcaattcttcaaa agactactgtagcctttgtgttgacttacgggctcgattctcgaaacgaa tttctgctcgaattgtgacagccatattcaatttggtatagtcttttcgt attttttgccatttttctgttttcttctaatatttaatctattattaaat tatgtccgtaactccctccaaaattagaactgcgaccgaacagagattcg ttccgccccatattccggccaatcagatcgagtaggcggagttcgaagtc gctgattggtttgaaaagtcgcggaaatttgcaagttttaaggtagcgaa aactgatgactattgtagcgcgcttgtgtcgatttacggaatctcgattt tcaggaatgaatttttaattacattttttcgctcaattaatattctaaat aaataaataaatgatttgaattaatttaatttcattcgagcccgtagatc gacacatgtgctacagtaatcattagttttcgctacgagatattttgcgc gtaaaatattttcccgtaataactctactccgacaaacattacgacctcc atggaggcctccaggtataggtgagactcttgtatttccaattcagagac aatgcgtcactggaagagaaaacgaagcggaaaaaaaaacacggaaaccc aaaaatagtgtttgccccgctctattcttctccaataatttctgtgtcta attttgaaagactccacctgtgtatgccttctcgacataaaccccccccc ccccccctatcttacatggtactgataacactttcagtctttcacacttt tggcgcgcaacgccgctcttttttcgcggcgagctgatgacgtcatcaat ttttcatcgcttttgattatcttcaatgttctagaagggcacataggtca tccttattttttccttctctttctcgtgacggcccttgttgcgcatgccc gccccctagagcagggcgtggcctgaacggcggctccgagagctactcat tcttgccgcgtcaccctccagcgccacccaaacttcttcggttctagaga tcgagaagaacgtatgattttttaaaattataattgtttctttcgaaaaa aaaaatttcatttacagtaagccaaacatacacaatcaacatgaaactcg taattctgctatcttttgttgcgacagttgcggtttttggtgagtttatg ctttagataatacttttccgccaaaaatacagttgccggtctcggtatgg caatatttttgttaaattcgaaaagcagtgagtaatgtagtttcgaattt tcgtttctgcttaattttcatcaattcatcgtttttctcacgacttcttc tttatgaaaaatcaatgaaaattctgactaggtcagcttaggggtgaggt acctagagacgccacatatgccaaacggaagctgagatcattggctacaa gaatatgctttcaaattctgcaacggacctctgggagtctggaaattctt gtctgaaattatgcttttgaatgctcgaaagtggtaagaatttagaattt attacagaaaaacgtttaattaataaaattagttttatacttgaaacaag tactgtatgcactgtatcaaaacacattttcatcttttctaggtattcaa cttcacgtttttctgtaataaattctaaattcttaccactttcgagcatt caaaagcataatttcagacaagaatttccagactcccagaggtccgttgc agaatttgaaagcatattcttgtagccaatgatctcagcttccgtttggc atatgtggcgtctctaggtacctcacccctaagctgaccattccctagtg agcaaacaaaattttgaaattacagtactatttaaaggcacattgatttt ttgggtcaagcaaaaatttgtcgtgtcgagaccggctacggtattttcgc gaaaaatcgcaaaatcttgcggctgggatatacttgtgcgaaatactttt tgcattaattttgagcaaaattattttttttagactttttgaaatccaaa ttttttggattgcgaaaaaaacctgtgtccggttgtttcattaggccaac aaagttcctggaacactgatgaaaaccatgatagaggcggagcataatat cgatttttcgtactttcctgtatttcttcttctatatggccgagtagaac aggattaggggtaaagtcaaaatttttctcatatggatatcatatggata tcaaaatttttctcatatggatatggagaaaatttttctcatatggactt tgaaagttgaatcacttgacatctgggaaattagtattccaggcgtaagt cggatctgttagaaacggaatacttataggcttcgtgaattaggtagact ttcaattaatctgatccatgggagtcagacgcggtttccaggcctgacgc ctgcctccaacttgcccgcctcacgccggtctctcgcctcatttctgcac tgtgacgagacagacgaaggtcgccttctggcgcccgcatggaaatccta cgaatatgtcagcttctgatgggactccgtaaatcgacacacaggggtac ctcagacatttccctcccccttacaaattgttaggacaaggagggggaat tcatctccactcgagacacacatatgttgtcgtcagtgaagtgtaaagat ctaaacgattgcgtgtatgaaaaagcactctatgatcacctttttcatct tcctacaccctttttaggtgtggtgcccatcgagcactcacgccaggcag ggagagcaccggtccctgactaatgggattcgaatgttttagaccggaaa taggagcgatgaaagagcatagaaatgatcatttggaaatcacgtttaat taggttacggcgaaaatttgcaaaaaagagcaggaaacttggctcaaatc cttcgaaatataacaactaggacttccatgtaggcgttaaagcgccctgt ctctcaccccaatccgtaccttaagctgaaacaaacgtgaacttttttca tttcttaaaggagtatcgtcaatgggaaaattgttttaaaatgtagtatt tgtacttcaacttccaattattgcaaaagaaaaacggaaaaaatccgtta acattcagcattttaagtcgaagaaatctttaaaatttaactagagaaat cctaggccacgacgctcattcgaattttaatttgttttgatattgtattt tgaaaaaaaaacttaatacaattccttcttcccagttttctataactttt tgagaaaaaaacgaattaaattccgaaaaaactacatttaaatcaatatt ttgtttacgaatatggcctagaaatcgcgtggtggcctaggattcatttg cgcgcgaaattcaaattccgtcactttcgtcgatttcaacggctaaatgc tgaatgtcaacggatttttcccgtttttcttttgcaataattagaagttt gagtacaaatactacattttaaaacaattttatttttggtattttgacga aaaattgatttattggtttttttggttgtttgggaccaaaaaatccaaaa aaaatgtttggcgtgtctagtttcgactcgagactattctgtattaaaaa tacattaaaacatgtattttaacacagttgtgacgtcataaatgtatttt gatacattttgcaacattacttaaataaccccattaaaaattaacctaag catcaaaaattttttggtttttttggtttttcgaaaatttcaattttttt tgttttttggttttttttggtttttcaaaaacttcaattttttgtttttt ggtccaacatttttttttggtctcagctctgctgcctaccctagaagaac taatagcgcttcaaaaactgatgaaaacgttcaaatttgtcgaaatatta cgaaaatttgaaaagttggctcaaatctagattgaatcggccgattttcc acaagtttccaagtttccacaagtcgccacatatcccgagaaaaatcgat tcaaattgtttgaaaattggaatactgcgaattttgaaccaaatttccct ggcttctctgttgaaatacttgaaaataccgcgaagcaaacaaaaaatct aattattacgtgaacacaaaattctgaaaatgcgtatatattgcgcaaca tatttgacgcgcaaaatatctcgtagcgaaaactacattaattctttaaa tgacacgctgtatgtggtgatttacgggctcaaaaaattattttcgaaaa tcaagcccgtaaatccacacgtagtaattatataaagaattactgtagtt ttcgctacgagatattttgcgcgccaagtatgttgcgcaatacgcaaccc catatgttgatatatactgatgtgaggataaaaaacaacacaactttcag cggctccatcggctccggcaggtctcgaggagaagctgcgtgctcttcag gagcaactgtacagtctggagaaagagaacggagttgatgtgaagcaaaa ggagcaaccagcagcagccgacacattccttggatttgttccacagaaga gaatggtcgcgtggcagccgatgaagcggtcgatgatcaatgaggattct agagctccatgtaagttagtggtggtggccggaaaagagaaaactcggcc aagctgctcggagtttttgaatttttgataatccgaaataaaaattgatt gctcgaaaaggaacaatcttttggaaaaaaacgaattttgtcattttttt cagcaaaaattgattttcgaatttttccaataaaaaatcgataatttctc cccgtgcagtggaaaacaaacaatatttttttgttgatcgttctcttcca aacccggaataggtacacacattcctgcgtcatcccattctcttatcaca cttttttttcgaaaataaaagtgtagagacggaaaagtgagaaaggagtc aattttatgcgaaattttgcatgataatacactcaaattaaaaaaactgc gtggcgtgcactgcagaaaacctcatatttaggccccgcctttttctcgt ccactcacggagaaaaggcaaaaatttggggaccaaccaatatcaggccg ccgacatcctacgggttccgcgcgccgctatgtttaactcgctgtgggtg tggcgagctgtctccgcccgctgcgagttaaacatagcggcgcgcggaac ccgtaggaagtcggcggcctgatattgttggtccccaaattttttccttt tctccgtgagtggacgagaaaaaggcggggcctaattatgaggttttctg cagtacacgccacgcagtttttttattttgagtgtataggtctcgattct cgaaagtatgacagttatttaaatgatgaactcgtgatgactgttaaatt tttggaaatttcgggggaattatatcgatttttcgataaatttacaggaa aaaagtccaaaatctaggtattccatggtaggcaggcgcgatttcttgac gcctgcctggaatctgtccgcctcacaccaaaaaatgtcaatcattttgc tgaaaaccaaattaagaaatgaaaaagtgcacttagagatgatgacggag gtcgccttaaggtcagacaggttaaaaaaccgattttagttgagttttcc cgaaattttctgaacaaccgaattagaaatatgctgcttgtcatttttga gtaaaaattaacgaaaacttcgaccaaaaccacgaaaaaaatgaagaaaa taaagatttttcgagaaaataacaacaaaatccagcaaatagtgaaaaat agttttatccgagaaaaagtagtttagacgctatgaactctcgaaaatca gattttttcaatctaaaagccataaaattatcgattttttaaaaattctc actgaaaaccggcgaatttcagtgctccacgcaatcgaagcccgcttggc cgaagtgttgagagccggagaacgcctcggagtcaacccggaggaagttt tggcggatcttcgtgctcgtaatcaattccaataaatattctttgcccta aatactttaaattatccatctgacaactaaaatttcggttcttcttggct tcttctatttgtgaaatggtttattttcccccgaactctcaaaaggttta aatattgttcgattacccctttttatcaattattttcttcaatttcttat ttatcattatttttctaaacgaagacggatgtgattttaaattatgttaa tggactattttacaaactgaataaattcagcatgttggcaggttttttca gtagtttttgagtgaaaatagaggtaaaaagacagaaaatcaataaaaaa tgaaaacaaaactatgaaaaatggttgaaaatcgagcaaaaatcgttcaa aaaaaaataaattcaaaaaataattgcgtcgagaaacgcgtcagtagccg ctctctgcgtctctcacccttcagcacgcggagagagccacgagaaatgc gcaaaggctaaattcggcgcggaaaatcatttttcaaaataaattcgacg agaaaatcaatacttaagtaattatcgattttcagctcgttcaaaaaatt ttcagaaacgttttagtcgtttaaaggtttttttaaaattaaaatcgtcg gaagtaaaaaaatagcgcggatggaaatctacggagtgcggagcgaacaa acgcgcggtaattcaaatgggtagaatagtcaaaattgaaaattagccag catcgaccgatttttttaaaacttaatggattttttcgtttttcttttgt ggtatttcggcatttaggattagatagcacattttaaagtaaaattccca tccaagctactccaccttctccagactgtacagttaaaccaatttgaaaa gtgtattgtatcccgtttttttttctgaacaattttgaaaatttttcgtt tatccaggatacgataatcatgattcaaattcgttaacaaaaaatgaata tatgagagcgattaaagcatttgtgtcggaaaatatgggttaaatgggga gaagggggcggacatttggatggggtacaaaaaaatatgcaaaaaatggg ctaaaaacaatattttcaaattatgcccgacaaaggttcaaaagtcaata tatagaaatgagaacatgagtattatgccacgtggcgggaaaaatatgtg gaatgtaatacgatgagatccttgtgaatacaaagcttgtgacgacgtgg ccgagaagaactttttaagccaacgagaaaaaaggggttcaaggccgaaa ttttttttgggccacctattaagttaaattgaaaatttaaaaaaaacaca gcggatccaattatttgccgagttttgacttgagctcggcgcgatacgtg tcgattgactgaaaatattgtttttttttatttccgaataaaaaatggtg agtacctccaaaattagcttttcattgtccatatagaactttttgatttg ttccacagtttttgtggccatcaactcggcgatcaactcgaaattgtcct tgtaccagtggaaacctgaaggaatttcggatgtttttgcttaatcataa tcataataatcttaatcataagacttggaaaatgcgaaatttttcgagaa tattcaatttatcttcagattttattgcaacaaatcgattttcaacataa aattaatttttccaactttttttcccaatttatgagagtttaaagattgt tttaaagcaaaccgccaactttacataaaaaattaaaatattgtgaaaaa aatgatgaaatttagcagattttctgataaaaaattgaatttttttggat tcgcgcttcaatttcacattgttcttttagaaaagtcgaaattttatatt tccaattttcagatttaaaaaaatttaaaaaggaatgaacttttccaaag aaaaactgaatataaccagaaattgtgatttttcagcatttttttttagg tttgaatttttttttcatgattaatcacgtgaaaagtcaattttaccgca aaacatttaaaaaatcaagatttttcaattttctctgaattcctgcagat ttttcgatgaaaaattgaattttccttggaatttatatttttcgggtatt taaagtttcggatattaaaaaaaattttcaattttctctgaagttatcga taaaaattattttctgcaaaaaatctactttttttcgttgaatattccgg aaaaaaaatcagaatttcaaggcacatttccttttctaatctaattcgaa taattcaatattcttttaaaaattcggggtagaaaaggaattgtaccaat ttttatttttaaaagttaatttttctaattttcaaaattttcttgaattt tcgaattacagattttcaaaaaaattttttttgtttttttttctcgaaaa tttgaaatccatacatctaatagcattcttcttttcctcaggactccaac cataatttatcctgacttttccagatcgattgccatttgttgcagtagta tctagttcaggagtaaatctctcgaatcttcccttcaacgccatcatatc tttcttccaatttgcaatttctccttttggtacacggctgtatgtcattg ttgcacggaacatttgttgacgggcttcttcattcagaattctggaaaaa ttgatgttgtgcgattttttttggttaaaaaaaacaattttcgtaagttt aattaactaatattttaaaaaatctctcattttctgaggcaccacggatt caagatctggtgggattccggatctggcaccgtgccaacgcattaaatgc aatttttctgaaaaaagggcaacgaagatccgatttaaaaaaatttttca attatttttcaaaattttcactaactataagaaattagagatttttcaca aaaattccagttttctgttagaatttgaaaaaaaaattgaatttttccta aaaaatttgtaattttccgatatttcaagctgtcaaaacctaaaatctga aaactgaatttttaaaggaaaaattttgagcattcttatcaaaaaattgt ttcaactttttctcaaaatgtttcaacctttttctttctaaattctgaaa agcatatctcagcttttgctaaactatttttttcctcaatttttgagaaa attaaaatataatatataatatagtaaatattgcttattttctaataatt tttggtatttctattctttcgttttttttttcaaaaattccaaatagttt taaatgttcatattattttttttgacgaaaataaattttaattttaaacc ggaaaattgtttcgtaactttttttttcaaaaaatttgaattttcgacat gaaagatgtaaagtgtaatttaaaaataatagtgcaggtattttcagttt acagcaaaagtcagtttaaaaaatttcgactggttttcaaaatgagtttc cttattttttacacgtagaactttttttattttccgattttttttgttgc gcagaaattttttttccgcaaaatcaggaaaaattcagaaaaagacagtc aaaaaattgtagatacaattttttgactgtctttttctgaatttttcctg attttgcggaaaaaaaaatttattttttcatgaataaaaatcgaataccc atccaattccacaaacttactcgttctcctccatacatttcgtttgttta actctccaaacaagtggaacacacatatgatgttttctcttgatattatc aattaatgccagtgcagccggtgtatcgaagcaccgtgtcattctgcacg tattctcatcgattggatcagcttcaatcgattgctccacaatgtagggg cctgatggtttacggagaaggcagtcgtctggagaaaaatagaatagaat aatgatttttaggttattttacgtttaaaaatctaatttttaagacgcgt aaacgttgagctcatttataaaaattcggcaaaccggcaatttgccgaaa aatttcggaaaattgtcggtttgcacattttttcttgaaatttcagaact tcgatttcaaacggcaaaattgtatacatcctatcaaaacatcaatcttg aaaagccagtaaactctatgaaaatgtctaaagaaaagaaaacggtaaaa aaatacagttttaaatgtttccgtcttattaataacaaaattcgacaatt tgccggaattgaaatttttttttctccaatttccgaaaaaaacccaccga ccaccataatatcatcgtcttcttctttttcttttccaattccaagccgt ttgatcgcttttccgttggctggctccatgagctcaagatatccgtatac ataaattttcatgtctgaaagaaaattcaaatttcttctggaatcagtta ttcgaaactaacattctggacataaaactcgttgccgtcgttttgtcagt gcacggaggcttgccggacgtggaacacgcatcaaacggaaataaaggat acacggtttacattcgtgacgcgacattacacgatttagcttaaaattgt gaaattaattttttttaatagctctttatttttttgaaaatttctcccat gctttttccattttttcaacgagtttccttattttttgtccatttactgt aagttttttttgagaatttttttttgttaatttaacattttattagctca aaacatttattagcaaaaattttattagcaaaaaaattttttaatttttt taaattagctcaaaattctcgaaattttaaatttttagggtaaacaatat aaaacttagggagttttgagctataaaatgataaattgattttaaaaagg atgaaaaacttattttaaaaaaccgacaaaaatcgacaaaaatgaaggga acaggcagcagcttagccccatgcttagccagcagccccgtagcaaccca gtatcaataatatcccgtgccaattttcataaaactgaatataaattggg ttgatgttgctaaagggctgcgaaaaactgacctgggatgaagctgggct gcaaggggctgcgaagtgctgcgagggcaaagcgctacagtgctaaaagg gggctgagcccagaccctcaggaaaaaactcatactcgcagcccttcgca gcccacatttgcgctctgatcgcgtgctatccgcgcgcacagaatttcga aagtattttccaaattcggaatgcgcgcggagcagacgcaattagagcgc ggatctggcacgtaaggaagaagtgtgactggagcacgaaccagtaatct agtcgcgccccgtccgcgctccaggaggagcgatttgccgagcagttcag cccttcgcagccctttagcaacaaccaaatttatacagttttatgaaaat tggaacgggatattattgatacgcctaagcagccctattaaatagtgatg agggcgtaaatgaaattcgccatttccagctaaaatataaattttttgaa ttttttaacattgatattcggaatggattcagcagaaaatttgaagtcat ttgaaaatattttccagatttcggtactccacttttaaaattgaataaaa ctgtagtctttattcaatgtttcttcaaaatttaaaaagtagaatataac tgtgagaaaatttccaaaattgtcaaaatttcaaatagctgaaatatttc acggcccggcggggggtacatggatgagaattctctaccgtattccaatt tggctgactgcgtgctcaacgttgaatactcagtgtaaactttcgtacac cgttgcgtactgcacagcgcgcattttaattgacgacatttagcaaaaat tgaacataagatttttcggaattatgaagctcaattttcacaaaaataat gagttttttgtagaatttatgaaaaaacgtgaatatatagattttttgtt catgatattcaagaaaaagcgatttttagttcttcacagaggaatcctct cgcatttcacttgctcatgatgttttttgctccactttaggacgataaaa atgcgaattgttgataaaatgaatgaataatataaaaagtgcaaatatga cttcagcaagtgttaaatcccaaatttttcctgcgattttctgctagatt cctggttttgagtaaacagtctgatatattcatgattataatgataacaa taacgaacataataataaaaatggagagcacagagaaacaacaaattgca aaaacagcaactgatatcagaattaacgacgaccacggaaaccgcctcgg tctccacctcgcccaccacggaagccaccacctctgtcgcgtcctctgaa tcctcctcgatctccaccgaatccacctctaaatcctccatcgcggtctt ctgatctaccacggaagcctccacctccaccaggatctgttgaaagtcct ctgaagcctcctcgatcgccacctccacggaagccaccacgatccgcgga ttttcctctatagccttcgaggcttcagttgtaccccattcttcgttggc acgcttcagatctctacaaaaaaaacaaattagaagcattcaattatcga aatgtgtacctatcccgatttatcgcaatctgtctattcttctccttctg attctcaacttctttaacttgtccagtagcggcagcttgcttacgagcag cattttcccgaatcgccttcacctctgcctcctcagcatcctgttgctcc ttgacaatcgtaagtcttcgaatgacacgttgctcactctcctgctcacg acgctttttcatctgcttcttcttgtttatagtcaccgcattatgcttgt gatagagaacctctccctcatcgatttcttcttcaattttgacgagttcc agggtcagtcgggtccgatctcacgaagacggacgttgctattctggcca attccgcagtcacgtccttcataaatgtcttgtggaagttcttcttgctg agggggctgctgaaaccaatgtcggcatgatgagagttccggtcttctga atccatttcctgcgtgggctgtggcgacgagctgcacgtctgaaaatcaa gtttttgtaatttttgggcgcatgatatggagctgaatcattcgatttta gaatcagcatgcttttattcatattttaggatctttttaaaaaatctgga ccaacagttttcgaaaaaatttaatttttgttcagaaatgtgaatattca ctaaatcgaaaaaaataattgcaaaatccgtcagctgaacattcaaaact tatcaatttgaaatcagcatatttcagtgtataattaaaaaagtttcaaa aattctgagaccaatttttattgagaaaaataatttttcgctcgaattat tgaattttcactaaatgcaaaaaacagtaaacttgggcccatgctacaag cctgaatctttcaaattaagaaccagcatgattttttcaatattctagga cgtttaaaaaaaatctggaccaacagtttttgaggaacgtaattttttat acaaaaatgttctgatttttcactaaactcaaaaaaatagtcaagttggg cccatgctgtacacctaaatcattaaaattcagaaccgccatgtattttt tcttaccaaaggctctttaaaaaaaatctggaccaacagtttttgagata tttagaaaaacaactcacttttcgacgtttttcgccttttcgtggctcac ccggttgatttttgcggcgatttgtggtctttcgctgaaaatattatttt tatttcaattattaacgaagaaaacaagaaaaaacgacgagaaaacatca aaaaaacgcgaaaaaacatcgaaaaaccaccgcaacctcatgaacaaaaa aaaagcattgcagccgcgggactagttttcgcaactttctaggccatgtc ccgttcgccgtgccgtgtatttgtttaattccctttttggaaaaagtcaa catatttttctaacaaatcgtttttctattaatttttttctaaaactcac aatcaacagatcactttttgcattgcaattctcacaatatcccgacggaa ccctctccaaatgattgacctctttgaatagttcatcataagtgtcggtt tcattcaaatgcacattaatcattgttttatagttttgcacttttttcgt gttgtaatagtattggataatggaagaaagcgagcgttggggcatctgca aaaaataatgaaatttattttctttttatgattaaattaaattttcaaaa attccctttttttgacatatgcacttacagccgcatgaatcttcttgaac cgttttccgaaatgaaagaagcaagtggagaaaagactaatttcttctgc cgtccaatcatcatgaatttcttttcttctcatcgcttgaaccatcgcag cgtcgaaatcatttgactgtttgttcagaatgaacagagcctgtaaaagc agttagtttttttttcaaattcaaagtacatttccgaaaaataaaaaaaa ggcttgattttttaaaatctcgaatttttattatggtcaattgttatttt ttccagagaaaaactcattttctcccaattttcagacgtttctctctaaa tttggtgtttttccaatcgtaccctatctataggtaattgatatcgtcca gtagcttctgaaatgtattctgtaagccgattctcgttcatttcgtctgg aaacgcccaaatttgttgatctctgcacggttctttttccaattgctctg cagttggctgtataatcgcctgatattcggttcccacgtggattagattg tcgacgttggaaagtggatttgctggaagaaattgggaatttttcaaggt tttaagtggattttcaagctatttataaaagcatgaaaaagctcagaaat gactataaaacctttttttacgtcgtatttttttcaatgaaattacctac ttttaattaattgttcggcttaaaaccagaaaattgtttcatatcgattt tcccggtgaaaatcgaaggaatcgtcgcattctcaaagttttttcaccga tttgtttcaattttagcacaactaaatggaaaaatcacaaaaattccatt acagccgattttcgtgaattttcctacatttcgaactaaaaattgtcctt tcttctgtttaaaccggaaattctcttttgaaaaaccaatgaaaatttga attttctgggcttttcttcggaaaattattctcgaaatttatcaatcgat ccttgggctttttttgttccgcagaggctggcggagtttacaagcgtacg aagtggttcaacttttatataaagctttataaatgggacatagatgaata tttcgaatgctaaatgcaaaaagaatcagtaaaaaagcgcgcagccccgt ccttctctgacgaaaaacgccgtttaaggatcgattgctaaattttggca gtagttagaagtgtcaaaatttctgccggagagtcgtcaaatttcactga aacgtaacccggtaatttccacaattaatggtcgatttttcgcaaaaagt ggtatgtttgtcaggatttattagaaattgtggctgtccagattttaaag agtatttttgggcaaaaatgtcgaattttctctgaaaaagttcgattttt atcgaaaattcagattttttagatagttttcatcgattttcccagttttc agcctgagaactttactaacagaaagatgtgtcatgagcaccactttcat gatgctcacgagcttcagcttcttcatcttcgtcctcttcatcctccaaa tcttcatcctcatcgcccattgattccccagacgttgtttcgcgttttct catggatcttataggacgagccatctgaagtttcaattttagcttttaaa ttcaattttaccgcttaaaaatcgataattctcccgtactctgctggttt cttcttcttgttccgcctgctcctctggatcatcttcctccattggctcc ggcgatgcattcaacatattcaagccttcgtctgaaatatctggccaatt tatagaaaaaccgacaaaataataagcctcactttcttttcgagaggcgt cttcgtcagatgacgtgtacgaatccattttctggaatttgaggattttt gaatgtttttaaacaaactttatagagaaaacattcgaaacactagaagt tatgttgaaacacgagaaaattttttaaaaatccatgagaaaaacagttt tgaaaaatctgtttttggaggctctccggattttgaggaatcgtcacccc ggagacgcagattctccggtaatttttcattcatatttgagtttaagaac aaaacagtttaaaaaaatgtttttagtatttgaatgaaacttataatgta ttttttcttccattaaaacttaaaaaaaactacaaaattattatgaatca aatttgaaaccgtgaatcaatctccgcggaagggcgagtctatactgctg caagcgcactctatcgcaaatgtacaattggcggtttttcaaacaggaat taatcggattctcgtagtttattttggatttcttttttcgggaacatatt ggtgtttttgcgttcaatattcaaatttagaggaaaactgcttcaaatat ttaggtaaactcttgaaaccgctgaaaataggcaaaaataattatttttg tattttttaggctactttctatacttttgcgtaaatactatagtttttct ataaaacacccattaaaattatttttataaaatgatttttccaataaaaa taaaatgcgcaaaatgattcttttccagaatcctatatgcgcctttaaaa tctctcggattactgtagtttcaaagaaattatcctttatatttttaatt ttaaattttttcctgaatgtcaaatattaggggaaaaattataataatat gtgctttattcatatgagtgtagaattagtgaaaaagaaaaaaaacatgt atggactgtaaaattggaattttagcgagaaaataaaaataatatgcaga aaaaattaaaattttcaggaaaaaagtcagtaaagccatcaaaaactact cgattttgaaggaaatcagcaagaaaaattagaaaaaagtatttttaagt tggaaaacccctgcttgaatttgtacactaaattgggcataaaagcgtac aaattcgcaaaaaccggtaaaaatctggggatcgtgatggatggagtgtt ttgtgaaaaaatgcagcgaaaaattgagtagacaatttcaaaaatgtcga tttttgaaatttgtgacgaaaaaattgaacaaaaactgtttttttttgga attttcaacaagaagttttataaatttttttgtttaaaattttgaatatt atatgagtttggtttcacttaacagaacaattcgaacaaaagtattctag aaaggaaatgtgcgctccagcacactatttgcccgtggagcgcacttgtg tgcacgaacgctagcgagaatgtgtggtagaaagggagggaataggaaat attaacaaaattgggcaaaatatgtaagattcggagaaagaattggagaa aaatatgtatttcgagctccgcgagctgatcaatccaaaggctttctcca tccttttttcgagaggcacattgcattatagttacacacagcacgtgtat aatggaacattgaagcctggaaacgagccatcgctaccatcattaccacg tggatctgaaaaaattaaagtttgatgattcgaaaattttctggaaaagt tatgattgtgagataaattgaattctttgaaaaatcaaaattcaaaagct tgtagaaaattttatatatttttttaagcgtattttttccgtatacattt ccaaatttttttgttacccaattttaaagattttcttgaattttaaaatt tctttcagtaaaaactttttttcaactttttgattttttttccgcatttt ttaaaattttattcagaattattagattcttttgaatttaacgaattttt ttcgctaaaaaattgttcgatttttcccgaattaagaaaaatattatttg gtttttgaattattttcctgatttttttcgattaataaatttgtaaaaac aattttttttctaatttttggttttgatgattgtgttttttttctgaact ttacagttttcaaagtttacaccgaacttccacattaaaaaattctgata caaaaaagtattcacatgatttttaaaatttaaatatttttcaaaaaaaa taatatttaaactgtgtttttttcggaattttttttcgattttttccgag ttttttttggaattttttcctttctgctccaaaaatattcaaattcaatg ttgtgtagaaattttattcaaaaaaagtgttcaacttctgagtctaaacc ttttccgaatccttaaatcctggcagagctctcgtgaattcagttgtcaa tttatgtggatagcaagctgccagtttaatgaaagttttagttcctttgt caagtactcgattaattttcgaataatcataatcatcgactcgaacacca tataatccttgagtatagttccaaattgcttcacggaatgcagcagtgtc aatttcattctgattcacggcggctggtggttctccgtctccagatgcat gggatgagccggatggcctgaaaaattaattttttggaattattatattt ttctgtttttgaaatttcatgcatctcgaatattttaacaaaattaccaa attcaactagatttcttacaactttcactgtgtcgatttacgggttcgtt atacgaattgaatttgtttatcgatagaatattaaaatttagctaaaatt gagaagaatataagaagaaattaatttttttaatttcaaaaatcgagcca gtaaatcgacacgagcgatcgacacagtagtcatttaaagaccagtttcc gccacgaaatatttcgcgcttcaaacatgttgcgtagtacgtattctcaa aattgtgcgttcacgtataatatttatgcgaatttttggtctactttgtt agagaaatcatcactaacatattgccagtaagagtccgaatatgatcgaa cattcgatcaagccgtgacgtcagtgtatccgtatactcattcatcgtat tataaacatgatcccatccaaattcttcaactcggaatggcggaatatcc ttttcaggtcgctttctaaaatcaatatatccaaatgttcgatgatgcga gtaaattggataattacacggcggctctttttccataatatcttctccat tttcatcgatatttgcaagaagaagtacaggcgagtagtttttccgattg gaactatatgttgctgcaggagcactaattaatgattcaatagtttcagt agtcattgcacacatcttcgctggtggcctagtttgtcctttttccgtct ttttcagctcactgatcaaatattcgacttcagttggccgacgatctggg acttttcggaaataggccgacattctcgcctcccaatagtcgagatcatc gatattaaggaaatcaatctcatcttgtgtcaaatcaacacgacgttcca atccaatacagcatataactgtgcacattgcgtgagtcattgacattatt ccgacggcgtggtggagagagcaaaccgagaaaaacgcaggaccaccgtc tggcgtgcggcgagcgaagagcacctggaaattttcaaattcttgagaaa aacctaacatcgttgttatacgttcgttctcttggcattggagttggcag aatttgttttgaaaaaacgttgttttttttttgaaagaacatttttttat tacgggaccatgagatcatgagaattcctatttactggcgcgaaaatatt ggcaggccacggcaacgagagagcatatggcaaagagagacgcatcttat tttgtcttgtaatttttttttaaaataatttacaatcccttttcaactat cgtgattgtaaaatattacaaatttcagaatttcgctaccaaattattac tggaaaactaaactctgagaatgcgcattgagcaacatatttgacgcgca aagcatctcgtagcgaaaactacagttattctttaaatgactactgtagc gcttgtgtcgatttacgggttcggtttttgaaataattttcttttcgaga agtgacagtgatattccattttccttcttttcttcctattattttatcat tatttgcttaattttaatattcaattcataactaaattactttaattcat ttcgagtagacattcaaagaattccggtagttttcgcttcgagatatttt gcgcgtgaaatatgttgtgaaatacgcattcttagaatatggtgttcccg taatattcagaaaagaaaagatttccaagaactttctgaagatttcaata tttgcaaaatcagaaaccagttctgaatattctttatttttagaaatttt tcaaggttttctaaataacttttctaaataacctaccgtatttcttctat taatatggctgcaatactatttttcgatggtcttcccgcttgcaatacta ttagggagtgcaagtctaatagggagtgccatactattcttcagaaaatt tttctgtgttggggcttactagattctacttgaaaaaactccaattttat ttggaagtatagaaaatttgattgaaattgcaacaaaaaggtacaataac ttcaatctctaaaaattttgttataaactgttgcaaaataggcaaaaaat gttattaaaattttaaaattagtaaggagtgtttgcaacaaaaaaaagta ggtgcaagactattagggagtgcaacactaatagggagtgcaatactaat tttcggaaggtctccgaggggcaatactaatagggagtgcaaatctaata gggaggccatattaatagaagatatacggtatatatagctttgaaaaatc ggaaaatgcctaatttttactttttgaggtttgaaaatctctaaaaattc aataaaatttcaaattaccgctagatttttccaatgaatcatccatggtc tatgacagagcattcgattcaaataatccaattttcgaaatttcatgtat gaccaatcaatgcccaacaaccacatttgttgtccacccttttccagaaa tttgcgacgatgatgatccataagtgataggcatctgtgacgtgatgcag ccattagtgcaagataatgacgagccgaagctggtagatcacttatatca acgaacatatggccataacttcctgtcatatgaacatgtagagttgggtg tttacatgtgaaacggaataatctggaaacgtgagggaaattagttcgag acggggaggggcaggttggcggtgccaaccgacagccgaacattggggtt tctcagctggtagcgccagccgacagtctactgcagtactgcagataaat tttcgtcggctgtcggctggtgaaaattttcatgaaaatcaataatttta aagaaattgttgcaaatttttcccaaacttgaccaaatttgttggctggc tgtaccagccgacacccgaaatttagaacattgattagaggctgcttggc agaaataattttaaattcagaaattcaattcgttttcaaaaaatattttt taaaactttaccgatcaacttctggaatcggatcaaaattgagccaatcc atggcttttcgtcttttagttgtagtgtgcattgtgtagatctttttata ttgctgcgaggtgagtaaatgaagaattttcgcgacccgtttctgaaaaa actcagttttctaaggaaattttgaaaataaattcgagaaaaagaaactg agtcagcaaaagaaaattggaaatgtctgtctggaaatattcgaatatta tattcaaaagttttcaaaaaaacaacgaaattacaagcaattgtgatcag aaaccgcggaaggaactggacgaaaaaaattatctttgagacgaatctct ttgcatctttgtgatctaaaagattaataaaggttgtcatcacatttttc gagatttgggaatgtgataagggtgaaaaatggagattaattgtggtaaa atgaggaaaaacctaatttttggtgagaaaattgtggaaaaactataaaa gaatctttatggagtttaaaactcaagtttttcacgcttttccgcactgt gcggaacgttttttgagagaatttggccgaattcggtgattaaaaaaata atttcaaaactttgcgcctcaattgtgatgtattaccgtactctgttgcc attccaccaaaatttccttcattgttttgccatttttctgcataataact gttctgggtttttttgcttcatgtgcccaaatgtacgaatttccctaaaa attatacctattttttcaaaatttttaatcgctagaatttttttttctgc attttctttaaaaaaagagatttctcgcaagtagaaggagaaaaaatgtg tggctatacttcttcttaaagaatgcacgactagccatagctcaagcccc ctctggaacgttccatcttcctcccattttcccacgttcaagaatcatca gcttcttctccctcagcttctcttcttctaaaaccacaactagacaaatg ttcttgttttccaccctatttttcacataaaaccgccgagaaacccgcta tcacagactcaatgcgcaccggaggggctctttgtgtgtgtgtactgatc tctgcgttatattcgaacaccggcgcacactcggattgaaccagaggggg ggggggaggggggggggggggtgaaaaaagagaaatactctgaaattcca taaaatctagaagaagaaagaaaacaaaggaaaaattggacattccgaag tcaggctaaaaaatctcataaaacaaaatctattcgatttgtgaccattt tcatctatctctctcaaaacccgaataaacaaagcctcccgtccccaaag tgtgctctcatgctcttctggagccttctagactgtctgtagagcctaga gacagcggaattgcactgaagtgatggagagacgtagagaaaacgcctga agaaaaaaacgaacactttggtggaggaggagatggcttccctccaaata aacaacaatttctatcgtttctctgtgattgtgttctcttctatgtatac tgttacgatattgaacaggaaattaaattgagcactctgaatacataata cacaataaataaatacaaaaactatagtttcagcacaaaaaattcgaaaa aaaaacgattttttttgtccgagaggagtatatggcctagaaaaagaaaa ctcggccactctgatgcaataaatttaaaaaattatggccgaattttaga tttctcaggccaatttgatacgtttctcgaaaagccataaattagtcggt ttttcacgggcttcttgccttcctcattgcatttttcgcgctccattggc aatctcctgctggacaacgcgtgggaaatcgtgtgccccacacgggcaaa tacattttgttttacaaagaaaaccgtgccgcgacgcgacacgcaacgag ccgtaaatctaccccagatatggccgagctcaaatggcctaacctgtcaa aatcttccacttcaaaatatgagggaagccagaagcgcgtgttgtttctg aaaaaaaaacccgcctaaagttgatttaaattatcgtttttttggaaata ataaaatcgatgaatttgtagattttgataaatttccgataaaaaaaaaa ttttaaaagaggaaaaaaaatgtttcttcgccctttagtaccaaaaatac gcccaactaaccaaatcgttctttcaatcttttttaaatgtttgtgcgtc tataattgtcgcttcagaaaactacacaaaacacacacacacacaaggag aagaaaagaaaaaacgtgttccatgacctgccactgggatcgatctgtaa aagaattggggaaaattgaggtaaactggttttttatcgggaagattttt tcggaaggattgagatgaaagttcgaaaggtaattggcaaagttgaaaat tgaaaaattcgaaaaaaatctcaattctctgctgtaacccccaattttgc gtcatggcctagagtatgcagcgtggcctagaaattcctaacgtggccta aaagatcacggcggtacctatgattttctagcgtgacctagaatatacca gacctagaatttgatagcgtagaatttcccagtatatcctagcagtctta agtgacagtttctcagtacgtccaagaattcgtcagcatgacctaggatg ttaaagcgtggcctacaaattttcagagtcttctaggatattccagtcta aaaattttcagtgaggcctgaaatcatcgcgtgtcctagaatgtctaata attgcaaaaaaaagatttgaaaactagtatttaccctaaaattgcatttt gagcattatttttaatctagttttaaggaaaaaatcagaaaaaataaaca ttttttgattaaatcttccgatctacagatagaaagtgtgcaagaaagaa tgcaacattgtgctcggtggagcaagaagataaaagaaagagaaagaagg tcccccacccctccagtggtcgaaacaatgataaattggacaaacggagg accaaggggccgggcagacacaagagagagagtacgtgaactgaggaggg tgtgcagggaaaaatgggatgggggcaaatctagttcaaagatgagacac ttttcaggatctttgattctgagaaaaattttgaacaaaaagaatacttc aataatttaatggcacatagaaatattttcagattgttcttcaaaagaaa aatatttttatgcccggaaaatttatttattgcatttcttccaaaacagt ggccggtctcgacacgacaaatttttgttaaatgcgaagaggtgtgcgcc tttaaagagtactgtaatttcaaactttcgttttaatatttacttgtggg aaaacattaatgcttaacgaaaaattacagtactctttaaaagcgcacat cttttcgcatgtgacaaacattttcgcgtctcggtgacaacttttaagtt aaaggcacatagaacttttctgaagaattttatttatttttctgaaagtt aattgctacagtatcctttttcaagtcgcaccgagagccaaactgtagca aatcatcaaaaaaaagtcgacaaaacgtgccgaaatcagtaaacttgaga gctttaaaactctattatcagttcttcgccaacaaaaaaaaagagtaccg tatcaaaaacgaacttcgacttttttggctctcctgcatacggacatgat tctgattgacagttttcatgtttttttttgggagttttatttattgtgca tttaaaaaatcgtatagtttgatgcgtggcctagaatttgccagtgtgag cattaactctccacggtagccaagaaattttctacggtggcctaaaaact gccagtgtagcctaaaatattttattgtggcctaaattttccaatggtct gttttttttatagttgcctagaatttcttttcgtgacctagaagcgtaca gagtggtggcctagaaaacgattcatggcagagttttgaaaaaaaaacga aatttcgagaaacaagcgaacaaaaatcgtctgtcgaaagagtatttcga atgctggggatgcaaatcagcaaatcattcaaaaaaaacttttgtgataa gaaatcaaactgataagccagtgtcaaagtctcgaggattaaaaatagca tttcaggtcggggtacggtagggtttttgtagaaattaatgcaaaatttc agtgggaaacgagttcgtggcctagaaaaatcatgtctgaaaaattgcaa atgcgctcccccgaaatggttaaaaattttcaattgatagcctatttgaa gtggcggcctagaatatcaaataatggcctagaactcaaattggcggcct agaaatcaaactaatgacctagattagggcatcttgtaggcagcttagat cacctattataggcaggtgtaggtaaaattgtagacaaatgtaagtttct ttgaagataggcgtaggttcctttgcaggcatacatagatcatttattag gcagatgtaggcctgattgtaggtacagtgccggccaaaaatatatccta tttttgacttttgataaatttacaaattttccaaacgagcacaactttaa aactagaaatgttatcgaaaaaagttcaactcatgtatgtattgcccata attacgtctactcgtattcaattgtttgttgtttactagtgtcacgacaa caaatacagcggccgacatctcgtaagcccgtttttgacaacgtttactg attcggccgtatctcgaaaactaatttttttctgaaaatgttgttaaagt gaaatagttttcatgttatttgttatcatttgtgtttattcactttgttc tgaaaaatccagtaaaaaagttatgggagtgcaaacttgtcgctcactgc cactcacccgctacaatcaaaaatcaggttacttatagttagttctaatt ttttttttgtagagcattttttagaaataacacatgtaaaatcacaatga >Contig2 cctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaagcc taagcctaagcctaagcctaagcctaagcctaagcctaagcctaagccta agcctaagcctaagcctaagcctaagcctaagcctaagcctaagcctaag cctaagcctaagcctaagcctaaaatagtgactctggcagttctctaaaa taagtgactctggcagttcaccaaaaattgtgactctgaccgttcaccaa aaatagtgactctgaccgttcaccaaaaatagtgactctgaccgttcaca aaaaatagtgactctgaccgttcaccaaatatagtgactctgaccgttca ccaaaaattgtgacaatgaccgttcaccaaaaattgtgactctgaccgtc actatttttattgaactgccagagtcactatttttagtgaacttccagag tcacaatttttagtgaactgccagagtcactatttttagtgaactgccag agtcacttattttggtgcactggggtgggtcacgcccccagttctcagtt atgggtactctgatccactcgggacccactttatcgtgttccccgtgcct catttaccctagagcttcctcctttacctctcctctcgctatctctaaca ttccaatggaaactcctatttgaattaccgccaccgatgtgcccgacgcg acttactgttagcccttgttttgcacaaatctgttggcttccatatttaa aagttaattaatgacccaatgttctttttttctctaaatctccacaagat gttctgttttccctactggacactatcgttcactgcgtctcaccaattca cattgtctctactttaccttttttgtcatagtacacgttcgccaacggtg tcgacggccaaatgctttgggcagcgtttgctttttttataattagtttt attttattaaaacaatagctctaaagtttacaagtcatttgttataggct aaatgagttatgtctaataagtaatttgaactagatacttccgtgtaagt gacaatgtatcggaaaagtcctcaaagtgcgatgtagaagttcacatgta ctttgtttggcatgttagtaaaagagccagtatgctgattcattttatat tctatatactcatgtaatatgcccatgtaaggtttaattccaaaaatatg agcgtgttctattttataatattttactaaaatacctttcagttaattgc actcaaatttgttgttcttcattctctcgttatgatttaatcttattgcg tcaaggtcattattttaggtccattagttatcgatctgaaacatgttgtt gtatttttctattcttgtgagctcaggacacctcatacaactccagagaa aatgtgtctcattattcttgtcttttttcaagatctaatcaattttctac attaacgacgtttttgtcgttctgcttctttttttcgttcgtttgtctcg tccatcagctgtccactcatttctctcccactcactaggcagtgctttgt ttggttccgattggcagctggctgcagggcctgcatctcttctatgtctc tcatttacttgcattcttttcttcgttaatttttgttatgatatttaaac gggaagaagagtttgtggttcttctttttataatcactaaaacttttgga taagtaacaattttctgataaaaatattttcacggcgaagaaaaaagaaa aagaagagtagtttttgcacgttttcatataattattttcgttgatcaaa tgttcttctggagttttctaataaatttcttatcgactttttttcagaaa tttttctcaacttgtcatgtcaatggtaagaaatgtatcaaatcagagcg aaaaattggaagtaagttctttataatttcatttatatactataagtttt ctcgatcacaggagaaacaaaaacaacagacaacacaaaaaacaataaaa caatattgctctagtaatcaatagtgttgtaaagagggaagaaaattgtt atctgtgtagcagtcaacgttgattgagatgttgtgtttgactatagagt tgaaaataataacttcaaacttgcaagtcatgacttatcaaacactgccg gaacttattctggatcaaaggaaagttgtccaactgtagagtcatgtttt tcaaaagaaaacacaatttttaagtataaatattttgaaaaagtatgttt tagaagtatgtcaaattaaaaaaaaaatccttggttaaaaaatgattttt ttggatatatgtgtatttttaactaaaaatatatactttacatatatatt ttggcgcagttatttgatctataaatcaaactttttgatagacatttttt tatatttacaacaactagggttgttatgaaaacgcctattattctacaaa ctaaattattttaatcatacattccccactatctaaaaactaatgcaatt ttcagattttgtcatgtaaatgggtaggatgtctcaaatcaacagaagtg ttcaaaacggttgaaaagttattagatcatgttacggctgatcatattcc agaagttattgtaaacgatgacgggtcggaggaagtcgtttgtcagtggg attgctgcgaaatgggtgccagtcgtggaaatcttcaaaaaaaggtattt ttaatttaatgtgcattttataatataaattcttcagaaagagtggatgg agaatcacttcaaaacacgtcatgttcgcaaagcaaaaatattcaaatgc ttaattgaggattgccctgtggtaaagtcaagtagtcaggaaattgaaac ccatctcagaataagtcatccaataaatccgaaaaaaggtattcacaatt tgcatgatattgttataatctaattttcagagagactgaaagagtttaaa agttctaccgaccacatcgaacctactcaagctaatagagtatggacaat tgtgaacggagaggttcaatggaagactccaccgcggtaagtgtgtttct ttaaaaattacttccttttttcaattgtttgaaattaacaagaaacctgt tggagcgtatttctgaacttttaaatcgaaaatatcatttgcaaaaaaac ttgaaaattgagaaacttttttaaaagtggagtagcgtctgcgggttttt ttgccctaaatgacagaatacatacccaatataccgaatataaccgtaat aaaattatgcgatttttatttttatttttcatgaatgttaggggcaaaaa acccacatgcgctactccgccttaagaagaatcagctgtgagcactatcc actatacattggaaatttacaaataaaatagagattaagtaatataattt ttaagggttaaaaaaaagactgtgatatactatgatgatgggccgaggta tgtatttccaacgggatgtgcgagatgcaactatgatagtgacgaatcag aactggaatcagatgagttttggtcagccacagagatgtcagataatgaa gagtacgttgttttgcaaattgattaaaagtggagtagcgtcagttaaaa actctaacatgtcttaggtttttcaaaagtttggtcaaagttttggcaaa ctgccaacttcttgaaaacttcgttaaaaaaattcttgaaatgatttgaa aatttgtattatgttattctcttatttctgcactattctatatggcgcta ctatacttttaattgatttcttgaaagcagttcaataataattaatttta gagtatatgtgaacttccgtggaatgaactgtatctcaacaggaaagtcg gccagtatggtcccgagcaaacgaagaaattggccaaaaagagtgaagaa aaggctatcgacacaaagaaacaatcagaaaactattcgaccaccagagc tgaataaaaataatatagagataaaagatatgaagtaagtcgaaattgac aaacagtggtttttgtttaagtttattgcgaaatattcaaaattagacat gttaaaattttgcgagataatctaaagattaggtatacagattttttcat gtaaagttacattcatcaaaatttttgtgttcaccaaattagacaaaaaa tgttagttacacagtatatttattttttatatcaataaaacctttttcag ctcaaataaccttgaagaacgcaacagagaagaatgcattcagcctgttt ctgttgaaaagaacatcctgcattttgaaaaattcaaatcaaatcaaatt tgcattgttcgggaaaacaataaatttagagaaggaacgagaagacgcag aaagaattctggtgaatcggaagacttgaaaattcatgaaaactttactg aaaaacgaagacccattcgatcatgcaaacaaaatataagtttctatgaa atggacggggatatagaagaatttgaagtgtttttcgatactcccacaaa aagcaaaaaagtacttctggatatctacagtgcgaagaaaatgccaaaaa ttgaggttgaagattcattagttaataagtttcattcaaaacgtccatca agagcatgtcgagttcttggaagtatggaagaagtaccatttgatgtgga aataggatattgattttataacgtgtaattgagttttggccaaaaaggta tggaaaggtggctgtttagttatatatttttctattatttatttgaaaca tgcaaaattgaagtgaacaataagtgatgttcatggaaatttaaactgtt ttatgatacttttttgagaaattgaaaaatctgttcattttagaaacaat gtccacatggttctaagagctaaaatttttattttcatccatttagagta ctttctcttttagagtacggccccagagcgatgttagaaacctgagatcg gtcaacacagaccgttaattttgggaagttgagaaattcgctagtttctg acacgaatttcagctaataccaaataatgtgcaattgcattttgcatgtc agcattcagcattcatacaaaaatttcaaagagccaacttttcatacgtt tatggtcaacactgtatgtgttacattgaacttttttaaattgtattatt tcatatttgaatcatttccatgccattttcaaatctttttttaacaaaaa tttagtttcaaagttttaaatttaggtgaaaacttgctacaaaataacac attctttagtcgtttcaattgctatctatccgcagactgcaattttgttt tcccacaaccattcacacaataaataagggtataaagttttgttcatata acacatttcaatactaacatttcaattttgaacaatttttctaaacttat ttcccttcgcccaaacgtcattcaacattctttgtacaaaacattaccat tatagaaaatctcatttttccactatttcatttatttttattgttccgcg ataaatataaataaacatttacgtgttccgagttcaaagttttaccacgt ttcagaataaaggaatcggaggggggggggggtgaaaaaatcatttcaac aaatcagaatttctcaaaatgtgagttttatcattttcattgttagaatc acgattagcttatattgaacaataacaataatttaatcccattcattcca tatcttctcatgatgaaaaaataagcattattcgttttcttttgacgcgt ttgatagggttctgccagcgccgaccaatattcttctcaattagattttc cagaactgcactaactgaaaattgttttagtaaatagaacaaaactgact attagttcaatattataattttacttccctattttctgacagcattttgc aaatactctttacagtcattcttgtattttgacaacaaaattcgaattta aaatttcatttttcttttaaaatatcagtacactcctggtagaacaaata ttttatttcaatgtttatgtatgagtagaaaagttaaacagaactattat ttggcatcaatttctgttttttttttcatctaaagtatggcatttattag caaaaaatacatgcttgaacaggaataactattaatttcctatagcgacc caaaaacaaccaaaaattgttttaaataatttttttttggtcgacttcca tagttatgagtggcaaaaactgagtaattgtcaccttttgacagtaaata aagaaattttcaaaaaatttttgaaaagttttattatgctattcgatcat tttggcaccatgtaggctttaacaccccactggcgctactccgcatttaa aagtgatttataaaaagtgtaaggaatactttgctccaattttcgtctag tgcatggcgtacccatattttgttcatgttgtgtctcttagcctctcctc ggaggcgagtttccatagctacccacaaagaatttatttttctatgtatg cttcgttggctccaccttcattttctcagtcgtttctcagttttctcatg aattctctttttgttcgtttttagaatagctatcgttttaccacgaatgc gctttttcagttattaagaaaacagttttttgtagtacatttttcatagt ctgaactttcagcatggaaatagtaaaaacaatcattccacacaaccgtt catacattgaaccgcccataccatctgccaccgagtggcttgcaagtaag ttgttttgtctaaagacataatttctgaaagaaatctatccgatgagcgg aaaataaacagaagttgtttttaagaagactttgatgcaattacaaaaat atatggctatgtgcgtaaaaattagacgtgactcgtcataaagaccatat aggtctacaaaggcgtttcgagtcttataaaagttcgttcaaactttttc ggggtcttgttaaaagcacaccaacaatacgttaaaggatttataaaatt gcaagtattcgtctcaattgcaatactttggaagccgaattttggcaaga tattggtaaaacaggtacattacaatagctatttttggtaaaatgtacta ggtatcttgtaatgagttgcgcttgcctcataggcatacatctaccgtat attctttactagtgctgcaggcagcactaatttctaggcccttttttaat gcagtactattagagactgcagtactactggagatgcagcgctaatagag aatatacggtatgtagatgcctagaacgctaaggttgttttattcaaata aatttcttagaaaggctaatattatttttaaaatcaagatttaagttaat taacacgacctcgtattctatatttcagatatgtcagttatgcatgtgtc atgtttaaccattgcttgtgtatttgttgctatcacatttttgtcatcat ttttccatttattttttgtattgaaatatgtgtctaatgagaggatacgc aatgatatgtacgcattgatatttatgtttcccgtgagtttataaatgga attatgaagagtttggaaaatatattttagataaccacatttgcaagttt agtaggaatgttcataccaagagcggctattttcctttatgcagtttccc ttgtgtaagtgtaccttaacaccacaaacagaaaaatgaactttattgat ttgttccagatatttcatgtttaccctcttcataatggtcactcttttat tcaacatttttggaggtcggcaagaaatgagtgcctacttgctccaaaga aatattcgagttaatttcactgttccaccactctgttttttcaagttcct gccaactgtagaaagtacagagtgagctcgaaaatcttatttttgtcttt ctaatatattttccagtcaaaacctacgtcgaatcgagtggttggttttt caaactcctattatacgaactcttttagagttagtctcagtcgttgtgtc tatggaacaagaaggaagacgagaaagcgtgtaagaccatgaaatgtttt ttgtgtgcgcaacgtagtgcgaaaagttagttatttggagaggaagttat ttgaaaagaatctttagaagatattgatattttatattctgtatgtaggc gctgtggcaaattctcgacttcacgggagaagcgcaaattttgtgctggt ttccataatgcgctcgaaaaatgttgtatatttacattattcaaatctca ccgattaaagttgaatttcagttggttcgtattctctcaactcatggcac tactctcgatgtgcattgccttttatggttgctacgttatggttcctctt ggaagagaaaagcatgctccataccgattcgattttctatttcgtacatg tgacattgctcagtgtatttacacaattcaaaagtttgtctttgagtttg ctgctgcagttggcttgatcacttcggatcgataccttccagctgctgcg aaggcattgtgtaagttcagaagtttaaatggagataggaatgtttcgat tttcgtttataaaaacacctgttaatattaaattgttgacgtcatagctg accattcatgcttctttgtctgattgtaattatataattgtaaaaataca aacaagttaaccgaaaaagttctctgttacgtttgcggtaaattagaact ttatgttgaaagatatcagcttaaagttaaaattttttgaaaattaaatt ttggctagaaattcaatttttctgataaaaagaaagtttggtgggaaatt taatttcattgatagaaaaactgaagtttgtgttttctgagaattaaaaa acagcgggcaatgaagattatgtctttgtaaatttcaatttaaaaatatt ccagggtgggcatcgtttatgtgcacctgggagatgatgctcctctctgc tctttgctcatattgtttgcgtccagccaagtgtaaattttttgatcttt accctggaaacgatatgccagctttatcggcaagagatggttcaaactcc cgtgttccgtcattttctcgacgtctatctattgaatatgagccacgaat tgctggagtgatgttggagccgccgtcaagaagttcactgagcattactc ctagggataaaatagaagacccaacaaccgtatcgtattttgctgataat tttgattctttatcgcaaatccaaggacaataattgaacttcctgccatt atttcattgctcaatttacactattcttaactttttgacatgaattaaat ctttttaaactgagtctaagcgtattattgtatcgtattttccccttctg atgtattcatttacttgtatttttgaaacccatcccagtgatacaactac ccaattttcctgtgccatgtttcttgaaacaaatcaaatgtgataaatag tttgaatgcctttatgtataataaaatcaatttttcaagccggatctccg tgtttgccattttgattacccagcgagcctgaaagatttgaaaagttata tgagcagcatgaaaagcctgttctttggtgtttaaaggtagcacacaaac caaaatttgttagcgcaaatttaaatttctatttcagttaccatgagtta gtctaacaaacatttttcaagattatcagacaactgataattttaactca ataagcatgattttgaacaatttcctaactggcgttacttcaccttgaat aagattgaagtggttattggttcttatgtacagtgcttatatatttagtt aataatactaactgaatatataagcactcgctaaacttgcgcacaaactt gcgcttagatttctcggcgttttcctgtgcagtctttcttcttcttcttc gattactggcatttcttcaacgaaccattttacattgttggcagacgggg aaatctgagaaaacattaattatttttgagaagatttttcaaaattaccg ccgattttccttcagcagaagcagtaattttcagaggttcttccggaatt tctatttttactgcctgttgttttgctattcgaatacgttcttgaacaag tgcaggcacatttatcacaacatcatctgtgttttctattggctcaattt cacgtaaatctgttgatgggcgacgatggttaaggttcaaagctgctacg cctaaaaagtttactttctcgatataatgttttccaaaatattttgtatt tactcacatacactagctctcctcggttcagcttctgatttataaaattt tgcgatagctgcagaaattttgttacgcggttgttgccacatatgtcgtg gcaacatcgaagcgttactttgcgttcttgtattttcagttgaaaaagac gatgcatcatcatcaccaaacgatctcgatcggcgtcgaacatacgaaat caaaccattgtccaagtcagcttcgttcctgaaaatcgtccacatttcaa atttcgaatcagatcatattaattcacatatttatgcaacttccttgcat atgagaatgagctttccgtgttgaagttggcgtgtccattgtttgtggtg tttcatcttcccaaaacggatctcttttaagtgtcggataccgcccgtag gcttgatctacgactgcataccctacctgcaaaacttctaatagttttca atcttattcgaggtgaagaataaatatctctttaacctggagattccgct caatgatccagtttgtttcaaaatcttcatcgtcttctccaagaggattt aacattacttctgatacttttagccagccgactataaatacaatttgtag agatgtcattattggaaaataaatgtctatcttccactttgcaccaggaa tgttattgttgttttcaagaaactgtcttccaaacaatgccaatacaaaa taagtccgaacagccaggttaacaacttgagtataaaccagaggaatcgg caccatatcgaaaatgaccagatttagaatttttgtacggaattctctca ttttctgaaatttggttgacattaatcattgtcagtactttactaacatc aatcaaatcaacgtagagataactgtctgcaataagcccttcatccttag caaccgttaccaatgagaacaaccattgaattggttgccagtatttgctc tgcgggctcgtgatggcatcaaattcagtcagctcatcttcagttaataa tccagcaccaattaggtgttttatggtaggaaatcgtcgtctaattgcag gtgaaacatcacgaaacaccatcacttgtgcaactatcatgtaccgcaca cagtttcttcgaattaatcgggctttttccgatgtaccacgaatatattg agcaattgtcaaagcggatctacaaaattattaagaatgcaggaatattt ttgaacagaaacaaacgtatcaatccatccaacgttatcgaacactttgg tccatcgattataaacaattgatacataaaaacctaacatgaatgttacc ggtataaatactgaaaatgtatcaaaaaaagtgcacagctgttcgaaaac tctgaaaaaagcaagttaagctcaataagaaattaattgcaacaaactct ctctgtgctttgttaagcaatattctataaatcactgataaaatagaata acacaaaagccaaattagtaactctgaccatattgatttccaaaccgatc ctttccatctcaataacacttttatttgagtgaacagacctgacgtggca acatctagagaataagcgacagtcatcagtcaaaaaagtatgtgtagcaa caaactgaataaaaaatattatttgtatcagtcggtgggtgagcgattag tatcaagtagcacaaacgctaccgtttcagatttgcatattttattgtta taggggttattcaggcataggtcggttgaattccgactttttattcacat ttttccagaaacaaatcgattctcctaattttatttttatgctttatctt tttgaaaatctggcatcactgtttgcggaaaaaaatataaacaagaggga atacagtttgtgggtattttgcttacgttactgatattatcgccttttaa tctatattttagtaatttatcttgcgtaaataccaaaatatggattaaaa ggagataatatcagagaagttaaattacagtagctgcgacaaagaaaagt ggccaaaatttctgattttagccaaatttggctttttttcgaaattttga cccgccataaaaaatttagaataattttataatttttttacagttatgct tggtacattgagactttattctatcattcaaaacaaaaaaataccacaaa tgcttctccaactttgagaattgtaaaattttcaataagccaaaagtcag ttactggtacctttgcacctatcagtacttgccatcaaaagaaatttccg agaatgttcgcatttcggagtgccgtaaaacttgttcctgagagatatat atcgtctcatcaattcggtatcagtcaacctcccatattgtgtcatccga tattcaatctacaacaaacgtcatgttgtttgttttcaaacaaagtgtat taacattggactttcagatagggttttctgattctttaaccctctaaaaa accatttccctcatttccataatatttattctattttatgcttaacaaat ttacacgagtttcaaactatttgattgttcatcaaaaaaaatcccaaaaa ctgttttgtttttatatattgaactcaacaacataatataaaaactttca aatcgtaaatcatctaagaaaagatcacatgaagtgagtagatgatagag aaccagttcttatttttatgtttccgttacttttttgttactaccactaa taacttggcatttttcaatcaatattttttacagaatgactgtaacttat tcactcgatgttgcttcttcttcttttttctgcttatacaaactactatt tcgatggaaggtgtgctaaacattgcagatattttgtttcattaaatttt taggggtcaatctggaaatcggtgtgggctgagcttgtagtttggctttg tctttatgcagtgcttagtgttatttatcgatgccttttaacaatgaagc aaagagcgtaagtactgttttcaaaaataaaccgggagtctgactttcag aacgttcgaagatctttgtatattttttgatacttattccaatttcattc caattacattcatgcttggattttatgtctctgctgttttcacacgatgg tggcagattttcgacaacatagggtggattgacacgtaaatgacttagtc gtattacgattataatatctaaataattctaggccttgtctttggataac tcaatatatcaaaggggaaacagagcgagcaaagtgtgtgagaagaaatt gtataagatactcaattcttacacaggctatggtaaatctgtgtgtaagc ctaactaataacaatagtctttttgaaggtgtaccgtgacgttgcagcaa gcgttcgcaaacgtttccccactttcaatcatttagttactgctggcttg atgacagaaaaagaaatggccgagttcgagtctatccctagtccacacgc aaaatattggcagccaatgcattggttgttttcgatgatcactttagcgc gagacgaaggaatgatttcaagtgatatcatatatgtagacttgatggag gtacataattcagaagatttttaaagagtaataaataataagtttcagaa aatgcgccaatttcgtgtcaacattctttcattaacattatttgattggg ttcctgtccctcttgtttatacacaagttgtccatcttgcagtacgatcg tatttcctgatagcgttgtttggtagacagtatctccatccggagagcaa ccgtttaaatgactttaagcaaactattgatttatatgtaccaattatgt cacttctccaatttatatttttcattgggtggatgaaagttgccgaagtg cttctcaatcctttgggagaggatgatgacgattttgaatgcaattggat acttgacagaaatttacaggtaaacgattaacataatcaagatttattat tattatttaatacgtttattgaaaagtgaaaatgatagaaaaatttgatt atttaattcaattttaagttagaaaaatatcctacacattttctgaagaa gtgtcgtaaatggggaaactttttaaacatgtacgttccaaacatgtgcg ttccaaagttccgaaaaaaaatttgtgtagtaccaaatattaaagaattt tcttcattcttaaaatagtcgtcttgatatacttctgatatgatagtaag tattgaaacattaactacactttttcagttattttttattcgcgatacca tccatttaataaaataagggagttcatctacacctgtgcccttctatcaa acttgattgaagtatattatttttaggtgggattgatggttgttgatact gcatataaccgttatccaactcttgaaaaagatcagttctgggaggacgc aattgcggagcctctttacactgcagagagtgcgatgagacctctgaatc cacaagtcggatcttgtgcggatatgtaattgagtgaatttgttccaaac aattgattttcatgttcaggccaaccgaagaagagcctttcatggttcgt ccacgaagacggacgctgtccagaatgtcacactgggatggcgacatgga agatactgatgttgttccggttgtgggtctgaaacacacgcgtgataata gtaattatgcttctggcgaatctctagcattttcgaatagctttgccaat ggtggtaggaaactgagtgagatgtttcgaagaatgagagctgggagcag aattggtgataggtataggaaacgcaactcgtcagcacaagactttgaaa atggaatggcaaagtttgtttcatgaaaatatgaatgttatttattattt cggctatttacagaaaaaacagtattgatgaaaatgcagatattcacagt aataggctcgatcaagcatccggtacaccaaaatcaggaaggctttggag ttcgatgcctcaaacacaattggaagaaatgcttaaggtttgtttattca ctggatttattcactggcgctttcacgtaatttcgagactgcaaatttga acttttaaaacaatattcggagaagaccattttcgagctaatctagcgtt agttttaaaaaatgttttaagtttttttttcaatcacaatggtgattgaa aatctcagaaaagtagagtacttgcgttcatacttaacgcatttctcata tttttatagaattacacgccgataaacaaatagttaagacatttcagaat aaaaactttaactctcctgtcaaatacaacactgatgggatgaaagaccg agagcttcaaaatccaacaccaatcactgatcacattgatttgcctttgc atgtggcgagtagtcaatcatggtttaacgaaagtttaccagtaatcaaa gaggaggaagaagctaaaagaaaatccaacacggatacaggtagacatag tcaattttgtcaaaaaaaattaaatgagtttttcagagtctccaaagtct agtaagcattcaagtatgtcaatcagaagatcggaattgagaagatcatc atcttcaggtagtgatctaggcaagtctggaaagcgggagagaaagaaga gcgagtgatttttgaacagtatgataaaatattttttgtttctcttttca ctctaaactgaagatccctttcatttcatttttacatatttattatattt taaatttcaaattgcttaattaattttctattttttaataaacaattgtg taaatatatatattttttaatacagtgtgggaaagttctataggaccccc cctaatttgaaggtttgaggaacttccgaaaatttttttgaaaaactgct aatgccattcgtttttaaattgaaaaaaacctatatacatttttttccag aagtttatctcaaaaactgaggtcgcgctggaaaaaacgtcaaaatccag tgtgaaacttctataggaccccccgttttttttcacgatttttactaaaa tcaacagattttggaatttttgacaaagctcaaatcaagtttgagttaga aatgagttcatataagcagttttgactttaaaaattaatacgaaatgttc tcgtgggatctccagactggttctgattcttccgatctttgatgttcaag tctgtttcaagcttcctggtgctctcggtaatgccaaaacttgataaact ctctttaacaagttcctactaaaattcctagcacacacaccataaacatt tttacgccatccccaagaaaccagtcagaaacagcgtattaacaagttgc agttatttttgatcaacaacagaacattcatatactaaaatcaagaaagg atcaatagttaatcgggtttccttgtgtgcggatgatctcaaacagtctg tcctccattgatctgaccaaacttttcagctggttgtccggaatagactt ccaagcgtcgagaattccttgcttcaacgatgcaactgttgggtaagtct tgttctgagcatacacgatacggacaagaatcccccacaaattttcgatt ggattgagatcaggacttcgagctggccaatcaagaaggttgatcttctt gagcttgaaatagtcgcgggttgagttgctcacatggattgtcgcattat cctgctgaaatctaaagttttttctggagtagtgacgaagatatttggag agctccagttccaagacgttctgatagtcagtgctgttcatcttgctact gacgaactgtatctcaagcttcttcttctccgtgaacgctccccaaacca tcaccgttcctcctccaaaattacgtctcgaaaaaaccattggttccttg cgcaaatcgcgccaatagtagcggcaaccgtcaggcccatcgagattgaa tttcttttcatcggagaagacaacctaaaacaatgatcctaattattcac tcttgcttttttaaattctcactttactccaattcgttcccatattgttc ttagcaaattccaatcgcttgagtttatggtctgcagagagtaacggagc agggcgaagtttctgacgaacgattacaccagatcgtttgatgacattga ggatggtcctttttgaagcagacaattgaagctcattgcgaatatctctt gccgtcttacaggagttggaggcagcacgaatcacatttcgttcgtcacg cacggagagagctttgcgacgaggagctcttttagatgtaccgtagctca ccggatccttcagatacacgcgaatacagtgtcgagaacgggaaattttc ctactcatttcatgcagggacacattgagcaatttcataacatccagctg agcgcgttcagtgtccgaaagggcagatcctcgaggcattgcaagttaga ctgctttcgaagtaagctttccagcctctatatgtgtgccacaacacatg ccacaattccacatttaataattcacgcaaaaaatagtaaataacatctg tgagggacaatttaacttgaaatattggtcccatggaaccttgtaatcaa agaaaaacgatttgattcctgataagccttccattgtttcctgctgcata ttttgccaaatcagcttgactacacagtcgaaacatctaaagtgcgtgct aggaattttagtaggaacttgttaaagagagtttatcaagttttggcatt accgagagcaccaggaagcttgaaacagacttgaacatcaaagatcggaa gaatcagaaccagtctggagatcccacgagaacatttcgtattaattttt aaagtcaaaactgcttatatgaactcatttctaactcaaacttgatttga gctttgtcaaaaattccaaaatctgttgattttagtaaaaatcgtgaaaa aaaacggggggtcctatagaagtttcacactggattttgacgttttttcc agcgcgacctcagtttttgagataaacttctggaaaaaaatgtatatagg tttttttcaatttaaaaacgaacggcattagcagtttttcgaaaaaattt tcggaagttcctcaaaccttcaaattagggggggtcctatagaactttcc cacactgtatattgcaaatacatgacataaatttagatgcagggcaaaaa ctatagatcaaaattttctattgcactttttatgtataatcaataaaaat tgaaaaaataaaaaactttgtaagttgatgccgaaacatttcagtttcta ccaaaatcgttcgattttatactgatcagttttgatcactttctggtaaa tttcgaaattcgcgtttttttcagttttagagttagaatattagttacta acaagtttagcaattttgaaggatttcttcaaaaaaaactgctcaaagga ctttgctgcaaacatactcaaatttgcagcaaagtcctttgagcagtttt ttttttgaagaaatccttgatttttccttattttctccttattttctaat tttattttctaataaatccttattttctaatttttcgtaaaaaattatta aaatttcaaatttttggaaacaattgtttttttttcagtaattgaccata tttttgaccttcttgtacgtgaatgctttcctttcctctattagggtgtg tgactgcgtgtgtgtgtgagagtgtgtgtatgtgtgtacgtgcgtgtgtt ccctggcgcggtggtggtgttggccacacggccctgcgacccccataaaa actcggttcgatagagagacacacgggaatgtgagagagtatgacgattc gagagacgcagacgcacgaggagaaacacacgtcacgcgaaacacgttcg cgtcgcgtcgatgagcgcgcgcacacgtccacatcgttgcctggatgagt gggtttttggtccgcacacacgaactgtttttttttaattcttgtcttcc ctagtagtgaagagttttccaaatttccaagtatgtagttttaagtttct gattaagaaaaatattattcatgtgttttgaaagtttgtcagaaaaatca atatataatatttttagacgccatgattttcaaccaaataataagtttaa ttttttttgtgaatctaacttattgatttctgtgtaatattttcaatcgg tgtgttttttactacattgatatctacattgatatatctacattgataca ttgatatttttcgtaaaaaatttttttactccattttactacattttact acaatttactacattgatataaatgatgatttttcagatgaaaagaatac tctcagatggagtcaatgagccaaaactatgcaaattcataaaagaagaa tcaccacataaagttaaacaggaaccatatgatgatgaagaccttgtaca tttgggatccgaatcaattccatcaccaacttcatccacttcgcctccat ttcctacagaacctgcggttcaaacaattaaacttcccaaatatatggag gtaaccatacacttttcatatatgtgtaacatggggcggaaatgtgaact gttcatcgagaccgaaaaaataatattttcaatgattacttttggtaaac ttttcaaaacaaaattggcaattttttcttacaacttcaaattgttattt atttccgattcatctttataacttcaataatttatttgagaattctattc aatgctattaaagtcaaaaatttgcgaaacgtggttttgcccgagttctc tgaaactttctgaatttggaggagtatagaaaatggttcgtaattttttg caaaaaagtttcaaaaggctgattaggccacgccctttttagagagttac tcgtcttctaaaaagtgtcactggttttcttgattcgttttctctaatgt taagtacataatgacataaatcaaacaaaaaacaatacagtgttcttggt aaacgagaaactgagtgcattttttaaaaaaatgtgaaaaagtattggta aattgctaaaattttgaaaaatataagattttgaggaaattcaaagcaat gtcgcatggtccgacctcaacccctacattggtccgacccctatacgagt aattaaaataaaattaaagtataaaaaatgtaggaaaaaaaaattttttt ggtcgaattccaaacttatgagtggaaaaaactgagaaaaaaatgcggat ggtgctctcttttgttgaaactttcaacgaaagcctttaaaaaaccgctg aaagcgtccaaggaatgtctaaaaattagaatgccgctgtgaaatttagt aagcgatcaaaaaagttaagcaatttactcaaattatttcagttaaaatg tggtgccctcgttgctcgacttcacactgaattgtttatttgtcctggaa ttcgagaaaaatgcatcgaagtactaggtcggtcagagagtattacataa gtgtcattgattattcacagactgtcccggagagttattgacacctgttg agtttacaatcaaggctgaaaagagcaagcaaaaagactggaaaggagcg ataaagcataatggaaggatgttaaggtatgcttcttgtagttttcaact cttaaaaacagaacaatttaacagaacactcatggaattcaaacaattgg atttttataatcatcatgagatgtgttcattcaagtgtcattctcgaaac tatatcacaaaaaacggtggatctgttccaaaacttccaccaaaaaatgt tcaacgtcgtcactcttctgcatcaacaacatcaaacgtttcacaaacag cgattaatcaattacttcaaggagagctgattaaaaatccaaattttcta gctgcgttcgctgctcattgtactgctgaaaatcagaaacgacaagaaga agctgagagaaagttgcaagaaaaacaaaacgccatcaagtgtctgatgg aaaccgactcggtcacgttctggaatcaaacgatacaatcaaaaacgtct actgtcgttttggatcgaatttctatggagctcggttcgttggctcagaa tctgatttctggtcgtgattttgcggcgagttcgtctaaaatcatccagg tacttcaagttctcggcttgtcggacaccgtttctcgggaaatgtgtggt caattcattcttccatcgtccgtgtcaactaatgttgatggtaaggaatt cgatccttttaattctatctgaagattagttaaaagtggagtaccgtaat ctcctttttaagcccaacatgacccaacactactgaatttcgcaataaaa ctttttggaaatttctcagaaaaaagttatggcgattcaaagttctgcaa aaaaaagactcaatttcagctaaaatcacaacttttaccattttctcctt gtcgcagcttctcgaatttaataatataatctttcagggcaaagctcatt agacgcacaactaccggttcaacatttgccgtcaaaagaaatgaaagccg ttgacccaatcgaaaaatcaccaaatgataataacaatgaaactctcagc tcttctgagaaactcgaactcatgatcagaaacgcgctctgatcgaacat tcatactctaacacattcctcttcacatctccagatgattattcatgtcc atcattttatcattaaatatctcattctatactctcttctcgctttattg tttctcgctcactccctacccaccattacataacatctctgaaatttcaa agttttgacattcttggctgtgccttttcctctcaatgatattttccaaa ttccattatttttcccccgcctttgattgcttttattggttacttgttta ttggtatacttctcggtattctttttaatgttctgcatgtttcatatggt ataaattgcatattacgttagacacaaattactgcaaactaaactcagtt tgtctcaatggaaattcgtcgaaacacatggtgtcaagctgtcccattac ggtttgatttacaaaaaatgtagatcaaaacaaaatcggacagcccgaaa ctaggtgtaaatatacttataagaattcaaaaagaccgaataacataata aaacattcctaagaattttagattttctaaaatttccagtcatagttttg gcaacttgccgaatttttaaaaagtatgagcttttgagaggatgcagaat gtttttacacaaataattaaaaaaaaaaggaaagcataaaaattttagaa tttttttttcggtagacttccaaagttatgagttacaaaaaatgagtaat tgtcgctttttgacagtgcatttaaaaacatgcaatttaaaaaaaaactg tcagctaaggtgccgactgtcagtgacagtctgtcacttgttggtaattt tttaatagattctagcttacattggtattattctgttctaaattatttgc tcaaatgaatcatcattctcgctgtgttgtcattttatggtacttgtatt attatcattatttagataatgaatatacacatattggatagaacattttc caaaatcagaacaaagcctcattacaaacttcaaatttcatttttcaaaa ctttgaatagaaaataaatttgggtaaatagtcagtaatcacccaatccc ttaacataatatccacattatcgagctagtgaagctgtttctctggcagt gtccaacccacttcttcgtcttcacctcattccttttcaactccgcccct taaaggaagtactcgtccatagcgcataggaacaggcaagcgataatgtc tgtgtctctatattttcacgcactgtctagtgccgcatccgtatcctcta ggacaccggtgccgtggcgtctataaaagagagtacgggtgtcttacgca gttcgtatccgattttcagtccagtgtccaaggaagacaagccgaatgtc ccatataaaccgcattcttatctattcacccattccaccattatgttctt gctttgttcccatttctcgtttttcgttttttttctcttaaactttagat attactatgttaataacccattattttaggcagtcacaacctaaaaatga tggagacttcggagcacaaagagctccgacgtgtggcgtttttcgccatt gttgtatctactgtagctgttattgcagctattgtaattcttccaatgct ctattcatatgttgctggtttccagagccatcttatcattgaagctgatt tctgtaaggttagtataaaacaaacatacgtattctattatacaagcaac gcttttttagactcggtctcgtgacatgtgggcccaaatccatgacatag atggaccacacctattccatcgtcagaagcgtcaatactcttcaccaaac ccaccagctgccggtggatatggagctccagttacgaactccgagccagc tccaacttgctgctcttgccaacaaggaccagccggaccaccaggaccac ctggagatgacggaaatggcgggcaagatggtgttcgtggaaacgatgga actgacggaaaggaaggaagccttttggaaagtgctattgtaaatgaacc atgcattatctgccctccaggaccaccaggaccacaaggaatggcaggag ctaaggggccacaaggaccaaagggaggaaatggagataatggaccagat ggaaaggctggagccaacggaatgcaaggaccaccaggaatgatgggccc accaggaagacaaggagtaagtggaccaaagggagctccaggacgtatca atcaaatcaatggaccagctggaccagctggacataagggagtccgcgga ccaccaggaccacgcggagaagctggacttgatggtggaaactctgaagg accacaaggaccacaaggagatgctggaagaccaggaccagttggagagc aaggaccacaaggaccagaggtatatttatttttattcgatatattcaag gctttacatttattaatttcagggaccacaaggaccaccaggagaaccag gaggctgtgagcattgcccaattccaagaacaccaccaggatattgaacc tgtactttttctcattaatttcgaattcatccgcccaaataattgggtgt ttacaatacaatgaattttttcattttaattcacagattataaattgcaa aatttttcagtatttgtcttattattactggtacagagagtgtagatagt tagagagtgccaggcatccgggacccaatggggcacatcaaaggctccca tcgatcgatatgcctaacatgttgaaaaccgattaaaacctcacgtttga atcccctctaaaaactgaatgtgtgccaacacagcgtcattgacgcattt acggtgtcttgacgcgatacgcgttttcaatacgaggcaaactcaaattt attattttcattttcaaaatatcaatttgttgaaaactagcaactactac tcatctcttcactcgtcattatggttaaattgcgcgatgaacagaagaca gagcttataaaacaggaaaaaaagcgccgacgcattgctcggttacgaca ggtatcgatttttctttcagttctcaaaaatattaattattattgacttc aggttcgccagcaaagtgcagcaaatgcaaagattacaagagacgtggtg aatcaacggaagcaagagttcattcaagaaattaggttgactttttcaaa tttaaaattataattgaaaatttatattttcagggaagaattgcacgagc aggtggatgcgctaatgacagaagtaactgaaaaatcgttaaaattacct gtctccgcaaaaaggaaaacttctacgcctcgctcacttgtaagtgttcg ttaaaatgatatttgcaaaaaaaaaccacgaattttcagaaaaagtcttc tcgttgtcgtgaaatgacggaaagcgatctggagttggcgaaaaaaagaa atgcagatgcaatgaagcatcttcgagaagcgaagaaaaagaaggaaaag gagcaagaggaaaagttggcaaagaagaaagaagccgcacgaaaagctaa tgcaattatgagaggagctcagatattgtaataataaacttttttttcat ttatgagtatatattgagatcaaacaaagaaagtgacaagaaatcgatat ttttaaacacaaaaaaaattaaaattgaattcctaccgatcacaaatggg cgaaagttagatgaaattagttttcacaagtgtatcggttgcggccccca tagtttattcttcgtggtggtcgttcacaaacgtcaaacgtcaatttcaa gcaagaaatttcattttcataaagaagccatggcagatagtcgtctctga aatattttacaaatttttgaatttcttttccattgaaaagttgttatttt ccgctgaaaaattcaatttttaaaaaaaaaacaacattatttgcaaaaat taaacttttttatttgaaataaatttttttctgaaaatttgaaaaatgca aaaaattcaaaaatttaaaatttgaaacaaattttttttcgaaaatttca aatttccgtgtaaaattaaatgaaacatgttatttttcattgaaaaatta tattttgcgcgtagagcatgttgaattggagcacacttaagtgtgctcca aatttgctatttttttcttctagatgcaccggtgcaccatgttaaaaatg cacttttttggcaaaggggaactaatcgatacattttttaaaattagata ctgtgctaaaattaaactttttatttgaataaatttgacattacaaaaaa aaaattttaaaattttaaaattgaaaaaaaagtttttgaaattttttttt gaaaattttaaattttcgcgtaaaattatatgaaaaatatgtttttttct attaaaaaaaaccgttttgtgcgcgaagtatggcgaattggggcacactt tattattgtcacgatgtaccatgtttaaaatgtaaaaatcgatttgcata ataaaggtggagtagagtcttttaagaattttgattttaataaattaggc tgtagggactgaatataactataaacaatttaatacaaaatttctgaatt tttatgatttttccaatttcgcaaaaattcaaaaaactagtcttactatt tttgaattcccacgcaaattaatgatcattgttggtttttcttgtttttt tttctttaatattcaatttgatgtttcagttcaccaaagtttcaagacat ttctagctaattagcaatattaaagaaaaaacaagacaaaccaataatca ttaatttgcgtgggaattcaaaaataatttaatatgaaaatgactcactt atgccacaaatttttcctatattctatgtaccactggctaaacttgtcaa attggccattattaacatcccaggtacgacgacgctcctccagctccttg gctacggtctcttcaatttgttcttcttttagtactcgttttcttaaaac tgatccagttgcgttgaaggctaatcggattttggcaattactgagccct aaacgatttctaatttaattttcccgaatttttagaactgacgtatccaa cttgttgaagacttcctagggcactcttataatctcctctatcaccttga aggaaatccatctcacaatcaccatccgcatcaaatgctgaccaatcatt gaaacagaactcctaaattttcaagtcaccattttaataagtaatttttg aaaaagacagttggagaaattcaaaatctcacaattatggattttttttc gaaagcttgcagtttgcagaaaaattgctggacaatttcttgatcatttt caagaaatttattatcttaaaattcacatcaaatctctttgaattaagag gtgggcggcaaacgatttttccggcaaatcggcaaattgtcgaaattgaa atttccggcaaactgtcggaattggaatttcggtcaaaatcgatttgccg aatttgccgaaaattatcggaaaattgtgattttgtacttttttcttgga aatttcagaatttcaattttaatcggcaaaattgtacacatcctataaat gttgctacatctattctgaacagtaagcaaattatatgatattattaaag aaaacgtgaaaaaattttcaaaaaagcacagttttaagtttttccgtctt ttaaaaaatccctcgaaacatttccgacacatggcaaatcgacaatttgc caaaaatgaaaatgaacggcaaaacgaacggcaattgccgcccacccctg cttcaattttttaagtgtatacctttctgaaagtataaaacccaggtcta tgcggacaattgatcccacgcaattgcccagccaaactattagattggtc ttcttcattctgctccaaattggtcaaagattcacatacatcacaaaaat aacaattgtttccatagccaccacatccattattactcgcgttttgacac ttcatgttggaaaagtcctttacattccaccactgatattccgctttcgc aaaaaaactattgtcaagcgcttctggtagatctttaactttgacattct taatttccaaagtaaagcatcctgggaatcgaacaacgtctggctttata tgaacttggctcgggttaatatccaatattttttttgctccgggtcgtgc ttaaaaatagcttctttgtgaatcaaacaattatggataacaaaaaatgg aaaagtacatactgcaacttgcgtcagcccttccgggaaccagccaaatg tcctgatagttgcatttgtggttgagcttcttcctagcatcggtgtgtct gactccattcctatactttgcacgacatgcaggatccaatatatcgcagt caagttgggcaaaaatcgataatataaatgagaaaataatcagaatggat atgttgaaggtgtaattcgaagatcgcatactaaatgtgaatatgtgtat gtttaaaatggaattttcaatacaattcgaataaaaggaaagtaaaatat ttgtagaagcaaatcaatttctaatgataattgcgtcacagtgtgtcctc taaagtgtcttctttcttgtctgcattttctcttatgcctctctttgtaa caagattggccgtacatgcccctcttccgacgcacctaacctgttgacca tggagaccttggaggtcatcattttttttgtgcgaaatttggcatttagc aagagaagggatcccttatgggagagaaataaagagtaaagataggcaat tatgctttgtaccacgagaaaaaaaaactcatttgacacatgaaaatctt ttgaaaatggaacaatcttatactattcagtacatgtgctccatgtcgta aagcggttttttacagtttcttgaatgaaatctcacgtggtgtcaggctg tcccttcgctctactgcacatgaaaacttatgacgtcacagcgaacttcg aataattgtttttcaatttaagagccgattttcgtgaaattgttttatca ttttttgaagcaaaatgcaataaaaacacaattttattttaaacattata tttaaaaattatgaaaatcgaagcttaaattgaaaatcaattattcgaaa tgcgatgtgacgtcacaattattgcaaaaaaacatttttcccatctttcg tgtgcagtcaagcgatgggacaacttgacaccacgtgactatacgagttg gtgattagaatttcaaatacaaaaacaatttaggaaaatactctgaaatt aggaactttagcaaaaagaaacattttaaaactattgctcgaggagtaca cgagctgtggaaatcgacatattttccaatttattattacggcaacaaaa aattctgatacttaatgcatattgcacatcatatttgacgcgcaaaatat ctgtgtagcgaaaactacagtgactatttaaatgactactggagatcttg cgtcgattttcaaaagaatttctcagtgacagcgatattacattttcctt cgttttttttttgtattactgtctcatttaaattaaataatatattgctt tcaattcattaacagaaaatcgaacccgtaaatcgacacaagagctacag tagtcatttaaagagatactgtaattttcgttacgagatatttagcgcgt caaatatgttgtggaatacgcattctcaaaattttgtgtatatcccgtaa taattgcaaaaatacacttcaattttaagaaaatttgaaagttgttcata aattggcggcattttttttttgagaatcctacagacaaaaaaaacgtgaa attttaaagaagcggtgtcggtgaatagagacgaagagggattaggagat aaaaataattgatcgaagcagcagaaaaaccgattttcttcattttcttg cccatctctctctctgtctctttttatttcgaaaaaagacagtcagactg gatttgagtggaatagaggggaaagggagggacagttctacgaagcgaaa cgaaaacgattatccatctctgttggcagtctcatatggaaaaggttgaa aattgaaaatattcgggaacaagaagaacacaataatatttatttcgaca tacaaaaggattatatttcttttttcaacgaagaaaaagaagttgaagac aagaagagagagagttagtactctgacacgaaaagggtaaaacttacctg aaaaggaaattggaaaataggggggggggggggggggaataaaataatga ataaataaaaaacatttaatgctccatagatctatcgattttcgactgat gtttatgcttttgattgaatgattcgatgaactgtgaagatgattgtccg tttgactcggccttatatttttccggtaccgggacattgaggatttcagg taacatccatggtttaagccatccgactagaggtgattgtccaggttggt atacaaccgaattatgatagaagaggagtccatcgagacggaaactggaa caaatattaaaggtgtagtagggaaatttgactttttgagggaatttcag ggaatttgagtgcatttgcagacccaaaacggtccaaaactaccaagtta aattaaacgttgtgaaaatttctcaaaaaaatgttacagtatttcttcta tattctattcgaaaattaaattttttgaaattttgaaactttttgaaaca aagcaaaaagcgagaatttgaataaaccgatttcgcaatttttggttttt ttatttattattacctgatctcgtttttcataagctccgccatctgatct tgggagcacggacagctgggaatcggtgagaaaatattcctgaagccagg cgtagatttcgctagttcaggagcctcttcaagtttagactttagcataa attggcggaaatcgtatggactctcaacatattcgtgtgcattccatgaa agaagatcgagcacatagtatgtctgattcgagtagatgcagtcaagaat tgtccaagcctgatctataaagtgagcacattgaaatcgattttttattt atttctaaaaacacacatacttttagcacgagtatttccaccaggaaggc gagactgaaatcgcgatacttctcttccgcctttgttatatgcaactgtg aagccctgaaatacaagattatcccgggtttttaataattaaatcattaa aaacaaactcgtgatgcaacgacaagtgtccgtttgcccactggagccat aaccatcgtccagtcggaagaaagtgattccggaatatccacaagccact cactcaacatcattttgtcggcatatctttaaaaattgaaatatagtttt tttttagtttaagaattcgaacttttaggaaattacttactttttaaatt ttccagtgtgtttttcagcagtagttgtctcaattttttgttcatcgtct tcgtcagaagtaacgtcatcaaaagctaaatttcgtagtttcataaatgt gtcaaagcggccatttttttgccgctcaagcgtttcttcacgacgctttg cttgttgttctgcagcttttgtaagatttttgtactgcgaatacctcgga tgttcacctgaaaaaatcaatgtttactttttgtgtttaaaaaaggagtc tcaccaagagctagaggatcgacctgaaatccactgctaagctggtctgt caacgaatctaaatcgtcccccatttgtgatactggaaaaaataatttta aattagtttaatcaaacttttaactaaatgagtataagaaaaagctttta caacaattaaaaaactgtaaaagtcgaaaaagttgctctgttgatttacg ataaacatctctcgttgcgggatagtggagtgtcgttgggggaaaatatt ttttcgatttcgaaatcgatttcaacacattcacagcgagaaaaaacaat tgcatataatcgattttttatattatggcctgtgtagacatatcaccact atcatatgaatctgtaaaatccgttgtcaaatatttgagcctgaaaaaaa ggtatactatttaatattgatatttaaaaaaatactggttcattaaccgc acggttaattaaccagtatttattacgggaacacaaaattctgagaatac gtattgcgcaacatgtttatatctcgtagcgaaaaccacagtaattttgt aaatgactactgaacgcgtaaatcgccaccagcgctacagtagtcattta aagaattactgtgattttcgctacgagatattttgcgcgtcaaatatgtt gtgcaatacgcattctcagaattttgtgttcatgtaatttgaacacattg aaaaacaaaacattttcagggagcaaattaatttatcgattccggcatat cataaaacagccaaacttgcacctttcaaactagaaaacgttgggttatc agatttgtacattgaagtcaatgatagaacgtggactttgagtgaaactg acatcattggttgcactcttgttttgtatcgctcaaacggctacaacttc cagcacattatttcaataactcctgaagttgcgttccagaagcttatatc cgaatatttgagaaatggaacacgtattaaaaaatttcacttgtatgccc ttccaagtctgctcaaaaacgttgagcttcatgtgtcagagctgaaaatt gacggatgcgactatgaaacatttgcaaagttccgtaaattcattgatca taaccaaaaaatattaaaggagattgggtttgtggtgcgtcagaatactt tatggatgtttgatgaacaattggtacagaacacttcaagctttccgcct tcaacttttgataaatttcaggtgaaaaacagtaaggagatttatctcca gtttccgtattataatccggtcccgttggacgatatgcttcttcggctga aaaacaatcatgtacacattatctatctgggcttcccgattgaaaaacta caagaattggctatggtgagaaattttgtaatattatactttggccaagt ttggtgcggatctttttgattttccactcagaaatacggtacccggtcct gtcacgaacagaacaataaggtctgcgcttttaagtacgtggtgtcagag tgccccatattgatttgatctacgcagatctatgagaatcgcgggaattt agatgcagatttcttaactggttttgaatggttaagaacgtgctaacgtg aaattttttggaaaaaaattcccgcattttttgtcgatcaaaccgcaatg agacagccggacaccatgtgtttaagtaaagcacagtatttaacccacta ctagttaaaaaaaaaatataattcagaactggatcgacactaagaagcca attggtacctcactcttcctggttgttgagcattatacatatgttatcga cgtgtttgagcatttaaccaaatactctaaagctattccttcaagactca ctgatcttgggtaaggtcttagaactctataatatacaaataagcttgac tcattcagaacttccttcttttcacattgtgtaacaatgcatattgatga aagttctgaattggtgctatttggaggtccaatggtaattgaatggaaac cgttcaaatggacgttgagaatgacgatcatgaaacgcggatcgacaatt tcaaaatgatttggaatcctccaaaattctgattttcctcaataaacatt ttaatattttatgttgcaatgtgaatacaattttccttctctgtcaggat actccagaaggaatgtatggaatcttgatgttcagaacatcagcatccct tgatcaaacatcgaatctgttattacactaaagtatgtgttcagtgaaag tgctgtaaaagatgattaacgtcctcaagaagcaaatgctggccgatgac agtttttttgaagattcagctaatttcattagatattctagttgattcaa tatcttgaagaaattttttcgcgagatgatgatgttctttcaagttaaaa ttttgataagtttacgttgtcaaaaataacaacagaaaactacagctggg agtaacattgcatataaataacataagtattaatttataattacactata aaatagttttaaaattgcctcagaacatgtaaacagtctaaataattact ttttagattattgtagataacaagaaaacaagtacaaaatcagtaagtat catgtcattacacatttgctagaaccagtattttgaacacacttgcggac acaggtcaacacacttgttcttagcacccgcctacagcttagatgttccc tttaaaaggaagtacttacgtcgctacaagggcagatttctccctacttg gctatatctctttgttcaatgattaaacgtcttctctgctttctcttttt tggcttaactttttcagctccggtgccattacccctagaagaagatttta atgttttccatacaaaacaggaaaaccatggtacgttttctcatcttttt tgaaatttgtatcaattaacagagcaaagattccttcgtgttgatcacaa tagagacaaagtagtctcattcgacgaatttcttcacatggaacttgcat atgttgatgcaaagaaagaagagtttgatacgttagataagaaccgtaag tgatgtttaaagcgttgagaaattagcaaactactttagatgacgggaaa gtgagccttgccgagtacgaagagcatttccacgaagcctcaagcaagaa tgaaaaatctcggaccgcatactttgccaaagttttcgaagatttcgatg aagatttcaatatggcattgagtcgtgaagaacttgaacgtgtgcttgct gaacgattcttagtcaaaccaagagaaaactttcccaaattgtttttcaa gtttgacgtagacaagtccggaggacttgatctgactggtgagctaagtg ctcgttttatacatctcgaagtttttttttacagaatacatgaaatttga tgctgaattcccattcgatcaaaccgatccagttggtggaggaccatcca aatctaacaatcaccacgatcaaatgcacacagaagtccctcaagatgct gacgctgctgcaattgcagccgtgttggcacaagcttcaccaacactgaa taagagtccagtcggtgttgctccaccccaaacgtcacctggattgcatc cagtggcaccagggttattccagacggcggtgccaatcaaaaaagtttga ttctaacagtaacttttacaattgtaatttttaattttctctaatactgg tatcacaaatcactgtcattcaccatatcaattgtttttgtaaataaagc tttatagttaaaatattatatacaaaaatgcattgcacaaaaggctgaaa tcagtgaactttagtatcagaggttctcgacataatctcattatgtacat caatcttactcaatttttgaccttctttgctgtgacggataccataacaa atgtagattacaatgccaatcgccaaccatacaaaaagacgaatccaagt cattgagtttagatacaccatcatgaagacattgatgagtagaccaaggc atggaatgaatggaacaagtggcaccttaaacttttgaacttattatttt ttgtaaaatgtttctgttcaccttatacgtagatgttgacttattctgtt catggcccaaaataaaaacgaatgataacagtgaaaaggcagcaccgact gtcaataaaataataccaccagcattgctaaatattcctgttctgaatgg aattgccaggcaaatataaccaaaaattaatcctgcgacagctacgcgaa ttgaaatcccctctgagaaattttcccaaacaccttgaaatggtacccac gacttcaggcatccgccttaaaatttttattttgttaattttgctaaaca cgtgtgtgttgctttataaaatcttaccattatcataatcagttgcacta ccatctacaagatgggattgatgccgcaaaataataacacagattgagac cattgaataagccaacagagttccaattgacaaaaagtcgacaagcgcct gcagatcgaagaccagtgccaaaattgcgtttattatagtaaacacaata gttgcattgagaggggtttttgtttttgagtttatcaccccaaaccaccc gaagataagaccatcatcagccattgcatatacagctcgtggaagagcga atgatccagtaactagattgttcaacatacccgctagtgcaccaacactc attataatcttggcaacagttgctcctttcatttcgaatgcagcagcaaa cgcggcatcaggatcaactaagtcgtatggtatcatcagcgtcaatgagg cgcccataagtacatatatgacacttataattgccaaagaagtgaaagta gccagtgggattgttctatgtgggtttttagcttcttctccagcagtggc taaagcttcaaatccgataaatgcgaaaaagcatgtggaagcgccagaaa tagctccttgtattccatacgggaaaaattttgaacgaccatcctggtat gttcctgaccagagtgaaaagtctgcataggttagtccacaaataatcac aaacgccaaaacctgaatgaacattttgtaataggttttgacactttaac ataccgccagatttaaaaatacaaaactagtattcacgttggcagagaac tttgatcccattgcaaccgccactgctacaagaaacaataggaagaacgc taaaaagtcagggtacagtgcaaaaaatccttttccctgtaagttgaaac gcgtattagcatatctgattcaaaaatggcacgaacatcactcaatcgtc caacagtatccaatgtccagttggacacacttttagatacaagattgtcg aaataagcagaccacgaacgagcgacagcggcgtttccaatcatgtactc tagtggtactgtccatccgacaatgaatgcccaaatttcacccattccta cataactgtatgtgtatgcacttccagctcggggaaacctggaagtttat cagtttaaacatgtaaagttaaaaaataataaaataaaccttgctccaaa ttcagcatagctgaaagctgacaaaagtgcagcaaagccagagaaaataa aagatagtataattgctggtcctgcttgatttcgaacaacggatcccgtg agtacgtatattccagctccaatcatgtgtccaatcgcaatgaacattac gtcaagaattgttaaacatcgtttcatttgtgattcaagatgtgatcctc cgtcaaatgtctttttacgaaacaagacatcggctatttgatgtaccttc atggttgttttttctgaaattatatatttctgaaggtgaaaaaactattc atacaagaaagattaaaaatatgaaaaattcggtatacttttatatggta atattgttttagttttaagataaggcttctggaattgtgaatcaaaaatg SeqIO.pm100644000766000024 6077314006427353 16052 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# BioPerl module for Bio::SeqIO # # Please direct questions and support issues to # # Cared for by Ewan Birney # and Lincoln Stein # # Copyright Ewan Birney # # You may distribute this module under the same terms as perl itself # # _history # October 18, 1999 Largely rewritten by Lincoln Stein # POD documentation - main docs before the code =head1 NAME Bio::SeqIO - Handler for SeqIO Formats =head1 SYNOPSIS use Bio::SeqIO; $in = Bio::SeqIO->new(-file => "inputfilename" , -format => 'Fasta'); $out = Bio::SeqIO->new(-file => ">outputfilename" , -format => 'EMBL'); while ( my $seq = $in->next_seq() ) { $out->write_seq($seq); } # Now, to actually get at the sequence object, use the standard Bio::Seq # methods (look at Bio::Seq if you don't know what they are) use Bio::SeqIO; $in = Bio::SeqIO->new(-file => "inputfilename" , -format => 'genbank'); while ( my $seq = $in->next_seq() ) { print "Sequence ",$seq->id, " first 10 bases ", $seq->subseq(1,10), "\n"; } # The SeqIO system does have a filehandle binding. Most people find this # a little confusing, but it does mean you can write the world's # smallest reformatter use Bio::SeqIO; $in = Bio::SeqIO->newFh(-file => "inputfilename" , -format => 'Fasta'); $out = Bio::SeqIO->newFh(-format => 'EMBL'); # World's shortest Fasta<->EMBL format converter: print $out $_ while <$in>; =head1 DESCRIPTION Bio::SeqIO is a handler module for the formats in the SeqIO set (eg, Bio::SeqIO::fasta). It is the officially sanctioned way of getting at the format objects, which most people should use. The Bio::SeqIO system can be thought of like biological file handles. They are attached to filehandles with smart formatting rules (eg, genbank format, or EMBL format, or binary trace file format) and can either read or write sequence objects (Bio::Seq objects, or more correctly, Bio::SeqI implementing objects, of which Bio::Seq is one such object). If you want to know what to do with a Bio::Seq object, read L. The idea is that you request a stream object for a particular format. All the stream objects have a notion of an internal file that is read from or written to. A particular SeqIO object instance is configured for either input or output. A specific example of a stream object is the Bio::SeqIO::fasta object. Each stream object has functions $stream->next_seq(); and $stream->write_seq($seq); As an added bonus, you can recover a filehandle that is tied to the SeqIO object, allowing you to use the standard EE and print operations to read and write sequence objects: use Bio::SeqIO; $stream = Bio::SeqIO->newFh(-format => 'Fasta', -fh => \*ARGV); # read from standard input or the input filenames while ( $seq = <$stream> ) { # do something with $seq } and print $stream $seq; # when stream is in output mode This makes the simplest ever reformatter #!/usr/bin/perl use strict; my $format1 = shift; my $format2 = shift || die "Usage: reformat format1 format2 < input > output"; use Bio::SeqIO; my $in = Bio::SeqIO->newFh(-format => $format1, -fh => \*ARGV ); my $out = Bio::SeqIO->newFh(-format => $format2 ); # Note: you might want to quote -format to keep older # perl's from complaining. print $out $_ while <$in>; =head1 CONSTRUCTORS =head2 Bio::SeqIO-Enew() $seqIO = Bio::SeqIO->new(-file => 'seqs.fasta', -format => $format); $seqIO = Bio::SeqIO->new(-fh => \*FILEHANDLE, -format => $format); $seqIO = Bio::SeqIO->new(-string => $string , -format => $format); $seqIO = Bio::SeqIO->new(-format => $format); The new() class method constructs a new Bio::SeqIO object. The returned object can be used to retrieve or print Seq objects. new() accepts the following parameters: =over 5 =item -file A file path to be opened for reading or writing. The usual Perl conventions apply: 'file' # open file for reading '>file' # open file for writing '>>file' # open file for appending '+new(-fh => \*STDIN); A string filehandle is handy if you want to modify the output in the memory, before printing it out. The following program reads in EMBL formatted entries from a file and prints them out in fasta format with some HTML tags: use Bio::SeqIO; use IO::String; my $in = Bio::SeqIO->new(-file => "emblfile", -format => 'EMBL'); while ( my $seq = $in->next_seq() ) { # the output handle is reset for every file my $stringio = IO::String->new($string); my $out = Bio::SeqIO->new(-fh => $stringio, -format => 'fasta'); # output goes into $string $out->write_seq($seq); # modify $string $string =~ s|(>)(\w+)|$1$2|g; # print into STDOUT print $string; } Filehandles can also be used to read from or write to a piped command: use Bio::SeqIO; #convert .fastq.gz to .fasta open my $zcat, 'zcat seq.fastq.gz |' or die $!; my $in=Bio::SeqIO->new(-fh=>$zcat, -format=>'fastq'); my $out=Bio::SeqIO->new(-file=>'>seq.fasta', -format=>'fasta'); while (my $seq=$in->next_seq) { $out->write_seq($seq) } =item -string A string to read the sequences from. For example: my $string = ">seq1\nACGCTAGCTAGC\n"; my $seqIO = Bio::SeqIO->new(-string => $string); =item -format Specify the format of the file. Supported formats include fasta, genbank, embl, swiss (SwissProt), Entrez Gene and tracefile formats such as abi (ABI) and scf. There are many more, for a complete listing see the SeqIO HOWTO (L). If no format is specified and a filename is given then the module will attempt to deduce the format from the filename suffix. If there is no suffix that Bioperl understands then it will attempt to guess the format based on file content. If this is unsuccessful then SeqIO will throw a fatal error. The format name is case-insensitive: 'FASTA', 'Fasta' and 'fasta' are all valid. =item -alphabet Sets the alphabet ('dna', 'rna', or 'protein'). When the alphabet is set then Bioperl will not attempt to guess what the alphabet is. This may be important because Bioperl does not always guess correctly. =item -flush By default, all files (or filehandles) opened for writing sequences will be flushed after each write_seq() (making the file immediately usable). If you do not need this facility and would like to marginally improve the efficiency of writing multiple sequences to the same file (or filehandle), pass the -flush option '0' or any other value that evaluates as defined but false: my $gb = Bio::SeqIO->new(-file => " "gb"); my $fa = Bio::SeqIO->new(-file => ">gball.fa", -format => "fasta", -flush => 0); # go as fast as we can! while($seq = $gb->next_seq) { $fa->write_seq($seq) } =item -seqfactory Provide a Bio::Factory::SequenceFactoryI object. See the sequence_factory() method. =item -locfactory Provide a Bio::Factory::LocationFactoryI object. See the location_factory() method. =item -objbuilder Provide a Bio::Factory::ObjectBuilderI object. See the object_builder() method. =back =head2 Bio::SeqIO-EnewFh() $fh = Bio::SeqIO->newFh(-fh => \*FILEHANDLE, -format=>$format); $fh = Bio::SeqIO->newFh(-format => $format); # etc. This constructor behaves like new(), but returns a tied filehandle rather than a Bio::SeqIO object. You can read sequences from this object using the familiar EE operator, and write to it using print(). The usual array and $_ semantics work. For example, you can read all sequence objects into an array like this: @sequences = <$fh>; Other operations, such as read(), sysread(), write(), close(), and printf() are not supported. =head1 OBJECT METHODS See below for more detailed summaries. The main methods are: =head2 $sequence = $seqIO-Enext_seq() Fetch the next sequence from the stream, or nothing if no more. =head2 $seqIO-Ewrite_seq($sequence [,$another_sequence,...]) Write the specified sequence(s) to the stream. =head2 TIEHANDLE(), READLINE(), PRINT() These provide the tie interface. See L for more details. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: bioperl-l@bioperl.org rather than to the module maintainer directly. Many experienced and responsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Ewan Birney, Lincoln Stein Email birney@ebi.ac.uk lstein@cshl.org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut #' Let the code begin... package Bio::SeqIO; $Bio::SeqIO::VERSION = '1.7.8'; use strict; use warnings; use Bio::Factory::FTLocationFactory; use Bio::Seq::SeqBuilder; use Bio::Tools::GuessSeqFormat; use Symbol; use parent qw(Bio::Root::Root Bio::Root::IO Bio::Factory::SequenceStreamI); my %valid_alphabet_cache; =head2 new Title : new Usage : $stream = Bio::SeqIO->new(-file => 'sequences.fasta', -format => 'fasta'); Function: Returns a new sequence stream Returns : A Bio::SeqIO stream initialised with the appropriate format Args : Named parameters indicating where to read the sequences from or to write them to: -file => filename, OR -fh => filehandle to attach to, OR -string => string Additional arguments, all with reasonable defaults: -format => format of the sequences, usually auto-detected -alphabet => 'dna', 'rna', or 'protein' -flush => 0 or 1 (default: flush filehandles after each write) -seqfactory => sequence factory -locfactory => location factory -objbuilder => object builder See L =cut my $entry = 0; sub new { my ($caller, @args) = @_; my $class = ref($caller) || $caller; # or do we want to call SUPER on an object if $caller is an # object? if( $class =~ /Bio::SeqIO::(\S+)/ ) { my ($self) = $class->SUPER::new(@args); $self->_initialize(@args); return $self; } else { my %params = @args; @params{ map { lc $_ } keys %params } = values %params; # lowercase keys unless( defined $params{-file} || defined $params{-fh} || defined $params{-string} ) { $class->throw("file argument provided, but with an undefined value") if exists $params{'-file'}; $class->throw("fh argument provided, but with an undefined value") if exists $params{'-fh'}; $class->throw("string argument provided, but with an undefined value") if exists($params{'-string'}); # $class->throw("No file, fh, or string argument provided"); # neither defined } # Determine or guess sequence format and variant my $format = $params{'-format'}; if (! $format ) { if ($params{-file}) { # Guess from filename extension, and then from file content $format = $class->_guess_format( $params{-file} ) || Bio::Tools::GuessSeqFormat->new(-file => $params{-file} )->guess; } elsif ($params{-fh}) { # Guess from filehandle content $format = Bio::Tools::GuessSeqFormat->new(-fh => $params{-fh} )->guess; } elsif ($params{-string}) { # Guess from string content $format = Bio::Tools::GuessSeqFormat->new(-text => $params{-string})->guess; } } # changed 1-3-11; no need to print out an empty string (only way this # exception is triggered) - cjfields $class->throw("Could not guess format from file, filehandle or string") if not $format; $format = "\L$format"; # normalize capitalization to lower case if ($format =~ /-/) { ($format, my $variant) = split('-', $format, 2); $params{-variant} = $variant; } return unless( $class->_load_format_module($format) ); return "Bio::SeqIO::$format"->new(%params); } } =head2 newFh Title : newFh Usage : $fh = Bio::SeqIO->newFh(-file=>$filename,-format=>'Format') Function: Does a new() followed by an fh() Example : $fh = Bio::SeqIO->newFh(-file=>$filename,-format=>'Format') $sequence = <$fh>; # read a sequence object print $fh $sequence; # write a sequence object Returns : filehandle tied to the Bio::SeqIO::Fh class Args : See L =cut sub newFh { my $class = shift; return unless my $self = $class->new(@_); return $self->fh; } =head2 fh Title : fh Usage : $obj->fh Function: Get or set the IO filehandle Example : $fh = $obj->fh; # make a tied filehandle $sequence = <$fh>; # read a sequence object print $fh $sequence; # write a sequence object Returns : filehandle tied to Bio::SeqIO class Args : none =cut sub fh { my $self = shift; my $class = ref($self) || $self; my $s = Symbol::gensym; tie $$s,$class,$self; return $s; } # _initialize is chained for all SeqIO classes sub _initialize { my($self, @args) = @_; # flush is initialized by the Root::IO init my ($seqfact,$locfact,$objbuilder, $alphabet) = $self->_rearrange([qw(SEQFACTORY LOCFACTORY OBJBUILDER ALPHABET) ], @args); $locfact = Bio::Factory::FTLocationFactory->new(-verbose => $self->verbose) if ! $locfact; $objbuilder = Bio::Seq::SeqBuilder->new(-verbose => $self->verbose) unless $objbuilder; $self->sequence_builder($objbuilder); $self->location_factory($locfact); # note that this should come last because it propagates the sequence # factory to the sequence builder $seqfact && $self->sequence_factory($seqfact); #bug 2160 $alphabet && $self->alphabet($alphabet); # initialize the IO part $self->_initialize_io(@args); } =head2 next_seq Title : next_seq Usage : $seq = stream->next_seq Function: Reads the next sequence object from the stream and returns it. Certain driver modules may encounter entries in the stream that are either misformatted or that use syntax not yet understood by the driver. If such an incident is recoverable, e.g., by dismissing a feature of a feature table or some other non-mandatory part of an entry, the driver will issue a warning. In the case of a non-recoverable situation an exception will be thrown. Do not assume that you can resume parsing the same stream after catching the exception. Note that you can always turn recoverable errors into exceptions by calling $stream->verbose(2). Returns : a Bio::Seq sequence object, or nothing if no more sequences are available Args : none See L, L, L =cut sub next_seq { my ($self, $seq) = @_; $self->throw("Sorry, you cannot read from a generic Bio::SeqIO object."); } =head2 write_seq Title : write_seq Usage : $stream->write_seq($seq) Function: writes the $seq object into the stream Returns : 1 for success and 0 for error Args : Bio::Seq object =cut sub write_seq { my ($self, $seq) = @_; $self->throw("Sorry, you cannot write to a generic Bio::SeqIO object."); } =head2 format Title : format Usage : $format = $stream->format() Function: Get the sequence format Returns : sequence format, e.g. fasta, fastq Args : none =cut # format() method inherited from Bio::Root::IO =head2 alphabet Title : alphabet Usage : $self->alphabet($newval) Function: Set/get the molecule type for the Seq objects to be created. Example : $seqio->alphabet('protein') Returns : value of alphabet: 'dna', 'rna', or 'protein' Args : newvalue (optional) Throws : Exception if the argument is not one of 'dna', 'rna', or 'protein' =cut sub alphabet { my ($self, $value) = @_; if ( defined $value) { $value = lc $value; unless ($valid_alphabet_cache{$value}) { # instead of hard-coding the allowed values once more, we check by # creating a dummy sequence object eval { require Bio::PrimarySeq; my $seq = Bio::PrimarySeq->new( -verbose => $self->verbose, -alphabet => $value ); }; if ($@) { $self->throw("Invalid alphabet: $value\n. See Bio::PrimarySeq for allowed values."); } $valid_alphabet_cache{$value} = 1; } $self->{'alphabet'} = $value; } return $self->{'alphabet'}; } =head2 _load_format_module Title : _load_format_module Usage : *INTERNAL SeqIO stuff* Function: Loads up (like use) a module at run time on demand Example : Returns : Args : =cut sub _load_format_module { my ($self, $format) = @_; my $module = "Bio::SeqIO::" . $format; my $ok; eval { $ok = $self->_load_module($module); }; if ( $@ ) { print STDERR <_filehandle($newval) Function: This method is deprecated. Call _fh() instead. Example : Returns : value of _filehandle Args : newvalue (optional) =cut sub _filehandle { my ($self,@args) = @_; return $self->_fh(@args); } =head2 _guess_format Title : _guess_format Usage : $obj->_guess_format($filename) Function: guess format based on file suffix Example : Returns : guessed format of filename (lower case) Args : Notes : formats that _filehandle() will guess include fasta, genbank, scf, pir, embl, raw, gcg, ace, bsml, swissprot, fastq and phd/phred =cut sub _guess_format { my $class = shift; return unless $_ = shift; return 'abi' if /\.ab[i1]$/i; return 'ace' if /\.ace$/i; return 'alf' if /\.alf$/i; return 'bsml' if /\.(bsm|bsml)$/i; return 'ctf' if /\.ctf$/i; return 'embl' if /\.(embl|ebl|emb|dat)$/i; return 'entrezgene' if /\.asn$/i; return 'exp' if /\.exp$/i; return 'fasta' if /\.(fasta|fast|fas|seq|fa|fsa|nt|aa|fna|faa)$/i; return 'fastq' if /\.fastq$/i; return 'gcg' if /\.gcg$/i; return 'genbank' if /\.(gb|gbank|genbank|gbk|gbs)$/i; return 'phd' if /\.(phd|phred)$/i; return 'pir' if /\.pir$/i; return 'pln' if /\.pln$/i; return 'qual' if /\.qual$/i; return 'raw' if /\.txt$/i; return 'scf' if /\.scf$/i; # from Strider 1.4 Release Notes: The file name extensions used by # Strider 1.4 are ".xdna", ".xdgn", ".xrna" and ".xprt" for DNA, # DNA Degenerate, RNA and Protein Sequence Files, respectively return 'strider' if /\.(xdna|xdgn|xrna|xprt)$/i; return 'swiss' if /\.(swiss|sp)$/i; return 'ztr' if /\.ztr$/i; } sub DESTROY { my $self = shift; $self->close(); } sub TIEHANDLE { my ($class,$val) = @_; return bless {'seqio' => $val}, $class; } sub READLINE { my $self = shift; return $self->{'seqio'}->next_seq() || undef unless wantarray; my (@list, $obj); push @list, $obj while $obj = $self->{'seqio'}->next_seq(); return @list; } sub PRINT { my $self = shift; $self->{'seqio'}->write_seq(@_); } =head2 sequence_factory Title : sequence_factory Usage : $seqio->sequence_factory($seqfactory) Function: Get/Set the Bio::Factory::SequenceFactoryI Returns : Bio::Factory::SequenceFactoryI Args : [optional] Bio::Factory::SequenceFactoryI =cut sub sequence_factory { my ($self, $obj) = @_; if( defined $obj ) { if( ! ref($obj) || ! $obj->isa('Bio::Factory::SequenceFactoryI') ) { $self->throw("Must provide a valid Bio::Factory::SequenceFactoryI object to ".ref($self)."::sequence_factory()"); } $self->{'_seqio_seqfactory'} = $obj; my $builder = $self->sequence_builder(); if($builder && $builder->can('sequence_factory') && (! $builder->sequence_factory())) { $builder->sequence_factory($obj); } } $self->{'_seqio_seqfactory'}; } =head2 object_factory Title : object_factory Usage : $obj->object_factory($newval) Function: This is an alias to sequence_factory with a more generic name. Example : Returns : value of object_factory (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub object_factory{ return shift->sequence_factory(@_); } =head2 sequence_builder Title : sequence_builder Usage : $seqio->sequence_builder($seqfactory) Function: Get/Set the Bio::Factory::ObjectBuilderI used to build sequence objects. This applies to rich sequence formats only, e.g. genbank but not fasta. If you do not set the sequence object builder yourself, it will in fact be an instance of L, and you may use all methods documented there to configure it. Returns : a Bio::Factory::ObjectBuilderI compliant object Args : [optional] a Bio::Factory::ObjectBuilderI compliant object =cut sub sequence_builder { my ($self, $obj) = @_; if( defined $obj ) { if( ! ref($obj) || ! $obj->isa('Bio::Factory::ObjectBuilderI') ) { $self->throw("Must provide a valid Bio::Factory::ObjectBuilderI object to ".ref($self)."::sequence_builder()"); } $self->{'_object_builder'} = $obj; } $self->{'_object_builder'}; } =head2 location_factory Title : location_factory Usage : $seqio->location_factory($locfactory) Function: Get/Set the Bio::Factory::LocationFactoryI object to be used for location string parsing Returns : a Bio::Factory::LocationFactoryI implementing object Args : [optional] on set, a Bio::Factory::LocationFactoryI implementing object. =cut sub location_factory { my ($self,$obj) = @_; if( defined $obj ) { if( ! ref($obj) || ! $obj->isa('Bio::Factory::LocationFactoryI') ) { $self->throw("Must provide a valid Bio::Factory::LocationFactoryI" . " object to ".ref($self)."->location_factory()"); } $self->{'_seqio_locfactory'} = $obj; } $self->{'_seqio_locfactory'}; } 1; Range.pm100644000766000024 2125314006427353 16114 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::Range # # Please direct questions and support issues to # # Cared for by Heikki Lehvaslaiho # # Copywright Matthew Pocock # # You may distribute this module under the same terms as perl itself # # POD documentation - main docs before the code # =head1 NAME Bio::Range - Pure perl RangeI implementation =head1 SYNOPSIS $range = Bio::Range->new(-start=>10, -end=>30, -strand=>+1); $r2 = Bio::Range->new(-start=>15, -end=>200, -strand=>+1); print join(', ', $range->union($r2)), "\n"; print join(', ', $range->intersection($r2)), "\n"; print $range->overlaps($r2), "\n"; print $range->contains($r2), "\n"; =head1 DESCRIPTION This provides a pure perl implementation of the BioPerl range interface. Ranges are modeled as having (start, end, length, strand). They use Bio-coordinates - all points E= start and E= end are within the range. End is always greater-than or equal-to start, and length is greather than or equal to 1. The behaviour of a range is undefined if ranges with negative numbers or zero are used. So, in summary: length = end - start + 1 end >= start strand = (-1 | 0 | +1) =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Heikki Lehvaslaiho Email heikki-at-bioperl-dot-org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut package Bio::Range; $Bio::Range::VERSION = '1.7.8'; use strict; use Carp; use integer; use base qw(Bio::Root::Root Bio::RangeI); =head1 Constructors =head2 new Title : new Usage : $range = Bio::Range->new(-start => 100, -end=> 200, -strand = +1); Function: generates a new Bio::Range Returns : a new range Args : -strand (defaults to 0) and any two of (-start, -end, -length), the third will be calculated =cut sub new { my ($caller, @args) = @_; my $self = $caller->SUPER::new(@args); my ($strand, $start, $end, $length) = $self->_rearrange([qw(STRAND START END LENGTH )],@args); $self->strand($strand || 0); if(defined $start ) { $self->start($start); if(defined $end) { $self->end($end); } elsif(defined $length) { $self->end($self->start()+ $length - 1); } } elsif(defined $end && defined $length ) { $self->end($end); $self->start($self->end() - $length + 1); } return $self; } =head2 unions Title : unions Usage : @unions = Bio::Range->unions(@ranges); Function: generate a list of non-intersecting Bio::Range objects from a list of Bio::Range objects which may intersect Returns : a list of Bio::Range objects Args : a list of Bio::Range objects =cut sub unions { my ($class,@i) = @_; my $i = 0; my %i = map { $i++ => $_ } @i; my $lastsize = scalar(keys %i); do { foreach my $j (sort { $i{$a}->start <=> $i{$b}->start } keys %i){ foreach my $k (sort { $i{$a}->start <=> $i{$b}->start } keys %i){ #it may have been replaced by a union under the key of #the overlapping range, we are altering the hash in-place next unless $i{$j}; next if $i{$k}->end < $i{$j}->start; last if $i{$k}->start > $i{$j}->end; if($i{$j}->overlaps($i{$k})){ my($start,$end,$strand) = $i{$j}->union($i{$k}); delete($i{$k}); $i{$j} = Bio::Range->new( -start => $start , -end => $end , -strand => $strand ); } } } goto DONE if scalar(keys %i) == $lastsize; $lastsize = scalar(keys %i); #warn $lastsize; } while(1); DONE: return values %i; } =head1 Member variable access These methods let you get at and set the member variables =head2 start Title : start Function : return or set the start co-ordinate Example : $s = $range->start(); $range->start(7); Returns : the value of the start co-ordinate Args : optionally, the new start co-ordinate Overrides: Bio::RangeI::start =cut sub start { my ($self,$value) = @_; if( defined $value) { $self->throw("'$value' is not an integer.\n") unless $value =~ /^[-+]?\d+$/; $self->{'start'} = $value; } return $self->{'start'}; } =head2 end Title : end Function : return or set the end co-ordinate Example : $e = $range->end(); $range->end(2000); Returns : the value of the end co-ordinate Args : optionally, the new end co-ordinate Overrides: Bio::RangeI::end =cut sub end { my ($self,$value) = @_; if( defined $value) { $self->throw("'$value' is not an integer.\n") unless $value =~ /^[-+]?\d+$/; $self->{'end'} = $value; } return $self->{'end'}; } =head2 strand Title : strand Function : return or set the strandedness Example : $st = $range->strand(); $range->strand(-1); Returns : the value of the strandedness (-1, 0 or 1) Args : optionally, the new strand - (-1, 0, 1) or (-, ., +). Overrides: Bio::RangeI::strand =cut { my %VALID_STRAND = ( -1 => -1, 0 => 0, 1 => 1, '+' => 1, '-' => -1, '.' => 0 ); sub strand { my $self = shift; if(@_) { my $val = shift; if (exists $VALID_STRAND{$val}) { $self->{'strand'} = $VALID_STRAND{$val}; } else { $self->throw("Invalid strand: $val"); } } return $self->{'strand'}; } } =head2 length Title : length Function : returns the length of this range Example : $length = $range->length(); Returns : the length of this range, equal to end - start + 1 Args : if you attempt to set the length an exception will be thrown Overrides: Bio::RangeI::Length =cut sub length { my $self = shift; if(@_) { confess ref($self), "->length() is read-only"; } return $self->end() - $self->start() + 1; } =head2 toString Title : toString Function: stringifies this range Example : print $range->toString(), "\n"; Returns : a string representation of this range =cut sub toString { my $self = shift; return "(${\$self->start}, ${\$self->end}) strand=${\$self->strand}"; } =head1 Boolean Methods These methods return true or false. $range->overlaps($otherRange) && print "Ranges overlap\n"; =head2 overlaps Title : overlaps Usage : if($r1->overlaps($r2)) { do stuff } Function : tests if $r2 overlaps $r1 Args : a range to test for overlap with Returns : true if the ranges overlap, false otherwise Inherited: Bio::RangeI =head2 contains Title : contains Usage : if($r1->contains($r2) { do stuff } Function : tests whether $r1 totally contains $r2 Args : a range to test for being contained Returns : true if the argument is totally contained within this range Inherited: Bio::RangeI =head2 equals Title : equals Usage : if($r1->equals($r2)) Function : test whether $r1 has the same start, end, length as $r2 Args : a range to test for equality Returns : true if they are describing the same range Inherited: Bio::RangeI =head1 Geometrical methods These methods do things to the geometry of ranges, and return triplets (start, end, strand) from which new ranges could be built. =head2 intersection Title : intersection Usage : ($start, $stop, $strand) = $r1->intersection($r2) Function : gives the range that is contained by both ranges Args : a range to compare this one to Returns : nothing if they do not overlap, or the range that they do overlap Inherited: Bio::RangeI::intersection =cut =head2 union Title : union Usage : ($start, $stop, $strand) = $r1->union($r2); : ($start, $stop, $strand) = Bio::Range->union(@ranges); Function : finds the minimal range that contains all of the ranges Args : a range or list of ranges Returns : the range containing all of the ranges Inherited: Bio::RangeI::union =cut 1; Taxon.pm100644000766000024 6011414006427353 16150 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::Taxon # # Please direct questions and support issues to # # Cared for by Sendu Bala # # Copyright Sendu Bala, based heavily on a module by Jason Stajich # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::Taxon - A node in a represented taxonomy =head1 SYNOPSIS use Bio::Taxon; # Typically you will get a Taxon from a Bio::DB::Taxonomy object # but here is how you initialize one my $taxon = Bio::Taxon->new(-name => $name, -id => $id, -rank => $rank, -division => $div); # Get one from a database my $dbh = Bio::DB::Taxonomy->new(-source => 'flatfile', -directory=> '/tmp', -nodesfile=> '/path/to/nodes.dmp', -namesfile=> '/path/to/names.dmp'); my $human = $dbh->get_taxon(-name => 'Homo sapiens'); $human = $dbh->get_taxon(-taxonid => '9606'); print "id is ", $human->id, "\n"; # 9606 print "rank is ", $human->rank, "\n"; # species print "scientific name is ", $human->scientific_name, "\n"; # Homo sapiens print "division is ", $human->division, "\n"; # Primates my $mouse = $dbh->get_taxon(-name => 'Mus musculus'); # You can quickly make your own lineages with the list database my @ranks = qw(superkingdom class genus species); my @h_lineage = ('Eukaryota', 'Mammalia', 'Homo', 'Homo sapiens'); my $list_dbh = Bio::DB::Taxonomy->new(-source => 'list', -names => \@h_lineage, -ranks => \@ranks); $human = $list_dbh->get_taxon(-name => 'Homo sapiens'); my @names = $human->common_names; # @names is empty $human->common_names('woman'); @names = $human->common_names; # @names contains woman # You can switch to another database when you need more information my $entrez_dbh = Bio::DB::Taxonomy->new(-source => 'entrez'); $human->db_handle($entrez_dbh); @names = $human->common_names; # @names contains woman, human, man # Since Bio::Taxon implements Bio::Tree::NodeI, we have access to those # methods (and can manually create our own taxa and taxonomy without the use # of any database) my $homo = $human->ancestor; # Though be careful with each_Descendent - unless you add_Descendent() # yourself, you won't get an answer because unlike for ancestor(), Bio::Taxon # does not ask the database for the answer. You can ask the database yourself # using the same method: ($human) = $homo->db_handle->each_Descendent($homo); # We can also take advantage of Bio::Tree::Tree* methods: # a) some methods are available with just an empty tree object use Bio::Tree::Tree; my $tree_functions = Bio::Tree::Tree->new(); my @lineage = $tree_functions->get_lineage_nodes($human); my $lineage = $tree_functions->get_lineage_string($human); my $lca = $tree_functions->get_lca($human, $mouse); # b) for other methods, create a tree using your Taxon object my $tree = Bio::Tree::Tree->new(-node => $human); my @taxa = $tree->get_nodes; $homo = $tree->find_node(-rank => 'genus'); # Normally you can't get the lca of a list-database derived Taxon and an # entrez or flatfile-derived one because the two different databases might # have different roots and different numbers of ranks between the root and the # taxa of interest. To solve this, make a tree of the Taxon with the more # detailed lineage and splice out all the taxa that won't be in the lineage of # your other Taxon: my $entrez_mouse = $entrez_dbh->get_taxon(-name => 'Mus musculus'); my $list_human = $list_dbh->get_taxon(-name => 'Homo sapiens'); my $mouse_tree = Bio::Tree::Tree->new(-node => $entrez_mouse); $mouse_tree->splice(-keep_rank => \@ranks); $lca = $mouse_tree->get_lca($entrez_mouse, $list_human); =head1 DESCRIPTION This is the next generation (for Bioperl) of representing Taxonomy information. Previously all information was managed by a single object called Bio::Species. This new implementation allows representation of the intermediate nodes not just the species nodes and can relate their connections. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Sendu Bala Email bix@sendu.me.uk =head1 CONTRIBUTORS Jason Stajich, jason-at-bioperl-dot-org (original Bio::Taxonomy::Node) Juguang Xiao, juguang@tll.org.sg Gabriel Valiente, valiente@lsi.upc.edu =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut package Bio::Taxon; $Bio::Taxon::VERSION = '1.7.8'; use strict; use Scalar::Util qw(blessed); use Bio::DB::Taxonomy; use base qw(Bio::Tree::Node Bio::IdentifiableI); =head2 new Title : new Usage : my $obj = Bio::Taxonomy::Node->new(); Function: Builds a new Bio::Taxonomy::Node object Returns : an instance of Bio::Taxonomy::Node Args : -dbh => a reference to a Bio::DB::Taxonomy object [no default] -name => a string representing the taxon name (scientific name) -id => human readable id - typically NCBI taxid -ncbi_taxid => same as -id, but explicitly say that it is an NCBI taxid -rank => node rank (one of 'species', 'genus', etc) -common_names => array ref of all common names -division => 'Primates', 'Rodents', etc -genetic_code => genetic code table number -mito_genetic_code => mitochondrial genetic code table number -create_date => date created in database -update_date => date last updated in database -pub_date => date published in database =cut sub new { my ($class, @args) = @_; my $self = $class->SUPER::new(@args); my ($name, $id, $objid, $rank, $div, $dbh, $ncbitaxid, $commonname, $commonnames, $gcode, $mitocode, $createdate, $updatedate, $pubdate, $parent_id) = $self->_rearrange([qw(NAME ID OBJECT_ID RANK DIVISION DBH NCBI_TAXID COMMON_NAME COMMON_NAMES GENETIC_CODE MITO_GENETIC_CODE CREATE_DATE UPDATE_DATE PUB_DATE PARENT_ID)], @args); if (defined $id && (defined $ncbitaxid && $ncbitaxid ne $id || defined $objid && $objid ne $id)) { $self->warn("Only provide one of -id, -object_id or -ncbi_taxid, using $id\n"); } elsif(!defined $id) { $id = $objid || $ncbitaxid; } defined $id && $self->id($id); $self->{_ncbi_tax_id_provided} = 1 if $ncbitaxid; defined $rank && $self->rank($rank); defined $name && $self->node_name($name); my @common_names; if ($commonnames) { $self->throw("-common_names takes only an array reference") unless $commonnames && ref($commonnames) eq 'ARRAY'; @common_names = @{$commonnames}; } if ($commonname) { my %c_names = map { $_ => 1 } @common_names; unless (exists $c_names{$commonname}) { unshift(@common_names, $commonname); } } @common_names > 0 && $self->common_names(@common_names); defined $gcode && $self->genetic_code($gcode); defined $mitocode && $self->mitochondrial_genetic_code($mitocode); defined $createdate && $self->create_date($createdate); defined $updatedate && $self->update_date($updatedate); defined $pubdate && $self->pub_date($pubdate); defined $div && $self->division($div); defined $dbh && $self->db_handle($dbh); # Making an administrative decision to override this behavior, particularly # for optimization reasons (if it works to cache it up front, why not? # Please trust your implementations to get it right) # Original note: # deprecated and will issue a warning when method called, # eventually to be removed completely as option defined $parent_id && $self->parent_id($parent_id); # some things want to freeze/thaw Bio::Species objects, but # _root_cleanup_methods contains a CODE ref, delete it. delete $self->{_root_cleanup_methods}; return $self; } =head1 Bio::IdentifiableI interface Also see L =head2 version Title : version Usage : $taxon->version($newval) Returns : value of version (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub version { my $self = shift; return $self->{'version'} = shift if @_; return $self->{'version'}; } =head2 authority Title : authority Usage : $taxon->authority($newval) Returns : value of authority (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub authority { my $self = shift; return $self->{'authority'} = shift if @_; return $self->{'authority'}; } =head2 namespace Title : namespace Usage : $taxon->namespace($newval) Returns : value of namespace (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub namespace { my $self = shift; return $self->{'namespace'} = shift if @_; return $self->{'namespace'}; } =head1 Bio::Taxonomy::Node implementation =head2 db_handle Title : db_handle Usage : $taxon->db_handle($newval) Function: Get/Set Bio::DB::Taxonomy Handle Returns : value of db_handle (a scalar) (Bio::DB::Taxonomy object) Args : on set, new value (a scalar, optional) Bio::DB::Taxonomy object Also see L =cut sub db_handle { my $self = shift; if (@_) { my $db = shift; if (! ref($db) || ! $db->isa('Bio::DB::Taxonomy')) { $self->throw("Must provide a valid Bio::DB::Taxonomy object to db_handle()"); } if (!$self->{'db_handle'} || ($self->{'db_handle'} && $self->{'db_handle'} ne $db)) { my $new_self = $self->_get_similar_taxon_from_db($self, $db); $self->_merge_taxa($new_self) if $new_self; } # NB: The Bio::DB::Taxonomy modules access this data member directly # to avoid calling this method and going infinite $self->{'db_handle'} = $db; } return $self->{'db_handle'}; } =head2 rank Title : rank Usage : $taxon->rank($newval) Function: Get/set rank of this Taxon, 'species', 'genus', 'order', etc... Returns : value of rank (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub rank { my $self = shift; return $self->{'rank'} = shift if @_; return $self->{'rank'}; } =head2 id Title : id Usage : $taxon->id($newval) Function: Get/Set id (NCBI Taxonomy ID in most cases); object_id() and ncbi_taxid() are synonyms of this method. Returns : id (a scalar) Args : none to get, OR scalar to set =cut sub id { my $self = shift; return $self->SUPER::id(@_); } *object_id = \&id; =head2 ncbi_taxid Title : ncbi_taxid Usage : $taxon->ncbi_taxid($newval) Function: Get/Set the NCBI Taxonomy ID; This actually sets the id() but only returns an id when ncbi_taxid has been explictely set with this method. Returns : id (a scalar) Args : none to get, OR scalar to set =cut sub ncbi_taxid { my ($self, $id) = @_; if ($id) { $self->{_ncbi_tax_id_provided} = 1; return $self->SUPER::id($id); } if ($self->{_ncbi_tax_id_provided}) { return $self->SUPER::id; } return; } =head2 parent_id Title : parent_id Usage : $taxon->parent_id() Function: Get parent ID, (NCBI Taxonomy ID in most cases); parent_taxon_id() is a synonym of this method. Returns : value of parent_id (a scalar) Args : none =cut sub parent_id { my $self = shift; if (@_) { $self->{parent_id} = shift; } if (defined $self->{parent_id}) { return $self->{parent_id} } my $ancestor = $self->ancestor() || return; return $ancestor->id; } *parent_taxon_id = \&parent_id; =head2 trusted_parent_id Title : trusted_parent_id Usage : $taxon->trusted_parent_id() Function: If the parent_id is explicitly set, trust it Returns : simple boolean value (whether or not it has been set) Args : none Notes : Previously, the parent_id method was to be deprecated in favor of using ancestor(). However this removes one key optimization point, namely when an implementation has direct access to the taxon's parent ID when retrieving the information for the taxon ID. This method is in place so implementations can choose to (1) check whether the parent_id is set and (2) trust that the implementation (whether it is self or another implementation) set the parent_id correctly. =cut sub trusted_parent_id { return defined $_[0]->{parent_id}; } =head2 genetic_code Title : genetic_code Usage : $taxon->genetic_code($newval) Function: Get/set genetic code table Returns : value of genetic_code (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub genetic_code { my $self = shift; return $self->{'genetic_code'} = shift if @_; return $self->{'genetic_code'}; } =head2 mitochondrial_genetic_code Title : mitochondrial_genetic_code Usage : $taxon->mitochondrial_genetic_code($newval) Function: Get/set mitochondrial genetic code table Returns : value of mitochondrial_genetic_code (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub mitochondrial_genetic_code { my $self = shift; return $self->{'mitochondrial_genetic_code'} = shift if @_; return $self->{'mitochondrial_genetic_code'}; } =head2 create_date Title : create_date Usage : $taxon->create_date($newval) Function: Get/Set Date this node was created (in the database) Returns : value of create_date (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub create_date { my $self = shift; return $self->{'create_date'} = shift if @_; return $self->{'create_date'}; } =head2 update_date Title : update_date Usage : $taxon->update_date($newval) Function: Get/Set Date this node was updated (in the database) Returns : value of update_date (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub update_date { my $self = shift; return $self->{'update_date'} = shift if @_; return $self->{'update_date'}; } =head2 pub_date Title : pub_date Usage : $taxon->pub_date($newval) Function: Get/Set Date this node was published (in the database) Returns : value of pub_date (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub pub_date { my $self = shift; return $self->{'pub_date'} = shift if @_; return $self->{'pub_date'}; } =head2 ancestor Title : ancestor Usage : my $ancestor_taxon = $taxon->ancestor() Function: Retrieve the ancestor taxon. Normally the database is asked what the ancestor is. If you manually set the ancestor (or you make a Bio::Tree::Tree with this object as an argument to new()), the database (if any) will not be used for the purposes of this method. To restore normal database behaviour, call ancestor(undef) (which would remove this object from the tree), or request this taxon again as a new Taxon object from the database. Returns : Bio::Taxon Args : none =cut sub ancestor { my $self = shift; my $ancestor = $self->SUPER::ancestor(@_); if ($ancestor) { return $ancestor; } my $dbh = $self->db_handle; #*** could avoid the db lookup if we knew our current id was definitely # information from the db... my $definitely_from_dbh = $self->_get_similar_taxon_from_db($self); return $dbh->ancestor($definitely_from_dbh); } =head2 get_Parent_Node Title : get_Parent_Node Function: Synonym of ancestor() Status : deprecated =cut sub get_Parent_Node { my $self = shift; $self->warn("get_Parent_Node is deprecated, use ancestor() instead"); return $self->ancestor(@_); } =head2 each_Descendent Title : each_Descendent Usage : my @taxa = $taxon->each_Descendent(); Function: Get all the descendents for this Taxon (but not their descendents, ie. not a recursive fetchall). get_Children_Nodes() is a synonym of this method. Note that this method never asks the database for the descendents; it will only return objects you have manually set with add_Descendent(), or where this was done for you by making a Bio::Tree::Tree with this object as an argument to new(). To get the database descendents use $taxon->db_handle->each_Descendent($taxon). Returns : Array of Bio::Taxon objects Args : optionally, when you have set your own descendents, the string "height", "creation", "alpha", "revalpha", or coderef to be used to sort the order of children nodes. =cut # implemented by Bio::Tree::Node =head2 get_Children_Nodes Title : get_Children_Nodes Function: Synonym of each_Descendent() Status : deprecated =cut sub get_Children_Nodes { my $self = shift; $self->warn("get_Children_Nodes is deprecated, use each_Descendent() instead"); return $self->each_Descendent(@_); } =head2 name Title: name Usage: $taxon->name('scientific', 'Homo sapiens'); $taxon->name('common', 'human', 'man'); my @names = @{$taxon->name('common')}; Function: Get/set the names. node_name(), scientific_name() and common_names() are shorthands to name('scientific'), name('scientific') and name('common') respectively. Returns: names (a array reference) Args: Arg1 => the name_class. You can assign any text, but the words 'scientific' and 'common' have the special meaning, as scientific name and common name, respectively. 'scientific' and 'division' are treated specially, allowing only the first value in the Arg2 list to be set. Arg2 ... => list of names =cut sub name { my ($self, $name_class, @names) = @_; $self->throw('No name class specified') unless defined $name_class; if (@names) { if ($name_class =~ /scientific|division/i) { delete $self->{'_names_hash'}->{$name_class}; @names = (shift(@names)); } push @{$self->{'_names_hash'}->{$name_class}}, @names; } return $self->{'_names_hash'}->{$name_class} || return; } =head2 node_name Title : node_name Usage : $taxon->node_name($newval) Function: Get/set the name of this taxon (node), typically the scientific name of the taxon, eg. 'Primate' or 'Homo'; scientific_name() is a synonym of this method. Returns : value of node_name (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub node_name { my $self = shift; my @v = @{$self->name('scientific', @_) || []}; return pop @v; } *scientific_name = \&node_name; =head2 common_names Title : common_names Usage : $taxon->common_names($newval) Function: Get/add the other names of this taxon, typically the genbank common name and others, eg. 'Human' and 'man'. common_name() is a synonym of this method. Returns : array of names in list context, one of those names in scalar context Args : on add, new list of names (scalars, optional) =cut sub common_names { my $self = shift; my @v = @{$self->name('common', @_) || []}; return ( wantarray ) ? @v : pop @v; } *common_name = \&common_names; =head2 division Title : division Usage : $taxon->division($newval) Function: Get/set the division this taxon belongs to, eg. 'Primates' or 'Bacteria'. Returns : value of division (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub division { my $self = shift; my @v = @{$self->name('division',@_) || []}; return pop @v; } # get a node from the database that is like the supplied node sub _get_similar_taxon_from_db { #*** not really happy with this having to be called so much; there must be # a better way... my ($self, $taxon, $db) = @_; $self->throw("Must supply a Bio::Taxon") unless ref($taxon) && $taxon->isa("Bio::Taxon"); ($self->id || $self->node_name) || return; $db ||= $self->db_handle || return; if (!blessed($db) || !$db->isa('Bio::DB::Taxonomy')) { $self->throw("DB handle is not a Bio::DB::Taxonomy: got $db in node ".$self->node_name) } my $db_taxon = $db->get_taxon(-taxonid => $taxon->id) if $taxon->id; unless ($db_taxon) { my @try_ids = $db->get_taxonids($taxon->node_name) if $taxon->node_name; my $own_rank = $taxon->rank || 'no rank'; foreach my $try_id (@try_ids) { my $try = $db->get_taxon(-taxonid => $try_id); my $try_rank = $try->rank || 'no rank'; if ($own_rank eq 'no rank' || $try_rank eq 'no rank' || $own_rank eq $try_rank) { $db_taxon = $try; last; } } } return $db_taxon; } # merge data from supplied Taxon into self sub _merge_taxa { my ($self, $taxon) = @_; $self->throw("Must supply a Bio::Taxon object") unless ref($taxon) && $taxon->isa('Bio::Taxon'); return if ($taxon eq $self); foreach my $attrib (qw(scientific_name version authority namespace genetic_code mitochondrial_genetic_code create_date update_date pub_date division id)) { my $own = $self->$attrib(); my $his = $taxon->$attrib(); if (!$own && $his) { $self->$attrib($his); } } my $own = $self->rank || 'no rank'; my $his = $taxon->rank || 'no rank'; if ($own eq 'no rank' && $his ne 'no rank') { $self->rank($his); } my %own_cnames = map { $_ => 1 } $self->common_names; my %his_cnames = map { $_ => 1 } $taxon->common_names; foreach (keys %his_cnames) { unless (exists $own_cnames{$_}) { $self->common_names($_); } } #*** haven't merged the other things in names() hash, could do above much easier with direct access to object data } =head2 remove_Descendent Title : remove_Descendent Usage : $node->remove_Descedent($node_foo); Function: Removes a specific node from being a Descendent of this node Returns : nothing Args : An array of Bio::Node::NodeI objects which have been previously passed to the add_Descendent call of this object. =cut sub remove_Descendent { # need to override this method from Bio::Tree::Node since it casually # throws away nodes if they don't branch my ($self,@nodes) = @_; my $c= 0; foreach my $n ( @nodes ) { if ($self->{'_desc'}->{$n->internal_id}) { $self->{_removing_descendent} = 1; $n->ancestor(undef); $self->{_removing_descendent} = 0; $self->{'_desc'}->{$n->internal_id}->ancestor(undef); delete $self->{'_desc'}->{$n->internal_id}; $c++; } } return $c; } 1; bp_search2gff100644000766000024 3020614006427353 16427 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_search2gff - turn a SearchIO report into GFF =head1 SYNOPSIS Usage: bp_search2gff [-o outputfile] [-f reportformat] [-i inputfilename] OR file1 file2 .. =head1 DESCRIPTION This script will turn a SearchIO report (BLAST, FASTP, SSEARCH, AXT, WABA) into GFF. The options are: -i infilename - (optional) inputfilename, will read either ARGV files or from STDIN -o filename - the output filename [default STDOUT] -f format - search result format (blast, fasta,waba,axt) (ssearch is fasta format). default is blast. -t/--type seqtype - if you want to see query or hit information in the GFF report -s/--source - specify the source (will be algorithm name otherwise like BLASTN) --method - the method tag (primary_tag) of the features (default is similarity) --scorefunc - a string or a file that when parsed evaluates to a closure which will be passed a feature object and that returns the score to be printed --locfunc - a string or a file that when parsed evaluates to a closure which will be passed two features, query and hit, and returns the location (Bio::LocationI compliant) for the GFF3 feature created for each HSP; the closure may use the clone_loc() and create_loc() functions for convenience, see their PODs --onehsp - only print the first HSP feature for each hit -p/--parent - the parent to which HSP features should refer if not the name of the hit or query (depending on --type) --target/--notarget - whether to always add the Target tag or not -h - this help menu --version - GFF version to use (put a 3 here to use gff 3) --component - generate GFF component fields (chromosome) -m/--match - generate a 'match' line which is a container of all the similarity HSPs --addid - add ID tag in the absence of --match -c/--cutoff - specify an evalue cutoff Additionally specify the filenames you want to process on the command-line. If no files are specified then STDIN input is assumed. You specify this by doing: bp_search2gff E file1 file2 file3 =head1 AUTHOR Jason Stajich, jason-at-bioperl-dot-org =head1 Contributors Hilmar Lapp, hlapp-at-gmx-dot-net =cut use strict; use warnings; use Bio::Tools::GFF; use Getopt::Long; use Bio::SearchIO; use Bio::Location::Simple; # pre-declare to simplify $locfunc implementations use Bio::Location::Atomic; # pre-declare to simplify $locfunc implementations use Storable qw(dclone); # for cloning location objects use Bio::Factory::FTLocationFactory; my ( $output, # output file (if not stdout) $input, # name of the input file $format, # format of the input file, defauly is blast $type, # 'query' or 'hit' $cutoff, # cut-off value for e-value filter $sourcetag, # explicit source tag (will be taken from program # otherwise $methodtag, # primary tag (a.k.a. method), default 'similarity' $gffver, # GFF version (dialect) to write $scorefunc, # closure returning the score for a passed feature $locfunc, # closure returning a location object for a passed # query and hit feature $addid, # flag: whether to always add the ID for $match == 0 $parent, # the name of the parent to use; if set and $match == 0 # will always add the target $comp, # flag: whether to print a component feature $addtarget, # flag: whether to always add the Target tag, default # is true $match, # flag: whether to print match lines as containers $onehsp, # flag: whether to consider only the first HSP for a hit $quiet, # flag: run quietly $help # flag: show help screen ); # set defaults: $format = 'blast'; $type = 'query'; $gffver = 2; $methodtag = "similarity"; $addtarget = 1; GetOptions( 'i|input:s' => \$input, 'component' => \$comp, 'm|match' => \$match, 'o|output:s' => \$output, 'f|format:s' => \$format, 's|source:s' => \$sourcetag, 'method=s' => \$methodtag, 'addid' => \$addid, 'scorefunc=s' => \$scorefunc, 'locfunc=s' => \$locfunc, 'p|parent=s' => \$parent, 'target!' => \$addtarget, 'onehsp' => \$onehsp, 't|type:s' => \$type, 'c|cutoff:s' => \$cutoff, 'v|version:i' => \$gffver, 'q|quiet' => \$quiet, 'h|help' => sub { exec( 'perldoc', $0 ); exit(0); }, ); $type = lc($type); if ( $type =~ /target/ ) { $type = 'hit' } elsif ( $type ne 'query' && $type ne 'hit' ) { die("seqtype must be either 'query' or 'hit'"); } # custom or default function returning the score $scorefunc = defined($scorefunc) ? parse_code($scorefunc) : sub { shift->score }; # custom or default function returning the location $locfunc = defined($locfunc) ? parse_code($locfunc) : sub { shift->location }; # if --match is given then $addid needs to be disabled $addid = undef if $addid && $match; # if no input is provided STDIN will be used my $parser = new Bio::SearchIO( -format => $format, -verbose => $quiet ? -1 : 0, -file => $input ); my $out; if ( defined $output ) { $out = new Bio::Tools::GFF( -gff_version => $gffver, -file => ">$output" ); } else { $out = new Bio::Tools::GFF( -gff_version => $gffver ); # STDOUT } my ( %seen_hit, %seen ); my $other = $type eq 'query' ? 'hit' : 'query'; while ( my $result = $parser->next_result ) { my $qname = $result->query_name; if ( $comp && $type eq 'query' && $result->query_length ) { $out->write_feature( Bio::SeqFeature::Generic->new( -start => 1, -end => $result->query_length, -seq_id => $qname, -source_tag => 'chromosome', -primary_tag => 'Component', -tag => { 'Sequence' => $qname } ) ); } while ( my $hit = $result->next_hit ) { next if ( defined $cutoff && $hit->significance > $cutoff ); my $acc = $qname; if ( $seen{ $qname . "-" . $hit->name }++ ) { $acc = $qname . "-" . $seen{ $qname . '-' . $hit->name }; } if ( $comp && $type eq 'hit' && $hit->length && !$seen_hit{ $hit->name }++ ) { $out->write_feature( Bio::SeqFeature::Generic->new( -start => 1, -end => $hit->length, -seq_id => $hit->name, -source_tag => 'chromosome', -primary_tag => 'Component', -tag => { 'Sequence' => $hit->name } ) ); } my ( %min, %max, $seqid, $name, $st ); while ( my $hsp = $hit->next_hsp ) { my $feature = new Bio::SeqFeature::Generic; my ( $proxyfor, $otherf ); if ( $type eq 'query' ) { ( $proxyfor, $otherf ) = ( $hsp->query, $hsp->hit ); $name ||= $hit->name; } else { ( $otherf, $proxyfor ) = ( $hsp->query, $hsp->hit ); $name ||= $acc; } $proxyfor->score( $hit->bits ) unless ( $proxyfor->score ); if ( ( $gffver == 3 ) && ( $match || $parent ) ) { $feature->add_tag_value( 'Parent', $parent || $name ); } $min{$type} = $proxyfor->start unless defined $min{$type} && $min{$type} < $proxyfor->start; $max{$type} = $proxyfor->end unless defined $max{$type} && $max{$type} > $proxyfor->end; $min{$other} = $otherf->start unless defined $min{$other} && $min{$other} < $otherf->start; $max{$other} = $otherf->end unless defined $max{$other} && $max{$other} > $otherf->end; if ( $addtarget || $match ) { $feature->add_tag_value( 'Target', 'Sequence:' . $name ); $feature->add_tag_value( 'Target', $otherf->start ); $feature->add_tag_value( 'Target', $otherf->end ); } if ($addid) { $feature->add_tag_value( 'ID', $name ); } $feature->location( &$locfunc( $proxyfor, $otherf ) ); # strand for feature is always going to be product of # query & hit strands so that target can always be just # '+' $feature->strand( $proxyfor->strand * $otherf->strand ); if ($sourcetag) { $feature->source_tag($sourcetag); } else { $feature->source_tag( $proxyfor->source_tag ); } $feature->score( &$scorefunc($proxyfor) ); $feature->frame( $proxyfor->frame ); $feature->seq_id( $proxyfor->seq_id ); $feature->primary_tag($methodtag); # add annotation if encoded in the query description my $desc = $result->query_description; while ( $desc =~ /\/([^=]+)=(\S+)/g ) { $feature->add_tag_value( $1, $2 ); } $seqid ||= $proxyfor->seq_id; $out->write_feature($feature); $st ||= $sourcetag || $proxyfor->source_tag; last if $onehsp; } if ($match) { my $matchf = Bio::SeqFeature::Generic->new( -start => $min{$type}, -end => $max{$type}, -strand => $hit->strand($type) * $hit->strand($other), -primary_tag => 'match', -source_tag => $st, -score => $hit->bits, -seq_id => $seqid ); if ( $gffver == 3 ) { $matchf->add_tag_value( 'ID', $name ); } $matchf->add_tag_value( 'Target', "Sequence:$name" ); $out->write_feature($matchf); } } } sub parse_code { my $src = shift; my $code; # file or subroutine? if ( -r $src ) { if ( !( ( $code = do $src ) && ( ref($code) eq "CODE" ) ) ) { die "error in parsing code block $src: $@" if $@; die "unable to read file $src: $!" if $!; die "failed to run $src, or it failed to return a closure"; } } else { $code = eval $src; die "error in parsing code block \"$src\": $@" if $@; die "\"$src\" fails to return a closure" unless ref($code) eq "CODE"; } return $code; } =head2 clone_loc Title : clone_loc Usage : my $l = clone_loc($feature->location); Function: Helper function to simplify the task of cloning locations for --locfunc closures. Presently simply implemented using Storable::dclone(). Example : Returns : A L object of the same type and with the same properties as the argument, but physically different. All structured properties will be cloned as well. Args : A L compliant object =cut sub clone_loc { return dclone(shift); } =head2 create_loc Title : create_loc Usage : my $l = create_loc("10..12"); Function: Helper function to simplify the task of creating locations for --locfunc closures. Creates a location from a feature- table formatted string. Example : Returns : A L object representing the location given as formatted string. Args : A GenBank feature-table formatted string. =cut sub create_loc { return Bio::Factory::FTLocationFactory->from_string(shift); } bp_seqconvert100644000766000024 422014006427353 16563 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; use Getopt::Long; use Bio::SeqIO; my $help; my $from=undef; my $to=undef; ### please add to this list (see the modules under Bio/SeqIO): my @known_formats= qw(gcg fasta ace raw fastq phd pir scf swiss genbank locuslink embl game qual bsml tab raw abi chado alf ctf exp ztr pln chaosxml chadoxml yaml tigr tigrxml agave chaos kegg interpro lasergene strider gbdriver embldriver swissdriver); my $script=substr($0, 1+rindex($0,'/')); my $usage="Usage: $script --from in-format --to out-format < file.in-format > file.out-format Known formats:\n " . join(' ', @known_formats) . "\n\n"; die $usage unless &GetOptions( 'from:s' => \$from, 'to:s' => \$to, 'h|help' => \$help ) && !$help && $from && $to && grep($from eq $_, @known_formats) && grep($to eq $_, @known_formats); my $in = Bio::SeqIO->newFh(-fh => \*STDIN , '-format' => $from); my $out = Bio::SeqIO->newFh(-fh=> \*STDOUT, '-format' => $to); print $out $_ while <$in>; __END__ =head1 NAME bp_seqconvert - generic BioPerl sequence format converter =head1 SYNOPSIS bp_seqconvert --from in-format --to out-format < file.in-format > file.out-format # or bp_seqconvert -f in-format -t out-format < file.in-format > file.out-format =head1 DESCRIPTION This script gives command line interface to BioPerl Bio::SeqIO. =head1 SEE ALSO L L for similar functionality which also supports AlignIO. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Philip Lijnzaad Email Ep.lijnzaad-at-med.uu.nlE =cut __END__ bp_seq_length100644000766000024 322214006427353 16524 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_seq_length.pl - lists the number of bases and number of sequences in specified sequence database files =head1 SYNOPSIS bp_seq_length.pl *.fa =head1 DESCRIPTION bp_seq_length.pl will report the total number of residues and total number of individual sequences contained within a specified sequence database file. =head1 OPTIONS -f/--format - Specify the database format ('fasta' is default). This script uses SeqIO and as such formats are limited to those which SeqIO system supports. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Jason Stajich Jason Stajich Ejason@bioperl.orgE =cut use strict; use warnings; use Bio::SeqIO; use Getopt::Long; my $format = 'fasta'; GetOptions('f|format:s' => \$format); exec('perldoc',$0) unless @ARGV; foreach my $f ( @ARGV ) { my $in = new Bio::SeqIO(-file => $f, -format => $format); my $len = 0; my $count = 0; while( my $seq = $in->next_seq ) { $len += $seq->length(); $count++; } printf "%-10s %d bp $count sequences\n",$f,$len; } Signalp.t100644000766000024 212214006427353 16342 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 11); use_ok('Bio::Tools::Signalp'); } # global setting my $verbose = test_debug(); # shared variables my $infile; my $parser; my @feat; # negative example without "YES" features ok $infile = test_input_file('signalp.negative.out'); ok $parser = Bio::Tools::Signalp->new(-file=>$infile, -verbose=>$verbose); while ( my $feat = $parser->next_result ) { push @feat, $feat; } is @feat, 0; is $parser->_seqname, 'my_fasta_id'; is $parser->_fact1, 'NO'; # positive example with "YES" features ok $infile = test_input_file('signalp.positive.out'); ok $parser = Bio::Tools::Signalp->new(-file=>$infile, -verbose=>$verbose); # # The current module does NOT parse stuff properly # It is probably from version 2 but version 3 is used today # This has to be investigated!!!! --Torsten # FIXME / TODO? / BUG / *** # while ( my $feat = $parser->next_result ) { push @feat, $feat; } is @feat , 1; is $parser->_seqname, 'my_fasta_id'; is $parser->_fact1, 'YES'; TargetP.t100644000766000024 1133614006427353 16342 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 124); use_ok('Bio::Tools::TargetP'); } my $targetp = Bio::Tools::TargetP->new( -file => test_input_file('targetp.out') ); ok($targetp); my $items = { '1' => { 'id' => 'BC1G_00001.1', 'len' => '173', 'mTP' => '0.393', 'SP' => '0.024', 'other' => '0.683', 'loc' => '_', 'RC' => '4', 'tplen' => undef, }, '2' => { 'id' => 'BC1G_00002.1', 'len' => '120', 'mTP' => '0.619', 'SP' => '0.040', 'other' => '0.458', 'loc' => 'M', 'RC' => '5', 'tplen' => '97', }, '3' => { 'id' => 'BC1G_00003.1', 'len' => '323', 'mTP' => '0.094', 'SP' => '0.895', 'other' => '0.027', 'loc' => 'S', 'RC' => '1', 'tplen' => '21', }, '4' => { 'id' => 'BC1G_00004.1', 'len' => '361', 'mTP' => '0.402', 'SP' => '0.072', 'other' => '0.479', 'loc' => '_', 'RC' => '5', 'tplen' => undef, }, '5' => { 'id' => 'BC1G_00005.1', 'len' => '244', 'mTP' => '0.526', 'SP' => '0.035', 'other' => '0.548', 'loc' => '_', 'RC' => '5', 'tplen' => undef, }, '6' => { 'id' => 'BC1G_00006.1', 'len' => '35', 'mTP' => '0.234', 'SP' => '0.036', 'other' => '0.819', 'loc' => '_', 'RC' => '3', 'tplen' => undef, }, '7' => { 'id' => 'BC1G_00007.1', 'len' => '73', 'mTP' => '0.292', 'SP' => '0.127', 'other' => '0.431', 'loc' => '_', 'RC' => '5', 'tplen' => undef, }, '8' => { 'id' => 'BC1G_00008.1', 'len' => '349', 'mTP' => '0.088', 'SP' => '0.958', 'other' => '0.078', 'loc' => 'S', 'RC' => '1', 'tplen' => '82', }, '9' => { 'id' => 'BC1G_00009.1', 'len' => '514', 'mTP' => '0.183', 'SP' => '0.102', 'other' => '0.735', 'loc' => '_', 'RC' => '3', 'tplen' => undef, }, '10' => { 'id' => 'BC1G_00010.1', 'len' => '440', 'mTP' => '0.114', 'SP' => '0.088', 'other' => '0.865', 'loc' => '_', 'RC' => '2', 'tplen' => undef, }, '11' => { 'id' => 'BC1G_04501.1', 'len' => '215', 'mTP' => '0.185', 'SP' => '0.038', 'other' => '0.843', 'loc' => '_', 'RC' => '2', 'tplen' => undef, }, '12' => { 'id' => 'BC1G_04502.1', 'len' => '395', 'mTP' => '0.118', 'SP' => '0.164', 'other' => '0.825', 'loc' => '_', 'RC' => '2', 'tplen' => undef, }, '13' => { 'id' => 'BC1G_04503.1', 'len' => '199', 'mTP' => '0.515', 'SP' => '0.062', 'other' => '0.436', 'loc' => 'M', 'RC' => '5', 'tplen' => '20', }, '14' => { 'id' => 'BC1G_04504.1', 'len' => '220', 'mTP' => '0.440', 'SP' => '0.030', 'other' => '0.707', 'loc' => '_', 'RC' => '4', 'tplen' => undef, }, '15' => { 'id' => 'BC1G_04505.1', 'len' => '67', 'mTP' => '0.382', 'SP' => '0.049', 'other' => '0.610', 'loc' => '_', 'RC' => '4', 'tplen' => undef, }, }; my $i = 1; $targetp->_parse_results(); is($targetp->network(), 'NON-PLANT'); is($targetp->analysis_method_version(), "v1.1"); while(my $feat = $targetp->next_prediction()){ is($feat->seq_id(), $items->{$i}->{id}, "good SeqID"); is($feat->length(), $items->{$i}->{len}, "good Seqlength"); is(($feat->get_tag_values('mitochondrionCutOff'))[0], $items->{$i}->{mTP}, "correct Mitochondrion cutoff"); is(($feat->get_tag_values('signalPeptideCutOff'))[0], $items->{$i}->{SP}, "correct signalpPeptide cutoff"); is(($feat->get_tag_values('otherCutOff'))[0], $items->{$i}->{other}, "correct other cutoff"); is(($feat->get_tag_values('location'))[0], $targetp->_toString_location($items->{$i}->{loc}), "correct location"); is(($feat->get_tag_values('reliabilityClass'))[0], $items->{$i}->{RC}, "correct Reliability class score"); if ($feat->has_tag('signalPeptideLength')) { is(($feat->get_tag_values('signalPeptideLength'))[0], $items->{$i}->{tplen}, "correct peptide signal length") } else { is($feat->has_tag('signalPeptideLength'), '', "No peptide signal length reported") } $i++; } Genpred.t100644000766000024 2302614006427353 16357 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 185); use_ok('Bio::Tools::Fgenesh'); use_ok('Bio::Tools::Genscan'); use_ok('Bio::Tools::Genemark'); use_ok('Bio::Tools::Glimmer'); use_ok('Bio::Tools::MZEF'); use_ok('Bio::SeqIO'); } # Genscan report my $genscan = Bio::Tools::Genscan->new('-file' => test_input_file('genomic-seq.genscan')); ok $genscan; # original sequence my $seqin = Bio::SeqIO->new('-file' => test_input_file('genomic-seq.fasta'), '-format' => "fasta"); ok $seqin; my $seq = $seqin->next_seq(); $seqin->close(); ok $seq; # scan through the report my $fea; my $pred_num = 0; my ($prtseq, $cds, $tr_cds); while(my $gene = $genscan->next_prediction()) { $gene->attach_seq($seq) if $seq; $pred_num++; if($pred_num == 1) { $fea = ($gene->exons())[0]; is $fea->strand(), -1, "strand match (".$fea->strand()." and -1)"; $fea = ($gene->poly_A_site()); is $fea->score(), 1.05, "score match (".$fea->score()." and 1.05)"; } if($pred_num == 2) { $fea = ($gene->exons("Initial"))[0]; is $fea->strand(), 1, "strand match (".$fea->strand()." and 1)"; is $fea->score(), 4.46, "score match (".$fea->score()." and 4.46)"; } if($pred_num == 3) { my @exons = $gene->exons("Initial"); is scalar(@exons), 0, "initial exons ".scalar(@exons); $fea = ($gene->exons())[0]; is $fea->score(), 1.74, "score match ".$fea->score(); } if($seq) { $prtseq = $gene->predicted_protein()->seq(); $cds = $gene->cds(); ok($cds); $tr_cds = $cds->translate()->seq(); $tr_cds =~ s/\*$//; is( lc($prtseq), lc($tr_cds), "predicted and extracted protein seqs match"); } } # Genscan report with no genes predicted my $null_genscan = Bio::Tools::Genscan->new('-file' => test_input_file('no-genes.genscan')); ok $null_genscan; my $no_gene = $null_genscan->next_prediction; my @exons = $no_gene->exons; is($#exons,-1); # MZEF report my $mzef = Bio::Tools::MZEF->new('-file' => test_input_file('genomic-seq.mzef')); ok $mzef; my $exon_num = 0; my $gene = $mzef->next_prediction(); is($gene->exons, 23); # Genemark testing my $genemark = Bio::Tools::Genemark->new('-file' => test_input_file('genemark.out')); my $gmgene = $genemark->next_prediction(); is $gmgene->seq_id(), "Hvrn.contig8"; is $genemark->analysis_date(), "Thu Mar 22 10:25:00 2001"; my $i = 0; my @num_exons = (1,5,2,1,9,5,3,2,3,2,1,2,7); while($gmgene = $genemark->next_prediction()) { $i++; my @gmexons = $gmgene->exons(); is scalar(@gmexons), $num_exons[$i]; if($i == 5) { my $gmstart = $gmexons[0]->start(); is $gmstart, 23000; my $gmend = $gmexons[0]->end(); is $gmend, 23061; } } # Genemark testing (prokaryotic gene fragment) $genemark = Bio::Tools::Genemark->new('-file' => test_input_file('genemark-fragment.out'), '-seqname' => 'AAVN02000021.1'); $gmgene = $genemark->next_prediction(); is $gmgene->seq_id(), 'AAVN02000021.1','Genemark tests'; is $gmgene->start(), 2; is $gmgene->end(), 214; is $gmgene->strand(), '1'; my ($gmexon) = $gmgene->exons(); isa_ok $gmexon->location(), 'Bio::Location::Fuzzy'; is $gmexon->location->start_pos_type(), 'BEFORE'; is $gmexon->location->max_start(), 2; is $gmexon->location->end_pos_type(), 'EXACT'; is $gmexon->location->end(), 214; $gmgene = $genemark->next_prediction(); is $gmgene->seq_id(), 'AAVN02000021.1'; is $gmgene->start, 459; is $gmgene->end, 596; is $gmgene->strand(), '1'; ($gmexon) = $gmgene->exons(); isa_ok $gmexon->location, 'Bio::Location::Fuzzy'; is $gmexon->location->start_pos_type(), 'EXACT'; is $gmexon->location->start(), 459; is $gmexon->location->end_pos_type(), 'AFTER'; is $gmexon->location->min_end(), 596; # Glimmer testing (GlimmerM) my $glimmer_m = Bio::Tools::Glimmer->new('-file' => test_input_file('GlimmerM.out')); $gmgene = $glimmer_m->next_prediction; ok($gmgene); is($gmgene->seq_id, 'gi|23613028|ref|NC_004326.1|'); is($gmgene->source_tag, 'GlimmerM_3.0'); is($gmgene->primary_tag, 'transcript'); is(($gmgene->get_tag_values('Group'))[0], 'GenePrediction1'); my @glim_exons = $gmgene->exons; is(scalar (@glim_exons), 1); is($glim_exons[0]->start, 461); is($glim_exons[0]->end, 523); is($glim_exons[0]->strand, -1); is(($glim_exons[0]->get_tag_values('Group'))[0], 'GenePrediction1'); @num_exons = (0,1,3,1,4,2,5,2,8,3,5); $i = 1; while($gmgene = $glimmer_m->next_prediction()) { $i++; is(($gmgene->get_tag_values('Group'))[0],"GenePrediction$i"); @glim_exons = $gmgene->exons(); is scalar(@glim_exons), $num_exons[$i]; if($i == 5) { is $glim_exons[1]->start, 23910; is $glim_exons[1]->end, 23956; is $glim_exons[1]->strand, 1; } } # Glimmer testing (GlimmerHMM) my $glimmer_hmm = Bio::Tools::Glimmer->new('-file' => test_input_file('GlimmerHMM.out')); my $ghmmgene = $glimmer_hmm->next_prediction; ok($ghmmgene); is($ghmmgene->seq_id, 'gi|23613028|ref|NC_004326.1|'); is($ghmmgene->source_tag, 'GlimmerHMM'); is($ghmmgene->primary_tag, 'transcript'); is($ghmmgene->exons, 1); @num_exons = qw(0 1 2 4 2 2 1 1 1 2 2 2 10 4 1 1); # only first few tested $i = 1; while ($ghmmgene = $glimmer_hmm->next_prediction) { $i++; my @ghmm_exons = $ghmmgene->exons; is(scalar(@ghmm_exons), $num_exons[$i]) if $i <= $#num_exons; if ($i == 9) { is( $ghmm_exons[1]->start, 5538 ); is( $ghmm_exons[1]->end, 5647 ); cmp_ok( $ghmm_exons[1]->strand, '>', 0 ); } } is($i, 44); # Glimmer testing (Glimmer 2.X) my $glimmer_2 = Bio::Tools::Glimmer->new('-file' => test_input_file('Glimmer2.out'), '-seqname' => 'BCTDNA', '-seqlength' => 29940,); my $g2gene = $glimmer_2->next_prediction; ok($g2gene); is($g2gene->seq_id, 'BCTDNA'); is($g2gene->source_tag, 'Glimmer_2.X'); is($g2gene->primary_tag, 'gene'); is($g2gene->start, 292); is($g2gene->end, 1623); is($g2gene->frame, 0); is($g2gene->strand, 1); $i = 1; while ($g2gene = $glimmer_2->next_prediction) { $i++; if ($i == 2) { is($g2gene->start, 2230); is($g2gene->end, 2349); is($g2gene->strand, -1); is($g2gene->frame, 0); } elsif ($i == 25) { isa_ok($g2gene->location, 'Bio::Location::SplitLocationI'); my @sublocations = $g2gene->location->sub_Location(); is(scalar (@sublocations), 2); is($sublocations[0]->start, 29263); is($sublocations[0]->end, 29940); is($sublocations[1]->start, 1); is($sublocations[1]->end, 9); is($g2gene->strand, 1); is($g2gene->frame, 0); } } is($i, 25); # Glimmer testing (Glimmer 3.X) my $glimmer_3 = Bio::Tools::Glimmer->new('-file' => test_input_file('Glimmer3.predict'), '-detail' => test_input_file('Glimmer3.detail')); my $g3gene = $glimmer_3->next_prediction; ok($g3gene); is($g3gene->seq_id, 'BCTDNA'); is($g3gene->source_tag, 'Glimmer_3.X'); is($g3gene->primary_tag, 'gene'); is($g3gene->score, '9.60'); isa_ok($g3gene->location, 'Bio::Location::SplitLocationI'); my @sublocations = $g3gene->location->sub_Location(); is(scalar (@sublocations), 2); is($sublocations[0]->start, 29263); is($sublocations[0]->end, 29940); is($sublocations[1]->start, 1); is($sublocations[1]->end, 9); is($g3gene->frame, 0); $i = 1; while ($g3gene = $glimmer_3->next_prediction) { $i++; if ($i == 13) { is($g3gene->start, 13804); is($g3gene->end, 14781); is($g3gene->strand, -1); is($g3gene->frame, 0); is($g3gene->score, '5.51'); my ($orfid) = $g3gene->has_tag('Group') ? $g3gene->get_tag_values('Group') : undef; is($orfid, 'GenePrediction_00015'); } } is($i, 27); # Glimmer 3.X (prokaryotic gene fragment) my $glimmer_3a = Bio::Tools::Glimmer->new( '-file' => test_input_file('glimmer3-fragment.predict'), '-detail' => test_input_file('glimmer3-fragment.detail'), ); my $g3gene_a = $glimmer_3a->next_prediction; ok($g3gene_a); isa_ok $g3gene_a->location(), 'Bio::Location::Fuzzy'; is $g3gene_a->location->start_pos_type(), 'BEFORE'; is $g3gene_a->location->max_start(), 1; is $g3gene_a->location->end_pos_type(), 'EXACT'; is $g3gene_a->location->end(), 674; is $g3gene_a->frame(), 2; for (1..3) { $g3gene_a = $glimmer_3a->next_prediction; } isa_ok $g3gene_a->location(), 'Bio::Location::Fuzzy'; is $g3gene_a->location->start_pos_type(), 'EXACT'; is $g3gene_a->location->start(), 2677; is $g3gene_a->frame(), 0; is $g3gene_a->location->end_pos_type(), 'AFTER'; is $g3gene_a->location->min_end(), 2932; is $g3gene_a->score, '5.63'; # Fgenesh my $fgh = Bio::Tools::Fgenesh->new( '-file' => test_input_file('fgenesh.out'), ); my $fghgene = $fgh->next_prediction(); ok($fghgene); is($fghgene->seq_id, 'gi|1914348|emb|Z81551.1|'); is($fghgene->source_tag, 'Fgenesh'); is($fghgene->start(), 29); is($fghgene->end(), 1869); cmp_ok($fghgene->strand(), '<', 0); $i = 0; @num_exons = (2,5,4,8); while ($fghgene = $fgh->next_prediction()) { $i++; my @fghexons = $fghgene->exons(); is(scalar(@fghexons), $num_exons[$i]); if ($i == 2) { cmp_ok($fghexons[0]->strand(), '>', 0); is($fghexons[0]->primary_tag(), 'InitialExon'); is($fghexons[0]->start(), 14778); is($fghexons[0]->end(), 15104); cmp_ok($fghexons[3]->strand(), '>', 0); is($fghexons[3]->primary_tag(), 'TerminalExon'); is($fghexons[3]->start(), 16988); is($fghexons[3]->end(), 17212); } } is($i, 3); Primer3.t100644000766000024 255714006427353 16302 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14); use_ok('Bio::Tools::Primer3'); } my ($p3, $num, $primer); ok $p3 = Bio::Tools::Primer3->new(-file => test_input_file('primer3_output.txt')); ok $num = $p3->number_of_results; is $num, 5 or diag "Got $num"; ok $num = $p3->all_results; is defined $num, 1 or diag "Can't get all results"; ok $num = $p3->primer_results(1); is defined $num, 1 or diag "Can't get results for 1"; ok $primer = $p3->next_primer; isa_ok $primer, "Bio::Seq::PrimedSeq" or diag "reference for primer stream is not right"; # get the left primer my $left_primer = $primer->get_primer('left'); # get the sequence for that primer. This is a test to verify behavior # on the bioperl list in or about 050315 my $seqobj = $left_primer->seq(); my $seq = $seqobj->seq(); my $other_left_primer = $primer->get_primer(); # a different way to access the primers in the stream my $alt = $p3->primer_results(0,'PRIMER_LEFT_INPUT'); # next one ok $primer = $p3->next_primer; # get the left primer my $left_primer_seq = $primer->get_primer('left')->seq; is $left_primer_seq->seq, "GAGGGTAACACGCTGGTCAT"; # bug 2862 ok $p3 = Bio::Tools::Primer3->new(-file => test_input_file('bug2862.pmr')); $num = 0; while ($p3->next_primer) { $num++ }; is $p3->number_of_results, $num, 'bug 2862'; genbank.t100644000766000024 6546714006427353 16277 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 306); use_ok('Bio::SeqIO::genbank'); } my $verbose = test_debug; my $ast = Bio::SeqIO->new(-format => 'genbank' , -verbose => $verbose, -file => test_input_file('roa1.genbank')); isa_ok($ast, 'Bio::SeqIO'); $ast->verbose($verbose); my $as = $ast->next_seq; is $as->molecule, 'mRNA',$as->accession_number; is $as->alphabet, 'dna'; is $as->division, 'EST'; is join(',',$as->get_dates), '27-OCT-1998'; is($as->primary_id, 3598416); my @class = $as->species->classification; is $class[$#class],'Eukaryota'; $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('NT_021877.gbk')); $ast->verbose($verbose); $as = $ast->next_seq; is $as->molecule, 'DNA',$as->accession_number; is $as->alphabet, 'dna'; is $as->division, 'CON'; is join(',',$as->get_dates), '17-OCT-2003'; is($as->primary_id, 37539616); is($as->accession_number, 'NT_021877'); my ($cds) = grep { $_->primary_tag eq 'CDS' } $as->get_SeqFeatures; is(($cds->get_tag_values('transl_except'))[1], '(pos:complement(4224..4226),aa:OTHER)'); # test for a DBSOURCE line $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('BAB68554.gb')); $ast->verbose($verbose); $as = $ast->next_seq; is $as->molecule, 'PRT',$as->accession_number; is $as->alphabet, 'protein'; is $as->division, 'VRT'; is join(',',$as->get_dates), '11-APR-2002'; # Though older GenBank releases indicate SOURCE contains only the common name, # this is no longer true. In general, this line will contain an abbreviated # form of the full organism name (but may contain the full length name), # as well as the optional common name and organelle. There is no get/set # for the abbreviated name but it is accessible via name() ok defined($as->species->name('abbreviated')->[0]); is $as->species->name('abbreviated')->[0], 'Aldabra giant tortoise'; is($as->primary_id, 15824047); my $ac = $as->annotation; ok defined $ac; my @dblinks = $ac->get_Annotations('dblink'); is(scalar @dblinks,1); is($dblinks[0]->database, 'GenBank'); is($dblinks[0]->primary_id, 'AB072353'); is($dblinks[0]->version, '1'); is($dblinks[0]->display_text, 'GenBank:AB072353.1','operator overloading in AnnotationI is deprecated'); # test for multi-line SOURCE $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('NC_006346.gb')); $as = $ast->next_seq; is $as->species->binomial('FULL'), 'Bolitoglossa n. sp. RLM-2004',$as->accession_number;; @class = $as->species->classification; is($class[$#class],'Eukaryota'); is($as->species->common_name,'mushroomtongue salamander'); $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('U71225.gb')); $as = $ast->next_seq; @class = $as->species->classification; is($class[$#class],'Eukaryota',$as->accession_number); is $as->species->common_name,'black-bellied salamander'; # test for unusual common name $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('AB077698.gb')); $as = $ast->next_seq; # again, this is not a common name but is in name('abbreviated') ok defined($as->species->name('abbreviated')->[0]),$as->accession_number; is $as->species->name('abbreviated')->[0],'Homo sapiens cDNA to mRNA'; # test for common name with parentheses $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('DQ018368.gb')); $as = $ast->next_seq; is $as->species->scientific_name,'(Populus tomentosa x P. bolleana) x P. tomentosa var. truncata', $as->accession_number;; # test secondary accessions my $seqio = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('D10483.gbk')); my $seq = $seqio->next_seq; my @kw = $seq->get_keywords; is(scalar @kw, 118, $seq->accession_number); is($kw[-1], 'yabO'); my @sec_acc = $seq->get_secondary_accessions; is(scalar @sec_acc,14); is($sec_acc[-1], 'X56742'); # bug #1487 my $str = Bio::SeqIO->new(-verbose => $verbose, -file => test_input_file('D12555.gbk')); eval { $seq = $str->next_seq; }; ok(! $@, 'bug 1487'); # bug 1647 rpt_unit sub-feature with multiple parens $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('mini-AE001405.gb')); ok($seq = $str->next_seq); my @rpts = grep { $_->primary_tag eq 'repeat_region' } $seq->get_SeqFeatures; is $#rpts, 2, 'bug 1647'; my @rpt_units = grep {$_->has_tag('rpt_unit')} @rpts; is $#rpt_units, 0; is(($rpt_units[0]->get_tag_values('rpt_unit'))[0],'(TG)10;A;(TG)7'); # test bug #1673 , RDB-II genbank files $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('Mcjanrna_rdbII.gbk') ); ok($seq = $str->next_seq, 'bug 1673'); my @refs = $seq->annotation->get_Annotations('reference'); is(@refs, 1); is($seq->display_id,'Mc.janrrnA'); is($seq->molecule ,'RNA'); is $as->division, 'PLN'; is join(',',$as->get_dates), '23-MAY-2005'; $str = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('AF165282.gb'), -verbose => $verbose); $seq = $str->next_seq; my @features = $seq->all_SeqFeatures; is(@features, 5, $seq->accession_number); is($features[0]->start, 1); is($features[0]->end, 226); my $location = $features[1]->location; ok($location->isa('Bio::Location::SplitLocationI')); my @sublocs = $location->sub_Location; is(@sublocs, 29); # version and primary ID - believe it or not, this wasn't working is ($seq->version, 1); is ($seq->seq_version, 1); is ($seq->primary_id, "5734104"); # streaming and Bio::RichSeq creation my $stream = Bio::SeqIO->new(-file => test_input_file('test.genbank'), -verbose => $verbose, -format => 'genbank'); $stream->verbose($verbose); my $seqnum = 0; my $species; my @cl; my $lasts; my @ids = qw(DDU63596 DDU63595 HUMBDNF); my @tids = (44689, 44689, 9606); my @tnames = ("Dictyostelium discoideum", "Dictyostelium discoideum", "Homo sapiens"); while($seq = $stream->next_seq) { if($seqnum < 3) { is $seq->display_id, $ids[$seqnum]; $species = $seq->species; @cl = $species->classification; is( $species->binomial, $tnames[$seqnum], 'species parsing incorrect for genbank'); is( $cl[3] ne $species->genus, 1, 'genus duplicated in genbank parsing'); is( $species->ncbi_taxid, $tids[$seqnum] ); } $seqnum++; $lasts = $seq; } is($seqnum, 5,'streaming'); is $lasts->display_id, "HUMBETGLOA"; my ($ref) = $lasts->annotation->get_Annotations('reference'); is($ref->medline, 94173918); $stream->close; $stream = Bio::SeqIO->new(-file => test_input_file('test.genbank.noseq'), -verbose => $verbose, -format => 'genbank' ); $seqnum = 0; while($seq = $stream->next_seq) { if($seqnum < 3) { is $seq->display_id, $ids[$seqnum]; } elsif( $seq->display_id eq 'M37762') { is( ($seq->get_keywords)[0], 'neurotrophic factor'); } $seqnum++; } is $seqnum, 5, "Total number of sequences in test file"; # fuzzy $seq = Bio::SeqIO->new( -file => test_input_file('testfuzzy.genbank'), -format => 'genbank', -verbose => $verbose ); ok(defined($as = $seq->next_seq)); @features = $as->all_SeqFeatures; is(@features,21,'Fuzzy in'); my $lastfeature = pop @features; # this is a split location; the root doesn't have strand is($lastfeature->strand, undef); $location = $lastfeature->location; #$location->verbose(-1); # silence the warning of undef seq_id() # see above; splitlocs roots do not have a strand really is($location->strand, undef); is($location->start, 83202); is($location->end, 84996); @sublocs = $location->sub_Location; is(@sublocs, 2); my $loc = shift @sublocs; is($loc->start, 83202); is($loc->end, 83329); is($loc->strand, -1); $loc = shift @sublocs; is($loc->start, 84248); is($loc->end, 84996); is($loc->strand,1); $seq = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => ">" .test_output_file); $seq->verbose($verbose); ok($seq->write_seq($as),'Fuzzy out'); ## now genbank ## $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('BK000016-tpa.gbk')); $seq = $str->next_seq; ok(defined $seq, $seq->accession_number); ok(defined $seq->seq); is($seq->accession_number, 'BK000016',$seq->accession_number); is($seq->alphabet, 'dna'); is($seq->display_id, 'BK000016'); is($seq->length, 1162); is($seq->division, 'ROD'); is($seq->get_dates, 1); is($seq->keywords, 'Third Party Annotation; TPA'); is($seq->desc, 'TPA: Mus musculus pantothenate kinase 4 mRNA, partial cds.'); is($seq->seq_version, 1); is($seq->feature_count, 2); my $spec_obj = $seq->species; is ($spec_obj->common_name, 'house mouse'); is ($spec_obj->species, 'musculus'); is ($spec_obj->genus, 'Mus'); is ($spec_obj->binomial, 'Mus musculus'); $ac = $seq->annotation; my $reference = ($ac->get_Annotations('reference') )[0]; is ($reference->pubmed, '11479594'); is ($reference->medline, '21372465',$seq->accession_number); # validate that what is written is what is read my $testfile = test_output_file; my $out = Bio::SeqIO->new(-file => ">$testfile", -format => 'genbank'); $out->write_seq($seq); $out->close; $str = Bio::SeqIO->new(-format => 'genbank', -file => $testfile); $seq = $str->next_seq; ok(defined $seq,'roundtrip'); ok(defined $seq->seq); is($seq->accession_number, 'BK000016'); is($seq->alphabet, 'dna'); is($seq->display_id, 'BK000016'); is($seq->length, 1162); is($seq->division, 'ROD'); is($seq->get_dates, 1); is($seq->keywords, 'Third Party Annotation; TPA'); is($seq->desc, 'TPA: Mus musculus pantothenate kinase 4 mRNA, partial cds.'); is($seq->seq_version, 1); is($seq->feature_count, 2); $spec_obj = $seq->species; is ($spec_obj->common_name, 'house mouse'); is ($spec_obj->species, 'musculus'); is ($spec_obj->genus, 'Mus'); is ($spec_obj->binomial, 'Mus musculus'); $ac = $seq->annotation; $reference = ($ac->get_Annotations('reference') )[0]; is ($reference->pubmed, '11479594'); is ($reference->medline, '21372465'); # write revcomp split location my $gb = Bio::SeqIO->new(-format => 'genbank', # This sequence has an odd LOCUS line which sets off a warning, setting # verbose to -1. # The newest Ensembl seq lacks this. Maybe update? cjfields 6-5-07 -verbose => $verbose ? $verbose : -1, -file => test_input_file('revcomp_mrna.gb')); $seq = $gb->next_seq; $gb = Bio::SeqIO->new(-format => 'genbank', -file => ">$testfile"); $gb->write_seq($seq); undef $gb; ok(! -z $testfile, 'revcomp split location'); # bug 1925, continuation of long ORGANISM line ends up in @classification: # ORGANISM Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC # 9150 # Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; # Enterobacteriaceae; Salmonella. $gb = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('NC_006511-short.gbk')); $seq = $gb->next_seq; is $seq->species->common_name, undef, "Bug 1925"; is $seq->species->scientific_name, "Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150"; @class = $seq->species->classification; is $class[$#class], "Bacteria"; # WGS tests $gb = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('O_sat.wgs')); $seq = $gb->next_seq; my @tests = ('wgs' => 'AAAA02000001-AAAA02050231', 'wgs_scafld' => 'CM000126-CM000137', 'wgs_scafld' => 'CH398081-CH401163'); my @wgs = map {$seq->annotation->get_Annotations(lc($_))} qw(WGS WGS_SCAFLD); my $ct=0; for my $wgs (@wgs) { my ($tagname, $value) = (shift @tests, shift @tests); is($wgs->tagname, $tagname, $tagname); is($wgs->value, $value); $ct++; } is ($ct, 3); # make sure we can retrieve a feature with a primary tag of 'misc_difference' $gb = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('BC000007.gbk')); $seq = $gb->next_seq; ($cds) = grep { $_->primary_tag eq 'misc_difference' } $seq->get_SeqFeatures; my @vals = $cds->get_tag_values('gene'); is $vals[0], 'PX19', $seq->accession_number; # Check that the source,organism section is identical between input and output. # - test an easy one where organism is species, then two different formats of # subspecies, then a species with a format that used to be mistaken for # subspecies, then a bacteria with no genus, and finally a virus with a genus. # These tests are now somewhat out-of-date since we are moving to a Bio::Taxon- # based system for verifying taxonomic information. Right now they just verify # changes so are really useless; I will change them to verify common name, # organelle, scientific name, etc. my $outfile = test_output_file; # output always adds a period (GenBank std), but two of these files do not use them. foreach my $in ('BK000016-tpa.gbk', 'ay116458.gb', 'ay149291.gb', 'NC_006346.gb', 'ay007676.gb', 'dq519393.gb') { my $infile = test_input_file($in); $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => $infile); $seq = $str->next_seq; $out = Bio::SeqIO->new(-file => ">$outfile", -format => 'genbank'); $out->write_seq($seq); $out->close; open my $IN, '<', $infile or die "Could not read file '$infile': $!\n"; my @in = <$IN>; close $IN; open my $RESULT, '<', $outfile or die "Could not read file '$outfile': $!\n"; my $line = 0; my $check = 0; my $is = 1; FILECHECK: while (my $result = <$RESULT>) { if ($result =~ /^KEYWORDS/) { $check = 1; next; } if ($result =~ /^REFERENCE/) { last FILECHECK; } if ($check) { # end periods don't count (not all input files have them) $result =~ s{\.$}{}; $in[$line] =~ s{\.$}{}; if ($result ne $in[$line]) { $is = 0; last; } } } continue { $line++; } close $RESULT; ok $is, $in; } # NB: there should probably be full testing on all lines to ensure that output # matches input. # 20061117: problem with *double* colon in some annotation-dblink values $ct = 0; foreach my $in ('P35527.gb') { my $infile = test_input_file($in); $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => $infile); $seq = $str->next_seq; my $ac = $seq->annotation; # Bio::AnnotationCollection foreach my $key ($ac->get_all_annotation_keys ) { my @values = $ac->get_Annotations($key); foreach my $ann (@values) { my $value = $ann->display_text; $ct++; if ($key eq 'dblink') { ok (index($value,'::') < 0); # this should never be true ok ($value, $value); # check value is not empty # print " ann/", sprintf('%12s ',$key), '>>>', $value , '<<<', "\n"; # print " index double colon: ",index($value ,'::'), "\n"; # check db name: my @parts = split(/:/,$value); if ( $parts[0] =~ /^(?: # not an exhaustive list of databases; # just the db's referenced in P35527.gb: swissprot | GenBank | GenPept | HSSP | IntAct | Ensembl | KEGG | HGNC | MIM | ArrayExpress | GO | InterPro | Pfam | PRINTS | PROSITE )$/x ) { ok 1; } else { ok 0; } ok ( $parts[1], "$parts[0]" ); } } } } is($ct, 46); # bug 2195 $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('AF305198.gb') ); $species = $str->next_seq->species; is($species->scientific_name, 'Virginia creeper phytoplasma', 'Bug 2195'); is(join(', ',$species->classification), 'Virginia creeper phytoplasma, 16SrV (Elm yellows group), ' . 'Candidatus Phytoplasma, Acholeplasmataceae, Acholeplasmatales, ' . 'Mollicutes, Firmicutes, Bacteria', 'Bug 2195'); # bug 2569, PROJECT line support, read and write, round-tripping $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('NC_008536.gb')); $seq = $str->next_seq; my $project = ($seq->annotation->get_Annotations('project'))[0]; isa_ok($project, 'Bio::Annotation::SimpleValue'); if ($project) { is($project->value, 'GenomeProject:12638'); } else { ok(0, "PROJECT not parsed"); } $outfile = test_output_file; $gb = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => ">$outfile"); $gb->write_seq($seq); $str = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => $outfile); $seq = $str->next_seq; $project = ($seq->annotation->get_Annotations('project'))[0]; isa_ok($project, 'Bio::Annotation::SimpleValue'); if ($project) { is($project->value, 'GenomeProject:12638'); } else { ok(0, "Roundtrip test failed"); } # test for swissprot/UniProt/UniProtKB DBSOURCE line (Bug : RT 44536) $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('P39765.gb')); $as = $ast->next_seq; is $as->molecule, 'PRT',$as->accession_number; is $as->division, 'BCT',$as->accession_number; is join(',',$as->get_dates), '03-MAR-2009',$as->accession_number; is $as->alphabet, 'protein'; # Though older GenBank releases indicate SOURCE contains only the common name, # this is no longer true. In general, this line will contain an abbreviated # form of the full organism name (but may contain the full length name), # as well as the optional common name and organelle. There is no get/set # for the abbreviated name but it is accessible via name() ok defined($as->species->name('abbreviated')->[0]); is $as->species->name('abbreviated')->[0], 'Bacillus subtilis'; is($as->primary_id, 20141743); $ac = $as->annotation; ok defined $ac; @dblinks = $ac->get_Annotations('dblink'); is(scalar @dblinks,31); is($dblinks[0]->database, 'UniProtKB'); is($dblinks[0]->primary_id, 'PYRR_BACSU'); is($dblinks[0]->version, undef); is($dblinks[0]->display_text, 'UniProtKB:PYRR_BACSU','operator overloading in AnnotationI is deprecated'); #bug 2982 embl/genbank contig handling $ast = Bio::SeqIO->new( -file => test_input_file('bug2982.gb'), -format => 'genbank' ); $seq = $ast->next_seq; ok my @ctg = $seq->annotation->get_Annotations('contig'); like $ctg[0]->value, qr/join\(.*?gap.*?complement/; # write_seq() and FTHelper duplicate specific tags, need to check a round-trip $ast = Bio::SeqIO->new(-format => 'genbank' , -verbose => $verbose, -file => test_input_file('singlescore.gbk')); $as = $ast->next_seq; ($cds) = grep { $_->primary_tag eq 'CDS' } $as->get_SeqFeatures; my @notes = $cds->get_tag_values('note'); is(scalar @notes, 2); $testfile = test_output_file; $out = Bio::SeqIO->new(-file => ">$testfile", -format => 'genbank'); $out->write_seq($as); $out->close; $ast = Bio::SeqIO->new(-format => 'genbank' , -verbose => $verbose, -file => $testfile ); $as = $ast->next_seq; ($cds) = grep { $_->primary_tag eq 'CDS' } $as->get_SeqFeatures; @notes = $cds->get_tag_values('note'); is(scalar @notes, 2); #bug 3375 my $in = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('NC_002058_multDBLINK_bug3375.gb')); $seq = $in->next_seq; # should not throw a warning now @dblinks = $seq->annotation->get_Annotations('dblink'); # contains 5 dblink references # testing DBLINK BioProject: PRJNA15288 is($dblinks[0]->database, 'BioProject', 'bug3375 database is BioProject'); is($dblinks[0]->primary_id, 'PRJNA15288', 'bug3375 primary_id is PRJNA15288'); # testing DBLINK Project:100,200,300 is($dblinks[3]->database, 'Project'); is($dblinks[3]->primary_id, '300'); # testing DBLINK NC_002058.3 is($dblinks[4]->database, 'GenBank'); is($dblinks[4]->primary_id, 'NC_002058'); is($dblinks[4]->version, '3'); # long labels handled { # Create sequence with feature with a long label qualifier my $seq=Bio::Seq->new(-seq => 'actg', -id => 'abacab'); my $feature=Bio::SeqFeature::Generic->new(-primary=>'CDS', -start=>1, -end=>4); my $label='1 2 3 4 5 6 7 8 9 a b c d e f g h i j k l m n o p q r'; $feature->add_tag_value(note => $label); $seq->add_SeqFeature($feature); # Write genbank my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 2); my $out = Bio::SeqIO->new(-format => 'genbank', -fh => $str_fh); $out->write_seq($seq); # Read genbank my $in = Bio::SeqIO->new(-format => 'genbank', -string => $string); my $genbank = $in->next_seq; my ($read_feature) = $genbank->get_SeqFeatures; my ($read_label) = $read_feature->get_tag_values('note'); is($read_label, $label, 'Label is the same'); } # bug 3448 (redmine) $in = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('YP_007988852.gp'), -verbose => $verbose); $seq = $in->next_seq; # should not throw a warning now is($seq->length, 205); my @anns = $seq->annotation->get_Annotations('contig'); is(@anns, 1); isa_ok($anns[0], 'Bio::Annotation::SimpleValue'); is($anns[0]->value, 'join(WP_015639704.1:1..205)'); is($seq->seq, 'MENRKFGYIRVSSKDQNEGRQLEAMRKIGITERDIYLDKQSGKNFERANYQLLKRIIRKGDI' . 'LYIHSLDRFGRNKEEILQEWNDLTKNIEADIVVLDMPLLDTTQYKDSMGTFIADLVLQILSWMAEEERERIRK' . 'RQREGIDLALQNGIQFGRSPVVVSDEFKEVYRKWKAKELTAVEAMQEAGVKKTSFYKLVKAHENSIKVNS'); # Genbank files with CONTIG and sequence should print the sequence with write_seq() $testfile = test_output_file; $out = Bio::SeqIO->new(-file => ">$testfile", -format => 'genbank'); $out->write_seq($seq); $out->close; $in = Bio::SeqIO->new(-file => $testfile, -format => 'genbank', -verbose => $verbose); $seq = $in->next_seq; is($seq->length, 205); @anns = $seq->annotation->get_Annotations('contig'); is(@anns, 1); isa_ok($anns[0], 'Bio::Annotation::SimpleValue'); is($anns[0]->value, 'join(WP_015639704.1:1..205)'); is($seq->seq, 'MENRKFGYIRVSSKDQNEGRQLEAMRKIGITERDIYLDKQSGKNFERANYQLLKRIIRKGDI' . 'LYIHSLDRFGRNKEEILQEWNDLTKNIEADIVVLDMPLLDTTQYKDSMGTFIADLVLQILSWMAEEERERIRK' . 'RQREGIDLALQNGIQFGRSPVVVSDEFKEVYRKWKAKELTAVEAMQEAGVKKTSFYKLVKAHENSIKVNS'); $seq = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('YP_007988852.gp') )->next_seq; @features = $seq->remove_SeqFeatures; is $#features, 10, 'Got 11 features'; $seq = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('YP_007988852.gp') )->next_seq; @features = $seq->remove_SeqFeatures('CDS'); is $#features, 0, 'Got 1 feature'; is $features[0]->primary_tag, 'CDS', 'Correct primary tag for feature'; @features = $seq->remove_SeqFeatures; is $#features, 9, 'Got 10 features'; # Handle Structured Comments in COMMENT section $seq = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('KF527485.gbk') )->next_seq; my $comment = ($seq->get_Annotations('comment') )[0]; like($comment->as_text, qr/^Comment:\s+##Assembly-Data-START##/m, "Got correct Structured Comment"); like($comment->as_text, qr/^Assembly Method :: Lasergene v. 10/m, "Got correct Structured Comment"); like($comment->as_text, qr/^Sequencing Technology :: ABI37XL; Sanger dideoxy sequencing/m, "Got correct Structured Comment"); like($comment->as_text, qr/^##Assembly-Data-END##/m, "Got correct Structured Comment"); $seq = Bio::SeqIO->new(-format => 'genbank', -file => test_input_file('HM138502.gbk') )->next_seq; $comment = ($seq->get_Annotations('comment') )[0]; ok( $comment->as_text =~ /^Comment: Swine influenza A \(H1N1\) virus isolated during human swine flu outbreak of 2009/, "Got correct Structured Comment" ); like( $comment->as_text, qr/^##GISAID_EpiFlu\(TM\)Data-START##/m, "Got correct Structured Comment" ); like( $comment->as_text, qr/^Subtype :: H1N1/m, "Got correct Structured Comment"); like( $comment->as_text, qr/^##GISAID_EpiFlu\(TM\)Data-END##/m, "Got correct Structured Comment" ); # Issue #321 (github) { # Create blank sequence my $seq=Bio::Seq->new(-seq => 'N' x 1200000, -id => 'abacab'); my $feature= Bio::SeqFeature::Generic->new(-primary=>'tRNA', -start=>1123552, -end=>1123554); my $text='(pos:complement(1123552..1123554),aa:Leu,seq:caa)'; $feature->add_tag_value(anticodon => $text); $seq->add_SeqFeature($feature); # Write genbank my $string; open my $str_fh, '>', \$string or skip("Could not write string, skipping", 2); my $out = Bio::SeqIO->new(-format => 'genbank', -fh => $str_fh, -verbose => -1); $out->write_seq($seq); like($string, qr/,$/m, 'unquoted labels wrap at comma'); # Read genbank my $in = Bio::SeqIO->new(-format => 'genbank', -string => $string, -verbose => -1); my $genbank = $in->next_seq; my ($read_feature) = $genbank->get_SeqFeatures; my ($read_text) = $read_feature->get_tag_values('anticodon'); is($read_text, $text, 'Label is the same'); } tinyseq.t100644000766000024 224214006427353 16324 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 16, -requires_modules => [qw(XML::Parser::PerlSAX XML::Writer)]); use_ok('Bio::SeqIO::tinyseq'); } my $file = test_input_file('test.tseq'); my $outfile = test_output_file(); my $instream = Bio::SeqIO->new( -file => $file, -format => 'tinyseq' ); my $outstream = Bio::SeqIO->new( -file => ">$outfile", -format => 'tinyseq' ); my $seq = $instream->next_seq; ok(defined $seq); ok(defined $seq->seq); is($seq->length, 5830); is($seq->accession_number,'NM_002253'); ok($seq->species); is($seq->species->binomial, 'Homo sapiens'); is($seq->species->ncbi_taxid, 9606); $outstream->write_seq($seq); undef $outstream; ok(-s $outfile); my $reread = Bio::SeqIO->new( -file => $outfile, -format => 'tinyseq' ); my $seq2 = $reread->next_seq; ok($seq2); ok($seq2->seq); is($seq2->length, 5830); is($seq2->accession_number, 'NM_002253'); ok($seq2->species); is($seq2->species->binomial, 'Homo sapiens'); is($seq2->species->ncbi_taxid, 9606); Handler.t100644000766000024 11310314006427353 16244 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 561, -requires_module => 'Data::Stag'); use_ok('Bio::SeqIO'); } my $verbose = test_debug(); ################################## GenBank ################################## my $ast = Bio::SeqIO->new(-format => 'gbdriver' , -verbose => $verbose, -file => test_input_file("roa1.genbank")); $ast->verbose($verbose); my $as = $ast->next_seq(); is $as->molecule, 'mRNA',$as->accession_number; is $as->alphabet, 'dna'; is($as->primary_id, 3598416); my @class = $as->species->classification; is $class[$#class],'Eukaryota'; $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("NT_021877.gbk")); $ast->verbose($verbose); $as = $ast->next_seq(); is $as->molecule, 'DNA',$as->accession_number; is $as->alphabet, 'dna'; is($as->primary_id, 37539616); is($as->accession_number, 'NT_021877'); my ($cds) = grep { $_->primary_tag eq 'CDS' } $as->get_SeqFeatures(); is(($cds->get_tag_values('transl_except'))[1], '(pos:complement(4224..4226),aa:OTHER)'); # test for a DBSOURCE line $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("BAB68554.gb")); $ast->verbose($verbose); $as = $ast->next_seq(); is $as->molecule, 'linear',$as->accession_number;; is $as->alphabet, 'protein'; # Though older GenBank releases indicate SOURCE contains only the common name, # this is no longer true. In general, this line will contain an abbreviated # form of the full organism name (but may contain the full length name), # as well as the optional common name and organelle. There is no get/set # for the abbreviated name but it is accessible via name() ok defined($as->species->name('abbreviated')->[0]); is $as->species->name('abbreviated')->[0], 'Aldabra giant tortoise'; is($as->primary_id, 15824047); my $ac = $as->annotation; ok defined $ac; my @dblinks = $ac->get_Annotations('dblink'); is(scalar @dblinks,1); is($dblinks[0]->database, 'GenBank'); is($dblinks[0]->primary_id, 'AB072353'); is($dblinks[0]->version, '1'); is($dblinks[0]->display_text, 'GenBank:AB072353.1','operator overloading in AnnotationI is deprecated'); # test for multi-line SOURCE $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("NC_006346.gb")); $as = $ast->next_seq; is $as->species->binomial('FULL'), 'Bolitoglossa n. sp. RLM-2004',$as->accession_number;; @class = $as->species->classification; is($class[$#class],'Eukaryota'); is($as->species->common_name,'mushroomtongue salamander'); $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("U71225.gb")); $as = $ast->next_seq; @class = $as->species->classification; is($class[$#class],'Eukaryota',$as->accession_number); is $as->species->common_name,'black-bellied salamander'; # test for unusual common name $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("AB077698.gb")); $as = $ast->next_seq; # again, this is not a common name but is in name('abbreviated') ok defined($as->species->name('abbreviated')->[0]),$as->accession_number; is $as->species->name('abbreviated')->[0],'Homo sapiens cDNA to mRNA'; # test for common name with parentheses $ast = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file("DQ018368.gb")); $as = $ast->next_seq; is $as->species->scientific_name,'(Populus tomentosa x P. bolleana) x P. tomentosa var. truncata', $as->accession_number;; # test secondary accessions my $seqio = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('D10483.gbk')); my $seq = $seqio->next_seq; my @kw = $seq->get_keywords; is(scalar @kw, 118, $seq->accession_number); is($kw[-1], 'yabO'); my @sec_acc = $seq->get_secondary_accessions(); is(scalar @sec_acc,14); is($sec_acc[-1], 'X56742'); # bug #1487 my $str = Bio::SeqIO->new(-verbose => $verbose, -file => test_input_file('D12555.gbk')); eval { $seq = $str->next_seq; }; ok(! $@, 'bug 1487'); # bug 1647 rpt_unit sub-feature with multiple parens $str = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('mini-AE001405.gb')); ok($seq = $str->next_seq); my @rpts = grep { $_->primary_tag eq 'repeat_region' } $seq->get_SeqFeatures; is $#rpts, 2, 'bug 1647'; my @rpt_units = grep {$_->has_tag('rpt_unit')} @rpts; is $#rpt_units, 0; is(($rpt_units[0]->get_tag_values('rpt_unit'))[0],'(TG)10;A;(TG)7'); # test bug #1673 , RDB-II genbank files $str = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('Mcjanrna_rdbII.gbk') ); ok($seq = $str->next_seq, 'bug 1673'); my @refs = $seq->annotation->get_Annotations('reference'); is(@refs, 1); is($seq->display_id,'Mc.janrrnA'); is($seq->molecule ,'RNA'); $str = Bio::SeqIO->new(-format => 'gbdriver', -file => test_input_file("AF165282.gb"), -verbose => $verbose); $seq = $str->next_seq; my @features = $seq->all_SeqFeatures(); is(@features, 5, $seq->accession_number); is($features[0]->start, 1); is($features[0]->end, 226); my $location = $features[1]->location; ok($location->isa('Bio::Location::SplitLocationI')); my @sublocs = $location->sub_Location(); is(@sublocs, 29); # version and primary ID - believe it or not, this wasn't working is ($seq->version, 1); is ($seq->seq_version, 1); is ($seq->primary_id, "5734104"); # streaming and Bio::RichSeq creation my $stream = Bio::SeqIO->new(-file => test_input_file("test.genbank"), -verbose => $verbose, -format => 'gbdriver'); $stream->verbose($verbose); my $seqnum = 0; my $species; my @cl; my $lasts; my @ids = qw(DDU63596 DDU63595 HUMBDNF); my @tids = (44689, 44689, 9606); my @tnames = ("Dictyostelium discoideum","Dictyostelium discoideum", "Homo sapiens"); while($seq = $stream->next_seq()) { if($seqnum < 3) { is $seq->display_id(), $ids[$seqnum]; $species = $seq->species(); @cl = $species->classification(); is( $species->binomial(), $tnames[$seqnum], 'species parsing incorrect for genbank'); is( $cl[3] ne $species->genus(), 1, 'genus duplicated in genbank parsing'); is( $species->ncbi_taxid, $tids[$seqnum] ); } $seqnum++; $lasts = $seq; } is($seqnum, 5,'streaming'); is $lasts->display_id(), "HUMBETGLOA"; my ($ref) = $lasts->annotation->get_Annotations('reference'); is($ref->medline, 94173918); $stream->close(); $stream = Bio::SeqIO->new(-file => test_input_file("test.genbank.noseq"), -verbose => $verbose, -format => 'gbdriver' ); $seqnum = 0; while($seq = $stream->next_seq()) { if($seqnum < 3) { is $seq->display_id(), $ids[$seqnum]; } elsif( $seq->display_id eq 'M37762') { is( ($seq->get_keywords())[0], 'neurotrophic factor'); } $seqnum++; } is $seqnum, 5, "Total number of sequences in test file"; # fuzzy $seq = Bio::SeqIO->new( -format => 'gbdriver', -verbose => $verbose, -file =>test_input_file("testfuzzy.genbank")); $seq->verbose($verbose); ok(defined($as = $seq->next_seq())); @features = $as->all_SeqFeatures(); is(@features,21,'Fuzzy in'); my $lastfeature = pop @features; # this is a split location; the root doesn't have strand is($lastfeature->strand, undef); $location = $lastfeature->location; #$location->verbose(-1); # silence the warning of undef seq_id() # see above; splitlocs roots do not have a strand really is($location->strand, undef); is($location->start, 83202); is($location->end, 84996); @sublocs = $location->sub_Location(); is(@sublocs, 2); my $loc = shift @sublocs; is($loc->start, 83202); is($loc->end, 83329); is($loc->strand, -1); $loc = shift @sublocs; is($loc->start, 84248); is($loc->end, 84996); is($loc->strand,1); my $outfile = test_output_file(); $seq = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file=> ">$outfile"); $seq->verbose($verbose); ok($seq->write_seq($as),'Fuzzy out'); ## now genbank ## $str = Bio::SeqIO->new(-format =>'gbdriver', -verbose => $verbose, -file => test_input_file('BK000016-tpa.gbk')); $seq = $str->next_seq; ok(defined $seq, $seq->accession_number); ok(defined $seq->seq); is($seq->accession_number, 'BK000016',$seq->accession_number); is($seq->alphabet, 'dna'); is($seq->display_id, 'BK000016'); is($seq->length, 1162); is($seq->division, 'ROD'); is($seq->get_dates, 1); is($seq->keywords, 'Third Party Annotation; TPA'); is($seq->desc, 'TPA: Mus musculus pantothenate kinase 4 mRNA, partial cds.'); is($seq->seq_version, 1); is($seq->feature_count, 2); my $spec_obj = $seq->species; is ($spec_obj->common_name, 'house mouse'); is ($spec_obj->species, 'musculus'); is ($spec_obj->genus, 'Mus'); is ($spec_obj->binomial, 'Mus musculus'); $ac = $seq->annotation; my $reference = ($ac->get_Annotations('reference') )[0]; is ($reference->pubmed, '11479594'); is ($reference->medline, '21372465',$seq->accession_number); # validate that what is written is what is read my $testfile = test_output_file(); my $out = Bio::SeqIO->new(-file => ">$testfile", -format => 'genbank'); $out->write_seq($seq); $out->close(); $str = Bio::SeqIO->new(-format =>'gbdriver', -file => $testfile); $seq = $str->next_seq; ok(defined $seq,'roundtrip'); ok(defined $seq->seq); is($seq->accession_number, 'BK000016'); is($seq->alphabet, 'dna'); is($seq->display_id, 'BK000016'); is($seq->length, 1162); is($seq->division, 'ROD'); is($seq->get_dates, 1); is($seq->keywords, 'Third Party Annotation; TPA'); is($seq->desc, 'TPA: Mus musculus pantothenate kinase 4 mRNA, partial cds.'); is($seq->seq_version, 1); is($seq->feature_count, 2); $spec_obj = $seq->species; is ($spec_obj->common_name, 'house mouse'); is ($spec_obj->species, 'musculus'); is ($spec_obj->genus, 'Mus'); is ($spec_obj->binomial, 'Mus musculus'); $ac = $seq->annotation; $reference = ($ac->get_Annotations('reference') )[0]; is ($reference->pubmed, '11479594'); is ($reference->medline, '21372465'); # write revcomp split location my $gb = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('revcomp_mrna.gb')); $seq = $gb->next_seq(); $gb = Bio::SeqIO->new(-format => 'genbank', -file => ">$testfile"); $gb->write_seq($seq); undef $gb; ok(! -z $testfile, 'revcomp split location'); # bug 1925, continuation of long ORGANISM line ends up in @classification: # ORGANISM Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC # 9150 # Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; # Enterobacteriaceae; Salmonella. $gb = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('NC_006511-short.gbk')); $seq = $gb->next_seq; is $seq->species->common_name, undef, "Bug 1925"; is $seq->species->scientific_name, "Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150"; @class = $seq->species->classification; is $class[$#class], "Bacteria"; # WGS tests $gb = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('O_sat.wgs')); $seq = $gb->next_seq; my @tests = ('wgs' => 'AAAA02000001-AAAA02050231', 'wgs_scafld' => 'CM000126-CM000137', 'wgs_scafld' => 'CH398081-CH401163'); my @wgs = map {$seq->annotation->get_Annotations(lc($_))} (qw(WGS WGS_SCAFLD)); my $ct=0; for my $wgs (@wgs) { my ($tagname, $value) = (shift @tests, shift @tests); is($wgs->tagname, $tagname, $tagname); is($wgs->value, $value); $ct++; } is ($ct, 3); # make sure we can retrieve a feature with a primary tag of 'misc_difference' $gb = Bio::SeqIO->new(-format => 'gbdriver', -verbose => $verbose, -file => test_input_file('BC000007.gbk')); $seq = $gb->next_seq; ($cds) = grep { $_->primary_tag eq 'misc_difference' } $seq->get_SeqFeatures; my @vals = $cds->get_tag_values('gene'); is $vals[0], 'PX19', $seq->accession_number; # Check that the source,organism section is identical between input and output. # - test an easy one where organism is species, then two different formats of # subspecies, then a species with a format that used to be mistaken for # subspecies, then a bacteria with no genus, and finally a virus with a genus. # These tests are now somewhat out-of-date since we are moving to a Bio::Taxon- # based system for verifying taxonomic information. Right now they just verify # changes so are really useless; I will change them to verify common name, # organelle, scientific name, etc. # output always adds a period (GenBank std), but two of these files do not use them. foreach my $in ('BK000016-tpa.gbk', 'ay116458.gb', 'ay149291.gb', 'NC_006346.gb', 'ay007676.gb', 'dq519393.gb') { my $infile = test_input_file($in); $outfile = test_output_file(); $str = Bio::SeqIO->new(-format =>'genbank', -verbose => $verbose, -file => $infile); $seq = $str->next_seq; $out = Bio::SeqIO->new(-file => $outfile, -format => 'genbank'); $out->write_seq($seq); $out->close(); open my $IN, '<', $infile or die "Could not read file '$infile': $!\n"; my @in = <$IN>; close $IN; open my $RESULT, '<', $outfile or die "Could not read file '$outfile': $!\n"; my $line = 0; my $check = 0; my $is = 1; FILECHECK: while (my $result = <$RESULT>) { if ($result =~ /^KEYWORDS/) { $check = 1; next; } if ($result =~ /^REFERENCE/) { last FILECHECK; } if ($check) { # end periods don't count (not all input files have them) $result =~ s{\.$}{}; $in[$line] =~ s{\.$}{}; if ($result ne $in[$line]) { $is = 0; last; } } } continue { $line++ } close $RESULT; ok $is, $in; } # NB: there should probably be full testing on all lines to ensure that output # matches input. # 20061117: problem with *double* colon in some annotation-dblink values $ct = 0; foreach my $in ('P35527.gb') { my $infile = test_input_file($in); $str = Bio::SeqIO->new(-format =>'genbank', -verbose => $verbose, -file => $infile); $seq = $str->next_seq; my $ac = $seq->annotation(); # Bio::AnnotationCollection foreach my $key ($ac->get_all_annotation_keys() ) { my @values = $ac->get_Annotations($key); foreach my $ann (@values) { my $value = $ann->display_text; $ct++; if ($key eq 'dblink') { ok (index($value,'::') < 0); # this should never be true ok ($value, $value); # check value is not empty # print " ann/", sprintf('%12s ',$key), '>>>', $value , '<<<', "\n"; # print " index double colon: ",index($value ,'::'), "\n"; # check db name: my @parts = split(/:/,$value); if ( $parts[0] =~ /^(?: # not an exhaustive list of databases; # just the db's referenced in P35527.gb: swissprot | GenBank | GenPept | HSSP| IntAct | Ensembl | KEGG | HGNC | MIM | ArrayExpress | GO | InterPro | Pfam| PRINTS | PROSITE )$/x ) { ok 1; } else { ok 0; } ok ( $parts[1], "$parts[0]" ); } # elsif ($key eq 'reference') { } } } } is($ct, 46); # bug 2195 $str = Bio::SeqIO->new(-format =>'gbdriver', -verbose => $verbose, -file => test_input_file('AF305198.gb') ); $species = $str->next_seq->species; is($species->scientific_name, 'Virginia creeper phytoplasma', 'Bug 2195'); is(join(', ',$species->classification), 'Virginia creeper phytoplasma, '. '16SrV (Elm yellows group), Candidatus Phytoplasma, '. 'Acholeplasmataceae, Acholeplasmatales, Mollicutes, '. 'Firmicutes, Bacteria', 'Bug 2195'); # bug 2569, PROJECT line support, read and write, round-tripping $str = Bio::SeqIO->new(-format =>'gbdriver', -verbose => $verbose, -file => test_input_file('NC_008536.gb')); $seq = $str->next_seq; my $project = ($seq->annotation->get_Annotations('project'))[0]; isa_ok($project, 'Bio::Annotation::SimpleValue'); if ($project) { is($project->value, 'GenomeProject:12638'); } else { ok(0, "PROJECT not parsed"); } $outfile = test_output_file(); $gb = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file=> ">$outfile"); $gb->write_seq($seq); $str = Bio::SeqIO->new(-format =>'gbdriver', -verbose => $verbose, -file => $outfile); $seq = $str->next_seq; $project = ($seq->annotation->get_Annotations('project'))[0]; isa_ok($project, 'Bio::Annotation::SimpleValue'); if ($project) { is($project->value, 'GenomeProject:12638'); } else { ok(0, "Roundtrip test failed"); } ################################## EMBL ################################## # Set to -1 for release version, so warnings aren't printed $ast = Bio::SeqIO->new( -format => 'embldriver', -verbose => $verbose, -file => test_input_file("roa1.dat")); $ast->verbose($verbose); $as = $ast->next_seq(); ok defined $as->seq; is($as->display_id, 'HSHNCPA1'); is($as->accession_number, 'X79536'); is($as->seq_version, 1); is($as->version, 1); is($as->desc, 'H.sapiens mRNA for hnRNPcore protein A1'); is($as->molecule, 'RNA'); is($as->alphabet, 'rna'); is(scalar $as->all_SeqFeatures(), 4); is($as->length, 1198); is($as->species->binomial(), 'Homo sapiens'); is($as->get_dates, 2); # EMBL Release 87 changes (8-17-06) $ast = Bio::SeqIO->new( -format => 'embldriver', -verbose => $verbose, -file => test_input_file("roa1_v2.dat")); $ast->verbose($verbose); $as = $ast->next_seq(); ok defined $as->seq; # accession # same as display name now is($as->display_id, 'X79536'); is($as->get_dates, 2); is($as->accession_number, 'X79536'); is($as->seq_version, 1); is($as->version, 1); is($as->desc, 'H.sapiens mRNA for hnRNPcore protein A1'); # mRNA instead of RNA is($as->molecule, 'mRNA'); is($as->alphabet, 'rna'); is(scalar $as->all_SeqFeatures(), 4); is($as->length, 1198); is($as->species->binomial(), 'Homo sapiens'); my $ent = Bio::SeqIO->new( -file => test_input_file("test.embl"), -format => 'embldriver'); $seq = $ent->next_seq(); is(defined $seq->seq(), 1, 'success reading Embl with ^ location and badly split double quotes'); is(scalar $seq->annotation->get_Annotations('reference'), 3); is($seq->get_dates, 0); $out = Bio::SeqIO->new(-file=> ">$outfile", -format => 'embl'); is($out->write_seq($seq),1, 'success writing Embl format with ^ < and > locations'); # embl with no FT $ent = Bio::SeqIO->new( -file => test_input_file("test.embl"), -format => 'embldriver'); $seq = $ent->next_seq(); ok($seq); is(lc($seq->subseq(1,10)),'gatcagtaga'); is($seq->length, 4870); # embl with no FH my $noFH = Bio::SeqIO->new(-file => test_input_file("no_FH.embl"), -format => 'embldriver'); $seq = $noFH->next_seq; is(scalar($seq->get_SeqFeatures), 4); is($seq->display_id, 'AE000001'); is($seq->get_dates, 0); # bug 1571 $ent = Bio::SeqIO->new(-format => 'embldriver', -file => test_input_file('test.embl2sq')); $seq = $ent->next_seq; is($seq->length,4870); is($seq->get_dates, 0); # embl repbase $ent = Bio::SeqIO->new(-file => test_input_file("BEL16-LTR_AG.embl"), -format => 'embldriver'); $seq = $ent->next_seq; is($seq->display_id,'BEL16-LTR_AG'); is($seq->get_dates, 2); # test secondary accessions in EMBL (bug #1332) $seqio = Bio::SeqIO->new(-format => 'embldriver', -file => test_input_file('ECAPAH02.embl')); $seq = $seqio->next_seq; is($seq->accession_number, 'D10483'); is($seq->seq_version, 2); my @accs = $seq->get_secondary_accessions(); is($accs[0], 'J01597'); is($accs[-1], 'X56742'); is($seq->get_dates, 2); ### TPA TESTS - Thanks to Richard Adams ### # test Third Party Annotation entries in EMBL/Gb format # to ensure compatability with parsers. $str = Bio::SeqIO->new(-verbose => $verbose, -format =>'embldriver', -file => test_input_file('BN000066-tpa.embl')); $seq = $str->next_seq; ok(defined $seq); is($seq->accession_number, 'BN000066'); is($seq->alphabet, 'dna'); is($seq->display_id, 'AGA000066'); is($seq->length, 5195); is($seq->division, 'INV'); is($seq->keywords, 'acetylcholinesterase; achE1 gene; Third Party Annotation; TPA'); is($seq->seq_version, 1); is($seq->feature_count, 15); is($seq->get_dates, 2); $spec_obj = $seq->species; is ($spec_obj->common_name, 'African malaria mosquito'); is ($spec_obj->species, 'gambiae'); is ($spec_obj->genus, 'Anopheles'); is ($spec_obj->binomial, 'Anopheles gambiae'); $ac = $seq->annotation; $reference = ($ac->get_Annotations('reference') )[1]; is ($reference->title,'"A novel acetylcholinesterase gene in mosquitoes codes for the insecticide target and is non-homologous to the ace gene in Drosophila"'); is ($reference->authors,'Weill M., Fort P., Berthomi eu A., Dubois M.P., Pasteur N., Raymond M.'); my $cmmnt = ($ac->get_Annotations('comment') )[0]; is($cmmnt->text, 'see also AJ488492 for achE-1 from Kisumu strain Third Party Annotation Database: This TPA record uses Anopheles gambiae trace archive data (http://trace.ensembl.org)'); $ent = Bio::SeqIO->new( -file => test_input_file("test.embl"), -format => 'embldriver'); $ent->verbose($verbose); $seq = $ent->next_seq(); $species = $seq->species(); @cl = $species->classification(); is( $cl[3] ne $species->genus(), 1, 'genus duplication test'); $ent->close(); # ## read-write - test embl writing of a PrimarySeq # my $primaryseq = Bio::PrimarySeq->new( -seq => 'AGAGAGAGATA', -id => 'myid', -desc => 'mydescr', -alphabet => 'DNA', -accession_number => 'myaccession'); $verbose = -1 unless $ENV{'BIOPERLDEBUG'}; # silence warnings unless we are debuggin my $embl = Bio::SeqIO->new(-format => 'embl', -verbose => $verbose, -file => ">$outfile"); ok($embl->write_seq($primaryseq)); # this should generate a warning my $scalar = "test"; eval { $embl->write_seq($scalar); }; ok ($@); ############################## Swiss/UniProt ############################## $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => test_input_file('test.swiss')); isa_ok($seqio, 'Bio::SeqIO'); $seq = $seqio->next_seq; my @gns = $seq->annotation->get_Annotations('gene_name'); $outfile = test_output_file(); $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => ">$outfile"); $seqio->write_seq($seq); # reads it in once again $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => $outfile); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, 6239); # version, seq_update, dates (5 tests) is($seq->version, 40); my ($ann) = $seq->annotation->get_Annotations('seq_update'); eval {is($ann->display_text, 35,'operator overloading in AnnotationI is deprecated')}; ok(!$@); my @dates = $seq->get_dates; my @date_check = qw(01-NOV-1997 01-NOV-1997 16-OCT-2001); for my $date (@dates) { my $expdate = shift @date_check; is($date, $expdate,'dates'); } my @gns2 = $seq->annotation->get_Annotations('gene_name'); # check gene name is preserved (was losing suffix in worm gene names) ok($#gns2 == 0 && $gns[0]->value eq $gns2[0]->value); # test swissprot multiple RP lines $str = Bio::SeqIO->new(-file => test_input_file('P33897')); $seq = $str->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); @refs = $seq->annotation->get_Annotations('reference'); is( @refs, 23); is($refs[20]->rp, 'VARIANTS X-ALD LEU-98; ASP-99; GLU-217; GLN-518; ASP-608; ILE-633 AND PRO-660, AND VARIANT THR-13.'); # version, seq_update, dates (5 tests) is($seq->version, 44); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 28,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-FEB-1994 01-FEB-1994 15-JUN-2004); for my $date (@dates) { is($date, shift @date_check); } $ast = Bio::SeqIO->new(-verbose => $verbose, -format => 'swissdriver' , -file => test_input_file("roa1.swiss")); $as = $ast->next_seq(); ok defined $as->seq; is($as->id, 'ROA1_HUMAN', "id is ".$as->id); like($as->primary_id, qr(Bio::PrimarySeq)); is($as->length, 371); is($as->alphabet, 'protein'); is($as->division, 'HUMAN'); is(scalar $as->all_SeqFeatures(), 16); is(scalar $as->annotation->get_Annotations('reference'), 11); # version, seq_update, dates (5 tests) is($as->version, 35); ($ann) = $as->annotation->get_Annotations('seq_update'); is($ann->display_text, 15,'operator overloading in AnnotationI is deprecated'); @dates = $as->get_dates; @date_check = qw(01-MAR-1989 01-AUG-1990 01-NOV-1997); for my $date (@dates) { is($date, shift @date_check); } ($ent,$out) = undef; ($as,$seq) = undef; $seqio = Bio::SeqIO->new(-format => 'swissdriver' , -verbose => $verbose, -file => test_input_file("swiss.dat")); $seq = $seqio->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); # more tests to verify we are actually parsing correctly like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->display_id, 'MA32_HUMAN'); is($seq->length, 282); is($seq->division, 'HUMAN'); is($seq->alphabet, 'protein'); my @f = $seq->all_SeqFeatures(); is(@f, 2); is($f[1]->primary_tag, 'CHAIN'); is(($f[1]->get_tag_values('description'))[0], 'COMPLEMENT COMPONENT 1, Q SUBCOMPONENT BINDING PROTEIN'); # version, seq_update, dates (5 tests) is($seq->version, 40); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 31,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-FEB-1995 01-FEB-1995 01-OCT-2000); for my $date (@dates) { is($date, shift @date_check); } my @genenames = qw(GC1QBP HABP1 SF2P32 C1QBP); ($ann) = $seq->annotation->get_Annotations('gene_name'); # use Data::Stag findval and element name to get values/nodes foreach my $gn ( $ann->findval('Name') ) { ok ($gn, shift(@genenames)); } foreach my $gn ( $ann->findval('Synonyms') ) { ok ($gn, shift(@genenames)); } like($ann->value, qr/Name: GC1QBP/); # test for feature locations like ?..N $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->display_id, 'ACON_CAEEL'); is($seq->length, 788); is($seq->division, 'CAEEL'); is($seq->alphabet, 'protein'); is(scalar $seq->all_SeqFeatures(), 5); foreach my $gn ( $seq->annotation->get_Annotations('gene_name') ) { ok ($gn->value, 'F54H12.1'); } # test species in swissprot -- this can be a n:n nightmare $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); @sec_acc = $seq->get_secondary_accessions(); is($sec_acc[0], 'P29360'); is($sec_acc[1], 'Q63631'); is($seq->accession_number, 'P42655'); @kw = $seq->get_keywords; is( $kw[0], 'Brain'); is( $kw[1], 'Neurone'); is($kw[3], 'Multigene family'); is($seq->display_id, '143E_HUMAN'); # hybrid names from old sequences are no longer valid, these are chopped # off at the first organism is($seq->species->binomial, "Homo sapiens"); is($seq->species->common_name, "Human"); is($seq->species->ncbi_taxid, 9606); $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); is($seq->species->binomial, "Bos taurus"); is($seq->species->common_name, "Bovine"); is($seq->species->ncbi_taxid, 9913); # multiple genes in swissprot $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); ($ann) = $seq->annotation->get_Annotations("gene_name"); @genenames = qw(CALM1 CAM1 CALM CAM CALM2 CAM2 CAMB CALM3 CAM3 CAMC); my $flatnames = "(CALM1 OR CAM1 OR CALM OR CAM) AND (CALM2 OR CAM2 OR CAMB) AND (CALM3 OR CAM3 OR CAMC)"; my @names = @genenames; # copy array my @ann_names = $ann->get_all_values(); is(scalar(@ann_names), scalar(@names)); # do this in a layered way (nested tags) for my $node ($ann->findnode('gene_name')) { for my $name ($node->findval('Name')) { is($name, shift(@names)); } for my $name ($node->findval('Synonyms')) { is($name, shift(@names)); } } is(scalar(@names),0); # same entry as before, but with the new gene names format $seqio = Bio::SeqIO->new(-format => 'swissdriver', -verbose => $verbose, -file => test_input_file("calm.swiss")); $seq = $seqio->next_seq(); isa_ok($seq, 'Bio::Seq::RichSeqI'); like($seq->primary_id, qr(Bio::PrimarySeq)); ($ann) = $seq->annotation->get_Annotations("gene_name"); @names = @genenames; # copy array my @ann_names2 = $ann->get_all_values(); #emulate StructuredValue's flattened array is(scalar(@ann_names2), scalar(@names)); for my $node ($ann->findnode('gene_name')) { for my $name ($node->findval('Name')) { is($name, shift(@names)); } for my $name ($node->findval('Synonyms')) { is($name, shift(@names)); } } is(scalar(@names),0); # test proper parsing of references my @litrefs = $seq->annotation->get_Annotations('reference'); is(scalar(@litrefs), 17); my @titles = ( '"Complete amino acid sequence of human brain calmodulin."', '"Multiple divergent mRNAs code for a single human calmodulin."', '"Molecular analysis of human and rat calmodulin complementary DNA clones. Evidence for additional active genes in these species."', '"Isolation and nucleotide sequence of a cDNA encoding human calmodulin."', '"Structure of the human CALM1 calmodulin gene and identification of two CALM1-related pseudogenes CALM1P1 and CALM1P2."', undef, '"Characterization of the human CALM2 calmodulin gene and comparison of the transcriptional activity of CALM1, CALM2 and CALM3."', '"Cloning of human full-length CDSs in BD Creator(TM) system donor vector."', '"The DNA sequence and analysis of human chromosome 14."', '"Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences."', '"Alpha-helix nucleation by a calcium-binding peptide loop."', '"Solution structure of Ca(2+)-calmodulin reveals flexible hand-like properties of its domains."', '"Calmodulin structure refined at 1.7 A resolution."', '"Drug binding by calmodulin: crystal structure of a calmodulin-trifluoperazine complex."', '"Structural basis for the activation of anthrax adenylyl cyclase exotoxin by calmodulin."', '"Physiological calcium concentrations regulate calmodulin binding and catalysis of adenylyl cyclase exotoxins."', '"Crystal structure of a MARCKS peptide containing the calmodulin-binding domain in complex with Ca2+-calmodulin."', ); my @locs = ( "Biochemistry 21:2565-2569(1982).", "J. Biol. Chem. 263:17055-17062(1988).", "J. Biol. Chem. 262:16663-16670(1987).", "Biochem. Int. 9:177-185(1984).", "Eur. J. Biochem. 225:71-82(1994).", "Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases.", "Cell Calcium 23:323-338(1998).", "Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases.", "Nature 421:601-607(2003).", "Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002).", "Proc. Natl. Acad. Sci. U.S.A. 96:903-908(1999).", "Nat. Struct. Biol. 8:990-997(2001).", "J. Mol. Biol. 228:1177-1192(1992).", "Biochemistry 33:15259-15265(1994).", "Nature 415:396-402(2002).", "EMBO J. 21:6721-6732(2002).", "Nat. Struct. Biol. 10:226-231(2003).", ); my @positions = ( undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, undef, 94, 103, 1, 76, undef, undef, undef, undef, 5, 148, 1, 148, undef, undef, ); foreach my $litref (@litrefs) { is($litref->title, shift(@titles)); is($litref->location, shift(@locs)); is($litref->start, shift(@positions)); is($litref->end, shift(@positions)); } # format parsing changes (pre-rel 9.0) $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => test_input_file('pre_rel9.swiss')); ok($seqio); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, "6239"); # version, seq_update, dates (5 tests) is($seq->version, 44); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 1); @dates = $seq->get_dates; @date_check = qw(01-NOV-1997 01-NOV-1996 30-MAY-2006 ); for my $date (@dates) { is($date, shift @date_check); } my @idcheck = qw(Z66513 T22647 Cel.30446 Q06319 Q20772 F54D5.7 WBGene00010052 F54D5.7 GO:0005515 IPR006089 IPR006091 IPR006090 IPR006092 IPR009075 IPR009100 IPR013764 PF00441 PF02770 PF02771 PS00072 PS00073); for my $dblink ( $seq->annotation->get_Annotations('dblink') ) { is($dblink->primary_id, shift @idcheck); } $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => test_input_file('pre_rel9.swiss')); my @namespaces = qw(Swiss-Prot TrEMBL TrEMBL); while (my $seq = $seqio->next_seq) { is($seq->namespace, shift @namespaces); } # format parsing changes (rel 9.0, Oct 2006) $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => test_input_file('rel9.swiss')); ok($seqio); $seq = $seqio->next_seq; isa_ok($seq->species, 'Bio::Species'); is($seq->species->ncbi_taxid, 6239); is($seq->version, 47); ($ann) = $seq->annotation->get_Annotations('seq_update'); is($ann->display_text, 1,'operator overloading in AnnotationI is deprecated'); @dates = $seq->get_dates; @date_check = qw(01-NOV-1997 01-NOV-1996 31-OCT-2006 ); for my $date (@dates) { is($date, shift @date_check); } @idcheck = qw(Z66513 T22647 Cel.30446 Q06319 Q20772 F54D5.7 cel:F54D5.7 WBGene00010052 F54D5.7 GO:0005515 IPR006089 IPR006091 IPR006090 IPR006092 IPR009075 IPR013786 IPR009100 IPR013764 PF00441 PF02770 PF02771 PS00072 PS00073 ); for my $dblink ( $seq->annotation->get_Annotations('dblink') ) { is($dblink->primary_id, shift @idcheck); } $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swissdriver', -file => test_input_file('rel9.swiss')); @namespaces = qw(Swiss-Prot TrEMBL TrEMBL); while (my $seq = $seqio->next_seq) { is($seq->namespace, shift @namespaces); } # bug 2288 # Q8GBD3.swiss $seqio = Bio::SeqIO->new( -verbose => $verbose, -format => 'swiss', -file => test_input_file('Q8GBD3.swiss')); while (my $seq = $seqio->next_seq) { my $lineage = join(';', $seq->species->classification); is ($lineage, 'Acetobacter aceti;Acetobacter subgen. Acetobacter;'. 'Acetobacter;Acetobacteraceae;Rhodospirillales;Alphaproteobacteria;'. 'Proteobacteria;Bacteria'); } # test for GenBank swissprot/UniProt/UniProtKB DBSOURCE line (Bug : RT 44536) $ast = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('P39765.gb')); $ast->verbose($verbose); $as = $ast->next_seq(); is $as->molecule, 'PRT',$as->accession_number;; is $as->alphabet, 'protein'; # Though older GenBank releases indicate SOURCE contains only the common name, # this is no longer true. In general, this line will contain an abbreviated # form of the full organism name (but may contain the full length name), # as well as the optional common name and organelle. There is no get/set # for the abbreviated name but it is accessible via name() ok defined($as->species->name('abbreviated')->[0]); is $as->species->name('abbreviated')->[0], 'Bacillus subtilis'; is($as->primary_id, 20141743); $ac = $as->annotation; ok defined $ac; @dblinks = $ac->get_Annotations('dblink'); is(scalar @dblinks,31); is($dblinks[0]->database, 'UniProtKB'); is($dblinks[0]->primary_id, 'PYRR_BACSU'); is($dblinks[0]->version, undef); is($dblinks[0]->display_text, 'UniProtKB:PYRR_BACSU','operator overloading in AnnotationI is deprecated'); tigrxml.t100644000766000024 471414006427353 16324 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 49, -requires_modules => [qw(XML::SAX XML::SAX::Writer)]); use_ok('Bio::SeqIO::tigrxml'); } my $verbose = test_debug(); my $ast = Bio::SeqIO->new(-format => 'tigrxml' , -verbose => $verbose, -file => test_input_file('test.tigrxml')); isa_ok($ast, 'Bio::SeqIO'); $ast->verbose($verbose); ok my $as = $ast->next_seq(); is($as->display_id, 'chr9'); my $first = 1; for my $f ( sort { $a->start * $a->strand <=> $b->start * $b->strand } $as->get_SeqFeatures ) { ok($f); my ($name); for my $tag ( qw(Parent ID) ) { if( $f->has_tag($tag) ) { ($name) = $f->get_tag_values($tag); last; } } if( $name eq '162.t00500' || $name eq '162.m02638' ) { if( $f->primary_tag eq 'gene' ) { is($f->start, 185408); is($f->end, 187155); # warn($f->gff_string, "\n"); } elsif( $f->primary_tag eq 'mRNA' ) { is($f->start, 185408); # the values list for COORD are start/end of CDS not whole transcript is($f->end, 187155); is($f->strand, 1); } elsif( $f->primary_tag eq "five_prime_UTR" ) { my ($id) = $f->get_tag_values('ID'); if( $id =~ /UTR1$/ ) { is($f->start, 185408); is($f->end, 185433); } elsif( $id =~ /UTR2$/ ) { is($f->start, 185487); is($f->end, 185793); } else { ok(0, 'expected only two UTRS'); } } elsif( $f->primary_tag eq "three_prime_UTR" ) { is($f->start, 187042); is($f->end, 187155); } elsif( $f->primary_tag eq 'CDS' ) { is($f->start, 185794); is($f->end, 187041); } } elsif ( $name eq '162.t00448' || $name eq '162.m02967' ) { if( $f->primary_tag eq 'gene' ) { is($f->start, 59343); is($f->end, 61061); } elsif( $f->primary_tag eq 'mRNA' ) { is($f->start, 59343); # the values list for COORD are start/end of CDS not whole transcript is($f->end, 61061); is($f->strand, -1); } elsif( $f->primary_tag eq "five_prime_UTR" ) { my ($id) = $f->get_tag_values('ID'); is($f->start, 60834); is($f->end, 61061); is($f->strand, -1); } elsif( $f->primary_tag eq "three_prime_UTR" ) { is($f->start, 59343); is($f->end, 59632); is($f->strand, -1); } elsif( $f->primary_tag eq 'CDS' ) { if( $first ) { is($f->start, 60801); is($f->end, 60833); is($f->strand, -1); $first = 0; } } } else { ok(0, "unexpected name '$name'\n"); } } PrimedSeq.t100644000766000024 1063514006427353 16316 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 65); use_ok('Bio::SeqIO'); use_ok('Bio::Seq::PrimedSeq'); } my ($seqio, $seq, $left, $right, $primed_seq, $left_test, $right_test, $annseq, $amplicon, $returnedseq); $seqio = Bio::SeqIO->new(-file => test_input_file('primedseq.fa')); $seq = $seqio->next_seq; my $expected_amplicon_seq = 'cttttcattctgactgcaacgGGCAATATGTCTCTGTGTGGATTAAAAA'. 'AAGAGTGTCTGATAGCAGCTTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAA'. 'TACTTTAACCAATATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAacgcattagca'. 'ccacc'; my $expected_amplicon_seq2 = 'cttttcattctgactgcaacgTGTCTCTGTGTGGATTAAAAA'. 'AAGAGTGTCTGATAGCAGCTTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAA'. 'TACTTTAACCAATATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAacgcattagca'. 'ccacc'; # Prime with Bio::PrimarySeqI objects and have the primer positions calculated $left = Bio::PrimarySeq->new(-id => 123, -seq => 'CTTTTCATTCTGACTGCAACG'); $right = Bio::Seq->new(-seq => 'GGTGGTGCTAATGCGT'); ok $primed_seq = Bio::Seq::PrimedSeq->new( -seq => $seq, -left_primer => $left, -right_primer => $right, ), 'Priming the target with sequence objects'; is $primed_seq->isa('Bio::SeqFeature::Generic'), 1; ok $annseq = $primed_seq->annotated_sequence; # should I check that this is what I think it is, or just be happy? ok $amplicon = $primed_seq->amplicon; is $amplicon->seq, $expected_amplicon_seq; is $amplicon->id, 'Amplicon_from_Test1'; ok $returnedseq = $primed_seq->seq; is $returnedseq->seq, $seq->seq; ok $left_test = $primed_seq->get_primer('-left_primer'); isa_ok $left_test, 'Bio::SeqFeature::Primer'; is $left_test->seq->seq, 'CTTTTCATTCTGACTGCAACG'; ok $right_test = $primed_seq->get_primer('-r'); isa_ok $right_test, 'Bio::SeqFeature::Primer'; is $right_test->seq->seq, 'GGTGGTGCTAATGCGT'; ok( ($left_test, $right_test) = $primed_seq->get_primer() ); is $left_test->seq->seq, 'CTTTTCATTCTGACTGCAACG'; is $right_test->seq->seq, 'GGTGGTGCTAATGCGT'; is $left_test->strand, 1; is $left_test->start, 3; is $left_test->end, 23; is $right_test->strand, -1; is $right_test->start, 195; is $right_test->end, 210; # Prime the sequence with Bio::SeqFeature::Primer objects $left = Bio::SeqFeature::Primer->new(-id => 123, -seq => 'CTTTTCATTCTGACTGCAACG'); $right = Bio::SeqFeature::Primer->new(-seq => 'GGTGGTGCTAATGCGT'); ok $primed_seq = Bio::Seq::PrimedSeq->new( -seq => $seq, -left_primer => $left, -right_primer => $right, ), 'Priming the target with primer objects'; ok $annseq = $primed_seq->annotated_sequence; ok $amplicon = $primed_seq->amplicon; is $amplicon->seq, $expected_amplicon_seq; is $amplicon->id, 'Amplicon_from_Test1'; ok $returnedseq = $primed_seq->seq; is $returnedseq->seq, $seq->seq; ok $left_test = $primed_seq->get_primer('left'); is_deeply $left_test, $left; ok $right_test = $primed_seq->get_primer('r'); is_deeply $right_test, $right; ok( ($left_test, $right_test) = $primed_seq->get_primer('-both') ); is_deeply $left_test, $left; is_deeply $right_test, $right; is $left_test->strand, 1; is $left_test->start, 3; is $left_test->end, 23; is $right_test->strand, -1; is $right_test->start, 195; is $right_test->end, 210; # Prime the sequence with Bio::SeqFeature::Primer objects $left = Bio::SeqFeature::Primer->new( -id => 123, -seq => 'CTTTTCATTCTGACTGCAACG', -start => 10, -end => 30, -strand => 1, ); $right = Bio::SeqFeature::Primer->new( -seq => 'GGTGGTGCTAATGCGT', -start => 195, -end => 210, -strand => -1, ); ok $primed_seq = Bio::Seq::PrimedSeq->new( -seq => $seq, -left_primer => $left, -right_primer => $right, ), 'Priming the target with located primer objects'; ok $annseq = $primed_seq->annotated_sequence; ok $amplicon = $primed_seq->amplicon; is $amplicon->seq, $expected_amplicon_seq2; is $amplicon->id, 'Amplicon_from_Test1'; ok $returnedseq = $primed_seq->seq; is $returnedseq->seq, $seq->seq; ok $left_test = $primed_seq->get_primer('left'); is_deeply $left_test, $left; ok $right_test = $primed_seq->get_primer('r'); is_deeply $right_test, $right; ok( ($left_test, $right_test) = $primed_seq->get_primer('-both') ); is_deeply $left_test, $left; is_deeply $right_test, $right; is $left_test->strand, 1; is $left_test->start, 10; is $left_test->end, 30; is $right_test->strand, -1; is $right_test->start, 195; is $right_test->end, 210; LargePSeq.t100644000766000024 562114006427353 16227 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 30); use_ok('Bio::Seq::LargePrimarySeq'); use_ok('Bio::Seq::LargeSeq'); use_ok('Bio::Location::Simple'); use_ok('Bio::Location::Fuzzy'); use_ok('Bio::Location::Split'); use_ok('Bio::SeqIO'); } my $pseq = Bio::Seq::LargePrimarySeq->new(); ok $pseq; $pseq->add_sequence_as_string('ATGGGGTGGGGTGAAACCCTTTGGGGGTGGGGTAAAT'); $pseq->add_sequence_as_string('GTTTGGGGTTAAACCCCTTTGGGGGGT'); is $pseq->display_id('hello'), 'hello'; is $pseq->seq, 'ATGGGGTGGGGTGAAACCCTTTGGGGGTGGGGTAAATGTTTGGGGTTAAACCCCTTTGGGGGGT' , "Sequence is " . $pseq->seq; is $pseq->subseq(3,7), 'GGGGT', "Subseq is ".$pseq->subseq(3,7); my $location = Bio::Location::Simple->new(-start => 4, -end => 8, -strand => 1); is($pseq->subseq($location), 'GGGTG'); my $splitlocation = Bio::Location::Split->new(); $splitlocation->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 15, '-strand' => 1)); $splitlocation->add_sub_Location( Bio::Location::Simple->new('-start' => 21, '-end' => 27, '-strand' => -1)); is( $pseq->subseq($splitlocation), 'ATGGGGTGGGGTGAACCCCCAA'); my $fuzzy = Bio::Location::Fuzzy->new(-start => '<10', -end => '18', -strand => 1); is( $pseq->subseq($fuzzy), 'GGTGAAACC'); is($pseq->trunc(8,15)->seq, 'GGGGTGAA', 'trunc seq was ' . $pseq->trunc(8,15)->seq); is $pseq->alphabet('dna'), 'dna'; # so translate will not complain is $pseq->translate()->seq, 'MGWGETLWGWGKCLGLNPFGG'; my $seq = Bio::Seq::LargeSeq->new(-primaryseq => $pseq ); is $seq->display_id('hello'), 'hello'; is $seq->seq, 'ATGGGGTGGGGTGAAACCCTTTGGGGGTGGGGTAAATGTTTGGGGTTAAACCCCTTTGGGGGGT' , "Sequence is " . $seq->seq; # test SeqIO::fasta (allows LargeSeqI; bug 2490) SKIP: { eval {require IO::String}; skip "SeqIO output for LargeSeq requires IO::String", 2 if $@; my $str; my $strobj = IO::String->new($str); my $out = Bio::SeqIO->new(-fh => $strobj, -format => 'fasta'); ok($out->write_seq($seq)); like($str, qr/>hello\nATGGGGTGGGGTGAAACCCTTTGGGGGTGGGGTAAATGTTTGGGGTTAAACCCCTTTGGG\nGGGT/, 'output via Bio::SeqIO::fasta'); } is $seq->subseq(3,7), 'GGGGT', "Subseq is ".$seq->subseq(3,7); is ($seq->trunc(8,15)->seq, 'GGGGTGAA', 'trunc seq was ' . $seq->trunc(8,15)->seq); is $seq->alphabet('dna'), 'dna'; # so translate will not complain is $seq->translate()->seq, 'MGWGETLWGWGKCLGLNPFGG'; $seq = Bio::Seq::LargeSeq->new( -display_id => 'hello'); $seq->seq('ATGGGGTGGGGT'); is $seq->display_id, 'hello'; is $seq->seq, 'ATGGGGTGGGGT' , "Sequence is " . $seq->seq; is $seq->subseq(3,7), 'GGGGT', "Subseq is ".$seq->subseq(3,7); is ($seq->trunc(8,12)->seq, 'GGGGT', 'trunc seq was ' . $seq->trunc(8,12)->seq); is $seq->alphabet('dna'), 'dna'; # so translate will not complain is $seq->translate()->seq, 'MGWG'; Fasta.t100644000766000024 3432114006427353 16171 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/LocalDBBEGIN { use Bio::Root::Test; test_begin( -tests => 109, -requires_modules => [qw(Bio::DB::Fasta Bio::SeqIO)] ); } use strict; use warnings; use Bio::Root::Root; use File::Copy; my $DEBUG = test_debug(); # Test Bio::DB::Fasta, but also the underlying module, Bio::DB::IndexedBase my $test_dir = setup_temp_dir('dbfa'); my $test_file = test_input_file('dbfa', 'mixed_alphabet.fasta'); my $test_files = [ test_input_file('dbfa', 'mixed_alphabet.fasta'), test_input_file('dbfa', '6.fa') ]; { # Test basic functionalities ok my $db = Bio::DB::Fasta->new($test_dir, -reindex => 1), 'Index a directory'; is $db->glob, '*.{fa,FA,fasta,FASTA,fast,FAST,dna,DNA,fna,FNA,faa,FAA,fsa,FSA}'; isa_ok $db, 'Bio::DB::Fasta'; is $db->length('CEESC13F'), 389; is $db->seq('CEESC13F:1,10'), 'cttgcttgaa'; is $db->seq('CEESC13F:1-10'), 'cttgcttgaa'; is $db->seq('CEESC13F:1..10'), 'cttgcttgaa'; is $db->seq('CEESC13F:1..10/1'), 'cttgcttgaa'; is $db->seq('CEESC13F:1..10/+1'), 'cttgcttgaa'; is $db->seq('CEESC13F:1..10/-1'), 'ttcaagcaag'; is $db->seq('CEESC13F/1'), 'cttgcttgaaaaatttatataaatatttaagagaagaaaaataaataatcgcatctaatgacgtctgtccttgtatccctggtttccattgactggtgcactttcctgtctttgaggacatggacaatattcggcatcagttcctggctctccctcctctcctggtgctccagcagaaccgttctctccattatctcccttgtctccacgtggtccacgctctcctggtgctcctggaataccttgagctccctcgtgccgaattcctgcagcccgggggatccactagttctagagcggccgccaccgcggtgggagctccagcttttgttncctttagtgagggttaatttcgagcttggcgtaatcatggtcatagctgtttcctg'; is $db->seq('CEESC13F/-1'), 'caggaaacagctatgaccatgattacgccaagctcgaaattaaccctcactaaaggnaacaaaagctggagctcccaccgcggtggcggccgctctagaactagtggatcccccgggctgcaggaattcggcacgagggagctcaaggtattccaggagcaccaggagagcgtggaccacgtggagacaagggagataatggagagaacggttctgctggagcaccaggagaggagggagagccaggaactgatgccgaatattgtccatgtcctcaaagacaggaaagtgcaccagtcaatggaaaccagggatacaaggacagacgtcattagatgcgattatttatttttcttctcttaaatatttatataaatttttcaagcaag'; is $db->seq('AW057119', 1, 10), 'tcatgttggc'; is $db->seq('AW057119', 1, 10, 1), 'tcatgttggc'; is $db->seq('AW057119', 1, 10, -1), 'gccaacatga'; is $db->seq('AW057119', 10, 1), 'gccaacatga'; is $db->seq('AW057119', 10, 1, -1), 'tcatgttggc'; is $db->header('AW057119'), 'AW057119 test description'; is $db->seq('foobarbaz'), undef; is $db->get_Seq_by_id('foobarbaz'), undef; is $db->file('AW057119'), '1.fa'; is $db->file('AW057410'), '3.fa'; is $db->file('CEESC13F'), '6.fa'; # Bio::DB::RandomAccessI and Bio::DB::SeqI methods ok my $primary_seq = $db->get_Seq_by_id('AW057119'); ok $primary_seq = $db->get_Seq_by_acc('AW057119'); ok $primary_seq = $db->get_Seq_by_version('AW057119'); ok $primary_seq = $db->get_Seq_by_primary_id('AW057119'); isa_ok $primary_seq, 'Bio::PrimarySeq::Fasta'; isa_ok $primary_seq, 'Bio::PrimarySeqI'; # Bio::PrimarySeqI methods is $primary_seq->id, 'AW057119'; is $primary_seq->display_id, 'AW057119'; like $primary_seq->primary_id, qr/^Bio::PrimarySeq::Fasta=HASH/; is $primary_seq->alphabet, 'dna'; is $primary_seq->accession_number, 'unknown'; is $primary_seq->is_circular, undef; is $primary_seq->subseq(11, 20), 'ttctcggggt'; is $primary_seq->description, 'test description', 'bug 3126'; is $primary_seq->seq, 'tcatgttggcttctcggggtttttatggattaatacattttccaaacgattctttgcgccttctgtggtgccgccttctccgaaggaactgacgaaaaatgacgtggatttgctgacaaatccaggcgaggaatatttggacggattgatgaaatggcacggcgacgagcgacccgtgttcaaaagagaggacatttatcgttggtcggatagttttccagaatatcggctaagaatgatttgtctgaaagacacgacaagggtcattgcagtcggtcaatattgttactttgatgctctgaaagaaaggagagcagccattgttcttcttaggattgggatggacggatcctgaatatcgtaatcgggcagttatggagcttcaagcttcgatggcgctggaggagagggatcggtatccgactgccaacgcggcatcgcatccaaataagttcatgaaacgattttggcacatattcaacggcctcaaagagcacgaggacaaaggtcacaaggctgccgctgtttcatacaagagcttctacgacctcanagacatgatcattcctgaaaatctggatgtcagtggtattactgtaaatgatgcacgaaaggtgccacaaagagatataatcaactacgatcaaacatttcatccatatcatcgagaaatggttataatttctcacatgtatgacaatgatgggtttggaaaagtgcgtatgatgaggatggaaatgtacttggaattgtctagcgatgtctttanaccaacaagactgcacattagtcaattatgcagatagcc'; ok my $trunc = $primary_seq->trunc(11,20); isa_ok $trunc, 'Bio::PrimarySeq::Fasta'; isa_ok $trunc, 'Bio::PrimarySeqI'; is $trunc->length, 10; is $trunc->seq, 'ttctcggggt'; ok my $rev = $trunc->revcom; isa_ok $rev, 'Bio::PrimarySeq::Fasta'; isa_ok $rev, 'Bio::PrimarySeqI'; is $rev->seq, 'accccgagaa'; is $rev->length, 10; } { # Re-open an existing index. # Doing this test properly involves unloading and reloading Bio::DB::Fasta. SKIP: { test_skip(-tests => 1, -requires_modules => [qw(Class::Unload)]); use_ok('Class::Unload'); Class::Unload->unload( 'Bio::DB::Fasta' ); Class::Unload->unload( 'Bio::DB::IndexedBase' ); require Bio::DB::Fasta; } ok my $db = Bio::DB::Fasta->new($test_dir), 'Re-open an existing index'; is $db->seq('AW057119', 1, 10), 'tcatgttggc'; } { # Test tied hash access my %h; ok tie(%h, 'Bio::DB::Fasta', $test_dir), 'Tied hash access'; ok exists $h{'AW057146'}; is $h{'AW057146:1,10'} , 'aatgtgtaca'; # in file 1.fa is $h{'AW057146:10,1'} , 'tgtacacatt'; # reverse complement is $h{'AW057443:11,20'}, 'gaaccgtcag'; # in file 4.fa } { # Test writing the Bio::PrimarySeq::Fasta objects with SeqIO ok my $db = Bio::DB::Fasta->new($test_dir, -reindex => 1), 'Writing with SeqIO'; my $out = Bio::SeqIO->new( -format => 'genbank', -file => '>'.test_output_file() ); my $primary_seq = Bio::Seq->new(-primary_seq => $db->get_Seq_by_acc('AW057119')); eval { $out->write_seq($primary_seq) }; is $@, ''; $out = Bio::SeqIO->new(-format => 'embl', -file => '>'.test_output_file()); eval { $out->write_seq($primary_seq) }; is $@, ''; } { # Test alphabet and reverse-complement RNA ok my $db = Bio::DB::Fasta->new( $test_file, -reindex => 1), 'Index a single file'; is $db->alphabet('gi|352962132|ref|NG_030353.1|'), 'dna'; is $db->alphabet('gi|352962148|ref|NM_001251825.1|'), 'rna'; is $db->alphabet('gi|194473622|ref|NP_001123975.1|'), 'protein'; is $db->alphabet('gi|61679760|pdb|1Y4P|B'), 'protein'; is $db->alphabet('123'), ''; is $db->seq('gi|352962148|ref|NM_001251825.1|', 20, 29, 1), 'GUCAGCGUCC'; is $db->seq('gi|352962148|ref|NM_001251825.1|', 20, 29, -1), 'GGACGCUGAC'; # Test empty sequence is $db->seq('123'), ''; is $db->file('gi|352962132|ref|NG_030353.1|'), 'mixed_alphabet.fasta'; } { # Test stream ok my $db = Bio::DB::Fasta->new( $test_file, -reindex => 1); ok my $stream = $db->get_PrimarySeq_stream; isa_ok $stream, 'Bio::DB::Indexed::Stream'; my $count = 0; # note use of modified iterator, needed b/c of overloading while (defined(my $seq = $stream->next_seq)) { $count++; } is $count, 7; # bug #170 (Github) # retrieve seq with ID of 0 my $seq = $db->get_Seq_by_id(0); isa_ok $seq, 'Bio::PrimarySeq::Fasta'; is $seq->display_id, 0; # ActivePerl will not allow deletion if the tie-hash is still active $db->DESTROY; # Strawberry Perl temporary file unlink "$test_file.index" if -e "$test_file.index"; # ActivePerl temporary files unlink "$test_file.index.dir" if -e "$test_file.index.dir"; unlink "$test_file.index.pag" if -e "$test_file.index.pag"; } { # Concurrent databases (bug #3390) ok my $db1 = Bio::DB::Fasta->new( test_input_file('dbfa', '1.fa') ); ok my $db3 = Bio::DB::Fasta->new( test_input_file('dbfa', '3.fa') ); ok my $db4 = Bio::DB::Fasta->new( $test_dir ); ok my $db2 = Bio::DB::Fasta->new( test_input_file('dbfa', '2.fa') ); is $db4->file('AW057231'), '1.fa'; is $db2->file('AW057302'), '2.fa'; is $db4->file('AW057119'), '1.fa'; is $db3->file('AW057336'), '3.fa'; is $db1->file('AW057231'), '1.fa'; is $db4->file('AW057410'), '3.fa'; # ActivePerl will not allow deletion if the tie-hash is still active $db1->DESTROY; $db2->DESTROY; $db3->DESTROY; # Strawberry Perl temporary file unlink $db1->index_name if -e $db1->index_name; unlink $db2->index_name if -e $db2->index_name; unlink $db3->index_name if -e $db3->index_name; # ActivePerl temporary files unlink $db1->index_name().'.dir' if -e $db1->index_name().'.dir'; unlink $db2->index_name().'.dir' if -e $db2->index_name().'.dir'; unlink $db3->index_name().'.dir' if -e $db3->index_name().'.dir'; unlink $db1->index_name().'.pag' if -e $db1->index_name().'.pag'; unlink $db2->index_name().'.pag' if -e $db2->index_name().'.pag'; unlink $db3->index_name().'.pag' if -e $db3->index_name().'.pag'; } { # Test an arbitrary index filename and cleaning my $name = 'arbitrary.idx'; ok my $db = Bio::DB::Fasta->new( $test_file, -reindex => 1, -index_name => $name, -clean => 1, ); is $db->index_name, $name; # Tied-hash in Strawberry Perl produce $name, # while in ActivePerl produce "$name.dir" and "$name.pag" if (-e "$name.pag") { ok -f "$name.pag"; # ActivePerl will not allow deletion if the tie-hash is still active $db->DESTROY; unlink "$name.dir" if -e "$name.dir"; unlink "$name.pag" if -e "$name.pag"; ok ! -f "$name.pag"; } else { ok -f $name; # ActivePerl will not allow deletion if the tie-hash is still active $db->DESTROY; unlink $name if -e $name; ok ! -f $name; } undef $db; } { # Test makeid ok my $db = Bio::DB::Fasta->new( $test_file, -reindex => 1, -clean => 1, -makeid => \&extract_gi, ), 'Make single ID'; is_deeply [sort $db->get_all_primary_ids], ['', 194473622, 352962132, 352962148, 61679760]; is $db->get_Seq_by_id('gi|352962148|ref|NM_001251825.1|'), undef; isa_ok $db->get_Seq_by_id(194473622), 'Bio::PrimarySeqI'; } { # Test makeid that generates several IDs, bug #3389 ok my $db = Bio::DB::Fasta->new( $test_file, -reindex => 1, -clean => 1, -makeid => \&extract_gi_and_ref, ), 'Make multiple IDs, bug #3389'; is_deeply [sort $db->get_all_primary_ids], ['', 194473622, 352962132, 352962148, 61679760, 'NG_030353.1', 'NM_001251825.1', 'NP_001123975.1']; is $db->get_Seq_by_id('gi|352962148|ref|NM_001251825.1|'), undef; isa_ok $db->get_Seq_by_id('NG_030353.1'), 'Bio::PrimarySeqI'; } { # Test opening set of files and test IDs ok my $db = Bio::DB::Fasta->new( $test_files, -reindex => 1), 'Index a set of files'; ok $db->ids; ok $db->get_all_ids; my @ids = sort $db->get_all_primary_ids(); is_deeply \@ids, [ qw( 0 1 123 CEESC12R CEESC13F CEESC13R CEESC14F CEESC14R CEESC15F CEESC15R CEESC15RB CEESC16F CEESC17F CEESC17RB CEESC18F CEESC18R CEESC19F CEESC19R CEESC20F CEESC21F CEESC21R CEESC22F CEESC23F CEESC24F CEESC25F CEESC26F CEESC27F CEESC28F CEESC29F CEESC30F CEESC32F CEESC33F CEESC33R CEESC34F CEESC35R CEESC36F CEESC37F CEESC39F CEESC40R CEESC41F gi|194473622|ref|NP_001123975.1| gi|352962132|ref|NG_030353.1| gi|352962148|ref|NM_001251825.1| gi|61679760|pdb|1Y4P|B )]; like $db->index_name, qr/^fileset_.+\.index$/; my $index = $db->index_name; # ActivePerl will not allow deletion if the tie-hash is still active $db->DESTROY; # Strawberry Perl temporary file unlink $index if -e $index; # ActivePerl temporary files unlink "$index.dir" if -e "$index.dir"; unlink "$index.pag" if -e "$index.pag"; } { # Squash warnings locally local $SIG{__WARN__} = sub {}; # Issue 3172 my $test_dir = setup_temp_dir('bad_dbfa'); throws_ok {my $db = Bio::DB::Fasta->new($test_dir, -reindex => 1)} qr/FASTA header doesn't match/; # Issue 3237 # Empty lines within a sequence is bad... throws_ok {my $db = Bio::DB::Fasta->new(test_input_file('badfasta.fa'), -reindex => 1)} qr/Blank lines can only precede header lines/; } { # Issue 3237 again # but empty lines preceding headers are okay, but let's check the seqs just in case my $db; lives_ok {$db = Bio::DB::Fasta->new(test_input_file('spaced_fasta.fa'), -reindex => 1)}; is length($db->seq('CEESC39F')), 375, 'length is correct in sequences past spaces'; is length($db->seq('CEESC13F')), 389; is $db->subseq('CEESC39F', 51, 60) , 'acatatganc', 'subseq is correct'; is $db->subseq('CEESC13F', 146, 155), 'ggctctccct', 'subseq is correct'; # Remove temporary test file my $outfile = test_input_file('spaced_fasta.fa').'.index'; # ActivePerl will not allow deletion if the tie-hash is still active $db->DESTROY; # Strawberry Perl temporary file unlink $outfile if -e $outfile; # ActivePerl temporary files unlink "$outfile.dir" if -e "$outfile.dir"; unlink "$outfile.pag" if -e "$outfile.pag"; } exit; sub extract_gi { # Extract GI from RefSeq my $header = shift; my ($id) = ($header =~ /gi\|(\d+)/m); return $id || ''; } sub extract_gi_and_ref { # Extract GI and from RefSeq my $header = shift; my ($gi) = ($header =~ /gi\|(\d+)/m); $gi ||= ''; my ($ref) = ($header =~ /ref\|([^|]+)/m); $ref ||= ''; return $gi, $ref; } sub setup_temp_dir { # this obfuscation is to deal with lockfiles by GDBM_File which can # only be created on local filesystems apparently so will cause test # to block and then fail when the testdir is on an NFS mounted system my $data_dir = shift; my $io = Bio::Root::IO->new(); my $tempdir = test_output_dir(); my $test_dir = $io->catfile($tempdir, $data_dir); mkdir($test_dir); # make the directory my $indir = test_input_file($data_dir); opendir(my $INDIR,$indir) || die("cannot open dir $indir"); # effectively do a cp -r but only copy the files that are in there, no subdirs for my $file ( map { $io->catfile($indir,$_) } readdir($INDIR) ) { next unless (-f $file ); copy($file, $test_dir); } closedir($INDIR); return $test_dir } Ontology000755000766000024 014006427353 15135 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tTerm.t100644000766000024 652414006427353 16400 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Ontology# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 54, -requires_module => 'Graph::Directed'); use_ok('Bio::Ontology::Term'); use_ok('Bio::Ontology::TermFactory'); use_ok('Bio::Annotation::DBLink'); } my $obj = Bio::Ontology::Term->new(); isa_ok $obj, "Bio::Ontology::TermI"; is( $obj->identifier( "0003947" ), "0003947" ); is( $obj->identifier(), "0003947" ); is( $obj->name( "N-acetylgalactosaminyltransferase" ), "N-acetylgalactosaminyltransferase" ); is( $obj->name(), "N-acetylgalactosaminyltransferase" ); is( $obj->definition( "Catalysis of ..." ), "Catalysis of ..." ); is( $obj->definition(), "Catalysis of ..." ); is( $obj->version( "666" ), "666" ); is( $obj->version(), "666" ); ok( $obj->ontology( "category 1 name" ) ); is( $obj->ontology()->name(), "category 1 name" ); my $ont = Bio::Ontology::Ontology->new(); ok( $ont->name( "category 2 name" ) ); ok( $obj->ontology( $ont ) ); is( $obj->ontology()->name(), "category 2 name" ); is( $obj->is_obsolete( 1 ), 1 ); is( $obj->is_obsolete(), 1 ); is( $obj->comment( "Consider the term ..." ), "Consider the term ..." ); is( $obj->comment(), "Consider the term ..." ); is( $obj->get_synonyms(), 0 ); $obj->add_synonym( ( "AA", "AB" ) ); my @al1 = $obj->get_synonyms(); is( scalar(@al1), 2 ); is( $al1[ 0 ], "AA" ); is( $al1[ 1 ], "AB" ); my @al2 = $obj->remove_synonyms(); is( $al2[ 0 ], "AA" ); is( $al2[ 1 ], "AB" ); is( $obj->get_synonyms(), 0 ); is( $obj->remove_synonyms(), 0 ); $obj->add_synonyms( ( "AA", "AB" ) ); is( $obj->identifier(undef), undef ); is( $obj->name(undef), undef ); is( $obj->definition(undef), undef ); is( $obj->is_obsolete(0), 0 ); is( $obj->comment(undef), undef ); $obj = Bio::Ontology::Term->new( -identifier => "0016847", -name => "1-aminocyclopropane-1-carboxylate synthase", -definition => "Catalysis of ...", -is_obsolete => 0, -version => "6.6.6", -ontology => "cat", -comment => "X", -dbxrefs => [ Bio::Annotation::DBLink->new(-database => 'db1'), Bio::Annotation::DBLink->new(-database => 'db2') ], -references => [] ); is( $obj->identifier(), "0016847" ); is( $obj->name(), "1-aminocyclopropane-1-carboxylate synthase" ); is( $obj->definition(), "Catalysis of ..." ); is( $obj->is_obsolete(), 0); is( $obj->comment(), "X" ); is( $obj->version(), "6.6.6" ); is( $obj->ontology()->name(), "cat" ); is( scalar($obj->get_dbxrefs), 2); is( scalar($obj->get_references), 0); # test object factory for terms my $fact = Bio::Ontology::TermFactory->new(); $obj = $fact->create_object(-name => "some ontology term"); isa_ok $obj, "Bio::Ontology::TermI"; is ($obj->name, "some ontology term"); $fact->type("Bio::Ontology::GOterm"); $obj = $fact->create_object(-name => "some ontology term", -identifier => "GO:987654"); isa_ok $obj, "Bio::Ontology::TermI"; isa_ok $obj, "Bio::Ontology::GOterm"; is ($obj->name, "some ontology term"); is ($obj->identifier, "GO:987654"); $fact->type("Bio::Annotation::OntologyTerm"); $obj = $fact->create_object(-name => "some ontology term", -identifier => "GO:987654", -ontology => "nonsense"); isa_ok $obj, "Bio::Ontology::TermI"; isa_ok $obj, "Bio::AnnotationI"; is ($obj->name, "some ontology term"); is ($obj->identifier, "GO:987654"); is ($obj->tagname, "nonsense"); fasta.t100644000766000024 316614006427353 16236 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 12); use_ok('Bio::AlignIO::fasta'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # FASTA $str = Bio::AlignIO->new( -file => test_input_file("testaln.fasta"), -format => 'fasta'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->display_id, 'AK1H_ECOLI/114-431', "fasta input test "; is $aln->get_seq_by_pos(1)->get_nse, 'AK1H_ECOLI/114-431/1-318', "fasta input test "; is ($aln->get_seq_by_pos(1)->description, 'DESCRIPTION HERE', "fasta input test for description"); is ($aln->get_seq_by_pos(11)->display_id, 'AK_YEAST/134-472', "fasta input test for id"); is $aln->get_seq_by_pos(11)->get_nse, 'AK_YEAST/134-472/134-472', "fasta input test, captures formatted start/end"; is ($aln->get_seq_by_pos(2)->end, 318, "fasta input test for end"); is ($aln->get_seq_by_pos(11)->description, 'A COMMENT FOR YEAST', "fasta input test for description"); $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'fasta'); $status = $strout->write_aln($aln); is $status, 1,"fasta output test"; my $in = Bio::AlignIO->newFh( '-file' => test_input_file("testaln.fasta"), '-format' => 'fasta'); my $out = Bio::AlignIO->newFh( '-file' => ">".test_output_file(), '-format' => 'pfam'); while ( $aln = <$in>) { is $aln->get_seq_by_pos(1)->get_nse, 'AK1H_ECOLI/114-431/1-318', "filehandle input test "; $status = print $out $aln; last; } is $status, 1, "filehandle output test"; nexus.t100644000766000024 1071614006427353 16321 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: nexus.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 43); use_ok('Bio::AlignIO::nexus'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # NEXUS $str = Bio::AlignIO->new( '-file' => test_input_file('testaln.nexus'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'Homo_sapiens/1-45'; $strout = Bio::AlignIO->new('-file' => ">".test_output_file(), '-format' => 'nexus', ); $status = $strout->write_aln($aln); is $status, 1, "nexus output test"; $str = Bio::AlignIO->new( '-file' => test_input_file('Bird_Ovomucoids.nex'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file('basic-ladder.nex'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file('Kingdoms_DNA.nex'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file('char-interleave.nex'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file('Primate_mtDNA.nex'), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("char-matrix-spaces.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("SPAN_Family4nl.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("intrablock-comment.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("SPAN_Family7n.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("long-names.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("SPAN_Family8a.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("multiline-intrablock-comment.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("Treebase-chlamy-dna.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("quoted-strings1.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("UnaSmithHIV-both.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("quoted-strings2.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("barns-combined.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("radical-whitespace.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("basic-bush.nex"), '-format' => 'nexus'); isa_ok($str,'Bio::AlignIO'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); $str = Bio::AlignIO->new( '-file' => test_input_file("radical-whitespace_02.nex"), '-format' => 'nexus'); selex.t100644000766000024 126714006427353 16260 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: selex.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 4); use_ok('Bio::AlignIO::selex'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # SELEX $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.selex"), '-format' => 'selex'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'HSFAU/1-518', "selex format test "; $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'selex'); $status = $strout->write_aln($aln); is $status, 1, "selex output test"; wise.t100644000766000024 200714006427353 16253 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: SearchIO_wise.t 11733 2007-10-26 18:22:10Z jason $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 20); use_ok('Bio::SearchIO'); } my $parser = Bio::SearchIO->new(-file => test_input_file('genewise.out'), -format => 'wise', -wisetype => 'genewise'); my $result = $parser->next_result; my $hit = $result->next_hit; is($result->query_name, 'SINFRUP00000067802'); is($hit->name, 'Scaffold_2042.1'); is($hit->score, 2054.68); my $hsp = $hit->next_hsp; is($hsp->query->start,22265); is($hsp->query->end,22396); is($hsp->query->strand,1); is($hsp->query->score, 2054.68); is($hsp->hit->start,1); is($hsp->hit->end,44); is($hsp->hit->strand,0); is($hsp->hit->score, 2054.68); $hsp = $hit->next_hsp; is($hsp->query->start,24224); is($hsp->query->end,24328); is($hsp->hit->start,45); is($hsp->hit->end,79); $hsp = $hit->next_hsp; is($hsp->query->start,24471); is($hsp->query->end,24513); is($hsp->hit->start,80); is($hsp->hit->end,93); waba.t100644000766000024 503414006427353 16221 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: WABA.t 11525 2007-06-27 10:16:38Z sendu $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 64); use_ok('Bio::SearchIO'); } my $wabain = Bio::SearchIO->new( -format => 'waba', -file => test_input_file('test.waba') ); isa_ok($wabain, 'Bio::SearchIO') ; # These won't look the same as the WABA file because Jim's code is 0 based # while we (bioperl) are 1 based. my @results = ( [ 'U57623', 'pair1_hs.fa', 'pair1_mm.fa', [ 'U02884', 3, [qw(3833 34 2972 1 243 3688 1 40.9)], [qw(4211 3022 6914 1 3705 6848 1 43.7)], [qw(2218 7004 9171 1 6892 8712 1 50.3)], ], ], [ 'X57152', 'pair9_hs.fa', 'pair9_mm.fa', [ 'X80685', 1, [qw(7572 4 5845 1 632 7368 1 46.8)], ], ] ); while ( my $wabar = $wabain->next_result ) { my @r = @{ shift @results }; is( $wabar->query_name, shift @r, 'query_name' ); is( $wabar->query_database, shift @r, 'query database' ); is( $wabar->database_name, shift @r, 'database name' ); while ( my $wabah = $wabar->next_hit ) { my (@h) = @{ shift @r }; is( $wabah->name, shift @h, 'name' ); is( $wabah->hsps(), shift @h, 'hsps' ); while ( my $wabahsp = $wabah->next_hsp ) { my (@hsp) = @{ shift @h }; is( $wabahsp->length('total'), shift @hsp , 'total length'); is( $wabahsp->query->start, shift @hsp , 'start' ); is( $wabahsp->query->end, shift @hsp , 'end' ); is( $wabahsp->strand('query'), shift @hsp , 'strand' ); is( $wabahsp->start('hit'), shift @hsp , 'start' ); is( $wabahsp->end('subject'), shift @hsp , 'end' ); is( $wabahsp->subject->strand, shift @hsp, 'strand' ); is( length( $wabahsp->query_string ), $wabahsp->length('total') , 'query string'); is( length( $wabahsp->hit_string ), $wabahsp->length('total') , 'hit_string' ); is( length( $wabahsp->hmmstate_string ), $wabahsp->length('total') , 'hmmstate string'); my $hs = $wabahsp->hit_string; is( $wabahsp->gaps('hit'), $hs =~ tr/\-// ); my $qs = $wabahsp->query_string; is( $wabahsp->gaps('query'), $qs =~ tr/\-// ); is( sprintf( "%.1f", $wabahsp->percent_identity ), shift @hsp ); } } } sim4.t100644000766000024 1071614006427353 16206 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: Sim4.t 11525 2007-06-27 10:16:38Z sendu $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 102); use_ok('Bio::SearchIO'); } # parse align format 0 my $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('crypto.sim4-0') ); my $r = $parser->next_result; is ($r->query_name, 'cn416'); is ($r->query_length, 630); my $hit = $r->next_hit; is ($hit->name, 'Contig147'); is ($hit->description, 'Contig147.fa'); is ($hit->length, 1086); my $hsp = $hit->next_hsp; is ($hsp->query->start, 36); is ($hsp->query->end, 132); is ($hsp->query->strand, 1); is ($hsp->hit->start, 191); is ($hsp->hit->end, 286); is ($hsp->hit->strand, 1); $hsp = $hit->next_hsp; is ($hsp->query->start, 133); is ($hsp->query->end, 191); is ($hsp->query->strand, 1); is ($hsp->hit->start, 343); is ($hsp->hit->end, 401); is ($hsp->hit->strand, 1); # parse align format 3 $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('crypto.sim4-3') ); $r = $parser->next_result; is ($r->query_name, 'cn416'); is ($r->query_length, 630); $hit = $r->next_hit; is ($hit->name, 'Contig147'); is ($hit->description, 'Contig147.fa'); is ($hit->length, 1086); $hsp = $hit->next_hsp; is ($hsp->query->start, 36); is ($hsp->query->end, 132); is ($hsp->query->strand, 1); is ($hsp->hit->start, 191); is ($hsp->hit->end, 286); is ($hsp->hit->strand, 1); $hsp = $hit->next_hsp; is ($hsp->query->start, 133); is ($hsp->query->end, 191); is ($hsp->query->strand, 1); is ($hsp->hit->start, 343); is ($hsp->hit->end, 401); is ($hsp->hit->strand, 1); # parse align format 4 $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('crypto.sim4-4') ); $r = $parser->next_result; is ($r->query_name, 'cn416'); is ($r->query_length, 630); $hit = $r->next_hit; is ($hit->name, 'Contig147'); is ($hit->length, 1086); $hsp = $hit->next_hsp; is ($hsp->query->start, 36); is ($hsp->query->end, 132); is ($hsp->query->strand, 1); is ($hsp->hit->start, 191); is ($hsp->hit->end, 286); is ($hsp->hit->strand, 1); $hsp = $hit->next_hsp; is ($hsp->query->start, 133); is ($hsp->query->end, 191); is ($hsp->query->strand, 1); is ($hsp->hit->start, 343); is ($hsp->hit->end, 401); is ($hsp->hit->strand, 1); # do the other sim4 files $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('sim4.rev') ); $r = $parser->next_result; is ($r->query_name, '/nfs/disk21/birney/prog/wise2/example/human.rev'); is ($r->query_length, 5368); $hit = $r->next_hit; is ($hit->name, 'HSHNCPA1'); is ($hit->description, 'temp.cdna'); is ($hit->length, 1198); $hsp = $hit->next_hsp; is ($hsp->query->start, 486); is ($hsp->query->end, 503); is ($hsp->query->strand, 1); is ($hsp->hit->start, 10); is ($hsp->hit->end, 27); is ($hsp->hit->strand, -1); $hsp = $hit->next_hsp; is ($hsp->query->start, 1048); is ($hsp->query->end, 1117); is ($hsp->query->strand, 1); is ($hsp->hit->start, 194); is ($hsp->hit->end, 265); is ($hsp->hit->strand, -1); # do the other sim4 files fwd $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('sim4.for.for') ); $r = $parser->next_result; is ($r->query_name, 'human.genomic'); is ($r->query_length, 5368); $hit = $r->next_hit; is ($hit->name, 'hs_est'); is ($hit->description, 'est.for'); is ($hit->length, 479); $hsp = $hit->next_hsp; is ($hsp->query->start, 695); is ($hsp->query->end, 813); is ($hsp->query->strand, 1); is ($hsp->hit->start, 1); is ($hsp->hit->end, 119); is ($hsp->hit->strand, 1); $hsp = $hit->next_hsp; is ($hsp->query->start, 1377); is ($hsp->query->end, 1500); is ($hsp->query->strand, 1); is ($hsp->hit->start, 120); is ($hsp->hit->end, 243); is ($hsp->hit->strand, 1); # do the other sim4 files fwd rev $parser = Bio::SearchIO->new(-format => 'sim4', -file => test_input_file('sim4.for.rev') ); $r = $parser->next_result; is ($r->query_name, 'human.genomic'); is ($r->query_length, 5368); $hit = $r->next_hit; is ($hit->name, 'REVCOMP'); is ($hit->description, 'hn_est.rev'); is ($hit->length, 479); $hsp = $hit->next_hsp; is ($hsp->query->start, 695); is ($hsp->query->end, 813); is ($hsp->query->strand, 1); is ($hsp->hit->start, 1); is ($hsp->hit->end, 119); is ($hsp->hit->strand, -1); $hsp = $hit->next_hsp; is ($hsp->query->start, 1377); is ($hsp->query->end, 1500); is ($hsp->query->strand, 1); is ($hsp->hit->start, 120); is ($hsp->hit->end, 243); is ($hsp->hit->strand, -1); Storable.t100644000766000024 434414006427353 16353 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 35); use_ok('Bio::Root::Storable'); } foreach my $mode( "BINARY", "ASCII" ){ if( $mode eq "ASCII" ){ no warnings; $Bio::Root::Storable::BINARY = 0; } #------------------------------ # Test the easy bits that don't need file IO my $obj = Bio::Root::Storable->new(); ok defined($obj) && $obj->isa('Bio::Root::Storable'); eval { $obj->throw('Testing throw') }; ok $@ =~ /Testing throw/; # 'throw failed'; $obj->{_test} = "_TEST"; # Provide test attributes $obj->{__test} = "__TEST"; # my $state = $obj->serialise; ok length($state) > 0; my $clone = $obj->clone; ok defined($clone) and $clone->isa('Bio::Root::Storable'); ok $clone->{_test} eq "_TEST" && $clone->{__test} eq "__TEST"; #------------------------------ # Test standard file IO my $file = $obj->store; ok $file && -f $obj->statefile; my $retrieved; eval { $retrieved = Bio::Root::Storable->retrieve( $file ) }; ok defined($retrieved) && $retrieved->isa('Bio::Root::Storable'); ok $retrieved->{_test} eq "_TEST" && ! exists $retrieved->{__test}; my $skel = $obj->new_retrievable; ok defined($skel) && $skel->isa('Bio::Root::Storable'); ok ! exists $skel->{_test} && ! exists $skel->{__test}; ok $skel->retrievable; eval { $skel->retrieve }; ok ! $skel->retrievable; ok $skel->{_test} eq "_TEST" && ! exists $skel->{__test}; my $obj2 = Bio::Root::Storable->new(); $obj2->template('TEST_XXXXXX'); $obj2->suffix('.state'); my $file2 = $obj2->store; ok $file2 =~ /TEST_(\w{6})?\.state$/ and -f $file2; #------------------------------ # Test recursive file IO $obj->{_test_lazy} = $obj2; $obj->store; my $retrieved2; eval { $retrieved2 = Bio::Root::Storable->retrieve( $obj->token ) }; ok $retrieved2->{_test_lazy} && $retrieved2->{_test_lazy}->retrievable; #------------------------------ # Clean up # Should only be 2 object files; all others were clones in one way or another $obj->remove; ok ! -f $obj->statefile; $obj2->remove; ok ! -f $obj2->statefile; } cysprot.fa100755000766000024 472414006427353 16421 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CYS1_DICDI MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHK ADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT ESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCN PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII >ALEU_HORVU MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGK SYESAAEVRRRFRIFSESLEEVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRD AAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAF QVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCA IATCASYPVVAA >CATH_HUMAN MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHNN GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGS CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQGK DGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN HAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV >CATH_RAT MWTALPLLCAGAWLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRN HTFKMGLNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCW TFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHA VLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPIPQV >CATL_HUMAN MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGK HSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWA FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLV VGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV >CATL_RAT MTPLLLLAVLCLGTALATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGK HGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWA FSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS CKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLV VGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIVN >PAPA_CARPA MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFE IFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDW RQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQ YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGG IFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN humts1.pal100644000766000024 375614006427353 16326 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataPalindromes of: HUMTS1 Sequence length is: 18596 Start at position: 1 End at position: 18596 Minimum length of Palindromes is: 10 Maximum length of Palindromes is: 100 Maximum gap between elements is: 100 Number of mismatches allowed in Palindrome: 0 Palindromes: 126 caaaaaaaaaaaaaaaa 142 ||||||||||||||||| 217 gtttttttttttttttt 201 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 215 tttttttttttttttt 200 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 214 tttttttttttttttt 199 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 213 tttttttttttttttt 198 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 212 tttttttttttttttt 197 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 211 tttttttttttttttt 196 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 210 tttttttttttttttt 195 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 209 tttttttttttttttt 194 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 208 tttttttttttttttt 193 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 207 tttttttttttttttt 192 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 206 tttttttttttttttt 191 127 aaaaaaaaaaaaaaaa 142 |||||||||||||||| 205 tttttttttttttttt 190 127 aaaaaaaaaaaaaaaagaccgccagggct 155 ||||||||||||||||||||||||||||| 204 ttttttttttttttttctggcggtcccga 176 2278 catttttaac 2287 |||||||||| 2351 gtaaaaattg 2342 3960 agcccagacc 3969 |||||||||| 4056 tcgggtctgg 4047 4367 gattgagaaa 4376 |||||||||| 4449 ctaactcttt 4440 4368 attgagaaat 4377 |||||||||| 4399 taactcttta 4390 4387 gaaatttctc 4396 |||||||||| 4411 ctttaaagag 4402 4408 tttctttgatt 4418 ||||||||||| 4456 aaagaaactaa 4446 4428 tttctttgatt 4438 ||||||||||| 4456 aaagaaactaa 4446 6809 cataggcatg 6818 |||||||||| 6839 gtatccgtac 6830 11270 atgtttccaa 11279 |||||||||| 11339 tacaaaggtt 11330 14171 ttacctgaat 14180 |||||||||| 14208 aatggactta 14199 seqxml.xml100644000766000024 225214006427353 16430 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data dystroglycan 1 AAGGC----UGAUGUC.....ACAU example description. AAGGCGAAA----AA*AAAAAGT.....CACJOXA another example description AAGGCGTTAAA----AAAAAAAGT.....CACTA NNNNACTG--AAATGTGCCCGGG multi_1.fa100644000766000024 161714006427353 16263 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>gi|239758|bbs|68379 glucocorticoid receptor, GR [human, Peptide Partial, 394 aa] MDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSASSPSLAVASQSDSKQRRLLVDFPKGSV >gi|239752|bbs|68871 PML-3=putative zinc finger protein [human, Peptide, 802 aa] MPPPETPSEGRQPSPSPSPTERAPASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQ >gi|238775|bbs|65126 putative tyrosine kinase receptor=UFO [human, NIH3T3, Peptide, 894 aa] MAWRCPRMGRVPLAWCLALCGWACMAPRGTQAEESPFVGNPGNITGARGLTGTLRCQLQVQGEPPEVHWL >gi|239006|bbs|65162 alpha(1,3)-fucosyltransferase, ELFT [human, Peptide, 400 aa] MGAPWGSPTAAAGGRRGWRRGRGLPWTVCVLAAAGLTCTALITYACWGQLPPLPWASPTPSRPVGVLLWW >gi|237597|bbs|60089 putative adhesion molecule=ADMLX [human, Peptide, 679 aa] MVPGVPGAVLTLCLWLAASSGSWRPAPARLCAAAGRVAVCRERPARSCASRCLSLQITRISAFFQHFQNN >gi|237995|bbs|62046 NK-1 receptor [human, lung, Peptide, 407 aa] MDNVLPVDSDLSPNISTNTSEPNQFVQPAWEIVLWAAAYTVIVVTSVVGNVVVMWIILAHKRMRTVTNYF rel9.swiss100644000766000024 2161414006427353 16365 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID GCDH_CAEEL Reviewed; 409 AA. AC Q20772; DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot. DT 01-NOV-1996, sequence version 1. DT 31-OCT-2006, entry version 47. DE Probable glutaryl-CoA dehydrogenase, mitochondrial precursor DE (EC 1.3.99.7) (GCD). GN ORFNames=F54D5.7; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2; RX MEDLINE=99069613; PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282:2012-2018(1998). CC -!- CATALYTIC ACTIVITY: Glutaryl-CoA + acceptor = crotonoyl-CoA + CC CO(2) + reduced acceptor. CC -!- COFACTOR: FAD (By similarity). CC -!- PATHWAY: Degradative pathway of L-lysine, L-hydroxylysine, and L- CC tryptophan metabolism. CC -!- INTERACTION: CC P39745:mpk-1; NbExp=1; IntAct=EBI-313068, EBI-321013; CC Q17446:pmk-1; NbExp=1; IntAct=EBI-313068, EBI-312987; CC -!- SUBCELLULAR LOCATION: Mitochondrion; mitochondrial matrix CC (Potential). CC -!- SIMILARITY: Belongs to the acyl-CoA dehydrogenase family. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; Z66513; CAA91333.1; -; Genomic_DNA. DR PIR; T22647; T22647. DR UniGene; Cel.30446; -. DR HSSP; Q06319; 1BUC. DR IntAct; Q20772; -. DR Ensembl; F54D5.7; Caenorhabditis elegans. DR KEGG; cel:F54D5.7; -. DR WormBase; WBGene00010052; F54D5.7. DR WormPep; F54D5.7; CE03411. DR GO; GO:0005515; F:protein binding; IPI:IntAct. DR InterPro; IPR006089; Acyl_CoA_DH. DR InterPro; IPR006091; Acyl_CoA_DH/ox_M. DR InterPro; IPR006090; Acyl_CoA_DH_1. DR InterPro; IPR006092; Acyl_CoA_DH_N. DR InterPro; IPR009075; AcylCo_DH/ox_C. DR InterPro; IPR013786; AcylCoA_DH/ox_N. DR InterPro; IPR009100; AcylCoA_DH/ox_NM. DR InterPro; IPR013764; AcylCoA_DH_1/2_C. DR Pfam; PF00441; Acyl-CoA_dh_1; 1. DR Pfam; PF02770; Acyl-CoA_dh_M; 1. DR Pfam; PF02771; Acyl-CoA_dh_N; 1. DR PROSITE; PS00072; ACYL_COA_DH_1; FALSE_NEG. DR PROSITE; PS00073; ACYL_COA_DH_2; 1. KW Complete proteome; FAD; Flavoprotein; Hypothetical protein; KW Mitochondrion; Oxidoreductase; Transit peptide. FT TRANSIT 1 ? Mitochondrion (Potential). FT CHAIN ? 409 Probable glutaryl-CoA dehydrogenase. FT /FTId=PRO_0000000530. FT ACT_SITE 388 388 Proton acceptor (Potential). SQ SEQUENCE 409 AA; 44964 MW; 4D06241FB6768069 CRC64; MLTRGFTSIG KIASRGLSST FYQDAFQLSD QLTEDERSLM LSAREYCQER LLPRVTEAYR TEKFDPSLIP EMGSMGLLGA PYQGYGCAGT STVGYGLIAR EVERVDSGYR STMSVQTSLV IGPIYNYGSE DQKQKYIPDL ASGKKIGCFG LTEPNHGSNP GGMETKATWD ETTKTYKLNG SKTWISNSPV SDVMVVWARS ARHNNKIKGF ILERGMKGLT TPKIEGKLSL RASITGQIAM DDVPVPEENL LPNAEGLQGP FGCLNNARLG IAWGALGAAE ECFHLARQYT LDRQQFGRPL AQNQLMQLKM ADMLTEISLG LQGCLRVSRL KDEGKVQSEQ ISIIKRNSCG KALEVARKAR DMLGGNGIVD EYHIMRHMVN LETVNTYEGT HDVHALILGR AITGLNGFC // ID Q41V66_FERAC Unreviewed; 607 AA. AC Q41V66; DT 27-SEP-2005, integrated into UniProtKB/TrEMBL. DT 27-SEP-2005, sequence version 1. DT 31-OCT-2006, entry version 6. DE Glycoside hydrolase, family 15. GN ORFNames=FaciDRAFT_1685; OS Ferroplasma acidarmanus Fer1. OC Archaea; Euryarchaeota; Thermoplasmata; Thermoplasmatales; OC Ferroplasmaceae; Ferroplasma. OX NCBI_TaxID=333146; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-PGF); RA Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T., RA Hammon N., Israni S., Pitluck S., Richardson P.; RT "Sequencing of the draft genome and assembly of Ferroplasma RT acidarmanus fer1."; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-ORNL); RA Larimer F., Land M.; RT "Annotation of the draft genome assembly of Ferroplasma acidarmanus RT fer1."; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-PGF); RA Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T., RA Hammon N., Israni S., Pitluck S., Richardson P.; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABC04000001; EAM94575.1; -; Genomic_DNA. DR GO; GO:0004339; F:glucan 1,4-alpha-glucosidase activity; IEA:InterPro. DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW. DR GO; GO:0005976; P:polysaccharide metabolism; IEA:InterPro. DR InterPro; IPR008928; 6hp_glycosidase. DR InterPro; IPR011613; Glyco_hydro_15_rel. DR InterPro; IPR012343; Glyco_trans_sub. DR Pfam; PF00723; Glyco_hydro_15; 1. KW Hydrolase. SQ SEQUENCE 607 AA; 69495 MW; 8AC6297BA16ED500 CRC64; MGTYRGLYDL HDAYRSDYLK IANHGFIANN RTAALVGIDG TIDWACLPNF NSNPVFDSIL DARNGGYFKT SPVMESNVNQ YYEESTNILI TEFVNNNQVI LRLTDFLPTS SYSTITFPEI HRLIEAPYSD VEVSIDIKSH FNFGSGKTNI TRDRNGYIFS CTDDTLGIST NLKLKKGNGN VYSRIKVEKG SHEWIVVLSG VRQIGNVRQY ESYTRLEETR NYWSAWAGKI NYSGLYYDHV IRSALTLRGL FYDPTGMMVA APTTSLPEII GGERNWDYRY TWIRDTAYVV EALSLIGLND VATKFLYDIM SIVQKDKKVK TIYPVNGDSK LEEKKVNLSG YMDSIPVRIG NEASEQLQID QYGSIVNAVF RFHEAGGLVT TYLWDFLIEI LDTLKDIWKL PDSSIWEFRS EPKHYLYSKL ISWSAFNRAI KMGRELGYSA PYRTWHKIRE EIKNEIMEKG YNPDVKAFTQ YYGSDQMDAS VLRMPLTGII SAKDPRFVST LARVEAELKN PCGMFIRYHS DDGLKGHDNA FLLLSFWYVE DLILSGRIME AKETFENILD HSNHLMLFSE EINFNDCREM LGNFPQAITH LGVIRAAIKL DEALRGK // ID Q41US7_FERAC Unreviewed; 270 AA. AC Q41US7; DT 27-SEP-2005, integrated into UniProtKB/TrEMBL. DT 27-SEP-2005, sequence version 1. DT 31-OCT-2006, entry version 6. DE Potassium channel protein. GN ORFNames=FaciDRAFT_1443; OS Ferroplasma acidarmanus Fer1. OC Archaea; Euryarchaeota; Thermoplasmata; Thermoplasmatales; OC Ferroplasmaceae; Ferroplasma. OX NCBI_TaxID=333146; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-PGF); RA Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T., RA Hammon N., Israni S., Pitluck S., Richardson P.; RT "Sequencing of the draft genome and assembly of Ferroplasma RT acidarmanus fer1."; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-ORNL); RA Larimer F., Land M.; RT "Annotation of the draft genome assembly of Ferroplasma acidarmanus RT fer1."; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Fer1; RG US DOE Joint Genome Institute (JGI-PGF); RA Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T., RA Hammon N., Israni S., Pitluck S., Richardson P.; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABC04000002; EAM94333.1; -; Genomic_DNA. DR GO; GO:0005216; F:ion channel activity; IEA:UniProtKB-KW. DR GO; GO:0006813; P:potassium ion transport; IEA:InterPro. DR InterPro; IPR013099; Ion_trans_2. DR InterPro; IPR003148; TrkA_N. DR Pfam; PF07885; Ion_trans_2; 1. DR Pfam; PF02254; TrkA_N; 1. DR PROSITE; PS51201; RCK_N; 1. KW Ionic channel. SQ SEQUENCE 270 AA; 30497 MW; 528C4EA75C41DF75 CRC64; MQTITTVGYG DTPVYGLAGR ANGMLIMVIG IGSLGYLMAG LTSMLIDIRL SSKLGERMAA EKKHIVLCNY NESTKKVLDK IKYDGIDIVI LNENEVKGDN EYTYIKGSFL RENDLIRAGI KKASSVIIFS RSEDKEQMAM DAESILSAMI IRKLNPEIRI IGEILNPDSR EHASSFMDDI IIKGDVSSML IYSSIMIPGI PEFINDLLMS NSISEEDIDK KYASNTYREF ISNMEKENRI VLAFRKQDKI YLRENSDKKI DVDSYIFIKN // traits.tab100644000766000024 56114006427353 16354 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data"id" "assoc" disp "intermediate" 0 "blue" blue "red" 1 "blue" blue "red" 2 "blue" "blue" 3 "blue" blue "blue" 4 "blue" red "blue" 5 "blue" blue "red" 6 "blue" red "blue" 7 "blue" blue "blue" 8 "blue" red "blue" 9 "red" blue "red" 10 "red" red "red" 11 "red" blue "blue" 12 "red" red "blue" 13 "red" blue "red" 14 "red" red "red" 15 "red" blue "red" 16 "red" red "red" roa1.swiss100644000766000024 3041714006427353 16355 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID ROA1_HUMAN STANDARD; PRT; 371 AA. AC P09651; DT 01-MAR-1989 (Rel. 10, Created) DT 01-AUG-1990 (Rel. 15, Last sequence update) DT 01-NOV-1997 (Rel. 35, Last annotation update) DE HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A1 (HELIX-DESTABILIZING DE PROTEIN) (SINGLE-STRAND BINDING PROTEIN) (HNRNP CORE PROTEIN A1). GN HNRPA1. OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; OC Eutheria; Primates; Catarrhini; Hominidae; Homo. RN [1] RP SEQUENCE OF 1-250 AND 303-371 FROM N.A. RC TISSUE=LIVER; RX MEDLINE; 89342435. RA BIAMONTI G., BUVOLI M., BASSI M.T., MORANDI C., COBIANCHI F., RIVA S.; RT "Isolation of an active gene encoding human hnRNP protein A1. RT Evidence for alternative splicing."; RL J. Mol. Biol. 207:491-503(1989). RN [2] RP SEQUENCE OF 1-250 AND 303-371 FROM N.A. RC TISSUE=FIBROBLAST; RX MEDLINE; 88233978. RA BUVOLI M., BIAMONTI G., GHETTI A., RIVA S., BASSI M.T., HORANDI C.; RT "cDNA cloning of human hnRNP protein A1 reveals the existence of RT multiple mRNA isoforms."; RL Nucleic Acids Res. 16:3751-3770(1988). RN [3] RP SEQUENCE OF 1-250 AND 303-371 FROM N.A. RC TISSUE=LUNG; RA KNUDSEN S.M., LEFFERS H.; RL Submitted (JUN-1994) to the EMBL/GenBank/DDBJ databases. RN [4] RP SEQUENCE OF 124-250 AND 303-371 FROM N.A. RC TISSUE=LIVER; RX MEDLINE; 87053868. RA RIVA S., MORANDI C., TSOULFAS P., PANDOLFO M., BIAMONTI G., RA MERRILL B., WILLIAMS K.R., MULTHAUP G., BEYREUTHER K., WERR H., RA HEINRICH B., SCHAEFER K.P.; RT "Mammalian single-stranded DNA binding protein UP I is derived from RT the hnRNP core protein A1."; RL EMBO J. 5:2267-2273(1986). RN [5] RP SEQUENCE OF 251-302 FROM N.A. RX MEDLINE; 90214633. RA BUVOLI M., COBIANCHI F., BESTAGNO M.G., MANGIAROTTI A., BASSI M.T., RA BIAMONTI G., RIVA S.; RT "Alternative splicing in the human gene for the core protein A1 RT generates another hnRNP protein."; RL EMBO J. 9:1229-1235(1990). RN [6] RP NUCLEAR LOCALIZATION DOMAIN. RX MEDLINE; 95247808. RA SIOMI H., DREYFUSS G.; RT "A nuclear localization domain in the hnRNP A1 protein."; RL J. Cell Biol. 129:551-560(1995). RN [7] RP NUCLEAR LOCALIZATION DOMAIN, AND NUCLEAR EXPORT. RX MEDLINE; 96067639. RA MICHAEL W.M., CHOI M., DREYFUSS G.; RT "A nuclear export signal in hnRNP A1: a signal-mediated, temperature- RT dependent nuclear protein export pathway."; RL Cell 83:415-422(1995). RN [8] RP NUCLEAR LOCALIZATION DOMAIN. RX MEDLINE; 95286702. RA WEIGHARDT F., BIAMONTI G., RIVA S.; RT "Nucleo-cytoplasmic distribution of human hnRNP proteins: a search RT for the targeting domains in hnRNP A1."; RL J. Cell Sci. 108:545-555(1995). RN [9] RP 3D-STRUCTURE MODELING OF 106-189. RX MEDLINE; 91099515. RA GHETTI A., BOLOGNESI M., COBIANCHI F., MORANDI C.; RT "Modeling by homology of RNA binding domain in A1 hnRNP protein."; RL FEBS Lett. 277:272-276(1990). RN [10] RP X-RAY CRYSTALLOGRAPHY (1.75 ANGSTROMS) OF 8-180. RX MEDLINE; 97307256. RA SHAMOO Y., KRUEGER U., RICE L.M., WILLIAMS K.R., STEITZ T.A.; RT "Crystal structure of the two RNA binding domains of human hnRNP A1 RT at 1.75-A resolution."; RL Nat. Struct. Biol. 4:215-222(1997). RN [11] RP X-RAY CRYSTALLOGRAPHY (1.9 ANGSTROMS) OF 6-181. RX MEDLINE; 97277240. RA XU R.M., JOKHAN L., CHENG X., MAYEDA A., KRAINER A.R.; RT "Crystal structure of human UP1, the domain of hnRNP A1 that contains RT two RNA-recognition motifs."; RL Structure 5:559-570(1997). CC -!- FUNCTION: INVOLVED IN THE PACKAGING OF PRE-MRNA INTO HNRNP CC PARTICLES, TRANSPORT OF POLY-A MRNA FROM THE NUCLEUS TO THE CC CYTOPLASM AND MAY MODULATE SPLICE SITE SELECTION. CC -!- SUBCELLULAR LOCATION: NUCLEAR. SHUTTLES CONTINUOUSLY BETWEEN THE CC NUCLEUS AND THE CYTOPLASM ALONG WITH MRNA. COMPONENT OF CC RIBONUCLEOSOMES. CC -!- ALTERNATIVE PRODUCTS: A1-A (SHOWN HERE) AND A1-B ARE OBTAINED BY CC ALTERNATIVE SPLICING OF THE SAME GENE. A1-A IS TWENTY TIMES MORE CC ABUNDANT THEN A1-B. CC -!- SIMILARITY: BELONGS TO THE A/B GROUP OF HNRNP, WHICH ARE BASIC AND CC GLY-RICH PROTEINS. CC -!- SIMILARITY: CONTAINS 2 RNA RECOGNITION MOTIFS (RNP). CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; X12671; CAA31191.1; -. DR EMBL; X06747; CAA29922.1; ALT_SEQ. DR EMBL; X04347; CAA27874.1; -. DR EMBL; X79536; CAA56072.1; -. DR PIR; S04617; S04617. DR PIR; A24894; A24894. DR PIR; S02061; S02061. DR PDB; 1HA1; 15-MAY-97. DR PDB; 1UP1; 17-SEP-97. DR AARHUS/GHENT-2DPAGE; 207; NEPHGE. DR AARHUS/GHENT-2DPAGE; 2114; NEPHGE. DR AARHUS/GHENT-2DPAGE; 3612; NEPHGE. DR MIM; 164017; -. DR PFAM; PF00076; rrm; 2. DR PROSITE; PS00030; RNP_1; 2. PE 1: Evidence at protein level; KW Nuclear protein; RNA-binding; Repeat; Ribonucleoprotein; KW Methylation; Alternative splicing; 3D-structure. FT INIT_MET 0 0 FT DOMAIN 3 93 GLOBULAR A DOMAIN. FT DOMAIN 94 184 GLOBULAR B DOMAIN. FT DOMAIN 194 371 GLY-RICH. FT DOMAIN 15 20 RNA-BINDING (RNP2) (BY SIMILARITY). FT DOMAIN 54 61 RNA-BINDING (RNP1). FT DOMAIN 106 111 RNA-BINDING (RNP2) (BY SIMILARITY). FT DOMAIN 145 152 RNA-BINDING (RNP1). FT DOMAIN 217 239 RNA-BINDING RGG-BOX. FT DOMAIN 319 356 NUCLEAR TARGETING SEQUENCE (M9). FT MOD_RES 193 193 METHYLATION (BY SIMILARITY). FT VARSPLIC 251 302 MISSING (IN FORM A1-A). FT MUTAGEN 325 325 G->A: NO NUCLEAR IMPORT NOR EXPORT. FT MUTAGEN 326 326 P->A: NO NUCLEAR IMPORT NOR EXPORT. FT MUTAGEN 333 334 GG->LL: NORMAL NUCLEAR IMPORT AND EXPORT. FT CONFLICT 139 139 R -> P (IN REF. 4). SQ SEQUENCE 371 AA; 38715 MW; ECBA15FB CRC32; SKSESPKEPE QLRKLFIGGL SFETTDESLR SHFEQWGTLT DCVVMRDPNT KRSRGFGFVT YATVEEVDAA MNARPHKVDG RVVEPKRAVS REDSQRPGAH LTVKKIFVGG IKEDTEEHHL RDYFEQYGKI EVIEIMTDRG SGKKRGFAFV TFDDHDSVDK IVIQKYHTVN GHNCEVRKAL SKQEMASASS SQRGRSGSGN FGGGRGGGFG GNDNFGRGGN FSGRGGFGGS RGGGGYGGSG DGYNGFGNDG GYGGGGPGYS GGSRGYGSGG QGYGNQGSGY GGSGSYDSYN NGGGRGFGGG SGSNFGGGGS YNDFGNYNNQ SSNFGPMKGG NFGGRSSGPY GGGGQYFAKP RNQGGYGGSS SSSSYGSGRR F // ID A2S3_RAT STANDARD; PRT; 913 AA. AC Q8R2H7; Q8R2H6; Q8R4G3; DT 28-FEB-2003 (Rel. 41, Created) DT 15-MAR-2004 (Rel. 43, Last sequence update) DT 15-MAR-2004 (Rel. 43, Last annotation update) DE Amyotrophic lateral sclerosis 2 chromosomal region candidate gene DE protein 3 homolog (GABA-A receptor interacting factor-1) (GRIF-1) (O- DE GlcNAc transferase-interacting protein of 98 kDa). GN ALS2CR3 OR GRIF1 OR OIP98. OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116; RN [1] RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), SUBCELLULAR LOCATION, AND RP INTERACTION WITH GABA-A RECEPTOR. RC TISSUE=Brain; RX MEDLINE=22162448; PubMed=12034717; RA Beck M., Brickley K., Wilkinson H.L., Sharma S., Smith M., RA Chazot P.L., Pollard S., Stephenson F.A.; RT "Identification, molecular cloning, and characterization of a novel RT GABAA receptor-associated protein, GRIF-1."; RL J. Biol. Chem. 277:30079-30090(2002). RN [2] RP REVISIONS TO 579 AND 595-596, AND VARIANTS VAL-609 AND PRO-820. RA Stephenson F.A.; RL Submitted (FEB-2003) to the EMBL/GenBank/DDBJ databases. RN [3] RP SEQUENCE FROM N.A. (ISOFORM 3), INTERACTION WITH O-GLCNAC TRANSFERASE, RP AND O-GLYCOSYLATION. RC STRAIN=Sprague-Dawley; TISSUE=Brain; RX MEDLINE=22464403; PubMed=12435728; RA Iyer S.P.N., Akimoto Y., Hart G.W.; RT "Identification and cloning of a novel family of coiled-coil domain RT proteins that interact with O-GlcNAc transferase."; RL J. Biol. Chem. 278:5399-5409(2003). CC -!- SUBUNIT: Interacts with GABA-A receptor and O-GlcNac transferase. CC -!- SUBCELLULAR LOCATION: Cytoplasmic. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=3; CC Name=1; Synonyms=GRIF-1a; CC IsoId=Q8R2H7-1; Sequence=Displayed; CC Name=2; Synonyms=GRIF-1b; CC IsoId=Q8R2H7-2; Sequence=VSP_003786, VSP_003787; CC Name=3; CC IsoId=Q8R2H7-3; Sequence=VSP_003788; CC -!- PTM: O-glycosylated. CC -!- SIMILARITY: TO HUMAN OIP106. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; AJ288898; CAC81785.2; -. DR EMBL; AJ288898; CAC81786.2; -. DR EMBL; AF474163; AAL84588.1; -. DR GO; GO:0005737; C:cytoplasm; IEP. DR GO; GO:0005634; C:nucleus; IDA. DR GO; GO:0005886; C:plasma membrane; IEP. DR GO; GO:0005478; F:intracellular transporter activity; NAS. DR GO; GO:0005515; F:protein binding; IPI. DR GO; GO:0005102; F:receptor binding; IPI. DR GO; GO:0006836; P:neurotransmitter transport; NAS. DR GO; GO:0006493; P:O-linked glycosylation; IDA. DR GO; GO:0006605; P:protein targeting; IDA. DR GO; GO:0006357; P:regulation of transcription from Pol II pro...; IDA. DR InterPro; IPR006933; HAP1_N. DR Pfam; PF04849; HAP1_N; 1. KW Coiled coil; Alternative splicing; Polymorphism. FT DOMAIN 134 355 COILED COIL (POTENTIAL). FT DOMAIN 502 519 COILED COIL (POTENTIAL). FT VARSPLIC 653 672 VATSNPGKCLSFTNSTFTFT -> ALVSHHCPVEAVRAVHP FT TRL (in isoform 2). FT /FTId=VSP_003786. FT VARSPLIC 673 913 Missing (in isoform 2). FT /FTId=VSP_003787. FT VARSPLIC 620 687 VQQPLQLEQKPAPPPPVTGIFLPPMTSAGGPVSVATSNPGK FT CLSFTNSTFTFTTCRILHPSDITQVTP -> GSAASSTGAE FT ACTTPASNGYLPAAHDLSRGTSL (in isoform 3). FT /FTId=VSP_003788. FT VARIANT 609 609 E -> V. FT VARIANT 820 820 S -> P. SQ SEQUENCE 913 AA; 101638 MW; D0E135DBEC30C28C CRC64; MSLSQNAIFK SQTGEENLMS SNHRDSESIT DVCSNEDLPE VELVNLLEEQ LPQYKLRVDS LFLYENQDWS QSSHQQQDAS ETLSPVLAEE TFRYMILGTD RVEQMTKTYN DIDMVTHLLA ERDRDLELAA RIGQALLKRN HVLSEQNESL EEQLGQAFDQ VNQLQHELSK KEELLRIVSI ASEESETDSS CSTPLRFNES FSLSQGLLQL DMMHEKLKEL EEENMALRSK ACHIKTETFT YEEKEQKLIN DCVNELRETN AQMSRMTEEL SGKSDELLRY QEEISSLLSQ IVDLQHKLKE HVIEKEELRL HLQASKDAQR QLTMELHELQ DRNMECLGML HESQEEIKEL RNKAGPSAHL CFSQAYGVFA GESLAAEIEG TMRKKLSLDE ESVFKQKAQQ KRVFDTVKVA NDTRGRSVTF PVLLPIPGSN RSSVIMTAKP FESGVQQTED KTLPNQGSST EVPGNSHPRD PPGLPEDSDL ATALHRLSLR RQNYLSEKQF FAEEWERKLQ ILAEQEEEVS SCEALTENLA SFCTDQSETT ELGSAGCLRG FMPEKLQIVK PLEGSQTLHH WQQLAQPNLG TILDPRPGVI TKGFTQMPKD AVYHISDLEE DEEVGITFQV QQPLQLEQKP APPPPVTGIF LPPMTSAGGP VSVATSNPGK CLSFTNSTFT FTTCRILHPS DITQVTPSSG FPSLSCGSSA GSASNTAVNS PAASYRLSIG ESITNRRDST ITFSSTRSLA KLLQERGISA KVYHSPASEN PLLQLRPKAL ATPSTPPNSP SQSPCSSPVP FEPRVHVSEN FLASRPAETF LQEMYGLRPS RAPPDVGQLK MNLVDRLKRL GIARVVKTPV PRENGKSREA EMGLQKPDSA VYLNSGGSLL GGLRRNQSLP VMMGSFGAPV CTTSPKMGIL KED // test.fastq100644000766000024 524214006427353 16416 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data@FC12044_91407_8_200_406_24 GTTAGCTCCCACCTTAAGATGTTTA +FC12044_91407_8_200_406_24 SXXTXXXXXXXXXTTSUXSSXKTMQ @FC12044_91407_8_200_720_610 CTCTGTGGCACCCCATCCCTCACTT +FC12044_91407_8_200_720_610 OXXXXXXXXXXXXXXXXXTSXQTXU @FC12044_91407_8_200_345_133 GATTTTTTAACAATAAACGTACATA +FC12044_91407_8_200_345_133 OQTOOSFORTFFFIIOFFFFFFFFF @FC12044_91407_8_200_106_131 GTTGCCCAGGCTCGTCTTGAACTCC +FC12044_91407_8_200_106_131 XXXXXXXXXXXXXXSXXXXISTXQS @FC12044_91407_8_200_916_471 TGATTGAAGGTAGGGTAGCATACTG +FC12044_91407_8_200_916_471 XXXXXXXXXXXXXXXUXXUSXXTXW @FC12044_91407_8_200_57_85 GCTCCAATAGCGCAGAGGAAACCTG +FC12044_91407_8_200_57_85 XFXMXSXXSXXXOSQROOSROFQIQ @FC12044_91407_8_200_10_437 GCTGCTTGGGAGGCTGAGGCAGGAG +FC12044_91407_8_200_10_437 USXSXXXXXXUXXXSXQXXUQXXKS @FC12044_91407_8_200_154_436 AGACCTTTGGATACAATGAACGACT +FC12044_91407_8_200_154_436 MKKMQTSRXMSQTOMRFOOIFFFFF @FC12044_91407_8_200_336_64 AGGGAATTTTAGAGGAGGGCTGCCG +FC12044_91407_8_200_336_64 STQMOSXSXSQXQXXKXXXKFXFFK @FC12044_91407_8_200_620_233 TCTCCATGTTGGTCAGGCTGGTCTC +FC12044_91407_8_200_620_233 XXXXXXXXXXXXXXXXXXXXXSXSW @FC12044_91407_8_200_902_349 TGAACGTCGAGACGCAAGGCCCGCC +FC12044_91407_8_200_902_349 XMXSSXMXXSXQSXTSQXFKSKTOF @FC12044_91407_8_200_40_618 CTGTCCCCACGGCGGGGGGGCCTGG +FC12044_91407_8_200_40_618 TXXXXSXXXXXXXXXXXXXRKFOXS @FC12044_91407_8_200_83_511 GATGTACTCTTACACCCAGACTTTG +FC12044_91407_8_200_83_511 SOXXXXXUXXXXXXQKQKKROOQSU @FC12044_91407_8_200_76_246 TCAAGGGTGGATCTTGGCTCCCAGT +FC12044_91407_8_200_76_246 XTXTUXXXXXRXXXTXXSUXSRFXQ @FC12044_91407_8_200_303_427 TTGCGACAGAGTTTTGCTCTTGTCC +FC12044_91407_8_200_303_427 XXQROXXXXIXFQXXXOIQSSXUFF @FC12044_91407_8_200_31_299 TCTGCTCCAGCTCCAAGACGCCGCC +FC12044_91407_8_200_31_299 XRXTSXXXRXXSXQQOXQTSQSXKQ @FC12044_91407_8_200_553_135 TACGGAGCCGCGGGCGGGAAAGGCG +FC12044_91407_8_200_553_135 XSQQXXXXXXXXXXSXXMFFQXTKU @FC12044_91407_8_200_139_74 CCTCCCAGGTTCAAGCGATTATCCT +FC12044_91407_8_200_139_74 RMXUSXTXXQXXQUXXXSQISISSO @FC12044_91407_8_200_108_33 GTCATGGCGGCCCGCGCGGGGAGCG +FC12044_91407_8_200_108_33 OOOSSXXSXXOMKMOFMKFOKFFFF @FC12044_91407_8_200_980_965 ACAGTGGGTTCTTAAAGAAGAGTCG +FC12044_91407_8_200_980_965 TOSSRXXXSSMSXMOMXIRXOXFFS @FC12044_91407_8_200_981_857 AACGAGGGGCGCGACTTGACCTTGG +FC12044_91407_8_200_981_857 RXMSSXXXXSXQXQXFSXQFQKMXS @FC12044_91407_8_200_8_865 TTTCCCACCCCAGGAAGCCTTGGAC +FC12044_91407_8_200_8_865 XXXFKOROMKOORMIMRIIKKORFF @FC12044_91407_8_200_292_484 TCAGCCTCCGTGCCCAGCCCACTCC +FC12044_91407_8_200_292_484 XQXOSXXXXXUXXXXIXXXXQTOXF @FC12044_91407_8_200_675_16 CTCGGGAGGCTGAGGCAGGGGGGTT +FC12044_91407_8_200_675_16 OXTXXXSXXQXXOXXKMXXMXOKQF @FC12044_91407_8_200_285_136 CCAAATCTTGAATTGTAGCTCCCCT +FC12044_91407_8_200_285_136 OSXOQXXXXXSXXUXXTXXXXTRMS testaln.po100644000766000024 3033114006427353 16426 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataVERSION=clustalw NAME=NoName TITLE=NoName LENGTH=879 SOURCECOUNT=6 SOURCENAME=P84139 SOURCEINFO=420 0 0 -1 SOURCENAME=P814153 SOURCEINFO=420 0 0 -1 SOURCENAME=P851414 SOURCEINFO=60 154 0 -1 SOURCENAME=P841414 SOURCEINFO=60 154 0 -1 SOURCENAME=BAB68554 SOURCEINFO=141 42 0 -1 SOURCENAME=gb|443893|124775 SOURCEINFO=331 2 0 -1 M:S0S1 N:L0S0S1A2 M:S5A1 E:L1S0S1A4 R:L2S5A3 G:L3S0S1A6 F:L4S5A5 E:L5S0A8 M:L5S1A9 R:L6S5A7 H:L7L8S0S1A11 F:L9S5A10 Q:L10S0S1A13 G:L11S5A12 I:L12S0S1A15 V:L13S5A14 K:L14S0S1A17 V:L15S5A16 L:L16S0S1A19 V:L17S5A18 D:L18S0S1A21 P:L19S5A20 E:L20S0A23 V:L20S1A24 P:L21S5A22 L:L22L23S0S1A26 A:L24S5A25 F:L25S0S1A28 V:L26S5A27 E:L27S0S1A30 A:L28S5A29 K:L29S0S1A32 G:L30S5A31 L:L31S0S1A34 A:L32S5A33 L:L33S0S1A36 R:L34S5A35 R:L35S0S1A38 P:L36S5A37 A:L37S0S1A40 E:L38S5A39 R:L39S0S1A42 M:S4A43 L:L40S5A41 K:L41S0S1A45 L:L42L43S4S5A44 I:L44S0S1A47 T:L45S4A48 V:L45S5A46 F:L46S0S1A50 E:L47S4A51 V:L48S5A49 K:L49S0S1A53 D:L50S4A54 G:L51S5A52 N:L52S0S1A56 D:L53S4A57 S:L54S5A55 K:L55L56S0S1S4A59 R:L57S5A58 D:L58S0S1A61 Q:L58S4A62 P:L59S5A60 V:L60S0S1A64 L:L61S4A65 E:L62S5A63 L:L63L65S0S1S5A67 I:L64S4A66 R:L66S0S1A69 Q:L67S4A70 G:L66S5A68 H:L68L69S0S1S4 S:L71S0S1A73 V:L71S4A74 R:L70S5A72 Y:L72S0S1A76 W:L73L74S4S5A75 T:L75S0S1A78 E:L76S4S5A77 P:L77L78S0S1S5A80 K:L78S4A79 K:L79S0S1A82 V:L80S4A83 R:L79S5A81 D:L81S0S1A85 L:L82S4A86 G:L83S5A84 L:L84S0S1A88 E:L85S4A89 A:L86S5A87 P:L87S0S1A91 H:L88S4A92 V:L89S5A90 L:L90S0A94 H:L90S1A95 Q:L91S4A96 R:L92S5A93 R:L93L94S0S1A98 E:L95S4A99 L:L96S5A97 H:L97S0S1A101 D:L98S4A102 R:L99S5A100 E:L100S0S1A104 F:L101S4A105 P:L102S5A103 Q:L103S0S1A107 G:L104S4A108 A:L105S5A106 I:L106S0S1A110 A:L107S4A111 G:L108S5A109 E:L109L110S0S1S4A113 T:L111S5A112 T:L112S0S1A115 A:L112L113S4S5A114 L:L114L115S0S1S4A117 A:L115S5A116 A:L116S0S1A119 E:L116S4A120 G:L117S5A118 Q:L118S0S1A122 R:L119S4A123 D:L120S5A121 I:L121S0S1A125 M:L122S4A126 G:L123S5A124 L:L124S0S1A128 F:L125S4A129 A:L126S5A127 V:L127S0S1A131 I:L128S4A132 L:L129S5A130 P:L130S0S1A134 V:L131S4A135 A:L132S5A133 V:L133S0S1A137 Y:L134S4A138 L:L135S5A136 L:L136S0S1A140 P:L137S4A141 Q:L138S5A139 R:L139S0S1A143 S:L140S4A144 E:L141S5A142 G:L142S0S1A146 T:L143S4A147 P:L144S5A145 E:L145S0S1A149 K:L146S4A150 G:L147S5A148 T:L148L149S0S1S4A152 L:L150S5A151 P:L151S0S1A154 M:S2S3A155 Y:L151S4A156 W:L152S5A153 S:L153S0S1A158 K:L154S2S3A159 F:L155S4A160 L:L156S5A157 N:L157S0S1A162 I:L158S2S3A163 P:L159S4A164 G:L160S5A161 I:L161S0S1A166 V:L162S2S3A167 H:L163S4A168 E:L164S5A165 F:L165L167S0S1S4A170 W:L166S2S3A171 V:L168S5A169 V:L169S0S1A173 C:L170S2S3A174 D:L169S4A175 E:L171S5A172 Y:L172S0S1A177 G:L173S2S3A178 L:L174L175S4S5A176 G:L176S0S1A180 H:L177L178S2S3S4A181 A:L178S5A179 H:L180S4 K:L179S0S1A184 A:L180L181S2S3S5A185 D:L182S4A183 T:L183S0S1A187 C:L184S2S3A188 S:L185S4A189 E:L184S5A186 G:L186S0S1A191 F:L187S2S3A192 E:L188L189S4S5A190 T:L190S0S1A194 L:L191S2S3A195 Q:L192S4A196 A:L192S5A193 G:L193S0S1A198 V:L194S2S3A199 I:L195S4A200 A:L196S5A197 K:L197S0S1A202 E:L198S2S3A203 R:L199S4A204 Q:L200S5A201 T:L201S0S1A206 D:L202L204S2S3S5A207 H:L203S4A205 V:L205S0S1A209 R:L206S2S3A210 H:L207S4A211 G:L206S5A208 T:L208S0S1A213 G:L209L210S2S3S4A214 A:L211S5A212 V:L212S0S1A216 T:L213S2S3A217 K:L213S4A218 E:L214S5A215 K:L215L216L217S0S1S2S3S4A220 P:L218S5A219 G:L220S5 F:L219S0S1A223 I:L219S2S3A224 V:L219S4A225 R:L221S5A222 V:L222L224L225S0S1S4S5A227 L:L223S2S3A226 T:L226S0S1A229 I:L227S2S3A230 G:L226S4A231 D:L226S5A228 E:L228S0S1A233 D:L229S2S3A234 A:L230S4A235 T:L231S5A232 E:L232S0S1A237 P:L233S2S3A238 L:L234S4A239 F:L235S5A236 L:L236S0S1A241 Y:L237S2S3A242 G:L238S4A243 W:L239S5A240 K:L240S0S1A245 P:L241S2S3A246 D:L242S4A247 Y:L243S5A244 R:L244S0S1A249 D:L245S2S3A250 A:L246S4A251 K:L247S5A248 I:L248S0S1A253 V:L249L250S2S3S4A254 F:L251S5A252 S:L252S0S1A256 D:L253S2S3A257 K:L253S4A258 L:L254S5A255 E:L255L256S0S1S2S3A260 H:L257S4A261 K:L258S5A259 K:L259S0S1A263 D:L259S2S3A264 I:L260S4A265 R:L261S5A262 Y:L262S0S1A267 R:L263S2S3A268 D:L264S4A269 E:L265S5A266 N:L266L268S0S1S4A271 I:L267S2S3A272 P:L269S5A270 I:L270S0S1A274 G:L271L272S2S3S5A275 L:L270S4A273 P:L273S0S1A277 K:L274S2S3A278 S:L275S4A279 G:L274S5A276 V:L276L277S0S1S2S3A281 A:L278S4A282 E:L279S5A280 D:L280S0S1S2S3A284 T:L281S4A285 L:L282S5A283 V:L283S0S1A287 Y:L283S2S3A288 L:L284S4A289 S:L285S5A286 I:L286L287S0S1S2S3A291 S:L288S4A292 W:L289S5A290 Y:L290S0S1A294 L:L290S2S3A295 E:L291L292S4S5A293 I:L293S0S1A297 Q:L294S2A298 V:L294S3A299 L:L295S4A300 G:L295S5A296 N:L296L300S0S1S5A302 T:L297L298S2S3A303 S:L299S4A301 C:L301S0S1A305 H:L302S2S3A306 N:L303S4A307 G:L301S5A304 E:L304L305S0S1S2S3A309 L:L306S4A310 P:L307S5A308 I:L308S0S1A312 H:L308L309L310S2S3S4S5A311 V:L311S0S1A314 M:L312S2S3A315 C:L312S4A316 H:L312S5A313 D:L313L314L316S0S1S2S3S5A318 Y:L315S4A317 T:L317S0S1A320 R:L317S5A319 H:L317L319S0S1S2S3A322 N:L318S4A323 C:L320S5A321 Y:L321S0S1S2S3A325 L:L322S4A326 C:L323S5A324 R:L324L325S0S1S4A328 G:L324S2S3A329 T:L326S5A327 V:L327S0S1S4A331 K:L328S2S3A332 Y:L329S5A330 L:L330S0S1A334 T:L331S2S3A335 D:L330S4A336 N:L332S5A333 A:L333S0S1A338 P:L334L335S2S3S4A339 E:L336S5A337 N:L337L339S0S1S5A341 L:L338S2S3A342 V:L338S4A340 I:L340L341S0S1S2S3A344 N:L340L342S4S5A343 V:L343S0S1A346 A:L343S2S3A347 F:L344S4A348 L:L344S5A345 N:L345S0S1A350 K:L346L347S2S3S4A351 V:L348S5A349 Y:L349S0S1A353 L:L350S2S3S4A354 D:L351S5A352 F:L352S0S1A356 S:L353S2S3A357 L:L353S4A358 G:L354S5A355 K:L355S0S1A360 D:L356S2S3A361 S:L357S4A362 V:L358S5A359 D:L359S0S1A364 H:L361S4A365 Y:L362S5A363 E:L363S0S1A367 C:L364L365S4S5A366 T:L366S0S1A369 F:L367S4A370 L:L367S5A368 G:L368S0S1A372 Q:L369S4A373 P:L370S5A371 I:L371L373S0S1S5A375 V:L372S4A374 G:L374S0S5A377 E:L374S1A378 V:L375S4A376 V:L376L377S0S1A380 L:L378S4A379 P:L379S0S1A382 G:L380S4A381 M:L381S0S1A384 A:L382S4A383 V:L383S0S1A386 H:L376L384S4S5A385 G:L385S0S1A388 L:L386S4A389 W:L386S5A387 W:L387S0S1A391 G:L388L389S4S5A390 P:L390S0S1A393 E:L391S5A392 T:L392S0S1A395 A:L393S5A394 D:L394S0S1A397 R:L391S4A398 T:L395S5A396 E:L396L397S0S1S4A400 G:L398S5A399 V:L399S0S1A402 Y:L399S4A403 H:L400S5A401 Y:L401S0S1A405 T:L402L403S4S5A404 A:L404S0S1A407 P:L405S4A408 N:L405S5A406 K:L406S0S1A410 Q:L407S4A411 E:L408S5A409 L:L409S0S1A413 V:L410S4A414 M:L411S5A412 K:L412L414S0S1S5A416 Q:L413S4A415 Q:L415S0S1A418 V:L416S4A419 H:L415S5A417 V:L417S0S1A421 A:L418S4A422 T:L419S5A420 I:L420S0S1A424 Y:L421S4A425 T:L422S5A423 D:L423L424L425S0S1S4S5 M:L426S0S1A428 K:L426S4A429 F:L426S5A427 K:L427S0S1A431 F:L428S4A432 Y:L429S5A430 E:L430S0S1A434 L:L431S4A435 F:L432S5A433 R:L433S0S1A437 A:L434S4A438 N:L435S5A436 F:L436S0S1A440 A:L437S4A441 I:L438S5A439 V:L439L440S0S1S4A443 A:L441S5A442 I:L442S0S1A445 S:L442S4A446 G:L443S5A444 I:L444S0S1A448 A:L445S4A449 H:L446S5A447 V:L447L448S0S1S4A451 Q:L449S5A450 L:L450S0S1S4A453 A:L451S5A452 D:L452S0S1A455 A:L452S4A456 M:L453S5A454 E:L454L455S0S1S4A458 H:L456S5A457 I:L457S0S1A460 K:L457S4A461 Y:L458S5A459 D:L459S0S1A463 Y:L460S4A464 S:L461S5A462 K:L462S0S1A466 R:L463L464S4S5A465 L:L465S0S1A468 I:L466S5A467 V:L467S0S1A470 L:L468S5A469 K:L469S0S1A472 P:L470S5A471 K:L471S0S1A474 N:L472S5A473 S:L473S0S1A476 I:L474S5A475 G:L475S0S1A478 W:L476S5A477 D:L477S0S1A480 L:L478S5A479 E:L479S0S1A482 G:L480S5A481 V:L481S0S1A484 S:L482S5A483 L:L483S0S1A486 C:L484S5A485 Y:L485S0S1A488 P:L486S5A487 S:L487S0S1A490 R:L488S5A489 L:L489S0S1A492 Q:L490S5A491 T:L491S0S1A494 V:L492S5A493 R:L493S0S1A496 E:L494S5A495 I:L495S0S1A498 H:L496S5A497 N:L497S0S1A500 V:L498S5A499 T:L499L500S0S1S5 E:L501S0S1A503 I:L501S5A502 L:L502S0S1A505 K:L503S5A504 K:L504S0S1A507 L:L505S5A506 R:L506S0S1A509 K:L507S5A508 A:L508S0S1A511 H:L509S5A510 K:L510S0S1A513 E:L511S5A512 V:L512S0S1A515 L:L513S5A514 S:L514S0S1A517 G:L515S5A516 V:L516S0S1A519 I:L517S5A518 I:L518S0S1A521 T:L519S5A520 G:L520S0S1A523 A:L521S5A522 I:L522S0S1A525 V:L523S5A524 S:L524S0S1A527 M:L525S5A526 N:L526L527S0S1S5 D:L528S0S1 L:L529S0S1A531 F:L528S5A530 K:L530S0S1A533 Q:L531S5A532 F:L532S0S1A535 T:L533S5A534 K:L534S0S1A537 E:L535S5A536 E:L536S0S1A539 W:L537S5A538 Y:L538S0S1A541 D:L539S5A540 L:L540S0S1A543 I:L541S5A542 D:L542S0S1A545 V:L543S5A544 P:L544S0S1A547 Q:L545S5A546 R:L546S0S1A549 N:L547S5A548 V:L548S0S1A551 S:L549S5A550 L:L550S0S1A553 S:L551S5A552 S:L552S0S1A555 G:L553S5A554 S:L554S0S1A557 C:L555S5A556 L:L556S0S1A559 N:L557S5A558 S:L558S0S1A561 R:L559S5A560 E:L560S0S1A563 Y:L561S5A562 E:L562S0S1A565 P:L563S5A564 E:L564L565S0S1S5 V:L566S0S1A568 P:L566S5A567 V:L567S0S1A570 M:L568S5A569 F:L569S0S1A572 T:L570S5A571 P:L571L572S0S1S5 P:L573S0S1A575 D:L573S5A574 Y:L574S0S1A577 T:L575S5A576 D:L576S0S1A579 M:L577S5A578 A:L578S0S1A581 I:L579S5A580 N:L580S0S1A583 K:L581S5A582 Q:L582S0S1A585 L:L583S5A584 L:L584S0S1A587 Y:L585S5A586 R:L586L587S0S1S5 D:L588S0S1A590 E:L588S5A589 I:L589S0S1A592 E:L590S5A591 L:L591S0S1A594 G:L592S5A593 T:L593S0S1A596 L:L594S5A595 Q:L595S0S1A598 A:L596S5A597 R:L597S0S1A600 Y:L598S5A599 A:L599S0S1A602 I:L600S5A601 E:L601S0S1A604 W:L602S5A603 E:L603S0S1A606 M:L604S5A605 A:L605S0S1A608 P:L606S5A607 F:L607S0S1 Y:L609S0S1A611 T:L608S5A610 P:L610L611S0S1S5 G:L612S0S1A614 D:L612S5A613 V:L613S0S1A616 M:L614S5A615 L:L615S0S1A618 S:L616S5A617 D:L617S0S1A620 T:L618S5A619 E:L619L620S0S1S5 G:L621S0S1S5 V:L622S0S1A624 R:L622S5A623 I:L623S0S1A626 V:L624S5A625 P:L625S0S1A628 Q:L626S5A627 L:L627S0S1A630 M:L628S5A629 C:L629S0S1A632 L:L630S5A631 A:L631S0S1A634 P:L632S5A633 A:L633S0S1A636 Q:L634S5A635 L:L635S0S1A638 A:L636S5A637 A:L637S0S1A640 V:L638S5A639 A:L639S0S1A642 C:L640S5A641 R:L641S0S1A644 L:L642S5A643 E:L643S0S1A646 L:L644S5A645 H:L645L646S0S1S5 G:L647S0S1A649 A:L647S5A648 D:L648S0S1A651 L:L649S5A650 A:L650S0S1A653 L:L651S5A652 R:L652S0S1A655 E:L653S5A654 K:L654L655S0S1S5 A:L656S0S1A658 G:L656S5A657 L:L657S0S1A660 H:L658S5A659 D:L659S0S1A662 I:L660S5A661 L:L661S0S1A664 V:L662S5A663 L:L663S0S1A666 Y:L664S5A665 R:L665S0S1 V:L667S0S1 A:L668S0S1 G:L669S0S1 E:L670S0S1 I:L671S0S1A673 V:L666S5A672 A:L672S0S1A675 H:L673S5A674 E:L674S0S1A677 C:L675S5A676 R:L676S0S1A679 N:L677S5A678 E:L678S0S1A681 A:L679S5A680 G:L680L681S0S1S5 A:L682S0S1A684 V:L682S5A683 S:L683S0S1A686 G:L684S5A685 K:L685S0S1A688 R:L686S5A687 V:L687S0S1A690 S:L688S5A689 T:L689L690S0S1S5 E:L691S0S1A693 A:L691S5A692 K:L692S0S1A695 A:L693S5A694 H:L694S0S1A697 V:L695S5A696 V:L696S0S1A699 C:L697S5A698 W:L698S0S1A701 G:L699S5A700 K:L700S0S1A703 W:L701S5A702 A:L702S0S1A705 L:L703S5A704 Q:L704L705S0S1S5 E:L706S0S1A708 Y:L706S5A707 K:L707S0S1A710 V:L708S5A709 I:L709S0S1A712 M:L710S5A711 E:L711S0S1A714 G:L712S5A713 Q:L713S0S1A716 W:L714S5A715 D:L715S0S1A718 N:L716S5A717 M:L717S0S1A720 L:L718S5A719 M:L719S0S1A722 R:L720S5A721 E:L721S0S1A724 K:L722S5A723 E:L723S0S1A726 V:L724S5A725 V:L725S0S1A728 Q:L726S5A727 I:L727S0S1A730 Y:L728S5A729 K:L729S0S1A732 F:L730S5A731 T:L731S0S1A734 L:L732S5A733 R:L733S0A736 L:L733S1A737 M:L734S5A735 P:L735L736S0S1A739 A:L737S5A738 L:L738S0S1A741 K:L739S5A740 Q:L740S0S1A743 R:L741S5A742 S:L742S0S1A745 P:L743S5A744 K:L744S0S1A747 A:L745S5A746 V:L746L747S0S1S5 L:L748S0S1A750 Y:L748S5A749 L:L749S0S1A752 I:L750S5A751 Y:L751S0S1A754 D:L752S5A753 A:L753S0S1A756 E:L754S5A755 I:L755S0S1A758 E:L756S5A757 V:L757S0S1A760 A:L758S5A759 L:L759L760S0S1S5 L:L761S0S1A763 A:L761S5A762 D:L762S0S1A765 R:L763S5A764 E:L764S0S1A767 A:L765S5A766 N:L766S0S1A769 Q:L767S5A768 G:L768S0S1A771 E:L769S5A770 D:L770L771S0S1S5 L:L772S0S1A774 F:L772S5A773 P:L773S0S1A776 F:L774S5A775 A:L775S0S1A778 Q:L776S5A777 N:L777S0S1A780 K:L778S5A779 T:L779S0S1A782 F:L780S5A781 G:L781L782S0S1S5 D:L783S0S1A785 K:L783S5A784 V:L784L785S0S1S5 Y:L786S0S1A788 R:L786S5A787 A:L787S0S1A790 S:L788S5A789 V:L789S0S1A792 S:L790S5A791 Y:L791S0S1A794 V:L792S5A793 R:L793S0S1A796 C:L794S5A795 E:L795S0S1A798 S:L796S5A797 L:L797L798S0S1S5 C:L799S0S1 E:L800S0S1 Y:L801S0S1 I:L802S0S1 D:L803S0S1 L:L804S0S1 E:L805S0S1 P:L806S0S1 L:L807S0S1 T:L808S0S1 Q:L809S0S1 R:L810S0S1 R:L811S0S1 I:L812S0S1 S:L813S0S1 D:L814S0S1 L:L815S0S1 I:L816S0S1 N:L817S0S1 E:L818S0S1 L:L819S0S1 D:L820S0S1 M:L821S0S1 L:L822S0S1 G:L823S0S1 I:L824S0S1 I:L825S0S1 N:L826S0S1 A:L827S0S1 K:L828S0S1 V:L829S0S1 V:L830S0S1 S:L831S0S1 K:L832S0S1 G:L833S0S1 R:L834S0S1 Y:L835S0S1 G:L836S0S1 R:L837S0S1 T:L838S0S1 K:L839S0S1 E:L840S0S1 I:L841S0S1 R:L842S0S1 L:L843S0S1 N:L844S0A846 M:L844S1A845 V:L845L846S0S1 T:L847S0S1 S:L848S0S1 Y:L849S0S1 K:L850S0S1 I:L851S0S1 R:L852S0S1 N:L853S0S1 V:L854S0S1 L:L855S0S1 R:L856S0S1 Y:L857S0S1 D:L858S0S1 Y:L859S0S1 S:L860S0S1 I:L861S0S1 Q:L862S0S1 P:L863S0S1 L:L864S0S1 L:L865S0S1 T:L866S0S1 I:L867S0S1 S:L868S0S1 L:L869S0S1 K:L870S0S1 S:L871S0S1 E:L872S0S1 Q:L873S0S1 R:L874S0S1 R:L875S0S1 L:L876S0S1 I:L877S0S1 test.swiss100644000766000024 543514006427353 16454 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID GCDH_CAEEL STANDARD; PRT; 409 AA. AC Q20772; DT 01-NOV-1997 (Rel. 35, Created) DT 01-NOV-1997 (Rel. 35, Last sequence update) DT 16-OCT-2001 (Rel. 40, Last annotation update) DE PROBABLE GLUTARYL-COA DEHYDROGENASE, MITOCHONDRIAL PRECURSOR DE (EC 1.3.99.7) (GCD). GN F54D5.7. OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP SEQUENCE FROM N.A. RC STRAIN=BRISTOL N2; RA Coles L.; RL Submitted (OCT-1995) to the EMBL/GenBank/DDBJ databases. CC -!- CATALYTIC ACTIVITY: GLUTARYL-COA + ACCEPTOR = CROTONOYL-COA + CC CO(2) + REDUCED ACCEPTOR. CC -!- COFACTOR: FAD (BY SIMILARITY). CC -!- PATHWAY: DEGRADATIVE PATHWAY OF L-LYSINE, L-HYDROXYLYSINE, CC AND L-TRYPTOPHAN METABOLISM. CC -!- SUBCELLULAR LOCATION: MITOCHONDRIAL MATRIX (POTENTIAL). CC -!- SIMILARITY: BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; Z66513; CAA91333.1; -. DR HSSP; Q06319; 1BUC. DR WormPep; F54D5.7; CE03411. DR InterPro; IPR001552; Acyl-CoA_dh. DR Pfam; PF00441; Acyl-CoA_dh; 1. DR Pfam; PF02770; Acyl-CoA_dh_M; 1. DR Pfam; PF02771; Acyl-CoA_dh_N; 1. DR PROSITE; PS00072; ACYL_COA_DH_1; FALSE_NEG. DR PROSITE; PS00073; ACYL_COA_DH_2; 1. KW Hypothetical protein; Oxidoreductase; Flavoprotein; FAD; KW Mitochondrion; Transit peptide. FT TRANSIT 1 ? MITOCHONDRION (POTENTIAL). FT CHAIN ? 409 PROBABLE GLUTARYL-COA DEHYDROGENASE. FT ACT_SITE 388 388 BASE (POTENTIAL). SQ SEQUENCE 409 AA; 44964 MW; 4D06241FB6768069 CRC64; MLTRGFTSIG KIASRGLSST FYQDAFQLSD QLTEDERSLM LSAREYCQER LLPRVTEAYR TEKFDPSLIP EMGSMGLLGA PYQGYGCAGT STVGYGLIAR EVERVDSGYR STMSVQTSLV IGPIYNYGSE DQKQKYIPDL ASGKKIGCFG LTEPNHGSNP GGMETKATWD ETTKTYKLNG SKTWISNSPV SDVMVVWARS ARHNNKIKGF ILERGMKGLT TPKIEGKLSL RASITGQIAM DDVPVPEENL LPNAEGLQGP FGCLNNARLG IAWGALGAAE ECFHLARQYT LDRQQFGRPL AQNQLMQLKM ADMLTEISLG LQGCLRVSRL KDEGKVQSEQ ISIIKRNSCG KALEVARKAR DMLGGNGIVD EYHIMRHMVN LETVNTYEGT HDVHALILGR AITGLNGFC // phipsi.out100644000766000024 55115114006427353 16471 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBLASTP 2.0.14 [Jun-29-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= CYS1_DICDI (351 letters) Database: /home/peter/blast/data/swissprot 88,780 sequences; 31,984,247 total letters Searching...................................................................................................................................................... 3 occurrence(s) of pattern in query CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 23 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 120 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 CYS1_DICDI; PATTERN. pattern P-E-E-Q at position 237 of query sequence effective database length=3.2e+07 pattern probability=8.9e-06 lengthXprobability=2.8e+02 Number of occurrences of pattern in the database is 349 done Results from round 1 Score E (bits) Value Significant matches for pattern occurrence 1 at position 23 sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 688 0.0 sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE 8 4.8 sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST... 7 6.0 sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 7 7.6 sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7... 7 9.6 Significant matches for pattern occurrence 2 at position 120 sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT 13 0.13 sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT... 11 0.43 sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) 11 0.55 sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNI... 10 1.1 sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 I... 8 3.0 sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURS... 7 6.0 sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 7 7.6 sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] 7 7.6 Significant matches for pattern occurrence 3 at position 237 sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, ... 9 1.4 sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, ... 9 1.4 sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 8 4.8 sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROT... 7 6.0 sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 7 9.6 Significant alignments for pattern occurrence 1 at position 23 >sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR Length = 343 Score = 688 bits (1789), Expect = 0.0 Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 pattern 23 **** MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120 pattern 120 * ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119 Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 pattern 121 *** TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 pattern 237 **** CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232 Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292 Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 >sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE Length = 4969 Score = 7.8 bits (25), Expect = 4.8 Identities = 14/39 (35%), Positives = 19/39 (47%) Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 pattern 23 **** PEEQ +F E + K +K EE E + G+ EE Sbjct: 4414 PEEQEKFQEQKTKEEEKEEKEETKSEPEKAEGEDGEKEE 4452 >sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST CLASS-ALPHA) Length = 221 Score = 7.4 bits (24), Expect = 6.0 Identities = 19/67 (28%), Positives = 35/67 (51%), Gaps = 12/67 (17%) Query: 21 IPPEEQ-SQFLEFQDKFNKKY---------SH-EEYLERFEIFKSNLGKIEEL-NLIAIN 68 pattern 23 **** +PPEEQ ++ + +DK +Y SH ++YL ++ K+++ +E L N+ +N Sbjct: 112 LPPEEQEAKLAQIKDKAKNRYFPAFEKVLKSHGQDYLVGNKLSKADILLVELLYNVEELN 171 Query: 69 HKADTKF 75 A F Sbjct: 172 PGATASF 178 >sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 Length = 356 Score = 7.1 bits (23), Expect = 7.6 Identities = 14/67 (20%), Positives = 32/67 (46%), Gaps = 5/67 (7%) Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGK---IEELNLIAINHKADTKFGVNK 79 pattern 23 **** PEEQ++ ++D+ N + ++Y + + L K +LN + ++A ++ + Sbjct: 75 PEEQAK--TYKDEGNDYFKEKDYKKAVISYTEGLKKKCADPDLNAVLYTNRAAAQYYLGN 132 Query: 80 FADLSSD 86 F +D Sbjct: 133 FRSALND 139 >sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7 INTERGENIC REGION Length = 725 Score = 6.8 bits (22), Expect = 9.6 Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 21/99 (21%) Query: 21 IPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 78 pattern 23 **** + PEEQ L+F ++ H ER + +++G +N + + G+ Sbjct: 213 LTPEEQKDKDLLQFAEQI-----HSMRTER--LSGAHIGNSPAIN------RLRGELGLQ 259 Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 DL +E ++ + +DD+ ++ DEF++S Sbjct: 260 AMEDLPEEEITDH------KVLSDDIDLSQATIDEFVHS 292 Significant alignments for pattern occurrence 2 at position 120 >sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT Length = 555 Score = 13.0 bits (40), Expect = 0.13 Identities = 16/28 (57%), Positives = 18/28 (64%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 pattern 120 **** IFT D +AD LDD F IN + PEEQ Sbjct: 170 IFTGDDELADELDDRFVIDINKLFPEEQ 197 >sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT (MCR I ALPHA) Length = 553 Score = 11.2 bits (35), Expect = 0.43 Identities = 14/28 (50%), Positives = 18/28 (64%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEFINSIP---PEEQ 123 pattern 120 **** I T DL +AD +DD+F+ I PEEQ Sbjct: 168 IITGDLELADEIDDKFLIDIEKLFPEEQ 195 >sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) Length = 101 Score = 10.9 bits (34), Expect = 0.55 Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 1/23 (4%) Query: 114 FINSIPPEEQTAF-DWRTRGAVT 135 pattern 120 **** F S+ PEEQ AF +W+TR +T Sbjct: 78 FGKSLTPEEQRAFEEWKTRYGIT 100 >sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNIT (MCR II ALPHA) Length = 553 Score = 9.8 bits (31), Expect = 1.1 Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 3/28 (10%) Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 pattern 120 **** IFT D +AD +D F IN + PEEQ Sbjct: 168 IFTGDDELADEIDKRFLIDINKLFPEEQ 195 >sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 INTERGENIC REGION Length = 462 Score = 8.5 bits (27), Expect = 3.0 Identities = 13/39 (33%), Positives = 21/39 (53%), Gaps = 9/39 (23%) Query: 112 DEFINSIP-------PEEQT--AFDWRTRGAVTPVKNQG 141 pattern 120 **** DEF+N+ P PEEQ+ A++W + + + N G Sbjct: 308 DEFLNTSPSPEVFTLPEEQSGMAWEWHDKDWMLDLTNDG 346 >sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURSOR (MAGP) (MAGP-1) Length = 183 Score = 7.4 bits (24), Expect = 6.0 Identities = 11/37 (29%), Positives = 18/37 (47%) Query: 100 FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTP 136 pattern 120 **** + D + ADY D + ++ PEEQ + + V P Sbjct: 37 YGDQIDNADYYDYQEVSPRTPEEQFQSQQQVQQEVIP 73 >sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 Length = 199 Score = 7.1 bits (23), Expect = 7.6 Identities = 11/27 (40%), Positives = 15/27 (54%), Gaps = 1/27 (3%) Query: 105 PVADYLDDE-FINSIPPEEQTAFDWRT 130 pattern 120 **** PV+ Y DE + + PEEQ D+ T Sbjct: 171 PVSSYSSDEGSYDPLSPEEQELLDFTT 197 >sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] Length = 812 Score = 7.1 bits (23), Expect = 7.6 Identities = 8/13 (61%), Positives = 11/13 (84%) Query: 112 DEFINSIPPEEQT 124 pattern 120 **** D+ +S+PPEEQT Sbjct: 359 DQSDSSVPPEEQT 371 Significant alignments for pattern occurrence 3 at position 237 >sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE B) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN B) Length = 1034 Score = 9.5 bits (30), Expect = 1.4 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 >sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE A) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN A) Length = 1037 Score = 9.5 bits (30), Expect = 1.4 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + Sbjct: 83 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 136 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 137 MQD-------LASKNKIFK 148 >sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN) Length = 1034 Score = 7.8 bits (25), Expect = 4.8 Identities = 21/79 (26%), Positives = 38/79 (47%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F +++ I +T P AI D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKAIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 >sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROTEIN 6 Length = 1081 Score = 7.4 bits (24), Expect = 6.0 Identities = 25/93 (26%), Positives = 37/93 (38%), Gaps = 17/93 (18%) Query: 159 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI-IKNGGIQTESS 217 +F S+N+ +S L E M + E C L P ++I N I +S+ Sbjct: 642 NFTSKNEQEKISNDKL-----EVMVIKTVSTLCETCREELTPYLMHFISFLNTVIMPDSN 696 Query: 218 YPYTAETG--------TQCNFNSANIGPEEQAK 242 pattern 237 **** + T QC ++ GPEEQAK Sbjct: 697 VSHFTRTKLVRSIGYVVQCQVSN---GPEEQAK 726 >sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM P-PROTEIN) Length = 1034 Score = 6.8 bits (22), Expect = 9.6 Identities = 20/79 (25%), Positives = 38/79 (47%), Gaps = 13/79 (16%) Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** NSA PEEQ K++ F +++ I +T P +I D++++ + G+ + + Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 Query: 291 SLDHGILIVGYSAKNTIFR 309 D ++KN IF+ Sbjct: 134 MQD-------LASKNKIFK 145 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: Sequences not found previously or not previously below threshold: sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 709 0.0 sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 273 4e-73 sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RES... 270 2e-72 sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 266 6e-71 sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 252 6e-67 sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK C... 250 2e-66 sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 238 1e-62 sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 236 4e-62 sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 233 3e-61 sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE... 233 3e-61 sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 231 1e-60 sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 221 1e-57 sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEIN... 221 2e-57 sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 216 5e-56 sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 215 1e-55 sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 214 2e-55 sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 214 2e-55 sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN... 212 7e-55 sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 212 1e-54 sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 209 8e-54 sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 209 8e-54 sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 208 1e-53 sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 207 2e-53 sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE... 207 3e-53 sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 206 4e-53 sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 206 4e-53 sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 206 5e-53 sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 204 3e-52 sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 203 6e-52 sp|Q10991|CATL_SHEEP CATHEPSIN L 201 1e-51 sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 201 2e-51 sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 200 3e-51 sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 199 7e-51 sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 196 5e-50 sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 196 5e-50 sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 194 2e-49 sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 193 4e-49 sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 193 5e-49 sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II... 192 1e-48 sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPS... 192 1e-48 sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 190 5e-48 sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 188 2e-47 sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA... 187 2e-47 sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 187 2e-47 sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 187 4e-47 sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 186 5e-47 sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 185 9e-47 sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEP... 185 1e-46 sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPA... 184 3e-46 sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 183 3e-46 sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 183 5e-46 sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 183 6e-46 sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 182 8e-46 sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHE... 180 5e-45 sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 178 2e-44 sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 177 3e-44 sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 176 6e-44 sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 173 4e-43 sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 173 7e-43 sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 171 3e-42 sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 167 2e-41 sp|P25326|CATS_BOVIN CATHEPSIN S 165 1e-40 sp|P80884|ANAN_ANACO ANANAIN 161 2e-39 sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 158 1e-38 sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE... 158 2e-38 sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 152 1e-36 sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 150 4e-36 sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 150 6e-36 sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 150 6e-36 sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE P... 149 9e-36 sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 149 9e-36 sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 145 1e-34 sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 145 1e-34 sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 143 5e-34 sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR ... 141 3e-33 sp|P14518|BROM_ANACO BROMELAIN, STEM 139 6e-33 sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR... 138 1e-32 sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 129 1e-29 sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR... 121 3e-27 sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPP... 111 3e-24 sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 109 9e-24 sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN... 108 2e-23 sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 108 3e-23 sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 107 3e-23 sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 100 7e-21 sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 95 2e-19 sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PREC... 91 4e-18 sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PREC... 90 5e-18 sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 90 5e-18 sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 89 2e-17 sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 87 4e-17 sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 87 5e-17 sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP S... 86 9e-17 sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 2e-16 sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 2e-16 sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PREC... 85 2e-16 sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 3e-16 sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 3e-16 sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 80 9e-15 sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PREC... 78 2e-14 sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 78 4e-14 sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PREC... 73 7e-13 sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P1... 70 6e-12 sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 61 4e-09 sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 60 9e-09 sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 59 1e-08 sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 58 3e-08 sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 56 1e-07 sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 52 2e-06 sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 42 0.002 sp|P05689|CATX_BOVIN CATHEPSIN 40 0.006 sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 39 0.019 sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (G... 36 0.16 sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTEC... 35 0.22 sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 I... 32 1.9 sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 32 1.9 sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-D... 31 3.2 sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2 31 4.2 sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 31 4.2 sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDRO... 30 5.5 sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 30 5.5 sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 30 7.2 sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 30 7.2 sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4 sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (... 30 9.4 sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4 >sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR Length = 343 Score = 709 bits (1811), Expect = 0.0 Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119 Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 pattern 237 **** CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232 Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292 Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 >sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR Length = 313 Score = 273 bits (691), Expect = 4e-73 Identities = 149/324 (45%), Positives = 194/324 (58%), Gaps = 26/324 (8%) Query: 32 FQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLSSDE 87 F+ KF K Y S EE+ RF +FK+NL L A+ H+ + GV +F+DL+ E Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANL-------LRAMRHQKMDPSARHGVTQFSDLTRSE 55 Query: 88 FKNYYLNNKEAI-FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 F+ +L K D A L + + PEE FDWR RGAVTPVKNQG CGSC Sbjct: 56 FRRKHLGVKGGFKLPKDANQAPILPTQNL----PEE---FDWRDRGAVTPVKNQGSCGSC 108 Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 WSFSTTG +EG HF++ KLVSLSEQ LVDCDHEC + E E +CD GCNGGL +A+ Y Sbjct: 109 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHEC-DPEEEGSCDSGCNGGLMNSAFEYT 167 Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** +K GG+ E YPYT G C + + I A +SNF+++ NE +A ++ GPL Sbjct: 168 LKTGGLMREKDYPYTGTDGGSCKLDRSKI----VASVSNFSVVSINEDQIAANLIKNGPL 223 Query: 267 AIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAK--NTIFRKNMPYWIVKNSWGAD 324 A+A +A Q YIGGV L+HG+L+VGY + + K PYWI+KNSWG Sbjct: 224 AVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGES 283 Query: 325 WGEQGYIYLRRGKNTCGVSNFVST 348 WGE G+ + +G+N CGV + VST Sbjct: 284 WGENGFYKICKGRNICGVDSLVST 307 >sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A) Length = 363 Score = 270 bits (684), Expect = 2e-72 Identities = 144/327 (44%), Positives = 201/327 (61%), Gaps = 20/327 (6%) Query: 26 QSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 + F F+ KF+K Y+ EE+ RF +FKSNL K + + N + G+ KF+DL+ Sbjct: 45 EHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAK----LHQNRDPTAEHGITKFSDLT 100 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144 + EF+ +L K+ + LP + PE+ FDWR +GAVTPVK+QG CG Sbjct: 101 ASEFRRQFLGLKKRL---RLPAHAQKAPILPTTNLPED---FDWREKGAVTPVKDQGSCG 154 Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 SCW+FSTTG +EG H+++ KLVSLSEQ LVDCDH C + E +CD GCNGGL NA+ Sbjct: 155 SCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVC-DPEQAGSCDSGCNGGLMNNAFE 213 Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 pattern 237 **** Y++++GG+ E Y YT G+ C F+ + + A +SNF+++ +E +A +V G Sbjct: 214 YLLESGGVVQEKDYAYTGRDGS-CKFDKSKV----VASVSNFSVVTLDEDQIAANLVKNG 268 Query: 265 PLAIAADAVEWQFYIGGV-FDIPCNPNSLDHGILIVGY--SAKNTIFRKNMPYWIVKNSW 321 PLA+A +A Q Y+ GV C + LDHG+L+VG+ A I K PYWI+KNSW Sbjct: 269 PLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSW 328 Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348 G +WGEQGY + RG+N CGV + VST Sbjct: 329 GQNWGEQGYYKICRGRNVCGVDSMVST 355 >sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR Length = 368 Score = 266 bits (672), Expect = 6e-71 Identities = 156/367 (42%), Positives = 206/367 (55%), Gaps = 42/367 (11%) Query: 6 LFVLAVFTVFVSSR---------------GIPPE---EQSQFLEFQDKFNKKY-SHEEYL 46 +FVL+ F V VSS G P+ + F F+ KF K Y S+EE+ Sbjct: 10 VFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHD 69 Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTK--FGVNKFADLSSDEFKNYYLNNKEAI-FTDD 103 RF +FK+NL + + K D GV +F+DL+ EF+ +L + D Sbjct: 70 YRFSVFKANLRRARR------HQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKD 123 Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 A L E + PE+ FDWR GAVTPVKNQG CGSCWSFS TG +EG +F++ Sbjct: 124 ANKAPILPTENL----PED---FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176 Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 KLVSLSEQ LVDCDHEC + E ++CD GCNGGL +A+ Y +K GG+ E YPYT + Sbjct: 177 GKLVSLSEQQLVDCDHEC-DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGK 235 Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVF 283 pattern 237 **** G C + + I A +SNF++I +E +A +V GPLA+A +A Q YIGGV Sbjct: 236 DGKTCKLDKSKI----VASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVS 291 Query: 284 DIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 L+HG+L+VGY A K PYWI+KNSWG WGE G+ + +G+N CG Sbjct: 292 CPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICG 351 Query: 342 VSNFVST 348 V + VST Sbjct: 352 VDSMVST 358 >sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR Length = 371 Score = 252 bits (638), Expect = 6e-67 Identities = 138/332 (41%), Positives = 190/332 (56%), Gaps = 23/332 (6%) Query: 26 QSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 +S FL F +F K Y +E+ R +FK NL + L+ + GV KF+DL+ Sbjct: 45 ESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLL----DPSAEHGVTKFSDLT 100 Query: 85 SDEFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQG 141 EF+ YL ++ A+ + A + +P + FDWR GAV PVKNQG Sbjct: 101 PAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDD----FDWRDHGAVGPVKNQG 156 Query: 142 QCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPN 201 CGSCWSFS +G +EG H+++ KL LSEQ VDCDHEC E ++CD GCNGGL Sbjct: 157 SCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSE-PDSCDSGCNGGLMTT 215 Query: 202 AYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261 pattern 237 **** A++Y+ K GG+++E YPYT G +C F+ + I A + NF+++ +E ++ ++ Sbjct: 216 AFSYLQKAGGLESEKDYPYTGSDG-KCKFDKSKI----VASVQNFSVVSVDEAQISANLI 270 Query: 262 STGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKN 319 GPLAI +A Q YIGGV LDHG+L+VGY A I K+ PYWI+KN Sbjct: 271 KHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKN 330 Query: 320 SWGADWGEQGYIYLRRG---KNTCGVSNFVST 348 SWG +WGE GY + RG +N CGV + VST Sbjct: 331 SWGENWGENGYYKICRGSNVRNKCGVDSMVST 362 >sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK CATHEPSIN) Length = 376 Score = 250 bits (633), Expect = 2e-66 Identities = 147/391 (37%), Positives = 213/391 (53%), Gaps = 63/391 (16%) Query: 1 MKVILLFVLAVFTVFVSSRGIP-------PEEQSQFLEFQDKFNKKYSHEEYLERFEIFK 53 M++++ +L +F F + P + ++ F E+ KFN++YS E+ R+ IFK Sbjct: 1 MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFK 60 Query: 54 SNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK-EAIFTDDLPVADYLDD 112 SN+ ++ N + T G+N FAD++++E++ YL + A + + L+ Sbjct: 61 SNMDYVDNWNS---KGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNV 117 Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 E + + P + DWRT+ AVTP+K+QGQCGSCWSFSTTG+ EG H + KLVSLSEQ Sbjct: 118 EDLQTNPK----SIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQ 173 Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNS 232 NLVDC G E + GC+GGL NA++YIIKN GI TESSYPYTAETG+ C FN Sbjct: 174 NLVDC-------SGPEE-NFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNK 225 Query: 233 ANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNP 289 pattern 237 **** ++IG A I + I + GP+++A DA +Q Y G++ P C+P Sbjct: 226 SDIG----ATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSP 281 Query: 290 NSLDHGILIVGY--------------------------------SAKNTIFRKNMPYWIV 317 LDHG+L+VGY + +++ K YWIV Sbjct: 282 TELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIV 341 Query: 318 KNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347 KNSWG WG +GYI + + KN CG+++ S Sbjct: 342 KNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372 >sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR Length = 344 Score = 238 bits (601), Expect = 1e-62 Identities = 139/370 (37%), Positives = 201/370 (53%), Gaps = 45/370 (12%) Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59 MKV+ L VL V + + ++ F ++ K Y+ EE+ R+ IF +N+ + Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYV 60 Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 ++ N + ++T G+N FAD++++E++N YL K F + + NS Sbjct: 61 QQWN----SKGSETVLGLNNFADITNEEYRNTYLGTK---FDASSLIGTQEEKVHTNSSA 113 Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 + DWR+ GAVTPVKNQGQCG CWSFSTTG+ EG HF S+ +LVSLSEQNL+DC Sbjct: 114 ASK----DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCST 169 Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 pattern 237 *** E + GC+GGL A+ YII N GI TESSYPY AE G +C + S N G Sbjct: 170 E----------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG-KCEYKSENSG--- 215 Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGI 296 pattern 240 * A +S++ + V+ P+++A DA +Q Y G++ P C+ +LDHG+ Sbjct: 216 -ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGV 274 Query: 297 LIVGY--------------SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCG 341 L VGY S+ N + YWIVKNSWG WG +GYI + R + N CG Sbjct: 275 LAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCG 334 Query: 342 VSNFVSTSII 351 +++ S ++ Sbjct: 335 IASSASFPVV 344 >sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR Length = 450 Score = 236 bits (597), Expect = 4e-62 Identities = 137/354 (38%), Positives = 193/354 (53%), Gaps = 34/354 (9%) Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEE 61 V+L + +V + S + + +F F+ K+ K Y +E RF F+ N+ E+ Sbjct: 15 VLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENM---EQ 71 Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 + A + T FGV F+D++ +EF+ Y N A + +N Sbjct: 72 AKIQAAANPYAT-FGVTPFSDMTREEFRARYRNGASYF-----AAAQKRLRKTVNVTTGR 125 Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 A DWR +GAVTPVK QGQCGSCW+FST GN+EGQ ++ N LVSLSEQ LV CD Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCD--- 182 Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETG--TQCNFNSANIGP 237 pattern 237 * D GCNGGL NA+N+I+ + G + TE+SYPY + G QC N IG Sbjct: 183 -------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIG- 234 Query: 238 EEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGIL 297 pattern 238 *** A I++ +P++E +A Y+ GPLAIA DA + Y GG+ C LDHG+L Sbjct: 235 ---AAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGIL-TSCTSKQLDHGVL 290 Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 +VGY+ + N PYWI+KNSW WGE GYI + +G N C ++ VS++++ Sbjct: 291 LVGYNDNS-----NPPYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339 >sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) Length = 319 Score = 233 bits (589), Expect = 3e-61 Identities = 128/334 (38%), Positives = 190/334 (56%), Gaps = 30/334 (8%) Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 +P ++++F+ K+ K+Y E RF IFKSN+ K + L + + +GV + Sbjct: 12 LPGNVDEKYVQFKLKYRKQYHETEDEIRFNIFKSNILKAQ---LYQVFVRGSAIYGVTPY 68 Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 +DL++DEF +L + + L E +N+IP FDWR +GAVT VKNQ Sbjct: 69 SDLTTDEFARTHLTASWVVPSSRSNTPTSLGKE-VNNIPKN----FDWREKGAVTEVKNQ 123 Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 200 G CGSCW+FSTTGNVE Q F KL+SLSEQ LVDCD D+GCNGGL Sbjct: 124 GMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCD----------GLDDGCNGGLPS 173 Query: 201 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYI 260 pattern 237 **** NAY IIK GG+ E +YPY A+ +C+ + + I++ + ++ET +A ++ Sbjct: 174 NAYESIIKMGGLMLEDNYPYDAK-NEKCHLKTDGVA----VYINSSVNLTQDETELAAWL 228 Query: 261 VSTGPLAIAADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIV 317 +++ +A+ QFY G+ + I C+ LDH +L+VGY + KN P+WIV Sbjct: 229 YHNSTISVGMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG----VSEKNEPFWIV 284 Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 KNSWG +WGE GY + RG +CG++ ++++I Sbjct: 285 KNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318 >sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-1) Length = 354 Score = 233 bits (589), Expect = 3e-61 Identities = 144/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%) Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 LLF + V +FV G PP + + + F+ + K + + E RF F Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 K N+ LN + D KFADL+ EF YLN + D+ +D Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKDHKED 119 Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 ++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179 Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230 LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229 Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** + E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285 Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 >sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR Length = 354 Score = 231 bits (584), Expect = 1e-60 Identities = 143/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%) Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 LLF + V +FV G PP + + + F+ + K + + E RF F Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 K N+ LN + D KFADL+ EF YLN + ++ +D Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKNHKED 119 Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 ++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179 Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230 LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229 Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 pattern 237 **** + E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285 Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 >sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR Length = 322 Score = 221 bits (558), Expect = 1e-57 Identities = 132/349 (37%), Positives = 184/349 (51%), Gaps = 41/349 (11%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59 MKV+ LF+ + + + EF+ KF +KY EE R +F NL I Sbjct: 1 MKVVALFLFGLALAAANP---------SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYI 51 Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 EE N + +N+F+D+++++F K+ P A F ++ Sbjct: 52 EEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKGYKKG----PRPAA-----VFTSTDA 102 Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 E T DWRT+GAVTPVK+QGQCGSCW+FSTTG +EGQHF+ +LVSLSEQ LVDC Sbjct: 103 APESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-- 160 Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 pattern 237 *** G ++GCNGG A Y+ NGG+ TESSYPY A T C FNS IG Sbjct: 161 -----AGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNT-CRFNSNTIG--- 211 Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAVEWQF---YIGGVFDIPCNPNSLDHG 295 pattern 240 * A + + I + +E+ + GP+++A DA F Y G ++ C+ + LDH Sbjct: 212 -ATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHA 270 Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVS 343 +L VGY ++ +W+VKNSW WGE GYI + R + N CG++ Sbjct: 271 VLAVGYGSEG-----GQDFWLVKNSWATSWGESGYIKMARNRNNNCGIA 314 >sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEINASE) (CRUZAINE) Length = 467 Score = 221 bits (557), Expect = 2e-57 Identities = 134/358 (37%), Positives = 189/358 (52%), Gaps = 38/358 (10%) Query: 3 VILLFVLAVFTVFV--SSRGIPPEEQ--SQFLEFQDKFNKKY-SHEEYLERFEIFKSNLG 57 ++L VL V V ++ + EE SQF EF+ K + Y S E R +F+ NL Sbjct: 8 LLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF 67 Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 + L+ A H FGV F+DL+ +EF++ Y N + E + + Sbjct: 68 -LARLHAAANPHAT---FGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVKVEVVGA 123 Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 177 A DWR RGAVT VK+QGQCGSCW+FS GNVE Q F++ + L +LSEQ LV C Sbjct: 124 -----PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSC 178 Query: 178 DHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNSA 233 D D GC+GGL NA+ +I++ NG + TE SYPY + G C + Sbjct: 179 D----------KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGH 228 Query: 234 NIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLD 293 pattern 237 **** +G A I+ +P++E +A ++ GP+A+A DA W Y GGV C LD Sbjct: 229 TVG----ATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM-TSCVSEQLD 283 Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 HG+L+VGY+ + PYWI+KNSW WGE+GYI + +G N C V S++++ Sbjct: 284 HGVLLVGYNDSAAV-----PYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336 >sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) Length = 323 Score = 216 bits (545), Expect = 5e-56 Identities = 131/349 (37%), Positives = 181/349 (51%), Gaps = 32/349 (9%) Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 +LF L V+ V S+ P + + F EF +FNK YS E E L RF+IF+ NL +I Sbjct: 4 ILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI---- 59 Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123 I N K+ +NKF+DLS DE Y T + LD P + Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPTQTQNFCKVILLDQP-----PGKGP 113 Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 FDWR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ ++DCD Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDF---- 169 Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 pattern 237 **** D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + + Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219 Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 + I E + + GP+ +A DA + Y G+ C + L+H +L+VGY Sbjct: 220 DCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQGIIKY-CFDSGLNHAVLLVGYGV 278 Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351 +N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 >sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR Length = 323 Score = 215 bits (541), Expect = 1e-55 Identities = 132/357 (36%), Positives = 189/357 (51%), Gaps = 40/357 (11%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKI 59 MKV +LF+ V S + F+ K+ ++Y EE R IF+ N I Sbjct: 1 MKVAVLFLCGVALAAASP---------SWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYI 51 Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 EE N N + +NKF D++ +EF N I PV+ + + Sbjct: 52 EEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGN---IPRRSAPVSVFYPKKETGP-- 106 Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 + T DWRT+GAVTPVK+QGQCGSCW+FSTTG++EGQHF+ L+SL+EQ LVDC Sbjct: 107 --QATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDC-- 162 Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 pattern 237 *** +GCNGG +A++YI N GI TE++YPY A G+ C F+S ++ Sbjct: 163 ------SRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGS-CRFDSNSVA--- 212 Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHG 295 pattern 240 * A S T I +ET + + GP+++ DA +QFY GV+ P C+P+ LDH Sbjct: 213 -ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHA 271 Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351 +L VGY ++ +W+VKNSW WG+ GYI + R + N CG++ S ++ Sbjct: 272 VLAVGYGSEG-----GQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323 >sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) Length = 324 Score = 214 bits (540), Expect = 2e-55 Identities = 130/351 (37%), Positives = 188/351 (53%), Gaps = 33/351 (9%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 M I+L++L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I Sbjct: 1 MNKIVLYLLVYGAVQCAAYDVL-KAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59 Query: 60 EELNLIAINHKADT-KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI 118 I NH T ++ +NKFADLS DE + Y + T + LD Sbjct: 60 -----INKNHNDSTAQYEINKFADLSKDETISKYTGLSLPLQTQNFCEVVVLDRP----- 109 Query: 119 PPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 178 P + FDWR VT VKNQG CG+CW+F+T G++E Q I N+ ++LSEQ L+DCD Sbjct: 110 PDKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCD 169 Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238 pattern 237 ** D GC+GGL A+ ++ GGIQ ES YPY A G C N+A + Sbjct: 170 F----------VDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNG-DCRANAAKFVVK 218 Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILI 298 pattern 239 ** + T+ E + + S GP+ +A DA + Y G+ C + L+H +L+ Sbjct: 219 VKKCYRYITVF---EEKLKDLLRSVGPIPVAIDASDIVNYKRGIMKY-CANHGLNHAVLL 274 Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 VGY+ +N +P+WI+KN+WGADWGEQGY +++ N CG+ N + +S Sbjct: 275 VGYAVEN-----GVPFWILKNTWGADWGEQGYFRVQQNINACGIQNELPSS 320 >sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR Length = 321 Score = 214 bits (539), Expect = 2e-55 Identities = 125/326 (38%), Positives = 184/326 (56%), Gaps = 47/326 (14%) Query: 32 FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-- 88 F+ ++ +KY +E L R +F+ N IE+ N N + K +N+F D++++EF Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82 Query: 89 --KNYYLNNK---EAIFTDDL-PVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 K Y ++ +A+FT + P+A DWRT+ VTPVK+Q Q Sbjct: 83 VMKGYKKGSRGEPKAVFTAEAGPMA----------------ADVDWRTKALVTPVKDQEQ 126 Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 CGSCW+FS TG +EGQHF+ ++LVSLSEQ LVDC + ++GC GG +A Sbjct: 127 CGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDC--------STDYGNDGCGGGWMTSA 178 Query: 203 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262 pattern 237 **** ++YI NGGI TESSYPY AE C F++ +IG A + + E + + Sbjct: 179 FDYIKDNGGIDTESSYPYEAE-DRSCRFDANSIG----AICTGSVEVQHTEEALQEAVSG 233 Query: 263 TGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKN 319 GP+++A DA +QFY GV ++ C+P LDHG+L VGY ++T YW+VKN Sbjct: 234 VGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTEST-----KDYWLVKN 288 Query: 320 SWGADWGEQGYIYLRRGK-NTCGVSN 344 SWG+ WG+ GYI + R + N CG+++ Sbjct: 289 SWGSSWGDAGYIKMSRNRDNNCGIAS 314 >sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) (CYCLIC PROTEIN-2) (CP-2) Length = 334 Score = 212 bits (535), Expect = 7e-55 Identities = 127/359 (35%), Positives = 195/359 (53%), Gaps = 39/359 (10%) Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 ++LL VL + T + + +Q+ +++ + Y E R +++ N+ I+ Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLH 62 Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 N N K +N F D++++EF+ + + K +F + L + Sbjct: 63 NGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML---------- 112 Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 IP DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD Sbjct: 113 QIPK----TVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168 Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 C H+ +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + Sbjct: 169 CSHD----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215 Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293 pattern 237 **** A + F IP+ E + + + GP+++A DA QFY G++ P C+ LD Sbjct: 216 EYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLD 275 Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTSII 351 HG+L+VGY + T K+ YW+VKNSWG +WG GYI + + +N CG++ S I+ Sbjct: 276 HGVLVVGYGYEGTDSNKD-KYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333 >sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) Length = 334 Score = 212 bits (533), Expect = 1e-54 Identities = 126/359 (35%), Positives = 198/359 (55%), Gaps = 39/359 (10%) Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 ++LL VL + T + + +++ +++ + Y E R I++ N+ I+ Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLH 62 Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 N N + +N F D++++EF+ + + K +F + L + Sbjct: 63 NGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML---------- 112 Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 IP + DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD Sbjct: 113 KIPK----SVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168 Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 C H +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + Sbjct: 169 CSHA----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215 Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293 pattern 237 **** A + F IP+ E + + + GP+++A DA QFY G++ P C+ +LD Sbjct: 216 EFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD 275 Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351 HG+L+VGY + T KN YW+VKNSWG++WG +GYI + + + N CG++ S ++ Sbjct: 276 HGVLLVGYGYEGTDSNKN-KYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333 >sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE) (SULFHYDRYL-ENDOPEPTIDASE) (SH-EP) Length = 362 Score = 209 bits (526), Expect = 8e-54 Identities = 127/313 (40%), Positives = 179/313 (56%), Gaps = 35/313 (11%) Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 +RF +FK+N+ + N + +K +NKFAD+++ EF++ Y +K +F Sbjct: 58 KRFNVFKANVMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHHKMFRGS 113 Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 + E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I Sbjct: 114 QHGSGTFMYEKVGSVP----ASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT 169 Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 NKLVSLSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPYTA+ Sbjct: 170 NKLVSLSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQ 220 Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 pattern 237 **** GT C+ + N + I +P N+ V+ P+++A DA ++QFY G Sbjct: 221 EGT-CDESKVN---DLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEG 276 Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337 VF CN L+HG+ IVGY T+ N YWIV+NSWG +WGEQGYI ++R + Sbjct: 277 VFTGDCN-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEQGYIRMQRNISKKE 331 Query: 338 NTCGVSNFVSTSI 350 CG++ S I Sbjct: 332 GLCGIAMMASYPI 344 >sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) Length = 323 Score = 209 bits (526), Expect = 8e-54 Identities = 129/349 (36%), Positives = 179/349 (50%), Gaps = 32/349 (9%) Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 +LF L V+ V S+ + + F EF +FNK Y E E L RF+IF+ NL +I Sbjct: 4 ILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI---- 59 Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123 I N K+ +NKF+DLS DE Y I T + LD P + Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQP-----PGKGP 113 Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 FDWR VT VKNQG CG+CW+F+T ++E Q I N+L++LSEQ ++DCD Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---- 169 Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 pattern 237 **** D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + + Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219 Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 + I E + + GP+ +A DA + Y G+ C + L+H +L+VGY Sbjct: 220 DCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIKY-CFNSGLNHAVLLVGYGV 278 Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351 +N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 >sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR Length = 334 Score = 208 bits (525), Expect = 1e-53 Identities = 126/351 (35%), Positives = 184/351 (51%), Gaps = 35/351 (9%) Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 F L V + V+S + P + + +++ + Y E R +++ N I+ N Sbjct: 5 FFLTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQ 64 Query: 65 IAINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 K + +N F D++++EF+ N + N K + + +P Sbjct: 65 EYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-------KGKLFHEPLLVDVPK- 116 Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 + DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC Sbjct: 117 ---SVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA- 172 Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQ 240 pattern 237 ** ** +G ++GCNGGL NA+ YI NGG+ +E SYPY A CN+ PE Sbjct: 173 ---QG----NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-----PECSA 220 Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGIL 297 A + F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L Sbjct: 221 ANDTGFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVL 280 Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 +VGY + T N +WIVKNSWG +WG GY+ + + +N CG++ S Sbjct: 281 VVGYGFEGTDSNNN-KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAAS 330 >sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR Length = 356 Score = 207 bits (522), Expect = 2e-53 Identities = 129/331 (38%), Positives = 181/331 (53%), Gaps = 40/331 (12%) Query: 29 FLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 F F + K+Y S EE +RFEIF NL I N +++K G+N+F DL+ DE Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYK----LGINEFTDLTWDE 112 Query: 88 FKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144 F+ + L N A +L + N + PE + DWR G V+PVK QG+CG Sbjct: 113 FRKHKLGASQNCSATTKGNLKLT--------NVVLPETK---DWRKDGIVSPVKAQGKCG 161 Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 SCW+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ Sbjct: 162 SCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNF--------GCNGGLPSQAFE 213 Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 pattern 237 **** YI NGG+ TE +YPYT + G C F+ ANIG + + + N T+ + E A +V Sbjct: 214 YIKFNGGLDTEEAYPYTGKNGI-CKFSQANIGVKVISSV-NITLGAEYELKYAVALVR-- 269 Query: 265 PLAIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 P+++A + V+ ++ Y GV+ + P ++H +L VGY +N PYW++KNS Sbjct: 270 PVSVAFEVVKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVEN-----GTPYWLIKNS 324 Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 WGADWGE GY + GKN CGV+ S I+ Sbjct: 325 WGADWGEDGYFKMEMGKNMCGVATCASYPIV 355 >sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-2) Length = 444 Score = 207 bits (521), Expect = 3e-53 Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 39/327 (11%) Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 EF YLN A + ++++P A DWR +GAVTPVK+QG Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146 Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196 Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 pattern 237 **** ++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEEL--VVGAQIDGHVLIGSSEKAMAA 254 Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 ++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K Sbjct: 255 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 308 Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345 NSWG DWGEQGY+ + G N C +S + Sbjct: 309 NSWGGDWGEQGYVRVVMGVNACLLSEY 335 >sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR Length = 443 Score = 206 bits (520), Expect = 4e-53 Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 40/327 (12%) Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 EF YLN A + ++++P A DWR +GAVTPVK+QG Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146 Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196 Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 pattern 237 **** ++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSELV---VGAQIDGHVLIGSSEKAMAA 253 Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 ++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K Sbjct: 254 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 307 Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345 NSWG DWGEQGY+ + G N C +S + Sbjct: 308 NSWGGDWGEQGYVRVVMGVNACLLSEY 334 >sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) Length = 333 Score = 206 bits (520), Expect = 4e-53 Identities = 125/349 (35%), Positives = 187/349 (52%), Gaps = 34/349 (9%) Query: 8 VLAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 +LA F + ++S + + ++Q+ +++ N+ Y E R +++ N+ IE N Sbjct: 6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQE 65 Query: 66 AINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 K +N F D++S+EF+ N + N K F + E Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVFQEPLFYEA 114 Query: 123 QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182 + DWR +G VTPVKNQGQCGSCW+FS TG +EGQ F +L+SLSEQNLVDC Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDC----- 169 Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242 pattern 237 **** G + +EGCNGGL A+ Y+ NGG+ +E SYPY A T C +N A Sbjct: 170 --SGPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNP----KYSVAN 221 Query: 243 ISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIV 299 + F IPK E + + + GP+++A DA + FY G+ F+ C+ +DHG+L+V Sbjct: 222 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVV 281 Query: 300 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347 GY ++T N YW+VKNSWG +WG GY+ + + +N CG+++ S Sbjct: 282 GYGFEST-ESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 329 >sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR Length = 334 Score = 206 bits (519), Expect = 5e-53 Identities = 121/316 (38%), Positives = 167/316 (52%), Gaps = 33/316 (10%) Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---NYYLNNK 96 Y E R +++ N+ IE N K +N F D++++EF+ N + N K Sbjct: 40 YGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQK 99 Query: 97 EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVE 156 F S+ E + DWR +G VT VKNQGQCGSCW+FS TG +E Sbjct: 100 HK-----------KGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALE 148 Query: 157 GQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTES 216 GQ F KLVSLSEQNLVDC +G ++GCNGGL NA+ Y+ NGG+ TE Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSRP----QG----NQGCNGGLMDNAFQYVKDNGGLDTEE 200 Query: 217 SYPYTAETGTQCNFNSANIGPE-EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--V 273 pattern 237 ** ** SYPY C + PE A + F IP+ E + + + GP+++A DA Sbjct: 201 SYPYLGRETNSCTYK-----PECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHS 255 Query: 274 EWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332 +QFY G+ +D C+ LDHG+L+VGY + T + +WIVKNSWG +WG GY+ Sbjct: 256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT-DSNSSKFWIVKNSWGPEWGWNGYVK 314 Query: 333 LRRGKNT-CGVSNFVS 347 + + +N CG+S S Sbjct: 315 MAKDQNNHCGISTAAS 330 >sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) Length = 380 Score = 204 bits (513), Expect = 3e-52 Identities = 124/334 (37%), Positives = 178/334 (53%), Gaps = 41/334 (12%) Query: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADT----KFGVN 78 E ++ + + K+ K Y S E+ RFEIFK L I+E H ADT K G+N Sbjct: 37 EVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE-------HNADTNRSYKVGLN 89 Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138 +FADL+ +EF++ YL ++ V++ + F +P + DWR+ GAV +K Sbjct: 90 QFADLTDEEFRSTYLGFTSG--SNKTKVSNRYEPRFGQVLP----SYVDWRSAGAVVDIK 143 Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 +QG+CG CW+FS VEG + I L+SLSEQ L+DC G GCNGG Sbjct: 144 SQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC--------GRTQNTRGCNGGY 195 Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 pattern 237 **** + + +II NGGI TE +YPYTA+ G +CN + N E+ I + +P N Sbjct: 196 ITDGFQFIINNGGINTEENYPYTAQDG-ECNLDLQN---EKYVTIDTYENVPYNNEWALQ 251 Query: 259 YIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWI 316 V+ P+++A DA ++ Y G+F PC ++DH + IVGY + I YWI Sbjct: 252 TAVTYQPVSVALDAAGDAFKHYSSGIFTGPCG-TAIDHAVTIVGYGTEGGI-----DYWI 305 Query: 317 VKNSWGADWGEQGYIYLRR---GKNTCGVSNFVS 347 VKNSW WGE+GY+ + R G TCG++ S Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPS 339 >sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE EP-C1) Length = 362 Score = 203 bits (510), Expect = 6e-52 Identities = 125/313 (39%), Positives = 177/313 (55%), Gaps = 35/313 (11%) Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 +RF +FK+NL + N + +K +NKFAD+++ EF++ Y +K +F Sbjct: 58 KRFNVFKANLMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHPRMFRGT 113 Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 E + S+PP + DWR +GAVT VK+QGQCGSCW+FST VEG + I Sbjct: 114 PHENGAFMYEKVVSVPP----SVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKT 169 Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 NKLV+LSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPY A+ Sbjct: 170 NKLVALSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQ 220 Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 pattern 237 **** GT C+ + N + I +P N+ V+ P+++A DA ++QFY G Sbjct: 221 EGT-CDASKVN---DLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEG 276 Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337 VF C+ L+HG+ IVGY T+ N YWIV+NSWG +WGE GYI ++R + Sbjct: 277 VFTGDCS-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEHGYIRMQRNISKKE 331 Query: 338 NTCGVSNFVSTSI 350 CG++ S I Sbjct: 332 GLCGIAMLPSYPI 344 >sp|Q10991|CATL_SHEEP CATHEPSIN L Length = 217 Score = 201 bits (507), Expect = 1e-51 Identities = 105/226 (46%), Positives = 139/226 (61%), Gaps = 23/226 (10%) Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVD Sbjct: 6 DWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD--------SS 57 Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQAKISN 245 pattern 237 ** ** ++GCNGGL NA+ YI +NGG+ +E SYPY A T T CN+ PE AK + Sbjct: 58 RPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEA-TDTSCNYK-----PEYSAAKDTG 111 Query: 246 FTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYS 302 F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY Sbjct: 112 FVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG 171 Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 + T N +WIVKNSWG +WG +GY+ + + +N CG++ S Sbjct: 172 FEGT----NNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAAS 213 >sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR Length = 360 Score = 201 bits (506), Expect = 2e-51 Identities = 121/307 (39%), Positives = 161/307 (52%), Gaps = 28/307 (9%) Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102 +E RF +FK N+ I E N A K +NKF D+++ EF++ Y +K Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQ---KKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRS 110 Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162 + ++ + DWR +GAVT VK+QGQCGSCW+FST +VEG + I Sbjct: 111 QRGIQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIK 170 Query: 163 QNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA 222 +LVSLSEQ LVDCD + +EGCNGGL A+ +I KN GI TE SYPY Sbjct: 171 TGELVSLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAE 220 Query: 223 ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIG 280 pattern 237 **** + GT C N N I +P N V+ P++++ +A +QFY Sbjct: 221 QDGT-CASNLLN---SPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSE 276 Query: 281 GVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG---- 336 GVF C LDHG+ IVGY A R YWIVKNSWG +WGE GYI ++RG Sbjct: 277 GVFTGRCG-TELDHGVAIVGYGAT----RDGTKYWIVKNSWGEEWGESGYIRMQRGISDK 331 Query: 337 KNTCGVS 343 + CG++ Sbjct: 332 RGKCGIA 338 >sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR Length = 442 Score = 200 bits (504), Expect = 3e-51 Identities = 117/308 (37%), Positives = 169/308 (53%), Gaps = 32/308 (10%) Query: 4 ILLFVLAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 +L F+ + + S++ E Q + F + + YS EE+ R++IFKSN+ + + Sbjct: 3 VLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQ 62 Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 N + +T G+N FAD+++ E++ YL F + ++E I S P Sbjct: 63 WN----SKGGETVLGLNVFADITNQEYRTTYLGTP---FDGSALIGT--EEEKIFSTPAP 113 Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI---SQNKLVSLSEQNLVDCD 178 DWR +GAVTP+KNQGQCG CWSFSTTG+ EG HFI ++ LVSLSEQNL+DC Sbjct: 114 ---TVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC- 169 Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238 pattern 237 ** + + GC GGL + YII N GI TESSYPYTAE G +C F ++NIG Sbjct: 170 -------SKSYGNNGCEGGLMTLGFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIG-- 220 Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHG 295 pattern 239 ** A+I ++ + + P+++A DA +Q Y G++ P C P LDHG Sbjct: 221 --AQIVSYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACTPTQLDHG 278 Query: 296 ILIVGYSA 303 +L+VGY + Sbjct: 279 VLVVGYGS 286 Score = 48.8 bits (114), Expect = 2e-05 Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 1/35 (2%) Query: 314 YWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 YWIVKNSWG WG GYI++ + + N CG++ S Sbjct: 401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMAS 435 >sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) Length = 334 Score = 199 bits (501), Expect = 7e-51 Identities = 127/357 (35%), Positives = 191/357 (52%), Gaps = 43/357 (12%) Query: 5 LLFVLAVFTVFVSSRGIPPEEQS---QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 L VLA F + ++S +P +Q+ ++ +++ + Y E R +++ N+ IE Sbjct: 3 LSLVLAAFCLGIAS-AVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61 Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNY---YLNNK---EAIFTDDLPVADYLDDEFI 115 N K +N F D++++EF+ + N K +F + L +LD Sbjct: 62 HNGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFRKGKVFREPL----FLD---- 113 Query: 116 NSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLV 175 +P + DWR +G VTPVKNQ QCGSCW+FS TG +EGQ F KLVSLSEQNLV Sbjct: 114 --LPK----SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI 235 DC +G ++GCNGG A+ Y+ +NGG+ +E SYPY A C + N Sbjct: 168 DCSRP----QG----NQGCNGGFMARAFQYVKENGGLDSEESYPYVA-VDEICKYRPEN- 217 Query: 236 GPEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNS 291 pattern 237 **** A + FT++ P E + + + GP+++A DA +QFY G+ F+ C+ + Sbjct: 218 ---SVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKN 274 Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 LDHG+L+VGY + N YW+VKNSWG +WG GY+ + + KN CG++ S Sbjct: 275 LDHGVLVVGYGFEGA-NSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAAS 330 >sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) Length = 324 Score = 196 bits (494), Expect = 5e-50 Identities = 116/322 (36%), Positives = 168/322 (52%), Gaps = 30/322 (9%) Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 F +F KFNK YS E E L RF+IF+ NL +I N + + ++ +NKF+DLS +E Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKN----QNDSTAQYEINKFSDLSKEE 83 Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 + Y T + LD P FDWR VT VKNQG CG+CW Sbjct: 84 AISKYTGLSLPHQTQNFCEVVILDRP-----PDRGPLEFDWRQFNKVTSVKNQGVCGACW 138 Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207 +F+T G++E Q I N+L++LSEQ +DCD + GC+GGL A+ + Sbjct: 139 AFATLGSLESQFAIKYNRLINLSEQQFIDCDR----------VNAGCDGGLLHTAFESAM 188 Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 pattern 237 **** + GG+Q ES YPY G QC N ++ M E + + + GP+ Sbjct: 189 EMGGVQMESDYPYETANG-QCRINPNRFVVGVRSCRRYIVMF---EEKLKDLLRAVGPIP 244 Query: 268 IAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 +A DA + Y G+ C + L+H +L+VGY+ +N N+PYWI+KN+WG DWGE Sbjct: 245 VAIDASDIVNYRRGIMR-QCANHGLNHAVLLVGYAVEN-----NIPYWILKNTWGTDWGE 298 Query: 328 QGYIYLRRGKNTCGVSNFVSTS 349 GY +++ N CG+ N + +S Sbjct: 299 DGYFRVQQNINACGIRNELVSS 320 >sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR Length = 471 Score = 196 bits (494), Expect = 5e-50 Identities = 115/310 (37%), Positives = 166/310 (53%), Gaps = 31/310 (10%) Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD 103 E+ RF +F NL ++ N A + + G+N+FADL+++EF+ +L K A Sbjct: 69 EHERRFLVFWDNLKFVDAHNARA-DEGGGFRLGMNRFADLTNEEFRATFLGAKVA--ERS 125 Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 + + + +P + DWR +GAV PVKNQGQCGSCW+FS VE + + Sbjct: 126 RAAGERYRHDGVEELPE----SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVT 181 Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 ++++LSEQ LV+C + GCNGGL +A+++IIKNGGI TE YPY A Sbjct: 182 GEMITLSEQELVEC--------STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAV 233 Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 pattern 237 **** G +C+ N N + I F +P+N+ V+ P+++A +A E+Q Y G Sbjct: 234 DG-KCDINREN---AKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289 Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-- 339 VF C SLDHG++ VGY N YWIV+NSWG WGE GY+ + R N Sbjct: 290 VFSGRCG-TSLDHGVVAVGYGTDN-----GKDYWIVRNSWGPKWGESGYVRMERNINVTT 343 Query: 340 --CGVSNFVS 347 CG++ S Sbjct: 344 GKCGIAMMAS 353 >sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR Length = 458 Score = 194 bits (488), Expect = 2e-49 Identities = 124/355 (34%), Positives = 183/355 (50%), Gaps = 43/355 (12%) Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ--FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 ++LL LA + + S G EE+++ + E++ + K Y+ E R+ F+ NL I Sbjct: 12 LLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYI 71 Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-----NKEAIFTDDLPVADYLDDEF 114 +E N A + G+N+FADL+++E+++ YL +E +D AD Sbjct: 72 DEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAAD------ 125 Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 N PE + DWRT+GAV +K+QG CGSCW+FS VE + I L+SLSEQ L Sbjct: 126 -NEALPE---SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQEL 181 Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 VDCD + +EGCNGGL A+++II NGGI TE YPY + +C+ N N Sbjct: 182 VDCD---------TSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGK-DERCDVNRKN 231 Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292 pattern 237 **** + I ++ + N V P+++A +A +Q Y G+F C +L Sbjct: 232 ---AKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCG-TAL 287 Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343 DHG+ VGY +N YWIV+NSWG WGE GY+ + R CG++ Sbjct: 288 DHGVAAVGYGTEN-----GKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIA 337 >sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR Length = 462 Score = 193 bits (486), Expect = 4e-49 Identities = 122/321 (38%), Positives = 168/321 (52%), Gaps = 43/321 (13%) Query: 35 KFNKKYSHEEYLE---RFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNY 91 K K S +E RFEIFK NL ++E N ++++ G+ +FADL++DE+++ Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYR----LGLTRFADLTNDEYRSK 111 Query: 92 YLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWS 148 YL K+ L + DE SI DWR +GAV VK+QG CGSCW+ Sbjct: 112 YLGAKMEKKGERRTSLRYEARVGDELPESI--------DWRKKGAVAEVKDQGGCGSCWA 163 Query: 149 FSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 208 FST G VEG + I L++LSEQ LVDCD + +EGCNGGL A+ +IIK Sbjct: 164 FSTIGAVEGINQIVTGDLITLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIIK 214 Query: 209 NGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAI 268 pattern 237 **** NGGI T+ YPY GT C+ N + I ++ +P V+ P++I Sbjct: 215 NGGIDTDKDYPYKGVDGT-CDQIRKN---AKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270 Query: 269 AADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 A +A +Q Y G+FD C LDHG++ VGY +N YWIV+NSWG WG Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCG-TQLDHGVVAVGYGTEN-----GKDYWIVRNSWGKSWG 324 Query: 327 EQGYIYLRR----GKNTCGVS 343 E GY+ + R CG++ Sbjct: 325 ESGYLRMARNIASSSGKCGIA 345 >sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR Length = 360 Score = 193 bits (485), Expect = 5e-49 Identities = 115/329 (34%), Positives = 172/329 (51%), Gaps = 32/329 (9%) Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 +F F ++ K Y S E +RF IF +L + N ++++ G+N+FAD+S + Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYR----LGINRFADMSWE 113 Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 EF+ L + A + + + DWR G V+PVKNQG CGSC Sbjct: 114 EFRATRLGAAQNCS------ATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSC 167 Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI Sbjct: 168 WTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNF--------GCNGGLPSQAFEYI 219 Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** NGG+ TE SYPY G C F + N+G + + N T+ ++E A +V P+ Sbjct: 220 KYNGGLDTEESYPYQGVNGI-CKFKNENVGVKVLDSV-NITLGAEDELKDAVGLVR--PV 275 Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 ++A + + ++ Y GV+ P ++H +L VGY ++ +PYW++KNSWG Sbjct: 276 SVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVED-----GVPYWLIKNSWG 330 Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 ADWG++GY + GKN CGV+ S I+ Sbjct: 331 ADWGDEGYFKMEMGKNMCGVATCASYPIV 359 >sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II) (PPII) Length = 352 Score = 192 bits (482), Expect = 1e-48 Identities = 128/319 (40%), Positives = 169/319 (52%), Gaps = 43/319 (13%) Query: 35 KFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 K NK Y S +E + RFEIF+ NL I+E N K + + G+N FADLS+DEFK Sbjct: 54 KHNKIYESIDEKIYRFEIFRDNLMYIDETN------KKNNSYWLGLNGFADLSNDEFKKK 107 Query: 92 YLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150 Y+ +D ++ D+E F + DWR +GAVTPVKNQG CGSCW+FS Sbjct: 108 YVG----FVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFS 163 Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210 T VEG + I L+ LSEQ LVDCD GC GG Q + Y + N Sbjct: 164 TIATVEGINKIVTGNLLELSEQELVDCDKH----------SYGCKGGYQTTSLQY-VANN 212 Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVSTGPLAIA 269 pattern 237 **** G+ T YPY A+ +C A P + KI+ + +P N ET G + + PL++ Sbjct: 213 GVHTSKVYPYQAKQ-YKCR---ATDKPGPKVKITGYKRVPSNCETSFLGALANQ-PLSVL 267 Query: 270 ADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 +A +Q Y GVFD PC LDH + VGY + KN Y I+KNSWG +WGE Sbjct: 268 VEAGGKPFQLYKSGVFDGPCG-TKLDHAVTAVGYGTSD---GKN--YIIIKNSWGPNWGE 321 Query: 328 QGYIYLRR----GKNTCGV 342 +GY+ L+R + TCGV Sbjct: 322 KGYMRLKRQSGNSQGTCGV 340 >sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) Length = 333 Score = 192 bits (482), Expect = 1e-48 Identities = 121/333 (36%), Positives = 173/333 (51%), Gaps = 38/333 (11%) Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 E+ F + + K YS EY R ++F +N KI+ N NH K G+N+F+D+S Sbjct: 29 EKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHN--QRNHTF--KMGLNQFSDMS 84 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143 E K+ YL ++ ++ P ++ DWR +G V+PVKNQG C Sbjct: 85 FAEIKHKYLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136 Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 GSCW+FSTTG +E I+ K+++L+EQ LVDC + + GC GGL A+ Sbjct: 137 GSCWTFSTTGALESAVAIASGKMMTLAEQQLVDC--------AQNFNNHGCQGGLPSQAF 188 Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ-AKISNFTMIPKN-ETVMAGYIV 261 pattern 237 **** YI+ N GI E SYPY + G QC FN PE+ A + N I N E M + Sbjct: 189 EYILYNKGIMGEDSYPYIGKNG-QCKFN-----PEKAVAFVKNVVNITLNDEAAMVEAVA 242 Query: 262 STGPLAIAADAVE-WQFYIGGVFDI-PCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIV 317 P++ A + E + Y GV+ C+ P+ ++H +L VGY +N + YWIV Sbjct: 243 LYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIV 297 Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 KNSWG++WG GY + RGKN CG++ S I Sbjct: 298 KNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330 >sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR Length = 328 Score = 190 bits (477), Expect = 5e-48 Identities = 114/304 (37%), Positives = 164/304 (53%), Gaps = 29/304 (9%) Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPV 106 ERF IFK NL I+ N N A K G+ FA+L++DE+++ YL + + Sbjct: 27 ERFNIFKDNLRFIDLHN--ENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRR-ITK 83 Query: 107 ADYLDDEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK 165 A ++ ++ ++ +E DWR +GAV +K+QG CGSCW+FST VEG + I + Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143 Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225 LVSLSEQ LVDCD ++ ++GCNGGL A+ +I+KNGG+ TE YPY G Sbjct: 144 LVSLSEQELVDCD---------KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNG 194 Query: 226 TQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF 283 pattern 237 **** +CN N I + +P + VS P+++A DA +Q Y G+F Sbjct: 195 -KCNSLLKN---SRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIF 250 Query: 284 DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNT 339 C N +DH ++ VGY ++N + YWIV+NSWG WGE GYI + R Sbjct: 251 TGKCGTN-MDHAVVAVGYGSEN-----GVDYWIVRNSWGTRWGEDGYIRMERNVASKSGK 304 Query: 340 CGVS 343 CG++ Sbjct: 305 CGIA 308 >sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR Length = 335 Score = 188 bits (472), Expect = 2e-47 Identities = 123/332 (37%), Positives = 170/332 (51%), Gaps = 36/332 (10%) Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 E+ F + K K YS EEY R + F SN KI N N K +N+F+D+S Sbjct: 31 EKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHN----NGNHTFKMALNQFSDMS 86 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VTPVKNQGQC 143 E K+ YL ++ ++YL PP + DWR +G V+PVKNQG C Sbjct: 87 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPP----SVDWRKKGNFVSPVKNQGAC 138 Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 GSCW+FSTTG +E I+ K++SL+EQ LVDC + Y GC GGL A+ Sbjct: 139 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNY--------GCQGGLPSQAF 190 Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSAN-IGPEEQAKISNFTMIPKNETVMAGYIVS 262 pattern 237 **** YI+ N GI E +YPY + G C F IG + ++N T+ +E M + Sbjct: 191 EYILYNKGIMGEDTYPYQGKDG-YCKFQPGKAIGFVKD--VANITIY--DEEAMVEAVAL 245 Query: 263 TGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 P++ A + ++ Y G++ C+ P+ ++H +L VGY KN I PYWIVK Sbjct: 246 YNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGI-----PYWIVK 300 Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 NSWG WG GY + RGKN CG++ S I Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 >sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA) (PAPAYA PROTEINASE III) (PPIII) (PAPAYA PEPTIDASE A) Length = 348 Score = 187 bits (471), Expect = 2e-47 Identities = 121/319 (37%), Positives = 161/319 (49%), Gaps = 38/319 (11%) Query: 37 NKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNYYL 93 NK Y + +E L RFEIFK NL I+E N K + + G+N+FADLS+DEF Y+ Sbjct: 56 NKFYENVDEKLYRFEIFKDNLNYIDETN------KKNNSYWLGLNEFADLSNDEFNEKYV 109 Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153 + D + D+EFIN DWR +GAVTPV++QG CGSCW+FS Sbjct: 110 GS-----LIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVA 164 Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 VEG + I KLV LSEQ LVDC+ GC GG P A Y+ KN GI Sbjct: 165 TVEGINKIRTGKLVELSEQELVDCERR----------SHGCKGGYPPYALEYVAKN-GIH 213 Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273 pattern 237 **** S YPY A+ GT C GP K S + N ++ P+++ ++ Sbjct: 214 LRSKYPYKAKQGT-CRAKQVG-GP--IVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESK 269 Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 +Q Y GG+F+ PC +DH + VGY Y ++KNSWG WGE+GYI Sbjct: 270 GRPFQLYKGGIFEGPCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGTAWGEKGYI 323 Query: 332 YLRRGK-NTCGVSNFVSTS 349 ++R N+ GV +S Sbjct: 324 RIKRAPGNSPGVCGLYKSS 342 >sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR Length = 362 Score = 187 bits (471), Expect = 2e-47 Identities = 112/329 (34%), Positives = 170/329 (51%), Gaps = 33/329 (10%) Query: 28 QFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 +F F + K+Y E RF IF +L + N + ++ G+N+FAD+S + Sbjct: 61 RFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYR----LGINRFADMSWE 116 Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 EF+ L + A + + P +T DWR G V+PVK+QG CGSC Sbjct: 117 EFQASRLGAAQNCS------ATLAGNHRMRDAPALPETK-DWREDGIVSPVKDQGHCGSC 169 Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 W FSTTG++E ++ + VSLSEQ L DC + GC+GGL A+ YI Sbjct: 170 WPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNF--------GCSGGLPSQAFEYI 221 Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** NGG+ TE +YPYT G C++ N G + + N T++ ++E A +V P+ Sbjct: 222 KYNGGLDTEEAYPYTGVNGI-CHYKPENAGVKVLDSV-NITLVAEDELKNAVGLVR--PV 277 Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 ++A + ++ Y GV+ +P ++H +L VGY +N +PYW++KNSWG Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 332 Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 ADWG+ GY + GKN CG++ S I+ Sbjct: 333 ADWGDNGYFTMEMGKNMCGIATCASYPIV 361 >sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) Length = 333 Score = 187 bits (469), Expect = 4e-47 Identities = 115/356 (32%), Positives = 184/356 (51%), Gaps = 30/356 (8%) Query: 3 VILLFVLAVFTVFVSSRGIPPEEQS--QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 +I + LA+ + V S P+ ++ E++ K K Y+ E + +++ N IE Sbjct: 1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIE 60 Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIP 119 N + + D +N F DL++ EF ++ I + + D +F+ +P Sbjct: 61 LHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHI----FQDHQFLY-VP 115 Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 DWR G VTPVKNQG C S W+FS TG++EGQ F +L+ LSEQNL+DC Sbjct: 116 KR----VDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMG 171 Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 pattern 237 *** + + GC+GG A+ Y+ NGG+ TE SYPY + G +C +++ N Sbjct: 172 SNVTH--------GCSGGFMQYAFQYVKDNGGLATEESYPYRGQ-GRECRYHAEN----S 218 Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGI 296 pattern 240 * A + +F IP +E + + GP+++A DA +QFY G++ P C L+H + Sbjct: 219 AANVRDFVQIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAV 278 Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351 L+VGY + N +W+VKNSWG +WG +GY+ L + N CG++ + + I+ Sbjct: 279 LVVGYGFEGEESDGN-SFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333 >sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR Length = 335 Score = 186 bits (468), Expect = 5e-47 Identities = 124/343 (36%), Positives = 176/343 (51%), Gaps = 42/343 (12%) Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76 S+ + E+ F + + KKYS EEY R ++F SN KI N A NH K G Sbjct: 23 SNLAVSSFEKLHFKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHN--AGNHTF--KLG 78 Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VT 135 +N+F+D+S DE ++ YL ++ +YL PP + DWR +G V+ Sbjct: 79 LNQFSDMSFDEIRHKYLWSEPQ--NCSATKGNYLRGT--GPYPP----SMDWRKKGNFVS 130 Query: 136 PVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCN 195 PVKNQG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + + GC Sbjct: 131 PVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDC--------AQNFNNHGCQ 182 Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ----AKISNFTMIPK 251 pattern 237 **** GGL A+ YI N GI E +YPY + C F P++ ++N TM Sbjct: 183 GGLPSQAFEYIRYNKGIMGEDTYPYKGQ-DDHCKFQ-----PDKAIAFVKDVANITM--N 234 Query: 252 NETVMAGYIVSTGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTI 307 +E M + P++ A + ++ Y G++ C+ P+ ++H +L VGY +N I Sbjct: 235 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI 294 Query: 308 FRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 PYWIVKNSWG WG GY + RGKN CG++ S I Sbjct: 295 -----PYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 >sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR Length = 362 Score = 185 bits (466), Expect = 9e-47 Identities = 111/329 (33%), Positives = 169/329 (50%), Gaps = 33/329 (10%) Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 +F F ++ K Y S E RF IF +L ++ N + ++ G+N+F+D+S + Sbjct: 60 RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWE 115 Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 EF+ L + A + + +T DWR G V+PVKNQ CGSC Sbjct: 116 EFQATRLGAAQTCS------ATLAGNHLMRDAAALPETK-DWREDGIVSPVKNQAHCGSC 168 Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI Sbjct: 169 WTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYI 220 Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** NGGI TE SYPY G C++ + N + + N T+ ++E A +V P+ Sbjct: 221 KYNGGIDTEESYPYKGVNGV-CHYKAENAAVQVLDSV-NITLNAEDELKNAVGLVR--PV 276 Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 ++A ++ ++ Y GV+ P+ ++H +L VGY +N +PYW++KNSWG Sbjct: 277 SVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 331 Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 ADWG+ GY + GKN C ++ S ++ Sbjct: 332 ADWGDNGYFKMEMGKNMCAIATCASYPVV 360 >sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEPSIN X) (CATHEPSIN O2) Length = 329 Score = 185 bits (465), Expect = 1e-46 Identities = 123/350 (35%), Positives = 185/350 (52%), Gaps = 39/350 (11%) Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 L V + V S + PEE + + ++ K+Y+++ + + R I++ NL I NL Sbjct: 4 LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63 Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTA 125 A + +N D++S+E K +P++ ++ + IP E A Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLK-------VPLSHSRSNDTLY-IPEWEGRA 115 Query: 126 ---FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182 D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E Sbjct: 116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-- 173 Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242 pattern 237 **** ++GC GG NA+ Y+ KN GI +E +YPY + C +N + AK Sbjct: 174 --------NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE-ESCMYNPTG----KAAK 220 Query: 243 ISNFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILI 298 + IP+ NE + + GP+++A DA +QFY GV +D CN ++L+H +L Sbjct: 221 CRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLA 280 Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 VGY +K +WI+KNSWG +WG +GYI + R K N CG++N S Sbjct: 281 VGYG-----IQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325 >sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPAYA PEPTIDASE B) (GLYCYL ENDOPEPTIDASE) Length = 348 Score = 184 bits (462), Expect = 3e-46 Identities = 116/315 (36%), Positives = 162/315 (50%), Gaps = 37/315 (11%) Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYL 93 K NK Y + +E L RFEIFK NL I+E N + + G+N+F+DLS+DEFK Y+ Sbjct: 54 KHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYW----LGLNEFSDLSNDEFKEKYV 109 Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153 + +T+ D+EF+N + + DWR +GAVTPVK+QG C SCW+FST Sbjct: 110 GSLPEDYTNQP-----YDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVA 164 Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 VEG + I LV LSEQ LVDCD + GCN G Q + Y+ +N GI Sbjct: 165 TVEGINKIKTGNLVELSEQELVDCDKQ----------SYGCNRGYQSTSLQYVAQN-GIH 213 Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273 pattern 237 **** + YPY A+ T C N GP + K + + N ++ P+++ ++ Sbjct: 214 LRAKYPYIAKQQT-CRANQVG-GP--KVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESA 269 Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 ++Q Y GG+F+ C +DH + VGY Y ++KNSWG WGE GYI Sbjct: 270 GRDFQNYKGGIFEGSCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGPGWGENGYI 323 Query: 332 YLRRGK----NTCGV 342 +RR CGV Sbjct: 324 RIRRASGNSPGVCGV 338 >sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR Length = 373 Score = 183 bits (461), Expect = 3e-46 Identities = 125/349 (35%), Positives = 171/349 (48%), Gaps = 40/349 (11%) Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 VLAV V + S IP E++ E +Q + H E RF FKSN Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLP-VADYLDDEF- 114 I + N + D + + N+F D+ EF+ ++ + P V ++ Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALN 130 Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 ++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L Sbjct: 131 VSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQEL 186 Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 +DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN A Sbjct: 187 IDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNVARAA 236 Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292 pattern 237 **** I +P N V+ P+++A +A + FY GVF C L Sbjct: 237 QNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECG-TEL 295 Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 DHG+ +VGY + YW VKNSWG WGEQGYI + + G Sbjct: 296 DHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 >sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR Length = 371 Score = 183 bits (460), Expect = 5e-46 Identities = 126/353 (35%), Positives = 170/353 (47%), Gaps = 48/353 (13%) Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 VLAV V + S IP E++ E +Q + H E RF FKSN Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF-- 114 I + N + D + + N+F D+ EF+ ++ + D P F Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRR----DTPAKPPSVPGFMY 126 Query: 115 ----INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170 ++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLS Sbjct: 127 AALNVSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLS 182 Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF 230 EQ L+DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN Sbjct: 183 EQELIDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNV 232 Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 288 pattern 237 **** A I +P N V+ P+++A +A + FY GVF C Sbjct: 233 ARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCG 292 Query: 289 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 LDHG+ +VGY + YW VKNSWG WGEQGYI + + G Sbjct: 293 -TELDHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 >sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) Length = 329 Score = 183 bits (459), Expect = 6e-46 Identities = 119/348 (34%), Positives = 181/348 (51%), Gaps = 35/348 (10%) Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 L V + V S + PEE +Q+ ++ ++K+Y+ + + + R I++ NL I NL Sbjct: 4 LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63 Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQT 124 A + +N D++S+E K P + +D +I Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVP------PSRSHSNDTLYIPDWEGRTPD 117 Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 + D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E Sbjct: 118 SIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---- 173 Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 pattern 237 **** + GC GG NA+ Y+ +N GI +E +YPY + C +N + AK Sbjct: 174 ------NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQ-DESCMYNPTG----KAAKCR 222 Query: 245 NFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVG 300 + IP+ NE + + GP+++A DA +QFY GV +D C+ ++++H +L VG Sbjct: 223 GYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVG 282 Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 Y +K +WI+KNSWG WG +GYI + R K N CG++N S Sbjct: 283 YG-----IQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLAS 325 >sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR Length = 379 Score = 182 bits (458), Expect = 8e-46 Identities = 110/322 (34%), Positives = 173/322 (53%), Gaps = 38/322 (11%) Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLNNKE 97 ++HEE +R EIFK+N I ++N N K+ + G+NKFAD++ EF YL + Sbjct: 56 HNHEEEAKRLEIFKNNSNYIRDMNA---NRKSPHSHRLGLNKFADITPQEFSKKYLQAPK 112 Query: 98 AIFTDDLPVAD--YLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 + + + +A+ +++ PP ++DWR +G +T VK QG CG W+FS TG + Sbjct: 113 DV-SQQIKMANKKMKKEQYSCDHPP---ASWDWRKKGVITQVKYQGGCGRGWAFSATGAI 168 Query: 156 EGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTE 215 E H I+ LVSLSEQ LVDC E EG G Q ++ +++++GGI T+ Sbjct: 169 EAAHAIATGDLVSLSEQELVDCVEE----------SEGSYNGWQYQSFEWVLEHGGIATD 218 Query: 216 SSYPYTAETGTQCNFN----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271 pattern 237 **** YPY A+ G +C N I E +S+ + + E I+ P++++ D Sbjct: 219 DDYPYRAKEG-RCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILEQ-PISVSID 276 Query: 272 AVEWQFYIGGVFDIP--CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 329 A ++ Y GG++D +P ++H +L+VGY + + + YWI KNSWG DWGE G Sbjct: 277 AKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSAD-----GVDYWIAKNSWGFDWGEDG 331 Query: 330 YIYLRRGK----NTCGVSNFVS 347 YI+++R CG++ F S Sbjct: 332 YIWIQRNTGNLLGVCGMNYFAS 353 >sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) Length = 333 Score = 180 bits (451), Expect = 5e-45 Identities = 115/332 (34%), Positives = 166/332 (49%), Gaps = 36/332 (10%) Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 E+ F + + K YS EY R ++F +N KI+ N NH K +N+F+D+S Sbjct: 29 EKFHFKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHN--QRNHTF--KMALNQFSDMS 84 Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143 E K+ +L ++ ++ P ++ DWR +G V+PVKNQG C Sbjct: 85 FAEIKHKFLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136 Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 SCW+FSTTG +E I+ K++SL+EQ LVDC + + GC GGL A+ Sbjct: 137 ASCWTFSTTGALESAVAIASGKMLSLAEQQLVDC--------AQAFNNHGCKGGLPSQAF 188 Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVS 262 pattern 237 **** YI+ N GI E SYPY + + C FN + A + N I N E M + Sbjct: 189 EYILYNKGIMEEDSYPYIGK-DSSCRFNP----QKAVAFVKNVVNITLNDEAAMVEAVAL 243 Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPC---NPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 P++ A + E + Y GV+ P+ ++H +L VGY +N + YWIVK Sbjct: 244 YNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVK 298 Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 NSWG+ WGE GY + RGKN CG++ S I Sbjct: 299 NSWGSQWGENGYFLIERGKNMCGLAACASYPI 330 >sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR Length = 329 Score = 178 bits (447), Expect = 2e-44 Identities = 117/352 (33%), Positives = 182/352 (51%), Gaps = 43/352 (12%) Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 L V + + S + PEE +Q+ ++ K+Y+ + + + R I++ NL +I NL Sbjct: 4 LKVLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLE 63 Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD-----EFINSIPP 120 A + +N D++S+E + P Y +D E+ +P Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIP------PSRSYSNDTLYTPEWEGRVPD 117 Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 + D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E Sbjct: 118 ----SIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE 173 Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 pattern 237 **** + GC GG A+ Y+ +NGGI +E ++PY + C +N+ + Sbjct: 174 ----------NYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQ-DESCMYNAT----AKA 218 Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGI 296 AK + IP NE + + GP++++ DA +QFY GV +D C+ ++++H + Sbjct: 219 AKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAV 278 Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 L+VGY +K +WI+KNSWG WG +GY L R K N CG++N S Sbjct: 279 LVVGYGT-----QKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325 >sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) Length = 376 Score = 177 bits (445), Expect = 3e-44 Identities = 112/351 (31%), Positives = 171/351 (47%), Gaps = 47/351 (13%) Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 P E + F FQ +FN+ Y S EE+ R +IF NL + + L + +FGV F Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLG---TAEFGVTPF 91 Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAF--DWR-TRGAVTPV 137 +DL+ +EF Y + A + I S PEE F DWR GA++P+ Sbjct: 92 SDLTEEEFGQLYGYRRAAGGVPSM-------GREIRSEEPEESVPFSCDWRKVAGAISPI 144 Query: 138 KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGG 197 K+Q C CW+ + GN+E IS V +S L+DC C +GC+GG Sbjct: 145 KDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCGR----------CGDGCHGG 194 Query: 198 LQPNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVM 256 pattern 237 **** +A+ ++ N G+ +E YP+ + +C+ ++ A I +F M+ NE + Sbjct: 195 FVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY----QKVAWIQDFIMLQNNEHRI 250 Query: 257 AGYIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSA--------KN 305 A Y+ + GP+ + + Q Y GV C+P +DH +L+VG+ + Sbjct: 251 AQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAE 310 Query: 306 TIFRKNMP-------YWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 T+ ++ P YWI+KNSWGA WGE+GY L RG NTCG++ F T+ Sbjct: 311 TVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTA 361 >sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) Length = 371 Score = 176 bits (442), Expect = 6e-44 Identities = 110/346 (31%), Positives = 166/346 (47%), Gaps = 40/346 (11%) Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 P E + F FQ +FN+ Y + EY R IF NL + + L + +FG F Sbjct: 33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLG---TAEFGETPF 89 Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWR-TRGAVTPVKN 139 +DL+ +EF Y + T ++ + + S+P DWR + ++ VKN Sbjct: 90 SDLTEEEFGQLYGQERSPERTPNM-TKKVESNTWGESVP----RTCDWRKAKNIISSVKN 144 Query: 140 QGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQ 199 QG C CW+ + N++ I + V +S Q L+DC E C GCNGG Sbjct: 145 QGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC----------ERCGNGCNGGFV 194 Query: 200 PNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 pattern 237 **** +AY ++ N G+ +E YP+ + +C ++ A I +FTM+ NE +A Sbjct: 195 WDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKY----KKVAWIQDFTMLSNNEQAIAH 250 Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSAKN------TIF- 308 Y+ GP+ + + Q Y GV C+P +DH +L+VG+ K T+ Sbjct: 251 YLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLS 310 Query: 309 -----RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 R + PYWI+KNSWGA WGE+GY L RG NTCGV+ + T+ Sbjct: 311 HSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTA 356 >sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR Length = 321 Score = 173 bits (435), Expect = 4e-43 Identities = 100/304 (32%), Positives = 152/304 (49%), Gaps = 30/304 (9%) Query: 52 FKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLD 111 F+ +L + LN + + + +G+N+F+ L +EFK YL +K + F Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPR-------YS 96 Query: 112 DEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170 E SIP FDWR + VT V+NQ CG CW+FS G VE + I L LS Sbjct: 97 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS 156 Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK-NGGIQTESSYPYTAETGTQCN 229 Q ++DC + + GCNGG NA N++ K + +S YP+ A+ G C+ Sbjct: 157 VQQVIDCSYN----------NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGL-CH 205 Query: 230 FNSANIGPEEQAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPC 287 pattern 237 **** + S G I ++ E MA +++ GPL + DAV WQ Y+GG+ C Sbjct: 206 YFS---GSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC 262 Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVS 347 + +H +LI G+ + PYWIV+NSWG+ WG GY +++ G N CG+++ VS Sbjct: 263 SSGEANHAVLITGFDKTG-----STPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVS 317 Query: 348 TSII 351 + + Sbjct: 318 SIFV 321 >sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) Length = 345 Score = 173 bits (433), Expect = 7e-43 Identities = 119/322 (36%), Positives = 163/322 (49%), Gaps = 43/322 (13%) Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 K NK Y + +E + RFEIFK NL I+E N K + + G+N FAD+S+DEFK Sbjct: 54 KHNKIYKNIDEKIYRFEIFKDNLKYIDETN------KKNNSYWLGLNVFADMSNDEFKEK 107 Query: 92 YLNNKEAIFTD-DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150 Y + +T +L + L+D +N PE DWR +GAVTPVKNQG CGSCW+FS Sbjct: 108 YTGSIAGNYTTTELSYEEVLNDGDVNI--PEY---VDWRQKGAVTPVKNQGSCGSCWAFS 162 Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210 +EG I L SEQ L+DCD GCNGG +A ++ Sbjct: 163 AVVTIEGIIKIRTGNLNEYSEQELLDCDRR----------SYGCNGGYPWSALQ-LVAQY 211 Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAA 270 pattern 237 **** GI ++YPY G Q S GP + P NE + Y ++ P+++ Sbjct: 212 GIHYRNTYPY---EGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALL-YSIANQPVSVVL 267 Query: 271 DAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 328 +A ++Q Y GG+F PC N +DH + VGY Y ++KNSWG WGE Sbjct: 268 EAAGKDFQLYRGGIFVGPCG-NKVDHAVAAVGYGPN---------YILIKNSWGTGWGEN 317 Query: 329 GYIYLRRGK-NTCGVSNFVSTS 349 GYI ++RG N+ GV ++S Sbjct: 318 GYIRIKRGTGNSYGVCGLYTSS 339 >sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR Length = 331 Score = 171 bits (428), Expect = 3e-42 Identities = 116/351 (33%), Positives = 175/351 (49%), Gaps = 35/351 (9%) Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYS--HEEYLERFEIFKSNLGKIEEL 62 L+ VL V + V+ P + ++ + K+Y +EE + R I++ NL + Sbjct: 4 LVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRL-IWEKNLKFVMLH 62 Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 NL G+N D++S+E + T L V P Sbjct: 63 NLEHSMGMHSYDLGMNHLGDMTSEEVMS---------LTSSLRVPSQWQRNITYKSNPNR 113 Query: 123 --QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 + DWR +G VT VK QG CG+CW+FS G +E Q + KLV+LS QNLVDC Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDC--- 170 Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 pattern 237 **** E+ ++GCNGG A+ YII N GI +++SYPY A +C ++S Sbjct: 171 ----STEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKA-MDQKCQYDS----KYRA 221 Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGIL 297 A S +T +P E V+ + + GP+++ DA F++ GV+ P +++HG+L Sbjct: 222 ATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVL 281 Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 +VGY N YW+VKNSWG ++GE+GYI + R K N CG+++F S Sbjct: 282 VVGYGDLN-----GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPS 327 >sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L Length = 176 Score = 167 bits (420), Expect = 2e-41 Identities = 87/179 (48%), Positives = 115/179 (63%), Gaps = 16/179 (8%) Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 DWR +G VTPVK+QGQCGSCW+FSTTG +EGQHF ++ KLVSLSEQNLVDC EG Sbjct: 6 DWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP----EG 61 Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246 pattern 237 **** ++GCNGGL A+ Y+ NGGI +E SYPYTA+ C + + A + F Sbjct: 62 ----NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKA----EYNAANDTGF 113 Query: 247 TMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 301 IP+ +E + + S GP+++A DA +QFY G++ P C+ LDHG+L+VGY Sbjct: 114 VDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGY 172 >sp|P25326|CATS_BOVIN CATHEPSIN S Length = 217 Score = 165 bits (413), Expect = 1e-40 Identities = 90/227 (39%), Positives = 129/227 (56%), Gaps = 21/227 (9%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 + DWR +G VT VK QG CGSCW+FS G +E Q + KLVSLS QNLVDC Sbjct: 4 SMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC------- 56 Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 pattern 237 **** + ++GCNGG A+ YII N GI +E+SYPY A G +C ++ N A S Sbjct: 57 STAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG-KCQYDVKN----RAATCS 111 Query: 245 NFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGILIVGY 301 + +P +E + + + GP+++ DA F++ GV+ P +++HG+L+VGY Sbjct: 112 RYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGY 171 Query: 302 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 + YW+VKNSWG +G+QGYI + R N CG++N+ S Sbjct: 172 GNLD-----GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPS 213 >sp|P80884|ANAN_ANACO ANANAIN Length = 216 Score = 161 bits (403), Expect = 2e-39 Identities = 93/224 (41%), Positives = 123/224 (54%), Gaps = 26/224 (11%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 + DWR GAVT VKNQG+CGSCW+F++ VE + I + LVSLSEQ ++DC Sbjct: 4 SIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC------- 56 Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 pattern 237 **** A GC GG AY++II N G+ + + YPY A GT C N G A I+ Sbjct: 57 ----AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGT-CKTN----GVPNSAYIT 107 Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 +T + +N Y VS P+A A DA +Q Y GVF PC L+H I+I+GY Sbjct: 108 RYTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCG-TRLNHAIVIIGYGQ 166 Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343 + +WIV+NSWGA WGE GYI L R ++ CG++ Sbjct: 167 DSA----GKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGICGIA 206 >sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR Length = 330 Score = 158 bits (396), Expect = 1e-38 Identities = 89/226 (39%), Positives = 128/226 (56%), Gaps = 22/226 (9%) Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 DWR +G VT VK QG CGSCW+FS G +EGQ + KLVSLS QNLVDC E Sbjct: 118 DWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE------ 171 Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246 pattern 237 **** E+ ++GC GG A+ YII + I +E+SYPY A +C ++ N A S + Sbjct: 172 EKYGNKGCGGGFMTEAFQYII-DTSIDSEASYPYKA-MDEKCLYDPKN----RAATCSRY 225 Query: 247 TMIP-KNETVMAGYIVSTGPLAIAADAV---EWQFYIGGVFDIPCNPNSLDHGILIVGYS 302 +P +E + + + GP+++ D + Y GV+D P +++HG+L+VGY Sbjct: 226 IELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYG 285 Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYL-RRGKNTCGVSNFVS 347 + YW+VKNSWG +G+QGYI + R KN CG++++ S Sbjct: 286 TLD-----GKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCS 326 >sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE PRECURSOR Length = 346 Score = 158 bits (395), Expect = 2e-38 Identities = 87/238 (36%), Positives = 130/238 (54%), Gaps = 25/238 (10%) Query: 112 DEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSE 171 D ++ + + DWR +G + VK+QG CGSCW+FS +E + I L+SLSE Sbjct: 8 DRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSE 67 Query: 172 QNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFN 231 Q LVDCD + +EGC+GGL A+ ++IKNGGI TE YPY G C+ Sbjct: 68 QELVDCD---------RSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGV-CDQY 117 Query: 232 SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNP 289 pattern 237 **** N + KI ++ +P N V+ P++IA +A ++Q Y G+F C Sbjct: 118 RKN---AKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCG- 173 Query: 290 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343 ++DHG++I GY +N M YWIV+NSWGA+ E GY+ ++R ++ CG++ Sbjct: 174 TAVDHGVVIAGYGTEN-----GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLA 226 >sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR Length = 308 Score = 152 bits (379), Expect = 1e-36 Identities = 105/320 (32%), Positives = 151/320 (46%), Gaps = 48/320 (15%) Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 F ++ NK +++ EYL RF +F N +E A+ +N FAD++ +E Sbjct: 18 FKQWAATHNKVFANRAEYLYRFAVFLDNKKFVE----------ANANTELNVFADMTHEE 67 Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 F +L T ++P + + P + DWR+ + P K+QGQCGSCW Sbjct: 68 FIQTHLG-----MTYEVPETTSNVKAAVKAAPE----SVDWRS--IMNPAKDQGQCGSCW 116 Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207 +F TT +EG+ KL S SEQ LVDCD A D GC GG N+ +I Sbjct: 117 TFCTTAVLEGRVNKDLGKLYSFSEQQLVDCD----------ASDNGCEGGHPSNSLKFIQ 166 Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 pattern 237 **** +N G+ ES YPY A GT C N+ ++ + +ET + I GP+A Sbjct: 167 ENNGLGLESDYPYKAVAGT-CK-KVKNVATVTGSR----RVTDGSETGLQTIIAENGPVA 220 Query: 268 IAADA--VEWQFYIGGVF--DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 323 + DA +Q Y G D C ++H + VGY + + N YWI++NSWG Sbjct: 221 VGMDASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNS-----NGKYWIIRNSWGT 275 Query: 324 DWGEQGYIYLRR-GKNTCGV 342 WG+ GY L R N CG+ Sbjct: 276 SWGDAGYFLLARDSNNMCGI 295 >sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR Length = 315 Score = 150 bits (375), Expect = 4e-36 Identities = 103/317 (32%), Positives = 163/317 (50%), Gaps = 47/317 (14%) Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADT-KFGVN-KFADLSSDEFKNYYLN 94 NK ++ E L R IF N ++A N++ +T K V+ FA ++++E+ + Sbjct: 24 NKHFTAVESLRRRAIFNMNA------RIVAENNRKETFKLSVDGPFAAMTNEEYNSLLKL 77 Query: 95 NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 154 + ++ ++N P+ A DWR +G VTP+++QG CGSC++F + Sbjct: 78 KRSGEEKGEV--------RYLNIQAPK---AVDWRKKGKVTPIRDQGNCGSCYTFGSIAA 126 Query: 155 VEGQHFISQ---NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 211 +EG+ I + ++ + LSE+++V C E +G + GCNGGL N YNYI++N G Sbjct: 127 LEGRLLIEKGGDSETLDLSEEHMVQCTRE----DG----NNGCNGGLGSNVYNYIMEN-G 177 Query: 212 IQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271 pattern 237 **** I ES YPYT T + AKI ++ + +N V +S G + ++ D Sbjct: 178 IAKESDYPYTGSDST------CRSDVKAFAKIKSYNRVARNNEVELKAAISQGLVDVSID 231 Query: 272 A--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 A V++Q Y G + D C N +L+H + VGY + WIV+NSWG WG Sbjct: 232 ASSVQFQLYKSGAYTDTQCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWG 286 Query: 327 EQGYIYLRRGKNTCGVS 343 E+GYI + NTCGV+ Sbjct: 287 EKGYINMVIEGNTCGVA 303 >sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR Length = 395 Score = 150 bits (374), Expect = 6e-36 Identities = 101/331 (30%), Positives = 157/331 (46%), Gaps = 29/331 (8%) Query: 26 QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 ++++ ++ K Y +E R IF+SN E +N +N ADL+ Sbjct: 88 ETEWKDYVTALGKHYDQKENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTD 147 Query: 86 DEF--KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQC 143 +EF +N + +++ + +P + DWRT+GAVTPV+NQG+C Sbjct: 148 EEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQ----VDWRTKGAVTPVRNQGEC 203 Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 GSC++F+T +E H +L+ LS QN+VDC + GC+GG P A+ Sbjct: 204 GSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCT--------RNLGNNGCSGGYMPTAF 255 Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI-PKNETVMAGYIVS 262 pattern 237 **** Y + GI ES YPY T +C + + + + F I P +E + + Sbjct: 256 QYASRY-GIAMESRYPYVG-TEQRCRWQQSIAVVTD----NGFNEIQPGDELALKHAVAK 309 Query: 263 TGP--LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 GP + I+ ++FY GV+ N DH +L VGY + YWIVKNS Sbjct: 310 RGPVVVGISGSKRSFRFYKDGVYS-EGNCGRPDHAVLAVGYGTHPSY----GDYWIVKNS 364 Query: 321 WGADWGEQGYIYLRRGK-NTCGVSNFVSTSI 350 WG DWG+ GY+Y+ R + N C +++ S I Sbjct: 365 WGTDWGKDGYVYMARNRGNMCHIASAASFPI 395 >sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR Length = 506 Score = 150 bits (374), Expect = 6e-36 Identities = 116/363 (31%), Positives = 180/363 (48%), Gaps = 64/363 (17%) Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 S+F ++ + NKKY + +E L+RFE FK K ++ N + + VN+++D S Sbjct: 160 SKFFKYMKENNKKYENMDEQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSK 219 Query: 86 DEFKNYYLNNKEAIFTDDL------PVADYLDDEFINSIPPEEQT---AFDWRTRGAVTP 136 +EF NY+ K DL P+ +L + + S+ + + + D+R++ P Sbjct: 220 EEFDNYF--KKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLP 277 Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSEQNLVDCDHECMEYEGEEACDEGCN 195 K+QG CGSCW+F+ GN E + +++++ +S SEQ +VDC E + GC+ Sbjct: 278 PKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE----------NYGCD 327 Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQC-NFNSANIGPEEQAKISNFTMIPKNET 254 pattern 237 **** GG A+ Y+I NG + YPY C N+ + +G ++ + NE Sbjct: 328 GGNPFYAFLYMINNG-VCLGDEYPYKGHEDFFCLNYRCSLLG-----RVHFIGDVKPNEL 381 Query: 255 VMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSA---------- 303 +MA V GP+ IA A E + Y GGVFD CNP L+H +L+VGY Sbjct: 382 IMALNYV--GPVTIAVGASEDFVLYSGGVFDGECNPE-LNHSVLLVGYGQVKKSLAFEDS 438 Query: 304 -----KNTI--FRKNMP---------YWIVKNSWGADWGEQGYIYLRRGK----NTCGVS 343 N I +++N+ YWIV+NSWG +WGE GYI ++R K CGV Sbjct: 439 HSNVDSNLIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVG 498 Query: 344 NFV 346 + V Sbjct: 499 SDV 501 >sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE PROTEINASE ACP3) Length = 308 Score = 149 bits (372), Expect = 9e-36 Identities = 103/316 (32%), Positives = 159/316 (49%), Gaps = 45/316 (14%) Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDEFKNYYLNN 95 NK ++ E L R IF N + E N K K V+ FA ++++E++ L + Sbjct: 17 NKHFTAVEALRRRAIFNMNARFVAEFN-----KKGSFKLSVDGPFAAMTNEEYRTL-LKS 70 Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 K + + ++N PE + DWR +G VTP+++Q QCGSC++F + + Sbjct: 71 KRTVEENGKVT-------YLNIQAPE---SVDWRAQGKVTPIRDQAQCGSCYTFGSLAAL 120 Query: 156 EGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 212 EG+ I + + LSE++LV C + + GCNGGL N Y+YII+N G+ Sbjct: 121 EGRLLIEKGGNANTLDLSEEHLVQCT--------RDNGNNGCNGGLGSNVYDYIIQN-GV 171 Query: 213 QTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 272 pattern 237 **** ES YPYT T + C N + AKI+ + +P+N +S G + ++ DA Sbjct: 172 AKESDYPYTG-TDSTCKTN-----VKAFAKITGYNKVPRNNEAELKAALSQGLVDVSIDA 225 Query: 273 --VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 ++Q Y G + D C N +L+H + VGY + WIV+NSWG WG+ Sbjct: 226 SSAKFQLYKSGAYSDTKCKNNFFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGD 280 Query: 328 QGYIYLRRGKNTCGVS 343 +GYI + NTCGV+ Sbjct: 281 KGYINMVIEGNTCGVA 296 >sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR Length = 315 Score = 149 bits (372), Expect = 9e-36 Identities = 102/324 (31%), Positives = 161/324 (49%), Gaps = 45/324 (13%) Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87 F + K NK ++ E L R IF N ++ N I K V+ FA ++++E Sbjct: 16 FNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDGPFAAMTNEE 70 Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 ++ + + T++ YL+ + S+ DWR G VTP+++Q QCGSC+ Sbjct: 71 YRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPIRDQAQCGSCY 119 Query: 148 SFSTTGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 +F + +EG+ I + + LSE+++V C + + GCNGGL N Y+ Sbjct: 120 TFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCT--------RDNGNNGCNGGLGSNVYD 171 Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 pattern 237 **** YII++ G+ ES YPYT T C N + AKI+ +T +P+N +S G Sbjct: 172 YIIEH-GVAKESDYPYTGSDST-CKTNVKSF-----AKITGYTKVPRNNEAELKAALSQG 224 Query: 265 PLAIAADA--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKN 319 + ++ DA ++Q Y G + D C N +L+H + VGY + WIV+N Sbjct: 225 LVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRN 279 Query: 320 SWGADWGEQGYIYLRRGKNTCGVS 343 SWG WG++GYI + NTCGV+ Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVA 303 >sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR Length = 310 Score = 145 bits (363), Expect = 1e-34 Identities = 102/330 (30%), Positives = 160/330 (47%), Gaps = 40/330 (12%) Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN- 78 GI F + K NK ++ E L R IF N ++ N I K V+ Sbjct: 3 GIRIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDG 57 Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138 FA ++++E++ + + T++ YL+ + S+ DWR G VTP++ Sbjct: 58 PFAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPLR 106 Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 +Q QCGSC++F + +EG+ I + + N +D E M+ + + GCNGGL Sbjct: 107 DQAQCGSCYTFGSLAALEGRLLIEKG-----GDANTLDLSEEHMQCTRDNG-NNGCNGGL 160 Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 pattern 237 **** N Y+YII++G + ES YPYT T C N + KI+ +T +P+N Sbjct: 161 GSNVYDYIIEHG-VAKESDYPYTGSDST-CKTNVKSF-----RKITGYTKVPRNNEAELK 213 Query: 259 YIVSTGPLAIAAD--AVEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMP 313 +S G L ++ D + ++Q Y G + D C N +L+H + VGY + Sbjct: 214 AALSQGLLDVSIDVSSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKE 268 Query: 314 YWIVKNSWGADWGEQGYIYLRRGKNTCGVS 343 WIV+NSWG WG++GYI + NTCGV+ Sbjct: 269 CWIVRNSWGTSWGDKGYINMVIEGNTCGVA 298 >sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR Length = 441 Score = 145 bits (362), Expect = 1e-34 Identities = 107/345 (31%), Positives = 165/345 (47%), Gaps = 58/345 (16%) Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--NKFADLS 84 +F F +K+ K + S ++ ++RF F+ N ++ HK + + NKF+DLS Sbjct: 119 EFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVK-------THKPTEPYSLDLNKFSDLS 171 Query: 85 SDEFKNYY--------------------LNNKEAIFTDDLPVADYLDDEFINSIPPEEQT 124 +EFK Y +++K I+ L A +++ S+ E Sbjct: 172 DEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSLITGEN- 230 Query: 125 AFDWRTRGAVTPVKNQGQ-CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 +W AV+P K+QG CGSCW+FS+ +VE + + +NK LSEQ LV+CD M Sbjct: 231 -LNWARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCDKSSM- 288 Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 pattern 237 **** GC GGL A Y I + G+ ES PYT + C + N + I Sbjct: 289 ---------GCAGGLPITALEY-IHSKGVSFESEVPYTGIV-SPCKPSIKN-----KVFI 332 Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 + +++ N+ V ++S + IA E + Y GG+F C L+H +L+VG Sbjct: 333 DSISILKGNDVVNKSLVISPTVVGIAV-TKELKLYSGGIFTGKCG-GELNHAVLLVGEGV 390 Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNF 345 + M YWI+KNSWG DWGE G++ L+R G + CG+ F Sbjct: 391 DH---ETGMRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTF 432 >sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR Length = 439 Score = 143 bits (357), Expect = 5e-34 Identities = 105/351 (29%), Positives = 163/351 (45%), Gaps = 72/351 (20%) Query: 24 EEQSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKF 80 E +F EF K+N++++ +E L R F+SN +++E K D + G+N+F Sbjct: 119 EVYREFEEFNSKYNRRHATQQERLNRLVTFRSNYLEVKE-------QKGDEPYVKGINRF 171 Query: 81 ADLSSDEF--------------------------KNYYLNNKEAIFTDDLPVADYLDDEF 114 +DL+ EF K Y N K+A+ TD+ D Sbjct: 172 SDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDE--------DVD 223 Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 + + E DWR +VT VK+Q CG CW+FST G+VEG + +K LS Q L Sbjct: 224 LAKLTGEN---LDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQEL 280 Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 +DCD + GC GGL +AY Y+ K G+ + P+ + +C+ A Sbjct: 281 LDCD----------SFSNGCQGGLLESAYEYVRKY-GLVSAKDLPF-VDKARRCSVPKA- 327 Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH 294 pattern 237 **** ++ + ++ + K + VM + S+ + + E Y GVF C SL+H Sbjct: 328 ----KKVSVPSYHVF-KGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECG-KSLNH 381 Query: 295 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGV 342 +++VG ++ YW+V+NSWG DWGE GY+ L R G + CGV Sbjct: 382 AVVLVGEGYDEVTKKR---YWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 429 >sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR (TCP) Length = 569 Score = 141 bits (351), Expect = 3e-33 Identities = 107/367 (29%), Positives = 169/367 (45%), Gaps = 62/367 (16%) Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 S+F +F + NK Y + +E + +FEIFK N I+ N +N A K VN+F+D S Sbjct: 223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN--KLNKNAMYKKKVNQFSDYSE 280 Query: 86 DEFKNYYLN----NKEAIFTDDLPVADYLDD-----EFINSIPPEEQTAF-------DWR 129 +E K Y+ I P ++L D EF + E+ F D+R Sbjct: 281 EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340 Query: 130 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 189 +G V K+QG CGSCW+F++ GN+E ++S SEQ +VDC + Sbjct: 341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--------- 391 Query: 190 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI 249 pattern 237 **** + GC+GG ++ Y+++N + Y Y A+ C N + + +S+ + Sbjct: 392 -NFGCDGGHPFYSFLYVLQN-ELCLGDEYKYKAKDDMFC----LNYRCKRKVSLSSIGAV 445 Query: 250 PKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY------- 301 +N+ ++A + GPL++ ++ Y GV++ C+ L+H +L+VGY Sbjct: 446 KENQLILA--LNEVGPLSVNVGVNNDFVAYSEGVYNGTCS-EELNHSVLLVGYGQVEKTK 502 Query: 302 -------SAKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKN----TCGVSN 344 NT N P YWI+KNSW WGE G++ L R KN CG+ Sbjct: 503 LNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE 562 Query: 345 FVSTSII 351 V I+ Sbjct: 563 EVFYPIL 569 >sp|P14518|BROM_ANACO BROMELAIN, STEM Length = 212 Score = 139 bits (348), Expect = 6e-33 Identities = 81/224 (36%), Positives = 113/224 (50%), Gaps = 31/224 (13%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 + DWR GAVT VKNQ CG+CW+F+ VE + I + L LSEQ ++DC Sbjct: 5 SIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC------- 57 Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 pattern 237 **** A GC GG + A+ +II N G+ + + YPY A GT C + G A I+ Sbjct: 58 ----AKGYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTD----GVPNSAYIT 108 Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 + +P+N Y VS P+ +A DA +Q+Y GVF+ PC SL+H + +GY Sbjct: 109 GYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCG-TSLNHAVTAIGYGQ 167 Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343 + I+ K WGA WGE GYI + R CG++ Sbjct: 168 DSIIYPK---------KWGAKWGEAGYIRMARDVSSSSGICGIA 202 >sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR (DER F I) Length = 321 Score = 138 bits (345), Expect = 1e-32 Identities = 115/352 (32%), Positives = 157/352 (43%), Gaps = 52/352 (14%) Query: 7 FVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 FVLA+ ++ V S P F EF+ FNK Y+ +E E+ + N +E L + Sbjct: 3 FVLAIASLLVLSTVYARPASIKTFEEFKKAFNKNYAT---VEEEEVARKNF--LESLKYV 57 Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----INSIPPE 121 N K +N +DLS DEFKN YL + EA + L L+ E INS+ Sbjct: 58 EAN-----KGAINHLSDLSLDEFKNRYLMSAEAF--EQLKTQFDLNAETSACRINSVNVP 110 Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 + D R+ VTP++ QG CGSCW+FS E + +N + LSEQ LVDC Sbjct: 111 SE--LDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDC---- 164 Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQA 241 pattern 237 **** A GC+G P YI +NG ++ E SYPY A NS + G Sbjct: 165 -------ASQHGCHGDTIPRGIEYIQQNGVVE-ERSYPYVAREQRCRRPNSQHYG----- 211 Query: 242 KISNFTMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLD 293 ISN+ I + ++ AIA D +Q Y G D PN Sbjct: 212 -ISNYCQIYPPDVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNY-- 268 Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 H + IVGY + + YWIV+NSW WG+ GY Y + G N + + Sbjct: 269 HAVNIVGYGS-----TQGDDYWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQY 315 >sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR Length = 583 Score = 129 bits (320), Expect = 1e-29 Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 84/370 (22%) Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 S+F F +K+ + Y E +E+++ FK N KI++ N K VN+F+D S Sbjct: 235 SKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHN----ETNQMYKMKVNQFSDYSK 290 Query: 86 DEFKNYYLNNKEAIFTDDLPVADYLDDEFI--------------------NSIPPEEQTA 125 +F++Y F +P+ D+L +++ ++ + Sbjct: 291 KDFESY--------FRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEI 342 Query: 126 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK-LVSLSEQNLVDCDHECMEY 184 D+R +G V K+QG CGSCW+F++ GNVE + NK +++LSEQ +VDC Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDC------- 395 Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 pattern 237 **** + GC+GG ++ Y I+N GI Y Y A C N + + +S Sbjct: 396 ---SKLNFGCDGGHPFYSFIYAIEN-GICMGDDYKYKAMDNLFC----LNYRCKNKVTLS 447 Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYS- 302 + + +NE + A + GP+++ ++ FY GG+F+ C L+H +L+VGY Sbjct: 448 SVGGVKENELIRA--LNEVGPVSVNVGVTDDFSFYGGGIFNGTCT-EELNHSVLLVGYGQ 504 Query: 303 -AKNTIFRKN-------------------------MPYWIVKNSWGADWGEQGYIYLRRG 336 + IF++ YWI+KNSW WGE G++ + R Sbjct: 505 VQSSKIFQEKNAYDDASGVTKKGALSYPSKADDGIQYYWIIKNSWSKFWGENGFMRISRN 564 Query: 337 KN----TCGV 342 K CG+ Sbjct: 565 KEGDNVFCGI 574 >sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR (DER P I) Length = 320 Score = 121 bits (300), Expect = 3e-27 Identities = 111/345 (32%), Positives = 151/345 (43%), Gaps = 57/345 (16%) Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 MK++L + V +R P F E++ FNK Y+ E E + N +E Sbjct: 1 MKIVLAIASLLALSAVYAR---PSSIKTFEEYKKAFNKSYAT---FEDEEAARKNF--LE 52 Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----IN 116 + + N A +N +DLS DEFKN +L + EA + L L+ E IN Sbjct: 53 SVKYVQSNGGA-----INHLSDLSLDEFKNRFLMSAEAF--EHLKTQFDLNAETNACSIN 105 Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 P E D R VTP++ QG CGSCW+FS E + +N+ + L+EQ LVD Sbjct: 106 GNAPAE---IDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNQSLDLAEQELVD 162 Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 C A GC+G P YI NG +Q ES Y Y A + N+ G Sbjct: 163 C-----------ASQHGCHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFG 210 Query: 237 PEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPC 287 pattern 237 **** ISN+ I P N + + T AIA D ++ Y G D Sbjct: 211 ------ISNYCQIYPPNVNKIREALAQTHS-AIAVIIGIKDLDAFRHYDGRTIIQRDNGY 263 Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332 PN H + IVGYS + + YWIV+NSW +WG+ GY Y Sbjct: 264 QPNY--HAVNIVGYSN-----AQGVDYWIVRNSWDTNWGDNGYGY 301 >sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) Length = 462 Score = 111 bits (274), Expect = 3e-24 Identities = 83/260 (31%), Positives = 128/260 (48%), Gaps = 34/260 (13%) Query: 105 PVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGSCWSFSTTGNVEGQHFI 161 P+ D + + + S+P ++DWR RG V+PV+NQ CGSC+SF++ G +E + I Sbjct: 218 PITDEIQQQIL-SLPE----SWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRI 272 Query: 162 SQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYP 219 N + LS Q +V C +GC+GG ++ G+ E+ +P Sbjct: 273 LTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEENCFP 322 Query: 220 YTAETGTQCN--FNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQ 276 pattern 237 **** YTA T C N E + F NE +M +V GP+A+A + + + Sbjct: 323 YTA-TDAPCKPKENCLRYYSSEYYYVGGFYG-GCNEALMKLELVKHGPMAVAFEVHDDFL 380 Query: 277 FYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGY 330 Y G++ P NP L +H +L+VGY K+ + + YWIVKNSWG+ WGE GY Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYG-KDPV--TGLDYWIVKNSWGSQWGESGY 437 Query: 331 IYLRRGKNTCGVSNFVSTSI 350 +RRG + C + + +I Sbjct: 438 FRIRRGTDECAIESIAMAAI 457 >sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) Length = 462 Score = 109 bits (270), Expect = 9e-24 Identities = 91/335 (27%), Positives = 155/335 (46%), Gaps = 42/335 (12%) Query: 34 DKFNKKYSH-----EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF 88 +K N +H E Y ER ++ N ++ +N + K+ T ++ +S + Sbjct: 147 EKVNMNAAHLGGLQERYSER--LYTHNHNFVKAINTV---QKSWTATAYKEYEKMSLRDL 201 Query: 89 KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGS 145 +++ P+ D + + +N PE ++DWR +G V+PV+NQ CGS Sbjct: 202 IRRSGHSQRIPRPKPAPMTDEIQQQILNL--PE---SWDWRNVQGVNYVSPVRNQESCGS 256 Query: 146 CWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 C+SF++ G +E + I N + LS Q +V C +GC+GG Sbjct: 257 CYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIA 306 Query: 204 NYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262 pattern 237 **** ++ G+ ES +PYTA ++ + N + + F NE +M +V Sbjct: 307 GKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYYVGGFYG-GCNEALMKLELVK 365 Query: 263 TGPLAIAADAVE-WQFYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYW 315 GP+A+A + + + Y G++ P NP L +H +L+VGY + YW Sbjct: 366 HGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVT---GIEYW 422 Query: 316 IVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 I+KNSWG++WGE GY +RRG + C + + +I Sbjct: 423 IIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAI 457 >sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN) (PDP) Length = 139 Score = 108 bits (267), Expect = 2e-23 Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 9/145 (6%) Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETV 255 pattern 237 **** GGL +A+ Y+ NGG+ +E SYPY A+ G C + N A ++++ IP E Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQ-GDSCKYRPEN----SVANVTDYWDIPSKENE 55 Query: 256 MAGYIVSTGPLAIAADAV--EWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNM 312 + + + GP++ A DA ++FY G++ D C+ +DHG+L+VGY A T +N Sbjct: 56 LMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTE-TENK 114 Query: 313 PYWIVKNSWGADWGEQGYIYLRRGK 337 YWI+KNSWG DWG GYI + + + Sbjct: 115 KYWIIKNSWGTDWGMDGYIKMAKDR 139 >sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR Length = 454 Score = 108 bits (266), Expect = 3e-23 Identities = 75/238 (31%), Positives = 109/238 (45%), Gaps = 33/238 (13%) Query: 126 FDWRT-----RGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCD 178 FDW + R VTP++NQG CGSC++ + +E + + N + LS Q +VDC Sbjct: 222 FDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCS 281 Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF--NSANIG 236 EGCNGG ++ G+ + PYT E +C N Sbjct: 282 ----------PYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCTRYY 331 Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC-------- 287 pattern 237 **** + + I + NE +M ++S GP + + E +QFY G++ Sbjct: 332 TTDYSYIGGYYGAT-NEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNF 390 Query: 288 NPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 NP L +H +L+VGY PYW VKNSWG +WGEQGY + RG + CGV + Sbjct: 391 NPFELTNHAVLLVGYGVDKL---SGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445 >sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) Length = 463 Score = 107 bits (265), Expect = 3e-23 Identities = 75/235 (31%), Positives = 111/235 (46%), Gaps = 29/235 (12%) Query: 124 TAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCD 178 T++DWR V+PV+NQ CGSC+SF++ G +E + I N + LS Q +V C Sbjct: 233 TSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS 292 Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSA--NIG 236 +GC GG ++ G+ E+ +PYT T + C Sbjct: 293 QYA----------QGCEGGFPYLIAGKYAQDFGLVEEACFPYTG-TDSPCKMKEDCFRYY 341 Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDI-----PCNPN 290 pattern 237 **** E + F NE +M +V GP+A+A + + + Y G++ P NP Sbjct: 342 SSEYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPF 400 Query: 291 SL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 L +H +L+VGY + M YWIVKNSWG WGE GY +RRG + C + + Sbjct: 401 ELTNHAVLLVGYGTDSA---SGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452 >sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) Length = 211 Score = 99.8 bits (245), Expect = 7e-21 Identities = 73/228 (32%), Positives = 102/228 (44%), Gaps = 33/228 (14%) Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 S+P E D R+ VTP++ QG CGSCW+FS + E + +N + L+EQ LVD Sbjct: 10 SLPSE----LDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAYRNMSLDLAEQELVD 65 Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 C A GC+G P YI +NG +Q E YPY A + N+ G Sbjct: 66 C-----------ASQNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSCHRPNAQRYG 113 Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAI---AADAVEWQFYIGGVF---DIPCNPN 290 pattern 237 **** + +IS P + + + +A+ D ++ Y G D PN Sbjct: 114 LKNYCQISP----PDSNKIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPN 169 Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN 338 H + IVGY NT + + YWIV+NSW WG+ GY Y N Sbjct: 170 Y--HAVNIVGYG--NT---QGVDYWIVRNSWDTTWGDNGYGYFAANIN 210 >sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) Length = 151 Score = 94.8 bits (232), Expect = 2e-19 Identities = 60/158 (37%), Positives = 87/158 (54%), Gaps = 15/158 (9%) Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 +H+E++ R+E FK N+ + N + + T G+N+ ADLS++E++ YL + I Sbjct: 1 THKEFMPRYEEFKKNMDYVHNWN----SKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIK 56 Query: 101 TDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHF 160 + + +N ++ DWR + AVTPVK+QGQCGSC STTG+VEG Sbjct: 57 LNGYHKRNL--GLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTA 113 Query: 161 ISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 I KLVSLSEQN++ +EGCNGGL Sbjct: 114 IKTGKLVSLSEQNILRL--------SSSFGNEGCNGGL 143 >sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PRECURSOR Length = 344 Score = 90.9 bits (222), Expect = 4e-18 Identities = 69/272 (25%), Positives = 111/272 (40%), Gaps = 47/272 (17%) Query: 108 DYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 167 D + E ++IP W ++ +++Q CGSCW+F+ + + I+ N V Sbjct: 72 DIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAV 131 Query: 168 S--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY------- 218 + LS ++L+ C G +C GC GG A+ + +K+G + T SY Sbjct: 132 NTLLSSEDLLSC------CTGMFSCGNGCEGGYPIQAWKWWVKHG-LVTGGSYETQFGCK 184 Query: 219 PY-----------------------TAETGTQCNFNSANIGPEEQAKISNFTM--IPKNE 253 pattern 237 **** PY T + C + P Q K T + K Sbjct: 185 PYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKV 244 Query: 254 TVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNM 312 + I++ GP+ +A E + Y GV+ + H + I+G+ N Sbjct: 245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDN-----GT 299 Query: 313 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 PYW+V NSW WGE+GY + RG N CG+ + Sbjct: 300 PYWLVANSWNVAWGEKGYFRIIRGLNECGIEH 331 >sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PRECURSOR Length = 335 Score = 90.5 bits (221), Expect = 5e-18 Identities = 73/299 (24%), Positives = 124/299 (41%), Gaps = 50/299 (16%) Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 D++ ++ K + + A T D+ V + +E ++IP W ++ +++Q Sbjct: 46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE--DTIPATFDARTQWPNCMSINNIRDQ 103 Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 CGSCW+F+ + I+ N V+ LS ++++ C C C GC GG Sbjct: 104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSN------CGYGCEGGY 154 Query: 199 QPNAYNYIIKNG---GIQTESSYPYTAETGTQCNFNSANI--------GPEEQAKISNFT 247 pattern 237 **** NA+ Y++K+G G E+ + + C N+ G + A ++ T Sbjct: 155 PINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCT 214 Query: 248 -------------------MIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC 287 + K + + I++ GP+ A E + Y GV+ Sbjct: 215 NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTT 274 Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346 H I I+G+ N PYW+V NSW +WGE GY + RG N CG+ + V Sbjct: 275 GQELGGHAIRILGWGTDN-----GTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328 >sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) Length = 96 Score = 90.5 bits (221), Expect = 5e-18 Identities = 43/87 (49%), Positives = 55/87 (62%), Gaps = 2/87 (2%) Query: 264 GPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSW 321 GPLA+A +A Q YIGGV L+HG+L+VGY + I K PYW++KNSW Sbjct: 1 GPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGYAPIRLKEKPYWVIKNSW 60 Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348 G +WGE GY + RG+N CGV + VST Sbjct: 61 GENWGENGYYKICRGRNICGVDSMVST 87 >sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR Length = 335 Score = 88.5 bits (216), Expect = 2e-17 Identities = 65/259 (25%), Positives = 105/259 (40%), Gaps = 47/259 (18%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL---SEQNL 174 +P W + +++QG CGSCW+F + + I N V++ +E L Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139 Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ--------------------- 213 C EC +GCNGG A+N+ K G + Sbjct: 140 TCCGGEC---------GDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHH 190 Query: 214 -TESSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** S P T E T +CN S + ++ S++++ + +MA I GP+ Sbjct: 191 VNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAE-IYKNGPV 249 Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 A ++ Y GV+ H I I+G+ +N PYW+V NSW DW Sbjct: 250 EGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVEN-----GTPYWLVGNSWNTDW 304 Query: 326 GEQGYIYLRRGKNTCGVSN 344 G+ G+ + RG++ CG+ + Sbjct: 305 GDNGFFKILRGQDHCGIES 323 >sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) Length = 339 Score = 87.4 bits (213), Expect = 4e-17 Identities = 66/265 (24%), Positives = 113/265 (41%), Gaps = 45/265 (16%) Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNL 174 ++P W + +++QG CGSCW+F + + I N V++ S ++L Sbjct: 79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138 Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK----NGGIQTE--------------- 215 + C C C +GCNGG A+N+ + +GG+ Sbjct: 139 LTC---C-----GIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHH 190 Query: 216 ---SSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** S P T E T +CN S + ++ +++++ + +MA I GP+ Sbjct: 191 VNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAE-IYKNGPV 249 Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 A ++ Y GV+ H I I+G+ +N + PYW+V NSW DW Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGV-----PYWLVANSWNVDW 304 Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350 G+ G+ + RG+N CG+ + + I Sbjct: 305 GDNGFFKILRGENHCGIESEIVAGI 329 >sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR Length = 329 Score = 87.0 bits (212), Expect = 5e-17 Identities = 66/288 (22%), Positives = 117/288 (39%), Gaps = 38/288 (13%) Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 +++ +E K ++ K A +D++ + + + S+P + W ++ +++Q Sbjct: 50 EITEEEMKFKLMDGKYAAAHSDEIRATE--QEVVLASVPATFDSRTQWSECKSIKLIRDQ 107 Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 CGSCW+F + + I +S +L+ C C +C GC GG Sbjct: 108 ATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSC---C-----GSSCGNGCEGGY 159 Query: 199 QPNAYNY-----IIKNGGIQTESSYPYTAETGTQ----------CNFNSANIGPEEQAKI 243 pattern 237 **** A + ++ G PY T C+ + + AK Sbjct: 160 PIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKD 219 Query: 244 SNFTM----IPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILI 298 +F + +PKN + I + GP+ A E + Y GV+ H I I Sbjct: 220 KHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKI 279 Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346 +G+ ++ PYW+V NSWG +WGE G+ + RG + CG+ + V Sbjct: 280 IGWGTES-----GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAV 322 >sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP SECRETASE) Length = 339 Score = 86.2 bits (210), Expect = 9e-17 Identities = 68/285 (23%), Positives = 110/285 (37%), Gaps = 55/285 (19%) Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 + +FT+DL +P W + +++QG CGSCW+F + Sbjct: 70 QRVMFTEDL------------KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 117 Query: 156 EGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 + I N VS+ S ++L+ C C C +GCNGG A+N+ + G + Sbjct: 118 SDRICIHTNAHVSVEVSAEDLLTC---C-----GSMCGDGCNGGYPAEAWNFWTRKGLVS 169 Query: 214 ----------------------TESSYPYTAETGTQ-----CNFNSANIGPEEQAKISNF 246 pattern 237 **** S P T E T C + +++ N Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229 Query: 247 TMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 305 + +E + I GP+ A ++ Y GV+ H I I+G+ +N Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVEN 289 Query: 306 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 PYW+V NSW DWG+ G+ + RG++ CG+ + V I Sbjct: 290 -----GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 329 >sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SJ31) Length = 342 Score = 85.4 bits (208), Expect = 2e-16 Identities = 64/271 (23%), Positives = 109/271 (39%), Gaps = 57/271 (21%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ--NKLVSLSEQNLV 175 IP + + W +++ +++Q +CGSCW+F + + I + LS +L+ Sbjct: 90 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI---------------------QT 214 C C + C +GC GG A++Y +K G + T Sbjct: 150 SC---CKD------CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHT 200 Query: 215 ESSYP-------------YTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261 pattern 237 **** + YP T + G + + +E + N NE V+ I+ Sbjct: 201 KGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN------NEKVIQRDIM 254 Query: 262 STGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 GP+ A D E + Y G++ H I I+G+ + K PYW++ NS Sbjct: 255 MYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-----KRTPYWLIANS 309 Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 W DWGE+G + RG++ C + + V +I Sbjct: 310 WNEDWGEKGLFRMVRGRDECSIESDVVAGLI 340 >sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) Length = 340 Score = 85.4 bits (208), Expect = 2e-16 Identities = 66/265 (24%), Positives = 111/265 (40%), Gaps = 46/265 (17%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLV 175 +P T W ++ +++QG CGSCW+F + + + N VS+ S ++L+ Sbjct: 80 LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------- 213 C C G E C GCNGG A+ Y + G + Sbjct: 140 SC---C----GFE-CGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHV 191 Query: 214 TESSYPYTAETGT--QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 pattern 237 **** S P T E G +C+ + S + ++ I+++ +P++E + I GP+ Sbjct: 192 NGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYG-VPRSEKEIMAEIYKNGPV 250 Query: 267 AIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 A E + Y GV+ H I I+G+ +N PYW+ NSW DW Sbjct: 251 EGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVEN-----GTPYWLAANSWNTDW 305 Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350 G G+ + RG++ CG+ + + + Sbjct: 306 GITGFFKILRGEDHCGIESEIVAGV 330 >sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PRECURSOR Length = 379 Score = 85.0 bits (207), Expect = 2e-16 Identities = 71/265 (26%), Positives = 116/265 (42%), Gaps = 53/265 (20%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK--LVSLSEQNLV 175 IP + +W ++ +++Q CGSCW+F + + I+ + V+LS +L+ Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ------CN 229 C C ++C GCNGG A+ Y +K+G I T S+Y TA G + C Sbjct: 165 SC---C------KSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY--TANNGCKPYPFPPCE 212 Query: 230 FNSANIGPE------------EQAKISNFTMIPKNETVMAGY---------------IVS 262 pattern 237 ** ** +S + E+ +S++T +E G +++ Sbjct: 213 HHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMT 272 Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 321 GPL IA + E + Y GGV+ H + ++G+ + I PYW V NSW Sbjct: 273 HGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGI-----PYWTVANSW 327 Query: 322 GADWGEQGYIYLRRGKNTCGVSNFV 346 DWGE G+ + RG + CG+ + V Sbjct: 328 NTDWGEDGFFRILRGVDECGIESGV 352 >sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SM31) Length = 340 Score = 84.6 bits (206), Expect = 3e-16 Identities = 64/260 (24%), Positives = 107/260 (40%), Gaps = 45/260 (17%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 IP + W ++ +++Q +CGSCWSF + + I + V LS +L+ Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA---ETGTQCNFNS 232 C C E+C GC GG+ A++Y +K G + S +T +C ++ Sbjct: 149 TC---C------ESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHT 199 Query: 233 ANIGPEEQAKISN---------------FTM----------IPKNETVMAGYIVSTGPLA 267 pattern 237 **** P +KI N +T + +E + I+ GP+ Sbjct: 200 KGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 259 Query: 268 IAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 + E + Y G++ H I I+G+ +N PYW++ NSW DWG Sbjct: 260 ASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVEN-----KTPYWLIANSWNEDWG 314 Query: 327 EQGYIYLRRGKNTCGVSNFV 346 E GY + RG++ C + + V Sbjct: 315 ENGYFRIVRGRDECSIESEV 334 >sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) Length = 339 Score = 84.6 bits (206), Expect = 3e-16 Identities = 66/253 (26%), Positives = 108/253 (42%), Gaps = 43/253 (16%) Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYE 185 W + +++QG CGSCW+F + + I N V++ S ++L+ C C Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTC---C---- 142 Query: 186 GEEACDEGCNGGLQPNAYNYIIK----NGGIQTE------------------SSYPYTAE 223 C +GCNGG A+++ K +GG+ S P T E Sbjct: 143 -GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGE 201 Query: 224 TGT-QCNFN-SANIGPE-EQAKISNFTMIPKNETV--MAGYIVSTGPLAIAADAV-EWQF 277 pattern 237 ** ** T +CN + A P ++ K +T + +V + I GP+ A ++ Sbjct: 202 GDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLT 261 Query: 278 YIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK 337 Y GV+ H I I+G+ +N + PYW+ NSW DWG+ G+ + RG+ Sbjct: 262 YKSGVYKHEAGDMMGGHAIRILGWGVENGV-----PYWLAANSWNLDWGDNGFFKILRGE 316 Query: 338 NTCGVSNFVSTSI 350 N CG+ + + I Sbjct: 317 NHCGIESEIVAGI 329 >sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR Length = 341 Score = 79.6 bits (193), Expect = 9e-15 Identities = 63/270 (23%), Positives = 106/270 (38%), Gaps = 46/270 (17%) Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162 D V D +E + IP W ++ + +Q CGSCW+ S+ + + I+ Sbjct: 76 DEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIA 135 Query: 163 QN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY-----IIKNGGIQTE 215 K V +S Q++V C C C +GC GG +A+ + ++ G T+ Sbjct: 136 SKGAKQVLISAQDVVSC---CTW------CGDGCEGGWPISAFRFHADEGVVTGGDYNTK 186 Query: 216 SSY-PYTAET----GTQCNFNSANIGPEEQAKISNFTMI------PKNETVMAGYIVSTG 264 pattern 237 **** S PY G + + +G + + ++ P + Y + Sbjct: 187 GSCRPYEIHPCGHHGNETYYGEC-VGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNS 245 Query: 265 PLAIAADAV-------------EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKN 311 AI D + ++ Y G++ + H + ++G+ + K Sbjct: 246 VKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEE-----KG 300 Query: 312 MPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 PYWIV NSW DWGE G+ + RG N CG Sbjct: 301 TPYWIVANSWHDDWGENGFFRMHRGSNDCG 330 >sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PRECURSOR Length = 342 Score = 78.4 bits (190), Expect = 2e-14 Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++ Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220 C C C +GC GG A+ Y I +G + + PY Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197 Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 pattern 237 **** T +C + ++ + ++ ++ + I+ GP+ Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPV- 256 Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 +A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311 Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351 GE+GY + RG N CG+ ++ I+ Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAGIV 337 >sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR Length = 342 Score = 77.6 bits (188), Expect = 4e-14 Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%) Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++ Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145 Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220 C C C +GC GG A+ Y I +G + + PY Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197 Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 pattern 237 **** T +C + ++ + ++ ++ + I+ GP+ Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPV- 256 Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 +A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311 Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351 GE+GY + RG N CG+ ++ I+ Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGIV 337 >sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PRECURSOR Length = 370 Score = 73.3 bits (177), Expect = 7e-13 Identities = 56/248 (22%), Positives = 98/248 (38%), Gaps = 39/248 (15%) Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYE 185 W + ++NQ CGSCW+F + + I N +S ++++ C C Sbjct: 102 WPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSC---C---- 154 Query: 186 GEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------TESSYPYTAET 224 C GC GG A + +G + ES+ P + +T Sbjct: 155 -GTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTTP-SCKT 212 Query: 225 GTQCNFNSANIGPEEQAKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVE-WQFYIGGV 282 pattern 237 **** Q ++ + ++ S + + K+ T + I GP+ + E + Y GV Sbjct: 213 TCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGV 272 Query: 283 FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGV 342 + H + I+G+ +N + YW++ NSWG +GE+G+ +RRG N C + Sbjct: 273 YHYTSGKLVGGHAVKIIGWGVENGV-----DYWLIANSWGTSFGEKGFFKIRRGTNECQI 327 Query: 343 SNFVSTSI 350 V I Sbjct: 328 EGNVVAGI 335 >sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P126) (111 KD ANTIGEN) Length = 989 Score = 70.2 bits (169), Expect = 6e-12 Identities = 63/247 (25%), Positives = 102/247 (40%), Gaps = 46/247 (18%) Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE--EACDEGC 194 V++QG C + W F++ ++E + + +S + +C Y+GE + CDEG Sbjct: 579 VEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANC------YKGEHKDRCDEGS 632 Query: 195 NGGLQPNAYNYIIKNGG-IQTESSYPYT-AETGTQC------------------NFNSAN 234 + P + II++ G + ES+YPY + G QC N N N Sbjct: 633 S----PMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPN 688 Query: 235 I----------GPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFD 284 pattern 237 **** + F I K E + G +++ I A+ V + G Sbjct: 689 SLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAY----IKAENVMGYEFSGKKVQ 744 Query: 285 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 C ++ DH + IVGY + YWIV+NSWG WG++GY + T N Sbjct: 745 NLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFN 804 Query: 345 FVSTSII 351 F+ + +I Sbjct: 805 FIHSVVI 811 >sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) Length = 43 Score = 60.9 bits (145), Expect = 4e-09 Identities = 24/33 (72%), Positives = 27/33 (81%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 + DWR +GAVTPVKNQG CGSCW+FST VEG Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEG 36 >sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) Length = 43 Score = 59.7 bits (142), Expect = 9e-09 Identities = 24/33 (72%), Positives = 27/33 (81%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 + DWR +GAVTPVKNQG CGSCW+FST VEG Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIVTVEG 36 >sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 Length = 174 Score = 59.3 bits (141), Expect = 1e-08 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%) Query: 249 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 308 I KN V+AG+IV ++ Y G++ + H + I+G+ + Sbjct: 87 IMKNGPVVAGFIVYE----------DFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---- 132 Query: 309 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 K PYW++ NSW DWGE+G+ + RG N C + V I+ Sbjct: 133 -KGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGIV 174 >sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) Length = 43 Score = 57.8 bits (137), Expect = 3e-08 Identities = 22/33 (66%), Positives = 27/33 (81%) Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 + DWR +GAVTPV+NQG CGSCW+FS+ VEG Sbjct: 4 SIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 36 >sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) Length = 43 Score = 56.2 bits (133), Expect = 1e-07 Identities = 22/31 (70%), Positives = 25/31 (79%) Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 DWR +GAVTPVK+Q CGSCW+FST VEG Sbjct: 6 DWRQKGAVTPVKDQNPCGSCWAFSTVATVEG 36 >sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L Length = 42 Score = 51.9 bits (122), Expect = 2e-06 Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%) Query: 314 YWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351 YWIVKNSWG WG++GYIY+ + KN CG++ S ++ Sbjct: 4 YWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 42 >sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR Length = 136 Score = 41.8 bits (96), Expect = 0.002 Identities = 31/101 (30%), Positives = 50/101 (48%), Gaps = 4/101 (3%) Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIA 66 L + + + S PP+ +++ E++ KF K Y+ E R +++ N KIE N Sbjct: 17 LLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNADY 76 Query: 67 INHKADTKFGVNKFADLSSDEFK-NYYLNN-KEAIFTDDLP 105 K G+N+F+DL+ +EFK N Y N+ DLP Sbjct: 77 EQGKTSFYMGLNQFSDLTPEEFKTNCYGNSLNRGEMAPDLP 117 >sp|P05689|CATX_BOVIN CATHEPSIN Length = 73 Score = 40.2 bits (92), Expect = 0.006 Identities = 15/40 (37%), Positives = 24/40 (59%), Gaps = 5/40 (12%) Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 ++H + + G+ + M YWIV+NSWG WGE G++ Sbjct: 9 INHIVSVAGWGVSD-----GMEYWIVRNSWGEPWGEHGWM 43 >sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR Length = 141 Score = 38.7 bits (88), Expect = 0.019 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%) Query: 6 LFVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 +F+L + +S+ P P +++ E++ F K YS +E R +++ N KIE N Sbjct: 20 VFLLILCLGMMSAAPSPDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHNA 79 Query: 65 IAINHKADTKFGVNKFADLSSDEFK 89 K G+N+F+DL+ +EF+ Sbjct: 80 DYERGKTSFYMGLNQFSDLTPEEFR 104 >sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (GC-C) (INTESTINAL GUANYLATE CYCLASE) (STA RECEPTOR) Length = 1072 Score = 35.6 bits (80), Expect = 0.16 Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 19/120 (15%) Query: 15 FVSSRGIPPEEQSQFLEFQDK----FNKKYSHEEYLERFEIFKSNL-GKIEELNLIAINH 69 +V G PE+ +L + F++ S ++ L R E F+ L G+ + N+I + Sbjct: 190 YVYKNGSEPEDCFWYLNALEAGVSYFSEVLSFKDVLRRSEQFQEILMGRNRKSNVIVMCG 249 Query: 70 KADTKFGVN---KFAD----LSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 +T + V K AD + D F N+Y F DD +Y+D+ + ++PPE+ Sbjct: 250 TPETFYNVKGDLKVADDTVVILVDLFSNHY-------FEDDTRAPEYMDNVLVLTLPPEK 302 >sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTECTIVE ANTIGEN) Length = 650 Score = 35.2 bits (79), Expect = 0.22 Identities = 24/81 (29%), Positives = 36/81 (43%), Gaps = 5/81 (6%) Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDC----DHECMEYEGEEACDEGCNGGLQPNAYNYI 206 TT N + KL + + + +C DHEC +++C E NG Q + + Sbjct: 533 TTCNPKEIQECQDKKLECVYKNHKAECECPDDHECYREPAKDSCSEEDNGKCQSSGQRCV 592 Query: 207 IKNG-GIQTESSYPYTAETGT 226 I+NG + E S TA T T Sbjct: 593 IENGKAVCKEKSEATTAATTT 613 >sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 INTERGENIC REGION Length = 396 Score = 32.0 bits (71), Expect = 1.9 Identities = 39/191 (20%), Positives = 77/191 (39%), Gaps = 39/191 (20%) Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE-------------- 122 VNKF D++++E + ++ + P+ADYL F + ++ Sbjct: 42 VNKFKDITNNESCTCEVGDRVWFSGKNAPLADYLSVHFRGPLKLKQFAFYTSPGFTVNNS 101 Query: 123 QTAFDW----------RTRGAVTPVKNQGQCGSCW-------SFSTTGNVEGQHFISQNK 165 +++ DW +T VT + + G+ C S + TG+ ++ Sbjct: 102 RSSSDWNRLAYYESSSKTADNVTFLNHGGEASPCLGNALSYASSNGTGSASEATVLADGT 161 Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225 L+S ++ ++ + C + ++ C +G P Y Y GG T + + E Sbjct: 162 LISSDQEYIIYSNVSCPKSGYDKGCGVYRSG--IPAYYGY----GG--TTKMFLFEFEMP 213 Query: 226 TQCNFNSANIG 236 T+ NS++IG Sbjct: 214 TETEKNSSSIG 224 >sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) Length = 239 Score = 32.0 bits (71), Expect = 1.9 Identities = 24/93 (25%), Positives = 36/93 (37%), Gaps = 7/93 (7%) Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNG 196 ++ G G C G V + + L + + N+V C C + ++ C G N Sbjct: 137 IRPSGGSGDC---KYAGCVSDLNAACPDMLKVMDQNNVVACKSACERFNTDQYCCRGAND 193 Query: 197 GLQ---PNAYNYIIKNGGIQTESSYPYTAETGT 226 + P Y+ I KN SY Y ET T Sbjct: 194 KPETCPPTDYSRIFKN-ACPDAYSYAYDDETST 225 >sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-DIRECTED RNA POLYMERASE ; THIOL PROTEASE 3C ; HELICASE (2C LIKE PROTEIN)] Length = 1699 Score = 31.3 bits (69), Expect = 3.2 Identities = 13/31 (41%), Positives = 21/31 (66%) Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLE 47 SS+G+ EE ++ +++ N KYS EEYL+ Sbjct: 893 SSKGLSDEEYDEYKRIREERNGKYSIEEYLQ 923 >sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2 Length = 185 Score = 30.9 bits (68), Expect = 4.2 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 6/99 (6%) Query: 30 LEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF---GVNKF-ADLSS 85 L+ K KK ++ ++ + K+NL ++ +++HK +K ++KF D S Sbjct: 6 LKLGSKTLKKNISKKTKKKNSLQKANLFDWDDAETASLSHKPQSKIKIQSIDKFDLDEES 65 Query: 86 DEFKNYYLNNKEAIFT--DDLPVADYLDDEFINSIPPEE 122 K + E T +D P+ +Y+ ++ N +P EE Sbjct: 66 SSKKKLVIKLSENADTKKNDAPLVEYVTEKEYNEVPVEE 104 >sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN Length = 512 Score = 30.9 bits (68), Expect = 4.2 Identities = 17/58 (29%), Positives = 29/58 (49%), Gaps = 9/58 (15%) Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 + +NLI + K+D L+ +E KN+ +E I D+PV + DE +N+ Sbjct: 237 KRVNLIPVIAKSDL---------LTKEELKNFKTQVREIIRVQDIPVCFFFGDEVLNA 285 >sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDROLASE) (BLM HYDROLASE) Length = 454 Score = 30.5 bits (67), Expect = 5.5 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 11/66 (16%) Query: 111 DDEFINS--IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS 168 DD +N + ++ F+ TPV NQ G CW F+ T +Q +L Sbjct: 36 DDALLNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAAT---------NQLRLNV 86 Query: 169 LSEQNL 174 LSE NL Sbjct: 87 LSELNL 92 >sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 Length = 527 Score = 30.5 bits (67), Expect = 5.5 Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 7/52 (13%) Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNN 95 +YL + I+K K +L L IN K T F LSS FKNYYL + Sbjct: 466 DYLAKNSIYKMKNLKFMDLFLNNINSKGYTLF-------LSSGMFKNYYLKS 510 >sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 Length = 1427 Score = 30.1 bits (66), Expect = 7.2 Identities = 22/89 (24%), Positives = 44/89 (48%), Gaps = 10/89 (11%) Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--- 77 +PP + S F++ + Y EE ++ E F NLG + ++ I H+ + K+ + Sbjct: 1314 LPPFQVSSFVKETKLHSGDYGEEEDADQEESFSLNLG----IGIVEIAHENEQKWLIYDK 1369 Query: 78 --NKFADLSSDEFKNYYLNNKEAIFTDDL 104 +K+ S E ++++N +TDD+ Sbjct: 1370 KDHKYVCTFSME-PYHFISNYNTKYTDDM 1397 >sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C Length = 436 Score = 30.1 bits (66), Expect = 7.2 Identities = 11/20 (55%), Positives = 14/20 (70%) Query: 311 NMPYWIVKNSWGADWGEQGY 330 N W V+NSWG D G++GY Sbjct: 370 NSTKWKVENSWGKDAGQKGY 389 >sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) Length = 455 Score = 29.7 bits (65), Expect = 9.4 Identities = 10/17 (58%), Positives = 13/17 (75%) Query: 315 WIVKNSWGADWGEQGYI 331 W V+NSWG D G +GY+ Sbjct: 392 WRVENSWGEDHGHKGYL 408 >sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (AMINOPEPTIDASE H) Length = 455 Score = 29.7 bits (65), Expect = 9.4 Identities = 10/19 (52%), Positives = 14/19 (73%) Query: 315 WIVKNSWGADWGEQGYIYL 333 W V+NSWG D G +GY+ + Sbjct: 392 WRVENSWGEDRGNKGYLIM 410 >sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) Length = 454 Score = 29.7 bits (65), Expect = 9.4 Identities = 10/17 (58%), Positives = 13/17 (75%) Query: 315 WIVKNSWGADWGEQGYI 331 W V+NSWG D G +GY+ Sbjct: 392 WRVENSWGEDHGHKGYL 408 Database: /home/peter/blast/data/swissprot Posted date: Oct 10, 2000 10:43 AM Number of letters in database: 31,984,247 Number of sequences in database: 88,780 Lambda K H 0.317 0.136 0.414 Lambda K H 0.270 0.0477 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 23348054 Number of Sequences: 88780 Number of extensions: 1039466 Number of successful extensions: 3135 Number of sequences better than 10.0: 162 Number of HSP's better than 10.0 without gapping: 118 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 2557 Number of HSP's gapped (non-prelim): 148 length of query: 351 length of database: 31,984,247 effective HSP length: 50 effective length of query: 301 effective length of database: 27,545,247 effective search space: 8291119347 effective search space used: 8291119347 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.6 bits) S2: 65 (29.7 bits) D10483.gbk100644000766000024 75560014006427353 15721 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS ECO110K 111408 bp DNA linear BCT 29-MAY-2002 DEFINITION Escherichia coli genomic DNA. (0.0 - 2.4 min). ACCESSION D10483 J01597 J01683 J01706 K01298 K01990 M10420 M10611 M12544 V00259 X04711 X54847 X54945 X55034 X56742 VERSION D10483.2 GI:21321891 KEYWORDS Complete and shotgun sequencing; thrA; thrA1; thrA2; thrB; thrC; yaaA; yaaJ; talB; mog; chlG; dnaK; dnaJ; groP; gef; nhaA; ant; rpsT; yaaC; ileS; lspA; yaaD; lytB; yaaF; dapB; carA; pyrA; carB; yaaV; caiE; caiD; caiC; caiB; caiA; caiT; fixA; fixC; yaaT; yabE; yabF; kefC; trkC; folA; tmrA; apaH; apaG; pdxA; surA; imp; ostA; yabH; yabP; yabQ; hepA; polB; dinA; araD; araA; araB; araC; yabI; yabJ; yabK; tbpA; yabN; yabM; leuD; leuC; leuB; leuA; leuLP; lueO; ilvI; ilvH; brnP; shl; fruR; yabB; yabC; ftsL; pbpB; ftsI; murE; murF; mra; mraY; murX; murD; ftsW; murG; murC; ddl; ddlB; ftsQ; ftsA; divA; ftsZ; sfiB; sulB; lpxC; envA; asmB; yacA; secA; prlD; azi; pea; mutT; yacG; htgA; htpY; yaaI; nhaR; antO; caiF; fixB; yaaU; yabO. SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Escherichia. REFERENCE 1 AUTHORS Ohtsubo,H. and Ohtsubo,E. TITLE Nucleotide sequence of an insertion element, IS1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75 (2), 615-619 (1978) MEDLINE 78137003 PUBMED 273224 REFERENCE 2 AUTHORS Smith,B.R. and Schleif,R. TITLE Nucleotide sequence of the L-arabinose regulatory region of Escherichia coli K12 JOURNAL J. Biol. Chem. 253 (19), 6931-6933 (1978) MEDLINE 79005683 PUBMED 357433 REFERENCE 3 AUTHORS Greenfield,L., Boone,T. and Wilcox,G. TITLE DNA sequence of the araBAD promoter in Escherichia coli B/r JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75 (10), 4724-4728 (1978) MEDLINE 79116194 PUBMED 368797 REFERENCE 4 AUTHORS Johnsrud,L. TITLE DNA sequence of the transposable element IS1 JOURNAL Mol. Gen. Genet. 169 (2), 213-218 (1979) MEDLINE 79177885 PUBMED 375010 REFERENCE 5 AUTHORS Smith,D.R. and Calvo,J.M. TITLE Nucleotide sequence of the E coli gene coding for dihydrofolate reductase JOURNAL Nucleic Acids Res. 8 (10), 2255-2274 (1980) MEDLINE 81053692 PUBMED 6159575 REFERENCE 6 AUTHORS Ogden,S., Haggerty,D., Stoner,C.M., Kolodrubetz,D. and Schleif,R. TITLE The Escherichia coli L-arabinose operon: binding sites of the regulatory proteins and a mechanism of positive and negative regulation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (6), 3346-3350 (1980) MEDLINE 81013881 PUBMED 6251457 REFERENCE 7 AUTHORS Katinka,M., Cossart,P., Sibilli,L., Saint-Girons,I., Chalvignac,M.A., Le Bras,G., Cohen,G.N. and Yaniv,M. TITLE Nucleotide sequence of the thrA gene of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (10), 5730-5733 (1980) MEDLINE 81077247 PUBMED 7003595 REFERENCE 8 AUTHORS Miyada,C.G., Horwitz,A.H., Cass,L.G., Timko,J. and Wilcox,G. TITLE DNA sequence of the araC regulatory gene from Escherichia coli B/r JOURNAL Nucleic Acids Res. 8 (22), 5267-5274 (1980) MEDLINE 81124262 PUBMED 7008027 REFERENCE 9 AUTHORS Cossart,P., Katinka,M. and Yaniv,M. TITLE Nucleotide sequence of the thrB gene of E. coli, and its two adjacent regions; the thrAB and thrBC junctions JOURNAL Nucleic Acids Res. 9 (2), 339-347 (1981) MEDLINE 81150470 PUBMED 6259626 REFERENCE 10 AUTHORS Lee,N.L., Gielow,W.O. and Wallace,R.G. TITLE Mechanism of araC autoregulation and the domains of two overlapping promoters, Pc and PBAD, in the L-arabinose regulatory region of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78 (2), 752-756 (1981) MEDLINE 81199399 PUBMED 6262769 REFERENCE 11 AUTHORS Mackie,G.A. TITLE Nucleotide sequence of the gene for ribosomal protein S20 and its flanking regions JOURNAL J. Biol. Chem. 256 (15), 8177-8182 (1981) MEDLINE 81264207 PUBMED 6267039 REFERENCE 12 AUTHORS Stoner,C.M. and Schleif,R. TITLE Is the amino acid but not the nucleotide sequence of the Escherichia coli araC gene conserved? JOURNAL J. Mol. Biol. 154 (4), 649-652 (1982) MEDLINE 82216830 PUBMED 6283093 REFERENCE 13 AUTHORS Gilson,E., Nikaido,H. and Hofnung,M. TITLE Sequence of the malK gene in E.coli K12 JOURNAL Nucleic Acids Res. 10 (22), 7449-7458 (1982) MEDLINE 83116968 PUBMED 6296778 REFERENCE 14 AUTHORS Parsot,C., Cossart,P., Saint-Girons,I. and Cohen,G.N. TITLE Nucleotide sequence of thrC and of the transcription termination region of the threonine operon in Escherichia coli K12 JOURNAL Nucleic Acids Res. 11 (21), 7331-7345 (1983) MEDLINE 84069770 PUBMED 6316258 REFERENCE 15 AUTHORS Bardwell,J.C. and Craig,E.A. TITLE Major heat shock gene of Drosophila and the Escherichia coli heat-inducible dnaK gene are homologous JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (3), 848-852 (1984) MEDLINE 84144800 PUBMED 6322174 REFERENCE 16 AUTHORS Innis,M.A., Tokunaga,M., Williams,M.E., Loranger,J.M., Chang,S.Y., Chang,S. and Wu,H.C. TITLE Nucleotide sequence of the Escherichia coli prolipoprotein signal peptidase (lsp) gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (12), 3708-3712 (1984) MEDLINE 84222028 PUBMED 6374664 REFERENCE 17 AUTHORS Bouvier,J., Patte,J.C. and Stragier,P. TITLE Multiple regulatory signals in the control region of the Escherichia coli carAB operon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (13), 4139-4143 (1984) MEDLINE 84248073 PUBMED 6377309 REFERENCE 18 AUTHORS Bouvier,J., Richaud,C., Richaud,F., Patte,J.C. and Stragier,P. TITLE Nucleotide sequence and expression of the Escherichia coli dapB gene JOURNAL J. Biol. Chem. 259 (23), 14829-14834 (1984) MEDLINE 85054974 PUBMED 6094578 REFERENCE 19 AUTHORS Kamio,Y., Lin,C.K., Regue,M. and Wu,H.C. TITLE Characterization of the ileS-lsp operon in Escherichia coli. Identification of an open reading frame upstream of the ileS gene and potential promoter(s) for the ileS-lsp operon JOURNAL J. Biol. Chem. 260 (9), 5616-5620 (1985) MEDLINE 85182715 PUBMED 2985604 REFERENCE 20 AUTHORS Friedberg,D., Rosenthal,E.R., Jones,J.W. and Calvo,J.M. TITLE Characterization of the 3' end of the leucine operon of Salmonella typhimurium JOURNAL Mol. Gen. Genet. 199 (3), 486-494 (1985) MEDLINE 85295470 PUBMED 2993799 REFERENCE 21 AUTHORS Cowing,D.W., Bardwell,J.C., Craig,E.A., Woolford,C., Hendrix,R.W. and Gross,C.A. TITLE Consensus sequence for Escherichia coli heat shock gene promoters JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (9), 2679-2683 (1985) MEDLINE 85190560 PUBMED 3887408 REFERENCE 22 AUTHORS Chong,P., Hui,I., Loo,T. and Gillam,S. TITLE Structural analysis of a new GC-specific insertion element IS186 JOURNAL FEBS Lett. 192 (1), 47-52 (1985) MEDLINE 86030702 PUBMED 2996940 REFERENCE 23 AUTHORS Birnbaum,M.J., Haspel,H.C. and Rosen,O.M. TITLE Cloning and characterization of a cDNA encoding the rat brain glucose-transporter protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (16), 5784-5788 (1986) MEDLINE 86287338 PUBMED 3016720 REFERENCE 24 AUTHORS Sekiguchi,T., Ortega-Cesena,J., Nosoh,Y., Ohashi,S., Tsuda,K. and Kanaya,S. TITLE DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of Bacillus coagulans. Comparison with the enzymes of Saccharomyces cerevisiae and Thermus thermophilus JOURNAL Biochim. Biophys. Acta 867, 36-44 (1986) REFERENCE 25 AUTHORS Ohki,M., Tamura,F., Nishimura,S. and Uchida,H. TITLE Nucleotide sequence of the Escherichia coli dnaJ gene and purification of the gene product JOURNAL J. Biol. Chem. 261 (4), 1778-1781 (1986) MEDLINE 86111849 PUBMED 3003084 REFERENCE 26 AUTHORS Mackie,G.A. TITLE Structure of the DNA distal to the gene for ribosomal protein S20 in Escherichia coli K12: presence of a strong terminator and an IS1 element JOURNAL Nucleic Acids Res. 14 (17), 6965-6981 (1986) MEDLINE 87016337 PUBMED 2429258 REFERENCE 27 AUTHORS Blanchin-Roland,S., Blanquet,S., Schmitter,J.M. and Fayat,G. TITLE The gene for Escherichia coli diadenosine tetraphosphatase is located immediately clockwise to folA and forms an operon with ksgA JOURNAL Mol. Gen. Genet. 205 (3), 515-522 (1986) MEDLINE 87172305 PUBMED 3031429 REFERENCE 28 AUTHORS Lee,N., Gielow,W., Martin,R., Hamilton,E. and Fowler,A. TITLE The organization of the araBAD operon of Escherichia coli JOURNAL Gene 47 (2-3), 231-244 (1986) MEDLINE 87163495 PUBMED 3549454 REFERENCE 29 AUTHORS Gronger,P., Manian,S.S., Reilander,H., O'Connell,M., Priefer,U.B. and Puhler,A. TITLE Organization and partial sequence of a DNA region of the Rhizobium leguminosarum symbiotic plasmid pRL6JI containing the genes fixABC, nifA, nifB and a novel open reading frame JOURNAL Nucleic Acids Res. 15 (1), 31-49 (1987) MEDLINE 87146339 PUBMED 3029674 REFERENCE 30 AUTHORS Nonet,M.L., Marvel,C.C. and Tolan,D.R. TITLE The hisT-purF region of the Escherichia coli K-12 chromosome. Identification of additional genes of the hisT and purF operons JOURNAL J. Biol. Chem. 262 (25), 12209-12217 (1987) MEDLINE 87308226 PUBMED 3040734 REFERENCE 31 AUTHORS Karpel,R., Olami,Y., Taglicht,D., Schuldiner,S. and Padan,E. TITLE Sequencing of the gene ant which affects the Na+/H+ antiporter activity in Escherichia coli JOURNAL J. Biol. Chem. 263 (21), 10408-10414 (1988) MEDLINE 88273145 PUBMED 2839489 REFERENCE 32 AUTHORS Jaiswal,A.K., McBride,O.W., Adesnik,M. and Nebert,D.W. TITLE Human dioxin-inducible cytosolic NAD(P)H:menadione oxidoreductase. cDNA sequence and localization of gene to chromosome 16 JOURNAL J. Biol. Chem. 263 (27), 13572-13578 (1988) MEDLINE 88330879 PUBMED 2843525 REFERENCE 33 AUTHORS Lozoya,E., Hoffmann,H., Douglas,C., Schulz,W., Scheel,D. and Hahlbrock,K. TITLE Primary structures and catalytic properties of isoenzymes encoded by the two 4-coumarate: CoA ligase genes in parsley JOURNAL Eur. J. Biochem. 176 (3), 661-667 (1988) MEDLINE 89005119 PUBMED 3169018 REFERENCE 34 AUTHORS Sato,S., Nakada,Y. and Shiratsuchi,A. TITLE IS421, a new insertion sequence in Escherichia coli JOURNAL FEBS Lett. 249 (1), 21-26 (1989) MEDLINE 89252065 PUBMED 2542093 REFERENCE 35 AUTHORS Roa,B.B., Connolly,D.M. and Winkler,M.E. TITLE Overlap between pdxA and ksgA in the complex pdxA-ksgA-apaG-apaH operon of Escherichia coli K-12 JOURNAL J. Bacteriol. 171 (9), 4767-4777 (1989) MEDLINE 89359108 PUBMED 2670894 REFERENCE 36 AUTHORS Matsubara,Y., Indo,Y., Naito,E., Ozasa,H., Glassberg,R., Vockley,J., Ikeda,Y., Kraus,J. and Tanaka,K. TITLE Molecular cloning and nucleotide sequence of cDNAs encoding the precursors of rat long chain acyl-coenzyme A, short chain acyl-coenzyme A, and isovaleryl-coenzyme A dehydrogenases. Sequence homology of four enzymes of the acyl-CoA dehydrogenase family JOURNAL J. Biol. Chem. 264 (27), 16321-16331 (1989) MEDLINE 89380240 PUBMED 2777793 REFERENCE 37 AUTHORS Minami-Ishii,N., Taketani,S., Osumi,T. and Hashimoto,T. TITLE Molecular cloning and sequence analysis of the cDNA for rat mitochondrial enoyl-CoA hydratase. Structural and evolutionary relationships linked to the bifunctional enzyme of the peroxisomal beta-oxidation system JOURNAL Eur. J. Biochem. 185 (1), 73-78 (1989) MEDLINE 90032688 PUBMED 2806264 REFERENCE 38 AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R.E. TITLE Nucleotide sequence and deletion analysis of the polB gene of E. coli JOURNAL DNA Cell Biol. 9, 613-635 (1990) REFERENCE 39 AUTHORS Angerer,A., Gaisser,S. and Braun,V. TITLE Nucleotide sequences of the sfuA, sfuB, and sfuC genes of Serratia marcescens suggest a periplasmic-binding-protein-dependent iron transport mechanism JOURNAL J. Bacteriol. 172 (2), 572-578 (1990) MEDLINE 90130288 PUBMED 2404942 REFERENCE 40 AUTHORS Ricca,E. and Calvo,J.M. TITLE The nucleotide sequence of leuA from Salmonella typhimurium JOURNAL Nucleic Acids Res. 18 (5), 1290 (1990) MEDLINE 90206796 PUBMED 2181403 REFERENCE 41 AUTHORS Schaaff,I., Hohmann,S. and Zimmermann,F.K. TITLE Molecular analysis of the structural gene for yeast transaldolase JOURNAL Eur. J. Biochem. 188 (3), 597-603 (1990) MEDLINE 90235831 PUBMED 2185015 REFERENCE 42 AUTHORS Rosenthal,E.R. and Calvo,J.M. TITLE The nucleotide sequence of leuC from Salmonella typhimurium JOURNAL Nucleic Acids Res. 18 (10), 3072 (1990) MEDLINE 90272436 PUBMED 2190189 REFERENCE 43 AUTHORS Chen,H., Lawrence,C.B., Bryan,S.K. and Moses,R.E. TITLE Aphidicolin inhibits DNA polymerase II of Escherichia coli, an alpha-like DNA polymerase JOURNAL Nucleic Acids Res. 18 (23), 7185-7186 (1990) MEDLINE 91088346 PUBMED 2124684 REFERENCE 44 AUTHORS Mallonee,D.H., White,W.B. and Hylemon,P.B. TITLE Cloning and sequencing of a bile acid-inducible operon from Eubacterium sp. strain VPI 12708 JOURNAL J. Bacteriol. 172 (12), 7011-7019 (1990) MEDLINE 91072253 PUBMED 2254270 REFERENCE 45 AUTHORS Bouvier,J. and Stragier,P. TITLE Nucleotide sequence of the lsp-dapB interval in Escherichia coli JOURNAL Nucleic Acids Res. 19 (1), 180 (1991) MEDLINE 91187617 PUBMED 2011499 REFERENCE 46 AUTHORS Umeda,M. and Ohtsubo,E. TITLE Four types of IS1 with differences in nucleotide sequence reside in the Escherichia coli K-12 chromosome JOURNAL Gene 98 (1), 1-5 (1991) MEDLINE 91192599 PUBMED 1849492 REFERENCE 47 AUTHORS Arigoni,F., Kaminski,P.A., Hennecke,H. and Elmerich,C. TITLE Nucleotide sequence of the fixABC region of Azorhizobium caulinodans ORS571: similarity of the fixB product with eukaryotic flavoproteins, characterization of fixX, and identification of nifW JOURNAL Mol. Gen. Genet. 225 (3), 514-520 (1991) MEDLINE 91203829 PUBMED 1850088 REFERENCE 48 AUTHORS Munro,A.W., Ritchie,G.Y., Lamb,A.J., Douglas,R.M. and Booth,I.R. TITLE The cloning and DNA sequence of the gene for the glutathione-regulated potassium-efflux system KefC of Escherichia coli JOURNAL Mol. Microbiol. 5 (3), 607-616 (1991) MEDLINE 91260444 PUBMED 2046548 REFERENCE 49 AUTHORS Poulsen,L.K., Refn,A., Molin,S. and Andersson,P. TITLE Topographic analysis of the toxic Gef protein from Escherichia coli JOURNAL Mol. Microbiol. 5 (7), 1627-1637 (1991) MEDLINE 92048480 PUBMED 1943700 REFERENCE 50 AUTHORS Poulsen,L.K., Refn,A., Molin,S. and Andersson,P. TITLE The gef gene from Escherichia coli is regulated at the level of translation JOURNAL Mol. Microbiol. 5 (7), 1639-1648 (1991) MEDLINE 92048481 PUBMED 1943701 REFERENCE 51 (bases 1 to 111401) AUTHORS Yura,T., Mori,H., Nagai,H., Nagata,T., Ishihama,A., Fujita,N., Isono,K., Mizobuchi,K. and Nakata,A. TITLE Systematic sequencing of the Escherichia coli genome: analysis of the 0-2.4 min region JOURNAL Nucleic Acids Res. 20 (13), 3305-3308 (1992) MEDLINE 92334977 PUBMED 1630901 REFERENCE 52 AUTHORS Fujita,N., Mori,H., Yura,T. and Ishihama,A. TITLE Systematic sequencing of the Escherichia coli genome: analysis of the 2.4-4.1 min (110,917-193,643 bp) region JOURNAL Nucleic Acids Res. 22 (9), 1637-1639 (1994) MEDLINE 94261430 PUBMED 8202364 REFERENCE 53 AUTHORS Stragier,P. JOURNAL Unpublished REFERENCE 54 AUTHORS Ishino,Y. JOURNAL Unpublished REFERENCE 55 AUTHORS Ayala,J.A. TITLE Regulation of transcription at 2-minute region of the genetic map of Escherichia coli JOURNAL Unpublished REFERENCE 56 (bases 1 to 111408) AUTHORS Mori,H. TITLE Direct Submission JOURNAL Submitted (18-FEB-1992) Hirotada Mori, Institute for Virus Research, Kyoto University, Genetics and Molecular Biology; 53 Shogoin Kawara-Machi, Sakyo-ku, Kyoto 606, Japan (E-mail:e52985@sakura.kudpc.kyoto-u.ac.jp, Tel:075-751-4042, Fax:075-761-5626) COMMENT On or before Jun 4, 2002 this sequence version replaced gi:145461, gi:147757, gi:147977, gi:145773, gi:146668, gi:145775, gi:145709, gi:145770, gi:40949, gi:40916, gi:42462, gi:41931, gi:40841, gi:41874, gi:216434. Submitted (18-FEB-1992) to DDBJ by: Hirotada Mori Institute for Virus Research Kyoto University Kawahara-cho Shogoin Sakyo-ku Kyoto 606 Phone: 075-751-4042 Fax: 075-761-5626 Sequence updated (21-DEC-1992) by: Hirotada Mori. FEATURES Location/Qualifiers source 1..111408 /organism="Escherichia coli" /strain="K-12" /db_xref="taxon:562" /map="0-2.4 min" /note="Nucleotide position 1-111408 from the initiation site of ThrA (0 min.)." gene 1..2463 /gene="thrA" /note="synonyms: thrA1, thrA2" CDS 1..2463 /gene="thrA" /note="ORF_ID:o101#1 similar to PIR Accession Number A00671" /codon_start=1 /transl_table=11 /product="ThrA bifunctional enzyme" /protein_id="BAB96579.1" /db_xref="GI:21321892" /translation="MRVLKFGGTSLANAERFLRVADILESNARQGQVATVLSAPAKIT NHLVAMIEKTISGQDALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIK HVLHGISLLGQCPDSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHY LESTVDIAESTRRIAASRIPADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACL RADCCEIWTDVNGVYTCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQF QIPCLIKNTGNPQAPGTLIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAAR VFAAMSRARISVVLITQSSSEYSISFCVPQSDCVRAERAMLEEFYLELKEGLLEPLAV AERLAIISVVGDGLRTLRGISAKFFAALARANINIVAIAQGSSERSISVVVNNDDATT GVRVTHQMLFNTDQVIEVFVIGVGGVGGALLEQLKRQQSWLKNKHIDLRVCGVANSKA LLTNVHGLNLENWQEELAQAKEPFNLGRLIRLVKEYHLLNPVIVNCTSSQAVADQYAD FLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSRRKFLYDINVGAGLPVIENLQNLLN AGDELMKFSGILSGSLSYIFGKLDEGMSFSEATRLAREMGYTEPDPRDDLSGMDVARK LLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMANLSQLDDLFAARVAKARDEG KVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAFYSHYYQPLPLVLRGYGAG NDVTAAGVFADLLRTLSWKLGV" gene 2465..3394 /gene="thrB" CDS 2465..3394 /gene="thrB" /note="ORF_ID:o101#2 similar to PIR Accession Number A00658" /codon_start=1 /transl_table=11 /product="Homoserine kinase (EC 2.7.1.39)" /protein_id="BAB96580.1" /db_xref="GI:21321893" /translation="MVKVYAPASSANMSVGFDVLGAAVTPVDGALLGDVVTVEAAQTF SLNNLGRFADKLPSEPRENIVYQCWERFCQELGKQIPVAMTLEKNMPIGSGLGSSACS VVAALMAMNEHCGKPLNDTRLLALMGELEGRISGSIHYDNVAPCFLGGMQLMIEENDI ISQQVQGLMSGCGCSRIRGLKSRRQKQGYLPAQYRRQDCIAHGRHLAGFIHACYSRQP ELAAKLMKDVIAEPYRERLLPGFRQARQAVAEIGAVASGISGSGPTLFALCDKPETAQ RVADWLGKNYLQNQEGFVHICRLDTAGARVLEN" gene 3395..4681 /gene="thrC" CDS 3395..4681 /gene="thrC" /note="ORF_ID:o101#3 similar to PIR Accession Number A01157" /codon_start=1 /transl_table=11 /product="Threonine synthase (EC 4.2.99.2)" /protein_id="BAB96581.1" /db_xref="GI:21321894" /translation="MKLYNLKDHNEQVSFAQAVTQGLGKNQGLFFPHDLPEFSLTEID EMLKLDFVTRSAKILSAFIGDEIPQEILEERVRAAFAFPAPVANVESDVGCLELFHGP TLAFKDFGGRFMAQMLTHIAGDKPVTILTATSGDTGAAVAHAFYGLPNVKVVILYPRG KISPLQEKLFCTLGGNIETVAIDGDFDACQALVKQAFDDEELKVALGLNSANSINISR LLAQICYYFEAVAQLPQETRNQLVVSVPSGNFGDLTAGLLAKSLGLPVKRFIAATNVN DTVPRFLHDGQWSPKATQATLSNAMDVSQPNNWPRVEELFRRKIWQLKELGYAAVDDE TTQQTMRELKELGYTSEPHAAVAYRALRDQLNPGEYGLFLGTAHPAKFKESVEAILGE TLDLPKELAERADLPLLSHNLPADFAALRKLMMNHQ" CDS 4895..5191 /note="ORF_ID:o101#4" /codon_start=1 /transl_table=11 /protein_id="BAB96582.1" /db_xref="GI:21321895" /translation="MKKMQSIVLALSLVLVAPMAAQAAEITLVPSVKLQIGDRDNRGY YWDGGHWRDHGWWKQHYEWRGNRWHLHGPPPPPRHHKKAPHDHHGGHGPGKHHR" CDS complement(4971..5402) /note="ORF_ID:o101#5" /codon_start=1 /transl_table=11 /protein_id="BAB96583.1" /db_xref="GI:21321896" /translation="MMKIPPAMANWCLNATSSVKFAVNAGCGVNALSCLQKHADSIYC RINVGLIRRASVASGAECRIVTRHLSFSGDVCQDHDRRDDHEELSYGGAAVAAVRVGA SDCLAIHNVVSTSRGRASDLHPSNSHDYHDRLSVILLTGLT" gene complement(5344..6120) /gene="yaaA" CDS complement(5344..6120) /gene="yaaA" /note="ORF_ID:o101#6 similar to PIR Accession Number S40534" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96584.1" /db_xref="GI:21321897" /translation="MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPP QISTLMRISDKLAGINAARFHDWQPDFTPANARQAILAFKGDVYTGLQAETFSEDDFD FAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDIITNKLNEALA AQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGKFKIISFYAKKARGLMSRF IIENRLTKPEQLTGFNSEGYFFDEDSSSNGELVFKRYEQR" gene complement(6190..7620) /gene="yaaJ" CDS complement(6190..7620) /gene="yaaJ" /note="ORF_ID:o101#7 similar to SwissProt Accession Number P30143" /codon_start=1 /transl_table=11 /product="Hypothetical 51.7 kd protein in thrC-talB intergenic region (ORF8)." /protein_id="BAB96585.1" /db_xref="GI:21321898" /translation="MPDFFSFINSVLWGSVMIYLLFGAGCWFTFRTGFVQFRYIRQFG KSLKNSIHPQPGGLTSFQSLCTSLAARVGSGNLAGVALAITAGGPGAVFWMWVAAFIG MATSFAECSLAQLYKERDVNGQFRGGPAWYMARGLGMRWMGVLFAVFLLIAYGIIFSG VQANAVARALSFSFDFPPLVTGIILAVFTLLAITRGLHGVARLMQGFVPLMAIIWVLT SLVICVMNIGQLPHVIWSIFESAFGWQEAAGGAAGYTLSQAITNGFQRSMFSNEAGMG STPNAAAAAASWPPHPAAQGIVQMIGIFIDTLVICTASAMLILLAGNGTTYMPLEGIQ LIQKAMRVLMGSWGAEFVTLVVILFAFSSIVANYIYAENNLFFLRLNNPKAIWCLRIC TFATVIGGTLLSLPLMWQLADIIMACMAITNLTAILLLSPVVHTIASDYLRQRKLGVR PVFDPLRYPDIGRQLSPDAWDDVSQE" gene 7899..8852 /gene="talB" CDS 7899..8852 /gene="talB" /note="ORF_ID:o101#8 similar to PIR Accession Number S40535" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96586.1" /db_xref="GI:21321899" /translation="MTDKLTSLRQYTTVVADTGDIAAMKLYQPQDATTNPSLILNAAQ IPEYRKLIDDAVAWAKQQSNDRAQQIVDATDKLAVNIGLEILKLVPGRISTEVDARLS YDTEASIAKAKRLIKLYNDAGISNDRILIKLASTWQGIRAAEQLEKEGINCNLTLLFS FAQARACAEAGVFLISPFVGRILDWYKANTDKKEYAPAEDPGVVSVSEIYQYYKEHGY ETVVMGASFRNIGEILELAGCDRLTIAPALLKELAESEGAIERKLSYTGEVKARPARI TESEFLWQHNQDPMAVDKLAEGIRKFAIDQEKLEKMIGDLL" gene 9136..9552 /gene="chlG" /note="synonym: mog" CDS 9136..9552 /gene="chlG" /note="ORF_ID:o102#1 similar to PIR Accession Number B56688" /codon_start=1 /transl_table=11 /product="Molybdopterin biosynthesis Mog protein." /protein_id="BAB96587.1" /db_xref="GI:21321900" /translation="MCELVDEMSCHLVLTTGGTGPARRDVTPDATLAVADREMPGFGE QMRQISLHFVPTAILSRQVGVIRKQALILNLPGQPKSIKETLEGVKDAEGNVVVHGIF ASVPYCIQLLEGPYVETAPEVVAAFRPKSARRDVSE" CDS complement(9584..10150) /note="ORF_ID:o102#2 similar to PIR Accession Number E56688" /codon_start=1 /transl_table=11 /product="hgtA 5'-region hypothetical protein 1" /protein_id="BAB96588.1" /db_xref="GI:21321901" /translation="MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIF YGGIAQIFAGLLEYKKGNTFGLTAFTSYGSFWLTLVAILLMPKLGLTDAPNAQFLGVY LGLWGVFTVFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAIIHFAGWIGLIC GASAIYLAMGEVLNEQFGRTVLPIGESH" gene 10479..10963 /gene="htgA" /note="synonym: htpY" misc_feature 10479..10963 /gene="htgA" /product="Heat shock protein Y" /note="ORF is interrupted by frame-shifting ORF_ID:o102#3 similar to PIR Accession Number A40623" gene complement(11030..11435) /gene="yaaI" misc_feature complement(11030..11435) /gene="yaaI" /product="dnaK 5'-region hypothetical protein 1" /note="ORF is interrupted by frame-shifting ORF_ID:o102#4 similar to PIR Accession Number C56688" gene 11812..13728 /gene="dnaK" CDS 11812..13728 /gene="dnaK" /note="ORF_ID:o102#5 similar to PIR Accession Number A03311" /codon_start=1 /transl_table=11 /product="DnaK protein" /protein_id="BAB96589.1" /db_xref="GI:21321902" /translation="MGKIIGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQ DGETLVGQPAKRQAVTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAW VEVKGQKMAPPQISAEVLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRI AGLEVKRIINEPTAAALAYGLDKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEV LATNGDTHLGGEDFDSRLINYLVEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA QQTDVNLPYITADATGPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQDAGLSVSDI DDVILVGGQTRMPMVQKKVAEFFGKEPRKDVNPDEAVAIGAAVQGGVLTGDVKDVLLL DVTPLSLGIETMGGVMTTLIAKNTTIPTKHSQVFSTAEDNQSAVTIHVLQGERKRAAD NKSLGQFNLDGINPAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKITIKASSGLN EDEIQKMVRDAEANAEADRKFEELVQTRNQGDHLLHSTRKQVEEAGDKLPADDKTAIE SALTALETALKGEDKAAIEAKMQELAQVSQKLMEIAQQQHAQQQTAGADASANNAKDD DVVDAEFEEVKDKK" gene 13817..14947 /gene="dnaJ" /note="synonym: groP" CDS 13817..14947 /gene="dnaJ" /note="ORF_ID:o102#6 similar to SwissProt Accession Number P08622" /codon_start=1 /transl_table=11 /product="DnaJ protein." /protein_id="BAB96590.1" /db_xref="GI:21321903" /translation="MAKQDYYEILGVSKTAEEREIRKAYKRLAMKYHPDRNQGDKEAE AKFKEIKEAYEVLTDSQKRAAYDQYGHAAFEQGGMGGGGFGGGADFSDIFGDVFGDIF GGGRGRQRAARGADLRYNMELTLEEAVRGVTKEIRIPTLEECDVCHGSGAKPGTQPQT CPTCHGSGQVQMRQGFFAVQQTCPHCQGRGTLIKDPCNKCHGHGRVERSKTLSVKIPA GVDTGDRIRLAGEGEAGEHGAPAGDLYVQVQVKQHPIFEREGNNLYCEVPINFAMAAL GGEIEVPTLDGRVKLKVPGETQTGKLFRMRGKGVKSVRGGAQGDLLCRVVVETPVGLN ERQKQLLQELQESFGGPTGEHNSPRSKSFFDGVKKFFDDLTR" gene complement(16400..16609) /gene="gef" CDS complement(16400..16609) /gene="gef" /note="ORF_ID:o102#7 similar to PIR Accession Number S16473" /codon_start=1 /transl_table=11 /product="Gef protein" /protein_id="BAB96591.1" /db_xref="GI:21321904" /translation="MLNTCRVPLTDRKVKEKRAMKQHKAMIVALIVICITAVVAALVT RKDLCEVHIRTGQTEVAVFTAYESE" gene 17138..18304 /gene="ant" /note="synonym: nhaA" CDS 17138..18304 /gene="ant" /note="ORF_ID:o102#8 similar to SwissProt Accession Number P13738" /codon_start=1 /transl_table=11 /product="Na(+)/H(+) antiporter 1." /protein_id="BAB96592.1" /db_xref="GI:21321905" /translation="MKHLHRFFSSDASGGIILIIAAILAMIMANSGATSGWYHDFLET PVQLRVGSLEINKNMLLWINDALMAVFFLLVGLEVKRELMQGSLASLRQAAFPVIAAI GGMIVPALLYLAFNYADPITREGWAIPAATDIAFALGVLALLGSRVPLALKIFLMALA IIDDLGAIIIIALFYTNDLSMASLGVAAVAIAVLAVLNLCGARRTGVYILVGVVLWTA VLKSGVHATLAGVIVGFFIPLKEKHGRSPAKRLEHVLHPWVAYLILPLFAFANAGVSL QGVTLDGLTSILPLGIIAGLLIGKPLGISLFCWLALRLKLAHLPEGTTYQQIMVVGIL CGIGFTMSIFIASLAFGSVDPELINWAKLGILVGSISSAVIGYSWLRVRLRPSV" gene 18364..19283 /gene="antO" /note="synonym: nhaR" misc_feature 18364..19283 /gene="antO" /product="Transcriptional activator protein NhaR." /note="ORF is interrupted by frame-shifting ORF_ID:o103#1 similar to SwissProt Accession Number P10087" gene complement(20478..20741) /gene="rpsT" CDS complement(20478..20741) /gene="rpsT" /note="ORF_ID:o103#2 similar to PIR Accession Number A30425" /codon_start=1 /transl_table=11 /product="Ribosomal protein S20" /protein_id="BAB96593.1" /db_xref="GI:21321906" /translation="MANIKSAKKRAIQSEKARKHNASRRSMMRTFIKKVYAAIEAGDK AAAQKAFNEMQPIVDRQAAKGLIHKNKAARHKANLTAQINKLA" gene 21070..22008 /gene="yaaC" CDS 21070..22008 /gene="yaaC" /note="ORF_ID:o103#3 similar to PIR Accession Number A22609" /codon_start=1 /transl_table=11 /product="Hypothetical 35k protein (ileS-lsp operon)" /protein_id="BAB96594.1" /db_xref="GI:21321907" /translation="MKLIRGIHNLSQAPQEGCVLTIGNFDGVHRGHRALLQGLQEEGR KRNLPVMVMLFEPQPLELFATDKAPARLTRLREKLRYLAECGVDYVLCVRFDRRFAAL TAQNFVSDLLVKHLRVKFLAVGDDFPLALVVKAISCYYRKLAWNTASISPVRKLFAEV ACASAARLRQALADDNLALAESLLGHPFAISGRVVHGDELGRTIGFPTANVPPRRQVS PVKGVYAVEVLGLGEKPLPGVANIGTRPTVAGIRQQLEVHLLDVAMDLYGRHIQVVLR KKIRNEQRFASLDELKAQIARDELTAREFFGLTKPA" gene 22051..24861 /gene="ileS" CDS 22051..24861 /gene="ileS" /note="ORF_ID:o103#4 similar to PIR Accession Number S40549" /codon_start=1 /transl_table=11 /product="Isoleucine--tRNA ligase (EC 6.1.1.5)" /protein_id="BAB96595.1" /db_xref="GI:21321908" /translation="MSDYKSTLNLPETGFPMRGDLAKREPGMLARWTDDDLYGIIRAA KKGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVKSKGLSGYDSPYVPGWDCHGLPI ELKVEQEYGKPGEKFTAAEFRAKCREYAATQVDGQRKDFIRLGVLGDWSHPYLTMGLQ NWKANIIRALGKIIGNGHLHKGAKPVHWCVDCRSALAKRKLSITTKLLRPSTLLSRAV DQDALKAKFAVSNVNGPISLVIWTTTPWTLPANRAISIAPDFDYALVQIDGQAVILAK DLVESVMQRIGVTDSRHGKRCGAGAAAFTHPFMGFDVPAILGDHVTLDAGTGAVHTAP GHGPDDYVIGQKYGLETANPVGPDGTYLPGTYPTLDGVNVFKANDIVAALLQEKGALL HVEKMQHSYPCCWRHKTPIIFRATPQWFVSMDQKGLRAQSLKEIKGVQWIPDWGQARI ESMVANRPDWCISRQRTWGVPMSLFVHKDTEELHPRTLELMEEVAKRVEVDGIQAWWD LDAKEILGDEADQYVKVPDTLDVWFDSGSTHSSVVDVRPEFAGHAADMYLEGSDQHRG WFMSSLMISTAMKGKAPYRQVLTHGFTVDGQGRKMSKSIGNTVSPQDVMNKLGADILR LWVASTDYTGEMAVSDEILKRAADTYRRIRNTARFLLANLNGFDPAKDMVKRREMVVL DRWAVVVRKRHRKTSSRRTKHTISTKWYKRLMRFCSVEMGSFYLDIIKDRQYYAKGHS VARRSCQTALYHIAEALVRWMAPILSFTADEVWGYLPGEREKYVFTGEWYEGLFGLAD SEAMNDAFWDELLKVRGEVNKVIEQARADKKVGGSLEAAVTLYAEPELSAKLTALGDE LRFVLLTSGATVADYNDAPADAQQSEVLKGLKVALSKAEGEKCPRCWHYTQDVGKVAE HAEICGRCVSNVAGDGEKRKFA" gene 24861..25355 /gene="lspA" CDS 24861..25355 /gene="lspA" /note="ORF_ID:o103#5 similar to SwissProt Accession Number P00804" /codon_start=1 /transl_table=11 /product="Lipoprotein signal peptidase (EC 3.4.23.36) (Prolipoprotein signal peptidase) (Signal peptidase II) (Spase II)." /protein_id="BAB96596.1" /db_xref="GI:21321909" /translation="MSQSICSTGLRWLWLVVVVLIIDLGSKYLILQNFALGDTVPLFP SLNLHYARNYGAAFSFLADSGGWQRWFFAGIAIGISVILAVMMYRSKATQKLNNIAYA LIIGGALGNLFDRLWHGFVVDMIDFYVGDWHFATFNLADTAICVGAALIVLEGFLPSR AKKQ" gene 25480..25929 /gene="yaaD" CDS 25480..25929 /gene="yaaD" /note="ORF_ID:o103#6 similar to PIR Accession Number JE0402" /codon_start=1 /transl_table=11 /product="Hypothetical 16.4K protein (lsp-dapB intergenic region)" /protein_id="BAB96597.1" /db_xref="GI:21321910" /translation="MSESVQSNSAVLVHFTLKLDDGTTAESTRNNGKPALFRLGDASL SEGLEQHLLGLKVGDKTTFSLEPDAAFGVPSPDLIQYFSRREFMDAGEPEIGAIMLFT AMDGSEMPGVIREINGDSITVDFNHPLAGQTVHFDIEVLEIDPALEA" gene 25931..26881 /gene="lytB" CDS 25931..26881 /gene="lytB" /note="ORF_ID:o103#7 similar to PIR Accession Number JE0403" /codon_start=1 /transl_table=11 /product="Hypothetical 34.8k protein (lsp-dapB intergenic region)" /protein_id="BAB96598.1" /db_xref="GI:21321911" /translation="MQILLANPRGFCAGVDRAISIVENALAIYGAPIYVRHEVVHNRY VVDSLRERGAIFIEQISEVPDGAILIFSAHGVSQAVRNEAKSRDLTVFDATCPLVTKV HMEVARASRRGEESILIGHAGHPEVEGTMGQYSNPEGGMYLVESPDDVWKLTVKNEEK LSFMTQTTLSVDDTSDVIDALRKRFPKIVGPRKDDICYATTNRQEAVRALAEQAEVVL VVGSKNSSNSNRLAELAQRMGKRAFLIDDAKDIQEEWVKEVKCVGVTAGASAPDILVQ NVVARLQQLGGGEAIPLEGREENIVFEVPKELRVDIREVD" gene 26947..27861 /gene="yaaF" CDS 26947..27861 /gene="yaaF" /note="ORF_ID:o103#8 similar to PIR Accession Number JE0404" /codon_start=1 /transl_table=11 /product="Hypothetical 32.6k protein (lsp-dapB intergenic region)" /protein_id="BAB96599.1" /db_xref="GI:21321912" /translation="MRLPIFLDTDPGIDDAVAIAAAIFAPELDLQLMTTVAGNVSVEK TTRNALQLLHFWNAEIPLAQGAAVPLVRAPRDAASVHGESGMAGYDFVEHNRKPLGIP AFLAIRDALMRAPEPVTLVAIGPLTNIALLLSQCPECKPYIRRLVIMGGSAGRGNCTP NAEFNIAADPEAAACVFRSGIEIVMCGLDVTNQAILTPDYLSTLPQLNRTGKMLHALF SHYRSGSMQSGLRMHDLCAIAWLVRPDLFTLKPCFVAVETQGEFTSGTTVVDIDGCLG KPANVQVALDLDVKGFQQWVAEVLALAS" gene 28028..28849 /gene="dapB" CDS 28028..28849 /gene="dapB" /note="ORF_ID:o103#9 similar to SwissProt Accession Number P04036" /codon_start=1 /transl_table=11 /product="Dihydrodipicolinate reductase (EC 1.3.1.26)." /protein_id="BAB96600.1" /db_xref="GI:21321913" /translation="MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSL LGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIG TTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHH RHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVG EHTAMFADIGERLEITHKASSRMTFANGAVRSALWLSGKESGLFDMRDVLDLNNL" gene 29305..30453 /gene="carA" /note="synonym: pyrA" CDS 29305..30453 /gene="carA" /note="ORF_ID:o103#10 similar to SwissProt Accession Number P00907" /codon_start=1 /transl_table=11 /product="Carbamoyl-phosphate synthase small chain (EC 6.3.5.5) (carbamoyl- phosphate synthetase glutamine chain)." /protein_id="BAB96601.1" /db_xref="GI:21321914" /translation="MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILT DPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYL KRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAK EVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTI VPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLA LASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFD GTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK" gene 30471..33692 /gene="carB" CDS 30471..33692 /gene="carB" /note="ORF_ID:o103#11 similar to PIR Accession Number A01198" /codon_start=1 /transl_table=11 /product="Carbamoyl-phosphate synthase (glutamine-hydrolyzing) (EC 6.3.5.5) large chain" /protein_id="BAB96602.1" /db_xref="GI:21321915" /translation="MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRV ILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALEL ERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAA DVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEM EVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIG VETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDE LMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQ ESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGADRIWYIADAFRAGLSVD GVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNADFLRQLKRKGFADARLAKLAGV REAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLG GGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLE DVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAV ERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQT AVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPA YTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKA TGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRST GEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELD ATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIR RSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK" gene complement(33740..33919) /gene="yaaV" CDS complement(33740..33919) /gene="yaaV" /note="ORF_ID:o103#12 similar to SwissProt Accession Number P46145" /codon_start=1 /transl_table=11 /product="Hypothetical 6.8 kd protein in carB-caiE intergenic region." /protein_id="BAB96603.1" /db_xref="GI:21321916" /translation="MTRFEAIKQGHIKIVDISIVCNFTVDKCELNPAYVIKNIDSPKD LLNGQKKTVLIREPY" gene 33849..34351 /gene="caiF" misc_feature 33849..34351 /gene="caiF" /product="Transcription activator caiF" /note="ORF is interrupted by frame-shifting ORF_ID:o103#13 similar to PIR Accession Number JC6025" gene complement(34438..35049) /gene="caiE" CDS complement(34438..35049) /gene="caiE" /note="ORF_ID:o103#14 similar to SwissProt Accession Number P39206" /codon_start=1 /transl_table=11 /product="Carnitine operon protein caiE." /protein_id="BAB96604.1" /db_xref="GI:21321917" /translation="MERTLTTVSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYI GPLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCLIGRDA LVGMNSVIMDGAVIGEESIVAAMSFVKAGFRGEKRQLLMGTPARAVRNVSDDELHWKR LNTKEYQDLVGRCHVSLHETQPLRQMEENRPRLQGTTDVTPKR" gene complement(35034..35927) /gene="caiD" CDS complement(35034..35927) /gene="caiD" /note="ORF_ID:o103#15 similar to PIR Accession Number S40557" /codon_start=1 /transl_table=11 /product="Hypothetical protein." /protein_id="BAB96605.1" /db_xref="GI:21321918" /translation="MKQQGTTLPANNHTLKQYAFFAGMLSSLKKQKWRKGMSESLHLT RNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDL KAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGAFELALAADFIVCADN ASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGAEEALRWGIVNRVVSQ AELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHS EDAIEGPLAFAEKRDPVWKGR" gene complement(35928..37496) /gene="caiC" CDS complement(35928..37496) /gene="caiC" /note="ORF_ID:o104#1 similar to PIR Accession Number S40558" /codon_start=1 /transl_table=11 /product="Hypothetical protein." /protein_id="BAB96606.1" /db_xref="GI:21321919" /translation="MDRGAMDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYS YLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLL CEESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQ LKNQQPATLCYAPPLSTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRD DDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATVTECIPM MIRTLMVQPPSANDQQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGII GDRPGDKRRWPSIGRVGFCYEAEIRDDHNRPLPAGEIGEICIKGIPGKTIFKEYFLNP QATAKVLEADGWLHTGDTGYRDEEDFFYFVDRRCNMIKRGGENVSCVELENIIAAHPK IQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDL PRNCSGKIIRKNLK" gene complement(37555..38772) /gene="caiB" CDS complement(37555..38772) /gene="caiB" /note="ORF_ID:o104#2 similar to PIR Accession Number S40559" /codon_start=1 /transl_table=11 /product="L-carnitine dehydratase (EC 4.-.-.-)." /protein_id="BAB96607.1" /db_xref="GI:21321920" /translation="MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIE NVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFA RRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQP MPAFPYTADYFSGLTATTAALAALHKVRETGKGESIDIAMYEVMLRMGQYFMMDYFNG GEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLGTPEIPE GTQLIHRIECPYGPLVEEKLDAWLATHTIAEVKERFAELNIACAKVLTVPELESNPQY VARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDI QELVSKGLAKVED" gene complement(38901..40043) /gene="caiA" CDS complement(38901..40043) /gene="caiA" /note="ORF_ID:o105#1 similar to PIR Accession Number S40560" /codon_start=1 /transl_table=11 /product="Hypothetical protein." /protein_id="BAB96608.1" /db_xref="GI:21321921" /translation="MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVK ALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGT QEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRKNGKIYLNGSKCFITSSA YTPYIVVMARDGASPDKPVYTGWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELD EKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRF QLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQ VLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR" gene complement(40075..41589) /gene="caiT" CDS complement(40075..41589) /gene="caiT" /note="ORF_ID:o105#2 similar to PIR Accession Number S40561" /codon_start=1 /transl_table=11 /product="Hypothetical protein." /protein_id="BAB96609.1" /db_xref="GI:21321922" /translation="MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVF SYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSA AVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYF FFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFTMGTSLGLATPLVTEC MQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIV SGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIF LARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVAR AIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRI GWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD" gene 42025..42831 /gene="fixA" CDS 42025..42831 /gene="fixA" /note="ORF_ID:o105#3 similar to PIR Accession Number S40562" /codon_start=1 /transl_table=11 /product="FixA homolog." /protein_id="BAB96610.1" /db_xref="GI:21321923" /translation="MNDVSFMISGDAMKIITCYKCVPDEQDIAVNNADGSLDFSKADA KISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRDPDELIVVIDD QFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSK IISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVW SAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI" gene 42810..43787 /gene="fixB" misc_feature 42810..43787 /gene="fixB" /product="FixB protein." /note="ORF is interrupted by frame-shifting ORF_ID:o105#4 similar to SwissProt Accession Number P31574" gene 43838..45124 /gene="fixC" CDS 43838..45124 /gene="fixC" /note="ORF_ID:o105#5 similar to PIR Accession Number S40564" /codon_start=1 /transl_table=11 /product="FixC protein" /protein_id="BAB96611.1" /db_xref="GI:21321924" /translation="MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKN VTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRS YSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILAD GVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTD GLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEY SAHIMPEAGINMLPELVGDGVLIAGDAAGMCMNFGFTIRGMDLAIAAGEAAAKTVLSA MKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFT IDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL" gene 45121..45408 /gene="yaaT" CDS 45121..45408 /gene="yaaT" /note="ORF_ID:o105#6 similar to PIR Accession Number S40565" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96612.1" /db_xref="GI:21321925" /translation="MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACP AGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYS" gene 45465..46050 /gene="yaaU" misc_feature 45465..46050 /gene="yaaU" /product="Hypothetical 18.4 kd protein in fixC-kefC intergenic region (orf65)." /note="ORF is interrupted by frame-shifting ORF_ID:o105#7 similar to SwissProt Accession Number P31679" gene 46137..46799 /gene="yabE" CDS 46137..46799 /gene="yabE" /note="ORF_ID:o105#8 similar to PIR Accession Number S40566" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96613.1" /db_xref="GI:21321926" /translation="MKSSRSKPGFRDLFNRCHFPFVLFVAAIWTCQVIPMFAIYTFGP QIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLSTAGRRPLLIGSFAMMTLALAV LGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGT IVSTWALPIFIDNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQR MG" gene 46907..47437 /gene="yabF" CDS 46907..47437 /gene="yabF" /note="ORF_ID:o105#9 similar to PIR Accession Number S40567" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96614.1" /db_xref="GI:21321927" /translation="MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNI DIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWINKVFSHGWAYGHGGTALHGKHLLW AVTTGGGESHFEIGAHPGFDGLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQA RHYKQRLLEWQEAHHG" gene 47430..49292 /gene="kefC" /note="synonym: trkC" CDS 47430..49292 /gene="kefC" /note="ORF_ID:o105#10 similar to SwissProt Accession Number P03819" /codon_start=1 /transl_table=11 /product="Glutathione-regulated potassium-efflux system protein KefC (K(+)/H(+) antiporter)." /protein_id="BAB96615.1" /db_xref="GI:21321928" /translation="MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPW GLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGCGALQMVICGGLLG LFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAI PLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREV FSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLF FIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQ GSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREA DEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDA TRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQ AGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTK ARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS" gene 49482..49961 /gene="folA" /note="synonym: tmrA" CDS 49482..49961 /gene="folA" /note="ORF_ID:o105#11 similar to SwissProt Accession Number P00379" /codon_start=1 /transl_table=11 /product="Dihydrofolate reductase type I (EC 1.5.1.3)." /protein_id="BAB96616.1" /db_xref="GI:21321929" /translation="MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGR HTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQ FLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR" gene complement(50039..50881) /gene="apaH" CDS complement(50039..50881) /gene="apaH" /note="ORF_ID:o105#12 similar to PIR Accession Number A26221" /codon_start=1 /transl_table=11 /product="Bis(5'-nucleosyl)-tetraphosphatase (symmetrical) (EC 3.6.1.41)" /protein_id="BAB96617.1" /db_xref="GI:21321930" /translation="MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPG SLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPLDRLTPLLEAPDADELLNWL RRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMP NNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVA EEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLAE AAAS" gene complement(50888..51265) /gene="apaG" CDS complement(50888..51265) /gene="apaG" /note="ORF_ID:o106#1 similar to PIR Accession Number A30273" /codon_start=1 /transl_table=11 /product="ApaG protein" /protein_id="BAB96618.1" /db_xref="GI:21321931" /translation="MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQ LLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMI DENGVPFSIDIPVFRLAVPTLIH" gene complement(52087..53076) /gene="pdxA" CDS complement(52087..53076) /gene="pdxA" /note="ORF_ID:o106#2 similar to PIR Accession Number JV0026" /codon_start=1 /transl_table=11 /product="PdxA protein" /protein_id="BAB96619.1" /db_xref="GI:21321932" /translation="MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLT NRAAMLGLPLTLRPYSPNSPAQPQTAGTLTLLPVALRAPVTAGQLAVENGHYVVETLA RACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRV ALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHM GTEEIDTIIPVLNELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQ GFGRGVNITLGLPFIRTSVDHGTALELAGRGKADVGSFITALNLAIKMIVNTQ" CDS complement(53076..53873) /note="ORF_ID:o106#3 similar to PIR Accession Number PV0009" /codon_start=1 /transl_table=11 /product="Hypothetical protein 98 (pdx 5' region)" /protein_id="BAB96620.1" /db_xref="GI:21321933" /translation="MGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQAR NGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGF HILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAA AAKEFSQDPGSANQGGDLGWATPDIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELL DTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN" gene complement(53840..54361) /gene="surA" CDS complement(53840..54361) /gene="surA" /note="ORF_ID:o106#4 similar to PIR Accession Number S40574" /codon_start=1 /transl_table=11 /product="Survival protein SurA precursor (peptidyl-prolyl cis-trans isomerase SurA) (EC 5.2.1.8) (PPiase) (rotamase C)." /protein_id="BAB96621.1" /db_xref="GI:21321934" /translation="MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDG LMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIAN IAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQKSNPW RSRWVTKTTPALS" gene complement(54414..56768) /gene="imp" /note="synonym: ostA" CDS complement(54414..56768) /gene="imp" /note="ORF_ID:o106#5 similar to SwissProt Accession Number P31554" /codon_start=1 /transl_table=11 /product="Organic solvent tolerance protein precursor." /protein_id="BAB96622.1" /db_xref="GI:21321935" /translation="MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQ GDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVR TVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGEN RYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQL PVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEF RYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKV SDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEP QLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTE AKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTL EPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRI YDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRG GIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGI SQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWD NDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNTL" gene 57023..57838 /gene="yabH" CDS 57023..57838 /gene="yabH" /note="ORF_ID:o106#6 similar to SwissProt Accession Number P31680" /codon_start=1 /transl_table=11 /product="Hypothetical 30.6 kd protein in folA-hepA intergenic region (orf81)." /protein_id="BAB96623.1" /db_xref="GI:21321936" /translation="MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWF ANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFR VGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEE LGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDA TTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK" gene 58625..58783 /gene="yabP" CDS 58625..58783 /gene="yabP" /note="ORF_ID:o106#7 similar to SwissProt Accession Number P39220" /codon_start=1 /transl_table=11 /product="Hypothetical 5.9 kd protein in surA-hepA intergenic region." /protein_id="BAB96624.1" /db_xref="GI:21321937" /translation="MRDCYLGKKTMKGSNDILYERPGWNANLGVLPRTVLPRTVLTRT VLTWTVLP" gene 58780..58938 /gene="yabQ" CDS 58780..58938 /gene="yabQ" /note="ORF_ID:o106#8 similar to SwissProt Accession Number P39221" /codon_start=1 /transl_table=11 /product="Hypothetical 5.7 kd protein in surA-hepA intergenic region." /protein_id="BAB96625.1" /db_xref="GI:21321938" /translation="MNGATSLYDEVIIINKIPPKKIDTKGVATEEVATKKVLLNKLLT TQLLNEPE" gene complement(59346..60009) /gene="yabO" misc_feature complement(59346..60009) /gene="yabO" /product="Hypothetical 24.9 kd protein in surA-hepA intergenic region." /note="ORF is interrupted by frame-shifting ORF_ID:o106#9 similar to SwissProt Accession Number P39219" CDS 59985..60518 /note="ORF_ID:o106#10" /codon_start=1 /transl_table=11 /protein_id="BAB96626.1" /db_xref="GI:21321939" /translation="MDCSFPSPFSAPLLMRYNDQTQGIQTPTCLIQAFHYLLTVALNG GQFVVTNVRVHCTQSFQTRQLGRQFFVGFVTRCINQCTGRFLDLRFTQLKDSVNILLH GVNQFAAGVTVNRVKLAVKGFKLYLRRQVVAVFIQQHTYRRRWQEAVELQLLRSLGFN HINQFHQQRTYRQRFVF" gene complement(60021..62927) /gene="hepA" CDS complement(60021..62927) /gene="hepA" /note="ORF_ID:o106#11 similar to SwissProt Accession Number P23852" /codon_start=1 /transl_table=11 /product="Probable ATP-dependent helicase HepA." /protein_id="BAB96627.1" /db_xref="GI:21321940" /translation="MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYA RSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFL DSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLN IAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVE MLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDL LVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPN RFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSD SEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLV ICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIG SEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRW YHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLE QGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVL TPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGS STISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQV EFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKL SAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ" CDS complement(63092..64741) /note="ORF_ID:o107#1 dinA; polB similar to E. coli DNA polymerase II (EC 2.7.7.7) (Pol II): PIR Accession Number S15943" /codon_start=1 /transl_table=11 /protein_id="BAB96628.1" /db_xref="GI:21321941" /translation="MPSRYRLPLRLGRDNSELEWREHGPAKTASFLPQAKGRLIIDGI EALKSAFWDFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDC ELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEV PPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTE GFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRF FDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRA LVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDK QRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYVRETIDKLMAGELDARLVY RKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDY QRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF" CDS complement(64698..65447) /note="ORF_ID:o107#2 polB similar to E. coli DNA-directed DNA polymerase (EC 2.7.7.7) II: PIR Accession Number S15943" /codon_start=1 /transl_table=11 /protein_id="BAB96629.1" /db_xref="GI:21321942" /translation="MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAF IPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVT VYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPLKWVSIDIETT RHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVASRPQLLEKLNAWFANYDPD VIIGWNVVQFDLRMLQKHAEPLPSSAASWAR" gene complement(65522..66217) /gene="araD" CDS complement(65522..66217) /gene="araD" /note="ORF_ID:o107#3 similar to SwissProt Accession Number P08203" /codon_start=1 /transl_table=11 /product="L-ribulose-5-phosphate 4-epimerase (EC 5.1.3.4)." /protein_id="BAB96630.1" /db_xref="GI:21321943" /translation="MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPS GVDYSVMTADDMVVVSIETGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHAT IWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDA AQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKH YLRKHGAKAYYGQ" gene complement(66502..68004) /gene="araA" CDS complement(66502..68004) /gene="araA" /note="ORF_ID:o107#4 similar to SwissProt Accession Number P08202" /codon_start=1 /transl_table=11 /product="L-arabinose isomerase (EC 5.3.1.4)." /protein_id="BAB96631.1" /db_xref="GI:21321944" /translation="MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKL PCKLVLKPLGTTPDEITAICRDANYDDPCAGLVVWLHTFSPAKMWINGLTMLNKPLLQ FHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERI GSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVV NSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFT TTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMED YTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAI VASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHH TVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR" gene complement(68015..69715) /gene="araB" CDS complement(68015..69715) /gene="araB" /note="ORF_ID:o108#1 similar to PIR Accession Number B29022" /codon_start=1 /transl_table=11 /product="Ribulokinase (EC 2.7.1.16)" /protein_id="BAB96632.1" /db_xref="GI:21321945" /translation="MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCD APNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDSTGSTPAPIDADGNVL ALRPEFAENPNAMFVLWKDHTAVERSEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKI LHVTRQDSAVAQSAASWIELCDWVPALLSGTTRPQDIRRGRCSAGHKSLWHESWGGLP PASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCH MGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAG QSAFGDIYAWFGRVLSWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLP VVLDWFNGRRSPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGI AVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIP SAQQKMASAVEKTLQPRSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVATL " gene 70054..70932 /gene="araC" CDS 70054..70932 /gene="araC" /note="ORF_ID:o108#2 similar to PIR Accession Number A91473" /codon_start=1 /transl_table=11 /product="Arabinose operon regulatory protein" /protein_id="BAB96633.1" /db_xref="GI:21321946" /translation="MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKG YILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRA YWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLL LRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQ LGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEF RAGCEEKVNDVAVKLS" gene 71066..71779 /gene="yabI" CDS 71066..71779 /gene="yabI" /note="ORF_ID:o108#3 similar to PIR Accession Number S40581" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96634.1" /db_xref="GI:21321947" /translation="MAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWL AGIIGCLMGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFV GPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEF KWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVL IRHPLMPVYIDILRKVVGV" gene complement(71894..72592) /gene="yabJ" CDS complement(71894..72592) /gene="yabJ" /note="ORF_ID:o108#4 similar to PIR Accession Number S40582" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96635.1" /db_xref="GI:21321948" /translation="MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNL IAGFLTPASGSLTIDGVDHTTMPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKL NAVQQGKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDELFSAL DPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGMTNELL SGKASASALLGITG" gene complement(72576..74186) /gene="yabK" CDS complement(72576..74186) /gene="yabK" /note="ORF_ID:o108#5 similar to PIR Accession Number S40583" /codon_start=1 /transl_table=11 /product="Hypothetical protein" /protein_id="BAB96636.1" /db_xref="GI:21321949" /translation="MATRRQPLIPGWLIPGVSATTLVVAVALAAFLALWWNAPQDDWV AVWQDSYLWHVVRFSFWQAFLSALLSVIPAIFLARALYRRRFPGRLALLRLCAMTLIL PVLVAVFGILSVYGRQGWLATLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQ ALENIPGEQRQLAAQLGMRSWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLG GGPQATTIELAIYQALSYDYDPARAAMLALLQMVCCLGLVLLSQRLSKAIAPGTTLLQ GWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTS LRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQVLEMSGMLILAMPGIVLATGFFL LLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRL KVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGA VTTLILLLLCFLLFTVIEKLPGRNVKTD" gene complement(74162..75133) /gene="tbpA" CDS complement(74162..75133) /gene="tbpA" /note="ORF_ID:o108#6 similar to PIR Accession Number S40584" /codon_start=1 /transl_table=11 /product="Hypothetical protein." /protein_id="BAB96637.1" /db_xref="GI:21321950" /translation="MSAPAVAVTAPVFAKPVLTVYTYDSFAADWGPGPVVKKAFEADC NCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAV NVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPG LGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYH ILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNW MYPVANVTLPAGFEKLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR" gene complement(75308..76963) /gene="yabN" CDS complement(75308..76963) /gene="yabN" /note="ORF_ID:o108#7 similar to SwissProt Accession Number P33595" /codon_start=1 /transl_table=11 /product="Hypothetical 63.9 kd protein in tbpA-leuD intergenic region (orf103)." /protein_id="BAB96638.1" /db_xref="GI:21321951" /translation="MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLL NTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKAT VRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEE NGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIA DIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRN STNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEG CYYLLFDSRTHRGANQQVRDWASYVLSPTNLVYFAEEQYQQLWFPAYGLFPRWHHART IKSEKPAGLESLTLTFCQDHSEHRVIAGIMQQILASHQVTLKIKEIDYDQWHTGEIES DIWLNSANFTLPLDFSVFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVA SKAMVPLLHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP" gene 77285..78463 /gene="yabM" CDS 77285..78463 /gene="yabM" /note="ORF_ID:o108#8 similar to SwissProt Accession Number P31675" /codon_start=1 /transl_table=11 /product="Hypothetical 42.7 kd protein in tbpA-leuD intergenic region (orf104)." /protein_id="BAB96639.1" /db_xref="GI:21321952" /translation="MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREV GAQPFWIGLFYAVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRH YLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPL AFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVR MLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVK RYGKRRMMVIAVAAGVLFYTGLIFFNSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMP GRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV " gene complement(78512..79117) /gene="leuD" CDS complement(78512..79117) /gene="leuD" /note="ORF_ID:o108#9 similar to PIR Accession Number S40585" /codon_start=1 /transl_table=11 /product="Isopropylmalate isomerase subunit" /protein_id="BAB96640.1" /db_xref="GI:21321953" /translation="MAEKFIKHTGLVVPLDAANVDTDAIIPKQFLQKVTRTGFGAHLF NDWRFLDEKGQQPNPDFVLNFPQYQGASILLARENFGCGSSREHAPWALTDYGFKVVI APSFADIFYGNSFNNQLLPVKLSDAEVDELFALVKANPGIHFDVDLEAQEVKAGEKTY RFTIDAFRRHCMMNGLDSIGLTLQHDDAIAAYEAKQPAFMN" gene complement(79128..80528) /gene="leuC" CDS complement(79128..80528) /gene="leuC" /note="ORF_ID:o109#1 similar to PIR Accession Number S40586" /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydratase (EC 4.2.1.33) alpha chain" /protein_id="BAB96641.1" /db_xref="GI:21321954" /translation="MAKTLYEKLFDAHVVYEAENETPLLYIDRHLVHEVTSPQAFDGL RAHGRPVRQPGKTFATMDHNVSTQTKGINACGEMARIQMQELIKNCKEFGVELYDLNH PYQGIVHVMGPEQGVTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVEHVLATQTLKQ GRAKTMKIEVQGKAAPGITAKDIVLAIIGKTGSAGGTGHVVEFCGEAIRDLSMEGRMT LCNMAIEMGAKAGLVAPDETTFNYVKGRLHAPKGKDFDDAVAYWKTLQTDEGATFDTV VTLQAEEISPQVTWGTNPGQVISVNDNIPDPASFADPVERASAEKALAYMGLKPGIPL TEVAIDKVFIGSCTNSRIEDLRAAAEIVKGRKVAPGVQALVVPGSGPVKAQAEAEGLD KIFIEAGFEWRLPGCSMCLAMNNDRLNPGERCASTSNRNFEGRQGRGGRTHLVSPAMA AAAAVTGHFADIRNIK" gene complement(80531..81622) /gene="leuB" CDS complement(80531..81622) /gene="leuB" /note="ORF_ID:o109#2 similar to SwissProt Accession Number P30125" /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydrogenase (EC 1.1.1.85) (beta-ipm dehydrogenase) (imdH) (3-ipm-dh)." /protein_id="BAB96642.1" /db_xref="GI:21321955" /translation="MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMRITTSHYD VGGAAIDNHGQPLPPATVEGCEQADAVLFGSVGGPKWEHLPPDQQPERGALLPLRKHF KLFSNLRPAKLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEK AFDTEVYHRFEIERIARIAFESARKRRHKVTSIDKANVLQSSILWREIVNEIATEYPD VELAHMYIDNATMQLIKDPSQFDVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQG FGLYEPAGGSAPDIAGKNIANPIAQILSLALLLRYSLDADDAACAIERAINRALEEGI RTGDLARGAAAVSTDEMGDIIARYVAEGV" gene complement(81622..83193) /gene="leuA" CDS complement(81622..83193) /gene="leuA" /note="ORF_ID:o109#3 similar to PIR Accession Number S40588" /codon_start=1 /transl_table=11 /product="2-Isopropylmalate synthase" /protein_id="BAB96643.1" /db_xref="GI:21321956" /translation="MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVME VGFPVSSPGDFESVQTIARQVKNSRVCALARCVEKDIDVAAESLKVAEAFRIHTFIAT SPMHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAIN AGATTINIPDTVGYTMPFEFAGIISGLYERVPSIGKAIISVHTHDDLGLAVGNSLAAV HAGARQVEGAMNGIGERAGNCSLEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQI CNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIGLNQIQLNLTSRSGR AAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFR LDYFSVQSGSNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITEYNVELVKYS LTAKGHGKDALGQVDIVANYNGRRFHGVGLATDIVESSAKAMVHVLNNIWRAAEVEKE LQRKAQHNENNKETV" gene complement(83286..83372) /gene="leuLP" CDS complement(83286..83372) /gene="leuLP" /note="ORF_ID:o109#4 similar to PIR Accession Number A30376" /codon_start=1 /transl_table=11 /product="LeuABCD leader peptide." /protein_id="BAB96644.1" /db_xref="GI:21321957" /translation="MTHIVRFIGLLLLNASSLRGRRVSGIQH" gene 83855..84904 /gene="lueO" CDS 83855..84904 /gene="lueO" /note="ORF_ID:o109#5 similar to PIR Accession Number S40589" /codon_start=1 /transl_table=11 /product="LeuO protein." /protein_id="BAB96645.1" /db_xref="GI:21321958" /translation="MTHSTAMDSVFIRTRIFMFSEFYSFCFFLFYMHDKSYSSGLFLC IPIRERELSVTVELSMPEVQTDHPETAELSKPQLRMVDLNLLTVFDAVMQEQNITRAA HVLGMSQPAVSNAVARLKVMFNDELFVRYGRGIQPTARAFQLFGSVRQALQLVQNELP GSGFEPASSERVFHLCVCSPLDSILTSQIYNHIEQIAPNIHVMFKSSLNQNTEHQLRY QETEFVISYEDFHRPEFTSVPLFKDEMVLVASKNHPTIKGPLLKHDVYNEQHAAVSLD RFASFSQPWYDTVDKQASIAYQGMAMMSVLSVVSQTHLVAIAPRWLAEEFAESLELQV LPLPLKQNSRTCYLS" gene 85294..86994 /gene="ilvI" CDS 85294..86994 /gene="ilvI" /note="ORF_ID:o109#6 similar to PIR Accession Number S14385" /codon_start=1 /transl_table=11 /product="Acetolactate synthase (EC 4.1.3.18) III large chain." /protein_id="BAB96646.1" /db_xref="GI:21321959" /translation="MEMLSGGEMVVRSLIDQGVKQVFGYPGGAVLDIYDALHTVGGID HVLVRHEQAAVHMADGLARATGEVGVVLVTSGPGATNAITGIATAYMDSIPLVVLSGQ VATSLIGYDAFQECDMVGISRPVVKHSFLVKQTEDIPQVLKKAFWLAASGRPGPVVVD LPKDILNPANKLPYVWPESVSMRSYNPTTTGHKGQIKRALQSVVAVKKPVVYVGGGAI TAGCHQQLKETVEALNLPVVCSLMGLGAFPATHRQVLGMLGMHGTYEANMTMHNADVI FAVGVRFDDRTTNNLAKYCPNATVLHIDIDPTSISKTVTADIPIVGDARQVLEQMLEL LSQESAHQPLDEIRDWWQQIEQWRARQCLKYDTHSEKIKPQAVIETLWRLTKGDAYVT SDVGQHQMFAALYYPFDKPRRWINSGGLGSMGFGLPAALGVKMAFPEETVVCVTGDGS IQMNIQELSTALQYELPVLVVNLNNRYLGMVKQWQDMIYSGRHSQSYMQSLPDFVRRG AYGHVGIQISHPHGWKANLARRWNRCAIIAWCLLMLPSMAASTSTRCRFAGAEWMKCG " gene 87017..87508 /gene="brnP" /note="synonym: ilvH" CDS 87017..87508 /gene="brnP" /note="ORF_ID:o109#7 similar to SwissProt Accession Number P00894" /codon_start=1 /transl_table=11 /product="Acetolactate synthase isozyme III small subunit (EC 4.1.3.18) (ahas- III) (acetohydroxy-acid synthase III small subunit) (als-III)." /protein_id="BAB96647.1" /db_xref="GI:21321960" /translation="MRRILSVLLENESGALSRVIGLFSQRGYNIESLTVAPTDDPTLS RMTIQTVGDEKVLEQIEKQLHKLVDVLRVSELGQGAHVEREIMLVKIQASGYGRDEVK RNTEIFRGQIIDVTPSLYTVQLAGTSGKLSAFLASIRDVAKIVEVARSGVVGLSRGDK IMR" gene 87688..88692 /gene="fruR" /note="synonym: shl" CDS 87688..88692 /gene="fruR" /note="ORF_ID:o109#8 similar to PIR Accession Number JU0298" /codon_start=1 /transl_table=11 /product="Pep-fructosephosphotransferase system repressor." /protein_id="BAB96648.1" /db_xref="GI:21321961" /translation="MKLDEIARLAGVSRTTASYVINGKAKQYRVSDKTVEKVMAVVRE HNYHPNAVAAGLRAGRTRSIGLVIPDLENTSYTRIANYLERQARQRGYQLLIACSEDQ PDNEMRCIEHLLQRQVDAIIVSTSLPPEHPFYQRWANDPFPIVALDRALDREHFTSVV GADQDDAEMLAEELRKFPAETVLYLGALPELSVSFLREQGFRTAWKDDPREVHFLYAN SYEREAAAQLFEKWLETHPMPQALFTTSFALLQGVMDVTLRRDGKLPSDLAIATFGDN ELLDFLQCPVLAVAQRHRDVAERVLEIVLASLDEPRKPKPGLTRIKRNLYRRGVLSRS " gene 89294..89752 /gene="yabB" CDS 89294..89752 /gene="yabB" /note="ORF_ID:o109#9 similar to PIR Accession Number S14388" /codon_start=1 /transl_table=11 /product="Hypothetical protein C." /protein_id="BAB96649.1" /db_xref="GI:21321962" /translation="MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIYHPC LLLYPLPEWEIIEQKLSRLSSMNPVERRVQRLLLGHASECQMDGAGRLLIAPVLRQHA GLTKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDLSL" gene 89754..90695 /gene="yabC" CDS 89754..90695 /gene="yabC" /note="ORF_ID:o109#10 similar to SwissProt Accession Number P18595" /codon_start=1 /transl_table=11 /product="Hypothetical 34.9 kd protein in fruR-ftsL intergenic region (orfB)." /protein_id="BAB96650.1" /db_xref="GI:21321963" /translation="MMENYKHTTVLLDEAVNGLNIRPDGIYIDGTFGRGGHSRLILSQ LGEEGRLLAIDRDPQAIAVAKTIDDPRFSIIHGPFSALGEYVAERDLIGKIDGILLDL GVSSPQLDDAERGFSFMRDGPLDMRMDPTRGQSAAEWLQTAEEADIAWVLKTYGEERF AKRIARAIVERNREQPMTRTKELAEVVAAATPVKDKFKHPATRTFQAVRIWVNSELEE IEQALKSSLNVLAPGGRLSIISFHSLEDRIVKRFMRENSRGPQVPAGLPMTEEQLKKL GGRQLRALGKLMPGEEEVAENPRARSSVLRIAERTNA" gene 90692..91024 /gene="ftsL" CDS 90692..91024 /gene="ftsL" /note="ORF_ID:o110#1 similar to PIR Accession Number S14387" /codon_start=1 /transl_table=11 /product="Cell division protein FtsL" /protein_id="BAB96651.1" /db_xref="GI:21321964" /translation="MISRVTEALSKVKGSMGSHERHALPGVIGDDLLRFGKLPLCLFI CIILTAVTVVTTAHHTRLLTAQREQLVLERDALDIEWRNLILEENALGDHSRVERIAT EKLQMQTC" gene 91074..92840 /gene="ftsI" /note="synonym: pbpB" CDS 91074..92840 /gene="ftsI" /note="ORF_ID:o110#2 similar to PIR Accession Number A93123" /codon_start=1 /transl_table=11 /product="Penicillin-binding protein 3 precursor." /protein_id="BAB96652.1" /db_xref="GI:21321965" /translation="MKAAAKTQKPKRQEEHANFISWRFALLCGCILLALAFLLGRVAW LQVISPDMLVKEGDMRSLRVQQVSTSRGMITDRSGRPLAVSVPVKAIWADPKEVHDAG GISVGDRWKALANALNIPLDQLSARINANPKGRFIYLARQVNPDMADYIKKLKLPGIH LREESRRYYPSGEVTAHLIGFTNVDSQGIEGVEKSFDKWLTGQPGERIVRKDRYGRVI EDISSTDSQAAHNLALSIDERLQALVYRELNNAVAFNKAESGSAVLVDVNTGEVLAMA NSPSYNPNNLSGTPKEAMRNRTITDVFEPGSTVKPMVVMTALQRGVVRENSVLNTIPY RINGHEIKDVARYSELTLTGVLQKSSNVGVSKLALAMPSSALVDTYSRFGLGKATNLG LVGERSGLYPQKQRWSDIERATFSFGYGLMVTPLQLARVYATIGSYGIYRPLSITKVD PPVPGERVFPESIVRTVVHMMESVALPGGGGVKAAIKGYRIAIKTGTAKKVGPDGRYI NKYIAYTAGVAPASQPRFALVVVINDPQAGKYYGGAVSAPVFGAIMGGVLRTMNIEPD ALTTGDKNEFVINQGEGTGGRS" gene 92827..94314 /gene="murE" CDS 92827..94314 /gene="murE" /note="ORF_ID:o110#3 similar to PIR Accession Number S40595" /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase (EC 6.3.2.13) murE" /protein_id="BAB96653.1" /db_xref="GI:21321966" /translation="MADRNLRDLLAPWVPDAPSRALREMTLDSRVAAAGDLFVAVVGH QADGRRYIPQAIAQGVAAIIAEAKDEATDGEIREMHGVPVIYLSQLNERLSALAGRFY HEPSDNLRLVGVTGTNGKTTTTQLLAQWSQLLGEISAVMGTVGNGLLGKVIPTENTTG SAVDVQHELAGLVDQGATFCAMEVSSHGLVQHRVAALKFAASVFTNLSRDHLDYHGDM EHYEAAKWLLYSEHHCGQAIINADDEVGRRWLAKLPDAVAVSMEDHINPNCHGRWLKA TEVNYHDSGATIRFSSSWGDGEIESHLMGAFNVSNLLLALATLLALGYPLADLLKTAA RLQPVCGRMEVFTAPGKPTVVVDYAHTPDALEKALQAARLHCAGKLWCVFGCGGDRDK GKRPLMGAIAEEFADVAVVTDDNPRTEEPRAIINDILAGMLDAGHAKVMEGRAEAVTC AVMQAKENDVVLVAGKGHEDYQIVGNQRLDYSDRVTVARLLGVIA" gene 94311..95669 /gene="mra" /note="synonym: murF" CDS 94311..95669 /gene="mra" /note="ORF_ID:o110#4 similar to SwissProt Accession Number P11880" /codon_start=1 /transl_table=11 /product="UDP-n-acetylmuramoylalanyl-d-glutamyl-2, 6-diaminopimelate-d-alanyl -d- alanyl ligase (EC 6.3.2.15) (UDP-murnac-pentapeptide synthetase) (d-alanyl-d-alanine-adding enzyme)." /protein_id="BAB96654.1" /db_xref="GI:21321967" /translation="MISVTLSQLTDILNGELQGADITLDAVTTDTRKLTPGCLFVALK GERFDAHDFADQAKAGAAGALLVSRPLDIDLPQLIVKDTRLAFGELAAWVRQQVPARV VALTGSSGKTSVKEMTAAILSQCGNTLYTAGNLNNDIGVPMTLLRLTPEYDYAVIELG ANHQGEIAWTVSLTRPERALVNNLAAAHLEGFGSLAGVAKAKGEIFSGLPENGIAIMN ADNNDWLNWQSVIGSRKVWRFSPNAANSDFTATNIHVTSHGTEFTLQTPTGSVDVLLP LPGRHNIANALAAAALSMSVGATLDAIKAGLANLKAVPGRLFPIQLAENQLLLDDSYN ANVGSMTAAVQVLAEMPGYRVLVVGDMAELGAESEACHVQVGEAAKAAGIDRVLSVGK QSHAISTASGVGEHFADKTALITRLKLLIAEQQVITILVKGSRSAAMEEVVRALQENG TC" gene 95663..96745 /gene="mraY" /note="synonym: murX" CDS 95663..96745 /gene="mraY" /note="ORF_ID:o110#5 similar to SwissProt Accession Number P15876" /codon_start=1 /transl_table=11 /product="Phospho-n-acetylmuramoyl-pentapeptide-transferas e (EC 2.7.8.13)." /protein_id="BAB96655.1" /db_xref="GI:21321968" /translation="MLVWLAEHLVKYYSGFNVFSYLTFRAIVSLLTALFISLWMGPRM IAHLQKLSFGQVVRNDGPESHFSKRGTPTMGGIMILTAIVISVLLWAYPSNPYVWCVL VVLVGYGVIGFVDDYRKVVRKDTKGLIARWKYFWMSVIALGVAFALYLAGKDTPATQL VVPFFKDVMPQLGLFYILLAYFVIVGTGNAVNLTDGLDGLAIMPTVFVAGGFALVAWA TGNMNFASYLHIPYLRHAGELVIVCTAIVGAGLGFLWFNTYPAQVFMGDVGSLALGGA LGIIAVLLRQEFLLVIMGGVFVVETLSVILQVGSFKLRGQRIFRMAPIHHHYELKGWP EPRVIVRFWIISLMLVLIGLATLKVR" gene 96748..98064 /gene="murD" CDS 96748..98064 /gene="murD" /note="ORF_ID:o110#6 similar to PIR Accession Number S08396" /codon_start=1 /transl_table=11 /product="UDP-n-acetylmuramoylalanine-d-glutamate ligase (EC 6.3.2.9)." /protein_id="BAB96656.1" /db_xref="GI:21321969" /translation="MADYQGKNVVIIGLGLTGLSCVDFFLARGVTPRVMDTRMTPPGL DKLPEAVERHTGSLNDEWLMAADLIVASPGIALAHPSLSAAADAGIEIVGDIELFCRE AQAPIVAITGSNGKSTVTTLVGEMAKAAGVNVGVGGNIGLPALMLLDDECELYVLELS SFQLETTSSLQAVAATILNVTEDHMDRYPFGLQQYRAAKLRIYENAKVCVVNADDALT MPIRGADERCVSFGVNMGDYHLNHQQGETWLRVKGEKVLNVKEMKLSGQHNYTNALAA LALADAAGLPRASSLKALTTFTGLPHRFEVVLEHNGVRWINDSKATNVGSTEAALNGL HVDGTLHLLLGGDGKSADFSPLARYLNGDNVRLYCFGRDGAQLAALRPEVAEQTETME QAMRLLAPRVQPGDMVLLSPACASLDQFKNFEQRGNEFARLAKELG" gene 98064..99308 /gene="ftsW" CDS 98064..99308 /gene="ftsW" /note="ORF_ID:o111#1 similar to PIR Accession Number A32581" /codon_start=1 /transl_table=11 /product="Cell division protein FtsW." /protein_id="BAB96657.1" /db_xref="GI:21321970" /translation="MRLSLPRLKMPRLPGFSILVWISTALKGWVMGSREKDTDSLIMY DRTLLWLTFGLAAIGFIMVTSASMPIGQRLTNDPFFFAKRDGVYLILAFILAIITLRL PMEFWQRYSATMLLGSIILLMIVLVVGSSVKGASRWIDLGLLRIQPAELTKLSLFCYI ANYLVRKGDEVRNNLRGFLKPMGVILVLAVLLLAQPDLGTVVVLFVTTLAMLFLAGAK LWQFIAIIGMGISAVVLLILAEPYRIRRVTAFWNPWEDPFGSGYQLTQSLMAFGRGEL WGQGLGNSVQKLEYLPEAHTDFIFAIIGEELGYVGVVLALLMVFFVAFRAMSIGRKAL EIDHRFSGFLACSIGIWFSFQALVNVGAAAGMLPTKGLTLPLISYGGSSLLIMSTAIM MLLRIDYETRLEKAQAFVRGSR" gene 99305..100372 /gene="murG" CDS 99305..100372 /gene="murG" /note="ORF_ID:o111#2 similar to PIR Accession Number JQ0544" /codon_start=1 /transl_table=11 /product="MurG protein." /protein_id="BAB96658.1" /db_xref="GI:21321971" /translation="MSGQGKRLMVMAGGTGGHVFPGLAVAHHLMAQGWQVRWLGTADR MEADLVPKHGIEIDFIRISGLRGKGIKALIAAPLRIFNAWRQARAIMKAYKPDVVLGM GGYVSGPGGLAAWSLGIPVVLHEQNGIAGLTNKWLAKIATKVMQAFPGAFPNAEVVGN PVRTDVLALPLPQQRLAGREGPVRVLVVGGSQGARILNQTMPQVAAKLGDSVTIWHQS GKGSQQSVEQAYAEAGQPQHKVTEFIDDMAAAYAWADVVVCRSGALTVSEIAAAGLPA LFVPFQHKDRQQYWNALPLEKAGAAKIIEQPQLSVDAVANTLAGWSRETLLTMAERAR AASIPDATERVANEVSRVARA" gene 100426..101901 /gene="murC" CDS 100426..101901 /gene="murC" /note="ORF_ID:o111#3 similar to PIR Accession Number JQ0545" /codon_start=1 /transl_table=11 /product="UDP-n-acetylmuramate-alanine ligase (EC 6.3.2.8)." /protein_id="BAB96659.1" /db_xref="GI:21321972" /translation="MNTQQLAKLRSIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGY QISGSDLAPNPVTQQLMNLGATIYFNHRPENVRDASVVVVSSAISADNPEIVAAHEAR IPVIRRAEMLAELMRFRHGIAIAGTHGKTTTTAMVSSIYAEAGLDPTFVNGGLVKAAG VHARLGHGRYLIAEADESDASFLHLQPMVAIVTNIEADHMDTYQGDFENLKQTFINFL HNLPFYGRAVMCVDDPVIRELLPRVGRQTTTYGFSEDADVRVEDYQQIGPQGHFTLLR QDKEPMRVTLNAPGRHNALNAAAAVAVATEEGIDDEAILRALESFQGTGRRFDFLGEF PLEPVNGKSGTAMLVDDYGHHPTEVDATIKAARAGWPDKNLVMLFQPHRFTRTRDLYD DFANVLTQVDTLLMLEVYPAGEAPIPGADSRSLCRTIRGRGKIDPILVPDPARVAEML APVLTGNDLILVQGAGNIGKIARSLAEIKLKPQTPEEEQHD" gene 101894..102814 /gene="ddl" /note="synonym: ddlB" CDS 101894..102814 /gene="ddl" /note="ORF_ID:o111#4 similar to PIR Accession Number A30289" /codon_start=1 /transl_table=11 /product="D-alanine-d-alanine ligase (EC 6.3.2.4) B." /protein_id="BAB96660.1" /db_xref="GI:21321973" /translation="MTDKIAVLLGGTSAEREVSLNSGAAVLAGLREGGIDAYPVDPKE VDVTQLKSMGFQKVFIALHGRGGEDGTLQGMLELMGLPYTGSGVMASALSMDKLRSKL LWQGAGLPVAPWVALTRAEFEKGLSDKQLAEISALGLPVIVKPSREGSSVGMSKVVAE NALQDALRLAFQHDEEVLIEKWLSGPEFTVAILGEEILPSIRIQPSGTFYDYEAKYLS DETQYFCPAGLEASQEANLQALVLKAWTTLGCKGWGRIDVMLDSDGQFYLLEANTSPG MTSHSLVPMAARQAGMSFSQLVVRILELAD" gene 102816..103646 /gene="ftsQ" CDS 102816..103646 /gene="ftsQ" /note="ORF_ID:o111#5 similar to PIR Accession Number S10852" /codon_start=1 /transl_table=11 /product="Cell division protein FtsQ." /protein_id="BAB96661.1" /db_xref="GI:21321974" /translation="MSQAALNTRNSEEEVSSRRNNGTRLAGILFLLTVLTTVLVSGWV VLGWMEDAQRLPLSKLVLTGERHYTRNDDIRQSILALGEPGTFMTQDVNIIQTQIEQR LPWIKQVSVRKQWPDELKIHLVEYVPIARWNDQHMVDAEGNTFSVPPERTSKQVLPML YGPEGSANEVLQGYREMGQMLAKDRFTLKEAAMTARRSWQLTLNNDIKLNLGRGDTMK RLARFVELYPVLQQQAQTDGKRISYVDLRYDSGAAVGWAPLPPEESTQQQNQAQAEQQ " gene 103643..104905 /gene="divA" /note="synonym: ftsA" CDS 103643..104905 /gene="divA" /note="ORF_ID:o111#6 similar to SwissProt Accession Number P06137" /codon_start=1 /transl_table=11 /product="Cell division protein FtsA." /protein_id="BAB96662.1" /db_xref="GI:21321975" /translation="MIKATDRKLVVGLEIGTAKVAALVGEVLPDGMVNIIGVGSCPSR GMDKGGVNDLESVVKCVQRAIDQAELMADCQISSVYLALSGKHISCQNEIGMVPISEE EVTQEDVENVVHTAKSVRVRDEHRVLHVIPQEYAIDYQEGIKNPVGLSGVRMQAKVHL ITCHNDMAKNIVKAVERCGLKVDQLIFAGLASSYSVLTEDERELGVCVVDIGGGTMDI AVYTGGALRHTKVIPYAGNVVTSDIAYAFGTPPSDAEAIKVRHGCALGSIVGKDESVE VPSVGGRPPRSLQRQTLAEVIEPRYTELLNLVNEEILQLQEKLRQQGVKHHLAAGIVL TGGAAQIEGLAACAQRVFHTQVRIGAPLNITGLTDYAQEPYYSTAVGLLHYGKESHLN GEAEVEKRVTASVGSWIKRLNSWLRKEF" gene 104966..106117 /gene="ftsZ" /note="synonyms: sfiB, sulB" CDS 104966..106117 /gene="ftsZ" /note="ORF_ID:o111#7 similar to SwissProt Accession Number P06138" /codon_start=1 /transl_table=11 /product="Cell division protein FtsZ." /protein_id="BAB96663.1" /db_xref="GI:21321976" /translation="MFEPMELTNDAVIKVIGVGGGGGNAVEHMVRERIEGVEFFAVNT DAQALRKTAVGQTIQIGSGITKGLGAGANPEVGRNAADEDRDALRAALEGADMVFIAA GMGGGTGTGAAPVVAEVAKDLGILTVAVVTKPFNFEGKKRMAFAEQGITELSKHVNSL ITIPNDKLLKVLGRGISLLDAFGAANDVLKGAVQGIAELITRPGLMNVDFADVRTVMS EMGHAMMGSGVASGEDRAEEAAEMAISSPLLEDIDLSGARGVLVNITAGFDLRLDEFE TVGNTIRAFASDNATVVIGTSLDPDMNDELRVTVVATGIGMDKRPEITLVTNKQVQQP VMDRYQQHGMAPLTQEQKPVAKVVNDNAPQTAKEPDYLDIPAFLRKQAD" gene 106218..107135 /gene="asmB" /note="synonyms: envA, lpxC" CDS 106218..107135 /gene="asmB" /note="ORF_ID:o111#8 similar to SwissProt Accession Number P07652" /codon_start=1 /transl_table=11 /product="Udp-3-o-[3-hydroxymyristoyl] n-acetylglucosamine deacetylase (EC 3.5.1.-) (EnvA protein)." /protein_id="BAB96664.1" /db_xref="GI:21321977" /translation="MIKQRTLKRIVQATGVGLHTGKKVTLTLRPAPANTGVIYRRTDL NPPVDFPADAKSVRDTMLCTCLVNEHDVRISTVEHLNAALAGLGIDNIVIEVNAPEIP IMDGSAAPFVYLLLDAGIDELNCAKKFVRIKETVRVEDGDKWAEFKPYNGFSLDFTID FNHPAIDSSNQRYAMNFSADAFMRQISRARTFGFMRDIEYLQSRGLCLGGSFDCAIVV DDYRVLNEDGLRFEDEFVRHKMLDAIGDLFMCGHNIIGAFTAYKSGHALNNKLLQAVL AKQEAWEYVTFQDDAELPLAFKAPSAVLA" gene 107435..107878 /gene="yacA" CDS 107435..107878 /gene="yacA" /note="ORF_ID:o111#9 similar to PIR Accession Number B28381" /codon_start=1 /transl_table=11 /product="Hypothetical 16k protein (eneA-secA intergenic region)." /protein_id="BAB96665.1" /db_xref="GI:21321978" /translation="MVAASLGLPALSNAAEPNAPAKATTRNHEPSAKVNFGQLALLEA NTRRPNSNYSVDYWHQHAIRTVIRHLSFAMAPQTLPVAEESLPLQAQHLALLDTLSAL LTQEGTPSEKGYRIDYAHFTPQAKFSTPVWISQAQGIRAGPQRLT" gene 107940..110645 /gene="azi" /note="synonyms: pea, prlD, secA" CDS 107940..110645 /gene="azi" /note="ORF_ID:o111#10 similar to SwissProt Accession Number P10408" /codon_start=1 /transl_table=11 /product="Preprotein translocase SecA subunit." /protein_id="BAB96666.1" /db_xref="GI:21321979" /translation="MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELK GKTAEFRARLEKGEVLENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEM RTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPG MPAPAKREAYAADITYGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDE ARTPLIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTE RGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVI IVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTG TADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQ PVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIATNMAG RGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTERHESRR IDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVT KAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSETINSIRED VFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRDGIL AQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRGYAQKD PKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELEQQRRMEAERLAQMQQ LSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHGRLQ" gene 110705..111094 /gene="mutT" CDS 110705..111094 /gene="mutT" /note="ORF_ID:o111#11 similar to PIR Accession Number A27890" /codon_start=1 /transl_table=11 /product="Mutator MutT (AT-GC transversion)." /protein_id="BAB96667.1" /db_xref="GI:21321980" /translation="MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGE TPEQAVVRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVERWEGEPWGKEGQPG EWMSLVGLNADDFPPANEPVIAKLKRL" gene complement(111310..111408) /gene="yacG" CDS complement(111310..>111408) /gene="yacG" /note="ORF_ID:o111#12 similar to SwissProt Accession Number P36681" /codon_start=1 /transl_table=11 /product="Hypothetical 5.8 kd protein in mutT-guaC intergenic region." /protein_id="BAB96668.1" /db_xref="GI:21321981" /translation="LIDLGEWAAEEKRIPSSGDLSESDDWSEEPKQ" BASE COUNT 26083 a 28055 c 30531 g 26739 t ORIGIN 1 atgcgagtgt tgaagttcgg cggtacatca ttggcaaatg cagaacgttt tctgcgtgtt 61 gccgatattc tggaaagcaa tgccaggcag gggcaggtgg ccaccgtcct ctctgcccct 121 gccaaaatca ccaaccacct ggtggcgatg attgaaaaaa ccattagcgg ccaggatgct 181 ttacccaata tcagcgatgc cgaacgtatt tttgccgaac ttttgacggg actcgccgcc 241 gcccagccgg ggttcccgtt ggcgcaattg aaaacattcg tcgatcagga atttgcccaa 301 ataaaacatg tcctgcatgg cattagtttg ttggggcagt gcccggatag catcaacgct 361 gcgctgattt gccgtggcga gaagatgtcg atcgccatta tggccggcgt attagaagcg 421 cgcggtcaca acgttactgt tatcgatccg gtcgaaaaac tgctggcagt ggggcattac 481 ctcgaatcca ccgtcgatat tgccgagtcc acccgccgta tagcggcaag ccgcattccg 541 gctgatcaca tggtgctgat ggcaggtttc accgccggta atgagaaagg cgaactggtg 601 gtgcttggac gcaacggttc cgactactct gctgcggtgc tggctgcctg tttacgcgcc 661 gattgttgcg agatttggac ggacgttaac ggggtctata cctgcgaccc gcgtcaggtg 721 cccgacgcga ggttgttgaa gtcgatgtcc taccaggaag cgatggagct ttcctacttc 781 ggcgctaaag ttcttcaccc ccgcaccatt acccccatcg cccagttcca gatcccttgc 841 ctgattaaaa ataccggaaa tcctcaagca ccaggtacgc tcattggtgc aagccgtgat 901 gaagacgaat taccggtcaa gggcatttcc aatctgaata acatggcaat gttcagcgtt 961 tctggtccgg ggatgaaagg gatggtcggc atggcggcgc gcgtctttgc agcgatgtca 1021 cgcgcccgta tttccgtggt gctgattacg caatcatctt ccgaatacag catcagtttc 1081 tgcgttccac aaagcgactg tgtgcgagct gaacgggcaa tgctggaaga gttctaccta 1141 gaactgaaag aaggcttact ggagccgctg gcagtggcgg aacggctggc cattatctcg 1201 gtggtaggtg atggtttgcg caccttgcgt gggatctcgg cgaaattctt tgccgcactg 1261 gcccgcgcca atatcaacat tgtcgccatt gctcagggat cttctgaacg ctcaatctct 1321 gtcgtggtaa ataacgatga tgcgaccact ggcgtgcgcg ttactcatca gatgctgttc 1381 aataccgatc aggttatcga agtgtttgtg attggcgtcg gtggcgttgg cggtgcgctg 1441 ctggagcaac tgaagcgtca gcaaagctgg ctgaagaata aacatatcga cttacgtgtc 1501 tgcggtgttg ccaactcgaa ggctctgctc accaatgtac atggccttaa tctggaaaac 1561 tggcaggagg aactggcgca agccaaagag ccgtttaatc tcgggcgctt aattcgcctc 1621 gtgaaagaat atcatctgct gaacccggtc attgttaact gcacttccag ccaggcagtg 1681 gcggatcaat atgccgactt cctgcgcgaa ggtttccacg ttgtcacgcc gaacaaaaag 1741 gccaacacct cgtcgatgga ttactaccat cagttgcgtt atgcggcgga aaaatcgcgg 1801 cgtaaattcc tctatgacat caacgttggg gctggattac cggttattga gaacctgcaa 1861 aatctgctca atgcaggtga tgaattgatg aaattctccg gcattctttc tggttcgctt 1921 tcttatatct tcggcaagtt agacgaaggc atgagtttct ccgaggcgac ccggctggcg 1981 cgggaaatgg gttataccga accggacccg cgagatgatc tttctggtat ggatgtggcg 2041 cgtaaactat tgattctcgc tcgtgaaacg ggacgtgaac tggagctggc ggatattgaa 2101 attgaacctg tgctgcccgc agagtttaac gccgagggtg atgttgccgc ttttatggcg 2161 aatctgtcac aactcgacga tctctttgcc gcgcgcgtgg cgaaggcccg tgatgaagga 2221 aaagttttgc gctatgttgg caatattgat gaagatggcg tctgccgcgt gaagattgcc 2281 gaagtggatg gtaatgatcc gctgttcaaa gtgaaaaatg gcgaaaacgc cctggccttc 2341 tatagccact attatcagcc gctgccgttg gtactgcgcg gatatggtgc gggcaatgac 2401 gttacagctg ccggtgtctt tgctgatctg ctacgtaccc tctcatggaa gttaggagtc 2461 tgacatggtt aaagtttatg ccccggcttc cagtgccaat atgagcgtcg ggtttgatgt 2521 gctcggggcg gcggtgacac ctgttgatgg tgcattgctc ggagatgtag tcacggttga 2581 ggcggcacag acattcagtc tcaacaacct cggacgcttt gccgataagc tgccgtcaga 2641 accacgggaa aatatcgttt atcagtgctg ggagcgtttt tgccaggaac tgggtaagca 2701 aattccagtg gcgatgaccc tggaaaagaa tatgccgatc ggttcgggct taggctccag 2761 tgcctgttcg gtggtcgcgg cgctgatggc gatgaatgaa cactgcggca agccgcttaa 2821 tgacactcgt ttgctggctt tgatgggcga gctggaaggc cgtatctccg gcagcattca 2881 ttacgacaac gtggcaccgt gttttctcgg tggtatgcag ttgatgatcg aagaaaacga 2941 catcatcagc cagcaagtgc agggtttgat gagtggctgt gggtgctcgc gtatccgggg 3001 attaaagtct cgacggcaga agcagggcta tttaccggcg cagtatcgcc gccaggattg 3061 cattgcgcac gggcgacatc tggcaggctt cattcacgcc tgctattccc gtcagcctga 3121 gcttgccgcg aagctgatga aagatgttat cgctgaaccc taccgtgaac ggttactgcc 3181 aggcttccgg caggcgcggc aggcggtcgc ggaaatcggc gcggtagcga gcggtatctc 3241 cggctccggc ccgaccttgt tcgctctgtg tgacaagccg gaaaccgccc agcgcgttgc 3301 cgactggttg ggtaagaact acctgcaaaa tcaggaaggt tttgttcata tttgccggct 3361 ggatacggcg ggcgcacgag tactggaaaa ctaaatgaaa ctctacaatc tgaaagatca 3421 caacgagcag gtcagctttg cgcaagccgt aacccagggg ttgggcaaaa atcaggggct 3481 gttttttccg cacgacctgc cggaattcag cctgactgaa attgatgaga tgctgaagct 3541 ggattttgtc acccgcagtg cgaagatcct ctcggcgttt attggtgatg aaatcccaca 3601 ggaaatcctg gaagagcgcg tgcgcgcggc gtttgccttc ccggctccgg tcgccaatgt 3661 tgaaagcgat gtcggttgtc tggaattgtt ccacgggcca acgctggcat ttaaagattt 3721 cggcggtcgc tttatggcac aaatgctgac ccatattgcg ggtgataagc cagtgaccat 3781 tctgaccgcg acctccggtg ataccggagc ggcagtggct catgctttct acggtttacc 3841 gaatgtgaaa gtggttatcc tctatccacg aggcaaaatc agtccactgc aagaaaaact 3901 gttctgtaca ttgggcggca atatcgaaac tgttgccatc gacggcgatt tcgatgcctg 3961 tcaggcgctg gtgaagcagg cgtttgatga tgaagaactg aaagtggcgc tagggttaaa 4021 ctcggctaac tcgattaaca tcagccgttt gctggcgcag atttgctact actttgaagc 4081 tgttgcgcag ctgccgcagg agacgcgcaa ccagctggtt gtctcggtgc caagcggaaa 4141 cttcggcgat ttgacggcgg gtctgctggc gaagtcactc ggtctgccgg tgaaacgttt 4201 tattgctgcg accaacgtga acgataccgt gccacgtttc ctgcacgacg gtcagtggtc 4261 acccaaagcg actcaggcga cgttatccaa cgcgatggac gtgagtcagc cgaacaactg 4321 gccgcgtgtg gaagagttgt tccgccgcaa aatctggcaa ctgaaagagc tgggttatgc 4381 agccgtggat gatgaaacca cgcaacagac aatgcgtgag ttaaaagaac tgggctacac 4441 ttcggagccg cacgctgccg tagcttatcg tgcgctgcgt gatcagttga atccaggcga 4501 atatggcttg ttcctcggca ccgcgcatcc ggcgaaattt aaagagagcg tggaagcgat 4561 tctcggtgaa acgttggatc tgccaaaaga gctggcagaa cgtgctgatt tacccttgct 4621 ttcacataat ctgcccgccg attttgctgc gttgcgtaaa ttgatgatga atcatcagta 4681 aaatctattc attatctcaa tcaggccggg tttgctttta tgcagcccgg cttttttatg 4741 aagaaattat ggagaaaaat gacagggaaa aaggagaaat tctcaataaa tgcggtaact 4801 tagagattag gattgcggag aataacaacc gccgttctca tcgagtaatc tccggatatc 4861 gacccataac gggcaatgat aaaaggagta acctgtgaaa aagatgcaat ctatcgtact 4921 cgcactttcc ctggttctgg tcgctcccat ggcagcacag gctgcggaaa ttacgttagt 4981 cccgtcagta aaattacaga taggcgatcg tgataatcgt ggctattact gggatggagg 5041 tcactggcgc gaccacggct ggtggaaaca acattatgaa tggcgaggca atcgctggca 5101 cctacacgga ccgccgccac cgccgcgcca ccataagaaa gctcctcatg atcatcacgg 5161 cggtcatggt cctggcaaac atcaccgcta aatgacaaat gccgggtaac aatccggcat 5221 tcagcgcctg atgcgacgct ggcgcgtctt atcaggccta cgttaattct gcaatatatt 5281 gaatctgcat gcttttgtag gcaggataag gcgttcacgc cgcatccggc attgactgca 5341 aacttaacgc tgctcgtagc gtttaaacac cagttcgcca ttgctggagg aatcttcatc 5401 aaagaagtaa ccttcgctat taaaaccagt cagttgctct ggtttggtca gccgattttc 5461 aataatgaaa cgactcatca gaccgcgtgc tttcttagcg tagaagctga tgatcttaaa 5521 tttgccgttc ttctcatcga ggaacaccgg cttgataatc tcggcattca atttcttcgg 5581 cttcaccgat ttaaaatact catctgacgc cagattaatc accacattat cgccttgtgc 5641 tgcgagcgcc tcgttcagct tgttggtgat gatatctccc cagaattgat acagatcttt 5701 ccctcgggca ttctcaagac ggatccccat ttccagacga taaggctgca ttaaatcgag 5761 cgggcggagt acgccataca agccggaaag cattcgcaaa tgctgttggg caaaatcgaa 5821 atcgtcttcg ctgaaggttt cggcctgcaa gccggtgtag acatcacctt taaacgccag 5881 aatcgcctgg cgggcattcg ccggcgtgaa atctggctgc cagtcatgaa agcgagcggc 5941 gttgataccc gccagtttgt cgctgatgcg catcagcgtg ctaatctgcg gaggcgtcag 6001 tttccgcgcc tcatggatca actgctggga attgtctaac agctccggca gcgtatagcg 6061 cgtggtggtc aacgggcttt ggtaatcaag cgttttcgca ggtgaaataa gaatcagcat 6121 atccagtcct tgcaggaaat ttatgccgac tttagcaaaa aatgagaatg agttgatcga 6181 tagttgtgat tactcctgcg aaacatcatc ccacgcgtcc ggagaaagct ggcgaccgat 6241 atccggataa cgcaatggat caaacaccgg gcgcacgccg agtttacgct ggcgtagata 6301 atcactggca atggtatgaa ccacaggcga gagcagtaaa atggcggtca aattggtaat 6361 agccatgcag gccattatga tatctgccag ttgccacatc agcggaaggc ttagcaaggt 6421 gccgccgatg accgttgcga aggtgcagat ccgcaaacac cagatcgctt tagggttgtt 6481 caggcgtaaa aagaagagat tgttttcggc ataaatgtag ttggcaacga tggagctgaa 6541 ggcaaacaga ataaccacaa gggtaacaaa ctcagcaccc caggaaccca ttagcacccg 6601 catcgccttc tggataagct gaataccttc cagcggcatg taggttgtgc cgttacccgc 6661 cagtaatatc agcatggcgc ttgccgtaca gatgaccagg gtgtcgataa aaatgccaat 6721 catctggaca atcccttgcg ctgccggatg cggaggccag gacgccgctg ccgctgccgc 6781 gtttggcgtc gaacccattc ccgcctcatt ggaaaacata ctgcgctgaa aaccgttagt 6841 aatcgcctgg cttaaggtat atcccgccgc gccgcctgcc gcttcctgcc agccaaaagc 6901 actctcaaaa atagaccaaa tgacgtgggg aagttgcccg atattcatta cgcaaattac 6961 caggctggtc agtacccaga ttatcgccat caacgggaca aagccctgca tgagccgggc 7021 gacgccatga agaccgcgag tgattgccag cagagtaaag acagcgagaa taatgcctgt 7081 caccagcggg ggaaaatcaa aagaaaaact cagggcgcgg gcaacggcgt tcgcttgaac 7141 tccgctgaaa attatgccat aggcgatgag caaaaagacg gcgaacagaa cgcccatcca 7201 gcgcatcccc agcccgcgcg ccatatacca tgccggtccg ccacgaaact gcccattgac 7261 gtcacgttct ttataaagtt gtgccagaga acattcggca aacgaggtcg ccatgccgat 7321 aaacgcggca acccacatcc aaaagacggc tccaggtcca ccggcggtaa tagccagcgc 7381 aacgccggcc aggttgccgc tacccacgcg cgccgcaaga ctggtacaca atgactgaaa 7441 tgaggttaaa ccgcctggct gtggatgaat gctattttta agacttttgc caaactggcg 7501 gatgtagcga aactgcacaa atccggtgcg aaaagtgaac caacaacctg cgccgaagag 7561 caggtaaatc attaccgatc cccaaaggac gctgttaatg aaggagaaaa aatctggcat 7621 gcatatccct cttattgccg gtcgcgatga ctttcctgtg taaacgttac caattgttta 7681 agaagtatat acgctacgag gtacttgata acttctgcgt agcatacatg aggttttgta 7741 taaaaatggc gggcgatatc aacgcagtgt cagaaatccg aaacagtctc gcctggcgat 7801 aaccgtcttg tcggcggttg cgctgacgtt gcgtcgtgat atcatcaggg cagaccggtt 7861 acatccccct aacaagctgt ttaaagagaa atactatcat gacggacaaa ttgacctccc 7921 ttcgtcagta caccaccgta gtggccgaca ctggggacat cgcggcaatg aagctgtatc 7981 aaccgcagga tgccacaacc aacccttctc tcattcttaa cgcagcgcag attccggaat 8041 accgtaagtt gattgatgat gctgtcgcct gggcgaaaca gcagagcaac gatcgcgcgc 8101 agcagatcgt ggacgcgacc gacaaactgg cagtaaatat tggtctggaa atcctgaaac 8161 tggttccggg ccgtatctca actgaagttg atgcgcgtct ttcctatgac accgaagcgt 8221 caattgcgaa agcaaaacgc ctgatcaaac tctacaacga tgctggtatt agcaacgatc 8281 gtattctgat caaactggct tctacctggc agggtatccg tgctgcagaa cagctggaaa 8341 aagaaggcat caactgtaac ctgaccctgc tgttctcctt cgctcaggct cgtgcttgtg 8401 cggaagcggg cgtgttcctg atctcgccgt ttgttggccg tattcttgac tggtacaaag 8461 cgaataccga taagaaagag tacgctccgg cagaagatcc gggcgtggtt tctgtatctg 8521 aaatctacca gtactacaaa gagcacggtt atgaaaccgt ggttatgggc gcaagcttcc 8581 gtaacatcgg cgaaattctg gaactggcag gctgcgaccg tctgaccatc gcaccggcac 8641 tgctgaaaga gctggcggag agcgaagggg ctatcgaacg taaactgtct tacaccggcg 8701 aagtgaaagc gcgtccggcg cgtatcactg agtccgagtt cctgtggcag cacaaccagg 8761 atccaatggc agtagataaa ctggcggaag gtatccgtaa gtttgctatt gaccaggaaa 8821 aactggaaaa aatgatcggc gatctgctgt aatcattctt agcgtgaccg ggaagtcggt 8881 cacgctacct cttctgaagc ctgtctgtca ctcccttcgc agtgtatcat tctgtttaac 8941 gagactgttt aaacggaaaa atcttgatga atactttacg tattggctta gtttccatct 9001 ctgatcgcgc atccagcggc gtttatcagg ataaaggcat ccctgcgctg gaagaatggc 9061 tgacatgcgc taaccacgcc gtttgaactg gaaacccgct taatccccga tgagcaggcg 9121 atcatcgagc aaacgttgtg tgagctggtg gatgaaatga gttgccatct ggtgctcacc 9181 acgggcggaa ctggcccggc gcgtcgtgac gtaacgcccg atgcgacgct ggcagtagcg 9241 gaccgcgaga tgcctggctt tggtgaacag atgcgccaga tcagcctgca ttttgtacca 9301 actgcgatcc tttcgcgtca ggtgggcgtg attcgcaaac aggcgctgat ccttaactta 9361 cccggtcagc cgaagtctat taaagagacg ctggaaggtg tgaaggacgc tgagggtaac 9421 gttgtggtac acggtatttt tgccagcgta ccgtactgca ttcagttgct ggaagggcca 9481 tacgttgaaa cggcaccgga agtggttgca gcattcagac cgaagagtgc aagacgcgac 9541 gttagcgaat aaaaaatacc cgagcggggg gatctcaaaa caattagtgg gattcaccaa 9601 tcggcagaac ggtgcgacca aactgctcgt tcagtacttc acccatcgcc agatagattg 9661 cgctggcacc gcagatcagc ccaatccagc cggcaaagtg gatgattgcg gcgttaccgg 9721 caatgttacc gatcgccagc agggcaaaca gcacggtcag gctaaagaaa acgaattgca 9781 gaacgcgtgc gcctttcagc gtgccgaaga acataaacac ggtaaatacg ccccacagac 9841 ccaggtagac accaaggaac tgtgcatttg gcgcatcggt cagacccagt ttcggcatca 9901 gcagaatcgc aaccagcgtc agccagaaag aaccgtaaga ggtgaatgcg gttaaaccga 9961 aagtgttgcc ttttttgtac tccagcagac cagcaaaaat ttgcgcgatg ccgccgtaga 10021 aaatgcccat ggcaagaata ataccgtcca gagcgaaata acccacgttg tgcaggttaa 10081 gcagaatggt ggtcatgccg aagcccatca ggcccagcgg tgccggatta gccaacttag 10141 tgttgcccat aattcctcaa aaatcatcat cgaatgaatg gtgaaataat ttccctgaat 10201 aactgtagtg ttttcagggc gcggcataat aatcagccag tggggcagtg tctacgatct 10261 tttgagggga aaatgaaaat tttcccggtt tccggtatca gacctgagtg gcgctaacca 10321 tccggcgcag gcaggcgatt tgcagtacgg ctggaatcgt cacgcgatag cgtgcgtgac 10381 cgctttaacc ccatttagtg ccgcacctac aggcctccca gcccgcgccg cgcagcaaac 10441 catgcccaag tacgctcatt gctgcgtggg tgcgtaaaat gcgggtcagt tggctggaaa 10501 gcaaatgcga cacacctttt gccaataatt tgtctttcat cagcagcggc agcagctctt 10561 ccagctcatt caccctggca tcgaccgcgt gcagaaactc ctgcttatgt tcctcgtcca 10621 ttttcttcca ggtattacgc agaaattgtt ccagtaactg ttgctcaatt tcaaacgtag 10681 acatctcttt gtcggctttc agcttcaatc gctttgaaac atcgagcaaa atggcccgat 10741 acaatttacc gtgtccgcgc agtttgttgc gatactatcg ccaccaaaat gctgtaattc 10801 tccggcaatc agctgccagt tgcggcgatg ttgctcggga tgcccttcca tcgatttaaa 10861 cagttcgttg cgcatcagta cgctggagag gcgagttttg cctttttcat tatgggtgag 10921 caatcgggcg aaatttgcca actgttcctc actacaatgc tgaagaaaat ccagatctga 10981 atcattcagg taattaacat tcattttttg tggcttctat attctggcgt tagtcgtcgc 11041 cgataatttt cagcgtggcc atatccgatg agttcaccgt atgacccgaa aaggtgattt 11101 ttgagacgca gcgtttattg tcgttatcgc tcttaatgtt gatccagtca gtggtttgcc 11161 cttcttttat ttctgaagga atattcaggc tctgacctgg cgctgacggg cggctgtgaa 11221 ataaaccgat gcaccgctta actgtaaatc gccatggtcg gcagagagtt gtatgcgttt 11281 cacaatgcga caaacaggaa gtttcagcgc cagatcgttg gtttcgttac gcggcattgc 11341 aatgcgccga ggagtttatg gtcgtttgcc tgcgccgtgc agcacagcat caggctaatc 11401 gccaggctgg cggaaatcgt aaaaacggat ttcataagga ttctcttagt gggaagaggt 11461 agggggatga atacccacta gtttactgct gataaagaga agattcaggc acgtaatctt 11521 ttctttttat tacaattttt tgatgaatgc cttggctgcg attcattctt tatatgaata 11581 aaattgctgt caattttacg tcttgtcctg ccatatcgcg aaatttctgc gcaaaagcac 11641 aaaaaatttt tgcatctccc ccttgatgac gtggtttacg accccattta gtagtcaacc 11701 gcagtgagtg agtctgcaaa aaaatgaaat tgggcagttg aaaccagacg tttcgcccct 11761 attacagact cacaaccaca tgatgaccga atatatagtg gagacgttta gatgggtaaa 11821 ataattggta tcgacctggg tactaccaac tcttgtgtag cgattatgga tggcaccact 11881 cctcgcgtgc tggagaacgc cgaaggcgat cgcaccacgc cttctatcat tgcctatacc 11941 caggatggtg aaactctagt tggtcagccg gctaaacgtc aggcagtgac gaacccgcaa 12001 aacactctgt ttgcgattaa acgcctgatt ggtcgccgct tccaggacga agaagtacag 12061 cgtgatgttt ccatcatgcc gttcaaaatt attgctgctg ataacggcga cgcatgggtc 12121 gaagttaaag gccagaaaat ggcaccgccg cagatttctg ctgaagtgct gaaaaaaatg 12181 aagaaaaccg ctgaagatta cctgggtgaa ccggtaactg aagctgttat caccgtaccg 12241 gcatacttta acgatgctca gcgtcaggca accaaagacg caggccgtat cgctggtctg 12301 gaagtaaaac gtatcatcaa cgaaccgacc gcagctgcgc tggcttacgg tctggacaaa 12361 ggcactggca accgtactat cgcggtttat gacctgggtg gtggtacttt cgatatttct 12421 attatcgaaa tcgacgaagt tgacggcgaa aaaaccttcg aagttctggc aaccaacggt 12481 gatacccacc tggggggtga agacttcgac agccgtctga tcaactatct ggttgaagaa 12541 ttcaagaaag atcagggcat tgacctgcgc aacgatccgc tggcaatgca gcgcctgaaa 12601 gaagcggcag aaaaagcgaa aatcgaactg tcttccgctc agcagaccga cgttaacctg 12661 ccatacatca ctgcagacgc gaccggtccg aaacacatga acatcaaagt gactcgtgcg 12721 aaactggaaa gcctggttga agatctggta aaccgttcca ttgagccgct gaaagttgca 12781 ctgcaggacg ctggcctgtc cgtatctgat atcgacgacg ttatcctcgt tggtggtcag 12841 actcgtatgc caatggttca gaagaaagtt gctgagttct ttggtaaaga gccgcgtaaa 12901 gacgttaacc cggacgaagc tgtagcaatc ggtgctgctg ttcagggtgg tgttctgact 12961 ggtgacgtaa aagacgtact gctgctggac gttaccccgc tgtctctggg tatcgaaacc 13021 atgggcggtg tgatgacgac gctgatcgcg aaaaacacca ctatcccgac caagcacagc 13081 caggtgttct ctaccgctga agacaaccag tctgcggtaa ccatccatgt gctgcagggt 13141 gaacgtaaac gtgcggctga taacaaatct ctgggtcagt tcaacctaga tggtatcaac 13201 ccggcaccgc gcggcatgcc gcagatcgaa gttaccttcg atatcgatgc tgacggtatc 13261 ctgcacgttt ccgcgaaaga taaaaacagc ggtaaagagc agaagatcac catcaaggct 13321 tcttctggtc tgaacgaaga tgaaatccag aaaatggtac gcgacgcaga agctaacgcc 13381 gaagctgacc gtaagtttga agagctggta cagactcgca accagggcga ccatctgctg 13441 cacagcaccc gtaagcaggt tgaagaagca ggcgacaaac tgccggctga cgacaaaact 13501 gctatcgagt ctgcgctgac tgcactggaa actgctctga aaggtgaaga caaagccgct 13561 atcgaagcga aaatgcagga actggcacag gtttcccaga aactgatgga aatcgcccag 13621 cagcaacatg cccagcagca gactgccggt gctgatgctt ctgcaaacaa cgcgaaagat 13681 gacgatgttg tcgacgctga atttgaagaa gtcaaagaca aaaaataatc gccctataaa 13741 cgggtaatta tactgacacg ggcgaagggg aatttcctct ccgcccgtgc attcatctag 13801 gggcaattta aaaaagatgg ctaagcaaga ttattacgag attttaggcg tttccaaaac 13861 agcggaagag cgtgaaatca gaaaggccta caaacgcctg gccatgaaat accacccgga 13921 ccgtaaccag ggtgacaaag aggccgaggc gaaatttaaa gagatcaagg aagcttatga 13981 agttctgacc gactcgcaaa aacgtgcggc atacgatcag tatggtcatg ctgcgtttga 14041 gcaaggtggc atgggcggcg gcggttttgg cggcggcgca gacttcagcg atatttttgg 14101 tgacgttttc ggcgatattt ttggcggcgg acgtggtcgt caacgtgcgg cgcgcggtgc 14161 tgatttacgc tataacatgg agctcaccct cgaagaagct gtacgtggcg tgaccaaaga 14221 gatccgcatt ccgactctgg aagagtgtga cgtttgccac ggtagcggtg caaaaccagg 14281 tacacagccg cagacttgtc cgacctgtca tggttctggt caggtgcaga tgcgccaggg 14341 attcttcgct gtacagcaga cctgtccaca ctgtcagggc cgcggtacgc tgatcaaaga 14401 tccgtgcaac aaatgtcatg gtcatggtcg tgttgagcgc agcaaaacgc tgtccgttaa 14461 aatcccggca ggggtggaca ctggagaccg catccgtctt gcgggcgaag gtgaagcggg 14521 cgagcatggc gcaccggcag gcgatctgta cgttcaggtt caggttaaac agcacccgat 14581 tttcgagcgt gaaggcaaca acctgtattg cgaagtcccg atcaacttcg ctatggcggc 14641 gctgggtggc gaaatcgaag taccgaccct tgatggtcgc gtcaaactga aagtgcctgg 14701 cgaaacccag accggtaagc tattccgtat gcgcggtaaa ggcgtcaagt ctgtccgcgg 14761 tggcgcacag ggtgatttgc tgtgccgcgt tgtcgtcgaa acaccggtag gcctgaacga 14821 aaggcagaaa cagctgctgc aagagctgca agaaagcttc ggtggcccaa ccggcgagca 14881 caacagcccg cgctcaaaga gcttctttga tggtgtgaag aagttttttg acgacctgac 14941 ccgctaacct ccccaaaagc ctgcccgtgg gcaggcctgg gtaaaaatag ggtgcgttga 15001 agatatgcga gcacctgtaa agtggcgggg atcactccca taagcgctaa cttaagggtt 15061 gtggtattac gcctgatatg atttaacgtg ccgatgaatt actctcacga taactggtca 15121 gcaattctgg cccatattgg taagcccgaa gaactggata cttcggcacg taatgccggg 15181 gctctaaccc gccgccgcga aattcgtgat gctgcaactc tgctacgtct ggggctggct 15241 tacggccccg gggggatgtc attacgtgaa gtcactgcat gggctcagct ccatgacgtt 15301 gcaacattat ctgacgtggc tctcctgaag cggctgcgga atgccgccga ctggtttggc 15361 atacttgccg cacaaacact tgctgtacgc gccgcagtta cgggttgtac aagcggaaag 15421 agattgcgtc ttgtcgatgg aacagcaatc agtgcgcccg ggggcggcag cgctgaatgg 15481 cgactacata tgggatatga tcctcatacc tgtcagttca ctgattttga gctaaccgac 15541 agcagagacg ctgaacggct ggaccgattt gcgcaaacgg cagacgagat acgcattgct 15601 gaccggggat tcggttcgcg tcccgaatgt atccgctcac ttgcttttgg agaagctgat 15661 tatatcgtcc gggttcactg gcgaggattg cgctggttaa ctgcagaagg aatgcgcttt 15721 gacatgatgg gttttctgcg cgggctggat tgcggtaaga acggtgaaac cactgtaatg 15781 ataggcaatt caggtaataa aaaagccgga gctccctttc cggcacgtct cattgccgta 15841 tcacttcctc ccgaaaaagc attaatcagt aaaacccgac tgctcagcga gaatcgtcga 15901 aaaggacgag tagttcaggc ggaaacgctg gaagcagcgg gccatgtgct attgctaaca 15961 tcattaccgg aagatgaata ttcagcagag caagtggctg attgttaccg tctgcgatgg 16021 caaattgaac tggcttttaa gcggctcaaa agtttgctgc acctggatgc tttgcgtgca 16081 aaggaacctg aactcgcgaa agcgtggata tttgctaatc tactcgccgc atttttaatt 16141 gacgacataa tccagccatc gctggatttc ccccccagaa gtgccggatc cgaaaagaag 16201 aactaactcg ttgtggagaa taacaaaaat ggtcatctgg agcttacagg tggccattcg 16261 tgggacagta tccctgacag cctacaaaac gcaattgaag aacgcgaggc atcgtcttaa 16321 cgaggcaccg aggcgtcgca ttcttcagat ggttcaaccc ttaagttagc gcttatggga 16381 tcactccccg ccgttgctct tactcggatt cgtaagccgt gaaaacagca acctccgtct 16441 ggccagttcg gatgtgaacc tcacagaggt cttttctcgt taccagcgcc gccactacgg 16501 cggtgataca gatgacgatc agggcgacaa tcatcgcctt atgctgcttc attgctctct 16561 tctccttgac cttacggtca gtaagaggca ctctacatgt gttcagcata taggaggcct 16621 cgggttgatg gtaaaatatc actcggggct tttctctatc tgccgttcag ctaatgcctg 16681 agacagacag cctcaagcac ccgccgctat tatatcgctc tctttaaccc attttgtttt 16741 atcgattcta atcctgaaga cgcctcgcat ttttgtggcg taatttttta atgatttaat 16801 tatttaactt taatttatct cttcatcgca attattgacg acaagctgga ttatttttga 16861 aatattggcc taacaagcat cgccgactga caacaaatta attattactt ttcctaatta 16921 atccctcagg aatcctcacc ttaagctatg attatctagg cttagggtca ctcgtgagcg 16981 cttacagccg tcaaaaacgc atctcaccgc tgatggcgca aattcttcaa tagctcgtaa 17041 aaaacgaatt attcctacac tataatctga ttttaacgat gattcgtgcg gggtaaaata 17101 gtaaaaacga tctattcacc tgaaagagaa ataaaaagtg aaacatctgc atcgattctt 17161 tagcagtgat gcctcgggag gcattattct tatcattgcc gctatcctgg cgatgattat 17221 ggccaacagc ggcgcaacca gtggatggta tcacgacttt ctggagacgc cggttcagct 17281 ccgggttggt tcactcgaaa tcaacaaaaa catgctgtta tggataaatg acgcgctgat 17341 ggcggtattt ttcctgttag tcggtctgga agttaaacgt gaactgatgc aaggatcgct 17401 agccagctta cgccaggccg catttccagt tatcgccgct attggtggga tgattgtgcc 17461 ggcattactc tatctggctt ttaactatgc cgatccgatt acccgcgaag ggtgggcgat 17521 cccggcggct actgacattg cttttgcact tggtgtactg gcgctgttgg gaagtcgtgt 17581 tccgttagcg ctgaagatct ttttgatggc tctggctatt atcgacgatc ttggggccat 17641 cattatcatc gcattgttct acactaatga cttatcgatg gcctctcttg gcgtcgcggc 17701 tgtagcaatt gcggtactcg cggtattgaa tctgtgtggt gcacgccgca cgggcgtcta 17761 tattcttgtt ggcgtggtgt tgtggactgc ggtgttgaaa tcgggggttc acgcaactct 17821 ggcgggggta attgtcggct tctttattcc tttgaaagag aagcatgggc gttctccagc 17881 gaagcgactg gagcatgtgt tgcacccgtg ggtggcgtat ctgattttgc cgctgtttgc 17941 atttgctaat gctggcgttt cactgcaagg cgtcacgctg gatggcttga cctccattct 18001 gccattgggg atcatcgctg gcttgctgat tggcaaaccg ctggggatta gtctgttctg 18061 ctggttggcg ctgcgtttga aactggcgca tctgcctgag ggaacgactt atcagcaaat 18121 tatggtggtg gggatcctgt gcggtatcgg ttttactatg tctatcttta ttgccagcct 18181 ggcctttggt agcgtagatc cagaactgat taactgggcg aaactcggta tcctggtcgg 18241 ttctatctct tcggcggtaa ttggatacag ctggttacgc gttcgtttgc gtccatcagt 18301 ttgacaggac ggtttaccgg ggagccataa acggctccct tttcattgtt atcagggaga 18361 gaaatgagca tgtctcatat caattacaac cacttgtatt acttctggca tgtctataaa 18421 gaaggttccg tggttggcgc agcggaggcg ctttatttaa ctccacaaac cattaccgga 18481 cagattcgag cgctggaaga cgccctgcaa gcgaaattat ttaaacgcaa ggggacgtgg 18541 tctcgaaccc agcgagctgg agaactggtc tatcgctatg ccgataaaat gttcacctta 18601 agccaggaaa tgctggatat tgtgaactat cgcaaagaat ccaatttatt gtttgacgtt 18661 ggcgtggctg atgcactttc caaacgcctg gtcagtagcg tacttaacgc cgcagtggta 18721 gaaggcgagc ccattcatct tcgctgcttc gaatccaccc acgaaatgct gctggagcaa 18781 ttaagtcagc ataaactgga gatgatcatt tctgactgtc cgatagactc tacgcagcag 18841 gaaggcctgt tctccgtgag aattggcgaa tgtggcgtga gtttctggtg tacaaatcca 18901 ccaccagaaa aaccgttccc ggcttgtctg gaagaacggc gacttttgat tcctgggcga 18961 cgttcaatgt tagggcgcaa attgcttaac tggtttaact cccagggatt aaacgtagaa 19021 atcctcggcg agtttgatga tgccgctttg atgaaagctt ggctgcaggt ccttttggtg 19081 gcgatgcaaa tgcaatcttc gttgccccaa cgctttatgc atatgacttt tatgccgata 19141 aaactgtcgt agaaattggt cgcgtcgaga atgtgatgga agagtaccat gctatttttg 19201 ctgagcggat gattcagcac ccggcggtac agcgaatctg caatacggat tattctgcgc 19261 tttttagtcc agcggtgcgt taatcggcag ctcccccaaa gttaaggtgg gggagataga 19321 ttagttgtac attaccacga ttttgactcg gctcattatt tgcccgcttg agacattgtt 19381 tccatatgta cgcgggcgaa taaatagagg aatctgatta cttccttcat ggggatgctg 19441 aaaagagtag taattgctgg taatgactcc aacttattga tagtgtttta tgttcagata 19501 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc 19561 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg 19621 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt 19681 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat 19741 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa 19801 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca 19861 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt 19921 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca 19981 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa 20041 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg 20101 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa 20161 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc 20221 atcacctacc tcaatgtgta tcacaatatc catattcttt gtgggggagt ctggagattg 20281 agtagatatt cttgttcaga atgtatcagc cgatggttct acgattctta agccacgaag 20341 agttcagata gtacaacggc atgtctcttt tgactatctg gcaaccggca gtgtgttctc 20401 tcacgcatca caaaagcagc aggcataaaa aaacccgctt gcgcgggctt tttcacaaag 20461 cttcagcaaa ttggcgatta agccagtttg ttgatctgtg cagtcaggtt agccttatga 20521 cgtgcagctt tgtttttgtg gatcagacct ttagcagcct gacggtccac gatcggttgc 20581 atttcgttaa atgctttctg tgcagcagct ttgtcgccag cttcgatagc tgcgtatact 20641 ttcttgatga aagtacgcat catagagcga cggcttgcgt tgtgcttacg agccttttca 20701 gactgaatgg cgcgcttctt agctgatttg atattagcca aggtccaact cccaaatgtg 20761 ttctatatgg acaattcaaa ggccgaggaa tatgcccttt tagccttctt ttgtcaatgg 20821 atttgtgcaa ataagcgccg ttaatgtgcc ggcactcgtt acgtagtgat ggcgcaggat 20881 tctaccagct tgcggggtgt gaatacagct tttccgcgat aaaaattgca gcaggcggtc 20941 agtttcttcc cgtgatttgc gccatggcaa tgaaaagcca cttctttctg atttcggtac 21001 tcaatcgccg gttaaccttg accgctgtac aaggtctact cggacgattt tcactgtttt 21061 gagccagaca tgaagctgat acgcggcata cataatctca gccaggcccc gcaagaaggg 21121 tgtgtgctga ctattggtaa tttcgacggc gtgcatcgcg gtcatcgcgc gctgttacag 21181 ggcttgcagg aagaagggcg caagcgcaac ttaccggtga tggtgatgct ttttgaacct 21241 caaccactgg aactgtttgc taccgataaa gccccggcaa gactgacccg gctgcgggaa 21301 aaactgcgtt accttgcaga gtgtggcgtt gattacgtgc tgtgcgtgcg tttcgacagg 21361 cgtttcgcgg cgttaaccgc gcaaaatttc gtcagcgatc ttctggtgaa gcatttgcgc 21421 gtaaaatttc ttgccgtagg tgatgatttc cctttggcgc tggtcgtgaa ggcgatttct 21481 tgttattaca gaaagctggc atggaatacg gcttcgatat caccagtacg caaacttttt 21541 gcagaggtgg cgtgcgcatc agcagcacgg ctgcgtcagg cccttgcgga tgacaatctg 21601 gctctggcag agagtttact ggggcacccg tttgccatct ccgggcgtgt agtccacggt 21661 gatgaattag ggcgcactat aggtttcccg acggcgaatg taccgccgcg ccgtcaggtt 21721 tccccggtga aaggggttta tgcggtagaa gtgctgggcc tcggtgaaaa gccgttaccc 21781 ggcgtggcaa acatcggaac acgcccaacg gttgccggta ttcgccagca gctggaagtg 21841 catttgttag atgttgcaat ggacctttac ggtcgccata tacaagtagt gctgcgtaaa 21901 aaaatacgca atgagcagcg atttgcgtcg ctggacgaac tgaaagcgca gattgcgcgt 21961 gatgaattaa ccgcccgcga attttttggg ctaacaaaac cggcttaagc ctgttatgta 22021 atcaaaccga aatacggaac cgagaatctg atgagtgact ataaatcaac cctgaatttg 22081 ccggaaacag ggttcccgat gcgtggcgat ctcgccaagc gcgaacccgg aatgctggcg 22141 cgttggactg atgatgatct gtacggcatc atccgtgcgg ctaaaaaagg caaaaaaacc 22201 ttcattctgc atgatggccc tccttatgcg aatggcagca ttcatattgg tcactcggtt 22261 aacaagattc tgaaagacat tatcgtgaag tccaaagggc tttccggtta tgactcgccg 22321 tatgtgcctg gctgggactg ccacggtctg ccgatcgagc tgaaagtcga gcaagaatac 22381 ggtaagccgg gtgagaaatt caccgccgcc gagttccgcg ccaagtgccg cgaatacgcg 22441 gcgacccagg ttgacggtca acgcaaagac tttatccgtc tgggcgtgct gggcgactgg 22501 tcgcacccgt acctgaccat gggacttcaa aactggaagg ccaacatcat ccgcgcgctg 22561 ggcaaaatca tcggcaacgg tcacctgcac aaaggcgcga agccagttca ctggtgcgtt 22621 gactgccgtt ctgcgctggc gaagcggaag ttgagtatta cgacaaaact tctccgtcca 22681 tcgacgttgc tttccagggc agtcgatcag gatgcactga aagcaaaatt tgccgtaagc 22741 aacgttaacg gcccaatctc gctggtaatc tggaccacca cgccgtggac tctgcctgcc 22801 aaccgcgcaa tctctattgc accagatttc gactatgcgc tggtgcagat cgacggtcag 22861 gccgtgattc tggcgaaaga tctggttgaa agcgtaatgc agcgtatcgg cgtgaccgat 22921 tctcggcacg gtaaaaggtg cggagctgga gccgctgcgt ttacccatcc gtttatgggc 22981 ttcgacgttc cggcaatcct cggcgatcac gttaccctgg atgccggtac cggtgccgtt 23041 cacaccgcgc ctggccacgg cccggacgac tatgtgatcg gtcagaaata cggcctggaa 23101 accgctaacc cggttggccc ggacggcact tatctgccgg gcacttatcc gacgctggat 23161 ggcgtgaacg tcttcaaagc gaacgacatc gtcgctgcgc tgctgcagga aaaaggcgcg 23221 ctgctgcacg ttgagaaaat gcagcacagc tatccgtgct gctggcgtca caaaacgccg 23281 atcatcttcc gcgcgacgcc gcagtggttc gtcagcatgg atcagaaagg tctgcgtgcg 23341 cagtcactga aagagatcaa aggcgtgcag tggatcccgg actggggcca ggcgcgtatc 23401 gagtcgatgg ttgctaaccg tcctgactgg tgtatctccc gtcagcgcac ctggggtgta 23461 ccgatgtcac tgttcgtgca caaagacacg gaagagctgc atccgcgtac ccttgaactg 23521 atggaagaag tggcaaaacg cgttgaagtc gatggcatcc aggcgtggtg ggatctcgat 23581 gcgaaagaga tcctcggcga cgaagctgat cagtacgtga aagtgccgga cacattggat 23641 gtatggtttg actccggatc tacccactct tctgttgttg acgtgcgtcc ggaatttgcc 23701 ggtcacgcag cggacatgta tctggaaggt tctgaccaac accgcggctg gttcatgtct 23761 tccctaatga tctccaccgc gatgaagggt aaagcgccgt atcgtcaggt actcacccac 23821 ggctttaccg tggatggtca gggccgcaag atgtctaaat ccatcggcaa taccgtttcg 23881 ccgcaggatg tgatgaacaa actgggcgcg gatattctgc gtctgtgggt ggcatcaacc 23941 gactacaccg gtgaaatggc cgtttctgac gagatcctga aacgtgctgc cgatacgtat 24001 cgtcgtatcc gtaacaccgc gcgcttcctg ctggcaaacc tgaacggttt tgatccagca 24061 aaagatatgg tgaaacggag agagatggtg gtactggatc gctgggccgt agttgtgcga 24121 aagcggcaca ggaagacatc ctcaaggcgt acgaagcata cgatttccac gaagtggtac 24181 aagcgtctga tgcgcttctg ctccgttgag atgggttcct tctacctcga catcatcaaa 24241 gaccgtcagt actacgccaa aggacacagt gtggcgcgtc gtagctgcca gactgcgcta 24301 tatcacatcg cagaagcgct ggtgcgctgg atggcaccaa tcctctcctt caccgctgat 24361 gaagtgtggg gctacctgcc gggcgaacgt gaaaaatacg tcttcaccgg tgagtggtac 24421 gaaggcctgt ttggcctggc agacagtgaa gcgatgaacg atgcgttctg ggacgagctg 24481 ttgaaagtgc gtggcgaagt gaacaaagtc attgagcaag cgcgtgccga caagaaagtg 24541 ggtggctcgc tggaagcggc agtaaccttg tatgcagaac cggaactgtc ggcgaaactg 24601 accgcgctgg gcgatgaatt acgatttgtc ctgttgacct ccggcgctac cgttgcagac 24661 tataacgacg cacctgctga tgctcagcag agcgaagtac tcaaagggct gaaagtcgcg 24721 ttgagtaaag ccgaaggtga gaagtgccca cgctgctggc actacaccca ggatgtcggc 24781 aaggtggcgg aacacgcaga aatctgcggc cgctgtgtca gcaacgtcgc cggtgacggt 24841 gaaaaacgta agtttgcctg atgagtcaat cgatctgttc aacagggcta cgctggctgt 24901 ggctggtggt agtcgtgctg attatcgatc tgggcagcaa atacctgatc ctccagaact 24961 ttgctctggg ggatacggtc ccgctgttcc cgtcgcttaa tctgcattat gcgcgtaact 25021 atggcgcggc gtttagtttc cttgccgata gcggcggctg gcagcgttgg ttctttgccg 25081 gtattgcgat tggtattagc gtgatcctgg cagtgatgat gtatcgctcg aaggccacgc 25141 agaagctaaa caatatcgct tacgcgctga ttattggcgg cgcgctgggc aacctgttcg 25201 accgcctgtg gcacggcttc gttgtcgata tgatcgactt ctacgtcggc gactggcact 25261 tcgccacctt caaccttgcc gatactgcca tctgtgtcgg tgcggcactg attgtgctgg 25321 aaggtttttt gccttctaga gcgaaaaaac aataataaac cctgccggat gcgatgctga 25381 cgcatcttat ccggcctaca gattgctgcg aaatcgtagg ccggataagg cgtttacgcc 25441 gcatccggca aaaatcctta aatataagag caaacctgca tgtctgaatc tgtacagagc 25501 aatagcgccg tcctggtgca cttcacgcta aaactcgacg atggcaccac cgccgagtct 25561 acccgcaaca acggtaaacc ggcgctgttc cgcctgggtg atgcttctct ttctgaaggg 25621 ctggagcaac acctgttggg gctgaaagtg ggcgataaaa ccaccttctc gttggagcca 25681 gatgcggcgt ttggcgtgcc gtcaccggac ctgattcagt acttctcccg ccgtgaattt 25741 atggatgcag gcgagccaga aattggcgca atcatgcttt ttaccgcaat ggatggcagt 25801 gagatgcctg gcgtgatccg cgaaattaac ggcgactcca ttaccgttga tttcaaccat 25861 ccgctggccg ggcagaccgt tcattttgat attgaagtgc tggaaatcga tccggcactg 25921 gaggcgtaac atgcagatcc tgttggccaa cccgcgtggt ttttgtgccg gggtagaccg 25981 cgctatcagc attgttgaaa acgcgctggc catttacggc gcaccgatat atgtccgtca 26041 cgaagtggta cataaccgct atgtggtcga tagcttgcgt gagcgtgggg ctatctttat 26101 tgagcagatt agcgaagtac cggacggcgc gatcctgatt ttctccgcac acggtgtttc 26161 tcaggcggta cgtaacgaag caaaaagtcg cgatttgacg gtgtttgatg ccacctgtcc 26221 gctggtgacc aaagtgcata tggaagtcgc ccgcgccagt cgccgtggcg aagaatctat 26281 tctcatcggt cacgccgggc acccggaagt ggaagggaca atgggccagt acagtaaccc 26341 ggaaggggga atgtatctgg tcgaatcgcc ggacgatgtg tggaaactga cggtcaaaaa 26401 cgaagagaag ctctccttta tgacccagac cacgctgtcg gtggatgaca cgtctgatgt 26461 gatcgacgcg ctgcgtaaac gcttcccgaa aattgtcggt ccgcgcaaag atgacatctg 26521 ctacgccacg actaaccgtc aggaagcggt acgcgccctg gcagaacagg cggaagttgt 26581 gttggtggtc ggttcgaaaa actcctccaa ctccaaccgt ctggcggagc tggcccagcg 26641 tatgggcaaa cgcgcgtttt tgattgacga tgcgaaagac atccaggaag agtgggtgaa 26701 agaggttaaa tgcgtcggcg tgactgcggg cgcatcggct ccggatattc tggtgcagaa 26761 tgtggtggca cgtttgcagc agctgggcgg tggtgaagcc attccgctgg aaggccgtga 26821 agaaaacatt gttttcgaag tgccgaaaga gctgcgtgtc gatattcgtg aagtcgatta 26881 agtcattagc agcctaagtt atgcgaaaat gccggtcttg ttaccggcat tttttatgga 26941 gaaaacatgc gtttacctat cttcctcgat actgaccccg gcattgacga tgccgtcgcc 27001 attgccgccg cgatttttgc acccgaactc gacctgcaac tgatgaccac cgtcgcgggt 27061 aatgtctcgg ttgagaaaac tacccgcaat gccctgcaac tgctgcattt ctggaatgcg 27121 gagattccgc tcgcccaagg ggccgctgtg ccactggtac gcgcaccgcg tgatgcggca 27181 tctgtgcacg gcgaatcggg aatggctggc tacgactttg ttgagcacaa ccgaaagccg 27241 ctcgggatac cggcgtttct ggcgattcgg gatgccctga tgcgtgcacc agagcctgtt 27301 accctggtgg ccatcggccc gttaaccaat attgcgctgt tactttcaca atgcccggaa 27361 tgcaagccgt atattcgccg tctggtgatc atgggtggtt ctgccggacg cggcaactgt 27421 acgccaaacg ccgagtttaa tattgctgcc gatccagaag ctgctgcctg tgtcttccgc 27481 agtggtattg aaatcgtcat gtgcggtttg gatgtcacca atcaggcaat attaactcct 27541 gactatctct ctacactgcc gcagttaaac cgtaccggga aaatgcttca cgccctgttt 27601 agccactacc gtagcggcag tatgcaaagc ggcttgcgaa tgcacgatct ctgcgccatc 27661 gcctggctgg tgcgcccgga cctgttcact ctcaaaccct gttttgtggc agtggaaact 27721 cagggcgaat ttacctcagg cacgacggtg gttgatatcg acggttgcct gggcaagcca 27781 gccaatgtac aggtggcatt ggatctggat gtgaaaggct tccagcagtg ggtggctgag 27841 gtgctggctc tggcgtcgta acctgtcaca tgttattggc atgcagtcat tcatcgactc 27901 atgcctttca ctgatatccc tccctgttta tcattaattt ctaattatca gcgtttttgg 27961 ctggcggcgt agcgatgcgc tggttactct gaaaacggtc tatgcaaatt aacaaaagag 28021 aatagctatg catgatgcaa acatccgcgt tgccatcgcg ggagccgggg ggcgtatggg 28081 ccgccagttg attcaggcgg cgctggcatt agagggcgtg cagttgggcg ctgcgctgga 28141 gcgtgaagga tcttctttac tgggcagcga cgccggtgag ctggccggag ccgggaaaac 28201 aggcgttacc gtgcaaagca gcctcgatgc ggtaaaagat gattttgatg tgtttatcga 28261 ttttacccgt ccggaaggta cgctgaacca tctcgctttt tgtcgccagc atggcaaagg 28321 gatggtgatc ggcactacgg ggtttgacga agccggtaaa caagcaattc gtgacgccgc 28381 tgccgatatt gcgattgtct ttgcggccaa ttttagcgtt ggcgttaacg tcatgcttaa 28441 gctgctggag aaagcagcca aagtgatggg tgactacacc gatatcgaaa ttattgaagc 28501 acatcataga cataaagttg atgcgccgtc aggcaccgca ctggcaatgg gagaggcgat 28561 cgcccacgcc cttgataaag atctgaaaga ttgcgcggtc tacagtcgtg aaggccacac 28621 cggtgaacgt gtgcctggca ccattggttt tgccaccgtg cgtgcaggtg acatcgttgg 28681 tgaacatacc gcgatgtttg ccgatattgg cgagcgtctg gagatcaccc ataaggcgtc 28741 cagccgtatg acatttgcta acggcgcggt aagatcggct ttgtggttga gtggtaagga 28801 aagcggtctt tttgatatgc gagatgtact tgatctcaat aatttgtaac cacaaaatat 28861 ttgttatggt gcaaaaataa cacatttaat ttattgatta taaagggctt taatttttgg 28921 cccttttatt tttggtgtta tgtttttaaa ttgtctataa gtgccaaaaa ttacatgttt 28981 tgtcttctgt ttttgttgtt ttaatgtaaa ttttgaccat ttggtccact tttttctgct 29041 cgtttttatt tcatgcaatc ttcttgctgc gcaagcgttt tccagaacag gttagatgat 29101 ctttttgtcg cttaatgcct gtaaaacatg catgagccac aaaataatat aaaaaatccc 29161 gccattaagt tgacttttag cgcccatatc tccagaatgc cgccgtttgc cagaaattcg 29221 tcggtaagca gatttgcatt gatttacgtc atcattgtga attaatatgc aaataaagtg 29281 agtgaatatt ctctggaggg tgttttgatt aagtcagcgc tattggttct ggaagacgga 29341 acccagtttc acggtcgggc cataggggca acaggttcgg cggttgggga agtcgttttc 29401 aatacttcaa tgaccggtta tcaagaaatc ctcactgatc cttcctattc tcgtcaaatc 29461 gttactctta cttatcccca tattggcaat gtcggcacca atgacgccga tgaagaatct 29521 tctcaggtac atgcacaagg tctggtgatt cgcgacctgc cgctgattgc cagcaacttc 29581 cgtaataccg aagacctctc ttcttacctg aaacgccata acatcgtggc gattgccgat 29641 atcgataccc gtaagctgac gcgtttactg cgcgagaaag gcgcacagaa tggctgcatt 29701 atcgcgggcg ataacccgga tgcggcgctg gcgttagaaa aagcccgcgc gttcccaggt 29761 ctgaatggca tggatctggc aaaagaagtg accaccgcag aagcctatag ctggacacaa 29821 gggagctgga cgttgaccgg tggcctgcca gaagcgaaaa aagaagacga gctgccgttc 29881 cacgtcgtgg cttatgattt tggtgccaag cgcaacatcc tgcggatgct ggtggataga 29941 ggctgtcgcc tgaccatcgt tccggcgcaa acttctgcgg aagatgtgct gaaaatgaat 30001 ccagacggca tcttcctctc caacggtcct ggcgacccgg ccccgtgcga ttacgccatt 30061 accgccatcc agaaattcct cgaaaccgat attccggtat tcggcatctg tctcggtcat 30121 cagctgctgg cgctggcgag cggtgcgaag actgtcaaaa tgaaatttgg tcaccacggc 30181 ggcaaccatc cggttaaaga tgtggagaaa aacgtggtaa tgatcaccgc ccagaaccac 30241 ggttttgcgg tggacgaagc aacattacct gcaaacctgc gtgtcacgca taaatccctg 30301 ttcgacggta cgttacaggg cattcatcgc accgataaac cggcattcag cttccagggg 30361 caccctgaag ccagccctgg tccacacgac gccgcgccgt tgttcgacca ctttatcgag 30421 ttaattgagc agtaccgtaa aaccgctaag taatcaggag taaaagagcc atgccaaaac 30481 gtacagatat aaaaagtatc ctgattctgg gtgcgggccc gattgttatc ggtcaggcgt 30541 gtgagtttga ctactctggc gcgcaagcgt gtaaagccct gcgtgaagag ggttaccgcg 30601 tcattctggt gaactccaac ccggcgacca tcatgaccga cccggaaatg gctgatgcaa 30661 cctacatcga gccgattcac tgggaagttg tacgcaagat tattgaaaaa gagcgcccgg 30721 acgcggtgct gccaacgatg ggcggtcaga cggcgctgaa ctgcgcgctg gagctggaac 30781 gtcagggcgt gttggaagag ttcggtgtca ccatgattgg tgccactgcc gatgcgattg 30841 ataaagcaga agaccgccgt cgtttcgacg tagcgatgaa gaaaattggt ctggaaaccg 30901 cgcgttccgg tatcgcacac acgatggaag aagcgctggc ggttgccgct gacgtgggct 30961 tcccgtgcat tattcgccca tcctttacca tgggcggtag cggcggcggt atcgcttata 31021 accgtgaaga gtttgaagaa atttgcgccc gcggtctgga tctctctccg accaaagagt 31081 tgctgattga tgagtcgctg atcggctgga aagagtacga gatggaagtg gtgcgtgata 31141 aaaacgacaa ctgcatcatc gtctgctcta tcgaaaactt cgatgcgatg ggcatccaca 31201 ccggtgactc catcactgtc gcgccagccc aaacgctgac cgacaaagaa tatcaaatca 31261 tgcgtaacgc ctcgatggcg gtgctgcgtg aaatcggcgt tgaaaccggt ggttccaacg 31321 ttcagtttgc ggtgaacccg aaaaacggtc gtctgattgt tatcgaaatg aacccacgcg 31381 tgtcccgttc ttcggcgctg gcgtcgaaag cgaccggttt cccgattgct aaagtggcgg 31441 cgaaactggc ggtgggttac accctcgacg aactgatgaa cgacatcact ggcggacgta 31501 ctccggcctc cttcgagccg tccatcgact atgtggttac taaaattcct cgcttcaact 31561 tcgaaaaatt cgccggtgct aacgaccgtc tgaccactca gatgaaatcg gttggcgaag 31621 tgatggcgat tggtcgcacg cagcaggaat ccctgcaaaa agcgctgcgc ggcctggaag 31681 tcggtgcgac tggattcgac ccgaaagtga gcctggatga cccggaagcg ttaaccaaaa 31741 tccgtcgcga actgaaagac gcaggcgcag atcgtatctg gtacatcgcc gatgcgttcc 31801 gtgcgggcct gtctgtggac ggcgtcttca acctgaccaa cattgaccgc tggttcctgg 31861 tacagattga agagctggtg cgtctggaag agaaagtggc ggaagtgggc atcactggcc 31921 tgaacgctga cttcctgcgc cagctgaaac gcaaaggctt tgccgatgcg cgcttggcaa 31981 aactggcggg cgtacgcgaa gcggaaatcc gtaagctgcg tgaccagtat gacctgcacc 32041 cggtttataa gcgcgtggat acctgtgcgg cagagttcgc caccgacacc gcttacatgt 32101 actccactta tgaagaagag tgcgaagcga atccgtctac cgaccgtgaa aaaatcatgg 32161 tgcttggcgg cggcccgaac cgtatcggtc agggtatcga attcgactac tgttgcgtac 32221 acgcctcgct ggcgctgcgc gaagacggtt acgaaaccat tatggttaac tgtaacccgg 32281 aaaccgtctc caccgactac gacacttccg accgcctcta cttcgagccg gtaactctgg 32341 aagatgtgct ggaaatcgtg cgtatcgaga agccgaaagg cgttatcgtc cagtacggcg 32401 gtcagacccc gctgaaactg gcgcgcgcgc tggaagctgc tggcgtaccg gttatcggca 32461 ccagcccgga tgctatcgac cgtgcagaag accgtgaacg cttccagcat gcggttgagc 32521 gtctgaaact gaaacaaccg gcgaacgcca ccgttaccgc tattgaaatg gcggtagaga 32581 aggcgaaaga gattggctac ccgctggtgg tacgtccgtc ttacgttctc ggcggtcggg 32641 cgatggaaat cgtctatgac gaagctgacc tgcgtcgcta cttccagacg gcggtcagcg 32701 tgtctaacga tgcgccagtg ttgctggacc acttcctcga tgacgcggta gaagttgacg 32761 tggatgccat ctgcgacggc gaaatggtgc tgattggcgg catcatggag catattgagc 32821 aggcgggcgt gcactccggt gactccgcat gttctctgcc agcctacacc ttaagtcagg 32881 aaattcagga tgtgatgcgc cagcaggtgc agaaactggc cttcgaattg caggtgcgcg 32941 gcctgatgaa cgtgcagttt gcggtgaaaa acaacgaagt ctacctgatt gaagttaacc 33001 cgcgtgcggc gcgtaccgtt ccgttcgtct ccaaagccac cggcgtaccg ctggcaaaag 33061 tggcggcgcg cgtgatggct ggcaaatcgc tggctgagca gggcgtaacc aaagaagtta 33121 tcccgccgta ctactcggtg aaagaagtgg tgctgccgtt caataaattc ccgggcgttg 33181 acccgctgtt agggccagaa atgcgctcta ccggggaagt catgggcgtg ggccgcacct 33241 tcgctgaagc gtttgccaaa gcgcagctgg gcagcaactc caccatgaag aaacacggtc 33301 gtgcgctgct ttccgtgcgc gaaggcgata aagaacgcgt ggtggacctg gcggcaaaac 33361 tgctgaaaca gggcttcgag ctggatgcga cccacggcac ggcgattgtg ctgggcgaag 33421 caggtatcaa cccgcgtctg gtaaacaagg tgcatgaagg ccgtccgcac attcaggacc 33481 gtatcaagaa tggcgaatat acctacatca tcaacaccac ctcaggccgt cgtgcgattg 33541 aagactcccg cgtgattcgt cgcagtgcgc tgcaatataa agtgcattac gacaccaccc 33601 tgaacggcgg ctttgccacc gcgatggcgc tgaatgccga tgcgactgaa aaagtaattt 33661 cggtgcagga aatgcacgca cagatcaaat aatagcgtgt catggcagat atttttcatc 33721 cgctaatttg atcgaataac taatacggtt ctctgatgag gaccgttttt ttttgcccat 33781 taagtaaatc ttttggggaa tcgatatttt tgatgacata agcaggattt agctcacact 33841 tatcgacggt gaagttgcat actatcgata tatccacaat tttaatatgg ccttgtttaa 33901 ttgcttcaaa acgagtcata gccagacttt taatttgtga aactggagtt cgtatgtgtg 33961 aaggatatgt tgaaaaacca ctctacttgt taatcgccga atggatgatg gctgaaaatc 34021 ggtgggtgat agcaatagag agatctctat tcatttcgat attgaacaca gcaaggcggt 34081 taataccctg acttatattc tgtcggaagt cacagaaata agctgcgaag ttaagatgat 34141 ccctaataag ctggaagggc ggggatgcca gtgtcagcga ctggttaaag tggtcgatat 34201 ccatgagcaa atttacgcgc gcctgcgcaa taacagtcgg gaaaaattag tcggtgtaag 34261 aaagacgccg cgtattcctg ccgttccgct cacggaactt aaccgcgagc agaagtggca 34321 gatgatgttg tcaaagagat tgcgtcgtta attttatctc gttgataccg ggcgtcctgc 34381 ttgaccagat gcgatgttgt agcatcttat ccagcaacca ggtcgcatcc ggcaagatca 34441 ccgtttaggc gtcacatccg tcgtcccctg caaacggggg cgattttcct ccatttgcct 34501 cagtggctgc gtttcatgta acgatacatg acagcgcccg acaagatcct gatactcttt 34561 ggtattcaac cgtttccagt gtaactcgtc gtcactaaca ttgcgtacag cgcgggctgg 34621 cgtacccatc aacaactggc gtttctcgcc gcgaaagccc gctttgacaa agctcatggc 34681 ggcaacaatg ctctcttcgc caatgaccgc gccatccata atcacgctgt tcatcccgac 34741 caatgcatcg cgaccaatca aacaaccatg caggatcgct ccgtgcccga tatggccgtt 34801 ttccccaacg atagtgtcag tgtcgcagta gccatgcata atgcagccat cctgaatatt 34861 ggctcccgct tgcacgatca accgcccgta gtcaccacgc agtgaggcga gtgggccgat 34921 gtagacaccg gctcccacaa tcacatcgcc aatcaagacg gcactgggat ggacaaacgc 34981 cgtcgggtga accaccggaa ttaacccctc aaaggcgtaa tagctcacgg ttgttaacgt 35041 cctttccaca ccggatcgcg cttctcggca aacgccagcg gcccttcaat ggcatcttcc 35101 gaatgcagaa ccgatggata gtgtttcaac acgccgctgc gaatatagcg atacgcttct 35161 tctaccggca tttcgctggt ggtgcggtag atctctttca gcgccgcaat cgccagcggg 35221 gcgctgttaa ccagctgctg agccagttcg cgggcgttat ccatcagttc cgcctggcta 35281 accacgcggt tgactatccc ccaacgcagc gcctcttctg cgcccattcg tctgccggtc 35341 atcaccattt cattgacgat ggcaggcggc aggatcttcg gcagacgcag cacaccgccg 35401 ctgtcaggaa cgatgcccag tttggcttcc ggcagggcga agctggcgtt atcggcacaa 35461 acaataaaat ctgccgccag cgccagttca aaggcgccgc caaaggcata gccgttcaca 35521 gctgcgataa ccggtttgtc gagattgaaa atttcggtta atcccgcaaa accacccgga 35581 ccaaagtcag catccggtgc ttcgccttct gctgccgctt ttaaatccca gcccgcggaa 35641 aagaacttct ctccggcacc ggtaataatg gcgacacgta attgcggatc gtcacggaaa 35701 tttagaaata cttcgcccat ttcaaagctg gtttttgcat caatagcatt cgcttttgga 35761 cgatcaaggg taatttccag aattgatcca ttgcgggtca gatgtaatga ttcactcatt 35821 ccttttctcc atttttgctt tttcagggac gacaacatcc ctgcaaaaaa tgcatattgt 35881 tttagagtgt gattattagc tggcagggta gttccctgct gtttcattta tttcagattc 35941 tttctaatta ttttccccga gcaattacgt ggcagatctt ttctgatctc cagataagag 36001 ggcactttaa atttcgccat attttgttcg cagaagcgga aaaattcctc ttcgctcaat 36061 gtttcacctt cattcagcac cacaaatgct ttgatggctt catcgcgaat cgaatcttta 36121 atacccacaa ccacgatgtc ctgaattttc gggtgcgcgg cgataatatt ttccagctcc 36181 acgcaggaga cattctcgcc gccacgttta atcatattgc agcggcgatc gacgaaataa 36241 aaaaagtcct cttcgtcgcg gtatccggta tcgccggtat gcagccagcc atcggcttcc 36301 agcactttcg cagtggcttg tgggttgaga aagtactctt tgaagatggt tttcccaggt 36361 atgcctttaa tgcagatttc accgatctca ccagccggga gcgggcgatt gtgatcgtcg 36421 cggatctccg cttcgtagca aaaccccacc cgaccaatcg acggccagcg tcgtttatcg 36481 ccaggacgat caccgataat gcccacaatg gtttccgtca tcccataaga cgtcagcaag 36541 cgaacgccga agcgttcaca aaacgcatct ttttcctgct ccgacaagtt gagataaaac 36601 atcacttccc gcaggcggtg ttgctgatcg ttcgctgaag gcggctgtac catcaacgta 36661 cggatcatca tcggaataca ttcggtaacg gtggcgcggt acttctgtac ctgtccccag 36721 aaggcgcggg cgctgtattt ctcgaccagc acaaaggtgg ccccggcaga aaacgccgcc 36781 atcgccgcag tacactggca atcgatatga aacgcaggca ttaccgtcag gtagacgtca 36841 tcgtcacgca gtgcacactg ccaggcggag taatatccag cgaagcgcag gttgtaatgg 36901 gtaatcacca cacctttcgg tcgggaggtg gtgccggagg tgaagagaat ttccgccgta 36961 tcgtcagtcg atagcggcgg tgcatagcac aaggtggcag gttgttgatt tttcagttga 37021 gtaaacgaac tcacgccatc atcagcggga agtgccacat ctgtcaggca aatgtgccgc 37081 aattgagtgg catcttcctg ctgaatctgt tgatacatag gatagaattg cgcactggtc 37141 accagcaggc acgcctggct attttgcagg atccacgcgc tttcctcgca caacaggcgg 37201 gcgttaatcg gcaccataat cgcgccaatt tttgccagcc cgaaccagca aaagataaat 37261 tccgggcagt tgtcgagatg tagtgcaacc ttgtcgcctt tgcgaatccc cagcgtataa 37321 aacaggtttg ccgtgcggtt aatctcctga tttaactcaa gataactata ccggttaacg 37381 actccgccgc tggattcaca aatcagcgcc gttttatgac cgtaaacgtc cgcaagatcg 37441 tcccacattt gacgtagatg ttgtccgcca atgatatcca ttgcacctct atccattttt 37501 gttcgtttgt tattgggcgg gcgctagtca ggcaagccga ctgacgccac gcgtttagtc 37561 ctcaactttg gccagacctt tgctgaccaa ctcctgaatg tcgttttcgc tgtagccgat 37621 atttttcaaa atggcagccg tgtccatgcc atgtgagggc attccgcgcc agatttgtcc 37681 ggggttattt ttgaatttcg gcatgatgtt cggccctttg caggtgcgac catccatcgt 37741 ttgccactga gtgattgatt cgcgagccac atactgtgga ttgctttcca gttccggtac 37801 ggtcagcact ttggcgcagg cgatattcag ttcagcaaag cgttctttta cttccgcgat 37861 ggtatgtgtc gccagccagg catcgagttt ctcttcaacc agtgggccgt aagggcattc 37921 gatacggtgg ataagctgag tgccttccgg gatttctggc gtgccaagca gatgtgcgag 37981 gccaatatct ttaaagcact cttcaatttg ggtaatgccc accagttcca tcacgatgta 38041 gccgtcggca catttataca gaccgcaacc ggcgtagtag ggatctttac ctttgctcat 38101 gcgcgggcac atttcgccgc cgttgaagta atccatcatg aagtactggc ccatacgcag 38161 catcacttca tacatggcga tgtcgatact ttcgccttta ccggtttcac gcactttatg 38221 cagtgctgcc agcgccgccg tggtggcggt caggccagaa aagtaatcgg cggtatacgg 38281 gaaggcaggc attggctggt caacatcacc gttctgaatc aggtaaccac taaaggcctg 38341 ggcgatagtg ttataggccg gaagattggt gtactcctcg gtgccgtact gaccaaaacc 38401 ggacaggtga gcgataacca gtttcgggtt gtgctgccac agtacttcat cggtaatgcc 38461 acgacgggca aaggccggac ctttactggc ttcgatgaag atatcggtgg tttccattaa 38521 tttcagaaac gcttcgcggc cttcatcttt gaaaatattt aacgacagcg cgtgcaaatt 38581 gcggcgggag agttgcgggt agttcggttg aacgcgaatg gtgtcggccc aggcgacgtt 38641 ctcgatccag ataacttccg cgccccattc tgcgaacatt tgcccggcaa acggtccggc 38701 gatttcgata ccggagaaga caacgcgcaa tccggccaac ggcccgaatt tcggcatggg 38761 tagatgatcc attatttgct cctgaaaaat ttatgtagcg catgactgcc ggatgcggcg 38821 taaacgcttt atccggccta cattcgtgct cccgtaggcc tgataagacg catcagcgcg 38881 gcatcaggca gcgcacggac ttagcggtat tgcttcagca ccgcacgacc cagcgtcagg 38941 atctgcattt cgtcagatcc cccggagacg cggtctacac gcagatcacg ccagaagcgg 39001 ctgatgcggt ggttgcccgc aatcccgaca ccgcccagca cctgcattgc gctatccaca 39061 acttcaaatg ccgcattggc gcagaagtat ttgcacatcg ctgcatcgcc agaggtgatg 39121 gtgccgttgt ctgctttcca cgctgcttca tacagcatgt ttttcatgga gtttaatttg 39181 atcgccatgt gggcgaattt ttcctgaatc aactggaaac gaccaatagc ctcgccaaac 39241 tgcacgcgct gattggcgta gcgcgccgca tcttcaaagg cgcacatcgc cgtaccgtag 39301 ttggtgaggg ctaccaggaa acgttcatgg tcgaactctt ctttgacgcg gttaaagccg 39361 ttaccttccc gaccgaacat gtctttctcg tccagttcca cgtcgtcaaa ggtgatttca 39421 cagcagctat ccatacgcag accgagcttt tcaagtttgg tcactttgat gcccggtttg 39481 ctcatatcaa caaaccatcc ggtgtagaca ggtttgtccg gagaagcccc gtcgcgcgcc 39541 atcaccacga tgtacggggt gtaggcgctg ctggtaataa aacacttact accattaaga 39601 taaatcttac catttttacg ggtataagtc gttttcaggc tacccacgtc ggagcccgcg 39661 cccggttcgg taatcgctga gttccacatc tgcttaccgg tgccgcggaa agccataatt 39721 ttgtcgatct gctcttgtgt gccttcgcgc aggaaggtgt tgaacccgcc cggcaactgg 39781 tacagcacat aggttggtgc ccccagacgt cccagctcca tccacacggc ggcgagagta 39841 acaaaccccg cgtccagacc accgtgctct tcagggatca gcagactgtc gatacccata 39901 tccgccagtg ctttgacaaa acgttccggg tagacgctgt cacggtcgca ctcggcaaaa 39961 taggcctccc agttttcgct ggccatcagt tcgcggatac cggcgacaaa cagttcctgc 40021 tcatcattta aattaaaatc catctttcaa cctcttgata ttttgggggg ttaattaatc 40081 tttccagttc tgtttcgcgt ctttaataaa ggagagcgtc accataatgt tgacgaagaa 40141 cagcgggcat cctccggcga taatggcggt ttgaatcggt ttcaggccgc cgagcgccag 40201 cagaacaata ccgataatgc caaccagaat tgaccaaccg atacgcacca gcagaggtgg 40261 ttcttcacca tcgcgtactt cgcggcaagt ggacatcgcc agggtataag agcaggcgtt 40321 aaccagcgta acggtggcaa taaagcagag gatgaagaag ccccacatgg tggcggtgct 40381 gagtggcaga gcggcccagg tttcaatgat ggcgcgcgcc acaccgtact gttcgatcag 40441 atttggaatg ttgatgatgt ttttatctat caacagcaga gtgttactac cgagtacagt 40501 ccacaggatc caggttgacg ctgtcagccc cagcaccatg ccgaagcaca gttcacgcac 40561 agtacgacca cgggagatgc gggcgaggaa gatactcatc tggatagcat aaatcaccca 40621 ccatgcccag tagaacacgg tccagccctg cgggaagccg cctttagcga tgggatcggt 40681 atagaacaac atgcgcggca gatacatcag caacatcccc accgaatcgg tgaagtagtt 40741 catgatgaag ctggcaccgc tgacaatgaa cacccaaccc agcatcagga agctcaggta 40801 actacgcacg tcactggcga tacgtacccc tttttgcaga ccgcaagcga cgcaaatggc 40861 gttgaggata atccagcagg taatgatgat agcgtccagt tgcagggtat gcggaatgcc 40921 aaacaaccat tgcatacact cggtcaccag cggcgtggca aggcccagac tggtacccat 40981 cgtgaagatc aaggcgacga gatagaagtt gtcgacgata gtgccgaaca accctttggc 41041 gtgtttttca cctaccagcg gcaccagtgt cgagctgggg cgaatcactt ccattttgcg 41101 gacaaagaag aagtaagcga aggcgactga aaggaagctg taagtggccc acggcagagg 41161 tccccagtgg aacaagctgt aagccagccc caactctttc gcccctgtcg agttcggttc 41221 taagccaaac ggcggggtgg agatgtagta gtagatctca atcgatcccc agaacagtac 41281 ggcagcagac gtacaggagg cgaacatcat aaagatccaa ctggcggtgc taaattctgg 41341 cggttcgtta cctaaacgct ttttggcata cgggccaaac accagccaga accaaccgaa 41401 aagcatcacc accatatacc attcaaatgc ccatccccat acattggtga cgtaactgaa 41461 tacagcatta ataacgacat tcgctgcatc cagatctctg actgtaagcc aacaaagtat 41521 gccgacgatt attaacggcg gaaagaaaac cttcggttct attcccgttt ttctcttttc 41581 attcttcatg agttaattcc actgtgaaaa cgaatattta ttttgcgttc ccgtttgttt 41641 tatttttgtt aacatttaat ataattatta ttaacctcgt ggacgcgtta atggctaact 41701 cataatgggt attcaataag ctgtattctg tgattggtat cacatttttg tttcgggtga 41761 atagagggcg ttttttcgtt aattttgatt aataatcagt ttgttatgct ctgttgtgag 41821 taaaaaataa catctgactt tcaatattgg tgatccataa aacaatattg aaaatttctt 41881 tttgctacgc cgtgttttca atattggtga ggaacttaac aatattgaaa gttggattta 41941 tctgcgtgtg acattttcaa tattggtgat taaagtttta tttcaaaatt aaagggcgtg 42001 atatctgtaa ttaacaccac cgatatgaac gacgtttcct tcatgatttc tggagatgca 42061 atgaagatta ttacttgcta taagtgcgtg cctgatgaac aggatattgc ggtcaataat 42121 gctgatggtt cattagactt cagcaaagcc gatgccaaaa taagccaata cgatctcaac 42181 gctattgaag cggcttgcca gctaaagcaa caggcagcag aggcgcaggt gacagcctta 42241 agtgtgggcg gtaaagccct gaccaacgcc aaagggcgta aagatgtgct atcgcgcgac 42301 ccggatgaac tgattgtggt gattgatgac cagttcgagc aggcactgcc gcaacaaacg 42361 gcgagcgcac tggctgcagc cgcccagaaa gcaggctttg atctgatcct ctgtggcgat 42421 ggttcttccg acctttatgc ccagcaggtt ggtctgctgg tgggcgaaat cctcaatatt 42481 ccggcagtta acggcgtcag caaaattatc tccctgacgg cagataccct caccgttgag 42541 cgcgaactgg aagatgaaac cgaaacctta agcattccgc tgcctgcggt tgttgctgtt 42601 tccactgata tcaactcccc acaaattcct tcgatgaaag ccattctcgg cgcggcgaaa 42661 aagcccgtcc aggtatggtc ggcggcggat attggtttta acgcagaggc agcctggtca 42721 gaacaacagg ttgccgcgcc gaaacagcgc gaacgtcagc gcatcgtgat tgaaggcgac 42781 ggcgaagaac agatcgccgc atttgctgaa aatcttcgca aagtcattta accacagggg 42841 atgctacgaa cacattttct caagtctggg tattcagcga taccccttct cgtctgccgg 42901 aactgatgaa cggtgcgcag gctttagcta atcaaatcaa cacctttgtc ctcgatgatc 42961 ggcgacggtg tacaggcaat ccagctcggc gctaatcatg tctggaaatt aaacggcaaa 43021 ccggacgatc ggatgatcga agattacgcc ggtgtcatgg ctgacactat tcgccagcac 43081 ggcgcagacg gcctggtgct gctgccaaac acccgtcgcg gcaaattact ggcggcaaaa 43141 ctgggttatc gccttaaagc ggcggtgtct aacgatgcca gcaccgtcag cgtacaggac 43201 ggtaaagcga cagtgaaaca catggtttac ggtggtctgg cgattggcga agaacgcatt 43261 gccacgccgt atgcggtact gaccatcagc agcggcacgt tcgatgcggc tcagccagac 43321 gcgtcacgca ctgtgaaacg cacaccgtgg agtggcaggc tccggctgtg gcgattaccc 43381 gcacggcaac ccaggcgcgc cagagcaaca gcgtcgatct cgacaaagcc cgtctggtgg 43441 tcagcgtcgg tcgcggtatt ggcagcaaag agaacattgc gctggcagaa cagctttgca 43501 aggcgatagg tgcggagttg gcctgttctc gtccggtggc ggaaaacgaa aaatggatgg 43561 agcacgaacg ctatgtcggt atctccaacc tgatgctgaa acctgaactg tacctggcgg 43621 tggggatctc cgggcagatc cagcacatgg ttggcgctaa cgcgtcgcaa accattttcg 43681 ccatcaataa agataaaaat gcgccgatct tccagtacgc ggattacggc attgttggcg 43741 acgccgtgaa gatccttccg gcgctgaccg cagctttagc gcgttgatcc actctggcag 43801 ggctgcattt tggccctgcc gctgacaggg agctcttatg tccgaagata tctttgacgc 43861 catcatcgtc ggtgcagggc ttgctggttc ggttgccgca ctggtgctcg cccgcgaagg 43921 tgcgcaagtg ttagttatcg agcgtggcaa ttccgcaggt gccaagaacg tcaccggcgg 43981 gcgtctctat gcccacagtc tggaacacat tattcctggt ttcgccgact ccgcccccgt 44041 agaacgcctg atcacccatg aaaaactcgc gtttatgacg gaaaagtcag cgatgactat 44101 ggactactgc aatggtgacg aaacctcgcc atcccagcgt tcttactccg ttttgcgcag 44161 taaatttgat gcctggctga tggagcaggc cgaagaagcg ggcgcgcagt taattaccgg 44221 gatccgcgtc gataacctcg tacagcgcga tggcaaagtc gtcggtgtag aagccgatgg 44281 cgatgtgatt gaagcgaaaa cggtgatcct tgctgatggg gtgaactcca tccttgccga 44341 aaaattgggg atggcaaaac gcgtcaaacc gacggatgtg gcggttggcg tgaaggaact 44401 gatcgagtta ccgaagtcgg ttattgaaga ccgttttcag ttgcagggta atcagggggc 44461 ggcttgcctg tttgcgggat cacccaccga tggcctgatg ggcggcggct tcctttatac 44521 caatgaaaac accctgtcgc tggggctggt ttgtggtttg catcatctgc atgacgcgaa 44581 aaaatcggtg ccgcaaatgc tggaagattt caaacagcat ccggccgttg caccgctgat 44641 cgcgggcggc aagctggtgg aatattccgc tcacataatg ccggaagcag gcatcaacat 44701 gctgccggag ttggttggtg acggcgtatt gattgccggt gatgccgccg gaatgtgtat 44761 gaacttcggt tttaccattc gcggtatgga tctggcgatt gccgccgggg aagccgcagc 44821 aaaaaccgtg ctttcagcga tgaaaagcga cgatttcagt aagcaaaaac tggcggaata 44881 tcgtcagcat cttgagagtg gtccgctgcg cgatatgcgt atgtaccaga aactaccggc 44941 gttccttgat aacccacgca tgtttagcgg ctacccggag ctggcggtgg gtgtggcgcg 45001 tgacctgttc accattgatg gcagcgcgcc ggaactgatg cgcaagaaaa tcctccgcca 45061 cggcaagaaa gtgggcttca tcaatctaat caaggatggc atgaaaggag tgaccgtttt 45121 atgacttctc ccgtcaatgt ggacgtcaaa ctgggcgtca ataaattcaa tgtcgatgaa 45181 gagcatccgc acattgttgt gaaggccgat gctgataaac aggtgctgga gctgctggtg 45241 aaagcgtgcc ccgcaggtct gtacaagaag caggatgacg gcagtgtgcg cttcgattac 45301 gccggatgtc tggagtgcgg cacctgtcgc attctggggc tggggtcggc gctggaacag 45361 tgggaatacc cgcgcggcac ctttggtgtg gagttccgtt acagctaatg ttgctttgat 45421 acgtaacgcc gcactgactc tcattgcaaa aaacatgaat agctatgcaa ccgtccagaa 45481 accttgacga tctcaaatgc ttgtcccact accgccgcat tttgctgtgg ggaagcggtg 45541 gtccgtttct gtatggttat gtactggtaa tgattggcgt ggcgctggag caactgacgc 45601 cggcgctgaa actggacgct gactggattg gcttgctggg cgcgggaacg ctcgccgggc 45661 tgttcgttgg cacatcgctg tttggttata tttccgataa agtcggacgg cgcaaaatgt 45721 tcctcattga tatcatcgcc atcggcgtga tatcggtggc gacgatgttt gtttcatccc 45781 ccgtcgaact gttggtgatg cgggtattta tcggcattgt catcggtgca gattatccca 45841 tcgccacctc aatgatcaac gagttctcca gtacccgtca gcgggcggtt ttccatcagc 45901 tttattgccg cgatgtggta tgttggcgcg acctgtgccg atctggtcgg ctactggctt 45961 tatgatgtgg aaggcggctg gcgctggatg ctgggtagcg cggcgatccc ctgtctgttg 46021 attttgattg gtcagattcg aactgcctga atctccccgc tggttattat gcaaagggcg 46081 agtaaaagag tgcgaggaga tgatgatcaa actgtttgga gaaccggtgg ctttcgatga 46141 agagcagccg cagcaaaccc ggttttcgcg atctgtttaa tcgctgccat tttccttttg 46201 ttctgtttgt tgccgccatc tggacctgcc aggtgatccc aatgttcgcc atttacacct 46261 ttggcccgca aattgttggt ttgttgggat tgggggttgg caaaaacgcg gcactaggaa 46321 atgtggtgat tagcctgttc tttatgctcg gctgtattcc gccgatgctg tggttaagca 46381 ctgccggacg gcgtccattg ttgattggca gctttgccat gatgacgctg gcgctggcgg 46441 ttttggggct aatcccggat atggggatct ggctggtagt gatggccttt gcggtgtatg 46501 cctttttctc tggcgggccg ggtaatttgc agtggctcta tcctaatgaa ctcttcccga 46561 cagatatccg cgcctctgcc gtgggcgtga ttatgtcctt aagtcgtatt ggcaccattg 46621 tttcgacctg ggcactaccg atctttatcg ataattacgg tatcagtaac acgatgctaa 46681 tgggggcggg tatctcgctg tttggcttgt tgatttccgt agcgtttgcc ccggagactc 46741 gagggatgtc actggcgcag accagcaata tgacgatccg cgggcagaga atggggtaaa 46801 ttgttcagat ttctctcttt tctgaatcaa tattattgac tataagccgc gtgaatatat 46861 gactacactt tgtgggaaaa caaaggcgta atcacgcggg ctacctatga ttcttataat 46921 ttatgcgcat ccgtatccgc atcattccca tgcgaataaa cggatgcttg aacaggcaag 46981 gacgctggaa ggcgtcgaaa ttcgctctct ttatcaactc tatcctgact tcaatatcga 47041 tattgccgcc gagcaggagg cgctgtctcg cgccgatctg atcgtctggc agcatccgat 47101 gcagtggtac agcattcctc cgctcctcaa actttggatc aataaagttt tctcccacgg 47161 ctgggcttac ggacatggcg gcacggcgct gcatggcaaa catttgctgt gggcggtgac 47221 gaccggcggc ggggaaagcc attttgaaat tggtgcgcat ccgggctttg atggtctgtc 47281 gcagccgcta caggcgacgg caatctactg cgggctgaac tggctgccac cgtttgccat 47341 gcactgcacc tttatttgtg acgacgaaac cctcgaaggg caggcgcgtc actataagca 47401 acgtctgctg gaatggcagg aggcccatca tggatagcca tacgctgatt caggcgctga 47461 tttatctcgg ttcggcagcg ctgattgtac ccattgcggt acgtcttggt ctgggatcgg 47521 tacttggcta cctgatcgcc ggctgcatta ttggcccgtg ggggctgcga ctggtgaccg 47581 atgccgaatc tattctgcac tttgccgaga ttggggtggt gctgatgctg tttattatcg 47641 gcctcgaact cgatccacaa aggctgtgga agctgcgtgc ggcagtgttc ggctgtggcg 47701 cattgcagat ggtgatttgc ggcggcctgc tggggctgtt ctgcatgtta cttgggctgc 47761 gctggcaggt cgcggaattg atcggcatga cgctggcgct ctcctctacg gcgattgcca 47821 tgcaggcgat gaatgaacgc aatctgatgg tgacgcaaat gggtcgcagt gcctttgcgg 47881 tgctgctgtt ccaggatatc gcggcgatcc cgctggtggc gatgattccg ctactggcaa 47941 cgagcagtgc ctcgacgacg atgggcgcat ttgctctctc ggcgttaaaa gtggcgggtg 48001 cgctggtgct ggtggtattg ctggggcgct atgtcacgcg tccggcgctg cgttttgtag 48061 cccgctctgg cttgcgggaa gtgtttagtg ccgtggcgtt attcctcgtg tttggctttg 48121 gtttgctgct ggaagaggtc ggcttgtcga tggcgatggg cgcgtttctg gcgggcgtac 48181 tgctggcaag ctcggaatac cgtcatgcgc tggagagcga tatcgaacca tttaaaggtt 48241 tgctgttggg gctgtttttc atcggtgttg gcatgtcgat agactttggc acgctgcttg 48301 aaaacccatt gcgcattgtc attttgctgc tcggtttcct catcatcaaa atcgccatgc 48361 tgtggctgat tgcccgaccg ttgcaagtgc caaataaaca gcgtcgttgg tttgcggtgt 48421 tgttagggca gggcagtgag tttgcctttg tggtatttgg cgcggcgcag atggcgaatg 48481 tgctggagcc ggagtgggcg aaatcgctga ccctggcggt ggcgctgtcg atggcagcaa 48541 cgccgattct gctggtgatc ctcaatcgcc ttgagcaatc ttctactgag gaagcgcgtg 48601 aagccgatga gatcgacgaa gaacagccgc gcgtgattat cgccggattc ggtcgttttg 48661 ggcagattac cggacgttta ctgctctcca gcggggtgaa aatggtggta ctcgatcacg 48721 atccggacca tatcgaaacc ttgcgtaaat ttggtatgaa agtgttttat ggcgatgcca 48781 cgcggatgga tttactggaa tctgccggag cggcgaaagc ggaagtgctg attaacgcca 48841 tcgacgatcc gcaaaccaac ctgcaactga cagagatggt gaaagaacat ttcccgcatt 48901 tgcagattat tgcccgcgcc cgcgatgtcg accactacat tcgtttgcgt caggcaggcg 48961 ttgaaaagcc ggagcgtgaa accttcgaag gtgcgctgaa aaccgggcgt ctggcactgg 49021 aaagtttagg tctggggccg tatgaagcgc gagaacgtgc cgatgtgttc cgccgcttta 49081 atattcagat ggtggaagag atggcaatgg ttgagaacga caccaaagcc cgcgcggcgg 49141 tctataaacg caccagcgcg atgttaagtg agatcattac cgaggaccgc gaacatctgt 49201 cattaattca acgacatggc tggcagggaa ccgaagaagg taaacatacc ggcaacatgg 49261 cggatgaacc ggaaacgaaa ccctcatcct aataaagagt gacgtaaatc acacttacag 49321 ctaactgttt gttttgtttc attgtaatgc ggcgagtcca gggagagagc gtggactcgc 49381 cagcagaata taaaattttc ctcaacatca tcctcgcacc agtcgacgac ggtttacgct 49441 ttacgtatag tggcgacaat tttttttatc gggaaatctc aatgatcagt ctgattgcgg 49501 cgttagcggt agatcgcgtt atcggcatgg aaaacgccat gccgtggaac ctgcctgccg 49561 atctcgcctg gtttaaacgc aacaccttaa ataaacccgt gattatgggc cgccatacct 49621 gggaatcaat cggtcgtccg ttgccaggac gcaaaaatat tatcctcagc agtcaaccgg 49681 gtacggacga tcgcgtaacg tgggtgaagt cggtggatga agccatcgcg gcgtgtggtg 49741 acgtaccaga aatcatggtg attggcggcg gtcgcgttta tgaacagttc ttgccaaaag 49801 cgcaaaaact gtatctgacg catatcgacg cagaagtgga aggcgacacc catttcccgg 49861 attacgagcc ggatgactgg gaatcggtat tcagcgaatt ccacgatgct gatgcgcaga 49921 actctcacag ctattgcttt gagattctgg agcggcggta attttgtata gaatttacgg 49981 ctagcgccgg atgcgacgcc ggtcgcgtct tatccggcct tcctatatca ggctgtgttt 50041 aagacgccgc cgcttcggcc aaatccttat gccggttcga cggctggaca aaatactgtt 50101 tatcttccca gcgcaggcag gttaatgtac caccccagca gcagccggta tccagcgcgt 50161 atataccttc cggcgtacct ttgccctcca gcgatgccca gtgaccaaag gcgatgctgt 50221 attcttcagc gacagggcca ggaatcgcaa accacggttt cagtggggca ggggcctctt 50281 ccggcgattc tttgctgtac atatccagtt gaccgttcgg gaagcaaaaa cgcatacggg 50341 taaaagcgtt ggtgataaaa cgcagtcttc ccagcccccg caattccggt gaccagttat 50401 ttggcatatc gccgtacatg gcatcaagaa agaagggata ggagtcactc gatagcaccg 50461 cttctacatc gcgtgcgcac tctttggcgg tctgcagatc ccactgcggc gtgatccctg 50521 cgtgggccat caccagcttt ttctcttcgt cgatttgcag cagaggctgg cgccgcagcc 50581 agttaagcag ctcgtcggca tccggcgctt ccagcagcgg tgtcaggcga tctaacggtt 50641 tattgcggct gatcccggca aataccgcca gcagatgcag atcgtgattg cccagcacca 50701 gacgtacgct gtcgcctaag gatttcacat agcgcagaac atccagcgaa cccggcccgc 50761 gcgcgaccag atcgcccgtc agccagaggg tatctttccc aggggtaaat tctactttat 50821 gcagcaatgc gatcagttca tcgtaacaac catgaacgtc gccaataagg tatgtcgcca 50881 tattctttta atgaatgagt gtgggaacgg cgagtcggaa tacgggaatg tcgatgctga 50941 aagggacgcc attttcatcg atcatttcgt agtgaccctg catggtgccc agcggggttt 51001 caatgattgc accgctggtg tactggtact cttcgccagg cgcgataagt ggctggacgc 51061 caaccactcc ttcgccctgg acttcggttt cacggccatt gccattggtg atcagccagt 51121 aacgccccaa caactgcact ggcgctcgcc ccagattgcg tatggttacg gtataagcaa 51181 aaacgtaacg ttcattatca ggtgaagatt gagcctcaat gtagacgctt tgaacctgaa 51241 tacacactcg gggcgaattg atcatcgtta actctcctgc aaaggcgcgt tctccgccca 51301 gatagttcgc catctggcaa tattgcgcga cagagatatt ttccgctcgc atcgccgggt 51361 cgatccccat tcccgttaac acctcgacgc taaacaggtt gccgaggctg ttacgaatgg 51421 ttttacgacg ctggttaaag gcttcggtgg tgatgcggct caacacacga acatctttaa 51481 ccgggtgagg catcgttgca tgaggaacca ggcgcacgac ggcggaatcc actttgggtg 51541 gtggtgtaaa ggctgacggc ggtacttcca gtaccgggat cacattgcaa tagtattgcg 51601 ccatgacgct taatcgacca tacgctttgc tgttcggtcc tgcaaccaga cgattcacca 51661 cctctttttg caacataaag tgcatgtcgg caatggcatc agtatagcta aacagatgga 51721 acatcaacgg cgtggagatg ttataaggca ggttgccgaa aacacgcagc ggctgaccca 51781 ttttctcggc cagttcacca aagttaaagg tcatcgcatc ctgctgataa atcgtcagtt 51841 tcgggcctaa gaatggatgc gtttgcagac gtgccgccag atcgcggtca agttcgatga 51901 ccgtcagctg gtccagacgt tcgccgaccg gttcggtcaa tgccgccaga ccggggccga 51961 tttcgaccat cgcctggccc ttttgcgggt taatggcaga cacaatactg tcgatcacga 52021 actgatcgtt gagaaagttt tgcccgaagc gtttacgggc taagtggccc tggtggactc 52081 gattattcat tgggtgttaa caatcatttt gatggcgaga ttaagcgccg taataaaact 52141 gccgacatcg gctttgccac gtcccgccag ttcaagcgcg gtgccgtggt ccactgatgt 52201 gcgaataaag ggcaggccca gcgtaatgtt cacaccgcgc ccgaagccct ggtattttag 52261 cacgggaaga ccctgatcgt ggtacatcgc cagcacggcg tcggcgttat caagatattt 52321 cggctgaaac agggtatcgg caggcagcgg cccgttgagt ttcatcccct gcgcccgcag 52381 ctcattgagc accggaataa tggtgtctat ctcttccgta cccatatgac cgccttcgcc 52441 cgcgtgcgga ttcagcccgc agaccagaat gcgcggttcg gcaataccaa atttggtccg 52501 caaatcgtga tgcaaaatag caatcacttc gtgcaaaagt gcaggggtga tagcgtctgc 52561 gatatcgcgc agcggtaaat gcgtcgttgc cagcgccacg cgaagttctt cggtcgccag 52621 catcatcacc acctttttcg cctgcgaacg ctcttcgaaa aactcggtat gaccggtaaa 52681 aggaatgcca gcgtcgttaa taacgccttt atgcaccgga cctgtgatca gcgcggcaaa 52741 ttcgccgttc agacaaccat cgcacgctcg cgccagcgtt tccaccacat aatgcccatt 52801 ttcaaccgct aactgccccg cagtgacagg tgcacgtagc gcgacaggaa gtagcgttaa 52861 tgtgcccgca gtttgcggtt gtgcagggga gttgggggaa taagggcgga gggtgagcgg 52921 caaaccgagc atcgctgccc ggttggtaag gagagtggca tcggcacaaa caaccagttc 52981 gaccggccac tcacgctgtg caagctggac aactaagtcc gggccaatcc cggcgggctc 53041 gccgggagtg atcacaacac gttgggtttt aaccattagt tgctcaggat tttaacgtag 53101 gcgctggcac gttgttcctg catccagctt gctgcttctt ccgagaactt acggttcatc 53161 agcatgcggt atgcacgatc tttctgcgca gcgtcggttt tatcgacatt acgggtatcc 53221 agcagttcga ttaaatgcca gccgaatgaa gagtgaaccg gtgcactcat ttgacctttg 53281 ttcaggcgag tcagggcgtc acggaaggcc ggatcgaaaa tatctggtgt agcccagccg 53341 agatcgccgc cctggttagc agagcctgga tcctgagaga actctttcgc tgcggcagca 53401 aaagtcgttt taccactctt gatatcagca gcaatctgtt ccagtttcac acgggcctgt 53461 tcgtcagtca tgatcggcga cggtttcagc agaatatggc gagcatgaac ttcggtcacc 53521 gagatatttt tgctttcgcc gcgcaggtcg ttaactttca gaatatggaa gccaacgccg 53581 gaacgaatcg ggccaacaat gtcgcctttc ttcgcggtgc ttaatgcctg ggcgaagatc 53641 ccgggcaact cctgaatacg gccccagccc atctggccgc cgttcagcgc ctgctggtcg 53701 gcagaatgag caatcgccag cttaccgaaa tcagcgccgt tacgcgcctg atcgacaatg 53761 gcgcgcgcct ggctttccgc ttcgttcacc tgatcagagg tcgggttttc cggcagcggg 53821 atcaggatgt ggctcaggtt cagctcagtg ctggcgtcgt tttggttacc cacctgctgc 53881 gccagggatt cgacttctgc ggcaggatgg tgatgcgacg acgcacctcg ttgttacgca 53941 cttcagagat aatcatctct ttgcggatct ggttacgata ggtgttgtag ttcagtccat 54001 cgtaagccag acggctgcgc atctgatcca gcgtcatgtt gttctgtttc gcgatgttag 54061 caatcgcctg atccagctgc tcatcggaga ttttcactcc cattttctgc cccatctgca 54121 ggatgatttg atccatgatc aaacgttcca tgatttggtg gcgcagcgtc gcgtcatcag 54181 gaagttgctg ccttgcctga gcagcgttca gttttaccga ctgcattaat ccatcaacgt 54241 cgctttccag cacgacgccg ttattgacga cggctgcgac tttatcgact acctgggggg 54301 cagcgaaact ggtattcgcg atcatggcga taccgagaag cagcgttttc cagttcttca 54361 tactttttcc atttcaatta accgcactgc ggattacgtg gtaaatcaac aaatcacaaa 54421 gtgttttgat acggcagaat gttcgaacgc agcatctctt gcgtacccag accgtagttg 54481 gagctcaggc cgcgaagttc gatgttaaag ccgattgcgt tgtcatatac cgcatgttgt 54541 ttatcgttat cccaaccgtt cagcttccgc tcgtaaccga cgcgaattgc atagcagcag 54601 gagctgtatt gcacacctaa catagagtcg gcttgcttgt tagcattggt gtcgtagtag 54661 taggccccaa caatggacca acgatcggca attggccagc tggcgacagc acctacctgc 54721 gaaataccat tcttatattg ctcagcagtg gaatagtact taggcagcgt agcctgaata 54781 tattccgggc tggcgtaacg gtaattcagc tgtaccagac ggtcttcatc ccgacggtat 54841 tcaatgctgg agttactggt cgctacgtta tccagacgtg tatcgtactg aatcccgcca 54901 cgcaatcccc aacgctcgga gatacgccag taagtatcgc ctgcccacac cagtgaaccc 54961 gttttgtcgt cattctccca tgttatgttg tcatcgccag tgcgagactc cgtgaaatag 55021 tagatttgac caacggaaat attaaaacgt tcaacggcag catcatcata tatgcgagat 55081 gtgacaccgg tcgtcacctg gttagcggag gcaatacggt caagaccgcc gtaagtccgg 55141 tcccggaaca ggccagagta gtcagattgc agcagagacg agtcgtagtt atagatgtcg 55201 ctctgatcgc gatacggcac gtacaaatac tgcgcgcgcg gttccagcgt ttgggtataa 55261 cccggagcca gcatttccat atcgcgttca aagaccattt tgccgtcaac tttgaattgc 55321 ggcattacgc ggttaacgga ttcgtccagc ttggtcgtgt ttctggagtt ataccagtca 55381 agattggttt gctgataatg ggttgccagc aacttcgctt cggtattgat gctgccccag 55441 ttattagaga gcggcaaatt gatggtcggt tccaggtgaa cacgggttgc ttcaggcatg 55501 tcgtctctgg tgttaacaaa gtgcactgcc tggccgtaaa tacgcgtatc aaacggacca 55561 acatcattct ggtagtaatt aacgtctaac tgcggctctg ccgagtagct actggtgttc 55621 tgttcgctga aaacctggaa ctgcttggtt gaaacggtgg cattgaagtt ttgcaccgca 55681 tagccaacgc tgaatttttg cgttgcgtag ccgtcagtac tggaaccgta cttgttatcg 55741 aaatcattga agtagctagg atcgctgacc ttggtgtagt cgacgttgaa acgccacacc 55801 tgatccatga ccccggagtg gttccagtag aataaccaac gacgtgaact gtcatcgttc 55861 gggtgttcat cttcatagac tttatctgaa ggcagatagt ccagttccat caagccagcg 55921 cccgcctggg agaggtagcg gaattcgttc tcccacatga tgttgccacg acgatgcata 55981 taatgcggcg tgatggtggc atccatattt ggcgcgatgt tccagtaata tggcaggtag 56041 aactcaaagt agttggtggt ggtgtacttg gcgttcggga tcaagaaacc agagcgacgt 56101 ttgtcaccca ccggcaactg caaatagggg ctataaaaga tcggtaccgg acccacctta 56161 aagcgggcgt tccagatctc cgcaacttgt tcttcgcggt catgaataat ttcgctacct 56221 accacgctcc aggtgtcaga acccggcaga caggaggtaa agctaccgtt atccagaatg 56281 gtatagcggt tttcgccacg ttgtttcatc aggtccgctt taccgcgacc ctggcgaccc 56341 accatctggt aatcaccttc ccagacgttg gtatctttgg tgttcagatt cgcccagcct 56401 ttcggccctt tgaggatcac ctggttatcg tcgtaatgga cattaccgag cgcatcaacg 56461 gtacgtaccg gctccggttg tcctggtgcc tctttttgat ggagctgcac ttcgtcggcc 56521 tgcagacggc tgttaccctg catgatatcc acgctgccag taaacacggc gtcatccggg 56581 tagtcccctt tcgcgtggtc agcattgata gtcacgggta agtcattggt atcgccctgt 56641 accagaggac ggtcatagct tggcacgccc aacatgcact gtgaggcgag gtcggctgcc 56701 agtccctgtt gactataaag ggcggtggca atcatggtgg ccaggagagt ggggatacgt 56761 tttttcatac gttgatttta ttgttccatc atcggtaacg ttgcgcgtga caaacggtca 56821 gagactaacg tactcgtcat ctctacgcta gtgttaatcc tgtccgaata gcgtcagtgg 56881 tgttaggcac ggcattgaat gacaggtatg ataatgcaaa ttataggcga tgtcccacaa 56941 ttgaccgcag ccggaaaacg gtaaaagcac ctttatattg tgggagatag ccctgatatc 57001 cgtgtgtcga tttggggaat atatgcagta ttggggaaaa atcattggcg tggccgtggc 57061 cttactgatg ggcggcggct tttggggcgt agtgttaggc ctgttaattg gccatatgtt 57121 tgataaagcc cgtagccgta aaatggcgtg gttcgccaac cagcgtgagc gtcaggcgct 57181 gttttttgcc accacttttg aagtgatggg gcatttaacc aaatccaaag gtcgcgtcac 57241 ggaggctgat attcatatcg ccagccagtt gatggaccga atgaatcttc atggcgcttc 57301 ccgtactgcg gcgcaaaatg cgttccgggt gggaaaatca gacaattacc cgctgcgcga 57361 aaagatgcgc cagtttcgca gtgtctgctt tggtcgtttt gacttaattc gtatgtttct 57421 ggagatccag attcaggcgg cgtttgctga tggttcactg cacccgaatg aacgggcggt 57481 gctgtatgtc attgcagaag aattagggat ctcccgcgct cagtttgacc agtttttgcg 57541 catgatgcag ggcggtgcac agtttggcgg cggttatcag cagcaaactg gcggtggtaa 57601 ctggcagcaa gcgcagcgtg gcccaacgct ggaagatgcc tgtaatgtgc tgggcgtgaa 57661 gccgacggat gatgcgacca ccatcaaacg tgcctaccgt aagctgatga gtgaacacca 57721 tcccgataag ctggtggcga aaggtttgcc gcctgagatg atggagatgg cgaagcagaa 57781 agcgcaggaa attcagcagg catatgagct gataaagcag cagaaagggt ttaaatgacc 57841 ctgtaaatga tgctgagtaa ctgcccacga ttaaaggtgg ccgccctggc ggtcacttct 57901 ttgagaaaag gcgtttactc agaatggtgg acaggctcaa tgcacggttt acgggagggg 57961 ttctgtaggt tttatcgcgt tgaccctgct taaggttgag agctttacga cgagcggaat 58021 tatattttta cgtcttaaaa ataaaaaaca catacctgaa tgagcgattt ttgaaagtat 58081 atttattcag aacgcgcatc atgagttttt aactcaatgc gaggctatta ccatgaaagt 58141 atcagttcca ggcatgccgg ttacactttt aaatatgagc aagaacgata tttataagat 58201 ggtgagcggg gacaagatgg acgtgaagat gaatatcttt caacgcttgt gggagacgtt 58261 acgccatctg ttctggtcag ataaacagac tgaggcttat aaacttctgt tcaatttcgt 58321 gaataaccag actggcaaca tcaacgcctc agaatacttt actggggcta tcaacgagaa 58381 tgagagagaa aagtttatca atagcctgga attattcaat aaacttaaaa catgcgcaaa 58441 aaatccggat gagttggtcg caaagggcaa tatgcgctgg gtcgcccaga ccttcgggga 58501 tatcgagtta agtgtcactt ttttcattga aaagaataag atatgtactc agacgttgca 58561 gctgcataag ggccaaggta acttgggcgt tgatcttaga aaggcttacc ttcccggcgt 58621 tgacatgagg gattgttacc ttggtaaaaa aacaatgaaa ggtagcaatg atatccttta 58681 tgagagacct gggtggaatg ctaacctggg cgtgctaccc cggacggtgc taccccggac 58741 ggtgctaacc cggacggtgc taacctggac ggtgctaccg tgaacggtgc tacctcctta 58801 tatgatgagg taattattat taataaaatc ccccccaaaa aaattgatac taaaggagtt 58861 gctactgaag aagttgctac taaaaaagta ctgctgaaca aattactgac aacgcaatta 58921 ttgaatgagc cagaataagc taaggttgaa ggggctggaa cgccccttca accttagcag 58981 tagcgtggga tgatttcaca attagaaaga cctgcatgat gagctagaga agaggctagt 59041 gacgcaaggc gtcgtgcagg acacggatca ccgagatggg catcgccaac cagactgcta 59101 attagcccat gaataacaat cagaaaggac cataacagac ccgttaaaat gaaatataag 59161 agacggtcaa cgggtgaaga aaaagttcaa aaattcgctg tggagcagga agggaattac 59221 cgaatggaaa gcgtagccac acgcaacaac tgaaagcagt ttggcagaaa caaaaaatcc 59281 ccggactcgg ggatttatgt acaagaggca gcccttagga tgagggtata aacgtacagg 59341 aaaggttaaa aatccgctgg cgctttaaac gtcatactat tgccatacgc cggatgggta 59401 atcgtcaaca tctctgcatg tagcaacaaa cgtggtgcca tcgctctcgc ttctggtgat 59461 gcataaaaac gatcgccgag aatcggatga cccagcgcca gcatatgcac acgcaattga 59521 tgcgaacgcc cggtaatcgg ttttaacacc actcttgccg tgttatccgc cgcatactcc 59581 accacttcat attccgtctg cgccaggttt acccgttttc gtaacagcac tttctgtttc 59641 gggcggtttg gccagtcgca aatcagcggc agatccacca gaccttctgc gggggatgga 59701 tgcccccaga cgcgggccac atactgcttt ttcggctcgc gctcgcgaga actggcgttt 59761 taactcccgc tccgcggctt tggtcagcgc cactacaatc acgccgctgg tagccatatc 59821 cagacgatgc accgattctg cctgcggata atcacgctga atgcgcgtca tcacgctgtc 59881 tttgtgctct tccagacgac ccggcactga caacaaaccg ctcggcttgt tgaccaccat 59941 aatatggtca tcctgataca ggataaccaa ccagggttcc tgcggtggat tgtagttttc 60001 catccccatt ttcggctccg ttactgatgc gttacaacga tcaaacgcag ggcatccaga 60061 cgccaacctg cctgatccag gctttccatt acctgctgac ggttgctctc aatggcggtc 60121 agttcgtcgt cacgaatgtt cgggttcact gcacgcagag cttccagacg agacagctcg 60181 gcagacagtt tttcgtcggc ttcgttacgc gctgcatcaa tcaatgcacg ggcagatttc 60241 tcgatctgcg cttcacccag ttgaaggata gcgtgaacat cctgctgcac ggcgttaacc 60301 agtttgctgc cggtgtgacg gttaaccgcg ttaagctggc ggttaaaggt ttcaaactct 60361 acctgcgccg ccaggttgtt gccgttttta tccagcagca tacgtaccgg cgtcggtggc 60421 aggaagcggt tgagctgcaa ctgcttcgga gcctgggctt caaccacata aatcagttcc 60481 accaacagcg tacctaccgg caacgctttg ttttttaaca gtgaaatcgt gctgctaccg 60541 gtatcgccag aaaggatcag atccagaccg ttgcggatca gcggatgctc ccaggtaata 60601 aactgtgcat cttcacgcgc cagcgccact tcacgatcaa aggtgatggt gatgccatct 60661 tccgacaggc cagggaagtc cggcaccagc atatgatcgg acggcgtcag cacgatcatg 60721 ttgtcgccgc gatcgtcctg attgataccg ataatatcga acaggttcat ggcgaaggcg 60781 atcaggttgg tatcgtcatc ctgctcttca atgctttctg ccagtgcctg ggctttttcg 60841 ccaccgttgg agtggatttc cagcaggcgg tcacgaccct gttccagctg tgctttcagc 60901 gcttcatgtt gctcgcggca gtttttgatc agatcgtcaa agccttcggt ttgatccggt 60961 gaagccagat agttaatcag atcgttgtat acgctatcgt aaatagtgcg tccggtcggg 61021 caggtgtgct caaatgcatc cagaccttcg tgataccagc gcaccagcac cgactgagcg 61081 gttttctcca gataaggcac atggatctga atatcgtgcg cctggccgat acgatccaga 61141 cgaccaatac gctgctccag tagatccggg ttgaatggca ggtcaaacat caccatgtgg 61201 ctggcgaact ggaagttacg tccttcagaa ccgatttctg agcacagcag tacctgtgcg 61261 ccggtgtctt cttcggcaaa ccaggcggca gcgcggtcac gttcgataat cgacatacct 61321 tcgtggaaca ccgcagcgcg aataccttca cgttcgcgca gtacctgctc cagttgcagc 61381 gcagtggcag ctttggcgca gatcaccagc actttctgag agcgatggct ggtcaggtag 61441 cccatcagcc actcaacgcg cggatcgaag ttccaccagg tggcgttatc accttcaaat 61501 tcctgataaa tacgctccgg gtagagcata tcgcgagcac gatcttccgc acttttacgt 61561 gcgcccataa tgccggagac tttaatagcc gtctgatact gcgtcggtag cggcagctta 61621 atggtgtgca gctcgcgttt cgggaatcct ttcacaccgt tacgcgtgtt acggaacagc 61681 acgcggctgg tgccgtggcg atccatcagc atcgaaacca gctcctgacg ggcgctctgg 61741 gcatcttcgc tgtcgctgtt tgctgcctgc aacagcggct cgatatcctg ctcgccgatc 61801 atctcgccga gcatgttcag ttcgtcattg ctcagtttgt tacctgccag cagcatggca 61861 acggcgtccg caaccggacg ataatttttc tgctcttcaa cgaactgcgc aaaatcgtgg 61921 aaacggttcg ggtccagcag acgcagacgg gcgaagtggc tttccatccc cagctgttcc 61981 ggggtcgcgg tcagcagcag aacgcccggc acgtgctctg ccagttgttc aatggcctga 62041 tattcacggc ttggcgcatc ttcgctccac accaggtgat gcgcttcatc gaccaccagc 62101 aggtcccatt cggcttcaca gagatgttcc aggcgctgtt tgctacgacg ggcaaaatcc 62161 agcgagcaaa tcaccagctg ttcggtgtca aacgggttgt aagcatcgtg ctgagcttcg 62221 gcataacgct catcatcaaa tagcgcaaag cgcaggttga aacggcgcag catttctacc 62281 agccactgat gctgtaaggt ttccgggacg ataattagca cacgttcagc agcgccagag 62341 agcagttgct gatgcaggat catcccggct tcaatggttt tccctaaacc cacttcgtca 62401 gccagcagga cgcgcggcgc gtggcggcga ccaacatcat gagcgatgtt gagctgatgc 62461 gggatcaggc tggtacgctg accgcgcagg ccgctgtacg gcatacggaa ctgttcgctg 62521 gaatatttac gcgcgcgata acgcagcgca aagcggtcca tacggtcaat ctgcccggca 62581 aacagacggt cctgcggttt gctgaacacc agtttgctat caaggaaaac ttcacgcagg 62641 gctacgccgg actcttcagt atccaggcga gtaccgatat aggtcagcaa gccattttct 62701 tcttttactt cttcgacttg catctgccag ccgtcatggc tggtaatggt atcaccaggg 62761 ttgaacatca cgcgggtcac gggggaatca ctgcgtgcgt acagacggtt ttcaccagta 62821 gatgggaaaa gtaaagtgac agttcgcgca tccaccgcga caacggttcc aagtcccaat 62881 tcgctttctg tatcgctgat ccagcgttga ccaagtgtaa aaggcatatg tgttcggctc 62941 tatatcttta attgcaggca ataaccaccc gctaccgtgc ttatgaggta gtggtgttat 63001 tcaggtccag gaatggaaag ggcgctatgg tactggatgg caaagcattc gtcacgcatc 63061 aaaatggtat ctggcgaact cttttttttg ctcaaaatag cccaagttgc ccggtcataa 63121 gtgtagcaaa attatcctca ataaaaggga gtattccctc cgccacgggt tgtagctggc 63181 gggtcagata gtgttcgtaa tccagtggtg aacgttggta gtccagcggc tccgggccgt 63241 tggtggtcca tacgtactta atggtgccgc gattctgata ttgcaagggg cgaccacgct 63301 tttggttttc ttcatcggca aggcgagcgg cgcgtacatg aggcggcaca ttacgctgat 63361 actcgctcag cggacggcga aggcgtttac ggtaaaccag tcgcgcatcc agttcacccg 63421 ccatcagttt gtcgatggtt tcgcgtacat attcctgata tggctcgttg cggaagatgc 63481 gcaggtatag ctcctgctga aactgctggg ccagcggcgt ccagtcggtg cgcacggttt 63541 ccagcccttt aaacaccatc cgctgcttgt cgccctcctg aatcagtccg gcataacgct 63601 ttttactgcc ggtatcggct ccgcgaatgg ttggcatcag aaaacggcag aaatgggttt 63661 catactccag ttctaatgcg ctggtcagcc gttgtttttg cagcgtttcc gcccaccagg 63721 cgttaacgtg ctgcaccagt gcacgaccga ttttcgccgc ttcttcttcc gaatgtgcgc 63781 ctttcagcca gacaaacgtt gagtcggtat cgccgtagat aacgtcgtag ccctgtgctt 63841 caatcaacgc tttggtttgc cgcatgatct gatgaccacg catggtgatc gacgatgcca 63901 gccgcggatc gaagaagcgg caggcggtgg tgccgagcac gccataaaag gcattcatga 63961 tgattttcag cgcctgcgac agcggtttgt taccctggcg tttggcttca tcgcgcccgt 64021 gccagatgtt agtcacaatc tccggcaggc aatgtttttc tcgcgagaac caggcatcga 64081 gaaaaccttc ggtactgtgc tctggatcag gctgcgccat gccttccacc agcccgacgg 64141 gatcaatcag aaaggtgcgg atgatcgacg ggtacaggct tttatagtcc agcaccagca 64201 ctgaatcata aagccctggc cgtgaatcca tcacgtagcc gccagggctg gcgtgcggcg 64261 gcacttcgcc gagattaggc gcgacataac cagcgcgatg cattcgcgga aaatagagat 64321 gaccaaatgc cgccaccgaa ccgccgtgtc ggtccaccgg caggccgttc accgttgccc 64381 gttcgagtaa aaatggcatg atttcagttt tgtggaagat ctgcgtcacc agctcgcaat 64441 ctttcaggtt ataagttgcc agcgcaggtt tatcttcggc gaaacggcgg tcaatttcgt 64501 ccattcgatc ccacgggtta tcgatagatt ttccttcgcc taatagctcc tgagcgacag 64561 tttccagcga gaatgaagag aaatcccaga acgcggattt cagcgcctcg ataccgtcga 64621 taattagccg acctttagcc tggggcaaaa aagacgccgt ttttgcgggg ccgtgctcgc 64681 gccactccag ctcgctatta tcgcgcccaa gacgcagcgg aagacggtaa cggctcggca 64741 tgtttttgca gcattcgcag atcgaactgc accacgttcc aaccgatgat cacatcagga 64801 tcgtagttgg caaaccaggc gttgagtttt tccagcaact gcgggcggct ggcgacgtat 64861 tccagttcga aatcaagcga ggaggcgtcg ccattctccg gccccagcat ataaacgatg 64921 cgctgcccgc agccttccag gccgatgcag tacagctcac cgtggcgggt ggtttcaata 64981 tctatagaaa cccacttgag cggcggacga tagtcgggat gcggtttcag acgggcatta 65041 acgatagtgc cattgtgcat atcaccctcg acccacaccg gtgaggtgat aaaccgctcc 65101 atcagatagc gttctggcgg acgcacatcg gcctcgtaga cggtaacgcc accttcacgc 65161 aggcgctttt cgtaattcat caattggcga tgggcgcgac agtaaaggcc atacaccggc 65221 tggcggtgaa aatcctttaa cgccagcggt gtcaggcgaa agccttgttc accctgcaaa 65281 atatgctgag cgcggggaac ctgatcggcg ggaataaacg ccacggactc ttgcggtgca 65341 agcgtaacct gcaacggccc gttgtccgtc gccagccaga aggagacttc tgtcccttgc 65401 ggggtgtccc gccagtgtcg ggttaagata aaacctgcct gcgccacgct gaaaatccat 65461 caaaaaacca ggcttgagta tagcctggtt tcgtttgatt ggctgtggtt ttatacagtc 65521 attactgccc gtaatatgcc ttcgcgccat gcttacgcag atagtgttta tccagcagcg 65581 tttgctgcat atccggtaac tgcggcgcta actgacggca gaatatcccc atataagcga 65641 cctcttccag cacgatggcg ttatgcaccg catcttcggc atttttgccc catgcaaacg 65701 ggccgtggga atggaccaga acgccgggca tttgcgctgc atcgataccc tgtttttcaa 65761 aggtttctac gatgacgtta ccggtttccc actcatattc gccgttgatt tctgcgtcgg 65821 tcattttgcg ggtgcaggga atggtgccgt agaaatagtc ggcgtgggtg gtgccggttg 65881 ctggaatcga ctgacccgcc tgcgcccaga tggtggcgtg gcgcgagtgc gtatgcacaa 65941 tgccgccaat ggaggggaat gcctgataga gcagccggtg agttggcgtg tcggaggagg 66001 gctttttcgt accttcaacc acttcaccgg tttcgatgct aaccacgacc atatcgtcag 66061 cggtcatgac gctgtaatcg acgccggaag gtttgatcac aaagacgccg cgctcgcgat 66121 caacggcgct gacgttgccc catgtgagcg tgaccaggtt gtgttttggc agcgccaggt 66181 tggcttctaa tacctggcgt ttgagatctt ctaacatgtt gactccttcg tgccggatgc 66241 gctttgctta tccggcctac aaaatcgcag cgtgtaggcc tgataagacg cgccagcgtc 66301 gcatcaggcg ttgaatgccg gatgcgcttt gcttatccgg cctacaaaat cgcagcgcgt 66361 aggcctgata agacgcgcca gcgtcgcatc aggcgttgaa tgccggatgc gctttgctta 66421 tccggcctac aaaatcgcag cgtgtaggcc agataagacg cgtcagcgtc gcatcaggcg 66481 ttacataccg gatgcggcta cttagcgacg aaacccgtaa tacacttcgt tccagcgcag 66541 cgcgtcttta aacgctggca ggcgtgtgtc gttatcaatc accgtgattt caatgtcgtg 66601 catctcggcg aattggcgca tatcgttgag gttcagtgca tggctgaaga cggtatggtg 66661 cgcgccacca gcgaggatcc acgcttcgga agcagttggc agatccggtt gcgctttcca 66721 cagcgcattc gccaccggca gtttcggcag ggagtgcggt gttttcaccg tgtcgataca 66781 gttaaccagc agacggtaac gatcgccgag atcaatcagg ctggcgacaa tcgctggacc 66841 ggtttgggta ttgaagatca gtcgggcagg atcgtcctta ccaccaatac cgagatgctg 66901 aacgtcgagg atcggtttct cttctacggc aatcgacggg cagacttcca gcatatggga 66961 gccgagcacc aagtcattac ctttctcgaa gtgataggtg tagtcctcca taaaggaggt 67021 gccgccctgc agaccggttg acatcacctt catgatgcga agcagggcgg cggttttcca 67081 gtcgccttcg cccgcaaagc cgtaaccctg ctgcatcaga cgctgtacgg ccagacctgg 67141 aagctgtttc agaccgtgca aatcttcaaa ggtggtggtg aacgcgtgga agccaccttg 67201 ttccaggaaa cgcttcatcc ccagctcaat acgcgccgct tccagcacgt tctgtcgttt 67261 ttcgccgtgg atttgtgttg caggcgtcat ggtgtagcag ctttcgtact catcgaccag 67321 cgcgttaaca tcgccgtcgc tgatggagtt caccacctgc accagatcgc caaccgccca 67381 ggtattgacg gagaaaccga acttgatctg tgcggcaact ttatcaccat cggtgaccgc 67441 cacttcacgc atgttatcgc caaaacggca gactttcaga tgacgggtat cctgtttaga 67501 aaccgcctga cgcatccagg agccgatacg ctcatgggct tgtttatcct gccagtgacc 67561 ggtaacgacg gcatgttgct gacgcatacg cgcgccaatg aagccgaact cgcgaccgcc 67621 atgtgcagtc tggttcaggt tcataaagtc catatcgata ctgtcccacg gcagcgccgc 67681 gttgaactgg gtgtggaatt gcagcaacgg tttgttgagc atggtcaggc cgttgatcca 67741 cattttggcc ggggagaagg tgtgcagcca caccaccaga ccagcgcacg gatcgtcgta 67801 attcgcgtcg cggcaaatag cggtgatttc atccggcgtg gtgcccagcg gtttcaacac 67861 cagtttgcag ggcagtttcg cttccgtatt cagcgcatta acaacgtgct cggcatgttg 67921 ggtgacctga cgcagggttt ccgggccata cagatgctgg ctgccaatga caaaccacac 67981 ttcataatta tcaaaaatcg tcattatcgt gtccttatag agtcgcaacg gcctgggcag 68041 cctgtgccgg ggcggaagtt ggaagatagt gttgttcggc gctcatcgcc cattgctgat 68101 agcggcgata aagctgttca aagcgttgtg cctgttcgct gcgcggttgc agggttttct 68161 ctaccgcact ggccattttt tgctgggctg atgggatgtc tgcgtgcact ttcgcggcga 68221 cggcagcaaa aatcgccgca ccgagcgcac agcactggtc agaggcaaca atttgcagcg 68281 ggcgattcag cacgtcgcag caggcctgca taatgacttg gtttttccgc gcgatgccgc 68341 ccagcgccat cacgttattg acggcgatcc cctgatcggt aaagcactcc atgattgcgc 68401 gtgcgccaaa ggcggtggca gcaatcaaac cgccgaacag cagcggagcg tcggtagcga 68461 ggttaagatc ggtaatcacc cctttcaggc gttggttagc gtttggcgag cgacgaccgt 68521 taaaccagtc gagcaccacc ggcaggtgat ccagagacgg atttttggcc catgcttcgg 68581 tcagcgccgg aagcagttgt ttctggctgg cgttgatttg cgctttcagt tccggatgct 68641 gggcggcaag ctgttccagc ggccagctga gtacgcgacc gaaccaggcg tagatatcac 68701 caaacgccga ttggcctgct tccagaccga taaatccagg caccacgctg ccatcaacct 68761 gaccgcaaat acctttaact gcccgctcgc caacgctctg tttgtcggca atcagaatgt 68821 cgcaggtgga agtaccgata acttttacca gtgcgttagg ctgtgcgcct gcgccaactg 68881 cgcccatatg gcagtcaaac gcgccgccgg aaatcaccac gctttcaggc aggccgagac 68941 gctgcgccca ttccgggcat aaggtgccca ccggaatatc ggcagtccag gtgtcagtga 69001 acagcgggga aggcaaatgg cgattgagga tcgggtccag ctcatcaaag aaactggctg 69061 gcggcaagcc gccccagctt tcgtgccaca gagatttatg cccggcgctg caacgtccgc 69121 gacgaatatc ctgcgggcgg gtggtaccgg aaagcagagc tggcacccag tcgcacagct 69181 caatccacga tgcggcagat tgcgccacgg cgctgtcctg gcgagtcaca tgcaggattt 69241 ttgcccagaa ccattcgctg gaataaatac cgccaatata gcgggagtag tcaacattgc 69301 ccggcgcgtg gcacaaacgg gtaatctctt cgcttctttc aaccgcagtg tggtctttcc 69361 acaatacgaa catcgcgttc gggttttcgg caaactccgg gcgcagcgcc agcacgttac 69421 cgtcggcatc aatcggtgcg ggcgtcgagc cggtactgtc aacgccaatc ccgaccacag 69481 ctgcgcgctg ttcgacgcta agctctgcaa gcacggtttt cagtgccgct tccattgact 69541 caatgtagtc acgcggatga tgacggaact ggttattcgg ggcatcacaa aattgccctt 69601 tttgccaacg gggataccac tctacgctgg tggcgatctc ttcaccgctg gcgcagtcca 69661 ccgccaaagc tcgcacagaa tcactgccaa aatcgaggcc aattgcaatc gccatcgttt 69721 cactccatcc aaaaaaacgg gtatggagaa acagtagaga gttgcgataa aaagcgtcag 69781 gtaggatccg ctaatcttat ggataaaaat gctatggcat agcaaagtgt gacgccgtgc 69841 aaataatcaa tgtggacttt tctgccgtga ttatagacac ttttgttacg cgtttttgtc 69901 atggctttgg tcccgctttg ttacagaatg cttttaataa gcggggttac cggttgggtt 69961 agcgagaaga gccagtaaaa gacgcagtga cggcaatgtc tgatgcaata tggacaattg 70021 gtttcttctc tgaatggtgg gagtatgaaa agtatggctg aagcgcaaaa tgatcccctg 70081 ctgccgggat actcgtttaa cgcccatctg gtggcgggtt taacgccgat tgaggccaac 70141 ggttatctcg atttttttat cgaccgaccg ctgggaatga aaggttatat tctcaatctc 70201 accattcgcg gtcagggggt ggtgaaaaat cagggacgag aatttgtctg ccgaccgggt 70261 gatattttgc tgttcccgcc aggagagatt catcactacg gtcgtcatcc ggaggctcgc 70321 gaatggtatc accagtgggt ttactttcgt ccgcgcgcct actggcatga atggcttaac 70381 tggccgtcaa tatttgccaa tacgggtttc tttcgcccgg atgaagcgca ccagccgcat 70441 ttcagcgacc tgtttgggca aatcattaac gccgggcaag gggaagggcg ctattcggag 70501 ctgctggcga taaatctgct tgagcaattg ttactgcggc gcatggaagc gattaacgag 70561 tcgctccatc cgccgatgga taatcgggta cgcgaggctt gtcagtacat cagcgatcac 70621 ctggcagaca gcaattttga tatcgccagc gtcgcacagc atgtttgctt gtcgccgtcg 70681 cgtctgtcac atcttttccg ccagcagtta gggattagcg tcttaagctg gcgcgaggac 70741 caacgtatca gccaggcgaa gctgcttttg agcactaccc ggatgcctat cgccaccgtc 70801 ggtcgcaatg ttggttttga cgatcaactc tatttctcgc gagtatttaa aaaatgcacc 70861 ggggccagcc cgagcgagtt tcgtgccggt tgtgaagaaa aagtgaatga tgtagccgtc 70921 aagttgtcat aattggtaac gaatcagaca attgacggct tgacggagta gcatagggtt 70981 tgcagaatcc ctgcttcgtc catttgacag gccacattat gcaagcattg cggaacactt 71041 tattacccaa ccaccgtgtt cattgatggc ggtggtgttg gtggcctttc tggagtcgct 71101 ggcgctggtc ggtttgattc tacccggtac ggtgctgatg gcggggctgg gagcgctgat 71161 tggcagcggc gagttaagtt tctggcacgc ctggctggca gggattattg gctgcttgat 71221 gggcgactgg atttctttct ggctgggttg gcgttttaaa aagccgttgc atcgctggtc 71281 atttctgaag aaaaacaaag cactacttga taaaactgaa catgcgttgc atcaacacag 71341 catgttcacc attctggtcg gtcgttttgt tggcccgacg cgtccgctgg tgccaatggt 71401 ggcgggaatg ctggatctgc cggtggctaa atttattacg ccgaatatta tcggctgcct 71461 gctgtggccg ccgttttact tcctgccagg gattctggcg ggcgcggcga tcgatattcc 71521 tgccggaatg cagagcggtg agtttaaatg gttgctgctg gcaacagcgg tgtttttgtg 71581 ggttggtggc tggctgtgct ggcggttatg gcgcagcggt aaagcgactg accgtttgag 71641 tcattatttg tcccgcggtc gtttgttgtg gctgacgccg ttgatttctg ccatcggcgt 71701 ggtggcgctg gtggtgttaa ttcgccaccc gttgatgccg gtgtatatcg atattttgcg 71761 taaagtggtt ggggtttagg agatagtctt gtgcgggttg cctgagcgcg acgcttgccg 71821 cgtcttatca ggcctacaaa acgcactacc cgtaggtcgg ataaggcgtt cacgccgcat 71881 ccgacagtgc atactaaccc gtaatcccca atagtgccga agcactcgcc ttaccgctca 71941 acaactcatt ggtcataccc tgccaggcga tgcgcccgtc ggcgactact accgagcgcg 72001 tggcgatccg cgccgcatct tccacgctgt gcgacaccat caatagcgtc attttttgct 72061 gctggcagct cgtgctcacc agcgtcaaca tctcctgacg taacgccgga tcgagcgcag 72121 agaacagttc atcgagcaat aaaatcggct gttcgcgtac cagacaacgc gctaacgcca 72181 ctcgctgtcg ctgaccgccg gaaagctcgc ccggtaaccg cgccattaaa ttatcaatcc 72241 ccatctggcg ggcgatagcg tgcattttcc cctgctgtac cgcgttcagt ttcaatcccg 72301 gatttagccc cagcccgatg ttctgtgcga ccgtcaggtg gctgaacagg ttgttctcct 72361 gaaacagcat cgacaccgga cggcgtgacg gcggcatagt tgtgtgatct acgccatcga 72421 tagtcagcga accgctggct ggcgtcagaa aaccggcgat caaattcagc agggtacttt 72481 tacccgcgcc gcttggcccg aggatcgcca cctgctcgcc gcgttccacc gttaagctaa 72541 aacgcatcgg caaatggtgg taaagccagg tgatatcagt cagttttaac atttcgcccc 72601 ggtagttttt caatcacggt aaacagcaga aaacagagca gcagcagaat taacgtggtg 72661 accgcaccgt cctggctgcg ataggagcca atttgctggt agagataaaa cggcagggtg 72721 cggaaatcat cgttaccgaa caacgccacc acgccaaaat caccaatcga cagcacgcat 72781 gcaaaggcca gcgcctgcgc cagtggacgt ttcagggcgc gcagctccac cacttttaag 72841 cgtgaccagc cttcaatccc cagcgactga cataacatgc tgtagcgggc ggtgatatcg 72901 cgcatcgggt tttccagcac tttcagcgca taagggatcg ccattaacgc attggtgaaa 72961 atcacaatgc cgtcagcaga ttgtggcagg ccgatagtgt tgttgagcag taaaaagaag 73021 ccggtagcca gcacaatccc cggcatggcg aggatcaaca tgccgctcat ctccagcacc 73081 tgacccgcca gcattttctg ccgcgcccgc agttcgcgac tgctccatag cagcatcatg 73141 gtcagcacta cgcacaatac acctgccgcc agcgcaatac gcaacgaggt ccacagcgcc 73201 tgccacagca ccggttgtgc cagcacttcc ggcaactggc gatttacccc atcgacgatc 73261 accgccagta acggtggcag caacagcagc agcgccagca caattaacac cgtgtcgcaa 73321 atgcggctat gcagacgatc gtccgggtcg cgccagcctt gcagcagcgt ggtgccgggc 73381 gcaatggcct tactcaatcg ctgactcaac agcaccagcc cgaggcagca caccatctgg 73441 agcagcgcca gcattgccgc gcgggcagga tcgtagtcgt aactcagcgc ctgatagatt 73501 gccagctcga tagtggtcgc ctgcggaccg ccccccagcg atagcacggt ggcgaagctg 73561 gcgaaacaga gcataaagat aagcgcagca accggcggga tttgtcgccg taaccacggc 73621 cattcgacga agcggaaaaa atgccagcta cgcatcccaa gctgggcggc aagttgacgc 73681 tgttcgccgg ggatgttttc cagtgcctgg agtaataagc ggctcgccat cggcagatta 73741 aaaaacacat gggccagcaa aataccttgc aggccgtagg gcgaaaaggt ccactccaga 73801 ccgagcgatt ggcagagtgt tgccagccag ccctggcgac catagacgct aagaatgccg 73861 aaaacagcga ccaacaccgg gaggatcaag gtcattgcac acagacgcaa cagcgccagc 73921 cgacccggaa agcgcctgcg atagagcgcg cgggcgagga atatcgcggg tatgacagag 73981 agcagtgccg agagaaacgc ctgccagaag gagaagcgca ccacatgcca cagatagctg 74041 tcctgccaga ctgccaccca gtcatcctgc ggcgcgttcc accacagggc gagaaacgcc 74101 gccagcgcaa ccgctaccac cagcgtggtg gcgcttacac ctggaattaa ccagccggga 74161 attaacggct gacggcgcgt tgccattcgc taatccatgc ctgacgttgt gccgccactt 74221 cggctggcgt gaactccaac gtggttgcgg gtttggtcaa tttttcaaaa ccggcaggca 74281 gcgtgacgtt tgccaccgga tacatccagt tgccggttgg gatcgcattc tggaaagccg 74341 gagaaaccat aaactggagg aatttttgcg ccagctccgg ctgcttgctg gcagcggtgc 74401 gggcggcgac ttccacttgc agatagtgac cttcgctgaa gttcgcggcg gcgtagttat 74461 ctttcttctc ttcgagaatg tgataagccg gagaggtggt gtaactcagt accagatcgc 74521 tttcaccttt taaaaacagg ccgtaggctt cgctccagcc tttggtgacc gtgaccgttt 74581 tcttcgccag tttctgccag gcttgtgggg cgtcatcgcc atagactttt tgcatccata 74641 gcaacagacc cagccccggt gtactggtgc gcggatcctg ataaatcacc cgccagtttt 74701 gatcgctctc aaccagttct ttcaggcttt gtggcgggtt tttcagtttg ttcttgtcat 74761 aaacgaaggc gaagtagcca taatcaaacg gtacgaaagt gtcattattc cagccgccgg 74821 gaacgttaac ggcatccgct gccacaccgc ttttggcaaa cagtccggtt ttactggcgg 74881 cgtctaacag gttgttatcc agccccagca ccacatcggc tttactgttt ttgccttcca 74941 tccgtagacg gttgagaagc gaaacgccat cttccagcgc caccagtttc agttcgcaat 75001 tacagtcggc ttcaaaggct tttttaacca ccggaccagg cccccagtcg gcggcgaagg 75061 aatcgtaggt ataaacagtc agaacgggtt tagcgaaaac gggcgctgtc acagcaacag 75121 caggggcaga cattttttta acactttgca cctcaaaaaa gagtggcaaa ggacttgaga 75181 aggagcctca aatcccttcg ccggcgttat ccggatcagg ttcgacgggt attttctcag 75241 cgcacgcgta cgcgtggcac cccgttgaga acggcgttag tgtagtgatt ttgttatcaa 75301 ccagcaatca tggatccggt ggcgcaaacc acgctgattt aaaatcgaac cagccgaggg 75361 tattcatgcg caggccgcgc atactgcgtt gcccctgaat gatcagccag tggtgcaata 75421 atggcaccat cgctttgctg gcgaccagtt gctggcacca gttcgccaga ttcatctcgc 75481 cattgcgcca gcgagcagcg tcggcttgcc agtcaatggg aatgcaatgt tgtagcagtg 75541 gcacttcgca taaatgtgcg aaaacagaga agtccagcgg cagggtaaag ttggcgctgt 75601 ttagccagat atcactttcg atctctcctg tatgccactg atcgtagtcg atctctttga 75661 ttttcagcgt cacctggtga cttgccagaa tctgctgcat gatcccggca atcacccgat 75721 gctcactgtg atcctgacaa aaggttaggg tgaggctttc caggccagcc ggtttttcgc 75781 tctttatggt gcgggcatgg tgccaacggg ggaacagtcc ataagccggg aaccacagtt 75841 gctggtactg ttcctcagcg aaatagacca agttagttgg agaaagcaca tagcttgccc 75901 agtccctgac ttgctgattc gccccgcgat gggtgcggct gtcgaacagt aaatagtagc 75961 aaccttcctc caggcggctt tcaatctctt tttcctcgcc ctgtggacct tttagcatca 76021 gccctccggc tggctcgtcg gcaatttccg gcagaaccca gacgttaact tcgtcgatta 76081 atgcccggta accgaagaag tcatcgaatg cctgaatttt cagttgattg gtgctgttgc 76141 gaatcaccgc atacggaccg gtgccgatgg gatggctggc aaagttactg agggtttccc 76201 attcgcgcgg caggatcatc gccggaactt gccccagcag taacggtaac cagcggtccg 76261 gttgcgtgag atggatatcc agcgtccagg gcgtcggcga cacaatgtca gcaatatgcg 76321 aatagagcgg cagcgtattg attcgtttta aagaggcgat cacatcgtcc atttccagtt 76381 cacgaccatg gtgaaaatgg actcctggac gcaaaaagaa acgccagtga agcggtgaaa 76441 tttgctgcca gtggtgggcg atgtctgctt ccagttcccc attttcctca tttatgcgcg 76501 ttagcgaact gaagatttgc cgggcgatat gggtttcgga acggcgcaat gcgctgccag 76561 gtagcagatt acgcaacgga cgatagtaga gcacgcgcag gatgtgccgc ccctggcgga 76621 agctgcggcc cagatgagaa accagcattt gccgcacagt cgctttgtcg ccaaccaact 76681 gcaccagttg atcgatacga tcctgctcca gcaggtcttc cgcccgctgt tgctgaagcg 76741 ccagcccggt atagaggaat gtcagacgcg agcgtttacc gcgcccgact tccgcttccc 76801 acgtcagcca gccgcgatcc tgcatggtgt tgagcagggt gcgcatatga cgacgcgagc 76861 agctcaataa cgctgccagt tcgttgagcg ttgtgtcctg cgatttaccc tcgcagcatt 76921 gccacaggcg gatgaactgt tgttgcagac gagcagatgg cataaaaggg gaactcctgt 76981 gcaaaagaca gcaattttat tgtccctata ttaagtcaat aattcctaac gatgaagcaa 77041 gggggtgccc catgcgtcag ttttatcagc actattttac cgcgacagcg aagttgtgct 77101 ggttgcgttg gttaagcgtc ccacaacgat taaccatgct tgaaggactg atgcagtggg 77161 atgaccgcaa ttctgaaagt tgacttgcct gcatcatgtg tgactgagta ttggtgtaaa 77221 atcacccgcc agcagattat acctgctggt tttttttatt ctcgccgcgc taaaaaggga 77281 acgtatgatc tggataatga cgatggctcg ccgtatgaac ggtgtttacg cggcatttat 77341 gctggtcgct tttatgatgg gggtggccgg ggcgctacag gctcctacat tgagcttatt 77401 tctgagtcgt gaggttggcg cgcaaccttt ctggatcggc ctcttttatg cggtgaatgc 77461 tattgctggg atcggcgtaa gcctctggtt ggcaaaacgt tctgacagtc agggcgatcg 77521 gcgaaaactg attatatttt gctgtttgat ggctatcggc aatgcgctat tgtttgcatt 77581 taatcgtcat tatctgacgc ttatcacctg tggtgtgctt ctggcatctc tggccaatac 77641 ggcaatgcca cagttatttg ctctggcgcg ggaatatgcg gataactcgg cgcgagaagt 77701 ggtgatgttt agctcggtga tgcgtgcgca gctttctctg gcatgggtta tcggtccacc 77761 gttggccttt atgctggcgt tgaattacgg ctttacggtg atgttttcga ttgccgccgg 77821 gatattcaca ctcagtctgg tattgattgc atttatgctt ccgtctgtgg cgcgggtaga 77881 actgccgtcg gaaaatgctt tatcaatgca aggtggctgg caggatagta acgtacggat 77941 gttatttgtc gcctcgacgt taatgtggac ctgcaacacc atgtacatta ttgatatgcc 78001 gttgtggatc agtagcgagt taggattgcc agacaaactg gcgggtttcc tgatggggac 78061 ggcagctgga ctggaaatac cagcaatgat tctggctggc tactatgtca aacgttatgg 78121 taagcggcga atgatggtca tagcagtggc ggcaggagta ctgttttaca ccggattgat 78181 tttctttaat agccgtatgg cgttgatgac gctgcaactt tttaacgctg tatttatcgg 78241 cattgttgcg ggtattggga tgctatggtt tcaggattta atgcctggaa gagcgggggc 78301 agctaccacc ttatttacta acagtatttc taccggggta attctggctg gcgttattca 78361 gggagcaatt gcacaaagtt gggggcactt tgctgtctac tgggtaattg cggttatttc 78421 tgttgtcgca ttatttttaa ccgcaaaggt taaagacgtt tgatgacgtg gacgatagcg 78481 gaaagcccgg tcatttgacc gggcaagggg attaattcat aaacgcaggt tgttttgctt 78541 cataagcggc aatggcgtcg tcgtgctgca aggtaagccc aatactgtcc agaccgttca 78601 tcatgcagtg gcggcggaag gcatcgatgg taaagcgata ggttttctct cccgctttca 78661 cctcttgcgc ttccagatcc acgtcgaaat ggatccccgg attagctttc accagcgcaa 78721 acagttcgtc cacttctgca tcgcttaatt tcaccggcag cagctggttg ttaaagctat 78781 tgccgtagaa gatgtcagca aaactcggcg caatcaccac tttaaaaccg tagtcggtca 78841 atgcccaggg cgcgtgctca cgcgaagagc cacagccgaa gttttctcgt gccagcaaaa 78901 tggaagcgcc ctgatactgc gggaagttca gcacgaagtc cgggtttggc tgttggcctt 78961 tttcatccag aaaacgccag tcgttaaaca gatgcgcgcc aaaacccgta cgggtcactt 79021 tctgcaaaaa ctgtttcggg atgattgcat cggtatcgac attggcggca tccagcggaa 79081 ccaccaggcc tgtgtgtttg ataaatttct ctgccatggt gtgctcctta tttaatgttg 79141 cgaatgtcgg cgaaatgtcc ggtcacagca gcagcggcag ccattgccgg gctgaccaga 79201 tgcgtgcgcc cgccgcgccc ctggcggcct tcaaagttac ggttgctggt ggaggcacaa 79261 cgttcgcccg gattcagacg gtcgttgttc atcgccagac acattgagca gccaggcaag 79321 cgccattcaa aaccggcttc aataaagatt ttatccagac cttccgcttc cgcctgggct 79381 tttaccgggc cagagccggg aaccaccagt gcctgcacgc ctggcgcgac ttttcgccct 79441 ttgacgatct ccgctgccgc gcgtaaatct tcaatgcgcg agttggtaca ggaaccgata 79501 aacactttgt cgatagccac ttcggtcagc ggaatacccg gtttcagccc catataggcc 79561 agcgcttttt ctgccgacgc gcgttcaacc ggatcggcaa acgaagccgg atcgggaata 79621 ttgtcgttca cggaaatcac ctggccggga ttggtgcccc aggtgacctg cggtgaaatt 79681 tcttctgctt gcagagtgac aacggtatcg aaagttgcgc cttcgtcggt ttgcagggtt 79741 ttccagtagg caacggcgtc gtcgaaatct ttgcctttcg gcgcatgcag acggcctttg 79801 acatagttaa aggtggtttc gtccggtgca accagaccgg cttttgcgcc catttcgatt 79861 gccatattgc acagggtcat acgaccttcc atgcttaaat cacggattgc ttcgccgcaa 79921 aactccacca catgcccggt gccgcctgcg ctaccggttt taccgataat tgccagcacg 79981 atatcttttg cggtaatgcc cggcgcggct ttgccctgga cttcaatttt catggttttt 80041 gcgcggccct gtttcagggt ttgcgttgcc agtacgtgtt caacttcgga agtgccgata 80101 ccaaaggcca gtgcgccaaa cgcgccgtgg gtggcggtat gcgagtcgcc gcagacaatg 80161 gtcatccccg gcaaggtgac gccctgttcc ggccccatta cgtggacgat cccctgatac 80221 gggtgattca ggtcatacag ttcgacgcca aattctttgc agtttttgat cagttcctgc 80281 atctggatac gcgccatttc accgcaggca ttaatgcctt tggtctgggt agagacgttg 80341 tgatccatgg tagcgaaggt tttgcccggc tgacgtaccg ggcgaccgtg ggcgcgcaga 80401 ccatcgaacg cctgcggtga ggtcacttca tgcaccaggt ggcggtcgat atataacagt 80461 ggggtttcgt tttcggcttc gtacacaacg tgagcgtcga acaatttttc gtataacgtc 80521 ttagccatga ttacacccct tctgctacat agcgggcaat gatatcgccc atttcatcgg 80581 tactaacggc ggcagcgcca cgggctaaat ccccggtgcg aatgccttct tctaatgcgc 80641 ggttaatggc gcgttcaatg gcgcaagccg catcatcggc atccaggctg taacgcagca 80701 gcagtgccag cgaaaggatt tgtgcaatcg ggttggcgat gtttttgcct gcgatatctg 80761 gtgccgagcc gcccgccggt tcatacagtc caaaaccttg ctcgttcagg ctggcggaag 80821 gcaacatccc catcgagcca gtgatcattg cgcactcgtc agacagaatg tcgccaaaca 80881 ggttggagca cagcagaacg tcaaactgtg atggatcttt aatcagctgc atggtggcgt 80941 tgtcgatgta catatgcgcc agttcgacat ccgggtattc cgtggcgatc tcgttaacga 81001 tctcccgcca taaaatagag gattgcagca cgttggcttt atcgatcgac gtcactttgt 81061 ggcgacgctt gcgagcagat tcaaacgcga tgcgggcgat acgttcgatc tcaaaacggt 81121 gatacacctc ggtatcaaag gctttttcat attgtccgct accttcgcgg ccttttggct 81181 gaccgaaata gatgccgccg gtcagttcgc gcacacacag gatgtcgaag ccgtttgcgg 81241 caatgtctgc acgcagcgga cagaatgctt ccagcccctg atacagtttt gccgggcgca 81301 ggttgctgaa taatttgaag tgcttacgca gaggcagcag cgcgccgcgt tctggttgct 81361 ggtctggtgg taaatgttcc cacttcgggc cgcctaccga gccaaacagc acggcatcgg 81421 cttgctcaca accttcaacc gtcgcaggcg gcagtggttg cccgtggtta tcaatggctg 81481 cgccgcctac atcgtaatgg ctggtggtga tgcgcatcgc aaagcggttg cgcacggcat 81541 ccagcacttt cagcgcctgg gtcatcactt ccggaccaat accgtccccc ggcaatacgg 81601 caatatggta attcttcgac atcacacggt ttccttgttg ttttcgttgt gttgagcttt 81661 gcgttgcaac tctttttcga cttctgcggc acgccagata ttgttcagaa cgtgcaccat 81721 ggctttggca gatgactcga caatatcggt agccaggccg acgccgtgga agcggcgacc 81781 gttgtagtta gcgacgatat ccacctgacc cagcgcatct ttaccgtggc ctttggcggt 81841 caggctgtat ttcaccagtt cgacgttata ttcagtgatg cggttaattg cctgatagac 81901 ggcatcgacc ggaccgttac cgttggcggc ttctgctttg acttcttcgc cacaggccag 81961 tttgacggcg gcggtggcga tatcgttaga gccagactgc acgctgaagt aatccagacg 82021 gaaatgctcc ggctcttctt gctgcttacc gatgaaggcc agcgcctcca gatcgtaatc 82081 aaacacctga ccttttttgt ccgccagctt caggaaagca tcgtacaaat tgtctaaatt 82141 atattcactt tctttatacc ccatctcatc catgcgatgt ttcaccgccg cacgccccga 82201 acgagaggtc agattcagct ggatttggtt cagaccaata gattctggtg tcatgatttc 82261 gtagttttcg cggtttttca gcacgccatc ctggtgtata ccggaggagt gtgcgaatgc 82321 gccgctgcca acaatggctt tgtttgccgg gatcggcata ttacaaatct ggctaactaa 82381 ctggctggtg cgccatatct cctggtgatt aatggcggtg tggacgttga gaatatcctt 82441 acgaactttg atcgccatga tgacttcttc cagggaacag tttccggcac gctcgccgat 82501 cccgttcatt gcgccttcca cctggcgtgc accggcatgt accgccgcca gtgagtttcc 82561 gaccgccagg cccaaatcgt cgtgggtatg tacggagata atggctttgc cgatgctagg 82621 cacgcgttca tacaggccgc tgatgattcc ggcgaactca aacggcatgg tgtagcccac 82681 ggtgtccgga atgttgatgg tggtggcacc ggcattaatc gccgcttcga ccactcgcgc 82741 cagatcggca atgggtgtac gcccggcatc ttcgcaagaa aattcaacat catcggtgta 82801 attacgggcg cgtttcacca tatagatagc gcgttcgatc acctcgtcca gcgtgctgcg 82861 cagcttggtg gcgatgtgca ttggcgaagt ggcaataaag gtatgaatac ggaaggcttc 82921 ggcgactttc agggattcgg ccgccacgtc gatatctttt tccacgcagc gagctaacgc 82981 acatacgcgg ctgtttttaa cctggcgggc gatggtttgc accgattcaa aatcgcccgg 83041 cgaagagacg gggaaaccga cttccatcac gtcaacaccc atacgctcaa gggccagcgc 83101 aatttgcagt ttttctttca cactcaagct tgcctgtaac gcctgttcac cgtcgcgcaa 83161 tgtggtatcg aaaataatga cttgctggct catggtttgg gtccttgtct cttttagagc 83221 gcctcgcttc gggcataaaa aaacccgcgc aatggcgcgg gttttttgtt tgactgcgtg 83281 ctggcttaat gctggatgcc gctcactcgt ctaccgcgca aagaagatgc gtttagtagt 83341 agtagaccga taaagcgaac gatgtgagtc attaaatcag ctccagatga atgcgatatg 83401 cttttagagt tactggatac aaaaacggat gtcaaccctg acgcaataaa aacgtcccgc 83461 cagcgtgagt tctgcatccg taaaattagc taattgtgct gcggtggtta aagtaagcga 83521 tattaatttc tgcttaacta ccgacgcttt tcatcggttg acatatttca gcataaattt 83581 ttgcatctaa tcaacgagga aaaaggggac aaaatgcacg cgttgcaaaa cctatcctga 83641 tgatttgtat tgaattatat gttttgcgat tttttttgat attgatttgg tgaatattat 83701 tgatcaatta atgttaagaa ttaatgcatt aaatatataa attaattatt aaataagcac 83761 atttaatcca ttttgtagat gattgagtat tcgcggtagt tatgattaga ttgttttcgc 83821 aacaaaaaca ttatggatta ttatgctgtg gtaaatgact cattccacgg caatggattc 83881 tgtttttatc agaacccgta tctttatgtt ttccgaattt tactcatttt gctttttctt 83941 attttatatg catgataaat catattcttc aggattattt ctctgcattc caataaggga 84001 aagggagtta agtgtgacag tggagttaag tatgccagag gtacaaacag atcatccaga 84061 gacggcggag ttaagcaaac cacagctacg catggtcgat ctcaacttat taaccgtttt 84121 cgatgccgtg atgcaggagc aaaacattac tcgtgccgct catgttctgg gaatgtcgca 84181 acctgcggtc agtaacgctg ttgcacgcct gaaggtgatg tttaatgacg agctttttgt 84241 tcgttatggc cgtggtattc aaccgactgc tcgcgcattt caactttttg gttcagttcg 84301 tcaggcattg caactagtac aaaatgaatt gcctggttca ggttttgaac ccgcgagcag 84361 tgaacgtgta tttcatcttt gtgtttgcag cccgttagac agcattctga cctcgcagat 84421 ttataatcac attgagcaga ttgcgccaaa tatacatgtt atgttcaagt cttcattaaa 84481 tcagaacact gaacatcagc tgcgttatca ggaaacggag tttgtgatta gttatgaaga 84541 cttccatcgt cctgaattta ccagcgtacc attatttaaa gatgaaatgg tgctggtagc 84601 cagcaaaaat catccaacaa ttaagggccc gttactgaaa catgatgttt ataacgaaca 84661 acatgcggcg gtttcgctcg atcgtttcgc gtcatttagt caaccttggt atgacacggt 84721 agataagcaa gccagtatcg cgtatcaggg catggcaatg atgagcgtac ttagcgtggt 84781 gtcgcaaacg catttggtcg ctattgcgcc gcgttggctg gctgaagagt tcgctgaatc 84841 cttagaatta caggtattac cgctgccgtt aaaacaaaac agcagaacct gttatctctc 84901 ctagcatgaa gctgccgggc gcgataaagg ccatcagtgg atggaagagc aattagtctc 84961 aatttgcaaa cgctaactga ttgcagaata ggtcagacat gaatgtctgg tttattctgc 85021 attttttatt gaatgtagaa ttttattctg aatgtgtggg ctctctattt taggattaat 85081 taaaaaaata gagaaattgc tgtaagttgt gggattcagc cgatttatta tcaatttaat 85141 cctctgtaat ggaggatttt atcgtttctt ttcacctttc ctcctgttta ttcttattac 85201 cccgtgttta tgtctctggc tgccaattgc ttaagcaaga tcggacggtt aatgtgtttt 85261 acacattttt tccgtcaaac agtgaggcag gccatggaga tgttgtctgg aggcgagatg 85321 gtcgtccgat cgcttatcga tcagggcgtt aaacaagtat tcggttatcc cggaggcgca 85381 gtccttgata tttatgatgc attgcatacc gtgggtggta ttgatcatgt attagttcgt 85441 catgagcagg cggcggtgca tatggccgat ggcttggcgc gcgcgaccgg ggaagtcggc 85501 gtcgtgctgg taacgtcggg tccaggggcg accaatgcga ttactggcat cgccaccgct 85561 tatatggatt ccattccatt agttgtcctt tccgggcagg tagcgacctc gttgataggt 85621 tacgatgcct ttcaggagtg cgacatggtg gggatttcgc gacccgtggt taaacacagt 85681 tttctggtta agcaaacgga agacattccg caggtgctga aaaaggcttt ctggctggcg 85741 gcaagcggtc gcccaggacc agtagtcgtt gatttaccga aagatattct taatccggcg 85801 aacaaattac cctatgtctg gccggagtcg gtcagtatgc gttcttacaa tcccactact 85861 accggacata aagggcaaat taagcgtgct ctgcaaagcg tggtagcggt aaaaaaaccg 85921 gttgtctacg taggcggtgg ggcaatcacg gcgggctgcc atcagcagtt gaaagaaacg 85981 gtggaggcgt tgaatctgcc cgttgtttgc tcattgatgg ggctgggggc gtttccggca 86041 acgcatcgtc aggtactggg tatgctggga atgcacggta cctacgaagc caatatgacg 86101 atgcataacg cggatgtgat tttcgccgtc ggggtacgat ttgatgaccg aacgacgaac 86161 aatctggcaa agtactgccc aaatgccact gttctgcata tcgatattga tcctacttcc 86221 atttctaaaa ccgtgactgc ggatatcccg attgtggggg atgctcgcca agtcctcgaa 86281 caaatgcttg aactcttgtc gcaagaatcc gcccatcaac cactggatga gatccgcgac 86341 tggtggcagc aaattgaaca gtggcgcgct cgtcagtgcc tgaaatatga cactcacagt 86401 gaaaagatta aaccgcaggc ggtgatcgag actctttggc ggttgacgaa gggagacgct 86461 tacgtgacgt ccgatgtcgg gcagcaccag atgtttgctg cactttatta tccattcgac 86521 aaaccgcgtc gctggatcaa ttccggtggc ctcggcagca tgggttttgg tttacctgcg 86581 gcactgggcg tcaaaatggc gttcccagaa gaaaccgtgg tttgcgtcac tggcgacggc 86641 agtattcaga tgaacatcca ggaactgtct accgcgttgc aatacgagtt gcccgtactg 86701 gtggtgaatc tcaataaccg ctatctgggg atggtgaagc agtggcagga catgatctat 86761 tccggccgtc attcacaatc ttatatgcaa tcgctacccg atttcgtccg tcgcggagcc 86821 tatgggcatg tcgggatcca gatttctcat ccgcatggct ggaaagcaaa cttagcgagg 86881 cgctggaaca ggtgcgcaat aatcgcctgg tgtttgttga tgttaccgtc gatggcagcg 86941 agcacgtcta cccgatgcag attcgcgggg gcggaatgga tgaaatgtgg ttaagcaaaa 87001 cggagagaac ctgattatgc gccggatatt atcagtctta ctcgaaaatg aatcaggcgc 87061 gttatcccgc gtgattggcc ttttttccca gcgtggctac aacattgaaa gcctgaccgt 87121 tgcgccaacc gacgatccga cattatcgcg tatgaccatc cagaccgtgg gcgatgaaaa 87181 agtacttgag cagatcgaaa agcaattaca caaactggtc gatgtcttgc gcgtgagtga 87241 gttggggcag ggcgcgcatg ttgagcggga aatcatgctg gtgaaaattc aggccagcgg 87301 ttacgggcgt gacgaagtga aacgtaatac ggaaatattc cgtgggcaaa ttatcgatgt 87361 cacaccctcg ctttataccg ttcaattagc aggcaccagc ggtaagctta gtgcattttt 87421 agcatcgatt cgcgatgtgg cgaaaattgt ggaggttgct cgctctggtg tggtcggact 87481 ttcgcgcggc gataaaataa tgcgttgaga atgatctcaa tgcgcaattt acagcccaac 87541 atgtcacgtt gggctttttt tgcgaaatca gtgggaacct ggaataaaag cagttgccgc 87601 agttaatttt ctgcgcttag atgttaatga atttaaccca taccagtaca atggctatgg 87661 tttttacatt ttacgcaagg ggcaattgtg aaactggatg aaatcgctcg gctggcggga 87721 gtgtcgcgga ccactgcaag ctatgttatt aacggcaaag cgaagcaata ccgtgtgagc 87781 gacaaaaccg ttgaaaaagt catggctgtg gtgcgtgagc acaattacca cccgaacgcc 87841 gtggcagctg ggcttcgtgc tggacgcaca cgttctattg gtcttgtgat ccccgatctg 87901 gagaacacca gctatacccg catcgctaac tatcttgaac gccaggcgcg gcaacggggt 87961 tatcaactgc tgattgcctg ctcagaagat cagccagaca acgaaatgcg gtgcattgag 88021 caccttttac agcgtcaggt tgatgccatt attgtttcga cgtcgttgcc tcctgagcat 88081 cctttttatc aacgctgggc taacgacccg ttcccgattg tcgcgctgga ccgcgccctc 88141 gatcgtgaac acttcaccag cgtggttggt gccgatcagg atgatgccga aatgctggcg 88201 gaagagttac gtaagtttcc cgccgagacg gtgctttatc ttggtgcgct accggagctt 88261 tctgtcagct tcctgcgtga acaaggtttc cgtactgcct ggaaagatga tccgcgcgaa 88321 gtgcatttcc tgtatgccaa cagctatgag cgggaggcgg ctgcccagtt attcgaaaaa 88381 tggctggaaa cgcatccgat gccgcaggcg ctgttcacaa cgtcgtttgc gttgttgcaa 88441 ggagtgatgg atgtcacgct gcgtcgcgac ggcaaactgc cttctgacct ggcaattgcc 88501 acctttggcg ataacgaact gctcgacttc ttacagtgtc cggtgctggc agtggctcaa 88561 cgtcaccgcg atgtcgcaga gcgtgtgctg gagattgtcc tggcaagcct ggacgaaccg 88621 cgtaagccaa aacctggttt aacgcgcatt aaacgtaatc tctatcgccg cggcgtgctc 88681 agccgtagct aagccgcgaa caaaaatacg cgccaggtga atttccctct ggcgcgtaga 88741 gtacgggact ggacatcaat atgcttaaag taaataagac tattcctgac tattattgat 88801 aaatgctttt aaacccgccc gttaattaac tcaccagctg aaattcacaa taattaagtg 88861 atatcgacag cgcgtttttg cattattttg ttacatgcgg cgatgaattg ccgatttaac 88921 aaacactttt ctttgctttt gcgcaaaccc gctggcatca agcgccacac agacgtaaca 88981 aggactgtta accggggaag atatgtccta aaatgccgct cgcgtcgcaa actgacactt 89041 tatatttgct gtggaaaata gtgagtcatt ttaaaacggt gatgacgatg agggattttt 89101 tcttacagct attcataacg ttaatttgct tcgcacgttg gacgtaaaat aaacaacgct 89161 gatattagcc gtaaacatcg ggttttttac ctcggtatgc cttgtgactg gcttgacaag 89221 cttttcctca gctccgtaaa ctcctttcag tgggaaattg tggggcaaag tgggaataag 89281 gggtgaggct ggcatgttcc ggggagcaac gttagtcaat ctcgacagca aagggcgctt 89341 atcagtgcct acccgttatc gggaacagct gcttgagaac gctgccggtc aaatggtttg 89401 caccattgac atttatcacc cgtgcctgct gctttacccc ctgcctgaat gggaaattat 89461 cgagcaaaaa ttatcgcgtc tgtcgagcat gaacccggtt gagcgccgtg tgcagcgcct 89521 actgttaggt catgccagcg aatgtcagat ggatggcgca ggtcgattgt taatcgcgcc 89581 agtactgcgg caacatgccg ggctgacaaa agaagtgatg ctggttggac agttcaacaa 89641 gtttgagctg tgggatgaaa caacctggca tcaacaggtc aaggaagata tcgacgcaga 89701 gcagttggct accggagact tatcggagcg actgcaggac ttgtctctat aaaatgatgg 89761 aaaactataa acatactacg gtgctgctgg atgaagccgt taatggcctc aatatccgtc 89821 ctgatggcat ctacattgat gggacttttg gtcgcggtgg tcactcacgt ctgatcctct 89881 cgcagcttgg cgaagagggg cgtttgctgg cgatcgatcg cgacccgcag gctatcgccg 89941 ttgcgaagac tattgatgat ccgcgcttct ccatcatcca cggacctttc tccgcgctgg 90001 gcgaatacgt tgccgagcgc gatcttatcg gcaagatcga cggcattctc ctcgatcttg 90061 gcgtctcttc accgcaactt gatgatgctg aacgtggctt ttcctttatg cgcgatggtc 90121 cgctggacat gcgtatggac ccaacccgtg ggcagtcagc cgctgaatgg ctacaaaccg 90181 cagaagaagc cgatatcgcc tgggtattga aaacctatgg tgaagagcgt tttgccaaac 90241 gcattgcccg cgccattgtc gagcgtaacc gcgaacagcc gatgacccgc accaaagaac 90301 tggcggaagt cgtggctgct gcaacgccgg tgaaagataa gtttaaacat cccgcgaccc 90361 gtaccttcca ggcggtgcgc atttgggtaa acagtgaact ggaggagata gagcaggcgc 90421 taaaaagctc gctcaacgtg ctggccccgg gtgggcggct ttcgatcatc agcttccact 90481 cgctggaaga ccgtattgtg aaacgtttta tgcgtgaaaa cagccgcggt ccgcaagttc 90541 cggcagggtt accgatgact gaagagcagc tcaaaaaact gggtggccgt cagctgcgag 90601 cactaggcaa gttaatgccg ggcgaagaag aggtggctga gaaccctcgt gcccgtagtt 90661 cagttctgcg tattgcagag aggacgaatg catgatcagc agagtgacag aagctctaag 90721 caaagttaaa ggatcgatgg gaagccacga gcgccatgca ttgcctggtg ttatcggtga 90781 cgatcttttg cgatttggga agctgccact ctgcctgttc atttgcatta ttttgacggc 90841 ggtgactgtg gtaaccacgg cgcaccatac ccgtttactg accgctcagc gcgaacaact 90901 ggtgctggag cgagatgctt tagacattga atggcgcaac ctgatccttg aagagaatgc 90961 gctcggcgac catagccggg tggaaaggat cgccacggaa aagctgcaaa tgcagacatg 91021 ttgatccgtc acaagaaaat atcgtagtgc aaaaataagg ataaacgcga cgcatgaaag 91081 cagcggcgaa aacgcagaaa ccaaaacgtc aggaagaaca tgccaacttt atcagttggc 91141 gttttgcgtt gttatgcggc tgtattctcc tggcgctggc ttttctgctc ggacgcgtag 91201 cgtggttaca agttatctcc ccggatatgc tggtgaaaga gggcgacatg cgttctcttc 91261 gcgttcagca agtttccacc tcccgcggca tgattactga ccgttctggt cgcccgttag 91321 cggtgagcgt gccggtaaaa gcgatttggg ctgacccgaa agaagtgcat gacgctggcg 91381 gtatcagcgt cggtgaccgc tggaaggcgc tggctaacgc gctcaatatt ccgctggatc 91441 agctttcagc ccgcattaac gccaacccga aagggcgctt tatttatctg gcgcgtcagg 91501 tgaaccctga catggcggac tacatcaaaa aactgaaact gccggggatt catctgcgtg 91561 aagagtctcg ccgttactat ccgtccggcg aagtgactgc tcacctcatc ggctttacta 91621 acgtcgatag tcaagggatt gagggcgttg agaagagttt cgataaatgg cttaccgggc 91681 agccgggtga gcgcattgtg cgtaaagacc gctatggtcg cgtaattgaa gatatttctt 91741 ctactgacag ccaggcagcg cacaacctgg cgctgagtat tgatgaacgc ctgcaggcgc 91801 tggtttatcg cgaactgaac aacgcggtgg cctttaacaa ggctgaatct ggtagcgccg 91861 tgctggtgga tgtcaacacc ggtgaagtgc tggcgatggc taacagcccg tcatacaacc 91921 ctaacaatct gagcggcacg ccgaaagagg cgatgcgtaa ccgtaccatc accgacgtgt 91981 ttgaaccggg ctcaacggtt aaaccgatgg tggtaatgac cgcgttgcaa cgtggcgtgg 92041 tgcgggaaaa ctcggtactc aataccattc cttatcgaat taacggccac gaaatcaaag 92101 acgtggcacg ctacagcgaa ttaaccctga ccggggtatt acagaagtcg agtaacgtcg 92161 gtgtttccaa gctggcgtta gcgatgccgt cctcagcgtt agtagatact tactcacgtt 92221 ttggactggg aaaagcgacc aatttggggt tggtcggaga acgcagtggc ttatatcctc 92281 aaaaacaacg gtggtctgac atagagaggg ccaccttctc tttcggctac gggctaatgg 92341 taacaccatt acagttagcg cgagtctacg caactatcgg cagctacggc atttatcgcc 92401 cactgtcgat taccaaagtt gaccccccgg ttcccggtga acgtgtcttc ccggaatcca 92461 ttgtccgcac tgtggtgcat atgatggaaa gcgtggcgct accaggcggc ggcggcgtga 92521 aggcggcgat taaaggctat cgtatcgcca ttaaaaccgg taccgcgaaa aaggtcgggc 92581 cggacggtcg ctacatcaat aaatatattg cttataccgc aggcgttgcg cctgcgagtc 92641 agccgcgctt cgcgctggtt gttgttatca acgatccgca ggcgggtaaa tactacggcg 92701 gcgccgtttc cgcgccggtc tttggtgcca tcatgggcgg cgtattgcgt accatgaaca 92761 tcgagccgga tgcgctgaca acgggcgata aaaatgaatt tgtgattaat caaggcgagg 92821 ggacaggtgg cagatcgtaa tttgcgcgac cttcttgctc cgtgggtgcc agacgcacct 92881 tcgcgagcac tgcgagagat gacactcgac agccgtgtgg ctgcggcggg cgatctcttt 92941 gtagctgtag taggtcatca ggcggacggg cgtcgatata tcccgcaggc gatagcgcaa 93001 ggtgtcgctg ccattattgc agaggcgaaa gatgaggcga ccgatggtga aatccgtgaa 93061 atgcacggcg taccggtcat ctatctcagc cagctcaacg agcgtttatc tgcactggcg 93121 ggccgctttt accatgaacc ctctgacaat ttacgtctcg tgggcgtaac gggcaccaac 93181 ggcaaaacca cgactaccca gctgttggcg cagtggagcc aactgcttgg cgaaatcagc 93241 gcggtaatgg gcaccgttgg taacggcctg ctggggaaag tgatcccgac agaaaataca 93301 accggttcgg cagtcgatgt tcagcatgag ctggcggggc tggtggatca gggcgcgacg 93361 ttttgcgcaa tggaagtttc ctcccacggg ctggtacagc accgtgtggc ggcattgaaa 93421 tttgcggcgt cggtctttac caacttaagc cgcgatcacc ttgattatca tggtgatatg 93481 gaacactacg aagccgcgaa atggctgctt tattctgagc atcattgcgg tcaggcgatt 93541 attaacgccg acgatgaagt gggccgccgc tggctggcaa aactgccgga cgcggttgcg 93601 gtatcaatgg aagatcatat taatccgaac tgtcacggac gctggttgaa agcgaccgaa 93661 gtgaactatc acgacagcgg tgcgacgatt cgctttagct caagttgggg cgatggcgaa 93721 attgaaagcc atctgatggg cgcttttaac gtcagcaacc tgctgctcgc gctggcgaca 93781 ctgttggcac tcggctatcc actggctgat ctgctgaaaa ccgccgcgcg tctgcaaccg 93841 gtttgcggac gtatggaagt gttcactgcg ccaggcaaac cgacggtggt ggtggattac 93901 gcgcatacgc cggatgcact ggaaaaagcc ttacaggcgg cgcgtctgca ctgtgcgggc 93961 aagctgtggt gtgtctttgg ctgtggtggc gatcgcgata aaggtaagcg tccactgatg 94021 ggcgcaattg ccgaagagtt tgctgacgtg gcggtggtga cggacgataa cccgcgtacc 94081 gaagaaccgc gtgccatcat caacgatatt ctggcgggaa tgttagatgc cggacatgcc 94141 aaagtgatgg aaggccgtgc tgaagcggtg acttgcgccg ttatgcaggc taaagagaat 94201 gatgtggtac tggtcgcggg caaaggccat gaagattacc agattgttgg caatcagcgt 94261 ctggactact ccgatcgcgt cacggtggcg cgtctgctgg gggtgattgc atgattagcg 94321 taacccttag ccaacttacc gacattctca acggtgaact gcaaggtgca gatatcaccc 94381 ttgatgctgt aaccactgat acccgaaaac tgacgccggg ctgcctgttt gttgccctga 94441 aaggcgaacg ttttgatgcc cacgattttg ccgaccaggc gaaagctggc gcggcaggcg 94501 cactactggt tagccgtccg ctggacatcg acctgccgca gttaatcgtc aaggatacgc 94561 gtctggcgtt tggtgaactg gctgcatggg ttcgccagca agttccggcg cgcgtggttg 94621 ctctgacggg gtcctccggc aaaacctccg ttaaagagat gacggcggcg attttaagcc 94681 agtgcggcaa cacgctttat acggcaggca atctcaacaa cgacatcggt gtaccgatga 94741 cgctgttgcg cttaacgccg gaatacgatt acgcagttat tgaacttggc gcgaaccatc 94801 agggcgaaat agcctggact gtgagtctga ctcgcccgga acgtgcgctg gtcaacaacc 94861 tggcagcggc gcatctggaa ggttttggct cgcttgcggg tgtcgcgaaa gcgaaaggtg 94921 aaatctttag cggcctgccg gaaaacggta tcgccattat gaacgccgac aacaacgact 94981 ggctgaactg gcagagcgta attggctcac gcaaagtgtg gcgtttctca cccaatgccg 95041 ccaacagcga tttcaccgcc accaatatcc atgtgacctc gcacggtacg gaatttaccc 95101 tacaaacccc aaccggtagc gtcgatgttc tgctgccgtt gccggggcgt cacaatattg 95161 cgaatgcgct ggcagccgct gcgctctcca tgtccgtggg cgcaacgctt gatgctatca 95221 aagcggggct ggcaaatctg aaagctgttc caggccgtct gttccccatc caactggcag 95281 aaaaccagtt gctgctcgac gactcctaca acgccaatgt cggttcaatg actgcagcag 95341 tccaggtact ggctgaaatg ccgggctacc gcgtgctggt ggtgggcgat atggcggaac 95401 tgggcgctga aagcgaagcc tgccatgtac aggtgggcga ggcggcaaaa gctgctggta 95461 ttgaccgcgt gttaagcgtg ggtaaacaaa gccatgctat cagcaccgcc agcggcgttg 95521 gcgaacattt tgctgataaa actgcgttaa ttacgcgtct taaattactg attgctgagc 95581 aacaggtaat tacgatttta gttaagggtt cacgtagtgc cgccatggaa gaggtagtac 95641 gcgctttaca ggagaatggg acatgttagt ttggctggcc gaacatttgg tcaaatatta 95701 ttccggcttt aacgtctttt cctatctgac gtttcgcgcc atcgtcagcc tgctgaccgc 95761 gctgttcatc tcattgtgga tgggcccgcg tatgattgct catttgcaaa aactttcctt 95821 tggtcaggtg gtgcgtaacg acggtcctga atcacacttc agcaagcgcg gtacgccgac 95881 catgggcggg attatgatcc tgacggcgat tgtgatctcc gtactgctgt gggcttaccc 95941 gtccaatccg tacgtctggt gcgtgttggt ggtgctggta ggttacggtg ttattggctt 96001 tgttgatgat tatcgcaaag tggtgcgtaa agacaccaaa gggttgatcg ctcgttggaa 96061 gtatttctgg atgtcggtca ttgcgctggg tgtcgccttc gccctgtacc ttgccggcaa 96121 agacacgccc gcaacgcagc tggtggtccc attctttaaa gatgtgatgc cgcagctggg 96181 gctgttctac attctgctgg cttacttcgt cattgtgggt actggcaacg cggtaaacct 96241 gaccgatggt ctcgacggcc tggcaattat gccgaccgta tttgtcgccg gtggttttgc 96301 gctggtggcg tgggcgaccg gcaatatgaa ctttgccagc tacttgcata taccgtatct 96361 gcgacacgcc ggggaactgg ttattgtctg taccgcgata gtcggggcag gactgggctt 96421 cctgtggttt aacacctatc cggcgcaggt ctttatgggc gatgtaggtt cgctggcgtt 96481 aggtggtgcg ttaggcatta tcgccgtact gctacgtcag gaattcctgc tggtgattat 96541 ggggggcgtg ttcgtggtag aaacgctttc tgtcatcctg caggtcggct cctttaaact 96601 gcgcggacaa cgtattttcc gcatggcacc gattcatcac cactatgaac tgaaaggctg 96661 gccggaaccg cgcgtcattg tgcgtttctg gattatttcg ctgatgctgg ttctgattgg 96721 tctggcaacg ctgaaggtac gttaatcatg gctgattatc agggtaaaaa tgtcgtcatt 96781 atcggcctgg gcctcaccgg gctttcctgc gtggactttt tcctcgctcg cggtgtgacg 96841 ccgcgcgtta tggatacgcg tatgacaccg cctggcctgg ataaattacc cgaagccgta 96901 gaacgccaca cgggcagtct gaatgatgaa tggctgatgg cggcagatct gattgtcgcc 96961 agtcccggta ttgcactggc gcatccatcc ttaagcgctg ccgctgatgc cggaatcgaa 97021 atcgttggcg atatcgagct gttctgtcgc gaagcacaag caccgattgt ggcgattacc 97081 ggttctaacg gcaaaagcac ggtcaccacg ctagtgggtg aaatggcgaa agcggcgggg 97141 gttaacgttg gtgtgggtgg caatattggc ctgcctgcgt tgatgctact ggatgatgag 97201 tgtgaactgt acgtgctgga actgtcgagc ttccagctgg aaaccacctc cagcttacag 97261 gcggtagcag cgaccattct gaacgtgact gaagatcata tggatcgcta tccgtttggt 97321 ttacaacagt atcgtgcagc aaaactgcgc atttacgaaa acgcgaaagt ttgcgtggtt 97381 aatgctgatg atgccttaac aatgccgatt cgcggtgcgg atgaacgctg cgtcagcttt 97441 ggcgtcaaca tgggtgacta tcacctgaat catcagcagg gcgaaacctg gctgcgggtt 97501 aaaggcgaga aagtgctgaa tgtgaaagag atgaaacttt ccgggcagca taactacacc 97561 aatgcgctgg cggcgctggc gctggcagat gctgcagggt taccgcgtgc cagcagcctg 97621 aaagcgttaa ccacattcac tggtctgccg catcgctttg aagttgtgct ggagcataac 97681 ggcgtacgtt ggattaacga ttcgaaagcg accaacgtcg gcagtacgga agcggcgctg 97741 aatggcctgc acgtagacgg cacactgcat ttgttgctgg gtggcgatgg taaatcggcg 97801 gactttagcc cactggcgcg ttacctgaat ggcgataacg tacgtctgta ttgtttcggt 97861 cgtgacggcg cgcagctggc ggcgctacgc ccggaagtgg cagaacaaac cgaaactatg 97921 gaacaggcga tgcgcttgct ggctccgcgt gttcagccgg gcgatatggt tctgctctcc 97981 ccagcctgtg ccagccttga tcagttcaag aactttgaac aacgaggcaa tgagtttgcc 98041 cgtctggcga aggagttagg ttgatgcgtt tatctctccc tcgcctgaaa atgccgcgcc 98101 tgccaggatt cagtatcctg gtctggatct ccacggcgct aaagggctgg gtgatgggct 98161 cgcgggaaaa agataccgac agcctgatca tgtacgatcg caccttactg tggctgacct 98221 tcggcctcgc ggcgattggc tttatcatgg tgacctcggc gtcaatgccc atagggcaac 98281 gcttaaccaa cgatccgttc ttcttcgcga agcgtgatgg tgtctatctg attttggcgt 98341 ttattctggc gatcattacg ctgcgtctgc cgatggagtt ctggcaacgc tacagtgcca 98401 cgatgctgct cggatctatc atcctgctga tgatcgtcct ggtagtgggt agctcggtta 98461 aaggggcatc gcgttggatc gatctcggtt tgctgcgtat ccagcctgcg gagctgacaa 98521 aactgtcgct gttttgctat atcgccaact atctggtgcg taaaggcgac gaagtacgta 98581 ataacctgcg cggcttcctg aaaccgatgg gcgtgattct ggtgttggca gtgttactgc 98641 tggcacagcc agaccttggt acggtggtgg tgttgtttgt gactacgctg gcgatgttgt 98701 tcctggcggg agcgaaattg tggcagttca ttgccattat cggtatgggc atttcagcgg 98761 ttgtgttgct gatactcgcc gaaccgtacc gtatccgccg tgttaccgca ttctggaacc 98821 cgtgggaaga tccctttggc agcggctatc agttaacgca atcgctgatg gcgtttggtc 98881 gcggcgaact ttgggggcaa ggtttaggta actcggtaca aaaactggag tatctgccgg 98941 aagcgcacac tgactttatt ttcgccatta tcggcgaaga actggggtat gtcggtgtgg 99001 tgctggcact tttaatggta ttcttcgtcg cttttcgcgc gatgtcgatt ggccgtaaag 99061 cattagaaat tgaccaccgt ttttccggtt ttctcgcctg ttctattggc atctggttta 99121 gcttccaggc gctggttaac gtaggcgcgg cggcggggat gttaccgacc aaaggtctga 99181 cattgccgct gatcagttac ggtggttcga gcttactgat tatgtcgaca gccatcatga 99241 tgctgttgcg tattgattat gaaacgcgtc tggagaaagc gcaggcgttt gtacgaggtt 99301 cacgatgagt ggtcaaggaa agcgattaat ggtgatggca ggcggaaccg gtggacatgt 99361 attcccggga ctggcggttg cgcaccatct aatggctcag ggttggcaag ttcgctggct 99421 ggggactgcc gaccgtatgg aagcggactt agtgccaaaa catggcatcg aaattgattt 99481 cattcgtatc tctggtctgc gtggaaaagg tataaaagca ctgatagctg ccccgctgcg 99541 tatcttcaac gcctggcgtc aggcgcgggc gattatgaaa gcgtacaaac ctgacgtggt 99601 gctcggtatg ggaggctacg tgtcaggtcc aggtggtctg gccgcgtggt cgttaggcat 99661 tccggttgta cttcatgaac aaaacggtat tgcgggctta accaataaat ggctggcgaa 99721 gattgccacc aaagtgatgc aggcgtttcc aggtgctttc cctaatgcgg aagtagtggg 99781 taacccggtg cgtaccgatg tgttggcgct gccgttgccg cagcaacgtt tggctggacg 99841 tgaaggtccg gttcgtgtgc tggtagtggg tggttctcag ggcgcacgca ttcttaacca 99901 gacaatgccg caggttgctg cgaaactggg tgattcagtc actatctggc atcagagcgg 99961 caaaggttcg caacaatccg ttgaacaggc gtatgccgaa gcggggcaac cgcagcataa 100021 agtgacggaa tttattgatg atatggcggc ggcgtatgcg tgggcggatg tcgtcgtttg 100081 ccgctccggt gcgttaacgg tgagtgaaat cgccgcggca ggactaccgg cgttgtttgt 100141 gccgtttcaa cataaagacc gccagcaata ctggaatgcg ctaccgctgg aaaaagcggg 100201 cgcagccaaa attatcgagc agccacagct tagcgtggat gctgtcgcca acaccctggc 100261 cgggtggtcg cgagaaacct tattaaccat ggcagaacgc gcccgcgctg catccattcc 100321 ggatgccacc gagcgagtgg caaatgaagt gagccgggtt gcccgggcgt aattgtagcg 100381 atgccttttg catcgtatga atttaagaag ttaatggcgt aaagaatgaa tacacaacaa 100441 ttggcaaaac tgcgttccat cgtgcccgaa atgcgtcgcg ttcggcacat acattttgtc 100501 ggcattggtg gtgccggtat gggcggtatt gccgaagttc tggccaatga aggttatcag 100561 atcagtggtt ccgatttagc gccaaatccg gtcacgcagc agttaatgaa tctgggtgcg 100621 acgatttatt tcaaccatcg cccggaaaac gtacgtgatg ccagcgtggt cgttgtttcc 100681 agcgcgattt ctgccgataa cccggaaatt gtcgccgctc atgaagcgcg tattccggtg 100741 atccgtcgtg ccgaaatgct ggctgagtta atgcgttttc gtcatggcat cgccattgcc 100801 ggaacgcacg gcaaaacgac aaccaccgcg atggtttcca gcatctacgc agaagcgggg 100861 ctcgacccaa ccttcgttaa cggcgggctg gtaaaagcgg cgggggttca tgcgcgtttg 100921 gggcatggtc ggtacctgat tgccgaagca gatgagagtg atgcatcgtt cctgcatctg 100981 caaccgatgg tggcgattgt caccaatatc gaagccgacc acatggatac ctaccagggc 101041 gactttgaga atttaaaaca gacttttatt aattttctgc acaacctgcc gttttacggt 101101 cgtgcggtga tgtgtgttga tgatccggtg atccgcgaat tgttaccgcg agtggggcgt 101161 cagaccacga cttacggctt cagcgaagat gccgacgtgc gtgtagaaga ttatcagcag 101221 attggcccgc aggggcactt tacgctgctg cgccaggaca aagagccgat gcgcgtcacc 101281 ctgaatgcgc caggtcgtca taacgcgctg aacgccgcag ctgcggttgc ggttgctacg 101341 gaagagggca ttgacgacga ggctattttg cgggcgcttg aaagcttcca ggggactggt 101401 cgccgttttg atttcctcgg tgaattcccg ctggagccag tgaatggtaa aagcggtacg 101461 gcaatgctgg tcgatgacta cggccaccac ccgacggaag tggacgccac cattaaagcg 101521 gcgcgcgcag gctggccgga taaaaacctg gtaatgctgt ttcagccgca ccgttttacc 101581 cgtacgcgcg acctgtatga tgatttcgcc aatgtgctga cgcaggttga taccctgttg 101641 atgctggaag tgtatccggc tggcgaagcg ccaattccgg gagcggacag ccgttcgctg 101701 tgtcgcacaa ttcgtggacg tgggaaaatt gatcccattc tggtgccgga tccggcgcgg 101761 gtagccgaga tgctggcacc ggtattaacc ggtaacgacc tgattctcgt tcagggggct 101821 ggtaatattg gaaaaattgc ccgttcttta gctgaaatca aactgaagcc gcaaactccg 101881 gaggaagaac aacatgactg ataaaatcgc ggtcctgttg ggtgggacct ccgctgagcg 101941 ggaagtttct ctgaattctg gcgcagcggt gttagccgga ctgcgtgaag gcggtattga 102001 cgcgtatcct gtcgacccga aagaagtcga cgtgacgcaa ctgaagtcga tgggctttca 102061 gaaagtgttt atcgcgctac acggtcgcgg cggtgaagat ggtacgctgc aggggatgct 102121 cgagctgatg ggcttgcctt ataccggaag cggagtgatg gcatctgcgc tttcaatgga 102181 taaactacgc agcaaacttc tatggcaagg tgccggttta ccggtcgcgc cgtgggtagc 102241 gttaacccgc gcagagtttg aaaaaggcct gagcgataag cagttagcag aaatttctgc 102301 tctgggtttg ccggttatcg ttaagccgag ccgcgaaggt tccagtgtgg gaatgtcaaa 102361 agtagtagca gaaaatgctc tacaagatgc attaagattg gcatttcagc acgatgaaga 102421 agtattgatt gaaaaatggc taagtgggcc ggagttcacg gttgcgatac tcggtgaaga 102481 aattttaccg tcaatacgta ttcaaccgtc cggaaccttc tatgattatg aggcgaagta 102541 tctctctgat gagacacagt atttctgccc cgcaggtctg gaagcgtcac aagaggccaa 102601 tttgcaggca ttagtgctga aagcatggac gacgttaggt tgcaaaggat ggggacgtat 102661 tgacgttatg ctggacagcg atggacagtt ttatctgctg gaagccaata cctcaccggg 102721 tatgaccagc cacagcctgg tgccgatggc ggcacgtcag gcaggtatga gcttctcgca 102781 gttggtagta cgaattctgg aactggcgga ctaatatgtc gcaggctgct ctgaacacgc 102841 gaaacagcga agaagaggtt tcttctcgcc gcaataatgg aacgcgtctg gcggggatcc 102901 ttttcctgct gaccgtttta acgacagtgt tggtgagcgg ctgggtcgtg ttgggctgga 102961 tggaagatgc gcaacgcctg ccgctctcaa agctggtgtt gaccggtgaa cgccattaca 103021 cacgtaatga cgatatccgg cagtcgatcc tggcattggg tgagccgggt acctttatga 103081 cccaggatgt caacatcatc cagacgcaaa tagaacaacg cctgccgtgg attaagcagg 103141 tgagcgtcag aaagcagtgg cctgatgaat tgaagattca tctggttgaa tatgtgccga 103201 ttgcgcggtg gaatgatcaa catatggtag acgcggaagg aaataccttc agcgtgccgc 103261 cagaacgcac cagcaagcag gtgcttccaa tgctgtatgg cccggaaggc agcgccaatg 103321 aagtgttgca gggctatcgc gaaatggggc agatgctggc aaaggacaga tttactctga 103381 aggaagcggc gatgaccgcg cggcgttcct ggcagttgac gctgaataac gatattaagc 103441 tcaatcttgg ccggggcgat acgatgaaac gtttggctcg ctttgtagaa ctttatccgg 103501 ttttacagca gcaggcgcaa accgatggca aacggattag ctacgttgat ttgcgttatg 103561 actctggagc ggcagtaggc tgggcgccct tgccgccaga ggaatctact cagcaacaaa 103621 atcaggcaca ggcagaacaa caatgatcaa ggcgacggac agaaaactgg tagtaggact 103681 ggagattggt accgcgaagg ttgccgcttt agtaggggaa gttctgcccg acggtatggt 103741 caatatcatt ggcgtgggca gctgcccgtc gcgtggtatg gataaaggcg gggtgaacga 103801 cctcgaatcc gtggtcaagt gcgtacaacg cgccattgac caggcagaat tgatggcaga 103861 ttgtcagatc tcttcggtat atctggcgct ttctggtaag cacatcagct gccagaatga 103921 aattggtatg gtgcctattt ctgaagaaga agtgacgcaa gaagatgtgg aaaacgtcgt 103981 ccataccgcg aaatcggtgc gtgtgcgcga tgagcatcgt gtgctgcatg tgatcccgca 104041 agagtatgcg attgactatc aggaagggat caagaatccg gtaggacttt cgggcgtgcg 104101 gatgcaggca aaagtgcacc tgatcacatg tcacaacgat atggcgaaaa acatcgtcaa 104161 agcggttgaa cgttgtgggc tgaaagttga ccaactgata tttgccggac tggcatcaag 104221 ttattcggta ttgacggaag atgaacgtga actgggtgtc tgcgtcgtcg atatcggtgg 104281 tggtacaatg gatatcgccg tttataccgg tggggcattg cgccacacta aggtaattcc 104341 ttatgctggc aatgtcgtga ccagtgatat cgcttacgcc tttggcacgc cgccaagcga 104401 cgccgaagcg attaaagttc gccacggttg tgcgctgggt tccatcgttg gaaaagatga 104461 gagcgtggaa gtgccgagcg taggtggtcg tccgccacgg agtctgcaac gtcagacact 104521 ggcagaggtg atcgagccgc gctataccga gctgctcaac ctggtcaacg aagagatatt 104581 gcagttgcag gaaaagcttc gccaacaagg ggttaaacat cacctggcgg caggcattgt 104641 attaaccggt ggcgcagcgc agatcgaagg tcttgcagcc tgtgctcagc gcgtgtttca 104701 tacgcaagtg cgtatcggcg cgccgctgaa cattaccggt ttaacggatt atgctcagga 104761 gccgtattat tcgacggcgg tgggattgct tcactatggg aaagagtcac atcttaacgg 104821 tgaagctgaa gtagaaaaac gtgttacagc atcagttggc tcgtggatca agcgactcaa 104881 tagttggctg cgaaaagagt tttaattttt atgaggccga cgatgattac ggcctcaggc 104941 gacaggcaca aatcggagag aaactatgtt tgaaccaatg gaacttacca atgacgcggt 105001 gattaaagtc atcggcgtcg gcggcggcgg cggtaatgct gttgaacaca tggtgcgcga 105061 gcgcattgaa ggtgttgaat tcttcgcggt aaataccgat gcacaagcgc tgcgtaaaac 105121 agcggttgga cagacgattc aaatcggtag cggtatcacc aaaggactgg gcgctggcgc 105181 taatccagaa gttggccgca atgcggctga tgaggatcgc gatgcattgc gtgcggcgct 105241 ggaaggtgca gacatggtct ttattgctgc gggtatgggt ggtggtaccg gtacaggtgc 105301 ggcaccagtc gtcgctgaag tggcaaaaga tttgggtatc ctgaccgttg ctgtcgtcac 105361 taagcctttc aactttgaag gcaagaagcg tatggcattc gcggagcagg ggatcactga 105421 actgtccaag catgtgaact ctctgatcac tatcccgaac gacaaactgc tgaaagttct 105481 gggccgcggt atctccctgc tggatgcgtt tggcgcagcg aacgatgtac tgaaaggcgc 105541 tgtgcaaggt atcgctgaac tgattactcg tccgggtttg atgaacgtgg actttgcaga 105601 cgtacgcacc gtaatgtctg agatgggcca cgcaatgatg ggttctggcg tggcgagcgg 105661 tgaagaccgt gcggaagaag ctgctgaaat ggctatctct tctccgctgc tggaagatat 105721 cgacctgtct ggcgcgcgcg gcgtgctggt taacatcacg gcgggcttcg acctgcgtct 105781 ggatgagttc gaaacggtag gtaacaccat ccgtgcattt gcttccgaca acgcgactgt 105841 ggttatcggt acttctcttg acccggatat gaatgacgag ctgcgcgtaa ccgttgttgc 105901 gacaggtatc ggcatggaca aacgtcctga aatcactctg gtgaccaata agcaggttca 105961 gcagccagtg atggatcgct accagcagca tgggatggct ccgctgaccc aagagcagaa 106021 gccggttgct aaagtcgtga atgacaatgc gccgcaaact gcgaaagagc cggattatct 106081 ggatatccca gcattcctgc gtaagcaagc tgattaagaa ttgactggaa tttgggtttc 106141 gattctcttt gtgctaaact ggcccgccga atgtatagta cacttcggtt ggataggtaa 106201 tttggcgaga taatacgatg atcaaacaaa ggacacttaa acgtatcgtt caggcgacgg 106261 gtgtcggttt acataccggc aagaaagtca ccctgacgtt acgccctgcg ccggccaaca 106321 ccggggtcat ctatcgtcgc accgacttga atccaccggt agatttcccg gccgatgcca 106381 aatctgtgcg tgataccatg ctctgtacgt gtctggtcaa cgagcatgat gtacggattt 106441 caaccgtaga gcacctcaat gctgctctcg cgggcttggg catcgataac attgttatcg 106501 aagttaacgc gccggaaatc ccgatcatgg acggcagcgc cgctccgttt gtatacctgc 106561 tgcttgacgc cggtatcgac gagttgaact gcgccaaaaa atttgttcgc atcaaagaga 106621 ctgttcgtgt cgaagatggc gataagtggg ctgaatttaa gccgtacaat ggtttttcgc 106681 tggatttcac catcgatttt aaccatccgg ctattgattc cagcaaccag cgctatgcga 106741 tgaacttctc cgctgatgcg tttatgcgcc agatcagccg tgcgcgtacg ttcggtttca 106801 tgcgtgatat cgaatatctg cagtcccgtg gtttgtgcct gggcggcagc ttcgattgtg 106861 ccatcgttgt tgacgattat cgcgtactga acgaagacgg cctgcgtttt gaagacgaat 106921 ttgtgcgtca caaaatgctc gatgcgatcg gtgacttgtt catgtgtggt cacaatatta 106981 ttggtgcatt taccgcttat aaatccggtc atgcactgaa taacaaactg ctgcaggctg 107041 tcctggcgaa acaggaagcc tgggaatatg tgaccttcca ggacgacgca gaactgccgt 107101 tggccttcaa agcgccttca gctgtactgg cataacgaca tttatactgt cgtataaaat 107161 tcgactggca aatctggcac tctctccggc caggtgaacc agtcgttttt ttttgaattt 107221 tataagagct ataaaaaacg gtgcgaacgc tgttttctta agcacttttc cgcacaactt 107281 atcttcattc gtgctgtgga ctgcaggctt taatgataag atttgtgcgc taaatacgtt 107341 tgaatatgat cgggatggca ataacgtgag tggaatactg acgcgctggc gacagtttgg 107401 taaacgctac ttctggccgc atctcttatt agggatggtt gcggcgagtt taggtttgcc 107461 tgcgctcagc aacgccgccg aaccaaacgc gcccgcaaaa gcgacaaccc gcaaccacga 107521 gccttcagcc aaagttaact ttggtcaatt ggccttgctg gaagcgaaca cacgccgccc 107581 gaattcgaac tattccgttg attactggca tcaacatgcc attcgcacgg taatccgtca 107641 tctttctttc gcaatggcac cgcaaacact gcccgttgct gaagaatctt tgcctcttca 107701 ggcgcaacat cttgcattac tggatacgct cagcgcgctg ctgacccagg aaggcacgcc 107761 gtctgaaaag ggttatcgca ttgattatgc gcattttacc ccacaagcaa aattcagcac 107821 gcccgtctgg ataagccagg cgcaaggcat ccgtgctggc cctcaacgcc tcacctaaca 107881 acaataaacc tttacttcat tttattaact ccgcaacgcg gggcgtttga gattttatta 107941 tgctaatcaa attgttaact aaagttttcg gtagtcgtaa cgatcgcacc ctgcgccgga 108001 tgcgcaaagt ggtcaacatc atcaatgcca tggaaccgga gatggaaaaa ctctccgacg 108061 aagaactgaa agggaaaacc gcagagtttc gtgcacgtct ggaaaaaggc gaagtgctgg 108121 aaaatctgat cccggaagct ttcgccgtgg tacgtgaggc aagtaagcgc gtctttggta 108181 tgcgtcactt cgacgttcag ttactcggcg gtatggttct taacgaacgc tgcatcgccg 108241 aaatgcgtac cggtgaagga aaaaccctga ccgcaacgct gcctgcttac ctgaacgcac 108301 taaccggtaa aggcgtgcac gtagttaccg tcaacgacta cctggcgcaa cgtgacgccg 108361 aaaacaaccg tccgctgttt gaattccttg gcctgactgt cggtatcaac ctgccgggca 108421 tgccagcacc ggcaaagcgc gaagcttacg cagctgacat cacttacggt acgaacaacg 108481 aatacggctt tgactacctg cgcgacaaca tggcgttcag ccctgaagaa cgtgtacagc 108541 gtaaactgca ctatgcgctg gtggacgaag tggactccat cctgatcgat gaagcgcgta 108601 caccgctgat catttccggc ccggcagaag acagctcgga aatgtataaa cgcgtgaata 108661 aaattattcc gcacctgatc cgtcaggaaa aagaagactc cgaaaccttc cagggcgaag 108721 gccacttctc ggtggacgaa aaatctcgcc aggtgaacct gaccgaacgt ggtctggtgc 108781 tgattgaaga actgctggtg aaagagggca tcatggatga aggggagtct ctgtactctc 108841 cggccaacat catgctgatg caccacgtaa cggcggcgct gcgcgctcat gcgctgttta 108901 cccgtgacgt cgactacatc gttaaagatg gtgaagttat catcgttgac gaacacaccg 108961 gtcgtaccat gcagggccgt cgctggtccg atggtctgca ccaggctgtg gaagcgaaag 109021 aaggtgtgca gatccagaac gaaaaccaaa cgctggcttc gatcaccttc cagaactact 109081 tccgtctgta tgaaaaactg gcggggatga ccggtactgc tgataccgaa gctttcgaat 109141 ttagctcaat ctacaagctg gataccgtcg ttgttccgac caaccgtcca atgattcgta 109201 aagatctgcc ggacctggtc tacatgactg aagcggaaaa aattcaggcg atcattgaag 109261 atatcaaaga acgtactgcg aaaggccagc cggtgctggt gggtactatc tccatcgaaa 109321 aatcggagct ggtgtcaaac gaactgacca aagccggtat taagcacaac gtcctgaacg 109381 ccaaattcca cgccaacgaa gcggcgattg ttgctcaggc aggttatccg gctgcggtga 109441 ctatcgcgac caatatggcg ggtcgtggta cagatattgt gctcggtggt agctggcagg 109501 cagaagttgc cgcgctggaa aatccgaccg cagagcaaat tgaaaaaatt aaagccgact 109561 ggcaggtacg tcacgatgcg gtactggaag caggtggcct gcatatcatc ggtaccgagc 109621 gtcacgaatc ccgtcgtatc gataaccagt tgcgcggtcg ttctggtcgt cagggggatg 109681 ctggttcttc ccgtttctac ctgtcgatgg aagatgcgct gatgcgtatt tttgcttccg 109741 accgagtatc cggcatgatg cgtaaactgg gtatgaagcc aggcgaagcc attgaacacc 109801 cgtgggtgac taaagcgatt gccaacgccc agcgtaaagt tgaaagccgt aacttcgaca 109861 ttcgtaagca actgctggaa tatgatgacg tggctaacga tcagcgtcgc gccatttact 109921 cccagcgtaa cgaactgttg gatgtcagcg atgtgagcga aaccattaac agcattcgtg 109981 aagatgtgtt caaagcgacc attgatgcct acattccacc acagtcgctg gaagaaatgt 110041 gggatattcc ggggctgcag gaacgtctga agaacgattt cgacctcgat ttgccaattg 110101 ccgagtggct ggataaagaa ccagaactgc atgaagagac gctgcgtgac ggcattctgg 110161 cgcagtccat cgaagtgtat cagcgtaaag aagaagtggt tggtgctgag atgatgcgtc 110221 acttcgagaa aggcgtcatg ctgcaaacgc ttgactccct gtggaaagag cacctggcag 110281 cgatggacta tctgcgtcag ggtatccacc tgcgtggcta cgcacagaaa gatccgaagc 110341 aggaatacaa acgtgaatcg ttctccatgt ttgcagcgat gctggagtcg ttgaaatatg 110401 aagttatcag tacgctgagc aaagttcagg tacgtatgcc tgaagaggtt gaggagctgg 110461 aacaacagcg tcgtatggaa gccgagcgtt tagcgcaaat gcagcagctt agccatcagg 110521 atgacgactc tgcagccgca gctgcactgg cggcgcaaac cggagagcgc aaagtaggac 110581 gtaacgatcc ttgcccgtgc ggttctggta aaaaatacaa gcagtgccat ggccgcctgc 110641 aataaaagct aactgttgaa gtaaaaggcg caggattctg cgcctttttt ataggtttaa 110701 gacaatgaaa aagctgcaaa ttgcggtagg tattattcgc aacgagaaca atgaaatctt 110761 tataacgcgt cgcgcagcag atgcgcacat ggcgaataaa ctggagtttc ccggcggtaa 110821 aattgaaatg ggtgaaacgc cggaacaggc ggtggtgcgt gaacttcagg aagaagtcgg 110881 gattaccccc caacattttt cgctatttga aaaactggaa tatgaattcc cggacaggca 110941 tataacactg tggttttggc tggtcgaacg ctgggaaggg gagccgtggg gtaaagaagg 111001 gcaacccggt gagtggatgt cgctggtcgg tcttaatgcc gatgattttc cgccagccaa 111061 tgaaccggta attgcgaagc ttaaacgtct gtaggtcaga taaggcgttt tcgccgcatc 111121 cgacattcgc acacgatgcc tgatgcgacg ctggcgcgtc ttatcaggcc taaagggatt 111181 tctaactcat tgataaattt gtttttgtag gtcggataag gcgttcacgc cgcatccgac 111241 atttgcacaa gatgcctgat gcgacgctgt ccgcgtctta tcaggcctac gtgcggcatc 111301 agacaaatgt cactgctttg gttcttcgct ccagtcatcg ctttcggaaa gatcgccact 111361 gctggggatt cgtttttctt cagcagccca ttctccgagg tcgatcag // puzzle.tre100644000766000024 51314006427353 16420 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data[ lh=-2673.059726 ](anid:0.11568,(histo7:0.00692,hcap_NA:0.00698,histo6:0.00352)100:0.06375, ((cpos:0.00001,cimm:0.00001)100:0.04597,uree:0.03765)100:0.05282); [ lh=-2674.559163 ](anid:0.09945,((histo7:0.00607,hcap_NA:0.00607,histo6:0.00607)100 :0.07755,((cpos:0.00001,cimm:0.00001)100:0.03918,uree:0.03919)100:0.04443 ):0.01583); test-1.tab100644000766000024 3414006427353 16136 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataN D S n1 d1 aaaa n2 d2 tttt no_FH.embl100644000766000024 3404014006427353 16247 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID AE000001; FT CDS 202..1557 FT /M1_GAS_orthologue="SPy0002" FT /MGAS10394_orthologue="M6_Spy0001" FT /MGAS315_orthologue="dnaA" FT /MGAS8232_orthologue="dnaA" FT /SSI_1_orthologue="SPs0001" FT /cds_id="subject0" FT /cds_id="subject0" FT /class="1.2.1" FT /colour=2 FT /fasta_file="fasta/SP_new.tab.seq.00001.out" FT /gene="dnaA" FT /gene="dnaH" FT /product="chromosomal replication initiator protein" FT /systematic_id="SpyM50001" FT misc_feature 535..1488 FT /colour=9 FT /domain="HMMPfam:PF00308;Bacterial dnaA FT protein;1.7e-156;codon 112-429" FT /id="SpyM50001" FT /label=HMMPfam FT /note="HMMPfam hit to PF00308, Bacterial dnaA protein, FT score 1.7e-156" FT misc_feature 655..678 FT /colour=8 FT /note="PS00017 ATP/GTP-binding site motif A (P-loop)." FT misc_feature 1429..1488 FT /colour=8 FT /note="PS01008 DnaA protein signature." SQ Sequence 1841271 BP; 564546 A; 354645 C; 356610 G; 565470 T; 0 other; ttgttgatat tctgtttttt cttttttagt tttccacata aaaaatagtt gaaaacaata 60 gcggtgtcac cttaaaatgg cttttccaca ggttgtggag aacccaaatt aacagtgtta 120 atttattttc cacaggttgt ggaaaactag aatagtttat ggtagaatag ttctagaatt 180 atccacaaga aggaacctag tatgactgaa aatgaacaaa ttttttggaa cagggtcttg 240 gaattagctc agagtcaatt aaaacaggca acttatgaat tttttgttca tgatgcccgt 300 ctattaaagg tcgataagca tattgcaact atttacttag atcaaatgaa agaactcttt 360 tgggaaaaaa atcttaaaga tgttattctt actgctggtt ttgaagttta taacgctcaa 420 atttctgttg actatgtttt cgaagaagac ctaatgattg agcaaaatca gaccaaaatc 480 aatcaaaaac ctaagcagca agccttaaat tctttgccta ctgttacttc agatttaaac 540 tcgaaatata gttttgaaaa ctttattcaa ggagatgaaa atcgttgggc tgttgctgct 600 tcaatagcag tagctaatac tcctggaact acctataatc ctttgtttat ttggggtggc 660 cctgggcttg gaaaaaccca tttattaaat gctattggta attctgtact attagaaaat 720 ccaaatgctc gaattaaata tatcacagct gaaaacttta ttaatgagtt tgttatccat 780 attcgccttg ataccatgga tgaattgaaa gaaaaatttc gtaatttaga tttactcctt 840 attgatgata tccaatcttt agctaaaaaa acgctctctg gaacacaaga agagttcttt 900 aatactttta atgcacttca taataataac aaacaaattg tcctaacaag cgaccgtaca 960 ccagatcatc tcaatgattt agaagatcga ttagttactc gttttaaatg gggattaaca 1020 gtcaatatca cacctcctga ttttgaaaca cgagtggcta ttttgacaaa taaaattcaa 1080 gaatataact ttatttttcc tcaagatacc attgagtatt tggctggtca atttgattct 1140 aatgtcagag atttagaagg tgccttaaaa gatattagtc tggttgctaa tttcaaacaa 1200 attgacacga ttactgttga cattgctgcc gaagctattc gcgccagaaa gcaagatgga 1260 cctaaaatga cagttattcc catcgaagaa attcaagcgc aagttggaaa attttacggt 1320 gttaccgtca aagaaattaa agctactaaa cgaacacaaa atattgtttt agcaagacaa 1380 gtagctatgt ttttagcacg tgaaatgaca gataacagtc ttcctaaaat tggaaaagaa 1440 tttggtggca gagaccattc aacagtactc catgcctata ataaaatcaa aaacatgatc 1500 agccaggacg aaagccttag gatcgaaatt gaaaccataa aaaacaaaat taaataacat 1560 gtggaaaaga atatctttta tgaaatagtt atccacaagt tgtgaacaac catttagtct 1620 tggattctct cgtttattta gagttatcca ctatatacac aagacctact actactactt 1680 attattatac ttattaaata aaggagttct catgattcaa ttttcaatta atcgcacatt 1740 atttattcat gctttaaatg caactaaacg tgctattagc actaaaaatg ccattcctat 1800 tctttcatca ataaaaattg aagtcactcc tacaggagta actttaacag ggtctaacgg 1860 tcaaatatca attgaaaaca ctattcctgt aagtaatgaa aatgctggtt tgctaattac 1920 ctctccagga gctattttat tagaagctag tttttttatt aatattattt caagtttgcc 1980 agatattagt ataaatgtta aagaaattga acaacaccaa gttgttttaa ccagtggtaa 2040 atcagagatt accttaaaag gaaaagatgt tgaccagtat cctcgtctac aagaagtatc 2100 aacagaaaat cctttgattt taaaaacaaa attattgaag tctattattg ctgaaacagc 2160 ttttgcagcc agtttacaag aaagtcgtcc tattttaaca ggagttcata ttgtattaag 2220 taatcataaa gattttaaag cagtagcgac tgactctcat cgtatgagcc aacgtttaat 2280 cactttggac aatacttcag cagattttga tgtggttatt ccaagtaaat ctttgagaga 2340 attttcagca gtatttacag atgatattga gaccgttgag gtatttttct caccaagcca 2400 aatcttgttc agaagtgaac acatttcttt ttatacacgc ctcttagaag gaaattatcc 2460 cgatacagac cgtttattaa tgacagaatt tgagacggag gttgttttca atacccaatc 2520 ccttcgccac gctatggaac gtgccttctt gatttctaat gctactcaaa atggtactgt 2580 taagcttgag attactcaaa atcatatttc agctcatgtt aactcacctg aggttggtaa 2640 ggtaaacgag gatttagata ttgttagtca gtctggtagt gatttaacta tcagcttcaa 2700 tccaacttac cttattgagt ctttaaaagc tattaaaagt gaaacagtaa aaattcattt 2760 cttatcacca gttcgaccat tcaccctaac accaggcgat gaggaagaaa gttttatcca 2820 attaattaca ccagtacgaa caaactaagt aagaaaaagc tcccttttag gagttttttt 2880 gttattataa atattaacga taatataagt ggagaaaagt gaatgtatca aattggatca 2940 tttgttgaaa tgaaaaaacc tcatgcttgt gtaattaaag aaactggtaa aaaggctaat 3000 caatggaagg tactgagagt aggagctgat attaaaattc agtgtactaa ctgtcagcat 3060 gtgattatga tgagtcgtta tgattttgag cgaaaattaa aaaaagtact gcaaccatga 3120 aagtcctttt aaataaataa ttctagctat atttgcaacc aatactttcc taaaaaattg 3180 ttagtatgcc gttggaaaat tagctattct aacgttatcg aaagaagaaa ggtggctatt 3240 gacaatgacg aaagttgtag cacagttact tagtttctcg gcaaactata tcaaaatgag 3300 aataatacac caaactttga taatttaaat gaaggtgaaa gagtggcata atgataagca 3360 aatctgaaat tttcttctta ttcataccgc tttttcttct attttttggt ataataatct 3420 tgattgaaat ttgaatggag attcgctaaa tggctttaac agcaggtatt gtaggcttac 3480 caaatgttgg taaatcaacc ttatttaacg caattacaaa agcaggagca gaagctgcta 3540 attatccttt tgcgactatt gatcccaatg ttggtatggt agaggttcca gatgagcgtc 3600 tgcaaaaatt gacagaattg attacaccta aaaaaacagt tccgacaacc tttgaattta 3660 ctgatattgc aggtatcgtt aaaggtgctt ctagagggga aggtctagga aataaatttt 3720 tagctaatat tcgtgaagtc gatgctattg ttcatgtggt acgtgctttt gatgatgaaa 3780 atgtcatgcg tgaacaaggt cgtgaggatg ctttcgttga tcctatagca gatattgaca 3840 ctattaatct tgaattaatc ttagctgatt tagagtcaat caataaacgt tatgcgcgtg 3900 ttgaaaaaat ggcacgaact caaaaagata aagaatcagt agcagagttc aatgttttac 3960 aaaagattaa acctgttttg gaagatggga aatcagctag gacaattgag tttacagaag 4020 aggaagcaaa agttgttaaa ggtctctttt tattaacaac taaacctgtt ttatatgtag 4080 ctaatgtcga tgaagataaa gttgctaatc cagatggtat tgattatgtc aaacaaattc 4140 gtgactttgc agctactgaa aatgctgaag tagttgttat ctcagcgcgt gcagaagaag 4200 aaatttcaga gcttgacgat gaggataaag aagaattttt ggaagctatc ggtcttactg 4260 aatcaggcgt tgataaatta accagagcag cttatcatct cttaggcctt ggaacctatt 4320 ttacagcagg tgaaaaagag gttcgtgctt ggacgtttaa gcgtggtata aaagctccac 4380 aagctgctgg tattatccat tcagattttg aaagaggttt tattcgtgca gtaaccatgt 4440 cttatgatga tctaatgatc tacggttcag aaaaagccgt caaagaagct ggacgcttgc 4500 gtgaagaagg aaaagaatac gttgttcaag atggggacat catggaattc agatttaatg 4560 tgtaattata ttaaaacaat atcagaaggt tggaagaaca ttccagccct tttggcattt 4620 tagaaagaga aaatatggta aaaatgattg ttggtctggg aaatccaggc tctaaatatg 4680 aaaaaacaaa gcacaatatt ggttttatgg ctattgacaa tattgtcaag aaccttgacg 4740 ttacctttac agatgataaa aattttaaag cacagatagg aagtactttt attaatcatg 4800 aaaaagttta ctttgtgaaa cctactactt ttatgaataa tagcggcata gcagtaaaag 4860 cattactaac ctactataat attgacataa cagatttaat tgttatctat gatgatttag 4920 acatggaagt cagtaaatta cgtttacgta gtaagggttc agcaggagga cataatggca 4980 ttaagtcaat cattgcccac attggaactc aggaatttaa ccgaatcaaa gttggtattg 5040 gacgaccttt aaaaggtatg actgttatta gccatgtgat gggccaattc aataccgaag 5100 ataatattgc tatttcgtta actcttgaca gagttgtcaa tgctgtcaag ttttatttac 5160 aagaaaatga ttttgaaaaa acaatgcaga aatttaatgg ataatcatgg atattttaga 5220 attatttagt cagaataaga aagtccaatc ctggcactct ggattaacca ccttaggaag 5280 acaactggta atggggttat cgggttcaag taaagcattg gctatagctt ccgcttattt 5340 agatgatcaa aaaaaaatag ttgtggttac atcaactcaa aatgaggttg aaaaattagc 5400 cagcgattta tctagtttac ttgatgaaga acttgttttc caattttttg cagacgatgt 5460 ggctgcagcg gaatttatct ttgcgtcaat ggataaagct ctatcaagaa tagaaaccct 5520 gcaattttta aggaatccta aatctcaggg cgttttaatt gttagtttat caagcttaag 5580 aactttattg ccaaacccag atgtttttac aaagagtcag attcaactaa cagttggaga 5640 agattatgat agtgatactc ttactaaaca actgatgaca attggctatc agaaggtctc 5700 acaggtcatt agtccgggag aatttagccg tcgaggggat attttagata tctatgagat 5760 tacacaagaa ttgccttatc gattggaatt ttttggcgat gatattgata gtattaggca 5820 attttatcca gaaactcaaa aatcttttga acaactagaa ggtattttta ttaatccagc 5880 aagtgatctt atttttgagg ctagtgattt tcaacgtggc attgagcaat tagaaaaggc 5940 tctacaaaca gcacaagatg ataaaaaatc ttatttagaa gatgtattag ctgtttcaaa 6000 aaacggtttt aaacataagg atatccgtaa atttcaatca ttattttacg aaaaagagtg 6060 gtcattatta gattatattc ctaagggaac gccaatcttt tttgatgatt ttcaaaaact 6120 agttgataaa aatgcaagat ttgatttaga gattgctaat ctcttgacag aagatttaca 6180 gcaaggaaag gctctttcca atcttaacta ctttgcagat aattatcgag agcttaggca 6240 ctataagcca gcgaccttct tttcaaattt tcataaggga cttggaaata tcaaatttga 6300 tcagatgcat cagctaactc agtatgccat gcaggaattt tttaatcaat ttcctttgtt 6360 gattgatgag attaaacggt atcaaaaaaa tcaaacaacg gttattgtac aggtagagtc 6420 tcagtatgct tacgaacgac ttgaaaaatc ttttcaagat taccaatttc gccttccttt 6480 agtgagtgct aatcaaattg tttcacgtga atcacaaatt gtaattggag ctatctccag 6540 tggtttttat tttgctgatg aaaagttagc gttaatcaca gagcatgaaa tttatcataa 6600 aaagatcaaa cgacgcgcta gacgatctaa tattagcaat gctgagcgtt tgaaagatta 6660 caatgagtta gcagtaggtg attacgtggt tcataatgtc catggtatcg gtcgctttct 6720 tggaattgaa acgattcagg ttcagggaat ccatcgagat tacgttacta ttcaatatca 6780 aaattcagac cgtatttctc ttccaattga ccaaattggt agcttatcaa agtacgtttc 6840 tgctgatgga aaagaaccta aaattaataa actcaatgat ggtcgttttc aaaagacaaa 6900 gcaaaaggtc gctagacaag tagaagatat tgctgatgac cttctaaaat tatatgctga 6960 aagaagtcag caaaaaggat tttcattttc accagatgat gacttgcagc gcgcttttga 7020 tgatgatttt gcttttgtag aaacagaaga tcaacttagg tctataaagg aaattaaagc 7080 tgatatggag agcatgcaac ctatggatcg tcttttagta ggcgatgtag gatttggtaa 7140 gacagaagta gctatgaggg cagcctttaa agcggtgaat gatcacaaac aagtagctgt 7200 cttagttcca accacagtct tggcccagca gcattatgaa aacttcaaag cacgctttga 7260 aaattacctt gttgaggttg acgtcttaag tcgtttccgt agtaaaaaag agcaagctga 7320 aacactagaa catgtacaaa aaggtcaaat tgacattatt atcggaaccc atcgactcct 7380 atcaaaagat gtggtctttt ctgatttagg attaattgtg attgatgagg aacaacggtt 7440 tggtgttaag cataaagaaa ctttaaagga attaaaaact aaggttgacg tcttaacctt 7500 aacagctact ccgattccta gaaccttaca catgtctatg ttaggtatcc gagatttatc 7560 ggttattgag accccaccaa ccaatcgtta tcctgttcaa acctatgttt tggaaaataa 7620 tccaggtctc gttagagaag ctatcattcg tgaaatggat cgtggaggac aaatttttta 7680 cgtttacaat aaagttgaca ctattgaaaa gaaagttgca gagctacaag aattagtccc 7740 agaagcttct atcggttttg ttcatgggca aatgagtgaa attcaacttg aaaatacctt 7800 gattgacttt ataaatggtg attatgatgt ccttgtggct acaacaatca tcgaaacagg 7860 agttgacatt tctaatgtaa acactttgtt tattgagaat gctgatcata tgggattgtc 7920 aactttatat caactgaggg ggcgcgtcgg aagaagtaat cgtattgcct atgcttacct 7980 gatgtatcgt cctgataaga tcctaacaga agtctctgaa aaacgtttag aggctattaa 8040 aggctttact gaattaggtt caggcttcaa gattgctatg cgagatttgt ctatacgagg 8100 agcaggtaat attttaggag cttctcaaag tggctttatt gattcggtcg gttttgaaat 8160 gtattctcag ttattggaac aggctattgc tagcaagcaa ggaaaaacga ctgttcgcca 8220 aaaaggtaat actgaaatca atcttcagat tgatgcttat ttaccagatg attatattgc 8280 agatgagcgc caaaaaattg acatttacaa gcgtattcga gaaattcaat caagagaaga 8340 ttatctcaat ttgcaagatg agctgataga tcgttttgga gagtatcctg atcaagttgc 8400 ctatttgtta gagatagctc tgctaaaaca ttatatggac aatgcctttg cggaattagt 8460 tgaacgcaaa aataatcagg tcattgttcg atttgaagta acttctttaa gttatttctt 8520 gactcaggat tatttcgaag ccttatctaa aacccatctt aaagctaaaa ttagtgaaca 8580 tcaggggaaa atcgatatcg tctttgacgt tcgccatcaa aaagattata gaattttaga 8640 agaattgatg ttatttggag aaaggcttag tgagataaaa atcagaaaaa acaattcagt 8700 ttttaaataa taatttgaaa caacttttag actgaatagt attaatgaca atggaacgaa 8760 ctagctcttt atacttttgt caaaaatcta agatctttgc tgcgaaatta ggatcttttc 8820 taccatttaa aaaataaaaa tgataaaatg aagaggattg taggagaaat tatgagacta 8880 gataaatatc taaaggtatc gcgccttatt aaacgtcgtt cagtagcaaa agaagttgcg 8940 gataagggac gaattaaagt taatgggata cttgctaaaa gttcaacgaa tgtaaaacta 9000 aatgatcaca ttgaaattag ttttggaaat aaattactga cagtaagagt cattgaaata 9060 aaagatagta caaaaaaaga agatgctctt aagatgtatg agataatcag tgaaacaagg 9120 ataacattaa atgaagaagc ctagtattgt tcaattaaat aatcattata ttaagaaaga 9180 gaatctcaaa aaaaaatttg aagaagaaga atctcaaaaa agaaatcgtt ttatgggatg 9240 gatccttgta agtatgatgt ttttatttat tttgccaact tataatcttg tcaaaagtta 9300 tgttgatttt gaaaagcaaa atcaacaggt ggttaaatta aaaaaagagt ataatgaatt 9360 gtcaaagagt acaaaaaaag aaaaacaatt agcagaacga ctaaaagatg ataattttgt 9420 caaaaaatat gctagggcaa aatactattt atcgcgtgaa ggagaaatga tttatcctat 9480 tccaggacta ttaccaaaat gatgatggac aatattataa aaaaaataga agcatttctt 9540 gctttttctg ataaaaaatt agcagagctg caacaggaaa atcaaaaagt taaagaagaa 9600 roa1.gbxml100644000766000024 2157414006427353 16322 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data dbEST 1888424 AI129902 1 3598416 3 2 qc41b07.x1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone IMAGE:1712149 3' similar to SW:ROA1_SCHAM P21522 HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A1, A2/B1 HOMOLOG. ;contains MSR1.b2 MSR1 repetitive element ;. 1998 9 10 1998 10 27 Contact: Robert Strausberg, Ph.D.~Email: cgapbs-r@mail.nih.gov~This clone is available royalty-free through LLNL ; contact the IMAGE Consortium (info@image.llnl.gov) for further information. ~Trace considered overall poor quality~Insert Length: 525 Std Error: 0.00~Seq primer: -40m13 fwd. ET from Amersham~High quality sequence stop: 1 Homo sapiens human taxon 9606 Homo sapiens 255 Organ: uterus; Vector: pT7T3D-PacI; Site_1: Not I; Site_2: Eco RI; 1st strand cDNA was primed with a Not I - oligo(dT) primer [5' AACTGGAAGAATTCGCGGCCGCCTTTTTTTTTTTTTTTTTT 3'], double-stranded cDNA was ligated to Eco RI adaptors (Pharmacia), digested with Not I and cloned into the Not I and Eco RI sites of the modified pT7T3 vector. Library went through one round of normalization. Library constructed by M. Fatima Bonaldo. Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo 1 2 PRI 3 IMAGE:1712149 11 Soares_pregnant_uterus_NbHPU 7 female 12 adult 16 DH10B Unpublished NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap 1997 National Cancer Institute, Cancer Genome Anatomy Project (CGAP), Tumor Gene Index 37 CTCCGCGCCAACTCCCCCCACCCCCCCCCCACACCCC bl2seq.out100644000766000024 425314006427353 16321 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>ALEU_HORVU Length = 362 Score = 191 bits (481), Expect = 2e-53 Identities = 111/322 (34%), Positives = 167/322 (51%), Gaps = 27/322 (8%) Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 +F F ++ K Y S E RF IF +L ++ N + ++ G+N+F+D+S + Sbjct: 60 RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWE 115 Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 EF+ L A T +A ++P DWR G V+PVKNQ CGSCW+FS Sbjct: 116 EFQATRLG---AAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFS 172 Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 TTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI NG Sbjct: 173 TTGALEAAYTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYIKYNG 224 Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADAV 265 GI TE SYPY G C++ + N ++ + I N E + + P+++A + Sbjct: 225 GIDTEESYPYKGVNGV-CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVI 283 Query: 266 E-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321 + ++ Y GV+ P+ ++H +L VGY +N +PYW++KNSWGADWG+ G Sbjct: 284 DGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVEN-----GVPYWLIKNSWGADWGDNG 338 Query: 322 YIYLRRGKNTCGVSNFVSTSII 343 Y + GKN C ++ S ++ Sbjct: 339 YFKMEMGKNMCAIATCASYPVV 360 Lambda K H 0.316 0.135 0.414 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 368 Number of Sequences: 0 Number of extensions: 19 Number of successful extensions: 6 Number of sequences better than 10.0: 1 Number of HSP's better than 10.0 without gapping: 1 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 1 length of query: 343 length of database: 362 effective HSP length: 23 effective length of query: 320 effective length of database: 339 effective search space: 108480 effective search space used: 108480 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 24 (13.8 bits) S2: 24 (13.8 bits) bug2982.gb100755000766000024 650214006427353 16016 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS NT_019265 1250660 bp DNA CON 16-OCT-2001 DEFINITION Homo sapiens chromosome 1 working draft sequence segment. ACCESSION NT_019265 VERSION NT_019265.6 GI:16156830 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1250660) AUTHORS NCBI Annotation Project. TITLE Direct Submission JOURNAL Submitted (11-OCT-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA COMMENT GENOME ANNOTATION REFSEQ: NCBI contigs are derived from assembled genomic sequence data. They may include both draft and finished sequence. On Oct 16, 2001 this sequence version replaced gi:15294341. COMPLETENESS: not full length. FEATURES Location/Qualifiers source 1..1250660 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" source 1..3290 /note="Accession AL391218 sequenced by The Sanger Centre" /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RP11-13G5" misc_feature 215902..365470 /standard_name="RP11-242F24" /note="FISH-mapped clone" variation 217508 /allele="T" /allele="C" /db_xref="dbSNP:811400" mRNA join(342430..342515,363171..363300,365741..365814, 376398..376499,390169..390297,391257..391379, 392606..392679,398230..398419,399082..399167, 399534..399650,405844..405913,406704..406761, 406868..407010,407962..408091,408508..409092) /gene="FLJ10737" /product="hypothetical protein FLJ10737" /transcript_id="XM_057697.1" /db_xref="LocusID:55735" CONTIG join(AL391218.9:105173..108462,gap(100), complement(AL512330.12:1..182490), complement(AL590128.4:9034..81287),gap(100), AL591163.7:85799..94832,gap(100),AL591163.7:94933..113245,gap(100), AL591163.7:42173..44897,complement(AL590128.4:1..6208), AL591163.7:51307..52779,gap(100),AL591163.7:52880..85698,gap(100), AL591163.7:113346..126143,complement(AL159177.12:184729..186047), AL031447.4:1..112158,complement(AL159177.12:1..72671), complement(AL591866.12:23507..86371),AL031848.11:1..142965, AL031847.17:1..166418,AL035406.25:1..161651, complement(AL356261.20:94599..98345), complement(AC026968.3:54432..54579),gap(100), complement(AC062024.2:98529..107911),gap(100), AC062024.2:7713..11594,gap(100),complement(AL356261.20:1..94498), complement(AL356693.18:19988..70853),gap(100), AL356693.18:17351..19887,gap(100), complement(AL356693.18:3037..17250),gap(100), complement(AL356693.18:1..2936),gap(100),AC026968.3:675..2393, gap(100),AC026968.3:1..574,gap(100),AL356261.20:179029..182233) // spaces.nex100644000766000024 72514006427353 16352 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data#NEXUS BEGIN TREES; TRANSLATE 1 'Allium drummondii', 2 'Allium cernuum', 3 A.cyaneum ; TREE onion = [&R] ((1,2),3); END; #NEXUS BEGIN TREES; TRANSLATE 1 'Allium drummondii, USA', 2 'Allium drummondii, Russia', 3 A.cyaneum ; TREE onion = [&R] ((1,2),3); END; #NEXUS BEGIN TREES; TRANSLATE 1 'Allium drummondii, Russia', 2 'Allium drummondii, USA', 3 A.cyaneum ; TREE onion = [&R] ((1,2),3); END; prints.out100644000766000024 6507614006427353 16502 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataSn; BIOPERL_TEST_SEQUENCE_ID Si; Fasta sequence 1TBS 1TBH DNAGYRASEB 2.035573e-70 1TBH TPI2FAMILY 3.830756e-70 1TBF 2TBS 2TBT FingerPrint No.Motifs SumId AveId ProfScore Ppvalue Evalue GraphScan 2TBH DNAGYRASEB 10 of 11 5.3e+02 53 4363 2.5e-75 2e-70 IIII.IIIIII 2TBH TPI2FAMILY 9 of 9 539.06 59.90 4433 4.8e-75 3.8e-70 IIIIIIIII 2TBN BCTRLSENSOR 2 of 4 64.27 32.14 391 2.4e-06 0.19 I.i. 2TBN TROPOMYOSIN 2 of 5 58.53 29.26 483 4.4e-05 3.5 I.i.. 2TBN MEROZOITESA 2 of 15 52.70 26.35 488 8e-05 6.4 .i...........i. 2TBN FADG3PDH 2 of 6 72.65 36.32 400 0.00012 9.8 I....I 2TBN PROXISOMPAGR 2 of 6 97.06 48.53 481 0.00046 37 .I..I. 2TBN CYANASE 2 of 5 60.96 30.48 322 0.00068 54 i...I 2TBN ALARACEMASE 2 of 9 49.23 24.61 344 0.00068 54 ..I...i.. 2TBN FMOXYGENASE 2 of 13 44.28 22.14 327 0.00073 58 i........i... 2TBF 3TBS 3TBT MotifName No.Mots IdScore PfScore Pvalue Sequence Len low pos high 3TBH DNAGYRASEB 1 of 11 56.97 334 4.14e-05 YSAGQIKILEG 11 4 8 33 3TBH DNAGYRASEB 2 of 11 48.33 419 1.44e-07 DFQFDILSARFRELAF 16 178 182 207 3TBH DNAGYRASEB 3 of 11 48.81 352 4.94e-06 FLNKGLVLIVEDRR 14 193 197 222 3TBH DNAGYRASEB 4 of 11 40.58 487 2.12e-10 GGIVSFVEHINENKHPMHKVIHF 23 217 227 277 3TBH DNAGYRASEB 6 of 11 65.69 637 1.20e-11 GEDVKEGLTAVISIKIP 17 310 319 369 3TBH DNAGYRASEB 7 of 11 45.33 358 7.99e-07 FFEENPNITKKILEK 15 359 367 417 3TBH DNAGYRASEB 8 of 11 60.16 604 3.60e-13 KCILSAKAREAARKARDLTRR 21 373 381 431 3TBH DNAGYRASEB 9 of 11 60.33 332 1.39e-06 LVSALGTGIG 10 459 477 911 3TBH DNAGYRASEB 10 of 11 44.62 312 2.07e-05 PSNRVVLKVKLDD 13 588 594 1029 3TBH DNAGYRASEB 11 of 11 54.51 528 4.10e-09 AEETFNILMGDEVQPRK 17 604 610 1045 3TBB 3TBH TPI2FAMILY 1 of 9 56.63 501 7.92e-09 GLHKMVYEVVDNSVDE 16 31 37 172 3TBH TPI2FAMILY 2 of 9 62.28 483 2.55e-08 EVKDNGRGIPVDIH 14 66 72 209 3TBH TPI2FAMILY 3 of 9 73.17 547 2.56e-10 GGLHGVGVSVVNALS 15 109 115 252 3TBH TPI2FAMILY 4 of 9 50.35 371 8.26e-06 CFTNNINNNLGGTH 14 251 274 414 3TBH TPI2FAMILY 5 of 9 70.89 498 6.45e-10 LYLVEGDSAGGSAKQ 15 399 427 561 3TBH TPI2FAMILY 6 of 9 66.21 629 3.61e-10 KIRYHKIMIMTDADIDG 17 461 494 928 3TBH TPI2FAMILY 7 of 9 52.17 503 4.41e-10 HIRTLLLTFFFRYMRPVI 18 479 512 946 3TBH TPI2FAMILY 8 of 9 43.15 321 1.38e-06 GYLYVAQPPLYLI 13 499 532 966 3TBH TPI2FAMILY 9 of 9 64.20 580 7.91e-11 IQRYKGLGEMNPEQLWE 17 539 573 1008 3TBB 3TBN BCTRLSENSOR 1 of 4 40.72 272 2.48e-05 NIIEVKDNGRGIPVD 15 219 69 1911 3TBN BCTRLSENSOR 3 of 4 23.55 119 9.78e-02 GLHGVGVSVVNALSEYLEV 19 252 116 1946 3TBB 3TBN TROPOMYOSIN 1 of 5 35.76 286 9.84e-04 ELAFLNKGLVLIVEDRRR 18 9 194 357 3TBN TROPOMYOSIN 3 of 5 22.77 197 4.43e-02 ILEKCILSAKAREAARKARDLTRRKTVLE 29 70 378 560 3TBB 3TBN MEROZOITESA 2 of 15 28.21 293 1.55e-03 NGRGIPVDIHPDKKISTIEV 20 68 76 123 3TBN MEROZOITESA 14 of 15 24.49 195 5.17e-02 LIKHGKNSTYAYSDKEKEELL 21 510 543 576 3TBB 3TBN FADG3PDH 1 of 6 32.91 209 6.51e-03 NDAYKVSGGLHGV 13 4 108 75 3TBN FADG3PDH 6 of 6 39.74 191 1.88e-02 LYLIKHGKNSTYA 13 347 541 454 3TBB 3TBN PROXISOMPAGR 2 of 6 50.00 241 1.57e-02 PDATIFTTVDFQFD 14 30 173 60 3TBN PROXISOMPAGR 5 of 6 47.06 240 2.93e-02 EDVKEGLTAVISIKIPQ 17 185 320 215 3TBB 3TBN CYANASE 1 of 5 25.89 131 4.96e-02 KHPTGLSGEDVKEGLT 16 13 312 77 3TBN CYANASE 5 of 5 35.06 191 1.37e-02 QAILPLKGKIL 11 131 448 297 3TBB 3TBN ALARACEMASE 3 of 9 32.91 191 1.08e-02 AYKVSGGLHGVGV 13 120 110 155 3TBN ALARACEMASE 7 of 9 16.32 153 6.29e-02 LAEISIQYSETYTENI 16 265 257 307 3TBB 3TBN FMOXYGENASE 1 of 13 22.06 151 7.01e-02 RKTVLEGGGLPGKLADC 17 2 401 7 3TBN FMOXYGENASE 10 of 13 22.22 176 1.04e-02 RVVLKVKLDDFVEAEETF 18 225 597 296 3TBB 3TBF 4TBT -------------------------------------------------------------------------- 4TBN DNAGYRASEB 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL YDASsIqVLEG 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL eFdFetLakRLRELAFFLNkGltItLtDeR 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL GGIvsFVeYLNrnKtplhdepiY 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL GeDiREGLTAiISVKvP fLe 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL ENPqeAKkIveKKiidAArAReAARkARElTRR 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL LITALGtGIG 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL PevRtLlqVTleD ADeiFstLMGDdVEPRR 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN TPI2FAMILY 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL GLHHlvdEIvDNAaDE sVe 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL DNGRGIPVdiH GGLHGVGASVVNALS 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL SFVNnIaTteGGTH 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL LiLVEGDSAGGSAKs 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL kLRYgKIiIMTDADvDG HIktLLL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL TFFyRymPpLi GfvyiAqPPLYKv IQRYKGLGEMnakqLwE 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN BCTRLSENSOR 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL vlieVs 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL DtGiGIppe gGTGLGLaIVkriveahGG 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN TROPOMYOSIN 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL EVASLNRRIQLVEEELDR 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL EiQLKEAKHIAEEADRKYEEVARKLVIiE 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN MEROZOITESA 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL HGSGIRVDLGEDAEVaNtqY 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL YDKMDQAddYGKStSRkDEML 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN FADG3PDH 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL fDVlVIGGGaTGa 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL LITIAGGKlTTYR 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN PROXISOMPAGR 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL FDIKPFTTVDFSSI 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL QAEKEKLLAEISSDIDQ 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN CYANASE 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL KaakGlTFadLadelG 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL RVVVTlDGKYL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN ALARACEMASE 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL hLKvDTGMnRLGv 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL iGvVaiGYADGypRal 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN FMOXYGENASE 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL KRVAiIGAGVSGLASIK 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL kVkiKpsVKEfTETSAIF 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN CD44 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL DLNITCRYAGVFHVE 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL KNGRY 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL DQFMTADETRNLQNVDMKIG 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN JANUSKINASE3 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL aETFh 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL VGLPGA 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL GLhVDGvAlnLTS 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL 4TBL 4TBS -------------------------------------------------------------------------- 4TBT -------------------------------------------------------------------------- 4TBN KUPTAKETRKA 4TBS MSQKETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSVDEAMAGHCTEIRISILPNNIIEVK 4TBL 4TBL 4TBS DNGRGIPVDIHPDKKISTIEVVMTILHAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKIYTQKYE 4TBL 4TBL 4TBS KGIPVSPVEIKGDSSERGTIVRFKPDATIFTTVDFQFDILSARFRELAFLNKGLVLIVEDRRRGAEGENLLRNE 4TBL 4TBL 4TBS FQFSGGIVSFVEHINENKHPMHKVIHFERNKDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALT 4TBL 4TBL 4TBS RTLNDFLKKDTTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEIKGIMQTLSSEGLTLFFEE 4TBL 4TBL 4TBS NPNITKKILEKCILSAKAREAARKARDLTRRKTVLEGGGLPGKLADCSEKDPAFSELYLVEGDSAGGSAKQGRD 4TBL KIIILGAGQVGgTLA 4TBL 4TBS RNTQAILPLKGKILNVEKARLDKILSSEEIRILVSALGTGIGEDEFNINKIRYHKIMIMTDADIDGSHIRTLLL 4TBL 4TBL 4TBS TFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTYAYSDKEKEELLKNVGTEKVVIQRYKGLGEMNPEQLWETTM 4TBL TaS 4TBL 4TBS DPSNRVVLKVKLDDFVEAEETFNILMGDEVQPRKQFIEINAAKVANLDL 4TBL DEtNMvACQvAy 4TBL 4TBS -------------------------------------------------------------------------- multi_2.fa100644000766000024 161714006427353 16264 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>gi|239758|bbs|68379 glucocorticoid receptor, GR [human, Peptide Partial, 394 aa] MDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSASSPSLAVASQSDSKQRRLLVDFPKGSV >gi|239752|bbs|68871 PML-3=putative zinc finger protein [human, Peptide, 802 aa] MPPPETPSEGRQPSPSPSPTERAPASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQ >gi|238775|bbs|65126 putative tyrosine kinase receptor=UFO [human, NIH3T3, Peptide, 894 aa] MAWRCPRMGRVPLAWCLALCGWACMAPRGTQAEESPFVGNPGNITGARGLTGTLRCQLQVQGEPPEVHWL >gi|239006|bbs|65162 alpha(1,3)-fucosyltransferase, ELFT [human, Peptide, 400 aa] MGAPWGSPTAAAGGRRGWRRGRGLPWTVCVLAAAGLTCTALITYACWGQLPPLPWASPTPSRPVGVLLWW >gi|237597|bbs|60089 putative adhesion molecule=ADMLX [human, Peptide, 679 aa] MVPGVPGAVLTLCLWLAASSGSWRPAPARLCAAAGRVAVCRERPARSCASRCLSLQITRISAFFQHFQNN >gi|237995|bbs|62046 NK-1 receptor [human, lung, Peptide, 407 aa] MDNVLPVDSDLSPNISTNTSEPNQFVQPAWEIVLWAAAYTVIVVTSVVGNVVVMWIILAHKRMRTVTNYF D12555.gbk100644000766000024 357614006427353 15661 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS MUSBC05 105 bp DNA linear ROD 12-APR-2002 DEFINITION Mus spretus gene for beta-casein, 3'UTR. ACCESSION D12555 VERSION D12555.1 GI:303649 KEYWORDS . SOURCE Mus spretus (western wild mouse) ORGANISM Mus spretus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 105) AUTHORS Takahashi,N. and Ko,M.S. TITLE The short 3'-end region of complementary DNAs as PCR-based polymorphic markers for an expression map of the mouse genome JOURNAL Genomics 16 (1), 161-168 (1993) MEDLINE 93252372 PUBMED 8486351 COMMENT Submitted (06-JUL-1993) to DDBJ by: Minoru S.H. Ko ERATO Research Development Corporation of Japan (JRDC) 5-9-6 Tohkohdai, Tsukuba Ibaraki 300-26 Japan Phone: 0298-47-5531 Fax: 0298-47-5421. FEATURES Location/Qualifiers source 1..105 /organism="Mus spretus" /mol_type="genomic DNA" /db_xref="taxon:10096" 3'UTR <1..>105 /note="beta-casein genomic DNA sequence corresponding to a part of the 3'UTR of beta-caseine gene, MMBCASE (bases 7015 - 7121)" variation 69^70 /note="A in MMBCASE" /replace="a" variation 103^102 /note="T in MMBCASE deleted in CAST/Ei (M. m. castaneus)" /replace="t" BASE COUNT 37 a 7 c 17 g 44 t ORIGIN 1 agttatatta caggaatttt ataagtgttc aatatggagt tgaaaatgca agtcaataat 61 gtatacaaat agtttgtgaa aaattggatt ttctattttt ttctt // bug2901.fa100644000766000024 35714006427353 15762 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>seq1 GGTACCAGCAGGTGGGCCGCCTACTGCGCACGCGCGGGTTTGCGGGCAGCCGC >seq2 GGTACCAGCTGGTGGGCCGCCTACTGCGCACGCGCGGGTTTGCGGGCAGCCGC >seq3 GGTACCAGCAGGTGGTCCGCCTA------------------------------ >seq4 --------------------------CGCACGCGCGTGTTTGCGGGCAGCCGCtest 2.txt100644000766000024 107514006427353 16241 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>Test AGCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTGATAGCAGC TTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAATACTTTAACCAA TATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAACGCATTAGCACCACC ATTACCACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGCGTACAGGAAACACAGAAAAAAG CCCGCACCTGACAGTGCGGGCTTTTTTTTTCGACCAAAGGTAACGAGGTAACAACCATGCGAGTGTTGAA GTTCGGCGGTACATCAGTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATGCC AGGCAGGGGCAGGTGGCCACCGTCCTCTCTGCCCCCGCCAAAATCACCAACCACCTGGTGGCGATGATTG AAAAAACCATTAGCGGCCAGGATGCTTTACCCAATATCAGCGATGCCGAACGTATTTTTGCCGAACTTTTtest.fasta100644000766000024 144414006427353 16376 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>roa1_drome Rea guano receptor type III >> 0.1 MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVV VMKDPRTKRSRGFGFITYSHSSMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVK KLFVGALKDDHDEQSIRDYFQHFGNIVDNIVIDKETGKKRGFAFVEFDDYDPVDKVVLQK QHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGN NWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGND FGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY >roa2_drome Rea guano ligand MVNSNQNQNGNSNGHDDDFPQDSITEPEHMRKLFIGGLDYRTTDENLKAHEKWGNIVDVV VMKDPTSTSTSTSTSTSTSTSTMIDEAQKSRPHKIDGRVEPKRAVPRQDIDSPNAGATVK KLFVGALKDDHDEQSIRDYFQHLLLLLLLDLLLLDLLLLDLLLFVEFDDYDPVDKVVLQK QHQLNGKMVDVKKALPKNDQQGGGGGRGGPGGRAGGNRGNMGGGNYGNQNGGGNWNNGGN NWGNNRGNDNWGNNSFGGGGGGGGGYGGGNNSWGNNNPWDNGNGGGNFGGGGNNWNGGND FGGYQQNYGGGPQRGGGNFNNNRMQPYQGGGGFKAGGGNQGNYGNNQGFNNGGNNRRY calm.swiss100644000766000024 3050314006427353 16423 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID CALM_HUMAN STANDARD; PRT; 148 AA. AC P62158; P02593; P70667; P99014; Q61379; Q61380; DT 21-JUL-1986 (Rel. 01, Created) DT 21-JUL-1986 (Rel. 01, Last sequence update) DT 01-OCT-2004 (Rel. 45, Last annotation update) DE Calmodulin (CaM). GN Name=CALM1; Synonyms=CAM1, CALM, CAM; GN and GN Name=CALM2; Synonyms=CAM2, CAMB; GN and GN Name=CALM3; Synonyms=CAM3, CAMC; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP SEQUENCE, ACETYLATION SITE ALA-1, AND METHYLATION SITE LYS-115. RC TISSUE=Brain; RX MEDLINE=82231946; PubMed=7093203; RA Sasagawa T., Ericsson L.H., Walsh K.A., Schreiber W.E., Fischer E.H., RA Titani K.; RT "Complete amino acid sequence of human brain calmodulin."; RL Biochemistry 21:2565-2569(1982). RN [2] RP SEQUENCE FROM N.A. RX MEDLINE=89034207; PubMed=3182832; RA Fischer R., Koller M., Flura M., Mathews S., Strehler-Page M.A., RA Krebs J., Penniston J.T., Carafoli E., Strehler E.E.; RT "Multiple divergent mRNAs code for a single human calmodulin."; RL J. Biol. Chem. 263:17055-17062(1988). RN [3] RP SEQUENCE FROM N.A. RX MEDLINE=88059053; PubMed=2445749; RA Sengupta B., Friedberg F., Detera-Wadleigh S.D.; RT "Molecular analysis of human and rat calmodulin complementary DNA RT clones. Evidence for additional active genes in these species."; RL J. Biol. Chem. 262:16663-16670(1987). RN [4] RP SEQUENCE FROM N.A. RX MEDLINE=85022688; PubMed=6385987; RA Wawrzynczak E.J., Perham R.N.; RT "Isolation and nucleotide sequence of a cDNA encoding human RT calmodulin."; RL Biochem. Int. 9:177-185(1984). RN [5] RP SEQUENCE FROM N.A. (CALM1). RC TISSUE=Blood; RX MEDLINE=95010144; PubMed=7925473; RA Rhyner J.A., Ottiger M., Wicki R., Greenwood T.M., Strehler E.E.; RT "Structure of the human CALM1 calmodulin gene and identification of RT two CALM1-related pseudogenes CALM1P1 and CALM1P2."; RL Eur. J. Biochem. 225:71-82(1994). RN [6] RP SEQUENCE FROM N.A. RC TISSUE=Lymphoma; RA Kato S.; RL Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases. RN [7] RP SEQUENCE FROM N.A. (CALM2). RX MEDLINE=98346173; PubMed=9681195; RA Toutenhoofd S.L., Foletti D., Wicki R., Rhyner J.A., Garcia F., RA Tolon R., Strehler E.E.; RT "Characterization of the human CALM2 calmodulin gene and comparison of RT the transcriptional activity of CALM1, CALM2 and CALM3."; RL Cell Calcium 23:323-338(1998). RN [8] RP SEQUENCE FROM N.A. RA Kalnine N., Chen X., Rolfs A., Halleck A., Hines L., Eisenstein S., RA Koundinya M., Raphael J., Moreira D., Kelley T., LaBaer J., Lin Y., RA Phelan M., Farmer A.; RT "Cloning of human full-length CDSs in BD Creator(TM) system donor RT vector."; RL Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases. RN [9] RP SEQUENCE FROM N.A. (CALM1). RX MEDLINE=22459283; PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., RA Sun H., Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., RA Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., RA Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., RA Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., RA Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., RA Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., RA Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., RA Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., RA Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., RA Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., RA Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., RA Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., RA Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., RA Quetier F., Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [10] RP SEQUENCE FROM N.A. RC TISSUE=Brain, Lung, Lymph, Placenta, and Urinary bladder; RX MEDLINE=22388257; PubMed=12477932; DOI=10.1073/pnas.242603899; RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., RA Klausner R.D., Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., RA Diatchenko L., Marusina K., Farmer A.A., Rubin G.M., Hong L., RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C., RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A., Gunaratne P.H., RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human RT and mouse cDNA sequences."; RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). RN [11] RP STRUCTURE BY NMR OF 94-103. RX PubMed=9927666; RA Siedlecka M., Goch G., Ejchart A., Sticht H., Bierzyski A.; RT "Alpha-helix nucleation by a calcium-binding peptide loop."; RL Proc. Natl. Acad. Sci. U.S.A. 96:903-908(1999). RN [12] RP STRUCTURE BY NMR OF 1-76 AND 82-148. RX PubMed=11685248; DOI=10.1038/nsb1101-990; RA Chou J.J., Li S., Klee C.B., Bax A.; RT "Solution structure of Ca(2+)-calmodulin reveals flexible hand-like RT properties of its domains."; RL Nat. Struct. Biol. 8:990-997(2001). RN [13] RP X-RAY CRYSTALLOGRAPHY (1.7 ANGSTROMS). RX PubMed=1474585; RA Chattopadhyaya R., Meador W.E., Means A.R., Quiocho F.A.; RT "Calmodulin structure refined at 1.7 A resolution."; RL J. Mol. Biol. 228:1177-1192(1992). RN [14] RP X-RAY CRYSTALLOGRAPHY (2.45 ANGSTROMS). RX PubMed=7803388; RA Cook W.J., Walter L.J., Walter M.R.; RT "Drug binding by calmodulin: crystal structure of a calmodulin- RT trifluoperazine complex."; RL Biochemistry 33:15259-15265(1994). RN [15] RP X-RAY CRYSTALLOGRAPHY (2.75 ANGSTROMS) OF 5-148. RX MEDLINE=21666515; PubMed=11807546; DOI=10.1038/415396a; RA Drum C.L., Yan S.-Z., Bard J., Shen Y.Q., Lu D., Soelaiman S., RA Grabarek Z., Bohm A., Tang W.-J.; RT "Structural basis for the activation of anthrax adenylyl cyclase RT exotoxin by calmodulin."; RL Nature 415:396-402(2002). RN [16] RP X-RAY CRYSTALLOGRAPHY (3.6 ANGSTROMS) OF 1-148. RX PubMed=12485993; RA Shen Y., Lee Y.-S., Soelaiman S., Bergson P., Lu D., Chen A., RA Beckingham K., Grabarek Z., Mrksich M., Tang W.-J.; RT "Physiological calcium concentrations regulate calmodulin binding and RT catalysis of adenylyl cyclase exotoxins."; RL EMBO J. 21:6721-6732(2002). RN [17] RP X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS). RX PubMed=12577052; DOI=10.1038/nsb900; RA Yamauchi E., Nakatsu T., Matsubara M., Kato H., Taniguchi H.; RT "Crystal structure of a MARCKS peptide containing the calmodulin- RT binding domain in complex with Ca2+-calmodulin."; RL Nat. Struct. Biol. 10:226-231(2003). CC -!- FUNCTION: Calmodulin mediates the control of a large number of CC enzymes by Ca(2+). Among the enzymes to be stimulated by the CC calmodulin-Ca(2+) complex are a number of protein kinases and CC phosphatases. CC -!- PTM: Ubiquitylation results in a strongly decreased activity (By CC similarity). CC -!- PTM: Phosphorylation results in a decreased activity (By CC similarity). CC -!- MISCELLANEOUS: This protein has four functional calcium-binding CC sites. CC -!- SIMILARITY: Contains 4 EF-hand calcium-binding domains. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; J04046; AAA51918.1; -. DR EMBL; M19311; AAA35641.1; -. DR EMBL; M27319; AAA35635.1; -. DR EMBL; U12022; AAB60644.1; -. DR EMBL; U11886; AAB60644.1; JOINED. DR EMBL; D45887; BAA08302.1; -. DR EMBL; U94728; AAC83174.1; -. DR EMBL; U94725; AAC83174.1; JOINED. DR EMBL; U94726; AAC83174.1; JOINED. DR EMBL; BT006818; AAP35464.1; -. DR EMBL; BT006855; AAP35501.1; -. DR EMBL; BT009916; AAP88918.1; -. DR EMBL; AC006536; AAD45181.1; -. DR EMBL; BC000454; AAH00454.1; -. DR EMBL; BC003354; AAH03354.1; -. DR EMBL; BC005137; AAH05137.1; -. DR EMBL; BC006464; AAH06464.1; -. DR EMBL; BC008597; AAH08597.1; -. DR EMBL; BC011834; AAH11834.1; -. DR EMBL; BC017385; AAH17385.1; -. DR EMBL; BC018677; AAH18677.1; -. DR EMBL; BC026065; AAH26065.1; -. DR EMBL; BC047523; AAH47523.1; -. DR PIR; S48728; MCHU. DR PDB; 1AJI; 17-SEP-97. DR PDB; 1CLL; 31-OCT-93. DR PDB; 1CTR; 20-DEC-94. DR PDB; 1IWQ; 11-MAR-03. DR PDB; 1J7O; 07-NOV-01. DR PDB; 1J7P; 07-NOV-01. DR PDB; 1K90; 23-JAN-02. DR PDB; 1K93; 23-JAN-02. DR PDB; 1LVC; 04-DEC-02. DR PDB; 1NKF; 23-MAR-99. DR SWISS-2DPAGE; P62158; HUMAN. DR Aarhus/Ghent-2DPAGE; 9048; IEF. DR OGP; P02593; -. DR Genew; HGNC:1442; CALM1. DR Genew; HGNC:1445; CALM2. DR Genew; HGNC:1449; CALM3. DR Reactome; P62158; -. DR MIM; 114180; -. DR MIM; 114182; -. DR MIM; 114183; -. DR GO; GO:0005737; C:cytoplasm; TAS. DR GO; GO:0005886; C:plasma membrane; TAS. DR GO; GO:0005509; F:calcium ion binding; TAS. DR GO; GO:0005515; F:protein binding; NAS. DR GO; GO:0007186; P:G-protein coupled receptor protein signalin...; TAS. DR InterPro; IPR002048; EF-hand. DR Pfam; PF00036; efhand; 4. DR PRINTS; PR00450; RECOVERIN. DR ProDom; PD000012; EF-hand; 2. DR PROSITE; PS00018; EF_HAND; 4. KW 3D-structure; Acetylation; Calcium-binding; Direct protein sequencing; KW Methylation; Phosphorylation; Repeat; Ubl conjugation. FT INIT_MET 0 0 FT MOD_RES 1 1 N-acetylalanine. FT CA_BIND 20 31 EF-hand 1. FT CA_BIND 56 67 EF-hand 2. FT CA_BIND 93 104 EF-hand 3. FT CA_BIND 129 140 EF-hand 4. FT BINDING 21 21 Ubiquitin (multi-) (By similarity). FT MOD_RES 44 44 Phosphothreonine (by CaMK4) (By FT similarity). FT MOD_RES 115 115 N6,N6,N6-trimethyllysine. FT HELIX 6 18 FT TURN 21 22 FT TURN 29 30 FT HELIX 33 36 FT TURN 37 40 FT TURN 45 46 FT HELIX 47 54 FT TURN 55 55 FT HELIX 66 72 FT TURN 73 74 FT HELIX 75 77 FT TURN 78 78 FT TURN 81 81 FT HELIX 82 92 FT TURN 94 95 FT STRAND 99 100 FT HELIX 102 111 FT TURN 112 113 FT HELIX 118 128 FT STRAND 136 137 FT HELIX 138 146 SQ SEQUENCE 148 AA; 16706 MW; 464B8A287475A1CA CRC64; ADQLTEEQIA EFKEAFSLFD KDGDGTITTK ELGTVMRSLG QNPTEAELQD MINEVDADGN GTIDFPEFLT MMARKMKDTD SEEEIREAFR VFDKDGNGYI SAAELRHVMT NLGEKLTDEE VDEMIREADI DGDGQVNYEE FVQMMTAK // Matrix000755000766000024 014006427353 14567 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tMatrix.t100644000766000024 1046514006427353 16406 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Matrix# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 77); use_ok('Bio::Matrix::Generic'); use_ok('Bio::Matrix::IO'); } my $raw = [ [ 0, 10, 20], [ 2, 17, 4], [ 3, 4, 5] ]; my $matrix = Bio::Matrix::Generic->new(-values => $raw, -matrix_id => 'fakeid00', -matrix_name=> 'matname', -rownames => [qw(A B C)], -colnames => [qw(D E F)] ); is($matrix->matrix_name, 'matname'); is($matrix->matrix_id, 'fakeid00'); is($matrix->entry('A','F'), $raw->[0]->[2]); my @colE = $matrix->get_column('E'); is($colE[0], $raw->[0]->[1]); is($colE[1], $raw->[1]->[1]); is($colE[2], $raw->[2]->[1]); my @rowC = $matrix->get_row('C'); is($rowC[0], $raw->[2]->[0]); is($rowC[1], $raw->[2]->[1]); is($rowC[2], $raw->[2]->[2]); is($matrix->row_num_for_name('A'),0); is($matrix->column_num_for_name('D'),0); is($matrix->row_header(1),'B'); is($matrix->column_header(0),'D'); is($matrix->add_row(1, 'b', [qw(21 13 14)]),4); is($matrix->add_column(2, 'f', [qw(71 81 14 3)]),4); is($matrix->add_row(4, 'c', [qw(22 11 17)]),5); is($matrix->remove_row(4),4); is($matrix->add_column(4, 'g', [qw(11 10 100 71)]),5); is($matrix->remove_column(4),4); is($matrix->row_num_for_name('B'),2); is($matrix->row_num_for_name('b'),1); is($matrix->column_num_for_name('D'),0); is($matrix->column_num_for_name('F'),3); is($matrix->column_num_for_name('f'),2); is($matrix->row_header(2),'B'); is($matrix->column_header(3),'F'); is($matrix->get_entry('b', 'f'), 81); # read in a scoring matrix my $io = Bio::Matrix::IO->new(-format => 'scoring', -file => test_input_file('BLOSUM50')); my $blosum_matrix = $io->next_matrix; isa_ok($blosum_matrix,'Bio::Matrix::Scoring'); is($blosum_matrix->entropy, 0.4808); is($blosum_matrix->expected_score, -0.3573); is($blosum_matrix->scale, '1/3'); is($blosum_matrix->get_entry('*','A'), -5); is($blosum_matrix->get_entry('V','Y'), -1); is($blosum_matrix->get_entry('Y','V'), -1); is($blosum_matrix->get_entry('L','I'), 2); my @diag = $blosum_matrix->get_diagonal; is($diag[2],7); my @row = $blosum_matrix->get_row('D'); is($row[5], $blosum_matrix->get_entry('D','Q')); is($blosum_matrix->num_rows,24); is($blosum_matrix->num_columns,24); $io = Bio::Matrix::IO->new(-format => 'scoring', -file => test_input_file('PAM250')); my $pam_matrix = $io->next_matrix; isa_ok($pam_matrix, 'Bio::Matrix::Scoring'); is($pam_matrix->entropy, 0.354); is($pam_matrix->expected_score, -0.844); is($pam_matrix->scale, 'ln(2)/3'); is($pam_matrix->num_rows,24); is($pam_matrix->get_entry('G','*'), -8); is($pam_matrix->get_entry('V','Y'), -2); is($pam_matrix->get_entry('Y','V'), -2); is($pam_matrix->get_entry('L','I'), 2); @diag = $pam_matrix->get_diagonal; is($diag[2],2); @row = $pam_matrix->get_row('D'); is($row[5], $pam_matrix->get_entry('D','Q')); # test Phylip parsing $io = Bio::Matrix::IO->new(-format => 'phylip', -program => 'phylipdist', -file => test_input_file('phylipdist.out')); my $phy = $io->next_matrix; is $phy->program, 'phylipdist'; is $phy->get_entry('Alpha','Beta'), '4.23419'; is $phy->get_entry('Gamma','Alpha'),'3.63330'; my @column = $phy->get_column('Alpha'); is $column[0], '0.00000'; is $column[1], '4.23419'; is $column[2], '3.63330'; is $column[3], '6.20865'; is $column[4], '3.45431'; @row = $phy->get_row('Gamma'); is $row[0], '3.63330'; is $row[1], '3.49289'; is $row[2], '0.00000'; is $row[3], '3.68733'; is $row[4], '5.84929'; @diag = $phy->get_diagonal; is $diag[0], '0.00000'; is $diag[1], '0.00000'; is $diag[2], '0.00000'; is $diag[3], '0.00000'; is $diag[4], '0.00000'; # test mlagan parsing $io = Bio::Matrix::IO->new(-format => 'mlagan', -file => test_input_file('nucmatrix.txt')); my $mlag = $io->next_matrix; is $mlag->get_entry('A', 'C'), -150; is $mlag->get_entry('.', 'A'), 0; is $mlag->gap_open, -300; is $mlag->gap_continue, -25; # test output round-trip $mlag->entry('A', 'C', -149); $mlag->gap_open(-150); $mlag->gap_continue(-5); my $out = test_output_file(); $io = Bio::Matrix::IO->new(-format => 'mlagan', -file => ">$out"); $io->write_matrix($mlag); $io = Bio::Matrix::IO->new(-format => 'mlagan', -file => $out); $mlag = $io->next_matrix; is $mlag->get_entry('A', 'C'), -149; is $mlag->gap_open, -150; is $mlag->gap_continue, -5; IO000755000766000024 014006427353 15076 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Matrixpsm.t100644000766000024 1217614006427353 16251 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Matrix/IO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 63); use_ok ('Bio::Matrix::PSM::IO'); } my $mmt= "chr04q 170164 170208 strong - 0 Motif 3 occurrance in chr04q chr04q 215755 215799 strong + 0 Motif 4 occurrance in chr04q chr04q 532530 532574 strong + 2 Motif 2 occurrance in chr04q chr04q 539492 539536 strong - 1 Motif 1 occurrance in chr04q chr04q 586113 586157 strong + 2 Motif 2 occurrance in chr04q chr04q 698245 698289 strong - 0 Motif 4 occurrance in chr04q chr04q 804412 804456 strong - 0 Motif 3 occurrance in chr04q chr04q 858870 858914 strong - 2 Motif 3 occurrance in chr04q chr04q 861561 861605 strong - 2 Motif 3 occurrance in chr04q chr04q 916898 916942 strong - 1 Motif 1 occurrance in chr04q chr04q 1146916 1146960 strong - 0 Motif 1 occurrance in chr04q chr04q 1315772 1315816 strong + 1 Motif 1 occurrance in chr04q chr04q 1636119 1636163 strong + 2 Motif 3 occurrance in chr04q chr04q 1636200 1636244 strong + 2 Motif 1 occurrance in chr04q chr04q 1636437 1636481 strong + 2 Motif 4 occurrance in chr04q chr04q 1637361 1637405 strong + 2 Motif 2 occurrance in chr04q chr04q 1652447 1652491 strong + 1 Motif 4 occurrance in chr04q"; my @mmt=split(/\n/,$mmt); #Let's try meme here my $psmIO = Bio::Matrix::PSM::IO->new(-format=>'meme', -file=>test_input_file('meme.dat')); ok $psmIO; my @inputfile=grep(/datafile/i,$psmIO->unstructured); ok @inputfile; my $release=$psmIO->release; ok $release; my @ids=$psmIO->hid; is @ids,4; my %weights=$psmIO->weight; ok %weights; my %seq = $psmIO->seq; is %seq,'0';#Meme doesn't have seq is $psmIO->version,'3.0'; my $psm = $psmIO->next_psm; ok $psm; #Lets try to compress and uncompress the log odds and the frequencies, see if there is no #considerable loss of data. my $fA=$psm->get_compressed_freq('A'); my @check=Bio::Matrix::PSM::SiteMatrix::_uncompress_string($fA,1,1); my @A=$psm->get_array('A'); my ($var,$max) = (0,0); for (my $i = 0; $i<@check;$i++) { my $diff=abs(abs($check[$i])-abs($A[$i])); $var += $diff; $max=$diff if ($diff>$max); } my $avg=$var/@check; cmp_ok $avg,'<',0.01; #Loss of data under 1 percent #print $avg,"\n"; is $psm->sequence_match_weight('CAGAAAAATAAAATGGCCACCACCC'),2015; my $lA=$psm->get_compressed_logs('A'); @check=Bio::Matrix::PSM::SiteMatrix::_uncompress_string($lA,1000,2); @A=$psm->get_logs_array('A'); ($var,$max) = (0,0); for (my $i = 0;$i<@check;$i++) { my $diff=abs(abs($check[$i])-abs($A[$i])); $var += $diff; $max=$diff if ($diff>$max); } $avg=$var/@check; cmp_ok $avg,'<',10; #Loss of data under 1 percent my $matrix=$psm->matrix; ok $matrix; my $psm2=$psm; $psm2->matrix($matrix); is $psm,$psm2; my %psm_header=$psm->header; is $psm_header{IC},38.1; is $psm_header{sites},4; is $psm_header{width},25; is $psm_header{e_val},'1.2e-002'; #Quick check if returned object works my $IUPAC=$psm->IUPAC; is $IUPAC,'CMKWMAAAKWVAWTYCMCASCHCCM'; is $IUPAC,$psm2->IUPAC; is $IUPAC,$matrix->IUPAC; my $instances=$psm->instances; ok $instances; foreach my $instance (@{$instances}) { my $id=$instance->primary_id; is $instance->strand,1; last if (ok $id); } ok $psm->header('e_val'); #Meme parser should be OK if tests passed #Now we are going to try transfac $psmIO = Bio::Matrix::PSM::IO->new(-format=>'transfac', -file=> test_input_file('transfac.dat')); ok $psmIO; my $version=$psmIO->version; ok !$version; is $psmIO->release, '6.4--2002-12-02'; $psm = $psmIO->next_psm; ok $psm; # Lets try to compress and uncompress the the frequencies, see if # there is no considerable loss of data. $fA=$psm->get_compressed_freq('A'); @check=Bio::Matrix::PSM::SiteMatrix::_uncompress_string($fA,1,1); @A=$psm->get_array('A'); ($var,$max) = (0,0); for (my $i = 0; $i<@check;$i++) { my $diff=abs(abs($check[$i])-abs($A[$i])); $var += $diff; $max=$diff if ($diff>$max); } $avg=$var/@check; cmp_ok $avg,'<',0.01; #Loss of data under 1 percent %weights = $psmIO->weight; ok !$weights{''}; %seq = $psmIO->seq; is scalar keys %seq, 0; #Quick check if returned object works $IUPAC = $psm->IUPAC; is $IUPAC,'VVDCAKSTGBYD'; #Now we are going to try mast $psmIO = Bio::Matrix::PSM::IO->new(-format=>'mast', -file=>test_input_file('mast.dat')); ok $psmIO; @inputfile = grep(/datafile/i,$psmIO->unstructured); ok !@inputfile; is( $psmIO->release, '2002/04/02 0:11:59'); @ids = $psmIO->hid; is @ids,4; %weights = $psmIO->weight; ok !%weights; #Mast doesn't have weights ok %seq = $psmIO->seq; foreach my $id ($psmIO->hid) { ok $seq{$id}; } ok $psm=$psmIO->next_psm; my %instances=$psmIO->instances; ok %instances; is $psmIO->version, '3.0'; my $mmastIO=Bio::Matrix::PSM::IO->new(-format=>'mast',-file=>test_input_file('mixedmast.dat')); $psm = $mmastIO->next_psm; my $lastinstances = $psm->instances(); my $i=0; foreach my $hit (@$lastinstances) { $hit -> end ( $hit-> start () + length ($hit->seq) - 1 ) ; # fix an old bug in InstanceSite.pm my $d=join("\t",$hit->{accession_number},$hit -> start () , $hit-> end (),$hit -> score (), $hit -> strand == 1 ? '+' : '-' , $hit -> frame, $hit -> desc ( )); is $d,$mmt[$i]; $i++; last if ($hit -> start == 1652447); } TreeIO.pm100644000766000024 2037014006427353 16206 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::TreeIO # # Please direct questions and support issues to # # Cared for by Jason Stajich # # Copyright Jason Stajich # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::TreeIO - Parser for Tree files =head1 SYNOPSIS { use Bio::TreeIO; my $treeio = Bio::TreeIO->new(-format => 'newick', -file => 'globin.dnd'); while( my $tree = $treeio->next_tree ) { print "Tree is ", $tree->number_nodes, "\n"; } } =head1 DESCRIPTION This is the driver module for Tree reading from data streams and flatfiles. This is intended to be able to create Bio::Tree::TreeI objects. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Jason Stajich Email jason-at-bioperl-dot-org =head1 CONTRIBUTORS Allen Day Eallenday@ucla.eduE =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # Let the code begin... package Bio::TreeIO; $Bio::TreeIO::VERSION = '1.7.8'; use strict; # Object preamble - inherits from Bio::Root::Root use Bio::TreeIO::TreeEventBuilder; use base qw(Bio::Root::Root Bio::Root::IO Bio::Event::EventGeneratorI Bio::Factory::TreeFactoryI); =head2 new Title : new Usage : my $obj = Bio::TreeIO->new(); Function: Builds a new Bio::TreeIO object Returns : Bio::TreeIO Args : a hash. useful keys: -format : Specify the format of the file. Supported formats: newick Newick tree format nexus Nexus tree format nhx NHX tree format svggraph SVG graphical representation of tree tabtree ASCII text representation of tree lintree lintree output format =cut sub new { my($caller,@args) = @_; my $class = ref($caller) || $caller; # or do we want to call SUPER on an object if $caller is an # object?n my $obj; if( $class =~ /Bio::TreeIO::(\S+)/ ) { $obj = $class->SUPER::new(@args); $obj->_initialize(@args); } else { my %param = @args; @param{ map { lc $_ } keys %param } = values %param; # lowercase keys my $format = $param{'-format'} || $class->_guess_format( $param{'-file'} || $ARGV[0] ) || 'newick'; $format = "\L$format"; # normalize capitalization to lower case # normalize capitalization return undef unless( $class->_load_format_module($format) ); $obj = "Bio::TreeIO::$format"->new(@args); } return $obj; } =head2 format Title : format Usage : $format = $obj->format() Function: Get the tree format Returns : tree format Args : none =cut # format() method inherited from Bio::Root::IO =head2 next_tree Title : next_tree Usage : my $tree = $treeio->next_tree; Function: Gets the next tree off the stream Returns : Bio::Tree::TreeI or undef if no more trees Args : none =cut sub next_tree{ my ($self) = @_; $self->throw("Cannot call method next_tree on Bio::TreeIO object must use a subclass"); } =head2 write_tree Title : write_tree Usage : $treeio->write_tree($tree); Function: Writes a tree onto the stream Returns : none Args : Bio::Tree::TreeI =cut sub write_tree{ my ($self,$tree) = @_; $self->throw("Cannot call method write_tree on Bio::TreeIO object must use a subclass"); } =head2 attach_EventHandler Title : attach_EventHandler Usage : $parser->attatch_EventHandler($handler) Function: Adds an event handler to listen for events Returns : none Args : Bio::Event::EventHandlerI =cut sub attach_EventHandler{ my ($self,$handler) = @_; return if( ! $handler ); if( ! $handler->isa('Bio::Event::EventHandlerI') ) { $self->warn("Ignoring request to attach handler ".ref($handler). ' because it is not a Bio::Event::EventHandlerI'); } $self->{'_handler'} = $handler; return; } =head2 _eventHandler Title : _eventHandler Usage : private Function: Get the EventHandler Returns : Bio::Event::EventHandlerI Args : none =cut sub _eventHandler{ my ($self) = @_; return $self->{'_handler'}; } sub _initialize { my($self, @args) = @_; $self->{'_handler'} = undef; $self->get_params; # Initialize the default parameters. my ($nen,$ini) = $self->_rearrange ([qw(NEWLINE_EACH_NODE INTERNAL_NODE_ID)],@args); $self->set_param('newline_each_node',$nen); $self->set_param('internal_node_id',$ini); $self->attach_EventHandler(Bio::TreeIO::TreeEventBuilder->new (-verbose => $self->verbose(), @args)); $self->_initialize_io(@args); #$self->debug_params; } =head2 _load_format_module Title : _load_format_module Usage : *INTERNAL TreeIO stuff* Function: Loads up (like use) a module at run time on demand Example : Returns : Args : =cut sub _load_format_module { my ($self,$format) = @_; my $module = "Bio::TreeIO::" . $format; my $ok; eval { $ok = $self->_load_module($module); }; if ( $@ ) { print STDERR <get_params->{$param} = $value; } return $self->get_params->{$param}; } sub set_param { my $self = shift; my $param = shift; my $value = shift; #print STDERR "[$param] -> [undef]\n" if (!defined $value); return unless (defined $value); #print STDERR "[$param] -> [$value]\n"; $self->get_params->{$param} = $value; return $self->param($param); } sub params { my $self = shift; return $self->get_params; } sub get_params { my $self = shift; if (!defined $self->{_params}) { $self->{_params} = $self->get_default_params; } return $self->{_params}; } sub set_params { my $self = shift; my $params = shift; # Apply all the passed parameters to our internal parm hashref. my $cur_params = $self->get_params; $self->{_params} = { %$cur_params, %$params }; return $self->get_params; } sub get_default_params { my $self = shift; return {}; } sub debug_params { my $self = shift; my $params = $self->get_params; print STDERR "{\n"; foreach my $param (keys %$params) { my $value = $params->{$param}; print STDERR " [$param] -> [$value]\n"; } print STDERR "}\n"; } =head2 _guess_format Title : _guess_format Usage : $obj->_guess_format($filename) Function: Example : Returns : guessed format of filename (lower case) Args : =cut sub _guess_format { my $class = shift; return unless $_ = shift; return 'newick' if /\.(dnd|newick|nh)$/i; return 'nhx' if /\.(nhx)$/i; return 'phyloxml' if /\.(xml)$/i; return 'svggraph' if /\.svg$/i; return 'lintree' if( /\.(lin|lintree)$/i ); } sub DESTROY { my $self = shift; $self->close(); } sub TIEHANDLE { my $class = shift; return bless {'treeio' => shift},$class; } sub READLINE { my $self = shift; return $self->{'treeio'}->next_tree() || undef unless wantarray; my (@list,$obj); push @list,$obj while $obj = $self->{'treeio'}->next_tree(); return @list; } sub PRINT { my $self = shift; $self->{'treeio'}->write_tree(@_); } 1; RangeI.pm100644000766000024 4670014006427353 16231 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::RangeI # # Please direct questions and support issues to # # Cared for by Lehvaslaiho # # Copyright Matthew Pocock # # You may distribute this module under the same terms as perl itself # # POD documentation - main docs before the code =head1 NAME Bio::RangeI - Range interface =head1 SYNOPSIS #Do not run this module directly =head1 DESCRIPTION This provides a standard BioPerl range interface that should be implemented by any object that wants to be treated as a range. This serves purely as an abstract base class for implementers and can not be instantiated. Ranges are modeled as having (start, end, length, strand). They use Bio-coordinates - all points E= start and E= end are within the range. End is always greater-than or equal-to start, and length is greater than or equal to 1. The behaviour of a range is undefined if ranges with negative numbers or zero are used. So, in summary: length = end - start + 1 end >= start strand = (-1 | 0 | +1) =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Heikki Lehvaslaiho Email: heikki-at-bioperl-dot-org =head1 CONTRIBUTORS Juha Muilu (muilu@ebi.ac.uk) Sendu Bala (bix@sendu.me.uk) Malcolm Cook (mec@stowers-institute.org) Stephen Montgomery (sm8 at sanger.ac.uk) =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut package Bio::RangeI; $Bio::RangeI::VERSION = '1.7.8'; use strict; use Carp; use integer; use vars qw(%STRAND_OPTIONS); use base qw(Bio::Root::RootI); BEGIN { # STRAND_OPTIONS contains the legal values for the strand-testing options %STRAND_OPTIONS = map { $_, '_' . $_ } ( 'strong', # ranges must have the same strand 'weak', # ranges must have the same strand or no strand 'ignore', # ignore strand information ); } # utility methods # # returns true if strands are equal and non-zero sub _strong { my ($r1, $r2) = @_; my ($s1, $s2) = ($r1->strand(), $r2->strand()); return 1 if $s1 != 0 && $s1 == $s2; } # returns true if strands are equal or either is zero sub _weak { my ($r1, $r2) = @_; my ($s1, $s2) = ($r1->strand(), $r2->strand()); return 1 if $s1 == 0 || $s2 == 0 || $s1 == $s2; } # returns true for any strandedness sub _ignore { return 1; } # works out what test to use for the strictness and returns true/false # e.g. $r1->_testStrand($r2, 'strong') sub _testStrand() { my ($r1, $r2, $comp) = @_; return 1 unless $comp; my $func = $STRAND_OPTIONS{$comp}; return $r1->$func($r2); } =head1 Abstract methods These methods must be implemented in all subclasses. =head2 start Title : start Usage : $start = $range->start(); Function: get/set the start of this range Returns : the start of this range Args : optionally allows the start to be set using $range->start($start) =cut sub start { shift->throw_not_implemented(); } =head2 end Title : end Usage : $end = $range->end(); Function: get/set the end of this range Returns : the end of this range Args : optionally allows the end to be set using $range->end($end) =cut sub end { shift->throw_not_implemented(); } =head2 length Title : length Usage : $length = $range->length(); Function: get/set the length of this range Returns : the length of this range Args : optionally allows the length to be set using $range->length($length) =cut sub length { shift->throw_not_implemented(); } =head2 strand Title : strand Usage : $strand = $range->strand(); Function: get/set the strand of this range Returns : the strandedness (-1, 0, +1) Args : optionally allows the strand to be set using $range->strand($strand) =cut sub strand { shift->throw_not_implemented(); } =head1 Boolean Methods These methods return true or false. They throw an error if start and end are not defined. $range->overlaps($otherRange) && print "Ranges overlap\n"; =head2 overlaps Title : overlaps Usage : if($r1->overlaps($r2)) { do stuff } Function: tests if $r2 overlaps $r1 Args : arg #1 = a range to compare this one to (mandatory) arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if the ranges overlap, false otherwise =cut sub overlaps { my ($self, $other, $so) = @_; $self->throw("start is undefined") unless defined $self->start; $self->throw("end is undefined") unless defined $self->end; $self->throw("not a Bio::RangeI object") unless defined $other && $other->isa('Bio::RangeI'); $other->throw("start is undefined") unless defined $other->start; $other->throw("end is undefined") unless defined $other->end; return ($self->_testStrand($other, $so) and not ( ($self->start() > $other->end() or $self->end() < $other->start() ) )); } =head2 contains Title : contains Usage : if($r1->contains($r2) { do stuff } Function: tests whether $r1 totally contains $r2 Args : arg #1 = a range to compare this one to (mandatory) alternatively, integer scalar to test arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if the argument is totally contained within this range =cut sub contains { my ($self, $other, $so) = @_; $self->throw("start is undefined") unless defined $self->start; $self->throw("end is undefined") unless defined $self->end; if(defined $other && ref $other) { # a range object? $other->throw("Not a Bio::RangeI object: $other") unless $other->isa('Bio::RangeI'); $other->throw("start is undefined") unless defined $other->start; $other->throw("end is undefined") unless defined $other->end; return ($self->_testStrand($other, $so) and $other->start() >= $self->start() and $other->end() <= $self->end()); } else { # a scalar? $self->throw("'$other' is not an integer.\n") unless $other =~ /^[-+]?\d+$/; return ($other >= $self->start() and $other <= $self->end()); } } =head2 equals Title : equals Usage : if($r1->equals($r2)) Function: test whether $r1 has the same start, end, length as $r2 Args : arg #1 = a range to compare this one to (mandatory) arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if they are describing the same range =cut sub equals { my ($self, $other, $so) = @_; $self->throw("start is undefined") unless defined $self->start; $self->throw("end is undefined") unless defined $self->end; $other->throw("Not a Bio::RangeI object") unless $other->isa('Bio::RangeI'); $other->throw("start is undefined") unless defined $other->start; $other->throw("end is undefined") unless defined $other->end; return ($self->_testStrand($other, $so) and $self->start() == $other->start() and $self->end() == $other->end() ); } =head1 Geometrical methods These methods do things to the geometry of ranges, and return Bio::RangeI compliant objects or triplets (start, stop, strand) from which new ranges could be built. =head2 intersection Title : intersection Usage : ($start, $end, $strand) = $r1->intersection($r2); OR ($start, $end, $strand) = Bio::Range->intersection(\@ranges); OR my $containing_range = $r1->intersection($r2); OR my $containing_range = Bio::Range->intersection(\@ranges); Function: gives the range that is contained by all ranges Returns : undef if they do not overlap or if @ranges has only a single range, else returns the range that they do overlap. In scalar contex, the return value is an object of the same class as the calling one. In array context the return value is a three element array. Args : arg #1 = [REQUIRED] a Bio::RangeI to compare this one to, or an array ref of ranges arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') =cut sub intersection { my ($self, $given, $so) = @_; $self->throw("missing arg: you need to pass in another feature") unless $given; my @ranges; if ($self eq "Bio::RangeI") { $self = "Bio::Range"; $self->warn("calling static methods of an interface is deprecated; use $self instead"); } if (ref $self) { push(@ranges, $self); } ref($given) eq 'ARRAY' ? push(@ranges, @{$given}) : push(@ranges, $given); #$self->throw("Need at least 2 ranges") unless @ranges >= 2; # Rather than the above, I think the following is more consistent return undef unless @ranges >= 2; my $intersect; while (@ranges > 0) { unless ($intersect) { $intersect = shift(@ranges); $self->throw("Not an object: $intersect") unless ref($intersect); $self->throw("Not a Bio::RangeI object: $intersect") unless $intersect->isa('Bio::RangeI'); $self->throw("start is undefined") unless defined $intersect->start; $self->throw("end is undefined") unless defined $intersect->end; } my $compare = shift(@ranges); $self->throw("Not an object: $compare") unless ref($compare); $self->throw("Not a Bio::RangeI object: $compare") unless $compare->isa('Bio::RangeI'); $self->throw("start is undefined") unless defined $compare->start; $self->throw("end is undefined") unless defined $compare->end; return unless $compare->_testStrand($intersect, $so); my @starts = sort {$a <=> $b} ($intersect->start(), $compare->start()); my @ends = sort {$a <=> $b} ($intersect->end(), $compare->end()); my $start = pop @starts; # larger of the 2 starts my $end = shift @ends; # smaller of the 2 ends my $intersect_strand; # strand for the intersection if (defined($intersect->strand) && defined($compare->strand) && $intersect->strand == $compare->strand) { $intersect_strand = $compare->strand; } else { $intersect_strand = 0; } if ($start > $end) { return; } else { $intersect = $self->new(-start => $start, -end => $end, -strand => $intersect_strand); } } if (wantarray()) { return ($intersect->start, $intersect->end, $intersect->strand); } else { return $intersect; } } =head2 union Title : union Usage : ($start, $end, $strand) = $r1->union($r2); : ($start, $end, $strand) = Bio::Range->union(@ranges); my $newrange = Bio::Range->union(@ranges); Function: finds the minimal Range that contains all of the Ranges Args : a Range or list of Range objects Returns : the range containing all of the range. In scalar contex, the return value is an object of the same class as the calling one. In array context the return value is a three element array. =cut sub union { my $self = shift; my @ranges = @_; if ($self eq "Bio::RangeI") { $self = "Bio::Range"; $self->warn("calling static methods of an interface is deprecated; use $self instead"); } if(ref $self) { unshift @ranges, $self; } my @start = sort {$a<=>$b} map( { $_->start() } @ranges); my @end = sort {$a<=>$b} map( { $_->end() } @ranges); my $start = shift @start; while( !defined $start ) { $start = shift @start; } my $end = pop @end; my $union_strand; # Strand for the union range object. foreach(@ranges) { if(! defined $union_strand) { $union_strand = $_->strand; next; } else { if(not defined $_->strand or $union_strand ne $_->strand) { $union_strand = 0; last; } } } return unless $start or $end; if( wantarray() ) { return ( $start,$end,$union_strand); } else { return $self->new('-start' => $start, '-end' => $end, '-strand' => $union_strand ); } } =head2 overlap_extent Title : overlap_extent Usage : ($a_unique,$common,$b_unique) = $a->overlap_extent($b) Function: Provides actual amount of overlap between two different ranges Example : Returns : array of values containing the length unique to the calling range, the length common to both, and the length unique to the argument range Args : a range =cut sub overlap_extent{ my ($a,$b) = @_; $a->throw("start is undefined") unless defined $a->start; $a->throw("end is undefined") unless defined $a->end; $b->throw("Not a Bio::RangeI object") unless $b->isa('Bio::RangeI'); $b->throw("start is undefined") unless defined $b->start; $b->throw("end is undefined") unless defined $b->end; if( ! $a->overlaps($b) ) { return ($a->length,0,$b->length); } my ($au,$bu) = (0, 0); if( $a->start < $b->start ) { $au = $b->start - $a->start; } else { $bu = $a->start - $b->start; } if( $a->end > $b->end ) { $au += $a->end - $b->end; } else { $bu += $b->end - $a->end; } my $intersect = $a->intersection($b); if( ! $intersect ) { warn("no intersection\n"); return ($au, 0, $bu); } else { my $ie = $intersect->end; my $is = $intersect->start; return ($au,$ie-$is+1,$bu); } } =head2 disconnected_ranges Title : disconnected_ranges Usage : my @disc_ranges = Bio::Range->disconnected_ranges(@ranges); Function: finds the minimal set of ranges such that each input range is fully contained by at least one output range, and none of the output ranges overlap Args : a list of ranges Returns : a list of objects of the same type as the input (conforms to RangeI) =cut sub disconnected_ranges { my $self = shift; if ($self eq "Bio::RangeI") { $self = "Bio::Range"; $self->warn("calling static methods of an interface is deprecated; use $self instead"); } my @inranges = @_; if(ref $self) { unshift @inranges, $self; } my @outranges = (); # disconnected ranges # iterate through all input ranges $inrange, # adding each input range to the set of output ranges @outranges, # provided $inrange does not overlap ANY range in @outranges # - if it does overlap an outrange, then merge it foreach my $inrange (@inranges) { my $intersects = 0; my @outranges_new = (); my @intersecting_ranges = (); # iterate through all @outranges, testing if it intersects # current $inrange; if it does, merge and add to list # of @intersecting_ranges, otherwise add $outrange to # the new list of outranges that do NOT intersect for (my $i=0; $i<@outranges; $i++) { my $outrange = $outranges[$i]; my $intersection = $inrange->intersection($outrange); if ($intersection) { $intersects = 1; my $union = $inrange->union($outrange); push(@intersecting_ranges, $union); } else { push(@outranges_new, $outrange); } } @outranges = @outranges_new; # @outranges now contains a list of non-overlapping ranges # that do not intersect the current $inrange if (@intersecting_ranges) { if (@intersecting_ranges > 1) { # this sf intersected > 1 range, which means that # all the ranges it intersects should be joined # together in a new range my $merged_range = $self->union(@intersecting_ranges); push(@outranges, $merged_range); } else { # exactly 1 intersecting range push(@outranges, @intersecting_ranges); } } else { # no intersections found - new range push(@outranges, $self->new('-start'=>$inrange->start, '-end'=>$inrange->end, '-strand'=>$inrange->strand, )); } } return @outranges; } =head2 offsetStranded Title : offsetStranded Usage : $rnge->ofsetStranded($fiveprime_offset, $threeprime_offset) Function : destructively modifies RangeI implementing object to offset its start and stop coordinates by values $fiveprime_offset and $threeprime_offset (positive values being in the strand direction). Args : two integer offsets: $fiveprime_offset and $threeprime_offset Returns : $self, offset accordingly. =cut sub offsetStranded { my ($self, $offset_fiveprime, $offset_threeprime) = @_; my ($offset_start, $offset_end) = $self->strand() eq -1 ? (- $offset_threeprime, - $offset_fiveprime) : ($offset_fiveprime, $offset_threeprime); $self->start($self->start + $offset_start); $self->end($self->end + $offset_end); return $self; }; =head2 subtract Title : subtract Usage : my @subtracted = $r1->subtract($r2) Function: Subtract range r2 from range r1 Args : arg #1 = a range to subtract from this one (mandatory) arg #2 = strand option ('strong', 'weak', 'ignore') (optional) Returns : undef if they do not overlap or r2 contains this RangeI, or an arrayref of Range objects (this is an array since some instances where the subtract range is enclosed within this range will result in the creation of two new disjoint ranges) =cut sub subtract() { my ($self, $range, $so) = @_; $self->throw("missing arg: you need to pass in another feature") unless $range; return unless $self->_testStrand($range, $so); if ($self eq "Bio::RangeI") { $self = "Bio::Range"; $self->warn("calling static methods of an interface is deprecated; use $self instead"); } $range->throw("Input a Bio::RangeI object") unless $range->isa('Bio::RangeI'); my @sub_locations; if ($self->location->isa('Bio::Location::SplitLocationI') ) { @sub_locations = $self->location->sub_Location; } else { @sub_locations = $self; } my @outranges; foreach my $sl (@sub_locations) { if (!$sl->overlaps($range)) { push(@outranges, $self->new('-start' =>$sl->start, '-end' =>$sl->end, '-strand'=>$sl->strand, )); next; } ##Subtracts everything if ($range->contains($sl)) { next; } my ($start, $end, $strand) = $sl->intersection($range, $so); ##Subtract intersection from $self range if ($sl->start < $start) { push(@outranges, $self->new('-start' =>$sl->start, '-end' =>$start - 1, '-strand'=>$sl->strand, )); } if ($sl->end > $end) { push(@outranges, $self->new('-start' =>$end + 1, '-end' =>$sl->end, '-strand'=>$sl->strand, )); } } if (@outranges) { return \@outranges; } return; } 1; bp_seqretsplit100644000766000024 265414006427353 16762 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # =head1 NAME bp_seqretsplit - split a sequence (or stream) into a single file per sequence =head1 SYNOPSIS bp_seqretsplit file1 file2 .. # or bp_seqretsplit < file1 =head1 DESCRIPTION The script will split all sequences from fasta file(s) (or stdin) to individual files. The filename is the sequence ID (everything before the 1st whitespace in a FASTA header). Currently it doesn't check to see that it isn't overwriting an existing file so IDs should be unique This is inspired by EMBOSS seqretsplit tool. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Jason Stajich Ejason_AT_bioperl_DOT_orgE =cut use strict; use warnings; use Bio::SeqIO; my $in = Bio::SeqIO->new(-format => 'fasta', -fh => \*ARGV); while( my $s = $in->next_seq ) { Bio::SeqIO->new(-format => 'fasta', -file => ">".$s->id.".fa")->write_seq($s); } bp_oligo_count100644000766000024 1413214006427353 16736 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # # oligomer_freq.pl # We use this to determine what primers are useful for frequent priming of # nucleic acid for random labeling # Input: Sequence file, oligomer length # Output: Tab-delimited text file of oligomer frequencies # Written July 2, 2001 # Charles C. Kim ########### # MODULES # ########### use Bio::Seq; use Bio::SeqIO; use Getopt::Long; ######################### # VARIABLES & FILENAMES # ######################### use strict; use warnings; my ($format, $infile, $help, $outfile, $oligomerlength) = ('fasta'); GetOptions( 'f|format:s' => \$format, 'i|in|s|sequence:s' => \$infile, 'h|help|?' => \$help, 'o|out:s' => \$outfile, 'length:i' => \$oligomerlength ); my $USAGE = "Usage:\toligo_count [-h/--help] [-l/--length OLIGOLENGTH]\n". "\t[-f/--format SEQFORMAT] [-i/--in/-s/--sequence SEQFILE]\n". "\t[-o/--out OUTFILE]\n". "\tDefault SEQFORMAT is fasta\n"; print $USAGE and exit if $help; unless ($infile ) { print 'Enter your concatenated FASTA sequence filename: '; chomp ($infile=); } unless (-e $infile) { die "$infile not found\n"; } if ($outfile) { if (-e $outfile) { print "$outfile already exists! Overwrite (Y/N)? "; chomp ($_ = ); while (/[^yn]/i) { print 'Y or N, please: '; chomp ($_ = ); } if (/n/i) { die "$outfile not overwritten.\n"; } } #} else { # print 'Enter an output filename: '; # chomp ($outfile=); # if (-e $outfile) { # print "$outfile already exists! Overwrite (Y/N)? "; # chomp ($_ = ); # while (/[^yn]/i) { # print 'Y or N, please: '; # chomp ($_ = ); # } # if (/n/i) { die "$outfile not overwritten.\n"; } # } } unless ($oligomerlength) { while () { print 'Enter an oligomer length to count: '; chomp($oligomerlength=); if ($oligomerlength !~ /\d/) { print "Value is non-numeric!\n"; } else {last;} } } ######## # MAIN # ######## if ($oligomerlength >= 9) { print "An oligomer length of $oligomerlength will generate "; print 4 ** $oligomerlength, " combinations,\nwhich could cause "; print "an out of memory error. Proceed? (y/n) "; chomp($_=); if (/y/i) { ; } else { die "Program terminated\n"; } } my @oligoseqs = &generate_all_oligos($oligomerlength); my %oligos = (); foreach (@oligoseqs) { $oligos{$_} = 0; } my $in = Bio::SeqIO->new( -file => $infile, -format => $format); my $seqnumber = 0; my $oligocounts = 0; my $exception; while (my $seq = $in->next_seq() ) { my $len = $seq->length(); my $position = 1; if ($position+$oligomerlength > $len) { $exception = 2; next; } $seq = uc $seq->seq; #string $exception = 1 if $seq =~ /[^GATC]/; while ($position + $oligomerlength-1 <= $len) { $oligos{substr $seq, $position-1, $oligomerlength}++; $position++; if ($position%250000 == 0) {print "$position\n";} } $oligocounts += $position-1; $seqnumber++; } my $OUTFILE; if ($outfile) { open $OUTFILE, '>', $outfile or die "Could not open file '$outfile': $!\n"; } else { open $OUTFILE, '>-'; # STDOUT } print $OUTFILE "$seqnumber sequences analyzed\n"; print $OUTFILE "$oligocounts total $oligomerlength-mers counted\n"; print $OUTFILE "$oligomerlength-mer\tNumber\tFrequency\n"; foreach my $key (sort keys %oligos) { print $OUTFILE "$key\t$oligos{$key}\t", $oligos{$key}/$oligocounts, "\n"; } close $OUTFILE; if ($exception) { if ($exception == 1) { print "Non-standard (non-GATC) bases were found in sequence\n"; } if ($exception == 2) { print "Oligomer length greater than sequence length\n"; } } #¬ify(); ############### # SUBROUTINES # ############### sub generate_all_oligos { my $oligolength = $_[0]; my $iter = 1; my @newarray = qw{A C G T}; my @bases = qw{A C G T}; while ($iter < $oligolength) { my @oldarray = @newarray; @newarray = (); foreach my $oligoseq (@oldarray) { foreach my $newbase (@bases) { push @newarray, $oligoseq . $newbase; } } $iter++; } return @newarray; } # if you wanted to be notified about status of running #my $EMAILADDRESS = undef; #die("Must change script to a valid email addres for notification") # unless( defined $EMAILADDRESS ); #sub notify { # $address = $EMAILADDRESS; # $address = $_[0] if $_[0]; # open(SENDMAIL, "|/usr/lib/sendmail -oi -t") or die "Can't fork for sendmail: $!\n"; # print SENDMAIL <<"EOF"; #From: Computer #To: $address #Subject: Program Finished # #EOF # close(SENDMAIL) or warn "sendmail didn't close nicely"; #} __END__ =head1 NAME bp_oligo_count - oligo count and frequency =head1 SYNOPSIS Usage: bp_oligo_count [-h/--help] [-l/--length OLIGOLENGTH] [-f/--format SEQFORMAT] [-i/--in/-s/--sequence SEQFILE] [-o/--out OUTFILE] =head1 DESCRIPTION This scripts counts occurrence and frequency for all oligonucleotides of given length. It can be used to determine what primers are useful for frequent priming of nucleic acid for random labeling. Note that this script could be run by utilizing the compseq program which is part of EMBOSS. =head1 OPTIONS The default sequence format is fasta. If no outfile is given, the results will be printed to standard out. All other options can entered interactively. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Charles C. Kim Email cckim@stanford.edu =head1 HISTORY Written July 2, 2001 Submitted to bioperl scripts project 2001/08/06 EE 100 x speed optimization by Heikki Lehvaslaiho =cut bp_process_sgd100644000766000024 710714006427353 16714 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # This script will convert from SGD format to GFF format # See http://db.yeastgenome.org/schema/Schema.html use strict; use warnings; # hard-coded length data that I couldn't get directly my %CHROMOSOMES = (I => 230_203, II => 813_139, III => 316_613, IV => 1_531_929, V => 576_869, VI => 270_148, VII => 1_090_937, VIII => 562_639, IX => 439_885, X => 745_444, XI => 666_445, XII => 1_078_173, XIII => 924_430, XIV => 784_328, XV => 1_091_284, XVI => 948_061, Mit => 85_779); my @ROMAN = qw(I II III IV V VI VII VIII IX X XI XII XIII XIV XV XVI Mit); if ($ARGV[0] =~ /^--?h/) { die < This script massages the SGD sequence annotation flat files located at ftp://genome-ftp.stanford.edu/pub/yeast/data_dump/feature/chromosomal_features.tab into a version of the GFF format suitable for display by the generic genome browser. To use this script, get the SGD chromosomal_features.tab file from the FTP site listed above, and run the following command: % process_sgd.pl chromosomal_features.tab > yeast.gff The yeast.gff file can then be loaded into a Bio::DB::GFF database using the following command: % bulk_load_gff.pl -d yeast.gff USAGE ; } # first print out chromosomes # We hard coded the lengths because they are not available in the features table. for my $chrom (sort keys %CHROMOSOMES) { print join("\t",$chrom,'chromosome','Component',1,$CHROMOSOMES{$chrom},'.','.','.',qq(Sequence "$chrom")),"\n"; } # this is hard because the SGD idea of a feature doesn't really map onto the GFF idea. while (<>) { chomp; my($id,$gene,$aliases,$type,$chromosome,$start,$stop,$strand,$sgdid,$sgdid2,$description,$date) = split "\t"; my $ref = $ROMAN[$chromosome-1]; $description =~ s/"/\\"/g; $description =~ s/;/\\;/g; $strand = $strand eq 'W' ? '+' : '-'; ($start,$stop) = ($stop,$start) if $strand eq '-'; die "Strand logic is messed up" if $stop < $start; if ($gene) { my @aliases = split(/\|/,$aliases); my $aliases = join " ; ",map {qq(Alias "$_")} @aliases; my $group = qq(Gene "$gene" ; Note "$description"); $group .= " ; $aliases" if $aliases; print join("\t",$ref,'sgd','gene',$start,$stop,'.',$strand,'.',$group),"\n"; $description .= "\\; AKA @aliases" if @aliases; } print join("\t",$ref,'sgd',$type,$start,$stop,'.',$strand,'.',qq($type "$id" ; Note "$description")),"\n"; } __END__ =head1 NAME bp_process_sgd.pl - Massage SGD annotation flat files into a version suitable for the Generic Genome Browser =head1 SYNOPSIS % bp_process_sgd.pl chromosomal_features.tab > yeast.gff =head1 DESCRIPTION This script massages the SGD sequence annotation flat files located at ftp://genome-ftp.stanford.edu/pub/yeast/data_dump/feature/chromosomal_features.tab into a version of the GFF format suitable for display by the generic genome browser. To use this script, get the SGD chromosomal_features.tab file from the FTP site listed above, and run the following command: % bp_process_sgd.pl chromosomal_features.tab > yeast.gff The yeast.gff file can then be loaded into a Bio::DB::GFF database using the following command: % bulk_load_gff.pl -d yeast.gff =head1 SEE ALSO L, L, L =head1 AUTHOR Lincoln Stein, lstein@cshl.org Copyright (c) 2002 Cold Spring Harbor Laboratory This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See DISCLAIMER.txt for disclaimers of warranty. =cut Genewise.t100644000766000024 357714006427353 16532 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id: Genewise.t 11733 2007-10-26 18:22:10Z jason $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 33); use_ok('Bio::Tools::Genewise'); } my $inputfilename= test_input_file('genewise.out'); my $parser = Bio::Tools::Genewise->new(-file => $inputfilename); my @gene; while (my $gene= $parser->next_prediction){ push @gene, $gene; } my @t = $gene[0]->transcripts; my @e = $t[0]->exons; is ($t[0]->seq_id, 'Scaffold_2042.1'); is ($e[0]->seq_id, 'Scaffold_2042.1'); is ($t[0]->source_tag, 'genewise'); is ($e[0]->source_tag, 'genewise'); is ($t[0]->primary_tag, 'transcript'); is ($e[0]->primary_tag, 'exon'); is (scalar($t[0]->exons), 18); is ($t[0]->start, 22265); is ($t[0]->end, 37062); is ($e[0]->start,22265); is ($e[0]->end, 22396); my ($phase) = $e[0]->get_tag_values('phase'); is ($phase,0); my ($sf)= $e[0]->get_tag_values('supporting_feature'); is ($sf->feature1->seq_id,'Scaffold_2042.1'); is ($sf->feature1->start,22265); is ($sf->feature1->end,22396); is ($sf->feature2->seq_id,'SINFRUP00000067802'); is ($sf->feature2->start,1); is ($sf->feature2->end,44); is ($sf->feature1->end,22396); open my $FH, '<', $inputfilename or die "Could not read file '$inputfilename': $!\n"; $parser = Bio::Tools::Genewise->new(-fh => $FH); while (my $gene = $parser->next_prediction){ push @gene, $gene; } @t = $gene[0]->transcripts; @e = $t[0]->exons; is (scalar($t[0]->exons), 18); is ($t[0]->start, 22265); is ($t[0]->end, 37062); is ($e[0]->start,22265); is ($e[0]->end, 22396); ($phase) = $e[0]->get_tag_values('phase'); is ($phase,0); ($sf)= $e[0]->get_tag_values('supporting_feature'); is ($sf->feature1->seq_id,'Scaffold_2042.1'); is ($sf->feature1->start,22265); is ($sf->feature1->end,22396); is ($sf->feature2->seq_id,'SINFRUP00000067802'); is ($sf->feature2->start,1); is ($sf->feature2->end,44); is ($sf->feature1->end,22396); bsml_sax.t100644000766000024 170314006427353 16441 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 15, -requires_modules => [qw(XML::SAX XML::SAX::Writer XML::SAX::Base)]); use_ok('Bio::SeqIO'); } my $verbose = test_debug(); my $str = Bio::SeqIO->new(-format => 'bsml_sax', -verbose => $verbose, -file => test_input_file('U83300.bsml')); my $seq = $str->next_seq; isa_ok($seq, 'Bio::Seq::RichSeqI'); my @refs = $seq->annotation->get_Annotations('reference'); is(@refs, 2); is($seq->display_id,'MIVN83300'); is($seq->molecule ,'dna'); ok(! $seq->is_circular); is($seq->get_dates,2); is($seq->accession_number, 'U83300'); is($seq->seq_version,1); my @feats = $seq->get_SeqFeatures; is(@feats, 2); is($feats[1]->start, 1); is($feats[1]->end, 946); is($feats[1]->get_tag_values('db_xref'), 3); is($seq->annotation->get_Annotations('reference'),2); is($seq->annotation->get_Annotations('dblink'),2); EncodedSeq.t100644000766000024 363514006427353 16421 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 37); use_ok('Bio::Seq::EncodedSeq'); } my ($str, $aln, $seq, $loc); ok $seq = Bio::Seq::EncodedSeq->new( -seq => '--atg---gta--', -start => 1, -end => 6, -strand => 1 ); is $seq->alphabet, 'dna'; is $seq->start, 1; is $seq->end, 6; is $seq->strand, 1; is $seq->num_gaps, 1; is $seq->column_from_residue_number(4), 9; # this should fail eval { $seq->column_from_residue_number(8); }; ok $@; ok $loc = $seq->location_from_column(4); isa_ok $loc, 'Bio::Location::Simple'; is $loc->to_FTstring, "2"; ok $loc = $seq->location_from_column(6); isa_ok $loc,'Bio::Location::Simple'; is $loc->start, 3; is $loc->location_type, 'IN-BETWEEN'; is $loc->to_FTstring, '3^4'; is $loc = $seq->location_from_column(2), undef; is $seq->encoding, "GGCCCGGGCCCGG"; is $seq->encoding(-explicit => 1), "GGCDEGGGCDEGG"; ok $seq = Bio::Seq::EncodedSeq->new( -seq => 'atcgta', -start => 10, -end => 15, -strand => -1, ); is $seq->encoding('CCGGG'), 'CCGGGCCCC'; is $seq->seq, 'atcg---ta'; is $seq->column_from_residue_number(14), 2; is $seq->encoding('3C2GCG'), 'CCCGGCGCC'; is $seq->seq, 'at-c--gta'; is $seq->num_gaps, 2; is $seq->location_from_column(2)->to_FTstring, 14; is $seq->location_from_column(5)->to_FTstring, "12^13"; is $seq->encoding("B", Bio::Location::Simple->new(-start => 10, -end => 11, -location_type => 'IN-BETWEEN')), 'B'; is $seq->seq, 'at-c--gt-a'; is $seq->encoding, 'CBCCGGCGCC'; is $seq->cds(-nogaps => 1)->seq, 'tacgat'; is $seq->translate->seq, 'YD'; ok $seq = $seq->trunc(4,10); # kinda testing LocatableSeq's new trunc() here as well. is $seq->seq, 'c--gt-a'; is $seq->encoding, 'CBCCGGC'; PrimarySeq.t100644000766000024 7575214006427353 16534 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; use Data::Dumper; BEGIN { use Bio::Root::Test; test_begin( -tests => 312 ); use_ok('Bio::PrimarySeq'); use_ok('Bio::Location::Simple'); use_ok('Bio::Location::Fuzzy'); use_ok('Bio::Location::Split'); } # Bare object ok my $seq = Bio::PrimarySeq->new(), 'Bare object'; isa_ok $seq, 'Bio::PrimarySeqI'; is $seq->id, undef; is $seq->seq, undef; is $seq->length, 0; is $seq->alphabet, undef; is $seq->is_circular, undef; # Empty sequence ok $seq = Bio::PrimarySeq->new( -seq => '', -nowarnonempty => 1); is $seq->seq, ''; is $seq->length, 0; is $seq->alphabet, undef; # Basic tests ok $seq = Bio::PrimarySeq->new( '-seq' => 'TTGGTGGCGTCAACT', '-display_id' => 'new-id', '-alphabet' => 'dna', '-accession_number' => 'X677667', '-desc' => 'Sample Bio::Seq object' ); ok defined $seq; is $seq->accession_number(), 'X677667'; is $seq->seq(), 'TTGGTGGCGTCAACT'; is $seq->display_id(), 'new-id'; is $seq->alphabet(), 'dna'; is $seq->is_circular(), undef; ok $seq->is_circular(1); is $seq->is_circular(0), 0; # check IdentifiableI and DescribableI interfaces isa_ok $seq, 'Bio::IdentifiableI'; isa_ok $seq, 'Bio::DescribableI'; # make sure all methods are implemented is $seq->authority("bioperl.org"), "bioperl.org"; is $seq->authority, "bioperl.org"; is $seq->namespace("t"), "t"; is $seq->namespace, "t"; is $seq->version(0), 0; is $seq->version, 0; is $seq->lsid_string(), "bioperl.org:t:X677667"; is $seq->namespace_string, "t:X677667.0"; is $seq->version(47), 47; is $seq->version, 47; is $seq->namespace_string, "t:X677667.47"; is $seq->description, 'Sample Bio::Seq object'; is $seq->display_name, "new-id"; # Test subseq is $seq->subseq(2, 5), 'TGGT'; is $seq->subseq( -start => 1, -end => 15), 'TTGGTGGCGTCAACT'; my $location = Bio::Location::Simple->new( '-start' => 2, '-end' => 5, '-strand' => -1 ); is $seq->subseq($location), 'ACCA'; my $splitlocation = Bio::Location::Split->new(); $splitlocation->add_sub_Location( Bio::Location::Simple->new( '-start' => 1, '-end' => 4, '-strand' => 1 ) ); $splitlocation->add_sub_Location( Bio::Location::Simple->new( '-start' => 7, '-end' => 12, '-strand' => -1 ) ); is $seq->subseq($splitlocation), 'TTGGTGACGC'; my $fuzzy = Bio::Location::Fuzzy->new( -start => '<3', -end => '8', -strand => 1 ); is $seq->subseq($fuzzy), 'GGTGGC'; { ok my $seq = Bio::PrimarySeq->new( -seq => 'TT-GTGGCGTCAACT' ); is $seq->subseq(2, 5, 'nogap'), 'TGT'; is $seq->subseq( -start => 2, -end => 5, -nogap => 1 ), 'TGT'; my $location = Bio::Location::Simple->new( '-start' => 2, '-end' => 5, '-strand' => 1 ); is $seq->subseq( $location, -nogap => 1), 'TGT'; is $seq->subseq(-start=>2, -end=>5, -replace_with=>'aa'), 'T-GT'; is $seq->seq, 'TaaGGCGTCAACT'; throws_ok { $seq->subseq(-start=>2, -end=>5, -replace_with=>'?!'); } qr/.+/; } { ok my $seq = Bio::PrimarySeq->new( -seq => 'AACCGGTT', -is_circular => 1 ); is $seq->subseq( -start => 7, -end => 10 ), 'TTAA'; } ### Test for Bug #2936 # Without strand input argument (case: user don't think is necessary) my $split_loc_obj1 = Bio::Location::Split->new(); $split_loc_obj1->add_sub_Location( Bio::Location::Simple->new( '-start' => 1, '-end' => 10 ) ); $split_loc_obj1->add_sub_Location( Bio::Location::Simple->new( '-start' => 20, '-end' => 30 ) ); # With strand input argument (case: user provides the argument) my $split_loc_obj2 = Bio::Location::Split->new(); $split_loc_obj2->add_sub_Location( Bio::Location::Simple->new( '-start' => 1, '-end' => 10, '-strand' => 1 ) ); $split_loc_obj2->add_sub_Location( Bio::Location::Simple->new( '-start' => 20, '-end' => 30, '-strand' => 1 ) ); is $split_loc_obj1->to_FTstring, "join(1..10,20..30)"; is $split_loc_obj2->to_FTstring, "join(1..10,20..30)"; $split_loc_obj1->flip_strand; $split_loc_obj2->flip_strand; is $split_loc_obj1->to_FTstring, "complement(join(1..10,20..30))"; is $split_loc_obj2->to_FTstring, "complement(join(1..10,20..30))"; ### # Test trunc my $trunc = $seq->trunc( 1, 4 ); isa_ok $trunc, 'Bio::PrimarySeqI'; is $trunc->seq(), 'TTGG' or diag( "Expecting TTGG. Got " . $trunc->seq() ); $trunc = $seq->trunc($splitlocation); isa_ok $trunc, 'Bio::PrimarySeqI' ; is $trunc->seq(), 'TTGGTGACGC'; $trunc = $seq->trunc($fuzzy); isa_ok $trunc, 'Bio::PrimarySeqI'; is $trunc->seq(), 'GGTGGC'; my $rev = $seq->revcom(); isa_ok $rev, 'Bio::PrimarySeqI'; is $rev->seq(), 'AGTTGACGCCACCAA' or diag( 'revcom() failed, was ' . $rev->seq() ); is $rev->display_id, 'new-id'; is $rev->display_name(), 'new-id'; is $rev->accession_number(), 'X677667'; is $rev->alphabet, 'dna'; is $rev->description, 'Sample Bio::Seq object'; is $rev->is_circular(), 0; is $rev->version, 47; is $rev->authority, 'bioperl.org'; is $rev->namespace, 't'; is $rev->namespace_string(), 't:X677667.47'; # # Translate # my $aa = $seq->translate(); # TTG GTG GCG TCA ACT is $aa->seq, 'LVAST', "Translation: " . $aa->seq; # tests for non-standard initiator codon coding for # M by making translate() look for an initiator codon and # terminator codon ("complete", the 5th argument below) $seq->seq('TTGGTGGCGTCAACTTAA'); # TTG GTG GCG TCA ACT TAA $aa = $seq->translate( undef, undef, undef, undef, 1 ); is $aa->seq, 'MVAST', "Translation: " . $aa->seq; # same test as previous, but using named parameter $aa = $seq->translate( -complete => 1 ); is $aa->seq, 'MVAST', "Translation: " . $aa->seq; # find ORF, ignore codons outside the ORF or CDS $seq->seq('TTTTATGGTGGCGTCAACTTAATTT'); # ATG GTG GCG TCA ACT $aa = $seq->translate( -orf => 1 ); is $aa->seq, 'MVAST*', "Translation: " . $aa->seq; # smallest possible ORF $seq->seq("ggggggatgtagcccc"); # atg tga $aa = $seq->translate( -orf => 1 ); is $aa->seq, 'M*', "Translation: " . $aa->seq; # same as previous but complete, so * is removed $aa = $seq->translate( -orf => 1, -complete => 1 ); is $aa->seq, 'M', "Translation: " . $aa->seq; # ORF without termination codon # should warn, let's change it into throw for testing $seq->verbose(2); $seq->seq("ggggggatgtggcccc"); # atg tgg ccc eval { $seq->translate( -orf => 1 ); }; like( $@, qr/\batgtggccc\b/i ); $seq->verbose(-1); $aa = $seq->translate( -orf => 1 ); is $aa->seq, 'MWP', "Translation: MWP"; $seq->verbose(0); # use non-standard codon table where terminator is read as Q $seq->seq('ATGGTGGCGTCAACTTAG'); # ATG GTG GCG TCA ACT TAG $aa = $seq->translate( -codontable_id => 6 ); is $aa->seq, 'MVASTQ' or diag( "Translation: " . $aa->seq ); # insert an odd character instead of terminating with * $aa = $seq->translate( -terminator => 'X' ); is $aa->seq, 'MVASTX' or diag( "Translation: " . $aa->seq ); # change frame from default $aa = $seq->translate( -frame => 1 ); # TGG TGG CGT CAA CTT AG is $aa->seq, 'WWRQL' or diag( "Translation: " . $aa->seq ); $aa = $seq->translate( -frame => 2 ); # GGT GGC GTC AAC TTA G is $aa->seq, 'GGVNL' or diag( "Translation: " . $aa->seq ); # TTG is initiator in Standard codon table? Afraid so. $seq->seq("ggggggttgtagcccc"); # ttg tag $aa = $seq->translate( -orf => 1 ); is $aa->seq, 'L*' or diag( "Translation: " . $aa->seq ); # Replace L at 1st position with M by setting complete to 1 $seq->seq("ggggggttgtagcccc"); # ttg tag $aa = $seq->translate( -orf => 1, -complete => 1 ); is $aa->seq, 'M' or diag( "Translation: " . $aa->seq ); # Ignore non-ATG initiators (e.g. TTG) in codon table $seq->seq("ggggggttgatgtagcccc"); # atg tag $aa = $seq->translate( -orf => 1, -start => "atg", -complete => 1 ); is $aa->seq, 'M' or diag( "Translation: " . $aa->seq ); # test for character '?' in the sequence string is $seq->seq('TTGGTGGCG?CAACT'), 'TTGGTGGCG?CAACT'; # issue #105 - when there are starts and stops in both frame 0 and # frame 1, frame 0 start < frame 1 start, then should return the frame # 0 ORF per the pod ('the first orf') even if frame 1 stop < frame 0 stop # Turn off warnings for a few tests my $verbosity = $seq->verbose(); $seq->verbose(-1); $seq->seq('ATGAATGTAAATAA'); $aa = $seq->translate( -orf => 1 ); my $aa0 = $seq->translate(-frame => 0); is $aa->seq, $aa0->seq, "frame 0 start, frame 1 stop < frame 0 stop"; $seq->seq('AAATGAATGTAAATAA'); $aa = $seq->translate( -orf => 1, -frame=>1 ); my $aa2 = $seq->translate(-frame => 2); is $aa->seq, $aa2->seq, "frame 1 start, frame 2 stop < frame 1 stop"; # Turn 'em back on! $seq->verbose($verbosity); # test for some aliases $seq = Bio::PrimarySeq->new( -id => 'aliasid', -description => 'Alias desc' ); is $seq->description, 'Alias desc'; is $seq->display_id, 'aliasid'; # Test alphabet ok $seq->seq('actgx'); is $seq->alphabet, 'protein', 'Alphabet'; ok $seq->seq('actge'); is $seq->alphabet, 'protein'; ok $seq->seq('actgf'); is $seq->alphabet, 'protein'; ok $seq->seq('actgi'); is $seq->alphabet, 'protein'; ok $seq->seq('actgj'); is $seq->alphabet, 'protein'; ok $seq->seq('actgl'); is $seq->alphabet, 'protein'; ok $seq->seq('actgo'); is $seq->alphabet, 'protein'; ok $seq->seq('actgp'); is $seq->alphabet, 'protein'; ok $seq->seq('actgq'); is $seq->alphabet, 'protein'; ok $seq->seq('actgz'); is $seq->alphabet, 'protein'; ok $seq->seq('actgn'); is $seq->alphabet, 'dna'; ok $seq->seq('acugn'); is $seq->alphabet, 'rna'; ok $seq->seq('bdhkm'); is $seq->alphabet, 'protein'; ok $seq->seq('rsvwx'); is $seq->alphabet, 'protein'; ok $seq->seq('AAACTYAAAAGAATTGRCGG'); # valid degenerate DNA PCR primer sequence (90% ACGTN) is $seq->alphabet, 'dna'; ok $seq->seq('AAACTYAAAKGAATTGRCGG'); # another primer previously detected as protein (85% ACGTN) is $seq->alphabet, 'dna'; ok $seq->seq('YWACTYAAAKGARTTGRCGG'); # 70% ACGTNWSRM. Everything <= 70% is considered a protein is $seq->alphabet, 'dna'; ok $seq->seq('XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX'); # Bug 2438 is $seq->alphabet, 'protein', 'Bug 2438'; ok $seq->seq('CAGTCXXXXXXXXXXXXXXXXXXXXXXXXXXXCAGCG'); is $seq->alphabet, 'protein'; ok $seq->seq('WTGGGGCTATGAAAAAAAAAWTTKMGMMAAAAAWTTWTKRWMRATC'); # showed up on MAKER list is $seq->alphabet, 'dna'; ok $seq->seq('actgn', 'protein'); # accept specified alphabet, no matter what is $seq->alphabet, 'protein'; ok $seq->seq('bdhkm', 'dna'); is $seq->alphabet, 'dna'; # Bug #2864: $seq = Bio::PrimarySeq->new( -display_id => 0, -seq => 'GATC' ); is $seq->display_id, 0, "Bug #2864"; # Test that the check for terminators inside the translated protein # works when the terminator isn't '*': $seq = Bio::PrimarySeq->new(-seq=>'ATGCTCTAAGCAGGGTAA'); # ML*AG* eval { $aa = $seq->translate(-complete=>1, -throw=>1, -terminator=>'#') }; my $error = $@; ok $error =~ /\QTerminator codon inside CDS!\E/, 'Terminator + inside sequence'; $seq = Bio::PrimarySeq->new(-seq=>'ATGCTCGCAGGGTAA'); # MLAG* $aa = $seq->translate(-complete=>1, -throw=>1, -terminator=>'#'); is $aa->seq, 'MLAG'; # Test length method ok $seq = Bio::PrimarySeq->new(), 'Length method'; is $seq->length, 0; ok $seq->length(123); is $seq->length, 123; ok $seq = Bio::PrimarySeq->new( -seq => 'ATGCTCTAAGCAGGGTAA' ); is $seq->length, 18; ok $seq->seq('ATGCTCTAAG'); is $seq->length, 10; is $seq->seq(undef), undef; is $seq->length, 0; ok $seq = Bio::PrimarySeq->new( -length => 123 ); is $seq->length, 123; ok $seq = Bio::PrimarySeq->new( -seq => 'ATGCTCTAAGCAGGGTAA' ); is $seq->length, 18; ok $seq->length( $seq->length ); # save memory by removing seq is $seq->seq( undef ), undef; # ... but keeping a record of length is $seq->length, 18; is $seq->seq, undef; ok $seq->seq('ACGT'); is $seq->length, 4; # manually-specified length changed when sequence is changed throws_ok { $seq->length(666); } qr/.+/; # Cannot lie about length # Sequence validation method is $seq->validate_seq( undef ), 1; is $seq->validate_seq( '' ), 1; is $seq->validate_seq( 'acgt' ), 1; is $seq->validate_seq( 'ACGT' ), 1; is $seq->validate_seq( 'XFRH' ), 1; is $seq->validate_seq( '-~' ), 1; # gap symbols is $seq->validate_seq( '-.*?=~' ), 1; # other valid symbols is $seq->validate_seq( '0' ), 0; is $seq->validate_seq( ' ' ), 0; is $seq->validate_seq( 'AAAA$' ), 0; is $seq->validate_seq( 'tt&t!' ), 0; throws_ok { $seq->validate_seq('tt&t!', 1); } qr/.+/; # Test direct option (no sequence validation) throws_ok { $seq = Bio::PrimarySeq->new(-seq => 'A\T$AGQ+T'); } qr/.+/, 'Validation'; ok $seq = Bio::PrimarySeq->new( -seq => 'A\T$AGQ+T', -direct => 1 ); is $seq->seq, 'A\T$AGQ+T'; throws_ok { $seq->seq('NT@/') } qr/.+/; # Set a sequence by reference my $string = 'AAAACCCCGGGGTTTT'; ok $seq = Bio::PrimarySeq->new( -ref_to_seq => \$string ); is $seq->seq, 'AAAACCCCGGGGTTTT'; # Test internal PrimarySeqI _find_orfs function and translate( -orf => 'longest' ) { my @tests = ( #tiny test ['TTTTATGGTGGCGTCAACTTAATTT', [[4,22,18,1]], ], #bigger test (this is a tomato unigene) ['GAAGGCTGGTTCTGAGTTGGATCTATGTTTGATGAAGGGAAGTAGACCGGAGGTCTTGCATCAGCAATATTAGTACCAAATCCAGGTGGAGGCGCATCCTGTCTCCGTTGCATTTCAACTTTCATTTCAGCAATCTGTTGCATCAGTTGCATGATCAATTCATTCTGTTCCACTACAGTGGGCTGAGCGACCACAACGTCAGTAAGACGCCCTTCGTCATTGTTGTCTCCCATAACTGTTTTTCCTTTATCTGAATTTGATCGAGGGAAGGAATCTGTAGGACCTTTCGATCTGGTGAAGTAAGGATGATCTGCCAGCTTTATTGACACAGATCAGTAAAAAGGTACCTGAAAGGTAAAAACAACTCAAAGGCAAATTTGTTAGTGCATATCCAGAGTACAAAATGCTTAATATCGCACATAAAACCGATAAACACACAAGTCGTTTTGTTTGAGGATATCTTAACCCACGAATAAGGACGGATATATATTTTGAACAAACAGGAATTTGTTTGTTTGGCGTTATCTTGGGAAATCTG', [[98,254,156,2],[347,476,129,2],[219,303,84,0],[16,73,57,1],[403,454,51,1],[310,358,48,1],[235,280,45,1],[491,536,45,2],[150,186,36,0],[507,537,30,0],[5,32,27,2],[511,538,27,1],[24,45,21,0],[305,326,21,2],[450,465,15,0]], ], ); foreach my $test (@tests) { my ($test_seq, $orfs) = @$test; my @orfs = Bio::PrimarySeqI::_find_orfs_nucleotide( undef, $test_seq, Bio::Tools::CodonTable->new, undef, ); # ATG GTG GCG TCA ACT is_deeply( \@orfs, $orfs, '_find_orfs 1') or diag "for $test_seq, _find_orfs returned:\n" .Dumper([map [@$_], @orfs]); is_deeply( $orfs->[0], (sort {$b->[2] <=> $a->[2]} @$orfs)[0], 'orfs are sorted by descending length' ); # make sure we get the same sequence by taking the longest orf # nucleotide from the test data and translating it, as by # calling translate with -orf => 'longest' is( Bio::PrimarySeq ->new( -seq => $test_seq, -id => 'fake_id' ) ->translate( -orf => 'longest' ) ->seq, Bio::PrimarySeq ->new( -seq => substr( $test_seq, $orfs->[0][0], $orfs->[0][2] ), -id => 'foo' ) ->translate ->seq, 'got correct -orf => "longest" seq', ); } } ##### # Extensive location and subsequence tests ok $seq = Bio::PrimarySeq->new('-seq' => 'AAAAACCCCCGGGGGTTTTT',); ok $seq->is_circular(1); # NOTE: "_no_strand" variables tests the possibility that the user didn't set # Strand for positive coordinates (or the object comes from # Bio::Factory::FTLocationFactory->from_string) # Single location # Coordinates: 1..5 => AAAAA # Revcom: complement(1..5) => TTTTT ok my $loc1_strand = Bio::Location::Simple->new('-start' => 1, '-end' => 5,'-strand' => 1); ok my $loc1_no_strand = Bio::Location::Simple->new('-start' => 1, '-end' => 5); is $seq->subseq($loc1_strand), 'AAAAA'; is $seq->subseq($loc1_no_strand), 'AAAAA'; is $loc1_strand->to_FTstring, '1..5'; is $loc1_no_strand->to_FTstring, '1..5'; $loc1_strand->flip_strand; $loc1_no_strand->flip_strand; is $seq->subseq($loc1_strand), 'TTTTT'; is $seq->subseq($loc1_no_strand), 'TTTTT'; is $loc1_strand->to_FTstring, 'complement(1..5)'; is $loc1_no_strand->to_FTstring, 'complement(1..5)'; is $loc1_strand->length, 5; is $loc1_no_strand->length, 5; # Basic split, both locations in positive strand # Coords: join(6..10,16..20) => CCCCCTTTTT # Revcom: complement(join(6..10,16..20)) => AAAAAGGGGG ok my $loc2_strand = Bio::Location::Split->new(); ok my $loc2_no_strand = Bio::Location::Split->new(); ok $loc2_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => 1) ); ok $loc2_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => 1) ); ok $loc2_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10) ); ok $loc2_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20) ); is $seq->subseq($loc2_strand), 'CCCCCTTTTT'; is $seq->subseq($loc2_no_strand), 'CCCCCTTTTT'; is $loc2_strand->to_FTstring, 'join(6..10,16..20)'; is $loc2_no_strand->to_FTstring, 'join(6..10,16..20)'; $loc2_strand->flip_strand; $loc2_no_strand->flip_strand; is $seq->subseq($loc2_strand), 'AAAAAGGGGG'; is $seq->subseq($loc2_no_strand), 'AAAAAGGGGG'; is $loc2_strand->to_FTstring, 'complement(join(6..10,16..20))'; is $loc2_no_strand->to_FTstring, 'complement(join(6..10,16..20))'; is $loc2_strand->length, 15; is $loc2_no_strand->length, 15; # Basic split, both locations in negative strand # Coords: complement(join(6..10,16..20)) => AAAAAGGGGG # Revcom: join(6..10,16..20) => CCCCCTTTTT my $loc3_strand = Bio::Location::Split->new(); $loc3_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => -1) ); $loc3_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => -1) ); is $seq->subseq($loc3_strand), 'AAAAAGGGGG'; is $loc3_strand->to_FTstring, 'complement(join(6..10,16..20))'; $loc3_strand->flip_strand; is $seq->subseq($loc3_strand), 'CCCCCTTTTT'; is $loc3_strand->to_FTstring, 'join(6..10,16..20)'; is $loc3_strand->length, 15; ## Cut by origin-split, same strand, single sequence that pass through origin #Coords: join(16..20,1..2) => TTTTTAA #Revcom: complement(join(16..20,1..2)) => TTAAAAA my $loc4_strand = Bio::Location::Split->new(); my $loc4_no_strand = Bio::Location::Split->new(); $loc4_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => 1) ); $loc4_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2, '-strand' => 1) ); $loc4_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20) ); $loc4_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2) ); is $seq->subseq($loc4_strand), 'TTTTTAA'; is $seq->subseq($loc4_no_strand), 'TTTTTAA'; is $loc4_strand->to_FTstring, 'join(16..20,1..2)'; is $loc4_no_strand->to_FTstring, 'join(16..20,1..2)'; $loc4_strand->flip_strand; $loc4_no_strand->flip_strand; is $seq->subseq($loc4_strand), 'TTAAAAA'; is $seq->subseq($loc4_no_strand), 'TTAAAAA'; is $loc4_strand->to_FTstring, 'complement(join(16..20,1..2))'; is $loc4_no_strand->to_FTstring, 'complement(join(16..20,1..2))'; is $loc4_strand->length, 7; is $loc4_no_strand->length, 7; ## Cut by origin-combo split, same strand, 2 sequences with 1st passing through origin #Coords: join(19..20,1..2,11..13) => TTAAGGG #Revcom: complement(join(19..20,1..2,11..13)) => CCCTTAA my $loc5_strand = Bio::Location::Split->new(); my $loc5_no_strand = Bio::Location::Split->new(); $loc5_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20, '-strand' => 1) ); $loc5_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2, '-strand' => 1) ); $loc5_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13, '-strand' => 1) ); $loc5_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20) ); $loc5_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2) ); $loc5_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13) ); is $seq->subseq($loc5_strand), 'TTAAGGG'; is $seq->subseq($loc5_no_strand), 'TTAAGGG'; is $loc5_strand->to_FTstring, 'join(19..20,1..2,11..13)'; is $loc5_no_strand->to_FTstring, 'join(19..20,1..2,11..13)'; $loc5_strand->flip_strand; $loc5_no_strand->flip_strand; is $seq->subseq($loc5_strand), 'CCCTTAA'; is $seq->subseq($loc5_no_strand), 'CCCTTAA'; is $loc5_strand->to_FTstring, 'complement(join(19..20,1..2,11..13))'; is $loc5_no_strand->to_FTstring, 'complement(join(19..20,1..2,11..13))'; is $loc5_strand->length, 15; is $loc5_no_strand->length, 15; ## Cut by origin-combo split, same strand, 2 sequences with 2nd passing through origin #Coords: join(6..10,19..20,1..4) => CCCCCTTAAAA #Revcom: complement(join(6..10,19..20,1..4)) => TTTTAAGGGGG my $loc6_strand = Bio::Location::Split->new(); my $loc6_no_strand = Bio::Location::Split->new(); $loc6_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => 1) ); $loc6_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20, '-strand' => 1) ); $loc6_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 4, '-strand' => 1) ); $loc6_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10) ); $loc6_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20) ); $loc6_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 4) ); is $seq->subseq($loc6_strand), 'CCCCCTTAAAA'; is $seq->subseq($loc6_no_strand), 'CCCCCTTAAAA'; is $loc6_strand->to_FTstring, 'join(6..10,19..20,1..4)'; is $loc6_no_strand->to_FTstring, 'join(6..10,19..20,1..4)'; $loc6_strand->flip_strand; $loc6_no_strand->flip_strand; is $seq->subseq($loc6_strand), 'TTTTAAGGGGG'; is $seq->subseq($loc6_no_strand), 'TTTTAAGGGGG'; is $loc6_strand->to_FTstring, 'complement(join(6..10,19..20,1..4))'; is $loc6_no_strand->to_FTstring, 'complement(join(6..10,19..20,1..4))'; is $loc6_strand->length, 19; is $loc6_no_strand->length, 19; ## Trans-splicing, 2 sequences in different strands, 2nd in complement #Coords: join(6..10,complement(16..20)) => CCCCCAAAAA #Revcom: join(16..20,complement(6..10)) => TTTTTGGGGG my $loc7_strand = Bio::Location::Split->new(); my $loc7_no_strand = Bio::Location::Split->new(); $loc7_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => 1) ); $loc7_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => -1) ); $loc7_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10) ); $loc7_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => -1) ); is $seq->subseq($loc7_strand), 'CCCCCAAAAA'; is $seq->subseq($loc7_no_strand), 'CCCCCAAAAA'; is $loc7_strand->to_FTstring, 'join(6..10,complement(16..20))'; is $loc7_no_strand->to_FTstring, 'join(6..10,complement(16..20))'; $loc7_strand->flip_strand; $loc7_no_strand->flip_strand; is $seq->subseq($loc7_strand), 'TTTTTGGGGG'; is $seq->subseq($loc7_no_strand), 'TTTTTGGGGG'; is $loc7_strand->to_FTstring, 'join(16..20,complement(6..10))'; is $loc7_no_strand->to_FTstring, 'join(16..20,complement(6..10))'; is $loc7_strand->length, 10; is $loc7_no_strand->length, 10; ## Trans-splicing, 2 sequences in different strands, 1st in complement #Coords: join(complement(16..20),6..10) => AAAAACCCCC #Revcom: join(complement(6..10),16..20) => GGGGGTTTTT my $loc8_strand = Bio::Location::Split->new(); my $loc8_no_strand = Bio::Location::Split->new(); $loc8_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => -1) ); $loc8_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => 1) ); $loc8_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 16, '-end' => 20, '-strand' => -1) ); $loc8_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10) ); is $seq->subseq($loc8_strand), 'AAAAACCCCC'; is $seq->subseq($loc8_no_strand), 'AAAAACCCCC'; is $loc8_strand->to_FTstring, 'join(complement(16..20),6..10)'; is $loc8_no_strand->to_FTstring, 'join(complement(16..20),6..10)'; $loc8_strand->flip_strand; $loc8_no_strand->flip_strand; is $seq->subseq($loc8_strand), 'GGGGGTTTTT'; is $seq->subseq($loc8_no_strand), 'GGGGGTTTTT'; is $loc8_strand->to_FTstring, 'join(complement(6..10),16..20)'; is $loc8_no_strand->to_FTstring, 'join(complement(6..10),16..20)'; is $loc8_strand->length, 10; is $loc8_no_strand->length, 10; ## Trans-splicing w/cut by origin, 2 sequences with 1st passing through origin, 2nd in complement #Coords: join(19..20,1..3,complement(11..13)) => TTAAACCC #Revcom: join(11..13,complement(1..3),complement(19..20)) => GGGTTTAA my $loc9_strand = Bio::Location::Split->new(); my $loc9_no_strand = Bio::Location::Split->new(); $loc9_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20, '-strand' => 1) ); $loc9_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 3, '-strand' => 1) ); $loc9_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13, '-strand' => -1) ); $loc9_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20) ); $loc9_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 3) ); $loc9_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13, '-strand' => -1) ); is $seq->subseq($loc9_strand), 'TTAAACCC'; is $seq->subseq($loc9_no_strand), 'TTAAACCC'; is $loc9_strand->to_FTstring, 'join(19..20,1..3,complement(11..13))'; is $loc9_no_strand->to_FTstring, 'join(19..20,1..3,complement(11..13))'; $loc9_strand->flip_strand; $loc9_no_strand->flip_strand; is $seq->subseq($loc9_strand), 'GGGTTTAA'; is $seq->subseq($loc9_no_strand), 'GGGTTTAA'; is $loc9_strand->to_FTstring, 'join(11..13,complement(1..3),complement(19..20))'; is $loc9_no_strand->to_FTstring, 'join(11..13,complement(1..3),complement(19..20))'; is $loc9_strand->length, 8; is $loc9_no_strand->length, 8; ## Trans-splicing w/cut by origin, 2 sequences with 1st passing through origin, 1st in complement #Coords: join(complement(1..3),complement(19..20),11..13) => TTTAAGGG #Revcom: join(complement(11..13),19..20,1..3) => CCCTTAAA my $loc10_strand = Bio::Location::Split->new(); my $loc10_no_strand = Bio::Location::Split->new(); $loc10_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 3, '-strand' => -1) ); $loc10_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20, '-strand' => -1) ); $loc10_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13, '-strand' => 1) ); $loc10_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 3, '-strand' => -1) ); $loc10_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 19, '-end' => 20, '-strand' => -1) ); $loc10_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 11, '-end' => 13) ); is $seq->subseq($loc10_strand), 'TTTAAGGG'; is $seq->subseq($loc10_no_strand), 'TTTAAGGG'; is $loc10_strand->to_FTstring, 'join(complement(1..3),complement(19..20),11..13)'; is $loc10_no_strand->to_FTstring, 'join(complement(1..3),complement(19..20),11..13)'; $loc10_strand->flip_strand; $loc10_no_strand->flip_strand; is $seq->subseq($loc10_strand), 'CCCTTAAA'; is $seq->subseq($loc10_no_strand), 'CCCTTAAA'; is $loc10_strand->to_FTstring, 'join(complement(11..13),19..20,1..3)'; is $loc10_no_strand->to_FTstring, 'join(complement(11..13),19..20,1..3)'; is $loc10_strand->length, 8; is $loc10_no_strand->length, 8; ## Trans-splicing w/cut by origin, 2 sequences with 2nd passing through origin, 2nd in complement #Coords: join(6..10,complement(1..2),complement(18..20)) => CCCCCTTAAA #Revcom: join(18..20,1..2,complement(6..10)) => TTTAAGGGGG my $loc11_strand = Bio::Location::Split->new(); my $loc11_no_strand = Bio::Location::Split->new(); $loc11_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => 1) ); $loc11_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2, '-strand' => -1) ); $loc11_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 18, '-end' => 20, '-strand' => -1) ); $loc11_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10) ); $loc11_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2, '-strand' => -1) ); $loc11_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 18, '-end' => 20, '-strand' => -1) ); is $seq->subseq($loc11_strand), 'CCCCCTTAAA'; is $seq->subseq($loc11_no_strand), 'CCCCCTTAAA'; is $loc11_strand->to_FTstring, 'join(6..10,complement(1..2),complement(18..20))'; is $loc11_no_strand->to_FTstring, 'join(6..10,complement(1..2),complement(18..20))'; $loc11_strand->flip_strand; $loc11_no_strand->flip_strand; is $seq->subseq($loc11_strand), 'TTTAAGGGGG'; is $seq->subseq($loc11_no_strand), 'TTTAAGGGGG'; is $loc11_strand->to_FTstring, 'join(18..20,1..2,complement(6..10))'; is $loc11_no_strand->to_FTstring, 'join(18..20,1..2,complement(6..10))'; is $loc11_strand->length, 10; is $loc11_no_strand->length, 10; ## Trans-splicing w/cut by origin, 2 sequences with 2nd passing through origin, 1st in complement #Coords: join(complement(6..10),18..20,1..2) => GGGGGTTTAA #Revcom: join(complement(1..2),complement(18..20),6..10) => TTAAACCCCC my $loc12_strand = Bio::Location::Split->new(); my $loc12_no_strand = Bio::Location::Split->new(); $loc12_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => -1) ); $loc12_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 18, '-end' => 20, '-strand' => 1) ); $loc12_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2, '-strand' => 1) ); $loc12_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 6, '-end' => 10, '-strand' => -1) ); $loc12_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 18, '-end' => 20) ); $loc12_no_strand->add_sub_Location( Bio::Location::Simple->new('-start' => 1, '-end' => 2) ); is $seq->subseq($loc12_strand), 'GGGGGTTTAA'; is $seq->subseq($loc12_no_strand), 'GGGGGTTTAA'; is $loc12_strand->to_FTstring, 'join(complement(6..10),18..20,1..2)'; is $loc12_no_strand->to_FTstring, 'join(complement(6..10),18..20,1..2)'; $loc12_strand->flip_strand; $loc12_no_strand->flip_strand; is $seq->subseq($loc12_strand), 'TTAAACCCCC'; is $seq->subseq($loc12_no_strand), 'TTAAACCCCC'; is $loc12_strand->to_FTstring, 'join(complement(1..2),complement(18..20),6..10)'; is $loc12_no_strand->to_FTstring, 'join(complement(1..2),complement(18..20),6..10)'; is $loc12_strand->length, 10; is $loc12_no_strand->length, 10; IO000755000766000024 014006427353 15444 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Ontologygo.t100644000766000024 1741414006427353 16425 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Ontology/IO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 102, -requires_module => 'Graph'); use_ok('Bio::OntologyIO'); } my $parser = Bio::OntologyIO->new( -format => "go", -defs_file => test_input_file('GO.defs.test'), # test using -file -file => test_input_file('component.ontology.test')); my $IS_A = Bio::Ontology::RelationshipType->get_instance( "IS_A" ); my $PART_OF = Bio::Ontology::RelationshipType->get_instance( "PART_OF" ); my @onts = (); while(my $ont = $parser->next_ontology()) { push(@onts, $ont); } is (scalar(@onts), 1); my $ont = $onts[0]; isa_ok $ont, "Bio::Ontology::OntologyI"; is ($ont->name(), "Gene Ontology"); my $engine = $ont->engine(); isa_ok $engine, "Bio::Ontology::OntologyEngineI"; my $term = $engine->get_terms( "GO:0018897" ); # note that all dblinks are now Bio::Annotation::DBLink instances and that all # *dblink* methods related to Bio::Ontology::Term are deprecated; this is due to # inconsistencies in those Bio::Ontology::Term methods. Use *dbxref* methods # instead my @dblinks = sort {$a->display_text cmp $b->display_text} ( $term->get_dbxrefs() ); my @synos = sort ( $term->get_synonyms() ); is( $dblinks[ 0 ]->display_text, "MetaCyc:PWY-681" ); is( $dblinks[ 1 ]->display_text, "UM-BBD_pathwayID:dbt" ); is( $synos[ 0 ], "murein sacculus" ); is( $synos[ 1 ], "peptidoglycan" ); is( $term->ontology()->name(), "Gene Ontology" ); is( $term->name(), "dibenzothiophene desulfurization" ); $term = $engine->get_terms( "GO:0004796" ); @dblinks = sort ( $term->get_dbxrefs() ); @synos = sort ( $term->get_synonyms() ); my @sec = sort ( $term->get_secondary_GO_ids() ); is( $dblinks[ 0 ]->display_text, "EC:5.3.99.5" ); is( $synos[ 0 ], "cytochrome P450 CYP5" ); is( $sec[ 0 ], "GO:0008400" ); is( $term->ontology()->name(), "Gene Ontology" ); is( $term->name(), "thromboxane-A synthase" ); my @parents = sort goid ( $ont->get_parent_terms( $term ) ); is( @parents, 2 ); is( $parents[ 0 ]->GO_id(), "GO:0015034" ); is( $parents[ 1 ]->GO_id(), "GO:0018895" ); @parents = sort goid ( $ont->get_parent_terms( $term, $PART_OF, $IS_A) ); is( @parents, 2 ); is( $parents[ 0 ]->GO_id(), "GO:0015034" ); is( $parents[ 1 ]->GO_id(), "GO:0018895" ); @parents = sort goid ( $ont->get_parent_terms( "GO:0004796", $IS_A ) ); is( @parents, 2 ); is( $parents[ 0 ]->GO_id(), "GO:0015034" ); is( $parents[ 1 ]->GO_id(), "GO:0018895" ); @parents = sort goid ( $ont->get_parent_terms( "GO:0004796", $PART_OF ) ); is( scalar(@parents), 0 ); my @anc = sort goid ( $ont->get_ancestor_terms( $term ) ); is( scalar(@anc), 3 ); is( $anc[ 0 ]->GO_id(), "GO:0003673" ); is( $anc[ 1 ]->GO_id(), "GO:0015034" ); is( $anc[ 2 ]->GO_id(), "GO:0018895" ); @anc = sort goid ( $ont->get_ancestor_terms( "GO:0004796", $IS_A ) ); is( scalar(@anc), 3 ); is( $anc[ 0 ]->GO_id(), "GO:0003673" ); is( $anc[ 1 ]->GO_id(), "GO:0015034" ); is( $anc[ 2 ]->GO_id(), "GO:0018895" ); @anc = sort goid ( $ont->get_ancestor_terms( "GO:0000666" ) ); is( @anc, 12 ); @anc = sort goid ( $ont->get_ancestor_terms( "GO:0000666", $IS_A ) ); is( @anc, 2 ); is( $anc[ 0 ]->GO_id(), "GO:0005811" ); is( $anc[ 1 ]->GO_id(), "GO:0030481" ); @anc = sort goid ( $ont->get_ancestor_terms( "GO:0000666", $PART_OF ) ); is( @anc, 6 ); is( $anc[ 0 ]->GO_id(), "GO:0005623" ); is( $anc[ 1 ]->GO_id(), "GO:0005625" ); is( $anc[ 2 ]->GO_id(), "GO:0005933" ); is( $anc[ 3 ]->GO_id(), "GO:0005935" ); is( $anc[ 4 ]->GO_id(), "GO:0005937" ); is( $anc[ 5 ]->GO_id(), "GO:0005938" ); my @childs = sort goid ( $ont->get_child_terms( "GO:0005625", $PART_OF ) ); is( @childs, 2 ); is( $childs[ 0 ]->GO_id(), "GO:0000666" ); is( $childs[ 0 ]->name(), "polarisomeX" ); is( $childs[ 1 ]->GO_id(), "GO:0000667" ); is( $childs[ 1 ]->name(), "polarisomeY" ); is( $childs[ 1 ]->ontology()->name(), "Gene Ontology" ); is( $engine->get_terms( "GO:0005625" )->name(), "soluble fraction" ); @childs = sort goid ( $ont->get_descendant_terms( "GO:0005624", $IS_A ) ); is( @childs, 6 ); is( $childs[ 0 ]->GO_id(), "GO:0000299" ); is( $childs[ 0 ]->name(), "integral membrane protein of membrane fraction" ); is( $childs[ 1 ]->GO_id(), "GO:0000300" ); is( $childs[ 1 ]->name(), "peripheral membrane protein of membrane fraction" ); is( $childs[ 2 ]->GO_id(), "GO:0005792" ); is( $childs[ 2 ]->name(), "microsome" ); is( $childs[ 3 ]->GO_id(), "GO:0019717" ); is( $childs[ 3 ]->name(), "synaptosome" ); is( $childs[ 4 ]->GO_id(), "GO:0019718" ); is( $childs[ 4 ]->name(), "rough microsome" ); is( $childs[ 5 ]->GO_id(), "GO:0019719" ); is( $childs[ 5 ]->name(), "smooth microsome" ); @childs = sort goid ( $ont->get_descendant_terms( "GO:0005625", $IS_A ) ); is( @childs, 0 ); @childs = sort goid ( $ont->get_descendant_terms( "GO:0005625", $PART_OF ) ); is( @childs, 2 ); my @rels = sort child_goid ( $ont->get_relationships( "GO:0005625" ) ); is( @rels, 3 ); is( $rels[ 0 ]->object_term()->GO_id(), "GO:0005625" ); is( $rels[ 0 ]->subject_term()->GO_id(), "GO:0000666" ); ok( $rels[ 0 ]->predicate_term()->equals( $PART_OF ) ); is( $rels[ 1 ]->object_term()->GO_id(), "GO:0005625" ); is( $rels[ 1 ]->subject_term()->GO_id(), "GO:0000667" ); ok( $rels[ 1 ]->predicate_term()->equals( $PART_OF ) ); is( $rels[ 2 ]->object_term()->GO_id(), "GO:0000267" ); is( $rels[ 2 ]->subject_term()->GO_id(), "GO:0005625" ); ok( $rels[ 2 ]->predicate_term()->equals( $IS_A ) ); # dbxrefs and synonyms are candidates for being falsely picked up by # overly promiscuous regular expressions as related terms, so we test for # that here my @terms = $engine->get_terms( "EC:5.3.99.5" ); is (scalar(@terms), 0); @terms = $engine->get_terms("MetaCyc:PWY-681","MetaCyc:PWY"); is (scalar(@terms), 0); @terms = $engine->get_terms("UM-BBD_pathwayID:dbt","BBD_pathwayID:dbt", "UM-BBD_pathwayID:dbt2","BBD_pathwayID:dbt2"); is (scalar(@terms), 0); ok( $engine->graph() ); ok( $ont->add_term( Bio::Ontology::GOterm->new(-identifier => "GO:0000000"))); ok( $engine->has_term( "GO:0000300" ) ); is( scalar $ont->get_all_terms(), 44 ); is( scalar $ont->get_relationship_types(), 3 ); ok( ! $ont->add_relationship( $rels[ 2 ] ) ); # this edge already exists, cannot add $rels[ 2 ]->subject_term()->GO_id( "GO:0005938" ); ok( $ont->add_relationship( $rels[ 2 ] ) ); # now it's changed, can add my @roots = $ont->get_root_terms(); is( scalar(@roots), 10 ); my @leafs = $ont->get_leaf_terms(); is( scalar(@leafs), 19 ); $parser = Bio::OntologyIO->new( -format => "go", -defs_file => test_input_file('GO.defs.test2'), # test using -files -files => test_input_file('component.ontology.test2')); $ont = $parser->next_ontology(); ok ($ont); @roots = $ont->get_root_terms(); is( scalar(@roots), 1 ); @leafs = $ont->get_leaf_terms(); is( scalar(@leafs), 4 ); $parser = Bio::OntologyIO->new( -format => "go", -file => test_input_file('mpath.ontology.test')); ok($parser); $ont = $parser->next_ontology; ok($ont); $engine = $ont->engine; ok($engine); $term = $engine->get_terms( "MPATH:30" ); is($term->identifier,"MPATH:30"); is($term->name,"cystic medial necrosis"); is($term->definition,undef); is((sort $term->get_synonyms)[0],"erdheim disease"); is($ont->get_parent_terms( $term )->name,"tissue specific degenerative process"); is(scalar($ont->get_root_terms()),2); @anc = $ont->get_ancestor_terms($term); is(scalar(@anc),4); ################################################################# # helper functions ################################################################# sub goid { snum ( $a->GO_id() ) <=> snum ( $b->GO_id() ) } sub child_goid { snum ( $a->child_term()->GO_id() ) <=> snum ( $b->child_term()->GO_id() ) } sub snum { my $x = shift( @_ ); $x =~ s/\D+//g; return $x; } bl2seq.t100644000766000024 201214006427353 16315 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: bl2seq.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 7); use_ok('Bio::AlignIO::bl2seq'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # BL2SEQ $str = Bio::AlignIO->new(-file => test_input_file("bl2seq.out"), -format => 'bl2seq', -report_type => 'blastp'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(2)->get_nse, 'ALEU_HORVU/60-360', "BLAST bl2seq format test"; # Bug 2978, test report_type guessing for TBLASTN and correct Frame assignment to HitFrame $str = Bio::AlignIO->new(-file => test_input_file("bl2seq.tblastn.out"), -format => 'bl2seq'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); foreach my $seq ( $aln->each_seq_with_id('WAN03UHTX_1') ) { is $seq->start(), 946; is $seq->end(), 990; is $seq->strand(), -1; } prodom.t100644000766000024 101714006427353 16431 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: prodom.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 3); use_ok('Bio::AlignIO::prodom'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # PRODOM $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.prodom"), '-format' => 'prodom'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'P04777/1-33', "prodom input test "; phylip.t100644000766000024 613014006427353 16437 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 20); use_ok('Bio::AlignIO::phylip'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # PHYLIP sequential/non-interleaved $strout = Bio::AlignIO->new('-file' => test_input_file('noninterleaved.phy'), '-interleaved' => 0, '-format' => 'phylip'); $aln = $strout->next_aln($aln); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(2)->seq(), 'CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAA'. 'AGGTAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGACATGAATT'. 'TGCCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGGTTTATCAAAGTAAGACAGTATGATCAGA'. 'TACCCATAGAGATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGACCCACACCTGTCAATATAATTG'. 'GAAGAAATCTGTTGACTCAGATTGGTTGCACTTTAAATTTT' ); # PHYLIP interleaved with long Ids $str = Bio::AlignIO->new( '-file' => test_input_file("protpars_longid.phy"), '-format' => 'phylip', 'longid' => 1); $aln = $str->next_aln(); #isa_ok($str,'Bio::AlignIO'); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'S I N F R U P 0 0 1 /1-84'; is $aln->get_seq_by_pos(2)->get_nse, 'SINFRUP002/1-84'; # PHYLIP interleaved, multiple segments $str = Bio::AlignIO->new( '-file' => test_input_file("protpars.phy"), '-format' => 'phylip'); $aln = $str->next_aln(); #isa_ok($str,'Bio::AlignIO'); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'SINFRUP001/1-4940'; # is $aln->get_seq_by_pos(2)->get_nse, 'SINFRUP002/1-84'; # PHYLIP interleaved $str = Bio::AlignIO->new( '-file' => test_input_file("testaln.phylip"), '-format' => 'phylip'); $aln = $str->next_aln(); #isa_ok($str,'Bio::AlignIO'); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'Homo_sapie/1-45'; $strout = Bio::AlignIO->new( '-file' => ">".test_output_file(), '-format' => 'phylip'); $status = $strout->write_aln($aln); is $status, 1, "phylip output test"; # check the LocatableSeq start/end/strand etc my $ls = $aln->get_seq_by_pos(2); is($ls->display_id, 'Pan_panisc'); is($ls->start, 1); is($ls->end,47); is($ls->length,50); # bug 2984 TODO: { local $TODO = 'problems with default strand'; # shouldn't this be 0? is($ls->strand,0); } # check to see that newlines between header and sequences are parsed correctly $str = Bio::AlignIO->new('-file' => test_input_file("codeml45b.mlc"), -format => 'phylip', '-longid' => 1); $aln = $str->next_aln(); $ls = $aln->get_seq_by_pos(9); ok($ls->display_id eq "Pop_trich_ch", "newline between header and sequences is parsed correctly"); # bug 123 "Problem with newer versions of AlignIO to write alignment data #123" my $io = Bio::AlignIO->new( -file => test_input_file("ORTHOMCL2345.cluster.aa.fa.aln.aa.phy.txt"), -format => 'phylip'); $aln = $io->next_aln(); isa_ok($aln,'Bio::SimpleAlign'); # Alignment file where an id has spaces in it $io = Bio::AlignIO->new( -file => test_input_file("ids-with-spaces.phy"), -format => 'phylip'); $aln = $io->next_aln(); isa_ok($aln,'Bio::SimpleAlign'); is($aln->get_seq_by_pos(1)->display_id, 'A_BAD_ID'); emboss.t100644000766000024 743314006427353 16431 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: emboss.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 37); use_ok('Bio::AlignIO::emboss'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # EMBOSS water $str = Bio::AlignIO->new('-format' => 'emboss', '-file' => test_input_file('cysprot.water')); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->score,'501.50'); is($aln->get_seq_by_pos(1)->get_nse,'PAPA_CARPA/3-342'); is($aln->get_seq_by_pos(2)->get_nse,'CATL_HUMAN/1-331'); is(sprintf("%.1f",$aln->overall_percentage_identity),33.8); is(sprintf("%.1f",$aln->average_percentage_identity),40.1); is($aln->get_seq_by_pos(1)->start, 3); is($aln->length,364); # EMBOSS needle $str = Bio::AlignIO->new('-format' => 'emboss', '-file' => test_input_file('cysprot.needle')); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->score,'499.50'); is($aln->get_seq_by_pos(1)->get_nse,'PAPA_CARPA/1-345'); is($aln->get_seq_by_pos(2)->get_nse,'CATL_HUMAN/1-333'); # EMBOSS water 2.2.x $str = Bio::AlignIO->new('-format' => 'emboss', '-file' => test_input_file('cys1_dicdi.water')); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse,'CYS1_DICDI/1-343'); is($aln->get_seq_by_pos(2)->get_nse,'CYS1_DICDI-1/1-343'); is($aln->score,'1841.0'); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(1)->get_nse,'CYS1_DICDI/29-343'); is($aln->get_seq_by_pos(2)->get_nse,'ALEU_HORVU/61-360'); # EMBOSS water 2.2.x sparse needle $str = Bio::AlignIO->new(-verbose => $DEBUG, '-format' => 'emboss', '-file' => test_input_file('sparsealn.needle')); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->score,'18.0'); is(sprintf("%.1f",$aln->overall_percentage_identity), 2.1); is(sprintf("%.1f",$aln->average_percentage_identity), 38.5); is($aln->get_seq_by_pos(1)->length, 238); is($aln->length,238); is($aln->get_seq_by_pos(1)->get_nse,'KV1K_HUMAN/1-108'); is($aln->get_seq_by_pos(2)->get_nse,'IF1Y_HUMAN/1-143'); is($aln->get_seq_by_pos(1)->seq(), 'DIQMTQSPSTLSVSVGDRVTITCEASQTVLSYLNWYQQK'. 'PGKAPKLLIYAASSLETGVPSRFSGQGSGTBFTFTISSVZPZBFATYYCQZYLDLPRTFGQGTKVDLKR'. '-'x130); is($aln->get_seq_by_pos(2)->seq(), ('-'x94).'PKNKGKGGK-NRRRGKNENESEKRELVFKE'. 'DGQEYAQVIKMLGNGRLEALCFDGVKRLCHIRGKLRKKVWINTSDIILVGLRDYQDNKADVILKYNADEAR'. 'SLKAYGGLPEHAKINETDTFGPGDDDEIQFDDIGDDDEDIDDI'); is($aln->is_flush, 1); # EMBOSS needle $str = Bio::AlignIO->new('-format' => 'emboss', '-file' => test_input_file('gf-s71.needle')); $aln = $str->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is($aln->get_seq_by_pos(2)->seq(), 'MEDVTLFQFTWRKPI-RLQGEIVYKTSETQTIETNKKDVECVANFQENKEVQTDS-VDNGVGENVKKDITISKEVLNLLYDFVRDDSKVNYDRLLEFHKFDKVALETVQKYHVETRNENIILMISSSSRKTLILFGGISHETFCSHQARALLCSSSTSFSIPLPVCAISAVFYSSTQFILGDVSGNISMCSKDKIIFEKKITDGAVTCLEMCRHGLLSGSDDGNIILWQIGTSGLEKLGGTKLTVSDLSRKIRRSSTSNKPVAIVSMQVYVWPSGEEACVATETGGLYLLTLPTLDYKPLSHQTATSINKILFENQFVAVIYHTSNAAVFNSEGLVDEIPFVATLAVR----------PKLVLF--YTSVCVQDITLNCTSPFREFNNEYNPVIKFSKIRFSADLSVING-FRTSSPNSNN-----------------------------------------------'); is($aln->get_seq_by_pos(1)->seq(), 'MEDVTLHHFRWRKPVENKNGEIVYKTSETQTAEISRKDVECVANFQKSQESQTDDFMQNGVGDGIKKEIRISKEVLGHIYDFLRDDSKVNYDRLLEFHKFDKVSLETVQKYHVETRNENIILMISNSSRKTLILFGGLSHETFCSHQARAVLCSSSTTSSLPLPVCAISAVFYSSTQFLLGDISGNISMWTKEKMIFENKVTDGSVTSLELCRYGLLSGSDDGNVILWKVEESKIEKIEGIKLTVSDLSRKIRRSSTSNKPVAIVSMQV----SGDEVCVATETGGLYLLTLPTLESKPLT-QSATSIFKILYEHPYIAVVYHTSNSAIFNSEGLVDEIPFVATLAVRCGAYFIFSNQSRLIIWSMNTRSTVIDENLNCHS-ICSLSND--------------TLQVLDGDFNLNSQSENSATSESENLRISDLQNLRMLKLQNLRTSEFQNFRTSESQYFKKDNGEL'); is($aln->is_flush(), 1); is($aln->get_seq_by_pos(1)->get_nse,'gf.s71.44/1-448'); is($aln->get_seq_by_pos(2)->get_nse,'Y50C1A.2/1-406'); fasta.t100644000766000024 6765314006427353 16444 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: SearchIO_fasta.t 14995 2008-11-16 06:20:00Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 301); use_ok('Bio::SearchIO'); } my ($searchio, $result, $hit, $hsp); # simple FASTA $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('HUMBETGLOA.FASTA')); $result = $searchio->next_result; like($result->database_name, qr/dros_clones.2.5/); is($result->database_letters, 112936249); is($result->database_entries, 657); is($result->algorithm, 'FASTN'); is($result->algorithm_version, '3.3t08'); is($result->query_name, "HUMBETGLOA"); is($result->query_description, "Human haplotype C4 beta-globin gene, complete cds."); is($result->query_length, 3002); is($result->get_parameter('gapopen'), -16); is($result->get_parameter('gapext'), -4); is($result->get_parameter('ktup'), 6); is($result->get_statistic('lambda'), 0.0823); is($result->get_statistic('dbletters'), 112936249); is($result->get_statistic('dbentries'), 657); my @valid = ( [ 'BACR21I23', 73982, 'BACR21I23', '0.017', 44.2], [ 'BACR40P19', 73982, 'BACR40P19', '0.017', 44.2], [ 'BACR30L17', 32481, 'BACR30L17', '0.018', 44.1]); my $count = 0; while( my $hit = $result->next_hit ) { my $d = shift @valid; is($hit->name, shift @$d); is($hit->length, shift @$d); is($hit->accession, shift @$d); float_is($hit->significance, shift @$d); is($hit->raw_score, shift @$d ); is($hit->rank, $count + 1); if( $count == 0 ) { my $hsps_left = 1; while( my $hsp = $hit->next_hsp ) { is($hsp->query->start, 31); is($hsp->query->end, 289); is($hsp->query->strand, -1); is($hsp->hit->end, 65167); is($hsp->hit->start, 64902); is($hsp->hit->strand, 1); is($hsp->length('total'), 267); float_is($hsp->evalue, 0.017); is($hsp->score, 134.5); is($hsp->bits,44.2); is(sprintf("%.2f",$hsp->percent_identity), '57.30'); is(sprintf("%.4f",$hsp->frac_identical('query')), 0.5907); is(sprintf("%.4f",$hsp->frac_identical('hit')), 0.5752); # these are really UNGAPPED values not CONSERVED # otherwise ident and conserved would be identical for # nucleotide alignments is(sprintf("%.4f",$hsp->frac_conserved('total')), '0.5918'); is(sprintf("%.4f",$hsp->frac_conserved('query')), '0.6100'); is(sprintf("%.4f",$hsp->frac_conserved('hit')), '0.5940'); is($hsp->query->frame(), 0); is($hsp->hit->frame(), 0); is($hsp->gaps('query'), 8); is($hsp->gaps('hit'),1); is($hsp->gaps('total'), 9); is($hsp->query_string, 'GATTAAAACCTTCTGGTAAGAAAAGAAAAAATATATATATATATATATGTGTATATGTACACACATACATATACATATATATGCATTCATTTGTTGTTGTTTTTCTTAATTTGCTCATGCATGCTA----ATAAATTATGTCTAAAAATAGAAT---AAATACAAATCAATGTGCTCTGTGCATTA-GTTACTTATTAGGTTTTGGGAAACAAGAGGTAAAAAACTAGAGACCTCTTAATGCAGTCAAAAATACAAATAAATAAAAAGTCACTTACAACCCAAAGTGTGACTATCAATGGGGTAATCAGTGGTGTCAAATAGGAGGT'); is($hsp->hit_string, 'GATGTCCTTGGTGGATTATGGTGTTAGGGTATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATAATATAATACAAAATATAATACAAAATATAATACAAAATATAATACAAAATATAATACAAAATATAATACAAAATATAATACAAAATATAATATAAAATATAATATAAAATATAATATAAAATAAAATATAAAATAAAATATAAAATAAAATATAAAATAAAATATAAAATAAAATAT-AATATAAAATATAAAATAAAATATAATATAAAATATAATATAAAATATAATATAAAATATAATATAAAATA'); is($hsp->homology_string, ' :::::::::::::::::: : ::::: :: : : ::: ::::: :::::::: :: :: : : : : : : : :: : :: :: :: : ::: ::: :::::: ::: ::::: :: ::: : : : :: ::: : :: : : : : :: : :: : : :: : : :: : : ::: ::: :: ::::: ::: : : :: :: ::: : : : ::: :: '.' 'x60); is(join(' ', $hsp->seq_inds('query', 'nomatch',1)), '33 37 39 41 43 47-49 52 55 56 58 60 64 70 71 74 78 82 84 86 87 90-96 98 100 103 105 107 110-112 114 117 119 121-123 125 127-129 132 134 135 139 140 143 145-148 150-153 155 156 160 161 164 173 180-184 188 192 196-198 201 204 206-209 212 213 215 217 219 221 223-225 227 229 232 233 236 237 246 252 256 258 260 263 269 271'); is(join(' ', $hsp->seq_inds('query', 'mismatch',1)), '33 37 41 43 47-49 52 55 56 58 60 64 70 71 74 78 82 84 86 87 90-96 98 100 103 105 107 110-112 114 117 119 121-123 125 127-129 132 134 135 139 140 143 145-148 150-153 155 156 160 161 164 173 180-184 188 192 196-198 201 204 206-209 212 213 215 217 219 221 223-225 227 229 232 233 236 237 246 252 256 258 260 263 269 271'); is(join(' ', $hsp->seq_inds('query', 'conserved',1)), '31 32 34-36 38 40 42 44-46 50 51 53 54 57 59 61-63 65-69 72 73 75-77 79-81 83 85 88 89 97 99 101 102 104 106 108 109 113 115 116 118 120 124 126 130 131 133 136-138 141 142 144 149 154 157-159 162 163 165-172 174-179 185-187 189-191 193-195 199 200 202 203 205 210 211 214 216 218 220 222 226 228 230 231 234 235 238-245 247-251 253-255 257 259 261 262 264-268 270 272-289'); is(join(' ', $hsp->seq_inds('hit', 'nomatch',1)), '64920 64922 64928 64931 64933 64935 64939 64945 64954 64955 64958 64959 64962 64964 64966-64968 64970 64972 64974 64976 64978 64979 64982-64985 64987 64990 64993-64995 64998-65001 65003 65007 65011-65015 65022 65026-65028 65034 65037 65038 65042 65043 65045-65048 65050-65053 65055 65058-65060 65064 65065 65067 65070-65072 65074 65076-65078 65080 65082 65085 65087-65089 65092 65094 65096 65099 65101 65103-65109 65112 65113 65115 65117 65121 65125 65128 65129 65135 65139 65141 65143 65144 65147 65150-65152 65156 65158 65161 65165'); is(join(' ', $hsp->seq_inds('hit', 'mismatch',1)), '64920 64922 64928 64931 64933 64935 64939 64945 64954 64955 64958 64959 64962 64964 64966-64968 64970 64972 64974 64976 64978 64979 64982-64985 64987 64990 64993-64995 65003 65007 65011-65015 65022 65034 65037 65038 65042 65043 65045-65048 65050-65053 65055 65059 65060 65064 65065 65067 65070-65072 65074 65076-65078 65080 65082 65085 65087-65089 65092 65094 65096 65099 65101 65103-65109 65112 65113 65115 65117 65121 65125 65128 65129 65135 65139 65141 65143 65144 65147 65150-65152 65156 65158 65161 65165'); is(join(' ', $hsp->seq_inds('hit', 'conserved',1)), '64902-64919 64921 64923-64927 64929 64930 64932 64934 64936-64938 64940-64944 64946-64953 64956 64957 64960 64961 64963 64965 64969 64971 64973 64975 64977 64980 64981 64986 64988 64989 64991 64992 64996 64997 65002 65004-65006 65008-65010 65016-65021 65023-65025 65029-65033 65035 65036 65039-65041 65044 65049 65054 65056 65057 65061-65063 65066 65068 65069 65073 65075 65079 65081 65083 65084 65086 65090 65091 65093 65095 65097 65098 65100 65102 65110 65111 65114 65116 65118-65120 65122-65124 65126 65127 65130-65134 65136-65138 65140 65142 65145 65146 65148 65149 65153-65155 65157 65159 65160 65162-65164 65166 65167'); is(join(' ', $hsp->seq_inds('query', 'gap',1)), '141 170 194'); is(join(' ', $hsp->seq_inds('hit', 'frameshift')), ''); is(join(' ', $hsp->seq_inds('query', 'frameshift')), ''); is($hsp->ambiguous_seq_inds, ''); # note: the reason this is not the same percent id above # is we are calculating average percent id is(sprintf("%.2f",$hsp->get_aln->percentage_identity()), '59.30'); $hsps_left--; } is($hsps_left, 0); } last if( $count++ > @valid ); } is(@valid, 0); $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('cysprot1.FASTA')); $result = $searchio->next_result; like($result->database_name, qr/ecoli.aa/); is($result->database_letters, 1358987); is($result->database_entries, 4289); is($result->algorithm, 'FASTP'); is($result->algorithm_version, '3.3t08'); is($result->query_name, 'CYS1_DICDI'); is($result->query_length, 343); is($result->get_parameter('gapopen'), -12); is($result->get_parameter('gapext'), -2); is($result->get_parameter('ktup'), 2); is($result->get_statistic('lambda'), 0.1456); is($result->get_statistic('dbletters'), 1358987); is($result->get_statistic('dbentries'), 4289); @valid = ( [ 'gi|1787478|gb|AAC74309.1|', 512, 'AAC74309', 1787478, 1.2, 29.2], [ 'gi|1790635|gb|AAC77148.1|', 251, 'AAC77148', 1790635, 2.1, 27.4], [ 'gi|1786590|gb|AAC73494.1|', 94, 'AAC73494', 1786590, 2.1, 25.9]); $count = 0; while( my $hit = $result->next_hit ) { my $d = shift @valid; is($hit->name, shift @$d); is($hit->length, shift @$d); is($hit->accession, shift @$d); is($hit->ncbi_gi, shift @$d); float_is($hit->significance, shift @$d); is($hit->raw_score, shift @$d ); if( $count == 0 ) { my $hsps_left = 1; while( my $hsp = $hit->next_hsp ) { is($hsp->query->start, 125); is($hsp->query->end, 305); is($hsp->query->strand, 0); is($hsp->hit->start, 2); is($hsp->hit->end, 181); is($hsp->hit->strand, 0); is($hsp->length('total'), 188); float_is($hsp->evalue, 1.2); is($hsp->score, 109.2); is($hsp->bits,29.2); is(sprintf("%.2f",$hsp->percent_identity), 23.94); is(sprintf("%.4f",$hsp->frac_identical('query')), 0.2486); is(sprintf("%.4f",$hsp->frac_identical('hit')), '0.2500'); is(sprintf("%.4f",$hsp->frac_conserved('query')), '0.2707'); is(sprintf("%.4f",$hsp->frac_conserved('hit')), '0.2722'); # there is slight rounding different here so file says 26.012% # but with the rounding this ends up as 0.2606 is(sprintf("%.4f",$hsp->frac_conserved('total')), '0.2606'); is($hsp->query->frame(), 0); is($hsp->hit->frame(), 0); is($hsp->gaps('query'), 7); is($hsp->gaps('hit'), 8); is($hsp->gaps, 15); is($hsp->query_string, 'NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTT-GNV----EGQHFISQNKLVSLSEQNLVDCDHECME-YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGP-LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII'); is($hsp->hit_string, (' 'x29).'MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWTSREGVEYAWFNNVETKPGQGF-PTDWENQEKYKGGWI--RKINGKLQPRMGNRAMLLGKIFANPHLPGIDDYYEPFDFDYQNLHTAPEG----SKSQPIARPRSLITGERMAKIEKGPNWEDDLGGEFDKLAKDKNFDN-IQKAMYSQFENTFMMYLPRLCEHCLNPACVATCPSGAIYKREEDGIVLIDQDKCRGWRMCITGCPYKKIYFNWKSGKSEKCIFCYPRIEAGQPTVCSETC'); is($hsp->homology_string, ' . :. : : : .: .: . :.: :: :: .. :.. . :.. : : .: :.: . :: ::: : . : : .. : . .:. :. . . :.. . . :: .: . .:. .:: :: . ...:. : . :: .. : .: '.' 'x60); # note: the reason this is not the same percent id above # is we are calculating average percent id is(sprintf("%.2f",$hsp->get_aln->percentage_identity()), 26.01); is(join(' ', $hsp->seq_inds('query', 'nomatch',1)), '126 129 131 132 134 136 137 140 143 145 149 154 157-159 163 165-167 171-173 175 177 183-187 189 190 193 197-199 201 202 204 205 207 209 212-214 216-218 220-224 228 229 232 234-236 238-242 246 248-252 254 257 260-263 265 269 270 274 277-279 281 287 289 290 292 295 296 299 301-303'); is(join(' ', $hsp->seq_inds('query', 'mismatch',1)), '126 129 131 132 134 136 137 140 143 145 149 154 157-159 163 165-167 172 173 175 177 183-185 189 190 193 197-199 201 202 204 205 207 209 212-214 216-218 220-224 228 229 232 234-236 242 246 248-252 254 257 260-263 265 269 270 274 277-279 281 289 290 292 295 296 299 301-303'); is(join(' ', $hsp->seq_inds('query', 'conserved',1)), '125 127 128 130 133 135 138 139 141 142 144 146-148 150-153 155 156 160-162 164 168-170 174 176 178-182 188 191 192 194-196 200 203 206 208 210 211 215 219 225-227 230 231 233 237 243-245 247 253 255 256 258 259 264 266-268 271-273 275 276 280 282-286 288 291 293 294 297 298 300 304 305'); is(join(' ', $hsp->seq_inds('hit', 'nomatch',1)), '3 6 8 9 11 13 14 17 20 22 26 27 30-33 36 39-41 45 47-49 53 54 56 58 61 65-67 69 70 73 77-79 81 82 84 85 87 89 92-94 96-98 100-104 108 109 112 114-116 118 122 124-128 130 133 134 137-140 142 146 147 151 154-156 158 165 166 168 171 172 175 177-179'); is(join(' ', $hsp->seq_inds('hit', 'mismatch',1)), '3 6 8 9 11 13 14 17 20 22 27 36 39-41 45 47-49 53 54 56 58 65-67 69 70 73 77-79 81 82 84 85 87 89 92-94 96-98 100-104 108 109 112 114-116 118 122 124-128 130 133 137-140 142 146 147 151 154-156 158 165 166 168 171 172 175 177-179'); is(join(' ', $hsp->seq_inds('hit', 'conserved',1)), '2 4 5 7 10 12 15 16 18 19 21 23-25 28 29 34 35 37 38 42-44 46 50-52 55 57 59 60 62-64 68 71 72 74-76 80 83 86 88 90 91 95 99 105-107 110 111 113 117 119-121 123 129 131 132 135 136 141 143-145 148-150 152 153 157 159-164 167 169 170 173 174 176 180 181'); is(join(' ', $hsp->seq_inds('query', 'gap',1)), '148 151 179 257'); is(join(' ', $hsp->seq_inds('hit', 'frameshift')), ''); is(join(' ', $hsp->seq_inds('query', 'frameshift')), ''); $hsps_left--; } is($hsps_left, 0); } last if( $count++ > @valid ); } is(@valid, 0); is($result->hits, 8); $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('cysprot_vs_gadfly.FASTA')); $result = $searchio->next_result; like($result->database_name, qr/gadflypep2/); is($result->database_letters, 7177762); is($result->database_entries, 14334); is($result->algorithm, 'FASTP'); is($result->algorithm_version, '3.3t08'); is($result->query_name, 'cysprot.fa'); is($result->query_length, 2385); is($result->get_parameter('gapopen'), -12); is($result->get_parameter('gapext'), -2); is($result->get_parameter('ktup'), 2); is($result->get_parameter('matrix'), 'BL50'); is($result->get_statistic('lambda'), 0.1397); is($result->get_statistic('dbletters'), 7177762 ); is($result->get_statistic('dbentries'), 14334); @valid = ( [ 'Cp1|FBgn0013770|pp-CT20780|FBan0006692', 341, 'FBan0006692', '3.1e-59', 227.8], [ 'CG11459|FBgn0037396|pp-CT28891|FBan0011459', 336, 'FBan0011459', '6.4e-41', 166.9], [ 'CG4847|FBgn0034229|pp-CT15577|FBan0004847', 390, 'FBan0004847', '2.5e-40', 165.2]); $count = 0; while( my $hit = $result->next_hit ) { my $d = shift @valid; is($hit->name, shift @$d); is($hit->length, shift @$d); is($hit->accession, shift @$d); float_is($hit->significance, shift @$d); is($hit->raw_score, shift @$d ); if( $count == 0 ) { my $hsps_left = 1; while( my $hsp = $hit->next_hsp ) { is($hsp->query->start, 1373); is($hsp->query->end, 1706); is($hsp->query->strand, 0); is($hsp->hit->start, 5); is($hsp->hit->end, 341); is($hsp->hit->strand, 0); is($hsp->length('total'), 345); float_is($hsp->evalue, 3.1e-59); is($hsp->score, 1170.6); is($hsp->bits,227.8); is(sprintf("%.2f",$hsp->percent_identity), 53.04); is(sprintf("%.4f",$hsp->frac_identical('query')), 0.5479); is(sprintf("%.4f",$hsp->frac_identical('hit')), '0.5430'); is($hsp->query->frame(), 0); is($hsp->hit->frame(), 0); is($hsp->gaps('query'), 11); is($hsp->gaps('hit'), 8); is($hsp->gaps, 19); is($hsp->hit_string, (' 'x26).'MRTAVLLPLLAL----LAVAQA-VSFADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGED---YWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV'); is($hsp->query_string, 'SNWGNNGYFLIERGKNMCGLAACASYPIPQVMNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ---NRKPR------KGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTVMTPLLLLAVLCLGTALATPKFDQTFNAQWHQWKSTHRRLYGTNEE'); # note: the reason this is not the same percent id above # is we are calculating average percent id is(sprintf("%.2f",$hsp->get_aln->percentage_identity()), 56.13); is(join(' ', $hsp->seq_inds('query', 'nomatch',1)), '1375 1377 1379 1381-1384 1391 1395-1397 1399 1400 1403 1404 1407 1408 1410 1412 1414 1415 1419 1422 1423 1429 1431 1433 1434 1440 1444 1447 1452 1457 1458 1473 1477 1480 1481 1483-1487 1495 1498 1501 1524 1530 1544-1546 1558 1580 1588-1592 1596 1607 1609 1630 1638 1658 1660 1662 1663 1665-1667 1677 1678 1681 1682 1688 1705'); is(join(' ', $hsp->seq_inds('query', 'mismatch',1)), '1375 1377 1379 1395-1397 1399 1400 1403 1404 1407 1408 1410 1412 1414 1415 1419 1422 1423 1429 1431 1433 1434 1440 1444 1447 1452 1457 1458 1473 1477 1480 1481 1483-1487 1495 1498 1501 1524 1530 1544-1546 1558 1580 1588-1592 1596 1607 1609 1630 1638 1658 1660 1662 1663 1677 1678 1681 1682 1688 1705'); is(join(' ', $hsp->seq_inds('query', 'conserved',1)), '1373 1374 1376 1378 1380 1385-1390 1392-1394 1398 1401 1402 1405 1406 1409 1411 1413 1416-1418 1420 1421 1424-1428 1430 1432 1435-1439 1441-1443 1445 1446 1448-1451 1453-1456 1459-1472 1474-1476 1478 1479 1482 1488-1494 1496 1497 1499 1500 1502-1523 1525-1529 1531-1543 1547-1557 1559-1579 1581-1587 1593-1595 1597-1606 1608 1610-1629 1631-1637 1639-1657 1659 1661 1664 1668-1676 1679 1680 1683-1687 1689-1704 1706'); is(join(' ', $hsp->seq_inds('hit', 'nomatch',1)), '7 9 11 22-24 26 27 30 31 34 35 37 39 41-43 47 50 51 57 59 61 62 68 72 75 80 85 86 98-100 104 106-111 114 117 118 120-124 132 135 138 161 167 181-183 195 217 225-229 233 241 245 247 268 276 296 298 300 301 312 313 316 317 323 340'); is(join(' ', $hsp->seq_inds('hit', 'mismatch',1)), '7 9 11 22-24 26 27 30 31 34 35 37 39 42 43 47 50 51 57 59 61 62 68 72 75 80 85 86 104 114 117 118 120-124 132 135 138 161 167 181-183 195 217 225-229 233 245 247 268 276 296 298 300 301 312 313 316 317 323 340'); is(join(' ', $hsp->seq_inds('hit', 'conserved',1)), '5 6 8 10 12-21 25 28 29 32 33 36 38 40 44-46 48 49 52-56 58 60 63-67 69-71 73 74 76-79 81-84 87-97 101-103 105 112 113 115 116 119 125-131 133 134 136 137 139-160 162-166 168-180 184-194 196-216 218-224 230-232 234-240 242-244 246 248-267 269-275 277-295 297 299 302-311 314 315 318-322 324-339 341'); is(join(' ', $hsp->seq_inds('query', 'gap',1)), '1413 1469 1474 1603'); is(join(' ', $hsp->seq_inds('hit', 'frameshift')), ''); is(join(' ', $hsp->seq_inds('query', 'frameshift')), ''); $hsps_left--; } is($hsps_left, 0); } last if( $count++ > @valid ); } is(@valid, 0); is($result->hits, 21); # test on TFASTXY $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('5X_1895.FASTXY')); $result = $searchio->next_result; like($result->database_name, qr/yeast_nrpep.fasta/); is($result->database_letters, 4215311); is($result->database_entries, 9190); is($result->algorithm, 'FASTY'); is($result->algorithm_version, '3.4t07'); is($result->query_name, '5X_1895.fa'); is($result->query_length, 7972); is($result->get_parameter('gapopen'), -14); is($result->get_parameter('gapext'), -2); is($result->get_parameter('ktup'), 2); is($result->get_parameter('matrix'), 'BL50'); is($result->get_statistic('lambda'), 0.1711); is($result->get_statistic('dbletters'), 4215311); is($result->get_statistic('dbentries'), 9190); @valid = ( [ 'NR_SC:SW-YNN2_YEAST', 1056, 'NR_SC:SW-YNN2_YEAST','1.6e-154', '547.0'], [ 'NR_SC:SW-MPCP_YEAST', 311, 'NR_SC:SW-MPCP_YEAST', '1.3e-25', 117.1], [ 'NR_SC:SW-YEO3_YEAST', 300, 'NR_SC:SW-YEO3_YEAST', '5.7e-05', 48.5]); $count = 0; while( my $hit = $result->next_hit ) { my $d = shift @valid; is($hit->name, shift @$d); is($hit->length, shift @$d); is($hit->accession, shift @$d); float_is($hit->significance, shift @$d); is($hit->raw_score, shift @$d ); if ( $count == 1 ) { my $hsps_left = 1; while( my $hsp = $hit->next_hsp ) { is($hsp->query->start, 6947); is($hsp->query->end, 7938); is($hsp->query->strand, -1); is($hsp->hit->start, 19); is($hsp->hit->end, 287); is($hsp->hit->strand, 0); is($hsp->length('total'), 330); float_is($hsp->evalue, 1.3e-25); is($hsp->score, 563.4); is($hsp->bits,'117.1'); is(sprintf("%.2f",$hsp->percent_identity), 54.24); is(sprintf("%.4f",$hsp->frac_identical('query')), '0.5413'); is(sprintf("%.4f",$hsp->frac_identical('hit')), '0.6654'); is($hsp->query->frame(), 0); is($hsp->hit->frame(), 0); is($hsp->gaps('query'), 0); is($hsp->gaps('hit'), 61); is($hsp->gaps, 61); is($hsp->query_string, 'RFALAGALGCAVTHGALTPVDVVKTRIQLEPEVYNRVGRFFNSS*GF*EL*GVVLMSQT\KGMVASFRQIIAKEGAGALLTGFGPTAVGYAIQGAFKFGG*VMMSLQITA*SRANLPISYEFWKKKAIDLVGVDKARENRQAIYLGASAIAEFFADIALCPLEATRIRLVSQPSFANGLSGGFLRILREEGPAAFYAGFGPILFKQVPYTMAKFAV*VDRTA*\YQTFG*YYRSYEVAVEKILKATGKSKDSLTGGQLTGLNLTSGLIAGLAAAVISQPADTLLSKINKTKGAPGQSTTSRLVQMAGQLGVSGLFTGMTTRLVMIGTLTAGQ'); is($hsp->hit_string, 'KFALAGAIGCGSTHSSMVPIDVVKTRIQLEPTVYN-------------------------KGMVGSFKQIIAGEGAGALLTGFGPTLLGYSIQGAFKFGG-------------------YEVFKKFFIDNLGYDTASRYKNSVYMGSAAMAEFLADIALCPLEATRIRLVSQPQFANGLVGGFSRILKEEGIGSFYSGFTPILFKQIPYNIAKFLVFERASEF-YYGFAG------------------PKEKLSSTSTTLLNLLSGLTAGLAAAIVSQPADTLLSKVNKTKKAPGQSTVGLLAQLAKQLGFFGSFAGLPTRLVMVGTLTSLQ'); is($hsp->homology_string, '.::::::.::. ::....:.::::::::::: ::: ::::.::.:::: ::::::::::::: .::.::::::::: :: .:: :: .: : : . ....:.:..:.:::.:::::::::::::::::::.::::: ::: :::.::: ..::.:: ::::::.::..::: : . : :. :..:.. . : ::: ::: ::::::..::::::::::.:::: ::::::.. :.:.: ::: : :.:. :::::.::::. :'); # note: the reason this is not the same percent id above # is we are calculating average percent id is(sprintf("%.2f",$hsp->get_aln->overall_percentage_identity()), '53.92'); is(sprintf("%.2f",$hsp->get_aln->average_percentage_identity()), '67.29'); is(join(' ', $hsp->seq_inds('query', 'nomatch',1)), '6950-6952 6986-6988 7001-7003 7007-7012 7022-7024 7040-7042 7067-7069 7139-7141 7151-7153 7163-7165 7169-7171 7175-7177 7196-7255 7262-7267 7272-7277 7281-7292 7296-7298 7341-7343 7365-7367 7389-7391 7401-7403 7524-7526 7530-7532 7536-7538 7542-7544 7551-7553 7560-7565 7575-7577 7584-7640 7680-7682 7722-7724 7762-7833 7843-7845 7903-7905'); is(join(' ', $hsp->seq_inds('query', 'mismatch',1)), '6950-6952 6986-6988 7001-7003 7007-7012 7022-7024 7040-7042 7067-7069 7139-7141 7151-7153 7163-7165 7169-7171 7175-7177 7196-7198 7253-7255 7262-7267 7272-7277 7281-7292 7296-7298 7341-7343 7365-7367 7389-7391 7401-7403 7524-7526 7530-7532 7536-7538 7542-7544 7551-7553 7560-7565 7575-7577 7680-7682 7722-7724 7843-7845 7903-7905'); is(join(' ', $hsp->seq_inds('query', 'conserved',1)), '6947-6949 6953-6985 6989-7000 7004-7006 7013-7021 7025-7039 7043-7066 7070-7138 7142-7150 7154-7162 7166-7168 7172-7174 7178-7195 7256-7261 7269-7272 7278-7280 7293-7295 7299-7340 7344-7364 7368-7388 7392-7400 7404-7523 7527-7529 7533-7535 7539-7541 7545-7550 7554-7559 7566-7574 7578-7583 7641-7679 7683-7721 7725-7757 7759-7762 7834-7842 7846-7902 7906-7938'); is(join(' ', $hsp->seq_inds('query', 'gap',1)), ''); is(join(' ', $hsp->seq_inds('hit', 'nomatch',1)), '30 50 66 80 96 100 101 104 107 109 111 113 154 158 166 174 189 191-194 196 197 199 200 203 204 211 213 215 219 223 247 256 262 266 267 269 274 286'); is(join(' ', $hsp->seq_inds('hit', 'mismatch',1)), '30 50 66 80 96 100 101 104 107 109 111 113 154 158 166 174 189 191-194 196 197 199 200 203 204 211 213 215 219 223 247 256 262 266 267 269 274 286'); is(join(' ', $hsp->seq_inds('hit', 'conserved',1)), '19-29 31-49 51-65 67-79 81-95 97-99 102 103 105 106 108 110 112 114-153 155-157 159-165 167-173 175-188 190 195 198 201 202 205-210 212 214 216-218 220-222 224-246 248-255 257-261 263-265 268 270-273 275-285 287'); is(join(' ', $hsp->seq_inds('hit', 'gap',1)), '53 93 203'); is(join(' ', $hsp->seq_inds('query', 'frameshift', 1)), '7271-7273 7761-7763'); is(join(' ', $hsp->seq_inds('hit', 'frameshift')), ''); is($hsp->ambiguous_seq_inds, 'query'); $hsps_left--; } is($hsps_left, 0); } last if( $count++ > @valid ); } is(@valid, 0); is($result->hits, 58); # test FASTA v35.04, params encoding changed # test on TFASTXY $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('BOSS_DROME.FASTP_v35_04')); $result = $searchio->next_result; like($result->database_name, qr/wormpep190/, 'TFASTXY'); is($result->database_letters, 10449259); is($result->database_entries, 23771); is($result->algorithm, 'FASTP'); is($result->algorithm_version, '35.04'); is($result->query_name, 'BOSS_DROME'); is($result->query_length, 896); is($result->get_parameter('gapopen'), -10); is($result->get_parameter('gapext'), -2); is($result->get_parameter('ktup'), 2); is($result->get_parameter('matrix'), 'BL50'); # wordsize is the same as ktup, not opt width, as we used to parse # is($result->get_parameter('wordsize'), 16); # this is the range of the scoring matrix, not a filter (which is meant # to capture whether xS seg filtering used) # is($result->get_parameter('filter'), '15:-5'); is($result->get_statistic('lambda'), 0.122629); is($result->get_statistic('dbletters'), 10449259); is($result->get_statistic('dbentries'), 23771); is($result->get_statistic('effectivespace'),23771); # bug 2473 - fasta3.4 parsing with -U option $searchio = Bio::SearchIO->new(-format => 'fasta', -file => test_input_file('bug2473.fasta')); while(my $res = $searchio->next_result) { is($res->query_name, 'total:39860_L:12096_-3:12346_0:617_+3:14801'); is($res->query_description, ''); is($res->query_length, 22); is($res->algorithm, 'FASTN'); } # bug 2537 $searchio = Bio::SearchIO->new( -format => 'fasta', -file => test_input_file('in.fasta'), ); while ( my $result = $searchio->next_result() ) { while( my $hit = $result->next_hit ) { # process the Bio::Search::Hit::HitI object while( my $hsp = $hit->next_hsp ) { # process the Bio::Search::HSP::HSPI object is($hsp->num_identical, 30, "num_identical()"); is($hsp->num_conserved, 30, "num_conserved()"); } } } # bug 2937 and version 35.04 $searchio = Bio::SearchIO->new( -format => 'fasta', -file => test_input_file('bug2937.fasta'), ); ok($searchio, 'bug 2937 and FASTA version 3.5'); while ( my $result = $searchio->next_result() ) { is ($result->algorithm_version, '35.04', 'algorithm version'); is($result->query_name, 'ILTV-miR1', 'query name'); is($result->query_description, '', 'query description'); is($result->query_length, 70, 'query length'); is($result->algorithm, 'FASTN', 'algorithm'); while( my $hit = $result->next_hit ) { # process the Bio::Search::Hit::HitI object while( my $hsp = $hit->next_hsp ) { # process the Bio::Search::HSP::HSPI object is($hsp->num_identical, 26, "num_identical()"); is($hsp->num_conserved, 26, "num_conserved()"); is($hsp->strand('hit'), '1', 'hsp->strand(hit)'); is($hsp->hit->strand, '1', 'hsp->hit->strand'); is($hsp->strand('query'), '-1', 'hsp->strand(query)'); is($hsp->query->strand, '-1', 'hsp->query->strand'); } } } $searchio = Bio::SearchIO->new( -format => 'fasta', -file => test_input_file('issue255_ssearch.fasta'),); ok($searchio, 'issue 255'); while (my $result = $searchio->next_result() ) { while (my $hit = $result->next_hit ) { is ($hit->next_hsp->score, '424.1', 'issue 255: first hsp score'); } } blast.t100644000766000024 26314714006427353 16467 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: SearchIO_blast.t 14995 2008-11-16 06:20:00Z cjfields $ use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 1389); use_ok('Bio::SearchIO'); } SKIP: { test_skip(-tests => 4, -requires_module => 'Path::Class'); my $file = Path::Class::file(test_input_file('ecolitst.bls')); my $f = sub { my ($file) = @_; Bio::SearchIO->new( -file => $file, -format => 'blast') }; lives_ok(sub { $f->($file) } , 'Bio::SearchIO->new can handle a Path::Class object'); isa_ok($f->($file), 'Bio::Root::IO'); $file = Path::Class::dir(File::Spec->catfile(qw/t data/))->file('ecolitst.bls'); lives_ok(sub { $f->($file) } , 'Bio::SearchIO->new can handle a Path::Class object'); isa_ok($f->($file), 'Bio::Root::IO'); } my ( $searchio, $result, $iter, $hit, $hsp ); $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.bls') ); $result = $searchio->next_result; like($result->algorithm_reference, qr/Gapped BLAST and PSI-BLAST: a new generation of protein database search/ ); is( $result->database_name, 'ecoli.aa', 'database_name()' ); is( $result->database_entries, 4289 ); is( $result->database_letters, 1358990 ); is( $result->algorithm, 'BLASTP' ); like( $result->algorithm_version, qr/^2\.1\.3/ ); like( $result->query_name, qr/gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I,\s+homoserine dehydrogenase I [Escherichia coli]/ ); is( $result->query_accession, 'AAC73113.1' ); is( $result->query_gi, 1786183 ); is( $result->query_length, 820 ); is( $result->get_statistic('kappa'), '0.135' ); is( $result->get_statistic('kappa_gapped'), '0.0410' ); is( $result->get_statistic('lambda'), '0.319' ); is( $result->get_statistic('lambda_gapped'), '0.267' ); is( $result->get_statistic('entropy'), '0.383' ); is( $result->get_statistic('entropy_gapped'), '0.140' ); is( $result->get_statistic('dbletters'), 1358990 ); is( $result->get_statistic('dbentries'), 4289 ); is( $result->get_statistic('effective_hsplength'), 47 ); is( $result->get_statistic('effectivespace'), 894675611 ); is( $result->get_parameter('matrix'), 'BLOSUM62' ); is( $result->get_parameter('gapopen'), 11 ); is( $result->get_parameter('gapext'), 1 ); is( $result->get_statistic('S2'), '92' ); is( $result->get_statistic('S2_bits'), '40.0' ); float_is( $result->get_parameter('expect'), '1.0e-03' ); is( $result->get_statistic('num_extensions'), '82424' ); is( $result->get_statistic('querylength'), 773 ); is( $result->get_statistic('effectivedblength'), 1157407 ); is( $result->get_statistic('effectivespaceused'), 894675611 ); my @valid = ( [ 'gb|AAC73113.1|', 820, 'AAC73113', '0', 1567, 4058 ], [ 'gb|AAC76922.1|', 810, 'AAC76922', '1e-91', 332, 850 ], [ 'gb|AAC76994.1|', 449, 'AAC76994', '3e-47', 184, 467 ] ); my $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->bits, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 820 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 820 ); is( $hsp->length('total'), 820 ); is( $hsp->start('hit'), $hsp->hit->start ); is( $hsp->end('query'), $hsp->query->end ); is( $hsp->strand('sbjct'), $hsp->subject->strand ); # alias for hit float_is( $hsp->evalue, 0.0 ); is( $hsp->score, 4058 ); is( $hsp->bits, 1567 ); is( sprintf( "%.2f", $hsp->percent_identity ), 98.29 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.9829 ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), 0.9829 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.wublastp') ); $result = $searchio->next_result; like($result->algorithm_reference, qr/Gish, W. \(1996-2000\)/); is( $result->database_name, 'ecoli.aa' ); is( $result->database_letters, 1358990 ); is( $result->database_entries, 4289 ); is( $result->algorithm, 'BLASTP' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); like( $result->query_name, qr/gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I,\s+homoserine dehydrogenase I [Escherichia coli]/ ); is( $result->query_accession, 'AAC73113.1' ); is( $result->query_length, 820 ); is( $result->query_gi, 1786183 ); is( $result->get_statistic('kappa'), 0.136 ); is( $result->get_statistic('lambda'), 0.319 ); is( $result->get_statistic('entropy'), 0.384 ); is( $result->get_statistic('dbletters'), 1358990 ); is( $result->get_statistic('dbentries'), 4289 ); is( $result->get_parameter('matrix'), 'BLOSUM62' ); is( $result->get_statistic('Frame+0_lambda_used'), '0.319' ); is( $result->get_statistic('Frame+0_kappa_used'), '0.136' ); is( $result->get_statistic('Frame+0_entropy_used'), '0.384' ); is( $result->get_statistic('Frame+0_lambda_computed'), '0.319' ); is( $result->get_statistic('Frame+0_kappa_computed'), '0.136' ); is( $result->get_statistic('Frame+0_entropy_computed'), '0.384' ); is( $result->get_statistic('Frame+0_lambda_gapped'), '0.244' ); is( $result->get_statistic('Frame+0_kappa_gapped'), '0.0300' ); is( $result->get_statistic('Frame+0_entropy_gapped'), '0.180' ); @valid = ( [ 'gb|AAC73113.1|', 820, 'AAC73113', '0', 4141 ], [ 'gb|AAC76922.1|', 810, 'AAC76922', '3.1e-86', 844 ], [ 'gb|AAC76994.1|', 449, 'AAC76994', '2.8e-47', 483 ] ); $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; if ( $count == 1 ) { # Test HSP contig data returned by SearchUtils::tile_hsps() # Second hit has two hsps that overlap. # compare with the contig made by hand for these two contigs # in t/data/contig-by-hand.wublastp # (in this made-up file, the hsps from ecolitst.wublastp # were aligned and contiged, and Length, Identities, Positives # were counted, by a human (maj) ) my $hand_hit = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('contig-by-hand.wublastp') )->next_result->next_hit; my $hand_hsp = $hand_hit->next_hsp; my @hand_qrng = $hand_hsp->range('query'); my @hand_srng = $hand_hsp->range('hit'); my @hand_matches = $hand_hit->matches; my ( $qcontigs, $scontigs ) = Bio::Search::SearchUtils::tile_hsps($hit); # Query contigs is( $qcontigs->[0]->{'start'}, $hand_qrng[0] ); is( $qcontigs->[0]->{'stop'}, $hand_qrng[1] ); is( $qcontigs->[0]->{'iden'}, $hand_matches[0] ); is( $qcontigs->[0]->{'cons'}, $hand_matches[1] ); # Subject contigs is( $scontigs->[0]->{'start'}, $hand_srng[0] ); is( $scontigs->[0]->{'stop'}, $hand_srng[1] ); is( $scontigs->[0]->{'iden'}, $hand_matches[0] ); is( $scontigs->[0]->{'cons'}, $hand_matches[1] ); } is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 820 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 820 ); is( $hsp->length('total'), 820 ); float_is( $hsp->evalue, 0.0 ); float_is( $hsp->pvalue, '0.0' ); is( $hsp->score, 4141 ); is( $hsp->bits, 1462.8 ); is( $hsp->percent_identity, 100 ); is( $hsp->frac_identical('query'), 1.00 ); is( $hsp->frac_identical('hit'), 1.00 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); # test that add hit really works properly for BLAST objects # bug 1611 my @hits = $result->hits; $result->add_hit( $hits[0] ); is( $result->num_hits, @hits + 1 ); # test WU-BLAST -noseqs option $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.noseqs.wublastp') ); $result = $searchio->next_result; is( $result->algorithm_reference, 'Gish, W. (1996-2004) http://blast.wustl.edu ' ); is( $result->database_name, 'ecoli.aa' ); is( $result->database_letters, 1358990 ); is( $result->database_entries, 4289 ); is( $result->algorithm, 'BLASTP' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); like( $result->query_name, qr/gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I,\s+homoserine dehydrogenase I [Escherichia coli]/ ); is( $result->query_accession, 'AAC73113.1' ); is( $result->query_gi, 1786183 ); is( $result->query_length, 820 ); is( $result->get_statistic('kappa'), 0.135 ); is( $result->get_statistic('lambda'), 0.319 ); is( $result->get_statistic('entropy'), 0.384 ); is( $result->get_statistic('dbletters'), 1358990 ); is( $result->get_statistic('dbentries'), 4289 ); is( $result->get_parameter('matrix'), 'BLOSUM62' ); is( $result->get_statistic('Frame+0_lambda_used'), '0.319' ); is( $result->get_statistic('Frame+0_kappa_used'), '0.135' ); is( $result->get_statistic('Frame+0_entropy_used'), '0.384' ); is( $result->get_statistic('Frame+0_lambda_computed'), '0.319' ); is( $result->get_statistic('Frame+0_kappa_computed'), '0.135' ); is( $result->get_statistic('Frame+0_entropy_computed'), '0.384' ); is( $result->get_statistic('Frame+0_lambda_gapped'), '0.244' ); is( $result->get_statistic('Frame+0_kappa_gapped'), '0.0300' ); is( $result->get_statistic('Frame+0_entropy_gapped'), '0.180' ); @valid = ( [ 'gb|AAC73113.1|', 820, 'AAC73113', '0', 4141 ], [ 'gb|AAC76922.1|', 810, 'AAC76922', '6.6e-93', 907 ], [ 'gb|AAC76994.1|', 449, 'AAC76994', '2.8e-47', 483 ] ); $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 820 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 820 ); is( $hsp->length('total'), 820 ); float_is( $hsp->evalue, 0. ); float_is( $hsp->pvalue, '0.' ); is( $hsp->score, 4141 ); is( $hsp->bits, 1462.8 ); is( $hsp->percent_identity, 100 ); is( $hsp->frac_identical('query'), 1.00 ); is( $hsp->frac_identical('hit'), 1.00 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); # test tblastx $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('HUMBETGLOA.tblastx') ); $result = $searchio->next_result; like($result->algorithm_reference,qr/Gapped BLAST and PSI-BLAST/); is( $result->database_name, 'ecoli.nt' ); is( $result->database_letters, 4662239 ); is( $result->database_entries, 400 ); is( $result->algorithm, 'TBLASTX' ); like( $result->algorithm_version, qr/^2\.1\.2/ ); is( $result->query_name, 'HUMBETGLOA' ); is( $result->query_description, 'Human haplotype C4 beta-globin gene, complete cds.' ); is( $result->query_length, 3002 ); is( $result->get_statistic('kappa'), 0.135 ); is( $result->get_statistic('lambda'), 0.318 ); is( $result->get_statistic('entropy'), 0.401 ); is( $result->get_statistic('dbletters'), 4662239 ); is( $result->get_statistic('dbentries'), 400 ); is( $result->get_statistic('querylength'), 953 ); is( $result->get_statistic('effectivedblength'), 1535279 ); is( $result->get_statistic('effectivespace'), 1463120887 ); is( $result->get_statistic('effectivespaceused'), 1463120887 ); is( $result->get_statistic('T'), 13 ); is( $result->get_statistic('X1'), 16 ); is( $result->get_statistic('X1_bits'), 7.3 ); is( $result->get_statistic('X2'), 0 ); is( $result->get_statistic('X2_bits'), '0.0' ); is( $result->get_statistic('S1'), 41 ); is( $result->get_statistic('S1_bits'), 21.7 ); is( $result->get_statistic('S2'), 53 ); is( $result->get_statistic('S2_bits'), 27.2 ); is( $result->get_statistic('decayconst'), 0.1 ); is( $result->get_parameter('matrix'), 'BLOSUM62' ); @valid = ( [ 'gb|AE000479.1|AE000479', 10934, 'AE000479', '0.13', 33.6, 67 ], [ 'gb|AE000302.1|AE000302', 10264, 'AE000302', '0.61', 31.3, 62 ], [ 'gb|AE000277.1|AE000277', 11653, 'AE000277', '0.84', 30.8, 61 ] ); $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->bits, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1057 ); is( $hsp->query->end, 1134 ); is( $hsp->query->strand, 1 ); is( $hsp->strand('query'), $hsp->query->strand ); is( $hsp->hit->end, 5893 ); is( $hsp->hit->start, 5816 ); is( $hsp->hit->strand, -1 ); is( $hsp->strand('sbjct'), $hsp->subject->strand ); is( $hsp->length('total'), 26 ); float_is( $hsp->evalue, 0.13 ); is( $hsp->score, 67 ); is( $hsp->bits, 33.6 ); is( sprintf( "%.2f", $hsp->percent_identity ), 42.31 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), '0.4231' ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), '0.4231' ); is( $hsp->query->frame(), 0 ); is( $hsp->hit->frame(), 1 ); is( $hsp->gaps, 0 ); is( $hsp->query_string, 'SAYWSIFPPLGCWWSTLGPRGSLSPL' ); is( $hsp->hit_string, 'AAVWALFPPVGSQWGCLASQWRTSPL' ); is( $hsp->homology_string, '+A W++FPP+G W L + SPL' ); # changed to reflect positional ambiguities, note extra flag is( join( ' ', $hsp->seq_inds( 'query', 'nomatch', 1 ) ), '1063-1065 1090-1095 1099-1104 1108-1113 1117-1125' ); is( join( ' ', $hsp->seq_inds( 'hit', 'nomatch', 1 ) ), '5825-5833 5837-5842 5846-5851 5855-5860 5885-5887' ); is( $hsp->ambiguous_seq_inds, 'query/subject' ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); # test for MarkW bug in blastN $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('a_thaliana.blastn') ); $result = $searchio->next_result; like($result->algorithm_reference,qr/Gapped BLAST and PSI-BLAST/); is( $result->rid, '1012577175-3730-28291' ); is( $result->database_name, 'All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS,or phase 0, 1 or 2 HTGS sequences) ' ); is( $result->database_letters, 4677375331 ); is( $result->database_entries, 1083200 ); is( $result->algorithm, 'BLASTN' ); like( $result->algorithm_version, qr/^2\.2\.1/ ); is( $result->query_name, '' ); is( $result->query_length, 60 ); is( $result->get_parameter('gapopen'), 5 ); is( $result->get_parameter('gapext'), 2 ); is( $result->get_parameter('ktup'), undef ); is( $result->get_statistic('querylength'), 41 ); is( $result->get_statistic('effectivedblength'), 4656794531 ); is( $result->get_statistic('effectivespace'), 190928575771 ); is( $result->get_statistic('effectivespaceused'), 190928575771 ); is( $result->get_statistic('lambda'), 1.37 ); is( $result->get_statistic('kappa'), 0.711 ); is( $result->get_statistic('entropy'), 1.31 ); is( $result->get_statistic('T'), 0 ); is( $result->get_statistic('A'), 30 ); is( $result->get_statistic('X1'), '6' ); is( $result->get_statistic('X1_bits'), 11.9 ); is( $result->get_statistic('X2'), 15 ); is( $result->get_statistic('X2_bits'), 29.7 ); is( $result->get_statistic('S1'), 12 ); is( $result->get_statistic('S1_bits'), 24.3 ); is( $result->get_statistic('S2'), 17 ); is( $result->get_statistic('S2_bits'), 34.2 ); is( $result->get_statistic('dbentries'), 1083200 ); @valid = ( [ 'gb|AY052359.1|', 2826, 'AY052359', '3e-18', 95.6, 48, 1, 60, '1.0000' ], [ 'gb|AC002329.2|AC002329', 76170, 'AC002329', '3e-18', 95.6, 48, 1, 60, '1.0000' ], [ 'gb|AF132318.1|AF132318', 5383, 'AF132318', '0.04', 42.1, 21, 35, 55, '0.3500' ] ); $count = 0; while ( my $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->bits, shift @$d ); is( $hit->raw_score, shift @$d ); is( $hit->start, shift @$d ); is( $hit->end, shift @$d ); is( sprintf( "%.4f", $hit->frac_aligned_query ), shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 60 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->start, 154 ); is( $hsp->hit->end, 212 ); is( $hsp->hit->strand, 1 ); is( $hsp->length('total'), 60 ); float_is( $hsp->evalue, 3e-18 ); is( $hsp->score, 48 ); is( $hsp->bits, 95.6 ); is( sprintf( "%.2f", $hsp->percent_identity ), 96.67 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.9667 ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), 0.9831 ); is( $hsp->query->frame(), 0 ); is( $hsp->hit->frame(), 0 ); is( $hsp->query->seq_id, undef ); is( $hsp->hit->seq_id, 'gb|AY052359.1|' ); is( $hsp->gaps('query'), 0 ); is( $hsp->gaps('hit'), 1 ); is( $hsp->gaps, 1 ); is( $hsp->query_string, 'aggaatgctgtttaattggaatcgtacaatggagaatttgacggaaatagaatcaacgat' ); is( $hsp->hit_string, 'aggaatgctgtttaattggaatca-acaatggagaatttgacggaaatagaatcaacgat' ); is( $hsp->homology_string, '||||||||||||||||||||||| |||||||||||||||||||||||||||||||||||' ); my $aln = $hsp->get_aln; is( sprintf( "%.2f", $aln->overall_percentage_identity ), 96.67 ); is( sprintf( "%.2f", $aln->percentage_identity ), 98.31 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); #WU-BlastX test $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('dnaEbsub_ecoli.wublastx') ); $result = $searchio->next_result; is( $result->algorithm_reference, 'Gish, W. (1996-2000) http://blast.wustl.edu Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72. ' ); is( $result->database_name, 'ecoli.aa' ); is( $result->database_letters, 1358990 ); is( $result->database_entries, 4289 ); is( $result->algorithm, 'BLASTX' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); is( $result->query_name, 'gi|142864|gb|M10040.1|BACDNAE' ); is( $result->query_description, 'B.subtilis dnaE gene encoding DNA primase, complete cds' ); is( $result->query_accession, 'M10040.1' ); is( $result->query_gi, 142864 ); is( $result->query_length, 2001 ); is( $result->get_parameter('matrix'), 'blosum62' ); is( $result->get_statistic('lambda'), 0.318 ); is( $result->get_statistic('kappa'), 0.135 ); is( $result->get_statistic('entropy'), 0.401 ); is( $result->get_statistic('dbentries'), 4289 ); @valid = ( [ 'gi|1789447|gb|AAC76102.1|', 581, 'AAC76102', '1.1e-74', 671 ] ); $count = 0; while ( my $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); is( sprintf( "%.4f", $hit->frac_identical('query') ), '0.3640' ); is( sprintf( "%.4f", $hit->frac_identical('hit') ), '0.3660' ); is( sprintf( "%.4f", $hit->frac_conserved('query') ), '0.5370' ); is( sprintf( "%.4f", $hit->frac_conserved('hit') ), '0.5400' ); is( sprintf( "%.4f", $hit->frac_aligned_query ), '0.6200' ); is( sprintf( "%.4f", $hit->frac_aligned_hit ), '0.7100' ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 21 ); is( $hsp->query->end, 1265 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 413 ); is( $hsp->hit->strand, 0 ); is( $hsp->length('total'), 421 ); float_is( $hsp->evalue, 1.1e-74 ); float_is( $hsp->pvalue, '1.1e-74' ); is( $hsp->score, 671 ); is( $hsp->bits, 265.8 ); is( sprintf( "%.2f", $hsp->percent_identity ), 35.87 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.3639 ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), 0.3656 ); is( sprintf( "%.4f", $hsp->frac_conserved('query') ), 0.5373 ); is( sprintf( "%.2f", $hsp->frac_conserved('hit') ), 0.54 ); is( sprintf( "%.4f", $hsp->frac_identical('hsp') ), 0.3587 ); is( sprintf( "%.4f", $hsp->frac_conserved('hsp') ), 0.5297 ); is( $hsp->query->frame(), 2 ); is( $hsp->hit->frame(), 0 ); is( $hsp->gaps('query'), 6 ); is( $hsp->gaps('hit'), 8 ); is( $hsp->gaps, 14 ); is( $hsp->query_string, 'MGNRIPDEIVDQVQKSADIVEVIGDYVQLKKQGRNYFGLCPFHGESTPSFSVSPDKQIFHCFGCGAGGNVFSFLRQMEGYSFAESVSHLADKYQIDFPDDITVHSGARP---ESSGEQKMAEAHELLKKFYHHLLINTKEGQEALDYLLSRGFTKELINEFQIGYALDSWDFITKFLVKRGFSEAQMEKAGLLIRREDGSGYFDRFRNRVMFPIHDHHGAVVAFSGRALGSQQPKYMNSPETPLFHKSKLLYNFYKARLHIRKQERAVLFEGFADVYTAVSSDVKESIATMGTSLTDDHVKILRRNVEEIILCYDSDKAGYEATLKASELL---QKKGCKVRVAMIPDGLDPDDYIKKFGGEKFKNDIIDASVTVMAFKMQYFRKGKNLSDEGDRLAYIKDVLKEISTLSGSLEQEVYVKQ' ); is( $hsp->hit_string, 'MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSFTVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE----AGSGPSQIERHQRQTLYQLMDGLNTFYQQSL-QQPVATSARQYLEKRGLSHEVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSY-DRFRERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYEAQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRATNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLVRKEGKEAFEARM-EQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLISQVPGETLR-IYLRQ' ); is( $hsp->homology_string, 'M RIP ++ + DIV++I V+LKKQG+N+ CPFH E TPSF+V+ +KQ +HCFGCGA GN FL + F E+V LA + ++ P + +G+ P E Q + + + L FY L A YL RG + E+I F IG+A WD + K + + AG+L+ + G Y DRFR RVMFPI D G V+ F GR LG+ PKY+NSPET +FHK + LY Y+A+ + R ++ EG+ DV + ++A++GTS T DH+++L R +I CYD D+AG +A +A E G ++R +PDG DPD ++K G E F+ + + ++ + AF +LS R L IS + G + +Y++Q' ); is( join( ' ', $hsp->seq_inds( 'query', 'nomatch', 1 ) ), '24-29 39-47 54-56 60-71 90-98 129-137 150-152 156-158 180-182 192-194 219-221 228-236 243-251 255-263 267-269 279-284 291-296 300-302 309-311 315-317 321-332 342-344 351-362 366-368 372-374 378-383 387-389 393-398 405-413 417-440 444-449 456-461 468-470 474-476 486-491 495-497 510-518 525-527 531-533 537-557 561-569 573-578 594-599 603-605 609-614 618-620 633-635 654-656 660-665 669-671 678-680 684-686 693-695 705-710 738-740 753-755 759-761 768-773 786-797 801-806 810-812 819-821 831-833 840-860 864-869 894-896 900-902 921-923 927-938 945-947 957-959 972-974 981-986 993-995 999-1013 1017-1019 1029-1037 1050-1052 1062-1067 1077-1079 1083-1085 1089-1091 1098-1103 1107-1109 1113-1115 1122-1124 1128-1130 1137-1163 1173-1184 1188-1208 1212-1217 1224-1226 1230-1232 1236-1244 1248-1250' ); is( join( ' ', $hsp->seq_inds( 'query', 'mismatch', 1 ) ), '24-29 39-47 54-56 60-71 90-98 129-137 150-152 156-158 180-182 192-194 219-221 228-236 243-251 255-263 267-269 279-284 291-296 300-302 309-311 315-317 342-344 351-362 366-368 372-374 378-383 387-389 393-398 405-413 420-440 444-449 456-461 468-470 474-476 486-491 495-497 510-518 525-527 531-533 537-557 561-569 573-578 594-599 603-605 609-614 633-635 654-656 660-665 669-671 678-680 684-686 693-695 705-710 738-740 753-755 759-761 768-773 786-797 801-806 810-812 819-821 831-833 840-860 864-869 894-896 900-902 921-923 927-938 945-947 957-959 972-974 981-986 993-995 999-1013 1017-1019 1029-1037 1050-1052 1062-1067 1077-1079 1083-1085 1089-1091 1098-1103 1113-1115 1122-1124 1128-1130 1137-1163 1173-1184 1188-1208 1212-1217 1224-1226 1230-1232 1236-1244' ); is( join( ' ', $hsp->seq_inds( 'hit', 'nomatch', 1 ) ), '2 3 7-9 12 14-17 24-26 37-39 44 46 54 58 67 70-72 75-77 79-81 83 87 88 91 92 94 97 99 104 106-108 110-113 115 117 119 120 122 124 125 128-130 132-138 140 141 144 145 148 150 154 155 157 162-164 167 169 171-177 179-181 183 184 190 191 193 195 196 202 209 211 212 214 217 219 222 226 227 237 242 244 247 248 253-256 258 259 261 264 268 271-277 279 280 289 291 298 300-303 306 310 315 318 319 322 324-331 333 337-339 344 348 349 353 355 357 360 361 364 367 369 372-380 384-387 389-395 397 398 401 403 405-407' ); is( join( ' ', $hsp->seq_inds( 'hit', 'mismatch', 1 ) ), '2 3 7-9 12 14-17 24-26 37-39 44 46 54 58 67 70-72 75-77 79-81 83 87 88 91 92 94 97 99 104 110-113 115 117 119 120 122 124 125 128-130 132-138 140 141 144 145 148 150 154 155 157 162-164 167 169 171-177 179-181 183 184 190 191 193 195 196 202 209 211 212 214 217 219 222 226 227 237 242 244 247 248 253-256 258 259 261 264 268 271-277 279 280 289 291 298 300-303 306 310 315 318 319 322 324 325 329-331 333 337-339 344 348 349 353 355 357 360 361 364 367 369 372-380 384-387 389-395 397 398 401 403 405-407' ); is( join( ' ', $hsp->seq_inds( 'query', 'gaps', 1 ) ), '347 1004' ); is( join( ' ', $hsp->seq_inds( 'hit', 'gaps', 1 ) ), '100 131 197 362 408' ); is( $hsp->ambiguous_seq_inds, 'query' ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); #Trickier WU-Blast $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('tricky.wublast') ); $result = $searchio->next_result; my $hits_left = 1; while ( my $hit = $result->next_hit ) { # frac_aligned_hit used to be over 1, frac_identical & frac_conserved are still too wrong TODO: { local $TODO = 'frac_identical & frac_conserved are still too wrong'; cmp_ok sprintf( "%.3f", $hit->frac_identical ), '>', 0.9; cmp_ok sprintf( "%.3f", $hit->frac_conserved ), '<=', 1; } is( sprintf( "%.2f", $hit->frac_aligned_query ), '0.92' ); is( sprintf( "%.2f", $hit->frac_aligned_hit ), '0.91' ); $hits_left--; } is( $hits_left, 0 ); # More frac_ method testing, this time on ncbi blastn $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('frac_problems.blast') ); my @expected = ( "1.000", "0.943" ); while ( my $result = $searchio->next_result ) { my $hit = $result->next_hit; is( $hit->frac_identical, shift @expected ); } is( @expected, 0 ); # And even more: frac_aligned_query should never be over 1! $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('frac_problems2.blast') ); $result = $searchio->next_result; $hit = $result->next_hit; is $hit->frac_aligned_query, 0.97; # Also, start and end should be sane $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('frac_problems3.blast') ); $result = $searchio->next_result; $hit = $result->next_hit; is $hit->start('sbjct'), 207; is $hit->end('sbjct'), 1051; #WU-TBlastN test $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('dnaEbsub_ecoli.wutblastn') ); $result = $searchio->next_result; is( $result->algorithm_reference, 'Gish, W. (1996-2000) http://blast.wustl.edu ' ); is( $result->database_name, 'ecoli.nt' ); is( $result->database_letters, 4662239 ); is( $result->database_entries, 400 ); is( $result->algorithm, 'TBLASTN' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); is( $result->query_name, 'gi|142865|gb|AAA22406.1|' ); is( $result->query_description, 'DNA primase' ); is( $result->query_accession, 'AAA22406.1' ); is( $result->query_gi, 142865 ); is( $result->query_length, 603 ); is( $result->get_parameter('matrix'), 'blosum62' ); is( $result->get_statistic('lambda'), '0.320' ); is( $result->get_statistic('kappa'), 0.136 ); is( $result->get_statistic('entropy'), 0.387 ); is( $result->get_statistic('dbentries'), 400 ); @valid = ( [ 'gi|1789441|gb|AE000388.1|AE000388', 10334, 'AE000388', '1.4e-73', 671 ] ); $count = 0; while ( my $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 415 ); is( $hsp->query->strand, 0 ); is( $hsp->hit->start, 4778 ); is( $hsp->hit->end, 6016 ); is( $hsp->hit->strand, 1 ); is( $hsp->length('total'), 421 ); float_is( $hsp->evalue, 1.4e-73 ); float_is( $hsp->pvalue, 1.4e-73 ); is( $hsp->score, 671 ); is( $hsp->bits, 265.8 ); is( sprintf( "%.2f", $hsp->percent_identity ), 35.87 ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), 0.3656 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.3639 ); is( sprintf( "%.4f", $hsp->frac_conserved('hsp') ), 0.5297 ); is( $hsp->query->frame(), 0 ); is( $hsp->hit->frame(), 1 ); is( $hsp->gaps('query'), 6 ); is( $hsp->gaps('hit'), 8 ); is( $hsp->gaps, 14 ); is( $hsp->query_string, 'MGNRIPDEIVDQVQKSADIVEVIGDYVQLKKQGRNYFGLCPFHGESTPSFSVSPDKQIFHCFGCGAGGNVFSFLRQMEGYSFAESVSHLADKYQIDFPDDITVHSGARP---ESSGEQKMAEAHELLKKFYHHLLINTKEGQEALDYLLSRGFTKELINEFQIGYALDSWDFITKFLVKRGFSEAQMEKAGLLIRREDGSGYFDRFRNRVMFPIHDHHGAVVAFSGRALGSQQPKYMNSPETPLFHKSKLLYNFYKARLHIRKQERAVLFEGFADVYTAVSSDVKESIATMGTSLTDDHVKILRRNVEEIILCYDSDKAGYEATLKASELL---QKKGCKVRVAMIPDGLDPDDYIKKFGGEKFKNDIIDASVTVMAFKMQYFRKGKNLSDEGDRLAYIKDVLKEISTLSGSLEQEVYVKQ' ); is( $hsp->hit_string, 'MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSFTVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE----AGSGPSQIERHQRQTLYQLMDGLNTFYQQSL-QQPVATSARQYLEKRGLSHEVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSY-DRFRERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYEAQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRATNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLVRKEGKEAFEARM-EQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLISQVPGETLR-IYLRQ' ); is( $hsp->homology_string, 'M RIP ++ + DIV++I V+LKKQG+N+ CPFH E TPSF+V+ +KQ +HCFGCGA GN FL + F E+V LA + ++ P + +G+ P E Q + + + L FY L A YL RG + E+I F IG+A WD + K + + AG+L+ + G Y DRFR RVMFPI D G V+ F GR LG+ PKY+NSPET +FHK + LY Y+A+ + R ++ EG+ DV + ++A++GTS T DH+++L R +I CYD D+AG +A +A E G ++R +PDG DPD ++K G E F+ + + ++ + AF +LS R L IS + G + +Y++Q' ); is( join( ' ', $hsp->seq_inds( 'query', 'nomatch', 1 ) ), '2 3 7-9 12 14-17 24-26 37-39 44 46 54 58 67 70-72 75-77 79-81 83 87 88 91 92 94 97 99 101-104 108 111-114 116 118 120 121 123 125 126 129-131 133-140 142 143 146 147 150 152 156 157 159 164-166 169 171 173-179 181-183 185 186 192 193 195 197 198 200 205 212 214 215 217 220 222 225 229 230 240 245 247 250 251 256-259 261 262 264 267 271 274-280 282 283 292 294 301 303-306 309 313 318 321 322 325 327-331 333 337-339 344 348 349 353 355 357 360 361 363 365 368 370 373-381 385-388 390-396 398 399 402 404 406-408 410' ); is( join( ' ', $hsp->seq_inds( 'hit', 'nomatch', 1 ) ), '4781-4786 4796-4804 4811-4813 4817-4828 4847-4855 4886-4894 4907-4909 4913-4915 4937-4939 4949-4951 4976-4978 4985-4993 5000-5008 5012-5020 5024-5026 5036-5041 5048-5053 5057-5059 5066-5068 5072-5074 5087-5089 5093-5101 5105-5116 5120-5122 5126-5128 5132-5137 5141-5143 5147-5152 5159-5167 5171-5191 5195-5200 5207-5212 5219-5221 5225-5227 5237-5242 5246-5248 5261-5269 5276-5278 5282-5284 5288-5308 5312-5320 5324-5329 5345-5350 5354-5356 5360-5365 5381-5383 5402-5404 5408-5413 5417-5419 5426-5428 5432-5434 5441-5443 5453-5458 5486-5488 5501-5503 5507-5509 5516-5521 5534-5545 5549-5554 5558-5560 5567-5569 5579-5581 5588-5608 5612-5617 5642-5644 5648-5650 5669-5671 5675-5686 5693-5695 5705-5707 5720-5722 5729-5734 5741-5743 5747-5770 5774-5776 5786-5794 5807-5809 5819-5824 5834-5836 5840-5842 5846-5848 5855-5860 5867-5869 5876-5878 5882-5884 5891-5917 5927-5938 5942-5962 5966-5971 5978-5980 5984-5986 5990-5998' ); is( join( ' ', $hsp->seq_inds( 'query', 'gaps', 1 ) ), '109 328' ); is( join( ' ', $hsp->seq_inds( 'hit', 'gaps', 1 ) ), '5077 5170 5368 5863 6001' ); is( $hsp->ambiguous_seq_inds, 'subject' ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( $count, 1 ); # WU-BLAST TBLASTX $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('dnaEbsub_ecoli.wutblastx') ); $result = $searchio->next_result; is( $result->algorithm_reference, 'Gish, W. (1996-2000) http://blast.wustl.edu ' ); is( $result->database_name, 'ecoli.nt' ); is( $result->database_letters, 4662239 ); is( $result->database_entries, 400 ); is( $result->algorithm, 'TBLASTX' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); is( $result->query_name, 'gi|142864|gb|M10040.1|BACDNAE' ); is( $result->query_description, 'B.subtilis dnaE gene encoding DNA primase, complete cds' ); is( $result->query_accession, 'M10040.1' ); is( $result->query_gi, 142864 ); is( $result->query_length, 2001 ); is( $result->get_parameter('matrix'), 'blosum62' ); is( $result->get_statistic('lambda'), 0.318 ); is( $result->get_statistic('kappa'), 0.135 ); is( $result->get_statistic('entropy'), 0.401 ); is( $result->get_statistic('dbentries'), 400 ); @valid = ( [ 'gi|1789441|gb|AE000388.1|AE000388', 10334, 'AE000388', '6.4e-70', 318, 148.6 ], [ 'gi|2367383|gb|AE000509.1|AE000509', 10589, 'AE000509', 1, 59, 29.9 ] ); $count = 0; while ( my $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); # using e here to deal with 0.9992 coming out right here as well float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); is( $hit->bits, shift @$d ); if ( $count == 0 ) { my $hspcounter = 0; while ( my $hsp = $hit->next_hsp ) { $hspcounter++; if ( $hspcounter == 3 ) { # let's actually look at the 3rd HSP is( $hsp->query->start, 441 ); is( $hsp->query->end, 617 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->start, 5192 ); is( $hsp->hit->end, 5368 ); is( $hsp->hit->strand, 1 ); is( $hsp->length('total'), 59 ); float_is( $hsp->evalue, 6.4e-70 ); float_is( $hsp->pvalue, 6.4e-70 ); is( $hsp->score, 85 ); is( $hsp->bits, 41.8 ); is( sprintf( "%.2f", $hsp->percent_identity ), '32.20' ); is( sprintf( "%.3f", $hsp->frac_identical('hit') ), 0.322 ); is( sprintf( "%.3f", $hsp->frac_identical('query') ), 0.322 ); is( sprintf( "%.4f", $hsp->frac_conserved('hsp') ), 0.4746 ); is( $hsp->query->frame(), 2 ); is( $hsp->hit->frame(), 1 ); is( $hsp->gaps('query'), 0 ); is( $hsp->gaps('hit'), 0 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); is( $hsp->query_string, 'ALDYLLSRGFTKELINEFQIGYALDSWDFITKFLVKRGFSEAQMEKAGLLIRREDGSGY' ); is( $hsp->hit_string, 'ARQYLEKRGLSHEVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSY' ); is( $hsp->homology_string, 'A YL RG + E+I F IG+A WD + K + + AG+L+ + G Y' ); is( join( ' ', $hsp->seq_inds( 'query', 'nomatch', 1 ) ), '444-449 456-461 468-470 474-476 486-491 495-497 510-518 525-527 531-533 537-557 561-569 573-578 594-599 603-605 609-614' ); is( join( ' ', $hsp->seq_inds( 'hit', 'nomatch', 1 ) ), '5195-5200 5207-5212 5219-5221 5225-5227 5237-5242 5246-5248 5261-5269 5276-5278 5282-5284 5288-5308 5312-5320 5324-5329 5345-5350 5354-5356 5360-5365' ); is( $hsp->ambiguous_seq_inds, 'query/subject' ); last; } } is( $hspcounter, 3 ); } elsif ( $count == 1 ) { my $hsps_to_do = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 587 ); is( $hsp->query->end, 706 ); is( $hsp->query->strand, -1 ); is( $hsp->hit->start, 4108 ); is( $hsp->hit->end, 4227 ); is( $hsp->hit->strand, -1 ); is( $hsp->length('total'), 40 ); float_is( $hsp->evalue, 7.1 ); float_is( $hsp->pvalue, '1.00' ); is( $hsp->score, 59 ); is( $hsp->bits, 29.9 ); is( sprintf( "%.2f", $hsp->percent_identity ), '37.50' ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), '0.3750' ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), '0.3750' ); is( sprintf( "%.4f", $hsp->frac_conserved('hsp') ), '0.4750' ); is( $hsp->query->frame(), 2 ); is( $hsp->hit->frame(), 2 ); is( $hsp->gaps('query'), 0 ); is( $hsp->gaps('hit'), 0 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); is( $hsp->query_string, 'WLPRALPEKATTAP**SWIGNMTRFLKRSKYPLPSSRLIR' ); is( $hsp->hit_string, 'WLSRTTVGSSTVSPRTFWITRMKVKLSSSKVTLPSTKSTR' ); is( $hsp->homology_string, 'WL R +T +P WI M L SK LPS++ R' ); $hsps_to_do--; last; } is( $hsps_to_do, 0 ); } last if ( $count++ > @valid ); } is( $count, 2 ); # WU-BLAST -echofilter option test (Bug 2388) $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('echofilter.wublastn') ); $result = $searchio->next_result; is( $result->algorithm_reference, 'Gish, W. (1996-2006) http://blast.wustl.edu ' ); is( $result->database_name, 'NM_003201.fa' ); is( $result->database_letters, 1936 ); is( $result->database_entries, 1 ); is( $result->algorithm, 'BLASTN' ); like( $result->algorithm_version, qr/^2\.0MP\-WashU/ ); like( $result->query_name, qr/ref|NM_003201.1| Homo sapiens transcription factor A, mitochondrial \(TFAM\), mRNA/ ); is( $result->query_accession, 'NM_003201.1' ); is( $result->query_length, 1936 ); is( $result->get_statistic('lambda'), 0.192 ); is( $result->get_statistic('kappa'), 0.182 ); is( $result->get_statistic('entropy'), 0.357 ); is( $result->get_statistic('dbletters'), 1936 ); is( $result->get_statistic('dbentries'), 1 ); is( $result->get_parameter('matrix'), '+5,-4' ); @valid = ( [ 'ref|NM_003201.1|', 1936, 'NM_003201', '0', 9680 ], ); $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 1 ); is( $hsp->query->end, 1936 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 1936 ); is( $hsp->length('total'), 1936 ); float_is( $hsp->evalue, 0. ); float_is( $hsp->pvalue, '0.' ); is( $hsp->score, 9680 ); is( $hsp->bits, 1458.4 ); is( $hsp->percent_identity, 100 ); is( $hsp->frac_identical('query'), 1.00 ); is( $hsp->frac_identical('hit'), 1.00 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); # Do a multiblast report test $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('multi_blast.bls') ); @expected = qw(CATH_RAT CATL_HUMAN CATL_RAT PAPA_CARPA); my $results_left = 4; while ( my $result = $searchio->next_result ) { like($result->algorithm_reference, qr/Gapped BLAST and PSI-BLAST/); is( $result->query_name, shift @expected, "Multiblast query test" ); $results_left--; } is( $results_left, 0 ); # Test GCGBlast parsing $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('test.gcgblast') ); $result = $searchio->next_result(); like($result->algorithm_reference,qr/Gapped BLAST and PSI-BLAST/); is( $result->query_name, '/v0/people/staji002/test.gcg' ); is( $result->algorithm, 'BLASTP' ); is( $result->algorithm_version, '2.2.1 [Apr-13-2001]' ); is( $result->database_name, 'pir' ); is( $result->database_entries, 274514 ); is( $result->database_letters, 93460074 ); is( $result->get_statistic('querylength'), 44 ); is( $result->get_statistic('effectivedblength'), 65459646 ); is( $result->get_statistic('effectivespace'), 2880224424 ); is( $result->get_statistic('effectivespaceused'), 2880224424 ); $hit = $result->next_hit; is( $hit->description, 'F22B7.10 protein - Caenorhabditis elegans' ); is( $hit->name, 'PIR2:S44629' ); is( $hit->length, 628 ); is( $hit->accession, 'PIR2:S44629' ); float_is( $hit->significance, 2e-08 ); is( $hit->raw_score, 136 ); is( $hit->bits, '57.0' ); $hsp = $hit->next_hsp; float_is( $hsp->evalue, 2e-08 ); is( $hsp->bits, '57.0' ); is( $hsp->score, 136 ); is( int( $hsp->percent_identity ), 28 ); is( sprintf( "%.2f", $hsp->frac_identical('query') ), 0.29 ); is( $hsp->frac_conserved('total'), 69 / 135 ); is( $hsp->gaps('total'), 8 ); is( $hsp->gaps('hit'), 6 ); is( $hsp->gaps('query'), 2 ); is( $hsp->hit->start, 342 ); is( $hsp->hit->end, 470 ); is( $hsp->query->start, 3 ); is( $hsp->query->end, 135 ); is( $hsp->query_string, 'CAAEFDFMEKETPLRYTKTXXXXXXXXXXXXXXRKIISDMWGVLAKQQTHVRKHQFDHGELVYHALQLLAYTALGILIMRLKLFLTPYMCVMASLICSRQLFGW--LFCKVHPGAIVFVILAAMSIQGSANLQTQ' ); is( $hsp->hit_string, 'CSAEFDFIQYSTIEKLCGTLLIPLALISLVTFVFNFVKNT-NLLWRNSEEIG----ENGEILYNVVQLCCSTVMAFLIMRLKLFMTPHLCIVAALFANSKLLGGDRISKTIRVSALVGVI-AILFYRGIPNIRQQ' ); is( $hsp->homology_string, 'C+AEFDF++ T + T + + +L + + ++GE++Y+ +QL T + LIMRLKLF+TP++C++A+L + +L G + + A+V VI A + +G N++ Q' ); #test all the database accession number formats $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('testdbaccnums.out') ); $result = $searchio->next_result; like($result->algorithm_reference,qr/Gapped BLAST and PSI-BLAST/); is( $result->rid, '1036160600-011802-21377' ); is( $result->get_statistic('querylength'), 9 ); is( $result->get_statistic('effectivedblength'), 35444647 ); is( $result->get_statistic('effectivespace'), 319001823 ); is( $result->get_statistic('effectivespaceused'), 319001823 ); @valid = ( [ 'pir||T14789', 'T14789', 'T14789', 'CAB53709', 'AAH01726' ], [ 'gb|NP_065733.1|CYT19', 'NP_065733', 'CYT19' ], [ 'emb|XP_053690.4|Cyt19', 'XP_053690' ], [ 'dbj|NP_056277.2|DKFZP586L0724', 'NP_056277' ], [ 'prf||XP_064862.2', 'XP_064862' ], [ 'pdb|BAB13968.1|1', 'BAB13968' ], [ 'sp|Q16478|GLK5_HUMAN', 'Q16478' ], [ 'pat|US|NP_002079.2', 'NP_002079' ], [ 'bbs|NP_079463.2|', 'NP_079463' ], [ 'gnl|db1|NP_002444.1', 'NP_002444' ], [ 'ref|XP_051877.1|', 'XP_051877' ], [ 'lcl|AAH16829.1|', 'AAH16829' ], [ 'gi|1|gb|NP_065733.1|CYT19', 'NP_065733' ], [ 'gi|2|emb|XP_053690.4|Cyt19', 'XP_053690' ], [ 'gi|3|dbj|NP_056277.2|DKFZP586L0724', 'NP_056277' ], [ 'gi|4|pir||T14789', 'T14789' ], [ 'gi|5|prf||XP_064862.2', 'XP_064862' ], [ 'gi|6|pdb|BAB13968.1|1', 'BAB13968' ], [ 'gi|7|sp|Q16478|GLK5_HUMAN', 'Q16478' ], [ 'gi|8|pat|US|NP_002079.2', 'NP_002079' ], [ 'gi|9|bbs|NP_079463.2|', 'NP_079463' ], [ 'gi|10|gnl|db1|NP_002444.1', 'NP_002444' ], [ 'gi|11|ref|XP_051877.1|', 'XP_051877' ], [ 'gi|12|lcl|AAH16829.1|', 'AAH16829' ], [ 'MY_test_ID', 'MY_test_ID' ] ); $hit = $result->next_hit; my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->accession, shift @$d ); my @accnums = $hit->each_accession_number; foreach my $a (@accnums) { is( $a, shift @$d ); } $d = shift @valid; $hit = $result->next_hit; is( $hit->name, shift @$d ); is( $hit->accession, shift @$d ); is( $hit->locus, shift @$d ); $hits_left = 23; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->accession, shift @$d ); $hits_left--; } is( $hits_left, 0 ); # Parse MEGABLAST # parse the BLAST-like output my $infile = test_input_file('503384.MEGABLAST.2'); my $in = Bio::SearchIO->new( -file => $infile, -format => 'blast' ); # this is megablast blast-like output my $r = $in->next_result; my @dcompare = ( [ 'Contig3700', 5631, 396, 785, '0.0', 785, '0.0', 396, 639, 12, 8723, 9434, 1, 4083, 4794, -1 ], [ 'Contig3997', 12734, 335, 664, '0.0', 664, '0.0', 335, 401, 0, 1282, 1704, 1, 1546, 1968, -1 ], [ 'Contig634', 858, 245, 486, '1e-136', 486, '1e-136', 245, 304, 3, 7620, 7941, 1, 1, 321, -1 ], [ 'Contig1853', 2314, 171, 339, '1e-91', 339, '1e-91', 171, 204, 0, 6406, 6620, 1, 1691, 1905, 1 ] ); like($r->algorithm_reference,qr/A greedy algorithm for aligning DNA sequences/); is( $r->algorithm, 'MEGABLAST' ); is( $r->query_name, '503384' ); is( $r->query_description, '11337 bp 2 contigs' ); is( $r->query_length, 11337 ); is( $r->database_name, 'cneoA.nt' ); is( $r->database_letters, 17206226 ); is( $r->database_entries, 4935 ); is( $r->get_statistic('querylength'), 11318 ); is( $r->get_statistic('effectivedblength'), 17112461 ); is( $r->get_statistic('effectivespace'), 193678833598 ); is( $r->get_statistic('effectivespaceused'), 0 ); $hits_left = 4; while ( my $hit = $r->next_hit ) { my $d = shift @dcompare; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->raw_score, shift @$d ); is( $hit->bits, shift @$d ); float_is( $hit->significance, shift @$d ); my $hsp = $hit->next_hsp; is( $hsp->bits, shift @$d ); float_is( $hsp->evalue, shift @$d ); is( $hsp->score, shift @$d ); is( $hsp->num_identical, shift @$d ); is( $hsp->gaps('total'), shift @$d ); is( $hsp->query->start, shift @$d ); is( $hsp->query->end, shift @$d ); is( $hsp->query->strand, shift @$d ); is( $hsp->hit->start, shift @$d ); is( $hsp->hit->end, shift @$d ); is( $hsp->hit->strand, shift @$d ); is( $hsp->n, 1 ); $hits_left--; } is( $hits_left, 0 ); # Let's test RPS-BLAST my $parser = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('ecoli_domains.rpsblast') ); $r = $parser->next_result; is( $r->algorithm, 'RPS-BLAST(BLASTP)'); is( $r->algorithm_version, '2.2.4 [Aug-26-2002]'); is( $r->algorithm_reference, undef ); is( $r->query_name, 'gi|1786183|gb|AAC73113.1|' ); is( $r->query_gi, 1786183 ); is( $r->num_hits, 7 ); is( $r->get_statistic('querylength'), 438 ); is( $r->get_statistic('effectivedblength'), 31988 ); is( $r->get_statistic('effectivespace'), 14010744 ); is( $r->get_statistic('effectivespaceused'), 24054976 ); $hit = $r->next_hit; is( $hit->name, 'gnl|CDD|3919' ); float_is( $hit->significance, 0.064 ); is( $hit->bits, 28.3 ); is( $hit->raw_score, 63 ); $hsp = $hit->next_hsp; is( $hsp->query->start, 599 ); is( $hsp->query->end, 655 ); is( $hsp->hit->start, 23 ); is( $hsp->hit->end, 76 ); # Test PSI-BLAST parsing $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('psiblastreport.out') ); $result = $searchio->next_result; like($result->algorithm_reference, qr/Gapped BLAST and PSI-BLAST/); is( $result->database_name, '/home/peter/blast/data/swissprot.pr' ); is( $result->database_entries, 88780 ); is( $result->database_letters, 31984247 ); is( $result->algorithm, 'BLASTP' ); like( $result->algorithm_version, qr/^2\.0\.14/ ); is( $result->query_name, 'CYS1_DICDI' ); is( $result->query_length, 343 ); is( $result->get_statistic('kappa'), 0.0491 ); cmp_ok( $result->get_statistic('lambda'), '==', 0.270 ); cmp_ok( $result->get_statistic('entropy'), '==', 0.230 ); is( $result->get_statistic('dbletters'), 31984247 ); is( $result->get_statistic('dbentries'), 88780 ); is( $result->get_statistic('effective_hsplength'), 49 ); is( $result->get_statistic('querylength'), 294 ); is( $result->get_statistic('effectivedblength'), 27634027 ); is( $result->get_statistic('effectivespace'), 8124403938 ); is( $result->get_statistic('effectivespaceused'), 8124403938 ); is( $result->get_parameter('matrix'), 'BLOSUM62' ); is( $result->get_parameter('gapopen'), 11 ); is( $result->get_parameter('gapext'), 1 ); my @valid_hit_data = ( [ 'sp|P04988|CYS1_DICDI', 343, 'P04988', '0', 721 ], [ 'sp|P43295|A494_ARATH', 313, 'P43295', '1e-75', 281 ], [ 'sp|P25804|CYSP_PEA', 363, 'P25804', '1e-74', 278 ] ); my @valid_iter_data = ( [ 127, 127, 0, 109, 18, 0, 0, 0, 0 ], [ 157, 40, 117, 2, 38, 0, 109, 3, 5 ] ); my $iter_count = 0; while ( $iter = $result->next_iteration ) { $iter_count++; my $di = shift @valid_iter_data; is( $iter->number, $iter_count ); is( $iter->num_hits, shift @$di ); is( $iter->num_hits_new, shift @$di ); is( $iter->num_hits_old, shift @$di ); is( scalar( $iter->newhits_below_threshold ), shift @$di ); is( scalar( $iter->newhits_not_below_threshold ), shift @$di ); is( scalar( $iter->newhits_unclassified ), shift @$di ); is( scalar( $iter->oldhits_below_threshold ), shift @$di ); is( scalar( $iter->oldhits_newly_below_threshold ), shift @$di ); is( scalar( $iter->oldhits_not_below_threshold ), shift @$di ); my $hit_count = 0; if ( $iter_count == 1 ) { while ( $hit = $result->next_hit ) { my $d = shift @valid_hit_data; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->bits, shift @$d ); if ( $hit_count == 1 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 32 ); is( $hsp->query->end, 340 ); is( $hsp->hit->start, 3 ); is( $hsp->hit->end, 307 ); is( $hsp->length('total'), 316 ); is( $hsp->start('hit'), $hsp->hit->start ); is( $hsp->end('query'), $hsp->query->end ); is( $hsp->strand('sbjct'), $hsp->subject->strand ) ; # alias for hit float_is( $hsp->evalue, 1e-75 ); is( $hsp->score, 712 ); is( $hsp->bits, 281 ); is( sprintf( "%.1f", $hsp->percent_identity ), 46.5 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.4757 ); is( sprintf( "%.3f", $hsp->frac_identical('hit') ), 0.482 ); is( $hsp->gaps, 18 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $hit_count++ > @valid_hit_data ); } } } is( @valid_hit_data, 0 ); is( @valid_iter_data, 0 ); # Test filtering $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.bls'), '-signif' => 1e-100 ); @valid = qw(gb|AAC73113.1|); $r = $searchio->next_result; while ( my $hit = $r->next_hit ) { is( $hit->name, shift @valid ); } $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.bls'), '-score' => 100 ); @valid = qw(gb|AAC73113.1| gb|AAC76922.1| gb|AAC76994.1|); $r = $searchio->next_result; while ( my $hit = $r->next_hit ) { is( $hit->name, shift @valid ); } is( @valid, 0 ); $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.bls'), '-bits' => 200 ); @valid = qw(gb|AAC73113.1| gb|AAC76922.1|); $r = $searchio->next_result; while ( my $hit = $r->next_hit ) { is( $hit->name, shift @valid ); } is( @valid, 0 ); my $filt_func = sub { my $hit = shift; $hit->frac_identical('query') >= 0.31; }; $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.bls'), '-hit_filter' => $filt_func ); @valid = qw(gb|AAC73113.1| gb|AAC76994.1|); $r = $searchio->next_result; while ( my $hit = $r->next_hit ) { is( $hit->name, shift @valid ); } is( @valid, 0 ); # bl2seq parsing testing # this is blastp bl2seq $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.out') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, '' ); is( $result->algorithm, 'BLASTP' ); is( $result->algorithm_reference, undef ); is( $result->get_statistic('querylength'), 320 ); is( $result->get_statistic('effectivedblength'), 339 ); is( $result->get_statistic('effectivespace'), 108480 ); is( $result->get_statistic('effectivespaceused'), 108480 ); $hit = $result->next_hit; is( $hit->name, 'ALEU_HORVU' ); is( $hit->length, 362 ); $hsp = $hit->next_hsp; is( $hsp->score, 481 ); is( $hsp->bits, 191 ); is( int $hsp->percent_identity, 34 ); float_is( $hsp->evalue, 2e-53 ); is( int( $hsp->frac_conserved * $hsp->length ), 167 ); is( $hsp->query->start, 28 ); is( $hsp->query->end, 343 ); is( $hsp->hit->start, 60 ); is( $hsp->hit->end, 360 ); is( $hsp->gaps, 27 ); # this is blastn bl2seq $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.blastn.rev') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, '' ); is( $result->algorithm, 'BLASTN' ); is( $result->algorithm_reference, undef ); is( $result->query_length, 180 ); is( $result->get_statistic('querylength'), 174 ); is( $result->get_statistic('effectivedblength'), 173 ); is( $result->get_statistic('effectivespace'), 30102 ); is( $result->get_statistic('effectivespaceused'), 30102 ); $hit = $result->next_hit; is( $hit->length, 179 ); is( $hit->name, 'human' ); $hsp = $hit->next_hsp; is( $hsp->score, 27 ); is( $hsp->bits, '54.0' ); is( int $hsp->percent_identity, 88 ); float_is( $hsp->evalue, 2e-12 ); is( int( $hsp->frac_conserved * $hsp->length ), 83 ); is( $hsp->query->start, 94 ); is( $hsp->query->end, 180 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->strand, -1 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 94 ); is( $hsp->gaps, 7 ); is( $hsp->n, 1 ); # this is blastn bl2seq $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.blastn') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, '' ); is( $result->query_length, 180 ); is( $result->algorithm, 'BLASTN' ); is( $result->algorithm_reference, undef ); is( $result->get_statistic('querylength'), 174 ); is( $result->get_statistic('effectivedblength'), 173 ); is( $result->get_statistic('effectivespace'), 30102 ); is( $result->get_statistic('effectivespaceused'), 30102 ); $hit = $result->next_hit; is( $hit->name, 'human' ); is( $hit->length, 179 ); $hsp = $hit->next_hsp; is( $hsp->score, 27 ); is( $hsp->bits, '54.0' ); is( int $hsp->percent_identity, 88 ); float_is( $hsp->evalue, 2e-12 ); is( int( $hsp->frac_conserved * $hsp->length ), 83 ); is( $hsp->query->start, 94 ); is( $hsp->query->end, 180 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->strand, 1 ); is( $hsp->hit->start, 86 ); is( $hsp->hit->end, 179 ); is( $hsp->gaps, 7 ); is( $hsp->n, 1 ); # this is blastn bl2seq+ $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq+.blastn') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, 'gi|2695846|emb|Y13255.1|' ); is( $result->query_description, 'Acipenser baeri mRNA for immunoglobulin heavy chain, clone ScH 3.3' ); is( $result->query_length, 606 ); is( $result->algorithm, 'BLASTN' ); is( $result->algorithm_version, '2.2.29+' ); is( $result->algorithm_reference, undef ); is( $result->get_statistic('effectivespaceused'), 352836 ); is( $result->get_statistic('kappa'), 0.621 ); is( $result->get_statistic('kappa_gapped'), '0.460' ); is( $result->get_statistic('lambda'), 1.33 ); is( $result->get_statistic('lambda_gapped'), 1.28 ); is( $result->get_statistic('entropy'), 1.12 ); is( $result->get_statistic('entropy_gapped'), '0.850' ); $hit = $result->next_hit; is( $hit->name, 'gi|2695846|emb|Y13255.1|' ); is( $hit->description, 'Acipenser baeri mRNA for immunoglobulin heavy chain, clone ScH 3.3' ); is( $hit->length, 606 ); $hsp = $hit->next_hsp; is( $hsp->score, 606 ); is( $hsp->bits, 1120 ); is( $hsp->percent_identity, 100 ); float_is( $hsp->evalue, '0.0' ); is( $hsp->query->start, 1 ); is( $hsp->query->end, 606 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->strand, 1 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 606 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); # this is blastp bl2seq $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.bug940.out') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, 'zinc' ); is( $result->algorithm, 'BLASTP' ); is( $result->query_description, 'finger protein 135 (clone pHZ-17) [Homo sapiens]. neo_id RS.ctg14243-000000.6.0' ); is( $result->query_length, 469 ); is( $result->get_statistic('querylength'), 446 ); is( $result->get_statistic('effectivedblength'), 446 ); is( $result->get_statistic('effectivespace'), 198916 ); is( $result->get_statistic('effectivespaceused'), 198916 ); $hit = $result->next_hit; is( $hit->name, 'gi|4507985|' ); is( $hit->ncbi_gi, 4507985 ); is( $hit->description, 'zinc finger protein 135 (clone pHZ-17) [Homo sapiens]. neo_id RS.ctg14243-000000.6.0' ); is( $hit->length, 469 ); $hsp = $hit->next_hsp; is( $hsp->score, 1626 ); is( $hsp->bits, 637 ); is( int $hsp->percent_identity, 66 ); float_is( $hsp->evalue, 0.0 ); is( int( $hsp->frac_conserved * $hsp->length ), 330 ); is( $hsp->query->start, 121 ); is( $hsp->query->end, 469 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 469 ); is( $hsp->gaps, 120 ); is( $hsp->n, 1 ); ok( $hit->next_hsp ); # there is more than one HSP here, # make sure it is parsed at least # cannot distinguish between blastx and tblastn reports # so we're only testing a blastx report for now # this is blastx bl2seq $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.blastx.out') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, 'AE000111.1' ); is( $result->query_description, 'Escherichia coli K-12 MG1655 section 1 of 400 of the complete genome' ); is( $result->algorithm, 'BLASTX' ); is( $result->algorithm_reference, undef ); is( $result->query_length, 720 ); is( $result->get_statistic('querylength'), undef ); is( $result->get_statistic('effectivedblength'), 787 ); is( $result->get_statistic('effectivespace'), undef ); is( $result->get_statistic('effectivespaceused'), 162122 ); $hit = $result->next_hit; is( $hit->name, 'AK1H_ECOLI' ); is( $hit->description, 'P00561 Bifunctional aspartokinase/homoserine dehydrogenase I (AKI-HDI) [Includes: Aspartokinase I ; Homoserine dehydrogenase I ]' ); is( $hit->length, 820 ); $hsp = $hit->next_hsp; is( $hsp->score, 634 ); is( $hsp->bits, 248 ); is( int $hsp->percent_identity, 100 ); float_is( $hsp->evalue, 2e-70 ); is( int( $hsp->frac_conserved * $hsp->length ), 128 ); is( $hsp->query->start, 1 ); is( $hsp->query->end, 384 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 128 ); is( $hsp->gaps, 0 ); is( $hsp->query->frame, 0 ); is( $hsp->hit->frame, 0 ); is( $hsp->query->strand, -1 ); is( $hsp->hit->strand, 0 ); is( $hsp->n, 1 ); # this is tblastx bl2seq (self against self) $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('bl2seq.tblastx.out') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->query_name, 'Escherichia' ); is( $result->algorithm, 'TBLASTX' ); is( $result->algorithm_reference, undef ); is( $result->query_description, 'coli K-12 MG1655 section 1 of 400 of the complete genome' ); is( $result->query_length, 720 ); is( $result->get_statistic('querylength'), undef ); is( $result->get_statistic('effectivedblength'), 221 ); is( $result->get_statistic('effectivespace'), undef ); is( $result->get_statistic('effectivespaceused'), 48620 ); $hit = $result->next_hit; is( $hit->name, 'gi|1786181|gb|AE000111.1|AE000111' ); is( $hit->ncbi_gi, 1786181 ); is( $hit->description, 'Escherichia coli K-12 MG1655 section 1 of 400 of the complete genome' ); is( $hit->length, 720 ); $hsp = $hit->next_hsp; is( $hsp->score, 1118 ); is( $hsp->bits, 515 ); is( int $hsp->percent_identity, 95 ); float_is( $hsp->evalue, 1e-151 ); is( int( $hsp->frac_conserved * $hsp->length ), 229 ); is( $hsp->query->start, 1 ); is( $hsp->query->end, 720 ); is( $hsp->hit->start, 1 ); is( $hsp->hit->end, 720 ); is( $hsp->gaps, 0 ); is( $hsp->query->frame, 0 ); is( $hsp->hit->frame, 0 ); is( $hsp->query->strand, 1 ); is( $hsp->hit->strand, 1 ); is( $hsp->n, 1 ); # this is NCBI tblastn $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('tblastn.out') ); $result = $searchio->next_result; isa_ok( $result, 'Bio::Search::Result::ResultI' ); is( $result->algorithm, 'TBLASTN' ); like($result->algorithm_reference,qr/Gapped BLAST and PSI-BLAST/); is( $result->get_statistic('querylength'), 102 ); is( $result->get_statistic('effectivedblength'), 4342 ); is( $result->get_statistic('effectivespace'), 442884 ); is( $result->get_statistic('effectivespaceused'), 442884 ); $hit = $result->next_hit; is( $hit->name, 'gi|10040111|emb|AL390796.6|AL390796' ); # Test Blast parsing with B=0 (WU-BLAST) $searchio = Bio::SearchIO->new( -file => test_input_file('no_hsps.blastp'), -format => 'blast' ); $result = $searchio->next_result; like($result->algorithm_reference,qr/Gish, W. \(1996-2003\)/); is( $result->query_name, 'mgri:MG00189.3' ); $hit = $result->next_hit; is( $hit->name, 'mgri:MG00189.3' ); is( $hit->description, 'hypothetical protein 6892 8867 +' ); is( $hit->bits, 3098 ); float_is( $hit->significance, 0. ); $hit = $result->next_hit; is( $hit->name, 'fgram:FG01141.1' ); is( $hit->description, 'hypothetical protein 47007 48803 -' ); is( $hit->bits, 2182 ); float_is( $hit->significance, 4.2e-226 ); is( $result->num_hits, 415 ); # Let's now test if _guess_format is doing its job correctly my %pair = ( 'filename.blast' => 'blast', 'filename.bls' => 'blast', 'f.blx' => 'blast', 'f.tblx' => 'blast', 'fast.bls' => 'blast', 'f.fasta' => 'fasta', 'f.fa' => 'fasta', 'f.fx' => 'fasta', 'f.fy' => 'fasta', 'f.ssearch' => 'fasta', 'f.SSEARCH.m9' => 'fasta', 'f.m9' => 'fasta', 'f.psearch' => 'fasta', 'f.osearch' => 'fasta', 'f.exon' => 'exonerate', 'f.exonerate' => 'exonerate', 'f.blastxml' => 'blastxml', 'f.xml' => 'blastxml' ); while ( my ( $file, $expformat ) = each %pair ) { is( Bio::SearchIO->_guess_format($file), $expformat, "$expformat for $file" ); } # Test Wes Barris's reported bug when parsing blastcl3 output which # has integer overflow $searchio = Bio::SearchIO->new( -file => test_input_file('hsinsulin.blastcl3.blastn'), -format => 'blast' ); $result = $searchio->next_result; is( $result->query_name, 'human' ); is( $result->database_letters(), '-24016349' ); # this is of course not the right length, but is the what blastcl3 # reports, the correct value is is( $result->get_statistic('dbletters'), '192913178' ); is( $result->get_statistic('dbentries'), '1867771' ); # test for links and groups being parsed out of WU-BLAST properly $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('brassica_ATH.WUBLASTN') ); ok( $result = $searchio->next_result ); ok( $hit = $result->next_hit ); ok( $hsp = $hit->next_hsp ); is( $hsp->links, '(1)-3-2' ); is( $hsp->query->strand, 1 ); is( $hsp->hit->strand, 1 ); is( $hsp->hsp_group, '1' ); is( $hsp->n, 1 ); ## Web blast result parsing $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('catalase-webblast.BLASTP') ); ok( $result = $searchio->next_result ); is( $result->rid, '1118324516-16598-103707467515.BLASTQ1' ); ok( $hit = $result->next_hit ); is( $hit->name, 'gi|40747822|gb|EAA66978.1|', 'full hit name' ); is( $hit->accession, 'EAA66978', 'hit accession' ); is( $hit->ncbi_gi, 40747822 ); ok( $hsp = $hit->next_hsp ); is( $hsp->query->start, 1, 'query start' ); is( $hsp->query->end, 528, 'query start' ); is( $hsp->n, 1 ); # tests for new BLAST 2.2.13 output $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('new_blastn.txt') ); $result = $searchio->next_result; is( $result->database_name, 'All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS,GSS,environmental samples or phase 0, 1 or 2 HTGS sequences)' ); is( $result->database_entries, 3742891 ); is( $result->database_letters, 16670205594 ); is( $result->algorithm, 'BLASTN' ); is( $result->algorithm_version, '2.2.13 [Nov-27-2005]' ); like($result->algorithm_reference, qr/Gapped BLAST and PSI-BLAST/); is( $result->rid, '1141079027-8324-8848328247.BLASTQ4' ); is( $result->query_name, 'pyrR,' ); is( $result->query_length, 558 ); is( $result->get_statistic('kappa'), '0.711' ); is( $result->get_statistic('kappa_gapped'), '0.711' ); is( $result->get_statistic('lambda'), '1.37' ); is( $result->get_statistic('lambda_gapped'), '1.37' ); is( $result->get_statistic('entropy'), '1.31' ); is( $result->get_statistic('entropy_gapped'), '1.31' ); is( $result->get_statistic('dbletters'), '-509663586' ); is( $result->get_statistic('dbentries'), 3742891 ); is( $result->get_statistic('effective_hsplength'), undef ); is( $result->get_statistic('effectivespace'), 8935230198384 ); is( $result->get_statistic( 'number_of_hsps_better_than_expect_value_cutoff_without_gapping'), 0 ); is( $result->get_statistic('number_of_hsps_gapped'), 1771 ); is( $result->get_statistic('number_of_hsps_successfully_gapped'), 0 ); is( $result->get_statistic('length_adjustment'), 22 ); is( $result->get_statistic('querylength'), 536 ); is( $result->get_statistic('effectivedblength'), 16670205594 ); is( $result->get_statistic('effectivespaceused'), 8891094027712 ); is( $result->get_parameter('matrix'), 'blastn matrix:1 -3' ); is( $result->get_parameter('gapopen'), 5 ); is( $result->get_parameter('gapext'), 2 ); is( $result->get_statistic('S2'), '60' ); is( $result->get_statistic('S2_bits'), '119.4' ); float_is( $result->get_parameter('expect'), '1e-23' ); is( $result->get_statistic('num_extensions'), '117843' ); @valid = ( [ 'gi|41400296|gb|AE016958.1|', 4829781, 'AE016958', 41400296, '6e-059', 119, 236 ], [ 'gi|54013472|dbj|AP006618.1|', 6021225, 'AP006618', 54013472, '4e-026', 64, 127 ], [ 'gi|57546753|dbj|BA000030.2|', 9025608, 'BA000030', 57546753, '1e-023', 60, 119 ] ); $count = 0; while ( $hit = $result->next_hit ) { my $d = shift @valid; is( $hit->name, shift @$d ); is( $hit->length, shift @$d ); is( $hit->accession, shift @$d ); is( $hit->ncbi_gi, shift @$d ); float_is( $hit->significance, shift @$d ); is( $hit->raw_score, shift @$d ); is( $hit->bits, shift @$d ); if ( $count == 0 ) { my $hsps_left = 1; while ( my $hsp = $hit->next_hsp ) { is( $hsp->query->start, 262 ); is( $hsp->query->end, 552 ); is( $hsp->hit->start, 1166897 ); is( $hsp->hit->end, 1167187 ); is( $hsp->length('total'), 291 ); is( $hsp->hit_features, 'PyrR' ); is( $hsp->start('hit'), $hsp->hit->start ); is( $hsp->end('query'), $hsp->query->end ); is( $hsp->strand('sbjct'), $hsp->subject->strand ); # alias for hit float_is( $hsp->evalue, 6e-59 ); is( $hsp->score, 119 ); is( $hsp->bits, 236 ); is( sprintf( "%.2f", $hsp->percent_identity ), 85.22 ); is( sprintf( "%.4f", $hsp->frac_identical('query') ), 0.8522 ); is( sprintf( "%.4f", $hsp->frac_identical('hit') ), 0.8522 ); is( $hsp->gaps, 0 ); is( $hsp->n, 1 ); $hsps_left--; } is( $hsps_left, 0 ); } last if ( $count++ > @valid ); } is( @valid, 0 ); # Bug 2189 $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('blastp2215.blast') ); $result = $searchio->next_result; is( $result->database_entries, 4460989 ); is( $result->database_letters, 1533424333 ); is( $result->algorithm, 'BLASTP' ); is( $result->algorithm_version, '2.2.15 [Oct-15-2006]' ); is( $result->rid, '1169055516-21385-22799250964.BLASTQ4' ); is( $result->query_name, 'gi|15608519|ref|NP_215895.1|' ); is( $result->query_gi, 15608519 ); is( $result->query_length, 193 ); @hits = $result->hits; is( scalar(@hits), 10 ); is( $hits[1]->accession, '1W30' ); is( $hits[4]->significance, '2e-72' ); is( $hits[7]->bits, '254' ); $result = $searchio->next_result; is( $result->database_entries, 4460989 ); is( $result->database_letters, 1533424333 ); is( $result->algorithm, 'BLASTP' ); is( $result->algorithm_version, '2.2.15 [Oct-15-2006]' ); is( $result->query_name, 'gi|15595598|ref|NP_249092.1|' ); is( $result->query_length, 423 ); @hits = $result->hits; is( scalar(@hits), 10 ); is( $hits[1]->accession, 'ZP_00972546' ); is( $hits[2]->ncbi_gi, 116054132 ); is( $hits[4]->significance, '0.0' ); is( $hits[7]->bits, 624 ); # Bug 2246 $searchio = Bio::SearchIO->new( -format => 'blast', -verbose => -1, -file => test_input_file('bug2246.blast') ); $result = $searchio->next_result; is( $result->get_statistic( 'number_of_hsps_better_than_expect_value_cutoff_without_gapping'), 0 ); is( $result->get_statistic('number_of_hsps_gapped'), 7049 ); is( $result->get_statistic('number_of_hsps_successfully_gapped'), 55 ); is( $result->get_statistic('length_adjustment'), 125 ); is( $result->get_statistic('querylength'), 68 ); is( $result->get_statistic('effectivedblength'), 1045382588 ); is( $result->get_statistic('effectivespace'), 71086015984 ); is( $result->get_statistic('effectivespaceused'), 71086015984 ); $hit = $result->next_hit; is $hit->name, 'UniRef50_Q9X0H5'; is $hit->length, 0; is $hit->accession, 'UniRef50_Q9X0H5'; is $hit->description, 'Cluster: Histidyl-tRNA synthetase; n=4; Thermoto...'; is $hit->bits, 23; float_is( $hit->significance, 650 ); # Bug 1986 $searchio = Bio::SearchIO->new( -format => 'blast', -verbose => -1, -file => test_input_file('bug1986.blastp') ); $result = $searchio->next_result; is( $result->get_statistic('querylength'), 335 ); is( $result->get_statistic('effectivedblength'), 18683311 ); is( $result->get_statistic('effectivespace'), 6258909185 ); is( $result->get_statistic('effectivespaceused'), 6258909185 ); $hit = $result->next_hit; is $hit->name, 'ENSP00000350182'; is $hit->length, 425; is $hit->accession, 'ENSP00000350182'; is $hit->description, 'pep:novel clone::BX322644.8:4905:15090:-1 gene:ENSG00000137397 transcript:ENST00000357569'; is $hit->raw_score, 301; is $hit->bits, 120; float_is( $hit->significance, 3e-27 ); $hit = $result->next_hit; is $hit->name, 'ENSP00000327738'; is $hit->length, 468; is $hit->accession, 'ENSP00000327738'; is $hit->description, 'pep:known-ccds chromosome:NCBI36:4:189297592:189305643:1 gene:ENSG00000184108 transcript:ENST00000332517 CCDS3851.1'; is $hit->raw_score, 289; is $hit->bits, 115; float_is( $hit->significance, 8e-26 ); # Bug 1986, pt. 2 # handle at least the first iteration with BLAST searches using databases # containing non-unique IDs my $file = test_input_file('bug1986.blast2'); my %unique_accs; open my $IN, '<', $file or die "Could not read file '$file': $!\n"; while (<$IN>) { last if (/^Sequences/); } $count = 1; while (<$IN>) { chomp; next if m{^\s*$}; next unless ($_); last if m{^>}; my ($accession) = split(/\s+/); #print "Real Hit $count = $accession\n"; $unique_accs{$accession}++; #last if ($count == 10); ++$count; } close $IN; is( $count, 495 ); is( scalar( keys %unique_accs ), 490 ); my %search_accs; $searchio = Bio::SearchIO->new( -format => 'blast', -verbose => -1, -file => $file ); $result = $searchio->next_result; $count = 1; while ( my $hit = $result->next_hit ) { $search_accs{ $hit->accession }++; $count++; } is( $count, 495 ); is( scalar( keys %search_accs ), 490 ); is_deeply( \%unique_accs, \%search_accs ); # bug 2391 - long query names $file = test_input_file('bug2391.megablast'); $searchio = Bio::SearchIO->new( -format => 'blast', -file => $file ); $result = $searchio->next_result; # data is getting munged up with long names is( $result->query_name, 'c6_COX;c6_QBL;6|31508172;31503325;31478402|rs36223351|1|dbSNP|C/G' ); is( $result->query_description, '' ); is( $result->algorithm, 'MEGABLAST' ); is( $result->get_statistic( 'number_of_hsps_better_than_expect_value_cutoff_without_gapping'), undef ); is( $result->get_statistic('number_of_hsps_gapped'), 0 ); is( $result->get_statistic('number_of_hsps_successfully_gapped'), 0 ); is( $result->get_statistic('length_adjustment'), 16 ); is( $result->get_statistic('querylength'), 85 ); is( $result->get_statistic('effectivedblength'), 59358266 ); is( $result->get_statistic('effectivespace'), 5045452610 ); is( $result->get_statistic('effectivespaceused'), 5045452610 ); # bug 2399 - catching Expect(n) values $file = test_input_file('bug2399.tblastn'); $searchio = Bio::SearchIO->new( -format => 'blast', -file => $file ); my $total_n = 0; while ( my $query = $searchio->next_result ) { while ( my $subject = $query->next_hit ) { $total_n += grep { $_->n } $subject->hsps; } } is( $total_n, 80 ); # n = at least 1, so this was changed to reflect that sub cmp_evalue ($$) { my ( $tval, $aval ) = @_; is( sprintf( "%g", $tval ), sprintf( "%g", $aval ) ); } # bug 3064 - All-gap Query/Subject lines for BLAST+ do not have numbering $file = test_input_file('blast_plus.blastp'); $searchio = Bio::SearchIO->new( -format => 'blast', -file => $file ); my $total_hsps = 0; while ( my $query = $searchio->next_result ) { is( $query->get_statistic('querylength'), undef ); is( $query->get_statistic('effectivedblength'), undef ); is( $query->get_statistic('effectivespace'), undef ); is( $query->get_statistic('effectivespaceused'), 55770 ); while ( my $subject = $query->next_hit ) { while ( my $hsp = $subject->next_hsp ) { $total_hsps++; if ( $total_hsps == 1 ) { is( $hsp->start('query'), 5 ); is( $hsp->start('hit'), 3 ); is( $hsp->end('query'), 220 ); is( $hsp->end('hit'), 308 ); is( length( $hsp->query_string ), length( $hsp->hit_string ) ); } } } } is( $total_hsps, 2 ); # BLAST 2.2.20+ output file ZABJ4EA7014.CH878695.1.blast.txt # Tests SearchIO blast parsing of 'Features in/flanking this part of a subject sequence' $searchio = Bio::SearchIO->new( -format => 'blast', -file => test_input_file('ZABJ4EA7014.CH878695.1.blast.txt') ); $result = $searchio->next_result; # Parse BLAST header details is( $result->algorithm, 'BLASTN' ); is( $result->algorithm_version, '2.2.20+' ); like($result->algorithm_reference, qr/A greedy algorithm for aligning DNA\s+sequences/); is( $result->database_name, 'human build 35 genome database (reference assembly only)' ); is( $result->database_entries, 378 ); is( $result->database_letters, 2866055344 ); is( $result->query_name, 'gi|95131563|gb|CH878695.1|' ); is( $result->query_description, 'Homo sapiens 211000035829648 genomic scaffold' ); is( $result->query_length, 29324 ); # Parse BLAST footer details is( $result->get_statistic('posted_date'), 'Jul 26, 2007 3:20 PM' ); is( $result->get_statistic('dbletters'), -1428911948 ); is( $result->get_statistic('lambda'), '1.33' ); is( $result->get_statistic('kappa'), '0.621' ); is( $result->get_statistic('entropy'), '1.12' ); is( $result->get_statistic('lambda_gapped'), '1.28' ); is( $result->get_statistic('kappa_gapped'), '0.460' ); is( $result->get_statistic('entropy_gapped'), '0.850' ); is( $result->get_parameter('matrix'), 'blastn matrix:1 -2' ); is( $result->get_parameter('gapopen'), 0 ); is( $result->get_parameter('gapext'), 0 ); is( $result->get_statistic('num_extensions'), 216 ); is( $result->get_statistic('num_successful_extensions'), 216 ); is( $result->get_parameter('expect'), '0.01' ); is( $result->get_statistic('seqs_better_than_cutoff'), 10 ); is( $result->get_statistic( 'number_of_hsps_better_than_expect_value_cutoff_without_gapping'), 0 ); is( $result->get_statistic('number_of_hsps_gapped'), 216 ); is( $result->get_statistic('number_of_hsps_successfully_gapped'), 212 ); is( $result->get_statistic('length_adjustment'), 34 ); is( $result->get_statistic('querylength'), 29290 ); is( $result->get_statistic('effectivedblength'), 2866042492 ); is( $result->get_statistic('effectivespace'), 83946384590680 ); is( $result->get_statistic('effectivespaceused'), 83946384590680 ); is( $result->get_statistic('A'), 0 ); is( $result->get_statistic('X1'), 23 ); is( $result->get_statistic('X1_bits'), '44.2' ); is( $result->get_statistic('X2'), 32 ); is( $result->get_statistic('X2_bits'), '59.1' ); is( $result->get_statistic('X3'), 54 ); is( $result->get_statistic('X3_bits'), '99.7' ); is( $result->get_statistic('S1'), 23 ); is( $result->get_statistic('S1_bits'), '43.6' ); is( $result->get_statistic('S2'), 29 ); is( $result->get_statistic('S2_bits'), '54.7' ); # Skip the 1st hit. It doesn't have any 'Features in/flanking this part of subject sequence:' $hit = $result->next_hit; # The 2nd hit has hsps with 'Features flanking this part of subject sequence:' $hit = $result->next_hit; is( $hit->name, 'gi|51459264|ref|NT_077382.3|Hs1_77431' ); is( $hit->description, 'Homo sapiens chromosome 1 genomic contig' ); is( $hit->length, 237250 ); # In the 2nd hit, look at the 1st hsp $hsp = $hit->next_hsp; is( $hsp->hit_features, "16338 bp at 5' side: PRAME family member 8 11926 bp at 3' side: PRAME family member 9" ); is( $hsp->bits, 7286 ); is( $hsp->score, 3945 ); is( $hsp->expect, '0.0' ); is( $hsp->hsp_length, 6145 ); is( $hsp->num_identical, 5437 ); is( int sprintf( "%.2f", $hsp->percent_identity ), 88 ); is( $hsp->gaps, 152 ); is( $hsp->start('query'), 23225 ); is( $hsp->start('sbjct'), 86128 ); is( $hsp->end('query'), 29324 ); is( $hsp->end('sbjct'), 92165 ); # In the 2nd hit, look at the 2nd hsp $hsp = $hit->next_hsp; is( $hsp->hit_features, "25773 bp at 5' side: PRAME family member 3 3198 bp at 3' side: PRAME family member 5" ); is( $hsp->bits, 4732 ); is( $hsp->score, 2562 ); is( $hsp->expect, '0.0' ); is( $hsp->hsp_length, 4367 ); is( $hsp->num_identical, 3795 ); is( int sprintf( "%.2f", $hsp->percent_identity ), 86 ); is( $hsp->gaps, 178 ); is( $hsp->start('query'), 23894 ); is( $hsp->start('sbjct'), 37526 ); is( $hsp->end('query'), 28193 ); is( $hsp->end('sbjct'), 41781 ); # In the 2nd hit, look at the 3rd hsp $hsp = $hit->next_hsp; is( $hsp->hit_features, "16338 bp at 5' side: PRAME family member 8 14600 bp at 3' side: PRAME family member 9" ); is( $hsp->bits, 3825 ); is( $hsp->score, 2071 ); is( $hsp->expect, '0.0' ); is( $hsp->hsp_length, 3406 ); is( $hsp->num_identical, 2976 ); is( int sprintf( "%.2f", $hsp->percent_identity ), 87 ); is( $hsp->gaps, 89 ); is( $hsp->start('query'), 14528 ); is( $hsp->start('sbjct'), 86128 ); is( $hsp->end('query'), 17886 ); is( $hsp->end('sbjct'), 89491 ); # In the 2nd hit, look at the 4th hsp $hsp = $hit->next_hsp; is( $hsp->hit_features, "29101 bp at 5' side: PRAME family member 8 2120 bp at 3' side: PRAME family member 9" ); is( $hsp->bits, 3241 ); is( $hsp->score, 1755 ); is( $hsp->expect, '0.0' ); is( $hsp->hsp_length, 3158 ); is( $hsp->num_identical, 2711 ); is( int sprintf( "%.2f", $hsp->percent_identity ), 85 ); is( $hsp->gaps, 123 ); is( $hsp->start('query'), 23894 ); is( $hsp->start('sbjct'), 98891 ); is( $hsp->end('query'), 27005 ); is( $hsp->end('sbjct'), 101971 ); # In the 2nd hit, look at the 5th hsp $hsp = $hit->next_hsp; is( $hsp->hit_features, "PRAME family member 13" ); is( $hsp->bits, 3142 ); is( $hsp->score, 1701 ); is( $hsp->expect, '0.0' ); is( $hsp->hsp_length, 2507 ); is( $hsp->num_identical, 2249 ); is( int sprintf( "%.2f", $hsp->percent_identity ), 89 ); is( $hsp->gaps, 63 ); is( $hsp->start('query'), 3255 ); is( $hsp->start('sbjct'), 128516 ); is( $hsp->end('query'), 5720 ); is( $hsp->end('sbjct'), 131000 ); # testing for Bug #3298 $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('multiresult_blastn+.bls') ); is ($searchio->next_result->algorithm_version, '2.2.25+', "testing Bug 3298"); is ($searchio->next_result->algorithm_version, '2.2.25+', "testing Bug 3298"); is ($searchio->next_result->algorithm_version, '2.2.25+', "testing Bug 3298"); # testing for Bug #3251 $searchio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('rpsblast_no_hits.bls') ); is ($searchio->next_result->database_name, 'CDD.v.2.13', "testing Bug 3251"); is ($searchio->next_result->database_name, 'CDD.v.2.13', "testing Bug 3251"); is ($searchio->next_result->database_name, 'CDD.v.2.13', "testing Bug 3251"); erpin.t100644000766000024 1176514006427353 16454 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: SearchIO_erpin.t 14672 2008-04-22 21:42:50Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 91); use_ok('Bio::SearchIO'); } my ($searchio, $result, $iter, $hit, $hsp, $algorithm, $meta); $searchio = Bio::SearchIO->new( -format => 'erpin', -file => test_input_file('testfile.erpin'), -model => 'stem-loop', -query_acc => 'test', -version => 5.5, -verbose => 2 ); $result = $searchio->next_result; isa_ok($result, 'Bio::Search::Result::ResultI'); $algorithm = $result->algorithm; is($result->algorithm, 'ERPIN', "Result $algorithm"); is($result->algorithm_reference, undef, "Result $algorithm reference"); is($result->algorithm_version, 5.5, "Result $algorithm version"); is($result->available_parameters, 2, "Result parameters"); is($result->available_statistics, 1, "Result statistics"); is($result->database_entries, '', "Result entries"); is($result->database_letters, '', "Result letters"); is($result->database_name, 'AE016879.fna', "Result database_name"); is($result->num_hits, 1, "Result num_hits"); is($result->program_reference, undef, "Result program_reference"); is($result->query_accession, 'test', "Result query_accession"); is($result->query_description, '40 sequences of length 43', "Result query_description"); is($result->query_name, '/home/Administrator/pyrR.epn', "Result query_name"); $hit = $result->next_hit; isa_ok($hit, 'Bio::Search::Hit::HitI'); is($hit->accession, 'AE016879.1', "Hit accession"); is($hit->ncbi_gi, '30260185', "Hit GI"); is($hit->algorithm, 'ERPIN', "Hit algorithm"); is($hit->bits, 31.64, "Hit bits"); is($hit->description, 'Bacillus anthracis str. Ames, complete genome', "Hit description"); # no hit descs yet is($hit->length, 0, "Hit length"); is($hit->locus, '', "Hit locus"); is($hit->n, 4, "Hit n"); is($hit->name, 'gi|30260185|gb|AE016879.1|', "Hit name"); is($hit->num_hsps, 4, "Hit num_hsps"); is($hit->overlap, 0, "Hit overlap"); is($hit->query_length, undef, "Hit query_length"); is($hit->rank, 1, "Hit rank"); is($hit->raw_score, 31.64, "Hit raw_score"); is($hit->score, 31.64, "Hit score"); float_is($hit->significance, 4.44e-06); $hsp = $hit->next_hsp; isa_ok($hsp, 'Bio::Search::HSP::HSPI'); is($hsp->algorithm, 'ERPIN', "HSP algorithm"); float_is($hsp->evalue, 1.68e-05); isa_ok($hsp->feature1, 'Bio::SeqFeature::Similarity'); isa_ok($hsp->feature2, 'Bio::SeqFeature::Similarity'); is($hsp->frame('query'), 0, "HSP frame"); is($hsp->gaps, 2, "HSP gaps"); isa_ok($hsp->hit, 'Bio::SeqFeature::Similarity', "HSP hit"); is($hsp->hit_string, 'CTTT.aacc--.CAACC.CCGTGA.GGTTG.a.GAAG', "HSP hit_string"); is($hsp->homology_string, '', "HSP homology_string"); is($hsp->hsp_group, undef, "HSP hsp_group"); is($hsp->hsp_length, 37, "HSP hsp_length"); is($hsp->length, 37, "HSP length"); is($hsp->links, undef, "HSP links"); isa_ok($hsp->query, 'Bio::SeqFeature::Similarity', "HSP query"); is($hsp->query_string, '', "HSP query_string"); is($hsp->range, 37, "HSP range"); is($hsp->rank, 1, "HSP rank"); float_is($hsp->significance, 1.68e-05); float_is($hsp->expect, '1.68e-05', "HSP expect"); isa_ok($hsp->seq, 'Bio::LocatableSeq'); is($hsp->seq_str, '', "HSP seq_str"); is($hsp->start, 1, "HSP start"); is($hsp->custom_score, undef, "HSP custom_score"); is($hsp->meta, undef, "HSP meta"); is($hsp->meta, undef); is($hsp->meta, undef); is($hsp->strand('hit'), 1, "HSP strand"); is($hsp->meta, undef); is($hsp->meta, undef); # ERPIN lacks sequence for query, will spit back a warning.. eval{$hsp->get_aln}; like($@, qr'Missing query string, can\'t build alignment','ERPIN get_aln warning'); $hsp = $hit->next_hsp; isa_ok($hsp, 'Bio::Search::HSP::HSPI'); is($hsp->algorithm, 'ERPIN', "HSP algorithm"); float_is($hsp->evalue, 5.61e-05); isa_ok($hsp->feature1, 'Bio::SeqFeature::Similarity'); isa_ok($hsp->feature2, 'Bio::SeqFeature::Similarity'); is($hsp->frame('query'), 0, "HSP frame"); is($hsp->gaps, 1, "HSP gaps"); isa_ok($hsp->hit, 'Bio::SeqFeature::Similarity', "HSP hit"); is($hsp->hit_string, 'CTTT.taatt-.CAGTC.CTGTGA.GACCG.g.AAAG', "HSP hit_string"); is($hsp->homology_string, '', "HSP homology_string"); is($hsp->query_string, '', "HSP query_string"); is($hsp->hsp_group, undef, "HSP hsp_group"); is($hsp->hsp_length, 37, "HSP hsp_length"); is($hsp->length, 37, "HSP length"); is($hsp->links, undef, "HSP links"); isa_ok($hsp->query, 'Bio::SeqFeature::Similarity'); is($hsp->range, 37, "HSP range"); is($hsp->rank, 2, "HSP rank"); float_is($hsp->significance, 5.61e-05); is($hsp->end, 37, "HSP end"); float_is($hsp->expect, '5.61e-05', "HSP expect"); isa_ok($hsp->seq, 'Bio::LocatableSeq'); is($hsp->seq_str, '', "HSP seq_str"); is($hsp->start, 1, "HSP start"); is($hsp->custom_score, undef, "HSP custom_score"); is($hsp->meta, undef); is($hsp->meta, undef); is($hsp->strand('hit'), -1, "HSP strand"); Utilities.t100644000766000024 612014006427353 16545 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 56); use_ok('Bio::Root::Utilities'); } # Object creation my $u = Bio::Root::Utilities->new(); isa_ok($u, 'Bio::Root::Utilities') ; # month2num() and num2month() my @month = qw(XXX Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec); for my $i (1 .. 12) { is $u->month2num($month[$i]), $i; is $u->num2month($i), $month[$i]; } # untaint() is $u->untaint(''), ''; is $u->untaint('nice string'), 'nice string'; is $u->untaint('bad *?&^$! string'), 'bad '; is $u->untaint( q{100% relaxed&;''\"|*?!~<>^()[]{}$}, 1 ), '100% relaxed'; # mean_stdev() my($mu,$sd); ($mu,$sd) = $u->mean_stdev(); is $mu, undef; is $sd, undef; ($mu,$sd) = $u->mean_stdev(42); is $mu, 42; is $sd, undef; ($mu,$sd) = $u->mean_stdev(-1,0,1); is $mu, 0; is $sd, 1; # file_date(), file_flavor(), date_format() my $file = test_input_file('test.txt'); my $file2 = test_input_file('test 2.txt'); my $fdate = $u->file_date($file); like $fdate , qr/\d{4}-\d{2}-\d{2}/, 'file_date()'; ok $u->file_flavor($file), 'unix (\n or 012 or ^J)'; my $date = $u->date_format(); like $date, qr/\d{4}-\d{2}-\d{2}/, 'date format'; my $date2 = $u->date_format('yyyy-mmm-dd', $date); like $date2 , qr/\d{4}-[a-z]{3}-\d{2}/i, 'date format'; my $date3 = $u->date_format('mdhms'); like $date3 , qr/[a-z]{3}\d{1,2} \d{1,2}:\d{1,2}:\d{1,2}/, 'date format'; my $date4 = $u->date_format('d-m-y', '11/22/60'); like $date4 , qr/\d{1,2}-[a-z]{3}-\d{4}/i, 'date format'; my $date5 = $u->date_format('mdy', '1/5/01'); like $date5 , qr/[a-z]{3} \d{1,2}, \d{4}/i, 'date format'; # External executable-related functions. my $exe = $u->find_exe('some-weird-thing-no-one-will-have'); ok ! defined $exe ; # compress() and uncompress() using gzip. SKIP: { my $gzip = $u->find_exe('gzip'); skip "gzip not found, skipping gzip tests", 12 unless $gzip; ok -x $gzip; # test compression/decompression of a simple file my $zfile = $u->compress($file); # In Windows, the folder separator '\' may brake # the following qr{}, so change it to '/' $zfile =~ s'\\'/'g; $file =~ s'\\'/'g; like $zfile, qr/$file.gz|tmp.bioperl.gz/; ok -s $zfile; if ($zfile =~ /tmp.bioperl.gz/) { ok -e $file; } else { ok ! -e $file; } my $unzfile = $u->uncompress($zfile); ok ! -e $zfile; ok -e $file; # test compression/decompression of a filename with spaces keeping the original intact my $zfile2 = $file2.'.gz'; my $return = $u->compress(-file => $file2, -outfile => $zfile2, -tmp => 1); is $return, $zfile2; ok -e $zfile2; ok -e $file2; unlink $file2 or die "Problem deleting $file2: $!\n"; $return = $u->uncompress(-file => $zfile2, -outfile => $file2, -tmp => 1); is $return, $file2; ok -e $file2; ok -e $zfile2; unlink $zfile2 or die "Problem deleting $zfile2: $!\n"; } # send_mail() # $u->send_mail(-to=>'sac@bioperl.org', # <--- your address here! # -subj=>'Root-Utilities.t', # -msg=>'Hey, your send_mail() method works!'); Exception.t100644000766000024 273514006427353 16540 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Root# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { eval {require Error;}; use Bio::Root::Test; test_begin(-tests => 7, -requires_module => 'Error'); use_ok('Bio::Root::TestObject'); } use Error qw(:try); $Error::Debug = test_debug(); # Set up a tester object. ok my $test = Bio::Root::TestObject->new(-verbose => test_debug()); is $test->data('Eeny meeny miney moe.'), 'Eeny meeny miney moe.'; # This demonstrates what will happen if a method defined in an # interface that is not implemented in the implementating object. eval { try { $test->foo(); } catch Bio::Root::NotImplemented with { my $err = shift; is ref $err, 'Bio::Root::NotImplemented'; }; }; # TestObject::bar() deliberately throws a Bio::TestException, # which is defined in TestObject.pm try { $test->bar; } catch Bio::TestException with { my $err = shift; is ref $err, 'Bio::TestException'; }; # Use the non-object-oriented syntax to throw a generic Bio::Root::Exception. try { throw Bio::Root::Exception( "A generic error", 42 ); } catch Bio::Root::Exception with { my $err = shift; is ref $err, 'Bio::Root::Exception'; is $err->value, 42; }; # Try to call a subroutine that doesn't exist. But because it occurs # within a try block, the Error module will create a Error::Simple to # capture it. Handy eh? try { $test->foobar(); } otherwise { my $err = shift; is ref $err, 'Error::Simple'; }; cds-266.fas100644000766000024 3602114006427353 16175 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>183.m01790 |||similar to unknown protein||chr_13|chr13|183 ATGGACGACAAAGAACTCGAAATACCGGTAGAACATTCCACGGCTTTCGGTCAGCTCGTG ACGGGTCCCCCGGGAGCGGGTAAATCGACCTATTGTCATGGCTTACATCAGTTCCTTACA GCCATCGGTAGACCAGTGCATATCATCAACCTCGATCCTGCAGTCCCAAACCCTCCGTAT CCATGCTCTATAAACATCACGGAACTCATCACACTCGAAAGTGTTATGGAGGAATACAAT CTAGGACCGAATGGGGCGATGCTTTATTGTATAGAATTCTTAGAGGCCAATTTTGACTGG CTAGTGGAGAGGCTGGATGAGGTCTTGGCTGAAGAGGGGGGGAATGGATATGTGGTGTTT GATACGCCGGGTCAAGCAGAGTTATGGACGAACCATGATAGTTTGAAGAACGTGGTCGAA AAGTTGGTCAAGATGGACTATAGACTAGCGGCTGTGCATCTCAGCGACGCGCACTACATA ACAGATGCCTCAAAATTCATCTCTGTAGTTTTGCTAGCTCTTCGGGCGATGCTGCAAATG GAAATGCCGCATTTGAATGTGCTCAGCAAAATAGATTTGATATCAACTTATGGAGAGCTC CCGTTCGACTTGAGCTATTACACAGAAGTCCAAGATCTGTCATACTTACTGGGCAGTCTG GATTCAGACCCTCGAACAGCAAAGTACCACAAGTTAAATAAAGCGTTGGTAGAGCTTATA GAAGGCTTTTCATTAGTCGGATTTCAAACCCTCGCTGTTGAGGACAAAGAATCAATGCTT AATATCGTCCGTCTTGTCGATAAGATGACGGGCTACATATTTATTCCGTCTGGCGACCTC GAAGGAACCAACGCCATCAATACCCAAGCTCTGTTTGGTAGTGCCATGTCGTCGGCGAAG CTTACAGGAAGAGCAGGCGGGGACGTAAGAGATGTTCAGGAGAGATGGATGGATAACAAG GAGGCTTGGGATGAATGGGAGAAGAAAGAATGGAAGAGAGAAGCAGAGATAAGAGCCCAG ATGGGCACTGGAATACCAGAAGGGATGAAAGGCGGTGAAGATGCGGAAAGTACAGGTATA >AAL117C location=AgChr1:complement(140329..141372) ATGGCGTATGGACAGATTGTGATAGGTCCACCGGGGTCTGGGAAGTCGACATACTGTAAT GGGTGCAGCCAGTTCTTTAATGCCATCGGCAGACACGCTCGGATCGTGAACATGGACCCT GCAAACGACTCGCTGCCCTACCAATGCGATGTAGACATTCGAGACTTTATTACTCTGGAG GAAATCATGAACGAGCAGCACCTGGGGCCCAACGGAGGGCTGGTGTATGCGTTTGAGTCG GTGGAGCACTCACTGTCGCTGTTTGCGCTGCAGATCAAGACGCTGGTCAAGGATGAGAAC GCATATCTCGTCTTTGACTGCCCCGGTCAGGTGGAGCTGTTCACGCATCACTCGGCGCTC TCCAAGATATTCCAGCAGCTGGTGCGCGACTTGGACCTACGAGTGTGCGTGGTGAACTTG ATGGACAGCATCTACATTACATCGCCGTCGCAGTATGTCTCGGTACTGCTGCTGGCGCTG CGCTCAATGTTGATGATGGACCTGCCCCATATTAACGTTCTCTCTAAGATCGATATGCTG AGCTCGTACGGCGACCTGCCGTTCCGGCTCGACTACTATACCGAGGTGCAAGACTTGGAG TATCTGCAACCGCATATTGAACGCGAACACAAGGGAGCCAAGGCGTTGAGGTACCGCCGA CTAACGGAGGCCATAGGAGAGGTGGTTTCGGACTTCAACCTGGTCGCCTTCGAGGTGCTT TGCGTCGATGACAAACAGAGCATGATCAACTTGCAAAGCGCAATCGACAAGGCCAATGGT TATATTTTTGGTGCCTCCGAAGTTGGTGGCGATACTGTGTGGGCGGAGGCAACCCGCCAG GGCACTGCTGCAATTGAATATGACATTCAGGACAGATGGATCGACAACAAGGACTTTTAT GACAAGGAGGAAGAGGCTAGGCGCAAGAAGTTACTTGAGGAGCATGAGCTTCTGGAGAAA GAAGTTGATGTCAACCAGGATGATGAATGGGAACGCGCAGTGAAGGAATGGGAGTCCCAG CACTCTGTGAACTTCGTTAAA >AN2438.1 hypothetical protein (53856 - 52862) ATGAGTGAGGATCAATTGGGTCCGAACGGCGGTGTTTTGTATGCGTTGGAAGAGCTAGAG GAGAACTTTGACTTCTTGGAGGAAGGGTTGAAAGAGCTCGGAGAGGACTATATTATCTTC GATTGTCCCGGCCAGGTAGAAATTTTCACTCACCATTCGTCCTTACGGAATATCTTCTTC AAGATCCAGAAGATGGGCTATAGACTAATAGTACTACACCTAATCGACTCCTACAACCTC ACCCTGCCATCGATGTACATCTCCTCTCTTATTCTATGCTTGCGTGCCATGCTCCAAATG GACCTTCCACATCTCAACGTCCTAACAAAAATCGATAATTTGTCCAATTATACTTCGCTG CCTTTCAACCTAGATTTCTACACCGAGGTTCAGGACCTTACATACCTCCTCCCCCACTTA GAGGCAGAGTCCTCCCGGCTATCGCACGAGAAGTTCGGAGCACTGAACAACGCCATCATC ACACTGATTGAGGAGTTTGGACTCGTGGGCTTCGAAACACTGGCTGTAGAAGATAAAAAG AGCATGATGAATTTGCTCCGGGCCATTGACCGCGCAAGTGGATACGTGTTTGGGCCTGCA GAAGGCGCAAATGACTCCGTTTGGCAAGTGGCTGTTCGGGAAGGAATGGGGTCCATGGAT ATCCGTGATATTCAAGAGCGTTGGATAGATGCCAAAGACGAGTACGATGAGTTGGAACGA CGGCAGCGAGAGGAGGAGATAAAAAATCACCAGCAAGCTGCAACCTACCAGGCAGGGAAC GAGGACGACGACGATGATAACGATTACGAATTCGGGCGCAGGATGCCTGTACCAGACAGT GGAGTGAAAGTGATGCGGAAG >FG05298.1 hypothetical protein (258181 - 259340) ATGCCTTTCGCGCAACTCGTTCTCGGTAGTCCGGGCTGCGGAAAGAGTACATACTGTGAT GGCATACAGCTGACCGGTCAAGTGCATCAGTTCCTAGGCGCCATCGGGCGAGCCTGTTCA GTCGTCAATCTCGATCCTGCCAACGATCATACCAACTACCCTGCAGCTCTCGACATTCGC AGTTTGATTAAGCTCGAGGAGATTATGAAAGATGATAAATTAGGACCTAATGGCGGCATC CTGTATGCCCTCGAAGAGTTGGAACACAATTTCGAGTGGTTGGAAGAAGGACTGAAAGAA TTCAGCGAAGACTATATTCTTTTCGACTGTCCGGGACAAGTGGAACTATATACACACCAC AACTCCTTGCGAAACATATTCTACAAGCTCCAGAAGATTGGATTCAGGCTTGTTTCCGTC CACCTCTCCGACTCCTTCTGCCTCACGCAACCGTCGTTATACGTATCGAACGTCCTCCTC TCCCTTCGTGCGATGATCCAGATGGATATGCCACACATAAATATTCTCTCCAAGATCGAC AAAGTTGCCGACTACGACGAACTCCCTTTCAACCTCGATTACTACACAGACGTGGACGAC CTTACATATTTGACACCCCATCTTGAGACAGAGTCGCCCGCTCTGAGGAGTGAGAAATTC GGCAAGCTCAACGAGGCGATTGCGAATCTGATCGAGAGCTACGGTCTGGTGCGCTATGAA GTCCTGGCTGTCGAGAACAAGAAAAGCATGATGCATATCCTCCGTGTCATTGACCGTGCT GGTGGATACGTCTTTGGTAGTGCTGAAGGAGCCAATGATACAGTCTGGTCAGTTGCCATG AGGAACGAGTCGTCCATGTTGGGGGTGCAGGACATCCAAGAGCGTTGGATCGACCAAAAG GTGGAATATGATCAAATGGAGCGTGAGGCCGAAGAAGAACAGGCGCGCATCCAAGAAGAA CAAGCCATGGAGATGGAACAATCACAGCCACCTCCTGCGCCGACAGGTGGCATGGATCCT GATTTTGGTGACATGACGGTGCCCAAAGATAGTGGGATCAAAGTAGTTAGAAAG >MG06110.4 hypothetical protein similar to (NCU09745.1) hypothetical protein (25629 - 24026) ATGGGATTTCTAGGCGCAATAGGGAGAGCATGTTCCGTAGTAAACCTTGACCCGGCCAAT GACCATACGAGCTATCCATGTGCCCTCGACATACGAAATCTTGTCACGCTGGAGGAAATC ATGGGAGACGACAATTTGGGGCCAAACGGTGGCATCCTCTACGCTATTGAAGAGCTGGAG CATAACTTTGAGTGGTTGGAAGATGGTCTGAAAGAGCTTGGGGACGACTACATACTATTC GACTGCCCGGGCCAGGTCGAGCTGTACACACATCACAATTCATTGCGCAATATCTTCTTC AAGTTACAAAAGCTCGGCTACAGACTTGTGGTTGTTCACCTCTCGGACAGCATTTGCCTC ACTCAACCATCGTTGTACATCTCGAATCTCCTCCTCGCTTTGCGCGCCATGCTCCAGATG GATCTTTCCCATGTCAATGTCCTCACCAAAATCGACAAGGTGTCTTCATATGACAGACTA GCCTTCAACCTCGACTTTTATACCGAGGTCCACGATCTTTCGTACCTCCTCCCCGAGCTC GAAGCCGAGAATCCGTCGCTACGCAGCGAAAAGTTCGCCAAGCTAAACCGAGCCGTCGCA AACTTGATTGAAGACTTTGGGCTCGTCCGGTTCGAAGTCTTGGCTGTCGAGAATAAGAAA AGTATGATGCATTTGCTCCGGGTCCTCGATCGTGCCAACGGGTACGTTTTTGGTGGGGCC GAGGGAGCCAACGACACCGTTTGGCAAGTAGCCATGCGCAACGAGGGCTCCCTGATGGGG GTCCAAGATATCCAGGAGCGCTGGATCGATAACAAAGAGGCTTATGACGAGATGGAGCAG CGTGAATGGGAGGAACAGGTCAAGGCACAAGAAGCCATGGCCGAAGCCGATGCAGCAGCT GCTGAAGAGGGCGACGATGACTTGATGGGAGGCCCAGGTGCTCGA >NCU09745.1 (NCU09745.1) hypothetical protein (81475 - 83184) ATGACCTCCCCACTGCCAGTGCAGCAGTTTATGGGCGCCATCGGGCGACAATGCTCGGTA GTCAACCTCGACCCTGCGAACGACCACACCAACTACCCATGCGCGCTCGACATTCGCGAC CTTGTCACTTTGGAGGAGATTATGGCAGACGACAAATTGGGTCCCAATGGCGGTATTCTG TACGCACTTGAAGAGCTGGAAAATAACATGGAATGGCTCGAGAACGGCCTCAAGGAGCTT GGAGAAGACTATGTGCTTTTTGACTGCCCTGGTCAAGTCGAGCTCTACACCCACCACAAC TCGTTACGCAACATCTTTTACCGGTTACAGAAGCTGGGCTACAGGCTGGTAGTTGTCCAC CTTTCCGACTGCTTCTGCCTCACACAACCATCGCTCTACATTTCCAACGTCCTCCTCTCT TTGCGCGCCATGTTGCAAATGGACCTTCCCCACATCAACGTCCTGACCAAGATTGACAAG ATCTCGTCCTACGATCCTCTTCCATTCAACCTCGACTATTACACCGAAGTACAAGACCTA CGGTACCTCATGCCGTCCCTCGACGCGGAATCGCCTGCCCTGAAGAAAGGCAAGTTCACC AAGCTTAACGAGGCCGTTGCGAACATGGTTGAGCAGTTCGGCCTTGTCAGCTTCGAGGTG CTGGCAGTCGAGAACAAGAAGAGTATGATGCATCTGTTGCGCGTGATTGACCGTGCAAGT GGGTACGTCTTTGGCGGCGCTGAGGGAACGAACGACACCGTCTGGCAGGTTGCCATGCGC AACGAGTCATCATTGCCCGATGCTCTTGATATTCAAGAGAGGTGGATCGATAGCAAAGAA GAGTATGACGAGATGGAGCGGAAGGAGGAGGAAGAACAAGAAAAACTGCGGGCGGAGCAG GCACGGGCCGCTGAAGAAGCAGGTCTCGGTGACGGCTCGGTCCCTGGAGTGGCGCCACAG TTCACCAGTGGCTCGGGAATCCGTGTGACGCTTAGCCTAGTGGCCGCTTTTACCAAATAT AGCGATCTT >SPAC144.07c SPAC144.07c conserved eukaryotic protein; ATP-binding protein; similar to S. cerevisiae YOR262W ATGCCATTTTGTCAAGTGGTCGTTGGACCTCCGGGTTCTGGGAAATCAACTTACTGTTTC GGAATGTACCAATTATTATCTGCCATAGGAAGGAGTAGTATTATCGTCAATCTTGACCCA GCAAATGACTTTATCAAATACCCATGCGCAATTGATATTCGTAAAGTTCTCGATGTTGAG ATGATCCAAAAAGACTATGATTTAGGACCAAATGGAGCACTTATTTATGCTATGGAAGCA ATTGAATATCACGTTGAATGGTTGCTTAAGGAGCTAAAAAAGCATCGAGATTCATATGTG ATATTTGATTGCCCTGGTCAAGTTGAGTTATTTACAAACCATAATTCCTTACAAAAAATA ATCAAAACTTTGGAAAAGGAACTGGATTATAGACCTGTGTCCGTACAACTTGTAGATGCA TATTGCTGCACGAATCCTTCTGCATATGTTAGTGCACTGCTTGTTTGCCTAAAGGGGATG CTTCAGCTGGACATGCCACATGTAAATATTTTGTCGAAGGCTGATTTGCTTTGTACGTAT GGAACTTTACCAATGAAACTAGATTTTTTTACCGAAGTACAAGACCTTTCATATTTGGCG CCTTTGCTTGATAGAGATAAACGTCTTCAGCGCTATAGTGATTTAAACAAAGCTATTTGT GAACTTGTTGAAGATTTTAATCTTGTTTCTTTTGAAGTTGTTGCAGTAGAAAATAAAGCC AGTATGTTACGTGTTCTTCGAAAAATCGATCAAGCAGGTGGATATGCATATGGATCTACA GAAATTGGTGGTGATGCCGTTTGGGTGAATGCCGTTCGTCAAGGTGGAGACCCTCTTCAA GGTATTTCGCCTCAGGAAAGATGGATTGACAAGAAAGAGGAATATGACAAATATGAATGG GAATTAGAGCAAAAATCGACCATGGACGAAGATGAAAATGAAGGG >Sbay_Contig635.43 YOR262W, Contig c635 67551-68594 ATGCCTTTTGCTCAGATTGTTATTGGACCCCCGGGTTCAGGGAAGTCTACGTATTGTAAC GGATGTTCACAATTTTTTAATGCTATTGGGAGACATTCTCAGGTGGTAAATATGGATCCC GCCAATGATGCCTTACCTTATCCGTGTGCTGTGGATATCAGAGATTTTATAACTTTGGAA GAGATCATGAAAGAGCAACACTTGGGCCCTAATGGTGGTTTGATGTATGCCGTTGAATCT CTAGATAAGTCCATTGATTTATTTATACTACAGATCAAATCACTTGTAGAAGAAGAGAAG GCATATGTTGTGTTTGACTGCCCGGGACAAGTTGAGCTGTTTACGCATCATTCTTCATTA TTCAGCATTTTCAAGAAATTAGAAAAAGAACTAGATATGAGATTCTGTGTGGTGAATTTG ATTGATTGTTTTTACATGACATCTCCTTCACAATATGTCTCGATTTTGCTCCTGGCATTA AGGTCTATGCTGATGATGGACCTGCCCCATATCAACGTCTTTTCGAAGATAGATAAGTTG AAATCATATGGAGAATTGCCATTTAGATTAGATTATTATACAGAAGTTCAAGATTTGGAT TATTTGGAGCCGTATATTGAAAAAGAAGGTTCTGGTGCACTGGGAAAAAGATATAGCAAA TTGACTGAAACGATTAGTGAGCTGGTTTCTGATTTTAACCTGGTTTCCTTTGAAGTTTTG GCTGTGGATGACAAAGAAAGTATGATAAATCTCCAGGGTGTTATTGATAAAGCCAATGGT TACATATTTGGTGCATCTGAAGTGGGCGGCGACACGGTATGGGCCGAGGCCTCGAGAGAA GGTGCATTGCTAGCAAGCTATGATATTCAAGATAGGTGGATAGATAATAAAGAAAAATAT GATAAAGAAGAACAAGAGAAACGGGCTGCAATGGTGAAAGAGCAGGAACTGCAAAATAAA GAGGTTAATGTAGACGAAGAAGACGAGTGGGAAAATGCACTAAACGACTGGGAAGAAAAA CAAGGCACAGATTTTGTCAGG >Scas_Contig692.20 YOR262W, Contig c692 40768-41811 ATGCCATTTGCCCAAATTGTTATCGGACCCCCCGGTTCAGGAAAATCAACATACTGTAAC GGGTGTTCTCAATTTTTCAACGCCATCGGCAGGCATGGCCAAATAGTGAACATGGATCCA GCTAATGATGCTCTACCATATCCATGTGCAGTAGACATTCGAGATTTTGTGACTCTGGAG GAGATTATGCAAGAGCAACAACTGGGCCCCAATGGAGGGTTGATGTATGCTGTGGAATCG TTAGATGAATCCATCGATCTTTTCATACTACAAATAAAATCTCTAGTTCAAGAGGAGAAG GCATATTTAGTCTTTGATTGTCCTGGACAAGTAGAGTTGTTTACTCATCATTCATCTCTG TTCAAAATCTTCAAAAAATTGGAAAAGGAACTAGATATGCGATTTTGTGTGGTGAATTTG ATTGATTCTTTCTATATTACCTCCCCATCACAGTATGTTTCCATTTTGCTGTTGGCTTTG AGATCTATGTTAATGATGGACCTACCGCAAATCAATGTTTTCTCCAAGATTGATATGCTG AAATCCTATGGAGAACTACCTTTTAGATTGGATTATTACACAGAAGTGCAAGATTTAGAT TATTTACAGCCATTTATTGAGAAGGAGAGTTCCAGTGTTTTGGGTAGAAGATATAGCAAG TTAACAGAAACGATTAGTGAATTGGTTTCCGATTTTAATTTGGTCTCATTTGAAGTCTTA GCTGTAGATGATAAACAAAGCATGATTAATTTACAAAGTGTAGTAGACAAGGCTAATGGA TATATATTTGGAGCATCTGAAGTAGGTGGTGATACTGTTTGGGCAGAAGCCACGCGAGAA GGTGCAATGATGGTAAATTATGATATACAGGACAGATGGATAGATAACAAAGAAAAGTAC GATGAAGAGGAGAGAAAAAGACAAGAGGAACAAGCCAAAGAGCAGAACATGCAAGAAAAG GAGGTAGACGTGGATAATGAGGACGAATGGGAAAAGGCATTGAAGGATTGGGAAGAAAAA CAAGGAACAGGCTATGTAAGG >Sklu_Contig2277.4 YOR262W, Contig c2277 4093-5136 ATGCCCTTTGGTCAGATTGTTATCGGCCCTCCTGGTTCAGGAAAGTCTACCTATTGTAAT GGTTGCTCCCAGTTTTTTAATGCTGTCGGTAGACATGCCCAAGTAATCAACATGGATCCA GCAAATGATTCGTTACCTTACCCATGTGCCGTTGACATTCGAGATTTCATCACCTTAGAG GAAATTATGACAGAACAGCAGCTGGGGCCTAATGGTGGATTGATGTACGCCCTAGAATCT TTGGATAAATCAATCGACTTATTTGTTTTGCAGATCAAATCACTAGTTCAGGATGAACAT GCTTACGTAGTATTTGATTGTCCGGGGCAAGTGGAGCTTTTTACGCACCATTCGTCCTTG TTCCGCATATTCAAGAAGTTGGAAAGAGAACTAGATATGAGGTTATGCGTGGTTAATTTA ATCGATTGTTTTTACATCACCTCTCCTTCACAGTATGTCTCTATTCTTTTGCTAGCTTTG AGGTCGATGCTGATGATGGACTTACCACACATTAATGTCTTTTCTAAAATTGATTTGTTG AAATCCTACGGTGAGCTGCCATTCCGACTAGATTATTATACCGAAGTTCAAGAGCTAGAT TACTTGAAGCCACATATTGACAAGGAAGGGAGCAGCGTCCTTGGAAGGAAATATAGTAGG TTGACAGAAACCATTAGTGAACTGGTTTCTGACTTTAATCTGGTTTCCTTTGAAGTTTTG TGTGTTGATGATAAGCAGAGCATGATCAATTTGCAAAGTATTGTGGATAAAGCAAATGGT TACATATTTGGTGTTTCTGAGATCGGTGGAGATACGGTATGGGCAGAGGCAACGCGACAA GGCAGTGCAATTGCTAATTACGACATTCAAGAGAGATGGATAGATAATAAAGATATGTAC GACAGAGAGGAACAGGAAAAACGTGAACAGTTGCTCAAAGAAGAAGAGCTACAGAATAAA GAAGTAGACGTGGATAAAGGTGATGAGTGGGAAAATGCTTTAAAAGAATGGGAAGAAAAG CAAGGCATGAGTTATGTAAAA >Skud_Contig1703.7 YOR262W, Contig c1703 9292-10335 reverse complement ATGCCATTTGCTCAAATTGTTATCGGCCCACCAGGCTCGGGAAAGTCAACGTATTGTAAC GGGTGTTCGCAGTTCTTCAACGCCATTGGAAGACATTCTCAAGTGGTGAATATGGATCCC GCTAATGATGCTTTGCCTTATCCGTGTGCTGTAGATATTAGAGATTTTATAACTTTGGAA GAGGTTATGCAGGAGCAACAGTTGGGTCCTAATGGTGGTTTAATGTATGCCGTTGAATCC CTAGATAACTCCATTGATCTATTCATATTACAGATCAAGTCACTTGTAGAAGAAGAAAAG GCCTACCTTGTGTTTGACTGTCCTGGACAAGTTGAGCTATTCACGCACCATTCATCTTTA TTTAGCATTTTCAAGAAAATGGAGAAAGAATTGGATATGAGATTCTGTGTCGTAAACTTG ATTGATTGCTTTTATATGACATCTCCTTCTCAGTATGTTTCAATTTTGCTACTGGCATTA AGGTCCATGCTAATGATGGATTTGCCTCACATAAACGTTTTTTCCAAAATAGATATGTTA AAATCATATGGGGAATTACCCTTCAGATTGGATTATTATACAGAGGTCCAGGAGCTAGAT CATTTGGAGCCATATATTGAAAAGGAAGGCTCTAGCGTTCTAGGAAAAAAATATAGTAAG TTGACTGAAACGATCAAAGAATTAGTCTCCGATTTTAACTTAGTTTCTTTTGAGGTTCTG TCCGTGGATGACAAAGAAAGTATGATAAATCTCCAGGGTGTTATTGATAAAGCGAATGGC TACATATTCGGAGCATCCGAAGTTGGAGGTGATACAGTGTGGGCCGAAGCTTCGAGAGAA GGTGCATTGTTAGAAAACTACGACATACAGGATAGGTGGATAGATAATAAAGAAACGTAT GATAAAGAAGAACAAGAGAAGCGTGCATCGCTGTTAAAAGAACAAGAACTGCAGAATAAA ACGGTTGATGTGAAAGAAGAAGATGAATGGGAAAATGCATTAAAGGAGTGGGAAGAAAAG CAAGATACGGAGTTTGTCAGA >Smik_Contig1103.1 YOR262W, Contig c1103 447-1490 reverse complement ATGCCGTTTGCTCAGATTGTTATTGGCCCACCGGGTTCAGGCAAGTCCACTTATTGTAAC GGCTGCTCACAGTTCTTCAATGCCATTGGGAGACATTCTCAGGTGGTGAACATGGATCCC GCTAATGATGCTTTGCCTTATCCTTGTGCTGTGGATATCAGAGATTTTATAACGTTGGAA GAGATTATGCAAGAGCAACAGTTAGGCCCCAATGGTGGTTTAATGTATGCAGTCGAATCC TTGGATAAGTCTATTGATTTGTTTTTATTACAGATCAAATCGCTTGTAGAAGAAGAAAAA GCCTATCTTGTATTCGACTGTCCAGGCCAGGTCGAGTTATTTACTCATCACTCATCCTTA TTCAATATATTTAAGAAAATGGAGAAAGAATTGGACATGAGGTTCTGTGTAATAAACTTG ATTGACTGTTTTTACATGACGTCACCCTCACAATATGTCTCAATTTTACTGCTTGCACTA AGATCCATGTTGATGATGGATCTGCCCCACATAAATGTTTTTTCTAAGATAGATATGTTG AAATCATATGGAGAACTACCATTTAGACTAGATTATTATACAGAGGTACAGGATCTAGAT TATTTGGAACCGTATATTGAAAAAGAAGGCTCTAGTGTATTAGGAAAGAAATACAATAAG TTGACCGACGCAATCAAAGAGCTTGTTTCTGATTTTAACTTGGTTTCCTTTGAGGTTTTG TCCGTGGATGACAAAGAAAGTATGATAAATCTCCAGGGTGTGATTGATAAAGCAAATGGC TACATATTTGGTGCGTCTGAGGTTGGTGGTGATACAGTGTGGGCAGAGGCTTCTAGGGAA GGTGCTCTTTTAACAAGTTACGATATTCAAGATAGGTGGATAGATAATAAGGAAAAGTAT GACAAAGAAGAAGAAGAGAAACGTGTAATCTTGTTAAAAGAGCAAGAGCTGCAAAATAAA GCAGTTGACGTGAATGAAGACGATGAGTGGGAAAGTGCGCTCAAGGAATGGGAAGAAAAA CAAGGTATGGATTTTGTTAGA >Spar_21273 YOR262W, Contig c261 8817-9860 ATGCCCTTTGCTCAAATTGTTATTGGCCCACCGGGTTCAGGAAAATCAACCTATTGCAAC GGCTGTTCACAGTTTTTCAATGCCATTGGAAGACATTCTCAGGTAGTAAATATGGACCCT GCTAATGATGCGTTACCTTACCCATGTGCTGTGGATATTCGAGATTTTATAACTTTGGAG GAGATTATGCAAGAGCAACAGTTAGGCCCCAATGGTGGTTTGGTGTATGCTGTTGAATCC TTGGATAAGTCCATTGACTTGTTCATATTACAAATCAAGTCGCTTGTAGAAGAAGAAAAG GCATATCTCGTATTTGACTGTCCCGGACAAGTGGAGTTATTTACTCATCACTCATCTTTA TTCAGCATTTTTAAGAAAATGGAAAAAGAATTGGACATGAGATTCTGTGTAGTAAATTTG ATAGACTGTTTTTACATGACTTCTCCTTCACAATACATCTCCATTTTGCTACTCGCATTA AGGTCTATGTTAATGATGGATCTACCCCACATTAACGTTTTTTCTAAGATAGATATGTTG AAATCCTACGGGGAATTACCCTTTAGATTAGATTATTATACAGAGGTTCAGGATCTAGAT TATTTGGAGCCATATATCGAAAAGGAAGGCTCTAGTGTACTGGGAAAGAAATATAGCAAG TTAACTGAGACAATCAAAGAGCTTGTTTCAGATTTCAATCTGGTTTCATTTGAGGTCCTG TCTGTGGATGATAAAGAAAGTATGATAAATCTTCAAGGTGTTATAGATAAAGCAAATGGC TACATATTCGGCGCATCTGAAGTTGGCGGTGATACAGTTTGGGCTGAGGCCTCTAGAGAA GGTGCATTACTAGCAAATTACGACATTCAGGACAGATGGATAGACAATAAAGAGAAGTAC GATAAAGAGGAAGAAGAGAAACGCGCGGCGTTGCTAAAAGAACAAGAGTTGCAAAATAAA GCCGTTGATGTGAATGAAGAGGATGAGTGGGAAAATGCGCTGAAGGAATGGGAAGAAAAA CAGGGTACGGATTTCGTTAGA >YOR262W YOR262W SGDID:S0005788, Chr XV from 817289-818332 ATGCCCTTCGCTCAGATTGTTATTGGTCCACCAGGTTCAGGGAAGTCAACCTATTGCAAC GGCTGCTCACAGTTCTTCAATGCCATCGGAAGACATTCCCAGGTAGTGAATATGGATCCT GCTAATGATGCCTTACCTTACCCATGCGCTGTGGATATTCGTGATTTTATAACATTAGAG GAGATCATGCAAGAGCAACAGTTAGGCCCTAATGGAGGTTTGATGTATGCTGTTGAATCA TTGGATAATTCTATTGATTTGTTCATTTTACAGATCAAGTCACTTGTAGAAGAAGAAAAA GCATATCTTGTATTCGACTGTCCGGGCCAAGTGGAGCTATTTACTCATCACTCATCTTTG TTCAACATCTTTAAAAAAATGGAAAAGGAATTGGACATTAGGTTTTGTGTTGTAAATTTG ATTGACTGTTTTTACATGACATCCCCTTCACAATATATCTCGATTTTGTTACTTGCATTG AGGTCTATGTTAATGATGGATCTCCCTCACATCAACGTTTTTTCTAAAATAGATATGCTG AAATCATACGGAGAATTACCCTTTAGATTAGACTATTATACAGAGGTCCAGGATCTGGAT TATTTGGAGCCATATATTGAAAAGGAAGGCTCTAGTGTACTGGGAAAGAAATATAGCAAG TTAACTGAAACAATCAAAGAGCTAGTCTCAGATTTCAACTTAGTATCATTTGAGGTTTTG TCCGTGGATGACAAAGAAAGTATGATAAATCTTCAAGGTGTTATAGATAAAGCAAATGGC TACATATTCGGCGCATCCGAAGTTGGTGGTGATACCGTGTGGGCTGAGGCTTCGCGAGAA GGTGCATTAATAGCGAATTACGACATTCAAGACAGGTGGATAGACAATAAAGAGAAGTAT GATAAAGAAGAAGAAGAAAAACGTACGGCGTTGTTAAAAGAACAAGAATTGCAAAATAAA GCTGTTGATGTGAATGAAGAAGATGAGTGGGAAAATGCGCTGAAGGAGTGGGAAGAGAAA CAAGGAATGGATTTTGTTAGG bug2120.phd100644000766000024 3324014006427353 16175 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBEGIN_SEQUENCE HEX1549K12_01_A01.b_015-1516 BEGIN_COMMENT CHROMAT_FILE: HEX1549K12_01_A01.b_015-1516 ABI_THUMBPRINT: 0 PHRED_VERSION: 0.020425.c CALL_METHOD: phred QUALITY_LEVELS: 99 TIME: Sun Oct 8 12:48:40 2006 TRACE_ARRAY_MIN_INDEX: 0 TRACE_ARRAY_MAX_INDEX: 16302 TRIM: 34 849 0.0500 TRACE_PEAK_AREA_RATIO: 0.0020 CHEM: term DYE: big END_COMMENT BEGIN_DNA g 6 14 c 6 25 g 6 39 c 6 42 c 6 60 g 6 63 t 6 79 g 6 90 t 9 106 g 9 113 g 9 121 g 12 130 g 12 145 g 8 153 c 8 169 c 9 177 t 8 203 t 8 210 a 8 218 t 9 234 g 8 244 c 6 257 t 6 270 a 8 275 g 8 285 c 8 304 g 8 316 a 8 333 a 10 339 t 8 350 c 9 359 g 9 370 c 9 378 c 10 392 t 17 408 t 12 421 c 14 429 t 18 444 t 24 457 c 28 467 t 24 479 c 24 486 c 22 499 c 23 516 c 23 526 t 19 540 a 24 548 a 24 559 g 37 570 t 44 582 c 47 592 t 47 604 t 37 616 g 37 628 t 39 641 a 41 650 c 41 659 g 47 672 t 47 684 c 57 696 g 57 707 t 57 721 c 57 732 t 57 744 t 59 757 g 59 769 t 59 781 g 59 793 g 48 807 g 48 819 a 48 831 g 48 841 a 48 853 g 57 864 t 57 874 t 57 886 g 68 898 a 68 909 a 68 920 t 68 930 c 68 942 a 68 951 t 68 963 c 68 975 c 68 987 c 50 999 a 50 1007 a 50 1019 t 50 1032 c 68 1044 c 68 1056 t 68 1068 g 68 1081 g 68 1094 a 68 1107 g 57 1118 c 57 1129 a 57 1139 a 57 1149 a 57 1160 g 57 1172 t 59 1184 t 68 1195 c 68 1207 g 68 1220 a 68 1232 g 68 1243 g 68 1256 a 68 1268 a 68 1279 c 68 1289 t 68 1301 t 68 1313 g 68 1325 a 68 1338 c 68 1348 c 68 1360 c 68 1372 t 68 1384 c 68 1396 c 68 1407 c 68 1419 a 57 1428 a 57 1439 g 57 1451 g 59 1463 g 59 1476 g 68 1488 c 68 1500 t 68 1513 t 68 1525 g 68 1536 g 68 1549 t 68 1562 g 68 1574 a 68 1586 g 68 1597 c 68 1608 a 68 1619 g 68 1632 g 68 1644 t 68 1656 c 68 1667 t 68 1679 g 68 1692 c 68 1703 g 68 1716 a 68 1728 g 68 1739 c 68 1751 t 68 1764 g 68 1776 a 68 1788 t 68 1799 c 68 1810 c 68 1822 t 68 1835 t 68 1847 g 68 1859 g 68 1872 t 68 1884 g 68 1896 t 68 1908 t 68 1920 g 68 1932 a 68 1943 t 68 1954 g 68 1966 t 68 1978 a 68 1989 g 68 2001 c 68 2013 t 59 2026 c 59 2038 g 68 2050 c 68 2062 c 68 2074 t 68 2087 c 68 2098 g 68 2110 a 68 2123 t 68 2134 g 68 2146 c 68 2157 t 68 2170 c 68 2182 c 68 2193 c 68 2205 t 68 2218 t 68 2230 c 68 2242 t 68 2254 t 68 2266 c 68 2277 c 68 2289 a 68 2299 c 68 2310 a 68 2320 c 68 2332 a 68 2343 g 68 2356 c 68 2369 t 68 2382 g 68 2393 c 68 2406 g 68 2418 g 68 2432 a 68 2444 t 68 2455 g 68 2467 a 68 2479 a 68 2491 g 68 2503 t 68 2514 g 68 2526 a 68 2539 t 68 2549 a 68 2561 c 68 2572 c 68 2584 t 62 2597 c 62 2609 a 62 2620 g 62 2633 t 62 2646 c 62 2657 g 62 2670 g 52 2683 a 62 2695 t 62 2706 g 62 2718 t 62 2730 g 68 2742 c 68 2753 t 68 2766 t 68 2778 g 68 2790 c 68 2802 t 68 2815 c 68 2826 c 68 2838 g 68 2850 t 68 2863 t 68 2875 c 68 2886 g 68 2898 t 68 2910 g 68 2922 g 68 2934 a 68 2947 a 68 2958 c 68 2969 a 68 2979 c 68 2990 g 68 3003 g 68 3016 g 68 3028 g 68 3040 t 68 3053 t 68 3065 c 68 3077 t 68 3089 t 68 3101 t 68 3113 g 68 3125 c 68 3137 c 68 3149 a 68 3159 g 68 3171 g 68 3184 g 68 3195 c 68 3208 c 68 3220 a 68 3231 g 68 3243 g 68 3255 g 68 3267 c 68 3280 g 68 3292 g 68 3305 a 68 3318 c 68 3328 t 68 3340 t 68 3352 g 68 3364 c 68 3376 t 68 3389 g 68 3401 t 68 3413 c 68 3425 c 68 3436 a 68 3447 c 68 3458 c 68 3470 a 68 3481 g 68 3494 g 68 3506 a 68 3519 g 68 3531 c 68 3543 t 68 3555 g 68 3567 c 68 3579 a 68 3590 c 68 3601 c 68 3613 g 68 3625 t 68 3638 t 68 3651 c 68 3663 t 68 3675 g 68 3687 g 68 3700 t 68 3711 g 68 3723 t 68 3735 c 68 3746 t 68 3758 t 68 3770 g 68 3782 a 68 3795 a 68 3806 c 62 3817 g 62 3830 a 62 3841 g 62 3853 a 62 3866 a 62 3877 g 62 3889 a 68 3901 t 68 3911 c 68 3923 a 68 3934 c 68 3946 c 68 3958 a 68 3969 a 68 3981 g 68 3994 t 68 4006 a 68 4018 g 62 4030 t 62 4042 c 62 4054 g 62 4067 a 62 4079 g 62 4091 c 62 4103 a 68 4114 a 68 4126 g 68 4138 c 68 4149 c 68 4161 a 68 4173 g 68 4185 a 68 4198 g 68 4209 c 68 4221 g 68 4234 c 68 4245 c 68 4257 t 68 4270 g 68 4282 a 68 4294 g 68 4305 t 68 4318 a 68 4329 c 68 4340 a 68 4351 g 68 4364 g 68 4376 c 68 4388 g 68 4401 g 68 4414 t 68 4426 g 68 4437 g 68 4450 a 68 4463 g 68 4474 g 68 4487 c 68 4498 c 68 4510 g 68 4522 c 68 4534 t 68 4547 a 68 4558 t 68 4569 g 68 4581 t 68 4594 a 68 4605 c 68 4616 t 68 4628 c 68 4640 a 68 4651 g 68 4664 c 68 4676 c 68 4688 t 68 4701 c 68 4713 a 68 4724 c 68 4735 a 68 4746 g 68 4758 c 68 4771 t 68 4783 g 68 4795 g 68 4808 a 68 4821 c 68 4831 a 68 4843 g 68 4855 g 68 4867 g 68 4878 c 68 4891 c 68 4903 a 68 4914 c 68 4926 c 68 4938 a 68 4949 c 68 4961 c 68 4973 t 68 4986 g 68 4998 c 68 5011 t 68 5023 g 68 5035 c 68 5047 t 68 5059 t 68 5071 g 68 5083 a 68 5096 c 68 5106 t 68 5119 g 68 5131 a 68 5143 t 68 5154 t 68 5166 g 68 5179 c 62 5190 c 62 5202 a 62 5213 g 62 5225 c 62 5238 t 62 5250 c 62 5262 a 62 5273 c 68 5284 g 68 5297 a 62 5309 g 62 5321 a 62 5334 c 62 5345 a 62 5356 c 68 5367 t 68 5380 t 68 5392 g 68 5404 c 68 5417 c 68 5429 g 68 5442 a 68 5454 g 68 5465 g 62 5477 a 68 5490 a 68 5502 g 68 5513 a 68 5525 g 68 5537 g 68 5549 a 68 5562 g 62 5573 g 62 5585 a 62 5598 t 62 5608 c 68 5620 c 68 5632 c 68 5644 g 68 5657 c 68 5669 t 68 5682 c 68 5694 g 68 5706 t 68 5718 g 68 5730 c 68 5742 t 68 5754 c 68 5765 t 68 5777 t 68 5790 g 68 5802 c 68 5813 t 68 5826 g 68 5837 g 68 5849 t 68 5862 g 62 5874 t 62 5886 c 62 5897 g 62 5910 a 62 5922 t 62 5933 g 62 5945 t 68 5956 c 68 5968 g 68 5981 c 68 5992 c 68 6005 g 68 6017 g 68 6030 c 68 6042 g 68 6054 t 68 6067 g 68 6078 g 68 6091 t 68 6103 c 68 6115 g 68 6127 c 68 6138 t 68 6152 g 57 6164 t 57 6176 c 59 6187 g 59 6200 c 59 6212 t 59 6224 g 57 6236 t 68 6248 a 68 6260 c 68 6270 c 68 6282 c 68 6294 g 59 6307 a 59 6320 c 57 6331 g 57 6343 a 57 6356 a 59 6368 g 59 6380 t 59 6391 g 59 6404 t 59 6415 g 68 6428 c 68 6439 c 68 6452 g 68 6464 c 68 6476 c 68 6488 c 59 6500 c 59 6512 a 59 6523 g 59 6535 g 59 6547 a 59 6560 c 59 6571 a 59 6582 c 59 6593 c 59 6606 t 59 6618 a 59 6630 g 59 6642 g 68 6655 g 68 6667 t 68 6680 a 68 6691 g 68 6703 t 68 6715 g 68 6727 g 68 6741 a 68 6752 g 68 6764 g 68 6777 c 68 6788 c 68 6800 g 68 6813 t 68 6825 g 68 6837 g 68 6849 t 68 6862 c 68 6874 g 68 6886 a 68 6899 g 68 6910 g 68 6923 g 68 6935 t 68 6946 c 68 6958 c 68 6970 c 68 6981 t 68 6994 g 68 7006 c 68 7018 t 68 7031 a 68 7042 t 68 7054 g 59 7066 t 59 7078 a 59 7090 g 59 7102 c 59 7114 g 57 7127 g 57 7140 a 57 7152 c 57 7162 g 57 7175 a 57 7187 t 59 7198 c 59 7210 c 68 7223 t 68 7235 c 68 7247 t 57 7259 t 57 7272 c 57 7284 a 57 7294 c 57 7305 t 57 7318 g 68 7330 c 68 7342 c 68 7355 t 68 7367 g 68 7379 c 68 7391 t 68 7404 g 68 7415 g 68 7429 t 68 7439 g 68 7451 t 68 7463 t 59 7475 c 59 7487 c 59 7499 g 59 7512 a 59 7524 c 59 7535 g 59 7547 t 57 7560 t 57 7572 g 68 7583 g 68 7598 t 68 7609 c 68 7621 g 59 7633 c 57 7645 t 54 7657 c 54 7668 c 54 7682 a 54 7692 t 54 7704 g 57 7716 a 59 7729 a 68 7741 c 68 7752 c 68 7764 g 68 7777 a 68 7789 c 68 7801 t 59 7813 g 59 7825 a 59 7838 c 57 7849 a 57 7861 t 59 7872 a 59 7884 g 59 7897 c 59 7909 c 59 7922 g 59 7934 a 48 7946 c 48 7958 g 43 7969 g 43 7983 a 40 7996 g 40 8006 a 41 8019 a 43 8031 t 43 8041 g 48 8054 c 48 8065 c 46 8078 a 47 8089 a 47 8101 g 59 8113 t 59 8125 c 59 8137 c 43 8149 g 41 8161 g 37 8175 c 41 8186 c 41 8199 g 47 8211 t 47 8223 g 47 8235 t 59 8248 g 47 8260 t 47 8271 g 47 8284 a 47 8297 g 47 8309 t 47 8320 g 59 8333 a 41 8344 g 37 8355 g 37 8368 t 41 8381 a 41 8393 g 44 8405 c 47 8416 g 39 8430 a 39 8442 a 35 8454 g 31 8464 g 31 8478 c 39 8489 t 39 8501 c 47 8513 c 47 8525 c 59 8536 c 47 8548 a 41 8560 c 38 8571 a 34 8582 a 34 8596 g 35 8606 g 32 8620 c 39 8632 g 41 8644 t 37 8656 t 37 8669 g 44 8680 g 44 8694 t 44 8706 a 47 8717 c 47 8728 t 59 8740 g 43 8753 c 41 8765 g 37 8778 t 41 8790 a 37 8802 g 41 8814 c 37 8826 a 59 8838 t 39 8849 c 37 8860 c 37 8873 a 41 8884 c 37 8895 t 36 8907 t 31 8921 c 32 8931 c 32 8945 t 39 8958 c 31 8968 c 32 8982 g 39 8993 t 39 9006 c 43 9017 g 39 9030 t 40 9042 g 40 9054 c 40 9066 t 46 9078 g 47 9091 t 47 9102 c 37 9114 g 37 9127 c 37 9138 g 35 9151 g 32 9163 c 41 9175 t 34 9188 c 34 9200 a 28 9211 g 31 9223 t 24 9234 t 32 9247 t 32 9259 c 35 9271 a 35 9282 g 31 9295 c 29 9305 c 29 9318 t 34 9331 c 41 9343 t 41 9356 c 32 9366 c 37 9378 t 37 9391 c 30 9402 c 25 9414 a 32 9426 t 32 9438 c 25 9449 g 19 9460 g 19 9477 a 22 9487 g 22 9499 t 28 9512 g 34 9524 a 32 9535 g 35 9547 a 35 9560 g 37 9572 c 37 9582 t 41 9595 g 31 9607 g 34 9620 a 35 9633 t 33 9643 g 28 9656 g 25 9669 c 32 9681 a 31 9693 g 37 9706 t 37 9717 c 32 9729 g 27 9741 g 25 9755 t 31 9767 g 31 9779 a 30 9792 g 30 9804 c 25 9816 c 34 9826 c 27 9838 a 32 9851 g 35 9864 c 37 9875 t 39 9888 a 39 9900 g 39 9913 c 35 9924 t 37 9937 c 32 9948 a 25 9960 a 27 9969 c 30 9982 g 34 9995 a 28 10008 t 33 10019 g 33 10031 c 28 10043 g 28 10055 c 28 10069 t 30 10081 t 26 10092 g 29 10106 g 29 10116 c 32 10131 g 17 10143 t 15 10155 a 9 10166 g 9 10174 g 9 10191 c 13 10203 g 14 10215 g 19 10229 a 26 10240 c 13 10252 t 13 10264 g 12 10276 t 18 10289 c 12 10299 g 12 10312 a 12 10326 a 12 10335 g 12 10346 c 12 10360 g 19 10374 c 18 10384 g 28 10397 a 13 10408 t 11 10420 c 9 10431 c 10 10449 c 8 10456 g 8 10467 g 10 10482 a 14 10497 g 11 10507 t 8 10520 g 8 10535 t 8 10552 t 8 10559 c 8 10566 c 8 10575 t 8 10593 g 12 10607 g 12 10617 t 16 10632 g 16 10644 c 10 10654 a 10 10666 c 10 10681 c 8 10689 t 10 10703 c 8 10716 a 8 10721 a 8 10731 t 13 10751 c 14 10762 c 20 10774 c 15 10785 t 19 10796 a 11 10810 g 11 10823 a 8 10835 t 8 10846 a 6 10862 g 6 10866 a 6 10886 a 6 10894 g 8 10905 g 10 10919 a 11 10930 g 11 10945 a 10 10956 g 10 10965 g 10 10977 a 10 10994 g 10 11007 c 8 11016 c 13 11029 c 16 11045 c 11 11052 a 10 11066 g 12 11073 g 9 11088 t 14 11103 c 10 11116 a 11 11127 c 10 11140 t 12 11147 c 10 11161 a 9 11170 t 10 11184 c 13 11197 t 13 11213 g 12 11223 g 10 11232 a 11 11245 a 11 11256 c 15 11271 g 10 11283 t 10 11294 g 10 11305 g 9 11313 c 9 11329 c 10 11342 t 18 11360 t 12 11370 c 10 11382 a 13 11396 t 12 11406 g 19 11418 t 19 11431 c 23 11443 t 22 11455 t 22 11465 c 18 11480 c 12 11492 t 11 11504 t 10 11512 t 8 11518 g 10 11528 a 8 11545 c 10 11563 g 10 11577 c 10 11588 c 12 11598 g 12 11613 c 12 11624 c 12 11634 a 12 11646 c 11 11658 c 12 11668 t 16 11681 c 15 11697 t 10 11707 g 10 11720 c 10 11736 a 9 11744 t 9 11757 c 10 11767 t 8 11784 t 8 11792 t 8 11802 g 8 11822 g 9 11831 t 8 11841 g 8 11857 c 8 11871 c 8 11879 g 10 11900 g 8 11909 t 9 11921 g 11 11931 a 16 11946 t 10 11958 c 10 11967 a 10 11981 c 10 11997 c 10 12005 a 10 12020 g 11 12032 g 11 12042 t 9 12053 c 9 12070 g 9 12085 t 9 12093 c 9 12105 a 9 12113 a 9 12128 c 9 12146 g 9 12154 t 9 12171 a 9 12179 g 7 12195 a 7 12204 a 8 12212 c 8 12232 c 10 12248 c 11 12258 c 9 12267 a 9 12275 c 9 12293 c 9 12301 a 9 12313 g 8 12324 c 8 12341 a 8 12349 g 8 12369 g 9 12378 g 8 12387 g 8 12397 c 8 12404 t 8 12423 t 9 12434 t 10 12442 t 10 12451 c 8 12466 t 8 12484 t 8 12491 c 7 12503 a 8 12521 c 10 12535 t 10 12549 c 12 12559 c 16 12571 c 17 12580 c 10 12591 c 9 12605 g 10 12623 t 10 12635 c 10 12650 g 14 12662 g 12 12672 t 12 12687 a 11 12699 g 11 12711 a 11 12723 c 11 12734 g 8 12746 a 6 12759 c 7 12771 a 6 12775 g 10 12794 c 9 12803 c 10 12816 t 13 12835 c 11 12849 a 11 12863 t 12 12874 g 13 12885 c 12 12897 g 9 12908 g 9 12916 g 9 12928 c 9 12946 t 10 12964 c 10 12973 t 10 12983 g 12 12996 c 12 13007 t 12 13021 c 9 13033 g 10 13047 a 10 13063 a 10 13071 a 10 13080 a 8 13089 c 9 13101 c 9 13111 a 9 13130 t 8 13143 c 8 13154 t 6 13174 t 6 13178 c 6 13189 c 6 13199 t 10 13218 t 10 13233 a 9 13246 g 10 13260 c 10 13271 g 10 13281 t 10 13300 g 9 13312 g 7 13322 a 8 13335 a 8 13344 g 8 13348 t 8 13362 c 6 13378 c 8 13388 g 8 13407 g 8 13415 t 8 13433 t 9 13441 t 9 13450 t 8 13459 g 8 13474 g 8 13484 c 8 13490 g 8 13504 t 8 13522 t 10 13531 c 10 13548 c 10 13559 c 11 13574 c 11 13585 a 11 13594 c 11 13607 c 16 13616 c 16 13627 c 16 13640 g 16 13654 g 12 13668 g 15 13676 g 8 13686 g 8 13695 g 9 13714 g 9 13723 c 9 13742 t 9 13760 g 9 13772 c 11 13781 c 12 13790 c 10 13806 c 10 13816 c 9 13834 a 8 13842 g 8 13856 g 8 13868 g 8 13873 c 8 13890 a 8 13911 a 8 13927 a 10 13940 a 13 13956 a 8 13964 a 12 13972 a 8 13984 a 8 14008 c 8 14017 c 8 14029 t 8 14045 t 8 14053 t 8 14060 g 8 14078 c 8 14093 c 9 14102 c 9 14113 c 8 14122 g 8 14142 g 9 14158 g 9 14166 g 14 14180 g 22 14192 g 22 14204 a 15 14215 a 12 14229 c 11 14244 c 10 14258 a 10 14268 c 10 14279 c 8 14289 t 8 14304 t 8 14320 g 8 14326 c 8 14347 c 8 14357 c 7 14366 c 8 14377 c 8 14389 t 6 14393 c 6 14405 c 6 14415 t 10 14433 g 10 14442 g 10 14453 g 10 14468 c 10 14486 g 9 14495 g 9 14508 a 9 14521 a 18 14541 a 11 14552 a 8 14563 a 6 14577 t 6 14584 c 6 14603 t 6 14607 c 6 14611 a 6 14631 a 6 14646 a 8 14651 a 8 14670 c 8 14684 c 8 14698 c 11 14705 c 9 14721 c 10 14731 g 16 14743 g 13 14754 g 12 14764 g 15 14777 g 15 14796 g 10 14805 g 7 14823 g 7 14834 t 6 14842 g 6 14843 g 8 14864 a 6 14875 t 10 14887 g 8 14906 g 8 14920 c 8 14939 c 6 14946 a 6 14956 t 6 14978 a 6 14981 a 6 14993 a 6 15002 a 12 15019 a 12 15029 a 11 15039 c 11 15051 c 11 15064 c 10 15077 t 10 15088 t 6 15099 c 6 15119 c 6 15130 t 6 15132 c 6 15146 c 8 15157 c 10 15166 c 10 15184 c 10 15196 t 10 15213 t 10 15224 c 10 15233 c 9 15250 g 10 15263 a 11 15274 t 11 15287 c 9 15301 g 9 15313 c 9 15323 g 9 15332 c 6 15343 c 8 15362 c 8 15375 g 8 15381 g 8 15398 t 8 15415 t 8 15420 g 8 15437 a 8 15451 a 11 15465 a 11 15483 a 10 15494 a 11 15505 a 15 15520 a 11 15529 a 10 15539 a 10 15548 g 8 15566 g 8 15577 c 8 15594 g 8 15600 c 8 15605 c 8 15624 c 8 15639 c 8 15654 a 8 15659 c 8 15674 t 8 15690 t 8 15697 t 8 15707 a 8 15719 a 10 15738 a 10 15748 c 10 15758 a 10 15776 c 10 15786 t 8 15800 c 8 15811 c 8 15826 a 8 15833 c 8 15838 c 8 15847 t 8 15860 c 8 15867 a 8 15887 t 7 15901 g 9 15918 g 10 15930 t 10 15939 g 10 15957 g 10 15973 g 13 15985 g 21 15995 g 28 16005 g 13 16020 g 10 16034 g 8 16045 g 8 16052 g 8 16069 a 8 16086 a 8 16096 g 8 16103 a 8 16118 g 8 16140 c 8 16148 a 8 16167 c 8 16177 a 8 16182 c 8 16197 c 8 16214 c 8 16221 c 8 16238 c 13 16253 c 9 16270 c 9 16276 c 9 16297 END_DNA END_SEQUENCE ay149291.gb100644000766000024 423614006427353 16016 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AY149291 357 bp DNA linear PRI 25-FEB-2003 DEFINITION Homo sapiens neanderthalsensis mitochondrial D-loop hypervariable region I, partial sequence. ACCESSION AY149291 VERSION AY149291.1 GI:28557455 KEYWORDS . SOURCE mitochondrion Homo sapiens neanderthalensis ORGANISM Homo sapiens neanderthalensis Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 357) AUTHORS Schmitz,R.W., Serre,D., Bonani,G., Feine,S., Hillgruber,F., Krainitzki,H., Paabo,S. and Smith,F.H. TITLE The Neandertal type site revisited: interdisciplinary investigations of skeletal remains from the Neander Valley, Germany JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (20), 13342-13347 (2002) PUBMED 12232049 REFERENCE 2 (bases 1 to 357) AUTHORS Serre,D. and Paabo,S. TITLE Direct Submission JOURNAL Submitted (05-SEP-2002) Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Inselstrasse, Leipzig D-04103, Germany FEATURES Location/Qualifiers source 1..357 /organism="Homo sapiens neanderthalensis" /organelle="mitochondrion" /mol_type="genomic DNA" /isolate="Neandertal 2" /sub_species="neanderthalensis" /db_xref="taxon:63221" /country="Germany: Neandertal" D-loop <1..>357 /note="hypervariable region I" ORIGIN 1 gttctttcat gggggagcag atttgggtac cacccaagta ttgactcacc catcagcaac 61 cgctatgtat ttcgtacatt actgccagcc accatgaata ttgtacagta ccataattac 121 ttgactacct gcagtacata aaaacctaat ccacatcaac cccccccccc catgcttaca 181 agcaagcaca gcaatcaacc ttcaactgtc atacatcaac tacaactcca aagacaccct 241 tacacccact aggatatcaa caaacctacc cacccttgac agtacatagc acataaagtc 301 atttaccgta catagcacat tacagtcaaa tcccttctcg cccccatgga tgacccc // example.vcf100644000766000024 273614006427353 16537 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data##fileformat=VCFv4.0 ##fileDate=20090805 ##source=myImputationProgramV3.1 ##reference=1000GenomesPilot-NCBI36 ##phasing=partial ##INFO= ##INFO= ##INFO= ##INFO= ##INFO= ##INFO= ##FILTER= ##FILTER= ##FORMAT= ##FORMAT= ##FORMAT= ##FORMAT= #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003 20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,. 20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3 20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4 20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:7:56,60 0|0:48:4:51,51 0/0:61:2 20 1234567 microsat1 GTCT G,GTACT 50 PASS NS=3;DP=9;AA=G GT:GQ:DP 0/1:35:4 0/2:17:2 1/1:40:3 AF165282.gb100644000766000024 602014006427353 15662 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS HSATPCB01 226 bp DNA PRI 17-AUG-1999 DEFINITION Homo sapiens ATP cassette binding transporter 1 (ABC1) gene, exon 12. ACCESSION AF165282 VERSION AF165282.1 GI:5734104 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 226) AUTHORS Rust,S., Rosier,M., Funke,H., Real,J., Amoura,Z., Piette,J.C., Deleuze,J.F., Brewer,H.B., Duverger,N., Denefle,P. and Assmann,G. TITLE Tangier disease is caused by mutations in the gene encoding ATP-binding cassette transporter 1 JOURNAL Nat. Genet. 22 (4), 352-355 (1999) MEDLINE 99364413 PUBMED 10431238 REFERENCE 2 (bases 1 to 226) AUTHORS Rust,S., Rosier,M., Funke,H., Real,J., Amoura,Z., Piette,J.C., Deleuze,J.F., Brewer,H.B., Duverger,N., Denefle,P. and Assmann,G. TITLE Direct Submission JOURNAL Submitted (06-JUL-1999) Genomics, Rhone-Poulenc Rorer, 2 rue Gaston Cremieux, Evry 91006, France FEATURES Location/Qualifiers source 1..226 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q31" gene join(<1..226,AF165283.1:1..197,AF165284.1:1..243, AF165285.1:1..242,AF165286.1:1..225,AF165287.1:1..152, AF165288.1:1..163,AF165289.1:1..158,AF165290.1:1..241, AF165291.1:1..93,AF165292.1:1..223,AF165293.1:1..69, AF165294.1:1..134,AF165295.1:1..169,AF165296.1:1..145, AF165297.1:1..119,AF165298.1:1..209,AF165299.1:1..115, AF165300.1:1..53,AF165301.1:1..126,AF165302.1:1..95, AF165303.1:1..190,AF165304.1:1..198,AF165305.1:1..136, AF165306.1:1..165,AF165307.1:1..150,AF165308.1:1..141, AF165309.1:1..83,AF165310.1:1..>264) /gene="ABC1" mRNA join(<16..221,AF165283.1:16..192,AF165284.1:16..238, AF165285.1:16..237,AF165286.1:16..>220) /gene="ABC1" /product="ATP cassette binding transporter 1" exon 16..221 /gene="ABC1" /number=12 CDS join(<16..221,AF165283.1:16..192,AF165284.1:16..238, AF165285.1:16..237,AF165286.1:16..>220) /gene="ABC1" /codon_start=1 /db_xref="GI:5734134" BASE COUNT 69 a 46 c 58 g 53 t ORIGIN 1 ctgttcttct atcagtgtgt caacctgaac aagctagaac ccatagcaac agaagtctgg 61 ctcatcaaca agtccatgga gctgctggat gagaggaagt tctgggctgg tattgtgttc 121 actggaatta ctccaggcag cattgagctg ccccatcatg tcaagtacaa gatccgaatg 181 gacattgaca atgtggagag gacaaataaa atcaaggatg ggtaag // bug2862.pmr100644000766000024 2663414006427353 16246 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataSEQUENCE=atgNaggtgacttttatttacatcctagttattacatgttacgaaaacgacgtaaacgtttaccatattttttttcagatgtctctttggctgcctagtgaggccactgtctacttgcctcctgtcccagtatctaaNgttgtaagcacggatgaatatgttgcacgcacaaacatatattatcatgcaggaacatccagactacttgcagttggacatccctattttcctattaaaaaacctaacaataacaaaatattagttcctaaagtatcaggattacaatacagggtatttagaataNatttacctgaccccttagtgaataagtttggttttcctgacacctcattttaNaatccagatacacagcggctggtttgggcctgtgtaggtgttgaggtaggNcgtggtcagccattaggtgtgggcattagtggccatcctttattaaataaattggatgacacagaaaatgctagtgcttatgcagcaaatgcaggtgtggataatagagaatgtatatctatggattacaaacaaacacaattgtgtttaattggttgcaaaccacctataggggaacactggggcaaaggatccccatgtaNcaatgttgcagtaaNtccaggtgattgtccaccattagagttaataaacacagttattcaggatggtgatatggttgatacNggctttggtgctatggactttactacattacaggctaacaaaagtgaagttccactggatatttgtacNtctatttgcaaatatccagattatattaaaatggtgtcagaaccatatggcgacagcttatttttttatttacgaagggaacaaatgtttgttagacatttatttaatagggctggtNctgttggtgaaaatgtaccagacgatttatacattaaaggctctgggtctactgcaaatttagccagttcaaattattttcctacacctagtggttctatggttacctctgatgcccaaatattNaataaaccttattggttacaacgagcacagggccacaataatggcatttgttggggtaaccaactatttgttactgtNgtNgatactacacgcagtacaaatatgtcattatgtgctgccatatctacttcagaaNctacatataaaaaatactaactttaaNgagtacctacgacatggggaggaatatgatttacagtttatttttcaactgtgcaaaataaccttaactgcagacgttatgacatacatacattctatgaattccactattttggaggactggaattttggtNtacaacctccNccaggaggcacactagaagatacttataggtttgtaacatcccaggcaattgcttgtcaaaaacatacacctccagcacctaaagaagatgatccccttaaaaaataNactttttgggaagtaaatttaaaNgaaaagttttctgcagacctagatcagtttcctttaggacgcaaatttttactacaagcaggattNaaggccaaaccaaaatttacattaggaaaacgaaaagctacacccaccacctcatctacctctacaactgctaaacgcaaaaaacgtaagctgtaa PRIMER_PRODUCT_SIZE_RANGE=200-1200 PRIMER_OPT_SIZE=23 PRIMER_MIN_SIZE=22 PRIMER_MAX_SIZE=25 PRIMER_NUM_NS_ACCEPTED=0 PRIMER_NUM_RETURN=10 PRIMER_MISPRIMING_LIBRARY=mispriming.fasta PRIMER_LIB_AMBIGUITY_CODES_CONSENSUS=0 PRIMER_EXPLAIN_FLAG=1 PRIMER_OPT_TM=60 PRIMER_MIN_TM=59 PRIMER_MAX_TM=61 PRIMER_LEFT_EXPLAIN=considered 4449, too many Ns 373, GC content failed 127, low tm 2109, high tm 1284, high any compl 6, high end compl 27, high repeat similarity 31, long poly-x seq 61, ok 431 PRIMER_RIGHT_EXPLAIN=considered 4370, too many Ns 400, GC content failed 120, low tm 2106, high tm 1190, high any compl 2, high end compl 6, high repeat similarity 6, long poly-x seq 58, ok 482 PRIMER_PAIR_EXPLAIN=considered 343, unacceptable product size 320, ok 23 PRIMER_PAIR_PENALTY=0.1407 PRIMER_LEFT_PENALTY=0.081493 PRIMER_RIGHT_PENALTY=0.059210 PRIMER_LEFT_SEQUENCE=tgtgctgccatatctacttcaga PRIMER_RIGHT_SEQUENCE=taggtgctggaggtgtatgtttt PRIMER_LEFT=1116,23 PRIMER_RIGHT=1393,23 PRIMER_LEFT_TM=59.919 PRIMER_RIGHT_TM=59.941 PRIMER_LEFT_GC_PERCENT=43.478 PRIMER_RIGHT_GC_PERCENT=43.478 PRIMER_LEFT_SELF_ANY=4.00 PRIMER_RIGHT_SELF_ANY=2.00 PRIMER_LEFT_SELF_END=3.00 PRIMER_RIGHT_SELF_END=0.00 PRIMER_LEFT_MISPRIMING_SCORE=11.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_MISPRIMING_SCORE=12.00, U45909 Human papillomavirus type 45, isolate IS252, major capsid protein L1 (L1) gene, partial cds. PRIMER_PAIR_MISPRIMING_SCORE=20.00, reverse EF177179 Human papillomavirus type 56 clone Qv22608, complete genome. PRIMER_LEFT_END_STABILITY=6.7000 PRIMER_RIGHT_END_STABILITY=7.0000 PRIMER_PAIR_COMPL_ANY=5.00 PRIMER_PAIR_COMPL_END=1.00 PRIMER_PRODUCT_SIZE=278 PRIMER_PAIR_PENALTY_1=0.1407 PRIMER_LEFT_1_PENALTY=0.081493 PRIMER_RIGHT_1_PENALTY=0.059210 PRIMER_LEFT_1_SEQUENCE=tgtgctgccatatctacttcaga PRIMER_RIGHT_1_SEQUENCE=ttaggtgctggaggtgtatgttt PRIMER_LEFT_1=1116,23 PRIMER_RIGHT_1=1394,23 PRIMER_LEFT_1_TM=59.919 PRIMER_RIGHT_1_TM=59.941 PRIMER_LEFT_1_GC_PERCENT=43.478 PRIMER_RIGHT_1_GC_PERCENT=43.478 PRIMER_LEFT_1_SELF_ANY=4.00 PRIMER_RIGHT_1_SELF_ANY=2.00 PRIMER_LEFT_1_SELF_END=3.00 PRIMER_RIGHT_1_SELF_END=0.00 PRIMER_LEFT_1_MISPRIMING_SCORE=11.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_1_MISPRIMING_SCORE=11.00, EF202156 Human papillomavirus type 45 isolate Qv20214, complete genome. PRIMER_PAIR_1_MISPRIMING_SCORE=20.00, EF202156 Human papillomavirus type 45 isolate Qv20214, complete genome. PRIMER_LEFT_1_END_STABILITY=6.7000 PRIMER_RIGHT_1_END_STABILITY=7.0000 PRIMER_PAIR_1_COMPL_ANY=5.00 PRIMER_PAIR_1_COMPL_END=1.00 PRIMER_PRODUCT_SIZE_1=279 PRIMER_PAIR_PENALTY_2=0.1407 PRIMER_LEFT_2_PENALTY=0.081493 PRIMER_RIGHT_2_PENALTY=0.059210 PRIMER_LEFT_2_SEQUENCE=tgtgctgccatatctacttcaga PRIMER_RIGHT_2_SEQUENCE=tttaggtgctggaggtgtatgtt PRIMER_LEFT_2=1116,23 PRIMER_RIGHT_2=1395,23 PRIMER_LEFT_2_TM=59.919 PRIMER_RIGHT_2_TM=59.941 PRIMER_LEFT_2_GC_PERCENT=43.478 PRIMER_RIGHT_2_GC_PERCENT=43.478 PRIMER_LEFT_2_SELF_ANY=4.00 PRIMER_RIGHT_2_SELF_ANY=2.00 PRIMER_LEFT_2_SELF_END=3.00 PRIMER_RIGHT_2_SELF_END=0.00 PRIMER_LEFT_2_MISPRIMING_SCORE=11.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_2_MISPRIMING_SCORE=11.00, U45909 Human papillomavirus type 45, isolate IS252, major capsid protein L1 (L1) gene, partial cds. PRIMER_PAIR_2_MISPRIMING_SCORE=20.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_LEFT_2_END_STABILITY=6.7000 PRIMER_RIGHT_2_END_STABILITY=6.6000 PRIMER_PAIR_2_COMPL_ANY=5.00 PRIMER_PAIR_2_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_2=280 PRIMER_PAIR_PENALTY_3=0.2276 PRIMER_LEFT_3_PENALTY=0.081493 PRIMER_RIGHT_3_PENALTY=0.146140 PRIMER_LEFT_3_SEQUENCE=tgtgctgccatatctacttcaga PRIMER_RIGHT_3_SEQUENCE=aaggggatcatcttctttaggtg PRIMER_LEFT_3=1116,23 PRIMER_RIGHT_3=1410,23 PRIMER_LEFT_3_TM=59.919 PRIMER_RIGHT_3_TM=59.854 PRIMER_LEFT_3_GC_PERCENT=43.478 PRIMER_RIGHT_3_GC_PERCENT=43.478 PRIMER_LEFT_3_SELF_ANY=4.00 PRIMER_RIGHT_3_SELF_ANY=7.00 PRIMER_LEFT_3_SELF_END=3.00 PRIMER_RIGHT_3_SELF_END=1.00 PRIMER_LEFT_3_MISPRIMING_SCORE=11.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_3_MISPRIMING_SCORE=11.00, X74481 Human papillomavirus type 52 genomic DNA. PRIMER_PAIR_3_MISPRIMING_SCORE=20.00, DQ057317 Human papillomavirus type 52 isolate HK1151 L1 protein (L1) gene, partial cds. PRIMER_LEFT_3_END_STABILITY=6.7000 PRIMER_RIGHT_3_END_STABILITY=7.9000 PRIMER_PAIR_3_COMPL_ANY=4.00 PRIMER_PAIR_3_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_3=295 PRIMER_PAIR_PENALTY_4=0.2322 PRIMER_LEFT_4_PENALTY=0.173020 PRIMER_RIGHT_4_PENALTY=0.059210 PRIMER_LEFT_4_SEQUENCE=actgcaaatttagccagttcaaa PRIMER_RIGHT_4_SEQUENCE=tttaggtgctggaggtgtatgtt PRIMER_LEFT_4=930,23 PRIMER_RIGHT_4=1395,23 PRIMER_LEFT_4_TM=60.173 PRIMER_RIGHT_4_TM=59.941 PRIMER_LEFT_4_GC_PERCENT=34.783 PRIMER_RIGHT_4_GC_PERCENT=43.478 PRIMER_LEFT_4_SELF_ANY=8.00 PRIMER_RIGHT_4_SELF_ANY=2.00 PRIMER_LEFT_4_SELF_END=2.00 PRIMER_RIGHT_4_SELF_END=0.00 PRIMER_LEFT_4_MISPRIMING_SCORE=10.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_4_MISPRIMING_SCORE=11.00, U45909 Human papillomavirus type 45, isolate IS252, major capsid protein L1 (L1) gene, partial cds. PRIMER_PAIR_4_MISPRIMING_SCORE=19.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_LEFT_4_END_STABILITY=7.3000 PRIMER_RIGHT_4_END_STABILITY=6.6000 PRIMER_PAIR_4_COMPL_ANY=4.00 PRIMER_PAIR_4_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_4=466 PRIMER_PAIR_PENALTY_5=0.2322 PRIMER_LEFT_5_PENALTY=0.173020 PRIMER_RIGHT_5_PENALTY=0.059210 PRIMER_LEFT_5_SEQUENCE=actgcaaatttagccagttcaaa PRIMER_RIGHT_5_SEQUENCE=taggtgctggaggtgtatgtttt PRIMER_LEFT_5=930,23 PRIMER_RIGHT_5=1393,23 PRIMER_LEFT_5_TM=60.173 PRIMER_RIGHT_5_TM=59.941 PRIMER_LEFT_5_GC_PERCENT=34.783 PRIMER_RIGHT_5_GC_PERCENT=43.478 PRIMER_LEFT_5_SELF_ANY=8.00 PRIMER_RIGHT_5_SELF_ANY=2.00 PRIMER_LEFT_5_SELF_END=2.00 PRIMER_RIGHT_5_SELF_END=0.00 PRIMER_LEFT_5_MISPRIMING_SCORE=10.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_5_MISPRIMING_SCORE=12.00, U45909 Human papillomavirus type 45, isolate IS252, major capsid protein L1 (L1) gene, partial cds. PRIMER_PAIR_5_MISPRIMING_SCORE=21.00, reverse PPHT39 Human papillomavirus ORFs. PRIMER_LEFT_5_END_STABILITY=7.3000 PRIMER_RIGHT_5_END_STABILITY=7.0000 PRIMER_PAIR_5_COMPL_ANY=4.00 PRIMER_PAIR_5_COMPL_END=3.00 PRIMER_PRODUCT_SIZE_5=464 PRIMER_PAIR_PENALTY_6=0.2322 PRIMER_LEFT_6_PENALTY=0.173020 PRIMER_RIGHT_6_PENALTY=0.059210 PRIMER_LEFT_6_SEQUENCE=actgcaaatttagccagttcaaa PRIMER_RIGHT_6_SEQUENCE=ttaggtgctggaggtgtatgttt PRIMER_LEFT_6=930,23 PRIMER_RIGHT_6=1394,23 PRIMER_LEFT_6_TM=60.173 PRIMER_RIGHT_6_TM=59.941 PRIMER_LEFT_6_GC_PERCENT=34.783 PRIMER_RIGHT_6_GC_PERCENT=43.478 PRIMER_LEFT_6_SELF_ANY=8.00 PRIMER_RIGHT_6_SELF_ANY=2.00 PRIMER_LEFT_6_SELF_END=2.00 PRIMER_RIGHT_6_SELF_END=0.00 PRIMER_LEFT_6_MISPRIMING_SCORE=10.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_6_MISPRIMING_SCORE=11.00, EF202156 Human papillomavirus type 45 isolate Qv20214, complete genome. PRIMER_PAIR_6_MISPRIMING_SCORE=20.00, reverse PPHT39 Human papillomavirus ORFs. PRIMER_LEFT_6_END_STABILITY=7.3000 PRIMER_RIGHT_6_END_STABILITY=7.0000 PRIMER_PAIR_6_COMPL_ANY=4.00 PRIMER_PAIR_6_COMPL_END=3.00 PRIMER_PRODUCT_SIZE_6=465 PRIMER_PAIR_PENALTY_7=0.2545 PRIMER_LEFT_7_PENALTY=0.173020 PRIMER_RIGHT_7_PENALTY=0.081493 PRIMER_LEFT_7_SEQUENCE=actgcaaatttagccagttcaaa PRIMER_RIGHT_7_SEQUENCE=tctgaagtagatatggcagcaca PRIMER_LEFT_7=930,23 PRIMER_RIGHT_7=1138,23 PRIMER_LEFT_7_TM=60.173 PRIMER_RIGHT_7_TM=59.919 PRIMER_LEFT_7_GC_PERCENT=34.783 PRIMER_RIGHT_7_GC_PERCENT=43.478 PRIMER_LEFT_7_SELF_ANY=8.00 PRIMER_RIGHT_7_SELF_ANY=4.00 PRIMER_LEFT_7_SELF_END=2.00 PRIMER_RIGHT_7_SELF_END=3.00 PRIMER_LEFT_7_MISPRIMING_SCORE=10.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_7_MISPRIMING_SCORE=12.00, reverse FJ385268 Human papillomavirus type 58 isolate HPV58sc78, complete genome. PRIMER_PAIR_7_MISPRIMING_SCORE=19.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_LEFT_7_END_STABILITY=7.3000 PRIMER_RIGHT_7_END_STABILITY=8.2000 PRIMER_PAIR_7_COMPL_ANY=5.00 PRIMER_PAIR_7_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_7=209 PRIMER_PAIR_PENALTY_8=0.3192 PRIMER_LEFT_8_PENALTY=0.173020 PRIMER_RIGHT_8_PENALTY=0.146140 PRIMER_LEFT_8_SEQUENCE=actgcaaatttagccagttcaaa PRIMER_RIGHT_8_SEQUENCE=aaggggatcatcttctttaggtg PRIMER_LEFT_8=930,23 PRIMER_RIGHT_8=1410,23 PRIMER_LEFT_8_TM=60.173 PRIMER_RIGHT_8_TM=59.854 PRIMER_LEFT_8_GC_PERCENT=34.783 PRIMER_RIGHT_8_GC_PERCENT=43.478 PRIMER_LEFT_8_SELF_ANY=8.00 PRIMER_RIGHT_8_SELF_ANY=7.00 PRIMER_LEFT_8_SELF_END=2.00 PRIMER_RIGHT_8_SELF_END=1.00 PRIMER_LEFT_8_MISPRIMING_SCORE=10.00, J04353 Human papillomavirus type 31 (HPV-31) complete genome. PRIMER_RIGHT_8_MISPRIMING_SCORE=11.00, X74481 Human papillomavirus type 52 genomic DNA. PRIMER_PAIR_8_MISPRIMING_SCORE=19.00, X74481 Human papillomavirus type 52 genomic DNA. PRIMER_LEFT_8_END_STABILITY=7.3000 PRIMER_RIGHT_8_END_STABILITY=7.9000 PRIMER_PAIR_8_COMPL_ANY=3.00 PRIMER_PAIR_8_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_8=481 PRIMER_PAIR_PENALTY_9=0.3690 PRIMER_LEFT_9_PENALTY=0.309788 PRIMER_RIGHT_9_PENALTY=0.059210 PRIMER_LEFT_9_SEQUENCE=atgctagtgcttatgcagcaaat PRIMER_RIGHT_9_SEQUENCE=taggtgctggaggtgtatgtttt PRIMER_LEFT_9=475,23 PRIMER_RIGHT_9=1393,23 PRIMER_LEFT_9_TM=60.310 PRIMER_RIGHT_9_TM=59.941 PRIMER_LEFT_9_GC_PERCENT=39.130 PRIMER_RIGHT_9_GC_PERCENT=43.478 PRIMER_LEFT_9_SELF_ANY=6.00 PRIMER_RIGHT_9_SELF_ANY=2.00 PRIMER_LEFT_9_SELF_END=3.00 PRIMER_RIGHT_9_SELF_END=0.00 PRIMER_LEFT_9_MISPRIMING_SCORE=12.00, EU918766 Human papillomavirus type 33 isolate LZcc12-33, complete genome. PRIMER_RIGHT_9_MISPRIMING_SCORE=12.00, U45909 Human papillomavirus type 45, isolate IS252, major capsid protein L1 (L1) gene, partial cds. PRIMER_PAIR_9_MISPRIMING_SCORE=22.00, reverse PPHT39 Human papillomavirus ORFs. PRIMER_LEFT_9_END_STABILITY=7.2000 PRIMER_RIGHT_9_END_STABILITY=7.0000 PRIMER_PAIR_9_COMPL_ANY=5.00 PRIMER_PAIR_9_COMPL_END=2.00 PRIMER_PRODUCT_SIZE_9=919 = AHCYL1.kegg100644000766000024 727714006427353 16171 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataENTRY 10768 CDS H.sapiens NAME AHCYL1 DEFINITION S-adenosylhomocysteine hydrolase-like 1 [EC:3.3.1.1] ORTHOLOG KO: K01251 adenosylhomocysteinase CLASS Metabolism; Amino Acid Metabolism; Methionine metabolism [PATH:hsa00271] Metabolism; Metabolism of Other Amino Acids; Selenoamino acid metabolism [PATH:hsa00450] POSITION 1:join(26813..26932,50794..50905,52974..53117,54122..54222,54657.. 54759,56523..56617,57185..57291,58104..58220,58427..58490,59255.. 59343,59706..59776,60133..60227,60312..60410,60811..60879,61308.. 61386,62491..62611,63434..63440) DBLINKS LocusLink: 10768 GDB: 9958257 NCBI: 21361647 SP: O43865 CODON_USAGE T C A G T 7 8 1 10 8 10 6 2 7 11 1 0 11 8 0 8 C 3 10 2 16 3 8 4 3 2 6 6 17 3 6 7 3 A 9 15 3 14 6 11 15 1 11 11 14 24 2 8 2 4 G 10 9 7 17 18 11 11 3 21 11 10 23 5 11 6 11 AASEQ 530 MSMPDAMPLPGVGEELKQAKEIEDAEKYSFMATVTKAPKKQIQFADDMQEFTKFPTKTGR RSLSRSISQSSTDSYSSAASYTDSSDDEVSPREKQQTNSKGSSNFCVKNIKQAEFGRREI EIAEQDMSALISLRKRAQGEKPLAGAKIVGCTHITAQTAVLIETLCALGAQCRWSACNIY STQNEVAAALAEAGVAVFAWKGESEDDFWWCIDRCVNMDGWQANMILDDGGDLTHWVYKK YPNVFKKIRGIVEESVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDG LKRTTDVMFGGKQVVVCGYGEVGKGCCAALKALGAIVYITEIDPICALQACMDGFRVVKL NEVIRQVDVVITCTGNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVTSLRTPELTWERVR SQVDHVIWPDGKRVVLLAEGRLLNLSCSTVPTFVLSITATTQALALIELYNAPEGRYKQD VYLLPKKMDEYVASLHLPSFDAHLTELTDDQAKYLGLNKNGPFKPNYYRY NTSEQ 1593 atgtcgatgcctgacgcgatgccgctgcccggggtcggggaggagctgaagcaggccaag gagatcgaggacgccgagaagtactccttcatggccaccgtcaccaaggcgcccaagaag caaatccagtttgctgatgacatgcaggagttcaccaaattccccaccaaaactggccga agatctttgtctcgctcgatctcacagtcctccactgacagctacagttcagctgcatcc tacacagatagctctgatgatgaggtttctccccgagagaagcagcaaaccaactccaag ggcagcagcaatttctgtgtgaagaacatcaagcaggcagaatttggacgccgggagatt gagattgcagagcaagacatgtctgctctgatttcactcaggaaacgtgctcagggggag aagcccttggctggtgctaaaatagtgggctgtacacacatcacagcccagacagcggtg ttgattgagacactctgtgccctgggggctcagtgccgctggtctgcttgtaacatctac tcaactcagaatgaagtagctgcagcactggctgaggctggagttgcagtgttcgcttgg aagggcgagtcagaagatgacttctggtggtgtattgaccgctgtgtgaacatggatggg tggcaggccaacatgatcctggatgatgggggagacttaacccactgggtttataagaag tatccaaacgtgtttaagaagatccgaggcattgtggaagagagcgtgactggtgttcac aggctgtatcagctctccaaagctgggaagctctgtgttccggccatgaacgtcaatgat tctgttaccaaacagaagtttgataacttgtactgctgccgagaatccattttggatggc ctgaagaggaccacagatgtgatgtttggtgggaaacaagtggtggtgtgtggctatggt gaggtaggcaagggctgctgtgctgctctcaaagctcttggagcaattgtctacattacc gaaatcgaccccatctgtgctctgcaggcctgcatggatgggttcagggtggtaaagcta aatgaagtcatccggcaagtcgatgtcgtaataacttgcacaggaaataagaatgtagtg acacgggagcacttggatcgcatgaaaaacagttgtatcgtatgcaatatgggccactcc aacacagaaatcgatgtgaccagcctccgcactccggagctgacgtgggagcgagtacgt tctcaggtggaccatgtcatctggccagatggcaaacgagttgtcctcctggcagagggt cgtctactcaatttgagctgctccacagttcccacctttgttctgtccatcacagccaca acacaggctttggcactgatagaactctataatgcacccgaggggcgatacaagcaggat gtgtacttgcttcctaagaaaatggatgaatacgttgccagcttgcatctgccatcattt gatgcccaccttacagagctgacagatgaccaagcaaaatatctgggactcaacaaaaat gggccattcaaacctaattattacagatactaa /// BAB68554.gb100644000766000024 342514006427353 15712 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS BAB68554 141 aa linear VRT 11-APR-2002 DEFINITION alpha D-globin [Aldabrachelys elephantina]. ACCESSION BAB68554 PID g15824047 VERSION BAB68554.1 GI:15824047 DBSOURCE accession AB072353.1 KEYWORDS . SOURCE Aldabra giant tortoise. ORGANISM Aldabrachelys elephantina Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Testudines; Cryptodira; Testudinoidea; Testudinidae; Aldabrachelys. REFERENCE 1 AUTHORS Shishikura,F. TITLE The primary structure of hemoglobin D from the Aldabra giant tortoise, Geochelone gigantea JOURNAL Zoolog. Sci. 19, 197-206 (2002) REFERENCE 2 (residues 1 to 141) AUTHORS Shishikura,F. TITLE Direct Submission JOURNAL Submitted (29-SEP-2001) Fumio Shishikura, Nihon University School of Medicine, Biology; Oyaguchi-kamimachi, 30-1, Itabashi-ku, Tokyo 173-8610, Japan (E-mail:fshishi@med.nihon-u.ac.jp, Tel:81-3-3972-8111(ex.2291), Fax:81-3-3972-0027) FEATURES Location/Qualifiers source 1..141 /organism="Aldabrachelys elephantina" /db_xref="taxon:167804" /note="synonym:Dipsochelys dussumieri~synonym:Geochelone gigantea" Protein 1..141 /product="alpha D-globin" CDS 1..141 /coded_by="join(AB072353.1:1..92,AB072353.1:307..511, AB072353.1:739..>864)" /note="hemoglobin D" ORIGIN 1 mlteddkqli qhvwekvleh qedfgaeale rmfivypstk tyfphfdlhh dseqirhhgk 61 kvvgalgdav khidnlsatl selsnlhayn lrvdpvnfkl lshcfqvvlg ahlgreytpq 121 vqvaydkfla avsavlaeky r // cysprot.msf100644000766000024 1057414006427353 16635 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataPileUp MSF: 393 Type: P Check: 3400 .. Name: CATH_HUMAN oo Len: 393 Check: 4989 Weight: 10.0 Name: CATH_RAT oo Len: 393 Check: 6434 Weight: 10.6 Name: ALEU_HORVU oo Len: 393 Check: 5288 Weight: 16.7 Name: PAPA_CARPA oo Len: 393 Check: 3242 Weight: 18.9 Name: CATL_HUMAN oo Len: 393 Check: 4708 Weight: 12.2 Name: CATL_RAT oo Len: 393 Check: 6797 Weight: 12.2 Name: CYS1_DICDI oo Len: 393 Check: 1942 Weight: 18.9 // CATH_HUMAN ....MWATLP LLCAGAWLLG VPVCGAAELS VN........ .......... CATH_RAT ....MWTALP LLCAGAWLLS AGAT..AELT VN........ .......... ALEU_HORVU ....MAHARV LLLALAVLAT AAVAVASSSS FADSNPIRPV TDRAASTLES PAPA_CARPA MAMIPSISKL LFVAICLFVY MGLSFGDFSI VG........ .........Y CATL_HUMAN ......MNPT LILAAFCLGI ASATLTFDHS LE........ .......... CATL_RAT ......MTPL LLLAVLCLGT ALATPKFDQT FN........ .......... CYS1_DICDI ......MKVI LLFVLAVFTV FVSSRGIPPE EQ........ .......... CATH_HUMAN ........SL EKFHFKSWMS KHRKTYS.TE EYHHRLQTFA SNWRKINAHN CATH_RAT ........AI EKFHFTSWMK QHQKTYS.SR EYSHRLQVFA NNWRKIQAHN ALEU_HORVU AVLGALGRTR HALRFARFAV RYGKSYESAA EVRRRFRIFS ESLEEVRSTN PAPA_CARPA SQNDLTSTER LIQLFESWML KHNKIYKNID EKIYRFEIFK DNLKYIDETN CATL_HUMAN .......... ..AQWTKWKA MHNRLYG.MN EEGWRRAVWE KNMKMIELHN CATL_RAT .......... ..AQWHQWKS THRRLYG.TN EEEWRRAVWE KNMRMIQLHN CYS1_DICDI .......... ..SQFLEFQD KFNKKYS.HE EYLERFEIFK SNLGKIEELN CATH_HUMAN ....NGNHTF KMALNQFSDM SFAEIKHKYL WSEPQNCSAT KS..NYLRGT CATH_RAT ....QRNHTF KMGLNQFSDM SFAEIKHKYL WSEPQNCSAT KS..NYLRGT ALEU_HORVU ....RKGLPY RLGINRFSDM SWEEFQATRL G.AAQTCSAT LAGNHLMRDA PAPA_CARPA ....KKNNSY WLGLNVFADM SNDEFKEKYT GSIAGNYTTT ELSYEEVLND CATL_HUMAN QEYREGKHSF TMAMNAFGDM TSEEFRQVMN GFQ....NRK PRKGKVFQEP CATL_RAT GEYSNGKHGF TMEMNAFGDM TNEEFRQIVN GYR....HQK HKKGRLFQEP CYS1_DICDI LIAINHKADT KFGVNKFADL SSDEFKNYYL NNKEAIFTDD LPVADYLDDE CATH_HUMAN G..PYPPSVD WRKKGNFVSP VKNQGACGSC WTFSTTGALE SAIAIATGKM CATH_RAT G..PYPSSMD WRKKGNVVSP VKNQGACGSC WTFSTTGALE SAVAIASGKM ALEU_HORVU A..ALPETKD WREDG.IVSP VKNQAHCGSC WTFSTTGALE AAYTQATGKN PAPA_CARPA GDVNIPEYVD WRQKG.AVTP VKNQGSCGSC WAFSAVVTIE GIIKIRTGNL CATL_HUMAN LFYEAPRSVD WREKG.YVTP VKNQGQCGSC WAFSATGALE GQMFRKTGRL CATL_RAT LMLQIPKTVD WREKG.CVTP VKNQGQCGSC WAFSASGCLE GQMFLKTGKL CYS1_DICDI FINSIPTAFD WRTRG.AVTP VKNQGQCGSC WSFSTTGNVE GQHFISQNKL CATH_HUMAN LSLAEQQLVD CAQDF..... ...NNYGCQG GLPSQAFEYI LYNKGIMGED CATH_RAT MTLAEQQLVD CAQNF..... ...NNHGCQG GLPSQAFEYI LYNKGIMGED ALEU_HORVU ISLSEQQLVD CAGGF..... ...NNFGCNG GLPSQAFEYI KYNGGIDTEE PAPA_CARPA NEYSEQELLD CDR....... ...RSYGCNG GYPWSALQLV AQYG.IHYRN CATL_HUMAN ISLSEQNLVD CSGP....QG ....NEGCNG GLMDYAFQYV QDNGGLDSEE CATL_RAT ISLSEQNLVD CSHD....QG ....NQGCNG GLMDFAFQYI KENGGLDSEE CYS1_DICDI VSLSEQNLVD CDHECMEYEG EEACDEGCNG GLQPNAYNYI IKNGGIQTES CATH_HUMAN TYPYQGKDGY .CKFQPGKAI GFVKDVANIT IYDEEAMVEA VALYNPVSFA CATH_RAT SYPYIGKNGQ .CKFNPEKAV AFVKNVVNIT LNDEAAMVEA VALYNPVSFA ALEU_HORVU SYPYKGVNGV .CHYKAENAA VQVLDSVNIT LNAEDELKNA VGLVRPVSVA PAPA_CARPA TYPYEGVQRY .CRSREKGPY AAKTDGVRQV QPYNEGALLY SIANQPVSVV CATL_HUMAN SYPYEATEES .CKYNPKYSV ANDTGFVDIP K.QEKALMKA VATVGPISVA CATL_RAT SYPYEAKDGS .CKYRAEYAV ANDTGFVDIP Q.QEKALMKA VATVGPISVA CYS1_DICDI SYPYTAETGT QCNFNSANIG AKISNFTMIP K.NETVMAGY IVSTGPLAIA CATH_HUMAN FEVT.QDFMM YRTGIYSSTS CHKTPDKVNH AVLAVGYGEK NG.....IPY CATH_RAT FEVT.EDFMM YKSGVYSSNS CHKTPDKVNH AVLAVGYGEQ NG.....LLY ALEU_HORVU FQVI.DGFRQ YKSGVYTSDH CGTTPDDVNH AVLAVGYGVE NG.....VPY PAPA_CARPA LEAAGKDFQL YRGGIFVG.. ..PCGNKVDH AVAAVGYGPN .........Y CATL_HUMAN IDAGHESFLF YKEGIYFEPD ..CSSEDMDH GVLVVGYGFE STESDNNK.Y CATL_RAT MDASHPSLQF YSSGIYYEPN ..CSSKDLDH GVLVVGYGYE GTDSNKDK.Y CYS1_DICDI ADA..VEWQF YIGGVFDIP. ..CNPNSLDH GILIVGYSAK NTIFRKNMPY CATH_HUMAN WIVKNSWGPQ WGMNGYFLIE RGK....NMC GLAACASYPI PLV CATH_RAT WIVKNSWGSN WGNNGYFLIE RGK....NMC GLAACASYPI PQV ALEU_HORVU WLIKNSWGAD WGDNGYFKME MGK....NMC AIATCASYPV VAA PAPA_CARPA ILIKNSWGTG WGENGYIRIK RGTGNSYGVC GLYTSSFYPV KN. CATL_HUMAN WLVKNSWGEE WGMGGYVKMA KDRR...NHC GIASAASYPT V.. CATL_RAT WLVKNSWGKE WGMDGYIKIA KDRN...NHC GLATAASYPI VN. CYS1_DICDI WIVKNSWGAD WGEQGYIYLR RGK....NTC GVSNFVSTSI I.. ay007676.gb100644000766000024 655214006427353 16021 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AY007676 1389 bp DNA linear BCT 29-OCT-2001 DEFINITION Unknown marine gamma proteobacterium NOR5 16S ribosomal RNA, partial sequence. ACCESSION AY007676 VERSION AY007676.1 GI:12000362 KEYWORDS . SOURCE unknown marine gamma proteobacterium NOR5 ORGANISM unknown marine gamma proteobacterium NOR5 Bacteria; Proteobacteria; Gammaproteobacteria. REFERENCE 1 (bases 1 to 1389) AUTHORS Eilers,H., Pernthaler,J., Peplies,J., Glockner,F.O., Gerdts,G. and Amann,R. TITLE Isolation of novel pelagic bacteria from the German bight and their seasonal contributions to surface picoplankton JOURNAL Appl. Environ. Microbiol. 67 (11), 5134-5142 (2001) PUBMED 11679337 REFERENCE 2 (bases 1 to 1389) AUTHORS Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O., Gerdts,G., Schuett,C. and Amann,R. TITLE Identification and seasonal dominance of culturable marine bacteria JOURNAL Unpublished REFERENCE 3 (bases 1 to 1389) AUTHORS Eilers,H., Pernthaler,J., Peplies,J., Gloeckner,F.O., Gerdts,G., Schuett,C. and Amann,R. TITLE Direct Submission JOURNAL Submitted (29-AUG-2000) Molecular Ecology, Max-Planck-Institute, Celsiusstrasse 1, Bremen 28359, Germany FEATURES Location/Qualifiers source 1..1389 /organism="unknown marine gamma proteobacterium NOR5" /mol_type="genomic DNA" /db_xref="taxon:145658" rRNA <1..>1389 /product="16S ribosomal RNA" ORIGIN 1 cgcgaaagta cttcggtatg agtagagcgg cggacgggtg agtaacgcgt aggaatctat 61 ccagtagtgg gggacaactc ggggaaactc gagctaatac cgcatacgtc ctaagggaga 121 aagcggggga tcttcggacc tcgcgctatt ggaggagcct gcgttggatt agctagttgg 181 tggggtaaag gcctaccaag gcgacgatcc atagctggtc tgagaggatg atcagccaca 241 ccgggactga gacacggccc ggactcctac gggaggcagc agtggggaat attgcgcaat 301 gggcgaaagc ctgacgcagc catgccgcgt gtgtgaagaa ggccttcggg ttgtaaagca 361 ctttcaattg ggaagaaagg ttagtagtta ataactgcta gctgtgacat tacctttaga 421 agaagcaccg gctaactccg tgccagcagc cgcggtaata cggaggtgcg agcgttaatc 481 ggaattactg ggcgtaaagc gcgcgtaggc ggtctgttaa gtcggatgtg aaagccccgg 541 gctcaacctg ggaattgcac ccgatactgg ccgactggag tgcgagagag ggaggtagaa 601 ttccacgtgt agcggtgaaa tgcgtagata tgtggaggaa taccggtggc gaaggcggcc 661 tcctggctcg acactgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc 721 ctggtagtcc acgccgtaaa cgatgtctac tagccgttgg gagacttgat ttcttggtgg 781 cgaagttaac gcgataagta gaccgcctgg ggagtacggc cgcaaggtta aaactcaaat 841 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt taattcgatg caacgcgaag 901 aaccttacca ggccttgaca tcctaggaat cctgtagaga tacgggagtg ccttcgggaa 961 tctagtgaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc 1021 ccgtaacgag cgcaaccctt gtccttagtt gccagcgcgt aatggcggga actctaagga 1081 gactgccggt gacaaaccgg aggaaggtgg ggacgacgtc aagtcatcat ggcccttacg 1141 gcctgggcta cacacgtgct acaatggaac gcacagaggg cagcaaaccc gcgaggggga 1201 gcgaatccca caaaacgttt cgtagtccgg atcggagtct gcaactcgac tccgtgaagt 1261 cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgttcccg ggccttgtac 1321 acaccgcccg tcacaccatg ggagtgggtt gctccagaag tggttagcct aaccttcggg 1381 agggcgatc // atp1.matrix100644000766000024 24134314006427353 16536 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data Last position-specific scoring matrix computed, weighted observed percentages rounded down, information per position, and relative weight of gapless real matches to pseudocounts A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 1 M -2 -2 -3 -4 -2 -1 -3 -4 -2 0 1 -2 8 -1 -3 -2 -1 -2 -2 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 1.01 0.28 2 E 1 -1 -1 -1 -3 0 4 -2 -2 -3 -3 0 -3 -4 -2 3 1 -4 -3 -2 10 0 0 0 0 0 41 0 0 0 0 5 0 0 0 33 8 0 0 3 0.55 0.40 3 L -2 -3 -2 -3 -3 -2 -3 -4 -3 0 1 -3 8 0 -4 -3 0 -3 -3 1 1 0 2 1 0 0 1 0 0 3 8 0 66 2 0 0 7 0 0 9 1.03 0.64 4 S 0 0 3 1 -4 3 2 -2 -2 -4 -4 2 -3 -5 -3 2 1 -5 -4 -4 9 3 13 7 0 15 12 1 0 0 0 13 0 0 0 16 10 0 0 0 0.43 0.68 5 P -2 -4 -5 -3 -3 -4 -4 -5 -5 6 3 -3 1 0 -1 -3 -2 -4 -3 1 2 0 0 2 0 0 0 0 0 59 25 1 2 3 3 1 1 0 0 2 0.82 0.68 6 R -2 4 5 -1 -4 1 -2 -3 -2 -4 -4 2 -3 -5 -3 1 -1 -5 -4 -4 0 27 36 2 0 4 0 0 0 1 0 15 0 0 1 12 2 0 0 0 0.78 0.70 7 A 2 -4 -2 -3 -3 -3 -3 -1 -2 -2 -2 -3 -3 -4 6 1 0 -5 -4 -1 22 0 1 0 0 0 0 4 1 2 4 0 0 0 45 11 7 0 0 3 0.95 0.72 8 A 2 -3 0 4 -3 0 2 -2 -3 -2 -4 -1 -3 -4 -3 3 1 -5 -4 -3 16 0 4 22 0 3 15 1 0 3 0 2 0 0 0 25 8 0 0 1 0.49 0.72 9 E -3 -2 -1 1 -5 1 7 -4 -2 -3 -4 -1 -4 -5 -3 -2 -3 -5 -4 -4 0 0 1 3 0 2 90 0 0 1 1 1 0 0 0 1 0 0 0 0 1.41 0.72 10 L -3 -5 -5 -5 -3 -4 -5 -5 -5 6 1 -4 0 -1 -5 -4 -2 -4 -1 3 1 0 0 0 0 1 0 0 0 68 8 0 0 1 0 0 0 0 2 19 1.06 0.72 11 T 0 0 -1 -2 -1 -1 -2 -1 -2 -3 -4 -2 -3 -3 -2 5 2 -5 -3 -2 4 4 0 0 1 2 0 2 1 1 1 0 0 0 1 69 15 0 0 1 0.84 0.72 12 N 2 -1 0 1 -4 1 2 -2 -2 -4 -4 3 -3 -4 -3 3 0 -5 -3 -3 15 1 4 8 0 4 13 2 0 0 1 19 0 0 0 30 3 0 1 0 0.46 0.73 13 L -1 -4 -5 -5 -3 -4 -5 -5 -5 6 3 -4 0 -2 -4 -3 -2 -4 -3 2 6 1 0 0 0 0 0 0 0 55 26 0 1 0 0 1 0 0 0 9 0.87 0.72 14 F -3 -4 -4 -5 -3 -4 -5 -6 -5 5 4 -4 0 -1 -5 -4 -2 -4 -3 0 0 0 1 0 0 0 0 0 0 50 46 0 0 1 0 0 1 0 0 0 1.01 0.74 15 E -2 3 -1 -2 -5 1 3 -4 0 -5 -4 6 -3 -5 -3 -2 -3 -5 -4 -4 1 17 1 1 0 4 19 0 2 0 0 55 0 0 0 1 0 0 0 0 0.96 0.74 16 S -1 -1 1 1 -5 5 4 -3 -2 -5 -4 2 -3 -5 -3 1 -1 -5 -3 -4 5 1 7 8 0 27 28 1 0 0 0 12 0 0 0 8 2 0 1 0 0.67 0.75 17 R -2 4 -2 -2 -5 6 1 -3 -2 -4 -3 2 -3 -2 -3 -2 -2 -4 -1 -2 2 26 1 0 0 44 6 1 0 0 1 11 0 1 0 0 2 0 2 4 0.84 0.76 18 I -3 -2 -5 -5 -3 -4 -5 -6 -5 7 1 -4 1 -1 -5 -4 -1 -4 -3 2 0 2 0 0 0 0 0 0 0 74 8 0 3 1 0 0 3 0 0 9 1.10 0.78 19 R 1 0 1 2 -4 1 3 -2 -2 -4 -1 3 -3 -5 -3 0 -1 -5 -4 -4 12 4 6 10 0 4 24 1 0 0 6 23 0 0 0 7 3 0 0 0 0.44 0.78 20 N -2 -1 4 0 -4 2 0 2 -2 -5 -5 3 -4 -5 -3 2 -2 -5 -4 -4 1 2 28 4 0 10 5 15 0 0 0 20 0 0 0 14 1 0 0 0 0.61 0.79 21 F -1 -4 -4 -5 -4 -2 -4 -5 -1 -1 0 -2 -2 5 -5 -2 -1 0 6 0 4 0 0 0 0 1 0 0 1 3 10 2 0 30 0 2 4 1 36 5 0.91 0.78 22 Y -2 -2 2 3 -4 1 3 2 -2 -4 -2 0 -3 -4 -1 -1 0 -4 0 -3 2 1 10 21 0 7 18 14 0 0 4 6 0 0 3 4 7 0 4 1 0.39 0.78 23 A 0 -1 0 1 -3 1 0 -1 -3 -1 -1 1 -2 -1 1 1 2 -4 -3 0 7 1 4 7 0 8 6 3 0 4 7 9 0 2 8 9 16 0 0 6 0.15 0.78 24 N -1 -1 1 1 -4 1 2 0 2 -3 -4 3 -3 -3 -2 1 1 -5 -3 -2 3 1 7 7 0 6 17 8 5 1 1 21 0 1 1 12 8 0 0 2 0.36 0.78 25 F 3 -4 -2 -4 -1 -3 -2 -3 -4 1 1 -2 0 0 -1 1 0 -4 -3 3 25 0 1 0 1 0 2 0 0 9 13 2 1 5 3 11 6 0 0 22 0.32 0.78 26 Q -2 -1 2 3 -5 2 4 -2 -2 -3 -3 1 -3 -3 -2 1 0 -5 -4 -2 1 2 8 16 0 10 30 2 0 1 1 8 0 1 1 9 6 0 0 2 0.49 0.77 27 V 0 0 -2 -3 -3 0 -2 -4 -4 1 1 -2 2 -1 1 -1 0 -4 -2 4 8 5 2 1 0 4 2 0 0 7 14 1 4 2 7 5 4 0 1 35 0.32 0.77 28 D 0 1 0 1 -4 1 3 -2 0 -3 -3 0 -2 -1 -3 2 1 -4 -2 0 6 7 3 9 0 5 19 2 2 1 0 6 0 3 0 21 8 0 1 8 0.28 0.77 29 E -2 -2 4 2 -5 0 5 -3 -1 -5 -3 -1 -4 -5 -3 -1 0 -5 -4 -4 1 0 23 7 0 1 56 0 1 0 2 1 0 0 0 3 5 0 0 0 0.93 0.76 30 I -2 -4 -2 -3 -3 -1 1 -5 -1 3 -1 -3 -1 -3 -4 -1 2 -4 -1 4 0 0 2 0 0 2 11 0 2 18 4 1 0 0 0 2 15 0 2 40 0.52 0.77 31 G -2 -4 -2 -3 -4 -4 -4 7 -4 -6 -6 -3 -5 -5 -4 -2 -3 -4 -5 -5 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.92 0.77 32 R -2 3 -1 -3 -4 2 0 -4 -2 -1 -3 2 -3 -3 -3 -1 4 -4 0 -1 0 21 1 0 0 9 5 0 0 4 0 12 0 1 0 2 38 0 3 3 0.59 0.77 33 V -2 -5 -5 -5 -3 -4 -5 -5 -5 4 -1 -4 -1 -3 -4 -4 -2 -5 -3 6 0 0 0 0 0 0 0 0 0 19 0 0 0 0 0 0 0 0 0 81 1.24 0.77 34 V -2 -4 -3 -4 -2 -3 1 -5 -4 3 4 -1 0 -1 -4 -3 0 -4 -3 2 2 0 1 0 0 0 10 0 0 15 44 3 1 1 0 1 5 0 0 17 0.54 0.78 35 S -1 -1 -1 -1 -4 3 1 -2 -1 -4 -4 1 -3 -1 -3 5 1 -4 0 -3 2 2 2 2 0 16 7 1 1 0 0 7 0 3 0 49 6 0 4 0 0.60 0.79 36 V -1 -5 -5 -5 -3 -4 -5 -5 -5 4 0 -4 -1 -2 -5 -3 -2 -5 -2 6 3 0 0 0 0 0 0 0 0 29 4 0 0 1 0 1 1 0 1 60 1.04 0.85 37 G 1 -3 -2 -3 -2 0 -2 6 -3 -5 -5 0 -3 -5 -4 2 -1 -4 -2 -4 9 0 1 0 1 4 1 57 0 0 0 6 0 0 0 16 4 0 2 1 0.90 0.86 38 D -4 -2 -1 8 -6 -2 0 -3 -2 -5 -6 -3 -5 -6 -4 -2 -1 -6 -5 -5 0 1 0 96 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 1.91 0.86 39 G -2 -4 -1 -2 -5 -4 -4 7 -4 -6 -6 -4 -5 -5 -4 -1 -3 -5 -5 -5 0 0 2 1 0 0 0 95 0 0 0 0 0 0 0 2 0 0 0 0 1.84 0.86 40 I -3 -5 -5 -5 -3 -5 -5 -6 -5 7 0 -5 -1 -2 -5 -4 -2 -5 -3 4 0 0 0 0 0 0 0 0 0 75 1 0 0 0 0 0 1 0 0 23 1.34 0.87 41 A 6 -4 -4 -4 0 -3 -3 -2 -4 2 -2 -3 -2 -4 -3 -1 -2 -5 -4 1 72 0 0 0 2 0 0 0 0 14 0 0 0 0 0 0 1 0 0 11 0.96 0.87 42 Q -3 7 -1 -4 -5 1 -1 -4 0 -2 -2 1 -3 2 -4 -3 -3 -4 -2 -4 0 65 2 0 0 5 2 0 2 3 4 6 0 9 0 0 0 0 1 0 1.08 0.88 43 V 0 -5 -5 -5 1 -4 -5 -5 -5 4 -1 -4 -1 -3 -4 -2 -2 -5 -3 5 9 0 0 0 3 0 0 0 0 30 1 0 0 0 0 3 0 0 0 54 0.96 0.88 44 Y -3 -2 -2 -3 -3 -3 -1 -3 5 -3 -3 -3 -3 1 -4 2 0 -1 7 -4 0 1 1 0 0 0 3 1 17 0 1 0 0 4 0 16 5 0 48 0 1.08 0.88 45 G -2 -2 -2 -3 -3 -4 -4 7 -4 -6 -6 -3 -5 -5 -4 -2 -4 -5 -5 -5 0 2 0 0 1 0 0 97 0 0 0 0 0 0 0 0 0 0 0 0 1.91 0.88 46 L -3 -4 -5 -6 -3 -4 -5 -6 -5 1 6 -5 2 -2 -5 -4 -3 -4 -3 -1 0 0 0 0 0 0 0 0 0 3 92 0 3 0 0 1 0 0 0 1 1.33 0.88 47 N 0 -1 2 3 -5 -1 3 0 -3 -5 -4 2 -4 -5 1 1 -1 -5 -4 -4 6 3 12 20 0 1 20 6 0 0 2 12 0 0 8 9 3 0 0 0 0.48 0.88 48 E -3 -2 6 2 -5 1 2 0 -1 -5 -5 1 -4 -5 -4 -1 -2 -6 -4 -5 1 1 51 12 0 6 11 5 1 0 0 8 0 0 0 3 1 0 0 0 0.94 0.88 49 I 2 -4 -5 -4 2 -4 -4 -4 -5 3 -1 -4 -1 -3 -4 -3 -2 -5 -3 5 22 0 0 1 4 0 0 0 0 20 1 0 0 0 0 0 0 0 0 52 0.88 0.88 50 Q 0 1 -3 -4 -4 5 -1 -3 -3 -2 -1 1 7 -1 -4 -2 -3 -4 -3 -2 7 5 0 0 0 32 1 2 0 1 3 8 38 2 0 0 0 0 0 1 0.86 0.88 51 A 4 -3 -2 -4 -3 1 -3 -3 -3 -1 0 -3 -1 0 -3 1 -2 -3 2 0 48 0 2 0 0 6 0 0 0 3 12 0 1 4 0 11 1 0 8 4 0.51 0.88 52 G -2 -4 1 0 -5 -2 1 6 2 -6 -5 -3 -2 -5 -4 0 -3 -3 -2 -5 0 0 6 3 0 1 11 63 6 0 0 0 1 0 0 8 0 0 1 0 1.05 0.87 53 E -3 -2 -2 0 -6 2 7 -4 -2 -5 -5 -1 -4 -5 -3 -2 -3 -5 -4 -5 0 0 0 0 0 6 94 0 0 0 0 0 0 0 0 0 0 0 0 0 1.68 0.87 54 M -1 -4 -5 -5 -3 -3 -4 -5 -4 0 4 -1 7 -1 -5 -4 -3 -4 -3 1 5 0 0 0 0 0 0 0 0 2 40 4 41 1 0 0 0 0 0 6 1.03 0.88 55 V -3 -5 -5 -5 -1 -4 -5 -6 -5 3 3 -5 -1 -2 -5 -4 -2 -5 -3 6 0 0 0 0 1 0 0 0 0 14 24 0 0 0 0 0 0 0 0 61 1.05 0.88 56 L -1 1 -2 -1 -3 0 6 -4 -2 -3 -1 1 -3 -3 -3 -1 -1 -5 -4 -4 3 6 0 1 1 2 67 0 0 1 6 6 0 1 0 3 3 0 0 0 0.97 0.90 57 F -4 -5 -5 -6 -4 -5 -5 -5 -4 0 1 -5 -2 8 -6 -4 -4 -1 1 -2 0 0 0 0 0 0 0 0 0 6 10 0 0 84 0 1 0 0 0 0 1.74 0.90 58 A 1 -3 -1 3 -4 -1 3 -2 -1 -4 -5 -1 -4 -5 4 1 -2 -5 -4 -3 14 0 3 16 0 2 23 1 1 0 0 1 0 0 24 13 0 0 0 1 0.69 0.90 59 N -2 -3 4 3 -4 -2 1 3 0 -5 -5 -1 -4 -5 -4 3 -1 -5 -3 -5 0 0 20 15 0 1 9 20 2 0 0 3 0 0 0 28 2 0 1 0 0.68 0.90 60 G -2 0 3 1 -5 -3 -1 6 -3 -6 -6 -1 -5 -5 -4 0 -3 -5 -5 -5 0 6 13 7 0 0 3 62 0 0 0 2 0 0 0 5 0 0 0 0 1.09 0.90 61 V -1 -2 -3 -2 -3 -3 -3 -4 2 3 1 -3 -1 -3 -4 1 3 -5 -3 3 3 3 0 2 0 0 1 0 5 19 13 1 0 0 0 10 17 0 0 26 0.44 0.90 62 K -3 1 -3 -4 -5 0 0 -4 -2 0 0 4 0 3 -4 -2 -2 -3 4 -1 0 6 0 0 0 4 4 0 0 5 7 33 3 16 0 2 2 0 14 4 0.52 0.90 63 G 0 -4 -2 -3 -5 -4 -4 7 -4 -6 -6 -4 -5 -5 -4 -2 -4 -5 -5 -5 5 0 0 0 0 0 0 95 0 0 0 0 0 0 0 0 0 0 0 0 1.88 0.89 64 M -3 -4 -5 -5 -3 -1 -4 -5 -4 4 2 -4 8 0 -5 -4 -3 -4 -2 1 0 0 0 0 0 3 0 0 0 26 16 0 45 2 0 0 0 0 1 8 1.08 0.91 65 A 6 -4 -4 -4 0 -3 -3 -3 -4 0 -3 -3 -2 -4 -3 -1 -1 -5 -4 3 70 0 0 0 1 0 0 0 0 5 0 0 0 0 0 1 2 0 0 20 0.96 0.91 66 L -3 -4 -5 -5 -4 0 -4 -5 -2 -1 5 -4 4 2 -5 -3 -3 -3 -2 0 1 0 0 0 0 4 0 0 1 0 67 0 12 8 0 1 1 0 0 5 0.96 0.92 67 N -3 -3 8 0 -5 -2 0 -1 -2 -5 -6 -2 -4 -5 -4 -1 -1 -6 -4 -5 0 0 88 1 0 0 4 2 0 0 0 0 0 0 0 2 2 0 0 0 1.71 0.92 68 L -4 -4 -6 -6 -3 -4 -5 -6 -5 1 6 -5 0 -1 -3 -5 -3 -4 -3 0 0 0 0 0 0 0 0 0 0 4 92 0 0 1 2 0 0 0 0 2 1.32 0.92 69 E -3 -1 1 3 -6 0 6 -3 -2 -5 -5 -2 -4 -5 -3 -2 -3 -5 -1 -5 0 2 7 16 0 1 72 1 0 0 0 0 0 0 0 0 0 0 3 0 1.30 0.93 70 N 0 1 2 0 -4 1 3 -4 -1 -3 -4 1 -2 -5 0 1 -1 -5 -2 0 7 9 10 4 0 7 23 0 1 1 0 8 1 0 5 12 2 0 1 10 0.35 0.93 71 E -3 -2 1 7 -5 -2 3 -3 -2 -5 -6 0 -5 -6 -4 0 -1 -6 -5 -5 0 1 7 62 0 0 17 1 1 0 0 4 0 0 0 5 3 0 0 0 1.22 0.93 72 N -2 -2 6 -1 -4 0 0 -1 0 -4 -3 -2 -4 0 -4 2 0 -5 -1 -2 2 0 49 2 0 5 5 3 2 1 1 0 0 5 0 16 5 0 2 2 0.78 0.93 73 V -2 -5 -5 -5 -2 -4 -5 -5 -5 3 -1 -4 0 -3 -5 -4 -1 -5 -3 6 1 0 0 0 1 0 0 0 0 13 0 0 1 0 0 0 2 0 0 83 1.30 0.93 74 G -2 -2 -1 -2 -5 -3 -4 7 -4 -6 -6 -3 -5 -5 -4 -1 -4 -5 -5 -5 1 3 2 1 0 1 0 91 0 0 0 0 0 0 0 3 0 0 0 0 1.75 0.95 75 I 3 -5 -5 -5 2 -4 -4 -1 -5 4 -1 -4 -2 -3 -4 -3 -2 -5 -4 4 26 0 0 0 5 0 0 5 0 28 1 0 0 0 0 0 1 0 0 35 0.75 0.95 76 V -1 -5 -5 -5 -3 -4 -5 -2 -5 3 0 -4 -1 -3 -5 -3 -2 -5 -3 6 3 0 0 0 0 0 0 4 0 12 7 0 0 0 0 1 0 0 0 73 1.10 0.97 77 V -3 -5 -5 -6 -3 -5 -5 -4 -5 4 4 -5 1 -2 -5 -4 -3 -4 -3 4 0 0 0 0 0 0 0 1 0 27 39 0 3 0 0 0 0 0 0 29 0.95 0.97 78 F -3 -4 -4 -4 -3 -3 -4 -4 -3 1 4 -4 5 5 -4 -3 -3 -2 0 -1 1 1 0 1 0 0 1 1 0 7 37 1 17 29 1 1 1 0 1 1 0.87 0.89 79 G -2 -3 2 1 -4 -3 -3 6 -3 -5 -5 -3 -4 -4 -3 -2 -3 -4 -4 -4 1 1 9 10 0 0 1 71 0 1 1 1 0 0 1 1 1 0 0 1 1.26 0.90 80 G -2 -2 3 5 -4 0 2 -2 -2 -4 -4 0 -4 -3 3 1 -2 -4 -2 -4 2 1 12 32 0 3 11 2 0 1 1 5 0 1 13 11 1 0 2 1 0.62 0.90 81 D -2 -3 -2 4 -4 -1 1 1 -2 -2 -4 0 -3 0 -2 0 0 -3 4 -3 3 0 0 27 0 3 8 12 0 3 1 6 0 4 2 6 5 0 18 1 0.46 0.89 82 T -1 3 1 1 -3 -1 0 -3 -3 -2 0 1 -2 -2 -3 1 3 -4 -2 0 3 16 8 7 0 1 5 1 0 2 10 7 0 1 1 10 19 0 1 7 0.24 0.88 83 A 1 -1 1 2 -4 0 1 2 3 -3 -1 0 -1 -2 -3 0 -1 -4 -3 -3 13 4 7 11 0 3 8 15 9 1 8 4 1 2 1 6 4 0 0 1 0.22 0.87 84 I -3 -4 -4 -4 -3 -4 -4 -4 -4 6 2 -4 -1 -2 -4 -3 0 -4 -3 3 1 1 0 1 0 0 1 1 0 55 14 1 0 0 1 1 6 0 0 17 0.89 0.84 85 K -1 1 -1 -2 -4 2 0 -3 0 -2 -1 5 -2 -3 -3 1 0 -4 -1 -2 4 5 1 1 0 11 4 1 2 2 8 37 0 0 1 13 4 0 2 3 0.45 0.86 86 E 0 -2 -2 -1 -4 2 6 -3 -2 -4 -4 -1 -3 -4 -2 0 -2 -4 -3 -2 9 0 0 1 0 6 68 1 0 0 1 1 0 0 1 6 1 0 0 2 1.04 0.86 87 G -2 -4 -1 -2 -4 -3 -3 7 -3 -5 -5 -3 -4 -4 -3 -2 -3 -4 -4 -4 1 0 3 1 0 0 1 86 0 0 1 1 0 0 1 2 1 0 0 1 1.58 0.86 88 D -1 -3 -1 5 -4 2 0 -2 -2 -3 -3 -2 2 -3 -3 2 1 -4 -3 -2 5 0 1 38 0 9 4 2 1 0 1 1 7 1 1 14 9 0 0 3 0.57 0.86 89 L -2 0 -1 -2 -3 -1 0 -3 0 1 1 2 0 -2 -1 1 2 -4 -2 0 1 4 3 1 0 3 7 1 2 10 17 13 1 1 3 10 16 0 1 6 0.19 0.86 90 V 0 -4 -4 -4 1 -3 -4 -4 -4 2 -1 -4 -1 -3 -4 -3 -2 -4 -3 6 7 0 0 1 3 0 1 1 0 9 1 1 0 0 1 1 1 0 0 74 1.01 0.82 91 K -2 2 0 -2 -4 -1 0 -2 -1 -3 -3 5 0 -3 -3 1 -1 -2 1 -2 1 11 3 0 0 1 4 3 1 1 2 49 3 0 0 9 3 0 5 2 0.63 0.83 92 R 1 5 -2 -3 4 -1 -2 1 -3 -3 -3 -1 -3 -4 -1 0 -1 -4 -3 -1 12 44 0 1 10 1 1 12 0 0 1 1 0 0 2 5 4 0 0 5 0.66 0.81 93 T -2 -2 -2 -3 1 -2 -3 -3 -3 -2 -2 -2 1 -3 -3 1 7 -4 -3 -2 1 2 0 1 3 0 1 1 0 0 3 1 5 0 0 6 75 0 0 1 1.07 0.81 94 G -2 -2 1 -1 -4 -2 -1 6 0 -4 -4 0 -4 -2 -3 -1 -3 -3 2 -4 1 1 8 3 0 1 3 58 2 0 1 5 0 1 0 4 1 0 9 1 0.88 0.81 95 S 0 4 0 -1 -4 2 0 -3 1 -3 -4 4 -3 -3 -2 2 -1 -4 -3 -3 7 21 3 3 0 8 4 1 3 1 1 24 0 1 1 18 4 0 0 1 0.49 0.81 96 I -3 -4 -2 -2 -3 -4 -4 -4 -4 6 2 -4 -1 1 -4 -3 -2 -4 -2 2 1 0 2 3 0 0 1 1 0 53 17 1 0 5 0 1 1 0 0 13 0.76 0.78 97 V 3 -3 -4 -4 -3 -3 -3 -3 -4 1 2 -3 2 0 -3 -2 -2 -3 -2 4 26 0 0 0 0 0 1 1 0 7 20 1 6 5 0 1 1 0 0 30 0.51 0.78 98 D -1 -1 -1 4 -4 2 3 -2 0 -4 -4 1 -3 -3 -3 2 -2 -4 -1 -4 4 1 1 21 0 10 23 2 2 0 1 8 0 1 0 21 1 0 2 1 0.54 0.79 99 V -1 -4 -4 -4 -3 -3 -4 -4 -4 4 1 -3 -1 0 -4 -3 1 -4 -3 5 2 0 0 0 0 0 1 1 0 20 9 1 0 3 0 1 9 0 0 51 0.74 0.77 100 P -2 -2 0 -1 -4 -1 -2 -2 -3 -4 -2 0 -3 -3 7 -2 -3 -4 2 -3 1 1 5 3 0 3 1 2 0 0 6 5 0 0 61 1 1 0 8 1 1.33 0.78 101 A 1 -4 -4 -4 -2 -3 -4 -4 -4 1 2 -3 -1 -2 -3 -2 -1 -4 -3 5 10 0 0 0 0 0 1 1 0 2 18 1 0 0 0 2 2 0 0 60 0.76 0.76 102 G -1 -4 -2 -2 -4 -3 -3 7 -3 -4 -4 -3 -4 0 -3 -1 -3 -4 -4 -4 2 0 0 2 0 0 1 81 0 0 1 0 0 5 0 4 0 0 0 1 1.37 0.77 103 K -1 2 0 3 -4 -1 3 -3 -1 -3 -4 3 -2 -4 0 -2 -2 -4 0 -3 5 10 4 18 0 1 24 1 1 2 1 21 1 0 4 1 0 0 4 1 0.51 0.77 104 A 3 -3 -1 -1 -2 0 3 2 -1 -4 -3 -2 -3 0 -3 -1 -2 -3 2 -2 25 0 2 3 1 4 21 19 1 0 1 1 0 4 0 3 2 0 9 2 0.39 0.77 105 M 0 -3 -4 -4 -3 -3 -3 -4 -3 0 4 -3 5 0 -4 0 -1 -3 0 1 6 0 0 0 0 0 0 1 0 3 43 0 16 4 0 9 2 0 3 10 0.52 0.72 106 L -1 -3 -4 -4 -1 -2 -3 -4 -4 2 5 -2 1 -1 -3 0 -2 -3 -3 0 5 0 0 0 1 1 1 1 0 13 58 2 2 1 1 9 1 0 0 3 0.59 0.71 107 G -1 -2 -2 -3 -4 -3 -3 6 -3 -4 -4 -3 -4 -4 -3 1 -3 -4 -4 -4 1 2 0 0 0 0 0 78 0 0 1 0 0 0 0 10 0 0 0 0 1.27 0.71 108 R -3 7 -2 -3 4 -1 -2 -4 -2 -4 -3 1 -3 -4 -3 -1 -2 -4 -3 -4 1 76 0 0 9 0 0 0 0 0 1 5 0 0 0 5 0 0 0 0 1.26 0.69 109 V -2 -4 -4 -4 -3 -4 -4 -5 -4 3 -1 -4 -1 2 -4 -3 -1 -4 -2 6 0 0 0 0 0 0 0 0 0 13 1 0 0 8 0 0 2 0 0 72 0.98 0.72 110 V -2 -4 -4 -4 -3 -4 -4 -4 -4 3 0 -4 3 -2 -4 -3 -1 -4 -3 6 2 0 0 0 0 0 0 0 0 19 3 0 9 0 0 0 2 0 0 62 0.90 0.72 111 D -3 -3 5 6 -4 -2 1 -3 -2 -5 -5 -2 -4 -5 -3 0 -1 -5 -4 -4 0 0 34 44 0 0 7 0 0 0 0 0 0 0 0 6 2 0 0 0 1.02 0.79 112 A 3 -3 -3 -3 -3 -3 -3 1 -4 -4 -4 -3 -3 -4 6 1 1 -5 -4 -2 29 0 0 0 0 0 0 8 0 0 0 0 0 0 37 9 9 0 0 2 0.86 0.79 113 M -3 -1 -4 -5 -3 -4 -4 -5 -4 1 5 -4 1 -2 -4 -3 1 -4 -3 -1 0 4 0 0 0 0 0 0 0 5 71 0 2 0 0 0 12 0 0 2 0.86 0.81 114 G 0 -4 -3 -3 -4 -4 -4 7 -4 -5 -5 -3 0 -5 -4 -1 -3 -5 -5 -2 7 0 0 0 0 0 0 83 0 0 0 0 3 0 0 4 0 0 0 3 1.50 0.87 115 V -2 1 4 -1 -5 2 2 0 3 -3 -3 2 -2 -2 -4 -2 -1 -5 -2 -2 1 6 20 1 0 11 16 8 9 1 2 12 1 1 0 1 2 0 1 3 0.43 0.96 116 P -1 -4 -4 1 -1 -1 0 -4 -4 -3 0 -3 3 1 6 -2 -1 -5 -4 -2 4 0 0 7 2 2 8 0 0 0 8 0 8 6 47 2 4 0 0 2 0.95 0.96 117 I -3 -5 -5 -6 -3 -5 -5 -6 -5 6 3 -5 2 2 -5 -5 -3 -4 -3 2 0 0 0 0 0 0 0 0 0 53 26 0 5 7 0 0 0 0 0 9 1.04 0.96 118 D -3 -3 -1 5 -4 -2 -1 -4 -2 -2 0 -3 -2 -2 -3 -2 2 -3 3 1 0 0 0 49 0 0 0 0 0 0 11 0 0 0 0 0 13 0 15 13 0.69 0.61 119 G 1 2 2 2 -4 -2 2 1 -3 -4 0 -2 -3 -4 -4 1 -2 -5 -4 -3 13 13 13 13 0 0 13 12 0 0 13 0 0 0 0 13 0 0 0 0 0.33 0.78 120 K -2 1 -1 -1 -5 0 0 -2 -3 -4 0 6 -3 -4 -2 -1 -2 -5 -4 -3 0 6 2 2 0 1 3 2 0 0 10 68 0 0 1 3 0 0 0 2 0.88 0.77 121 G -2 -1 -2 -2 -4 -3 -1 7 -4 -6 -5 -2 -4 -5 -2 0 -3 -5 -5 -4 0 3 0 1 0 0 3 83 0 0 0 1 0 0 2 6 0 0 0 0 1.42 0.77 122 A 2 -1 0 1 -4 -1 3 -3 -3 -4 -4 -2 -4 -4 5 0 -2 -5 -1 -3 23 3 3 9 0 1 19 0 0 0 1 1 0 0 30 6 1 0 2 1 0.68 0.77 123 L -3 -4 -5 -5 -3 -4 -2 -6 -5 6 3 -4 0 -1 -3 -4 -3 -4 -3 2 0 0 0 0 0 0 3 0 0 51 31 0 0 1 1 0 0 0 0 12 0.90 0.76 124 S -1 -2 2 2 -4 2 1 -1 -1 -3 -2 2 -3 -1 -1 1 -1 -5 -3 -1 4 0 12 12 0 9 10 5 1 1 4 17 0 2 3 9 4 0 0 6 0.22 0.76 125 D 2 -3 0 1 0 -2 -1 0 -1 -3 -4 0 -3 -2 0 1 3 -4 -2 -1 17 0 4 10 2 0 3 7 2 1 0 6 0 1 4 12 23 0 1 3 0.27 0.76 126 H -1 -2 1 3 -4 -1 2 -1 2 -3 -3 2 -3 -4 -3 2 2 -5 -3 -2 2 0 5 16 0 2 14 5 5 1 1 15 0 0 0 16 13 0 0 4 0.33 0.76 127 E -1 -1 -1 1 -3 1 4 -1 1 -2 -2 2 -1 -3 -2 -1 0 -4 1 -3 6 1 2 6 0 4 31 4 3 2 3 17 1 0 1 4 5 0 6 0 0.34 0.73 128 Q -2 5 -1 -1 -4 0 0 -3 -1 -3 -2 1 -2 1 -4 1 2 0 1 -3 1 34 2 3 0 2 4 0 1 0 3 5 1 8 0 11 18 1 4 0 0.44 0.72 129 R 0 5 -2 -1 -4 -1 -1 -2 -3 -3 0 0 3 -3 -4 2 -1 0 -3 -2 6 38 0 3 0 1 1 2 0 0 8 5 8 1 0 16 3 2 0 3 0.45 0.72 130 R -1 4 -3 -3 -4 -1 -1 -4 -2 -2 -1 0 -1 -3 6 -1 -3 -5 -4 -2 3 31 0 1 0 2 2 0 1 1 8 3 1 1 39 3 0 0 0 2 0.87 0.70 131 V -1 -4 -5 -5 -1 -4 -4 -5 -5 4 1 -4 0 -2 -4 -4 -2 -4 -3 5 3 0 0 0 1 0 0 0 0 32 12 0 0 0 0 0 0 0 0 49 0.86 0.69 132 E -3 -2 -2 2 -5 1 7 -2 -2 -5 -4 -1 -4 -1 -3 -2 -3 -4 -3 -4 0 0 0 7 0 1 83 3 0 0 0 0 0 3 0 0 0 0 0 0 1.24 0.69 133 V 0 3 -3 -4 1 -2 -2 -4 -3 1 0 1 -1 -3 -4 0 0 -4 -3 3 8 17 0 0 3 0 1 0 0 8 8 10 1 1 0 9 6 0 0 28 0.30 0.69 134 K -2 -1 -2 -2 -4 1 -2 -2 -3 1 -2 5 -1 -4 3 -2 -2 -5 -3 1 1 0 1 2 0 5 0 2 0 9 1 49 2 0 15 1 0 0 0 12 0.61 0.69 135 A 6 -3 -3 -3 -2 -2 -3 -2 -3 -3 -3 -2 -3 -4 -3 0 0 -4 -4 -2 96 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 3 0 0 0 1.19 0.69 136 P -1 -4 -4 -3 -4 -3 -3 -4 -3 -4 -3 -3 -4 -4 8 -2 -2 -5 -3 -4 3 0 0 0 0 0 0 0 0 0 2 0 0 0 91 0 2 0 1 0 2.33 0.67 137 G 0 -4 0 -2 -4 -3 -4 7 -4 -5 -5 -3 -4 -5 -2 0 -2 -4 -5 -5 5 0 3 0 0 0 0 86 0 0 0 0 0 0 1 4 1 0 0 0 1.47 0.67 138 I -2 -4 -5 -5 -3 -4 -5 -5 -5 6 0 -4 -1 -2 -4 -4 -2 -4 -3 4 0 0 0 0 0 0 0 0 0 62 0 0 0 0 0 0 0 0 0 35 1.07 0.66 139 L -2 -4 -5 -5 -2 -4 -5 -5 -5 6 1 -4 5 -2 -4 -4 -2 -4 -3 2 1 0 0 0 0 0 0 0 0 65 3 0 18 0 0 0 0 0 0 11 1.01 0.68 140 E 1 -2 -2 2 -3 1 3 -3 -2 -4 -4 -1 -3 -1 4 1 0 0 -1 -2 12 0 0 11 0 5 21 0 1 0 0 2 0 4 22 10 6 1 3 2 0.47 0.68 141 R -3 8 -2 -3 -5 -1 -1 -4 -2 -2 -4 1 -3 -4 -4 -2 -3 -4 -3 -4 0 96 0 0 0 0 1 0 0 3 0 0 0 0 0 0 0 0 0 0 1.63 0.66 142 K -1 1 -2 0 -4 4 1 -3 -1 -2 -3 5 -2 -4 -3 -1 -2 -4 -3 -3 3 6 0 6 0 26 6 0 1 3 1 41 0 0 0 4 1 0 0 1 0.67 0.67 143 S -1 -3 -1 -2 -3 -2 -2 -1 -3 -4 -3 -2 -2 -3 4 6 0 -5 -4 -3 0 0 0 0 0 0 0 2 0 0 2 1 1 1 19 73 1 0 0 0 0.98 0.67 144 V -2 -4 -5 -5 -2 -4 -4 -5 -5 2 -1 -4 -1 -2 -4 -3 -2 -5 -3 7 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 98 1.26 0.66 145 H -2 1 3 2 1 -2 -1 -2 6 -4 -4 -1 -3 -2 -3 1 1 -4 0 -4 2 7 18 13 4 0 1 1 26 0 0 3 0 1 0 13 9 0 3 0 0.59 0.66 146 E -2 0 -2 -1 -5 4 6 -4 -2 -4 -4 0 -3 -4 -3 -2 -2 -4 -1 -1 0 3 0 0 0 18 70 0 0 0 0 2 0 0 0 0 1 0 2 5 1.09 0.66 147 P -2 -4 -2 -3 -5 -3 -3 -4 -4 -5 -5 -3 -4 -5 8 -2 -3 -5 -5 -4 0 0 2 0 0 0 0 0 0 0 0 0 0 0 96 2 0 0 0 0 2.49 0.65 148 M -3 -4 -5 -5 -3 -3 -4 -5 -4 1 4 -4 7 0 -3 -4 -2 -3 -2 2 0 0 0 0 0 0 0 0 0 3 38 0 37 3 1 0 0 0 0 18 0.89 0.66 149 Q -1 -1 -1 -2 -4 7 1 -3 -1 -3 -1 -1 -1 -4 -2 -1 -2 -4 -2 -3 4 0 2 0 0 74 6 0 0 1 8 0 0 0 1 2 0 0 1 0 1.07 0.66 150 T -2 -3 -2 -3 -3 -2 -3 -3 -3 -1 -2 -2 -2 -2 -3 0 7 -4 -3 -2 0 0 0 0 0 0 0 0 0 3 1 0 0 2 0 2 92 0 0 0 1.24 0.66 151 G -1 -3 -2 -3 -4 -3 -4 7 -4 -6 -5 -3 -4 -5 -4 -2 -3 -4 -5 -5 0 1 0 0 0 0 0 99 0 0 0 0 0 0 0 0 0 0 0 0 1.81 0.66 152 L -3 -4 -5 -5 -3 -4 -5 -5 -4 5 4 -4 0 0 -5 -3 -2 -3 1 1 0 0 0 0 0 0 0 0 0 40 45 0 0 2 0 1 1 0 6 5 0.82 0.66 153 K -2 0 0 -3 -4 -1 -1 -3 -3 -1 -1 6 -2 -4 -3 -2 0 -5 -3 -1 2 0 4 0 0 0 0 0 0 3 7 72 0 0 0 0 4 0 0 5 0.82 0.66 154 A 5 -3 -3 -3 1 -3 -3 -2 -3 0 -2 -2 -2 -4 -2 3 -1 -4 -3 1 61 0 0 0 3 0 0 0 0 4 1 0 0 0 1 21 1 0 0 9 0.71 0.66 155 V -2 -5 -5 -5 -3 -4 -5 -5 -5 6 0 -4 -1 -2 -4 -4 -2 -4 -3 5 0 0 0 0 0 0 0 0 0 57 3 0 0 0 0 0 0 0 0 40 1.08 0.66 156 D -3 -3 0 8 -5 -2 0 -3 -3 -5 -5 -2 -5 -5 -3 -2 -3 -6 -5 -5 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.86 0.66 157 S 4 -3 -2 -3 2 -2 -2 -1 -3 -3 -3 -2 -2 -4 -3 5 0 -4 -3 -2 37 0 0 0 4 0 0 1 0 0 0 0 0 0 0 53 3 0 0 1 0.80 0.66 158 L -3 -4 -5 -5 -3 -3 -4 -5 -4 0 4 -4 7 -1 -2 -4 -3 -3 -3 -1 0 0 0 0 0 0 0 0 0 2 49 0 47 0 2 0 0 0 0 0 1.12 0.67 159 V -2 -4 -5 -5 -3 -4 -4 -5 -5 5 0 -4 -1 1 -4 -3 0 -4 -2 5 0 0 0 0 0 0 0 0 0 40 1 0 0 5 0 0 6 0 0 47 0.90 0.67 160 P -2 -4 -4 -3 -4 -3 -3 -4 -4 -4 -4 -3 -4 -5 8 -2 -3 -5 -5 -4 2 0 0 0 0 0 0 0 0 0 1 0 0 0 96 1 0 0 0 0 2.50 0.67 161 I -3 -5 -5 -5 -3 -4 -5 -6 -5 7 0 -4 -1 -2 -5 -4 -2 -4 -3 3 0 0 0 0 0 0 0 0 0 80 0 0 0 0 0 0 0 0 0 20 1.24 0.69 162 G -1 -4 -2 -3 -2 -4 -4 7 -4 -6 -6 -3 -5 -5 -3 -2 -3 -4 -5 -5 0 0 0 0 1 0 0 99 0 0 0 0 0 0 1 0 0 0 0 0 1.82 0.70 163 R -3 7 -2 -3 1 -1 -2 -4 -2 -5 -3 1 -3 -5 -4 -2 -3 -5 -4 -4 0 92 0 0 3 0 0 0 0 0 2 4 0 0 0 0 0 0 0 0 1.58 0.71 164 G -2 -4 -2 -3 -4 -4 -4 7 -4 -6 -6 -3 -5 -5 -4 -2 -3 -4 -5 -5 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.88 0.71 165 Q -3 -1 -2 -2 -5 8 0 -4 -1 -5 -4 0 -2 -5 -3 -2 -2 -4 -3 -4 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.74 0.71 166 R -3 8 -2 -3 -5 -1 -2 -4 -2 -5 -4 1 -3 -5 -4 -3 -3 -5 -4 -4 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.79 0.71 167 E -3 -2 -2 0 -6 2 7 -4 -2 -5 -5 -1 -4 -5 -3 -2 -3 -5 -4 -4 0 0 0 0 0 6 94 0 0 0 0 0 0 0 0 0 0 0 0 0 1.55 0.71 168 L -3 -4 -5 -5 -3 -4 -5 -6 -5 0 6 -4 0 -1 -2 -4 -3 -4 -3 -1 0 0 0 0 0 0 0 0 0 0 96 0 0 0 2 0 0 0 0 0 1.26 0.76 169 L -3 -5 -5 -5 -3 -5 -5 -6 -5 7 0 -5 -1 -2 -5 -4 -3 -5 -3 2 0 0 0 0 0 0 0 0 0 90 2 0 0 0 0 0 0 0 0 7 1.41 0.77 170 I -3 -5 -5 -5 -1 -5 -5 -6 -5 7 1 -5 0 -2 -5 -4 -3 -5 -3 1 0 0 0 0 1 0 0 0 0 92 4 0 0 0 0 0 0 0 0 2 1.41 0.78 171 G -2 -4 -2 -3 -5 -4 -4 7 -4 -6 -6 -3 -5 -5 -4 -2 -4 -4 -5 -5 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.94 0.79 172 G -4 -4 0 8 -6 -2 0 -3 -3 -5 -6 -3 -5 -6 -3 -2 -3 -6 -5 -5 0 0 2 96 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1.90 0.80 173 R -3 8 -2 -4 -5 1 -2 -4 -2 -5 -4 1 -3 -5 -4 -3 -3 -5 -4 -5 0 96 0 0 0 3 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1.79 0.81 174 Q -1 -1 0 -2 -5 8 0 -3 -1 -5 -4 1 -2 -5 -3 -2 -2 -4 -3 -4 3 0 3 0 0 87 0 0 0 0 0 4 0 0 0 1 1 0 0 0 1.52 0.81 175 T -2 -3 -2 -3 -1 -3 -3 -4 -4 -1 -3 -3 -2 -4 -3 0 7 -4 -4 -1 0 0 0 0 1 0 0 0 0 2 0 0 0 0 0 2 94 0 0 2 1.43 0.81 176 G -2 -4 -2 -3 -5 -4 -4 7 -4 -6 -6 -3 -5 -5 -4 -2 -4 -5 -5 -5 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.96 0.81 177 K -2 0 -2 -2 -5 0 -1 -3 -2 -4 -4 7 -3 -5 -3 -2 -2 -4 -3 -4 0 0 0 0 0 0 0 0 0 0 0 95 0 0 0 0 0 0 0 0 1.47 0.77 178 T -2 -3 -2 -3 -3 -2 -3 -3 -3 -3 -3 -2 -2 -4 -3 0 7 -4 -3 -2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 93 0 0 0 1.45 0.82 179 T 5 -2 -3 -3 -2 -1 -2 -2 -3 -3 -3 -2 -3 -4 -3 2 1 -4 -3 -2 68 1 0 0 0 2 0 0 0 0 0 0 0 0 0 18 8 0 0 0 0.92 0.82 180 I -3 -4 -5 -5 -3 -4 -5 -5 -5 6 1 -4 0 -2 -4 -4 -2 -4 -3 4 0 0 0 0 0 0 0 0 0 50 11 0 1 0 0 0 0 0 0 34 1.04 0.85 181 A 6 -3 -3 -4 4 -2 -3 -1 -4 -2 -3 -3 -3 -4 -3 -1 -2 -4 -4 -2 84 0 0 0 8 1 0 3 0 2 1 0 0 0 0 0 0 0 0 0 1.15 0.85 182 I -3 -4 -5 -5 -3 -4 -5 -5 -5 6 2 -4 0 -2 -4 -4 -1 -4 -3 3 0 0 0 0 0 0 0 0 0 58 14 0 1 1 0 0 3 0 0 19 0.99 0.85 183 D -3 -3 -1 8 -5 -2 0 -3 -3 -5 -5 -3 -5 -5 -3 0 -3 -6 -5 -5 0 0 0 92 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 1.84 0.88 184 T 3 -3 -3 -3 0 -3 -3 -3 -4 0 -3 -3 -1 -4 -3 -1 6 -4 -4 -2 26 0 0 0 2 0 0 0 0 7 0 0 2 0 0 1 61 0 0 0 0.96 0.88 185 I -3 -5 -5 -5 -3 -5 -5 -5 -5 7 0 -4 0 -1 -5 -4 -3 -4 -3 1 0 0 0 0 0 0 0 0 0 88 1 0 1 2 0 0 0 0 0 4 1.40 0.88 186 L -3 -4 -5 -5 -3 -4 -5 -5 -5 6 4 -4 0 -2 -4 -2 -3 -4 -3 1 0 0 0 0 0 0 0 0 0 50 38 0 0 0 1 4 0 0 0 5 0.98 0.88 187 N -3 -2 8 -1 -5 -2 -2 -3 1 -5 -5 -2 -4 -5 -4 -1 -2 -6 -4 -5 2 1 93 0 0 0 0 0 2 0 0 1 0 0 0 1 0 0 0 0 1.86 0.93 188 Q -3 -1 -2 -2 -5 8 0 -4 -1 -5 -3 -1 -2 -5 -3 -2 -3 -4 -3 -4 0 0 0 0 0 97 0 0 1 0 2 0 0 0 0 0 0 0 0 0 1.87 0.93 189 K -2 2 -1 -3 -5 0 -1 -3 -3 -4 -4 7 -3 -5 -3 -2 -3 -5 -4 -2 2 9 1 0 0 1 0 0 0 0 1 81 0 0 0 0 0 0 0 2 1.31 0.89 190 Q -1 3 1 0 -3 2 1 0 1 -2 -1 1 -2 -1 -1 0 -1 -2 2 -2 5 17 6 5 1 8 8 8 3 2 6 9 1 3 3 5 2 0 7 2 0.17 0.69 191 I -1 -2 -2 -2 -2 -1 -1 -2 -2 2 0 0 2 2 -1 0 2 2 0 1 4 2 2 2 1 2 3 3 1 16 8 5 5 9 2 9 14 2 3 8 0.16 0.61 192 N -1 -1 6 -1 -2 -1 -1 -1 4 -2 -2 -1 -2 -2 -2 -1 -1 -2 0 -2 4 2 48 2 1 2 3 3 10 2 4 2 1 2 2 3 2 1 4 3 0.70 0.56 193 S 0 -1 1 2 -1 0 0 -1 -1 -1 0 0 -1 -1 -1 1 -1 3 -1 -1 7 3 7 14 1 5 6 5 1 3 8 4 1 2 3 13 4 4 2 5 0.10 0.35 194 R 0 2 0 0 -1 1 2 -1 -1 -1 -1 2 -1 -1 -1 0 0 -1 -1 -1 6 11 5 3 1 6 16 4 1 3 6 13 1 2 3 5 5 1 2 4 0.13 0.40 195 A 0 -1 0 -1 -1 -1 -1 2 -1 1 -1 -1 -1 -1 0 2 0 0 -1 -1 6 3 3 3 1 2 4 17 1 8 7 4 1 2 5 19 5 1 2 5 0.12 0.42 196 T 0 0 1 1 -1 0 -1 1 -1 -1 -1 0 -1 -1 0 0 2 0 -1 -1 8 4 6 9 1 3 4 12 1 3 7 6 1 3 4 8 13 1 2 4 0.08 0.40 197 S -1 0 2 4 -2 -1 0 0 -1 -2 -1 -1 -1 -2 -1 1 -1 -2 -1 -1 5 6 9 25 1 2 6 7 1 3 5 4 1 2 3 9 4 1 2 4 0.22 0.40 198 E 0 -1 -1 0 -2 0 4 -1 2 -2 -1 0 -1 -2 1 0 1 -2 -1 -1 7 3 3 5 1 3 27 4 6 3 6 5 1 2 6 6 9 1 2 4 0.22 0.44 199 S -1 -1 1 2 -2 -1 -1 3 -2 -3 -3 3 -3 -3 -2 2 -1 -3 -3 -3 4 1 6 12 1 1 2 22 0 1 2 19 0 1 1 21 3 0 1 1 0.39 0.66 200 E -2 0 2 1 -3 4 2 -1 -2 -3 -3 3 -3 -4 -2 1 -2 -3 -3 -3 2 2 9 8 0 20 16 3 0 2 2 21 0 1 1 11 1 0 1 1 0.45 0.71 201 T -1 -1 1 4 -4 0 -1 2 -3 -4 -4 3 -3 -1 0 0 0 -4 -3 -3 4 1 5 23 0 3 2 15 0 0 1 25 0 4 5 5 4 0 0 1 0.44 0.75 202 M -3 -4 -5 -5 -1 -4 -5 -5 -5 3 3 -4 3 -2 -5 -4 -2 -4 -3 5 0 0 0 0 1 0 0 0 0 17 30 0 6 0 0 0 1 0 0 45 0.86 0.79 203 Y -3 -1 -3 -4 -4 -2 -3 -5 1 3 -1 1 -2 2 -3 -3 0 -2 6 0 0 3 1 0 0 1 0 0 3 24 3 11 0 6 1 0 5 0 37 6 0.71 0.80 204 C -2 0 -4 -5 11 -4 -5 -3 -5 -3 -3 -4 -3 -4 -5 -1 -3 -4 -4 -2 0 6 0 0 87 0 0 2 0 0 0 0 0 0 0 4 0 0 0 1 2.31 0.78 205 V -3 -5 -5 -5 -3 -5 -5 -6 -5 6 -1 -5 -1 1 -5 -4 -2 -4 -3 5 0 0 0 0 0 0 0 0 0 54 0 0 0 5 0 0 0 0 0 42 1.13 0.79 206 Y -4 -4 -4 -5 -5 -3 -4 -5 2 -3 -3 -4 -3 1 -5 -4 -4 0 9 -3 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 96 0 2.18 0.83 207 V -2 -5 -5 -5 1 -4 -5 -3 -5 1 -1 -4 -1 -3 -4 -4 -2 -5 -3 7 1 0 0 0 3 0 0 2 0 1 2 0 0 0 0 0 1 0 0 90 1.27 0.87 208 A 6 -4 -3 -4 0 -3 -3 -1 -4 -3 -4 -3 -3 -3 -1 0 0 -5 -4 -2 82 0 0 0 2 0 0 3 0 0 0 0 0 1 3 4 4 0 0 0 1.09 0.88 209 I -3 -5 -5 -5 0 -5 -5 -6 -5 7 -1 -5 1 -2 -5 -4 -3 -5 -3 3 0 0 0 0 2 0 0 0 0 78 0 0 2 0 0 0 0 0 0 18 1.36 0.90 210 G -2 -5 -3 -4 -5 -4 -4 7 -4 -5 -6 -4 -5 -5 -4 -2 -4 -5 -5 -2 0 0 0 0 0 0 0 96 0 0 0 0 0 0 0 0 0 0 0 4 1.91 0.93 211 Q -3 -1 -2 -2 -5 8 0 -4 0 -5 -3 -1 -2 -5 -3 -2 -1 -4 -3 -4 0 0 0 0 0 92 0 0 2 0 3 0 0 0 0 0 4 0 0 0 1.72 0.93 212 K -3 4 -2 -3 -6 -1 -2 -4 -3 -5 -5 7 -4 -5 -3 -3 -3 -5 -4 -5 0 17 0 0 0 0 0 0 0 0 0 83 0 0 0 0 0 0 0 0 1.59 1.04 213 R 3 5 -1 -1 0 1 -1 -1 -3 -2 -3 0 -2 -3 -4 0 -2 -5 -4 -4 27 36 2 3 2 7 4 3 0 2 1 3 1 1 0 6 2 0 0 0 0.60 1.04 214 S 1 -3 -2 -2 -3 -2 -2 -1 -3 -4 -4 -2 -3 -4 -3 6 1 -4 -4 -3 8 0 0 0 0 0 1 4 0 0 1 0 0 0 0 76 6 0 0 1 1.11 0.99 215 T 0 -3 0 -3 -3 -1 -1 -2 -3 -3 -3 -1 -3 -4 -3 3 6 -4 -2 -3 8 0 3 0 0 1 3 2 0 0 1 3 0 0 0 22 52 0 2 1 0.82 0.99 216 V -2 -4 -5 -5 -3 -4 -4 -3 -5 3 -1 -4 0 -3 -4 -4 -2 -4 -3 6 2 0 0 0 0 0 0 3 0 16 3 0 2 0 0 0 0 0 0 70 1.15 1.08 217 G 5 2 -2 -4 -3 -1 -2 -2 -3 -2 -1 -2 -1 -4 -3 -1 -2 -4 -4 -1 63 12 2 0 0 3 2 1 0 2 5 2 1 0 0 2 1 0 0 3 0.80 1.15 218 Q 0 1 2 -1 -4 6 0 -1 1 -4 -3 0 -3 -5 -4 0 -1 -4 -4 -3 8 8 9 2 0 45 4 4 3 0 2 3 0 0 0 6 4 0 0 2 0.75 1.17 219 L -1 -4 -2 -5 -3 -1 -4 -5 -5 4 2 -4 0 -1 -4 -2 -1 -4 -3 4 6 0 2 0 0 3 0 0 0 24 22 0 2 2 0 3 3 0 0 31 0.65 1.16 220 I -2 -2 -4 -4 0 -1 -1 -5 -1 2 -1 -2 -2 -3 -4 -2 -1 -4 1 6 3 2 0 0 2 3 3 0 2 7 4 2 0 0 0 2 2 0 6 56 0.70 1.19 221 Q -1 1 2 0 -5 4 2 -2 0 -4 -2 3 -3 -5 -4 -1 1 -5 -4 -4 5 7 8 4 0 19 15 3 2 0 3 19 0 0 0 3 7 0 0 0 0.44 1.12 222 T 0 0 -1 -3 -4 1 -1 -4 -4 3 -1 1 0 -4 -4 0 3 -5 -4 0 8 6 3 0 0 6 4 0 0 22 4 9 3 0 0 5 23 0 0 5 0.37 1.12 223 L -3 -5 -5 -5 -4 -4 -5 -6 -5 2 4 -5 -1 3 2 -4 -3 -4 -2 1 0 0 0 0 0 0 0 0 0 11 53 0 0 13 12 0 0 0 0 10 0.86 0.94 224 E 0 3 0 0 -4 1 2 -4 2 -1 -4 0 -3 -5 2 1 0 -5 -4 -1 7 16 4 4 0 7 16 0 5 4 0 4 0 0 10 13 4 0 0 4 0.32 0.94 225 E -1 0 1 2 -5 2 5 -2 0 -5 -5 2 -4 -5 -4 -1 -1 -5 -4 -2 4 5 5 13 0 7 40 2 2 0 0 11 0 0 0 4 2 0 0 4 0.66 1.00 226 A 2 0 3 -3 -4 -1 0 -2 3 -4 -2 1 -1 1 -4 -1 1 -4 2 -2 20 6 15 0 0 2 7 2 9 0 4 8 2 5 0 2 8 0 8 2 0.31 1.00 227 N -1 -4 2 3 -5 -1 -1 6 0 -6 -6 -1 -5 -5 -4 -1 -3 -5 -5 -5 3 0 8 16 0 3 3 57 3 0 0 3 0 0 0 4 0 0 0 0 1.04 1.00 228 A 6 -4 -4 -4 3 -3 -3 -2 -4 -3 -4 -3 -3 -4 -3 1 -2 -5 -4 0 76 0 0 0 7 0 0 0 0 0 0 0 0 0 0 10 0 0 0 7 1.13 1.00 229 L 0 -4 -5 -5 -4 -4 -5 -5 -4 1 3 -4 6 1 0 0 -3 -4 2 0 9 0 0 0 0 0 0 0 0 6 33 0 21 6 6 7 0 0 6 7 0.65 1.17 230 E 0 0 0 1 -5 2 4 -1 3 -5 -2 0 -4 2 0 -1 -1 -5 -3 -4 7 4 4 8 0 9 29 4 6 0 4 5 0 9 5 4 4 0 0 0 0.41 1.17 231 Y 1 1 0 -4 -4 -2 -3 -4 3 -3 -3 1 -3 2 -4 -2 -3 -1 7 -3 12 10 6 0 0 0 0 0 7 0 0 10 0 5 0 0 0 0 51 0 0.91 0.67 232 S -2 -3 -3 -3 3 -3 -3 -4 -4 1 1 -3 -2 -3 1 1 5 -4 -3 1 0 0 0 0 7 0 0 0 0 7 16 0 0 0 7 10 45 0 0 7 0.58 0.67 233 I 1 -4 -4 -4 3 -3 -4 -4 -4 4 0 -3 -1 -3 -4 1 1 -4 -3 3 11 0 0 0 7 0 0 0 0 31 7 0 0 0 0 11 10 0 0 23 0.51 0.67 234 L -3 -4 -4 -4 -3 -3 -4 -4 -4 3 2 -4 4 3 -4 -3 -2 -3 -1 2 0 0 0 0 0 0 0 0 0 17 17 0 17 17 0 0 0 0 0 17 0.40 0.56 235 V -2 -4 2 -4 -3 -3 -4 -5 -4 3 -1 -4 4 -3 -4 -3 -2 -5 -3 5 0 0 13 0 0 0 0 0 0 15 0 0 13 0 0 0 0 0 0 60 0.87 0.72 236 A 4 -3 0 -3 2 -3 -1 -3 -3 0 -1 -3 1 0 -3 1 0 -4 -3 1 44 0 6 0 5 0 5 0 0 5 5 0 5 5 0 9 5 0 0 8 0.40 0.72 237 A 5 -1 -3 -3 -3 -2 0 1 -3 -3 -4 -2 -3 -4 -3 1 1 -4 -4 -2 66 3 0 0 0 0 7 7 0 0 0 0 0 0 0 9 7 0 0 0 0.76 0.70 238 T 1 -3 1 -2 -3 -2 0 0 0 -2 -3 -2 0 -4 1 1 4 -4 -3 -2 15 0 8 0 0 0 6 8 3 2 2 0 2 0 6 10 39 0 0 0 0.47 0.70 239 A 4 -3 -2 1 -2 -2 -2 1 -3 -3 -3 -2 -3 -4 2 1 -2 -4 -3 -2 56 0 0 9 0 0 0 10 0 0 0 0 0 0 9 9 0 0 0 0 0.54 0.56 240 S 2 -3 3 3 -3 -2 -2 2 -2 -4 -4 -2 -3 -4 -3 2 2 -5 -4 -3 17 0 17 17 0 0 0 17 0 0 0 0 0 0 0 17 17 0 0 0 0.50 0.56 241 D 1 -3 2 2 -4 2 1 -3 -2 -4 -4 -2 -3 2 -4 1 1 -3 3 -4 11 0 11 11 0 11 11 0 0 0 0 0 0 11 0 11 11 0 11 0 0.41 0.89 242 P 2 -3 -3 -3 2 -2 0 -3 -3 -3 -1 -2 1 -4 5 3 -2 -5 -4 -3 16 0 0 0 5 0 5 0 0 0 7 0 5 0 32 30 0 0 0 0 0.74 0.67 243 A 5 -3 -3 0 -2 -3 -2 -2 -3 -3 -1 -3 -2 -4 2 1 -2 -4 -4 0 64 0 0 6 0 0 0 0 0 0 6 0 0 0 11 8 0 0 0 6 0.71 0.67 244 P 2 -3 -3 -3 -3 -3 -3 0 -4 -1 -4 -3 -3 -4 6 1 2 -5 -4 -1 16 0 0 0 0 0 0 8 0 4 0 0 0 0 43 10 15 0 0 5 0.92 0.67 245 L 1 -4 -4 -4 4 -3 1 -4 -3 2 1 -4 4 3 2 -3 -3 -3 3 -2 11 0 0 0 11 0 11 0 0 11 11 0 11 11 11 0 0 0 11 0 0.48 0.89 246 Q -2 0 -1 -2 -4 7 0 -4 2 -1 -2 1 -2 -4 -2 -1 -1 -4 -3 -2 0 2 2 0 0 74 0 0 4 4 3 5 0 0 2 2 2 0 0 2 1.06 0.63 247 F -3 -4 -1 -5 -4 -3 -4 -5 -1 -2 -2 -4 1 5 -5 -3 -3 5 7 -1 0 0 4 0 0 0 0 0 0 0 0 0 4 28 0 0 0 7 52 6 1.26 0.63 248 L -3 -4 -5 -5 -3 -4 -5 -5 -4 3 4 -4 1 1 -2 -4 -3 -3 2 0 0 0 0 0 0 0 0 0 0 23 59 0 3 4 2 0 0 0 7 3 0.78 0.63 249 A 3 -2 -1 -2 -2 -1 -1 -1 -2 -1 -2 -1 -2 -3 -2 4 2 -4 -3 1 31 0 0 0 0 0 0 0 0 0 0 0 0 0 0 44 13 0 0 13 0.44 0.28 250 P 3 -3 -3 -3 -2 -2 -2 -2 -3 0 -1 -2 -1 -3 5 -1 -1 -4 -3 3 33 0 0 0 0 0 0 0 0 0 0 0 0 0 33 0 0 0 0 33 0.65 0.22 251 Y -3 -3 -3 -4 -3 -2 -3 -4 0 -1 1 -3 3 4 -4 -3 -2 1 7 -1 0 0 0 0 0 0 0 0 0 0 13 0 13 14 0 0 0 0 61 0 0.90 0.28 252 S 1 -4 -3 -3 5 -3 -3 2 -4 -2 -3 -3 -3 -4 3 2 2 -4 -4 1 14 0 0 0 14 0 0 14 0 0 0 0 0 0 14 14 14 0 0 14 0.51 0.67 253 G 1 -3 -2 -3 -3 -3 -3 6 -3 -4 -4 -3 -4 -4 -3 1 -2 -4 -4 -4 14 0 0 0 0 0 0 69 0 0 0 0 0 0 0 8 0 0 0 0 1.05 0.59 254 C 1 -4 -4 -4 8 -3 -4 -4 -4 0 -2 -4 1 0 -4 0 1 -4 -3 2 12 0 0 0 42 0 0 0 0 4 0 0 4 4 0 5 7 0 0 20 0.89 0.59 255 A 4 -3 -2 -3 2 -2 -3 -2 -3 -2 0 -2 -2 -3 -3 2 3 -4 -3 0 41 0 0 0 5 0 0 0 0 0 11 0 0 0 0 20 20 0 0 4 0.51 0.58 256 M -1 -3 -4 -4 -2 -3 -4 -4 -3 4 2 -3 6 1 -4 -3 -2 -3 -2 1 6 0 0 0 0 0 0 0 0 38 11 0 34 6 0 0 0 0 0 6 0.61 0.37 257 G 2 -3 -2 -3 3 -3 -3 5 -3 -4 -4 -2 -3 -4 -3 1 -2 -4 -4 -3 23 0 0 0 8 0 0 60 0 0 0 0 0 0 0 8 0 0 0 0 0.88 0.37 258 E -2 -1 -1 0 -5 3 6 -3 -1 -4 -4 2 -3 -4 -2 -1 -2 -4 -3 -4 0 0 0 0 0 12 77 0 0 0 0 11 0 0 0 0 0 0 0 0 1.01 0.37 259 Y 0 -3 2 -3 -4 2 1 -4 4 -4 0 -3 -3 2 -4 1 -3 6 3 -4 10 0 10 0 0 10 10 0 10 0 10 0 0 10 0 10 0 10 10 0 0.50 1.00 260 F -3 -4 -4 -5 -4 -4 -4 -4 -3 0 0 -4 -1 6 0 -1 -3 6 3 0 0 0 0 0 0 0 0 0 0 6 6 0 0 49 6 6 0 13 7 6 0.90 0.56 261 R 2 3 -3 -3 -2 -1 -2 -3 -2 -1 2 -1 5 -2 -3 -2 -2 -3 -2 -1 25 25 0 0 0 0 0 0 0 0 25 0 25 0 0 0 0 0 0 0 0.41 0.33 262 D -2 -1 0 5 0 -3 0 -4 1 -4 -4 1 0 0 -4 -2 -3 3 5 -2 3 3 3 34 2 0 5 0 3 0 0 8 2 3 0 3 0 3 25 2 0.74 1.04 263 N -1 1 4 0 -5 3 -1 -1 1 -2 -3 2 -3 -5 -4 1 0 -5 -4 -4 6 8 24 4 0 15 3 3 3 3 3 14 0 0 0 12 4 0 0 0 0.47 1.04 264 G -1 -1 -3 -3 -5 -3 -1 6 0 -6 -6 -1 -5 -5 0 -1 -4 -5 -5 -5 3 4 0 0 0 0 5 73 3 0 0 3 0 0 5 3 0 0 0 0 1.30 1.04 265 M 0 1 1 1 -5 2 1 0 -3 -4 -1 1 3 -4 -4 0 1 -5 2 -4 8 8 8 8 0 8 8 8 0 0 8 8 8 0 0 8 8 0 8 0 0.23 1.33 266 H 1 2 2 2 -4 -2 -2 -3 4 -5 -5 2 -4 -5 3 1 -2 -5 -3 -4 13 13 13 13 0 0 0 0 13 0 0 13 0 0 13 13 0 0 0 0 0.52 0.78 267 A 1 2 -2 2 5 -3 -3 1 -3 -3 -3 -2 -3 -4 -4 1 2 -5 -4 1 13 13 0 13 12 0 0 13 0 0 0 0 0 0 0 13 13 0 0 12 0.39 0.78 268 L -2 -4 -4 -4 5 -3 -4 -4 -4 2 1 -4 4 3 -4 1 -2 -3 -2 2 0 0 0 0 14 0 0 0 0 14 14 0 14 14 0 14 0 0 0 14 0.55 0.67 269 I -1 -4 -4 -4 1 -4 -4 -5 -4 5 1 -4 1 -2 -4 -3 2 -4 -1 2 3 0 0 0 3 0 0 0 0 53 12 0 3 0 0 0 16 0 3 9 0.69 0.62 270 I -1 -4 -4 -5 3 -4 -4 -3 -4 5 -1 -4 2 1 -4 -2 -2 -4 -2 4 3 0 0 0 6 0 0 2 0 38 0 0 5 6 0 2 0 0 0 36 0.66 0.62 271 Y -3 -2 -2 -3 -3 -2 -2 -4 6 0 0 -3 -2 2 -4 -2 0 -1 5 -2 0 0 0 0 0 0 0 0 32 7 7 0 0 9 0 0 7 0 30 0 0.66 0.39 272 D -3 -2 0 6 -5 4 1 -3 -2 -4 -4 -1 -3 -5 -3 -1 -2 -5 -4 -4 0 0 0 68 0 32 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.12 0.39 273 D -1 -2 0 5 -4 -1 0 4 -2 -4 -4 -1 -3 -4 -2 -1 -2 -4 -4 -4 0 0 0 50 0 0 0 50 0 0 0 0 0 0 0 0 0 0 0 0 0.79 0.11 274 L -2 3 -3 -4 -3 -2 -3 -4 -3 0 4 -2 0 -2 2 -3 -2 -3 -3 1 0 23 0 0 0 0 0 0 0 0 52 0 0 0 13 0 0 0 0 13 0.52 0.39 275 S 1 -2 -1 -2 -2 -2 -2 -2 -2 -3 -3 -2 -3 -4 4 4 2 -4 -3 -2 10 0 0 0 0 0 0 0 0 0 0 0 0 0 23 46 12 0 0 0 0.58 0.39 276 K -2 3 3 -1 -4 3 -1 -2 -1 -4 -4 3 -3 -4 -3 2 -1 -4 -3 -4 0 20 20 0 0 20 0 0 0 0 0 20 0 0 0 20 0 0 0 0 0.60 0.44 277 Q -3 0 -1 -2 -5 6 0 -3 6 -5 -4 3 -2 -4 -3 -2 -2 -4 -2 -4 0 0 0 0 0 54 0 0 22 0 0 25 0 0 0 0 0 0 0 0 1.12 0.63 278 A 5 -3 -2 -3 -2 -2 -2 -2 5 -3 -3 -2 -3 -4 -3 2 -2 -4 -2 -2 67 0 0 0 0 0 0 0 19 0 0 0 0 0 0 14 0 0 0 0 0.90 0.63 279 V -1 -1 -3 -1 1 0 -1 2 -4 0 -2 -3 0 -3 -4 -1 -1 3 -1 4 4 3 0 3 3 5 3 19 0 4 0 0 2 0 0 3 3 5 3 41 0.37 0.63 280 A 5 -3 -3 -4 6 0 -3 -1 -3 -3 -3 -3 -3 -4 -3 0 0 -4 -4 -1 59 0 0 0 20 4 0 4 0 0 0 0 0 0 0 6 4 0 0 4 0.82 0.64 281 Y -3 -3 -3 -5 -4 -3 -4 -5 4 1 2 -3 -1 1 -4 -3 -3 0 7 -2 0 0 0 0 0 0 0 0 11 8 21 0 0 0 0 0 0 0 59 0 1.11 0.58 282 R 2 4 -2 -2 -3 -1 -1 -2 -2 -3 -3 0 -2 -4 5 -1 -1 -4 -3 -2 33 33 0 0 0 0 0 0 0 0 0 0 0 0 33 0 0 0 0 0 0.74 0.22 283 Q 0 -2 -2 -2 -3 3 1 -3 2 3 -2 0 -2 -3 -3 2 1 -4 -3 -1 7 0 0 0 0 20 9 0 6 23 0 6 0 0 0 23 7 0 0 0 0.34 0.50 284 M -2 -4 -4 -5 4 -3 -4 -5 -4 4 2 -4 5 -2 -4 -3 1 -4 -3 2 0 0 0 0 11 0 0 0 0 28 18 0 25 0 0 0 8 0 0 9 0.67 0.56 285 S 1 -3 -2 -3 3 -2 -2 -2 -2 -2 -3 -2 -2 -3 -3 4 3 -3 2 0 9 0 0 0 8 0 0 0 0 0 0 0 0 0 0 49 17 0 8 9 0.58 0.56 286 L -3 -4 -5 -5 -3 -4 -4 -5 -4 2 5 -4 1 3 -4 -4 -3 -3 -2 0 0 0 0 0 0 0 0 0 0 10 78 0 0 11 0 0 0 0 0 0 0.92 0.52 287 L -3 -3 -3 1 -3 -3 -3 -4 -4 1 4 -3 0 1 -4 -2 1 -3 -2 0 0 0 0 11 0 0 0 0 0 3 64 0 0 6 0 2 10 0 0 3 0.59 0.52 288 L -2 -4 -4 -4 -3 -3 -4 -4 -4 0 4 -4 1 3 2 -3 0 -3 -1 0 3 0 0 0 0 0 0 0 0 3 52 0 2 18 11 0 8 0 0 3 0.59 0.52 289 R -3 6 1 -2 -5 0 -1 -3 2 -4 -4 3 -3 -4 -3 -2 -2 -4 -3 -4 0 66 8 0 0 0 0 0 5 0 0 22 0 0 0 0 0 0 0 0 1.09 0.52 290 R -3 7 -2 -2 -5 2 1 -4 -2 -4 -4 3 -3 -4 -3 -2 -2 -4 -3 -4 0 72 0 0 0 6 10 0 0 0 0 12 0 0 0 0 0 0 0 0 1.12 0.49 291 P -2 -3 -3 -3 -4 -2 -2 -3 -3 -4 -4 -2 -4 -5 8 -2 -2 -5 -4 -4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 2.48 0.40 292 P -1 -3 -3 -3 -4 -2 -2 -3 -3 -4 -3 -2 -3 -5 8 -1 0 -5 -4 -3 2 0 0 0 0 0 0 0 0 0 2 0 0 0 89 2 5 0 0 0 2.07 0.38 293 G 0 -3 -2 -2 -4 -3 -3 6 -3 -5 -5 -3 -4 -4 -3 -1 -3 2 -4 -4 2 0 0 0 0 0 0 91 0 0 0 0 0 0 0 2 0 4 0 0 1.39 0.38 294 R -2 7 -1 -3 -4 0 -1 -3 -1 -4 -3 2 -2 -4 -3 -2 1 -4 -3 -3 0 88 0 0 0 0 0 0 0 0 0 3 0 0 0 0 9 0 0 0 1.17 0.38 295 E -2 1 -1 0 3 1 6 -3 -1 -4 -4 0 -3 -4 -2 -1 -2 -4 -3 -3 0 6 0 0 8 0 84 0 0 0 0 2 0 0 0 0 0 0 0 0 1.00 0.36 296 A 5 -3 -2 -3 -2 -2 -2 2 -3 -3 -3 -2 -2 -4 2 0 -1 -4 -3 -2 72 0 0 0 0 0 0 16 0 0 0 0 0 0 9 0 2 0 0 0 0.70 0.36 297 F -3 -3 -3 -4 -3 -3 -3 -4 0 1 -1 -3 -1 4 -4 -2 -2 1 7 -1 0 0 0 0 0 0 0 0 0 11 0 0 0 20 0 2 0 0 68 0 1.06 0.30 298 P -2 -3 -3 -2 -4 -2 -2 -3 -3 -3 -3 -2 2 -4 8 -1 -2 -4 -4 -3 0 0 0 0 0 0 0 0 0 0 0 0 11 0 86 3 0 0 0 0 1.90 0.30 299 G -1 -3 -1 -2 -4 -3 -3 7 -3 -5 -5 -2 -4 -4 -3 -1 -3 -3 -4 -4 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.47 0.30 300 D -3 -2 1 7 -5 -1 1 -2 -2 -4 -5 -1 -4 -5 -2 -1 -2 -5 -4 -4 0 0 0 95 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 1.36 0.30 301 V -1 -3 -4 -4 -2 -3 -3 -4 -4 3 0 -3 0 -2 -3 -1 -1 -4 -2 5 0 0 0 0 0 0 0 0 0 22 0 0 0 0 0 5 0 0 0 73 0.67 0.30 302 F -2 -3 -3 -3 -3 -3 -3 -3 -2 -1 0 -3 -1 7 -3 -3 -2 0 2 -1 1 1 1 1 0 1 1 1 0 1 4 1 0 85 1 1 1 0 0 1 0.99 0.24 303 Y -3 -3 -3 -4 -3 -2 -3 -4 2 -2 -2 -3 -2 3 -4 -3 -3 1 8 -2 0 0 0 0 0 0 0 0 2 0 0 0 0 6 0 0 0 0 93 0 1.40 0.30 304 L -2 -3 -4 -4 -1 -3 -4 -4 -4 2 5 -3 1 -1 -3 -2 -2 -3 -2 1 2 0 0 0 1 0 0 0 0 8 78 0 0 0 1 2 0 0 0 8 0.65 0.30 305 H -3 -1 0 -2 -4 0 -1 -3 9 -4 -4 -2 -2 -2 -3 -2 -3 -3 1 -4 0 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 1.57 0.28 306 S 1 -2 0 -1 -2 -1 -1 -1 -2 -3 -3 -1 -2 -3 -2 5 1 -4 -3 -2 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 92 0 0 0 0 0.78 0.28 307 R -2 7 -1 -2 -4 0 -1 -2 -1 -4 -3 2 -2 -4 -3 -1 -2 -4 -3 -3 0 92 0 0 0 1 0 2 0 0 0 1 0 0 0 4 0 0 0 0 1.10 0.28 308 L -2 -3 -4 -4 -2 -3 -4 -5 -4 1 5 -3 1 0 -1 -2 -2 -3 -2 0 0 0 0 0 0 0 0 0 0 0 96 0 0 0 2 2 0 0 0 0 0.77 0.28 309 L -2 -3 -3 -4 -2 -3 -3 -4 -3 1 5 -3 1 -1 -3 1 -1 -3 -2 0 0 0 0 0 0 0 0 0 0 0 84 0 0 0 0 16 0 0 0 0 0.61 0.27 310 E -2 -1 -1 1 -5 1 6 1 -1 -4 -4 0 -3 -4 -2 -1 -2 -4 -3 -4 0 0 0 0 0 0 88 11 0 0 0 1 0 0 0 0 0 0 0 0 0.97 0.29 311 R -2 7 -1 -3 -4 0 -1 -3 -1 -4 -3 1 -2 -4 -3 -2 -2 -4 -3 -4 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.26 0.29 312 A 5 -2 -2 -2 0 -1 1 -1 -2 -3 -3 -1 -2 -3 -1 2 -1 -4 -3 -1 72 0 0 0 1 0 11 1 0 0 0 0 0 0 2 13 0 0 0 1 0.57 0.29 313 A 5 1 -2 -3 1 -1 -2 -1 -2 -2 -2 -1 -2 -3 -2 1 -1 -4 -3 0 74 11 0 0 3 0 0 1 0 0 1 0 0 0 0 6 1 0 0 4 0.55 0.29 314 K -2 3 -1 1 -4 1 0 -3 0 -4 -3 5 -2 -4 -2 -1 -2 -4 -3 -3 0 13 0 11 0 4 0 0 2 0 0 69 1 0 0 0 0 0 0 0 0.75 0.31 315 R -2 1 -4 -4 -2 -2 -3 -4 -2 1 4 -2 3 0 -4 -3 -2 -2 2 2 0 11 0 0 0 0 0 0 0 3 49 0 12 0 0 0 0 0 12 13 0.45 0.31 316 S 0 -2 4 2 -2 -1 -1 -1 -1 -2 -3 -1 -3 -3 -2 4 0 -4 -3 -3 2 1 27 11 1 0 1 1 1 2 1 1 0 0 0 49 2 0 0 0 0.54 0.33 317 D 0 -1 0 5 -4 0 2 -2 3 -4 -4 0 -3 -4 0 0 0 -4 -2 -3 8 1 3 41 0 3 13 1 10 0 0 6 0 0 4 2 6 0 0 0 0.53 0.36 318 Q 0 2 1 2 -4 3 4 -2 -1 -3 -3 1 -3 -4 -2 0 -1 -4 -3 -2 6 12 6 13 0 13 30 1 1 0 1 9 0 0 0 4 1 0 0 3 0.44 0.40 319 T -2 -1 3 -2 -2 -1 -2 0 4 -2 2 1 0 0 -3 -1 1 -3 0 -2 1 1 20 1 1 1 0 7 12 1 22 12 2 3 0 1 13 0 2 1 0.25 0.47 320 G 0 -3 -2 -3 -4 -2 -3 6 -3 -5 -5 1 -4 -4 -3 -1 -3 -4 -4 -3 4 0 0 0 0 1 0 84 0 0 0 9 0 0 0 1 0 0 0 1 1.27 0.47 321 A 2 -3 -1 -2 -2 -2 -1 5 -3 -4 -4 -2 -3 -4 -3 0 1 -4 -4 -3 19 0 1 0 0 1 3 62 0 0 0 1 0 0 0 4 10 0 0 0 0.89 0.47 322 G -1 -4 -2 -3 -4 -3 -3 7 -3 -5 -5 -3 -4 -5 -3 -1 -3 -4 -4 -5 0 0 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 1.65 0.46 323 S 0 -2 0 -1 -2 -1 -1 -1 -2 -4 -4 -1 -3 -4 -2 6 0 -4 -3 -3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 1.04 0.44 324 L -3 -3 -4 -5 -2 -3 -4 -5 -4 3 5 -4 3 -1 -4 -3 -1 -3 -2 0 0 0 0 0 0 0 0 0 0 21 64 0 9 0 0 1 3 0 0 1 0.74 0.44 325 T -1 -2 -1 -2 -2 -2 -2 -3 -3 -2 -2 -2 -2 -3 -2 0 7 -4 -3 -1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 1.16 0.44 326 A 6 -3 -3 -3 2 -2 -2 0 -3 -3 -3 -2 -2 -4 -2 0 -1 -4 -3 -1 93 0 0 0 4 0 0 2 0 0 0 0 0 0 0 1 0 0 0 0 0.91 0.43 327 L -3 -4 -5 -5 -3 -4 -4 -5 -4 1 5 -4 1 1 -4 -4 -2 -3 -2 0 0 0 0 0 0 0 0 0 0 4 91 0 0 6 0 0 0 0 0 0 0.94 0.43 328 P -2 -3 -3 -3 -4 -2 -2 -3 -3 -4 -4 -2 -4 -5 8 -2 -2 -5 -4 -4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 0 0 2.49 0.42 329 V -2 -4 -4 -4 -2 -3 -4 -4 -4 5 0 -3 1 0 -4 0 -1 -4 -2 3 0 0 0 0 0 0 0 0 0 58 0 0 4 3 0 8 0 0 0 27 0.70 0.43 330 I -1 -4 -4 -4 -2 -3 -4 -5 2 6 0 -4 0 -1 -4 -3 -2 -4 -2 2 5 0 0 0 0 0 0 0 8 77 0 0 1 0 0 0 0 0 0 9 0.81 0.43 331 E -2 1 -1 0 -5 1 6 -3 -1 -5 -4 0 -3 -5 -2 -1 -2 -4 -3 -4 0 9 0 0 0 2 88 0 0 0 0 1 0 0 0 0 0 0 0 0 1.16 0.42 332 T -1 -2 2 -2 -2 -2 -2 -3 -3 -1 -2 -2 -2 -3 -2 0 6 -4 -3 -1 0 0 10 0 0 0 0 0 0 2 0 0 0 0 0 0 88 0 0 0 0.96 0.41 333 Q -2 0 -1 -1 -4 7 1 -3 -1 -3 0 0 -1 -4 -3 -1 -2 -3 -3 -3 0 0 0 2 0 75 6 0 0 0 14 3 1 0 0 0 0 0 0 0 0.92 0.42 334 A 5 -3 -2 -2 -2 -1 -1 1 -3 -2 -3 -2 -2 -3 -2 1 -1 -4 -3 1 69 0 1 1 0 1 4 9 0 0 0 0 0 0 0 6 0 0 0 9 0.61 0.42 335 G -1 -3 3 0 -4 -1 -3 6 -3 -5 -5 -2 -4 -4 -3 -1 -2 -4 -4 -4 0 0 15 4 0 2 0 77 0 0 0 0 0 0 0 1 0 0 0 0 1.18 0.43 336 D -3 -2 2 7 -5 -1 1 -2 -2 -4 -5 1 -4 -5 -3 -1 -2 -5 -4 -4 0 0 7 82 0 0 2 0 0 0 0 9 0 0 0 0 0 0 0 0 1.26 0.42 337 V -1 -4 -4 -5 -2 -4 -4 -5 -4 3 1 -4 0 -2 -4 -3 -1 -4 -2 6 0 0 0 0 0 0 0 0 0 8 8 0 0 0 0 0 0 0 0 84 0.87 0.42 338 S 0 -2 -1 -2 -1 -1 -1 -2 -2 -3 -4 1 -3 -4 -2 5 1 -4 -3 -3 3 0 0 0 0 0 0 0 0 0 0 9 0 0 0 81 7 0 0 0 0.80 0.42 339 A 6 -3 -3 -3 -2 -2 -2 0 -3 -3 -3 -2 -2 -3 -2 0 -1 -4 -3 -1 97 0 0 0 0 0 0 2 0 0 0 0 0 0 0 2 0 0 0 0 0.94 0.42 340 Y -3 -3 -4 -4 -4 -3 -3 -4 0 -2 -2 -3 -2 5 -4 -3 -3 1 8 -2 0 0 0 0 0 0 0 0 0 0 0 0 0 21 0 0 0 0 79 0 1.47 0.41 341 I -2 -4 -4 -4 -2 -4 -4 -5 -5 6 0 -4 0 -1 -4 -3 -1 -4 -2 3 0 0 0 0 0 0 0 0 0 88 0 0 0 0 0 0 1 0 0 11 0.97 0.41 342 P -1 -3 -3 -3 -4 -2 -2 -3 -3 -4 -4 -2 -4 -5 8 -1 -2 -5 -4 -3 2 0 0 0 0 0 0 0 0 0 0 0 0 0 93 3 0 0 0 1 2.23 0.41 343 T -1 -2 -1 -2 -2 -2 -2 -3 -3 -2 -2 -2 -2 -3 -2 0 7 -4 -3 -1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 0 0 0 1.10 0.40 344 N -3 -1 8 0 -4 -1 -1 -2 0 -4 -5 -1 -3 -4 -3 0 -1 -5 -3 -4 0 0 99 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1.39 0.39 345 V -1 -4 -4 -4 -2 -3 -4 -4 -4 2 1 -3 0 -2 -4 -3 -1 -4 -2 6 1 0 0 0 0 0 0 0 0 2 8 0 1 0 0 0 0 0 0 89 0.85 0.39 346 I -2 -4 -5 -4 -2 -4 -4 -5 -5 7 1 -4 0 -1 -4 -4 -2 -4 -2 2 0 0 0 0 0 0 0 0 0 98 2 0 0 0 0 0 0 0 0 0 1.03 0.39 347 S 0 -2 0 -1 -2 0 -1 -1 -2 -4 -4 -1 -3 -4 0 6 0 -4 -3 -3 0 0 0 0 0 2 0 0 0 0 0 0 0 0 4 94 0 0 0 0 0.90 0.39 348 I -2 -4 -4 -4 -2 -4 -4 -5 -4 6 1 -4 0 -1 -4 -3 -1 -4 -2 2 0 0 1 0 0 0 0 0 0 96 2 0 0 0 0 0 1 0 0 0 0.99 0.39 349 T -1 -2 -1 -2 -2 -2 -2 -3 -3 -2 -1 -2 -2 -3 -2 0 6 -4 -3 -1 2 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 93 0 0 0 0.96 0.38 350 D -3 -3 0 7 -5 -1 0 -1 -2 -4 -4 -2 -1 -4 -3 -1 -2 -5 -4 -4 0 0 0 93 0 0 0 3 0 0 0 0 4 0 0 0 0 0 0 0 1.36 0.38 351 G 0 -3 -1 -2 -4 -3 -3 7 -3 -5 -5 -3 -4 -4 -3 -1 -3 -4 -4 -4 4 0 0 0 0 0 1 95 0 0 0 0 0 0 0 0 0 0 0 0 1.44 0.38 352 Q -2 0 -1 -1 -4 7 1 -3 3 -4 -3 1 -2 -4 -2 -1 -2 -3 -2 -3 0 0 0 0 0 85 0 0 8 0 0 4 0 0 0 0 0 0 0 0 1.13 0.40 353 I -2 0 -4 -4 -1 -3 -4 -5 -4 6 1 -3 0 -1 -4 -3 -1 -3 0 1 0 8 0 0 1 0 0 0 0 79 5 0 0 0 0 0 1 0 4 1 0.74 0.40 354 C -3 -3 -4 -4 3 -4 -4 -4 -2 -1 -1 -4 -1 7 -4 -3 -3 0 5 -1 0 2 0 0 7 0 0 0 0 0 0 0 0 67 0 0 0 0 19 4 1.09 0.40 355 L -2 -3 -4 -4 -2 -3 -4 -4 -4 0 5 -3 1 2 -2 -2 -1 -2 -2 0 0 0 0 0 0 0 0 0 0 0 81 0 2 8 1 3 3 0 0 1 0.73 0.40 356 E -1 -1 1 1 -4 1 5 -2 -1 -4 -4 0 -2 -4 -2 2 -1 -4 -3 -3 0 0 8 6 0 5 58 1 0 0 1 4 0 0 0 15 1 0 0 1 0.72 0.40 357 T 1 1 -1 -2 -2 -1 -1 -2 -2 -2 -3 -1 -2 -3 0 3 5 -4 -3 -2 11 8 0 1 0 0 1 1 0 0 0 2 0 0 4 25 45 0 0 0 0.52 0.42 358 E 0 -2 2 4 -4 0 4 -1 0 -4 -4 -1 -3 -4 -2 0 -1 -4 -3 -3 11 0 12 34 0 1 31 2 1 0 0 1 0 0 0 5 1 0 0 0 0.65 0.42 359 L -2 -3 -4 -5 -2 -3 -4 -5 -4 1 5 -4 1 -1 -4 -3 -2 -3 -2 1 0 0 0 0 0 0 0 0 0 1 87 0 1 0 0 0 0 0 1 8 0.87 0.43 360 F -2 -4 -4 -5 -3 -4 -4 -4 -3 -1 1 -4 -1 8 -4 -3 -3 0 2 -2 2 0 0 0 0 0 0 0 0 0 13 0 0 82 0 0 0 0 1 0 1.23 0.43 361 Y -1 -1 5 -1 -3 -2 -1 -2 0 -3 -2 -2 -2 1 -3 1 -1 -2 4 -3 6 3 45 0 0 0 2 0 1 0 4 0 0 6 0 9 2 0 22 0 0.59 0.43 362 R 2 2 -1 -2 -3 3 -1 -1 -2 -3 -3 1 -2 -4 -2 3 -1 -4 -3 -3 20 14 0 0 0 14 0 3 0 0 2 9 0 0 0 35 1 0 0 0 0.43 0.43 363 G -1 -3 1 -2 -4 -2 -3 6 -3 -5 -5 -2 -4 -4 -3 -1 -3 -4 -4 -4 0 1 6 0 0 1 0 89 0 0 0 0 0 0 0 0 0 0 0 1 1.36 0.44 364 I -2 -3 -3 -3 -3 2 -2 -4 -3 5 1 -3 0 -1 -3 -3 -1 -3 -2 3 0 0 1 0 0 16 0 0 0 51 5 0 0 2 0 0 1 0 0 22 0.58 0.44 365 R -2 7 -2 -3 -4 0 -1 -3 -1 -3 -2 2 -2 -4 -3 -2 -2 -4 -3 -3 0 86 0 0 0 0 0 0 0 1 3 7 0 0 0 0 0 0 0 0 1.21 0.44 366 P -2 -3 -3 -3 -4 -2 -2 -3 -3 -4 -4 -2 -4 -5 8 -2 -2 -5 -4 -3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 96 0 0 0 0 0 2.34 0.43 367 A 6 -2 -3 -3 -2 -2 -2 -1 -3 -2 -3 -2 -2 -3 -2 0 -1 -4 -3 -1 95 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0.93 0.43 368 I -2 -4 -4 -4 -2 -3 -4 -4 -4 5 0 -3 1 -1 -4 -3 -2 -4 -2 4 0 0 0 0 0 0 0 0 0 56 2 0 3 0 0 0 0 0 0 36 0.79 0.43 369 N -3 -2 7 3 -4 -1 -1 -2 0 -5 -5 -1 -4 -4 -3 0 -1 -5 -4 -4 0 0 77 20 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 1.23 0.45 370 V 1 -4 -4 -4 -2 -3 -3 -4 -4 1 -1 -3 -1 -3 3 -2 -1 -4 -3 5 11 0 0 0 0 0 0 0 0 4 0 0 0 0 18 1 1 0 0 65 0.69 0.44 371 G -1 -4 -2 -3 -4 -3 -3 7 -3 -5 -5 -3 -4 -4 -3 -1 -3 -4 -4 -4 1 0 0 0 0 0 0 98 0 0 0 0 0 0 0 1 0 0 0 0 1.59 0.44 372 L -2 -3 -4 -4 -3 -3 -4 -5 -4 4 4 -2 0 -1 -4 -2 -2 -3 -2 1 0 1 0 0 0 1 0 0 0 32 55 2 0 1 0 2 1 0 0 5 0.67 0.45 373 S 0 -2 -1 -1 -2 -1 -1 -1 -2 -4 -4 -1 -3 -4 -1 6 0 -4 -3 -3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 99 0 0 0 0 1.02 0.44 374 V -1 -4 -4 -4 -2 -3 -4 -4 -4 2 0 -4 0 -2 -4 -3 -1 -4 -2 6 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 98 0.99 0.44 375 S 0 -2 -1 -1 -2 -1 -1 -1 -2 -4 -4 -1 -3 -4 -2 6 1 -4 -3 -3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 96 4 0 0 0 1.00 0.44 376 R -3 7 -2 -3 -5 0 -1 -4 -1 -4 -3 1 -3 -4 -3 -2 -2 -4 -3 -4 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.46 0.43 377 V -1 -4 -4 -5 -2 -4 -4 -5 -4 2 0 -4 0 -2 -4 -3 -1 -4 -2 6 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 96 0.99 0.43 378 G -1 -4 -2 -3 -4 -3 -3 7 -3 -5 -5 -3 -4 -4 -3 -1 -3 -4 -4 -4 0 0 0 0 0 0 0 98 0 0 0 0 0 0 0 2 0 0 0 0 1.58 0.43 379 S 0 -3 -1 -2 -3 -2 -2 5 -3 -4 -4 -2 -3 -4 -3 4 -1 -4 -4 -4 0 0 0 0 0 0 0 49 0 0 0 0 0 0 0 51 0 0 0 0 0.91 0.42 380 A 4 -2 0 0 -2 -1 -1 -1 -2 -3 -3 0 -2 -4 -2 3 -1 -4 -3 -2 57 0 3 4 0 0 0 0 0 0 0 7 0 0 0 28 0 0 0 0 0.60 0.42 381 A 6 -3 -3 -3 -2 -2 -2 -1 -3 -3 -3 -2 -2 -3 -2 0 -1 -4 -3 -1 99 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0.96 0.42 382 Q -2 0 -1 -1 -4 8 1 -3 -1 -4 -3 0 -2 -5 -2 -1 -2 -3 -3 -3 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.36 0.42 383 L -2 -2 -2 -3 -1 -2 -3 -4 1 4 1 -3 0 -1 -1 -1 3 1 -1 1 1 2 1 0 1 1 0 0 4 33 16 0 1 1 3 2 29 2 2 4 0.36 0.43 384 K -1 1 -1 0 -4 0 0 -2 -2 -4 -4 6 -3 -4 1 -1 -1 -4 -2 -3 4 1 1 3 0 1 0 2 0 0 0 77 0 0 7 1 1 0 1 1 0.83 0.45 385 A 5 -3 -3 -3 0 -2 -2 0 -3 1 -2 -2 -2 -3 -2 1 -1 -4 -3 0 72 0 0 0 1 1 0 3 0 12 1 0 0 0 0 5 0 0 0 4 0.62 0.45 386 M -2 -3 -4 -4 -3 -2 -4 -4 -3 2 1 -3 8 -1 -4 -3 -1 -3 0 0 0 0 0 0 0 0 0 0 0 15 6 0 72 0 0 0 1 0 3 3 0.99 0.45 387 K -2 3 -1 -2 -4 0 0 -3 -2 -4 -4 6 -3 -4 -2 -1 -2 -4 -3 -4 0 12 1 0 0 0 0 0 0 0 0 87 0 0 0 1 0 0 0 0 1.09 0.45 388 Q -1 0 -1 -1 -4 5 1 -2 -2 -4 -2 5 -1 -4 -2 -1 -2 -4 -3 -3 3 0 1 0 0 33 5 2 0 0 3 48 1 0 0 2 0 0 0 1 0.71 0.46 389 V -2 -4 -4 -5 -2 -3 -4 -5 -4 2 2 -4 0 -2 -4 -3 -1 -4 -2 5 0 0 0 0 0 1 0 0 0 5 18 0 0 0 0 0 1 0 0 75 0.81 0.46 390 C 4 -1 -2 -3 4 -2 -2 0 -3 -3 -3 -2 -2 -4 -2 3 0 -4 -3 -2 48 3 0 0 10 0 0 6 0 0 0 0 0 0 0 30 2 0 0 0 0.60 0.46 391 G 0 -3 -2 -3 -4 -3 -3 7 -3 -5 -5 -1 -4 -4 -3 -1 -3 -4 -4 -4 4 0 0 0 0 0 0 89 0 0 0 3 0 0 0 3 0 0 0 0 1.39 0.46 392 S -1 1 0 -1 -3 -1 -2 1 -2 -4 -4 1 -2 -4 2 3 2 -4 -3 -3 1 8 4 2 0 1 0 13 0 0 0 10 0 0 9 35 17 0 0 0 0.43 0.46 393 S -2 -3 -4 -5 -3 -3 -4 -5 -4 3 4 -4 3 -1 -4 -1 -2 -3 0 1 0 0 0 0 0 0 0 0 0 16 58 0 9 0 0 5 0 0 2 9 0.65 0.46 394 K -2 5 -1 -2 -4 0 -1 -3 -2 -4 -4 6 -3 -4 -3 -2 -1 -4 -3 -4 0 34 0 0 0 0 0 0 0 0 0 63 0 0 0 0 3 0 0 0 1.01 0.47 395 L -2 -3 -4 -5 -3 -3 -4 -2 -4 1 5 -4 1 -1 -1 -3 0 -3 -2 0 1 0 0 0 0 0 0 3 0 6 78 0 1 0 3 0 7 0 1 1 0.72 0.47 396 E 0 -2 -1 3 -4 0 5 -3 -2 -1 -1 -1 -2 -3 -3 -1 -1 -4 -3 -3 11 0 1 20 0 2 51 0 0 4 5 0 1 1 0 2 2 0 0 0 0.63 0.47 397 L -3 -3 -4 -5 -3 -2 -4 -5 -2 1 5 -4 1 0 -4 -3 -2 -2 3 -1 0 0 0 0 0 2 0 0 1 2 79 0 1 0 0 1 0 0 15 0 0.82 0.48 398 A 6 -3 -2 -3 -2 -2 -2 -1 -3 -3 -3 -2 -2 -4 -2 2 -1 -4 -3 -2 84 0 0 0 0 0 0 0 0 0 0 0 0 0 0 15 1 0 0 0 0.89 0.47 399 Q -1 0 -1 -2 -4 7 1 -3 -1 -4 -3 0 -1 -4 -3 0 -2 -3 -3 -3 3 0 1 0 0 89 2 0 0 0 1 0 1 0 0 4 0 0 0 0 1.23 0.47 400 Y -3 -3 -4 -5 -4 -3 -4 -4 0 -2 -2 -3 -2 5 -4 -3 -3 1 8 -2 0 0 0 0 0 0 0 0 0 0 0 0 0 24 0 1 0 0 75 0 1.50 0.47 401 R -1 6 -1 -1 -4 0 0 -3 -1 -3 -2 1 -2 -4 -3 -1 -2 -2 -3 -2 5 73 1 3 0 1 5 0 0 0 4 1 0 0 0 1 0 0 0 3 0.92 0.48 402 E -2 -1 -1 1 -5 1 7 -3 -1 -5 -4 0 -3 -5 -2 -1 -2 -4 -3 -4 1 0 0 2 0 0 94 0 0 0 0 1 0 0 0 1 1 0 0 0 1.29 0.48 403 V -2 -3 -4 -5 -3 -3 -4 -5 -4 1 5 -3 4 -1 -4 -3 -2 -3 -1 2 0 0 0 0 0 0 0 0 0 1 62 0 16 0 0 0 0 0 2 17 0.76 0.49 404 A 4 -2 -2 -1 -3 2 4 -2 -2 -3 -2 0 -2 -4 -2 0 -2 -4 -3 -2 47 0 0 2 0 10 30 0 0 1 2 5 0 0 0 2 0 0 0 1 0.56 0.49 405 A 5 -2 -2 -2 -2 -2 -1 -1 -3 -3 -3 -2 1 -4 -2 2 0 -4 -3 -1 71 2 0 0 0 0 3 0 0 0 0 0 4 0 0 16 3 0 0 1 0.70 0.49 406 F -3 -4 -4 -3 -4 -4 -4 -4 -3 -1 0 -4 -1 8 -5 -2 -3 0 2 -2 0 0 0 2 0 0 0 0 0 0 4 0 0 90 0 3 0 0 0 1 1.41 0.49 407 A 5 -3 -2 -2 -2 -2 -1 -1 -3 -3 -2 -2 -2 -4 -2 3 0 -4 -3 -2 60 0 0 0 0 0 2 0 0 0 4 0 0 0 0 31 3 0 0 0 0.69 0.49 408 Q -2 1 -1 0 -4 7 1 -3 -1 -4 -3 1 -1 -5 -3 -1 -2 -4 -3 -3 2 6 0 3 0 79 1 0 0 0 0 7 1 0 0 0 1 0 0 2 1.10 0.50 409 F -3 -4 -4 -5 -4 -4 -5 -3 -3 -1 0 -4 -1 8 -5 -2 -3 0 2 -2 0 0 0 0 0 0 0 2 0 0 3 0 0 92 0 3 0 0 0 0 1.47 0.49 410 G 3 -3 -2 -2 -3 -3 -3 6 -3 -4 -4 -3 -3 -4 -3 0 -2 -4 -4 -3 26 0 0 1 0 0 0 67 0 0 0 0 0 0 0 6 0 0 0 0 1.07 0.49 411 S 0 -2 1 -2 -2 -1 -2 0 -2 -4 -4 -2 -3 -4 -2 6 1 -4 -3 -3 4 0 4 0 0 1 0 5 0 0 0 0 0 0 0 83 2 0 0 0 0.88 0.49 412 D -2 -2 0 7 -5 0 1 -3 -2 -4 -3 -2 -4 -2 -3 -1 -2 -5 -4 -4 3 1 1 82 0 3 4 0 0 0 2 1 0 2 0 1 0 0 0 0 1.25 0.50 413 L -2 -4 -5 -5 -3 -3 -4 -3 -4 1 5 -4 1 -1 -2 -4 -1 -3 -2 0 1 0 0 0 0 0 0 2 0 3 88 0 1 0 2 0 2 0 0 2 0.90 0.50 414 D -3 -3 0 7 -5 0 1 -1 -2 -5 -5 -2 -4 -5 -3 -2 -2 -6 -4 -5 0 0 0 93 0 2 1 3 0 0 0 0 0 0 0 0 0 0 0 0 1.53 0.50 415 A 4 -1 -2 2 -3 1 2 -2 -2 -3 -3 2 -3 -4 0 0 -1 -4 -3 -2 42 0 0 13 0 8 15 0 0 0 0 13 0 0 4 1 2 0 0 1 0.46 0.50 416 A 4 -2 -1 1 -2 -2 0 -1 -1 -1 -3 -2 -2 -3 -2 2 0 -4 -3 0 47 1 3 7 0 0 7 3 1 5 0 0 0 0 0 16 3 0 0 5 0.40 0.51 417 T -1 -3 -1 -2 -2 -2 -2 -3 -3 -2 -2 -2 -2 -3 -1 2 6 -4 -3 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 2 10 81 0 0 4 0.96 0.51 418 Q -1 3 -2 -2 -4 5 0 -3 -1 -3 -2 4 0 -4 -3 -2 -2 -4 -3 -3 3 17 0 0 0 43 0 0 0 0 5 27 3 0 0 0 1 0 0 0 0.73 0.50 419 A 3 1 1 0 -3 2 0 -1 -2 -3 -3 3 -1 -4 -2 0 -2 -4 -1 -3 34 6 7 6 0 11 3 3 0 1 1 22 1 0 0 3 0 0 2 0 0.37 0.51 420 L -2 2 -2 -3 -3 5 -1 -4 -2 0 2 1 1 -3 -1 -2 -1 -3 -1 -1 1 12 0 0 0 34 1 0 0 4 26 6 3 0 3 1 3 0 2 3 0.42 0.51 421 L -3 -4 -5 -5 -3 -4 -4 -5 -4 2 5 -4 2 -1 -3 -2 -2 -2 -3 1 0 0 0 0 0 0 0 0 0 8 80 0 3 0 1 4 0 0 0 5 0.88 0.54 422 N 2 -2 4 1 -3 0 3 -2 -1 -1 -3 -1 -3 -4 -3 0 0 -5 -3 -1 24 0 30 7 0 2 20 0 0 5 0 1 0 0 0 4 4 0 0 3 0.46 0.54 423 R -3 6 0 -1 -5 1 -1 -4 3 -4 -3 2 -3 -4 -3 -2 -2 -4 -2 -4 0 70 3 3 0 6 0 0 9 1 1 7 0 0 0 0 0 0 0 0 1.06 0.54 424 G -1 -4 -2 -3 -4 -3 -4 7 -4 -5 -5 -3 -4 -5 -4 -2 -3 -4 -5 -5 1 0 0 0 0 0 0 99 0 0 0 0 0 0 0 0 0 0 0 0 1.69 0.53 425 A 2 2 -1 -2 -3 3 3 -3 -1 -3 -3 2 -2 -4 -3 1 -2 -4 -2 -1 25 14 2 0 0 14 19 0 1 0 1 11 0 0 0 8 0 0 1 4 0.40 0.54 426 R -2 7 -2 -3 -3 0 -1 -4 -1 -3 -1 3 0 -4 -3 -2 -2 -4 -3 -3 1 75 0 0 0 0 0 0 1 1 5 13 2 0 0 0 0 0 0 0 1.11 0.54 427 L -2 -4 -2 -5 -3 -4 -4 -5 -4 3 4 -4 2 0 -4 -3 0 -3 -2 2 1 0 2 0 0 0 0 0 0 21 52 0 6 3 0 0 5 0 0 10 0.65 0.55 428 T -2 2 -1 -3 -3 -2 -3 -2 -3 0 -1 -2 2 -3 -3 -1 5 -4 -2 3 1 13 2 0 0 0 0 2 0 3 1 0 6 0 0 1 44 0 1 26 0.52 0.55 429 E 0 -2 -2 0 -5 1 6 -3 -2 -4 -4 -1 -3 -5 -3 -1 -2 -4 -3 -3 9 0 0 0 0 2 87 0 0 1 0 1 0 0 0 0 0 0 0 2 1.18 0.55 430 V -3 -4 -5 -5 -3 -4 -4 -5 -4 3 4 -4 2 -1 -4 -3 -2 -3 -1 3 0 0 0 0 0 0 0 0 0 20 52 0 5 0 0 0 0 0 2 20 0.75 0.54 431 P -3 -4 -5 -5 -3 -4 -4 -5 -4 1 5 -4 3 2 0 -4 -2 -3 -2 -1 0 0 0 0 0 0 0 0 0 5 72 0 7 9 6 0 1 0 0 0 0.82 0.55 432 K -1 1 0 -2 -4 0 -1 -3 -2 -3 -4 7 -3 -5 -2 -1 -1 -5 -3 -4 3 0 3 0 0 2 0 0 0 1 0 89 0 0 0 0 1 0 0 0 1.11 0.55 433 Q -2 0 -1 -2 -4 8 1 -3 -1 -4 -4 0 -2 -5 -3 -1 -2 -3 -3 -4 0 0 0 0 0 100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1.54 0.55 434 P -1 -1 1 2 -4 -1 -1 1 -3 -4 -4 2 -3 -2 5 1 -1 -5 -4 -3 4 1 7 10 0 1 1 9 0 0 0 20 0 2 32 11 2 0 0 1 0.64 0.55 435 Q -2 1 -1 -2 0 7 0 -3 -1 -1 -1 1 -2 -4 -1 -2 -2 -3 -3 -2 1 5 2 0 2 66 2 0 1 4 6 5 0 0 3 0 0 0 0 2 0.83 0.54 436 Y -2 -2 2 -3 -4 -2 -3 -2 3 -3 -1 -3 -2 3 -4 -1 -2 0 7 -3 3 1 13 0 0 1 0 3 6 0 3 0 0 11 0 4 2 0 53 0 0.89 0.54 437 A 1 -1 0 0 -3 1 0 -2 1 -3 -3 0 -1 -4 -1 4 1 -4 -3 -1 12 1 4 4 0 7 6 0 3 0 0 6 1 0 2 42 7 0 0 4 0.40 0.56 438 P -2 -4 -3 -3 -3 -3 -3 -4 -4 -4 -4 -2 -4 -5 8 -2 -1 -5 -4 -3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 96 0 3 0 0 1 2.43 0.56 439 L -3 -3 -4 -5 -2 -2 -4 -5 -3 1 4 -3 6 0 -3 -2 -2 -3 0 1 0 1 0 0 0 1 0 0 0 5 42 0 32 3 1 3 0 0 4 8 0.72 0.57 440 P 0 0 0 1 -3 -1 -1 -2 -3 -3 -4 -1 -3 -2 5 3 2 -4 -2 -3 6 4 5 8 0 2 1 0 0 1 0 2 0 1 27 25 14 0 1 1 0.57 0.57 441 I -2 -4 -3 -4 -3 -1 -4 -5 -4 3 1 -4 0 0 -2 -3 0 -2 -2 5 0 0 1 0 0 3 0 0 0 21 9 0 2 3 2 0 5 0 0 53 0.65 0.57 442 E 2 -2 -2 -1 -4 -1 5 -2 -2 -3 -3 -1 1 -2 1 0 0 0 0 -1 20 0 1 1 0 0 47 1 0 0 0 0 5 1 6 5 4 1 4 4 0.52 0.58 443 K -2 -1 1 2 -2 2 4 -3 0 -4 -3 3 -3 -4 -3 -1 -1 -4 -2 -2 0 0 5 11 1 8 39 0 2 0 2 23 0 0 0 3 2 0 1 2 0.62 0.59 444 Q -1 -1 -2 -2 -4 7 2 -3 -1 -3 -1 0 2 -4 -3 -2 -2 -4 -3 -3 3 0 0 0 0 78 8 0 0 1 5 0 6 0 0 0 0 0 0 0 1.12 0.59 445 I 1 -4 -4 -4 -1 -4 -4 -2 -4 4 0 -4 -1 -2 -4 -2 -1 -4 -3 5 12 0 0 0 1 0 0 3 0 26 1 0 0 0 0 2 1 0 0 54 0.71 0.59 446 L 2 -4 -4 -4 1 -3 -4 -2 -4 3 2 -3 0 -2 -2 -2 -1 -4 -3 3 21 0 0 0 2 0 0 2 0 18 23 0 2 0 1 1 2 0 0 27 0.45 0.59 447 V -1 -3 -3 -4 0 -3 -3 -4 -4 4 1 -3 1 -1 -4 2 0 -4 -3 3 1 0 0 0 2 0 0 0 0 26 15 0 3 2 0 24 5 0 0 21 0.44 0.59 448 I -3 -4 -5 -5 -3 -4 -5 -5 -5 5 4 -4 1 0 -4 -4 -2 -2 -2 2 0 0 0 0 0 0 0 0 0 42 45 0 1 2 0 0 0 0 0 8 0.84 0.58 449 Y -2 -4 -4 -5 -4 -3 -4 -3 -1 -2 -1 -4 -2 5 -5 -2 -3 3 7 -2 3 0 0 0 0 0 0 1 0 1 4 0 0 29 0 3 0 3 56 1 1.23 0.59 450 A 5 -3 -3 -3 3 -2 -3 -1 -3 -1 -1 -2 -1 -3 -3 0 1 -4 -3 -1 71 0 0 0 6 0 0 1 0 2 6 0 1 0 0 3 9 0 0 2 0.73 0.58 451 A 3 -4 -3 -4 2 -3 -4 4 -4 0 1 -3 -1 0 -3 -2 -2 -4 -3 1 26 0 0 0 4 0 0 34 0 5 13 0 1 4 0 0 0 0 0 12 0.48 0.59 452 V 0 -3 1 -1 -3 0 -1 -1 -3 0 -1 -1 -1 -3 -3 0 3 -4 -3 3 8 0 8 3 0 3 4 5 0 5 3 3 0 0 0 4 22 0 0 30 0.28 0.59 453 N -2 2 5 -1 -4 1 2 -2 2 -5 -4 3 -3 -2 -3 -1 -1 -4 -2 -4 2 10 39 0 0 5 14 1 5 0 0 18 0 2 0 3 2 0 1 0 0.66 0.59 454 G -1 -2 2 0 -4 -3 -2 6 -2 -5 -5 0 -4 -5 -4 -1 -3 -4 -4 -5 1 1 9 4 0 0 1 76 0 0 1 5 0 0 0 2 0 0 0 0 1.20 0.60 455 F -2 -3 -4 -5 -4 -3 -4 -5 3 -2 0 -3 -2 5 -5 -3 -3 0 7 -1 3 0 0 0 0 0 0 0 6 0 11 1 0 28 0 0 0 0 46 4 1.04 0.60 456 C -3 -4 -5 -5 2 -4 -5 -5 -4 2 5 -4 2 2 -5 -4 -3 -3 -2 0 0 0 0 0 4 0 0 0 0 11 71 0 3 7 0 0 0 0 1 3 0.87 0.61 457 D -2 -1 0 6 -5 -1 0 -3 -2 -4 -3 1 -1 -5 -2 -1 0 -5 -4 -4 2 3 2 67 0 1 2 0 1 0 2 11 2 0 1 1 6 0 0 0 0.99 0.61 458 R -1 0 2 5 -4 0 1 -2 -2 -3 -4 2 -3 -3 -1 0 -2 0 -4 -3 4 6 9 40 0 3 5 2 0 1 1 16 0 1 2 5 1 2 0 2 0.56 0.61 459 M -2 -3 -5 -5 -3 -4 -4 -5 -5 5 2 -4 2 0 -4 -4 -2 -4 -2 4 1 1 0 0 0 0 0 0 0 39 16 0 6 2 0 0 0 0 1 35 0.77 0.61 460 P 0 -2 -1 2 -4 -1 3 -3 -3 -4 -4 0 -4 -5 6 -1 -2 -5 -4 -3 8 0 2 10 0 1 24 0 0 0 0 6 0 0 45 3 0 0 0 0 1.05 0.60 461 L -2 -3 -3 -4 -3 -2 -3 -5 -4 2 2 -3 -1 -2 1 -2 -1 -4 -3 5 2 1 1 0 0 2 1 0 0 11 25 1 0 0 7 1 2 0 0 47 0.58 0.63 462 D 1 -1 2 3 -4 0 3 -1 -2 -4 -4 2 -3 -5 -2 1 -1 -5 -4 -3 11 1 11 20 0 3 26 4 0 0 0 12 0 0 1 11 1 0 0 1 0.50 0.63 463 R -2 1 1 4 -1 2 1 -3 -1 -3 -3 3 -3 -4 -3 -1 -2 -4 0 -3 1 8 7 26 2 12 4 0 1 1 3 26 0 0 0 3 0 0 5 1 0.50 0.63 464 I -1 -4 -5 -5 -3 -4 -4 -5 -5 5 1 -4 1 -2 -4 -3 -2 -4 -3 4 4 0 0 0 0 0 0 0 0 48 7 0 2 0 0 0 0 0 0 40 0.91 0.64 465 S 0 3 -2 -3 -4 2 -1 0 -3 -1 -1 2 -2 -4 -1 1 1 -4 -3 -1 8 20 1 0 0 10 3 7 0 3 7 12 0 0 3 11 7 0 0 6 0.24 0.64 466 Q 0 2 0 3 -4 1 2 -3 -2 -4 -3 2 -3 -2 -1 1 -1 -4 -2 -3 10 14 3 16 0 6 16 0 0 1 2 14 0 1 2 10 3 0 1 0 0.37 0.64 467 Y -2 -4 -4 -5 -2 -4 -4 -5 -2 -1 -1 -4 -2 7 -5 -4 -4 2 5 -2 3 0 0 0 1 0 1 0 0 3 3 0 0 71 0 0 0 2 17 0 1.36 0.63 468 E -2 0 -2 -1 -4 1 6 -3 -2 -3 -1 2 -2 -2 -3 -2 -2 -4 -3 -3 1 3 0 0 0 2 71 0 0 1 7 11 0 2 0 1 0 0 0 1 0.92 0.64 469 K 0 1 1 0 -4 1 1 0 0 -3 -3 3 -2 -3 -3 1 1 -4 0 -2 10 7 6 3 0 5 11 5 2 1 1 25 0 0 0 10 8 0 3 3 0.28 0.64 470 A 2 -1 -1 2 -4 1 3 1 -3 -3 -1 1 0 -3 -3 0 -2 -4 -2 -2 17 3 2 12 0 5 19 12 0 1 8 10 3 1 0 7 1 0 1 1 0.26 0.65 471 I -3 -4 -5 -5 -3 -4 -5 -5 -4 3 5 -4 1 3 -5 -4 -3 0 1 0 0 0 0 0 0 0 0 0 0 16 61 0 2 12 0 0 0 1 4 3 0.83 0.65 472 P -2 2 -2 -4 -2 -2 -4 -5 1 2 3 -2 0 3 -3 -3 -2 -2 1 0 3 12 1 0 1 2 0 0 4 14 40 0 1 12 1 0 1 0 4 4 0.43 0.64 473 N 1 -1 2 2 -4 1 3 -2 -1 -3 -2 0 -3 -4 -2 1 0 -4 -3 -3 14 2 10 12 0 6 24 1 1 1 5 6 0 0 1 11 4 0 0 0 0.34 0.64 474 S -1 -1 -1 -3 -3 -1 0 -3 4 -3 -2 1 -2 4 -4 -1 -2 2 6 -3 4 2 2 1 0 2 5 1 10 0 3 10 0 17 0 5 2 2 33 1 0.62 0.63 475 V 0 -4 -4 -5 2 -4 -4 -4 -4 2 4 -4 2 3 -4 -3 -2 -3 -2 2 8 0 0 0 5 0 0 0 0 12 40 0 6 12 0 1 0 0 0 16 0.56 0.64 476 K -2 3 2 3 -4 1 0 -3 3 -3 -4 4 -3 -4 -2 -1 -1 -4 -2 -3 1 16 10 21 0 4 4 1 8 2 0 27 0 0 1 2 3 0 1 2 0.52 0.63 477 P 2 0 1 -1 -3 1 0 -1 0 -3 -3 0 -3 -4 1 3 2 -4 -2 -2 15 4 8 3 0 5 6 3 3 0 2 5 0 0 6 26 13 0 1 1 0.31 0.64 478 E -1 -1 4 2 -4 2 2 -2 1 -3 -4 2 -3 -4 -3 2 -1 -5 -2 -3 2 2 23 9 0 8 17 1 3 1 0 17 0 0 0 14 2 0 1 1 0.51 0.65 479 L 0 0 0 -1 -3 0 -1 -2 6 -2 0 3 -2 -1 -4 -2 -3 -3 3 -2 8 3 3 4 0 2 2 2 29 2 9 20 0 2 0 0 0 0 13 1 0.53 0.65 480 L 1 -2 -1 -2 -3 0 -2 1 -3 -3 0 1 -3 0 5 1 -1 -4 -3 -3 11 1 3 0 0 5 1 9 0 0 10 10 0 4 29 12 4 0 0 1 0.47 0.64 481 Q 0 -2 -1 4 -4 2 4 -2 -2 -3 -4 0 -3 -3 -2 0 -2 -4 -3 -3 8 0 1 29 0 8 36 2 0 2 1 5 0 1 1 5 1 0 0 0 0.66 0.63 482 A -2 -4 -4 -2 -2 -3 -1 -4 -2 4 3 -3 -1 1 -4 -1 -1 -3 0 2 2 0 0 2 0 0 4 0 1 31 27 1 0 7 0 5 2 0 3 14 0.44 0.64 483 L -2 -2 -4 -4 -1 -3 -2 -2 -4 2 4 -3 4 2 -3 -3 -3 0 -1 1 2 1 0 0 1 1 3 3 0 15 43 1 13 8 1 0 0 1 1 6 0.51 0.66 484 K 0 1 1 2 -4 1 3 -2 -2 -4 -3 3 -3 -4 -3 0 0 -4 0 -4 10 6 5 11 0 4 25 1 0 0 2 21 0 0 0 7 4 0 4 0 0.43 0.68 485 G 0 -1 0 1 -4 1 2 2 -1 -1 -3 1 -3 -4 -3 2 1 -3 -4 -3 8 2 4 7 0 5 13 13 1 4 2 11 0 0 0 18 8 0 0 1 0.26 0.67 486 G -2 -2 -3 -2 -3 0 -3 -1 -4 6 0 -2 1 -2 -4 -2 1 -4 -3 2 2 2 1 2 0 4 1 5 0 55 3 2 3 0 0 2 10 0 0 8 0.57 0.66 487 L 1 3 1 -2 -3 1 0 -2 -2 2 1 2 -1 -1 -3 -1 -2 -4 -2 -1 11 15 9 0 0 7 4 3 0 13 13 11 1 3 0 4 1 0 0 2 0.18 0.67 488 T -1 0 1 1 -3 -1 2 -2 -3 -3 -4 0 -3 -4 -3 2 5 -4 -3 -3 1 4 7 6 0 0 14 2 0 0 0 6 0 0 0 18 41 0 1 0 0.57 0.68 489 N -2 -1 3 1 -4 0 1 3 -2 -3 -3 3 -3 -4 -2 -1 0 -5 -4 -4 2 0 15 6 0 2 11 26 0 1 3 26 0 0 1 3 4 0 0 0 0.48 0.68 490 E -1 -1 -1 3 -2 1 3 -2 -2 -4 -4 4 -3 -5 -2 -1 0 -5 -3 -2 3 1 2 20 1 5 23 2 0 0 0 29 0 0 1 2 7 0 1 3 0.57 0.69 491 R -1 0 -2 -3 -3 -3 -3 -2 -1 2 3 0 1 3 -4 -3 -2 2 2 -1 4 5 1 1 0 0 1 3 1 11 39 8 3 11 0 0 1 3 7 0 0.38 0.69 492 K -2 -2 3 3 -4 -1 1 -2 -2 -4 -2 1 -3 -1 -3 2 3 -4 -2 -3 1 0 17 14 0 0 10 2 0 0 4 7 0 3 0 20 19 0 1 0 0.42 0.70 493 M 1 -2 -1 4 -4 -1 2 -2 -2 -1 -1 1 -1 -2 0 -1 -2 -3 -1 -3 12 0 1 32 0 1 17 3 0 3 8 8 2 2 4 4 0 0 3 0 0.40 0.70 494 E -1 -2 1 4 -5 2 4 -2 -2 -4 -4 0 0 -5 0 0 -2 -5 -4 -3 3 0 5 25 0 7 36 2 0 1 1 4 3 0 4 8 1 0 0 1 0.64 0.71 495 P 1 -3 0 0 -3 -2 -1 -4 1 3 2 -2 -1 -2 -2 0 2 -4 0 0 13 0 6 7 0 0 3 0 3 18 18 2 0 1 2 5 13 0 3 6 0.19 0.71 496 D 0 -2 -1 2 -4 1 5 -4 -2 -3 -3 2 -3 -3 -3 -1 -1 -5 -4 0 6 0 2 12 0 4 47 0 0 0 2 12 0 1 0 1 2 0 0 10 0.63 0.70 497 A 3 -1 2 1 -4 1 2 -1 -2 -4 -4 3 -3 -5 -3 0 0 -5 -4 -3 25 1 9 7 0 6 12 3 0 0 0 26 0 0 0 5 4 0 0 1 0.44 0.70 498 F 2 0 -2 -1 -1 2 0 -2 -3 -1 0 3 2 1 -3 0 0 -4 -3 -1 20 2 0 2 1 13 4 1 0 3 9 19 5 6 0 5 4 0 0 4 0.22 0.70 499 L -3 -4 -5 -5 -1 -4 -5 -5 -4 2 5 -4 1 2 -5 -4 -2 -3 -1 0 1 0 0 0 1 0 0 0 0 12 74 0 1 9 0 0 1 0 1 1 0.96 0.70 500 K -1 0 2 1 -4 0 2 -3 -2 -1 -3 5 -3 -3 -3 0 0 -5 -4 -1 3 3 9 6 0 1 11 1 0 3 1 47 0 1 0 5 5 0 0 4 0.55 0.71 501 E 1 -2 1 2 -4 0 4 -2 -2 -4 -3 2 -3 -4 -3 0 -1 -4 0 -3 16 0 7 8 0 3 35 2 0 0 1 14 0 0 0 7 2 0 3 0 0.50 0.71 502 R 3 -3 -1 -4 0 -3 -2 -2 -3 2 0 -3 -1 1 -4 0 -1 -4 -1 3 35 0 3 0 2 0 1 2 0 14 8 0 1 7 0 4 1 0 1 20 0.38 0.71 503 A -1 -4 -5 -5 -2 -3 -4 -4 -5 5 2 -1 1 0 -4 -3 -1 -4 -3 2 6 0 0 0 0 1 0 0 0 48 20 4 2 3 0 0 3 0 0 12 0.68 0.71 504 L -1 1 1 2 -4 2 3 -3 -2 -3 -3 3 -3 -5 -3 0 1 -5 -4 -3 4 5 7 13 0 7 22 0 0 1 3 23 0 0 0 5 8 0 0 1 0.45 0.71 505 A 0 -1 2 2 -4 0 4 -2 -2 -4 -4 1 -1 0 -1 1 0 -5 -3 -3 7 1 9 13 0 2 32 2 0 0 0 9 1 4 3 11 5 0 0 0 0.44 0.71 506 L -4 -4 -5 -5 -4 -5 -5 -5 -2 -2 1 -5 -1 8 -5 -4 -4 -1 4 -2 0 0 0 0 0 0 0 0 0 0 9 0 0 75 0 0 0 0 15 1 1.51 0.69 507 I -2 -4 -4 -5 -2 -3 -4 -5 -4 3 3 -4 3 -1 -4 -3 -2 -4 -2 4 0 0 0 0 0 0 0 0 0 21 33 0 11 0 0 0 0 0 0 35 0.67 0.45 K Lambda Standard Ungapped 0.1349 0.3178 Standard Gapped 0.0410 0.2670 PSI Ungapped 0.1840 0.3174 PSI Gapped 0.0563 0.2670 ay116458.gb100644000766000024 442714006427353 16017 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AY116458 229 bp DNA linear PLN 09-APR-2003 DEFINITION Eucalyptus globulus subsp. bicostata isolate EgRT12 Ty1-copia retrotransposon reverse transcriptase-like (RT) gene, partial sequence. ACCESSION AY116458 VERSION AY116458.1 GI:22858882 KEYWORDS . SOURCE Eucalyptus globulus subsp. bicostata ORGANISM Eucalyptus globulus subsp. bicostata Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; Myrtales; Myrtaceae; Eucalyptus. REFERENCE 1 (bases 1 to 229) AUTHORS Diez,J., Beguiristain,T., Le Tacon,F., Casacuberta,J.M. and Tagu,D. TITLE Identification of Ty1- copia retrotransposons in three ectomycorrhizal basidiomycetes: evolutionary relationships and use as molecular markers JOURNAL Curr. Genet. 43 (1), 34-44 (2003) PUBMED 12684843 REFERENCE 2 (bases 1 to 229) AUTHORS Beguiristain,T., Diez,J., Le Tacon,F., Casacuberta,J. and Tagu,D. TITLE Direct Submission JOURNAL Submitted (29-MAY-2002) UMR 1136 IaM Interactions Arbres/Micro-Organismes, INRA-Nancy, Champenoux 54280, France FEATURES Location/Qualifiers source 1..229 /organism="Eucalyptus globulus subsp. bicostata" /mol_type="genomic DNA" /isolate="EgRT12" /sub_species="bicostata" /db_xref="taxon:71272" gene <1..>229 /gene="RT" /note="EgRT" misc_feature <1..>229 /gene="RT" /note="similar to reverse transcriptase; unknown if correctly expressed due to presence of premature stopcodons and frameshifts" repeat_region <1..>229 /transposon="Ty1-copia retrotransposon" ORIGIN 1 aatgggaatt tagaggaaga ggtttatatg gaccaacccg aaggctttcc agttatagga 61 aaggaacaca tggtatgtaa attaaagaag tcaatatatg gacttaaaca agcttcccga 121 caatggtatc ttaagttaaa cgataccata acttcatttg gatttaagga aaacactgtt 181 gatcggtgta tatatatgaa gattagtggg agcaagttta tttttctag // pep-266.aln100644000766000024 2044314006427353 16212 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataCLUSTAL W (1.82) multiple sequence alignment Spar_21273 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA YOR262W -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Smik_Contig1103.1 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Skud_Contig1703.7 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Sbay_Contig635.43 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Scas_Contig692.20 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHGQIVNMDPA Sklu_Contig2277.4 -------------MPFGQIVIGPPGSGKSTYCN------GCSQFFNAVGRHAQVINMDPA AAL117C -------------MAYGQIVIGPPGSGKSTYCN------GCSQFFNAIGRHARIVNMDPA SPAC144.07c -------------MPFCQVVVGPPGSGKSTYCF------GMYQLLSAIGRSSIIVNLDPA MG06110.4 -----------------------------------------MGFLGAIGRACSVVNLDPA NCU09745.1 ----------------------------------MTSPLPVQQFMGAIGRQCSVVNLDPA FG05298.1 -------------MPFAQLVLGSPGCGKSTYCDGIQLTGQVHQFLGAIGRACSVVNLDPA AN2438.1 ------------------------------------------------------------ 183.m01790 MDDKELEIPVEHSTAFGQLVTGPPGAGKSTYCH------GLHQFLTAIGRPVHIINLDPA Spar_21273 NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLVYAVESLDKSIDLFILQIKSLV--EEE YOR262W NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLMYAVESLDNSIDLFILQIKSLV--EEE Smik_Contig1103.1 NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLMYAVESLDKSIDLFLLQIKSLV--EEE Skud_Contig1703.7 NDALPYPCAVDIRDFITLEEVMQEQQLGPNGGLMYAVESLDNSIDLFILQIKSLV--EEE Sbay_Contig635.43 NDALPYPCAVDIRDFITLEEIMKEQHLGPNGGLMYAVESLDKSIDLFILQIKSLV--EEE Scas_Contig692.20 NDALPYPCAVDIRDFVTLEEIMQEQQLGPNGGLMYAVESLDESIDLFILQIKSLV--QEE Sklu_Contig2277.4 NDSLPYPCAVDIRDFITLEEIMTEQQLGPNGGLMYALESLDKSIDLFVLQIKSLV--QDE AAL117C NDSLPYQCDVDIRDFITLEEIMNEQHLGPNGGLVYAFESVEHSLSLFALQIKTLV--KDE SPAC144.07c NDFIKYPCAIDIRKVLDVEMIQKDYDLGPNGALIYAMEAIEYHVEWLLKELK-----KHR MG06110.4 NDHTSYPCALDIRNLVTLEEIMGDDNLGPNGGILYAIEELEHNFEWLEDGLK-----ELG NCU09745.1 NDHTNYPCALDIRDLVTLEEIMADDKLGPNGGILYALEELENNMEWLENGLK-----ELG FG05298.1 NDHTNYPAALDIRSLIKLEEIMKDDKLGPNGGILYALEELEHNFEWLEEGLK-----EFS AN2438.1 ---------------------MSEDQLGPNGGVLYALEELEENFDFLEEGLK-----ELG 183.m01790 VPNPPYPCSINITELITLESVMEEYNLGPNGAMLYCIEFLEANFDWLVERLDEVLAEEGG : .*****.::*..* :: .. : :. : Spar_21273 KAYLVFDCPGQVELFTHHSSLFSIFKKMEKELDMRFCVVNLIDCFYMTSPSQYISILLLA YOR262W KAYLVFDCPGQVELFTHHSSLFNIFKKMEKELDIRFCVVNLIDCFYMTSPSQYISILLLA Smik_Contig1103.1 KAYLVFDCPGQVELFTHHSSLFNIFKKMEKELDMRFCVINLIDCFYMTSPSQYVSILLLA Skud_Contig1703.7 KAYLVFDCPGQVELFTHHSSLFSIFKKMEKELDMRFCVVNLIDCFYMTSPSQYVSILLLA Sbay_Contig635.43 KAYVVFDCPGQVELFTHHSSLFSIFKKLEKELDMRFCVVNLIDCFYMTSPSQYVSILLLA Scas_Contig692.20 KAYLVFDCPGQVELFTHHSSLFKIFKKLEKELDMRFCVVNLIDSFYITSPSQYVSILLLA Sklu_Contig2277.4 HAYVVFDCPGQVELFTHHSSLFRIFKKLERELDMRLCVVNLIDCFYITSPSQYVSILLLA AAL117C NAYLVFDCPGQVELFTHHSALSKIFQQLVRDLDLRVCVVNLMDSIYITSPSQYVSVLLLA SPAC144.07c DSYVIFDCPGQVELFTNHNSLQKIIKTLEKELDYRPVSVQLVDAYCCTNPSAYVSALLVC MG06110.4 DDYILFDCPGQVELYTHHNSLRNIFFKLQK-LGYRLVVVHLSDSICLTQPSLYISNLLLA NCU09745.1 EDYVLFDCPGQVELYTHHNSLRNIFYRLQK-LGYRLVVVHLSDCFCLTQPSLYISNVLLS FG05298.1 EDYILFDCPGQVELYTHHNSLRNIFYKLQK-IGFRLVSVHLSDSFCLTQPSLYVSNVLLS AN2438.1 EDYIIFDCPGQVEIFTHHSSLRNIFFKIQK-MGYRLIVLHLIDSYNLTLPSMYISSLILC 183.m01790 NGYVVFDTPGQAELWTNHDSLKNVVEKLVK-MDYRLAAVHLSDAHYITDASKFISVVLLA . *::** ***.*::*:*.:* :. : : :. * ::* *. * .* ::* :::. Spar_21273 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYS YOR262W LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYS Smik_Contig1103.1 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYN Skud_Contig1703.7 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQELDHLEPYIEKEGSSVLGKKYS Sbay_Contig635.43 LRSMLMMDLPHINVFSKIDKLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSGALGKRYS Scas_Contig692.20 LRSMLMMDLPQINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLQPFIEKESSSVLGRRYS Sklu_Contig2277.4 LRSMLMMDLPHINVFSKIDLLKSYGELPFRLDYYTEVQELDYLKPHIDKEGSSVLGRKYS AAL117C LRSMLMMDLPHINVLSKIDMLSSYGDLPFRLDYYTEVQDLEYLQPHIEREHKGAKALRYR SPAC144.07c LKGMLQLDMPHVNILSKADLLCTYGTLPMKLDFFTEVQDLSYLAPLLDRDKR---LQRYS MG06110.4 LRAMLQMDLSHVNVLTKIDKVSSYDRLAFNLDFYTEVHDLSYLLPELEAENPSLRSEKFA NCU09745.1 LRAMLQMDLPHINVLTKIDKISSYDPLPFNLDYYTEVQDLRYLMPSLDAESPALKKGKFT FG05298.1 LRAMIQMDMPHINILSKIDKVADYDELPFNLDYYTDVDDLTYLTPHLETESPALRSEKFG AN2438.1 LRAMLQMDLPHLNVLTKIDNLSNYTSLPFNLDFYTEVQDLTYLLPHLEAESSRLSHEKFG 183.m01790 LRAMLQMEMPHLNVLSKIDLISTYGELPFDLSYYTEVQDLSYLLGSLDSDPR---TAKYH *:.*: :::.::*:::* * : * *.: *.::*:*.:* :* :: : :: Spar_21273 KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ YOR262W KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Smik_Contig1103.1 KLTDAIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Skud_Contig1703.7 KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Sbay_Contig635.43 KLTETISELVSDFNLVSFEVLAVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Scas_Contig692.20 KLTETISELVSDFNLVSFEVLAVDDKQSMINLQSVVDKANGYIFGAS---EVGG------ Sklu_Contig2277.4 RLTETISELVSDFNLVSFEVLCVDDKQSMINLQSIVDKANGYIFGVS---EIGG------ AAL117C RLTEAIGEVVSDFNLVAFEVLCVDDKQSMINLQSAIDKANGYIFGAS---EVGG------ SPAC144.07c DLNKAICELVEDFNLVSFEVVAVENKASMLRVLRKIDQAGGYAYGST---EIGG------ MG06110.4 KLNRAVANLIEDFGLVRFEVLAVENKKSMMHLLRVLDRANGYVFGGA---EGAN------ NCU09745.1 KLNEAVANMVEQFGLVSFEVLAVENKKSMMHLLRVIDRASGYVFGGA---EGTN------ FG05298.1 KLNEAIANLIESYGLVRYEVLAVENKKSMMHILRVIDRAGGYVFGSA---EGAN------ AN2438.1 ALNNAIITLIEEFGLVGFETLAVEDKKSMMNLLRAIDRASGYVFGPA---EGAN------ 183.m01790 KLNKALVELIEGFSLVGFQTLAVEDKESMLNIVRLVDKMTGYIFIPSGDLEGTNAINTQA *. :: ::. :.** ::.:.*::* **:.: :*: ** : : * . Spar_21273 --DTVWAEASREGA-LLANYDIQDRWIDNKEKYDKEEEEKRAALLKEQELQNKAVDVNEE YOR262W --DTVWAEASREGA-LIANYDIQDRWIDNKEKYDKEEEEKRTALLKEQELQNKAVDVNEE Smik_Contig1103.1 --DTVWAEASREGA-LLTSYDIQDRWIDNKEKYDKEEEEKRVILLKEQELQNKAVDVNED Skud_Contig1703.7 --DTVWAEASREGA-LLENYDIQDRWIDNKETYDKEEQEKRASLLKEQELQNKTVDVKEE Sbay_Contig635.43 --DTVWAEASREGA-LLASYDIQDRWIDNKEKYDKEEQEKRAAMVKEQELQNKEVNVDEE Scas_Contig692.20 --DTVWAEATREGA-MMVNYDIQDRWIDNKEKYDEEERKRQEEQAKEQNMQEKEVDVDNE Sklu_Contig2277.4 --DTVWAEATRQGS-AIANYDIQERWIDNKDMYDREEQEKREQLLKEEELQNKEVDVDKG AAL117C --DTVWAEATRQGT-AAIEYDIQDRWIDNKDFYDKEEEARRKKLLEEHELLEKEVDVNQD SPAC144.07c --DAVWVNAVRQGGDPLQGISPQERWIDKKEEYDK----------YEWELEQK--STMDE MG06110.4 --DTVWQVAMRNEGSLMGVQDIQERWIDNKEAYDEMEQREWEEQVKAQEAMAEADAAAAE NCU09745.1 --DTVWQVAMRNESSLPDALDIQERWIDSKEEYDEMERKEEEEQEKLRAEQARAAEEAGL FG05298.1 --DTVWSVAMRNESSMLGVQDIQERWIDQKVEYDQMEREAEEEQARIQEEQAMEMEQSQP AN2438.1 --DSVWQVAVREGMGSMDIRDIQERWIDAKDEYDELERRQREEEIKNHQQAATYQAGNED 183.m01790 LFGSAMSSAKLTGRAGGDVRDVQERWMDNKEAWDEWEKKEWKREAEIRAQMGTGIPEGMK .:. * . *:**:* * :*. Spar_21273 DEWENALKEWEEKQGTDFVR-------------- YOR262W DEWENALKEWEEKQGMDFVR-------------- Smik_Contig1103.1 DEWESALKEWEEKQGMDFVR-------------- Skud_Contig1703.7 DEWENALKEWEEKQDTEFVR-------------- Sbay_Contig635.43 DEWENALNDWEEKQGTDFVR-------------- Scas_Contig692.20 DEWEKALKDWEEKQGTGYVR-------------- Sklu_Contig2277.4 DEWENALKEWEEKQGMSYVK-------------- AAL117C DEWERAVKEWESQHSVNFVK-------------- SPAC144.07c DENEG----------------------------- MG06110.4 -EG-----DDDLMGGPGAR--------------- NCU09745.1 GDGSVPGVAPQFTSGSGIRVTLSLVAAFTKYSDL FG05298.1 PPAPTGGMDPDFGDMTVPKDSGIKVVRK------ AN2438.1 DDDDN---DYEFGRRMPVPDSGVKVMRK------ 183.m01790 GGED--------AESTGI---------------- cysprot1.fa100644000766000024 55014006427353 16450 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CYS1_DICDI MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHK ADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT ESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCN PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII testaln.psi100644000766000024 20123114006427353 16622 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data QUERY MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS PRGC1_HUMAN MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q4W5M7_HUMAN MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q5RBY0_PONPY MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS PRGC1_PIG MAWDMCNQD--SVWTDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS PRGC1_RAT MAWDMCSQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS PRGC1_BOVIN MAWDMCNQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q4L229_BOVIN MAWDMCNQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS PRGC1_MOUSE MAWDMCSQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q3UP72_MOUSE MAWDMCSQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q60GU0_CHICK MAWDMCNQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDADS Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN MAWDMCNQDSESVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q3LIG2_MOUSE MAWDMCSQD--SVWSDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG MAWDRCNQD--SVWRELECAALVGEDQPLCPDLPELDLSELDVSDLDADS Q811R2_RAT XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q8TAL0_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q8VHJ7_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q86YN5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q8N1N9_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q86YN3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q8TDE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q86YN6_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q86YN4_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSAT Q8C1C0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSPT Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG MAWDMCNQD--SVWTDIECAALVGEDQPLCPDLPELDLSELDVNDLDTDS Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDFDSVN Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP PRGC1_HUMAN FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP Q4W5M7_HUMAN FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP Q5RBY0_PONPY FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP PRGC1_PIG FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP PRGC1_RAT FLGGLKWCSDQSEIISNQYNNEPANIFEKIDEENEANLLAVLTETLDSLP PRGC1_BOVIN FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP Q4L229_BOVIN FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP PRGC1_MOUSE FLGGLKWCSDQSEIISNQYNNEPANIFEKIDEENEANLLAVLTETLDSLP Q3UP72_MOUSE FLGGLKWCSDQSEIISNQYNNEPANIFEKIDEENEANLLAVLTETLDSLP Q60GU0_CHICK FLGGLKWYSDQSEVISSQYSNEPANIFEKIDEENEANLLAVLTETLDSIP Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDEENEANLLAVLTETLDSIP Q3LIG1_HUMAN FLGGLKWCSDQSEIISNQYNNEPSNIFEKIDEENEANLLAVLTETLDSLP Q3LIG2_MOUSE FLGGLKWCSDQSEIISNQYNNEPANIFEKIDEENEANLLAVLTETLDSLP Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG FLGGLKWYSDQSEIISAQYGNEASNLFEKIDEENEANLLAVLTETLDSIP Q811R2_RAT CFGELQWCPETSETEPSQYSPDDSEFFQ-IDSENEA-LLAALTKTLDDIP Q8TAL0_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q8VHJ7_MOUSE CFGELQWCPETSETEPSQYSPDDSELFQ-IDSENEA-LLAALTKTLDDIP Q86YN5_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q8N1N9_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q86YN3_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q8TDE5_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q86YN6_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q86YN4_HUMAN CFGELQWCPENSETEPNQYSPDDSELFQ-IDSENEA-LLAELTKTLDDIP Q8C1C0_MOUSE CFGELQWCPETSETEPSQYSPDDSELFQ-IDSENEA-LLAALTKTLDDIP Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG FLGGLKWCSDQSEIISNQYNNEPSNIFEXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE CLSELHWCNEQSDHSPAQYSAGDPELFE---EEN-AALLAALTDSLDGIV Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA PRGC1_HUMAN VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q4W5M7_HUMAN VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q5RBY0_PONPY VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA PRGC1_PIG VDEDGLPSFDALTDGDVTTENEASPSSMPDGTPPPQEAEEPSLLKKLLLA PRGC1_RAT VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA PRGC1_BOVIN VDEDGLPSFDALTDGDVTTENEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q4L229_BOVIN VDEDGLPSFDALTDGDVTTENEASPSSMPDGTPPPQEAEEPSLLKKLLLA PRGC1_MOUSE VDEDGLPSFDALTDGAVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q3UP72_MOUSE VDEDGLPSFDALTDGAVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q60GU0_CHICK VDEDGLPSFDALTDGDVTNEHDASPSPMPDGTPPPQEAEEPSLLKKLLLA Q5QHW4_PIG XXXXXXXXXDALTDGDVTTENEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU VDEDGLPSFEALADGDVTNASDRSCPSSPDGSPRTPEPEEPSLLKKLLLA Q3LIG1_HUMAN VDEDGLPSFDALTDGDVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q3LIG2_MOUSE VDEDGLPSFDALTDGAVTTDNEASPSSMPDGTPPPQEAEEPSLLKKLLLA Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG VDEDGLPSFEALADGDVTNASDRSCPSSPGGSPRTPEPEEPSLLKKLLLA Q811R2_RAT EDDVGLAAFPGLDEGDTPSCTPASPAPLSVPPSPASEVDELSLLQKLLLA Q8TAL0_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSL------- Q8VHJ7_MOUSE EDDVGLAAFPELDEGDTPSCTAPPSPTLERLLSPASDVDELSLLQKLLLA Q86YN5_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q8N1N9_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q86YN3_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q8TDE5_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q86YN6_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q86YN4_HUMAN EDDVGLAAFPALDGGDALSCTAPPSPAPEKPSAPAPEVDELSLLQKLLLA Q8C1C0_MOUSE EDDVGLAAFPELDEGDTPSCTAPPSPTLERLLSPASDVDELSLLQKLLLA Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXA Q800H2_BRARE EDGVGLSVFPSLGDEPEEGEEEEDDLPMESEPLPSPETEDPSLLKKLLLT Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY PANTQLSYNECSGLSTQNHANHNHRIRTNPAIVKTENSWSNKAKSICQQQ PRGC1_HUMAN PANTQLSYNECSGLSTQNHANHNHRIRTNPAIVKTENSWSNKAKSICQQQ Q4W5M7_HUMAN PANTQLSYNECSGLSTQNHANHNHRIRTNPAIVKTENSWSNKAKSICQQQ Q5RBY0_PONPY PANTQLSYNECSGLSTQNHANHNHRIRTNPAIVKTENSWSNKAKSICQQQ PRGC1_PIG PANTQLSYNECSGLSTQNHANHNHRIRTNPAVVKTENSWSNKAKSICQQQ PRGC1_RAT PANTQLSYNECSGLSTQNHANHTHRIRTNPAIVKTENSWSNKAKSICQQQ PRGC1_BOVIN PANTQLSYNECSGLSTQNHANHNHRIRTNPAVVKTENSWSNKAKSICQQQ Q4L229_BOVIN PANTQLSYNECSGLSTQNHANHNHRIRTNPAVVKTENSWSNKAKSICQQQ PRGC1_MOUSE PANTQLSYNECSGLSTQNHANHTHRIRTNPAIVKTENSWSNKAKSICQQQ Q3UP72_MOUSE PANTQLSYNECSGLSTQNHANHTHRIRTNPAIVKTENSWSNKAKSICQQQ Q60GU0_CHICK PANTQLNYNECSGLSTQNHANTNHRIRTSPVVVKTENSWSNKAKSICQQQ Q5QHW4_PIG PANTQLSYNECSGLSTQNHANHNHRIRTNPAVVKTENSWSNKAKSICQQQ Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU PANSQLSYNQYTGGQAQNHASSNHRIRPPPAVVKMESTWNGKARGSSQQN Q3LIG1_HUMAN PANTQLSYNECSGLSTQNHANHNHRIRTNPAIVKTENSWSNKAKSICQQQ Q3LIG2_MOUSE PANTQLSYNECSGLSTQNHANHTHRIRTNPAIVKTENSWSNKAKSICQQQ Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG PANSQLSYNQYTGGKAQNHASSNHRIRPPPAVVKMESPWNGKSRGCSQQN Q811R2_RAT TSSPTASSDALKDGATWSQTSLSS--RSQRPCVKVDGTQDKKTPML---- Q8TAL0_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN ----------------------------------ADSTQDKKAPMM---- Q8VHJ7_MOUSE TSSPTASSDALKDGATWSQTSLSS--RSQRPCVKVDGTQDKKTPTL---- Q86YN5_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q8N1N9_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q86YN3_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q8TDE5_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q86YN6_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q86YN4_HUMAN T-----SYPTSSSDTQKEGTAWRQRSKSQRPCVKADSTQDKKAPMM---- Q8C1C0_MOUSE TSSPTASSDALKDGATWSQTSLSS--RSQRPCVKVDGTQDKKTPTL---- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWSNKAKSICQQQ Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE PANSQLSYNQYPGGKAQNHAASNQRIRPAPSCIKTENPWNSKPRGAC-PN Q800H2_BRARE PPNVPVGLESHKDSGVHRHSSRNQHVKPVRPVLKXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKSHTQSQSQH PRGC1_HUMAN KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKSHTQSQSQH Q4W5M7_HUMAN KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKSHTQSQSQH Q5RBY0_PONPY KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKSHTQSQSQH PRGC1_PIG KPQRRPCSELLKYLTTNDDPPHTKPTETRNSSRDKCTSKKKAHTQSQSQH PRGC1_RAT KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCASKKKSHTQPQSQH PRGC1_BOVIN KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKAHTQSQTQH Q4L229_BOVIN KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKAHTQSQTQH PRGC1_MOUSE KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCASKKKSHTQPQSQH Q3UP72_MOUSE KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCASKKKSHTQPQSQH Q60GU0_CHICK KPQRRPCSELLKYLTTNDDPPQTKPAENRNSSKEKCTSKRKPHLQSQTNH Q5QHW4_PIG KPQRRPCSELLKYLTTNDDPPHTKPTETRNSSRDKCTSKKKAHTQSQSQH Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU RPVRRHCTELLKYLTATDDILHAKNNDAKGTSRDKSGLGL ---------- Q3LIG1_HUMAN KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKSHTQSQSQH Q3LIG2_MOUSE KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCASKKKSHTQPQSQH Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG RPVRRHCTELLKYLTATDDILHPKASEAKGTSRDKS-------------- Q811R2_RAT RSQSRPCTELHKHLTSVLPCPRGKA------------------------- Q8TAL0_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q8VHJ7_MOUSE RAQSRPCTELHKHLTSVLPCPRVKA------------------------- Q86YN5_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q8N1N9_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q86YN3_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q8TDE5_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q86YN6_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q86YN4_HUMAN QSQSRSCTELHKHLTSAQCCLQ---------------------------- Q8C1C0_MOUSE RAQSRPCTELHKHLTSVLPCPRVKA------------------------- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP KPQRRPCSELLKYLTTNDDPPHTKPTENRNSSRDKCTSKKKAHTQSQTQH Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE RSMRRPCTELLKYLTSSDEAFQTKAGEAKSTXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP PRGC1_HUMAN LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q4W5M7_HUMAN LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q5RBY0_PONPY LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP PRGC1_PIG LQAKPTSLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP PRGC1_RAT AQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP PRGC1_BOVIN LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q4L229_BOVIN LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP PRGC1_MOUSE AQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q3UP72_MOUSE AQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q60GU0_CHICK LQAKPTSLSLPLTPESPNDPKGSPFENKTIEQTLSVELSGTAGLTPPTTP Q5QHW4_PIG LQAKPTSLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q4R5X5_MACFA XXAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLTPPTTP Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU --AKPTTLPLPLTPESPNDHKGSPFENKTIERTLSVEIAGTPGLTPPTTP Q3LIG1_HUMAN LQAKPTTLSLPLTPESPNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE AQAKPTTLSLPLTPESPNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSGTAGLTPPTTP Q4SAM8_TETNG --AKPTTLPLPLTPESPNDHKGSPFENKAIERTLSVEIAGTP--DRRQPV Q811R2_RAT ----PRCLMLAL---SQSDPLG----KKSFEESLTVELCGTAGLTPPTTP Q8TAL0_HUMAN XXXXPRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q8TDE4_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q8VHJ7_MOUSE -----RCLMLAL---SQSDSLG----KKSFEESLTVELCGTAGLTPPTTP Q86YN5_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q8N1N9_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q86YN3_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q8TDE5_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q86YN6_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q86YN4_HUMAN ----PRCLMLALSQSDPT------FGKKSFEQTLTVELCGTAGLTPPTTP Q8C1C0_MOUSE -----RCLMLAL---SQSDSLG----RKSFEESLTVELCGTAGLTPPTTP Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP LQAKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGTAGLXXXXXX Q800H3_BRARE XXXXXXXXXXXXXXXXXXXXXXXXYANKPFEQTLCMELCGTAGLTPPTTP Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXKPTTLSLPLTPESPNDPKGSPFENKTIERTLSVELSGXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY PHKANQDNPFRASPKLKSSCKTVVPPPSKKPRYSESSGTQGNNSTKKGPE PRGC1_HUMAN PHKANQDNPFRASPKLKSSCKTVVPPPSKKPRYSESSGTQGNNSTKKGPE Q4W5M7_HUMAN PHKANQDNPFRASPKLKSSCKTVVPPPSKKPRYSESSGTQGNNSTKKGPE Q5RBY0_PONPY PHKANQDNPFRASPKLKSSCKTVVPPPSKKPRYSESSGTQGNNSTKKGPE PRGC1_PIG PHKANQDNPFRASPKLKPPCKTVVPPPSKKTRYSESSGTHGNNSTKKGPE PRGC1_RAT PHKANQDNPFKASPKLKPSCKTVVPPPTKRARYSECSGTQGSHSTKKGPE PRGC1_BOVIN PHKANQDNPFRASPKLKPSCKTVVPPPSKKARYSESSCTQGSNSTKKGPE Q4L229_BOVIN PHKANQDNPFRASPKLKPSCKTVVPPPSKKARYSESSCTQGSNSTKKGPE PRGC1_MOUSE PHKANQDNPFKASPKLKPSCKTVVPPPTKRARYSECSGTQGSHSTKKGPE Q3UP72_MOUSE PHKANQDNPFKASPKLKPSCKTVVPPPTKRARYSECSGTQGSHSTKKGPE Q60GU0_CHICK PHKANQDNPFRTSPKPKSSCKTV-APPSKKPRYSESSGSQGNNPVKKGPE Q5QHW4_PIG PHKANQDNPFRASPKLKPPCKTVVPPPSKKTRYSESSGTHGNNSTKKGPE Q4R5X5_MACFA PHKANQDNPFRASPKPKSSCKTVVPPPSKKPRYSESSGTQGNNSTKKGPE Q58FA1_SHEEP XHKANQDNPFRASPKLKPSCKTVVPPPAKKARYSESSGTQGSNSTKKGPE Q3Y595_FUGRU PHKASQENPFKASLKTKLSSCSSSAFVCKRARLSESGPGAGGGPIRKGPE Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR PHKANQDNPFRASPKPKSSCKTVVPPPSKKARYSESSGTQGNNSTKKGPE Q4SAM8_TETNG WNRFCGASRKRESQTFPKNQERVTAPACKRARLSEPGPAAPAPGARKGPE Q811R2_RAT PYKPMEEDPFKQDTKHSPGQDTAPSLPSPETLQLTATPGASHKLPKRHPE Q8TAL0_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q8TDE4_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q8VHJ7_MOUSE PYKPMEEDPFKPDTKLSPGQDTAPSLPSPEALPLTATPGASHKLPKRHPE Q86YN5_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q8N1N9_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q86YN3_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q8TDE5_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q86YN6_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q86YN4_HUMAN PYKPTEEDPFKPDIKHSLGKEIALSLPSPEGLSLKATPGAAHKLPKKHPE Q8C1C0_MOUSE PYKPMEEDPFKPDTKLSPGQDTAPSLPSPEALPLTATPGASHKLPKRHPE Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE PHKPVEDELFK---------------PDAKADLSTKSSCLMRANMRKLPE Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY QSELYAQLSKSSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINI PRGC1_HUMAN QSELYAQLSKSSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINI Q4W5M7_HUMAN QSELYAQLSKSSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINI Q5RBY0_PONPY QSELYAQLSKSSVLTGGHEERKTKRPSLRLFGDHDYCQSINSKTEILINI PRGC1_PIG QSELYAQLSKTSALGGGHEERKARRPSLRLFGDHDYCQSINSKAEILINI PRGC1_RAT QSELYAQLSKSSVLSRGHEERKTKRPSLRLFGDHDYCQSVNSKTDILINI PRGC1_BOVIN QSELYAQLSKTSVLTSGHEERKAKRPSLRLFGDHDYCQSINSKTEILVST Q4L229_BOVIN QSELYAQLSKTSVLTSGHEERKAKRPSLRLFGDHDYCQSINSKTEILVST PRGC1_MOUSE QSELYAQLSKSSGLSRGHEERKTKRPSLRLFGDHDYCQSLNSKTDILINI Q3UP72_MOUSE QSELYAQLSKSSGLSRGHEERKTKRPSLRLFGDHDYCQSLNSKTDILINI Q60GU0_CHICK QTELYAQLSKTTALSSGHEERKTKRPSLRLFGDHDYCQSVNSKSEIHIKI Q5QHW4_PIG QSELYAQLSKTSALGGGHEERKARRPSLRLFGDHDYCQSINSKAEILINI Q4R5X5_MACFA QSELYAQLSKSTVLTGGHEERKTKRHSLRLFGDHDYCQSINSKTEILIHI Q58FA1_SHEEP QSESYAQLSKTSVLTSGHEERKAKRPSLRLFGDHDYCQSINSKTEILVSI Q3Y595_FUGRU QTELYAQLSKASTALGGPEERRSKRAAPRGYSDHDYCQASAKKDSAALVP Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR QSELYAQLSKSSVPSSGHEERKAKRPSPRLFGDHDYCQSINSKTEILLHV Q4SAM8_TETNG QTELYAQLSKASTARGHRSSGSSQRAASRRHSDHDYCQASAGKDSVTMTT Q811R2_RAT RSELLSHLQHAT--TQPVSQAGQKRPFSCSFGDHDYCQVIRPEAAL---- Q8TAL0_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q8TDE4_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q8VHJ7_MOUSE RSELLSHLQHAT--TQPVSQAGQKRPFSCSFGDHDYCQVLRPEAAL---- Q86YN5_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q8N1N9_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q86YN3_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q8TDE5_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q86YN6_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q86YN4_HUMAN RSELLSHLRHATAQPA--SQAGQKRPFSCSFGDHDYCQVLRPEGVL---- Q8C1C0_MOUSE RSELLSHLQHAT--TQPVSQAGQKRPFSCSFGDHDYCQVLRPEAAL---- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE QTELYAQLRR ----MGQTGDIDSKGGTQRAYGDHDYCLGESRKTTAVLNG Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY SQELQDSRQLENKDVSSDWQGQICSSTDSDQCYLRETLEASKQVSPCSTR PRGC1_HUMAN SQELQDSRQLENKDVSSDWQGQICSSTDSDQCYLRETLEASKQVSPCSTR Q4W5M7_HUMAN SQELQDSRQLENKDVSSDWQGQICSSTDSDQCYLRETLEASKQVSPCSTR Q5RBY0_PONPY SQELQDSRQLENKDVSSDWQGQICSSTDSDQCYLRETLEASKQVSPCSTR PRGC1_PIG SQELHDSRQLDSKDAASDWQRQMCSSTDSDQCYLTETSEASRQVSPGSAR PRGC1_RAT SQELQDSRQLDFKDASCDWQGHICSSTDSSQCYLRETLEASKQVSPCSTR PRGC1_BOVIN SQELHDSRQLENKDASSNGPGQIHSSTDSDPCYLRETAEVSRQVSPGSTR Q4L229_BOVIN SQELHDSRQLENKDASSNGPGQIHSSTDSDPCYLRETAEVSRQVSPGSTR PRGC1_MOUSE SQELQDSRQLDFKDASCDWQGHICSSTDSGQCYLRETLEASKQVSPCSTR Q3UP72_MOUSE SQELQDSRQLDFKDASCDWQGHICSSTDSGQCYLRETLEASKQVSPCSTR Q60GU0_CHICK SQELQDSRQLEFKDSSPGWQCQICSSLEQDQYFKKETLQTSKQGSQGNNR Q5QHW4_PIG SQELHDSRQLDSKDAASDWQRQMCSSTDSDQCYLTETSEASRQVSPGSAR Q4R5X5_MACFA SQELQDSRQLENKDVSSDWQGQICSSTDSDQCYLRETSEASKQVSPCSTR Q58FA1_SHEEP SQELHDSRQLENKDASSNGPGQIHSSTDSDLCYLRETAEVSRQVSPGSTR Q3Y595_FUGRU TVGRVEDGHVECKDSAMPPEQQDLSSVDGKAAPARPSGDDEDQLQTCASR Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR SQELPDSRQLDYKDASSAWQGQICSSTDSDQLYLRETLEASKQVSPCGTR Q4SAM8_TETNG AAMTAEDGHVECKDLAMPPSSSSSSPPSSSPPPKASSGPSAEQQNPSSAR Q811R2_RAT --QRKVLRSWEPIKVHLEDLAHQGATLPVETKTPRREADQNCDPTP-KDS Q8TAL0_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q8TDE4_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q8VHJ7_MOUSE --QRKVLRSWEPIGVHLEDLAQQGAPLPTETKAPRR--EANQNCDPTKDS Q86YN5_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q8N1N9_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q86YN3_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q8TDE5_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q86YN6_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q86YN4_HUMAN --QRKVLRSWEPSGVHLDWPQQGAPWAEA- QAPGREEDRSCDAGAPPKDS Q8C1C0_MOUSE --QRKVLRSWEPIGVHLEDLAQQGAPLPTETKAPRR--EANQNCDPTKDS Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE SQ------------------------------------------------ Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY KQLQDQEIRAELNKHFGHPSQAVFDDEADKTGELRDSDFSNEQFSKLPMF PRGC1_HUMAN KQLQDQEIRAELNKHFGHPSQAVFDDEADKTGELRDSDFSNEQFSKLPMF Q4W5M7_HUMAN KQLQDQEIRAELNKHFGHPSQAVFDDEADKTGELRDSDFSNEQFSKLPMF Q5RBY0_PONPY KQLQDQEIRAELDKHFGHPSQAVFDDEADKTSELRDSDFSNEQFSKLPMF PRGC1_PIG KQLQDQEIRAELNKHFGHPSQAVFDDEADKTSELRDSDFSNEQFSKLPMF PRGC1_RAT KQLQDQEIRAELNKHFGHPSQAVFDDKVDKTSELRDGNFSNEQFSKLPVF PRGC1_BOVIN KQLQDQEIRAELNKHFGHPSQAVFDDKADKTSELRDSDFSNEQFSKLPMF Q4L229_BOVIN KQLQDQEIRAELNKHFGHPSQAVFDDKADKTSELRDSDFSNEQFSKLPMF PRGC1_MOUSE KQLQDQEIRAELNKHFGHPCQAVFDDKSDKTSELRDGDFSNEQFSKLPVF Q3UP72_MOUSE KQLQDQEIRAELNKHFGHPCQAVFDDKSDKTSELRDGDFSNEQFSKLPVF Q60GU0_CHICK KQLQDQEIRAELNKHFGHPSQAVFDEEADKTGELRDSDYSNEQFSKLPMF Q5QHW4_PIG KQLQDQEIRAELNKHFGHPSQAVFDDEADKTSELRDSDFSNEQFSKLPMF Q4R5X5_MACFA KQLQDQEIRAELNKHFGHPSQAVFDDEADKTSELRDSDFSNEQFSKLPMF Q58FA1_SHEEP KQLQDQEIRAELNKHFGHPSQAVFDDKADRTSELRDSDFSNEQFSKLPMF Q3Y595_FUGRU KLLRDNQIRAELNKHFGHPLQALYSQXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR KQLQDQEIRAELNKHFGHPSQAVFDDEADKTSELRDSDFSNEQFSKLPMF Q4SAM8_TETNG KLLRDNEIRAELNKHFGPPLQALYSQGREPVGEGEDSYYPH----RLPSY Q811R2_RAT MQLRDHEIRASLTKHFG--------------------------------- Q8TAL0_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q8TDE4_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q8VHJ7_MOUSE MQLRDHEIRASLTKHFG--------------------------------- Q86YN5_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q8N1N9_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q86YN3_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q8TDE5_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q86YN6_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q86YN4_HUMAN TLLRDHEIRASLTKHFG--------------------------------- Q8C1C0_MOUSE MQLRDHEIRASLTKHFG--------------------------------- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE -------------------------------------------------- Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY INSGLAMDGLFDDSEDESDKLSYPWDGTQSYSLFNVSPSCSSFNSPCRDS PRGC1_HUMAN INSGLAMDGLFDDSEDESDKLSYPWDGTQSYSLFNVSPSCSSFNSPCRDS Q4W5M7_HUMAN INSGLAMDGLFDDSEDESDKLSYPWDGTQSYSLFNVSPSCSSFNSPCRDS Q5RBY0_PONPY INSGLAMDGLFDDSEDESDKLSYPWDGTQSYSLFNVSPSCSSFNSPCRDS PRGC1_PIG INSGLAMDGLFDDSEDESDKLNSPWDGTQSYSLFDVSPSCSSFNSPCRDS PRGC1_RAT INSGLAMDGLFDDSEDENDKLSYPWDGTQSYSLFDVSPSCSSFNSPCRDS PRGC1_BOVIN INSGLAMDGLFDDSEDESDKLNSPWDGTQSYSLFDVSPSCSSFNSPCRDS Q4L229_BOVIN INSGLAMDGLFDDSEDESDKLNSPWDGTQSYSLFDVSPSCSSFNSPCRDS PRGC1_MOUSE INSGLAMDGLFDDSEDESDKLSYPWDGTQPYSLFDVSPSCSSFNSPCRDS Q3UP72_MOUSE INSGLAMDGLFDDSEDESDKLSYPWDGTQPYSLFDVSPSCSSFNSPCRDS Q60GU0_CHICK INSGLAMDGLFDDSEDESDKLCYPWDGTQSYSLFDVSPSCSSFNSPCRDS Q5QHW4_PIG INSGLAMDGLFDDSEDESDKLNSPWDGTQSYSLFDVSPSCSSFNSPCRDS Q4R5X5_MACFA INSGLAMDGLFDDSEDESDKLSYPWDGTQSYSLFNVSPSCSSFNSPCRDS Q58FA1_SHEEP INSGLAMDGLFDDSEDESDKLNSPWDGTQSYSLFDVSPSCSSFNSPCRDS Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR INSGLAMDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG LHPGLPFHEELELAQDREGRSLYPWEGTPLDLLFDRSPSCPSSCSPSRGS Q811R2_RAT -------------------------------------------------- Q8TAL0_HUMAN -------------------------------------------------- Q8TDE4_HUMAN -------------------------------------------------- Q8VHJ7_MOUSE -------------------------------------------------- Q86YN5_HUMAN -------------------------------------------------- Q8N1N9_HUMAN -------------------------------------------------- Q86YN3_HUMAN -------------------------------------------------- Q8TDE5_HUMAN -------------------------------------------------- Q86YN6_HUMAN -------------------------------------------------- Q86YN4_HUMAN -------------------------------------------------- Q8C1C0_MOUSE -------------------------------------------------- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE -------------------------------------------------- Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY VSPPKSLFSQRPQRMRSRSRSFSRHRSCSRSPYSRSRSRSPGSRSSSRSC PRGC1_HUMAN VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4W5M7_HUMAN VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5RBY0_PONPY VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_PIG VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_RAT VSPPKSLF------------------------------------------ PRGC1_BOVIN VSPPKSLF------------------------------------------ Q4L229_BOVIN VSPPKSLF------------------------------------------ PRGC1_MOUSE VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3UP72_MOUSE VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q60GU0_CHICK VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW4_PIG VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP VSPPKSLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG I------------------------------------------------- Q811R2_RAT -------------------------------------------------- Q8TAL0_HUMAN -------------------------------------------------- Q8TDE4_HUMAN -------------------------------------------------- Q8VHJ7_MOUSE -------------------------------------------------- Q86YN5_HUMAN -------------------------------------------------- Q8N1N9_HUMAN -------------------------------------------------- Q86YN3_HUMAN -------------------------------------------------- Q8TDE5_HUMAN -------------------------------------------------- Q86YN6_HUMAN -------------------------------------------------- Q86YN4_HUMAN -------------------------------------------------- Q8C1C0_MOUSE -------------------------------------------------- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE -------------------------------------------------- Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY YYYESSHYRHRTHRNSPLYVRSRSRSPYSRRPRYDSYEEYQHERLKREEY PRGC1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4W5M7_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5RBY0_PONPY XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_RAT -------------------------------------------------- PRGC1_BOVIN -------------------------------------EEYQHERLKREEY Q4L229_BOVIN -------------------------------------EEYQHERLKREEY PRGC1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3UP72_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q60GU0_CHICK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG -------------------------------------------------- Q811R2_RAT -------------------------------------------------- Q8TAL0_HUMAN -------------------------------------------------- Q8TDE4_HUMAN -------------------------------------------------- Q8VHJ7_MOUSE -------------------------------------------------- Q86YN5_HUMAN -------------------------------------------------- Q8N1N9_HUMAN -------------------------------------------------- Q86YN3_HUMAN -------------------------------------------------- Q8TDE5_HUMAN -------------------------------------------------- Q86YN6_HUMAN -------------------------------------------------- Q86YN4_HUMAN -------------------------------------------------- Q8C1C0_MOUSE -------------------------------------------------- Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE -------------------------------------------------- Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QUERY RREYEKRESERAKQRERQRQKAIEERRVIYVGKIRPDTTRTELRDRFEVF PRGC1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4W5M7_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5RBY0_PONPY XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_RAT ---------------------------VIYVGKIRPDTTRTELRDRFEVF PRGC1_BOVIN RREYEKRESERAKQRERQRQKAIEERRVIYVGKIRPDTTRTELRDRFEVF Q4L229_BOVIN RREYEKRESERAKQRERQRQKAIEERRVIYVGKIRPDTTRTELRDRFEVF PRGC1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3UP72_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q60GU0_CHICK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG ---------------------------VVYVGRLRSDCTRTELKRRFEVF Q811R2_RAT ---------------------------VVYIRNLSGDMSSRELKKRFEVF Q8TAL0_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q8TDE4_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q8VHJ7_MOUSE ---------------------------VVYIRNLSSDMSSRELKKRFEVF Q86YN5_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q8N1N9_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q86YN3_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q8TDE5_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q86YN6_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q86YN4_HUMAN ---------------------------VVYIQNLSSDMSSRELKRRFEVF Q8C1C0_MOUSE ---------------------------VVYIRNLSSDMSSRELKKRFEVF Q76N31_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q5VV66_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q5VV67_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q6P3U5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q9BZE5_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q91YW8_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q6P3W1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q6NZN1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q80TW6_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q5QHW3_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELRDRFEVF Q4RQH2_TETNG XXXXXXXXXXXXXXXXXXXXXXXXXXXVVYVGRIRGTMTQKELGERFSLF Q9BUJ3_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXVVFIGKIPGRMTRSELKQRFSVF Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE ---------------------------VIYIHNLPSSVTQTMLRKRFEAF Q5U183_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q6NPA7_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q8IPM1_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q8IH74_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q8IPM0_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q9VN24_DROME XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRIEQETTKEILRRKFLPY Q7QA91_ANOGA XXXXXXXXXXXXXXXXXXXXXXXXXXXIVYVGRLESTTRKEDLQQKFQPY Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXYIKHLPDDITDDRLREIFEPF QUERY GEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNETDFELYF PRGC1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4W5M7_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5RBY0_PONPY XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_RAT GEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNETDFELYF PRGC1_BOVIN GEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNETDFELYF Q4L229_BOVIN GEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNETDFELYF PRGC1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3UP72_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q60GU0_CHICK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG GEIEECAVNLRDDGDNFGFITYRYTCDAFAALENGHTLRRSDEPQFELCF Q811R2_RAT GEIVECQVLRRKRGQKHGFITFRCSEHAALSVRNGATLRKRNEPSFHLSY Q8TAL0_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q8TDE4_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q8VHJ7_MOUSE GEIVECQVLTRKRGQKHGFITFRCSEHAALSVRNGATLRKRNEPSFHLSY Q86YN5_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q8N1N9_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q86YN3_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q8TDE5_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q86YN6_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q86YN4_HUMAN GEIEECEVLTRNRGEKYGFITYRCSEHAALSLTKGAALRKRNEPSFQLSY Q8C1C0_MOUSE GEIVECQVLTRKRGQKHGFITFRCSEHAALSVRNGATLRKRNEPSFHLSY Q76N31_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q5VV66_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q5VV67_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q6P3U5_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q9BZE5_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q91YW8_MOUSE GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q6P3W1_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q6NZN1_MOUSE GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q80TW6_MOUSE GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q5QHW3_PIG GEIEECTVNLRDDGDSYGFITYRYTCDAFAALENGYTLRRSNETDFELYF Q4RQH2_TETNG GEIEDCTLHFRNHGDNYGFVTYYDTKDAFAAIENGSKLRKPDELPFDLCF Q9BUJ3_HUMAN GEIEECTIHFRVQGDNYGFVTYRYAEEAFAAIESGHKLRQADEQPFDLCF Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE GHPEDCKVVIKKE-ERCGVITLRHTQ-------NGQTSRHRWDLLGPSGG Q5U183_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q6NPA7_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q8IPM1_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q8IH74_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q8IPM0_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q9VN24_DROME GSIKQITIHYKENGMKYGFVTYERAQDAFTAIDTSH--RDSQISMYDISF Q7QA91_ANOGA GKIVKITLHMKANGSRYGFVTFEKPQHAYDAIDARGT--DPNLRNYDVSF Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA GKITSCAIMKEPNGKGFAFVCFEDKQHASAALRNGHPLEHSAKPLYXXXX QUERY CGRKQFFKSNYADLDSNSDDFDPASTKSKYDSLDFDSLLKEAQRSLRR PRGC1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4W5M7_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5RBY0_PONPY XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PRGC1_RAT CGRKQFFKSNYADLDSNSDDFDPASTKSKYDSLDFDSLLKEAQRSLRR PRGC1_BOVIN CGRKQFFKSNYADLDSNSDDFDPACIKSKYDSLDFDSLLKEAQRSLRR Q4L229_BOVIN CGRKQFFKSNYADLDSNSDDFDPACIKSKYDSLDFDSLLKEAQRSLRR PRGC1_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3UP72_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q60GU0_CHICK XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW4_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4R5X5_MACFA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q58FA1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3Y595_FUGRU XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG1_HUMAN XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG2_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6R2I4_SPETR XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4SAM8_TETNG GGQKQFCKSHYADLDSHSDDFDPASTKSKYGSLDFDSLLREAQRSLRX Q811R2_RAT GGLRHFRWPRYTDYDPTSEESLPSSGKSKYEAMDFDSLLKEAQQSLHX Q8TAL0_HUMAN GGLRHLGWPRYTDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE4_HUMAN GGLRHFCWPRYTDYDSNSEEALPASGKSKYEAMDFDSLLKEAQQSLHX Q8VHJ7_MOUSE GGLRHFRWPRYTDYDPTSEESLPSSGKSKYEAMDFDSLLKEAQQSLHX Q86YN5_HUMAN GGLRHFCWPRYTDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8N1N9_HUMAN GGLRHFCWPRYTDYDSNSEEALPASGKSKYEAMDFDSLLKEAQQSLHX Q86YN3_HUMAN GGLRHFCWPRYTDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8TDE5_HUMAN GGLRHFCWPRYTDYDSNSEEALPASGKSKYEAMDFDSLLKEAQQSLHX Q86YN6_HUMAN GGLRHFCWPRYTDYDSNSEEALPASGKSKYEAMDFDSLLKEAQQSLHX Q86YN4_HUMAN GGLRHFCWPRYTDYDSNSEEALPASGKSKYEAMDFDSLLKEAQQSLHX Q8C1C0_MOUSE GGLRHFRWPRYTDYDPTSEESLPSSGKSKYEAMDFDSLLKEAQQSLHX Q76N31_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q5VV66_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q5VV67_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q6P3U5_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q9BZE5_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q91YW8_MOUSE GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q6P3W1_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q6NZN1_MOUSE GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q80TW6_MOUSE GGRRQFCKRSYSDLDSNREDFDPAPVKSKFDSLDFDTLLKQAQKNLRR Q5QHW3_PIG CGRKQFFKSNYADLDSNSDDFDPASTKSKYDSLDFDSLLKEAQRSLRX Q4RQH2_TETNG GGRRQFCQTSYADLDS-STEYEPFPAKGKFHALDFDTLLKQAQQNLKR Q9BUJ3_HUMAN GGRRQFCKRSYSDLDSNREDFDPAPVRAXXXXXXXXXXXXXXXXXXXX Q6VAC1_SHEEP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H3_BRARE NGSRRFGRKRYIDLDEAG----PGPVKSKYDALDFDALLKEAQRSLHR Q5U183_DROME GGRRAFCRSSYADLDNAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q6NPA7_DROME GGRRAFCRSSYADLDNAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM1_DROME GGRRAFCRSSYADLDNAGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IH74_DROME GGRRAFCRSSYADLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q8IPM0_DROME GGRRAFCRSSYADLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q9VN24_DROME GGRRAFCRSSYADLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q7QA91_ANOGA GGRRAFCRTQYADLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q5QHW5_PIG XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q52MY8_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q800H2_BRARE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q3LIG0_MOUSE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX Q4QA88_LEIMA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX targetp.out100644000766000024 361514006427353 16600 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data### targetp v1.1 prediction results ################################## Number of query sequences: 500 Cleavage site predictions included. Using NON-PLANT networks. Name Len mTP SP other Loc RC TPlen ---------------------------------------------------------------------- BC1G_00001.1 173 0.393 0.024 0.683 _ 4 - BC1G_00002.1 120 0.619 0.040 0.458 M 5 97 BC1G_00003.1 323 0.094 0.895 0.027 S 1 21 BC1G_00004.1 361 0.402 0.072 0.479 _ 5 - BC1G_00005.1 244 0.526 0.035 0.548 _ 5 - BC1G_00006.1 35 0.234 0.036 0.819 _ 3 - BC1G_00007.1 73 0.292 0.127 0.431 _ 5 - BC1G_00008.1 349 0.088 0.958 0.078 S 1 82 BC1G_00009.1 514 0.183 0.102 0.735 _ 3 - BC1G_00010.1 440 0.114 0.088 0.865 _ 2 - ---------------------------------------------------------------------- cutoff 0.000 0.000 0.000 ### targetp v1.1 prediction results ################################## Number of query sequences: 500 Cleavage site predictions included. Using NON-PLANT networks. Name Len mTP SP other Loc RC TPlen ---------------------------------------------------------------------- BC1G_04501.1 215 0.185 0.038 0.843 _ 2 - BC1G_04502.1 395 0.118 0.164 0.825 _ 2 - BC1G_04503.1 199 0.515 0.062 0.436 M 5 20 BC1G_04504.1 220 0.440 0.030 0.707 _ 4 - BC1G_04505.1 67 0.382 0.049 0.610 _ 4 - ---------------------------------------------------------------------- cutoff 0.000 0.000 0.000multifa.seq100644000766000024 127714006427353 16556 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>HSEARLOBE 321bp gctcggactctatctagcagaaacctcgttcagctagtcttgcttcatggaggtttgatc tagactgcaaacgtcggtgctaaaagaccatacttccgtatgtgcctatcgggagcagtc gctgagaagtgcggaatgatccttcaatgaccgccgttaaagcctgggagtccgcgccac aatcattccatatacagcaacacgcgctacgcggacctctcggtgggtgacgattctatt gaggcgttgaagcgagaaagatattccgattcttttcgagtctatagttaaatcggactg catcatccattttagggcata >HSMETOO 134bp ggccgggatggccggacctgttctgaacatcttatatccacccgaacaagttataaacaa tttaaatctgggcggccatctataagcgtgtcttcagtatgagagtcttcggatatcacg acccattaggaaag >empty1 an empty seq well-formatted >empty2 an empty seq without subsequent empty line >MMWHISK 62bp gctgcctctatcaagggtgaaaattgcctgcccggctgggtagacactcgcaccactccc cg >empty3 empty seq at the end of the file testaln.arp100644000766000024 6235514006427353 16605 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data[Profile] Title="mtDNA sequences in the Senegalese Mandenka (hypervariable region 1)" #Data from : #Graven, L., Passarino, G., Semino, O., Boursot, P., Santachiara-Benerecetti, A. S., #Langaney, A., and Excoffier, L., 1995, Evolutionary correlation between #control region sequence and RFLP diversity pattern in the mitochondrial genome #of a Senegalese sample, Mol. Biol. Evol. 12(2):334-345 NbSamples=1 GenotypicData=0 DataType=DNA LocusSeparator=NONE MissingData='?' [Data] [[Samples]] SampleName="Mandenka" #Reference: Graven et al. 1995 Mol. Biol. Evol. 12:334-345. SampleSize=119 SampleData= { 01 6 ATTAGCACCCAAAGCTAAGATTCTAATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACTATCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCTTTCTCGTCCCCATGGATGACCCCCC 02 2 ???????????????????????????????????????????TCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCCATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 03 10 ??????????????????????????TTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATCTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCATACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 04 1 ????????????????????????????????????????????????????????????????????????????????????????ACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCCCCTCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTCCTCGTCCCCATGGATGACCCCC? 05 1 ???????????????????????????????????TTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAGCAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 06 4 ??????????????????????????TTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCCATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGTCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCACGGATGACCCCC? 07 1 ????????????????????TTCTAATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCA??????????????????????????????ATGCTTACAAGCAAGTACAGCAATCGACCCTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 08 2 ??????????????????????????????????????TCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGTCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 09 1 ?????????????????????TCTAATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTCACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCC?? 10 7 ??????????????????????????????AACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 11 2 ????????????????????????????????CTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 12 7 ??????????????????????????????????????????TTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCTCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAGGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 13 1 ????????????????????????????????????????TGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATCTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCATACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCACGGATGACCCCCC 14 1 ??????????????????????????????????????????TTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 15 2 ???????????????????????????????????????????????????????????????????GTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 16 3 ??????????????????????????TTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 17 1 ????????????????????TTCTAATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATCTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCATACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGA?????? 18 3 ????????????????????????????????????????TGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTAACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGAT???????? 19 1 ??????????????????????????????????????TCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACAGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGTCA-CCTCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCACGGATGACCCCC? 20 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACAGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCCCCTCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGTCA-CCTCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCACGGATGACCCCCC 21 2 ???????????????????????????????ACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACAGTACCATAAATACTTGACTACCTGTAGTACATAAAAACTCAAC--CCACATCAAAA-----CCCTGCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTGTCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGCACATAAAGTCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 22 1 ???????????????????????????????ACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCCTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 23 3 ?????????????????????????ATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAGCCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCGATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTCACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 24 1 ????????????????????????????????????????????????????????????????????TACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGCACAGCAATCAACCCTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCC? 25 3 ??????????????????????????????AACTATTCTCTGTTCTTTCATGGGGAAGCGGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 26 1 ??????????????????????????????AACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAACATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 27 2 ?????????????????????????????AAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTATCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 28 3 ???????????????????????????????????TTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATCTCGTACATTACTGCCAGCCACCATGAATATTGTACAGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAA???????????CCCCCATGCTTACAAGCAAGCACAGCGATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTGCCCATCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTTGTCCCCATGGATGAC????? 29 2 ????????????????????????????????CTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAA????????????????????????????????????????????????????????????????????????CCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 30 1 ??????????????????????????????AACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAC--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCRCCCTTAACAGTACATAGCACATAAAACCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 31 1 ??????????????????????????????????????TCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCA???????????????CCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 32 1 ??????????????????????????????????????????TTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAACATGAATATTGTACAGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAACAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTGCAGTCAAATCCTTTCTCGCCCCCATGGATGACCCCCC 33 2 ?????????????????????????????????????????????TTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCTCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCAT???????????? 34 1 ?????????????????????????????????????????????????????????GCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACTACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 35 2 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAACATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGAATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 36 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCCATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAACCAACCTTCAACTATCACACATCAACTGCAACTCCAAGGCCA-CCCCTTACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 37 2 ??????????????????????????????????????TCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAATCCAAT--CCACATCAAAA-----CCCCCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 38 1 ?????????????????????????????????????????????TTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGCACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 39 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 40 3 ????????????????????????????????????TCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAGCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 41 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATCTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAACAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 42 1 ????????????????????????????????????????????????????????????????????????????????????????????ATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCATACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 43 1 ??????????????????????????????????????????????????????????????????????CCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCTAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGCACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCC?????????????? 44 2 ??????????????????????????????????????????????????????????????????GGTGCCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAACATGAATATTGTACAGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAACAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAGACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTTCAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 45 2 ???????????????????????????????????TTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCC??? 46 1 ?????????????????????????????????????CTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGCACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 47 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCAGCATGAATATTGCACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGAATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCC?????????????? 48 2 ????????????????????????????????????????TGTTCTTTCATGGGGAAGCAGATTTGGGTGCCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 49 1 ???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????ATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATTAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCTACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCTTTCTCGTCCCCATGGATGACCCCCC 50 1 ????????????????????TTCTAATTTAAACTATTCTCTGTTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAACAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGG?????????? 51 1 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAACAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCATCCTTAACAGTACATGGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCCC 52 3 ???????????????????????????????????????????????????????AAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGCACGGTACCATAAATACTTAACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCC?? 53 1 ??????????????????????????????????????????TTCTTTCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGAATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 54 2 ??????????????????????????????????????????????????????????????????GGTGCCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 55 1 ???????????????????????????????????????????????????????????????????????????????????????????????AACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 56 2 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCTCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCCC 57 1 ???????????????????????????????????????????????????GGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATCGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAG?ATATCAACAAACCTACCCACCCTTAACAGYACATAGYACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGYCCCCATGGATGACCCCCC 58 1 ???????????????????????????????????????????????TCATGGGGAAGCAGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATACCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGTCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGATGACCCCC? 59 2 ??????????????????????????????????????????????????????????????????GGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCCCCTCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTCACCCACTAGGATATCAACAAACCTACCCACCCTTAACAGTACATAGTACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGTCCCCATGGAT???????? 60 1 ???????????????????????????????????????????????????????????AGATTTGGGTACCACCCAAGTATTGACTCACCCATCAACAACCGCTATGTATTTCGTACATTACTGCCAGCCACCATGAATATTGTACGGTACCATAAATACTTGACCACCTGTAGTACATAAAAACCCAAT--CCACATCAAAA-----CCTTCCCCCCATGCTTACAAGCAAGTACAGCAATCAACCTTCAACTATCACACATCAACTGCAACTCCAAAGCCA-CCCCTTACCCATTAGGATATCAACAAACCTACCCGCCCTTAACAGTACATAGCACATAAAGCCATTTACCGTACATAGCACATTACAGTCAAATCCCTTCTCGCCCCCATGGATGACCCCC? } bug2453.maf100644000766000024 216014006427353 16153 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data##maf version=1 scoring=tba.v8 # tba.v8 (((human chimp) baboon) (mouse rat)) # multiz.v7 # maf_project.v5 _tba_right.maf3 mouse _tba_C # single_cov2.v4 single_cov2 /dev/stdin a score=23262.0 s hg16.chr7 27578828 38 + 158545518 AAA-GGGAATGTTAACCAAATGA---ATTGTCTCTTACGGTG s panTro1.chr6 28741140 38 + 161576975 AAA-GGGAATGTTAACCAAATGA---ATTGTCTCTTACGGTG s baboon 116834 38 + 4622798 AAA-GGGAATGTTAACCAAATGA---GTTGTCTCTTATGGTG s mm4.chr6 53215344 38 + 151104725 -AATGGGAATGTTAAGCAAACGA---ATTGTCTCTCAGTGTG s rn3.chr4 81344243 40 + 187371129 -AA-GGGGATGCTAAGCCAATGAGTTGTTGTCTCTCAATGTG a score=5062.0 s hg16.chr7 27699739 6 + 158545518 TAAAGA s panTro1.chr6 28862317 6 + 161576975 TAAAGA s baboon 241163 6 + 4622798 TAAAGA s mm4.chr6 53303881 6 + 151104725 TAAAGA s rn3.chr4 81444246 6 + 187371129 taagga a score=6636.0 s hg16.chr7 27707221 13 + 158545518 gcagctgaaaaca s panTro1.chr6 28869787 13 + 161576975 gcagctgaaaaca s baboon 249182 13 + 4622798 gcagctgaaaaca s mm4.chr6 53310102 13 + 151104725 ACAGCTGAAAATAAAC12660.fa100644000766000024 106114006427353 15665 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>AAC12660 TAK1 binding protein [Homo sapiens]. MAAQRRSLLQSEQQPSWTDDLPLCHLSGVGSASNRSYSADGKGTESHPPEDSWLKFRSEN NCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLES IDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYV ANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQEST RRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAH GPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDRVKRIHSDTFASGGERARFCPRHEDMTL LVRNFGYPLGEMSQPTPSPAPAAGGRVYPVSVPYSSAQSTSKTSVTLSLVMPSQGQMVNG AHSASTLDEATPTLTNQSPTLTLQSTNTHTQSSSSSSDGGLFRSRPAHSLPPGEDGRVEP YVDFAEFYRLWSVDHGEQSVVTAP crab.dat.cn100644000766000024 167714006427353 16407 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 13 sequences 1 A-salina 2 C-vittat 3 C-sp. 4 L-aequit 5 P-camtsc 6 E-tenuim 7 L-splend 8 P-bernha 9 P-acadia 10 P-p(NE) 11 P-p(GU) 12 P-l(NE) 13 P-l(GU) 14 and 2 0.097855 14 and 3 0.097855 15 and 20 0.063651 15 and 14 0.013271 15 and 1 0.155362 16 and 10 0.008043 16 and 11 0.008043 17 and 12 0.001340 17 and 13 0.001340 18 and 16 0.010389 18 and 17 0.017091 19 and 4 0.006702 19 and 5 0.006702 20 and 23 0.017147 20 and 18 0.029044 21 and 8 0.002681 21 and 9 0.002681 22 and 6 0.012064 22 and 21 0.009383 23 and 24 0.005306 23 and 19 0.023626 24 and 22 0.012958 24 and 7 0.025022 file:crab.dat constant rate for NJ (negative branches allowed) p-distance was used. Number of nucleotide sites compared 373 (nsite=421) outgroup: 1 A-salina fgenesh.out100644000766000024 707014006427353 16550 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data FGENESH 2.3 Prediction of potential genes in C_elegans genomic DNA Time : Wed Feb 27 13:43:15 2008 Seq name: gi|1914348|emb|Z81551.1| Caenorhabditis elegans Cosmid F56A12, complete sequence Length of sequence: 26028 Number of predicted genes 4 in +chain 1 in -chain 3 Number of predicted exons 19 in +chain 4 in -chain 15 Positions of predicted genes and exons: G Str Feature Start End Score ORF Len 1 - PolA 29 -0.94 1 - 1 CDSl 828 - 903 2.72 828 - 902 75 1 - 2 CDSf 996 - 1201 9.86 998 - 1201 204 1 - TSS 1869 -4.49 2 - PolA 9879 1.25 2 - 1 CDSl 10132 - 10275 7.05 10132 - 10275 144 2 - 2 CDSi 11180 - 11291 3.76 11180 - 11290 111 2 - 3 CDSi 11786 - 12035 1.01 11788 - 12033 246 2 - 4 CDSi 12238 - 12404 9.79 12239 - 12403 165 2 - 5 CDSf 12884 - 12900 0.72 12886 - 12900 15 2 - TSS 13142 -4.89 3 + TSS 14096 -7.69 3 + 1 CDSf 14778 - 15104 27.84 14778 - 15104 327 3 + 2 CDSi 15673 - 15802 20.10 15673 - 15801 129 3 + 3 CDSi 16175 - 16524 28.83 16177 - 16524 348 3 + 4 CDSl 16988 - 17212 15.92 16988 - 17212 225 3 + PolA 17223 1.25 4 - PolA 17420 1.25 4 - 1 CDSl 17552 - 17630 0.75 17552 - 17629 78 4 - 2 CDSi 17690 - 17848 22.67 17692 - 17847 156 4 - 3 CDSi 18221 - 18453 26.88 18223 - 18453 231 4 - 4 CDSi 18501 - 18637 19.50 18501 - 18635 135 4 - 5 CDSi 18850 - 19024 8.97 18851 - 19024 174 4 - 6 CDSi 19073 - 19156 8.63 19073 - 19156 84 4 - 7 CDSi 19206 - 19472 23.22 19206 - 19472 267 4 - 8 CDSf 20381 - 20479 21.19 20381 - 20479 99 4 - TSS 21231 -5.29 Predicted protein(s): >FGENESH: 1 2 exon (s) 828 - 1201 93 aa, chain - MSCRHRSTSTTGCKHQNILYAREKMYVWECQHGNGKKGEKLIVGMGVLELLGVSMRILYA REKVSAWEICFGFYKFSFQKAENHETCRKIIEN >FGENESH: 2 5 exon (s) 10132 - 12900 229 aa, chain - MNLLQSTSIDYHVAHNVWLQMENNLYFLRIHLLSSPIWGVGAAVNTFLIETVDQITIAGV KRGLSLLNYFLFTYGKPLPIEHIVKGPLSTKIPSMCYWQLCSKIFRKMKTPKSGKTEFLC VSIFVAQAFFSFKDPEARDKAETLDNDHVAGVVKIVAPITYIYTSPSILLAQTSCLERRT PVKYIDLFLCFFFLFFFPDPLVLIGTGWHAWAPLTIICSMFSDEKSTNS >FGENESH: 3 4 exon (s) 14778 - 17212 343 aa, chain + MSSIPPYFMTDHPPIDTSSYFDCYQQHQLPLQYTFTSSSNSNTSNSSTSPSHISDQFSSS GGPPYELSSHILTPSSVIPTPSPSVASASISSPTIPAFGCTMSEYSMEQMEAISTSLFQA RDGDRLVAFFKQLESLYGPNAVDHLRSEAIIVAYTYALYHSNEFETLFHLLSNRHFQQRH YNDLQDIWHHARYKESQLKRGKELNPVEKYRLRRKFPAPKTIWDGEEIVYSFKDSSRKFL KQFFRNVSEYPTQEQKREISRATGLKIVQISNWFKNRRQRDKSNNSAKCSPPSSSSSTNG GSDFLPIITPQSFNLAAAPFNMNMIYGTLRDSQSDNDQFTFNP >FGENESH: 4 8 exon (s) 17552 - 20479 410 aa, chain - MNETDDNWTVFTKLYSCQYRASDDSPQFTLWLDGPVTISAAIFSAIGTVYAIGFLRNGHL NRRMSAALYTLCLMDFMLTMTTVLFLSIEPLSILLFRTNIFYQHQDMILILYGIRNSFAM SSPMLVCYITYIRYRVVNNPLKFASHYGRSRKSISASKMSTAAQPSSTESAKFTISFPAE MFYEFHTRSNAGKRGANFRRFFRPFLVPILLVILCFVIHSTSYFEFNLITCFDEVHQTES KMLQMTQLRGDSYWYFQFKVALTMTTETLGPMLFISTLSLFTEYKMHQNVKERRRLFESQ KRSRNTLVTEELKDKASKALAVFIVVKFLILRSLPTLIDLYEVLIENSINFGPFMTKVTR ISDFLVILNSATNTLAYFGKVRFEKWLERRIRCRIVKKEAKEILSTSLTG lucy.stderr100644000766000024 10014006427353 16544 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/datalucy: ==> dropping TaLr1011A07R TaLr1010B10R has PolyA (right). testaln.msf100644000766000024 1612414006427353 16601 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data NoName MSF: 16 Type: P Fri May 25 11:28:34 2001 Check: 00 .. Name: 1433_LYCES/9-246 Len: 242 Check: 2603 Weight: 1.00 Name: 1434_LYCES/6-243 Len: 242 Check: 2257 Weight: 1.00 Name: 143R_ARATH/7-245 Len: 242 Check: 3905 Weight: 1.00 Name: 143B_VICFA/7-242 Len: 242 Check: 2825 Weight: 1.00 Name: 143E_HUMAN/4-239 Len: 242 Check: 2034 Weight: 1.00 Name: BMH1_YEAST/4-240 Len: 242 Check: 7237 Weight: 1.00 Name: RA24_SCHPO/6-241 Len: 242 Check: 3762 Weight: 1.00 Name: RA25_SCHPO/5-240 Len: 242 Check: 2995 Weight: 1.00 Name: 1431_ENTHI/4-239 Len: 242 Check: 2213 Weight: 1.00 Name: 1432_ENTHI/4-238 Len: 242 Check: 8344 Weight: 1.00 Name: 1433_CAEEL/5-237 Len: 242 Check: 1241 Weight: 1.00 Name: 143Z_DROME/6-239 Len: 242 Check: 8864 Weight: 1.00 Name: 1433_XENLA/1-227 Len: 242 Check: 8793 Weight: 1.00 Name: 143T_HUMAN/3-236 Len: 242 Check: 6034 Weight: 1.00 Name: 143F_MOUSE/3-240 Len: 242 Check: 5185 Weight: 1.00 Name: 143S_HUMAN/3-238 Len: 242 Check: 1679 Weight: 1.00 // 1433_LYCES/9-246 REENVYMAKL ADRAESDEEM VEFMEKVSNS LGS.EELTVE ERNLLSVAYK 1434_LYCES/6-243 REENVYLAKL AEQAERYEEM IEFMEKVAKT ADV.EELTVE ERNLLSVAYK 143R_ARATH/7-245 RDQYVYMAKL AEQAERYEEM VQFMEQLVTG ATPAEELTVE ERNLLSVAYK 143B_VICFA/7-242 RENFVYIAKL AEQAERYEEM VDSMKNVANL DV...ELTIE ERNLLSVGYK 143E_HUMAN/4-239 REDLVYQAKL AEQAERYDEM VESMKKVAGM DV...ELTVE ERNLLSVAYK BMH1_YEAST/4-240 REDSVYLAKL AEQAERYEEM VENMKTVASS GQ...ELSVE ERNLLSVAYK RA24_SCHPO/6-241 REDAVYLAKL AEQAERYEGM VENMKSVAST DQ...ELTVE ERNLLSVAYK RA25_SCHPO/5-240 RENSVYLAKL AEQAERYEEM VENMKKVACS ND...KLSVE ERNLLSVAYK 1431_ENTHI/4-239 REDCVYTAKL AEQSERYDEM VQCMKQVAEM EA...ELSIE ERNLLSVAYK 1432_ENTHI/4-238 REDLVYLSKL AEQSERYEEM VQYMKQVAEM GT...ELSVE ERNLISVAYK 1433_CAEEL/5-237 VEELVQRAKL AEQAERYDDM AAAMKKVTEQ GQ...ELSNE ERNLLSVAYK 143Z_DROME/6-239 KEELVQKAKL AEQSERYDDM AQAMKSVTET GV...ELSNE ERNLLSVAYK 1433_XENLA/1-227 .......AKL SEQAERYDDM AASMKAVTEL GA...ELSNE ERNLLSVAYK 143T_HUMAN/3-236 KTELIQKAKL AEQAERYDDM ATCMKAVTEQ GA...ELSNE ERNLLSVAYK 143F_MOUSE/3-240 REQLLQRARL AEQAERYDDM ASAMKAVTEL NE...PLSNE DRNLLSVAYK 143S_HUMAN/3-238 RASLIQKAKL AEQAERYEDM AAFMKGAVEK GE...ELSCE ERNLLSVAYK 1433_LYCES/9-246 NVIGARRASW RIISSIEQKE ESRG.NEEHV NSIREYRSKI ENELSKICDG 1434_LYCES/6-243 NVIGARRASW RIISSIEQKE ESRG.NEDHV NTIKEYRSKI EADLSKICDG 143R_ARATH/7-245 NVIGSLRAAW RIVSSIEQKE ESRK.NDEHV SLVKDYRSKV ESELSSVCSG 143B_VICFA/7-242 NVIGARRASW RILSSIEQKE ESKG.NDVNA KRIKEYRHKV ETELSNICID 143E_HUMAN/4-239 NVIGARRASW RIISSIEQKE ENKG.GEDKL KMIREYRQMV ETELKLICCD BMH1_YEAST/4-240 NVIGARRASW RIVSSIEQKE ESKEKSEHQV ELICSYRSKI ETELTKISDD RA24_SCHPO/6-241 NVIGARRASW RIVSSIEQKE ESKG.NTAQV ELIKEYRQKI EQELDTICQD RA25_SCHPO/5-240 NIIGARRASW RIISSIEQKE ESRG.NTRQA ALIKEYRKKI EDELSDICHD 1431_ENTHI/4-239 NVIGAKRASW RIISSLEQKE QAKG.NDKHV EIIKGYRAKI EKELSTCCDD 1432_ENTHI/4-238 NVVGSRRASW RIISSLEQKE QAKG.NTQRV ELIKTYRAKI EQELSQKCDD 1433_CAEEL/5-237 NVVGARRSSW RVISSIEQKT EG...SEKKQ QLAKEYRVKV EQELNDICQD 143Z_DROME/6-239 NVVGARRSSW RVISSIEQKT EA...SARKQ QLAREYRERV EKELREICYE 1433_XENLA/1-227 NVVGARRSSW RVISSIEQKT EG...NDKRQ QMAREYREKV ETELQDICKD 143T_HUMAN/3-236 NVVGGRRSAW RVISSIEQKT DT...SDKKL QLIKDYREKV ESELRSICTT 143F_MOUSE/3-240 NVVGARRSSW RVISSIEQKT MADG.NEKKL EKVKAYREKI EKELETVCND 143S_HUMAN/3-238 NVVGGQRAAW RVLSSIEQKS NEEG.SEEKG PEVREYREKV ETELQGVCDT 1433_LYCES/9-246 ILKLLDSKLI PSA..TSGDS KVFYLKMKGD YHRYLAEFKT GAERKEAAES 1434_LYCES/6-243 ILSLLESNLI PSA..STAES KVFHLKMKGD YHRYLAEFKT GTERKEAAEN 143R_ARATH/7-245 ILKLLDSHLI PSA..GASES KVFYLKMKGD YHRYMAEFKS GDERKTAAED 143B_VICFA/7-242 VMRVIDEHLI PSA..AAGES TVFYYKMKGD YYRYLAEFKT GNEKKEAGDQ 143E_HUMAN/4-239 ILDVLDKHLI PAA..NTGES KVFYYKMKGD YHRYLAEFAT GNDRKEAAEN BMH1_YEAST/4-240 ILSVLDSHLI PSA..TTGES KVFYYKMKGD YHRYLAEFSS GDAREKATNA RA24_SCHPO/6-241 ILTVLEKHLI PNA..ASAES KVFYYKMKGD YYRYLAEFAV GEKRQHSADQ RA25_SCHPO/5-240 VLSVLEKHLI PAA..TTGES KVFYYKMKGD YYRYLAEFTV GEVCKEAADS 1431_ENTHI/4-239 VLKVIQENLL PKA..STSES KVFFKKMEGD YYRYFAEFTV DEKRKEVADK 1432_ENTHI/4-238 VLKIITEFLL KNS..TSIES KVFFKKMEGD YYRYYAEFTV DEKRKEVADK 1433_CAEEL/5-237 VLKLLDEFLI VKA..GAAES KAFYLKMKGD YYRYLAEVAS .EDRAAVVEK 143Z_DROME/6-239 VLGLLDKYLI PKA..SNPES KVFYLKMKGD YYRYLAEVAT GDARNTVVDD 1433_XENLA/1-227 VLDLLDRFLV PNA..TPPES KVFYLKMKGD YYRYLSEVAS GDSKQETVAS 143T_HUMAN/3-236 VLELLDKYLI ANA..TNPES KVFYLKMKGD YFRYLAEVAC GDDRKQTIDN 143F_MOUSE/3-240 VLALLDKFLI KNCNDFQYES KVFYLKMKGD YYRYLAEVAS GEKKNSVVEA 143S_HUMAN/3-238 VLGLLDSHLI KEA..GDAES RVFYLKMKGD YYRYLAEVAT GDDKKRIIDS 1433_LYCES/9-246 TLTAYKAAQD IASAELAPTH PIRLGLALNF SVFYYEILNS PDRACNLAKQ 1434_LYCES/6-243 TLLAYKSAQD IALAELAPTH PIRLGLALNF SVFYYEILNS PDRACNLAKQ 143R_ARATH/7-245 TMLAYKAAQD IAAADMAPTH PIRLGLALNF SVFYYEILNS SDKACNMAKQ 143B_VICFA/7-242 SMKAYESATT AAEAELPPTH PIRLGLALNF SVFYYEILNS PERACHLAKQ 143E_HUMAN/4-239 SLVAYKAASD IAMTELPPTH PIRLGLALNF SVFYYEILNS PDRACRLAKA BMH1_YEAST/4-240 SLEAYKTASE IATTELPPTH PIRLGLALNF SVFYYEIQNS PDKACHLAKQ RA24_SCHPO/6-241 SLEGYKAASE IATAELAPTH PIRLGLALNF SVFYYEILNS PDRACYLAKQ RA25_SCHPO/5-240 SLEAYKAASD IAVAELPPTD PMRLGLALNF SVFYYEILDS PESACHLAKQ 1431_ENTHI/4-239 SLAAYTEATE ISNAELAPTH PIRLGLALNF SVFYFEIMND ADKACQLAKQ 1432_ENTHI/4-238 SLAAYQEATD TA.ASLVPTH PIRLGLALNF SVFYYQIMND ADKACQLAKE 1433_CAEEL/5-237 SQKAYQEALD IAKDKMQPTH PIRLGLALNF SVFYYEILNT PEHACQLAKQ 143Z_DROME/6-239 SQTAYQDAFD ISKGKMQPTH PIRLGLALNF SVFYYEILNS PDKACQLAKQ 1433_XENLA/1-227 SQQAYQEAFE ISKSEMQPTH PIRLGLALNF SVFYYEILNS PEKACSLAKS 143T_HUMAN/3-236 SQGAYQEAFD ISKKEMQPTH PIRLGLALNF SVFYYEILNN PELACTLAKT 143F_MOUSE/3-240 SEAAYKEAFE ISKEHMQPTH PIRLGLALNF SVFYYEIQNA PEQACLLAKQ 143S_HUMAN/3-238 ARSAYQEAMD ISKKEMPPTN PIRLGLALNF SVFHYEIANS PEEAISLAKT 1433_LYCES/9-246 AFDEAIAELD TLGEESYKDS TLIMQLLRDN LTLWTSDMQD DG 1434_LYCES/6-243 AFDEAISELD TLGEESYKDS TLIMQLLRDN LTLWTSDNAD DV 143R_ARATH/7-245 AFEEAIAELD TLGEESYKDS TLIMQLLRDN LTLWTSDYAG AD 143B_VICFA/7-242 AFDEAISELD TLNEESYKDS TLIMQLLRDN LTLWTSDIPE DG 143E_HUMAN/4-239 AFDDAIAELD TLSEESYKDS TLIMQLLRDN LTLWTSDMQG DG BMH1_YEAST/4-240 AFDDAIAELD TLSEESYKDS TLIMQLLRDN LTLWTSDMSE SG RA24_SCHPO/6-241 AFDEAISELD SLSEESYKDS TLIMQLLRDN LTLWTSDAEY SA RA25_SCHPO/5-240 VFDEAISELD SLSEESYKDS TLIMQLLRDN LTLWTSDAEY NQ 1431_ENTHI/4-239 AFDDAIAKLD EVPENMYKDS TLIMQLLRDN LTLWTSDACD EE 1432_ENTHI/4-238 AFDEAIQKLD EVPEESYKES TLIMQLLRDN LTLWTSDMGD DE 1433_CAEEL/5-237 AFDDAIAELD TLNEDSYKDS TLIMQLLRDN LTLWTSDVGA ED 143Z_DROME/6-239 AFDDAIAELD TLNEDSYKDS TLIMQLLRDN LTLWTSDTQG DE 1433_XENLA/1-227 AFDEAIRELD TLNEESYKDS TLIMQLLRDN LTLWTSENQG EE 143T_HUMAN/3-236 AFDEAIAELD TLNEDSYKDS TLIMQLLRDN LTLWTSDSAG EE 143F_MOUSE/3-240 AFDDAIAELD TLNEDSYKDS TLIMQLLRDN LTLWTSDQQD EE 143S_HUMAN/3-238 TFDEAMADLH TLSEDSYKDS TLIMQLLRDN LTLWTADNAG EE tblastn.out100644000766000024 544114006427353 16600 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataTBLASTN 2.2.10 [Oct-19-2004] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= HAHU | 1114 | Hemoglobin alpha chain - Human, chimpanzee, and pygmy chimpanzee (141 letters) Database: testnt.fa 2 sequences; 13,260 total letters Searching..done Score E Sequences producing significant alignments: (bits) Value gi|10040111|emb|AL390796.6|AL390796 Homo sapiens chromosome 1 cl... 18 2.3 test6 16 6.7 >gi|10040111|emb|AL390796.6|AL390796 Homo sapiens chromosome 1 clone RP11-562F3, *** SEQUENCING IN PROGRESS ***, 16 unordered pieces /len=164198 Length = 8160 Score = 17.7 bits (34), Expect = 2.3 Identities = 8/23 (34%), Positives = 11/23 (47%) Frame = +1 Query: 56 KGHGKKVADALTNAVAHVDDMPN 78 KGH KK+ N ++ D N Sbjct: 7603 KGHLKKITSFFFNHTSNYRDEYN 7671 Score = 17.3 bits (33), Expect = 3.0 Identities = 9/28 (32%), Positives = 13/28 (46%) Frame = +1 Query: 31 RMFLSFPTTKTYFPHFDLSHGSAQVKGH 58 RM+ F TY+ +F L V G+ Sbjct: 7069 RMYFCFLKVLTYYFNFFLFFNIFYVLGY 7152 >test6 Length = 5100 Score = 16.2 bits (30), Expect = 6.7 Identities = 6/9 (66%), Positives = 7/9 (77%) Frame = +3 Query: 72 HVDDMPNAL 80 H+D M NAL Sbjct: 3822 HMDYMSNAL 3848 Score = 15.8 bits (29), Expect = 8.8 Identities = 5/7 (71%), Positives = 6/7 (85%) Frame = -2 Query: 93 VDPVNFK 99 +DP NFK Sbjct: 1814 IDP*NFK 1794 Database: testnt.fa Posted date: Feb 5, 2005 6:26 PM Number of letters in database: 13,260 Number of sequences in database: 2 Lambda K H 0.319 0.130 0.385 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1998 Number of Sequences: 2 Number of extensions: 23 Number of successful extensions: 4 Number of sequences better than 10.0: 2 Number of HSP's better than 10.0 without gapping: 3 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 4 length of query: 141 length of database: 4420 effective HSP length: 39 effective length of query: 102 effective length of database: 4342 effective search space: 442884 effective search space used: 442884 frameshift window, decay const: 40, 0.1 T: 13 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 29 (16.3 bits) S2: 29 (15.8 bits) AF305198.gb100644000766000024 725114006427353 15673 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AF305198 1744 bp DNA linear BCT 18-DEC-2000 DEFINITION Virginia creeper phytoplasma 16S ribosomal RNA gene, partial sequence; and tRNA-Ile gene, complete sequence. ACCESSION AF305198 VERSION AF305198.1 GI:11878216 KEYWORDS . SOURCE Virginia creeper phytoplasma ORGANISM Virginia creeper phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma; 16SrV (Elm yellows group). REFERENCE 1 (bases 1 to 1744) AUTHORS Harrison,N.A., Griffiths,H.M., Carpio,M.L. and Richardson,P.A. TITLE Detection of an Elm yellows group (16SrV) phytoplasma associated with Virginia creeper plants in southern Florida JOURNAL Unpublished REFERENCE 2 (bases 1 to 1744) AUTHORS Griffiths,H.M., Harrison,N.A., Carpio,M.L. and Richardson,P.A. TITLE Direct Submission JOURNAL Submitted (13-SEP-2000) Research and Education Center, University of Florida, 3205 College Avenue, Fort Lauderdale, FL 33314-7799, USA FEATURES Location/Qualifiers source 1..1744 /organism="Virginia creeper phytoplasma" /mol_type="genomic DNA" /strain="VC-Fl" /db_xref="taxon:145578" rRNA <1..1491 /product="16S ribosomal RNA" tRNA 1599..1675 /product="tRNA-Ile" ORIGIN 1 gcctaataca tgcaagtcga acggagaccc ttcaaaaggt cttagtggcg aacgggtgag 61 taacacgtaa gtaacctacc tttaagacga ggataacaat cggaaacggt tgctaagact 121 ggataggaaa cagaaaggca tctttttgtt tttaaaagac cttcttcgga gggtatgctt 181 aaagaggggc ttgcgccaca ttagttagtt ggtgaggtaa aggcttacca agattatgat 241 gtgtagctgg actgagaggt tgaacagcca cattgggact gagacacggc ccaaactcct 301 acgggaggca gcagtaggga attttcggca atggaggaaa ctctgaccga gcgacgccgc 361 gtgaacgatg aagtatttcg gtatgtaaag ttcttttatt gaagaagaaa aaatagtgga 421 aaaactatct tgacgttatt caatgaataa gccccggcta actatgtgcc agcagccgcg 481 gtaagacata gggggcgagc gttatccgga attattgggc gtaaagggtg cgtaggcggt 541 tagataagtc tataatttaa tttcagtgct taacgctgtc ttgttataga aactgtcttg 601 actagagtga gatagaggca agcggaattc catgtgtagc ggtaaaatgt gtaaatatat 661 ggaggaacac cagaagcgta ggcggcttgc tgggtcttta ctgacgctga ggcacgaaag 721 cgtggggagc aaacaggatt agataccctg gtagtccacg ctgtaaacga tgagtactaa 781 gtgtcggggg taactcggta ctgaagttaa cacattaagt actccgcctg agtagtacgt 841 acgcaagtat gaaacttaaa ggaattgacg ggactccgca caagcggtgg atcatgttgt 901 ttaattcgaa gatacacgaa aaaccttacc aggtcttgac atactctgca aagctataga 961 aatatagtgg aggttatcag ggatacaggt ggtgcatggt tgtcgtcagt tcgtgtcgtg 1021 agatgttagg ttaagtccta aaacgaacgc aacccctgtc gctagttgcc agcacgtaat 1081 ggtggggact ttagcgagac tgccaattaa acattggagg aaggtgggga taacgtcaaa 1141 tcatcatgcc ccttatgatc tgggctacaa acgtgataca atggctatta caaagagtag 1201 ctgaaacgcg agtttttagc caatctcaaa aaggtagtct cagtacggat tgaagtctgc 1261 aactcgactt tatgaagctg gaatcgctag taatcgcgaa tcagcatgtc gcggtgaata 1321 cgttctcggg gtttgtacac accgcccgtc aaaccacgaa agttagcaat acccgaaagc 1381 agtggcttaa cttcgcaaga agagggagct gtctaaggta gggttgatga ttggggttaa 1441 gtcgtaacaa ggtatcctta ccggaaggtg aggatggatc acctcctttc taaggacata 1501 catataaaaa tcatcatctt cagttttgaa agacttaggt taaaatataa gtttttcttt 1561 ttataaaaaa aagtgtttct cttatataaa agaccaaagg gcctatagct cagttggtta 1621 gagcacacgc ctgataagcg tgaggtcggt ggttcgagtc cacttaggcc caccaatttt 1681 atatcaggaa aatatttact tcgaagaaag ttctttgaaa agtagataaa catgatttgc 1741 ttaa // dq519393.gb100644000766000024 447414006427353 16021 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS DQ519393 748 bp RNA linear VRL 04-JUL-2006 DEFINITION Hepatitis delta virus genotype I delta antigen gene, complete cds. ACCESSION DQ519393 VERSION DQ519393.1 GI:109726906 KEYWORDS . SOURCE Hepatitis delta virus ORGANISM Hepatitis delta virus Viruses; Deltavirus. REFERENCE 1 (bases 1 to 748) AUTHORS Wang,T.-C. and Chao,M. TITLE Direct Submission JOURNAL Submitted (27-APR-2006) Microbiology and Immunology, Chang Gung Unversity, 259, Wen-Hwa 1st Road, Kwei-Shan, Tao-Yuan 333, Taiwan FEATURES Location/Qualifiers source 1..748 /organism="Hepatitis delta virus" /mol_type="genomic RNA" /db_xref="taxon:12475" /country="Taiwan" /note="genotype: I" CDS 34..675 /codon_start=1 /product="delta antigen" /protein_id="ABG45868.1" /db_xref="GI:109726907" /translation="MSQPDPRRPRKGREETLGKWIDARRRKEELERDLRKVNKTIKRL EEDNPWLGNIRGIIRKDKDGDGAPPAKRARTDQMEVDSGPRKRKHPGGFTEQERRDHR RRKALENKKKQLSSGGKSLSREEEEELRRLTEEDERRERRVAGPRVGDVNPLDGGPRG APGGGFVPSMLGVPESPFTRRGDGLDTMGTQEFPWVSPQPPPPRLPLLECTPQ" ORIGIN 1 aagagttctg aggaccgccg cccgcaatcg aggatgagcc agcccgatcc gaggagaccc 61 cggaagggga gagaagagac actcggaaag tggatcgatg caagaagaag aaaggaggag 121 ctcgagaggg acctccggaa ggtcaacaag actatcaaga gacttgagga ggacaacccc 181 tggctgggaa acatccgagg aataataaga aaagacaagg acggggatgg agctcccccg 241 gcgaagagag ccaggacgga tcagatggag gtcgactccg gacctaggaa gaggaaacac 301 ccgggagggt tcaccgaaca ggagagacgg gatcaccgaa ggaggaaggc cctggagaac 361 aagaagaagc aactctcctc gggagggaag agcctcagca gggaggaaga agaggaactt 421 cgaaggttga ccgaggaaga cgagagaagg gaaagaagag tagccggccc gcgggttggg 481 gatgtgaacc ccctcgacgg aggtccgagg ggagcgcccg gaggcgggtt cgtccccagc 541 atgctaggag tcccggagtc tcccttcact cggagagggg acggcctgga caccatggga 601 actcaggaat ttccgtgggt tagccctcag ccccctcctc cccgccttcc actcctcgag 661 tgtacccccc aataaaggat cgagggttcc actcacggtt cgcgtccgtg tccttccttt 721 cctcttcggg tcggcatggc ataagggc // badfasta.fa100644000766000024 202514006427353 16450 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CEESC12R gcacgagtccatctccatatgccaccacaacantggtcctgtcgaaccaa caaccagcttggctcaatgacaaaatgcttcgcgcgccanaatgccaaca aatcccgtgccaccagagccaccggcgcgatatgcagatcataccgctgg aagacgatctcgatcgagccgtgcatccgatgggagaggaactctgaatg gcggactccatcaccggactagcggaagtcaacggtcggatagtccacct cacacagatgtgagctatgttcagcttcactcatccgatggaactggtag tagtaaggaaagaantngggagcggagaacaccaccgaataaa >CEESC13F cttgcttgaaaaatttatataaatatttaagagaagaaaaataaataatc gcatctaatgacgtctgtccttgtatccctggtttccattgactggtgca ctttcctgtctttgaggacatggacaatattcggcatcagttcctggctc tccctcctctcctggtgctccagcagaaccgttctctccattatctccct tgtctccacgtggtccacgctctcctggtgctcctggaataccttgagct ccctcgtgccgaattcctgcagcccgggggatccactagttctagagcgg ccgccaccgcggtgggagctccagcttttgttncctttagtgagggttaa tttcgagcttggcgtaatcatggtcatagctgtttcctg >CEESC13R gcacgagggagctcaaggtattccaggagcaccaggagagcgtggaccac gtggagacaagggagataatggagagaacggttctgctggagcaccagga gaggagggagagccaggaactgatgccgaatattgtccatgtcctcaaag acaggaaagtgcaccagtcaatggaaaccagggatacaagaacagacgtc attagatgcgattatttatttttcttctcttaaatatttatataaatttt tcaagcaag AB077698.gb100644000766000024 1646114006427353 15725 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AB077698 2701 bp mRNA linear PRI 01-MAR-2002 DEFINITION Homo sapiens mRNA for hCHCR-G, complete cds. ACCESSION AB077698 VERSION AB077698.1 GI:19032344 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 AUTHORS Squillace,R.M., Chenault,D.M. and Wang,E.H. TITLE Inhibition of myogenesis by the novel Muscleblind-related protein CHCR JOURNAL Unpublished REFERENCE 2 AUTHORS Squillace,R.M. and Wang,E.H. TITLE Genomic structure, chromosomal localization, and splicing variation of the human CHCR gene, cloning and characterization of mouse CHCR JOURNAL Unpublished REFERENCE 3 (bases 1 to 2701) AUTHORS Squillace,R.M., Chenault,D.M. and Wang,E.H. TITLE Direct Submission JOURNAL Submitted (10-JAN-2002) Edith H. Wang, University of Washington, Pharmacology; 1959 NE Pacific Ave., Box 357280, Seattle, Washington 98195, USA (E-mail:ehwang@u.washington.edu, Tel:206-616-5376, Fax:206-685-3822) FEATURES Location/Qualifiers source 1..2701 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq26.1" gene 1..2701 /gene="CHCR" 5'UTR <1..79 /gene="CHCR" CDS 80..1144 /gene="CHCR" /note="Cys3His CCG1-Required Encoded on BAC clone RP5-842K24 (AL050310) The human CHCR (Cys3His CCG1-Required) protein is highly related to EXP/MBNL (Y13829, NM_021038, AF401998) and MBLL (NM_005757,AF061261), which together comprise the human Muscleblind family" /codon_start=1 /product="hCHCR-G" /protein_id="BAB85648.1" /db_xref="GI:19032345" /translation="MTAVNVALIRDTKWLTLEVCREFQRGTCSRADADCKFAHPPRVC HVENGRVVACFDSLKGRCTRENCKYLHPPPHLKTQLEINGRNNLIQQKTAAAMFAQQM QLMLQNAQMSSLGSFPMTPSIPANPPMAFNPYIPHPGMGLVPAELVPNTPVLIPGNPP LAMPGAVGPKLMRSDKLEVCREFQRGNCTRGENDCRYAHPTDASMIEASDNTVTICMD YIKGRCSREKCKYFHPPAHLQARLKAAHHQMNHSAASAMALQPGTLQLIPKRSALEKP NGATPVFNPTVFHCQQALTNLQLPQPAFIPAGPILCMAPASNIVPMMHGATPTTVSAA TTPATSVPFAAPTTGNQLKF" misc_feature 137..196 /gene="CHCR" /note="Cys3His, zinc finger Encoded on BAC clone RP5-842K24 (AL050310)" misc_feature 239..292 /gene="CHCR" /note="Cys3His, zinc finger Encoded on BAC clone RP5-842K24 (AL050310)" misc_feature 617..676 /gene="CHCR" /note="Cys3His, zinc finger Encoded on BAC clone RP5-842K24 (AL050310)" misc_feature 725..778 /gene="CHCR" /note="Cys3His, zinc finger Encoded on BAC clone RP5-842K24 (AL050310)" 3'UTR 1145..2659 /gene="CHCR" polyA_site 1606 /gene="CHCR" /note="Encoded on BAC clone RP5-842K24 (AL050310); PolyA_site#1 used by CHCR EST clone PLACE1010202 (AK002178)" polyA_site 2660 /gene="CHCR" /note="Encoded on BAC clone RP5-842K24 (AL050310); PolyA_site#2 used by CHCR EST clone DKFZp434G2222 (AL133625)" BASE COUNT 817 a 570 c 525 g 789 t ORIGIN 1 aattcatttt taatccttta atagtccaca gtaatattgt cctaaagagg gtacattgga 61 ttttaatttt gctttcaata tgacggctgt caatgttgcc ctgattcgtg ataccaagtg 121 gctgacttta gaagtctgta gagaatttca gagaggaact tgctctcgag ctgatgcaga 181 ttgcaagttt gcccatccac caagagtttg ccatgtggaa aatggtcgtg tggtggcctg 241 ttttgattct ctaaagggtc ggtgtacccg agagaactgc aagtaccttc accctcctcc 301 acacttaaaa acgcagctgg agattaatgg gcggaacaat ctgattcaac agaagactgc 361 cgcagccatg ttcgcccagc agatgcagct tatgctccaa aacgctcaaa tgtcatcact 421 tggttctttt cctatgactc catcaattcc agctaatcct cccatggctt tcaatcctta 481 cataccacat cctgggatgg gcctcgttcc tgcagaactt gtaccaaata cacctgttct 541 gattcctgga aacccacctc ttgcaatgcc aggagctgtt ggcccaaaac tgatgcgttc 601 agataaactg gaggtttgcc gagaatttca gcgtggaaat tgtacccgtg gggagaatga 661 ttgccgctat gctcacccta ctgatgcttc catgattgaa gcgagtgata atactgtgac 721 aatctgcatg gattacatca aaggtcgatg ctcgcgggag aaatgcaagt actttcatcc 781 tcctgcacac ttgcaagcca gactcaaggc agctcatcat cagatgaacc attcagctgc 841 ctctgccatg gccctgcagc ctggtacact gcaactgata ccaaagagat cagcactgga 901 aaagcccaat ggtgccaccc cggtctttaa tcccactgtt ttccactgcc aacaggctct 961 gactaacctg cagctcccac agccggcatt tatccctgca gggccaatac tgtgcatggc 1021 acccgcttca aatattgtgc ccatgatgca cggtgctaca cctaccactg tgtctgcagc 1081 aacaacacct gccaccagcg ttccgttcgc tgcaccaact acaggcaatc agctgaaatt 1141 ctgaacagca gagttatgga gtatcagaat ctttccatgg aaacctccat atggcctttc 1201 tatatatatt ctcgtatgtc ttattctacc aacacaacaa taagcgtgtt gcagtcaatg 1261 tattaagcaa agcaaacctg ccagccagca aattcaaata aaaaataaag cattaaaaat 1321 caatggagat gttaaaacaa cacaaataga aaactagtaa ctaccatcca tcctatttga 1381 attatcaagc agaacatgac cataaaattt ggtaacttgt tacattactc tttgtgattt 1441 tctaataacc atgctaagtg tatttccaca gtgagctttt ggcttactat atacattctt 1501 ggtggataaa ttgttcatct gtttttgaag tgttacctta ctattttgtt tacaagatag 1561 tctattgggt tgattcagga tgtaacaaat atattcagta ccatttcttg tgttgtattg 1621 tgttgtgctg tgttaggttt ttacatactg tagtgttttg ctgtatatgt gtggtgtttg 1681 atttcaacta aagtgttatt agtggggaac agaagtatat gtgcttaaga acatgacagg 1741 ttcatgcaaa tatgctctct ttctttagaa tatttctgta ggtttcttgg gactgacatt 1801 taaaacgcct cacttttgaa tgtgcacaaa acctgctcat taacatgcat gtgtataatt 1861 tgtacctgca gatctgatgt tgcataatac aatcaaatta ctagattttt taaagagaga 1921 aataattacc tgcacaaagc agagaacttc ataaaacatt aacccctaat tcactcttct 1981 taaatagctt ggcaaataag actttacctt taaatgaatt tctcagcatt tatactaaaa 2041 attatgtaac gtgctcatta gattttttgt gtgtgtggct tgagaatccc atctcctaaa 2101 ttgagtgtct aaaactgagc catttgtcat cttcagctga gaaactggta cttgggagct 2161 taaaaatatg ctaattacaa gttataaatc aaacggagag atgggggcat ggagatagtt 2221 tttacgtact ggaggaaagt gtgtaaaacc atggcaatgt caccttttac acaaatgcca 2281 ttttccaaat gcaaatggct catgctcttt agactactct ttgaataaca agtaagatgc 2341 aatctagcaa aagtcagtca gggtgaaaga gaattggttg caaatgagga cttccctccc 2401 caaatggaca gtcttctctg ttgatcacag agggagcctg agtacaggct tggagaaatg 2461 gctaggacag ggaacaggga agcacttaca attattcctt gatttattca aaagaactgg 2521 gaaagatggt tgtagttgtc tttagcttcg gttcaactga gtttcgtttt gttaaacagt 2581 tcagtgaagg agaaagcacc tgtgatatat ggcaagtgtc cccctgccca aactttaaca 2641 tcagaccctc tcacatcata aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2701 a // DQ018368.gb100644000766000024 571314006427353 15712 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS DQ018368 523 bp DNA linear PLN 23-MAY-2005 DEFINITION (Populus tomentosa x P. bolleana) x P. tomentosa var. truncata BS-LRR type disease resistance protein (RGA6) gene, partial cds. ACCESSION DQ018368 VERSION DQ018368.1 GI:66271013 KEYWORDS . SOURCE (Populus tomentosa x P. bolleana) x P. tomentosa var. truncata ORGANISM (Populus tomentosa x P. bolleana) x P. tomentosa var. truncata Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; eurosids I; Malpighiales; Salicaceae; Saliceae; Populus. REFERENCE 1 (bases 1 to 523) AUTHORS Zhang,Q., Lin,S.Z., Lin,Y.Z., Zhou,Y.L., Zhang,Z.Y., Zheng,H.Q., Chen,J.B., Wang,Z.L., Qiao,M.J., Wang,X. and Zhu,B.Q. TITLE Characterization and cloning of disease resistance gene from poplar JOURNAL Unpublished REFERENCE 2 (bases 1 to 523) AUTHORS Zhang,Q., Lin,S.Z., Lin,Y.Z., Zhou,Y.L. and Zhang,Z.Y. TITLE Direct Submission JOURNAL Submitted (26-APR-2005) Key Laboratory for Genetics and Breeding in Forest Trees and Ornamental Plants, MOE, Beijing Forestry University, Box 118, Qinghuadong Road, Beijing 100083, P.R.China FEATURES Location/Qualifiers source 1..523 /organism="(Populus tomentosa x P. bolleana) x P. tomentosa var. truncata" /mol_type="genomic DNA" /db_xref="taxon:328805" gene <1..>523 /gene="RGA6" mRNA <1..>523 /gene="RGA6" /product="BS-LRR type disease resistance protein" CDS <1..>523 /gene="RGA6" /codon_start=1 /product="BS-LRR type disease resistance protein" /protein_id="AAY43785.1" /db_xref="GI:66271014" /translation="GMGGIGKTTVARVVYDRIRWQFEGSCFLANVREDLAKKGGQRRL QEQLLSEILMERANICDSSRGIEMIKRRLQRKKILVVLDDVDDRKQLESLAAESKWFG PESRIIITSRDKQVLTRNGVTRIYEAEKLNDDDALMLFSQKAFKKDQPVEDFVKLSKQ VVGYANGPSTCPQS" ORIGIN 1 gggatggggg gtataggtaa gactactgtt gcaagggtag tatatgatag gattcgttgg 61 caatttgaag gtagctgttt cttagcaaat gtcagagaag atcttgctaa gaaaggtgga 121 caacgccgtt tacaggagca acttctttct gaaatcttaa tggaacgtgc taatatatgt 181 gattcttcta gaggaattga aatgataaag cggaggttac aacgtaaaaa gattcttgtt 241 gttcttgatg atgtagatga ccgtaaacaa ctagaatccc tggctgcgga gagtaaatgg 301 tttggtccag agagtagaat tatcataaca agcagagata aacaagtgtt gactagaaat 361 ggtgttacta gaatttatga ggctgagaaa ttgaatgatg atgatgctct tatgttgttt 421 agccagaaag ctttcaaaaa agaccaacct gttgaggatt ttgtgaaact atccaagcaa 481 gttgtgggtt atgctaatgg gccttccact tgccctcaaa gtc // roa1_v2.dat100644000766000024 1070714006427353 16364 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID X79536; SV 1; linear; mRNA; STD; HUM; 1198 BP. XX AC X79536; XX DT 03-JUN-1994 (Rel. 39, Created) DT 18-APR-2005 (Rel. 83, Last updated, Version 2) XX DE H.sapiens mRNA for hnRNPcore protein A1 XX KW core protein. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [2] RP 1-1198 RA Leffers H.; RT ; RL Submitted (01-JUN-1994) to the EMBL/GenBank/DDBJ databases. RL H. Leffers, Inst. of Medical Research Biochemistry & Danish Centre for RL Human Genome Research, Ole Worms Alle 170, Aarhus Univ., 8000 Aarhus C, RL DENMARK XX RN [3] RA Knudsen S.M., Leffers H.; RT "Cloning and sequencing of a splice variant of human hnRNP core protein RT A1"; RL Unpublished. XX DR H-InvDB; HIT000323756. XX FH Key Location/Qualifiers FH FT source 1..1198 FT /organism="Homo sapiens" FT /mol_type="mRNA" FT /clone_lib="lambda ZapII" FT /clone="HDP1" FT /cell_line="MRC5" FT /cell_type="fibroblast" FT /tissue_type="lung" FT /db_xref="taxon:9606" FT CDS 27..989 FT /product="hnRNPcore protein A1" FT /db_xref="GDB:127388" FT /db_xref="GOA:P09651" FT /db_xref="HGNC:5031" FT /db_xref="InterPro:IPR000504" FT /db_xref="InterPro:IPR012677" FT /db_xref="PDB:1HA1" FT /db_xref="PDB:1L3K" FT /db_xref="PDB:1PGZ" FT /db_xref="PDB:1PO6" FT /db_xref="PDB:1U1K" FT /db_xref="PDB:1U1L" FT /db_xref="PDB:1U1M" FT /db_xref="PDB:1U1N" FT /db_xref="PDB:1U1O" FT /db_xref="PDB:1U1P" FT /db_xref="PDB:1U1Q" FT /db_xref="PDB:1U1R" FT /db_xref="PDB:1UP1" FT /db_xref="PDB:2UP1" FT /db_xref="UniProtKB/Swiss-Prot:P09651" FT /protein_id="CAA56072.1" FT /translation="MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVV FT MRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTV FT KKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIV FT IQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNF FT SGRGGFGGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNF FT GGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" FT misc_feature 995 FT /note="alternative splice site" FT polyA_signal 1180..1185 XX SQ Sequence 1198 BP; 334 A; 216 C; 336 G; 312 T; 0 other; ttaaagtctc tcttcaccct gccgtcatgt ctaagtcaga gtctcctaaa gagcccgaac 60 agctgaggaa gctcttcatt ggagggttga gctttgaaac aactgatgag agcctgagga 120 gccattttga gcaatgggga acgctcacgg actgtgtggt aatgagagat ccaaacacca 180 agcgctctag gggctttggg tttgtcacat atgccactgt ggaggaggtg gatgcagcta 240 tgaatgcaag gccacacaag gtggatggaa gagttgtgga accaaagaga gctgtctcca 300 gagaagattc tcaaagacca ggtgcccact taactgtgaa aaagatattt gttggtggca 360 ttaaagaaga cactgaagaa catcacctaa gagattattt tgaacagtat ggaaaaattg 420 aagtgattga aatcatgact gaccgaggca gtggcaagaa aaggggcttt gcctttgtaa 480 cctttgacga ccatgactcc gtggataaga ttgtcattca gaaataccat actgtgaatg 540 gccacaactg tgaagttaga aaagccctgt caaagcaaga gatggctagt gcttcatcca 600 gccaaagagg tcgaagtggt tctggaaact ttggtggtgg tcgtggaggt ggtttcggtg 660 ggaatgacaa cttcggtcgt ggaggaaact tcagtggtcg tggtggcttt ggtggcagcc 720 gtggtggtgg tggatatggt ggcagtgggg atggctataa tggatttggc aatgatggaa 780 gcaattttgg aggtggtgga agctacaatg attttgggaa ttacaacaat cagtcttcaa 840 attttggacc catgaaggga ggaaattttg gaggcagaag ctctggcccc tatggcggtg 900 gaggccaata ctttgcaaaa ccacgaaacc aaggtggcta tggcggttcc agcagcagca 960 gtagctatgg cagtggcaga agattttaat tagggaggag tctgctacta gtcttatcag 1020 ctcttaaaaa cagaaactca tctgtccaag ttcgtggcag aaaggaacgt ccttgtgaag 1080 acctttatct gagccactgt acttcgttat cacgccatgc agtttacatg agctgttctg 1140 cagctcgaaa ttccattttg tgaatgggtt ttttttttta ataaactgta tttaactt 1198 //alleles.fas100644000766000024 25614006427353 16473 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>testseq GGATCCATTGCTACT >allele1 GGATCCATTCCTACT >allele2 GGAT--ATTCCTCCT >allele3 GCATCCATTGCTACT >allele4 GGATCCATTGCTACT >allele5 GGATCCATTGCTACA >allele6 GGATCCATTGCTAGAgmap_f9.txt100644000766000024 53270514006427353 16533 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>NM_004448 Homo sapiens v-erb-b2 erythroblastic leukemia viral oncogene homolog 2, neuro/glioblastoma derived oncogene homolog (avian) (ERBB2), transcript variant 1, mRNA. 4624 bp, mRNA, linear, PRI 20-DEC-2004 0 1 G +17:35109780 2524641287 G 1 E 2 G +17:35109781 2524641288 G E 1 3 A +17:35109782 2524641289 A 1 4 G +17:35109783 2524641290 G 2 E 5 G +17:35109784 2524641291 G E 2 6 A +17:35109785 2524641292 A 2 7 G +17:35109786 2524641293 G 3 V 8 G +17:35109787 2524641294 G V 3 9 T +17:35109788 2524641295 T 3 10 G +17:35109789 2524641296 G 4 E 11 G +17:35109790 2524641297 G E 4 12 A +17:35109791 2524641298 A 4 13 G +17:35109792 2524641299 G 5 E 14 G +17:35109793 2524641300 G E 5 15 A +17:35109794 2524641301 A 5 16 G +17:35109795 2524641302 G 6 E 17 G +17:35109796 2524641303 G E 6 18 A +17:35109797 2524641304 A 6 19 G +17:35109798 2524641305 G 7 G 20 G +17:35109799 2524641306 G G 7 21 G +17:35109800 2524641307 G 7 22 C +17:35109801 2524641308 C 8 C 23 T +17:35109802 2524641309 T C 8 24 G +17:35109803 2524641310 G 8 25 C +17:35109804 2524641311 C 9 L 26 T +17:35109805 2524641312 T L 9 27 T +17:35109806 2524641313 T 9 28 G +17:35109807 2524641314 G 10 R 29 A +17:35109808 2524641315 A R 10 30 G +17:35109809 2524641316 G 10 31 G +17:35109810 2524641317 G 11 K 32 A +17:35109811 2524641318 A K 11 33 A +17:35109812 2524641319 A 11 34 G +17:35109813 2524641320 G 12 Y 35 T +17:35109814 2524641321 T Y 12 36 A +17:35109815 2524641322 A 12 37 T +17:35109816 2524641323 T 13 K 38 A +17:35109817 2524641324 A K 13 39 A +17:35109818 2524641325 A 13 40 G +17:35109819 2524641326 G 14 N 41 A +17:35109820 2524641327 A N 14 42 A +17:35109821 2524641328 A 14 43 T +17:35109822 2524641329 T 15 E 44 G +17:35109823 2524641330 G E 15 45 A +17:35109824 2524641331 A 15 46 A +17:35109825 2524641332 A 16 V 47 G +17:35109826 2524641333 G V 16 48 T +17:35109827 2524641334 T 16 49 T +17:35109828 2524641335 T 17 V 50 G +17:35109829 2524641336 G V 17 51 T +17:35109830 2524641337 T 17 52 G +17:35109831 2524641338 G 18 K 53 A +17:35109832 2524641339 A K 18 54 A +17:35109833 2524641340 A 18 55 G +17:35109834 2524641341 G 19 L 56 C +17:35109835 2524641342 C L 19 57 T +17:35109836 2524641343 T 19 58 G +17:35109837 2524641344 G 20 R 59 A +17:35109838 2524641345 A R 20 60 G +17:35109839 2524641346 G 20 61 A +17:35109840 2524641347 A 21 62 +17:35109841 2524641348 T F 21 S 62 T +17:35109842 2524641349 T 21 63 C +17:35109843 2524641350 C 22 64 C +17:35109844 2524641351 C P 22 P 65 C +17:35109845 2524641352 C 22 66 C +17:35109846 2524641353 C 23 67 T +17:35109847 2524641354 T S 23 P 68 C +17:35109848 2524641355 C 23 69 C +17:35109849 2524641356 C 24 70 A +17:35109850 2524641357 A I 24 L 71 T +17:35109851 2524641358 T 24 72 T +17:35109852 2524641359 T 25 73 G +17:35109853 2524641360 G G 25 G 74 G +17:35109854 2524641361 G 25 75 G +17:35109855 2524641362 G 26 76 A +17:35109856 2524641363 A T 26 P 77 C +17:35109857 2524641364 C 26 78 C +17:35109858 2524641365 C 27 79 G +17:35109859 2524641366 G G 27 E 80 G +17:35109860 2524641367 G 27 81 A +17:35109861 2524641368 A 28 82 G +17:35109862 2524641369 G E 28 K 83 A +17:35109863 2524641370 A 28 84 A +17:35109864 2524641371 A 29 85 A +17:35109865 2524641372 A T 29 P 86 C +17:35109866 2524641373 C 29 87 C +17:35109867 2524641374 C 30 88 A +17:35109868 2524641375 A R 30 G 89 G +17:35109869 2524641376 G 30 90 G +17:35109870 2524641377 G 31 91 G +17:35109871 2524641378 G G 31 E 92 G +17:35109872 2524641379 G 31 93 A +17:35109873 2524641380 A 32 94 G +17:35109874 2524641381 G A 32 P 95 C +17:35109875 2524641382 C 32 96 C +17:35109876 2524641383 C 33 97 C +17:35109877 2524641384 C P 33 P 98 C +17:35109878 2524641385 C 33 99 C +17:35109879 2524641386 C 34 100 C +17:35109880 2524641387 C R 34 G 101 G +17:35109881 2524641388 G 34 102 G +17:35109882 2524641389 G 35 103 G +17:35109883 2524641390 G A 35 Q 104 C +17:35109884 2524641391 C 35 105 A +17:35109885 2524641392 A 36 106 G +17:35109886 2524641393 G A 36 P 107 C +17:35109887 2524641394 C 36 108 C +17:35109888 2524641395 C 37 109 G +17:35109889 2524641396 G A 37 R 110 C +17:35109890 2524641397 C 37 111 G +17:35109891 2524641398 G 38 112 C +17:35109892 2524641399 C R 38 A 113 G +17:35109893 2524641400 G 38 114 C +17:35109894 2524641401 C 39 115 C +17:35109895 2524641402 C P 39 P 116 C +17:35109896 2524641403 C 39 117 C +17:35109897 2524641404 C 40 118 T +17:35109898 2524641405 T F 40 S 119 T +17:35109899 2524641406 T 40 120 C +17:35109900 2524641407 C 41 121 C +17:35109901 2524641408 C P 41 H 122 C +17:35109902 2524641409 C 41 123 A +17:35109903 2524641410 A 42 124 C +17:35109904 2524641411 C R 42 G 125 G +17:35109905 2524641412 G 42 126 G +17:35109906 2524641413 G 43 127 G +17:35109907 2524641414 G G 43 A 128 G +17:35109908 2524641415 G 43 129 C +17:35109909 2524641416 C 44 130 C +17:35109910 2524641417 C P 44 L 131 C +17:35109911 2524641418 C 44 132 T +17:35109912 2524641419 T 45 133 T +17:35109913 2524641420 T L 45 Y 134 T +17:35109914 2524641421 T 45 135 A +17:35109915 2524641422 A 46 136 C +17:35109916 2524641423 C L 46 C 137 T +17:35109917 2524641424 T 46 138 G +17:35109918 2524641425 G 47 139 C +17:35109919 2524641426 C R 47 A 140 G +17:35109920 2524641427 G 47 141 C +17:35109921 2524641428 C 48 142 C +17:35109922 2524641429 C R 48 A 143 G +17:35109923 2524641430 G 48 144 C +17:35109924 2524641431 C 49 145 G +17:35109925 2524641432 G A 49 R 146 C +17:35109926 2524641433 C 49 147 G +17:35109927 2524641434 G 50 148 C +17:35109928 2524641435 C P 50 P 149 C +17:35109929 2524641436 C 50 150 C +17:35109930 2524641437 C 51 151 G +17:35109931 2524641438 G G 51 A 152 G +17:35109932 2524641439 G 51 153 C +17:35109933 2524641440 C 52 154 C +17:35109934 2524641441 C P 52 P 155 C +17:35109935 2524641442 C 52 156 C +17:35109936 2524641443 C 53 157 C +17:35109937 2524641444 C H 53 T 158 A +17:35109938 2524641445 A 53 159 C +17:35109939 2524641446 C 54 160 C +17:35109940 2524641447 C P 54 P 161 C +17:35109941 2524641448 C 54 162 C +17:35109942 2524641449 C 55 163 T +17:35109943 2524641450 T S 55 R 164 C +17:35109944 2524641451 C 55 165 G +17:35109945 2524641452 G 56 166 C +17:35109946 2524641453 C Q 56 S 167 A +17:35109947 2524641454 A 56 168 G +17:35109948 2524641455 G 57 169 C +17:35109949 2524641456 C H 57 T 170 A +17:35109950 2524641457 A 57 171 C +17:35109951 2524641458 C 58 172 C +17:35109952 2524641459 C P 58 P 173 C +17:35109953 2524641460 C 58 174 C +17:35109954 2524641461 C 59 175 G +17:35109955 2524641462 G A 59 R 176 C +17:35109956 2524641463 C 59 177 G +17:35109957 2524641464 G 60 178 C +17:35109958 2524641465 C P 60 P 179 C +17:35109959 2524641466 C 60 180 C +17:35109960 2524641467 C 61 181 C +17:35109961 2524641468 C R 61 A 182 G +17:35109962 2524641469 G 61 183 C +17:35109963 2524641470 C 62 184 G +17:35109964 2524641471 G A 62 P 185 C +17:35109965 2524641472 C 62 186 C +17:35109966 2524641473 C 63 187 C +17:35109967 2524641474 C L 63 S 188 T +17:35109968 2524641475 T 63 189 C +17:35109969 2524641476 C 64 190 C +17:35109970 2524641477 C P 64 Q 191 C +17:35109971 2524641478 C 64 192 A +17:35109972 2524641479 A 65 193 G +17:35109973 2524641480 G A 65 P 194 C +17:35109974 2524641481 C 65 195 C +17:35109975 2524641482 C 66 196 G +17:35109976 2524641483 G G 66 G 197 G +17:35109977 2524641484 G 66 198 G +17:35109978 2524641485 G 67 199 T +17:35109979 2524641486 T S 67 P 200 C +17:35109980 2524641487 C 67 201 C +17:35109981 2524641488 C 68 202 A +17:35109982 2524641489 A S 68 A 203 G +17:35109983 2524641490 G 68 204 C +17:35109984 2524641491 C 69 205 C +17:35109985 2524641492 C R 69 G 206 G +17:35109986 2524641493 G 69 207 G +17:35109987 2524641494 G 70 208 A +17:35109988 2524641495 A S 70 A 209 G +17:35109989 2524641496 G 70 210 C +17:35109990 2524641497 C 71 211 C +17:35109991 2524641498 C H 71 M 212 A +17:35109992 2524641499 A 71 213 T +17:35109993 2524641500 T 72 214 G +17:35109994 2524641501 G G 72 G 215 G +17:35109995 2524641502 G 72 216 G +17:35109996 2524641503 G 73 217 G +17:35109997 2524641504 G A 73 P 218 C +17:35109998 2524641505 C 73 219 C +17:35109999 2524641506 C 74 220 G +17:35110000 2524641507 G G 74 E 221 G +17:35110001 2524641508 G 74 222 A +17:35110002 2524641509 A 75 223 G +17:35110003 2524641510 G A 75 P 224 C +17:35110004 2524641511 C 75 225 C +17:35110005 2524641512 C 76 226 G +17:35110006 2524641513 G A 76 Q 227 C +17:35110007 2524641514 C 76 228 A +17:35110008 2524641515 A 77 229 G +17:35110009 2524641516 G V 77 * 230 T +17:35110010 2524641517 T 77 231 G +17:35110011 2524641518 G 78 232 A +17:35110012 2524641519 A S 78 233 G +17:35110013 2524641520 G 78 234 C +17:35110014 2524641521 C 79 235 A +17:35110015 2524641522 A T 79 236 C +17:35110016 2524641523 C 79 237 C +17:35110017 2524641524 C 80 238 A +17:35110018 2524641525 A M 80 239 T +17:35110019 2524641526 T 80 240 G +17:35110020 2524641527 G 81 241 G +17:35110021 2524641528 G E 81 242 A +17:35110022 2524641529 A 81 243 G +17:35110023 2524641530 G 82 244 C +17:35110024 2524641531 C L 82 245 T +17:35110025 2524641532 T 82 246 G +17:35110026 2524641533 G 83 247 G +17:35110027 2524641534 G A 83 248 C +17:35110028 2524641535 C 83 249 G +17:35110029 2524641536 G 84 250 G +17:35110030 2524641537 G A 84 251 C +17:35110031 2524641538 C 84 252 C +17:35110032 2524641539 C 85 253 T +17:35110033 2524641540 T L 85 254 T +17:35110034 2524641541 T 85 255 G +17:35110035 2524641542 G 86 256 T +17:35110036 2524641543 T C 86 257 G +17:35110037 2524641544 G 86 258 C +17:35110038 2524641545 C 87 259 C +17:35110039 2524641546 C R 87 260 G +17:35110040 2524641547 G 87 261 C +17:35110041 2524641548 C 88 262 T +17:35110042 2524641549 T W 88 263 G +17:35110043 2524641550 G 88 264 G +17:35110044 2524641551 G 89 265 G +17:35110045 2524641552 G G 89 266 G +17:35110046 2524641553 G 89 267 G +17:35110047 2524641554 G 90 268 C +17:35110048 2524641555 C L 90 269 T +17:35110049 2524641556 T 90 270 C +17:35110050 2524641557 C 91 271 C +17:35110051 2524641558 C L 91 272 T +17:35110052 2524641559 T 91 273 C +17:35110053 2524641560 C 92 274 C +17:35110054 2524641561 C L 92 275 T +17:35110055 2524641562 T 92 276 C +17:35110056 2524641563 C 93 277 G +17:35110057 2524641564 G A 93 278 C +17:35110058 2524641565 C 93 279 C +17:35110059 2524641566 C 94 280 C +17:35110060 2524641567 C L 94 281 T +17:35110061 2524641568 T 94 282 C +17:35110062 2524641569 C 95 283 T +17:35110063 2524641570 T L 95 284 T +17:35110064 2524641571 T 95 285 G +17:35110065 2524641572 G 96 286 C +17:35110066 2524641573 C P 96 287 C +17:35110067 2524641574 C 96 288 C +17:35110068 2524641575 C 97 289 C +17:35110069 2524641576 C P 97 290 C +17:35110070 2524641577 C 97 291 C +17:35110071 2524641578 C 98 292 G +17:35110072 2524641579 G G 98 293 G +17:35110073 2524641580 G 98 294 A +17:35110074 2524641581 A 99 295 G +17:35110075 2524641582 G A 99 296 C +17:35110076 2524641583 C 99 297 C +17:35110077 2524641584 C 100 298 G +17:35110078 2524641585 G A 100 299 C +17:35110079 2524641586 C 100 300 G +17:35110080 2524641587 G 101 301 A +17:35110081 2524641588 A S 101 302 G +17:35110082 2524641589 G 101 303 C +17:35110083 2524641590 C 102 304 A +17:35110084 2524641591 A T 102 305 C +17:35110085 2524641592 C 102 306 C +17:35110086 2524641593 C 103 307 C +17:35110087 2524641594 C Q 103 308 A +17:35110088 2524641595 A 103 309 A +17:35110089 2524641596 A 104 310 G +17:35110090 2524641597 G V 104 311 T +17:35116769 2524648276 T 104 312 G +17:35116770 2524648277 G 105 313 T +17:35116771 2524648278 T C 105 314 G +17:35116772 2524648279 G 105 315 C +17:35116773 2524648280 C 106 316 A +17:35116774 2524648281 A T 106 317 C +17:35116775 2524648282 C 106 318 C +17:35116776 2524648283 C 107 319 G +17:35116777 2524648284 G G 107 320 G +17:35116778 2524648285 G 107 321 C +17:35116779 2524648286 C 108 322 A +17:35116780 2524648287 A T 108 323 C +17:35116781 2524648288 C 108 324 A +17:35116782 2524648289 A 109 325 G +17:35116783 2524648290 G D 109 326 A +17:35116784 2524648291 A 109 327 C +17:35116785 2524648292 C 110 328 A +17:35116786 2524648293 A M 110 329 T +17:35116787 2524648294 T 110 330 G +17:35116788 2524648295 G 111 331 A +17:35116789 2524648296 A K 111 332 A +17:35116790 2524648297 A 111 333 G +17:35116791 2524648298 G 112 334 C +17:35116792 2524648299 C L 112 335 T +17:35116793 2524648300 T 112 336 G +17:35116794 2524648301 G 113 337 C +17:35116795 2524648302 C R 113 338 G +17:35116796 2524648303 G 113 339 G +17:35116797 2524648304 G 114 340 C +17:35116798 2524648305 C L 114 341 T +17:35116799 2524648306 T 114 342 C +17:35116800 2524648307 C 115 343 C +17:35116801 2524648308 C P 115 344 C +17:35116802 2524648309 C 115 345 T +17:35116803 2524648310 T 116 346 G +17:35116804 2524648311 G A 116 347 C +17:35116805 2524648312 C 116 348 C +17:35116806 2524648313 C 117 349 A +17:35116807 2524648314 A S 117 350 G +17:35116808 2524648315 G 117 351 T +17:35116809 2524648316 T 118 352 C +17:35116810 2524648317 C P 118 353 C +17:35116811 2524648318 C 118 354 C +17:35116812 2524648319 C 119 355 G +17:35116813 2524648320 G E 119 356 A +17:35116814 2524648321 A 119 357 G +17:35116815 2524648322 G 120 358 A +17:35116816 2524648323 A T 120 359 C +17:35116817 2524648324 C 120 360 C +17:35116818 2524648325 C 121 361 C +17:35116819 2524648326 C H 121 362 A +17:35116820 2524648327 A 121 363 C +17:35116821 2524648328 C 122 364 C +17:35116822 2524648329 C L 122 365 T +17:35116823 2524648330 T 122 366 G +17:35116824 2524648331 G 123 367 G +17:35116825 2524648332 G D 123 368 A +17:35116826 2524648333 A 123 369 C +17:35116827 2524648334 C 124 370 A +17:35116828 2524648335 A M 124 371 T +17:35116829 2524648336 T 124 372 G +17:35116830 2524648337 G 125 373 C +17:35116831 2524648338 C L 125 374 T +17:35116832 2524648339 T 125 375 C +17:35116833 2524648340 C 126 376 C +17:35116834 2524648341 C R 126 377 G +17:35116835 2524648342 G 126 378 C +17:35116836 2524648343 C 127 379 C +17:35116837 2524648344 C H 127 380 A +17:35116838 2524648345 A 127 381 C +17:35116839 2524648346 C 128 382 C +17:35116840 2524648347 C L 128 383 T +17:35116841 2524648348 T 128 384 C +17:35116842 2524648349 C 129 385 T +17:35116843 2524648350 T Y 129 386 A +17:35116844 2524648351 A 129 387 C +17:35116845 2524648352 C 130 388 C +17:35116846 2524648353 C Q 130 389 A +17:35116847 2524648354 A 130 390 G +17:35116848 2524648355 G 131 391 G +17:35116849 2524648356 G G 131 392 G +17:35116850 2524648357 G 131 393 C +17:35116851 2524648358 C 132 394 T +17:35116852 2524648359 T C 132 395 G +17:35116853 2524648360 G 132 396 C +17:35116854 2524648361 C 133 397 C +17:35116855 2524648362 C Q 133 398 A +17:35116856 2524648363 A 133 399 G +17:35116857 2524648364 G 134 400 G +17:35116858 2524648365 G V 134 401 T +17:35116859 2524648366 T 134 402 G +17:35116860 2524648367 G 135 403 G +17:35116861 2524648368 G V 135 404 T +17:35116862 2524648369 T 135 405 G +17:35116863 2524648370 G 136 406 C +17:35116864 2524648371 C Q 136 407 A +17:35116865 2524648372 A 136 408 G +17:35116866 2524648373 G 137 409 G +17:35116867 2524648374 G G 137 410 G +17:35116868 2524648375 G 137 411 A +17:35116869 2524648376 A 138 412 A +17:35116870 2524648377 A N 138 413 A +17:35116871 2524648378 A 138 414 C +17:35116872 2524648379 C 139 415 C +17:35116873 2524648380 C L 139 416 T +17:35116874 2524648381 T 139 417 G +17:35116875 2524648382 G 140 418 G +17:35116876 2524648383 G E 140 419 A +17:35116877 2524648384 A 140 420 A +17:35116878 2524648385 A 141 421 C +17:35116879 2524648386 C L 141 422 T +17:35116880 2524648387 T 141 423 C +17:35116881 2524648388 C 142 424 A +17:35116882 2524648389 A T 142 425 C +17:35116883 2524648390 C 142 426 C +17:35116884 2524648391 C 143 427 T +17:35116885 2524648392 T Y 143 428 A +17:35116886 2524648393 A 143 429 C +17:35116887 2524648394 C 144 430 C +17:35116888 2524648395 C L 144 431 T +17:35116889 2524648396 T 144 432 G +17:35116890 2524648397 G 145 433 C +17:35116891 2524648398 C P 145 434 C +17:35116892 2524648399 C 145 435 C +17:35116893 2524648400 C 146 436 A +17:35116894 2524648401 A T 146 437 C +17:35116895 2524648402 C 146 438 C +17:35116896 2524648403 C 147 439 A +17:35116897 2524648404 A N 147 440 A +17:35116898 2524648405 A 147 441 T +17:35116899 2524648406 T 148 442 G +17:35116900 2524648407 G A 148 443 C +17:35116901 2524648408 C 148 444 C +17:35116902 2524648409 C 149 445 A +17:35116903 2524648410 A S 149 446 G +17:35116904 2524648411 G 149 447 C +17:35116905 2524648412 C 150 448 C +17:35116906 2524648413 C L 150 449 T +17:35116907 2524648414 T 150 450 G +17:35116908 2524648415 G 151 451 T +17:35116909 2524648416 T S 151 452 C +17:35116910 2524648417 C 151 453 C +17:35116911 2524648418 C 152 454 T +17:35116912 2524648419 T F 152 455 T +17:35116913 2524648420 T 152 456 C +17:35116914 2524648421 C 153 457 C +17:35116915 2524648422 C L 153 458 T +17:35116916 2524648423 T 153 459 G +17:35116917 2524648424 G 154 460 C +17:35116918 2524648425 C Q 154 461 A +17:35116919 2524648426 A 154 462 G +17:35116920 2524648427 G 155 463 G +17:35118100 2524649607 G D 155 464 A +17:35118101 2524649608 A 155 465 T +17:35118102 2524649609 T 156 466 A +17:35118103 2524649610 A I 156 467 T +17:35118104 2524649611 T 156 468 C +17:35118105 2524649612 C 157 469 C +17:35118106 2524649613 C Q 157 470 A +17:35118107 2524649614 A 157 471 G +17:35118108 2524649615 G 158 472 G +17:35118109 2524649616 G E 158 473 A +17:35118110 2524649617 A 158 474 G +17:35118111 2524649618 G 159 475 G +17:35118112 2524649619 G V 159 476 T +17:35118113 2524649620 T 159 477 G +17:35118114 2524649621 G 160 478 C +17:35118115 2524649622 C Q 160 479 A +17:35118116 2524649623 A 160 480 G +17:35118117 2524649624 G 161 481 G +17:35118118 2524649625 G G 161 482 G +17:35118119 2524649626 G 161 483 C +17:35118120 2524649627 C 162 484 T +17:35118121 2524649628 T Y 162 485 A +17:35118122 2524649629 A 162 486 C +17:35118123 2524649630 C 163 487 G +17:35118124 2524649631 G V 163 488 T +17:35118125 2524649632 T 163 489 G +17:35118126 2524649633 G 164 490 C +17:35118127 2524649634 C L 164 491 T +17:35118128 2524649635 T 164 492 C +17:35118129 2524649636 C 165 493 A +17:35118130 2524649637 A I 165 494 T +17:35118131 2524649638 T 165 495 C +17:35118132 2524649639 C 166 496 G +17:35118133 2524649640 G A 166 497 C +17:35118134 2524649641 C 166 498 T +17:35118135 2524649642 T 167 499 C +17:35118136 2524649643 C H 167 500 A +17:35118137 2524649644 A 167 501 C +17:35118138 2524649645 C 168 502 A +17:35118139 2524649646 A N 168 503 A +17:35118140 2524649647 A 168 504 C +17:35118141 2524649648 C 169 505 C +17:35118142 2524649649 C Q 169 506 A +17:35118143 2524649650 A 169 507 A +17:35118144 2524649651 A 170 508 G +17:35118145 2524649652 G V 170 509 T +17:35118146 2524649653 T 170 510 G +17:35118147 2524649654 G 171 511 A +17:35118148 2524649655 A R 171 512 G +17:35118149 2524649656 G 171 513 G +17:35118150 2524649657 G 172 514 C +17:35118151 2524649658 C Q 172 515 A +17:35118152 2524649659 A 172 516 G +17:35118153 2524649660 G 173 517 G +17:35118154 2524649661 G V 173 518 T +17:35118155 2524649662 T 173 519 C +17:35118156 2524649663 C 174 520 C +17:35118157 2524649664 C P 174 521 C +17:35118158 2524649665 C 174 522 A +17:35118159 2524649666 A 175 523 C +17:35118160 2524649667 C L 175 524 T +17:35118161 2524649668 T 175 525 G +17:35118162 2524649669 G 176 526 C +17:35118163 2524649670 C Q 176 527 A +17:35118164 2524649671 A 176 528 G +17:35118165 2524649672 G 177 529 A +17:35118166 2524649673 A R 177 530 G +17:35118167 2524649674 G 177 531 G +17:35118168 2524649675 G 178 532 C +17:35118169 2524649676 C L 178 533 T +17:35118170 2524649677 T 178 534 G +17:35118171 2524649678 G 179 535 C +17:35118172 2524649679 C R 179 536 G +17:35118173 2524649680 G 179 537 G +17:35118174 2524649681 G 180 538 A +17:35118175 2524649682 A I 180 539 T +17:35118176 2524649683 T 180 540 T +17:35118177 2524649684 T 181 541 G +17:35118178 2524649685 G V 181 542 T +17:35118179 2524649686 T 181 543 G +17:35118180 2524649687 G 182 544 C +17:35118181 2524649688 C R 182 545 G +17:35118182 2524649689 G 182 546 A +17:35118183 2524649690 A 183 547 G +17:35118184 2524649691 G G 183 548 G +17:35118185 2524649692 G 183 549 C +17:35118186 2524649693 C 184 550 A +17:35118187 2524649694 A T 184 551 C +17:35118188 2524649695 C 184 552 C +17:35118189 2524649696 C 185 553 C +17:35118190 2524649697 C Q 185 554 A +17:35118191 2524649698 A 185 555 G +17:35118192 2524649699 G 186 556 C +17:35118193 2524649700 C L 186 557 T +17:35118194 2524649701 T 186 558 C +17:35118195 2524649702 C 187 559 T +17:35118196 2524649703 T F 187 560 T +17:35118197 2524649704 T 187 561 T +17:35118198 2524649705 T 188 562 G +17:35118199 2524649706 G E 188 563 A +17:35118200 2524649707 A 188 564 G +17:35118201 2524649708 G 189 565 G +17:35118202 2524649709 G D 189 566 A +17:35118203 2524649710 A 189 567 C +17:35118204 2524649711 C 190 568 A +17:35118205 2524649712 A N 190 569 A +17:35118206 2524649713 A 190 570 C +17:35118207 2524649714 C 191 571 T +17:35118208 2524649715 T Y 191 572 A +17:35118209 2524649716 A 191 573 T +17:35118210 2524649717 T 192 574 G +17:35118211 2524649718 G A 192 575 C +17:35118212 2524649719 C 192 576 C +17:35118213 2524649720 C 193 577 C +17:35118214 2524649721 C L 193 578 T +17:35118215 2524649722 T 193 579 G +17:35118216 2524649723 G 194 580 G +17:35118217 2524649724 G A 194 581 C +17:35118218 2524649725 C 194 582 C +17:35118219 2524649726 C 195 583 G +17:35118220 2524649727 G V 195 584 T +17:35118221 2524649728 T 195 585 G +17:35118222 2524649729 G 196 586 C +17:35118223 2524649730 C L 196 587 T +17:35118224 2524649731 T 196 588 A +17:35118225 2524649732 A 197 589 G +17:35118226 2524649733 G D 197 590 A +17:35118227 2524649734 A 197 591 C +17:35118228 2524649735 C 198 592 A +17:35118229 2524649736 A N 198 593 A +17:35118230 2524649737 A 198 594 T +17:35118231 2524649738 T 199 595 G +17:35118232 2524649739 G G 199 596 G +17:35118233 2524649740 G 199 597 A +17:35118234 2524649741 A 200 598 G +17:35118235 2524649742 G D 200 599 A +17:35118236 2524649743 A 200 600 C +17:35118237 2524649744 C 201 601 C +17:35118238 2524649745 C P 201 602 C +17:35118239 2524649746 C 201 603 G +17:35118240 2524649747 G 202 604 C +17:35118241 2524649748 C L 202 605 T +17:35118242 2524649749 T 202 606 G +17:35118243 2524649750 G 203 607 A +17:35118244 2524649751 A N 203 608 A +17:35118245 2524649752 A 203 609 C +17:35118246 2524649753 C 204 610 A +17:35118247 2524649754 A N 204 611 A +17:35118248 2524649755 A 204 612 T +17:35118249 2524649756 T 205 613 A +17:35118250 2524649757 A T 205 614 C +17:35118251 2524649758 C 205 615 C +17:35118252 2524649759 C 206 616 A +17:35118253 2524649760 A T 206 617 C +17:35118254 2524649761 C 206 618 C +17:35118255 2524649762 C 207 619 C +17:35118256 2524649763 C P 207 620 C +17:35118257 2524649764 C 207 621 T +17:35118258 2524649765 T 208 622 G +17:35118259 2524649766 G V 208 623 T +17:35118260 2524649767 T 208 624 C +17:35118261 2524649768 C 209 625 A +17:35118262 2524649769 A T 209 626 C +17:35118263 2524649770 C 209 627 A +17:35118264 2524649771 A 210 628 G +17:35118265 2524649772 G G 210 629 G +17:35118266 2524649773 G 210 630 G +17:35118267 2524649774 G 211 631 G +17:35118268 2524649775 G A 211 632 C +17:35118269 2524649776 C 211 633 C +17:35118270 2524649777 C 212 634 T +17:35118271 2524649778 T S 212 635 C +17:35118272 2524649779 C 212 636 C +17:35118273 2524649780 C 213 637 C +17:35118274 2524649781 C P 213 638 C +17:35118275 2524649782 C 213 639 A +17:35118276 2524649783 A 214 640 G +17:35118277 2524649784 G G 214 641 G +17:35118278 2524649785 G 214 642 A +17:35118279 2524649786 A 215 643 G +17:35118280 2524649787 G G 215 644 G +17:35118281 2524649788 G 215 645 C +17:35118282 2524649789 C 216 646 C +17:35118283 2524649790 C L 216 647 T +17:35118284 2524649791 T 216 648 G +17:35118285 2524649792 G 217 649 C +17:35118286 2524649793 C R 217 650 G +17:35118287 2524649794 G 217 651 G +17:35118288 2524649795 G 218 652 G +17:35118289 2524649796 G E 218 653 A +17:35118290 2524649797 A 218 654 G +17:35118291 2524649798 G 219 655 C +17:35118292 2524649799 C L 219 656 T +17:35118293 2524649800 T 219 657 G +17:35118294 2524649801 G 220 658 C +17:35118295 2524649802 C Q 220 659 A +17:35118296 2524649803 A 220 660 G +17:35118297 2524649804 G 221 661 C +17:35118298 2524649805 C L 221 662 T +17:35118299 2524649806 T 221 663 T +17:35118300 2524649807 T 222 664 C +17:35118301 2524649808 C R 222 665 G +17:35118302 2524649809 G 222 666 A +17:35118303 2524649810 A 223 667 A +17:35118304 2524649811 A S 223 668 G +17:35118305 2524649812 G 223 669 C +17:35118306 2524649813 C 224 670 C +17:35118307 2524649814 C L 224 671 T +17:35118308 2524649815 T 224 672 C +17:35118309 2524649816 C 225 673 A +17:35118310 2524649817 A T 225 674 C +17:35118311 2524649818 C 225 675 A +17:35118312 2524649819 A 226 676 G +17:35118313 2524649820 G E 226 677 A +17:35119097 2524650604 A 226 678 G +17:35119098 2524650605 G 227 679 A +17:35119099 2524650606 A I 227 680 T +17:35119100 2524650607 T 227 681 C +17:35119101 2524650608 C 228 682 T +17:35119102 2524650609 T L 228 683 T +17:35119103 2524650610 T 228 684 G +17:35119104 2524650611 G 229 685 A +17:35119105 2524650612 A K 229 686 A +17:35119106 2524650613 A 229 687 A +17:35119107 2524650614 A 230 688 G +17:35119108 2524650615 G G 230 689 G +17:35119109 2524650616 G 230 690 A +17:35119110 2524650617 A 231 691 G +17:35119111 2524650618 G G 231 692 G +17:35119112 2524650619 G 231 693 G +17:35119113 2524650620 G 232 694 G +17:35119114 2524650621 G V 232 695 T +17:35119115 2524650622 T 232 696 C +17:35119116 2524650623 C 233 697 T +17:35119117 2524650624 T L 233 698 T +17:35119118 2524650625 T 233 699 G +17:35119119 2524650626 G 234 700 A +17:35119120 2524650627 A I 234 701 T +17:35119121 2524650628 T 234 702 C +17:35119122 2524650629 C 235 703 C +17:35119123 2524650630 C Q 235 704 A +17:35119124 2524650631 A 235 705 G +17:35119125 2524650632 G 236 706 C +17:35119126 2524650633 C R 236 707 G +17:35119127 2524650634 G 236 708 G +17:35119128 2524650635 G 237 709 A +17:35119129 2524650636 A N 237 710 A +17:35119130 2524650637 A 237 711 C +17:35119131 2524650638 C 238 712 C +17:35119132 2524650639 C P 238 713 C +17:35119133 2524650640 C 238 714 C +17:35119134 2524650641 C 239 715 C +17:35119135 2524650642 C Q 239 716 A +17:35119136 2524650643 A 239 717 G +17:35119137 2524650644 G 240 718 C +17:35119138 2524650645 C L 240 719 T +17:35119139 2524650646 T 240 720 C +17:35119140 2524650647 C 241 721 T +17:35119141 2524650648 T C 241 722 G +17:35119142 2524650649 G 241 723 C +17:35119143 2524650650 C 242 724 T +17:35119144 2524650651 T Y 242 725 A +17:35119145 2524650652 A 242 726 C +17:35119146 2524650653 C 243 727 C +17:35119147 2524650654 C Q 243 728 A +17:35119148 2524650655 A 243 729 G +17:35119149 2524650656 G 244 730 G +17:35119150 2524650657 G D 244 731 A +17:35119151 2524650658 A 244 732 C +17:35119152 2524650659 C 245 733 A +17:35119153 2524650660 A T 245 734 C +17:35119154 2524650661 C 245 735 G +17:35119155 2524650662 G 246 736 A +17:35119156 2524650663 A I 246 737 T +17:35119157 2524650664 T 246 738 T +17:35119158 2524650665 T 247 739 T +17:35119159 2524650666 T L 247 740 T +17:35119160 2524650667 T 247 741 G +17:35119161 2524650668 G 248 742 T +17:35119162 2524650669 T W 248 743 G +17:35119163 2524650670 G 248 744 G +17:35119164 2524650671 G 249 745 A +17:35119165 2524650672 A K 249 746 A +17:35119166 2524650673 A 249 747 G +17:35119167 2524650674 G 250 748 G +17:35119168 2524650675 G D 250 749 A +17:35119169 2524650676 A 250 750 C +17:35119170 2524650677 C 251 751 A +17:35119171 2524650678 A I 251 752 T +17:35119172 2524650679 T 251 753 C +17:35119173 2524650680 C 252 754 T +17:35119174 2524650681 T F 252 755 T +17:35119175 2524650682 T 252 756 C +17:35119176 2524650683 C 253 757 C +17:35119177 2524650684 C H 253 758 A +17:35119178 2524650685 A 253 759 C +17:35119179 2524650686 C 254 760 A +17:35119180 2524650687 A K 254 761 A +17:35119181 2524650688 A 254 762 G +17:35119182 2524650689 G 255 763 A +17:35119183 2524650690 A N 255 764 A +17:35119184 2524650691 A 255 765 C +17:35119185 2524650692 C 256 766 A +17:35119186 2524650693 A N 256 767 A +17:35119187 2524650694 A 256 768 C +17:35119188 2524650695 C 257 769 C +17:35119189 2524650696 C Q 257 770 A +17:35119190 2524650697 A 257 771 G +17:35119191 2524650698 G 258 772 C +17:35119192 2524650699 C L 258 773 T +17:35119193 2524650700 T 258 774 G +17:35119194 2524650701 G 259 775 G +17:35119195 2524650702 G A 259 776 C +17:35119196 2524650703 C 259 777 T +17:35119197 2524650704 T 260 778 C +17:35119198 2524650705 C L 260 779 T +17:35119199 2524650706 T 260 780 C +17:35119200 2524650707 C 261 781 A +17:35119201 2524650708 A T 261 782 C +17:35119202 2524650709 C 261 783 A +17:35119203 2524650710 A 262 784 C +17:35119204 2524650711 C L 262 785 T +17:35119205 2524650712 T 262 786 G +17:35119206 2524650713 G 263 787 A +17:35119207 2524650714 A I 263 788 T +17:35119208 2524650715 T 263 789 A +17:35119209 2524650716 A 264 790 G +17:35119210 2524650717 G D 264 791 A +17:35119211 2524650718 A 264 792 C +17:35119212 2524650719 C 265 793 A +17:35119213 2524650720 A T 265 794 C +17:35119214 2524650721 C 265 795 C +17:35119215 2524650722 C 266 796 A +17:35119216 2524650723 A N 266 797 A +17:35119217 2524650724 A 266 798 C +17:35119218 2524650725 C 267 799 C +17:35119219 2524650726 C R 267 800 G +17:35119220 2524650727 G 267 801 C +17:35119221 2524650728 C 268 802 T +17:35119222 2524650729 T S 268 803 C +17:35119223 2524650730 C 268 804 T +17:35119224 2524650731 T 269 805 C +17:35119225 2524650732 C R 269 806 G +17:35119226 2524650733 G 269 807 G +17:35119227 2524650734 G 270 808 G +17:35119228 2524650735 G A 270 809 C +17:35119229 2524650736 C 270 810 C +17:35119230 2524650737 C 271 811 T +17:35119231 2524650738 T C 271 812 G +17:35119592 2524651099 G 271 813 C +17:35119593 2524651100 C 272 814 C +17:35119594 2524651101 C H 272 815 A +17:35119595 2524651102 A 272 816 C +17:35119596 2524651103 C 273 817 C +17:35119597 2524651104 C P 273 818 C +17:35119598 2524651105 C 273 819 C +17:35119599 2524651106 C 274 820 T +17:35119600 2524651107 T C 274 821 G +17:35119601 2524651108 G 274 822 T +17:35119602 2524651109 T 275 823 T +17:35119603 2524651110 T S 275 824 C +17:35119604 2524651111 C 275 825 T +17:35119605 2524651112 T 276 826 C +17:35119606 2524651113 C P 276 827 C +17:35119607 2524651114 C 276 828 G +17:35119608 2524651115 G 277 829 A +17:35119609 2524651116 A M 277 830 T +17:35119610 2524651117 T 277 831 G +17:35119611 2524651118 G 278 832 T +17:35119612 2524651119 T C 278 833 G +17:35119613 2524651120 G 278 834 T +17:35119614 2524651121 T 279 835 A +17:35119615 2524651122 A K 279 836 A +17:35119616 2524651123 A 279 837 G +17:35119617 2524651124 G 280 838 G +17:35119618 2524651125 G G 280 839 G +17:35119619 2524651126 G 280 840 C +17:35119620 2524651127 C 281 841 T +17:35119621 2524651128 T S 281 842 C +17:35119622 2524651129 C 281 843 C +17:35119623 2524651130 C 282 844 C +17:35119624 2524651131 C R 282 845 G +17:35119625 2524651132 G 282 846 C +17:35119626 2524651133 C 283 847 T +17:35119627 2524651134 T C 283 848 G +17:35119628 2524651135 G 283 849 C +17:35119629 2524651136 C 284 850 T +17:35119630 2524651137 T W 284 851 G +17:35119631 2524651138 G 284 852 G +17:35119632 2524651139 G 285 853 G +17:35119633 2524651140 G G 285 854 G +17:35119634 2524651141 G 285 855 A +17:35119635 2524651142 A 286 856 G +17:35119636 2524651143 G E 286 857 A +17:35119637 2524651144 A 286 858 G +17:35119638 2524651145 G 287 859 A +17:35119639 2524651146 A S 287 860 G +17:35119640 2524651147 G 287 861 T +17:35119641 2524651148 T 288 862 T +17:35119642 2524651149 T S 288 863 C +17:35119643 2524651150 C 288 864 T +17:35119644 2524651151 T 289 865 G +17:35119645 2524651152 G E 289 866 A +17:35119646 2524651153 A 289 867 G +17:35119647 2524651154 G 290 868 G +17:35119648 2524651155 G D 290 869 A +17:35119649 2524651156 A 290 870 T +17:35119650 2524651157 T 291 871 T +17:35119651 2524651158 T C 291 872 G +17:35119652 2524651159 G 291 873 T +17:35119653 2524651160 T 292 874 C +17:35119654 2524651161 C Q 292 875 A +17:35119655 2524651162 A 292 876 G +17:35119656 2524651163 G 293 877 A +17:35119657 2524651164 A S 293 878 G +17:35119658 2524651165 G 293 879 C +17:35119659 2524651166 C 294 880 C +17:35119660 2524651167 C L 294 881 T +17:35119865 2524651372 T 294 882 G +17:35119866 2524651373 G 295 883 A +17:35119867 2524651374 A T 295 884 C +17:35119868 2524651375 C 295 885 G +17:35119869 2524651376 G 296 886 C +17:35119870 2524651377 C R 296 887 G +17:35119871 2524651378 G 296 888 C +17:35119872 2524651379 C 297 889 A +17:35119873 2524651380 A T 297 890 C +17:35119874 2524651381 C 297 891 T +17:35119875 2524651382 T 298 892 G +17:35119876 2524651383 G V 298 893 T +17:35119877 2524651384 T 298 894 C +17:35119878 2524651385 C 299 895 T +17:35119879 2524651386 T C 299 896 G +17:35119880 2524651387 G 299 897 T +17:35119881 2524651388 T 300 898 G +17:35119882 2524651389 G A 300 899 C +17:35119883 2524651390 C 300 900 C +17:35119884 2524651391 C 301 901 G +17:35119885 2524651392 G G 301 902 G +17:35119886 2524651393 G 301 903 T +17:35119887 2524651394 T 302 904 G +17:35119888 2524651395 G G 302 905 G +17:35119889 2524651396 G 302 906 C +17:35119890 2524651397 C 303 907 T +17:35119891 2524651398 T C 303 908 G +17:35119892 2524651399 G 303 909 T +17:35119893 2524651400 T 304 910 G +17:35119894 2524651401 G A 304 911 C +17:35119895 2524651402 C 304 912 C +17:35119896 2524651403 C 305 913 C +17:35119897 2524651404 C R 305 914 G +17:35119898 2524651405 G 305 915 C +17:35119899 2524651406 C 306 916 T +17:35119900 2524651407 T C 306 917 G +17:35119901 2524651408 G 306 918 C +17:35119902 2524651409 C 307 919 A +17:35119903 2524651410 A K 307 920 A +17:35119904 2524651411 A 307 921 G +17:35119905 2524651412 G 308 922 G +17:35119906 2524651413 G G 308 923 G +17:35119907 2524651414 G 308 924 G +17:35119908 2524651415 G 309 925 C +17:35119909 2524651416 C P 309 926 C +17:35119910 2524651417 C 309 927 A +17:35119911 2524651418 A 310 928 C +17:35119912 2524651419 C L 310 929 T +17:35119913 2524651420 T 310 930 G +17:35119914 2524651421 G 311 931 C +17:35119915 2524651422 C P 311 932 C +17:35119916 2524651423 C 311 933 C +17:35119917 2524651424 C 312 934 A +17:35119918 2524651425 A T 312 935 C +17:35119919 2524651426 C 312 936 T +17:35119920 2524651427 T 313 937 G +17:35119921 2524651428 G D 313 938 A +17:35119922 2524651429 A 313 939 C +17:35119923 2524651430 C 314 940 T +17:35119924 2524651431 T C 314 941 G +17:35119925 2524651432 G 314 942 C +17:35119926 2524651433 C 315 943 T +17:35119927 2524651434 T C 315 944 G +17:35119928 2524651435 G 315 945 C +17:35119929 2524651436 C 316 946 C +17:35119930 2524651437 C H 316 947 A +17:35119931 2524651438 A 316 948 T +17:35119932 2524651439 T 317 949 G +17:35119933 2524651440 G E 317 950 A +17:35119934 2524651441 A 317 951 G +17:35119935 2524651442 G 318 952 C +17:35119936 2524651443 C Q 318 953 A +17:35119937 2524651444 A 318 954 G +17:35119938 2524651445 G 319 955 T +17:35119939 2524651446 T C 319 956 G +17:35119940 2524651447 G 319 957 T +17:35119941 2524651448 T 320 958 G +17:35119942 2524651449 G A 320 959 C +17:35119943 2524651450 C 320 960 T +17:35119944 2524651451 T 321 961 G +17:35119945 2524651452 G A 321 962 C +17:35119946 2524651453 C 321 963 C +17:35119947 2524651454 C 322 964 G +17:35119948 2524651455 G G 322 965 G +17:35119949 2524651456 G 322 966 C +17:35119950 2524651457 C 323 967 T +17:35119951 2524651458 T C 323 968 G +17:35119952 2524651459 G 323 969 C +17:35119953 2524651460 C 324 970 A +17:35119954 2524651461 A T 324 971 C +17:35119955 2524651462 C 324 972 G +17:35119956 2524651463 G 325 973 G +17:35119957 2524651464 G G 325 974 G +17:35119958 2524651465 G 325 975 C +17:35119959 2524651466 C 326 976 C +17:35119960 2524651467 C P 326 977 C +17:35119961 2524651468 C 326 978 C +17:35119962 2524651469 C 327 979 A +17:35119963 2524651470 A K 327 980 A +17:35119964 2524651471 A 327 981 G +17:35119965 2524651472 G 328 982 C +17:35119966 2524651473 C H 328 983 A +17:35119967 2524651474 A 328 984 C +17:35119968 2524651475 C 329 985 T +17:35119969 2524651476 T S 329 986 C +17:35119970 2524651477 C 329 987 T +17:35119971 2524651478 T 330 988 G +17:35119972 2524651479 G D 330 989 A +17:35119973 2524651480 A 330 990 C +17:35119974 2524651481 C 331 991 T +17:35119975 2524651482 T C 331 992 G +17:35119976 2524651483 G 331 993 C +17:35119977 2524651484 C 332 994 C +17:35119978 2524651485 C L 332 995 T +17:35119979 2524651486 T 332 996 G +17:35119980 2524651487 G 333 997 G +17:35120119 2524651626 G A 333 998 C +17:35120120 2524651627 C 333 999 C +17:35120121 2524651628 C 334 1000 T +17:35120122 2524651629 T C 334 1001 G +17:35120123 2524651630 G 334 1002 C +17:35120124 2524651631 C 335 1003 C +17:35120125 2524651632 C L 335 1004 T +17:35120126 2524651633 T 335 1005 C +17:35120127 2524651634 C 336 1006 C +17:35120128 2524651635 C H 336 1007 A +17:35120129 2524651636 A 336 1008 C +17:35120130 2524651637 C 337 1009 T +17:35120131 2524651638 T F 337 1010 T +17:35120132 2524651639 T 337 1011 C +17:35120133 2524651640 C 338 1012 A +17:35120134 2524651641 A N 338 1013 A +17:35120135 2524651642 A 338 1014 C +17:35120136 2524651643 C 339 1015 C +17:35120137 2524651644 C H 339 1016 A +17:35120138 2524651645 A 339 1017 C +17:35120139 2524651646 C 340 1018 A +17:35120140 2524651647 A S 340 1019 G +17:35120141 2524651648 G 340 1020 T +17:35120142 2524651649 T 341 1021 G +17:35120143 2524651650 G G 341 1022 G +17:35120144 2524651651 G 341 1023 C +17:35120145 2524651652 C 342 1024 A +17:35120146 2524651653 A I 342 1025 T +17:35120147 2524651654 T 342 1026 C +17:35120148 2524651655 C 343 1027 T +17:35120149 2524651656 T C 343 1028 G +17:35120150 2524651657 G 343 1029 T +17:35120151 2524651658 T 344 1030 G +17:35120152 2524651659 G E 344 1031 A +17:35120153 2524651660 A 344 1032 G +17:35120154 2524651661 G 345 1033 C +17:35120155 2524651662 C L 345 1034 T +17:35120156 2524651663 T 345 1035 G +17:35120157 2524651664 G 346 1036 C +17:35120158 2524651665 C H 346 1037 A +17:35120159 2524651666 A 346 1038 C +17:35120160 2524651667 C 347 1039 T +17:35120161 2524651668 T C 347 1040 G +17:35120162 2524651669 G 347 1041 C +17:35120163 2524651670 C 348 1042 C +17:35120164 2524651671 C P 348 1043 C +17:35120165 2524651672 C 348 1044 A +17:35120166 2524651673 A 349 1045 G +17:35120167 2524651674 G A 349 1046 C +17:35120168 2524651675 C 349 1047 C +17:35120169 2524651676 C 350 1048 C +17:35120170 2524651677 C L 350 1049 T +17:35120171 2524651678 T 350 1050 G +17:35120172 2524651679 G 351 1051 G +17:35120173 2524651680 G V 351 1052 T +17:35120174 2524651681 T 351 1053 C +17:35120175 2524651682 C 352 1054 A +17:35120176 2524651683 A T 352 1055 C +17:35120177 2524651684 C 352 1056 C +17:35120178 2524651685 C 353 1057 T +17:35120179 2524651686 T Y 353 1058 A +17:35120180 2524651687 A 353 1059 C +17:35120181 2524651688 C 354 1060 A +17:35120182 2524651689 A N 354 1061 A +17:35120183 2524651690 A 354 1062 C +17:35120184 2524651691 C 355 1063 A +17:35120185 2524651692 A T 355 1064 C +17:35120186 2524651693 C 355 1065 A +17:35120187 2524651694 A 356 1066 G +17:35120188 2524651695 G D 356 1067 A +17:35120189 2524651696 A 356 1068 C +17:35120190 2524651697 C 357 1069 A +17:35120191 2524651698 A T 357 1070 C +17:35120192 2524651699 C 357 1071 G +17:35120193 2524651700 G 358 1072 T +17:35120194 2524651701 T F 358 1073 T +17:35120195 2524651702 T 358 1074 T +17:35120196 2524651703 T 359 1075 G +17:35120197 2524651704 G E 359 1076 A +17:35120198 2524651705 A 359 1077 G +17:35120199 2524651706 G 360 1078 T +17:35120200 2524651707 T S 360 1079 C +17:35120201 2524651708 C 360 1080 C +17:35120202 2524651709 C 361 1081 A +17:35120203 2524651710 A M 361 1082 T +17:35120204 2524651711 T 361 1083 G +17:35120205 2524651712 G 362 1084 C +17:35120206 2524651713 C P 362 1085 C +17:35120207 2524651714 C 362 1086 C +17:35120208 2524651715 C 363 1087 A +17:35120209 2524651716 A N 363 1088 A +17:35120210 2524651717 A 363 1089 T +17:35120211 2524651718 T 364 1090 C +17:35120212 2524651719 C P 364 1091 C +17:35120213 2524651720 C 364 1092 C +17:35120214 2524651721 C 365 1093 G +17:35120215 2524651722 G E 365 1094 A +17:35120216 2524651723 A 365 1095 G +17:35120217 2524651724 G 366 1096 G +17:35120218 2524651725 G G 366 1097 G +17:35120219 2524651726 G 366 1098 C +17:35120220 2524651727 C 367 1099 C +17:35120221 2524651728 C R 367 1100 G +17:35120222 2524651729 G 367 1101 G +17:35120223 2524651730 G 368 1102 T +17:35120224 2524651731 T Y 368 1103 A +17:35120225 2524651732 A 368 1104 T +17:35120226 2524651733 T 369 1105 A +17:35120227 2524651734 A T 369 1106 C +17:35120228 2524651735 C 369 1107 A +17:35120229 2524651736 A 370 1108 T +17:35120230 2524651737 T F 370 1109 T +17:35120231 2524651738 T 370 1110 C +17:35120232 2524651739 C 371 1111 G +17:35120233 2524651740 G G 371 1112 G +17:35120234 2524651741 G 371 1113 C +17:35120235 2524651742 C 372 1114 G +17:35120236 2524651743 G A 372 1115 C +17:35120237 2524651744 C 372 1116 C +17:35120238 2524651745 C 373 1117 A +17:35120239 2524651746 A S 373 1118 G +17:35120240 2524651747 G 373 1119 C +17:35120241 2524651748 C 374 1120 T +17:35120242 2524651749 T C 374 1121 G +17:35120243 2524651750 G 374 1122 T +17:35120244 2524651751 T 375 1123 G +17:35120245 2524651752 G V 375 1124 T +17:35120246 2524651753 T 375 1125 G +17:35120247 2524651754 G 376 1126 A +17:35120248 2524651755 A T 376 1127 C +17:35120249 2524651756 C 376 1128 T +17:35120250 2524651757 T 377 1129 G +17:35120251 2524651758 G A 377 1130 C +17:35120252 2524651759 C 377 1131 C +17:35120253 2524651760 C 378 1132 T +17:35120254 2524651761 T C 378 1133 G +17:35120255 2524651762 G 378 1134 T +17:35120256 2524651763 T 379 1135 C +17:35120257 2524651764 C P 379 1136 C +17:35120258 2524651765 C 379 1137 C +17:35120259 2524651766 C 380 1138 T +17:35120260 2524651767 T Y 380 1139 A +17:35121707 2524653214 A 380 1140 C +17:35121708 2524653215 C 381 1141 A +17:35121709 2524653216 A N 381 1142 A +17:35121710 2524653217 A 381 1143 C +17:35121711 2524653218 C 382 1144 T +17:35121712 2524653219 T Y 382 1145 A +17:35121713 2524653220 A 382 1146 C +17:35121714 2524653221 C 383 1147 C +17:35121715 2524653222 C L 383 1148 T +17:35121716 2524653223 T 383 1149 T +17:35121717 2524653224 T 384 1150 T +17:35121718 2524653225 T S 384 1151 C +17:35121719 2524653226 C 384 1152 T +17:35121720 2524653227 T 385 1153 A +17:35121721 2524653228 A T 385 1154 C +17:35121722 2524653229 C 385 1155 G +17:35121723 2524653230 G 386 1156 G +17:35121724 2524653231 G D 386 1157 A +17:35121725 2524653232 A 386 1158 C +17:35121726 2524653233 C 387 1159 G +17:35121727 2524653234 G V 387 1160 T +17:35121728 2524653235 T 387 1161 G +17:35121729 2524653236 G 388 1162 G +17:35121730 2524653237 G G 388 1163 G +17:35121731 2524653238 G 388 1164 A +17:35121732 2524653239 A 389 1165 T +17:35121733 2524653240 T S 389 1166 C +17:35121734 2524653241 C 389 1167 C +17:35121735 2524653242 C 390 1168 T +17:35121736 2524653243 T C 390 1169 G +17:35121737 2524653244 G 390 1170 C +17:35121738 2524653245 C 391 1171 A +17:35121739 2524653246 A T 391 1172 C +17:35121740 2524653247 C 391 1173 C +17:35121741 2524653248 C 392 1174 C +17:35121742 2524653249 C L 392 1175 T +17:35121743 2524653250 T 392 1176 C +17:35121744 2524653251 C 393 1177 G +17:35121745 2524653252 G V 393 1178 T +17:35121746 2524653253 T 393 1179 C +17:35121747 2524653254 C 394 1180 T +17:35121748 2524653255 T C 394 1181 G +17:35121749 2524653256 G 394 1182 C +17:35121750 2524653257 C 395 1183 C +17:35121751 2524653258 C P 395 1184 C +17:35121752 2524653259 C 395 1185 C +17:35121753 2524653260 C 396 1186 C +17:35121754 2524653261 C L 396 1187 T +17:35121755 2524653262 T 396 1188 G +17:35121756 2524653263 G 397 1189 C +17:35121757 2524653264 C H 397 1190 A +17:35121758 2524653265 A 397 1191 C +17:35121759 2524653266 C 398 1192 A +17:35121760 2524653267 A N 398 1193 A +17:35121761 2524653268 A 398 1194 C +17:35121762 2524653269 C 399 1195 C +17:35121763 2524653270 C Q 399 1196 A +17:35121764 2524653271 A 399 1197 A +17:35121765 2524653272 A 400 1198 G +17:35121766 2524653273 G E 400 1199 A +17:35121767 2524653274 A 400 1200 G +17:35121768 2524653275 G 401 1201 G +17:35121769 2524653276 G V 401 1202 T +17:35121770 2524653277 T 401 1203 G +17:35121771 2524653278 G 402 1204 A +17:35121772 2524653279 A T 402 1205 C +17:35121773 2524653280 C 402 1206 A +17:35121774 2524653281 A 403 1207 G +17:35121775 2524653282 G A 403 1208 C +17:35121776 2524653283 C 403 1209 A +17:35121777 2524653284 A 404 1210 G +17:35121778 2524653285 G E 404 1211 A +17:35121779 2524653286 A 404 1212 G +17:35121780 2524653287 G 405 1213 G +17:35121781 2524653288 G D 405 1214 A +17:35121782 2524653289 A 405 1215 T +17:35121783 2524653290 T 406 1216 G +17:35121784 2524653291 G G 406 1217 G +17:35121785 2524653292 G 406 1218 A +17:35121786 2524653293 A 407 1219 A +17:35121787 2524653294 A T 407 1220 C +17:35121788 2524653295 C 407 1221 A +17:35121789 2524653296 A 408 1222 C +17:35121790 2524653297 C Q 408 1223 A +17:35121791 2524653298 A 408 1224 G +17:35121792 2524653299 G 409 1225 C +17:35121793 2524653300 C R 409 1226 G +17:35121794 2524653301 G 409 1227 G +17:35121795 2524653302 G 410 1228 T +17:35121796 2524653303 T C 410 1229 G +17:35121797 2524653304 G 410 1230 T +17:35121798 2524653305 T 411 1231 G +17:35121799 2524653306 G E 411 1232 A +17:35121800 2524653307 A 411 1233 G +17:35121801 2524653308 G 412 1234 A +17:35121802 2524653309 A K 412 1235 A +17:35121803 2524653310 A 412 1236 G +17:35121804 2524653311 G 413 1237 T +17:35121805 2524653312 T C 413 1238 G +17:35121806 2524653313 G 413 1239 C +17:35121807 2524653314 C 414 1240 A +17:35121808 2524653315 A S 414 1241 G +17:35121809 2524653316 G 414 1242 C +17:35121810 2524653317 C 415 1243 A +17:35121811 2524653318 A K 415 1244 A +17:35121812 2524653319 A 415 1245 G +17:35121813 2524653320 G 416 1246 C +17:35121814 2524653321 C P 416 1247 C +17:35121815 2524653322 C 416 1248 C +17:35121816 2524653323 C 417 1249 T +17:35121817 2524653324 T C 417 1250 G +17:35121818 2524653325 G 417 1251 T +17:35121819 2524653326 T 418 1252 G +17:35121820 2524653327 G A 418 1253 C +17:35121821 2524653328 C 418 1254 C +17:35121822 2524653329 C 419 1255 C +17:35121823 2524653330 C R 419 1256 G +17:35121824 2524653331 G 419 1257 A +17:35121825 2524653332 A 420 1258 G +17:35121826 2524653333 G V 420 1259 T +17:35122101 2524653608 T 420 1260 G +17:35122102 2524653609 G 421 1261 T +17:35122103 2524653610 T C 421 1262 G +17:35122104 2524653611 G 421 1263 C +17:35122105 2524653612 C 422 1264 T +17:35122106 2524653613 T Y 422 1265 A +17:35122107 2524653614 A 422 1266 T +17:35122108 2524653615 T 423 1267 G +17:35122109 2524653616 G G 423 1268 G +17:35122110 2524653617 G 423 1269 T +17:35122111 2524653618 T 424 1270 C +17:35122112 2524653619 C L 424 1271 T +17:35122113 2524653620 T 424 1272 G +17:35122114 2524653621 G 425 1273 G +17:35122115 2524653622 G G 425 1274 G +17:35122116 2524653623 G 425 1275 C +17:35122117 2524653624 C 426 1276 A +17:35122118 2524653625 A M 426 1277 T +17:35122119 2524653626 T 426 1278 G +17:35122120 2524653627 G 427 1279 G +17:35122121 2524653628 G E 427 1280 A +17:35122122 2524653629 A 427 1281 G +17:35122123 2524653630 G 428 1282 C +17:35122124 2524653631 C H 428 1283 A +17:35122125 2524653632 A 428 1284 C +17:35122126 2524653633 C 429 1285 T +17:35122127 2524653634 T L 429 1286 T +17:35122128 2524653635 T 429 1287 G +17:35122129 2524653636 G 430 1288 C +17:35122130 2524653637 C R 430 1289 G +17:35122131 2524653638 G 430 1290 A +17:35122132 2524653639 A 431 1291 G +17:35122133 2524653640 G E 431 1292 A +17:35122134 2524653641 A 431 1293 G +17:35122135 2524653642 G 432 1294 G +17:35122136 2524653643 G V 432 1295 T +17:35122137 2524653644 T 432 1296 G +17:35122138 2524653645 G 433 1297 A +17:35122139 2524653646 A R 433 1298 G +17:35122140 2524653647 G 433 1299 G +17:35122141 2524653648 G 434 1300 G +17:35122142 2524653649 G A 434 1301 C +17:35122143 2524653650 C 434 1302 A +17:35122144 2524653651 A 435 1303 G +17:35122145 2524653652 G V 435 1304 T +17:35122146 2524653653 T 435 1305 T +17:35122147 2524653654 T 436 1306 A +17:35122148 2524653655 A T 436 1307 C +17:35122149 2524653656 C 436 1308 C +17:35122150 2524653657 C 437 1309 A +17:35122151 2524653658 A S 437 1310 G +17:35122152 2524653659 G 437 1311 T +17:35122153 2524653660 T 438 1312 G +17:35122154 2524653661 G A 438 1313 C +17:35122155 2524653662 C 438 1314 C +17:35122156 2524653663 C 439 1315 A +17:35122157 2524653664 A N 439 1316 A +17:35122158 2524653665 A 439 1317 T +17:35122159 2524653666 T 440 1318 A +17:35122160 2524653667 A I 440 1319 T +17:35122161 2524653668 T 440 1320 C +17:35122162 2524653669 C 441 1321 C +17:35122163 2524653670 C Q 441 1322 A +17:35122164 2524653671 A 441 1323 G +17:35122165 2524653672 G 442 1324 G +17:35122166 2524653673 G E 442 1325 A +17:35122167 2524653674 A 442 1326 G +17:35122168 2524653675 G 443 1327 T +17:35122169 2524653676 T F 443 1328 T +17:35122170 2524653677 T 443 1329 T +17:35122171 2524653678 T 444 1330 G +17:35122172 2524653679 G A 444 1331 C +17:35122173 2524653680 C 444 1332 T +17:35122174 2524653681 T 445 1333 G +17:35122175 2524653682 G G 445 1334 G +17:35122176 2524653683 G 445 1335 C +17:35122177 2524653684 C 446 1336 T +17:35122178 2524653685 T C 446 1337 G +17:35122179 2524653686 G 446 1338 C +17:35122180 2524653687 C 447 1339 A +17:35122181 2524653688 A K 447 1340 A +17:35122182 2524653689 A 447 1341 G +17:35122183 2524653690 G 448 1342 A +17:35122184 2524653691 A K 448 1343 A +17:35122185 2524653692 A 448 1344 G +17:35122186 2524653693 G 449 1345 A +17:35122187 2524653694 A I 449 1346 T +17:35122188 2524653695 T 449 1347 C +17:35122189 2524653696 C 450 1348 T +17:35122190 2524653697 T F 450 1349 T +17:35122191 2524653698 T 450 1350 T +17:35122192 2524653699 T 451 1351 G +17:35122193 2524653700 G G 451 1352 G +17:35122194 2524653701 G 451 1353 G +17:35122195 2524653702 G 452 1354 A +17:35122196 2524653703 A S 452 1355 G +17:35122197 2524653704 G 452 1356 C +17:35122198 2524653705 C 453 1357 C +17:35122199 2524653706 C L 453 1358 T +17:35122200 2524653707 T 453 1359 G +17:35122201 2524653708 G 454 1360 G +17:35122202 2524653709 G A 454 1361 C +17:35122203 2524653710 C 454 1362 A +17:35122204 2524653711 A 455 1363 T +17:35122205 2524653712 T F 455 1364 T +17:35122206 2524653713 T 455 1365 T +17:35122207 2524653714 T 456 1366 C +17:35122208 2524653715 C L 456 1367 T +17:35122209 2524653716 T 456 1368 G +17:35122210 2524653717 G 457 1369 C +17:35122211 2524653718 C P 457 1370 C +17:35122212 2524653719 C 457 1371 G +17:35122213 2524653720 G 458 1372 G +17:35122214 2524653721 G E 458 1373 A +17:35122215 2524653722 A 458 1374 G +17:35122216 2524653723 G 459 1375 A +17:35122217 2524653724 A S 459 1376 G +17:35122218 2524653725 G 459 1377 C +17:35122219 2524653726 C 460 1378 T +17:35122220 2524653727 T F 460 1379 T +17:35122221 2524653728 T 460 1380 T +17:35122222 2524653729 T 461 1381 G +17:35122223 2524653730 G D 461 1382 A +17:35122224 2524653731 A 461 1383 T +17:35122225 2524653732 T 462 1384 G +17:35122226 2524653733 G G 462 1385 G +17:35122227 2524653734 G 462 1386 G +17:35125065 2524656572 G 463 1387 G +17:35125066 2524656573 G D 463 1388 A +17:35125067 2524656574 A 463 1389 C +17:35125068 2524656575 C 464 1390 C +17:35125069 2524656576 C P 464 1391 C +17:35125070 2524656577 C 464 1392 A +17:35125071 2524656578 A 465 1393 G +17:35125072 2524656579 G A 465 1394 C +17:35125073 2524656580 C 465 1395 C +17:35125074 2524656581 C 466 1396 T +17:35125075 2524656582 T S 466 1397 C +17:35125076 2524656583 C 466 1398 C +17:35125077 2524656584 C 467 1399 A +17:35125078 2524656585 A N 467 1400 A +17:35125079 2524656586 A 467 1401 C +17:35125080 2524656587 C 468 1402 A +17:35125081 2524656588 A T 468 1403 C +17:35125082 2524656589 C 468 1404 T +17:35125083 2524656590 T 469 1405 G +17:35125084 2524656591 G A 469 1406 C +17:35125085 2524656592 C 469 1407 C +17:35125086 2524656593 C 470 1408 C +17:35125087 2524656594 C P 470 1409 C +17:35125088 2524656595 C 470 1410 G +17:35125089 2524656596 G 471 1411 C +17:35125090 2524656597 C L 471 1412 T +17:35125091 2524656598 T 471 1413 C +17:35125092 2524656599 C 472 1414 C +17:35125093 2524656600 C Q 472 1415 A +17:35125094 2524656601 A 472 1416 G +17:35125095 2524656602 G 473 1417 C +17:35125096 2524656603 C P 473 1418 C +17:35125097 2524656604 C 473 1419 A +17:35125098 2524656605 A 474 1420 G +17:35125099 2524656606 G E 474 1421 A +17:35125100 2524656607 A 474 1422 G +17:35125101 2524656608 G 475 1423 C +17:35125102 2524656609 C Q 475 1424 A +17:35125103 2524656610 A 475 1425 G +17:35125104 2524656611 G 476 1426 C +17:35125105 2524656612 C L 476 1427 T +17:35125106 2524656613 T 476 1428 C +17:35125107 2524656614 C 477 1429 C +17:35125108 2524656615 C Q 477 1430 A +17:35125109 2524656616 A 477 1431 A +17:35125110 2524656617 A 478 1432 G +17:35125111 2524656618 G V 478 1433 T +17:35125112 2524656619 T 478 1434 G +17:35125113 2524656620 G 479 1435 T +17:35125114 2524656621 T F 479 1436 T +17:35125115 2524656622 T 479 1437 T +17:35125116 2524656623 T 480 1438 G +17:35125117 2524656624 G E 480 1439 A +17:35125118 2524656625 A 480 1440 G +17:35125119 2524656626 G 481 1441 A +17:35125120 2524656627 A T 481 1442 C +17:35125121 2524656628 C 481 1443 T +17:35125122 2524656629 T 482 1444 C +17:35125123 2524656630 C L 482 1445 T +17:35125124 2524656631 T 482 1446 G +17:35125125 2524656632 G 483 1447 G +17:35125126 2524656633 G E 483 1448 A +17:35125127 2524656634 A 483 1449 A +17:35125128 2524656635 A 484 1450 G +17:35125129 2524656636 G E 484 1451 A +17:35125130 2524656637 A 484 1452 G +17:35125131 2524656638 G 485 1453 A +17:35125132 2524656639 A I 485 1454 T +17:35125133 2524656640 T 485 1455 C +17:35125134 2524656641 C 486 1456 A +17:35125135 2524656642 A T 486 1457 C +17:35125136 2524656643 C 486 1458 A +17:35125137 2524656644 A 487 1459 G +17:35125138 2524656645 G G 487 1460 G +17:35125225 2524656732 G 487 1461 T +17:35125226 2524656733 T 488 1462 T +17:35125227 2524656734 T Y 488 1463 A +17:35125228 2524656735 A 488 1464 C +17:35125229 2524656736 C 489 1465 C +17:35125230 2524656737 C L 489 1466 T +17:35125231 2524656738 T 489 1467 A +17:35125232 2524656739 A 490 1468 T +17:35125233 2524656740 T Y 490 1469 A +17:35125234 2524656741 A 490 1470 C +17:35125235 2524656742 C 491 1471 A +17:35125236 2524656743 A I 491 1472 T +17:35125237 2524656744 T 491 1473 C +17:35125238 2524656745 C 492 1474 T +17:35125239 2524656746 T S 492 1475 C +17:35125240 2524656747 C 492 1476 A +17:35125241 2524656748 A 493 1477 G +17:35125242 2524656749 G A 493 1478 C +17:35125243 2524656750 C 493 1479 A +17:35125244 2524656751 A 494 1480 T +17:35125245 2524656752 T W 494 1481 G +17:35125246 2524656753 G 494 1482 G +17:35125247 2524656754 G 495 1483 C +17:35125248 2524656755 C P 495 1484 C +17:35125249 2524656756 C 495 1485 G +17:35125250 2524656757 G 496 1486 G +17:35125251 2524656758 G D 496 1487 A +17:35125252 2524656759 A 496 1488 C +17:35125253 2524656760 C 497 1489 A +17:35125254 2524656761 A S 497 1490 G +17:35125255 2524656762 G 497 1491 C +17:35125256 2524656763 C 498 1492 C +17:35125257 2524656764 C L 498 1493 T +17:35125258 2524656765 T 498 1494 G +17:35125259 2524656766 G 499 1495 C +17:35125260 2524656767 C P 499 1496 C +17:35125261 2524656768 C 499 1497 T +17:35125262 2524656769 T 500 1498 G +17:35125263 2524656770 G D 500 1499 A +17:35125264 2524656771 A 500 1500 C +17:35125265 2524656772 C 501 1501 C +17:35125266 2524656773 C L 501 1502 T +17:35125267 2524656774 T 501 1503 C +17:35125268 2524656775 C 502 1504 A +17:35125269 2524656776 A S 502 1505 G +17:35125270 2524656777 G 502 1506 C +17:35125271 2524656778 C 503 1507 G +17:35125272 2524656779 G V 503 1508 T +17:35125273 2524656780 T 503 1509 C +17:35125274 2524656781 C 504 1510 T +17:35125275 2524656782 T F 504 1511 T +17:35125276 2524656783 T 504 1512 C +17:35125277 2524656784 C 505 1513 C +17:35125278 2524656785 C Q 505 1514 A +17:35125279 2524656786 A 505 1515 G +17:35125280 2524656787 G 506 1516 A +17:35125281 2524656788 A N 506 1517 A +17:35125282 2524656789 A 506 1518 C +17:35125283 2524656790 C 507 1519 C +17:35125284 2524656791 C L 507 1520 T +17:35125285 2524656792 T 507 1521 G +17:35125286 2524656793 G 508 1522 C +17:35125287 2524656794 C Q 508 1523 A +17:35125288 2524656795 A 508 1524 A +17:35125289 2524656796 A 509 1525 G +17:35125290 2524656797 G V 509 1526 T +17:35125291 2524656798 T 509 1527 A +17:35125292 2524656799 A 510 1528 A +17:35125293 2524656800 A I 510 1529 T +17:35125294 2524656801 T 510 1530 C +17:35125295 2524656802 C 511 1531 C +17:35125296 2524656803 C R 511 1532 G +17:35125297 2524656804 G 511 1533 G +17:35125298 2524656805 G 512 1534 G +17:35125299 2524656806 G G 512 1535 G +17:35125300 2524656807 G 512 1536 A +17:35125301 2524656808 A 513 1537 C +17:35125302 2524656809 C R 513 1538 G +17:35125303 2524656810 G 513 1539 A +17:35125304 2524656811 A 514 1540 A +17:35125305 2524656812 A I 514 1541 T +17:35125306 2524656813 T 514 1542 T +17:35125307 2524656814 T 515 1543 C +17:35125308 2524656815 C L 515 1544 T +17:35125309 2524656816 T 515 1545 G +17:35125310 2524656817 G 516 1546 C +17:35125311 2524656818 C H 516 1547 A +17:35125312 2524656819 A 516 1548 C +17:35125313 2524656820 C 517 1549 A +17:35125314 2524656821 A N 517 1550 A +17:35125315 2524656822 A 517 1551 T +17:35125519 2524657026 T 518 1552 G +17:35125520 2524657027 G G 518 1553 G +17:35125521 2524657028 G 518 1554 C +17:35125522 2524657029 C 519 1555 G +17:35125523 2524657030 G A 519 1556 C +17:35125524 2524657031 C 519 1557 C +17:35125525 2524657032 C 520 1558 T +17:35125526 2524657033 T Y 520 1559 A +17:35125527 2524657034 A 520 1560 C +17:35125528 2524657035 C 521 1561 T +17:35125529 2524657036 T S 521 1562 C +17:35125530 2524657037 C 521 1563 G +17:35125531 2524657038 G 522 1564 C +17:35125532 2524657039 C L 522 1565 T +17:35125533 2524657040 T 522 1566 G +17:35125534 2524657041 G 523 1567 A +17:35125535 2524657042 A T 523 1568 C +17:35125536 2524657043 C 523 1569 C +17:35125537 2524657044 C 524 1570 C +17:35125538 2524657045 C L 524 1571 T +17:35125539 2524657046 T 524 1572 G +17:35125540 2524657047 G 525 1573 C +17:35125541 2524657048 C Q 525 1574 A +17:35125542 2524657049 A 525 1575 A +17:35125543 2524657050 A 526 1576 G +17:35125544 2524657051 G G 526 1577 G +17:35125545 2524657052 G 526 1578 G +17:35125546 2524657053 G 527 1579 C +17:35125547 2524657054 C L 527 1580 T +17:35125548 2524657055 T 527 1581 G +17:35125549 2524657056 G 528 1582 G +17:35125550 2524657057 G G 528 1583 G +17:35125551 2524657058 G 528 1584 C +17:35125552 2524657059 C 529 1585 A +17:35125553 2524657060 A I 529 1586 T +17:35125554 2524657061 T 529 1587 C +17:35125555 2524657062 C 530 1588 A +17:35125556 2524657063 A S 530 1589 G +17:35125557 2524657064 G 530 1590 C +17:35125558 2524657065 C 531 1591 T +17:35125559 2524657066 T W 531 1592 G +17:35125560 2524657067 G 531 1593 G +17:35125561 2524657068 G 532 1594 C +17:35125562 2524657069 C L 532 1595 T +17:35125563 2524657070 T 532 1596 G +17:35125564 2524657071 G 533 1597 G +17:35125565 2524657072 G G 533 1598 G +17:35125566 2524657073 G 533 1599 G +17:35125567 2524657074 G 534 1600 C +17:35125568 2524657075 C L 534 1601 T +17:35125569 2524657076 T 534 1602 G +17:35125570 2524657077 G 535 1603 C +17:35125571 2524657078 C R 535 1604 G +17:35125572 2524657079 G 535 1605 C +17:35125573 2524657080 C 536 1606 T +17:35125574 2524657081 T S 536 1607 C +17:35125575 2524657082 C 536 1608 A +17:35125576 2524657083 A 537 1609 C +17:35125577 2524657084 C L 537 1610 T +17:35125578 2524657085 T 537 1611 G +17:35125579 2524657086 G 538 1612 A +17:35125580 2524657087 A R 538 1613 G +17:35125581 2524657088 G 538 1614 G +17:35125582 2524657089 G 539 1615 G +17:35125583 2524657090 G E 539 1616 A +17:35125584 2524657091 A 539 1617 A +17:35125585 2524657092 A 540 1618 C +17:35125586 2524657093 C L 540 1619 T +17:35125587 2524657094 T 540 1620 G +17:35125588 2524657095 G 541 1621 G +17:35125589 2524657096 G G 541 1622 G +17:35125590 2524657097 G 541 1623 C +17:35125591 2524657098 C 542 1624 A +17:35125592 2524657099 A S 542 1625 G +17:35125593 2524657100 G 542 1626 T +17:35125594 2524657101 T 543 1627 G +17:35125595 2524657102 G G 543 1628 G +17:35125596 2524657103 G 543 1629 A +17:35125597 2524657104 A 544 1630 C +17:35125598 2524657105 C L 544 1631 T +17:35125599 2524657106 T 544 1632 G +17:35125600 2524657107 G 545 1633 G +17:35125601 2524657108 G A 545 1634 C +17:35125602 2524657109 C 545 1635 C +17:35125603 2524657110 C 546 1636 C +17:35125604 2524657111 C L 546 1637 T +17:35125605 2524657112 T 546 1638 C +17:35125606 2524657113 C 547 1639 A +17:35125607 2524657114 A I 547 1640 T +17:35125608 2524657115 T 547 1641 C +17:35125609 2524657116 C 548 1642 C +17:35125610 2524657117 C H 548 1643 A +17:35125611 2524657118 A 548 1644 C +17:35125612 2524657119 C 549 1645 C +17:35125613 2524657120 C H 549 1646 A +17:35125614 2524657121 A 549 1647 T +17:35125615 2524657122 T 550 1648 A +17:35125616 2524657123 A N 550 1649 A +17:35125617 2524657124 A 550 1650 C +17:35125618 2524657125 C 551 1651 A +17:35125619 2524657126 A T 551 1652 C +17:35125620 2524657127 C 551 1653 C +17:35125621 2524657128 C 552 1654 C +17:35125622 2524657129 C H 552 1655 A +17:35125623 2524657130 A 552 1656 C +17:35125624 2524657131 C 553 1657 C +17:35125625 2524657132 C L 553 1658 T +17:35125626 2524657133 T 553 1659 C +17:35125627 2524657134 C 554 1660 T +17:35125628 2524657135 T C 554 1661 G +17:35125629 2524657136 G 554 1662 C +17:35125630 2524657137 C 555 1663 T +17:35125631 2524657138 T F 555 1664 T +17:35125632 2524657139 T 555 1665 C +17:35125633 2524657140 C 556 1666 G +17:35125634 2524657141 G V 556 1667 T +17:35125635 2524657142 T 556 1668 G +17:35125636 2524657143 G 557 1669 C +17:35125637 2524657144 C H 557 1670 A +17:35125638 2524657145 A 557 1671 C +17:35125639 2524657146 C 558 1672 A +17:35125640 2524657147 A T 558 1673 C +17:35125641 2524657148 C 558 1674 G +17:35125642 2524657149 G 559 1675 G +17:35125643 2524657150 G V 559 1676 T +17:35125644 2524657151 T 559 1677 G +17:35125645 2524657152 G 560 1678 C +17:35125646 2524657153 C P 560 1679 C +17:35125647 2524657154 C 560 1680 C +17:35125648 2524657155 C 561 1681 T +17:35125649 2524657156 T W 561 1682 G +17:35125650 2524657157 G 561 1683 G +17:35125651 2524657158 G 562 1684 G +17:35125652 2524657159 G D 562 1685 A +17:35125653 2524657160 A 562 1686 C +17:35125654 2524657161 C 563 1687 C +17:35125655 2524657162 C Q 563 1688 A +17:35125656 2524657163 A 563 1689 G +17:35125657 2524657164 G 564 1690 C +17:35125658 2524657165 C L 564 1691 T +17:35125659 2524657166 T 564 1692 C +17:35125660 2524657167 C 565 1693 T +17:35125661 2524657168 T F 565 1694 T +17:35125662 2524657169 T 565 1695 T +17:35125663 2524657170 T 566 1696 C +17:35125664 2524657171 C R 566 1697 G +17:35125665 2524657172 G 566 1698 G +17:35125666 2524657173 G 567 1699 A +17:35125667 2524657174 A N 567 1700 A +17:35125668 2524657175 A 567 1701 C +17:35125669 2524657176 C 568 1702 C +17:35125670 2524657177 C P 568 1703 C +17:35125671 2524657178 C 568 1704 G +17:35125672 2524657179 G 569 1705 C +17:35125673 2524657180 C H 569 1706 A +17:35125674 2524657181 A 569 1707 C +17:35125675 2524657182 C 570 1708 C +17:35125676 2524657183 C Q 570 1709 A +17:35125677 2524657184 A 570 1710 A +17:35125678 2524657185 A 571 1711 G +17:35125679 2524657186 G A 571 1712 C +17:35125680 2524657187 C 571 1713 T +17:35125681 2524657188 T 572 1714 C +17:35125682 2524657189 C L 572 1715 T +17:35125683 2524657190 T 572 1716 G +17:35125684 2524657191 G 573 1717 C +17:35125685 2524657192 C L 573 1718 T +17:35125686 2524657193 T 573 1719 C +17:35125687 2524657194 C 574 1720 C +17:35125688 2524657195 C H 574 1721 A +17:35125689 2524657196 A 574 1722 C +17:35125690 2524657197 C 575 1723 A +17:35125691 2524657198 A T 575 1724 C +17:35125692 2524657199 C 575 1725 T +17:35125693 2524657200 T 576 1726 G +17:35125694 2524657201 G A 576 1727 C +17:35125695 2524657202 C 576 1728 C +17:35125696 2524657203 C 577 1729 A +17:35125697 2524657204 A N 577 1730 A +17:35125698 2524657205 A 577 1731 C +17:35125699 2524657206 C 578 1732 C +17:35125700 2524657207 C R 578 1733 G +17:35125701 2524657208 G 578 1734 G +17:35125702 2524657209 G 579 1735 C +17:35125703 2524657210 C P 579 1736 C +17:35125704 2524657211 C 579 1737 A +17:35125705 2524657212 A 580 1738 G +17:35125706 2524657213 G E 580 1739 A +17:35125707 2524657214 A 580 1740 G +17:35125708 2524657215 G 581 1741 G +17:35125709 2524657216 G D 581 1742 A +17:35125710 2524657217 A 581 1743 C +17:35125711 2524657218 C 582 1744 G +17:35125712 2524657219 G E 582 1745 A +17:35125713 2524657220 A 582 1746 G +17:35125714 2524657221 G 583 1747 T +17:35125715 2524657222 T C 583 1748 G +17:35125716 2524657223 G 583 1749 T +17:35125717 2524657224 T 584 1750 G +17:35125718 2524657225 G V 584 1751 T +17:35126080 2524657587 T 584 1752 G +17:35126081 2524657588 G 585 1753 G +17:35126082 2524657589 G G 585 1754 G +17:35126083 2524657590 G 585 1755 C +17:35126084 2524657591 C 586 1756 G +17:35126085 2524657592 G E 586 1757 A +17:35126086 2524657593 A 586 1758 G +17:35126087 2524657594 G 587 1759 G +17:35126088 2524657595 G G 587 1760 G +17:35126089 2524657596 G 587 1761 C +17:35126090 2524657597 C 588 1762 C +17:35126091 2524657598 C L 588 1763 T +17:35126092 2524657599 T 588 1764 G +17:35126093 2524657600 G 589 1765 G +17:35126094 2524657601 G A 589 1766 C +17:35126095 2524657602 C 589 1767 C +17:35126096 2524657603 C 590 1768 T +17:35126097 2524657604 T C 590 1769 G +17:35126098 2524657605 G 590 1770 C +17:35126099 2524657606 C 591 1771 C +17:35126100 2524657607 C H 591 1772 A +17:35126101 2524657608 A 591 1773 C +17:35126102 2524657609 C 592 1774 C +17:35126103 2524657610 C Q 592 1775 A +17:35126104 2524657611 A 592 1776 G +17:35126105 2524657612 G 593 1777 C +17:35126106 2524657613 C L 593 1778 T +17:35126107 2524657614 T 593 1779 G +17:35126108 2524657615 G 594 1780 T +17:35126109 2524657616 T C 594 1781 G +17:35126110 2524657617 G 594 1782 C +17:35126111 2524657618 C 595 1783 G +17:35126112 2524657619 G A 595 1784 C +17:35126113 2524657620 C 595 1785 C +17:35126114 2524657621 C 596 1786 C +17:35126115 2524657622 C R 596 1787 G +17:35126116 2524657623 G 596 1788 A +17:35126117 2524657624 A 597 1789 G +17:35126118 2524657625 G G 597 1790 G +17:35126119 2524657626 G 597 1791 G +17:35126120 2524657627 G 598 1792 C +17:35126121 2524657628 C H 598 1793 A +17:35126122 2524657629 A 598 1794 C +17:35126123 2524657630 C 599 1795 T +17:35126124 2524657631 T C 599 1796 G +17:35126125 2524657632 G 599 1797 C +17:35126126 2524657633 C 600 1798 T +17:35126127 2524657634 T W 600 1799 G +17:35126128 2524657635 G 600 1800 G +17:35126129 2524657636 G 601 1801 G +17:35126130 2524657637 G G 601 1802 G +17:35126131 2524657638 G 601 1803 T +17:35126132 2524657639 T 602 1804 C +17:35126133 2524657640 C P 602 1805 C +17:35126134 2524657641 C 602 1806 A +17:35126135 2524657642 A 603 1807 G +17:35126136 2524657643 G G 603 1808 G +17:35126137 2524657644 G 603 1809 G +17:35126138 2524657645 G 604 1810 C +17:35126139 2524657646 C P 604 1811 C +17:35126140 2524657647 C 604 1812 C +17:35126141 2524657648 C 605 1813 A +17:35126142 2524657649 A T 605 1814 C +17:35126143 2524657650 C 605 1815 C +17:35126144 2524657651 C 606 1816 C +17:35126145 2524657652 C Q 606 1817 A +17:35126146 2524657653 A 606 1818 G +17:35126147 2524657654 G 607 1819 T +17:35126148 2524657655 T C 607 1820 G +17:35126149 2524657656 G 607 1821 T +17:35126150 2524657657 T 608 1822 G +17:35126151 2524657658 G V 608 1823 T +17:35126152 2524657659 T 608 1824 C +17:35126153 2524657660 C 609 1825 A +17:35126154 2524657661 A N 609 1826 A +17:35126155 2524657662 A 609 1827 C +17:35126156 2524657663 C 610 1828 T +17:35126157 2524657664 T C 610 1829 G +17:35126158 2524657665 G 610 1830 C +17:35126159 2524657666 C 611 1831 A +17:35126160 2524657667 A S 611 1832 G +17:35126161 2524657668 G 611 1833 C +17:35126162 2524657669 C 612 1834 C +17:35126163 2524657670 C Q 612 1835 A +17:35126164 2524657671 A 612 1836 G +17:35126165 2524657672 G 613 1837 T +17:35126166 2524657673 T F 613 1838 T +17:35126167 2524657674 T 613 1839 C +17:35126168 2524657675 C 614 1840 C +17:35126169 2524657676 C L 614 1841 T +17:35126170 2524657677 T 614 1842 T +17:35126171 2524657678 T 615 1843 C +17:35126172 2524657679 C R 615 1844 G +17:35126173 2524657680 G 615 1845 G +17:35126174 2524657681 G 616 1846 G +17:35126175 2524657682 G G 616 1847 G +17:35126176 2524657683 G 616 1848 C +17:35126177 2524657684 C 617 1849 C +17:35126178 2524657685 C Q 617 1850 A +17:35126179 2524657686 A 617 1851 G +17:35126180 2524657687 G 618 1852 G +17:35126181 2524657688 G E 618 1853 A +17:35126182 2524657689 A 618 1854 G +17:35126183 2524657690 G 619 1855 T +17:35126184 2524657691 T C 619 1856 G +17:35126185 2524657692 G 619 1857 C +17:35126186 2524657693 C 620 1858 G +17:35126187 2524657694 G V 620 1859 T +17:35126188 2524657695 T 620 1860 G +17:35126189 2524657696 G 621 1861 G +17:35126190 2524657697 G E 621 1862 A +17:35126191 2524657698 A 621 1863 G +17:35126192 2524657699 G 622 1864 G +17:35126193 2524657700 G E 622 1865 A +17:35126194 2524657701 A 622 1866 A +17:35126195 2524657702 A 623 1867 T +17:35126196 2524657703 T C 623 1868 G +17:35126197 2524657704 G 623 1869 C +17:35126198 2524657705 C 624 1870 C +17:35126199 2524657706 C R 624 1871 G +17:35126200 2524657707 G 624 1872 A +17:35126201 2524657708 A 625 1873 G +17:35126202 2524657709 G V 625 1874 T +17:35126203 2524657710 T 625 1875 A +17:35126204 2524657711 A 626 1876 C +17:35126205 2524657712 C L 626 1877 T +17:35126206 2524657713 T 626 1878 G +17:35126207 2524657714 G 627 1879 C +17:35126208 2524657715 C Q 627 1880 A +17:35126209 2524657716 A 627 1881 G +17:35126210 2524657717 G 628 1882 G +17:35126211 2524657718 G G 628 1883 G +17:35126212 2524657719 G 628 1884 G +17:35126294 2524657801 G 629 1885 C +17:35126295 2524657802 C L 629 1886 T +17:35126296 2524657803 T 629 1887 C +17:35126297 2524657804 C 630 1888 C +17:35126298 2524657805 C P 630 1889 C +17:35126299 2524657806 C 630 1890 C +17:35126300 2524657807 C 631 1891 A +17:35126301 2524657808 A R 631 1892 G +17:35126302 2524657809 G 631 1893 G +17:35126303 2524657810 G 632 1894 G +17:35126304 2524657811 G E 632 1895 A +17:35126305 2524657812 A 632 1896 G +17:35126306 2524657813 G 633 1897 T +17:35126307 2524657814 T Y 633 1898 A +17:35126308 2524657815 A 633 1899 T +17:35126309 2524657816 T 634 1900 G +17:35126310 2524657817 G V 634 1901 T +17:35126311 2524657818 T 634 1902 G +17:35126312 2524657819 G 635 1903 A +17:35126313 2524657820 A N 635 1904 A +17:35126314 2524657821 A 635 1905 T +17:35126315 2524657822 T 636 1906 G +17:35126316 2524657823 G A 636 1907 C +17:35126317 2524657824 C 636 1908 C +17:35126318 2524657825 C 637 1909 A +17:35126319 2524657826 A R 637 1910 G +17:35126320 2524657827 G 637 1911 G +17:35126321 2524657828 G 638 1912 C +17:35126322 2524657829 C H 638 1913 A +17:35126323 2524657830 A 638 1914 C +17:35126324 2524657831 C 639 1915 T +17:35126325 2524657832 T C 639 1916 G +17:35126326 2524657833 G 639 1917 T +17:35126327 2524657834 T 640 1918 T +17:35126328 2524657835 T L 640 1919 T +17:35126329 2524657836 T 640 1920 G +17:35126330 2524657837 G 641 1921 C +17:35126331 2524657838 C P 641 1922 C +17:35126332 2524657839 C 641 1923 G +17:35126333 2524657840 G 642 1924 T +17:35126334 2524657841 T C 642 1925 G +17:35126335 2524657842 G 642 1926 C +17:35126336 2524657843 C 643 1927 C +17:35126337 2524657844 C H 643 1928 A +17:35126338 2524657845 A 643 1929 C +17:35126339 2524657846 C 644 1930 C +17:35126340 2524657847 C P 644 1931 C +17:35126341 2524657848 C 644 1932 T +17:35126342 2524657849 T 645 1933 G +17:35126343 2524657850 G E 645 1934 A +17:35126344 2524657851 A 645 1935 G +17:35126345 2524657852 G 646 1936 T +17:35126346 2524657853 T C 646 1937 G +17:35126347 2524657854 G 646 1938 T +17:35126348 2524657855 T 647 1939 C +17:35126349 2524657856 C Q 647 1940 A +17:35126350 2524657857 A 647 1941 G +17:35126351 2524657858 G 648 1942 C +17:35126352 2524657859 C P 648 1943 C +17:35126353 2524657860 C 648 1944 C +17:35126354 2524657861 C 649 1945 C +17:35126355 2524657862 C Q 649 1946 A +17:35126356 2524657863 A 649 1947 G +17:35126357 2524657864 G 650 1948 A +17:35126358 2524657865 A N 650 1949 A +17:35126359 2524657866 A 650 1950 T +17:35126360 2524657867 T 651 1951 G +17:35126361 2524657868 G G 651 1952 G +17:35126362 2524657869 G 651 1953 C +17:35126363 2524657870 C 652 1954 T +17:35126364 2524657871 T S 652 1955 C +17:35126365 2524657872 C 652 1956 A +17:35126366 2524657873 A 653 1957 G +17:35126367 2524657874 G V 653 1958 T +17:35126368 2524657875 T 653 1959 G +17:35126369 2524657876 G 654 1960 A +17:35126370 2524657877 A T 654 1961 C +17:35126371 2524657878 C 654 1962 C +17:35126372 2524657879 C 655 1963 T +17:35126373 2524657880 T C 655 1964 G +17:35126374 2524657881 G 655 1965 T +17:35126375 2524657882 T 656 1966 T +17:35126376 2524657883 T F 656 1967 T +17:35126377 2524657884 T 656 1968 T +17:35126378 2524657885 T 657 1969 G +17:35126379 2524657886 G G 657 1970 G +17:35126380 2524657887 G 657 1971 A +17:35126381 2524657888 A 658 1972 C +17:35126382 2524657889 C P 658 1973 C +17:35126383 2524657890 C 658 1974 G +17:35126384 2524657891 G 659 1975 G +17:35127099 2524658606 G E 659 1976 A +17:35127100 2524658607 A 659 1977 G +17:35127101 2524658608 G 660 1978 G +17:35127102 2524658609 G A 660 1979 C +17:35127103 2524658610 C 660 1980 T +17:35127104 2524658611 T 661 1981 G +17:35127105 2524658612 G D 661 1982 A +17:35127106 2524658613 A 661 1983 C +17:35127107 2524658614 C 662 1984 C +17:35127108 2524658615 C Q 662 1985 A +17:35127109 2524658616 A 662 1986 G +17:35127110 2524658617 G 663 1987 T +17:35127111 2524658618 T C 663 1988 G +17:35127112 2524658619 G 663 1989 T +17:35127113 2524658620 T 664 1990 G +17:35127114 2524658621 G V 664 1991 T +17:35127115 2524658622 T 664 1992 G +17:35127116 2524658623 G 665 1993 G +17:35127117 2524658624 G A 665 1994 C +17:35127118 2524658625 C 665 1995 C +17:35127119 2524658626 C 666 1996 T +17:35127120 2524658627 T C 666 1997 G +17:35127121 2524658628 G 666 1998 T +17:35127122 2524658629 T 667 1999 G +17:35127123 2524658630 G A 667 2000 C +17:35127124 2524658631 C 667 2001 C +17:35127125 2524658632 C 668 2002 C +17:35127126 2524658633 C H 668 2003 A +17:35127127 2524658634 A 668 2004 C +17:35127128 2524658635 C 669 2005 T +17:35127129 2524658636 T Y 669 2006 A +17:35127130 2524658637 A 669 2007 T +17:35127131 2524658638 T 670 2008 A +17:35127132 2524658639 A K 670 2009 A +17:35127133 2524658640 A 670 2010 G +17:35127134 2524658641 G 671 2011 G +17:35127135 2524658642 G D 671 2012 A +17:35127136 2524658643 A 671 2013 C +17:35127137 2524658644 C 672 2014 C +17:35127138 2524658645 C P 672 2015 C +17:35127139 2524658646 C 672 2016 T +17:35127140 2524658647 T 673 2017 C +17:35127141 2524658648 C P 673 2018 C +17:35127142 2524658649 C 673 2019 C +17:35127143 2524658650 C 674 2020 T +17:35127144 2524658651 T F 674 2021 T +17:35127145 2524658652 T 674 2022 C +17:35127146 2524658653 C 675 2023 T +17:35127147 2524658654 T C 675 2024 G +17:35127148 2524658655 G 675 2025 C +17:35127149 2524658656 C 676 2026 G +17:35127150 2524658657 G V 676 2027 T +17:35127151 2524658658 T 676 2028 G +17:35127152 2524658659 G 677 2029 G +17:35127153 2524658660 G A 677 2030 C +17:35127154 2524658661 C 677 2031 C +17:35127155 2524658662 C 678 2032 C +17:35127156 2524658663 C R 678 2033 G +17:35127157 2524658664 G 678 2034 C +17:35127158 2524658665 C 679 2035 T +17:35127159 2524658666 T C 679 2036 G +17:35127160 2524658667 G 679 2037 C +17:35127161 2524658668 C 680 2038 C +17:35127162 2524658669 C P 680 2039 C +17:35127163 2524658670 C 680 2040 C +17:35127164 2524658671 C 681 2041 A +17:35127165 2524658672 A S 681 2042 G +17:35127166 2524658673 G 681 2043 C +17:35127167 2524658674 C 682 2044 G +17:35127168 2524658675 G G 682 2045 G +17:35127169 2524658676 G 682 2046 T +17:35127170 2524658677 T 683 2047 G +17:35127171 2524658678 G V 683 2048 T +17:35127172 2524658679 T 683 2049 G +17:35127173 2524658680 G 684 2050 A +17:35127174 2524658681 A K 684 2051 A +17:35127175 2524658682 A 684 2052 A +17:35127176 2524658683 A 685 2053 C +17:35127177 2524658684 C P 685 2054 C +17:35127178 2524658685 C 685 2055 T +17:35127179 2524658686 T 686 2056 G +17:35127180 2524658687 G D 686 2057 A +17:35127181 2524658688 A 686 2058 C +17:35127182 2524658689 C 687 2059 C +17:35127183 2524658690 C L 687 2060 T +17:35127184 2524658691 T 687 2061 C +17:35127185 2524658692 C 688 2062 T +17:35127186 2524658693 T S 688 2063 C +17:35127187 2524658694 C 688 2064 C +17:35127188 2524658695 C 689 2065 T +17:35127189 2524658696 T Y 689 2066 A +17:35127190 2524658697 A 689 2067 C +17:35127191 2524658698 C 690 2068 A +17:35127192 2524658699 A M 690 2069 T +17:35127193 2524658700 T 690 2070 G +17:35127194 2524658701 G 691 2071 C +17:35127195 2524658702 C P 691 2072 C +17:35127196 2524658703 C 691 2073 C +17:35127197 2524658704 C 692 2074 A +17:35127198 2524658705 A I 692 2075 T +17:35127199 2524658706 T 692 2076 C +17:35127200 2524658707 C 693 2077 T +17:35127201 2524658708 T W 693 2078 G +17:35127202 2524658709 G 693 2079 G +17:35127203 2524658710 G 694 2080 A +17:35127204 2524658711 A K 694 2081 A +17:35127205 2524658712 A 694 2082 G +17:35127206 2524658713 G 695 2083 T +17:35127207 2524658714 T F 695 2084 T +17:35127208 2524658715 T 695 2085 T +17:35127209 2524658716 T 696 2086 C +17:35127210 2524658717 C P 696 2087 C +17:35127211 2524658718 C 696 2088 A +17:35127212 2524658719 A 697 2089 G +17:35127213 2524658720 G D 697 2090 A +17:35127214 2524658721 A 697 2091 T +17:35127215 2524658722 T 698 2092 G +17:35127216 2524658723 G E 698 2093 A +17:35127217 2524658724 A 698 2094 G +17:35127218 2524658725 G 699 2095 G +17:35127219 2524658726 G E 699 2096 A +17:35127220 2524658727 A 699 2097 G +17:35127221 2524658728 G 700 2098 G +17:35127222 2524658729 G G 700 2099 G +17:35127223 2524658730 G 700 2100 C +17:35127224 2524658731 C 701 2101 G +17:35127225 2524658732 G A 701 2102 C +17:35127226 2524658733 C 701 2103 A +17:35127227 2524658734 A 702 2104 T +17:35127228 2524658735 T C 702 2105 G +17:35127229 2524658736 G 702 2106 C +17:35127230 2524658737 C 703 2107 C +17:35127231 2524658738 C Q 703 2108 A +17:35127232 2524658739 A 703 2109 G +17:35127233 2524658740 G 704 2110 C +17:35127234 2524658741 C P 704 2111 C +17:35127235 2524658742 C 704 2112 T +17:35127236 2524658743 T 705 2113 T +17:35127237 2524658744 T C 705 2114 G +17:35127238 2524658745 G 705 2115 C +17:35127239 2524658746 C 706 2116 C +17:35127240 2524658747 C P 706 2117 C +17:35127241 2524658748 C 706 2118 C +17:35127242 2524658749 C 707 2119 A +17:35127243 2524658750 A I 707 2120 T +17:35127244 2524658751 T 707 2121 C +17:35127245 2524658752 C 708 2122 A +17:35127246 2524658753 A N 708 2123 A +17:35127247 2524658754 A 708 2124 C +17:35127248 2524658755 C 709 2125 T +17:35127249 2524658756 T C 709 2126 G +17:35127250 2524658757 G 709 2127 C +17:35127251 2524658758 C 710 2128 A +17:35127252 2524658759 A T 710 2129 C +17:35127253 2524658760 C 710 2130 C +17:35127254 2524658761 C 711 2131 C +17:35127255 2524658762 C H 711 2132 A +17:35127256 2524658763 A 711 2133 C +17:35127257 2524658764 C 712 2134 T +17:35127258 2524658765 T S 712 2135 C +17:35127259 2524658766 C 712 2136 C +17:35129566 2524661073 C 713 2137 T +17:35129567 2524661074 T C 713 2138 G +17:35129568 2524661075 G 713 2139 T +17:35129569 2524661076 T 714 2140 G +17:35129570 2524661077 G V 714 2141 T +17:35129571 2524661078 T 714 2142 G +17:35129572 2524661079 G 715 2143 G +17:35129573 2524661080 G D 715 2144 A +17:35129574 2524661081 A 715 2145 C +17:35129575 2524661082 C 716 2146 C +17:35129576 2524661083 C L 716 2147 T +17:35129577 2524661084 T 716 2148 G +17:35129578 2524661085 G 717 2149 G +17:35129579 2524661086 G D 717 2150 A +17:35129580 2524661087 A 717 2151 T +17:35129581 2524661088 T 718 2152 G +17:35129582 2524661089 G D 718 2153 A +17:35129583 2524661090 A 718 2154 C +17:35129584 2524661091 C 719 2155 A +17:35129585 2524661092 A K 719 2156 A +17:35129586 2524661093 A 719 2157 G +17:35129587 2524661094 G 720 2158 G +17:35129588 2524661095 G G 720 2159 G +17:35129589 2524661096 G 720 2160 C +17:35129590 2524661097 C 721 2161 T +17:35129591 2524661098 T C 721 2162 G +17:35129592 2524661099 G 721 2163 C +17:35129593 2524661100 C 722 2164 C +17:35129594 2524661101 C P 722 2165 C +17:35129595 2524661102 C 722 2166 C +17:35129596 2524661103 C 723 2167 G +17:35129597 2524661104 G A 723 2168 C +17:35129598 2524661105 C 723 2169 C +17:35129599 2524661106 C 724 2170 G +17:35129600 2524661107 G E 724 2171 A +17:35129601 2524661108 A 724 2172 G +17:35129602 2524661109 G 725 2173 C +17:35129603 2524661110 C Q 725 2174 A +17:35129604 2524661111 A 725 2175 G +17:35129605 2524661112 G 726 2176 A +17:35129606 2524661113 A R 726 2177 G +17:35129607 2524661114 G 726 2178 A +17:35129608 2524661115 A 727 2179 G +17:35129609 2524661116 G A 727 2180 C +17:35129610 2524661117 C 727 2181 C +17:35129611 2524661118 C 728 2182 A +17:35129612 2524661119 A S 728 2183 G +17:35129613 2524661120 G 728 2184 C +17:35133098 2524664605 C 729 2185 C +17:35133099 2524664606 C P 729 2186 C +17:35133100 2524664607 C 729 2187 T +17:35133101 2524664608 T 730 2188 C +17:35133102 2524664609 C L 730 2189 T +17:35133103 2524664610 T 730 2190 G +17:35133104 2524664611 G 731 2191 A +17:35133105 2524664612 A T 731 2192 C +17:35133106 2524664613 C 731 2193 G +17:35133107 2524664614 G 732 2194 T +17:35133108 2524664615 T S 732 2195 C +17:35133109 2524664616 C 732 2196 C +17:35133110 2524664617 C 733 2197 A +17:35133111 2524664618 A I 733 2198 T +17:35133112 2524664619 T 733 2199 C +17:35133113 2524664620 C 734 2200 A +17:35133114 2524664621 A I 734 2201 T +17:35133115 2524664622 T 734 2202 C +17:35133116 2524664623 C 735 2203 T +17:35133117 2524664624 T S 735 2204 C +17:35133118 2524664625 C 735 2205 T +17:35133119 2524664626 T 736 2206 G +17:35133120 2524664627 G A 736 2207 C +17:35133121 2524664628 C 736 2208 G +17:35133122 2524664629 G 737 2209 G +17:35133123 2524664630 G V 737 2210 T +17:35133124 2524664631 T 737 2211 G +17:35133125 2524664632 G 738 2212 G +17:35133126 2524664633 G V 738 2213 T +17:35133127 2524664634 T 738 2214 T +17:35133128 2524664635 T 739 2215 G +17:35133129 2524664636 G G 739 2216 G +17:35133130 2524664637 G 739 2217 C +17:35133131 2524664638 C 740 2218 A +17:35133132 2524664639 A I 740 2219 T +17:35133133 2524664640 T 740 2220 T +17:35133134 2524664641 T 741 2221 C +17:35133135 2524664642 C L 741 2222 T +17:35133136 2524664643 T 741 2223 G +17:35133137 2524664644 G 742 2224 C +17:35133138 2524664645 C L 742 2225 T +17:35133139 2524664646 T 742 2226 G +17:35133140 2524664647 G 743 2227 G +17:35133141 2524664648 G V 743 2228 T +17:35133142 2524664649 T 743 2229 C +17:35133143 2524664650 C 744 2230 G +17:35133144 2524664651 G V 744 2231 T +17:35133145 2524664652 T 744 2232 G +17:35133146 2524664653 G 745 2233 G +17:35133147 2524664654 G V 745 2234 T +17:35133148 2524664655 T 745 2235 C +17:35133149 2524664656 C 746 2236 T +17:35133150 2524664657 T L 746 2237 T +17:35133151 2524664658 T 746 2238 G +17:35133152 2524664659 G 747 2239 G +17:35133153 2524664660 G G 747 2240 G +17:35133154 2524664661 G 747 2241 G +17:35133155 2524664662 G 748 2242 G +17:35133156 2524664663 G V 748 2243 T +17:35133157 2524664664 T 748 2244 G +17:35133158 2524664665 G 749 2245 G +17:35133159 2524664666 G V 749 2246 T +17:35133160 2524664667 T 749 2247 C +17:35133161 2524664668 C 750 2248 T +17:35133162 2524664669 T F 750 2249 T +17:35133163 2524664670 T 750 2250 T +17:35133164 2524664671 T 751 2251 G +17:35133165 2524664672 G G 751 2252 G +17:35133166 2524664673 G 751 2253 G +17:35133167 2524664674 G 752 2254 A +17:35133168 2524664675 A I 752 2255 T +17:35133169 2524664676 T 752 2256 C +17:35133170 2524664677 C 753 2257 C +17:35133171 2524664678 C L 753 2258 T +17:35133172 2524664679 T 753 2259 C +17:35133173 2524664680 C 754 2260 A +17:35133174 2524664681 A I 754 2261 T +17:35133175 2524664682 T 754 2262 C +17:35133176 2524664683 C 755 2263 A +17:35133177 2524664684 A K 755 2264 A +17:35133178 2524664685 A 755 2265 G +17:35133179 2524664686 G 756 2266 C +17:35133180 2524664687 C R 756 2267 G +17:35133181 2524664688 G 756 2268 A +17:35133182 2524664689 A 757 2269 C +17:35133183 2524664690 C R 757 2270 G +17:35133184 2524664691 G 757 2271 G +17:35133185 2524664692 G 758 2272 C +17:35133186 2524664693 C Q 758 2273 A +17:35133187 2524664694 A 758 2274 G +17:35133188 2524664695 G 759 2275 C +17:35133189 2524664696 C Q 759 2276 A +17:35133190 2524664697 A 759 2277 G +17:35133191 2524664698 G 760 2278 A +17:35133192 2524664699 A K 760 2279 A +17:35133193 2524664700 A 760 2280 G +17:35133194 2524664701 G 761 2281 A +17:35133195 2524664702 A I 761 2282 T +17:35133196 2524664703 T 761 2283 C +17:35133197 2524664704 C 762 2284 C +17:35133198 2524664705 C R 762 2285 G +17:35133199 2524664706 G 762 2286 G +17:35133200 2524664707 G 763 2287 A +17:35133201 2524664708 A K 763 2288 A +17:35133202 2524664709 A 763 2289 G +17:35133203 2524664710 G 764 2290 T +17:35133204 2524664711 T Y 764 2291 A +17:35133205 2524664712 A 764 2292 C +17:35133206 2524664713 C 765 2293 A +17:35133207 2524664714 A T 765 2294 C +17:35133208 2524664715 C 765 2295 G +17:35133209 2524664716 G 766 2296 A +17:35133210 2524664717 A M 766 2297 T +17:35133211 2524664718 T 766 2298 G +17:35133212 2524664719 G 767 2299 C +17:35133213 2524664720 C R 767 2300 G +17:35133214 2524664721 G 767 2301 G +17:35133215 2524664722 G 768 2302 A +17:35133216 2524664723 A R 768 2303 G +17:35133217 2524664724 G 768 2304 A +17:35133218 2524664725 A 769 2305 C +17:35133219 2524664726 C L 769 2306 T +17:35133220 2524664727 T 769 2307 G +17:35133221 2524664728 G 770 2308 C +17:35133222 2524664729 C L 770 2309 T +17:35133223 2524664730 T 770 2310 G +17:35133224 2524664731 G 771 2311 C +17:35133225 2524664732 C Q 771 2312 A +17:35133226 2524664733 A 771 2313 G +17:35133227 2524664734 G 772 2314 G +17:35133228 2524664735 G E 772 2315 A +17:35133229 2524664736 A 772 2316 A +17:35133230 2524664737 A 773 2317 A +17:35133231 2524664738 A T 773 2318 C +17:35133232 2524664739 C 773 2319 G +17:35133233 2524664740 G 774 2320 G +17:35133234 2524664741 G E 774 2321 A +17:35133235 2524664742 A 774 2322 G +17:35133236 2524664743 G 775 2323 C +17:35133317 2524664824 C L 775 2324 T +17:35133318 2524664825 T 775 2325 G +17:35133319 2524664826 G 776 2326 G +17:35133320 2524664827 G V 776 2327 T +17:35133321 2524664828 T 776 2328 G +17:35133322 2524664829 G 777 2329 G +17:35133323 2524664830 G E 777 2330 A +17:35133324 2524664831 A 777 2331 G +17:35133325 2524664832 G 778 2332 C +17:35133326 2524664833 C P 778 2333 C +17:35133327 2524664834 C 778 2334 G +17:35133328 2524664835 G 779 2335 C +17:35133329 2524664836 C L 779 2336 T +17:35133330 2524664837 T 779 2337 G +17:35133331 2524664838 G 780 2338 A +17:35133332 2524664839 A T 780 2339 C +17:35133333 2524664840 C 780 2340 A +17:35133334 2524664841 A 781 2341 C +17:35133335 2524664842 C P 781 2342 C +17:35133336 2524664843 C 781 2343 T +17:35133337 2524664844 T 782 2344 A +17:35133338 2524664845 A S 782 2345 G +17:35133339 2524664846 G 782 2346 C +17:35133340 2524664847 C 783 2347 G +17:35133341 2524664848 G G 783 2348 G +17:35133342 2524664849 G 783 2349 A +17:35133343 2524664850 A 784 2350 G +17:35133344 2524664851 G A 784 2351 C +17:35133345 2524664852 C 784 2352 G +17:35133346 2524664853 G 785 2353 A +17:35133347 2524664854 A M 785 2354 T +17:35133348 2524664855 T 785 2355 G +17:35133349 2524664856 G 786 2356 C +17:35133350 2524664857 C P 786 2357 C +17:35133351 2524664858 C 786 2358 C +17:35133352 2524664859 C 787 2359 A +17:35133353 2524664860 A N 787 2360 A +17:35133354 2524664861 A 787 2361 C +17:35133355 2524664862 C 788 2362 C +17:35133356 2524664863 C Q 788 2363 A +17:35133357 2524664864 A 788 2364 G +17:35133358 2524664865 G 789 2365 G +17:35133359 2524664866 G A 789 2366 C +17:35133360 2524664867 C 789 2367 G +17:35133361 2524664868 G 790 2368 C +17:35133362 2524664869 C Q 790 2369 A +17:35133363 2524664870 A 790 2370 G +17:35133364 2524664871 G 791 2371 A +17:35133365 2524664872 A M 791 2372 T +17:35133366 2524664873 T 791 2373 G +17:35133367 2524664874 G 792 2374 C +17:35133368 2524664875 C R 792 2375 G +17:35133369 2524664876 G 792 2376 G +17:35133370 2524664877 G 793 2377 A +17:35133371 2524664878 A I 793 2378 T +17:35133372 2524664879 T 793 2379 C +17:35133373 2524664880 C 794 2380 C +17:35133374 2524664881 C L 794 2381 T +17:35133375 2524664882 T 794 2382 G +17:35133376 2524664883 G 795 2383 A +17:35133377 2524664884 A K 795 2384 A +17:35133378 2524664885 A 795 2385 A +17:35133379 2524664886 A 796 2386 G +17:35133380 2524664887 G E 796 2387 A +17:35133381 2524664888 A 796 2388 G +17:35133382 2524664889 G 797 2389 A +17:35133383 2524664890 A T 797 2390 C +17:35133384 2524664891 C 797 2391 G +17:35133385 2524664892 G 798 2392 G +17:35133386 2524664893 G E 798 2393 A +17:35133387 2524664894 A 798 2394 G +17:35133388 2524664895 G 799 2395 C +17:35133389 2524664896 C L 799 2396 T +17:35133390 2524664897 T 799 2397 G +17:35133391 2524664898 G 800 2398 A +17:35133392 2524664899 A R 800 2399 G +17:35133393 2524664900 G 800 2400 G +17:35133394 2524664901 G 801 2401 A +17:35133395 2524664902 A K 801 2402 A +17:35133396 2524664903 A 801 2403 G +17:35133397 2524664904 G 802 2404 G +17:35133398 2524664905 G V 802 2405 T +17:35133399 2524664906 T 802 2406 G +17:35133400 2524664907 G 803 2407 A +17:35133401 2524664908 A K 803 2408 A +17:35133402 2524664909 A 803 2409 G +17:35133403 2524664910 G 804 2410 G +17:35133404 2524664911 G V 804 2411 T +17:35133405 2524664912 T 804 2412 G +17:35133406 2524664913 G 805 2413 C +17:35133407 2524664914 C L 805 2414 T +17:35133408 2524664915 T 805 2415 T +17:35133409 2524664916 T 806 2416 G +17:35133410 2524664917 G G 806 2417 G +17:35133411 2524664918 G 806 2418 A +17:35133412 2524664919 A 807 2419 T +17:35133413 2524664920 T S 807 2420 C +17:35133414 2524664921 C 807 2421 T +17:35133415 2524664922 T 808 2422 G +17:35133416 2524664923 G G 808 2423 G +17:35133417 2524664924 G 808 2424 C +17:35133418 2524664925 C 809 2425 G +17:35133419 2524664926 G A 809 2426 C +17:35133420 2524664927 C 809 2427 T +17:35133421 2524664928 T 810 2428 T +17:35133422 2524664929 T F 810 2429 T +17:35133423 2524664930 T 810 2430 T +17:35133424 2524664931 T 811 2431 G +17:35133425 2524664932 G G 811 2432 G +17:35133426 2524664933 G 811 2433 C +17:35133427 2524664934 C 812 2434 A +17:35133428 2524664935 A T 812 2435 C +17:35133429 2524664936 C 812 2436 A +17:35133430 2524664937 A 813 2437 G +17:35133431 2524664938 G V 813 2438 T +17:35133432 2524664939 T 813 2439 C +17:35133433 2524664940 C 814 2440 T +17:35133434 2524664941 T Y 814 2441 A +17:35133435 2524664942 A 814 2442 C +17:35133436 2524664943 C 815 2443 A +17:35133437 2524664944 A K 815 2444 A +17:35133438 2524664945 A 815 2445 G +17:35133439 2524664946 G 816 2446 G +17:35133691 2524665198 G G 816 2447 G +17:35133692 2524665199 G 816 2448 C +17:35133693 2524665200 C 817 2449 A +17:35133694 2524665201 A I 817 2450 T +17:35133695 2524665202 T 817 2451 C +17:35133696 2524665203 C 818 2452 T +17:35133697 2524665204 T W 818 2453 G +17:35133698 2524665205 G 818 2454 G +17:35133699 2524665206 G 819 2455 A +17:35133700 2524665207 A I 819 2456 T +17:35133701 2524665208 T 819 2457 C +17:35133702 2524665209 C 820 2458 C +17:35133703 2524665210 C P 820 2459 C +17:35133704 2524665211 C 820 2460 T +17:35133705 2524665212 T 821 2461 G +17:35133706 2524665213 G D 821 2462 A +17:35133707 2524665214 A 821 2463 T +17:35133708 2524665215 T 822 2464 G +17:35133709 2524665216 G G 822 2465 G +17:35133710 2524665217 G 822 2466 G +17:35133711 2524665218 G 823 2467 G +17:35133712 2524665219 G E 823 2468 A +17:35133713 2524665220 A 823 2469 G +17:35133714 2524665221 G 824 2470 A +17:35133715 2524665222 A N 824 2471 A +17:35133716 2524665223 A 824 2472 T +17:35133717 2524665224 T 825 2473 G +17:35133718 2524665225 G V 825 2474 T +17:35133719 2524665226 T 825 2475 G +17:35133720 2524665227 G 826 2476 A +17:35133721 2524665228 A K 826 2477 A +17:35133722 2524665229 A 826 2478 A +17:35133723 2524665230 A 827 2479 A +17:35133724 2524665231 A I 827 2480 T +17:35133725 2524665232 T 827 2481 T +17:35133726 2524665233 T 828 2482 C +17:35133727 2524665234 C P 828 2483 C +17:35133728 2524665235 C 828 2484 A +17:35133729 2524665236 A 829 2485 G +17:35133730 2524665237 G V 829 2486 T +17:35133731 2524665238 T 829 2487 G +17:35133732 2524665239 G 830 2488 G +17:35133733 2524665240 G A 830 2489 C +17:35133734 2524665241 C 830 2490 C +17:35133735 2524665242 C 831 2491 A +17:35133736 2524665243 A I 831 2492 T +17:35133737 2524665244 T 831 2493 C +17:35133738 2524665245 C 832 2494 A +17:35133739 2524665246 A K 832 2495 A +17:35133740 2524665247 A 832 2496 A +17:35133741 2524665248 A 833 2497 G +17:35133742 2524665249 G V 833 2498 T +17:35133743 2524665250 T 833 2499 G +17:35133744 2524665251 G 834 2500 T +17:35133745 2524665252 T L 834 2501 T +17:35133746 2524665253 T 834 2502 G +17:35133747 2524665254 G 835 2503 A +17:35133748 2524665255 A R 835 2504 G +17:35133749 2524665256 G 835 2505 G +17:35133750 2524665257 G 836 2506 G +17:35133751 2524665258 G E 836 2507 A +17:35133752 2524665259 A 836 2508 A +17:35133753 2524665260 A 837 2509 A +17:35133754 2524665261 A N 837 2510 A +17:35133755 2524665262 A 837 2511 C +17:35133756 2524665263 C 838 2512 A +17:35133757 2524665264 A T 838 2513 C +17:35133758 2524665265 C 838 2514 A +17:35133759 2524665266 A 839 2515 T +17:35133760 2524665267 T S 839 2516 C +17:35133761 2524665268 C 839 2517 C +17:35133762 2524665269 C 840 2518 C +17:35133763 2524665270 C P 840 2519 C +17:35133764 2524665271 C 840 2520 C +17:35133765 2524665272 C 841 2521 A +17:35133766 2524665273 A K 841 2522 A +17:35133767 2524665274 A 841 2523 A +17:35133768 2524665275 A 842 2524 G +17:35133769 2524665276 G A 842 2525 C +17:35133770 2524665277 C 842 2526 C +17:35133771 2524665278 C 843 2527 A +17:35133772 2524665279 A N 843 2528 A +17:35133773 2524665280 A 843 2529 C +17:35133774 2524665281 C 844 2530 A +17:35133775 2524665282 A K 844 2531 A +17:35133776 2524665283 A 844 2532 A +17:35133777 2524665284 A 845 2533 G +17:35133778 2524665285 G E 845 2534 A +17:35133779 2524665286 A 845 2535 A +17:35133780 2524665287 A 846 2536 A +17:35133781 2524665288 A I 846 2537 T +17:35133782 2524665289 T 846 2538 C +17:35133783 2524665290 C 847 2539 T +17:35133784 2524665291 T L 847 2540 T +17:35133785 2524665292 T 847 2541 A +17:35133786 2524665293 A 848 2542 G +17:35133787 2524665294 G D 848 2543 A +17:35133788 2524665295 A 848 2544 C +17:35133789 2524665296 C 849 2545 G +17:35134505 2524666012 G E 849 2546 A +17:35134506 2524666013 A 849 2547 A +17:35134507 2524666014 A 850 2548 G +17:35134508 2524666015 G A 850 2549 C +17:35134509 2524666016 C 850 2550 A +17:35134510 2524666017 A 851 2551 T +17:35134511 2524666018 T Y 851 2552 A +17:35134512 2524666019 A 851 2553 C +17:35134513 2524666020 C 852 2554 G +17:35134514 2524666021 G V 852 2555 T +17:35134515 2524666022 T 852 2556 G +17:35134516 2524666023 G 853 2557 A +17:35134517 2524666024 A M 853 2558 T +17:35134518 2524666025 T 853 2559 G +17:35134519 2524666026 G 854 2560 G +17:35134520 2524666027 G A 854 2561 C +17:35134521 2524666028 C 854 2562 T +17:35134522 2524666029 T 855 2563 G +17:35134523 2524666030 G G 855 2564 G +17:35134524 2524666031 G 855 2565 T +17:35134525 2524666032 T 856 2566 G +17:35134526 2524666033 G V 856 2567 T +17:35134527 2524666034 T 856 2568 G +17:35134528 2524666035 G 857 2569 G +17:35134529 2524666036 G G 857 2570 G +17:35134530 2524666037 G 857 2571 C +17:35134531 2524666038 C 858 2572 T +17:35134532 2524666039 T S 858 2573 C +17:35134533 2524666040 C 858 2574 C +17:35134534 2524666041 C 859 2575 C +17:35134535 2524666042 C P 859 2576 C +17:35134536 2524666043 C 859 2577 A +17:35134537 2524666044 A 860 2578 T +17:35134538 2524666045 T Y 860 2579 A +17:35134539 2524666046 A 860 2580 T +17:35134540 2524666047 T 861 2581 G +17:35134541 2524666048 G V 861 2582 T +17:35134542 2524666049 T 861 2583 C +17:35134543 2524666050 C 862 2584 T +17:35134544 2524666051 T S 862 2585 C +17:35134545 2524666052 C 862 2586 C +17:35134546 2524666053 C 863 2587 C +17:35134547 2524666054 C R 863 2588 G +17:35134548 2524666055 G 863 2589 C +17:35134549 2524666056 C 864 2590 C +17:35134550 2524666057 C L 864 2591 T +17:35134551 2524666058 T 864 2592 T +17:35134552 2524666059 T 865 2593 C +17:35134553 2524666060 C L 865 2594 T +17:35134554 2524666061 T 865 2595 G +17:35134555 2524666062 G 866 2596 G +17:35134556 2524666063 G G 866 2597 G +17:35134557 2524666064 G 866 2598 C +17:35134558 2524666065 C 867 2599 A +17:35134559 2524666066 A I 867 2600 T +17:35134560 2524666067 T 867 2601 C +17:35134561 2524666068 C 868 2602 T +17:35134562 2524666069 T C 868 2603 G +17:35134563 2524666070 G 868 2604 C +17:35134564 2524666071 C 869 2605 C +17:35134565 2524666072 C L 869 2606 T +17:35134566 2524666073 T 869 2607 G +17:35134567 2524666074 G 870 2608 A +17:35134568 2524666075 A T 870 2609 C +17:35134569 2524666076 C 870 2610 A +17:35134570 2524666077 A 871 2611 T +17:35134571 2524666078 T S 871 2612 C +17:35134572 2524666079 C 871 2613 C +17:35134573 2524666080 C 872 2614 A +17:35134574 2524666081 A T 872 2615 C +17:35134575 2524666082 C 872 2616 G +17:35134576 2524666083 G 873 2617 G +17:35134577 2524666084 G V 873 2618 T +17:35134578 2524666085 T 873 2619 G +17:35134579 2524666086 G 874 2620 C +17:35134580 2524666087 C Q 874 2621 A +17:35134581 2524666088 A 874 2622 G +17:35134582 2524666089 G 875 2623 C +17:35134583 2524666090 C L 875 2624 T +17:35134584 2524666091 T 875 2625 G +17:35134585 2524666092 G 876 2626 G +17:35134586 2524666093 G V 876 2627 T +17:35134587 2524666094 T 876 2628 G +17:35134588 2524666095 G 877 2629 A +17:35134589 2524666096 A T 877 2630 C +17:35134590 2524666097 C 877 2631 A +17:35134591 2524666098 A 878 2632 C +17:35134592 2524666099 C Q 878 2633 A +17:35134593 2524666100 A 878 2634 G +17:35134594 2524666101 G 879 2635 C +17:35134595 2524666102 C L 879 2636 T +17:35134596 2524666103 T 879 2637 T +17:35134597 2524666104 T 880 2638 A +17:35134598 2524666105 A M 880 2639 T +17:35134599 2524666106 T 880 2640 G +17:35134600 2524666107 G 881 2641 C +17:35134601 2524666108 C P 881 2642 C +17:35134602 2524666109 C 881 2643 C +17:35134603 2524666110 C 882 2644 T +17:35134604 2524666111 T Y 882 2645 A +17:35134605 2524666112 A 882 2646 T +17:35134606 2524666113 T 883 2647 G +17:35134607 2524666114 G G 883 2648 G +17:35134608 2524666115 G 883 2649 C +17:35134609 2524666116 C 884 2650 T +17:35134610 2524666117 T C 884 2651 G +17:35134611 2524666118 G 884 2652 C +17:35134612 2524666119 C 885 2653 C +17:35134613 2524666120 C L 885 2654 T +17:35134614 2524666121 T 885 2655 C +17:35134615 2524666122 C 886 2656 T +17:35134616 2524666123 T L 886 2657 T +17:35134617 2524666124 T 886 2658 A +17:35134618 2524666125 A 887 2659 G +17:35134619 2524666126 G D 887 2660 A +17:35134620 2524666127 A 887 2661 C +17:35134621 2524666128 C 888 2662 C +17:35134622 2524666129 C H 888 2663 A +17:35134623 2524666130 A 888 2664 T +17:35134624 2524666131 T 889 2665 G +17:35134625 2524666132 G V 889 2666 T +17:35134626 2524666133 T 889 2667 C +17:35134627 2524666134 C 890 2668 C +17:35134628 2524666135 C R 890 2669 G +17:35134629 2524666136 G 890 2670 G +17:35134630 2524666137 G 891 2671 G +17:35134631 2524666138 G E 891 2672 A +17:35134632 2524666139 A 891 2673 A +17:35134633 2524666140 A 892 2674 A +17:35134634 2524666141 A N 892 2675 A +17:35134635 2524666142 A 892 2676 C +17:35134636 2524666143 C 893 2677 C +17:35134637 2524666144 C R 893 2678 G +17:35134638 2524666145 G 893 2679 C +17:35134639 2524666146 C 894 2680 G +17:35134640 2524666147 G G 894 2681 G +17:35134641 2524666148 G 894 2682 A +17:35134642 2524666149 A 895 2683 C +17:35134643 2524666150 C R 895 2684 G +17:35134644 2524666151 G 895 2685 C +17:35134645 2524666152 C 896 2686 C +17:35134646 2524666153 C L 896 2687 T +17:35134647 2524666154 T 896 2688 G +17:35134648 2524666155 G 897 2689 G +17:35134649 2524666156 G G 897 2690 G +17:35134650 2524666157 G 897 2691 C +17:35134651 2524666158 C 898 2692 T +17:35134652 2524666159 T S 898 2693 C +17:35134653 2524666160 C 898 2694 C +17:35134654 2524666161 C 899 2695 C +17:35134655 2524666162 C Q 899 2696 A +17:35134656 2524666163 A 899 2697 G +17:35134657 2524666164 G 900 2698 G +17:35134658 2524666165 G D 900 2699 A +17:35134659 2524666166 A 900 2700 C +17:35134660 2524666167 C 901 2701 C +17:35134661 2524666168 C L 901 2702 T +17:35134662 2524666169 T 901 2703 G +17:35134663 2524666170 G 902 2704 C +17:35134664 2524666171 C L 902 2705 T +17:35134665 2524666172 T 902 2706 G +17:35134666 2524666173 G 903 2707 A +17:35134667 2524666174 A N 903 2708 A +17:35134668 2524666175 A 903 2709 C +17:35134669 2524666176 C 904 2710 T +17:35134670 2524666177 T W 904 2711 G +17:35134671 2524666178 G 904 2712 G +17:35134672 2524666179 G 905 2713 T +17:35134673 2524666180 T C 905 2714 G +17:35134674 2524666181 G 905 2715 T +17:35134675 2524666182 T 906 2716 A +17:35134676 2524666183 A M 906 2717 T +17:35134677 2524666184 T 906 2718 G +17:35134678 2524666185 G 907 2719 C +17:35134679 2524666186 C Q 907 2720 A +17:35134680 2524666187 A 907 2721 G +17:35134681 2524666188 G 908 2722 A +17:35134682 2524666189 A I 908 2723 T +17:35134683 2524666190 T 908 2724 T +17:35134684 2524666191 T 909 2725 G +17:35134685 2524666192 G A 909 2726 C +17:35134686 2524666193 C 909 2727 C +17:35134687 2524666194 C 910 2728 A +17:35134688 2524666195 A K 910 2729 A +17:35134689 2524666196 A 910 2730 G +17:35134690 2524666197 G 911 2731 G +17:35134828 2524666335 G G 911 2732 G +17:35134829 2524666336 G 911 2733 G +17:35134830 2524666337 G 912 2734 A +17:35134831 2524666338 A M 912 2735 T +17:35134832 2524666339 T 912 2736 G +17:35134833 2524666340 G 913 2737 A +17:35134834 2524666341 A S 913 2738 G +17:35134835 2524666342 G 913 2739 C +17:35134836 2524666343 C 914 2740 T +17:35134837 2524666344 T Y 914 2741 A +17:35134838 2524666345 A 914 2742 C +17:35134839 2524666346 C 915 2743 C +17:35134840 2524666347 C L 915 2744 T +17:35134841 2524666348 T 915 2745 G +17:35134842 2524666349 G 916 2746 G +17:35134843 2524666350 G E 916 2747 A +17:35134844 2524666351 A 916 2748 G +17:35134845 2524666352 G 917 2749 G +17:35134846 2524666353 G D 917 2750 A +17:35134847 2524666354 A 917 2751 T +17:35134848 2524666355 T 918 2752 G +17:35134849 2524666356 G V 918 2753 T +17:35134850 2524666357 T 918 2754 G +17:35134851 2524666358 G 919 2755 C +17:35134852 2524666359 C R 919 2756 G +17:35134853 2524666360 G 919 2757 G +17:35134854 2524666361 G 920 2758 C +17:35134855 2524666362 C L 920 2759 T +17:35134856 2524666363 T 920 2760 C +17:35134857 2524666364 C 921 2761 G +17:35134858 2524666365 G V 921 2762 T +17:35134859 2524666366 T 921 2763 A +17:35134860 2524666367 A 922 2764 C +17:35134861 2524666368 C H 922 2765 A +17:35134862 2524666369 A 922 2766 C +17:35134863 2524666370 C 923 2767 A +17:35134864 2524666371 A R 923 2768 G +17:35134865 2524666372 G 923 2769 G +17:35134866 2524666373 G 924 2770 G +17:35134867 2524666374 G D 924 2771 A +17:35134868 2524666375 A 924 2772 C +17:35134869 2524666376 C 925 2773 T +17:35134870 2524666377 T L 925 2774 T +17:35134871 2524666378 T 925 2775 G +17:35134872 2524666379 G 926 2776 G +17:35134873 2524666380 G A 926 2777 C +17:35134874 2524666381 C 926 2778 C +17:35134875 2524666382 C 927 2779 G +17:35134876 2524666383 G A 927 2780 C +17:35134877 2524666384 C 927 2781 T +17:35134878 2524666385 T 928 2782 C +17:35134879 2524666386 C R 928 2783 G +17:35134880 2524666387 G 928 2784 G +17:35134881 2524666388 G 929 2785 A +17:35134882 2524666389 A N 929 2786 A +17:35134883 2524666390 A 929 2787 C +17:35134884 2524666391 C 930 2788 G +17:35134885 2524666392 G V 930 2789 T +17:35134886 2524666393 T 930 2790 G +17:35134887 2524666394 G 931 2791 C +17:35134888 2524666395 C L 931 2792 T +17:35134889 2524666396 T 931 2793 G +17:35134890 2524666397 G 932 2794 G +17:35134891 2524666398 G V 932 2795 T +17:35134892 2524666399 T 932 2796 C +17:35134893 2524666400 C 933 2797 A +17:35134894 2524666401 A K 933 2798 A +17:35134895 2524666402 A 933 2799 G +17:35134896 2524666403 G 934 2800 A +17:35134897 2524666404 A S 934 2801 G +17:35134898 2524666405 G 934 2802 T +17:35134899 2524666406 T 935 2803 C +17:35134900 2524666407 C P 935 2804 C +17:35134901 2524666408 C 935 2805 C +17:35134902 2524666409 C 936 2806 A +17:35134903 2524666410 A N 936 2807 A +17:35134904 2524666411 A 936 2808 C +17:35134905 2524666412 C 937 2809 C +17:35134906 2524666413 C H 937 2810 A +17:35134907 2524666414 A 937 2811 T +17:35134908 2524666415 T 938 2812 G +17:35134909 2524666416 G V 938 2813 T +17:35134910 2524666417 T 938 2814 C +17:35134911 2524666418 C 939 2815 A +17:35134912 2524666419 A K 939 2816 A +17:35134913 2524666420 A 939 2817 A +17:35134914 2524666421 A 940 2818 A +17:35134915 2524666422 A I 940 2819 T +17:35134916 2524666423 T 940 2820 T +17:35134917 2524666424 T 941 2821 A +17:35134918 2524666425 A T 941 2822 C +17:35134919 2524666426 C 941 2823 A +17:35134920 2524666427 A 942 2824 G +17:35134921 2524666428 G D 942 2825 A +17:35134922 2524666429 A 942 2826 C +17:35134923 2524666430 C 943 2827 T +17:35134924 2524666431 T F 943 2828 T +17:35134925 2524666432 T 943 2829 C +17:35134926 2524666433 C 944 2830 G +17:35134927 2524666434 G G 944 2831 G +17:35134928 2524666435 G 944 2832 G +17:35134929 2524666436 G 945 2833 C +17:35134930 2524666437 C L 945 2834 T +17:35134931 2524666438 T 945 2835 G +17:35134932 2524666439 G 946 2836 G +17:35134933 2524666440 G A 946 2837 C +17:35134934 2524666441 C 946 2838 T +17:35134935 2524666442 T 947 2839 C +17:35134936 2524666443 C R 947 2840 G +17:35134937 2524666444 G 947 2841 G +17:35134938 2524666445 G 948 2842 C +17:35134939 2524666446 C L 948 2843 T +17:35134940 2524666447 T 948 2844 G +17:35134941 2524666448 G 949 2845 C +17:35134942 2524666449 C L 949 2846 T +17:35134943 2524666450 T 949 2847 G +17:35134944 2524666451 G 950 2848 G +17:35134945 2524666452 G D 950 2849 A +17:35134946 2524666453 A 950 2850 C +17:35134947 2524666454 C 951 2851 A +17:35134948 2524666455 A I 951 2852 T +17:35134949 2524666456 T 951 2853 T +17:35134950 2524666457 T 952 2854 G +17:35134951 2524666458 G D 952 2855 A +17:35134952 2524666459 A 952 2856 C +17:35134953 2524666460 C 953 2857 G +17:35134954 2524666461 G E 953 2858 A +17:35134955 2524666462 A 953 2859 G +17:35134956 2524666463 G 954 2860 A +17:35134957 2524666464 A T 954 2861 C +17:35134958 2524666465 C 954 2862 A +17:35134959 2524666466 A 955 2863 G +17:35134960 2524666467 G E 955 2864 A +17:35134961 2524666468 A 955 2865 G +17:35134962 2524666469 G 956 2866 T +17:35134963 2524666470 T Y 956 2867 A +17:35134964 2524666471 A 956 2868 C +17:35134965 2524666472 C 957 2869 C +17:35134966 2524666473 C H 957 2870 A +17:35134967 2524666474 A 957 2871 T +17:35134968 2524666475 T 958 2872 G +17:35134969 2524666476 G A 958 2873 C +17:35134970 2524666477 C 958 2874 A +17:35134971 2524666478 A 959 2875 G +17:35134972 2524666479 G D 959 2876 A +17:35134973 2524666480 A 959 2877 T +17:35134974 2524666481 T 960 2878 G +17:35134975 2524666482 G G 960 2879 G +17:35134976 2524666483 G 960 2880 G +17:35134977 2524666484 G 961 2881 G +17:35134978 2524666485 G G 961 2882 G +17:35134979 2524666486 G 961 2883 C +17:35134980 2524666487 C 962 2884 A +17:35134981 2524666488 A K 962 2885 A +17:35134982 2524666489 A 962 2886 G +17:35134983 2524666490 G 963 2887 G +17:35135106 2524666613 G V 963 2888 T +17:35135107 2524666614 T 963 2889 G +17:35135108 2524666615 G 964 2890 C +17:35135109 2524666616 C P 964 2891 C +17:35135110 2524666617 C 964 2892 C +17:35135111 2524666618 C 965 2893 A +17:35135112 2524666619 A I 965 2894 T +17:35135113 2524666620 T 965 2895 C +17:35135114 2524666621 C 966 2896 A +17:35135115 2524666622 A K 966 2897 A +17:35135116 2524666623 A 966 2898 G +17:35135117 2524666624 G 967 2899 T +17:35135118 2524666625 T W 967 2900 G +17:35135119 2524666626 G 967 2901 G +17:35135120 2524666627 G 968 2902 A +17:35135121 2524666628 A M 968 2903 T +17:35135122 2524666629 T 968 2904 G +17:35135123 2524666630 G 969 2905 G +17:35135124 2524666631 G A 969 2906 C +17:35135125 2524666632 C 969 2907 G +17:35135126 2524666633 G 970 2908 C +17:35135127 2524666634 C L 970 2909 T +17:35135128 2524666635 T 970 2910 G +17:35135129 2524666636 G 971 2911 G +17:35135130 2524666637 G E 971 2912 A +17:35135131 2524666638 A 971 2913 G +17:35135132 2524666639 G 972 2914 T +17:35135133 2524666640 T S 972 2915 C +17:35135134 2524666641 C 972 2916 C +17:35135135 2524666642 C 973 2917 A +17:35135136 2524666643 A I 973 2918 T +17:35135137 2524666644 T 973 2919 T +17:35135138 2524666645 T 974 2920 C +17:35135139 2524666646 C L 974 2921 T +17:35135140 2524666647 T 974 2922 C +17:35135141 2524666648 C 975 2923 C +17:35135142 2524666649 C R 975 2924 G +17:35135143 2524666650 G 975 2925 C +17:35135144 2524666651 C 976 2926 C +17:35135145 2524666652 C R 976 2927 G +17:35135146 2524666653 G 976 2928 G +17:35135147 2524666654 G 977 2929 C +17:35135148 2524666655 C R 977 2930 G +17:35135149 2524666656 G 977 2931 G +17:35135150 2524666657 G 978 2932 T +17:35135151 2524666658 T F 978 2933 T +17:35135152 2524666659 T 978 2934 C +17:35135153 2524666660 C 979 2935 A +17:35135154 2524666661 A T 979 2936 C +17:35135155 2524666662 C 979 2937 C +17:35135156 2524666663 C 980 2938 C +17:35135157 2524666664 C H 980 2939 A +17:35135158 2524666665 A 980 2940 C +17:35135159 2524666666 C 981 2941 C +17:35135160 2524666667 C Q 981 2942 A +17:35135161 2524666668 A 981 2943 G +17:35135162 2524666669 G 982 2944 A +17:35135163 2524666670 A S 982 2945 G +17:35135164 2524666671 G 982 2946 T +17:35135165 2524666672 T 983 2947 G +17:35135166 2524666673 G D 983 2948 A +17:35135167 2524666674 A 983 2949 T +17:35135168 2524666675 T 984 2950 G +17:35135169 2524666676 G V 984 2951 T +17:35135170 2524666677 T 984 2952 G +17:35135171 2524666678 G 985 2953 T +17:35135172 2524666679 T W 985 2954 G +17:35135173 2524666680 G 985 2955 G +17:35135174 2524666681 G 986 2956 A +17:35135175 2524666682 A S 986 2957 G +17:35135176 2524666683 G 986 2958 T +17:35135177 2524666684 T 987 2959 T +17:35135178 2524666685 T Y 987 2960 A +17:35135179 2524666686 A 987 2961 T +17:35135180 2524666687 T 988 2962 G +17:35135181 2524666688 G G 988 2963 G +17:35135486 2524666993 G 988 2964 T +17:35135487 2524666994 T 989 2965 G +17:35135488 2524666995 G V 989 2966 T +17:35135489 2524666996 T 989 2967 G +17:35135490 2524666997 G 990 2968 A +17:35135491 2524666998 A T 990 2969 C +17:35135492 2524666999 C 990 2970 T +17:35135493 2524667000 T 991 2971 G +17:35135494 2524667001 G V 991 2972 T +17:35135495 2524667002 T 991 2973 G +17:35135496 2524667003 G 992 2974 T +17:35135497 2524667004 T W 992 2975 G +17:35135498 2524667005 G 992 2976 G +17:35135499 2524667006 G 993 2977 G +17:35135500 2524667007 G E 993 2978 A +17:35135501 2524667008 A 993 2979 G +17:35135502 2524667009 G 994 2980 C +17:35135503 2524667010 C L 994 2981 T +17:35135504 2524667011 T 994 2982 G +17:35135505 2524667012 G 995 2983 A +17:35135506 2524667013 A M 995 2984 T +17:35135507 2524667014 T 995 2985 G +17:35135508 2524667015 G 996 2986 A +17:35135509 2524667016 A T 996 2987 C +17:35135510 2524667017 C 996 2988 T +17:35135511 2524667018 T 997 2989 T +17:35135512 2524667019 T F 997 2990 T +17:35135513 2524667020 T 997 2991 T +17:35135514 2524667021 T 998 2992 G +17:35135515 2524667022 G G 998 2993 G +17:35135516 2524667023 G 998 2994 G +17:35135517 2524667024 G 999 2995 G +17:35135518 2524667025 G A 999 2996 C +17:35135519 2524667026 C 999 2997 C +17:35135520 2524667027 C 1000 2998 A +17:35135521 2524667028 A K 1000 2999 A +17:35135522 2524667029 A 1000 3000 A +17:35135523 2524667030 A 1001 3001 C +17:35135524 2524667031 C P 1001 3002 C +17:35135525 2524667032 C 1001 3003 T +17:35135526 2524667033 T 1002 3004 T +17:35135527 2524667034 T Y 1002 3005 A +17:35135528 2524667035 A 1002 3006 C +17:35135529 2524667036 C 1003 3007 G +17:35135530 2524667037 G D 1003 3008 A +17:35135531 2524667038 A 1003 3009 T +17:35135532 2524667039 T 1004 3010 G +17:35135533 2524667040 G G 1004 3011 G +17:35135534 2524667041 G 1004 3012 G +17:35135535 2524667042 G 1005 3013 A +17:35135536 2524667043 A I 1005 3014 T +17:35135537 2524667044 T 1005 3015 C +17:35135538 2524667045 C 1006 3016 C +17:35135539 2524667046 C P 1006 3017 C +17:35135540 2524667047 C 1006 3018 A +17:35135541 2524667048 A 1007 3019 G +17:35135542 2524667049 G A 1007 3020 C +17:35135543 2524667050 C 1007 3021 C +17:35135544 2524667051 C 1008 3022 C +17:35135545 2524667052 C R 1008 3023 G +17:35135546 2524667053 G 1008 3024 G +17:35135547 2524667054 G 1009 3025 G +17:35135548 2524667055 G E 1009 3026 A +17:35135549 2524667056 A 1009 3027 G +17:35135550 2524667057 G 1010 3028 A +17:35135551 2524667058 A I 1010 3029 T +17:35135552 2524667059 T 1010 3030 C +17:35135553 2524667060 C 1011 3031 C +17:35135554 2524667061 C P 1011 3032 C +17:35135555 2524667062 C 1011 3033 T +17:35135556 2524667063 T 1012 3034 G +17:35135557 2524667064 G D 1012 3035 A +17:35135558 2524667065 A 1012 3036 C +17:35135559 2524667066 C 1013 3037 C +17:35135560 2524667067 C L 1013 3038 T +17:35135561 2524667068 T 1013 3039 G +17:35135562 2524667069 G 1014 3040 C +17:35135563 2524667070 C L 1014 3041 T +17:35135564 2524667071 T 1014 3042 G +17:35135565 2524667072 G 1015 3043 G +17:35135566 2524667073 G E 1015 3044 A +17:35135567 2524667074 A 1015 3045 A +17:35135568 2524667075 A 1016 3046 A +17:35135569 2524667076 A K 1016 3047 A +17:35135570 2524667077 A 1016 3048 G +17:35135571 2524667078 G 1017 3049 G +17:35135572 2524667079 G G 1017 3050 G +17:35135573 2524667080 G 1017 3051 G +17:35135574 2524667081 G 1018 3052 G +17:35135575 2524667082 G E 1018 3053 A +17:35135576 2524667083 A 1018 3054 G +17:35135577 2524667084 G 1019 3055 C +17:35135578 2524667085 C R 1019 3056 G +17:35135579 2524667086 G 1019 3057 G +17:35135580 2524667087 G 1020 3058 C +17:35135581 2524667088 C L 1020 3059 T +17:35135582 2524667089 T 1020 3060 G +17:35135583 2524667090 G 1021 3061 C +17:35135584 2524667091 C P 1021 3062 C +17:35135585 2524667092 C 1021 3063 C +17:35135586 2524667093 C 1022 3064 C +17:35135587 2524667094 C Q 1022 3065 A +17:35135588 2524667095 A 1022 3066 G +17:35135589 2524667096 G 1023 3067 C +17:35135590 2524667097 C P 1023 3068 C +17:35135591 2524667098 C 1023 3069 C +17:35135592 2524667099 C 1024 3070 C +17:35135593 2524667100 C P 1024 3071 C +17:35135594 2524667101 C 1024 3072 C +17:35135595 2524667102 C 1025 3073 A +17:35135596 2524667103 A I 1025 3074 T +17:35135597 2524667104 T 1025 3075 C +17:35135598 2524667105 C 1026 3076 T +17:35135599 2524667106 T C 1026 3077 G +17:35135600 2524667107 G 1026 3078 C +17:35135601 2524667108 C 1027 3079 A +17:35135602 2524667109 A T 1027 3080 C +17:35135603 2524667110 C 1027 3081 C +17:35135604 2524667111 C 1028 3082 A +17:35135605 2524667112 A I 1028 3083 T +17:35135606 2524667113 T 1028 3084 T +17:35135607 2524667114 T 1029 3085 G +17:35135608 2524667115 G D 1029 3086 A +17:35135609 2524667116 A 1029 3087 T +17:35135610 2524667117 T 1030 3088 G +17:35135611 2524667118 G V 1030 3089 T +17:35135612 2524667119 T 1030 3090 C +17:35135613 2524667120 C 1031 3091 T +17:35135614 2524667121 T Y 1031 3092 A +17:35135615 2524667122 A 1031 3093 C +17:35135616 2524667123 C 1032 3094 A +17:35135617 2524667124 A M 1032 3095 T +17:35135618 2524667125 T 1032 3096 G +17:35135619 2524667126 G 1033 3097 A +17:35135620 2524667127 A I 1033 3098 T +17:35135621 2524667128 T 1033 3099 C +17:35135622 2524667129 C 1034 3100 A +17:35135623 2524667130 A M 1034 3101 T +17:35135624 2524667131 T 1034 3102 G +17:35135625 2524667132 G 1035 3103 G +17:35135626 2524667133 G V 1035 3104 T +17:35135627 2524667134 T 1035 3105 C +17:35135628 2524667135 C 1036 3106 A +17:35135629 2524667136 A K 1036 3107 A +17:35135630 2524667137 A 1036 3108 A +17:35135631 2524667138 A 1037 3109 T +17:35135632 2524667139 T C 1037 3110 G +17:35136341 2524667848 G 1037 3111 T +17:35136342 2524667849 T 1038 3112 T +17:35136343 2524667850 T W 1038 3113 G +17:35136344 2524667851 G 1038 3114 G +17:35136345 2524667852 G 1039 3115 A +17:35136346 2524667853 A M 1039 3116 T +17:35136347 2524667854 T 1039 3117 G +17:35136348 2524667855 G 1040 3118 A +17:35136349 2524667856 A I 1040 3119 T +17:35136350 2524667857 T 1040 3120 T +17:35136351 2524667858 T 1041 3121 G +17:35136352 2524667859 G D 1041 3122 A +17:35136353 2524667860 A 1041 3123 C +17:35136354 2524667861 C 1042 3124 T +17:35136355 2524667862 T S 1042 3125 C +17:35136356 2524667863 C 1042 3126 T +17:35136357 2524667864 T 1043 3127 G +17:35136358 2524667865 G E 1043 3128 A +17:35136359 2524667866 A 1043 3129 A +17:35136360 2524667867 A 1044 3130 T +17:35136361 2524667868 T C 1044 3131 G +17:35136362 2524667869 G 1044 3132 T +17:35136363 2524667870 T 1045 3133 C +17:35136364 2524667871 C R 1045 3134 G +17:35136365 2524667872 G 1045 3135 G +17:35136366 2524667873 G 1046 3136 C +17:35136367 2524667874 C P 1046 3137 C +17:35136368 2524667875 C 1046 3138 A +17:35136369 2524667876 A 1047 3139 A +17:35136370 2524667877 A R 1047 3140 G +17:35136371 2524667878 G 1047 3141 A +17:35136372 2524667879 A 1048 3142 T +17:35136373 2524667880 T F 1048 3143 T +17:35136374 2524667881 T 1048 3144 C +17:35136375 2524667882 C 1049 3145 C +17:35136376 2524667883 C R 1049 3146 G +17:35136377 2524667884 G 1049 3147 G +17:35136378 2524667885 G 1050 3148 G +17:35136379 2524667886 G E 1050 3149 A +17:35136380 2524667887 A 1050 3150 G +17:35136381 2524667888 G 1051 3151 T +17:35136382 2524667889 T L 1051 3152 T +17:35136383 2524667890 T 1051 3153 G +17:35136384 2524667891 G 1052 3154 G +17:35136385 2524667892 G V 1052 3155 T +17:35136386 2524667893 T 1052 3156 G +17:35136387 2524667894 G 1053 3157 T +17:35136388 2524667895 T S 1053 3158 C +17:35136389 2524667896 C 1053 3159 T +17:35136390 2524667897 T 1054 3160 G +17:35136391 2524667898 G E 1054 3161 A +17:35136392 2524667899 A 1054 3162 A +17:35136393 2524667900 A 1055 3163 T +17:35136394 2524667901 T F 1055 3164 T +17:35136395 2524667902 T 1055 3165 C +17:35136396 2524667903 C 1056 3166 T +17:35136397 2524667904 T S 1056 3167 C +17:35136398 2524667905 C 1056 3168 C +17:35136399 2524667906 C 1057 3169 C +17:35136400 2524667907 C R 1057 3170 G +17:35136401 2524667908 G 1057 3171 C +17:35136402 2524667909 C 1058 3172 A +17:35136403 2524667910 A M 1058 3173 T +17:35136404 2524667911 T 1058 3174 G +17:35136405 2524667912 G 1059 3175 G +17:35136406 2524667913 G A 1059 3176 C +17:35136407 2524667914 C 1059 3177 C +17:35136408 2524667915 C 1060 3178 A +17:35136409 2524667916 A R 1060 3179 G +17:35136410 2524667917 G 1060 3180 G +17:35136411 2524667918 G 1061 3181 G +17:35136412 2524667919 G D 1061 3182 A +17:35136413 2524667920 A 1061 3183 C +17:35136414 2524667921 C 1062 3184 C +17:35136415 2524667922 C P 1062 3185 C +17:35136416 2524667923 C 1062 3186 C +17:35136417 2524667924 C 1063 3187 C +17:35136418 2524667925 C Q 1063 3188 A +17:35136419 2524667926 A 1063 3189 G +17:35136420 2524667927 G 1064 3190 C +17:35136421 2524667928 C R 1064 3191 G +17:35136422 2524667929 G 1064 3192 C +17:35136423 2524667930 C 1065 3193 T +17:35136424 2524667931 T F 1065 3194 T +17:35136425 2524667932 T 1065 3195 T +17:35136426 2524667933 T 1066 3196 G +17:35136427 2524667934 G V 1066 3197 T +17:35136428 2524667935 T 1066 3198 G +17:35136429 2524667936 G 1067 3199 G +17:35136430 2524667937 G V 1067 3200 T +17:35136431 2524667938 T 1067 3201 C +17:35136432 2524667939 C 1068 3202 A +17:35136433 2524667940 A I 1068 3203 T +17:35136434 2524667941 T 1068 3204 C +17:35136435 2524667942 C 1069 3205 C +17:35136436 2524667943 C Q 1069 3206 A +17:35136437 2524667944 A 1069 3207 G +17:35136438 2524667945 G 1070 3208 A +17:35136594 2524668101 A N 1070 3209 A +17:35136595 2524668102 A 1070 3210 T +17:35136596 2524668103 T 1071 3211 G +17:35136597 2524668104 G E 1071 3212 A +17:35136598 2524668105 A 1071 3213 G +17:35136599 2524668106 G 1072 3214 G +17:35136600 2524668107 G D 1072 3215 A +17:35136601 2524668108 A 1072 3216 C +17:35136602 2524668109 C 1073 3217 T +17:35136603 2524668110 T L 1073 3218 T +17:35136604 2524668111 T 1073 3219 G +17:35136605 2524668112 G 1074 3220 G +17:35136606 2524668113 G G 1074 3221 G +17:35136607 2524668114 G 1074 3222 C +17:35136608 2524668115 C 1075 3223 C +17:35136609 2524668116 C P 1075 3224 C +17:35136610 2524668117 C 1075 3225 A +17:35136611 2524668118 A 1076 3226 G +17:35136612 2524668119 G A 1076 3227 C +17:35136613 2524668120 C 1076 3228 C +17:35136614 2524668121 C 1077 3229 A +17:35136615 2524668122 A S 1077 3230 G +17:35136616 2524668123 G 1077 3231 T +17:35136617 2524668124 T 1078 3232 C +17:35136618 2524668125 C P 1078 3233 C +17:35136619 2524668126 C 1078 3234 C +17:35136620 2524668127 C 1079 3235 T +17:35136621 2524668128 T L 1079 3236 T +17:35136622 2524668129 T 1079 3237 G +17:35136623 2524668130 G 1080 3238 G +17:35136624 2524668131 G D 1080 3239 A +17:35136625 2524668132 A 1080 3240 C +17:35136626 2524668133 C 1081 3241 A +17:35136627 2524668134 A S 1081 3242 G +17:35136628 2524668135 G 1081 3243 C +17:35136629 2524668136 C 1082 3244 A +17:35136630 2524668137 A T 1082 3245 C +17:35136631 2524668138 C 1082 3246 C +17:35136632 2524668139 C 1083 3247 T +17:35136633 2524668140 T F 1083 3248 T +17:35136634 2524668141 T 1083 3249 C +17:35136635 2524668142 C 1084 3250 T +17:35136636 2524668143 T Y 1084 3251 A +17:35136637 2524668144 A 1084 3252 C +17:35136638 2524668145 C 1085 3253 C +17:35136639 2524668146 C R 1085 3254 G +17:35136640 2524668147 G 1085 3255 C +17:35136641 2524668148 C 1086 3256 T +17:35136642 2524668149 T S 1086 3257 C +17:35136643 2524668150 C 1086 3258 A +17:35136644 2524668151 A 1087 3259 C +17:35136645 2524668152 C L 1087 3260 T +17:35136646 2524668153 T 1087 3261 G +17:35136647 2524668154 G 1088 3262 C +17:35136648 2524668155 C L 1088 3263 T +17:35136649 2524668156 T 1088 3264 G +17:35136650 2524668157 G 1089 3265 G +17:35136651 2524668158 G E 1089 3266 A +17:35136652 2524668159 A 1089 3267 G +17:35136653 2524668160 G 1090 3268 G +17:35136654 2524668161 G D 1090 3269 A +17:35136655 2524668162 A 1090 3270 C +17:35136656 2524668163 C 1091 3271 G +17:35136657 2524668164 G D 1091 3272 A +17:35136658 2524668165 A 1091 3273 T +17:35136659 2524668166 T 1092 3274 G +17:35136660 2524668167 G D 1092 3275 A +17:35136661 2524668168 A 1092 3276 C +17:35136662 2524668169 C 1093 3277 A +17:35136663 2524668170 A M 1093 3278 T +17:35136664 2524668171 T 1093 3279 G +17:35136665 2524668172 G 1094 3280 G +17:35136666 2524668173 G G 1094 3281 G +17:35136667 2524668174 G 1094 3282 G +17:35136668 2524668175 G 1095 3283 G +17:35136669 2524668176 G D 1095 3284 A +17:35136670 2524668177 A 1095 3285 C +17:35136671 2524668178 C 1096 3286 C +17:35136672 2524668179 C L 1096 3287 T +17:35136673 2524668180 T 1096 3288 G +17:35136674 2524668181 G 1097 3289 G +17:35136675 2524668182 G V 1097 3290 T +17:35136676 2524668183 T 1097 3291 G +17:35136677 2524668184 G 1098 3292 G +17:35136678 2524668185 G D 1098 3293 A +17:35136679 2524668186 A 1098 3294 T +17:35136680 2524668187 T 1099 3295 G +17:35136681 2524668188 G A 1099 3296 C +17:35136682 2524668189 C 1099 3297 T +17:35136683 2524668190 T 1100 3298 G +17:35136684 2524668191 G E 1100 3299 A +17:35136685 2524668192 A 1100 3300 G +17:35136686 2524668193 G 1101 3301 G +17:35136687 2524668194 G E 1101 3302 A +17:35136688 2524668195 A 1101 3303 G +17:35136689 2524668196 G 1102 3304 T +17:35136690 2524668197 T Y 1102 3305 A +17:35136691 2524668198 A 1102 3306 T +17:35136692 2524668199 T 1103 3307 C +17:35136693 2524668200 C L 1103 3308 T +17:35136694 2524668201 T 1103 3309 G +17:35136695 2524668202 G 1104 3310 G +17:35136696 2524668203 G V 1104 3311 T +17:35136697 2524668204 T 1104 3312 A +17:35136698 2524668205 A 1105 3313 C +17:35136699 2524668206 C P 1105 3314 C +17:35136700 2524668207 C 1105 3315 C +17:35136701 2524668208 C 1106 3316 C +17:35136702 2524668209 C Q 1106 3317 A +17:35136703 2524668210 A 1106 3318 G +17:35136704 2524668211 G 1107 3319 C +17:35136705 2524668212 C Q 1107 3320 A +17:35136706 2524668213 A 1107 3321 G +17:35136707 2524668214 G 1108 3322 G +17:35136708 2524668215 G G 1108 3323 G +17:35136709 2524668216 G 1108 3324 C +17:35136710 2524668217 C 1109 3325 T +17:35136711 2524668218 T F 1109 3326 T +17:35136712 2524668219 T 1109 3327 C +17:35136713 2524668220 C 1110 3328 T +17:35136714 2524668221 T F 1110 3329 T +17:35136715 2524668222 T 1110 3330 C +17:35136716 2524668223 C 1111 3331 T +17:35136717 2524668224 T C 1111 3332 G +17:35136718 2524668225 G 1111 3333 T +17:35136719 2524668226 T 1112 3334 C +17:35136720 2524668227 C P 1112 3335 C +17:35136721 2524668228 C 1112 3336 A +17:35136722 2524668229 A 1113 3337 G +17:35136723 2524668230 G D 1113 3338 A +17:35136724 2524668231 A 1113 3339 C +17:35136725 2524668232 C 1114 3340 C +17:35136726 2524668233 C P 1114 3341 C +17:35136727 2524668234 C 1114 3342 T +17:35136728 2524668235 T 1115 3343 G +17:35136729 2524668236 G A 1115 3344 C +17:35136730 2524668237 C 1115 3345 C +17:35136731 2524668238 C 1116 3346 C +17:35136732 2524668239 C P 1116 3347 C +17:35136733 2524668240 C 1116 3348 G +17:35136734 2524668241 G 1117 3349 G +17:35136735 2524668242 G G 1117 3350 G +17:35136736 2524668243 G 1117 3351 C +17:35136737 2524668244 C 1118 3352 G +17:35136738 2524668245 G A 1118 3353 C +17:35136739 2524668246 C 1118 3354 T +17:35136740 2524668247 T 1119 3355 G +17:35136741 2524668248 G G 1119 3356 G +17:35136742 2524668249 G 1119 3357 G +17:35136743 2524668250 G 1120 3358 G +17:35136744 2524668251 G G 1120 3359 G +17:35136745 2524668252 G 1120 3360 C +17:35136746 2524668253 C 1121 3361 A +17:35136747 2524668254 A M 1121 3362 T +17:35136748 2524668255 T 1121 3363 G +17:35136749 2524668256 G 1122 3364 G +17:35136750 2524668257 G V 1122 3365 T +17:35136751 2524668258 T 1122 3366 C +17:35136752 2524668259 C 1123 3367 C +17:35136753 2524668260 C H 1123 3368 A +17:35136754 2524668261 A 1123 3369 C +17:35136755 2524668262 C 1124 3370 C +17:35136756 2524668263 C H 1124 3371 A +17:35136757 2524668264 A 1124 3372 C +17:35136758 2524668265 C 1125 3373 A +17:35136759 2524668266 A R 1125 3374 G +17:35136760 2524668267 G 1125 3375 G +17:35136761 2524668268 G 1126 3376 C +17:35136762 2524668269 C H 1126 3377 A +17:35136763 2524668270 A 1126 3378 C +17:35136764 2524668271 C 1127 3379 C +17:35136765 2524668272 C R 1127 3380 G +17:35136766 2524668273 G 1127 3381 C +17:35136767 2524668274 C 1128 3382 A +17:35136768 2524668275 A S 1128 3383 G +17:35136769 2524668276 G 1128 3384 C +17:35136770 2524668277 C 1129 3385 T +17:35136771 2524668278 T S 1129 3386 C +17:35136772 2524668279 C 1129 3387 A +17:35136773 2524668280 A 1130 3388 T +17:35136774 2524668281 T S 1130 3389 C +17:35136775 2524668282 C 1130 3390 T +17:35136776 2524668283 T 1131 3391 A +17:35136777 2524668284 A T 1131 3392 C +17:35136778 2524668285 C 1131 3393 C +17:35136779 2524668286 C 1132 3394 A +17:35136780 2524668287 A R 1132 3395 G +17:35136781 2524668288 G 1132 3396 G +17:35136782 2524668289 G 1133 3397 A +17:35137074 2524668581 A S 1133 3398 G +17:35137075 2524668582 G 1133 3399 T +17:35137076 2524668583 T 1134 3400 G +17:35137077 2524668584 G G 1134 3401 G +17:35137078 2524668585 G 1134 3402 C +17:35137079 2524668586 C 1135 3403 G +17:35137080 2524668587 G G 1135 3404 G +17:35137081 2524668588 G 1135 3405 T +17:35137082 2524668589 T 1136 3406 G +17:35137083 2524668590 G G 1136 3407 G +17:35137084 2524668591 G 1136 3408 G +17:35137085 2524668592 G 1137 3409 G +17:35137086 2524668593 G D 1137 3410 A +17:35137087 2524668594 A 1137 3411 C +17:35137088 2524668595 C 1138 3412 C +17:35137089 2524668596 C L 1138 3413 T +17:35137090 2524668597 T 1138 3414 G +17:35137091 2524668598 G 1139 3415 A +17:35137092 2524668599 A T 1139 3416 C +17:35137093 2524668600 C 1139 3417 A +17:35137094 2524668601 A 1140 3418 C +17:35137095 2524668602 C L 1140 3419 T +17:35137096 2524668603 T 1140 3420 A +17:35137097 2524668604 A 1141 3421 G +17:35137098 2524668605 G G 1141 3422 G +17:35137099 2524668606 G 1141 3423 G +17:35137100 2524668607 G 1142 3424 C +17:35137101 2524668608 C L 1142 3425 T +17:35137102 2524668609 T 1142 3426 G +17:35137103 2524668610 G 1143 3427 G +17:35137104 2524668611 G E 1143 3428 A +17:35137105 2524668612 A 1143 3429 G +17:35137106 2524668613 G 1144 3430 C +17:35137107 2524668614 C P 1144 3431 C +17:35137108 2524668615 C 1144 3432 C +17:35137109 2524668616 C 1145 3433 T +17:35137110 2524668617 T S 1145 3434 C +17:35137111 2524668618 C 1145 3435 T +17:35137112 2524668619 T 1146 3436 G +17:35137113 2524668620 G E 1146 3437 A +17:35137114 2524668621 A 1146 3438 A +17:35137115 2524668622 A 1147 3439 G +17:35137116 2524668623 G E 1147 3440 A +17:35137117 2524668624 A 1147 3441 G +17:35137118 2524668625 G 1148 3442 G +17:35137119 2524668626 G E 1148 3443 A +17:35137120 2524668627 A 1148 3444 G +17:35137121 2524668628 G 1149 3445 G +17:35137122 2524668629 G A 1149 3446 C +17:35137123 2524668630 C 1149 3447 C +17:35137124 2524668631 C 1150 3448 C +17:35137125 2524668632 C P 1150 3449 C +17:35137126 2524668633 C 1150 3450 C +17:35137127 2524668634 C 1151 3451 A +17:35137128 2524668635 A R 1151 3452 G +17:35137129 2524668636 G 1151 3453 G +17:35137130 2524668637 G 1152 3454 T +17:35137131 2524668638 T S 1152 3455 C +17:35137132 2524668639 C 1152 3456 T +17:35137133 2524668640 T 1153 3457 C +17:35137134 2524668641 C P 1153 3458 C +17:35137135 2524668642 C 1153 3459 A +17:35137136 2524668643 A 1154 3460 C +17:35137137 2524668644 C L 1154 3461 T +17:35137138 2524668645 T 1154 3462 G +17:35137139 2524668646 G 1155 3463 G +17:35137140 2524668647 G A 1155 3464 C +17:35137141 2524668648 C 1155 3465 A +17:35137142 2524668649 A 1156 3466 C +17:35137143 2524668650 C P 1156 3467 C +17:35137144 2524668651 C 1156 3468 C +17:35137145 2524668652 C 1157 3469 T +17:35137146 2524668653 T S 1157 3470 C +17:35137147 2524668654 C 1157 3471 C +17:35137148 2524668655 C 1158 3472 G +17:35137149 2524668656 G E 1158 3473 A +17:35137150 2524668657 A 1158 3474 A +17:35137151 2524668658 A 1159 3475 G +17:35137152 2524668659 G G 1159 3476 G +17:35137153 2524668660 G 1159 3477 G +17:35137154 2524668661 G 1160 3478 G +17:35137155 2524668662 G A 1160 3479 C +17:35137156 2524668663 C 1160 3480 T +17:35137157 2524668664 T 1161 3481 G +17:35137158 2524668665 G G 1161 3482 G +17:35137159 2524668666 G 1161 3483 C +17:35137160 2524668667 C 1162 3484 T +17:35137161 2524668668 T S 1162 3485 C +17:35137162 2524668669 C 1162 3486 C +17:35137163 2524668670 C 1163 3487 G +17:35137164 2524668671 G D 1163 3488 A +17:35137165 2524668672 A 1163 3489 T +17:35137166 2524668673 T 1164 3490 G +17:35137167 2524668674 G V 1164 3491 T +17:35137168 2524668675 T 1164 3492 A +17:35137169 2524668676 A 1165 3493 T +17:35137170 2524668677 T F 1165 3494 T +17:35137171 2524668678 T 1165 3495 T +17:35137172 2524668679 T 1166 3496 G +17:35137173 2524668680 G D 1166 3497 A +17:35137174 2524668681 A 1166 3498 T +17:35137175 2524668682 T 1167 3499 G +17:35137176 2524668683 G G 1167 3500 G +17:35137177 2524668684 G 1167 3501 T +17:35137178 2524668685 T 1168 3502 G +17:35137179 2524668686 G D 1168 3503 A +17:35137180 2524668687 A 1168 3504 C +17:35137181 2524668688 C 1169 3505 C +17:35137182 2524668689 C L 1169 3506 T +17:35137183 2524668690 T 1169 3507 G +17:35137184 2524668691 G 1170 3508 G +17:35137185 2524668692 G G 1170 3509 G +17:35137186 2524668693 G 1170 3510 A +17:35137187 2524668694 A 1171 3511 A +17:35137188 2524668695 A M 1171 3512 T +17:35137189 2524668696 T 1171 3513 G +17:35137190 2524668697 G 1172 3514 G +17:35137191 2524668698 G G 1172 3515 G +17:35137192 2524668699 G 1172 3516 G +17:35137193 2524668700 G 1173 3517 G +17:35137194 2524668701 G A 1173 3518 C +17:35137195 2524668702 C 1173 3519 A +17:35137196 2524668703 A 1174 3520 G +17:35137197 2524668704 G A 1174 3521 C +17:35137198 2524668705 C 1174 3522 C +17:35137199 2524668706 C 1175 3523 A +17:35137200 2524668707 A K 1175 3524 A +17:35137201 2524668708 A 1175 3525 G +17:35137202 2524668709 G 1176 3526 G +17:35137203 2524668710 G G 1176 3527 G +17:35137204 2524668711 G 1176 3528 G +17:35137205 2524668712 G 1177 3529 C +17:35137206 2524668713 C L 1177 3530 T +17:35137207 2524668714 T 1177 3531 G +17:35137208 2524668715 G 1178 3532 C +17:35137209 2524668716 C Q 1178 3533 A +17:35137210 2524668717 A 1178 3534 A +17:35137211 2524668718 A 1179 3535 A +17:35137212 2524668719 A S 1179 3536 G +17:35137213 2524668720 G 1179 3537 C +17:35137214 2524668721 C 1180 3538 C +17:35137215 2524668722 C L 1180 3539 T +17:35137216 2524668723 T 1180 3540 C +17:35137217 2524668724 C 1181 3541 C +17:35137218 2524668725 C P 1181 3542 C +17:35137219 2524668726 C 1181 3543 C +17:35137220 2524668727 C 1182 3544 A +17:35137221 2524668728 A T 1182 3545 C +17:35137222 2524668729 C 1182 3546 A +17:35137223 2524668730 A 1183 3547 C +17:35137224 2524668731 C H 1183 3548 A +17:35137225 2524668732 A 1183 3549 T +17:35137226 2524668733 T 1184 3550 G +17:35137227 2524668734 G D 1184 3551 A +17:35137228 2524668735 A 1184 3552 C +17:35137229 2524668736 C 1185 3553 C +17:35137230 2524668737 C P 1185 3554 C +17:35137231 2524668738 C 1185 3555 C +17:35137232 2524668739 C 1186 3556 A +17:35137233 2524668740 A S 1186 3557 G +17:35137234 2524668741 G 1186 3558 C +17:35137235 2524668742 C 1187 3559 C +17:35137236 2524668743 C P 1187 3560 C +17:35137237 2524668744 C 1187 3561 T +17:35137238 2524668745 T 1188 3562 C +17:35137239 2524668746 C L 1188 3563 T +17:35137240 2524668747 T 1188 3564 A +17:35137241 2524668748 A 1189 3565 C +17:35137242 2524668749 C Q 1189 3566 A +17:35137243 2524668750 A 1189 3567 G +17:35137244 2524668751 G 1190 3568 C +17:35137245 2524668752 C R 1190 3569 G +17:35137246 2524668753 G 1190 3570 G +17:35137247 2524668754 G 1191 3571 T +17:35137248 2524668755 T Y 1191 3572 A +17:35137249 2524668756 A 1191 3573 C +17:35137250 2524668757 C 1192 3574 A +17:35137251 2524668758 A S 1192 3575 G +17:35137252 2524668759 G 1192 3576 T +17:35137253 2524668760 T 1193 3577 G +17:35137254 2524668761 G E 1193 3578 A +17:35137255 2524668762 A 1193 3579 G +17:35137256 2524668763 G 1194 3580 G +17:35137257 2524668764 G D 1194 3581 A +17:35137258 2524668765 A 1194 3582 C +17:35137259 2524668766 C 1195 3583 C +17:35137260 2524668767 C P 1195 3584 C +17:35137261 2524668768 C 1195 3585 C +17:35137262 2524668769 C 1196 3586 A +17:35137263 2524668770 A T 1196 3587 C +17:35137264 2524668771 C 1196 3588 A +17:35137265 2524668772 A 1197 3589 G +17:35137266 2524668773 G V 1197 3590 T +17:35137267 2524668774 T 1197 3591 A +17:35137268 2524668775 A 1198 3592 C +17:35137269 2524668776 C P 1198 3593 C +17:35137270 2524668777 C 1198 3594 C +17:35137271 2524668778 C 1199 3595 C +17:35137272 2524668779 C L 1199 3596 T +17:35137273 2524668780 T 1199 3597 G +17:35137274 2524668781 G 1200 3598 C +17:35137275 2524668782 C P 1200 3599 C +17:35137276 2524668783 C 1200 3600 C +17:35137277 2524668784 C 1201 3601 T +17:35137278 2524668785 T S 1201 3602 C +17:35137279 2524668786 C 1201 3603 T +17:35137280 2524668787 T 1202 3604 G +17:35137281 2524668788 G E 1202 3605 A +17:35137282 2524668789 A 1202 3606 G +17:35137283 2524668790 G 1203 3607 A +17:35137284 2524668791 A T 1203 3608 C +17:35137285 2524668792 C 1203 3609 T +17:35137286 2524668793 T 1204 3610 G +17:35137287 2524668794 G D 1204 3611 A +17:35137288 2524668795 A 1204 3612 T +17:35137289 2524668796 T 1205 3613 G +17:35137290 2524668797 G G 1205 3614 G +17:35137291 2524668798 G 1205 3615 C +17:35137292 2524668799 C 1206 3616 T +17:35137293 2524668800 T Y 1206 3617 A +17:35137294 2524668801 A 1206 3618 C +17:35137295 2524668802 C 1207 3619 G +17:35137296 2524668803 G V 1207 3620 T +17:35137297 2524668804 T 1207 3621 T +17:35137298 2524668805 T 1208 3622 G +17:35137299 2524668806 G A 1208 3623 C +17:35137300 2524668807 C 1208 3624 C +17:35137301 2524668808 C 1209 3625 C +17:35137302 2524668809 C P 1209 3626 C +17:35137303 2524668810 C 1209 3627 C +17:35137304 2524668811 C 1210 3628 C +17:35137305 2524668812 C L 1210 3629 T +17:35137306 2524668813 T 1210 3630 G +17:35137307 2524668814 G 1211 3631 A +17:35137308 2524668815 A T 1211 3632 C +17:35137309 2524668816 C 1211 3633 C +17:35137310 2524668817 C 1212 3634 T +17:35137311 2524668818 T C 1212 3635 G +17:35137312 2524668819 G 1212 3636 C +17:35137313 2524668820 C 1213 3637 A +17:35137314 2524668821 A S 1213 3638 G +17:35137315 2524668822 G 1213 3639 C +17:35137316 2524668823 C 1214 3640 C +17:35137317 2524668824 C P 1214 3641 C +17:35137318 2524668825 C 1214 3642 C +17:35137319 2524668826 C 1215 3643 C +17:35137320 2524668827 C Q 1215 3644 A +17:35137321 2524668828 A 1215 3645 G +17:35137322 2524668829 G 1216 3646 C +17:35137323 2524668830 C P 1216 3647 C +17:35137324 2524668831 C 1216 3648 T +17:35137325 2524668832 T 1217 3649 G +17:35137326 2524668833 G E 1217 3650 A +17:35137468 2524668975 A 1217 3651 A +17:35137469 2524668976 A 1218 3652 T +17:35137470 2524668977 T Y 1218 3653 A +17:35137471 2524668978 A 1218 3654 T +17:35137472 2524668979 T 1219 3655 G +17:35137473 2524668980 G V 1219 3656 T +17:35137474 2524668981 T 1219 3657 G +17:35137475 2524668982 G 1220 3658 A +17:35137476 2524668983 A N 1220 3659 A +17:35137477 2524668984 A 1220 3660 C +17:35137478 2524668985 C 1221 3661 C +17:35137479 2524668986 C Q 1221 3662 A +17:35137480 2524668987 A 1221 3663 G +17:35137481 2524668988 G 1222 3664 C +17:35137482 2524668989 C P 1222 3665 C +17:35137483 2524668990 C 1222 3666 A +17:35137484 2524668991 A 1223 3667 G +17:35137485 2524668992 G D 1223 3668 A +17:35137486 2524668993 A 1223 3669 T +17:35137487 2524668994 T 1224 3670 G +17:35137488 2524668995 G V 1224 3671 T +17:35137489 2524668996 T 1224 3672 T +17:35137490 2524668997 T 1225 3673 C +17:35137491 2524668998 C R 1225 3674 G +17:35137492 2524668999 G 1225 3675 G +17:35137493 2524669000 G 1226 3676 C +17:35137494 2524669001 C P 1226 3677 C +17:35137495 2524669002 C 1226 3678 C +17:35137496 2524669003 C 1227 3679 C +17:35137497 2524669004 C Q 1227 3680 A +17:35137498 2524669005 A 1227 3681 G +17:35137499 2524669006 G 1228 3682 C +17:35137500 2524669007 C P 1228 3683 C +17:35137501 2524669008 C 1228 3684 C +17:35137502 2524669009 C 1229 3685 C +17:35137503 2524669010 C P 1229 3686 C +17:35137504 2524669011 C 1229 3687 T +17:35137505 2524669012 T 1230 3688 T +17:35137506 2524669013 T S 1230 3689 C +17:35137507 2524669014 C 1230 3690 G +17:35137508 2524669015 G 1231 3691 C +17:35137509 2524669016 C P 1231 3692 C +17:35137510 2524669017 C 1231 3693 C +17:35137511 2524669018 C 1232 3694 C +17:35137512 2524669019 C R 1232 3695 G +17:35137513 2524669020 G 1232 3696 A +17:35137514 2524669021 A 1233 3697 G +17:35137515 2524669022 G E 1233 3698 A +17:35137516 2524669023 A 1233 3699 G +17:35137517 2524669024 G 1234 3700 G +17:35137518 2524669025 G G 1234 3701 G +17:35137519 2524669026 G 1234 3702 C +17:35137520 2524669027 C 1235 3703 C +17:35137521 2524669028 C P 1235 3704 C +17:35137522 2524669029 C 1235 3705 T +17:35137523 2524669030 T 1236 3706 C +17:35137524 2524669031 C L 1236 3707 T +17:35137525 2524669032 T 1236 3708 G +17:35137526 2524669033 G 1237 3709 C +17:35137527 2524669034 C P 1237 3710 C +17:35137528 2524669035 C 1237 3711 T +17:35137529 2524669036 T 1238 3712 G +17:35137530 2524669037 G A 1238 3713 C +17:35137531 2524669038 C 1238 3714 T +17:35137532 2524669039 T 1239 3715 G +17:35137533 2524669040 G A 1239 3716 C +17:35137534 2524669041 C 1239 3717 C +17:35137535 2524669042 C 1240 3718 C +17:35137536 2524669043 C R 1240 3719 G +17:35137537 2524669044 G 1240 3720 A +17:35137538 2524669045 A 1241 3721 C +17:35137539 2524669046 C P 1241 3722 C +17:35137540 2524669047 C 1241 3723 T +17:35137541 2524669048 T 1242 3724 G +17:35137542 2524669049 G A 1242 3725 C +17:35137543 2524669050 C 1242 3726 T +17:35137544 2524669051 T 1243 3727 G +17:35137545 2524669052 G G 1243 3728 G +17:35137546 2524669053 G 1243 3729 T +17:35137547 2524669054 T 1244 3730 G +17:35137548 2524669055 G A 1244 3731 C +17:35137549 2524669056 C 1244 3732 C +17:35137550 2524669057 C 1245 3733 A +17:35137551 2524669058 A T 1245 3734 C +17:35137552 2524669059 C 1245 3735 T +17:35137553 2524669060 T 1246 3736 C +17:35137554 2524669061 C L 1246 3737 T +17:35137555 2524669062 T 1246 3738 G +17:35137556 2524669063 G 1247 3739 G +17:35137557 2524669064 G E 1247 3740 A +17:35137558 2524669065 A 1247 3741 A +17:35137559 2524669066 A 1248 3742 A +17:35137560 2524669067 A R 1248 3743 G +17:35137561 2524669068 G 1248 3744 G +17:35137562 2524669069 G 1249 3745 C +17:35137563 2524669070 C P 1249 3746 C +17:35137564 2524669071 C 1249 3747 C +17:35137565 2524669072 C 1250 3748 A +17:35137566 2524669073 A K 1250 3749 A +17:35137567 2524669074 A 1250 3750 G +17:35137568 2524669075 G 1251 3751 A +17:35137569 2524669076 A T 1251 3752 C +17:35137570 2524669077 C 1251 3753 T +17:35137571 2524669078 T 1252 3754 C +17:35137572 2524669079 C L 1252 3755 T +17:35137573 2524669080 T 1252 3756 C +17:35137574 2524669081 C 1253 3757 T +17:35137575 2524669082 T S 1253 3758 C +17:35137576 2524669083 C 1253 3759 C +17:35137577 2524669084 C 1254 3760 C +17:35137578 2524669085 C P 1254 3761 C +17:35137579 2524669086 C 1254 3762 A +17:35137580 2524669087 A 1255 3763 G +17:35137581 2524669088 G G 1255 3764 G +17:35137582 2524669089 G 1255 3765 G +17:35137583 2524669090 G 1256 3766 A +17:35137584 2524669091 A K 1256 3767 A +17:35137585 2524669092 A 1256 3768 G +17:35137586 2524669093 G 1257 3769 A +17:35137587 2524669094 A N 1257 3770 A +17:35137588 2524669095 A 1257 3771 T +17:35137589 2524669096 T 1258 3772 G +17:35137590 2524669097 G G 1258 3773 G +17:35137591 2524669098 G 1258 3774 G +17:35137592 2524669099 G 1259 3775 G +17:35137593 2524669100 G V 1259 3776 T +17:35137594 2524669101 T 1259 3777 C +17:35137595 2524669102 C 1260 3778 G +17:35137596 2524669103 G V 1260 3779 T +17:35137597 2524669104 T 1260 3780 C +17:35137598 2524669105 C 1261 3781 A +17:35137599 2524669106 A K 1261 3782 A +17:35137600 2524669107 A 1261 3783 A +17:35137601 2524669108 A 1262 3784 G +17:35137602 2524669109 G D 1262 3785 A +17:35137603 2524669110 A 1262 3786 C +17:35137604 2524669111 C 1263 3787 G +17:35137605 2524669112 G V 1263 3788 T +17:35137606 2524669113 T 1263 3789 T +17:35137607 2524669114 T 1264 3790 T +17:35137608 2524669115 T F 1264 3791 T +17:35137609 2524669116 T 1264 3792 T +17:35137610 2524669117 T 1265 3793 G +17:35137611 2524669118 G A 1265 3794 C +17:35137612 2524669119 C 1265 3795 C +17:35137613 2524669120 C 1266 3796 T +17:35137614 2524669121 T F 1266 3797 T +17:35137615 2524669122 T 1266 3798 T +17:35137616 2524669123 T 1267 3799 G +17:35137617 2524669124 G G 1267 3800 G +17:35137618 2524669125 G 1267 3801 G +17:35137619 2524669126 G 1268 3802 G +17:35137620 2524669127 G G 1268 3803 G +17:35137621 2524669128 G 1268 3804 T +17:35137622 2524669129 T 1269 3805 G +17:35137623 2524669130 G A 1269 3806 C +17:35137624 2524669131 C 1269 3807 C +17:35137625 2524669132 C 1270 3808 G +17:35137626 2524669133 G V 1270 3809 T +17:35137627 2524669134 T 1270 3810 G +17:35137628 2524669135 G 1271 3811 G +17:35137629 2524669136 G E 1271 3812 A +17:35137630 2524669137 A 1271 3813 G +17:35137631 2524669138 G 1272 3814 A +17:35137632 2524669139 A N 1272 3815 A +17:35137633 2524669140 A 1272 3816 C +17:35137634 2524669141 C 1273 3817 C +17:35137635 2524669142 C P 1273 3818 C +17:35137636 2524669143 C 1273 3819 C +17:35137637 2524669144 C 1274 3820 G +17:35137638 2524669145 G E 1274 3821 A +17:35137639 2524669146 A 1274 3822 G +17:35137640 2524669147 G 1275 3823 T +17:35137641 2524669148 T Y 1275 3824 A +17:35137642 2524669149 A 1275 3825 C +17:35137643 2524669150 C 1276 3826 T +17:35137644 2524669151 T L 1276 3827 T +17:35137645 2524669152 T 1276 3828 G +17:35137646 2524669153 G 1277 3829 A +17:35137647 2524669154 A T 1277 3830 C +17:35137648 2524669155 C 1277 3831 A +17:35137649 2524669156 A 1278 3832 C +17:35137650 2524669157 C P 1278 3833 C +17:35137651 2524669158 C 1278 3834 C +17:35137652 2524669159 C 1279 3835 C +17:35137653 2524669160 C Q 1279 3836 A +17:35137654 2524669161 A 1279 3837 G +17:35137655 2524669162 G 1280 3838 G +17:35137656 2524669163 G G 1280 3839 G +17:35137657 2524669164 G 1280 3840 A +17:35137658 2524669165 A 1281 3841 G +17:35137659 2524669166 G G 1281 3842 G +17:35137660 2524669167 G 1281 3843 A +17:35137661 2524669168 A 1282 3844 G +17:35137662 2524669169 G A 1282 3845 C +17:35137663 2524669170 C 1282 3846 T +17:35137664 2524669171 T 1283 3847 G +17:35137665 2524669172 G A 1283 3848 C +17:35137666 2524669173 C 1283 3849 C +17:35137667 2524669174 C 1284 3850 C +17:35137668 2524669175 C P 1284 3851 C +17:35137669 2524669176 C 1284 3852 T +17:35137670 2524669177 T 1285 3853 C +17:35137671 2524669178 C Q 1285 3854 A +17:35137672 2524669179 A 1285 3855 G +17:35137673 2524669180 G 1286 3856 C +17:35137674 2524669181 C P 1286 3857 C +17:35137675 2524669182 C 1286 3858 C +17:35137676 2524669183 C 1287 3859 C +17:35137677 2524669184 C H 1287 3860 A +17:35137678 2524669185 A 1287 3861 C +17:35137679 2524669186 C 1288 3862 C +17:35137680 2524669187 C P 1288 3863 C +17:35137681 2524669188 C 1288 3864 T +17:35137682 2524669189 T 1289 3865 C +17:35137683 2524669190 C P 1289 3866 C +17:35137684 2524669191 C 1289 3867 T +17:35137685 2524669192 T 1290 3868 C +17:35137686 2524669193 C P 1290 3869 C +17:35137687 2524669194 C 1290 3870 T +17:35137688 2524669195 T 1291 3871 G +17:35137689 2524669196 G A 1291 3872 C +17:35137690 2524669197 C 1291 3873 C +17:35137691 2524669198 C 1292 3874 T +17:35137692 2524669199 T F 1292 3875 T +17:35137693 2524669200 T 1292 3876 C +17:35137694 2524669201 C 1293 3877 A +17:35137695 2524669202 A S 1293 3878 G +17:35137696 2524669203 G 1293 3879 C +17:35137697 2524669204 C 1294 3880 C +17:35137698 2524669205 C P 1294 3881 C +17:35137699 2524669206 C 1294 3882 A +17:35137700 2524669207 A 1295 3883 G +17:35137701 2524669208 G A 1295 3884 C +17:35137702 2524669209 C 1295 3885 C +17:35137703 2524669210 C 1296 3886 T +17:35137704 2524669211 T F 1296 3887 T +17:35137705 2524669212 T 1296 3888 C +17:35137706 2524669213 C 1297 3889 G +17:35137707 2524669214 G D 1297 3890 A +17:35137708 2524669215 A 1297 3891 C +17:35137709 2524669216 C 1298 3892 A +17:35137710 2524669217 A N 1298 3893 A +17:35137711 2524669218 A 1298 3894 C +17:35137712 2524669219 C 1299 3895 C +17:35137713 2524669220 C L 1299 3896 T +17:35137714 2524669221 T 1299 3897 C +17:35137715 2524669222 C 1300 3898 T +17:35137716 2524669223 T Y 1300 3899 A +17:35137717 2524669224 A 1300 3900 T +17:35137718 2524669225 T 1301 3901 T +17:35137719 2524669226 T Y 1301 3902 A +17:35137720 2524669227 A 1301 3903 C +17:35137721 2524669228 C 1302 3904 T +17:35137722 2524669229 T W 1302 3905 G +17:35137723 2524669230 G 1302 3906 G +17:35137724 2524669231 G 1303 3907 G +17:35137725 2524669232 G D 1303 3908 A +17:35137726 2524669233 A 1303 3909 C +17:35137727 2524669234 C 1304 3910 C +17:35137728 2524669235 C Q 1304 3911 A +17:35137729 2524669236 A 1304 3912 G +17:35137730 2524669237 G 1305 3913 G +17:35137731 2524669238 G D 1305 3914 A +17:35137732 2524669239 A 1305 3915 C +17:35137733 2524669240 C 1306 3916 C +17:35137734 2524669241 C P 1306 3917 C +17:35137735 2524669242 C 1306 3918 A +17:35137736 2524669243 A 1307 3919 C +17:35137737 2524669244 C P 1307 3920 C +17:35137738 2524669245 C 1307 3921 A +17:35137739 2524669246 A 1308 3922 G +17:35137740 2524669247 G E 1308 3923 A +17:35137741 2524669248 A 1308 3924 G +17:35137742 2524669249 G 1309 3925 C +17:35137743 2524669250 C R 1309 3926 G +17:35137744 2524669251 G 1309 3927 G +17:35137745 2524669252 G 1310 3928 G +17:35137746 2524669253 G G 1310 3929 G +17:35137747 2524669254 G 1310 3930 G +17:35137748 2524669255 G 1311 3931 G +17:35137749 2524669256 G A 1311 3932 C +17:35137750 2524669257 C 1311 3933 T +17:35137751 2524669258 T 1312 3934 C +17:35137752 2524669259 C P 1312 3935 C +17:35137753 2524669260 C 1312 3936 A +17:35137754 2524669261 A 1313 3937 C +17:35137755 2524669262 C P 1313 3938 C +17:35137756 2524669263 C 1313 3939 C +17:35137757 2524669264 C 1314 3940 A +17:35137758 2524669265 A S 1314 3941 G +17:35137759 2524669266 G 1314 3942 C +17:35137760 2524669267 C 1315 3943 A +17:35137761 2524669268 A T 1315 3944 C +17:35137762 2524669269 C 1315 3945 C +17:35137763 2524669270 C 1316 3946 T +17:35137764 2524669271 T F 1316 3947 T +17:35137765 2524669272 T 1316 3948 C +17:35137766 2524669273 C 1317 3949 A +17:35137767 2524669274 A K 1317 3950 A +17:35137768 2524669275 A 1317 3951 A +17:35137769 2524669276 A 1318 3952 G +17:35137770 2524669277 G G 1318 3953 G +17:35137771 2524669278 G 1318 3954 G +17:35137772 2524669279 G 1319 3955 A +17:35137773 2524669280 A T 1319 3956 C +17:35137774 2524669281 C 1319 3957 A +17:35137775 2524669282 A 1320 3958 C +17:35137776 2524669283 C P 1320 3959 C +17:35137777 2524669284 C 1320 3960 T +17:35137778 2524669285 T 1321 3961 A +17:35137779 2524669286 A T 1321 3962 C +17:35137780 2524669287 C 1321 3963 G +17:35137781 2524669288 G 1322 3964 G +17:35137782 2524669289 G A 1322 3965 C +17:35137783 2524669290 C 1322 3966 A +17:35137784 2524669291 A 1323 3967 G +17:35137785 2524669292 G E 1323 3968 A +17:35137786 2524669293 A 1323 3969 G +17:35137787 2524669294 G 1324 3970 A +17:35137788 2524669295 A N 1324 3971 A +17:35137789 2524669296 A 1324 3972 C +17:35137790 2524669297 C 1325 3973 C +17:35137791 2524669298 C P 1325 3974 C +17:35137792 2524669299 C 1325 3975 A +17:35137793 2524669300 A 1326 3976 G +17:35137794 2524669301 G E 1326 3977 A +17:35137795 2524669302 A 1326 3978 G +17:35137796 2524669303 G 1327 3979 T +17:35137797 2524669304 T Y 1327 3980 A +17:35137798 2524669305 A 1327 3981 C +17:35137799 2524669306 C 1328 3982 C +17:35137800 2524669307 C L 1328 3983 T +17:35137801 2524669308 T 1328 3984 G +17:35137802 2524669309 G 1329 3985 G +17:35137803 2524669310 G G 1329 3986 G +17:35137804 2524669311 G 1329 3987 T +17:35137805 2524669312 T 1330 3988 C +17:35137806 2524669313 C L 1330 3989 T +17:35137807 2524669314 T 1330 3990 G +17:35137808 2524669315 G 1331 3991 G +17:35137809 2524669316 G D 1331 3992 A +17:35137810 2524669317 A 1331 3993 C +17:35137811 2524669318 C 1332 3994 G +17:35137812 2524669319 G V 1332 3995 T +17:35137813 2524669320 T 1332 3996 G +17:35137814 2524669321 G 1333 3997 C +17:35137815 2524669322 C P 1333 3998 C +17:35137816 2524669323 C 1333 3999 A +17:35137817 2524669324 A 1334 4000 G +17:35137818 2524669325 G V 1334 4001 T +17:35137819 2524669326 T 1334 4002 G +17:35137820 2524669327 G 1335 4003 T +17:35137821 2524669328 T * 1335 4004 G +17:35137822 2524669329 G 1335 4005 A +17:35137823 2524669330 A 1335 4006 A +17:35137824 2524669331 A 1335 4007 C +17:35137825 2524669332 C 1335 4008 C +17:35137826 2524669333 C 1335 4009 A +17:35137827 2524669334 A 1335 4010 G +17:35137828 2524669335 G 1335 4011 A +17:35137829 2524669336 A 1335 4012 A +17:35137830 2524669337 A 1335 4013 G +17:35137831 2524669338 G 1335 4014 G +17:35137832 2524669339 G 1335 4015 C +17:35137833 2524669340 C 1335 4016 C +17:35137834 2524669341 C 1335 4017 A +17:35137835 2524669342 A 1335 4018 A +17:35137836 2524669343 A 1335 4019 G +17:35137837 2524669344 G 1335 4020 T +17:35137838 2524669345 T 1335 4021 C +17:35137839 2524669346 C 1335 4022 C +17:35137840 2524669347 C 1335 4023 G +17:35137841 2524669348 G 1335 4024 C +17:35137842 2524669349 C 1335 4025 A +17:35137843 2524669350 A 1335 4026 G +17:35137844 2524669351 G 1335 4027 A +17:35137845 2524669352 A 1335 4028 A +17:35137846 2524669353 A 1335 4029 G +17:35137847 2524669354 G 1335 4030 C +17:35137848 2524669355 C 1335 4031 C +17:35137849 2524669356 C 1335 4032 C +17:35137850 2524669357 C 1335 4033 T +17:35137851 2524669358 T 1335 4034 G +17:35137852 2524669359 G 1335 4035 A +17:35137853 2524669360 A 1335 4036 T +17:35137854 2524669361 T 1335 4037 G +17:35137855 2524669362 G 1335 4038 T +17:35137856 2524669363 T 1335 4039 G +17:35137857 2524669364 G 1335 4040 T +17:35137858 2524669365 T 1335 4041 C +17:35137859 2524669366 C 1335 4042 C +17:35137860 2524669367 C 1335 4043 T +17:35137861 2524669368 T 1335 4044 C +17:35137862 2524669369 C 1335 4045 A +17:35137863 2524669370 A 1335 4046 G +17:35137864 2524669371 G 1335 4047 G +17:35137865 2524669372 G 1335 4048 G +17:35137866 2524669373 G 1335 4049 A +17:35137867 2524669374 A 1335 4050 G +17:35137868 2524669375 G 1335 4051 C +17:35137869 2524669376 C 1335 4052 A +17:35137870 2524669377 A 1335 4053 G +17:35137871 2524669378 G 1335 4054 G +17:35137872 2524669379 G 1335 4055 G +17:35137873 2524669380 G 1335 4056 A +17:35137874 2524669381 A 1335 4057 A +17:35137875 2524669382 A 1335 4058 G +17:35137876 2524669383 G 1335 4059 G +17:35137877 2524669384 G 1335 4060 C +17:35137878 2524669385 C 1335 4061 C +17:35137879 2524669386 C 1335 4062 T +17:35137880 2524669387 T 1335 4063 G +17:35137881 2524669388 G 1335 4064 A +17:35137882 2524669389 A 1335 4065 C +17:35137883 2524669390 C 1335 4066 T +17:35137884 2524669391 T 1335 4067 T +17:35137885 2524669392 T 1335 4068 C +17:35137886 2524669393 C 1335 4069 T +17:35137887 2524669394 T 1335 4070 G +17:35137888 2524669395 G 1335 4071 C +17:35137889 2524669396 C 1335 4072 T +17:35137890 2524669397 T 1335 4073 G +17:35137891 2524669398 G 1335 4074 G +17:35137892 2524669399 G 1335 4075 C +17:35137893 2524669400 C 1335 4076 A +17:35137894 2524669401 A 1335 4077 T +17:35137895 2524669402 T 1335 4078 C +17:35137896 2524669403 C 1335 4079 A +17:35137897 2524669404 A 1335 4080 A +17:35137898 2524669405 A 1335 4081 G +17:35137899 2524669406 G 1335 4082 A +17:35137900 2524669407 A 1335 4083 G +17:35137901 2524669408 G 1335 4084 G +17:35137902 2524669409 G 1335 4085 T +17:35137903 2524669410 T 1335 4086 G +17:35137904 2524669411 G 1335 4087 G +17:35137905 2524669412 G 1335 4088 G +17:35137906 2524669413 G 1335 4089 A +17:35137907 2524669414 A 1335 4090 G +17:35137908 2524669415 G 1335 4091 G +17:35137909 2524669416 G 1335 4092 G +17:35137910 2524669417 G 1335 4093 C +17:35137911 2524669418 C 1335 4094 C +17:35137912 2524669419 C 1335 4095 C +17:35137913 2524669420 C 1335 4096 T +17:35137914 2524669421 T 1335 4097 C +17:35137915 2524669422 C 1335 4098 C +17:35137916 2524669423 C 1335 4099 G +17:35137917 2524669424 G 1335 4100 A +17:35137918 2524669425 A 1335 4101 C +17:35137919 2524669426 C 1335 4102 C +17:35137920 2524669427 C 1335 4103 A +17:35137921 2524669428 A 1335 4104 C +17:35137922 2524669429 C 1335 4105 T +17:35137923 2524669430 T 1335 4106 T +17:35137924 2524669431 T 1335 4107 C +17:35137925 2524669432 C 1335 4108 C +17:35137926 2524669433 C 1335 4109 A +17:35137927 2524669434 A 1335 4110 G +17:35137928 2524669435 G 1335 4111 G +17:35137929 2524669436 G 1335 4112 G +17:35137930 2524669437 G 1335 4113 G +17:35137931 2524669438 G 1335 4114 A +17:35137932 2524669439 A 1335 4115 A +17:35137933 2524669440 A 1335 4116 C +17:35137934 2524669441 C 1335 4117 C +17:35137935 2524669442 C 1335 4118 T +17:35137936 2524669443 T 1335 4119 G +17:35137937 2524669444 G 1335 4120 C +17:35137938 2524669445 C 1335 4121 C +17:35137939 2524669446 C 1335 4122 A +17:35137940 2524669447 A 1335 4123 T +17:35137941 2524669448 T 1335 4124 G +17:35137942 2524669449 G 1335 4125 C +17:35137943 2524669450 C 1335 4126 C +17:35137944 2524669451 C 1335 4127 A +17:35137945 2524669452 A 1335 4128 G +17:35137946 2524669453 G 1335 4129 G +17:35137947 2524669454 G 1335 4130 A +17:35137948 2524669455 A 1335 4131 A +17:35137949 2524669456 A 1335 4132 C +17:35137950 2524669457 C 1335 4133 C +17:35137951 2524669458 C 1335 4134 T +17:35137952 2524669459 T 1335 4135 G +17:35137953 2524669460 G 1335 4136 T +17:35137954 2524669461 T 1335 4137 C +17:35137955 2524669462 C 1335 4138 C +17:35137956 2524669463 C 1335 4139 T +17:35137957 2524669464 T 1335 4140 A +17:35137958 2524669465 A 1335 4141 A +17:35137959 2524669466 A 1335 4142 G +17:35137960 2524669467 G 1335 4143 G +17:35137961 2524669468 G 1335 4144 A +17:35137962 2524669469 A 1335 4145 A +17:35137963 2524669470 A 1335 4146 C +17:35137964 2524669471 C 1335 4147 C +17:35137965 2524669472 C 1335 4148 T +17:35137966 2524669473 T 1335 4149 T +17:35137967 2524669474 T 1335 4150 C +17:35137968 2524669475 C 1335 4151 C +17:35137969 2524669476 C 1335 4152 T +17:35137970 2524669477 T 1335 4153 T +17:35137971 2524669478 T 1335 4154 C +17:35137972 2524669479 C 1335 4155 C +17:35137973 2524669480 C 1335 4156 T +17:35137974 2524669481 T 1335 4157 G +17:35137975 2524669482 G 1335 4158 C +17:35137976 2524669483 C 1335 4159 T +17:35137977 2524669484 T 1335 4160 T +17:35137978 2524669485 T 1335 4161 G +17:35137979 2524669486 G 1335 4162 A +17:35137980 2524669487 A 1335 4163 G +17:35137981 2524669488 G 1335 4164 T +17:35137982 2524669489 T 1335 4165 T +17:35137983 2524669490 T 1335 4166 C +17:35137984 2524669491 C 1335 4167 C +17:35137985 2524669492 C 1335 4168 C +17:35137986 2524669493 C 1335 4169 A +17:35137987 2524669494 A 1335 4170 G +17:35137988 2524669495 G 1335 4171 A +17:35137989 2524669496 A 1335 4172 T +17:35137990 2524669497 T 1335 4173 G +17:35137991 2524669498 G 1335 4174 G +17:35137992 2524669499 G 1335 4175 C +17:35137993 2524669500 C 1335 4176 T +17:35137994 2524669501 T 1335 4177 G +17:35137995 2524669502 G 1335 4178 G +17:35137996 2524669503 G 1335 4179 A +17:35137997 2524669504 A 1335 4180 A +17:35137998 2524669505 A 1335 4181 G +17:35137999 2524669506 G 1335 4182 G +17:35138000 2524669507 G 1335 4183 G +17:35138001 2524669508 G 1335 4184 G +17:35138002 2524669509 G 1335 4185 T +17:35138003 2524669510 T 1335 4186 C +17:35138004 2524669511 C 1335 4187 C +17:35138005 2524669512 C 1335 4188 A +17:35138006 2524669513 A 1335 4189 G +17:35138007 2524669514 G 1335 4190 C +17:35138008 2524669515 C 1335 4191 C +17:35138009 2524669516 C 1335 4192 T +17:35138010 2524669517 T 1335 4193 C +17:35138011 2524669518 C 1335 4194 G +17:35138012 2524669519 G 1335 4195 T +17:35138013 2524669520 T 1335 4196 T +17:35138014 2524669521 T 1335 4197 G +17:35138015 2524669522 G 1335 4198 G +17:35138016 2524669523 G 1335 4199 A +17:35138017 2524669524 A 1335 4200 A +17:35138018 2524669525 A 1335 4201 G +17:35138019 2524669526 G 1335 4202 A +17:35138020 2524669527 A 1335 4203 G +17:35138021 2524669528 G 1335 4204 G +17:35138022 2524669529 G 1335 4205 A +17:35138023 2524669530 A 1335 4206 A +17:35138024 2524669531 A 1335 4207 C +17:35138025 2524669532 C 1335 4208 A +17:35138026 2524669533 A 1335 4209 G +17:35138027 2524669534 G 1335 4210 C +17:35138028 2524669535 C 1335 4211 A +17:35138029 2524669536 A 1335 4212 C +17:35138030 2524669537 C 1335 4213 T +17:35138031 2524669538 T 1335 4214 G +17:35138032 2524669539 G 1335 4215 G +17:35138033 2524669540 G 1335 4216 G +17:35138034 2524669541 G 1335 4217 G +17:35138035 2524669542 G 1335 4218 A +17:35138036 2524669543 A 1335 4219 G +17:35138037 2524669544 G 1335 4220 T +17:35138038 2524669545 T 1335 4221 C +17:35138039 2524669546 C 1335 4222 T +17:35138040 2524669547 T 1335 4223 T +17:35138041 2524669548 T 1335 4224 T +17:35138042 2524669549 T 1335 4225 G +17:35138043 2524669550 G 1335 4226 T +17:35138044 2524669551 T 1335 4227 G +17:35138045 2524669552 G 1335 4228 G +17:35138046 2524669553 G 1335 4229 A +17:35138047 2524669554 A 1335 4230 T +17:35138048 2524669555 T 1335 4231 T +17:35138049 2524669556 T 1335 4232 C +17:35138050 2524669557 C 1335 4233 T +17:35138051 2524669558 T 1335 4234 G +17:35138052 2524669559 G 1335 4235 A +17:35138053 2524669560 A 1335 4236 G +17:35138054 2524669561 G 1335 4237 G +17:35138055 2524669562 G 1335 4238 C +17:35138056 2524669563 C 1335 4239 C +17:35138057 2524669564 C 1335 4240 C +17:35138058 2524669565 C 1335 4241 T +17:35138059 2524669566 T 1335 4242 G +17:35138060 2524669567 G 1335 4243 C +17:35138061 2524669568 C 1335 4244 C +17:35138062 2524669569 C 1335 4245 C +17:35138063 2524669570 C 1335 4246 A +17:35138064 2524669571 A 1335 4247 A +17:35138065 2524669572 A 1335 4248 T +17:35138066 2524669573 T 1335 4249 G +17:35138067 2524669574 G 1335 4250 A +17:35138068 2524669575 A 1335 4251 G +17:35138069 2524669576 G 1335 4252 A +17:35138070 2524669577 A 1335 4253 C +17:35138071 2524669578 C 1335 4254 T +17:35138072 2524669579 T 1335 4255 C +17:35138073 2524669580 C 1335 4256 T +17:35138074 2524669581 T 1335 4257 A +17:35138075 2524669582 A 1335 4258 G +17:35138076 2524669583 G 1335 4259 G +17:35138077 2524669584 G 1335 4260 G +17:35138078 2524669585 G 1335 4261 T +17:35138079 2524669586 T 1335 4262 C +17:35138080 2524669587 C 1335 4263 C +17:35138081 2524669588 C 1335 4264 A +17:35138082 2524669589 A 1335 4265 G +17:35138083 2524669590 G 1335 4266 T +17:35138084 2524669591 T 1335 4267 G +17:35138085 2524669592 G 1335 4268 G +17:35138086 2524669593 G 1335 4269 A +17:35138087 2524669594 A 1335 4270 T +17:35138088 2524669595 T 1335 4271 G +17:35138089 2524669596 G 1335 4272 C +17:35138090 2524669597 C 1335 4273 C +17:35138091 2524669598 C 1335 4274 A +17:35138092 2524669599 A 1335 4275 C +17:35138093 2524669600 C 1335 4276 A +17:35138094 2524669601 A 1335 4277 G +17:35138095 2524669602 G 1335 4278 C +17:35138096 2524669603 C 1335 4279 C +17:35138097 2524669604 C 1335 4280 C +17:35138098 2524669605 C 1335 4281 A +17:35138099 2524669606 A 1335 4282 G +17:35138100 2524669607 G 1335 4283 C +17:35138101 2524669608 C 1335 4284 T +17:35138102 2524669609 T 1335 4285 T +17:35138103 2524669610 T 1335 4286 G +17:35138104 2524669611 G 1335 4287 G +17:35138105 2524669612 G 1335 4288 C +17:35138106 2524669613 C 1335 4289 C +17:35138107 2524669614 C 1335 4290 C +17:35138108 2524669615 C 1335 4291 T +17:35138109 2524669616 T 1335 4292 T +17:35138110 2524669617 T 1335 4293 T +17:35138111 2524669618 T 1335 4294 C +17:35138112 2524669619 C 1335 4295 C +17:35138113 2524669620 C 1335 4296 T +17:35138114 2524669621 T 1335 4297 T +17:35138115 2524669622 T 1335 4298 C +17:35138116 2524669623 C 1335 4299 C +17:35138117 2524669624 C 1335 4300 A +17:35138118 2524669625 A 1335 4301 G +17:35138119 2524669626 G 1335 4302 A +17:35138120 2524669627 A 1335 4303 T +17:35138121 2524669628 T 1335 4304 C +17:35138122 2524669629 C 1335 4305 C +17:35138123 2524669630 C 1335 4306 T +17:35138124 2524669631 T 1335 4307 G +17:35138125 2524669632 G 1335 4308 G +17:35138126 2524669633 G 1335 4309 G +17:35138127 2524669634 G 1335 4310 T +17:35138128 2524669635 T 1335 4311 A +17:35138129 2524669636 A 1335 4312 C +17:35138130 2524669637 C 1335 4313 T +17:35138131 2524669638 T 1335 4314 G +17:35138132 2524669639 G 1335 4315 A +17:35138133 2524669640 A 1335 4316 A +17:35138134 2524669641 A 1335 4317 A +17:35138135 2524669642 A 1335 4318 G +17:35138136 2524669643 G 1335 4319 C +17:35138137 2524669644 C 1335 4320 C +17:35138138 2524669645 C 1335 4321 T +17:35138139 2524669646 T 1335 4322 T +17:35138140 2524669647 T 1335 4323 A +17:35138141 2524669648 A 1335 4324 G +17:35138142 2524669649 G 1335 4325 G +17:35138143 2524669650 G 1335 4326 G +17:35138144 2524669651 G 1335 4327 A +17:35138145 2524669652 A 1335 4328 A +17:35138146 2524669653 A 1335 4329 G +17:35138147 2524669654 G 1335 4330 C +17:35138148 2524669655 C 1335 4331 T +17:35138149 2524669656 T 1335 4332 G +17:35138150 2524669657 G 1335 4333 G +17:35138151 2524669658 G 1335 4334 C +17:35138152 2524669659 C 1335 4335 C +17:35138153 2524669660 C 1335 4336 T +17:35138154 2524669661 T 1335 4337 G +17:35138155 2524669662 G 1335 4338 A +17:35138156 2524669663 A 1335 4339 G +17:35138157 2524669664 G 1335 4340 A +17:35138158 2524669665 A 1335 4341 G +17:35138159 2524669666 G 1335 4342 G +17:35138160 2524669667 G 1335 4343 G +17:35138161 2524669668 G 1335 4344 G +17:35138162 2524669669 G 1335 4345 A +17:35138163 2524669670 A 1335 4346 A +17:35138164 2524669671 A 1335 4347 G +17:35138165 2524669672 G 1335 4348 C +17:35138166 2524669673 C 1335 4349 G +17:35138167 2524669674 G 1335 4350 G +17:35138168 2524669675 G 1335 4351 C +17:35138169 2524669676 C 1335 4352 C +17:35138170 2524669677 C 1335 4353 C +17:35138171 2524669678 C 1335 4354 T +17:35138172 2524669679 T 1335 4355 A +17:35138173 2524669680 A 1335 4356 A +17:35138174 2524669681 A 1335 4357 G +17:35138175 2524669682 G 1335 4358 G +17:35138176 2524669683 G 1335 4359 G +17:35138177 2524669684 G 1335 4360 A +17:35138178 2524669685 A 1335 4361 G +17:35138179 2524669686 G 1335 4362 T +17:35138180 2524669687 T 1335 4363 G +17:35138181 2524669688 G 1335 4364 T +17:35138182 2524669689 T 1335 4365 C +17:35138183 2524669690 C 1335 4366 T +17:35138184 2524669691 T 1335 4367 A +17:35138185 2524669692 A 1335 4368 A +17:35138186 2524669693 A 1335 4369 G +17:35138187 2524669694 G 1335 4370 A +17:35138188 2524669695 A 1335 4371 A +17:35138189 2524669696 A 1335 4372 C +17:35138190 2524669697 C 1335 4373 A +17:35138191 2524669698 A 1335 4374 A +17:35138192 2524669699 A 1335 4375 A +17:35138193 2524669700 A 1335 4376 A +17:35138194 2524669701 A 1335 4377 G +17:35138195 2524669702 G 1335 4378 C +17:35138196 2524669703 C 1335 4379 G +17:35138197 2524669704 G 1335 4380 A +17:35138198 2524669705 A 1335 4381 C +17:35138199 2524669706 C 1335 4382 C +17:35138200 2524669707 C 1335 4383 C +17:35138201 2524669708 C 1335 4384 A +17:35138202 2524669709 A 1335 4385 T +17:35138203 2524669710 T 1335 4386 T +17:35138204 2524669711 T 1335 4387 C +17:35138205 2524669712 C 1335 4388 A +17:35138206 2524669713 A 1335 4389 G +17:35138207 2524669714 G 1335 4390 A +17:35138208 2524669715 A 1335 4391 G +17:35138209 2524669716 G 1335 4392 A +17:35138210 2524669717 A 1335 4393 C +17:35138211 2524669718 C 1335 4394 T +17:35138212 2524669719 T 1335 4395 G +17:35138213 2524669720 G 1335 4396 T +17:35138214 2524669721 T 1335 4397 C +17:35138215 2524669722 C 1335 4398 C +17:35138216 2524669723 C 1335 4399 C +17:35138217 2524669724 C 1335 4400 T +17:35138218 2524669725 T 1335 4401 G +17:35138219 2524669726 G 1335 4402 A +17:35138220 2524669727 A 1335 4403 A +17:35138221 2524669728 A 1335 4404 A +17:35138222 2524669729 A 1335 4405 C +17:35138223 2524669730 C 1335 4406 C +17:35138224 2524669731 C 1335 4407 T +17:35138225 2524669732 T 1335 4408 A +17:35138226 2524669733 A 1335 4409 G +17:35138227 2524669734 G 1335 4410 T +17:35138228 2524669735 T 1335 4411 A +17:35138229 2524669736 A 1335 4412 C +17:35138230 2524669737 C 1335 4413 T +17:35138231 2524669738 T 1335 4414 G +17:35138232 2524669739 G 1335 4415 C +17:35138233 2524669740 C 1335 4416 C +17:35138234 2524669741 C 1335 4417 C +17:35138235 2524669742 C 1335 4418 C +17:35138236 2524669743 C 1335 4419 C +17:35138237 2524669744 C 1335 4420 C +17:35138238 2524669745 C 1335 4421 A +17:35138239 2524669746 A 1335 4422 T +17:35138240 2524669747 T 1335 4423 G +17:35138241 2524669748 G 1335 4424 A +17:35138242 2524669749 A 1335 4425 G +17:35138243 2524669750 G 1335 4426 G +17:35138244 2524669751 G 1335 4427 A +17:35138245 2524669752 A 1335 4428 A +17:35138246 2524669753 A 1335 4429 G +17:35138247 2524669754 G 1335 4430 G +17:35138248 2524669755 G 1335 4431 A +17:35138249 2524669756 A 1335 4432 A +17:35138250 2524669757 A 1335 4433 C +17:35138251 2524669758 C 1335 4434 A +17:35138252 2524669759 A 1335 4435 G +17:35138253 2524669760 G 1335 4436 C +17:35138254 2524669761 C 1335 4437 A +17:35138255 2524669762 A 1335 4438 A +17:35138256 2524669763 A 1335 4439 T +17:35138257 2524669764 T 1335 4440 G +17:35138258 2524669765 G 1335 4441 G +17:35138259 2524669766 G 1335 4442 T +17:35138260 2524669767 T 1335 4443 G +17:35138261 2524669768 G 1335 4444 T +17:35138262 2524669769 T 1335 4445 C +17:35138263 2524669770 C 1335 4446 A +17:35138264 2524669771 A 1335 4447 G +17:35138265 2524669772 G 1335 4448 T +17:35138266 2524669773 T 1335 4449 A +17:35138267 2524669774 A 1335 4450 T +17:35138268 2524669775 T 1335 4451 C +17:35138269 2524669776 C 1335 4452 C +17:35138270 2524669777 C 1335 4453 A +17:35138271 2524669778 A 1335 4454 G +17:35138272 2524669779 G 1335 4455 G +17:35138273 2524669780 G 1335 4456 C +17:35138274 2524669781 C 1335 4457 T +17:35138275 2524669782 T 1335 4458 T +17:35138276 2524669783 T 1335 4459 T +17:35138277 2524669784 T 1335 4460 G +17:35138278 2524669785 G 1335 4461 T +17:35138279 2524669786 T 1335 4462 A +17:35138280 2524669787 A 1335 4463 C +17:35138281 2524669788 C 1335 4464 A +17:35138282 2524669789 A 1335 4465 G +17:35138283 2524669790 G 1335 4466 A +17:35138284 2524669791 A 1335 4467 G +17:35138285 2524669792 G 1335 4468 T +17:35138286 2524669793 T 1335 4469 G +17:35138287 2524669794 G 1335 4470 C +17:35138288 2524669795 C 1335 4471 T +17:35138289 2524669796 T 1335 4472 T +17:35138290 2524669797 T 1335 4473 T +17:35138291 2524669798 T 1335 4474 T +17:35138292 2524669799 T 1335 4475 C +17:35138293 2524669800 C 1335 4476 T +17:35138294 2524669801 T 1335 4477 G +17:35138295 2524669802 G 1335 4478 T +17:35138296 2524669803 T 1335 4479 T +17:35138297 2524669804 T 1335 4480 T +17:35138298 2524669805 T 1335 4481 A +17:35138299 2524669806 A 1335 4482 G +17:35138300 2524669807 G 1335 4483 T +17:35138301 2524669808 T 1335 4484 T +17:35138302 2524669809 T 1335 4485 T +17:35138303 2524669810 T 1335 4486 T +17:35138304 2524669811 T 1335 4487 T +17:35138305 2524669812 T 1335 4488 A +17:35138306 2524669813 A 1335 4489 C +17:35138307 2524669814 C 1335 4490 T +17:35138308 2524669815 T 1335 4491 T +17:35138309 2524669816 T 1335 4492 T +17:35138310 2524669817 T 1335 4493 T +17:35138311 2524669818 T 1335 4494 T +17:35138312 2524669819 T 1335 4495 T +17:35138313 2524669820 T 1335 4496 T +17:35138314 2524669821 T 1335 4497 G +17:35138315 2524669822 G 1335 4498 T +17:35138316 2524669823 T 1335 4499 T +17:35138317 2524669824 T 1335 4500 T +17:35138318 2524669825 T 1335 4501 T +17:35138319 2524669826 T 1335 4502 G +17:35138320 2524669827 G 1335 4503 T +17:35138321 2524669828 T 1335 4504 T +17:35138322 2524669829 T 1335 4505 T +17:35138323 2524669830 T 1335 4506 T +17:35138324 2524669831 T 1335 4507 T +17:35138325 2524669832 T 1335 4508 T +17:35138326 2524669833 T 1335 4509 T +17:35138327 2524669834 T 1335 4510 A +17:35138328 2524669835 A 1335 4511 A +17:35138329 2524669836 A 1335 4512 A +17:35138330 2524669837 A 1335 4513 G +17:35138331 2524669838 G 1335 4514 A +17:35138332 2524669839 A 1335 4515 T +17:35138333 2524669840 T 1335 4516 G +17:35138334 2524669841 G 1335 4517 A +17:35138335 2524669842 A 1335 4518 A +17:35138336 2524669843 A 1335 4519 A +17:35138337 2524669844 A 1335 4520 T +17:35138338 2524669845 T 1335 4521 A +17:35138339 2524669846 A 1335 4522 A +17:35138340 2524669847 A 1335 4523 A +17:35138341 2524669848 A 1335 4524 G +17:35138342 2524669849 G 1335 4525 A +17:35138343 2524669850 A 1335 4526 C +17:35138344 2524669851 C 1335 4527 C +17:35138345 2524669852 C 1335 4528 C +17:35138346 2524669853 C 1335 4529 A +17:35138347 2524669854 A 1335 4530 G +17:35138348 2524669855 G 1335 4531 G +17:35138349 2524669856 G 1335 4532 G +17:35138350 2524669857 G 1335 4533 G +17:35138351 2524669858 G 1335 4534 G +17:35138352 2524669859 G 1335 4535 A +17:35138353 2524669860 A 1335 4536 G +17:35138354 2524669861 G 1335 4537 A +17:35138355 2524669862 A 1335 4538 A +17:35138356 2524669863 A 1335 4539 T +17:35138357 2524669864 T 1335 4540 G +17:35138358 2524669865 G 1335 4541 G +17:35138359 2524669866 G 1335 4542 G +17:35138360 2524669867 G 1335 4543 T +17:35138361 2524669868 T 1335 4544 G +17:35138362 2524669869 G 1335 4545 T +17:35138363 2524669870 T 1335 4546 T +17:35138364 2524669871 T 1335 4547 G +17:35138365 2524669872 G 1335 4548 T +17:35138366 2524669873 T 1335 4549 A +17:35138367 2524669874 A 1335 4550 T +17:35138368 2524669875 T 1335 4551 G +17:35138369 2524669876 G 1335 4552 G +17:35138370 2524669877 G 1335 4553 G +17:35138371 2524669878 G 1335 4554 G +17:35138372 2524669879 G 1335 4555 A +17:35138373 2524669880 A 1335 4556 G +17:35138374 2524669881 G 1335 4557 G +17:35138375 2524669882 G 1335 4558 C +17:35138376 2524669883 C 1335 4559 A +17:35138377 2524669884 A 1335 4560 A +17:35138378 2524669885 A 1335 4561 G +17:35138379 2524669886 G 1335 4562 T +17:35138380 2524669887 T 1335 4563 G +17:35138381 2524669888 G 1335 4564 T +17:35138382 2524669889 T 1335 4565 G +17:35138383 2524669890 G 1335 4566 G +17:35138384 2524669891 G 1335 4567 G +17:35138385 2524669892 G 1335 4568 G +17:35138386 2524669893 G 1335 4569 G +17:35138387 2524669894 G 1335 4570 G +17:35138388 2524669895 G 1335 4571 T +17:35138389 2524669896 T 1335 4572 C +17:35138390 2524669897 C 1335 4573 C +17:35138391 2524669898 C 1335 4574 T +17:35138392 2524669899 T 1335 4575 T +17:35138393 2524669900 T 1335 4576 C +17:35138394 2524669901 C 1335 4577 T +17:35138395 2524669902 T 1335 4578 C +17:35138396 2524669903 C 1335 4579 C +17:35138397 2524669904 C 1335 4580 A +17:35138398 2524669905 A 1335 4581 C +17:35138399 2524669906 C 1335 4582 A +17:35138400 2524669907 A 1335 4583 C +17:35138401 2524669908 C 1335 4584 C +17:35138402 2524669909 C 1335 4585 C +17:35138403 2524669910 C 1335 4586 A +17:35138404 2524669911 A 1335 4587 C +17:35138405 2524669912 C 1335 4588 T +17:35138406 2524669913 T 1335 4589 T +17:35138407 2524669914 T 1335 4590 T +17:35138408 2524669915 T 1335 4591 G +17:35138409 2524669916 G 1335 4592 T +17:35138410 2524669917 T 1335 4593 C +17:35138411 2524669918 C 1335 4594 C +17:35138412 2524669919 C 1335 4595 A +17:35138413 2524669920 A 1335 4596 T +17:35138414 2524669921 T 1335 4597 T +17:35138415 2524669922 T 1335 4598 T +17:35138416 2524669923 T 1335 4599 G +17:35138417 2524669924 G 1335 4600 C +17:35138418 2524669925 C 1335 4601 A +17:35138419 2524669926 A 1335 4602 A +17:35138420 2524669927 A 1335 4603 A +17:35138421 2524669928 A 1335 4604 T +17:35138422 2524669929 T 1335 4605 A +17:35138423 2524669930 A 1335 4606 T +17:35138424 2524669931 T 1335 4607 A +17:35138425 2524669932 A 1335 4608 T +17:35138426 2524669933 T 1335 4609 T +17:35138427 2524669934 T 1335 4610 T +17:35138428 2524669935 T 1335 4611 T +17:35138429 2524669936 T 1335 4612 G +17:35138430 2524669937 G 1335 4613 G +17:35138431 2524669938 G 1335 4614 A +17:35138432 2524669939 A 1335 4615 A +17:35138433 2524669940 A 1335 4616 A +17:35138434 2524669941 A 1335 4617 A +17:35138435 2524669942 A 1335 4618 C +17:35138436 2524669943 C 1335 4619 A +17:35138437 2524669944 A 1335 4620 G +17:35138438 2524669945 G 1335 4621 C +17:35138439 2524669946 C 1335 4622 T +17:35138440 2524669947 T 1335 4623 A +17:35138441 2524669948 A hmmpfam.out100644000766000024 367114006427353 16561 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/datahmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: pfam Sequence file: /home/birney/src/wise2/example/road.pep - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: roa1_drome Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- SEED 146.1 6.3e-40 2 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- SEED 1/2 33 103 .. 1 77 [] 71.2 2.2e-17 SEED 2/2 124 193 .. 1 77 [] 75.5 1.1e-18 Alignments of top-scoring domains: SEED: domain 1 of 2, from 33 to 103: score 71.2, E = 2.2e-17 *->lfVgNLppdvteedLkdlFskfGpivsikivrDiiekpketgkskGf lf+g+L + +t+e Lk++F+k G iv++ +++D + t++s+Gf roa1_drome 33 LFIGGLDYRTTDENLKAHFEKWGNIVDVVVMKD-----PRTKRSRGF 74 aFVeFeseedAekAlealnG.kelggrklrv<-* +F+++ ++ + A + +++++gr+++ roa1_drome 75 GFITYSHSSMIDEAQK--SRpHKIDGRVVEP 103 SEED: domain 2 of 2, from 124 to 193: score 75.5, E = 1.1e-18 *->lfVgNLppdvteedLkdlFskfGpivsikivrDiiekpketgkskGf lfVg L d +e+ ++d+F++fG iv+i+iv+D ketgk +Gf roa1_drome 124 LFVGALKDDHDEQSIRDYFQHFGNIVDINIVID-----KETGKKRGF 165 aFVeFeseedAekAlealnGkelggrklrv<-* aFVeF++++ ++k + ++l+g+ + v roa1_drome 166 AFVEFDDYDPVDKVVL--KQHQLNGKMVDV 193 // MSGEFTUA.gb100755000766000024 1225314006427353 16147 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS MSGEFTUA 1500 bp DNA linear BCT 18-JUN-1999 DEFINITION Mycobacterium leprae Tuf gene for elongation factor Tu, complete cds. ACCESSION D13869 VERSION D13869.1 GI:434782 KEYWORDS EF-Tu; elongation factor Tu. SOURCE Mycobacterium leprae ORGANISM Mycobacterium leprae Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; Corynebacterineae; Mycobacteriaceae; Mycobacterium. REFERENCE 1 (bases 1 to 1500) AUTHORS Dhandayuthapani,S., Banu,M.J. and Kashiwabara,Y. TITLE Cloning and sequence determination of the gene coding for the elongation factor Tu of Mycobacterium leprae JOURNAL J. Biochem. 115 (4), 664-669 (1994) PUBMED 8089081 COMMENT Submitted (07-DEC-1992) to DDBJ by: Subranamian Dhandayuthapani National Institute for Leprosy Research 4-2-1 Aobacho, Higashimurayama-shi Tokyo 189 Japan Phone: 0423-91-8211 Fax: 0423-94-9092. FEATURES Location/Qualifiers source 1..1500 /organism="Mycobacterium leprae" /mol_type="genomic DNA" /isolate="Thai 53" /db_xref="taxon:1769" /clone_lib="lambda gt11" repeat_region 20..71 /rpt_type=direct /rpt_unit="20..29" /rpt_unit="62..71" repeat_region 90..231 /rpt_type=direct /rpt_unit="90..99" /rpt_unit="222..231" RBS 205..210 /standard_name="Shine-Dalgarno sequence" gene 219..1409 /gene="Tuf" CDS 219..1409 /gene="Tuf" /function="peptide chain elongtion during protein synthesis" /standard_name="EF-Tu" /note="start codon gtg" /codon_start=1 /evidence=not_experimental /transl_table=11 /product="elongation factor Tu" /protein_id="BAA02982.2" /db_xref="GI:5103687" /translation="MAKAKFERTKPHVNIGTIGHVDHGKTTLTAAITKVLHDKFPNLN ESRAFDQIDNAPEERQRGITINISHVEYQTEKRHYAHVDAPGHADYIKNMITGAAQMD GAILVVAATDGPMPQTREHVLLARQVGVPYILVALNKSDAVDDEELLELVEMEVRELL AAQEFDEDAPVVRVSALKALEGDAKWVESVTQLMDAVDESIPAPVRETDKPFLMPVED VFTITGRGTVVTGRVERGVVNVNEEVEIVGIRQTTTKTTVTGVEMFRKLLDQGQAGDN VGLLLRGIKREDVERGQVVIKPGTTTPHTEFEGQVYILSKDEGGRHTPFFNNYRPQFY FRTTDVTGVVTLPEGTEMVMPGDNTNISVTLIQPVAMDEGLRFAIREGGPTVGAGRVV KIIK" repeat_region 465..729 /rpt_type=direct /rpt_unit="465..474" /rpt_unit="720..729" repeat_region 883..913 /rpt_type=direct /rpt_unit="883..892" /rpt_unit="904..913" repeat_region 1169..1369 /rpt_type=direct /rpt_unit="1169..1180" /rpt_unit="1358..1369" ORIGIN 1 gctcggtcca agacgcaagg ccggcgaact actccatggt gttcaactcg tactccgaag 61 tgccggcgaa cgtgtcgaag gagatcatcg cgaaggcgac gggcgagtag gagttgagtg 121 gtcggaaaag cctacgagtt atgctgacgc ggtcactacc gcggataaat tactaacatt 181 gctttttaca agcaccaata gtccaggagg acacagaagt ggcgaaggcg aagttcgagc 241 ggacgaagcc gcacgtcaac atcgggacca tcggtcacgt tgaccacggc aagaccacat 301 tgaccgcggc tattaccaag gtcctgcatg acaagttccc caacctgaat gagtcgcgcg 361 cctttgacca gattgacaac gcgcccgagg agcgtcagcg cggtatcacc atcaacattt 421 cccacgtgga gtatcagacc gagaagcgtc actatgctca cgtcgacgcc ccggggcacg 481 ccgactacat taagaacatg atcaccggtg cggcccagat ggatggtgcg attctggtgg 541 tcgctgctac agacggcccg atgccgcaga ctcgcgagca cgtgctgctc gctcgtcagg 601 tgggtgtacc ttacatcctg gtcgcactta acaagtccga cgccgtggac gacgaggaac 661 tactcgagct tgtcgagatg gaagtccgtg agttgctggc tgcccaggaa ttcgacgagg 721 acgccccggt tgtgcgtgtc tcggcattga aggcgctcga gggtgacgcc aagtgggtcg 781 agtctgtcac acagttgatg gacgctgtcg acgagtcgat cccggctcca gtccgcgaga 841 ctgacaagcc gttcctgatg cccgttgagg acgtcttcac tatcaccggt cgtggcaccg 901 tggtcaccgg tcgggtggag cgcggcgtgg tcaacgtgaa cgaggaagtt gagatcgtcg 961 gcattcgtca gacgaccacc aagaccaccg tcacgggtgt ggaaatgttt cgtaagctgc 1021 tcgaccaagg tcaggccggt gataacgttg gtctgttgtt gcgtggcatc aagcgcgagg 1081 acgtcgagcg tggtcaggtg gtcatcaaac ccggcaccac caccccgcat accgagttcg 1141 aaggccaagt atacatcctg tccaaggacg aaggtggtcg gcacacaccg ttcttcaaca 1201 actaccgtcc gcagttctac ttccgtacca ctgatgtgac cggtgtggtg actttgccgg 1261 agggcaccga gatggtgatg ccaggtgaca acaccaacat ctcggtgacg ttgattcagc 1321 ccgttgccat ggacgagggt ttgcgcttcg cgatccgcga aggtggtccg actgttggtg 1381 ccggccgggt cgtcaagatc atcaagtaag tcgcatcaac gggccaggtg gttacaacgg 1441 tccacatttg cgaaacggcg ctcatccgaa aggataggcg cgtttctttt gagtttatgc // AY763288.gb100644000766000024 646114006427353 15730 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data LOCUS AY763288 723 bp DNA linear INV 18-JAN-2006 DEFINITION Leishmania guyanensis putative elongation factor 1 beta gene, complete cds. ACCESSION AY763288 VERSION AY763288.1 GI:54300415 KEYWORDS . SOURCE Leishmania guyanensis ORGANISM Leishmania guyanensis Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Leishmania; Leishmania guyanensis species complex. REFERENCE 1 (bases 1 to 723) AUTHORS Walker,J., Acestor,N., Gongora,R., Quadroni,M., Segura,I., Fasel,N. and Saravia,N.G. TITLE Comparative protein profiling identifies elongation factor-1beta and tryparedoxin peroxidase as factors associated with metastasis in Leishmania guyanensis JOURNAL Mol. Biochem. Parasitol. 145 (2), 254-264 (2006) PUBMED 16325936 REFERENCE 2 (bases 1 to 723) AUTHORS Vergel,C., Gongora,R.E., Saravia,N. and Walker,J. TITLE Direct Submission JOURNAL Submitted (24-SEP-2004) Biochemistry and Molecular Biology, Centro Internacional de Entrenamiento e Investigaciones Medicas, Av 1a Norte No. 3-03, Cali, Valle, Colombia FEATURES Location/Qualifiers source 1..723 /organism="Leishmania guyanensis" /mol_type="genomic DNA" /db_xref="taxon:5670" /clone="WHI/BT/78/M5313" /dev_stage="promastigote" /note="subgenus Viannia" mRNA <1..>723 /product="putative elongation factor 1 beta" CDS 1..723 /note="similar to elongation factor 1 beta of Leishmania major (Friedlin)" /codon_start=1 /product="putative elongation factor 1 beta" /protein_id="AAV32818.1" /db_xref="GI:54300416" /translation="MSVKDVSKKAAELEARLGGKLFLGGAKPTAEDVRMLNDLLGANH ASLYRWVKNMATYTEGERKAWGAPVRTAAPELRMPAPAAAAPAAAKKPVPAAAAPAAA KKPAPAPKAVAPAEDDDIGLFGETTEEEQAALEAKRAKDAEKKKAKKDVIAKSPILFD IKAWDDTVDLEALAQKLHAIQRDGLVWGDHKLAPVAFGVKKLQQLVVIEDDKVSGDDL EEMIMGFEDEVQSIDIVAWNKI" misc_feature 379..381 /note="putative; phosphorylation site" ORIGIN 1 atgtctgtga aggacgtgag caagaaggcc gccgagctgg aggcgaggct gggcggcaag 61 ctgttcctgg gcggcgcgaa gccgacggcg gaggacgtga ggatgctcaa cgacctgctc 121 ggcgcgaacc acgcgagcct gtaccggtgg gtgaagaaca tggcgaccta cacggagggc 181 gagcgcaagg cgtggggcgc accggtgcgc actgctgcgc cggagctgcg catgcccgcg 241 cctgccgcgg cggcgcctgc tgccgctaag aagcccgtgc ctgccgcggc ggcgcctgct 301 gccgctaaga agcccgcgcc tgcgccgaag gctgttgccc ctgcagagga cgacgacatc 361 ggcctgttcg gcgagacgac ggaggaggag caggcggcgc tggaggcgaa gagggcaaag 421 gacgcggaga agaagaaggc gaagaaggac gtgattgcga agtcgcccat cctgttcgac 481 atcaaggcgt gggacgacac ggtggacctg gaggcgctcg cgcagaagct gcacgcgatt 541 cagcgcgacg gcctggtgtg gggtgaccac aagctggcgc ccgttgcgtt cggcgtgaag 601 aagctgcagc agctggtcgt gatcgaggat gacaaagtgt ctggcgacga cctggaggag 661 atgatcatgg gcttcgagga tgaggtgcag tcgattgata tcgtcgcctg gaacaagatc 721 tga // U83300.bsml100644000766000024 1033414006427353 16076 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data Moore W.S., DeFilippis V.R. The window of taxonomic resolution for phylogenies based on mitochondrial cytochrome b (in) Mindell D.R. (eds.). AVIAN MOLECULAR EVOLUTION AND SYSTEMATICS:81-116. Academic Press, Inc., San Diego, CA, USA (1997) Moore W.S., DeFilippis V.R. Submitted (27-DEC-1996) to the EMBL/GenBank/DDBJ databases. Biological Sciences, Wayne State University, Biological Sciences Building, Detroit, MI 48202, USA aantttggatctctcctaggcatttgcctaataacacaaattgtcacagg cctcctgcttgccacccactacactgccgacacaaccctagccttttctt ccgtcgcccatacatgccgcaacgttcaatacggctgactaatccgtaac ctccatgccaacggggcctcattctttttcatctgcatctacttacacat cggacgtggattctactacggatcctacttatttaaagaaacttgaaaca cgggagtcatccttctcctcaccctcatagccaccgccttcgtcggctac gtcctcccctgaggacaaatatcatcctgaggagcaaccgttattacaaa tttattctcagccctcccctacgtaggacaaactatcgtcgaatgagcct gaggaggattctctgtagacaaccccactctcacccgattcttcgnccta cactttctcctcccattcttaattgnaggactcaccctaattcacttcac tttcctccacgaatccggntcgaacaatcccctcggaatcgtatccgaca gngataaaatcccctttcanccctacttctccntaaaagatatcctagga ttcatattcatactcctccccctcgtnnccctagcnctattctcacctaa cctcctaggagaccnggaaaatttnacgcccgcaaaccccctagtnacac ccccccacatcaaaccagaatggtacttcctatttgcatatgctatccta cgctcaatccccaataaactaggaggagtcctagccctagctgcctcagt cctaattctattcctagcccccctccttcatacatccaaacaacgcacga tagccttccgacccttttcccaactcctattctgaatactagtcgccaac ctcctcatcctcacctgaatcggnagncaaccagnagaacatccct ATF14F8.gbk100644000766000024 54051214006427353 16105 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS ATF14F8 96892 bp DNA PLN 07-AUG-2000 DEFINITION Arabidopsis thaliana DNA chromosome 5, BAC clone F14F8 (ESSA project). ACCESSION AL391144 VERSION AL391144.1 GI:9755607 KEYWORDS . SOURCE thale cress. ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. REFERENCE 1 (bases 1 to 96892) AUTHORS Sato,S., Nakamura,Y., Kaneko,T., Kato,T., Asamizu,E., Kotani,H., Tabata,S., Mewes,H.W., Rudd,S., Lemcke,K. and Mayer,K.F.X. JOURNAL Unpublished REFERENCE 2 (bases 1 to 96892) AUTHORS EU Arabidopsis sequencing,project. TITLE Direct Submission JOURNAL Submitted (07-AUG-2000) MIPS, at the Max-Planck-Institut fuer Biochemie, Am Klopferspitz 18a, D-82152 Martinsried, FRG, E-mail: lemcke@mips.biochem.mpg.de,mayer@mips.biochem.mpg.de COMMENT Information on performance of analysis and a more detailed annotation of this entry and other sequences of chromosomes 3, 4 and 5 can be viewed at: http://www.mips.biochem.mpg.de/proj/thal/. FEATURES Location/Qualifiers source 1..96892 /organism="Arabidopsis thaliana" /variety="Columbia" /db_xref="taxon:3702" /chromosome="5" exon 626..652 /gene="F14F8_10" /number=1 gene 626..2147 /gene="F14F8_10" CDS join(626..652,743..1199,1277..1424,1504..1781,1870..2147) /gene="F14F8_10" /note="strong similarity to putative phytochelatin synthetase - Arabidopsis thaliana, EMBL:AJ006787" /codon_start=1 /product="putative phytochelatin synthetase" /protein_id="CAC01762.1" /db_xref="GI:9755608" /translation="MSWTADGYVATVTMNNFQIYRHIQNPGWTLGWTWAKKEVIWSMV GAQTTEQGDCSKFKGNVPHCCKKTPTVVDLLPGVPYNQQFSNCCKGGVIGAWGQDPSA AVSQFQVSAGLAGTTNKTVKLPKNFTLLGPGPGYTCGPAKIVPSTVFLTTDKRRKTQA LMTWNVTCTYSQFLARKHPSCCVSFSSFYNDTITPCPSCACGCENKKSCVKADSKILT KKGLNTPKKDNTPLLQCTHHMCPVRVHWHVKTNYKDYWRVKIAITNFNYRMNHTLWTL AIQHPNLNNVTQVFSFDYKPVSPYGSINDTGMFYGTKFYNDLLMEAGPSGNVQSEVLL QKDQKTFTFKQGWAFPRKVYFNGDECMLPPPDSYPFLPNSAQGNFASFSLTILLLLFI SIW" intron 653..742 /gene="F14F8_10" /number=1 exon 743..1199 /gene="F14F8_10" /number=2 intron 1200..1276 /gene="F14F8_10" /number=2 exon 1277..1424 /gene="F14F8_10" /number=3 intron 1425..1503 /gene="F14F8_10" /number=3 exon 1504..1781 /gene="F14F8_10" /number=4 intron 1782..1869 /gene="F14F8_10" /number=4 exon 1870..2147 /gene="F14F8_10" /number=5 gene 3192..5279 /gene="F14F8_20" CDS join(3192..3253,3824..4205,4309..4524,4578..4796, 4969..5125,5254..5279) /gene="F14F8_20" /note="similarity to Mitochondrial carrier protein - Ribes nigrum, EMBL:AJ007580 Contains Mitochondrial energy transfer proteins signature AA284-293" /codon_start=1 /product="putative mitochondrial carrier protein" /protein_id="CAC01763.1" /db_xref="GI:9755609" /translation="MDTPPTSRIASFGQTEINWDKLDKRRFYINGAGLFTGVTVALYP VSVVKTRLQVASKEIAERSAFSVVKGILKNDGVPGLYRGFGTVITGAVPARIIFLTAL ETTKISAFKLVAPLELSEPTQAAIANGIAGMTASLFSQAVFVPIDVVSQKLMVQGYSG HATYTGGIDVATKIIKSYGVRGLYRGFGLSVMTYSPSSAAWWASYGSSQRVIWRLAMN VLSFLEFGFATKATIPLIQYLLLLGRFLGYGGDSDATAAPSKSKIVMVQAAGGIIAGA TASSITTPLDTIKTRLQVMGHQENRPSAKQVVKKLLAEDGWKGFYRGLGPRFFSMSAW GTSMILTYEYLKRLCAIED" exon 3192..3253 /gene="F14F8_20" /number=1 intron 3254..3823 /gene="F14F8_20" /number=1 exon 3824..4205 /gene="F14F8_20" /number=2 intron 4206..4308 /gene="F14F8_20" /number=2 exon 4309..4524 /gene="F14F8_20" /number=3 intron 4525..4577 /gene="F14F8_20" /number=3 exon 4578..4796 /gene="F14F8_20" /number=4 intron 4797..4968 /gene="F14F8_20" /number=4 exon 4969..5125 /gene="F14F8_20" /number=5 intron 5126..5253 /gene="F14F8_20" /number=5 exon 5254..5279 /gene="F14F8_20" /number=6 gene 7805..9695 /gene="F14F8_30" CDS join(7805..8146,8760..9010,9099..9267,9375..9695) /gene="F14F8_30" /codon_start=1 /product="reversibly glycosylated polypeptide-3" /protein_id="CAC01764.1" /db_xref="GI:9755610" /translation="MVEPANTVGLPVNPTPLLKDELDIVIPTIRNLDFLEMWRPFLQP YHLIIVQDGDPSKKIHVPEGYDYELYNRNDINRILGPKASCISFKDSACRCFGYMVSK KKYIFTIDDDCFVAKDPSGKAVNALEQHIKNLLCPSSPFFFNTLYDPYREGADFVRGY PFSLREGVSTAVSHGLWLNIPDYDAPTQLVKPKERNTRYVDAVMTIPKGTLFPMCGMN LAFDRDLIGPAMYFGLMGDGQPIGRYDDMWAGWCIKVICDHLSLGVKTGLPYIYHSKA SNPFVNLKKEYKGIFWQEEIIPFFQNAKLSKEAVTVQQCYIELSKMVKEKLSSLDPYF DKLADAMVTWIEAWDELNPPAASGKA" exon 7805..8146 /gene="F14F8_30" /number=1 intron 8147..8759 /gene="F14F8_30" /number=1 exon 8760..9010 /gene="F14F8_30" /number=2 intron 9011..9098 /gene="F14F8_30" /number=2 exon 9099..9267 /gene="F14F8_30" /number=3 intron 9268..9374 /gene="F14F8_30" /number=3 exon 9375..9695 /gene="F14F8_30" /number=4 exon 11254..12399 /gene="F14F8_40" /number=1 gene 11254..12643 /gene="F14F8_40" CDS join(11254..12399,12473..12643) /gene="F14F8_40" /note="similarity to several hypothetical proteins - Arabidopsis thaliana" /codon_start=1 /product="putative protein" /protein_id="CAC01765.1" /db_xref="GI:9755611" /translation="MRRRSKKIKTENNSNPETSEERNKFDEIPHDLVIEILERLPLKS VARFLTVSKLWATTIRSPDFRKSYRGGSSSEPRTLIVSDLNFKEPNPKLHFFRPSISS PSFLSSLTCPFTYPRHEEYYYHHVNGLISVGYGTDQIVINPTTGKFITLPRPKTRRKL VISFFGYDSVSDQYKVLCMTERLRGHPEEASSQHQVYTLGAKQKSWKMINCSIPHRPW SWNAVCINGVVYYIAKTGEGMFRRCLMRFDLKSDNLDLCTILPEEIQTSLHDYFLINY KGKVAIPNQPNFYTYDVWVMNQEGGKIEWLKNITFTIKPRKGFVRYLFVTGTTHTGEF ILAPTSYTDEFYVFHYNPDMNSFRKIRVQAPGVKFSFAQKASVVFSDHSLLRLDNLHI RGSTHTATGEFILAPRFYSDDLNVIHFNPDTNSFRSTKVEVYEDYE" intron 12400..12472 /gene="F14F8_40" /number=1 exon 12473..12643 /gene="F14F8_40" /number=2 CDS join(15197..15443,15612..16009) /gene="F14F8_50" /note="similarity to several hypothetical proteins - Arabidopsis thaliana" /codon_start=1 /product="putative protein" /protein_id="CAC01766.1" /db_xref="GI:9755612" /translation="MMRRRNKKTKTVISNPETLEERNKFDEIPHDLVIEILGRLPAKS VARFLTVSKLWATSIRSLDFIKSYPLGSSSKPRTLVASKQVVANPSTGRTIPLPRVKT RRTIATSFFGYDSVSDQYKVLCMTVKAYGDLRDESSQHQVFTLGAKKKSFRMIDTSII PHRPCSNGVCIDSVVYYVAKTGAGMLHLCIMRFDLSSEILDLFTSLPQEIRPPS" exon 15197..15443 /gene="F14F8_50" /number=1 gene 15197..16009 /gene="F14F8_50" intron 15444..15611 /gene="F14F8_50" /number=1 exon 15612..16009 /gene="F14F8_50" /number=2 gene 16790..26395 /gene="F14F8_60" exon complement(16790..19855) /gene="F14F8_60" /number=1 gene complement(join(16790..19855,20136..20912,21378..21497, 21654..21876,22204..22400,22527..23158,23335..23448, 23538..23938,24175..24536,24604..24715,24889..24984, 25114..25171,25257..25329,25544..25589,25900..26018, 26300..26395)) /gene="F14F8_60" CDS complement(join(16790..19855,20136..20912,21378..21497, 21654..21876,22204..22400,22527..23158,23335..23448, 23538..23938,24175..24536,24604..24715,24889..24984, 25114..25171,25257..25329,25544..25589,25900..26018, 26300..26395)) /gene="F14F8_60" /note="Contains Prokaryotic membrane lipoprotein lipid attachment site AA801-811" /codon_start=1 /product="hypothetical protein" /protein_id="CAC01767.1" /db_xref="GI:9755613" /translation="MKSGSAAKLIVEALLQRFLPLARRRIETAQAQDGQYLRPSDPAY EQVLDSLAMIARHTPVPLLEALLRWRESESPKGANDASTFQRKLAVECIFCSACIRFV ECCPQEGLTEKLWSGLENFVFDWLINADRVVSQVEYPSLVDLRGLLLDLVAQLLGALS RISSVTERFFMELNTRRIDTSVARSETLSIINGMRYLKLGVKSEGGLNASASFVAKAN PLIRDIHKRKSELYHALCNMLSNILAPLSDGGKSQWPPSVAEPALTLWYEAVGRIRVQ LIQWMEKQSKHLGVRQSDLVSLLYAYLVITVLNMYNWMNRLAIRFLFLVIIENFAPYM QDKNHRYMALDCLHRVLRFYLSVYASSQPPNRIWDYLDSVTSQLLTVLRKGMLTQDVQ QDKLVEFCVTIAEHNLDFAMNHMLLELLKQDSPSEAKIIGLRALLALVMSPSSQYVGL EIFKGHGIGHYIPKVKAAIESILRSCHRTYSQALLTSSRTTIDAVNKEKSQGSLFQSV LKCIPYLIEEVGRSDKITEIIPQHGISIDPGVRVEAVQVLNRIVRYLPHRRFAVMRGM ANFILKLPDEFPLLIQASLGRLLELMRFWRACLVDDRQDTDAEEENKTAKGNDRFKKL SFHQAADAIEFRASDIDAVGLIFLSSVDSQIRHTALELLRCVRALRNDIRDLMIQEHP DHVMKFEAEPIYMIDVLEEHGDDIVQSCYWDSARPFDLRRDSDAIPSDVTLQSIIFES LDKNKWGRCLSELVKYAAELCPRSVQEAKSEIMHRLAHITPVEFGGKANQSQDTDNKL DQWLLYAMFVCSCPPDGKDAGSIASTRDMYHLIFPYLRFGSETHNHAATMALGRSHLE ACEIMFSELASFMEEISSETETKPKWKIQKGGRREDLRVHVSNIYRTVSENVWPGMLA RKPVFRLHYLRFIEDSTRQISLAPHESFQDMQPLRYALASVLRFLAPEFVESKSEKFD VRSRKRLFDLLLSWSDDTGNTWGQDGVSDYRREVERYKTSQHNRSKDSVDKISFDKEL NEQIEAIQWASLNAMASLLYGPCFDDNARKMSGRVISWINSLFIEPAPRVPFGYSPAD PRTPSYSKYTGEGGRGTTGRDRHRGGHQRVALAKLALKNLLLTNLDLFPACIDQCYYS DAAIADGYFSVLAEVYMRQEIPKCEIQRLLSLILYKVVDPSRQIRDDALQMLETLSMR EWAEDGIEGSGGYRAAVVGNLPDSYQQFQYKLSCKLAKDHPELSQLLCEEIMQRQLDA VDIIAQHQVLTCMAPWIENLNFWKLKDSGWSERLLKSLYYVTWRHGDQFPDEIEKLWS TIASKPRNISPVLDFLITKGIEDCDSNASAEITGAFATYFSVAKRVSLYLARICPQRT IDHLVYQLSQRMLEDSIEPIGYSANRGDSNGNFVLEFSQGPATAPQVVSVADSQPHMS PLLVRGSLDGPLRNTSGSLSWRTAGMTGRSASGPLSPMPPELNIVPVATGRSGQLLPS LVNASGPLMGVRSSTGSLRSRHVSRDSGDYLIDTPNSGEDVLHSGIAMHGVNAKELQS ALQGHQQHSLTHADIALILLAEIAYENDEDFREHLPLLFHVTFVSMDSSEDIVLEHCQ HLLVNLLYSLAGRHLELYEVENSDGENKQQVVSLIKYVQSKRGSMMWENEDPTVVRTD LPSAALLSALVQSMVDAIFFQGDLRETWGTEALKWAMECTSRHLACRSHQIYRALRPS VTSDACVSLLRCLHRCLSNPIPPVLGFIMEILLTLQVMVENMEPEKVILYPQLFWGCV AMMHTDFVHVYCQVLELFSRIIDRLSFRDKTTENVLLSSMPRDEFNTNDLGEFQRSES RGYEMPPSSGTLPKFEGVQPLVLKGLMSTVSHEFSIEVLSRITVPSCDSIFGDAETRL LMHITGLLPWLCLQLTQDQVMVSALPLQQQYQKACSVASNIAVWCRAKSLDELATVFV AYARGEIKRVENLLACVSPLLCNKWFPKHSALAFGHLLRLLKKGPVDYQRVILLMLKA LLQHTPMDASQSPHMYTIVSQLVESTLCWEALSVLEALLQSCSPVQGGTGGSHPQDSS YSENGTDEKTLVPQTSFKARSGPLQYAMMAATMSQPFPLGAAAAESGIPPRDVALQNT RLMLGRVLDNCALGRRDYRRLVPFVTTIANM" intron complement(19856..20135) /gene="F14F8_60" /number=1 exon complement(20136..20912) /gene="F14F8_60" /number=2 intron complement(20913..21377) /gene="F14F8_60" /number=2 exon complement(21378..21497) /gene="F14F8_60" /number=3 intron complement(21498..21653) /gene="F14F8_60" /number=3 exon complement(21654..21876) /gene="F14F8_60" /number=4 intron complement(21877..22203) /gene="F14F8_60" /number=4 exon complement(22204..22400) /gene="F14F8_60" /number=5 intron complement(22401..22526) /gene="F14F8_60" /number=5 exon complement(22527..23158) /gene="F14F8_60" /number=6 intron complement(23159..23334) /gene="F14F8_60" /number=6 exon complement(23335..23448) /gene="F14F8_60" /number=7 intron complement(23449..23537) /gene="F14F8_60" /number=7 exon complement(23538..23938) /gene="F14F8_60" /number=8 intron complement(23939..24174) /gene="F14F8_60" /number=8 exon complement(24175..24536) /gene="F14F8_60" /number=9 intron complement(24537..24603) /gene="F14F8_60" /number=9 exon complement(24604..24715) /gene="F14F8_60" /number=10 intron complement(24716..24888) /gene="F14F8_60" /number=10 exon complement(24889..24984) /gene="F14F8_60" /number=11 intron complement(24985..25113) /gene="F14F8_60" /number=11 exon complement(25114..25171) /gene="F14F8_60" /number=12 intron complement(25172..25256) /gene="F14F8_60" /number=12 exon complement(25257..25329) /gene="F14F8_60" /number=13 intron complement(25330..25543) /gene="F14F8_60" /number=13 exon complement(25544..25589) /gene="F14F8_60" /number=14 intron complement(25590..25899) /gene="F14F8_60" /number=14 exon complement(25900..26018) /gene="F14F8_60" /number=15 intron complement(26019..26299) /gene="F14F8_60" /number=15 exon complement(26300..26395) /gene="F14F8_60" /number=16 misc_feature 27422..29058 /note="putative Mutator like transposase" gene complement(join(29425..29769,29840..30004)) /gene="F14F8_70" exon complement(29425..29769) /gene="F14F8_70" /number=1 gene 29425..30004 /gene="F14F8_70" CDS complement(join(29425..29769,29840..30004)) /gene="F14F8_70" /note="similarity to hypothetical protein AT4g04780 - Arabidopsis thaliana, EMBL:AL161501" /codon_start=1 /product="putative protein" /protein_id="CAC01768.1" /db_xref="GI:9755614" /translation="MSTNSYYSSASSSGFRVCPPGVPSKCWCGEEIITFTSKTKENPY RRFYRCAIAMKRENEEHLFKWVDEALLDEIKMVNEKCKRVAENISDLRMNVMVNMELL NKNAKQMEEELIKKMEGELLTMKENVEELGRTCHGKVSFEDCRGCCCDCCIYCMALGK SVGVVEVLE" intron complement(29770..29839) /gene="F14F8_70" /number=1 exon complement(29840..30004) /gene="F14F8_70" /number=2 exon complement(31395..31505) /gene="F14F8_80" /number=1 gene complement(join(31395..31505,31597..31700,31806..31920, 32027..32074,32167..32256,32333..32407,32602..32754, 33036..33120,33210..33280,33373..33423,33563..33979, 34057..34188,34338..34451,34537..34654,34956..35014, 35276..35342,35433..35563,35663..35872,36327..37211)) /gene="F14F8_80" gene 31395..37211 /gene="F14F8_80" CDS complement(join(31395..31505,31597..31700,31806..31920, 32027..32074,32167..32256,32333..32407,32602..32754, 33036..33120,33210..33280,33373..33423,33563..33979, 34057..34188,34338..34451,34537..34654,34956..35014, 35276..35342,35433..35563,35663..35872,36327..37211)) /gene="F14F8_80" /note="Contains Prokaryotic membrane lipoprotein lipid attachment site AA922-932;Bacteriophage-type RNA polymerase family active site signatures AA708-719;Bacteriophage-type RNA polymerase family active site signatures AA781-795" /codon_start=1 /product="DNA-directed RNA polymerase (mitochondrial)" /protein_id="CAC01769.1" /db_xref="GI:9755615" /translation="MSSAQTPLFLANQTKVFDHLIPLHKPFISSPNPVSQSFPMWRNI AKQAISRSAARLNVSSQTRGLLVSSPESIFSKNLSFRFPVLGSPCHGKGFRCLSGITR REEFSKSERCLSGTLARGYTSVAEEEVLSTDVEEEPEVDELLKEMKKEKKRESHRSWR MKKQDQFGMGRTKFQNLWRRQVKIETEEWERAAAEYMELLTDMCEQKLAPNLPYVKSL FLGWFEPLRDAIAKDQELYRLGKSKATYAHYLDQLPADKISVITMHKLMGHLMTGGDN GCVKVVHAACTVGDAIEQEIRICTFLDKKKKGDDNEESGGVENETSMKEQDKLRKKVN ELIKKQKLSAVRKILQSHDYTKPWIADVRAKVGSRLIELLVRTAYIQSPADQQDNDLP DVRPAFVHTFKVAKGSMNSGRKYGVIECDPLVRKGLEKSGRYAVMPYMPMLVPPLKWS GYDKGAYLFLTSYIMKTHGAKQQREALKSAPKGQLQPVFEALDTLGSTKWRVNKRVLT VVDRIWSSGGCVADMVDRSDVPLPEKPDTEDEGILKKWKWEVKSAKKVNSERHSQRCD TELKLSVARKMKDEEAFYYPHNMDFRGRAYPMPPHLNHLGSDLCRGVLEFAEGRPMGI SGLRWLKIHLANLYAGGVDKLSLDGRLAFTENHLDDIFDSADRPLEGSRWWLQAEDPF QCLAVCISLTEALRSPSPETVLSHIPIHQDGSCNGLQHYAALGRDTLGAEAVNLVAGE KPADVYSGIATRVLDIMRRDADRDPEVFPEALRARKLLNQVDRKLVKQTVMTSVYGVT YIGARDQIKRRLKERSDFGDEKEVFGAACYAAKVTLAAIDEMFQAARAIMRWFGECAK IIASENETVRWTTPLGLPVVQPYHQMGTKLVKTSLQTLSLQHETDQVIVRRQRTAFPP NFIHSLDGSHMMMTAVACKRAGVCFAGVHDSFWTHACDVDKLNIILREKFVELYSQPI LENLLESFEQSFPHLDFPPLPERGDLDLKVVLDSPYFFN" intron complement(31506..31596) /gene="F14F8_80" /number=1 exon complement(31597..31700) /gene="F14F8_80" /number=2 intron complement(31701..31805) /gene="F14F8_80" /number=2 exon complement(31806..31920) /gene="F14F8_80" /number=3 intron complement(31921..32026) /gene="F14F8_80" /number=3 exon complement(32027..32074) /gene="F14F8_80" /number=4 intron complement(32075..32166) /gene="F14F8_80" /number=4 exon complement(32167..32256) /gene="F14F8_80" /number=5 intron complement(32257..32332) /gene="F14F8_80" /number=5 exon complement(32333..32407) /gene="F14F8_80" /number=6 intron complement(32408..32601) /gene="F14F8_80" /number=6 exon complement(32602..32754) /gene="F14F8_80" /number=7 intron complement(32755..33035) /gene="F14F8_80" /number=7 exon complement(33036..33120) /gene="F14F8_80" /number=8 intron complement(33121..33209) /gene="F14F8_80" /number=8 exon complement(33210..33280) /gene="F14F8_80" /number=9 intron complement(33281..33372) /gene="F14F8_80" /number=9 exon complement(33373..33423) /gene="F14F8_80" /number=10 intron complement(33424..33562) /gene="F14F8_80" /number=10 exon complement(33563..33979) /gene="F14F8_80" /number=11 intron complement(33980..34056) /gene="F14F8_80" /number=11 exon complement(34057..34188) /gene="F14F8_80" /number=12 intron complement(34189..34337) /gene="F14F8_80" /number=12 exon complement(34338..34451) /gene="F14F8_80" /number=13 intron complement(34452..34536) /gene="F14F8_80" /number=13 exon complement(34537..34654) /gene="F14F8_80" /number=14 intron complement(34655..34955) /gene="F14F8_80" /number=14 exon complement(34956..35014) /gene="F14F8_80" /number=15 intron complement(35015..35275) /gene="F14F8_80" /number=15 exon complement(35276..35342) /gene="F14F8_80" /number=16 intron complement(35343..35432) /gene="F14F8_80" /number=16 exon complement(35433..35563) /gene="F14F8_80" /number=17 intron complement(35564..35662) /gene="F14F8_80" /number=17 exon complement(35663..35872) /gene="F14F8_80" /number=18 intron complement(35873..36326) /gene="F14F8_80" /number=18 exon complement(36327..37211) /gene="F14F8_80" /number=19 exon 38393..39739 /gene="F14F8_90" /number=1 gene 38393..39739 /gene="F14F8_90" CDS 38393..39739 /gene="F14F8_90" /note="similarity to unusual floral organs protein UFO - Arabidopsis thaliana, PIR:S57710 Contains Prokaryotic membrane lipoprotein lipid attachment site AA422-432" /codon_start=1 /product="putative protein" /protein_id="CAC01770.1" /db_xref="GI:9755616" /translation="MERLGFWGLLMGSVEKSLDSGNSLACSASAKNGDEESSTSSKQV SPLKGSGSRNTSPLGRVGSRNTSPSRQKVVKTKPRGLEEETVASFGKQVVADVQMEDG IWAMLPEDLLNEILARVPPFMIFRIRSVCKKWNLILQDNSFLKFHSNVSSHGPCLLTF WKNSPQIPQCSVFSLPLKTWYKIPFTFLPPWAFWLVGSSGGLVCFSGLDGLTFRTLVC NPLMQSWRTLPSMHYNQQRQLIMVVDRSDKSFKVIATSDIYGDKSLPTEVYDSKTDKW SLHQIMPAVNLCSSKMAYCDSRLYLETLSPLGLMMYRLDSGQWEHIPAKFPRSLLDGY LVAGTQKRLFLVGRIGLYSTLQSMRIWELDHTKVSWVEISRMPPKYFRALLRLSAERF ECFGQDNLICFTSWNQGKGLLYNVDKKIWSWISGCALQSCNSQVCFYEPRFDASVL" gene complement(join(40286..40506,40590..40848,40971..41219, 41307..41440,41520..41757)) /gene="F14F8_100" gene 40286..41757 /gene="F14F8_100" CDS complement(join(40286..40506,40590..40848,40971..41219, 41307..41440,41520..41757)) /gene="F14F8_100" /note="similarity to RXF26 - Arabidopsis thaliana, EMBL:AB008020 Contains Protein splicing signature AA340-347" /codon_start=1 /product="putative protein" /protein_id="CAC01771.1" /db_xref="GI:9755617" /translation="MKSLLICLVLLELVWLGNGQSRDHQPLAPAFFVFGDSLVDSGNN NYIPTLARANYFPYGIDFGFPTGRFCNGRTVVDYGATYLGLPLVPPYLSPLSIGQNAL RGVNYASAAAGILDETGRHYVRGARTTFNGQISQFEITIELRLRRFFQNPADLRKYLA KSIIGINIGSNDYINNYLMPERYSTSQTYSGEDYADLLIKTLSAQISRLYNLGARKMV LAGSGPLGCIPSQLSMVTGNNTSGCVTKINNMVSMFNSRLKDLANTLNTTLPGSFFVY QNVFDLFHDMVVNPSRYGLVVSNEACCGNGRYGGALTCLPLQQPCLDRNQYVFWDAFH PTETANKIIAHNTFSKSANYSYPISVYELAKL" exon complement(40286..40506) /gene="F14F8_100" /number=1 intron complement(40507..40589) /gene="F14F8_100" /number=1 exon complement(40590..40848) /gene="F14F8_100" /number=2 intron complement(40849..40970) /gene="F14F8_100" /number=2 exon complement(40971..41219) /gene="F14F8_100" /number=3 intron complement(41220..41306) /gene="F14F8_100" /number=3 exon complement(41307..41440) /gene="F14F8_100" /number=4 intron complement(41441..41519) /gene="F14F8_100" /number=4 exon complement(41520..41757) /gene="F14F8_100" /number=5 gene 46886..48648 /gene="F14F8_110" CDS join(46886..47205,47285..47438,47517..47646,47727..47884, 47974..48177,48304..48648) /gene="F14F8_110" /note="strong similarity to serine/threonine-specific protein kinase NPK15 - Nicotiana tabacum Contains Protein kinases signatures and profile AA235-247" /codon_start=1 /product="serine/threonine-specific protein kinase-like protein" /protein_id="CAC01772.1" /db_xref="GI:9755618" /translation="MVNRSDLVVIGISVGLALGLLLALLLFFAIKWYYGRSHLRRCAN EQNSPTLPVHTAKRGVVIPDDRANTESSQPPENGAPTQHQPWWNNHTKDLTVSASGIP RYNYKDIQKATQNFTTVLGQGSFGPVYKAVMPNGELAAAKVHGSNSSQGDREFQTEVS LLGRLHHRNLVNLTGYCVDKSHRMLIYEFMSNGSLENLLYGGEGMQVLNWEERLQIAL DISHGIEYLHEGAVPPVIHRDLKSANILLDHSMRAKVADFGLSKEMVLDRMTSGLKGT HGYMDPTYISTNKYTMKSDIYSFGVIILELITAIHPQQNLMEYINLASMSPDGIDEIL DQKLVGNASIEEVRLLAKIANRCVHKTPRKRPSIGEVTQFILKIKQSRSRGRRQDTMS SSFGVGYEEDLSRVMSRIKDQHVELGLLAGVKEENHQERNIATT" exon 46886..47205 /gene="F14F8_110" /number=1 intron 47206..47284 /gene="F14F8_110" /number=1 exon 47285..47438 /gene="F14F8_110" /number=2 intron 47439..47516 /gene="F14F8_110" /number=2 exon 47517..47646 /gene="F14F8_110" /number=3 intron 47647..47726 /gene="F14F8_110" /number=3 exon 47727..47884 /gene="F14F8_110" /number=4 intron 47885..47973 /gene="F14F8_110" /number=4 exon 47974..48177 /gene="F14F8_110" /number=5 intron 48178..48303 /gene="F14F8_110" /number=5 exon 48304..48648 /gene="F14F8_110" /number=6 exon complement(50390..50636) /gene="F14F8_120" /number=1 gene complement(join(50390..50636,50709..50866,50944..51148, 51230..51519,51628..51793,51886..51974,52074..52141, 52255..52549)) /gene="F14F8_120" gene 50390..52549 /gene="F14F8_120" CDS complement(join(50390..50636,50709..50866,50944..51148, 51230..51519,51628..51793,51886..51974,52074..52141, 52255..52549)) /gene="F14F8_120" /note="similarity to auxin-independent growth promoter - Nicotiana tabacum, PIR:A44226 Contains Prokaryotic membrane lipoprotein lipid attachment site AA101-111" /codon_start=1 /product="putative protein" /protein_id="CAC01773.1" /db_xref="GI:9755619" /translation="MEKFLYHRKLWEMNVKLLGESKVEKLKNSFVSRPRMSLWMIRAV TVLLLWSCFVHLMALGEMWGPRLFKGWPSCFNHHQLSTAAEMTSLPTKIALPPKRVYV NNGYLMVSCNGGLNQMRAAICDMVTVARYMNVTLIVPELDKTSFWNDPSEFKDIFDVD HFISSLRDEVRILKELPPRLKKRVELGVYHEMPPISWSNMSYYQNQILPLVKKHKVLH LNRTDTRLANNGLPVEVQKLRCRVNFNGLKFTPQIEELGRRVVKILREKGPFLVLHLR YEMDMLAFSGCSHGCNPEEEEELTRMRYAYPWWKEKVINSELKRKDGLCPLTPEETAL TLTALGIDRNVQIYIAAGEIYGGQRRMKALTDAFPNVVRKETLLESSDLDFCRNHSSQ MAALDYLVALESDIFVPTNDGNMARVVEGHRRFLGFKKTIQLNRRFLVKLIDEYTEGL LSWDVFSSTVKAFHSTRMGSPKRRLVIPNRPKEEDYFYANPQECLQLLDEPLRVI" intron complement(50637..50708) /gene="F14F8_120" /number=1 exon complement(50709..50866) /gene="F14F8_120" /number=2 intron complement(50867..50943) /gene="F14F8_120" /number=2 exon complement(50944..51148) /gene="F14F8_120" /number=3 intron complement(51149..51229) /gene="F14F8_120" /number=3 exon complement(51230..51519) /gene="F14F8_120" /number=4 intron complement(51520..51627) /gene="F14F8_120" /number=4 exon complement(51628..51793) /gene="F14F8_120" /number=5 intron complement(51794..51885) /gene="F14F8_120" /number=5 exon complement(51886..51974) /gene="F14F8_120" /number=6 intron complement(51975..52073) /gene="F14F8_120" /number=6 exon complement(52074..52141) /gene="F14F8_120" /number=7 intron complement(52142..52254) /gene="F14F8_120" /number=7 exon complement(52255..52549) /gene="F14F8_120" /number=8 exon 57051..57238 /gene="F14F8_130" /number=1 gene 57051..58303 /gene="F14F8_130" CDS join(57051..57238,57324..57413,57893..57967,58072..58303) /gene="F14F8_130" /note="strong similarity to 40S RIBOSOMAL PROTEINs - different species" /codon_start=1 /product="ribosomal protein-like" /protein_id="CAC01774.1" /db_xref="GI:9755620" /translation="MRKLKYHEKKLIKKVNFLEWKREGNHRENEITYRYHMGSRDDYK KLVPLFALKALFYLFFFFWMVQKLTNIMKQMDPADPFRIQMTDMLLEKLYNMGVIPTR KSLTLTERLSVSSFCRRRLSTVLVHLKFAEHHKEAVTYIEQGHVRVGPETITDPAFLV TRNMEDFITWVDSSKIKRKVLEYNDTLDDYDMLA" intron 57239..57323 /gene="F14F8_130" /number=1 exon 57324..57413 /gene="F14F8_130" /number=2 intron 57414..57892 /gene="F14F8_130" /number=2 exon 57893..57967 /gene="F14F8_130" /number=3 intron 57968..58071 /gene="F14F8_130" /number=3 exon 58072..58303 /gene="F14F8_130" /number=4 exon 58702..58968 /gene="F14F8_140" /number=1 gene 58702..59507 /gene="F14F8_140" CDS join(58702..58968,59223..59507) /gene="F14F8_140" /note="strong similarity to plastid-specific ribosomal protein 3 precursor - Spinacia oleracea, EMBL:AF239218" /codon_start=1 /product="ribosomal protein 3 precursor-like protein" /protein_id="CAC01775.1" /db_xref="GI:9755621" /translation="MAVQANQSASFGFRTASPSQKLSSKPIAHISLSTKLKPSSRPSL SCSTWNQGQIPARHSCINPGIFAYPPSNLTFSHELPESESPPLGKKKMRVLVKPLEKP KVVLKFVWMQKDIGVALDHMIPGFGTIPLSPYYFWPRKDAWEELKTLLESKPWISELH RVFLLNQATDIINLWQSSGGDLS" intron 58969..59222 /gene="F14F8_140" /number=1 exon 59223..59507 /gene="F14F8_140" /number=2 exon complement(59695..60144) /gene="F14F8_150" /number=1 gene complement(59695..60144) /gene="F14F8_150" gene 59695..60144 /gene="F14F8_150" CDS complement(59695..60144) /gene="F14F8_150" /note="similarity to Glucosamine-6-phosphate acetyltransferase EMeg32 protein - Mus musculus, EMBL:AJ001006" /codon_start=1 /product="acetyltransferase-like protein" /protein_id="CAC01776.1" /db_xref="GI:9755622" /translation="MAETFKIRKLEISDKRKGFIELLGQLTVTGSVTDEEFDRRFEEI RSYGDDHVICVIEEETSGKIAATGSVMIEKKFLRNCGKAGHIEDVVVDSRFRGKQLGK KVVEFLMDHCKSMGCYKVILDCSVENKVFYEKCGMSNKSIQMSKYFD" exon complement(60500..61509) /gene="F14F8_160" /number=1 gene complement(join(60500..61509,61704..61899)) /gene="F14F8_160" gene 60500..61899 /gene="F14F8_160" CDS complement(join(60500..61509,61704..61899)) /gene="F14F8_160" /note="similarity to other proline-rich proteins" /codon_start=1 /product="proline-rich protein" /protein_id="CAC01777.1" /db_xref="GI:9755623" /translation="MERITTLWFWFSLMIFLGISINGGLSQGQQHVMKKTRSSAVVVG TVYCDTCFNGAFSKSPNHLISGALVAVECIDENSKPSFRQEVKTDKRGEFKVKLPFSV SKHVKKIKRCSVKLLSSSQPYCSIASSATSSSLKRLKSNHHGENTRVFSAGFFTFRPE NQPEICSQKPINLRGSKPLLPDPSFPPPLQDPPNPSPLPNLPIVPPLPNLPVPKLPVP DLPLPLVPPLLPPGPQKSASLHNKKSDSLKDKKTEALKPNFFFPPNPLNPPSIIPPNP LIPSIPTPTLPPNPLIPSPPSLPPIPLIPTPPTLPTIPLLPTPPTPTLPPIPTIPTLP PLPVLPPVPIVNPPSLPPPPPSFPVPLPPVPGLPGIPPVPLIPGIPPAPLIPGIPPLS PSFSSHHQP" intron complement(61510..61703) /gene="F14F8_160" /number=1 exon complement(61704..61899) /gene="F14F8_160" /number=2 gene 65402..66566 /gene="F14F8_170" exon 65402..65521 /gene="F14F8_170" /number=1 CDS join(65402..65521,65615..66056,66144..66253,66540..66566) /gene="F14F8_170" /note="similarity to RING-H2 finger protein RHB1a - Arabidopsis thaliana, EMBL:AF079179" /codon_start=1 /product="putative protein" /protein_id="CAC01778.1" /db_xref="GI:9755624" /translation="MGCVSSCFGVDDFEDYPNPSSSVNRSCPCPRCLVNNFLNLYISL FRRGETRSLPSSLQATNVSIATSTSYDNFMSNTFHSTPRPLPYDADPRYFRSRRDSLV SRRDKGSSHSHEEAEPLRSDADVDSESFSVEGSKWANKLIISGEDSKEEFSRSSRRIL QSRTMSTSNEGLYITSDDEDVCPTCLEEYISENPKIVTKCSHHFHLSCIYEWMERSEN CPVCGKVMEFHETP" intron 65522..65614 /gene="F14F8_170" /number=1 exon 65615..66056 /gene="F14F8_170" /number=2 intron 66057..66143 /gene="F14F8_170" /number=2 exon 66144..66253 /gene="F14F8_170" /number=3 intron 66254..66539 /gene="F14F8_170" /number=3 exon 66540..66566 /gene="F14F8_170" /number=4 exon complement(67196..67322) /gene="F14F8_180" /number=1 gene complement(join(67196..67322,67451..67599,67737..67820, 67911..68010,68100..68161,68252..68333,69185..69369)) /gene="F14F8_180" gene 67196..69369 /gene="F14F8_180" CDS complement(join(67196..67322,67451..67599,67737..67820, 67911..68010,68100..68161,68252..68333,69185..69369)) /gene="F14F8_180" /note="sequence differences to mRNA EMBL:M55551 Contains MADS-box domain signature and profile AA3-57" /codon_start=1 /product="MADS box protein AGL2" /protein_id="CAC01779.1" /db_xref="GI:9755625" /translation="MGRGRVELKRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVA LIIFSNRGKLYEFCSSSNMLKTLDRYQKCSYGSIEVNNKPAKELENSYREYLKLKGRY ENLQRQQRNLLGEDLGPLNSKELEQLERQLDGSLKQVRSIKTQYMLDQLSDLQNKEQM LLETNRALAMKLDDMIGVRSHHMGGGGGWEGGEQNVTYAHHQAQSQGLYQPLECNPTL QMGCCFGDDDDDDRYDNPVCSEQITATTQAQAQQGNGYIPGWML" intron complement(67323..67450) /gene="F14F8_180" /number=1 exon complement(67451..67599) /gene="F14F8_180" /number=2 intron complement(67600..67736) /gene="F14F8_180" /number=2 exon complement(67737..67820) /gene="F14F8_180" /number=3 intron complement(67821..67910) /gene="F14F8_180" /number=3 exon complement(67911..68010) /gene="F14F8_180" /number=4 intron complement(68011..68099) /gene="F14F8_180" /number=4 exon complement(68100..68161) /gene="F14F8_180" /number=5 intron complement(68162..68251) /gene="F14F8_180" /number=5 exon complement(68252..68333) /gene="F14F8_180" /number=6 intron complement(68334..69184) /gene="F14F8_180" /number=6 exon complement(69185..69369) /gene="F14F8_180" /number=7 tRNA 73241..73314 /note="tRNA predicted as a tRNA- Ile : anticodon aat" /product="tRNA-Ile" exon 73400..73789 /gene="F14F8_190" /number=1 gene 73400..76723 /gene="F14F8_190" CDS join(73400..73789,74082..74363,74478..74565,74673..74794, 74868..75007,75092..75164,75307..75395,75470..75575, 75662..75853,75961..76155,76247..76324,76424..76723) /gene="F14F8_190" /note="strong similarity to several N2, N2-dimethylguanine tRNA methyltransferases" /codon_start=1 /product="N2, N2-dimethylguanine tRNA methyltransferase-like protein" /protein_id="CAC01780.1" /db_xref="GI:9755626" /translation="MNFSKRFVGFKQLSILRYSLPNPYGLFRVSPLCLSETATKLEAI GFIQASLKPVAKFRHLYSHIYRFKLCDLPFGFQIRVGNSISESSAPKEKNLRSAMETD LNDYTVIKEGEAEVLMHKKNQVFFNKAQVNNRDMSIAVLRAFIIKRKQEHEAMLSKRA RSSGKVVEKDVSETSKEETPTENGDDNGKTNGEHEVTTQDGPKEAAKTAYESARRELK PPRVLEALSASGLRALRYAREVEGIGQVVALDNDPASVEACQRNIKFNGLMSTSKVES HLTDARVHMLSHPKDFDVVDLDPYGAPSIFLDSAVQSVADGGLLMCTATDMAVLCGAN GEVCYSKYGSYPLKGKYCHEMALRILLASIESHANRYKRYIVPVLSVQMDFYVRVFVR VYTSASAMKNTPLKLSYVYQCIGCDSFHLQSVGRSLPKNNSVRYLPGVGPVVPQDCTH CGKKYNMGGPIWSAPIHDQEWVNSILNGVKSMKDRYPAYDRICAELPDVPLFLSLHSL SATLKCTSPSAALFRSAVINAKYRVSGSHVNPLGIKTDAPMEIIWDIMRCWVKNHPIK PQSPEHPGSVILSKEPSHQADFSRHVGSLSKAQAKKVARFLPNPEKHWGPKIRAGRTI TSKHVSLLGHEAVNGHLNNNHKEAGDEEEEEEEEEPEEDIIEGEPELKRQKTTEDFAS TS" intron 73790..74081 /gene="F14F8_190" /number=1 exon 74082..74363 /gene="F14F8_190" /number=2 intron 74364..74477 /gene="F14F8_190" /number=2 exon 74478..74565 /gene="F14F8_190" /number=3 intron 74566..74672 /gene="F14F8_190" /number=3 exon 74673..74794 /gene="F14F8_190" /number=4 intron 74795..74867 /gene="F14F8_190" /number=4 exon 74868..75007 /gene="F14F8_190" /number=5 intron 75008..75091 /gene="F14F8_190" /number=5 exon 75092..75164 /gene="F14F8_190" /number=6 intron 75165..75306 /gene="F14F8_190" /number=6 exon 75307..75395 /gene="F14F8_190" /number=7 intron 75396..75469 /gene="F14F8_190" /number=7 exon 75470..75575 /gene="F14F8_190" /number=8 intron 75576..75661 /gene="F14F8_190" /number=8 exon 75662..75853 /gene="F14F8_190" /number=9 intron 75854..75960 /gene="F14F8_190" /number=9 exon 75961..76155 /gene="F14F8_190" /number=10 intron 76156..76246 /gene="F14F8_190" /number=10 exon 76247..76324 /gene="F14F8_190" /number=11 intron 76325..76423 /gene="F14F8_190" /number=11 exon 76424..76723 /gene="F14F8_190" /number=12 tRNA 77114..77185 /note="tRNA predicted as a tRNA- Asp : anticodon gtc" /product="tRNA-Asp" exon 77389..78435 /gene="F14F8_200" /number=1 gene 77389..78435 /gene="F14F8_200" CDS 77389..78435 /gene="F14F8_200" /note="similarity to ring finger protein - Cicer arietinum, EMBL:AB026262" /codon_start=1 /product="putative protein" /protein_id="CAC01781.1" /db_xref="GI:9755627" /translation="MSSAKLFGCSINVNVEAEEEEGGDGGSSTNVEVSRSGNQPDCEA MSFSNQMEIGVRNTYYQFLESNSDSGSDSMYAEPEFIDFFDRESYEVDTVREVCVSSN QRVSTPGYFNIWDQDVDLGLGIGLGSRSGSGQLPGDSGGVGVEVGRGVTPVEYNLFGE EAMVVDEVLEWENFNNAIHLVQEPAYASMEGEEEEEEDEVVMEFAASIYSDAWEILLY DNMTNSAPMDLDVEVWLDSVDGYAPMDYNAIIGQMFDNETGIKGTPPASKSVVDGLPD VELTIEELSSVSIVCAICKDEVVFKEKVKRLPCKHYYHGECIIPWLGIRNTCPVCRHE LPTDDLEYERKRRA" exon 84193..84753 /gene="F14F8_210" /number=1 gene 84193..84753 /gene="F14F8_210" CDS 84193..84753 /gene="F14F8_210" /note="similarity to putative bZIP DNA-binding protein - Capsicum chinense, EMBL:AF127797 Contains bZIP transcription factors basic domain signature AA77-92;bZIP transcription factors basic domain signature AA78-92" /codon_start=1 /product="bZIP DNA-binding protein-like" /protein_id="CAC01782.1" /db_xref="GI:9755628" /translation="MQPNYDSSSLNNMQQQDYFNLNNYYNNLNPSTNNNNLNILQYPQ IQELNLQSPVSNNSTTSDDATEEIFVINERKQRRMVSNRESARRSRMRKQRHLDELLS QVAWLRSENHQLLDKLNQVSDNNDLVIQENSSLKEENLELRQVITSMKKLGGGIHDKY SSPSSMDELDQDFSSITDDPRTHHPS" exon complement(86945..87304) /gene="F14F8_220" /number=1 gene complement(join(86945..87304,87538..88299)) /gene="F14F8_220" gene 86945..88299 /gene="F14F8_220" CDS complement(join(86945..87304,87538..88299)) /gene="F14F8_220" /codon_start=1 /product="CONSTANS" /protein_id="CAC01783.1" /db_xref="GI:9755629" /translation="MLKQESNDIGSGENNRARPCDTCRSNACTVYCHADSAYLCMSCD AQVHSANRVASRHKRVRVCESCERAPAAFLCEADDASLCTACDSEVHSANPLARRHQR VPILPISGNSFSSMTTTHHQSEKTMTDPEKRLVVDQEEGEEGDKDAKEVASWLFPNSD KNNNNQNNGLLFSDEYLNLVDYNSSMDYKFTGEYSQHQQNCSVPQTSYGGDRVVPLKL EESRGHQCHNQQNFQFNIKYGSSGTHYNDNGSINHNAYISSMETGVVPESTACVTTAS HPRTPKGTVEQQPDPASQMITVTQLSPMDREARVLRYREKRKTRKFEKTIRYASRKAY AEIRPRVNGRFAKREIEAEEQGFNTMLMYNTGYGIVPSF" intron complement(87305..87537) /gene="F14F8_220" /number=1 exon complement(87538..88299) /gene="F14F8_220" /number=2 exon complement(91899..92249) /gene="F14F8_230" /number=1 gene complement(join(91899..92249,92359..93075)) /gene="F14F8_230" gene 91899..93075 /gene="F14F8_230" CDS complement(join(91899..92249,92359..93075)) /gene="F14F8_230" /codon_start=1 /product="CONSTANS-like 1" /protein_id="CAC01784.1" /db_xref="GI:9755630" /translation="MLKVESNWAQACDTCRSAACTVYCRADSAYLCSSCDAQVHAANR LASRHERVRVCQSCERAPAAFFCKADAASLCTTCDSEIHSANPLARRHQRVPILPISE YSYSSTATNHSCETTVTDPENRLVLGQEEEDEDEAEAASWLLPNSGKNSGNNNGFSIG DEFLNLVDYSSSDKQFTDQSNQYQLDCNVPQRSYGEDGVVPLQIEVSKGMYQEQQNFQ LSINCGSWGALRSSNGSLSHMVNVSSMDLGVVPESTTSDATVSNPRSPKAVTDQPPYP PAQMLSPRDREARVLRYREKKKMRKFEKTIRYASRKAYAEKRPRIKGRFAKKKDVDEE ANQAFSTMITFDTGYGIVPSF" intron complement(92250..92358) /gene="F14F8_230" /number=1 exon complement(92359..93075) /gene="F14F8_230" /number=2 exon complement(94278..94439) /gene="F14F8_240" /number=1 gene complement(join(94278..94439,94541..94631,94708..94851, 94920..94985,95075..95212,95305..95406,95503..95628, 95837..95917,96250..96321,96588..96865)) /gene="F14F8_240" gene 94278..96865 /gene="F14F8_240" CDS complement(join(94278..94439,94541..94631,94708..94851, 94920..94985,95075..95212,95305..95406,95503..95628, 95837..95917,96250..96321,96588..96865)) /gene="F14F8_240" /note="similarity to sterol esterase - Rattus norvegicus, EMBL:Z22803 Contains Carboxylesterases type-B signatures AA214-229" /codon_start=1 /product="Carboxylesterase-like protein" /protein_id="CAC01785.1" /db_xref="GI:9755631" /translation="MHSPLQTQQPEQRCWPMTSTVSEIEEVLPDEDSDRTTLLNGEPL RRRVSGKSPVDEGPRRIFRQQSFGRDIGHAAAETYLITGLSFKLLRYLGLSMDDKITC PYMLCYASYAWLSSRLDLYLPSNNDGLKPVVVFVTGGAWIIGWLLVNKFMFANLVLIF RYKAWGSLLGMQLAERDIIVACLDYRNFPQGTISDMVTDASQGISFVCNNISAFGGDP NRIYLMGQSAGAHIAACALLEQATKELKGESISWTVSQIKAYFGLSGGYNLYKLVDHF HNRGLYRSIFLSIMEGEESFEKFSPEVRLKDPVVGKAASLLPPIILFHGSSDYSIPCD ESKTFTDALQAVGAKAELVLYSGKTHTDLFLQDPLRGGKDELFDDIVSVIHAEDNDGL TKDSLAPPRKRLVPELLLKLAREISPF" intron complement(94440..94540) /gene="F14F8_240" /number=1 exon complement(94541..94631) /gene="F14F8_240" /number=2 intron complement(94632..94707) /gene="F14F8_240" /number=2 exon complement(94708..94851) /gene="F14F8_240" /number=3 intron complement(94852..94919) /gene="F14F8_240" /number=3 exon complement(94920..94985) /gene="F14F8_240" /number=4 intron complement(94986..95074) /gene="F14F8_240" /number=4 exon complement(95075..95212) /gene="F14F8_240" /number=5 intron complement(95213..95304) /gene="F14F8_240" /number=5 exon complement(95305..95406) /gene="F14F8_240" /number=6 intron complement(95407..95502) /gene="F14F8_240" /number=6 exon complement(95503..95628) /gene="F14F8_240" /number=7 intron complement(95629..95836) /gene="F14F8_240" /number=7 exon complement(95837..95917) /gene="F14F8_240" /number=8 intron complement(95918..96249) /gene="F14F8_240" /number=8 exon complement(96250..96321) /gene="F14F8_240" /number=9 intron complement(96322..96587) /gene="F14F8_240" /number=9 exon complement(96588..96865) /gene="F14F8_240" /number=10 BASE COUNT 30580 a 17554 c 17879 g 30879 t ORIGIN 1 aggaagatgg taagctaagt tgggagatat gtttgtgtca gaagttgact catggccagt 61 ccccactttt tctttatatc ttcaacccca agttactttc tcacttctca tctctgaatc 121 attagggttt tcgtttcttt accaacaaaa gatagttgag atgttgtatt cggacgagaa 181 cacgtcagtt tctaaggtgt tctattatca atcactacca taaacattgc ttaaagttat 241 gggaaacaag aaagaataaa tttggagaat gcaacaacaa gtaggtgaaa gcccatcttc 301 agaccaaaaa gactcatcaa taatctgtaa aaaacttatg ttttaaatcc cataaacaca 361 tcaattcaaa ttcctgaaga agaaaaagcc tgaggttaaa cactgcacct gggaatcaat 421 ttccttaaat ctttactcaa aagatgaggc tcctcttcag cttctgcttc ttcttcttca 481 tgatcatctt taccgcaagt gagtcctgta tattcatttt tgcccgcaca ttaacttact 541 tttcttactc tctgattttc tttttttcac agctgcttat gatccattag atcctagtgg 601 taacattaca atcaaatggg atattatgtc ctggacggca gatggctatg tggtaagaat 661 cctctgcact tgtgcgtgag tgtatatatg cttgaaactg gcacaaactc tcatcatctc 721 tatacattct gtgcatttgc aggctacggt aactatgaac aacttccaaa tctaccggca 781 catacaaaac cctggttgga cattaggttg gacatgggca aagaaagagg tgatttggtc 841 aatggttggt gcacaaacaa cagaacaagg agactgttcc aagtttaagg gaaatgtacc 901 tcattgctgt aagaaaaccc ctacagttgt tgatctcttg ccaggtgtgc cttataatca 961 acagttctca aactgttgca aaggaggtgt aattggagct tggggtcaag atccatcagc 1021 cgctgtatcc cagtttcagg ttagtgctgg tttagctgga actacaaaca agactgtcaa 1081 gcttcctaag aacttcactt tgcttggtcc cggccctggt tacacttgcg gtcctgccaa 1141 aatcgtgccc tctaccgttt ttctcacaac tgacaaacgg cgaaaaacac aagctttgag 1201 taagcaatca atcaaaaacc ttaacttgtt ccatcatcga aatctaaaca tcaatttcaa 1261 acttatgttt ctgcagtgac atggaatgtt acctgcacat actcacagtt tttagcaaga 1321 aagcatccaa gctgttgtgt ctccttctct tctttctaca acgacaccat aactccttgc 1381 ccgtcttgtg cctgtggctg cgagaacaaa aagagctgcg tcaagtgagt aacaaatcat 1441 tgaaatccaa actctttctt agatatcctc tgaaaaaata tcaatatgaa tgttttcttg 1501 cagggctgat tctaagattc taaccaagaa aggtctcaac acaccaaaaa aggacaacac 1561 tcctttgttg caatgcacac atcacatgtg ccctgttaga gtccactggc acgttaaaac 1621 taactacaaa gactattggc gagtgaagat agcaatcaca aatttcaatt accggatgaa 1681 tcatacactc tggactttag caattcagca tccaaatctc aacaatgtga ctcaagtttt 1741 cagctttgac tacaaaccag tctctcctta cggatccata agtaaataaa tacctccctc 1801 aaccacaact attcaacttc aaaccaacca ttgaatgact aataacattc ttaaatcttt 1861 ttgatacaga tgatactgga atgttctatg gaacgaagtt ttacaatgat ttattaatgg 1921 aagctggacc ttcagggaat gtgcaatcag aggttttgct acagaaagat caaaagactt 1981 ttactttcaa gcaaggttgg gcttttccta gaaaagttta ctttaatggt gatgaatgta 2041 tgttacctcc accagattca tacccttttc taccaaactc tgcacaaggg aactttgctt 2101 cgttctcact caccattctt cttctcctat tcatctcaat atggtgattt gatctgatcc 2161 ttctgggttt taaaacactg aaccagaact gttttttgta aactcattgc aaagattgta 2221 aaagaaaaaa aggagttttg tgagaggaaa tgattagaga tgtgcaaggc ttttttacct 2281 tgatcactgc cattgatcaa tgactctgta cttcttggct ttttgctttg gtactaatgt 2341 gaaagtgaag catattgatt tgattgagaa agtggcaaaa caacgaaatg tttttaattg 2401 ggctcaaata aacgcatttt gaatccagcc caataaaggc ctggtttaac attcacgcgt 2461 ttcaatagat tcgcgtgtgt attattttcg ttggtctgaa aaggcaaaag gtcaaaaacg 2521 gcgttttgtt tgggagacac tttttgttcg aagatatttt ctcattctct gatctctctg 2581 tccgaaacgg ctccgttaat tttaaagtaa atattccaaa ataagtttat gtcaaggcta 2641 gtctttttga aaataagatc agaattttac ctatatatat atatttattt atcaaaaaaa 2701 gtagacacta cttacaaaga ttagtattat caacgagtaa tacaaaatgg tttgcaagtt 2761 aatttaatta aaaacaaaaa actaagattg ctaaaataca gtatttgtga acaagttgaa 2821 ctaaaatcta gaagccagtc ccgcacggtg gcttttaagc ccaaatcaat gcccctttga 2881 ttgagagatc tgaaatttag gactcaaaaa gtttgttgct gaccaaacgc gaaactttga 2941 ttaaaaacgc gtttgcgttt taaaattctc ctgtgtttgt gtatgtgtgg aaaattaatt 3001 tcttctcccg acgaatccca aaagtttgtt gaaatttgcc tccaagaaat ctgcaattag 3061 aaatttcatt tcgccgtcat cgtcttcttc ttctgattct gattaattcc agatgtcttc 3121 ctgagtctta cggagattag gtcattttcg cctcgactcg aactctgtct cgcagcttcc 3181 gtgagagagt catggacacg ccaccgactt ctcgtatcgc atcttttggt cagacggaga 3241 ttaactggga caagtacgtt cttcattttc ctcctcaatt tccatcgatt gatccgcttg 3301 ttttctagtt ttgattgatt ctctgtggcg ttgttgttat tagtccgaac tttgtttatt 3361 gttgtttcga tccagttttt gtgattctgt gattggtttc ttcggtggat ttatcgacat 3421 gaaatggata tcgctcgttg ttttgacgga atcatttcgt ggaggtaatt gaagatttcc 3481 gaatttattg gaaagaatct ggttctcggg ctcctgagaa gctcgatttc agaaattgat 3541 ttggtgcaga agatttgttc gctagtgtta atcaaatctt tgtttaggta aaagatctag 3601 taggcatttg gatagtttta gtgttcctta ttcttacggt ataatctgag gtcctacgtc 3661 taaggttttt ggaaatgtgt gatgagtagg gccaaaacaa acttgaaacc attacaaatt 3721 tggttgtaag atttccgaga ggtacctgaa gaatcataca acaaataata tcttcatact 3781 agctctaatt atttttgctc tctctccttc ttttatcttg cagacttgac aaaaggaggt 3841 tctacattaa tggagctggc ctcttcactg gtgttacagt agctctgtat cctgtatccg 3901 ttgtgaaaac aaggcttcaa gttgcttcta aagagattgc tgagagaagt gccttttctg 3961 tagttaaagg aattttaaag aatgatggtg ttcctggtct gtaccgaggt tttggtactg 4021 tcattacagg tgctgtacct gcaagaatca tatttctaac tgctcttgag accactaaga 4081 tttctgcttt taagttggtt gcacctttgg agttaagtga acctacacaa gccgccattg 4141 caaatggaat tgctggcatg acagcatctc ttttctcaca ggctgtgttt gtcccaattg 4201 atgttgtatg tattatatac aacactaccg ctttgtaaga tttgtaaatg ctgtgaatgt 4261 tctatcacta tgagttgctg aaaaccatat ttctcctatt ttctgcaggt tagccaaaag 4321 ttgatggtac aaggatactc aggtcatgct acatatactg gtggtatcga tgttgccaca 4381 aaaatcatta agtcatatgg tgtaagggga ttatacagag ggtttggtct gtctgttatg 4441 acctattctc cttcaagtgc cgcttggtgg gctagctatg gatcaagcca acgtgttatc 4501 tggaggttag ccatgaatgt tctagttaat tgatgatagt ttccatttta gtttgattgc 4561 aagaacttgc atttcagtcg ttcctggaat ttggatttgc taccaaagct acaattccgc 4621 tgatacaata tctgctactg ttgggcagat tcttaggtta tggtggtgac tcggatgcaa 4681 ctgctgctcc tagtaagtca aaaattgtta tggtccaggc tgctggagga attattgctg 4741 gtgcaacagc atcctcaatt acaacaccat tagacacaat caaaacgcga ctgcaggtat 4801 aaagtgttcc cattcatagt aaaaacttga ttttattttg catttacttt ttttaactct 4861 aatgaatata gaagagcttc cgtgcctgct gtcctggttc tcaaatagat aacttcgggt 4921 attgctacat agaaactatg ctaaacaaac ggtaaaatct ctgtgcaggt catgggacat 4981 caagaaaata gaccttcagc gaaacaagtg gtgaaaaaac tgctagcaga agatggctgg 5041 aaaggattct ataggggttt gggcccaaga ttctttagca tgtcggcttg gggaacctcg 5101 atgatattga cttacgaata cttaagtaag agttcttttc ttttataccc aaaatctcac 5161 tttgagccac tctcaaaacc aaccaagatt cttcttttga aatgctagct gctcctctga 5221 catgtttcac tctttcatgg tttttttttc cagagcgtct gtgtgcaata gaagattaga 5281 aagcttcact actgttcgtt ggtttttctc cttctactaa ctcgagtaga cggcatcacc 5341 ttgattcaaa tctcggctat tgggagcttc catgttgaca attggaccat tctttcaact 5401 ttgaagtctg agttccgtta ttgtaatcat tctttcaact taatttgcgt attggagtgt 5461 attcttacag cagaagaaat tctagcttac gttcttctgc tttatgtagc ttctgagatt 5521 taagaagctt caaaacaatg caatcacttc atgaaatctt atagtcgtac ggatgtacgt 5581 gttatcgtaa cagtcaaaat ttgagttgtt gcaaatggct tctcatttgc ataactgtct 5641 tcacaaagta tcctcattct taggatatca gccaaagcta acctctggtc attctcattg 5701 atagcaatat aatcatcatc aaacctctgg tcattctcat tgatagcgac atatacatca 5761 tcaagtactt tattataact atgatcattc tcattgatgc tgtgtctctt agcaaaaaaa 5821 aggccactag ccataggcta acaccaaatc accaagagcg cttactgggt tttattttag 5881 ctcatatacg taaatctatt actccagttc ctaagattta agagtgttta ttatcctaaa 5941 caaaaagcca ccgatatatc taattgttat tcaaaatcag ataaaataaa ataaatcgta 6001 tatatacgaa aagtttcaag ataaacaaga aaagtagtta gtatttgacc ttttttagtt 6061 tatttcctcg tagcaggctc atttttttcc cctctctgac tcagcagatt aatttctcca 6121 ttaggataat aacttaaata tttccggaga cttaaatatt cttacaacgt ctttatcaac 6181 aaaaacaaaa ttcttgaata gcgatctctt ttcttttaat taaaaaaaaa gactttaaaa 6241 atgaggataa aatcaaatat aataatgcat tattgatagt aacttaaaca tgcatggagc 6301 tcttaattat ttttgggaga cttttagata tttatgaaat aaaataaagt gtagatgttt 6361 taaaaataaa aataaaataa agtgtcgatc acctaccgac ctatacgtgc atgtattctc 6421 gtttatacga tagttacatg atttatgcaa taatcataga cgacgtgcat cgacgagcgg 6481 cccacgacca cgacgtaagg attacggtcc acgtcgtaag gattacgttt taaatttata 6541 gtttatcaat tgtatatcgt aaataataaa tttagaatga ctgagaattt gttgagaaat 6601 gaaaatatta gagaatttat cgaatgattc gaatctaaag cgaataataa tttttttgat 6661 gatttatgtc cttttgtttc gtcgacattt atgatttaag tcctagcata tactatattg 6721 tacttattac cttatacaat acgatttgta tatatacgaa ctttaactaa aaataaagtt 6781 aaatgttgaa gagtttaaga cactaccata gctcccccac atccacatgc actatttctt 6841 cgtaacatat caaattaaaa ccatgcattt attaagttgt cagttaagga aacaaaaagt 6901 tgtgaatgta tttgatttta atttattttt tcagtttttt tcttttaggt agaaagaatg 6961 aaaccttagt tttgtacata ataaaaacct tactatctta cattccagtc aacaaattaa 7021 atacagtgag tacattttgt tagatatacg ttcatttcat tagcacatac gttatatact 7081 tatatagtaa tcataatcat aaaactgtac atatgaattt atttggttgc aaaataaagt 7141 tagcttcaat caattataat aaatgcttca ctattttaga gttttggttt atttaggggt 7201 atttaaatct aaataaaact gaaaatgaag caaaagcaaa cttaaattaa agtacatctt 7261 ttattgggtc cgaaaaactg aaattaaatt tcgtagtagt cttaaatgat tttgaccttt 7321 ttaaataata tattcaaatg tgtttcaaac acgaatcaaa ctataccaaa aaaaaaaaaa 7381 aagttggata aaaaataaaa cctgactaca cctcaacttt ggatcaaaat ctatgaatat 7441 attttcaaaa ttatcttagt caaattttaa attaattaat tatttatata aaatttaata 7501 attatcataa ccttggatta aatttatcta cagtcaaaaa ttaattttaa atcaattaat 7561 taatagcatt attacaatcc ctaattgtac gggacgaata aaaaagtaga aaactcaagt 7621 tcctttcttt accatacagc tttttcgatt ggagttgaat aagtcttcat ctgacacgtg 7681 taaccctggc acatgccgtc cactaaaaca cgtgcgagat ctgtataaat caaacctacg 7741 cgtttcatct ctcttttcaa aactcaccga cgcgatccga tctcatctct ctcatttcga 7801 aaccatggtt gagccggcga atactgttgg tcttccggtg aacccgactc cgttgctgaa 7861 agatgagctc gatatcgtga ttccgactat cagaaacctc gatttcctcg agatgtggag 7921 gccttttctt cagccttacc atctgatcat cgtccaggac ggagatccat cgaagaagat 7981 ccatgtccct gaaggttacg actacgagct ctacaacagg aacgacatta accgaatcct 8041 cggacctaag gcttcttgta tctcgtttaa ggattctgct tgtcgatgct ttgggtacat 8101 ggtgtctaag aagaagtata tcttcaccat tgatgacgat tgcttcgtaa gttacttgaa 8161 ttttgagttt tgtattcgtt tttatgcttg atttgagagt tttgtcaatt ttggttctag 8221 atctgttttt ttgagcttat ttgtttgtgt ttgtgtggat ttttcaagtt cattgcttga 8281 atttcgtaga tttggtgaga gatcaattat acgattcact aaatttgacg gatcttaggt 8341 ttgtgagata atccttggtt cgattagcta ggcaattcaa tgttttgtac cagatccata 8401 gatctgcttg ttgagtctga atatgttttc acttttgtgt aattagccat gatctctaat 8461 gtttacttgt agattttctg tgagctgatg tctcttttgt tgacgacatt gttgttgagc 8521 tgatatctct gagtcattat agctaccttt acgatatggt tgcacgtcct tgttcatcac 8581 ttttttcttt tgttttacct ttttgagatt tgtggggcat atccaaggat gagtctcgat 8641 gacgcttgtg tttagtttat aattttctga gttttttttg gaggaactct ttgatcaatg 8701 gcttgatctg gattttaacc gctttttaat tcatgtattt ctttgatgtg tacatgtagg 8761 ttgccaagga tccatcaggc aaagcagtga acgctcttga gcaacacatc aagaaccttc 8821 tctgcccatc gtctcccttt ttcttcaaca ccttgtatga tccttaccgt gaaggtgctg 8881 atttcgtccg tggataccct ttcagtctcc gtgaaggtgt ttccactgct gtttcccatg 8941 gtctttggct caacatccct gactacgatg ccccgaccca actcgtgaag cctaaggaga 9001 ggaacaccag gtgacaataa ttatcatcat aacatgttta tgtgtttttt tgtcaggata 9061 ttcaaatgtc agtttttgct aaacgtttga tatgtcaggt atgtggatgc tgtcatgacc 9121 atcccaaagg gaacactttt cccaatgtgt ggtatgaact tggcttttga ccgtgatttg 9181 attggcccgg ctatgtactt tggtctcatg ggtgatggtc agcctattgg tcgttacgac 9241 gatatgtggg ctggttggtg catcaaggta atttcttctt attcccttgt aagactcata 9301 attgagtata gctaaatatg aagcacatgc tctgtactaa gcgatacctc catttggggt 9361 tgaatctttt ataggtgatc tgtgaccact tgagcttggg agtgaagacc ggtttaccgt 9421 atatctacca cagcaaagcg agcaaccctt ttgttaacct gaagaaggaa tacaagggaa 9481 tcttctggca ggaggagatc attccgttct tccagaacgc aaagctatcg aaagaagcag 9541 taactgttca gcaatgctac attgagctct caaagatggt caaggagaag ttgagctcct 9601 tagacccgta ctttgacaag cttgcagatg ccatggttac atggattgaa gcttgggatg 9661 agcttaaccc accagcagcc agtggcaaag cttgagagca gtatgagcca aaaagaaaaa 9721 gccaccaaag ttttggttat ttttagctca aattatcgtt acttttaaat ttctgatttt 9781 acgaaccttt cttgcttttt ttacacattt gagtagtttt catcatcagt actttctcat 9841 tgtccggtta tggtttttgc atttggttta aatatcaccg gtttatttat aaacagtggt 9901 ggattagtag tactattttc tgagtttttt tctttgtttc attaataaaa aggccttttc 9961 ataggtgttt gcaattagtt tttttccccc attaatcatc gattatcata ggtatgttat 10021 ggctttaaat ggtataagga aattgcttat agaccaaaaa aaagttgaat tgctattgag 10081 agagctttta caaaagaaag agcattgttc aataagcttt tcacatttgg tcgatatttt 10141 gatcaaccta tcataggtat ctcaattaat aaaccggaat gttaatatgt tttgcaagat 10201 atccctacaa tcaaaccgtg aagacccgac ccggttgcgc aagatccatt ttcctacttg 10261 gttaaatggt caagcccatt aaccatacta taggcctgtt gccattttcg accgacctct 10321 ctatttctac ttactactta aagaagcaag aggagacgta gaggttcata cagggtttaa 10381 aaacaagagc aagtcaccat gatgatagca agaaatcagt ccaagaagtt gaagacggag 10441 aacaattcac aaactttgca aactcgaagg aagggaacaa atatgtcgtt ccttttgacc 10501 tagtgatcga gattccgtag ctaggtctgt cctcttatcc aaggtatggg ggaaaatagt 10561 ccgcagtaga cttttcatga catctttccc gtttacgtca ctgtcgtctc agcctgatcg 10621 tctcttcgtt gttttcattg atttcattaa tatcttacgt tttttgcggt gattcctctt 10681 ttgtggaaaa gtattatcac catataaacc agttggattc actttatgtg atgctggcac 10741 tatattgttt atgcctctca tctcaagttc tcaactattc tttttgcggt ttcctcattt 10801 cgtgatgaga ttcttagaca ttgcatgtgt attgtgcttt tcttcttctt atcaagaacc 10861 aactttgttc aaattagatt ctcaaagtgg gctttctttt agcatctacg cgattgtgtg 10921 gttgactact tttcttttaa gtataatttc atagttttgt tttatttcat tttagtatat 10981 acagtactac aactgtgtag ttaacttact acagtagcat gtatttcatt tgaccaaaac 11041 tttggtatat caattaacgt ttttactttt tactttttac tttttacact cctggtttac 11101 ccaattttgt gcaaaatgat gcttatccat gaagaccgga cccggttatg taagatccac 11161 gaggtttatt acgttcggat ctgctccttt atctccattt ctcagctggg gtagagagag 11221 agatctaggg tttcgaaaga agagcgatcg atcatgagaa gacgcagcaa aaagatcaaa 11281 accgagaaca acagcaatcc agaaacgtcg gaagagagaa acaagtttga tgaaatccct 11341 catgacctag tgatcgagat acttgaaaga ttgcctttga aatctgtagc caggtttctc 11401 acagtatcca agttatgggc aacaactata cgcagtccag atttcagaaa atcttaccgg 11461 ggtggatctt cgtcggagcc tcgtacccta atcgtttccg acctaaattt taaggaacca 11521 aacccaaagt tgcatttctt caggccgtct atatcatcac cgtcttttct atcaagtctg 11581 acatgtccgt tcacatatcc tcggcacgaa gagtactatt atcatcatgt taatggattg 11641 ataagcgttg gatatggtac agaccaaatc gtaattaacc ccacgactgg aaaattcata 11701 actttaccaa gacccaaaac aaggagaaag ctcgtaataa gttttttcgg gtatgattca 11761 gtgagtgatc aatacaaagt gttgtgcatg acggaaagac tgcgtggcca tccagaagaa 11821 gcatcatctc aacatcaagt gtacacattg ggagctaaac agaaatcatg gaaaatgatt 11881 aattgtagca tccctcaccg tccttggtcg tggaacgccg tgtgcataaa tggtgttgtg 11941 tattacattg ctaaaacggg ggagggtatg tttcgtcggt gcttaatgag atttgatttg 12001 aagtctgaca atttggatct ttgcactatt ttacctgaag agattcaaac aagtctacat 12061 gattactttt tgatcaacta caaggggaaa gtagccatac ccaatcaacc taatttctat 12121 acatatgatg tgtgggttat gaatcaggaa ggtggaaaaa ttgaatggtt gaagaatata 12181 actttcacta ttaaacctcg gaagggtttt gttcgttatc tattcgtcac aggcactact 12241 catacgggtg agtttatttt ggcaccaacg tcctataccg atgagtttta tgtcttccat 12301 tacaatcccg acatgaatag ttttagaaag ataagggttc aagcacccgg agttaagttt 12361 agttttgctc agaaagcttc agttgttttt tcggatcacg tagagagtgt ttggttgttg 12421 taggagagac atggttttga tacccataat catcagtatc gaaccttgga agagtttatt 12481 gcgtttggat aatttacaca ttagaggctc tactcatacg gctacgggag agtttatttt 12541 ggcaccacgg ttctattctg atgatcttaa tgttatccat ttcaatcccg acacgaatag 12601 ttttagaagc actaaggttg aagtatacga agactatgag tgaaagcgtc atggtacaag 12661 agcaatggtt tttcgggatt acgtagtctt aagtttttct aggagagaca gtttcatgtt 12721 ttgataccta tcgtatttga aatgtttgca ttgatagacc cttttttagg gaaatttttg 12781 tcacttcatt gcttaattta gttaaaccag ttggatttgt ctactttctt ttctcctttt 12841 acaactatta tttatttcat ctgttttagg acatgcttta tatgaatgaa tcatttatca 12901 tttatcaatc atttagtatc taaaagagtt tatgctcttt gatatatcta cgaagaataa 12961 tactaaggtg ggagtcaatg tttgaaatat tgataagccg gaagacgtaa gtaagggctt 13021 gattggttct ccctctgcca cccgcaaacg ctgcgtttgc gggtgatagc ggttattagc 13081 ggttggcacc aatcatagaa accgctagat accgcttcgg accgctcgaa atcgccagat 13141 ttcaaaagct ccttcccgca agcatttgcg gttgcgggcg gtagcggtta agttaaaaaa 13201 aaatattaat taattaaaat aattatgtct tccacccaaa ccttatactt attcttatca 13261 gaaaacccta atcaatgaaa ccctattctc tcaattatgg catccaccca aactgacaaa 13321 aatcaaagtc aaaatcaaga atcaaacgta cgtatgatct atcctttcat ccattcatta 13381 gctagtgtaa ttctcggaat atttcatggc atcgttgttt ttttttattt tgtttttttt 13441 tgagtttttt gattgaatac atgttttgat ttgttttctc gattagaaaa tcatttggtc 13501 aaacgagacc acatttgcgt tacttcaacg ggttattgtg gagaaataaa cggaagaatc 13561 caaagacccc aaaatcagat ttctaagtga tgttcaaaaa gaaaatattc ttaagaatat 13621 taacgtttgg aaaaaatttg agttggagat gtgttaaaaa tcgtttggat atcttgaaga 13681 aactttatca tatgtataga ggcaatccgg aaaacccgcg agttcaacgt tatttacagt 13741 ttatcacact acttgatgca atttttggtg atgtaccaaa tgcataaagt attaaacttt 13801 ttcttacaaa atttatttat ttttgaattt ttagttttat ttatggtttt aattattaag 13861 tatgtttaat ttttcaaata ttatcaattt atcataataa tatattgtat tttttttctt 13921 ttaatagtaa tatattatat ttattttggt tattttttat ttaattacta tcgcacccgc 13981 tggtttacca gtcataaaac tcccgcaaac gcacccataa ccaaacgctc aaccagtcgt 14041 tcaaaacact tgataacgct tgaaaccgca accgcccgtt tccgcaaact cccgcaaccg 14101 caaccgcacc cgctgcgttt aaaccagtca ggccctaagt gatgcatctt gactaaataa 14161 aatttatgtt aaattacaaa agttgtcgat ataacagccg atttgtctat actactaagg 14221 ccaggactta gtagttgcgt tgattcctct atatcatgtg tatgtccgtt cccatatcag 14281 gtaccattct cattatgtta ataacttgat tgtggtcaaa agcaaatcat aattttaatg 14341 actttattaa agtatgaatc caataactcg agatttatga agattttaaa ttacatttta 14401 caaatcaaaa tctaataaca ttaaacttta acagagtctt ataaaatctt aattgaataa 14461 cacaagattt tttaacattt caagtcacta aaaatccata taaatcacaa accaataaca 14521 ccttttagtt aaaaatacag aaatccaaat ctcatggttt aaggtgagat ttgaattaat 14581 tttacaaaaa tcatatgaac ttccctaaaa ttatcaaaat catttaaaaa ctcaagaaac 14641 tcaaatcact ttaaattcta atttgaatac acgccctttc taagtttcaa tataaaagag 14701 cacttgcttt tccggattac gtagagagtc ttaggtgttt agtaggagag gcagtttcat 14761 ggcgttgatc tctctttttt aactttggtt gattactcta ttgattactt actcttattt 14821 cacaaacaat cttacctctt tttataaaat cattcggact ttgtctcctt gcttctgggg 14881 tattattgac actatagttt ctgtctacca aaacgagttc gtgataacat tattatacat 14941 atgaaaagag gtcatgattt aatcctcaac ccaatcccac aaaaggtata aagatgcata 15001 tatatacgtg aagacgggac ccgggtacgt aaatccactt taccagtcgg gtatagatat 15061 gtaggttcaa aaaaagcgca ttggttatac cttgtctcca tatttataag ttttaaatta 15121 ttactatata ctcagctgga aaaaaagcat acaaaagaca gagaaatcta gggtttcgaa 15181 accagagcga tcgatcatga tgagaagacg caacaaaaag accaaaaccg tgatcagcaa 15241 tccagaaacg ttggaagaga gaaacaagtt tgatgaaatc cctcatgacc tagtgatcga 15301 gattcttggg agattgcctg ccaaatctgt agctaggttt ctcacagtat ccaagttatg 15361 ggcaactagt atccgcagtc tagatttcat caaatcttac ccgcttggat cttcgtcgaa 15421 gcctcgtacc ttagttgctt ctagtgaacc agatctcaat atggaacacc acgaatgata 15481 cttcttctcg cagtcttcct catcaacgtc tcttgtatca cgtgtgagtg tgacatgtcc 15541 gctcccatat tactatgatc atcactaata tcatcatgtt aatgggttga taatcattgg 15601 acatggtcca gagcaagtcg tggctaaccc tagcacaggt cgaaccatac ctttaccaag 15661 agtcaaaacc aggagaacga tagcaacaag ttttttcggg tatgattctg ttagtgatca 15721 atacaaagtg ttgtgtatga cggtaaaagc gtatggtgat ctgagggatg agtcatctca 15781 acatcaagta ttcacattgg gagctaaaaa gaaatcattc agaatgatag atactagtat 15841 tattcctcac cgtccttgct ctaacggtgt gtgcatagat agtgttgtgt attatgttgc 15901 taaaacgggg gcgggaatgt tgcatctatg cataatgaga tttgacttga gttctgaaat 15961 attggatctt tttactagtt tgcctcaaga gattcgacct ccaagttgaa ttattgagaa 16021 tttgataaac tacgagggga aactagccat accaacggaa actacttcat atacatatgt 16081 tgtgtgggtt atagatcagg attctgaaaa acatgaatgc ttgaagaaat taactttcag 16141 tattgaacca gcttggaaga attcatttgt taatctacgc ctcataggct acactactca 16201 tacgtgtgag tttattttgc atcaccacac tataattatg agatttatgt ctcccattgc 16261 aatcccgaca cgaatagttt tagacgcact aaggttgaag catccgcagc gttcaagttt 16321 ccttttataa gagcaatggt ttttccggat tacgtagaga gtgttaggat gttgtaggag 16381 aggcagtttc atgctgtata atactctctt tataaaactt tagcacttat aaagagagta 16441 ttcattcgac gactctaatg tacaaaatcg aggatgtaat tgtacattac aattcacgat 16501 tacaattcac gactatacgt gcacattaca attcacgatt ttggttgact tcactatcta 16561 ttagttcttt atgttttcac aatttcacag aaaaaaactc tgatcataat aaagtaacaa 16621 ataaaacttt ccttcttaca tgtctggaat acattggact ctactaaccg cgatcataca 16681 agttagttgc tacgacacca aactattcac acacagacac attgcttctt taaaacttct 16741 ggaaagcata caaattacca aaaataaatt caaaacttta ctgaaaatgt tacatgtttg 16801 cgatcgttgt gacaaaaggt accaatcttc tgtaatctct tcttcctaga gcgcaattat 16861 caagaacccg acccaacatt agccttgtgt tctgtaaagc cacatctctc ggaggtatcc 16921 ctgactctgc cgcagctgca ccaagaggga aaggttgcga cattgttgca gccatcattg 16981 cgtactgaag aggtccgctt cgggctttaa atgaagtctg aggtacaagc gttttctcat 17041 cagtcccgtt ttcagagtaa ctcgaatcct gagggtggga tccaccggtg cccccttgta 17101 ccgggctgca gctctgaaga agagcttcga gaacactcaa tgcctcccaa cacagagtgc 17161 tttcgactag ctgtgaaaca attgtgtaca tgtggggact ctgcgaagca tccatgggag 17221 tgtgttggag caatgccttg agcataagta atatgactcg ttggtaatcc actggtcctt 17281 tttttagcag tcttaacaag tgaccgaaag ctaaagctga gtgtttcggg aaccatttgt 17341 tgcagagcag aggagagaca catgcaagga gattttcaac tcttttaatc tcgccacgag 17401 catatgcgac aaaaacagtc gctaattcat cgagtgattt tgcccgacac cagacagcaa 17461 tgttagatgc cacagaacat gccttttgat attgctgctg gagtggtaac gcagaaacca 17521 tgacttggtc ttgggtcagc tgcaagcaaa gccaaggaag tagacctgtg atatgcataa 17581 ggagtcttgt ctcagcatcg ccaaatatag agtcacatga aggcactgta atccttgata 17641 ggacctcaat agagaactcg tggctaacag tagacatcag tcctttgagg acaagaggtt 17701 gtacaccttc gaactttgga agagttccac ttgacggcgg catctcgtac cctctagatt 17761 ctgacctttg gaattccccc aaatcattag tgttgaattc atcccgaggc atacttgaca 17821 agagtacatt ttcagttgtt ttgtcacgaa atgatagccg gtcaataata cgggagaaga 17881 gctcaagaac ctggcaatag acatggacga agtctgtgtg catcatggcg acacagcccc 17941 agaagagctg tgggtagagt atgactttct ctggctccat gttctctacc atgacctgta 18001 gcgtcagaag gatttccata ataaaaccca aaactggagg aatgggattg ctaagacatc 18061 gatgaagaca gcgtagaagg gagacacatg catcacttgt cacacttggg cgtagcgcac 18121 ggtatatttg gtgtgaccga catgctagat gtcttgacgt gcattccata gcccatttga 18181 gagcttcagt tccccaagtt tctcgaagat ctccttggaa gaatatagca tcaaccatac 18241 tctgaacaag tgcagacaaa agagcggcac taggaagatc tgttctcaca accgttggat 18301 cttcattctc ccacatcatg cttcctcgtt ttgattgcac atacttaatt aaactcacaa 18361 cctgttgttt gttctctcca tcgctgtttt ccacctcata tagctccaaa tgccgacccg 18421 ctagtgagta caataaattc acaagtaaat gttgacaatg ctctagcaca atgtcttcag 18481 agctatccat cgaaacaaaa gtgacgtgga agagcaaagg cagatgttcc ctaaaatctt 18541 catcattctc atatgcgatt tctgcaagaa gaatcagagc tatgtcagca tgagtgagtg 18601 agtgctgttg atgtccctgc aaggctgatt gaagttcctt tgcattgacg ccgtgcattg 18661 cgatccctga atgcaacaca tcttctccag aatttggtgt atctatgagg taatctccgc 18721 tatcacgtga gacatggcga ctcctcaaac ttcctgtaga gctacgaact cccattaaag 18781 gtcctgatgc attcaccaat gatggcagga gctgaccaga acgaccagta gctacaggaa 18841 caatattcaa ctcaggaggc attggactta aaggcccaga agcacttcgc ccagtcatcc 18901 ctgctgttct ccagcttagg cttccacttg tattcctaag gggaccatca agggagcctc 18961 tgacaagaag tggtgacata tgtggttggc tgtctgcaac tgaaacaact tgaggtgctg 19021 tggcaggtcc ctgagagaac tccagcacaa agtttccatt ggaatcaccc cggtttgcac 19081 tataaccaat gggttcaata ctgtcttcca gcatccgctg tgacagttgg taaaccaggt 19141 gatcaatagt acgctgagga catattcgtg cgaggtacaa acttacacgc ttagcaactg 19201 aaaaatatgt agcaaatgca ccagttattt cggcagatgc atttgaatca cagtcctcaa 19261 ttccctttgt gatcaagaaa tccaatacag ggctgatgtt tctgggctta ctagcaatgg 19321 tactccaaag tttttcaatt tcatcaggga attgatcccc gtgtcgccat gtcacatagt 19381 aaagactttt tagtaatctt tcactccaac ccgaatcctt gagtttccag aaattaaggt 19441 tctcaatcca tggagccata caggtcaaga cctgatgctg agcaattata tcaacagcat 19501 caagctgacg ctgcataatt tcttcacaaa gcagctggct taactccgga tgatccttgg 19561 cgagtttgca ggagagcttg tattggaact gttgatatga atctggaaga ttacctacaa 19621 ctgctgctct atagcctcct gaaccttcta taccatcctc agcccactca cgcatagaaa 19681 gtgtctcaag catttgaaga gcatcatcgc gaatttgtct agatggatcg actaccttgt 19741 agagaattag acttaagagc ctctggattt cacattttgg tatttcttgt cgcatgtaaa 19801 cctcagccaa cacactgaag tacccatcag ctatagccgc atcagagtag taacactggg 19861 aaaagtacag atgaattagg tcatgcagtt tagaagagtc taatcaagga tgtagtcaaa 19921 cagatgagca aagcatagaa gtgaaaaaag cttattaaga gaattcagat gcttaaactg 19981 gagctaataa aacaaaacta tctgatacgg aaccatcact gcatggataa gttgtcatca 20041 ttggtcaaga tttatcaaac atctgaccta gtaattacca agacgacagt ataattaaac 20101 acaacttcca tgcattcagt cagatagaac attacctgat caatgcaggc aggaaagaga 20161 tccaagtttg taagtagaag attcttcagg gccaattttg ccaacgcaac acgttgatga 20221 ccacctctat gcctatcacg acctgttgtc cctcgtccac cttctcccgt gtatttggag 20281 taagatggag ttcttggatc agcaggcgag taaccaaaag ggaccctggg agcaggctca 20341 atgaacaaac tatttatcca agatatcaca cggccgctca tttttcttgc attgtcatca 20401 aaacacggtc cataaagaag ggaagccatt gcattcaatg aggcccactg gatggcttca 20461 atttgctcat tcagttcctt gtcaaaagaa atcttgtcta cagaatcttt ggaccggtta 20521 tgctgtgagg tcttatagcg ctctacttca cggcgataat cactaacacc atcctgtccc 20581 cacgtatttc cagtgtcatc ggaccaggaa agaagcagat cgaatagacg tttcctactt 20641 ctaacatcaa acttctctga ttttgattca acaaattcag gggcaaggaa tcttaaaaca 20701 gacgcaagtg catatcgcag aggctgcatg tcctgaaagc tttcatgagg tgccaacgat 20761 atttgtcttg ttgaatcttc aatgaaccgc agataatgga ggcggaaaac tggtttacga 20821 gcaagcattc caggccagac attctcagac actgttcgat atatattaga gacatgaaca 20881 cggaggtcct ctcttcgacc tcctttttgg atctgcattc atatcatagg aacacatcgt 20941 tttagcgtca agtctcaata ttaatcaccc agctgtaaat caatgtaaca aaaatggtga 21001 agaggaatcc aacggtcatt aatttcatat tcagggaaaa acttagtaat tcgacatccg 21061 tagcagccca tgtgtgagaa aatactggat acttccgttc cctcaactct aatgattcaa 21121 aatgaagctc aataaagaca catagaggta caaataatgg caatgacttc tagtactttt 21181 catccatgag atttccctta cttgttctcc ccttttttta ttcatttgct taggtaagtt 21241 ttgtaatgaa tctatagtga taatgagtac aaccgtaaat ctgtgcagcc cagcagcata 21301 gtgaatgctg taggatttcc aggtataatt ctgaaagaaa gtattctaca ttgttttaca 21361 atttgacaag tatatacctt ccactttggc ttcgtctctg tctctgatga aatctcctcc 21421 atgaaagatg caagctcgct aaacataatt tcacatgctt ccaaatgaga gcgcccgaga 21481 gccatagttg cagcatgcta catgcaagga agtataacgt aaagattgta aaaacttatc 21541 taggagaaaa aattcaacca aaaccacatt aacctaacaa gacaagtagg tttaaaagac 21601 tgacaaaact accaacttca gactaaaaat gcagagacac aaaaacaact tacgttgtgg 21661 gtctctgacc caaatctaag ataagggaaa ataaggtggt acatgtctct ggttgatgca 21721 atgctacccg catctttgcc atcaggtgga cacgaacaca caaacatagc atacagaagc 21781 cattgatcca gtttgttatc tgtatcttgt gactggtttg cctttccacc aaactcaaca 21841 ggtgtaatat gggccaggcg gtgcataatt tcagacctgg aaatgaaaag ggcactttag 21901 tttccttcat gttattaaga atagcaaaag agttatggca ataaaatgag acaaaaaaat 21961 tcttcctaaa ttttaaaatt cccacagtaa agtgaaattt atttgcacgt tatactaggg 22021 catgaatttt actgaaggat tgttgatcca taaagcctac tttacttttt taatctttat 22081 agactggtag gacagcgttg acggtcagag aagactcaag aaaacacacc aaggatgatg 22141 tagttaggaa aatatacttg gagcaaacac actaaagata atgcagctaa gaaatatact 22201 tacttggctt cttgaacaga gcgtgggcag agctcagcag catatttaac aagctcgctg 22261 agacatcgac cccatttgtt tttatcaagg ctctcaaaga taatggattg aagagtcaca 22321 tcagaaggaa tggcatcgga atctcgtctt aaatcaaaag gacgggcaga atcccaatag 22381 caactttgaa caatatcatc ctattatgtg aaagaaataa aggatacaaa tcttaatttt 22441 ctataaagcc agacggtttt aaaccataat caaccaaaag gctaaaaagc ctctgaaatg 22501 gtgagatcta cacagaagct acttacccca tgttcttcca agacatcgat catgtatatg 22561 ggttcagctt caaatttcat aacatggtct gggtgctctt gaatcatgag atcccgaata 22621 tcatttctca aagcacgtac acatcgtaat aattctaggg cagtgtgccg aatctgactg 22681 tcgactgaac taagaaatat tagcccaaca gcatcaatat cagaagcgcg aaactcaatt 22741 gcatctgctg cttgatgaaa agataatttc ttgaacctat catttccctt tgctgttttg 22801 ttttcttctt cagcatcagt atcttgtctg tcatctacta aacaagccct ccagaaacgc 22861 atgagttcca acagacgtcc tagtgatgct tgaatgagga gtgggaattc atcaggaagc 22921 tttaagatga agtttgccat tcctctcatg actgcaaaac gacgatgagg aaggtatcta 22981 acaatccggt tcaggacttg cactgcttcc actcgaactc caggatcaat acttatacca 23041 tgctgaggta ttatttccgt gatcttatca ctccggccaa cttcctcaat cagataaggt 23101 atgcacttca gcactgactg gaagagagat ccttgagact tttctttatt cacagcatct 23161 gcatataaca tgaaggttag cgaaaagtgt tttcggaaag taacagccag gggatcctag 23221 gtagaaaatc aaaaataata tcacacttaa atacacacaa catataaaga tttcttaacc 23281 tcctcacaaa accatgtaaa tcagactaag ctttttgcgt aaaactaagt ttaccgattg 23341 tagtcctcga tgatgttaga agagcttgac tgtaggtcct gtggcaggat cttaaaattg 23401 actcaattgc tgctttcacc tttggtatgt aatggcctat cccatgacct attaagtgaa 23461 cactaagaaa agttaagtgt atgaaaggaa aagcttattg agacctacta acaacctaat 23521 tcaggttaac gggttaccct taaatatctc caacccaaca tactggcttg aaggcgacat 23581 aacaagggca agtaaagcac ggagaccaat aatctttgct tcgcttggac tatcttgctt 23641 taacagttct agtaacatgt gattcatagc aaagtcaagg ttatgttcag ctatggtcac 23701 acagaactca acgagtttat cttgctggac atcctgagtg agcattcctt ttctcagaac 23761 tgttagcaat tgtgatgtca cactatccaa gtaatcccat atacgatttg gtggctggga 23821 cgaagcatag acactcaaat aaaacctcaa cacccggtga agacaatcaa gtgccatgta 23881 gcgatggttc ttatcctgca tataaggcgc aaaattctct ataatgacca agaacaagct 23941 tgaaaaagaa caatgaagtc acaggtgctc aacatacttg aaaaaaaata agctgctgca 24001 gttacaactg cacatcaggg tttgggacgt gataattggt tccatgcatt cttacttaat 24061 ttaaattacg aatacaggca aacttactct cagaagcttg tacagttgtt ccatgtgaga 24121 gcttaaatta tgatgaaata taagcgggtc accgagacag agaagaagac tcacaagcgg 24181 atagccaacc tattcatcca attatacatg ttaaggactg taatcacaag gtaggcatac 24241 aaaagtgata ctaaatcaga ttgacgtaca ccaagatgtt tactctgttt ctccatccac 24301 tggattagct gcactctgat gcgtccaaca gcctcatacc agagtgtcag tgcaggttct 24361 gcaactgaag gaggccactg gcttttgcca ccatcggaaa gtggcgccaa gatgttagag 24421 agcatattgc aaagcgcatg ataaagctca cttttacgtt tgtgtatatc ccggataaga 24481 gggtttgctt ttgctacaaa ggaggcagac gcattcagcc caccctcact cttaacctgt 24541 aaataaacat actgtcacgt tcactttctg aagcctgcct tttctaagat atctgagaca 24601 taccccaagc ttcaagtaac gcatcccatt aatgatgctg agagtttcac ttcttgcaac 24661 gcttgtatca atccggcgag tattgagttc catgaaaaac ctctccgtga cagagctgaa 24721 tctgttagtg aaaattaaat aaatggataa gagtggttgt ggcttaaaat agatcatcac 24781 tttcacaaat tcttttctgc caactaaact tgtgaatgga agtccacata ttcttttcat 24841 ttatacttta gcatgcttgg aggattgata acttacaaaa tgaattacct aatacgggac 24901 agtgcaccca ggagttgagc aaccaagtca agaaggagac ccctcaaatc gaccagcgat 24961 gggtattcca cttggctaac aaccctgtaa cggagaagca cccagaactt agaaattcag 25021 gtttttacaa ctcgaaaaag gaatgccaaa ggataaaaga agtgtttagc gaatcatata 25081 aaattcaacc gttccagatt ctgaaatcac tacctgtcag cgttgattag ccaatcaaaa 25141 acaaagttct caagtccaga ccatagcttc tctgccattc acgagagatg ttcttctttt 25201 agcaaggcag taaccacatt ttgttgaaaa gaatctaaaa agagagacta acaaaccagt 25261 gagcccttcc tgcgggcaac actccacaaa gcgaatgcaa gctgagcaga agatgcactc 25321 aacagccaac tgattccagt aatgtaaaac catattttag aaagttgaaa accaatatcg 25381 aaaattgatt aaataagcca ttaaaatatt agggttcata aatggaggta agagtactaa 25441 gaaacagaga agtaagagta aaacagtctg gtcagttgga tctagtgtac atagttacca 25501 aaaccccgag gaattgttcc tatgaagagt cgaaagaaaa tacctttctc tggaaagtcg 25561 aggcatcatt tgcaccttta ggtgattcac tgcaagccac aaacaaacac aatattagta 25621 atggtctaga gaagtggctt cacccaatca tagataacat ctagtcaata aattcaccta 25681 tacgaagaaa aagtgataat ccacataaga attatgataa tgcaccaaag agtccaaaag 25741 aaacactgaa acatagaaga tcaatgactc aaacctaaaa ataatgcaag atgtagaaat 25801 aatagcctcg tgggagacaa ggtcttctag acgtatttca tacagacaag agtcagatac 25861 catctaattg ttaagagaga gaaataacaa tatacccacc tttccctcca tctaagaaga 25921 gcttctaaaa gaggaacagg tgtatgacgg gcaatcatag ccaatgaatc caacacctgc 25981 tcataagcag gatctgatgg acgaaggtac tgtccatcct gcaaacatca taccaagaac 26041 cacatcacta ccaaagagct atttaagaac aaatacgtat actagtcagt aagctacagt 26101 cggaacagcg gcatttccaa ttcaatgtaa tacttccagt aaaccgtact caagactaag 26161 tttcaactac tgagaatgaa actatctaat ttgcagatac caactcctta gagcactgaa 26221 aacttaaact acctgactct atatggattc aaatgtgtgt taaaaaaaaa aaaaaaggct 26281 tattgagaag gagaggaacc tgagcctgag ctgtctcgat tcgtcgtctt gcgagaggca 26341 ggaaacgctg aagaagcgct tctactatca atttcgcagc actcccagac ttcatcacgt 26401 tcagcaagaa tagatcctca aaagcgaaat gtcctctaaa ttaaattaag aaaaattcaa 26461 caataagatt tatagacccg aatcgaacag aatacgagac tgaatctcgt gaaattttcc 26521 cagaaataat gagaaatacc aaaggagagc accacgaatc acgaaatcat tctagggttt 26581 agttcgtcga tccggcaaga gaatcgctaa gaaatttctt tgatcgacga cgagcttcct 26641 cgttcctctt ttcttatttg tttttgtttt caatatattt tttttgtttc tttttttttt 26701 ctgaaaagtg tttaacgtta agatgtttaa taaaaatcat ttgttaacta acaacattta 26761 aggtagattc aattaaataa aacaaacaaa atacaacaag tagtacagta tagtatcaat 26821 ctcgggatct tttgtctgaa actaattata tttttgtcaa cgtctcgaaa gtaaattaga 26881 gaagctatat agattttttg ttttttcatt tcgcataatt gacaaaaaat ttacaagact 26941 attaattacc atgcttattt tctttggatt ggctgatttt tgaggtataa actatttcga 27001 ttctcagctt tcactcccag acttttgggc ccttaaattt ccaccatcaa ctttcatatt 27061 cctcaaaatt acatcatcta ctttcgaatt catgcttaaa catacatttt caattgaacc 27121 attcatttaa ccgggattaa caacaaatct gcgtttactg taccgcgatt gcgattaacc 27181 cgactaatac ccagacccat taagaaaacg acgtcgtttt catttcgttt aactcatcaa 27241 aaagagaaag aaaacgacgt cgttttcatt cttctttaac tctcaaatcg atttggtcgt 27301 tcttctttaa tctcgacact gtgaaattag ggttcttgag agagaaagac aaaaggggat 27361 ttggagtttt cttcttcgat tcgaattatt gggtttcgat ttgggtttcg aatagggaga 27421 aatgaagaat tggtgggagg aaggattgat ttaccaaagc gatccagatg accccgattt 27481 cgaacctcca gagagtgaca ttgaagccga cgatggaagt gatagtggtg atagcggagt 27541 cgaggaggac gaagccaccg gagttgaggg agacgacgtt gggatagatg gagacagaga 27601 atctgatgga gaggagaata gagaagacga tggtgacata gcgtctgatg gagacgtcaa 27661 tctagaagac gatggagaca gagatgaaag tcaaaagaag aggacgagag aagagaaaag 27721 aaaggaggaa gaggctgaag aaggtcaaaa gaagaagaga aaacagaaga aggatgagcc 27781 ggaagaagat ttggccgaga ggtttgagtt cgagatagag gaagcagtag ctatgtggta 27841 tgatgagtta aagattagaa ggaacgaaat accagaaagt aataatgaag aagaagatca 27901 tgtgatcact agagataaga agattaggtt agcttcggat gataggttag ctataggcag 27961 aacattcttc actggttttg aattcaaaga ggttgtcttg cattatgcaa tgaagcatag 28021 gataaatgct aaacaaaata ggtgggaaaa ggataaaatt agctttagat gtgctcaaag 28081 gaaggagtgt gaatggtatg tctatgcctc atactctcac gagaggcaat tgtgggtttt 28141 gaagaccaaa tgtctggatc attcttgtac ttctaatgga aaatgcaagc tgttgaagag 28201 gaaagtgatt ggtagattgt ttatggataa gctaaggttg cagcctaatt tcatgcctct 28261 tgacattcag aggcatatca aggagcaatg gaagctagtt agtactattg gacaagtgca 28321 agatggaaga cttctagctc ttaagtggct gaaagaagaa tatgcccaac agtttgcaca 28381 ccttcgaggc tatgtggcag aaatcttgag tacaaacaaa ggatcaactg caattgttga 28441 taccattagg gatgcaaacg agaatgatgt cttcaaccgg atttatgttt gtcttggagc 28501 aatgaagaat gtgttttact tctgtaggcc tctcattggg atagatggaa cgtttttgaa 28561 gcatgcggtt aagggatgtc tgttcactgc tatagctcat gatgcgaaca accaaatcta 28621 tccagtggcg tgggcaactg ttcagtctaa gaatgcagac aattggctct ggtttttgaa 28681 tcagctgaag catgacttgg aactaaagga tggcagtggc tatgtggtca tatcagatcg 28741 ttgcaaggtt agtattatct tatgtctaat tgtttaatta tgtatagatg atagtgcttt 28801 atagatgcta atgtgtttgt atttgacagg ggattattag tgctgtcaag aagaagaatc 28861 aagagattaa ggtttgtttc tgttttaggc tgctgtttgg ttagccattt tggtgctcat 28921 gtttgaatgt tgtttctgtt ttaggaggct ggaacatctc agcctacaat ggagctccaa 28981 gaaacaacac atggagctga tacaatcaca ctcactcaga gaagcagtca atgggatcaa 29041 tcggaccagt tggactagtt ctagttctca atcctctttt gtctattttg gaacaatctt 29101 aacaatcttc atcctctttt gtctattttg gatgatgtat tattcggtta atgtttagga 29161 tgttctttat cctcttgagg ctttgtagga tgatgtatta ttcggttaat gttttggatg 29221 ttctttatcc ctcttttgta tggattctta tctattctta tgttatgcat cgattcacat 29281 tagacaaaga cacataagac ataacataga tagattcaca aaccctccat cgtttaatct 29341 cgacacttac acaatccaaa acaaagacaa acatagccaa acccctacaa gcttcaaaac 29401 agagactaat ccaatactac aatttcactc caaaacctca acaacaccta cactcttccc 29461 caaagccata caatagatgc aacaatcaca acagcaaccc ctacagtctt caaagctgac 29521 tttgccatga catgtcctac ctagctcttc cacattctcc ttcatggtca agagttcacc 29581 ctccatcttc ttaatcagct cttcctccat ctgcttagcg tttttattaa ggagctccat 29641 attaaccatc acgttcattc tgaggtcact aatattctct gcaactctct tacacttctc 29701 atttaccatc ttaatttcgt ccaagagagc ttcatcaacc catttgaaca agtgctcttc 29761 attttctctc tataaaacag aatttccaat catttactcc ttcaatcaat caaatatatc 29821 acaaaatcga aaggtgtacc ttcattgcta tcgcacaacg gtagaatctc cggtatggat 29881 tctcctttgt tttcgatgtg aatgtaatga tctcttctcc acaccaacat ttcgagggga 29941 caccaggagg acagacacga aatcctgaac tggaagcgga tgagtagtag ctgttggtgc 30001 tcattttctt ataaaattgt cgaatgaaga accctaatcg actaaatcga ttttgcagaa 30061 gaagaagaat gaaaacgacg tcgttttctt tctctttttg atgagttaaa cgaaatgaaa 30121 acgacgtcgt tttcttaaca ggtctgagta ttagtcgggt taatcgcaat cgcggtacag 30181 taaacgcgga tttgtggtta atctcggtta aatgaatggt tcaattggaa atgtatgttt 30241 aagcatgaat ccgaaagtag atgatgtaat tttaggaata tgaaagttga tggtggaaat 30301 ttaaggatcc aaaagtccga aggtgaaaac tgagaatcga aatagtttat ggctcgaaaa 30361 tcagccaacc cattttcttt taataagtct gtggatgtgt tggcaaaatt ggtttaagcg 30421 atttgtcgct ttaatgctta tgagccggta cgtttccgtt gagctgatga ttacagattt 30481 tgattttaga gatttgcaat ttgtgttgaa gaaatgaagt ggtttccgac gtttattacc 30541 atagccttaa gctgttattt ctttgtggag cttcgattag tttttccagc taaaattcca 30601 agaaaataca tgtccgattt tattatcggt ggtgatcttt taccggaatt gttgctctga 30661 cggcgaagga tgttatcgga atactttggt tactcatcta acacgtgtta tgttgttgac 30721 acgttcttaa ggatggagtc aaattgaagt tttcttcttg catcgacatt tctcttttgg 30781 gtctgataga tcttgttaac atctgtaatc gggcttttga agtccatttt gtatgtttgg 30841 tttgtttaat acaatatatt acacttatgg gaaaaaaaac aaaaaaaaaa acagacgtgt 30901 tacatatata ttttcaatct atattttctg tgaaaaaatg gcttccacca attcagtttc 30961 tcagttaata gtagtaatcc atttcgtata gtgaacacca gtcaaggctt gtggtacagt 31021 tcttgctatt ctacagtgta ccgacgggtt tacacatgac atcaaatagt ttctaaccaa 31081 cttccggtac tcccaaaccg ggatttctct cagatcagaa gctacaactc tcatctcagg 31141 aggtttcatt aagcttcgtc agcacaagta aattcaaagg atttcttcta ctccttctct 31201 ctgtctgcgt ctctgtatac aaaacccagc agcttttctt ttgttctggc ccttgcaaga 31261 agtttgtctt tcttctcgtc catctaacac gtacagctca agccttgggg ttgatactat 31321 gttctgaaag aatattgttt cttgagagct tcagagatgt ctgagaatct tagtggaaga 31381 cattactttc tttatcagtt gaagaaataa ggtgaatcta acaccacttt taaatccaaa 31441 tctcctcgtt ctggcagagg aggaaagtct aaatgaggaa acgattgctc aaagctctcc 31501 agcaactgtt aaaaagggaa caaaaaacaa actcagttca tgaagcagtc ttctacatgc 31561 catatatact gaaccatttg aacatatgac acttacattc tctagtatcg gttgtgaata 31621 cagctcaaca aacttttccc gtagtattat gtttagttta tccacatcac acgcatgcgt 31681 ccaaaaagag tcatgaactc ctgttagaat aatttttaca tgccatgtgt taattgacac 31741 aaagaccaag gttaagaaga agatggaggt ccaaaagatt taacttgtga aaaaaaggac 31801 agaacctgca aaacacacgc ctgctctttt acaggcaacc gcagtcatca tcatatgaga 31861 cccatccagg gagtgaataa aatttggagg aaaagctgtc ctttgtcgcc tcacaatgac 31921 ctacaaaagc ccacgaaact cttttaggaa taggcctctg catcgtactc tgtttcctta 31981 cttatctttt gtttctatgt aaaaccaagg taaagtggaa gcagacctga tcagtttcat 32041 gctgaagcga tagggtctgg agggatgtct ttacctgcat tcattttgat agaaagtgaa 32101 aacagtcaca ttcagattca acagacgaga gtaatggtgt cagtgataac ttttaggaag 32161 acttacgagt tttgttccca tttggtggta aggttgtaca acaggaagac ccaatggggt 32221 tgtccatcga actgtttcat tttctgaagc aataatctgc tcatgaaaat tattaaggat 32281 tattaaaagt ccaaaagcat atttaattga cgctggaaaa aagtctacac accttcgcac 32341 attcaccaaa ccaacgcatg atggcgcgtg cagcttgaaa catctcatct atagcagcta 32401 atgttacctg ccccaaacat aacaaaacaa aagattatcc cctcatagca tttttacttt 32461 tgagaagttt ccaacgtaca tgagaactag aaagtagtaa ttaatcattt gttaaagcac 32521 acgcgttctt tgaagattag atgaaggaaa ttgccaaatt aagattttct agtgccagtg 32581 ctttatctgt tgctaatata cctttgctgc atagcaagca gccccaaaaa cttctttttc 32641 atcaccaaaa tcacttcgtt ccttcaacct tctctttatt tgatcccgag cgccaatgta 32701 ggtgacacca tagactgatg tcataaccgt ctgctttaca agcttacgat ccacctacag 32761 attttgaaca accgtcaaca agttgtatca gctgaccagt tatgtgaatg agtaaacaat 32821 gtacccaaac aaaacagata tcaggcaact tgctcacatg ccaatactac tacatataca 32881 ttagaacggt tcattaaaat ttattggtga tttaacaatg ttttttgagg atgtaatgta 32941 tgagatttct actccactat gcaagcacgg aagcggaaac cgagaaacac aaggttagga 33001 caatggaaaa agaatgaaaa gaatttcata cggacctggt taagtaattt tcttgcacgc 33061 aatgcctctg gaaaaacttc aggatctctg tctgcatctc ggcgcataat atcaagaacc 33121 ctgcaggtat atgcatatga aattttgaat ggacatttat ttactcgatt aaacttaagt 33181 tagctttaag cagtagatgc agagcttacc tggtagctat tcctgaataa acatctgccg 33241 gcttctcacc tgcaactaga ttaacagctt ctgctcctaa ctgataatca tcttataaaa 33301 ccataatcag taaagaaaga tgacaagcag ccaacaaaaa gaaccataga agattaaaaa 33361 caagagaatt actgtgtctc tcccaagagc ggcataatgc tgtaaaccat tgcaggaacc 33421 atcctggcca gaaccaaagt gaaggaaggg aagttcattt gaagaaacac acagattgaa 33481 cgagaaacta caacgtaaag tcaaaaaaaa attgaagcct tcaggagaag atgccaaagt 33541 ggcaacttct accagaaagt acctgatgta taggaatatg tgacagaact gtctctgggg 33601 atgggcttct cagagcttca gtcagactta tgcagacagc caagcactga aatgggtctt 33661 cagcctgcag ccaccatctg cttccttcaa gtggtctgtc tgccgaatca aatatgtcat 33721 ccaagtgatt ttcagtgaaa gctagccgtc catcaagtga taacttatct acaccaccag 33781 catacaagtt tgctaagtgt atcttcagcc agcgtaagcc tgaaattccc ataggccttc 33841 cctcagcaaa ctccaaaaca ccccgacaca aatcagagcc aagatgattt aagtgtgggg 33901 gcatgggata tgcacgaccc cggaagtcca tattgtgggg atagtaaaaa gcttcctcat 33961 ctttcatttt ccgtgctacc tgctcacatt taaacagtta tgaacagaac tctcaaactg 34021 gcacattact ttaataacaa ggcggataaa gctcaccgaa agcttgagtt ctgtgtcaca 34081 tcgctgagaa tgtctctcgc tgttcacctt tttagcagat ttgacttccc acttccattt 34141 cttaagaatg ccctcatctt cagtatccgg cttttctggt aaaggaacct gcagtttgaa 34201 attaaagaat taaagaggcc tcgtgatgca caagggatag aaaatatgtc tgcgccaaca 34261 ataactaatg tcaaactgtt gtacagatga aaaacagtag ccattataag atcaaagtaa 34321 cacatggagt aactcacatc actccgatcc accatatcag caacacatcc gccactgctc 34381 catatcctat ctacaaccgt taagactcgc ttatttactc tccatttagt acttccaagc 34441 gtatccaggg cctaaaaatg tccataatat atttacagaa attaaaactg taactgattc 34501 ccacataacg cacgtgaaga aacattaatc ttctacctca aagactggtt gtagttgtcc 34561 tttaggtgcg ctcttaagtg cctctctttg ttgcttggct ccatgagttt tcattatata 34621 agacgtcaag aacaagtaag cacctttgtc atatctgcga aacacaaaat aaaagtgaaa 34681 tgacttacat aagtttcacc aaaagtaaag tattacaagc atcttatgaa gaaatgttga 34741 cagttctcga agctatacaa atctaaattt cctttctctt aaacttagat agataagctt 34801 taaggtttct gtcttctctg agtaactaag ctggccacaa aagaatgtac atttttaatc 34861 atactgaatt cattttggaa tttatgtttg agataccatg caatgcaatt atttaatcat 34921 ttggggtaag tatacagcaa agaaagctca tttaccccga ccatttgaga ggaggaacca 34981 gcattggcat gtatggcatc actgcatatc tcccctgcac cacaaagaaa aaagcgacat 35041 tcacatttcc aacaatatat agattagatg tctcagttat accttagtct tcaaatccca 35101 gaaatattta atctaattta caggggctac ccaagtcaaa ccctctttag atataacatt 35161 cgacaaaatg accataataa atgactccga gctctgaaaa agagtcaaca ctatttttgt 35221 cgattttggg ttaaatttgt tctgataaca aacaaaacgc gaatagatca cctacacttt 35281 tttccaagcc tttgcggacc aaagggtcac actcgattac accatatttt ctcccagaat 35341 tcctgttgta caagaaagac tggaatatga gcctatgaca ttcagggcaa gaagataagc 35401 ctagttgaaa gtcatagacc gaaatttctc acatgcttcc tttcgctacc ttgaaggtat 35461 gcacaaatgc aggtcggaca tcaggcagat cattatcctg ctgatcagct ggagactgta 35521 tataagctgt tcttacgagt aactctatta gacgacttcc aacctggaat atatattaga 35581 aaatatcaaa atattagcaa cttatgcagc caacataatc tttctacctc tccaacaatt 35641 ttgcagccaa gcataaacag accttagccc taacatctgc gatccatggt ttcgtatagt 35701 catgtgactg caagatcttt ctaactgctg acaacttctg tttctttatc aactcattga 35761 ccttttttct taacttatct tgttccttca tagaagtttc attttcaact cccccactct 35821 cctcattgtc atcccctttt ttcttcttat ccaagaatgt gcatattctt atctatcaat 35881 tcacatagag aaaaacaaca ccattagaat acaaaccatc caaaacacca tagagaagat 35941 caagacaatc aaggaaccct aaaggctaat aatacaagag aaaatattac ttcacacctc 36001 catataaagg ttgtttttat atgattaatt acaactgtaa catgctaaaa tgaaggagtt 36061 ccattcacaa atagtgaata ttttatcaat atcatgttcc acatattaaa gcaatcggtt 36121 ttgctttaag atcaggctaa cagttccagt atccagctat ggtatacata gagtttaatc 36181 ctcaatggtt taggttcact agtgagggaa acagtttaag agatcaaatt tcccactata 36241 ccggcaaacc aagaaaatta ccggcagatg atatgatata attatataaa caaagtagaa 36301 accagatcaa gatgaagcta cccaacctct tgttcaatgg catcgcctac tgtacacgca 36361 gcatgaacaa ccttaacaca accgttatca ccaccagtca tcaaatgccc catcaacttg 36421 tgcattgtaa taacagatat cttgtcggca ggcaactgat caaggtaatg tgcataagtt 36481 gctttgctct tccctaatct atacaactcc tggtctttag caattgcatc tcttaatggt 36541 tcgaaccaac ccagaaataa agacttcaca taaggcagat taggcgcaag cttttgctca 36601 cacatatctg tcaaaagctc catgtactct gcagcagctc tttcccattc ttcagtctcg 36661 atcttaactt gtcttctcca taaattctgg aacttggtac gacccattcc aaattggtct 36721 tgcttcttca tccgccacga acgatgactt tctctcttct tctctttctt catctccttc 36781 aacaattcat ctacctcagg ctcttcctca acatctgttg ataagacctc ctcttcggct 36841 acactcgtgt accctctagc caaagtccca ctaaggcacc tctcgctctt ggaaaactcc 36901 tctcgcctcg tgatcccact taaacaacga aaaccctttc cgtgacatgg tgatcccaag 36961 accggaaacc gaaagctcaa atttttcgag aaaatagact ccggagaaga cacaagaagg 37021 ccacgagttt gggaactgac gtttaatcta gcagctgacc tcgaaattgc ttgtttagca 37081 atgtttctcc acatggggaa ggattgggaa actgggtttg gagaagaaat aaagggttta 37141 tgcaatggaa tcaagtgatc gaatacctta gtctgatttg ccaagaacag tggggtttga 37201 gcactggaca tcaatcgaac ttaaaaattg agtaaaatcg ataaaccaga acaataacca 37261 actcgagatt gaataaattc gatcgcatga gcaaagaaaa atggcaaatt gggaagagtt 37321 tcatagacgg aagaaaataa aaacaaatgg tgagtgaagc gtacacgacg aagccgcctt 37381 aaaccctcgc ttaaaaaccc ttgctttctt aagcctctgt tttttttttc tgggttgtct 37441 ctgtttctca catgtcaaac aattaaaaaa taaattaaaa atatctaccc taaattcgac 37501 ccgtatagga ggatcttacc cgacccggga agtatctgat taaggcctct aataacggcc 37561 caaagtctct ttactatgca aaaaggcttg acgaatttat ggagcccaat tagttaaata 37621 tatatataca aaatgaattt tattatagtt tcttctatta tttttaccat ttcatacaac 37681 tttttagtgg aacccaatta atcaacaaca aaaaaaggtg gaacccaatt acaatgccga 37741 atataattaa aatatttaga tttttgtatg aataattcac ttggatttta tttttttggt 37801 caacaatatc aatttgtgtt aataataatt attctagtct attatttgta aataaatttg 37861 gactaaaaat ctatttttct tgggatgtgt cgactagtgg ttgggttgct agttgtgttt 37921 caccaatgat aactcacacc tgattattcg ctccaaattt gatcgtttcc acctcatagt 37981 tttaatatat taccatttaa ccccttaact tttgtctgag aagtaacaag cctgatttgt 38041 ttcctttcct cggaactgaa gctacaaaga aattgtatta ggtctctctc tctatctctc 38101 tcttcttcgt gttactaaaa aggacgaagc ttgttgcata atatgttgag gtaaattact 38161 aattactgat ccaaagttcg aatctttgct ccaactccag gctagctgat tgcgtagctt 38221 ccgattgatt tctacctgag ttttgagttc ctttgtggcc acttcgttgt tcttctgctg 38281 ggttttttgc tcgaggatct gatacttctg tttggtcgat gatcgagtga tcttcgttgg 38341 gttttgggga tctaagtcgt ctatatagct aatggtttgg atttgagttt gaatggagcg 38401 tttaggattt tggggattgc taatgggtag tgtggaaaag tcattggatt ctggaaattc 38461 gttggcttgc tctgcatctg ctaagaatgg agacgaagag agtagtactt catcgaagca 38521 agtttcacca ttgaagggtt ctgggtcgag aaatactagt cctttaggtc gagttgggtc 38581 gagaaacacg agtccttcta ggcagaaagt ggtgaagacg aagcctcgtg gtctagagga 38641 agaaacagtt gcttcatttg gtaaacaagt tgttgctgat gtgcagatgg aagatggtat 38701 atgggcaatg cttccagagg atttgctcaa tgagatttta gctagggttc caccgtttat 38761 gatatttcga atccggtctg tttgtaaaaa atggaacttg attcttcagg ataatagttt 38821 tctcaagttt cactcaaatg tgtcatctca tgggccttgt cttctcactt tctggaagaa 38881 ctcgccgcag attccgcaat gctcagtttt tagtttgcca ttgaagacat ggtacaaaat 38941 tccattcacg tttttgcctc catgggcttt ttggttggtt ggttcttcag gtggtctcgt 39001 ttgtttttcg ggtcttgatg gtctaacttt cagaacttta gtatgcaatc ctctgatgca 39061 gagttggagg actctaccga gtatgcacta taaccaacaa aggcaattga ttatggtcgt 39121 ggatcgctca gacaaatcgt tcaaagtcat agccacaagt gatatatacg gggataagtc 39181 acttcctact gaagtttatg attccaaaac tgacaaatgg tccttacatc agataatgcc 39241 tgcggtgaac ttatgctcct cgaaaatggc ttattgtgat tcccggttat atctagaaac 39301 tctttcgcct cttggtttga tgatgtatcg gcttgattca gggcaatggg aacacattcc 39361 agctaaattc ccgagatctt tgttggatgg ttacttagtt gctggaactc agaagagatt 39421 gtttctcgtg ggaaggattg gcctctacag tactctccaa agcatgagaa tatgggagct 39481 tgatcacaca aaggtctctt gggtagagat aagtagaatg ccaccaaagt acttccgagc 39541 acttctgaga ctttcggctg agaggttcga gtgttttgga caagataatt tgatctgctt 39601 tacgtcttgg aatcaaggaa aaggtcttct atacaatgtg gataagaaaa tttggtcttg 39661 gatttccggt tgtgctcttc agtcatgcaa cagccaagtg tgcttttatg agccaagatt 39721 tgatgcatct gtcctctgaa caataagtta tcgtctgtct cacatcattc ttgaaaactt 39781 acaagttcgc cagcaaaaca tgtcagaaat atgaaatcaa agagggtttg atgtgtacct 39841 tcagtgttaa tgaagacctg gtcagcaatg atatgcttca ccaatggtta acaatatcga 39901 ggagaaaaac tgtaagataa acttgtttct agctttctgt aaattagcat tcactcgata 39961 tgaaaacttt ctcaatatcc cctgtctcgt ccgcgtgaaa tatatataca cttaatttta 40021 gtttcctgtg ctgaattttc cttttctata tcgtaaattt aagcttctct ttgcaaatag 40081 tgactgctag ttcaagaata tcttgttggc taacacttat gagaatgaga aagttttaag 40141 ttataacatt ggaatataga tgtgatctta aacatttggg caattggtac atacaaatcc 40201 catacttaat gtcttctaac tagaaactag aaaaccgagg agactcttga aatggggagg 40261 taatctagct ttgagataca tttctctaca atttagctaa ttcatagaca ctgatagggt 40321 aggagtagtt tgcagacttg ctgaaggtat tgtgagctat tattttgttg gcagtttctg 40381 ttggatgaaa tgcatcccaa aagacatatt gattcctatc caagcaaggt tgctgcaatg 40441 gaaggcatgt taaggctcct ccatatctcc cgttaccgca gcacgcttcg tttgatacta 40501 caagacctgc aaaaagaaga actaaaactc agtactgata tatagaaata agagtaaacc 40561 cttataaata gggattgagg tcattttacc atatctagaa ggattcacaa ccatatcatg 40621 aaatagatca aagacgtttt gatagacaaa gaaagatcct ggcagagttg tgttgagagt 40681 atttgctaga tctttcagac ggctattgaa cattgaaacc atattgttga tttttgtcac 40741 acacccgctg gtgttgttgc cagttaccat agatagctga ctaggtatgc aacctaatgg 40801 tcctgaccca gctaacacca tttttcttgc acctaagttg tatagtctct gcaaaatgtt 40861 tgtaaaagaa cacagttttc actttctgag tatgagatga atatctgcat cactattgtc 40921 aattttgtac taagagatga tgggagagtc ataaaagatc attgacttac ggatatttga 40981 gctgagagag tcttgatcaa gagatctgca taatcttctc cactgtaggt ttggctggtg 41041 gagtatctct caggcataag gtagttgttg atatagtcat tgcttcctat attgatccca 41101 attatcgatt ttgcaagata ctttctcaga tctgcagggt tttgaaagaa acgccggagg 41161 cgtaactcga tcgtaatctc aaactgcgat atctgtccat taaatgtagt tcttgctccc 41221 tggaaaaaca gaggtaaatg aacctatatg taagaagaag aaaaggtttt aaaatcttta 41281 ttatgaaaga gtttaagttg cttgaccctc acataatgtc gaccggtttc atctaaaatc 41341 ccagctgctg cagatgcgta gttaacccct cgtaaggcat tttgcccaat ggataaagga 41401 gataagtatg gtggcaccaa tggcaagccg aggtacgttg ctacattacg acaaaattgt 41461 gagaaagtta agttaaaaat tgtgaaacca atattatgat ctttagtctt tagctgtacc 41521 tccataatca acaacggtac ggccattgca gaaacgacca gtggggaagc caaaatcaat 41581 tccataaggg aaataattag ctcgtgcaag agttggaatg taattgttgt ttccactatc 41641 aactaaagaa tctccaaaga caaagaaagc tggagcaaga ggttgatgat ctctagattg 41701 gccattgcct aaccagacaa gctctaacaa cacaagacaa atcaacaaac ttttcatgct 41761 tggttactat ctttatctct ttctctgttt tgagttcttt gagataagct tatgaagaaa 41821 acaacgctac tacatatctt tatggtgata tagaaagaga catggttacg tgacataaga 41881 gaccgtgttt ggttcagagg aactggtcgt ggtaaatgat ctcatggagt gtagtggtta 41941 actgttactg ttctgaaaga tttgtatttt ttctttttct catttactgc ttcttttatt 42001 ttcttactct attcgtcttc tttttgatag agacagcttt aagaaaaagc caacctctaa 42061 acctccttat ttcgtaaata tgtttactag tcattgatgg ttttgctcat agcttctgaa 42121 ttctgattga aatataaaag tgaccactct gttatgtaat gatggttttt cagatgtgtc 42181 taaaccggct tatagtttat gataaccgga tggatgtagc tagtgtaatg ggcccttatt 42241 tgagggctga atcttcgggt ttaggtgaat tttcatgatc catctatgtt tcattagtta 42301 atatcatttg ggattagtat tttgtacttc ctacgaatat taacagctag agtatcattt 42361 gaaaacgtta gttaaatgca tagtaggatt attttttata acaaaagtat taaatttgtg 42421 aaggttttca aggaacggac cagaatcgga gaactcgtcc tcgtgtctca aaaagaatat 42481 atcaatttcg atgtcactac aataataagg tacaatatat ttcattagaa atggtccaac 42541 aaagcatatc atttagcaaa tcgatggtca agggatcttt cttgggtaaa atacaaaaac 42601 aaaaaagatc agacttattt agaaacccta agcaaccaat gttattagtt ttcaatcaag 42661 caactaatca agcttttcat tataaagata gctttgaaat ttatatatca actcggaaat 42721 atttcaagaa aatatgtgtg actgcaaata atcattccca aatgaaacaa gagttgattc 42781 gaaataaatg aaaaagcaat ttcctatcgt gaaaaatgtc tccagttttt cgaaacagtt 42841 atgtggaata tggatgatgt tttagtttct caataaaata cttctttagt taataaacct 42901 cctttgcaat taccaaggag aagtatcttt ttgtcggaaa accctagcta gcctcttgtt 42961 tctacaaatt ctataacatc cctatatttg tataatatcg aaatcaaggt caagtattga 43021 agaactgata aatactttac aaaagcaacg caactacttt tcagtacaaa cgaccaagga 43081 tataatatga agaagacgag tttgaagtta atgaccctcg ttttagggtt ttgcttcgtc 43141 atttatcttc ttcaagggcc tcgaggtttg aatctactca ttcttttgat ggtcatatga 43201 tgttgatgtt gatgatgatg atgatgatga agatgcatat acatatactg tttacttgca 43261 tggttctaat atccttgaat atgatatatg tttcttctct tcgtacatgc atttatgtaa 43321 ttgttgcatt ctcacaagat ccatcgtttt agaataaaac tttaaatatg ttagtgagtt 43381 acaaatttta ttttaagacc tgtagtctta acaattatat ttataaatgt aggcggttcg 43441 agaaatggag atctcttgat agcacgaaag gtaatgacac atatcgaaag ttttgagtta 43501 aattaatatt tggttttatg tttgatcatt ttcattttag gattcagtta tttgtcagag 43561 aatcaacgtt gattcagcta tccatattgg aacaatttta aaaatgaatc atcatcaata 43621 tcttcttctt agatttggtc tattttccgc agttgatatc tctggaaccg attgaaacaa 43681 aaaatgcagc gagatcgttg aaagattcaa tatcaacaga tttagaggaa gaggttgatc 43741 gtctgatgga gcacgaatat ccttcaccag taaagccgag gaaaagaact ccggttcaca 43801 atggcgtgcg taatcgtcac taattgaccc ttgttctatt gttctttcaa ttagtagttt 43861 aataagtttg gtaaaatcat aaaatggacg tatcctgtaa tgctgaatat tagtatatta 43921 cttatacaaa gactttatca actatttgga gtgcttgtat tttttgagat ttaaaaaaaa 43981 atctcattgt tttggagata tttaaaattt tcgaaattag tatattccac ttacatatac 44041 ttgtatataa atattgtttt atcattcttt gtacttctta aatagatcgg aaaactagat 44101 cataacccgt gttatacagc acggacatta atattttgta taaattaaat tgtatacttt 44161 gtttgaatta cataagaaat agtatagtat aaaagtgatg ccataatatt tttcaatgtt 44221 ttactattta catataatgc attagattga ctttgtttaa aaatgttatt gttataataa 44281 atacaacata tgtagtttta agtaaatgtg taaaatttgt atagaagtta ttttggtaag 44341 aaacatgttt cctttgtatt agaggatcta atttttggtc taatttagaa ttagtttttg 44401 tatagtattt tggttatatt tgtggaccta attggggcct atattagatt atctttaata 44461 gggctgacga aaaaacggac ccgataacca actcaatact cgaaccaaaa aatcgggtta 44521 gggcgggtta aaattttagg aaatttcctt attgggtaga gttttagtaa acccgtggat 44581 atccgattgg accggaaatt acccgttatc taaaaagagt attcaaaaac ccaaacatta 44641 atttaatatc caaaatatta attatatgat attattttat ttgattttaa atatatagta 44701 aaatgcgagt tgtatatgtt ttcttgatat tatttatatt gtttagtgtt taaaattata 44761 cacttgtatt ttgattgtta attttagagt ttcacctgta atataccatc ttatattaat 44821 atcgatttaa acccgtcaat tctaggattt tccagcttgt attaaaaatt gaatcacatc 44881 atacacataa aaaaatctaa tatgttatta attattgttg tatataagat tataaattct 44941 taaaataata tgcatgaaat tgaatataat atttaaatta tgacccagta cttagtaata 45001 aattttctta aatctatttt tgacccgtta taatattttt tcatgtattg aacagtttat 45061 attcgttttt aaaagtttaa attatggcat atgcgaaaaa actctaatta tttttttata 45121 acgatgatat tattttttcg caaaaataga atcatataaa gatgagaggt gaactataat 45181 aattaataaa aaattaatat gataatttag atatcaaatc taatttgttg attttaattg 45241 gttaattttt tggaaattaa taatgtattt cattttttaa tgaaatttaa ttaattaaat 45301 tagtatttga ctttttaatt tttaaagaga tgaattaatt tactctttaa attttatttc 45361 taatggcata cctatgtaat tacttaaaaa aataaggtta tatttaaaat gtatttccca 45421 aataatatag taggataaaa gtgttttgtc atggaagtag tgttaattac tttgaattac 45481 atttcacacg agaatacgtt aacaaaaaca tagttttatg acatttctct aacttttttg 45541 gaattaaatg tggacataca ttacttgtat atattttgat gaaaaaaata aagcaaacca 45601 agttggattt ggtcaagtcg tcagttctga actatatcga ataagaaaac tcaattattt 45661 attatcattt tgcataccaa cttgaagtac aacttttttc ttcattttta tgtctttggg 45721 tcatgcttca aaagttcaaa tcattcacct aaacaaataa aaaggtaata taaactccat 45781 ggtttagcaa taactctcat tttgtatcaa aattagttaa acaactgtgc aagagagctt 45841 aaaatatcac atgtgaacta aacaaaatag cttaaaggat aaaattatta gaagatacta 45901 ttttagtaaa gataaaatgt gaataatata tgggctaaat ttataatgct aaaaagtatt 45961 aagcctacca aaaagtaata gttacaaatt aaatatttca gaataaaatg attttattaa 46021 acgcgctttt aaagcgaaca aatattcttt cggaccaaaa aaaaaaaagg aggaagaaaa 46081 aaatgttgga gttagctctg tttctttgtt catcagttta ctttatggac gaggtcctca 46141 tgtagatttg actttgtttc aaagcggagg aagaaggaga aggcaacact aaaagatcaa 46201 aaccttagat ctttgcttac gctttcgcct ggttaggtaa gtgtttaaga atgttcaatg 46261 gctttatgct gctcctagga ttttgatttt gcaattctct tggttggtaa aaaaggcttc 46321 tgggtgtttt tccttcttat ttcgagattt gatcgaaccc acttgatttc tatcttgatt 46381 agttggtttt actagtgtta cagaacaatt tgtgattttg ggctttgatt ctctgcgaat 46441 atagttgcgt gttagctcta gggtatcttc tagcaaacca aaagtgggat ctttggttta 46501 tcatggagat ctcgatccaa tttcggttag aattcaaaga aggaaaaaat tgaatctttg 46561 agaagctttt gttattgagt taattcagtg atgatcctat gcttgtgatg tgtgaaaatg 46621 tttttgttgc ttgtccttgt cttttgtata aatcactgag tagcttcgtt tagtgtctca 46681 ccaatgattt aaccttggtt tgctagttta ctgaacatgg agtacaatgt ctagatggag 46741 ttggaattgt atgtctagta ataatgtgtg taatgcttca tgatgcaaaa ccttgctttt 46801 gaattttttg taggaacggt ttttcagaaa caaaagattc catttttaga ttcaattccc 46861 ttctggtttg aaaaatgata tcaaaatggt gaatagaagt gatttggtgg tgattggcat 46921 ctcggttggg cttgcacttg gtctcttgct cgctctgctt ttgttcttcg ccataaaatg 46981 gtactatggc cgctctcacc tcaggcgatg cgctaatgaa cagaattccc cgactctacc 47041 tgttcacact gctaaaagag gtgtagtaat ccctgatgat agagcaaaca cagaatcgtc 47101 acagccacct gagaatggag caccaactca acatcagcca tggtggaaca accacaccaa 47161 agatctcact gtatctgcat ccggcatacc tagatataac tacaagtgag tattcagtct 47221 gaagctttag tttgcgtgaa tttgttattg ggttgttgta gcttatggtc tgaaacttct 47281 ttagggatat tcagaaagca acacaaaatt tcacaaccgt tctaggacaa ggatcttttg 47341 gtcctgtcta caaagcggtt atgcccaatg gagaattagc tgcagcgaaa gttcatggct 47401 ctaattcaag tcaaggagac agagagtttc aaaccgaggt aagtgtacag atccactcat 47461 ttccctggtc catgcacaat ggagataaca gagcagtatt ggtttcttgt ttccaggtat 47521 ctttacttgg gagactgcat caccggaatc ttgtgaactt gacaggatac tgtgtcgata 47581 aaagtcaccg gatgttgatc tatgagttca tgagtaatgg aagtttggag aatcttttgt 47641 atggcggtga gtcgtccttt gttttcttct atcaatataa gacgcagcaa gaaattaatc 47701 acaagatttt ctactttact tttcaggtga aggaatgcaa gtcttgaatt gggaagagcg 47761 gcttcaaatc gctcttgaca tctcccacgg cattgaatac cttcacgaag gggccgtacc 47821 gccagttatt caccgtgatc ttaagtcagc aaacattttg ttagatcatt ccatgagagc 47881 taaggtaaga gagcaaatat ttaatacgct tactcaaaac tgtttggtta aaccttataa 47941 aattttggtt tttttttttt cttgttggta taggtcgcgg atttcgggtt gtcgaaagag 48001 atggttttag atagaatgac ttccggattg aagggtactc acggctacat ggatccaaca 48061 tacatttcga ctaacaaata cacgatgaag agcgacattt acagtttcgg tgtcatcatt 48121 cttgagctca ttactgcaat ccatccccaa cagaatctga tggaatacat caacctggta 48181 agttaaaatc cctggtctat gcaaccaaac cgaataaccc aaaaatctga aattaaaacc 48241 caaatcgaac caaaactcag tgagtgattg tgagatcaat gattctgttt ttctgatgat 48301 caggcttcga tgagtccaga tggtatcgac gaaatactcg atcagaaact agtgggaaac 48361 gcaagcattg aagaagtgag gttactggcg aagattgcaa acaggtgtgt gcataagaca 48421 ccaagaaaaa gaccatctat tggagaagta acacagttca tactaaagat caaacaaagt 48481 cggtctcgag gaagaagaca ggacacgatg tcttcatcgt ttggtgttgg ttatgaggaa 48541 gatctgtcaa gggttatgag caggattaag gatcagcatg ttgagttagg gttattggct 48601 ggtgttaagg aagagaatca tcaagagagg aacattgcaa caacatagta actctttact 48661 ttaggagtaa gtctttttgt acatattacc acatgagtct caaaagtaag atttctccct 48721 cttctgcaaa aagaaaagag tttttgcttg ctgacaacaa caacaaaaat ttgaggttat 48781 gttataaaga ctcttaacaa aagatatatc tagttcttat ttttgggtta agtttttgtg 48841 tacttttcaa tccgaaaaat cattaccaac gttatagcat cattcatacc atcatcatca 48901 taatagtcat aacactaatc taaaaagttt gtaagaagaa ggaattacaa gttcactaag 48961 tactacttca taaacaatag acacctatat tttgaaacca ttaattaata atggtatctt 49021 catcaacttt tttgtgtact tgttgtgctg cgaaagctat gtgtctcggt ggatccaaca 49081 atattaatat tcaatcaata agcaattaaa gggagttgtt gacaaaaaaa caaaacaaag 49141 aaaaactttg gagacatatc gaaacagcta aaacatcatg tcacataact tgtaatagcg 49201 atgtaaaatc ctttaaagaa accaataatc atactgaaat attgttgtat ttgcaaggag 49261 atgattatag tgtcgataag atcaagaacc tctacgaaat ttgatgagta gccgacgaag 49321 tagaatttct tggatcaccc cttgatatat ttaaacccta ttgaatatat cttggagcac 49381 ttgtataatt atcataatag ttattttctc tcttggatca ctcgaataca cttgagacta 49441 catatattca tgctctttaa ataatctgta ttttctcgag atataataat ggtttgatct 49501 tcttaaaata aatttgaaat gaacatcatg ggttagtgag ttagggtcat tttccacttg 49561 ctctatcact tctcaaaatg attgcaaaaa tggaagtgtt ttggttggct ttgaactttt 49621 gaacccaaag tagtttcaac atattaaata ggtgaaaatt tagtgatctt gcactaacca 49681 atatataaga ggagctgatt ccactctagg tgtttaggta ggtagtggag catattgaat 49741 aagttggtga aaattagtga tcttatacta aaccaatata ataatagtca caaatttcat 49801 aactaaaagt tgacaaaaaa taaacataat ttttaaaaag atggatacaa aaagtaataa 49861 aatttcagtt gaaaatctaa atttaagatt ttgtgtagta ttgtattttc ttttctgttt 49921 ttgttattta taatgtcttt atctaggcaa aatcgcttga aaatattacg aaatttttat 49981 ttaaataaaa tactctcaga cacctagctg ctataaccga ttctaagaag tagagcaatg 50041 ttaattgagc aatctgataa tgccatttta gctatagagt tttttttatc ttttttatgt 50101 aaaacatatt gggattgatt ttatcaagag agaaaaaaat aatatgaaaa atatatatct 50161 gtaaccaagg tttaattaca caaaatggca cttattattt agctaacaaa actggcaagg 50221 ccatggatgc caaactttac agtgtaagct ttaatcaaca ttctaatctc catccatgga 50281 agctctttca gaaatctcta caccaagaac actaaataac ttcaatctgt aaaactcact 50341 gtaaattcct ctttcaagct gttctctcgc cagtggtgga gtcacattct caaatgactc 50401 tcaatggttc atctaacagc tgcagacatt cttgtgggtt ggcgtagaag tagtcttctt 50461 ccttgggtct attgggaatc actaaccgtc tcttagggct tcccattcga gtagagtgaa 50521 atgccttcac cgtggatgaa aatacatccc aactcaacaa tccttcagta tactcatcta 50581 tcagcttaac taggaacctc ctattcagct ggattgtctt cttgaacccc aagaatctgg 50641 aaaaaatggt gtcagcctcg ggatgataaa tatgggaaga aagaagaaga aagataacgg 50701 tatttcacct gcgatgacct tcaacaactc ttgccatgtt cccatcattg gttggaacaa 50761 atatgtcgct ctcaagtgcc actaggtaat caagtgcagc catttgagat gaatggttcc 50821 ggcaaaaatc cagatcagag gattcgagta gggtttcttt ccggacctga gttagttaag 50881 ataatggagt caatggaaga tataagtttg aaaaaattgc agaggctgaa atgacctact 50941 taccacattt ggaaaagcgt ctgttaaagc cttcatccgc ctttgaccac catagatttc 51001 tccagcagct atgtaaattt gaacatttcg gtcaataccc aacgcggtca gagtgagagc 51061 ggtttcctca ggagttaaag ggcaaaggcc atctttcctc ttcagctcag agtttatgac 51121 tttctctttc caccatggat aagcatatct gcaaacaaag agaatttaca tcaatagatc 51181 catcaaagtc ggaacaaaag caggaagatt tttcggggat tttttatacc tcattcttgt 51241 tagttcttct tcttcctcgg ggttgcaacc atgtgaacaa ccagaaaatg ctaacatatc 51301 catctcatat ctgagatgca ggacaagaaa gggacctttc tctctcagaa tcttgactac 51361 tcgtctacct aattcctcaa tctgaggagt aaacttaagc ccattgaaat ttactcggca 51421 cctcagcttc tgaacctcca caggcagtcc attattagca agtcgggtat ccgttctgtt 51481 cagatgtaaa accttatgtt ttttcaccag tggaagaatc tgtatccaga tattatttag 51541 tttgaaaaga attagcaaac tgatttaggt ataaaaacca tagtgaagac taatctcgaa 51601 caaaaagacc aaacttggat gactaacctg attttggtag taagacatgt ttgaccaact 51661 aataggaggc atttcgtggt acactccaag ctcaaccctt ttcttaagcc ttggaggtaa 51721 ctctttgagt atccgaactt catctcttaa cgaacttatg aagtgatcca catcgaatat 51781 gtctttaaac tcactgcccc gaagaagaaa acattgtaag agattgttga atcggaaaat 51841 aaaaaaaaac agagttcaca aagaagcttc aagtgagata cataccttgg atcgttccaa 51901 aaagaggtct tgtcaagctc cggcacaata agtgtaacat tcatgtatct tgcaacggtt 51961 accatatcac atatctgtag aaagtagacg aaaaacaaag taagcaaatc tatcactaga 52021 agatgaaacc ttgtaaagag atgttatttc caatggttta gttatcatct tacagctgct 52081 cgcatttgat tgagtcctcc attgcaggaa accataagat aaccattgtt cacatatacc 52141 cctgagaaga aaaaacaaag agcttagttt ccagattcat ctccaagacc aagtaaggta 52201 cattatccga ttcaaccaca aatttcatca atcaatgttc tcaaaagaac ttacttttag 52261 gtggaagagc aattttggta ggaagagacg tcatctctgc ggcagtggac agttgatggt 52321 gattgaaaca agaaggccaa cctttaaaca atcttggtcc ccacatctct cccaaagcca 52381 tcaaatgaac aaaacagctc caaagcaaca atacagtaac agcacgaatc atccataagc 52441 tcatacgagg cctcgaaaca aaagagttct tgagtttctc aactttgctc tctcctaaaa 52501 gcttaacatt catctcccat agtttcctgt ggtaaagaaa cttctccatt ttacacattt 52561 ttctatatca cccaccaacc ccactctcac agaagattct cacagttccc aaaacccctc 52621 agaggaattt catcgaacac ttaaagggta agcctggaaa aactaaccac agcttaaaag 52681 actgcagatt taaaaggaca ggaatgaaac aatataatag gtttggtgct caagctttta 52741 attatcctct ctaataacac tcagagaaaa tgtaatctcc acacaaatct gtgagcatta 52801 atcacacctc gaaaaaccca gatttttagt tttaaagtag cccacgatga tctcgactta 52861 ataatctgac taaaacccag atcttgctct attaaataaa agaagaatct ctctctctct 52921 ctcaggtttt aattaagagt caactttcaa ctctcaattg atggaagctc acacttccag 52981 ttacagaaaa aatgacacag agaaggcttt gaaattcagc aatggcgatc gatcaaagta 53041 gccgtacaaa aaaaaaaatg ggagacgaga gtctccgtgg agaaccggtc ggtgttgacc 53101 ggagaaggaa gaagaaggac gtcggagaaa tagggagaag atttcggtgt ggagaaaaaa 53161 agagacaaga atgtgcaaat ttcagaactt ttttctacca aggaggaaaa aaaaaagaag 53221 cagtctttag gtttcttctt ttttgtcggc aaaaactgga gtgtcattta taaacttttt 53281 ttttacagag aaatattttt tgtaatattc tttgcaattt aaaatatatt ttttgataca 53341 gagccaaagt agacctttaa ttaaatttat aaaatacatt gtattatttt gctattagtg 53401 gatatgtata gttatataca tacacttata tgtattttgg ataacttaaa caaggtacaa 53461 atttgaattt tggttgacca aatacattca agctattttt taagttttat tttccagagg 53521 gataaattaa tatatttgga agttaaataa cagagtttca gttatatcta aaccacatta 53581 gcaggttgcc cctagactaa ctctgtcatt aagcaaaact aatccacaag ttaacttcca 53641 gaaagtattt gacaagaaaa taaaatcaaa ttacatattt aacatacata aatatgtgtt 53701 aacccaatca ttttcaagat tggttgaaaa acgaaattaa attttatttg agttaactaa 53761 taaaataaat ttatcgataa atggaaatta atacggcaag attcattttc tgtcggttct 53821 ttataaaaat aaaattaaaa aattgttcaa aagggcatat tctagacttg tggatcgctc 53881 gttttgtgac agcaatgaca aagagagatc caaataaata ttgaatattt ggcaaataaa 53941 taaaataaaa aagtaggcct acaatatact atataataat taaatatata tggatggacc 54001 aactctattt attgctcaca gttgtcattt gtcaatctaa aataaaatat agaaaagtat 54061 gtggaagtgt attatgttta ttgattaaaa agaattaaac gataaaaagg tccaaaacat 54121 tagatcaaaa cttaagctag ctttgttacc ttgatttcga ctgaagtatg gtataaatct 54181 ggccccgagc cgactgaact aacattttca gcattatgtt attcgaaatt tcgacagtat 54241 gataatgagt atatttaaaa gaagcattat gtataatttt gagagaagaa aagattgagc 54301 attttttagt atgttttttt tttttagggt gaaaattgga tctaaggata tttataggaa 54361 acaaatggat gaatacataa gaaggaatta atatgtttcg tcatgaacca agtaaccaac 54421 ataaatagat acattttaaa ctttttataa aagcagccat taatagaaga tggtctttta 54481 aaattggatt tgttttattt gcttttcaaa ttttatcaca attttgtctt gttcatagtt 54541 tttaagaaaa atattaacca caactttgtc ttgttcatat tttattttac aaaattggct 54601 aaaatcacat tctatttatt tatgagtttt aaaatataaa ttagattttg gtgatattca 54661 atttataata ttttaataaa ttttcagtag tctctctaga aatatacaag tttttttaat 54721 gtacctctaa atattttaaa atcaagaaaa tcaatctaga tatggttctt tataagaaag 54781 tacacaagcc aacaaacact cgtagtctcg tagatgtttt tatcaagtat atatatatac 54841 attcattata ctattttcaa atcgttggta aatcaactga taattttttt ttcctatcca 54901 ttatatataa aacaaatata aatatgtcgc ctattacatc ataaaatttg aggaagtaaa 54961 aaaaaaattg aaatttcaaa cctatcaact aaattttctg atttaacaga tttaatttat 55021 ttttattttt ttagtttaaa aaaactcatt cattaaacat gtgaaagtga ctaaccacga 55081 tgtctagctg tagcggcagt ttaataaagc tttaagcaat gatctcgctc gccggccaag 55141 tttagaaaca atgaaagatc aaattgaatg atttgttttc ttttgtcatt cgtgaagaca 55201 atttaaaaag atacacgtgt agatgtgagg ttatgaacag gttattgttt taaaacacag 55261 cattatatca gccaatatat atcaatatat agacgatgtt cggtccaaaa atctagatca 55321 atctccttca tgtttgctat tcgaaaatga ttgttattta tatcaaatcg tgcgtgtaac 55381 atcgattgat aaatattgga aatttaagta aaggccaatt gtcacgtaat gacttgaaag 55441 catctccaat gtaattcagt atttatttct tttcacttga cgatagaaga atttgattag 55501 agatgagtta cttattgagc gaattaatgt acgttatttt ctatgaccat atacattcca 55561 ccgacatcgg acattacact gctgtcggaa cttaatgatt tatttttcta aatacgaata 55621 catttctcca atggaaaaaa tctaagagtt ttaagatttt tctataactg taattgtcat 55681 ttcaacgaag agtttaagaa gaaaacagag agaaaagcgt gtaaatatct gtagttttaa 55741 tcaaatgggt catttctatt tcagtctata tatttaatca aaattgcaat tcacaaattt 55801 taatgcttac gtattttaag taacattaaa tgaacatctt cgtcctaaaa gttacttgat 55861 attaatagaa aggtaaagta acagaacaaa gcattaacga tatatttata tttcaaaact 55921 tacacacaag tattcagaat taagaaaaga acaaaacaaa taaaggagac gtttctgcga 55981 ttatgttttt gcttgaaata gaacaaacaa acacgtattg gtaaagaaac caataacttt 56041 gaagatagct taaacgcaaa atatatagtg aaaaaaggga aatcaacaat tattatatat 56101 ccccacgaaa ctcggtgtcg gtggctcaat catcttaaca ctctaaactt ttaagcccat 56161 tcgtgcgcat gcgtgtatgt atatacgtgt gcattggatt tgaaattttg gtcgttatcg 56221 agagttctag tagattatgc attttgtgta acaagtgtaa tataattggg accctttatt 56281 actttctata aaacttaatt cttttaacct atagatcacc cgtttgtcaa ttatatcaat 56341 caagggtccc ttattagctg cattttccga tcgataatag agaagatcat ccaatgcttt 56401 acaatcactt ccatgcgacc tcacatacaa ttacttccat tgcatagtat gtagtaaaac 56461 tatatatttt gcttccaatg gatttgatgc ttgtttatgc ttagaacttc caaataatct 56521 tttttaacgt agcgaattga ttttgcattc aaagcaccct tagtaaatcc gtttcctagt 56581 ctagtcccat aattaaggta aaatctacat gggcttctca ggcccataat gagaagttta 56641 aaaggctcaa ctcaacaaga tttagttatg ctaaggcctt acaagttaca atcaaacaga 56701 ggatagggac cgtcggattt atctgacttc ctctccttcg attagggttt ctgaattact 56761 actcctctgc caattagggt ttaagcctct ccagcatttt ctttcagctc aaacccaaga 56821 acctcatcgc cgattatttc ttcatcctta ctctatcccg cgccgattaa tctctctcag 56881 gtaaactact tcaattttac tcatagacta attgtacaga gattgataaa tcctatgttg 56941 ttcccattag aatgttcaag ggtttatgat tattcaggat ttcaaatctg aattattgga 57001 caaataaacc ctaatttcgt ataatttgca tatcatcagt accgtccgag atgagaaagc 57061 tcaagtatca cgagaagaag cttataaaga aagtaaattt cttagaatgg aaaagagaag 57121 gtaaccatcg ggaaaacgag attacgtatc gataccacat gggctctcgt gatgattaca 57181 aaaagttggt acctttattt gctcttaaag ctctgttcta cttattcttc ttcttctggt 57241 tcgagtgttt tgtgatgaac aatggtcttt ttctgattct tgagaatgtt ttttcttttg 57301 ggcaggtatt caggattgtg taggatggtg cagaaactga cgaatataat gaaacaaatg 57361 gatccagcag atccttttcg tattcagatg actgatatgc ttttggagaa gctgtgagtt 57421 ttttaaatat atataccagt ttgttcaagt atagagctta ttacaagcaa tggttctttg 57481 ttttctctag atttaggtcg tagaacatga gacttgatca ttttctgcag ttttttacac 57541 cgatcttatc ttagagacct cctgttgttt ttcttgttaa ccacttttag agataattaa 57601 aattgggggt ccttatgctt ctatatagat tttaggaccg ttggtttgaa atccattaga 57661 ggttgaattc atcctggtaa aatgtgtgat ctaatgggtt tctttcagaa tctgtttctt 57721 gaatagcttc tacatagagc taagttacta tataatatag gcttattgta tagggaaaaa 57781 gagccatcct actgaaactt gttgattgcg tagttccact ctgcaatgag aaaacttgtt 57841 ctcttaatgg ctgactgact tgtgtttctt acatgtacgc ttgaaacggc agatataaca 57901 tgggtgttat accgactagg aaaagcttga ctctaactga gcggttatca gtttcatcct 57961 tctgtaggta agagtatatg caagagccta tataggcagt tacttaaacc cttgatagcc 58021 aaatgttctc atttcgtttg cctcatatcg gttctttttt tttatttgca ggcgtaggct 58081 atctactgtg ctggttcacc tgaagtttgc agagcaccac aaagaagctg tgacatacat 58141 agagcaagga cacgttcgtg taggaccaga aacaattact gatccagctt tcttagtaac 58201 tcggaacatg gaagatttca tcacttgggt tgattcatcc aagattaaac gaaaggttct 58261 tgagtacaat gacacattgg atgattatga catgcttgct tagctttgag gaagttaaaa 58321 aagactgatg ttttgtaaaa gccaaaaaag gttcactgct actactattg cagggtttgt 58381 ttggattttt gaatatgcaa atgtgctact attcccttgt ctttaatctt actttcgatc 58441 tatttgcaag aagaaacttg aaaacaaatg tcatgaaaat attgaatcta actcatatta 58501 tcgtcaaatt gtataagttg gagccacaga aatggatgaa cataaaatat taaatctaat 58561 tattatttga cacttcaaat tacaaaaccc gtttgattcg atccgacccg acccgaccag 58621 aaaatcgcaa aacctgtgtt tttgctttcc ccggtctctc aaaataaaat tgctcgctta 58681 gaagcttcaa acgcagctga aatggcggtt caagcaaatc agagtgcttc ttttgggttc 58741 agaacagctt caccttctca gaagctctct tcgaaaccca tagctcatat ctctctatcg 58801 acgaagctca aaccctcttc tcgaccatcg ctttcgtgct ctacttggaa tcagggtcaa 58861 attccggcga gacactcctg tatcaatccc ggaatattcg catatccccc ctccaatctt 58921 accttctctc atgaactacc agaatctgaa tctccgccac ttggaaaggt cagtcatcgt 58981 caaaaaactt ctcttggaac aaattggggt tataagcttg ttagacaaag ttgcagcctt 59041 tttttgtcat ttgaacgaaa atgtggtaaa ccaaatgtat tgtctccatc tatttaattg 59101 tggtagaaga ggcagtgtgt gctagtgtta aacctccctt atgttgcttc attgtgaaaa 59161 cttttgtgaa aggaagtaga gatgtatgat tcttcccttg ttatcgtcat cttatattgc 59221 agaagaagat gcgtgtgtta gtgaagccgc tagagaaacc aaaggtggta ctgaagtttg 59281 tatggatgca aaaggacata ggagttgcat tagaccatat gattccagga tttggaacaa 59341 tcccacttag tccatactac ttctggccta ggaaagatgc ttgggaagag ctcaaaactt 59401 tgttagagag caagccttgg atctctgagc ttcatcgtgt cttcctcctt aaccaagcta 59461 cagacatcat caatctttgg caatcaagcg gtggagattt gtcttgattt ggattcacca 59521 gatatatctg tttatcatag gtatcattaa cttttcaaag tctaaagctt ttgtgttgtt 59581 tagtctatca ttagtcctct gtgtaatcat cgaaactcga aaaaacttct atcattaaaa 59641 tggcgaattc aaaacaagaa aacaaaagat catcatcatc atcatgatga acgtttaatc 59701 gaagtactta gacatttgaa tcgatttatt actcatccca catttctcat agaacacttt 59761 gttctccaca ctacaatcta gaatcacctt atagcaaccc attgatttgc aatgatccat 59821 aagaaactca acaactttct tccccagctg tttcccgcga aaccttgaat ccacaacaac 59881 gtcttcaatg tgcccagctt taccgcaatt cctcagaaac ttcttctcta tcatcacact 59941 acccgtagca gcgatttttc ccgaagtttc ttcttcgatc acgcagatca cgtggtcgtc 60001 accatacgat ctgatttctt cgaatcgccg atcgaattct tcgtctgtta ctgatccggt 60061 gacggttagt tgacctagaa gctcgatgaa tccttttctc ttatcggaga tctccagttt 60121 tcggatcttg aatgtctcag ccatagaatc agatcggata attttgaatt tgagtttttt 60181 tttataagat ccagtcactc actggaactg aatataatca cccatagaaa aacaaatgtg 60241 ataattacga aaaggtactt attcttttag atagttacaa aaagtttccg aatctcttaa 60301 tgaagtcaaa atgggccatt tgttttttaa attgactttt gatgtaacaa aagcagcaag 60361 ataaaaactc catagtaaac aatatttatt gaaatgacaa caagccaaca agtgaaaatg 60421 cagaggaaaa aagtaacaat ttaaagcgga tgatacatat agattacata attggaaatg 60481 taaggaaagg attgctaatt tagggttgat ggtgactaga aaaggaaggt gaaagaggag 60541 ggattcccgg gatgagaggg gccggaggga ttccagggat aagagggacc ggagggatgc 60601 cagggagtcc agggaccgga gggagcggaa cagggaaaga aggtggtggt ggcggtagtg 60661 aaggcggatt cacgataggg acaggtggaa gaaccggaag aggtggaaga gtcggaattg 60721 ttggaatcgg tggcagagta ggagtcgggg gagtaggaag taacggaatt gtagggagag 60781 ttgggggagt agggattagt gggattggag ggagacttgg gggagaagga ataagcgggt 60841 ttggagggag agtaggggtt gggatagaag gaatgagcgg gtttggaggg atgatagatg 60901 gcgggtttaa cgggtttggc ggaaagaaga aattgggttt tagtgcttcg gttttcttgt 60961 ctttcaagga atcagatttc ttgttatgca aggaagctga tttctgaggt cctggtggaa 61021 gaagaggagg aactagagga agaggtaagt ccggaacagg taactttgga acaggtaagt 61081 ttggaagagg aggaacgata gggagatttg ggagagggga aggattagga ggatcttgga 61141 gcggtggagg aaatgaagga tccggtaaaa gaggcttgga accgcggaga ttgataggtt 61201 tttggctaca aatctctggt tggttttcgg gtctgaaagt gaagaaccct gcagagaaaa 61261 ccctagtatt ctctccatga tggtttgatt taaggcgttt gagagaagaa gaagtagcag 61321 acgaagctat agaacagtaa ggctgtgagc tacttagcaa cttcacagag catctcttga 61381 tcttcttcac atgtttgctc acagagaatg gtagcttcac tttgaattcg ccgcgcttgt 61441 ctgttttcac ttcttgtctg aaactcggtt tcgagttctc gtcaatgcat tccactgcaa 61501 ctaaagcacc tgcaacaaga tccacaaaac ttttcataat tctttctaag attttagacc 61561 cacaaacaag aaaaaaccct ataagtttcc aaaacagagt gctctgttct agaatccaaa 61621 aaacattatt ttccaagtag tgttaccaag aaaacacgag caagactgca agaggaaaaa 61681 aaaacaaaaa tcagaaacca gacctgagat taagtggttg ggtgatttcg agaaggcgcc 61741 gttaaaacaa gtgtcacagt agacagttcc tactacaact gcagaagaac gtgtcttctt 61801 cataacatgt tgctgtcctt gagacaaacc tccattgata gagattccta agaaaatcat 61861 aagactaaac caaaaccata aggtcgttat tctctccata attatcgact ctatctctca 61921 ctatcggagg aaagaggggg tatatatata ggagtgtggt gtgaagatga gtgagccacg 61981 aaaaacgaag agattgttaa aagatgagtg aagcttatct ctttttgttt tgccatgtta 62041 ataagaatca tgggaaaaat acaaaatcga actacaaagc tagctactat aatgtcttgt 62101 gaccctatct ttctacagct cagtagctga attattgtaa aaaatagtaa gaggtgagaa 62161 aaaagacaac actaatattt tggattcacg ttcgcgtgtc gattagattt gatcgaacag 62221 tacacggtaa aaacacgttc aaagtaaaat accaaacctg ttatatcatt gtttcttgaa 62281 ttgcggggtt gctaaaacta aaaaaattca gacacatatg attttttttt ttttttctga 62341 gttaaaggtt ccgaagatga acatatctga gatacttttg tgttgaaaat atctgttatc 62401 atcatctttc ccttatttac attctgaatt ttggatgaga ctaaaacatg acatgatgtt 62461 ttcatttcca aactgtgaat cactgtctgt catgtttgat gtttatagta tagtctacat 62521 ccatgttttt ttaaagaatg acatttacaa tcattattgt tagaaggata tcctaattgt 62581 caatattaca tcgaaatatt tatttttaga acatccgaga atagatgatt agttactgtg 62641 gtgatgattt tgatagttaa ctaatatcag taacagtcta tcatggacgg atttacatat 62701 ttctaatcgg taatccaaac tagatgtaag aagaggttac agagattcat taattcgcaa 62761 tataattgtt ctaagaaact tatacattgc ttattataca tttgtttata gatttggaga 62821 aacatatagc cagtgaggct aaagttgata actagactta agtaagttat agtttctatt 62881 ttctttaaca gaggaaacaa tacatgcaat ttaagaaagt gaaggcttat gggaatgaga 62941 atggggaagg gaatacagtt ggagagtctc tgtcgattca cgagacacta aggatgtcca 63001 cacaatcata tatacccaat tcttacttct gtcttcttat agagtactac atacatactc 63061 attcttgata agggacaaaa aattaaataa aatgagttaa agaatgtgaa tgttttttaa 63121 tggtattaaa gactctgaat gttcttctcg aattttcata ttttaacttt tgatttttgt 63181 gttttttttt aatcaggaac gccgtatgtg ctaaaaaaaa taccatttca tgagaataaa 63241 caaattctat tagtcgacga cataaaaaac aacaaacaaa cactatggtg gattatagat 63301 agactaggga atattgtaac tttataaaaa atattgaaaa aatatgaaaa cattgtaaac 63361 taaaaatcat taatttctag ataatatagg actaaataat aaaatgggcc aagatcaaat 63421 taacccaaac tgtagtttac aacttggttg ttcccattcc ttgatatggg ctaagtgaca 63481 taaaaagccc gactctttat tgtccactaa caattggtct tttaacttct tgatggatta 63541 attgtgggtg tagaagatta ttttgactat ataaaaacag aaaattaact aaatattttc 63601 gatatattaa ataactagat gatagatgat ttttttttgc agtaatttca taaatttagt 63661 ttcataaatt agtttggttg aattcaaatt tagtaaattg tcaaatttga gacatagatg 63721 atagaactaa gattagtaat tagtagcgct atatatgact caatggatgt gaacaagatg 63781 ataatttaaa aaaatgtttt ataaaattct ttttgtccaa tgaacaatag gttaaaacgg 63841 acaaatttac ttttgtgatt taattattgc catgcatttt ggcatatagt cactctgttt 63901 cataatataa gctgtttaat ttttgtcatt taacaatttt aatacgtgta acatttcgtc 63961 atgcaatgtt attttatgtg tagatatcta atttattaaa tgtactttta tttcagttat 64021 aaaagttata cataatctgc gattataaaa ttttatatta atataatgcg ttaaacatta 64081 agtatctgtt tacgataata ccaaaataaa aaacaaggga aaaattccat cccaaatttt 64141 tgcttttgtt acttttgaat agcaaatctc taatgtatta tttttctaaa ataaaatctt 64201 cctatcctaa tgtcatcatc tacaaaatca ttaaaatatg taatgtctga ttaactaaaa 64261 taattttatc aactagtata aaataagaga atcaaccaaa aaagtttggt ttaaattgta 64321 tagagagagt cctccattag cgtccagcaa caacgcccac ccactttcac accaacttga 64381 acacactact cctcctcctt cactttttcc ttcaatctct cttctcgctg acgacttgga 64441 aataaagtga gggaaaatat tcctccaact tcactttcct gagaatctgc aaataaagca 64501 attcaacttc cttttctttc tttctaaatg aaatttcgtt ctttgtctca gtctctctct 64561 ctttcttcgg tcgagtccga tcaaaattga aaacccattg agttcctcgc caaaaggtat 64621 agtcttcgtt atttcttctg taattggata atggggtttg atcgattctg attagtacaa 64681 tcccaaagtg ttgttctttg ttttctggta aattttctct gctttttttg tgttctgttt 64741 taaaacaatt attctgagtt aagcaaagca acttgtggtt ttgttgtgta atgatatcaa 64801 gtgggttatt ttttttatta taagtttctc tgttttattg cgtttgttag ggattatcta 64861 cctaaattag gtttcaactt ttaatcggtt agttgcttgc atttcacttt gaccactttt 64921 gccctttttt gtatatcccc aattgaagcc tcgagttttc tttttgacca aacacaattt 64981 tgtactgagt ttggtgaatg tttgtttgat gtctcttgtt gggaatctct ctgctttaaa 65041 gttacttact gtcaatgctg gcaataaatc tttagttttt tactattttc ataatataat 65101 ggattgtttt gggggttgtg atgagttttg gatagtaatt gaagaggttt aaaactccct 65161 taaaaaacta cagtccaaac agtgatcaga tcattctaca ggtgaaattt gagagaatga 65221 atcaacgaga gaagctgatt gttggtgaaa tgtttggaga gtttctgtaa atagacgcct 65281 cggatttggc attgttagtg aaggttaaag gagtttccta gaagtttcaa agcaatttgc 65341 ctccaagttc ttgacttata aaacatattg gaactaacat ttcaagaaag aagagaagga 65401 gatgggctgt gtttcttctt gctttggggt cgatgacttt gaggattacc ccaatccaag 65461 tagttctgtt aacagaagct gcccatgccc gagatgtctt gttaacaact tccttaacct 65521 ggtaatgctc tttttgcagt ttttactttt cacctgctac ttatatgaaa taaaggagag 65581 ttcattgaag cttcggattt gatctcttct gcagtatatc tctttattca gaagaggcga 65641 aacccgctct ctcccatcct ccttacaggc tactaatgta tcgatagcta catcaacttc 65701 gtatgataac tttatgtcta atacattcca ttctactcca aggcctctgc cttatgacgc 65761 tgatccaaga tacttccggt caaggcgtga ttcgcttgtt tctagacgcg ataagggttc 65821 aagtcattct catgaggaag ctgagccctt aagaagtgat gctgatgtgg attctgagtc 65881 tttctcagtg gaaggaagca aatgggctaa caagcttatt atctctggtg aagattccaa 65941 agaagagttc tctagatcct ctcggaggat tcttcagtca aggacaatgt ctactagtaa 66001 cgaaggcctg tatataacat ctgatgatga agatgtctgt ccaacgtgtc ttgaaggtat 66061 tcattttcat cttttttttt ctttttcttg gatcaatgca tttcaaagaa gcttagtatc 66121 agattatttc gtgtgttttg tagaatacat atcagagaac ccgaagattg ttacaaagtg 66181 ttctcaccat ttccacctca gttgcattta tgaatggatg gagagaagtg aaaactgccc 66241 tgtttgtgga aaggtaaaca cttttcatgt tcttccaagc cagttttcgg cttctcattt 66301 actagtgttt cagagtcaga ctcagacacc atcacgcgta aatagacgag tgatccagag 66361 atcaatagaa tgaaagatgt acaacaaccc tagaagcaca tgcgtaaaat ctttacttgt 66421 cttgtctatg acaaatatgg ctaatcattc ttcctgaaac tcagaacatt tgaattattt 66481 ctcattcatg ttgactaaag aatatttttt tctctgctgt gaattcgtct catttgcagg 66541 tgatggagtt ccacgaaaca ccgtaacttg ttgccattga tcctgtcttg tactgcctag 66601 accggacctg gggagattga caaggcagca gagagatgtg aatatatttt ttaacaatat 66661 atgatattgt atatgatggg gaaatgagga aacgtctctc tgcacgtttg gcttatggag 66721 ttctgaggaa accctcttga gagaaaacaa cagcagcagc agaacagatg cacacaggaa 66781 tctgacatgt ttcttaggat ttgcctttgg cgcagtgttt tgttcaaggc atgaaacttt 66841 ttacatcatt ttgtatgatt acttttaacc ttttatgaat ttatgaatat atatgtgatg 66901 cttgtatttt ggcatatagt ttacattggg tatttgggtc aaggagagat cacattgtaa 66961 cccaaatgtt tctcattaca tgataataaa ccagtgtcat ggttcagtgt taaaacaagt 67021 tccacacatt tgaattctta atataagcaa ggattcccaa aagatacaaa tgtcttgaag 67081 atcctcagga ccattatata tagccactta tgtctacaaa tccaaagtct tgtgtctata 67141 cttttatata tataaaagat cttttgttgg tgagctttaa cacagcacat gactctcaga 67201 gcatccaccc cgggatgtaa ccgtttccct gctgcgcctg agcttgggtt gtcgcagtta 67261 tttgctctga gcataccgga ttatcatacc tgtcatcatc atcatcatca ccaaaacagc 67321 aacttaaaag gttcacacac attgatgatc ctctttacac caagtaacta aagaacctta 67381 tataagtaca atcttagtag tgaagtacac aagctgattt gcagatgatt gtttaaggca 67441 aaggatttac cccatttgca gagttggatt gcattcaaga ggctggtata gtccctgaga 67501 ctgagcttga tgatgcgcgt aggtaacatt ctgttcacca ccttcccatc ctcctcctcc 67561 tcccatatga tgacttctca caccaatcat atcatccagc taaacattta catttcacca 67621 aaagactata atcagaaaca agaaaactag agatgtttcc aaaaatgttt agaaacttaa 67681 actctgtgaa agagattgat cgcaaggcac caaatgcatt tattctgtaa ttataccttc 67741 attgccaaag ctctattggt ttcaagcaac atttgctctt tattttgaag atccgagagc 67801 tggtcaagca tgtactgtgt ctgcaaacaa tcatcatacc accatagtag ttataattaa 67861 acaacacaaa ttaatctcat ttgaatcatt gattccatgt ataaagatac cttgatggac 67921 cgaacttgct tgagagagcc gtccagttga cgctcaagct gctctaactc ctttgaattc 67981 aaaggtccta aatcctcccc aagaagattt ctgcaccaac aaaagccaca cacacctaca 68041 aatatgaatc tacatggata cagaattgag taaatatacg gaggtagaca gatatgtacc 68101 tctgttgacg ttgaaggttc tcatatctac ccttaagctt cagatattct ctgtagctgt 68161 tctgaaatat ttcacgaaaa caaactttta aaaaagatct ttcttgaaat gatatgatag 68221 gaatctaaaa tagtatttga attaagaaca cctcaagttc tttggcaggt ttgttgttga 68281 cttcaatgga tccatagctg catttctggt accgatcaag tgtcttgagc atgctgaaaa 68341 taactcaaaa agagagaata agagtttatt aaaaagatca tacttgactt tggagatcta 68401 aacctataaa tcaccataaa aaaaaagctc ttcaagtttt aaagtattct tccatcaaaa 68461 ttgaagattg tttccattat ttatctgatt atatacgagt tgatttggat ttgattgttt 68521 caactttcaa acgtaaagtt tcgtatttat atttacaaat atcaatagtt caatcctatc 68581 atttgcaagc aatatcttac gttttcgttt gattcacaaa ttgtgatttg aaaaaaggaa 68641 ttaatcgcta tattttcttg aatgcatgta agcaaaacta ttttaacata taattaaaaa 68701 tctattaaac attgttttta agaaaaaaca aacaatcaaa gccttcacag atatacacgc 68761 gtttgaagtt ttaaccacca taaagtatga tgaactaaca aatggttaat gcatgtaacc 68821 atatattatc attctacgtt ggatacttgt aattttcata gaaatctgtg aacaactttt 68881 aagtttccaa tcacacagac atatattaat tatataaaac ttttcagatt gcatatcagg 68941 atttggaatc tattcgatta caaaattcac agttctaaat cattgacaac aaaaaaacca 69001 gatcttaaat ttcaagaatt ggaaaaaaac cctagaaaat gaaatcaaga aatctaatta 69061 actcaacttc gtagaaatca agatcgtgtg atctatatat aacaagtcct tgagacaaag 69121 atctaagctc agatctagca aaaactcaat caagaaactg ataaagagtg agagagttgt 69181 ttactttgag gagctgcaaa actcatagag ctttccacgg ttggagaaga tgatgagagc 69241 aacttcagca tcacagagaa cagacaattc ataagctttc ttcaacaaac cgttcctacg 69301 ctttgcaaac gttacttgtc tgttgatttt gttctctatc ctcttcagct ctactcttcc 69361 tcttcccatt tctttttgtt accttaattt ccacttttta tgtctataca caaatatggg 69421 tatgtaatgt ttatctatat acagacacac aaaaacataa gttgggtatc tctttttcag 69481 aggaaagatc tgatcttttt gagtttttat ggtctggaga attgatggta tttggaaaaa 69541 agaaattaac cctaatttca agaacaaaag gtactagctt aaggaagaag aagaggaaac 69601 ttctgtttgt ctgatttttt atttataggg ttattgattc tttttcctct ctttaattgt 69661 gtttggaagt gtgtggaggt tgtgaaatct taatgaaaat tttgttttga atcaatggaa 69721 tctattgaga gcaaaagata agaaatgtgt gagggtgagg ttagggtttg gtgtgttttc 69781 tcatttctag cccatctctc tgcttcttta ttcttcttta ctttcattcc cacgctctct 69841 ttaagcacgt gtgatacagt caccaaataa ttggctcttg tgatacaatt accaattaaa 69901 cttgtatata tatgtttata tgttatttaa aggataaatt tttatttcgt attacttact 69961 ttagagtttt cattataggg ttattattta atttgaacat ctatgtgttt ctcctcaggt 70021 agaattcatg tttgtgtata tatgggtgtc tactatttat ggaatattta tacaaaacat 70081 ctaaacatat cattgcttga aaattctatg tttatatcat aatagcaaat tgtttatgtt 70141 tagttatata taacacaaca atagtactct aagaaaaaca gcatcatgca cgtttgatgg 70201 ttgactagta actagacaca cgaggcacaa aggttatcag ttgattaaca tgtgtatgta 70261 tatacggtct ttcacttatc acattacaaa cttgaaagta aaatttgttt agcgatatta 70321 agaaagttta tatatagaga cctcgtacta gctggaaatg tatatctaaa atgaaagtac 70381 actccctctt acatacagtt taatatattt tcaacataaa acatatgtac atagtactag 70441 cttatttttt aaccaaaact tttgggtttg atagtttaac aactcggtca agaatctcat 70501 attttcttac ttcaaataat tgccaatata ttctagttct ttttgttttt aaattatacc 70561 tggtaatttt ctaacgagag tctcataatt gcaagttgta gtagtcttgc agctggatat 70621 gaagcgcgtg aattaaaact aaataatact tacaataaat gaaagcaacg tatcatatca 70681 ttttctagtt gttttcagaa gccgcagaaa agcatcttct accttaaaat gggtttccat 70741 gtgtcgatgc cttattggat catactttct tactttttaa ttcagtttaa aataaattga 70801 aaaatgagat aaacatttaa gtggttcaat tatcataaaa aacactaaaa ctgataaaat 70861 agccacataa tataggtcat atcaaataaa tgagaaataa tatttgtatg cctaaatcaa 70921 gttatccatg tttgaaggtg tttgcagtga ggtatatatg tttttgtcta ataaagatta 70981 cagttatgat tatgtaacat aaattagata atatttacag ctcgcgatac actacaattg 71041 aaacgtggtt aattagtgaa actgaaattt gaagaagagg atgaaatatg aatgatattc 71101 gttaaaatga tctcttagat tgtatagtca ttgtccccca ttgccaaaag atcaaccact 71161 tacataaagt aaaacgcatc tttagttaaa agctaaccaa aagagtggaa catatttatt 71221 tttggattgc atttgcgaat ttcatcaata tatggttttg ttcaattggg aaatttaaat 71281 aactccacta aaatgtacaa agtaattttt tttcgacaaa gaatctatga agctaaaaaa 71341 tctacccata agtcactttt agcagaaata ccaccatata gacaagggat tattgttgat 71401 tttgccattt tttatagaga tcttttgaaa ttgccatttt tactaaaatc ttacggataa 71461 gccattttct aagatttaaa aatatatgga tttcggactt tttttggtaa aaatacgtag 71521 tttacacaaa atactaaacc ctaagaacaa ccacaaccga gtaaacccaa acataatgaa 71581 taaaacgtat attcattcac gtaacctgaa aaatggcaaa attcacaaac tattaaaaaa 71641 agggcaaaat ccacaaatgc catcccaaaa aatggcattt tcataattga ttctatagac 71701 aattattttt ctgcctatta actttatttg ttttgataaa gataaccaac ctactatcag 71761 ttatggttga ttttttttat tcatattaaa aaaaacaaga aaacgaagta actttattta 71821 agttaaacaa aaattcaaga aaaatgtgtg tttgcaccaa ttatggtaca tcaactgcca 71881 aaaacccacc tatagttgct cactgaaaga aaggtccagc agagcatttg gttactagat 71941 gaatttctcc ggtttaatga cgataaaacc cggttaaagt taacctactc agtaattcat 72001 aagcatccgg tacatctaag caaagtgcct tatttgcagt aacagatcaa gaattggaat 72061 aagagacttc tcgaataaac aagacattct tcaactttaa cattaccatt accaatggac 72121 cgattacact tatcatagtg tcaagaagat gaaacaacta taaaataccg agaaaatggt 72181 caagattttg agcattcagg gcaagggaaa caacctttgg cttggcaatt atcgcaagcc 72241 actgatcttg tgtttgatga gtcactgaaa ccaaagaatc ctcctgattt gattgttcct 72301 gttcctttgc aacggccaca agctacattc ttaagatctc cacattttcg acacatgcca 72361 agagatctgc aaaacttgtg tttagaaaca tatgatgatg gcaagttata gattcaaaaa 72421 caagtctaat gtcattcagc catggacata caacactcta gaacatttct tatagccgtc 72481 agcaaccaca atgaacagtc ttaagatcat gtctcattac aaaggtcaga ttttcagaat 72541 ccataatcaa gatcaagaat gattcaatac ctgaatattt caaaaatcaa aagatctttc 72601 agtgaggaat acagaatttc aggaaaacca ttactttcac atggtgtgaa ctcagaaaaa 72661 aaatgtaatc cattggagat tgaaactaaa aacataactc aacaatggat tttttttttc 72721 attcagaatc tatccttcca aactacagat cacaaagaaa ctcttaagga tgaaaaaata 72781 gtttaaatca aaccctcttg gattaaaatc aagcagctca ttgcaattct aagcagctac 72841 tacaagcatc atcattatca cataacaaca aaacaaatcc acacggttta aatccaaaac 72901 aggatataac acgacatggg ttccactcag aacatgaaag attgagcaaa aagttggatt 72961 ttgtggaatt tcagaaatct tatgtaatga acttgaaaag ctttattacc ttctctgaga 73021 agcagcgata aaagcatcaa ctttaggagc agcgatggtg gcaccaagaa gcagagcccc 73081 aactgccaaa cccgcaactt cgtttgacgt taaactctgc atcgtcgtct tcctttttct 73141 cttctcttct ctcgttcctt cttcttaact tgatgataat gatatcctga atttatttca 73201 aacgggtaaa tattttacat atctagagct tatcagatat ggcctattag ctcagttggt 73261 tagagcgtcg tgctaataac gcgaaggtca caggttcgat ccctgtatag gccacttttt 73321 tgtttttcct tcttttaata atttcaattt tgaccccgta tcttttgatt tcaccaattt 73381 gtatcccaaa gttaaataaa tgaatttctc caaacgattt gtagggttta aacaacttag 73441 tatactccgt tactcgcttc caaacccgta cggcctcttt agagtttcac ctttgtgtct 73501 ctctgaaact gcaacaaagc ttgaagcgat agggtttatt caagcttcac tcaaaccagt 73561 cgctaaattt cgacatcttt attcacatat ctatagattc aagctttgtg atttgccatt 73621 cgggtttcag attcgtgttg gtaattcaat ttctgaatcc tctgctccta aagagaagaa 73681 tctgagatca gcaatggaaa cggatctcaa tgattatact gtcattaagg aaggagaagc 73741 tgaggttctt atgcacaaga agaaccaagt cttcttcaac aaagctcagg tttgctatat 73801 gatttgattt tctgcaaagg ttcttcctct ttggtgataa ttgattgaat tatgtcaagg 73861 gcaatcttag gattgatcaa gtattagttt tgagcatttt atcttggttt aagatttcaa 73921 atttagtgtt tttttcttgt tatggattag tgtcatgctt gtgttagtga gaaatttgaa 73981 ggcaactgat cccaaatcct tcttagcaat tatgttctct tgtgtgaatc catatggata 74041 taagtgtgag ctaaactaat tgcgtgttgt tttatggtta ggttaacaat agggacatgt 74101 ctattgctgt gctaagggca tttataatca aacgcaagca agagcatgag gctatgttat 74161 ctaaaagagc tagatcatct gggaaagtgg ttgagaagga tgtctctgaa acttccaagg 74221 aagaaactcc tactgaaaac ggtgacgata atggaaaaac caatggagaa cacgaagtaa 74281 caactcagga tggaccaaag gaagctgcta agaccgcata tgaatctgca cgaagggaac 74341 tcaaaccacc aagagtgctt gaggtttacc tatattcttt cctttatgac tgtaatagct 74401 gtacttttgt tttcttattt cgttgcttat tatgcgtttc tagttgataa ctttttcttg 74461 tgttatttat tttataggcc ctgtcagctt cagggctacg ggctttgagg tatgctcgtg 74521 aagttgaagg aattggtcaa gttgtggctt tagataatga cccaggtatt ttttccttct 74581 caacgtttgt gcttgtgaag aggttcgaga ctctactttt ctttcgtcct aacatcccag 74641 aataatatag tatttccatt ttcctattaa agcatcggtt gaagcctgcc agagaaacat 74701 taagttcaat ggtttgatgt ctacttcaaa ggtggagtca catcttactg atgctcgtgt 74761 tcacatgctc agccacccaa aagattttga tgtggtaagt atccaatttt tgcaaaatct 74821 tttcatctta ttgatcacat aactaacttg gttcagtgat gtcttaggtt gatcttgacc 74881 catatggtgc gccgtctatt ttccttgact cagctgttca atcagttgcc gatggtggtt 74941 tgctgatgtg tacagcaact gacatggcag tgttatgtgg agctaacggc gaggtctgct 75001 attccaagtg agattttact ccttcattac cttattctga aaacaaatat ctttgatttt 75061 gtctgaatgt cttaaatctg tctgttgtta gatatggctc ttatccactg aaagggaaat 75121 attgtcatga gatggctttg aggatcctcc tcgccagcat cgaggtaatt ttttgctacc 75181 gttgaattga ttcactctcg ccaaatcttc caggctttga ttgttttaaa gatgctcgac 75241 tggtatttct tttctaaaaa gcttttgttt ttgatatatt gactttggtc atatttttat 75301 ttgcagagcc atgcaaatcg ctacaagcgg tatattgttc ctgttctatc ggtccaaatg 75361 gatttctacg ttcgtgtttt tgtccgcgtc tacacgtgag ttttcctttc ttttcaaact 75421 gttcctctca attaaaggga agtaaagatt gaagctttat gaaatgcagt tcggcgagtg 75481 caatgaagaa tactccacta aagctctcat atgtctacca atgcattggt tgtgattcct 75541 ttcatcttca gtctgttgga agatctctcc ctaaggtttt gtggctgtct cgttgcaatg 75601 ttagtttagt agtttttgaa agctgccttt tctgataatc ctcttctata atttcttaca 75661 gaataacagt gtgagatatc tacctggagt tggtcctgtt gttcctcagg attgcactca 75721 ttgtggaaag aaatataata tgggtggacc aatatggtct gctccaatcc acgatcaaga 75781 atgggtgaat tcgatactaa acggtgttaa atccatgaaa gacagatatc ctgcttacga 75841 ccgaatttgc gctgttctta ccacaatctc agaggcaaaa ctctcctcca cactctgata 75901 gctttagttt ttggttttga tcaatttctc taaatcccta aatgtttggc ttgtctccag 75961 gaattgccag atgttcctct ctttttgagc ctgcatagtc tctctgcaac gctaaaatgt 76021 acttcaccat cagctgcttt gtttcgatca gcggtaatca atgcaaagta ccgtgtctct 76081 gggtcccatg tgaacccgct tgggattaaa actgatgctc caatggagat tatctgggac 76141 atcatgcggt gctgggtaaa ccaaccattt aagtctaaag gcacaagctt tgaatgtcat 76201 ttcttttcct cattgataat gaatgaactc gtttgtttga ttgtaggtga aaaatcatcc 76261 cataaagccg caatcacctg aacatcctgg aagtgtgatt ctatctaaag aaccatctca 76321 tcaggtctct ctttcactaa gccattgttg taatgcttca ttcattgttt atatatttaa 76381 agttccaaac ttgaatcttt aaacaccgct ttctttcgga caggctgact tttcgcgcca 76441 cgtcggttcg cttagtaaag cacaggcgaa gaaagtagcc cggtttctac caaacccaga 76501 gaagcattgg ggtcccaaga taagggcagg tcgtacaatc acaagcaaac atgtatcgct 76561 tcttggtcat gaagcagtaa acggtcatct caataacaat cataaagaag caggagacga 76621 agaagaagaa gaagaagaag aagagcccga agaagatatc atcgagggtg agccagagct 76681 caaacgccaa aagacaacag aagattttgc ctcaacatca tagggcgaat gtttacacaa 76741 gaattttata tttttgtttg caaaagattt tctcttttct tttcttaaac atttgttatc 76801 acagttctgc gtttttgatc ctagcacttc accatttttg cagacttcta caacaaacaa 76861 cttataagtc tctattggat tcttcagatc tgttagagac tatttaaaaa aaccttgtaa 76921 tgtgtaagtt ttcatcagta acacttttct atgaatcttt catcattatt tccagttttt 76981 attggcaata tgattgatac ataaacgagg aaaacaataa tcatggagta tcattaactt 77041 ttgatcctaa gcagcaaaca aattgtataa caaacaagtt cattaacgat aaaacagaaa 77101 gaagagcaaa taagtcgttg tagtatagtg gtaagtattc ccgcctgtca cgcgggtgac 77161 ccgggttcga tccccggcaa cggcgttact tttttctttt tccttttttc cttttcaaag 77221 ttttccacgg cataatcgta aagcccagcc cacgagtgaa agtaacagag acgatacacg 77281 tcataatgtg atcatcttaa aatctttttt tatgtataga gagatacgaa tcttttaacc 77341 ttgtcaaatt ttgtttttct tttctctctg aacccgaaaa ctccagcaat gtcgtcggcg 77401 aaacttttcg gttgttctat taacgttaac gttgaggcag aggaagaaga aggcggtgac 77461 ggtggtagct caaccaacgt ggaggtttct cgatctggta accaaccgga ttgtgaagct 77521 atgagtttct ccaatcagat ggaaatcggc gttcgcaata catattacca gtttctagag 77581 tcgaattcag attctggatc ggattctatg tatgcagaac ctgaattcat cgatttcttc 77641 gatcgagagt cgtacgaggt cgacacggtc cgtgaggttt gtgtaagttc gaatcagagg 77701 gttagtactc caggttattt caatatttgg gatcaggatg tagatttagg gcttggaatc 77761 gggttaggtt caaggtcggg ttcgggtcag ttacctggtg attcaggtgg ggttggggtc 77821 gaagttggta ggggtgttac tcctgttgag tataatctct ttggagagga agctatggtg 77881 gttgatgaag tattggaatg ggagaatttc aataacgcta tccacttggt tcaagaacct 77941 gcctatgcta gtatggaggg agaagaagaa gaagaagaag acgaagtagt aatggaattt 78001 gcagcatcca tttatagtga tgcttgggaa atattattgt acgataacat gacaaactct 78061 gctcctatgg atttggatgt tgaagtttgg ctagatagtg tagatggata tgctcctatg 78121 gattacaatg ctatcatagg acaaatgttt gataacgaga ctggaatcaa aggaactcct 78181 cctgcatcca aaagtgtagt agatggtctt cctgatgtgg agcttaccat tgaagaattg 78241 agcagtgtga gcattgtttg cgcgatatgc aaagatgagg ttgtgtttaa ggagaaagtt 78301 aagaggcttc cttgtaagca ttactatcat ggagagtgta taataccttg gttggggata 78361 aggaatactt gcccggtttg tcggcatgag cttcctactg atgatctgga gtatgaaagg 78421 aagaggagag catgaaggcg tggtagtggc gggttggaaa gggagtcgat gctcgggtag 78481 agaaggagta actgtgcgtt atatcttcga taattgtcta atcagtaacg atctgttcct 78541 cgttgatatg atgtacatat tggttctaga taataggaag catacatttt ttagccatat 78601 ctttttgcaa gttatttagg ttcattaatc ttattagact ttgcaggtaa ttgcagtcag 78661 aggttatatt taattgttga tccgactttg attagttcta aatatgattt ttttcggatg 78721 tggttttgtt gctttaacag cgcaagattc tactgtaaaa cgtgaaagaa agtgaaaagc 78781 tcaatcatgt tatttttacc gtttgtgtta tctaggattt gaatcaatta gagatgagat 78841 gttgttgcca tggggctaat ccattgtctg atgtgatgaa tccggcggat ccagagtcaa 78901 gtcatgtgag tctgtgacca ccttgtggtt aaccaaaaac atattaaaac gatggatttg 78961 gtccggttaa acgtggtctt ctgggaatgg gtctcatatg gacctgatct aatccaaagt 79021 acgggccttt tagtccacta aagaatctct agtttaaatt ggtgtaattt ggttttattt 79081 ggaagctaca aatactaaga atttccacat gaacgtgtat ttgggtatcc acacgaccat 79141 actctataca caattttttt tgttcgcatg cgtttgaatt ttgccaatta ggtcaatttt 79201 taattagttg tttaaaagct tttgaattat ttatttatat tttcatgttt cattttcact 79261 aactacttga aatatgaacc gaaagtccat tagcaataag attgttaaac aaaactaaag 79321 ttaaaagtca ttttcactgg cagtggctcc ctcttttctc ttatagtgac tctacatttc 79381 taaagagcca aacttcaaac attgagtgtc gatttttttt aataaaaatt ctttttaata 79441 atcttagatt ttttttaatt tttttttctg gctcttggat acccatttgg ctcaacacta 79501 tttgcttatc ctctcatatg cgataatcat gtctcttgtc tcattcgatg taacaaatac 79561 tccacacaga aaatttctta acttctaaaa ctataagctc ctctatgaaa tcttaatttt 79621 aaaccttgtg cgtcatttac gttgttcaca tcaaccatag agagttgcaa ttaactactt 79681 gcaaatgatc tagtatatgg aatttggcaa gaatttccat gttcctcagt ttagactcat 79741 cttaccaaag ctatatacag tatatcttta aatgactaaa catttgtgac gttttaaaac 79801 tttatttgtg tggatatgtt tcgaagtaca aacggattaa ataagtgaac tgcaatgaaa 79861 aaagtataag aaaatcaaaa taaaacgtaa tgtagttgga gtgggtgaat aacatcacat 79921 caaaacaaat tcatcacacc atttgaaatg ttttcattag ataattatta cctcaatcat 79981 tttgggattg acatttgcgc tcgagtcttc agtgacgttc tcccactttg tcttcttgtc 80041 tcttaggatt ctctctcttc tccttaaggg acattaagct ttctagacca aaatgcacat 80101 caactacgta ctctcaacca ataataacta gacaagatta tcaagttaaa aattcaacgt 80161 aaatcatatt ttagactaat aatttcatgg ggtaaacttg tgattgtgtt gtgtgattaa 80221 atttgcatgg gtaaaatttg ttatcttata gttcgttttc ctactcaaat tgtaaaatct 80281 ggctttgata tatacaaaac ataagcacac taattctttt taaaaactaa aaagaactaa 80341 aataaacaaa ggaaaacaca gttaggcaat gataagtggg ttatttgaga attcgtaaaa 80401 ttattagcat aatcataact attatgatta ggactggcaa tatatagttg taaaaattgg 80461 cagtaatagt ggttagtgga gcaagacctt agtaaagttt acaagtatct tagcatattt 80521 gttttggcct ttctccatat ctttctatta aattagagaa atgatggact acggtcgtgc 80581 ctaagataca ataatttaat agatcatgta gttttgtaat atttgattat ctgataaaga 80641 ttatttaaac aagagatcaa atttacaccg caaatattaa taaaaagtta ggtatatata 80701 tatatatata tagttaaaac atatttataa gtattcttaa tttggcaaat tggatagaat 80761 tatttaaagg tacgaaaaca aaataccata gtataaaatg gatatacaat tataatattt 80821 tcataaaaac aaaaggtgta gaattataaa aattatcgta aaaggtgtga tgatacaata 80881 gactttgatt tggtgattgg cttatcctta atccaccaat caggtgtacc atcatgacat 80941 catccatttt tcattaaaaa aacattataa caataaatta atatgttgat gatttttgga 81001 gtttgcttaa caaagaagat cgtggaattt cgaattctca tatggtatta aagattaaga 81061 ggccgatgaa tataatctga ccaaaaaaga gtttcaagaa caattaccaa cgagtcaaaa 81121 tataatatat atgttagaaa aatgtgacaa aagttatatg tttttgtttt agttagaata 81181 gcatgtttag atacgacaac atttacatgg catgtactat ttagattaaa ttacagttac 81241 acaaaattat aaatttctta acagaataga agatctaagg gaattctttg gtaaaaaaaa 81301 tgaaaaatca aggtaggcca atttcaggtg gtagtgtcat tgatgggacc aaaacaatga 81361 aacattcaaa aacactttat agtttatgat ccagtgaaaa ataatactaa aagaatatca 81421 gaaaaggaaa cactaagaaa aagcaaattg gaaaaaagta gaaaatgata aaataagaaa 81481 aagaacttga ggctgtggaa gaagaacaaa ccgtgcaaag gacatgagac tgaaataaag 81541 atgaaataga ctttgtctaa atgatatgac ttgatggtgt gtatttgcct ttcgtttgac 81601 taacacagtg atatgcacac tctattttat tctaatttgc catattatgc atcaacaggg 81661 aatttcaaag tttaagtcag tcaaatattt ttaggtgttg tgaatttcac attacaaagt 81721 tataaagtag ttttaaagtc accagatgag aatctatgag atcaacaagt acagccaaat 81781 gaatatccag tctgatgaat taaaaaaaac tatactacac atgttcaaca taaactatag 81841 tgtttcttga atatatgtgg tttcctcaaa tagtaaggtg catcagtaaa acccgaaatt 81901 agttattgaa ataaacaagc tccattattg tataactttc agtcgacacc caaaaagtct 81961 atctaaagag gaccaatatc aatatgcaca tatttttaat taacattgcc tttctatgga 82021 tggattatta taaacccatg catgcctaag tttttggtta tatttatctt aaagaaagat 82081 atgtttttaa tagtagaata gtagcaagaa agaaaagagg agaccaaaaa aacgattttt 82141 tcaattctcc aatcattaat tgtacaaaac tcaaaaacga ttttcctaaa gtttgatatc 82201 gttatataaa aatccagaca tctattctct taatcttgac caactaagag ataacactcg 82261 ctcattttat aacttagtta gttagttaaa tcaatgaatt ttcttttggt tttagtttgt 82321 aatcactaat caatgatcat tatacaagtg tgtgtggttt taaactaaat aaaaggtatt 82381 gttttaagtg gttgttgagt catcggctcc tcaattgtat tatagcctct gccttagcct 82441 ttgtaatgaa catcataaaa cgacaaggca agcctgccta acacatcttg ctttcctagt 82501 gcaagtaagg agtttgtttt tgtaaatagt ggtctttgac catatatcta tatataataa 82561 aagaagagtt gaattaataa ttctgttctt gtatcaatat gatcttcttt gtattgattc 82621 tataagtata aaaacatacc acttgagaca aacaaaaaac cataaatagg gttaaaggat 82681 gaatcatgtg aatagaccat gcatggttcg agtatttcca ttgcgctaag gactgaagac 82741 tggaaatgtt ttcaatgaca agaataaaca agtatttcac atatctctaa gattaattag 82801 tactatatat gatatgggat gattggttaa gagatcatgc ataatcaatg ggttactaag 82861 tactgtagat ttgggttttg tatacaatgt atattgaaac ttcggtatgc catgtttgtg 82921 tgtggaaaaa ataggaaaag attgtcattt tttcggtaaa attctaattt attaattatt 82981 gtattactaa catttatata ttaattatta agtctttagg tttgctttat ttttgtttac 83041 gtctgtctgt gtgatattag tattatagta gaagaccaca tttttttata gtttatctag 83101 gacctaatat atgcagaaat caatataata attaattaaa gaagagaagc aaatcttaaa 83161 ggtcctatag atatataatc ttcttctttt ttctaaagag gaagaagaag agaaattggt 83221 gattaatttt acgtagatgt tagaactaat tcatttagaa aaagatatat actttataaa 83281 ttgacttatt tggcatcatc gccactttat atattggcta acaaaagcat cgtgtttcga 83341 aactaaaata attttgaaaa ccccacaact aaaaactaca atttttttta tctctgatgt 83401 aagcacataa cattgatcta aagcttgatt ttatccacaa tatatatgta catatggtac 83461 ccaagtcctt atgtgtcatt atcatcataa tgcatgaaca tgatatagaa accttttcca 83521 gttcatgtcc tgcatgaaca tcatgatcat gtttcttgta tgtgtatcgt ttagagatac 83581 cactagatgg ccatggccct gttagtctgc tactcacatg tacacgacaa gatgccacca 83641 cttggttgtt aaactaatcc tataatgaag tagcaccttc acgtgtcatc atatatacac 83701 atccatatat aatttatgtg aatatgatac actatgatct atgcatacaa gccatgaagc 83761 caaggttact ataaaattca agcattcaat tccttcccaa gtctcattct catgcatgga 83821 caaccaatga atatacaata tatgtatggg gacttctaac caacatgttt atcgatcttt 83881 acaacagcaa agtagccata gcagatttac acagcaacgt gtaaatactt ttttaaactt 83941 ctctcagtaa gtcatttaca agaaaaaaaa ttatgtccat agccaatggg ttatgtctta 84001 tatcgacata atctaaatta ttttttaaaa agcaaaaccc tcttaagcca atgtcacgaa 84061 agccaccaat ttgcaaaaaa agttgtggtc cacttgtatg cctttctcag ctgtataaac 84121 actccttctc atccaatctc caccattaaa ctcaccctct ttgagttcta aagagagaga 84181 gagaaattga ccatgcagcc aaattatgat agctcaagtc ttaacaacat gcaacaacaa 84241 gactacttca acttgaacaa ctactacaac aacttaaacc cttcaaccaa taacaacaac 84301 ctcaatatcc tccaataccc tcaaattcaa gaactcaacc tacaatctcc ggtaagcaac 84361 aactccacaa cttccgatga cgcaactgaa gaaatcttcg tcatcaacga gaggaagcaa 84421 agacgtatgg tatctaacag agagtcagca agaagatcaa gaatgagaaa gcaaagacac 84481 ttagatgagc ttctctcaca ggttgcttgg cttcgaagcg agaaccacca gcttttagat 84541 aagcttaacc aagtctccga caacaatgat cttgttattc aagagaactc gagtcttaaa 84601 gaagaaaact tggagcttcg tcaggttatc acatccatga agaagcttgg aggaggcata 84661 catgacaaat attcttctcc gtcgtctatg gatgaattgg atcaagactt ttcttctatc 84721 acagatgatc caagaactca tcatccatca tgagttgttc tgatcgagaa aaacattact 84781 tcaagtttct gcctctaatc ttatctattt tctttctttg gtatcacttt ttaaaccgtt 84841 taactacgta ccaaagtttg ggttttattt gtttttctgt tgttttgtca gaagataatt 84901 aaatgatgga tatttgaaga tgatgctgta aacaaagatt tataattata tacataaatt 84961 tggaaaagaa gagtataatt ttataataat tatacgtaga gaaataaaaa gaagacaaac 85021 aactaccacc aaaccctctt cacccaatag agaagccaag tttgatatac ttgaagaaaa 85081 gtaaattatt caaaaggccg cggatcttgt taattattct gcattctcag tgaggatagt 85141 tcgaggagga tctttctgaa atcaatgaga tgattaaaaa tatcattcat ttacttaaaa 85201 ctatccgttg tgcaattatg aagttcgttg tcgaaatgct tagaataact catcgtgaat 85261 gataagatta ttatcttttg ataagataaa gatagaaaat gatcatgcaa caacaaaaaa 85321 ggaaacttcc tgaaagaaga taagcttagt tcggaagact acttgggccg tcccctgaaa 85381 cagaagttta tcgcgcaaga ataattgggc cttatatcag atcctttggc ccagcccgtt 85441 agtattagac aagaccagtt gcgtgcgtca ttgttgaata tttcgaagtt tgatgatcga 85501 tcgtcgctca aaatttaggg cttttcactt cacaatcccg atttggggtt ttctggtaaa 85561 ctcctttaat ctcagctcgt cagacaaaat cgatctggga tgaatttgga tctaattgag 85621 caaaaaattg tgttttttca atctattgaa tttcacatcc ttaaaccctt gcatattcaa 85681 tctattgaat ttcgcatatt tttcctcaaa tttatccaaa atcatcccaa atctattgtg 85741 acgggttgaa agtaagtttc tttatcgaaa atcagtttga atgaatgtta gggtttatat 85801 tttgatggtg actgattccg atttgtataa atgttagggt ttatactggt ttctgactct 85861 ctggtaagta tagagatgat tatatattaa atggtgtttt aaggcgtgtt cagatatagc 85921 tgtgcttgtg ttgttctgtt ttggcatttt tcatttgttc tgtctttact tcttcttagt 85981 cagtctgtgg atttcctttg taatggttat tggttagtta ggaactactc tcaggccttc 86041 ttcacatgac accaaagttt tttgtggatt atgtggagga aattgctatt ttagtatatg 86101 gtcattgaag caagataaga accttagaaa gcagagtgtt ataatttttc tgtcaaactc 86161 atgctgtagt tgcagacttt ttcctacttc agattaagga gaatggtttt ttttttgtaa 86221 gcaagaagtt aaaagagtga gaatctatga aggcccaggg accatgtaaa gtttgggaat 86281 ggttttaaaa tttgatggct ttgagaatcg atctgaacta tccttagatt tttggaaaag 86341 tgcaaaaaag tgcaaagtgc agcatcctta tatcacttta tgaacttttc acgtgaggtc 86401 tcgaatccat cacttaacat taacacatag tataagctat acacttggtt gttattactt 86461 atttcttctt acagttcttg aagccgagaa catatatggc tgatggtttc tacaattagg 86521 aacgtgatat gactcatttt agtaagaagt tgtgatttac agctgaggag tctgtgttat 86581 caacttgaat tgtagaatct ttttgttgtc agggaggcgt gaaagtgtca catgttcagg 86641 ttatcgggaa gttattggct ttgaaccttg gcttccctct tagtagttag caaggtgaaa 86701 tctgcgtttt cacttcttga tgatccatat ttcacaaatg gggatctgta ccaattagac 86761 tggttttgtt taactaatgc agagaaagaa gaatactata gttttaatta gccaaaacta 86821 caaacccatt tgcacaacag tagtaatact tcttgaagct aagaatttca ccaaacctaa 86881 cataatctca aactaaaagt aatttacagc ttgcaatcta gtttttcttt ttgccacagg 86941 agtatcagaa tgaaggaaca atcccatatc ctgtgttgta cattagcatc gtgttgaacc 87001 cttgctcctc ggcttcgatt tctctctttg cgaaccggcc attgacccgc ggtcttatct 87061 ctgcatatgc cttcctcgaa gcatacctta ttgtcttctc aaatttcctt gtcttcctct 87121 tctctctgta tctcaggacc ctggcttctc tgtccattgg actgagttgt gttactgtta 87181 tcatctggct tgcagggtca ggttgttgct ctactgtccc tttgggcgtt cttgggtgtg 87241 aagctgttgt gacacatgct gttgactccg gcacaacacc agtttccatg gatgaaatgt 87301 atgcctaatg acagaaacaa agacattctc aatccccctg gtaagaggac taaagagagt 87361 cgatgcaaaa ggaagcatat attgttttta ttatagactt ctggtgaaat ttctatgtgc 87421 tatagatcaa ctgttggcag tggatcagtg gatttgtgtg tatgatgaaa gaattcaccc 87481 tgctgcgtta tgggaagatg ctaaattgaa ggggtaacaa atatacaaaa gccttacgtt 87541 atggttaatg gaaccattgt cgttgtagtg agtccctgag gagccatatt tgatattgaa 87601 ctgaaaattc tgttggttat ggcactggtg gccccttgat tcttcaagtt taagcggaac 87661 aactctatct cccccgtagc tcgtctgtgg tacgctgcag ttttgttggt gttgactgta 87721 ttcacctgtg aatttgtagt ccatactcga gttgtaatcc acaaggttta gatactcatc 87781 actaaacaat aacccattgt tttggttatt gttatttttg tctgaattag ggaacagcca 87841 cgaagcaacc tccttggcat ccttatcacc ttcttcacct tcctcttgat ccaccaccag 87901 tctcttctct ggatcggtca ttgttttctc gctttggtgg tgagtagtgg tcatggagct 87961 gaaagagttt ccagaaattg gtagaattgg aactcgctga tggcgtctag caagtgggtt 88021 tgcagaatga acctctgaat cacaggctgt gcatagagag gcatcatctg cctcacacaa 88081 aaaagcagcc ggagcacgct cacatgactc gcagacccgg acacgtttat ggcgggaagc 88141 aacgcgattg gcagagtgaa cttgagcatc acagctcatg cacaagtagg cagaatctgc 88201 atggcaatac acggtgcagg cgtttgaccg gcatgtgtca cagggtcgtg ccctgttgtt 88261 ctctccacta cctatgtcgt tactctcttg tttcaacata ataactcaga tgtagtaagt 88321 ttgatggtgt gggagctggt atctgaaaga aggggctaat aaagcttata tctggtgtga 88381 gagaataagt agaggggaga tcgaatcctg agtgcctaca cgtggcacag cgttagtggc 88441 tcttacatgt tattggatcc tcttgcagct agttgagctt ttgggaaggc ctatgtgtaa 88501 agtgtaaagt gtaaagtgta aagtgtaaac atctcctcct atgcacttct tcgttgggac 88561 ccgttcacaa gttgcttagg ttttgccact tgccatactg agattctcaa gaggaagtgg 88621 tccggataat gccatggtgt tgcaggcaaa ttgttactat tttttcgtat caccttttac 88681 tattttcata tgcattttat ttaaagtaaa gctagattcg ttttatctct ttggctttat 88741 acaaagttta agtggttgaa tattcatcct gaaccttgtg attactgttc aggtatctct 88801 ccgagtaagg cttcgagtgt cagagccatc accaactgtt atgggttgac tacctataca 88861 aaggttagac aaactcgtga atatggtata gttgttgttc cattatttgt gaatgcatct 88921 gcatcattta gaaagatttg caattgaaac tttcgaagga ttttctgatg aaaagtgtta 88981 taatgtttaa tctggttcat aaagtcatgg taatgagaat catatcggaa aagtgacatg 89041 aaaaacatag agacggtgga ttttctagga tcaaactcga atccgccaag tagtcttgag 89101 actacaaatt agaatttacc tgacgcacaa gttaaaactt aaaaggcatt ttggagtcca 89161 catgagcaat aattagattt atcatatata gtatgtagaa atttgcactt ggtaaccatt 89221 atagtaagta gttacaaaac atgactcatg agtgttgtca catctctgta atttgggttt 89281 atgatgatac tctatactat ttattgtagc tttattagat ctcaacctgt ggaagtaatc 89341 taatacaaac gacacttgta tggaattttc aggacatgtt tctaagatca cagaggtaat 89401 tgtgaagttt tctgctacac tcatttacta gtccacatga tccatagatg caatttaacc 89461 atcttcactc tatttggtga cccattcatg tgtttagcat cgtgggtttt cctgtactaa 89521 taaccattag tacttggagt tgccttgttg gtgttatgct cataatgttt actatgtgga 89581 agctacagga tacgatactc tctaggtttg aggactagct ccagttacgc ttttgggtcc 89641 agtgaatgta tgatacccat tatgtggtcc aaaaagaatg agatttgacg gacgctctag 89701 tggccaaatt catatggcta ccttcactct atgccttaag agctgaaaag tgagtgtttt 89761 ttttttgtat aaaatgctgg tcctaggata cacttttcgc atctcttcct ctttcatatt 89821 tgcttacacc gcacttctct ttcccatttt gaactttaaa ctttgaaaag tttttttttg 89881 cttggtaatt ggaacttaac atacaagttc ttgttattta ctggttttac gatgaatgta 89941 atggcattta gaattgttat gaaatggtta tatgatcatt gaattctaaa gaaaagattc 90001 taggttcttt agtaagcatt tttggagatc tttatcctgc caagttggat aagttgaggc 90061 atctttggct tttagtgctt tggtgatgaa attttttgga gattgtaacg tggttcgagt 90121 tcatgaagag taaattgtgt attggccttt tcacttgtag ctcaagaaaa aagtagtttt 90181 gcttattatt gctcgttgac agtctctaat taagtttatt gttgtagcgg tttggaactt 90241 ctctcacatg tgtaggcctt cggataactg ttacgagtaa tagatcatat gtaacaatat 90301 cctcaaagtt gcatcgagat tttcaagaaa acactattgc ttatttgttt tcattcttta 90361 tcaggtttcc gtggtttcga atactttgct tctttcagat taacgaaaac cctatttagt 90421 tgcaaagact gagaagagaa taagagtcga gggaccatat aaagcttgta attgtacgtt 90481 tatgcggatt ctccactttc catggctgtg agaaactatc tgaggaaaag tgcagtgacc 90541 tttggcaaaa ggtaatcact tttatgcact ttcacgtgct agctcgaatc cacaccaaat 90601 agtaacattc tattggctct gaaactcata gtgctccaag tggttgtcat tgcattttaa 90661 aaattcctga caagaaattt agtagaaact ttgctacttc aagtttcctt ctttatgata 90721 tgctagctcg tattatatgc aacatgggat atctttgttt tgcttgtgtg gcttataagt 90781 ttatgttgca tgatcaaaac agatcactga ataaattttc tggtggtctg agaactcgct 90841 gcagtgaatg aatgagatcc atgtaagtgg ccccaaaaga acataattta caacactctg 90901 gtaggtaaat tcatgtggat actaaaactg actcatttat ggctaaaaag tattgatttg 90961 gtacatgaat tcatcagctt aaccgttgaa gtgggaagct atcctagtgg taaataagtt 91021 gatctctggc cctttccagg tggcaaaaag tgaaggcttt acagtgtcac cctttttctt 91081 ttggttgcgt cgttattctt tagtgccagt atataccttg gaaaaagtga gctattattc 91141 taaaagatag accaagaaat agatttctca cactgaatat ctttgttcat ttggatggtc 91201 cataggcatt gacatattct gctagtagtg gttgtgagtg atagaatctt gttatccgag 91261 acaacctcgt gggaatggat ttagcaccaa tgctaatcac taatcagtac aagaatatgc 91321 gttgatgtaa ttgtacctat tagcattgtt cactcaggat tcttggttat ttgttacctg 91381 actattcagt gtgtgatttt attggagttt tacatgttta cacttgtctc tgtaatctac 91441 ttttggtata aaatcatatt gagtacaact ttcttttctc agggatgtgc gaaattgtct 91501 ttggggcggc ataagtggaa tcatttatct tctacgtgat aaagtgatgt ggaaagtgtc 91561 accaagaaag gaatcattaa ctttgaactt aggcaaggtg agttacgtgt tgactgttct 91621 tggtacagct agctatacta atgcatatct ctttgtagca tcagtttgaa tttagactgt 91681 agggcagtat ttcatagata agaggacttc taccaagtgg gcaaaatttg tattaactta 91741 cagcagctaa gtatgagtgt atatctaatt aactcgtact agttaggcaa aactagtaaa 91801 ctacagaccc tatatgcata aaaccgtggt aataacttgc aggaatttca aaccagaagt 91861 aatttacaag catgcagtct agttttgcca tagcagcatc agaatgatgg aacaattcca 91921 tatccggtgt caaatgttat cattgtggag aaagcttggt ttgcctcttc atcgacatct 91981 ttcttctttg caaaccggcc cttgatccgt ggtcttttct ctgcatacgc tttccttgaa 92041 gcatatctta tcgtcttctc aaatttcctc atcttcttct tctctctgta tctcaggact 92101 ctagcttctc tgtcccttgg actgagcatc tgagctggag ggtaaggtgg ttggtctgtt 92161 accgctttgg gcgatcttgg gtttgatact gttgcgtcac tcgttgttga ctccggcaca 92221 actcccaggt ccatagatga aacattcacc taatgacagt aacaaagaca ttcttagaaa 92281 tttacttcat ggatttgtgt gaaagaagca gcatgctggt catagtaacg gatctgaatt 92341 aaatatatga aagcttacca tatgactgag ggaaccattg gagcttcgaa gagctcccca 92401 ggagccacag ttgatactca gctgaaagtt ctgttgctct tggtacatgc cctttgatac 92461 ttcaatttga agtggaacaa ctccatcttc cccatagctc ctctgaggta cgttgcagtc 92521 tagttgatac tgattggatt gatctgtgaa ttgcttatca ctcgaactat aatcaacaag 92581 gttcagaaac tcatccccaa tcgagaagcc attgttgtta ccactgtttt tccctgaatt 92641 aggcaacaac catgaagccg cctctgcttc atcttcatcc tcttcttctt gaccaagcac 92701 aagtctgttc tctggatctg tcactgttgt ctcacatgaa tggttagtgg ccgtggaact 92761 gtaagagtac tcagaaatgg gcagaattgg aactcgttga tggcgtctag caagtgggtt 92821 tgcggaatga atctctgaat cacaggttgt gcatagagat gcagcatctg ccttgcagaa 92881 aaaggcagcc ggggctcgct cacatgattg acagactcga acacgttcat ggcgggaagc 92941 aagacgattg gcagcatgaa cttgagcatc acaactggag cacaagtagg cagaatcagc 93001 ccggcagtac acggtgcagg cggctgatcg gcatgtatca caggcttgtg cccagttact 93061 ctctactttc aacatactaa ccgcagatgt tgtagttgtt tgatggtgag tgagatcgta 93121 tgtaaaggca gcaggggact aagaaggcta tattagttgt tgaatggaaa tggaaaggaa 93181 ggatagaatt ctgagggctg gacacgtggc gagaagtttg tgggtttgat gttgagtgat 93241 catcttgtgg ctggctgagt ttttgagaag gggtccagtg cagagatgct tacagctttg 93301 gaatcatcaa ccagcacgaa atgtgaaaga aagaaaaaaa ctaattgtct tcttatatct 93361 ttgcaatggc accaatagtt actacttttt ctctctttct tcctcaataa gtgtaaacac 93421 ctcttcctat gtaatcttct ataggaccca tccacatgtt acctaggttc tgaaacgtgc 93481 aacttctaac tatttttgta tcatctacta ctattttggt aaagctggaa actttagtag 93541 ctctttggct tgaccattag cctaatgttt atcttgattc ttaacttttg ccatgttcta 93601 actgttcagg tttctctcat cgtctcatga tttcaattat aaaaattaaa tgatcactgt 93661 ttagcattgt taaaaagatt atgactctta accatttgtt atcttcgaat gatgctaggt 93721 gtcttttttg gtgatatgtt ggagataata gatatggcat gtcctaactt tgtttccttc 93781 cacatcaagt cggtcaagct gatttggcca acggtggaga tccaaggagc caattggaat 93841 ctgccaagtc gctacgagtc tctagagatt aggagccaca agttgaagca aatggtgggt 93901 ccaaattaag gtgaataacg atagttatat caacatcacc tgtggtctgt ttatacaagt 93961 ccaatttgct tatgatcacc atcatatttc atagttacaa aacatgtata agtctatcaa 94021 gtgtttgtca catctcaatc acctttttga gaatatgata aaactcgata ttatctttac 94081 tagcttttat tagatttcaa acttgtggaa gtaatcatat tcatgtgttc aaatctacat 94141 gggtcaaata aatttggtgg agcaacaaag tttgttttta tttacatgat gttctactga 94201 tgcaattaaa aacacagcaa ctaacaaagg taaaaactga ggatcaacaa acaaagcttg 94261 ttttcctttc ttggcagtca gaaagggcta atctcacgag ccagtttcag taacaactct 94321 ggtacaagac gctttctggg aggagccaac gagtctttgg tcagcccatc attgtcctcc 94381 gcatgtatca cagagactat gtcatcaaag agttcgtctt tacctcctcg taagggatcc 94441 tataaacaat actctcatga gttttattgt actcacactg agctaagtgt ttgtgtgtgt 94501 gacacaatgt aaggtaagtt gactgattta tgtggcttac ctgaagaaat aagtcagtat 94561 gtgtctttcc gctgtataaa acaagctcgg cttttgctcc aacagcttgc agagcgtctg 94621 taaaagtttt gctgcaaacg aggtggtttt agtatcccaa aaggtttcat cagtaatgta 94681 tcacttcaag aagatgatca tatatacctt tcatcgcatg gtatggaata atctgaggat 94741 ccgtggaaaa gtataatagg aggtaaaaga gaagcagctt ttccaacaac tgggtctttc 94801 agtcttactt ccggagagaa tttctcaaag gactcttctc cttccattat gctgcaagtt 94861 ggagttgaac attagatctg atcagactgc gtttaggatt tggttaacca agcatttacc 94921 ttaggaaaat cgagcgatac agaccccgat tatggaagtg atcaaccaac ttgtatagat 94981 tgtacctatt gaagttcaaa atgataagca gatcaattgc tgtagaccaa aaagcacata 95041 gaagccaacg ggaatccaaa agcttagaag ataccctcca gataatccaa aataagcttt 95101 tatctgggac actgtccagg agatgctctc tccttttaat tctttagtag cttgttccaa 95161 tagagcacaa gcggctatat gggcaccagc tgattgcccc atcaggtaga tcctagagag 95221 acaagcagat acatgcagtt taatagccga gagaccaatt ggaagatcaa atggtaaata 95281 aaaacaaagc caatgagatg aaacctgttg gggtcacctc caaatgcaga gatgttattg 95341 cagacaaatg agattccttg agaagcatca gtcaccatat cactaattgt tccctgagga 95401 aagttcctat accataaaga acggattgct tggtaaatgc atgaaagttc ttcctatatt 95461 cataaaagac gactagtttt ttgaattgtg taatttgttt acctgtagtc aaggcatgct 95521 acaatgatat ctctttctgc tagctgcatt cccaagagcg agccccaagc tttgtaccta 95581 aatatcagta caagattagc aaacatgaac ttattaacaa gcaaccagct agtacaagaa 95641 gatgaaactt gagtgagtca gtgagataag aatacagagc ccatcatgat ctttactttt 95701 ttctttttgt aactcaaaca tcaacaaaag aacagtgaag gggaaaactt ctataagtgg 95761 gacacgagaa attgtaaaac aaacaaattt caacattatg cagaagagaa tgctgtaaat 95821 ctgatgtgca acccacccaa taatccaagc tccacccgtc acaaaaacca caaccggctt 95881 caagccatcg ttgttgcttg gcaagtacag atccagccta tcaaatcacc gttcatacat 95941 caataatctc aagacaaagg tgatgtgata aaagcaggca ttcaacatac cacttaaggc 96001 cgatgtgaaa ggagaaggta agattctatt cagttcacac acaaacctgt tccttggttg 96061 atctccatac actatactcc tccggacttg ctttgagaaa aaataactat atgcaactgc 96121 acaagataat acaaacttga gacgaggaat caaagactaa ttttccaatt aaaacaacat 96181 cttaatcaaa aggaaaaatc ctttctatga atttatttat cagcaaagaa aaaaaaaaaa 96241 aaatcaaacc ttgaagaaag ccaggcataa gaagcatagc ataacatgta agggcaagta 96301 attttgtcat ccatcgatag cctaccctga acaaaactta acaattagat caatccttgc 96361 tcaactggaa atataaaaag aagaagttaa acaagagcaa aacaacagca catggagtaa 96421 ctttcataat tagagtaaag ttccaaaata aaagacagag aacttttata tatcattgga 96481 ttctttgcag agcaaggcaa agccagctaa tatctacacg aattacctcc ctaagcgtaa 96541 tcatcaatat acctaaatcc aaaaacaggg acaatcgaga tcagtacccg aggtatcgaa 96601 gaagcttgaa gctaagtcca gtaattagat acgtttcagc tgccgcgtgg ccgatatcgc 96661 gaccgaacga ttgctgccgg aaaatccgac gaggtccttc gtcaaccggc gacttcccgg 96721 aaactcgacg acgtaacggt tcaccgttaa gaagagtcgt tctatcggag tcctcatcgg 96781 gtagaacttc ctcgatctcc gataccgtcg aggtcatcgg ccagcatcgc tgctccggct 96841 gctgagtctg aagaggcgaa tgcatctttt ccgccgtgaa atagtggatc tc // chad100.scf100644000766000024 23533614006427353 16265 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data.scf"h:2.00e() Rq+R\ @!\@F@x@@@@@@@@@@@.@@R@ %@@%j@@@@@@@@@@@@@@@@y9y+9\6%j 2%()8'[#phB'@sf> ))L6yBRP,_n~AzM)]`   J ~*7FZ4p7EZsAEn`+/=JRPOE@/@@@@@&j@\;@ @@x@@lS@R2's I^ E;A 3I lG[f%k$~?+U1_Z4 WFh Jk hn&G 8B&Acw0&4!cm9Bk8GW'%(ZB{&$D  =|27UiX..q54jVB2#fy@<YE>T3%(E!X c  r >  ! = ` r,bp+JF ,`T.tt07@s8AyL#@K.x  te($e: "J&-Z5:4?rBg D DB?<\7m3.)`&#NY $#W@qxQi0 ;G~d''^]w(x87|71}!hj,4 ? ?$f|&N &VLvI2 OSJLC;y; @G^>R['A5GN   U 3MyI\=Bwu>< j0 3Y97lV#Rr}#/NT4g3Y[    HOM1C[h($HR   )~xzb{E&DPyYDc6XDYO&RU.X& #mdfe d4vm$+~`X!E<~u.6r<PfNk Lj[#CD"YgIk15  K 1#c9 , lPD=w'y,)| V;\ NT^9\(` Xw; S4g~*64Nd.: 7o7P'{F =x Ox5  B  H Kb ) d9&eXz36mTbsWIr A R_vN  9(|@ E  \D ]B WU?^xX!es'%F_/?(J"BM fI4`h<QR{.|:. R G  C, OTL:=$ 7  0 yeqXBzBeRe)*mM ,P.I iP$TF0Ec"<3 O% U8<t`>|i3   Y~3= EDi&.b=1/U+ .`TdY[ cs~ 3< Q [ @[tS  (? h:B?hR"- lf|=F+D$[hG8jt2RmZ C&]Lvn6\f)$aV'kCJ'EL1Z Fn) gXa y   g: $M%?|NoaE =&`^  E&  [  njA_Q} x F Z1&1u7jxr69   I!R?xl **_AAmoHhv1]h(LMMUAS Gu#5oh!a6? ?&\ <~6  l a|>%; l ] WW" <)#ds96`1 pE c ;V1s. IP`"AKFg   ~FbJP@UQ*oQI# ogE&d2   Z,1jj0O3'j DA$'^'$X DI8|Wc, &R! I! fXs JDE (e}j 1 JP [ 'Cn1^ V[XX^%&]Sr,DTlZd?*{m"G%O9sJ?0i93 2uB;q41l Ikd6[C) "^vf3].,|T- Ptw( 32uS{3Uj ` ,7 %sdR%_p#xIc|\me O;f$W C-m (V _y 8]p?S&Dk* 1s7]wE#SG *dThHQ!ER f)c s*:O ~$"Y? UDP!7m7UN%KY) IQ5d'3jEkjCbk6QoD<4"a>i\ppUg$Jen2 L pLxE l  ] ( U&{ *eO|9"tnj41r   ' b ~ ( p4P $!x   D 5rh`l94\Ezl.\NT!0$bp>H9m)@6 P&` ~rc FX n/n$bls"#:3NLy#c) 7oB` X ( Q B7p "w;Z  Y 6g @0p's=j $6)^ ,   1 ' =T-Arl^,_y=ym,kT^^$h25Bqr 6 > % B h%3Ll. u0v;,@F,IABUcZBJ >Cb,=,$}5t6=v KnlH xAPe4   ; = ~ .S<{.BJG ulgh>PuY n2 ,q 0  M^DEi"AK.bUcK>|4J ( dB , `J|CS]+ x *w@")y e  U>v*2oxawZ>e,T i/H / >r0i$d>02j`G7 w \ ~m>%br/)HC5d/kNB] wy3) 9  v]Au%m,BAm 1">"0  @z5y|M- B /  ~   H   /f em   '/ 7 N" X W E-|u42N ShdJ}[J2 ZBk Q  0 "YN L^LQ4&k Qt=m=+ {E:H"3OO+gZZ 1C \6-?_r#n4uR7Wn Xs0 G1L l!D |I;Ph%bD-P EFNU1-# )@ats)8O)gZVI9Eb(&.mC5 4p(e6Y& znp $ 2@u K)_^%Kx?6 p;@di,u <Q = ,O N8Il s_SW*L 7 D1B!P<SDI<tFjl M+cOy'NJBvq ,|=7i6X@+GCM#dUL VmK  W$ K 5{{6  WQ$ Z 8 h AN ?  G  SY) j i#' O&;! [GV R)bz# R]t6 q    C  /wp5,k Bl s U OM# 2@Lc)ni`y:G 73}'8XR  I !`Rk#AM>(/vGMclo+GV!VA?O>GXD2 :  t=/n2h(<6T(( 7 k ?  FwwGq =i|e(_ \Q  L@ C  N7 5G S %] B5C"GRo/> ~[$8hY(At `$ 6 r 5 f#H.! /nQ5v.0 * qg H5M9Nny 0 r;: rB)Xl$ Um E  %O S #N5ZQzN*94$aQG0*vL ,[_" M s-I N#v .I0!t>U @) !   B B t u  BE)(Ukv`?VD'?Da,wz0LEA Z;dA:a'^\:t8}j*^ V9  @ n& 1Y ]q od7i&pj gBf$X '_iB 2zee/pNJ`rd2Y9 Rc&6 d !qCK" 54iJ 2 d > Dv7]M/+sU ;T2 - %.m B[&2 zd 8,K}qKeKogHx2Y) .5 |  8 ;f &'vG'%op/bMo1 9j ^ &{  }"w>1l&  4:z}A6xUUq9t7Oy@9! ?R hq G x Yz`%Ma) E 3q /&=L06"_hnT(& ^Emv4 1o1! [#C-E M jB.q &&w9H%z H#Ng57~;3g37 > & sS 7.ms; h 9*[8 p~ T  it!=|B$ GyLH<d>#(gY D 5f 8?a' }^a~Zsn/GK~@ x .*Vk3%^~a+ 8a4q NI @U[t  y2?[0 ?P2|8u/ 1)I$s%"? \ 1 yFl:)~d|R d*\8dX-I{S!:l-$]* *06Nv?& M@E W 5A#y"j STv#\= hBpT?W(3~' k ;] _ & UMd yT{]v%M 2]|xF?%C HFS= [ 3F9 //6 SF9p>vw [C]?qyT89D P>pZFe ) j .^ iDO, {y |70\ EcQI=DB8]?rPB` $"COYK-+BL%> zP@MoZ }.Hg$%8 %(mx@~C 0 k   JmnD( 'vR? Mmr[ -)ct-C"( j/K<PX .lW0@B%.Tj17gqGTW(EI#{$Emr9v"J_)RB?  SOp     7  )  j 3[,3E +~lI)\ !ks8 F  l ? Erg&  _);( F3wWV,ZE*:u?  Y}P^4!XbHF kF~=fEGmDqyXw=%4b$ &n[5  K{~ T&V2oy?8r_2BCi$HEEm4S5#fXHo 2hgh6/P74 d/f^54qZ )cZ.1/$E4"RRQL3%u e i SOG!(V2$E7tR4'DaSo4sn>< ht7r=nc25d|xX#(jhW;  PX%A;CO- .?+k 0XqLAYV(b`-M-# t  JY%o  MB /mnAX*7Wus=2@Lo4 u '"h!=]  Xy O  S )\*cEgK,oggGM S1 V"'E x*LUD'ehM; bvJ`  <'Rk 1NlmR?a5X0&6F! ?O 7RFt&hvZA ek3YM*lxIuAvv<!Uo;3P)mlJ5e'[  /~j[%)5aL,71 $s]63c BcFv@]sBsQj7 i ]0Go1(ht,JX .  } $  x<hrU|D/D}%laB?HC! OMT $<Qw?L  X`@n<9 b8F LN Xsd0 }  x1 C[! .A<!<oZ4E| UlI$0] ) 9wD^;,1fa)q5skB-XU-PnpX.  H ,KF[1%4Y u)/X]?2RSt?@ 7J +M^&b w{ w&SIr;7"zX*N*% N RK{:< 5nl2CB-xg 0TyH$ve*N<< *0G(lrEl&!mY ) a*E6K#Y_d;_#y >,)|%6gO)?|o4@yiM,e bJzL.a V~hDX4&7f5amT@i An <No nR(<VZBth{(Xk],9$wG## p^>} F^&0tp4E4xMcRb  ;uV',   =b`7)U9BT, (kkf|F#  KEv# "ME$t !ETP>()Uu+OB ) I13L,-5Kva7,d k6J{}L*{[}5z[?j,# z~.-c(ce4 tXWv;p{J8z;YQk)"9pA{L.}:AJKM)IP" B[.0 >RjR/@Z\H  !L{Kk)`~0?  7  S. T`8 -P _ Y?EyL,+ ;8I)r }.lyGL/g+C%b E] SL $0 4,_kqB[$P&AH5q8b[6 7R XI (,c&iE*} ZVIb  T ? vQ*o w+3XpL3 $ #  ArjC%6hxQ /vlO190 <'ae:cw5  nyWUm 9|-VW6!:Z c W: L =C)5fM Wk<  +^n!T-3943+UvP6$W  R3mJoG,g   z, eB " @ k t= p\ 1sXV=)|H gNT > jn8X mAkG*/Hgzx^.28"Pj " Q x"fyI+u  >4yoMm6&= n%MrYeL$GMfF{. 9L- qKsa4  C O'/Zck#bkA'Z# >]A-jz  & 7  +  9[dM !Cks l*:VmlV- -fiI.1". Z CmsE8x@PY^/A3 *+[Hio#K]3r#b.  $P  yLB :RE 'z[ N )}a+;vK5mv'YZ;'~ S+&C^lfM&(g+SlZn<; 0Byb'1Zoa52g?!#_4 K _B7o$o;V C 1eq<O H   u R L_W9t1U/eV7j=ZfA) Eur54s)LcQ*1YPJ a(]z R7-# 57? H\ZOc;$`$[" r6 > .%$ A*db2=$E 8 .   R         s{*FI/)W^ Y}L#)pq< AX\G2cBMY=*.8 X n@a"e-9fU5#Zw N+iPyS63# Q    l491q \+6#)@o/ 9uCR- 1  1]eX+   .d%F}st A*"::.1cHc1 3?6 %S=iaL5u; q.=sh;v 8@(4Yf~SN+?w[!}K'04<h~yU!MmGQ-3K @r^o_MU1 K(_S/@ |? )^O0l8<IV.A=  G8_^oE*2=7% +^ )TV,zuG%X@|u<Q5'  ;iD'[X}/zX'(SL1{ ;LK%7Gx4ah@& y V 5   :l C wZ]@uvI*f     :?jzO 16NS D$"L+a<!#!<&AdW9# }\$?D'p )::,E{ 3fl'Y1n *RGj% a @*o0 D_@S*= I:o}SI1 ,EQlH,wuL2$!*<gY ~vU(@=4D.?,5-4-G.z//J-)x$p v W9cT}A[aDo5k+["F4.7S |>l}"If??#  }0OFxrN6&&%   / \ 'Q`5g=  i"OnpK1e 1TD` Qe9 Xe-n^ {gG^1$$5SNrO9+D<$/nWF( ;e~vw 8V]H.1WlG.%'/YwlQ1e :7{(oSHEu9P\;]1>3BYhzb#>A$SVK3/`coO5\"}~ d8W*C5Hj#BowI)C cqlU=1p [oI-,^HmT.+2(,nRN4"S #Gvw:*U^o>v(g;Hz M_W5F Cx,]:{yQo2 FB;qhv,I\lQ K)'6W'En oLB$,MR)s^8") P <c{K((.$  !Et0T:'{ aJ@=:. _:gO@* (( $GxREX4 4eK_U45G':*Bn5`p%Gt) ~D ,d)aRh/ M0*9,   ,b]be<"Q/>r[ 4Z  8dI<nD[^D3& 5`Q 7P-Nc_B  |{  %wUD  oM4G$ke= l4 &eO*6dE(/OX @  :ls  oO0)x    r ;           yuy    /6& ;h53v{M01Yc%'x^%C04/6Jtk*v>Qn7'* aFC(OD:6QX_5Fn5b(2  Y?  9`NM_4J [[qH-        N $ E skH/Klz v_E; -]D@}+i * 8JTZ?7* $ M|$eb  %XQ8$GS *[;("(:GZ%H_A38QA?[:$ [ Giyt\82N]\J+`)4`Y80 z  +:=1 !gB$tSaJ=8#?Qoz1As~Y@0'!E :^TL0BUZ.WhrPz@p$4UE,.r%qKLm .  A   0Xe %zP<PXH<%)7jRRb/  &$@mZPOT\baV @AtsYO1 ."'BqCL`,B+-!? lU 4}dD-) N Zbf@$!#An! },!`e lP(=K1N*8&#"    d D0*.6<8&D{a qz'T47%"! {:2[}X=+$T VTs k#>L#FR"` F?3x&{"*b  U}K(y3 &OG8sVC?."}S(   *Rj43cN@& &I{M>YzuV@/W#"<r #nQ9_'-TQ E<uuN1w):HscE!09#GQ_{  ]OPX20z[|B8?m/is\PK=.f#u'G uP$5! n^YY\\VH3q:%CkwXA0".96%&C n'idK6% #KoE4#nZWgXPC(|TUn9v&pbQB6,# z"@?guW(@N1j'!!r5  "I?ia5**g5\&^j  = G`dB u,!vDiwN ( h~X]D0!Y  E4{!cN9CzBK]vP[  )BOePMEC0I`hmUCj725@Uhr y( &;pJIU-`q !8>0 :kYh{4^MFKYokb+3d--DS, `eH08D A 3         |, YM4z9vyU6 #K ACT"T2G<GETTGVCdGkC}CT GCA G"G"T'A'C(CG.GTCC$G-G> AK TW T`Cj-Ct.C*G*G*G+T!C!GAC C!C#A"C"G"C(G(T)"C4#C@#GG#CT#T_#Ch(Aq-A{3A-A'C'A'G(A(G(T(A(A(G(A(G(C(T3T&3G0.A:.AD.AM.CY.Te.Tp8T{8G8G3G.G.A.T3A.C.A.A3A3G3A.T#G#G(#C8#GA#TL#CY'Ta'Al'Ty'G8G8G8T8G3T-T-T-C-G-G-A3A .A(T#(G,(C:(TF(TR(G].Gh#Tq"A{"C"G"C"C"A&C&T&T&A(A(C(T(T(C(A$(A0(G=-GG-CR-A`(Gl-Tx+G.T+C+A+A+G+A+T+C+G+T#C#T#G#C)#T2#C>+CH+AT+T_+Cn+Gz+G+C+A+T+C+G+A+G+C+C+C-G -G$G"#T-#A5#CF#GQ#T^+Tj3Tt3A+A+G+A+C+A+G+T$T$G$C$T $C +T -G *3T 78T B8C O8T Z8C f8C q3A }3A 3G 3A 3A 3A 3A -A -G -C -C +A +G +C -T $-C 0+C >-T J-G T-C a3C l3A x8A 3G 3G 3C 3C 3A -A -A -A -C -C -G 3T 3C #3T .3C ;3T H8G S+T `+T m+G v+C +T +A +A +C +G +A +T +G +A +A +C +T +C #+G ,+C :+C E+A Q3A \3G j3T v3G 3G 8T 8A 8T 8G 8G 3T 3C 3C +T +G +A +C !+A -+G :8G G8A S8G `8A k8A w8T 3T 3T -T -C +C +T +G +C +C +G 3G 3A8T!8G-3G:3TE3CS3T_3Tk8Tw8T+G+G+A+C+A+G8A8T8C8A8G8A3G3A*3T43CA3CM-CY-Ae-Gs-A}+G+T+A+C+T+T+A+A+A+C+G+G-T8G&8A28A>8GK3TW+Tc+Gm+C|+T+G-G3A-G+A+C+T+A+T+G+G +C+T#+A0+T=+GH8AS8C`8Cl3Cx3A3T3T3T+G+G+A+C+T+T3G8G 8C8A 8A,8G98AE3AP3G[3Ci3Cv3T3G8A3G-A-A-C-T-T-T+G+C(T(A(A(+A43T>3AI3CV3Ca8Am3Ay3G8C8C8T8T8T8G-A-G-C-T-G -A3T"3C/3C93AF+TR+G]-Cl-Gw-A-G8A8T3G'G&G&C&T&A&T(G8T8T!8G-8G:8GF8AR8G]3Ci(Av(G(C(T(G3G8T3T3T3C3A3T3C-A-T-C)3C53CA3TN8GY8Ad3Ao3G{3C(T(T(T(A(A(A3C3A3A3A-T-A-C-G,3G8-AD#GO#C^#Tf#As#A}#C'T(G3T8G8G8T8C8C8T8G8A8A3G'3C43CB*GM*TX*Ce8Tq8G}8G3T3T3C3A3A-G(A-C-T-G3G -G(G#(C2%T<(CI(TU(G_(Cm(Tz-T-C3T3T8G8A8T8G8G3A3A3A3C3A-C&(A2(T@3TL(GW(Ad%An'C|%T%A%C(T.T(T,G(G(C(A(A G AA C* A6 TA(CN(C["Ce AqA| T C A A"C(C(T.T(G(T(T.C (T(C"%G+%C9%CF(GQ.T\(Ag(Gt!TTGCTGAGGTTG T T(C&"T0(C=(CI(TU0C`"Gm"Gx0T0G0G(A(G(C(C(G(A(GTACT A&C1(A=0GJ"AU Aa"Tl Cv A$C$C A A"C GG A T TGG"A0T&0T10T=(TI GTA^GlGxACAAGCTAC A(C(C(C 0C G #G 0A 7 G E G RT ZC gC rA }(T T T G (A T C "C "T T T A! G! G!A!)C!7T!CT!OG!YC!fT!oA!{A!G!G!A!C!C!C!C!G!A!A!C"A" A"G"#G"0A";G"EC"ST"]C"jT"sC"C"T"C"A"A"GSIGN=A=587,C=301,G=615,T=409 SPAC= 11.91 PRIM=0 MACH=SRC3700 DYEP=DT3700POP6{BD}v5.mob NAME=ML4942R LANE=89 GELN= PROC= RTRK= CONV=phred version=0.980904.e COMM=MeristemLib_DH12075 SRCE=ABI 373A or 377psi_xml.dat100644000766000024 1000414006427353 16554 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data adenine specific DNA methyltransferase (dpnA) HP0050 Helicobacter pylori 26695 hypothetical HP0001 Helicobacter pylori 26695 GTP-binding membrane protein (lepA) HP0355 Helicobacter pylori 26695 bogus-binding membrane protein (lepA) HP0355 Helicobacter pylori 26695 Two hybrid test Two hybrid test Two hybrid test seqfile.pir100644000766000024 213214006427353 16536 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>P1;CCHU cytochrome c [validated] - human MGDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGYSYTAANKNKGIIWGEDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKATNE* >P1;CCCZ cytochrome c - chimpanzee (tentative sequence) GDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGYSYTAANKNKGIIWGEDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKATNE* >P1;CCST cytochrome c - snapping turtle (tentative sequence) GDVEK.GKKIF.VQKCAQCHTVEKGGKH.KTGPNLNGL.IGRKTGQAEGF.SYTEANKN.KGITWG.EETLM.EY.LENPKKY.IPGTKM.IF.AGIKKKAERADL.IAY.LKDATSK* >P1;CCFG cytochrome c - bullfrog (tentative sequence) GDVEKGKKIF(V,Q.K.C.A.Q.C.H.T.C,E.K.G.G.K.H)KVGPNLYGLIGRKTGQAAGFSYTDANKNKGITW(G.E,D,T.L.M.E.Y)LENPKKYIPGTKMIFAGI(K.K.K.G.E.R.Q)DLIAY(L.K.S,A,C,S,K)* >P1;CCND cytochrome c - love-in-a-mist (tentative sequence) AS.F.BZAPAGBSAS(G.E.K)I.F.KTKCAZCHTVBZGAGH.KZGP(N.L)H.G.L.F.GRQSGT.VAG.Y.SY.SAANKN.KAVN.W.EEKT.L.Y.DYLLNPKK.Y.IP(G.T.K.M)VFPGL.KKPZZRABL.LA.Y.LKESTA* >F1;C44264 ALL-1/AF-4 clone 25 mutant fusion protein - human (fragment) /EKPPPVNKQENAGTLNIFSTLSNGNSSKQKIPADGVHRIRVDFKTYSNEVHCVEEILKEMTHSWPPPLTAIHTPSTAEPSKFPFPTKDSQHVSSVTQNQKQYDTSSKTHSNSQQGTSSMLEDDLQLSDSEDSDS/* ProtPsm.t100644000766000024 455614006427353 16532 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Matrix# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14); use_ok('Bio::Matrix::PSM::IO'); } # Test psiblast reading functionality. my $psmIO = Bio::Matrix::PSM::IO->new(-format => 'psiblast', -file => test_input_file('atp1.matrix')); ok $psmIO; my $psm = $psmIO->next_psm; ok $psm; # Verify that getting IUPAC sequence is functional my $IUPAC = 'MEMSINPSEISSIIKEQIENYDTKAEVSEVGTVLSVGDGIARVYGLDNVMAGEMVEFPSGVKGMALNLEEDNVGVVLLGDDTGIKEGDLVKRTGKIVEVPVGEALLGRVVDPLGNPIDAKGPIKTDERRPVEVKAPGIIPRKSVHEPLQTGLKAIDSLVPIGRGQRELIIGDRQTGKTAIAIDTIINQKRINDESTDEGKKVYCIYVAIGQKRSTVAQVVQTLREAGALEYTIIVAATAAAPAPAQYLSAYAGCAIGEAFADNGAAACIIHDDLSRQAVAYAIISLLLRRPPGREAYPGDVFYLHSRLLERAAKLSDELGGGSLTALPIIETQAGDVSAYIPTNVISITDGQIFLETDLFNSGIRPAINVGLSVSRVGSAAQIKAMKKVAGSLKLELAQYRELAAFAQFGSDLDAATQAQLNRGARLTELLKQPQYSPLPVEEQVVILYAGVNGYLDDIPVEDIRDFEKELLEYLKSNHPEILESIRTGKLSDEIEKALKEAIKEFV'; is $psm->IUPAC, $IUPAC; ## Lets try to compress and uncompress the log odds and the ## frequencies, see if there is no considerable loss of data. SKIP: { skip('TODO: Module incomplete',10); my $fA=$psm->get_compressed_freq('A'); my @check=Bio::Matrix::PSM::SiteMatrix::_uncompress_string($fA,1,1); my @A=$psm->get_array('A'); my ($var,$max) = (0,0); for (my $i = 0; $i<@check;$i++) { my $diff=abs(abs($check[$i])-abs($A[$i])); $var += $diff; $max=$diff if ($diff>$max); } my $avg=$var/@check; cmp_ok $avg,'<',0.01; #Loss of data under 1 percent is $psm->sequence_match_weight('CAGAAAAATAAAATGGCCACCACCC'),2015; my $lA=$psm->get_compressed_logs('A'); @check=Bio::Matrix::PSM::SiteMatrix::_uncompress_string($lA,1000,2); @A=$psm->get_logs_array('A'); ($var,$max) = (0,0); for (my $i = 0;$i<@check;$i++) { my $diff=abs(abs($check[$i])-abs($A[$i])); $var += $diff; $max=$diff if ($diff>$max); } $avg=$var/@check; cmp_ok $avg,'<',10; #Loss of data under 1 percent my $matrix=$psm->matrix; ok $matrix; my $psm2=$psm; $psm2->matrix($matrix); is $psm,$psm2; is $IUPAC,'CAGAAAAATWVAATYCCCACCHCCC'; is $IUPAC,$psm2->IUPAC; is $IUPAC,$matrix->IUPAC; my $instances=$psm->instances; ok $instances; foreach my $instance (@{$instances}) { my $id=$instance->primary_id; is $instance->strand,1; last if (ok $id); } } Species.pm100644000766000024 4626114006427353 16461 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::Species # # Please direct questions and support issues to # # Cared for by James Gilbert # Reimplemented by Sendu Bala # Re-reimplemented by Chris Fields # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::Species - Generic species object. =head1 SYNOPSIS $species = Bio::Species->new(-classification => [@classification]); # Can also pass classification # array to new as below $species->classification(qw( sapiens Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota )); $genus = $species->genus(); $bi = $species->binomial(); # $bi is now "Homo sapiens" # For storing common name $species->common_name("human"); # For storing subspecies $species->sub_species("accountant"); =head1 DESCRIPTION B Provides a very simple object for storing phylogenetic information. The classification is stored in an array, which is a list of nodes in a phylogenetic tree. Access to getting and setting species and genus is provided, but not to any of the other node types (eg: "phylum", "class", "order", "family"). There's plenty of scope for making the model more sophisticated, if this is ever needed. A methods are also provided for storing common names, and subspecies. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR James Gilbert email B =head1 CONTRIBUTORS Sendu Bala, bix@sendu.me.uk Chris Fields, cjfields at bioperl dot org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut #' Let the code begin... package Bio::Species; $Bio::Species::VERSION = '1.7.8'; use strict; use warnings; use Bio::DB::Taxonomy; use Bio::Tree::Tree; use Bio::Taxon; use base qw(Bio::Root::Root Bio::Tree::NodeI); =head2 new Title : new Usage : my $obj = Bio::Species->new(-classification => \@class) Function: Build a new Species object Returns : Bio::Species object Args : -ncbi_taxid => NCBI taxonomic ID (optional) -classification => arrayref of classification =cut sub new { my($class, @args) = @_; my $self = $class->SUPER::new(@args); # Bio::Species is now just a proxy object that just observes the NodeI # interface methods but delegates them to the proper classes (Bio::Taxon and # Bio::Tree::Tree). This will be surplanted by the much simpler # Bio::Taxon/Bio::DB::Taxonomy modules in the future. # Using a proxy allows proper GC w/o using weaken(). This just wraps the # older instances, which have no reciprocal refs (thus no circular refs). # This can then run proper cleanup $self->taxon(Bio::Taxon->new(@args)); my ($org, $sp, $var, $classification) = $self->_rearrange([qw(ORGANELLE SUB_SPECIES VARIANT CLASSIFICATION)], @args); if (defined $classification && ref($classification) eq "ARRAY" && @{$classification}) { $self->classification(@$classification); } else { $self->tree(Bio::Tree::Tree->new()); } defined $org && $self->organelle($org); defined $sp && $self->sub_species($sp); defined $var && $self->variant($var); return $self; } =head2 classification Title : classification Usage : $self->classification(@class_array); @classification = $self->classification(); Function: Get/set the lineage of this species. The array provided must be in the order ... ---> SPECIES, GENUS ---> KINGDOM ---> etc. Example : $obj->classification(qw( 'Homo sapiens' Homo Hominidae Catarrhini Primates Eutheria Mammalia Vertebrata Chordata Metazoa Eukaryota)); Returns : Classification array Args : Classification array OR A reference to the classification array. In the latter case if there is a second argument and it evaluates to true, names will not be validated. NB: in any case, names are never validated anyway. =cut sub classification { my ($self, @vals) = @_; my $taxon = $self->taxon; if (@vals) { if (ref($vals[0]) eq 'ARRAY') { @vals = @{$vals[0]}; } $vals[1] ||= ''; # make sure the lineage contains us as first or second element # (lineage may have subspecies, species, genus ...) my $name = $taxon->node_name; my ($genus, $species) = (quotemeta($vals[1]), quotemeta($vals[0])); if ($name && ($name !~ m{$species}i && $name !~ m{$genus}i) && $name !~ m{$genus $species}i) { if ($name =~ /^$genus $species\s*(.+)/) { # just assume the problem is someone tried to make a Bio::Species starting at subspecies #*** no idea if this is appropriate! just a possible fix related to bug 2092 $self->sub_species($1); $name = $taxon->node_name("$vals[1] $vals[0]"); } else { $self->warn("The supplied lineage does not start near '$name' (I was supplied '".join(" | ", @vals)."')"); } } # create a lineage for ourselves my $db = Bio::DB::Taxonomy->new(-source => 'list', -names => [reverse @vals]); unless ($taxon->scientific_name) { # assume we're supposed to be the leaf of the supplied lineage $self->taxon->scientific_name($vals[0]); } unless ($taxon->rank) { # and that we are rank species $taxon->rank('species'); } $taxon->db_handle($db); $self->tree(Bio::Tree::Tree->new(-node => $taxon)); } @vals = (); foreach my $node ($self->tree->get_lineage_nodes($taxon), $taxon) { unshift(@vals, $node->scientific_name || next); } return @vals; } =head2 ncbi_taxid Title : ncbi_taxid Usage : $obj->ncbi_taxid($newval) Function: Get/set the NCBI Taxon ID Returns : the NCBI Taxon ID as a string Args : newvalue to set or undef to unset (optional) =cut =head2 common_name Title : common_name Usage : $self->common_name( $common_name ); $common_name = $self->common_name(); Function: Get or set the common name of the species Example : $self->common_name('human') Returns : The common name in a string Args : String, which is the common name (optional) =cut =head2 division Title : division Usage : $obj->division($newval) Function: Genbank Division for a species Returns : value of division (a scalar) Args : value of division (a scalar) =cut =head2 species Title : species Usage : $self->species( $species ); $species = $self->species(); Function: Get or set the species name. Note that this is NOT genus and species -- use $self->binomial() for that. Example : $self->species('sapiens'); Returns : species name as string (NOT genus and species) Args : species name as string (NOT genus and species) =cut sub species { my ($self, $species) = @_; if ($species) { $self->{_species} = $species; } unless (defined $self->{_species}) { # work it out from our nodes my $species_taxon = $self->tree->find_node(-rank => 'species'); unless ($species_taxon) { # just assume we are rank species $species_taxon = $self->taxon; } $species = $species_taxon->scientific_name; # # munge it like the Bio::SeqIO modules used to do # (more or less copy/pasted from old Bio::SeqIO::genbank, hence comments # referring to 'ORGANISM' etc.) # my $root = $self->tree->get_root_node; unless ($root) { $self->tree(Bio::Tree::Tree->new(-node => $species_taxon)); $root = $self->tree->get_root_node; } my @spflds = split(' ', $species); if (@spflds > 1 && $root->node_name ne 'Viruses') { $species = undef; # does the next term start with uppercase? # yes: valid genus; no then unconventional # e.g. leaf litter basidiomycete sp. Collb2-39 my $genus; if ($spflds[0] =~ m/^[A-Z]/) { $genus = shift(@spflds); } else { undef $genus; } my $sub_species; if (@spflds) { while (my $fld = shift @spflds) { $species .= "$fld "; # does it have subspecies or varieties? last if ($fld =~ m/(sp\.|var\.)/); } chop $species; # last space $sub_species = join ' ',@spflds if(@spflds); } else { $species = 'sp.'; } # does ORGANISM start with any words which make its genus undefined? # these are in @unkn_genus # this in case species starts with uppercase so isn't caught above. # alter common name if required my $unconv = 0; # is it unconventional species name? my @unkn_genus = ('unknown','unclassified','uncultured','unidentified'); foreach (@unkn_genus) { if ($genus && $genus =~ m/$_/i) { $species = $genus . " " . $species; undef $genus; $unconv = 1; last; } elsif ($species =~ m/$_/i) { $unconv = 1; last; } } if (!$unconv && !$sub_species && $species =~ s/^(\w+)\s(\w+)$/$1/) { # need to extract subspecies from conventional ORGANISM format. # Will the 'word' in a two element species name # e.g. $species = 'thummi thummi' => $species='thummi' & # $sub_species='thummi' $sub_species = $2; } $self->genus($genus) if $genus; $self->sub_species($sub_species) if $sub_species; } $self->{_species} = $species; } return $self->{_species}; } =head2 genus Title : genus Usage : $self->genus( $genus ); $genus = $self->genus(); Function: Get or set the scientific genus name. Example : $self->genus('Homo'); Returns : Scientific genus name as string Args : Scientific genus name as string =cut sub genus { my ($self, $genus) = @_; # TODO: instead of caching the raw name, cache the actual node instance. if ($genus) { $self->{_genus} = $genus; } unless (defined $self->{_genus}) { my $genus_taxon = $self->tree->find_node(-rank => 'genus'); unless ($genus_taxon) { # just assume our ancestor is rank genus $genus_taxon = $self->taxon->ancestor; } $self->{_genus} = $genus_taxon->scientific_name if $genus_taxon; } return $self->{_genus}; } =head2 sub_species Title : sub_species Usage : $obj->sub_species($newval) Function: Get or set the scientific subspecies name. Returns : value of sub_species Args : newvalue (optional) =cut sub sub_species { my ($self, $sub) = @_; # TODO: instead of caching the raw name, cache the actual node instance. if (!defined $self->{'_sub_species'}) { my $ss_taxon = $self->tree->find_node(-rank => 'subspecies'); if ($ss_taxon) { if ($sub) { $ss_taxon->scientific_name($sub); # *** weakening ref to our root node in species() to solve a # memory leak means that we have a subspecies taxon to set # during the first call to species(), but it has vanished by # the time a user subsequently calls sub_species() to get the # value. So we 'cheat' and just store the subspecies name in # our self hash, instead of the tree. Is this a problem for # a Species object? Can't decide --sendu # This can now be changed to deal with this information on the # fly. For now, the caching remains, but maybe we should just # let these things deal with mutable data as needed? -- cjfields $self->{'_sub_species'} = $sub; } return $ss_taxon->scientific_name; } else { # should we create a node here to be added to the tree? } } # fall back to direct storage on self $self->{'_sub_species'} = $sub if $sub; return $self->{'_sub_species'}; } =head2 variant Title : variant Usage : $obj->variant($newval) Function: Get/set variant information for this species object (strain, isolate, etc). Example : Returns : value of variant (a scalar) Args : new value (a scalar or undef, optional) =cut sub variant{ my ($self, $var) = @_; # TODO: instead of caching the raw name, cache the actual node instance. if (!defined $self->{'_variant'}) { my $var_taxon = $self->tree->find_node(-rank => 'variant'); if ($var_taxon) { if ($var) { $var_taxon->scientific_name($var); } return $var_taxon->scientific_name; } else { # should we create a node here to be added to the tree? } } # fall back to direct storage on self $self->{'_variant'} = $var if $var; return $self->{'_variant'}; } =head2 binomial Title : binomial Usage : $binomial = $self->binomial(); $binomial = $self->binomial('FULL'); Function: Returns a string "Genus species", or "Genus species subspecies", if the first argument is 'FULL' (and the species has a subspecies). Args : Optionally the string 'FULL' to get the full name including the subspecies. Note : This is just munged from the taxon() name =cut sub binomial { my ($self, $full) = @_; my $rank = $self->taxon->rank || 'no rank'; my ($species, $genus) = ($self->species, $self->genus); unless (defined $species) { $species = 'sp.'; $self->warn("requested binomial but classification was not set"); } $genus = '' unless( defined $genus); $species =~ s/$genus\s+//; my $bi = "$genus $species"; if (defined($full) && $full =~ /full/i) { my $ssp = $self->sub_species; if ($ssp) { $ssp =~ s/$bi\s+//; $ssp =~ s/$species\s+//; $bi .= " $ssp"; } } return $bi; } =head2 validate_species_name Title : validate_species_name Usage : $result = $self->validate_species_name($string); Function: Validate the species portion of the binomial Args : string Notes : The string following the "genus name" in the NCBI binomial is so variable that it's not clear that this is a useful function. Consider the binomials "Simian 11 rotavirus (serotype 3 / strain SA11-Patton)", or "St. Thomas 3 rotavirus", straight from GenBank. This is particularly problematic in microbes and viruses. As such, this isn't actually used automatically by any Bio::Species method. =cut sub validate_species_name { my( $self, $string ) = @_; return 1 if $string eq "sp."; return 1 if $string =~ /strain/; return 1 if $string =~ /^[a-z][\w\s-]+$/i; $self->throw("Invalid species name '$string'"); } sub validate_name { return 1; } =head2 organelle Title : organelle Usage : $self->organelle( $organelle ); $organelle = $self->organelle(); Function: Get or set the organelle name Example : $self->organelle('Chloroplast') Returns : The organelle name in a string Args : String, which is the organelle name Note : TODO: We currently do not know where the organelle definition will eventually go. This is stored in the source seqfeature, though, so the information isn't lost. =cut sub organelle { my($self) = shift; return $self->{'_organelle'} = shift if @_; return $self->{'_organelle'}; } =head2 Delegation The following methods delegate to the internal Bio::Taxon instance. This is mainly to allow code continue using older methods, with the mind to migrate to using Bio::Taxon and related methods when this class is deprecated. =cut sub node_name {shift->taxon->node_name(@_)} sub scientific_name {shift->taxon->node_name(@_)} sub id {shift->taxon->id(@_)} sub object_id {shift->taxon->id(@_)} sub ncbi_taxid {shift->taxon->ncbi_taxid(@_)} sub rank {shift->taxon->rank(@_)} sub division {shift->taxon->division(@_)} sub common_names {shift->taxon->common_names(@_)} sub common_name {shift->taxon->common_names(@_)} sub genetic_code {shift->taxon->genetic_code(@_)} sub mitochondrial_genetic_code {shift->taxon->mitochondrial_genetic_code(@_)} sub create_date { shift->taxon->create_date(@_)} sub pub_date { shift->taxon->pub_date(@_)} sub update_date { shift->taxon->update_date(@_)} sub db_handle { shift->taxon->db_handle(@_)} sub parent_id { shift->taxon->parent_id(@_)} sub parent_taxon_id { shift->taxon->parent_id(@_)} sub version { shift->taxon->version(@_)} sub authority { shift->taxon->authority(@_)} sub namespace { shift->taxon->namespace(@_)} sub ancestor { shift->taxon->ancestor(@_)} sub get_Parent_Node { shift->taxon->get_Parent_Node(@_)} sub each_Descendent { shift->taxon->each_Descendent(@_)} sub get_Children_Nodes { shift->taxon->get_Children_Nodes(@_)} sub remove_Descendant { shift->taxon->remove_Descendant(@_)} sub name { shift->taxon->name(@_)} =head2 taxon Title : taxon Usage : $obj->taxon Function : retrieve the internal Bio::Taxon instance Returns : A Bio::Taxon. If one is not previously set, an instance is created lazily Args : Bio::Taxon (optional) =cut sub taxon { my ($self, $taxon) = @_; if (!$self->{taxon} || $taxon) { $taxon ||= Bio::Taxon->new(); $self->{taxon} = $taxon; } $self->{taxon}; } =head2 tree Title : tree Usage : $obj->tree Function : Returns a Bio::Tree::Tree object Returns : A Bio::Tree::Tree. If one is not previously set, an instance is created lazily Args : Bio::Tree::Tree (optional) =cut sub tree { my ($self, $tree) = @_; if (!$self->{tree} || $tree) { $tree ||= Bio::Tree::Tree->new(); delete $tree->{_root_cleanup_methods}; $self->{tree} = $tree; } $self->{tree}; } sub DESTROY { my $self = shift; $self->tree->cleanup_tree; delete $self->{tree}; $self->taxon->node_cleanup; } 1; AlignIO.pm100644000766000024 3635114006427353 16347 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::AlignIO # # based on the Bio::SeqIO module # by Ewan Birney # and Lincoln Stein # # Copyright Peter Schattner # # You may distribute this module under the same terms as perl itself # # History # September, 2000 AlignIO written by Peter Schattner # POD documentation - main docs before the code =head1 NAME Bio::AlignIO - Handler for AlignIO Formats =head1 SYNOPSIS use Bio::AlignIO; $inputfilename = "testaln.fasta"; $in = Bio::AlignIO->new(-file => $inputfilename , -format => 'fasta'); $out = Bio::AlignIO->new(-file => ">out.aln.pfam" , -format => 'pfam'); while ( my $aln = $in->next_aln() ) { $out->write_aln($aln); } # OR use Bio::AlignIO; open MYIN, '<', 'testaln.fasta' or die "Could not read file 'testaln.fasta': $!\n"; $in = Bio::AlignIO->newFh(-fh => \*MYIN, -format => 'fasta'); open my $MYOUT, '>', 'testaln.pfam' or die "Could not write file 'testaln.pfam': $!\n"; $out = Bio::AlignIO->newFh(-fh => $MYOUT, -format => 'pfam'); # World's smallest Fasta<->pfam format converter: print $out $_ while <$in>; =head1 DESCRIPTION L is a handler module for the formats in the AlignIO set, for example, L. It is the officially sanctioned way of getting at the alignment objects. The resulting alignment is a L-compliant object. The idea is that you request an object for a particular format. All the objects have a notion of an internal file that is read from or written to. A particular AlignIO object instance is configured for either input or output, you can think of it as a stream object. Each object has functions: $stream->next_aln(); And: $stream->write_aln($aln); Also: $stream->type() # returns 'INPUT' or 'OUTPUT' As an added bonus, you can recover a filehandle that is tied to the AlignIO object, allowing you to use the standard EE and print operations to read and write alignment objects: use Bio::AlignIO; # read from standard input $stream = Bio::AlignIO->newFh(-format => 'Fasta'); while ( $aln = <$stream> ) { # do something with $aln } And: print $stream $aln; # when stream is in output mode L is patterned on the L module and shares most of its features. One significant difference is that L usually handles IO for only a single alignment at a time, whereas L handles IO for multiple sequences in a single stream. The principal reason for this is that whereas simultaneously handling multiple sequences is a common requirement, simultaneous handling of multiple alignments is not. The only current exception is format C which parses results of the BLAST C program and which may produce several alignment pairs. This set of alignment pairs can be read using multiple calls to L. =head1 CONSTRUCTORS =head2 Bio::AlignIO-Enew() $seqIO = Bio::AlignIO->new(-file => 'filename', -format=>$format); $seqIO = Bio::AlignIO->new(-fh => \*FILEHANDLE, -format=>$format); $seqIO = Bio::AlignIO->new(-format => $format); $seqIO = Bio::AlignIO->new(-fh => \*STDOUT, -format => $format); The L class method constructs a new L object. The returned object can be used to retrieve or print alignment objects. L accepts the following parameters: =over 4 =item -file A file path to be opened for reading or writing. The usual Perl conventions apply: 'file' # open file for reading '>file' # open file for writing '>>file' # open file for appending '+new(-fh => \*STDIN); Note that you must pass filehandles as references to globs. If neither a filehandle nor a filename is specified, then the module will read from the @ARGV array or STDIN, using the familiar EE semantics. =item -format Specify the format of the file. Supported formats include: bl2seq Bl2seq Blast output clustalw clustalw (.aln) format emboss EMBOSS water and needle format fasta FASTA format maf Multiple Alignment Format mase mase (seaview) format mega MEGA format meme MEME format msf msf (GCG) format nexus Swofford et al NEXUS format pfam Pfam sequence alignment format phylip Felsenstein PHYLIP format prodom prodom (protein domain) format psi PSI-BLAST format selex selex (hmmer) format stockholm stockholm format Currently only those formats which were implemented in L have been incorporated into L. Specifically, C, C and C have only been implemented for input. See the specific module (e.g. L) for notes on supported versions. If no format is specified and a filename is given, then the module will attempt to deduce it from the filename suffix. If this is unsuccessful, C format is assumed. The format name is case insensitive; C, C and C are all treated equivalently. =back =head2 Bio::AlignIO-EnewFh() $fh = Bio::AlignIO->newFh(-fh => \*FILEHANDLE, -format=>$format); # read from STDIN or use @ARGV: $fh = Bio::AlignIO->newFh(-format => $format); This constructor behaves like L, but returns a tied filehandle rather than a L object. You can read sequences from this object using the familiar EE operator, and write to it using L. The usual array and $_ semantics work. For example, you can read all sequence objects into an array like this: @sequences = <$fh>; Other operations, such as read(), sysread(), write(), close(), and printf() are not supported. =over 1 =item -flush By default, all files (or filehandles) opened for writing alignments will be flushed after each write_aln() making the file immediately usable. If you do not need this facility and would like to marginally improve the efficiency of writing multiple sequences to the same file (or filehandle), pass the -flush option '0' or any other value that evaluates as defined but false: my $clustal = Bio::AlignIO->new( -file => " "clustalw" ); my $msf = Bio::AlignIO->new(-file => ">prot.msf", -format => "msf", -flush => 0 ); # go as fast as we can! while($seq = $clustal->next_aln) { $msf->write_aln($seq) } =back =head1 OBJECT METHODS See below for more detailed summaries. The main methods are: =head2 $alignment = $AlignIO-Enext_aln() Fetch an alignment from a formatted file. =head2 $AlignIO-Ewrite_aln($aln) Write the specified alignment to a file.. =head2 TIEHANDLE(), READLINE(), PRINT() These provide the tie interface. See L for more details. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Peter Schattner Email: schattner@alum.mit.edu =head1 CONTRIBUTORS Jason Stajich, jason@bioperl.org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # 'Let the code begin... package Bio::AlignIO; $Bio::AlignIO::VERSION = '1.7.8'; use strict; use Bio::Seq; use Bio::LocatableSeq; use Bio::SimpleAlign; use Bio::Tools::GuessSeqFormat; use base qw(Bio::Root::Root Bio::Root::IO); =head2 new Title : new Usage : $stream = Bio::AlignIO->new(-file => $filename, -format => 'Format') Function: Returns a new seqstream Returns : A Bio::AlignIO::Handler initialised with the appropriate format Args : -file => $filename -format => format -fh => filehandle to attach to -displayname_flat => 1 [optional] to force the displayname to not show start/end information =cut sub new { my ($caller,@args) = @_; my $class = ref($caller) || $caller; # or do we want to call SUPER on an object if $caller is an # object? if( $class =~ /Bio::AlignIO::(\S+)/ ) { my ($self) = $class->SUPER::new(@args); $self->_initialize(@args); return $self; } else { my %param = @args; @param{ map { lc $_ } keys %param } = values %param; # lowercase keys my $format = $param{'-format'} || $class->_guess_format( $param{-file} || $ARGV[0] ); unless ($format) { if ($param{-file}) { $format = Bio::Tools::GuessSeqFormat->new(-file => $param{-file}||$ARGV[0] )->guess; } elsif ($param{-fh}) { $format = Bio::Tools::GuessSeqFormat->new(-fh => $param{-fh}||$ARGV[0] )->guess; } } $format = "\L$format"; # normalize capitalization to lower case $class->throw("Unknown format given or could not determine it [$format]") unless $format; return unless( $class->_load_format_module($format) ); return "Bio::AlignIO::$format"->new(@args); } } =head2 newFh Title : newFh Usage : $fh = Bio::AlignIO->newFh(-file=>$filename,-format=>'Format') Function: does a new() followed by an fh() Example : $fh = Bio::AlignIO->newFh(-file=>$filename,-format=>'Format') $sequence = <$fh>; # read a sequence object print $fh $sequence; # write a sequence object Returns : filehandle tied to the Bio::AlignIO::Fh class Args : =cut sub newFh { my $class = shift; return unless my $self = $class->new(@_); return $self->fh; } =head2 fh Title : fh Usage : $obj->fh Function: Example : $fh = $obj->fh; # make a tied filehandle $sequence = <$fh>; # read a sequence object print $fh $sequence; # write a sequence object Returns : filehandle tied to the Bio::AlignIO::Fh class Args : =cut sub fh { my $self = shift; my $class = ref($self) || $self; my $s = Symbol::gensym; tie $$s,$class,$self; return $s; } =head2 format Title : format Usage : $format = $stream->format() Function: Get the alignment format Returns : alignment format Args : none =cut # format() method inherited from Bio::Root::IO # _initialize is where the heavy stuff will happen when new is called sub _initialize { my($self,@args) = @_; my ($flat,$alphabet,$width) = $self->_rearrange([qw(DISPLAYNAME_FLAT ALPHABET WIDTH)], @args); $self->force_displayname_flat($flat) if defined $flat; $self->alphabet($alphabet); $self->width($width) if defined $width; $self->_initialize_io(@args); 1; } =head2 _load_format_module Title : _load_format_module Usage : *INTERNAL AlignIO stuff* Function: Loads up (like use) a module at run time on demand Example : Returns : Args : =cut sub _load_format_module { my ($self,$format) = @_; my $module = "Bio::AlignIO::" . $format; my $ok; eval { $ok = $self->_load_module($module); }; if ( $@ ) { print STDERR <next_aln Function: reads the next $aln object from the stream Returns : a Bio::Align::AlignI compliant object Args : =cut sub next_aln { my ($self,$aln) = @_; $self->throw("Sorry, you cannot read from a generic Bio::AlignIO object."); } =head2 write_aln Title : write_aln Usage : $stream->write_aln($aln) Function: writes the $aln object into the stream Returns : 1 for success and 0 for error Args : Bio::Seq object =cut sub write_aln { my ($self,$aln) = @_; $self->throw("Sorry, you cannot write to a generic Bio::AlignIO object."); } =head2 _guess_format Title : _guess_format Usage : $obj->_guess_format($filename) Function: Example : Returns : guessed format of filename (lower case) Args : =cut sub _guess_format { my $class = shift; return unless $_ = shift; return 'clustalw' if /\.aln$/i; return 'emboss' if /\.(water|needle)$/i; return 'metafasta' if /\.metafasta$/; return 'fasta' if /\.(fasta|fast|seq|fa|fsa|nt|aa)$/i; return 'maf' if /\.maf/i; return 'mega' if /\.(meg|mega)$/i; return 'meme' if /\.meme$/i; return 'msf' if /\.(msf|pileup|gcg)$/i; return 'nexus' if /\.(nexus|nex)$/i; return 'pfam' if /\.(pfam|pfm)$/i; return 'phylip' if /\.(phylip|phlp|phyl|phy|ph)$/i; return 'psi' if /\.psi$/i; return 'stockholm' if /\.stk$/i; return 'selex' if /\.(selex|slx|selx|slex|sx)$/i; return 'xmfa' if /\.xmfa$/i; } sub DESTROY { my $self = shift; $self->close(); } sub TIEHANDLE { my $class = shift; return bless {'alignio' => shift},$class; } sub READLINE { my $self = shift; return $self->{'alignio'}->next_aln() || undef unless wantarray; my (@list,$obj); push @list,$obj while $obj = $self->{'alignio'}->next_aln(); return @list; } sub PRINT { my $self = shift; $self->{'alignio'}->write_aln(@_); } =head2 force_displayname_flat Title : force_displayname_flat Usage : $obj->force_displayname_flat($newval) Function: Example : Returns : value of force_displayname_flat (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub force_displayname_flat{ my $self = shift; return $self->{'_force_displayname_flat'} = shift if @_; return $self->{'_force_displayname_flat'} || 0; } =head2 alphabet Title : alphabet Usage : $obj->alphabet($newval) Function: Get/Set alphabet for purpose of passing to Bio::LocatableSeq creation Example : $obj->alphabet('dna'); Returns : value of alphabet (a scalar) Args : on set, new value (a scalar or undef, optional) =cut sub alphabet { my $self = shift; my $value = shift; if ( defined $value ) { $self->throw("Invalid alphabet $value") unless $value eq 'rna' || $value eq 'protein' || $value eq 'dna'; $self->{'_alphabet'} = $value; } return $self->{'_alphabet'}; } 1; Root000755000766000024 014006427353 15262 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/BioIO.pm100644000766000024 10774514006427353 16345 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/Rootpackage Bio::Root::IO; $Bio::Root::IO::VERSION = '1.7.8'; use strict; use Symbol; use IO::Handle; use File::Copy; use Fcntl; use base qw(Bio::Root::Root); # as of 2016, worked on most systems, but will test this in a RC my %modes = ( 0 => 'r', 1 => 'w', 2 => 'rw' ); =head1 NAME Bio::Root::IO - BioPerl base IO handling class =head1 SYNOPSIS # Use stream I/O in your module $self->{'io'} = Bio::Root::IO->new(-file => "myfile"); $self->{'io'}->_print("some stuff"); my $line = $self->{'io'}->_readline(); $self->{'io'}->_pushback($line); $self->{'io'}->close(); # obtain platform-compatible filenames $path = Bio::Root::IO->catfile($dir, $subdir, $filename); # obtain a temporary file (created in $TEMPDIR) ($handle) = $io->tempfile(); =head1 DESCRIPTION This module provides methods that will usually be needed for any sort of file- or stream-related input/output, e.g., keeping track of a file handle, transient printing and reading from the file handle, a close method, automatically closing the handle on garbage collection, etc. To use this for your own code you will either want to inherit from this module, or instantiate an object for every file or stream you are dealing with. In the first case this module will most likely not be the first class off which your class inherits; therefore you need to call _initialize_io() with the named parameters in order to set file handle, open file, etc automatically. Most methods start with an underscore, indicating they are private. In OO speak, they are not private but protected, that is, use them in your module code, but a client code of your module will usually not want to call them (except those not starting with an underscore). In addition this module contains a couple of convenience methods for cross-platform safe tempfile creation and similar tasks. There are some CPAN modules related that may not be available on all platforms. At present, File::Spec and File::Temp are attempted. This module defines $PATHSEP, $TEMPDIR, and $ROOTDIR, which will always be set, and $OPENFLAGS, which will be set if either of File::Spec or File::Temp fails. The -noclose boolean (accessed via the noclose method) prevents a filehandle from being closed when the IO object is cleaned up. This is special behavior when a object like a parser might share a filehandle with an object like an indexer where it is not proper to close the filehandle as it will continue to be reused until the end of the stream is reached. In general you won't want to play with this flag. =head1 AUTHOR Hilmar Lapp =cut our ($FILESPECLOADED, $FILETEMPLOADED, $FILEPATHLOADED, $TEMPDIR, $PATHSEP, $ROOTDIR, $OPENFLAGS, $VERBOSE, $ONMAC, $HAS_EOL, ); my $TEMPCOUNTER; my $HAS_WIN32 = 0; BEGIN { $TEMPCOUNTER = 0; $FILESPECLOADED = 0; $FILETEMPLOADED = 0; $FILEPATHLOADED = 0; $VERBOSE = 0; # try to load those modules that may cause trouble on some systems eval { require File::Path; $FILEPATHLOADED = 1; }; if( $@ ) { print STDERR "Cannot load File::Path: $@" if( $VERBOSE > 0 ); # do nothing } # If on Win32, attempt to find Win32 package if($^O =~ /mswin/i) { eval { require Win32; $HAS_WIN32 = 1; }; } # Try to provide a path separator. Why doesn't File::Spec export this, # or did I miss it? if ($^O =~ /mswin/i) { $PATHSEP = "\\"; } elsif($^O =~ /macos/i) { $PATHSEP = ":"; } else { # unix $PATHSEP = "/"; } eval { require File::Spec; $FILESPECLOADED = 1; $TEMPDIR = File::Spec->tmpdir(); $ROOTDIR = File::Spec->rootdir(); require File::Temp; # tempfile creation $FILETEMPLOADED = 1; }; if( $@ ) { if(! defined($TEMPDIR)) { # File::Spec failed # determine tempdir if (defined $ENV{'TEMPDIR'} && -d $ENV{'TEMPDIR'} ) { $TEMPDIR = $ENV{'TEMPDIR'}; } elsif( defined $ENV{'TMPDIR'} && -d $ENV{'TMPDIR'} ) { $TEMPDIR = $ENV{'TMPDIR'}; } if($^O =~ /mswin/i) { $TEMPDIR = 'C:\TEMP' unless $TEMPDIR; $ROOTDIR = 'C:'; } elsif($^O =~ /macos/i) { $TEMPDIR = "" unless $TEMPDIR; # what is a reasonable default on Macs? $ROOTDIR = ""; # what is reasonable?? } else { # unix $TEMPDIR = "/tmp" unless $TEMPDIR; $ROOTDIR = "/"; } if (!( -d $TEMPDIR && -w $TEMPDIR )) { $TEMPDIR = '.'; # last resort } } # File::Temp failed (alone, or File::Spec already failed) # determine open flags for tempfile creation using Fcntl $OPENFLAGS = O_CREAT | O_EXCL | O_RDWR; for my $oflag (qw/FOLLOW BINARY LARGEFILE EXLOCK NOINHERIT TEMPORARY/){ my ($bit, $func) = (0, "Fcntl::O_" . $oflag); no strict 'refs'; $OPENFLAGS |= $bit if eval { $bit = &$func(); 1 }; } } $ONMAC = "\015" eq "\n"; } =head2 new Title : new Usage : my $io = Bio::Root::IO->new( -file => 'data.txt' ); Function: Create new class instance. It automatically calls C<_initialize_io>. Args : Same named parameters as C<_initialize_io>. Returns : A Bio::Root::IO object =cut sub new { my ($caller, @args) = @_; my $self = $caller->SUPER::new(@args); $self->_initialize_io(@args); return $self; } =head2 _initialize_io Title : _initialize_io Usage : $io->_initialize_io(@params); Function: Initializes filehandle and other properties from the parameters. Args : The following named parameters are currently recognized: -file name of file to read or write to -fh file handle to read or write to (mutually exclusive with -file and -string) -input name of file, or filehandle (GLOB or IO::Handle object) to read of write to -string string to read from (will be converted to filehandle) -url name of URL to open -flush boolean flag to autoflush after each write -noclose boolean flag, when set to true will not close a filehandle (must explicitly call close($io->_fh) -retries number of times to try a web fetch before failure -ua_parms when using -url, hashref of key => value parameters to pass to LWP::UserAgent->new(). A useful value might be, for example, {timeout => 60 } (ua defaults to 180s) Returns : True =cut sub _initialize_io { my($self, @args) = @_; $self->_register_for_cleanup(\&_io_cleanup); my ($input, $noclose, $file, $fh, $string, $flush, $url, $retries, $ua_parms) = $self->_rearrange([qw(INPUT NOCLOSE FILE FH STRING FLUSH URL RETRIES UA_PARMS)], @args); my $mode; if ($url) { $retries ||= 5; require LWP::UserAgent; my $ua = LWP::UserAgent->new(%$ua_parms); my $http_result; my ($handle, $tempfile) = $self->tempfile(); CORE::close($handle); for (my $try = 1 ; $try <= $retries ; $try++) { $http_result = $ua->get($url, ':content_file' => $tempfile); $self->warn("[$try/$retries] tried to fetch $url, but server ". "threw ". $http_result->code . ". retrying...") if !$http_result->is_success; last if $http_result->is_success; } $self->throw("Failed to fetch $url, server threw ".$http_result->code) if !$http_result->is_success; $file = $tempfile; $mode = '>'; } delete $self->{'_readbuffer'}; delete $self->{'_filehandle'}; $self->noclose( $noclose) if defined $noclose; # determine whether the input is a file(name) or a stream if ($input) { if (ref(\$input) eq 'SCALAR') { # we assume that a scalar is a filename if ($file && ($file ne $input)) { $self->throw("Input file given twice: '$file' and '$input' disagree"); } $file = $input; } elsif (ref($input) && ((ref($input) eq 'GLOB') || $input->isa('IO::Handle'))) { # input is a stream $fh = $input; } else { # let's be strict for now $self->throw("Unable to determine type of input $input: ". "not string and not GLOB"); } } if (defined($file) && defined($fh)) { $self->throw("Providing both a file and a filehandle for reading - ". "only one please!"); } if ($string) { if (defined($file) || defined($fh)) { $self->throw("File or filehandle provided with -string, ". "please unset if you are using -string as a file"); } open $fh, '<', \$string or $self->throw("Could not read string: $!"); } if (defined($file) && ($file ne '')) { $self->file($file); ($mode, $file) = $self->cleanfile; $mode ||= '<'; my $action = ($mode =~ m/>/) ? 'write' : 'read'; $fh = Symbol::gensym(); open $fh, $mode, $file or $self->throw("Could not $action file '$file': $!"); } if (defined $fh) { # check filehandle to ensure it's one of: # a GLOB reference, as in: open(my $fh, "myfile"); # an IO::Handle or IO::String object # the UNIVERSAL::can added to fix Bug2863 unless ( ( ref $fh and ( ref $fh eq 'GLOB' ) ) or ( ref $fh and ( UNIVERSAL::can( $fh, 'can' ) ) and ( $fh->isa('IO::Handle') or $fh->isa('IO::String') ) ) ) { $self->throw("Object $fh does not appear to be a file handle"); } if ($HAS_EOL) { binmode $fh, ':raw:eol(LF-Native)'; } $self->_fh($fh); # if $fh not provided, defaults to STDIN and STDOUT } $self->_flush_on_write(defined $flush ? $flush : 1); return 1; } =head2 _fh Title : _fh Usage : $io->_fh($newval); Function: Get or set the file handle for the stream encapsulated. Args : Optional filehandle to use Returns : Filehandle for the stream =cut sub _fh { my ($self, $value) = @_; if ( defined $value) { $self->{'_filehandle'} = $value; } return $self->{'_filehandle'}; } =head2 mode Title : mode Usage : $io->mode(); $io->mode(-force => 1); Function: Determine if the object was opened for reading or writing Args : -force: Boolean. Once mode() has been called, the mode is cached for further calls to mode(). Use this argument to override this behavior and re-check the object's mode. Returns : Mode of the object: 'r' for readable 'w' for writable 'rw' for readable and writable '?' if mode could not be determined (e.g. for a -url) =cut sub mode { my ($self, %arg) = @_; # Method 1: IO::Handle::fdopen # my $iotest = new IO::Handle; # $iotest->fdopen( dup(fileno($fh)) , 'r' ); # if ($iotest->error == 0) { ... } # It did not actually seem to work under any platform, since there would no # error if the filehandle had been opened writable only. It could not be # hacked around when dealing with unseekable (piped) filehandles. # Method 2: readline, a.k.a. the <> operator # no warnings "io"; # my $line = <$fh>; # if (defined $line) { # $self->{'_mode'} = 'r'; # ... # It did not work well either because <> returns undef, i.e. querying the # mode() after having read an entire file returned 'w'. if ( $arg{-force} || not exists $self->{'_mode'} ) { # Determine stream mode my $mode; my $fh = $self->_fh; if (defined $fh) { # use fcntl if not Windows-based if ($^O !~ /MSWin32/) { my $m = fcntl($fh, F_GETFL, 0) || 0; $mode = exists $modes{$m & 3} ? $modes{$m & 3} : '?'; } else { # Determine read/write status of filehandle no warnings 'io'; if ( defined( read $fh, my $content, 0 ) ) { # Successfully read 0 bytes $mode = 'r' } if ( defined( syswrite $fh, '') ) { # Successfully wrote 0 bytes $mode ||= ''; $mode .= 'w'; } } } else { # Stream does not have a filehandle... cannot determine mode $mode = '?'; } # Save mode for future use $self->{'_mode'} = $mode; } return $self->{'_mode'}; } =head2 file Title : file Usage : $io->file('>'.$file); my $file = $io->file; Function: Get or set the name of the file to read or write. Args : Optional file name (including its mode, e.g. '<' for reading or '>' for writing) Returns : A string representing the filename and its mode. =cut sub file { my ($self, $value) = @_; if ( defined $value) { $self->{'_file'} = $value; } return $self->{'_file'}; } =head2 cleanfile Title : cleanfile Usage : my ($mode, $file) = $io->cleanfile; Function: Get the name of the file to read or write, stripped of its mode ('>', '<', '+>', '>>', etc). Args : None Returns : In array context, an array of the mode and the clean filename. =cut sub cleanfile { my ($self) = @_; return ($self->{'_file'} =~ m/^ (\+?[><]{1,2})?\s*(.*) $/x); } =head2 format Title : format Usage : $io->format($newval) Function: Get the format of a Bio::Root::IO sequence file or filehandle. Every object inheriting Bio::Root::IO is guaranteed to have a format. Args : None Returns : Format of the file or filehandle, e.g. fasta, fastq, genbank, embl. =cut sub format { my ($self) = @_; my $format = (split '::', ref($self))[-1]; return $format; } =head2 variant Title : format Usage : $io->format($newval) Function: Get the variant of a Bio::Root::IO sequence file or filehandle. The format variant depends on the specific format used. Note that not all formats have variants. Also, the Bio::Root::IO-implementing modules that require access to variants need to define a global hash that has the allowed variants as its keys. Args : None Returns : Variant of the file or filehandle, e.g. sanger, solexa or illumina for the fastq format, or undef for formats that do not have variants. =cut sub variant { my ($self, $variant) = @_; if (defined $variant) { $variant = lc $variant; my $var_name = '%'.ref($self).'::variant'; my %ok_variants = eval $var_name; # e.g. %Bio::Assembly::IO::ace::variant if (scalar keys %ok_variants == 0) { $self->throw("Could not validate variant because global variant ". "$var_name was not set or was empty\n"); } if (not exists $ok_variants{$variant}) { $self->throw("$variant is not a valid variant of the " . $self->format . ' format'); } $self->{variant} = $variant; } return $self->{variant}; } =head2 _print Title : _print Usage : $io->_print(@lines) Function: Print lines of text to the IO stream object. Args : List of strings to print Returns : True on success, undef on failure =cut sub _print { my $self = shift; my $fh = $self->_fh() || \*STDOUT; my $ret = print $fh @_; return $ret; } =head2 _insert Title : _insert Usage : $io->_insert($string,1) Function: Insert some text in a file at the given line number (1-based). Args : * string to write in file * line number to insert the string at Returns : True =cut sub _insert { my ($self, $string, $line_num) = @_; # Line number check if ($line_num < 1) { $self->throw("Could not insert text at line $line_num: the minimum ". "line number possible is 1."); } # File check my ($mode, $file) = $self->cleanfile; if (not defined $file) { $self->throw('Could not insert a line: IO object was initialized with '. 'something else than a file.'); } # Everything that needs to be written is written before we read it $self->flush; # Edit the file line by line (no slurping) $self->close; my $temp_file; my $number = 0; while (-e "$file.$number.temp") { $number++; } $temp_file = "$file.$number.temp"; copy($file, $temp_file); open my $fh1, '<', $temp_file or $self->throw("Could not read temporary file '$temp_file': $!"); open my $fh2, '>', $file or $self->throw("Could not write file '$file': $!"); while (my $line = <$fh1>) { if ($. == $line_num) { # right line for new data print $fh2 $string . $line; } else { print $fh2 $line; } } CORE::close $fh1; CORE::close $fh2; unlink $temp_file or $self->throw("Could not delete temporary file '$temp_file': $!"); # Line number check (again) if ( $. > 0 && $line_num > $. ) { $self->throw("Could not insert text at line $line_num: there are only ". "$. lines in file '$file'"); } # Re-open the file in append mode to be ready to add text at the end of it # when the next _print() statement comes open my $new_fh, '>>', $file or $self->throw("Could not append to file '$file': $!"); $self->_fh($new_fh); # If file is empty and we're inserting at line 1, simply append text to file if ( $. == 0 && $line_num == 1 ) { $self->_print($string); } return 1; } =head2 _readline Title : _readline Usage : local $Bio::Root::IO::HAS_EOL = 1; my $io = Bio::Root::IO->new(-file => 'data.txt'); my $line = $io->_readline(); $io->close; Function: Read a line of input and normalize all end of line characters. End of line characters are typically "\n" on Linux platforms, "\r\n" on Windows and "\r" on older Mac OS. By default, the _readline() method uses the value of $/, Perl's input record separator, to detect the end of each line. This means that you will not get the expected lines if your input has Mac-formatted end of line characters. Also, note that the current implementation does not handle pushed back input correctly unless the pushed back input ends with the value of $/. For each line parsed, its line ending, e.g. "\r\n" is converted to "\n", unless you provide the -raw argument. Altogether it is easier to let the PerlIO::eol module automatically detect the proper end of line character and normalize it to "\n". Do so by setting $Bio::Root::IO::HAS_EOL to 1. Args : -raw : Avoid converting end of line characters to "\n" This option has no effect when using $Bio::Root::IO::HAS_EOL = 1. Returns : Line of input, or undef when there is nothing to read anymore =cut sub _readline { my ($self, %param) = @_; my $fh = $self->_fh or return; my $line; # if the buffer been filled by _pushback then return the buffer # contents, rather than read from the filehandle if( @{$self->{'_readbuffer'} || [] } ) { $line = shift @{$self->{'_readbuffer'}}; } else { $line = <$fh>; } # Note: In Windows the "-raw" parameter has no effect, because Perl already discards # the '\r' from the line when reading in text mode from the filehandle # ($line = <$fh>), and put it back automatically when printing if( !$HAS_EOL && !$param{-raw} && (defined $line) ) { # don't strip line endings if -raw or $HAS_EOL is specified $line =~ s/\015\012/\012/g; # Change all CR/LF pairs to LF $line =~ tr/\015/\n/ unless $ONMAC; # Change all single CRs to NEWLINE } return $line; } =head2 _pushback Title : _pushback Usage : $io->_pushback($newvalue) Function: Puts a line previously read with _readline back into a buffer. buffer can hold as many lines as system memory permits. Note that this is only supported for pushing back data ending with the current, localized value of $/. Using this method to push modified data back onto the buffer stack is not supported; see bug 843. Args : newvalue Returns : True =cut # fix for bug 843, this reveals some unsupported behavior #sub _pushback { # my ($self, $value) = @_; # if (index($value, $/) >= 0) { # push @{$self->{'_readbuffer'}}, $value; # } else { # $self->throw("Pushing modifed data back not supported: $value"); # } #} sub _pushback { my ($self, $value) = @_; return unless $value; unshift @{$self->{'_readbuffer'}}, $value; return 1; } =head2 close Title : close Usage : $io->close() Function: Closes the file handle associated with this IO instance, excepted if -noclose was specified. Args : None Returns : True =cut sub close { my ($self) = @_; # do not close if we explicitly asked not to return if $self->noclose; if( defined( my $fh = $self->{'_filehandle'} )) { $self->flush; return if ref $fh eq 'GLOB' && ( \*STDOUT == $fh || \*STDERR == $fh || \*STDIN == $fh ); # don't close IO::Strings CORE::close $fh unless ref $fh && $fh->isa('IO::String'); } $self->{'_filehandle'} = undef; delete $self->{'_readbuffer'}; return 1; } =head2 flush Title : flush Usage : $io->flush() Function: Flushes the filehandle Args : None Returns : True =cut sub flush { my ($self) = shift; if( !defined $self->{'_filehandle'} ) { $self->throw("Flush failed: no filehandle was active"); } if( ref($self->{'_filehandle'}) =~ /GLOB/ ) { my $oldh = select($self->{'_filehandle'}); $| = 1; select($oldh); } else { $self->{'_filehandle'}->flush(); } return 1; } =head2 noclose Title : noclose Usage : $io->noclose($newval) Function: Get or set the NOCLOSE flag - setting this to true will prevent a filehandle from being closed when an object is cleaned up or explicitly closed. Args : Optional new value (a scalar or undef) Returns : Value of noclose (a scalar) =cut sub noclose { my $self = shift; return $self->{'_noclose'} = shift if @_; return $self->{'_noclose'}; } =head2 _io_cleanup =cut sub _io_cleanup { my ($self) = @_; $self->close(); my $v = $self->verbose; # we are planning to cleanup temp files no matter what if ( exists($self->{'_rootio_tempfiles'}) and ref($self->{'_rootio_tempfiles'}) =~ /array/i and not $self->save_tempfiles ) { if( $v > 0 ) { warn( "going to remove files ", join(",", @{$self->{'_rootio_tempfiles'}}), "\n"); } unlink (@{$self->{'_rootio_tempfiles'}} ); } # cleanup if we are not using File::Temp if ( $self->{'_cleanuptempdir'} and exists($self->{'_rootio_tempdirs'}) and ref($self->{'_rootio_tempdirs'}) =~ /array/i and not $self->save_tempfiles ) { if( $v > 0 ) { warn( "going to remove dirs ", join(",", @{$self->{'_rootio_tempdirs'}}), "\n"); } $self->rmtree( $self->{'_rootio_tempdirs'}); } } =head2 exists_exe Title : exists_exe Usage : $exists = $io->exists_exe('clustalw'); $exists = Bio::Root::IO->exists_exe('clustalw') $exists = Bio::Root::IO::exists_exe('clustalw') Function: Determines whether the given executable exists either as file or within the path environment. The latter requires File::Spec to be installed. On Win32-based system, .exe is automatically appended to the program name unless the program name already ends in .exe. Args : Name of the executable Returns : 1 if the given program is callable as an executable, and 0 otherwise =cut sub exists_exe { my ($self, $exe) = @_; $self->throw("Must pass a defined value to exists_exe") unless defined $exe; $exe = $self if (!(ref($self) || $exe)); $exe .= '.exe' if(($^O =~ /mswin/i) && ($exe !~ /\.(exe|com|bat|cmd)$/i)); return $exe if ( -f $exe && -x $exe ); # full path and exists # Ewan's comment. I don't think we need this. People should not be # asking for a program with a pathseparator starting it # $exe =~ s/^$PATHSEP//; # Not a full path, or does not exist. Let's see whether it's in the path. if($FILESPECLOADED) { for my $dir (File::Spec->path()) { my $f = Bio::Root::IO->catfile($dir, $exe); return $f if( -f $f && -x $f ); } } return 0; } =head2 tempfile Title : tempfile Usage : my ($handle,$tempfile) = $io->tempfile(); Function: Create a temporary filename and a handle opened for reading and writing. Caveats: If you do not have File::Temp on your system you should avoid specifying TEMPLATE and SUFFIX. Args : Named parameters compatible with File::Temp: DIR (defaults to $Bio::Root::IO::TEMPDIR), TEMPLATE, SUFFIX. Returns : A 2-element array, consisting of temporary handle and temporary file name. =cut sub tempfile { my ($self, @args) = @_; my ($tfh, $file); my %params = @args; # map between naming with and without dash for my $key (keys(%params)) { if( $key =~ /^-/ ) { my $v = $params{$key}; delete $params{$key}; $params{uc(substr($key,1))} = $v; } else { # this is to upper case my $v = $params{$key}; delete $params{$key}; $params{uc($key)} = $v; } } $params{'DIR'} = $TEMPDIR if(! exists($params{'DIR'})); unless (exists $params{'UNLINK'} && defined $params{'UNLINK'} && ! $params{'UNLINK'} ) { $params{'UNLINK'} = 1; } else { $params{'UNLINK'} = 0; } if($FILETEMPLOADED) { if(exists($params{'TEMPLATE'})) { my $template = $params{'TEMPLATE'}; delete $params{'TEMPLATE'}; ($tfh, $file) = File::Temp::tempfile($template, %params); } else { ($tfh, $file) = File::Temp::tempfile(%params); } } else { my $dir = $params{'DIR'}; $file = $self->catfile( $dir, (exists($params{'TEMPLATE'}) ? $params{'TEMPLATE'} : sprintf( "%s.%s.%s", $ENV{USER} || 'unknown', $$, $TEMPCOUNTER++)) ); # sneakiness for getting around long filenames on Win32? if( $HAS_WIN32 ) { $file = Win32::GetShortPathName($file); } # Try to make sure this will be marked close-on-exec # XXX: Win32 doesn't respect this, nor the proper fcntl, # but may have O_NOINHERIT. This may or may not be in Fcntl. local $^F = 2; # Store callers umask my $umask = umask(); # Set a known umaskr umask(066); # Attempt to open the file if ( sysopen($tfh, $file, $OPENFLAGS, 0600) ) { # Reset umask umask($umask); } else { $self->throw("Could not write temporary file '$file': $!"); } } if( $params{'UNLINK'} ) { push @{$self->{'_rootio_tempfiles'}}, $file; } return wantarray ? ($tfh,$file) : $tfh; } =head2 tempdir Title : tempdir Usage : my ($tempdir) = $io->tempdir(CLEANUP=>1); Function: Creates and returns the name of a new temporary directory. Note that you should not use this function for obtaining "the" temp directory. Use $Bio::Root::IO::TEMPDIR for that. Calling this method will in fact create a new directory. Args : args - ( key CLEANUP ) indicates whether or not to cleanup dir on object destruction, other keys as specified by File::Temp Returns : The name of a new temporary directory. =cut sub tempdir { my ($self, @args) = @_; if ($FILETEMPLOADED && File::Temp->can('tempdir')) { return File::Temp::tempdir(@args); } # we have to do this ourselves, not good # we are planning to cleanup temp files no matter what my %params = @args; print "cleanup is " . $params{CLEANUP} . "\n"; $self->{'_cleanuptempdir'} = ( defined $params{CLEANUP} && $params{CLEANUP} == 1); my $tdir = $self->catfile( $TEMPDIR, sprintf("dir_%s-%s-%s", $ENV{USER} || 'unknown', $$, $TEMPCOUNTER++)); mkdir($tdir, 0755); push @{$self->{'_rootio_tempdirs'}}, $tdir; return $tdir; } =head2 catfile Title : catfile Usage : $path = Bio::Root::IO->catfile(@dirs, $filename); Function: Constructs a full pathname in a cross-platform safe way. If File::Spec exists on your system, this routine will merely delegate to it. Otherwise it tries to make a good guess. You should use this method whenever you construct a path name from directory and filename. Otherwise you risk cross-platform compatibility of your code. You can call this method both as a class and an instance method. Args : components of the pathname (directories and filename, NOT an extension) Returns : a string =cut sub catfile { my ($self, @args) = @_; return File::Spec->catfile(@args) if $FILESPECLOADED; # this is clumsy and not very appealing, but how do we specify the # root directory? if($args[0] eq '/') { $args[0] = $ROOTDIR; } return join($PATHSEP, @args); } =head2 rmtree Title : rmtree Usage : Bio::Root::IO->rmtree($dirname ); Function: Remove a full directory tree If File::Path exists on your system, this routine will merely delegate to it. Otherwise it runs a local version of that code. You should use this method to remove directories which contain files. You can call this method both as a class and an instance method. Args : roots - rootdir to delete or reference to list of dirs verbose - a boolean value, which if TRUE will cause C to print a message each time it examines a file, giving the name of the file, and indicating whether it's using C or C to remove it, or that it's skipping it. (defaults to FALSE) safe - a boolean value, which if TRUE will cause C to skip any files to which you do not have delete access (if running under VMS) or write access (if running under another OS). This will change in the future when a criterion for 'delete permission' under OSs other than VMS is settled. (defaults to FALSE) Returns : number of files successfully deleted =cut # taken straight from File::Path VERSION = "1.0403" sub rmtree { my ($self, $roots, $verbose, $safe) = @_; if ( $FILEPATHLOADED ) { return File::Path::rmtree ($roots, $verbose, $safe); } my $force_writable = ($^O eq 'os2' || $^O eq 'dos' || $^O eq 'MSWin32' || $^O eq 'amigaos' || $^O eq 'cygwin'); my $Is_VMS = $^O eq 'VMS'; my @files; my $count = 0; $verbose ||= 0; $safe ||= 0; if ( defined($roots) && length($roots) ) { $roots = [$roots] unless ref $roots; } else { $self->warn("No root path(s) specified\n"); return 0; } my $root; for $root (@{$roots}) { $root =~ s#/\z##; (undef, undef, my $rp) = lstat $root or next; $rp &= 07777; # don't forget setuid, setgid, sticky bits if ( -d _ ) { # notabene: 0777 is for making readable in the first place, # it's also intended to change it to writable in case we have # to recurse in which case we are better than rm -rf for # subtrees with strange permissions chmod(0777, ($Is_VMS ? VMS::Filespec::fileify($root) : $root)) or $self->warn("Could not make directory '$root' read+writable: $!") unless $safe; if (opendir DIR, $root){ @files = readdir DIR; closedir DIR; } else { $self->warn("Could not read directory '$root': $!"); @files = (); } # Deleting large numbers of files from VMS Files-11 filesystems # is faster if done in reverse ASCIIbetical order @files = reverse @files if $Is_VMS; ($root = VMS::Filespec::unixify($root)) =~ s#\.dir\z## if $Is_VMS; @files = map("$root/$_", grep $_!~/^\.{1,2}\z/s,@files); $count += $self->rmtree([@files],$verbose,$safe); if ($safe && ($Is_VMS ? !&VMS::Filespec::candelete($root) : !-w $root)) { print "skipped '$root'\n" if $verbose; next; } chmod 0777, $root or $self->warn("Could not make directory '$root' writable: $!") if $force_writable; print "rmdir '$root'\n" if $verbose; if (rmdir $root) { ++$count; } else { $self->warn("Could not remove directory '$root': $!"); chmod($rp, ($Is_VMS ? VMS::Filespec::fileify($root) : $root)) or $self->warn("and can't restore permissions to " . sprintf("0%o",$rp) . "\n"); } } else { if ( $safe and ($Is_VMS ? !&VMS::Filespec::candelete($root) : !(-l $root || -w $root)) ) { print "skipped '$root'\n" if $verbose; next; } chmod 0666, $root or $self->warn( "Could not make file '$root' writable: $!") if $force_writable; warn "unlink '$root'\n" if $verbose; # delete all versions under VMS for (;;) { unless (unlink $root) { $self->warn("Could not unlink file '$root': $!"); if ($force_writable) { chmod $rp, $root or $self->warn("and can't restore permissions to " . sprintf("0%o",$rp) . "\n"); } last; } ++$count; last unless $Is_VMS && lstat $root; } } } return $count; } =head2 _flush_on_write Title : _flush_on_write Usage : $io->_flush_on_write($newval) Function: Boolean flag to indicate whether to flush the filehandle on writing when the end of a component is finished (Sequences, Alignments, etc) Args : Optional new value Returns : Value of _flush_on_write =cut sub _flush_on_write { my ($self, $value) = @_; if (defined $value) { $self->{'_flush_on_write'} = $value; } return $self->{'_flush_on_write'}; } =head2 save_tempfiles Title : save_tempfiles Usage : $io->save_tempfiles(1) Function: Boolean flag to indicate whether to retain tempfiles/tempdir Args : Value evaluating to TRUE or FALSE Returns : Boolean value : 1 = save tempfiles/tempdirs, 0 = remove (default) =cut sub save_tempfiles { my $self = shift; if (@_) { my $value = shift; $self->{save_tempfiles} = $value ? 1 : 0; } return $self->{save_tempfiles} || 0; } 1; DB000755000766000024 014006427353 14624 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/BioSeqI.pm100644000766000024 1121714006427353 16205 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/DB# # BioPerl module for Bio::DB::SeqI.pm # # Please direct questions and support issues to # # Cared for by Ewan Birney # # Copyright Ewan Birney # # You may distribute this module under the same terms as perl itself # =head1 NAME Bio::DB::SeqI - Abstract Interface for Sequence databases =head1 SYNOPSIS # get a Bio::DB::SeqI somehow $seq = $seqdb->get_Seq_by_id('some-id'); $seq = $seqdb->get_Seq_by_acc('some-accession-number'); @ids = $seqdb->get_all_ids(); $stream = $seqdb->get_PrimarySeq_stream(); while((my $seq = $stream->next_seq()) { # $seq is a PrimarySeqI compliant object } =head1 DESCRIPTION Abstract interface for a sequence database =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Ewan Birney Email birney@ebi.ac.uk =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut package Bio::DB::SeqI; $Bio::DB::SeqI::VERSION = '1.7.8'; use strict; use base qw(Bio::DB::RandomAccessI); =head1 Methods inherited from Bio::DB::RandomAccessI =head2 get_Seq_by_id Title : get_Seq_by_id Usage : $seq = $db->get_Seq_by_id('ROA1_HUMAN') Function: Gets a Bio::Seq object by its name Returns : a Bio::Seq object Args : the id (as a string) of a sequence Throws : "id does not exist" exception =head2 get_Seq_by_acc Title : get_Seq_by_acc Usage : $seq = $db->get_Seq_by_acc('X77802'); Function: Gets a Bio::Seq object by accession number Returns : A Bio::Seq object Args : accession number (as a string) Throws : "acc does not exist" exception =head2 get_Seq_by_version Title : get_Seq_by_version Usage : $seq = $db->get_Seq_by_version('X77802.1'); Function: Gets a Bio::Seq object by sequence version Returns : A Bio::Seq object Args : accession.version (as a string) Throws : "acc.version does not exist" exception =head1 Methods [that were] specific for Bio::DB::SeqI =head2 get_PrimarySeq_stream Title : get_PrimarySeq_stream Usage : $stream = get_PrimarySeq_stream Function: Makes a Bio::SeqIO compliant object which provides a single method, next_seq Returns : Bio::SeqIO Args : none =cut sub get_PrimarySeq_stream{ my ($self,@args) = @_; $self->throw("Object did not provide a PrimarySeq stream object"); } =head2 get_all_primary_ids Title : get_all_ids Usage : @ids = $seqdb->get_all_primary_ids() Function: gives an array of all the primary_ids of the sequence objects in the database. These may be ids (display style) or accession numbers or something else completely different - they *are not* meaningful outside of this database implementation. Example : Returns : an array of strings Args : none =cut sub get_all_primary_ids{ my ($self,@args) = @_; $self->throw("Object did not provide a get_all_ids method"); } =head2 get_Seq_by_primary_id Title : get_Seq_by_primary_id Usage : $seq = $db->get_Seq_by_primary_id($primary_id_string); Function: Gets a Bio::Seq object by the primary id. The primary id in these cases has to come from $db->get_all_primary_ids. There is no other way to get (or guess) the primary_ids in a database. The other possibility is to get Bio::PrimarySeqI objects via the get_PrimarySeq_stream and the primary_id field on these objects are specified as the ids to use here. Returns : A Bio::Seq object Args : accession number (as a string) Throws : "acc does not exist" exception =cut sub get_Seq_by_primary_id { my ($self,@args) = @_; $self->throw("Abstract database call of get_Seq_by_primary_id. Your database". " has not implemented this method!"); } 1; Flat.pm100644000766000024 4157414006427353 16243 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/DB# # # BioPerl module for Bio::DB::Flat # # Please direct questions and support issues to # # Cared for by Lincoln Stein # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::DB::Flat - Interface for indexed flat files =head1 SYNOPSIS $db = Bio::DB::Flat->new(-directory => '/usr/share/embl', -dbname => 'mydb', -format => 'embl', -index => 'bdb', -write_flag => 1); $db->build_index('/usr/share/embl/primate.embl', '/usr/share/embl/protists.embl'); $seq = $db->get_Seq_by_id('HSFOS'); @sequences = $db->get_Seq_by_acc('DIV' => 'primate'); $raw = $db->fetch_raw('HSFOS'); =head1 DESCRIPTION This object provides the basic mechanism to associate positions in files with primary and secondary name spaces. Unlike Bio::Index::Abstract (see L), this is specialized to work with the "flat index" and BerkeleyDB indexed flat file formats worked out at the 2002 BioHackathon. This object is a general front end to the underlying databases. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Lincoln Stein Email - lstein@cshl.org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with an "_" (underscore). =cut # Let the code begin... package Bio::DB::Flat; $Bio::DB::Flat::VERSION = '1.7.8'; use File::Spec; use base qw(Bio::Root::Root Bio::DB::RandomAccessI); use constant CONFIG_FILE_NAME => 'config.dat'; =head2 new Title : new Usage : my $db = Bio::DB::Flat->new( -directory => $root_directory, -dbname => 'mydb', -write_flag => 1, -index => 'bdb', -verbose => 0, -out => 'outputfile', -format => 'genbank'); Function: create a new Bio::DB::Flat object Returns : new Bio::DB::Flat object Args : -directory Root directory containing "config.dat" -write_flag If true, allows creation/updating. -verbose Verbose messages -out File to write to when write_seq invoked -index 'bdb' or 'binarysearch' Status : Public The required -directory argument indicates where the flat file indexes will be stored. The build_index() and write_seq() methods will automatically create subdirectories of this root directory. Each subdirectory will contain a human-readable configuration file named "config.dat" that specifies where the individual indexes are stored. The required -dbname argument gives a name to the database index. The index files will actually be stored in a like-named subdirectory underneath the root directory. The -write_flag enables writing new entries into the database as well as the creation of the indexes. By default the indexes will be opened read only. -index is one of "bdb" or "binarysearch" and indicates the type of index to generate. "bdb" corresponds to Berkeley DB. You *must* be using BerkeleyDB version 2 or higher, and have the Perl BerkeleyDB extension installed (DB_File will *not* work). "binarysearch" corresponds to the OBDA "flat" indexed file. The -out argument specifies the output file for writing objects created with write_seq(). The -format argument specifies the format of the input file or files. If the file suffix is one that Bioperl can already associate with a format then this is optional. =cut sub new { my $class = shift; $class = ref($class) if ref($class); my $self = $class->SUPER::new(@_); # first we initialize ourselves my ($flat_directory,$dbname,$format) = $self->_rearrange([qw(DIRECTORY DBNAME FORMAT)],@_); defined $flat_directory or $self->throw('Please supply a -directory argument'); defined $dbname or $self->throw('Please supply a -dbname argument'); # set values from configuration file $self->directory($flat_directory); $self->dbname($dbname); $self->throw("Base directory $flat_directory doesn't exist") unless -e $flat_directory; $self->throw("$flat_directory isn't a directory") unless -d _; my $dbpath = File::Spec->catfile($flat_directory,$dbname); unless (-d $dbpath) { $self->debug("creating db directory $dbpath\n"); mkdir $dbpath,0777 or $self->throw("Can't create $dbpath: $!"); } $self->_read_config(); # but override with initialization values $self->_initialize(@_); $self->throw('you must specify an indexing scheme') unless $self->indexing_scheme; # now we figure out what subclass to instantiate my $index_type = $self->indexing_scheme eq 'BerkeleyDB/1' ? 'BDB' :$self->indexing_scheme eq 'flat/1' ? 'Binary' :$self->throw("unknown indexing scheme: " . $self->indexing_scheme); $format = $self->file_format; # because Michele and Lincoln did it differently # Michele's way is via a standalone concrete class if ($index_type eq 'Binary') { my $child_class = 'Bio::DB::Flat::BinarySearch'; eval "use $child_class"; $self->throw($@) if $@; push @_, ('-format', $format); return $child_class->new(@_); } # Lincoln uses Bio::SeqIO style delegation. my $child_class= "Bio\:\:DB\:\:Flat\:\:$index_type\:\:\L$format"; eval "use $child_class"; $self->throw($@) if $@; # rebless & reinitialize with the new class # (this prevents subclasses from forgetting to call our own initialization) bless $self,$child_class; $self->_initialize(@_); $self->_set_namespaces(@_); $self; } sub _initialize { my $self = shift; my ($flat_write_flag,$dbname,$flat_indexing,$flat_verbose,$flat_outfile,$flat_format) = $self->_rearrange([qw(WRITE_FLAG DBNAME INDEX VERBOSE OUT FORMAT)],@_); $self->write_flag($flat_write_flag) if defined $flat_write_flag; if (defined $flat_indexing) { # very permissive $flat_indexing = 'BerkeleyDB/1' if $flat_indexing =~ /bdb/; $flat_indexing = 'flat/1' if $flat_indexing =~ /^(flat|binary)/; $self->indexing_scheme($flat_indexing); } $self->verbose($flat_verbose) if defined $flat_verbose; $self->dbname($dbname) if defined $dbname; $self->out_file($flat_outfile) if defined $flat_outfile; $self->file_format($flat_format) if defined $flat_format; } sub _set_namespaces { my $self = shift; $self->primary_namespace($self->default_primary_namespace) unless defined $self->{flat_primary_namespace}; $self->secondary_namespaces($self->default_secondary_namespaces) unless defined $self->{flat_secondary_namespaces}; $self->file_format($self->default_file_format) unless defined $self->{flat_format}; } =head2 new_from_registry Title : new_from_registry Usage : $db = Bio::DB::Flat->new_from_registry(%config) Function: creates a new Bio::DB::Flat object in a Bio::DB::Registry- compatible fashion Returns : new Bio::DB::Flat Args : provided by the registry, see below Status : Public The following registry-configuration tags are recognized: location Root of the indexed flat file; corresponds to the new() method's -directory argument. =cut sub new_from_registry { my ($self,%config) = @_; my $location = $config{'location'} or $self->throw('location tag must be specified.'); my $dbname = $config{'dbname'} or $self->throw('dbname tag must be specified.'); my $db = $self->new(-directory => $location, -dbname => $dbname, ); $db; } # accessors sub directory { my $self = shift; my $d = $self->{flat_directory}; $self->{flat_directory} = shift if @_; $d; } sub write_flag { my $self = shift; my $d = $self->{flat_write_flag}; $self->{flat_write_flag} = shift if @_; $d; } sub verbose { my $self = shift; my $d = $self->{flat_verbose}; $self->{flat_verbose} = shift if @_; $d; } sub out_file { my $self = shift; my $d = $self->{flat_outfile}; $self->{flat_outfile} = shift if @_; $d; } sub dbname { my $self = shift; my $d = $self->{flat_dbname}; $self->{flat_dbname} = shift if @_; $d; } sub primary_namespace { my $self = shift; my $d = $self->{flat_primary_namespace}; $self->{flat_primary_namespace} = shift if @_; $d; } # get/set secondary namespace(s) # pass an array ref. # get an array ref in scalar context, list in list context. sub secondary_namespaces { my $self = shift; my $d = $self->{flat_secondary_namespaces}; $self->{flat_secondary_namespaces} = (ref($_[0]) eq 'ARRAY' ? shift : [@_]) if @_; return unless $d; $d = [$d] if $d && ref($d) ne 'ARRAY'; # just paranoia return wantarray ? @$d : $d; } # return the file format sub file_format { my $self = shift; my $d = $self->{flat_format}; $self->{flat_format} = shift if @_; $d; } # return the alphabet sub alphabet { my $self = shift; my $d = $self->{flat_alphabet}; $self->{flat_alphabet} = shift if @_; $d; } sub parse_one_record { my $self = shift; my $fh = shift; my $parser = $self->{cached_parsers}{fileno($fh)} ||= Bio::SeqIO->new(-fh=>$fh,-format=>$self->default_file_format); my $seq = $parser->next_seq or return; $self->{flat_alphabet} ||= $seq->alphabet; my $ids = $self->seq_to_ids($seq); return $ids; } # return the indexing scheme sub indexing_scheme { my $self = shift; my $d = $self->{flat_indexing}; $self->{flat_indexing} = shift if @_; $d; } sub add_flat_file { my $self = shift; my ($file_path,$file_length,$nf) = @_; # check that file_path is absolute unless (File::Spec->file_name_is_absolute($file_path)) { $file_path = File::Spec->rel2abs($file_path); } -r $file_path or $self->throw("flat file $file_path cannot be read: $!"); my $current_size = -s _; if (defined $file_length) { $current_size == $file_length or $self->throw("flat file $file_path has changed size. Was $file_length bytes; now $current_size"); } else { $file_length = $current_size; } unless (defined $nf) { $self->{flat_file_index} = 0 unless exists $self->{flat_file_index}; $nf = $self->{flat_file_index}++; } $self->{flat_flat_file_path}{$nf} = $file_path; $self->{flat_flat_file_no}{$file_path} = $nf; $nf; } sub write_config { my $self = shift; $self->write_flag or $self->throw("cannot write configuration file because write_flag is not set"); my $path = $self->_config_path; open my $F, '>', $path or $self->throw("Could not write file '$path': $!"); my $index_type = $self->indexing_scheme; print $F "index\t$index_type\n"; my $format = $self->file_format; my $alphabet = $self->alphabet; my $alpha = $alphabet ? "/$alphabet" : ''; print $F "format\tURN:LSID:open-bio.org:${format}${alpha}\n"; my @filenos = $self->_filenos or $self->throw("cannot write config file because no flat files defined"); for my $nf (@filenos) { my $path = $self->{flat_flat_file_path}{$nf}; my $size = -s $path; print $F join("\t","fileid_$nf",$path,$size),"\n"; } # write primary namespace my $primary_ns = $self->primary_namespace or $self->throw('cannot write config file because no primary namespace defined'); print $F join("\t",'primary_namespace',$primary_ns),"\n"; # write secondary namespaces my @secondary = $self->secondary_namespaces; print $F join("\t",'secondary_namespaces',@secondary),"\n"; close $F or $self->throw("close error on $path: $!"); } sub files { my $self = shift; return unless $self->{flat_flat_file_no}; return keys %{$self->{flat_flat_file_no}}; } sub write_seq { my $self = shift; my $seq = shift; $self->write_flag or $self->throw("cannot write sequences because write_flag is not set"); my $file = $self->out_file or $self->throw('no outfile defined; use the -out argument to new()'); my $seqio = $self->{flat_cached_parsers}{$file} ||= Bio::SeqIO->new(-Format => $self->file_format, -file => ">$file") or $self->throw("couldn't create Bio::SeqIO object"); my $fh = $seqio->_fh or $self->throw("couldn't get filehandle from Bio::SeqIO object"); my $offset = tell($fh); $seqio->write_seq($seq); my $length = tell($fh)-$offset; my $ids = $self->seq_to_ids($seq); $self->_store_index($ids,$file,$offset,$length); $self->{flat_outfile_dirty}++; } sub close { my $self = shift; return unless $self->{flat_outfile_dirty}; $self->write_config; delete $self->{flat_outfile_dirty}; delete $self->{flat_cached_parsers}{$self->out_file}; } sub _filenos { my $self = shift; return unless $self->{flat_flat_file_path}; return keys %{$self->{flat_flat_file_path}}; } # read the configuration file sub _read_config { my $self = shift; my $path = $self->_config_path; return unless -e $path; open my $F, '<', $path or $self->throw("Could not read file '$path': $!"); my %config; while (<$F>) { chomp; my ($tag,@values) = split "\t"; $config{$tag} = \@values; } CORE::close $F or $self->throw("close error on $path: $!"); $config{index}[0] =~ m~(flat/1|BerkeleyDB/1)~ or $self->throw("invalid configuration file $path: no index line"); $self->indexing_scheme($1); if ($config{format}) { # handle LSID format if ($config{format}[0] =~ /^URN:LSID:open-bio\.org:(\w+)(?:\/(\w+))/) { $self->file_format($1); $self->alphabet($2); } else { # compatibility with older versions $self->file_format($config{format}[0]); } } # set up primary namespace my $primary_namespace = $config{primary_namespace}[0] or $self->throw("invalid configuration file $path: no primary namespace defined"); $self->primary_namespace($primary_namespace); # set up secondary namespaces (may be empty) $self->secondary_namespaces($config{secondary_namespaces}); # get file paths and their normalization information my @normalized_files = grep {$_ ne ''} map {/^fileid_(\S+)/ && $1} keys %config; for my $nf (@normalized_files) { my ($file_path,$file_length) = @{$config{"fileid_${nf}"}}; $self->add_flat_file($file_path,$file_length,$nf); } 1; } sub _config_path { my $self = shift; $self->_catfile($self->_config_name); } sub _catfile { my $self = shift; my $component = shift; File::Spec->catfile($self->directory,$self->dbname,$component); } sub _config_name { CONFIG_FILE_NAME } sub _path2fileno { my $self = shift; my $path = shift; return $self->add_flat_file($path) unless exists $self->{flat_flat_file_no}{$path}; $self->{flat_flat_file_no}{$path}; } sub _fileno2path { my $self = shift; my $fileno = shift; $self->{flat_flat_file_path}{$fileno}; } sub _files { my $self = shift; my $paths = $self->{flat_flat_file_no}; return keys %$paths; } =head2 fetch Title : fetch Usage : $index->fetch( $id ) Function: Returns a Bio::Seq object from the index Example : $seq = $index->fetch( 'dJ67B12' ) Returns : Bio::Seq object Args : ID Deprecated. Use get_Seq_by_id instead. =cut sub fetch { shift->get_Seq_by_id(@_) } =head2 To Be Implemented in Subclasses The following methods MUST be implemented by subclasses. =cut # create real live Bio::Seq object sub get_Seq_by_id { my $self = shift; my $id = shift; $self->throw_not_implemented; } # fetch array of Bio::Seq objects sub get_Seq_by_acc { my $self = shift; return $self->get_Seq_by_id(shift) if @_ == 1; my ($ns,$key) = @_; $self->throw_not_implemented; } sub fetch_raw { my ($self,$id,$namespace) = @_; $self->throw_not_implemented; } sub default_file_format { my $self = shift; $self->throw_not_implemented; } sub _store_index { my $self = shift; my ($ids,$file,$offset,$length) = @_; $self->throw_not_implemented; } =head2 May Be Overridden in Subclasses The following methods MAY be overridden by subclasses. =cut sub default_primary_namespace { return "ACC"; } sub default_secondary_namespaces { return; } sub seq_to_ids { my $self = shift; my $seq = shift; my %ids; $ids{$self->primary_namespace} = $seq->accession_number; \%ids; } sub DESTROY { my $self = shift; $self->close; } 1; Qual.pm100644000766000024 4022114006427353 16243 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/DB# # BioPerl module for Bio::DB::Qual # # You may distribute this module under the same terms as perl itself # =head1 NAME Bio::DB::Qual - Fast indexed access to quality files =head1 SYNOPSIS use Bio::DB::Qual; # create database from directory of qual files my $db = Bio::DB::Qual->new('/path/to/qual/files/'); my @ids = $db->get_all_primary_ids; # Simple access my @qualarr = @{$db->qual('CHROMOSOME_I',4_000_000 => 4_100_000)}; my @revqual = @{$db->qual('CHROMOSOME_I',4_100_000 => 4_000_000)}; my $length = $db->length('CHROMOSOME_I'); my $header = $db->header('CHROMOSOME_I'); # Access to sequence objects. See Bio::PrimarySeqI. my $obj = $db->get_Qual_by_id('CHROMOSOME_I'); my @qual = @{$obj->qual}; my @subqual = @{$obj->subqual(4_000_000 => 4_100_000)}; my $length = $obj->length; # Loop through sequence objects my $stream = $db->get_PrimarySeq_stream; while (my $qual = $stream->next_seq) { # Bio::Seq::PrimaryQual operations } # Filehandle access my $fh = Bio::DB::Qual->newFh('/path/to/qual/files/'); while (my $qual = <$fh>) { # Bio::Seq::PrimaryQual operations } # Tied hash access tie %qualities,'Bio::DB::Qual','/path/to/qual/files/'; print $qualities{'CHROMOSOME_I:1,20000'}; =head1 DESCRIPTION Bio::DB::Qual provides indexed access to a single Fasta file, several files, or a directory of files. It provides random access to each quality score entry without having to read the file from the beginning. Access to subqualities (portions of a quality score) is provided, although contrary to Bio::DB::Fasta, the full quality score has to be brought in memory. Bio::DB::Qual is based on Bio::DB::IndexedBase. See this module's documentation for details. The qual files should contain decimal quality scores. Entries may have any line length up to 65,536 characters, and different line lengths are allowed in the same file. However, within a quality score entry, all lines must be the same length except for the last. An error will be thrown if this is not the case. The module uses /^E(\S+)/ to extract the primary ID of each quality score from the qual header. See -makeid in Bio::DB::IndexedBase to pass a callback routine to reversibly modify this primary ID, e.g. if you wish to extract a specific portion of the gi|gb|abc|xyz GenBank IDs. =head1 DATABASE CREATION AND INDEXING The object-oriented constructor is new(), the filehandle constructor is newFh() and the tied hash constructor is tie(). They all allow one to index a single Fasta file, several files, or a directory of files. See Bio::DB::IndexedBase. =head1 SEE ALSO L L L =head1 LIMITATIONS When a quality score is deleted from one of the qual files, this deletion is not detected by the module and removed from the index. As a result, a "ghost" entry will remain in the index and will return garbage results if accessed. Currently, the only way to accommodate deletions is to rebuild the entire index, either by deleting it manually, or by passing -reindex=E1 to new() when initializing the module. All quality score lines for a given quality score must have the same length except for the last (not sure why there is this limitation). This is not problematic for sequences but could be annoying for quality scores. A workaround is to make sure that your quality scores fit on no more than 2 lines. Another solution could be to padd them with blank spaces so that each line has the same number of characters (maybe this padding should be implemented in Bio::SeqIO::qual?). =head1 AUTHOR Florent E Angly Eflorent . angly @ gmail-dot-comE. Module largely based on and adapted from Bio::DB::Fasta by Lincoln Stein. Copyright (c) 2007 Florent E Angly. This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ For BioPerl-style access, the following methods are provided: =head2 get_Seq_by_id Title : get_Seq_by_id, get_Seq_by_acc, get_Seq_by_version, get_Seq_by_primary_id, get_Qual_by_id, get_qual_by_acc, get_qual_by_version, get_qual_by_primary_id, Usage : my $seq = $db->get_Seq_by_id($id); Function: Given an ID, fetch the corresponding sequence from the database. Returns : A Bio::PrimarySeq::Fasta object (Bio::PrimarySeqI compliant) Note that to save resource, Bio::PrimarySeq::Fasta sequence objects only load the sequence string into memory when requested using seq(). See L for methods provided by the sequence objects returned from get_Seq_by_id() and get_PrimarySeq_stream(). Args : ID =head2 get_PrimarySeq_stream Title : get_Seq_stream, get_PrimarySeq_stream Usage : my $stream = $db->get_Seq_stream(); Function: Get a stream of Bio::PrimarySeq::Fasta objects. The stream supports a single method, next_seq(). Each call to next_seq() returns a new Bio::PrimarySeq::Fasta sequence object, until no more sequences remain. Returns : A Bio::DB::Indexed::Stream object Args : None =head1 For simple access, the following methods are provided: =cut package Bio::DB::Qual; $Bio::DB::Qual::VERSION = '1.7.8'; use strict; use IO::File; use File::Spec; use base qw(Bio::DB::IndexedBase); our $obj_class = 'Bio::Seq::PrimaryQual::Qual'; our $file_glob = '*.{qual,QUAL,qa,QA}'; =head2 new Title : new Usage : my $db = Bio::DB::Qual->new( $path, %options); Function: Initialize a new database object. When indexing a directory, files ending in .qual,qa are indexed by default. Returns : A new Bio::DB::Qual object Args : A single file, or path to dir, or arrayref of files Optional arguments: see Bio::DB::IndexedBase =cut sub _calculate_offsets { # Bio::DB::IndexedBase calls this to calculate offsets my ($self, $fileno, $file, $offsets) = @_; my $fh = IO::File->new($file) or $self->throw("Could not open $file: $!"); binmode $fh; warn "Indexing $file\n" if $self->{debug}; my ($offset, @ids, $linelen, $headerlen, $count, $qual_lines, $last_line, $numres, %offsets); my ($l3_len, $l2_len, $l_len, $blank_lines) = (0, 0, 0, 0); my $termination_length = $self->{termination_length}; while (my $line = <$fh>) { # Account for crlf-terminated Windows files if (index($line, '>') == 0) { if ($line =~ /^>(\S+)/) { print STDERR "Indexed $count quality scores...\n" if $self->{debug} && (++$count%1000) == 0; $self->_check_linelength($linelen); my $pos = tell($fh); if (@ids) { my $strlen = $pos - $offset - length($line); $strlen -= $termination_length * $qual_lines; my $ppos = &{$self->{packmeth}}($offset, $strlen, $numres, $linelen, $headerlen, Bio::DB::IndexedBase::NA, $fileno); for my $id (@ids) { $offsets->{$id} = $ppos; } $numres = 0; } @ids = $self->_makeid($line); ($offset, $headerlen, $linelen, $qual_lines) = ($pos, length $line, 0, 0); ($l3_len, $l2_len, $l_len, $blank_lines) = (0, 0, 0, 0); } else { # Catch bad header lines, bug 3172 $self->throw("FASTA header doesn't match '>(\\S+)': $line"); } } elsif ($line !~ /\S/) { # Skip blank line $blank_lines++; next; } else { # Need to check every line :( $l3_len = $l2_len; $l2_len = $l_len; $l_len = length $line; if (Bio::DB::IndexedBase::DIE_ON_MISSMATCHED_LINES) { if ( ($l3_len > 0) && ($l2_len > 0) && ($l3_len != $l2_len) ) { my $fap = substr($line, 0, 20).".."; $self->throw("Each line of the qual entry must be the same ". "length except the last. Line above #$. '$fap' is $l2_len". " != $l3_len chars."); } if ($blank_lines) { # Blank lines not allowed in entry $self->throw("Blank lines can only precede header lines, ". "found preceding line #$."); } } $linelen ||= length $line; $qual_lines++; $numres += scalar(split /\s+/, $line); } $last_line = $line; } # Process last entry $self->_check_linelength($linelen); my $pos = tell($fh); if (@ids) { my $strlen = $pos - $offset; if ($linelen == 0) { $strlen = 0; } else { if ($last_line !~ /\s$/) { $qual_lines--; } $strlen -= $termination_length * $qual_lines; } my $ppos = &{$self->{packmeth}}($offset, $strlen, $numres, $linelen, $headerlen, Bio::DB::IndexedBase::NA, $fileno); for my $id (@ids) { $offsets->{$id} = $ppos; } } return \%offsets; } # for backward compatibility sub get_PrimaryQual_stream { my $self = shift; return $self->get_PrimarySeq_stream; } # for backward compatibility sub get_Qual_by_id { my ($self, $id) = @_; return $self->get_Seq_by_id($id); } *get_qual_by_version = *get_qual_by_primary_id = *get_qual_by_acc = \&get_Qual_by_id; =head2 qual Title : qual, quality, subqual Usage : # All quality scores my @qualarr = @{$qualdb->subqual($id)}; # Subset of the quality scores my @subqualarr = @{$qualdb->subqual($id, $start, $stop, $strand)}; # or... my @subqualarr = @{$qualdb->subqual($compound_id)}; Function: Get a subqual of an entry in the database. For your convenience, the sequence to extract can be specified with any of the following compound IDs: $db->qual("$id:$start,$stop") $db->qual("$id:$start..$stop") $db->qual("$id:$start-$stop") $db->qual("$id:$start,$stop/$strand") $db->qual("$id:$start..$stop/$strand") $db->qual("$id:$start-$stop/$strand") $db->qual("$id/$strand") If $stop is less than $start, then the reverse complement of the sequence is returned. Avoid using it if possible since this goes against Bio::Seq conventions. Returns : Reference to an array of quality scores Args : Compound ID of entry to retrieve or ID, optional start (defaults to 1), optional end (defaults to the number of quality scores for this sequence), and strand (defaults to 1). =cut sub subqual { my ($self, $id, $start, $stop, $strand) = @_; # Quality values in a quality score can have 1 or 2 digits and are separated # by one (or several?) spaces. Thus contrary to Bio::DB::Fasta, here there # is no easy way match the position of a quality value to its position in # the quality string. # As a consequence, if a subqual of the quality is requested, we still need # to grab the full quality string first - performance penalty for big # quality scores :( # I think there is no way around starting at the begining of the quality # score but maybe there is a resource-efficient way of starting at the # begining of the quality score and stopping when the the position of the # last quality value requested is reached?? $self->throw('Need to provide a sequence ID') if not defined $id; ($id, $start, $stop, $strand) = $self->_parse_compound_id($id, $start, $stop, $strand); # Position in quality string my $string_start = 1; my $string_stop = $self->strlen($id); # Fetch full quality string my $fh = $self->_fh($id) or return; my $filestart = $self->_calc_offset($id, $string_start); my $filestop = $self->_calc_offset($id, $string_stop ); seek($fh, $filestart,0); my $data; read($fh, $data, $filestop-$filestart+1); # Process quality score $data =~ tr/\n\r//d; #strip control characters my $subqual = 0; $subqual = 1 if ( $start || $stop ); my @data; if ( $subqual || ($strand == -1) ) { @data = split / /, $data, $stop+1; my $length = scalar(@data); $start = 1 if $start < 1; $stop = $length if $stop > $length; pop @data if ($stop != $length); splice @data, 0, $start-1; @data = reverse(@data) if $strand == -1; $data = join ' ', @data; } else { @data = split / /, $data; } return \@data; } *qual = *quality = \&subqual; =head2 header Title : header Usage : my $header = $db->header($id); Function: Get the header line (ID and description fields) of the specified entry. Returns : String Args : ID of entry =cut sub header { my ($self, $id) = @_; $self->throw('Need to provide a sequence ID') if not defined $id; my ($offset, $headerlen) = (&{$self->{unpackmeth}}($self->{offsets}{$id}))[0,4]; $offset -= $headerlen; my $data; my $fh = $self->_fh($id) or return; seek($fh, $offset, 0); read($fh, $data, $headerlen); # On Windows chomp remove '\n' but leaves '\r' # when reading '\r\n' in binary mode, $data =~ tr/\n\r//d; #strip control characters substr($data, 0, 1) = ''; return $data; } #------------------------------------------------------------- # Tied hash overrides # sub FETCH { return shift->subqual(@_); } #------------------------------------------------------------- # Bio::Seq::PrimaryQual compatibility # # Usage is the same as in Bio::Seq::PrimaryQual package Bio::Seq::PrimaryQual::Qual; $Bio::Seq::PrimaryQual::Qual::VERSION = '1.7.8'; use overload '""' => 'display_id'; use base qw(Bio::Root::Root Bio::Seq::PrimaryQual); sub new { my ($class, @args) = @_; my $self = $class->SUPER::new(@args); my ($db, $id, $start, $stop) = $self->_rearrange( [qw(DATABASE ID START STOP)], @args); $self->{db} = $db; $self->{id} = $id; $self->{stop} = $stop || $db->length($id); $self->{start} = $start || ($self->{stop} > 0 ? 1 : 0); # handle 0-length seqs return $self; } sub qual { my $self = shift; my $qual = $self->{db}->qual($self->{id}, $self->{start}, $self->{stop}); return $qual; } sub subqual { my ($self, $start, $stop) = @_; return $self->trunc($start, $stop)->qual; } sub trunc { # Override Bio::Seq::QualI trunc() method. This way, we create an object # that does not store the quality array in memory. my ($self, $start, $stop) = @_; $self->throw( "$stop is smaller than $stop. If you want to truncate and reverse ". "complement, you must call trunc followed by revcom." ) if $start > $stop; if ($self->{start} <= $self->{stop}) { $start = $self->{start}+$start-1; $stop = $self->{start}+$stop-1; } else { $start = $self->{start}-($start-1); $stop = $self->{start}-($stop-1); } my $obj = $self->new( -database => $self->{db}, -id => $self->{id}, -start => $start, -stop => $stop ); return $obj; } sub display_id { my $self = shift; return $self->{id}; } sub primary_id { my $self = shift; return overload::StrVal($self); } sub revcom { # Override Bio::QualI revcom() with optimized method. my $self = shift; return $self->new(@{$self}{'db', 'id', 'stop', 'start'}); } sub length { # Get length from quality location, not the quality array (too expensive) my $self = shift; return $self->{start} < $self->{stop} ? $self->{stop} - $self->{start} + 1 : $self->{start} - $self->{stop} + 1 ; } sub description { my $self = shift; my $header = $self->{'db'}->header($self->{id}); # remove the id from the header $header = (split(/\s+/, $header, 2))[2]; return $header; } *desc = \&description; 1; bp_search2table100644000766000024 446614006427353 16745 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_search2table - turn SearchIO parseable reports into tab delimited format like NCBI's -m 9 =head1 SYNOPSIS bp_search2table -f fasta -i file.FASTA -o output.table =head1 DESCRIPTION Turn SearchIO reports into a tabular format like NCBI's -m 9 output. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Jason Stajich jason_at_bioperl-dot-org =cut use strict; use warnings; use Bio::SearchIO; use Getopt::Long; my ($format, $file,$output) = ('blast'); GetOptions( 'f|format:s' => \$format, 'i|input:s' => \$file, 'o|output:s' => \$output); if( @ARGV ) { $file = shift; } my $in = Bio::SearchIO->new(-format => $format, -file => $file); my $out; if( $output ) { open $out, '>', $output or die "Could not write file '$output': $!\n"; } else { $out = \*STDOUT; } while( my $r = $in->next_result ) { while( my $hit = $r->next_hit ) { while( my $hsp = $hit->next_hsp ) { my $mismatchcount = $hsp->length('total') - ($hsp->num_conserved + $hsp->gaps('total')); print $out join("\t", ( $r->query_name, $hit->name, sprintf("%.2f",$hsp->percent_identity), $hsp->length('total'), $mismatchcount, $hsp->gaps('total'), # flip start/end on rev strand $hsp->query->strand < 0 ? ( $hsp->query->end, $hsp->query->start ) : ( $hsp->query->start, $hsp->query->end ), $hsp->hit->strand < 0 ? ( $hsp->hit->end, $hsp->hit->start ) : ( $hsp->hit->start, $hsp->hit->end ), $hsp->evalue, # chance this to $hsp->sw_score # if you would rather have that # it will only work for FASTA parsing though! $hsp->bits)),"\n"; } } } bp_genbank2gff3100644000766000024 23343514006427353 16703 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =pod =head1 NAME bp_genbank2gff3.pl -- Genbank-Egbrowse-friendly GFF3 =head1 SYNOPSIS bp_genbank2gff3.pl [options] filename(s) # process a directory containing GenBank flatfiles perl bp_genbank2gff3.pl --dir path_to_files --zip # process a single file, ignore explicit exons and introns perl bp_genbank2gff3.pl --filter exon --filter intron file.gbk.gz # process a list of files perl bp_genbank2gff3.pl *gbk.gz # process data from URL, with Chado GFF model (-noCDS), and pipe to database loader curl ftp://ftp.ncbi.nih.gov/genomes/Saccharomyces_cerevisiae/CHR_X/NC_001142.gbk \ | perl bp_genbank2gff3.pl -noCDS -in stdin -out stdout \ | perl gmod_bulk_load_gff3.pl -dbname mychado -organism fromdata Options: --noinfer -r don't infer exon/mRNA subfeatures --conf -i path to the curation configuration file that contains user preferences for Genbank entries (must be YAML format) (if --manual is passed without --ini, user will be prompted to create the file if any manual input is saved) --sofile -l path to to the so.obo file to use for feature type mapping (--sofile live will download the latest online revision) --manual -m when trying to guess the proper SO term, if more than one option matches the primary tag, the converter will wait for user input to choose the correct one (only works with --sofile) --dir -d path to a list of genbank flatfiles --outdir -o location to write GFF files (can be 'stdout' or '-' for pipe) --zip -z compress GFF3 output files with gzip --summary -s print a summary of the features in each contig --filter -x genbank feature type(s) to ignore --split -y split output to separate GFF and fasta files for each genbank record --nolump -n separate file for each reference sequence (default is to lump all records together into one output file for each input file) --ethresh -e error threshold for unflattener set this high (>2) to ignore all unflattener errors --[no]CDS -c Keep CDS-exons, or convert to alternate gene-RNA-protein-exon model. --CDS is default. Use --CDS to keep default GFF gene model, use --noCDS to convert to g-r-p-e. --format -f Input format (SeqIO types): GenBank, Swiss or Uniprot, EMBL work (GenBank is default) --GFF_VERSION 3 is default, 2 and 2.5 and other Bio::Tools::GFF versions available --quiet don't talk about what is being processed --typesource SO sequence type for source (e.g. chromosome; region; contig) --help -h display this message =head1 DESCRIPTION This script uses Bio::SeqFeature::Tools::Unflattener and Bio::Tools::GFF to convert GenBank flatfiles to GFF3 with gene containment hierarchies mapped for optimal display in gbrowse. The input files are assumed to be gzipped GenBank flatfiles for refseq contigs. The files may contain multiple GenBank records. Either a single file or an entire directory can be processed. By default, the DNA sequence is embedded in the GFF but it can be saved into separate fasta file with the --split(-y) option. If an input file contains multiple records, the default behaviour is to dump all GFF and sequence to a file of the same name (with .gff appended). Using the 'nolump' option will create a separate file for each genbank record. Using the 'split' option will create separate GFF and Fasta files for each genbank record. =head2 Notes =head3 'split' and 'nolump' produce many files In cases where the input files contain many GenBank records (for example, the chromosome files for the mouse genome build), a very large number of output files will be produced if the 'split' or 'nolump' options are selected. If you do have lists of files E 6000, use the --long_list option in bp_bulk_load_gff.pl or bp_fast_load_gff.pl to load the gff and/ or fasta files. =head3 Designed for RefSeq This script is designed for RefSeq genomic sequence entries. It may work for third party annotations but this has not been tested. But see below, Uniprot/Swissprot works, EMBL and possibly EMBL/Ensembl if you don't mind some gene model unflattener errors (dgg). =head3 G-R-P-E Gene Model Don Gilbert worked this over with needs to produce GFF3 suited to loading to GMOD Chado databases. Most of the changes I believe are suited for general use. One main chado-specific addition is the --[no]cds2protein flag My favorite GFF is to set the above as ON by default (disable with --nocds2prot) For general use it probably should be OFF, enabled with --cds2prot. This writes GFF with an alternate, but useful Gene model, instead of the consensus model for GFF3 [ gene > mRNA> (exon,CDS,UTR) ] This alternate is gene > mRNA > polypeptide > exon means the only feature with dna bases is the exon. The others specify only location ranges on a genome. Exon of course is a child of mRNA and protein/peptide. The protein/polypeptide feature is an important one, having all the annotations of the GenBank CDS feature, protein ID, translation, GO terms, Dbxrefs to other proteins. UTRs, introns, CDS-exons are all inferred from the primary exon bases inside/outside appropriate higher feature ranges. Other special gene model features remain the same. Several other improvements and bugfixes, minor but useful are included * IO pipes now work: curl ftp://ncbigenomes/... | bp_genbank2gff3 --in stdin --out stdout | gff2chado ... * GenBank main record fields are added to source feature, e.g. organism, date, and the sourcetype, commonly chromosome for genomes, is used. * Gene Model handling for ncRNA, pseudogenes are added. * GFF header is cleaner, more informative. --GFF_VERSION flag allows choice of v2 as well as default v3 * GFF ##FASTA inclusion is improved, and CDS translation sequence is moved to FASTA records. * FT -> GFF attribute mapping is improved. * --format choice of SeqIO input formats (GenBank default). Uniprot/Swissprot and EMBL work and produce useful GFF. * SeqFeature::Tools::TypeMapper has a few FT -> SOFA additions and more flexible usage. =head1 TODO =head2 Are these additions desired? * filter input records by taxon (e.g. keep only organism=xxx or taxa level = classYYY * handle Entrezgene, other non-sequence SeqIO structures (really should change those parsers to produce consistent annotation tags). =head2 Related bugfixes/tests These items from Bioperl mail were tested (sample data generating errors), and found corrected: From: Ed Green eva.mpg.de> Subject: genbank2gff3.pl on new human RefSeq Date: 2006-03-13 21:22:26 GMT -- unspecified errors (sample data works now). From: Eric Just northwestern.edu> Subject: genbank2gff3.pl Date: 2007-01-26 17:08:49 GMT -- bug fixed in genbank2gff3 for multi-record handling This error is for a /trans_splice gene that is hard to handle, and unflattner/genbank2 doesn't From: Chad Matsalla dieselwurks.com> Subject: genbank2gff3.PLS and the unflatenner - Inconsistent order? Date: 2005-07-15 19:51:48 GMT =head1 AUTHOR Sheldon McKay (mckays@cshl.edu) Copyright (c) 2004 Cold Spring Harbor Laboratory. =head2 AUTHOR of hacks for GFF2Chado loading Don Gilbert (gilbertd@indiana.edu) =cut use strict; use warnings; use Pod::Usage; use Bio::Root::RootI; use Bio::SeqIO; use File::Spec; use Bio::SeqFeature::Tools::Unflattener; use Bio::SeqFeature::Tools::TypeMapper; use Bio::SeqFeature::Tools::IDHandler; use Bio::Location::SplitLocationI; use Bio::Location::Simple; use Bio::Tools::GFF; use Getopt::Long; use List::Util qw(first); use Bio::OntologyIO; use YAML qw(Dump LoadFile DumpFile); use File::Basename; use vars qw/$split @filter $zip $outdir $help $ethresh $ONTOLOGY %FEATURES %DESCENDANTS @RETURN $MANUAL @GFF_LINE_FEAT $CONF $YAML $TYPE_MAP $SYN_MAP $noinfer $SO_FILE $file @files $dir $summary $nolump $source_type %proteinfa %exonpar $didheader $verbose $DEBUG $GFF_VERSION $gene_id $rna_id $tnum $ncrna_id $rnum %method %id %seen/; use constant SO_URL => 'http://song.cvs.sourceforge.net/viewvc/*checkout*/song/ontology/so.obo'; use constant ALPHABET => [qw(a b c d e f g h i j k l m n o p q r s t u v w x y z)]; use constant ALPHABET_TO_NUMBER => { a => 0, b => 1, c => 2, d => 3, e => 4, f => 5, g => 6, h => 7, i => 8, j => 9, k => 10, l => 11, m => 12, n => 13, o => 14, p => 15, q => 16, r => 17, s => 18, t => 19, u => 20, v => 21, w => 22, x => 23, y => 24, z => 25, }; use constant ALPHABET_DIVISOR => 26; use constant GM_NEW_TOPLEVEL => 2; use constant GM_NEW_PART => 1; use constant GM_DUP_PART => 0; use constant GM_NOT_PART => -1; # Options cycle in multiples of 2 because of left side/right side pairing. # You can make this number odd, but displayed matches will still round up use constant OPTION_CYCLE => 6; $GFF_VERSION = 3; # allow v2 ... $verbose = 1; # right default? -nov to turn off # dgg: change the gene model to Gene/mRNA/Polypeptide/exons... my $CDSkeep= 1; # default should be ON (prior behavior), see gene_features() my $PROTEIN_TYPE = 'polypeptide'; # for noCDSkeep; # protein = flybase chado usage; GMOD Perls use 'polypeptide' with software support my $FORMAT="GenBank"; # swiss ; embl; genbank ; ** guess from SOURCEID ** my $SOURCEID= $FORMAT; # "UniProt" "GenBank" "EMBL" should work # other Bio::SeqIO formats may work. TEST: EntrezGene < problematic tags; InterPro KEGG my %TAG_MAP = ( db_xref => 'Dbxref', name => 'Name', note => 'Note', # also pull GO: ids into Ontology_term synonym => 'Alias', symbol => 'Alias', # is symbol still used? # protein_id => 'Dbxref', also seen Dbxref tags: EC_number # translation: handled in gene_features ); $| = 1; my $quiet= !$verbose; my $ok= GetOptions( 'd|dir|input:s' => \$dir, 'z|zip' => \$zip, 'h|help' => \$help, 's|summary' => \$summary, 'r|noinfer' => \$noinfer, 'i|conf=s' => \$CONF, 'sofile=s' => \$SO_FILE, 'm|manual' => \$MANUAL, 'o|outdir|output:s'=> \$outdir, 'x|filter:s'=> \@filter, 'y|split' => \$split, "ethresh|e=s"=>\$ethresh, 'c|CDS!' => \$CDSkeep, 'f|format=s' => \$FORMAT, 'typesource=s' => \$source_type, 'GFF_VERSION=s' => \$GFF_VERSION, 'quiet!' => \$quiet, # swap quiet to verbose 'DEBUG!' => \$DEBUG, 'n|nolump' => \$nolump); my $lump = 1 unless $nolump || $split; $verbose= !$quiet; # look for help request pod2usage(2) if $help || !$ok; # keep SOURCEID as-is and change FORMAT for SeqIO types; # note SeqIO uses file.suffix to guess type; not useful here $SOURCEID= $FORMAT; $FORMAT = "swiss" if $FORMAT =~/UniProt|trembl/; $verbose =1 if($DEBUG); # initialize handlers my $unflattener = Bio::SeqFeature::Tools::Unflattener->new; # for ensembl genomes (-trust_grouptag=>1); $unflattener->error_threshold($ethresh) if $ethresh; $unflattener->verbose(1) if($DEBUG); # $unflattener->group_tag('gene') if($FORMAT =~ /embl/i) ; #? ensembl only? # ensembl parsing is still problematic, forget this my $tm = Bio::SeqFeature::Tools::TypeMapper->new; my $idh = Bio::SeqFeature::Tools::IDHandler->new; # dgg $source_type ||= "region"; # should really parse from FT.source contents below #my $FTSOmap = $tm->FT_SO_map(); my $FTSOmap; my $FTSOsynonyms; if (defined($SO_FILE) && $SO_FILE eq 'live') { print "\nDownloading the latest SO file from ".SO_URL."\n\n"; use LWP::UserAgent; my $ua = LWP::UserAgent->new(timeout => 30); my $request = HTTP::Request->new(GET => SO_URL); my $response = $ua->request($request); if ($response->status_line =~ /200/) { use File::Temp qw/ tempfile /; my ($fh, $fn) = tempfile(); print $fh $response->content; $SO_FILE = $fn; } else { print "Couldn't download SO file online...skipping validation.\n" . "HTTP Status was " . $response->status_line . "\n" and undef $SO_FILE } } if ($SO_FILE) { my (%terms, %syn); my $parser = Bio::OntologyIO->new( -format => "obo", -file => $SO_FILE ); $ONTOLOGY = $parser->next_ontology(); for ($ONTOLOGY->get_all_terms) { my $feat = $_; $terms{$feat->name} = $feat->name; #$terms{$feat->name} = $feat; my @syn = $_->each_synonym; push @{$syn{$_}}, $feat->name for @syn; #push @{$syn{$_}}, $feat for @syn; } $FTSOmap = \%terms; $FTSOsynonyms = \%syn; my %hardTerms = %{ $tm->FT_SO_map() }; map { $FTSOmap->{$_} ||= $hardTerms{$_} } keys %hardTerms; } else { my %terms = %{ $tm->FT_SO_map() }; while (my ($k,$v) = each %terms) { $FTSOmap->{$k} = ref($v) ? shift @$v : $v; } } $TYPE_MAP = $FTSOmap; $SYN_MAP = $FTSOsynonyms; # #convert $FTSOmap undefined to valid SO : moved to TypeMapper->map_types( -undefined => "region") # stringify filter list if applicable my $filter = join ' ', @filter if @filter; # determine input files my $stdin=0; # dgg: let dir == stdin == '-' for pipe use if ($dir && ($dir eq '-' || $dir eq 'stdin')) { $stdin=1; $dir=''; @files=('stdin'); } elsif ( $dir ) { if ( -d $dir ) { opendir DIR, $dir or die "could not open $dir for reading: $!"; @files = map { "$dir/$_";} grep { /\.gb.*/ } readdir DIR; closedir DIR; } else { die "$dir is not a directory\n"; } } else { @files = @ARGV; $dir = ''; } # we should have some files by now pod2usage(2) unless @files; my $stdout=0; # dgg: let outdir == stdout == '-' for pipe use if($outdir && ($outdir eq '-' || $outdir eq 'stdout')) { warn("std. output chosen: cannot split\n") if($split); warn("std. output chosen: cannot zip\n") if($zip); warn("std. output chosen: cannot nolump\n") if($nolump); $stdout=1; $lump=1; $split= 0; $zip= 0; # unless we pipe stdout thru gzip } elsif ( $outdir && !-e $outdir ) { mkdir($outdir) or die "could not create directory $outdir: $!\n"; } elsif ( !$outdir ) { $outdir = $dir || '.'; } for my $file ( @files ) { # dgg ; allow 'stdin' / '-' input ? chomp $file; die "$! $file" unless($stdin || -e $file); print "# Input: $file\n" if($verbose); my ($lump_fh, $lumpfa_fh, $outfile, $outfa); if ($stdout) { $lump_fh= *STDOUT; $lump="stdout$$"; $outfa= "stdout$$.fa"; # this is a temp file ... see below open $lumpfa_fh, ">$outfa" or die "Could not create a lump outfile called ($outfa) because ($!)\n"; } elsif ( $lump ) { my ($vol,$dirs,$fileonly) = File::Spec->splitpath($file); $lump = File::Spec->catfile($outdir, $fileonly.'.gff'); ($outfa = $lump) =~ s/\.gff/\.fa/; open $lump_fh, ">$lump" or die "Could not create a lump outfile called ($lump) because ($!)\n"; open $lumpfa_fh, ">$outfa" or die "Could not create a lump outfile called ($outfa) because ($!)\n"; } # open input file, unzip if req'd if ($stdin) { *FH = *STDIN; } elsif ( $file =~ /\.gz/ ) { open FH, "gunzip -c $file |"; } else { open FH, '<', $file; } my $in = Bio::SeqIO->new(-fh => \*FH, -format => $FORMAT, -debug=>$DEBUG); my $gffio = Bio::Tools::GFF->new( -noparse => 1, -gff_version => $GFF_VERSION ); while ( my $seq = $in->next_seq() ) { my $seq_name = $seq->accession_number; my $end = $seq->length; my @to_print; # arrange disposition of GFF output $outfile = $lump || File::Spec->catfile($outdir, $seq_name.'.gff'); my $out; if ( $lump ) { $outfile = $lump; $out = $lump_fh; } else { $outfile = File::Spec->catfile($outdir, $seq_name.'.gff'); open $out, ">$outfile"; } # filter out unwanted features my $source_feat= undef; my @source= filter($seq); $source_feat= $source[0]; ($source_type,$source_feat)= getSourceInfo( $seq, $source_type, $source_feat ) ; # always; here we build main prot $source_feat; # if @source; # abort if there are no features warn "$seq_name has no features, skipping\n" and next if !$seq->all_SeqFeatures; $FTSOmap->{'source'} = $source_type; ## $FTSOmap->{'CDS'}= $PROTEIN_TYPE; # handle this in gene_features # construct a GFF header # add: get source_type from attributes of source feature? chromosome=X tag # also combine 1st ft line here with source ft from $seq .. my($header,$info)= gff_header($seq_name, $end, $source_type, $source_feat); print $out $header; print "# working on $info\n" if($verbose); # unflatten gene graphs, apply SO types, etc; this also does TypeMapper .. unflatten_seq($seq); # Note that we use our own get_all_SeqFeatures function # to rescue cloned exons @GFF_LINE_FEAT = (); for my $feature ( get_all_SeqFeatures($seq) ) { my $method = $feature->primary_tag; next if($SOURCEID =~/UniProt|swiss|trembl/i && $method ne $source_type); $feature->seq_id($seq->id) unless($feature->seq_id); $feature->source_tag($SOURCEID); # dgg; need to convert some Genbank to GFF tags: note->Note; db_xref->Dbxref; ## also, pull any GO:000 ids from /note tag and put into Ontology_term maptags2gff($feature); # current gene name. The unflattened gene features should be in order so any # exons, CDSs, etc that follow will belong to this gene my $gene_name; if ( $method eq 'gene' || $method eq 'pseudogene' ) { @to_print= print_held($out, $gffio, \@to_print); $gene_id = $gene_name= gene_name($feature); } else { $gene_name= gene_name($feature); } #?? should gene_name from /locus_tag,/gene,/product,/transposon=xxx # be converted to or added as Name=xxx (if not ID= or as well) ## problematic: convert_to_name ($feature); # drops /locus_tag,/gene, tags convert_to_name($feature); ## dgg: extended to protein|polypeptide ## this test ($feature->has_tag('gene') ||) is not good: repeat_regions over genes ## in yeast have that genbank tag; why? ## these include pseudogene ... ## Note we also have mapped types to SO, so these RNA's are now transcripts: # pseudomRNA => "pseudogenic_transcript", # pseudotranscript" => "pseudogenic_transcript", # misc_RNA=>'processed_transcript', warn "#at: $method $gene_id/$gene_name\n" if $DEBUG; if ( $method =~ /(gene|RNA|CDS|exon|UTR|protein|polypeptide|transcript)/ || ( $gene_id && $gene_name eq $gene_id ) ) { my $action = gene_features($feature, $gene_id, $gene_name); # -1, 0, 1, 2 result if ($action == GM_DUP_PART) { # ignore, this is dupl. exon with new parent ... } elsif ($action == GM_NOT_PART) { add_generic_id( $feature, $gene_name, "nocount"); my $gff = $gffio->gff_string($feature); push @GFF_LINE_FEAT, $feature; #print $out "$gff\n" if $gff; } elsif ($action > 0) { # hold off print because exon etc. may get 2nd, 3rd parents @to_print= print_held($out, $gffio, \@to_print) if ($action == GM_NEW_TOPLEVEL); push(@to_print, $feature); } } # otherwise handle as generic feats with IDHandler labels else { add_generic_id( $feature, $gene_name, ""); my $gff= $gffio->gff_string($feature); push @GFF_LINE_FEAT, $feature; #print $out "$gff\n" if $gff; } } # don't like doing this after others; do after each new gene id? @to_print= print_held($out, $gffio, \@to_print); gff_validate(@GFF_LINE_FEAT); for my $feature (@GFF_LINE_FEAT) { my $gff= $gffio->gff_string($feature); print $out "$gff\n" if $gff; } # deal with the corresponding DNA my ($fa_out,$fa_outfile); my $dna = $seq->seq; if($dna || %proteinfa) { $method{'RESIDUES'} += length($dna); $dna =~ s/(\S{60})/$1\n/g; $dna .= "\n"; if ($split) { $fa_outfile = $outfile; $fa_outfile =~ s/gff$/fa/; open $fa_out, ">$fa_outfile" or die $!; print $fa_out ">$seq_name\n$dna" if $dna; foreach my $aid (sort keys %proteinfa) { my $aa= delete $proteinfa{$aid}; $method{'RESIDUES(tr)'} += length($aa); $aa =~ s/(\S{60})/$1\n/g; print $fa_out ">$aid\n$aa\n"; } } else { ## problem here when multiple GB Seqs in one file; all FASTA needs to go at end of $out ## see e.g. Mouse: mm_ref_chr19.gbk has NT_082868 and NT_039687 parts in one .gbk ## maybe write this to temp .fa then cat to end of lumped gff $out print $lumpfa_fh ">$seq_name\n$dna" if $dna; foreach my $aid (sort keys %proteinfa) { my $aa= delete $proteinfa{$aid}; $method{'RESIDUES(tr)'} += length($aa); $aa =~ s/(\S{60})/$1\n/g; print $lumpfa_fh ">$aid\n$aa\n"; } } %proteinfa=(); } if ( $zip && !$lump ) { system "gzip -f $outfile"; system "gzip -f $fa_outfile" if($fa_outfile); $outfile .= '.gz'; $fa_outfile .= '.gz' if $split; } # print "\n>EOF\n" if($stdout); #?? need this if summary goes to stdout after FASTA print "# GFF3 saved to $outfile" unless( !$verbose || $stdout || $lump); print ($split ? "; DNA saved to $fa_outfile\n" : "\n") unless($stdout|| $lump); # dgg: moved to after all inputs; here it prints cumulative sum for each record #if ( $summary ) { # print "# Summary:\n# Feature\tCount\n# -------\t-----\n"; # # for ( keys %method ) { # print "# $_ $method{$_}\n"; # } # print "# \n"; # } } print "# GFF3 saved to $outfile\n" if( $verbose && $lump); if ( $summary ) { print "# Summary:\n# Feature\tCount\n# -------\t-----\n"; for ( keys %method ) { print "# $_ $method{$_}\n"; } print "# \n"; } ## FIXME for piped output w/ split FA files ... close($lumpfa_fh) if $lumpfa_fh; if (!$split && $outfa && $lump_fh) { print $lump_fh "##FASTA\n"; # GFF3 spec open $lumpfa_fh, $outfa or warn "reading FA $outfa: $!"; while( <$lumpfa_fh>) { print $lump_fh $_; } # is $lump_fh still open? close($lumpfa_fh); unlink($outfa); } if ( $zip && $lump ) { system "gzip -f $lump"; } close FH; } sub typeorder { return 1 if ($_[0] =~ /gene/); return 2 if ($_[0] =~ /RNA|transcript/); return 3 if ($_[0] =~ /protein|peptide/); return 4 if ($_[0] =~ /exon|CDS/); return 3; # default before exon (smallest part) } sub sort_by_feattype { my($at,$bt)= ($a->primary_tag, $b->primary_tag); return (typeorder($at) <=> typeorder($bt)) || ($at cmp $bt); ## || ($a->name() cmp $b->name()); } sub print_held { my($out,$gffio,$to_print)= @_; return unless(@$to_print); @$to_print = sort sort_by_feattype @$to_print; # put exons after mRNA, otherwise chado loader chokes while ( my $feature = shift @$to_print) { my $gff= $gffio->gff_string($feature); # $gff =~ s/\'/./g; # dang bug in encode push @GFF_LINE_FEAT, $feature; #print $out "$gff\n"; } return (); # @to_print } sub maptags2gff { my $f = shift; ## should copy/move locus_tag to Alias, if not ID/Name/Alias already # but see below /gene /locus_tag usage foreach my $tag (keys %TAG_MAP) { if ($f->has_tag($tag)) { my $newtag= $TAG_MAP{$tag}; my @v= $f->get_tag_values($tag); $f->remove_tag($tag); $f->add_tag_value($newtag,@v); ## also, pull any GO:000 ids from /note tag and put into Ontology_term ## ncbi syntax in CDS /note is now '[goid GO:0005886]' OR '[goid 0005624]' if ($tag eq 'note') { map { s/\[goid (\d+)/\[goid GO:$1/g; } @v; my @go= map { m/(GO:\d+)/g } @v; $f->add_tag_value('Ontology_term',@go) if(@go); } } } } sub getSourceInfo { my ($seq, $source_type, $sf) = @_; my $is_swiss= ($SOURCEID =~/UniProt|swiss|trembl/i); my $is_gene = ($SOURCEID =~/entrezgene/i); my $is_rich = (ref($seq) =~ /RichSeq/); my $seq_name= $seq->accession_number(); unless($sf) { # make one $source_type= $is_swiss ? $PROTEIN_TYPE : $is_gene ? "eneg" # "gene" # "region" # : $is_rich ? $seq->molecule : $source_type; $sf = Bio::SeqFeature::Generic->direct_new(); my $len = $seq->length(); $len=1 if($len<1); my $start = 1; ##$start= $len if ($len<1); my $loc= $seq->can('location') ? $seq->location() : new Bio::Location::Simple( -start => $start, -end => $len); $sf->location( $loc ); $sf->primary_tag($source_type); $sf->source_tag($SOURCEID); $sf->seq_id( $seq_name); #? $sf->display_name($seq->id()); ## Name or Alias ? $sf->add_tag_value( Alias => $seq->id()); # unless id == accession $seq->add_SeqFeature($sf); ## $source_feat= $sf; } if ($sf->has_tag("chromosome")) { $source_type= "chromosome"; my ($chrname) = $sf->get_tag_values("chromosome"); ## PROBLEM with Name <> ID, RefName for Gbrowse; use Alias instead ## e.g. Mouse chr 19 has two IDs in NCBI genbank now $sf->add_tag_value( Alias => $chrname ); } # pull GB Comment, Description for source ft ... # add reference - can be long, not plain string... warn "# $SOURCEID:$seq_name fields = ", join(",", $seq->annotation->get_all_annotation_keys()),"\n" if $DEBUG; # GenBank fields: keyword,comment,reference,date_changed # Entrezgene fields 850293 =ALIAS_SYMBOL,RefSeq status,chromosome,SGD,dblink,Entrez Gene Status,OntologyTerm,LOCUS_SYNONYM # is this just for main $seq object or for all seqfeatures ? my %AnnotTagMap= ( 'gene_name' => 'Alias', 'ALIAS_SYMBOL' => 'Alias', # Entrezgene 'LOCUS_SYNONYM' => 'Alias', #? 'symbol' => 'Alias', 'synonym' => 'Alias', 'dblink' => 'Dbxref', 'product' => 'product', 'Reference' => 'reference', 'OntologyTerm' => 'Ontology_term', 'comment' => 'Note', 'comment1' => 'Note', # various map-type locations # gene accession tag is named per source db !?? # 'Index terms' => keywords ?? ); my ($desc)= $seq->annotation->get_Annotations("desc") || ( $seq->desc() ); my ($date)= $seq->annotation->get_Annotations("dates") || $seq->annotation->get_Annotations("update-date") || $is_rich ? $seq->get_dates() : (); my ($comment)= $seq->annotation->get_Annotations("comment"); my ($species)= $seq->annotation->get_Annotations("species"); if (!$species && $seq->can('species') && defined $seq->species() && $seq->species()->can('binomial') ) { $species= $seq->species()->binomial(); } # update source feature with main GB fields $sf->add_tag_value( ID => $seq_name ) unless $sf->has_tag('ID'); $sf->add_tag_value( Note => $desc ) if($desc && ! $sf->has_tag('Note')); $sf->add_tag_value( organism => $species ) if($species && ! $sf->has_tag('organism')); $sf->add_tag_value( comment1 => $comment ) if(!$is_swiss && $comment && ! $sf->has_tag('comment1')); $sf->add_tag_value( date => $date ) if($date && ! $sf->has_tag('date')); $sf->add_tag_value( Dbxref => $SOURCEID.':'.$seq_name ) if $is_swiss || $is_gene; foreach my $atag (sort keys %AnnotTagMap) { my $gtag= $AnnotTagMap{$atag}; next unless($gtag); my @anno = map{ if (ref $_ && $_->can('get_all_values')) { split( /[,;] */, join ";", $_->get_all_values) } elsif (ref $_ && $_->can('display_text')) { split( /[,;] */, $_->display_text) } elsif (ref $_ && $_->can('value')) { split( /[,;] */, $_->value) } else { (); } } $seq->annotation->get_Annotations($atag); foreach(@anno) { $sf->add_tag_value( $gtag => $_ ); } } #my @genes = map{ split( /[,;] */, "$_"); } $seq->annotation->get_Annotations('gene_name'); #$sf->add_tag_value( Alias => $_ ) foreach(@genes); # #my @dblink= map { "$_"; } $seq->annotation->get_Annotations("dblink"); # add @all #$sf->add_tag_value( Dbxref => $_ ) foreach(@dblink); return (wantarray)? ($source_type,$sf) : $source_type; #? } sub gene_features { my ($f, $gene_id, $genelinkID) = @_; local $_ = $f->primary_tag; $method{$_}++; if ( /gene/ ) { $f->add_tag_value( ID => $gene_id ) unless($f->has_tag('ID')); # check is same value!? $tnum = $rnum= 0; $ncrna_id= $rna_id = ''; return GM_NEW_TOPLEVEL; } elsif ( /mRNA/ ) { return GM_NOT_PART unless $gene_id; return GM_NOT_PART if($genelinkID && $genelinkID ne $gene_id); ($rna_id = $gene_id ) =~ s/gene/mRNA/; $rna_id .= '.t0' . ++$tnum; $f->add_tag_value( ID => $rna_id ); $f->add_tag_value( Parent => $gene_id ); } elsif ( /RNA|transcript/) { ## misc_RNA here; missing exons ... flattener problem? # all of {t,nc,sn}RNA can have gene models now ## but problem in Worm chr: mRNA > misc_RNA > CDS with same locus tag ## CDS needs to use mRNA, not misc_RNA, rna_id ... ## also need to fix cases where tRNA,... lack a 'gene' parent: make this one top-level if($gene_id) { return GM_NOT_PART if($genelinkID && $genelinkID ne $gene_id); ($ncrna_id = $gene_id) =~ s/gene/ncRNA/; $ncrna_id .= '.r0' . ++$rnum; $f->add_tag_value( Parent => $gene_id ); $f->add_tag_value( ID => $ncrna_id ); } else { unless ($f->has_tag('ID')) { if($genelinkID) { $f->add_tag_value( ID => $genelinkID ) ; } else { $idh->generate_unique_persistent_id($f); } } ($ncrna_id)= $f->get_tag_values('ID'); return GM_NEW_TOPLEVEL; # this feat now acts as gene-top-level; need to print @to_print to flush prior exons? } } elsif ( /exon/ ) { # can belong to any kind of RNA return GM_NOT_PART unless ($rna_id||$ncrna_id); return GM_NOT_PART if($genelinkID && $genelinkID ne $gene_id); ## we are getting duplicate Parents here, which chokes chado loader, with reason... ## problem is when mRNA and ncRNA have same exons, both ids are active, called twice ## check all Parents for my $expar ($rna_id, $ncrna_id) { next unless($expar); if ( $exonpar{$expar} and $f->has_tag('Parent') ) { my @vals = $f->get_tag_values('Parent'); next if (grep {$expar eq $_} @vals); } $exonpar{$expar}++; $f->add_tag_value( Parent => $expar); # last; #? could be both } # now we can skip cloned exons # dgg note: multiple parents get added and printed for each unique exon return GM_DUP_PART if ++$seen{$f} > 1; } elsif ( /CDS|protein|polypeptide/ ) { return GM_NOT_PART unless $rna_id; ## ignore $ncrna_id ?? return GM_NOT_PART if($genelinkID && $genelinkID ne $gene_id); #?? (my $pro_id = $rna_id) =~ s/\.t/\.p/; if( ! $CDSkeep && /CDS/) { $f->primary_tag($PROTEIN_TYPE); ## duplicate problem is Location .. if ($f->location->isa("Bio::Location::SplitLocationI")) { # my($b,$e)=($f->start, $f->end); # is this all we need? my($b,$e)=(-1,0); foreach my $l ($f->location->each_Location) { $b = $l->start if($b<0 || $b > $l->start); $e = $l->end if($e < $l->end); } $f->location( Bio::Location::Simple->new( -start => $b, -end => $e, -strand => $f->strand) ); } $f->add_tag_value( Derives_from => $rna_id ); } else { $f->add_tag_value( Parent => $rna_id ); } $f->add_tag_value( ID => $pro_id ); move_translation_fasta($f, $pro_id); #if( $f->has_tag('translation')) { # my ($aa) = $f->get_tag_values("translation"); # $proteinfa{$pro_id}= $aa; # $f->remove_tag("translation"); # $f->add_tag_value("translation","length.".length($aa)); # hack for odd chado gbl problem #} } elsif ( /region/ ) { $f->primary_tag('gene_component_region'); $f->add_tag_value( Parent => $gene_id ); } else { return GM_NOT_PART unless $gene_id; $f->add_tag_value( Parent => $gene_id ); } ## return GM_DUP_PART if /exon/ && ++$seen{$f} > 1; return GM_NEW_PART; } ## was generic_features > add_generic_id sub add_generic_id { my ($f, $ft_name, $flags) = @_; my $method = $f->primary_tag; $method{$method}++ unless($flags =~ /nocount/); ## double counts GM_NOT_PART from above if ($f->has_tag('ID')) { } elsif ( $f->has_tag($method) ) { my ($name) = $f->get_tag_values($method); $f->add_tag_value( ID => "$method:$name" ); } elsif($ft_name) { # is this unique ? $f->add_tag_value( ID => $ft_name ); } else { $idh->generate_unique_persistent_id($f); } move_translation_fasta( $f, ($f->get_tag_values("ID"))[0] ) if($method =~ /CDS/); # return $io->gff_string($f); } sub move_translation_fasta { my ($f, $ft_id) = @_; if( $ft_id && $f->has_tag('translation') ) { my ($aa) = $f->get_tag_values("translation"); if($aa && $aa !~ /^length/) { $proteinfa{$ft_id}= $aa; $f->remove_tag("translation"); $f->add_tag_value("translation","length.".length($aa)); # hack for odd chado gbl problem } } } sub gff_header { my ($name, $end, $source_type, $source_feat) = @_; $source_type ||= "region"; my $info = "$source_type:$name"; my $head = "##gff-version $GFF_VERSION\n". "##sequence-region $name 1 $end\n". "# conversion-by bp_genbank2gff3.pl\n"; if ($source_feat) { ## dgg: these header comment fields are not useful when have multi-records, diff organisms for my $key (qw(organism Note date)) { my $value; if ($source_feat->has_tag($key)) { ($value) = $source_feat->get_tag_values($key); } if ($value) { $head .= "# $key $value\n"; $info .= ", $value"; } } $head = "" if $didheader; } else { $head .= "$name\t$SOURCEID\t$source_type\t1\t$end\t.\t.\t.\tID=$name\n"; } $didheader++; return (wantarray) ? ($head,$info) : $head; } sub unflatten_seq { my $seq = shift; ## print "# working on $source_type:", $seq->accession, "\n"; my $uh_oh = "Possible gene unflattening error with" . $seq->accession_number . ": consult STDERR\n"; eval { $unflattener->unflatten_seq( -seq => $seq, -noinfer => $noinfer, -use_magic => 1 ); }; # deal with unflattening errors if ( $@ ) { warn $seq->accession_number . " Unflattening error:\n"; warn "Details: $@\n"; print "# ".$uh_oh; } return 0 if !$seq || !$seq->all_SeqFeatures; # map feature types to the sequence ontology ## $tm->map_types_to_SO( -seq => $seq ); #$tm->map_types( -seq => $seq, -type_map => $FTSOmap, -undefined => "region" ); #dgg map_types( $tm, -seq => $seq, -type_map => $FTSOmap, -syn_map => $FTSOsynonyms, -undefined => "region" ); #nml } sub filter { my $seq = shift; ## return unless $filter; my @feats; my @sources; # dgg; pick source features here; only 1 always? if ($filter) { for my $f ( $seq->remove_SeqFeatures ) { my $m = $f->primary_tag; push @sources, $f if ($m eq 'source'); # dgg? but leave in @feats ? push @feats, $f unless $filter =~ /$m/i; } $seq->add_SeqFeature($_) foreach @feats; } else { for my $f ( $seq->get_SeqFeatures ){ my $m = $f->primary_tag; push @sources, $f if ($m eq 'source'); # dgg? but leave in @feats ? } } return @sources; } # The default behaviour of Bio::FeatureHolderI:get_all_SeqFeatures # changed to filter out cloned features. We have to implement the old # method. These two subroutines were adapted from the v1.4 Bio::FeatureHolderI sub get_all_SeqFeatures { my $seq = shift; my @flatarr; foreach my $feat ( $seq->get_SeqFeatures ){ push(@flatarr,$feat); _add_flattened_SeqFeatures(\@flatarr,$feat); } return @flatarr; } sub gene_name { my $g = shift; my $gene_id = ''; # zero it; if ($g->has_tag('locus_tag')) { ($gene_id) = $g->get_tag_values('locus_tag'); } elsif ($g->has_tag('gene')) { ($gene_id) = $g->get_tag_values('gene'); } elsif ($g->has_tag('ID')) { # for non-Genbank > Entrezgene ($gene_id) = $g->get_tag_values('ID'); } ## See Unflattener comment: # on rare occasions, records will have no /gene or /locus_tag # but it WILL have /product tags. These serve the same purpose # for grouping. For an example, see AY763288 (also in t/data) # eg. product=tRNA-Asp ; product=similar to crooked neck protein elsif ($g->has_tag('product')) { my ($name)= $g->get_tag_values('product'); ($gene_id) = $name unless($name =~ / /); # a description not name } ## dgg; also handle transposon=xxxx ID/name # ID=GenBank:repeat_region:NC_004353:1278337:1281302;transposon=HeT-A{}1685;Dbxref=FLYBASE:FBti0059746 elsif ($g->has_tag('transposon')) { my ($name)= $g->get_tag_values('transposon'); ($gene_id) = $name unless($name =~ / /); # a description not name } return $gene_id; } # same list as gene_name .. change tag to generic Name sub convert_to_name { my $g = shift; my $gene_id = ''; # zero it; if ($g->has_tag('gene')) { ($gene_id) = $g->get_tag_values('gene'); $g->remove_tag('gene'); $g->add_tag_value('Name', $gene_id); } elsif ($g->has_tag('locus_tag')) { ($gene_id) = $g->get_tag_values('locus_tag'); $g->remove_tag('locus_tag'); $g->add_tag_value('Name', $gene_id); } elsif ($g->has_tag('product')) { my ($name)= $g->get_tag_values('product'); ($gene_id) = $name unless($name =~ / /); # a description not name ## $g->remove_tag('product'); $g->add_tag_value('Name', $gene_id); } elsif ($g->has_tag('transposon')) { my ($name)= $g->get_tag_values('transposon'); ($gene_id) = $name unless($name =~ / /); # a description not name ## $g->remove_tag('transposon'); $g->add_tag_value('Name', $gene_id); } elsif ($g->has_tag('ID')) { my ($name)= $g->get_tag_values('ID'); $g->add_tag_value('Name', $name); } return $gene_id; } sub _add_flattened_SeqFeatures { my ($arrayref,$feat) = @_; my @subs = (); if ($feat->isa("Bio::FeatureHolderI")) { @subs = $feat->get_SeqFeatures; } elsif ($feat->isa("Bio::SeqFeatureI")) { @subs = $feat->sub_SeqFeature; } else { warn ref($feat)." is neither a FeatureHolderI nor a SeqFeatureI. ". "Don't know how to flatten."; } for my $sub (@subs) { push(@$arrayref,$sub); _add_flattened_SeqFeatures($arrayref,$sub); } } sub map_types { my ($self, @args) = @_; my($sf, $seq, $type_map, $syn_map, $undefmap) = $self->_rearrange([qw(FEATURE SEQ TYPE_MAP SYN_MAP UNDEFINED )], @args); if (!$sf && !$seq) { $self->throw("you need to pass in either -feature or -seq"); } my @sfs = ($sf); if ($seq) { $seq->isa("Bio::SeqI") || $self->throw("$seq NOT A SeqI"); @sfs = $seq->get_all_SeqFeatures; } $type_map = $type_map || $self->typemap; # dgg: was type_map; foreach my $feat (@sfs) { $feat->isa("Bio::SeqFeatureI") || $self->throw("$feat NOT A SeqFeatureI"); $feat->isa("Bio::FeatureHolderI") || $self->throw("$feat NOT A FeatureHolderI"); my $primary_tag = $feat->primary_tag; #if ($primary_tag =~ /^pseudo(.*)$/) { # $primary_tag = $1; # $feat->primary_tag($primary_tag); #} my $mtype = $type_map->{$primary_tag}; if ($mtype) { if (ref($mtype)) { if (ref($mtype) eq 'ARRAY') { my $soID; ($mtype, $soID) = @$mtype; if ($soID && ref($ONTOLOGY)) { my ($term) = $ONTOLOGY->find_terms(-identifier => $soID); $mtype = $term->name if $term; } # if SO ID is undefined AND we have an ontology to search, we want to delete # the feature type hash entry in order to force a fuzzy search elsif (! defined $soID && ref($ONTOLOGY)) { undef $mtype; delete $type_map->{$primary_tag}; } elsif ($undefmap && $mtype eq 'undefined') { # dgg $mtype= $undefmap; } $type_map->{$primary_tag} = $mtype if $mtype; } elsif (ref($mtype) eq 'CODE') { $mtype = $mtype->($feat); } else { $self->throw('must be scalar or CODE ref'); } } elsif ($undefmap && $mtype eq 'undefined') { # dgg $mtype= $undefmap; } $feat->primary_tag($mtype); } if ($CONF) { conf_read(); my %perfect_matches; while (my ($p_tag,$rules) = each %$YAML) { RULE: for my $rule (@$rules) { for my $tags (@$rule) { while (my ($tag,$values) = each %$tags) { for my $value (@$values) { if ($feat->has_tag($tag)) { for ($feat->get_tag_values($tag)) { next RULE unless $_ =~ /\Q$value\E/; } } elsif ($tag eq 'primary_tag') { next RULE unless $value eq $feat->primary_tag; } elsif ($tag eq 'location') { next RULE unless $value eq $feat->start.'..'.$feat->end; } else { next RULE } } } } $perfect_matches{$p_tag}++; } } if (scalar(keys %perfect_matches) == 1) { $mtype = $_ for keys %perfect_matches; } elsif (scalar(keys %perfect_matches) > 1) { warn "There are conflicting rules in the config file for the" . " following types: "; warn "\t$_\n" for keys %perfect_matches; warn "Until conflict resolution is built into the converter," . " you will have to manually edit the config file to remove the" . " conflict. Sorry :(. Skipping user preference for this entry"; sleep(2); } } if ( ! $mtype && $syn_map) { if ($feat->has_tag('note')) { my @all_matches; my @note = $feat->each_tag_value('note'); for my $k (keys %$syn_map) { if ($k =~ /"(.+)"/) { my $syn = $1; for my $note (@note) { # look through the notes to see if the description # is an exact match for synonyms if ( $syn eq $note ) { my @map = @{$syn_map->{$k}}; my $best_guess = $map[0]; unshift @{$all_matches[-1]}, [$best_guess]; $mtype = $MANUAL ? manual_curation($feat, $best_guess, \@all_matches) : $best_guess; print '#' x 78 . "\nGuessing the proper SO term for GenBank" . " entry:\n\n" . GenBank_entry($feat) . "\nis:\t$mtype\n" . '#' x 78 . "\n\n"; } else { # check both primary tag and and note against # SO synonyms for best matching description SO_fuzzy_match( $k, $primary_tag, $note, $syn, \@all_matches); } } } } #unless ($mtype) { for my $note (@note) { for my $name (values %$type_map) { # check primary tag against SO names for best matching # descriptions //NML also need to check against # definition && camel case split terms SO_fuzzy_match($name, $primary_tag, $note, $name, \@all_matches); } } #} if (scalar(@all_matches) && !$mtype) { my $top_matches = first { defined $_ } @{$all_matches[-1]}; my $best_guess = $top_matches->[0]; # if guess has quotes, it is a synonym term. we need to # look up the corresponding name term # otherwise, guess is a name, so we can use it directly if ($best_guess =~ /"(.+)"/) { $best_guess = $syn_map->{$best_guess}->[0]; } @RETURN = @all_matches; $mtype = $MANUAL ? manual_curation($feat, $best_guess, \@all_matches) : $best_guess; print '#' x 78 . "\nGuessing the proper SO term for GenBank" . " entry:\n\n" . GenBank_entry($feat) . "\nis:\t$mtype\n" . '#' x 78 . "\n\n"; } } $mtype ||= $undefmap; $feat->primary_tag($mtype); } } } sub SO_fuzzy_match { my $candidate = shift; my $primary_tag = shift; my $note = shift; my $SO_terms = shift; my $best_matches_ref = shift; my $modifier = shift; $modifier ||= ''; my @feat_terms; for ( split(" |_", $primary_tag) ) { #my @camelCase = /(?:[A-Z]|[a-z])(?:[A-Z]+|[a-z]*)(?=$|[A-Z])/g; my @camelCase = /(?:[A-Z]|[a-z])(?:[A-Z]+|[a-z]*)(?=$|[A-Z]|[;:.,])/g; push @feat_terms, @camelCase; } for ( split(" |_", $note) ) { #my @camelCase = /(?:[A-Z]|[a-z])(?:[A-Z]+|[a-z]*)(?=$|[A-Z])/g; #my @camelCase = /(?:[A-Z]|[a-z])(?:[A-Z]+|[a-z]*)(?=$|[A-Z]|[;:.,])/g; (my $word = $_) =~ s/[;:.,]//g; push @feat_terms, $word; } my @SO_terms = split(" |_", $SO_terms); # fuzzy match works on a simple point system. When 2 words match, # the $plus counter adds one. When they don't, the $minus counter adds # one. This is used to sort similar matches together. Better matches # are found at the end of the array, near the top. # NML: can we improve best match by using synonym tags # EXACT,RELATED,NARROW,BROAD? my ($plus, $minus) = (0, 0); my %feat_terms; my %SO_terms; #unique terms map {$feat_terms{$_} = 1} @feat_terms; map {$SO_terms{$_} = 1} @SO_terms; for my $st (keys %SO_terms) { for my $ft (keys %feat_terms) { ($st =~ m/$modifier\Q$ft\E/) ? $plus++ : $minus++; } } push @{$$best_matches_ref[$plus][$minus]}, $candidate if $plus; } sub manual_curation { my ($feat, $default_opt, $all_matches) = @_; my @all_matches = @$all_matches; # convert all SO synonyms into names and filter # all matches into unique term list because # synonyms can map to multiple duplicate names my (@unique_SO_terms, %seen); for (reverse @all_matches) { for (@$_) { for (@$_) { #my @names; if ($_ =~ /"(.+)"/) { for (@{$SYN_MAP->{$_}}) { push @unique_SO_terms, $_ unless $seen{$_}; $seen{$_}++; } } else { push @unique_SO_terms, $_ unless $seen{$_}; $seen{$_}++; } } } } my $s = scalar(@unique_SO_terms); my $choice = 0; my $more = "[a]uto : automatic input (selects best guess for remaining entries)\r" . "[f]ind : search for other SO terms matching your query (e.g. f gene)\r" . "[i]nput : add a specific term\r" . "[r]eset : reset to the beginning of matches\r" . "[s]kip : skip this entry (selects best guess for this entry)\r" ; $more .= "[n]ext : view the next ".OPTION_CYCLE." terms\r" . "[p]rev : view the previous ".OPTION_CYCLE." terms" if ($s > OPTION_CYCLE); my $msg = #"\n\n" . '-' x 156 . "\n" "The converter found $s possible matches for the following GenBank entry: "; my $directions = "Type a number to select the SO term that best matches" . " the genbank entry, or use any of the following options:\r" . '_' x 76 . "\r$more"; # lookup filtered list to pull out definitions my @options = map { my $term = $_; my %term; for (['name', 'name'], ['def', 'definition'], ['synonym', 'each_synonym']) { my ($label, $method) = @$_; $term{$label} = \@{[$term->$method]}; } [++$choice, $_->name, ($_->definition || 'none'), \%term, $_->each_synonym ]; } map { $ONTOLOGY->find_terms(-name => $_) } @unique_SO_terms; my $option = options_cycle(0, OPTION_CYCLE, $msg, $feat, $directions, $default_opt, @options); if ($option eq 'skip') { return $default_opt } elsif ($option eq 'auto') { $MANUAL = 0; return $default_opt; } else { return $option } } sub options_cycle { my ($start, $stop, $msg, $feat, $directions, $best_guess, @opt) = @_; #NML: really should only call GenBank_entry once. Will need to change #method to return array & shift off header my $entry = GenBank_entry($feat, "\r"); my $total = scalar(@opt); ($start,$stop) = (0, OPTION_CYCLE) if ( ($start < 0) && ($stop > 0) ); ($start,$stop) = (0, OPTION_CYCLE) if ( ( ($stop - $start) < OPTION_CYCLE ) && $stop < $total); ($start,$stop) = ($total - OPTION_CYCLE, $total) if $start < 0; ($start,$stop) = (0, OPTION_CYCLE) if $start >= $total; $stop = $total if $stop > $total; my $dir_copy = $directions; my $msg_copy = $msg; my $format = "format STDOUT = \n" . '-' x 156 . "\n" . '^' . '<' x 77 . '| Available Commands:' . "\n" . '$msg_copy' . "\n" . '-' x 156 . "\n" . ' ' x 78 . "|\n" . '^' . '<' x 77 . '| ^' . '<' x 75 . '~' . "\n" . '$entry' . ' ' x 74 . '$dir_copy,' . "\n" . (' ' x 20 . '^' . '<' x 57 . '| ^' . '<' x 75 . '~' . "\n" . ' ' x 20 . '$entry,' . ' ' x 53 . '$dir_copy,' . "\n") x 1000 . ".\n"; { # eval throws redefined warning that breaks formatting. # Turning off warnings just for the eval to fix this. no warnings 'redefine'; eval $format; } write; print '-' x 156 . "\n" . 'Showing results ' . ( $stop ? ( $start + 1 ) : $start ) . " - $stop of possible SO term matches: (best guess is \"$best_guess\")" . "\n" . '-' x 156 . "\n"; for (my $i = $start; $i < $stop; $i+=2) { my ($left, $right) = @opt[$i,$i+1]; my ($nL, $nmL, $descL, $termL, @synL) = @$left; #odd numbered lists can cause fatal undefined errors, so check #to make sure we have data my ($nR, $nmR, $descR, $termR, @synR) = ref($right) ? @$right : (undef, undef, undef); my $format = "format STDOUT = \n"; $format .= ' ' x 78 . "|\n" . '@>>: name: ^' . '<' x 64 . '~' . ' |' . ( ref($right) ? ('@>>: name: ^' . '<' x 64 . '~' ) : '' ) . "\n" . '$nL,' . ' ' x 7 . '$nmL,' . ( ref($right) ? (' ' x 63 . '$nR,' . ' ' x 7 . "\$nmR,") : '' ) . "\n" . ' ' x 11 . '^' . '<' x 61 . '...~' . ' |' . (ref($right) ? (' ^' . '<' x 61 . '...~') : '') . "\n" . ' ' x 11 . '$nmL,' . (ref($right) ? (' ' x 74 . '$nmR,') : '') . "\n" . #' ' x 78 . '|' . "\n" . ' def: ^' . '<' x 65 . ' |' . (ref($right) ? (' def: ^' . '<' x 64 . '~') : '') . "\n" . ' ' x 11 . '$descL,' . (ref($right) ? (' ' x 72 . '$descR,') : '') . "\n" . (' ^' . '<' x 65 . ' |' . (ref($right) ? (' ^' . '<' x 64 . '~') : '') . "\n" . ' ' x 11 . '$descL,' . (ref($right) ? (' ' x 72 . '$descR,') : '') . "\n") x 5 . ' ^' . '<' x 61 . '...~ |' . (ref($right) ? (' ^' . '<' x 61 . '...~') : '') . "\n" . ' ' x 11 . '$descL,' . (ref($right) ? (' ' x 72 . '$descR,') : '') . "\n" . ".\n"; { # eval throws redefined warning that breaks formatting. # Turning off warnings just for the eval to fix this. no warnings 'redefine'; eval $format; } write; } print '-' x 156 . "\nenter a command:"; while () { (my $input = $_) =~ s/\s+$//; if ($input =~ /^\d+$/) { if ( $input && defined $opt[$input-1] ) { return $opt[$input-1]->[1] } else { print "\nThat number is not an option. Please enter a valid number.\n:"; } } elsif ($input =~ /^n/i | $input =~ /next/i ) { return options_cycle($start + OPTION_CYCLE, $stop + OPTION_CYCLE, $msg, $feat, $directions, $best_guess, @opt) } elsif ($input =~ /^p/i | $input =~ /prev/i ) { return options_cycle($start - OPTION_CYCLE, $stop - OPTION_CYCLE, $msg, $feat, $directions, $best_guess, @opt) } elsif ( $input =~ /^s/i || $input =~ /skip/i ) { return 'skip' } elsif ( $input =~ /^a/i || $input =~ /auto/i ) { return 'auto' } elsif ( $input =~ /^r/i || $input =~ /reset/i ) { return manual_curation($feat, $best_guess, \@RETURN ); } elsif ( $input =~ /^f/i || $input =~ /find/i ) { my ($query, @query_results); if ($input =~ /(?:^f|find)\s+?(.*?)$/) { $query = $1; } else { #do a SO search print "Type your search query\n:"; while () { chomp($query = $_); last } } for (keys(%$TYPE_MAP), keys(%$SYN_MAP)) { SO_fuzzy_match($_, $query, '', $_, \@query_results, '(?i)'); } return manual_curation($feat, $best_guess, \@query_results); } elsif ( $input =~ /^i/i || $input =~ /input/i ) { #NML fast input for later #my $query; #if ($input =~ /(?:^i|input)\s+?(.*?)$/) { $query = $1 }; #manual input print "Type the term you want to use\n:"; while () { chomp(my $input = $_); if (! $TYPE_MAP->{$input}) { print "\"$input\" doesn't appear to be a valid SO term. Are ". "you sure you want to use it? (y or n)\n:"; while () { chomp(my $choice = $_); if ($choice eq 'y') { print "\nWould you like to save your preference for " . "future use (so you don't have to redo manual " . "curation for this feature everytime you run " . "the converter)? (y or n)\n"; #NML: all these while loops are a mess. Really should condense it. while () { chomp(my $choice = $_); if ($choice eq 'y') { curation_save($feat, $input); return $input; } elsif ($choice eq 'n') { return $input } else { print "\nDidn't recognize that command. Please " . "type y or n.\n:" } } } elsif ($choice eq 'n') { return options_cycle($start, $stop, $msg, $feat, $directions, $best_guess, @opt) } else { print "\nDidn't recognize that command. Please " . "type y or n.\n:" } } } else { print "\nWould you like to save your preference for " . "future use (so you don't have to redo manual " . "curation for this feature everytime you run " . "the converter)? (y or n)\n"; #NML: all these while loops are a mess. Really should condense it. while () { chomp(my $choice = $_); if ($choice eq 'y') { curation_save($feat, $input); return $input; } elsif ($choice eq 'n') { return $input } else { print "\nDidn't recognize that command. Please " . "type y or n.\n:" } } } } } else { print "\nDidn't recognize that command. Please re-enter your choice.\n:" } } } sub GenBank_entry { my ($f, $delimiter, $num) = @_; $delimiter ||= "\n"; my $entry = ($num ? ' [1] ' : ' ' x 5) . $f->primary_tag . ($num ? ' ' x (12 - length $f->primary_tag ) . ' [2] ' : ' ' x (15 - length $f->primary_tag) ) . $f->start.'..'.$f->end . "$delimiter"; if ($num) { words_tag($f, \$entry); } else { for my $tag ($f->all_tags) { for my $val ( $f->each_tag_value($tag) ) { $entry .= ' ' x 20; #$entry .= "/$tag=\"$val\"$delimiter"; $entry .= $val eq '_no_value' ? "/$tag$delimiter" : "/$tag=\"$val\"$delimiter"; } } } return $entry; } sub gff_validate { warn "Validating GFF file\n" if $DEBUG; my @feat = @_; my (%parent2child, %all_ids, %descendants, %reserved); for my $f (@feat) { for my $aTags (['Parent', \%parent2child], ['ID', \%all_ids]) { map { push @{$$aTags[1]->{$_}}, $f } $f->get_tag_values($$aTags[0]) if $f->has_tag($$aTags[0]); } } if ($SO_FILE) { while (my ($parentID, $aChildren) = each %parent2child) { parent_validate($parentID, $aChildren, \%all_ids, \%descendants, \%reserved); } } id_validate(\%all_ids, \%reserved); } sub parent_validate { my ($parentID, $aChildren, $hAll, $hDescendants, $hReserved) = @_; my $aParents = $hAll->{$parentID}; map { my $child = $_; $child->add_tag_value( validation_error => "feature tried to add Parent tag, but no Parent found with ID $parentID" ); my %parents; map { $parents{$_} = 1 } $child->get_tag_values('Parent'); delete $parents{$parentID}; my @parents = keys %parents; $child->remove_tag('Parent'); unless ($child->has_tag('ID')) { my $id = gene_name($child); $child->add_tag_value('ID', $id); push @{$hAll->{$id}}, $child } $child->add_tag_value('Parent', @parents) if @parents; } @$aChildren and return unless scalar(@$aParents); my $par = join(',', map { $_->primary_tag } @$aParents); warn scalar(@$aParents)." POSSIBLE PARENT(S): $par" if $DEBUG; #NML manual curation needs to happen here my %parentsToRemove; CHILD: for my $child (@$aChildren) { my $childType = $child->primary_tag; warn "WORKING ON $childType at ".$child->start.' to '.$child->end if $DEBUG; for (my $i = 0; $i < scalar(@$aParents); $i++) { my $parent = $aParents->[$i]; my $parentType = $parent->primary_tag; warn "CHECKING $childType against $parentType" if $DEBUG; #cache descendants so we don't have to do repeat searches unless ($hDescendants->{$parentType}) { for my $term ($ONTOLOGY->find_terms( -name => $parentType ) ) { map { $hDescendants->{$parentType}{$_->name}++ } $ONTOLOGY->get_descendant_terms($term); } # NML: hopefully temporary fix. # SO doesn't consider exon/CDS to be a child of mRNA # even though common knowledge dictates that they are # This cheat fixes the false positives for now if ($parentType eq 'mRNA') { $hDescendants->{$parentType}{'exon'} = 1; $hDescendants->{$parentType}{'CDS'} = 1; } } warn "\tCAN $childType at " . $child->start . ' to ' . $child->end . " be a child of $parentType?" if $DEBUG; if ($hDescendants->{$parentType}{$childType}) { warn "\tYES, $childType can be a child of $parentType" if $DEBUG; #NML need to deal with multiple children matched to multiple different #parents. This model only assumes the first parent id that matches a child will #be the reserved feature. $hReserved->{$parentID}{$parent}{'parent'} = $parent; push @{$hReserved->{$parentID}{$parent}{'children'}}, $child; #mark parent for later removal from all IDs #so we don't accidentally change any parents $parentsToRemove{$i}++; next CHILD; } } #NML shouldn't have to check this; somehow child can lose Parent #it's happening W3110 #need to track this down if ( $child->has_tag('Parent') ) { warn "\tNO, @{[$child->primary_tag]} cannot be a child of $parentID" if $DEBUG; my %parents; map { $parents{$_} = 1 } $child->get_tag_values('Parent'); delete $parents{$parentID}; my @parents = keys %parents; warn 'VALIDATION ERROR '.$child->primary_tag." at ".$child->start . ' to ' . $child->end . " cannot be a child of ID $parentID" if $DEBUG; $child->add_tag_value( validation_error => "feature cannot be a child of $parentID"); $child->remove_tag('Parent'); unless ($child->has_tag('ID')) { my $id = gene_name($child); $child->add_tag_value('ID', $id); push @{$hAll->{$id}}, $child } $child->add_tag_value('Parent', @parents) if @parents; } } #delete $aParents->[$_] for keys %parentsToRemove; splice(@$aParents, $_, 1) for keys %parentsToRemove; } sub id_validate { my ($hAll, $hReserved) = @_; for my $id (keys %$hAll) { #since 1 feature can have this id, #let's just shift it off and uniquify #the rest (unless it's reserved) shift @{$hAll->{$id}} unless $hReserved->{$id}; for my $feat (@{$hAll->{$id}}) { id_uniquify(0, $id, $feat, $hAll); } } for my $parentID (keys %$hReserved) { my @keys = keys %{$hReserved->{$parentID}}; shift @keys; for my $k (@keys) { my $parent = $hReserved->{$parentID}{$k}{'parent'}; my $aChildren= $hReserved->{$parentID}{$k}{'children'}; my $value = id_uniquify(0, $parentID, $parent, $hAll); for my $child (@$aChildren) { my %parents; map { $parents{$_}++ } $child->get_tag_values('Parent'); $child->remove_tag('Parent'); delete $parents{$parentID}; $parents{$value}++; my @parents = keys %parents; $child->add_tag_value('Parent', @parents); } } } } sub id_uniquify { my ($count, $value, $feat, $hAll) = @_; warn "UNIQUIFYING $value" if $DEBUG; if (! $count) { $feat->add_tag_value(Alias => $value); $value .= ('.' . $feat->primary_tag) } elsif ($count == 1) { $value .= ".$count" } else { chop $value; $value .= $count } $count++; warn "ENDED UP WITH $value" if $DEBUG; if ( $hAll->{$value} ) { warn "$value IS ALREADY TAKEN" if $DEBUG; id_uniquify($count, $value, $feat, $hAll); } else { #warn "something's breaking ".$feat->primary_tag.' at '.$feat->start.' to '.$feat->end; $feat->remove_tag('ID'); $feat->add_tag_value('ID', $value); push @{$hAll->{$value}}, $value; } $value; } sub conf_read { print "\nCannot read $CONF. Change file permissions and retry, " . "or enter another file\n" and conf_locate() unless -r $CONF; print "\nCannot write $CONF. Change file permissions and retry, " . "or enter another file\n" and conf_locate() unless -w $CONF; $YAML = LoadFile($CONF); } sub conf_create { my ($path, $input) = @_; print "Cannot write to $path. Change directory permissions and retry " . "or enter another save path\n" and conf_locate() unless -w $path; $CONF = $input; open(FH, '>', $CONF); close(FH); conf_read(); } sub conf_write { DumpFile($CONF, $YAML) } sub conf_locate { print "\nEnter the location of a previously saved config, or a new " . "path and file name to create a new config (this step is " . "necessary to save any preferences)"; print "\n\nenter a command:"; while () { chomp(my $input = $_); my ($fn, $path, $suffix) = fileparse($input, qr/\.[^.]*/); if (-e $input && (! -d $input)) { print "\nReading $input...\n"; $CONF = $input; conf_read(); last; } elsif (! -d $input && $fn.$suffix) { print "Creating $input...\n"; conf_create($path, $input); last; } elsif (-e $input && -d $input) { print "You only entered a directory. " . "Please enter BOTH a directory and filename\n"; } else { print "$input does not appear to be a valid path. Please enter a " . "valid directory and filename\n"; } print "\nenter a command:"; } } sub curation_save { my ($feat, $input) = @_; #my $error = "Enter the location of a previously saved config, or a new " . # "path and file name to create a new config (this step is " . # "necessary to save any preferences)\n"; if (!$CONF) { print "\n\n"; conf_locate(); } elsif (! -e $CONF) { print "\n\nThe config file you have chosen doesn't exist.\n"; conf_locate(); } else { conf_read() } my $entry = GenBank_entry($feat, "\r", 1); my $msg = "Term entered: $input"; my $directions = "Please select any/all tags that provide evidence for the term you have entered. You may enter multiple tags by separating them by commas/dashes (e.g 1,3,5-7). For tags with more than one word value (i.e 'note'), you have the option of either selecting the entire note as evidence, or specific keywords. If a tag has multiple keywords, they will be tagged alphabetically for selection. To select a specific keyword in a tag field, you must enter the tag number followed by the keyword letter (e.g 3a). Multiple keywords may be selected by entering each letter separated by commas/dashes (e.g 3b,f,4a-c). The more tags you select, the more specific the GenBank entry will have to be to match your curation. To match the GenBank entry exactly as it appears, type every number (start-end), or just type 'all'. Remember, once the converter saves your preference, you will no longer be prompted to choose a feature type for any matching entries until you edit the curation.ini file."; my $msg_copy = $msg; my $dir_copy = $directions; my $format = "format STDOUT = \n" . '-' x 156 . "\n" . '^' . '<' x 77 . '| Directions:' . "\n" . '$msg_copy' . "\n" . '-' x 156 . "\n" . ' ' x 78 . "|\n" . '^' . '<' x 77 . '| ^' . '<' x 75 . '~' . "\n" . '$entry' . ' ' x 74 . '$dir_copy,' . "\n" . (' ' x 15 . '^' . '<' x 62 . '| ^' . '<' x 75 . '~' . "\n" . ' ' x 15 . '$entry,' . ' ' x 58 . '$dir_copy,' . "\n") x 20 . ".\n"; { # eval throws redefined warning that breaks formatting. # Turning off warnings just for the eval to fix this. no warnings 'redefine'; eval $format; } write; print '-' x 156 . "\nenter a command:"; my @tags = words_tag($feat); my $final = {}; my $choices; while () { chomp(my $choice = $_); if (scalar(keys %$final) && $choice =~ /^y/i) { last } elsif (scalar(keys %$final) && $choice =~ /^n/i) { curation_save($feat, $input) } elsif (scalar(keys %$final)) { print "\nInvalid selection. Please try again\n"; } elsif ($choice eq 'all') { $choice = ''; for (my $i=1; $i < scalar(@tags); $i++) { $choice .= "$i,"; } chop $choice; } #print "CHOICE [$choice]"; my @selections; for (split(/(?<=\w)[^[:alnum:]\-]+(?=\d)/, $choice)) { if ($_ =~ /(\d+)(?:\D*)-(\d+)(.*)/) { for ($1..$2) { push @selections, $_ } my $dangling_alphas = $3; alpha_expand($dangling_alphas, \@selections); } else { alpha_expand($_, \@selections); } } foreach my $numbers (@selections) { my @c = split(/(?=[\w])/, $numbers); s/\W+//g foreach @c; my $num; { $^W = 0; $num = 0 + shift @c; } my $tag = $tags[$num]; if (ref $tag && scalar(@c)) { my $no_value; foreach (@c) { if (defined $tag->{$_}) { $choices .= "${num}[$_] "; my ($t,$v) = @{$tag->{$_}}; push @{${$final->{$input}}[0]{$t}}, $v; } else { $no_value++ } } if ($no_value) { _selection_add($tag,$final,$input,\$choices,$num); #my ($t,$v) = @{$tag->{'all'}}; #unless (defined ${$final->{$input}}[0]{$t}) { #$choices .= "$num, "; #push @{${$final->{$input}}[0]{$t}}, $v #} } $choices = substr($choices, 0, -2); $choices .= ', '; } elsif (ref $tag) { _selection_add($tag,$final,$input,\$choices,$num); #my ($t,$v) = @{$tag->{'all'}}; #unless (defined ${$final->{$input}}[0]{$t}) { #$choices .= "$num, "; #push @{${$final->{$input}}[0]{$t}}, $v #} } } $choices = substr($choices, 0, -2) if $choices; if ($final) { print "\nYou have chosen the following tags:\n$choices\n"; print "This will be written to the config file as:\n"; print Dump $final; print "\nIs this correct? (y or n)\n"; } else { print "\nInvalid selection. Please try again\n" } } push @{$YAML->{$input}}, $final->{$input}; conf_write(); } # words_tag() splits each tag value string into multiple words so that the # user can select the parts he/she wants to use for curation # it can tag 702 (a - zz) separate words; this should be enough sub words_tag { my ($feat, $entry) = @_; my @tags; @tags[1,2] = ({'all' => ['primary_tag', $feat->primary_tag]}, {'all' => ['location', $feat->start.'..'.$feat->end]}); my $i = 3; foreach my $tag ($feat->all_tags) { foreach my $value ($feat->each_tag_value($tag)) { my ($string, $tagged_string); my @words = split(/(?=\w+?)/, $value); my $pos = 0; foreach my $word (@words) { (my $sanitized_word = $word) =~ s/\W+?//g; $string .= $word; my $lead = int($pos/ALPHABET_DIVISOR); my $lag = $pos % ALPHABET_DIVISOR; my $a = $lead ? ${(ALPHABET)}[$lead-1] : ''; $a .= $lag ? ${(ALPHABET)}[$lag] : 'a'; $tagged_string .= " ($a) $word"; $tags[$i]{$a} = [$tag, $sanitized_word]; $pos++; } $value = $tagged_string if scalar(@words) > 1; $$entry .= "[$i] /$tag=\"$value\"\r"; $tags[$i]{'all'} = [$tag, $string]; } $i++; } return @tags; } sub alpha_expand { my ($dangling_alphas, $selections) = @_; if (defined($dangling_alphas) && $dangling_alphas =~ /(\d)*([[:alpha:]]+)-([[:alpha:]]+)/) { my $digit = $1; push @$selections, $digit if $digit; my $start = $2; my $stop = $3; my @starts = split('', $start); my @stops = split('', $stop); my ($final_start, $final_stop); for ([\$final_start, \@starts], [\$final_stop, \@stops]) { my ($final, $splits) = @$_; my $int = ${(ALPHABET_TO_NUMBER)}{$$splits[0]}; my $rem; if ($$splits[1]) { $rem = ${(ALPHABET_TO_NUMBER)}{$$splits[1]}; $int++ } else { $rem = $int; $int = 0; } $$final = $int * ALPHABET_DIVISOR; $$final += $rem; } my $last_number = pop @$selections; for my $pos ($final_start..$final_stop) { my $lead = int($pos/ALPHABET_DIVISOR); my $lag = $pos % ALPHABET_DIVISOR; my $alpha = $lead ? ${(ALPHABET)}[$lead-1] : ''; $alpha .= $lag ? ${(ALPHABET)}[$lag] : 'a'; push @$selections, $last_number.$alpha; } } elsif (defined($dangling_alphas)) { if ($dangling_alphas =~ /^\d/) { push @$selections, $dangling_alphas; } elsif ($dangling_alphas =~ /^\D/) { #print "$dangling_alphas ".Dumper @$selections; my $last_number = pop @$selections; $last_number ||= ''; push @$selections, $last_number.$dangling_alphas; #$$selections[-1] .= $dangling_alphas; } } } sub _selection_add { my ($tag, $final, $input, $choices, $num) = @_; my ($t,$v) = @{$tag->{'all'}}; unless (defined ${$final->{$input}}[0]{$t}) { $$choices .= "$num, "; push @{${$final->{$input}}[0]{$t}}, $v } } bp_search2tribe100644000766000024 512214006427353 16751 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # Author: Jason Stajich # Description: Turn SearchIO parseable report(s) into a TRIBE matrix # =head1 NAME bp_search2tribe - Turn SearchIO parseable reports(s) into TRIBE matrix =head1 SYNOPSIS Usage: bp_search2tribe [-o outputfile] [-f reportformat] [-w/--weight] file1 file2 .. =head1 DESCRIPTION This script is probably too slow for most people's uses. It is better to use something like scripts/searchio/fastam9_to_table, -m 9 output from BLAST, or the blast2table from the BLAST O'Reilly book to get a tabular output from these programs and then feed the table into MCL with the mcxdeblast script and the --m9 option. This script will turn a protein Search report (BLASTP, FASTP, SSEARCH) into a Markov Matrix for TribeMCL clustering. The options are: -o filename - the output filename [default STDOUT] -f format - search result format (blast, fasta) (ssearch is fasta format). default is blast. -w or --weight VALUE - Change the default weight for E(0.0) hits to VALUE (default=200 (i.e. 1e-200) ) -h - this help menu Additionally specify the filenames you want to process on the command-line. If no files are specified then STDIN input is assumed. You specify this by doing: bp_search2tribe E file1 file2 file3 =head1 AUTHOR Jason Stajich, jason-at-bioperl-dot-org =cut use strict; use warnings; use Bio::SearchIO; use Bio::SearchIO::FastHitEventBuilder; # employ a speedup use Getopt::Long; use constant DEFAULT_WEIGHT => 200; use constant DEFAULT_FORMAT => 'blast'; my ($format,@files,$output,$weight); $weight = DEFAULT_WEIGHT; # default weight value $format = DEFAULT_FORMAT; my ($help); GetOptions( 'f|format:s' => \$format, 'o|output:s' => \$output, 'w|weight:i' => \$weight, 'h|help' => sub{ exec('perldoc',$0); exit(0) }, ); my $outfh; if( $output ) { open $outfh, '>', $output or die "Could not write output file '$output': $!\n"; } else { $outfh = *STDOUT; } my $parser = new Bio::SearchIO(-format => $format, -fh => \*ARGV); # Let's throw away HSP events $parser->attach_EventHandler(new Bio::SearchIO::FastHitEventBuilder); while( my $report = $parser->next_result ) { my $q = $report->query_name; while( my $hit = $report->next_hit ) { my $evalue = $hit->significance; $evalue =~ s/^e/1e/i; if( $evalue == 0 ) { $evalue = "1e-$weight"; } else { $evalue = sprintf("%e",$evalue); } print $outfh join("\t",$q,$hit->name, split('e-',$evalue)), "\n"; } } Compatible.t100644000766000024 346014006427353 16631 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tree# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 5, -requires_module => 'Set::Scalar'); use_ok('Bio::Tree::Compatible'); use_ok('Bio::TreeIO'); } # these tests are done with direct access to Bio::Tree::Compatible methods, # instead of via creating a Bio::Tree::Compatible->new() object or similar... # the docs seem to indicate that is normally possible? TODO? my $in = Bio::TreeIO->new(-format => 'newick', -fh => \*DATA); # the common labels of (((A,B)C,D),(E,F,G)); and ((A,B)H,E,(J,(K)G)I); # are [A,B,E,G] my $t1 = $in->next_tree; my $t2 = $in->next_tree; my $common = Bio::Tree::Compatible::common_labels($t1,$t2); my $labels = Set::Scalar->new(qw(A B E G)); ok($common->is_equal($labels)); # the topological restrictions of (((A,B)C,D),(E,F,G)); and # ((A,B)H,E,(J,(K)G)I); to their common labels, [A,B,E,G], are, # respectively, ((A,B),(E,G)); and ((A,B),E,(G)); Bio::Tree::Compatible::topological_restriction($t1,$common); Bio::Tree::Compatible::topological_restriction($t2,$common); my $t3 = $in->next_tree; my $t4 = $in->next_tree; # ok($t1->is_equal($t3)); # is_equal method missing in Bio::Tree::Tree # ok($t2->is_equal($t4)); # is_equal method missing in Bio::Tree::Tree # the topological restrictions of (((A,B)C,D),(E,F,G)); and # ((A,B)H,E,(J,(K)G)I); to their common labels, [A,B,E,G], are # compatible my ($incompat, $ilabels, $inodes) = Bio::Tree::Compatible::is_compatible($t3,$t4); ok(!$incompat); # (((B,A),C),D); and ((A,(D,B)),C); are incompatible my $t5 = $in->next_tree; my $t6 = $in->next_tree; ($incompat, $ilabels, $inodes) = Bio::Tree::Compatible::is_compatible($t5,$t6); ok($incompat); __DATA__ (((A,B)C,D),(E,F,G)); ((A,B)H,E,(J,(K)G)I); ((A,B),(E,G)); ((A,B),E,(G)); (((B,A),C),D); ((A,(D,B)),C); TreeIO000755000766000024 014006427353 15351 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Treenhx.t100644000766000024 741114006427353 16476 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tree/TreeIO# -*-Perl-*- Test Harness script for Bioperl # $Id: TreeIO.t 14580 2008-03-01 17:01:30Z cjfields $ use strict; BEGIN { use Bio::Root::Test; use File::Temp qw(tempfile); test_begin(-tests => 19); use_ok('Bio::TreeIO'); } my $verbose = 0; #test_debug(); my $nl = qr/\n/; my $cr = qr/\r/; my $treeio = Bio::TreeIO->new( -format => 'nhx', -verbose => $verbose, -file => test_input_file('test.nhx'), ); my $tree; ok($treeio); $tree = $treeio->next_tree; isa_ok($tree, 'Bio::Tree::TreeI'); my @nodes = $tree->get_nodes; is(@nodes, 12, "Total Nodes"); #print STDERR "TREE: ".$tree->as_text('nhx')."\n"; my $adhy = $tree->find_node('ADHY'); is($adhy->branch_length, 0.1); is(($adhy->get_tag_values('S'))[0], 'nematode'); is(($adhy->get_tag_values('E'))[0], '1.1.1.1'); test_roundtrip('((a,b),c);','simple newick'); test_roundtrip('((x:0.05,y:0.06),a:0.1[&&NHX:G=dummy]);','bug 1471 test'); test_roundtrip('((x:0.05[&&NHX:label=x],y:0.06)[&&NHX:label=int_node],a:0.1[&&NHX:label=a]);','different combinations of label, NHX, and branch length'); test_roundtrip('(a:1,b:2,c:3,d:4)TEST:1.2345;','doot node branch length'); test_roundtrip('(A:0.1,B:0.2,(C:0.3,D:0.4)E:0.5)F;','Example from Wikipedia'); test_roundtrip('(((ADH2:0.1[&&NHX:E=1.1.1.1:S=human],ADH1:0.11[&&NHX:E=1.1.1.1:S=human]):0.05[&&NHX:B=100:D=Y:E=1.1.1.1:S=Primates],ADHY:0.1[&&NHX:E=1.1.1.1:S=nematode],ADHX:0.12[&&NHX:E=1.1.1.1:S=insect]):0.1[&&NHX:D=N:E=1.1.1.1:S=Metazoa],(ADH4:0.09[&&NHX:E=1.1.1.1:S=yeast],ADH3:0.13[&&NHX:E=1.1.1.1:S=yeast],ADH2:0.12[&&NHX:E=1.1.1.1:S=yeast],ADH1:0.11[&&NHX:E=1.1.1.1:S=yeast]):0.1[&&NHX:S=Fungi])[&&NHX:D=N:E=1.1.1.1];','ADH NHX tree'); test_roundtrip('(gene1_Hu[&&NHX:S=Hu_Homo_sapiens],(gene2_Hu[&&NHX:S=Hu_Homo_sapiens],gene2_Mu[&&NHX:S=Mu_Mus_musculus]));','notung nhx example http://www.cs.cmu.edu/~aiton/split/Manual-2.6.master014.html'); test_roundtrip('(cow_gene1,(mouse_gene2,cow_gene2)[&&NHX:B=100]);','notung nhx bootstrap http://www.cs.cmu.edu/~aiton/split/Manual-2.6.master014.html'); # Read in some larger trees from data files... test_roundtrip(read_file(test_input_file('nhx-bacteria.nhx')),'r-sig-phylo mailing list http://www.mail-archive.com/r-sig-phylo@r-project.org/msg00516.html'); test_roundtrip(read_file(test_input_file('ex1.nucl.nhx')),'treebest example nhx'); # Note: these files aren't reproduced exactly in their online form. We need to round-trip them once # before including them in the test, because the ordering of annotation keys is not a well-defined # part of the NHX format. Since nhx.pm sorts the keys before output, once they've been through # one time, the ordering becomes stable. test_roundtrip(read_file(test_input_file('wellcome_tol.nhx')),'Wellcome Trust ToL (from http://iphylo.blogspot.com/2009/02/thoughts-on-wellcome-interactive-tree.html)'); # Uncomment to run (takes a long time!!) #test_roundtrip(read_file(test_input_file('tol-2010-02-18.nhx')),'Tolweb.org converted to NHX'); test_roundtrip(read_file(test_input_file('biorecipe.nhx')),'Biorecipes NHX file (http://www.biorecipes.com/Orthologues/StatusPage/pics/TreeEukaryota.nt)'); sub test_roundtrip { my $string = shift; my $desc = shift; my $in = Bio::TreeIO->new(-format => 'nhx', -string => $string, -verbose => $verbose ); my $t = $in->next_tree; my $out; if (defined $t) { $out = $t->as_text('nhx'); } $desc = "Roundtrip: $desc"; return is($out,$string,$desc); } sub read_file { my $file = shift; local $/=undef; my $string; open my $IN, '<', $file or die "Could not read file '$file': $!\n"; binmode $IN; $string = <$IN>; close $IN; $string =~ s/$nl//g; $string =~ s/$cr//g; # For files with Windows line-endings #print STDERR "STR: $string\n"; return $string; } Sigcleave.t100644000766000024 174514006427353 16661 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 18); use_ok('Bio::PrimarySeq'); use_ok('Bio::Tools::Sigcleave'); } #load n-terminus of MGR5_HUMAN as test seq my $protein = "MVLLLILSVLLLKEDVRGSAQSSERRVVAHMPGDIIIGALFSVHHQPTVDKVHERKCGAVREQYGI"; ok my $seq= Bio::PrimarySeq->new(-seq => $protein); ok my $sig = Bio::Tools::Sigcleave->new(); ok $sig->seq($seq); ok my $sout = $sig->seq; is $sout->seq, $protein; is $sig->threshold, 3.5; is $sig->threshold(5), 5; is $sig->matrix, 'eucaryotic'; is $sig->matrix('procaryotic'), 'procaryotic'; is $sig->matrix('eucaryotic'), 'eucaryotic'; like $sig->pretty_print, qr/Maximum score 7/; ok my %results = $sig->signals; is $results{9}, 5.2, "unable to get raw sigcleave results"; $sig = Bio::Tools::Sigcleave->new(-seq=>$protein, -threshold=>5); ok %results = $sig->signals; is $results{9}, 5.2, "unable to get raw sigcleave results"; is $sig->result_count, 5; asciitree.t100644000766000024 156014006427353 16602 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ # `make test'. After `make install' it should work as `perl test.t' use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 2); use_ok('Bio::SeqIO'); } my $verbose = test_debug(); # asciitree is a write-only format my $in = Bio::SeqIO->new(-format => 'genbank', -verbose => $verbose, -file => test_input_file('AE003644_Adh-genomic.gb')); my $seq = $in->next_seq; my $out_file = test_output_file(); my $out = Bio::SeqIO->new(-file => ">".$out_file, -verbose => $verbose, -format => 'asciitree'); $out->write_seq($seq); # this is a bug and is failing on some systems like IRIX (not sure why, maybe # File::Temp?) if (-s $out_file) { ok(1, "File exists, has contents on ".$^O); } else { TODO: { local $TODO = "Output doesn't exists on ".$^O; ok(-s $out_file); } } locuslink.t100644000766000024 506514006427353 16641 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 26, -requires_module => 'Graph::Directed'); use_ok('Bio::SeqIO::locuslink'); use_ok('Bio::SeqFeature::Generic'); use_ok('Bio::SeqFeature::AnnotationAdaptor'); } my $seqin = Bio::SeqIO->new(-file => test_input_file('test.locuslink'), -format => 'locuslink'); ok $seqin; isa_ok($seqin, 'Bio::SeqIO'); my $seqout = Bio::SeqIO->new(-file => ">".test_output_file(), -format => 'embl'); # process and write to output my @seqs = (); while(my $seq = $seqin->next_seq()) { push(@seqs, $seq); # create an artificial feature to stick the annotation on my $fea = Bio::SeqFeature::Generic->new(-start => 1, -end => 9999, -strand => 1, -primary => 'annotation'); my $ac = Bio::SeqFeature::AnnotationAdaptor->new(-feature => $fea); foreach my $k ($seq->annotation->get_all_annotation_keys()) { foreach my $ann ($seq->annotation->get_Annotations($k)) { next unless $ann->isa("Bio::Annotation::SimpleValue"); $ac->add_Annotation($ann); } } $seq->add_SeqFeature($fea); $seqout->write_seq($seq); } is (scalar(@seqs), 2); is ($seqs[0]->desc, "amiloride binding protein 1 (amine oxidase (copper-containing))"); is ($seqs[0]->accession, "26"); is ($seqs[0]->display_id, "ABP1"); is ($seqs[0]->species->binomial, "Homo sapiens"); my @dblinks = $seqs[0]->annotation->get_Annotations('dblink'); my %counts = map { ($_->database(),0) } @dblinks; foreach (@dblinks) { $counts{$_->database()}++; } is ($counts{GenBank}, 11); is ($counts{RefSeq}, 4); is ($counts{UniGene}, 1); is ($counts{Pfam}, 1); is ($counts{STS}, 2); is ($counts{MIM}, 1); is ($counts{PUBMED}, 6); is (scalar(@dblinks), 27); is ($seqs[1]->desc, "v-abl Abelson murine leukemia viral oncogene homolog 2 (arg, Abelson-related gene)"); is ($seqs[1]->display_id, "ABL2"); my $ac = $seqs[1]->annotation; my @keys = $ac->get_all_annotation_keys(); is (scalar(@keys), 19); my ($cmt) = $ac->get_Annotations('comment'); is (length($cmt->text), 403); my @isoforms = qw(a b); foreach ($ac->get_Annotations('PRODUCT')) { is ($_->value, "v-abl Abelson murine leukemia viral oncogene homolog 2 isoform ". shift(@isoforms)); } my @goann = (); foreach my $k (@keys) { foreach my $ann ($ac->get_Annotations($k)) { next unless $ann->isa("Bio::Ontology::TermI"); push(@goann, $ann); } } is (scalar(@goann), 4); @goann = sort { $a->as_text() cmp $b->as_text() } @goann; is ($goann[2]->as_text, "cellular component|cytoplasm|"); MultiFile.t100644000766000024 131614006427353 16523 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 5); use_ok 'Bio::SeqIO::MultiFile'; } my $verbose = test_debug(); # Test multiple files, with a specified format ok my $mf = Bio::SeqIO::MultiFile->new( -format => 'Fasta' , -verbose => $verbose, -files => [ test_input_file('multi_1.fa'), test_input_file('multi_2.fa')], ); my $count = 0; while (my $seq = $mf->next_seq() ) { $count++; } is $count, 12; # Automatically determine format ok $mf = Bio::SeqIO::MultiFile->new( -verbose => $verbose, -files => [ test_input_file('multi_1.fa'), test_input_file('multi_2.fa')], ); is $mf->format, 'fasta'; metafasta.t100644000766000024 76114006427353 16561 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 6); use_ok('Bio::SeqIO::metafasta'); } my $verbose = test_debug(); my $io = Bio::SeqIO->new(-format => 'metafasta', -verbose => $verbose, -file => test_input_file('test.metafasta')); isa_ok($io, 'Bio::SeqIO'); ok(my $seq = $io->next_seq); isa_ok($seq, 'Bio::Seq::Meta'); is($seq->seq, "ABCDEFHIJKLMNOPQRSTUVWXYZ"); is($seq->display_id,'test'); PrimaryQual.t100644000766000024 1022114006427353 16662 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 70); use_ok('Bio::SeqIO'); use_ok('Bio::Seq::Quality'); use_ok('Bio::Seq::PrimaryQual'); } my $DEBUG = test_debug(); my $verbose = -1 unless $DEBUG; # redirect STDERR to STDOUT open (STDERR, ">&STDOUT"); my $string_quals = "10 20 30 40 50 40 30 20 10"; print("Quals are $string_quals\n") if($DEBUG); ok my $qualobj = Bio::Seq::PrimaryQual->new( -qual => $string_quals, -id => 'QualityFragment-12', -accession_number => 'X78121', ); is $qualobj->display_id, 'QualityFragment-12'; is $qualobj->accession_number, 'X78121'; my @q2 = split / /, $string_quals; $qualobj = Bio::Seq::PrimaryQual->new( -qual => \@q2, -primary_id => 'chads primary_id', -desc => 'chads desc', -accession_number => 'chads accession_number', -id => 'chads id', -header => 'chads header' ); is $qualobj->primary_id, 'chads primary_id'; isa_ok $qualobj->qual(), 'ARRAY'; my $newqualstring = "50 90 1000 20 12 0 0"; ok $qualobj->qual($newqualstring); is join(' ', @{$qualobj->qual()}), $newqualstring; my @newqualarray = split / /,$newqualstring; ok $qualobj->qual(\@newqualarray); is join(' ', @{$qualobj->qual()}), $newqualstring; is $qualobj->validate_qual($string_quals ), 1; is $qualobj->validate_qual("" ), 1; is $qualobj->validate_qual("0" ), 1; is $qualobj->validate_qual(undef ), 1; is $qualobj->validate_qual(" " ), 1; is $qualobj->validate_qual("10 20 30 30" ), 1; is $qualobj->validate_qual(" 20 9 5 " ), 1; is $qualobj->validate_qual("+1 9.3 50e-1"), 1; is $qualobj->validate_qual(" 4" ), 1; is $qualobj->validate_qual("chad" ), 0; is $qualobj->validate_qual("10 one" ), 0; ok $qualobj->qual("10 20 30 30"); ok $qualobj->qual("+1 9.3 50e-1"); throws_ok { $qualobj->qual("chad"); } qr/.+/; throws_ok { $qualobj->validate_qual("chad", 1) } qr/.+/; ok $qualobj->qual("4 10"); is $qualobj->length(), 2; $qualobj->qual("10 20 30 40 50 40 30 20 10"); ok my @subquals = @{$qualobj->subqual(3,6);}; is @subquals, 4; is "30 20 10", join(' ',@{$qualobj->subqual(7,9)}); throws_ok { $qualobj->subqual(-1,6); } qr/EX/; ok $qualobj->subqual(1,6); ok $qualobj->subqual(1,9); throws_ok { $qualobj->subqual(9,1); } qr/EX/; is $qualobj->display_id(), "chads id"; is $qualobj->display_id("chads new display_id"), "chads new display_id"; is $qualobj->display_id(), "chads new display_id"; is $qualobj->accession_number(), "chads accession_number"; is $qualobj->accession_number("chads new accession_number"), "chads new accession_number"; is $qualobj->accession_number(), "chads new accession_number"; is $qualobj->primary_id(), "chads primary_id"; is $qualobj->primary_id("chads new primary_id"), "chads new primary_id"; is $qualobj->primary_id(), "chads new primary_id"; is $qualobj->desc(), "chads desc"; is $qualobj->desc("chads new desc"), "chads new desc"; is $qualobj->desc(), "chads new desc"; is $qualobj->display_id(), "chads new display_id"; is $qualobj->display_id("chads new id"), ("chads new id"); is $qualobj->display_id(), "chads new id"; is $qualobj->header(), "chads header"; ok my $in_qual = Bio::SeqIO->new( -file => test_input_file('qualfile.qual'), -format => 'qual', -verbose => $verbose, ); ok my $pq = $in_qual->next_seq(); is $pq->qual()->[99] , '39'; # spot check boundary is $pq->qual()->[100], '39'; # spot check boundary ok my $out_qual = Bio::SeqIO->new( -file => ">".test_output_file(), -format => 'qual', -verbose => $verbose, ); ok $out_qual->write_seq(-source => $pq); ok my $swq545 = Bio::Seq::Quality->new ( -seq => "ATA", -qual => $pq ); ok $out_qual->write_seq(-source => $swq545); ok $in_qual = Bio::SeqIO->new( -file => test_input_file('qualfile.qual'), -format => 'qual', -verbose => $verbose, ); ok my $out_qual2 = Bio::SeqIO->new( -file => ">".test_output_file(), -format => 'qual', -verbose => $verbose, ); while ( my $batch_qual = $in_qual->next_seq() ) { ok $out_qual2->write_seq(-source => $batch_qual); } GOterm.t100644000766000024 747214006427353 16671 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Ontology# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 62, -requires_module => 'Graph::Directed'); use_ok('Bio::Ontology::GOterm'); use_ok('Bio::Ontology::Ontology'); use_ok('Bio::Annotation::DBLink'); } my $obj = Bio::Ontology::GOterm->new(); # note that all *dblink* methods are now deprecated... isa_ok( $obj,"Bio::Ontology::GOterm"); $obj->init(); like( $obj->to_string(), qr'-- GO id:' ); is( $obj->GO_id( "GO:0003947" ), "GO:0003947" ); is( $obj->GO_id(), "GO:0003947" ); is( $obj->get_dbxrefs(), 0 ); my @dblinks = (Bio::Annotation::DBLink->new(-primary_id => 'dAA'), Bio::Annotation::DBLink->new(-primary_id => 'dAB')); $obj->add_dbxref( -dbxrefs => \@dblinks ); is( scalar($obj->get_dbxrefs()), 2 ); my @df1 = $obj->get_dbxrefs(); is( $df1[ 0 ]->display_text, "dAA" ); is( $df1[ 1 ]->display_text, "dAB" ); is( $obj->get_dbxrefs(), 2 ); my @df2 = $obj->remove_dbxrefs(); is( $df2[ 0 ]->display_text, "dAA" ); is( $df2[ 1 ]->display_text, "dAB" ); is( $obj->get_dbxrefs(), 0 ); is( $obj->remove_dbxrefs(), 0 ); is( $obj->get_secondary_GO_ids(), 0 ); $obj->add_secondary_GO_id( ( "GO:0000000", "GO:1234567" ) ); is( scalar($obj->get_secondary_GO_ids()), 2 ); my @si1 = $obj->get_secondary_GO_ids(); is( $si1[ 0 ], "GO:0000000" ); is( $si1[ 1 ], "GO:1234567" ); is( $obj->get_secondary_GO_ids(), 2 ); my @si2 = $obj->remove_secondary_GO_ids(); is( $si2[ 0 ], "GO:0000000" ); is( $si2[ 1 ], "GO:1234567" ); is( $obj->get_secondary_GO_ids(), 0 ); is( $obj->remove_secondary_GO_ids(), 0 ); is( $obj->identifier( "0003947" ), "0003947" ); is( $obj->identifier(), "0003947" ); is( $obj->name( "N-acetylgalactosaminyltransferase" ), "N-acetylgalactosaminyltransferase" ); is( $obj->name(), "N-acetylgalactosaminyltransferase" ); is( $obj->definition( "Catalysis of ..." ), "Catalysis of ..." ); is( $obj->definition(), "Catalysis of ..." ); is( $obj->version( "666" ), "666" ); is( $obj->version(), "666" ); ok( $obj->ontology( "category 1 name" ) ); is( $obj->ontology()->name(), "category 1 name" ); my $ont = Bio::Ontology::Ontology->new(); ok( $ont->name( "category 2 name" ) ); ok( $obj->ontology( $ont ) ); is( $obj->ontology()->name(), "category 2 name" ); is( $obj->is_obsolete( 1 ), 1 ); is( $obj->is_obsolete(), 1 ); is( $obj->comment( "Consider the term ..." ), "Consider the term ..." ); is( $obj->comment(), "Consider the term ..." ); is( $obj->get_synonyms(), 0 ); $obj->add_synonym( ( "AA", "AB" ) ); my @al1 = $obj->get_synonyms(); is( scalar(@al1), 2 ); is( $al1[ 0 ], "AA" ); is( $al1[ 1 ], "AB" ); my @al2 = $obj->remove_synonyms(); is( $al2[ 0 ], "AA" ); is( $al2[ 1 ], "AB" ); is( $obj->get_synonyms(), 0 ); is( $obj->remove_synonyms(), 0 ); $obj->add_synonym( ( "AA", "AB" ) ); $obj->add_dbxref( -dbxrefs => \@dblinks ); $obj->add_secondary_GO_id( ( "GO:1234567", "GO:1234567" ) ); $obj->init(); is( $obj->identifier(), undef ); # don't make up identifiers is( $obj->name(), undef ); is( $obj->definition(), undef ); is( $obj->is_obsolete(), 0 ); is( $obj->comment(), undef ); $obj = Bio::Ontology::GOterm->new( -go_id => "0016847", -name => "1-aminocyclopropane-1-carboxylate synthase", -definition => "Catalysis of ...", -is_obsolete => 0, -version => "6.6.6", -ontology => "cat", -comment => "X" ); is( $obj->identifier(), "GO:0016847" ); is( $obj->name(), "1-aminocyclopropane-1-carboxylate synthase" ); is( $obj->definition(), "Catalysis of ..." ); is( $obj->is_obsolete(), 0 ); is( $obj->comment(), "X" ); is( $obj->version(), "6.6.6" ); is( $obj->ontology()->name(), "cat" ); obo.t100644000766000024 2114614006427353 16574 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Ontology/IO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 92, -requires_module => 'Graph'); use_ok('Bio::OntologyIO'); use_ok('Bio::Ontology::RelationshipType'); } my $IS_A = Bio::Ontology::RelationshipType->get_instance( "IS_A" ); my $PART_OF = Bio::Ontology::RelationshipType->get_instance( "PART_OF" ); my $parser = Bio::OntologyIO->new( -format => "obo", -file => test_input_file('so.obo')); my $ont = $parser->next_ontology(); ok ($ont); is ($ont->name(), "sequence"); my @roots = $ont->get_root_terms(); is (scalar(@roots), 1); is ($roots[0]->name(), "Sequence_Ontology"); is ($roots[0]->identifier(), "SO:0000000"); my @terms = sort {$a->name cmp $b->name} $ont->get_child_terms($roots[0]); is (scalar(@terms), 5); my ($term) = grep { $_->name() eq "variation_operation"; } @terms; ok $term; ($term) = grep { $_->name() eq "sequence_attribute"; } @terms; ok $term; ($term) = grep { $_->name() eq "consequences_of_mutation"; } @terms; ok $term; ($term) = grep { $_->name() eq "chromosome_variation"; } @terms; ok $term; ($term) = grep { $_->name() eq "located_sequence_feature"; } @terms; ok $term; @terms = sort {$a->name cmp $b->name} $ont->get_child_terms($terms[4]); is (scalar(@terms), 5); ($term) = grep { $_->name() eq "translocate"; } @terms; ok $term; ($term) = grep { $_->name() eq "delete"; } @terms; ok $term; ($term) = grep { $_->name() eq "insert"; } @terms; ok $term; ($term) = grep { $_->name() eq "substitute"; } @terms; ok $term; ($term) = grep { $_->name() eq "invert"; } @terms; ok $term; my $featterm = $terms[0]; @terms = sort {$a->name cmp $b->name} $ont->get_child_terms($featterm); is (scalar(@terms), 2); # substitution has two parents, see whether this is handled @terms = $ont->find_terms(-name => "substitution"); $term = $terms[0]; is ($term->name(), "substitution"); # search using obo terms; @terms = $ont->find_identically_named_terms($term); is scalar @terms, 1; @terms = $ont->find_identical_terms($term); is scalar @terms, 1; @terms = $ont->find_similar_terms($term); is scalar @terms, 7; @terms = $ont->get_ancestor_terms($term); is (scalar(@terms), 6); is (scalar(grep { $_->name() eq "region"; } @terms), 1); is (scalar(grep { $_->name() eq "sequence_variant"; } @terms), 1); # processed_transcript has part-of and is-a children @terms = $ont->find_terms(-name => "processed_transcript");; $term = $terms[0]; @terms = $ont->get_child_terms($term); is (scalar(@terms), 5); @terms = $ont->get_child_terms($term, $PART_OF); is (scalar(@terms), 2); @terms = $ont->get_child_terms($term, $IS_A); is (scalar(@terms), 3); @terms = $ont->get_child_terms($term, $PART_OF, $IS_A); is (scalar(@terms), 5); # TF_binding_site has 2 parents and different relationships in the two # paths up (although the relationships to its two parents are of the # same type, namely is-a) @terms = $ont->find_terms(-name => "TF_binding_site");; $term = $terms[0]; @terms = $ont->get_parent_terms($term); is (scalar(@terms), 2); my ($pterm) = grep { $_->name eq "regulatory_region"; } @terms; ok $pterm; @terms = $ont->get_parent_terms($term, $PART_OF); is (scalar(@terms), 0); @terms = $ont->get_parent_terms($term, $IS_A); is (scalar(@terms), 2); @terms = $ont->get_parent_terms($term, $PART_OF, $IS_A); is (scalar(@terms), 2); # pull out all relationships my @rels = $ont->get_relationships(); my @relset = grep { $_->object_term->name eq "Sequence_Ontology"; } @rels; is (scalar(@relset), 5); @relset = grep { $_->subject_term->name eq "Sequence_Ontology"; } @rels; is (scalar(@relset), 0); # relationships for a specific term only ($term) = $ont->find_terms(-identifier => "SO:0000082"); ok ($term); is ($term->identifier, "SO:0000082"); is ($term->name, "processed_transcript_attribute"); @rels = $ont->get_relationships($term); is (scalar(@rels), 5); @relset = grep { $_->predicate_term->name eq "IS_A"; } @rels; is (scalar(@relset), 5); @relset = grep { $_->object_term->identifier eq "SO:0000082"; } @rels; is (scalar(@relset), 4); @relset = grep { $_->subject_term->identifier eq "SO:0000082"; } @rels; is (scalar(@relset), 1); #### --- testing obo parsers for regulates relationships my $parser2 = Bio::OntologyIO->new ( -format => 'obo', -file => test_input_file('regulation_test.obo')); isa_ok($parser2,'Bio::OntologyIO', 'got a ontology IO handler'); my @ontologies; while (my $ont = $parser2->next_ontology()) { isa_ok($ont,'Bio::Ontology::Ontology','got ontology parser2'); isa_ok($ont->engine,'Bio::Ontology::OBOEngine','got OBO engine object'); push @ontologies,$ont; } my $molont = $ontologies[1]; my $bioont = $ontologies[2]; is($ontologies[0]->name(),'gene_ontology','Gene ontology'); is($bioont->name(),'biological_process','biological process'); is($molont->name(),'molecular_function','molecular function'); my ($broot) = $bioont->get_root_terms(); is($broot->name(),'biological_process','Got root'); my ($mroot) = $molont->get_root_terms(); is($mroot->name(),'molecular_function','Got root'); ## -- testing newly introduced relationships is($ontologies[0]->get_relationship_type('REGULATES')->name,'REGULATES','Got regulates from gene_ontology'); is($ontologies[0]->get_relationship_type('POSITIVELY_REGULATES')->name,'POSITIVELY_REGULATES','Got positively regulates from gene_ontology'); is($bioont->get_relationship_type('REGULATES')->name,'REGULATES','Got regulates from biological_process'); is($bioont->get_relationship_type('POSITIVELY_REGULATES')->name,'POSITIVELY_REGULATES','Got positively regulates from biological_process'); ## -- getting relationships for various ontologies my @onto_pred = sort {$a->name cmp $b->name} $ontologies[0]->get_predicate_terms(); my @bio_pred = sort {$a->name cmp $b->name} $bioont->get_predicate_terms(); is(scalar @onto_pred,6,'Got predicates for gene_ontology'); is(scalar @bio_pred,2,'Got predicates for biological_process'); is($onto_pred[4]->name(),'REGULATES','Got regulates predicate'); is($bio_pred[0]->name(),'POSITIVELY_REGULATES','Got positively regulates predicate'); my @bio_rel = $bioont->get_relationships(); my @mol_rel = $molont->get_relationships(); is(scalar @bio_rel,11,'Got relationships for biological_process'); is(scalar @mol_rel,2,'Got relationships for molecular_function'); is($mol_rel[0]->predicate_term->name(),'IS_A','Got is a relationship from molecular_function'); ## ---- ## -- testing the regulates relationships between term1s my $REG = Bio::Ontology::RelationshipType->get_instance('REGULATES'); my $PREG = Bio::Ontology::RelationshipType->get_instance('POSITIVELY_REGULATES'); my ($term1) = $bioont->find_terms(-identifier => 'GO:0050790'); isa_ok($term1,'Bio::Ontology::Term','Got term object'); is($term1->identifier(),'GO:0050790', 'Got term id'); is($term1->name(),'regulation of catalytic activity', 'Got term name'); my ($parent) = $bioont->get_parent_terms($term1,$REG); isa_ok($parent,'Bio::Ontology::Term','Got regulated object'); is($parent->identifier(),'GO:0003824','Got regulated term1 id'); ## -- now testing the other way my ($child) = $bioont->get_child_terms($parent,$REG); isa_ok($child,'Bio::Ontology::Term','Got term1 object'); is($child->identifier(),$term1->identifier(),'Got back the child'); my ($term2) = $bioont->find_terms(-identifier => 'GO:0043085'); isa_ok($term2,'Bio::Ontology::Term','Got term object'); is($term2->identifier(),'GO:0043085', 'Got term id'); is($term2->name(),'positive regulation of catalytic activity', 'Got term name'); my ($parent2) = $bioont->get_parent_terms($term2,$PREG); isa_ok($parent2,'Bio::Ontology::Term','Got regulated object'); is($parent2->identifier(),'GO:0003824','Got regulated term1 id'); is($parent->name(),$parent2->name(),'Got identical regulation'); my ($child2) = $bioont->get_child_terms($parent2,$PREG); isa_ok($child2,'Bio::Ontology::Term','Got term1 object'); is($child2->identifier(),$term2->identifier(),'Got back the child'); #### --- testing obo parsers for secondary identifiers my $parser3 = Bio::OntologyIO->new ( -format => 'obo', -file => test_input_file('sp_subset.obo')); isa_ok($parser3,'Bio::OntologyIO', 'got a ontology IO handler'); my $sp_ont = $parser3->next_ontology(); ok ($sp_ont); is ($sp_ont->name(), "solanaceae_phenotype"); #the term 'plant size has 4 secondary identifiers @terms = $sp_ont->find_terms(-name => "plant size"); $term = $terms[0]; is ($term->name(), "plant size"); my @xrefs = $term->get_secondary_ids(); is(scalar(@xrefs) , 4); my ($xref) = grep { $_ eq "PATO:0000117"; } @xrefs; ok $xref; ($xref) = grep { $_ eq "PO:0000003"; } @xrefs; ok $xref; ($xref) = grep { $_ eq "PO:0007130"; } @xrefs; ok $xref; ($xref) = grep { $_ eq "TO:0000207"; } @xrefs; ok $xref; AlignIO.t100644000766000024 456614006427353 16427 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 27, -requires_module => 'Data::Stag'); use_ok('Bio::AlignIO'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # general filehandle tests # not all parsers support output (noted as 0) my %files = ( # file format I O 'testaln.phylip' => ['phylip', 1, 1], 'testaln.psi' => ['psi', 1, 1], 'testaln.arp' => ['arp', 1, 0], 'testaln.pfam' => ['pfam', 1, 1], 'testaln.msf' => ['msf', 1, 1], 'testaln.fasta' => ['fasta', 1, 1], 'testaln.selex' => ['selex', 1, 1], 'testaln.mase' => ['mase', 1, 0], 'testaln.prodom' => ['prodom', 1, 0], 'testaln.clustalw' => ['clustalw', 1, 1], 'testaln.metafasta' => ['metafasta', 1, 1], 'testaln.nexus' => ['nexus', 1, 1], 'testaln.po' => ['po', 1, 1], 'testaln.xmfa' => ['xmfa', 1, 1], ); # input file handles $aln = Bio::AlignIO->new( -file => test_input_file('longnames.aln'), -format=>'clustalw', )->next_aln(); isa_ok($aln, 'Bio::AnnotatableI'); while (my ($file, $fdata) = each %files) { my ($format, $in, $out) = @{$fdata}; if ($in) { my $fhin = Bio::AlignIO->newFh( '-file' => test_input_file($file), '-format' => $format); my $fhout = Bio::AlignIO->newFh( '-file' => ">".test_output_file(), '-format' => 'clustalw'); while ( $aln = <$fhin>) { cmp_ok($aln->num_sequences, '>=', 2, "input filehandle method test : $format"); last; } } } # output file handles while (my ($file, $fdata) = each %files) { my ($format, $in, $out) = @{$fdata}; if ($out) { my $status = 0; my $fhin = Bio::AlignIO->newFh( '-file' => test_input_file('testaln.clustalw'), '-format' => 'clustalw'); my $fhout = Bio::AlignIO->newFh( '-file' => '>'.test_output_file(), '-format' => $format); while ( $aln = <$fhin> ) { $status = print $fhout $aln; last; } is $status, 1, "filehandle output test : $format"; } } Tiling.t100644000766000024 3334114006427353 16557 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO#-*-perl-*- #$Id$ use strict; use warnings; use vars qw($EXHAUSTIVE $VERBOSE); BEGIN { use Bio::Root::Test; $EXHAUSTIVE = $ENV{BIOPERL_TILING_EXHAUSTIVE_TESTS}; $VERBOSE = $ENV{BIOPERL_TILING_VERBOSE_TESTS}; test_begin(-tests => ($EXHAUSTIVE ? 6519 : 1141) ); } use_ok('Bio::Search::Tiling::MapTiling'); use_ok('Bio::Search::Tiling::MapTileUtils'); use_ok('Bio::SearchIO'); use_ok('Bio::Search::Hit::BlastHit'); use_ok('File::Spec'); my ($blio, $result, $hit, $tiling, $hsp); my @normal_formats = qw( blast wublast blastn wublastn blastp wublastp multiblast megablast rpsblast psiblast ); my @xltd_formats = qw( blastx wublastx tblastn wutblastn tblastx wutblastx ); # an exhaustive listing of search reports in # t/data my %test_files = ( 'blast' => [qw( ecolitst.bls frac_problems.blast frac_problems2.blast frac_problems3.blast bl2seq.out )], 'multiblast' => [qw( multi_blast.bls )], 'blastn' => [qw( a_thaliana.blastn bl2seq.blastn new_blastn.txt hsinsulin.blastcl3.blastn )], 'wublastn' =>[qw( brassica_ATH.WUBLASTN echofilter.wublastn )], 'blastp' => [qw( blastp2215.blast no_hsps.blastp catalase-webblast.BLASTP )], 'wublastp' => [qw( dcr1_sp.WUBLASTP ecolitst.wublastp contig-by-hand.wublastp ecolitst.noseqs.wublastp )], 'blastx' => [qw( bl2seq.blastx.out )], 'wublastx' => [qw( dnaEbsub_ecoli.wublastx )], 'wublast' => [qw( tricky.wublast )], 'tblastn' => [qw( tblastn.out 1ZZ19XR301R-Alignment.tblastn )], 'wutblastn' => [qw( dnaEbsub_ecoli.wutblastn )], 'tblastx' => [qw( bl2seq.tblastx.out HUMBETGLOA.tblastx )], 'wutblastx' => [qw( dnaEbsub_ecoli.wutblastx )], 'megablast' => [qw( 503384.MEGABLAST.2 )], 'rpsblast' => [qw( ecoli_domains.rpsblast )], 'psiblast' => [qw( psiblastreport.out )], 'bug2942' => [qw( bug2942.blastx )] ); # a subset of search reports for # run-o-the-mill regression tests my %example_files = ( 'blast' => [qw( ecolitst.bls )], 'blastn' => [qw( a_thaliana.blastn )], 'wublastn' =>[qw( brassica_ATH.WUBLASTN )], 'blastp' => [qw( no_hsps.blastp catalase-webblast.BLASTP )], 'wublastp' => [qw( dcr1_sp.WUBLASTP )], 'blastx' => [qw( bl2seq.blastx.out )], 'wublastx' => [qw( dnaEbsub_ecoli.wublastx )], 'wublast' => [qw( tricky.wublast )], 'tblastn' => [qw( tblastn.out )], 'wutblastn' => [qw( dnaEbsub_ecoli.wutblastn )], 'tblastx' => [qw( HUMBETGLOA.tblastx )], 'wutblastx' => [qw( dnaEbsub_ecoli.wutblastx )], 'megablast' => [qw( 503384.MEGABLAST.2 )] ); ok( $blio = Bio::SearchIO->new( -file => test_input_file('dcr1_sp.WUBLASTP'), -format => 'blast', ), 'parse data file'); $result = $blio->next_result; while ( $_ = $result->next_hit ) { last if $_->name =~ /ASPTN/; } ok($hit = $_, 'got test hit'); ok($tiling = Bio::Search::Tiling::MapTiling->new($hit), 'create tiling'); # TilingI compliance isa_ok($tiling, 'Bio::Search::Tiling::TilingI'); foreach ( qw( next_tiling rewind_tilings identities conserved length ) ) { ok( $tiling->$_, "implements '$_'" ); } # regression test on original calculations my @orig_id_results = ( 387,388,388,381,382,389 ); my @orig_cn_results = ( 622,619,628,608,611,613 ); my @id_results = ( $tiling->identities('query', 'exact'), $tiling->identities('query', 'est'), $tiling->identities('query', 'max'), $tiling->identities('subject', 'exact'), $tiling->identities('subject', 'est'), $tiling->identities('subject', 'max') ); my @cn_results = ( $tiling->conserved('query', 'exact'), $tiling->conserved('query', 'est'), $tiling->conserved('query', 'max'), $tiling->conserved('subject', 'exact'), $tiling->conserved('subject', 'est'), $tiling->conserved('subject', 'max') ); map { $_ = int($_) } @id_results, @cn_results; is_deeply(\@id_results, \@orig_id_results, 'identities regression test'); is_deeply(\@cn_results, \@orig_cn_results, 'conserved regression test'); # tiling iterator regression tests my ($qn, $sn)=(0,0); while ($tiling->next_tiling('query')) {$qn++}; while ($tiling->next_tiling('subject')) {$sn++}; is ($qn, 8, 'tiling iterator regression test(1)'); is ($sn, 128, 'tiling iterator regression test(2)'); $tiling->rewind('subject'); while ($tiling->next_tiling('subject')) {$sn++}; is ($sn, 256, 'tiling iterator regression test(3, rewind)'); diag("Old blast.t tiling tests") if $VERBOSE; ok($blio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('ecolitst.wublastp') ), "ecolitst.wublastp"); $result = $blio->next_result; $result->next_hit; $hit = $result->next_hit; $tiling = Bio::Search::Tiling::MapTiling->new($hit); # Test HSP contig data returned by SearchUtils::tile_hsps() # Second hit has two hsps that overlap. # compare with the contig made by hand for these two contigs # in t/data/contig-by-hand.wublastp # (in this made-up file, the hsps from ecolitst.wublastp # were aligned and contiged, and Length, Identities, Positives # were counted, by a human (maj) ) my $hand_hit = Bio::SearchIO->new( -format=>'blast', -file=>test_input_file('contig-by-hand.wublastp') )->next_result->next_hit; my $hand_hsp = $hand_hit->next_hsp; my @hand_qrng = $hand_hsp->range('query'); my @hand_srng = $hand_hsp->range('hit'); my @hand_matches = $hand_hit->matches; is(($tiling->range('query'))[0], $hand_qrng[0]); is(($tiling->range('query'))[1], $hand_qrng[1]); is(sprintf("%d",$tiling->identities('query')), $hand_matches[0]); is(sprintf("%d",$tiling->conserved('query')), $hand_matches[1]); is(($tiling->range('hit'))[0], $hand_srng[0]); is(($tiling->range('hit'))[1], $hand_srng[1]); is(sprintf("%d",$tiling->identities('hit')), $hand_matches[0]); is(sprintf("%d",$tiling->conserved('hit')), $hand_matches[1]); ok( $blio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('dnaEbsub_ecoli.wublastx') ), "dnaEbsub_ecoli.wublastx"); $hit = $blio->next_result->next_hit; $tiling = Bio::Search::Tiling::MapTiling->new($hit); is(sprintf("%.3f",$tiling->frac_identical(-type=>'query',-denom=>'aligned',-context=>'p2')), '0.364'); is(sprintf("%.3f",$tiling->frac_identical(-type=>'hit',-denom=>'aligned',-context=>'all')), '0.366'); is(sprintf("%.3f",$tiling->frac_conserved(-type=>'query',-denom=>'aligned',-context=>'p2')), '0.537'); is(sprintf("%.3f",$tiling->frac_conserved(-type=>'hit',-denom=>'aligned',-context=>'all')), '0.540'); is(sprintf("%.2f",$tiling->frac_aligned_query(-context=>'p2')), '0.62'); is(sprintf("%.2f",$tiling->frac_aligned_hit(-context=>'all')), '0.71'); ok( $blio = Bio::SearchIO->new( '-format' => 'blast', '-file' => test_input_file('tricky.wublast') ), "tricky.wublast"); $hit = $blio->next_result->next_hit; $tiling = Bio::Search::Tiling::MapTiling->new($hit); cmp_ok sprintf("%.3f",$tiling->frac_identical(-denom => 'aligned')), '>', 0.2, 'tricky.wublast(1)'; cmp_ok sprintf("%.3f",$tiling->frac_conserved(-denom => 'aligned')), '<=', 1, 'tricky.wublast(2)'; is(sprintf("%.2f",$tiling->frac_aligned_query), '0.92', 'tricky.wublast(3)'); is(sprintf("%.2f",$tiling->frac_aligned_hit), '0.91','tricky.wublast(4)'); diag("New tiling tests") if $VERBOSE; # select test file set based on the environment variable # BIOPERL_TILING_EXHAUSTIVE_TESTS my $files = ($EXHAUSTIVE ? \%test_files : \%example_files); foreach my $alg (@normal_formats, @xltd_formats) { diag("*******$alg files*******") if ($files->{$alg} && $VERBOSE); foreach my $tf (@{$files->{$alg}}) { ok( $blio = Bio::SearchIO->new( -format=>'blast', -file=>test_input_file($tf) ), "$tf" ); $result = $blio->next_result; my $hit_count = 0; # compare the per-aligned-base identity avg over hsps # with frac_identical (bzw, conserved) HIT: while ( $hit = $result->next_hit ) { ++$hit_count; # quiet the "No HSPs" warning with -verbose => -1 ok( $tiling = Bio::Search::Tiling::MapTiling->new(-hit=>$hit,-verbose=>-1), "tile $tf hit $hit_count #hsps ".scalar $tiling->hsps ); my @hsps = $tiling->hsps; unless (@hsps) { diag( "--no hsps for $tf hit $hit_count") if $VERBOSE; next HIT; } my ($dpct, $est, $fast,$exact, $max); my $tol = 0.10; # % difference accepted as approx. equal ## loop through contexts: for my $type (qw( query hit )) { for my $context ($tiling->contexts($type)) { diag(" --- $type $context ---") if $VERBOSE; if (scalar($tiling->contexts($type, $context)) == 1) { # equality ($dpct, $est, $fast) = $tiling->cmp_frac($type,'identical','aligned', 'est', 'fast', $context); is( $est,$fast, substr($type,0,1)." id: est ($est) = fast ($fast)"); ($dpct, $est, $fast) = $tiling->cmp_frac($type,'conserved','aligned', 'est', 'fast', $context); is( $est,$fast, substr($type,0,1)." cn: est ($est) = fast ($fast)"); } else { # comparisons ($dpct, $est, $fast) = $tiling->cmp_frac($type,'identical','aligned', 'est', 'fast', $context); # cmp_ok( $dpct, "<", $tol, # substr($type,0,1)." id: est ($est) ~ fast ($fast)"); ($dpct, $exact, $max) = $tiling->cmp_frac($type,'identical','aligned', 'exact', 'max', $context); cmp_ok( abs($exact-$est)/$exact, "<" , $tol, substr($type,0,1)." id: exact ($exact) ~ est ($est)"); cmp_ok( $exact, "<=" , $max, substr($type,0,1)." id: exact ($exact) <= max ($max)"); ($dpct, $est, $fast) = $tiling->cmp_frac($type,'conserved','aligned', 'est', 'fast', $context); # cmp_ok( $dpct, "<", $tol, # substr($type,0,1)." cn: est ($est) ~ fast ($fast)"); ($dpct, $exact, $max) = $tiling->cmp_frac($type,'conserved','aligned', 'exact', 'max', $context); cmp_ok( abs($exact-$est)/$exact, "<" , $tol, substr($type,0,1)." cn: exact ($exact) ~ est ($est)"); cmp_ok( $exact, "<=" , $max, substr($type,0,1)." cn: exact ($exact) <= max ($max)"); } } } } } } # bug 2942 my %expected_ranges = ( 'm0' => [7, 11037], #query 'm1' => [1770, 10865], #query 'm2' => [2462, 14599], #query 'all' => [231, 3563] #subject ); $blio = Bio::SearchIO->new( -file=>test_input_file( $test_files{'bug2942'}->[0] ), -format => 'blast' ); $hit = $blio->next_result->next_hit; $tiling = Bio::Search::Tiling::MapTiling->new($hit); for ( 'm0', 'm1', 'm2' ) { is_deeply( [$tiling->range('query',$_)], $expected_ranges{$_}, "bug2942: query $_: range correct"); } is_deeply( [$tiling->range('subject', 'all')], $expected_ranges{'all'}, "bug2942: subject all : range correct" ); # test get_tiled_alns $blio = Bio::SearchIO->new( -file=>test_input_file( 'dcr1_sp.WUBLASTP' ) ); $result = $blio->next_result; while ($hit = $result->next_hit) { last if $hit->name =~ /ASPTN/; } $tiling = Bio::Search::Tiling::MapTiling->new($hit); ok my @alns = $tiling->get_tiled_alns, "get_tiled_alns"; is scalar @alns, 6, "got all alns"; for my $aln ( @alns ) { my (@aint, @qint, @sint); my $qs = $aln->get_seq_by_id('query'); my $ss = $aln->get_seq_by_id('subject'); ok my @qfeats = $qs->get_SeqFeatures; foreach (@qfeats) { push @aint, [$_->start, $_->end]; push @qint, [($_->get_tag_values('query_start'))[0], ($_->get_tag_values('query_end'))[0] ]; } is( eval(join('+', map {$$_[1]-$$_[0]+1} @aint)), eval(join('+', map {$$_[1]-$$_[0]+1} @qint)), "aln and qfeat lengths correspond" ); is( $qs->length - $qs->num_gaps('-'), eval(join('+', map {$$_[1]-$$_[0]+1} @qint)), "q length correct"); ok my @hfeats = $ss->get_SeqFeatures; @aint = (); ok ( @qfeats == @hfeats, "features on q and s correspond"); foreach (@hfeats) { push @aint, [$_->start, $_->end]; push @sint, [($_->get_tag_values('subject_start'))[0], ($_->get_tag_values('subject_end'))[0] ]; } is( eval(join('+', map {$$_[1]-$$_[0]+1} @aint)), eval(join('+', map {$$_[1]-$$_[0]+1} @sint)), "aln and hfeat lengths correspond" ); is( $ss->length - $ss->num_gaps('-'), eval(join('+', map {$$_[1]-$$_[0]+1} @sint)), "s length correct"); } 1; package Bio::Search::Tiling::MapTiling; sub cmp_frac { my ($tiling, $type, $method, $denom, @actions) = @_; my ($a, $b); my $context = ($actions[2] ? $actions[2] : 'all'); $a = $tiling->frac(-type=>$type, -method=>$method, -denom=>$denom, -action=>$actions[0], -context=>$context); $b = $tiling->frac(-type=>$type, -method=>$method, -denom=>$denom, -action=>$actions[1], -context=>$context); return ( abs($a-$b)/$a, f(5,$a), f(5,$b) ); } sub f { my ($d,$val) = @_; sprintf("%.${d}f",$val) } 1; SeqFeature000755000766000024 014006427353 15367 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tGene.t100644000766000024 635514006427353 16603 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqFeature# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 28); use_ok('Bio::SeqIO'); use_ok('Bio::SeqFeature::Gene::Transcript'); use_ok('Bio::SeqFeature::Gene::UTR'); use_ok('Bio::SeqFeature::Gene::Exon'); use_ok('Bio::SeqFeature::Gene::Poly_A_site'); use_ok('Bio::SeqFeature::Gene::GeneStructure'); use_ok('Bio::Location::Fuzzy'); } my ( $seqio, $geneseq, $gene, $transcript, $poly_A_site1, $poly_A_site2, $fiveprimeUTR, $exon); # tests for Bio::SeqFeature::Gene::* objects # using information from acc: AB077698 as a guide ok $seqio = Bio::SeqIO->new( -format => 'genbank', -file => test_input_file('AB077698.gb'), ); ok $geneseq = $seqio->next_seq(); ok $gene = Bio::SeqFeature::Gene::GeneStructure->new( -primary => 'gene', -start => 1, -end => 2701, -strand => 1, ); ok $transcript = Bio::SeqFeature::Gene::Transcript->new( -primary => 'CDS', -start => 80, -end => 1144, -tag => { 'gene' => "CHCR", 'note' => "Cys3His CCG1-Required Encoded on BAC clone RP5-842K24 (AL050310) The human CHCR (Cys3His CCG1-Required) protein is highly related to EXP/MBNL (Y13829, NM_021038, AF401998) and MBLL (NM_005757,AF061261), which together comprise the human Muscleblind family", 'codon_start' => 1, 'protein_id' => 'BAB85648.1', } ); ok $poly_A_site1 = Bio::SeqFeature::Gene::Poly_A_site->new( -primary => 'polyA_site', -start => 2660, -end => 2660, -tag => { 'note' => "Encoded on BAC clone RP5-842K24 (AL050310); PolyA_site#2 used by CHCR EST clone DKFZp434G2222 (AL133625)" } ); ok $poly_A_site2 = Bio::SeqFeature::Gene::Poly_A_site->new( -primary => 'polyA_site', -start => 1606, -end => 1606, -tag => { 'note' => "Encoded on BAC clone RP5-842K24 (AL050310); PolyA_site#1 used by CHCR EST clone PLACE1010202 (AK002178)", } ); ok $fiveprimeUTR = Bio::SeqFeature::Gene::UTR->new(-primary => "utr5prime"); ok $fiveprimeUTR->location( Bio::Location::Fuzzy->new( -start => "<1", -end => 79, ) ); ok my $threeprimeUTR = Bio::SeqFeature::Gene::UTR->new( -primary => "utr3prime", -start => 1145, -end => 2659, ); # Did a quick est2genome against genomic DNA (this is on Chr X) to # get the gene structure by hand since it is not in the file # --Jason ok $exon = Bio::SeqFeature::Gene::Exon->new( -primary => 'exon', -start => 80, -end => 177, ); ok $geneseq->add_SeqFeature($exon); ok $geneseq->add_SeqFeature($fiveprimeUTR); ok $geneseq->add_SeqFeature($threeprimeUTR); ok $geneseq->add_SeqFeature($poly_A_site1); ok $geneseq->add_SeqFeature($poly_A_site2); ok $transcript->add_utr($fiveprimeUTR, 'utr5prime'); ok $transcript->add_utr($threeprimeUTR, 'utr3prime'); ok $transcript->add_exon($exon); # API only supports a single poly-A site per transcript at this point $transcript->poly_A_site($poly_A_site2); $geneseq->add_SeqFeature($transcript); $gene->add_transcript($transcript); $geneseq->add_SeqFeature($gene); my ($t) = $gene->transcripts(); # get 1st transcript ok(defined $t); is($t->mrna->length, 1693, 'mRNA spliced length'); is($gene->utrs, 2, 'has 2 UTRs'); Align000755000766000024 014006427353 14355 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/tUtilities.t100644000766000024 325014006427353 16655 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Align# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14); use_ok('Bio::Align::Utilities', qw(:all)); use_ok('Bio::SimpleAlign'); use_ok('Bio::PrimarySeq'); use_ok('Bio::LocatableSeq'); use_ok('Bio::AlignIO'); } my $DEBUG = test_debug(); my $aa_align = Bio::SimpleAlign->new(); $aa_align->add_seq(Bio::LocatableSeq->new(-id => "n1", -seq => "MLIDVG-MLVLR")); $aa_align->add_seq(Bio::LocatableSeq->new(-id => "n2", -seq => "MLIDVRTPLALR")); $aa_align->add_seq(Bio::LocatableSeq->new(-id => "n3", -seq => "MLI-VR-SLALR")); my %dnaseqs = (); $dnaseqs{'n1'} = Bio::PrimarySeq->new(-id => "n1dna", -seq => 'atgctgatagacgtaggcatgctagtactgaga'); $dnaseqs{'n2'} = Bio::PrimarySeq->new(-id => "n2dna", -seq => 'atgctgatcgacgtacgcaccccgctagcactcaga'); $dnaseqs{'n3'} = Bio::PrimarySeq->new(-id => "n3dna", -seq => 'atgttgattgtacgctcgcttgcacttaga'); my $dna_aln; ok( $dna_aln = &aa_to_dna_aln($aa_align, \%dnaseqs)); if( $DEBUG ) { Bio::AlignIO->new(-format=>'clustalw')->write_aln($dna_aln); } is $dna_aln->length, 36; is $dna_aln->num_residues, 99; is $dna_aln->num_sequences, 3; is $dna_aln->consensus_string(50), "atgctgat?gacgtacgc????cgctagcact?aga"; my @dnaseqs = $dna_aln->each_seq; is $dnaseqs[0]->display_id, 'n1dna'; $dna_aln->verbose(-1); my $replicates; ok $replicates = &bootstrap_replicates($dna_aln,3); is scalar @$replicates, 3; my $repl_aln = pop @$replicates; is $repl_aln->num_sequences, 3; ##use IO::String; ##use Bio::AlignIO; ##my $string; ##my $out = IO::String->new($string); ## ##my $strout = Bio::AlignIO->new(-fh => $out,'-format' => 'pfam'); ##$strout->write_aln($repl_aln); ##is $string, ""; TreeBuild.t100644000766000024 313714006427353 16565 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Align# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 13); for my $mod ( qw(Bio::Align::DNAStatistics Bio::Align::ProteinStatistics Bio::Align::Utilities Bio::AlignIO Bio::Tree::DistanceFactory Bio::TreeIO) ) { use_ok($mod); } } my $debug = test_debug(); use Bio::Align::Utilities qw(:all); my $in = Bio::AlignIO->new(-format => 'clustalw', -file => test_input_file('pep-266.aln')); my $aln = $in->next_aln(); isa_ok($aln, 'Bio::SimpleAlign','SimpleAlign object parsed out'); my $pstats = Bio::Align::ProteinStatistics->new(-verbose => $debug); my $matrix = $pstats->distance(-method => 'Kimura', -align => $aln); isa_ok($matrix,'Bio::Matrix::MatrixI','Protein distance matrix retrieved'); my $treebuilder = Bio::Tree::DistanceFactory->new(-method => 'NJ'); my $tree = $treebuilder->make_tree($matrix); isa_ok($tree,'Bio::Tree::TreeI', 'Tree object gotten back'); my ($cn) = $tree->find_node('183.m01790'); # brlens checked against tree generated by PHYLIP NJ is($tree->find_node('AN2438.1')->branch_length, '0.28221','NJ calculated Branch length'); is($tree->find_node('FG05298.1')->branch_length, '0.20593','NJ calculated Branch length'); # simple topology test - make sure these 2 are sister is($tree->find_node('YOR262W')->ancestor->id, $tree->find_node('Smik_Contig1103.1')->ancestor->id, 'Make sure two nodes are sister'); # TODO? # UPGMA tests # test the bootstrap my $replicates = &bootstrap_replicates($aln,10); is(scalar @$replicates, 10,'10 replicates formulated'); AlignUtil.t100644000766000024 313114006427353 16570 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Align# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; test_begin( -tests => 47 ); use_ok( 'Bio::Align::Utilities', qw( aa_to_dna_aln bootstrap_replicates cat dna_to_aa_aln ) ); use_ok('Bio::AlignIO'); use_ok('Bio::SeqIO'); } my $in = Bio::AlignIO->new( -format => 'clustalw', -file => test_input_file('pep-266.aln') ); my $pep_aln = $in->next_aln(); isa_ok( $pep_aln, 'Bio::Align::AlignI' ); $in->close(); # aa_to_dna_aln my $seqin = Bio::SeqIO->new( -format => 'fasta', -file => test_input_file('cds-266.fas') ); my %dna_seq; while ( my $seq = $seqin->next_seq ) { $dna_seq{ $seq->display_id } = $seq; } my $dna_aln = aa_to_dna_aln( $pep_aln, \%dna_seq ); my @aa_seqs = $pep_aln->each_seq; for my $dna_seq ( $dna_aln->each_seq ) { my $peptrans = $dna_seq->translate(); my $aaseq = shift @aa_seqs; is( $peptrans->seq(), $aaseq->seq() ); } # dna_to_aa_aln my $aa_aln = dna_to_aa_aln($dna_aln); my @pep_seqs = $aa_aln->each_seq; for my $dna_seq ( $dna_aln->each_seq ) { my $peptrans = $dna_seq->translate(); my $aaseq = shift @pep_seqs; is( $peptrans->seq, $aaseq->seq ); } # bootstrap_replicates my $bootstraps = bootstrap_replicates( $pep_aln, 10 ); is( scalar @$bootstraps, 10 ); # cat my $sub_aln1 = $pep_aln->slice( 1, 100 ); my $sub_aln2 = $pep_aln->slice( 101, 200 ); my $sub_aln3 = $pep_aln->slice( 1, 200 ); my $cat_aln = cat( $sub_aln1, $sub_aln2 ); my @seq = $sub_aln3->each_seq; for my $seq ( $cat_aln->each_seq ) { my $refseq = shift @seq; is( $seq->seq, $refseq->seq ); } blosum62.bla100644000766000024 403014006427353 16522 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# Matrix made by matblas from blosum62.iij # * column uses minimum score # BLOSUM Clustered Scoring Matrix in 1/2 Bit Units # Blocks Database = /data/blocks_5.0/blocks.dat # Cluster Percentage: >= 62 # Entropy = 0.6979, Expected = -0.5209 A R N D C Q E G H I L K M F P S T W Y V B Z X * 4 -1 -2 -2 0 -1 -1 0 -2 -1 -1 -1 -1 -2 -1 1 0 -3 -2 0 -2 -1 0 -4 -1 5 0 -2 -3 1 0 -2 0 -3 -2 2 -1 -3 -2 -1 -1 -3 -2 -3 -1 0 -1 -4 -2 0 6 1 -3 0 0 0 1 -3 -3 0 -2 -3 -2 1 0 -4 -2 -3 3 0 -1 -4 -2 -2 1 6 -3 0 2 -1 -1 -3 -4 -1 -3 -3 -1 0 -1 -4 -3 -3 4 1 -1 -4 0 -3 -3 -3 9 -3 -4 -3 -3 -1 -1 -3 -1 -2 -3 -1 -1 -2 -2 -1 -3 -3 -2 -4 -1 1 0 0 -3 5 2 -2 0 -3 -2 1 0 -3 -1 0 -1 -2 -1 -2 0 3 -1 -4 -1 0 0 2 -4 2 5 -2 0 -3 -3 1 -2 -3 -1 0 -1 -3 -2 -2 1 4 -1 -4 0 -2 0 -1 -3 -2 -2 6 -2 -4 -4 -2 -3 -3 -2 0 -2 -2 -3 -3 -1 -2 -1 -4 -2 0 1 -1 -3 0 0 -2 8 -3 -3 -1 -2 -1 -2 -1 -2 -2 2 -3 0 0 -1 -4 -1 -3 -3 -3 -1 -3 -3 -4 -3 4 2 -3 1 0 -3 -2 -1 -3 -1 3 -3 -3 -1 -4 -1 -2 -3 -4 -1 -2 -3 -4 -3 2 4 -2 2 0 -3 -2 -1 -2 -1 1 -4 -3 -1 -4 -1 2 0 -1 -3 1 1 -2 -1 -3 -2 5 -1 -3 -1 0 -1 -3 -2 -2 0 1 -1 -4 -1 -1 -2 -3 -1 0 -2 -3 -2 1 2 -1 5 0 -2 -1 -1 -1 -1 1 -3 -1 -1 -4 -2 -3 -3 -3 -2 -3 -3 -3 -1 0 0 -3 0 6 -4 -2 -2 1 3 -1 -3 -3 -1 -4 -1 -2 -2 -1 -3 -1 -1 -2 -2 -3 -3 -1 -2 -4 7 -1 -1 -4 -3 -2 -2 -1 -2 -4 1 -1 1 0 -1 0 0 0 -1 -2 -2 0 -1 -2 -1 4 1 -3 -2 -2 0 0 0 -4 0 -1 0 -1 -1 -1 -1 -2 -2 -1 -1 -1 -1 -2 -1 1 5 -2 -2 0 -1 -1 0 -4 -3 -3 -4 -4 -2 -2 -3 -2 -2 -3 -2 -3 -1 1 -4 -3 -2 11 2 -3 -4 -3 -2 -4 -2 -2 -2 -3 -2 -1 -2 -3 2 -1 -1 -2 -1 3 -3 -2 -2 2 7 -1 -3 -2 -1 -4 0 -3 -3 -3 -1 -2 -2 -3 -3 3 1 -2 1 -1 -2 -2 0 -3 -1 4 -3 -2 -1 -4 -2 -1 3 4 -3 0 1 -1 0 -3 -4 0 -3 -3 -2 0 -1 -4 -3 -3 4 1 -1 -4 -1 0 0 1 -3 3 4 -2 0 -3 -3 1 -1 -3 -1 0 -1 -3 -2 -2 1 4 -1 -4 0 -1 -1 -1 -2 -1 -1 -1 -1 -1 -1 -1 -1 -1 -2 0 0 -2 -1 -1 -1 -1 -1 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 -4 1 HM138502.gbk100644000766000024 1043714006427353 16075 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS HM138502 1410 bp cRNA linear VRL 22-APR-2010 DEFINITION Influenza A virus (A/California/07/2009(H1N1)) segment 6 neuraminidase (NA) gene, complete cds. ACCESSION HM138502 VERSION HM138502.1 GI:295002854 DBLINK BioProject: PRJNA37813 KEYWORDS . SOURCE Influenza A virus (A/California/07/2009(H1N1)) ORGANISM Influenza A virus (A/California/07/2009(H1N1)) Viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A. REFERENCE 1 (bases 1 to 1410) AUTHORS Starick,E. TITLE Direct Submission JOURNAL Submitted (21-APR-2010) Friedrich-Loeffler-Institut, Suedufer 10, Greifswald-Insel Riems 17493, Germany COMMENT Swine influenza A (H1N1) virus isolated during human swine flu outbreak of 2009. ##GISAID_EpiFlu(TM)Data-START## Isolate :: A/California/07/09 Subtype :: H1N1 Lineage :: swl ##GISAID_EpiFlu(TM)Data-END## FEATURES Location/Qualifiers source 1..1410 /organism="Influenza A virus (A/California/07/2009(H1N1))" /mol_type="viral cRNA" /strain="A/California/07/2009" /serotype="H1N1" /host="Homo sapiens" /db_xref="taxon:641809" /segment="6" /country="USA" /collection_date="09-Apr-2009" /note="lineage: swl" gene 1..1410 /gene="NA" CDS 1..1410 /gene="NA" /codon_start=1 /product="neuraminidase" /protein_id="ADF58339.1" /db_xref="GI:295002855" /translation="MNPNQKIITIGSVCMTIGMANLILQIGNIISIWISHSIQLGNQN QIETCNQSVITYENNTWVNQTYVNISNTNFAAGQSVVSVKLAGNSSLCPVSGWAIYSK DNSVRIGSKGDVFVIREPFISCSPLECRTFFLTQGALLNDKHSNGTIKDRSPYRTLMS CPIGEVPSPYNSRFESVAWSASACHDGINWLTIGISGPDNGAVAVLKYNGIITDTIKS WRNNILRTQESECACVNGSCFTVMTDGPSNGQASYKIFRIEKGKIVKSVEMNAPNYHY EECSCYPDSSEITCVCRDNWHGSNRPWVSFNQNLEYQIGYICSGIFGDNPRPNDKTGS CGPVSSNGANGVKGFSFKYGNGVWIGRTKSISSRNGFEMIWDPNGWTGTDNNFSIKQD IVGINEWSGYSGSFVQHPELTGLDCIRPCFWVELIRGRPKENTIWTSGSSISFCGVNS DTVGWSWPDGAELPFTIDK" ORIGIN 1 atgaatccaa accaaaagat aataaccatt ggttcggtct gtatgacaat tggaatggct 61 aacttaatat tacaaattgg aaacataatc tcaatatgga ttagccactc aattcaactt 121 gggaatcaaa atcagattga aacatgcaat caaagcgtca ttacttatga aaacaacact 181 tgggtaaatc agacatatgt taacatcagc aacaccaact ttgctgctgg acagtcagtg 241 gtttccgtga aattagcagg caattcctct ctctgccctg ttagtggatg ggctatatac 301 agtaaagaca acagtgtaag aatcggttcc aagggggatg tgtttgtcat aagggaacca 361 ttcatatcat gctccccctt ggaatgcaga accttcttct tgactcaagg ggccttgcta 421 aatgacaaac attccaatgg aaccattaaa gacaggagcc catatcgaac cctaatgagc 481 tgtcctattg gtgaagttcc ctctccatac aactcaagat ttgagtcagt cgcttggtca 541 gcaagtgctt gtcatgatgg catcaattgg ctaacaattg gaatttctgg cccagacaat 601 ggggcagtgg ctgtgttaaa gtacaacggc ataataacag acactatcaa gagttggaga 661 aacaatatat tgagaacaca agagtctgaa tgtgcatgtg taaatggttc ttgctttact 721 gtaatgaccg atggaccaag taatggacag gcctcataca agatcttcag aatagaaaag 781 ggaaagatag tcaaatcagt cgaaatgaat gcccctaatt atcactatga ggaatgctcc 841 tgttatcctg attctagtga aatcacatgt gtgtgcaggg ataactggca tggctcgaat 901 cgaccgtggg tgtctttcaa ccagaatctg gaatatcaga taggatacat atgcagtggg 961 attttcggag acaatccacg ccctaatgat aagacaggca gttgtggtcc agtatcgtct 1021 aatggagcaa atggagtaaa agggttttca ttcaaatacg gcaatggtgt ttggataggg 1081 agaactaaaa gcattagttc aagaaacggt tttgagatga tttgggatcc gaacggatgg 1141 actgggacag acaataactt ctcaataaag caagatatcg taggaataaa tgagtggtca 1201 ggatatagcg ggagttttgt tcagcatcca gaactaacag ggctggattg tataagacct 1261 tgcttctggg ttgaactaat cagagggcga cccaaagaga acacaatctg gactagcggg 1321 agcagcatat ccttttgtgg tgtaaacagt gacactgtgg gttggtcttg gccagacggt 1381 gctgagttgc catttaccat tgacaagtaa // genemark.out100644000766000024 1516614006427353 16747 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataGeneMark.hmm (Version 2.2a) Sequence name: Hvrn.contig8 Sequence length: 50124 bp G+C content: 44.82% Matrices file: /home/software/analysis/gene-prediction/genemark/matdir/osativa.mtx (Oryza sativa) Thu Mar 22 10:25:00 2001 Predicted genes/exons Gene Exon Strand Exon Exon Range Exon Start/End # # Type Length Frame 1 1 - Initial 1805 2176 372 3 1 2 5 - Terminal 3108 3229 122 3 2 2 4 - Internal 3869 4501 633 1 2 2 3 - Internal 4820 4888 69 1 2 2 2 - Internal 4981 5061 81 1 2 2 1 - Initial 5296 5656 361 1 1 3 2 - Terminal 7171 7288 118 3 3 3 1 - Initial 7540 7787 248 2 1 4 1 + Single 15431 15757 327 1 3 5 1 + Initial 17526 17696 171 1 3 5 2 + Internal 17772 17887 116 1 2 5 3 + Internal 18005 18074 70 3 3 5 4 + Internal 18456 18539 84 1 3 5 5 + Internal 18628 18714 87 1 3 5 6 + Internal 18807 18870 64 1 1 5 7 + Internal 19944 20038 95 2 3 5 8 + Internal 20139 20293 155 1 2 5 9 + Terminal 20779 20788 10 3 3 6 5 - Terminal 23000 23061 62 3 2 6 4 - Internal 23397 24101 705 1 2 6 3 - Internal 24708 24821 114 1 2 6 2 - Internal 25079 25356 278 1 3 6 1 - Initial 26970 26977 8 2 1 7 3 - Terminal 34218 34310 93 3 1 7 2 - Internal 35900 36301 402 3 1 7 1 - Initial 36392 36448 57 3 1 8 1 + Initial 36531 37064 534 1 3 8 2 + Terminal 37153 37161 9 1 3 9 3 - Terminal 37880 37917 38 3 2 9 2 - Internal 38938 39006 69 1 2 9 1 - Initial 39080 40214 1135 1 1 10 2 - Terminal 41091 41554 464 3 2 10 1 - Initial 41635 41713 79 1 1 11 1 - Single 41744 42061 318 3 1 12 1 + Initial 42171 42212 42 1 3 12 2 + Terminal 42432 42824 393 1 3 13 7 - Terminal 43798 43932 135 3 1 13 6 - Internal 44220 44297 78 3 1 13 5 - Internal 47595 47685 91 3 3 13 4 - Internal 48393 48526 134 2 1 13 3 - Internal 48643 49024 382 3 3 13 2 - Internal 49118 49149 32 2 1 13 1 - Initial 49457 49507 51 3 1 Predicted gene sequence(s): >Hvrn.contig8|GeneMark.hmm|gene 1|124_aa MEVAVKGYADASFDTDPDDSKSQTGYVFILNGGAVSWCSSKQSVVADSRCEAEYMAALEA AKEGVWMKQFMTDLGVVSSALDPLTLLCDNTRAIALAKEPRFHNKTRHIKRRFNLIRDYV EGED >Hvrn.contig8|GeneMark.hmm|gene 2|421_aa MAHAKVTLNFNTFLEKAKLKDDGSNFVDWARNLKLLLQAGKKDYVLNVALGDEPPAAADQ DAKNAWLACKEDYSVVQCAVLYGLEPGLQRCFERHGAYEMFQELKFIFQKNARIERYETS ESELRKEHQVLMVNKATSFKRSGKGKKGYGSLEAQLSKYLAGKKAAKEKSENNGCSISMS NIFYGHAPNVRGLFILNLDSDNTHIHNIETKRVRVNNDSAMFLWHCRLGHIGVKRMKKLH TDGLLESLDFDSLDTCEPCLMGKMTKTPFSGTMERASDLLEIIHTDVCGPMSAEARGGYR YFLTFIDDLSRYGYVYLMKHKSETFEKFKQFQSEVENHRNKKIKFLRSDHGGEYLSFEFG AHLRQCGIVSQLTPLGTPQRNEAMVGPDSNKWLEAMKSEIGSMYGNKVWTLEVLPEGRKA I >Hvrn.contig8|GeneMark.hmm|gene 3|121_aa MVRRQRLIYRMTSFDYRKVFGHYRECTESDEWVPNVHREGPTHPGKPIGPRGGAPALGGL VGQPKRALCAKDRKSKRKKKRKRSRYFTTTGAPSRCRRTHLLIRLACWIKKAEIIIELYV C >Hvrn.contig8|GeneMark.hmm|gene 4|108_aa MFTTPKAGGGMYLCLSVGWGIVGRRRVMSGCGQGSEMGLVGLRTRRHWAKTGRGGAAGGA ASIGDGPRRAADKATLGEDGPGRGVGRGGVGRRRVASGGGDREEDEWS >Hvrn.contig8|GeneMark.hmm|gene 5|283_aa MDAAVQEAKLLRQVNALIVAHLRDQNLTQAAAAVAAATMTPKADASLPNHLLRLVAKGLA AEREEAARGGGAPPAFDSAGGGGLARPLGTSAVDFSVQNVRGPSKTFPKHETRHISDHKN VARCAKFSPDGKHFATGSGDTSIKFFEVSKIKQTMLGDSKEGPGRPVVRTFYDHVQLLTQ LLVHSTDKVSSFVTNIPGTDHPVAHLYDVNTFTCFLSANPQDSSAAINQVRYSGTGSMYV TASKDGSLRIWDGVSAECVRPIIGAHGSVEATSAIFTKDESGF >Hvrn.contig8|GeneMark.hmm|gene 6|388_aa MGSVVFLEGSEGNLQALKDTLQAYQVASAQKVNLQKSSILDGKGCRDEDKGTLKQTIGID SEALSERYSGLPTVVGRLKDGSFEYVRERSKGKVSGSVGKASVALQFPSSLCARVLKARY FKECTIMNTTCPNAMFWKVLSSEKWVPVAIPPVSEGPHGELASWLLRWFAEVGDPERELM VHAVYGLWLARNEARDGKRIVDPRVVEENVYQHIIEWNAIHMKKPRSTTPTLAVRWSPPE QGWLKANSDGALAKLRDRGGGGVVLRDHDGAYRGGACYVFRDVSDPEVVEILACRKAVHL AVQTGATRVHVEVDSKGMAAMLNDQAKNLSAAGPIVEEIKLLGRTLQGFIVSRVRRSGNH GAHLLAREVRSVYTHVILKQPLFDTCRL >Hvrn.contig8|GeneMark.hmm|gene 7|183_aa MVLTEKEAKGFVFSGPVEEAWGLHHDAQFRDLGNNLFLVHFGGEGDWKHSRNNGPWQFDF MILKGYDGKTRPSEMVFDSVEAWVRVEDLPLDRRTREFGEALGNWLGEVVKVDVERDGFA KGKYLRVRAKIFVYEPVVRYFNLKESVDDEVETAEGQAGPLEAEAEARRGASVSAHSFGR WGK >Hvrn.contig8|GeneMark.hmm|gene 8|180_aa MASTVSPWSETPQDILGLVIDRLHSSPDHEEPRLSAAWSRFLLAVPVAAANRRGFQRARR TRHSAAADRARFRAVCRSWHLAMRQHVSTPRVLPWIILSDGYFFTPSDNGCRAPRRLPSL PKNARCIGSTDGWLALDCTDARNVHTYLLHNPFSDTTVPLPELDPIIANVSEFFAVRKAA >Hvrn.contig8|GeneMark.hmm|gene 9|413_aa MPLKFWDETFSTAVYLINRVPSRVIHNQTPLERLFGLTPNYTFLRIFGCAVWPNLRPFNK HKLEYRSKQCVFIGYNYLHKGYKCLDVSTGRVYVSQDVIFDEHIFPFASLHPNAGAQLRA ELVLLPPTLLNLSSPLTPSAAPNDPMAISTIYAPTSANSVQDSAGISHDFMQPNVSTDLV ATENPGLHASESATAAPGAGDPPLQASGSAAAAPGSSPGFVHQPAASVGRSPASTSDPAR QPDASAARPPVSDPVRPTTVATALFPASDLVRSPQEIRLQRRAPPTAPWIGRGLPRVVGP PCLLPWTREISLDVVTRYRLLRLRPMQRRRCPMQRPPRLLFLLVCHLIRYLLTLRCPVVS STICNPCNQHLHPLGLILGEPENLKEAIADPKWKAAMDEEFDWAGCPDDRRST >Hvrn.contig8|GeneMark.hmm|gene 10|180_aa MAAAGKPLDDDELVSYILQGLDSDYNPEARIDAQNGSNTNSFSINLASKGGSRNNNDTRP SGPGGGNPAAYRGAGGGFFPNTLVAPPPSGGRDETCQICKRQGHATWHCFKRYDKNFNPP PKRQGGGGGNNSGGGGNSSGGNTKSANTVPAAYDVDTNWYLDTGAMDHVTGELEKLAMHD >Hvrn.contig8|GeneMark.hmm|gene 11|105_aa MGYLDGTMAEPPAVLTTETDVAGKKEISSTPNPAHVLWYTQDQQVLTFLLASLSRDVLLQ VHSLASATGVWTAIQQMFASHSRARHIQLRGQLGNTKKGDSPVAI >Hvrn.contig8|GeneMark.hmm|gene 12|144_aa MVELEEEDDMSMEEVALMTNNSNYLIILIRPGKGVWLPKPDTAPFNLFIDIVFLQGKLYG ITQAEDLASVSIDFDDCGMPTVTTVERLIKHPPLESCEFDVWSDAGEKLEADGDMGDEDQ VENGGEDHDEALNEVDARIQKENR >Hvrn.contig8|GeneMark.hmm|gene 13|300_aa MSTATSLWDKAALMMREELAVAAVVAGCLDMTKLYVVGAGMFSCVTVALYPVSVIKTRMQ VASGEAMRRNALATFKNILKVDGVPGLYRGFGTVITGAIPARIIFLTALEKTKATSLKLV EPLQLSESMEAALANGLGGLTASLCSQAVFVPIDVVSQKLMVQGYSGHVRYKGGIDVVQK IMKADGPRGLYRGFGLSVMTALGRLDDKEDTPSQLKIVGVQATGGMVAGATSLEDNPLSD NVPQFAETSSAGSPLEKERVRQRASATISVTRDCQCSRRPTIGGVRQLGRSLPMRRDGAT testaln.xmfa100644000766000024 422414006427353 16725 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>1:1-598 + chrY TCCAAGTCGGCTTTATGTTTGCTTCTGCCAGGCATTCTAGATGCCCCATGTCTAGGATCT CTTTAGGCAGGAGAGAGGGTGATGGTGTAGGAGGACCCATTTCTTGGCTTGCAGATTCCA ATAATAAAAAAGTCACAGATTTAAACCCCAAACTTTGATGAAATGCAGGTCTAGGGTTTT AAAATATAATGAGAGTTAAATACTTTTGTATTTTCTTCATCCAGAGATGGGGCAAGCTTC CTCATCTGCTCGTTCATGGGTGATTTATATTTTCCCCACTCCATCCTTTTCCTAAGGTAT TTTTTTTTTAGGGACAATGGCTTTTTGCAGAGTACTCAGTTCCAGCTCCGGGGGCACCGG TTGAGCCCTTACCGTCCTGCCCCTAAACATCCAGACCTCAAGTTAGAGAGGGGAGTAACA TTTGGGGGGTGCCCACACCTAGGAGGACCAATCCTTCTGGTTTCCTTAGGGATGCAGGAA TTTGGGGGGGGGGGGCTCAGTGCTAAAACCAGTAGAGTCCTGGGCAAACGAGTATGACTG AAGATGCTTTGAACACCCTAGCGTTATGTCGATCGCATGCATCGTAGTGTCGCTGATG >2:5000-5534 - chr17 TGCAGATTGGCCTT-TGTTTCGTTTTTC-AAGCGTT-TAAA--CGCCTTGCCTAAGAATC TTTT--GCAGGGAAGGGGATAGTGAACTGGGAAAACCTGGCTCTTCCTTTCGAGATTCCA GTAACAAACATGTCATAACTATAAACGCCAAACTTGG--AGAGCGCAGGAATGGAAGGTC AAACACCAATGAGAGTTAGATGGTTTTGGGTTT----------------------GCT-- CTAGTCTGCACG-------GTGCTCCCCGTCCCCTCACGTCCGTGCTTTTCCTCAGGATG ATGCCTTGCCAGAACACCGGTGTGCTGCAAGGTGCTCAGCTCCAAATCGGGCTGCACCGC TTCAGCTTTCCCCATCCAGCCA--ACGCAGGAAGGCCTGGAGCTACAGAGTTTAGAGCCA TCTCTCCGCTGCTCAT--------TAACCAACCATTCCAGCT-------GTCTGTAGTGG GTTTTTTTCTT----CTCTACACTAAAATGAGGACAGTCCAGGCCCTTTG--TTAGACTG AAGATGCTTTGAACACCCTAGCGTTATGTCGATCGCATGCATCGTAGTGTCGCTGATG >3:19000-19537 - chr7 TCCAGACTGTCTTT-TGCTCCCTTTTTCCGAGCATT-TAAAAATACCATGCCTAAGAATC TTTT--GCAGGGAAGGGGATAGCGAGCTGGGAAGGCCTATTTCTTCATTTCGAGATTCTG GTAATAAACATGTCATAAATATAAATGCCAAACTCCG--GAAATGCAGGTGTAGAGCGTC AGATTCTATTTGGACTTAAATGATGTGGTGTTTT---------------------GCT-- CTAATTTCTACC-------GTGCTCTCCGTTCC-TCAAGTCCATGCATTTCCTTAGGGTG CTGCCTTTCCAGAGTACTGGTATGCTGCAGGGTGCTCAGTTCCACATCTGTCTGCACTAT TTCAAAGTTTCCC-TCCAGCCC--ACACAACTATGCCTAGAGCTA--GAGGTTAGAACCG TCTGTCCA-TGCTCTT--------TAACCAACCACTCCAGAT-------AGGTGTGGTGG TTTTTTTTTTTTTTTCTCTGTACTAAAATTAGGACAGTCCAGGCCTGTTG--TTAGACCA AAGATGCTTTGAACACCCTAGCGTTATGTCGATCGCATGCATCGTAGTGTCGCTGATG = score = 111 >1:1000-1059 + chrY CACTCTAATAGTAAAGTTTCTTTTGCTGTGCAGAAGCTCTTTAGTTTAATTAGATCCCAT >2:6000-6055 + chr17 CACTCTAATACTAAACTTTCTTTTCCTCTCCACA----CTTTACTTTAATTACATCCCAT >3:20000-20055 - chr12 CACTCTAATAGTAAAGTTTCTT----TGTGCAGAAGCTCTTAGTTTTAATTAGATCCCAT = score = 11 >TelAviv_01:1-9 + TelAviv_01 AAAAAAAAA test.genbank100644000766000024 3714314006427353 16732 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS DDU63596 310 bp DNA INV 14-MAY-1999 DEFINITION Dictyostelium discoideum Tdd-4 transposable element flanking sequence, clone p427/428 right end. ACCESSION U63596 NID g2393749 KEYWORDS . SOURCE Dictyostelium discoideum. ORGANISM Dictyostelium discoideum Eukaryota; Dictyosteliida; Dictyostelium. REFERENCE 1 (bases 1 to 310) AUTHORS Wells,D.J. TITLE Tdd-4, a DNA transposon of Dictyostelium that encodes proteins similar to LTR retroelement integrases JOURNAL Nucleic Acids Res. 27 (11), 2408-2415 (1999) REFERENCE 2 (bases 1 to 310) AUTHORS Wells,D.J. and Welker,D.L. TITLE Dictyostelium discoideum Tdd-4 transposable element, right end flanking sequence from clone p427/428 JOURNAL Unpublished REFERENCE 3 (bases 1 to 310) AUTHORS Wells,D.J. and Welker,D.L. TITLE Direct Submission JOURNAL Submitted (11-JUL-1996) Biology, Utah State Univ., Logan, UT 84322-5305, USA FEATURES Location/Qualifiers source 1..310 /organism="Dictyostelium discoideum" /strain="AX4" /db_xref="taxon:44689" /clone="p427/428" misc_feature 5.12 /note="Fuzzy location" misc_feature join(J00194:(100..202),1..245,256..258) /note="Location partly in another entry" BASE COUNT 118 a 46 c 67 g 79 t ORIGIN 1 gtgacagttg gctgtcagac atacaatgat tgtttagaag aggagaagat tgatccggag 61 taccgtgata gtattttaaa aactatgaaa gcgggaatac ttaatggtaa actagttaga 121 ttatgtgacg tgccaagggg tgtagatgta gaaattgaaa caactggtct aaccgattca 181 gaaggagaaa gtgaatcaaa agaagaagag tgatgatgaa tagccaccat tactgcatac 241 tgtagccctt acccttgtcg caccattagc cattaataaa aataaaaaat tatataaaaa 301 ttacacccat // LOCUS DDU63595 83 bp DNA INV 14-MAY-1999 DEFINITION Dictyostelium discoideum Tdd-4 transposable element flanking sequence, clone p427/428 left end. ACCESSION U63595 NID g2393748 KEYWORDS . SOURCE Dictyostelium discoideum. ORGANISM Dictyostelium discoideum Eukaryota; Dictyosteliida; Dictyostelium. REFERENCE 1 (bases 1 to 83) AUTHORS Wells,D.J. TITLE Tdd-4, a DNA transposon of Dictyostelium that encodes proteins similar to LTR retroelement integrases JOURNAL Nucleic Acids Res. 27 (11), 2408-2415 (1999) REFERENCE 2 (bases 1 to 83) AUTHORS Wells,D.J. and Welker,D.L. TITLE Dictyostelium discoideum Tdd-4 transposable element, left end flanking sequence from clone p427/428 JOURNAL Unpublished REFERENCE 3 (bases 1 to 83) AUTHORS Wells,D.J. and Welker,D.L. TITLE Direct Submission JOURNAL Submitted (11-JUL-1996) Biology, Utah State Univ., Logan, UT 84322-5305, USA FEATURES Location/Qualifiers source 1..83 /organism="Dictyostelium discoideum" /strain="AX4" /db_xref="taxon:44689" /clone="p427/428" BASE COUNT 31 a 16 c 12 g 24 t ORIGIN 1 ttcgaaggat atctcaaggc agttaataat tactatgatg attgtaaaat attccaaagt 61 ttcccagacc caccaataat gac // LOCUS HUMBDNF 918 bp DNA PRI 31-OCT-1994 DEFINITION Human brain-derived neurotrophic factor (BDNF) gene, complete cds. ACCESSION M37762 VERSION M37762.1 GI:179402 KEYWORDS neurotrophic factor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 918) AUTHORS Jones,K.R. and Reichardt,L.F. TITLE Molecular cloning of a human gene that is a member of the nerve growth factor family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (20), 8060-8064 (1990) MEDLINE 91045937 COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by K.R.Jones, 13-AUG-1990. FEATURES Location/Qualifiers source 1..918 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" sig_peptide 76..123 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" CDS 76..819 /gene="BDNF" /note="putative" /codon_start=1 /db_xref="GDB:G00-125-916" /product="brain-derived neurotrophic factor" /protein_id="AAA51820.1" /db_xref="GI:179403" /translation="MTILFLTMVISYFGCMKAAPMKEANIRGQGGLAYPGVRTHGTLE SVNGPKAGSRGLTSLADTFEHVIEELLDEDQKVRPNEENNKDADLYTSRVMLSSQVPL EPPLLFLLEEYKNYLDAANMSMRVRRHSDPARRGELSVCDSISEWVTAADKKTAVDMS GGTVTVLEKVPVSKGQLKQYFYETKCNPMGYTKEGCRGIDKRHWNSQCRTTQSYVRAL TMDSKKRIGWRFIRIDTSCVCTLTIKRGR" gene 76..816 /gene="NTF3" /map="12p13" gene 76..819 /gene="BDNF" /map="11p13" mat_peptide 124..816 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" BASE COUNT 269 a 192 c 237 g 220 t ORIGIN 1 ggtgaaagaa agccctaacc agttttctgt cttgtttctg ctttctccct acagttccac 61 caggtgagaa gagtgatgac catccttttc cttactatgg ttatttcata ctttggttgc 121 atgaaggctg cccccatgaa agaagcaaac atccgaggac aaggtggctt ggcctaccca 181 ggtgtgcgga cccatgggac tctggagagc gtgaatgggc ccaaggcagg ttcaagaggc 241 ttgacatcat tggctgacac tttcgaacac gtgatagaag agctgttgga tgaggaccag 301 aaagttcggc ccaatgaaga aaacaataag gacgcagact tgtacacgtc cagggtgatg 361 ctcagtagtc aagtgccttt ggagcctcct cttctctttc tgctggagga atacaaaaat 421 tacctagatg ctgcaaacat gtccatgagg gtccggcgcc actctgaccc tgcccgccga 481 ggggagctga gcgtgtgtga cagtattagt gagtgggtaa cggcggcaga caaaaagact 541 gcagtggaca tgtcgggcgg gacggtcaca gtccttgaaa aggtccctgt atcaaaaggc 601 caactgaagc aatacttcta cgagaccaag tgcaatccca tgggttacac aaaagaaggc 661 tgcaggggca tagacaaaag gcattggaac tcccagtgcc gaactaccca gtcgtacgtg 721 cgggccctta ccatggatag caaaaagaga attggctggc gattcataag gatagacact 781 tcttgtgtat gtacattgac cattaaaagg ggaagatagt ggatttatgt tgtatagatt 841 agattatatt gagacaaaaa ttatctattt gtatatatac ataacagggt aaattattca 901 gttaagaaaa aaataatt // LOCUS NT_010368 161485 bp DNA CON 16-NOV-2000 DEFINITION Homo sapiens chromosome 15 working draft sequence segment, complete sequence. ACCESSION NT_010368 VERSION NT_010368.1 GI:11433101 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 161485) AUTHORS International Human Genome Project collaborators. TITLE Toward the complete sequence of the human genome JOURNAL Unpublished COMMENT GENOME ANNOTATION REFSEQ: NCBI contigs are derived from assembled genomic sequence data. They may include both draft and finished sequence. COMPLETENESS: not full length. FEATURES Location/Qualifiers source 1..310 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" source order(1..100,251..300,300..310) /note="Doctored from Accession AC011224 sequenced by Whitehead Institute for Biomedical Research" /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RP11-10K20" variation 244 /replace="T" /replace="A" /db_xref="dbSNP:140670" ORIGIN 1 gtgacagttg gctgtcagac atacaatgat tgtttagaag aggagaagat tgatccggag 61 taccgtgata gtattttaaa aactatgaaa gcgggaatac ttaatggtaa actagttaga 121 ttatgtgacg tgccaagggg tgtagatgta gaaattgaaa caactggtct aaccgattca 181 gaaggagaaa gtgaatcaaa agaagaagag tgatgatgaa tagccaccat tactgcatac 241 tgtagccctt acccttgtcg caccattagc cattaataaa aataaaaaat tatataaaaa 301 ttacacccat // LOCUS HUMBETGLOA 3002 bp DNA linear PRI 26-AUG-1994 DEFINITION Human haplotype C4 beta-globin gene, complete cds. ACCESSION L26462 VERSION L26462.1 GI:432453 KEYWORDS beta-globin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3002) AUTHORS Fullerton,S.M., Harding,R.M., Boyce,A.J. and Clegg,J.B. TITLE Molecular and population genetic analysis of allelic sequence diversity at the human beta-globin locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (5), 1805-1809 (1994) MEDLINE 94173918 PUBMED 7907422 FEATURES Location/Qualifiers source 1..3002 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="C4" /note="sequence found in a Melanesian population" variation 111 /replace="t" variation 263 /note="Rsa I polymorphism" /replace="t" variation 273 /replace="c" variation 286..287 /note="2 bp insertion of AT" /replace="" variation 288 /replace="t" variation 295..296 /note="1 bp deletion of C or 2 bp deletion of CT" /replace="" variation 347 /replace="c" variation 476 /replace="t" variation 500 /replace="c" CDS join(866..957,1088..1310,2161..2289) /codon_start=1 /product="beta-globin" /protein_id="AAA21100.1" /db_xref="GI:532506" /translation="MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFE SFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPE NFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH" exon <866..957 /number=1 variation 874 /replace="c" intron 958..1087 /number=1 exon 1088..1310 /number=2 intron 1311..2160 /number=2 variation 1326 /note="Ava II polymorphism" /replace="g" variation 1384 /replace="g" variation 1391 /replace="t" variation 1976 /replace="t" exon 2161..>2289 /number=3 variation 2522 /replace="c" variation 2602 /replace="a" variation 2604 /replace="c" variation 2760 /note="Hinf I polymorphism" /replace="t" variation 2913 /replace="g" BASE COUNT 810 a 601 c 599 g 992 t ORIGIN 1 acctcctatt tgacaccact gattacccca ttgatagtca cactttgggt tgtaagtgac 61 tttttattta tttgtatttt tgactgcatt aagaggtctc tagtttttta cctcttgttt 121 cccaaaacct aataagtaac taatgcacag agcacattga tttgtattta ttctattttt 181 agacataatt tattagcatg catgagcaaa ttaagaaaaa caacaacaaa tgaatgcata 241 tatatgtata tgtatgtgtg tacatataca catatatata tatatatatt ttttcttttc 301 ttaccagaag gttttaatcc aaataaggag aagatatgct tagaactgag gtagagtttt 361 catccattct gtcctgtaag tattttgcat attctggaga cgcaggaaga gatccatcta 421 catatcccaa agctgaatta tggtagacaa aactcttcca cttttagtgc atcaacttct 481 tatttgtgta ataagaaaat tgggaaaacg atcttcaata tgcttaccaa gctgtgattc 541 caaatattac gtaaatacac ttgcaaagga ggatgttttt agtagcaatt tgtactgatg 601 gtatggggcc aagagatata tcttagaggg agggctgagg gtttgaagtc caactcctaa 661 gccagtgcca gaagagccaa ggacaggtac ggctgtcatc acttagacct caccctgtgg 721 agccacaccc tagggttggc caatctactc ccaggagcag ggagggcagg agccagggct 781 gggcataaaa gtcagggcag agccatctat tgcttacatt tgcttctgac acaactgtgt 841 tcactagcaa cctcaaacag acaccatggt gcatctgact cctgaggaga agtctgccgt 901 tactgccctg tggggcaagg tgaacgtgga tgaagttggt ggtgaggccc tgggcaggtt 961 ggtatcaagg ttacaagaca ggtttaagga gaccaataga aactgggcat gtggagacag 1021 agaagactct tgggtttctg ataggcactg actctctctg cctattggtc tattttccca 1081 cccttaggct gctggtggtc tacccttgga cccagaggtt ctttgagtcc tttggggatc 1141 tgtccactcc tgatgctgtt atgggcaacc ctaaggtgaa ggctcatggc aagaaagtgc 1201 tcggtgcctt tagtgatggc ctggctcacc tggacaacct caagggcacc tttgccacac 1261 tgagtgagct gcactgtgac aagctgcacg tggatcctga gaacttcagg gtgagtctat 1321 gggacccttg atgttttctt tccccttctt ttctatggtt aagttcatgt cataggaagg 1381 ggataagtaa cagggtacag tttagaatgg gaaacagacg aatgattgca tcagtgtgga 1441 agtctcagga tcgttttagt ttcttttatt tgctgttcat aacaattgtt ttcttttgtt 1501 taattcttgc tttctttttt tttcttctcc gcaattttta ctattatact taatgcctta 1561 acattgtgta taacaaaagg aaatatctct gagatacatt aagtaactta aaaaaaaact 1621 ttacacagtc tgcctagtac attactattt ggaatatatg tgtgcttatt tgcatattca 1681 taatctccct actttatttt cttttatttt taattgatac ataatcatta tacatattta 1741 tgggttaaag tgtaatgttt taatatgtgt acacatattg accaaatcag ggtaattttg 1801 catttgtaat tttaaaaaat gctttcttct tttaatatac ttttttgttt atcttatttc 1861 taatactttc cctaatctct ttctttcagg gcaataatga tacaatgtat catgcctctt 1921 tgcaccattc taaagaataa cagtgataat ttctgggtta aggcaatagc aatatctctg 1981 catataaata tttctgcata taaattgtaa ctgatgtaag aggtttcata ttgctaatag 2041 cagctacaat ccagctacca ttctgctttt attttatggt tgggataagg ctggattatt 2101 ctgagtccaa gctaggccct tttgctaatc atgttcatac ctcttatctt cctcccacag 2161 ctcctgggca acgtgctggt ctgtgtgctg gcccatcact ttggcaaaga attcacccca 2221 ccagtgcagg ctgcctatca gaaagtggtg gctggtgtgg ctaatgccct ggcccacaag 2281 tatcactaag ctcgctttct tgctgtccaa tttctattaa aggttccttt gttccctaag 2341 tccaactact aaactggggg atattatgaa gggccttgag catctggatt ctgcctaata 2401 aaaaacattt attttcattg caatgatgta tttaaattat ttctgaatat tttactaaaa 2461 agggaatgtg ggaggtcagt gcatttaaaa cataaagaaa tgaagagcta gttcaaacct 2521 tgggaaaata cactatatct taaactccat gaaagaaggt gaggctgcaa acagctaatg 2581 cacattggca acagccctga tgcatatgcc ttattcatcc ctcagaaaag gattcaagta 2641 gaggcttgat ttggaggtta aagttttgct atgctgtatt ttacattact tattgtttta 2701 gctgtcctca tgaatgtctt ttcactaccc atttgcttat cctgcatctc tcagccttga 2761 ctccactcag ttctcttgct tagagatacc acctttcccc tgaagtgttc cttccatgtt 2821 ttacggcgag atggtttctc ctcgcctggc cactcagcct tagttgtctc tgttgtctta 2881 tagaggtcta cttgaagaag gaaaaacagg ggtcatggtt tgactgtcct gtgagccctt 2941 cttccctgcc tcccccactc acagtgaccc ggaatctgca gtgctagtct cccggaacta 3001 tc // genewise.out100644000766000024 3712114006427353 16757 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/datagenewise $Name: not supported by cvs2svn $ (unreleased release) This program is freely distributed under a GPL. See source directory Copyright (c) GRL limited: portions of the code are from separate copyright Query protein: SINFRUP00000067802 Comp Matrix: blosum62.bla Gap open: 12 Gap extension: 2 Start/End default Target Sequence Scaffold_2042.1 Strand: forward Start/End (protein) default Gene Paras: human.gf Codon Table: codon.table Subs error: 1e-05 Indel error: 1e-05 Model splice? model Model codon bias? flat Model intron bias? tied Null model syn Algorithm 623 genewise output Score 2054.68 bits over entire alignment Scores as bits over a synchronous coding model Warning: The bits scores is not probablistically correct for single seqs See WWW help for more info SINFRUP00000067 1 LDEDQLSHSSSRLSRSPLKGVKKAKIMQCKVTLLDNSDYTIDVE LDEDQLSHSSSRLSRSPLKGVKKAKIMQCKVTLLDNSDYTIDVE LDEDQLSHSSSRLSRSPLKGVKKAKIMQCKVTLLDNSDYTIDVE Scaffold_2042.122265 cgggcttcaaacctatccaggaagaaactagatcgatgtaaggg taaaatcaggggtcgcctagtaacattagatcttaacaactata gcgcgactccttccgtcgaacgaggaggtacaggcccccattag SINFRUP00000067 45 KGARGQMLFDKVCEHLNLLERD KGARGQMLFDKVCEHLNLLERD KGARGQMLFDKVCEHLNLLERD Scaffold_2042.122397 GTGAGCA Intron 1 CAGaggagcactgagtgccatcgag <0-----[22397 : 24223]-0>agcggatttaatgaatattaga aacgcagcttaatgtatacgac SINFRUP00000067 67 YFGITYRDVENQK NWLDPSKEL YFGITYRDVENQK NWLDPSKEL YFGITYRDVENQK NWLDPSKEL Scaffold_2042.124290 ttgaatagggacaGTAAATA Intron 2 CAGatcgctagc atgtcagataaaa<0-----[24329 : 24470]-0>agtaccaat ctccatactgtgg tggtttagg SINFRUP00000067 89 KKQIR GPWNFAFNVKFYPP KKQIR GPWNFAFNVKFYPP KKQIR T:T[act] GPWNFAFNVKFYPP Scaffold_2042.124498 aacacAGTAAGCA Intron 3 CAGCTgctatgtagattcc aaatg <1-----[24514 : 24586]-1> gcgatctatatacc aggcg tcgctcctcgtcta SINFRUP00000067 109 DPVQLAEDIT YYLCLQLRD DPVQLAEDIT YYLCLQLRD DPVQLAEDIT R:R[agg] YYLCLQLRD Scaffold_2042.124631 gcgccgggaaAGGTAAGAG Intron 4 CAGGttttcccag actatcaatc <2-----[24663 : 24816]-2> aatgtatga ttcggtgcca ctgtgggat SINFRUP00000067 129 DVVSGRLPCSFATHTVLGSYTVQSELGDYDHEELGSDYLSELRLAPNQT DVVSGRLPCSFATHTVLGSYTVQSELGDYDHEELGSDYLSELRLAPNQT DVVSGRLPCSFATHTVLGSYTVQSELGDYDHEELGSDYLSELRLAPNQT Scaffold_2042.124845 gggtgccctttgacagcgttagctgcggtgcggcgagtcagcccgcaca attcggtcgctccacttgcactacatgaaaaaatggaatgatgtccaac tgtgctgccttcccatactcgggtagactctgggccccctggctaccgc SINFRUP00000067 178 KELEEKVMELHKSY GMTPG KELEEKVMELHKSY GMTPG KELEEKVMELHKSY K:K[aag] GMTPG Scaffold_2042.124992 agcggagagccaatAAGTTGGTT Intron 5 CAGGgaacg aataaattataaga <2-----[25036 : 25161]-2> gtccg agtgagcgaccgcc ggatg SINFRUP00000067 198 EAEMHFLENAKKLSMYGVDLHHAK EAEMHFLENAKKLSMYGVDLHHAK EAEMHFLENAKKLSMYGVDLHHAK Scaffold_2042.125178 gggactcgagaactatgggcccgaGTAATAA Intron 6 acatattaacaatctagtataaca<0-----[25250 : 26185] aaggccgatcgggcgccttccctg SINFRUP00000067 222 DSEGVEIMLGVCSSGLLIYRDRLRINRFAWPKILKISYKRNNFYIK DSEGVEIMLGVCSSGLLIYRDRLRINRFAWPKILKISYKRNNFYIK DSEGVEIMLGVCSSGLLIYRDRLRINRFAWPKILKISYKRNNFYIK Scaffold_2042.126183 TAGgtggggaacggttagccatagatcaaatgtcaacaattaaaattaa -0>acagtatttgtgcggtttagagtgtagtcgcattatcaagaatata ctgatgcggatcgccgtccacgggccgttgtaacgttcggtcccca SINFRUP00000067 268 IRPGE FEQFESTIGFKLPNHRA IRPGE FEQFESTIGFKLPNHRA IRPGE FEQFESTIGFKLPNHRA Scaffold_2042.126324 accggGTAATTA Intron 7 CAGtgctgaaagtaccaccg tgcga<0-----[26339 : 26457]-0>taatagctgtatcaagc ctgcg tggtgcgctcgtctttt SINFRUP00000067 290 AKRLWKVCVEHHTFFRQVMKEVL AKRLWKVCVEHHTFFRQVMKEVL AKRLWKVCVEHHTFFRQVMKEVL G:G[ggg] Scaffold_2042.126509 gaactagtggccattacgaaggcGGGTTGCCA Intron 8 cagtgatgtaaacttgattaatt <2-----[26580 : 26864] cagcggctggtccccgacggggg SINFRUP00000067 313 LVSPEAPPKKFLSLGSKFRYSGRTQAQTRRASSQIIRPAPFFERS LVSPEAPPKKFLSLGSKFRYSGRTQAQTRRASSQIIRPAPFFERS LVSPEAPPKKFLSLGSKFRYSGRTQAQTRRASSQIIRPAPFFERS Scaffold_2042.126862 CAGGcgtcggccaatcacgtatctagaacgcaccgatcaaacgcttgct -2> ttccacccaattgtgcatgagggcacacggcgcattgcccttagc agttggtaggcgttctgtcctcaggtgcagcctgctattgctgtc SINFRUP00000067 359 SSKRYNMSRSLDG SSADGV SSKRYNMSRSLDG SSADGV SSKRYNMSRSLDG D:D[gac] SSADGV Scaffold_2042.127001 taactaatcatggGGTAGGAT Intron 9 AAGACatgggg cgagaatcggtag <1-----[27041 : 27314]-1> gccagt ccacccgcttata ctttag SINFRUP00000067 379 ARVKGNIITTVTPERKAEEENGEQEDAKNDAADKPDPAASSPLHQETKV ARVKGNIITTVTPERKAEEENGEQEDAKNDAADKPDPAASSPLHQETKV ARVKGNIITTVTPERKAEEENGEQEDAKNDAADKPDPAASSPLHQETKV Scaffold_2042.127335 gcgagaaaaagacgaaggggaggcgggaaggggacgcggtaccccgaag cgtagattcctccagacaaaagaaaacaaaccaacaccccgctaaacat tatgatccctatagaaaaagcagggtcgctgacgactttcctccaaaga SINFRUP00000067 428 TL DPLRSELSLPSSPISFT TL DPLRSELSLPSSPISFT TL D:D[gac] DPLRSELSLPSSPISFT Scaffold_2042.127482 acGGTCAACC Intron 10 CAGACgccctgctccttcatta ct <1-----[27489 : 28576]-1> actgcatctcccctctc aa tccctgctccattcact SINFRUP00000067 448 KIRRRRRENAHKRASSVSPGKSSTGRCRRQALADRKAALLDEQALLLVA KIRRRRRENAHKRASSVSPGKSSTGRCRRQALADRKAALLDEQALLLVA KIRRRRRENAHKRASSVSPGKSSTGRCRRQALADRKAALLDEQALLLVA Scaffold_2042.128630 aaccacagagcacgttgacgaaaagctcccgcggcaggccggcgtccgg atgggggaacaagccctgcgaggcgggggactcagaccttaaacttttc aagggcggcatgaccactacgccagtcctgctttcatggatgacggcga SINFRUP00000067 497 RKQRLEQGRNHGGTLFSFSLHLPDLSSVLDEDGYITFPDLSKMRFLPEC RKQRLEQGRNHGGTLFSFSLHLPDLSSVLDEDGYITFPDLSKMRFLPEC RKQRLEQGRNHGGTLFSFSLHLPDLSSVLDEDGYITFPDLSKMRFLPEC Scaffold_2042.128777 cacacgcgaacggacttttccccgcttgtggggtaatcgctaactccgt gaagtaaggaaggcttctctatcatccttaaagatctcatcatgttcag aggagggcgttccacccccgcgccgctcgcactccccctgcagccctgt SINFRUP00000067 546 AQNFLPIKS YLCSLLLFTL AQNFLPIKS YLCSLLLFTL AQNFLPIKS P:P[cct] YLCSLLLFTL Scaffold_2042.128924 gcatccaatCCGTCACTC Intron 11 CAGTtcttccctac caattctac <2-----[28953 : 30816]-2> atgcttttct ggccgccga ttcggatctc SINFRUP00000067 566 FTFFFLCLLNSLHLSSFAVSRKQTRNLKYRLRYTGLTFIDRPCSMHQIL FTFFFLCLLNSLHLSSFAVSRKQTRNLKYRLRYTGLTFIDRPCSMHQIL FTFFFLCLLNSLHLSSFAVSRKQTRNLKYRLRYTGLTFIDRPCSMHQIL Scaffold_2042.130848 tatttctccatccctttggacacaaacatacatagcatagccttaccac tcttttgttactatcctctggaacgataagtgacgtcttagcgctaatt cccttttaatttccctctctgggggtgacacgctgcttctgccggtact SINFRUP00000067 615 DILTRCPEPSASTISHFAVSILAYCF DILTRCPEPSASTISHFAVSILAYCF DILTRCPEPSASTISHFAVSILAYCF Scaffold_2042.130995 gacaatcgctgtaaactggtatgttt attcggcaccccctgatctcttcagt cccaaccgagaccctcttcccgtccc SINFRUP00000067 641 EVVKHQTNISELKRSFLET EVVKHQTNISELKRSFLET S:S[tct] EVVKHQTNISELKRSFLET Scaffold_2042.131073 TGTTGTCA Intron 12 CAGCTgggaccaaaagcacttcga <1-----[31074 : 31980]-1> attaaacatgatagcttac ggcacgccccaggcccggg SINFRUP00000067 661 GDGTQGPTEWEKRLSSSPMCSPRFDEAPMIEPLDVSV GDGTQGPTEWEKRLSSSPMCSPRFDEAPMIEPLDVSV GDGTQGPTEWEKRLSSSPMCSPRFDEAPMIEPLDVSV Scaffold_2042.132040 gggacgcagtgaactttcattcatgggcaagccggag gagcagccagaagtcccctgccgtaaccttactatgt ttcgacgcagggggcgtcgcaaattgaggtgggtacc SINFRUP00000067 698 ERTSDSQEDVSERSMLKVS ERTSDSQEDVSERSMLKVS D:D[gat] ERTSDSQEDVSERSMLKVS Scaffold_2042.132151 GGTCTGCA Intron 13 CAGATgcatgtcgggtgatataga <1-----[32152 : 33223]-1> agccacaaatcagcttatg gactccggcggggagggcc SINFRUP00000067 718 PRAVTQEVSQAISDKKGRLIILKDAEHKEEDEEPGEAEGVEKELPPSKE PRAVTQEVSQAISDKKGRLIILKDAEHKEEDEEPGEAEGVEKELPPSKE PRAVTQEVSQAISDKKGRLIILKDAEHKEEDEEPGEAEGVEKELPPSKE Scaffold_2042.133283 caggacggtcgaagaagacaatagggcagggggcgggggggagtcctag cgctcaatcactgaaaggttttaacaaaaaaaacgacagtaaatcccaa cattagagggcctcggagttcgattataggcaacagtacgggagtaaag SINFRUP00000067 767 KEMVKEDAADVLASITREIKKQHPKVEIKTNGTQQINGSDSPKKAMVSW KEMVKEDAADVLASITREIKKQHPKVEIKTNGTQQINGSDSPKKAMVSW KEMVKEDAADVLASITREIKKQHPKVEIKTNGTQQINGSDSPKKAMVSW Scaffold_2042.133430 agagaggggggcgtaaagaaacccaggaaaagaccaagagtcaagagtt aattaaaccattcctcgataaaacatatacagcaataggaccaacttcg aagtaacctttgctaagacgggtgggaaatccgagtcttcgtagcgggg SINFRUP00000067 816 ISEEVKTVATKEVSEGESVEAPQQTAEIFTFEEEQSKSSPTQITVSESS ISEEVKTVATKEVSEGESVEAPQQTAEIFTFEEEQSKSSPTQITVSESS ISEEVKTVATKEVSEGESVEAPQQTAEIFTFEEEQSKSSPTQITVSESS Scaffold_2042.133577 atgggaaggaaggagggagggcccaggatatgggctatacacaagtgtt tcaatactccaatgagagtaccaaccattctaaaacacgccatctcacc ctggggtcgagaatgagctagggagggccctagggagccgcgcattata SINFRUP00000067 865 ASFAVV LDTTDAGTKEMPV ASFAVV LDTTDAGTKEMPV ASFAVV H:H[cat] LDTTDAGTKEMPV Scaffold_2042.133724 gttgggCAGTATAAA Intron 14 GAGTcgaagggaagacg cctctt <2-----[33744 : 35243]-2> taccacgcaatct tttgga tcctttacaggcg SINFRUP00000067 885 IHTETKTITYESAE VDTNGDLD IHTETKTITYESAE VDTNGDLD IHTETKTITYESAE VDTNGDLD Scaffold_2042.135284 acagaaaaatgtggGTACACA Intron 15 CAGggaaggtg tacacactcaacca<0-----[35326 : 36271]-0>tacagata ccagaatcccgtag tctttcgc SINFRUP00000067 907 PGVLLSAQTITSEITSTTTTTHITK PGVLLSAQTITSEITSTTTTTHITK PGVLLSAQTITSEITSTTTTTHITK Scaffold_2042.136296 cggccagcaaatgaaaaaaaacaaaGTCAGTT Intron 16 cgtttgcactccatcgcccccatca<0-----[36371 : 36567] cttcgttgcctaatccccacaccag SINFRUP00000067 932 MVKGGISETRIEKRIVISGDADIDHDQ MVKGGISETRIEKRIVISGDADIDHDQ MVKGGISETRIEKRIVISGDADIDHDQ Scaffold_2042.136565 CAGagaggatgaaagaaagatggggagcgc -0>ttaggtcacgtaagtttcgacataaaa ggaagtaaaacgagcccaatacctctg SINFRUP00000067 959 ALAQAIKEAKEQHPDMSVTKVV ALAQAIKEAKEQHPDMSVTKVV ALAQAIKEAKEQHPDMSVTKVV Scaffold_2042.136649 GTCTCTT Intron 17 CAGgcgcgaaggagcccgatgaagg <0-----[36649 : 36954]-0>ctcactaacaaaacatctcatt tgtgcaagtaagttcgagcaag SINFRUP00000067 981 VHKETEITPEEGED VHKETEITPEEGED VHKETEITPEEGED Scaffold_2042.137021 gcagagaacggggg taaacatccaagaa atagagcgaagggc // Gene 1 Gene 22265 37062 Exon 22265 22396 phase 0 Supporting 22265 22396 1 44 Exon 24224 24328 phase 0 Supporting 24224 24328 45 79 Exon 24471 24513 phase 0 Supporting 24471 24512 80 93 Exon 24587 24662 phase 1 Supporting 24589 24660 95 118 Exon 24817 25035 phase 2 Supporting 24818 25033 120 191 Exon 25162 25249 phase 2 Supporting 25163 25249 193 221 Exon 26186 26338 phase 0 Supporting 26186 26338 222 272 Exon 26458 26579 phase 0 Supporting 26458 26577 273 312 Exon 26865 27040 phase 2 Supporting 26866 27039 314 371 Exon 27315 27488 phase 1 Supporting 27317 27487 373 429 Exon 28577 28952 phase 1 Supporting 28579 28950 431 554 Exon 30817 31073 phase 2 Supporting 30818 31072 556 640 Exon 31981 32151 phase 1 Supporting 31983 32150 642 697 Exon 33224 33743 phase 1 Supporting 33226 33741 699 870 Exon 35244 35325 phase 2 Supporting 35245 35325 872 898 Exon 36272 36370 phase 0 Supporting 36272 36370 899 931 Exon 36568 36648 phase 0 Supporting 36568 36648 932 958 Exon 36955 37062 phase 0 Supporting 36955 37062 959 994 // Glimmer2.out100644000766000024 5370614006427353 16636 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataGC Proportion = 33.3% Minimum gene length = 90 Minimum overlap length = 30 Minimum overlap percent = 10.0% Threshold score = 90 Use independent scores = True Ignore independent score on orfs longer than 481 Use strict independent model = True Use first start codon = True Orf Gene Lengths Gene -- Frame Scores - Indep ID# Fr Start Start End Orf Gene Score F1 F2 F3 R1 R2 R3 Score F3 654 681 779 126 99 0 99 _ 0 _ _ _ 0 0 -1.452 R1 984 957 772 213 186 0 99 _ _ 0 _ _ 0 0 -1.383 F2 917 917 1009 93 93 0 99 0 _ _ _ _ 0 0 -1.335 1 F1 292 292 1620 1329 1329 99 99 _ _ _ _ _ 0 396 -1.219 R2 1631 1607 1515 117 93 0 99 _ _ _ 0 _ 0 0 -1.331 F2 1592 1706 1837 246 132 0 _ 0 _ _ _ _ 99 0 -1.346 R2 1844 1829 1737 108 93 0 _ 0 _ _ 0 _ 99 0 -1.351 F3 1860 1881 2000 141 120 2 _ 7 2 _ _ 21 68 2 -1.372 F2 1841 1856 2095 255 240 0 _ 0 _ _ _ _ 99 7 -1.373 R3 2152 2017 1865 288 153 0 _ 8 _ _ _ 0 90 21 -1.397 F3 2004 2028 2174 171 147 0 _ _ 0 _ _ _ 99 0 -1.385 2 R1 2385 2349 2233 153 117 99 _ _ _ 99 _ _ 0 99 -1.208 R1 3039 3033 2944 96 90 14 _ _ _ 14 _ _ 85 14 -1.262 R3 3253 3235 3098 156 138 0 _ _ 0 _ _ 0 99 0 -1.356 F3 3087 3105 3254 168 150 0 _ _ 0 _ _ 0 99 0 -1.336 R3 3463 3439 3323 141 117 1 _ _ 98 _ _ 1 0 1 -1.235 F1 3550 3589 3699 150 111 0 0 _ 99 _ _ _ 0 0 -1.281 R3 3727 3712 3611 117 102 0 _ _ 99 _ _ 0 0 0 -1.247 R1 4038 4026 3916 123 111 0 _ _ 99 0 _ _ 0 0 -1.376 R2 4097 4091 3999 99 93 0 _ _ 99 _ 0 _ 0 0 -1.296 F1 4009 4009 4110 102 102 0 0 _ 99 _ _ _ 0 0 -1.341 F1 4171 4189 4311 141 123 0 0 _ 99 _ 0 0 0 0 -1.416 3 F3 3258 3279 4397 1140 1119 99 _ _ 99 _ _ _ 0 791 -1.201 R3 4405 4300 4184 222 117 0 0 _ 99 _ 0 0 0 0 -1.360 R2 4481 4436 4323 159 114 0 _ _ _ _ 0 _ 99 0 -1.359 F2 4454 4460 4549 96 90 0 _ 0 _ _ _ _ 99 0 -1.348 R1 5025 4962 4801 225 162 0 _ 0 _ 0 _ _ 99 0 -1.394 F2 4790 4811 5026 237 216 0 _ 0 _ 0 _ _ 99 0 -1.350 4 F1 5038 5062 6156 1119 1095 99 99 _ _ _ _ _ 0 99 -1.184 R2 6386 6383 6216 171 168 0 0 _ 99 _ 0 _ 0 0 -1.345 F1 6217 6292 6432 216 141 0 0 _ 99 _ _ _ 0 0 -1.391 R3 6487 6469 6338 150 132 1 _ _ 98 _ _ 1 0 1 -1.268 5 F3 6105 6156 6617 513 462 99 _ _ 99 _ _ _ 0 395 -1.205 R2 6722 6704 6600 123 105 0 0 _ _ _ 0 _ 99 0 -1.380 F1 6601 6628 6723 123 96 0 0 _ _ _ 0 0 99 0 -1.504 R3 6880 6856 6743 138 114 0 _ 99 0 _ _ 0 0 0 -1.541 R3 7033 6988 6884 150 105 0 _ 99 _ _ _ 0 0 0 -1.423 F3 7062 7062 7187 126 126 0 _ 99 0 _ 0 0 0 0 -1.379 R2 7196 7067 6975 222 93 0 _ 99 _ _ 0 _ 0 0 -1.504 R3 7267 7237 7037 231 201 0 _ 99 _ _ _ 0 0 0 -1.426 F3 7299 7299 7415 117 117 0 _ 99 0 _ _ _ 0 0 -1.388 R2 7502 7481 7389 114 93 0 _ 99 _ _ 0 0 0 0 -1.329 F3 7419 7425 7514 96 90 0 _ 98 0 1 _ 0 0 0 -1.341 R3 7525 7468 7337 189 132 0 _ 99 _ _ _ 0 0 0 -1.333 R1 7683 7524 7423 261 102 0 _ 96 _ 0 _ 0 2 1 -1.288 F3 7896 7899 8036 141 138 0 _ 99 0 _ 0 0 0 0 -1.479 R3 8050 8002 7895 156 108 0 _ 99 0 _ 0 0 0 0 -1.540 R2 8147 8069 7875 273 195 0 _ 99 _ _ 0 _ 0 0 -1.384 R3 8305 8293 8123 183 171 0 _ 99 _ _ _ 0 0 0 -1.399 6 F2 6674 6686 8602 1929 1917 99 _ 99 _ _ _ _ 0 1481 -1.218 R3 8986 8935 8819 168 117 0 _ 99 _ _ _ 0 0 0 -1.463 F3 9003 9003 9197 195 195 0 _ 99 0 _ _ _ 0 0 -1.421 R1 9246 9240 9142 105 99 0 _ 99 _ 0 _ _ 0 0 -1.309 R3 10417 10360 10199 219 162 0 _ 99 _ _ _ 0 0 0 -1.428 R3 10576 10576 10421 156 156 0 _ 99 _ _ _ 0 0 0 -1.380 R1 10851 10809 10576 276 234 0 _ 99 _ 0 _ _ 0 0 -1.340 R1 11097 11088 10999 99 90 0 _ 99 _ 0 _ _ 0 0 -1.362 7 F2 8606 8624 11125 2520 2502 99 _ 99 _ _ _ _ 0 792 -1.206 R2 11399 11345 11205 195 141 0 _ _ 99 _ 0 _ 0 0 -1.459 R3 11584 11536 11396 189 141 0 _ _ 99 _ _ 0 0 0 -1.375 R1 11742 11724 11635 108 90 0 _ 0 99 0 0 _ 0 0 -1.396 8 F3 11118 11133 12119 1002 987 99 _ _ 99 _ _ _ 0 396 -1.229 F1 11974 11974 12129 156 156 0 0 _ _ _ _ 81 18 0 -1.379 9 R3 12142 12130 11921 222 210 94 _ _ _ _ _ 94 5 175 -1.287 R2 12149 12146 12012 138 135 0 _ _ _ _ 0 _ 99 0 -1.381 R3 12469 12355 12167 303 189 0 _ _ 0 _ _ 0 99 0 -1.466 F3 12123 12129 12470 348 342 6 _ _ 6 _ _ _ 93 6 -1.391 R2 12494 12494 12309 186 186 0 _ _ _ _ 0 _ 99 0 -1.454 F2 12422 12473 12565 144 93 6 _ 6 _ _ _ _ 93 6 -1.318 R1 12651 12636 12541 111 96 0 _ _ 1 0 _ _ 98 0 -1.441 F3 12477 12489 12662 186 174 0 _ _ 0 _ _ _ 99 1 -1.389 R3 13084 13069 12977 108 93 0 99 _ _ _ 0 0 0 0 -1.347 R2 13169 13166 12972 198 195 0 99 _ _ _ 0 _ 0 0 -1.282 F3 13176 13218 13313 138 96 0 99 _ 0 _ _ _ 0 0 -1.296 10 F1 12670 12715 13449 780 735 99 99 _ _ _ _ _ 0 396 -1.211 R1 13470 13449 13294 177 156 0 99 _ _ 0 _ _ 0 0 -1.444 R3 13627 13624 13535 93 90 0 _ _ _ _ _ 0 99 0 -1.409 R2 13691 13688 13593 99 96 0 _ _ _ _ 0 _ 99 0 -1.364 11 F3 13656 13671 13838 183 168 99 _ _ 99 _ _ _ 0 99 -1.293 R2 13889 13871 13740 150 132 4 _ _ _ _ 4 _ 95 4 -1.327 F1 13762 13774 13917 156 144 0 0 _ _ _ _ _ 99 0 -1.376 F2 13928 13949 14074 147 126 1 0 1 _ 98 _ _ 0 1 -1.301 F1 13921 13963 14088 168 126 0 0 _ _ 99 _ _ 0 0 -1.425 R2 14222 14192 14091 132 102 0 _ _ _ 84 0 _ 15 0 -1.473 F1 14128 14128 14223 96 96 0 0 _ 0 98 0 _ 1 0 -1.481 F3 14103 14133 14330 228 198 0 _ _ 0 99 _ _ 0 0 -1.405 F2 14216 14270 14362 147 93 38 0 38 _ 50 _ _ 11 38 -1.336 F1 14248 14296 14397 150 102 0 0 _ _ 96 _ _ 3 0 -1.454 R2 14603 14594 14439 165 156 0 _ _ _ 99 0 _ 0 0 -1.305 12 R1 14847 14805 13807 1041 999 99 _ _ _ 99 _ _ 0 822 -1.279 F2 15419 15440 15544 126 105 0 _ 0 0 _ 94 _ 5 0 -1.496 R3 15682 15682 15530 153 153 0 _ _ 0 _ 99 0 0 0 -1.381 F2 15584 15599 15688 105 90 0 0 0 0 _ 99 _ 0 0 -1.518 F3 15210 15219 15746 537 528 0 _ _ 0 _ 99 _ 0 0 -1.512 13 R2 15791 15776 15219 573 558 99 _ _ _ _ 99 _ 0 490 -1.295 F1 16213 16219 16326 114 108 0 0 _ 99 _ _ _ 0 0 -1.297 F2 16253 16268 16369 117 102 0 _ 0 99 _ _ _ 0 0 -1.351 R3 17203 17101 17006 198 96 0 _ _ 98 _ _ 0 0 0 -1.297 F2 17198 17207 17335 138 129 0 _ 0 99 0 _ _ 0 0 -1.393 R1 17370 17349 17158 213 192 0 _ _ 99 0 _ _ 0 0 -1.339 14 F3 15849 15855 17459 1611 1605 99 _ _ 99 _ _ _ 0 593 -1.232 R3 17677 17623 17483 195 141 0 _ _ 99 _ _ 0 0 0 -1.339 R2 18242 18137 18045 198 93 0 _ _ 99 _ 0 _ 0 0 -1.320 15 F3 17463 17484 18374 912 891 99 _ _ 99 _ _ _ 0 297 -1.206 R2 18620 18554 18438 183 117 0 0 _ _ 99 0 _ 0 0 -1.344 F1 18439 18535 18633 195 99 0 0 _ _ 99 _ _ 0 0 -1.318 F2 18677 18719 18811 135 93 4 _ 4 _ 95 _ 0 0 4 -1.188 F1 18808 18835 18927 120 93 0 0 _ _ 99 _ _ 0 0 -1.497 R2 19040 18995 18885 156 111 0 _ _ _ 99 0 _ 0 0 -1.367 F3 18978 18981 19124 147 144 0 _ _ 0 99 _ _ 0 0 -1.309 R2 19235 19232 19044 192 189 0 _ _ _ 99 0 _ 0 0 -1.301 R3 19357 19357 19211 147 147 0 _ _ 0 99 _ 0 0 0 -1.360 F3 19188 19269 19391 204 123 0 _ _ 0 99 _ _ 0 0 -1.269 16 R1 19449 19407 18322 1128 1086 99 _ _ _ 99 _ _ 0 986 -1.187 F3 19443 19473 19571 129 99 0 _ _ 0 _ _ _ 99 0 -1.312 R2 19631 19628 19527 105 102 23 _ _ _ _ 23 _ 76 23 -1.268 F2 19610 19616 19720 111 105 42 _ 42 _ _ _ _ 57 42 -1.247 R3 19873 19843 19718 156 126 0 _ _ 0 _ _ 0 99 0 -1.478 F3 19719 19740 19895 177 156 0 _ _ 0 _ _ _ 99 0 -1.482 F1 20041 20068 20175 135 108 0 0 _ 70 _ _ 28 1 0 -1.444 R3 20185 20110 19976 210 135 0 _ _ 99 _ _ 0 0 28 -1.372 17 F3 19977 19980 20192 216 213 99 _ _ 99 _ _ _ 0 268 -1.320 F1 20179 20266 20364 186 99 0 0 0 _ _ _ 99 0 0 -1.365 R1 20445 20418 20305 141 114 0 _ 0 _ 0 _ 99 0 0 -1.318 F2 20261 20288 20446 186 159 0 _ 0 _ _ _ 99 0 0 -1.376 F1 20404 20413 20601 198 189 0 0 _ _ _ _ 99 0 0 -1.361 F3 20469 20583 20756 288 174 0 _ _ 0 _ _ 99 0 0 -1.274 18 R3 20824 20749 20189 636 561 99 _ _ _ _ _ 99 0 594 -1.215 F1 20752 20773 20910 159 138 4 4 2 _ _ _ _ 92 4 -1.340 R1 21078 21039 20842 237 198 0 _ 99 _ 0 _ _ 0 0 -1.243 F3 21063 21069 21212 150 144 0 _ 99 0 _ _ _ 0 0 -1.350 R1 21597 21585 21496 102 90 0 _ 99 _ 0 0 _ 0 0 -1.313 F1 21508 21520 21624 117 105 0 0 99 _ _ 0 _ 0 0 -1.361 19 F2 20765 20801 21703 939 903 99 _ 99 _ _ _ _ 0 497 -1.192 R3 22108 22078 21914 195 165 0 _ 99 _ _ _ 0 0 0 -1.325 F3 21918 21945 22109 192 165 0 _ 99 0 _ _ 0 0 0 -1.374 F3 22113 22116 22238 126 123 0 _ 99 0 0 0 _ 0 0 -1.273 R2 22277 22232 22020 258 213 0 _ 99 _ _ 0 _ 0 0 -1.294 R1 22320 22194 22081 240 114 0 _ 99 _ 0 0 _ 0 0 -1.237 20 F2 21845 21851 22351 507 501 99 _ 99 _ _ _ _ 0 594 -1.193 R3 22465 22462 22241 225 222 0 _ _ _ _ _ 0 99 0 -1.358 R2 22490 22487 22398 93 90 0 _ _ 99 _ 0 _ 0 0 -1.415 F1 22402 22420 22554 153 135 0 0 _ 99 _ _ _ 0 0 -1.354 R2 22598 22592 22494 105 99 0 _ _ 99 _ 0 _ 0 0 -1.425 R3 22720 22681 22547 174 135 0 _ _ 99 _ _ 0 0 0 -1.269 F1 22657 22663 22776 120 114 0 0 _ 99 _ 0 _ 0 0 -1.353 21 F3 22332 22359 23099 768 741 99 _ _ 99 _ _ _ 0 594 -1.221 R2 23501 23471 23271 231 201 0 _ _ 99 _ 0 _ 0 0 -1.351 22 F3 23112 23118 23723 612 606 99 _ _ 99 _ _ _ 0 198 -1.186 R2 24008 24008 23910 99 99 0 _ _ _ _ 0 _ 99 0 -1.382 F1 23920 23920 24033 114 114 0 0 _ _ _ _ _ 99 0 -1.364 F1 24220 24241 24330 111 90 0 0 _ 16 _ _ _ 83 0 -1.424 F2 24320 24365 24454 135 90 0 _ 0 19 _ _ _ 79 0 -1.403 R1 24507 24507 24412 96 96 0 _ _ 99 0 _ _ 0 0 -1.387 F2 24467 24536 24637 171 102 0 0 0 99 _ _ _ 0 0 -1.483 R3 24673 24670 24548 126 123 5 0 _ 94 _ _ 5 0 5 -1.270 F1 24496 24562 24723 228 162 0 0 _ 99 _ _ _ 0 0 -1.385 R1 25095 25089 24967 129 123 0 _ _ 99 0 _ _ 0 0 -1.450 R2 25343 25304 25185 159 120 0 _ _ 99 _ 0 _ 0 0 -1.591 F1 25225 25267 25362 138 96 0 0 0 99 _ _ _ 0 0 -1.484 F2 25208 25331 25444 237 114 0 _ 0 99 _ _ _ 0 0 -1.501 F1 25774 25774 25863 90 90 0 0 0 99 0 _ _ 0 0 -1.385 23 F3 24150 24216 25895 1746 1680 99 _ _ 99 _ _ _ 0 1020 -1.261 R3 26158 26125 25970 189 156 0 _ _ _ _ _ 0 99 0 -1.368 F3 26055 26064 26216 162 153 83 _ _ 83 _ _ _ 16 83 -1.262 R2 26309 26285 26133 177 153 1 _ _ _ _ 1 _ 98 1 -1.294 R1 26370 26328 26212 159 117 0 _ _ _ 0 _ _ 99 0 -1.327 F3 26463 26592 26684 222 93 19 _ _ 19 14 _ 7 59 19 -1.301 R3 26698 26647 26477 222 171 84 _ _ 0 _ _ 84 14 91 -1.290 R1 26703 26697 26557 147 141 0 _ _ _ 0 _ 92 7 14 -1.342 F2 26672 26684 26890 219 207 0 _ 0 _ _ _ _ 99 0 -1.325 F2 26945 26954 27067 123 114 2 _ 2 _ 97 _ _ 0 2 -1.253 24 R1 27117 27090 26731 387 360 99 _ _ _ 99 _ _ 0 196 -1.219 R1 27276 27231 27142 135 90 0 _ _ _ 0 _ _ 99 0 -1.239 R1 27405 27375 27280 126 96 4 _ _ _ 4 _ _ 95 4 -1.307 R2 27743 27743 27654 90 90 0 _ _ _ _ 0 _ 99 0 -1.310 F2 27833 27836 27934 102 99 0 _ 0 _ _ _ _ 99 0 -1.427 F1 27904 27952 28092 189 141 7 7 _ _ _ _ _ 92 7 -1.317 25 F2 28001 28025 28141 141 117 95 _ 95 _ _ _ _ 4 95 -1.284 R1 28272 28203 28081 192 123 70 _ _ _ 70 _ _ 29 70 -1.319 F1 28240 28240 28362 123 123 0 0 _ 99 _ _ 0 0 0 -1.434 R3 28390 28375 28154 237 222 0 _ _ 99 _ _ 0 0 0 -1.331 R2 28544 28544 28383 162 162 0 _ _ 99 _ 0 _ 0 0 -1.403 F1 28387 28417 28548 162 132 0 0 _ 99 _ _ _ 0 0 -1.393 F1 28591 28663 28791 201 129 0 0 _ 99 _ 0 _ 0 0 -1.460 R2 28988 28940 28812 177 129 0 _ _ 99 _ 0 0 0 0 -1.442 R3 29005 28885 28766 240 120 0 _ _ 99 _ _ 0 0 0 -1.343 R1 29019 28977 28885 135 93 0 _ _ 99 0 0 0 0 0 -1.354 26 F3 28155 28185 29195 1041 1011 99 _ _ 99 _ _ _ 0 891 -1.249 F1 28987 28993 29244 258 252 0 0 _ _ _ _ _ 99 0 -1.434 R2 29579 29561 29430 150 132 0 99 _ _ _ 0 _ 0 0 -1.207 R1 29622 29616 29521 102 96 0 98 _ _ 0 _ _ 1 0 -1.321 R3 29866 29836 29702 165 135 0 99 _ 0 _ 0 0 0 0 -1.290 27 F1 29263 29263 29946 684 684 99 99 _ _ _ _ _ 0 395 -1.187 End = 29940 Original Genes = 27 Potential Genes = 27 Avg Olaps = 0.1 Potential Changes = 2 Potential Rejects = 2 Sure Rejects = 2 Original Genes = 27 Potential Genes = 25 Avg Olaps = 0.0 Potential Changes = 0 Potential Rejects = 0 Sure Rejects = 0 Original Genes = 27 Potential Genes = 25 Avg Olaps = 0.0 Potential Changes = 0 Potential Rejects = 0 Sure Rejects = 0 Putative Genes: 1 292 1620 [+1 L=1329 r=-1.219] 2 2349 2233 [-1 L= 117 r=-1.208] 3 3279 4397 [+3 L=1119 r=-1.201] 4 5062 6156 [+1 L=1095 r=-1.184] 5 6156 6617 [+3 L= 462 r=-1.205] 6 6686 8602 [+2 L=1917 r=-1.218] 7 8624 11125 [+2 L=2502 r=-1.206] 8 11133 12119 [+3 L= 987 r=-1.229] 10 12715 13449 [+1 L= 735 r=-1.211] 12 14805 13807 [-1 L= 999 r=-1.279] 13 15776 15219 [-3 L= 558 r=-1.295] 14 15855 17459 [+3 L=1605 r=-1.232] 15 17484 18374 [+3 L= 891 r=-1.206] 16 19407 18322 [-1 L=1086 r=-1.187] 17 19980 20192 [+3 L= 213 r=-1.320] 18 20749 20189 [-2 L= 561 r=-1.215] 19 20801 21703 [+2 L= 903 r=-1.192] 20 21851 22351 [+2 L= 501 r=-1.193] 21 22359 23099 [+3 L= 741 r=-1.221] 22 23118 23723 [+3 L= 606 r=-1.186] 23 24216 25895 [+3 L=1680 r=-1.261] 24 27090 26731 [-1 L= 360 r=-1.219] 25 28025 28141 [+2 L= 117 r=-1.284] 26 28185 29195 [+3 L=1011 r=-1.249] 27 29263 6 [+1 L= 684 r=-1.187] AF032047.gbk100644000766000024 475614006427353 16043 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS ALSEB2MP1 291 bp DNA linear PRI 30-JUN-1998 DEFINITION Alouatta seniculus beta-2-microglobulin precursor, gene, exon 1. ACCESSION AF032047 VERSION AF032047.1 GI:3264957 KEYWORDS . SEGMENT 1 of 2 SOURCE Alouatta seniculus (howler monkey) ORGANISM Alouatta seniculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Platyrrhini; Cebidae; Alouattinae; Alouatta. REFERENCE 1 (bases 1 to 291) AUTHORS Canavez,F.C., Ladasky,J.J., Muniz,J.A., Seuanez,H.N., Parham,P. and Cavanez,C. TITLE beta2-Microglobulin in neotropical primates (Platyrrhini) JOURNAL Immunogenetics 48 (2), 133-140 (1998) MEDLINE 98298008 PUBMED 9634477 REFERENCE 2 (bases 1 to 291) AUTHORS Canavez,F.C., Ladasky,J.J., Seuanez,H.N. and Parham,P. TITLE Direct Submission JOURNAL Submitted (31-OCT-1997) Structural Biology, Stanford University, Fairchild Building, Campus West Dr., Room #D-100, Stanford, CA 94305-5126, USA FEATURES Location/Qualifiers source 1..291 /organism="Alouatta seniculus" /mol_type="genomic DNA" /db_xref="taxon:9503" mRNA join(<136..202,AF032048.1:67..345,AF032048.1:1162..1189) /product="beta-2-microglobulin precursor" sig_peptide 136..195 CDS join(136..202,AF032048.1:67..345,AF032048.1:1162..1175) /codon_start=1 /product="beta-2-microglobulin precursor" /protein_id="AAC52091.1" /db_xref="GI:3264961" /translation="MARFVVVALLALLSLSGLEAIQHAPKIQVYSRHPAENGKPNFLN CYVSGFHPSDIEVDLLKNGKKIEKVEHSDLSFSKDWSFYLLYYTEFTPNEKDEYACRV SHVTFPTPKTVKWDRTM" sig_peptide 136..195 exon <136..202 /number=1 intron 203..>291 /number=1 BASE COUNT 31 a 103 c 78 g 79 t ORIGIN 1 cggtccccgc gggccttctc ctgattggct gtccccgcgg gccttgtcct tattggctgt 61 gcccgactcc gtataacata aatagaggcg tcgagttgcg cgggcattac tgcagcggac 121 cgcacttggg tcgagatggc tcgcttcgtg gtggtagccc tgctcgcgct actctctctg 181 tctggcctgg aggctatcca gcgtaagtct ctcctcccgt ccggcgctgg tcctttccct 241 cccgccccaa ctctctgtag ccctctctgt gctctctcgt tccgttacct g // blast.report100644000766000024 5330314006427353 16762 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataTBLASTN 2.0.4 [Feb-24-1998] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|1401126 (504 letters) Database: Non-redundant GenBank+EMBL+DDBJ+PDB sequences 336,723 sequences; 677,679,054 total letters Searchingdone Score E Sequences producing significant alignments: (bits) Value gb|U49928|HSU49928 Homo sapiens TAK1 binding protein (TAB1) mRNA... 1009 0.0 emb|Z36985|PTPP2CMR P.tetraurelia mRNA for protein phosphatase t... 58 4e-07 emb|X77116|ATMRABI1 A.thaliana mRNA for ABI1 protein 53 1e-05 gb|U12856|ATU12856 Arabidopsis thaliana Col-0 abscisic acid inse... 53 1e-05 dbj|D38109|ATHPP2CA Arabidopsis thaliana mRNA for protein phosph... 52 3e-05 emb|Y08965|ATABI2RNA A.thaliana mRNA for ABI2 protein 47 8e-04 emb|AL010222|PFSC04009 Plasmodium falciparum DNA *** SEQUENCING ... 45 0.004 gb|AC002409|ATAC002409 Arabidopsis thaliana chromosome II BAC T2... 43 0.012 emb|Z98762|SPAC4A8 S.pombe chromosome I cosmid c4A8 42 0.020 gb|AF006827|AF006827 Magnaporthe grisea adenylate cyclase (MAC1)... 41 0.045 emb|Y13936|HSY13936 Homo sapiens mRNA for protein phosphatase 2C... 41 0.059 gb|U81159|BTU81159 Bos taurus magnesium-dependent calcium inhibi... 40 0.077 emb|Y10438|SPFKBAD Streptomyces sp. MA6548 fkbA gene and partial... 40 0.077 gb|AF012921|AF012921 Magnaporthe grisea adenylate cyclase (mac1)... 40 0.10 gb|U42383|MMU42383 Mus musculus fibroblast growth factor inducib... 40 0.10 gb|AF023665|AF023665 Plasmodium falciparum protein phosphatase 2... 39 0.13 gb|M24942|YSPCYR1A Yeast (S.pombe) adenylate cyclase (CYR1) gene... 39 0.17 gb|M26699|YSPADC Yeast (S.pombe) cyr1 gene encoding adenylyl cyc... 39 0.17 gb|L43413|PANADCY Podospora anserina adenyl cyclase gene, exons 1-4 38 0.30 gb|M89651|HUMMMDBC Human DNA from cosmid DNA MMDB (f10080) and M... 38 0.30 emb|Z99161|SPAC11G7 S.pombe chromosome I cosmid c11G7 38 0.30 emb|X56042|SKADECYC Saccharomyces kluyveri gene for adenylyl cyc... 37 0.88 gb|M29235|CELPOLII C.elegans RNA polymerase II largest subunit (... 36 1.1 dbj|D45132|HUMHOXY1 Human kidney mRNA for zinc-finger DNA-bindin... 36 1.1 gb|U70654|MMU70654 Mus musculus musculus sex determining protein... 36 1.1 gb|U53333|CELF36A4 Caenorhabditis elegans cosmid F36A4. 36 1.1 gb|L29028|CREWP6A Chlamydomonas eugametos WP6 mRNA, complete cds 36 1.1 gb|U70653|MMU70653 Mus musculus musculus sex determining protein... 36 1.1 gb|U17838|HSU17838 Human zinc finger protein RIZ mRNA, complete cds 36 1.1 emb|X56432|DMRUNTR D.melanogaster mRNA for runt segmentation gene 36 1.5 emb|X67204|MMSRYLOC M.musculus DNA sequence of Sry locus 36 1.5 gb|U00051|CELF42G9 Caenorhabditis elegans cosmid F42G9. 36 1.5 gb|U70655|MMU70655 Mus musculus sex determining protein (Sry) ge... 36 1.5 gb|AF019985|AF019985 Dictyostelium discoideum Spalten (spnA) mRN... 36 2.0 emb|X78886|ATABI1G A.thaliana (Landsberg erecta) ABI1 gene 36 2.0 gb|U70651|MMU70651 Mus musculus musculus sex determining protein... 36 2.0 gb|U70652|MMU70652 Mus musculus musculus sex determining protein... 36 2.0 emb|X58924|HVRPL11E H.volcanii genes for ribosomal proteins L11e... 36 2.0 emb|X05806|CHAMPER Acetabularia plastid DNA homologous to Drosop... 36 2.0 gb|U82833|OSU82833 Oryza sativa S-adenosyl-L-methionine syntheta... 35 2.6 emb|Z27084|HLHOLLI H.lanatus mRNA for allergen Hol-lI 35 2.6 dbj|D00676|SH1RSP40 Pseudorabies virus genome, RSp40 and pk genes 35 2.6 emb|Y10421|CCATCOLE1 C.curvatus strain ATCC 20509 Ole1 gene 35 2.6 gb|L14320|HSBBICP4A Bovine herpesvirus type 1 early-intermediate... 35 2.6 emb|AJ004801|BHV1CGEN Bovine herpesvirus 1 complete genome 35 2.6 dbj|AB010074|AB010074 Arabidopsis thaliana genomic DNA, chromoso... 35 3.4 gb|AC004490|AC004490 Homo sapiens chromosome 19, cosmid R29381, ... 35 3.4 dbj|AB011474|AB011474 Arabidopsis thaliana genomic DNA, chromoso... 35 3.4 gb|U03645|MMU03645 Mus musculus domesticus Torino (Sry) gene, co... 35 3.4 gb|U70641|MMU70641 Mus musculus domesticus sex determining prote... 35 3.4 gb|U70650|MMU70650 Mus musculus domesticus sex determining prote... 35 3.4 gb|U70642|MMU70642 Mus musculus domesticus sex determining prote... 35 3.4 gb|U70647|MMU70647 Mus musculus domesticus sex determining prote... 35 3.4 emb|Y11840|ATABI2 Arabidopsis thaliana ABI2 gene 35 3.4 emb|Y08966|ATABI2DNA A.thaliana gene encoding ABI2 protein 35 3.4 emb|X16144|SVGRA S. violaceoruber DNA for granaticin polyketide ... 35 3.4 emb|X16300|SVPKS Streptomyces violaceoruber polyketide synthase ... 35 3.4 gb|U70657|MMU70657 Mus musculus castaneus sex determining protei... 35 3.4 gb|M57417|HUMMUCCF Human (cystic fibrosis patient) mucin mRNA, p... 35 3.4 gb|L04286|DDIGP100 Dictyostelium discoideum glycoprotein gp100 (... 35 3.4 emb|Z46938|BTTESDNA B.taurus DNA (protamine gene cluster) 35 3.4 gb|L13054|DROZESTEL Drosophila melanogaster zeste (z) gene, part... 34 4.4 gb|L13058|DROZESTEP Drosophila melanogaster zeste (z) gene, part... 34 4.4 gb|M60590|YSCAAGLCS S.cerevisiae a-agglutinin core subunit (AGA1... 34 4.4 gb|AF009521|AF009521 Mus spretus sex determining protein (Sry) g... 34 4.4 emb|Z71659|SCYNR044W S.cerevisiae chromosome XIV reading frame O... 34 4.4 gb|U19361|PMU19361 Petromyzon marinus neurofilament subunit NF-1... 34 4.4 gb|U70646|MMU70646 Mus musculus domesticus sex determining prote... 34 4.4 gb|U23477|DDU23477 Dictyostelium discoideum phosphatidylinositol... 34 4.4 emb|X55695|LEEXTEN15 Tomato mRNA for a glycine-rich protein (clo... 34 4.4 emb|Z95556|MTCY07A7 Mycobacterium tuberculosis cosmid SCY07A7 34 4.4 dbj|AB007645|AB007645 Arabidopsis thaliana genomic DNA, chromoso... 34 5.8 emb|X64346|HSGEND Herpesvirus saimiri complete genome DNA 34 5.8 emb|Z95620|SPBC3D6 S.pombe chromosome II cosmid c3D6 34 5.8 gb|U70644|MMU70644 Mus musculus domesticus sex determining prote... 34 5.8 gb|U70649|MMU70649 Mus musculus domesticus sex determining prote... 34 5.8 gb|U43491|SCU43491 Saccharomyces cerevisiae cosmid clone pEOA156... 34 5.8 dbj|D00909|NEUNAC N.crassa nac gene coding for adenylate cyclase... 34 5.8 gb|U70645|MMU70645 Mus musculus domesticus sex determining prote... 34 5.8 gb|AF009519|AF009519 Mus musculus domesticus sex determining pro... 34 5.8 gb|M77174|MUSPERPA Mouse perlecan mRNA, complete cds. 34 5.8 emb|Z74917|SCYOR009W S.cerevisiae chromosome XV reading frame OR... 34 5.8 gb|J04054|MUSPCGBM Mouse basement membrane proteoglycan mRNA, pa... 34 5.8 emb|Z47072|CEF26C11 Caenorhabditis elegans cosmid F26C11, comple... 34 5.8 gb|U70643|MMU70643 Mus musculus sex determining protein (Sry) ge... 34 5.8 gb|AC003671|AC003671 Arabidopsis thaliana chromosome 1 BAC F17O7... 34 5.8 emb|X16481|RN11ZNBP Rat mRNA for zinc(2+) binding protein 34 5.8 gb|U46156|SSU46156 Synechococcus sp. CcmK (ccmK) gene, complete ... 34 5.8 gb|AC000098|YUP8H12 Arabidopsis thaliana chromosome 1 YAC yUP8H1... 34 5.8 emb|Z71781|SCCIVL37K S.cerevisiae chromosome IV left arm (EU) DN... 34 5.8 emb|Y13332|SSTO1AMY Streptomyces sp. TO1 amy gene 34 5.8 gb|M30473|NEULEURSC N.crassa cytoplasmic leucyl-tRNA synthetase ... 34 5.8 emb|Z74085|SCYDL037C S.cerevisiae chromosome IV reading frame OR... 34 5.8 gb|AF029858|AF029858 Sorghum bicolor cytochrome P450 CYP71E1 (CY... 34 5.8 gb|S47414|S47414 glycine-rich protein {clone atGRP-5} [Arabidops... 34 7.6 emb|X60294|SCSEC1A S.cereale Sec1 gene for omega secalin 34 7.6 emb|Z37975|BTPLAKOPH B.taurus mRNA for plakophilin. 34 7.6 emb|X60295|SCSEC1B S.cereale Sec1 gene for omega secalin 34 7.6 dbj|D87895|D87895 Aspergillus nidulans DNA for chitinase, comple... 34 7.6 gb|AF017789|AF017789 Homo sapiens putative transcription factor ... 34 7.6 >gb|U49928|HSU49928 Homo sapiens TAK1 binding protein (TAB1) mRNA, complete cds. Length = 3096 Score = 1009 bits (2580), Expect = 0.0 Identities = 504/504 (100%), Positives = 504/504 (100%) Query: 1 MAAQRRSLLQSEQQPSWTDDLPLCHLSGVGSASNRSYSADGKGTESHPPEDSWLKFRSEN 60 MAAQRRSLLQSEQQPSWTDDLPLCHLSGVGSASNRSYSADGKGTESHPPEDSWLKFRSEN Sbjct: 21 MAAQRRSLLQSEQQPSWTDDLPLCHLSGVGSASNRSYSADGKGTESHPPEDSWLKFRSEN 200 Query: 61 NCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLES 120 NCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLES Sbjct: 201 NCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLES 380 Query: 121 IDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYV 180 IDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYV Sbjct: 381 IDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYV 560 Query: 181 ANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQEST 240 ANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQEST Sbjct: 561 ANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQEST 740 Query: 241 RRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAH 300 RRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAH Sbjct: 741 RRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAH 920 Query: 301 GPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDRVKRIHSDTFASGGERARFCPRHEDMTL 360 GPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDRVKRIHSDTFASGGERARFCPRHEDMTL Sbjct: 921 GPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDRVKRIHSDTFASGGERARFCPRHEDMTL 1100 Query: 361 LVRNFGYPLGEMSQPTPSPAPAAGGRVYPVSVPYSSAQSTSKTSVTLSLVMPSQGQMVNG 420 LVRNFGYPLGEMSQPTPSPAPAAGGRVYPVSVPYSSAQSTSKTSVTLSLVMPSQGQMVNG Sbjct: 1101LVRNFGYPLGEMSQPTPSPAPAAGGRVYPVSVPYSSAQSTSKTSVTLSLVMPSQGQMVNG 1280 Query: 421 AHSASTLDEATPTLTNQSPTLTLQSTNTHTQSSSSSSDGGLFRSRPAHSLPPGEDGRVEP 480 AHSASTLDEATPTLTNQSPTLTLQSTNTHTQSSSSSSDGGLFRSRPAHSLPPGEDGRVEP Sbjct: 1281AHSASTLDEATPTLTNQSPTLTLQSTNTHTQSSSSSSDGGLFRSRPAHSLPPGEDGRVEP 1460 Query: 481 YVDFAEFYRLWSVDHGEQSVVTAP 504 YVDFAEFYRLWSVDHGEQSVVTAP Sbjct: 1461YVDFAEFYRLWSVDHGEQSVVTAP 1532 >emb|Z36985|PTPP2CMR P.tetraurelia mRNA for protein phosphatase type 2C Length = 969 Score = 57.8 bits (137), Expect = 4e-07 Identities = 64/261 (24%), Positives = 112/261 (42%), Gaps = 2/261 (0%) Query: 64 LYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLESIDD 123 ++GVF+G+ G V FV + ELL + + E+ F E++ + Sbjct: 182 VFGVFDGHGGREVA*FVEKHFVDELLKNK------------------NFKEQKFEEALKE 307 Query: 124 ALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREIS-GGAMAVVAVLLNNKLYVAN 182 + L L P+ QK L K + + S G A VA++ N LYVAN Sbjct: 308 TFLKMDELL-----------LTPEGQKELN*YKATDTDESYAGCTANVALIYKNTLYVAN 454 Query: 183 VGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQESTRR 242 G +R++LC++ + ++VDH +N +E R+ + G G + + +R Sbjct: 455 AGDSRSVLCRNNTN----HDMSVDHKPDNPEEKSRIERAG---GFVSDGRVNGNLNLSRA 613 Query: 243 IGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAHGP 302 +GD + K D + IIA P++ + L F+++ +G+++ L Sbjct: 614 LGDLEYKR-----DNKLRSNE*LIIALPDVKKTE-LTP*DKFILMGCDGVFETLNH*ELL 775 Query: 303 GQANQEIA-AMIDTEFAKQTSLD 324 Q N I A + E K+ + D Sbjct: 776 KQVNSTIG*AQVTEELLKKAAED 844 >emb|X77116|ATMRABI1 A.thaliana mRNA for ABI1 protein Length = 1981 Score = 52.7 bits (124), Expect = 1e-05 Identities = 59/242 (24%), Positives = 105/242 (43%), Gaps = 6/242 (2%) Query: 55 KFRSENNCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVE 114 +F ++ +GV++G+ G++V N+ +R+ L AE A + +L +E Sbjct: 918 RFDPQSAAHFFGVYDGHGGSQVANYCRERMHLAL------AEEI-AKEKPMLCDGDTWLE 1076 Query: 115 RSFLESIDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLL 174 + + L + ++S PE V G+ +VVAV+ Sbjct: 1077 KWKKALFNSFLRVDSEIESVAPETV-------------------------GSTSVVAVVF 1181 Query: 175 NNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQ---- 230 + ++VAN G +RA+LC+ G L+VDH + EDE R+ G GK+ Q Sbjct: 1182 PSHIFVANCGDSRAVLCR----GKTALPLSVDHKPDREDEAARIEAAG---GKVIQWNGA 1340 Query: 231 --VGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLM 288 G++ +R IGD +K II +PE+ + + L+L Sbjct: 1341 RVFGVLA---MSRSIGDRYLK--------------PSIIPDPEVTAVKRVK-EDDCLILA 1466 Query: 289 SEGLYKAL 296 S+G++ + Sbjct: 1467 SDGVWDVM 1490 >gb|U12856|ATU12856 Arabidopsis thaliana Col-0 abscisic acid insensitive protein (ABI1) mRNA, complete cds. Length = 2000 Score = 52.7 bits (124), Expect = 1e-05 Identities = 59/242 (24%), Positives = 105/242 (43%), Gaps = 6/242 (2%) Query: 55 KFRSENNCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVE 114 +F ++ +GV++G+ G++V N+ +R+ L AE A + +L +E Sbjct: 918 RFDPQSAAHFFGVYDGHGGSQVANYCRERMHLAL------AEEI-AKEKPMLCDGDTWLE 1076 Query: 115 RSFLESIDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLL 174 + + L + ++S PE V G+ +VVAV+ Sbjct: 1077 KWKKALFNSFLRVDSEIESVAPETV-------------------------GSTSVVAVVF 1181 Query: 175 NNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQ---- 230 + ++VAN G +RA+LC+ G L+VDH + EDE R+ G GK+ Q Sbjct: 1182 PSHIFVANCGDSRAVLCR----GKTALPLSVDHKPDREDEAARIEAAG---GKVIQWNGA 1340 Query: 231 --VGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLM 288 G++ +R IGD +K II +PE+ + + L+L Sbjct: 1341 RVFGVLA---MSRSIGDRYLK--------------PSIIPDPEVTAVKRVK-EDDCLILA 1466 Query: 289 SEGLYKAL 296 S+G++ + Sbjct: 1467 SDGVWDVM 1490 >dbj|D38109|ATHPP2CA Arabidopsis thaliana mRNA for protein phosphatase 2C Length = 1371 Score = 51.5 bits (121), Expect = 3e-05 Identities = 73/290 (25%), Positives = 132/290 (45%), Gaps = 13/290 (4%) Query: 47 HPPEDSWLKFRSENNCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVL 106 HP S+L+ SEN+ F YGVF+G+ + V +RL +++ ++ A + + Sbjct: 447 HP---SFLQRNSENHHF-YGVFDGHGCSHVAEKCRERLH-DIVKKEVEVM-ASDEWTETM 608 Query: 107 LQAFDVVERSFLESIDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGA 166 +++F +++ + + + A+ + PQ + G+ Sbjct: 609 VKSFQKMDKEVSQRECNLVVNGAT--RSMKNSCRCELQSPQCDAV-------------GS 743 Query: 167 MAVVAVLLNNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAG 226 AVV+V+ K+ V+N G +RA+LC++ V L+VDH + DEL R+ Q G G Sbjct: 744 TAVVSVVTPEKIIVSNCGDSRAVLCRNGV----AIPLSVDHKPDRPDELIRIQQAG---G 902 Query: 227 KI------KQVGIICGQESTRRIGD-YKVKYGYTDIDLLSAAKSKP----IIAEPEIHGA 275 ++ + +G++ +R IGD Y Y D ++ ++ I+A + Sbjct: 903 RVIYWDGARVLGVLA---MSRAIGDNYLKPYVIPDPEVTVTDRTDEDECLILASDGLWDV 1073 Query: 276 QPLDGVTGF--LVLMSEGLYKALEAAHGPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDR 333 P + G + L G +AAH A + A ++ + S D V+ VVD Sbjct: 1074VPNETACGVARMCLRGAGAGDDSDAAH---NACSDAALLLTKLALARQSSDNVSVVVVDL 1244 Query: 334 VKR 336 KR Sbjct: 1245RKR 1253 >emb|Y08965|ATABI2RNA A.thaliana mRNA for ABI2 protein Length = 1470 Score = 46.9 bits (109), Expect = 8e-04 Identities = 55/241 (22%), Positives = 100/241 (40%), Gaps = 7/241 (2%) Query: 56 FRSENNCFLYGVFNGYDGNRVTNFVAQRLSAELL--LGQLNAEHAEADV-----RRVLLQ 108 F + +GV++G+ G++V N+ +R+ L + + E + D ++ L Sbjct: 504 FNPHLSAHFFGVYDGHGGSQVANYCRERMHLALTEEIVKEKPEFCDGDTWQEKWKKALFN 683 Query: 109 AFDVVERSFLESIDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMA 168 +F V+ S +E++ A PE V G+ + Sbjct: 684 SFMRVD-SEIETVAHA-----------PETV-------------------------GSTS 752 Query: 169 VVAVLLNNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKI 228 VVAV+ ++VAN G +RA+LC+ G L+VDH + +DE R+ G + Sbjct: 753 VVAVVFPTHIFVANCGDSRAVLCR----GKTPLALSVDHKPDRDDEAARIEAAGGKVIRW 920 Query: 229 KQVGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLM 288 + +R IGD +K +I +PE+ + + L+L Sbjct: 921 NGARVFGVLAMSRSIGDRYLK--------------PSVIPDPEVTSVRRVK-EDDCLILA 1055 Query: 289 SEGLYKAL 296 S+GL+ + Sbjct: 1056SDGLWDVM 1079 >emb|AL010222|PFSC04009 Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS *** from contig 4-9, complete sequence [Plasmodium falciparum] Length = 5332 Score = 44.5 bits (103), Expect = 0.004 Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 15/183 (8%) Query: 163 SGGAMAVVAVLLNNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTE-NEDELFRLSQL 221 S G A V+V+ N LYVAN+G +R ++ K+ + L VDH N+ E R+ + Sbjct: 2760 SSGTTACVSVIFKNMLYVANIGDSRCIISKNG----RAIVLTVDHRASINKKEQDRILKS 2593 Query: 222 GLDAGKIKQVGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGV 281 G G + G + G R G+ + K K +I EP++ + D Sbjct: 2592 G---GILDDEGYLGGCLGVCR--------GFGSFHKKTKEKLKGLICEPDLFHIKLTDD- 2449 Query: 282 TGFLVLMSEGLYKALEAAHGPGQANQEIAAMIDTEFA---------KQTSLDAVAQAVV- 331 FL++ +G++ + + + D + A K+ SLD ++ VV Sbjct: 2448 DEFLIICCDGIFDVITSQEAVNTVKNSLIQSRDAKTAAEALCQLAYKKKSLDNLSVLVVI 2269 Query: 332 ----DRVKRIHSDTFASG 345 D+ ++ S +SG Sbjct: 2268 FQNPDKNNKVSSINESSG 2215 >gb|AC002409|ATAC002409 Arabidopsis thaliana chromosome II BAC T20B5 genomic sequence, complete sequence [Arabidopsis thaliana] Length = 72839 Score = 43.0 bits (99), Expect = 0.012 Identities = 21/57 (36%), Positives = 38/57 (65%) Query: 165 GAMAVVAVLLNNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQL 221 G A+ ++L+ NKL+VANVG +RA+LC++ ++++ D+ TE+E L+ + L Sbjct: 24355 GCTAIASLLVENKLFVANVGDSRAILCRAG-HPFALSKVR*DYHTESELSLYSIGAL 24188 >emb|Z98762|SPAC4A8 S.pombe chromosome I cosmid c4A8 Length = 43895 Score = 42.2 bits (97), Expect = 0.020 Identities = 56/230 (24%), Positives = 104/230 (44%), Gaps = 20/230 (8%) Query: 63 FLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLESID 122 F YG+F+G+ G + F++ L + LN D ++L + V ++ + Sbjct: 3939 FFYGLFDGHGGTECSEFLSTNLGKIIENQDLN------DTEKILKEVHSV--GGYMAGLK 3784 Query: 123 DALAEKASLQSQLPEGVPQHQLPPQY-QKILERLKTLER----EISGGAMAVVAVLLNNK 177 + + LQS+ + + + +L + Q ++ L R GA+ VA++ + Sbjct: 3783 PPFSLRTVLQSRDEDLLWRARLYYSFLQADMDYLTNYARPSPDSAVPGAVGTVAIITSKN 3604 Query: 178 -----------LYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGL--- 223 +++A+VG RALLC S + +L H + +E RL + + Sbjct: 3603 NLSYWESDSYIIHLAHVGDTRALLCDSRTG--RAHRLTFQHHPADVEEARRLRRYNMGFS 3430 Query: 224 -DAGKIKQVGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQPLDGVT 282 D+ K+ + +TR GD GY K ++AEP++ L Sbjct: 3429 RDSFGQKRFAWVA---NTRSFGD-----GY-------KLKKLGVVAEPQLTSIHSLRDDW 3295 Query: 283 GFLVLMSEGL 292 FL L+S+G+ Sbjct: 3294 SFLTLLSDGI 3265 >gb|AF006827|AF006827 Magnaporthe grisea adenylate cyclase (MAC1) gene, complete cds Length = 8678 Score = 41.0 bits (94), Expect = 0.045 Identities = 63/259 (24%), Positives = 111/259 (42%), Gaps = 8/259 (3%) Query: 14 QPSWTDDLPLCHLSGVGSASNR-SYS-ADGKGTESHPPEDSWL--KFRSENNCFLYGVFN 69 QPS + C + GS++ Y+ AD G H + +F + L G+F+ Sbjct: 5314 QPSIPEQSEDCRVRTSGSSAGYLPYAMADTLGKNEHLSTVDLVVPRFNASETETLLGLFD 5493 Query: 70 GY----DGNRVTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLESIDDAL 125 G G+++ ++ + GQ+ A A + D + R+FL +++ L Sbjct: 5494 GQALSSGGSKIAKYLHENF------GQILATELRALKTGLKETPEDALRRAFL-ALNKEL 5652 Query: 126 AEKASLQSQLPEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYVANVGT 185 A S+ VP H+ Q IL + +++ G +A V L LYVANVG Sbjct: 5653 VTIAIQHSEDRPSVP-HRSGSQAHVILNK-----EDLNSGGVATVVYLQGQDLYVANVGD 5814 Query: 186 NRALLCKSTVDGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQESTRRIGD 245 +A++ +S +T+ H E R+ + G G + + G + Q R Sbjct: 5815 AQAMIIQSDQTHKMLTR---KHDPAEPTERSRIREAG---GWVSRNGKLNDQLGVSR--- 5967 Query: 246 YKVKYGYTDIDLLSAAKSKPIIAEPEI 272 +GY +DL+ A ++ P ++ I Sbjct: 5968 ---AFGY--VDLMPAVQAAPHVSHVAI 6033 CPU time: 78.72 user secs. 0.85 sys. secs 79.57 total secs. Database: Non-redundant GenBank+EMBL+DDBJ+PDB sequences Posted date: Apr 16, 1998 9:38 AM Number of letters in database: 677,679,054 Number of sequences in database: 336,723 Lambda K H 0.313 0.130 0.370 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 374080624 Number of Sequences: 336723 Number of extensions: 5779396 Number of successful extensions: 37522 Number of sequences better than 10: 214 Number of HSP's better than 10.0 without gapping: 32 Number of HSP's successfully gapped in prelim test: 85 Number of HSP's that attempted gapping in prelim test: 35679 Number of HSP's gapped (non-prelim): 445 length of query: 504 length of database: 225893018 effective HSP length: 62 effective length of query: 442 effective length of database: 205016192 effective search space: 90617156864 frameshift window, decay const: 50, 0.1 T: 13 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 42 (21.9 bits) S2: 74 (33.2 bits) version2.scf100644000766000024 36676514006427353 16727 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data.scf7RX02.00lH-K 4a$@m @2a@KL @.;@v@@@@U@@@@@@@@@@@@@@@@@@@@@@@@@@,@@@@@@F@@@@@@@@@@@@@@@@@@@@@@>@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@e@@@@@@@@@@@@t@@@g@@@<@a@}@@1@L>@@@@;@@0@@@@2@@ @@@@@@@@@@@5&@s@J@z@2@n@ W@@@b@8@D@@+@@@D @ @Os@/C@@W@J@@,@\Vq,GP@Y@@j@H@@F~@y@U@`@@S7@6@;@jC@N,\ob -,@@@@.@F^@V7@a@T@=@$@@@@@)@G@b@wS@x@dOH d*@@@@U@@@H@@@l@w@U@@ @@.@S@\@~@@W@O@,@m@@#@@@@K~D@d@3@R@-2@tB_QKrZo#|x "'bIX|zmKP?? x %/BPlr *c1A1"* X"3s+x=Q0{R0mH\J3@@ M=BA#| 4 x({l1  (<@I@ @ @@@ @ @@@@@@-@@a@I@L@&@@o@A@@ @d[R (%YH4-MJJha"iWK+Fg$"fxL/ 20 J?ez t cNl$:&K O (!@@@ U #|< p D8u}+oQiD#5>@@@@;@@@@@ @@@)@0/)'@@@@ M2 >.@@ @TB@g@|2rEXgl!1E^L1~}j~t]P7+}IYF]<qk-+@@< gcC[C XB/\7#0d:#5'N=1&- +#  :us ?, | y( -) Z ^ GcYG!(z+)M"a)-34:.&")= \@}8:P kQ?8;DZLQP5MD 890,/A87JYL^o:@w@t@e@N@5W eJO4^&2L}NR%  f{*L=M  VX@S @K @B@:@3d-'  b@ *%6D& i'u(%[#k6|@@@@@ @ @2 y F ` _ *04 ,> %f "+3>' b/7l:6,%n s2cZW~YZX'PVC04%b^] )u: GK DN!4#.U!.)< DD8BU>1<?;<=*?58W>GFO;NR5IZg+;] $\ W O48E?z>BJ2Q@R@J@9@(0!.P8KI M"D55T Qv *d: w2^k:G6(,@5%@B@L@R@S@PWJC <85M. 'x!,Igjp { {W+55C Ft]x}HaNC;L5."& io '5 hI=d3ADN(D7u#(hq;k*'2R$5B0/+#g0& !D K0KHl@@@@@@)Rq#+ j $Ivk|1O(D "tm  < ]3"-E3.$  C!;8KYVD/Ui </GGg@@@@_)L@)HeD'//C&Yg;O )/+ Hg/)  AK4Q$o'3"` +rRV:q& 9  % : Ws%\7pdY D5B,0?4@N?@RP@La@<om(wy w{yD. 9   O PV6'! v"$&#m' 9Ww1  R Ks`P:u6  B[u k zh*W Jh?04H+bv#rFvm%Z+Be00*-LJ$ 0L`5f)I. 2$@@@@@@@@@@@@M''054/%r*3:G  v*)q;HO0N@G@<@1& $ L  c 9 | 0B@@@ @g *|G=N)XkYtRHD0$ 61H ,VG\&\6Y\D_KnhIRu>N.e,vlyN2n  5@'*+,|~/p"7 G,._f|n@) ^9 6` $!!&.%C \bo wNp`Mu>]7673)3H)t@ @ @ @ @@ S` +l3  *7 .*.!@P ZUJ@, : 776@ 2@(@@@ @@@$@}@@@@0@>@@@@:@@@@l@u@l@M@@S@T ).z-M&,l*('>&$k_!+ An! ;)6H'Zgenj<`K 4 pY?1 < &)=2WT1j(rMl[c E12Y$VK`$.8^@<=B AZ=e=CmO]h j`3-L6$,"4K/cx'-11?.H&qY, ?$k(,-t'=+v&UC"f5e i8duOA)K\8| >| Na(3e, 891$ N}C PkqI @mF;OPE5#!'D'_'H!g5  %0N6v8t4,%#Q !*7CDH09L-Il m%o@w@n@l@s@@Bf3~vKph[K;R13@W'.pze_mqSUd<Y+&N$vC8}/-'%@(n0 N;! F1HNANVFm7&- RB ]khW@ 2=pFb U D!)H,&c h#-p/)R.  !>E_ Mw_}G;b0jO+ki^udEz*9@UbaFS=p'|4 ,X@P4nYHYVP B .4`*(;./ 1C=NOAU`Hm1t"vwYL|!`G'(i$s '!#.:=C IKK7M PB*Ta.Xm+Z!VM6f <!&  P #(2}@C t<O-}umbYTDj5'|,  >  ; u Bn4_ Go<1xD*T`^?_$W2G 3v! @@@@^@AxtVtG~+y[*19 _4z (@|}5:O<>A@9@'}Gg, !,1",z#[n g"c_ PYA4NY >*z+ xlUo1# g ,:3c41+$ gU>")%52N@aP.g\YbdB_'V:GD7*/#A$cN-M=>Q,hP| m){,f$O'9#'Yr J/ ,Jhr+T6`4!)Z@ @  B.0{($&BN]!qp\BF& =F  -l67\. J;h6JG 9;rb2hBf,$s> @ Ta * XV @@@@@>2!j  t&-*!Q ]5* 6C +  Q.`(5'))7$A!B=F4-( *$.I5/,9P;$x3:'IyP&.oSkh$\#:O. 6@k@$@(@(@&@"<!T#',+0,1)+# 0+]NIy|APWtGM%*&3w74Z .'s!;b r/%8CC :) \)75b|b;*4Y( qF ] $Fl j\#+;LOG7v_vd0 G)1@G@Z@_@Q@=@'p!V] gNu##+&+0/'| O z |    ( X $( 09wQ8po7RhUyO3WC ft[}% ~y>r]l%hjr.}lehc#yf UMG-<2 +'{ r0 8,531,~>/NDm$b x-\=%h cE*jYLZ6aLaVWCk-N  F +oK ; [8R zax{| /!'EW("ym81 G%SCT$VAs0SJG^wr7/pS"8 PKT&*$w~M6 U2 :BsD;, ) }(C&8 IQORC1! ;ve'j D!%?++*H0;PEJr BP 3x "gi;$[vv[; <  0A~L #Iws;(VJ!+1 yBOO?D3 "i iPc4&)48 4*Y/ v   s# G w'3DN mn|a6xF_`A9)Z@CO6)0L 0 9)U Y{x ,{fV7TczY!Qj`s?A.s0XI xf L  D NcEy]>}u pts}rkrW=,,*:jUq)5n>PB>1@: -O dK#  & E CV]S , m;!    !VD29I7Qb`j]lKH`@*! &*-70}2<2 0,( %s)#b#%@U' *+,)$$V\! VnVR + P2G(U6,NU)"G 4\   \)(5$4<;6F83)/#+ /$ M#eve]c "s&9I5"JeV8 Rw)kl-TC='E8ZeNwf~AdlbBzE"_ IY:1+o&! } */.y$)"$ 1@ Lr SUSSVr%] /c6en=^eGPT;_'xd^@L [69 t"'*<+'9? r+n<X'17;8/6f3d249?@KHwNqS8WZV\ ]z[IWT(R<NNO ^[V^cYrQ~F%9}*pkb8WF S*V3 .\3K`*'d_vXvNdCJ;..53J 358I=L B FGQFC@7= W<#r<>ozA3_C2 ?Dn!B <23( " Tk l J&H/ +1 . I* 1FjQL&Hf<  o&= `Z!w"v%?,C4>vFL_LMPES FR IQIMnJAH3B+$=U8!a52/$,.+{D+i0[6<t@-Ab>8S1(jl WEi2!d" 8z TGqP,BSk[Tq,BRR.4H IK6?aj0+ %7#IE]M H&:)' T  jD  Gm!I  ` U9%_e0tPb*x ']2<B<HC@u&:62n+#ab " %-z%>!HH@S>-  mNA!&&Ss+4/yw*/4'+ $6nl7`bVr. B 52! 3zHX#apLb [K[6%e.r, C8)j{u[S 31p F)L+|*!e*08\@C MLNSkDZ/8`-1fh2h>f6P]eNz9$&  cwVY6P+(O"5ENHJ7%An.1V +"n!S2TCtNQOLKGC?6O),R` U1 +02pw1C(=K #$,:j *9I)Fv&_)NqYjI%"$,$,&X ##:`]#  %'g)>,.Y /-?*%K"#(a.6v:81J%1B6HR4L>'g w _ 8[@ ZW*b ?" | N  1Y fz8E w )y6M? ;/K9$v 1(F(3~6E2D'} 0 MB h|}z}!JQ{cS=G#' 7QZe 11j  [ mD #?> p\ *  D     f#V ,A0{G\ hL e o Wv Gb; :<Km V.7 =E,1L+i ~p0{ b: Dv(e )Z<F lej2d ;Ls8S4a $x\G;&c`yT-p " V!G 8]K\^WEMXGLF%FD;;ss0D($ 'U.8@?w8E0 + , 0u0Q+'4!Kx   u&?[*E'"3"] * ) - C, 2% g5F{u k_`"N8#Y/:hJbbIO2fM8eWpiR7 ]e/3W1 MT#D 2!;:&-0y,\#J.'*"A '3>;Z;`7J35<LEJ J0:Cb9/)1&c%|%&yQ%^".>k;MgG )5 *,  &  1  6Pv%$ ,*1, !!1@2JMJOG@% 1  ! =]3XK1`kh&YAp *.VX5^ F`8".;zEE</;"nBjg)*374Pp*B[cPM I T= . | H+' c+*"S^+ &KpN  ~'] W]7 79 x  P !"4M(q  S w] 8  F+"Y +!c(2=\EHA3P$'?:SGF;!+_  cC'i '2y<sl@F>"4 'Hy "-+6`$?:#ZR8]3]0S_Q4&>? +PbpyM{zvjq68hk&[5I>47: d/y u\2!"5lU /i#$> Q ]#4!  i%V:Vpsh"#tm[Ey::DSbmItuVr)kd`R+Cw353+2$ %"#%$)a7<,qQ1f6wr<?tw@{CFJuN]TjDX'XR G7L9d%+#!$#3,A 88DOVY)TN iI) 6-$%3E@VL5 Xgw~ I +RhkY6 . e d <d r(@Ypc -ucaJ4c#\1 T7eh!6 ,+4:26)  mC9$ uc 7)pW53:w<_<4o'Bu#e\ &Y!WQE4#  k"QJ9~$* =Pb!s]%>&&#' W q s2eFSVH_F`H[FP7B3')"K%t2Gay n$ML/To/Z3GV7f#,e:%SQ4b f[F"x1KQ }4!#0$H@cH{<  ~n&d,Tm,. &#.l 478=KTg|Q05W#mqeM,,@R[ZV]S'@RJ+QuL > / $    :c)7w'>H88"G- 0K8E 29 /. :Q o{!"[6?Q'r} _ D2*)**&#l"EQ y:&'0<7 Bd@3 $z[&0($   291 $1 <+A2>i 14M''3-BS^^VH;3 5 A S f! s,v2"o..b$4T0I 'DFJ NO5M O(H f-As+;o&7_5I1#6-5))P# #q( 9 T x  "%c%@" 19T +ux=VR:i$ `=  wj(\ GKp8&  )~*.b <+OO"Ca8q +}#}4rEeR[WUTTIX:^+elqvz {(y ?r&`d'O$8'#7 W)= FHD;`,5 x$!Q3&6@((G%(J$0G):C1?@:;>;0>6"A-K(Z/oC]uyd TD" 7,-0 &. #+))P5/E=RQWfTwH8*    !!4 E K C 5z!y| .{\#q%f"\ UQ N}LI LL)M3M; I@BA7?)/:Q6x2 /#,%((#+.13p-4NL40w4!4(4A3^0u'zi M,.9CLXk"ZM3  utPQ85. "-1 $82@?KH[OpQ Q J>0c!B"  $,39I?/GDNZRkSsLr@]i1,Z!K)>85<.<-"1.&3' =' J&Y#2gFvZgidYJ:)%/+z6H]:kE<2:$8 4|3 ]3/C5=8 4I=.PK$QWLVEI>5=(C.ON\|fi d+Y1L/@&71|-b(G)#/;KSR4HK!8] 3%c ?]E K?42 %! ' 6/ KI de {y q``AL#6 #!)4%?2H;L8;J _4A)2)#>R a)kq5 mr?hC\@$L8%9,!&kG $ " 2G^ q~#1:x: n1 e$ `^^_`_a fm+v"C|%Z|$isjb^LK38,* 2 @ Sg$z$  !8'R-rj4Zy;D~B1y J$nQ$aT2U!QNJ Jr><.-! " g&B#6 3H @V+E\M2BWsB7KL'>H49 1'5g<E D- I% ,L) >J0 LC6N87 E+44!/$*%!$---"<%2J FQYR iNqI,oA6c::O048%'%:c% )(,%BWhe> k dR;% -1"JG)e]-uq,p&\D9 FlnQ7   )w&.S5,6>#"A < 2$,$C /Z:j BqDn>f)0V4!B< /A@<(4. *+ ' #Ge' .2$02x,H+m0]"h+ij"eoXn Eb4O(:"+ (!3 M'p*(% #$(- o1L0!9*61 Q/n) # } ^)>!E%2bEzXfslXh<\ K 9* .AO&R2F =6F-J7GR@x3%    - 0: BB MG WOH3FH:F1@47F)d->INKB4$ ma $\5aFoOQL@2$  |^ G:9?!$J!1T@XNUYK _@[6K.7*# & ! 0B%S3 [? [A W:R- L )DM9o* + NhxK5& % 4:o;\3R*R.Y Bfdr{|ue?PR=E/+1#>5DMF*YN2\JIU=^F+r1  *Dl`Hu %  x laXRL E !<-4028GK&9Uf0Zw"Xw Sg OM M1JF'<-1/% .*#'2%<&>+44)> G ID 9 (4 `   ! " {gQ >217D S ` g!h'a1R<?G+MJ@)0 ? %X &j$l$ _4FA-II#C+$: 5L2@,L+V,[/W1 H15- "Z&&!.:@2AE> Y8f0m & x # % !Z 0#:Sg q+pKf$lV)E'5!%pS= 5;J[ hj he jy .<EF<-yT 4/ C$U'[4'X S'Iq'6&#&u&f&`' m()'"  # /l):[-AJ+E6$E&@ 9.19);&2'!%,#4 =CD A;3, ' " $/52 (!+ 4 < B# E% F! C = 3 ',)5/73/2".$'(& = \ $*)"  $4 B sE`?K16"$ (. 1)-.$-") 3& G*Z9)mQ7lA>2!th\S uJ _FJG@OC\Sigrso m`XG=/   %#*:'V {  #    ) 3;? @ ? |<j9X3G+;" 558 <:(1 5&'B)I$ &I;?a/  (%K%isb' 6386 ~-Y!H#J )\-u2 <JYhirHv2 t1 pG loig (c.X-G)4s!"E /CT&ZDUeG7+# s R&2./ '!' 10=EFXGfBp7p*gT < %" 7Manqk% [-F-$2%-".$''+2= L Y! b: ePeccdaW_E[<QFAd. "2?M H#IC*80+0 (.%? BH\IpEn>Z7 @4#7 !="(F#'N SR J.;>)H#N $V  f$'$'06 3*i:&-Hf      ![/8;*E ?MvT^j Mxb K ou qe VK B : 1 $z]I < 5 5;H Z jq! h% Q&8%#!'  )*, $1!*7'/->5?(C=LA;R90O ,"F<78 AM%Y2`;c<c6d,e" 'fFdhZH 3  "  y W  9$).1> /K &Q T VWWTL A "7 62K5Z>bG6cK f`D`5i$#|%"Q, "2xPRr2 .CVcigIc-YL > 1&2P i x m+U2 766 2 2<, Dc& Q" V#R)D20:]@ 9B @% <37>#4?206?*(G I D 8)* 9&Q-q*&!-/ *    #1<@=5)&E P G,& ,.-+*e - D5)C'S(d#u qX;$ 5 T u  #+^1=9"G"Z5qIX` #a +]2Y~4U e1TL(Q4L!B4$ ! 0< >63[&  p ],X 2`2o+ |# y(k>]Y Yoc},|= J|Jq@c/UF~8[-I"M !b# . 5 5,f!A*!9 GML B 3,$ >Q "a/l8t9x2x%sk bZQF7 ((#> &O')Q-,F3055469 7<<=?E; rR4\)a^UHy;F1* & #"$/"1I$:a% 9r$/s"h[ R P T.WFT)^H;u5K"P I7%jS@ 6 (62>4L ,[ !d f  a VJ?" 5!,+%:EF "<40L(f*5D" P+ V0 R1 G/ u9+ `,$ J!5#  # 8Rmw dR B*6"/.,,)5"&>&J)]- s26 9 : t;V;: >) B%G+J 1-J0;C+)D6@E&S?c3 m'r!s2p<l>)d67Y&EKL= K0 B%I4%6H :     X> D e&)% ?    &k& %D*,+$ 'K#u $/ ;YG3M#J$@2*$<NZ"_ -^ 7Z ;X8X-\ ad ]"L' 8( #$    @ n  *// +('2&8(c8/L178%>&B A<3"' >^yz ` B!$4J.!_ V+r {4 9 81q$yJj& Y F3&#8 HU\`-(^I ,[c-Vu)Ov!Fe<I1-'$(++")4 &D$"RDWeUL=s, W6 ";Xx! #   ~ Y 9 !$$"$!! ;$P %` %] %K (3/:G T ^b$a1Z=NA@B1E #Tq# 2@&J5KAE[G:2F,@ 7. /(F%^'o+ y/0.(). -( }ty '!?Z oyxm [{Ik9[-P 'N%Y&q (, 044,19)x FXSC` ;j>nDkFc?X0N G DCB"@ < "7?2c .++)'#,7d; E6,+ ! ,0) ,8#C"E)) ?/+55(*=!!IWf!p's-j2X3?*2(4,8%4 ) ,485.E0Av&R\Z L8#T' '9K R*L(P<9z)FK G: )+1 V23.& / J*f8{'?)>{%6j+U B76;C IKHDBDFGE?71 3<I#U*Z/S1! A4'.5*9( =&C#G"H#G(A/ 94,.5Y$0%'0 8O<%:-38 (?FJM QW  ]b f% fE"aeYOCs 8V&,51 !89"5./:=(EZ!NrXybiiNl0jaR!C$5 &+&(?%*i%0$8 $@%E%F&E&@$9T 2-+%     & =&U/l:}DJKH@6 ~-9y%_ri \K9(  uK &) 2R)644>+6DvE@ 6f'| =  &$    %,1 6;!9A>5@[$*:y+00#2z1 h,[%\q" +.),!EE -   O!'!!'02>6+E 3TC(8)^  Z ! !   ),* "!k15X@ ONGiMFzT=_/{h mkYd BQ *:% #"-%7'>&B%? #5 !&(6 /Q2l 1-("p[ H<'685G5P1Q&L @2!  !(&*D%8a AwD|An8R.5 %  !!(+&?8',XB+2lE'4u@0v3'x$   # g4!1? C=&./ 4."  #08.8@20MF#TZRh Is:~+,:DG|FS@(.83/=&B?49%c X/   )7 =e O%[+a0]1Q. ?(-$w#V(A3 ;DCW Sicvl| k{`tRhEY=L;@ ?7D0G+D'9" * (+) "  '<Sl-If |}&xm-fZ2UN4"EH6$8I8%0N;#*O?!'HC#:G )KO SUUQ JA7-%   !/;BEECABDKS [# _;_RZg PmBc 3R$B ?0 RH~!]'k (oM%i yZDe-) "N& !') ,'4SBVo&)$(.&# T, { h S =(   #% . A`    + /)#+ &g9",A=sHKD"6!H%Z] P9)9}D_IAE(;,1Iau    v m e_] YT L-@82@#CDFKS]hqwxt l_,OH?j2)$# $ ## .d3'>06 'F S\ _ &Z JN r? /   v L , # 4& ^) ' "!*q3e8h9u5-" ri it$)+123: /B&JS^&[>,_!._ (YM< )3 Jakg XB%+5C L Q QM7 GU ?q 6+ w Z =  & %.50E-T$2bLrettd K1y dO=0 ' "$2<A >  4 &,$ A- T3b3l.p%po mj"f.a6\<W=T;R'3R6'QBOJI O@P6,O*HN iOT[en r ndS>*  ! + 2 _5 50%&2>GL L F=;]- ~")e+E''    +255X77 9 > F OY f`@cb [PMB 1 < 'e3@N{YAb"i !n :p cq q p!m#g%]'O)<)X)(3$  $* 08@ FIG'A,7-)+ ' ,Ee  !/ @ N U Uo M? @ /  '&J5oAI IB4y$S2     '4< @!>J8- "" i  s&4+.+s &M80 29CNW$^1b;a DYKMQ ;!V)#Y![ ZWS M E=4) %'I4"s<,?8=F9W2j,~c&:#  #Gt3 QB4L;Q TVXo^%gr~F #3Tyw mbXMC 7," %'%fD 38 N n f N B%G2\ ;|!> :,2=( Rka B0/<Rhu q _sEa*Q C'5- (//, &  %8Ney , ;E{Fi&=V&.B1#   #,+ ?3(P9&BY>3dWA:MB9<B0)D"H N U]dfbeX']dAPb?-\4  ! +3|7x7u2 r* n!f\ P C5* '/37 9&;*>,D *I%M$N7 OJKX4 E\M<Xd2Ln&:g&R7  /#T2>FIH'B#: 3%--)4(9)s;)o8)q1's$!sk] L <0,*.92D4L2 P(PLE >5/- 067^AJPQMC5'^F943.!%*4>HOT#T0P;E C6E%4B N8 h)w x jS8'%0!502Q)~  *4994 + !$ %%%% $"#&w*.c. 3O/7>-70(6&!667 8 85.$3 Qr!0?K O K @!0# !   j I+-9DK!L#G%?'4-)5 @ N'ZBfd nsuto ib[ WV!Z/b~;mv#Az)@+9)+#{iR 8"g =$6HTXTJ <.##2?HN,T7 Y:\5\)XN @1% $ 4M)k8DI F?4*!  %+.02j$2W)0F+-7)()$!  !. 8>;)$25*$>.E- I) J!G(D@?X: h5j1\-'E(1,#9AH PW]emtw yv oe!Z2M CAO5W)W Q D  4#% , 4;BIR \d#0k/Mm7nj8b3W(I<2+(y'a*K.91*59 <> ? ? <83.*'&%&0 %9$@#D!E!EDCA' >H9s 4.(     oG%$ +//, &  # ' + - .+&' 2#=!EJJ G@84 115;@BB=  5-)%H l ! #!,!a8BF(Ta konhaZVVY_ehh bW3I ^7 %  uZJI (Z4y@J"P&R'Q'$M-G( A:2) g%?1 ? HML E: 0' %" 4!A$J)L0J7A=4C%H K LIB9.# #' ,0/4D7X;a?`BT FBJ/!O*U0] 1e -m$t {2FWad_{UsIh=]4Q/ C/52&':+F-R+^( g!*k4g?[ IHQ2Y^^YQD5( " *+342:*<92 ((5@ FIHE A <8+4P2/- * $ %FUP:$,6C Qu_Rn4{       vcN8%$Cg   q1S@<M 1V2Y@UVI r8& `>!   'Eh   sZ,D:3F*O)&U10U?5OP6Dc25u-$'# !! $ )/6z=pDdH VM HP;R 0S'TSS RRR RS1TFV^V%vU1S9O>G@>=27$.w #O.,@R^ b!_=UYFy4 #z$d)L*(3'9 J[ kx { o]J7',R }!   " &*.4|8s:m9g6a0W)K <-  )020'        '!<!P& ^*c,`,V+H)8)+(' & $ ## % $((0*3<, >F+GK'LL NEM7I(D= 5+ ! 3"F!4UO`mc _VI:.#'k4U@DJ>PASJTTRZO\K\GYDTBQAP@S@[@l>;73/+w(S&3% % %$"   ) A[s ~ hN5   /If    $ , 2 7 ;v@hE\IT KLKGJ@!H;'F6)F5'F5 H7H9H; F<D=@>=@8A4B/A*<%3!(  !. %: *G /T 2a6n9y<}<{9q2`'L7#    !"#&"",+238:<?=@;?4;*5 /) %" ? o "73HD LW %Ci)/{,,+($sugpZoLr=w- ~    zkW C1% #+ 1 5 5 3 1 0 037> H Tb'sBc  ulwel`k\w[[^binqpi]Ng>O0<&/ !). ';$*B+1 C3:@:E!9AP*1FY/*I_- Kc&JfHg Ch =h6c0Y*J%6$#$& * /59 $:"(8")1 *'*,05&:E;o; 70 '6$P+W4K=/E M S X [ l[ [[ ZYfT }ME:/$ \:"" ,3$655F2R,[&_chn!u)~3> I|QmXX^@ c)ed -a8Z?PBD@8 ;+3)   *<H I>$.4F W fqvvq i!a#Y% S( P) O( P$ SYc n{ !?f uc"P'>*. ."037m:$C<0$=?>O>^=k 8u 0} $  (?WipmbSE6*|u n  hb^ X+ S5 N!;I!:C3>(8421146 :=?%C,H/N!,S#.%W&BY)TY*` V(bS#[OIL4I HGEDDB@=;9::,;L:t 5/%'2> GMN"J( A,4+&'#  "',3':-B1I2M0P*Ou!MpLoIq FsBu<x2~& &3?I PQN "G*=246,7&6$2#/u$,m&+h')i'(q'%' !&%&"'))- -,2)$ 6")9-;1< 3<4;3f92E50+0-** %'%%(0: G U d q  |(9HQRJ < z+ s m e _ X R LHF C@ = 7#0%%)%.$#4 9 = A  DEFD&?079.@$CC@ 9  /'!#.34F 8V:a;d;`:V8H4=.7 '7!>GR^ i&u,4 ;AGrN`UO[ @a5d.d -a$/]'4X';R%DL#PF_?q6+! $' *,-)/2193}>4f@4O?2;=-,9("3 #/ *& ##!'* ,",&,,,2, 7,9,$:,8:*J9&X8!`7e5h4l5#o7,s<6uB>wH CxJ DxH AyA <z64y*+u""l/`=PI ?S.!Z'_+b -b ,` (\"VN-G<@K9X3c.l!*s&(u+'q2%c9%MA$6F"!H FA9!0$&',3# 94DIP\[pek ke\&Q2G<=E6sK0YN+BO&.O$N%L) I/C7;?3E,I &J$I#E$%=1'3>)(K+X+c* m'v $| (=Rdt xof\jPPD46(   %"33>F DYDh<r(.t2 p9g?\@O?F9=27)334 8=CHKM K G>)34%>F IIF?6 -#)>R cr  "D%r'*(4(;'>%'>#*;6/*& n$.##$g%8'+ 0 6@.KOXzf r| /E&J,@}1)z3 u2p.i(a%X,P/rJ-YF&@C+A> ;8 6 679 ;<;71#(',0" 3=4_420.+' !!-f:DF(QY ^^Z QF*;70?' <"/"  (2BT#f'w)*)'%!~(r4f<Z @NBCA9>18*0%&!  8&V(x(&%"!   !t%K)),..-+(&$#!@b *!:"|H"WS 6Z]]\ YWU"S*S2T;=WBf[D`Cf=j3m& m"k!f_YV2MC :' 24 *> %C"D">8$5b(' /7A M/[+gry|}u{ZzLxLvWtjpm i d a ] YVUUWbYCZ)XUP"J,C2;431 )()B*` 4y;?A@>q9a2O(=+  (@" ]%z#&- 242,# wkb!_(a,g.p.{-+*)7(C'K&U&](i{+tf/Q3@83;,=*>y->p2=h:;cD8^P3X_,S t$NGA :1!*! $ kK1   &,38<@ FKQV&(X>5VZBRtNKXB^:`2`+^'}[% hW%QS&<M',E(%=*'3+ 2), G- a,}+'"(0#7)=.B0F0H.I)H"FDA <$5'-+",- +( %" !  " "cD$*+/12&1: 0O/d.u ,)#q#U$ :#   -?O[ekor s-sCs#Vs*\t.Rv.>{,((" "( w.g2X4M5 D7?:;>"7B'3H( 0N'-S"*X&[!!] &_+b/ f1l0u,$ (>Wkxzwk[E u/bR C!9 #2%0 %01%2@&6I'9J'<@&'<0-'<1)92,511/,8'#AJS Z`a`!]8 YRSjLy)F~7>tC7^N2 CU/+W/S2I7;@ *K U_fijifb ]6VLMa,Cm?9qP0o])kc$ic"h_"gX"cO%]D*T:2J0;A(E:"N5T3 V5%V:,SB5ON@KZKEcV> fa6cl,$Zv"2N=CD<F:C;=>3@'@<5 /--2#9#A EFC :,w&k)^'R!G=6 20002B5P9X>ZDUI HM6O %O#M) I-D1=465 -2 $, "" 3D Vh y #)-15:>A@;3)"& +!0" 6$i=%ND&5I(!L)K( E$:!-   ""$"'!'!"!$( *06:?EE#P F'[B(c9(j",&n'$q("q% ojaW L@ 5,#  # / < I S Z_#_5[IS]Hm={4/ /39!A(&I/(P5(W8'_:!d:h8i7 g8b;YAPIHRBZ?c>j<|s:kz7]2P-D):%2} *y&v#u #u&v+w3y=zK{[|l||zuof$[%N$@0m!R7  &>Z $ s,/$#/5*+Ex/"Td3aP3kA0v:*=!GZs Im  m I lU{J#tJ.mU8di@ZEOHBJ7J,J# IHGG!G%F)E"+B&-@,->0-A2-D4-I4-O1v.T+h.V$\+UV(QW"M ^GmA =9630y .[/=2$ :&G,W3j9~=>;4* u*d3T3H*?8 1,!).+:/C5 I=ID%EH3;H@,GFCH?E<><5@+I!Uds   "!{#i'+V'7 E$<8!9323 '9%A,J6 P?TIVRT]Ng Gs<~#//";E N[l~}|zwsmgb]Z XU"Q.L7H@E EDHEIGIJGL FNBO>Q7Q.R!RcPC M)ID?;86%5A3c0. *'&%"$x'$O*#++!*!$*4< BIMON KG0BF<] 6l1r+q!'o+'l3*m;/o@4oC8hC8ZB7F>4/9325+9$?E!J.M;PJRYThUyVVVUS P LHDA >#9& 4&,#$ %+/2 3'2/.5m(8O 775)3 % / +(7 GV c  k  o ldYO F@ ;6 0')."36 5(19*Q n #+/1/,(!$*| 3|8;:7 1 * "& CYgml%f.\8O?@A1? %8+%- $4,65/ ?JT] d i}kTj4e] Q B3%%)) % %7GUb!#k!+r0v4u7p9d</V ?CH @X=?m6;354+6"764u3d4Y7W=\CfIpMuOuO pOfP \P,TQ6QP>TMB\GAl>;21#&   " 0 5.%-24Z380 ./2 7>FO W_f j k h)b:ZHRQ HQ@M8D!1;-+17 &(>"@="6) ,/%3 688:> E N W ^ a ` \ V P/ MJ Jh H~ F A 8 ,p _ Q J LVfz! $%&(*,-~-l- Y+ E) 2'!&(.5>FKK HDA?)@5AB CMAR?'O";2B&9:0&9? ;B?DCBG=G4G)ECCEGHH G"D0@<";?%47$.)'" #,7AJRWZ\^ `Hb|e im5nX# mf*i_/cH1['0T/ O.K.G.F.D.C{,C)F%KQ U UDP^ Di5f#V:   l'%T,'?0&01%'1" $0,.>+W' "v$# "  ! (v 1] ;E D/ K M IA5( & '20);69@:GA6O<+Q5!N-"F%!;0&  #"% +,(782FM:Tl@cBnAw<{"5~#/~ '~ zu mbRBt1k##n.|:IYk}*6@FJJD<0f"J7 ,'(r/f<^NX dUxTUX" Z&x\)`]+F\*-Y(U$ Q!M I!G#F'G-I3 M8Q?5UCSZDu]@_7a+a`^ZoSNL1D<3+#    $ .!9>BdKQ U V U R LE;1'^$8-8 CLRTRMF<2&    "'+.000/ /.. ../"/%0%0$0#0$ -% (% ""        ##',++;3-G9.O;0R:5Q6<L1ED+P9$[-ab ] TF8 +#%,0237 ="G (N.R4N9@>.@@> :4.("#/!< 7IRTn]dhjjgpdZ_DZ0W U V X[%^-_3^6]6Y5U3O1H-B(<!879 > (D 5K @OERDS<R2O)J (D /<=4Q,g' {$$%*!'1),732<A8@P@C_GCmuMAxfR<UU5BT+-O!}Hw=q2l& i hh j kn p p p m hb"Z(O*D(8"*   '!60A?$CM*<W2.Z 9 UAJI <#P-#SR N H/AP<w8654 2 //+5&.,<JU _gg=ns w y y w toic\T!K6BN6!f-$|%' *,%.4.?,E(tF#RC1<3$)- :IX dlmgYF#0@` #0m>NM1Z dhf)_0)W29M0HC*U:$]3].V*I* ;+#"/0&")5(*;'/@#7D @HIKRO[Ub\feeocyadly ygTA/ lW C1#!#'#2%?' L-X7aB'fN8fYLaacXexLd@`5Y-R(K'G'A(<~*5r+.e,&V-!D/ 20" 1'1* /,++ $' +@ Yr(9K]lw| |uj\OCw< _:C<+@FLP UVUOG9 9a *     -<!Gp$QH%Y(%_ $b!db\ R #D&4*$"-%0 %4$9 "@"E#J)#J/"E1:1+/-, * ( '$(0,:4BAGOF]@f8m,o$ n9jW cwXI8%F brxx u r p p'm/e5V:B>-A CB? :'540>-D+ H( J%L"LKE =0j='4>BA /;^4 -((%Y$s$s$Y# (" i9  !$! # "! $' (%"  # % #!%&)6+H ,\+l)z$  $,//*# n ] TV ` p!&)&*1+;-Az0BX7=:?2K# Xc kpo1jWaXPIC?=< >a@8BB?"6#L)*-, 'Djy #r-V6+>DJPWv _tfym s y | } zumcX}N`'EA2 <&@6M!3W/3];6^@8Z>7R61,K+'9E@@== 2:"71 ,))-6 ?I#O'R%SQ N LKKM QW] d%2i&Tl#{nnl g.^>SMCWr3]Y#^@Z* TN H C ?< ;:9 7,&4N*1v-,/'1#2 2331+!q$"W"E#= $<%:&7'1%'!!  *2/9E>ZBgFiH]KFL/ N Q UX[\![)!YGUiPH A 9 2, r(U$!9%,"*5-:/ =0=0:/5/-0&&0 >0X.n+{%#}'u*f+P )7%$' 6 CNU VFOxB$1) .51]1o 0g/G//25X7.:=@DIL!N4MJGd?~4( & ( % ~ ka ^`b^U I ?:$= &F$%Q&\& e$ m  swx sh(X:IJ>W:a<hDjO i^er] T"J$@#$6'"-)&* )' $! z'Y -<2% 6/6 ?1M)W)]7] C&WL2MO<AID5<K)+RY _ fn) y267&4+ /+ +&), 4 >| Lq \c lS}C4+)#,'q$0,V)0/9&(2!33 5 8A%P6cEvQY\YO@-o^ K 79a'!&%" *>Q`jm!$i(%^1#O9!>C -H# H(rB0d6:[&BSIM LJ JHEF=B 17#) * Ko   gD% )'3-703/)',Ij  ' 8ELI!@ $1&!*#0 )8.~B0WK06Q.S+Q&L FDF LWc p{  y<nfbWP M MOQQM F;0! &&)*)&! }$t(f-V0H3@3D3R1c-s(x"pY?%"(,0 2#!3+A3.k2*1 . + ' #    b ;  $:2 `=CD%C+A,)?+4A(9F"y8P\0YM#c Nkb nkdYK"=-3*$ +7*B-F4C><YL14Z% hs|  xl \ J$ :( /*)):)([/"7>D IOV_b=o ~  &$- H3t66"4m&/\(*P(%G&"E$G#J!N!}Q PK C 7+   "&"(#!%2 ? E F ?4' &C.'3;7P8b 7o {2x 5,}${Vs)d P;$&50V> {M\ it}$-\2:4 2 .+'%$ ">hn `XV X]#c5gHhWe`]aO`=\*X V VVWY[w]x ___&^, \2 Z6X8S6N0H%B!<&8-562>.G)N"y SMS'N F; ,&+-'0G2n6;AFHF_ B9;4, "''"-1 59 ;!<$:&+5)L/)q"'' }Y!6,: G R7 WX3X)~DQ3SH8Z<9Y04N'-="$) m !I !,& )04 > <M>_1;rO5o* ]:vfY PJF)B3@:=A:F6G4% G0; G+SE&hF vK}V}euvj ] R J F E|E fCM@6="#<*@0 J2 [3 s2 0//.........//0111/ + ' !     ##)'.)3*4) 1%)  ' ;O`k#q& q% k"c yZ `PSIVDkBCHO[!!h& u(})&&9{Kn[b\iLFtA/z>|=y8r.h"[NB857 ?JU^cca^ [YX"V"RMG &C0@6@8 C5 G0 M * P# R" Q1 L<EA <?28*2$3!@ [ !0!CT=^R`S[CQ'D6 ,(!(%.$7CAoK UaoNk3& 5<x=!g6,[+ <SUMxJ#I)I+M )U"%]&d8jImWl`zic_daS\ZTSQ\IHeBAn=;s<9u@8qF:jK=^MDMKK:FT&=\3^ *\#U M D@>$@%F O\ m gI .  "0<B D#BJ>!} 9,$26%)@:$He M NMzH[D) BBDg J7QY#"b,&j 4%r9"x;{:{!5x,q g[ NA70.1 8B$K.Q3Q0L&C6+%#Dj/"B&R+pZ/Z[1JU2@K0=>.?3(D*!I'M)K, B136#< C K! T/_9 .h:Gq3 cw'*x|. ,"nW<%#4DKH = +  ' 4r=bAY<X/] !i v  #:|(Rp:fdKmYWgN]VB_?4 ])$YV *T1U 3Y0a&*j<*!tW:~uEJF;-G}ix|u{uft?t so i!bv Z{RKF B%@+=09426*o4"E0$+ %!!*3:>?> =?C &J /T6^;f0<kK8lf1hy(a|YqQ\ KGG8F8HJJm*K:HE?J/G =. % 8K%X1^:[u@TaBIXB?aC7{D4F4H7I;G>B=88z+/K#%   ) C'`8~IPMA 0fH- *@ X'h'?p!roe W?H!j=/t8< W8I;T@\Bc!BBg =g7`0W+J>);d*,. 4)9/=1;.4-)o(@#GR%` h#h+^7JC5O!X_cc^V J)=B/.aG!^ nv#u1l?^LNW? rb2Kh+*j'g(`,S4C>0JT[7^t[V  O DK WH <JMPgR/Q M!G#B1"=W >zE Q_pc|>%6|KmaX vA+ z'w4u?rFnF h>`0'W 0O3J+I MUa l0vO|w}zun! 'g"Da#V]"X] L`5d in ruz   u cQA7| 5z=|Oisq\IH(:2/ 049 A"'M';]+(Tn/4j|2?5I6R}6Xm4]X0^@+\*&V!O G!>&4,+2!(6/919-6%322!3% 3% 2 0 ,&! $%4U1C.@RP^]fbhae V] ER /F ; 4/17 AL8Xqaf f1b?Z%PG>`:, 9 = CKPQMD8-%!"'-266"7f%6& 3& /'$)Q "cY8 ^%"9(+-$2-69+<;#<75/*!   2Kcw %<-T@iRyv\n{_b|ZV%yPJ1tC=9o50<l'%;i3l(u % 4F0V)[f8rrFk{PF}VzX rUrdP!:SJ-@F40C2!@*??><:! 56/T'$t09@CC@ :2!'x-J6 $: 83,:.6l+;,73/>" LZ e#]m%-o mha@Z eS N JH$sG3OFA*FF D@?38#. 5$ Y'8HWc)k3l <hjC\&MGK82G5HA"P8N*D#25 L f !8] wuoUr 7!' 3>FLVa )it9ieF_U*MKC4O738M/$5G9-AY ;7 2.)$ `5%&&0!;"E,L5O;=M>hG=>:45*/")$! #.)t>1bK8_Q>hNAwEB5B%Az#@c* >F3<,=:E8%H7@G4kB28..)&$!   #)06* =KDJK#S] iSv ,6 7 -"7|S mt _SJ(C1?3</:&73,l$E##3 G X df]$J/ 47( 9U9876H7e:h>Q&C$/J3Q3Wn/]E'b6e?e\ e db$a0`:]k8WK-O= DE:e1+ '''),6.#B6HAGM@Y]6`2,b$]TF9/$+-+21281?-D *G &F %B#= 5-*"48 7 6;/ NA0mM@PKIQ :N)D 4- #6:t 8E 0!&A n  # .8 >B C D FX L2 U`l$t.y*5uM7io3U*@$--"2f!1C++"<" Qdq t -lU[C# ,(* %!!   &+(!# $ .JK1    ( 7FU$_* d,b|)YX$ K6 ;!*%..;:F FP(MWGM_hEf7m&qtw q[mAf2]3TD M`G,BE@ ^@p?(w@FqCY_F cFLf.SgZn ]~ ZP#?3+D :Q F[ 4b,d5c=y^A/R?B8./-BgQE] )dg ea< \cW$R1L9F<A9< 38 {*5 W 2 D, K$ n    *&  ,6/<C=S ;9]D/^-$X,L;=Km.[5!hpr0mT%_w3JD3Viaz<&29z <n&8e) /^("Y&Y&& Y''[)"]/`4b6d5e0e&EauZQ F=8$8;=YVH .kRvZu^!hZ-3RN;_;=H'*RV S J@)9 46A:NAYFaFd?`0~V!F2  !$,)5 -(9949N;3k<)7".%#(x*L* '#  $6lH &fW ;p`Qe `e da]ZLM7=#,  ^ ; '/4 78 :;; 96/(C.!q@MSP$G, :4);?A~?U=4<> CK$&T:3^Q: ea8 he/f["_FT/ F7 (#/;DIJGC> 903K*j   )<!N'Y-[2X3P3vK3KG5&E; DFBT=c,4oT(w~woaN9&s_ \(i5@HNP'O5J?@E2Di#=E,1' ;$FN R S*U>%UQ4U`CRjONsXF];_/_$^]["!Y. $U>NM)DZ?9`T.bdw%`mL\k/Yc&WV-WJ<X@GX:FW7:W9* T=QBLGEL;NI1N&MJ F$ C$>82+L '! (.:&L1+`N+rj&~q pU[8D- "/,B5P>-SELILo8Q%TTRN|InBo: 1 (     S$4B JKWE ;32_::M-e" 8})W+"&/7~8i 1-S&7?.1' # " }$R(<.<7PApN\!g& p* v+ {y+j{*d&j!%x+.,t &Y=&| ^?%#8Oc)o>rRj`\eIa5S$@/- @HE7' '8 I V]^\ZY[_'d0@i1im+o o m jfXb"0_#_"c *kOw)w ;IQQ|JU?.x3l&bV K> 1&&B ` x !#4o+GU1Y83g/p% s q. mlgda}^YP/CM5br(!+ G f&}/6o8Lt2.]%G 4 *$ $% #}0p=[GAL *K E ;&1P(" !!#"'#.%6*?1\H8)RA ZH_M'_PDXRcLS=X.`$kuzV"8&+ ,*&#g #H$2'-:h-T3 j6x#5:.1%~}~~oZD31a[!1  #+ )V,+m&K5* u*F- *'044/a:("9'2(% &! W(- ;!G" Q",Xb[WN!+A,L/;KL. ]k!s.u9r=n<h7c/_(["TJ=0%&4 F $Y'j'v(%|/"{/rv.fp-]j-\g,"ce,$se*$e"edcccdgmyl D$  . 9< 7,3Mi hB z!  y"l 1[=G)C59D%J?R7Q-J'A#&9#:1&M)*] - g/o0y / /.).;0 I4 N<HF9N'TVj TC N"G0 AE!?U%=_!?`B]FTHLLDO?Q=Q>OA JEAL3R&$Z(b% h l!k+#h>.bW4\n3 Y{,\}% bv!(kl$*tj/$xx?uOi\Va?]( TI C C VK3X !$f(*"n*1%m$6%b8 R8A6 6441</ K. \1?j8]pAxnMeXqXaUMg7FhD eF_JVLLJAH4 G'I 1OQWr`fiij\l$;o7sK vZx_y\yR{F+9D1b/38;90#k J|.wv wz~ "#| rd(S:BK /0UQ!UxN(A139%>?<4'(5COmY>]Y N(>,23.=G6HY1H_'?S1="&$-"2)03></SA&b<f._ N7"    1(E;9VaG bPhRiNeE`9[.V%SnRaRcSqR N"G$>g(3J-*-2%6# 8&;7+b32-7';#;!9!4"-"$a!KN h "d#A0! 6ENV'_8\ HPUU=]j)!_].]7 8Y=Q9F-:,   $p,H2 #&6, 9/:) 9' &65N2H ,]'r! 1S&d~2ei:W$P=<+9<1*<5$=6'A3.G,4M#6S1V(Y Z Z Z X Tq M: D !: +23 /.[). 27 9'9^356:0;+5 ((-&<#+D:ED@B<68#7764 /* 'D$'t*,*3&=FMO M E:},}& 7K ,s^;_mDGtB3q6(e&(Q-: 0$+# - H d3|l}& -g,DK/91.(%1 :)@v8BgE=ZK2QI%T?j0 !   -CYxk?v| |C{#yy4vDnP5` WRMWV5RG!I,=2*(&!D'7f-P6d>mD~gEsTxAt<\:%E17*2$5"? J{TT]1d ino ni#_ &S&H"@<> A2DqC= 4 K)#a "%K &$'+*2/54y1:w&=r=]9H2-)"/M86-7p j!65J^.l>rJlP[QBN+ID?= ==?}'@Z* @=&=)9"3' +< #_ #>6\M!Y]+/f3g5`2WU-"J(A*91 5> 2L=0X_-]']UJ> 2)|"X&9,".,* '5$$X;*>/; 466<,5@2&- &"+h8ZCdIJGC =0782%,#'z3D;!> <=3b& %0%i68D7K!3[-f&n#!tJyt~")0{6l<Z?Gd?7G>-)9'1& *($**F&(&!" &d.8 A kK  SWLWTMD ;3,&!  %$0 ,7 := ICXPdemokd[S%L6JXLI-cG%wD8 ?I5R2p(OLTBh9/}    '14+f 9 ;d,?.O>XHZ^GU5;K*@5 "-*+,+%0$7(6>1I B0UC&V @M:;4(21 1 0$+9 Pf y 5S(s.+ !T-*; IpQ\PPF 5L5RR$m^}l zxg O=9C|R}][J 5  ziZ RP R!W#[ ]Y R H< 2*,Z,2>ON`cp Y|2~nt9fUD1 6I ,] 'j)l"0c"79N+MC71\J"2^M.SK%?E*9* !5 CF> /   !N #`. E7B=Q@ $m@ &=!5( *;!Gr* O_.QL(M@#E>,8M 5)m=EMROUkUeR:NI%F_3F'?J IPOXT\# S](3M\(E@Z$S.XRYF_4e#jnm'!f,/\.@O/RA.d50y,5&<!E KL F,;5-y8 L6%2#//,/4#e43.?(9M@4]BIm?3z8/'e{ /n ]"J+6/#*")  Lw%/ 7:q#9H45)B/J*M" KE?  ;60 ( 2E Jycq r  h )V=CN3rZ*=^)Y.P2#D5(95*603']'/.1 5: <]9#B/49"DDLa L"C(5'% $  6 HRU$V Q1+I9 @: 823&24 <FQZ#b*'i+.k *+m+#o2o)B m2Zi8vb9X8L5>2$//w*!,T),4",/4>JY h* vT  zocYSxSbVZZY"[[ W\L^:d (r  2Nj ~ e,G>,:$ .6=%@.& ?i!;4,g" w :* <N\&,d*>e*Jb$LZ>S+M K J I)F'O?85G)LG7' \" 7+ !5@&K*6T5E\>M]DLYGIOHJ@JV/PqY fqw-wJoZk_1M 9'#-s2 `0 T'$PEQ eX `is*e:CD 'D:&,+3N<y <{5e)N;. s& H%))# 2! ,=,HJ5iV =`BdEcFZHMKy=Q{.Y#`fg`S@Y+5%5 GZ ir u#q*g4YHJn< 0*+2 = LZvc od}[J)42 78]59 1/!0.49: >>;@3?);"7 !5(576 I8Z9b8,`65Q28;-6.&$0H(g$ %)1: A FJ7!NH"QKT= UTQ OpO@Q W] b!"b;'[W (Or '?%/$'#&#.#; !IL U [Z S|I-<0~ &; " +7DQY ^) \U U I :+xA&$ ?*U] +y &'pVf'e m}(/2Rl5yY3J+ > 47H2 [2j1s0u-r(m!d+[7Q=E#>7.<':9C6I5oK3YJ/FH*6H"*+L2%Q1"X '^b a \TJ A:%622< +<!5$).32-&',#"0$91(T-20k"E;yWHzbSpfX]aRFUB1C0"2%  $ +13 5d8 : =@C$E/Fm6E77 D0D%EIHL O P%M6FE<MK 2F)8#(!%'-B :^*Gw:SH\uO``N_DFZ+:S/I&A";,"6E 1f . ( &E K5$ 1=&A7I?E-8J.1H J+>q'1#& "v-Q?+ Rcl j Z#D0-:(<4 8<0: ' 0 _#" *o5 =?r=- 6:.q@"+=:1-&-;2$>9,7?1*@/=%9668/=QBxFGGFEyERH/JMOM HI ? 4*!&  '1p';@ 1C, 7G57FS0Cu$>: 9:;j<Y 9V1`%r%.6 >A<1""%& (*("5 Y1bA IK+K8BD5OG%X%`g'i3f<,b;G]5eX/W0Z8`@g@o7ut*ew^{b~ n/|eozRt"6kf baP`#!a6aM^`YgP"cC&V5#E)6!+$" *16?7e3(% .1, "!   V ,+ 3 5(1 N*&u)+ -0z3Y6L 6[ 4 1 . .19F Usf<r(x/w/o)d\ Z _gptsj`V P N S Zci n*o0m. k(h% c/[JSwKC> <@ <I;?9)7&50 7=>+KG>W$SC]&]0Z eNi9j&i[fCbK[pTLF/B.B#B$R+C_-Dg,Bvj)??f(9\'3M(/<&.-30!=MKgZ|#g+ lx3gm9&[k;8Jv:I:6Q/0N-)B4#0DXl| %s,vZ2hA7 [+8R7#Q20S,6W$5Z0-WJ O_ Ch 7Z /E1**< BNZc fv;+dp2S4;{1%l-Z)I&%?$4=E#?D%CR>eq:xF ;D4 Q]/%`V8UoIAUO+;J%9'  GAT'G5A;THTQCcCjGyG~CTTCG GACGACTCT#A* G; A@GLG[AcTrC~"C C#C#A#T#T(T(T3T3A3T-A-G-T 8T,(T8(TD(TN(AZ(Th(Ct.T.T8G*T*A8A-T(A(G(A(T(G*T*T #T+!A7!GG#AQ#T^#Tj.Tw8T8T(C(G(T(T"G!T!AAT T A T& T3T@TMCY0Tf0Tr0T~0A(TTGTTGAAATT A G-!T8%AE(TR(C`8Tm8Cz8T8G*G!GTAAT!T!T*A8T0C,A'*T3!A@!TNTZCg Tt C TGG*A,A*A*A*T*G*A*T *T&T#&A.-C=-TH-AU(Tc%Cn%Az$C(T#A%G&A(T(A*C(T-T&C(A*T%A,(A8(GG(AR(T_-Tk8Tv.A*T*A*A+T8C8T+T3T-A(T (T (A (T '(G 3,A @,A M8A ['A g(T t#C ~$A $T (C *T *C *T 2A 2T ,T /T ,T 8T 8C !,A .,A ;*A H*T U(T _(A l(T z%T %A (T (T (A -T %A (T #C #T #A #T (C (A ((A 5(A A,G N,T [$T h$T sC T G !T !C #T #T #C (A $T &T %A %T %A "*T .*C <*T G*A T#T b#T m#A {*G %C *A $T $A $T #C #T #A *T *C *T *T !*T ,*A 8*T E8C R*T _*T l*T w*A *T %C %C *C *T *A *T *C *A "C"T "A*T%A2 T>CNTYAfTsC~!ATATGGT TC A T CT( T4 G> TL TT CeAu Az C CG A T CAG ACTCG&A3TD TT C[ Gc Cn Cu A T C G C C T C T A A C G G, A7 TE GSGXCf CkGxCT C C C C C T C T C A" T, A=CECR T` Cl G~CTCCCC T CG ACA T C' C2 C< CI CW G`TkC|TCGCC A CCCT ATCC$G2 CB CR C] Ci Cu T T C A T CACC C C C C C% T6TCARTT Cc Cx ACACC C T C A C C C C C+ C3 G; CK A\ Th CyGCGCAC C C ACGACC) A2 C@ CP C[ Gi Aw A G AACCGCC C TT A C( T9 CC CM CS Ae Ax GTACGCC C C G A C C T(C8C?AVT_Ch Av CC C TATGCGGTAC C A& C6 T? CLCZC_AtC|CACACCCA G T C C T/ A7 CATR T] Tf Cn GCCCGC A C A T CGG" C1 CC CS C_GoCy T TCAGAC AGC T C C$C6ALARCcTi AxC~G C AACCC ACGCTT$G7TH TN C\TeTGTTCACACTCGA A T% A3C: TICMG]AmAwT CTCTCATT AC T C C - G : C C G U G ] A l C uT C C G C C G C A C C T!G! T!(G!8C!=A!FC!NC!_ A!sT!T! A!A!C!T! G! T! G!T!A"G"C"G"'C"5 C"= T"QG"WA"l A"rC"C"G" G" C"A"C" C" T" C" T#G# A# T#& T#- A#< C#HC#PA#\ C#k T#T#C#C#T# C#C#A#C#C#A#G$C$A$ C$4 A$@ G$O T$X C$fC$uT$A$T$ T$A$C$C$G$ C$A$T$G%T%C%G%#C%5T%9C%LT%RG%TC%`T%tA%A%G%A%C%A%G%T% G% C% A& A& G& A&$C&/ T&? C&L T&\ G&c C&wG&G&T&C&G&C& T&C& T& G& A'C' C' C'' G'/C'EA'JT'YC'f C'q G'x C' C' A'G'G'G'C'A' C'C'T( C( T(' C(5 A(;C(BC(W C(a T(k C(z G( C( T(G(G( C( C( A(C(C(C(C)G)C),C)5 C)F C)V C)d C)o T)| C) T)C) C)C)T)G)C) C) C*C*T*!T*&C*9A*GT*T T*eC*vC*C* C* C* A* A* A* C*C*G* C+ T+ T+1 T+9 C+I A+W A+a C+xG+G+G+A+C+A+ C+A+ C+ C, C,C,T,!C,:C,@G,MC,dG,zG,C,G,G,A,C,C,A,C,A,A,C-T-'C-(G-=C-KC-PG-fT-x C-{ G- G- C-C-A- C-C-A-C- T- C- A. C. A./ C.= C.GT.S T.c C.m C.y C. T. C.C. T. C. C. T.T.C/ C/ C/" C/. C/> A/G C/W A/b T/n C/{A/C/G/C/ C/A/ A/ C/ C/ C0 C0 G0 T0.G09G0FG0[A0cC0hG0yG0C0 T0 C0 T0 C0C0 C0 G0 C0 G1 G1 C1, T15 A1C C1V G1\A1hC1xG1yC1G1C1A1A1C1C1C1 C1 C2 C2 C2& T2/ C2; G2L C2\ C2c G2i C2s T2 T2C2C2C2C2C2 C2 C2A3A3C3T32T36C3IC3TC3b A3r C3 G3 G3 G3 C3 T3 C3 C3 C3 C3 T4 C4C4G4 C46C4C C4P C4_T4fT4yA4{C4C4C4 G4 C4 G4 A4 G4G4A5 G5 C5 T5( T57 C5B A5N C5b C5l C5w G5 C5 G5 A5 A5 C5 C5 A5 C5 C5 T6 C6 C6' C62 C6;C6JC6Q T6bT6nT6 C6 C6 C6 A6 A6C6A6G6C6A7C7 C7GSIGN=A=42,C=41,G=25,T=111 SPAC= 12.74 PRIM=1523 MACH=377-96 DYEP=DT {BD Set Any-Primer} NAME=IIABP1D4373 LANE=50 GELN=Gel File PROC= RTRK= CONV=phred version=0.990722.h COMM= SRCE=ABI 373A or 377version3.scf100644000766000024 36676614006427353 16731 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data.scf7RX03.00lpf0ns">@G\snHB9*#         #!# " !"$'$%# $,22-3:<$#6876 742*%,+,*++,*:?A?>N8><41 /22--0+' 9OVQO]&PTTO ,/-(,MXM-%+- :<>7':7961  DDJFJEJML"GHC7 7-13* KX_VTsBX[TL $**' @QM6!##0=@964<BB$BDC>~%@DA>.-)' 8YgX4"H\ZB&(&,23/,'.1-%#$'% %'&##$'#7LN? 98:0#0786*+/)"(*+!04766662&-*' 1HN>+CMB-&AJB( !  032*$+./. ,64(# -,*$ 003-%'++%($& !""+0- "'*'.-&"$$*)($$#     )'"3J?*FB8,!:3.!31*$ (()%&"+DA#  #-+# (&&&# " !()%$<JG5*55&+53# &#  %&"""''%2<7  $AD#  "''!",%! !-*# %&'& $  )0*:=$                                                                      rRh|{yqrw|A35@D@=9>k#TUEBJIMOM?"$&   /A<75=@A3041358<B1tQcK@8%  D^^ODb xAg~{^Q6w"2?NelX      ,dcdWA$I]ou@           GJRML s9<YXA         MNF3+tx9AE@9CKKK'qAMUV)       1DB95 $45-(2400;;7     >A>94<DDC ;>A73v)?KQ??9&~%?B4           ahcMEMZDSZWV $AA;325+;\i`<j(?SMP8      $ggg_A50]`g]V .<8& +94!  (10'%()-4            $6;7,'5AJ4 " (1$.ED1 3+& "-,     +52#  !               &&$  #*/('"!      !"   +%%*  %            -*#        %! $    $#        $>D7 3GI4#'2! ! 36)                                     !                            #"        "                           #" !                    !          ,$                                        *                                             '                                                                                                                                                                                                                                                                                         )                                      $   &       &#                       #!   )"          '%    &   ,     "    ( &        #       +                                        3,     2#            "           " !TYKi| +49,kukTde7[eN2 &'%##!%++$* /,42-   B>9)&05:S- ,6,$07%0.,#"%+-. %! "28930/_zuR&@ENK<#?@;: 8>>=%`ba[!<UQ`_UL%&$")*,  djkZZh^afaM 3#$  #'()BA=69;?>/ 7VTKE*l-fIMUN. Hgnd^4Rei`X%5>@Cxo-C@80#$%#! OQXT:\f-G[U<"%    MSM@5HVz}],&#"HJE<ehD@?3%  %"!%     ))$      "$ )42(# %.(                                                                                               qs[(,CN/ "Spvw4xZP  (+ (/ -LB-w}"7FL "  "## #)*"NvuI}NhkS%@ZU<'RgcF/2:9+ '48@3?EG   +MQ?' 1JQD$6NQ=JefN ,1;98JOJ6.mo8EKJ HHC6yu I|Oy~*4CC6 @>7./6501.0/ ,26 ""  _d2%,^gI!),E?12cM.">wz]8p~q8{|Gqua0'()'  ,\R-" ,36  6/! )Ln5)':=-<YeQ  !1=6#nUHmxf</33+$=>;. 2xc(v#jC1eor:CNO,QI6mp :JE1*8@) 3BE7O #"7ZG "'(   3.'"(% 72+#(.-,#;9/'(/8:%$# !  (0, +?KC.3?:& !"580-)#771( ..+% *)$PVN! $%!+9=/n "*/& /?@11@;)&   )$"9</ "04)  473.'42.. .;5'.)$!>=1,210+,&  570* *0]_D   762%+*(#!! 6=8 &24&1-#  ! ?JA BKF*  )';LM<$!%$##   *,$   310&#(*+! 82+ &GSP) 36- 6;3/64!   ;=($0;9  ,% 4<8"   (&"    !     #+% #8:"       #%               */'      $%""#                                                                             '5;HQcjy~#*;@L[cr~ ,8DNZht +7GQ^jw &3@MYfr~-8ER`mz'3@NZgt #.=HUcnz,8GR_kv   ' 3 @ M [ g t ~  ! . ; H U _ l z    ( 5 A N [ h s  " . < G T b m {   ! , 8 E R _ l w  %2>NYfs~ (4>LTeuz&3DT[cnu,7ESXfkx ",=ER`l~ '2<IW`k|$2BR]iu%6CRTcx+3;K\hy)2@P[iw(9CMSex (8?V_hv&6?LZ_t|/7AR]fn"1CS_oy$6LRcix~$7HN\e %3:IM]mw    - : C U ] l u !! !(!8!=!F!N!_!s!!!!!!!!!!""""'"5"="Q"W"l"r""""""""""## ##&#-#<#H#P#\#k###########$$$ $4$@$O$X$f$u$$$$$$$$$$$%%%%#%5%9%L%R%T%`%t%%%%%%%%%%&&&&$&/&?&L&\&c&w&&&&&&&&&&'' ''''/'E'J'Y'f'q'x''''''''''((('(5(;(B(W(a(k(z((((((((((())),)5)F)V)d)o)|)))))))))***!*&*9*G*T*e*v**********++ +1+9+I+W+a+x+++++++++,,,,!,:,@,M,d,z,,,,,,,,,,,--'-(-=-K-P-f-x-{-----------. . ./.=.G.S.c.m.y.........///"/./>/G/W/b/n/{/////////0 000.090F0[0c0h0y00000000001 11,151C1V1\1h1x1y1111111112 22&2/2;2L2\2c2i2s22222222233332363I3T3b3r33333333334444 464C4P4_4f4y4{44444444455 55(575B5N5b5l5w5555555555666'626;6J6Q6b6n666666666677 7 #3-(*8((!#!0 %*,!*,***&-%#&((%((.**-(,,8'$2,,*(%(%#((((%%*#*$#****"!                                 " #(((80 -%$*&8#(*8#(!#*%#*8%%**"                                                             -8((!(" 8*!*%((,*                      ##((33-8((((..*-(**###.88((! 000(!(88!!8*! ***&-((((-*(-8*+8+3(((($**2,/,8*((%((-(##,$$!##$&%**##$$#***********"*                            GATGATTCCGGCTTCGGACGACTCTAGAGGATCCCCATTTTTATAGTTTTTATCTTGTAATAGATGTTTAGATTTTTCGTTGTAATTATTTTCTTTATTGTTGAAATTAGTATCTCTGGGTAATTTATCATATTCTCTGGAAAATGATTTACTATCACTAGATACTTCATAAGATTTATAATCTTTATTATGAAAATCATCTCTATTTTTCAAATTATTATTATATCTATCAAAGTTTCTGTCTTCATTATATCTATTAGCATATCTATCTTTATCTTTATCCCTATCACTATATCTATCATATGGTTCATCTTGTTCAACCGATCAGACTCGATTCGCCATCGCCTCTAACGGATGGCCGCTCCCCCTCTCATACCTCGCTCCCCTCGACATCCCCCGTCTCGCCACCCTATCCGCCCCCTTCATCACCCCCCCTTATCCACACCCTCACCCCCCGCATCGCGCACCCACGACCACCCGAAGAACCGCCCTTACTCCCAAGTACGCCCCGACCTCCATCACCCTATGCGGTACCACTCCCACCACACCCAGTCCTACTTTCGCCCGCACATCGGCCCCGCTTCAGACAGCTCCCAACTACGCAACCCACGCTTGTTCTTGTTCACACTCGAATACTCGAATCTCTCATTACTCCGCGGACTCCGCCGCACCTGTGCACCATTAACTGTGTAGCGCCTGAACCGGCACCTCTGATTACCACTTCCTCCACCAGCACAGTCCTATTACCGCATGTCGCTCTGCTAAGACAGTGCAAGACTCTGCGGTCGCTCTGACCCGCATCCGCCAGGGCACCTCTCACCCTCGCTGGCCACCCCGCCCCCCTCTCCCTGCCCCTTCATTCCCCCAAACCGCTTTCAACGGGACACACCCCTCCGCGGCGGACCACAACTCGCCGTCGGCCACCACTCACACCTTCCCTCCTCCTTCCCCCACATCACGCCAACCCCGTGGGACGGCTCTCCCGCGGCTACGACGCGCAACCCCCCCTCGCCGCTTCCCCCCCAACTTCCCACGGGCTCCCCTCCGCCCCTTACCCGCGAGGAGCTTCACCCGCGAACCACCTCCCCCCTTTCCCAACAGCACCGSIGN=A=42,C=41,G=25,T=111 SPAC= 12.74 PRIM=1523 MACH=377-96 DYEP=DT {BD Set Any-Primer} NAME=IIABP1D4373 LANE=50 GELN=Gel File PROC= RTRK= CONV=phred version=0.990722.h COMM= SRCE=ABI 373A or 377rpsblast.bls100644000766000024 1305314006427353 16752 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataRPS-BLAST 2.2.18 [Mar-02-2008] Database: CDD.v.2.12 24,291 sequences; 6,026,850 total letters Searching..................................................done Query= orf10 (129 letters) ***** No hits found ****** Query= orf20 (765 letters) Score E Sequences producing significant alignments: (bits) Value gnl|CDD|82074 PRK07003, PRK07003, DNA polymerase III subunits ga... 41 0.001 >gnl|CDD|82074 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau. Length = 825 Score = 41.0 bits (95), Expect = 0.001 Identities = 51/212 (24%), Positives = 65/212 (30%) Query: 101 GTGTTTCAGAAGATATAATAAAAAAAATAGAAAAAAAAAATTGGAAACTAATAGCAATTT 160 G GA+ A A AA AAAA AA A AA ATA + Sbjct: 384 GARAAAAVGASAVPAVTGATGATGAALAPKAAAARAATRAEAPPAAPARPATAERGDDAS 443 Query: 161 TATTAACTCATAATCATATTGATCATGTAGGAGGAGTAGAAGAAATTATAAGAAGATATC 220 A A A + A A +G A +A +A AA +A Sbjct: 444 DGDAPVPAKANARASADSRCDERDAQPPADSGSASGAASSAPPDTAFESAPRAAAPSAAT 503 Query: 221 CAAACGTAACTGTTTTTGGTCCTGAAGAAACAAAAACAAGAAATGTAAATAAAATTGTAA 280 AA A + AA + AA A + ATG AAA G Sbjct: 504 PAAVPDARAPAAASAEDAVEAAAPAAPESRPPTPAAGAPVSRATGAAAALDVLRNAGMRV 563 Query: 281 AACAAGGAGATGTAATAAAATTGTTAAAAAGT 312 ++ GA A+ A + T Sbjct: 564 SSGRGAGAARPAAGPAASPTKPAPRAQVSVPT 595 Query= orf30 (225 letters) ***** No hits found ****** Query= orf40 (858 letters) Score E Sequences producing significant alignments: (bits) Value gnl|CDD|33678 COG3889, COG3889, Predicted solute binding protein... 47 2e-05 >gnl|CDD|33678 COG3889, COG3889, Predicted solute binding protein [General function prediction only]. Length = 872 Score = 47.3 bits (112), Expect = 2e-05 Identities = 33/147 (22%), Positives = 48/147 (32%), Gaps = 3/147 (2%) Query: 505 TTGGGTGTTTTTATTTTAATTTTATTTTATAGTGTAAAGATGAAGGGATTAATTGGTTTT 564 G T + A TT G TT+ T Sbjct: 721 VKIGQALTVYGSLEVFPAGENWGFIPTTK--RVKVRIMDPASGTGTSITTSGTFTAEVPQ 778 Query: 565 TGTAAAGAACTTACTTTACAACCTTTTAATCATCCTGTATTTTTTATTTTTAATTTTTTA 624 + T +A + T+ TT+ T T T T++ + TT+ T+T+TT Sbjct: 779 SPTKTETTLSYSAYSNTSILIE-TTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTT 837 Query: 625 TTAGAATTAGTTTCTTTATTATCAAAA 651 TT+ + TT G C AA A Sbjct: 838 TTSPSQTTTGGGICGPIVIIVGLAALA 864 Score = 45.4 bits (107), Expect = 5e-05 Identities = 30/134 (22%), Positives = 44/134 (32%), Gaps = 5/134 (3%) Query: 522 AATTTTATTTTATAGTGTAAAGATGAAGGGATTAATTGGTTTTTGTAAAGAACTTACTTT 581 A T + AG T A+ GT+ TT T + Sbjct: 725 QALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTF-----TAEVPQS 779 Query: 582 ACAACCTTTTAATCATCCTGTATTTTTTATTTTTAATTTTTTATTAGAATTAGTTTCTTT 641 T + +A T T+ T T T T TT++ + T + T T+T TTT Sbjct: 780 PTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTT 839 Query: 642 ATTATCAAAACCAA 655 + + T C Sbjct: 840 SPSQTTTGGGICGP 853 Score = 45.0 bits (106), Expect = 8e-05 Identities = 28/132 (21%), Positives = 41/132 (31%) Query: 105 AGATTTATGTAATTTTAAGTTTGTTGAACCAGGAAAAATTGTTTCTCATTTTTGGGTTTT 164 G T G+ T A+ TGT+ T T T + T Sbjct: 723 IGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTK 782 Query: 165 AAATATTGATTCAATCATATTTTCTCTTGTATTAGGATGTTTTTTTTTGAGCATCTTTTA 224 T + A + + T+ T T T + T TT+ T + T TT+ + Sbjct: 783 TETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPS 842 Query: 225 CACAGTGGCAAA 236 G G C Sbjct: 843 QTTTGGGICGPI 854 Score = 42.3 bits (99), Expect = 4e-04 Identities = 26/96 (27%), Positives = 39/96 (40%) Query: 368 TTGTTTGGGTTTTTTTAATGAATCTTATGGATTTAATTCCAATTGATTTTTTTCCATTTA 427 +GT T T+ T T + T T T + + T+ TT T T T TTT+ Sbjct: 759 ASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTS 818 Query: 428 TTTCTGAGAGGTTTTTTCATTTTCCAGCAATGCGTA 463 + + T + T+T+T TT+ CG Sbjct: 819 SPSPTQTTSPTQTSTSTTTTTSPSQTTTGGGICGPI 854 Query= orf50 (189 letters) ***** No hits found ****** Query= orf60 (387 letters) ***** No hits found ****** Query= orf70 (600 letters) ***** No hits found ****** Query= orf80 (270 letters) ***** No hits found ****** Query= orf90 (987 letters) ***** No hits found ****** Query= orf100 (624 letters) ***** No hits found ****** Query= orf110 (435 letters) ***** No hits found ****** Database: CDD.v.2.12 Posted date: Aug 13, 2008 12:02 PM Number of letters in database: 6,026,850 Number of sequences in database: 24,291 Lambda K H 0.320 0.134 0.503 Gapped Lambda K H 0.267 0.0579 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 24291 Number of Hits to DB: 98,336,059 Number of extensions: 7540086 Number of successful extensions: 79485 Number of sequences better than 1.0e-03: 11 Number of HSP's gapped: 59896 Number of HSP's successfully gapped: 1457 Length of database: 6,026,850 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 86 (37.2 bits) empty.bl2seq100644000766000024 120514006427353 16642 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataQuery= (468 letters) Lambda K H 1.37 0.711 1.31 Gapped Lambda K H 1.37 0.711 1.31 Matrix: blastn matrix:1 -3 Gap Penalties: Existence: 5, Extension: 2 Number of Hits to DB: 0 Number of Sequences: 0 Number of extensions: 0 Number of successful extensions: 0 Number of sequences better than 10.0: 0 length of query: 468 length of database: 2400 effective HSP length: 9 effective length of query: 459 effective length of database: 2355 effective search space: 1080945 effective search space used: 1080945 T: 0 A: 30 X1: 6 (11.9 bits) X2: 15 (29.7 bits) S1: 12 (24.3 bits) S2: 9 (18.3 bits) GO.defs.test100644000766000024 310714006427353 16523 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data!version: $Revision: 1.1 $ !date: Fri Sep 20 14:08:27 GMT 2002 !saved-by: gwg !autogenerated-by: DAG-Edit version 1.311 ! !Gene Ontology definitions ! term: cellular_component goid: GO:0005575 definition: The part of a cell of which a gene product is a component; for purpose of GO includes the extracellular environment of cells; a gene product may be a component of one or more parts of a cell; this term includes gene products that are parts of macromolecular complexes, by the definition that all members of a complex normally co-purify under all except extreme conditions. definition_reference: GO:curators term: ascus goid: GO:0005627 definition: A sac-like fruiting body (ascomycete fungi); contains ascospores (typically eight in number). definition_reference: ISBN:0198547684 term: lipid particle goid: GO:0005811 definition: Any particle of coalesced lipids in the cytoplasm of a cell. definition_reference: GO:mah term: spore wall (sensu Fungi) goid: GO:0005619 definition: The specialized cell wall of the ascospore (spore), as described in Saccharomyces. definition_reference: ISBN:0879693568 term: membrane fraction goid: GO:0005624 definition: That fraction of cells, prepared by disruptive biochemical methods, that includes the plasma and other membranes. definition_reference: FB:ma term: microsome goid: GO:0005792 definition: Any of the small, heterogeneous, artifactual, vesicular particles, 50-150 nm in diameter, that are formed when some eukaryotic cells are homogenized and that sediment on centrifugation at 100000 g. definition_reference: ISBN:0198506732 testaln2.arp100644000766000024 220314006427353 16631 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data[Profile] Title="An example of DNA sequence data" NbSamples=3 GenotypicData=0 DataType=DNA LocusSeparator=NONE [Data] [[Samples]] SampleName="Population 1" SampleSize=6 SampleData= { 000 3 GACTCTCTACGTAGCATCCGATGACGATA 001 1 GACTGTCTGCGTAGCATACGACGACGATA 002 2 GCCTGTCTGCGTAGCATAGGATGACGATA } SampleName="Population 2" SampleSize=8 SampleData= { 000 1 GACTCTCTACGTAGCATCCGATGACGATA 001 1 GACTGTCTGCGTAGCATACGACGACGATA 002 1 GCCTGTCTGCGTAGCATAGGATGACGATA 003 1 GCCTGTCTGCCTAGCATACGATCACGATA 004 1 GCCTGTCTGCGTACCATACGATGACGATA 005 1 GCCTGTCCGCGTAGCGTACGATGACGATA 006 1 GCCCGTGTGCGTAGCATACGATGGCGATA 007 1 GCCTGTCTGCGTAGCATGCGACGACGATA } SampleName="Population 3" SampleSize=6 SampleData= { 023 1 GCCTGTCTGCGTAGCATACGATGACGGTA 024 1 GCCTGTCTGCGTAGCGTACGATGACGATA 025 1 GCCTGTCTGCGTAGCATACGATGACGATA 026 1 GCCTGTCCGCGTAGCATACGGTGACGGTA 027 1 GCCTGTCTGCGTGGCATACGATGACGATG 028 1 GCCTGTCTGCGTAGCATACGATGACGATA } [[Structure]] StructureName="A group of 3 populations analyzed for DNA" NbGroups=1 Group= { "Population 1" "Population 2" "Population 3" } NC_006346.gb100644000766000024 551714006427353 16040 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS NC_006346 21657 bp DNA circular VRT 12-JAN-2005 DEFINITION Bolitoglossa n. sp. RLM-2004 mitochondrion, complete genome. ACCESSION NC_006346 VERSION NC_006346.1 GI:53686544 KEYWORDS . SOURCE mitochondrion Bolitoglossa n. sp. RLM-2004 (mushroomtongue salamander) ORGANISM Bolitoglossa n. sp. RLM-2004 Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia; Batrachia; Caudata; Salamandroidea; Plethodontidae; Plethodontinae; Bolitoglossini; Bolitoglossa. REFERENCE 1 (bases 1 to 21657) AUTHORS Mueller,R.L., Macey,J.R., Jaekel,M., Wake,D.B. and Boore,J.L. TITLE Morphological homoplasy, life history evolution, and historical biogeography of plethodontid salamanders inferred from complete mitochondrial genomes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 101 (38), 13820-13825 (2004) PUBMED 15365171 REFERENCE 2 (bases 1 to 21657) AUTHORS . CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (01-OCT-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 21657) AUTHORS Mueller,R.L., Macey,J.R., Jaekel,M., Wake,D.B. and Boore,J.L. TITLE Direct Submission JOURNAL Submitted (18-AUG-2004) Museum of Vertebrate Zoology and Evolutionary Genomics, University of California at Berkeley and DOE Joint Genome Institute, 3101 Valley Life Sciences Bldg., Berkeley, CA 94720-3160, USA COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence was derived from AY728235. FEATURES Location/Qualifiers source 1..21657 /organism="Bolitoglossa n. sp. RLM-2004" /organelle="mitochondrion" /mol_type="genomic DNA" /db_xref="taxon:291262" /common="mushroomtongue salamander" tRNA 1..64 /product="tRNA-Phe" rRNA 65..96 /product="s-rRNA" D-loop 152..216 /note="putative control region" ORIGIN 1 gcaagtgtag tttataaaaa catagcactg aaaatgctaa aataaatatt atatttcact 61 agcaatagat ttggtcctaa tctttttatt aattataact ataattatac atgcaagttt 121 caccacccca gtgagtaagc cccacatccc cgcttagtga tgaaggagct ggtatcaggc 181 atacacaccc aaaacaccac gcttagccac acctacacag gaactcagca gtaattaaca 241 ttaaaccata agtgaaaact tgatttagta atagtattta aagttggtaa atctcgtgcc 301 agccaccgcg gttacacgaa agacttaaat taatatatgc ggcccaaagg acagtttaag 361 gttttatagg aacttaagag cctttaaggc cgttatacgt ttaaaaggtc aaaaacacaa 421 atcaccccac caacttgaat ctgttaaagc cgggggacac actgggatta gataccccac // cysprot1b.fa100644000766000024 261214006427353 16633 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CATH_RAT MWTALPLLCAGAWLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRN HTFKMGLNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCW TFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHA VLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPIPQV >CATL_HUMAN MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGK HSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWA FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLV VGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV >CATL_RAT MTPLLLLAVLCLGTALATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGK HGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWA FSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS CKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLV VGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIVN >PAPA_CARPA MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFE IFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDW RQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQ YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGG IFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN interpro.xml100644000766000024 45602614006427353 17035 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data Kringle

Kringles are autonomous structural domains, found throughout the blood clotting and fibrinolytic proteins. Kringle domains are believed to play a role in binding mediators (e.g., membranes, other proteins or phospholipids), and in the regulation of proteolytic activity [, , ]. Kringle domains [, , ] are characterised by a triple loop, 3-disulphide bridge structure, whose conformation is defined by a number of hydrogen bonds and small pieces of anti-parallel beta-sheet. They are found in a varying number of copies in some plasma proteins including prothrombin and urokinase-type plasminogen activator, which are serine proteases belonging to MEROPS peptidase family S1A.

Steroid or nuclear hormone receptors (4A nuclear receptor, NRs) constitute an important superfamily of transcription regulators that are involved in widely diverse physiological functions, including control of embryonic development, cell differentiation and homeostasis. Members of the superfamily include the steroid hormone receptors and receptors for thyroid hormone, retinoids, 1,25-dihydroxy-vitamin D3 and a variety of other ligands []. The proteins function as dimeric molecules in nuclei to regulate the transcription of target genes in a ligand-responsive manner [, ]. In addition to C-terminal ligand-binding domains, these nuclear receptors contain a highly-conserved, N-terminal zinc-finger that mediates specific binding to target DNA sequences, termed ligand-responsive elements. In the absence of ligand, steroid hormone receptors are thought to be weakly associated with nuclear components; hormone binding greatly increases receptor affinity.

NRs are extremely important in medical research, a large number of them being implicated in diseases such as cancer, diabetes, hormone resistance syndromes, etc. While several NRs act as ligand-inducible transcription factors, many do not yet have a defined ligand and are accordingly termed 'orphan' receptors. During the last decade, more than 300 NRs have been described, many of which are orphans, which cannot easily be named due to current nomenclature confusions in the literature. However, a new system has recently been introduced in an attempt to rationalise the increasingly complex set of names used to describe superfamily members.

Patthy L. Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules. Cell 1985 Patthy L, Trexler M, Vali Z, Banyai L, Varadi A. Kringles: modules specialized for protein binding. Homology of the gelatin-binding region of fibronectin with the kringle structures of proteases. FEBS Lett. 1984 McMullen BA, Fujikawa K. Amino acid sequence of the heavy chain of human alpha-factor XIIa (activated Hageman factor). J. Biol. Chem. 1985 Ikeo K, Takahashi K, Gojobori T. Evolutionary origin of numerous kringles in human and simian apolipoprotein(a). FEBS Lett. 1991 De Vos P, Schmitt J, Verhoeven G, Stunnenberg HG. Human androgen receptor expressed in HeLa cells activates transcription in vitro. Nucleic Acids Res. 1994 Schwabe JW, Teichmann SA. Nuclear receptors: the evolution of diversity. Sci. STKE 2004 Atkinson RA, Williams RJ. Solution structure of the kringle 4 domain from human plasminogen by 1H nuclear magnetic resonance spectroscopy and distance geometry. J. Mol. Biol. 1990 Castellino FJ, Beals JM. The genetic relationships between the kringle domains of human plasminogen, prothrombin, tissue plasminogen activator, urokinase, and coagulation factor XII. J. Mol. Evol. 1987 Nishikawa J, Kitaura M, Imagawa M, Nishihara T. Vitamin D receptor contains multiple dimerization interfaces that are functionally different. Nucleic Acids Res. 1995
Retinoid X receptor

Steroid or nuclear hormone receptors (4A nuclear receptor, NRs) constitute an important superfamily of transcription regulators that are involved in widely diverse physiological functions, including control of embryonic development, cell differentiation and homeostasis. Members of the superfamily include the steroid hormone receptors and receptors for thyroid hormone, retinoids, 1,25-dihydroxy-vitamin D3 and a variety of other ligands []. The proteins function as dimeric molecules in nuclei to regulate the transcription of target genes in a ligand-responsive manner [, ]. In addition to C-terminal ligand-binding domains, these nuclear receptors contain a highly-conserved, N-terminal zinc-finger that mediates specific binding to target DNA sequences, termed ligand-responsive elements. In the absence of ligand, steroid hormone receptors are thought to be weakly associated with nuclear components; hormone binding greatly increases receptor affinity.

NRs are extremely important in medical research, a large number of them being implicated in diseases such as cancer, diabetes, hormone resistance syndromes, etc. While several NRs act as ligand-inducible transcription factors, many do not yet have a defined ligand and are accordingly termed 'orphan' receptors. During the last decade, more than 300 NRs have been described, many of which are orphans, which cannot easily be named due to current nomenclature confusions in the literature. However, a new system has recently been introduced in an attempt to rationalise the increasingly complex set of names used to describe superfamily members.

The retinoic acid (retinoid X) receptor consists of 3 functional and structural domains: an N-terminal (modulatory) domain; a DNA binding domain that mediates specific binding to target DNA sequences (ligand-responsive elements); and a hormone binding domain. The N-terminal domain differs between retinoic acid isoforms; the small highly-conserved DNA-binding domain (~65 residues) occupies the central portion of the protein; and the ligand binding domain lies at the receptor C terminus.

Synonym(s): 2B nuclear receptor

Molecular Function DNA binding Molecular Function ligand-dependent nuclear receptor activity Molecular Function steroid binding Cellular Component nucleus Biological Process regulation of transcription, DNA-dependent Molecular Function zinc ion binding Nishikawa J, Kitaura M, Imagawa M, Nishihara T. Vitamin D receptor contains multiple dimerization interfaces that are functionally different. Nucleic Acids Res. 1995 De Vos P, Schmitt J, Verhoeven G, Stunnenberg HG. Human androgen receptor expressed in HeLa cells activates transcription in vitro. Nucleic Acids Res. 1994 Schwabe JW, Teichmann SA. Nuclear receptors: the evolution of diversity. Sci. STKE 2004
Helix-turn-helix, AraC type

Many bacterial transcription regulation proteins bind DNA through a 'helix-turn-helix' (HTH) motif. One major subfamily of these proteins [, ] is related to the arabinose operon regulatory protein AraC [], . Except for celD [], all of these proteins seem to be positive transcriptional factors.

Although the sequences belonging to this family differ somewhat in length, in nearly every case the HTH motif is situated towards the C terminus in the third quarter of most of the sequences. The minimal DNA binding domain spans roughly 100 residues and comprises two HTH subdomains; the classical HTH domain and another HTH subdomain with similarity to the classical HTH domain but with an insertion of one residue in the turn-region. The N-terminal and central regions of these proteins are presumed to interact with effector molecules and may be involved in dimerisation [].

The known structure of MarA () shows that the AraC domain is alpha helical and shows the two HTH subdomains both bind the major groove of the DNA. The two HTH subdomains are separated by only 27 angstroms, which causes the cognate DNA to bend.

Molecular Function transcription factor activity Cellular Component intracellular Biological Process regulation of transcription, DNA-dependent Molecular Function sequence-specific DNA binding Parker LL, Hall BG. Characterization and nucleotide sequence of the cryptic cel operon of Escherichia coli K12. Genetics 1990 Henikoff S, Wallace JC, Brown JP. Finding protein similarities with nucleotide sequence databases. Meth. Enzymol. 1990 Gallegos MT, Michan C, Ramos JL. The XylS/AraC family of regulators. Nucleic Acids Res. 1993 Bustos SA, Schleif RF. Functional domains of the AraC protein. Proc. Natl. Acad. Sci. U.S.A. 1993
Metallothionein, vertebrate

Metallothioneins (MT) are small proteins that bind heavy metals, such as zinc, copper, cadmium, nickel, etc. They have a high content of cysteine residues that bind the metal ions through clusters of thiolate bonds [, ]. An empirical classification into three classes has been proposed by Fowler and coworkers [] and Kojima []. Members of class I are defined to include polypeptides related in the positions of their cysteines to equine MT-1B, and include mammalian MTs as well as from crustaceans and molluscs. Class II groups MTs from a variety of species, including sea urchins, fungi, insects and cyanobacteria. Class III MTs are atypical polypeptides composed of gamma-glutamylcysteinyl units [].

This original classification system has been found to be limited, in the sense that it does not allow clear differentiation of patterns of structural similarities, either between or within classes. Consequently, all class I and class II MTs (the proteinaceous sequences) have now been grouped into families of phylogenetically-related and thus alignable sequences. This system subdivides the MT superfamily into families, subfamilies, subgroups, and isolated isoforms and alleles.

The metallothionein superfamily comprises all polypeptides that resemble equine renal metallothionein in several respects []: e.g., low molecular weight; high metal content; amino acid composition with high Cys and low aromatic residue content; unique sequence with characteristic distribution of cysteines, and spectroscopic manifestations indicative of metal thiolate clusters. A MT family subsumes MTs that share particular sequence-specific features and are thought to be evolutionarily related. The inclusion of a MT within a family presupposes that its amino acid sequence is alignable with that of all members. Fifteen MT families have been characterised, each family being identified by its number and its taxonomic range: e.g., Family 1: vertebrate MTs [see http://www.bioc.unizh.ch/mtpage/protali.html].

The members of family 1 are recognised by the sequence pattern K-x(1,2)-C-C-x-C-C-P-x(2)-C located at the beginning of the third exon. The taxonomic range of the members extends to vertebrates. Known characteristics: 60 to 68 AAs; 20 Cys (21 in one case), 19 of them are totally conserved; the protein sequence is divided into two structural domains, containing 9 and 11 Cys all binding 3 and 4 bivalent metal ions, respectively. The gene is composed of 3 exons, 2 introns and the splicing sites are conserved. Family 1 includes subfamilies: m1, m2, m3, m4, m, a, a1, a2, b, ba, t, all of them hit the same InterPro entry.

Molecular Function metal ion binding Kagi JH, Kojima Y. Chemistry and biochemistry of metallothionein. Experientia Suppl. 1987 Kagi JH. Overview of metallothionein. Meth. Enzymol. 1991 Kojima Y. Definitions and nomenclature of metallothioneins. Meth. Enzymol. 1991 Fowler BA, Hildebrand CE, Kojima Y, Webb M. Nomenclature of metallothionein. Experientia Suppl. 1987
Tubby, C-terminal

Tubby, an autosomal recessive mutation, mapping to mouse chromosome 7, was recently found to be the result of a splicing defect in a novel gene with unknown function. This mutation maps to the tub gene [, ]. The mouse tubby mutation is the cause of maturity-onset obesity, insulin resistance and sensory deficits. By contrast with the rapid juvenile-onset weight gain seen in diabetes (db) and obese (ob) mice, obesity in tubby mice develops gradually, and strongly resembles the late-onset obesity observed in the human population. Excessive deposition of adipose tissue culminates in a two-fold increase of body weight. Tubby mice also suffer retinal degeneration and neurosensory hearing loss. The tripartite character of the tubby phenotype is highly similar to human obesity syndromes, such as Alstrom and Bardet-Biedl. Although these phenotypes indicate a vital role for tubby proteins, no biochemical function has yet been ascribed to any family member [], although it has been suggested that the phenotypic features of tubby mice may be the result of cellular apoptosis triggered by expression of the mutated tub gene. TUB is the founding-member of the tubby-like proteins, the TULPs. TULPs are found in multicellular organisms from both the plant and animal kingdoms. Ablation of members of this protein family cause disease phenotypes that are indicative of their importance in nervous-system function and development [].

Mammalian TUB is a hydrophilic protein of ~500 residues. The N-terminal () portion of the protein is conserved neither in length nor sequence, but, in TUB, contains the nuclear localisation signal and may have transcriptional-activation activity. The C-terminal 250 residues are highly conserved. The C-terminal extremity contains a cysteine residue that might play an important role in the normal functioning of these proteins. The crystal structure of the C-terminal core domain from mouse tubby has been determined to 1.9A resolution. This domain is arranged as a 12-stranded, all anti-parallel, closed beta-barrel that surrounds a central alpha helix, (which is at the extreme carboxyl terminus of the protein) that forms most of the hydrophobic core. Structural analyses suggest that TULPs constitute a unique family of bipartite transcription factors [].

Kleyn PW, Fan W, Kovats SG, Lee JJ, Pulido JC, Wu Y, Berkemeier LR, Misumi DJ, Holmgren L, Charlat O, Woolf EA, Tayber O, Brody T, Shu P, Hawkins F, Kennedy B, Baldini L, Ebeling C, Alperin GD, Deeds J, Lakey ND, Culpepper J, Chen H, Glucksmann-Kuis MA, Carlson GA, Duyk GM, Moore KJ. Identification and characterization of the mouse obesity gene tubby: a member of a novel gene family. Cell 1996 Noben-Trauth K, Naggert JK, North MA, Nishina PM. A candidate gene for the mouse mutation tubby. Nature 1996 Boggon TJ, Shan WS, Santagata S, Myers SC, Shapiro L. Implication of tubby proteins as transcription factors by structure-based functional analysis. Science 1999 Carroll K, Gomez C, Shapiro L. Tubby proteins: the plot thickens. Nat. Rev. Mol. Cell Biol. 2004
C2 calcium-dependent membrane targeting The C2 domain is a Ca2+-dependent membrane-targeting module found in many cellular proteins involved in signal transduction or membrane trafficking. C2 domains are unique among membrane targeting domains in that they show wide range of lipid selectivity for the major components of cell membranes, including phosphatidylserine and phosphatidylcholine. This C2 domain is about 116 amino-acid residues and is located between the two copies of the C1 domain in Protein Kinase C (that bind phorbol esters and diacylglycerol) (see ) and the protein kinase catalytic domain (see ). Regions with significant homology [] to the C2-domain have been found in many proteins. The C2 domain is thought to be involved in calcium-dependent phospholipid binding [] and in membrane targetting processes such as subcellular localisation.

The 3D structure of the C2 domain of synaptotagmin has been reported [], the domain forms an eight-stranded beta sandwich constructed around a conserved 4-stranded motif, designated a C2 key []. Calcium binds in a cup-shaped depression formed by the N- and C-terminal loops of the C2-key motif. Structural analyses of several C2 domains have shown them to consist of similar ternary structures in which three Ca2+-binding loops are located at the end of an 8 stranded antiparallel beta sandwich.

Sutton RB, Davletov BA, Berghuis AM, Sudhof TC, Sprang SR. Structure of the first C2 domain of synaptotagmin I: a novel Ca2+/phospholipid-binding fold. Cell 1995 Davletov BA, Sudhof TC. A single C2 domain from synaptotagmin I is sufficient for high affinity Ca2+/phospholipid binding. J. Biol. Chem. 1993 Brose N, Hofmann K, Hata Y, Sudhof TC. Mammalian homologues of Caenorhabditis elegans unc-13 gene define novel family of C2-domain proteins. J. Biol. Chem. 1995
Protein phosphatase 2A, regulatory subunit PR55 Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase implicated in many cellular processes, including the regulation of metabolic enzymes and proteins involved in signal transduction [, ]. PP2A is a trimer composed of a 36 kDa catalytic subunit, a 65 kDa regulatory subunit (subunit A) and a variable third subunit (subunit B) [, ].

One form of the third subunit is a 55 kDa protein (PR55), which exists in Drosophila melanogaster and yeast, and has up to three forms in mammals [, ]. PR55 may act as a substrate recognition unit, or may help to target the enzyme to the correct subcellular location [].

Cellular Component protein phosphatase type 2A complex Biological Process signal transduction Molecular Function protein phosphatase type 2A regulator activity Mayer RE, Hendrix P, Cron P, Matthies R, Stone SR, Goris J, Merlevede W, Hofsteenge J, Hemmings BA. Structure of the 55-kDa regulatory subunit of protein phosphatase 2A: evidence for a neuronal-specific isoform. Biochemistry 1991 Pallas DC, Weller W, Jaspers S, Miller TB, Lane WS, Roberts TM. The third subunit of protein phosphatase 2A (PP2A), a 55-kilodalton protein which is apparently substituted for by T antigens in complexes with the 36- and 63-kilodalton PP2A subunits, bears little resemblance to T antigens. J. Virol. 1992
Proteinase inhibitor I25, cystatin

Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.

The cystatins are cysteine proteinase inhibitors belonging to MEROPS inhibitor family I25, clan IH [, , ]. They mainly inhibit peptidases belonging to peptidase families C1 (papain family) and C13 (legumain family). The cystatin family includes:

  • The Type 1 cystatins, which are intracellular cystatins that are present in the cytosol of many cell types, but can also appear in body fluids at significant concentrations. They are single-chain polypeptides of about 100 residues, which have neither disulphide bonds nor carbohydrate side chains.
  • The Type 2 cystatins, which are mainly extracellular secreted polypeptides synthesised with a 19-28 residue signal peptide. They are broadly distributed and found in most body fluids.
  • The Type 3 cystatins, which are multidomain proteins. The mammalian representatives of this group are the kininogens. There are three different kininogens in mammals: H- (high molecular mass, ) and L- (low molecular mass) kininogen which are found in a number of species, and T-kininogen that is found only in rat.
  • Unclassified cystatins. These are cystatin-like proteins found in a range of organisms: plant phytocystatins, fetuin in mammals, insect cystatins and a puff adder venom cystatin which inhibits metalloproteases of the MEROPS peptidase family M12 (astacin/adamalysin). Also a number of the cystatins-like proteins have been shown to be devoid of inhibitory activity.

All true cystatins inhibit cysteine peptidases of the papain family (MEROPS peptidase family C1), and some also inhibit legumain family enzymes (MEROPS peptidase family C13). These peptidases play key roles in physiological processes, such as intracellular protein degradation (cathepsins B, H and L), are pivotal in the remodelling of bone (cathepsin K), and may be important in the control of antigen presentation (cathepsin S, mammalian legumain). Moreover, the activities of such peptidases are increased in pathophysiological conditions, such as cancer metastasis and inflammation. Additionally, such peptidases are essential for several pathogenic parasites and bacteria. Thus in animals cystatins not only have capacity to regulate normal body processes and perhaps cause disease when down-regulated, but in other organisms may also participate in defence against biotic and abiotic stress.

Molecular Function cysteine-type endopeptidase inhibitor activity Turk V, Bode W. The cystatins: protein inhibitors of cysteine proteinases. FEBS Lett. 1991 Rawlings ND, Barrett AJ. Evolution of proteins of the cystatin superfamily. J. Mol. Evol. 1990 Abrahamson M, Alvarez-Fernandez M, Nathanson CM. Cystatins. Biochem. Soc. Symp. 2003
Ubiquitin-activating enzyme, E1-like

The post-translational attachment of ubiquitin () to proteins (ubiquitinylation) alters the function, location or trafficking of a protein, or targets it to the 26S proteasome for degradation [, , ]. Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1), a ubiquitin-conjugating enzyme (E2, ), and a ubiquitin ligase (E3, , ), which work sequentially in a cascade []. The E1 enzyme is responsible for activating ubiquitin, the first step in ubiquitinylation. The E1 enzyme hydrolyses ATP and adenylates the C-terminal glycine residue of ubiquitin, and then links this residue to the active site cysteine of E1, yielding a ubiquitin-thioester and free AMP. To be fully active, E1 must non-covalently bind to and adenylate a second ubiquitin molecule. The E1 enzyme can then transfer the thioester-linked ubiquitin molecule to a cysteine residue on the ubiquitin-conjugating enzyme, E2, in an ATP-dependent reaction.

Biological Process protein modification process Molecular Function small protein activating enzyme activity Burger AM, Seth AK. The ubiquitin-mediated protein degradation pathway in cancer: therapeutic implications. Eur. J. Cancer 2004 Passmore LA, Barford D. Getting into position: the catalytic mechanisms of protein ubiquitylation. Biochem. J. 2004 Pickart CM, Fushman D. Polyubiquitin chains: polymeric protein signals. 2004 Sun L, Chen ZJ. The novel functions of ubiquitination in signaling. Curr. Opin. Cell Biol. 2004
Retroviral VpR/VpX protein Human immunodeficiency virus (HIV) is the human retrovirus associated with AIDS (acquired immune deficiency syndrome), and SIV its simian counterpart. Three main groups of primate lentivirus are known, designated Human immunodeficiency virus 1 (HIV-1), Human immunodeficiency virus 2 (HIV-2)/Simian immunodeficiency virus - mac (SIVMAC)/Simian immunodeficiency virus - sm (SIVSM) and Simian immunodeficiency virus - agm (SIVAGM). Simian immunodeficiency virus - mnd (SIVMND) has been suggested to represent a fourth distinct group []. These groups are believed to have diverged from a common ancestor long before the spread of AIDS in humans. Genetic variation in HIV-1 and HIV-2 has been studied extensively, and the nucleotide sequences reported for several strains [].

ORF analysis has revealed two open reading frames, yielding the so-called R- and X-ORF proteins, whose functions are unknown, but which show a high degree of sequence similarity.

Hasegawa A, Tsujimoto H, Maki N, Ishikawa K, Miura T, Fukasawa M, Miki K, Hayami M. Genomic divergence of HIV-2 from Ghana. AIDS Res. Hum. Retroviruses 1989 Tsujimoto H, Hasegawa A, Maki N, Fukasawa M, Miura T, Speidel S, Cooper RW, Moriyama EN, Gojobori T, Hayami M. Sequence of a novel simian immunodeficiency virus from a wild-caught African mandrill. Nature 1989
Peptidase M7, snapalysin

In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:

  • Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.
  • Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.

In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.

Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].

This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.

With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.

Molecular Function metalloendopeptidase activity Cellular Component extracellular region Biological Process proteolysis Molecular Function zinc ion binding Rawlings ND, Barrett AJ. Evolutionary families of metallopeptidases. Meth. Enzymol. 1995
PAS

PAS domains are involved in many signalling proteins where they are used as a signal sensor domain. PAS domains appear in archaea, bacteria and eukaryotes. Several PAS-domain proteins are known to detect their signal by way of an associated cofactor. Haeme, flavin, and a 4-hydroxycinnamyl chromophore are used in different proteins. The PAS domain was named after three proteins that it occurs in:

  • Per- period circadian protein
  • Arnt- Ah receptor nuclear translocator protein
  • Sim- single-minded protein.
  • PAS domains are often associated with PAC domains . It appears that these domains are directly linked, and that together they form the conserved 3D PAS fold. The division between the PAS and PAC domains is caused by major differences in sequences in the region connecting these two motifs []. In human PAS kinase, this region has been shown to be very flexible, and adopts different conformations depending on the bound ligand []. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels [].

    Molecular Function signal transducer activity Biological Process signal transduction Zhulin IB, Taylor BL, Dixon R. PAS domain S-boxes in Archaea, Bacteria and sensors for oxygen and redox. Trends Biochem. Sci. 1997 Hefti MH, Francoijs KJ, de Vries SC, Dixon R, Vervoort J. The PAS fold. A redefinition of the PAS domain based upon structural prediction. Eur. J. Biochem. 2004 Amezcua CA, Harper SM, Rutter J, Gardner KH. Structure and interactions of PAS kinase N-terminal PAS domain: model for intramolecular kinase regulation. Structure 2002
    Fimbrial biogenesis outer membrane usher protein In Gram-negative bacteria the biogenesis of fimbriae (or pili) requires a two- component assembly and transport system which is composed of a periplasmic chaperone (see ) and an outer membrane protein which has been termed a molecular 'usher' [, , ].

    The usher protein is rather large (from 86 to 100 Kd) and seems to be mainly composed of membrane-spanning beta-sheets, a structure reminiscent of porins. Although the degree of sequence similarity of these proteins is not very high they share a number of characteristics. One of these is the presence of two pairs of cysteines, the first one located in the N-terminal part and the second at the C-terminal extremity that are probably involved in disulphide bonds. The best conserved region is located in the central part of these proteins.

    Molecular Function transporter activity Biological Process transport Cellular Component membrane Schifferli DM, Alrutz MA. Permissive linker insertion sites in the outer membrane protein of 987P fimbriae of Escherichia coli. J. Bacteriol. 1994 Jacob-Dubuisson F, Striker R, Hultgren SJ. Chaperone-assisted self-assembly of pili independent of cellular energy. J. Biol. Chem. 1994 Van Rosmalen M, Saier MH Jr. Structural and evolutionary relationships between two families of bacterial extracytoplasmic chaperone proteins which function cooperatively in fimbrial assembly. Res. Microbiol. 1993
    P2Y4 purinoceptor

    G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).

    The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].

    In addition to their role in energy metabolism, purines (especially adenosine and adenine nucleotides) produce a wide range of pharmacological effects mediated by activation of cell surface receptors []. ATP is a co-transmitter in sympathetic nerves in the autonomic nervous system, where it exerts an important physiological role in the regulation of smooth muscle activity, stimulating relaxation of intestinal smooth muscle and contraction of the bladder. Receptors for adenine nucleotides are involved in a number of other physiological pathways, including stimulation of platelet activation by ADP, which is released from the vascular endothelium following injury. ATP has excitatory effects in the CNS []. Distinct receptors exist for adenosine. The main effects of adenosine in the periphery include vasodilation, bronchoconstriction, immunosuppression, inhibition of platelet aggregation, cardiac depression, stimulation of nociceptive afferents, inhibition of neurotransmitter release, and inhibition of the release of hormones. In the CNS, adenosine exerts a pre- and post-synaptic depressant action, reducing motor activity, depressing respiration, inducing sleep and relieving anxiety. The physiological role of adenosine is believed to be to adjust energy demands in line with oxygen supply [].

    Purinoceptors have been classified as P1 or P2, depending on their preference for adenosine or adenine nucleotides respectively. Adenosine receptors (P1 purinoceptors) are characterised by their affinity for adenosine and by the ability of methylxanthines to act as antagonists []. Adenosine has very low affinity for P2 purinoceptors.

    The P2Y receptor is found in smooth muscle (e.g., taeni caeci) and in vascular tissue, where it induces vasodilation through endothelium-dependent release of nitric oxide. The receptor activates phosphoinositide metabolism through a pertussis-toxin-insensitive G-protein, probably belonging to the Gi/Go class [].

    A new subtype of P2 purinoceptors has been isolated []. Its deduced amino acid sequence is consistent with a GPCR that is 51% identical to the human P2Y2 receptor and 35% identical to the chicken P2Y1 receptor []. P2Y4 is expressed in the placenta, with low levels in the lung and vascular smoothmuscle. In cells stably expressing the receptor, UTP and UDP have been shown to stimulate the formation of inositol phosphates with equivalent potency and maximal effect, while ATP behaves as a partial agonist, and ADP is almost inactive []. The receptor is thus a new member of the P2 purinergic receptor family that functionally behaves as a pyrimidinergic receptor []. P2Y4 can couple to both Gi and Gq proteins to activate phospholipase C [].

    Biological Process G-protein coupled receptor protein signaling pathway Cellular Component integral to membrane Molecular Function purinergic nucleotide receptor activity, G-protein coupled Birnbaumer L. G proteins in signal transduction. Annu. Rev. Pharmacol. Toxicol. 1990 Casey PJ, Gilman AG. G protein involvement in receptor-effector coupling. J. Biol. Chem. 1988 Communi D, Pirotton S, Parmentier M, Boeynaems JM. Cloning and functional expression of a human uridine nucleotide receptor. J. Biol. Chem. 1995 Attwood TK, Findlay JB. Design of a discriminating fingerprint for G-protein-coupled receptors. Protein Eng. 1993 Communi D, Janssens R, Suarez-Huerta N, Robaye B, Boeynaems JM. Advances in signalling by extracellular nucleotides. the role and transduction mechanisms of P2Y receptors. Cell. Signal. 2000 Attwood TK, Findlay JB. Fingerprinting G-protein-coupled receptors. Protein Eng. 1994 Watson S, Arkinstall S. Adenosine and adenine nucleotides. ISBN:0127384405 1994
    Anaphylatoxin/fibulin

    Complement components C3, C4 and C5 are large glycoproteins that have important functions in the immune response and host defence []. They have a wide variety of biological activities and are proteolytically activated by cleavage at a specific site, forming a- and b-fragments []. A-fragments form distinct structural domains of approximately 76 amino acids, coded for by a single exon within the complement protein gene. The C3a, C4a and C5a components are referred to as anaphylatoxins [, ]: they cause smooth muscle contraction, histamine release from mast cells, and enhanced vascular permeability []. They also mediate chemotaxis, inflammation, and generation of cytotoxic oxygen radicals []. The proteins are highly hydrophilic, with a mainly alpha-helical structure held together by 3 disulphide bridges [].

    Fibulins are secreted glycoproteins that become incorporated into a fibrillar extracellular matrix when expressed by cultured cells or added exogenously to cell monolayers [, ]. The five known members of the family share an elongated structure and many calcium-binding sites, owing to the presence of tandem arrays of epidermal growth factor-like domains. They have overlapping binding sites for several basement-membrane proteins, tropoelastin, fibrillin, fibronectin and proteoglycans, and they participate in diverse supramolecular structures. The amino-terminal domain I of fibulin consists of three anaphylatoxin-like (AT) modules, each approximately 40 residues long and containing four or six cysteines. The structure of an AT module was determined for the complement-derived anaphylatoxin C3a, and was found to be a compact alpha-helical fold that is stabilised by three disulphide bridges in the pattern Cys1-4, Cys2-5 and Cys3-6 (where Cys is cysteine). The bulk of the remaining portion of the fibulin molecule is a series of nine EGF-like repeats [].

    Cellular Component extracellular region Gennaro R, Simonic T, Negri A, Mottola C, Secchi C, Ronchi S, Romeo D. C5a fragment of bovine complement. Purification, bioassays, amino-acid sequence and other structural studies. Eur. J. Biochem. 1986 Ogata RT, Rosa PA, Zepf NE. Sequence of the gene for murine complement component C4. J. Biol. Chem. 1989 Argraves WS, Tran H, Burgess WH, Dickerson K. Fibulin is an extracellular matrix and plasma glycoprotein with repeated domain structure. J. Cell Biol. 1990 Timpl R, Sasaki T, Kostka G, Chu ML. Fibulins: a versatile family of extracellular matrix proteins. Nat. Rev. Mol. Cell Biol. 2003 Pan TC, Sasaki T, Zhang RZ, Fassler R, Timpl R, Chu ML. Structure and expression of fibulin-2, a novel extracellular matrix protein with multiple EGF-like repeats and consensus motifs for calcium binding. J. Cell Biol. 1993 Fritzinger DC, Petrella EC, Connelly MB, Bredehorst R, Vogel CW. Primary structure of cobra complement component C3. J. Immunol. 1992
    Hok/gef cell toxic protein The hok/gef family of Gram-negative bacterial proteins are toxic to cells when over-expressed, killing the cells from within by interfering with a vital function in the cell membrane []. Some family members (flm) increase the stability of unstable RNA [], some (pnd) induce the degradation of stable RNA at higher than optimum growth temperatures [], while others affect the release of cellular magnesium by membrane alterations []. The proteins are short (50-70 residues), consisting of an N-terminal hydrophobic (possibly membrane spanning) domain, and a C-terminal periplasmic region, which contains the toxic domain. The C-terminal region contains a conserved cysteine residue that mediates homo-dimerisation in the gef protein, although dimerisation is not necessary for the toxic effect []. Cellular Component membrane Sakikawa T, Akimoto S, Ohnishi Y. The pnd gene in E. coli plasmid R16: nucleotide sequence and gene expression leading to cell Mg2+ release and stable RNA degradation. Biochim. Biophys. Acta 1989 Golub EI, Panzer HA. The F factor of Escherichia coli carries a locus of stable plasmid inheritance stm, similar to the parB locus of plasmid RI. Mol. Gen. Genet. 1988 Poulsen LK, Refn A, Molin S, Andersson P. Topographic analysis of the toxic Gef protein from Escherichia coli. Mol. Microbiol. 1991 Carboxyl transferase

    Members in this domain include biotin dependent carboxylases [, ]. The carboxyl transferase domain carries out the following reaction; transcarboxylation from biotin to an acceptor molecule. There are two recognised types of carboxyl transferase. One of them uses acyl-CoA and the other uses 2-oxo acid as the acceptor molecule of carbon dioxide. All of the members in this family utilise acyl-CoA as the acceptor molecule.

    Molecular Function ligase activity Toh H, Kondo H, Tanabe T. Molecular evolution of biotin-dependent carboxylases. Eur. J. Biochem. 1993 Thornton CG, Kumar GK, Haase FC, Phillips NF, Woo SB, Park VM, Magner WJ, Shenoy BC, Wood HG, Samols D. Primary structure of the monomer of the 12S subunit of transcarboxylase as deduced from DNA and characterization of the product expressed in Escherichia coli. J. Bacteriol. 1993
    Phosphofructokinase The enzyme-catalysed transfer of a phosphoryl group from ATP is an important reaction in a wide variety of biological processes []. One enzyme that utilises this reaction is phosphofructokinase (PFK), which catalyses the phosphorylation of fructose-6-phosphate to fructose-1,6- bisphosphate, a key regulatory step in the glycolytic pathway [, ]. PFK exists as a homotetramer in bacteria and mammals (where each monomer possesses 2 similar domains), and as an octomer in yeast (where there are 4 alpha- (PFK1) and 4 beta-chains (PFK2), the latter, like the mammalian monomers, possessing 2 similar domains []).

    PFK is ~300 amino acids in length, and structural studies of the bacterial enzyme have shown it comprises two similar (alpha/beta) lobes: one involved in ATP binding and the other housing both the substrate-binding site and the allosteric site (a regulatory binding site distinct from the active site, but that affects enzyme activity). The identical tetramer subunits adopt 2 different conformations: in a 'closed' state, the bound magnesium ion bridges the phosphoryl groups of the enzyme products (ADP and fructose-1,6- bisphosphate); and in an 'open' state, the magnesium ion binds only the ADP [], as the 2 products are now further apart. These conformations are thought to be successive stages of a reaction pathway that requires subunit closure to bring the 2 molecules sufficiently close to react [].

    Deficiency in PFK leads to glycogenosis type VII (Tauri's disease), an autosomal recessive disorder characterised by severe nausea, vomiting, muscle cramps and myoglobinuria in response to bursts of intense or vigorous exercise []. Sufferers are usually able to lead a reasonably ordinary life by learning to adjust activity levels [].

    Molecular Function 6-phosphofructokinase activity Cellular Component 6-phosphofructokinase complex Biological Process glycolysis Raben N, Exelbert R, Spiegel R, Sherman JB, Nakajima H, Plotz P, Heinisch J. Functional expression of human mutant phosphofructokinase in yeast: genetic defects in French Canadian and Swiss patients with phosphofructokinase deficiency. Am. J. Hum. Genet. 1995 Shirakihara Y, Evans PR. Crystal structure of the complex of phosphofructokinase from Escherichia coli with its reaction products. J. Mol. Biol. 1988 Hellinga HW, Evans PR. Mutations in the active site of Escherichia coli phosphofructokinase. Nature 1987 Wegener G, Krause U. Different modes of activating phosphofructokinase, a key regulatory enzyme of glycolysis, in working vertebrate muscle. Biochem. Soc. Trans. 2002
    Frizzled cysteine-rich domain The Frizzled CRD (cysteine rich domain) is conserved in diverse proteins including several receptor tyrosine kinases [, , ]. In Drosophila melanogaster, members of the Frizzled family of tissue-polarity genes encode proteins that appear to function as cell-surface receptors for Wnts. The Frizzled genes belong to the seven transmembrane class of receptors (7TMR) and have in their extracellular region a cysteine-rich domain that has been implicated as the Wnt binding domain. Sequence similarity between the cysteine-rich domain of Frizzled and several receptor tyrosine kinases, which have roles in development, include the muscle-specific receptor tyrosine kinase (MuSK), the neuronal specific kinase (NSK2), and ROR1 and ROR2. The structure of this domain is known and is composed mainly of alpha helices. This domain contains ten conserved cysteines that form five disulphide bridges. Xu YK, Nusse R. The Frizzled CRD domain is conserved in diverse proteins including several receptor tyrosine kinases. Curr. Biol. 1998 Saldanha J, Singh J, Mahadevan D. Identification of a Frizzled-like cysteine rich domain in the extracellular region of developmental receptor tyrosine kinases. Protein Sci. 1998 Rehn M, Pihlajaniemi T, Hofmann K, Bucher P. The frizzled motif: in how many different protein families does it occur? Trends Biochem. Sci. 1998 Alkylbase DNA glycosidase, conserved site

    Alkylbase DNA glycosidases [] are DNA repair enzymes that hydrolyse the deoxyribose N-glycosidic bond to excise various alkylated bases from a damaged DNA polymer. In Escherichia coli there are two alkylbase DNA glycosidases: one (gene tag) which is constitutively expressed and which is specific for the removal of 3-methyladenine (), and one (gene alkA) which is induced during adaptation to alkylation and which can remove a variety of alkylation products (). Tag and alkA do not share any region of sequence similarity. In yeast there is an alkylbase DNA glycosidase (gene MAG1) [, ], which can remove 3-methyladenine or 7-methyladenine and which is structurally related to alkA. MAG and alkA are both proteins of about 300 amino acid residues. While the C- and N-terminal ends appear to be unrelated, there is a central region of about 130 residues which is well conserved.

    Molecular Function alkylbase DNA N-glycosylase activity Biological Process DNA repair Lindahl T, Sedgwick B, Sekiguchi M, Nakabeppu Y. Regulation and expression of the adaptive response to alkylating agents. Annu. Rev. Biochem. 1988 Berdal KG, Bjoras M, Bjelland S, Seeberg E. Cloning and expression in Escherichia coli of a gene for an alkylbase DNA glycosylase from Saccharomyces cerevisiae; a homologue to the bacterial alkA gene. EMBO J. 1990 Chen J, Derfler B, Samson L. Saccharomyces cerevisiae 3-methyladenine DNA glycosylase has homology to the AlkA glycosylase of E. coli and is induced in response to DNA alkylation damage. EMBO J. 1990
    test.tigrxml100644000766000024 623214006427353 16766 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data
    chr9 Cryptococcus neoformans
    MSAHSGSCIPSTSCPSSTVSINGTCVTCPLDCATCSTASTCSTCPSDRPILKNGRCIAYCATDTYYDTSTGTCQACDWTCKNCVGEGSAMCSSCSDGYMLKDGVCVDALCGDAGFANGFGMCFSSFVHKSQKRYLGLLALVGVAIIAGIASWWYVRRERRKTRQATKEFGKRLDERNVNDRLSALRLEKVFGFNRVTFGRGGDRSARTTQEDGGKKNKLRELLLPSKRRSGNEEMEMKKSNFAPDKERDCYDSWRTSNFGKDNWVAPPPYVPSQGVPTPVDVKHTFNKRDSLDSIPTPSHQTFAPSSSTSSFTITRPATPPRKLQNPYLGSTIIHSMSTPSPPPHSRSLMPPPRPGMGRRESGNSFSSGSLWTPMTGMTSITKITADKERDVRRYSGRQDRQMDVERRPTDYDLL* MPRFYENKYPEVDQLVMVQVQSIEDMGAYVKLLEYDNIEGMILLSELSRRRIRSVQKLIRVGRNEVVVVMRVDPDKGYIDLSKRRVSAEEVVKCEEQYEKGKAVDSIITQVAKKRGVTPESLYEKIAWPLHRQYGHAYEAFKLSISEPEAVFGSLELDEETLADLRSGIARRLTPKPVKVRADIEVKCFSYAGIDAIKRALTAGEAVSTPDVPIKVRLVAPPLYVMSTTSTDKNAAIELMEKAVEVIGETVRKDKGDITIKMKPKVVSETEDAELKALMEQFEAANMDQAGDDESSEEDE*
    match.output100755000766000024 360751614006427353 17057 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataSearch for sites by WeightMatrix library: ../../cgi-bin/data/matrix.dat Sequence file: default.seq.EXAMPLE Site selection profile: minSUM_good104.prf prf to minimize sum of both errors, high qual. Inspecting sequence ID RNTATFL V$MYOD_01 | 4338 (+) | 1.000 | 0.945 | atcCAGGTggtg V$MYOD_01 | 7390 (+) | 1.000 | 0.932 | gtcCAGGTggta V$MYOD_01 | 8503 (+) | 1.000 | 0.941 | ggaCAGGTgcgg V$MYOD_01 | 8767 (-) | 1.000 | 0.937 | cagcACCTGgga V$E47_01 | 33 (-) | 0.693 | 0.779 | gtgaacAACTTcaga V$E47_01 | 40 (+) | 0.934 | 0.858 | acttCAGATgtcatt V$E47_01 | 122 (-) | 0.924 | 0.753 | ctggctAACTGctca V$E47_01 | 143 (-) | 0.703 | 0.776 | ccagacATTTGctca V$E47_01 | 670 (+) | 0.712 | 0.753 | actgTAGCTgctctt V$E47_01 | 715 (+) | 0.934 | 0.829 | tttaCAGATggttgt V$E47_01 | 1025 (+) | 0.924 | 0.854 | ttggCAGTTgaatca V$E47_01 | 1124 (+) | 0.769 | 0.812 | tatgCACGTgtgttc V$E47_01 | 1174 (+) | 0.703 | 0.765 | aaagGAGATgggcac V$E47_01 | 1355 (-) | 1.000 | 0.860 | tgggtcACCTGatgc V$E47_01 | 1390 (+) | 1.000 | 0.930 | agggCAGGTgaggag V$E47_01 | 1482 (+) | 0.703 | 0.750 | ctggCAGAGggaggg V$E47_01 | 1678 (+) | 0.769 | 0.753 | ccttCGGGTgtcagc V$E47_01 | 1707 (-) | 0.703 | 0.760 | agcaccCTCTGcctt V$E47_01 | 1783 (-) | 1.000 | 0.927 | tggagcACCTGccat V$E47_01 | 1857 (-) | 0.924 | 0.823 | agagccAACTGaaga V$E47_01 | 1968 (+) | 1.000 | 0.807 | tgagCAGGTtggcct V$E47_01 | 2005 (-) | 0.703 | 0.773 | ctatacATCTCctga V$E47_01 | 2423 (+) | 0.769 | 0.753 | atcaCAGGGgttggg V$E47_01 | 2544 (+) | 1.000 | 0.755 | ataaCAGGTttctgt V$E47_01 | 2580 (-) | 0.693 | 0.780 | tacaacAAATGcagt V$E47_01 | 2642 (-) | 1.000 | 0.761 | cccctgACCTGcgta V$E47_01 | 2771 (+) | 0.769 | 0.768 | tccgCATGTgagatt V$E47_01 | 2824 (+) | 0.924 | 0.842 | tgtcCAGTTgtctgc V$E47_01 | 2834 (+) | 0.703 | 0.755 | tctgCAAATggcatc V$E47_01 | 2968 (+) | 0.934 | 0.898 | acggCAGATgcgcag V$E47_01 | 3007 (+) | 0.769 | 0.820 | tgcgCAGGCgtctcg V$E47_01 | 3049 (-) | 1.000 | 0.819 | ggacatACCTGcgag V$E47_01 | 3295 (-) | 0.703 | 0.787 | gtgaacATCTCcatg V$E47_01 | 3555 (+) | 0.712 | 0.768 | ttagTAGCTgtgtgg V$E47_01 | 3607 (+) | 0.924 | 0.766 | tctgCAGTTttcaaa V$E47_01 | 3899 (+) | 1.000 | 0.748 | atcaCAGGTattaat V$E47_01 | 4041 (+) | 0.769 | 0.806 | ccagCAGGGggaaat V$E47_01 | 4116 (+) | 0.934 | 0.825 | ttccCAGATgttaaa V$E47_01 | 4184 (+) | 0.703 | 0.756 | tgagCAGAAggctgc V$E47_01 | 4337 (+) | 1.000 | 0.871 | catcCAGGTggtgga V$E47_01 | 4350 (+) | 0.769 | 0.797 | gaggCAGGAggatca V$E47_01 | 4525 (-) | 0.693 | 0.770 | gacaacAAATGcgtc V$E47_01 | 4563 (-) | 0.712 | 0.799 | ggacacAGTTGcggt V$E47_01 | 4564 (+) | 0.924 | 0.807 | gacaCAGTTgcggtt V$E47_01 | 4589 (-) | 0.769 | 0.807 | tcctccTCCTGcccg V$E47_01 | 4645 (-) | 0.769 | 0.781 | cccggcACCTTctct V$E47_01 | 4736 (+) | 0.703 | 0.761 | ctcgCTGATggtcct V$E47_01 | 4763 (-) | 0.712 | 0.757 | ggtcccAGCAGcgag V$E47_01 | 4809 (+) | 0.693 | 0.767 | ctggCATTTgttacc V$E47_01 | 4902 (-) | 0.693 | 0.756 | ttggccAAGTGcatg V$E47_01 | 4956 (+) | 0.712 | 0.760 | tctgCAGCAggaagt V$E47_01 | 4979 (-) | 1.000 | 0.748 | gcccctACCTGtccg V$E47_01 | 5056 (+) | 1.000 | 0.787 | taagCAGGTagtgag V$E47_01 | 5125 (+) | 0.934 | 0.844 | gaaaCAGATggtgcg V$E47_01 | 5286 (+) | 0.934 | 0.838 | ggctCAGATgggcta V$E47_01 | 5493 (-) | 0.703 | 0.758 | ccttccCTCTGctcc V$E47_01 | 5871 (-) | 0.703 | 0.773 | tggaccATCTCctgc V$E47_01 | 5874 (-) | 0.769 | 0.784 | accatcTCCTGctgg V$E47_01 | 6033 (-) | 0.934 | 0.839 | ggacccATCTGgtgc V$E47_01 | 6061 (-) | 0.769 | 0.779 | ccctgcACGTGctgc V$E47_01 | 6062 (+) | 0.769 | 0.785 | cctgCACGTgctgcc V$E47_01 | 6152 (-) | 0.769 | 0.778 | tggctcTCCTGcctc V$E47_01 | 6259 (-) | 0.934 | 0.907 | tgtcccATCTGccga V$E47_01 | 6494 (+) | 0.934 | 0.877 | attgCAGATgcggcc V$E47_01 | 6656 (+) | 0.934 | 0.813 | caaaCAGATgaagat V$E47_01 | 6779 (+) | 0.934 | 0.784 | acagCAGATctaatg V$E47_01 | 6842 (-) | 0.924 | 0.821 | actgccAACTGgtga V$E47_01 | 6985 (+) | 0.769 | 0.750 | ttagCTGGTgaggag V$E47_01 | 7006 (+) | 0.769 | 0.802 | aaggCAGGCggatgt V$E47_01 | 7010 (+) | 0.703 | 0.783 | caggCGGATgtttcc V$E47_01 | 7025 (-) | 0.943 | 0.803 | tgcatcAGCTGaaga V$E47_01 | 7026 (+) | 0.943 | 0.808 | gcatCAGCTgaagag V$E47_01 | 7112 (+) | 0.769 | 0.774 | accaCATGTgtgcaa V$E47_01 | 7160 (+) | 0.712 | 0.774 | cctgGAGCTggcgtt V$E47_01 | 7172 (+) | 0.934 | 0.912 | gttgCAGATggtccc V$E47_01 | 7389 (+) | 1.000 | 0.864 | tgtcCAGGTggtaga V$E47_01 | 7509 (+) | 0.934 | 0.826 | gtgtCAGATggcctg V$E47_01 | 7531 (+) | 0.924 | 0.831 | gttaCAGTTggttgt V$E47_01 | 7545 (-) | 0.924 | 0.828 | tgagccAACTGtgta V$E47_01 | 7587 (-) | 0.693 | 0.780 | tcaaacAAATGctct V$E47_01 | 7617 (-) | 0.712 | 0.759 | ctctccAGCCGcctg V$E47_01 | 7677 (-) | 0.943 | 0.856 | gctcacAGCTGtaaa V$E47_01 | 7678 (+) | 0.943 | 0.860 | ctcaCAGCTgtaaag V$E47_01 | 7710 (+) | 0.703 | 0.753 | gaggCAGAAggatca V$E47_01 | 7722 (+) | 0.924 | 0.753 | tcagCAGTTtgggcc V$E47_01 | 7810 (-) | 0.693 | 0.754 | aacgacAACTCcaaa V$E47_01 | 8166 (+) | 1.000 | 0.801 | gaggCAGGTagcacc V$E47_01 | 8238 (+) | 0.934 | 0.798 | caccCAGATgaaggt V$E47_01 | 8244 (+) | 0.769 | 0.820 | gatgAAGGTgtgttc V$E47_01 | 8274 (-) | 0.943 | 0.893 | tgatgcAGCTGccct V$E47_01 | 8275 (+) | 0.943 | 0.892 | gatgCAGCTgccctg V$E47_01 | 8301 (-) | 0.703 | 0.772 | atgcccCTCTGccct V$E47_01 | 8345 (-) | 0.769 | 0.791 | tggaccACTTGccaa V$E47_01 | 8355 (-) | 0.769 | 0.757 | gccaacCCCTGtgca V$E47_01 | 8367 (-) | 0.712 | 0.775 | gcagccTGCTGctct V$E47_01 | 8387 (-) | 1.000 | 0.752 | ctgtatACCTGaagc V$E47_01 | 8418 (-) | 0.943 | 0.830 | gtaagcAGCTGttct V$E47_01 | 8419 (+) | 0.943 | 0.933 | taagCAGCTgttcta V$E47_01 | 8501 (-) | 0.712 | 0.807 | agggacAGGTGcggg V$E47_01 | 8502 (+) | 1.000 | 0.865 | gggaCAGGTgcggga V$E47_01 | 8518 (-) | 0.712 | 0.782 | actcacAGATGcttc V$E47_01 | 8519 (+) | 0.934 | 0.809 | ctcaCAGATgcttca V$E47_01 | 8749 (-) | 0.769 | 0.748 | gcccacACCTTgtcc V$E47_01 | 8765 (-) | 1.000 | 0.836 | cccagcACCTGggag V$E47_01 | 8766 (+) | 0.712 | 0.773 | ccagCACCTgggagg V$E47_01 | 8788 (+) | 0.703 | 0.775 | aaagAAGATgtagag V$E47_01 | 8806 (-) | 0.712 | 0.787 | ggggccAGCTTctgg V$E47_01 | 8967 (+) | 1.000 | 0.889 | gaacCAGGTgtggtg V$E47_01 | 8982 (-) | 0.769 | 0.751 | gtacacACCTTtaat V$E47_01 | 8998 (+) | 0.703 | 0.769 | ccagCACATggaggc V$E47_01 | 9008 (+) | 0.769 | 0.771 | gaggCAGGGgcaggc V$E47_01 | 9018 (+) | 0.934 | 0.782 | caggCAGATctttgt V$E47_01 | 9245 (-) | 0.943 | 0.788 | taggagAGCTGctca V$E47_01 | 9338 (-) | 0.769 | 0.761 | gaggacCCCTGaagt V$E47_01 | 9598 (+) | 0.693 | 0.757 | ccagCATTTgggaag V$E47_01 | 10617 (+) | 0.693 | 0.765 | gaggGAGTTgtagaa V$E47_01 | 10692 (-) | 0.769 | 0.762 | ggcaacACCCGtgtt V$E47_01 | 10886 (-) | 0.769 | 0.763 | cactgcACATGcgca V$E47_01 | 11001 (-) | 1.000 | 0.779 | cagattACCTGcttt V$E47_01 | 11072 (-) | 0.943 | 0.768 | gttacaAGCTGctct V$E47_01 | 11544 (+) | 0.703 | 0.749 | atggTAGATgggtat V$E47_01 | 11707 (-) | 0.924 | 0.910 | caccccAACTGccct V$E47_01 | 11805 (+) | 0.769 | 0.792 | ttggAAGGTggccca V$E47_01 | 11958 (-) | 1.000 | 0.848 | gactgcACCTGagct V$E47_01 | 11959 (+) | 0.712 | 0.754 | actgCACCTgagctc V$VMYB_01 | 36 (+) | 0.790 | 0.786 | aacAACTTca V$VMYB_01 | 69 (-) | 0.904 | 0.888 | taCTGTTgtt V$VMYB_01 | 125 (+) | 0.938 | 0.893 | gctAACTGct V$VMYB_01 | 161 (-) | 1.000 | 0.916 | ctCCGTTccc V$VMYB_01 | 452 (-) | 0.904 | 0.856 | gtCTGTTttg V$VMYB_01 | 570 (+) | 0.852 | 0.792 | ataAACGTct V$VMYB_01 | 580 (-) | 0.904 | 0.838 | acCTGTTgaa V$VMYB_01 | 589 (+) | 0.904 | 0.878 | attAACAGac V$VMYB_01 | 790 (+) | 0.910 | 0.865 | cttAACCGct V$VMYB_01 | 858 (-) | 0.852 | 0.798 | gaTCGTTgcc V$VMYB_01 | 984 (-) | 0.938 | 0.899 | caCAGTTaat V$VMYB_01 | 1027 (-) | 0.938 | 0.869 | ggCAGTTgaa V$VMYB_01 | 1057 (+) | 0.790 | 0.785 | aacAACTTta V$VMYB_01 | 1140 (+) | 0.910 | 0.854 | taaAACCGtt V$VMYB_01 | 1143 (-) | 1.000 | 0.966 | aaCCGTTact V$VMYB_01 | 1461 (-) | 0.852 | 0.798 | ggACGTTtct V$VMYB_01 | 1526 (-) | 0.904 | 0.865 | agCTGTTttt V$VMYB_01 | 1860 (+) | 0.938 | 0.887 | gccAACTGaa V$VMYB_01 | 2375 (-) | 0.938 | 0.877 | caCAGTTttc V$VMYB_01 | 2543 (+) | 0.904 | 0.898 | aatAACAGgt V$VMYB_01 | 2553 (-) | 0.904 | 0.880 | ttCTGTTgtg V$VMYB_01 | 2679 (-) | 1.000 | 0.930 | acCCGTTgca V$VMYB_01 | 2826 (-) | 0.938 | 0.905 | tcCAGTTgtc V$VMYB_01 | 2865 (+) | 0.904 | 0.837 | ctgAACAGaa V$VMYB_01 | 3345 (-) | 0.904 | 0.845 | tcCTGTTcat V$VMYB_01 | 3400 (-) | 0.938 | 0.923 | tcCAGTTata V$VMYB_01 | 3443 (+) | 0.904 | 0.866 | ggtAACAGtt V$VMYB_01 | 3446 (-) | 0.938 | 0.903 | aaCAGTTaat V$VMYB_01 | 3510 (-) | 0.904 | 0.870 | tgCTGTTaga V$VMYB_01 | 3546 (-) | 0.904 | 0.876 | ttCTGTTctt V$VMYB_01 | 3576 (-) | 0.790 | 0.795 | gcAAGTTatt V$VMYB_01 | 3609 (-) | 0.938 | 0.891 | tgCAGTTttc V$VMYB_01 | 3621 (-) | 0.852 | 0.797 | atGCGTTtat V$VMYB_01 | 3815 (-) | 0.852 | 0.799 | caTCGTTttc V$VMYB_01 | 3999 (+) | 0.852 | 0.841 | aacAACGAta V$VMYB_01 | 4214 (-) | 0.938 | 0.890 | ctCAGTTaaa V$VMYB_01 | 4319 (+) | 0.938 | 0.875 | taaAACTGgg V$VMYB_01 | 4371 (-) | 0.790 | 0.791 | taAAGTTatc V$VMYB_01 | 4537 (+) | 0.904 | 0.840 | gtcAACAGcc V$VMYB_01 | 4566 (-) | 0.938 | 0.872 | caCAGTTgcg V$VMYB_01 | 4572 (-) | 0.910 | 0.875 | tgCGGTTaga V$VMYB_01 | 4942 (+) | 0.938 | 0.848 | ctgAACTGgg V$VMYB_01 | 5096 (-) | 0.762 | 0.785 | ttGGGTTatt V$VMYB_01 | 5124 (+) | 0.904 | 0.837 | ggaAACAGat V$VMYB_01 | 5149 (-) | 0.938 | 0.856 | ggCAGTTtac V$VMYB_01 | 5279 (+) | 0.904 | 0.878 | aacAACAGgc V$VMYB_01 | 5322 (-) | 0.910 | 0.833 | atCGGTTcag V$VMYB_01 | 5727 (+) | 0.938 | 0.865 | ctaAACTGga V$VMYB_01 | 6107 (-) | 0.904 | 0.849 | tcCTGTTgac V$VMYB_01 | 6386 (+) | 0.938 | 0.868 | acgAACTGgc V$VMYB_01 | 6403 (+) | 0.852 | 0.800 | cagAACGCtc V$VMYB_01 | 6541 (+) | 1.000 | 0.928 | acaAACGGct V$VMYB_01 | 6571 (-) | 0.938 | 0.911 | caCAGTTatg V$VMYB_01 | 6655 (+) | 0.904 | 0.833 | gcaAACAGat V$VMYB_01 | 6775 (+) | 0.904 | 0.861 | caaAACAGca V$VMYB_01 | 6845 (+) | 0.938 | 0.872 | gccAACTGgt V$VMYB_01 | 6976 (+) | 0.852 | 0.796 | aggAACGAct V$VMYB_01 | 7168 (-) | 0.852 | 0.807 | tgGCGTTgca V$VMYB_01 | 7185 (+) | 1.000 | 0.911 | ccgAACGGcc V$VMYB_01 | 7491 (-) | 0.904 | 0.867 | gtCTGTTctt V$VMYB_01 | 7521 (+) | 0.938 | 0.868 | ctgAACTGaa V$VMYB_01 | 7533 (-) | 0.938 | 0.895 | taCAGTTggt V$VMYB_01 | 7548 (+) | 0.938 | 0.868 | gccAACTGtg V$VMYB_01 | 7600 (+) | 0.938 | 0.853 | ctgAACTGct V$VMYB_01 | 7724 (-) | 0.938 | 0.858 | agCAGTTtgg V$VMYB_01 | 7807 (+) | 0.852 | 0.841 | aacAACGAca V$VMYB_01 | 7930 (-) | 0.852 | 0.799 | atTCGTTtgt V$VMYB_01 | 7966 (-) | 0.790 | 0.785 | taTAGTTgtt V$VMYB_01 | 8003 (-) | 0.904 | 0.855 | gcCTGTTggt V$VMYB_01 | 8197 (-) | 0.904 | 0.831 | gcCTGTTtcc V$VMYB_01 | 8215 (-) | 0.938 | 0.880 | ctCAGTTctg V$VMYB_01 | 8424 (-) | 0.904 | 0.847 | agCTGTTcta V$VMYB_01 | 8673 (-) | 0.910 | 0.866 | acCGGTTagg V$VMYB_01 | 8723 (+) | 0.938 | 0.901 | aagAACTGta V$VMYB_01 | 8729 (+) | 0.852 | 0.813 | tgtAACGTcg V$VMYB_01 | 8840 (+) | 0.904 | 0.866 | caaAACAGaa V$VMYB_01 | 8845 (+) | 1.000 | 0.934 | cagAACGGtt V$VMYB_01 | 8848 (-) | 0.910 | 0.828 | aaCGGTTtaa V$VMYB_01 | 8890 (+) | 0.852 | 0.784 | tcaAACGTac V$VMYB_01 | 8901 (+) | 0.904 | 0.846 | acaAACAGat V$VMYB_01 | 9117 (+) | 0.910 | 0.865 | aaaAACCGgg V$VMYB_01 | 9133 (-) | 0.938 | 0.910 | ttCAGTTttt V$VMYB_01 | 9175 (+) | 0.852 | 0.799 | atgAACGCca V$VMYB_01 | 9316 (-) | 0.852 | 0.795 | agGCGTTggg V$VMYB_01 | 9431 (+) | 0.904 | 0.908 | aatAACAGga V$VMYB_01 | 9562 (-) | 0.910 | 0.856 | gtCGGTTtct V$VMYB_01 | 9763 (+) | 0.904 | 0.850 | gaaAACAGtc V$VMYB_01 | 9814 (+) | 0.790 | 0.786 | agtAACTTca V$VMYB_01 | 10056 (+) | 0.938 | 0.907 | aagAACTGaa V$VMYB_01 | 10061 (+) | 0.910 | 0.823 | ctgAACCGgg V$VMYB_01 | 10159 (-) | 0.852 | 0.814 | tgTCGTTttc V$VMYB_01 | 10195 (-) | 0.938 | 0.884 | gtCAGTTctg V$VMYB_01 | 10308 (+) | 0.938 | 0.888 | tttAACTGtt V$VMYB_01 | 10311 (-) | 0.904 | 0.864 | aaCTGTTttt V$VMYB_01 | 10399 (+) | 0.852 | 0.784 | atgAACGAgg V$VMYB_01 | 10441 (+) | 0.852 | 0.798 | tgcAACGTct V$VMYB_01 | 10663 (+) | 0.904 | 0.849 | gtcAACAGga V$VMYB_01 | 10965 (-) | 0.904 | 0.840 | gcCTGTTgga V$VMYB_01 | 11037 (-) | 0.904 | 0.874 | caCTGTTgtt V$VMYB_01 | 11056 (+) | 0.852 | 0.808 | gttAACGTtg V$VMYB_01 | 11058 (-) | 0.852 | 0.827 | taACGTTgtg V$VMYB_01 | 11068 (-) | 0.938 | 0.900 | atCAGTTaca V$VMYB_01 | 11097 (-) | 0.904 | 0.819 | caCTGTTtaa V$VMYB_01 | 11106 (+) | 0.852 | 0.815 | aagAACGAct V$VMYB_01 | 11138 (-) | 0.938 | 0.882 | ctCAGTTttg V$VMYB_01 | 11217 (+) | 0.938 | 0.852 | ttgAACTGcc V$VMYB_01 | 11225 (-) | 0.938 | 0.867 | ccCAGTTtgt V$VMYB_01 | 11235 (+) | 0.938 | 0.900 | tctAACTGga V$VMYB_01 | 11304 (+) | 0.852 | 0.852 | aatAACGAct V$VMYB_01 | 11353 (+) | 0.852 | 0.785 | tcgAACGAga V$VMYB_01 | 11494 (-) | 0.910 | 0.853 | tgCGGTTgaa V$VMYB_01 | 11517 (-) | 1.000 | 0.961 | taCCGTTgtc V$VMYB_01 | 11654 (-) | 0.904 | 0.869 | taCTGTTaac V$VMYB_01 | 11710 (+) | 0.938 | 0.873 | cccAACTGcc V$VMYB_01 | 11762 (+) | 0.904 | 0.869 | gttAACAGga V$VMYB_01 | 11831 (+) | 0.904 | 0.855 | agaAACAGca V$VMYB_01 | 11921 (+) | 0.852 | 0.795 | cccAACGCcc V$CMYB_01 | 63 (+) | 0.988 | 0.917 | cataattactGTTGTtga V$CMYB_01 | 574 (+) | 0.989 | 0.882 | acgtctacctGTTGAatt V$CMYB_01 | 715 (+) | 0.988 | 0.876 | tttacagatgGTTGTgag V$CMYB_01 | 733 (+) | 0.989 | 0.862 | ccaccacgtgGTTGCtgg V$CMYB_01 | 788 (-) | 0.895 | 0.861 | ctcTTAACcgctgagcca V$CMYB_01 | 1021 (+) | 0.989 | 0.930 | gtgattggcaGTTGAatc V$CMYB_01 | 1858 (-) | 1.000 | 0.899 | gagCCAACtgaagaaggg V$CMYB_01 | 2547 (+) | 0.988 | 0.893 | acaggtttctGTTGTgaa V$CMYB_01 | 2673 (+) | 0.989 | 0.874 | ctcttaacccGTTGCatc V$CMYB_01 | 2820 (+) | 0.988 | 0.907 | gctctgtccaGTTGTctg V$CMYB_01 | 3235 (-) | 1.000 | 0.869 | cagCCAACattcccagcc V$CMYB_01 | 3504 (+) | 0.906 | 0.863 | ctaccctgctGTTAGagg V$CMYB_01 | 4535 (-) | 0.989 | 0.940 | gcgTCAACagcctcttag V$CMYB_01 | 4560 (+) | 0.989 | 0.921 | gctggacacaGTTGCggt V$CMYB_01 | 5277 (-) | 0.988 | 0.931 | ccaACAACaggctcagat V$CMYB_01 | 6101 (+) | 0.989 | 0.878 | gatcagtcctGTTGAcca V$CMYB_01 | 6843 (-) | 1.000 | 0.908 | ctgCCAACtggtgagcac V$CMYB_01 | 7183 (-) | 0.880 | 0.874 | tccCGAACggccacgggg V$CMYB_01 | 7527 (+) | 1.000 | 0.940 | tgaagttacaGTTGGttg V$CMYB_01 | 7531 (+) | 0.988 | 0.860 | gttacagttgGTTGTgag V$CMYB_01 | 7546 (-) | 1.000 | 0.905 | gagCCAACtgtgtagatg V$CMYB_01 | 7960 (+) | 0.988 | 0.859 | ccagactataGTTGTttg V$CMYB_01 | 7997 (+) | 1.000 | 0.929 | cactgtgcctGTTGGtgt V$CMYB_01 | 9556 (+) | 0.869 | 0.866 | tcgcctgtcgGTTTCtgg V$CMYB_01 | 10401 (+) | 1.000 | 0.858 | gaacgaggtgGTTGGcat V$CMYB_01 | 10661 (-) | 0.989 | 0.875 | atgTCAACaggaggatcc V$CMYB_01 | 10690 (-) | 0.989 | 0.863 | ctgGCAACacccgtgttt V$CMYB_01 | 10959 (+) | 1.000 | 0.913 | gtctgtgcctGTTGGagc V$CMYB_01 | 11031 (+) | 0.988 | 0.938 | tcctaccactGTTGTtgg V$CMYB_01 | 11488 (+) | 0.989 | 0.907 | ttggaatgcgGTTGAatt V$CMYB_01 | 11511 (+) | 0.988 | 0.923 | ggatattaccGTTGTccg V$CMYB_01 | 11708 (-) | 1.000 | 0.960 | accCCAACtgccctccac V$CMYB_01 | 11919 (-) | 1.000 | 0.872 | ggcCCAACgcccattggc V$AP4_01 | 7025 (+) | 1.000 | 0.865 | tgcatCAGCTgaagagtt V$MEF2_01 | 627 (-) | 1.000 | 0.755 | acacttttTTTTAaac V$MEF2_01 | 1157 (-) | 0.800 | 0.747 | gtagttctTTTTCaaa V$MEF2_01 | 2405 (+) | 1.000 | 0.755 | cctTAAAAaaaacaaa V$MEF2_01 | 2527 (-) | 1.000 | 0.757 | catattacTTTTAgag V$MEF2_01 | 3753 (-) | 0.800 | 0.730 | ttagtcatTCTTAaaa V$MEF2_01 | 4219 (+) | 0.800 | 0.747 | ttaAAAAAataaattc V$MEF2_01 | 4422 (+) | 1.000 | 0.736 | gtcTAAAAaaaaaaaa V$MEF2_01 | 5097 (-) | 0.600 | 0.763 | tgggttatTATTTgtg V$MEF2_01 | 5806 (+) | 0.800 | 0.731 | aaaAAAAAatagccct V$MEF2_01 | 7637 (-) | 0.800 | 0.797 | agaattatTTTAAaaa V$MEF2_01 | 8486 (-) | 0.600 | 0.749 | ggggttatTTTCTaca V$MEF2_01 | 8697 (+) | 0.800 | 0.763 | ttcTGAAAataccatc V$MEF2_01 | 9833 (-) | 1.000 | 0.799 | agggataaTTTTAaag V$MEF2_01 | 11297 (+) | 0.800 | 0.746 | aaaTAATAataacgac V$ELK1_01 | 606 (-) | 0.933 | 0.849 | ttaaaCATCCtgtcgt V$ELK1_01 | 921 (-) | 1.000 | 0.804 | tagcaCTTCCacgaaa V$ELK1_01 | 1455 (+) | 0.900 | 0.812 | tgagccGGACGtttct V$ELK1_01 | 1726 (-) | 0.933 | 0.785 | gggctCATCCtgaact V$ELK1_01 | 1735 (-) | 1.000 | 0.873 | ctgaaCTTCCtataga V$ELK1_01 | 2151 (-) | 1.000 | 0.824 | ccttcCTTCCtcttcc V$ELK1_01 | 2157 (-) | 1.000 | 0.773 | ttcctCTTCCtttcct V$ELK1_01 | 2201 (-) | 1.000 | 0.812 | tctttCTTCCtttctt V$ELK1_01 | 2682 (-) | 0.933 | 0.773 | cgttgCATCCtggtca V$ELK1_01 | 2703 (-) | 1.000 | 0.803 | ttctcCTTCCtcttcc V$ELK1_01 | 2985 (-) | 1.000 | 0.867 | atctaCTTCCgctcct V$ELK1_01 | 3029 (-) | 1.000 | 0.876 | accgaCTTCCgctcca V$ELK1_01 | 3338 (-) | 0.900 | 0.812 | atttcCCTCCtgttca V$ELK1_01 | 4147 (-) | 0.900 | 0.776 | tggtgCCTCCcatttg V$ELK1_01 | 4332 (-) | 0.933 | 0.779 | ggtgaCATCCaggtgg V$ELK1_01 | 4350 (+) | 0.900 | 0.794 | gaggcaGGAGGatcag V$ELK1_01 | 4959 (+) | 1.000 | 0.879 | gcagcaGGAAGtcagg V$ELK1_01 | 5728 (+) | 1.000 | 0.852 | taaactGGAAGccata V$ELK1_01 | 6085 (-) | 0.775 | 0.787 | tgtgaGTTCCtgtgtg V$ELK1_01 | 6272 (+) | 1.000 | 0.778 | gaagaaGGAAGcctct V$ELK1_01 | 6518 (+) | 0.775 | 0.830 | agaacaCGAAGttcac V$ELK1_01 | 6795 (-) | 1.000 | 0.815 | tggctCTTCCtcccct V$ELK1_01 | 6855 (-) | 0.900 | 0.792 | gagcaCGTCCagaccg V$ELK1_01 | 6874 (+) | 1.000 | 0.849 | cttataGGAAGcacca V$ELK1_01 | 7009 (+) | 0.933 | 0.773 | gcaggcGGATGtttcc V$ELK1_01 | 7227 (+) | 1.000 | 0.931 | ccagcaGGAAGttcat V$ELK1_01 | 7981 (+) | 1.000 | 0.801 | ttcaagGGAAGgactt V$ELK1_01 | 8078 (-) | 1.000 | 0.836 | agaaaCTTCCtataca V$ELK1_01 | 8155 (-) | 1.000 | 0.779 | attagCTTCCagaggc V$ELK1_01 | 8190 (-) | 0.775 | 0.775 | ttgctCTGCCtgtttc V$ELK1_01 | 9432 (+) | 1.000 | 0.931 | ataacaGGAAGcccaa V$ELK1_01 | 9465 (+) | 0.933 | 0.904 | tgtacaGGATGttcta V$ELK1_01 | 10009 (+) | 0.900 | 0.792 | gggactGGAGGtatag V$ELK1_01 | 10071 (+) | 1.000 | 0.774 | tcttctGGAAGggcag V$ELK1_01 | 10254 (+) | 1.000 | 0.870 | tgtgcaGGAAGtgatt V$ELK1_01 | 10664 (+) | 0.900 | 0.836 | tcaacaGGAGGatcca V$ELK1_01 | 10841 (+) | 1.000 | 0.794 | atcgcaGGAAGaagcc V$SP1_01 | 118 (+) | 0.962 | 0.923 | gaGGCTGgct V$SP1_01 | 1380 (+) | 0.911 | 0.915 | tgGGCGAggt V$SP1_01 | 3245 (-) | 0.962 | 0.932 | tccCAGCCca V$SP1_01 | 4823 (+) | 1.000 | 0.965 | caGGCGGggt V$SP1_01 | 5077 (+) | 0.962 | 0.923 | gaGGCTGgct V$SP1_01 | 5514 (-) | 0.972 | 0.925 | tccCTGCCtt V$SP1_01 | 5895 (-) | 1.000 | 0.938 | ggcCCGCCcc V$SP1_01 | 8121 (-) | 0.972 | 0.943 | accCTGCCtg V$SP1_01 | 8404 (-) | 0.962 | 0.917 | agcCAGCCcg V$SP1_01 | 9008 (+) | 0.972 | 0.919 | gaGGCAGggg V$SP1_01 | 9014 (+) | 0.972 | 0.923 | ggGGCAGgca V$SP1_01 | 9683 (+) | 0.909 | 0.915 | gaGGGGGggt V$SP1_01 | 11847 (+) | 0.894 | 0.918 | ggGGTGGggt P$O2_01 | 1566 (-) | 0.800 | 0.849 | aatctacggggAATGC I$CF2II_01 | 656 (+) | 1.000 | 0.973 | tTATATata I$CF2II_01 | 657 (-) | 1.000 | 0.973 | tatATATAa I$CF2II_02 | 656 (+) | 1.000 | 0.945 | tTATATata I$CF2II_02 | 657 (-) | 1.000 | 0.945 | tatATATAa I$CF2II_02 | 4578 (-) | 1.000 | 0.894 | tagATATAg F$REPCAR1_01 | 7619 (+) | 0.886 | 0.874 | ctcCAGCCgcctggcgg F$ABF1_01 | 2908 (+) | 0.988 | 0.904 | ccacatctctgtggACGGAccc F$ABF1_01 | 3708 (+) | 0.978 | 0.917 | cttgatcaatatttACGGGtga I$E74A_01 | 4958 (+) | 0.947 | 0.908 | tgcagcAGGAAgtcagg I$E74A_01 | 5469 (-) | 0.947 | 0.930 | tactatTTCCTgatttt V$ATF_01 | 35 (-) | 0.800 | 0.802 | gaacaaCTTCAgat V$ATF_01 | 1236 (-) | 0.800 | 0.800 | ctttgaCATCAggg V$ATF_01 | 1443 (-) | 0.800 | 0.804 | tagtgaAGTCActg V$ATF_01 | 1458 (+) | 0.800 | 0.797 | gccGGACGtttctc V$ATF_01 | 1839 (-) | 1.000 | 0.923 | tggggtCGTCAaca V$ATF_01 | 2963 (+) | 1.000 | 0.930 | cagTGACGgcagat V$ATF_01 | 2963 (-) | 0.800 | 0.800 | cagtgaCGGCAgat V$ATF_01 | 3069 (-) | 0.800 | 0.804 | tagctaGGTCAcag V$ATF_01 | 4530 (-) | 1.000 | 0.921 | caaatgCGTCAaca V$ATF_01 | 4861 (-) | 0.800 | 0.799 | gagggaGGTCAgca V$ATF_01 | 5818 (+) | 0.800 | 0.801 | cccTGGCGtggcac V$ATF_01 | 7988 (-) | 0.800 | 0.808 | gaaggaCTTCActg V$ATF_01 | 8208 (+) | 0.800 | 0.811 | ctcTGACCtcagtt V$ATF_01 | 8320 (-) | 1.000 | 0.932 | cttctgCGTCAgcg V$ATF_01 | 8609 (-) | 1.000 | 0.905 | tcattcCGTCAatc V$ATF_01 | 8728 (+) | 0.800 | 0.803 | ctgTAACGtcgggg V$ATF_01 | 9174 (-) | 0.800 | 0.800 | catgaaCGCCAcag V$ATF_01 | 9344 (+) | 0.800 | 0.808 | cccTGAAGtctctt V$ATF_01 | 11108 (-) | 0.800 | 0.806 | gaacgaCTTCAgat V$ATF_01 | 11315 (+) | 1.000 | 0.949 | cggTGACGacgatg V$ATF_01 | 11330 (+) | 1.000 | 0.912 | tgaTGACGatgtaa I$DFD_01 | 46 (+) | 1.000 | 0.997 | gatgtcATTACttggg I$DFD_01 | 61 (+) | 1.000 | 0.999 | gccataATTACtgttg I$DFD_01 | 174 (+) | 1.000 | 0.996 | gctggcATTACtaccc I$DFD_01 | 493 (+) | 0.991 | 0.988 | tgcaacATTATtatat I$DFD_01 | 583 (+) | 0.996 | 0.986 | tgttgaATTAAcagac I$DFD_01 | 984 (-) | 0.996 | 0.983 | cacagTTAATaatttt I$DFD_01 | 987 (-) | 0.991 | 0.991 | agttaATAATtttcat I$DFD_01 | 1900 (+) | 0.991 | 0.987 | tattttATTATtttat I$DFD_01 | 1990 (+) | 0.991 | 0.981 | accaagATTATctggc I$DFD_01 | 2524 (+) | 1.000 | 0.996 | ttacatATTACtttta I$DFD_01 | 3103 (-) | 0.991 | 0.990 | ctataCTAATttccat I$DFD_01 | 3121 (+) | 0.996 | 0.992 | ggctatATTAAtttac I$DFD_01 | 3123 (-) | 0.996 | 0.996 | ctataTTAATttacat I$DFD_01 | 3409 (-) | 0.996 | 0.983 | aaaagTTAATgattct I$DFD_01 | 3446 (-) | 0.996 | 0.983 | aacagTTAATaaaaca I$DFD_01 | 3729 (+) | 0.991 | 0.991 | actaaaATTATtattt I$DFD_01 | 3732 (+) | 0.991 | 0.988 | aaaattATTATttcta I$DFD_01 | 3871 (-) | 0.996 | 0.982 | aggagTTAATacaagc I$DFD_01 | 3902 (+) | 0.996 | 0.993 | acaggtATTAAtaaag I$DFD_01 | 3904 (-) | 0.996 | 0.992 | aggtaTTAATaaagct I$DFD_01 | 4026 (+) | 0.996 | 0.996 | catggaATTAAtattc I$DFD_01 | 4028 (-) | 0.996 | 0.992 | tggaaTTAATattcca I$DFD_01 | 4056 (-) | 0.991 | 0.991 | aggcaATAATtaaaat I$DFD_01 | 4246 (+) | 1.000 | 0.997 | gattgtATTACttggt I$DFD_01 | 4443 (+) | 0.996 | 0.996 | cacacaATTAAtattc I$DFD_01 | 4445 (-) | 0.996 | 0.993 | cacaaTTAATattcat I$DFD_01 | 4710 (-) | 1.000 | 0.989 | cttggGTAATtcactg I$DFD_01 | 5003 (+) | 1.000 | 0.986 | ccttgtATTACccatt I$DFD_01 | 5097 (+) | 0.991 | 0.988 | tgggttATTATttgtg I$DFD_01 | 5584 (-) | 0.991 | 0.987 | ggataATAATaaaata I$DFD_01 | 5786 (-) | 0.991 | 0.991 | atgaaATAATtaaata I$DFD_01 | 6806 (+) | 1.000 | 0.986 | cccctcATTACcttct I$DFD_01 | 6954 (+) | 0.991 | 0.988 | ctgtccATTATtctgg I$DFD_01 | 7351 (-) | 0.996 | 0.986 | acatgTTAATcatggg I$DFD_01 | 7398 (+) | 1.000 | 0.999 | ggtagaATTACtgaag I$DFD_01 | 7634 (+) | 0.991 | 0.991 | ggtagaATTATtttaa I$DFD_01 | 7826 (+) | 0.996 | 0.986 | caaacaATTAAcccac I$DFD_01 | 8681 (-) | 1.000 | 0.989 | ggaagGTAATtagcct I$DFD_01 | 8950 (+) | 0.991 | 0.991 | tttaaaATTAGtaggg I$DFD_01 | 9748 (+) | 0.996 | 0.992 | caatatATTAAtattg I$DFD_01 | 9750 (-) | 0.996 | 0.992 | atataTTAATattgaa I$DFD_01 | 9928 (-) | 1.000 | 0.989 | tatggGTAATttgcat I$DFD_01 | 10124 (+) | 0.991 | 0.990 | aataaaATTAGttaag I$DFD_01 | 10136 (+) | 0.991 | 0.991 | taagaaATTATtatta I$DFD_01 | 10139 (+) | 0.991 | 0.987 | gaaattATTATtattg I$DFD_01 | 10142 (+) | 0.991 | 0.987 | attattATTATtgttg I$DFD_01 | 10169 (+) | 0.991 | 0.987 | gtccttATTAGtgtgt I$DFD_01 | 10913 (-) | 0.991 | 0.988 | tgcaaATAATagtcta I$DFD_01 | 10998 (+) | 1.000 | 0.990 | aatcagATTACctgct I$DFD_01 | 11198 (+) | 0.991 | 0.991 | tgattgATTATtaatt I$DFD_01 | 11201 (+) | 0.996 | 0.993 | ttgattATTAAtttgt I$DFD_01 | 11203 (-) | 0.996 | 0.996 | gattaTTAATttgttt I$DFD_01 | 11294 (-) | 0.991 | 0.987 | aagaaATAATaataac I$DFD_01 | 11297 (-) | 0.991 | 0.988 | aaataATAATaacgac I$DFD_01 | 11509 (+) | 1.000 | 0.986 | gtggatATTACcgttg I$DFD_01 | 11691 (-) | 0.996 | 0.986 | taaagTTAATcactgt I$FTZ_01 | 63 (-) | 0.800 | 0.834 | cATAATtactgt I$FTZ_01 | 467 (-) | 0.800 | 0.815 | cTTATTtgacac I$FTZ_01 | 790 (-) | 0.800 | 0.790 | cTTAACcgctga I$FTZ_01 | 2596 (+) | 1.000 | 0.777 | ctgactATTAAa I$FTZ_01 | 2947 (+) | 0.800 | 0.777 | ctgcctATTACg I$FTZ_01 | 3127 (-) | 1.000 | 0.778 | aTTAATttacat I$FTZ_01 | 3272 (+) | 0.800 | 0.790 | gatgccATTATg I$FTZ_01 | 3306 (+) | 0.800 | 0.819 | catgtaGTTAAg I$FTZ_01 | 3366 (-) | 0.800 | 0.785 | cTGAATagccta I$FTZ_01 | 3430 (-) | 1.000 | 0.971 | cTTAATtgactt I$FTZ_01 | 3499 (-) | 0.800 | 0.826 | cTTAACtaccct I$FTZ_01 | 3861 (+) | 0.800 | 0.831 | gtaccaTTTAAg I$FTZ_01 | 3906 (+) | 0.800 | 0.797 | gtattaATAAAg I$FTZ_01 | 4026 (+) | 1.000 | 0.794 | catggaATTAAt I$FTZ_01 | 4058 (+) | 1.000 | 0.807 | gcaataATTAAa I$FTZ_01 | 4295 (-) | 0.800 | 0.779 | cTAAATttgacc I$FTZ_01 | 4443 (+) | 1.000 | 0.820 | cacacaATTAAt I$FTZ_01 | 4548 (-) | 0.800 | 0.780 | cTTAGTtttgat I$FTZ_01 | 5010 (+) | 1.000 | 0.907 | ttacccATTAAg I$FTZ_01 | 5318 (-) | 0.800 | 0.788 | cTTCATcggttc I$FTZ_01 | 5434 (-) | 0.800 | 0.794 | cTTTATttgtat I$FTZ_01 | 5788 (+) | 1.000 | 0.810 | gaaataATTAAa I$FTZ_01 | 5964 (+) | 0.800 | 0.783 | agtgttTTTAAg I$FTZ_01 | 6190 (+) | 0.800 | 0.820 | cacccaTTTAAg I$FTZ_01 | 6458 (-) | 0.800 | 0.787 | cTTATTtcctga I$FTZ_01 | 6589 (+) | 0.800 | 0.848 | aaaacaAATAAg I$FTZ_01 | 7060 (-) | 1.000 | 0.937 | cTTAATttgttt I$FTZ_01 | 7826 (+) | 1.000 | 0.835 | caaacaATTAAc I$FTZ_01 | 8634 (+) | 0.800 | 0.797 | agcttaTTTAAg I$FTZ_01 | 8704 (+) | 0.800 | 0.790 | aataccATCAAg I$FTZ_01 | 8854 (+) | 1.000 | 0.911 | ttaaaaATTAAg I$FTZ_01 | 8920 (+) | 0.800 | 0.780 | ctcggaATTCAg I$FTZ_01 | 8991 (-) | 1.000 | 0.792 | tTTAATtccagc I$FTZ_01 | 9907 (-) | 0.800 | 0.820 | cTTACTtactgg I$FTZ_01 | 9911 (-) | 0.800 | 0.785 | cTTACTggattt I$FTZ_01 | 10128 (+) | 0.800 | 0.814 | aaattaGTTAAg I$FTZ_01 | 10504 (+) | 0.800 | 0.832 | aagtcaATTATg I$FTZ_01 | 10813 (-) | 0.800 | 0.808 | cTAAATcgcagt I$FTZ_01 | 11124 (-) | 0.800 | 0.778 | cTTAACtcgctg I$FTZ_01 | 11155 (+) | 0.800 | 0.787 | atgaaaATGAAg I$FTZ_01 | 11196 (-) | 0.800 | 0.818 | cTTGATtgatta I$FTZ_01 | 11207 (-) | 1.000 | 0.798 | aTTAATttgttt I$FTZ_01 | 11272 (+) | 1.000 | 0.943 | aaggaaATTAAg I$FTZ_01 | 11419 (+) | 0.800 | 0.785 | atttctATTTAg I$FTZ_01 | 11684 (+) | 0.800 | 0.846 | agaccaATAAAg I$FTZ_01 | 11695 (-) | 1.000 | 0.787 | gTTAATcactgt I$KR_01 | 1595 (+) | 1.000 | 0.923 | actGGGTTtg I$KR_01 | 2676 (-) | 1.000 | 1.000 | ttAACCCgtt I$KR_01 | 6439 (-) | 1.000 | 0.948 | tcAACCCagt I$KR_01 | 6747 (-) | 1.000 | 0.923 | aaAACCCact I$KR_01 | 7833 (-) | 1.000 | 0.966 | ttAACCCact I$KR_01 | 8484 (+) | 1.000 | 0.941 | aggGGGTTat I$HB_01 | 220 (-) | 1.000 | 0.983 | tTTTTTtttc I$HB_01 | 1532 (-) | 1.000 | 0.981 | tTTTTTgttc I$HB_01 | 2515 (+) | 1.000 | 0.969 | gaaaAAAAAt I$HB_01 | 2900 (-) | 1.000 | 0.975 | tTTTTTatcc I$HB_01 | 4425 (+) | 1.000 | 0.968 | taaaAAAAAa I$HB_01 | 4491 (+) | 1.000 | 0.981 | gaacAAAAAa I$HB_01 | 5801 (+) | 1.000 | 0.974 | acaaAAAAAa I$HB_01 | 5802 (+) | 1.000 | 0.976 | caaaAAAAAa I$HB_01 | 6825 (-) | 1.000 | 0.983 | tTTTTTatgt I$HB_01 | 7051 (-) | 1.000 | 0.969 | aTTTTTtttc I$HB_01 | 10337 (-) | 1.000 | 0.968 | gTTTTTattg I$HB_01 | 11469 (-) | 1.000 | 0.974 | tTTTTTattt I$HB_01 | 11503 (-) | 1.000 | 0.968 | aTTTTTgtgg V$E2F_01 | 1718 (-) | 0.800 | 0.757 | cctTTTCCgggctca V$E2F_01 | 5772 (+) | 1.000 | 0.783 | aaggagaGAAAAata V$E2F_01 | 6125 (-) | 1.000 | 0.816 | ttgTTTTCtcggcgt V$E2F_01 | 7087 (-) | 0.800 | 0.766 | tgcTTTGCgcgcacc V$E2F_01 | 8544 (+) | 1.000 | 0.819 | gggccgaGAAAAtga V$E2F_01 | 9119 (+) | 1.000 | 0.808 | aaaccggGAAAAcct V$E2F_01 | 10161 (-) | 1.000 | 0.761 | tcgTTTTCgtcctta V$E2F_01 | 10827 (-) | 1.000 | 0.776 | gccTTTTCaggccca V$ELK1_02 | 404 (+) | 1.000 | 0.921 | caaggCGGAAtggg V$ELK1_02 | 1718 (-) | 1.000 | 0.955 | ccttTTCCGggctc V$ELK1_02 | 2563 (-) | 1.000 | 0.946 | aaatTTCCGtaagt V$ELK1_02 | 2766 (-) | 1.000 | 0.918 | tagaTTCCGcatgt V$ELK1_02 | 2846 (-) | 1.000 | 0.921 | atcaTTCCGtcctt V$ELK1_02 | 2987 (-) | 1.000 | 0.958 | ctacTTCCGctcct V$ELK1_02 | 3031 (-) | 1.000 | 0.961 | cgacTTCCGctcca V$ELK1_02 | 3161 (-) | 1.000 | 0.919 | tccaTTCCGcctgt V$ELK1_02 | 4959 (+) | 0.938 | 0.939 | gcagcAGGAAgtca V$ELK1_02 | 5471 (-) | 0.938 | 0.918 | ctatTTCCTgattt V$ELK1_02 | 5728 (+) | 0.924 | 0.918 | taaacTGGAAgcca V$ELK1_02 | 6087 (-) | 0.938 | 0.918 | tgagTTCCTgtgtg V$ELK1_02 | 6459 (-) | 0.938 | 0.922 | ttatTTCCTgataa V$ELK1_02 | 7227 (+) | 0.938 | 0.943 | ccagcAGGAAgttc V$ELK1_02 | 8608 (-) | 1.000 | 0.921 | atcaTTCCGtcaat V$ELK1_02 | 8917 (+) | 1.000 | 0.923 | atactCGGAAttca V$ELK1_02 | 9432 (+) | 0.938 | 0.928 | ataacAGGAAgccc V$ELK1_02 | 10254 (+) | 0.938 | 0.939 | tgtgcAGGAAgtga V$ELK1_02 | 10841 (+) | 0.938 | 0.919 | atcgcAGGAAgaag F$ABAA_01 | 3129 (+) | 1.000 | 0.983 | taatttaCATTCccacaac F$HSF_01 | 1160 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 1538 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 1748 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 2871 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 3545 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 3550 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 3693 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 4680 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 5185 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 5662 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 6404 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 6518 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 6914 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 7412 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 7495 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 7695 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 7805 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 8181 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 8219 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 8428 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 8534 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 8724 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 8846 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 8966 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 9364 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 9404 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 9475 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 9534 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 9780 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 10051 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 10057 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 10199 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 10773 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 11107 (+) | 1.000 | 1.000 | AGAAC F$HSF_01 | 11233 (-) | 1.000 | 1.000 | GTTCT F$HSF_01 | 11885 (-) | 1.000 | 1.000 | GTTCT F$MATA1_01 | 349 (-) | 1.000 | 0.962 | ttctaCATCA F$MATA1_01 | 9800 (-) | 1.000 | 0.996 | atgtaCATCA F$MATALPHA2_01 | 7 (+) | 1.000 | 0.974 | gCATGTacaa F$MATALPHA2_01 | 9 (-) | 0.950 | 0.921 | atgtACAAGt F$MATALPHA2_01 | 15 (-) | 1.000 | 0.956 | aagtACATGt F$MATALPHA2_01 | 19 (+) | 1.000 | 0.977 | aCATGTgaat F$MATALPHA2_01 | 78 (+) | 0.905 | 0.895 | tGATGTtgat F$MATALPHA2_01 | 273 (+) | 0.950 | 0.918 | cCTTGTgcat F$MATALPHA2_01 | 322 (+) | 0.950 | 0.915 | tCCTGTgctg F$MATALPHA2_01 | 349 (-) | 0.905 | 0.901 | ttctACATCa F$MATALPHA2_01 | 421 (+) | 0.950 | 0.927 | gCTTGTccaa F$MATALPHA2_01 | 485 (-) | 0.916 | 0.886 | gtcaACATTg F$MATALPHA2_01 | 544 (+) | 0.950 | 0.914 | gCTTGTgttt F$MATALPHA2_01 | 580 (+) | 0.950 | 0.925 | aCCTGTtgaa F$MATALPHA2_01 | 606 (-) | 0.905 | 0.891 | ttaaACATCc F$MATALPHA2_01 | 613 (+) | 0.950 | 0.930 | tCCTGTcgta F$MATALPHA2_01 | 618 (-) | 1.000 | 0.920 | tcgtACATGa F$MATALPHA2_01 | 637 (-) | 0.916 | 0.898 | ttaaACATTt F$MATALPHA2_01 | 837 (-) | 0.950 | 0.919 | taaaACAAGt F$MATALPHA2_01 | 879 (-) | 0.972 | 0.891 | acaaACACGa F$MATALPHA2_01 | 935 (-) | 0.950 | 0.909 | aaccACAAGt F$MATALPHA2_01 | 963 (-) | 0.950 | 0.950 | atttACAGGa F$MATALPHA2_01 | 999 (+) | 1.000 | 0.903 | tCATGTgggg F$MATALPHA2_01 | 1129 (+) | 0.972 | 0.928 | aCGTGTgttc F$MATALPHA2_01 | 1153 (+) | 0.905 | 0.886 | tTATGTagtt F$MATALPHA2_01 | 1237 (-) | 0.905 | 0.914 | tttgACATCa F$MATALPHA2_01 | 1267 (+) | 0.972 | 0.937 | cCGTGTgtac F$MATALPHA2_01 | 1282 (-) | 0.950 | 0.921 | gatgACAGGg F$MATALPHA2_01 | 1298 (+) | 0.950 | 0.912 | aCCTGTgtag F$MATALPHA2_01 | 1421 (-) | 0.972 | 0.893 | tgagACACGc F$MATALPHA2_01 | 1628 (+) | 1.000 | 0.958 | gCATGTccag F$MATALPHA2_01 | 1772 (+) | 0.950 | 0.910 | gCCTGTcctc F$MATALPHA2_01 | 1796 (-) | 0.950 | 0.947 | attcACAGGa F$MATALPHA2_01 | 2358 (+) | 0.950 | 0.915 | aCTTGTattt F$MATALPHA2_01 | 2422 (-) | 0.950 | 0.923 | aatcACAGGg F$MATALPHA2_01 | 2522 (-) | 0.905 | 0.886 | aattACATAt F$MATALPHA2_01 | 2543 (-) | 0.950 | 0.925 | aataACAGGt F$MATALPHA2_01 | 2615 (+) | 1.000 | 0.965 | cCATGTgtat F$MATALPHA2_01 | 2774 (+) | 1.000 | 0.920 | gCATGTgaga F$MATALPHA2_01 | 3131 (-) | 0.916 | 0.908 | atttACATTc F$MATALPHA2_01 | 3169 (+) | 0.950 | 0.924 | gCCTGTacat F$MATALPHA2_01 | 3171 (-) | 0.916 | 0.897 | ctgtACATTa F$MATALPHA2_01 | 3305 (+) | 1.000 | 0.956 | cCATGTagtt F$MATALPHA2_01 | 3345 (+) | 0.950 | 0.935 | tCCTGTtcat F$MATALPHA2_01 | 3457 (-) | 0.950 | 0.912 | aaacACAGGg F$MATALPHA2_01 | 3640 (-) | 1.000 | 0.958 | taccACATGg F$MATALPHA2_01 | 3694 (-) | 0.950 | 0.923 | gaacACAAGa F$MATALPHA2_01 | 3743 (-) | 0.916 | 0.896 | ttctACATTt F$MATALPHA2_01 | 3877 (-) | 0.950 | 0.923 | taatACAAGc F$MATALPHA2_01 | 3898 (-) | 0.950 | 0.915 | catcACAGGt F$MATALPHA2_01 | 3925 (-) | 0.916 | 0.910 | aattACATTa F$MATALPHA2_01 | 3941 (+) | 0.950 | 0.920 | aCTTGTtcat F$MATALPHA2_01 | 4070 (+) | 0.905 | 0.901 | aTATGTaaaa F$MATALPHA2_01 | 4077 (+) | 0.916 | 0.890 | aAATGTatac F$MATALPHA2_01 | 4121 (+) | 0.905 | 0.900 | aGATGTtaaa F$MATALPHA2_01 | 4399 (-) | 0.950 | 0.904 | caagACAGGt F$MATALPHA2_01 | 4418 (+) | 0.950 | 0.927 | cCTTGTctaa F$MATALPHA2_01 | 4698 (+) | 0.905 | 0.890 | tGATGTccta F$MATALPHA2_01 | 4893 (+) | 1.000 | 0.908 | cCATGTtcct F$MATALPHA2_01 | 5003 (+) | 0.950 | 0.920 | cCTTGTatta F$MATALPHA2_01 | 5210 (+) | 0.950 | 0.923 | tCTTGTgctt F$MATALPHA2_01 | 5239 (-) | 1.000 | 0.968 | gattACATGg F$MATALPHA2_01 | 5249 (-) | 1.000 | 0.974 | gttcACATGt F$MATALPHA2_01 | 5253 (+) | 1.000 | 0.950 | aCATGTggtc F$MATALPHA2_01 | 5279 (-) | 0.950 | 0.914 | aacaACAGGc F$MATALPHA2_01 | 5364 (-) | 0.972 | 0.930 | aaacACACGg F$MATALPHA2_01 | 5605 (-) | 0.950 | 0.932 | taacACAAGa F$MATALPHA2_01 | 6006 (+) | 0.972 | 0.943 | aCGTGTccaa F$MATALPHA2_01 | 6107 (+) | 0.950 | 0.932 | tCCTGTtgac F$MATALPHA2_01 | 6123 (+) | 0.950 | 0.914 | gCTTGTtttc F$MATALPHA2_01 | 6394 (+) | 0.950 | 0.912 | gCCTGTgttc F$MATALPHA2_01 | 6517 (-) | 0.972 | 0.944 | aagaACACGa F$MATALPHA2_01 | 6829 (+) | 0.905 | 0.889 | tTATGTgcta F$MATALPHA2_01 | 6892 (-) | 0.950 | 0.933 | taagACAAGa F$MATALPHA2_01 | 6913 (-) | 0.950 | 0.908 | gagaACAAGg F$MATALPHA2_01 | 7111 (-) | 1.000 | 0.958 | taccACATGt F$MATALPHA2_01 | 7115 (+) | 1.000 | 0.897 | aCATGTgtgc F$MATALPHA2_01 | 7347 (-) | 1.000 | 0.970 | ttcaACATGt F$MATALPHA2_01 | 7351 (+) | 1.000 | 0.979 | aCATGTtaat F$MATALPHA2_01 | 7736 (-) | 0.950 | 0.918 | catcACAGGc F$MATALPHA2_01 | 8102 (+) | 0.950 | 0.898 | tCCTGTcaca F$MATALPHA2_01 | 8149 (-) | 0.916 | 0.895 | gaaaACATTa F$MATALPHA2_01 | 8361 (+) | 0.950 | 0.909 | cCCTGTgcag F$MATALPHA2_01 | 8495 (-) | 0.950 | 0.926 | ttctACAGGg F$MATALPHA2_01 | 8756 (+) | 0.950 | 0.907 | cCTTGTcctc F$MATALPHA2_01 | 8999 (-) | 1.000 | 0.944 | cagcACATGg F$MATALPHA2_01 | 9051 (-) | 1.000 | 0.962 | gtctACATGg F$MATALPHA2_01 | 9370 (-) | 0.950 | 0.929 | tatcACAGGg F$MATALPHA2_01 | 9431 (-) | 0.950 | 0.940 | aataACAGGa F$MATALPHA2_01 | 9464 (-) | 0.950 | 0.927 | ctgtACAGGa F$MATALPHA2_01 | 9517 (+) | 1.000 | 0.969 | cCATGTccaa F$MATALPHA2_01 | 9588 (+) | 0.950 | 0.923 | cCCTGTaatc F$MATALPHA2_01 | 9710 (+) | 0.877 | 0.890 | tGGTGTaaaa F$MATALPHA2_01 | 9737 (+) | 0.916 | 0.903 | tAATGTccat F$MATALPHA2_01 | 9798 (+) | 0.916 | 0.890 | aAATGTacat F$MATALPHA2_01 | 9800 (-) | 0.905 | 0.896 | atgtACATCa F$MATALPHA2_01 | 9805 (-) | 0.905 | 0.889 | catcACATAa F$MATALPHA2_01 | 9940 (+) | 1.000 | 0.906 | gCATGTatgt F$MATALPHA2_01 | 9944 (+) | 0.905 | 0.894 | gTATGTtata F$MATALPHA2_01 | 10251 (+) | 0.950 | 0.912 | gCTTGTgcag F$MATALPHA2_01 | 10420 (-) | 0.887 | 0.899 | tttaACACTa F$MATALPHA2_01 | 10659 (+) | 0.916 | 0.915 | tAATGTcaac F$MATALPHA2_01 | 10663 (-) | 0.950 | 0.932 | gtcaACAGGa F$MATALPHA2_01 | 10700 (+) | 0.972 | 0.942 | cCGTGTttat F$MATALPHA2_01 | 10756 (+) | 0.972 | 0.930 | cCGTGTgttt F$MATALPHA2_01 | 10803 (+) | 0.950 | 0.918 | cCTTGTgcat F$MATALPHA2_01 | 10888 (-) | 1.000 | 0.957 | ctgcACATGc F$MATALPHA2_01 | 10934 (+) | 0.950 | 0.930 | tCTTGTggac F$MATALPHA2_01 | 11336 (+) | 0.905 | 0.896 | cGATGTaaat F$MATALPHA2_01 | 11762 (-) | 0.950 | 0.946 | gttaACAGGa V$CETS1P54_01 | 609 (-) | 0.974 | 0.964 | aacaTCCTGt V$CETS1P54_01 | 1458 (+) | 1.000 | 0.968 | gCCGGAcgtt V$CETS1P54_01 | 4962 (+) | 0.974 | 0.973 | gCAGGAagtc V$CETS1P54_01 | 7230 (+) | 0.974 | 0.972 | gCAGGAagtt V$CETS1P54_01 | 9435 (+) | 0.974 | 0.967 | aCAGGAagcc V$CETS1P54_01 | 9468 (+) | 0.974 | 0.964 | aCAGGAtgtt V$CETS1P54_01 | 10257 (+) | 0.974 | 0.973 | gCAGGAagtg V$P53_01 | 1 (-) | 0.785 | 0.632 | gttcatgcatgtaCAAGTac V$P53_01 | 1626 (+) | 0.879 | 0.634 | caGCATGtccagcctttcct V$P53_01 | 2971 (+) | 0.785 | 0.694 | gcAGATGcgcaggcatctac V$P53_01 | 2971 (-) | 0.785 | 0.694 | gcagatgcgcaggCATCTac V$P53_01 | 7991 (+) | 0.570 | 0.630 | ggACTTCactgtgcctgttg V$P53_01 | 9385 (-) | 0.785 | 0.645 | tgtcagcccctggAATGTgg V$VMAF_01 | 72 (+) | 0.874 | 0.700 | tgttGTTGAtgttgatagg V$VMAF_01 | 78 (+) | 0.874 | 0.756 | tgatGTTGAtagggcctat V$VMAF_01 | 128 (-) | 0.653 | 0.734 | aactgctcagTCAATccag V$VMAF_01 | 295 (+) | 0.580 | 0.718 | ttttACAGActgagctatc V$VMAF_01 | 395 (+) | 0.645 | 0.727 | atttGCCAAcaaggcggaa V$VMAF_01 | 476 (-) | 0.874 | 0.714 | cacttacaagTCAACattg V$VMAF_01 | 1224 (+) | 0.910 | 0.702 | ttctGATGActgctttgac V$VMAF_01 | 1231 (-) | 0.642 | 0.756 | gactgctttgACATCaggg V$VMAF_01 | 1250 (-) | 0.910 | 0.721 | cagcattctgTCATCatcc V$VMAF_01 | 1278 (+) | 0.910 | 0.780 | ctggGATGAcagggacctg V$VMAF_01 | 1349 (-) | 0.870 | 0.797 | cttgcctgggTCACCtgat V$VMAF_01 | 1615 (-) | 0.732 | 0.728 | ggtctcgtggACAGCatgt V$VMAF_01 | 1678 (-) | 1.000 | 0.726 | ccttcgggtgTCAGCtcag V$VMAF_01 | 1837 (-) | 0.874 | 0.713 | attggggtcgTCAACacaa V$VMAF_01 | 2328 (-) | 0.718 | 0.760 | tcttgctatgTGACCctac V$VMAF_01 | 2929 (+) | 1.000 | 0.763 | cttaGCTGAttgcgaagtc V$VMAF_01 | 2936 (-) | 0.801 | 0.801 | gattgcgaagTCTGCctat V$VMAF_01 | 2973 (-) | 0.642 | 0.716 | agatgcgcagGCATCtact V$VMAF_01 | 3067 (-) | 0.642 | 0.701 | tatagctaggTCACAgggt V$VMAF_01 | 3208 (-) | 0.600 | 0.736 | gtatgctaggCAAGCactt V$VMAF_01 | 3224 (-) | 1.000 | 0.742 | cttttaccagTCAGCcaac V$VMAF_01 | 3341 (-) | 0.910 | 0.708 | tccctcctgtTCATCagac V$VMAF_01 | 3941 (+) | 0.723 | 0.714 | acttGTTCAttcagccatc V$VMAF_01 | 4398 (-) | 1.000 | 0.748 | ccaagacaggTCAGCcaca V$VMAF_01 | 4528 (-) | 0.874 | 0.702 | aacaaatgcgTCAACagcc V$VMAF_01 | 4788 (+) | 1.000 | 0.752 | gattGCTGActtgcataat V$VMAF_01 | 4859 (-) | 1.000 | 0.781 | ctgagggaggTCAGCagca V$VMAF_01 | 4966 (-) | 1.000 | 0.728 | gaagtcaggcTCAGCccct V$VMAF_01 | 4994 (+) | 0.801 | 0.738 | aatgGCAGAccttgtatta V$VMAF_01 | 5076 (+) | 0.747 | 0.725 | agagGCTGGctctgactgg V$VMAF_01 | 5235 (+) | 0.763 | 0.705 | gaacGATTAcatgggttca V$VMAF_01 | 5257 (-) | 0.910 | 0.803 | gtggtcacagTCATCctgg V$VMAF_01 | 5267 (-) | 0.621 | 0.712 | tcatcctgggCCAACaaca V$VMAF_01 | 5310 (-) | 0.910 | 0.742 | aatggcttctTCATCggtt V$VMAF_01 | 6047 (-) | 0.732 | 0.712 | cagccctgtgACAGCcctg V$VMAF_01 | 6067 (+) | 0.732 | 0.752 | acgtGCTGCcacagtctct V$VMAF_01 | 6289 (+) | 0.910 | 0.931 | tgatGATGActgagcaggg V$VMAF_01 | 6493 (+) | 0.801 | 0.720 | gattGCAGAtgcggccaca V$VMAF_01 | 6524 (+) | 0.723 | 0.715 | cgaaGTTCAccctggagac V$VMAF_01 | 6642 (-) | 0.801 | 0.794 | ggcacctcagTCTGCaaac V$VMAF_01 | 6693 (+) | 1.000 | 0.774 | gagtGCTGAagtagagaga V$VMAF_01 | 7019 (-) | 1.000 | 0.789 | gtttcctgcaTCAGCtgaa V$VMAF_01 | 7162 (+) | 0.747 | 0.761 | tggaGCTGGcgttgcagat V$VMAF_01 | 7295 (+) | 1.000 | 0.757 | ccaaGCTGAgagagtaagg V$VMAF_01 | 7603 (+) | 1.000 | 0.714 | aactGCTGAgccatctctc V$VMAF_01 | 7603 (-) | 0.657 | 0.756 | aactgctgagCCATCtctc V$VMAF_01 | 7712 (-) | 1.000 | 0.709 | ggcagaaggaTCAGCagtt V$VMAF_01 | 8122 (-) | 0.801 | 0.798 | ccctgcctggTCTGCtttc V$VMAF_01 | 8268 (+) | 0.910 | 0.801 | acctGATGAtgcagctgcc V$VMAF_01 | 8277 (+) | 0.732 | 0.733 | tgcaGCTGCcctggctccc V$VMAF_01 | 8318 (-) | 1.000 | 0.889 | agcttctgcgTCAGCgcca V$VMAF_01 | 8396 (-) | 0.747 | 0.742 | tgaagccaagCCAGCccgg V$VMAF_01 | 8409 (-) | 0.853 | 0.711 | gcccggcctgTAAGCagct V$VMAF_01 | 8581 (-) | 0.607 | 0.727 | tggggcaaagGCAACatct V$VMAF_01 | 9033 (-) | 0.732 | 0.704 | gagttcaaggGCAGCctgg V$VMAF_01 | 9042 (-) | 0.676 | 0.738 | ggcagcctggTCTACatgg V$VMAF_01 | 9071 (-) | 0.732 | 0.767 | gctagccaggGCAGCatag V$VMAF_01 | 9265 (-) | 0.732 | 0.742 | cagggcagcgACAGCcacc V$VMAF_01 | 9284 (-) | 1.000 | 0.748 | aagctgagagTCAGCttgt V$VMAF_01 | 9377 (-) | 1.000 | 0.750 | gggagagctgTCAGCccct V$VMAF_01 | 9546 (-) | 0.671 | 0.728 | tcctgcgtagTCGCCtgtc V$VMAF_01 | 9583 (-) | 0.763 | 0.738 | tatagccctgTAATCccag V$VMAF_01 | 9608 (+) | 1.000 | 0.742 | ggaaGCTGAggtgggagga V$VMAF_01 | 9631 (-) | 1.000 | 0.861 | gagttcaaggTCAGCttgg V$VMAF_01 | 10030 (-) | 0.870 | 0.709 | agtttgtgagTCACCgtgt V$VMAF_01 | 10269 (-) | 0.848 | 0.749 | ttttgccggcTGAGCaatc V$VMAF_01 | 10278 (-) | 0.910 | 0.752 | ctgagcaatcTCATCaggc V$VMAF_01 | 10422 (-) | 0.712 | 0.708 | taacactaagTCTTCgcaa V$VMAF_01 | 10628 (+) | 0.648 | 0.704 | agaaAGTGActgaggaaga V$VMAF_01 | 10814 (-) | 0.789 | 0.728 | taaatcgcagTCCGCcttt V$VMAF_01 | 11047 (-) | 0.718 | 0.723 | ggttcctttgTTAACgttg V$VMAF_01 | 11052 (+) | 0.718 | 0.709 | ctttGTTAAcgttgtgatc V$VMAF_01 | 11056 (-) | 0.758 | 0.729 | gttaacgttgTGATCagtt V$VMAF_01 | 11150 (+) | 0.910 | 0.725 | ctctGATGAaaatgaagct V$VMAF_01 | 11312 (+) | 0.870 | 0.759 | ctacGGTGAcgacgatgat V$VMAF_01 | 11315 (+) | 0.712 | 0.708 | cggtGACGAcgatgatgat V$VMAF_01 | 11324 (+) | 0.910 | 0.717 | cgatGATGAtgacgatgta V$VMAF_01 | 11327 (+) | 0.910 | 0.849 | tgatGATGAcgatgtaaat V$VMAF_01 | 11450 (+) | 0.870 | 0.729 | ttttGGTGAatcagtgaaa V$VMAF_01 | 11696 (-) | 0.870 | 0.757 | ttaatcactgTCACCccaa F$GCN4_01 | 1345 (-) | 0.829 | 0.847 | aaggcttgcctgGGTCAcctgatgctc F$GCN4_01 | 3925 (-) | 1.000 | 0.856 | aattacattatcAGTCActtgttcatt F$GCN4_01 | 4524 (-) | 0.829 | 0.856 | agacaacaaatgCGTCAacagcctctt F$GCN4_01 | 6407 (+) | 1.000 | 0.847 | acgctcagaaTGACTcctctggctttc F$GCN4_01 | 6547 (-) | 1.000 | 0.849 | ggctgaggtaagAGTCAtggttgccac F$GCN4_01 | 8314 (-) | 0.829 | 0.846 | ctgcagcttctgCGTCAgcgccagtat F$GCN4_01 | 9312 (-) | 0.829 | 0.852 | agggaggcgttgGGTCAgagcctctgg F$GCN4_01 | 10026 (-) | 1.000 | 0.971 | ccgtagtttgtgAGTCAccgtgtgggt V$CREB_01 | 2966 (+) | 1.000 | 0.879 | TGACGgca V$CREB_01 | 8612 (-) | 1.000 | 0.869 | ttcCGTCA V$CREBP1_01 | 3440 (-) | 0.831 | 0.891 | ttaGGTAA V$CREBP1_01 | 4273 (+) | 0.831 | 0.891 | TTACCtaa V$CREBP1CJUN_01 | 45 (-) | 0.848 | 0.781 | agATGTCa V$CREBP1CJUN_01 | 1239 (+) | 0.848 | 0.891 | tGACATca V$CREBP1CJUN_01 | 1239 (-) | 0.780 | 0.842 | tgACATCa V$CREBP1CJUN_01 | 1416 (+) | 0.780 | 0.821 | tGCCGTga V$CREBP1CJUN_01 | 1446 (+) | 0.780 | 0.842 | tGAAGTca V$CREBP1CJUN_01 | 1446 (-) | 0.840 | 0.885 | tgAAGTCa V$CREBP1CJUN_01 | 2966 (+) | 0.780 | 0.842 | tGACGGca V$CREBP1CJUN_01 | 2966 (-) | 0.780 | 0.842 | tgACGGCa V$CREBP1CJUN_01 | 3291 (+) | 0.780 | 0.821 | tGAGGTga V$CREBP1CJUN_01 | 4334 (+) | 0.848 | 0.778 | tGACATcc V$CREBP1CJUN_01 | 4864 (-) | 0.844 | 0.776 | ggAGGTCa V$CREBP1CJUN_01 | 6550 (+) | 0.780 | 0.817 | tGAGGTaa V$CREBP1CJUN_01 | 7527 (+) | 0.780 | 0.842 | tGAAGTta V$CREBP1CJUN_01 | 7582 (-) | 0.844 | 0.866 | tcAGGTCa V$CREBP1CJUN_01 | 7882 (-) | 0.840 | 0.776 | agAAGTCa V$CREBP1CJUN_01 | 8211 (+) | 0.844 | 0.888 | tGACCTca V$CREBP1CJUN_01 | 8211 (-) | 0.780 | 0.842 | tgACCTCa V$CREBP1CJUN_01 | 8266 (+) | 0.844 | 0.866 | tGACCTga V$CREBP1CJUN_01 | 8612 (-) | 0.780 | 0.817 | ttCCGTCa V$CREBP1CJUN_01 | 8731 (-) | 1.000 | 0.862 | taACGTCg V$CREBP1CJUN_01 | 9816 (-) | 0.780 | 0.842 | taACTTCa V$CREBP1CJUN_01 | 9844 (-) | 0.840 | 0.885 | taAAGTCa V$CREBP1CJUN_01 | 10502 (-) | 0.840 | 0.776 | aaAAGTCa V$CREBP1CJUN_01 | 10659 (-) | 0.848 | 0.891 | taATGTCa V$CREBP1CJUN_01 | 11783 (+) | 0.780 | 0.817 | tGATGTaa V$SOX5_01 | 947 (+) | 1.000 | 0.992 | ggaACAATga V$SOX5_01 | 3143 (+) | 1.000 | 0.996 | acaACAATat V$SOX5_01 | 7331 (+) | 1.000 | 0.995 | ggaACAATag V$SOX5_01 | 7826 (+) | 1.000 | 0.994 | caaACAATta V$SOX5_01 | 8556 (+) | 1.000 | 0.995 | tgaACAATgt V$SOX5_01 | 9744 (+) | 1.000 | 0.874 | catACAATat V$SOX5_01 | 9780 (+) | 1.000 | 0.991 | agaACAATtt V$SOX5_01 | 10149 (-) | 1.000 | 0.996 | ttATTGTtgt V$SOX5_01 | 10341 (-) | 1.000 | 0.997 | ttATTGTttt V$SOX5_01 | 10459 (+) | 1.000 | 0.996 | tcaACAATgc V$SOX5_01 | 10855 (-) | 1.000 | 0.872 | ccATTGTaga I$DL_01 | 330 (+) | 0.906 | 0.816 | tggaaATTCCc I$DL_01 | 330 (-) | 1.000 | 0.924 | tGGAAAttccc I$DL_01 | 1667 (+) | 0.906 | 0.801 | tggagATTCCt I$DL_01 | 1716 (+) | 1.000 | 0.773 | tgcctTTTCCg I$DL_01 | 1825 (-) | 0.897 | 0.764 | gGGAAGacacc I$DL_01 | 1953 (+) | 1.000 | 0.779 | ggaacTTTCCt I$DL_01 | 3048 (-) | 0.766 | 0.763 | gGGACAtacct I$DL_01 | 3154 (+) | 0.897 | 0.797 | tggagGTTCCa I$DL_01 | 3321 (+) | 1.000 | 0.825 | ggcctTTTCCt I$DL_01 | 4613 (-) | 0.766 | 0.809 | aGGTAAaacca I$DL_01 | 4637 (-) | 0.806 | 0.854 | aGCAAAatccc I$DL_01 | 4713 (+) | 0.671 | 0.771 | gggtaATTCAc I$DL_01 | 4884 (-) | 0.897 | 0.774 | tGGAACattcc I$DL_01 | 4949 (+) | 0.766 | 0.782 | gggacTTTCTg I$DL_01 | 5164 (+) | 0.766 | 0.782 | gggacTTTCAc I$DL_01 | 6135 (+) | 0.897 | 0.763 | ggcgtCTTCCa I$DL_01 | 6204 (+) | 0.671 | 0.768 | ggggtATGCCa I$DL_01 | 6708 (-) | 0.766 | 0.773 | gAGAAAagcca I$DL_01 | 6744 (-) | 0.766 | 0.766 | tTGAAAaccca I$DL_01 | 6795 (+) | 0.897 | 0.845 | tggctCTTCCt I$DL_01 | 6879 (-) | 0.897 | 0.829 | aGGAAGcacca I$DL_01 | 7001 (-) | 1.000 | 0.772 | tGGAAAaggca I$DL_01 | 7015 (+) | 1.000 | 0.809 | ggatgTTTCCt I$DL_01 | 8700 (-) | 0.806 | 0.791 | tGAAAAtacca I$DL_01 | 9123 (-) | 0.766 | 0.804 | cGGGAAaacct I$DL_01 | 9124 (-) | 1.000 | 0.761 | gGGAAAacctt I$DL_01 | 9580 (-) | 0.671 | 0.773 | tGGTATagccc I$DL_01 | 10640 (-) | 0.897 | 0.761 | aGGAAGacacc I$DL_01 | 10691 (-) | 0.766 | 0.810 | tGGCAAcaccc I$DL_01 | 10741 (+) | 0.766 | 0.767 | gggggTTTCAg I$DL_01 | 10846 (-) | 0.897 | 0.805 | aGGAAGaagcc I$DL_01 | 11087 (+) | 0.806 | 0.815 | gggaaTTTTCc I$DL_01 | 11088 (+) | 1.000 | 0.832 | ggaatTTTCCa I$DL_01 | 11477 (+) | 1.000 | 0.811 | ttgttTTTCCc I$SN_02 | 32 (-) | 0.972 | 0.930 | tgtgAACAActtca I$SN_02 | 440 (-) | 0.972 | 0.951 | tttaATCAGgttgt I$SN_02 | 577 (+) | 1.000 | 0.909 | tctacCTGTTgaat I$SN_02 | 636 (-) | 0.953 | 0.908 | tttaAACATttatt I$SN_02 | 731 (-) | 0.924 | 0.905 | agccACCACgtggt I$SN_02 | 735 (+) | 0.924 | 0.908 | accacGTGGTtgct I$SN_02 | 836 (-) | 0.972 | 0.886 | ataaAACAAgttaa I$SN_02 | 1195 (+) | 0.953 | 0.900 | gaaagCTGGTtata I$SN_02 | 1358 (+) | 0.972 | 0.886 | gtcacCTGATgctc I$SN_02 | 1523 (+) | 1.000 | 0.889 | tagagCTGTTtttt I$SN_02 | 1966 (-) | 0.972 | 0.933 | tttgAGCAGgttgg I$SN_02 | 2542 (-) | 1.000 | 0.893 | gaatAACAGgtttc I$SN_02 | 2579 (-) | 0.972 | 0.921 | ttacAACAAatgca I$SN_02 | 3442 (-) | 1.000 | 0.916 | aggtAACAGttaat I$SN_02 | 3641 (+) | 0.905 | 0.912 | accacATGGTggaa I$SN_02 | 3938 (+) | 0.972 | 0.895 | gtcacTTGTTcatt I$SN_02 | 4118 (+) | 0.953 | 0.921 | cccagATGTTaaaa I$SN_02 | 4524 (-) | 0.972 | 0.904 | agacAACAAatgcg I$SN_02 | 4811 (+) | 0.972 | 0.939 | ggcatTTGTTacca I$SN_02 | 5054 (-) | 0.972 | 0.951 | tctaAGCAGgtagt I$SN_02 | 5123 (-) | 1.000 | 0.945 | aggaAACAGatggt I$SN_02 | 5252 (+) | 0.924 | 0.902 | cacatGTGGTcaca I$SN_02 | 6064 (+) | 0.943 | 0.935 | tgcacGTGCTgcca I$SN_02 | 6596 (+) | 0.924 | 0.890 | ataagGTGGTgcta I$SN_02 | 6654 (-) | 1.000 | 0.958 | tgcaAACAGatgaa I$SN_02 | 6920 (+) | 0.943 | 0.907 | aggaaTTGATtgta I$SN_02 | 7346 (-) | 0.953 | 0.908 | tttcAACATgttaa I$SN_02 | 7391 (+) | 0.924 | 0.897 | tccagGTGGTagaa I$SN_02 | 7426 (+) | 0.972 | 0.902 | tttatTTGTTttta I$SN_02 | 7586 (-) | 0.972 | 0.897 | gtcaAACAAatgct I$SN_02 | 7590 (+) | 0.924 | 0.899 | aacaaATGCTctga I$SN_02 | 7922 (-) | 0.943 | 0.884 | aggtATCAAttcgt I$SN_02 | 8181 (+) | 0.972 | 0.927 | agaacTTGTTtgct I$SN_02 | 8417 (-) | 0.972 | 0.958 | tgtaAGCAGctgtt I$SN_02 | 8421 (+) | 1.000 | 0.904 | agcagCTGTTctag I$SN_02 | 8521 (+) | 0.924 | 0.894 | cacagATGCTtcaa I$SN_02 | 8764 (-) | 0.943 | 0.918 | tcccAGCACctggg I$SN_02 | 8965 (-) | 0.953 | 0.880 | gagaACCAGgtgtg I$SN_02 | 8996 (-) | 0.943 | 0.906 | ttccAGCACatgga I$SN_02 | 9536 (-) | 0.972 | 0.906 | aacaAACAAgtcct I$SN_02 | 9596 (-) | 0.924 | 0.901 | tcccAGCATttggg I$SN_02 | 10308 (+) | 1.000 | 0.923 | tttaaCTGTTttta I$SN_02 | 10374 (+) | 0.953 | 0.899 | tcaaaATGTTgcaa I$SN_02 | 10995 (-) | 0.972 | 0.924 | tggaATCAGattac I$SN_02 | 11064 (-) | 0.972 | 0.921 | tgtgATCAGttaca I$SN_02 | 11208 (+) | 0.972 | 0.909 | ttaatTTGTTtgaa I$SN_02 | 11740 (-) | 0.924 | 0.888 | tctcACCAAatgac I$SN_02 | 11777 (+) | 0.943 | 0.894 | gatacTTGATgtaa F$GCR1_01 | 4963 (-) | 1.000 | 0.934 | caGGAAGtc F$GCR1_01 | 5732 (-) | 1.000 | 0.967 | ctGGAAGcc F$GCR1_01 | 6276 (-) | 1.000 | 0.956 | aaGGAAGcc F$GCR1_01 | 8158 (+) | 1.000 | 0.935 | agCTTCCag F$GCR1_01 | 9436 (-) | 1.000 | 0.967 | caGGAAGcc F$GCR1_01 | 10639 (-) | 1.000 | 0.967 | gaGGAAGac F$ADR1_01 | 105 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 972 (-) | 1.000 | 0.997 | ACCCCt F$ADR1_01 | 1187 (-) | 1.000 | 0.996 | ACCCCc F$ADR1_01 | 1612 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 1695 (+) | 1.000 | 0.997 | aGGGGT F$ADR1_01 | 1839 (+) | 1.000 | 1.000 | tGGGGT F$ADR1_01 | 2428 (+) | 1.000 | 0.997 | aGGGGT F$ADR1_01 | 2641 (-) | 1.000 | 0.997 | ACCCCt F$ADR1_01 | 4827 (+) | 1.000 | 0.999 | cGGGGT F$ADR1_01 | 6000 (+) | 1.000 | 1.000 | tGGGGT F$ADR1_01 | 6203 (+) | 1.000 | 0.999 | cGGGGT F$ADR1_01 | 7197 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 7957 (-) | 1.000 | 1.000 | ACCCCa F$ADR1_01 | 8359 (-) | 1.000 | 0.997 | ACCCCt F$ADR1_01 | 8485 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 8737 (+) | 1.000 | 0.999 | cGGGGT F$ADR1_01 | 9342 (-) | 1.000 | 0.997 | ACCCCt F$ADR1_01 | 9687 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 10741 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 11616 (+) | 1.000 | 0.997 | aGGGGT F$ADR1_01 | 11708 (-) | 1.000 | 1.000 | ACCCCa F$ADR1_01 | 11824 (+) | 1.000 | 1.000 | tGGGGT F$ADR1_01 | 11846 (+) | 1.000 | 0.996 | gGGGGT F$ADR1_01 | 11851 (+) | 1.000 | 1.000 | tGGGGT F$ADR1_01 | 11859 (+) | 1.000 | 0.996 | gGGGGT V$NFKAPPAB50_01 | 105 (+) | 0.856 | 0.804 | GGGGGtggcc V$NFKAPPAB50_01 | 331 (+) | 0.630 | 0.790 | GGAAAttccc V$NFKAPPAB50_01 | 331 (-) | 0.864 | 0.845 | ggaaaTTCCC V$NFKAPPAB50_01 | 1183 (+) | 0.845 | 0.828 | GGGCAccccc V$NFKAPPAB50_01 | 1183 (-) | 0.856 | 0.893 | gggcaCCCCC V$NFKAPPAB50_01 | 1202 (-) | 0.845 | 0.795 | ggttaTACCC V$NFKAPPAB50_01 | 3393 (+) | 0.766 | 0.781 | GAGGAtctcc V$NFKAPPAB50_01 | 6205 (+) | 0.845 | 0.780 | GGGTAtgcca V$NFKAPPAB50_01 | 10114 (-) | 1.000 | 0.780 | tgccaTCCCC V$NFKAPPAB65_01 | 330 (+) | 0.862 | 0.900 | tggaaATTCC V$NFKAPPAB65_01 | 331 (-) | 1.000 | 1.000 | GGAAAttccc V$NFKAPPAB65_01 | 1667 (+) | 0.862 | 0.891 | tggagATTCC V$CREL_01 | 331 (-) | 1.000 | 0.990 | GGAAAttccc V$NFKAPPAB_01 | 331 (-) | 1.000 | 1.000 | ggaaaTTCCC V$NFKAPPAB_01 | 3048 (+) | 0.986 | 0.930 | GGGACatacc V$NMYC_01 | 734 (+) | 1.000 | 0.993 | caccACGTGgtt V$NMYC_01 | 734 (-) | 1.000 | 0.993 | cacCACGTggtt V$NMYC_01 | 1125 (-) | 1.000 | 0.988 | atgCACGTgtgt V$NMYC_01 | 6063 (+) | 1.000 | 0.986 | ctgcACGTGctg V$NMYC_01 | 6063 (-) | 1.000 | 0.987 | ctgCACGTgctg V$MYOGNF1_01 | 110 (+) | 0.900 | 0.622 | tggccagtgaggCTGGCtaactgctcagt V$MYOGNF1_01 | 306 (+) | 0.616 | 0.626 | gagctatctctcCTAGTcctgtgctggaa V$MYOGNF1_01 | 415 (-) | 0.728 | 0.635 | gggtgagcttgtCCAAAataccacgttta V$MYOGNF1_01 | 451 (+) | 1.000 | 0.623 | tgtctgttttggTTGGCttatttgacact V$MYOGNF1_01 | 710 (+) | 0.779 | 0.708 | atctctttacagATGGTtgtgagccacca V$MYOGNF1_01 | 720 (-) | 0.851 | 0.623 | agatggttgtgaGCCACcacgtggttgct V$MYOGNF1_01 | 857 (+) | 0.715 | 0.629 | cgatcgttgcccTTAGAaatgcacaaaca V$MYOGNF1_01 | 873 (-) | 0.715 | 0.628 | aaatgcacaaacACGAAatgggacagact V$MYOGNF1_01 | 991 (+) | 0.762 | 0.665 | aataattttcatGTGGGgacgcacctata V$MYOGNF1_01 | 1361 (+) | 0.817 | 0.671 | acctgatgctcgTTTGCagtgggcgaggt V$MYOGNF1_01 | 1371 (-) | 0.687 | 0.679 | cgtttgcagtggGCGAGgtagggcaggtg V$MYOGNF1_01 | 2306 (+) | 0.929 | 0.675 | tgtctgtgtgttTTGGAtacagtcttgct V$MYOGNF1_01 | 2988 (-) | 0.929 | 0.672 | tacttccgctccTCCAAggtgcgcaggcg V$MYOGNF1_01 | 3225 (-) | 1.000 | 0.638 | ttttaccagtcaGCCAAcattcccagccc V$MYOGNF1_01 | 3273 (+) | 0.911 | 0.673 | atgccattatgaTTGGGatgaggtgaaca V$MYOGNF1_01 | 3307 (+) | 1.000 | 0.632 | atgtagttaagaTTGGCcttttcctgatt V$MYOGNF1_01 | 3388 (-) | 0.829 | 0.637 | gccctgaggatcTCCAGttataaaagtta V$MYOGNF1_01 | 3588 (+) | 0.911 | 0.651 | actctcctatgtTTGGGtttctgcagttt V$MYOGNF1_01 | 3667 (+) | 0.668 | 0.675 | cacagacttagaATTGCcaagtgccaaga V$MYOGNF1_01 | 3677 (-) | 1.000 | 0.705 | gaattgccaagtGCCAAgaacacaagaga V$MYOGNF1_01 | 3755 (+) | 0.637 | 0.623 | agtcattcttaaAAGGCaatgctccttcc V$MYOGNF1_01 | 4129 (+) | 0.715 | 0.636 | aaacccagaatcTAGGTgtggtgcctccc V$MYOGNF1_01 | 4263 (-) | 0.715 | 0.667 | gttaggatgcttACCTAagcacacattag V$MYOGNF1_01 | 4542 (+) | 0.746 | 0.648 | cagcctcttagtTTTGAtgctggacacag V$MYOGNF1_01 | 4718 (+) | 1.000 | 0.623 | attcactgtgatTTGGCcctcgctgatgg V$MYOGNF1_01 | 4731 (+) | 0.779 | 0.673 | tggccctcgctgATGGTcctctcccctct V$MYOGNF1_01 | 4890 (+) | 1.000 | 0.702 | attccatgttccTTGGCcaagtgcatgct V$MYOGNF1_01 | 4969 (-) | 0.698 | 0.657 | gtcaggctcagcCCCTAcctgtccgaatg V$MYOGNF1_01 | 5106 (-) | 0.746 | 0.719 | atttgtgagtgcTCAAAaggaaacagatg V$MYOGNF1_01 | 5145 (-) | 0.715 | 0.716 | cctaggcagtttACGAAaagggactttca V$MYOGNF1_01 | 5630 (-) | 0.665 | 0.674 | cactgggataaaGACAAaataaaaagaag V$MYOGNF1_01 | 5726 (-) | 0.851 | 0.625 | actaaactggaaGCCATaatatagacgca V$MYOGNF1_01 | 5858 (+) | 0.779 | 0.631 | agttggtttcccATGGAccatctcctgct V$MYOGNF1_01 | 5874 (+) | 0.817 | 0.732 | accatctcctgcTGGGCatggggcccgcc V$MYOGNF1_01 | 5879 (+) | 0.762 | 0.633 | ctcctgctgggcATGGGgcccgcccctaa V$MYOGNF1_01 | 5884 (-) | 0.718 | 0.679 | gctgggcatgggGCCCGcccctaagagta V$MYOGNF1_01 | 6169 (-) | 0.746 | 0.665 | cttctgcagggtTCCCAgagacacccatt V$MYOGNF1_01 | 6575 (-) | 0.746 | 0.663 | gttatgcaaaacACAAAacaaataaggtg V$MYOGNF1_01 | 6717 (+) | 0.597 | 0.630 | cacaacctttaaAGGGTgagcagcccttt V$MYOGNF1_01 | 6931 (-) | 0.911 | 0.641 | gtatgtgcagtaCCCAAgtgatactgtcc V$MYOGNF1_01 | 7336 (-) | 0.597 | 0.626 | aatagacacattTCAACatgttaatcatg V$MYOGNF1_01 | 7379 (-) | 0.829 | 0.672 | atattgcttgtgTCCAGgtggtagaatta V$MYOGNF1_01 | 7490 (+) | 0.929 | 0.693 | cgtctgttcttcTTGGAgggtgtcagatg V$MYOGNF1_01 | 7526 (+) | 0.929 | 0.813 | ctgaagttacagTTGGTtgtgagccaact V$MYOGNF1_01 | 7536 (-) | 1.000 | 0.626 | agttggttgtgaGCCAActgtgtagatgc V$MYOGNF1_01 | 7937 (+) | 0.746 | 0.696 | tgtgtgtttagtTTTGAgataccccagac V$MYOGNF1_01 | 7947 (-) | 0.811 | 0.633 | gttttgagatacCCCAGactatagttgtt V$MYOGNF1_01 | 8332 (+) | 0.829 | 0.636 | cgccagtatttgCTGGAccacttgccaac V$MYOGNF1_01 | 8426 (+) | 0.817 | 0.660 | ctgttctagagcTTTGCaacccgcatcct V$MYOGNF1_01 | 8826 (-) | 0.929 | 0.677 | tcgatgttcaaaACCAAaacagaacggtt V$MYOGNF1_01 | 8827 (-) | 0.728 | 0.628 | cgatgttcaaaaCCAAAacagaacggttt V$MYOGNF1_01 | 9045 (+) | 0.851 | 0.623 | agcctggtctacATGGCaagttccaggct V$MYOGNF1_01 | 9309 (+) | 0.911 | 0.667 | aggagggaggcgTTGGGtcagagcctctg V$MYOGNF1_01 | 9364 (+) | 0.597 | 0.623 | gttctctatcacAGGGAgagctgtcagcc V$MYOGNF1_01 | 9389 (+) | 0.779 | 0.702 | agcccctggaatGTGGTtctatgtctaga V$MYOGNF1_01 | 9838 (+) | 0.687 | 0.647 | taattttaaagtCAGGCaggcagcattaa V$MYOGNF1_01 | 9903 (+) | 0.829 | 0.718 | aagacttacttaCTGGAtttttgcctatg V$MYOGNF1_01 | 9918 (+) | 0.746 | 0.655 | gatttttgcctaTGGGTaatttgcatgta V$MYOGNF1_01 | 10481 (-) | 0.646 | 0.622 | accagccacataTCAAGgatcaaaagtca V$MYOGNF1_01 | 10501 (+) | 0.851 | 0.627 | caaaagtcaattATGGCtgtagccattat V$MYOGNF1_01 | 10645 (-) | 0.698 | 0.626 | gacacccagtgtCCTAAtgtcaacaggag V$MYOGNF1_01 | 10753 (+) | 0.817 | 0.636 | gatccgtgtgttTGGGCactagaactctg V$MYOGNF1_01 | 10878 (-) | 0.634 | 0.663 | ggtgggagcactGCACAtgcgcagagacg V$MYOGNF1_01 | 10958 (+) | 0.929 | 0.623 | ggtctgtgcctgTTGGAgcaaggagcctg V$MYOGNF1_01 | 11083 (-) | 0.779 | 0.664 | ctctgggaatttTCCACtgtttaaagaac V$MYOGNF1_01 | 11234 (-) | 0.929 | 0.656 | ttctaactggagTCCAAaactctgtggtg V$MYOGNF1_01 | 11252 (-) | 1.000 | 0.630 | actctgtggtgtGCCAAgaaaaggaaatt V$MYOGNF1_01 | 11279 (-) | 0.817 | 0.699 | ttaaggcataggGCAAAgaaataataata V$MYOGNF1_01 | 11415 (+) | 0.787 | 0.706 | ctgcatttctatTTAGCtatccgcatact V$MYOGNF1_01 | 11476 (+) | 0.929 | 0.622 | tttgtttttcccTTGGAatgcggttgaat V$MYOGNF1_01 | 11574 (-) | 0.829 | 0.645 | tcattgcttataTCCAGgagtgagactag V$MYOGNF1_01 | 11761 (-) | 0.829 | 0.623 | agttaacaggatTCCAGatacttgatgta V$MYOGNF1_01 | 11787 (-) | 0.811 | 0.674 | gtaaggacaaatCCCAGattggaaggtgg V$MYOGNF1_01 | 11793 (+) | 0.929 | 0.662 | acaaatcccagaTTGGAaggtggcccagg V$MYOGNF1_01 | 11845 (+) | 0.762 | 0.626 | ggggggtggggtATGGGggtaggtccggg V$MYOGNF1_01 | 11915 (-) | 0.817 | 0.651 | ttcgggcccaacGCCCAttggctgaaact V$YY1_01 | 522 (-) | 0.922 | 0.925 | ttttaaaaATGATgggt V$YY1_01 | 714 (-) | 1.000 | 0.963 | ctttacagATGGTtgtg V$YY1_01 | 798 (+) | 0.998 | 0.955 | ctgaGCCATttctccag V$YY1_01 | 882 (-) | 0.996 | 0.944 | aacacgaaATGGGacag V$YY1_01 | 1173 (-) | 0.996 | 0.949 | gaaaggagATGGGcacc V$YY1_01 | 1658 (-) | 1.000 | 0.970 | cttggcaaATGGAgatt V$YY1_01 | 1883 (+) | 0.996 | 0.975 | gcctCCCATtttatatt V$YY1_01 | 2610 (+) | 0.998 | 0.940 | aagaGCCATgtgtatgc V$YY1_01 | 2833 (-) | 0.998 | 0.963 | gtctgcaaATGGCatca V$YY1_01 | 3145 (-) | 1.000 | 0.974 | aacaatatATGGAggtt V$YY1_01 | 3300 (+) | 1.000 | 0.980 | catcTCCATgtagttaa V$YY1_01 | 3375 (-) | 0.996 | 0.956 | ctaatcatATGGGgccc V$YY1_01 | 3638 (-) | 1.000 | 0.938 | cctaccacATGGTggaa V$YY1_01 | 3836 (+) | 1.000 | 0.920 | aattTCCATtctcataa V$YY1_01 | 3859 (+) | 1.000 | 0.982 | cagtACCATttaaggag V$YY1_01 | 3983 (+) | 1.000 | 0.966 | agacACCATctcatgaa V$YY1_01 | 4106 (+) | 1.000 | 0.990 | aacaTCCATtttcccag V$YY1_01 | 4151 (+) | 0.996 | 0.950 | gcctCCCATttgtaatc V$YY1_01 | 4888 (+) | 1.000 | 0.973 | acatTCCATgttccttg V$YY1_01 | 5124 (-) | 1.000 | 0.942 | ggaaacagATGGTgcgg V$YY1_01 | 5237 (-) | 0.996 | 0.954 | acgattacATGGGttca V$YY1_01 | 5285 (-) | 0.996 | 0.932 | aggctcagATGGGctag V$YY1_01 | 5303 (-) | 0.998 | 0.979 | catccaaaATGGCttct V$YY1_01 | 5443 (+) | 0.998 | 0.969 | tattGCCATataacctt V$YY1_01 | 5870 (+) | 1.000 | 0.938 | atggACCATctcctgct V$YY1_01 | 6032 (+) | 0.996 | 0.937 | aggaCCCATctggtgca V$YY1_01 | 6188 (+) | 0.996 | 0.960 | gacaCCCATttaagccg V$YY1_01 | 6258 (+) | 0.996 | 0.947 | ttgtCCCATctgccgaa V$YY1_01 | 6992 (-) | 1.000 | 0.955 | gtgaggagATGGAaaag V$YY1_01 | 7171 (-) | 1.000 | 0.943 | cgttgcagATGGTcccg V$YY1_01 | 7508 (-) | 0.998 | 0.943 | ggtgtcagATGGCctga V$YY1_01 | 7608 (+) | 0.998 | 0.951 | ctgaGCCATctctccag V$YY1_01 | 8572 (-) | 0.996 | 0.968 | ttgattaaATGGGgcaa V$YY1_01 | 8997 (-) | 1.000 | 0.944 | tccagcacATGGAggca V$YY1_01 | 9049 (-) | 0.998 | 0.956 | tggtctacATGGCaagt V$YY1_01 | 9512 (+) | 0.996 | 0.938 | tttcCCCATgtccaaca V$YY1_01 | 10103 (+) | 0.998 | 0.974 | ctgaGCCATcttgccat V$YY1_01 | 10384 (+) | 1.000 | 0.979 | gcaaTCCATataaaaat V$YY1_01 | 11342 (+) | 0.998 | 0.917 | aaatGCCATtatcgaac V$YY1_01 | 11536 (-) | 1.000 | 0.935 | tcccacatATGGTagat V$YY1_01 | 11543 (-) | 0.996 | 0.966 | tatggtagATGGGtatt V$YY1_01 | 11720 (+) | 1.000 | 0.982 | ctccACCATtttctcaa I$SN_01 | 1785 (-) | 1.000 | 0.948 | gagcACCTGccat I$SN_01 | 8502 (+) | 1.000 | 0.983 | gggaCAGGTgcgg I$SN_01 | 11960 (-) | 1.000 | 0.939 | ctgcACCTGagct V$IRF1_01 | 224 (-) | 0.978 | 0.879 | tttttcctTTTTG V$IRF1_01 | 900 (-) | 0.811 | 0.861 | cttttcacTTTCT V$IRF1_01 | 1956 (-) | 0.934 | 0.882 | actttcctTTTTT V$IRF1_01 | 2508 (+) | 0.934 | 0.882 | AAAAAgggaaaaa V$IRF1_01 | 5942 (+) | 1.000 | 0.858 | GAAAAgtaaaatt V$IRF1_01 | 7848 (+) | 0.978 | 0.869 | CAAAAgcgacagc V$IRF1_01 | 8834 (+) | 0.978 | 0.857 | CAAAAccaaaaca V$IRF1_01 | 9104 (+) | 0.978 | 0.873 | CAAAAgcaaaaca F$PHO4_01 | 1125 (+) | 1.000 | 0.979 | atgCACGTgtgt V$TAL1BETAE47_01 | 39 (+) | 1.000 | 0.877 | aacttCAGATgtcatt V$TAL1BETAE47_01 | 714 (+) | 1.000 | 0.972 | ctttaCAGATggttgt V$TAL1BETAE47_01 | 1355 (-) | 0.894 | 0.809 | tgggtcACCTGatgct V$TAL1BETAE47_01 | 1358 (+) | 0.780 | 0.794 | gtcacCTGATgctcgt V$TAL1BETAE47_01 | 1707 (-) | 0.780 | 0.791 | agcaccCTCTGccttt V$TAL1BETAE47_01 | 1783 (-) | 0.894 | 0.837 | tggagcACCTGccatt V$TAL1BETAE47_01 | 1857 (-) | 0.885 | 0.814 | agagccAACTGaagaa V$TAL1BETAE47_01 | 2823 (+) | 0.885 | 0.814 | ctgtcCAGTTgtctgc V$TAL1BETAE47_01 | 2967 (+) | 1.000 | 0.847 | gacggCAGATgcgcag V$TAL1BETAE47_01 | 3638 (+) | 0.780 | 0.828 | cctacCACATggtgga V$TAL1BETAE47_01 | 3697 (+) | 0.780 | 0.807 | cacaaGAGATgcttga V$TAL1BETAE47_01 | 3984 (-) | 0.780 | 0.815 | gacaccATCTCatgaa V$TAL1BETAE47_01 | 4115 (+) | 1.000 | 0.939 | tttccCAGATgttaaa V$TAL1BETAE47_01 | 4336 (+) | 0.894 | 0.883 | acatcCAGGTggtgga V$TAL1BETAE47_01 | 4358 (+) | 0.780 | 0.802 | aggatCAGAAgtttaa V$TAL1BETAE47_01 | 4563 (+) | 0.885 | 0.791 | ggacaCAGTTgcggtt V$TAL1BETAE47_01 | 4735 (+) | 0.780 | 0.793 | cctcgCTGATggtcct V$TAL1BETAE47_01 | 5124 (+) | 1.000 | 0.994 | ggaaaCAGATggtgcg V$TAL1BETAE47_01 | 5285 (+) | 1.000 | 0.884 | aggctCAGATgggcta V$TAL1BETAE47_01 | 5616 (-) | 0.780 | 0.799 | accaacATCTAataca V$TAL1BETAE47_01 | 5871 (-) | 0.780 | 0.816 | tggaccATCTCctgct V$TAL1BETAE47_01 | 5980 (-) | 0.780 | 0.787 | gataacTTCTGgatta V$TAL1BETAE47_01 | 6033 (-) | 1.000 | 0.915 | ggacccATCTGgtgca V$TAL1BETAE47_01 | 6259 (-) | 1.000 | 0.881 | tgtcccATCTGccgaa V$TAL1BETAE47_01 | 6493 (+) | 1.000 | 0.843 | gattgCAGATgcggcc V$TAL1BETAE47_01 | 6655 (+) | 1.000 | 0.895 | gcaaaCAGATgaagat V$TAL1BETAE47_01 | 6842 (-) | 0.885 | 0.850 | actgccAACTGgtgag V$TAL1BETAE47_01 | 7025 (+) | 0.882 | 0.786 | tgcatCAGCTgaagag V$TAL1BETAE47_01 | 7025 (-) | 0.882 | 0.827 | tgcatcAGCTGaagag V$TAL1BETAE47_01 | 7171 (+) | 1.000 | 0.946 | cgttgCAGATggtccc V$TAL1BETAE47_01 | 7388 (+) | 0.894 | 0.887 | gtgtcCAGGTggtaga V$TAL1BETAE47_01 | 7508 (+) | 1.000 | 0.897 | ggtgtCAGATggcctg V$TAL1BETAE47_01 | 7530 (+) | 0.885 | 0.892 | agttaCAGTTggttgt V$TAL1BETAE47_01 | 7545 (-) | 0.885 | 0.841 | tgagccAACTGtgtag V$TAL1BETAE47_01 | 7587 (+) | 0.780 | 0.808 | tcaaaCAAATgctctg V$TAL1BETAE47_01 | 7677 (+) | 0.882 | 0.812 | gctcaCAGCTgtaaag V$TAL1BETAE47_01 | 7677 (-) | 0.882 | 0.811 | gctcacAGCTGtaaag V$TAL1BETAE47_01 | 8237 (+) | 1.000 | 0.856 | acaccCAGATgaaggt V$TAL1BETAE47_01 | 8265 (+) | 0.780 | 0.792 | ttgacCTGATgatgca V$TAL1BETAE47_01 | 8418 (+) | 0.882 | 0.870 | gtaagCAGCTgttcta V$TAL1BETAE47_01 | 8418 (-) | 0.882 | 0.879 | gtaagcAGCTGttcta V$TAL1BETAE47_01 | 8501 (+) | 0.894 | 0.803 | agggaCAGGTgcggga V$TAL1BETAE47_01 | 8518 (+) | 1.000 | 0.935 | actcaCAGATgcttca V$TAL1BETAE47_01 | 8590 (-) | 0.780 | 0.803 | ggcaacATCTTtaata V$TAL1BETAE47_01 | 8765 (-) | 0.894 | 0.848 | cccagcACCTGggagg V$TAL1BETAE47_01 | 8809 (-) | 0.780 | 0.791 | gccagcTTCTGgtact V$TAL1BETAE47_01 | 8966 (+) | 0.894 | 0.826 | agaacCAGGTgtggtg V$TAL1BETAE47_01 | 10709 (-) | 0.780 | 0.806 | tgaaacATCTCtgcgt V$TAL1BETAE47_01 | 11536 (+) | 0.780 | 0.819 | tcccaCATATggtaga V$TAL1BETAE47_01 | 11707 (-) | 0.885 | 0.803 | caccccAACTGccctc V$TAL1ALPHAE47_01 | 39 (+) | 1.000 | 0.868 | aacttCAGATgtcatt V$TAL1ALPHAE47_01 | 143 (-) | 0.810 | 0.769 | ccagacATTTGctcat V$TAL1ALPHAE47_01 | 714 (+) | 1.000 | 0.968 | ctttaCAGATggttgt V$TAL1ALPHAE47_01 | 732 (+) | 0.696 | 0.768 | gccacCACGTggttgc V$TAL1ALPHAE47_01 | 732 (-) | 0.696 | 0.769 | gccaccACGTGgttgc V$TAL1ALPHAE47_01 | 1024 (+) | 0.875 | 0.760 | attggCAGTTgaatca V$TAL1ALPHAE47_01 | 1173 (+) | 0.772 | 0.743 | gaaagGAGATgggcac V$TAL1ALPHAE47_01 | 1192 (+) | 0.655 | 0.745 | caggaAAGCTggttat V$TAL1ALPHAE47_01 | 1259 (-) | 0.772 | 0.788 | gtcatcATCCGtgtgt V$TAL1ALPHAE47_01 | 1355 (-) | 0.886 | 0.790 | tgggtcACCTGatgct V$TAL1ALPHAE47_01 | 1358 (+) | 0.772 | 0.786 | gtcacCTGATgctcgt V$TAL1ALPHAE47_01 | 1389 (+) | 0.886 | 0.777 | tagggCAGGTgaggag V$TAL1ALPHAE47_01 | 1658 (+) | 0.810 | 0.750 | cttggCAAATggagat V$TAL1ALPHAE47_01 | 1707 (-) | 0.772 | 0.790 | agcaccCTCTGccttt V$TAL1ALPHAE47_01 | 1783 (-) | 0.886 | 0.837 | tggagcACCTGccatt V$TAL1ALPHAE47_01 | 1857 (-) | 0.875 | 0.796 | agagccAACTGaagaa V$TAL1ALPHAE47_01 | 1873 (-) | 1.000 | 0.763 | gggaggATCTGcctcc V$TAL1ALPHAE47_01 | 1993 (-) | 1.000 | 0.769 | aagattATCTGgctat V$TAL1ALPHAE47_01 | 2580 (+) | 0.810 | 0.766 | tacaaCAAATgcagta V$TAL1ALPHAE47_01 | 2611 (-) | 0.810 | 0.791 | agagccATGTGtatgc V$TAL1ALPHAE47_01 | 2626 (+) | 0.810 | 0.785 | ctaggCAAATgctcta V$TAL1ALPHAE47_01 | 2726 (-) | 0.772 | 0.755 | ataatcTTCTGccttt V$TAL1ALPHAE47_01 | 2823 (+) | 0.875 | 0.802 | ctgtcCAGTTgtctgc V$TAL1ALPHAE47_01 | 2833 (+) | 0.810 | 0.761 | gtctgCAAATggcatc V$TAL1ALPHAE47_01 | 2906 (-) | 0.772 | 0.745 | atccacATCTCtgtgg V$TAL1ALPHAE47_01 | 2967 (+) | 1.000 | 0.853 | gacggCAGATgcgcag V$TAL1ALPHAE47_01 | 3295 (-) | 0.772 | 0.777 | gtgaacATCTCcatgt V$TAL1ALPHAE47_01 | 3375 (+) | 0.818 | 0.767 | ctaatCATATggggcc V$TAL1ALPHAE47_01 | 3375 (-) | 0.818 | 0.794 | ctaatcATATGgggcc V$TAL1ALPHAE47_01 | 3638 (+) | 0.810 | 0.841 | cctacCACATggtgga V$TAL1ALPHAE47_01 | 3638 (-) | 0.704 | 0.773 | cctaccACATGgtgga V$TAL1ALPHAE47_01 | 3697 (+) | 0.772 | 0.803 | cacaaGAGATgcttga V$TAL1ALPHAE47_01 | 3984 (-) | 0.772 | 0.805 | gacaccATCTCatgaa V$TAL1ALPHAE47_01 | 4115 (+) | 1.000 | 0.941 | tttccCAGATgttaaa V$TAL1ALPHAE47_01 | 4152 (-) | 0.810 | 0.780 | cctcccATTTGtaatc V$TAL1ALPHAE47_01 | 4183 (+) | 0.772 | 0.753 | ctgagCAGAAggctgc V$TAL1ALPHAE47_01 | 4336 (+) | 0.886 | 0.872 | acatcCAGGTggtgga V$TAL1ALPHAE47_01 | 4358 (+) | 0.772 | 0.789 | aggatCAGAAgtttaa V$TAL1ALPHAE47_01 | 4509 (-) | 0.772 | 0.782 | aacaacATCAGaccaa V$TAL1ALPHAE47_01 | 4525 (+) | 0.810 | 0.768 | gacaaCAAATgcgtca V$TAL1ALPHAE47_01 | 4563 (+) | 0.875 | 0.792 | ggacaCAGTTgcggtt V$TAL1ALPHAE47_01 | 4735 (+) | 0.772 | 0.785 | cctcgCTGATggtcct V$TAL1ALPHAE47_01 | 4750 (-) | 0.772 | 0.754 | tctcccCTCTGtaggt V$TAL1ALPHAE47_01 | 4808 (-) | 0.810 | 0.779 | tctggcATTTGttacc V$TAL1ALPHAE47_01 | 4924 (+) | 1.000 | 0.786 | acttaCAGATcatgga V$TAL1ALPHAE47_01 | 5124 (+) | 1.000 | 0.994 | ggaaaCAGATggtgcg V$TAL1ALPHAE47_01 | 5124 (-) | 0.701 | 0.757 | ggaaacAGATGgtgcg V$TAL1ALPHAE47_01 | 5249 (+) | 0.704 | 0.778 | gttcaCATGTggtcac V$TAL1ALPHAE47_01 | 5249 (-) | 0.810 | 0.772 | gttcacATGTGgtcac V$TAL1ALPHAE47_01 | 5285 (+) | 1.000 | 0.871 | aggctCAGATgggcta V$TAL1ALPHAE47_01 | 5616 (-) | 0.772 | 0.791 | accaacATCTAataca V$TAL1ALPHAE47_01 | 5871 (-) | 0.772 | 0.805 | tggaccATCTCctgct V$TAL1ALPHAE47_01 | 5980 (-) | 0.772 | 0.786 | gataacTTCTGgatta V$TAL1ALPHAE47_01 | 6033 (+) | 0.701 | 0.748 | ggaccCATCTggtgca V$TAL1ALPHAE47_01 | 6033 (-) | 1.000 | 0.907 | ggacccATCTGgtgca V$TAL1ALPHAE47_01 | 6112 (+) | 0.772 | 0.766 | ttgacCAGAGggcttg V$TAL1ALPHAE47_01 | 6259 (-) | 1.000 | 0.884 | tgtcccATCTGccgaa V$TAL1ALPHAE47_01 | 6283 (+) | 0.772 | 0.749 | cctctCTGATgatgac V$TAL1ALPHAE47_01 | 6493 (+) | 1.000 | 0.845 | gattgCAGATgcggcc V$TAL1ALPHAE47_01 | 6632 (+) | 0.772 | 0.746 | gcaccCAGAAggcacc V$TAL1ALPHAE47_01 | 6655 (+) | 1.000 | 0.894 | gcaaaCAGATgaagat V$TAL1ALPHAE47_01 | 6842 (+) | 0.693 | 0.753 | actgcCAACTggtgag V$TAL1ALPHAE47_01 | 6842 (-) | 0.875 | 0.829 | actgccAACTGgtgag V$TAL1ALPHAE47_01 | 7009 (+) | 0.772 | 0.785 | gcaggCGGATgtttcc V$TAL1ALPHAE47_01 | 7025 (+) | 0.883 | 0.778 | tgcatCAGCTgaagag V$TAL1ALPHAE47_01 | 7025 (-) | 0.883 | 0.821 | tgcatcAGCTGaagag V$TAL1ALPHAE47_01 | 7109 (+) | 0.810 | 0.768 | tgtacCACATgtgtgc V$TAL1ALPHAE47_01 | 7109 (-) | 0.704 | 0.774 | tgtaccACATGtgtgc V$TAL1ALPHAE47_01 | 7111 (-) | 0.810 | 0.775 | taccacATGTGtgcaa V$TAL1ALPHAE47_01 | 7171 (+) | 1.000 | 0.938 | cgttgCAGATggtccc V$TAL1ALPHAE47_01 | 7388 (+) | 0.886 | 0.874 | gtgtcCAGGTggtaga V$TAL1ALPHAE47_01 | 7485 (-) | 0.772 | 0.768 | gcgcacGTCTGttctt V$TAL1ALPHAE47_01 | 7508 (+) | 1.000 | 0.890 | ggtgtCAGATggcctg V$TAL1ALPHAE47_01 | 7530 (+) | 0.875 | 0.884 | agttaCAGTTggttgt V$TAL1ALPHAE47_01 | 7545 (-) | 0.875 | 0.831 | tgagccAACTGtgtag V$TAL1ALPHAE47_01 | 7553 (+) | 0.772 | 0.754 | ctgtgTAGATgctgga V$TAL1ALPHAE47_01 | 7575 (-) | 0.772 | 0.756 | atccccATCAGgtcaa V$TAL1ALPHAE47_01 | 7587 (+) | 0.810 | 0.836 | tcaaaCAAATgctctg V$TAL1ALPHAE47_01 | 7609 (-) | 0.772 | 0.772 | tgagccATCTCtccag V$TAL1ALPHAE47_01 | 7677 (+) | 0.883 | 0.820 | gctcaCAGCTgtaaag V$TAL1ALPHAE47_01 | 7677 (-) | 0.883 | 0.819 | gctcacAGCTGtaaag V$TAL1ALPHAE47_01 | 7883 (-) | 0.810 | 0.753 | gaagtcATTTGgtttt V$TAL1ALPHAE47_01 | 8217 (+) | 0.772 | 0.771 | cagttCTGATgttggc V$TAL1ALPHAE47_01 | 8237 (+) | 1.000 | 0.849 | acaccCAGATgaaggt V$TAL1ALPHAE47_01 | 8265 (+) | 0.772 | 0.787 | ttgacCTGATgatgca V$TAL1ALPHAE47_01 | 8274 (+) | 0.883 | 0.776 | tgatgCAGCTgccctg V$TAL1ALPHAE47_01 | 8274 (-) | 0.883 | 0.766 | tgatgcAGCTGccctg V$TAL1ALPHAE47_01 | 8315 (-) | 0.772 | 0.754 | tgcagcTTCTGcgtca V$TAL1ALPHAE47_01 | 8345 (-) | 0.696 | 0.744 | tggaccACTTGccaac V$TAL1ALPHAE47_01 | 8418 (+) | 0.883 | 0.872 | gtaagCAGCTgttcta V$TAL1ALPHAE47_01 | 8418 (-) | 0.883 | 0.885 | gtaagcAGCTGttcta V$TAL1ALPHAE47_01 | 8501 (+) | 0.886 | 0.808 | agggaCAGGTgcggga V$TAL1ALPHAE47_01 | 8518 (+) | 1.000 | 0.940 | actcaCAGATgcttca V$TAL1ALPHAE47_01 | 8590 (-) | 0.772 | 0.812 | ggcaacATCTTtaata V$TAL1ALPHAE47_01 | 8667 (-) | 0.658 | 0.761 | caaaccACCGGttagg V$TAL1ALPHAE47_01 | 8765 (-) | 0.886 | 0.842 | cccagcACCTGggagg V$TAL1ALPHAE47_01 | 8809 (-) | 0.772 | 0.784 | gccagcTTCTGgtact V$TAL1ALPHAE47_01 | 8966 (+) | 0.886 | 0.818 | agaacCAGGTgtggtg V$TAL1ALPHAE47_01 | 8966 (-) | 0.693 | 0.762 | agaaccAGGTGtggtg V$TAL1ALPHAE47_01 | 8997 (+) | 0.810 | 0.769 | tccagCACATggaggc V$TAL1ALPHAE47_01 | 9017 (+) | 1.000 | 0.783 | gcaggCAGATctttgt V$TAL1ALPHAE47_01 | 9148 (+) | 0.818 | 0.772 | agacaCATATgtgagt V$TAL1ALPHAE47_01 | 9148 (-) | 0.818 | 0.775 | agacacATATGtgagt V$TAL1ALPHAE47_01 | 9327 (-) | 0.772 | 0.768 | cagagcCTCTGgagga V$TAL1ALPHAE47_01 | 9497 (-) | 1.000 | 0.771 | tagaaaATCTGaaagt V$TAL1ALPHAE47_01 | 9597 (-) | 0.810 | 0.789 | cccagcATTTGggaag V$TAL1ALPHAE47_01 | 10095 (-) | 0.772 | 0.774 | cttaacCTCTGagcca V$TAL1ALPHAE47_01 | 10104 (-) | 0.772 | 0.750 | tgagccATCTTgccat V$TAL1ALPHAE47_01 | 10709 (-) | 0.772 | 0.810 | tgaaacATCTCtgcgt V$TAL1ALPHAE47_01 | 11367 (-) | 1.000 | 0.753 | gattctATCTGtgctt V$TAL1ALPHAE47_01 | 11536 (+) | 0.818 | 0.852 | tcccaCATATggtaga V$TAL1ALPHAE47_01 | 11536 (-) | 0.818 | 0.776 | tcccacATATGgtaga V$TAL1ALPHAE47_01 | 11619 (+) | 0.772 | 0.751 | ggtacCAGAAggaccc V$TAL1ALPHAE47_01 | 11707 (-) | 0.875 | 0.802 | caccccAACTGccctc V$TAL1ALPHAE47_01 | 11733 (-) | 0.772 | 0.780 | tcaaacATCTCaccaa V$TAL1ALPHAE47_01 | 11741 (+) | 0.810 | 0.754 | ctcacCAAATgactag V$TAL1ALPHAE47_01 | 11770 (+) | 1.000 | 0.762 | gattcCAGATacttga V$TAL1ALPHAE47_01 | 11958 (-) | 0.886 | 0.763 | gactgcACCTGagctc I$HAIRY_01 | 5818 (-) | 1.000 | 0.887 | ccctgGCGTGgcac I$HAIRY_01 | 10866 (-) | 1.000 | 0.902 | atgcaGCGTGgtgg V$YY1_02 | 56 (+) | 1.000 | 0.853 | cttggGCCATaattactgtt V$YY1_02 | 199 (+) | 0.908 | 0.761 | acaccACCATgcctggtatt V$YY1_02 | 383 (+) | 1.000 | 0.765 | gagatGCCATtcatttgcca V$YY1_02 | 403 (-) | 0.881 | 0.777 | acaaggcggaATGGGtgagc V$YY1_02 | 523 (-) | 0.881 | 0.810 | tttaaaaatgATGGGtgatg V$YY1_02 | 712 (-) | 0.908 | 0.807 | ctctttacagATGGTtgtga V$YY1_02 | 797 (+) | 1.000 | 0.799 | gctgaGCCATttctccagcc V$YY1_02 | 880 (-) | 0.881 | 0.772 | caaacacgaaATGGGacaga V$YY1_02 | 1171 (-) | 0.881 | 0.805 | aagaaaggagATGGGcaccc V$YY1_02 | 1788 (+) | 1.000 | 0.802 | cacctGCCATtcacaggatt V$YY1_02 | 1882 (+) | 0.881 | 0.796 | tgcctCCCATtttatattta V$YY1_02 | 2450 (-) | 1.000 | 0.793 | tggtagcgctATGGCctagc V$YY1_02 | 2609 (+) | 1.000 | 0.815 | caagaGCCATgtgtatgcta V$YY1_02 | 2831 (-) | 1.000 | 0.809 | ttgtctgcaaATGGCatcat V$YY1_02 | 3143 (-) | 0.881 | 0.772 | acaacaatatATGGAggttc V$YY1_02 | 3156 (+) | 0.881 | 0.760 | gaggtTCCATtccgcctgta V$YY1_02 | 3270 (+) | 1.000 | 0.853 | gtgatGCCATtatgattggg V$YY1_02 | 3519 (+) | 0.780 | 0.765 | aggagGCCACcttggcccga V$YY1_02 | 3636 (-) | 0.908 | 0.808 | tacctaccacATGGTggaat V$YY1_02 | 3949 (+) | 1.000 | 0.837 | attcaGCCATcctactgagc V$YY1_02 | 4105 (+) | 0.881 | 0.795 | aaacaTCCATtttcccagat V$YY1_02 | 4150 (+) | 0.881 | 0.788 | tgcctCCCATttgtaatcct V$YY1_02 | 4733 (-) | 0.908 | 0.763 | gccctcgctgATGGTcctct V$YY1_02 | 4887 (+) | 0.881 | 0.781 | aacatTCCATgttccttggc V$YY1_02 | 4985 (-) | 1.000 | 0.844 | acctgtccgaATGGCagacc V$YY1_02 | 5122 (-) | 0.908 | 0.797 | aaggaaacagATGGTgcggt V$YY1_02 | 5283 (-) | 0.881 | 0.774 | acaggctcagATGGGctagc V$YY1_02 | 5296 (+) | 1.000 | 0.820 | ggctaGCCATccaaaatggc V$YY1_02 | 5301 (-) | 1.000 | 0.880 | gccatccaaaATGGCttctt V$YY1_02 | 5442 (+) | 1.000 | 0.824 | gtattGCCATataacctttt V$YY1_02 | 5733 (+) | 1.000 | 0.762 | tggaaGCCATaatatagacg V$YY1_02 | 5869 (+) | 0.908 | 0.785 | catggACCATctcctgctgg V$YY1_02 | 6031 (+) | 0.881 | 0.780 | taggaCCCATctggtgcagc V$YY1_02 | 6257 (+) | 0.881 | 0.774 | tttgtCCCATctgccgaaga V$YY1_02 | 6891 (-) | 0.780 | 0.770 | gtaagacaagAGGGCataag V$YY1_02 | 6960 (-) | 0.881 | 0.787 | attattctggATGGAaagga V$YY1_02 | 6990 (-) | 0.881 | 0.770 | tggtgaggagATGGAaaagg V$YY1_02 | 7169 (-) | 0.908 | 0.802 | ggcgttgcagATGGTcccga V$YY1_02 | 7506 (-) | 1.000 | 0.853 | agggtgtcagATGGCctgaa V$YY1_02 | 7607 (+) | 1.000 | 0.844 | gctgaGCCATctctccagcc V$YY1_02 | 7729 (+) | 1.000 | 0.799 | tttggGCCATcacaggctgg V$YY1_02 | 8649 (-) | 0.908 | 0.761 | acaattcccaATGGTtttca V$YY1_02 | 8702 (+) | 0.908 | 0.774 | aaaatACCATcaaggagtct V$YY1_02 | 8995 (-) | 0.881 | 0.761 | attccagcacATGGAggcag V$YY1_02 | 9047 (-) | 1.000 | 0.828 | cctggtctacATGGCaagtt V$YY1_02 | 10102 (+) | 1.000 | 0.902 | tctgaGCCATcttgccatcc V$YY1_02 | 10110 (+) | 1.000 | 0.833 | atcttGCCATccccaataaa V$YY1_02 | 10503 (-) | 1.000 | 0.810 | aaagtcaattATGGCtgtag V$YY1_02 | 10517 (+) | 1.000 | 0.764 | ctgtaGCCATtataggaggc V$YY1_02 | 10849 (+) | 1.000 | 0.768 | aagaaGCCATtgtagagatg V$YY1_02 | 11341 (+) | 1.000 | 0.800 | taaatGCCATtatcgaacga V$YY1_02 | 11534 (-) | 0.908 | 0.756 | aatcccacatATGGTagatg V$YY1_02 | 11541 (-) | 0.881 | 0.785 | catatggtagATGGGtattc V$YY1_02 | 11719 (+) | 0.908 | 0.794 | cctccACCATtttctcaaac V$TAL1BETAITF2_01 | 714 (+) | 1.000 | 0.954 | ctttaCAGATggttgt V$TAL1BETAITF2_01 | 5124 (+) | 1.000 | 0.997 | ggaaaCAGATggtgcg V$TAL1BETAITF2_01 | 6033 (-) | 1.000 | 0.907 | ggacccATCTGgtgca V$TAL1BETAITF2_01 | 7171 (+) | 1.000 | 0.916 | cgttgCAGATggtccc V$TAL1BETAITF2_01 | 8518 (+) | 1.000 | 0.905 | actcaCAGATgcttca V$E47_02 | 39 (+) | 0.866 | 0.833 | aacttCAGATgtcatt V$E47_02 | 284 (+) | 0.836 | 0.843 | caagtCAAGTgtttta V$E47_02 | 714 (+) | 0.866 | 0.855 | ctttaCAGATggttgt V$E47_02 | 732 (+) | 0.845 | 0.834 | gccacCACGTggttgc V$E47_02 | 732 (-) | 0.845 | 0.835 | gccaccACGTGgttgc V$E47_02 | 1123 (+) | 0.845 | 0.825 | gtatgCACGTgtgttc V$E47_02 | 1123 (-) | 0.845 | 0.819 | gtatgcACGTGtgttc V$E47_02 | 1355 (-) | 1.000 | 0.903 | tgggtcACCTGatgct V$E47_02 | 1389 (+) | 1.000 | 0.898 | tagggCAGGTgaggag V$E47_02 | 1783 (-) | 1.000 | 0.960 | tggagcACCTGccatt V$E47_02 | 2422 (+) | 0.766 | 0.817 | aatcaCAGGGgttggg V$E47_02 | 2611 (+) | 0.842 | 0.823 | agagcCATGTgtatgc V$E47_02 | 2823 (+) | 0.866 | 0.832 | ctgtcCAGTTgtctgc V$E47_02 | 2967 (+) | 0.866 | 0.824 | gacggCAGATgcgcag V$E47_02 | 3638 (-) | 0.842 | 0.837 | cctaccACATGgtgga V$E47_02 | 3898 (+) | 1.000 | 0.822 | catcaCAGGTattaat V$E47_02 | 4115 (+) | 0.866 | 0.880 | tttccCAGATgttaaa V$E47_02 | 4336 (+) | 1.000 | 0.939 | acatcCAGGTggtgga V$E47_02 | 4563 (+) | 0.866 | 0.824 | ggacaCAGTTgcggtt V$E47_02 | 5124 (+) | 0.866 | 0.861 | ggaaaCAGATggtgcg V$E47_02 | 5249 (+) | 0.842 | 0.850 | gttcaCATGTggtcac V$E47_02 | 6061 (+) | 0.845 | 0.841 | ccctgCACGTgctgcc V$E47_02 | 6061 (-) | 0.845 | 0.817 | ccctgcACGTGctgcc V$E47_02 | 6259 (-) | 0.866 | 0.814 | tgtcccATCTGccgaa V$E47_02 | 6493 (+) | 0.866 | 0.819 | gattgCAGATgcggcc V$E47_02 | 6655 (+) | 0.866 | 0.824 | gcaaaCAGATgaagat V$E47_02 | 6842 (-) | 0.866 | 0.815 | actgccAACTGgtgag V$E47_02 | 6940 (+) | 0.836 | 0.820 | gtaccCAAGTgatact V$E47_02 | 7025 (+) | 0.891 | 0.826 | tgcatCAGCTgaagag V$E47_02 | 7025 (-) | 0.891 | 0.849 | tgcatcAGCTGaagag V$E47_02 | 7109 (-) | 0.842 | 0.844 | tgtaccACATGtgtgc V$E47_02 | 7111 (+) | 0.842 | 0.840 | taccaCATGTgtgcaa V$E47_02 | 7171 (+) | 0.866 | 0.858 | cgttgCAGATggtccc V$E47_02 | 7388 (+) | 1.000 | 0.943 | gtgtcCAGGTggtaga V$E47_02 | 7530 (+) | 0.866 | 0.855 | agttaCAGTTggttgt V$E47_02 | 7545 (-) | 0.866 | 0.816 | tgagccAACTGtgtag V$E47_02 | 7677 (+) | 0.891 | 0.881 | gctcaCAGCTgtaaag V$E47_02 | 7677 (-) | 0.891 | 0.873 | gctcacAGCTGtaaag V$E47_02 | 8274 (+) | 0.891 | 0.840 | tgatgCAGCTgccctg V$E47_02 | 8274 (-) | 0.891 | 0.842 | tgatgcAGCTGccctg V$E47_02 | 8345 (-) | 0.836 | 0.834 | tggaccACTTGccaac V$E47_02 | 8355 (-) | 0.766 | 0.816 | gccaacCCCTGtgcag V$E47_02 | 8418 (+) | 0.891 | 0.912 | gtaagCAGCTgttcta V$E47_02 | 8418 (-) | 0.891 | 0.888 | gtaagcAGCTGttcta V$E47_02 | 8501 (+) | 1.000 | 0.919 | agggaCAGGTgcggga V$E47_02 | 8518 (+) | 0.866 | 0.865 | actcaCAGATgcttca V$E47_02 | 8765 (-) | 1.000 | 0.947 | cccagcACCTGggagg V$E47_02 | 8966 (+) | 1.000 | 0.938 | agaacCAGGTgtggtg V$E47_02 | 8997 (-) | 0.842 | 0.830 | tccagcACATGgaggc V$E47_02 | 10692 (-) | 0.766 | 0.817 | ggcaacACCCGtgttt V$E47_02 | 11958 (-) | 1.000 | 0.913 | gactgcACCTGagctc V$CP2_01 | 7 (+) | 0.936 | 0.798 | gcatgtACAAG V$CP2_01 | 103 (-) | 1.000 | 0.877 | CTGGGggtggc V$CP2_01 | 136 (+) | 0.961 | 0.798 | agtcaaTCCAG V$CP2_01 | 274 (-) | 0.936 | 0.818 | CTTGTgcatgc V$CP2_01 | 329 (-) | 0.961 | 0.766 | CTGGAaattcc V$CP2_01 | 475 (+) | 0.936 | 0.762 | acacttACAAG V$CP2_01 | 747 (-) | 1.000 | 0.770 | CTGGGaattga V$CP2_01 | 1184 (+) | 1.000 | 0.798 | ggcaccCCCAG V$CP2_01 | 1200 (-) | 0.987 | 0.881 | CTGGTtatacc V$CP2_01 | 1329 (+) | 1.000 | 0.822 | tcactcCCCAG V$CP2_01 | 1349 (-) | 0.923 | 0.754 | CTTGCctgggt V$CP2_01 | 1583 (+) | 0.974 | 0.762 | tggagaGCCAG V$CP2_01 | 1596 (-) | 1.000 | 0.889 | CTGGGtttggg V$CP2_01 | 1637 (+) | 0.897 | 0.774 | gcctttCCTAG V$CP2_01 | 1939 (+) | 0.871 | 0.758 | gccttgGCTAG V$CP2_01 | 1941 (-) | 0.948 | 0.806 | CTTGGctaggc V$CP2_01 | 2001 (-) | 0.974 | 0.774 | CTGGCtataca V$CP2_01 | 2290 (-) | 0.884 | 0.798 | CTAGTttttgt V$CP2_01 | 2462 (+) | 0.923 | 0.873 | ggcctaGCAAG V$CP2_01 | 2465 (-) | 0.871 | 0.810 | CTAGCaagcgc V$CP2_01 | 2483 (-) | 1.000 | 0.838 | CTGGGttcggt V$CP2_01 | 2650 (-) | 0.714 | 0.761 | CTGCGtaatgc V$CP2_01 | 2758 (+) | 0.858 | 0.770 | tctacaTCTAG V$CP2_01 | 2765 (-) | 0.858 | 0.837 | CTAGAttccgc V$CP2_01 | 2795 (-) | 0.987 | 0.750 | CTGGTcttctc V$CP2_01 | 2820 (+) | 0.961 | 0.877 | gctctgTCCAG V$CP2_01 | 2995 (+) | 0.948 | 0.857 | gctcctCCAAG V$CP2_01 | 3039 (+) | 0.714 | 0.812 | gctccaCCCGG V$CP2_01 | 3065 (+) | 0.871 | 0.810 | ggtataGCTAG V$CP2_01 | 3266 (-) | 0.948 | 0.893 | CTTGGtgatgc V$CP2_01 | 3942 (-) | 0.936 | 0.798 | CTTGTtcattc V$CP2_01 | 3970 (-) | 0.974 | 0.762 | CTGGCtgtatg V$CP2_01 | 4126 (+) | 1.000 | 0.770 | ttaaaaCCCAG V$CP2_01 | 4133 (+) | 0.858 | 0.770 | ccagaaTCTAG V$CP2_01 | 4140 (-) | 0.897 | 0.794 | CTAGGtgtggt V$CP2_01 | 4197 (+) | 0.910 | 0.814 | gcaagtTCAAG V$CP2_01 | 4241 (-) | 1.000 | 0.790 | CTGGGgattgt V$CP2_01 | 4256 (-) | 0.948 | 0.770 | CTTGGttgtta V$CP2_01 | 4324 (-) | 1.000 | 0.869 | CTGGGtctggt V$CP2_01 | 4330 (-) | 0.987 | 0.750 | CTGGTgacatc V$CP2_01 | 4333 (+) | 0.961 | 0.821 | gtgacaTCCAG V$CP2_01 | 4560 (+) | 0.714 | 0.761 | gctggaCACAG V$CP2_01 | 4604 (-) | 0.923 | 0.762 | CTTGCtggaag V$CP2_01 | 4736 (-) | 0.871 | 0.766 | CTCGCtgatgg V$CP2_01 | 4767 (+) | 0.871 | 0.766 | ccagcaGCGAG V$CP2_01 | 4779 (-) | 0.961 | 0.814 | CTGGAtaaaga V$CP2_01 | 4809 (-) | 0.974 | 0.754 | CTGGCatttgt V$CP2_01 | 4947 (-) | 1.000 | 0.770 | CTGGGactttc V$CP2_01 | 5004 (-) | 0.936 | 0.750 | CTTGTattacc V$CP2_01 | 5030 (-) | 1.000 | 0.790 | CTGGGatgctc V$CP2_01 | 5081 (-) | 0.974 | 0.853 | CTGGCtctgac V$CP2_01 | 5091 (-) | 0.987 | 0.893 | CTGGTttgggt V$CP2_01 | 5139 (+) | 0.897 | 0.857 | ggtcaaCCTAG V$CP2_01 | 5526 (+) | 0.961 | 0.798 | ctcctaTCCAG V$CP2_01 | 5553 (+) | 0.884 | 0.750 | tctctcACTAG V$CP2_01 | 5564 (+) | 0.987 | 0.750 | aaaacaACCAG V$CP2_01 | 5694 (-) | 0.884 | 0.766 | CTAGTtcatac V$CP2_01 | 5885 (-) | 1.000 | 0.790 | CTGGGcatggg V$CP2_01 | 5920 (+) | 0.871 | 0.810 | tccccaGCGAG V$CP2_01 | 5988 (-) | 0.961 | 0.834 | CTGGAttaggg V$CP2_01 | 6041 (-) | 0.987 | 0.770 | CTGGTgcagcc V$CP2_01 | 6124 (-) | 0.936 | 0.817 | CTTGTtttctc V$CP2_01 | 6151 (-) | 0.974 | 0.762 | CTGGCtctcct V$CP2_01 | 6206 (+) | 0.948 | 0.758 | ggtatgCCAAG V$CP2_01 | 6240 (-) | 0.974 | 0.921 | CTGGCtacggc V$CP2_01 | 6391 (-) | 0.974 | 0.786 | CTGGCctgtgt V$CP2_01 | 6437 (+) | 1.000 | 0.921 | ggtcaaCCCAG V$CP2_01 | 6490 (-) | 0.961 | 0.786 | CTGGAttgcag V$CP2_01 | 6629 (+) | 1.000 | 0.833 | ggggcaCCCAG V$CP2_01 | 6794 (-) | 0.974 | 0.873 | CTGGCtcttcc V$CP2_01 | 6850 (-) | 0.987 | 0.802 | CTGGTgagcac V$CP2_01 | 6862 (+) | 0.897 | 0.806 | tccagaCCGAG V$CP2_01 | 6937 (+) | 0.714 | 0.761 | gcagtaCCCAA V$CP2_01 | 6966 (-) | 0.961 | 0.766 | CTGGAtggaaa V$CP2_01 | 7161 (-) | 0.961 | 0.865 | CTGGAgctggc V$CP2_01 | 7629 (-) | 0.974 | 0.762 | CTGGCggtaga V$CP2_01 | 7686 (+) | 0.974 | 0.774 | tgtaaaGCCAG V$CP2_01 | 7844 (-) | 0.974 | 0.822 | CTGGCaaaagc V$CP2_01 | 8036 (-) | 0.923 | 0.861 | CTTGCtgggcc V$CP2_01 | 8040 (-) | 1.000 | 0.770 | CTGGGccttga V$CP2_01 | 8097 (-) | 1.000 | 0.838 | CTGGGtcctgt V$CP2_01 | 8133 (-) | 0.701 | 0.753 | CTGCTttctgc V$CP2_01 | 8172 (+) | 0.987 | 0.802 | ggtagcACCAG V$CP2_01 | 8185 (-) | 0.936 | 0.849 | CTTGTttgctc V$CP2_01 | 8234 (+) | 1.000 | 0.877 | cccacaCCCAG V$CP2_01 | 8327 (+) | 0.974 | 0.774 | gtcagcGCCAG V$CP2_01 | 8399 (+) | 0.974 | 0.806 | agccaaGCCAG V$CP2_01 | 8452 (+) | 0.948 | 0.758 | cctaagCCAAG V$CP2_01 | 8817 (-) | 0.987 | 0.750 | CTGGTactctc V$CP2_01 | 8963 (+) | 0.987 | 0.869 | gggagaACCAG V$CP2_01 | 9060 (+) | 0.961 | 0.845 | gcaagtTCCAG V$CP2_01 | 9069 (+) | 0.974 | 0.794 | aggctaGCCAG V$CP2_01 | 9072 (-) | 0.871 | 0.822 | CTAGCcagggc V$CP2_01 | 9189 (+) | 0.910 | 0.766 | gaccacTCAAG V$CP2_01 | 9219 (+) | 0.923 | 0.822 | gcaaagGCAAG V$CP2_01 | 9229 (+) | 0.961 | 0.766 | gttagcTCCAG V$CP2_01 | 9272 (+) | 0.688 | 0.752 | gcgacaGCCAC V$CP2_01 | 9276 (+) | 0.948 | 0.766 | cagccaCCAAG V$CP2_01 | 9298 (-) | 0.936 | 0.790 | CTTGTgtgccc V$CP2_01 | 9394 (-) | 0.961 | 0.798 | CTGGAatgtgg V$CP2_01 | 9438 (+) | 0.948 | 0.778 | ggaagcCCAAG V$CP2_01 | 9520 (+) | 0.936 | 0.770 | tgtccaACAAG V$CP2_01 | 9535 (+) | 0.936 | 0.861 | gaacaaACAAG V$CP2_01 | 9570 (-) | 1.000 | 0.857 | CTGGGtgtggt V$CP2_01 | 9915 (-) | 0.961 | 0.766 | CTGGAtttttg V$CP2_01 | 10007 (-) | 1.000 | 0.770 | CTGGGactgga V$CP2_01 | 10228 (+) | 0.897 | 0.762 | gtggaaCCTAG V$CP2_01 | 10235 (-) | 0.897 | 0.758 | CTAGGttggtg V$CP2_01 | 10248 (-) | 0.688 | 0.808 | CAGGCttgtgc V$CP2_01 | 10513 (-) | 0.688 | 0.764 | ATGGCtgtagc V$CP2_01 | 10781 (+) | 1.000 | 0.838 | gtcggaCCCAG V$CP2_01 | 10969 (+) | 0.923 | 0.790 | gttggaGCAAG V$CP2_01 | 11219 (+) | 1.000 | 0.802 | gaactgCCCAG V$CP2_01 | 11580 (+) | 0.961 | 0.766 | cttataTCCAG V$CP2_01 | 11793 (+) | 1.000 | 0.770 | acaaatCCCAG V$DELTAEF1_01 | 374 (-) | 1.000 | 0.985 | tctAGGTGaga V$DELTAEF1_01 | 1357 (+) | 1.000 | 0.987 | ggtCACCTgat V$DELTAEF1_01 | 1392 (-) | 1.000 | 0.982 | ggcAGGTGagg V$DELTAEF1_01 | 3290 (-) | 1.000 | 0.985 | atgAGGTGaac V$DELTAEF1_01 | 6596 (-) | 1.000 | 0.985 | ataAGGTGgtg V$DELTAEF1_01 | 7391 (-) | 1.000 | 0.980 | tccAGGTGgta V$DELTAEF1_01 | 9613 (-) | 1.000 | 0.985 | ctgAGGTGgga V$DELTAEF1_01 | 11960 (+) | 1.000 | 0.991 | ctgCACCTgag V$CETS1P54_02 | 223 (-) | 0.993 | 0.965 | ttttTTCCTtttt V$CETS1P54_02 | 326 (+) | 0.983 | 0.968 | gtgcTGGAAattc V$CETS1P54_02 | 608 (-) | 0.971 | 0.972 | aaacATCCTgtcg V$CETS1P54_02 | 766 (+) | 0.983 | 0.957 | cctcTGGAAgagt V$CETS1P54_02 | 1558 (+) | 0.983 | 0.969 | cttcTGGAAatct V$CETS1P54_02 | 1718 (-) | 1.000 | 0.966 | ccttTTCCGggct V$CETS1P54_02 | 1737 (-) | 0.993 | 0.971 | gaacTTCCTatag V$CETS1P54_02 | 1948 (+) | 0.983 | 0.960 | aggcTGGAActtt V$CETS1P54_02 | 2159 (-) | 0.993 | 0.963 | cctcTTCCTttcc V$CETS1P54_02 | 2203 (-) | 0.993 | 0.964 | tttcTTCCTttct V$CETS1P54_02 | 2563 (-) | 1.000 | 0.977 | aaatTTCCGtaag V$CETS1P54_02 | 2987 (-) | 1.000 | 0.979 | ctacTTCCGctcc V$CETS1P54_02 | 3031 (-) | 1.000 | 0.977 | cgacTTCCGctcc V$CETS1P54_02 | 3323 (-) | 0.993 | 0.965 | ccttTTCCTgatt V$CETS1P54_02 | 3835 (-) | 0.983 | 0.971 | aaatTTCCAttct V$CETS1P54_02 | 3852 (-) | 0.960 | 0.963 | aaatATCCAgtac V$CETS1P54_02 | 4622 (-) | 0.971 | 0.960 | caatATCCTttgg V$CETS1P54_02 | 4686 (+) | 0.993 | 0.959 | tggcAGGAAactt V$CETS1P54_02 | 4960 (+) | 0.993 | 0.974 | cagcAGGAAgtca V$CETS1P54_02 | 5119 (+) | 0.993 | 0.962 | caaaAGGAAacag V$CETS1P54_02 | 5471 (-) | 0.993 | 0.980 | ctatTTCCTgatt V$CETS1P54_02 | 5729 (+) | 0.983 | 0.963 | aaacTGGAAgcca V$CETS1P54_02 | 6087 (-) | 0.993 | 0.982 | tgagTTCCTgtgt V$CETS1P54_02 | 6459 (-) | 0.993 | 0.979 | ttatTTCCTgata V$CETS1P54_02 | 7017 (-) | 0.993 | 0.958 | atgtTTCCTgcat V$CETS1P54_02 | 7228 (+) | 0.993 | 0.978 | cagcAGGAAgttc V$CETS1P54_02 | 7245 (+) | 0.993 | 0.965 | tctcAGGAAaaag V$CETS1P54_02 | 8080 (-) | 0.993 | 0.971 | aaacTTCCTatac V$CETS1P54_02 | 8157 (-) | 0.983 | 0.957 | tagcTTCCAgagg V$CETS1P54_02 | 8909 (+) | 0.993 | 0.956 | attcAGGAAtact V$CETS1P54_02 | 8992 (-) | 0.983 | 0.957 | ttaaTTCCAgcac V$CETS1P54_02 | 9061 (-) | 0.983 | 0.960 | caagTTCCAggct V$CETS1P54_02 | 9433 (+) | 0.993 | 0.971 | taacAGGAAgccc V$CETS1P54_02 | 9466 (+) | 0.971 | 0.972 | gtacAGGATgttc V$CETS1P54_02 | 9872 (+) | 0.993 | 0.961 | aatgAGGAAatca V$CETS1P54_02 | 10255 (+) | 0.993 | 0.974 | gtgcAGGAAgtga V$CETS1P54_02 | 10587 (-) | 0.993 | 0.971 | caatTTCCTatat V$CETS1P54_02 | 10842 (+) | 0.993 | 0.960 | tcgcAGGAAgaag V$CETS1P54_02 | 11269 (+) | 0.993 | 0.981 | gaaaAGGAAatta V$GATA1_01 | 82 (+) | 1.000 | 0.990 | gttGATAGgg V$GATA1_01 | 91 (-) | 1.000 | 0.990 | gcCTATCttc V$GATA1_01 | 137 (-) | 0.997 | 0.982 | gtCAATCcag V$GATA1_01 | 536 (+) | 0.993 | 0.989 | ggtGATGCgc V$GATA1_01 | 608 (-) | 0.990 | 0.982 | aaACATCctg V$GATA1_01 | 1113 (+) | 0.987 | 0.982 | gctGATTTgt V$GATA1_01 | 1177 (+) | 1.000 | 0.995 | ggaGATGGgc V$GATA1_01 | 1239 (-) | 0.990 | 0.983 | tgACATCagg V$GATA1_01 | 1362 (+) | 0.993 | 0.988 | cctGATGCtc V$GATA1_01 | 1668 (+) | 0.990 | 0.985 | ggaGATTCct V$GATA1_01 | 1805 (-) | 0.988 | 0.981 | atTCATCagg V$GATA1_01 | 1995 (-) | 0.988 | 0.982 | gaTTATCtgg V$GATA1_01 | 2007 (-) | 0.990 | 0.982 | atACATCtcc V$GATA1_01 | 2933 (+) | 0.997 | 0.992 | gctGATTGcg V$GATA1_01 | 2971 (+) | 0.993 | 0.987 | gcaGATGCgc V$GATA1_01 | 3083 (+) | 0.990 | 0.984 | ggtGATTCta V$GATA1_01 | 3269 (+) | 0.993 | 0.986 | ggtGATGCca V$GATA1_01 | 3297 (-) | 0.990 | 0.984 | gaACATCtcc V$GATA1_01 | 3329 (+) | 0.990 | 0.984 | cctGATTCta V$GATA1_01 | 3393 (-) | 0.988 | 0.982 | gaGGATCtcc V$GATA1_01 | 3986 (-) | 1.000 | 0.983 | caCCATCtca V$GATA1_01 | 4119 (+) | 0.990 | 0.983 | ccaGATGTta V$GATA1_01 | 4243 (+) | 0.997 | 0.989 | gggGATTGta V$GATA1_01 | 4374 (-) | 0.988 | 0.983 | agTTATCtgg V$GATA1_01 | 4639 (-) | 0.987 | 0.985 | caAAATCccg V$GATA1_01 | 4739 (+) | 1.000 | 0.993 | gctGATGGtc V$GATA1_01 | 5040 (+) | 0.997 | 0.989 | cagGATTGct V$GATA1_01 | 5335 (+) | 1.000 | 0.994 | gccGATAGtt V$GATA1_01 | 5345 (-) | 0.993 | 0.983 | ttGCATCccc V$GATA1_01 | 5371 (+) | 0.987 | 0.984 | cggGATTTct V$GATA1_01 | 5477 (+) | 0.987 | 0.983 | cctGATTTtt V$GATA1_01 | 5527 (-) | 1.000 | 0.987 | tcCTATCcag V$GATA1_01 | 5533 (+) | 0.990 | 0.982 | ccaGATTCac V$GATA1_01 | 5841 (-) | 0.993 | 0.988 | agGCATCtcc V$GATA1_01 | 5873 (-) | 1.000 | 0.994 | gaCCATCtcc V$GATA1_01 | 5977 (+) | 0.988 | 0.984 | ggtGATAAct V$GATA1_01 | 5995 (+) | 1.000 | 0.983 | aggGATGGgg V$GATA1_01 | 6035 (-) | 1.000 | 0.996 | acCCATCtgg V$GATA1_01 | 6261 (-) | 1.000 | 0.992 | tcCCATCtgc V$GATA1_01 | 6373 (+) | 0.987 | 0.986 | cgaGATTTgg V$GATA1_01 | 6479 (+) | 0.988 | 0.982 | cagGATGAgg V$GATA1_01 | 6490 (+) | 0.997 | 0.983 | ctgGATTGca V$GATA1_01 | 6497 (+) | 0.993 | 0.989 | gcaGATGCgg V$GATA1_01 | 6753 (-) | 1.000 | 0.982 | caCTATCcct V$GATA1_01 | 6966 (+) | 1.000 | 0.983 | ctgGATGGaa V$GATA1_01 | 6996 (+) | 1.000 | 0.990 | ggaGATGGaa V$GATA1_01 | 7013 (+) | 0.990 | 0.983 | gcgGATGTtt V$GATA1_01 | 7024 (-) | 0.993 | 0.986 | ctGCATCagc V$GATA1_01 | 7152 (-) | 0.993 | 0.991 | ccGTATCtcc V$GATA1_01 | 7175 (+) | 1.000 | 0.992 | gcaGATGGtc V$GATA1_01 | 7557 (+) | 0.993 | 0.981 | gtaGATGCtg V$GATA1_01 | 7571 (-) | 0.987 | 0.984 | ccAAATCccc V$GATA1_01 | 7577 (-) | 1.000 | 0.998 | ccCCATCagg V$GATA1_01 | 7717 (-) | 0.988 | 0.982 | aaGGATCagc V$GATA1_01 | 8241 (+) | 0.988 | 0.982 | ccaGATGAag V$GATA1_01 | 8269 (+) | 0.988 | 0.985 | cctGATGAtg V$GATA1_01 | 8272 (+) | 0.993 | 0.983 | gatGATGCag V$GATA1_01 | 8706 (-) | 1.000 | 0.987 | taCCATCaag V$GATA1_01 | 9419 (-) | 1.000 | 0.981 | aaCTATCcca V$GATA1_01 | 9622 (+) | 0.995 | 0.987 | gagGATCGgg V$GATA1_01 | 9622 (-) | 0.988 | 0.983 | gaGGATCggg V$GATA1_01 | 10106 (-) | 1.000 | 0.992 | agCCATCttg V$GATA1_01 | 10114 (-) | 1.000 | 0.992 | tgCCATCccc V$GATA1_01 | 10286 (-) | 0.988 | 0.983 | tcTCATCagg V$GATA1_01 | 10792 (+) | 0.990 | 0.984 | gggGATGTga V$GATA1_01 | 10837 (-) | 1.000 | 0.982 | gcCCATCgca V$GATA1_01 | 11196 (+) | 0.997 | 0.984 | cttGATTGat V$GATA1_01 | 11429 (-) | 1.000 | 0.993 | agCTATCcgc V$GATA1_01 | 11547 (+) | 1.000 | 0.988 | gtaGATGGgt V$GATA1_01 | 11774 (+) | 0.993 | 0.988 | ccaGATACtt V$GATA1_01 | 11800 (+) | 0.997 | 0.991 | ccaGATTGga V$GATA2_01 | 82 (+) | 1.000 | 0.983 | gttGATAGgg V$GATA2_01 | 91 (-) | 1.000 | 0.993 | gcCTATCttc V$GATA2_01 | 307 (-) | 1.000 | 0.994 | agCTATCtct V$GATA2_01 | 566 (+) | 0.992 | 0.983 | ggaGATAAac V$GATA2_01 | 1177 (+) | 0.987 | 0.983 | ggaGATGGgc V$GATA2_01 | 1995 (-) | 0.992 | 0.986 | gaTTATCtgg V$GATA2_01 | 3178 (-) | 0.994 | 0.983 | ttATATCccc V$GATA2_01 | 3930 (-) | 0.992 | 0.984 | caTTATCagt V$GATA2_01 | 3953 (-) | 0.987 | 0.982 | agCCATCcta V$GATA2_01 | 3977 (+) | 1.000 | 0.986 | tatGATAGac V$GATA2_01 | 3986 (-) | 0.987 | 0.982 | caCCATCtca V$GATA2_01 | 4374 (-) | 0.992 | 0.990 | agTTATCtgg V$GATA2_01 | 4622 (-) | 0.994 | 0.984 | caATATCctt V$GATA2_01 | 4739 (+) | 0.987 | 0.981 | gctGATGGtc V$GATA2_01 | 5335 (+) | 1.000 | 0.995 | gccGATAGtt V$GATA2_01 | 5633 (+) | 0.992 | 0.984 | tggGATAAag V$GATA2_01 | 5873 (-) | 0.987 | 0.982 | gaCCATCtcc V$GATA2_01 | 5977 (+) | 0.992 | 0.990 | ggtGATAAct V$GATA2_01 | 6035 (-) | 0.987 | 0.982 | acCCATCtgg V$GATA2_01 | 6753 (-) | 1.000 | 0.993 | caCTATCcct V$GATA2_01 | 6947 (+) | 0.995 | 0.987 | agtGATACtg V$GATA2_01 | 7152 (-) | 0.995 | 0.992 | ccGTATCtcc V$GATA2_01 | 7175 (+) | 0.987 | 0.982 | gcaGATGGtc V$GATA2_01 | 7266 (-) | 0.994 | 0.985 | aaATATCcta V$GATA2_01 | 7512 (+) | 0.987 | 0.983 | tcaGATGGcc V$GATA2_01 | 7577 (-) | 0.987 | 0.983 | ccCCATCagg V$GATA2_01 | 7611 (-) | 0.987 | 0.981 | agCCATCtct V$GATA2_01 | 7733 (-) | 0.987 | 0.983 | ggCCATCaca V$GATA2_01 | 7788 (-) | 0.992 | 0.986 | aaTTATCcca V$GATA2_01 | 7951 (+) | 0.995 | 0.991 | tgaGATACcc V$GATA2_01 | 8062 (-) | 0.992 | 0.982 | gtTTATCtgg V$GATA2_01 | 9367 (-) | 1.000 | 0.990 | ctCTATCaca V$GATA2_01 | 9419 (-) | 1.000 | 0.994 | aaCTATCcca V$GATA2_01 | 9732 (+) | 0.992 | 0.984 | tatGATAAtg V$GATA2_01 | 9833 (+) | 0.992 | 0.984 | aggGATAAtt V$GATA2_01 | 10106 (-) | 0.987 | 0.982 | agCCATCttg V$GATA2_01 | 10114 (-) | 0.987 | 0.984 | tgCCATCccc V$GATA2_01 | 11369 (-) | 1.000 | 0.985 | ttCTATCtgt V$GATA2_01 | 11429 (-) | 1.000 | 0.999 | agCTATCcgc V$GATA2_01 | 11774 (+) | 0.995 | 0.989 | ccaGATACtt V$GATA3_01 | 83 (+) | 1.000 | 0.993 | ttGATAGgg V$GATA3_01 | 6753 (-) | 1.000 | 0.992 | caCTATCcc V$GATA3_01 | 7798 (+) | 0.985 | 0.979 | aaGATAAag V$GATA3_01 | 9367 (-) | 1.000 | 0.992 | ctCTATCac V$EVI1_01 | 2178 (-) | 0.889 | 0.813 | ccttcttTTCTTctct V$EVI1_01 | 2791 (-) | 0.922 | 0.823 | ttagctgGTCTTctct V$EVI1_01 | 5652 (+) | 0.889 | 0.776 | aaagAAGAAaagaaca V$EVI1_04 | 11572 (-) | 0.907 | 0.783 | tATCATtgcttatat V$MZF1_01 | 1002 (+) | 1.000 | 0.996 | tgtGGGGA V$MZF1_01 | 1333 (-) | 1.000 | 0.979 | TCCCCagg V$MZF1_01 | 1497 (+) | 1.000 | 0.995 | gggGGGGA V$MZF1_01 | 2432 (+) | 1.000 | 0.983 | gttGGGGA V$MZF1_01 | 3183 (-) | 1.000 | 0.996 | TCCCCcct V$MZF1_01 | 3781 (-) | 1.000 | 0.996 | TCCCCcct V$MZF1_01 | 4045 (+) | 1.000 | 0.979 | cagGGGGA V$MZF1_01 | 4240 (+) | 1.000 | 0.981 | gctGGGGA V$MZF1_01 | 4752 (-) | 1.000 | 0.994 | TCCCCtct V$MZF1_01 | 5350 (-) | 1.000 | 0.994 | TCCCCtct V$MZF1_01 | 5461 (-) | 1.000 | 0.996 | TCCCCaca V$MZF1_01 | 5490 (-) | 1.000 | 0.980 | TCCCCttc V$MZF1_01 | 5920 (-) | 1.000 | 0.981 | TCCCCagc V$MZF1_01 | 6805 (-) | 1.000 | 0.990 | TCCCCtca V$MZF1_01 | 7576 (-) | 1.000 | 0.985 | TCCCCatc V$MZF1_01 | 8204 (-) | 1.000 | 0.994 | TCCCCtct V$MZF1_01 | 9514 (-) | 1.000 | 0.983 | TCCCCatg V$MZF1_01 | 9977 (+) | 1.000 | 0.983 | tatGGGGA V$MZF1_01 | 10119 (-) | 1.000 | 0.985 | TCCCCaat V$MZF1_01 | 10789 (+) | 1.000 | 0.979 | cagGGGGA V$MZF1_02 | 1176 (+) | 0.898 | 0.873 | aggagATGGGcac V$MZF1_02 | 1488 (+) | 1.000 | 0.897 | gagggAGGGGggg V$MZF1_02 | 2639 (-) | 1.000 | 0.856 | ctaCCCCTgacct V$MZF1_02 | 3183 (-) | 1.000 | 0.916 | tccCCCCTctctt V$MZF1_02 | 3890 (-) | 0.857 | 0.877 | ttgACCCTcatca V$MZF1_02 | 4041 (+) | 1.000 | 0.876 | ccagcAGGGGgaa V$MZF1_02 | 4301 (-) | 0.857 | 0.869 | ttgACCCTcaata V$MZF1_02 | 4750 (-) | 1.000 | 0.904 | tctCCCCTctgta V$MZF1_02 | 5348 (-) | 1.000 | 0.871 | catCCCCTctttg V$MZF1_02 | 5489 (-) | 0.875 | 0.862 | atcCCCTTccctc V$MZF1_02 | 5994 (+) | 0.898 | 0.860 | tagggATGGGgta V$MZF1_02 | 6803 (-) | 1.000 | 0.881 | cctCCCCTcatta V$MZF1_02 | 8141 (+) | 0.844 | 0.858 | tgctgAGGGAaaa V$MZF1_02 | 8202 (-) | 1.000 | 0.970 | tttCCCCTctgac V$MZF1_02 | 8301 (-) | 1.000 | 0.937 | atgCCCCTctgcc V$MZF1_02 | 9008 (+) | 1.000 | 0.866 | gaggcAGGGGcag V$MZF1_02 | 9679 (+) | 1.000 | 0.876 | agtggAGGGGggg V$MZF1_02 | 9924 (+) | 0.898 | 0.859 | tgcctATGGGtaa V$MZF1_02 | 11546 (+) | 0.898 | 0.863 | ggtagATGGGtat V$MZF1_02 | 11611 (+) | 1.000 | 0.866 | gagaaAGGGGtac V$ZID_01 | 1851 (-) | 1.000 | 0.898 | cacaataGAGCCa V$IK1_01 | 2378 (-) | 1.000 | 0.955 | agttTTCCCagag V$IK1_01 | 3134 (-) | 1.000 | 0.966 | tacaTTCCCacaa V$IK1_01 | 3240 (-) | 1.000 | 0.948 | aacaTTCCCagcc V$IK1_01 | 6176 (-) | 1.000 | 0.946 | agggTTCCCagag V$IK1_01 | 8649 (-) | 1.000 | 0.955 | acaaTTCCCaatg V$IK1_01 | 11083 (+) | 1.000 | 0.959 | ctctGGGAAtttt V$IK2_01 | 163 (-) | 1.000 | 0.986 | ccgtTCCCAatg V$IK2_01 | 191 (-) | 1.000 | 0.958 | tgccTCCCAcac V$IK2_01 | 745 (+) | 1.000 | 0.978 | tgcTGGGAattg V$IK2_01 | 888 (+) | 1.000 | 0.962 | aaaTGGGAcaga V$IK2_01 | 1276 (+) | 1.000 | 0.983 | ctcTGGGAtgac V$IK2_01 | 1821 (+) | 0.969 | 0.962 | ttcAGGGAagac V$IK2_01 | 1882 (-) | 1.000 | 0.955 | tgccTCCCAttt V$IK2_01 | 2021 (+) | 1.000 | 0.973 | tgcTGGGAttaa V$IK2_01 | 2379 (-) | 1.000 | 0.993 | gtttTCCCAgag V$IK2_01 | 2712 (-) | 1.000 | 0.986 | ctctTCCCAaat V$IK2_01 | 3135 (-) | 1.000 | 0.979 | acatTCCCAcaa V$IK2_01 | 3241 (-) | 1.000 | 0.979 | acatTCCCAgcc V$IK2_01 | 3283 (+) | 1.000 | 0.977 | gatTGGGAtgag V$IK2_01 | 4113 (-) | 1.000 | 0.980 | atttTCCCAgat V$IK2_01 | 4150 (-) | 1.000 | 0.955 | tgccTCCCAttt V$IK2_01 | 4761 (-) | 1.000 | 0.961 | taggTCCCAgca V$IK2_01 | 4830 (+) | 0.969 | 0.959 | ggtAGGGAacga V$IK2_01 | 4945 (+) | 1.000 | 0.960 | aacTGGGActtt V$IK2_01 | 5028 (+) | 1.000 | 0.969 | gacTGGGAtgct V$IK2_01 | 5229 (+) | 1.000 | 0.983 | gacTGGGAacga V$IK2_01 | 5368 (+) | 0.978 | 0.959 | acaCGGGAtttc V$IK2_01 | 5510 (-) | 0.969 | 0.962 | ggttTCCCTgcc V$IK2_01 | 5630 (+) | 1.000 | 0.975 | cacTGGGAtaaa V$IK2_01 | 5710 (-) | 1.000 | 0.983 | ggaaTCCCAcaa V$IK2_01 | 5862 (-) | 1.000 | 0.988 | ggttTCCCAtgg V$IK2_01 | 6177 (-) | 1.000 | 0.992 | gggtTCCCAgag V$IK2_01 | 6257 (-) | 1.000 | 0.961 | tttgTCCCAtct V$IK2_01 | 7446 (+) | 1.000 | 0.955 | gtgTGGGAgtgt V$IK2_01 | 7698 (+) | 1.000 | 0.961 | actTGGGAggta V$IK2_01 | 7789 (-) | 1.000 | 0.977 | attaTCCCAaag V$IK2_01 | 8143 (+) | 0.969 | 0.964 | ctgAGGGAaaac V$IK2_01 | 8229 (-) | 1.000 | 0.958 | tggcTCCCAcac V$IK2_01 | 8650 (-) | 1.000 | 0.987 | caatTCCCAatg V$IK2_01 | 8760 (-) | 1.000 | 0.963 | gtccTCCCAgca V$IK2_01 | 8771 (+) | 1.000 | 0.963 | accTGGGAggcc V$IK2_01 | 9120 (+) | 0.978 | 0.971 | aacCGGGAaaac V$IK2_01 | 9257 (+) | 1.000 | 0.956 | tcaTGGGAcagg V$IK2_01 | 9420 (-) | 1.000 | 0.968 | actaTCCCAtaa V$IK2_01 | 9592 (-) | 1.000 | 0.981 | gtaaTCCCAgca V$IK2_01 | 9603 (+) | 1.000 | 0.985 | attTGGGAagct V$IK2_01 | 10005 (+) | 1.000 | 0.962 | ctcTGGGActgg V$IK2_01 | 10611 (+) | 1.000 | 0.958 | gtgTGGGAggga V$IK2_01 | 10877 (+) | 1.000 | 0.963 | tggTGGGAgcac V$IK2_01 | 11083 (+) | 1.000 | 0.980 | ctcTGGGAattt V$IK2_01 | 11532 (-) | 1.000 | 0.972 | caaaTCCCAcat V$IK2_01 | 11794 (-) | 1.000 | 0.972 | caaaTCCCAgat V$IK3_01 | 162 (-) | 1.000 | 0.794 | tccgTTCCCaatg V$IK3_01 | 326 (+) | 0.887 | 0.737 | gtgcTGGAAattc V$IK3_01 | 327 (+) | 0.778 | 0.815 | tgctGGAAAttcc V$IK3_01 | 332 (-) | 1.000 | 0.870 | gaaaTTCCCtcag V$IK3_01 | 745 (+) | 1.000 | 0.871 | tgctGGGAAttga V$IK3_01 | 923 (-) | 0.887 | 0.746 | gcacTTCCAcgaa V$IK3_01 | 942 (+) | 0.915 | 0.784 | agttAGGAAcaat V$IK3_01 | 1132 (-) | 0.915 | 0.845 | tgtgTTCCTaaaa V$IK3_01 | 1189 (+) | 0.915 | 0.773 | ccccAGGAAagct V$IK3_01 | 1203 (-) | 0.778 | 0.733 | gttaTACCCattg V$IK3_01 | 1558 (+) | 0.887 | 0.742 | cttcTGGAAatct V$IK3_01 | 1570 (+) | 1.000 | 0.904 | tacgGGGAAtgca V$IK3_01 | 1637 (-) | 0.915 | 0.763 | gcctTTCCTagtg V$IK3_01 | 1669 (-) | 0.915 | 0.797 | gagaTTCCTcctt V$IK3_01 | 1737 (-) | 0.915 | 0.753 | gaacTTCCTatag V$IK3_01 | 1793 (-) | 0.778 | 0.762 | gccaTTCACagga V$IK3_01 | 1821 (+) | 1.000 | 0.873 | ttcaGGGAAgaca V$IK3_01 | 2021 (+) | 0.778 | 0.743 | tgctGGGATtaaa V$IK3_01 | 2378 (-) | 1.000 | 0.873 | agttTTCCCagag V$IK3_01 | 2399 (-) | 0.915 | 0.781 | tgctTTCCTtaaa V$IK3_01 | 2509 (+) | 1.000 | 0.771 | aaaaGGGAAaaaa V$IK3_01 | 2536 (+) | 0.778 | 0.767 | tttaGAGAAtaac V$IK3_01 | 2562 (-) | 0.778 | 0.737 | gaaaTTTCCgtaa V$IK3_01 | 2711 (-) | 1.000 | 0.796 | cctcTTCCCaaat V$IK3_01 | 2766 (-) | 0.887 | 0.735 | tagaTTCCGcatg V$IK3_01 | 2783 (-) | 0.915 | 0.754 | attcTTCCTtagc V$IK3_01 | 2846 (-) | 0.887 | 0.754 | atcaTTCCGtcct V$IK3_01 | 3134 (-) | 1.000 | 0.872 | tacaTTCCCacaa V$IK3_01 | 3156 (-) | 0.887 | 0.750 | gaggTTCCAttcc V$IK3_01 | 3240 (-) | 1.000 | 0.892 | aacaTTCCCagcc V$IK3_01 | 3336 (-) | 1.000 | 0.747 | ctatTTCCCtcct V$IK3_01 | 3676 (-) | 0.778 | 0.753 | agaaTTGCCaagt V$IK3_01 | 3776 (-) | 1.000 | 0.738 | ctccTTCCCccct V$IK3_01 | 4024 (+) | 0.887 | 0.747 | ctcaTGGAAttaa V$IK3_01 | 4035 (-) | 0.887 | 0.785 | aataTTCCAgcag V$IK3_01 | 4112 (-) | 1.000 | 0.822 | cattTTCCCagat V$IK3_01 | 4161 (-) | 0.693 | 0.735 | tgtaATCCTagta V$IK3_01 | 4392 (-) | 0.887 | 0.789 | tgagTTCCAagac V$IK3_01 | 4686 (+) | 0.915 | 0.737 | tggcAGGAAactt V$IK3_01 | 4830 (+) | 1.000 | 0.794 | ggtaGGGAAcgag V$IK3_01 | 4880 (+) | 0.887 | 0.772 | gctcTGGAAcatt V$IK3_01 | 4887 (-) | 0.887 | 0.789 | aacaTTCCAtgtt V$IK3_01 | 4894 (-) | 0.915 | 0.782 | catgTTCCTtggc V$IK3_01 | 4945 (+) | 0.778 | 0.753 | aactGGGACtttc V$IK3_01 | 4960 (+) | 0.915 | 0.734 | cagcAGGAAgtca V$IK3_01 | 5006 (-) | 0.778 | 0.737 | tgtaTTACCcatt V$IK3_01 | 5229 (+) | 1.000 | 0.837 | gactGGGAAcgat V$IK3_01 | 5421 (-) | 1.000 | 0.795 | atctTTCCCtcca V$IK3_01 | 5441 (-) | 0.778 | 0.744 | tgtaTTGCCatat V$IK3_01 | 5456 (-) | 1.000 | 0.796 | ccttTTCCCcaca V$IK3_01 | 5491 (-) | 1.000 | 0.740 | ccccTTCCCtctg V$IK3_01 | 5509 (-) | 1.000 | 0.838 | aggtTTCCCtgcc V$IK3_01 | 5519 (-) | 0.915 | 0.750 | gcctTTCCTccta V$IK3_01 | 5705 (+) | 0.915 | 0.871 | acttAGGAAtccc V$IK3_01 | 5709 (-) | 0.778 | 0.758 | aggaATCCCacaa V$IK3_01 | 5861 (-) | 1.000 | 0.796 | tggtTTCCCatgg V$IK3_01 | 5915 (-) | 1.000 | 0.751 | ttgtTTCCCcagc V$IK3_01 | 6087 (-) | 0.915 | 0.760 | tgagTTCCTgtgt V$IK3_01 | 6137 (-) | 0.887 | 0.773 | cgtcTTCCAtgcc V$IK3_01 | 6176 (-) | 1.000 | 0.863 | agggTTCCCagag V$IK3_01 | 6201 (+) | 0.778 | 0.776 | gccgGGGTAtgcc V$IK3_01 | 6206 (-) | 0.665 | 0.736 | ggtaTGCCAaggt V$IK3_01 | 6273 (+) | 0.915 | 0.751 | aagaAGGAAgcct V$IK3_01 | 6315 (-) | 1.000 | 0.774 | ctacTTCCCtgac V$IK3_01 | 6797 (-) | 0.915 | 0.757 | gctcTTCCTcccc V$IK3_01 | 6875 (+) | 0.915 | 0.761 | ttatAGGAAgcac V$IK3_01 | 6916 (+) | 0.915 | 0.771 | aacaAGGAAttga V$IK3_01 | 6972 (+) | 0.915 | 0.766 | ggaaAGGAAcgac V$IK3_01 | 7016 (-) | 0.778 | 0.750 | gatgTTTCCtgca V$IK3_01 | 7228 (+) | 0.915 | 0.743 | cagcAGGAAgttc V$IK3_01 | 7245 (+) | 0.915 | 0.749 | tctcAGGAAaaag V$IK3_01 | 7326 (+) | 0.887 | 0.742 | cgtcTGGAAcaat V$IK3_01 | 7952 (-) | 0.778 | 0.737 | gagaTACCCcaga V$IK3_01 | 7982 (+) | 1.000 | 0.819 | tcaaGGGAAggac V$IK3_01 | 8070 (-) | 0.665 | 0.737 | ggtaTTGCAgaaa V$IK3_01 | 8080 (-) | 0.915 | 0.739 | aaacTTCCTatac V$IK3_01 | 8143 (+) | 1.000 | 0.822 | ctgaGGGAAaaca V$IK3_01 | 8608 (-) | 0.887 | 0.779 | atcaTTCCGtcaa V$IK3_01 | 8649 (-) | 1.000 | 0.849 | acaaTTCCCaatg V$IK3_01 | 8676 (+) | 0.915 | 0.741 | ggttAGGAAggta V$IK3_01 | 8909 (+) | 0.915 | 0.860 | attcAGGAAtact V$IK3_01 | 8918 (+) | 0.887 | 0.846 | tactCGGAAttca V$IK3_01 | 8992 (-) | 0.887 | 0.740 | ttaaTTCCAgcac V$IK3_01 | 9120 (+) | 1.000 | 0.893 | aaccGGGAAaacc V$IK3_01 | 9159 (-) | 0.915 | 0.819 | tgagTTCCTaaga V$IK3_01 | 9300 (-) | 0.778 | 0.756 | tgtgTGCCCagga V$IK3_01 | 9361 (-) | 0.778 | 0.734 | agtgTTCTCtatc V$IK3_01 | 9391 (+) | 0.887 | 0.760 | ccccTGGAAtgtg V$IK3_01 | 9433 (+) | 0.915 | 0.798 | taacAGGAAgccc V$IK3_01 | 9509 (-) | 1.000 | 0.769 | aagtTTCCCcatg V$IK3_01 | 9591 (-) | 0.778 | 0.795 | tgtaATCCCagca V$IK3_01 | 9603 (+) | 1.000 | 0.779 | atttGGGAAgctg V$IK3_01 | 9872 (+) | 0.915 | 0.743 | aatgAGGAAatca V$IK3_01 | 10115 (-) | 0.778 | 0.755 | gccaTCCCCaata V$IK3_01 | 10636 (+) | 0.915 | 0.769 | actgAGGAAgaca V$IK3_01 | 10688 (+) | 0.778 | 0.770 | ctctGGCAAcacc V$IK3_01 | 10842 (+) | 0.915 | 0.737 | tcgcAGGAAgaag V$IK3_01 | 10877 (+) | 0.778 | 0.754 | tggtGGGAGcact V$IK3_01 | 10991 (+) | 0.887 | 0.749 | gcccTGGAAtcag V$IK3_01 | 11083 (+) | 1.000 | 0.875 | ctctGGGAAtttt V$IK3_01 | 11479 (-) | 1.000 | 0.811 | gtttTTCCCttgg V$IK3_01 | 11485 (+) | 0.887 | 0.814 | ccctTGGAAtgcg V$IK3_01 | 11768 (-) | 0.887 | 0.831 | aggaTTCCAgata P$ATHB1_01 | 4057 (-) | 1.000 | 0.978 | ggcaATAATtaaaa I$ABDB_01 | 46 (-) | 0.769 | 0.802 | gatgtcATTACttg I$ABDB_01 | 58 (-) | 0.745 | 0.819 | tgggccATAATtac I$ABDB_01 | 174 (-) | 0.769 | 0.793 | gctggcATTACtac I$ABDB_01 | 385 (-) | 0.799 | 0.853 | gatgccATTCAttt I$ABDB_01 | 396 (-) | 0.684 | 0.793 | tttgccAACAAggc I$ABDB_01 | 564 (-) | 1.000 | 0.842 | taggagATAAAcgt I$ABDB_01 | 618 (+) | 0.771 | 0.817 | tcgTACATgacact I$ABDB_01 | 1790 (-) | 0.799 | 0.853 | cctgccATTCAcag I$ABDB_01 | 1840 (-) | 0.813 | 0.835 | ggggtcGTCAAcac I$ABDB_01 | 1975 (-) | 0.779 | 0.812 | gttggcCTCAAact I$ABDB_01 | 2022 (-) | 0.960 | 0.816 | gctgggATTAAagg I$ABDB_01 | 2661 (+) | 0.951 | 0.936 | actTTGATgacact I$ABDB_01 | 2736 (+) | 1.000 | 0.954 | gccTTTATgccttt I$ABDB_01 | 2979 (-) | 0.790 | 0.806 | gcaggcATCTActt I$ABDB_01 | 3272 (-) | 0.705 | 0.792 | gatgccATTATgat I$ABDB_01 | 3401 (-) | 1.000 | 0.856 | ccagttATAAAagt I$ABDB_01 | 3472 (+) | 0.862 | 0.819 | tgaTTTACtgccta I$ABDB_01 | 3843 (-) | 1.000 | 0.807 | attctcATAAAata I$ABDB_01 | 3902 (-) | 0.960 | 0.802 | acaggtATTAAtaa I$ABDB_01 | 3916 (+) | 1.000 | 0.789 | agcTTTATgaatta I$ABDB_01 | 4026 (-) | 0.960 | 0.831 | catggaATTAAtat I$ABDB_01 | 4199 (+) | 0.789 | 0.846 | aagTTCAAggcttg I$ABDB_01 | 4332 (-) | 0.790 | 0.819 | ggtgacATCCAggt I$ABDB_01 | 4453 (+) | 0.932 | 0.814 | ataTTCATttcaca I$ABDB_01 | 4694 (+) | 0.951 | 0.909 | aacTTGATgtccta I$ABDB_01 | 5010 (-) | 0.960 | 0.824 | ttacccATTAAgcc I$ABDB_01 | 5298 (-) | 0.790 | 0.847 | ctagccATCCAaaa I$ABDB_01 | 5438 (+) | 0.839 | 0.798 | attTGTATtgccat I$ABDB_01 | 5444 (-) | 0.839 | 0.893 | attgccATATAacc I$ABDB_01 | 5483 (+) | 0.951 | 0.794 | tttTTGATcccctt I$ABDB_01 | 5631 (-) | 1.000 | 0.831 | actgggATAAAgac I$ABDB_01 | 5735 (-) | 0.745 | 0.815 | gaagccATAATata I$ABDB_01 | 6139 (+) | 0.758 | 0.795 | tctTCCATgcctct I$ABDB_01 | 6193 (+) | 0.857 | 0.861 | ccaTTTAAgccggg I$ABDB_01 | 6444 (+) | 0.741 | 0.829 | ccaGTCATggcaaa I$ABDB_01 | 6759 (+) | 1.000 | 0.840 | cccTTTATaccatg I$ABDB_01 | 6825 (+) | 1.000 | 0.791 | tttTTTATgtgcta I$ABDB_01 | 6836 (+) | 0.862 | 0.811 | ctaTTTACtgccaa I$ABDB_01 | 6900 (-) | 0.754 | 0.793 | gagggcATAAGtgg I$ABDB_01 | 7200 (-) | 0.932 | 0.828 | ggtgctATGAAttg I$ABDB_01 | 7438 (+) | 1.000 | 0.793 | tatTTTATgtgtgg I$ABDB_01 | 7656 (+) | 0.839 | 0.835 | tttTGTATgccagg I$ABDB_01 | 7681 (-) | 0.862 | 0.795 | acagctGTAAAgcc I$ABDB_01 | 7731 (-) | 0.759 | 0.840 | tgggccATCACagg I$ABDB_01 | 7920 (-) | 0.951 | 0.799 | ttaggtATCAAttc I$ABDB_01 | 8258 (-) | 0.786 | 0.815 | cgtggcATTGAcct I$ABDB_01 | 8466 (+) | 0.683 | 0.790 | tcaTCTAAggcaaa I$ABDB_01 | 8868 (-) | 0.826 | 0.827 | tctggcATAGAata I$ABDB_01 | 8989 (+) | 0.960 | 0.844 | cctTTAATtccagc I$ABDB_01 | 9051 (+) | 0.771 | 0.838 | gtcTACATggcaag I$ABDB_01 | 9403 (+) | 0.826 | 0.842 | ggtTCTATgtctag I$ABDB_01 | 9422 (-) | 1.000 | 0.833 | tatcccATAAAtaa I$ABDB_01 | 9717 (+) | 0.951 | 0.797 | aaaTTGATctcttt I$ABDB_01 | 9772 (-) | 0.826 | 0.879 | catgccATAGAaca I$ABDB_01 | 9840 (+) | 0.816 | 0.823 | attTTAAAgtcagg I$ABDB_01 | 9857 (-) | 0.857 | 0.806 | gcagcaTTAAAgaa I$ABDB_01 | 9875 (-) | 0.951 | 0.826 | gaggaaATCAAttt I$ABDB_01 | 9897 (+) | 0.816 | 0.831 | tttTTAAAgactta I$ABDB_01 | 10214 (-) | 0.790 | 0.816 | agtggcATCTAaga I$ABDB_01 | 10450 (+) | 1.000 | 0.842 | tctTTTATatcaac I$ABDB_01 | 10507 (+) | 0.745 | 0.829 | tcaATTATggctgt I$ABDB_01 | 10519 (-) | 0.705 | 0.806 | gtagccATTATagg I$ABDB_01 | 11175 (+) | 0.733 | 0.812 | atcTTTTTggctgc I$ABDB_01 | 11272 (-) | 0.960 | 0.844 | aaggaaATTAAggc I$ABDB_01 | 11338 (+) | 0.799 | 0.810 | atgTAAATgccatt I$ABDB_01 | 11343 (-) | 0.705 | 0.802 | aatgccATTATcga I$ABDB_01 | 11667 (+) | 0.932 | 0.800 | attTTCATaaccaa I$ABDB_01 | 11693 (+) | 0.960 | 0.817 | aagTTAATcactgt I$BRCZ1_01 | 2407 (+) | 1.000 | 0.902 | ttaaaaaaaACAAAaaat I$BRCZ1_01 | 5792 (+) | 1.000 | 0.894 | taattaaatACAAAaaaa I$BRCZ1_01 | 6578 (+) | 1.000 | 0.899 | atgcaaaacACAAAacaa I$BRCZ1_01 | 7426 (-) | 1.000 | 0.912 | tttaTTTGTttttatttt I$BRCZ1_01 | 7769 (+) | 1.000 | 0.897 | tctcaaaaaACAAAacaa I$BRCZ1_01 | 11472 (-) | 1.000 | 0.914 | tttaTTTGTttttccctt I$BRCZ2_01 | 311 (+) | 0.948 | 0.865 | atctctcCTAGTcctg I$BRCZ2_01 | 1637 (+) | 0.948 | 0.871 | gcctttcCTAGTgcta I$BRCZ2_01 | 1648 (+) | 0.940 | 0.852 | tgctaatCTACTtggc I$BRCZ2_01 | 2283 (+) | 0.948 | 0.945 | ttctcttCTAGTtttt I$BRCZ2_01 | 2717 (-) | 1.000 | 0.870 | cccaAATAGataatct I$BRCZ2_01 | 3083 (+) | 1.000 | 0.933 | ggtgattCTATTtttc I$BRCZ2_01 | 3101 (+) | 0.940 | 0.850 | atctataCTAATttcc I$BRCZ2_01 | 3329 (+) | 1.000 | 0.856 | cctgattCTATTtccc I$BRCZ2_01 | 3365 (-) | 1.000 | 0.851 | actgAATAGcctaatc I$BRCZ2_01 | 3587 (+) | 0.887 | 0.858 | aactctcCTATGtttg I$BRCZ2_01 | 3738 (+) | 0.841 | 0.850 | attatttCTACAtttt I$BRCZ2_01 | 3800 (-) | 1.000 | 0.907 | acagAATAGaaaaaac I$BRCZ2_01 | 4079 (+) | 0.901 | 0.901 | atgtataCTATAtttt I$BRCZ2_01 | 4161 (+) | 0.948 | 0.850 | tgtaatcCTAGTattc I$BRCZ2_01 | 5464 (+) | 1.000 | 0.854 | ccacataCTATTtcct I$BRCZ2_01 | 5809 (-) | 1.000 | 0.925 | aaaaAATAGccctggc I$BRCZ2_01 | 6247 (+) | 1.000 | 0.929 | cggctttCTATTtgtc I$BRCZ2_01 | 6600 (+) | 1.000 | 0.875 | ggtggtgCTATTtgct I$BRCZ2_01 | 6829 (+) | 1.000 | 0.883 | ttatgtgCTATTtact I$BRCZ2_01 | 7042 (+) | 1.000 | 0.957 | tagttgaCTATTtttt I$BRCZ2_01 | 7217 (-) | 0.940 | 0.907 | aaaaAGTAGgccagca I$BRCZ2_01 | 7332 (-) | 1.000 | 0.871 | gaacAATAGacacatt I$BRCZ2_01 | 8952 (-) | 0.940 | 0.879 | taaaATTAGtagggag I$BRCZ2_01 | 9527 (-) | 0.948 | 0.875 | caagACTAGaacaaac I$BRCZ2_01 | 10126 (-) | 0.940 | 0.907 | taaaATTAGttaagaa I$BRCZ2_01 | 10916 (-) | 1.000 | 0.903 | aaatAATAGtctagcg I$BRCZ2_01 | 11416 (+) | 1.000 | 0.905 | tgcatttCTATTtagc I$BRCZ2_01 | 11440 (+) | 1.000 | 0.901 | tacttatCTATTttgg I$BRCZ2_01 | 11748 (-) | 0.948 | 0.882 | aatgACTAGaaagagt I$BRCZ2_01 | 11897 (-) | 1.000 | 0.897 | aaccAATAGcacgaag I$BRCZ3_01 | 835 (+) | 0.939 | 0.906 | tataaAACAAgttaa I$BRCZ3_01 | 1034 (+) | 0.939 | 0.853 | gaatcAACAAagcag I$BRCZ3_01 | 1089 (+) | 1.000 | 0.908 | tggtcAACTAaaatt I$BRCZ3_01 | 1144 (-) | 0.851 | 0.846 | accgtTACTTtatgt I$BRCZ3_01 | 1153 (-) | 1.000 | 0.854 | ttatgTAGTTctttt I$BRCZ3_01 | 1895 (-) | 0.834 | 0.861 | atattTATTTtatta I$BRCZ3_01 | 1903 (-) | 0.834 | 0.855 | tttatTATTTtattt I$BRCZ3_01 | 2286 (-) | 1.000 | 0.953 | tcttcTAGTTtttgt I$BRCZ3_01 | 2410 (+) | 0.939 | 0.924 | aaaaaAACAAaaaat I$BRCZ3_01 | 2601 (+) | 0.879 | 0.887 | tattaAACCAagagc I$BRCZ3_01 | 3101 (+) | 0.849 | 0.850 | atctaTACTAatttc I$BRCZ3_01 | 3124 (-) | 0.851 | 0.856 | tatatTAATTtacat I$BRCZ3_01 | 3254 (-) | 0.879 | 0.855 | acttaTGGTTttctt I$BRCZ3_01 | 3305 (-) | 1.000 | 0.847 | ccatgTAGTTaagat I$BRCZ3_01 | 3656 (+) | 0.851 | 0.863 | cagtaAAGTAacaca I$BRCZ3_01 | 3994 (+) | 0.939 | 0.868 | catgaAACAAcgata I$BRCZ3_01 | 4487 (+) | 0.939 | 0.846 | ttttgAACAAaaaac I$BRCZ3_01 | 4494 (+) | 0.939 | 0.927 | caaaaAACAAaacaa I$BRCZ3_01 | 4499 (+) | 0.939 | 0.927 | aacaaAACAAaacaa I$BRCZ3_01 | 4504 (+) | 0.939 | 0.892 | aacaaAACAAcatca I$BRCZ3_01 | 4545 (-) | 1.000 | 0.947 | cctctTAGTTttgat I$BRCZ3_01 | 4614 (+) | 0.879 | 0.857 | ggtaaAACCAatatc I$BRCZ3_01 | 4661 (-) | 0.939 | 0.903 | ggactTTGTTttgct I$BRCZ3_01 | 4673 (-) | 1.000 | 0.854 | gctcaTAGTTctgtg I$BRCZ3_01 | 5335 (-) | 1.000 | 0.927 | gccgaTAGTTttgca I$BRCZ3_01 | 5378 (-) | 1.000 | 0.979 | tctttTAGTTttatt I$BRCZ3_01 | 5384 (-) | 0.834 | 0.852 | agtttTATTTtacct I$BRCZ3_01 | 5561 (+) | 0.939 | 0.873 | tagaaAACAAccagg I$BRCZ3_01 | 5610 (+) | 0.879 | 0.850 | caagaAACCAacatc I$BRCZ3_01 | 5690 (-) | 1.000 | 0.873 | cccacTAGTTcatac I$BRCZ3_01 | 5832 (+) | 0.939 | 0.880 | tgtgaAACAAggcat I$BRCZ3_01 | 5856 (-) | 0.879 | 0.847 | tgagtTGGTTtccca I$BRCZ3_01 | 5906 (-) | 1.000 | 0.917 | aagagTAGTTtgttt I$BRCZ3_01 | 5910 (-) | 0.939 | 0.889 | gtagtTTGTTtcccc I$BRCZ3_01 | 5940 (+) | 0.851 | 0.853 | gagaaAAGTAaaatt I$BRCZ3_01 | 6120 (-) | 0.939 | 0.897 | agggcTTGTTttctc I$BRCZ3_01 | 6586 (+) | 0.939 | 0.910 | cacaaAACAAataag I$BRCZ3_01 | 7037 (-) | 1.000 | 0.903 | agagtTAGTTgacta I$BRCZ3_01 | 7061 (-) | 0.939 | 0.903 | ttaatTTGTTtttgt I$BRCZ3_01 | 7067 (-) | 0.939 | 0.904 | tgtttTTGTTtgtgt I$BRCZ3_01 | 7209 (+) | 0.939 | 0.861 | aattgAACAAaaagt I$BRCZ3_01 | 7426 (-) | 0.939 | 0.907 | tttatTTGTTtttat I$BRCZ3_01 | 7564 (+) | 0.879 | 0.854 | ctggaAACCAaatcc I$BRCZ3_01 | 7585 (+) | 0.939 | 0.895 | ggtcaAACAAatgct I$BRCZ3_01 | 7772 (+) | 0.939 | 0.927 | caaaaAACAAaacaa I$BRCZ3_01 | 7777 (+) | 0.939 | 0.923 | aacaaAACAAaaatt I$BRCZ3_01 | 7819 (+) | 0.939 | 0.894 | tccaaAACAAacaat I$BRCZ3_01 | 7823 (+) | 0.939 | 0.881 | aaacaAACAAttaac I$BRCZ3_01 | 7887 (-) | 0.879 | 0.871 | tcattTGGTTtttat I$BRCZ3_01 | 7927 (-) | 0.879 | 0.849 | tcaatTCGTTtgtgt I$BRCZ3_01 | 7940 (-) | 1.000 | 0.959 | gtgttTAGTTttgag I$BRCZ3_01 | 7963 (-) | 1.000 | 0.872 | gactaTAGTTgtttg I$BRCZ3_01 | 7966 (-) | 0.939 | 0.873 | tatagTTGTTtgttt I$BRCZ3_01 | 7970 (-) | 0.939 | 0.900 | gttgtTTGTTtttca I$BRCZ3_01 | 8055 (-) | 1.000 | 0.972 | aggtcTAGTTtatct I$BRCZ3_01 | 8181 (-) | 0.939 | 0.893 | agaacTTGTTtgctc I$BRCZ3_01 | 8832 (+) | 0.879 | 0.869 | ttcaaAACCAaaaca I$BRCZ3_01 | 8931 (+) | 1.000 | 0.984 | gtataAACTAaagca I$BRCZ3_01 | 9108 (+) | 0.939 | 0.914 | agcaaAACAAaaaac I$BRCZ3_01 | 9414 (+) | 1.000 | 0.938 | tagaaAACTAtccca I$BRCZ3_01 | 9535 (+) | 0.939 | 0.894 | gaacaAACAAgtcct I$BRCZ3_01 | 9784 (-) | 0.939 | 0.854 | caattTTGTTcagta I$BRCZ3_01 | 10024 (-) | 1.000 | 0.898 | gaccgTAGTTtgtga I$BRCZ3_01 | 10127 (-) | 1.000 | 0.875 | aaaatTAGTTaagaa I$BRCZ3_01 | 10323 (-) | 0.834 | 0.848 | attttTATTTtatag I$BRCZ3_01 | 10330 (-) | 1.000 | 0.940 | ttttaTAGTTtttat I$BRCZ3_01 | 10339 (-) | 0.939 | 0.891 | ttttaTTGTTttaat I$BRCZ3_01 | 11185 (-) | 0.939 | 0.885 | ctgccTTGTTtcttg I$BRCZ3_01 | 11208 (-) | 0.939 | 0.895 | ttaatTTGTTtgaac I$BRCZ3_01 | 11472 (-) | 0.939 | 0.910 | tttatTTGTTtttcc I$BRCZ3_01 | 11630 (-) | 0.879 | 0.858 | gacccTCGTTtaaat I$BRCZ3_01 | 11878 (-) | 1.000 | 0.863 | ggactTAGTTctcac I$BRCZ3_01 | 11935 (+) | 1.000 | 0.918 | gctgaAACTAtttca I$BRCZ4_01 | 221 (-) | 0.936 | 0.926 | tttttTTTCCttt I$BRCZ4_01 | 631 (-) | 0.974 | 0.926 | tttttTTTAAaca I$BRCZ4_01 | 647 (-) | 1.000 | 0.916 | attcaTTTATtat I$BRCZ4_01 | 1893 (-) | 1.000 | 0.967 | ttataTTTATttt I$BRCZ4_01 | 1898 (-) | 1.000 | 0.941 | tttatTTTATtat I$BRCZ4_01 | 2360 (-) | 1.000 | 0.959 | ttgtaTTTATttt I$BRCZ4_01 | 2404 (+) | 0.974 | 0.917 | tccTTAAAaaaaa I$BRCZ4_01 | 4003 (+) | 1.000 | 0.917 | acgATAAAtaaat I$BRCZ4_01 | 4421 (+) | 0.961 | 0.917 | tgtCTAAAaaaaa I$BRCZ4_01 | 5393 (-) | 1.000 | 0.933 | ttaccTTTATatt I$BRCZ4_01 | 5645 (+) | 1.000 | 0.942 | aaaATAAAaagaa I$BRCZ4_01 | 5792 (+) | 0.974 | 0.942 | taaTTAAAtacaa I$BRCZ4_01 | 6670 (-) | 0.961 | 0.934 | ttttaTTTAGctt I$BRCZ4_01 | 7052 (-) | 0.948 | 0.942 | tttttTTTCTtaa I$BRCZ4_01 | 7431 (-) | 1.000 | 0.950 | ttgttTTTATttt I$BRCZ4_01 | 8930 (+) | 1.000 | 0.933 | agtATAAActaaa I$BRCZ4_01 | 10321 (-) | 1.000 | 0.926 | atattTTTATttt I$BRCZ4_01 | 10326 (-) | 1.000 | 0.916 | tttatTTTATagt V$PBX1_01 | 1050 (+) | 1.000 | 0.956 | aaaAATCAa V$PBX1_01 | 3908 (+) | 0.948 | 0.943 | attAATAAa V$PBX1_01 | 10340 (-) | 0.948 | 0.948 | tTTATTgtt V$PBX1_01 | 11197 (-) | 1.000 | 1.000 | tTGATTgat V$PBX1_01 | 11201 (-) | 1.000 | 0.968 | tTGATTatt V$PBX1_01 | 11686 (+) | 0.948 | 0.950 | accAATAAa V$PAX6_01 | 473 (-) | 0.660 | 0.705 | tgacacttacaAGTCAacatt V$PAX6_01 | 525 (-) | 0.722 | 0.770 | taaaaatgatgGGTGAtgcgc V$PAX6_01 | 711 (+) | 0.688 | 0.689 | tctctTTACAgatggttgtga V$PAX6_01 | 908 (-) | 0.682 | 0.707 | tttctttcatgCGTAGcactt V$PAX6_01 | 1339 (+) | 0.749 | 0.756 | gggctTAAGGcttgcctgggt V$PAX6_01 | 2462 (-) | 0.812 | 0.723 | ggcctagcaagCGCGAaggcc V$PAX6_01 | 2519 (+) | 0.688 | 0.687 | aaaaaTTACAtattactttta V$PAX6_01 | 2735 (+) | 0.802 | 0.697 | tgcctTTATGcctttttcttt V$PAX6_01 | 2870 (-) | 0.735 | 0.687 | cagaactccacCGTGTatgcg V$PAX6_01 | 3464 (+) | 0.842 | 0.742 | gggccTCATGatttactgcct V$PAX6_01 | 3697 (-) | 0.832 | 0.729 | cacaagagatgCTTGAtcaat V$PAX6_01 | 3714 (-) | 0.722 | 0.695 | caatatttacgGGTGActaaa V$PAX6_01 | 3715 (+) | 0.960 | 0.855 | aatatTTACGggtgactaaaa V$PAX6_01 | 3751 (-) | 0.792 | 0.784 | ttttagtcattCTTAAaaggc V$PAX6_01 | 3863 (-) | 0.665 | 0.709 | accatttaaggAGTTAataca V$PAX6_01 | 3864 (+) | 0.749 | 0.713 | ccattTAAGGagttaatacaa V$PAX6_01 | 3907 (-) | 0.763 | 0.742 | tattaataaagCTTTAtgaat V$PAX6_01 | 3915 (+) | 0.802 | 0.745 | aagctTTATGaattacattat V$PAX6_01 | 4091 (-) | 0.604 | 0.687 | ttttactgattCTAAAacatc V$PAX6_01 | 4283 (-) | 0.778 | 0.693 | acacattagacCCTAAatttg V$PAX6_01 | 4306 (-) | 0.802 | 0.750 | cctcaatactgCATAAaactg V$PAX6_01 | 4525 (-) | 0.925 | 0.829 | gacaacaaatgCGTCAacagc V$PAX6_01 | 5166 (+) | 0.735 | 0.764 | gacttTCACTtgtgactgtgt V$PAX6_01 | 5182 (+) | 0.644 | 0.702 | tgtgtTCTAGcaggatttgtc V$PAX6_01 | 5376 (+) | 0.604 | 0.711 | tttctTTTAGttttattttac V$PAX6_01 | 6426 (-) | 0.647 | 0.689 | tggctttcatcGGTCAaccca V$PAX6_01 | 6570 (-) | 0.613 | 0.701 | ccacagttatgCAAAAcacaa V$PAX6_01 | 6720 (-) | 0.722 | 0.687 | aacctttaaagGGTGAgcagc V$PAX6_01 | 6721 (+) | 0.763 | 0.694 | accttTAAAGggtgagcagcc V$PAX6_01 | 7017 (+) | 0.673 | 0.758 | atgttTCCTGcatcagctgaa V$PAX6_01 | 7028 (-) | 0.665 | 0.684 | atcagctgaagAGTTAgttga V$PAX6_01 | 7029 (+) | 0.757 | 0.690 | tcagcTGAAGagttagttgac V$PAX6_01 | 7071 (+) | 0.613 | 0.735 | tttgtTTGTGtatgagtgctt V$PAX6_01 | 7415 (+) | 0.763 | 0.794 | actttTAAAGgtttatttgtt V$PAX6_01 | 7682 (-) | 0.630 | 0.716 | cagctgtaaagCCAGAacttg V$PAX6_01 | 7915 (+) | 0.778 | 0.739 | tttacTTAGGtatcaattcgt V$PAX6_01 | 8465 (-) | 0.613 | 0.688 | ctcatctaaggCAAAAataag V$PAX6_01 | 8518 (-) | 0.757 | 0.753 | actcacagatgCTTCAagaac V$PAX6_01 | 8537 (-) | 0.812 | 0.697 | acacagtgggcCGAGAaaatg V$PAX6_01 | 9112 (-) | 0.831 | 0.715 | aaacaaaaaacCGGGAaaacc V$PAX6_01 | 9163 (-) | 0.842 | 0.722 | ttcctaagatgCATGAacgcc V$PAX6_01 | 9765 (-) | 0.561 | 0.698 | aaacagtcatgCCATAgaaca V$PAX6_01 | 9896 (+) | 0.763 | 0.761 | tttttTAAAGacttacttact V$PAX6_01 | 10267 (+) | 0.756 | 0.695 | attttTGCCGgctgagcaatc V$PAX6_01 | 10337 (+) | 0.584 | 0.702 | gttttTATTGttttaattcaa V$PAX6_01 | 10746 (-) | 0.735 | 0.723 | tttcagtgatcCGTGTgtttg V$PAX6_01 | 11099 (+) | 0.763 | 0.709 | ctgttTAAAGaacgacttcag V$PAX6_01 | 11156 (-) | 0.589 | 0.705 | tgaaaatgaagCTGCAgagat V$PAX6_01 | 11190 (+) | 0.654 | 0.711 | ttgttTCTTGattgattatta V$PAX6_01 | 11397 (+) | 0.688 | 0.719 | attttGTACGcatttattctg V$PAX6_01 | 11448 (+) | 0.579 | 0.708 | tatttTGGTGaatcagtgaaa V$PAX6_01 | 11941 (-) | 0.647 | 0.683 | actatttcaagGGTCAggact V$PAX6_01 | 11942 (+) | 0.832 | 0.709 | ctattTCAAGggtcaggactg V$PAX2_01 | 133 (+) | 0.979 | 0.767 | ctcaGTCAAtccagacatt V$PAX2_01 | 134 (-) | 0.822 | 0.803 | tcagtcaatcCAGACattt V$PAX2_01 | 283 (+) | 0.979 | 0.839 | gcaaGTCAAgtgttttaca V$PAX2_01 | 520 (-) | 0.893 | 0.744 | gattttaaaaATGATgggt V$PAX2_01 | 527 (-) | 0.902 | 0.783 | aaaatgatggGTGATgcgc V$PAX2_01 | 613 (+) | 0.825 | 0.770 | tcctGTCGTacatgacact V$PAX2_01 | 614 (-) | 0.991 | 0.875 | cctgtcgtacATGACactt V$PAX2_01 | 644 (+) | 0.857 | 0.798 | tttaTTCATttattatata V$PAX2_01 | 909 (+) | 0.857 | 0.757 | ttctTTCATgcgtagcact V$PAX2_01 | 939 (+) | 0.847 | 0.784 | acaaGTTAGgaacaatgat V$PAX2_01 | 1087 (+) | 0.979 | 0.798 | tttgGTCAActaaaatttg V$PAX2_01 | 1219 (-) | 0.991 | 0.767 | ggagtttctgATGACtgct V$PAX2_01 | 1258 (+) | 0.893 | 0.756 | tgtcATCATccgtgtgtac V$PAX2_01 | 1273 (-) | 0.991 | 0.768 | gtactctgggATGACaggg V$PAX2_01 | 1817 (+) | 0.837 | 0.808 | ggtgTTCAGggaagacacc V$PAX2_01 | 2327 (-) | 1.000 | 0.763 | gtcttgctatGTGACccta V$PAX2_01 | 2586 (-) | 0.970 | 0.759 | aaatgcagtaCTGACtatt V$PAX2_01 | 2635 (-) | 0.970 | 0.819 | tgctctacccCTGACctgc V$PAX2_01 | 2666 (-) | 0.856 | 0.759 | gatgacactcTTAACccgt V$PAX2_01 | 2961 (+) | 0.826 | 0.744 | tacaGTGACggcagatgcg V$PAX2_01 | 3038 (-) | 0.833 | 0.761 | cgctccacccGGGACatac V$PAX2_01 | 3199 (+) | 0.819 | 0.767 | ctgcCTCAAgtatgctagg V$PAX2_01 | 3229 (+) | 0.970 | 0.784 | accaGTCAGccaacattcc V$PAX2_01 | 3308 (+) | 0.856 | 0.762 | tgtaGTTAAgattggcctt V$PAX2_01 | 3464 (+) | 0.831 | 0.779 | gggcCTCATgatttactgc V$PAX2_01 | 3699 (-) | 0.881 | 0.821 | caagagatgcTTGATcaat V$PAX2_01 | 3708 (+) | 0.881 | 0.780 | cttgATCAAtatttacggg V$PAX2_01 | 3716 (-) | 1.000 | 0.862 | atatttacggGTGACtaaa V$PAX2_01 | 3752 (+) | 0.991 | 0.832 | tttaGTCATtcttaaaagg V$PAX2_01 | 3880 (-) | 0.979 | 0.772 | tacaagctccTTGACcctc V$PAX2_01 | 3916 (-) | 0.797 | 0.745 | agctttatgaATTACatta V$PAX2_01 | 3928 (-) | 0.826 | 0.762 | tacattatcaGTCACttgt V$PAX2_01 | 3934 (+) | 1.000 | 0.794 | atcaGTCACttgttcattc V$PAX2_01 | 3988 (+) | 0.831 | 0.760 | ccatCTCATgaaacaacga V$PAX2_01 | 4403 (+) | 0.970 | 0.783 | acagGTCAGccacaacctt V$PAX2_01 | 4461 (-) | 0.970 | 0.747 | ttcacactgaCTGACcagc V$PAX2_01 | 4552 (-) | 0.813 | 0.800 | gttttgatgcTGGACacag V$PAX2_01 | 4687 (-) | 0.881 | 0.754 | ggcaggaaacTTGATgtcc V$PAX2_01 | 4783 (-) | 0.970 | 0.752 | ataaagattgCTGACttgc V$PAX2_01 | 4845 (-) | 0.805 | 0.763 | atcctgagacTTCACtgag V$PAX2_01 | 4864 (+) | 0.970 | 0.749 | ggagGTCAGcagcaaagct V$PAX2_01 | 4928 (+) | 0.893 | 0.746 | acagATCATggagactgaa V$PAX2_01 | 4965 (+) | 0.970 | 0.746 | ggaaGTCAGgctcagcccc V$PAX2_01 | 5034 (+) | 0.810 | 0.751 | gatgCTCAGgattgctgtg V$PAX2_01 | 5062 (+) | 0.796 | 0.756 | ggtaGTGAGagatgagagg V$PAX2_01 | 5166 (+) | 0.866 | 0.796 | gactTTCACttgtgactgt V$PAX2_01 | 5167 (-) | 1.000 | 0.843 | actttcacttGTGACtgtg V$PAX2_01 | 5468 (-) | 0.872 | 0.796 | atactatttcCTGATtttt V$PAX2_01 | 5968 (-) | 0.902 | 0.742 | tttttaagtgGTGATaact V$PAX2_01 | 6281 (-) | 0.893 | 0.751 | agcctctctgATGATgact V$PAX2_01 | 6288 (-) | 0.810 | 0.779 | ctgatgatgaCTGAGcagg V$PAX2_01 | 6313 (-) | 0.970 | 0.812 | atctacttccCTGACtgaa V$PAX2_01 | 6345 (+) | 0.866 | 0.750 | ttctTTCACgaaagctcag V$PAX2_01 | 6443 (+) | 0.991 | 0.820 | cccaGTCATggcaaactta V$PAX2_01 | 6456 (-) | 0.872 | 0.817 | aacttatttcCTGATaaaa V$PAX2_01 | 6556 (+) | 0.991 | 0.822 | aagaGTCATggttgccaca V$PAX2_01 | 6571 (+) | 0.868 | 0.798 | cacaGTTATgcaaaacaca V$PAX2_01 | 6760 (-) | 0.857 | 0.762 | cctttataccATGAAcaaa V$PAX2_01 | 7323 (+) | 0.822 | 0.755 | gtacGTCTGgaacaataga V$PAX2_01 | 7951 (-) | 0.822 | 0.764 | tgagatacccCAGACtata V$PAX2_01 | 8037 (-) | 0.979 | 0.791 | ttgctgggccTTGACtcga V$PAX2_01 | 8102 (+) | 1.000 | 0.752 | tcctGTCACacactctccc V$PAX2_01 | 8200 (-) | 0.970 | 0.762 | tgtttcccctCTGACctca V$PAX2_01 | 8385 (-) | 0.837 | 0.753 | atctgtatacCTGAAgcca V$PAX2_01 | 8564 (+) | 0.843 | 0.762 | gtccGTCTTtgattaaatg V$PAX2_01 | 8612 (+) | 0.979 | 0.754 | ttccGTCAAtcactttgag V$PAX2_01 | 8706 (+) | 0.881 | 0.749 | taccATCAAggagtctgaa V$PAX2_01 | 8906 (+) | 0.837 | 0.752 | cagaTTCAGgaatactcgg V$PAX2_01 | 9165 (-) | 0.857 | 0.777 | cctaagatgcATGAAcgcc V$PAX2_01 | 9252 (+) | 0.831 | 0.777 | gctgCTCATgggacagggc V$PAX2_01 | 9336 (-) | 0.837 | 0.755 | tggaggacccCTGAAgtct V$PAX2_01 | 9518 (-) | 0.843 | 0.768 | catgtccaacAAGACtaga V$PAX2_01 | 9541 (+) | 0.825 | 0.748 | acaaGTCCTgcgtagtcgc V$PAX2_01 | 9766 (+) | 0.991 | 0.822 | aacaGTCATgccatagaac V$PAX2_01 | 9844 (+) | 0.970 | 0.819 | taaaGTCAGgcaggcagca V$PAX2_01 | 10184 (+) | 0.837 | 0.743 | tgtgTTCAGcagtcagttc V$PAX2_01 | 10191 (+) | 0.970 | 0.747 | agcaGTCAGttctgtcctt V$PAX2_01 | 10242 (+) | 0.970 | 0.853 | ggtgGTCAGgcttgtgcag V$PAX2_01 | 10286 (+) | 0.872 | 0.751 | tctcATCAGgcttcggaca V$PAX2_01 | 10468 (-) | 0.831 | 0.756 | catctcacttTAGACcagc V$PAX2_01 | 10488 (+) | 0.881 | 0.843 | acatATCAAggatcaaaag V$PAX2_01 | 10502 (+) | 0.979 | 0.798 | aaaaGTCAAttatggctgt V$PAX2_01 | 10623 (-) | 1.000 | 0.772 | gttgtagaaaGTGACtgag V$PAX2_01 | 11062 (-) | 0.806 | 0.752 | gttgtgatcaGTTACaagc V$PAX2_01 | 11064 (+) | 0.872 | 0.744 | tgtgATCAGttacaagctg V$PAX2_01 | 11187 (-) | 0.881 | 0.795 | gccttgtttcTTGATtgat V$PAX2_01 | 11191 (-) | 0.881 | 0.772 | tgtttcttgaTTGATtatt V$PAX2_01 | 11569 (+) | 0.893 | 0.766 | tcttATCATtgcttatatc V$PAX2_01 | 11663 (-) | 0.868 | 0.742 | cacaattttcATAACcaaa V$PAX2_01 | 11772 (-) | 0.881 | 0.789 | ttccagatacTTGATgtaa V$PAX2_01 | 11942 (+) | 0.846 | 0.827 | ctatTTCAAgggtcaggac V$PAX2_01 | 11949 (+) | 0.970 | 0.793 | aaggGTCAGgactgcacct V$S8_01 | 61 (-) | 1.000 | 0.969 | gccaTAATTactgttg V$S8_01 | 3428 (-) | 1.000 | 0.993 | tgctTAATTgacttag V$S8_01 | 4056 (+) | 1.000 | 0.972 | aggcaatAATTAaaat V$S8_01 | 4058 (-) | 1.000 | 0.981 | gcaaTAATTaaaatat V$S8_01 | 4441 (+) | 1.000 | 0.987 | accacacAATTAatat V$S8_01 | 5786 (+) | 1.000 | 0.984 | atgaaatAATTAaata V$S8_01 | 5788 (-) | 1.000 | 0.986 | gaaaTAATTaaataca V$S8_01 | 7824 (+) | 1.000 | 0.991 | aacaaacAATTAaccc V$S8_01 | 8681 (+) | 1.000 | 0.985 | ggaaggtAATTAgcct V$S8_01 | 8683 (-) | 1.000 | 0.982 | aaggTAATTagccttt V$S8_01 | 10502 (+) | 1.000 | 0.990 | aaaagtcAATTAtggc V$CDP_02 | 648 (+) | 0.703 | 0.777 | ttcATTTAttatata V$CDP_02 | 1107 (+) | 0.773 | 0.809 | gaaATCGCtgatttg V$CDP_02 | 1751 (-) | 0.912 | 0.912 | acactgaTCTATaag V$CDP_02 | 1754 (+) | 0.773 | 0.805 | ctgATCTAtaagcct V$CDP_02 | 2719 (+) | 0.912 | 0.908 | caaATAGAtaatctt V$CDP_02 | 3095 (-) | 0.912 | 0.819 | tttcgaaTCTATact V$CDP_02 | 3706 (-) | 0.930 | 0.911 | tgcttgaTCAATatt V$CDP_02 | 3999 (+) | 0.773 | 0.813 | aacAACGAtaaataa V$CDP_02 | 4103 (-) | 0.907 | 0.811 | taaaacaTCCATttt V$CDP_02 | 5233 (+) | 0.773 | 0.810 | gggAACGAttacatg V$CDP_02 | 5316 (-) | 0.773 | 0.789 | ttcttcaTCGGTtca V$CDP_02 | 6428 (-) | 0.773 | 0.797 | gctttcaTCGGTcaa V$CDP_02 | 6431 (+) | 0.773 | 0.805 | ttcATCGGtcaaccc V$CDP_02 | 6921 (+) | 0.930 | 0.831 | ggaATTGAttgtatg V$CDP_02 | 7372 (+) | 0.930 | 0.809 | agaATTGAtattgct V$CDP_02 | 7920 (-) | 0.930 | 0.813 | ttaggtaTCAATtcg V$CDP_02 | 9716 (+) | 0.930 | 0.833 | aaaATTGAtctcttt V$CDP_02 | 9875 (-) | 0.930 | 0.837 | gaggaaaTCAATtta V$CDP_02 | 10381 (-) | 0.907 | 0.825 | gttgcaaTCCATata V$CDP_02 | 11194 (-) | 0.773 | 0.801 | ttcttgaTTGATtat V$CDP_02 | 11197 (+) | 0.930 | 0.943 | ttgATTGAttattaa V$CDP_02 | 11346 (-) | 0.773 | 0.797 | gccattaTCGAAcga V$CDP_02 | 11439 (-) | 0.912 | 0.912 | atacttaTCTATttt V$CDP_02 | 11693 (-) | 0.703 | 0.779 | aagttaaTCACTgtc V$CLOX_01 | 11197 (+) | 0.941 | 0.947 | ttgATTGAttattaa V$CDPCR1_01 | 856 (-) | 0.858 | 0.888 | acgaTCGTTg V$CDPCR1_01 | 3709 (-) | 0.929 | 0.869 | ttgaTCAATa V$CDPCR1_01 | 6923 (+) | 0.929 | 0.918 | aATTGAttgt V$CDPCR1_01 | 7923 (-) | 0.929 | 0.868 | ggtaTCAATt V$CDPCR1_01 | 9718 (+) | 0.929 | 0.885 | aATTGAtctc V$CDPCR1_01 | 9878 (-) | 0.929 | 0.865 | gaaaTCAATt V$CDPCR1_01 | 10384 (-) | 0.910 | 0.917 | gcaaTCCATa V$CDPCR3_01 | 62 (-) | 1.000 | 0.845 | CCATAattactgttg V$CDPCR3_01 | 1008 (+) | 0.745 | 0.676 | gacgcacctaTATGT V$CDPCR3_01 | 1015 (-) | 0.980 | 0.696 | CTATAtgtgattggc V$CDPCR3_01 | 1471 (-) | 0.996 | 0.727 | CAATAtttgctctgg V$CDPCR3_01 | 1735 (+) | 0.980 | 0.700 | ctgaacttccTATAG V$CDPCR3_01 | 1759 (-) | 0.980 | 0.683 | CTATAagcctctcgc V$CDPCR3_01 | 1853 (-) | 0.996 | 0.746 | CAATAgagccaactg V$CDPCR3_01 | 2005 (-) | 0.980 | 0.703 | CTATAcatctcctga V$CDPCR3_01 | 2615 (-) | 0.795 | 0.701 | CCATGtgtatgctag V$CDPCR3_01 | 3051 (-) | 0.745 | 0.714 | ACATAcctgcgagtg V$CDPCR3_01 | 3057 (+) | 0.980 | 0.774 | ctgcgagtggTATAG V$CDPCR3_01 | 3103 (-) | 0.980 | 0.705 | CTATActaatttcca V$CDPCR3_01 | 3123 (-) | 0.980 | 0.695 | CTATAttaatttaca V$CDPCR3_01 | 3141 (-) | 0.745 | 0.713 | CCACAacaatatatg V$CDPCR3_01 | 3142 (+) | 1.000 | 0.860 | cacaacaataTATGG V$CDPCR3_01 | 3147 (-) | 0.996 | 0.774 | CAATAtatggaggtt V$CDPCR3_01 | 3247 (+) | 1.000 | 0.702 | ccagcccactTATGG V$CDPCR3_01 | 3276 (-) | 0.786 | 0.684 | CCATTatgattggga V$CDPCR3_01 | 3372 (+) | 1.000 | 0.754 | agcctaatcaTATGG V$CDPCR3_01 | 3711 (+) | 0.745 | 0.691 | gatcaatattTACGG V$CDPCR3_01 | 3714 (-) | 0.996 | 0.904 | CAATAtttacgggtg V$CDPCR3_01 | 4004 (-) | 0.998 | 0.710 | CGATAaataaatatt V$CDPCR3_01 | 4086 (-) | 0.980 | 0.680 | CTATAttttactgat V$CDPCR3_01 | 4316 (-) | 0.745 | 0.691 | GCATAaaactgggtc V$CDPCR3_01 | 4447 (-) | 0.781 | 0.693 | CAATTaatattcatt V$CDPCR3_01 | 4572 (+) | 0.980 | 0.709 | tgcggttagaTATAG V$CDPCR3_01 | 4622 (-) | 0.996 | 0.777 | CAATAtcctttgggt V$CDPCR3_01 | 4674 (+) | 0.745 | 0.685 | ctcatagttcTGTGG V$CDPCR3_01 | 4781 (-) | 0.743 | 0.752 | GGATAaagattgctg V$CDPCR3_01 | 5085 (+) | 0.745 | 0.689 | ctctgactggTTTGG V$CDPCR3_01 | 5337 (-) | 0.998 | 0.765 | CGATAgttttgcatc V$CDPCR3_01 | 5433 (+) | 0.996 | 0.688 | actttatttgTATTG V$CDPCR3_01 | 5448 (-) | 1.000 | 0.682 | CCATAtaaccttttc V$CDPCR3_01 | 5466 (-) | 0.745 | 0.704 | ACATActatttcctg V$CDPCR3_01 | 6143 (-) | 0.795 | 0.687 | CCATGcctctggctc V$CDPCR3_01 | 6866 (+) | 0.980 | 0.836 | gaccgaggctTATAG V$CDPCR3_01 | 6958 (-) | 0.786 | 0.757 | CCATTattctggatg V$CDPCR3_01 | 7335 (-) | 0.996 | 0.699 | CAATAgacacatttc V$CDPCR3_01 | 7370 (+) | 0.996 | 0.781 | aaagaattgaTATTG V$CDPCR3_01 | 7928 (-) | 0.781 | 0.818 | CAATTcgtttgtgtg V$CDPCR3_01 | 7956 (+) | 0.980 | 0.729 | taccccagacTATAG V$CDPCR3_01 | 7965 (-) | 0.980 | 0.854 | CTATAgttgtttgtt V$CDPCR3_01 | 8087 (-) | 0.980 | 0.840 | CTATAcagctctggg V$CDPCR3_01 | 8241 (-) | 0.745 | 0.704 | CCAGAtgaaggtgtg V$CDPCR3_01 | 8560 (-) | 0.791 | 0.733 | CAATGtccgtctttg V$CDPCR3_01 | 8650 (-) | 0.781 | 0.674 | CAATTcccaatggtt V$CDPCR3_01 | 9236 (-) | 0.745 | 0.705 | CCAGAgccctaggag V$CDPCR3_01 | 9426 (-) | 1.000 | 0.715 | CCATAaataacagga V$CDPCR3_01 | 9748 (-) | 0.996 | 0.678 | CAATAtattaatatt V$CDPCR3_01 | 9749 (+) | 0.996 | 0.766 | aatatattaaTATTG V$CDPCR3_01 | 9784 (-) | 0.781 | 0.690 | CAATTttgttcagta V$CDPCR3_01 | 9967 (+) | 1.000 | 0.687 | ttctggttccTATGG V$CDPCR3_01 | 10123 (-) | 0.996 | 0.746 | CAATAaaattagtta V$CDPCR3_01 | 10140 (+) | 0.996 | 0.757 | aaattattatTATTG V$CDPCR3_01 | 10171 (-) | 0.745 | 0.765 | CCTTAttagtgtgtg V$CDPCR3_01 | 10389 (-) | 1.000 | 0.675 | CCATAtaaaaatgaa V$CDPCR3_01 | 10502 (+) | 1.000 | 0.697 | aaaagtcaatTATGG V$CDPCR3_01 | 10697 (+) | 0.745 | 0.737 | cacccgtgttTATGA V$CDPCR3_01 | 10726 (+) | 0.996 | 0.681 | ctgtagctttTATTG V$CDPCR3_01 | 11170 (+) | 0.745 | 0.735 | cagagatcttTTTGG V$CDPCR3_01 | 11437 (-) | 0.745 | 0.676 | GCATActtatctatt V$CDPCR3_01 | 11533 (+) | 1.000 | 0.679 | aaatcccacaTATGG V$CDPCR3_01 | 11800 (-) | 0.745 | 0.690 | CCAGAttggaaggtg V$CDPCR3HD_01 | 1754 (-) | 0.983 | 0.964 | ctgATCTAta V$CDPCR3HD_01 | 2721 (+) | 0.983 | 0.911 | aaTAGATaat V$CDPCR3HD_01 | 2982 (-) | 0.983 | 0.861 | ggcATCTAct V$CDPCR3HD_01 | 3098 (-) | 0.983 | 0.951 | cgaATCTAta V$CDPCR3HD_01 | 3709 (-) | 0.996 | 0.952 | ttgATCAAta V$CDPCR3HD_01 | 4106 (-) | 0.983 | 0.937 | aacATCCAtt V$CDPCR3HD_01 | 5484 (+) | 0.996 | 0.870 | ttTTGATccc V$CDPCR3HD_01 | 6097 (+) | 0.983 | 0.858 | tgTGGATcag V$CDPCR3HD_01 | 6310 (-) | 0.983 | 0.862 | ctgATCTAct V$CDPCR3HD_01 | 6783 (-) | 0.983 | 0.855 | cagATCTAat V$CDPCR3HD_01 | 6923 (+) | 0.996 | 0.944 | aaTTGATtgt V$CDPCR3HD_01 | 7374 (+) | 0.996 | 0.927 | aaTTGATatt V$CDPCR3HD_01 | 7923 (-) | 0.996 | 0.957 | ggtATCAAtt V$CDPCR3HD_01 | 8380 (+) | 0.996 | 0.867 | ctTTGATctg V$CDPCR3HD_01 | 9718 (+) | 0.996 | 0.986 | aaTTGATctc V$CDPCR3HD_01 | 9878 (-) | 0.996 | 0.959 | gaaATCAAtt V$CDPCR3HD_01 | 10217 (-) | 0.983 | 0.850 | ggcATCTAag V$CDPCR3HD_01 | 10384 (-) | 0.983 | 0.952 | gcaATCCAta V$CDPCR3HD_01 | 10672 (-) | 0.983 | 0.851 | aggATCCAca V$CDPCR3HD_01 | 11199 (+) | 0.996 | 0.934 | gaTTGATtat V$CDPCR3HD_01 | 11442 (-) | 0.983 | 0.931 | cttATCTAtt V$CDPCR3HD_01 | 11546 (+) | 0.983 | 0.861 | ggTAGATggg V$NRF2_01 | 3032 (-) | 1.000 | 0.946 | gaCTTCCgct V$CEBPB_01 | 333 (-) | 0.965 | 0.928 | aaATTCCctcagac V$CEBPB_01 | 489 (-) | 1.000 | 0.956 | acATTGCaacatta V$CEBPB_01 | 803 (-) | 0.996 | 0.951 | ccATTTCtccagcc V$CEBPB_01 | 1100 (+) | 0.996 | 0.975 | aatttgaGAAATcg V$CEBPB_01 | 1463 (-) | 0.966 | 0.972 | acGTTTCtcaatat V$CEBPB_01 | 2368 (-) | 0.976 | 0.935 | atTTTGCcacagtt V$CEBPB_01 | 2555 (+) | 0.996 | 0.979 | ctgttgtGAAATtt V$CEBPB_01 | 2564 (+) | 0.965 | 0.929 | aatttccGTAAGta V$CEBPB_01 | 2564 (-) | 0.941 | 0.927 | aaTTTCCgtaagta V$CEBPB_01 | 2712 (-) | 0.950 | 0.939 | ctCTTCCcaaatag V$CEBPB_01 | 2803 (-) | 0.965 | 0.947 | ctCTTACtgaacac V$CEBPB_01 | 3424 (-) | 0.970 | 0.952 | tgGTTGCttaattg V$CEBPB_01 | 3437 (+) | 0.950 | 0.943 | gacttagGTAACag V$CEBPB_01 | 3569 (+) | 0.985 | 0.966 | gcctttgGCAAGtt V$CEBPB_01 | 3644 (+) | 0.965 | 0.942 | acatggtGGAATca V$CEBPB_01 | 3739 (-) | 0.996 | 0.955 | ttATTTCtacattt V$CEBPB_01 | 4207 (-) | 0.985 | 0.939 | ggCTTGCctcagtt V$CEBPB_01 | 4270 (-) | 0.965 | 0.943 | tgCTTACctaagca V$CEBPB_01 | 4708 (+) | 0.981 | 0.963 | gacttggGTAATtc V$CEBPB_01 | 4795 (-) | 0.985 | 0.981 | gaCTTGCataatct V$CEBPB_01 | 5152 (+) | 0.972 | 0.955 | agtttacGAAAAgg V$CEBPB_01 | 5152 (-) | 0.956 | 0.928 | agTTTACgaaaagg V$CEBPB_01 | 5456 (-) | 0.972 | 0.945 | ccTTTTCcccacat V$CEBPB_01 | 5471 (-) | 0.996 | 0.928 | ctATTTCctgattt V$CEBPB_01 | 6547 (+) | 0.965 | 0.927 | ggctgagGTAAGag V$CEBPB_01 | 6564 (-) | 0.970 | 0.926 | tgGTTGCcacagtt V$CEBPB_01 | 6573 (+) | 0.976 | 0.958 | cagttatGCAAAac V$CEBPB_01 | 6608 (-) | 0.976 | 0.964 | taTTTGCtaaactc V$CEBPB_01 | 7054 (-) | 0.972 | 0.968 | ttTTTTCttaattt V$CEBPB_01 | 7343 (-) | 0.996 | 0.964 | acATTTCaacatgt V$CEBPB_01 | 7402 (-) | 0.981 | 0.936 | gaATTACtgaagaa V$CEBPB_01 | 8071 (-) | 1.000 | 0.961 | gtATTGCagaaact V$CEBPB_01 | 8578 (+) | 0.976 | 0.949 | aaatgggGCAAAgg V$CEBPB_01 | 8623 (+) | 0.972 | 0.956 | actttgaGAAAAgc V$CEBPB_01 | 8693 (-) | 0.981 | 0.945 | gcCTTTCtgaaaat V$CEBPB_01 | 8907 (+) | 0.965 | 0.948 | agattcaGGAATac V$CEBPB_01 | 9098 (-) | 0.965 | 0.944 | tgCTTACaaaagca V$CEBPB_01 | 9509 (-) | 0.966 | 0.943 | aaGTTTCcccatgt V$CEBPB_01 | 9601 (+) | 0.950 | 0.954 | gcatttgGGAAGct V$CEBPB_01 | 9707 (+) | 0.956 | 0.929 | tggtggtGTAAAat V$CEBPB_01 | 9789 (+) | 0.956 | 0.931 | ttgttcaGTAAAtg V$CEBPB_01 | 9871 (+) | 0.996 | 0.956 | gaatgagGAAATca V$CEBPB_01 | 10070 (+) | 0.950 | 0.931 | gtcttctGGAAGgg V$CEBPB_01 | 10132 (+) | 0.996 | 0.971 | tagttaaGAAATta V$CEBPB_01 | 10369 (-) | 0.981 | 0.980 | atCTTTCaaaatgt V$CEBPB_01 | 10377 (+) | 1.000 | 0.962 | aaatgttGCAATcc V$CEBPB_01 | 10622 (+) | 0.981 | 0.933 | agttgtaGAAAGtg V$CEBPB_01 | 10703 (+) | 0.966 | 0.948 | tgtttatGAAACat V$CEBPB_01 | 10968 (+) | 0.985 | 0.928 | tgttggaGCAAGga V$CEBPB_01 | 11666 (-) | 0.972 | 0.964 | aaTTTTCataacca V$CEBPB_01 | 11726 (-) | 0.972 | 0.947 | caTTTTCtcaaaca V$CEBPB_01 | 11780 (+) | 0.965 | 0.925 | acttgatGTAAGga I$ELF1_01 | 929 (-) | 1.000 | 0.940 | ccacgAAACCacaagt I$ELF1_01 | 1137 (-) | 1.000 | 0.922 | tcctaAAACCgttact I$ELF1_01 | 2600 (-) | 1.000 | 0.919 | ctattAAACCaagagc I$ELF1_01 | 3254 (+) | 1.000 | 0.914 | acttatGGTTTtcttg I$ELF1_01 | 4124 (-) | 1.000 | 0.909 | tgttaAAACCcagaat I$ELF1_01 | 4434 (-) | 1.000 | 0.929 | aaaaaAAACCacacaa I$ELF1_01 | 4613 (-) | 1.000 | 0.931 | aggtaAAACCaatatc I$ELF1_01 | 5609 (-) | 1.000 | 0.904 | acaagAAACCaacatc I$ELF1_01 | 6743 (-) | 1.000 | 0.922 | tttgaAAACCcactat I$ELF1_01 | 7887 (+) | 1.000 | 0.919 | tcatttGGTTTttatt I$ELF1_01 | 8064 (+) | 0.966 | 0.948 | ttatctGGTATtgcag I$ELF1_01 | 8655 (+) | 1.000 | 0.934 | cccaatGGTTTtcaaa I$ELF1_01 | 8700 (-) | 0.966 | 0.904 | tgaaaATACCatcaag I$ELF1_01 | 8831 (-) | 1.000 | 0.997 | gttcaAAACCaaaaca I$ELF1_01 | 8845 (+) | 1.000 | 0.941 | cagaacGGTTTaaaaa I$ELF1_01 | 9114 (-) | 1.000 | 0.922 | acaaaAAACCgggaaa I$ELF1_01 | 9705 (+) | 0.966 | 0.906 | ggtggtGGTGTaaaat I$ELF1_01 | 10938 (+) | 0.966 | 0.958 | gtggacGGTGTtgtag I$ELF1_01 | 11548 (+) | 0.966 | 0.918 | tagatgGGTATtcaac V$CREB_02 | 1842 (-) | 1.000 | 0.949 | ggtCGTCAacac V$CREB_02 | 2962 (+) | 1.000 | 0.886 | acagTGACGgca V$CREB_02 | 4533 (-) | 1.000 | 0.963 | atgCGTCAacag V$CREB_02 | 8323 (-) | 1.000 | 0.956 | ctgCGTCAgcgc V$CREB_02 | 8612 (-) | 1.000 | 0.883 | ttcCGTCAatca V$CREB_02 | 11314 (+) | 1.000 | 0.948 | acggTGACGacg V$CREB_02 | 11329 (+) | 1.000 | 0.944 | atgaTGACGatg V$TAXCREB_01 | 1278 (+) | 0.780 | 0.782 | ctgggaTGACAggga V$TAXCREB_01 | 1841 (-) | 1.000 | 0.854 | gggtCGTCAacacaa V$TAXCREB_01 | 3285 (+) | 0.780 | 0.791 | ttgggaTGAGGtgaa V$TAXCREB_01 | 4532 (-) | 1.000 | 0.844 | aatgCGTCAacagcc V$TAXCREB_01 | 5149 (+) | 0.906 | 0.782 | ggcagtTTACGaaaa V$TAXCREB_01 | 6433 (-) | 0.780 | 0.826 | catcGGTCAacccag V$TAXCREB_01 | 8322 (-) | 1.000 | 0.900 | tctgCGTCAgcgcca V$TAXCREB_01 | 8611 (-) | 1.000 | 0.811 | attcCGTCAatcact V$TAXCREB_01 | 9381 (-) | 0.780 | 0.801 | gagcTGTCAgcccct V$TAXCREB_01 | 11312 (+) | 1.000 | 0.793 | ctacggTGACGacga V$TAXCREB_01 | 11700 (-) | 0.780 | 0.786 | tcacTGTCAccccaa V$TAXCREB_02 | 23 (-) | 0.600 | 0.631 | gtgaatgtgTGTGAa V$TAXCREB_02 | 407 (-) | 0.600 | 0.672 | ggcggaatgGGTGAg V$TAXCREB_02 | 566 (-) | 0.800 | 0.622 | ggagataaaCGTCTa V$TAXCREB_02 | 722 (-) | 0.600 | 0.673 | atggttgtgAGCCAc V$TAXCREB_02 | 798 (+) | 0.600 | 0.664 | cTGAGCcatttctcc V$TAXCREB_02 | 1703 (+) | 0.800 | 0.686 | aTGAAGcaccctctg V$TAXCREB_02 | 1767 (+) | 0.600 | 0.631 | cTCTCGcctgtcctc V$TAXCREB_02 | 1875 (-) | 0.600 | 0.734 | gaggatctgCCTCCc V$TAXCREB_02 | 1971 (-) | 0.800 | 0.629 | gcaggttggCCTCAa V$TAXCREB_02 | 2001 (-) | 0.600 | 0.620 | ctggctataCATCTc V$TAXCREB_02 | 2435 (-) | 0.600 | 0.664 | ggggatttaGCTCAg V$TAXCREB_02 | 2644 (-) | 0.800 | 0.644 | cctgacctgCGTAAt V$TAXCREB_02 | 2880 (-) | 0.600 | 0.655 | ccgtgtatgCGTACt V$TAXCREB_02 | 3006 (-) | 0.800 | 0.646 | gtgcgcaggCGTCTc V$TAXCREB_02 | 3485 (+) | 0.800 | 0.629 | aTGAGGcagaatagc V$TAXCREB_02 | 3699 (-) | 0.600 | 0.624 | caagagatgCTTGAt V$TAXCREB_02 | 3890 (+) | 0.800 | 0.642 | tTGACCctcatcaca V$TAXCREB_02 | 4143 (-) | 0.600 | 0.642 | ggtgtggtgCCTCCc V$TAXCREB_02 | 4204 (-) | 0.800 | 0.638 | caaggcttgCCTCAg V$TAXCREB_02 | 4527 (-) | 1.000 | 0.673 | caacaaatgCGTCAa V$TAXCREB_02 | 4832 (-) | 0.600 | 0.633 | tagggaacgAGTGAt V$TAXCREB_02 | 4969 (+) | 0.600 | 0.675 | gTCAGGctcagcccc V$TAXCREB_02 | 5678 (+) | 0.600 | 0.646 | cAGACAcagagaccc V$TAXCREB_02 | 5747 (+) | 0.800 | 0.646 | tAGACGcaaaggacc V$TAXCREB_02 | 5891 (+) | 0.600 | 0.622 | aTGGGGcccgcccct V$TAXCREB_02 | 6000 (-) | 0.600 | 0.683 | tggggtacgTGTCCa V$TAXCREB_02 | 6258 (+) | 0.600 | 0.631 | tTGTCCcatctgccg V$TAXCREB_02 | 6327 (+) | 0.800 | 0.692 | cTGAAGcacacactg V$TAXCREB_02 | 6550 (-) | 0.800 | 0.710 | tgaggtaagAGTCAt V$TAXCREB_02 | 6575 (+) | 0.600 | 0.631 | gTTATGcaaaacaca V$TAXCREB_02 | 7338 (+) | 0.600 | 0.620 | tAGACAcatttcaac V$TAXCREB_02 | 7538 (-) | 0.600 | 0.666 | ttggttgtgAGCCAa V$TAXCREB_02 | 7608 (+) | 0.600 | 0.675 | cTGAGCcatctctcc V$TAXCREB_02 | 7845 (-) | 0.800 | 0.631 | tggcaaaagCGACAg V$TAXCREB_02 | 8104 (+) | 0.600 | 0.651 | cTGTCAcacactctc V$TAXCREB_02 | 8106 (+) | 0.600 | 0.657 | gTCACAcactctccc V$TAXCREB_02 | 8273 (+) | 0.800 | 0.743 | aTGATGcagctgccc V$TAXCREB_02 | 8317 (-) | 1.000 | 0.741 | cagcttctgCGTCAg V$TAXCREB_02 | 8395 (+) | 0.800 | 0.644 | cTGAAGccaagccag V$TAXCREB_02 | 8746 (+) | 0.600 | 0.624 | gTGGCCcacaccttg V$TAXCREB_02 | 8805 (-) | 0.600 | 0.629 | gggggccagCTTCTg V$TAXCREB_02 | 9123 (-) | 0.800 | 0.620 | cgggaaaacCTTCAg V$TAXCREB_02 | 9168 (+) | 0.600 | 0.629 | aAGATGcatgaacgc V$TAXCREB_02 | 9315 (-) | 0.800 | 0.752 | gaggcgttgGGTCAg V$TAXCREB_02 | 9376 (-) | 0.800 | 0.631 | agggagagcTGTCAg V$TAXCREB_02 | 9923 (-) | 0.600 | 0.640 | ttgcctatgGGTAAt V$TAXCREB_02 | 10029 (-) | 0.800 | 0.695 | tagtttgtgAGTCAc V$TAXCREB_02 | 10637 (+) | 0.800 | 0.672 | cTGAGGaagacaccc V$TAXCREB_02 | 10708 (+) | 0.600 | 0.657 | aTGAAAcatctctgc V$TAXCREB_02 | 10741 (-) | 0.600 | 0.699 | gggggtttcAGTGAt V$CEBPA_01 | 393 (-) | 0.922 | 0.930 | tcatttgCCAACaa V$CEBPA_01 | 420 (-) | 0.920 | 0.931 | agcttgtCCAAAat V$CEBPA_01 | 489 (+) | 1.000 | 0.941 | acATTGCaacatta V$CEBPA_01 | 1085 (+) | 0.920 | 0.930 | atTTTGGtcaacta V$CEBPA_01 | 1100 (+) | 0.909 | 0.921 | aaTTTGAgaaatcg V$CEBPA_01 | 1100 (-) | 0.929 | 0.934 | aatttgaGAAATcg V$CEBPA_01 | 1161 (-) | 0.909 | 0.921 | ttcttttTCAAAga V$CEBPA_01 | 1463 (-) | 0.932 | 0.938 | acgtttcTCAATat V$CEBPA_01 | 1656 (+) | 0.917 | 0.924 | taCTTGGcaaatgg V$CEBPA_01 | 2368 (+) | 0.977 | 0.921 | atTTTGCcacagtt V$CEBPA_01 | 2555 (-) | 0.929 | 0.934 | ctgttgtGAAATtt V$CEBPA_01 | 2564 (-) | 0.940 | 0.933 | aatttccGTAAGta V$CEBPA_01 | 2803 (+) | 0.940 | 0.932 | ctCTTACtgaacac V$CEBPA_01 | 2935 (+) | 1.000 | 0.926 | tgATTGCgaagtct V$CEBPA_01 | 3424 (+) | 0.979 | 0.973 | tgGTTGCttaattg V$CEBPA_01 | 3437 (-) | 0.945 | 0.945 | gacttagGTAACag V$CEBPA_01 | 3569 (-) | 0.974 | 0.972 | gcctttgGCAAGtt V$CEBPA_01 | 3677 (+) | 1.000 | 0.920 | gaATTGCcaagtgc V$CEBPA_01 | 3706 (-) | 0.932 | 0.942 | tgcttgaTCAATat V$CEBPA_01 | 4270 (+) | 0.940 | 0.947 | tgCTTACctaagca V$CEBPA_01 | 4708 (+) | 0.917 | 0.922 | gaCTTGGgtaattc V$CEBPA_01 | 4708 (-) | 0.966 | 0.962 | gacttggGTAATtc V$CEBPA_01 | 4795 (+) | 0.974 | 0.973 | gaCTTGCataatct V$CEBPA_01 | 4899 (+) | 0.917 | 0.922 | tcCTTGGccaagtg V$CEBPA_01 | 4899 (-) | 0.917 | 0.925 | tccttggCCAAGtg V$CEBPA_01 | 5152 (+) | 0.943 | 0.946 | agTTTACgaaaagg V$CEBPA_01 | 5355 (+) | 0.909 | 0.923 | tcTTTGAcaaaaca V$CEBPA_01 | 5442 (+) | 1.000 | 0.922 | gtATTGCcatataa V$CEBPA_01 | 6564 (+) | 0.979 | 0.920 | tgGTTGCcacagtt V$CEBPA_01 | 6573 (-) | 0.977 | 0.971 | cagttatGCAAAac V$CEBPA_01 | 6608 (+) | 0.977 | 0.974 | taTTTGCtaaactc V$CEBPA_01 | 7116 (-) | 1.000 | 0.922 | catgtgtGCAATgc V$CEBPA_01 | 7208 (+) | 0.932 | 0.937 | gaATTGAacaaaaa V$CEBPA_01 | 7402 (+) | 0.966 | 0.955 | gaATTACtgaagaa V$CEBPA_01 | 8071 (+) | 1.000 | 0.984 | gtATTGCagaaact V$CEBPA_01 | 8907 (-) | 0.929 | 0.926 | agattcaGGAATac V$CEBPA_01 | 9098 (+) | 0.940 | 0.949 | tgCTTACaaaagca V$CEBPA_01 | 9447 (-) | 0.943 | 0.951 | aggtttaCCAATct V$CEBPA_01 | 9789 (-) | 0.943 | 0.937 | ttgttcaGTAAAtg V$CEBPA_01 | 10128 (+) | 0.909 | 0.920 | aaATTAGttaagaa V$CEBPA_01 | 10132 (-) | 0.929 | 0.935 | tagttaaGAAATta V$CEBPA_01 | 10275 (-) | 1.000 | 0.924 | cggctgaGCAATct V$CEBPA_01 | 10377 (-) | 1.000 | 0.937 | aaatgttGCAATcc V$CEBPA_01 | 10430 (-) | 1.000 | 0.925 | agtcttcGCAATgc V$CEBPA_01 | 10703 (-) | 0.908 | 0.922 | tgtttatGAAACat V$CEBPA_01 | 11449 (+) | 0.920 | 0.922 | atTTTGGtgaatca V$CEBPA_01 | 11666 (+) | 0.906 | 0.921 | aaTTTTCataacca V$CEBPB_02 | 489 (+) | 1.000 | 0.919 | acaTTGCAacatta V$CEBPB_02 | 1085 (+) | 0.842 | 0.886 | attTTGGTcaacta V$CEBPB_02 | 1100 (-) | 0.861 | 0.889 | aatttgAGAAAtcg V$CEBPB_02 | 1463 (+) | 0.861 | 0.884 | acgTTTCTcaatat V$CEBPB_02 | 2368 (+) | 0.994 | 0.913 | attTTGCCacagtt V$CEBPB_02 | 2555 (-) | 0.865 | 0.890 | ctgttgTGAAAttt V$CEBPB_02 | 3424 (+) | 0.996 | 0.981 | tggTTGCTtaattg V$CEBPB_02 | 3437 (-) | 0.882 | 0.894 | gacttaGGTAAcag V$CEBPB_02 | 3569 (-) | 0.994 | 0.954 | gcctttGGCAAgtt V$CEBPB_02 | 4207 (+) | 0.994 | 0.915 | ggcTTGCCtcagtt V$CEBPB_02 | 4270 (+) | 0.882 | 0.888 | tgcTTACCtaagca V$CEBPB_02 | 4708 (-) | 0.882 | 0.899 | gacttgGGTAAttc V$CEBPB_02 | 4795 (+) | 1.000 | 0.970 | gacTTGCAtaatct V$CEBPB_02 | 5152 (+) | 0.888 | 0.885 | agtTTACGaaaagg V$CEBPB_02 | 5442 (+) | 0.994 | 0.886 | gtaTTGCCatataa V$CEBPB_02 | 6564 (+) | 0.994 | 0.913 | tggTTGCCacagtt V$CEBPB_02 | 6573 (-) | 1.000 | 0.985 | cagttaTGCAAaac V$CEBPB_02 | 6608 (+) | 0.996 | 0.970 | tatTTGCTaaactc V$CEBPB_02 | 7054 (+) | 0.861 | 0.886 | tttTTTCTtaattt V$CEBPB_02 | 8071 (+) | 1.000 | 0.955 | gtaTTGCAgaaact V$CEBPB_02 | 8578 (-) | 0.994 | 0.923 | aaatggGGCAAagg V$CEBPB_02 | 8936 (-) | 0.996 | 0.884 | aactaaAGCAActt V$CEBPB_02 | 9098 (+) | 0.888 | 0.889 | tgcTTACAaaagca V$CEBPB_02 | 10132 (-) | 0.861 | 0.883 | tagttaAGAAAtta V$CEBPB_02 | 10377 (-) | 1.000 | 0.924 | aaatgtTGCAAtcc V$CEBPB_02 | 10686 (-) | 0.994 | 0.894 | gactctGGCAAcac V$CEBPB_02 | 10968 (-) | 0.996 | 0.916 | tgttggAGCAAgga V$CEBPB_02 | 11284 (-) | 0.994 | 0.889 | gcatagGGCAAaga V$MYCMAX_01 | 733 (+) | 1.000 | 0.989 | ccacCACGTggttg V$MYCMAX_01 | 733 (-) | 1.000 | 0.989 | ccaccACGTGgttg V$MYCMAX_01 | 1124 (+) | 1.000 | 0.866 | tatgCACGTgtgtt V$MYCMAX_01 | 1124 (-) | 1.000 | 0.866 | tatgcACGTGtgtt V$MYCMAX_01 | 6062 (+) | 1.000 | 0.899 | cctgCACGTgctgc V$MYCMAX_01 | 6062 (-) | 1.000 | 0.899 | cctgcACGTGctgc V$MAX_01 | 733 (+) | 1.000 | 0.992 | ccacCACGTggttg V$MAX_01 | 733 (-) | 1.000 | 0.992 | ccaccACGTGgttg V$MAX_01 | 1124 (+) | 1.000 | 0.937 | tatgCACGTgtgtt V$MAX_01 | 1124 (-) | 1.000 | 0.937 | tatgcACGTGtgtt V$MAX_01 | 6062 (+) | 1.000 | 0.957 | cctgCACGTgctgc V$MAX_01 | 6062 (-) | 1.000 | 0.957 | cctgcACGTGctgc I$DL_02 | 898 (-) | 1.000 | 0.939 | gacTTTTCact I$DL_02 | 1548 (+) | 1.000 | 0.889 | actGAAAActc I$DL_02 | 1717 (-) | 1.000 | 0.896 | gccTTTTCcgg I$DL_02 | 2377 (-) | 1.000 | 0.962 | cagTTTTCcca I$DL_02 | 2501 (+) | 1.000 | 0.880 | tccGAAAAaaa I$DL_02 | 2512 (+) | 1.000 | 0.939 | aggGAAAAaaa I$DL_02 | 2750 (-) | 1.000 | 0.937 | ttcTTTTCtct I$DL_02 | 3259 (-) | 1.000 | 0.912 | tggTTTTCttg I$DL_02 | 4111 (-) | 1.000 | 0.950 | ccaTTTTCcca I$DL_02 | 5455 (-) | 1.000 | 0.920 | accTTTTCccc I$DL_02 | 5776 (+) | 1.000 | 0.958 | agaGAAAAata I$DL_02 | 5939 (+) | 1.000 | 0.946 | ggaGAAAAgta I$DL_02 | 6125 (-) | 1.000 | 0.944 | ttgTTTTCtcg I$DL_02 | 6707 (+) | 1.000 | 0.990 | agaGAAAAgcc I$DL_02 | 6743 (+) | 1.000 | 0.899 | tttGAAAAccc I$DL_02 | 7867 (+) | 1.000 | 0.898 | ttaGAAAAgct I$DL_02 | 7976 (-) | 1.000 | 0.891 | tgtTTTTCaag I$DL_02 | 8146 (+) | 1.000 | 0.927 | aggGAAAAcat I$DL_02 | 8548 (+) | 1.000 | 0.950 | cgaGAAAAtga I$DL_02 | 8627 (+) | 1.000 | 0.980 | tgaGAAAAgct I$DL_02 | 8660 (-) | 1.000 | 0.896 | tggTTTTCaaa I$DL_02 | 8698 (+) | 1.000 | 0.877 | tctGAAAAtac I$DL_02 | 9123 (+) | 1.000 | 0.980 | cggGAAAAcct I$DL_02 | 9413 (+) | 1.000 | 0.881 | ctaGAAAActa I$DL_02 | 9496 (+) | 1.000 | 0.895 | ataGAAAAtct I$DL_02 | 10827 (-) | 1.000 | 0.876 | gccTTTTCagg I$DL_02 | 11478 (-) | 1.000 | 0.989 | tgtTTTTCcct I$DL_02 | 11725 (-) | 1.000 | 0.950 | ccaTTTTCtca V$USF_01 | 285 (+) | 0.812 | 0.843 | aagtCAAGTgtttt V$USF_01 | 285 (-) | 0.812 | 0.843 | aagtcAAGTGtttt V$USF_01 | 733 (+) | 1.000 | 0.991 | ccacCACGTggttg V$USF_01 | 733 (-) | 1.000 | 0.991 | ccaccACGTGgttg V$USF_01 | 1124 (+) | 1.000 | 0.962 | tatgCACGTgtgtt V$USF_01 | 1124 (-) | 1.000 | 0.962 | tatgcACGTGtgtt V$USF_01 | 1356 (+) | 0.812 | 0.850 | gggtCACCTgatgc V$USF_01 | 1356 (-) | 0.812 | 0.850 | gggtcACCTGatgc V$USF_01 | 3639 (+) | 0.822 | 0.853 | ctacCACATggtgg V$USF_01 | 3639 (-) | 0.822 | 0.853 | ctaccACATGgtgg V$USF_01 | 3936 (+) | 0.812 | 0.843 | cagtCACTTgttca V$USF_01 | 3936 (-) | 0.812 | 0.843 | cagtcACTTGttca V$USF_01 | 6062 (+) | 1.000 | 0.971 | cctgCACGTgctgc V$USF_01 | 6062 (-) | 1.000 | 0.971 | cctgcACGTGctgc V$USF_02 | 285 (+) | 0.915 | 0.935 | aagtCAAGTgtttt V$USF_02 | 285 (-) | 0.915 | 0.935 | aagtcAAGTGtttt V$USF_02 | 733 (+) | 1.000 | 0.994 | ccacCACGTggttg V$USF_02 | 733 (-) | 1.000 | 0.994 | ccaccACGTGgttg V$USF_02 | 1124 (+) | 1.000 | 0.962 | tatgCACGTgtgtt V$USF_02 | 1124 (-) | 1.000 | 0.962 | tatgcACGTGtgtt V$USF_02 | 1356 (+) | 0.935 | 0.947 | gggtCACCTgatgc V$USF_02 | 1356 (-) | 0.935 | 0.947 | gggtcACCTGatgc V$USF_02 | 1784 (+) | 0.935 | 0.945 | ggagCACCTgccat V$USF_02 | 1784 (-) | 0.935 | 0.945 | ggagcACCTGccat V$USF_02 | 3639 (+) | 0.938 | 0.950 | ctacCACATggtgg V$USF_02 | 3639 (-) | 0.938 | 0.950 | ctaccACATGgtgg V$USF_02 | 3936 (+) | 0.915 | 0.933 | cagtCACTTgttca V$USF_02 | 3936 (-) | 0.915 | 0.933 | cagtcACTTGttca V$USF_02 | 5250 (+) | 0.938 | 0.938 | ttcaCATGTggtca V$USF_02 | 5250 (-) | 0.938 | 0.938 | ttcacATGTGgtca V$USF_02 | 6062 (+) | 1.000 | 0.972 | cctgCACGTgctgc V$USF_02 | 6062 (-) | 1.000 | 0.972 | cctgcACGTGctgc V$USF_02 | 7110 (+) | 0.938 | 0.936 | gtacCACATgtgtg V$USF_02 | 7110 (-) | 0.938 | 0.936 | gtaccACATGtgtg V$USF_02 | 8967 (+) | 0.935 | 0.934 | gaacCAGGTgtggt V$USF_02 | 8967 (-) | 0.935 | 0.934 | gaaccAGGTGtggt V$MYCMAX_02 | 734 (+) | 1.000 | 0.984 | cacCACGTggtt V$MYCMAX_02 | 734 (-) | 1.000 | 0.964 | caccACGTGgtt V$MYCMAX_02 | 1125 (+) | 1.000 | 0.942 | atgCACGTgtgt V$MYCMAX_02 | 1125 (-) | 1.000 | 0.929 | atgcACGTGtgt V$MYCMAX_02 | 3640 (+) | 0.941 | 0.921 | tacCACATggtg V$MYCMAX_02 | 5251 (-) | 0.941 | 0.953 | tcacATGTGgtc V$MYCMAX_02 | 6063 (-) | 1.000 | 0.960 | ctgcACGTGctg V$MYCMAX_02 | 7111 (+) | 0.941 | 0.946 | tacCACATgtgt V$MYCMAX_02 | 8999 (+) | 0.941 | 0.919 | cagCACATggag V$PBX1_02 | 11194 (-) | 1.000 | 0.982 | ttcttGATTGattat F$MCM1_01 | 7574 (+) | 0.867 | 0.895 | aatCCCCAtcaggtca V$GATA1_02 | 80 (+) | 0.993 | 0.979 | atgtTGATAgggcc V$GATA1_02 | 89 (-) | 1.000 | 0.970 | gggccTATCTtcgc V$GATA1_02 | 305 (-) | 1.000 | 0.944 | tgagcTATCTctcc V$GATA1_02 | 564 (+) | 1.000 | 0.951 | taggAGATAaacgt V$GATA1_02 | 1993 (-) | 1.000 | 0.950 | aagatTATCTggct V$GATA1_02 | 2316 (+) | 0.986 | 0.942 | ttttGGATAcagtc V$GATA1_02 | 2720 (+) | 1.000 | 0.955 | aaatAGATAatctt V$GATA1_02 | 3928 (-) | 0.993 | 0.962 | tacatTATCAgtca V$GATA1_02 | 3975 (+) | 0.993 | 0.934 | tgtaTGATAgacac V$GATA1_02 | 4372 (-) | 1.000 | 0.935 | aaagtTATCTggct V$GATA1_02 | 4575 (+) | 1.000 | 0.945 | ggttAGATAtagtc V$GATA1_02 | 4620 (-) | 0.986 | 0.956 | accaaTATCCtttg V$GATA1_02 | 4777 (+) | 0.986 | 0.945 | gtctGGATAaagat V$GATA1_02 | 5333 (+) | 0.991 | 0.948 | gcgcCGATAgtttt V$GATA1_02 | 5525 (-) | 0.986 | 0.969 | cctccTATCCagat V$GATA1_02 | 5580 (+) | 0.986 | 0.943 | taaaGGATAataat V$GATA1_02 | 5631 (+) | 0.986 | 0.937 | actgGGATAaagac V$GATA1_02 | 5975 (+) | 0.993 | 0.935 | gtggTGATAacttc V$GATA1_02 | 6463 (+) | 0.993 | 0.941 | ttccTGATAaaatg V$GATA1_02 | 6751 (-) | 0.986 | 0.969 | cccacTATCCcttt V$GATA1_02 | 6945 (+) | 0.993 | 0.959 | caagTGATActgtc V$GATA1_02 | 7150 (-) | 1.000 | 0.975 | cgccgTATCTcctg V$GATA1_02 | 7308 (+) | 0.986 | 0.940 | gtaaGGATAtagaa V$GATA1_02 | 7373 (+) | 0.993 | 0.957 | gaatTGATAttgct V$GATA1_02 | 7786 (-) | 0.986 | 0.939 | aaaatTATCCcaaa V$GATA1_02 | 7795 (+) | 1.000 | 0.955 | ccaaAGATAaagaa V$GATA1_02 | 7900 (+) | 0.993 | 0.947 | atttTGATAttagc V$GATA1_02 | 7949 (+) | 1.000 | 0.938 | tttgAGATAcccca V$GATA1_02 | 8060 (-) | 1.000 | 0.944 | tagttTATCTggta V$GATA1_02 | 8602 (-) | 0.993 | 0.950 | aatatTATCAttcc V$GATA1_02 | 9365 (-) | 0.993 | 0.948 | ttctcTATCAcagg V$GATA1_02 | 9417 (-) | 0.986 | 0.941 | aaaacTATCCcata V$GATA1_02 | 9730 (+) | 0.993 | 0.956 | tgtaTGATAatgtc V$GATA1_02 | 9831 (+) | 0.986 | 0.942 | gaagGGATAatttt V$GATA1_02 | 10486 (-) | 0.993 | 0.958 | ccacaTATCAagga V$GATA1_02 | 10592 (-) | 0.993 | 0.952 | tcctaTATCAaaaa V$GATA1_02 | 11346 (-) | 0.991 | 0.961 | gccatTATCGaacg V$GATA1_02 | 11367 (-) | 1.000 | 0.952 | gattcTATCTgtgc V$GATA1_02 | 11439 (-) | 1.000 | 0.963 | atactTATCTattt V$GATA1_02 | 11507 (+) | 0.986 | 0.936 | ttgtGGATAttacc V$GATA1_02 | 11567 (-) | 0.993 | 0.973 | actctTATCAttgc V$GATA1_02 | 11772 (+) | 1.000 | 0.960 | ttccAGATActtga V$GATA1_04 | 81 (+) | 0.991 | 0.956 | tgtTGATAgggcc V$GATA1_04 | 89 (-) | 1.000 | 0.962 | gggccTATCTtcg V$GATA1_04 | 305 (-) | 1.000 | 0.959 | tgagcTATCTctc V$GATA1_04 | 565 (+) | 1.000 | 0.979 | aggAGATAaacgt V$GATA1_04 | 1993 (-) | 1.000 | 0.981 | aagatTATCTggc V$GATA1_04 | 2721 (+) | 1.000 | 0.967 | aatAGATAatctt V$GATA1_04 | 3928 (-) | 0.991 | 0.974 | tacatTATCAgtc V$GATA1_04 | 3976 (+) | 0.991 | 0.954 | gtaTGATAgacac V$GATA1_04 | 4372 (-) | 1.000 | 0.983 | aaagtTATCTggc V$GATA1_04 | 5976 (+) | 0.991 | 0.962 | tggTGATAacttc V$GATA1_04 | 6464 (+) | 0.991 | 0.976 | tccTGATAaaatg V$GATA1_04 | 7150 (-) | 1.000 | 0.952 | cgccgTATCTcct V$GATA1_04 | 7796 (+) | 1.000 | 0.972 | caaAGATAaagaa V$GATA1_04 | 8060 (-) | 1.000 | 0.987 | tagttTATCTggt V$GATA1_04 | 8602 (-) | 0.991 | 0.963 | aatatTATCAttc V$GATA1_04 | 9365 (-) | 0.991 | 0.947 | ttctcTATCAcag V$GATA1_04 | 9731 (+) | 0.991 | 0.964 | gtaTGATAatgtc V$GATA1_04 | 11367 (-) | 1.000 | 0.963 | gattcTATCTgtg V$GATA1_04 | 11439 (-) | 1.000 | 0.982 | atactTATCTatt V$GATA1_04 | 11567 (-) | 0.991 | 0.970 | actctTATCAttg V$GATA1_04 | 11773 (+) | 1.000 | 0.949 | tccAGATActtga V$HFH1_01 | 568 (-) | 1.000 | 0.842 | agaTAAACgtct V$HFH1_01 | 604 (-) | 1.000 | 0.843 | tttTAAACatcc V$HFH1_01 | 635 (-) | 1.000 | 0.873 | tttTAAACattt V$HFH1_01 | 2410 (-) | 0.841 | 0.852 | aaaAAAACaaaa V$HFH1_01 | 4008 (-) | 0.790 | 0.835 | aaaTAAATattc V$HFH1_01 | 5718 (-) | 0.841 | 0.845 | acaAAAACacta V$HFH1_01 | 7064 (+) | 0.841 | 0.834 | atttGTTTTtgt V$HFH1_01 | 7070 (+) | 0.853 | 0.846 | ttttGTTTGtgt V$HFH1_01 | 7421 (+) | 1.000 | 0.893 | aaagGTTTAttt V$HFH1_01 | 7429 (+) | 0.841 | 0.844 | atttGTTTTtat V$HFH1_01 | 7823 (-) | 0.853 | 0.893 | aaaCAAACaatt V$HFH1_01 | 7938 (+) | 1.000 | 0.903 | gtgtGTTTAgtt V$HFH1_01 | 7969 (+) | 0.853 | 0.862 | agttGTTTGttt V$HFH1_01 | 8058 (+) | 1.000 | 0.869 | tctaGTTTAtct V$HFH1_01 | 8553 (-) | 0.790 | 0.848 | aaaTGAACaatg V$HFH1_01 | 10311 (+) | 0.841 | 0.858 | aactGTTTTtat V$HFH1_01 | 10456 (-) | 0.790 | 0.842 | ataTCAACaatg V$HFH1_01 | 10700 (+) | 1.000 | 0.865 | ccgtGTTTAtga V$HFH1_01 | 11097 (+) | 1.000 | 0.861 | cactGTTTAaag V$HFH1_01 | 11467 (+) | 0.790 | 0.847 | aattTTTTAttt V$HFH1_01 | 11575 (+) | 0.790 | 0.842 | cattGCTTAtat V$FOXD3_01 | 215 (+) | 0.906 | 0.921 | taTTTTTttttt V$FOXD3_01 | 1893 (+) | 0.944 | 0.918 | ttATATTtattt V$FOXD3_01 | 1905 (+) | 0.948 | 0.916 | taTTATTttatt V$FOXD3_01 | 2410 (-) | 1.000 | 0.949 | aaaaaAACAAaa V$FOXD3_01 | 2414 (-) | 0.906 | 0.937 | aaacaAAAAAtc V$FOXD3_01 | 4008 (-) | 0.944 | 0.958 | aaataAATATtc V$FOXD3_01 | 4221 (-) | 0.948 | 0.917 | aaaaaAATAAat V$FOXD3_01 | 5805 (-) | 0.906 | 0.921 | aaaaaAAAAAta V$FOXD3_01 | 7047 (+) | 0.884 | 0.912 | gaCTATTttttt V$FOXD3_01 | 7425 (+) | 0.948 | 0.931 | gtTTATTtgttt V$FOXD3_01 | 7781 (-) | 0.901 | 0.929 | aaacaAAAATta V$FOXD3_01 | 7823 (-) | 1.000 | 0.997 | aaacaAACAAtt V$FOXD3_01 | 7969 (+) | 1.000 | 0.937 | agTTGTTtgttt V$FOXD3_01 | 8553 (-) | 1.000 | 0.929 | aaatgAACAAtg V$FOXD3_01 | 11203 (+) | 0.948 | 0.915 | gaTTATTaattt V$FOXD3_01 | 11293 (-) | 0.948 | 0.938 | aaagaAATAAta V$FOXD3_01 | 11467 (+) | 0.906 | 0.929 | aaTTTTTtattt V$FOXD3_01 | 11471 (+) | 0.948 | 0.926 | ttTTATTtgttt V$HNF3B_01 | 210 (+) | 1.000 | 0.903 | cctggTATTTttttt V$HNF3B_01 | 212 (+) | 0.930 | 0.898 | tggtaTTTTTttttt V$HNF3B_01 | 213 (+) | 0.930 | 0.918 | ggtatTTTTTttttt V$HNF3B_01 | 503 (+) | 1.000 | 0.894 | ttataTATTTgtctt V$HNF3B_01 | 847 (-) | 1.000 | 0.938 | taaatAAATAcgatc V$HNF3B_01 | 954 (-) | 1.000 | 0.906 | tgatcAAATAtttac V$HNF3B_01 | 1469 (+) | 1.000 | 0.910 | ctcaaTATTTgctct V$HNF3B_01 | 1524 (+) | 1.000 | 0.917 | agagcTGTTTttttg V$HNF3B_01 | 1891 (+) | 1.000 | 0.945 | ttttaTATTTatttt V$HNF3B_01 | 1895 (+) | 1.000 | 0.907 | atattTATTTtatta V$HNF3B_01 | 1903 (+) | 1.000 | 0.938 | tttatTATTTtattt V$HNF3B_01 | 2358 (+) | 1.000 | 0.910 | acttgTATTTatttt V$HNF3B_01 | 2409 (-) | 1.000 | 0.928 | aaaaaAAACAaaaaa V$HNF3B_01 | 2413 (-) | 0.930 | 0.910 | aaaacAAAAAatcac V$HNF3B_01 | 3452 (-) | 1.000 | 0.894 | taataAAACAcaggg V$HNF3B_01 | 4007 (-) | 1.000 | 0.966 | taaatAAATAttctg V$HNF3B_01 | 4220 (-) | 1.000 | 0.923 | taaaaAAATAaattc V$HNF3B_01 | 4429 (-) | 0.930 | 0.890 | aaaaaAAAAAaaacc V$HNF3B_01 | 4430 (-) | 0.930 | 0.894 | aaaaaAAAAAaacca V$HNF3B_01 | 4498 (-) | 1.000 | 0.925 | aaacaAAACAaaaca V$HNF3B_01 | 4503 (-) | 1.000 | 0.893 | aaacaAAACAacatc V$HNF3B_01 | 5121 (-) | 1.000 | 0.900 | aaaggAAACAgatgg V$HNF3B_01 | 5590 (-) | 1.000 | 0.913 | taataAAATAtaata V$HNF3B_01 | 5717 (-) | 1.000 | 0.896 | cacaaAAACActaaa V$HNF3B_01 | 5792 (-) | 1.000 | 0.927 | taattAAATAcaaaa V$HNF3B_01 | 5805 (-) | 0.930 | 0.894 | aaaaaAAAAAtagcc V$HNF3B_01 | 5807 (-) | 1.000 | 0.930 | aaaaaAAATAgccct V$HNF3B_01 | 6589 (-) | 1.000 | 0.928 | aaaacAAATAaggtg V$HNF3B_01 | 7045 (+) | 1.000 | 0.953 | ttgacTATTTttttt V$HNF3B_01 | 7068 (+) | 1.000 | 0.901 | gttttTGTTTgtgta V$HNF3B_01 | 7423 (+) | 1.000 | 0.943 | aggttTATTTgtttt V$HNF3B_01 | 7776 (-) | 1.000 | 0.896 | aaacaAAACAaaaat V$HNF3B_01 | 7780 (-) | 0.930 | 0.904 | aaaacAAAAAttatc V$HNF3B_01 | 7822 (-) | 1.000 | 0.966 | aaaacAAACAattaa V$HNF3B_01 | 7936 (+) | 1.000 | 0.916 | ttgtgTGTTTagttt V$HNF3B_01 | 7967 (+) | 1.000 | 0.926 | atagtTGTTTgtttt V$HNF3B_01 | 9534 (-) | 1.000 | 0.906 | agaacAAACAagtcc V$HNF3B_01 | 10309 (+) | 1.000 | 0.920 | ttaacTGTTTttata V$HNF3B_01 | 10317 (+) | 1.000 | 0.890 | ttttaTATTTttatt V$HNF3B_01 | 10319 (+) | 0.930 | 0.894 | ttataTTTTTatttt V$HNF3B_01 | 10340 (+) | 1.000 | 0.896 | tttatTGTTTtaatt V$HNF3B_01 | 10911 (-) | 1.000 | 0.905 | tatgcAAATAatagt V$HNF3B_01 | 11292 (-) | 1.000 | 0.928 | caaagAAATAataat V$HNF3B_01 | 11464 (+) | 0.930 | 0.908 | tgaaaTTTTTtattt V$HNF3B_01 | 11465 (+) | 0.930 | 0.891 | gaaatTTTTTatttg V$HNF3B_01 | 11469 (+) | 1.000 | 0.938 | tttttTATTTgtttt V$HNF3B_01 | 11674 (-) | 0.930 | 0.895 | taaccAAAAAagacc V$HNF1_01 | 987 (+) | 1.000 | 0.948 | aGTTAAtaattttca V$HNF1_01 | 2535 (-) | 0.918 | 0.820 | ttttagagaATAACa V$HNF1_01 | 3412 (+) | 1.000 | 0.916 | aGTTAAtgattctgg V$HNF1_01 | 3449 (+) | 1.000 | 0.815 | aGTTAAtaaaacaca V$HNF1_01 | 4056 (-) | 0.788 | 0.823 | aggcaataaTTAAAa V$HNF1_01 | 5099 (+) | 0.918 | 0.884 | gGTTATtatttgtga V$HNF1_01 | 5786 (-) | 0.788 | 0.812 | atgaaataaTTAAAt V$HNF1_01 | 7354 (+) | 1.000 | 0.867 | tGTTAAtcatgggca V$HNF1_01 | 7824 (-) | 1.000 | 0.909 | aacaaacaaTTAACc V$HNF1_01 | 8488 (+) | 0.918 | 0.849 | gGTTATtttctacag V$HNF1_01 | 8852 (-) | 0.788 | 0.809 | gtttaaaaaTTAAGa V$HNF1_01 | 11694 (+) | 1.000 | 0.886 | aGTTAAtcactgtca V$HNF4_01 | 7082 (-) | 1.000 | 0.864 | atgagtgCTTTGcgcgcac V$HNF4_01 | 8373 (-) | 1.000 | 0.934 | tgctgctCTTTGatctgta V$HNF4_01 | 8579 (+) | 1.000 | 0.884 | aatggggCAAAGgcaacat V$OCT1_01 | 2622 (+) | 0.784 | 0.845 | tatgcTAGGCaaatgctct V$OCT1_01 | 2829 (+) | 0.784 | 0.848 | agttgTCTGCaaatggcat V$OCT1_01 | 3099 (+) | 0.784 | 0.819 | gaatcTATACtaatttcca V$OCT1_01 | 3838 (-) | 0.831 | 0.819 | tttccattcTCATAaaata V$OCT1_01 | 4066 (+) | 0.837 | 0.806 | taaaaTATGTaaaatgtat V$OCT1_01 | 4791 (-) | 1.000 | 0.864 | tgctgacttGCATAatctc V$OCT1_01 | 5439 (-) | 0.834 | 0.829 | tttgtattgCCATAtaacc V$OCT1_01 | 6203 (+) | 1.000 | 0.838 | cggggTATGCcaaggtttc V$OCT1_01 | 6572 (+) | 1.000 | 0.890 | acagtTATGCaaaacacaa V$OCT1_01 | 9727 (+) | 0.831 | 0.848 | ctttgTATGAtaatgtcca V$OCT1_01 | 9931 (-) | 0.822 | 0.880 | gggtaatttGCATGtatgt V$OCT1_01 | 10906 (+) | 1.000 | 0.970 | gctacTATGCaaataatag V$OCT1_01 | 11662 (-) | 0.831 | 0.881 | acacaatttTCATAaccaa V$OCT1_02 | 2 (-) | 0.809 | 0.792 | ttcatGCATGtacaa V$OCT1_02 | 641 (+) | 0.992 | 0.827 | acattTATTCattta V$OCT1_02 | 869 (+) | 0.814 | 0.769 | ttagaAATGCacaaa V$OCT1_02 | 957 (+) | 0.758 | 0.777 | tcaaaTATTTacagg V$OCT1_02 | 1119 (+) | 1.000 | 0.854 | ttgtgTATGCacgtg V$OCT1_02 | 1466 (-) | 0.753 | 0.776 | tttctCAATAtttgc V$OCT1_02 | 2001 (+) | 0.940 | 0.813 | ctggcTATACatctc V$OCT1_02 | 2315 (-) | 0.942 | 0.783 | gttttGGATAcagtc V$OCT1_02 | 2521 (-) | 0.766 | 0.793 | aaattACATAttact V$OCT1_02 | 2537 (-) | 0.992 | 0.775 | ttagaGAATAacagg V$OCT1_02 | 2582 (+) | 0.814 | 0.768 | caacaAATGCagtac V$OCT1_02 | 2593 (-) | 0.753 | 0.778 | gtactGACTAttaaa V$OCT1_02 | 2617 (+) | 1.000 | 0.801 | atgtgTATGCtaggc V$OCT1_02 | 2736 (+) | 1.000 | 0.777 | gccttTATGCctttt V$OCT1_02 | 2880 (+) | 1.000 | 0.803 | ccgtgTATGCgtact V$OCT1_02 | 2890 (-) | 0.805 | 0.817 | gtactGCATCttttt V$OCT1_02 | 2900 (+) | 0.942 | 0.788 | tttttTATCCacatc V$OCT1_02 | 2944 (-) | 0.761 | 0.785 | agtctGCCTAttacg V$OCT1_02 | 2953 (-) | 0.992 | 0.769 | attacGAATAcagtg V$OCT1_02 | 3176 (+) | 0.942 | 0.810 | cattaTATCCcccct V$OCT1_02 | 3204 (+) | 1.000 | 0.824 | tcaagTATGCtaggc V$OCT1_02 | 3363 (-) | 0.992 | 0.850 | ccactGAATAgccta V$OCT1_02 | 3577 (+) | 0.992 | 0.826 | caagtTATTCaactc V$OCT1_02 | 3709 (-) | 0.753 | 0.778 | ttgatCAATAtttac V$OCT1_02 | 3712 (+) | 0.758 | 0.784 | atcaaTATTTacggg V$OCT1_02 | 3850 (+) | 0.942 | 0.918 | taaaaTATCCagtac V$OCT1_02 | 4007 (-) | 0.758 | 0.812 | taaatAAATAttctg V$OCT1_02 | 4010 (+) | 0.992 | 0.905 | ataaaTATTCtgctc V$OCT1_02 | 4029 (-) | 0.753 | 0.805 | ggaatTAATAttcca V$OCT1_02 | 4032 (+) | 0.992 | 0.911 | attaaTATTCcagca V$OCT1_02 | 4050 (+) | 0.761 | 0.783 | ggaaaTAGGCaataa V$OCT1_02 | 4066 (+) | 0.766 | 0.786 | taaaaTATGTaaaat V$OCT1_02 | 4076 (-) | 0.940 | 0.841 | aaaatGTATActata V$OCT1_02 | 4103 (+) | 0.752 | 0.775 | taaaaCATCCatttt V$OCT1_02 | 4167 (+) | 0.992 | 0.878 | cctagTATTCaggag V$OCT1_02 | 4311 (-) | 1.000 | 0.835 | atactGCATAaaact V$OCT1_02 | 4446 (-) | 0.753 | 0.809 | acaatTAATAttcat V$OCT1_02 | 4449 (+) | 0.992 | 0.965 | attaaTATTCatttc V$OCT1_02 | 4473 (-) | 0.814 | 0.768 | gaccaGCATTtttct V$OCT1_02 | 4620 (+) | 0.942 | 0.868 | accaaTATCCtttgg V$OCT1_02 | 4776 (-) | 0.942 | 0.791 | agtctGGATAaagat V$OCT1_02 | 4795 (-) | 1.000 | 0.918 | gacttGCATAatctc V$OCT1_02 | 4884 (+) | 0.802 | 0.787 | tggaaCATTCcatgt V$OCT1_02 | 5525 (+) | 0.942 | 0.789 | cctccTATCCagatt V$OCT1_02 | 5579 (-) | 0.942 | 0.794 | ctaaaGGATAataat V$OCT1_02 | 5780 (+) | 0.761 | 0.785 | aaaaaTATGAaataa V$OCT1_02 | 6203 (+) | 1.000 | 0.801 | cggggTATGCcaagg V$OCT1_02 | 6468 (+) | 0.814 | 0.770 | gataaAATGCtcagg V$OCT1_02 | 6572 (+) | 1.000 | 0.834 | acagtTATGCaaaac V$OCT1_02 | 6751 (+) | 0.942 | 0.786 | cccacTATCCcttta V$OCT1_02 | 6899 (-) | 1.000 | 0.779 | agaggGCATAagtgg V$OCT1_02 | 6957 (+) | 0.992 | 0.826 | tccatTATTCtggat V$OCT1_02 | 7042 (-) | 0.753 | 0.780 | tagttGACTAttttt V$OCT1_02 | 7264 (+) | 0.942 | 0.865 | taaaaTATCCtaaaa V$OCT1_02 | 7307 (-) | 0.942 | 0.809 | agtaaGGATAtagaa V$OCT1_02 | 7656 (+) | 1.000 | 0.771 | ttttgTATGCcaggc V$OCT1_02 | 7786 (+) | 0.942 | 0.790 | aaaatTATCCcaaag V$OCT1_02 | 8083 (+) | 0.940 | 0.787 | cttccTATACagctc V$OCT1_02 | 8384 (-) | 0.940 | 0.779 | gatctGTATAcctga V$OCT1_02 | 8596 (-) | 0.753 | 0.780 | atcttTAATAttatc V$OCT1_02 | 8867 (-) | 1.000 | 0.771 | ctctgGCATAgaata V$OCT1_02 | 8872 (-) | 0.992 | 0.932 | gcataGAATAttcta V$OCT1_02 | 8875 (+) | 0.992 | 0.934 | tagaaTATTCtagaa V$OCT1_02 | 8910 (-) | 0.992 | 0.852 | ttcagGAATActcgg V$OCT1_02 | 9079 (-) | 1.000 | 0.828 | gggcaGCATAgtgaa V$OCT1_02 | 9138 (-) | 0.992 | 0.833 | tttttGAATAagaca V$OCT1_02 | 9165 (+) | 0.805 | 0.819 | cctaaGATGCatgaa V$OCT1_02 | 9417 (+) | 0.942 | 0.787 | aaaacTATCCcataa V$OCT1_02 | 9751 (-) | 0.753 | 0.780 | tatatTAATAttgaa V$OCT1_02 | 9754 (+) | 0.753 | 0.782 | attaaTATTGaaaac V$OCT1_02 | 9830 (-) | 0.942 | 0.791 | agaagGGATAatttt V$OCT1_02 | 9912 (-) | 0.756 | 0.778 | ttactGGATTtttgc V$OCT1_02 | 9939 (-) | 0.749 | 0.777 | tgcatGTATGttata V$OCT1_02 | 9956 (-) | 0.749 | 0.774 | accctGTATGtttct V$OCT1_02 | 10906 (+) | 1.000 | 0.888 | gctacTATGCaaata V$OCT1_02 | 11084 (-) | 0.806 | 0.791 | tctggGAATTttcca V$OCT1_02 | 11198 (-) | 0.753 | 0.784 | tgattGATTAttaat V$OCT1_02 | 11279 (-) | 1.000 | 0.777 | ttaagGCATAgggca V$OCT1_02 | 11412 (-) | 0.814 | 0.771 | attctGCATTtctat V$OCT1_02 | 11432 (-) | 1.000 | 0.835 | tatccGCATActtat V$OCT1_02 | 11496 (-) | 0.806 | 0.817 | cggttGAATTtttgt V$OCT1_02 | 11506 (-) | 0.942 | 0.925 | tttgtGGATAttacc V$OCT1_02 | 11548 (-) | 0.709 | 0.773 | tagatGGGTAttcaa V$OCT1_02 | 11551 (+) | 0.992 | 0.857 | atgggTATTCaacca V$OCT1_02 | 11579 (+) | 0.942 | 0.864 | gcttaTATCCaggag V$OCT1_03 | 47 (-) | 1.000 | 0.995 | atgtcATTACttg V$OCT1_03 | 61 (+) | 0.996 | 0.988 | gccATAATtactg V$OCT1_03 | 62 (-) | 1.000 | 0.992 | ccataATTACtgt V$OCT1_03 | 175 (-) | 1.000 | 0.936 | ctggcATTACtac V$OCT1_03 | 438 (+) | 0.982 | 0.980 | cgtTTAATcaggt V$OCT1_03 | 494 (-) | 0.996 | 0.943 | gcaacATTATtat V$OCT1_03 | 497 (-) | 0.996 | 0.991 | acattATTATata V$OCT1_03 | 650 (-) | 0.996 | 0.986 | catttATTATata V$OCT1_03 | 710 (-) | 0.934 | 0.936 | atctcTTTACaga V$OCT1_03 | 959 (-) | 0.934 | 0.932 | aaataTTTACagg V$OCT1_03 | 986 (+) | 0.982 | 0.977 | cagTTAATaattt V$OCT1_03 | 989 (+) | 0.996 | 0.940 | ttaATAATtttca V$OCT1_03 | 1647 (+) | 0.985 | 0.931 | gtgCTAATctact V$OCT1_03 | 1901 (-) | 0.996 | 0.989 | attttATTATttt V$OCT1_03 | 1991 (-) | 0.996 | 0.942 | ccaagATTATctg V$OCT1_03 | 2518 (-) | 1.000 | 0.945 | aaaaaATTACata V$OCT1_03 | 2525 (-) | 1.000 | 0.945 | tacatATTACttt V$OCT1_03 | 2573 (-) | 0.934 | 0.933 | aagtaTTTACaac V$OCT1_03 | 2651 (+) | 1.000 | 0.937 | tgcGTAATgcact V$OCT1_03 | 2723 (+) | 0.996 | 0.941 | tagATAATcttct V$OCT1_03 | 2898 (-) | 0.930 | 0.930 | tctttTTTATcca V$OCT1_03 | 2948 (-) | 1.000 | 0.948 | tgcctATTACgaa V$OCT1_03 | 3172 (-) | 0.996 | 0.940 | tgtacATTATatc V$OCT1_03 | 3273 (-) | 0.996 | 0.948 | atgccATTATgat V$OCT1_03 | 3372 (+) | 0.985 | 0.983 | agcCTAATcatat V$OCT1_03 | 3411 (+) | 0.982 | 0.975 | aagTTAATgattc V$OCT1_03 | 3428 (+) | 0.982 | 0.930 | tgcTTAATtgact V$OCT1_03 | 3448 (+) | 0.982 | 0.974 | cagTTAATaaaac V$OCT1_03 | 3626 (+) | 0.996 | 0.943 | tttATAATagtac V$OCT1_03 | 3714 (-) | 0.934 | 0.931 | caataTTTACggg V$OCT1_03 | 3730 (-) | 0.996 | 0.941 | ctaaaATTATtat V$OCT1_03 | 3733 (-) | 0.996 | 0.989 | aaattATTATttc V$OCT1_03 | 3827 (+) | 0.996 | 0.991 | ttcATAATaaatt V$OCT1_03 | 3906 (+) | 0.982 | 0.975 | gtaTTAATaaagc V$OCT1_03 | 3921 (-) | 1.000 | 0.934 | tatgaATTACatt V$OCT1_03 | 3926 (-) | 0.996 | 0.945 | attacATTATcag V$OCT1_03 | 4058 (+) | 0.996 | 0.989 | gcaATAATtaaaa V$OCT1_03 | 4059 (-) | 0.982 | 0.975 | caataATTAAaat V$OCT1_03 | 4159 (+) | 1.000 | 0.936 | tttGTAATcctag V$OCT1_03 | 4247 (-) | 1.000 | 0.935 | attgtATTACttg V$OCT1_03 | 4282 (-) | 0.985 | 0.932 | cacacATTAGacc V$OCT1_03 | 4444 (-) | 0.982 | 0.933 | acacaATTAAtat V$OCT1_03 | 4712 (+) | 1.000 | 0.934 | tggGTAATtcact V$OCT1_03 | 4799 (+) | 0.996 | 0.941 | tgcATAATctctg V$OCT1_03 | 5004 (-) | 1.000 | 0.934 | cttgtATTACcca V$OCT1_03 | 5011 (-) | 0.982 | 0.934 | tacccATTAAgcc V$OCT1_03 | 5098 (-) | 0.996 | 0.984 | gggttATTATttg V$OCT1_03 | 5235 (-) | 1.000 | 0.945 | gaacgATTACatg V$OCT1_03 | 5411 (-) | 0.982 | 0.978 | atcttATTAAatc V$OCT1_03 | 5583 (+) | 0.996 | 0.991 | aggATAATaataa V$OCT1_03 | 5586 (+) | 0.996 | 0.991 | ataATAATaaaat V$OCT1_03 | 5596 (+) | 0.996 | 0.939 | aatATAATataac V$OCT1_03 | 5634 (+) | 0.930 | 0.931 | gggATAAAgacaa V$OCT1_03 | 5738 (+) | 0.996 | 0.943 | gccATAATataga V$OCT1_03 | 5788 (+) | 0.996 | 0.990 | gaaATAATtaaat V$OCT1_03 | 5789 (-) | 0.982 | 0.976 | aaataATTAAata V$OCT1_03 | 6807 (-) | 1.000 | 0.995 | ccctcATTACctt V$OCT1_03 | 6834 (-) | 0.934 | 0.930 | tgctaTTTACtgc V$OCT1_03 | 6955 (-) | 0.996 | 0.942 | tgtccATTATtct V$OCT1_03 | 7058 (+) | 0.982 | 0.932 | ttcTTAATttgtt V$OCT1_03 | 7257 (-) | 0.982 | 0.978 | gtctcATTAAaat V$OCT1_03 | 7353 (+) | 0.982 | 0.978 | atgTTAATcatgg V$OCT1_03 | 7399 (-) | 1.000 | 0.933 | gtagaATTACtga V$OCT1_03 | 7784 (-) | 0.996 | 0.940 | caaaaATTATccc V$OCT1_03 | 7827 (-) | 0.982 | 0.932 | aaacaATTAAccc V$OCT1_03 | 7903 (-) | 0.985 | 0.931 | ttgatATTAGctt V$OCT1_03 | 8150 (-) | 0.985 | 0.933 | aaaacATTAGctt V$OCT1_03 | 8570 (-) | 0.982 | 0.975 | ctttgATTAAatg V$OCT1_03 | 8600 (-) | 0.996 | 0.943 | ttaatATTATcat V$OCT1_03 | 8683 (+) | 1.000 | 0.989 | aagGTAATtagcc V$OCT1_03 | 8684 (-) | 0.985 | 0.978 | aggtaATTAGcct V$OCT1_03 | 8855 (-) | 0.982 | 0.930 | taaaaATTAAgac V$OCT1_03 | 8951 (-) | 0.985 | 0.933 | ttaaaATTAGtag V$OCT1_03 | 9589 (+) | 1.000 | 0.934 | cctGTAATcccag V$OCT1_03 | 9733 (+) | 0.996 | 0.942 | atgATAATgtcca V$OCT1_03 | 9752 (+) | 0.982 | 0.930 | ataTTAATattga V$OCT1_03 | 9834 (+) | 0.996 | 0.943 | gggATAATtttaa V$OCT1_03 | 9930 (+) | 1.000 | 0.945 | tggGTAATttgca V$OCT1_03 | 10125 (-) | 0.985 | 0.932 | ataaaATTAGtta V$OCT1_03 | 10137 (-) | 0.996 | 0.942 | aagaaATTATtat V$OCT1_03 | 10140 (-) | 0.996 | 0.990 | aaattATTATtat V$OCT1_03 | 10143 (-) | 0.996 | 0.989 | ttattATTATtgt V$OCT1_03 | 10170 (-) | 0.985 | 0.980 | tccttATTAGtgt V$OCT1_03 | 10505 (-) | 0.996 | 0.942 | agtcaATTATggc V$OCT1_03 | 10520 (-) | 0.996 | 0.944 | tagccATTATagg V$OCT1_03 | 10655 (+) | 0.985 | 0.935 | gtcCTAATgtcaa V$OCT1_03 | 10700 (-) | 0.930 | 0.931 | ccgtgTTTATgaa V$OCT1_03 | 10915 (+) | 0.996 | 0.942 | caaATAATagtct V$OCT1_03 | 10999 (-) | 1.000 | 0.948 | atcagATTACctg V$OCT1_03 | 11199 (-) | 0.996 | 0.987 | gattgATTATtaa V$OCT1_03 | 11202 (-) | 0.982 | 0.974 | tgattATTAAttt V$OCT1_03 | 11205 (+) | 0.982 | 0.930 | ttaTTAATttgtt V$OCT1_03 | 11296 (+) | 0.996 | 0.990 | gaaATAATaataa V$OCT1_03 | 11299 (+) | 0.996 | 0.991 | ataATAATaacga V$OCT1_03 | 11344 (-) | 0.996 | 0.946 | atgccATTATcga V$OCT1_03 | 11387 (-) | 0.982 | 0.977 | aacttATTAAatt V$OCT1_03 | 11510 (-) | 1.000 | 0.942 | tggatATTACcgt V$OCT1_03 | 11693 (+) | 0.982 | 0.974 | aagTTAATcactg V$OCT1_04 | 352 (+) | 0.877 | 0.874 | tacatcagATTCAaatcctttct V$OCT1_04 | 988 (-) | 0.952 | 0.913 | gttaataattTTCATgtggggac V$OCT1_04 | 3104 (-) | 0.948 | 0.933 | tatactaattTCCATgcggctat V$OCT1_04 | 3124 (-) | 0.961 | 0.954 | tatattaattTACATtcccacaa V$OCT1_04 | 3830 (-) | 0.948 | 0.946 | ataataaattTCCATtctcataa V$OCT1_04 | 4056 (+) | 0.829 | 0.873 | aggcaataATTAAaatatgtaaa V$OCT1_04 | 4064 (+) | 0.961 | 0.899 | attaaaatATGTAaaatgtatac V$OCT1_04 | 4446 (-) | 0.952 | 0.895 | acaattaataTTCATttcacact V$OCT1_04 | 5336 (-) | 1.000 | 0.906 | ccgatagtttTGCATcccctctt V$OCT1_04 | 5398 (-) | 0.952 | 0.897 | tttatattttTTCATcttattaa V$OCT1_04 | 5431 (-) | 0.847 | 0.879 | ccactttattTGTATtgccatat V$OCT1_04 | 6570 (+) | 1.000 | 0.935 | ccacagttATGCAaaacacaaaa V$OCT1_04 | 9929 (-) | 1.000 | 0.956 | atgggtaattTGCATgtatgtta V$OCT1_04 | 10904 (+) | 1.000 | 0.959 | acgctactATGCAaataatagtc V$OCT1_04 | 11147 (+) | 0.952 | 0.922 | gtactctgATGAAaatgaagctg V$OCT1_04 | 11330 (+) | 0.961 | 0.941 | tgatgacgATGTAaatgccatta V$OCT1_04 | 11660 (-) | 0.952 | 0.949 | taacacaattTTCATaaccaaaa V$AHR_01 | 1423 (-) | 1.000 | 0.812 | agaCACGCaatcagtggc V$AHR_01 | 5813 (+) | 1.000 | 0.799 | aatagccctgGCGTGgca V$AHR_01 | 6348 (-) | 0.800 | 0.805 | tttCACGAaagctcaggg V$AHR_01 | 7160 (+) | 0.800 | 0.801 | cctggagctgGCGTTgca V$AHR_01 | 9065 (-) | 0.800 | 0.828 | ttcCAGGCtagccagggc I$BCD_01 | 2025 (+) | 1.000 | 1.000 | ggGATTAa V$LYF1_01 | 193 (-) | 1.000 | 0.890 | ccTCCCAca V$LYF1_01 | 1277 (+) | 1.000 | 0.878 | tcTGGGAtg V$LYF1_01 | 2381 (-) | 1.000 | 0.905 | ttTCCCAga V$LYF1_01 | 2714 (-) | 1.000 | 0.956 | ctTCCCAaa V$LYF1_01 | 3284 (+) | 1.000 | 0.862 | atTGGGAtg V$LYF1_01 | 4115 (-) | 1.000 | 0.905 | ttTCCCAga V$LYF1_01 | 5993 (+) | 0.861 | 0.843 | ttAGGGAtg V$LYF1_01 | 6179 (-) | 1.000 | 0.840 | gtTCCCAga V$LYF1_01 | 7269 (-) | 0.859 | 0.852 | taTCCTAaa V$LYF1_01 | 7699 (+) | 1.000 | 0.921 | ctTGGGAgg V$LYF1_01 | 7791 (-) | 1.000 | 0.951 | taTCCCAaa V$LYF1_01 | 8115 (-) | 1.000 | 0.842 | tcTCCCAcc V$LYF1_01 | 8762 (-) | 1.000 | 0.869 | ccTCCCAgc V$LYF1_01 | 8772 (+) | 1.000 | 0.859 | ccTGGGAgg V$LYF1_01 | 8960 (+) | 0.861 | 0.844 | gtAGGGAga V$LYF1_01 | 9422 (-) | 1.000 | 0.852 | taTCCCAta V$LYF1_01 | 9604 (+) | 1.000 | 0.956 | ttTGGGAag V$LYF1_01 | 10612 (+) | 1.000 | 0.890 | tgTGGGAgg V$LYF1_01 | 11084 (+) | 1.000 | 0.840 | tcTGGGAat F$NIT2_01 | 310 (+) | 1.000 | 0.996 | TATCTc F$NIT2_01 | 567 (-) | 1.000 | 0.996 | gAGATA F$NIT2_01 | 2723 (-) | 1.000 | 1.000 | tAGATA F$NIT2_01 | 4578 (-) | 1.000 | 1.000 | tAGATA F$NIT2_01 | 7155 (+) | 1.000 | 0.996 | TATCTc F$NIT2_01 | 7952 (-) | 1.000 | 0.996 | gAGATA F$NIT2_01 | 11444 (+) | 1.000 | 1.000 | TATCTa V$PAX5_01 | 6306 (-) | 0.988 | 0.824 | ggcactgatctacttCCCTGactgaagc V$PAX5_01 | 7672 (+) | 0.965 | 0.858 | tggtggctCACAGctgtaaagccagaac V$PAX5_01 | 10848 (+) | 0.999 | 0.826 | gaagaagcCATTGtagagatgcagcgtg V$PAX5_02 | 406 (+) | 0.711 | 0.690 | aggcggaatgggtGAGCTtgtccaaaat V$PAX5_02 | 527 (+) | 0.866 | 0.658 | aaaatgatgggtgATGCGcttgtgtttt V$PAX5_02 | 533 (-) | 1.000 | 0.718 | atgggtgatgCGCTTgtgttttaacctt V$PAX5_02 | 661 (+) | 0.657 | 0.659 | tataagtacactgTAGCTgctcttcaga V$PAX5_02 | 670 (-) | 0.738 | 0.634 | actgtagctgCTCTTcagacacaccaga V$PAX5_02 | 691 (+) | 0.657 | 0.646 | caccagaagagggCATCGgatctcttta V$PAX5_02 | 702 (-) | 0.738 | 0.637 | ggcatcggatCTCTTtacagatggttgt V$PAX5_02 | 799 (+) | 0.657 | 0.679 | tgagccatttctcCAGCCcgacactttt V$PAX5_02 | 856 (-) | 0.738 | 0.713 | acgatcgttgCCCTTagaaatgcacaaa V$PAX5_02 | 903 (+) | 0.866 | 0.683 | ttcactttctttcATGCGtagcacttcc V$PAX5_02 | 964 (-) | 0.657 | 0.648 | tttacaggacCCCTAagcatcacagtta V$PAX5_02 | 965 (+) | 0.738 | 0.723 | ttacaggacccctAAGCAtcacagttaa V$PAX5_02 | 1000 (-) | 0.839 | 0.642 | catgtggggaCGCACctatatgtgattg V$PAX5_02 | 1130 (+) | 0.738 | 0.676 | cgtgtgttcctaaAACCGttactttatg V$PAX5_02 | 1178 (-) | 0.611 | 0.707 | gagatgggcaCCCCCaggaaagctggtt V$PAX5_02 | 1391 (+) | 0.711 | 0.704 | gggcaggtgaggaGAGTGggtccattgc V$PAX5_02 | 1418 (-) | 0.785 | 0.638 | ccgtgagacaCGCAAtcagtggcaatag V$PAX5_02 | 1443 (+) | 0.711 | 0.686 | tagtgaagtcactGAGCCggacgtttct V$PAX5_02 | 1939 (-) | 0.657 | 0.724 | gccttggctaGGCTGgaactttcctttt V$PAX5_02 | 2022 (+) | 0.711 | 0.631 | gctgggattaaagGAGTGtaccactgta V$PAX5_02 | 2033 (-) | 0.657 | 0.710 | aggagtgtacCACTGtacctggcttctc V$PAX5_02 | 2096 (-) | 0.657 | 0.662 | ctctctgtctCTCTGtctctctctctct V$PAX5_02 | 2104 (-) | 0.711 | 0.685 | ctctctgtctCTCTCtctctctctctct V$PAX5_02 | 2321 (-) | 0.657 | 0.659 | gatacagtctTGCTAtgtgaccctacgc V$PAX5_02 | 2331 (-) | 0.657 | 0.628 | tgctatgtgaCCCTAcgctcttctcaaa V$PAX5_02 | 2446 (-) | 0.919 | 0.709 | tcagtggtagCGCTAtggcctagcaagc V$PAX5_02 | 2457 (+) | 1.000 | 0.682 | gctatggcctagcAAGCGcgaaggccct V$PAX5_02 | 2663 (-) | 0.738 | 0.710 | tttgatgacaCTCTTaacccgttgcatc V$PAX5_02 | 2688 (-) | 0.711 | 0.689 | atcctggtcaCACTCttctccttcctct V$PAX5_02 | 2849 (+) | 0.711 | 0.713 | attccgtcctttaGAGCTgaacagaact V$PAX5_02 | 2968 (-) | 0.785 | 0.652 | acggcagatgCGCAGgcatctacttccg V$PAX5_02 | 2999 (+) | 0.900 | 0.698 | ctccaaggtgcgcAGGCGtctcggcagt V$PAX5_02 | 3028 (-) | 0.973 | 0.640 | caccgacttcCGCTCcacccgggacata V$PAX5_02 | 3487 (-) | 0.738 | 0.682 | gaggcagaatAGCTTaactaccctgctg V$PAX5_02 | 3709 (+) | 0.604 | 0.639 | ttgatcaatatttACGGGtgactaaaat V$PAX5_02 | 3764 (-) | 0.711 | 0.633 | taaaaggcaaTGCTCcttcccccctttt V$PAX5_02 | 3906 (-) | 0.738 | 0.629 | gtattaataaAGCTTtatgaattacatt V$PAX5_02 | 4520 (+) | 0.866 | 0.655 | accaagacaacaaATGCGtcaacagcct V$PAX5_02 | 4564 (-) | 0.738 | 0.664 | gacacagttgCGGTTagatatagtctcc V$PAX5_02 | 4758 (+) | 0.919 | 0.685 | ctgtaggtcccagCAGCGagtctggata V$PAX5_02 | 4963 (-) | 0.711 | 0.640 | caggaagtcaGGCTCagcccctacctgt V$PAX5_02 | 5026 (-) | 0.711 | 0.670 | gggactgggaTGCTCaggattgctgtgg V$PAX5_02 | 5069 (-) | 0.657 | 0.628 | agagatgagaGGCTGgctctgactggtt V$PAX5_02 | 5242 (-) | 0.604 | 0.630 | tacatgggttCACATgtggtcacagtca V$PAX5_02 | 5308 (+) | 0.657 | 0.722 | aaaatggcttcttCATCGgttcagtgcg V$PAX5_02 | 5482 (-) | 0.738 | 0.644 | ttttttgatcCCCTTccctctgctccca V$PAX5_02 | 5682 (-) | 0.657 | 0.631 | cacagagaccCACTAgttcatacactta V$PAX5_02 | 5808 (+) | 0.819 | 0.713 | aaaaaaatagcccTGGCGtggcactgtg V$PAX5_02 | 5857 (-) | 0.604 | 0.662 | gagttggtttCCCATggaccatctcctg V$PAX5_02 | 5889 (-) | 0.873 | 0.669 | gcatggggccCGCCCctaagagtagttt V$PAX5_02 | 5911 (+) | 0.919 | 0.717 | tagtttgtttcccCAGCGagactccctt V$PAX5_02 | 6215 (-) | 0.973 | 0.690 | aggtttctcaCGCTCtgcatgctgtctg V$PAX5_02 | 6232 (-) | 0.657 | 0.716 | catgctgtctGGCTAcggctttctattt V$PAX5_02 | 6398 (-) | 0.973 | 0.669 | gtgttcagaaCGCTCagaatgactcctc V$PAX5_02 | 6586 (+) | 0.638 | 0.631 | cacaaaacaaataAGGTGgtgctatttg V$PAX5_02 | 6715 (+) | 0.738 | 0.630 | gccacaacctttaAAGGGtgagcagccc V$PAX5_02 | 6933 (+) | 0.738 | 0.631 | atgtgcagtacccAAGTGatactgtcca V$PAX5_02 | 7080 (+) | 0.839 | 0.657 | gtatgagtgctttGCGCGcacctagtct V$PAX5_02 | 7275 (+) | 0.657 | 0.639 | aaaataagacccaTAGAGtaccaagctg V$PAX5_02 | 7474 (-) | 0.839 | 0.646 | gagtgtgctaCGCGCacgtctgttcttc V$PAX5_02 | 7492 (+) | 0.711 | 0.637 | tctgttcttcttgGAGGGtgtcagatgg V$PAX5_02 | 7838 (+) | 1.000 | 0.630 | ccacttctggcaaAAGCGacagcgtaca V$PAX5_02 | 8195 (-) | 0.638 | 0.675 | ctgcctgtttCCCCTctgacctcagttc V$PAX5_02 | 8283 (-) | 0.738 | 0.664 | tgccctggctCCCTTcgcatgcccctct V$PAX5_02 | 8340 (-) | 0.738 | 0.656 | tttgctggacCACTTgccaacccctgtg V$PAX5_02 | 8385 (+) | 0.738 | 0.650 | atctgtatacctgAAGCCaagccagccc V$PAX5_02 | 8421 (+) | 0.711 | 0.669 | agcagctgttctaGAGCTttgcaacccg V$PAX5_02 | 8496 (+) | 0.839 | 0.684 | tctacagggacagGTGCGggacactcac V$PAX5_02 | 8813 (-) | 0.711 | 0.644 | gcttctggtaCTCTCgatgttcaaaacc V$PAX5_02 | 9083 (-) | 0.711 | 0.644 | agcatagtgaAGCTCtgcttacaaaagc V$PAX5_02 | 9094 (+) | 0.738 | 0.675 | gctctgcttacaaAAGCAaaacaaaaaa V$PAX5_02 | 9170 (-) | 0.819 | 0.637 | gatgcatgaaCGCCAcagagaccactca V$PAX5_02 | 9323 (-) | 0.657 | 0.644 | gggtcagagcCTCTGgaggacccctgaa V$PAX5_02 | 9576 (-) | 0.611 | 0.638 | gtggtggtatAGCCCtgtaatcccagca V$PAX5_02 | 9603 (+) | 0.638 | 0.643 | atttgggaagctgAGGTGggaggatcgg V$PAX5_02 | 10051 (+) | 0.738 | 0.679 | gttctaagaactgAACCGggtcttctgg V$PAX5_02 | 10213 (+) | 0.711 | 0.660 | cagtggcatctaaGAGTGgaacctaggt V$PAX5_02 | 10240 (-) | 0.738 | 0.690 | ttggtggtcaGGCTTgtgcaggaagtga V$PAX5_02 | 10265 (-) | 0.638 | 0.644 | tgatttttgcCGGCTgagcaatctcatc V$PAX5_02 | 10549 (+) | 0.819 | 0.668 | aagtatgtacagcCGGCGgtacagcgtt V$PAX5_02 | 10667 (+) | 0.657 | 0.663 | acaggaggatccaCAGCAggactctggc V$PAX5_02 | 10822 (+) | 0.638 | 0.631 | agtccgccttttcAGGCCcatcgcagga V$PAX5_02 | 10856 (+) | 0.919 | 0.670 | cattgtagagatgCAGCGtggtggtggg V$PAX5_02 | 10895 (-) | 0.919 | 0.652 | tgcgcagagaCGCTActatgcaaataat V$PAX5_02 | 10914 (+) | 0.919 | 0.647 | gcaaataatagtcTAGCGcctcttgtgg V$PAX5_02 | 11155 (-) | 0.657 | 0.643 | atgaaaatgaAGCTGcagagatcttttt V$PAX5_02 | 11326 (-) | 0.657 | 0.635 | atgatgatgaCGATGtaaatgccattat V$PAX5_02 | 11578 (+) | 0.711 | 0.652 | tgcttatatccagGAGTGagactagagt V$PAX5_02 | 11602 (+) | 0.738 | 0.631 | gagtgcacagagaAAGGGgtaccagaag V$PAX5_02 | 11882 (-) | 0.711 | 0.640 | ttagttctcaCTCTCaaccaatagcacg V$PAX5_02 | 11905 (+) | 0.611 | 0.635 | gcacgaaggcttcGGGCCcaacgcccat V$BRN2_01 | 498 (-) | 1.000 | 0.912 | cattATTATatatttg V$BRN2_01 | 647 (-) | 0.993 | 0.928 | attcATTTAttatata V$BRN2_01 | 651 (-) | 1.000 | 0.932 | atttATTATatataag V$BRN2_01 | 1886 (-) | 1.000 | 0.913 | tcccATTTTatattta V$BRN2_01 | 2519 (-) | 1.000 | 0.899 | aaaaATTACatattac V$BRN2_01 | 2719 (+) | 1.000 | 0.906 | caaatagATAATcttc V$BRN2_01 | 2765 (-) | 0.917 | 0.921 | ctagATTCCgcatgtg V$BRN2_01 | 2949 (-) | 1.000 | 0.929 | gcctATTACgaataca V$BRN2_01 | 2950 (+) | 0.904 | 0.865 | cctattaCGAATacag V$BRN2_01 | 3823 (+) | 1.000 | 0.944 | tcaattcATAATaaat V$BRN2_01 | 3827 (+) | 0.993 | 0.940 | ttcataaTAAATttcc V$BRN2_01 | 4051 (+) | 0.917 | 0.874 | gaaatagGCAATaatt V$BRN2_01 | 4069 (+) | 1.000 | 0.918 | aatatgtAAAATgtat V$BRN2_01 | 4154 (-) | 0.987 | 0.907 | tcccATTTGtaatcct V$BRN2_01 | 4155 (+) | 1.000 | 0.960 | cccatttGTAATccta V$BRN2_01 | 5005 (-) | 1.000 | 0.929 | ttgtATTACccattaa V$BRN2_01 | 5301 (+) | 1.000 | 0.962 | gccatccAAAATggct V$BRN2_01 | 5781 (+) | 1.000 | 0.917 | aaaatatGAAATaatt V$BRN2_01 | 5842 (+) | 0.904 | 0.896 | ggcatctCCAATgttg V$BRN2_01 | 5988 (-) | 0.987 | 0.973 | ctggATTAGggatggg V$BRN2_01 | 6490 (-) | 0.917 | 0.920 | ctggATTGCagatgcg V$BRN2_01 | 6959 (-) | 0.917 | 0.929 | cattATTCTggatgga V$BRN2_01 | 7311 (+) | 0.917 | 0.869 | aggatatAGAATgtac V$BRN2_01 | 8593 (+) | 0.993 | 0.948 | aacatctTTAATatta V$BRN2_01 | 8648 (-) | 0.917 | 0.910 | gacaATTCCcaatggt V$BRN2_01 | 8649 (+) | 0.904 | 0.860 | acaattcCCAATggtt V$BRN2_01 | 9485 (-) | 0.987 | 0.926 | ctttATTTGcaataga V$BRN2_01 | 9486 (+) | 0.917 | 0.865 | tttatttGCAATagaa V$BRN2_01 | 9493 (+) | 1.000 | 0.915 | gcaatagAAAATctga V$BRN2_01 | 9881 (-) | 0.993 | 0.907 | atcaATTTAaaattct V$BRN2_01 | 9882 (+) | 1.000 | 0.909 | tcaatttAAAATtctt V$BRN2_01 | 9926 (+) | 1.000 | 0.920 | cctatggGTAATttgc V$BRN2_01 | 10115 (+) | 0.904 | 0.936 | gccatccCCAATaaaa V$BRN2_01 | 10455 (+) | 0.917 | 0.878 | tatatcaACAATgcat V$BRN2_01 | 10585 (-) | 1.000 | 0.912 | ttcaATTTCctatatc V$BRN2_01 | 11196 (-) | 0.911 | 0.870 | cttgATTGAttattaa V$BRN2_01 | 11200 (-) | 1.000 | 0.929 | attgATTATtaatttg V$BRN2_01 | 11201 (+) | 0.993 | 0.909 | ttgattaTTAATttgt V$BRN2_01 | 11408 (-) | 0.917 | 0.894 | atttATTCTgcatttc V$BRN2_01 | 11464 (-) | 1.000 | 0.885 | tgaaATTTTttatttg V$BRN2_01 | 11572 (-) | 0.917 | 0.867 | tatcATTGCttatatc V$HSF1_01 | 1953 (-) | 0.974 | 0.976 | ggaacTTTCC V$HSF1_01 | 4885 (+) | 0.976 | 0.969 | GGAACattcc V$HSF1_01 | 4885 (-) | 0.977 | 0.978 | ggaacATTCC V$HSF1_01 | 8078 (+) | 0.997 | 0.970 | AGAAActtcc V$HSF1_01 | 8876 (+) | 1.000 | 0.986 | AGAATattct V$HSF1_01 | 8876 (-) | 1.000 | 0.986 | agaatATTCT V$HSF2_01 | 1953 (+) | 0.996 | 0.986 | GGAACtttcc V$HSF2_01 | 1953 (-) | 0.995 | 0.986 | ggaacTTTCC V$HSF2_01 | 2561 (+) | 0.990 | 0.982 | TGAAAtttcc V$HSF2_01 | 2561 (-) | 0.995 | 0.982 | tgaaaTTTCC V$HSF2_01 | 2781 (-) | 0.997 | 0.977 | agattCTTCC V$HSF2_01 | 4885 (+) | 0.996 | 0.987 | GGAACattcc V$HSF2_01 | 4885 (-) | 0.999 | 0.988 | ggaacATTCC V$HSF2_01 | 7015 (-) | 0.995 | 0.969 | ggatgTTTCC V$HSF2_01 | 7715 (+) | 0.998 | 0.968 | AGAAGgatca V$HSF2_01 | 8078 (+) | 0.996 | 0.986 | AGAAActtcc V$HSF2_01 | 8078 (-) | 0.997 | 0.993 | agaaaCTTCC V$HSF2_01 | 8524 (-) | 0.992 | 0.974 | agatgCTTCA V$HSF2_01 | 8876 (+) | 1.000 | 0.996 | AGAATattct V$HSF2_01 | 8876 (-) | 1.000 | 0.996 | agaatATTCT V$HSF2_01 | 8914 (+) | 0.999 | 0.968 | GGAATactcg V$HSF2_01 | 10709 (+) | 0.990 | 0.973 | TGAAAcatct V$HSF2_01 | 11088 (+) | 0.999 | 0.987 | GGAATtttcc V$HSF2_01 | 11088 (-) | 0.995 | 0.986 | ggaatTTTCC P$SBF1_01 | 520 (-) | 1.000 | 0.921 | gatTTTAAaaatga P$SBF1_01 | 983 (+) | 1.000 | 0.934 | tcacagTTAATaat P$SBF1_01 | 2401 (+) | 1.000 | 0.924 | ctttccTTAAAaaa P$SBF1_01 | 3307 (+) | 0.934 | 0.922 | atgtagTTAAGatt P$SBF1_01 | 3445 (+) | 1.000 | 0.941 | taacagTTAATaaa P$SBF1_01 | 4059 (+) | 1.000 | 0.924 | caataaTTAAAata P$SBF1_01 | 7257 (+) | 1.000 | 0.924 | gtctcaTTAAAata P$SBF1_01 | 10305 (-) | 1.000 | 0.915 | tttTTTAActgttt P$SBF1_01 | 10417 (-) | 1.000 | 0.939 | attTTTAAcactaa P$AG_01 | 164 (+) | 0.967 | 0.775 | cgttCCCAAtgctggcat P$AG_01 | 165 (+) | 0.878 | 0.858 | gttcCCAATgctggcatt P$AG_01 | 1134 (+) | 0.994 | 0.776 | tgttCCTAAaaccgttac P$AG_01 | 1551 (-) | 0.875 | 0.790 | gaaaactctTCTGGaaat P$AG_01 | 2381 (+) | 0.875 | 0.768 | tttcCCAGAgctgagagt P$AG_01 | 2713 (+) | 0.967 | 0.813 | tcttCCCAAatagataat P$AG_01 | 2714 (+) | 1.000 | 0.809 | cttcCCAAAtagataatc P$AG_01 | 2785 (+) | 0.915 | 0.786 | tcttCCTTAgctggtctt P$AG_01 | 3638 (+) | 0.881 | 0.771 | cctaCCACAtggtggaat P$AG_01 | 3860 (+) | 0.798 | 0.780 | agtaCCATTtaaggagtt P$AG_01 | 4394 (+) | 0.840 | 0.795 | agttCCAAGacaggtcag P$AG_01 | 5009 (+) | 0.845 | 0.807 | attaCCCATtaagccacg P$AG_01 | 5984 (-) | 0.994 | 0.825 | acttctggaTTAGGgatg P$AG_01 | 6266 (+) | 0.971 | 0.846 | tctgCCGAAgaaggaagc P$AG_01 | 6369 (-) | 1.000 | 0.784 | taagcgagaTTTGGtaga P$AG_01 | 7573 (-) | 0.869 | 0.788 | aaatccccaTCAGGtcaa P$AG_01 | 7834 (-) | 0.875 | 0.856 | taacccactTCTGGcaaa P$AG_01 | 8135 (-) | 0.875 | 0.770 | gctttctgcTGAGGgaaa P$AG_01 | 8462 (-) | 0.915 | 0.785 | ggtctcatcTAAGGcaaa P$AG_01 | 9595 (-) | 1.000 | 0.851 | atcccagcaTTTGGgaag P$AG_01 | 11442 (-) | 1.000 | 0.795 | cttatctatTTTGGtgaa P$AG_01 | 11534 (+) | 0.881 | 0.858 | aatcCCACAtatggtaga P$AG_01 | 11534 (-) | 0.920 | 0.875 | aatcccacaTATGGtaga P$AG_01 | 11673 (+) | 1.000 | 0.774 | ataaCCAAAaaagaccaa P$AG_01 | 11795 (-) | 0.878 | 0.773 | aaatcccagATTGGaagg V$SRF_01 | 11534 (+) | 0.812 | 0.772 | aatcCCACAtatggtaga F$STRE_01 | 1609 (+) | 1.000 | 1.000 | tcAGGGGg F$STRE_01 | 8482 (+) | 1.000 | 0.999 | taAGGGGg V$ARP1_01 | 50 (+) | 0.811 | 0.727 | tcattACTTGggccat V$ARP1_01 | 254 (+) | 0.721 | 0.698 | tgtaaGGTTGaactca V$ARP1_01 | 698 (-) | 0.833 | 0.746 | agagggCATCGgatct V$ARP1_01 | 791 (+) | 0.833 | 0.805 | ttaacCGCTGagccat V$ARP1_01 | 952 (-) | 0.789 | 0.701 | aatgatCAAATattta V$ARP1_01 | 1126 (+) | 0.693 | 0.703 | tgcacGTGTGttccta V$ARP1_01 | 1239 (-) | 0.924 | 0.734 | tgacatCAGGGcagca V$ARP1_01 | 1604 (-) | 0.924 | 0.820 | ggggctCAGGGggtct V$ARP1_01 | 1688 (-) | 0.924 | 0.737 | tcagctCAGGGgtaga V$ARP1_01 | 2038 (+) | 0.833 | 0.731 | tgtacCACTGtacctg V$ARP1_01 | 2411 (-) | 0.820 | 0.705 | aaaaaaCAAAAaatca V$ARP1_01 | 2638 (+) | 0.924 | 0.718 | tctacCCCTGacctgc V$ARP1_01 | 2677 (+) | 0.909 | 0.752 | taaccCGTTGcatcct V$ARP1_01 | 3072 (-) | 0.882 | 0.767 | ctaggtCACAGggtga V$ARP1_01 | 3456 (-) | 0.924 | 0.767 | aaaacaCAGGGcctca V$ARP1_01 | 3523 (+) | 1.000 | 0.725 | ggccaCCTTGgcccga V$ARP1_01 | 3661 (-) | 0.882 | 0.715 | aagtaaCACAGactta V$ARP1_01 | 3685 (-) | 0.843 | 0.726 | aagtgcCAAGAacaca V$ARP1_01 | 4020 (-) | 0.924 | 0.726 | tgctctCATGGaatta V$ARP1_01 | 4198 (-) | 1.000 | 0.719 | caagttCAAGGcttgc V$ARP1_01 | 4357 (-) | 0.744 | 0.732 | gaggatCAGAAgttta V$ARP1_01 | 5213 (+) | 0.744 | 0.704 | tgtgcTTCTGagcgga V$ARP1_01 | 5238 (-) | 0.924 | 0.801 | cgattaCATGGgttca V$ARP1_01 | 5315 (-) | 0.833 | 0.741 | cttcttCATCGgttca V$ARP1_01 | 5479 (+) | 0.820 | 0.819 | tgattTTTTGatcccc V$ARP1_01 | 5826 (+) | 0.882 | 0.736 | tggcaCTGTGaaacaa V$ARP1_01 | 6045 (+) | 0.924 | 0.700 | tgcagCCCTGtgacag V$ARP1_01 | 6207 (-) | 1.000 | 0.701 | gtatgcCAAGGtttct V$ARP1_01 | 6556 (-) | 0.924 | 0.699 | aagagtCATGGttgcc V$ARP1_01 | 6763 (+) | 0.924 | 0.732 | ttataCCATGaacaaa V$ARP1_01 | 7125 (-) | 1.000 | 0.721 | aatgctCAAGGaggcc V$ARP1_01 | 7234 (-) | 0.833 | 0.790 | gaagttCATCGtctca V$ARP1_01 | 7601 (+) | 0.676 | 0.819 | tgaacTGCTGagccat V$ARP1_01 | 8203 (+) | 0.901 | 0.747 | ttcccCTCTGacctca V$ARP1_01 | 8258 (+) | 0.909 | 0.708 | cgtggCATTGacctga V$ARP1_01 | 8266 (+) | 0.676 | 0.706 | tgaccTGATGatgcag V$ARP1_01 | 8302 (+) | 0.901 | 0.731 | tgcccCTCTGccctgc V$ARP1_01 | 8453 (-) | 1.000 | 0.763 | ctaagcCAAGGtctca V$ARP1_01 | 8526 (-) | 0.843 | 0.724 | atgcttCAAGAacaca V$ARP1_01 | 8706 (-) | 1.000 | 0.719 | taccatCAAGGagtct V$ARP1_01 | 9032 (-) | 1.000 | 0.711 | tgagttCAAGGgcagc V$ARP1_01 | 9090 (+) | 0.901 | 0.731 | tgaagCTCTGcttaca V$ARP1_01 | 9190 (-) | 0.843 | 0.706 | accactCAAGAgagca V$ARP1_01 | 9259 (-) | 0.924 | 0.705 | atgggaCAGGGcagcg V$ARP1_01 | 9294 (+) | 0.725 | 0.734 | tcagcTTGTGtgccca V$ARP1_01 | 9321 (-) | 0.901 | 0.802 | ttgggtCAGAGcctct V$ARP1_01 | 9339 (+) | 0.924 | 0.701 | aggacCCCTGaagtct V$ARP1_01 | 9387 (+) | 0.924 | 0.765 | tcagcCCCTGgaatgt V$ARP1_01 | 9713 (+) | 0.721 | 0.698 | tgtaaAATTGatctct V$ARP1_01 | 10096 (+) | 0.901 | 0.836 | ttaacCTCTGagccat V$ARP1_01 | 10349 (-) | 0.977 | 0.806 | ttaattCAAAGtgtca V$ARP1_01 | 10357 (-) | 0.901 | 0.781 | aagtgtCAGAGaatct V$ARP1_01 | 10495 (-) | 0.820 | 0.722 | aaggatCAAAAgtcaa V$ARP1_01 | 10979 (+) | 0.882 | 0.703 | ggagcCTGTGtggccc V$ARP1_01 | 10987 (+) | 0.924 | 0.725 | tgtggCCCTGgaatca V$ARP1_01 | 11456 (+) | 0.814 | 0.797 | tgaatCAGTGaaattt V$ARP1_01 | 11924 (-) | 0.833 | 0.737 | aacgccCATTGgctga V$ARP1_01 | 11949 (-) | 0.767 | 0.779 | aagggtCAGGActgca V$RORA1_01 | 1074 (+) | 1.000 | 0.968 | atgtgtaGGTCAt V$RORA1_01 | 9632 (+) | 1.000 | 0.976 | agttcaaGGTCAg V$COUP_01 | 946 (-) | 0.919 | 0.857 | aggaacaatGATCA V$COUP_01 | 3567 (+) | 0.796 | 0.817 | TGGCCtttggcaag V$COUP_01 | 7119 (-) | 0.919 | 0.858 | gtgtgcaatGCTCA V$COUP_01 | 8376 (+) | 0.697 | 0.813 | TGCTCtttgatctg V$COUP_01 | 10489 (-) | 0.919 | 0.831 | catatcaagGATCA V$COUP_01 | 10496 (-) | 0.701 | 0.834 | aggatcaaaAGTCA V$COUP_01 | 10799 (+) | 0.919 | 0.870 | TGAGCcttgtgcat V$COUP_01 | 11943 (-) | 1.000 | 0.875 | tatttcaagGGTCA V$OCT1_05 | 992 (+) | 0.934 | 0.917 | ataaTTTTCatgtg V$OCT1_05 | 3108 (+) | 0.934 | 0.905 | ctaaTTTCCatgcg V$OCT1_05 | 3128 (+) | 0.918 | 0.887 | ttaaTTTACattcc V$OCT1_05 | 6575 (-) | 1.000 | 0.875 | gttatGCAAAacac V$OCT1_05 | 7350 (-) | 0.852 | 0.890 | aacatGTTAAtcat V$OCT1_05 | 9933 (+) | 1.000 | 0.933 | gtaaTTTGCatgta V$OCT1_05 | 10909 (-) | 1.000 | 0.964 | actatGCAAAtaat V$OCT1_05 | 11664 (+) | 0.934 | 0.899 | acaaTTTTCataac V$OCT1_06 | 18 (-) | 0.901 | 0.936 | tacaTGTGAatgtg V$OCT1_06 | 44 (-) | 1.000 | 0.911 | cagaTGTCAttact V$OCT1_06 | 278 (-) | 0.964 | 0.897 | tgcaTGCAAgtcaa V$OCT1_06 | 357 (+) | 0.865 | 0.888 | cagatTCAAAtcct V$OCT1_06 | 383 (-) | 0.964 | 0.900 | gagaTGCCAttcat V$OCT1_06 | 498 (+) | 0.946 | 0.922 | cattaTTATAtatt V$OCT1_06 | 621 (-) | 0.982 | 0.922 | tacaTGACActttt V$OCT1_06 | 750 (-) | 0.911 | 0.891 | ggaaTTGAActcag V$OCT1_06 | 820 (+) | 0.946 | 0.911 | cacttTTTAAtact V$OCT1_06 | 907 (+) | 0.982 | 0.920 | ctttcTTTCAtgcg V$OCT1_06 | 979 (-) | 0.964 | 0.899 | agcaTCACAgttaa V$OCT1_06 | 992 (+) | 0.982 | 0.930 | ataatTTTCAtgtg V$OCT1_06 | 2355 (+) | 0.911 | 0.913 | caaacTTGTAttta V$OCT1_06 | 2519 (+) | 1.000 | 0.950 | aaaaaTTACAtatt V$OCT1_06 | 3108 (+) | 0.946 | 0.950 | ctaatTTCCAtgcg V$OCT1_06 | 3128 (+) | 1.000 | 0.907 | ttaatTTACAttcc V$OCT1_06 | 3270 (-) | 0.964 | 0.897 | gtgaTGCCAttatg V$OCT1_06 | 3834 (+) | 0.946 | 0.889 | taaatTTCCAttct V$OCT1_06 | 3923 (-) | 0.946 | 0.912 | tgaaTTACAttatc V$OCT1_06 | 4061 (-) | 0.946 | 0.889 | ataaTTAAAatatg V$OCT1_06 | 4315 (-) | 0.928 | 0.902 | tgcaTAAAActggg V$OCT1_06 | 4793 (+) | 0.964 | 0.888 | ctgacTTGCAtaat V$OCT1_06 | 4886 (+) | 0.946 | 0.891 | gaacaTTCCAtgtt V$OCT1_06 | 5026 (+) | 0.946 | 0.891 | gggacTGGGAtgct V$OCT1_06 | 5619 (-) | 0.982 | 0.900 | aacaTCTAAtacac V$OCT1_06 | 5783 (-) | 0.982 | 0.901 | aataTGAAAtaatt V$OCT1_06 | 5790 (+) | 0.964 | 0.931 | aataaTTAAAtaca V$OCT1_06 | 5948 (+) | 0.928 | 0.888 | taaaaTTTTAtttg V$OCT1_06 | 6666 (-) | 0.865 | 0.889 | aagaTTTTAtttag V$OCT1_06 | 6783 (-) | 0.982 | 0.919 | cagaTCTAAtgctg V$OCT1_06 | 7259 (+) | 0.865 | 0.896 | ctcatTAAAAtatc V$OCT1_06 | 7342 (-) | 0.964 | 0.917 | cacaTTTCAacatg V$OCT1_06 | 7897 (+) | 0.964 | 0.899 | tttatTTTGAtatt V$OCT1_06 | 9933 (+) | 0.964 | 0.922 | gtaatTTGCAtgta V$OCT1_06 | 10109 (+) | 0.946 | 0.891 | catctTGCCAtccc V$OCT1_06 | 10211 (+) | 0.964 | 0.922 | cacagTGGCAtcta V$OCT1_06 | 10217 (-) | 0.982 | 0.928 | ggcaTCTAAgagtg V$OCT1_06 | 10466 (-) | 0.982 | 0.908 | tgcaTCTCActtta V$OCT1_06 | 10808 (-) | 0.982 | 0.911 | tgcaTCTAAatcgc V$OCT1_06 | 10814 (-) | 0.928 | 0.902 | taaaTCGCAgtccg V$OCT1_06 | 11341 (-) | 0.964 | 0.922 | taaaTGCCAttatc V$OCT1_06 | 11389 (-) | 0.946 | 0.907 | cttaTTAAAttttg V$OCT1_06 | 11634 (+) | 0.964 | 0.907 | ctcgtTTAAAtgta V$OCT1_06 | 11794 (-) | 0.946 | 0.891 | caaaTCCCAgattg I$HSF_04 | 8876 (+) | 1.000 | 0.985 | agaatatTCTAGaat F$HSF_04 | 8876 (+) | 1.000 | 0.998 | agaatatTCTAGaat V$AP1FJ_Q2 | 44 (-) | 0.975 | 0.934 | cagatGTCATt V$AP1FJ_Q2 | 132 (-) | 0.963 | 0.949 | gctcaGTCAAt V$AP1FJ_Q2 | 1077 (-) | 0.975 | 0.952 | tgtagGTCATt V$AP1FJ_Q2 | 1086 (-) | 0.963 | 0.946 | ttttgGTCAAc V$AP1FJ_Q2 | 1228 (+) | 0.975 | 0.960 | gATGACtgctt V$AP1FJ_Q2 | 1254 (-) | 0.975 | 0.960 | attctGTCATc V$AP1FJ_Q2 | 1353 (-) | 1.000 | 0.976 | cctggGTCACc V$AP1FJ_Q2 | 1445 (-) | 1.000 | 0.973 | gtgaaGTCACt V$AP1FJ_Q2 | 1682 (-) | 0.983 | 0.950 | cgggtGTCAGc V$AP1FJ_Q2 | 2595 (+) | 0.983 | 0.945 | aCTGACtatta V$AP1FJ_Q2 | 2666 (+) | 0.975 | 0.935 | gATGACactct V$AP1FJ_Q2 | 2689 (-) | 1.000 | 0.938 | tcctgGTCACa V$AP1FJ_Q2 | 2964 (+) | 1.000 | 0.956 | aGTGACggcag V$AP1FJ_Q2 | 3071 (-) | 1.000 | 0.960 | gctagGTCACa V$AP1FJ_Q2 | 3228 (-) | 0.983 | 0.948 | taccaGTCAGc V$AP1FJ_Q2 | 3434 (+) | 0.963 | 0.947 | aTTGACttagg V$AP1FJ_Q2 | 3725 (+) | 1.000 | 0.994 | gGTGACtaaaa V$AP1FJ_Q2 | 3751 (-) | 0.975 | 0.966 | ttttaGTCATt V$AP1FJ_Q2 | 3933 (-) | 1.000 | 0.986 | tatcaGTCACt V$AP1FJ_Q2 | 4332 (+) | 1.000 | 0.971 | gGTGACatcca V$AP1FJ_Q2 | 4466 (+) | 0.983 | 0.966 | aCTGACtgacc V$AP1FJ_Q2 | 4532 (-) | 0.963 | 0.952 | aatgcGTCAAc V$AP1FJ_Q2 | 4792 (+) | 0.983 | 0.950 | gCTGACttgca V$AP1FJ_Q2 | 4863 (-) | 0.983 | 0.949 | gggagGTCAGc V$AP1FJ_Q2 | 4964 (-) | 0.983 | 0.954 | aggaaGTCAGg V$AP1FJ_Q2 | 5176 (+) | 1.000 | 0.951 | tGTGACtgtgt V$AP1FJ_Q2 | 5255 (-) | 1.000 | 0.955 | atgtgGTCACa V$AP1FJ_Q2 | 5261 (-) | 0.975 | 0.943 | tcacaGTCATc V$AP1FJ_Q2 | 6053 (+) | 1.000 | 0.947 | tGTGACagccc V$AP1FJ_Q2 | 6293 (+) | 0.975 | 0.965 | gATGACtgagc V$AP1FJ_Q2 | 6322 (+) | 0.983 | 0.953 | cCTGACtgaag V$AP1FJ_Q2 | 6415 (+) | 0.975 | 0.952 | aATGACtcctc V$AP1FJ_Q2 | 6433 (-) | 0.963 | 0.935 | catcgGTCAAc V$AP1FJ_Q2 | 7506 (-) | 0.983 | 0.943 | agggtGTCAGa V$AP1FJ_Q2 | 7881 (-) | 0.975 | 0.955 | tagaaGTCATt V$AP1FJ_Q2 | 8322 (-) | 0.983 | 0.972 | tctgcGTCAGc V$AP1FJ_Q2 | 8611 (-) | 0.963 | 0.943 | attccGTCAAt V$AP1FJ_Q2 | 9288 (-) | 0.983 | 0.952 | tgagaGTCAGc V$AP1FJ_Q2 | 9320 (-) | 0.983 | 0.945 | gttggGTCAGa V$AP1FJ_Q2 | 9381 (-) | 0.983 | 0.949 | gagctGTCAGc V$AP1FJ_Q2 | 9635 (-) | 0.983 | 0.940 | tcaagGTCAGc V$AP1FJ_Q2 | 9843 (-) | 0.983 | 0.936 | ttaaaGTCAGg V$AP1FJ_Q2 | 10034 (-) | 1.000 | 0.995 | tgtgaGTCACc V$AP1FJ_Q2 | 10190 (-) | 0.983 | 0.951 | cagcaGTCAGt V$AP1FJ_Q2 | 10241 (-) | 0.983 | 0.940 | tggtgGTCAGg V$AP1FJ_Q2 | 10632 (+) | 1.000 | 0.980 | aGTGACtgagg V$AP1FJ_Q2 | 11316 (+) | 1.000 | 0.974 | gGTGACgacga V$AP1FJ_Q2 | 11331 (+) | 0.975 | 0.936 | gATGACgatgt V$AP1FJ_Q2 | 11700 (-) | 1.000 | 0.955 | tcactGTCACc V$AP1FJ_Q2 | 11748 (+) | 0.975 | 0.938 | aATGACtagaa V$AP1_Q2 | 132 (-) | 1.000 | 0.950 | gctcAGTCAat V$AP1_Q2 | 480 (-) | 1.000 | 0.938 | tacaAGTCAac V$AP1_Q2 | 1228 (+) | 1.000 | 0.961 | gaTGACTgctt V$AP1_Q2 | 1254 (-) | 0.967 | 0.946 | attcTGTCAtc V$AP1_Q2 | 1353 (-) | 0.962 | 0.950 | cctgGGTCAcc V$AP1_Q2 | 1445 (-) | 1.000 | 0.948 | gtgaAGTCAct V$AP1_Q2 | 2595 (+) | 1.000 | 0.954 | acTGACTatta V$AP1_Q2 | 3228 (-) | 1.000 | 0.957 | taccAGTCAgc V$AP1_Q2 | 3434 (+) | 1.000 | 0.950 | atTGACTtagg V$AP1_Q2 | 3725 (+) | 1.000 | 0.989 | ggTGACTaaaa V$AP1_Q2 | 3751 (-) | 1.000 | 0.972 | ttttAGTCAtt V$AP1_Q2 | 3933 (-) | 1.000 | 0.982 | tatcAGTCAct V$AP1_Q2 | 4332 (+) | 0.967 | 0.941 | ggTGACAtcca V$AP1_Q2 | 4466 (+) | 1.000 | 0.960 | acTGACTgacc V$AP1_Q2 | 4532 (-) | 0.967 | 0.941 | aatgCGTCAac V$AP1_Q2 | 4792 (+) | 1.000 | 0.955 | gcTGACTtgca V$AP1_Q2 | 5261 (-) | 1.000 | 0.957 | tcacAGTCAtc V$AP1_Q2 | 6293 (+) | 1.000 | 0.967 | gaTGACTgagc V$AP1_Q2 | 6415 (+) | 1.000 | 0.944 | aaTGACTcctc V$AP1_Q2 | 7044 (+) | 1.000 | 0.941 | gtTGACTattt V$AP1_Q2 | 7881 (-) | 1.000 | 0.955 | tagaAGTCAtt V$AP1_Q2 | 8322 (-) | 0.967 | 0.967 | tctgCGTCAgc V$AP1_Q2 | 9288 (-) | 1.000 | 0.960 | tgagAGTCAgc V$AP1_Q2 | 10034 (-) | 1.000 | 0.991 | tgtgAGTCAcc V$AP1_Q2 | 10190 (-) | 1.000 | 0.946 | cagcAGTCAgt V$AP1_Q2 | 10632 (+) | 1.000 | 0.973 | agTGACTgagg V$AP1_Q2 | 11316 (+) | 0.967 | 0.953 | ggTGACGacga V$AP1_Q2 | 11700 (-) | 0.967 | 0.939 | tcacTGTCAcc V$AP1_Q2 | 11748 (+) | 1.000 | 0.945 | aaTGACTagaa V$AP1_Q6 | 132 (-) | 1.000 | 0.971 | gctcAGTCAat V$AP1_Q6 | 1228 (+) | 1.000 | 0.939 | gaTGACTgctt V$AP1_Q6 | 1353 (-) | 0.938 | 0.942 | cctgGGTCAcc V$AP1_Q6 | 1445 (-) | 1.000 | 0.938 | gtgaAGTCAct V$AP1_Q6 | 3434 (+) | 1.000 | 0.973 | atTGACTtagg V$AP1_Q6 | 3725 (+) | 1.000 | 0.986 | ggTGACTaaaa V$AP1_Q6 | 3751 (-) | 1.000 | 0.983 | ttttAGTCAtt V$AP1_Q6 | 3933 (-) | 1.000 | 0.977 | tatcAGTCAct V$AP1_Q6 | 4466 (+) | 1.000 | 0.977 | acTGACTgacc V$AP1_Q6 | 4532 (-) | 0.941 | 0.939 | aatgCGTCAac V$AP1_Q6 | 4964 (-) | 1.000 | 0.938 | aggaAGTCAgg V$AP1_Q6 | 6293 (+) | 1.000 | 0.981 | gaTGACTgagc V$AP1_Q6 | 6322 (+) | 1.000 | 0.976 | ccTGACTgaag V$AP1_Q6 | 6415 (+) | 1.000 | 0.954 | aaTGACTcctc V$AP1_Q6 | 6555 (-) | 1.000 | 0.943 | taagAGTCAtg V$AP1_Q6 | 7881 (-) | 1.000 | 0.938 | tagaAGTCAtt V$AP1_Q6 | 8046 (+) | 1.000 | 0.939 | ctTGACTcgag V$AP1_Q6 | 8322 (-) | 0.941 | 0.944 | tctgCGTCAgc V$AP1_Q6 | 9288 (-) | 1.000 | 0.946 | tgagAGTCAgc V$AP1_Q6 | 9320 (-) | 0.938 | 0.937 | gttgGGTCAga V$AP1_Q6 | 10034 (-) | 1.000 | 0.999 | tgtgAGTCAcc V$AP1_Q6 | 10190 (-) | 1.000 | 0.935 | cagcAGTCAgt V$AP1_Q6 | 10632 (+) | 1.000 | 0.978 | agTGACTgagg V$AP1_Q6 | 11748 (+) | 1.000 | 0.934 | aaTGACTagaa V$AP4_Q5 | 7028 (+) | 1.000 | 0.984 | atCAGCTgaa V$AP4_Q5 | 7028 (-) | 1.000 | 0.976 | atcAGCTGaa V$AP4_Q5 | 7680 (+) | 1.000 | 0.987 | caCAGCTgta V$AP4_Q5 | 8277 (-) | 1.000 | 0.978 | tgcAGCTGcc V$AP4_Q5 | 8421 (+) | 1.000 | 0.985 | agCAGCTgtt V$AP4_Q5 | 8421 (-) | 1.000 | 0.983 | agcAGCTGtt V$AP4_Q6 | 7680 (+) | 1.000 | 0.976 | caCAGCTgta V$AP4_Q6 | 8421 (-) | 1.000 | 0.954 | agcAGCTGtt V$CREB_Q2 | 2964 (+) | 1.000 | 0.895 | agTGACGgcaga V$CREB_Q2 | 4531 (-) | 1.000 | 0.915 | aaatgCGTCAac V$CREB_Q2 | 8321 (-) | 1.000 | 0.942 | ttctgCGTCAgc V$CREB_Q2 | 8610 (-) | 1.000 | 0.909 | cattcCGTCAat V$CREB_Q2 | 11316 (+) | 1.000 | 0.892 | ggTGACGacgat V$CREB_Q4 | 1840 (-) | 1.000 | 0.876 | ggggtCGTCAac V$CREB_Q4 | 2964 (+) | 1.000 | 0.930 | agTGACGgcaga V$CREB_Q4 | 4531 (-) | 1.000 | 0.891 | aaatgCGTCAac V$CREB_Q4 | 8321 (-) | 1.000 | 0.914 | ttctgCGTCAgc V$CREB_Q4 | 8610 (-) | 1.000 | 0.914 | cattcCGTCAat V$CREB_Q4 | 11316 (+) | 1.000 | 0.901 | ggTGACGacgat V$CREB_Q4 | 11331 (+) | 1.000 | 0.863 | gaTGACGatgta V$CREBP1_Q2 | 2964 (+) | 1.000 | 0.877 | agTGACGgcaga V$E2_Q6 | 11305 (+) | 1.000 | 0.930 | ataacgactaCGGTGa P$GBP_Q6 | 6002 (-) | 1.000 | 0.955 | gggtACGTGtcc P$GBP_Q6 | 10441 (-) | 0.980 | 0.953 | tgcaACGTCtct V$MYB_Q6 | 125 (+) | 1.000 | 0.975 | gctAACTGct V$MYB_Q6 | 452 (-) | 0.964 | 0.952 | gtCTGTTttg V$MYB_Q6 | 589 (+) | 0.964 | 0.955 | attAACAGac V$MYB_Q6 | 841 (-) | 0.979 | 0.954 | acAAGTTaaa V$MYB_Q6 | 858 (-) | 0.955 | 0.948 | gaTCGTTgcc V$MYB_Q6 | 939 (-) | 0.979 | 0.956 | acAAGTTagg V$MYB_Q6 | 984 (-) | 1.000 | 0.967 | caCAGTTaat V$MYB_Q6 | 1027 (-) | 1.000 | 0.995 | ggCAGTTgaa V$MYB_Q6 | 1143 (-) | 0.980 | 0.948 | aaCCGTTact V$MYB_Q6 | 1316 (+) | 0.968 | 0.965 | cctAACTCcc V$MYB_Q6 | 1368 (-) | 0.955 | 0.952 | gcTCGTTtgc V$MYB_Q6 | 1461 (-) | 0.959 | 0.951 | ggACGTTtct V$MYB_Q6 | 1735 (+) | 0.979 | 0.975 | ctgAACTTcc V$MYB_Q6 | 1860 (+) | 1.000 | 0.961 | gccAACTGaa V$MYB_Q6 | 1952 (+) | 0.979 | 0.964 | tggAACTTtc V$MYB_Q6 | 1982 (+) | 0.968 | 0.954 | tcaAACTCac V$MYB_Q6 | 2354 (+) | 0.979 | 0.951 | tcaAACTTgt V$MYB_Q6 | 2375 (-) | 1.000 | 0.964 | caCAGTTttc V$MYB_Q6 | 2393 (-) | 0.968 | 0.959 | gaGAGTTgct V$MYB_Q6 | 2679 (-) | 0.980 | 0.953 | acCCGTTgca V$MYB_Q6 | 2826 (-) | 1.000 | 0.966 | tcCAGTTgtc V$MYB_Q6 | 3400 (-) | 1.000 | 0.961 | tcCAGTTata V$MYB_Q6 | 3409 (-) | 0.979 | 0.950 | aaAAGTTaat V$MYB_Q6 | 3446 (-) | 1.000 | 0.971 | aaCAGTTaat V$MYB_Q6 | 3499 (+) | 0.975 | 0.973 | cttAACTAcc V$MYB_Q6 | 3576 (-) | 0.979 | 0.973 | gcAAGTTatt V$MYB_Q6 | 3584 (+) | 0.968 | 0.960 | ttcAACTCtc V$MYB_Q6 | 3609 (-) | 1.000 | 0.961 | tgCAGTTttc V$MYB_Q6 | 4197 (-) | 0.979 | 0.973 | gcAAGTTcaa V$MYB_Q6 | 4214 (-) | 1.000 | 0.964 | ctCAGTTaaa V$MYB_Q6 | 4319 (+) | 1.000 | 0.966 | taaAACTGgg V$MYB_Q6 | 4364 (-) | 0.979 | 0.951 | agAAGTTtaa V$MYB_Q6 | 4391 (-) | 0.968 | 0.952 | gtGAGTTcca V$MYB_Q6 | 4537 (+) | 0.964 | 0.964 | gtcAACAGcc V$MYB_Q6 | 4566 (-) | 1.000 | 0.967 | caCAGTTgcg V$MYB_Q6 | 4920 (+) | 0.979 | 0.969 | cacAACTTac V$MYB_Q6 | 4942 (+) | 1.000 | 0.973 | ctgAACTGgg V$MYB_Q6 | 5149 (-) | 1.000 | 0.995 | ggCAGTTtac V$MYB_Q6 | 5279 (+) | 0.964 | 0.961 | aacAACAGgc V$MYB_Q6 | 5338 (-) | 0.975 | 0.964 | gaTAGTTttg V$MYB_Q6 | 5693 (-) | 0.975 | 0.951 | acTAGTTcat V$MYB_Q6 | 5727 (+) | 1.000 | 0.966 | ctaAACTGga V$MYB_Q6 | 5980 (+) | 0.979 | 0.953 | gatAACTTct V$MYB_Q6 | 6011 (+) | 0.968 | 0.958 | tccAACTCtc V$MYB_Q6 | 6086 (-) | 0.968 | 0.954 | gtGAGTTcct V$MYB_Q6 | 6386 (+) | 1.000 | 0.993 | acgAACTGgc V$MYB_Q6 | 6524 (-) | 0.979 | 0.951 | cgAAGTTcac V$MYB_Q6 | 6541 (+) | 0.980 | 0.952 | acaAACGGct V$MYB_Q6 | 6571 (-) | 1.000 | 0.966 | caCAGTTatg V$MYB_Q6 | 6845 (+) | 1.000 | 0.978 | gccAACTGgt V$MYB_Q6 | 7040 (-) | 0.975 | 0.969 | gtTAGTTgac V$MYB_Q6 | 7185 (+) | 0.980 | 0.975 | ccgAACGGcc V$MYB_Q6 | 7233 (-) | 0.979 | 0.973 | ggAAGTTcat V$MYB_Q6 | 7491 (-) | 0.964 | 0.950 | gtCTGTTctt V$MYB_Q6 | 7521 (+) | 1.000 | 0.959 | ctgAACTGaa V$MYB_Q6 | 7533 (-) | 1.000 | 0.960 | taCAGTTggt V$MYB_Q6 | 7548 (+) | 1.000 | 0.969 | gccAACTGtg V$MYB_Q6 | 7600 (+) | 1.000 | 0.975 | ctgAACTGct V$MYB_Q6 | 7694 (+) | 0.979 | 0.949 | cagAACTTgg V$MYB_Q6 | 7724 (-) | 1.000 | 0.973 | agCAGTTtgg V$MYB_Q6 | 8003 (-) | 0.964 | 0.962 | gcCTGTTggt V$MYB_Q6 | 8078 (+) | 0.979 | 0.970 | agaAACTTcc V$MYB_Q6 | 8180 (+) | 0.979 | 0.953 | cagAACTTgt V$MYB_Q6 | 8197 (-) | 0.964 | 0.959 | gcCTGTTtcc V$MYB_Q6 | 8215 (-) | 1.000 | 0.963 | ctCAGTTctg V$MYB_Q6 | 8723 (+) | 1.000 | 0.955 | aagAACTGta V$MYB_Q6 | 8845 (+) | 0.980 | 0.948 | cagAACGGtt V$MYB_Q6 | 8942 (+) | 0.979 | 0.949 | agcAACTTtt V$MYB_Q6 | 9031 (-) | 0.968 | 0.955 | gtGAGTTcaa V$MYB_Q6 | 9060 (-) | 0.979 | 0.969 | gcAAGTTcca V$MYB_Q6 | 9133 (-) | 1.000 | 0.954 | ttCAGTTttt V$MYB_Q6 | 9158 (-) | 0.968 | 0.954 | gtGAGTTcct V$MYB_Q6 | 9225 (-) | 0.979 | 0.976 | gcAAGTTagc V$MYB_Q6 | 9416 (+) | 0.975 | 0.964 | gaaAACTAtc V$MYB_Q6 | 9507 (-) | 0.979 | 0.967 | gaAAGTTtcc V$MYB_Q6 | 9562 (-) | 0.968 | 0.954 | gtCGGTTtct V$MYB_Q6 | 9629 (-) | 0.968 | 0.961 | ggGAGTTcaa V$MYB_Q6 | 9763 (+) | 0.964 | 0.953 | gaaAACAGtc V$MYB_Q6 | 10056 (+) | 1.000 | 0.954 | aagAACTGaa V$MYB_Q6 | 10195 (-) | 1.000 | 0.986 | gtCAGTTctg V$MYB_Q6 | 10308 (+) | 1.000 | 0.969 | tttAACTGtt V$MYB_Q6 | 10619 (-) | 0.968 | 0.961 | ggGAGTTgta V$MYB_Q6 | 10965 (-) | 0.964 | 0.960 | gcCTGTTgga V$MYB_Q6 | 11068 (-) | 1.000 | 0.964 | atCAGTTaca V$MYB_Q6 | 11124 (+) | 0.968 | 0.968 | cttAACTCgc V$MYB_Q6 | 11138 (-) | 1.000 | 0.963 | ctCAGTTttg V$MYB_Q6 | 11217 (+) | 1.000 | 0.991 | ttgAACTGcc V$MYB_Q6 | 11225 (-) | 1.000 | 0.969 | ccCAGTTtgt V$MYB_Q6 | 11235 (+) | 1.000 | 0.962 | tctAACTGga V$MYB_Q6 | 11710 (+) | 1.000 | 0.997 | cccAACTGcc V$MYOD_Q6 | 1027 (-) | 0.984 | 0.980 | ggcAGTTGaa V$MYOD_Q6 | 1392 (-) | 1.000 | 0.991 | ggcAGGTGag V$MYOD_Q6 | 1786 (+) | 1.000 | 0.998 | agCACCTgcc V$MYOD_Q6 | 2970 (-) | 0.989 | 0.988 | ggcAGATGcg V$MYOD_Q6 | 5127 (-) | 0.989 | 0.962 | aacAGATGgt V$MYOD_Q6 | 6262 (+) | 0.989 | 0.988 | ccCATCTgcc V$MYOD_Q6 | 7511 (-) | 0.989 | 0.987 | gtcAGATGgc V$MYOD_Q6 | 8277 (+) | 0.989 | 0.989 | tgCAGCTgcc V$MYOD_Q6 | 8421 (+) | 0.989 | 0.962 | agCAGCTgtt V$MYOD_Q6 | 8504 (-) | 1.000 | 0.998 | gacAGGTGcg V$MYOD_Q6 | 11710 (+) | 0.984 | 0.984 | ccCAACTgcc V$NFY_Q6 | 1021 (-) | 1.000 | 0.945 | gtgATTGGcag V$NFY_Q6 | 4618 (+) | 1.000 | 0.951 | aaaCCAATatc V$NFY_Q6 | 9451 (+) | 1.000 | 0.946 | ttaCCAATctc V$NFY_Q6 | 11684 (+) | 1.000 | 0.951 | agaCCAATaaa V$NFY_Q6 | 11896 (+) | 1.000 | 0.962 | caaCCAATagc V$NFY_Q6 | 11928 (-) | 1.000 | 0.955 | cccATTGGctg V$SRF_Q6 | 11536 (+) | 0.948 | 0.942 | tcCCACAtatggta V$USF_Q6 | 735 (+) | 0.987 | 0.940 | aCCACGtggt V$USF_Q6 | 735 (-) | 0.987 | 0.940 | accaCGTGGt V$USF_Q6 | 1616 (-) | 0.987 | 0.934 | gtctCGTGGa V$USF_Q6 | 3938 (+) | 0.945 | 0.929 | gTCACTtgtt V$USF_Q6 | 5021 (+) | 0.987 | 0.969 | gCCACGggac V$USF_Q6 | 6049 (-) | 0.945 | 0.939 | gcccTGTGAc V$USF_Q6 | 7192 (+) | 0.987 | 0.942 | gCCACGgggg V$AP1_Q4 | 132 (-) | 1.000 | 0.957 | gctcAGTCAat V$AP1_Q4 | 1353 (-) | 0.935 | 0.938 | cctgGGTCAcc V$AP1_Q4 | 1445 (-) | 1.000 | 0.940 | gtgaAGTCAct V$AP1_Q4 | 3434 (+) | 1.000 | 0.955 | atTGACTtagg V$AP1_Q4 | 3725 (+) | 1.000 | 0.992 | ggTGACTaaaa V$AP1_Q4 | 3751 (-) | 1.000 | 0.973 | ttttAGTCAtt V$AP1_Q4 | 3933 (-) | 1.000 | 0.985 | tatcAGTCAct V$AP1_Q4 | 4466 (+) | 1.000 | 0.969 | acTGACTgacc V$AP1_Q4 | 6293 (+) | 1.000 | 0.970 | gaTGACTgagc V$AP1_Q4 | 6322 (+) | 1.000 | 0.948 | ccTGACTgaag V$AP1_Q4 | 10034 (-) | 1.000 | 0.994 | tgtgAGTCAcc V$AP1_Q4 | 10632 (+) | 1.000 | 0.987 | agTGACTgagg V$AP2_Q6 | 100 (-) | 0.992 | 0.979 | cgcctGGGGGtg V$AP2_Q6 | 1186 (+) | 0.992 | 0.893 | caCCCCCaggaa V$AP2_Q6 | 1332 (+) | 0.934 | 0.907 | ctCCCCAgggct V$AP2_Q6 | 1332 (-) | 0.905 | 0.872 | ctcccCAGGGct V$AP2_Q6 | 1483 (-) | 0.953 | 0.862 | tggcaGAGGGag V$AP2_Q6 | 1676 (-) | 0.942 | 0.894 | ctcctTCGGGtg V$AP2_Q6 | 1709 (+) | 0.953 | 0.857 | caCCCTCtgcct V$AP2_Q6 | 4598 (+) | 1.000 | 0.854 | tgCCCGCttgct V$AP2_Q6 | 5887 (-) | 0.934 | 0.854 | gggcaTGGGGcc V$AP2_Q6 | 7190 (-) | 0.923 | 0.861 | cggccACGGGgg V$AP2_Q6 | 7191 (-) | 0.944 | 0.893 | ggccaCGGGGgt V$AP2_Q6 | 10786 (-) | 0.992 | 0.866 | acccaGGGGGat V$AP2_Q6 | 11866 (-) | 0.992 | 0.902 | ggtccGGGGGag V$AP2_Q6 | 11911 (-) | 0.942 | 0.855 | aggctTCGGGcc V$AP2_Q6 | 11926 (+) | 0.892 | 0.884 | cgCCCATtggct V$CEBP_Q2 | 420 (-) | 0.936 | 0.934 | agcttgtCCAAAat V$CEBP_Q2 | 489 (+) | 1.000 | 0.968 | acATTGCaacatta V$CEBP_Q2 | 1085 (+) | 0.936 | 0.933 | atTTTGGtcaacta V$CEBP_Q2 | 1463 (-) | 0.944 | 0.944 | acgtttcTCAATat V$CEBP_Q2 | 2368 (+) | 0.984 | 0.942 | atTTTGCcacagtt V$CEBP_Q2 | 2555 (+) | 0.936 | 0.936 | ctGTTGTgaaattt V$CEBP_Q2 | 3424 (+) | 0.987 | 0.981 | tgGTTGCttaattg V$CEBP_Q2 | 3569 (-) | 0.984 | 0.968 | gcctttgGCAAGtt V$CEBP_Q2 | 3578 (-) | 0.931 | 0.936 | aagttatTCAACtc V$CEBP_Q2 | 3706 (-) | 0.944 | 0.938 | tgcttgaTCAATat V$CEBP_Q2 | 4207 (+) | 0.984 | 0.945 | ggCTTGCctcagtt V$CEBP_Q2 | 4708 (+) | 0.936 | 0.942 | gaCTTGGgtaattc V$CEBP_Q2 | 4795 (+) | 0.984 | 0.980 | gaCTTGCataatct V$CEBP_Q2 | 5442 (+) | 1.000 | 0.948 | gtATTGCcatataa V$CEBP_Q2 | 6564 (+) | 0.987 | 0.942 | tgGTTGCcacagtt V$CEBP_Q2 | 6573 (-) | 0.984 | 0.981 | cagttatGCAAAac V$CEBP_Q2 | 6608 (+) | 0.984 | 0.971 | taTTTGCtaaactc V$CEBP_Q2 | 7208 (+) | 0.944 | 0.941 | gaATTGAacaaaaa V$CEBP_Q2 | 8071 (+) | 1.000 | 0.984 | gtATTGCagaaact V$CEBP_Q2 | 8468 (-) | 0.984 | 0.938 | atctaagGCAAAaa V$CEBP_Q2 | 8578 (-) | 0.984 | 0.939 | aaatgggGCAAAgg V$CEBP_Q2 | 8936 (-) | 0.987 | 0.950 | aactaaaGCAACtt V$CEBP_Q2 | 9138 (+) | 0.928 | 0.934 | ttTTTGAataagac V$CEBP_Q2 | 9447 (-) | 0.952 | 0.944 | aggtttaCCAATct V$CEBP_Q2 | 9488 (+) | 0.984 | 0.944 | taTTTGCaatagaa V$CEBP_Q2 | 10377 (-) | 1.000 | 0.965 | aaatgttGCAATcc V$CEBP_Q2 | 10686 (-) | 0.987 | 0.933 | gactctgGCAACac V$CEBP_Q2 | 10968 (-) | 0.984 | 0.942 | tgttggaGCAAGga V$CEBP_Q2 | 11284 (-) | 0.984 | 0.933 | gcataggGCAAAga V$ER_Q6 | 1078 (-) | 1.000 | 0.928 | gtaGGTCAttttggtcaac V$ER_Q6 | 1087 (-) | 1.000 | 0.928 | tttGGTCAactaaaatttg V$ER_Q6 | 1354 (-) | 1.000 | 0.935 | ctgGGTCAcctgatgctcg V$ER_Q6 | 2327 (+) | 1.000 | 0.908 | gtcttgctatgTGACCcta V$ER_Q6 | 2635 (+) | 1.000 | 0.904 | tgctctaccccTGACCtgc V$ER_Q6 | 2690 (-) | 1.000 | 0.947 | cctGGTCAcactcttctcc V$ER_Q6 | 3072 (-) | 1.000 | 0.931 | ctaGGTCAcagggtgattc V$ER_Q6 | 3880 (+) | 1.000 | 0.914 | tacaagctcctTGACCctc V$ER_Q6 | 4291 (+) | 1.000 | 0.910 | gaccctaaattTGACCctc V$ER_Q6 | 4403 (-) | 1.000 | 0.922 | acaGGTCAgccacaacctt V$ER_Q6 | 5136 (-) | 1.000 | 0.909 | tgcGGTCAacctaggcagt V$ER_Q6 | 5256 (-) | 1.000 | 0.931 | tgtGGTCAcagtcatcctg V$ER_Q6 | 6102 (+) | 1.000 | 0.926 | atcagtcctgtTGACCaga V$ER_Q6 | 7582 (-) | 1.000 | 0.906 | tcaGGTCAaacaaatgctc V$ER_Q6 | 8255 (+) | 1.000 | 0.916 | gttcgtggcatTGACCtga V$ER_Q6 | 9321 (-) | 1.000 | 0.928 | ttgGGTCAgagcctctgga V$ER_Q6 | 9636 (-) | 1.000 | 0.950 | caaGGTCAgcttgggctac V$GR_Q6 | 8 (-) | 0.984 | 0.872 | catGTACAagtacatgtga V$GR_Q6 | 14 (-) | 0.984 | 0.909 | caaGTACAtgtgaatgtgt V$GR_Q6 | 32 (-) | 1.000 | 0.883 | tgtGAACAacttcagatgt V$GR_Q6 | 224 (+) | 0.864 | 0.859 | tttttccttttTGTCTttt V$GR_Q6 | 265 (+) | 0.986 | 0.870 | actcaggtcctTGTGCatg V$GR_Q6 | 314 (+) | 0.986 | 0.910 | tctcctagtccTGTGCtgg V$GR_Q6 | 413 (+) | 0.989 | 0.887 | atgggtgagctTGTCCaaa V$GR_Q6 | 538 (+) | 0.875 | 0.851 | tgatgcgcttgTGTTTtaa V$GR_Q6 | 605 (+) | 0.882 | 0.850 | tttaaacatccTGTCGtac V$GR_Q6 | 663 (-) | 0.984 | 0.924 | taaGTACActgtagctgct V$GR_Q6 | 874 (-) | 0.986 | 0.898 | aatGCACAaacacgaaatg V$GR_Q6 | 890 (-) | 0.989 | 0.904 | atgGGACAgacttttcact V$GR_Q6 | 945 (-) | 1.000 | 0.881 | tagGAACAatgatcaaata V$GR_Q6 | 1009 (+) | 0.865 | 0.871 | acgcacctataTGTGAttg V$GR_Q6 | 1123 (+) | 1.000 | 0.901 | gtatgcacgtgTGTTCcta V$GR_Q6 | 1145 (+) | 0.876 | 0.878 | ccgttactttaTGTAGttc V$GR_Q6 | 1247 (+) | 0.869 | 0.873 | gggcagcattcTGTCAtca V$GR_Q6 | 1261 (+) | 0.984 | 0.961 | catcatccgtgTGTACtct V$GR_Q6 | 1526 (+) | 1.000 | 0.948 | agctgttttttTGTTCtaa V$GR_Q6 | 1620 (+) | 0.989 | 0.902 | cgtggacagcaTGTCCagc V$GR_Q6 | 1620 (-) | 0.989 | 0.905 | cgtGGACAgcatgtccagc V$GR_Q6 | 1746 (-) | 1.000 | 0.934 | ataGAACActgatctataa V$GR_Q6 | 1764 (+) | 0.989 | 0.908 | agcctctcgccTGTCCtcc V$GR_Q6 | 1808 (+) | 1.000 | 0.881 | catcagggtggTGTTCagg V$GR_Q6 | 2027 (+) | 0.984 | 0.869 | gattaaaggagTGTACcac V$GR_Q6 | 2035 (+) | 0.984 | 0.878 | gagtgtaccacTGTACctg V$GR_Q6 | 2301 (+) | 0.861 | 0.847 | gtgtgtgtctgTGTGTttt V$GR_Q6 | 2303 (+) | 0.875 | 0.875 | gtgtgtctgtgTGTTTtgg V$GR_Q6 | 2411 (-) | 0.875 | 0.853 | aaaAAACAaaaaatcacag V$GR_Q6 | 2545 (+) | 0.892 | 0.862 | taacaggtttcTGTTGtga V$GR_Q6 | 2579 (-) | 0.892 | 0.866 | ttaCAACAaatgcagtact V$GR_Q6 | 2808 (-) | 1.000 | 0.913 | actGAACAccgtgctctgt V$GR_Q6 | 2813 (+) | 0.989 | 0.903 | acaccgtgctcTGTCCagt V$GR_Q6 | 2864 (-) | 1.000 | 0.916 | gctGAACAgaactccaccg V$GR_Q6 | 3022 (-) | 0.984 | 0.927 | gcaGTACAccgacttccgc V$GR_Q6 | 3046 (-) | 0.989 | 0.890 | ccgGGACAtacctgcgagt V$GR_Q6 | 3161 (+) | 0.984 | 0.889 | tccattccgccTGTACatt V$GR_Q6 | 3170 (-) | 0.984 | 0.897 | cctGTACAttatatccccc V$GR_Q6 | 3294 (-) | 1.000 | 0.883 | ggtGAACAtctccatgtag V$GR_Q6 | 3337 (+) | 1.000 | 0.904 | tatttccctccTGTTCatc V$GR_Q6 | 3454 (-) | 0.875 | 0.880 | ataAAACAcagggcctcat V$GR_Q6 | 3538 (+) | 1.000 | 0.962 | aggtttggttcTGTTCttt V$GR_Q6 | 3691 (-) | 1.000 | 0.955 | caaGAACAcaagagatgct V$GR_Q6 | 3795 (-) | 0.989 | 0.906 | tctGGACAgaatagaaaaa V$GR_Q6 | 3933 (+) | 1.000 | 0.881 | tatcagtcactTGTTCatt V$GR_Q6 | 4134 (+) | 0.878 | 0.851 | cagaatctaggTGTGGtgc V$GR_Q6 | 4150 (+) | 0.863 | 0.859 | tgcctcccattTGTAAtcc V$GR_Q6 | 4278 (-) | 0.986 | 0.962 | taaGCACAcattagaccct V$GR_Q6 | 4439 (-) | 0.878 | 0.857 | aaaCCACAcaattaatatt V$GR_Q6 | 4488 (-) | 1.000 | 0.909 | tttGAACAaaaaacaaaac V$GR_Q6 | 4495 (-) | 0.875 | 0.853 | aaaAAACAaaacaaaacaa V$GR_Q6 | 4524 (-) | 0.892 | 0.868 | agaCAACAaatgcgtcaac V$GR_Q6 | 4560 (-) | 0.989 | 0.907 | gctGGACAcagttgcggtt V$GR_Q6 | 4690 (+) | 0.989 | 0.946 | aggaaacttgaTGTCCtag V$GR_Q6 | 4883 (-) | 1.000 | 0.885 | ctgGAACAttccatgttcc V$GR_Q6 | 4885 (+) | 1.000 | 0.877 | ggaacattccaTGTTCctt V$GR_Q6 | 4977 (+) | 0.989 | 0.880 | cagcccctaccTGTCCgaa V$GR_Q6 | 5173 (+) | 1.000 | 0.959 | acttgtgactgTGTTCtag V$GR_Q6 | 5202 (+) | 0.986 | 0.932 | ctgtggcctctTGTGCttc V$GR_Q6 | 5361 (-) | 0.875 | 0.870 | acaAAACAcacgggatttc V$GR_Q6 | 5602 (-) | 0.879 | 0.870 | ataTAACAcaagaaaccaa V$GR_Q6 | 5660 (-) | 1.000 | 0.953 | aaaGAACAcaataagagac V$GR_Q6 | 5998 (+) | 0.989 | 0.882 | gatggggtacgTGTCCaac V$GR_Q6 | 6248 (+) | 0.989 | 0.914 | ggctttctattTGTCCcat V$GR_Q6 | 6329 (-) | 0.986 | 0.967 | gaaGCACAcactggatttc V$GR_Q6 | 6388 (+) | 1.000 | 0.917 | gaactggcctgTGTTCaga V$GR_Q6 | 6516 (-) | 1.000 | 0.955 | aaaGAACAcgaagttcacc V$GR_Q6 | 6580 (-) | 0.875 | 0.846 | gcaAAACAcaaaacaaata V$GR_Q6 | 6769 (-) | 1.000 | 0.906 | catGAACAaaacagcagat V$GR_Q6 | 6821 (+) | 0.986 | 0.941 | tcccttttttaTGTGCtat V$GR_Q6 | 6912 (-) | 1.000 | 0.923 | ggaGAACAaggaattgatt V$GR_Q6 | 6923 (+) | 0.986 | 0.898 | aattgattgtaTGTGCagt V$GR_Q6 | 6944 (+) | 0.989 | 0.889 | ccaagtgatacTGTCCatt V$GR_Q6 | 7056 (+) | 0.875 | 0.849 | ttttcttaattTGTTTttg V$GR_Q6 | 7098 (+) | 0.984 | 0.895 | cacctagtctgTGTACcac V$GR_Q6 | 7109 (+) | 0.986 | 0.909 | tgtaccacatgTGTGCaat V$GR_Q6 | 7210 (-) | 1.000 | 0.936 | attGAACAaaaagtaggcc V$GR_Q6 | 7311 (+) | 0.984 | 0.869 | aggatatagaaTGTACgtc V$GR_Q6 | 7329 (-) | 1.000 | 0.886 | ctgGAACAatagacacatt V$GR_Q6 | 7378 (+) | 0.989 | 0.910 | gatattgcttgTGTCCagg V$GR_Q6 | 7421 (+) | 0.875 | 0.854 | aaaggtttattTGTTTtta V$GR_Q6 | 7466 (+) | 0.986 | 0.935 | caggatttgagTGTGCtac V$GR_Q6 | 7483 (+) | 1.000 | 0.959 | acgcgcacgtcTGTTCttc V$GR_Q6 | 7778 (-) | 0.875 | 0.857 | acaAAACAaaaattatccc V$GR_Q6 | 7803 (-) | 1.000 | 0.931 | aaaGAACAacgacaactcc V$GR_Q6 | 7858 (-) | 0.984 | 0.881 | agcGTACActtagaaaagc V$GR_Q6 | 7989 (+) | 0.986 | 0.887 | aaggacttcacTGTGCctg V$GR_Q6 | 8243 (+) | 1.000 | 0.891 | agatgaaggtgTGTTCgtg V$GR_Q6 | 8353 (+) | 0.986 | 0.882 | ttgccaaccccTGTGCagc V$GR_Q6 | 8416 (+) | 1.000 | 0.931 | ctgtaagcagcTGTTCtag V$GR_Q6 | 8500 (-) | 0.989 | 0.868 | cagGGACAggtgcgggaca V$GR_Q6 | 8511 (-) | 0.989 | 0.873 | gcgGGACActcacagatgc V$GR_Q6 | 8532 (-) | 1.000 | 0.996 | caaGAACAcagtgggccga V$GR_Q6 | 8552 (+) | 0.989 | 0.864 | aaaatgaacaaTGTCCgtc V$GR_Q6 | 8554 (-) | 1.000 | 0.903 | aatGAACAatgtccgtctt V$GR_Q6 | 8748 (+) | 0.989 | 0.936 | ggcccacacctTGTCCtcc V$GR_Q6 | 8819 (+) | 1.000 | 0.908 | ggtactctcgaTGTTCaaa V$GR_Q6 | 8839 (-) | 0.875 | 0.871 | ccaAAACAgaacggtttaa V$GR_Q6 | 8896 (-) | 0.878 | 0.878 | gtaCCACAaacagattcag V$GR_Q6 | 8964 (+) | 0.878 | 0.852 | ggagaaccaggTGTGGtgg V$GR_Q6 | 8979 (-) | 0.984 | 0.889 | gtgGTACAcacctttaatt V$GR_Q6 | 8998 (-) | 0.986 | 0.925 | ccaGCACAtggaggcaggg V$GR_Q6 | 9259 (-) | 0.989 | 0.866 | atgGGACAgggcagcgaca V$GR_Q6 | 9291 (+) | 0.986 | 0.902 | gagtcagcttgTGTGCcca V$GR_Q6 | 9352 (+) | 1.000 | 0.961 | tctcttctcagTGTTCtct V$GR_Q6 | 9454 (+) | 0.984 | 0.868 | ccaatctctgcTGTACagg V$GR_Q6 | 9463 (+) | 1.000 | 0.952 | gctgtacaggaTGTTCtag V$GR_Q6 | 9463 (-) | 0.984 | 0.899 | gctGTACAggatgttctag V$GR_Q6 | 9509 (+) | 0.989 | 0.903 | aagtttccccaTGTCCaac V$GR_Q6 | 9532 (-) | 1.000 | 0.963 | ctaGAACAaacaagtcctg V$GR_Q6 | 9564 (+) | 0.878 | 0.847 | cggtttctgggTGTGGtgg V$GR_Q6 | 9729 (+) | 0.989 | 0.895 | ttgtatgataaTGTCCata V$GR_Q6 | 9778 (-) | 1.000 | 0.942 | ataGAACAattttgttcag V$GR_Q6 | 9779 (+) | 1.000 | 0.923 | tagaacaatttTGTTCagt V$GR_Q6 | 9790 (+) | 0.984 | 0.885 | tgttcagtaaaTGTACatc V$GR_Q6 | 9799 (-) | 0.984 | 0.866 | aatGTACAtcacataagta V$GR_Q6 | 9936 (+) | 0.879 | 0.846 | atttgcatgtaTGTTAtaa V$GR_Q6 | 10175 (+) | 1.000 | 0.916 | attagtgtgtgTGTTCagc V$GR_Q6 | 10192 (+) | 0.989 | 0.950 | gcagtcagttcTGTCCttc V$GR_Q6 | 10243 (+) | 0.986 | 0.879 | gtggtcaggctTGTGCagg V$GR_Q6 | 10297 (-) | 0.989 | 0.871 | ttcGGACAttttttaactg V$GR_Q6 | 10543 (+) | 0.984 | 0.902 | actctaaagtaTGTACagc V$GR_Q6 | 10552 (-) | 0.984 | 0.887 | tatGTACAgccggcggtac V$GR_Q6 | 10564 (-) | 0.984 | 0.869 | gcgGTACAgcgtttgcttt V$GR_Q6 | 10643 (+) | 0.989 | 0.960 | aagacacccagTGTCCtaa V$GR_Q6 | 10795 (+) | 0.986 | 0.874 | gatgtgagcctTGTGCatc V$GR_Q6 | 10887 (-) | 0.986 | 0.863 | actGCACAtgcgcagagac V$GR_Q6 | 10951 (+) | 0.986 | 0.919 | tagctgcggtcTGTGCctg V$GR_Q6 | 11135 (+) | 0.984 | 0.937 | gcactcagtttTGTACtct V$GR_Q6 | 11221 (+) | 1.000 | 0.980 | actgcccagttTGTTCtaa V$GR_Q6 | 11245 (+) | 0.878 | 0.853 | gtccaaaactcTGTGGtgt V$GR_Q6 | 11250 (+) | 0.986 | 0.862 | aaactctgtggTGTGCcaa V$GR_Q6 | 11365 (+) | 0.986 | 0.975 | atgattctatcTGTGCttt V$GR_Q6 | 11390 (+) | 0.984 | 0.878 | ttattaaatttTGTACgca V$GR_Q6 | 11467 (+) | 0.875 | 0.847 | aattttttattTGTTTttc V$GR_Q6 | 11512 (+) | 0.989 | 0.852 | gatattaccgtTGTCCgta V$GR_Q6 | 11603 (-) | 0.986 | 0.902 | agtGCACAgagaaaggggt V$GR_Q6 | 11641 (+) | 0.984 | 0.937 | aaatgtatttcTGTACtgt V$GR_Q6 | 11788 (-) | 0.989 | 0.943 | taaGGACAaatcccagatt V$NF1_Q6 | 388 (-) | 1.000 | 0.957 | gccattcatttGCCAAca V$NF1_Q6 | 461 (+) | 1.000 | 0.954 | ggTTGGCttatttgacac V$NF1_Q6 | 1023 (+) | 1.000 | 0.958 | gaTTGGCagttgaatcaa V$NF1_Q6 | 1657 (+) | 1.000 | 0.963 | acTTGGCaaatggagatt V$NF1_Q6 | 1849 (-) | 1.000 | 0.952 | aacacaatagaGCCAAct V$NF1_Q6 | 1940 (+) | 1.000 | 0.959 | ccTTGGCtaggctggaac V$NF1_Q6 | 1974 (+) | 1.000 | 0.974 | ggTTGGCctcaaactcac V$NF1_Q6 | 3226 (-) | 1.000 | 0.959 | tttaccagtcaGCCAAca V$NF1_Q6 | 3317 (+) | 1.000 | 0.966 | gaTTGGCcttttcctgat V$NF1_Q6 | 3528 (+) | 1.000 | 0.962 | ccTTGGCccgaggtttgg V$NF1_Q6 | 3571 (+) | 1.000 | 0.964 | ctTTGGCaagttattcaa V$NF1_Q6 | 3671 (-) | 1.000 | 0.952 | gacttagaattGCCAAgt V$NF1_Q6 | 3678 (-) | 1.000 | 0.977 | aattgccaagtGCCAAga V$NF1_Q6 | 4728 (+) | 1.000 | 0.958 | atTTGGCcctcgctgatg V$NF1_Q6 | 4894 (-) | 1.000 | 0.968 | catgttccttgGCCAAgt V$NF1_Q6 | 4900 (+) | 1.000 | 0.970 | ccTTGGCcaagtgcatgc V$NF1_Q6 | 5265 (-) | 1.000 | 0.958 | agtcatcctggGCCAAca V$NF1_Q6 | 6200 (-) | 1.000 | 0.968 | agccggggtatGCCAAgg V$NF1_Q6 | 6834 (-) | 1.000 | 0.961 | tgctatttactGCCAAct V$NF1_Q6 | 7537 (-) | 1.000 | 0.990 | gttggttgtgaGCCAAct V$NF1_Q6 | 8226 (+) | 1.000 | 0.973 | tgTTGGCtcccacaccca V$NF1_Q6 | 8344 (-) | 1.000 | 0.979 | ctggaccacttGCCAAcc V$NF1_Q6 | 8389 (-) | 1.000 | 0.960 | gtatacctgaaGCCAAgc V$NF1_Q6 | 8446 (-) | 1.000 | 0.958 | ccgcatcctaaGCCAAgg V$NF1_Q6 | 10410 (+) | 1.000 | 0.957 | ggTTGGCatttttaacac V$NF1_Q6 | 11179 (+) | 1.000 | 0.955 | ttTTGGCtgccttgtttc V$NF1_Q6 | 11253 (-) | 1.000 | 0.969 | ctctgtggtgtGCCAAga V$NF1_Q6 | 11930 (+) | 1.000 | 0.963 | caTTGGCtgaaactattt V$NFKB_Q6 | 328 (+) | 0.863 | 0.857 | gcTGGAAattccct V$NFKB_Q6 | 329 (-) | 1.000 | 0.974 | ctggaaaTTCCCtc V$NFKB_Q6 | 1004 (+) | 1.000 | 0.875 | tgGGGACgcaccta V$NFKB_Q6 | 1951 (+) | 0.805 | 0.866 | ctGGAACtttcctt V$NFKB_Q6 | 3046 (+) | 1.000 | 0.889 | ccGGGACatacctg V$NFKB_Q6 | 4947 (+) | 1.000 | 0.866 | ctGGGACtttctgc V$OCT1_Q6 | 992 (-) | 0.893 | 0.918 | ataaTTTTCatgtgg V$OCT1_Q6 | 2831 (+) | 1.000 | 0.855 | ttgtctGCAAAtggc V$OCT1_Q6 | 3108 (-) | 0.888 | 0.917 | ctaaTTTCCatgcgg V$OCT1_Q6 | 3128 (-) | 0.909 | 0.930 | ttaaTTTACattccc V$OCT1_Q6 | 3834 (-) | 0.888 | 0.874 | taaaTTTCCattctc V$OCT1_Q6 | 5340 (-) | 1.000 | 0.875 | tagtTTTGCatcccc V$OCT1_Q6 | 5435 (-) | 0.838 | 0.884 | tttaTTTGTattgcc V$OCT1_Q6 | 6574 (+) | 1.000 | 0.927 | agttatGCAAAacac V$OCT1_Q6 | 6966 (+) | 0.888 | 0.861 | ctggatGGAAAggaa V$OCT1_Q6 | 7349 (+) | 0.792 | 0.860 | caacatGTTAAtcat V$OCT1_Q6 | 9486 (-) | 1.000 | 0.884 | tttaTTTGCaataga V$OCT1_Q6 | 9933 (-) | 1.000 | 0.996 | gtaaTTTGCatgtat V$OCT1_Q6 | 10408 (-) | 0.883 | 0.859 | gtggTTGGCattttt V$OCT1_Q6 | 10908 (+) | 1.000 | 0.981 | tactatGCAAAtaat V$OCT1_Q6 | 11151 (+) | 0.893 | 0.902 | tctgatGAAAAtgaa V$OCT1_Q6 | 11334 (+) | 0.909 | 0.903 | gacgatGTAAAtgcc V$OCT1_Q6 | 11664 (-) | 0.893 | 0.882 | acaaTTTTCataacc V$SP1_Q6 | 1487 (+) | 0.927 | 0.911 | agagGGAGGgggg V$SP1_Q6 | 1492 (+) | 0.904 | 0.902 | gaggGGGGGggga V$SP1_Q6 | 1493 (+) | 0.904 | 0.907 | agggGGGGGggag V$SP1_Q6 | 5894 (-) | 1.000 | 0.956 | gggcCCGCCccta V$SP1_Q6 | 11845 (+) | 0.915 | 0.922 | ggggGGTGGggta V$SP1_Q6 | 11870 (+) | 0.927 | 0.912 | cgggGGAGGgact F$ABF_C | 2911 (+) | 0.931 | 0.954 | cATCTCtgtggacgg F$ABF_C | 3711 (+) | 0.906 | 0.944 | gATCAAtatttacgg F$ABF_C | 6859 (-) | 0.719 | 0.861 | acgtccagaCCGAGg F$ABF_C | 6889 (-) | 0.796 | 0.878 | acgtaagacAAGAGg F$ABF_C | 7859 (-) | 0.714 | 0.827 | gcgtacactTAGAAa F$ABF_C | 9318 (-) | 0.714 | 0.826 | gcgttgggtCAGAGc F$ABF_C | 10026 (-) | 0.877 | 0.909 | ccgtagtttGTGAGt F$ABF_C | 11914 (+) | 0.719 | 0.832 | cTTCGGgcccaacgc V$AP1_C | 133 (+) | 1.000 | 0.880 | ctcAGTCAa V$AP1_C | 133 (-) | 0.808 | 0.864 | cTCAGTcaa V$AP1_C | 440 (+) | 0.964 | 0.894 | tttAATCAg V$AP1_C | 644 (+) | 0.955 | 0.876 | tttATTCAt V$AP1_C | 644 (-) | 0.808 | 0.852 | tTTATTcat V$AP1_C | 1032 (+) | 0.964 | 0.956 | ttgAATCAa V$AP1_C | 1032 (-) | 0.955 | 0.949 | tTGAATcaa V$AP1_C | 3435 (+) | 0.808 | 0.864 | ttgACTTAg V$AP1_C | 3435 (-) | 1.000 | 0.880 | tTGACTtag V$AP1_C | 3471 (-) | 0.964 | 0.861 | aTGATTtac V$AP1_C | 3580 (+) | 0.955 | 0.874 | gttATTCAa V$AP1_C | 3726 (+) | 0.833 | 0.867 | gtgACTAAa V$AP1_C | 3726 (-) | 1.000 | 0.904 | gTGACTaaa V$AP1_C | 3752 (+) | 1.000 | 0.907 | tttAGTCAt V$AP1_C | 3752 (-) | 0.833 | 0.869 | tTTAGTcat V$AP1_C | 3922 (-) | 0.955 | 0.854 | aTGAATtac V$AP1_C | 3934 (+) | 1.000 | 0.884 | atcAGTCAc V$AP1_C | 3934 (-) | 0.808 | 0.854 | aTCAGTcac V$AP1_C | 3946 (+) | 0.955 | 0.864 | ttcATTCAg V$AP1_C | 4467 (+) | 0.808 | 0.850 | ctgACTGAc V$AP1_C | 4467 (-) | 1.000 | 0.897 | cTGACTgac V$AP1_C | 4715 (+) | 0.955 | 0.851 | gtaATTCAc V$AP1_C | 6294 (+) | 0.808 | 0.868 | atgACTGAg V$AP1_C | 6294 (-) | 1.000 | 0.884 | aTGACTgag V$AP1_C | 6323 (-) | 1.000 | 0.895 | cTGACTgaa V$AP1_C | 7355 (+) | 0.964 | 0.885 | gttAATCAt V$AP1_C | 8323 (+) | 0.788 | 0.852 | ctgCGTCAg V$AP1_C | 8572 (-) | 0.964 | 0.879 | tTGATTaaa V$AP1_C | 8616 (+) | 0.964 | 0.858 | gtcAATCAc V$AP1_C | 9141 (+) | 0.808 | 0.864 | ttgAATAAg V$AP1_C | 9141 (-) | 0.955 | 0.873 | tTGAATaag V$AP1_C | 9321 (+) | 0.788 | 0.850 | ttgGGTCAg V$AP1_C | 10035 (+) | 1.000 | 0.983 | gtgAGTCAc V$AP1_C | 10035 (-) | 0.989 | 0.976 | gTGAGTcac V$AP1_C | 10633 (+) | 0.808 | 0.866 | gtgACTGAg V$AP1_C | 10633 (-) | 1.000 | 0.881 | gTGACTgag V$AP1_C | 11197 (-) | 0.964 | 0.859 | tTGATTgat V$AP1_C | 11455 (+) | 0.964 | 0.974 | gtgAATCAg V$AP1_C | 11455 (-) | 0.955 | 0.952 | gTGAATcag V$AP1_C | 11695 (+) | 0.964 | 0.882 | gttAATCAc V$CEBP_C | 323 (+) | 0.893 | 0.817 | cctgtgctGGAAAttccc V$CEBP_C | 330 (-) | 0.944 | 0.900 | tggaaATTCCctcagaca V$CEBP_C | 486 (-) | 1.000 | 0.822 | tcaacATTGCaacattat V$CEBP_C | 800 (-) | 0.955 | 0.848 | gagccATTTCtccagccc V$CEBP_C | 921 (-) | 0.941 | 0.819 | tagcaCTTCCacgaaacc V$CEBP_C | 1099 (+) | 0.955 | 0.811 | aaatttgaGAAATcgctg V$CEBP_C | 1130 (-) | 0.903 | 0.894 | cgtgtGTTCCtaaaaccg V$CEBP_C | 1460 (-) | 0.914 | 0.838 | cggacGTTTCtcaatatt V$CEBP_C | 1556 (+) | 0.955 | 0.838 | ctcttctgGAAATctacg V$CEBP_C | 1818 (+) | 0.941 | 0.834 | gtgttcagGGAAGacacc V$CEBP_C | 2365 (-) | 0.949 | 0.844 | tttatTTTGCcacagttt V$CEBP_C | 2396 (-) | 0.951 | 0.843 | agttgCTTTCcttaaaaa V$CEBP_C | 2554 (+) | 0.955 | 0.862 | tctgttgtGAAATttccg V$CEBP_C | 2561 (-) | 0.893 | 0.818 | tgaaaTTTCCgtaagtat V$CEBP_C | 2563 (+) | 0.982 | 0.817 | aaatttccGTAAGtattt V$CEBP_C | 2573 (-) | 0.935 | 0.806 | aagtaTTTACaacaaatg V$CEBP_C | 2709 (-) | 0.941 | 0.810 | ttcctCTTCCcaaataga V$CEBP_C | 2800 (-) | 0.982 | 0.893 | cttctCTTACtgaacacc V$CEBP_C | 3132 (-) | 0.944 | 0.824 | tttacATTCCcacaacaa V$CEBP_C | 3320 (-) | 0.904 | 0.847 | tggccTTTTCctgattct V$CEBP_C | 3436 (+) | 0.945 | 0.873 | tgacttagGTAACagtta V$CEBP_C | 3568 (+) | 0.996 | 0.940 | ggcctttgGCAAGttatt V$CEBP_C | 3599 (-) | 0.914 | 0.810 | tttggGTTTCtgcagttt V$CEBP_C | 3643 (+) | 0.944 | 0.815 | cacatggtGGAATcagta V$CEBP_C | 3921 (-) | 0.985 | 0.857 | tatgaATTACattatcag V$CEBP_C | 4021 (+) | 0.944 | 0.832 | gctctcatGGAATtaata V$CEBP_C | 4204 (-) | 0.996 | 0.843 | caaggCTTGCctcagtta V$CEBP_C | 4267 (-) | 0.982 | 0.853 | ggatgCTTACctaagcac V$CEBP_C | 4390 (-) | 0.903 | 0.834 | agtgaGTTCCaagacagg V$CEBP_C | 4602 (+) | 0.941 | 0.861 | cgcttgctGGAAGgtaaa V$CEBP_C | 4707 (+) | 0.985 | 0.966 | agacttggGTAATtcact V$CEBP_C | 4792 (-) | 0.996 | 0.871 | gctgaCTTGCataatctc V$CEBP_C | 4896 (-) | 0.792 | 0.824 | tgttcCTTGGccaagtgc V$CEBP_C | 4948 (-) | 0.951 | 0.837 | tgggaCTTTCtgcagcag V$CEBP_C | 5149 (-) | 0.935 | 0.820 | ggcagTTTACgaaaaggg V$CEBP_C | 5439 (-) | 1.000 | 0.845 | tttgtATTGCcatataac V$CEBP_C | 5912 (-) | 0.914 | 0.860 | agtttGTTTCcccagcga V$CEBP_C | 6342 (-) | 0.951 | 0.813 | gatttCTTTCacgaaagc V$CEBP_C | 6546 (+) | 0.982 | 0.898 | cggctgagGTAAGagtca V$CEBP_C | 6572 (+) | 0.949 | 0.917 | acagttatGCAAAacaca V$CEBP_C | 6605 (-) | 0.949 | 0.884 | tgctaTTTGCtaaactcc V$CEBP_C | 6872 (+) | 0.941 | 0.818 | ggcttataGGAAGcacca V$CEBP_C | 7051 (-) | 0.904 | 0.809 | attttTTTTCttaatttg V$CEBP_C | 7243 (+) | 0.904 | 0.820 | cgtctcagGAAAAagtct V$CEBP_C | 7326 (+) | 0.770 | 0.832 | cgtctggaACAATagaca V$CEBP_C | 7340 (-) | 0.955 | 0.824 | gacacATTTCaacatgtt V$CEBP_C | 7558 (+) | 0.893 | 0.812 | tagatgctGGAAAccaaa V$CEBP_C | 8068 (-) | 1.000 | 0.816 | ctggtATTGCagaaactt V$CEBP_C | 8071 (+) | 0.914 | 0.801 | gtattgcaGAAACttcct V$CEBP_C | 8140 (+) | 0.893 | 0.838 | ctgctgagGGAAAacatt V$CEBP_C | 8488 (-) | 0.904 | 0.838 | ggttaTTTTCtacaggga V$CEBP_C | 8577 (+) | 0.949 | 0.806 | taaatgggGCAAAggcaa V$CEBP_C | 8606 (-) | 0.944 | 0.807 | ttatcATTCCgtcaatca V$CEBP_C | 8693 (+) | 0.904 | 0.818 | gcctttctGAAAAtacca V$CEBP_C | 8906 (+) | 0.944 | 0.832 | cagattcaGGAATactcg V$CEBP_C | 8990 (-) | 0.944 | 0.826 | ctttaATTCCagcacatg V$CEBP_C | 9095 (-) | 0.982 | 0.803 | ctctgCTTACaaaagcaa V$CEBP_C | 9506 (-) | 0.914 | 0.818 | tgaaaGTTTCcccatgtc V$CEBP_C | 9600 (+) | 0.941 | 0.833 | agcatttgGGAAGctgag V$CEBP_C | 9870 (+) | 0.955 | 0.916 | agaatgagGAAATcaatt V$CEBP_C | 10069 (+) | 0.941 | 0.847 | ggtcttctGGAAGggcag V$CEBP_C | 10131 (+) | 0.955 | 0.812 | ttagttaaGAAATtatta V$CEBP_C | 10252 (+) | 0.941 | 0.848 | cttgtgcaGGAAGtgatt V$CEBP_C | 10366 (-) | 0.951 | 0.849 | agaatCTTTCaaaatgtt V$CEBP_C | 10376 (+) | 1.000 | 0.915 | aaaatgttGCAATccata V$CEBP_C | 10621 (+) | 0.951 | 0.860 | gagttgtaGAAAGtgact V$CEBP_C | 10685 (+) | 0.959 | 0.866 | ggactctgGCAACacccg V$CEBP_C | 10967 (+) | 0.996 | 0.829 | ctgttggaGCAAGgagcc V$CEBP_C | 11149 (+) | 0.904 | 0.812 | actctgatGAAAAtgaag V$CEBP_C | 11283 (+) | 0.949 | 0.831 | ggcataggGCAAAgaaat V$CEBP_C | 11642 (-) | 0.955 | 0.819 | aatgtATTTCtgtactgt B$CRP_C | 25 (+) | 1.000 | 0.814 | gaatgtgTGTGAacaacttcagatgt B$CRP_C | 3886 (-) | 1.000 | 0.828 | ctccttgaccctcaTCACAggtatta B$CRP_C | 4136 (+) | 0.716 | 0.783 | gaatctaGGTGTggtgcctcccattt V$GATA_C | 84 (+) | 0.959 | 0.957 | tGATAGggcct V$GATA_C | 88 (-) | 0.959 | 0.963 | agggcCTATCt V$GATA_C | 304 (-) | 0.959 | 0.912 | ctgagCTATCt V$GATA_C | 568 (+) | 1.000 | 0.959 | aGATAAacgtc V$GATA_C | 1992 (-) | 1.000 | 0.940 | caagaTTATCt V$GATA_C | 2320 (+) | 0.953 | 0.915 | gGATACagtct V$GATA_C | 2724 (+) | 1.000 | 0.942 | aGATAAtcttc V$GATA_C | 2899 (-) | 1.000 | 0.956 | cttttTTATCc V$GATA_C | 3927 (-) | 1.000 | 0.938 | ttacaTTATCa V$GATA_C | 3979 (+) | 0.959 | 0.927 | tGATAGacacc V$GATA_C | 4004 (+) | 1.000 | 0.946 | cGATAAataaa V$GATA_C | 4371 (-) | 1.000 | 0.943 | taaagTTATCt V$GATA_C | 4579 (+) | 0.951 | 0.920 | aGATATagtct V$GATA_C | 4781 (+) | 1.000 | 0.959 | gGATAAagatt V$GATA_C | 5524 (-) | 0.959 | 0.935 | tcctcCTATCc V$GATA_C | 5584 (+) | 1.000 | 0.942 | gGATAAtaata V$GATA_C | 5635 (+) | 1.000 | 0.964 | gGATAAagaca V$GATA_C | 5979 (+) | 1.000 | 0.943 | tGATAActtct V$GATA_C | 6467 (+) | 1.000 | 0.945 | tGATAAaatgc V$GATA_C | 7149 (-) | 0.953 | 0.944 | gcgccGTATCt V$GATA_C | 7312 (+) | 0.951 | 0.915 | gGATATagaat V$GATA_C | 7785 (-) | 1.000 | 0.936 | aaaaaTTATCc V$GATA_C | 7799 (+) | 1.000 | 0.965 | aGATAAagaac V$GATA_C | 7953 (+) | 0.953 | 0.907 | aGATACcccag V$GATA_C | 8059 (-) | 1.000 | 0.956 | ctagtTTATCt V$GATA_C | 8601 (-) | 1.000 | 0.937 | taataTTATCa V$GATA_C | 9364 (-) | 0.959 | 0.922 | gttctCTATCa V$GATA_C | 9734 (+) | 1.000 | 0.942 | tGATAAtgtcc V$GATA_C | 9835 (+) | 1.000 | 0.935 | gGATAAtttta V$GATA_C | 10451 (-) | 0.951 | 0.912 | cttttATATCa V$GATA_C | 10485 (-) | 0.951 | 0.928 | gccacATATCa V$GATA_C | 10591 (-) | 0.951 | 0.909 | ttcctATATCa V$GATA_C | 11345 (-) | 1.000 | 0.930 | tgccaTTATCg V$GATA_C | 11366 (-) | 0.959 | 0.927 | tgattCTATCt V$GATA_C | 11426 (-) | 0.959 | 0.906 | tttagCTATCc V$GATA_C | 11438 (-) | 1.000 | 0.989 | catacTTATCt V$GATA_C | 11566 (-) | 1.000 | 0.981 | aactcTTATCa V$GATA_C | 11578 (-) | 0.951 | 0.913 | tgcttATATCc F$GCN4_C | 134 (-) | 1.000 | 0.814 | tcAGTCAatc F$GCN4_C | 358 (-) | 0.892 | 0.832 | agATTCAaat F$GCN4_C | 622 (+) | 0.777 | 0.770 | acaTGACAct F$GCN4_C | 729 (-) | 0.777 | 0.792 | tgAGCCAcca F$GCN4_C | 799 (-) | 0.777 | 0.837 | tgAGCCAttt F$GCN4_C | 1030 (+) | 0.892 | 0.868 | agtTGAATca F$GCN4_C | 1033 (-) | 0.922 | 0.888 | tgAATCAaca F$GCN4_C | 1227 (+) | 1.000 | 0.795 | tgaTGACTgc F$GCN4_C | 1309 (-) | 0.777 | 0.787 | agAGTCCcct F$GCN4_C | 1355 (-) | 0.777 | 0.794 | tgGGTCAcct F$GCN4_C | 1447 (-) | 1.000 | 0.816 | gaAGTCActg F$GCN4_C | 1586 (-) | 0.777 | 0.768 | agAGCCAgca F$GCN4_C | 1684 (-) | 0.777 | 0.769 | ggTGTCAgct F$GCN4_C | 1803 (-) | 0.892 | 0.879 | ggATTCAtca F$GCN4_C | 1857 (-) | 0.777 | 0.777 | agAGCCAact F$GCN4_C | 2594 (+) | 1.000 | 0.799 | tacTGACTat F$GCN4_C | 2611 (-) | 0.777 | 0.770 | agAGCCAtgt F$GCN4_C | 2665 (+) | 0.777 | 0.797 | tgaTGACAct F$GCN4_C | 2843 (-) | 0.699 | 0.770 | ggCATCAttc F$GCN4_C | 3082 (+) | 0.922 | 0.891 | gggTGATTct F$GCN4_C | 3328 (+) | 0.922 | 0.845 | tccTGATTct F$GCN4_C | 3415 (+) | 0.922 | 0.933 | taaTGATTct F$GCN4_C | 3433 (+) | 1.000 | 0.815 | aatTGACTta F$GCN4_C | 3578 (+) | 0.699 | 0.768 | aagTTATTca F$GCN4_C | 3651 (-) | 0.922 | 0.903 | ggAATCAgta F$GCN4_C | 3724 (+) | 1.000 | 0.793 | gggTGACTaa F$GCN4_C | 3753 (-) | 1.000 | 0.836 | ttAGTCAttc F$GCN4_C | 3872 (-) | 0.777 | 0.804 | ggAGTTAata F$GCN4_C | 3935 (-) | 1.000 | 0.825 | tcAGTCActt F$GCN4_C | 4094 (+) | 0.922 | 0.904 | tacTGATTct F$GCN4_C | 4147 (+) | 0.777 | 0.783 | tggTGCCTcc F$GCN4_C | 4465 (+) | 1.000 | 0.809 | cacTGACTga F$GCN4_C | 4534 (-) | 0.777 | 0.782 | tgCGTCAaca F$GCN4_C | 4839 (-) | 0.777 | 0.788 | cgAGTGAtcc F$GCN4_C | 5079 (+) | 0.777 | 0.769 | ggcTGGCTct F$GCN4_C | 5263 (-) | 1.000 | 0.798 | acAGTCAtcc F$GCN4_C | 5535 (-) | 0.892 | 0.900 | agATTCActc F$GCN4_C | 5536 (+) | 0.777 | 0.806 | gatTCACTct F$GCN4_C | 6143 (+) | 0.777 | 0.770 | ccaTGCCTct F$GCN4_C | 6185 (-) | 0.777 | 0.785 | agAGACAccc F$GCN4_C | 6292 (+) | 1.000 | 0.805 | tgaTGACTga F$GCN4_C | 6414 (+) | 1.000 | 0.990 | gaaTGACTcc F$GCN4_C | 6557 (-) | 1.000 | 0.932 | agAGTCAtgg F$GCN4_C | 6792 (+) | 0.777 | 0.768 | tgcTGGCTct F$GCN4_C | 7037 (-) | 0.777 | 0.801 | agAGTTAgtt F$GCN4_C | 7043 (+) | 1.000 | 0.777 | agtTGACTat F$GCN4_C | 7545 (-) | 0.777 | 0.785 | tgAGCCAact F$GCN4_C | 7609 (-) | 0.777 | 0.807 | tgAGCCAtct F$GCN4_C | 7637 (-) | 0.699 | 0.773 | agAATTAttt F$GCN4_C | 7672 (+) | 0.777 | 0.792 | tggTGGCTca F$GCN4_C | 7883 (-) | 1.000 | 0.828 | gaAGTCAttt F$GCN4_C | 8045 (+) | 1.000 | 0.900 | cctTGACTcg F$GCN4_C | 8226 (+) | 0.777 | 0.773 | tgtTGGCTcc F$GCN4_C | 8324 (-) | 0.777 | 0.778 | tgCGTCAgcg F$GCN4_C | 8617 (-) | 0.922 | 0.768 | tcAATCActt F$GCN4_C | 8861 (+) | 0.777 | 0.768 | ttaAGACTct F$GCN4_C | 8886 (-) | 0.922 | 0.852 | agAATCAaac F$GCN4_C | 8907 (-) | 0.892 | 0.823 | agATTCAgga F$GCN4_C | 9290 (-) | 1.000 | 0.933 | agAGTCAgct F$GCN4_C | 9767 (-) | 1.000 | 0.769 | acAGTCAtgc F$GCN4_C | 10033 (+) | 0.892 | 0.847 | ttgTGAGTca F$GCN4_C | 10036 (-) | 1.000 | 0.957 | tgAGTCAccg F$GCN4_C | 10104 (-) | 0.777 | 0.807 | tgAGCCAtct F$GCN4_C | 10192 (-) | 1.000 | 0.800 | gcAGTCAgtt F$GCN4_C | 10366 (-) | 0.699 | 0.771 | agAATCTttc F$GCN4_C | 10503 (-) | 1.000 | 0.807 | aaAGTCAatt F$GCN4_C | 10631 (+) | 1.000 | 0.825 | aagTGACTga F$GCN4_C | 10682 (+) | 0.777 | 0.769 | gcaGGACTct F$GCN4_C | 10996 (-) | 0.922 | 0.846 | ggAATCAgat F$GCN4_C | 11363 (+) | 0.922 | 0.875 | gcaTGATTct F$GCN4_C | 11453 (+) | 0.892 | 0.876 | tggTGAATca F$GCN4_C | 11456 (-) | 0.922 | 0.914 | tgAATCAgtg F$GCN4_C | 11696 (-) | 0.922 | 0.768 | ttAATCActg F$GCN4_C | 11747 (+) | 1.000 | 0.820 | aaaTGACTag F$GCN4_C | 11759 (-) | 0.777 | 0.774 | agAGTTAaca V$GRE_C | 296 (+) | 0.722 | 0.771 | tttacagactGAGCTa V$GRE_C | 761 (-) | 0.978 | 0.740 | cAGGACctctggaaga V$GRE_C | 1147 (+) | 0.714 | 0.734 | gttactttatGTAGTt V$GRE_C | 1263 (+) | 0.952 | 0.746 | tcatccgtgtGTACTc V$GRE_C | 1399 (-) | 0.741 | 0.755 | gAGGAGagtgggtcca V$GRE_C | 1591 (-) | 0.960 | 0.738 | cAGCACtgggtttggg V$GRE_C | 1621 (-) | 0.741 | 0.786 | gTGGACagcatgtcca V$GRE_C | 1622 (+) | 0.741 | 0.792 | tggacagcatGTCCAg V$GRE_C | 1631 (+) | 0.741 | 0.730 | tgtccagcctTTCCTa V$GRE_C | 1785 (-) | 0.960 | 0.730 | gAGCACctgccattca V$GRE_C | 2591 (-) | 0.952 | 0.776 | cAGTACtgactattaa V$GRE_C | 2810 (+) | 0.762 | 0.798 | tgaacaccgtGCTCTg V$GRE_C | 2844 (+) | 0.978 | 0.743 | gcatcattccGTCCTt V$GRE_C | 3023 (-) | 0.952 | 0.800 | cAGTACaccgacttcc V$GRE_C | 3171 (-) | 0.714 | 0.742 | cTGTACattatatccc V$GRE_C | 3521 (-) | 0.741 | 0.731 | gAGGCCaccttggccc V$GRE_C | 3540 (+) | 1.000 | 0.760 | gtttggttctGTTCTt V$GRE_C | 3692 (-) | 1.000 | 0.741 | aAGAACacaagagatg V$GRE_C | 3965 (-) | 0.960 | 0.824 | gAGCACtggctgtatg V$GRE_C | 4073 (+) | 0.714 | 0.741 | tgtaaaatgtATACTa V$GRE_C | 4279 (-) | 0.960 | 0.842 | aAGCACacattagacc V$GRE_C | 4670 (+) | 1.000 | 0.738 | tttgctcataGTTCTg V$GRE_C | 4692 (+) | 0.978 | 0.817 | gaaacttgatGTCCTa V$GRE_C | 5175 (+) | 1.000 | 0.759 | ttgtgactgtGTTCTa V$GRE_C | 5204 (+) | 0.960 | 0.774 | gtggcctcttGTGCTt V$GRE_C | 5661 (-) | 1.000 | 0.783 | aAGAACacaataagag V$GRE_C | 6330 (-) | 0.960 | 0.798 | aAGCACacactggatt V$GRE_C | 6517 (-) | 1.000 | 0.877 | aAGAACacgaagttca V$GRE_C | 6913 (-) | 1.000 | 0.758 | gAGAACaaggaattga V$GRE_C | 7468 (+) | 0.960 | 0.744 | ggatttgagtGTGCTa V$GRE_C | 7531 (+) | 0.762 | 0.740 | gttacagttgGTTGTg V$GRE_C | 7804 (-) | 1.000 | 0.751 | aAGAACaacgacaact V$GRE_C | 8418 (+) | 1.000 | 0.820 | gtaagcagctGTTCTa V$GRE_C | 8533 (-) | 1.000 | 0.880 | aAGAACacagtgggcc V$GRE_C | 8750 (+) | 0.978 | 0.853 | cccacaccttGTCCTc V$GRE_C | 8845 (-) | 1.000 | 0.758 | cAGAACggtttaaaaa V$GRE_C | 8871 (+) | 0.762 | 0.767 | ggcatagaatATTCTa V$GRE_C | 8965 (-) | 1.000 | 0.789 | gAGAACcaggtgtggt V$GRE_C | 8966 (+) | 0.722 | 0.742 | agaaccaggtGTGGTg V$GRE_C | 8999 (-) | 0.960 | 0.738 | cAGCACatggaggcag V$GRE_C | 9354 (+) | 1.000 | 0.782 | tcttctcagtGTTCTc V$GRE_C | 9440 (-) | 0.722 | 0.746 | aAGCCCaaggtttacc V$GRE_C | 9464 (-) | 0.714 | 0.758 | cTGTACaggatgttct V$GRE_C | 9465 (+) | 1.000 | 0.959 | tgtacaggatGTTCTa V$GRE_C | 9533 (-) | 1.000 | 0.748 | tAGAACaaacaagtcc V$GRE_C | 9779 (-) | 1.000 | 0.739 | tAGAACaattttgttc V$GRE_C | 9950 (-) | 0.762 | 0.760 | tATAACaccctgtatg V$GRE_C | 10062 (+) | 0.762 | 0.769 | tgaaccgggtCTTCTg V$GRE_C | 10079 (-) | 0.741 | 0.732 | aAGGGCagccagtgct V$GRE_C | 10080 (+) | 0.960 | 0.763 | agggcagccaGTGCTc V$GRE_C | 10189 (+) | 1.000 | 0.750 | tcagcagtcaGTTCTg V$GRE_C | 10194 (+) | 0.978 | 0.740 | agtcagttctGTCCTt V$GRE_C | 10298 (-) | 0.741 | 0.738 | tCGGACattttttaac V$GRE_C | 10566 (+) | 0.722 | 0.825 | ggtacagcgtTTGCTt V$GRE_C | 10645 (+) | 0.978 | 0.852 | gacacccagtGTCCTa V$GRE_C | 10683 (-) | 0.978 | 0.732 | cAGGACtctggcaaca V$GRE_C | 10883 (-) | 0.960 | 0.750 | gAGCACtgcacatgcg V$GRE_C | 11072 (+) | 0.762 | 0.830 | gttacaagctGCTCTg V$GRE_C | 11223 (+) | 1.000 | 0.848 | tgcccagtttGTTCTa V$GRE_C | 11367 (+) | 0.960 | 0.796 | gattctatctGTGCTt V$HNF1_C | 387 (+) | 0.670 | 0.805 | tGCCATtcatttgccaa V$HNF1_C | 985 (-) | 0.750 | 0.808 | acagttaataaTTTTCa V$HNF1_C | 987 (+) | 1.000 | 0.880 | aGTTAAtaattttcatg V$HNF1_C | 3412 (+) | 1.000 | 0.821 | aGTTAAtgattctggtt V$HNF1_C | 4056 (+) | 0.816 | 0.837 | aGGCAAtaattaaaata V$HNF1_C | 6834 (+) | 0.796 | 0.792 | tGCTATttactgccaac V$HNF1_C | 8145 (-) | 0.942 | 0.825 | gagggaaaacaTTAGCt V$HNF1_C | 9876 (+) | 0.771 | 0.795 | aGGAAAtcaatttaaaa V$HNF1_C | 10298 (-) | 1.000 | 0.819 | tcggacattttTTAACt V$HNF1_C | 11694 (+) | 1.000 | 0.857 | aGTTAAtcactgtcacc V$NFKB_C | 330 (-) | 0.973 | 0.983 | tggaaaTTCCCt V$NFKB_C | 1952 (+) | 0.760 | 0.851 | tGGAACtttcct V$NFKB_C | 4637 (-) | 0.977 | 0.867 | agcaaaATCCCg V$NFKB_C | 4948 (+) | 1.000 | 0.870 | tGGGACtttctg V$NFKB_C | 5163 (+) | 1.000 | 0.878 | aGGGACtttcac V$NFKB_C | 11086 (+) | 0.973 | 0.868 | tGGGAAttttcc V$NFKB_C | 11087 (+) | 0.737 | 0.854 | gGGAATtttcca V$NFY_C | 458 (+) | 0.800 | 0.811 | ttTGGTTggcttat V$NFY_C | 529 (+) | 0.800 | 0.789 | aaTGATGggtgatg V$NFY_C | 1020 (+) | 1.000 | 0.893 | tgTGATTggcagtt V$NFY_C | 1113 (+) | 1.000 | 0.785 | gcTGATTtgtgtat V$NFY_C | 1424 (-) | 1.000 | 0.794 | gacacgcAATCAgt V$NFY_C | 1789 (-) | 0.800 | 0.785 | acctgccATTCAca V$NFY_C | 2933 (+) | 1.000 | 0.816 | gcTGATTgcgaagt V$NFY_C | 3225 (-) | 0.800 | 0.834 | ttttaccAGTCAgc V$NFY_C | 3280 (+) | 1.000 | 0.851 | taTGATTgggatga V$NFY_C | 5086 (+) | 0.800 | 0.817 | tcTGACTggtttgg V$NFY_C | 9186 (-) | 0.800 | 0.780 | agagaccACTCAag V$NFY_C | 9449 (-) | 0.800 | 0.814 | gtttaccAATCTct V$NFY_C | 10759 (+) | 0.800 | 0.797 | tgTGTTTgggcact V$NFY_C | 11196 (+) | 1.000 | 0.855 | ctTGATTgattatt V$NFY_C | 11800 (+) | 0.800 | 0.776 | ccAGATTggaaggt V$OCT_C | 279 (-) | 0.761 | 0.835 | gcatGCAAGtcaa V$OCT_C | 992 (+) | 0.884 | 0.901 | ataaTTTTCatgt V$OCT_C | 2626 (-) | 1.000 | 0.830 | ctagGCAAAtgct V$OCT_C | 3108 (+) | 0.859 | 0.896 | ctaaTTTCCatgc V$OCT_C | 3128 (+) | 0.859 | 0.882 | ttaaTTTACattc V$OCT_C | 3834 (+) | 0.859 | 0.822 | taaaTTTCCattc V$OCT_C | 4793 (+) | 0.761 | 0.855 | ctgaCTTGCataa V$OCT_C | 5340 (+) | 1.000 | 0.814 | tagtTTTGCatcc V$OCT_C | 5435 (+) | 0.761 | 0.817 | tttaTTTGTattg V$OCT_C | 5748 (-) | 1.000 | 0.799 | agacGCAAAggac V$OCT_C | 6576 (-) | 1.000 | 0.900 | ttatGCAAAacac V$OCT_C | 6968 (-) | 0.859 | 0.810 | ggatGGAAAggaa V$OCT_C | 7351 (-) | 0.742 | 0.822 | acatGTTAAtcat V$OCT_C | 9486 (+) | 1.000 | 0.819 | tttaTTTGCaata V$OCT_C | 9731 (-) | 0.767 | 0.801 | gtatGATAAtgtc V$OCT_C | 9933 (+) | 1.000 | 0.965 | gtaaTTTGCatgt V$OCT_C | 10910 (-) | 1.000 | 0.983 | ctatGCAAAtaat V$OCT_C | 11153 (-) | 0.884 | 0.886 | tgatGAAAAtgaa V$OCT_C | 11336 (-) | 0.859 | 0.809 | cgatGTAAAtgcc V$OCT_C | 11664 (+) | 0.884 | 0.852 | acaaTTTTCataa V$PADS_C | 933 (-) | 1.000 | 0.957 | gaaACCACa V$PADS_C | 3359 (-) | 1.000 | 0.952 | cagACCACt V$PADS_C | 5049 (+) | 1.000 | 0.953 | tGTGGTcta V$PADS_C | 9187 (-) | 1.000 | 0.993 | gagACCACt F$RAP1_C | 3595 (-) | 1.000 | 0.969 | tatgttTGGGTttc V$SRF_C | 3860 (-) | 0.871 | 0.886 | agtaCCATTtaagga V$SRF_C | 3863 (+) | 0.925 | 0.893 | accattTAAGGagtt V$SRF_C | 11534 (-) | 0.836 | 0.882 | aatcCCACAtatggt V$SRF_C | 11537 (+) | 1.000 | 0.874 | cccacaTATGGtaga V$TATA_C | 239 (-) | 1.000 | 0.885 | tTTTTAaatc V$TATA_C | 294 (-) | 1.000 | 0.923 | gTTTTAcaga V$TATA_C | 502 (-) | 0.853 | 0.811 | aTTATAtatt V$TATA_C | 521 (+) | 1.000 | 0.889 | atttTAAAAa V$TATA_C | 521 (-) | 1.000 | 0.886 | aTTTTAaaaa V$TATA_C | 550 (-) | 1.000 | 0.837 | gTTTTAacct V$TATA_C | 558 (-) | 0.790 | 0.833 | cTTTTCtagg V$TATA_C | 603 (-) | 1.000 | 0.903 | tTTTTAaaca V$TATA_C | 634 (-) | 1.000 | 0.903 | tTTTTAaaca V$TATA_C | 655 (-) | 0.853 | 0.811 | aTTATAtata V$TATA_C | 657 (+) | 0.853 | 0.817 | tataTATAAg V$TATA_C | 823 (-) | 1.000 | 0.828 | tTTTTAatac V$TATA_C | 831 (+) | 0.853 | 0.881 | actaTATAAa V$TATA_C | 833 (+) | 1.000 | 0.926 | tataTAAAAc V$TATA_C | 1061 (+) | 1.000 | 0.950 | acttTAAAAt V$TATA_C | 1136 (+) | 1.000 | 0.818 | ttccTAAAAc V$TATA_C | 1743 (+) | 0.841 | 0.867 | cctaTAGAAc V$TATA_C | 1890 (-) | 1.000 | 0.927 | aTTTTAtatt V$TATA_C | 1901 (-) | 1.000 | 0.870 | aTTTTAttat V$TATA_C | 1909 (-) | 1.000 | 0.867 | aTTTTAtttg V$TATA_C | 2404 (+) | 1.000 | 0.911 | tcctTAAAAa V$TATA_C | 2534 (-) | 1.000 | 0.929 | cTTTTAgaga V$TATA_C | 2599 (+) | 0.790 | 0.827 | actaTTAAAc V$TATA_C | 2901 (-) | 1.000 | 0.885 | tTTTTAtcca V$TATA_C | 3404 (+) | 1.000 | 0.936 | gttaTAAAAg V$TATA_C | 3451 (+) | 1.000 | 0.869 | ttaaTAAAAc V$TATA_C | 3727 (+) | 1.000 | 0.814 | tgacTAAAAt V$TATA_C | 3750 (-) | 1.000 | 0.816 | tTTTTAgtca V$TATA_C | 3760 (+) | 1.000 | 0.851 | ttctTAAAAg V$TATA_C | 3846 (+) | 1.000 | 0.889 | ctcaTAAAAt V$TATA_C | 4062 (+) | 1.000 | 0.822 | taatTAAAAt V$TATA_C | 4071 (+) | 1.000 | 0.857 | tatgTAAAAt V$TATA_C | 4090 (-) | 1.000 | 0.863 | aTTTTActga V$TATA_C | 4099 (+) | 1.000 | 0.859 | attcTAAAAc V$TATA_C | 4123 (+) | 1.000 | 0.822 | atgtTAAAAc V$TATA_C | 4216 (+) | 1.000 | 0.823 | cagtTAAAAa V$TATA_C | 4315 (+) | 1.000 | 0.903 | tgcaTAAAAc V$TATA_C | 4421 (+) | 1.000 | 0.877 | tgtcTAAAAa V$TATA_C | 4423 (+) | 0.790 | 0.830 | tctaAAAAAa V$TATA_C | 5385 (-) | 1.000 | 0.866 | gTTTTAtttt V$TATA_C | 5439 (-) | 0.841 | 0.810 | tTTGTAttgc V$TATA_C | 5447 (+) | 0.853 | 0.836 | gccaTATAAc V$TATA_C | 5559 (+) | 0.790 | 0.827 | actaGAAAAc V$TATA_C | 5576 (-) | 0.841 | 0.830 | cTTCTAaagg V$TATA_C | 5589 (+) | 1.000 | 0.870 | ataaTAAAAt V$TATA_C | 5645 (+) | 1.000 | 0.869 | aaaaTAAAAa V$TATA_C | 5765 (+) | 1.000 | 0.817 | agggTAAAAg V$TATA_C | 5952 (-) | 1.000 | 0.867 | aTTTTAtttg V$TATA_C | 5968 (-) | 1.000 | 0.844 | tTTTTAagtg V$TATA_C | 6466 (+) | 1.000 | 0.868 | ctgaTAAAAt V$TATA_C | 6510 (+) | 1.000 | 0.872 | caaaTAAAAg V$TATA_C | 6669 (-) | 1.000 | 0.867 | aTTTTAttta V$TATA_C | 6826 (-) | 1.000 | 0.889 | tTTTTAtgtg V$TATA_C | 7260 (+) | 1.000 | 0.889 | tcatTAAAAt V$TATA_C | 7270 (+) | 1.000 | 0.819 | atccTAAAAt V$TATA_C | 7416 (-) | 1.000 | 0.956 | cTTTTAaagg V$TATA_C | 7434 (-) | 1.000 | 0.869 | tTTTTAtttt V$TATA_C | 7439 (-) | 1.000 | 0.886 | aTTTTAtgtg V$TATA_C | 7643 (+) | 1.000 | 0.889 | atttTAAAAa V$TATA_C | 7643 (-) | 1.000 | 0.886 | aTTTTAaaaa V$TATA_C | 7895 (-) | 1.000 | 0.869 | tTTTTAtttt V$TATA_C | 8851 (+) | 1.000 | 0.903 | ggttTAAAAa V$TATA_C | 8948 (+) | 1.000 | 0.886 | ttttTAAAAt V$TATA_C | 8948 (-) | 1.000 | 0.889 | tTTTTAaaat V$TATA_C | 9412 (+) | 0.790 | 0.827 | tctaGAAAAc V$TATA_C | 9493 (+) | 0.841 | 0.810 | gcaaTAGAAa V$TATA_C | 9711 (+) | 1.000 | 0.874 | ggtgTAAAAt V$TATA_C | 9742 (+) | 0.841 | 0.827 | tccaTACAAt V$TATA_C | 9775 (+) | 0.841 | 0.826 | gccaTAGAAc V$TATA_C | 9825 (-) | 1.000 | 0.860 | aTTTTAgaag V$TATA_C | 9840 (-) | 1.000 | 0.950 | aTTTTAaagt V$TATA_C | 9884 (+) | 1.000 | 0.883 | aattTAAAAt V$TATA_C | 9897 (-) | 1.000 | 0.953 | tTTTTAaaga V$TATA_C | 10122 (+) | 1.000 | 0.934 | ccaaTAAAAt V$TATA_C | 10306 (-) | 1.000 | 0.823 | tTTTTAactg V$TATA_C | 10316 (-) | 1.000 | 0.930 | tTTTTAtatt V$TATA_C | 10324 (-) | 1.000 | 0.869 | tTTTTAtttt V$TATA_C | 10329 (-) | 1.000 | 0.994 | aTTTTAtagt V$TATA_C | 10338 (-) | 1.000 | 0.936 | tTTTTAttgt V$TATA_C | 10346 (-) | 1.000 | 0.821 | gTTTTAattc V$TATA_C | 10388 (+) | 0.853 | 0.840 | tccaTATAAa V$TATA_C | 10390 (+) | 1.000 | 0.930 | cataTAAAAa V$TATA_C | 10418 (-) | 1.000 | 0.826 | tTTTTAacac V$TATA_C | 10451 (-) | 1.000 | 0.933 | cTTTTAtatc V$TATA_C | 10732 (-) | 1.000 | 0.939 | cTTTTAttgg V$TATA_C | 11470 (-) | 1.000 | 0.869 | tTTTTAtttg V$USF_C | 19 (-) | 0.929 | 0.940 | acATGTGa V$USF_C | 736 (+) | 1.000 | 0.996 | cCACGTgg V$USF_C | 736 (-) | 1.000 | 0.996 | ccACGTGg V$USF_C | 999 (-) | 0.929 | 0.943 | tcATGTGg V$USF_C | 1127 (+) | 1.000 | 0.992 | gCACGTgt V$USF_C | 1127 (-) | 1.000 | 0.995 | gcACGTGt V$USF_C | 1393 (-) | 0.918 | 0.934 | gcAGGTGa V$USF_C | 2615 (-) | 0.929 | 0.937 | ccATGTGt V$USF_C | 2774 (-) | 0.929 | 0.942 | gcATGTGa V$USF_C | 3642 (+) | 0.929 | 0.941 | cCACATgg V$USF_C | 4340 (-) | 0.918 | 0.933 | ccAGGTGg V$USF_C | 5251 (+) | 0.929 | 0.940 | tCACATgt V$USF_C | 5253 (-) | 0.929 | 0.943 | acATGTGg V$USF_C | 6065 (+) | 1.000 | 0.994 | gCACGTgc V$USF_C | 6065 (-) | 1.000 | 0.994 | gcACGTGc V$USF_C | 7113 (+) | 0.929 | 0.943 | cCACATgt V$USF_C | 7115 (-) | 0.929 | 0.939 | acATGTGt V$USF_C | 7392 (-) | 0.918 | 0.933 | ccAGGTGg V$USF_C | 9001 (+) | 0.929 | 0.935 | gCACATgg V$USF_C | 10890 (+) | 0.929 | 0.939 | gCACATgc P$MYBPH3_01 | 1139 (+) | 0.984 | 0.972 | ctaaaaCCGTTac P$MYBPH3_01 | 10309 (-) | 0.967 | 0.963 | ttAACTGttttta P$MYBPH3_02 | 7036 (+) | 1.000 | 0.933 | aagagtTAGTTga V$SREBP1_02 | 195 (+) | 0.600 | 0.747 | tcCCACAccac V$SREBP1_02 | 532 (-) | 1.000 | 0.782 | gatgGGTGAtg V$SREBP1_02 | 780 (-) | 0.800 | 0.790 | gtcgGGTGCtc V$SREBP1_02 | 1278 (-) | 0.800 | 0.764 | ctggGATGAca V$SREBP1_02 | 1327 (+) | 0.800 | 0.747 | tcTCACTcccc V$SREBP1_02 | 2336 (+) | 0.800 | 0.764 | tgTGACCctac V$SREBP1_02 | 3181 (+) | 0.800 | 0.790 | taTCCCCcctc V$SREBP1_02 | 3232 (+) | 0.800 | 0.752 | agTCAGCcaac V$SREBP1_02 | 3245 (+) | 0.600 | 0.747 | tcCCAGCccac V$SREBP1_02 | 3290 (-) | 1.000 | 0.747 | atgaGGTGAac V$SREBP1_02 | 4144 (-) | 0.800 | 0.752 | gtgtGGTGCct V$SREBP1_02 | 4328 (-) | 1.000 | 0.764 | gtctGGTGAca V$SREBP1_02 | 5135 (-) | 0.800 | 0.747 | gtgcGGTCAac V$SREBP1_02 | 5424 (+) | 0.600 | 0.747 | ttTCCCTccac V$SREBP1_02 | 5763 (-) | 0.800 | 0.747 | gtagGGTAAaa V$SREBP1_02 | 6437 (+) | 0.800 | 0.764 | ggTCAACccag V$SREBP1_02 | 6992 (-) | 0.800 | 0.782 | gtgaGGAGAtg V$SREBP1_02 | 7831 (+) | 0.600 | 0.778 | aaTTAACccac V$SREBP1_02 | 8542 (-) | 0.600 | 0.747 | gtggGCCGAga V$SREBP1_02 | 8664 (+) | 0.600 | 0.747 | ttTCAAAccac V$SREBP1_02 | 8741 (-) | 0.800 | 0.764 | gtgtGGTGGcc V$SREBP1_02 | 8960 (-) | 0.800 | 0.747 | gtagGGAGAac V$SREBP1_02 | 8974 (-) | 0.800 | 0.790 | gtgtGGTGGta V$SREBP1_02 | 9320 (-) | 0.800 | 0.747 | gttgGGTCAga V$SREBP1_02 | 9574 (-) | 0.800 | 0.790 | gtgtGGTGGta V$SREBP1_02 | 9618 (-) | 0.600 | 0.790 | gtggGAGGAtc V$SREBP1_02 | 9686 (-) | 0.800 | 0.782 | ggggGGTGGtg V$SREBP1_02 | 9709 (-) | 0.600 | 0.747 | gtggTGTAAaa V$SREBP1_02 | 10876 (-) | 0.600 | 0.747 | gtggTGGGAgc V$SREBP1_02 | 11257 (-) | 0.600 | 0.764 | gtggTGTGCca V$SREBP1_02 | 11704 (+) | 1.000 | 0.869 | tgTCACCccaa V$SREBP1_02 | 11823 (-) | 1.000 | 0.852 | ttggGGTGAga V$HAND1E47_01 | 39 (-) | 0.853 | 0.812 | aactTCAGAtgtcatt V$HAND1E47_01 | 114 (+) | 0.856 | 0.841 | cagtgagGCTGGctaa V$HAND1E47_01 | 139 (-) | 1.000 | 0.939 | caatCCAGAcatttgc V$HAND1E47_01 | 203 (+) | 0.868 | 0.837 | caccatgCCTGGtatt V$HAND1E47_01 | 240 (+) | 1.000 | 0.904 | ttttaaaTCTGGgctg V$HAND1E47_01 | 295 (-) | 0.859 | 0.816 | ttttACAGActgagct V$HAND1E47_01 | 352 (-) | 0.853 | 0.814 | tacaTCAGAttcaaat V$HAND1E47_01 | 446 (+) | 0.859 | 0.820 | caggttgTCTGTtttg V$HAND1E47_01 | 589 (-) | 0.859 | 0.831 | attaACAGActtactt V$HAND1E47_01 | 680 (-) | 0.853 | 0.820 | ctctTCAGAcacacca V$HAND1E47_01 | 689 (-) | 1.000 | 0.882 | cacaCCAGAagagggc V$HAND1E47_01 | 739 (+) | 0.856 | 0.814 | cgtggttGCTGGgaat V$HAND1E47_01 | 761 (+) | 1.000 | 0.906 | caggaccTCTGGaaga V$HAND1E47_01 | 1270 (+) | 1.000 | 0.903 | tgtgtacTCTGGgatg V$HAND1E47_01 | 1286 (+) | 0.868 | 0.829 | acagggaCCTGGacct V$HAND1E47_01 | 1474 (+) | 1.000 | 0.908 | tatttgcTCTGGcaga V$HAND1E47_01 | 1553 (+) | 1.000 | 0.921 | aaactctTCTGGaaat V$HAND1E47_01 | 1873 (+) | 0.871 | 0.832 | gggaggaTCTGCctcc V$HAND1E47_01 | 1943 (+) | 0.856 | 0.825 | tggctagGCTGGaact V$HAND1E47_01 | 1993 (+) | 1.000 | 0.933 | aagattaTCTGGctat V$HAND1E47_01 | 2043 (+) | 0.868 | 0.841 | cactgtaCCTGGcttc V$HAND1E47_01 | 2076 (+) | 0.859 | 0.819 | gtctctgTCTGTctct V$HAND1E47_01 | 2301 (+) | 0.859 | 0.834 | gtgtgtgTCTGTgtgt V$HAND1E47_01 | 2381 (-) | 1.000 | 0.892 | tttcCCAGAgctgaga V$HAND1E47_01 | 2726 (+) | 0.871 | 0.816 | ataatctTCTGCcttt V$HAND1E47_01 | 2827 (+) | 0.871 | 0.844 | ccagttgTCTGCaaat V$HAND1E47_01 | 2939 (+) | 0.871 | 0.837 | tgcgaagTCTGCctat V$HAND1E47_01 | 2967 (-) | 0.871 | 0.833 | gacgGCAGAtgcgcag V$HAND1E47_01 | 3226 (-) | 0.868 | 0.826 | tttaCCAGTcagccaa V$HAND1E47_01 | 3243 (-) | 0.856 | 0.824 | attcCCAGCccactta V$HAND1E47_01 | 3354 (-) | 0.859 | 0.819 | tcagACAGAccactga V$HAND1E47_01 | 3397 (-) | 0.868 | 0.809 | atctCCAGTtataaaa V$HAND1E47_01 | 3415 (+) | 1.000 | 0.918 | taatgatTCTGGttgc V$HAND1E47_01 | 3600 (+) | 0.871 | 0.828 | ttgggttTCTGCagtt V$HAND1E47_01 | 3664 (-) | 0.859 | 0.810 | taacACAGActtagaa V$HAND1E47_01 | 3788 (+) | 1.000 | 0.889 | tttttctTCTGGacag V$HAND1E47_01 | 3854 (-) | 0.868 | 0.827 | atatCCAGTaccattt V$HAND1E47_01 | 4115 (-) | 1.000 | 0.920 | tttcCCAGAtgttaaa V$HAND1E47_01 | 4129 (-) | 1.000 | 0.929 | aaacCCAGAatctagg V$HAND1E47_01 | 4322 (+) | 1.000 | 0.940 | aactgggTCTGGtgac V$HAND1E47_01 | 4372 (+) | 1.000 | 0.930 | aaagttaTCTGGctac V$HAND1E47_01 | 4771 (+) | 1.000 | 0.956 | cagcgagTCTGGataa V$HAND1E47_01 | 4801 (+) | 1.000 | 0.926 | cataatcTCTGGcatt V$HAND1E47_01 | 4818 (-) | 0.868 | 0.816 | gttaCCAGGcggggta V$HAND1E47_01 | 4875 (+) | 1.000 | 0.899 | gcaaagcTCTGGaaca V$HAND1E47_01 | 4939 (+) | 0.868 | 0.811 | agactgaACTGGgact V$HAND1E47_01 | 4994 (-) | 0.871 | 0.852 | aatgGCAGAccttgta V$HAND1E47_01 | 5022 (+) | 0.868 | 0.848 | ccacgggACTGGgatg V$HAND1E47_01 | 5073 (+) | 0.856 | 0.849 | atgagagGCTGGctct V$HAND1E47_01 | 5083 (+) | 0.868 | 0.813 | ggctctgACTGGtttg V$HAND1E47_01 | 5223 (+) | 0.868 | 0.844 | agcggagACTGGgaac V$HAND1E47_01 | 5529 (-) | 1.000 | 0.942 | ctatCCAGAttcactc V$HAND1E47_01 | 5567 (-) | 0.868 | 0.827 | acaaCCAGGcttctaa V$HAND1E47_01 | 5673 (-) | 0.859 | 0.848 | agagACAGAcacagag V$HAND1E47_01 | 5724 (+) | 0.868 | 0.811 | acactaaACTGGaagc V$HAND1E47_01 | 5980 (+) | 1.000 | 0.912 | gataactTCTGGatta V$HAND1E47_01 | 6033 (+) | 1.000 | 0.890 | ggacccaTCTGGtgca V$HAND1E47_01 | 6112 (-) | 1.000 | 0.872 | ttgaCCAGAgggcttg V$HAND1E47_01 | 6143 (+) | 1.000 | 0.930 | ccatgccTCTGGctct V$HAND1E47_01 | 6179 (-) | 1.000 | 0.913 | gttcCCAGAgacaccc V$HAND1E47_01 | 6232 (+) | 1.000 | 0.940 | catgctgTCTGGctac V$HAND1E47_01 | 6417 (+) | 1.000 | 0.913 | tgactccTCTGGcttt V$HAND1E47_01 | 6440 (-) | 0.868 | 0.812 | caacCCAGTcatggca V$HAND1E47_01 | 6482 (+) | 0.868 | 0.837 | gatgaggACTGGattg V$HAND1E47_01 | 6493 (-) | 0.871 | 0.844 | gattGCAGAtgcggcc V$HAND1E47_01 | 6632 (-) | 1.000 | 0.886 | gcacCCAGAaggcacc V$HAND1E47_01 | 6645 (+) | 0.871 | 0.826 | acctcagTCTGCaaac V$HAND1E47_01 | 6778 (-) | 0.871 | 0.813 | aacaGCAGAtctaatg V$HAND1E47_01 | 6859 (-) | 1.000 | 0.914 | acgtCCAGAccgaggc V$HAND1E47_01 | 6958 (+) | 1.000 | 0.895 | ccattatTCTGGatgg V$HAND1E47_01 | 7135 (-) | 1.000 | 0.890 | gaggCCAGAagagggc V$HAND1E47_01 | 7159 (+) | 0.856 | 0.827 | tcctggaGCTGGcgtt V$HAND1E47_01 | 7171 (-) | 0.871 | 0.808 | cgttGCAGAtggtccc V$HAND1E47_01 | 7321 (+) | 1.000 | 0.948 | atgtacgTCTGGaaca V$HAND1E47_01 | 7556 (+) | 0.856 | 0.811 | tgtagatGCTGGaaac V$HAND1E47_01 | 7617 (-) | 0.856 | 0.826 | ctctCCAGCcgcctgg V$HAND1E47_01 | 7621 (+) | 0.868 | 0.818 | ccagccgCCTGGcggt V$HAND1E47_01 | 7661 (-) | 0.868 | 0.839 | tatgCCAGGcatggtg V$HAND1E47_01 | 7689 (-) | 1.000 | 0.917 | aaagCCAGAacttggg V$HAND1E47_01 | 7737 (+) | 0.856 | 0.808 | atcacagGCTGGctac V$HAND1E47_01 | 7836 (+) | 1.000 | 0.912 | acccactTCTGGcaaa V$HAND1E47_01 | 7956 (-) | 1.000 | 0.918 | taccCCAGActatagt V$HAND1E47_01 | 8060 (+) | 1.000 | 0.941 | tagtttaTCTGGtatt V$HAND1E47_01 | 8072 (-) | 0.871 | 0.830 | tattGCAGAaacttcc V$HAND1E47_01 | 8089 (+) | 1.000 | 0.884 | atacagcTCTGGgtcc V$HAND1E47_01 | 8125 (+) | 0.871 | 0.819 | tgcctggTCTGCtttc V$HAND1E47_01 | 8159 (-) | 1.000 | 0.923 | gcttCCAGAggcaggt V$HAND1E47_01 | 8175 (-) | 1.000 | 0.889 | agcaCCAGAacttgtt V$HAND1E47_01 | 8237 (-) | 1.000 | 0.911 | acacCCAGAtgaaggt V$HAND1E47_01 | 8402 (-) | 0.856 | 0.819 | caagCCAGCccggcct V$HAND1E47_01 | 8518 (-) | 0.859 | 0.825 | actcACAGAtgcttca V$HAND1E47_01 | 8712 (+) | 0.853 | 0.825 | caaggagTCTGAagaa V$HAND1E47_01 | 8765 (+) | 0.868 | 0.817 | cccagcaCCTGGgagg V$HAND1E47_01 | 8809 (+) | 1.000 | 0.929 | gccagctTCTGGtact V$HAND1E47_01 | 8861 (+) | 1.000 | 0.939 | ttaagacTCTGGcata V$HAND1E47_01 | 8901 (-) | 0.859 | 0.819 | acaaACAGAttcagga V$HAND1E47_01 | 9017 (-) | 0.871 | 0.811 | gcagGCAGAtctttgt V$HAND1E47_01 | 9040 (+) | 0.868 | 0.808 | agggcagCCTGGtcta V$HAND1E47_01 | 9063 (-) | 0.868 | 0.834 | agttCCAGGctagcca V$HAND1E47_01 | 9232 (-) | 1.000 | 0.919 | agctCCAGAgccctag V$HAND1E47_01 | 9327 (+) | 1.000 | 0.902 | cagagccTCTGGagga V$HAND1E47_01 | 9386 (+) | 0.868 | 0.818 | gtcagccCCTGGaatg V$HAND1E47_01 | 9497 (+) | 0.853 | 0.810 | tagaaaaTCTGAaagt V$HAND1E47_01 | 9562 (+) | 1.000 | 0.920 | gtcggttTCTGGgtgt V$HAND1E47_01 | 9961 (+) | 1.000 | 0.919 | gtatgttTCTGGttcc V$HAND1E47_01 | 9999 (+) | 1.000 | 0.905 | taggtccTCTGGgact V$HAND1E47_01 | 10005 (+) | 0.868 | 0.837 | ctctgggACTGGaggt V$HAND1E47_01 | 10067 (+) | 1.000 | 0.908 | cgggtctTCTGGaagg V$HAND1E47_01 | 10478 (-) | 0.856 | 0.827 | tagaCCAGCcacatat V$HAND1E47_01 | 10682 (+) | 1.000 | 0.932 | gcaggacTCTGGcaac V$HAND1E47_01 | 10986 (+) | 0.868 | 0.825 | gtgtggcCCTGGaatc V$HAND1E47_01 | 11077 (+) | 1.000 | 0.901 | aagctgcTCTGGgaat V$HAND1E47_01 | 11112 (-) | 0.853 | 0.822 | gactTCAGAtccctta V$HAND1E47_01 | 11407 (+) | 0.871 | 0.811 | catttatTCTGCattt V$HAND1E47_01 | 11619 (-) | 1.000 | 0.897 | ggtaCCAGAaggaccc V$HAND1E47_01 | 11770 (-) | 1.000 | 0.959 | gattCCAGAtacttga V$HAND1E47_01 | 11796 (-) | 1.000 | 0.929 | aatcCCAGAttggaag V$STAT_01 | 927 (+) | 0.933 | 0.936 | TTCCAcgaa V$STAT_01 | 927 (-) | 0.780 | 0.866 | ttccACGAA V$STAT_01 | 1136 (+) | 0.956 | 0.906 | TTCCTaaaa V$STAT_01 | 1559 (-) | 0.824 | 0.878 | ttctGGAAA V$STAT_01 | 1821 (-) | 1.000 | 0.969 | ttcaGGGAA V$STAT_01 | 2403 (+) | 0.956 | 0.890 | TTCCTtaaa V$STAT_01 | 2566 (+) | 0.824 | 0.881 | TTTCCgtaa V$STAT_01 | 3138 (+) | 1.000 | 0.933 | TTCCCacaa V$STAT_01 | 3844 (+) | 0.854 | 0.883 | TTCTCataa V$STAT_01 | 4605 (+) | 0.780 | 0.866 | TTGCTggaa V$STAT_01 | 4605 (-) | 0.933 | 0.908 | ttgcTGGAA V$STAT_01 | 6875 (-) | 0.956 | 0.945 | ttatAGGAA V$STAT_01 | 7842 (-) | 0.824 | 0.878 | ttctGGCAA V$STAT_01 | 8612 (+) | 0.948 | 0.886 | TTCCGtcaa V$STAT_01 | 10052 (-) | 0.809 | 0.868 | ttctAAGAA V$STAT3_01 | 921 (-) | 0.775 | 0.829 | tagcacttccaCGAAAccaca P$P_01 | 3637 (+) | 1.000 | 0.997 | acCTACCac P$P_01 | 10236 (-) | 0.992 | 0.988 | taGGTTGgt P$P_01 | 11860 (-) | 1.000 | 0.998 | ggGGTAGgt V$VMYB_02 | 1144 (-) | 1.000 | 0.990 | aCCGTTact V$VMYB_02 | 11518 (-) | 1.000 | 0.977 | aCCGTTgtc N$SKN1_02 | 8604 (+) | 1.000 | 0.987 | tattATCATtcc N$SKN1_02 | 9730 (-) | 1.000 | 0.983 | tgtATGATaatg V$MEF2_02 | 2526 (-) | 0.829 | 0.797 | acatatTACTTttagagaataa V$MEF2_02 | 3085 (-) | 1.000 | 0.898 | tgattcTATTTttcgaatctat V$MEF2_02 | 3734 (-) | 1.000 | 0.793 | aattatTATTTctacattttta V$MEF2_02 | 3997 (+) | 1.000 | 0.800 | gaaacaacgatAAATAaatatt V$MEF2_02 | 5801 (+) | 1.000 | 0.832 | acaaaaaaaaaAAATAgccctg V$MEF2_02 | 6249 (-) | 1.000 | 0.795 | gctttcTATTTgtcccatctgc V$MEF2_02 | 6831 (-) | 1.000 | 0.815 | atgtgcTATTTactgccaactg V$MEF2_02 | 7044 (-) | 1.000 | 0.822 | gttgacTATTTtttttcttaat V$MEF2_02 | 8692 (+) | 1.000 | 0.814 | agcctttctgaAAATAccatca V$MEF2_02 | 8944 (+) | 0.838 | 0.823 | caactttttaaAATTAgtaggg V$MEF2_02 | 9402 (-) | 0.803 | 0.834 | tggttcTATGTctagaaaacta V$MEF2_02 | 9419 (+) | 1.000 | 0.806 | aactatcccatAAATAacagga V$MEF2_02 | 10118 (+) | 0.838 | 0.809 | atccccaataaAATTAgttaag V$MEF2_02 | 10316 (-) | 1.000 | 0.826 | tttttaTATTTttattttatag V$MEF2_02 | 10908 (+) | 0.853 | 0.816 | tactatgcaaaTAATAgtctag V$MEF2_02 | 11746 (+) | 0.766 | 0.798 | caaatgactagAAAGAgttaac V$MEF2_02 | 11937 (-) | 1.000 | 0.867 | tgaaacTATTTcaagggtcagg V$MEF2_03 | 956 (-) | 1.000 | 0.828 | atcaaaTATTTacaggacccct V$MEF2_03 | 3085 (-) | 1.000 | 0.869 | tgattcTATTTttcgaatctat V$MEF2_03 | 6831 (-) | 1.000 | 0.794 | atgtgcTATTTactgccaactg V$MEF2_03 | 8692 (+) | 1.000 | 0.844 | agcctttctgaAAATAccatca V$MEF2_03 | 9402 (-) | 0.843 | 0.877 | tggttcTATGTctagaaaacta V$MEF2_03 | 11937 (-) | 1.000 | 0.886 | tgaaacTATTTcaagggtcagg V$MEF2_04 | 513 (-) | 0.615 | 0.674 | gtcttcTGATTttaaaaatgat V$MEF2_04 | 956 (-) | 1.000 | 0.707 | atcaaaTATTTacaggacccct V$MEF2_04 | 1158 (-) | 0.815 | 0.675 | tagttcTTTTTcaaagaaagga V$MEF2_04 | 1890 (-) | 1.000 | 0.669 | attttaTATTTattttattatt V$MEF2_04 | 1957 (-) | 0.815 | 0.707 | ctttccTTTTTtgagcaggttg V$MEF2_04 | 2285 (-) | 0.796 | 0.662 | ctcttcTAGTTtttgtgtgtgt V$MEF2_04 | 2399 (+) | 0.815 | 0.683 | tgctttccttaAAAAAaacaaa V$MEF2_04 | 2526 (-) | 0.796 | 0.661 | acatatTACTTttagagaataa V$MEF2_04 | 2828 (+) | 0.796 | 0.655 | cagttgtctgcAAATGgcatca V$MEF2_04 | 3085 (-) | 1.000 | 0.839 | tgattcTATTTttcgaatctat V$MEF2_04 | 3118 (-) | 0.796 | 0.659 | tgcggcTATATtaatttacatt V$MEF2_04 | 3723 (+) | 0.796 | 0.715 | cgggtgactaaAATTAttattt V$MEF2_04 | 3742 (-) | 0.796 | 0.683 | tttctaCATTTttagtcattct V$MEF2_04 | 4417 (+) | 0.815 | 0.711 | accttgtctaaAAAAAaaaaaa V$MEF2_04 | 4479 (-) | 0.815 | 0.656 | cattttTCTTTtgaacaaaaaa V$MEF2_04 | 5431 (-) | 1.000 | 0.676 | ccacttTATTTgtattgccata V$MEF2_04 | 5795 (+) | 0.815 | 0.659 | ttaaatacaaaAAAAAaaaata V$MEF2_04 | 6249 (-) | 1.000 | 0.734 | gctttcTATTTgtcccatctgc V$MEF2_04 | 6500 (+) | 1.000 | 0.692 | gatgcggccacAAATAaaagaa V$MEF2_04 | 6646 (+) | 0.818 | 0.654 | cctcagtctgcAAACAgatgaa V$MEF2_04 | 6831 (-) | 1.000 | 0.680 | atgtgcTATTTactgccaactg V$MEF2_04 | 6926 (-) | 0.796 | 0.675 | tgattgTATGTgcagtacccaa V$MEF2_04 | 7266 (+) | 0.611 | 0.689 | aaatatcctaaAATAAgaccca V$MEF2_04 | 7765 (+) | 0.818 | 0.669 | agggtctcaaaAAACAaaacaa V$MEF2_04 | 8692 (+) | 1.000 | 0.750 | agcctttctgaAAATAccatca V$MEF2_04 | 8892 (+) | 0.818 | 0.682 | aaacgtaccacAAACAgattca V$MEF2_04 | 8944 (+) | 0.796 | 0.712 | caactttttaaAATTAgtaggg V$MEF2_04 | 9402 (-) | 0.796 | 0.802 | tggttcTATGTctagaaaacta V$MEF2_04 | 9419 (+) | 1.000 | 0.671 | aactatcccatAAATAacagga V$MEF2_04 | 9483 (+) | 0.796 | 0.684 | tactttatttgCAATAgaaaat V$MEF2_04 | 9720 (-) | 0.815 | 0.694 | ttgatcTCTTTgtatgataatg V$MEF2_04 | 9890 (-) | 0.815 | 0.660 | aaattcTTTTTtaaagacttac V$MEF2_04 | 10316 (-) | 1.000 | 0.677 | tttttaTATTTttattttatag V$MEF2_04 | 10908 (+) | 0.796 | 0.657 | tactatgcaaaTAATAgtctag V$MEF2_04 | 11094 (-) | 0.818 | 0.690 | ttccacTGTTTaaagaacgact V$MEF2_04 | 11442 (-) | 1.000 | 0.683 | cttatcTATTTtggtgaatcag V$MEF2_04 | 11660 (-) | 0.796 | 0.657 | taacacAATTTtcataaccaaa V$MEF2_04 | 11746 (+) | 0.815 | 0.712 | caaatgactagAAAGAgttaac V$MEF2_04 | 11937 (-) | 1.000 | 0.745 | tgaaacTATTTcaagggtcagg V$AHRARNT_01 | 1423 (-) | 1.000 | 0.927 | agaCACGCaatcagtg V$AHRARNT_01 | 5815 (+) | 1.000 | 0.929 | tagccctgGCGTGgca V$AHRARNT_01 | 10713 (+) | 1.000 | 0.926 | acatctctGCGTGctg V$ARNT_01 | 431 (-) | 1.000 | 0.916 | aatacCACGTttaatc V$ARNT_01 | 732 (+) | 1.000 | 0.961 | gccaccACGTGgttgc V$ARNT_01 | 732 (-) | 1.000 | 0.975 | gccacCACGTggttgc V$ARNT_01 | 1123 (+) | 1.000 | 0.983 | gtatgcACGTGtgttc V$ARNT_01 | 1123 (-) | 1.000 | 0.974 | gtatgCACGTgtgttc V$ARNT_01 | 1613 (+) | 0.831 | 0.838 | ggggtcTCGTGgacag V$ARNT_01 | 5364 (-) | 0.831 | 0.839 | aaacaCACGGgatttc V$ARNT_01 | 6000 (+) | 1.000 | 0.883 | tggggtACGTGtccaa V$ARNT_01 | 6061 (+) | 1.000 | 0.950 | ccctgcACGTGctgcc V$ARNT_01 | 6061 (-) | 1.000 | 0.971 | ccctgCACGTgctgcc V$ARNT_01 | 6853 (-) | 1.000 | 0.883 | gtgagCACGTccagac V$ARNT_01 | 6883 (-) | 1.000 | 0.886 | agcacCACGTaagaca V$ARNT_01 | 7189 (-) | 0.831 | 0.827 | acggcCACGGgggtgc V$ARNT_01 | 7483 (-) | 1.000 | 0.890 | acgcgCACGTctgttc V$AHRARNT_02 | 1419 (-) | 1.000 | 0.804 | cgtgagaCACGCaatcagt V$AHRARNT_02 | 1573 (+) | 0.800 | 0.770 | ggggaatGCATGgagagcc V$AHRARNT_02 | 6216 (-) | 1.000 | 0.855 | ggtttctCACGCtctgcat V$AHRARNT_02 | 8101 (-) | 0.800 | 0.778 | gtcctgtCACACactctcc V$BARBIE_01 | 1170 (+) | 0.979 | 0.940 | aaagAAAGGagatgg V$BARBIE_01 | 5427 (-) | 0.966 | 0.929 | ccctccACTTTattt V$BARBIE_01 | 6243 (-) | 1.000 | 0.923 | gctacgGCTTTctat V$BARBIE_01 | 6716 (-) | 0.979 | 0.919 | ccacaaCCTTTaaag V$NKX25_01 | 288 (+) | 1.000 | 1.000 | tcAAGTG V$NKX25_01 | 346 (-) | 1.000 | 0.897 | CACTTct V$NKX25_01 | 476 (-) | 1.000 | 0.939 | CACTTac V$NKX25_01 | 628 (-) | 1.000 | 0.897 | CACTTtt V$NKX25_01 | 820 (-) | 1.000 | 0.897 | CACTTtt V$NKX25_01 | 905 (-) | 1.000 | 0.897 | CACTTtc V$NKX25_01 | 924 (-) | 1.000 | 0.897 | CACTTcc V$NKX25_01 | 2660 (-) | 1.000 | 0.886 | CACTTtg V$NKX25_01 | 3204 (+) | 0.891 | 0.903 | tcAAGTA V$NKX25_01 | 3222 (-) | 1.000 | 0.897 | CACTTtt V$NKX25_01 | 3253 (-) | 1.000 | 0.939 | CACTTat V$NKX25_01 | 3683 (+) | 1.000 | 0.942 | ccAAGTG V$NKX25_01 | 3940 (-) | 1.000 | 0.953 | CACTTgt V$NKX25_01 | 4694 (-) | 0.869 | 0.884 | AACTTga V$NKX25_01 | 4906 (+) | 1.000 | 0.942 | ccAAGTG V$NKX25_01 | 5172 (-) | 1.000 | 0.953 | CACTTgt V$NKX25_01 | 5432 (-) | 1.000 | 0.944 | CACTTta V$NKX25_01 | 5704 (-) | 1.000 | 0.928 | CACTTag V$NKX25_01 | 5971 (+) | 1.000 | 0.986 | ttAAGTG V$NKX25_01 | 6906 (+) | 1.000 | 0.939 | atAAGTG V$NKX25_01 | 6944 (+) | 1.000 | 0.942 | ccAAGTG V$NKX25_01 | 7839 (-) | 1.000 | 0.897 | CACTTct V$NKX25_01 | 7864 (-) | 1.000 | 0.928 | CACTTag V$NKX25_01 | 8350 (-) | 1.000 | 0.953 | CACTTgc V$NKX25_01 | 8622 (-) | 1.000 | 0.886 | CACTTtg V$NKX25_01 | 9676 (+) | 1.000 | 0.897 | agAAGTG V$NKX25_01 | 10260 (+) | 1.000 | 0.897 | ggAAGTG V$NKX25_01 | 10355 (+) | 1.000 | 0.886 | caAAGTG V$NKX25_01 | 10473 (-) | 1.000 | 0.944 | CACTTta V$NKX25_01 | 10629 (+) | 1.000 | 0.897 | gaAAGTG V$NKX25_01 | 11779 (-) | 0.891 | 0.903 | TACTTga V$PPARA_01 | 370 (+) | 0.748 | 0.685 | tttctctaggtGAGAGatgc V$PPARA_01 | 2454 (-) | 0.939 | 0.709 | agcgCTATGgcctagcaagc V$PPARA_01 | 2621 (+) | 0.625 | 0.715 | gtatgctaggcAAATGctct V$PPARA_01 | 3067 (+) | 0.899 | 0.702 | tatagctaggtCACAGggtg V$PPARA_01 | 3435 (+) | 0.773 | 0.759 | ttgacttaggtAACAGttaa V$PPARA_01 | 3525 (-) | 0.751 | 0.740 | ccacCTTGGcccgaggtttg V$PPARA_01 | 3567 (-) | 1.000 | 0.675 | tggcCTTTGgcaagttattc V$PPARA_01 | 4049 (+) | 0.564 | 0.702 | gggaaataggcAATAAttaa V$PPARA_01 | 4555 (+) | 0.899 | 0.695 | ttgatgctggaCACAGttgc V$PPARA_01 | 4803 (-) | 0.899 | 0.680 | taatCTCTGgcatttgttac V$PPARA_01 | 4896 (-) | 0.751 | 0.673 | tgttCCTTGgccaagtgcat V$PPARA_01 | 5453 (-) | 0.875 | 0.761 | taacCTTTTccccacatact V$PPARA_01 | 6813 (-) | 0.625 | 0.723 | ttacCTTCTcccttttttat V$PPARA_01 | 7785 (+) | 1.000 | 0.682 | aaaaattatccCAAAGataa V$PPARA_01 | 7909 (-) | 0.875 | 0.713 | ttagCTTTTacttaggtatc V$PPARA_01 | 8204 (-) | 0.899 | 0.713 | tcccCTCTGacctcagttct V$PPARA_01 | 8211 (-) | 0.649 | 0.713 | tgacCTCAGttctgatgttg V$PPARA_01 | 8303 (-) | 0.899 | 0.673 | gcccCTCTGccctgcagctt V$PPARA_01 | 8465 (+) | 0.751 | 0.688 | ctcatctaaggCAAAAataa V$PPARA_01 | 8556 (-) | 0.690 | 0.715 | tgaaCAATGtccgtctttga V$PPARA_01 | 8575 (+) | 1.000 | 0.733 | attaaatggggCAAAGgcaa V$PPARA_01 | 9986 (+) | 0.564 | 0.681 | cagaagagggtATTAGgtcc V$PPARA_01 | 10097 (-) | 0.899 | 0.709 | taacCTCTGagccatcttgc V$PPARA_01 | 10196 (-) | 0.649 | 0.680 | tcagTTCTGtccttccacag V$PPARA_01 | 11014 (+) | 0.748 | 0.728 | ttagggtaggaGAGAGatcc V$PPARA_01 | 11274 (+) | 0.939 | 0.733 | ggaaattaaggCATAGggca V$PPARA_01 | 11281 (+) | 1.000 | 0.687 | aaggcatagggCAAAGaaat V$PPARA_01 | 11937 (+) | 0.751 | 0.689 | tgaaactatttCAAGGgtca V$EGR1_01 | 2975 (+) | 0.809 | 0.778 | atgcGCAGGcat V$EGR1_01 | 3006 (+) | 0.809 | 0.894 | gtgcGCAGGcgt V$EGR1_01 | 7444 (+) | 1.000 | 0.844 | atgtGTGGGagt V$EGR1_01 | 9211 (-) | 0.945 | 0.894 | gctCCTACgcaa V$EGR1_01 | 10609 (+) | 1.000 | 0.823 | gtgtGTGGGagg V$EGR1_01 | 11867 (+) | 0.885 | 0.785 | gtccGGGGGagg V$EGR2_01 | 193 (-) | 1.000 | 0.812 | cctCCCACacca V$EGR2_01 | 7444 (+) | 1.000 | 0.862 | atgtGTGGGagt V$EGR2_01 | 8115 (-) | 1.000 | 0.784 | tctCCCACcctg V$EGR2_01 | 8231 (-) | 1.000 | 0.792 | gctCCCACaccc V$EGR2_01 | 9211 (-) | 0.970 | 0.903 | gctCCTACgcaa V$EGR2_01 | 10609 (+) | 1.000 | 0.859 | gtgtGTGGGagg V$EGR2_01 | 11015 (+) | 0.970 | 0.770 | taggGTAGGaga V$EGR2_01 | 11839 (+) | 1.000 | 0.805 | cagaGTGGGggg F$PACC_01 | 7621 (-) | 0.976 | 0.941 | ccagccgcCTGGCggta V$OCT1_07 | 588 (-) | 1.000 | 0.864 | AATTAacagact V$OCT1_07 | 994 (-) | 0.930 | 0.867 | AATTTtcatgtg V$OCT1_07 | 2521 (-) | 0.705 | 0.825 | AAATTacatatt V$OCT1_07 | 3102 (+) | 1.000 | 0.899 | tctatacTAATT V$OCT1_07 | 3110 (-) | 0.930 | 0.876 | AATTTccatgcg V$OCT1_07 | 3122 (+) | 1.000 | 0.884 | gctatatTAATT V$OCT1_07 | 3130 (-) | 0.930 | 0.876 | AATTTacattcc V$OCT1_07 | 3836 (-) | 0.930 | 0.880 | AATTTccattct V$OCT1_07 | 4063 (-) | 1.000 | 0.888 | AATTAaaatatg V$OCT1_07 | 4069 (+) | 0.705 | 0.825 | aatatgtAAAAT V$OCT1_07 | 4448 (-) | 1.000 | 0.827 | AATTAatattca V$OCT1_07 | 9730 (+) | 0.776 | 0.866 | tgtatgaTAATG V$OCT1_07 | 9738 (-) | 0.705 | 0.835 | AATGTccataca V$OCT1_07 | 9935 (-) | 0.930 | 0.891 | AATTTgcatgta V$OCT1_07 | 10909 (+) | 0.705 | 0.828 | actatgcAAATA V$OCT1_07 | 11666 (-) | 0.930 | 0.938 | AATTTtcataac V$GFI1_01 | 237 (+) | 0.960 | 0.938 | tctttttaAATCTgggctgtaagg V$GFI1_01 | 1104 (-) | 1.000 | 0.946 | tgagaaatcgcTGATTtgtgtatg V$GFI1_01 | 1423 (+) | 1.000 | 0.923 | agacacgcAATCAgtggcaatagt V$GFI1_01 | 2414 (+) | 1.000 | 0.971 | aaacaaaaAATCAcaggggttggg V$GFI1_01 | 4634 (+) | 0.940 | 0.931 | ggtagcaaAATCCcggcaccttct V$GFI1_01 | 4715 (-) | 1.000 | 0.982 | gtaattcactgTGATTtggccctc V$GFI1_01 | 8562 (-) | 1.000 | 0.918 | atgtccgtcttTGATTaaatgggg V$GFI1_01 | 9448 (+) | 0.960 | 0.918 | ggtttaccAATCTctgctgtacag V$GFI1_01 | 9872 (+) | 1.000 | 0.917 | aatgaggaAATCAatttaaaattc V$GFI1_01 | 10254 (-) | 1.000 | 0.941 | tgtgcaggaagTGATTtttgccgg V$GFI1_01 | 10808 (+) | 0.939 | 0.932 | tgcatctaAATCGcagtccgcctt V$GFI1_01 | 11450 (+) | 1.000 | 0.923 | ttttggtgAATCAgtgaaattttt V$GFI1_01 | 11690 (+) | 1.000 | 0.942 | ataaagttAATCActgtcacccca V$GFI1_01 | 11788 (+) | 0.940 | 0.922 | taaggacaAATCCcagattggaag V$TATA_01 | 496 (-) | 0.936 | 0.916 | aacattatTATATat V$TATA_01 | 503 (+) | 0.936 | 0.896 | ttATATAtttgtctt V$TATA_01 | 636 (-) | 1.000 | 0.870 | tttaaacaTTTATtc V$TATA_01 | 644 (-) | 1.000 | 0.864 | tttattcaTTTATta V$TATA_01 | 649 (-) | 0.936 | 0.917 | tcatttatTATATat V$TATA_01 | 651 (-) | 0.936 | 0.902 | atttattaTATATaa V$TATA_01 | 656 (+) | 0.936 | 0.907 | ttATATAtaagtaca V$TATA_01 | 658 (+) | 0.936 | 0.929 | atATATAagtacact V$TATA_01 | 825 (-) | 0.936 | 0.874 | tttaatacTATATaa V$TATA_01 | 832 (+) | 0.936 | 0.938 | ctATATAaaacaagt V$TATA_01 | 834 (+) | 1.000 | 0.971 | atATAAAacaagtta V$TATA_01 | 848 (+) | 1.000 | 0.875 | aaATAAAtacgatcg V$TATA_01 | 1744 (+) | 0.884 | 0.884 | ctATAGAacactgat V$TATA_01 | 1884 (-) | 1.000 | 0.980 | cctcccatTTTATat V$TATA_01 | 1890 (-) | 1.000 | 0.866 | attttataTTTATtt V$TATA_01 | 1895 (-) | 1.000 | 0.873 | atatttatTTTATta V$TATA_01 | 1903 (-) | 1.000 | 0.875 | tttattatTTTATtt V$TATA_01 | 2005 (+) | 0.884 | 0.865 | ctATACAtctcctga V$TATA_01 | 2357 (-) | 1.000 | 0.872 | aacttgtaTTTATtt V$TATA_01 | 2731 (-) | 1.000 | 0.875 | cttctgccTTTATgc V$TATA_01 | 2895 (-) | 1.000 | 0.899 | gcatctttTTTATcc V$TATA_01 | 3094 (-) | 0.884 | 0.866 | ttttcgaaTCTATac V$TATA_01 | 3405 (+) | 1.000 | 0.976 | ttATAAAagttaatg V$TATA_01 | 3452 (+) | 1.000 | 0.882 | taATAAAacacaggg V$TATA_01 | 3618 (-) | 1.000 | 0.938 | caaatgcgTTTATaa V$TATA_01 | 3831 (+) | 1.000 | 0.858 | taATAAAtttccatt V$TATA_01 | 3847 (+) | 1.000 | 0.883 | tcATAAAatatccag V$TATA_01 | 4004 (+) | 1.000 | 0.873 | cgATAAAtaaatatt V$TATA_01 | 4008 (+) | 1.000 | 0.866 | aaATAAAtattctgc V$TATA_01 | 4225 (+) | 1.000 | 0.861 | aaATAAAttctcagg V$TATA_01 | 4316 (+) | 1.000 | 0.898 | gcATAAAactgggtc V$TATA_01 | 4781 (+) | 1.000 | 0.859 | ggATAAAgattgctg V$TATA_01 | 5379 (-) | 1.000 | 0.878 | cttttagtTTTATtt V$TATA_01 | 5390 (-) | 1.000 | 0.944 | attttaccTTTATat V$TATA_01 | 5590 (+) | 1.000 | 0.871 | taATAAAatataata V$TATA_01 | 5635 (+) | 1.000 | 0.863 | ggATAAAgacaaaat V$TATA_01 | 5646 (+) | 1.000 | 0.893 | aaATAAAaagaagaa V$TATA_01 | 5946 (-) | 1.000 | 0.872 | agtaaaatTTTATtt V$TATA_01 | 6467 (+) | 1.000 | 0.884 | tgATAAAatgctcag V$TATA_01 | 6511 (+) | 1.000 | 0.890 | aaATAAAagaacacg V$TATA_01 | 6663 (-) | 1.000 | 0.877 | atgaagatTTTATtt V$TATA_01 | 6754 (-) | 1.000 | 0.957 | actatcccTTTATac V$TATA_01 | 6820 (-) | 1.000 | 0.902 | ctccctttTTTATgt V$TATA_01 | 7314 (+) | 0.884 | 0.877 | atATAGAatgtacgt V$TATA_01 | 7428 (-) | 1.000 | 0.892 | tatttgttTTTATtt V$TATA_01 | 7433 (-) | 1.000 | 0.886 | gtttttatTTTATgt V$TATA_01 | 7889 (-) | 1.000 | 0.895 | atttggttTTTATtt V$TATA_01 | 8380 (-) | 0.884 | 0.859 | ctttgatcTGTATac V$TATA_01 | 8931 (+) | 1.000 | 0.933 | gtATAAActaaagca V$TATA_01 | 9426 (+) | 1.000 | 0.889 | ccATAAAtaacagga V$TATA_01 | 10123 (+) | 1.000 | 0.883 | caATAAAattagtta V$TATA_01 | 10310 (-) | 1.000 | 0.985 | taactgttTTTATat V$TATA_01 | 10318 (-) | 1.000 | 0.886 | tttatattTTTATtt V$TATA_01 | 10323 (-) | 1.000 | 0.975 | atttttatTTTATag V$TATA_01 | 10332 (-) | 1.000 | 0.894 | ttatagttTTTATtg V$TATA_01 | 10391 (+) | 1.000 | 0.984 | atATAAAaatgaacg V$TATA_01 | 10445 (-) | 1.000 | 0.985 | acgtctctTTTATat V$TATA_01 | 10697 (-) | 1.000 | 0.860 | cacccgtgTTTATga V$TATA_01 | 10726 (-) | 1.000 | 0.897 | ctgtagctTTTATtg V$TATA_01 | 11401 (-) | 1.000 | 0.881 | tgtacgcaTTTATtc V$TATA_01 | 11464 (-) | 1.000 | 0.884 | tgaaatttTTTATtt V$CAAT_01 | 108 (+) | 0.950 | 0.927 | ggtggCCAGTga V$CAAT_01 | 1022 (-) | 1.000 | 0.938 | tgATTGGcagtt V$CAAT_01 | 5088 (-) | 0.950 | 0.938 | tgACTGGtttgg V$CAAT_01 | 6439 (+) | 0.950 | 0.923 | tcaacCCAGTca V$CAAT_01 | 11682 (+) | 1.000 | 0.938 | aaagaCCAATaa V$CAAT_01 | 11894 (+) | 1.000 | 0.923 | ctcaaCCAATag V$CAAT_01 | 11929 (-) | 1.000 | 0.950 | ccATTGGctgaa V$GC_01 | 1487 (+) | 0.956 | 0.907 | agagGGAGGggggg V$GC_01 | 1491 (+) | 0.935 | 0.884 | ggagGGGGGgggga V$GC_01 | 1492 (+) | 0.935 | 0.906 | gaggGGGGGgggag V$GC_01 | 1493 (+) | 0.935 | 0.903 | agggGGGGGggagg V$GC_01 | 1495 (+) | 0.935 | 0.876 | ggggGGGGGaggga V$GC_01 | 1498 (+) | 0.956 | 0.879 | ggggGGAGGgaggg V$GC_01 | 1870 (+) | 0.956 | 0.876 | gaagGGAGGatctg V$GC_01 | 3037 (-) | 0.954 | 0.896 | ccgctCCACCcggg V$GC_01 | 5893 (-) | 1.000 | 0.939 | ggggcCCGCCccta V$GC_01 | 9617 (+) | 0.956 | 0.874 | ggtgGGAGGatcgg V$GC_01 | 9683 (+) | 0.935 | 0.870 | gaggGGGGGtggtg V$GC_01 | 9686 (+) | 0.954 | 0.885 | ggggGGTGGtggtg V$GC_01 | 11845 (+) | 0.954 | 0.931 | ggggGGTGGggtat V$GC_01 | 11870 (+) | 0.956 | 0.919 | cgggGGAGGgactt V$HLF_01 | 3439 (-) | 1.000 | 0.896 | cttagGTAAC V$HLF_01 | 6575 (+) | 0.892 | 0.885 | GTTATgcaaa V$HLF_01 | 8008 (-) | 1.000 | 0.893 | ttggtGTAAC V$OLF1_01 | 1327 (+) | 1.000 | 0.882 | tctcacTCCCCagggcttaagg V$STAF_01 | 6631 (+) | 1.000 | 0.833 | ggcACCCAgaaggcacctcagt V$STAF_02 | 5502 (+) | 1.000 | 0.897 | tgcTCCCAggtttccctgcct I$CROC_01 | 846 (+) | 1.000 | 0.952 | ttaaaTAAATacgatc I$CROC_01 | 1891 (-) | 1.000 | 0.991 | ttttatATTTAtttta I$CROC_01 | 2358 (-) | 1.000 | 0.966 | acttgtATTTAttttg I$CROC_01 | 4002 (+) | 1.000 | 0.949 | aacgaTAAATaaatat I$CROC_01 | 4006 (+) | 1.000 | 0.983 | ataaaTAAATattctg I$CROC_01 | 9424 (+) | 1.000 | 0.928 | tcccaTAAATaacagg V$XFD1_01 | 709 (-) | 1.000 | 0.864 | gatctcTTTACaga V$XFD1_01 | 843 (+) | 0.917 | 0.874 | aagTTAAAtaaata V$XFD1_01 | 847 (+) | 0.959 | 0.920 | taaATAAAtacgat V$XFD1_01 | 958 (-) | 1.000 | 0.930 | caaataTTTACagg V$XFD1_01 | 1892 (-) | 0.959 | 0.914 | tttataTTTATttt V$XFD1_01 | 2359 (-) | 0.959 | 0.914 | cttgtaTTTATttt V$XFD1_01 | 2572 (-) | 1.000 | 0.906 | taagtaTTTACaac V$XFD1_01 | 3713 (-) | 1.000 | 0.923 | tcaataTTTACggg V$XFD1_01 | 4003 (+) | 0.959 | 0.864 | acgATAAAtaaata V$XFD1_01 | 4007 (+) | 0.959 | 0.927 | taaATAAAtattct V$XFD1_01 | 4780 (+) | 0.959 | 0.859 | tggATAAAgattgc V$XFD1_01 | 5792 (+) | 0.917 | 0.887 | taaTTAAAtacaaa V$XFD1_01 | 6669 (-) | 0.917 | 0.894 | attttaTTTAGctt V$XFD1_01 | 6822 (-) | 0.959 | 0.880 | ccctttTTTATgtg V$XFD1_01 | 6833 (-) | 1.000 | 0.890 | gtgctaTTTACtgc V$XFD1_01 | 8634 (-) | 0.917 | 0.859 | agcttaTTTAAgaa V$XFD1_01 | 9425 (+) | 0.959 | 0.883 | cccATAAAtaacag V$XFD1_01 | 10302 (-) | 0.917 | 0.867 | acatttTTTAActg V$XFD1_01 | 10312 (-) | 0.959 | 0.880 | actgttTTTATatt V$XFD1_01 | 10390 (+) | 0.959 | 0.874 | catATAAAaatgaa V$XFD1_01 | 10699 (-) | 0.959 | 0.859 | cccgtgTTTATgaa V$XFD1_01 | 11420 (-) | 0.917 | 0.867 | tttctaTTTAGcta V$XFD1_01 | 11466 (-) | 0.959 | 0.894 | aaatttTTTATttg V$XFD2_01 | 77 (-) | 0.781 | 0.839 | ttgatgTTGATagg V$XFD2_01 | 603 (+) | 0.877 | 0.898 | tttTTAAAcatcct V$XFD2_01 | 634 (+) | 0.877 | 0.911 | tttTTAAAcattta V$XFD2_01 | 847 (+) | 1.000 | 0.868 | taaATAAAtacgat V$XFD2_01 | 958 (-) | 0.901 | 0.848 | caaataTTTACagg V$XFD2_01 | 1892 (-) | 1.000 | 0.908 | tttataTTTATttt V$XFD2_01 | 2359 (-) | 1.000 | 0.882 | cttgtaTTTATttt V$XFD2_01 | 2897 (-) | 1.000 | 0.837 | atctttTTTATcca V$XFD2_01 | 4003 (+) | 1.000 | 0.874 | acgATAAAtaaata V$XFD2_01 | 4007 (+) | 1.000 | 0.889 | taaATAAAtattct V$XFD2_01 | 4780 (+) | 1.000 | 0.839 | tggATAAAgattgc V$XFD2_01 | 5645 (+) | 1.000 | 0.872 | aaaATAAAaagaag V$XFD2_01 | 6822 (-) | 1.000 | 0.839 | ccctttTTTATgtg V$XFD2_01 | 7428 (-) | 0.781 | 0.848 | tatttgTTTTTatt V$XFD2_01 | 7430 (-) | 1.000 | 0.872 | tttgttTTTATttt V$XFD2_01 | 7798 (+) | 1.000 | 0.858 | aagATAAAgaacaa V$XFD2_01 | 7891 (-) | 1.000 | 0.871 | ttggttTTTATttt V$XFD2_01 | 7937 (-) | 0.877 | 0.873 | tgtgtgTTTAGttt V$XFD2_01 | 8930 (+) | 1.000 | 0.840 | agtATAAActaaag V$XFD2_01 | 9425 (+) | 1.000 | 0.854 | cccATAAAtaacag V$XFD2_01 | 10312 (-) | 1.000 | 0.886 | actgttTTTATatt V$XFD2_01 | 10320 (-) | 1.000 | 0.892 | tatattTTTATttt V$XFD2_01 | 10334 (-) | 1.000 | 0.858 | atagttTTTATtgt V$XFD2_01 | 10390 (+) | 1.000 | 0.904 | catATAAAaatgaa V$XFD2_01 | 10699 (-) | 1.000 | 0.914 | cccgtgTTTATgaa V$XFD2_01 | 11096 (-) | 0.877 | 0.892 | ccactgTTTAAaga V$XFD2_01 | 11466 (-) | 1.000 | 0.859 | aaatttTTTATttg V$XFD3_01 | 30 (+) | 0.917 | 0.913 | tgtGTGAAcaactt V$XFD3_01 | 482 (+) | 0.959 | 0.852 | caaGTCAAcattgc V$XFD3_01 | 1843 (+) | 0.959 | 0.845 | gtcGTCAAcacaat V$XFD3_01 | 4215 (+) | 0.917 | 0.861 | tcaGTTAAaaaaat V$XFD3_01 | 4534 (+) | 0.959 | 0.881 | tgcGTCAAcagcct V$XFD3_01 | 10131 (+) | 0.917 | 0.849 | ttaGTTAAgaaatt V$XFD3_01 | 10660 (+) | 0.959 | 0.858 | aatGTCAAcaggag V$XFD3_01 | 11051 (-) | 0.917 | 0.870 | cctttgTTAACgtt V$XFD3_01 | 11653 (-) | 0.917 | 0.869 | gtactgTTAACaca V$XFD3_01 | 11655 (+) | 0.917 | 0.858 | actGTTAAcacaat V$XFD3_01 | 11759 (+) | 0.917 | 0.850 | agaGTTAAcaggat I$GCM_01 | 8442 (+) | 1.000 | 0.995 | caacCCGCAtcct V$AML1_01 | 936 (-) | 1.000 | 1.000 | ACCACa V$AML1_01 | 3641 (-) | 1.000 | 1.000 | ACCACa V$AML1_01 | 4145 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 4441 (-) | 1.000 | 1.000 | ACCACa V$AML1_01 | 4572 (+) | 0.949 | 0.956 | tGCGGT V$AML1_01 | 5049 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 5136 (+) | 0.949 | 0.956 | tGCGGT V$AML1_01 | 5256 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 7112 (-) | 1.000 | 1.000 | ACCACa V$AML1_01 | 8742 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 8898 (-) | 1.000 | 1.000 | ACCACa V$AML1_01 | 8975 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 9400 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 9575 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 10955 (+) | 0.949 | 0.956 | tGCGGT V$AML1_01 | 11256 (+) | 1.000 | 1.000 | tGTGGT V$AML1_01 | 11494 (+) | 0.949 | 0.956 | tGCGGT V$P53_02 | 1346 (+) | 0.982 | 0.969 | aggCTTGCct V$P53_02 | 4206 (+) | 0.982 | 0.969 | aggCTTGCct V$P53_02 | 4401 (-) | 0.965 | 0.972 | agACAGGtca F$MAT1MC_01 | 1034 (-) | 1.000 | 0.886 | gaatcaaCAAAGcagcaa F$MAT1MC_01 | 7802 (-) | 0.800 | 0.867 | taaagaaCAACGacaact F$MAT1MC_01 | 11046 (+) | 1.000 | 0.885 | tggttcCTTTGttaacgt F$MAT1MC_02 | 1850 (-) | 1.000 | 0.960 | acACAATaga F$MAT1MC_02 | 7332 (-) | 1.000 | 0.975 | gaACAATaga F$MAT1MC_02 | 10340 (+) | 1.000 | 0.957 | tttATTGTtt V$LMO2COM_01 | 1785 (-) | 1.000 | 0.981 | gagcACCTGcca V$LMO2COM_01 | 4338 (+) | 1.000 | 0.975 | atcCAGGTggtg V$LMO2COM_01 | 7390 (+) | 1.000 | 0.973 | gtcCAGGTggta V$LMO2COM_01 | 8503 (+) | 1.000 | 0.986 | ggaCAGGTgcgg V$LMO2COM_01 | 8767 (-) | 1.000 | 0.985 | cagcACCTGgga V$LMO2COM_01 | 8968 (+) | 1.000 | 0.973 | aacCAGGTgtgg V$LMO2COM_01 | 11960 (-) | 1.000 | 0.986 | ctgcACCTGagc V$LMO2COM_02 | 83 (+) | 0.985 | 0.980 | tTGATAggg V$LMO2COM_02 | 7152 (-) | 1.000 | 0.993 | ccgTATCTc V$RFX1_01 | 484 (+) | 1.000 | 0.876 | agtcaacattGCAACat V$RFX1_01 | 741 (-) | 1.000 | 0.879 | tgGTTGCtgggaattga V$RFX1_01 | 4816 (-) | 0.992 | 0.917 | ttGTTACcaggcggggt V$RFX1_01 | 4875 (+) | 0.982 | 0.881 | gcaaagctctGGAACat V$RFX1_01 | 4895 (-) | 0.982 | 0.911 | atGTTCCttggccaagt V$RFX1_01 | 5224 (+) | 0.982 | 0.899 | gcggagactgGGAACga V$RFX1_01 | 5509 (-) | 0.982 | 0.898 | agGTTTCcctgcctttc V$RFX1_01 | 5825 (+) | 0.982 | 0.916 | gtggcactgtGAAACaa V$RFX1_01 | 5861 (-) | 0.982 | 0.875 | tgGTTTCccatggacca V$RFX1_01 | 6967 (+) | 0.982 | 0.869 | tggatggaaaGGAACga V$RFX1_01 | 8069 (+) | 0.982 | 0.867 | tggtattgcaGAAACtt V$RFX1_01 | 8431 (+) | 1.000 | 0.905 | ctagagctttGCAACcc V$RFX1_01 | 8581 (+) | 1.000 | 0.868 | tggggcaaagGCAACat V$RFX1_01 | 9160 (-) | 0.982 | 0.904 | gaGTTCCtaagatgcat V$RFX1_01 | 9509 (-) | 0.982 | 0.870 | aaGTTTCcccatgtcca V$RFX1_01 | 9564 (-) | 0.982 | 0.883 | cgGTTTCtgggtgtggt V$RFX1_01 | 10432 (+) | 1.000 | 0.908 | tcttcgcaatGCAACgt V$RFX1_01 | 10530 (+) | 1.000 | 0.925 | aggaggccgtGCAACtc V$RFX1_01 | 10683 (+) | 1.000 | 0.877 | caggactctgGCAACac V$RFX1_01 | 10743 (-) | 0.982 | 0.889 | ggGTTTCagtgatccgt V$RFX1_01 | 11046 (-) | 0.982 | 0.905 | tgGTTCCtttgttaacg V$RFX1_02 | 163 (-) | 0.982 | 0.898 | ccGTTCCcaatgctggca V$RFX1_02 | 483 (+) | 1.000 | 0.903 | aagtcaacattGCAACat V$RFX1_02 | 860 (-) | 1.000 | 0.823 | tcGTTGCccttagaaatg V$RFX1_02 | 936 (+) | 0.982 | 0.823 | accacaagttaGGAACaa V$RFX1_02 | 1133 (-) | 0.982 | 0.828 | gtGTTCCtaaaaccgtta V$RFX1_02 | 1145 (-) | 0.992 | 0.825 | ccGTTACtttatgtagtt V$RFX1_02 | 1220 (-) | 0.982 | 0.833 | gaGTTTCtgatgactgct V$RFX1_02 | 2395 (-) | 1.000 | 0.838 | gaGTTGCtttccttaaaa V$RFX1_02 | 2549 (-) | 0.982 | 0.891 | agGTTTCtgttgtgaaat V$RFX1_02 | 2681 (-) | 1.000 | 0.824 | ccGTTGCatcctggtcac V$RFX1_02 | 3424 (-) | 1.000 | 0.896 | tgGTTGCttaattgactt V$RFX1_02 | 3433 (+) | 0.992 | 0.901 | aattgacttagGTAACag V$RFX1_02 | 3602 (-) | 0.982 | 0.870 | ggGTTTCtgcagttttca V$RFX1_02 | 3652 (+) | 0.992 | 0.849 | gaatcagtaaaGTAACac V$RFX1_02 | 3986 (+) | 0.982 | 0.834 | caccatctcatGAAACaa V$RFX1_02 | 4824 (+) | 0.982 | 0.860 | aggcggggtagGGAACga V$RFX1_02 | 4895 (-) | 0.982 | 0.845 | atGTTCCttggccaagtg V$RFX1_02 | 5114 (+) | 0.982 | 0.840 | gtgctcaaaagGAAACag V$RFX1_02 | 5223 (+) | 0.982 | 0.829 | agcggagactgGGAACga V$RFX1_02 | 5509 (-) | 0.982 | 0.829 | agGTTTCcctgcctttcc V$RFX1_02 | 5602 (+) | 0.982 | 0.905 | atataacacaaGAAACca V$RFX1_02 | 5861 (-) | 0.982 | 0.897 | tgGTTTCccatggaccat V$RFX1_02 | 5915 (-) | 0.982 | 0.909 | ttGTTTCcccagcgagac V$RFX1_02 | 6088 (-) | 0.982 | 0.819 | gaGTTCCtgtgtggatca V$RFX1_02 | 6177 (-) | 0.982 | 0.934 | ggGTTCCcagagacaccc V$RFX1_02 | 6215 (-) | 0.982 | 0.883 | agGTTTCtcacgctctgc V$RFX1_02 | 6564 (-) | 1.000 | 0.940 | tgGTTGCcacagttatgc V$RFX1_02 | 6966 (+) | 0.982 | 0.838 | ctggatggaaaGGAACga V$RFX1_02 | 7017 (-) | 0.982 | 0.844 | atGTTTCctgcatcagct V$RFX1_02 | 7170 (-) | 1.000 | 0.887 | gcGTTGCagatggtcccg V$RFX1_02 | 7320 (+) | 0.982 | 0.850 | aatgtacgtctGGAACaa V$RFX1_02 | 7529 (-) | 0.992 | 0.841 | aaGTTACagttggttgtg V$RFX1_02 | 7556 (+) | 0.982 | 0.820 | tgtagatgctgGAAACca V$RFX1_02 | 8068 (+) | 0.982 | 0.854 | ctggtattgcaGAAACtt V$RFX1_02 | 8199 (-) | 0.982 | 0.873 | ctGTTTCccctctgacct V$RFX1_02 | 8580 (+) | 1.000 | 0.955 | atggggcaaagGCAACat V$RFX1_02 | 8932 (+) | 1.000 | 0.883 | tataaactaaaGCAACtt V$RFX1_02 | 9509 (-) | 0.982 | 0.836 | aaGTTTCcccatgtccaa V$RFX1_02 | 9804 (+) | 0.992 | 0.908 | acatcacataaGTAACtt V$RFX1_02 | 9970 (-) | 0.982 | 0.904 | tgGTTCCtatggggatca V$RFX1_02 | 10379 (-) | 1.000 | 0.831 | atGTTGCaatccatataa V$RFX1_02 | 10529 (+) | 1.000 | 0.853 | taggaggccgtGCAACtc V$RFX1_02 | 10682 (+) | 1.000 | 0.912 | gcaggactctgGCAACac V$RFX1_02 | 11046 (-) | 0.982 | 0.891 | tgGTTCCtttgttaacgt V$RFX1_02 | 11070 (-) | 0.992 | 0.827 | caGTTACaagctgctctg V$RFX1_02 | 11190 (-) | 0.982 | 0.845 | ttGTTTCttgattgatta I$ZESTE_Q2 | 2012 (+) | 1.000 | 0.946 | tctcctGAGTGctggg I$ZESTE_Q2 | 5106 (+) | 1.000 | 0.946 | atttgtGAGTGctcaa I$ZESTE_Q2 | 6619 (+) | 1.000 | 0.934 | ctccgtGAGTGgggca I$ZESTE_Q2 | 7078 (+) | 1.000 | 0.952 | gtgtatGAGTGctttg I$ZESTE_Q2 | 7468 (+) | 1.000 | 0.994 | ggatttGAGTGtgcta I$ZESTE_Q2 | 8019 (+) | 1.000 | 0.947 | tttagtGAGTGtcaga I$ZESTE_Q2 | 8512 (-) | 1.000 | 0.937 | cgggaCACTCacagat I$ZESTE_Q2 | 9187 (-) | 1.000 | 0.989 | gagacCACTCaagaga I$ZESTE_Q2 | 11131 (-) | 1.000 | 0.950 | cgctgCACTCagtttt V$TCF11MAFG_01 | 125 (-) | 0.941 | 0.926 | gctaactgctcaGTCAAtccag V$TCF11MAFG_01 | 525 (-) | 0.741 | 0.740 | taaaaatgatggGTGATgcgct V$TCF11MAFG_01 | 580 (+) | 0.784 | 0.730 | acctgTTGAAttaacagactta V$TCF11MAFG_01 | 723 (+) | 0.833 | 0.764 | tggttGTGAGccaccacgtggt V$TCF11MAFG_01 | 790 (-) | 0.741 | 0.753 | cttaaccgctgaGCCATttctc V$TCF11MAFG_01 | 1027 (+) | 0.784 | 0.760 | ggcagTTGAAtcaacaaagcag V$TCF11MAFG_01 | 1228 (-) | 0.741 | 0.779 | gatgactgctttGACATcaggg V$TCF11MAFG_01 | 2325 (-) | 0.704 | 0.779 | cagtcttgctatGTGACcctac V$TCF11MAFG_01 | 2455 (+) | 0.741 | 0.764 | gcgctATGGCctagcaagcgcg V$TCF11MAFG_01 | 2813 (-) | 0.682 | 0.728 | acaccgtgctctGTCCAgttgt V$TCF11MAFG_01 | 2970 (-) | 0.741 | 0.747 | ggcagatgcgcaGGCATctact V$TCF11MAFG_01 | 3064 (-) | 0.963 | 0.792 | tggtatagctagGTCACagggt V$TCF11MAFG_01 | 3366 (-) | 0.844 | 0.738 | ctgaatagcctaATCATatggg V$TCF11MAFG_01 | 3480 (+) | 0.870 | 0.744 | tgcctATGAGgcagaatagctt V$TCF11MAFG_01 | 3700 (-) | 0.784 | 0.817 | aagagatgcttgATCAAtattt V$TCF11MAFG_01 | 3744 (-) | 1.000 | 0.771 | tctacatttttaGTCATtctta V$TCF11MAFG_01 | 3834 (-) | 0.870 | 0.719 | taaatttccattCTCATaaaat V$TCF11MAFG_01 | 3926 (-) | 0.963 | 0.755 | attacattatcaGTCACttgtt V$TCF11MAFG_01 | 3938 (-) | 0.777 | 0.758 | gtcacttgttcaTTCAGccatc V$TCF11MAFG_01 | 4190 (-) | 0.784 | 0.728 | gaaggctgcaagTTCAAggctt V$TCF11MAFG_01 | 4462 (+) | 0.933 | 0.816 | tcacaCTGACtgaccagcattt V$TCF11MAFG_01 | 4843 (+) | 0.803 | 0.715 | tgatcCTGAGacttcactgagg V$TCF11MAFG_01 | 5307 (-) | 0.844 | 0.774 | caaaatggcttcTTCATcggtt V$TCF11MAFG_01 | 5353 (+) | 0.941 | 0.812 | cctctTTGACaaaacacacggg V$TCF11MAFG_01 | 6289 (+) | 1.000 | 0.911 | tgatgATGACtgagcagggcac V$TCF11MAFG_01 | 6975 (+) | 0.741 | 0.718 | aaggaACGACttagctggtgag V$TCF11MAFG_01 | 7600 (-) | 0.741 | 0.847 | ctgaactgctgaGCCATctctc V$TCF11MAFG_01 | 7666 (-) | 0.833 | 0.760 | caggcatggtggCTCACagctg V$TCF11MAFG_01 | 8265 (+) | 0.777 | 0.763 | ttgacCTGATgatgcagctgcc V$TCF11MAFG_01 | 8268 (+) | 0.844 | 0.741 | acctgATGATgcagctgccctg V$TCF11MAFG_01 | 8315 (-) | 0.933 | 0.795 | tgcagcttctgcGTCAGcgcca V$TCF11MAFG_01 | 9313 (-) | 0.933 | 0.733 | gggaggcgttggGTCAGagcct V$TCF11MAFG_01 | 9543 (-) | 0.704 | 0.718 | aagtcctgcgtaGTCGCctgtc V$TCF11MAFG_01 | 9628 (-) | 0.933 | 0.733 | cgggagttcaagGTCAGcttgg V$TCF11MAFG_01 | 9836 (-) | 0.933 | 0.716 | gataattttaaaGTCAGgcagg V$TCF11MAFG_01 | 10027 (-) | 0.963 | 0.779 | cgtagtttgtgaGTCACcgtgt V$TCF11MAFG_01 | 10628 (+) | 0.963 | 0.806 | agaaaGTGACtgaggaagacac V$TCF11MAFG_01 | 11156 (+) | 0.844 | 0.792 | tgaaaATGAAgctgcagagatc V$TCF11MAFG_01 | 11327 (+) | 1.000 | 0.790 | tgatgATGACgatgtaaatgcc V$TCF11MAFG_01 | 11447 (-) | 0.777 | 0.762 | ctattttggtgaATCAGtgaaa V$TCF11_01 | 49 (+) | 1.000 | 0.986 | GTCATtacttggg V$TCF11_01 | 464 (-) | 0.887 | 0.883 | tggcttatTTGAC V$TCF11_01 | 485 (+) | 0.887 | 0.887 | GTCAAcattgcaa V$TCF11_01 | 522 (-) | 0.887 | 0.887 | ttttaaaaATGAT V$TCF11_01 | 616 (-) | 1.000 | 0.992 | tgtcgtacATGAC V$TCF11_01 | 945 (-) | 0.887 | 0.887 | taggaacaATGAT V$TCF11_01 | 1082 (+) | 1.000 | 0.999 | GTCATtttggtca V$TCF11_01 | 1091 (+) | 0.887 | 0.882 | GTCAActaaaatt V$TCF11_01 | 1221 (-) | 1.000 | 0.983 | agtttctgATGAC V$TCF11_01 | 1259 (+) | 1.000 | 0.995 | GTCATcatccgtg V$TCF11_01 | 1275 (-) | 1.000 | 0.985 | actctgggATGAC V$TCF11_01 | 1358 (+) | 0.905 | 0.892 | GTCACctgatgct V$TCF11_01 | 1450 (+) | 0.905 | 0.902 | GTCACtgagccgg V$TCF11_01 | 1729 (+) | 0.883 | 0.882 | CTCATcctgaact V$TCF11_01 | 2329 (-) | 0.905 | 0.902 | cttgctatGTGAC V$TCF11_01 | 2659 (-) | 1.000 | 0.995 | gcactttgATGAC V$TCF11_01 | 2694 (+) | 0.905 | 0.902 | GTCACactcttct V$TCF11_01 | 2957 (-) | 0.905 | 0.906 | cgaatacaGTGAC V$TCF11_01 | 3076 (+) | 0.905 | 0.891 | GTCACagggtgat V$TCF11_01 | 3273 (-) | 0.887 | 0.888 | atgccattATGAT V$TCF11_01 | 3427 (-) | 0.887 | 0.884 | ttgcttaaTTGAC V$TCF11_01 | 3468 (+) | 0.883 | 0.883 | CTCATgatttact V$TCF11_01 | 3718 (-) | 0.905 | 0.891 | atttacggGTGAC V$TCF11_01 | 3756 (+) | 1.000 | 0.997 | GTCATtcttaaaa V$TCF11_01 | 3882 (-) | 0.887 | 0.883 | caagctccTTGAC V$TCF11_01 | 3938 (+) | 0.905 | 0.893 | GTCACttgttcat V$TCF11_01 | 3970 (-) | 0.887 | 0.885 | ctggctgtATGAT V$TCF11_01 | 4293 (-) | 0.887 | 0.886 | ccctaaatTTGAC V$TCF11_01 | 4325 (-) | 0.905 | 0.891 | tgggtctgGTGAC V$TCF11_01 | 4932 (+) | 0.887 | 0.883 | ATCATggagactg V$TCF11_01 | 5140 (+) | 0.887 | 0.887 | GTCAAcctaggca V$TCF11_01 | 5169 (-) | 0.905 | 0.892 | tttcacttGTGAC V$TCF11_01 | 5260 (+) | 0.905 | 0.900 | GTCACagtcatcc V$TCF11_01 | 5266 (+) | 1.000 | 0.999 | GTCATcctgggcc V$TCF11_01 | 5350 (-) | 0.887 | 0.883 | tcccctctTTGAC V$TCF11_01 | 6046 (-) | 0.905 | 0.892 | gcagccctGTGAC V$TCF11_01 | 6286 (-) | 1.000 | 0.994 | ctctgatgATGAC V$TCF11_01 | 6408 (-) | 1.000 | 0.998 | cgctcagaATGAC V$TCF11_01 | 6447 (+) | 1.000 | 0.983 | GTCATggcaaact V$TCF11_01 | 6560 (+) | 1.000 | 0.997 | GTCATggttgcca V$TCF11_01 | 7037 (-) | 0.887 | 0.883 | agagttagTTGAC V$TCF11_01 | 7886 (+) | 1.000 | 0.986 | GTCATttggtttt V$TCF11_01 | 8106 (+) | 0.905 | 0.899 | GTCACacactctc V$TCF11_01 | 9293 (+) | 0.882 | 0.884 | GTCAGcttgtgtg V$TCF11_01 | 9640 (+) | 0.882 | 0.883 | GTCAGcttgggct V$TCF11_01 | 9725 (-) | 0.887 | 0.883 | ctctttgtATGAT V$TCF11_01 | 9770 (+) | 1.000 | 0.985 | GTCATgccataga V$TCF11_01 | 9865 (-) | 0.883 | 0.883 | aaagaagaATGAG V$TCF11_01 | 10039 (+) | 0.905 | 0.906 | GTCACcgtgtggg V$TCF11_01 | 10246 (+) | 0.882 | 0.883 | GTCAGgcttgtgc V$TCF11_01 | 10506 (+) | 0.887 | 0.884 | GTCAAttatggct V$TCF11_01 | 10625 (-) | 0.905 | 0.903 | tgtagaaaGTGAC V$TCF11_01 | 11309 (-) | 0.905 | 0.906 | cgactacgGTGAC V$TCF11_01 | 11318 (-) | 0.887 | 0.884 | tgacgacgATGAT V$TCF11_01 | 11321 (-) | 0.887 | 0.886 | cgacgatgATGAT V$TCF11_01 | 11324 (-) | 1.000 | 0.994 | cgatgatgATGAC V$TCF11_01 | 11357 (-) | 0.887 | 0.885 | acgagagcATGAT V$TCF11_01 | 11705 (+) | 0.905 | 0.892 | GTCACcccaactg V$TCF11_01 | 11741 (-) | 1.000 | 0.987 | ctcaccaaATGAC V$NFY_01 | 1018 (-) | 1.000 | 0.959 | tatgtgATTGGcagtt V$NFY_01 | 3278 (-) | 1.000 | 0.934 | attatgATTGGgatga V$NFY_01 | 11682 (+) | 1.000 | 0.945 | aaagaCCAATaaagtt V$NFY_01 | 11894 (+) | 1.000 | 0.941 | ctcaaCCAATagcacg V$NFY_01 | 11925 (-) | 1.000 | 0.974 | acgcccATTGGctgaa F$HAP234_01 | 1018 (-) | 1.000 | 0.956 | tatgtgATTGGcagtt F$HAP234_01 | 10731 (-) | 1.000 | 0.975 | gcttttATTGGggggt V$HFH3_01 | 212 (+) | 0.981 | 0.925 | tggTATTTttttt V$HFH3_01 | 214 (+) | 0.931 | 0.890 | gtaTTTTTttttt V$HFH3_01 | 228 (+) | 0.931 | 0.906 | tccTTTTTgtctt V$HFH3_01 | 505 (+) | 0.981 | 0.942 | ataTATTTgtctt V$HFH3_01 | 847 (-) | 0.981 | 0.976 | taaatAAATAcga V$HFH3_01 | 876 (-) | 1.000 | 0.915 | tgcacAAACAcga V$HFH3_01 | 1526 (+) | 1.000 | 0.932 | agcTGTTTttttg V$HFH3_01 | 1893 (+) | 0.981 | 0.966 | ttaTATTTatttt V$HFH3_01 | 2291 (+) | 0.931 | 0.908 | tagTTTTTgtgtg V$HFH3_01 | 2360 (+) | 0.981 | 0.966 | ttgTATTTatttt V$HFH3_01 | 2409 (-) | 1.000 | 0.938 | aaaaaAAACAaaa V$HFH3_01 | 2413 (-) | 0.931 | 0.934 | aaaacAAAAAatc V$HFH3_01 | 4007 (-) | 0.981 | 0.980 | taaatAAATAttc V$HFH3_01 | 4220 (-) | 0.981 | 0.930 | taaaaAAATAaat V$HFH3_01 | 4425 (-) | 0.931 | 0.896 | taaaaAAAAAaaa V$HFH3_01 | 4431 (-) | 0.931 | 0.891 | aaaaaAAAAAacc V$HFH3_01 | 4478 (+) | 0.931 | 0.891 | gcaTTTTTctttt V$HFH3_01 | 5102 (+) | 0.981 | 0.890 | tatTATTTgtgag V$HFH3_01 | 5434 (+) | 0.981 | 0.946 | cttTATTTgtatt V$HFH3_01 | 5645 (-) | 0.931 | 0.927 | aaaatAAAAAgaa V$HFH3_01 | 5717 (-) | 1.000 | 0.915 | cacaaAAACActa V$HFH3_01 | 5777 (-) | 0.981 | 0.894 | gagaaAAATAtga V$HFH3_01 | 5792 (-) | 0.981 | 0.893 | taattAAATAcaa V$HFH3_01 | 5798 (-) | 0.931 | 0.917 | aatacAAAAAaaa V$HFH3_01 | 5807 (-) | 0.981 | 0.928 | aaaaaAAATAgcc V$HFH3_01 | 6506 (-) | 0.981 | 0.893 | gccacAAATAaaa V$HFH3_01 | 6589 (-) | 0.981 | 0.961 | aaaacAAATAagg V$HFH3_01 | 6823 (+) | 0.931 | 0.898 | cctTTTTTatgtg V$HFH3_01 | 7047 (+) | 0.981 | 0.925 | gacTATTTttttt V$HFH3_01 | 7064 (+) | 1.000 | 0.914 | attTGTTTttgtt V$HFH3_01 | 7066 (+) | 0.931 | 0.929 | ttgTTTTTgtttg V$HFH3_01 | 7070 (+) | 1.000 | 0.974 | tttTGTTTgtgta V$HFH3_01 | 7425 (+) | 0.981 | 0.973 | gttTATTTgtttt V$HFH3_01 | 7429 (+) | 1.000 | 0.920 | attTGTTTttatt V$HFH3_01 | 7431 (+) | 0.931 | 0.928 | ttgTTTTTatttt V$HFH3_01 | 7771 (-) | 1.000 | 0.897 | tcaaaAAACAaaa V$HFH3_01 | 7780 (-) | 0.931 | 0.931 | aaaacAAAAAtta V$HFH3_01 | 7798 (-) | 0.931 | 0.905 | aagatAAAGAaca V$HFH3_01 | 7822 (-) | 1.000 | 0.979 | aaaacAAACAatt V$HFH3_01 | 7892 (+) | 0.931 | 0.930 | tggTTTTTatttt V$HFH3_01 | 7938 (+) | 1.000 | 0.901 | gtgTGTTTagttt V$HFH3_01 | 7969 (+) | 1.000 | 0.982 | agtTGTTTgtttt V$HFH3_01 | 8783 (-) | 0.931 | 0.902 | gagacAAAGAaga V$HFH3_01 | 8898 (-) | 1.000 | 0.908 | accacAAACAgat V$HFH3_01 | 9111 (-) | 0.931 | 0.937 | aaaacAAAAAacc V$HFH3_01 | 9425 (-) | 0.981 | 0.894 | cccatAAATAaca V$HFH3_01 | 9534 (-) | 1.000 | 0.921 | agaacAAACAagt V$HFH3_01 | 9723 (+) | 0.931 | 0.910 | atcTCTTTgtatg V$HFH3_01 | 10311 (+) | 1.000 | 0.928 | aacTGTTTttata V$HFH3_01 | 10313 (+) | 0.931 | 0.905 | ctgTTTTTatatt V$HFH3_01 | 10319 (+) | 0.981 | 0.910 | ttaTATTTttatt V$HFH3_01 | 10321 (+) | 0.931 | 0.925 | ataTTTTTatttt V$HFH3_01 | 10390 (-) | 0.931 | 0.912 | catatAAAAAtga V$HFH3_01 | 10700 (+) | 1.000 | 0.906 | ccgTGTTTatgaa V$HFH3_01 | 10911 (-) | 0.981 | 0.906 | tatgcAAATAata V$HFH3_01 | 11292 (-) | 0.981 | 0.920 | caaagAAATAata V$HFH3_01 | 11467 (+) | 0.931 | 0.921 | aatTTTTTatttg V$HFH3_01 | 11471 (+) | 0.981 | 0.969 | tttTATTTgtttt V$HFH3_01 | 11643 (+) | 0.981 | 0.910 | atgTATTTctgta V$FREAC2_01 | 601 (+) | 1.000 | 0.887 | acttttTAAACatcct V$FREAC2_01 | 632 (+) | 1.000 | 0.890 | ttttttTAAACattta V$FREAC2_01 | 958 (-) | 0.855 | 0.850 | caaatATTTAcaggac V$FREAC2_01 | 2572 (-) | 0.855 | 0.847 | taagtATTTAcaacaa V$FREAC2_01 | 3713 (-) | 0.855 | 0.856 | tcaatATTTAcgggtg V$FREAC2_01 | 7937 (-) | 1.000 | 0.915 | tgtgtGTTTAgttttg V$FREAC2_01 | 10699 (-) | 1.000 | 0.913 | cccgtGTTTAtgaaac V$FREAC2_01 | 11096 (-) | 1.000 | 0.886 | ccactGTTTAaagaac V$FREAC3_01 | 3713 (-) | 1.000 | 0.866 | tcaataTTTACgggtg V$FREAC4_01 | 28 (+) | 0.831 | 0.766 | tgtgtgTGAACaactt V$FREAC4_01 | 480 (+) | 0.840 | 0.802 | tacaagTCAACattgc V$FREAC4_01 | 579 (-) | 0.840 | 0.761 | tacctGTTGAattaac V$FREAC4_01 | 601 (+) | 1.000 | 0.765 | acttttTAAACatcct V$FREAC4_01 | 632 (+) | 1.000 | 0.780 | ttttttTAAACattta V$FREAC4_01 | 709 (-) | 0.792 | 0.781 | gatctCTTTAcagatg V$FREAC4_01 | 1031 (+) | 0.840 | 0.750 | gttgaaTCAACaaagc V$FREAC4_01 | 1049 (+) | 0.840 | 0.763 | caaaaaTCAACaactt V$FREAC4_01 | 1266 (-) | 0.792 | 0.808 | tccgtGTGTActctgg V$FREAC4_01 | 3290 (+) | 0.831 | 0.786 | atgaggTGAACatctc V$FREAC4_01 | 3450 (+) | 0.792 | 0.752 | gttaatAAAACacagg V$FREAC4_01 | 4076 (-) | 0.792 | 0.754 | aaaatGTATActatat V$FREAC4_01 | 4266 (-) | 0.792 | 0.850 | aggatGCTTAcctaag V$FREAC4_01 | 4532 (+) | 0.840 | 0.767 | aatgcgTCAACagcct V$FREAC4_01 | 5119 (+) | 0.792 | 0.832 | caaaagGAAACagatg V$FREAC4_01 | 5558 (+) | 0.792 | 0.753 | cactagAAAACaacca V$FREAC4_01 | 5912 (-) | 0.792 | 0.779 | agtttGTTTCcccagc V$FREAC4_01 | 6106 (-) | 0.840 | 0.785 | gtcctGTTGAccagag V$FREAC4_01 | 6122 (-) | 0.792 | 0.751 | ggcttGTTTTctcggc V$FREAC4_01 | 7820 (+) | 0.792 | 0.749 | ccaaaaCAAACaatta V$FREAC4_01 | 7937 (-) | 1.000 | 0.855 | tgtgtGTTTAgttttg V$FREAC4_01 | 8183 (-) | 0.792 | 0.772 | aacttGTTTGctctgc V$FREAC4_01 | 8384 (-) | 0.792 | 0.767 | gatctGTATAcctgaa V$FREAC4_01 | 9094 (-) | 0.792 | 0.814 | gctctGCTTAcaaaag V$FREAC4_01 | 9444 (-) | 1.000 | 0.765 | ccaagGTTTAccaatc V$FREAC4_01 | 10453 (+) | 0.840 | 0.759 | tttataTCAACaatgc V$FREAC4_01 | 10658 (+) | 0.840 | 0.869 | ctaatgTCAACaggag V$FREAC4_01 | 10699 (-) | 1.000 | 0.798 | cccgtGTTTAtgaaac V$FREAC4_01 | 11051 (-) | 0.831 | 0.809 | cctttGTTAAcgttgt V$FREAC4_01 | 11096 (-) | 1.000 | 0.812 | ccactGTTTAaagaac V$FREAC4_01 | 11653 (+) | 0.831 | 0.787 | gtactgTTAACacaat V$FREAC4_01 | 11653 (-) | 0.831 | 0.801 | gtactGTTAAcacaat V$FREAC4_01 | 11757 (+) | 0.831 | 0.767 | aaagagTTAACaggat V$FREAC7_01 | 71 (-) | 0.814 | 0.848 | ctgttgTTGATgttga V$FREAC7_01 | 211 (-) | 0.832 | 0.832 | ctggtaTTTTTttttt V$FREAC7_01 | 504 (-) | 0.848 | 0.840 | tatataTTTGTcttct V$FREAC7_01 | 565 (+) | 1.000 | 0.858 | aggagATAAAcgtcta V$FREAC7_01 | 601 (+) | 0.832 | 0.836 | actttTTAAAcatcct V$FREAC7_01 | 632 (+) | 0.832 | 0.845 | tttttTTAAAcattta V$FREAC7_01 | 638 (-) | 1.000 | 0.835 | taaacaTTTATtcatt V$FREAC7_01 | 646 (-) | 1.000 | 0.832 | tattcaTTTATtatat V$FREAC7_01 | 841 (+) | 0.832 | 0.830 | acaagTTAAAtaaata V$FREAC7_01 | 845 (+) | 1.000 | 0.960 | gttaaATAAAtacgat V$FREAC7_01 | 874 (+) | 0.848 | 0.862 | aatgcACAAAcacgaa V$FREAC7_01 | 1049 (+) | 0.814 | 0.833 | caaaaATCAAcaactt V$FREAC7_01 | 1525 (-) | 0.832 | 0.847 | gagctgTTTTTttgtt V$FREAC7_01 | 1892 (-) | 1.000 | 0.965 | tttataTTTATtttat V$FREAC7_01 | 2359 (-) | 1.000 | 0.954 | cttgtaTTTATtttgc V$FREAC7_01 | 2407 (+) | 0.832 | 0.862 | ttaaaAAAAAcaaaaa V$FREAC7_01 | 2572 (-) | 0.848 | 0.833 | taagtaTTTACaacaa V$FREAC7_01 | 2897 (-) | 1.000 | 0.918 | atctttTTTATccaca V$FREAC7_01 | 3620 (-) | 1.000 | 0.843 | aatgcgTTTATaatag V$FREAC7_01 | 3713 (-) | 0.848 | 0.830 | tcaataTTTACgggtg V$FREAC7_01 | 3805 (+) | 0.832 | 0.839 | atagaAAAAAcatcgt V$FREAC7_01 | 4001 (+) | 1.000 | 0.943 | caacgATAAAtaaata V$FREAC7_01 | 4005 (+) | 1.000 | 0.959 | gataaATAAAtattct V$FREAC7_01 | 4218 (+) | 0.832 | 0.831 | gttaaAAAAAtaaatt V$FREAC7_01 | 4222 (+) | 1.000 | 0.837 | aaaaaATAAAttctca V$FREAC7_01 | 4491 (+) | 0.832 | 0.842 | gaacaAAAAAcaaaac V$FREAC7_01 | 4778 (+) | 1.000 | 0.913 | tctggATAAAgattgc V$FREAC7_01 | 5101 (-) | 0.848 | 0.831 | ttattaTTTGTgagtg V$FREAC7_01 | 5433 (-) | 0.848 | 0.843 | actttaTTTGTattgc V$FREAC7_01 | 5632 (+) | 1.000 | 0.904 | ctgggATAAAgacaaa V$FREAC7_01 | 5643 (+) | 1.000 | 0.921 | acaaaATAAAaagaag V$FREAC7_01 | 5715 (+) | 0.832 | 0.845 | cccacAAAAAcactaa V$FREAC7_01 | 5775 (+) | 0.832 | 0.834 | gagagAAAAAtatgaa V$FREAC7_01 | 5790 (+) | 0.832 | 0.836 | aataaTTAAAtacaaa V$FREAC7_01 | 5962 (-) | 0.832 | 0.829 | ccagtgTTTTTaagtg V$FREAC7_01 | 6251 (-) | 0.848 | 0.830 | tttctaTTTGTcccat V$FREAC7_01 | 6587 (+) | 0.848 | 0.845 | acaaaACAAAtaaggt V$FREAC7_01 | 6669 (-) | 0.832 | 0.838 | attttaTTTAGctttt V$FREAC7_01 | 6822 (-) | 1.000 | 0.927 | ccctttTTTATgtgct V$FREAC7_01 | 6833 (-) | 0.848 | 0.829 | gtgctaTTTACtgcca V$FREAC7_01 | 7063 (-) | 0.832 | 0.854 | aatttgTTTTTgtttg V$FREAC7_01 | 7069 (-) | 0.848 | 0.869 | tttttgTTTGTgtatg V$FREAC7_01 | 7420 (-) | 1.000 | 0.866 | taaaggTTTATttgtt V$FREAC7_01 | 7424 (-) | 0.848 | 0.852 | ggtttaTTTGTtttta V$FREAC7_01 | 7428 (-) | 0.832 | 0.859 | tatttgTTTTTatttt V$FREAC7_01 | 7430 (-) | 1.000 | 0.925 | tttgttTTTATtttat V$FREAC7_01 | 7769 (+) | 0.832 | 0.847 | tctcaAAAAAcaaaac V$FREAC7_01 | 7796 (+) | 1.000 | 0.920 | caaagATAAAgaacaa V$FREAC7_01 | 7820 (+) | 0.848 | 0.855 | ccaaaACAAAcaatta V$FREAC7_01 | 7891 (-) | 1.000 | 0.926 | ttggttTTTATtttga V$FREAC7_01 | 7937 (-) | 0.832 | 0.844 | tgtgtgTTTAGttttg V$FREAC7_01 | 7968 (-) | 0.848 | 0.863 | tagttgTTTGTttttc V$FREAC7_01 | 8057 (-) | 1.000 | 0.859 | gtctagTTTATctggt V$FREAC7_01 | 8472 (+) | 0.832 | 0.829 | aaggcAAAAAtaaggg V$FREAC7_01 | 8896 (+) | 0.848 | 0.850 | gtaccACAAAcagatt V$FREAC7_01 | 8928 (+) | 1.000 | 0.856 | tcagtATAAActaaag V$FREAC7_01 | 9423 (+) | 1.000 | 0.954 | atcccATAAAtaacag V$FREAC7_01 | 9532 (+) | 0.848 | 0.846 | ctagaACAAAcaagtc V$FREAC7_01 | 9960 (-) | 0.832 | 0.843 | tgtatgTTTCTggttc V$FREAC7_01 | 10310 (-) | 0.832 | 0.853 | taactgTTTTTatatt V$FREAC7_01 | 10312 (-) | 1.000 | 0.928 | actgttTTTATatttt V$FREAC7_01 | 10318 (-) | 0.832 | 0.841 | tttataTTTTTatttt V$FREAC7_01 | 10320 (-) | 1.000 | 0.926 | tatattTTTATtttat V$FREAC7_01 | 10334 (-) | 1.000 | 0.914 | atagttTTTATtgttt V$FREAC7_01 | 10388 (+) | 1.000 | 0.934 | tccatATAAAaatgaa V$FREAC7_01 | 10453 (+) | 0.814 | 0.846 | tttatATCAAcaatgc V$FREAC7_01 | 10699 (-) | 1.000 | 0.966 | cccgtgTTTATgaaac V$FREAC7_01 | 11096 (-) | 0.832 | 0.833 | ccactgTTTAAagaac V$FREAC7_01 | 11187 (-) | 0.832 | 0.845 | gccttgTTTCTtgatt V$FREAC7_01 | 11466 (-) | 1.000 | 0.927 | aaatttTTTATttgtt V$FREAC7_01 | 11470 (-) | 0.848 | 0.847 | tttttaTTTGTttttc V$FREAC7_01 | 11474 (-) | 0.832 | 0.843 | tatttgTTTTTccctt V$FREAC7_01 | 11642 (-) | 0.832 | 0.832 | aatgtaTTTCTgtact V$HFH8_01 | 603 (-) | 1.000 | 0.867 | tttttAAACAtcc V$HFH8_01 | 634 (-) | 1.000 | 0.867 | tttttAAACAttt V$HFH8_01 | 847 (-) | 0.892 | 0.908 | taaatAAATAcga V$HFH8_01 | 876 (-) | 1.000 | 0.906 | tgcacAAACAcga V$HFH8_01 | 1526 (+) | 1.000 | 0.904 | agcTGTTTttttg V$HFH8_01 | 1893 (+) | 0.892 | 0.901 | ttaTATTTatttt V$HFH8_01 | 2360 (+) | 0.892 | 0.900 | ttgTATTTatttt V$HFH8_01 | 2409 (-) | 1.000 | 0.898 | aaaaaAAACAaaa V$HFH8_01 | 2898 (+) | 0.870 | 0.872 | tctTTTTTatcca V$HFH8_01 | 3807 (-) | 1.000 | 0.883 | agaaaAAACAtcg V$HFH8_01 | 4003 (-) | 0.892 | 0.869 | acgatAAATAaat V$HFH8_01 | 4007 (-) | 0.892 | 0.910 | taaatAAATAttc V$HFH8_01 | 4493 (-) | 1.000 | 0.872 | acaaaAAACAaaa V$HFH8_01 | 4780 (-) | 0.870 | 0.872 | tggatAAAGAttg V$HFH8_01 | 5434 (+) | 0.892 | 0.841 | cttTATTTgtatt V$HFH8_01 | 5634 (-) | 0.870 | 0.861 | gggatAAAGAcaa V$HFH8_01 | 5645 (-) | 0.870 | 0.881 | aaaatAAAAAgaa V$HFH8_01 | 5717 (-) | 1.000 | 0.902 | cacaaAAACActa V$HFH8_01 | 5963 (+) | 1.000 | 0.887 | cagTGTTTttaag V$HFH8_01 | 6589 (-) | 0.892 | 0.838 | aaaacAAATAagg V$HFH8_01 | 6823 (+) | 0.870 | 0.896 | cctTTTTTatgtg V$HFH8_01 | 7064 (+) | 1.000 | 0.896 | attTGTTTttgtt V$HFH8_01 | 7070 (+) | 1.000 | 0.923 | tttTGTTTgtgta V$HFH8_01 | 7425 (+) | 0.892 | 0.838 | gttTATTTgtttt V$HFH8_01 | 7429 (+) | 1.000 | 0.901 | attTGTTTttatt V$HFH8_01 | 7431 (+) | 0.870 | 0.883 | ttgTTTTTatttt V$HFH8_01 | 7771 (-) | 1.000 | 0.880 | tcaaaAAACAaaa V$HFH8_01 | 7798 (-) | 0.870 | 0.878 | aagatAAAGAaca V$HFH8_01 | 7822 (-) | 1.000 | 0.919 | aaaacAAACAatt V$HFH8_01 | 7892 (+) | 0.870 | 0.883 | tggTTTTTatttt V$HFH8_01 | 7938 (+) | 1.000 | 0.877 | gtgTGTTTagttt V$HFH8_01 | 7969 (+) | 1.000 | 0.919 | agtTGTTTgtttt V$HFH8_01 | 7973 (+) | 1.000 | 0.876 | gttTGTTTttcaa V$HFH8_01 | 8898 (-) | 1.000 | 0.884 | accacAAACAgat V$HFH8_01 | 9425 (-) | 0.892 | 0.885 | cccatAAATAaca V$HFH8_01 | 9534 (-) | 1.000 | 0.905 | agaacAAACAagt V$HFH8_01 | 9961 (+) | 1.000 | 0.867 | gtaTGTTTctggt V$HFH8_01 | 10311 (+) | 1.000 | 0.904 | aacTGTTTttata V$HFH8_01 | 10313 (+) | 0.870 | 0.887 | ctgTTTTTatatt V$HFH8_01 | 10321 (+) | 0.870 | 0.885 | ataTTTTTatttt V$HFH8_01 | 10335 (+) | 0.870 | 0.857 | tagTTTTTattgt V$HFH8_01 | 10390 (-) | 0.870 | 0.897 | catatAAAAAtga V$HFH8_01 | 10455 (-) | 0.783 | 0.834 | tatatCAACAatg V$HFH8_01 | 10700 (+) | 1.000 | 0.966 | ccgTGTTTatgaa V$HFH8_01 | 11097 (+) | 1.000 | 0.861 | cacTGTTTaaaga V$HFH8_01 | 11188 (+) | 1.000 | 0.882 | cctTGTTTcttga V$HFH8_01 | 11467 (+) | 0.870 | 0.899 | aatTTTTTatttg V$HFH8_01 | 11471 (+) | 0.892 | 0.837 | tttTATTTgtttt V$HFH8_01 | 11475 (+) | 1.000 | 0.877 | attTGTTTttccc V$HFH8_01 | 11828 (-) | 1.000 | 0.861 | gtgagAAACAgca V$NFAT_Q6 | 223 (-) | 1.000 | 0.995 | tttTTTCCtttt V$NFAT_Q6 | 327 (+) | 1.000 | 0.969 | tgctGGAAAttc V$NFAT_Q6 | 1190 (+) | 1.000 | 0.969 | cccaGGAAAgct V$NFAT_Q6 | 1559 (+) | 1.000 | 0.967 | ttctGGAAAtct V$NFAT_Q6 | 1637 (-) | 1.000 | 0.962 | gccTTTCCtagt V$NFAT_Q6 | 1718 (-) | 1.000 | 0.956 | cctTTTCCgggc V$NFAT_Q6 | 1955 (-) | 1.000 | 0.975 | aacTTTCCtttt V$NFAT_Q6 | 2164 (-) | 1.000 | 0.962 | tccTTTCCttct V$NFAT_Q6 | 2172 (-) | 1.000 | 0.967 | ttcTTTCCttct V$NFAT_Q6 | 2378 (-) | 1.000 | 0.966 | agtTTTCCcaga V$NFAT_Q6 | 2399 (-) | 1.000 | 0.965 | tgcTTTCCttaa V$NFAT_Q6 | 2510 (+) | 1.000 | 0.974 | aaagGGAAAaaa V$NFAT_Q6 | 2563 (-) | 1.000 | 0.948 | aaaTTTCCgtaa V$NFAT_Q6 | 3109 (-) | 1.000 | 0.964 | taaTTTCCatgc V$NFAT_Q6 | 3323 (-) | 1.000 | 0.982 | cctTTTCCtgat V$NFAT_Q6 | 3336 (-) | 1.000 | 0.946 | ctaTTTCCctcc V$NFAT_Q6 | 3835 (-) | 1.000 | 0.973 | aaaTTTCCattc V$NFAT_Q6 | 4046 (+) | 1.000 | 0.950 | agggGGAAAtag V$NFAT_Q6 | 4112 (-) | 1.000 | 0.968 | catTTTCCcaga V$NFAT_Q6 | 4687 (+) | 1.000 | 0.969 | ggcaGGAAActt V$NFAT_Q6 | 5120 (+) | 1.000 | 0.979 | aaaaGGAAAcag V$NFAT_Q6 | 5421 (-) | 1.000 | 0.944 | atcTTTCCctcc V$NFAT_Q6 | 5456 (-) | 1.000 | 0.959 | cctTTTCCccac V$NFAT_Q6 | 5471 (-) | 1.000 | 0.969 | ctaTTTCCtgat V$NFAT_Q6 | 5509 (-) | 1.000 | 0.946 | aggTTTCCctgc V$NFAT_Q6 | 5519 (-) | 1.000 | 0.963 | gccTTTCCtcct V$NFAT_Q6 | 5861 (-) | 1.000 | 0.961 | tggTTTCCcatg V$NFAT_Q6 | 5915 (-) | 1.000 | 0.955 | ttgTTTCCccag V$NFAT_Q6 | 6459 (-) | 1.000 | 0.969 | ttaTTTCCtgat V$NFAT_Q6 | 6968 (+) | 1.000 | 0.957 | ggatGGAAAgga V$NFAT_Q6 | 6998 (+) | 1.000 | 0.981 | agatGGAAAagg V$NFAT_Q6 | 7017 (-) | 1.000 | 0.973 | atgTTTCCtgca V$NFAT_Q6 | 7246 (+) | 1.000 | 0.991 | ctcaGGAAAaag V$NFAT_Q6 | 7562 (+) | 1.000 | 0.968 | tgctGGAAAcca V$NFAT_Q6 | 8144 (+) | 1.000 | 0.966 | tgagGGAAAaca V$NFAT_Q6 | 8199 (-) | 1.000 | 0.956 | ctgTTTCCcctc V$NFAT_Q6 | 9121 (+) | 1.000 | 0.964 | accgGGAAAacc V$NFAT_Q6 | 9509 (-) | 1.000 | 0.951 | aagTTTCCccat V$NFAT_Q6 | 9873 (+) | 1.000 | 0.968 | atgaGGAAAtca V$NFAT_Q6 | 10587 (-) | 1.000 | 0.979 | caaTTTCCtata V$NFAT_Q6 | 11090 (-) | 1.000 | 0.998 | aatTTTCCactg V$NFAT_Q6 | 11270 (+) | 1.000 | 0.977 | aaaaGGAAAtta V$NFAT_Q6 | 11479 (-) | 1.000 | 0.978 | gttTTTCCcttg F$DDE1_B | 796 (-) | 1.000 | 0.928 | cGCTGAgcc F$DDE1_B | 1484 (-) | 0.943 | 0.908 | gGCAGAggg F$DDE1_B | 1711 (+) | 0.943 | 0.908 | cccTCTGCc F$DDE1_B | 3231 (+) | 1.000 | 0.899 | cagTCAGCc F$DDE1_B | 4181 (-) | 1.000 | 0.990 | gGCTGAgca F$DDE1_B | 4405 (+) | 1.000 | 0.946 | aggTCAGCc F$DDE1_B | 4973 (+) | 1.000 | 0.992 | ggcTCAGCc F$DDE1_B | 4997 (-) | 0.943 | 0.901 | gGCAGAcct F$DDE1_B | 6225 (+) | 0.943 | 0.899 | cgcTCTGCa F$DDE1_B | 6547 (-) | 1.000 | 0.946 | gGCTGAggt F$DDE1_B | 7606 (-) | 1.000 | 0.937 | tGCTGAgcc F$DDE1_B | 8141 (-) | 1.000 | 0.899 | tGCTGAggg F$DDE1_B | 8191 (+) | 0.943 | 0.944 | tgcTCTGCc F$DDE1_B | 8305 (+) | 0.943 | 0.908 | cccTCTGCc F$DDE1_B | 9093 (+) | 0.943 | 0.901 | agcTCTGCt F$DDE1_B | 9611 (-) | 1.000 | 0.901 | aGCTGAggt F$DDE1_B | 9638 (+) | 1.000 | 0.901 | aggTCAGCt F$DDE1_B | 10276 (-) | 1.000 | 0.990 | gGCTGAgca F$STRE_B | 1610 (+) | 1.000 | 1.000 | cAGGGGg F$STRE_B | 4045 (+) | 1.000 | 1.000 | cAGGGGg F$STRE_B | 10789 (+) | 1.000 | 1.000 | cAGGGGg V$ATATA_B | 658 (+) | 0.972 | 0.953 | atataTAAGT V$ATATA_B | 8637 (+) | 0.926 | 0.938 | ttattTAAGA V$BEL1_B | 7704 (-) | 1.000 | 0.892 | gaggtagaggcagaaggaTCAGCagttt V$LDSPOLYA_B | 508 (+) | 0.966 | 0.907 | tattTGTCTtctgatt V$LDSPOLYA_B | 874 (-) | 0.972 | 0.898 | aatgcacAAACAcgaa V$LDSPOLYA_B | 1266 (+) | 1.000 | 0.920 | tccgTGTGTactctgg V$LDSPOLYA_B | 3452 (-) | 1.000 | 0.939 | taataaaACACAgggc V$LDSPOLYA_B | 3559 (+) | 1.000 | 0.919 | tagcTGTGTggccttt V$LDSPOLYA_B | 3659 (-) | 1.000 | 0.911 | taaagtaACACAgact V$LDSPOLYA_B | 5119 (-) | 0.972 | 0.908 | caaaaggAAACAgatg V$LDSPOLYA_B | 5600 (-) | 1.000 | 0.899 | taatataACACAagaa V$LDSPOLYA_B | 5658 (-) | 1.000 | 0.928 | gaaaagaACACAataa V$LDSPOLYA_B | 5668 (-) | 0.966 | 0.906 | caataagAGACAgaca V$LDSPOLYA_B | 7332 (-) | 0.966 | 0.921 | gaacaatAGACAcatt V$LDSPOLYA_B | 7935 (+) | 1.000 | 0.905 | tttgTGTGTttagttt V$LDSPOLYA_B | 7937 (+) | 0.972 | 0.908 | tgtgTGTTTagttttg V$LDSPOLYA_B | 10178 (+) | 1.000 | 0.924 | agtgTGTGTgttcagc V$LPOLYA_B | 3910 (+) | 1.000 | 0.993 | tAATAAag V$LPOLYA_B | 11688 (+) | 1.000 | 1.000 | cAATAAag V$MTATA_B | 233 (-) | 0.951 | 0.835 | tttgtcttTTTAAatct V$MTATA_B | 238 (+) | 0.951 | 0.928 | ctttTTAAAtctgggct V$MTATA_B | 288 (-) | 0.901 | 0.859 | tcaagtgtTTTACagac V$MTATA_B | 515 (-) | 0.951 | 0.860 | cttctgatTTTAAaaat V$MTATA_B | 520 (+) | 0.951 | 0.856 | gattTTAAAaatgatgg V$MTATA_B | 566 (+) | 1.000 | 0.809 | ggagATAAAcgtctacc V$MTATA_B | 597 (-) | 0.951 | 0.867 | acttacttTTTAAacat V$MTATA_B | 602 (+) | 0.951 | 0.872 | ctttTTAAAcatcctgt V$MTATA_B | 628 (-) | 0.951 | 0.860 | cactttttTTTAAacat V$MTATA_B | 633 (+) | 0.951 | 0.853 | ttttTTAAAcatttatt V$MTATA_B | 707 (-) | 0.901 | 0.846 | cggatctcTTTACagat V$MTATA_B | 830 (+) | 0.877 | 0.811 | tactATATAaaacaagt V$MTATA_B | 832 (+) | 1.000 | 0.906 | ctatATAAAacaagtta V$MTATA_B | 846 (+) | 1.000 | 0.806 | ttaaATAAAtacgatcg V$MTATA_B | 956 (-) | 0.901 | 0.820 | atcaaataTTTACagga V$MTATA_B | 1055 (-) | 0.951 | 0.828 | tcaacaacTTTAAaata V$MTATA_B | 1060 (+) | 0.951 | 0.853 | aactTTAAAataagatg V$MTATA_B | 1144 (-) | 1.000 | 0.815 | accgttacTTTATgtag V$MTATA_B | 1742 (+) | 0.846 | 0.827 | tcctATAGAacactgat V$MTATA_B | 1750 (-) | 0.846 | 0.823 | aacactgaTCTATaagc V$MTATA_B | 1757 (+) | 0.801 | 0.810 | atctATAAGcctctcgc V$MTATA_B | 1884 (-) | 1.000 | 0.913 | cctcccatTTTATattt V$MTATA_B | 2003 (+) | 0.846 | 0.825 | ggctATACAtctcctga V$MTATA_B | 2570 (-) | 0.901 | 0.860 | cgtaagtaTTTACaaca V$MTATA_B | 2850 (-) | 0.901 | 0.811 | ttccgtccTTTAGagct V$MTATA_B | 3398 (-) | 0.827 | 0.809 | tctccagtTATAAaagt V$MTATA_B | 3403 (+) | 1.000 | 0.893 | agttATAAAagttaatg V$MTATA_B | 3450 (+) | 1.000 | 0.860 | gttaATAAAacacaggg V$MTATA_B | 3618 (-) | 1.000 | 0.857 | caaatgcgTTTATaata V$MTATA_B | 3845 (+) | 1.000 | 0.812 | tctcATAAAatatccag V$MTATA_B | 3908 (+) | 1.000 | 0.833 | attaATAAAgctttatg V$MTATA_B | 3911 (-) | 1.000 | 0.838 | aataaagcTTTATgaat V$MTATA_B | 4070 (+) | 0.901 | 0.810 | atatGTAAAatgtatac V$MTATA_B | 4098 (+) | 0.901 | 0.836 | gattCTAAAacatccat V$MTATA_B | 4314 (+) | 1.000 | 0.867 | ctgcATAAAactgggtc V$MTATA_B | 4361 (-) | 0.951 | 0.847 | atcagaagTTTAAagtt V$MTATA_B | 4366 (+) | 0.951 | 0.842 | aagtTTAAAgttatctg V$MTATA_B | 4779 (+) | 1.000 | 0.810 | ctggATAAAgattgctg V$MTATA_B | 5390 (-) | 1.000 | 0.841 | attttaccTTTATattt V$MTATA_B | 5413 (+) | 0.951 | 0.816 | cttaTTAAAtctttccc V$MTATA_B | 5427 (-) | 1.000 | 0.814 | ccctccacTTTATttgt V$MTATA_B | 6465 (+) | 1.000 | 0.814 | cctgATAAAatgctcag V$MTATA_B | 6716 (-) | 0.951 | 0.872 | ccacaaccTTTAAaggg V$MTATA_B | 6721 (+) | 0.951 | 0.872 | acctTTAAAgggtgagc V$MTATA_B | 6754 (-) | 1.000 | 0.880 | actatcccTTTATacca V$MTATA_B | 6759 (+) | 0.827 | 0.817 | ccctTTATAccatgaac V$MTATA_B | 6820 (-) | 1.000 | 0.824 | ctccctttTTTATgtgc V$MTATA_B | 6866 (-) | 0.801 | 0.816 | gaccgaggCTTATagga V$MTATA_B | 7410 (-) | 0.951 | 0.843 | gaagaactTTTAAaggt V$MTATA_B | 7415 (+) | 0.951 | 0.841 | acttTTAAAggtttatt V$MTATA_B | 7637 (-) | 0.951 | 0.844 | agaattatTTTAAaaat V$MTATA_B | 7642 (+) | 0.951 | 0.833 | tattTTAAAaatgcttt V$MTATA_B | 7683 (+) | 0.901 | 0.854 | agctGTAAAgccagaac V$MTATA_B | 7797 (+) | 1.000 | 0.811 | aaagATAAAgaacaacg V$MTATA_B | 8085 (+) | 0.846 | 0.862 | tcctATACAgctctggg V$MTATA_B | 8845 (-) | 0.951 | 0.906 | cagaacggTTTAAaaat V$MTATA_B | 8850 (+) | 0.951 | 0.864 | cggtTTAAAaattaaga V$MTATA_B | 8929 (+) | 1.000 | 0.880 | cagtATAAActaaagca V$MTATA_B | 8942 (-) | 0.951 | 0.867 | agcaacttTTTAAaatt V$MTATA_B | 8947 (+) | 0.951 | 0.853 | ctttTTAAAattagtag V$MTATA_B | 9424 (+) | 1.000 | 0.820 | tcccATAAAtaacagga V$MTATA_B | 9710 (+) | 0.901 | 0.823 | tggtGTAAAattgatct V$MTATA_B | 9819 (-) | 0.901 | 0.845 | cttcaaatTTTAGaagg V$MTATA_B | 9834 (-) | 0.951 | 0.859 | gggataatTTTAAagtc V$MTATA_B | 9839 (+) | 0.951 | 0.849 | aattTTAAAgtcaggca V$MTATA_B | 9878 (-) | 0.951 | 0.829 | gaaatcaaTTTAAaatt V$MTATA_B | 9883 (+) | 0.951 | 0.851 | caatTTAAAattctttt V$MTATA_B | 9891 (-) | 0.951 | 0.858 | aattctttTTTAAagac V$MTATA_B | 9896 (+) | 0.951 | 0.877 | ttttTTAAAgacttact V$MTATA_B | 9947 (+) | 0.819 | 0.811 | tgttATAACaccctgta V$MTATA_B | 10017 (+) | 0.846 | 0.821 | aggtATAGAccgtagtt V$MTATA_B | 10310 (-) | 1.000 | 0.851 | taactgttTTTATattt V$MTATA_B | 10323 (-) | 1.000 | 0.855 | atttttatTTTATagtt V$MTATA_B | 10389 (+) | 1.000 | 0.934 | ccatATAAAaatgaacg V$MTATA_B | 10445 (-) | 1.000 | 0.882 | acgtctctTTTATatca V$MTATA_B | 10468 (-) | 0.901 | 0.837 | catctcacTTTAGacca V$MTATA_B | 10697 (-) | 1.000 | 0.831 | cacccgtgTTTATgaaa V$MTATA_B | 10809 (+) | 0.901 | 0.889 | gcatCTAAAtcgcagtc V$MTATA_B | 11094 (-) | 0.951 | 0.853 | ttccactgTTTAAagaa V$MTATA_B | 11099 (+) | 0.951 | 0.897 | ctgtTTAAAgaacgact V$MTATA_B | 11336 (+) | 0.901 | 0.810 | cgatGTAAAtgccatta V$MTATA_B | 11630 (-) | 0.951 | 0.856 | gaccctcgTTTAAatgt V$MTATA_B | 11635 (+) | 0.951 | 0.827 | tcgtTTAAAtgtatttc V$MUSCLE_INI_B | 100 (-) | 1.000 | 0.875 | cgcctggGGGTGgccagtgag V$MUSCLE_INI_B | 191 (+) | 0.930 | 0.877 | tgcctcccaCACCAccatgcc V$MUSCLE_INI_B | 1274 (-) | 0.932 | 0.856 | tactctgGGATGacagggacc V$MUSCLE_INI_B | 1321 (+) | 0.985 | 0.869 | ctcccttctCACTCcccaggg V$MUSCLE_INI_B | 1823 (+) | 1.000 | 0.855 | cagggaagaCACCCattgggg V$MUSCLE_INI_B | 2011 (-) | 0.985 | 0.903 | atctcctGAGTGctgggatta V$MUSCLE_INI_B | 4328 (+) | 0.932 | 0.858 | gtctggtgaCATCCaggtggt V$MUSCLE_INI_B | 5036 (-) | 0.901 | 0.854 | tgctcagGATTGctgtggtct V$MUSCLE_INI_B | 5816 (-) | 0.981 | 0.900 | agccctgGCGTGgcactgtga V$MUSCLE_INI_B | 6522 (+) | 1.000 | 0.857 | cacgaagttCACCCtggagac V$MUSCLE_INI_B | 6624 (+) | 1.000 | 0.872 | tgagtggggCACCCagaaggc V$MUSCLE_INI_B | 7192 (-) | 1.000 | 0.875 | gccacggGGGTGctatgaatt V$MUSCLE_INI_B | 8229 (+) | 1.000 | 0.907 | tggctcccaCACCCagatgaa V$MUSCLE_INI_B | 8732 (-) | 1.000 | 0.917 | aacgtcgGGGTGtggtggccc V$MUSCLE_INI_B | 8734 (-) | 0.974 | 0.862 | cgtcgggGTGTGgtggcccac V$MUSCLE_INI_B | 9183 (+) | 0.985 | 0.883 | cacagagacCACTCaagagag V$MUSCLE_INI_B | 9271 (+) | 0.930 | 0.859 | agcgacagcCACCAagctgag V$MUSCLE_INI_B | 9390 (-) | 0.917 | 0.855 | gcccctgGAATGtggttctat V$MUSCLE_INI_B | 9565 (-) | 1.000 | 0.862 | ggtttctGGGTGtggtggtat V$MUSCLE_INI_B | 9682 (-) | 1.000 | 0.879 | ggaggggGGGTGgtggtggtg V$MUSCLE_INI_B | 10867 (-) | 0.930 | 0.904 | tgcagcgTGGTGgtgggagca V$MUSCLE_INI_B | 11814 (-) | 0.915 | 0.873 | ggcccagGGTTGgggtgagaa V$MUSCLE_INI_B | 11834 (-) | 0.985 | 0.892 | aacagcaGAGTGgggggtggg V$MUSCLE_INI_B | 11841 (-) | 1.000 | 0.869 | gagtgggGGGTGgggtatggg V$MYCMAX_B | 735 (+) | 1.000 | 0.948 | aCCACGtggt V$MYCMAX_B | 735 (-) | 1.000 | 0.948 | accaCGTGGt V$MYCMAX_B | 2614 (+) | 0.958 | 0.933 | gCCATGtgta V$MYCMAX_B | 7192 (+) | 1.000 | 0.938 | gCCACGgggg V$MINI19_B | 2011 (-) | 0.998 | 0.866 | atctcctGAGTGctgggatta V$MINI19_B | 6224 (-) | 0.986 | 0.851 | acgctctGCATGctgtctggc V$MINI19_B | 6647 (+) | 0.953 | 0.856 | ctcagtctgCAAACagatgaa V$MINI19_B | 8229 (+) | 0.998 | 0.851 | tggctcccaCACCCagatgaa V$MINI19_B | 9183 (+) | 0.998 | 0.847 | cacagagacCACTCaagagag V$MINI19_B | 9565 (-) | 0.998 | 0.866 | ggtttctGGGTGtggtggtat V$MINI19_B | 10979 (-) | 0.995 | 0.845 | ggagcctGTGTGgccctggaa V$MINI19_B | 11814 (-) | 0.955 | 0.847 | ggcccagGGTTGgggtgagaa V$MINI20_B | 2011 (-) | 0.949 | 0.867 | atctcctgagTGCTGggatta V$MINI20_B | 4848 (+) | 0.960 | 0.858 | ctgagaCTTCActgagggagg V$MINI20_B | 6647 (+) | 0.944 | 0.855 | ctcagtCTGCAaacagatgaa V$MINI20_B | 9390 (-) | 0.937 | 0.867 | gcccctggaaTGTGGttctat V$MINI20_B | 11814 (-) | 1.000 | 0.857 | ggcccagggtTGGGGtgagaa V$PAX3_B | 1138 (-) | 0.868 | 0.881 | cctaaaacCGTTActttatgt V$PAX3_B | 2956 (-) | 0.772 | 0.811 | acgaatacAGTGAcggcagat V$PAX3_B | 2958 (+) | 0.904 | 0.797 | gaatacagTGACGgcagatgc V$PAX3_B | 5919 (-) | 0.818 | 0.812 | ttccccagCGAGActcccttg V$PAX3_B | 6214 (+) | 1.000 | 0.856 | aaggtttcTCACGctctgcat V$PAX3_B | 6553 (+) | 0.797 | 0.813 | ggtaagagTCATGgttgccac V$PAX3_B | 10239 (+) | 0.772 | 0.801 | gttggtggTCAGGcttgtgca V$WHN_B | 10900 (+) | 1.000 | 0.992 | agaGACGCtac V$ZF5_B | 812 (-) | 0.888 | 0.756 | caGCCCGacactt V$ZF5_B | 1004 (+) | 0.919 | 0.816 | tggggaCGCACct V$ZF5_B | 1005 (-) | 0.841 | 0.756 | ggGGACGcaccta V$ZF5_B | 1125 (-) | 0.922 | 0.792 | atGCACGtgtgtt V$ZF5_B | 1366 (-) | 0.888 | 0.787 | atGCTCGtttgca V$ZF5_B | 1404 (-) | 0.738 | 0.750 | gaGTGGGtccatt V$ZF5_B | 1587 (-) | 0.753 | 0.749 | gaGCCAGcactgg V$ZF5_B | 2340 (+) | 0.919 | 0.768 | accctaCGCTCtt V$ZF5_B | 2466 (-) | 0.786 | 0.779 | taGCAAGcgcgaa V$ZF5_B | 2470 (-) | 1.000 | 0.845 | aaGCGCGaaggcc V$ZF5_B | 2811 (+) | 0.922 | 0.769 | gaacacCGTGCtc V$ZF5_B | 2883 (+) | 0.841 | 0.820 | tgtatgCGTACtg V$ZF5_B | 2918 (-) | 0.841 | 0.769 | gtGGACGgaccct V$ZF5_B | 2974 (+) | 0.738 | 0.745 | gatgcgCAGGCat V$ZF5_B | 3004 (-) | 0.919 | 0.758 | agGTGCGcaggcg V$ZF5_B | 3069 (-) | 0.753 | 0.747 | taGCTAGgtcaca V$ZF5_B | 3964 (+) | 0.753 | 0.770 | tgagcaCTGGCtg V$ZF5_B | 4389 (-) | 0.783 | 0.755 | taGTGAGttccaa V$ZF5_B | 4597 (-) | 0.888 | 0.811 | ctGCCCGcttgct V$ZF5_B | 4598 (+) | 0.786 | 0.759 | tgcccgCTTGCtg V$ZF5_B | 4730 (+) | 0.864 | 0.782 | ttggccCTCGCtg V$ZF5_B | 4771 (-) | 0.864 | 0.764 | caGCGAGtctgga V$ZF5_B | 4907 (+) | 0.771 | 0.792 | caagtgCATGCtc V$ZF5_B | 5075 (+) | 0.753 | 0.763 | gagaggCTGGCtc V$ZF5_B | 5108 (-) | 0.783 | 0.757 | ttGTGAGtgctca V$ZF5_B | 5132 (+) | 0.807 | 0.763 | atggtgCGGTCaa V$ZF5_B | 5220 (-) | 0.919 | 0.749 | ctGAGCGgagact V$ZF5_B | 5329 (-) | 0.919 | 0.845 | caGTGCGccgata V$ZF5_B | 5893 (+) | 0.919 | 0.779 | ggggccCGCCCct V$ZF5_B | 5894 (-) | 0.888 | 0.802 | ggGCCCGccccta V$ZF5_B | 5924 (-) | 0.864 | 0.798 | caGCGAGactccc V$ZF5_B | 6062 (+) | 0.922 | 0.839 | cctgcaCGTGCtg V$ZF5_B | 6063 (-) | 0.922 | 0.797 | ctGCACGtgctgc V$ZF5_B | 6219 (+) | 0.919 | 0.751 | ttctcaCGCTCtg V$ZF5_B | 6329 (+) | 0.768 | 0.761 | gaagcaCACACtg V$ZF5_B | 6381 (+) | 0.807 | 0.777 | ggtagaCGAACtg V$ZF5_B | 6402 (+) | 0.919 | 0.794 | tcagaaCGCTCag V$ZF5_B | 6403 (-) | 0.841 | 0.814 | caGAACGctcaga V$ZF5_B | 6498 (+) | 0.807 | 0.750 | cagatgCGGCCac V$ZF5_B | 6627 (+) | 0.768 | 0.747 | gtggggCACCCag V$ZF5_B | 6854 (+) | 0.841 | 0.792 | tgagcaCGTCCag V$ZF5_B | 6855 (-) | 0.922 | 0.846 | gaGCACGtccaga V$ZF5_B | 7088 (+) | 1.000 | 0.798 | gctttgCGCGCac V$ZF5_B | 7090 (+) | 0.919 | 0.782 | tttgcgCGCACct V$ZF5_B | 7091 (-) | 1.000 | 0.925 | ttGCGCGcaccta V$ZF5_B | 7143 (-) | 0.738 | 0.762 | aaGAGGGcgccgt V$ZF5_B | 7145 (-) | 0.919 | 0.808 | gaGGGCGccgtat V$ZF5_B | 7478 (+) | 1.000 | 0.790 | gtgctaCGCGCac V$ZF5_B | 7480 (+) | 0.919 | 0.750 | gctacgCGCACgt V$ZF5_B | 7739 (+) | 0.753 | 0.756 | cacaggCTGGCta V$ZF5_B | 7854 (+) | 0.841 | 0.757 | cgacagCGTACac V$ZF5_B | 7870 (+) | 0.783 | 0.765 | gaaaagCTCACta V$ZF5_B | 8030 (+) | 0.786 | 0.775 | tcagagCTTGCtg V$ZF5_B | 8403 (-) | 0.753 | 0.768 | aaGCCAGcccggc V$ZF5_B | 8407 (-) | 0.888 | 0.794 | caGCCCGgcctgt V$ZF5_B | 9012 (+) | 0.738 | 0.759 | caggggCAGGCag V$ZF5_B | 9156 (-) | 0.783 | 0.751 | atGTGAGttccta V$ZF5_B | 9197 (-) | 0.783 | 0.790 | aaGAGAGcaccaa V$ZF5_B | 10084 (-) | 0.753 | 0.769 | caGCCAGtgctct V$ZF5_B | 10242 (+) | 0.738 | 0.751 | ggtggtCAGGCtt V$ZF5_B | 10531 (+) | 0.922 | 0.832 | ggaggcCGTGCaa V$ZF5_B | 10532 (-) | 0.807 | 0.757 | gaGGCCGtgcaac V$ZF5_B | 10716 (+) | 0.922 | 0.802 | tctctgCGTGCtg V$ZF5_B | 10821 (-) | 0.807 | 0.789 | caGTCCGcctttt V$ZF5_B | 10951 (+) | 0.807 | 0.764 | tagctgCGGTCtg V$ZF5_B | 11023 (-) | 0.783 | 0.753 | gaGAGAGatccta V$ZF5_B | 11106 (-) | 0.841 | 0.779 | aaGAACGacttca V$ZF5_B | 11400 (-) | 0.841 | 0.766 | ttGTACGcattta V$ZF5_B | 11600 (-) | 0.768 | 0.766 | taGAGTGcacaga V$ZF5_B | 11903 (-) | 0.922 | 0.788 | taGCACGaaggct F$GCR1_B | 5728 (-) | 1.000 | 0.940 | taaactGGAAGcca V$ATF_B | 1236 (-) | 0.800 | 0.835 | ctttgaCATCAg V$ATF_B | 1238 (+) | 0.800 | 0.834 | tTGACAtcaggg V$ATF_B | 1445 (+) | 0.800 | 0.840 | gTGAAGtcactg V$ATF_B | 2963 (-) | 0.800 | 0.840 | cagtgaCGGCAg V$ATF_B | 2965 (+) | 1.000 | 0.853 | gTGACGgcagat V$ATF_B | 4530 (-) | 1.000 | 0.834 | caaatgCGTCAa V$ATF_B | 8208 (-) | 0.800 | 0.835 | ctctgaCCTCAg V$ATF_B | 8609 (-) | 1.000 | 0.835 | tcattcCGTCAa V$ATF_B | 11317 (+) | 1.000 | 0.838 | gTGACGacgatg V$ETS1_B | 604 (-) | 0.943 | 0.908 | ttttaaacATCCTgt V$ETS1_B | 1733 (-) | 1.000 | 0.915 | tcctgaacTTCCTat V$ETS1_B | 2149 (-) | 1.000 | 0.899 | ttccttccTTCCTct V$ETS1_B | 2701 (-) | 1.000 | 0.894 | tcttctccTTCCTct V$ETS1_B | 4962 (+) | 1.000 | 0.957 | gcAGGAAgtcaggct V$ETS1_B | 5467 (-) | 1.000 | 0.915 | catactatTTCCTga V$ETS1_B | 6083 (-) | 1.000 | 0.907 | tctgtgagTTCCTgt V$ETS1_B | 6455 (-) | 1.000 | 0.925 | aaacttatTTCCTga V$ETS1_B | 6793 (-) | 1.000 | 0.901 | gctggctcTTCCTcc V$ETS1_B | 7230 (+) | 1.000 | 0.976 | gcAGGAAgttcatcg V$ETS1_B | 8076 (-) | 1.000 | 0.940 | gcagaaacTTCCTat V$ETS1_B | 9435 (+) | 1.000 | 0.919 | acAGGAAgcccaagg V$ETS1_B | 9468 (+) | 0.943 | 0.924 | acAGGATgttctagc V$ETS1_B | 9874 (+) | 1.000 | 0.893 | tgAGGAAatcaattt V$ETS1_B | 10257 (+) | 1.000 | 0.986 | gcAGGAAgtgatttt V$ETS1_B | 10844 (+) | 1.000 | 0.899 | gcAGGAAgaagccat V$ETS2_B | 606 (-) | 0.880 | 0.860 | ttaaacATCCTgtc V$ETS2_B | 1190 (+) | 1.000 | 0.830 | cccAGGAAagctgg V$ETS2_B | 1667 (-) | 1.000 | 0.800 | tggagaTTCCTcct V$ETS2_B | 1735 (-) | 1.000 | 0.858 | ctgaacTTCCTata V$ETS2_B | 2007 (-) | 0.861 | 0.814 | atacatCTCCTgag V$ETS2_B | 2143 (-) | 1.000 | 0.829 | tctctcTTCCTtcc V$ETS2_B | 2147 (-) | 1.000 | 0.833 | tcttccTTCCTtcc V$ETS2_B | 2151 (-) | 1.000 | 0.846 | ccttccTTCCTctt V$ETS2_B | 2157 (-) | 1.000 | 0.826 | ttcctcTTCCTttc V$ETS2_B | 2201 (-) | 1.000 | 0.844 | tctttcTTCCTttc V$ETS2_B | 2682 (-) | 0.880 | 0.812 | cgttgcATCCTggt V$ETS2_B | 2703 (-) | 1.000 | 0.845 | ttctccTTCCTctt V$ETS2_B | 2781 (-) | 1.000 | 0.822 | agattcTTCCTtag V$ETS2_B | 3321 (-) | 1.000 | 0.839 | ggccttTTCCTgat V$ETS2_B | 3338 (-) | 0.861 | 0.823 | atttccCTCCTgtt V$ETS2_B | 4687 (+) | 1.000 | 0.887 | ggcAGGAAacttga V$ETS2_B | 4961 (+) | 1.000 | 0.936 | agcAGGAAgtcagg V$ETS2_B | 5058 (+) | 0.785 | 0.800 | agcAGGTAgtgaga V$ETS2_B | 5120 (+) | 1.000 | 0.847 | aaaAGGAAacagat V$ETS2_B | 5190 (+) | 0.880 | 0.833 | agcAGGATttgtct V$ETS2_B | 5263 (-) | 0.880 | 0.810 | acagtcATCCTggg V$ETS2_B | 5469 (-) | 1.000 | 0.913 | tactatTTCCTgat V$ETS2_B | 5873 (-) | 0.861 | 0.822 | gaccatCTCCTgct V$ETS2_B | 6085 (-) | 1.000 | 0.892 | tgtgagTTCCTgtg V$ETS2_B | 6274 (+) | 1.000 | 0.865 | agaAGGAAgcctct V$ETS2_B | 6457 (-) | 1.000 | 0.937 | acttatTTCCTgat V$ETS2_B | 6795 (-) | 1.000 | 0.846 | tggctcTTCCTccc V$ETS2_B | 6876 (+) | 1.000 | 0.863 | tatAGGAAgcacca V$ETS2_B | 6917 (+) | 1.000 | 0.841 | acaAGGAAttgatt V$ETS2_B | 6973 (+) | 1.000 | 0.816 | gaaAGGAAcgactt V$ETS2_B | 7015 (-) | 1.000 | 0.891 | ggatgtTTCCTgca V$ETS2_B | 7152 (-) | 0.861 | 0.811 | ccgtatCTCCTgga V$ETS2_B | 7229 (+) | 1.000 | 0.953 | agcAGGAAgttcat V$ETS2_B | 7246 (+) | 1.000 | 0.842 | ctcAGGAAaaagtc V$ETS2_B | 7464 (+) | 0.880 | 0.841 | tgcAGGATttgagt V$ETS2_B | 8078 (-) | 1.000 | 0.903 | agaaacTTCCTata V$ETS2_B | 8677 (+) | 1.000 | 0.813 | gttAGGAAggtaat V$ETS2_B | 8910 (+) | 1.000 | 0.806 | ttcAGGAAtactcg V$ETS2_B | 9434 (+) | 1.000 | 0.915 | aacAGGAAgcccaa V$ETS2_B | 9467 (+) | 0.880 | 0.872 | tacAGGATgttcta V$ETS2_B | 9539 (-) | 0.861 | 0.817 | aaacaaGTCCTgcg V$ETS2_B | 9873 (+) | 1.000 | 0.867 | atgAGGAAatcaat V$ETS2_B | 9906 (-) | 0.785 | 0.821 | acttacTTACTgga V$ETS2_B | 10256 (+) | 1.000 | 0.982 | tgcAGGAAgtgatt V$ETS2_B | 10585 (-) | 1.000 | 0.842 | ttcaatTTCCTata V$ETS2_B | 10637 (+) | 1.000 | 0.806 | ctgAGGAAgacacc V$ETS2_B | 10843 (+) | 1.000 | 0.879 | cgcAGGAAgaagcc V$ETS2_B | 11270 (+) | 1.000 | 0.828 | aaaAGGAAattaag V$GABP_B | 692 (+) | 0.800 | 0.824 | accAGAAGaggg V$GABP_B | 768 (+) | 1.000 | 0.906 | tctGGAAGagta V$GABP_B | 922 (-) | 1.000 | 0.888 | agcaCTTCCacg V$GABP_B | 1554 (-) | 0.800 | 0.799 | aactCTTCTgga V$GABP_B | 1717 (-) | 0.800 | 0.814 | gcctTTTCCggg V$GABP_B | 1736 (-) | 1.000 | 0.804 | tgaaCTTCCtat V$GABP_B | 2144 (-) | 1.000 | 0.842 | ctctCTTCCttc V$GABP_B | 2158 (-) | 1.000 | 0.863 | tcctCTTCCttt V$GABP_B | 2710 (-) | 1.000 | 0.849 | tcctCTTCCcaa V$GABP_B | 2986 (-) | 1.000 | 0.826 | tctaCTTCCgct V$GABP_B | 3030 (-) | 1.000 | 0.823 | ccgaCTTCCgct V$GABP_B | 4607 (+) | 1.000 | 0.820 | gctGGAAGgtaa V$GABP_B | 4962 (+) | 1.000 | 0.821 | gcaGGAAGtcag V$GABP_B | 5490 (-) | 1.000 | 0.858 | tcccCTTCCctc V$GABP_B | 5731 (+) | 1.000 | 0.807 | actGGAAGccat V$GABP_B | 6796 (-) | 1.000 | 0.891 | ggctCTTCCtcc V$GABP_B | 7138 (+) | 0.800 | 0.820 | gccAGAAGaggg V$GABP_B | 7230 (+) | 1.000 | 0.870 | gcaGGAAGttca V$GABP_B | 7837 (-) | 0.800 | 0.825 | cccaCTTCTggc V$GABP_B | 8156 (-) | 1.000 | 0.813 | ttagCTTCCaga V$GABP_B | 8191 (-) | 0.800 | 0.813 | tgctCTGCCtgt V$GABP_B | 9374 (+) | 0.800 | 0.804 | acaGGGAGagct V$GABP_B | 9435 (+) | 1.000 | 0.859 | acaGGAAGccca V$GABP_B | 10074 (+) | 1.000 | 0.933 | tctGGAAGggca V$GABP_B | 10257 (+) | 1.000 | 0.914 | gcaGGAAGtgat V$GABP_B | 10844 (+) | 1.000 | 0.822 | gcaGGAAGaagc V$OCT1_B | 5342 (-) | 1.000 | 0.940 | gtttTGCATc V$OCT1_B | 6577 (+) | 1.000 | 0.953 | tATGCAaaac V$OCT1_B | 9935 (-) | 1.000 | 0.992 | aattTGCATg V$OCT1_B | 10911 (+) | 1.000 | 1.000 | tATGCAaata V$OCT1_B | 11666 (-) | 0.821 | 0.881 | aattTTCATa P$RAV1_01 | 68 (-) | 1.000 | 0.951 | ttacTGTTGttg P$RAV1_01 | 492 (+) | 1.000 | 0.972 | ttgCAACAttat P$RAV1_01 | 1035 (+) | 1.000 | 0.941 | aatCAACAaagc P$RAV1_01 | 1845 (+) | 1.000 | 0.953 | cgtCAACAcaat P$RAV1_01 | 2552 (-) | 1.000 | 0.970 | tttcTGTTGtga P$RAV1_01 | 2579 (+) | 1.000 | 0.951 | ttaCAACAaatg P$RAV1_01 | 3141 (+) | 1.000 | 0.940 | ccaCAACAatat P$RAV1_01 | 3236 (+) | 1.000 | 0.937 | agcCAACAttcc P$RAV1_01 | 4524 (+) | 1.000 | 0.952 | agaCAACAaatg P$RAV1_01 | 4680 (-) | 0.947 | 0.949 | gttcTGTGGcag P$RAV1_01 | 5199 (-) | 0.947 | 0.938 | tgtcTGTGGcct P$RAV1_01 | 6504 (+) | 0.947 | 0.935 | cggCCACAaata P$RAV1_01 | 8222 (-) | 1.000 | 0.936 | ctgaTGTTGgct P$RAV1_01 | 8589 (+) | 1.000 | 0.950 | aggCAACAtctt P$RAV1_01 | 9179 (+) | 0.947 | 0.940 | acgCCACAgaga P$RAV1_01 | 10149 (-) | 1.000 | 0.942 | ttatTGTTGttg P$RAV1_01 | 10376 (-) | 1.000 | 0.965 | aaaaTGTTGcaa P$RAV1_01 | 10483 (+) | 0.947 | 0.937 | cagCCACAtatc P$RAV1_01 | 10691 (+) | 1.000 | 0.937 | tggCAACAcccg P$RAV1_01 | 11036 (-) | 1.000 | 0.938 | ccacTGTTGttg P$RAV1_02 | 98 (+) | 0.969 | 0.964 | ttcGCCTGgggg P$RAV1_02 | 442 (-) | 1.000 | 0.921 | taatCAGGTtgt P$RAV1_02 | 1358 (+) | 1.000 | 0.998 | gtcACCTGatgc P$RAV1_02 | 1390 (-) | 1.000 | 0.987 | agggCAGGTgag P$RAV1_02 | 1606 (-) | 0.967 | 0.967 | ggctCAGGGggt P$RAV1_02 | 1690 (-) | 0.967 | 0.963 | agctCAGGGgta P$RAV1_02 | 1769 (+) | 0.969 | 0.959 | ctcGCCTGtcct P$RAV1_02 | 1777 (+) | 0.972 | 0.968 | tccTCCTGgagc P$RAV1_02 | 1786 (+) | 1.000 | 0.983 | agcACCTGccat P$RAV1_02 | 2011 (+) | 0.972 | 0.969 | atcTCCTGagtg P$RAV1_02 | 2423 (-) | 0.967 | 0.954 | atcaCAGGGgtt P$RAV1_02 | 2640 (+) | 0.967 | 0.963 | tacCCCTGacct P$RAV1_02 | 3007 (-) | 0.969 | 0.954 | tgcgCAGGCgtc P$RAV1_02 | 3166 (+) | 0.969 | 0.954 | tccGCCTGtaca P$RAV1_02 | 3342 (+) | 0.972 | 0.955 | cccTCCTGttca P$RAV1_02 | 4041 (-) | 0.967 | 0.952 | ccagCAGGGgga P$RAV1_02 | 4172 (-) | 0.972 | 0.965 | tattCAGGAggc P$RAV1_02 | 4337 (-) | 1.000 | 0.988 | catcCAGGTggt P$RAV1_02 | 4350 (-) | 0.972 | 0.962 | gaggCAGGAgga P$RAV1_02 | 4592 (+) | 0.972 | 0.957 | tccTCCTGcccg P$RAV1_02 | 4819 (-) | 0.969 | 0.960 | ttacCAGGCggg P$RAV1_02 | 5877 (+) | 0.972 | 0.960 | atcTCCTGctgg P$RAV1_02 | 6155 (+) | 0.972 | 0.964 | ctcTCCTGcctc P$RAV1_02 | 7006 (-) | 0.969 | 0.957 | aaggCAGGCgga P$RAV1_02 | 7156 (+) | 0.972 | 0.972 | atcTCCTGgagc P$RAV1_02 | 7389 (-) | 1.000 | 0.989 | tgtcCAGGTggt P$RAV1_02 | 7624 (+) | 0.969 | 0.959 | gccGCCTGgcgg P$RAV1_02 | 8358 (+) | 0.967 | 0.953 | aacCCCTGtgca P$RAV1_02 | 8390 (+) | 1.000 | 0.922 | tatACCTGaagc P$RAV1_02 | 8502 (-) | 1.000 | 0.983 | gggaCAGGTgcg P$RAV1_02 | 8768 (+) | 1.000 | 0.987 | agcACCTGggag P$RAV1_02 | 8967 (-) | 1.000 | 0.991 | gaacCAGGTgtg P$RAV1_02 | 9008 (-) | 0.967 | 0.955 | gaggCAGGGgca P$RAV1_02 | 9304 (-) | 0.972 | 0.963 | tgccCAGGAggg P$RAV1_02 | 9341 (+) | 0.967 | 0.962 | gacCCCTGaagt P$RAV1_02 | 9389 (+) | 0.967 | 0.959 | agcCCCTGgaat P$RAV1_02 | 9555 (+) | 0.969 | 0.956 | gtcGCCTGtcgg P$RAV1_02 | 10664 (-) | 0.972 | 0.956 | tcaaCAGGAgga P$RAV1_02 | 10785 (-) | 0.967 | 0.963 | gaccCAGGGgga P$RAV1_02 | 11584 (-) | 0.972 | 0.962 | tatcCAGGAgtg P$RAV1_02 | 11961 (+) | 1.000 | 0.991 | tgcACCTGagct P$GAMYB_01 | 591 (+) | 0.937 | 0.947 | TAACAgac P$GAMYB_01 | 721 (-) | 1.000 | 0.943 | gatGGTTG P$GAMYB_01 | 1027 (-) | 0.978 | 0.982 | ggcAGTTG P$GAMYB_01 | 4539 (+) | 0.980 | 0.983 | CAACAgcc P$GAMYB_01 | 5281 (+) | 0.980 | 0.968 | CAACAggc P$GAMYB_01 | 7040 (-) | 0.978 | 0.935 | gttAGTTG P$GAMYB_01 | 7537 (-) | 1.000 | 0.953 | gttGGTTG P$GAMYB_01 | 8003 (-) | 0.980 | 0.968 | gccTGTTG P$GAMYB_01 | 10407 (-) | 1.000 | 0.953 | ggtGGTTG P$GAMYB_01 | 10965 (-) | 0.980 | 0.968 | gccTGTTG P$GAMYB_01 | 11494 (-) | 1.000 | 0.948 | tgcGGTTG P$GAMYB_01 | 11712 (+) | 0.978 | 0.982 | CAACTgcc P$GAMYB_01 | 11923 (+) | 0.988 | 0.935 | CAACGccc V$GATA1_05 | 448 (-) | 0.800 | 0.802 | ggTTGTCtgt V$GATA1_05 | 566 (+) | 1.000 | 0.942 | ggaGATAAac V$GATA1_05 | 1258 (-) | 0.800 | 0.803 | tgTCATCatc V$GATA1_05 | 1995 (-) | 1.000 | 0.948 | gaTTATCtgg V$GATA1_05 | 2722 (+) | 1.000 | 0.929 | ataGATAAtc V$GATA1_05 | 2902 (-) | 1.000 | 0.940 | ttTTATCcac V$GATA1_05 | 3930 (-) | 1.000 | 0.945 | caTTATCagt V$GATA1_05 | 4002 (+) | 1.000 | 0.932 | aacGATAAat V$GATA1_05 | 4374 (-) | 1.000 | 0.978 | agTTATCtgg V$GATA1_05 | 4779 (+) | 1.000 | 0.918 | ctgGATAAag V$GATA1_05 | 4817 (-) | 0.800 | 0.816 | tgTTACCagg V$GATA1_05 | 5582 (+) | 1.000 | 0.932 | aagGATAAta V$GATA1_05 | 5633 (+) | 1.000 | 0.917 | tggGATAAag V$GATA1_05 | 5977 (+) | 1.000 | 0.961 | ggtGATAAct V$GATA1_05 | 6465 (+) | 1.000 | 0.974 | cctGATAAaa V$GATA1_05 | 7788 (-) | 1.000 | 0.905 | aaTTATCcca V$GATA1_05 | 7797 (+) | 1.000 | 0.941 | aaaGATAAag V$GATA1_05 | 8062 (-) | 1.000 | 0.959 | gtTTATCtgg V$GATA1_05 | 8604 (-) | 1.000 | 0.949 | taTTATCatt V$GATA1_05 | 9732 (+) | 1.000 | 0.927 | tatGATAAtg V$GATA1_05 | 9833 (+) | 1.000 | 0.908 | aggGATAAtt V$GATA1_05 | 11064 (-) | 0.800 | 0.818 | tgTGATCagt V$GATA1_05 | 11348 (-) | 1.000 | 0.914 | caTTATCgaa V$GATA1_05 | 11441 (-) | 1.000 | 0.937 | acTTATCtat V$GATA1_05 | 11569 (-) | 1.000 | 0.960 | tcTTATCatt V$GATA1_05 | 11655 (+) | 0.800 | 0.818 | actGTTAAca V$GATA1_06 | 153 (-) | 0.839 | 0.820 | gcTCATCtct V$GATA1_06 | 566 (+) | 1.000 | 0.938 | ggaGATAAac V$GATA1_06 | 1699 (+) | 0.839 | 0.819 | gtaGATGAag V$GATA1_06 | 1753 (-) | 0.850 | 0.828 | acTGATCtat V$GATA1_06 | 1995 (-) | 1.000 | 0.949 | gaTTATCtgg V$GATA1_06 | 2722 (+) | 1.000 | 0.953 | ataGATAAtc V$GATA1_06 | 2902 (-) | 1.000 | 0.918 | ttTTATCcac V$GATA1_06 | 3930 (-) | 1.000 | 0.922 | caTTATCagt V$GATA1_06 | 4002 (+) | 1.000 | 0.900 | aacGATAAat V$GATA1_06 | 4374 (-) | 1.000 | 0.949 | agTTATCtgg V$GATA1_06 | 4779 (+) | 1.000 | 0.912 | ctgGATAAag V$GATA1_06 | 4801 (-) | 0.839 | 0.830 | caTAATCtct V$GATA1_06 | 4928 (+) | 0.850 | 0.841 | acaGATCAtg V$GATA1_06 | 5069 (+) | 0.839 | 0.858 | agaGATGAga V$GATA1_06 | 5407 (-) | 0.839 | 0.835 | ttTCATCtta V$GATA1_06 | 5582 (+) | 1.000 | 0.928 | aagGATAAta V$GATA1_06 | 5633 (+) | 1.000 | 0.897 | tggGATAAag V$GATA1_06 | 5977 (+) | 1.000 | 0.886 | ggtGATAAct V$GATA1_06 | 6465 (+) | 1.000 | 0.941 | cctGATAAaa V$GATA1_06 | 6659 (+) | 0.839 | 0.834 | acaGATGAag V$GATA1_06 | 7788 (-) | 1.000 | 0.871 | aaTTATCcca V$GATA1_06 | 7797 (+) | 1.000 | 0.972 | aaaGATAAag V$GATA1_06 | 8062 (-) | 1.000 | 0.952 | gtTTATCtgg V$GATA1_06 | 8241 (+) | 0.839 | 0.831 | ccaGATGAag V$GATA1_06 | 8381 (-) | 0.850 | 0.862 | ttTGATCtgt V$GATA1_06 | 8464 (-) | 0.839 | 0.844 | tcTCATCtaa V$GATA1_06 | 8604 (-) | 1.000 | 0.939 | taTTATCatt V$GATA1_06 | 9719 (-) | 0.850 | 0.820 | atTGATCtct V$GATA1_06 | 9732 (+) | 1.000 | 0.904 | tatGATAAtg V$GATA1_06 | 9833 (+) | 1.000 | 0.887 | aggGATAAtt V$GATA1_06 | 11348 (-) | 1.000 | 0.907 | caTTATCgaa V$GATA1_06 | 11441 (-) | 1.000 | 0.958 | acTTATCtat V$GATA1_06 | 11569 (-) | 1.000 | 0.948 | tcTTATCatt V$GATA2_02 | 82 (+) | 0.833 | 0.815 | gttGATAGgg V$GATA2_02 | 91 (-) | 0.833 | 0.826 | gcCTATCttc V$GATA2_02 | 153 (-) | 0.813 | 0.813 | gcTCATCtct V$GATA2_02 | 307 (-) | 0.833 | 0.821 | agCTATCtct V$GATA2_02 | 566 (+) | 1.000 | 0.961 | ggaGATAAac V$GATA2_02 | 952 (+) | 0.819 | 0.799 | aatGATCAaa V$GATA2_02 | 1065 (+) | 0.785 | 0.794 | taaAATAAga V$GATA2_02 | 1258 (-) | 0.813 | 0.795 | tgTCATCatc V$GATA2_02 | 1649 (-) | 0.813 | 0.807 | gcTAATCtac V$GATA2_02 | 1699 (+) | 0.813 | 0.792 | gtaGATGAag V$GATA2_02 | 1753 (-) | 0.819 | 0.819 | acTGATCtat V$GATA2_02 | 1995 (-) | 1.000 | 0.954 | gaTTATCtgg V$GATA2_02 | 2181 (-) | 0.785 | 0.796 | tcTTTTCttc V$GATA2_02 | 2186 (-) | 0.785 | 0.801 | tcTTCTCtct V$GATA2_02 | 2663 (+) | 0.813 | 0.793 | tttGATGAca V$GATA2_02 | 2722 (+) | 1.000 | 0.961 | ataGATAAtc V$GATA2_02 | 2751 (-) | 0.785 | 0.801 | tcTTTTCtct V$GATA2_02 | 2799 (-) | 0.785 | 0.794 | tcTTCTCtta V$GATA2_02 | 2902 (-) | 1.000 | 0.943 | ttTTATCcac V$GATA2_02 | 3178 (-) | 0.845 | 0.801 | ttATATCccc V$GATA2_02 | 3309 (+) | 0.785 | 0.796 | gtaGTTAAga V$GATA2_02 | 3374 (-) | 0.813 | 0.794 | ccTAATCata V$GATA2_02 | 3895 (-) | 0.813 | 0.795 | ccTCATCaca V$GATA2_02 | 3930 (-) | 1.000 | 0.950 | caTTATCagt V$GATA2_02 | 4002 (+) | 1.000 | 0.944 | aacGATAAat V$GATA2_02 | 4357 (+) | 0.819 | 0.796 | gagGATCAga V$GATA2_02 | 4374 (-) | 1.000 | 0.967 | agTTATCtgg V$GATA2_02 | 4577 (+) | 0.845 | 0.819 | ttaGATATag V$GATA2_02 | 4779 (+) | 1.000 | 0.931 | ctgGATAAag V$GATA2_02 | 4801 (-) | 0.813 | 0.793 | caTAATCtct V$GATA2_02 | 4928 (+) | 0.819 | 0.798 | acaGATCAtg V$GATA2_02 | 5069 (+) | 0.813 | 0.828 | agaGATGAga V$GATA2_02 | 5236 (+) | 0.813 | 0.793 | aacGATTAca V$GATA2_02 | 5407 (-) | 0.813 | 0.801 | ttTCATCtta V$GATA2_02 | 5527 (-) | 0.833 | 0.807 | tcCTATCcag V$GATA2_02 | 5582 (+) | 1.000 | 0.944 | aagGATAAta V$GATA2_02 | 5633 (+) | 1.000 | 0.931 | tggGATAAag V$GATA2_02 | 5655 (+) | 0.785 | 0.796 | gaaGAAAAga V$GATA2_02 | 5977 (+) | 1.000 | 0.954 | ggtGATAAct V$GATA2_02 | 6309 (-) | 0.819 | 0.814 | acTGATCtac V$GATA2_02 | 6465 (+) | 1.000 | 0.959 | cctGATAAaa V$GATA2_02 | 6550 (+) | 0.785 | 0.794 | tgaGGTAAga V$GATA2_02 | 6659 (+) | 0.813 | 0.797 | acaGATGAag V$GATA2_02 | 6892 (+) | 0.785 | 0.794 | taaGACAAga V$GATA2_02 | 6947 (+) | 0.833 | 0.797 | agtGATACtg V$GATA2_02 | 7152 (-) | 0.833 | 0.830 | ccGTATCtcc V$GATA2_02 | 7274 (+) | 0.785 | 0.794 | taaAATAAga V$GATA2_02 | 7310 (+) | 0.845 | 0.795 | aagGATATag V$GATA2_02 | 7375 (+) | 0.845 | 0.807 | attGATATtg V$GATA2_02 | 7788 (-) | 1.000 | 0.923 | aaTTATCcca V$GATA2_02 | 7797 (+) | 1.000 | 0.969 | aaaGATAAag V$GATA2_02 | 7902 (+) | 0.845 | 0.810 | tttGATATta V$GATA2_02 | 7922 (-) | 0.833 | 0.805 | agGTATCaat V$GATA2_02 | 7951 (+) | 0.833 | 0.812 | tgaGATACcc V$GATA2_02 | 8062 (-) | 1.000 | 0.959 | gtTTATCtgg V$GATA2_02 | 8381 (-) | 0.819 | 0.813 | ttTGATCtgt V$GATA2_02 | 8464 (-) | 0.813 | 0.820 | tcTCATCtaa V$GATA2_02 | 8604 (-) | 1.000 | 0.961 | taTTATCatt V$GATA2_02 | 9367 (-) | 0.833 | 0.794 | ctCTATCaca V$GATA2_02 | 9719 (-) | 0.819 | 0.800 | atTGATCtct V$GATA2_02 | 9732 (+) | 1.000 | 0.943 | tatGATAAtg V$GATA2_02 | 9833 (+) | 1.000 | 0.931 | aggGATAAtt V$GATA2_02 | 9980 (+) | 0.819 | 0.796 | gggGATCAga V$GATA2_02 | 10131 (+) | 0.785 | 0.794 | ttaGTTAAga V$GATA2_02 | 10286 (-) | 0.813 | 0.806 | tcTCATCagg V$GATA2_02 | 10454 (-) | 0.845 | 0.817 | ttATATCaac V$GATA2_02 | 10488 (-) | 0.845 | 0.821 | acATATCaag V$GATA2_02 | 10594 (-) | 0.845 | 0.804 | ctATATCaaa V$GATA2_02 | 11000 (+) | 0.813 | 0.793 | tcaGATTAcc V$GATA2_02 | 11064 (+) | 0.819 | 0.797 | tgtGATCAgt V$GATA2_02 | 11064 (-) | 0.819 | 0.806 | tgTGATCagt V$GATA2_02 | 11348 (-) | 1.000 | 0.935 | caTTATCgaa V$GATA2_02 | 11369 (-) | 0.833 | 0.826 | ttCTATCtgt V$GATA2_02 | 11441 (-) | 1.000 | 0.985 | acTTATCtat V$GATA2_02 | 11509 (+) | 0.845 | 0.796 | gtgGATATta V$GATA2_02 | 11569 (-) | 1.000 | 0.985 | tcTTATCatt V$GATA2_02 | 11581 (-) | 0.845 | 0.799 | ttATATCcag V$GATA2_02 | 11774 (+) | 0.833 | 0.802 | ccaGATACtt V$GATA2_03 | 2902 (-) | 1.000 | 0.912 | ttTTATCcac V$GATA2_03 | 3178 (-) | 0.954 | 0.905 | ttATATCccc V$GATA2_03 | 5069 (+) | 0.871 | 0.908 | agaGATGAga V$GATA2_03 | 5582 (+) | 1.000 | 0.931 | aagGATAAta V$GATA2_03 | 6465 (+) | 1.000 | 0.944 | cctGATAAaa V$GATA2_03 | 8381 (-) | 0.908 | 0.920 | ttTGATCtgt V$GATA2_03 | 8604 (-) | 1.000 | 0.943 | taTTATCatt V$GATA2_03 | 11369 (-) | 0.942 | 0.945 | ttCTATCtgt V$GATA2_03 | 11569 (-) | 1.000 | 0.947 | tcTTATCatt V$GATA3_02 | 566 (+) | 1.000 | 0.950 | ggaGATAAac V$GATA3_02 | 1995 (-) | 1.000 | 0.931 | gaTTATCtgg V$GATA3_02 | 2722 (+) | 1.000 | 0.936 | ataGATAAtc V$GATA3_02 | 2902 (-) | 1.000 | 0.933 | ttTTATCcac V$GATA3_02 | 3930 (-) | 1.000 | 0.916 | caTTATCagt V$GATA3_02 | 4002 (+) | 1.000 | 0.919 | aacGATAAat V$GATA3_02 | 4374 (-) | 1.000 | 0.937 | agTTATCtgg V$GATA3_02 | 5069 (+) | 0.917 | 0.926 | agaGATGAga V$GATA3_02 | 5407 (-) | 0.917 | 0.902 | ttTCATCtta V$GATA3_02 | 5582 (+) | 1.000 | 0.949 | aagGATAAta V$GATA3_02 | 5633 (+) | 1.000 | 0.919 | tggGATAAag V$GATA3_02 | 5977 (+) | 1.000 | 0.928 | ggtGATAAct V$GATA3_02 | 6465 (+) | 1.000 | 0.942 | cctGATAAaa V$GATA3_02 | 7788 (-) | 1.000 | 0.921 | aaTTATCcca V$GATA3_02 | 7797 (+) | 1.000 | 0.949 | aaaGATAAag V$GATA3_02 | 8062 (-) | 1.000 | 0.929 | gtTTATCtgg V$GATA3_02 | 8381 (-) | 0.928 | 0.914 | ttTGATCtgt V$GATA3_02 | 8604 (-) | 1.000 | 0.956 | taTTATCatt V$GATA3_02 | 9719 (-) | 0.928 | 0.899 | atTGATCtct V$GATA3_02 | 9732 (+) | 1.000 | 0.914 | tatGATAAtg V$GATA3_02 | 9833 (+) | 1.000 | 0.928 | aggGATAAtt V$GATA3_02 | 11348 (-) | 1.000 | 0.905 | caTTATCgaa V$GATA3_02 | 11441 (-) | 1.000 | 0.943 | acTTATCtat V$GATA3_02 | 11569 (-) | 1.000 | 0.957 | tcTTATCatt V$GATA3_03 | 43 (+) | 0.955 | 0.900 | tcaGATGTca V$GATA3_03 | 242 (-) | 0.968 | 0.926 | ttAAATCtgg V$GATA3_03 | 706 (-) | 0.915 | 0.913 | tcGGATCtct V$GATA3_03 | 952 (+) | 0.977 | 0.926 | aatGATCAaa V$GATA3_03 | 4119 (+) | 0.955 | 0.927 | ccaGATGTta V$GATA3_03 | 5069 (+) | 0.932 | 0.926 | agaGATGAga V$GATA3_03 | 5407 (-) | 0.932 | 0.903 | ttTCATCtta V$GATA3_03 | 5417 (-) | 0.968 | 0.964 | ttAAATCttt V$GATA3_03 | 5582 (+) | 0.958 | 0.900 | aagGATAAta V$GATA3_03 | 6665 (+) | 0.968 | 0.944 | gaaGATTTta V$GATA3_03 | 6782 (+) | 1.000 | 0.956 | gcaGATCTaa V$GATA3_03 | 7902 (+) | 0.981 | 0.905 | tttGATATta V$GATA3_03 | 8381 (-) | 0.977 | 0.968 | ttTGATCtgt V$GATA3_03 | 8604 (-) | 0.958 | 0.922 | taTTATCatt V$GATA3_03 | 10495 (+) | 0.977 | 0.904 | aagGATCAaa V$GATA3_03 | 11026 (+) | 0.915 | 0.931 | agaGATCCta V$GATA3_03 | 11064 (-) | 0.977 | 0.906 | tgTGATCagt V$GATA3_03 | 11116 (-) | 1.000 | 0.912 | tcAGATCcct V$GATA3_03 | 11171 (+) | 1.000 | 0.915 | agaGATCTtt V$GATA3_03 | 11369 (-) | 0.896 | 0.905 | ttCTATCtgt V$GATA3_03 | 11569 (-) | 0.958 | 0.904 | tcTTATCatt P$DOF1_01 | 7799 (+) | 1.000 | 0.988 | agaTAAAGaac P$DOF1_01 | 8936 (+) | 1.000 | 0.998 | aacTAAAGcaa P$DOF2_01 | 1038 (+) | 1.000 | 0.985 | caacAAAGCag P$DOF2_01 | 2398 (-) | 1.000 | 0.984 | ttGCTTTcctt P$DOF2_01 | 3910 (+) | 1.000 | 0.996 | taatAAAGCtt P$DOF2_01 | 3915 (-) | 1.000 | 0.996 | aaGCTTTatga P$DOF2_01 | 6677 (-) | 1.000 | 0.990 | taGCTTTtgaa P$DOF2_01 | 6708 (+) | 1.000 | 0.989 | gagaAAAGCca P$DOF2_01 | 7652 (-) | 1.000 | 0.990 | atGCTTTtgta P$DOF2_01 | 7685 (+) | 1.000 | 0.997 | ctgtAAAGCca P$DOF2_01 | 7846 (+) | 1.000 | 0.988 | ggcaAAAGCga P$DOF2_01 | 7868 (+) | 1.000 | 0.991 | tagaAAAGCtc P$DOF2_01 | 7910 (-) | 1.000 | 0.993 | taGCTTTtact P$DOF2_01 | 8628 (+) | 1.000 | 0.987 | gagaAAAGCtt P$DOF2_01 | 8936 (+) | 1.000 | 0.998 | aactAAAGCaa P$DOF2_01 | 9102 (+) | 1.000 | 0.991 | tacaAAAGCaa P$DOF2_01 | 10729 (-) | 1.000 | 0.994 | taGCTTTtatt P$DOF2_01 | 11009 (-) | 1.000 | 0.997 | ctGCTTTaggg P$DOF3_01 | 2027 (+) | 0.992 | 0.988 | gattAAAGGag P$DOF3_01 | 2199 (-) | 0.992 | 0.987 | ttTCTTTcttc P$DOF3_01 | 2271 (-) | 0.992 | 0.987 | ttTCTTTcttc P$DOF3_01 | 2398 (-) | 1.000 | 0.990 | ttGCTTTcctt P$DOF3_01 | 2506 (+) | 0.992 | 0.986 | aaaaAAAGGga P$DOF3_01 | 2749 (-) | 0.992 | 0.992 | ttTCTTTtctc P$DOF3_01 | 5518 (-) | 0.992 | 0.988 | tgCCTTTcctc P$DOF3_01 | 5940 (+) | 0.994 | 0.994 | gagaAAAGTaa P$DOF3_01 | 6426 (-) | 1.000 | 0.995 | tgGCTTTcatc P$DOF3_01 | 6708 (+) | 1.000 | 0.999 | gagaAAAGCca P$DOF3_01 | 7086 (-) | 1.000 | 0.989 | gtGCTTTgcgc P$DOF3_01 | 7652 (-) | 1.000 | 0.989 | atGCTTTtgta P$DOF3_01 | 7846 (+) | 1.000 | 0.990 | ggcaAAAGCga P$DOF3_01 | 7868 (+) | 1.000 | 0.988 | tagaAAAGCtc P$DOF3_01 | 8133 (-) | 1.000 | 0.987 | ctGCTTTctgc P$DOF3_01 | 8377 (-) | 0.992 | 0.987 | gcTCTTTgatc P$DOF3_01 | 8628 (+) | 1.000 | 0.994 | gagaAAAGCtt P$DOF3_01 | 8936 (+) | 1.000 | 0.993 | aactAAAGCaa P$DOF3_01 | 9102 (+) | 1.000 | 0.992 | tacaAAAGCaa P$DOF3_01 | 10576 (-) | 1.000 | 0.990 | ttGCTTTggtt P$DOF3_01 | 10729 (-) | 1.000 | 0.991 | taGCTTTtatt P$CDC5_01 | 794 (-) | 1.000 | 0.982 | accGCTGAgcc P$CDC5_01 | 4973 (+) | 1.000 | 0.869 | ggcTCAGCccc P$EMBP1_Q2 | 735 (+) | 1.000 | 0.929 | acCACGTggt P$EMBP1_Q2 | 735 (-) | 1.000 | 0.929 | accACGTGgt P$EMBP1_Q2 | 6003 (-) | 1.000 | 0.906 | ggtACGTGtc P$TAF1_Q2 | 735 (+) | 1.000 | 0.983 | accACGTGgt P$TAF1_Q2 | 735 (-) | 1.000 | 0.983 | acCACGTggt P$CPRF2_Q2 | 735 (+) | 1.000 | 0.998 | acCACGTggt P$CPRF2_Q2 | 735 (-) | 1.000 | 0.998 | accACGTGgt P$O2_02 | 434 (+) | 1.000 | 0.959 | acCACGTtta P$O2_02 | 434 (-) | 0.919 | 0.896 | accACGTTta P$O2_02 | 571 (+) | 0.919 | 0.879 | taAACGTcta P$O2_02 | 571 (-) | 0.946 | 0.890 | taaACGTCta P$O2_02 | 735 (+) | 1.000 | 0.990 | acCACGTggt P$O2_02 | 735 (-) | 1.000 | 0.990 | accACGTGgt P$O2_02 | 1126 (+) | 1.000 | 0.949 | tgCACGTgtg P$O2_02 | 1126 (-) | 1.000 | 0.944 | tgcACGTGtg P$O2_02 | 1460 (+) | 0.946 | 0.872 | cgGACGTttc P$O2_02 | 1460 (-) | 0.919 | 0.861 | cggACGTTtc P$O2_02 | 6003 (+) | 0.924 | 0.874 | ggTACGTgtc P$O2_02 | 6003 (-) | 1.000 | 0.936 | ggtACGTGtc P$O2_02 | 6064 (+) | 1.000 | 0.941 | tgCACGTgct P$O2_02 | 6064 (-) | 1.000 | 0.936 | tgcACGTGct P$O2_02 | 6856 (+) | 1.000 | 0.928 | agCACGTcca P$O2_02 | 6856 (-) | 0.946 | 0.892 | agcACGTCca P$O2_02 | 6886 (+) | 1.000 | 0.977 | acCACGTaag P$O2_02 | 6886 (-) | 0.924 | 0.901 | accACGTAag P$O2_02 | 7322 (+) | 0.924 | 0.872 | tgTACGTctg P$O2_02 | 7322 (-) | 0.946 | 0.887 | tgtACGTCtg P$O2_02 | 7486 (+) | 1.000 | 0.934 | cgCACGTctg P$O2_02 | 7486 (-) | 0.946 | 0.892 | cgcACGTCtg P$O2_02 | 8730 (+) | 0.919 | 0.891 | gtAACGTcgg P$O2_02 | 8730 (-) | 0.946 | 0.916 | gtaACGTCgg P$O2_02 | 8891 (+) | 0.919 | 0.860 | caAACGTacc P$O2_02 | 10442 (+) | 0.919 | 0.898 | gcAACGTctc P$O2_02 | 10442 (-) | 0.946 | 0.895 | gcaACGTCtc P$O2_02 | 11057 (+) | 0.919 | 0.884 | ttAACGTtgt P$O2_02 | 11057 (-) | 0.919 | 0.893 | ttaACGTTgt P$TGA1A_Q2 | 1126 (+) | 1.000 | 0.981 | tgCACGTgtg P$TGA1A_Q2 | 1126 (-) | 1.000 | 0.981 | tgcACGTGtg P$TGA1A_Q2 | 7486 (+) | 1.000 | 0.986 | cgCACGTctg P$TGA1A_Q2 | 7486 (-) | 0.992 | 0.980 | cgcACGTCtg V$PAX4_04 | 28 (-) | 0.797 | 0.773 | tgtgtgtgaacaacttcagatgtcaTTACT V$PAX4_04 | 534 (-) | 0.943 | 0.778 | tgggtgatgcgcttgtgttttaaccTTTTC V$PAX4_04 | 1510 (-) | 0.936 | 0.780 | ggagggagggaggtagagctgttttTTTGT V$PAX4_04 | 1878 (-) | 0.936 | 0.768 | gatctgcctcccattttatatttatTTTAT V$PAX4_04 | 2519 (+) | 1.000 | 0.765 | AAAAAttacatattacttttagagaataac V$PAX4_04 | 4430 (+) | 1.000 | 0.776 | AAAAAaaaaaaaccacacaattaatattca V$PAX4_04 | 4434 (+) | 1.000 | 0.818 | AAAAAaaaccacacaattaatattcatttc V$PAX4_04 | 4493 (+) | 0.936 | 0.846 | ACAAAaaacaaaacaaaacaacatcagacc V$PAX4_04 | 5082 (-) | 0.936 | 0.768 | tggctctgactggtttgggttattaTTTGT V$PAX4_04 | 5805 (+) | 1.000 | 0.799 | AAAAAaaaaatagccctggcgtggcactgt V$PAX4_04 | 7030 (-) | 1.000 | 0.779 | cagctgaagagttagttgactatttTTTTT V$PAX4_04 | 7785 (+) | 1.000 | 0.816 | AAAAAttatcccaaagataaagaacaacga V$PAX4_04 | 7923 (-) | 0.887 | 0.799 | ggtatcaattcgtttgtgtgtttagTTTTG V$PAX4_04 | 8856 (+) | 1.000 | 0.835 | AAAAAttaagactctggcatagaatattct V$PAX4_04 | 9416 (+) | 0.943 | 0.767 | GAAAActatcccataaataacaggaagccc V$PAX4_04 | 9461 (-) | 0.936 | 0.764 | ctgctgtacaggatgttctagctacTTTAT V$PAX4_04 | 9703 (-) | 0.936 | 0.784 | gtggtggtggtgtaaaattgatctcTTTGT V$PAX4_04 | 10243 (-) | 1.000 | 0.805 | gtggtcaggcttgtgcaggaagtgaTTTTT V$PAX4_04 | 11187 (-) | 0.936 | 0.817 | gccttgtttcttgattgattattaaTTTGT V$PAX4_04 | 11295 (+) | 0.950 | 0.779 | AGAAAtaataataacgactacggtgacgac V$PAX4_04 | 11374 (-) | 0.936 | 0.771 | tctgtgctttgaaaacttattaaatTTTGT V$PAX4_04 | 11452 (-) | 0.809 | 0.779 | ttggtgaatcagtgaaattttttatTTGTT V$PAX4_04 | 11677 (+) | 0.823 | 0.774 | CCAAAaaagaccaataaagttaatcactgt F$FACBCA_Q2 | 362 (+) | 0.853 | 0.899 | tcaaatcctttctctagGTGAGagat F$FACBCA_Q2 | 362 (-) | 0.989 | 0.849 | tcaaATCCTttctctaggtgagagat F$FACBCA_Q2 | 952 (+) | 0.930 | 0.810 | aatgatcaaatatttacAGGACccct F$FACBCA_Q2 | 952 (-) | 0.808 | 0.845 | aatgATCAAatatttacaggacccct F$FACBCA_Q2 | 1272 (-) | 0.806 | 0.802 | tgtaCTCTGggatgacagggacctgg F$FACBCA_Q2 | 1473 (-) | 0.691 | 0.797 | atatTTGCTctggcagagggaggggg F$FACBCA_Q2 | 1718 (+) | 0.786 | 0.804 | ccttttccgggctcatcCTGAActtc F$FACBCA_Q2 | 1718 (-) | 0.933 | 0.809 | ccttTTCCGggctcatcctgaacttc F$FACBCA_Q2 | 1737 (+) | 0.830 | 0.874 | gaacttcctatagaacaCTGATctat F$FACBCA_Q2 | 2016 (-) | 0.677 | 0.813 | ctgaGTGCTgggattaaaggagtgta F$FACBCA_Q2 | 2419 (+) | 1.000 | 0.827 | aaaaatcacaggggttgGGGATttag F$FACBCA_Q2 | 2419 (-) | 0.853 | 0.873 | aaaaATCACaggggttggggatttag F$FACBCA_Q2 | 2496 (+) | 0.956 | 0.858 | tcagctccgaaaaaaaaGGGAAaaaa F$FACBCA_Q2 | 2496 (-) | 0.977 | 0.891 | tcagCTCCGaaaaaaaagggaaaaaa F$FACBCA_Q2 | 2902 (+) | 0.896 | 0.879 | ttttatccacatctctgTGGACggac F$FACBCA_Q2 | 2902 (-) | 0.955 | 0.878 | ttttATCCAcatctctgtggacggac F$FACBCA_Q2 | 2992 (+) | 0.724 | 0.796 | tccgctcctccaaggtgCGCAGgcgt F$FACBCA_Q2 | 3031 (+) | 0.941 | 0.936 | cgacttccgctccacccGGGACatac F$FACBCA_Q2 | 3031 (-) | 0.933 | 0.857 | cgacTTCCGctccacccgggacatac F$FACBCA_Q2 | 3374 (+) | 0.830 | 0.803 | cctaatcatatggggccCTGAGgatc F$FACBCA_Q2 | 3986 (+) | 0.842 | 0.793 | caccatctcatgaaacaACGATaaat F$FACBCA_Q2 | 4227 (-) | 0.699 | 0.826 | ataaATTCTcagggctggggattgta F$FACBCA_Q2 | 4742 (+) | 0.665 | 0.800 | gatggtcctctcccctcTGTAGgtcc F$FACBCA_Q2 | 6256 (+) | 0.773 | 0.853 | atttgtcccatctgccgAAGAAggaa F$FACBCA_Q2 | 6256 (-) | 0.941 | 0.809 | atttGTCCCatctgccgaagaaggaa F$FACBCA_Q2 | 6355 (-) | 0.830 | 0.819 | aaagCTCAGggtactaagcgagattt F$FACBCA_Q2 | 6753 (+) | 0.798 | 0.842 | cactatccctttataccATGAAcaaa F$FACBCA_Q2 | 6959 (-) | 0.699 | 0.847 | cattATTCTggatggaaaggaacgac F$FACBCA_Q2 | 8228 (+) | 0.798 | 0.817 | ttggctcccacacccagATGAAggtg F$FACBCA_Q2 | 8228 (-) | 1.000 | 0.847 | ttggCTCCCacacccagatgaaggtg F$FACBCA_Q2 | 9453 (+) | 0.989 | 0.822 | accaatctctgctgtacAGGATgttc F$FACBCA_Q2 | 9453 (-) | 0.829 | 0.883 | accaATCTCtgctgtacaggatgttc F$FACBCA_Q2 | 9664 (+) | 0.955 | 0.788 | ccttgtctcaaaagaagTGGAGgggg F$FACBCA_Q2 | 9664 (-) | 0.770 | 0.842 | ccttGTCTCaaaagaagtggaggggg F$FACBCA_Q2 | 9969 (+) | 0.762 | 0.837 | ctggttcctatggggatCAGAAgagg F$FACBCA_Q2 | 9969 (-) | 0.945 | 0.839 | ctggTTCCTatggggatcagaagagg F$FACBCA_Q2 | 11531 (+) | 0.783 | 0.802 | gcaaatcccacatatggTAGATgggt F$FACBCA_Q2 | 11531 (-) | 1.000 | 0.798 | gcaaATCCCacatatggtagatgggt F$FACBCA_Q2 | 11737 (-) | 0.853 | 0.819 | acatCTCACcaaatgactagaaagag F$FACBALL_Q2 | 190 (-) | 0.822 | 0.821 | gtgcctcccacaccaccaTGCCTggt F$FACBALL_Q2 | 283 (+) | 0.920 | 0.805 | gcaAGTCAagtgttttacagactgag F$FACBALL_Q2 | 309 (+) | 0.882 | 0.785 | ctaTCTCTcctagtcctgtgctggaa F$FACBALL_Q2 | 362 (+) | 0.992 | 0.870 | tcaAATCCtttctctaggtgagagat F$FACBALL_Q2 | 362 (-) | 0.939 | 0.884 | tcaaatcctttctctaggTGAGAgat F$FACBALL_Q2 | 952 (+) | 0.935 | 0.877 | aatGATCAaatatttacaggacccct F$FACBALL_Q2 | 952 (-) | 0.976 | 0.860 | aatgatcaaatatttacaGGACCcct F$FACBALL_Q2 | 1272 (+) | 0.874 | 0.830 | tgtACTCTgggatgacagggacctgg F$FACBALL_Q2 | 1272 (-) | 0.976 | 0.795 | tgtactctgggatgacagGGACCtgg F$FACBALL_Q2 | 1322 (+) | 0.871 | 0.818 | tccCTTCTcactccccagggcttaag F$FACBALL_Q2 | 1428 (-) | 0.928 | 0.793 | cgcaatcagtggcaatagTGAAGtca F$FACBALL_Q2 | 1672 (-) | 0.781 | 0.791 | attcctccttcgggtgtcAGCTCagg F$FACBALL_Q2 | 1683 (+) | 0.928 | 0.806 | gggTGTCAgctcaggggtagatgaag F$FACBALL_Q2 | 1709 (+) | 0.871 | 0.843 | cacCCTCTgccttttccgggctcatc F$FACBALL_Q2 | 1718 (+) | 0.996 | 0.840 | cctTTTCCgggctcatcctgaacttc F$FACBALL_Q2 | 1718 (-) | 0.931 | 0.875 | ccttttccgggctcatccTGAACttc F$FACBALL_Q2 | 1737 (+) | 0.985 | 0.853 | gaaCTTCCtatagaacactgatctat F$FACBALL_Q2 | 1737 (-) | 0.935 | 0.899 | gaacttcctatagaacacTGATCtat F$FACBALL_Q2 | 1926 (+) | 0.871 | 0.803 | gtaTGTCTatgtagccttggctaggc F$FACBALL_Q2 | 1985 (-) | 0.903 | 0.792 | aactcaccaagattatctGGCTAtac F$FACBALL_Q2 | 2016 (-) | 0.988 | 0.803 | ctgagtgctgggattaaaGGAGTgta F$FACBALL_Q2 | 2419 (+) | 0.935 | 0.873 | aaaAATCAcaggggttggggatttag F$FACBALL_Q2 | 2419 (-) | 0.992 | 0.875 | aaaaatcacaggggttggGGATTtag F$FACBALL_Q2 | 2496 (+) | 0.988 | 0.868 | tcaGCTCCgaaaaaaaagggaaaaaa F$FACBALL_Q2 | 2496 (-) | 0.996 | 0.914 | tcagctccgaaaaaaaagGGAAAaaa F$FACBALL_Q2 | 2902 (+) | 1.000 | 0.906 | tttTATCCacatctctgtggacggac F$FACBALL_Q2 | 2902 (-) | 0.974 | 0.864 | ttttatccacatctctgtGGACGgac F$FACBALL_Q2 | 2987 (+) | 0.985 | 0.830 | ctaCTTCCgctcctccaaggtgcgca F$FACBALL_Q2 | 2995 (+) | 0.985 | 0.856 | gctCCTCCaaggtgcgcaggcgtctc F$FACBALL_Q2 | 2995 (-) | 0.891 | 0.841 | gctcctccaaggtgcgcaGGCGTctc F$FACBALL_Q2 | 3003 (+) | 0.805 | 0.794 | aagGTGCGcaggcgtctcggcagtac F$FACBALL_Q2 | 3031 (+) | 0.985 | 0.961 | cgaCTTCCgctccacccgggacatac F$FACBALL_Q2 | 3031 (-) | 0.985 | 0.866 | cgacttccgctccacccgGGACAtac F$FACBALL_Q2 | 3040 (-) | 0.842 | 0.793 | ctccacccgggacataccTGCGAgtg F$FACBALL_Q2 | 3270 (+) | 0.891 | 0.840 | gtgATGCCattatgattgggatgagg F$FACBALL_Q2 | 3270 (-) | 0.989 | 0.857 | gtgatgccattatgattgGGATGagg F$FACBALL_Q2 | 3349 (+) | 0.932 | 0.793 | gttCATCAgacagaccactgaatagc F$FACBALL_Q2 | 3349 (-) | 0.931 | 0.801 | gttcatcagacagaccacTGAATagc F$FACBALL_Q2 | 3374 (+) | 0.935 | 0.857 | cctAATCAtatggggccctgaggatc F$FACBALL_Q2 | 3374 (-) | 0.928 | 0.796 | cctaatcatatggggcccTGAGGatc F$FACBALL_Q2 | 3502 (-) | 0.985 | 0.809 | aactaccctgctgttagaGGAGGcca F$FACBALL_Q2 | 3946 (-) | 0.931 | 0.794 | ttcattcagccatcctacTGAGCact F$FACBALL_Q2 | 3986 (+) | 0.875 | 0.805 | cacCATCTcatgaaacaacgataaat F$FACBALL_Q2 | 3986 (-) | 0.915 | 0.795 | caccatctcatgaaacaaCGATAaat F$FACBALL_Q2 | 4039 (-) | 0.899 | 0.789 | ttccagcagggggaaataGGCAAtaa F$FACBALL_Q2 | 4334 (+) | 0.989 | 0.846 | tgaCATCCaggtggtggaggcaggag F$FACBALL_Q2 | 4334 (-) | 0.888 | 0.816 | tgacatccaggtggtggaGGCAGgag F$FACBALL_Q2 | 4392 (-) | 0.774 | 0.789 | tgagttccaagacaggtcAGCCAcaa F$FACBALL_Q2 | 4584 (-) | 0.809 | 0.789 | tagtctcctcctcctgccCGCTTgct F$FACBALL_Q2 | 4807 (+) | 0.831 | 0.804 | ctcTGGCAtttgttaccaggcggggt F$FACBALL_Q2 | 4867 (+) | 0.835 | 0.838 | ggtCAGCAgcaaagctctggaacatt F$FACBALL_Q2 | 4974 (+) | 0.892 | 0.790 | gctCAGCCcctacctgtccgaatggc F$FACBALL_Q2 | 4974 (-) | 0.902 | 0.792 | gctcagcccctacctgtcCGAATggc F$FACBALL_Q2 | 5049 (-) | 0.939 | 0.785 | tgtggtctaagcaggtagTGAGAgat F$FACBALL_Q2 | 5256 (+) | 0.920 | 0.785 | tgtGGTCAcagtcatcctgggccaac F$FACBALL_Q2 | 5500 (+) | 0.988 | 0.820 | tctGCTCCcaggtttccctgcctttc F$FACBALL_Q2 | 5500 (-) | 0.822 | 0.821 | tctgctcccaggtttcccTGCCTttc F$FACBALL_Q2 | 5921 (+) | 0.806 | 0.852 | cccCAGCGagactcccttggagaaaa F$FACBALL_Q2 | 6055 (-) | 0.831 | 0.787 | tgacagccctgcacgtgcTGCCAcag F$FACBALL_Q2 | 6069 (+) | 0.888 | 0.784 | gtgCTGCCacagtctctgtgagttcc F$FACBALL_Q2 | 6069 (-) | 0.931 | 0.821 | gtgctgccacagtctctgTGAGTtcc F$FACBALL_Q2 | 6128 (+) | 0.911 | 0.788 | tttTCTCGgcgtcttccatgcctctg F$FACBALL_Q2 | 6160 (+) | 0.985 | 0.819 | ctgCCTCCtcttctgcagggttccca F$FACBALL_Q2 | 6256 (+) | 0.985 | 0.859 | attTGTCCcatctgccgaagaaggaa F$FACBALL_Q2 | 6256 (-) | 0.871 | 0.846 | atttgtcccatctgccgaAGAAGgaa F$FACBALL_Q2 | 6277 (+) | 0.894 | 0.797 | aggAAGCCtctctgatgatgactgag F$FACBALL_Q2 | 6277 (-) | 0.920 | 0.804 | aggaagcctctctgatgaTGACTgag F$FACBALL_Q2 | 6355 (+) | 0.931 | 0.822 | aaaGCTCAgggtactaagcgagattt F$FACBALL_Q2 | 6355 (-) | 0.911 | 0.838 | aaagctcagggtactaagCGAGAttt F$FACBALL_Q2 | 6434 (+) | 0.920 | 0.810 | atcGGTCAacccagtcatggcaaact F$FACBALL_Q2 | 6753 (+) | 1.000 | 0.866 | cacTATCCctttataccatgaacaaa F$FACBALL_Q2 | 6753 (-) | 0.931 | 0.863 | cactatccctttataccaTGAACaaa F$FACBALL_Q2 | 6816 (+) | 0.996 | 0.840 | cctTCTCCcttttttatgtgctattt F$FACBALL_Q2 | 6816 (-) | 0.846 | 0.814 | ccttctcccttttttatgTGCTAttt F$FACBALL_Q2 | 6862 (-) | 0.985 | 0.821 | tccagaccgaggcttataGGAAGcac F$FACBALL_Q2 | 6881 (-) | 0.871 | 0.801 | gaagcaccacgtaagacaAGAGGgca F$FACBALL_Q2 | 6959 (-) | 0.988 | 0.823 | cattattctggatggaaaGGAACgac F$FACBALL_Q2 | 7016 (-) | 0.928 | 0.784 | gatgtttcctgcatcagcTGAAGagt F$FACBALL_Q2 | 7294 (+) | 0.781 | 0.804 | accAAGCTgagagagtaaggatatag F$FACBALL_Q2 | 7294 (-) | 1.000 | 0.793 | accaagctgagagagtaaGGATAtag F$FACBALL_Q2 | 7613 (+) | 0.882 | 0.829 | ccaTCTCTccagccgcctggcggtag F$FACBALL_Q2 | 7729 (-) | 0.903 | 0.797 | tttgggccatcacaggctGGCTAcat F$FACBALL_Q2 | 8025 (+) | 0.928 | 0.841 | gagTGTCAgagcttgctgggccttga F$FACBALL_Q2 | 8025 (-) | 0.879 | 0.808 | gagtgtcagagcttgctgGGCCTtga F$FACBALL_Q2 | 8112 (+) | 0.996 | 0.797 | cacTCTCCcaccctgcctggtctgct F$FACBALL_Q2 | 8157 (-) | 0.777 | 0.803 | tagcttccagaggcaggtAGCACcag F$FACBALL_Q2 | 8228 (+) | 0.988 | 0.832 | ttgGCTCCcacacccagatgaaggtg F$FACBALL_Q2 | 8228 (-) | 0.928 | 0.896 | ttggctcccacacccagaTGAAGgtg F$FACBALL_Q2 | 8292 (+) | 0.900 | 0.782 | tccCTTCGcatgcccctctgccctgc F$FACBALL_Q2 | 8298 (+) | 0.891 | 0.787 | cgcATGCCcctctgccctgcagcttc F$FACBALL_Q2 | 8306 (-) | 0.834 | 0.793 | cctctgccctgcagcttcTGCGTcag F$FACBALL_Q2 | 8328 (+) | 0.891 | 0.845 | tcaGCGCCagtatttgctggaccact F$FACBALL_Q2 | 8328 (-) | 0.976 | 0.810 | tcagcgccagtatttgctGGACCact F$FACBALL_Q2 | 8365 (+) | 0.892 | 0.784 | gtgCAGCCtgctgctctttgatctgt F$FACBALL_Q2 | 8365 (-) | 0.935 | 0.800 | gtgcagcctgctgctcttTGATCtgt F$FACBALL_Q2 | 8395 (+) | 0.894 | 0.850 | ctgAAGCCaagccagcccggcctgta F$FACBALL_Q2 | 8395 (-) | 0.879 | 0.837 | ctgaagccaagccagcccGGCCTgta F$FACBALL_Q2 | 8616 (+) | 0.935 | 0.783 | gtcAATCActttgagaaaagcttatt F$FACBALL_Q2 | 8706 (-) | 0.874 | 0.800 | taccatcaaggagtctgaAGAACtgt F$FACBALL_Q2 | 8775 (-) | 0.875 | 0.828 | gggaggccgagacaaagaAGATGtag F$FACBALL_Q2 | 8828 (-) | 0.874 | 0.818 | gatgttcaaaaccaaaacAGAACggt F$FACBALL_Q2 | 8992 (+) | 0.988 | 0.813 | ttaATTCCagcacatggaggcagggg F$FACBALL_Q2 | 8992 (-) | 0.888 | 0.837 | ttaattccagcacatggaGGCAGggg F$FACBALL_Q2 | 9041 (+) | 0.892 | 0.827 | gggCAGCCtggtctacatggcaagtt F$FACBALL_Q2 | 9041 (-) | 0.899 | 0.785 | gggcagcctggtctacatGGCAAgtt F$FACBALL_Q2 | 9061 (+) | 0.988 | 0.790 | caaGTTCCaggctagccagggcagca F$FACBALL_Q2 | 9236 (-) | 0.838 | 0.813 | ccagagccctaggagagcTGCTCatg F$FACBALL_Q2 | 9292 (+) | 0.778 | 0.847 | agtCAGCTtgtgtgcccaggagggag F$FACBALL_Q2 | 9329 (-) | 0.928 | 0.797 | gagcctctggaggaccccTGAAGtct F$FACBALL_Q2 | 9378 (+) | 0.781 | 0.829 | ggaGAGCTgtcagcccctggaatgtg F$FACBALL_Q2 | 9420 (-) | 0.985 | 0.805 | actatcccataaataacaGGAAGccc F$FACBALL_Q2 | 9453 (+) | 0.878 | 0.875 | accAATCTctgctgtacaggatgttc F$FACBALL_Q2 | 9453 (-) | 0.989 | 0.834 | accaatctctgctgtacaGGATGttc F$FACBALL_Q2 | 9488 (-) | 0.939 | 0.803 | tatttgcaatagaaaatcTGAAAgtt F$FACBALL_Q2 | 9664 (+) | 0.871 | 0.804 | cctTGTCTcaaaagaagtggaggggg F$FACBALL_Q2 | 9664 (-) | 0.985 | 0.828 | ccttgtctcaaaagaagtGGAGGggg F$FACBALL_Q2 | 9963 (+) | 0.882 | 0.791 | atgTTTCTggttcctatggggatcag F$FACBALL_Q2 | 9969 (+) | 0.988 | 0.818 | ctgGTTCCtatggggatcagaagagg F$FACBALL_Q2 | 9969 (-) | 0.871 | 0.891 | ctggttcctatggggatcAGAAGagg F$FACBALL_Q2 | 10242 (+) | 0.920 | 0.828 | ggtGGTCAggcttgtgcaggaagtga F$FACBALL_Q2 | 10781 (-) | 0.931 | 0.835 | gtcggacccagggggatgTGAGCctt F$FACBALL_Q2 | 10832 (-) | 0.871 | 0.786 | ttcaggcccatcgcaggaAGAAGcca F$FACBALL_Q2 | 10849 (-) | 0.831 | 0.832 | aagaagccattgtagagaTGCAGcgt F$FACBALL_Q2 | 11116 (+) | 0.992 | 0.800 | tcaGATCCcttaactcgctgcactca F$FACBALL_Q2 | 11116 (-) | 0.834 | 0.814 | tcagatcccttaactcgcTGCACtca F$FACBALL_Q2 | 11348 (-) | 0.935 | 0.797 | cattatcgaacgagagcaTGATTcta F$FACBALL_Q2 | 11479 (+) | 0.996 | 0.805 | gttTTTCCcttggaatgcggttgaat F$FACBALL_Q2 | 11531 (-) | 0.875 | 0.818 | gcaaatcccacatatggtAGATGggt F$FACBALL_Q2 | 11666 (-) | 0.863 | 0.802 | aattttcataaccaaaaaAGACCaat F$FACBALL_Q2 | 11737 (+) | 0.939 | 0.782 | acaTCTCAccaaatgactagaaagag F$FACBALL_Q2 | 11737 (-) | 0.882 | 0.807 | acatctcaccaaatgactAGAAAgag F$FACBALL_Q2 | 11890 (-) | 0.900 | 0.786 | cactctcaaccaatagcaCGAAGgct P$AGL3_01 | 3143 (-) | 1.000 | 0.881 | acaacaataTATGGaggt P$AGL3_01 | 3590 (-) | 0.989 | 0.887 | tctcctatgTTTGGgttt P$AGL3_01 | 3860 (+) | 0.963 | 0.877 | agtaCCATTtaaggagtt P$AGL3_01 | 11534 (+) | 0.940 | 0.916 | aatcCCACAtatggtaga P$AGL3_01 | 11534 (-) | 1.000 | 0.886 | aatcccacaTATGGtaga P$AGL3_02 | 3590 (-) | 0.998 | 0.879 | tctcctatgTTTGGgttt P$AGL3_02 | 11534 (+) | 0.959 | 0.923 | aatcCCACAtatggtaga P$AGL3_02 | 11534 (-) | 1.000 | 0.874 | aatcccacaTATGGtaga V$MSX1_01 | 8 (-) | 0.897 | 0.863 | cATGTAcaa V$MSX1_01 | 14 (-) | 0.901 | 0.866 | cAAGTAcat V$MSX1_01 | 15 (+) | 0.897 | 0.863 | aagTACATg V$MSX1_01 | 21 (+) | 0.873 | 0.847 | atgTGAATg V$MSX1_01 | 51 (+) | 0.901 | 0.891 | catTACTTg V$MSX1_01 | 124 (+) | 0.988 | 0.884 | ggcTAACTg V$MSX1_01 | 373 (+) | 0.849 | 0.863 | ctcTAGGTg V$MSX1_01 | 476 (-) | 0.988 | 0.927 | cACTTAcaa V$MSX1_01 | 618 (+) | 0.897 | 0.864 | tcgTACATg V$MSX1_01 | 642 (-) | 0.996 | 0.874 | cATTTAttc V$MSX1_01 | 650 (-) | 0.996 | 0.885 | cATTTAtta V$MSX1_01 | 671 (+) | 0.849 | 0.895 | ctgTAGCTg V$MSX1_01 | 986 (-) | 0.988 | 0.887 | cAGTTAata V$MSX1_01 | 1013 (+) | 0.936 | 0.860 | accTATATg V$MSX1_01 | 1015 (+) | 0.928 | 0.903 | ctaTATGTg V$MSX1_01 | 1019 (+) | 0.877 | 0.850 | atgTGATTg V$MSX1_01 | 1430 (+) | 0.865 | 0.859 | caaTCAGTg V$MSX1_01 | 1430 (-) | 0.877 | 0.883 | cAATCAgtg V$MSX1_01 | 1471 (+) | 0.940 | 0.911 | caaTATTTg V$MSX1_01 | 1795 (-) | 0.873 | 0.912 | cATTCAcag V$MSX1_01 | 1910 (+) | 0.940 | 0.854 | tttTATTTg V$MSX1_01 | 1961 (+) | 0.853 | 0.858 | cctTTTTTg V$MSX1_01 | 2045 (+) | 0.889 | 0.922 | ctgTACCTg V$MSX1_01 | 2068 (+) | 0.806 | 0.865 | ctgTCTCTg V$MSX1_01 | 2074 (+) | 0.806 | 0.865 | ctgTCTCTg V$MSX1_01 | 2331 (+) | 0.928 | 0.854 | tgcTATGTg V$MSX1_01 | 2719 (-) | 0.940 | 0.862 | cAAATAgat V$MSX1_01 | 2789 (+) | 0.849 | 0.856 | cctTAGCTg V$MSX1_01 | 2803 (+) | 0.900 | 0.899 | ctcTTACTg V$MSX1_01 | 2880 (+) | 0.814 | 0.871 | ccgTGTATg V$MSX1_01 | 2928 (+) | 0.849 | 0.856 | cctTAGCTg V$MSX1_01 | 2963 (-) | 0.865 | 0.907 | cAGTGAcgg V$MSX1_01 | 3147 (+) | 0.936 | 0.908 | caaTATATg V$MSX1_01 | 3253 (-) | 0.988 | 0.945 | cACTTAtgg V$MSX1_01 | 3402 (-) | 0.988 | 0.880 | cAGTTAtaa V$MSX1_01 | 3411 (+) | 0.908 | 0.871 | aagTTAATg V$MSX1_01 | 3429 (+) | 1.000 | 0.885 | gctTAATTg V$MSX1_01 | 3448 (-) | 0.988 | 0.887 | cAGTTAata V$MSX1_01 | 3865 (-) | 0.996 | 0.958 | cATTTAagg V$MSX1_01 | 4216 (-) | 0.988 | 0.887 | cAGTTAaaa V$MSX1_01 | 4447 (-) | 1.000 | 0.896 | cAATTAata V$MSX1_01 | 4465 (+) | 0.865 | 0.875 | cacTGACTg V$MSX1_01 | 4465 (-) | 0.865 | 0.906 | cACTGActg V$MSX1_01 | 4857 (-) | 0.865 | 0.875 | cACTGAggg V$MSX1_01 | 4981 (+) | 0.889 | 0.891 | cccTACCTg V$MSX1_01 | 5060 (-) | 0.889 | 0.891 | cAGGTAgtg V$MSX1_01 | 5085 (+) | 0.865 | 0.875 | ctcTGACTg V$MSX1_01 | 5102 (+) | 0.940 | 0.854 | tatTATTTg V$MSX1_01 | 5434 (+) | 0.940 | 0.919 | cttTATTTg V$MSX1_01 | 5465 (-) | 0.928 | 0.885 | cACATActa V$MSX1_01 | 5544 (+) | 0.841 | 0.858 | ctcTTTCTg V$MSX1_01 | 5668 (-) | 0.912 | 0.907 | cAATAAgag V$MSX1_01 | 5678 (-) | 0.806 | 0.865 | cAGACAcag V$MSX1_01 | 5704 (-) | 0.988 | 0.896 | cACTTAgga V$MSX1_01 | 5953 (+) | 0.940 | 0.854 | tttTATTTg V$MSX1_01 | 5969 (+) | 0.988 | 0.887 | tttTAAGTg V$MSX1_01 | 6078 (+) | 0.806 | 0.865 | cagTCTCTg V$MSX1_01 | 6194 (-) | 0.996 | 0.882 | cATTTAagc V$MSX1_01 | 6252 (+) | 0.940 | 0.862 | ttcTATTTg V$MSX1_01 | 6321 (+) | 0.865 | 0.875 | cccTGACTg V$MSX1_01 | 6333 (-) | 0.806 | 0.865 | cACACActg V$MSX1_01 | 6445 (-) | 0.865 | 0.859 | cAGTCAtgg V$MSX1_01 | 6510 (-) | 0.940 | 0.854 | cAAATAaaa V$MSX1_01 | 6573 (-) | 0.988 | 0.869 | cAGTTAtgc V$MSX1_01 | 6593 (-) | 0.940 | 0.919 | cAAATAagg V$MSX1_01 | 6605 (+) | 0.940 | 0.862 | tgcTATTTg V$MSX1_01 | 6621 (+) | 0.865 | 0.907 | ccgTGAGTg V$MSX1_01 | 6904 (+) | 0.988 | 0.869 | gcaTAAGTg V$MSX1_01 | 6929 (+) | 0.928 | 0.885 | ttgTATGTg V$MSX1_01 | 7830 (-) | 1.000 | 0.885 | cAATTAacc V$MSX1_01 | 7864 (-) | 0.988 | 0.895 | cACTTAgaa V$MSX1_01 | 7965 (+) | 0.861 | 0.856 | ctaTAGTTg V$MSX1_01 | 8017 (+) | 0.900 | 0.892 | cctTTAGTg V$MSX1_01 | 8263 (-) | 0.873 | 0.847 | cATTGAcct V$MSX1_01 | 8335 (+) | 0.940 | 0.958 | cagTATTTg V$MSX1_01 | 8467 (-) | 0.857 | 0.861 | cATCTAagg V$MSX1_01 | 8517 (-) | 0.865 | 0.906 | cACTCAcag V$MSX1_01 | 8566 (+) | 0.818 | 0.873 | ccgTCTTTg V$MSX1_01 | 8574 (+) | 0.996 | 0.882 | gatTAAATg V$MSX1_01 | 8618 (-) | 0.877 | 0.850 | cAATCActt V$MSX1_01 | 8900 (-) | 0.841 | 0.889 | cACAAAcag V$MSX1_01 | 9135 (+) | 0.853 | 0.897 | cagTTTTTg V$MSX1_01 | 9485 (+) | 0.940 | 0.919 | cttTATTTg V$MSX1_01 | 9564 (+) | 0.841 | 0.889 | cggTTTCTg V$MSX1_01 | 9794 (+) | 0.996 | 0.997 | cagTAAATg V$MSX1_01 | 9794 (-) | 0.900 | 0.891 | cAGTAAatg V$MSX1_01 | 10307 (+) | 0.988 | 0.887 | tttTAACTg V$MSX1_01 | 10390 (-) | 0.936 | 0.851 | cATATAaaa V$MSX1_01 | 10508 (-) | 1.000 | 0.953 | cAATTAtgg V$MSX1_01 | 10732 (+) | 0.912 | 0.900 | cttTTATTg V$MSX1_01 | 10915 (-) | 0.940 | 0.854 | cAAATAata V$MSX1_01 | 10957 (+) | 0.806 | 0.865 | cggTCTGTg V$MSX1_01 | 11070 (-) | 0.988 | 0.927 | cAGTTAcaa V$MSX1_01 | 11234 (+) | 0.988 | 0.895 | ttcTAACTg V$MSX1_01 | 11338 (+) | 0.996 | 0.932 | atgTAAATg V$MSX1_01 | 11369 (+) | 0.928 | 0.854 | ttcTATCTg V$MSX1_01 | 11407 (-) | 0.996 | 0.874 | cATTTAttc V$MSX1_01 | 11471 (+) | 0.940 | 0.854 | tttTATTTg V$MSX1_01 | 11539 (-) | 0.928 | 0.903 | cACATAtgg V$MSX1_01 | 11637 (+) | 0.996 | 0.882 | gttTAAATg V$MSX1_01 | 11775 (-) | 0.928 | 0.885 | cAGATActt P$ABF1_02 | 728 (-) | 1.000 | 0.762 | gtgagccaccaCGTGGttgctggga V$AMEF2_Q6 | 627 (-) | 1.000 | 0.825 | acacttttTTTTAaacat V$AMEF2_Q6 | 2403 (+) | 1.000 | 0.879 | ttcctTAAAAaaaacaaa V$AMEF2_Q6 | 3801 (+) | 0.885 | 0.825 | cagaaTAGAAaaaacatc V$AMEF2_Q6 | 4420 (+) | 1.000 | 0.820 | ttgtcTAAAAaaaaaaaa V$AMEF2_Q6 | 4779 (+) | 0.885 | 0.830 | ctggaTAAAGattgctga V$AMEF2_Q6 | 8850 (+) | 1.000 | 0.830 | cggttTAAAAattaagac V$AMEF2_Q6 | 9833 (-) | 1.000 | 0.868 | agggataaTTTTAaagtc V$AMEF2_Q6 | 10411 (-) | 1.000 | 0.882 | gttggcatTTTTAacact V$MMEF2_Q6 | 234 (-) | 1.000 | 0.880 | ttgtctTTTTAaatct V$MMEF2_Q6 | 520 (+) | 1.000 | 0.883 | gatttTAAAAatgatg V$MMEF2_Q6 | 598 (-) | 1.000 | 0.880 | cttactTTTTAaacat V$MMEF2_Q6 | 629 (-) | 1.000 | 0.943 | acttttTTTTAaacat V$MMEF2_Q6 | 2403 (+) | 1.000 | 0.926 | ttcctTAAAAaaaaca V$MMEF2_Q6 | 2896 (-) | 1.000 | 0.896 | catcttTTTTAtccac V$MMEF2_Q6 | 3745 (-) | 1.000 | 0.885 | ctacatTTTTAgtcat V$MMEF2_Q6 | 4420 (+) | 1.000 | 0.912 | ttgtcTAAAAaaaaaa V$MMEF2_Q6 | 7259 (+) | 1.000 | 0.902 | ctcatTAAAAtatcct V$MMEF2_Q6 | 7642 (+) | 1.000 | 0.890 | tatttTAAAAatgctt V$MMEF2_Q6 | 8633 (-) | 0.943 | 0.877 | aagcttATTTAagaag V$MMEF2_Q6 | 8850 (+) | 1.000 | 0.923 | cggttTAAAAattaag V$MMEF2_Q6 | 9710 (+) | 1.000 | 0.878 | tggtgTAAAAttgatc V$MMEF2_Q6 | 9835 (-) | 1.000 | 0.895 | ggataaTTTTAaagtc V$MMEF2_Q6 | 9892 (-) | 1.000 | 0.922 | attcttTTTTAaagac V$MMEF2_Q6 | 10389 (+) | 1.000 | 0.902 | ccataTAAAAatgaac V$MMEF2_Q6 | 10413 (-) | 1.000 | 0.923 | tggcatTTTTAacact V$SOX9_B1 | 945 (+) | 1.000 | 0.963 | taggaACAATgatc V$SOX9_B1 | 7329 (+) | 1.000 | 0.978 | ctggaACAATagac V$SOX9_B1 | 8554 (+) | 1.000 | 0.956 | aatgaACAATgtcc V$SOX9_B1 | 10147 (-) | 1.000 | 0.943 | tattATTGTtgttg V$SOX9_B1 | 10339 (-) | 1.000 | 0.950 | ttttATTGTtttaa V$SOX9_B1 | 10457 (+) | 1.000 | 0.952 | tatcaACAATgcat V$HNF4_01_B | 856 (-) | 0.800 | 0.804 | acgatCGTTGccctt V$HNF4_01_B | 3566 (-) | 1.000 | 0.865 | gtggcCTTTGgcaag V$HNF4_01_B | 4723 (-) | 0.800 | 0.815 | ctgtgATTTGgccct V$HNF4_01_B | 5452 (-) | 0.800 | 0.806 | ataacCTTTTcccca V$HNF4_01_B | 7084 (-) | 1.000 | 0.835 | gagtgCTTTGcgcgc V$HNF4_01_B | 8375 (-) | 1.000 | 0.894 | ctgctCTTTGatctg V$HNF4_01_B | 8581 (+) | 1.000 | 0.918 | tggggCAAAGgcaac V$HNF4_01_B | 10496 (+) | 0.800 | 0.819 | aggatCAAAAgtcaa V$HNF4_01_B | 11287 (+) | 1.000 | 0.825 | tagggCAAAGaaata V$HNF4_01_B | 11789 (+) | 0.800 | 0.818 | aaggaCAAATcccag V$AREB6_01 | 265 (-) | 1.000 | 0.923 | actCAGGTccttg V$AREB6_01 | 575 (+) | 1.000 | 0.939 | cgtctACCTGttg V$AREB6_01 | 1293 (+) | 1.000 | 0.950 | cctggACCTGtgt V$AREB6_01 | 1356 (+) | 1.000 | 0.908 | gggtcACCTGatg V$AREB6_01 | 1391 (-) | 1.000 | 0.945 | gggCAGGTgagga V$AREB6_01 | 1784 (+) | 1.000 | 0.900 | ggagcACCTGcca V$AREB6_01 | 1969 (-) | 1.000 | 0.905 | gagCAGGTtggcc V$AREB6_01 | 2044 (+) | 1.000 | 0.962 | actgtACCTGgct V$AREB6_01 | 2643 (+) | 1.000 | 0.947 | ccctgACCTGcgt V$AREB6_01 | 3050 (+) | 1.000 | 0.959 | gacatACCTGcga V$AREB6_01 | 3900 (-) | 1.000 | 0.941 | tcaCAGGTattaa V$AREB6_01 | 4338 (-) | 1.000 | 0.918 | atcCAGGTggtgg V$AREB6_01 | 4401 (-) | 1.000 | 0.947 | agaCAGGTcagcc V$AREB6_01 | 4980 (+) | 1.000 | 0.964 | cccctACCTGtcc V$AREB6_01 | 5057 (-) | 1.000 | 0.923 | aagCAGGTagtga V$AREB6_01 | 7390 (-) | 1.000 | 0.905 | gtcCAGGTggtag V$AREB6_01 | 7580 (-) | 1.000 | 0.909 | catCAGGTcaaac V$AREB6_01 | 8167 (-) | 1.000 | 0.939 | aggCAGGTagcac V$AREB6_01 | 8263 (+) | 1.000 | 0.912 | cattgACCTGatg V$AREB6_01 | 8388 (+) | 1.000 | 0.947 | tgtatACCTGaag V$AREB6_01 | 8503 (-) | 1.000 | 0.952 | ggaCAGGTgcggg V$AREB6_01 | 8766 (+) | 1.000 | 0.914 | ccagcACCTGgga V$AREB6_01 | 8968 (-) | 1.000 | 0.939 | aacCAGGTgtggt V$AREB6_01 | 11002 (+) | 1.000 | 0.930 | agattACCTGctt V$AREB6_01 | 11959 (+) | 1.000 | 0.953 | actgcACCTGagc V$AREB6_02 | 1357 (+) | 1.000 | 0.978 | ggtCACCTgatg V$AREB6_02 | 1391 (-) | 1.000 | 0.978 | gggcAGGTGagg V$AREB6_02 | 3289 (-) | 1.000 | 0.973 | gatgAGGTGaac V$AREB6_02 | 8503 (-) | 1.000 | 0.979 | ggacAGGTGcgg V$AREB6_02 | 8984 (+) | 1.000 | 0.983 | acaCACCTttaa V$AREB6_02 | 11960 (+) | 1.000 | 0.971 | ctgCACCTgagc V$AREB6_03 | 1125 (+) | 0.852 | 0.877 | atgCACGTgtgt V$AREB6_03 | 1357 (+) | 1.000 | 0.907 | ggtCACCTgatg V$AREB6_03 | 1391 (-) | 1.000 | 0.922 | gggcAGGTGagg V$AREB6_03 | 1785 (+) | 1.000 | 0.958 | gagCACCTgcca V$AREB6_03 | 4338 (-) | 1.000 | 0.929 | atccAGGTGgtg V$AREB6_03 | 7390 (-) | 1.000 | 0.924 | gtccAGGTGgta V$AREB6_03 | 8503 (-) | 1.000 | 0.994 | ggacAGGTGcgg V$AREB6_03 | 8520 (-) | 0.852 | 0.875 | tcacAGATGctt V$AREB6_03 | 8767 (+) | 1.000 | 0.984 | cagCACCTggga V$AREB6_03 | 8968 (-) | 1.000 | 0.969 | aaccAGGTGtgg V$AREB6_03 | 11960 (+) | 1.000 | 0.972 | ctgCACCTgagc V$CART1_01 | 62 (+) | 1.000 | 0.850 | ccaTAATTactgttgttg V$CART1_01 | 834 (-) | 0.783 | 0.757 | atataaaacaAGTTAaat V$CART1_01 | 839 (-) | 0.854 | 0.754 | aaacaagttaAATAAata V$CART1_01 | 987 (+) | 0.924 | 0.757 | agtTAATAattttcatgt V$CART1_01 | 990 (+) | 1.000 | 0.827 | taaTAATTttcatgtggg V$CART1_01 | 1201 (+) | 0.778 | 0.799 | tggTTATAcccattgctt V$CART1_01 | 1361 (+) | 0.797 | 0.733 | accTGATGctcgtttgca V$CART1_01 | 1890 (+) | 0.778 | 0.788 | attTTATAtttattttat V$CART1_01 | 1904 (+) | 0.854 | 0.738 | ttaTTATTttatttgaga V$CART1_01 | 1985 (-) | 0.933 | 0.765 | aactcaccaaGATTAtct V$CART1_01 | 2512 (-) | 1.000 | 0.831 | agggaaaaaaAATTAcat V$CART1_01 | 2519 (-) | 0.924 | 0.749 | aaaaattacaTATTActt V$CART1_01 | 2591 (-) | 0.924 | 0.800 | cagtactgacTATTAaac V$CART1_01 | 2652 (+) | 0.951 | 0.805 | gcgTAATGcactttgatg V$CART1_01 | 3082 (+) | 0.847 | 0.744 | gggTGATTctatttttcg V$CART1_01 | 3106 (+) | 1.000 | 0.829 | tacTAATTtccatgcggc V$CART1_01 | 3126 (+) | 1.000 | 0.929 | tatTAATTtacattccca V$CART1_01 | 3279 (+) | 0.847 | 0.789 | ttaTGATTgggatgaggt V$CART1_01 | 3328 (+) | 0.847 | 0.747 | tccTGATTctatttccct V$CART1_01 | 3402 (+) | 0.778 | 0.768 | cagTTATAaaagttaatg V$CART1_01 | 3402 (-) | 0.783 | 0.772 | cagttataaaAGTTAatg V$CART1_01 | 3406 (-) | 0.839 | 0.735 | tataaaagttAATGAttc V$CART1_01 | 3415 (+) | 0.847 | 0.752 | taaTGATTctggttgctt V$CART1_01 | 3429 (+) | 1.000 | 0.750 | gctTAATTgacttaggta V$CART1_01 | 3707 (+) | 0.779 | 0.798 | gctTGATCaatatttacg V$CART1_01 | 3724 (-) | 1.000 | 0.812 | gggtgactaaAATTAtta V$CART1_01 | 3897 (-) | 0.924 | 0.757 | tcatcacaggTATTAata V$CART1_01 | 3907 (+) | 0.924 | 0.786 | tatTAATAaagctttatg V$CART1_01 | 3920 (-) | 0.951 | 0.766 | ttatgaattaCATTAtca V$CART1_01 | 4059 (+) | 1.000 | 0.842 | caaTAATTaaaatatgta V$CART1_01 | 4448 (+) | 0.924 | 0.878 | aatTAATAttcatttcac V$CART1_01 | 5005 (-) | 0.951 | 0.816 | ttgtattaccCATTAagc V$CART1_01 | 5433 (+) | 0.854 | 0.852 | actTTATTtgtattgcca V$CART1_01 | 5578 (-) | 0.854 | 0.740 | tctaaaggatAATAAtaa V$CART1_01 | 5581 (-) | 0.854 | 0.749 | aaaggataatAATAAaat V$CART1_01 | 6585 (-) | 0.854 | 0.754 | acacaaaacaAATAAggt V$CART1_01 | 6669 (+) | 0.854 | 0.742 | attTTATTtagcttttga V$CART1_01 | 6949 (-) | 0.951 | 0.829 | tgatactgtcCATTAttc V$CART1_01 | 7059 (+) | 1.000 | 0.828 | tctTAATTtgtttttgtt V$CART1_01 | 7251 (-) | 0.951 | 0.801 | gaaaaagtctCATTAaaa V$CART1_01 | 7257 (+) | 0.839 | 0.748 | gtcTCATTaaaatatcct V$CART1_01 | 7348 (-) | 0.847 | 0.743 | tcaacatgttAATCAtgg V$CART1_01 | 7424 (+) | 0.854 | 0.739 | ggtTTATTtgtttttatt V$CART1_01 | 7570 (-) | 0.797 | 0.806 | accaaatcccCATCAggt V$CART1_01 | 7778 (-) | 1.000 | 0.834 | acaaaacaaaAATTAtcc V$CART1_01 | 7884 (+) | 0.839 | 0.732 | aagTCATTtggtttttat V$CART1_01 | 7895 (+) | 0.854 | 0.741 | tttTTATTttgatattag V$CART1_01 | 8678 (-) | 1.000 | 0.741 | ttaggaaggtAATTAgcc V$CART1_01 | 8684 (+) | 1.000 | 0.838 | aggTAATTagcctttctg V$CART1_01 | 8945 (-) | 1.000 | 0.766 | aactttttaaAATTAgta V$CART1_01 | 9164 (-) | 0.790 | 0.769 | tcctaagatgCATGAacg V$CART1_01 | 9484 (+) | 0.854 | 0.765 | actTTATTtgcaatagaa V$CART1_01 | 9734 (+) | 0.951 | 0.803 | tgaTAATGtccatacaat V$CART1_01 | 9795 (-) | 0.797 | 0.807 | agtaaatgtaCATCAcat V$CART1_01 | 9862 (-) | 0.839 | 0.735 | attaaagaagAATGAgga V$CART1_01 | 9931 (+) | 1.000 | 0.825 | gggTAATTtgcatgtatg V$CART1_01 | 10119 (-) | 1.000 | 0.744 | tccccaataaAATTAgtt V$CART1_01 | 10123 (-) | 0.783 | 0.764 | caataaaattAGTTAaga V$CART1_01 | 10137 (-) | 0.924 | 0.882 | aagaaattatTATTAttg V$CART1_01 | 10140 (+) | 0.854 | 0.850 | aaaTTATTattattgttg V$CART1_01 | 10143 (+) | 0.854 | 0.755 | ttaTTATTattgttgttg V$CART1_01 | 10146 (+) | 0.854 | 0.759 | ttaTTATTgttgttgtcg V$CART1_01 | 10307 (+) | 0.783 | 0.752 | tttTAACTgtttttatat V$CART1_01 | 10329 (+) | 0.778 | 0.758 | attTTATAgtttttattg V$CART1_01 | 10347 (+) | 1.000 | 0.798 | tttTAATTcaaagtgtca V$CART1_01 | 10382 (-) | 0.778 | 0.787 | ttgcaatccaTATAAaaa V$CART1_01 | 10388 (-) | 0.839 | 0.741 | tccatataaaAATGAacg V$CART1_01 | 10499 (-) | 1.000 | 0.829 | atcaaaagtcAATTAtgg V$CART1_01 | 10907 (-) | 0.854 | 0.754 | ctactatgcaAATAAtag V$CART1_01 | 11150 (+) | 0.797 | 0.766 | ctcTGATGaaaatgaagc V$CART1_01 | 11150 (-) | 0.839 | 0.795 | ctctgatgaaAATGAagc V$CART1_01 | 11196 (-) | 0.924 | 0.746 | cttgattgatTATTAatt V$CART1_01 | 11206 (+) | 1.000 | 0.797 | tatTAATTtgtttgaact V$CART1_01 | 11267 (-) | 1.000 | 0.832 | aagaaaaggaAATTAagg V$CART1_01 | 11294 (-) | 0.854 | 0.837 | aagaaataatAATAAcga V$CART1_01 | 11300 (+) | 0.924 | 0.808 | taaTAATAacgactacgg V$CART1_01 | 11338 (+) | 0.774 | 0.856 | atgTAAATgccattatcg V$CART1_01 | 11338 (-) | 0.951 | 0.857 | atgtaaatgcCATTAtcg V$CART1_01 | 11381 (-) | 0.924 | 0.778 | tttgaaaactTATTAaat V$CART1_01 | 11387 (+) | 0.854 | 0.751 | aacTTATTaaattttgta V$CART1_01 | 11470 (+) | 0.854 | 0.742 | tttTTATTtgtttttccc V$CART1_01 | 11662 (-) | 0.805 | 0.812 | acacaattttCATAAcca V$CART1_01 | 11679 (-) | 0.854 | 0.752 | aaaaaagaccAATAAagt V$CART1_01 | 11688 (-) | 0.847 | 0.805 | caataaagttAATCActg V$CART1_01 | 11694 (+) | 0.933 | 0.778 | agtTAATCactgtcaccc V$TGIF_01 | 45 (+) | 1.000 | 0.965 | agaTGTCAtta V$TGIF_01 | 470 (-) | 1.000 | 0.932 | attTGACActt V$TGIF_01 | 622 (-) | 1.000 | 0.931 | acaTGACActt V$TGIF_01 | 9382 (+) | 1.000 | 0.972 | agcTGTCAgcc V$TGIF_01 | 10357 (+) | 1.000 | 0.929 | aagTGTCAgag V$MEIS1_01 | 1254 (-) | 1.000 | 0.996 | attcTGTCAtca V$MEIS1_01 | 1281 (+) | 1.000 | 0.996 | ggaTGACAggga V$MEIS1_01 | 6052 (+) | 1.000 | 0.998 | ctgTGACAgccc V$MEIS1_01 | 8101 (-) | 1.000 | 0.998 | gtccTGTCAcac V$MEIS1_01 | 9381 (-) | 1.000 | 0.996 | gagcTGTCAgcc V$MEIS1_01 | 11700 (-) | 1.000 | 0.995 | tcacTGTCAccc V$MEIS1AHOXA9_01 | 1026 (+) | 0.821 | 0.789 | TGGCAgttgaatca V$MEIS1AHOXA9_01 | 1137 (-) | 0.668 | 0.820 | tcctaaaacCGTTA V$MEIS1AHOXA9_01 | 1230 (+) | 0.847 | 0.800 | TGACTgctttgaca V$MEIS1AHOXA9_01 | 1249 (-) | 1.000 | 0.731 | gcagcattcTGTCA V$MEIS1AHOXA9_01 | 1783 (-) | 0.821 | 0.736 | tggagcaccTGCCA V$MEIS1AHOXA9_01 | 2545 (+) | 0.821 | 0.802 | TAACAggtttctgt V$MEIS1AHOXA9_01 | 3223 (-) | 0.847 | 0.765 | acttttaccAGTCA V$MEIS1AHOXA9_01 | 3445 (+) | 0.821 | 0.741 | TAACAgttaataaa V$MEIS1AHOXA9_01 | 3928 (-) | 0.847 | 0.732 | tacattatcAGTCA V$MEIS1AHOXA9_01 | 4926 (+) | 0.805 | 0.762 | TTACAgatcatgga V$MEIS1AHOXA9_01 | 5088 (+) | 0.847 | 0.793 | TGACTggtttgggt V$MEIS1AHOXA9_01 | 6428 (-) | 0.875 | 0.778 | gctttcatcGGTCA V$MEIS1AHOXA9_01 | 7301 (+) | 0.821 | 0.742 | TGAGAgagtaagga V$MEIS1AHOXA9_01 | 7679 (+) | 0.805 | 0.739 | TCACAgctgtaaag V$MEIS1AHOXA9_01 | 8720 (-) | 0.805 | 0.733 | ctgaagaacTGTAA V$MEIS1AHOXA9_01 | 9376 (-) | 1.000 | 0.781 | agggagagcTGTCA V$MEIS1AHOXA9_01 | 9506 (+) | 0.805 | 0.788 | TGAAAgtttcccca V$MEIS1AHOXA9_01 | 9760 (-) | 0.847 | 0.768 | attgaaaacAGTCA V$MEIS1AHOXA9_01 | 10185 (-) | 0.847 | 0.747 | gtgttcagcAGTCA V$MEIS1AHOXA9_01 | 10310 (+) | 0.668 | 0.742 | TAACTgtttttata V$MEIS1AHOXA9_01 | 10363 (-) | 0.805 | 0.745 | cagagaatcTTTCA V$MEIS1AHOXA9_01 | 11453 (-) | 0.652 | 0.747 | tggtgaatcAGTGA V$MEIS1BHOXA9_02 | 625 (+) | 1.000 | 0.859 | TGACActttttttt V$MEIS1BHOXA9_02 | 10351 (-) | 1.000 | 0.913 | aattcaaagTGTCA V$FOXJ2_01 | 210 (-) | 0.960 | 0.898 | cctggTATTTtttttttt V$FOXJ2_01 | 503 (-) | 0.960 | 0.917 | ttataTATTTgtcttctg V$FOXJ2_01 | 844 (+) | 0.960 | 0.948 | agttaaatAAATAcgatc V$FOXJ2_01 | 873 (+) | 1.000 | 0.894 | aaatgcacAAACAcgaaa V$FOXJ2_01 | 1524 (-) | 1.000 | 0.921 | agagcTGTTTttttgttc V$FOXJ2_01 | 1891 (-) | 0.960 | 0.964 | ttttaTATTTattttatt V$FOXJ2_01 | 2358 (-) | 0.960 | 0.949 | acttgTATTTattttgcc V$FOXJ2_01 | 2406 (+) | 1.000 | 0.934 | cttaaaaaAAACAaaaaa V$FOXJ2_01 | 2410 (+) | 0.917 | 0.906 | aaaaaaacAAAAAatcac V$FOXJ2_01 | 4000 (+) | 0.960 | 0.907 | acaacgatAAATAaatat V$FOXJ2_01 | 4004 (+) | 0.960 | 0.956 | cgataaatAAATAttctg V$FOXJ2_01 | 4217 (+) | 0.960 | 0.895 | agttaaaaAAATAaattc V$FOXJ2_01 | 4490 (+) | 1.000 | 0.898 | tgaacaaaAAACAaaaca V$FOXJ2_01 | 5432 (-) | 0.960 | 0.900 | cacttTATTTgtattgcc V$FOXJ2_01 | 5642 (+) | 0.917 | 0.924 | gacaaaatAAAAAgaaga V$FOXJ2_01 | 5714 (+) | 1.000 | 0.909 | tcccacaaAAACActaaa V$FOXJ2_01 | 5804 (+) | 0.960 | 0.901 | aaaaaaaaAAATAgccct V$FOXJ2_01 | 6586 (+) | 0.960 | 0.926 | cacaaaacAAATAaggtg V$FOXJ2_01 | 6821 (-) | 0.917 | 0.897 | tccctTTTTTatgtgcta V$FOXJ2_01 | 7045 (-) | 0.960 | 0.909 | ttgacTATTTtttttctt V$FOXJ2_01 | 7062 (-) | 1.000 | 0.907 | taattTGTTTttgtttgt V$FOXJ2_01 | 7064 (-) | 0.917 | 0.899 | atttgTTTTTgtttgtgt V$FOXJ2_01 | 7068 (-) | 1.000 | 0.939 | gttttTGTTTgtgtatga V$FOXJ2_01 | 7423 (-) | 0.960 | 0.937 | aggttTATTTgtttttat V$FOXJ2_01 | 7427 (-) | 1.000 | 0.914 | ttattTGTTTttatttta V$FOXJ2_01 | 7429 (-) | 0.917 | 0.927 | atttgTTTTTattttatg V$FOXJ2_01 | 7768 (+) | 1.000 | 0.897 | gtctcaaaAAACAaaaca V$FOXJ2_01 | 7777 (+) | 0.917 | 0.905 | aacaaaacAAAAAttatc V$FOXJ2_01 | 7795 (+) | 0.917 | 0.902 | ccaaagatAAAGAacaac V$FOXJ2_01 | 7819 (+) | 1.000 | 0.972 | tccaaaacAAACAattaa V$FOXJ2_01 | 7890 (-) | 0.917 | 0.931 | tttggTTTTTattttgat V$FOXJ2_01 | 7936 (-) | 1.000 | 0.908 | ttgtgTGTTTagttttga V$FOXJ2_01 | 7967 (-) | 1.000 | 0.963 | atagtTGTTTgtttttca V$FOXJ2_01 | 8895 (+) | 1.000 | 0.898 | cgtaccacAAACAgattc V$FOXJ2_01 | 9108 (+) | 0.917 | 0.896 | agcaaaacAAAAAaccgg V$FOXJ2_01 | 9422 (+) | 0.960 | 0.896 | tatcccatAAATAacagg V$FOXJ2_01 | 9531 (+) | 1.000 | 0.911 | actagaacAAACAagtcc V$FOXJ2_01 | 10309 (-) | 1.000 | 0.917 | ttaacTGTTTttatattt V$FOXJ2_01 | 10311 (-) | 0.917 | 0.905 | aactgTTTTTatattttt V$FOXJ2_01 | 10319 (-) | 0.917 | 0.933 | ttataTTTTTattttata V$FOXJ2_01 | 10333 (-) | 0.917 | 0.901 | tatagTTTTTattgtttt V$FOXJ2_01 | 10387 (+) | 0.917 | 0.896 | atccatatAAAAAtgaac V$FOXJ2_01 | 10698 (-) | 1.000 | 0.905 | acccgTGTTTatgaaaca V$FOXJ2_01 | 11186 (-) | 1.000 | 0.895 | tgcctTGTTTcttgattg V$FOXJ2_01 | 11289 (+) | 0.960 | 0.893 | gggcaaagAAATAataat V$FOXJ2_01 | 11465 (-) | 0.917 | 0.925 | gaaatTTTTTatttgttt V$FOXJ2_01 | 11469 (-) | 0.960 | 0.929 | tttttTATTTgtttttcc V$E2F_Q3 | 2472 (-) | 1.000 | 0.946 | gcGCGAAg V$E2F_Q6 | 2472 (-) | 1.000 | 0.930 | gCGCGAag V$E2F1_Q3 | 2472 (-) | 1.000 | 0.997 | gCGCGAag V$E2F1_Q3 | 8331 (-) | 0.982 | 0.911 | gCGCCAgt V$E2F1_Q3 | 8511 (-) | 0.943 | 0.932 | gCGGGAca V$E2F1_Q3 | 9122 (-) | 0.943 | 0.908 | cCGGGAaa V$E2F1_Q4 | 1720 (+) | 0.819 | 0.870 | tTTTCCgg V$E2F1_Q4 | 2472 (-) | 0.973 | 0.907 | gcGCGAAg V$E2F1_Q4 | 2502 (-) | 0.785 | 0.846 | ccGAAAAa V$E2F1_Q4 | 3718 (+) | 0.785 | 0.845 | aTTTACgg V$E2F1_Q4 | 5750 (-) | 0.919 | 0.845 | acGCAAAg V$E2F1_Q4 | 6129 (+) | 0.839 | 0.885 | tTTCTCgg V$E2F1_Q4 | 7089 (+) | 0.919 | 0.869 | cTTTGCgc V$E2F1_Q4 | 8547 (-) | 0.839 | 0.885 | ccGAGAAa V$E2F1_Q4 | 8614 (-) | 0.866 | 0.903 | ccGTCAAt V$E2F1_Q4 | 9122 (-) | 0.873 | 0.909 | ccGGGAAa V$E2F1_Q4 | 9217 (-) | 0.919 | 0.845 | acGCAAAg V$E2F1_Q4 | 10270 (+) | 0.900 | 0.928 | tTTGCCgg V$E2F1_Q6 | 2472 (-) | 1.000 | 0.982 | gCGCGAag V$E2F1_Q6 | 7089 (+) | 0.886 | 0.892 | ctTTGCGc V$E2F1_Q6 | 9122 (-) | 0.932 | 0.908 | cCGGGAaa V$E2F1_Q6 | 10270 (+) | 0.897 | 0.880 | ttTGCCGg V$TITF1_Q3 | 38 (-) | 0.880 | 0.823 | caaCTTCAga V$TITF1_Q3 | 281 (+) | 0.857 | 0.782 | atGCAAGtca V$TITF1_Q3 | 286 (+) | 1.000 | 0.984 | agTCAAGtgt V$TITF1_Q3 | 542 (-) | 0.880 | 0.809 | gcgCTTGTgt V$TITF1_Q3 | 627 (-) | 0.737 | 0.775 | acaCTTTTtt V$TITF1_Q3 | 679 (-) | 0.880 | 0.765 | gctCTTCAga V$TITF1_Q3 | 839 (+) | 0.880 | 0.815 | aaACAAGtta V$TITF1_Q3 | 937 (+) | 0.880 | 0.765 | ccACAAGtta V$TITF1_Q3 | 1059 (-) | 0.857 | 0.772 | caaCTTTAaa V$TITF1_Q3 | 1346 (-) | 0.857 | 0.810 | aggCTTGCct V$TITF1_Q3 | 1444 (+) | 0.880 | 0.816 | agTGAAGtca V$TITF1_Q3 | 1655 (-) | 0.857 | 0.795 | ctaCTTGGca V$TITF1_Q3 | 1864 (+) | 0.880 | 0.830 | acTGAAGaag V$TITF1_Q3 | 1938 (-) | 0.857 | 0.799 | agcCTTGGct V$TITF1_Q3 | 2028 (+) | 0.857 | 0.768 | atTAAAGgag V$TITF1_Q3 | 2356 (-) | 0.880 | 0.863 | aaaCTTGTat V$TITF1_Q3 | 2606 (+) | 0.857 | 0.766 | aaCCAAGagc V$TITF1_Q3 | 3202 (+) | 1.000 | 0.919 | ccTCAAGtat V$TITF1_Q3 | 3215 (+) | 0.857 | 0.768 | agGCAAGcac V$TITF1_Q3 | 3311 (+) | 0.857 | 0.817 | agTTAAGatt V$TITF1_Q3 | 3550 (-) | 0.857 | 0.784 | gttCTTTAgt V$TITF1_Q3 | 3583 (+) | 0.762 | 0.766 | atTCAACtct V$TITF1_Q3 | 3657 (+) | 0.857 | 0.812 | agTAAAGtaa V$TITF1_Q3 | 3681 (+) | 0.857 | 0.774 | tgCCAAGtgc V$TITF1_Q3 | 3696 (+) | 0.880 | 0.809 | acACAAGaga V$TITF1_Q3 | 3705 (-) | 1.000 | 0.851 | atgCTTGAtc V$TITF1_Q3 | 3759 (-) | 0.857 | 0.763 | attCTTAAaa V$TITF1_Q3 | 3866 (+) | 0.857 | 0.768 | atTTAAGgag V$TITF1_Q3 | 3886 (-) | 1.000 | 0.820 | ctcCTTGAcc V$TITF1_Q3 | 3911 (+) | 0.857 | 0.808 | aaTAAAGctt V$TITF1_Q3 | 4201 (+) | 1.000 | 0.818 | gtTCAAGgct V$TITF1_Q3 | 4206 (-) | 0.857 | 0.810 | aggCTTGCct V$TITF1_Q3 | 4253 (-) | 0.857 | 0.804 | ttaCTTGGtt V$TITF1_Q3 | 4416 (-) | 0.880 | 0.824 | aacCTTGTct V$TITF1_Q3 | 4693 (-) | 1.000 | 0.888 | aaaCTTGAtg V$TITF1_Q3 | 4707 (-) | 0.857 | 0.870 | agaCTTGGgt V$TITF1_Q3 | 4794 (-) | 0.857 | 0.782 | tgaCTTGCat V$TITF1_Q3 | 4851 (-) | 0.880 | 0.872 | agaCTTCAct V$TITF1_Q3 | 5171 (-) | 0.880 | 0.809 | tcaCTTGTga V$TITF1_Q3 | 5208 (-) | 0.880 | 0.763 | cctCTTGTgc V$TITF1_Q3 | 5431 (-) | 0.857 | 0.839 | ccaCTTTAtt V$TITF1_Q3 | 5578 (+) | 0.857 | 0.784 | tcTAAAGgat V$TITF1_Q3 | 5607 (+) | 0.880 | 0.802 | acACAAGaaa V$TITF1_Q3 | 5969 (+) | 0.857 | 0.786 | ttTTAAGtgg V$TITF1_Q3 | 6121 (-) | 0.880 | 0.764 | gggCTTGTtt V$TITF1_Q3 | 6195 (+) | 0.857 | 0.765 | atTTAAGccg V$TITF1_Q3 | 6326 (+) | 0.880 | 0.802 | acTGAAGcac V$TITF1_Q3 | 6336 (-) | 0.762 | 0.795 | acaCTGGAtt V$TITF1_Q3 | 6697 (+) | 0.880 | 0.800 | gcTGAAGtag V$TITF1_Q3 | 6894 (+) | 0.880 | 0.821 | agACAAGagg V$TITF1_Q3 | 6942 (+) | 0.857 | 0.835 | acCCAAGtga V$TITF1_Q3 | 7032 (+) | 0.880 | 0.791 | gcTGAAGagt V$TITF1_Q3 | 7128 (+) | 1.000 | 0.836 | gcTCAAGgag V$TITF1_Q3 | 7382 (-) | 0.880 | 0.802 | ttgCTTGTgt V$TITF1_Q3 | 7407 (+) | 0.880 | 0.802 | acTGAAGaac V$TITF1_Q3 | 7525 (+) | 0.880 | 0.839 | acTGAAGtta V$TITF1_Q3 | 7696 (-) | 0.857 | 0.777 | gaaCTTGGga V$TITF1_Q3 | 7980 (+) | 1.000 | 0.813 | ttTCAAGgga V$TITF1_Q3 | 7991 (-) | 0.880 | 0.816 | ggaCTTCAct V$TITF1_Q3 | 8015 (-) | 0.857 | 0.822 | aacCTTTAgt V$TITF1_Q3 | 8043 (-) | 1.000 | 0.852 | ggcCTTGAct V$TITF1_Q3 | 8182 (-) | 0.880 | 0.815 | gaaCTTGTtt V$TITF1_Q3 | 8349 (-) | 0.857 | 0.802 | ccaCTTGCca V$TITF1_Q3 | 8529 (+) | 1.000 | 0.787 | ctTCAAGaac V$TITF1_Q3 | 8595 (-) | 0.857 | 0.772 | catCTTTAat V$TITF1_Q3 | 8639 (+) | 0.857 | 0.779 | atTTAAGaag V$TITF1_Q3 | 8709 (+) | 1.000 | 0.812 | caTCAAGgag V$TITF1_Q3 | 8859 (+) | 0.857 | 0.801 | aaTTAAGact V$TITF1_Q3 | 8937 (+) | 0.857 | 0.784 | acTAAAGcaa V$TITF1_Q3 | 9035 (+) | 1.000 | 0.783 | gtTCAAGggc V$TITF1_Q3 | 9088 (+) | 0.880 | 0.779 | agTGAAGctc V$TITF1_Q3 | 9129 (-) | 0.880 | 0.840 | aacCTTCAgt V$TITF1_Q3 | 9193 (+) | 1.000 | 0.900 | acTCAAGaga V$TITF1_Q3 | 9223 (+) | 0.857 | 0.805 | agGCAAGtta V$TITF1_Q3 | 9295 (-) | 0.880 | 0.823 | cagCTTGTgt V$TITF1_Q3 | 9345 (+) | 0.880 | 0.814 | ccTGAAGtct V$TITF1_Q3 | 9482 (-) | 0.857 | 0.832 | ctaCTTTAtt V$TITF1_Q3 | 9539 (+) | 0.880 | 0.808 | aaACAAGtcc V$TITF1_Q3 | 9633 (+) | 1.000 | 0.769 | gtTCAAGgtc V$TITF1_Q3 | 9662 (-) | 0.880 | 0.768 | gacCTTGTct V$TITF1_Q3 | 9674 (+) | 0.761 | 0.766 | aaAGAAGtgg V$TITF1_Q3 | 9862 (+) | 0.857 | 0.779 | atTAAAGaag V$TITF1_Q3 | 10133 (+) | 0.857 | 0.768 | agTTAAGaaa V$TITF1_Q3 | 10249 (-) | 0.880 | 0.770 | aggCTTGTgc V$TITF1_Q3 | 10472 (-) | 0.857 | 0.791 | tcaCTTTAga V$TITF1_Q3 | 10491 (+) | 1.000 | 0.870 | taTCAAGgat V$TITF1_Q3 | 10505 (+) | 0.762 | 0.796 | agTCAATtat V$TITF1_Q3 | 10545 (+) | 0.857 | 0.840 | tcTAAAGtat V$TITF1_Q3 | 10932 (-) | 0.880 | 0.763 | cctCTTGTgg V$TITF1_Q3 | 11111 (-) | 0.880 | 0.816 | cgaCTTCAga V$TITF1_Q3 | 11121 (-) | 0.857 | 0.764 | tccCTTAAct V$TITF1_Q3 | 11160 (+) | 0.880 | 0.799 | aaTGAAGctg V$TITF1_Q3 | 11193 (-) | 1.000 | 0.869 | tttCTTGAtt V$TITF1_Q3 | 11439 (-) | 0.737 | 0.776 | ataCTTATct V$TITF1_Q3 | 11596 (-) | 0.762 | 0.798 | agaCTAGAgt V$TITF1_Q3 | 11689 (+) | 0.857 | 0.797 | aaTAAAGtta V$TITF1_Q3 | 11778 (-) | 1.000 | 0.895 | ataCTTGAtg V$TITF1_Q3 | 11945 (+) | 1.000 | 0.869 | ttTCAAGggt V$IPF1_Q4 | 48 (-) | 1.000 | 0.920 | tgtCATTActtg V$IPF1_Q4 | 61 (+) | 0.869 | 0.874 | gccaTAATTact V$IPF1_Q4 | 176 (-) | 1.000 | 0.915 | tggCATTActac V$IPF1_Q4 | 495 (-) | 1.000 | 0.861 | caaCATTAttat V$IPF1_Q4 | 1647 (+) | 0.869 | 0.821 | gtgcTAATCtac V$IPF1_Q4 | 1902 (-) | 0.891 | 0.835 | tttTATTAtttt V$IPF1_Q4 | 3173 (-) | 1.000 | 0.927 | gtaCATTAtatc V$IPF1_Q4 | 3274 (-) | 1.000 | 0.873 | tgcCATTAtgat V$IPF1_Q4 | 3411 (+) | 1.000 | 0.918 | aagtTAATGatt V$IPF1_Q4 | 3734 (-) | 0.891 | 0.870 | aatTATTAtttc V$IPF1_Q4 | 3906 (+) | 0.891 | 0.862 | gtatTAATAaag V$IPF1_Q4 | 3927 (-) | 1.000 | 0.846 | ttaCATTAtcag V$IPF1_Q4 | 4030 (+) | 0.891 | 0.845 | gaatTAATAttc V$IPF1_Q4 | 4058 (+) | 0.869 | 0.854 | gcaaTAATTaaa V$IPF1_Q4 | 4283 (-) | 1.000 | 0.891 | acaCATTAgacc V$IPF1_Q4 | 5012 (-) | 1.000 | 0.864 | accCATTAagcc V$IPF1_Q4 | 5099 (-) | 0.891 | 0.841 | ggtTATTAtttg V$IPF1_Q4 | 5412 (-) | 0.891 | 0.871 | tctTATTAaatc V$IPF1_Q4 | 5586 (+) | 0.891 | 0.830 | ataaTAATAaaa V$IPF1_Q4 | 5788 (+) | 0.869 | 0.859 | gaaaTAATTaaa V$IPF1_Q4 | 6785 (+) | 1.000 | 0.882 | gatcTAATGctg V$IPF1_Q4 | 6808 (-) | 1.000 | 0.898 | cctCATTAcctt V$IPF1_Q4 | 6956 (-) | 1.000 | 0.842 | gtcCATTAttct V$IPF1_Q4 | 7258 (-) | 1.000 | 0.910 | tctCATTAaaat V$IPF1_Q4 | 7904 (-) | 0.891 | 0.823 | tgaTATTAgctt V$IPF1_Q4 | 8151 (-) | 1.000 | 0.875 | aaaCATTAgctt V$IPF1_Q4 | 8685 (-) | 0.869 | 0.837 | ggtAATTAgcct V$IPF1_Q4 | 9733 (+) | 1.000 | 0.908 | atgaTAATGtcc V$IPF1_Q4 | 9993 (-) | 0.891 | 0.843 | gggTATTAggtc V$IPF1_Q4 | 10141 (-) | 0.891 | 0.823 | aatTATTAttat V$IPF1_Q4 | 10144 (-) | 0.891 | 0.830 | tatTATTAttgt V$IPF1_Q4 | 10521 (-) | 1.000 | 0.839 | agcCATTAtagg V$IPF1_Q4 | 10655 (+) | 1.000 | 0.944 | gtccTAATGtca V$IPF1_Q4 | 11203 (-) | 0.891 | 0.845 | gatTATTAattt V$IPF1_Q4 | 11274 (-) | 0.869 | 0.850 | ggaAATTAaggc V$IPF1_Q4 | 11296 (+) | 0.891 | 0.877 | gaaaTAATAata V$IPF1_Q4 | 11299 (+) | 0.891 | 0.844 | ataaTAATAacg V$IPF1_Q4 | 11345 (-) | 1.000 | 0.846 | tgcCATTAtcga V$IPF1_Q4 | 11388 (-) | 0.891 | 0.823 | actTATTAaatt V$IPF1_Q4 | 11511 (-) | 0.891 | 0.822 | ggaTATTAccgt V$IPF1_Q4 | 11693 (+) | 0.869 | 0.842 | aagtTAATCact V$IPF1_Q4 | 11784 (+) | 0.783 | 0.819 | gatgTAAGGaca V$CHX10_01 | 8684 (+) | 1.000 | 0.997 | aggTAATTagcctt P$C1_Q2 | 126 (+) | 0.981 | 0.951 | ctAACTGctca P$C1_Q2 | 400 (+) | 0.985 | 0.953 | ccAACAAggcg P$C1_Q2 | 456 (-) | 0.996 | 0.960 | gtttTGGTTgg P$C1_Q2 | 607 (+) | 0.966 | 0.948 | taAACATcctg P$C1_Q2 | 719 (-) | 0.996 | 0.957 | cagaTGGTTgt P$C1_Q2 | 737 (-) | 0.996 | 0.946 | cacgTGGTTgc P$C1_Q2 | 933 (+) | 0.996 | 0.943 | gaAACCAcaag P$C1_Q2 | 937 (-) | 0.981 | 0.948 | ccacAAGTTag P$C1_Q2 | 1025 (-) | 0.981 | 0.954 | ttggCAGTTga P$C1_Q2 | 1317 (+) | 0.981 | 0.964 | ctAACTCcctt P$C1_Q2 | 1366 (-) | 0.981 | 0.946 | atgcTCGTTtg P$C1_Q2 | 1594 (-) | 0.978 | 0.944 | cactGGGTTtg P$C1_Q2 | 1736 (+) | 0.981 | 0.948 | tgAACTTccta P$C1_Q2 | 1969 (-) | 0.978 | 0.955 | gagcAGGTTgg P$C1_Q2 | 1983 (+) | 0.981 | 0.952 | caAACTCacca P$C1_Q2 | 2391 (-) | 0.981 | 0.944 | ctgaGAGTTgc P$C1_Q2 | 2426 (-) | 0.978 | 0.946 | acagGGGTTgg P$C1_Q2 | 2481 (-) | 0.978 | 0.944 | ccctGGGTTcg P$C1_Q2 | 2604 (+) | 0.996 | 0.946 | taAACCAagag P$C1_Q2 | 3238 (+) | 0.966 | 0.956 | ccAACATtccc P$C1_Q2 | 3306 (-) | 1.000 | 0.964 | catgTAGTTaa P$C1_Q2 | 3500 (+) | 1.000 | 0.966 | ttAACTAccct P$C1_Q2 | 3508 (-) | 0.966 | 0.950 | cctgCTGTTag P$C1_Q2 | 3534 (-) | 0.978 | 0.958 | cccgAGGTTtg P$C1_Q2 | 3585 (+) | 0.981 | 0.944 | tcAACTCtcct P$C1_Q2 | 4195 (-) | 0.981 | 0.946 | ctgcAAGTTca P$C1_Q2 | 4257 (-) | 0.985 | 0.972 | ttggTTGTTag P$C1_Q2 | 4538 (+) | 0.966 | 0.946 | tcAACAGcctc P$C1_Q2 | 4570 (-) | 0.978 | 0.955 | gttgCGGTTag P$C1_Q2 | 5088 (-) | 0.996 | 0.946 | tgacTGGTTtg P$C1_Q2 | 5147 (-) | 0.981 | 0.948 | taggCAGTTta P$C1_Q2 | 5277 (+) | 0.985 | 0.969 | ccAACAAcagg P$C1_Q2 | 5336 (-) | 1.000 | 0.957 | ccgaTAGTTtt P$C1_Q2 | 5564 (+) | 0.985 | 0.950 | aaAACAAccag P$C1_Q2 | 5691 (-) | 1.000 | 0.946 | ccacTAGTTca P$C1_Q2 | 5907 (-) | 1.000 | 0.962 | agagTAGTTtg P$C1_Q2 | 6012 (+) | 0.981 | 0.960 | ccAACTCtcct P$C1_Q2 | 6024 (+) | 0.981 | 0.943 | tcAACTCtagg P$C1_Q2 | 6387 (+) | 0.981 | 0.946 | cgAACTGgcct P$C1_Q2 | 6522 (-) | 0.981 | 0.944 | cacgAAGTTca P$C1_Q2 | 6615 (+) | 0.981 | 0.944 | taAACTCcgtg P$C1_Q2 | 7034 (-) | 0.981 | 0.945 | tgaaGAGTTag P$C1_Q2 | 7038 (-) | 1.000 | 0.963 | gagtTAGTTga P$C1_Q2 | 7068 (-) | 0.985 | 0.945 | gtttTTGTTtg P$C1_Q2 | 7186 (+) | 0.963 | 0.955 | cgAACGGccac P$C1_Q2 | 7231 (-) | 0.981 | 0.961 | caggAAGTTca P$C1_Q2 | 7531 (-) | 0.981 | 0.953 | gttaCAGTTgg P$C1_Q2 | 7535 (-) | 0.996 | 0.950 | cagtTGGTTgt P$C1_Q2 | 7549 (+) | 0.981 | 0.945 | ccAACTGtgta P$C1_Q2 | 7588 (+) | 0.985 | 0.945 | caAACAAatgc P$C1_Q2 | 7722 (-) | 0.981 | 0.948 | tcagCAGTTtg P$C1_Q2 | 7826 (+) | 0.985 | 0.945 | caAACAAttaa P$C1_Q2 | 7928 (-) | 0.981 | 0.947 | caatTCGTTtg P$C1_Q2 | 7967 (-) | 0.985 | 0.949 | atagTTGTTtg P$C1_Q2 | 8001 (-) | 0.966 | 0.946 | gtgcCTGTTgg P$C1_Q2 | 8356 (+) | 0.978 | 0.946 | ccAACCCctgt P$C1_Q2 | 8667 (+) | 0.996 | 0.992 | caAACCAccgg P$C1_Q2 | 8671 (-) | 0.978 | 0.944 | ccacCGGTTag P$C1_Q2 | 8796 (-) | 0.981 | 0.946 | tgtaGAGTTgg P$C1_Q2 | 8934 (+) | 1.000 | 0.942 | taAACTAaagc P$C1_Q2 | 9223 (-) | 0.981 | 0.949 | aggcAAGTTag P$C1_Q2 | 9314 (-) | 0.963 | 0.942 | ggagGCGTTgg P$C1_Q2 | 9417 (+) | 1.000 | 0.951 | aaAACTAtccc P$C1_Q2 | 9538 (+) | 0.985 | 0.942 | caAACAAgtcc P$C1_Q2 | 9627 (-) | 0.981 | 0.949 | tcggGAGTTca P$C1_Q2 | 10025 (-) | 1.000 | 0.962 | accgTAGTTtg P$C1_Q2 | 10405 (-) | 0.996 | 0.989 | gaggTGGTTgg P$C1_Q2 | 10617 (-) | 0.981 | 0.943 | gaggGAGTTgt P$C1_Q2 | 10756 (-) | 0.966 | 0.951 | ccgtGTGTTtg P$C1_Q2 | 10963 (-) | 0.966 | 0.946 | gtgcCTGTTgg P$C1_Q2 | 11038 (-) | 0.985 | 0.952 | actgTTGTTgg P$C1_Q2 | 11125 (+) | 0.981 | 0.948 | ttAACTCgctg P$C1_Q2 | 11218 (+) | 0.981 | 0.950 | tgAACTGccca P$C1_Q2 | 11711 (+) | 0.981 | 0.967 | ccAACTGccct P$C1_Q2 | 11816 (-) | 0.978 | 0.955 | cccaGGGTTgg P$C1_Q2 | 11895 (+) | 0.996 | 0.950 | tcAACCAatag P$C1_Q2 | 11922 (+) | 0.963 | 0.950 | ccAACGCccat P$O2_Q2 | 15 (+) | 0.858 | 0.811 | aagtaCATGTgaa P$O2_Q2 | 37 (-) | 0.862 | 0.842 | acaACTTCagatg P$O2_Q2 | 74 (+) | 0.890 | 0.857 | ttgttGATGTtga P$O2_Q2 | 282 (-) | 0.844 | 0.823 | tgcAAGTCaagtg P$O2_Q2 | 350 (-) | 0.890 | 0.888 | tctACATCagatt P$O2_Q2 | 431 (+) | 0.969 | 0.885 | aatacCACGTtta P$O2_Q2 | 434 (-) | 0.906 | 0.836 | accACGTTtaatc P$O2_Q2 | 571 (-) | 1.000 | 0.837 | taaACGTCtacct P$O2_Q2 | 732 (+) | 0.969 | 0.865 | gccacCACGTggt P$O2_Q2 | 735 (-) | 0.969 | 0.865 | accACGTGgttgc P$O2_Q2 | 1077 (-) | 0.844 | 0.811 | tgtAGGTCatttt P$O2_Q2 | 1126 (-) | 0.969 | 0.818 | tgcACGTGtgttc P$O2_Q2 | 1238 (-) | 0.890 | 0.812 | ttgACATCagggc P$O2_Q2 | 3287 (+) | 0.862 | 0.847 | gggatGAGGTgaa P$O2_Q2 | 3432 (+) | 0.844 | 0.811 | taattGACTTagg P$O2_Q2 | 4510 (-) | 0.890 | 0.823 | acaACATCagacc P$O2_Q2 | 4694 (+) | 0.890 | 0.829 | aacttGATGTcct P$O2_Q2 | 6695 (+) | 0.862 | 0.832 | gtgctGAAGTaga P$O2_Q2 | 6853 (+) | 0.969 | 0.810 | gtgagCACGTcca P$O2_Q2 | 6883 (+) | 0.969 | 0.834 | agcacCACGTaag P$O2_Q2 | 6886 (-) | 0.906 | 0.856 | accACGTAagaca P$O2_Q2 | 7581 (-) | 0.844 | 0.819 | atcAGGTCaaaca P$O2_Q2 | 7881 (-) | 0.844 | 0.818 | tagAAGTCatttg P$O2_Q2 | 8730 (-) | 1.000 | 0.812 | gtaACGTCggggt P$O2_Q2 | 8789 (+) | 0.890 | 0.839 | aagaaGATGTaga P$O2_Q2 | 9052 (-) | 0.858 | 0.810 | tctACATGgcaag P$O2_Q2 | 9610 (+) | 0.862 | 0.828 | aagctGAGGTggg P$O2_Q2 | 9635 (-) | 0.844 | 0.841 | tcaAGGTCagctt P$O2_Q2 | 9801 (-) | 0.890 | 0.845 | tgtACATCacata P$O2_Q2 | 9815 (-) | 0.862 | 0.814 | gtaACTTCaaatt P$O2_Q2 | 10442 (-) | 1.000 | 0.821 | gcaACGTCtcttt P$O2_Q2 | 11054 (+) | 0.906 | 0.853 | ttgttAACGTtgt P$O2_Q2 | 11057 (-) | 0.906 | 0.812 | ttaACGTTgtgat P$O2_Q2 | 11110 (-) | 0.862 | 0.848 | acgACTTCagatc P$O2_Q2 | 11332 (+) | 0.890 | 0.836 | atgacGATGTaaa P$O2_Q2 | 11779 (+) | 0.890 | 0.834 | tacttGATGTaag V$VDR_Q3 | 415 (+) | 1.000 | 0.838 | GGGTGagcttgtcca V$VDR_Q3 | 1320 (-) | 0.846 | 0.876 | actcccttctCACTC V$VDR_Q3 | 1381 (+) | 0.846 | 0.860 | GGGCGaggtagggca V$VDR_Q3 | 1391 (+) | 0.843 | 0.854 | GGGCAggtgaggaga V$VDR_Q3 | 1488 (+) | 0.802 | 0.846 | GAGGGaggggggggg V$VDR_Q3 | 1490 (+) | 0.846 | 0.867 | GGGAGggggggggga V$VDR_Q3 | 1496 (+) | 0.956 | 0.881 | GGGGGgggagggagg V$VDR_Q3 | 1499 (+) | 0.956 | 0.855 | GGGGGagggagggag V$VDR_Q3 | 1500 (+) | 0.953 | 0.880 | GGGGAgggagggagg V$VDR_Q3 | 1574 (+) | 0.843 | 0.831 | GGGAAtgcatggaga V$VDR_Q3 | 1604 (+) | 0.956 | 0.832 | GGGGCtcagggggtc V$VDR_Q3 | 1605 (+) | 0.837 | 0.839 | GGGCTcagggggtct V$VDR_Q3 | 1613 (+) | 0.947 | 0.848 | GGGGTctcgtggaca V$VDR_Q3 | 4047 (+) | 0.956 | 0.877 | GGGGGaaataggcaa V$VDR_Q3 | 5503 (-) | 0.843 | 0.841 | gctcccaggtTTCCC V$VDR_Q3 | 5766 (+) | 0.997 | 0.829 | GGGTAaaaggagaga V$VDR_Q3 | 6024 (-) | 1.000 | 0.837 | tcaactctagGACCC V$VDR_Q3 | 6180 (-) | 1.000 | 0.825 | ttcccagagaCACCC V$VDR_Q3 | 6902 (+) | 0.843 | 0.838 | GGGCAtaagtggaga V$VDR_Q3 | 8194 (-) | 0.953 | 0.853 | tctgcctgttTCCCC V$VDR_Q3 | 8759 (-) | 0.802 | 0.835 | tgtcctcccaGCACC V$VDR_Q3 | 8963 (+) | 0.846 | 0.850 | GGGAGaaccaggtgt V$VDR_Q3 | 9572 (+) | 1.000 | 0.831 | GGGTGtggtggtata V$VDR_Q3 | 9585 (-) | 0.837 | 0.826 | tagccctgtaATCCC V$VDR_Q3 | 9629 (+) | 0.846 | 0.829 | GGGAGttcaaggtca V$VDR_Q3 | 9686 (+) | 0.956 | 0.826 | GGGGGgtggtggtgg V$VDR_Q3 | 9689 (+) | 1.000 | 0.847 | GGGTGgtggtggtgg V$VDR_Q3 | 11706 (-) | 0.843 | 0.858 | tcaccccaacTGCCC V$VDR_Q3 | 11852 (+) | 0.947 | 0.866 | GGGGTatgggggtag V$VDR_Q3 | 11853 (+) | 0.997 | 0.841 | GGGTAtgggggtagg V$VDR_Q3 | 11861 (+) | 0.997 | 0.826 | GGGTAggtccggggg V$SPZ1_01 | 1488 (+) | 1.000 | 0.920 | gagGGAGGggggggg V$SPZ1_01 | 1491 (+) | 0.998 | 0.918 | ggaGGGGGgggggag V$SPZ1_01 | 1494 (+) | 0.998 | 0.927 | gggGGGGGggaggga V$SPZ1_01 | 1495 (+) | 0.998 | 0.950 | gggGGGGGgagggag V$SPZ1_01 | 1499 (+) | 1.000 | 0.952 | gggGGAGGgagggag V$SPZ1_01 | 1503 (+) | 1.000 | 0.951 | gagGGAGGgagggag V$SPZ1_01 | 1507 (+) | 1.000 | 0.951 | gagGGAGGgagggag V$SPZ1_01 | 1511 (+) | 1.000 | 0.940 | gagGGAGGgaggtag V$SPZ1_01 | 1871 (+) | 1.000 | 0.921 | aagGGAGGatctgcc V$SPZ1_01 | 3177 (-) | 0.998 | 0.917 | attatatCCCCCctc V$SPZ1_01 | 3178 (-) | 0.998 | 0.940 | ttatatcCCCCCtct V$SPZ1_01 | 3336 (-) | 1.000 | 0.948 | ctatttcCCTCCtgt V$SPZ1_01 | 3776 (-) | 0.998 | 0.956 | ctccttcCCCCCttt V$SPZ1_01 | 4175 (+) | 1.000 | 0.920 | tcaGGAGGctgagca V$SPZ1_01 | 4353 (+) | 1.000 | 0.920 | gcaGGAGGatcagaa V$SPZ1_01 | 5421 (-) | 1.000 | 0.951 | atctttcCCTCCact V$SPZ1_01 | 7501 (+) | 1.000 | 0.940 | cttGGAGGgtgtcag V$SPZ1_01 | 8112 (-) | 0.989 | 0.922 | cactctcCCACCctg V$SPZ1_01 | 8482 (+) | 0.998 | 0.925 | taaGGGGGttatttt V$SPZ1_01 | 9614 (+) | 0.989 | 0.929 | tgaGGTGGgaggatc V$SPZ1_01 | 9618 (+) | 1.000 | 0.921 | gtgGGAGGatcggga V$SPZ1_01 | 9682 (+) | 0.998 | 0.926 | ggaGGGGGggtggtg V$SPZ1_01 | 9683 (+) | 0.998 | 0.967 | gagGGGGGgtggtgg V$SPZ1_01 | 10613 (+) | 1.000 | 0.944 | gtgGGAGGgagttgt V$SPZ1_01 | 10737 (+) | 0.998 | 0.960 | attGGGGGgtttcag V$SPZ1_01 | 10738 (+) | 0.998 | 0.915 | ttgGGGGGtttcagt V$SPZ1_01 | 10875 (+) | 0.989 | 0.927 | ggtGGTGGgagcact V$SPZ1_01 | 11712 (-) | 1.000 | 0.928 | caactgcCCTCCacc V$SPZ1_01 | 11842 (+) | 0.998 | 0.967 | agtGGGGGgtggggt V$SPZ1_01 | 11871 (+) | 1.000 | 0.944 | gggGGAGGgacttag V$AR_Q2 | 2810 (-) | 0.875 | 0.837 | TGAACaccgtgctct V$AR_Q2 | 3024 (-) | 0.939 | 0.793 | AGTACaccgacttcc V$AR_Q2 | 4692 (+) | 0.899 | 0.785 | gaaacttgatGTCCT V$AR_Q2 | 6518 (+) | 0.875 | 0.853 | agaacacgaaGTTCA V$AR_Q2 | 6518 (-) | 1.000 | 0.877 | AGAACacgaagttca V$AR_Q2 | 7805 (-) | 1.000 | 0.867 | AGAACaacgacaact V$AR_Q2 | 8534 (-) | 1.000 | 0.783 | AGAACacagtgggcc V$AR_Q2 | 8966 (-) | 1.000 | 0.784 | AGAACcaggtgtggt V$AR_Q2 | 9465 (+) | 1.000 | 0.883 | tgtacaggatGTTCT V$AR_Q2 | 9465 (-) | 0.814 | 0.873 | TGTACaggatgttct V$AR_Q2 | 10062 (-) | 0.875 | 0.783 | TGAACcgggtcttct V$AR_Q2 | 10080 (+) | 0.899 | 0.789 | agggcagccaGTGCT V$AR_Q2 | 10194 (+) | 0.899 | 0.809 | agtcagttctGTCCT V$AR_Q2 | 11223 (+) | 1.000 | 0.785 | tgcccagtttGTTCT V$ZIC1_01 | 105 (+) | 0.817 | 0.781 | ggGGGTGgc V$ZIC1_01 | 106 (+) | 1.000 | 0.905 | ggGGTGGcc V$ZIC1_01 | 137 (-) | 0.819 | 0.774 | gtCAATCca V$ZIC1_01 | 182 (-) | 0.940 | 0.860 | taCTACCcg V$ZIC1_01 | 200 (-) | 1.000 | 0.762 | caCCACCat V$ZIC1_01 | 249 (+) | 0.782 | 0.778 | tgGGCTGta V$ZIC1_01 | 419 (+) | 0.833 | 0.801 | gaGCTTGtc V$ZIC1_01 | 446 (+) | 0.940 | 0.858 | caGGTTGtc V$ZIC1_01 | 459 (+) | 0.940 | 0.757 | ttGGTTGgc V$ZIC1_01 | 460 (+) | 0.901 | 0.754 | tgGTTGGct V$ZIC1_01 | 576 (-) | 0.940 | 0.749 | gtCTACCtg V$ZIC1_01 | 687 (-) | 0.718 | 0.817 | gaCACACca V$ZIC1_01 | 782 (+) | 0.682 | 0.773 | cgGGTGCtc V$ZIC1_01 | 1008 (-) | 0.722 | 0.785 | gaCGCACct V$ZIC1_01 | 1184 (-) | 0.817 | 0.787 | ggCACCCcc V$ZIC1_01 | 1279 (+) | 0.782 | 0.750 | tgGGATGac V$ZIC1_01 | 1330 (-) | 0.817 | 0.791 | caCTCCCca V$ZIC1_01 | 1346 (+) | 0.833 | 0.771 | agGCTTGcc V$ZIC1_01 | 1404 (+) | 0.778 | 0.764 | gaGTGGGtc V$ZIC1_01 | 1516 (+) | 0.842 | 0.782 | agGGAGGta V$ZIC1_01 | 1610 (+) | 0.877 | 0.817 | caGGGGGtc V$ZIC1_01 | 1632 (-) | 0.894 | 0.787 | gtCCAGCct V$ZIC1_01 | 1681 (+) | 0.817 | 0.781 | tcGGGTGtc V$ZIC1_01 | 1696 (+) | 0.940 | 0.747 | ggGGTAGat V$ZIC1_01 | 1812 (+) | 1.000 | 0.874 | agGGTGGtg V$ZIC1_01 | 1840 (+) | 0.944 | 0.955 | ggGGTCGtc V$ZIC1_01 | 1943 (+) | 0.833 | 0.801 | tgGCTAGgc V$ZIC1_01 | 1972 (+) | 0.940 | 0.766 | caGGTTGgc V$ZIC1_01 | 1973 (+) | 0.901 | 0.815 | agGTTGGcc V$ZIC1_01 | 2429 (+) | 0.940 | 0.770 | ggGGTTGgg V$ZIC1_01 | 2637 (-) | 0.940 | 0.753 | ctCTACCcc V$ZIC1_01 | 2792 (+) | 0.894 | 0.848 | taGCTGGtc V$ZIC1_01 | 2921 (-) | 0.687 | 0.789 | gaCGGACcc V$ZIC1_01 | 3032 (-) | 0.782 | 0.749 | gaCTTCCgc V$ZIC1_01 | 3040 (-) | 1.000 | 0.780 | ctCCACCcg V$ZIC1_01 | 3050 (-) | 0.682 | 0.759 | gaCATACct V$ZIC1_01 | 3245 (-) | 0.894 | 0.751 | tcCCAGCcc V$ZIC1_01 | 3357 (-) | 0.682 | 0.794 | gaCAGACca V$ZIC1_01 | 3502 (-) | 0.940 | 0.830 | aaCTACCct V$ZIC1_01 | 3523 (-) | 1.000 | 0.796 | ggCCACCtt V$ZIC1_01 | 3636 (-) | 0.743 | 0.753 | taCCTACca V$ZIC1_01 | 4206 (+) | 0.833 | 0.771 | agGCTTGcc V$ZIC1_01 | 4244 (+) | 0.819 | 0.794 | ggGATTGta V$ZIC1_01 | 4257 (+) | 0.940 | 0.752 | ttGGTTGtt V$ZIC1_01 | 4341 (+) | 1.000 | 0.806 | caGGTGGtg V$ZIC1_01 | 4413 (-) | 0.940 | 0.752 | caCAACCtt V$ZIC1_01 | 4469 (-) | 0.682 | 0.794 | gaCTGACca V$ZIC1_01 | 4473 (-) | 0.894 | 0.784 | gaCCAGCat V$ZIC1_01 | 4632 (+) | 0.940 | 0.795 | tgGGTAGca V$ZIC1_01 | 4740 (+) | 0.880 | 0.789 | ctGATGGtc V$ZIC1_01 | 4828 (+) | 0.940 | 0.770 | ggGGTAGgg V$ZIC1_01 | 4862 (+) | 0.842 | 0.863 | agGGAGGtc V$ZIC1_01 | 4963 (+) | 0.782 | 0.755 | caGGAAGtc V$ZIC1_01 | 5060 (+) | 0.940 | 0.767 | caGGTAGtg V$ZIC1_01 | 5140 (-) | 0.940 | 0.769 | gtCAACCta V$ZIC1_01 | 5295 (+) | 0.833 | 0.798 | ggGCTAGcc V$ZIC1_01 | 5296 (-) | 0.833 | 0.806 | ggCTAGCca V$ZIC1_01 | 5688 (-) | 0.778 | 0.773 | gaCCCACta V$ZIC1_01 | 5857 (+) | 0.901 | 0.748 | gaGTTGGtt V$ZIC1_01 | 5873 (-) | 0.880 | 0.830 | gaCCATCtc V$ZIC1_01 | 5895 (-) | 0.771 | 0.757 | ggCCCGCcc V$ZIC1_01 | 5896 (-) | 0.842 | 0.797 | gcCCGCCcc V$ZIC1_01 | 5930 (-) | 0.817 | 0.763 | gaCTCCCtt V$ZIC1_01 | 6056 (-) | 0.782 | 0.823 | gaCAGCCct V$ZIC1_01 | 6121 (+) | 0.833 | 0.787 | ggGCTTGtt V$ZIC1_01 | 6385 (-) | 0.845 | 0.796 | gaCGAACtg V$ZIC1_01 | 6438 (-) | 0.940 | 0.852 | gtCAACCca V$ZIC1_01 | 6540 (-) | 0.841 | 0.776 | gaCAAACgg V$ZIC1_01 | 6598 (+) | 1.000 | 0.791 | aaGGTGGtg V$ZIC1_01 | 6717 (-) | 0.940 | 0.752 | caCAACCtt V$ZIC1_01 | 6753 (-) | 0.819 | 0.783 | caCTATCcc V$ZIC1_01 | 6986 (+) | 0.894 | 0.758 | taGCTGGtg V$ZIC1_01 | 7176 (+) | 0.880 | 0.819 | caGATGGtc V$ZIC1_01 | 7393 (+) | 1.000 | 0.817 | caGGTGGta V$ZIC1_01 | 7503 (+) | 0.757 | 0.752 | tgGAGGGtg V$ZIC1_01 | 7505 (+) | 0.817 | 0.790 | gaGGGTGtc V$ZIC1_01 | 7538 (+) | 0.940 | 0.758 | ttGGTTGtg V$ZIC1_01 | 7701 (+) | 0.842 | 0.817 | tgGGAGGta V$ZIC1_01 | 7813 (-) | 0.642 | 0.760 | gaCAACTcc V$ZIC1_01 | 8117 (-) | 1.000 | 0.793 | tcCCACCct V$ZIC1_01 | 8354 (-) | 0.901 | 0.761 | tgCCAACcc V$ZIC1_01 | 8355 (-) | 0.940 | 0.861 | gcCAACCcc V$ZIC1_01 | 8400 (-) | 0.833 | 0.801 | gcCAAGCca V$ZIC1_01 | 8405 (-) | 0.782 | 0.747 | gcCAGCCcg V$ZIC1_01 | 8440 (-) | 0.940 | 0.775 | tgCAACCcg V$ZIC1_01 | 8669 (-) | 1.000 | 0.783 | aaCCACCgg V$ZIC1_01 | 8737 (+) | 0.817 | 0.770 | cgGGGTGtg V$ZIC1_01 | 8743 (+) | 1.000 | 0.793 | gtGGTGGcc V$ZIC1_01 | 8760 (-) | 0.842 | 0.789 | gtCCTCCca V$ZIC1_01 | 8774 (+) | 0.842 | 0.812 | tgGGAGGcc V$ZIC1_01 | 8976 (+) | 1.000 | 0.799 | gtGGTGGta V$ZIC1_01 | 9014 (+) | 0.782 | 0.758 | ggGGCAGgc V$ZIC1_01 | 9069 (+) | 0.833 | 0.771 | agGCTAGcc V$ZIC1_01 | 9070 (-) | 0.833 | 0.806 | ggCTAGCca V$ZIC1_01 | 9189 (-) | 0.703 | 0.807 | gaCCACTca V$ZIC1_01 | 9338 (-) | 0.626 | 0.749 | gaGGACCcc V$ZIC1_01 | 9419 (-) | 0.819 | 0.778 | aaCTATCcc V$ZIC1_01 | 9549 (+) | 0.642 | 0.768 | tgCGTAGtc V$ZIC1_01 | 9576 (+) | 1.000 | 0.799 | gtGGTGGta V$ZIC1_01 | 9685 (+) | 0.877 | 0.821 | ggGGGGGtg V$ZIC1_01 | 9688 (+) | 1.000 | 0.900 | ggGGTGGtg V$ZIC1_01 | 9691 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 9694 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 9697 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 9700 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 9703 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 9706 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 10114 (-) | 0.880 | 0.747 | tgCCATCcc V$ZIC1_01 | 10115 (-) | 0.782 | 0.758 | gcCATCCcc V$ZIC1_01 | 10237 (+) | 0.901 | 0.810 | agGTTGGtg V$ZIC1_01 | 10240 (+) | 1.000 | 0.888 | ttGGTGGtc V$ZIC1_01 | 10249 (+) | 0.833 | 0.766 | agGCTTGtg V$ZIC1_01 | 10405 (+) | 1.000 | 0.812 | gaGGTGGtt V$ZIC1_01 | 10408 (+) | 0.940 | 0.748 | gtGGTTGgc V$ZIC1_01 | 10409 (+) | 0.901 | 0.770 | tgGTTGGca V$ZIC1_01 | 10480 (-) | 0.894 | 0.931 | gaCCAGCca V$ZIC1_01 | 10645 (-) | 0.817 | 0.748 | gaCACCCag V$ZIC1_01 | 10739 (+) | 0.877 | 0.824 | tgGGGGGtt V$ZIC1_01 | 10792 (+) | 0.782 | 0.759 | ggGGATGtg V$ZIC1_01 | 10873 (+) | 1.000 | 0.788 | gtGGTGGtg V$ZIC1_01 | 11016 (+) | 0.940 | 0.754 | agGGTAGga V$ZIC1_01 | 11220 (-) | 0.782 | 0.762 | aaCTGCCca V$ZIC1_01 | 11705 (-) | 0.817 | 0.773 | gtCACCCca V$ZIC1_01 | 11809 (+) | 1.000 | 0.796 | aaGGTGGcc V$ZIC1_01 | 11844 (+) | 0.877 | 0.830 | tgGGGGGtg V$ZIC1_01 | 11847 (+) | 1.000 | 0.809 | ggGGTGGgg V$ZIC1_01 | 11860 (+) | 0.940 | 0.765 | ggGGTAGgt V$ZIC1_01 | 11861 (+) | 0.743 | 0.825 | ggGTAGGtc V$ZIC3_01 | 106 (+) | 1.000 | 0.939 | gGGGTGgcc V$ZIC3_01 | 182 (-) | 0.921 | 0.867 | tacTACCCg V$ZIC3_01 | 414 (+) | 1.000 | 0.909 | tGGGTGagc V$ZIC3_01 | 534 (+) | 1.000 | 0.919 | tGGGTGatg V$ZIC3_01 | 782 (+) | 1.000 | 0.969 | cGGGTGctc V$ZIC3_01 | 1183 (-) | 1.000 | 0.913 | gggCACCCc V$ZIC3_01 | 1185 (-) | 0.966 | 0.877 | gcaCCCCCa V$ZIC3_01 | 1310 (-) | 0.887 | 0.872 | gagTCCCCt V$ZIC3_01 | 1318 (-) | 0.948 | 0.857 | taaCTCCCt V$ZIC3_01 | 1330 (-) | 0.887 | 0.851 | cacTCCCCa V$ZIC3_01 | 1355 (+) | 0.916 | 0.853 | tGGGTCacc V$ZIC3_01 | 1516 (+) | 0.948 | 0.887 | aGGGAGgta V$ZIC3_01 | 1612 (+) | 0.878 | 0.864 | gGGGGTctc V$ZIC3_01 | 1682 (+) | 1.000 | 0.852 | cGGGTGtca V$ZIC3_01 | 1706 (-) | 1.000 | 0.911 | aagCACCCt V$ZIC3_01 | 1812 (+) | 1.000 | 0.937 | aGGGTGgtg V$ZIC3_01 | 1829 (-) | 1.000 | 0.864 | agaCACCCa V$ZIC3_01 | 1840 (+) | 0.916 | 0.921 | gGGGTCgtc V$ZIC3_01 | 3040 (-) | 1.000 | 0.874 | ctcCACCCg V$ZIC3_01 | 3081 (+) | 1.000 | 0.911 | aGGGTGatt V$ZIC3_01 | 3181 (-) | 0.966 | 0.876 | tatCCCCCc V$ZIC3_01 | 3502 (-) | 0.921 | 0.871 | aacTACCCt V$ZIC3_01 | 3723 (+) | 1.000 | 0.862 | cGGGTGact V$ZIC3_01 | 4712 (+) | 0.921 | 0.853 | tGGGTAatt V$ZIC3_01 | 4862 (+) | 0.948 | 0.948 | aGGGAGgtc V$ZIC3_01 | 5008 (-) | 0.921 | 0.847 | tatTACCCa V$ZIC3_01 | 5097 (+) | 0.912 | 0.846 | tGGGTTatt V$ZIC3_01 | 5371 (+) | 0.860 | 0.848 | cGGGATttc V$ZIC3_01 | 5896 (-) | 0.933 | 0.869 | gccCGCCCc V$ZIC3_01 | 6056 (-) | 0.845 | 0.862 | gacAGCCCt V$ZIC3_01 | 6187 (-) | 1.000 | 0.864 | agaCACCCa V$ZIC3_01 | 6438 (-) | 0.912 | 0.861 | gtcAACCCa V$ZIC3_01 | 6528 (-) | 1.000 | 0.900 | gttCACCCt V$ZIC3_01 | 6630 (-) | 1.000 | 0.922 | gggCACCCa V$ZIC3_01 | 6729 (+) | 1.000 | 0.900 | aGGGTGagc V$ZIC3_01 | 6746 (-) | 0.912 | 0.897 | gaaAACCCa V$ZIC3_01 | 6938 (-) | 0.921 | 0.853 | cagTACCCa V$ZIC3_01 | 7196 (+) | 0.966 | 0.871 | cGGGGGtgc V$ZIC3_01 | 7198 (+) | 1.000 | 0.904 | gGGGTGcta V$ZIC3_01 | 7506 (+) | 1.000 | 0.849 | aGGGTGtca V$ZIC3_01 | 7701 (+) | 0.948 | 0.895 | tGGGAGgta V$ZIC3_01 | 7731 (+) | 0.849 | 0.848 | tGGGCCatc V$ZIC3_01 | 7954 (-) | 0.878 | 0.873 | gatACCCCa V$ZIC3_01 | 8098 (+) | 0.916 | 0.850 | tGGGTCctg V$ZIC3_01 | 8117 (-) | 1.000 | 0.865 | tccCACCCt V$ZIC3_01 | 8235 (-) | 1.000 | 0.850 | ccaCACCCa V$ZIC3_01 | 8355 (-) | 0.912 | 0.852 | gccAACCCc V$ZIC3_01 | 8484 (+) | 0.966 | 0.873 | aGGGGGtta V$ZIC3_01 | 8760 (-) | 0.948 | 0.891 | gtcCTCCCa V$ZIC3_01 | 8774 (+) | 0.948 | 0.905 | tGGGAGgcc V$ZIC3_01 | 8962 (+) | 0.948 | 0.857 | aGGGAGaac V$ZIC3_01 | 9338 (-) | 0.916 | 0.895 | gagGACCCc V$ZIC3_01 | 9376 (+) | 0.948 | 0.857 | aGGGAGagc V$ZIC3_01 | 9571 (+) | 1.000 | 0.850 | tGGGTGtgg V$ZIC3_01 | 9628 (+) | 0.948 | 0.921 | cGGGAGttc V$ZIC3_01 | 9685 (+) | 0.966 | 0.908 | gGGGGGgtg V$ZIC3_01 | 9688 (+) | 1.000 | 0.936 | gGGGTGgtg V$ZIC3_01 | 9930 (+) | 0.921 | 0.853 | tGGGTAatt V$ZIC3_01 | 9952 (-) | 1.000 | 0.901 | taaCACCCt V$ZIC3_01 | 10067 (+) | 0.916 | 0.895 | cGGGTCttc V$ZIC3_01 | 10618 (+) | 0.948 | 0.864 | aGGGAGttg V$ZIC3_01 | 10644 (-) | 1.000 | 0.864 | agaCACCCa V$ZIC3_01 | 10694 (-) | 1.000 | 0.910 | caaCACCCg V$ZIC3_01 | 10739 (+) | 0.966 | 0.917 | tGGGGGgtt V$ZIC3_01 | 10740 (+) | 0.966 | 0.878 | gGGGGGttt V$ZIC3_01 | 10741 (+) | 0.878 | 0.860 | gGGGGTttc V$ZIC3_01 | 10790 (+) | 0.966 | 0.883 | aGGGGGatg V$ZIC3_01 | 10880 (+) | 0.948 | 0.865 | tGGGAGcac V$ZIC3_01 | 11552 (+) | 0.921 | 0.904 | tGGGTAttc V$ZIC3_01 | 11704 (-) | 1.000 | 0.852 | tgtCACCCc V$ZIC3_01 | 11844 (+) | 0.966 | 0.917 | tGGGGGgtg V$ZIC3_01 | 11847 (+) | 1.000 | 0.870 | gGGGTGggg V$ZIC3_01 | 11923 (-) | 0.933 | 0.859 | caaCGCCCa V$NKX3A_01 | 660 (+) | 1.000 | 0.953 | atatAAGTAcac V$NKX3A_01 | 2569 (+) | 1.000 | 0.944 | ccgtAAGTAttt V$NKX3A_01 | 11437 (-) | 1.000 | 0.955 | gcaTACTTatct V$IRF7_01 | 897 (-) | 0.976 | 0.931 | agacttttcacTTTCTtt V$IRF7_01 | 3328 (-) | 1.000 | 0.905 | tcctgattctaTTTCCct V$IRF7_01 | 6516 (+) | 0.850 | 0.880 | aaAGAACacgaagttcac V$IRF7_01 | 8548 (+) | 0.976 | 0.896 | cgAGAAAatgaacaatgt V$IRF7_01 | 10579 (-) | 1.000 | 0.915 | ctttggttcaaTTTCCta V$IRF7_01 | 11154 (+) | 0.971 | 0.919 | gaTGAAAatgaagctgca V$IRF7_01 | 11266 (+) | 0.976 | 0.950 | caAGAAAaggaaattaag V$MRF2_01 | 211 (-) | 1.000 | 0.894 | ctgGTATTtttttt V$MRF2_01 | 425 (+) | 1.000 | 0.924 | gtccaaAATACcac V$MRF2_01 | 510 (-) | 0.859 | 0.844 | tttGTCTTctgatt V$MRF2_01 | 822 (+) | 1.000 | 0.843 | ctttttAATACtat V$MRF2_01 | 1848 (+) | 0.914 | 0.874 | caacacAATAGagc V$MRF2_01 | 1904 (-) | 0.906 | 0.852 | ttaTTATTttattt V$MRF2_01 | 2359 (-) | 1.000 | 0.856 | cttGTATTtatttt V$MRF2_01 | 2572 (-) | 1.000 | 0.855 | taaGTATTtacaac V$MRF2_01 | 3142 (+) | 0.931 | 0.851 | cacaacAATATatg V$MRF2_01 | 3798 (+) | 0.914 | 0.853 | ggacagAATAGaaa V$MRF2_01 | 4011 (-) | 0.931 | 0.871 | taaATATTctgctc V$MRF2_01 | 4063 (+) | 0.931 | 0.850 | aattaaAATATgta V$MRF2_01 | 4168 (-) | 1.000 | 0.879 | ctaGTATTcaggag V$MRF2_01 | 4304 (+) | 1.000 | 0.905 | accctcAATACtgc V$MRF2_01 | 5004 (-) | 1.000 | 0.860 | cttGTATTacccat V$MRF2_01 | 5439 (-) | 1.000 | 0.915 | tttGTATTgccata V$MRF2_01 | 5582 (+) | 0.906 | 0.861 | aaggatAATAAtaa V$MRF2_01 | 5595 (+) | 0.931 | 0.855 | aaatatAATATaac V$MRF2_01 | 5641 (+) | 0.906 | 0.851 | agacaaAATAAaaa V$MRF2_01 | 5663 (+) | 0.906 | 0.863 | gaacacAATAAgag V$MRF2_01 | 5737 (+) | 0.931 | 0.877 | agccatAATATaga V$MRF2_01 | 5777 (+) | 0.931 | 0.861 | gagaaaAATATgaa V$MRF2_01 | 5792 (+) | 1.000 | 0.882 | taattaAATACaaa V$MRF2_01 | 6583 (+) | 0.847 | 0.864 | aaacacAAAACaaa V$MRF2_01 | 7046 (-) | 0.914 | 0.843 | tgaCTATTtttttt V$MRF2_01 | 7330 (+) | 0.914 | 0.854 | tggaacAATAGaca V$MRF2_01 | 7376 (-) | 0.931 | 0.852 | ttgATATTgcttgt V$MRF2_01 | 7895 (-) | 0.906 | 0.864 | tttTTATTttgata V$MRF2_01 | 7903 (-) | 0.931 | 0.848 | ttgATATTagcttt V$MRF2_01 | 8068 (-) | 1.000 | 0.891 | ctgGTATTgcagaa V$MRF2_01 | 8600 (-) | 0.931 | 0.877 | ttaATATTatcatt V$MRF2_01 | 8698 (+) | 1.000 | 0.857 | tctgaaAATACcat V$MRF2_01 | 9489 (+) | 0.914 | 0.857 | atttgcAATAGaaa V$MRF2_01 | 9755 (-) | 0.931 | 0.864 | ttaATATTgaaaac V$MRF2_01 | 10143 (-) | 0.906 | 0.862 | ttaTTATTattgtt V$MRF2_01 | 10146 (-) | 0.906 | 0.894 | ttaTTATTgttgtt V$MRF2_01 | 10318 (-) | 0.931 | 0.872 | tttATATTtttatt V$MRF2_01 | 10338 (-) | 0.906 | 0.871 | tttTTATTgtttta V$MRF2_01 | 11298 (+) | 0.906 | 0.851 | aataatAATAAcga V$MRF2_01 | 11444 (-) | 0.914 | 0.859 | tatCTATTttggtg V$MRF2_01 | 11510 (-) | 0.931 | 0.846 | tggATATTaccgtt V$MRF2_01 | 11552 (-) | 1.000 | 0.867 | tggGTATTcaacca I$DRI_01 | 64 (+) | 1.000 | 0.955 | atAATTActg I$DRI_01 | 439 (-) | 1.000 | 0.984 | gttTAATCag I$DRI_01 | 586 (+) | 1.000 | 0.985 | tgAATTAaca I$DRI_01 | 990 (-) | 1.000 | 0.951 | taaTAATTtt I$DRI_01 | 1648 (-) | 1.000 | 0.947 | tgcTAATCta I$DRI_01 | 2025 (+) | 1.000 | 0.983 | ggGATTAaag I$DRI_01 | 3106 (-) | 1.000 | 0.956 | tacTAATTtc I$DRI_01 | 3126 (-) | 1.000 | 0.999 | tatTAATTta I$DRI_01 | 3429 (-) | 1.000 | 0.987 | gctTAATTga I$DRI_01 | 3732 (+) | 1.000 | 0.951 | aaAATTAtta I$DRI_01 | 3923 (+) | 1.000 | 0.948 | tgAATTAcat I$DRI_01 | 4029 (+) | 1.000 | 0.993 | ggAATTAata I$DRI_01 | 4059 (-) | 1.000 | 0.956 | caaTAATTaa I$DRI_01 | 4061 (+) | 1.000 | 0.984 | atAATTAaaa I$DRI_01 | 4446 (+) | 1.000 | 0.996 | acAATTAata I$DRI_01 | 5789 (-) | 1.000 | 0.956 | aaaTAATTaa I$DRI_01 | 5791 (+) | 1.000 | 0.984 | atAATTAaat I$DRI_01 | 5989 (+) | 1.000 | 0.948 | tgGATTAggg I$DRI_01 | 7059 (-) | 1.000 | 0.981 | tctTAATTtg I$DRI_01 | 7354 (-) | 1.000 | 0.981 | tgtTAATCat I$DRI_01 | 7401 (+) | 1.000 | 0.955 | agAATTActg I$DRI_01 | 7637 (+) | 1.000 | 0.951 | agAATTAttt I$DRI_01 | 7829 (+) | 1.000 | 0.982 | acAATTAacc I$DRI_01 | 8572 (+) | 1.000 | 0.989 | ttGATTAaat I$DRI_01 | 8857 (+) | 1.000 | 0.982 | aaAATTAaga I$DRI_01 | 8953 (+) | 1.000 | 0.957 | aaAATTAgta I$DRI_01 | 8990 (-) | 1.000 | 0.983 | cttTAATTcc I$DRI_01 | 10127 (+) | 1.000 | 0.957 | aaAATTAgtt I$DRI_01 | 10139 (+) | 1.000 | 0.951 | gaAATTAtta I$DRI_01 | 10347 (-) | 1.000 | 0.988 | tttTAATTca I$DRI_01 | 11201 (+) | 1.000 | 0.956 | ttGATTAtta I$DRI_01 | 11206 (-) | 1.000 | 0.994 | tatTAATTtg I$DRI_01 | 11275 (+) | 1.000 | 0.981 | gaAATTAagg I$DRI_01 | 11694 (-) | 1.000 | 0.980 | agtTAATCac V$FAC1_01 | 68 (-) | 1.000 | 0.907 | ttactGTTGTtgat V$FAC1_01 | 215 (-) | 0.904 | 0.868 | tatttTTTTTtttt V$FAC1_01 | 289 (-) | 0.978 | 0.871 | caagtGTTTTacag V$FAC1_01 | 425 (+) | 0.885 | 0.870 | gtccAAAATaccac V$FAC1_01 | 451 (-) | 0.978 | 0.948 | tgtctGTTTTggtt V$FAC1_01 | 510 (-) | 0.854 | 0.888 | tttgtCTTCTgatt V$FAC1_01 | 545 (-) | 0.978 | 0.891 | cttgtGTTTTaacc V$FAC1_01 | 834 (+) | 0.978 | 0.878 | atatAAAACaagtt V$FAC1_01 | 1528 (-) | 0.904 | 0.868 | ctgttTTTTTgttc V$FAC1_01 | 1959 (-) | 0.904 | 0.884 | ttcctTTTTTgagc V$FAC1_01 | 2291 (-) | 0.926 | 0.909 | tagttTTTGTgtgt V$FAC1_01 | 2310 (-) | 0.978 | 0.981 | tgtgtGTTTTggat V$FAC1_01 | 2363 (-) | 0.885 | 0.877 | tatttATTTTgcca V$FAC1_01 | 2407 (+) | 0.904 | 0.873 | ttaaAAAAAacaaa V$FAC1_01 | 2409 (+) | 0.978 | 0.922 | aaaaAAAACaaaaa V$FAC1_01 | 2540 (+) | 0.956 | 0.866 | gagaATAACaggtt V$FAC1_01 | 2552 (-) | 1.000 | 0.944 | tttctGTTGTgaaa V$FAC1_01 | 2577 (+) | 1.000 | 0.890 | atttACAACaaatg V$FAC1_01 | 3139 (+) | 1.000 | 0.931 | tcccACAACaatat V$FAC1_01 | 3452 (+) | 0.978 | 0.886 | taatAAAACacagg V$FAC1_01 | 3689 (+) | 0.934 | 0.888 | gccaAGAACacaag V$FAC1_01 | 3694 (+) | 0.919 | 0.880 | gaacACAAGagatg V$FAC1_01 | 3788 (-) | 0.854 | 0.866 | tttttCTTCTggac V$FAC1_01 | 3807 (+) | 0.978 | 0.880 | agaaAAAACatcgt V$FAC1_01 | 4100 (+) | 0.978 | 0.867 | ttctAAAACatcca V$FAC1_01 | 4426 (+) | 0.904 | 0.868 | aaaaAAAAAaaaaa V$FAC1_01 | 4427 (+) | 0.904 | 0.868 | aaaaAAAAAaaaaa V$FAC1_01 | 4428 (+) | 0.904 | 0.868 | aaaaAAAAAaaaaa V$FAC1_01 | 4481 (-) | 0.897 | 0.894 | tttttCTTTTgaac V$FAC1_01 | 4489 (+) | 0.926 | 0.867 | ttgaACAAAaaaca V$FAC1_01 | 4491 (+) | 0.904 | 0.924 | gaacAAAAAacaaa V$FAC1_01 | 4493 (+) | 0.978 | 0.896 | acaaAAAACaaaac V$FAC1_01 | 4496 (+) | 0.926 | 0.904 | aaaaACAAAacaaa V$FAC1_01 | 4498 (+) | 0.978 | 0.935 | aaacAAAACaaaac V$FAC1_01 | 4501 (+) | 0.926 | 0.869 | caaaACAAAacaac V$FAC1_01 | 4503 (+) | 0.978 | 0.928 | aaacAAAACaacat V$FAC1_01 | 4506 (+) | 1.000 | 0.887 | caaaACAACatcag V$FAC1_01 | 4522 (+) | 1.000 | 0.891 | caagACAACaaatg V$FAC1_01 | 4663 (-) | 0.978 | 0.937 | actttGTTTTgctc V$FAC1_01 | 5206 (-) | 0.919 | 0.874 | ggcctCTTGTgctt V$FAC1_01 | 5276 (+) | 1.000 | 0.901 | gccaACAACaggct V$FAC1_01 | 5357 (+) | 0.926 | 0.869 | tttgACAAAacaca V$FAC1_01 | 5359 (+) | 0.978 | 0.945 | tgacAAAACacacg V$FAC1_01 | 5560 (+) | 0.978 | 0.879 | ctagAAAACaacca V$FAC1_01 | 5600 (+) | 0.956 | 0.871 | taatATAACacaag V$FAC1_01 | 5605 (+) | 0.919 | 0.879 | taacACAAGaaacc V$FAC1_01 | 5641 (+) | 0.885 | 0.891 | agacAAAATaaaaa V$FAC1_01 | 5658 (+) | 0.934 | 0.889 | gaaaAGAACacaat V$FAC1_01 | 5663 (+) | 0.907 | 0.883 | gaacACAATaagag V$FAC1_01 | 5714 (+) | 0.926 | 0.872 | tcccACAAAaacac V$FAC1_01 | 5717 (+) | 0.978 | 0.919 | cacaAAAACactaa V$FAC1_01 | 5737 (+) | 0.863 | 0.870 | agccATAATataga V$FAC1_01 | 5799 (+) | 0.904 | 0.904 | atacAAAAAaaaaa V$FAC1_01 | 5801 (+) | 0.904 | 0.868 | acaaAAAAAaaaaa V$FAC1_01 | 5803 (+) | 0.904 | 0.868 | aaaaAAAAAaaata V$FAC1_01 | 5962 (-) | 0.978 | 0.890 | ccagtGTTTTtaag V$FAC1_01 | 6514 (+) | 0.934 | 0.896 | taaaAGAACacgaa V$FAC1_01 | 6578 (+) | 0.978 | 0.978 | atgcAAAACacaaa V$FAC1_01 | 6583 (+) | 0.926 | 0.943 | aaacACAAAacaaa V$FAC1_01 | 6585 (+) | 0.978 | 0.962 | acacAAAACaaata V$FAC1_01 | 6770 (+) | 0.926 | 0.874 | atgaACAAAacagc V$FAC1_01 | 6772 (+) | 0.978 | 0.908 | gaacAAAACagcag V$FAC1_01 | 7063 (-) | 0.978 | 0.905 | aatttGTTTTtgtt V$FAC1_01 | 7066 (-) | 0.926 | 0.870 | ttgttTTTGTttgt V$FAC1_01 | 7428 (-) | 0.978 | 0.920 | tatttGTTTTtatt V$FAC1_01 | 7436 (-) | 0.882 | 0.885 | tttatTTTATgtgt V$FAC1_01 | 7451 (-) | 0.978 | 0.936 | ggagtGTTTTgcct V$FAC1_01 | 7769 (+) | 0.904 | 0.912 | tctcAAAAAacaaa V$FAC1_01 | 7771 (+) | 0.978 | 0.883 | tcaaAAAACaaaac V$FAC1_01 | 7774 (+) | 0.926 | 0.904 | aaaaACAAAacaaa V$FAC1_01 | 7776 (+) | 0.978 | 0.961 | aaacAAAACaaaaa V$FAC1_01 | 7801 (+) | 0.934 | 0.873 | ataaAGAACaacga V$FAC1_01 | 7818 (+) | 0.978 | 0.951 | ctccAAAACaaaca V$FAC1_01 | 7895 (-) | 0.885 | 0.877 | tttttATTTTgata V$FAC1_01 | 7972 (-) | 0.978 | 0.905 | tgtttGTTTTtcaa V$FAC1_01 | 8530 (+) | 0.934 | 0.878 | ttcaAGAACacagt V$FAC1_01 | 8837 (+) | 0.978 | 0.922 | aaccAAAACagaac V$FAC1_01 | 9107 (+) | 0.978 | 0.960 | aagcAAAACaaaaa V$FAC1_01 | 9112 (+) | 0.904 | 0.892 | aaacAAAAAaccgg V$FAC1_01 | 9359 (-) | 0.934 | 0.875 | tcagtGTTCTctat V$FAC1_01 | 9428 (+) | 0.956 | 0.887 | ataaATAACaggaa V$FAC1_01 | 9530 (+) | 0.934 | 0.867 | gactAGAACaaaca V$FAC1_01 | 9760 (+) | 0.978 | 0.869 | attgAAAACagtca V$FAC1_01 | 10149 (-) | 1.000 | 0.921 | ttattGTTGTtgtc V$FAC1_01 | 10152 (-) | 1.000 | 0.908 | ttgttGTTGTcgtt V$FAC1_01 | 10310 (-) | 0.978 | 0.876 | taactGTTTTtata V$FAC1_01 | 10341 (-) | 0.978 | 0.881 | ttattGTTTTaatt V$FAC1_01 | 10930 (-) | 0.919 | 0.876 | cgcctCTTGTggac V$FAC1_01 | 10942 (-) | 1.000 | 0.903 | acggtGTTGTagct V$FAC1_01 | 11036 (-) | 1.000 | 0.887 | ccactGTTGTtggt V$FAC1_01 | 11444 (-) | 0.885 | 0.871 | tatctATTTTggtg V$FAC1_01 | 11474 (-) | 0.978 | 0.916 | tatttGTTTTtccc V$FAC1_01 | 11501 (-) | 0.926 | 0.881 | gaattTTTGTggat V$FAC1_01 | 11675 (+) | 0.904 | 0.868 | aaccAAAAAagacc V$STAT5A_01 | 924 (-) | 0.995 | 0.974 | caCTTCCacgaaacc V$STAT5A_01 | 1818 (+) | 0.995 | 0.930 | gtgttcagGGAAGac V$STAT5A_01 | 10049 (+) | 0.908 | 0.932 | gggttctaAGAACtg V$STAT5A_01 | 10049 (-) | 0.908 | 0.930 | ggGTTCTaagaactg V$STAT5B_01 | 924 (+) | 0.951 | 0.909 | caCTTCCacgaaacc V$STAT5B_01 | 924 (-) | 0.922 | 0.946 | cacttccaCGAAAcc V$STAT5B_01 | 1164 (-) | 0.968 | 0.901 | tttttcaaAGAAAgg V$STAT5B_01 | 10049 (+) | 0.923 | 0.907 | ggGTTCTaagaactg V$STAT5B_01 | 10049 (-) | 0.923 | 0.906 | gggttctaAGAACtg V$STAT5A_02 | 730 (-) | 0.786 | 0.724 | gagccaccacgtggTTGCTgggaa V$STAT5A_02 | 745 (+) | 0.998 | 0.736 | tgctgGGAATtgaactcaggacct V$STAT5A_02 | 912 (-) | 0.988 | 0.812 | tttcatgcgtagcaCTTCCacgaa V$STAT5A_02 | 927 (+) | 0.980 | 0.760 | ttccaCGAAAccacaagttaggaa V$STAT5A_02 | 1121 (-) | 0.991 | 0.747 | gtgtatgcacgtgtGTTCCtaaaa V$STAT5A_02 | 1152 (-) | 0.977 | 0.776 | tttatgtagttcttTTTCAaagaa V$STAT5A_02 | 1167 (+) | 0.996 | 0.786 | ttcaaAGAAAggagatgggcaccc V$STAT5A_02 | 1641 (-) | 0.814 | 0.732 | ttcctagtgctaatCTACTtggca V$STAT5A_02 | 1806 (-) | 0.974 | 0.767 | ttcatcagggtggtGTTCAgggaa V$STAT5A_02 | 1821 (+) | 0.988 | 0.738 | ttcagGGAAGacacccattggggt V$STAT5A_02 | 3078 (-) | 0.655 | 0.743 | cacagggtgattctATTTTtcgaa V$STAT5A_02 | 3123 (-) | 0.998 | 0.727 | ctatattaatttacATTCCcacaa V$STAT5A_02 | 3829 (-) | 1.000 | 0.725 | cataataaatttccATTCTcataa V$STAT5A_02 | 4024 (+) | 0.998 | 0.825 | ctcatGGAATtaatattccagcag V$STAT5A_02 | 4024 (-) | 0.998 | 0.738 | ctcatggaattaatATTCCagcag V$STAT5A_02 | 5021 (-) | 0.789 | 0.792 | gccacgggactgggATGCTcagga V$STAT5A_02 | 6766 (+) | 0.974 | 0.713 | taccaTGAACaaaacagcagatct V$STAT5A_02 | 7187 (-) | 0.782 | 0.727 | gaacggccacggggGTGCTatgaa V$STAT5A_02 | 7581 (-) | 0.789 | 0.781 | atcaggtcaaacaaATGCTctgaa V$STAT5A_02 | 7596 (+) | 0.974 | 0.724 | tgctcTGAACtgctgagccatctc V$STAT5A_02 | 7981 (+) | 0.798 | 0.747 | ttcaaGGGAAggacttcactgtgc V$STAT5A_02 | 8241 (-) | 0.977 | 0.862 | ccagatgaaggtgtGTTCGtggca V$STAT5A_02 | 8256 (+) | 0.788 | 0.731 | ttcgtGGCATtgacctgatgatgc V$STAT5A_02 | 9390 (-) | 0.993 | 0.729 | gcccctggaatgtgGTTCTatgtc V$STAT5A_02 | 9652 (+) | 0.996 | 0.794 | tacttAGAAAgaccttgtctcaaa V$STAT5A_02 | 10991 (+) | 0.998 | 0.717 | gccctGGAATcagattacctgctt V$STAT5A_02 | 11263 (+) | 0.996 | 0.722 | tgccaAGAAAaggaaattaaggca V$STAT5A_02 | 11378 (+) | 0.977 | 0.750 | tgcttTGAAAacttattaaatttt V$STAT5A_02 | 11639 (+) | 0.805 | 0.715 | ttaaaTGTATttctgtactgttaa V$STAT5A_02 | 11639 (-) | 0.817 | 0.718 | ttaaatgtatttctGTACTgttaa I$OVO_01 | 1141 (-) | 0.998 | 0.991 | aaaaCCGTTacttta I$OVO_01 | 9560 (-) | 0.999 | 0.963 | ctgtCGGTTtctggg I$OVO_01 | 11066 (-) | 0.999 | 0.969 | tgatCAGTTacaagc V$GATA6_01 | 82 (+) | 0.989 | 0.987 | gttGATAGgg V$GATA6_01 | 91 (-) | 0.989 | 0.988 | gcCTATCttc V$GATA6_01 | 307 (-) | 0.989 | 0.987 | agCTATCtct V$GATA6_01 | 566 (+) | 1.000 | 0.997 | ggaGATAAac V$GATA6_01 | 1649 (-) | 0.987 | 0.985 | gcTAATCtac V$GATA6_01 | 1992 (+) | 0.987 | 0.985 | caaGATTAtc V$GATA6_01 | 1995 (-) | 1.000 | 0.997 | gaTTATCtgg V$GATA6_01 | 2722 (+) | 1.000 | 0.999 | ataGATAAtc V$GATA6_01 | 2725 (-) | 0.987 | 0.985 | gaTAATCttc V$GATA6_01 | 2902 (-) | 1.000 | 0.993 | ttTTATCcac V$GATA6_01 | 3374 (-) | 0.987 | 0.984 | ccTAATCata V$GATA6_01 | 3930 (-) | 1.000 | 0.998 | caTTATCagt V$GATA6_01 | 3977 (+) | 0.989 | 0.986 | tatGATAGac V$GATA6_01 | 4002 (+) | 1.000 | 0.998 | aacGATAAat V$GATA6_01 | 4374 (-) | 1.000 | 0.997 | agTTATCtgg V$GATA6_01 | 4779 (+) | 1.000 | 0.992 | ctgGATAAag V$GATA6_01 | 4801 (-) | 0.987 | 0.985 | caTAATCtct V$GATA6_01 | 5236 (+) | 0.987 | 0.985 | aacGATTAca V$GATA6_01 | 5335 (+) | 0.989 | 0.985 | gccGATAGtt V$GATA6_01 | 5582 (+) | 1.000 | 0.994 | aagGATAAta V$GATA6_01 | 5633 (+) | 1.000 | 0.991 | tggGATAAag V$GATA6_01 | 5977 (+) | 1.000 | 0.997 | ggtGATAAct V$GATA6_01 | 6465 (+) | 1.000 | 0.997 | cctGATAAaa V$GATA6_01 | 7375 (+) | 0.987 | 0.985 | attGATATtg V$GATA6_01 | 7788 (-) | 1.000 | 0.991 | aaTTATCcca V$GATA6_01 | 7797 (+) | 1.000 | 0.999 | aaaGATAAag V$GATA6_01 | 8062 (-) | 1.000 | 0.997 | gtTTATCtgg V$GATA6_01 | 8604 (-) | 1.000 | 0.999 | taTTATCatt V$GATA6_01 | 9367 (-) | 0.989 | 0.985 | ctCTATCaca V$GATA6_01 | 9732 (+) | 1.000 | 0.997 | tatGATAAtg V$GATA6_01 | 9833 (+) | 1.000 | 0.993 | aggGATAAtt V$GATA6_01 | 10488 (-) | 0.987 | 0.984 | acATATCaag V$GATA6_01 | 11200 (+) | 0.987 | 0.985 | attGATTAtt V$GATA6_01 | 11348 (-) | 1.000 | 0.996 | caTTATCgaa V$GATA6_01 | 11369 (-) | 0.989 | 0.987 | ttCTATCtgt V$GATA6_01 | 11441 (-) | 1.000 | 0.999 | acTTATCtat V$GATA6_01 | 11569 (-) | 1.000 | 0.999 | tcTTATCatt V$POU3F2_02 | 19 (+) | 0.674 | 0.719 | ACATGtgaat V$POU3F2_02 | 78 (+) | 0.783 | 0.775 | TGATGttgat V$POU3F2_02 | 149 (+) | 0.674 | 0.710 | ATTTGctcat V$POU3F2_02 | 499 (-) | 0.674 | 0.798 | attatTATAT V$POU3F2_02 | 589 (-) | 0.652 | 0.820 | attaaCAGAC V$POU3F2_02 | 1017 (+) | 0.891 | 0.719 | ATATGtgatt V$POU3F2_02 | 1063 (-) | 0.783 | 0.775 | tttaaAATAA V$POU3F2_02 | 1559 (+) | 0.783 | 0.743 | TTCTGgaaat V$POU3F2_02 | 1893 (+) | 0.783 | 0.775 | TTATAtttat V$POU3F2_02 | 1899 (+) | 0.783 | 0.775 | TTATTttatt V$POU3F2_02 | 1907 (+) | 0.783 | 0.775 | TTATTttatt V$POU3F2_02 | 2522 (-) | 0.891 | 0.719 | aattaCATAT V$POU3F2_02 | 2565 (-) | 0.783 | 0.743 | atttcCGTAA V$POU3F2_02 | 3103 (+) | 0.652 | 0.812 | CTATActaat V$POU3F2_02 | 3123 (+) | 0.652 | 0.820 | CTATAttaat V$POU3F2_02 | 3137 (-) | 0.783 | 0.743 | attccCACAA V$POU3F2_02 | 3256 (+) | 1.000 | 0.743 | TTATGgtttt V$POU3F2_02 | 3369 (-) | 0.783 | 0.767 | aatagCCTAA V$POU3F2_02 | 3494 (-) | 0.783 | 0.767 | aatagCTTAA V$POU3F2_02 | 3627 (+) | 0.783 | 0.743 | TTATAatagt V$POU3F2_02 | 3843 (-) | 1.000 | 0.855 | attctCATAA V$POU3F2_02 | 4064 (-) | 0.674 | 0.831 | attaaAATAT V$POU3F2_02 | 4070 (+) | 0.891 | 0.719 | ATATGtaaaa V$POU3F2_02 | 4088 (+) | 0.674 | 0.719 | ATATTttact V$POU3F2_02 | 4121 (+) | 0.674 | 0.719 | AGATGttaaa V$POU3F2_02 | 4669 (+) | 0.783 | 0.767 | TTTTGctcat V$POU3F2_02 | 4796 (-) | 1.000 | 0.767 | acttgCATAA V$POU3F2_02 | 5444 (-) | 0.891 | 0.798 | attgcCATAT V$POU3F2_02 | 5586 (-) | 0.783 | 0.743 | ataatAATAA V$POU3F2_02 | 5591 (-) | 0.674 | 0.719 | aataaAATAT V$POU3F2_02 | 6576 (+) | 1.000 | 0.767 | TTATGcaaaa V$POU3F2_02 | 6609 (+) | 0.674 | 0.710 | ATTTGctaaa V$POU3F2_02 | 6770 (-) | 0.783 | 0.775 | atgaaCAAAA V$POU3F2_02 | 6831 (+) | 0.674 | 0.710 | ATGTGctatt V$POU3F2_02 | 7262 (-) | 0.674 | 0.831 | attaaAATAT V$POU3F2_02 | 7267 (-) | 0.783 | 0.743 | aatatCCTAA V$POU3F2_02 | 7268 (+) | 0.674 | 0.710 | ATATCctaaa V$POU3F2_02 | 7351 (+) | 0.674 | 0.831 | ACATGttaat V$POU3F2_02 | 7641 (+) | 0.783 | 0.775 | TTATTttaaa V$POU3F2_02 | 7898 (+) | 0.783 | 0.775 | TTATTttgat V$POU3F2_02 | 7901 (+) | 0.783 | 0.743 | TTTTGatatt V$POU3F2_02 | 8605 (-) | 0.652 | 0.787 | attatCATTC V$POU3F2_02 | 8697 (+) | 0.783 | 0.743 | TTCTGaaaat V$POU3F2_02 | 8704 (-) | 0.783 | 0.743 | aatacCATCA V$POU3F2_02 | 8815 (+) | 0.783 | 0.743 | TTCTGgtact V$POU3F2_02 | 9154 (+) | 0.891 | 0.719 | ATATGtgagt V$POU3F2_02 | 9731 (+) | 0.869 | 0.900 | GTATGataat V$POU3F2_02 | 9750 (+) | 0.674 | 0.831 | ATATAttaat V$POU3F2_02 | 9936 (-) | 0.674 | 0.710 | atttgCATGT V$POU3F2_02 | 10143 (+) | 0.783 | 0.743 | TTATTattat V$POU3F2_02 | 10321 (+) | 0.674 | 0.719 | ATATTtttat V$POU3F2_02 | 10910 (+) | 0.869 | 0.812 | CTATGcaaat V$POU3F2_02 | 11153 (+) | 0.783 | 0.743 | TGATGaaaat V$POU3F2_02 | 11299 (-) | 0.783 | 0.743 | ataatAATAA V$POU3F2_02 | 11658 (-) | 0.783 | 0.775 | gttaaCACAA V$POU3F2_02 | 11667 (-) | 1.000 | 0.855 | attttCATAA V$POU3F2_02 | 11673 (-) | 0.783 | 0.743 | ataacCAAAA V$HIF1_Q5 | 6062 (-) | 1.000 | 0.966 | cctgCACGTgct V$HIF1_Q5 | 6064 (+) | 1.000 | 0.962 | tgcACGTGctgc V$HIF1_Q5 | 6884 (-) | 1.000 | 0.940 | gcacCACGTaag V$HIF1_Q5 | 7484 (-) | 1.000 | 0.935 | cgcgCACGTctg V$AP2REP_01 | 114 (+) | 1.000 | 0.927 | CAGTGag V$AP2REP_01 | 126 (-) | 0.918 | 0.854 | ctAACTG V$AP2REP_01 | 300 (-) | 0.918 | 0.866 | caGACTG V$AP2REP_01 | 667 (-) | 1.000 | 0.909 | taCACTG V$AP2REP_01 | 1029 (+) | 0.918 | 0.897 | CAGTTga V$AP2REP_01 | 1267 (+) | 0.907 | 0.856 | CCGTGtg V$AP2REP_01 | 1377 (+) | 1.000 | 1.000 | CAGTGgg V$AP2REP_01 | 1418 (+) | 0.907 | 0.844 | CCGTGag V$AP2REP_01 | 1434 (+) | 1.000 | 0.962 | CAGTGgc V$AP2REP_01 | 1450 (-) | 1.000 | 0.889 | gtCACTG V$AP2REP_01 | 1592 (-) | 1.000 | 0.897 | agCACTG V$AP2REP_01 | 1750 (-) | 1.000 | 0.909 | aaCACTG V$AP2REP_01 | 1861 (-) | 0.918 | 0.927 | ccAACTG V$AP2REP_01 | 2041 (-) | 1.000 | 0.970 | acCACTG V$AP2REP_01 | 2309 (+) | 0.907 | 0.856 | CTGTGtg V$AP2REP_01 | 2373 (-) | 0.907 | 0.880 | gcCACAG V$AP2REP_01 | 2447 (+) | 1.000 | 0.970 | CAGTGgt V$AP2REP_01 | 2805 (-) | 0.932 | 0.866 | ctTACTG V$AP2REP_01 | 2828 (+) | 0.918 | 0.897 | CAGTTgt V$AP2REP_01 | 2876 (-) | 0.870 | 0.854 | tcCACCG V$AP2REP_01 | 2889 (-) | 0.932 | 0.866 | cgTACTG V$AP2REP_01 | 2916 (+) | 0.907 | 0.887 | CTGTGga V$AP2REP_01 | 2963 (+) | 1.000 | 0.889 | CAGTGac V$AP2REP_01 | 3231 (+) | 0.918 | 0.854 | CAGTCag V$AP2REP_01 | 3251 (-) | 0.854 | 0.870 | ccCACTT V$AP2REP_01 | 3362 (-) | 1.000 | 0.970 | acCACTG V$AP2REP_01 | 3562 (+) | 0.907 | 0.856 | CTGTGtg V$AP2REP_01 | 3959 (-) | 0.932 | 0.939 | ccTACTG V$AP2REP_01 | 3966 (-) | 1.000 | 0.897 | agCACTG V$AP2REP_01 | 4310 (-) | 0.932 | 0.848 | aaTACTG V$AP2REP_01 | 4463 (-) | 1.000 | 0.939 | caCACTG V$AP2REP_01 | 4467 (-) | 0.918 | 0.854 | ctGACTG V$AP2REP_01 | 4568 (+) | 0.918 | 0.889 | CAGTTgc V$AP2REP_01 | 4683 (+) | 0.907 | 0.880 | CTGTGgc V$AP2REP_01 | 4719 (-) | 1.000 | 0.897 | ttCACTG V$AP2REP_01 | 4758 (+) | 0.839 | 0.856 | CTGTAgg V$AP2REP_01 | 4855 (-) | 1.000 | 0.897 | ttCACTG V$AP2REP_01 | 5021 (-) | 0.907 | 0.880 | gcCACGG V$AP2REP_01 | 5048 (+) | 0.907 | 0.887 | CTGTGgt V$AP2REP_01 | 5087 (-) | 0.918 | 0.854 | ctGACTG V$AP2REP_01 | 5202 (+) | 0.907 | 0.880 | CTGTGgc V$AP2REP_01 | 5329 (+) | 1.000 | 0.927 | CAGTGcg V$AP2REP_01 | 5367 (-) | 0.907 | 0.856 | caCACGG V$AP2REP_01 | 5430 (-) | 0.854 | 0.840 | tcCACTT V$AP2REP_01 | 5628 (-) | 1.000 | 0.909 | taCACTG V$AP2REP_01 | 5690 (-) | 0.830 | 0.849 | ccCACTA V$AP2REP_01 | 5761 (+) | 0.839 | 0.856 | CTGTAgg V$AP2REP_01 | 5827 (-) | 1.000 | 0.889 | ggCACTG V$AP2REP_01 | 5963 (+) | 1.000 | 0.909 | CAGTGtt V$AP2REP_01 | 5973 (+) | 0.854 | 0.840 | AAGTGgt V$AP2REP_01 | 6074 (-) | 0.907 | 0.880 | gcCACAG V$AP2REP_01 | 6084 (+) | 0.907 | 0.844 | CTGTGag V$AP2REP_01 | 6094 (+) | 0.907 | 0.856 | CTGTGtg V$AP2REP_01 | 6306 (-) | 1.000 | 0.889 | ggCACTG V$AP2REP_01 | 6323 (-) | 0.918 | 0.854 | ctGACTG V$AP2REP_01 | 6335 (-) | 1.000 | 0.939 | caCACTG V$AP2REP_01 | 6387 (-) | 0.918 | 0.854 | cgAACTG V$AP2REP_01 | 6569 (-) | 0.907 | 0.880 | gcCACAG V$AP2REP_01 | 6621 (+) | 0.907 | 0.844 | CCGTGag V$AP2REP_01 | 6649 (+) | 0.918 | 0.866 | CAGTCtg V$AP2REP_01 | 6751 (-) | 0.830 | 0.849 | ccCACTA V$AP2REP_01 | 6846 (-) | 0.918 | 0.927 | ccAACTG V$AP2REP_01 | 6908 (+) | 0.854 | 0.840 | AAGTGga V$AP2REP_01 | 6950 (-) | 0.932 | 0.841 | gaTACTG V$AP2REP_01 | 7192 (-) | 0.907 | 0.880 | gcCACGG V$AP2REP_01 | 7535 (+) | 0.918 | 0.927 | CAGTTgg V$AP2REP_01 | 7549 (-) | 0.918 | 0.927 | ccAACTG V$AP2REP_01 | 7678 (-) | 0.907 | 0.844 | ctCACAG V$AP2REP_01 | 7726 (+) | 0.918 | 0.866 | CAGTTtg V$AP2REP_01 | 7837 (-) | 0.854 | 0.870 | ccCACTT V$AP2REP_01 | 7995 (-) | 1.000 | 0.897 | ttCACTG V$AP2REP_01 | 8005 (+) | 0.825 | 0.844 | CTGTTgg V$AP2REP_01 | 8335 (+) | 0.932 | 0.848 | CAGTAtt V$AP2REP_01 | 8348 (-) | 0.854 | 0.840 | acCACTT V$AP2REP_01 | 8519 (-) | 0.907 | 0.844 | ctCACAG V$AP2REP_01 | 8540 (+) | 1.000 | 1.000 | CAGTGgg V$AP2REP_01 | 8670 (-) | 0.870 | 0.854 | acCACCG V$AP2REP_01 | 8929 (+) | 0.932 | 0.848 | CAGTAta V$AP2REP_01 | 9181 (-) | 0.907 | 0.880 | gcCACAG V$AP2REP_01 | 9360 (+) | 1.000 | 0.909 | CAGTGtt V$AP2REP_01 | 9560 (+) | 0.825 | 0.844 | CTGTCgg V$AP2REP_01 | 9678 (+) | 0.854 | 0.840 | AAGTGga V$AP2REP_01 | 9911 (-) | 0.932 | 0.866 | ctTACTG V$AP2REP_01 | 10043 (+) | 0.907 | 0.856 | CCGTGtg V$AP2REP_01 | 10088 (+) | 1.000 | 0.897 | CAGTGct V$AP2REP_01 | 10193 (+) | 0.918 | 0.854 | CAGTCag V$AP2REP_01 | 10209 (-) | 0.907 | 0.887 | tcCACAG V$AP2REP_01 | 10213 (+) | 1.000 | 0.962 | CAGTGgc V$AP2REP_01 | 10651 (+) | 1.000 | 0.902 | CAGTGtc V$AP2REP_01 | 10676 (-) | 0.907 | 0.887 | tcCACAG V$AP2REP_01 | 10749 (+) | 1.000 | 0.897 | CAGTGat V$AP2REP_01 | 10756 (+) | 0.907 | 0.856 | CCGTGtg V$AP2REP_01 | 10779 (+) | 0.825 | 0.844 | CTGTCgg V$AP2REP_01 | 10821 (+) | 0.918 | 0.854 | CAGTCcg V$AP2REP_01 | 10884 (-) | 1.000 | 0.897 | agCACTG V$AP2REP_01 | 10967 (+) | 0.825 | 0.844 | CTGTTgg V$AP2REP_01 | 10984 (+) | 0.907 | 0.856 | CTGTGtg V$AP2REP_01 | 11035 (-) | 1.000 | 0.970 | acCACTG V$AP2REP_01 | 11095 (-) | 1.000 | 0.970 | tcCACTG V$AP2REP_01 | 11227 (+) | 0.918 | 0.866 | CAGTTtg V$AP2REP_01 | 11236 (-) | 0.918 | 0.854 | ctAACTG V$AP2REP_01 | 11255 (+) | 0.907 | 0.887 | CTGTGgt V$AP2REP_01 | 11461 (+) | 1.000 | 0.897 | CAGTGaa V$AP2REP_01 | 11699 (-) | 1.000 | 0.897 | atCACTG V$AP2REP_01 | 11711 (-) | 0.918 | 0.927 | ccAACTG V$AP2ALPHA_01 | 101 (+) | 0.998 | 0.998 | GCCTGgggg V$AP2ALPHA_01 | 1334 (-) | 0.999 | 0.999 | ccccAGGGC V$AP2ALPHA_01 | 1341 (-) | 0.996 | 0.986 | gcttAAGGC V$AP2ALPHA_01 | 3388 (+) | 0.999 | 0.990 | GCCCTgagg V$AP2ALPHA_01 | 3569 (+) | 0.996 | 0.992 | GCCTTtggc V$AP2ALPHA_01 | 6147 (+) | 0.997 | 0.993 | GCCTCtggc V$AP2ALPHA_01 | 6147 (-) | 0.995 | 0.994 | gcctCTGGC V$AP2ALPHA_01 | 7192 (+) | 0.995 | 0.996 | GCCACgggg V$AP2ALPHA_01 | 7226 (+) | 0.995 | 0.984 | GCCAGcagg V$AP2ALPHA_01 | 7461 (+) | 0.998 | 0.986 | GCCTGcagg V$AP2ALPHA_01 | 7627 (+) | 0.998 | 0.983 | GCCTGgcgg V$AP2ALPHA_01 | 9925 (+) | 0.994 | 0.990 | GCCTAtggg V$AP2ALPHA_01 | 11913 (-) | 1.000 | 0.989 | gcttCGGGC V$AP2GAMMA_01 | 101 (+) | 0.993 | 0.994 | GCCTGgggg V$AP2GAMMA_01 | 1334 (-) | 0.995 | 0.995 | ccccAGGGC V$AP2GAMMA_01 | 1341 (-) | 0.987 | 0.954 | gcttAAGGC V$AP2GAMMA_01 | 2480 (+) | 0.995 | 0.982 | GCCCTgggt V$AP2GAMMA_01 | 3388 (+) | 0.995 | 0.964 | GCCCTgagg V$AP2GAMMA_01 | 3533 (+) | 1.000 | 0.984 | GCCCGaggt V$AP2GAMMA_01 | 3569 (+) | 0.987 | 0.977 | GCCTTtggc V$AP2GAMMA_01 | 3569 (-) | 0.969 | 0.970 | gcctTTGGC V$AP2GAMMA_01 | 5021 (+) | 0.975 | 0.964 | GCCACggga V$AP2GAMMA_01 | 5889 (-) | 0.998 | 0.955 | gcatGGGGC V$AP2GAMMA_01 | 6116 (-) | 0.995 | 0.952 | ccagAGGGC V$AP2GAMMA_01 | 6147 (+) | 0.991 | 0.980 | GCCTCtggc V$AP2GAMMA_01 | 6147 (-) | 0.977 | 0.976 | gcctCTGGC V$AP2GAMMA_01 | 6421 (-) | 0.977 | 0.963 | tcctCTGGC V$AP2GAMMA_01 | 7192 (+) | 0.975 | 0.979 | GCCACgggg V$AP2GAMMA_01 | 7461 (+) | 0.993 | 0.958 | GCCTGcagg V$AP2GAMMA_01 | 7627 (+) | 0.993 | 0.955 | GCCTGgcgg V$AP2GAMMA_01 | 8162 (-) | 0.991 | 0.967 | tccaGAGGC V$AP2GAMMA_01 | 9241 (+) | 0.995 | 0.978 | GCCCTagga V$AP2GAMMA_01 | 9331 (+) | 0.991 | 0.967 | GCCTCtgga V$AP2GAMMA_01 | 9390 (+) | 0.998 | 0.973 | GCCCCtgga V$AP2GAMMA_01 | 9442 (+) | 0.992 | 0.977 | GCCCAaggt V$AP2GAMMA_01 | 9925 (+) | 0.985 | 0.977 | GCCTAtggg V$AP2GAMMA_01 | 10560 (+) | 0.987 | 0.971 | GCCGGcggt V$AP2GAMMA_01 | 11284 (-) | 0.995 | 0.952 | gcatAGGGC V$AP2GAMMA_01 | 11815 (+) | 0.992 | 0.979 | GCCCAgggt V$AP2GAMMA_01 | 11913 (-) | 1.000 | 0.964 | gcttCGGGC V$AP2GAMMA_01 | 11920 (+) | 0.992 | 0.950 | GCCCAacgc V$AP2GAMMA_01 | 11928 (-) | 0.969 | 0.964 | cccaTTGGC V$TBP_01 | 241 (+) | 0.934 | 0.927 | tTTAAAtc V$TBP_01 | 569 (+) | 1.000 | 0.891 | gATAAAcg V$TBP_01 | 642 (-) | 1.000 | 0.945 | caTTTATt V$TBP_01 | 650 (-) | 1.000 | 0.945 | caTTTATt V$TBP_01 | 661 (+) | 0.881 | 0.895 | tATAAGta V$TBP_01 | 835 (+) | 1.000 | 0.923 | tATAAAac V$TBP_01 | 845 (+) | 0.934 | 0.910 | gTTAAAta V$TBP_01 | 849 (+) | 1.000 | 0.959 | aATAAAta V$TBP_01 | 962 (-) | 0.932 | 0.940 | taTTTACa V$TBP_01 | 1150 (-) | 1.000 | 0.901 | acTTTATg V$TBP_01 | 1890 (-) | 1.000 | 0.935 | atTTTATa V$TBP_01 | 1896 (-) | 1.000 | 0.959 | taTTTATt V$TBP_01 | 1901 (-) | 1.000 | 0.894 | atTTTATt V$TBP_01 | 1909 (-) | 1.000 | 0.894 | atTTTATt V$TBP_01 | 2363 (-) | 1.000 | 0.959 | taTTTATt V$TBP_01 | 2576 (-) | 0.932 | 0.940 | taTTTACa V$TBP_01 | 2737 (-) | 1.000 | 0.889 | ccTTTATg V$TBP_01 | 2901 (-) | 1.000 | 0.905 | ttTTTATc V$TBP_01 | 3130 (-) | 0.932 | 0.937 | aaTTTACa V$TBP_01 | 3406 (+) | 1.000 | 0.923 | tATAAAag V$TBP_01 | 3624 (-) | 1.000 | 0.923 | cgTTTATa V$TBP_01 | 3717 (-) | 0.932 | 0.900 | taTTTACg V$TBP_01 | 3832 (+) | 1.000 | 0.957 | aATAAAtt V$TBP_01 | 3848 (+) | 1.000 | 0.895 | cATAAAat V$TBP_01 | 3917 (-) | 1.000 | 0.889 | gcTTTATg V$TBP_01 | 4005 (+) | 1.000 | 0.968 | gATAAAta V$TBP_01 | 4009 (+) | 1.000 | 0.959 | aATAAAta V$TBP_01 | 4226 (+) | 1.000 | 0.957 | aATAAAtt V$TBP_01 | 4782 (+) | 1.000 | 0.911 | gATAAAga V$TBP_01 | 5396 (-) | 1.000 | 0.929 | ccTTTATa V$TBP_01 | 5433 (-) | 1.000 | 0.900 | acTTTATt V$TBP_01 | 5591 (+) | 1.000 | 0.894 | aATAAAat V$TBP_01 | 5636 (+) | 1.000 | 0.911 | gATAAAga V$TBP_01 | 5647 (+) | 1.000 | 0.896 | aATAAAaa V$TBP_01 | 5794 (+) | 0.934 | 0.901 | aTTAAAta V$TBP_01 | 5952 (-) | 1.000 | 0.894 | atTTTATt V$TBP_01 | 6468 (+) | 1.000 | 0.903 | gATAAAat V$TBP_01 | 6669 (-) | 1.000 | 0.894 | atTTTATt V$TBP_01 | 6673 (-) | 0.917 | 0.895 | taTTTAGc V$TBP_01 | 6760 (-) | 1.000 | 0.929 | ccTTTATa V$TBP_01 | 6826 (-) | 1.000 | 0.898 | ttTTTATg V$TBP_01 | 6837 (-) | 0.932 | 0.899 | taTTTACt V$TBP_01 | 7434 (-) | 1.000 | 0.896 | ttTTTATt V$TBP_01 | 7439 (-) | 1.000 | 0.895 | atTTTATg V$TBP_01 | 7800 (+) | 1.000 | 0.911 | gATAAAga V$TBP_01 | 7895 (-) | 1.000 | 0.896 | ttTTTATt V$TBP_01 | 8061 (-) | 1.000 | 0.903 | agTTTATc V$TBP_01 | 8638 (-) | 0.934 | 0.903 | taTTTAAg V$TBP_01 | 8932 (+) | 1.000 | 0.935 | tATAAAct V$TBP_01 | 9427 (+) | 1.000 | 0.961 | cATAAAta V$TBP_01 | 9484 (-) | 1.000 | 0.900 | acTTTATt V$TBP_01 | 9884 (-) | 0.934 | 0.939 | aaTTTAAa V$TBP_01 | 10124 (+) | 1.000 | 0.894 | aATAAAat V$TBP_01 | 10316 (-) | 1.000 | 0.937 | ttTTTATa V$TBP_01 | 10324 (-) | 1.000 | 0.896 | ttTTTATt V$TBP_01 | 10329 (-) | 1.000 | 0.935 | atTTTATa V$TBP_01 | 10338 (-) | 1.000 | 0.896 | ttTTTATt V$TBP_01 | 10392 (+) | 1.000 | 0.937 | tATAAAaa V$TBP_01 | 10451 (-) | 1.000 | 0.923 | ctTTTATa V$TBP_01 | 10703 (-) | 1.000 | 0.898 | tgTTTATg V$TBP_01 | 10812 (+) | 0.917 | 0.912 | tCTAAAtc V$TBP_01 | 11339 (+) | 0.932 | 0.925 | tGTAAAtg V$TBP_01 | 11392 (+) | 0.934 | 0.899 | aTTAAAtt V$TBP_01 | 11407 (-) | 1.000 | 0.945 | caTTTATt V$TBP_01 | 11424 (-) | 0.917 | 0.895 | taTTTAGc V$TBP_01 | 11470 (-) | 1.000 | 0.896 | ttTTTATt V$TBP_01 | 11638 (+) | 0.934 | 0.927 | tTTAAAtg V$TBP_01 | 11689 (+) | 1.000 | 0.900 | aATAAAgt V$FOXO1_01 | 10701 (-) | 1.000 | 0.990 | cgTGTTTatg V$FOXO4_02 | 30 (-) | 1.000 | 0.903 | tgtgtgAACAActt V$FOXO4_02 | 71 (+) | 1.000 | 0.836 | ctgTTGTTgatgtt V$FOXO4_02 | 219 (+) | 0.800 | 0.773 | tttTTTTTttcctt V$FOXO4_02 | 220 (+) | 0.800 | 0.759 | tttTTTTTtccttt V$FOXO4_02 | 396 (-) | 1.000 | 0.792 | tttgccAACAAggc V$FOXO4_02 | 482 (-) | 0.800 | 0.749 | caagtcAACATtgc V$FOXO4_02 | 508 (+) | 0.800 | 0.747 | tatTTGTCttctga V$FOXO4_02 | 603 (-) | 0.800 | 0.748 | tttttaAACATcct V$FOXO4_02 | 630 (+) | 0.800 | 0.725 | cttTTTTTtaaaca V$FOXO4_02 | 834 (-) | 1.000 | 0.787 | atataaAACAAgtt V$FOXO4_02 | 843 (-) | 0.800 | 0.745 | aagttaAATAAata V$FOXO4_02 | 985 (-) | 0.800 | 0.738 | acagttAATAAttt V$FOXO4_02 | 1033 (-) | 1.000 | 0.792 | tgaatcAACAAagc V$FOXO4_02 | 1051 (-) | 1.000 | 0.833 | aaaatcAACAActt V$FOXO4_02 | 1088 (-) | 0.800 | 0.750 | ttggtcAACTAaaa V$FOXO4_02 | 1116 (+) | 0.800 | 0.733 | gatTTGTGtatgca V$FOXO4_02 | 1367 (+) | 0.800 | 0.728 | tgcTCGTTtgcagt V$FOXO4_02 | 1843 (-) | 0.800 | 0.752 | gtcgtcAACACaat V$FOXO4_02 | 2169 (+) | 0.800 | 0.745 | tccTTCTTtccttc V$FOXO4_02 | 2177 (+) | 0.800 | 0.758 | tccTTCTTttcttc V$FOXO4_02 | 2395 (+) | 0.800 | 0.789 | gagTTGCTttcctt V$FOXO4_02 | 2404 (-) | 0.800 | 0.724 | tccttaAAAAAaac V$FOXO4_02 | 2409 (-) | 1.000 | 0.821 | aaaaaaAACAAaaa V$FOXO4_02 | 2501 (-) | 0.800 | 0.749 | tccgaaAAAAAagg V$FOXO4_02 | 2511 (-) | 0.800 | 0.760 | aagggaAAAAAaat V$FOXO4_02 | 2512 (-) | 0.800 | 0.753 | agggaaAAAAAatt V$FOXO4_02 | 2600 (-) | 0.800 | 0.729 | ctattaAACCAaga V$FOXO4_02 | 2661 (+) | 0.800 | 0.747 | actTTGATgacact V$FOXO4_02 | 2747 (+) | 0.800 | 0.753 | tttTTCTTttctct V$FOXO4_02 | 2828 (+) | 0.800 | 0.760 | cagTTGTCtgcaaa V$FOXO4_02 | 2897 (+) | 0.800 | 0.726 | atcTTTTTtatcca V$FOXO4_02 | 3256 (+) | 0.800 | 0.756 | ttaTGGTTttcttg V$FOXO4_02 | 3292 (-) | 0.800 | 0.785 | gaggtgAACATctc V$FOXO4_02 | 3424 (+) | 0.800 | 0.762 | tggTTGCTtaattg V$FOXO4_02 | 3447 (-) | 0.800 | 0.754 | acagttAATAAaac V$FOXO4_02 | 3548 (+) | 0.800 | 0.750 | ctgTTCTTtagtag V$FOXO4_02 | 3691 (-) | 0.800 | 0.761 | caagaaCACAAgag V$FOXO4_02 | 3814 (+) | 0.800 | 0.744 | acaTCGTTttcaat V$FOXO4_02 | 3940 (+) | 1.000 | 0.787 | cacTTGTTcattca V$FOXO4_02 | 3993 (-) | 1.000 | 0.793 | tcatgaAACAAcga V$FOXO4_02 | 4003 (-) | 0.800 | 0.734 | acgataAATAAata V$FOXO4_02 | 4215 (-) | 0.800 | 0.752 | tcagttAAAAAaat V$FOXO4_02 | 4216 (-) | 0.800 | 0.743 | cagttaAAAAAata V$FOXO4_02 | 4258 (+) | 1.000 | 0.746 | tggTTGTTaggatg V$FOXO4_02 | 4421 (-) | 0.800 | 0.725 | tgtctaAAAAAaaa V$FOXO4_02 | 4486 (-) | 1.000 | 0.795 | cttttgAACAAaaa V$FOXO4_02 | 4493 (-) | 1.000 | 0.801 | acaaaaAACAAaac V$FOXO4_02 | 4498 (-) | 1.000 | 0.812 | aaacaaAACAAaac V$FOXO4_02 | 4503 (-) | 1.000 | 0.831 | aaacaaAACAAcat V$FOXO4_02 | 4522 (-) | 1.000 | 0.829 | caagacAACAAatg V$FOXO4_02 | 4534 (-) | 0.800 | 0.764 | tgcgtcAACAGcct V$FOXO4_02 | 4663 (+) | 1.000 | 0.809 | actTTGTTttgctc V$FOXO4_02 | 4787 (+) | 0.800 | 0.750 | agaTTGCTgacttg V$FOXO4_02 | 4813 (+) | 1.000 | 0.804 | catTTGTTaccagg V$FOXO4_02 | 5121 (-) | 0.800 | 0.757 | aaaggaAACAGatg V$FOXO4_02 | 5273 (-) | 1.000 | 0.817 | tgggccAACAAcag V$FOXO4_02 | 5560 (-) | 1.000 | 0.911 | ctagaaAACAAcca V$FOXO4_02 | 5610 (-) | 0.800 | 0.790 | caagaaACCAAcat V$FOXO4_02 | 5655 (-) | 0.800 | 0.788 | gaagaaAAGAAcac V$FOXO4_02 | 5660 (-) | 0.800 | 0.758 | aaagaaCACAAtaa V$FOXO4_02 | 5831 (-) | 1.000 | 0.773 | ctgtgaAACAAggc V$FOXO4_02 | 5857 (+) | 0.800 | 0.768 | gagTTGGTttccca V$FOXO4_02 | 5858 (+) | 0.800 | 0.745 | agtTGGTTtcccat V$FOXO4_02 | 5912 (+) | 1.000 | 0.874 | agtTTGTTtcccca V$FOXO4_02 | 5952 (+) | 0.800 | 0.745 | attTTATTtgccag V$FOXO4_02 | 6106 (+) | 0.800 | 0.742 | gtcCTGTTgaccag V$FOXO4_02 | 6122 (+) | 1.000 | 0.878 | ggcTTGTTttctcg V$FOXO4_02 | 6456 (+) | 0.800 | 0.725 | aacTTATTtcctga V$FOXO4_02 | 6534 (-) | 0.800 | 0.736 | cctggaGACAAacg V$FOXO4_02 | 6585 (-) | 1.000 | 0.789 | acacaaAACAAata V$FOXO4_02 | 6652 (-) | 0.800 | 0.734 | tctgcaAACAGatg V$FOXO4_02 | 6669 (+) | 0.800 | 0.747 | attTTATTtagctt V$FOXO4_02 | 6767 (-) | 1.000 | 0.800 | accatgAACAAaac V$FOXO4_02 | 6822 (+) | 0.800 | 0.745 | cccTTTTTtatgtg V$FOXO4_02 | 6910 (-) | 1.000 | 0.809 | gtggagAACAAgga V$FOXO4_02 | 7014 (+) | 0.800 | 0.753 | cggATGTTtcctgc V$FOXO4_02 | 7039 (+) | 0.800 | 0.752 | agtTAGTTgactat V$FOXO4_02 | 7050 (+) | 0.800 | 0.767 | tatTTTTTttctta V$FOXO4_02 | 7063 (+) | 1.000 | 0.821 | aatTTGTTtttgtt V$FOXO4_02 | 7069 (+) | 1.000 | 0.806 | tttTTGTTtgtgta V$FOXO4_02 | 7073 (+) | 0.800 | 0.740 | tgtTTGTGtatgag V$FOXO4_02 | 7208 (-) | 1.000 | 0.807 | gaattgAACAAaaa V$FOXO4_02 | 7383 (+) | 0.800 | 0.727 | tgcTTGTGtccagg V$FOXO4_02 | 7428 (+) | 1.000 | 0.820 | tatTTGTTtttatt V$FOXO4_02 | 7563 (-) | 0.800 | 0.737 | gctggaAACCAaat V$FOXO4_02 | 7564 (-) | 0.800 | 0.756 | ctggaaACCAAatc V$FOXO4_02 | 7584 (-) | 1.000 | 0.779 | aggtcaAACAAatg V$FOXO4_02 | 7655 (+) | 0.800 | 0.745 | cttTTGTAtgccag V$FOXO4_02 | 7771 (-) | 1.000 | 0.798 | tcaaaaAACAAaac V$FOXO4_02 | 7776 (-) | 1.000 | 0.812 | aaacaaAACAAaaa V$FOXO4_02 | 7798 (-) | 0.800 | 0.775 | aagataAAGAAcaa V$FOXO4_02 | 7801 (-) | 1.000 | 0.753 | ataaagAACAAcga V$FOXO4_02 | 7818 (-) | 1.000 | 0.796 | ctccaaAACAAaca V$FOXO4_02 | 7822 (-) | 1.000 | 0.792 | aaaacaAACAAtta V$FOXO4_02 | 7937 (+) | 0.800 | 0.745 | tgtGTGTTtagttt V$FOXO4_02 | 7968 (+) | 1.000 | 0.825 | tagTTGTTtgtttt V$FOXO4_02 | 7972 (+) | 1.000 | 0.804 | tgtTTGTTtttcaa V$FOXO4_02 | 8057 (+) | 0.800 | 0.745 | gtcTAGTTtatctg V$FOXO4_02 | 8146 (-) | 0.800 | 0.740 | agggaaAACATtag V$FOXO4_02 | 8183 (+) | 1.000 | 0.866 | aacTTGTTtgctct V$FOXO4_02 | 8196 (+) | 0.800 | 0.730 | tgcCTGTTtcccct V$FOXO4_02 | 8487 (+) | 0.800 | 0.773 | gggTTATTttctac V$FOXO4_02 | 8552 (-) | 1.000 | 0.805 | aaaatgAACAAtgt V$FOXO4_02 | 8569 (+) | 0.800 | 0.744 | tctTTGATtaaatg V$FOXO4_02 | 8642 (-) | 0.800 | 0.753 | taagaaGACAAttc V$FOXO4_02 | 8657 (+) | 0.800 | 0.740 | caaTGGTTttcaaa V$FOXO4_02 | 8930 (-) | 0.800 | 0.738 | agtataAACTAaag V$FOXO4_02 | 9107 (-) | 1.000 | 0.812 | aagcaaAACAAaaa V$FOXO4_02 | 9296 (+) | 0.800 | 0.728 | agcTTGTGtgccca V$FOXO4_02 | 9413 (-) | 0.800 | 0.741 | ctagaaAACTAtcc V$FOXO4_02 | 9425 (-) | 0.800 | 0.754 | cccataAATAAcag V$FOXO4_02 | 9484 (+) | 0.800 | 0.745 | actTTATTtgcaat V$FOXO4_02 | 9530 (-) | 1.000 | 0.736 | gactagAACAAaca V$FOXO4_02 | 9534 (-) | 1.000 | 0.778 | agaacaAACAAgtc V$FOXO4_02 | 9760 (-) | 0.800 | 0.743 | attgaaAACAGtca V$FOXO4_02 | 9786 (+) | 1.000 | 0.794 | attTTGTTcagtaa V$FOXO4_02 | 9875 (-) | 0.800 | 0.754 | gaggaaATCAAttt V$FOXO4_02 | 10131 (-) | 0.800 | 0.756 | ttagttAAGAAatt V$FOXO4_02 | 10149 (+) | 1.000 | 0.729 | ttaTTGTTgttgtc V$FOXO4_02 | 10152 (+) | 1.000 | 0.852 | ttgTTGTTgtcgtt V$FOXO4_02 | 10158 (+) | 0.800 | 0.789 | ttgTCGTTttcgtc V$FOXO4_02 | 10302 (+) | 0.800 | 0.730 | acaTTTTTtaactg V$FOXO4_02 | 10303 (+) | 0.800 | 0.752 | catTTTTTaactgt V$FOXO4_02 | 10341 (+) | 1.000 | 0.798 | ttaTTGTTttaatt V$FOXO4_02 | 10415 (+) | 0.800 | 0.740 | gcaTTTTTaacact V$FOXO4_02 | 10455 (-) | 1.000 | 0.797 | tatatcAACAAtgc V$FOXO4_02 | 10660 (-) | 0.800 | 0.758 | aatgtcAACAGgag V$FOXO4_02 | 10699 (+) | 0.800 | 0.730 | cccGTGTTtatgaa V$FOXO4_02 | 10911 (-) | 0.800 | 0.743 | tatgcaAATAAtag V$FOXO4_02 | 10933 (+) | 0.800 | 0.738 | ctcTTGTGgacggt V$FOXO4_02 | 11039 (+) | 1.000 | 0.729 | ctgTTGTTggttcc V$FOXO4_02 | 11042 (+) | 0.800 | 0.778 | ttgTTGGTtccttt V$FOXO4_02 | 11051 (+) | 1.000 | 0.911 | cctTTGTTaacgtt V$FOXO4_02 | 11100 (-) | 0.800 | 0.742 | tgtttaAAGAAcga V$FOXO4_02 | 11187 (+) | 1.000 | 0.774 | gccTTGTTtcttga V$FOXO4_02 | 11198 (+) | 0.800 | 0.724 | tgaTTGATtattaa V$FOXO4_02 | 11210 (+) | 1.000 | 0.779 | aatTTGTTtgaact V$FOXO4_02 | 11288 (-) | 0.800 | 0.739 | agggcaAAGAAata V$FOXO4_02 | 11466 (+) | 0.800 | 0.739 | aaaTTTTTtatttg V$FOXO4_02 | 11474 (+) | 1.000 | 0.798 | tatTTGTTtttccc V$FOXO4_02 | 11574 (+) | 0.800 | 0.738 | tcaTTGCTtatatc V$FOXO4_02 | 11632 (+) | 0.800 | 0.735 | cccTCGTTtaaatg V$FOXO4_02 | 11653 (+) | 0.800 | 0.739 | gtaCTGTTaacaca V$FOXO4_02 | 11655 (-) | 0.800 | 0.755 | actgttAACACaat V$FOXO4_02 | 11670 (-) | 0.800 | 0.740 | ttcataACCAAaaa V$FOXO4_02 | 11759 (-) | 0.800 | 0.747 | agagttAACAGgat V$FOXO3_01 | 30 (-) | 1.000 | 0.867 | tgtgtgAACAActt V$FOXO3_01 | 71 (+) | 1.000 | 0.835 | ctgTTGTTgatgtt V$FOXO3_01 | 396 (-) | 1.000 | 0.778 | tttgccAACAAggc V$FOXO3_01 | 603 (-) | 0.829 | 0.778 | tttttaAACATcct V$FOXO3_01 | 634 (-) | 0.829 | 0.776 | tttttaAACATtta V$FOXO3_01 | 834 (-) | 1.000 | 0.796 | atataaAACAAgtt V$FOXO3_01 | 843 (-) | 0.793 | 0.776 | aagttaAATAAata V$FOXO3_01 | 1033 (-) | 1.000 | 0.795 | tgaatcAACAAagc V$FOXO3_01 | 1051 (-) | 1.000 | 0.834 | aaaatcAACAActt V$FOXO3_01 | 2409 (-) | 1.000 | 0.836 | aaaaaaAACAAaaa V$FOXO3_01 | 3292 (-) | 0.829 | 0.771 | gaggtgAACATctc V$FOXO3_01 | 3940 (+) | 1.000 | 0.793 | cacTTGTTcattca V$FOXO3_01 | 3993 (-) | 1.000 | 0.786 | tcatgaAACAAcga V$FOXO3_01 | 4486 (-) | 1.000 | 0.784 | cttttgAACAAaaa V$FOXO3_01 | 4493 (-) | 1.000 | 0.790 | acaaaaAACAAaac V$FOXO3_01 | 4498 (-) | 1.000 | 0.824 | aaacaaAACAAaac V$FOXO3_01 | 4503 (-) | 1.000 | 0.830 | aaacaaAACAAcat V$FOXO3_01 | 4522 (-) | 1.000 | 0.796 | caagacAACAAatg V$FOXO3_01 | 4663 (+) | 1.000 | 0.812 | actTTGTTttgctc V$FOXO3_01 | 5121 (-) | 0.829 | 0.775 | aaaggaAACAGatg V$FOXO3_01 | 5273 (-) | 1.000 | 0.771 | tgggccAACAAcag V$FOXO3_01 | 5560 (-) | 1.000 | 0.867 | ctagaaAACAAcca V$FOXO3_01 | 5912 (+) | 1.000 | 0.854 | agtTTGTTtcccca V$FOXO3_01 | 6122 (+) | 1.000 | 0.847 | ggcTTGTTttctcg V$FOXO3_01 | 6585 (-) | 1.000 | 0.793 | acacaaAACAAata V$FOXO3_01 | 6669 (+) | 0.793 | 0.775 | attTTATTtagctt V$FOXO3_01 | 6767 (-) | 1.000 | 0.786 | accatgAACAAaac V$FOXO3_01 | 6910 (-) | 1.000 | 0.774 | gtggagAACAAgga V$FOXO3_01 | 7063 (+) | 1.000 | 0.829 | aatTTGTTtttgtt V$FOXO3_01 | 7069 (+) | 1.000 | 0.828 | tttTTGTTtgtgta V$FOXO3_01 | 7208 (-) | 1.000 | 0.812 | gaattgAACAAaaa V$FOXO3_01 | 7428 (+) | 1.000 | 0.838 | tatTTGTTtttatt V$FOXO3_01 | 7584 (-) | 1.000 | 0.775 | aggtcaAACAAatg V$FOXO3_01 | 7771 (-) | 1.000 | 0.792 | tcaaaaAACAAaac V$FOXO3_01 | 7776 (-) | 1.000 | 0.832 | aaacaaAACAAaaa V$FOXO3_01 | 7798 (-) | 0.793 | 0.789 | aagataAAGAAcaa V$FOXO3_01 | 7818 (-) | 1.000 | 0.790 | ctccaaAACAAaca V$FOXO3_01 | 7822 (-) | 1.000 | 0.822 | aaaacaAACAAtta V$FOXO3_01 | 7937 (+) | 0.837 | 0.811 | tgtGTGTTtagttt V$FOXO3_01 | 7968 (+) | 1.000 | 0.830 | tagTTGTTtgtttt V$FOXO3_01 | 7972 (+) | 1.000 | 0.809 | tgtTTGTTtttcaa V$FOXO3_01 | 8183 (+) | 1.000 | 0.854 | aacTTGTTtgctct V$FOXO3_01 | 8552 (-) | 1.000 | 0.822 | aaaatgAACAAtgt V$FOXO3_01 | 9107 (-) | 1.000 | 0.830 | aagcaaAACAAaaa V$FOXO3_01 | 9534 (-) | 1.000 | 0.784 | agaacaAACAAgtc V$FOXO3_01 | 9786 (+) | 1.000 | 0.797 | attTTGTTcagtaa V$FOXO3_01 | 10152 (+) | 1.000 | 0.822 | ttgTTGTTgtcgtt V$FOXO3_01 | 10341 (+) | 1.000 | 0.826 | ttaTTGTTttaatt V$FOXO3_01 | 10455 (-) | 1.000 | 0.829 | tatatcAACAAtgc V$FOXO3_01 | 10660 (-) | 0.829 | 0.789 | aatgtcAACAGgag V$FOXO3_01 | 10699 (+) | 0.837 | 0.785 | cccGTGTTtatgaa V$FOXO3_01 | 11051 (+) | 1.000 | 0.899 | cctTTGTTaacgtt V$FOXO3_01 | 11187 (+) | 1.000 | 0.783 | gccTTGTTtcttga V$FOXO3_01 | 11210 (+) | 1.000 | 0.778 | aatTTGTTtgaact V$FOXO3_01 | 11474 (+) | 1.000 | 0.787 | tatTTGTTtttccc V$FOXO3_01 | 11574 (+) | 0.793 | 0.773 | tcaTTGCTtatatc V$CDC5_01 | 3873 (+) | 0.926 | 0.864 | gagTTAATacaa V$CDC5_01 | 4121 (-) | 1.000 | 0.885 | agatGTTAAaac V$CDC5_01 | 9944 (-) | 0.867 | 0.873 | gtatGTTATaac V$LUN1_01 | 736 (-) | 1.000 | 0.711 | ccacgtggttgcTGGGA V$LUN1_01 | 8764 (+) | 1.000 | 0.685 | TCCCAgcacctgggagg V$AR_01 | 1622 (+) | 0.975 | 0.890 | tggacagcaTGTCCa V$AR_01 | 1622 (-) | 0.975 | 0.886 | tGGACAgcatgtcca V$AR_01 | 2810 (+) | 0.747 | 0.828 | tgaacaccgTGCTCt V$AR_01 | 2810 (-) | 1.000 | 0.835 | tGAACAccgtgctct V$AR_01 | 4692 (+) | 0.975 | 0.797 | gaaacttgaTGTCCt V$AR_01 | 6518 (+) | 0.905 | 0.869 | agaacacgaAGTTCa V$AR_01 | 6518 (-) | 1.000 | 0.909 | aGAACAcgaagttca V$AR_01 | 6856 (-) | 0.896 | 0.783 | aGCACGtccagaccg V$AR_01 | 8534 (+) | 0.737 | 0.807 | agaacacagTGGGCc V$AR_01 | 8534 (-) | 1.000 | 0.902 | aGAACAcagtgggcc V$AR_01 | 9465 (+) | 1.000 | 0.955 | tgtacaggaTGTTCt V$AR_01 | 9465 (-) | 0.977 | 0.898 | tGTACAggatgttct V$AR_01 | 10080 (+) | 0.885 | 0.807 | agggcagccAGTGCt V$AR_01 | 10645 (+) | 0.975 | 0.790 | gacacccagTGTCCt V$AR_01 | 11223 (+) | 1.000 | 0.854 | tgcccagttTGTTCt V$AR_01 | 11223 (-) | 0.737 | 0.785 | tGCCCAgtttgttct V$AR_01 | 11619 (-) | 0.866 | 0.791 | gGTACCagaaggacc V$PITX2_Q2 | 2023 (-) | 1.000 | 0.995 | ctggGATTAaa V$PITX2_Q2 | 4161 (+) | 1.000 | 0.905 | tgTAATCctag V$PITX2_Q2 | 9591 (+) | 1.000 | 0.995 | tgTAATCccag V$NKX22_01 | 288 (+) | 0.938 | 0.946 | tCAAGTgttt I$MTTFA_01 | 5582 (-) | 1.000 | 0.975 | aagGATAAtaa I$MTTFA_01 | 11440 (+) | 1.000 | 0.991 | tacTTATCtat I$MTTFA_01 | 11568 (+) | 1.000 | 0.982 | ctcTTATCatt V$NKX62_Q2 | 63 (-) | 1.000 | 0.970 | catAATTActgt V$NKX62_Q2 | 585 (-) | 1.000 | 0.946 | ttgAATTAacag V$NKX62_Q2 | 648 (-) | 0.980 | 0.960 | ttcATTTAttat V$NKX62_Q2 | 843 (+) | 0.980 | 0.948 | aagtTAAATaaa V$NKX62_Q2 | 847 (+) | 0.980 | 0.977 | taaaTAAATacg V$NKX62_Q2 | 989 (+) | 1.000 | 0.977 | ttaaTAATTttc V$NKX62_Q2 | 1894 (-) | 0.980 | 0.979 | tatATTTAtttt V$NKX62_Q2 | 2361 (-) | 0.980 | 0.979 | tgtATTTAtttt V$NKX62_Q2 | 2574 (-) | 0.980 | 0.942 | agtATTTAcaac V$NKX62_Q2 | 3105 (+) | 1.000 | 0.956 | atacTAATTtcc V$NKX62_Q2 | 3125 (+) | 1.000 | 0.971 | atatTAATTtac V$NKX62_Q2 | 3731 (-) | 1.000 | 0.979 | taaAATTAttat V$NKX62_Q2 | 3734 (-) | 0.932 | 0.940 | aatTATTAtttc V$NKX62_Q2 | 3830 (+) | 0.980 | 0.962 | ataaTAAATttc V$NKX62_Q2 | 4007 (+) | 0.980 | 0.977 | taaaTAAATatt V$NKX62_Q2 | 4028 (-) | 1.000 | 0.971 | tggAATTAatat V$NKX62_Q2 | 4058 (+) | 1.000 | 0.979 | gcaaTAATTaaa V$NKX62_Q2 | 4060 (-) | 1.000 | 0.957 | aatAATTAaaat V$NKX62_Q2 | 4224 (+) | 0.980 | 0.962 | aaaaTAAATtct V$NKX62_Q2 | 4445 (-) | 1.000 | 0.972 | cacAATTAatat V$NKX62_Q2 | 5414 (+) | 0.980 | 0.951 | ttatTAAATctt V$NKX62_Q2 | 5788 (+) | 1.000 | 0.998 | gaaaTAATTaaa V$NKX62_Q2 | 5790 (-) | 1.000 | 0.954 | aatAATTAaata V$NKX62_Q2 | 5792 (+) | 0.980 | 0.968 | taatTAAATaca V$NKX62_Q2 | 6835 (-) | 0.980 | 0.955 | gctATTTActgc V$NKX62_Q2 | 7400 (-) | 1.000 | 0.949 | tagAATTActga V$NKX62_Q2 | 7636 (-) | 1.000 | 0.979 | tagAATTAtttt V$NKX62_Q2 | 8683 (+) | 1.000 | 0.968 | aaggTAATTagc V$NKX62_Q2 | 8952 (-) | 1.000 | 0.953 | taaAATTAgtag V$NKX62_Q2 | 10126 (-) | 1.000 | 0.953 | taaAATTAgtta V$NKX62_Q2 | 10138 (-) | 1.000 | 0.980 | agaAATTAttat V$NKX62_Q2 | 10346 (+) | 1.000 | 0.940 | gtttTAATTcaa V$NKX62_Q2 | 11205 (+) | 1.000 | 0.969 | ttatTAATTtgt V$NKX62_Q2 | 11390 (+) | 0.980 | 0.951 | ttatTAAATttt V$BACH2_01 | 10034 (+) | 1.000 | 0.983 | tgTGAGTcacc V$MAZR_01 | 1492 (+) | 1.000 | 0.949 | gagggGGGGGgga V$MAZR_01 | 1493 (+) | 1.000 | 0.968 | aggggGGGGGgag V$MAZR_01 | 1494 (+) | 1.000 | 0.979 | gggggGGGGGagg V$MAZR_01 | 9682 (+) | 1.000 | 0.939 | ggaggGGGGGtgg V$BACH1_01 | 10032 (+) | 1.000 | 0.904 | tttgTGAGTcaccgt P$ANT_01 | 161 (+) | 0.984 | 0.875 | ctccgttCCCAAtg P$ANT_01 | 3439 (-) | 0.886 | 0.790 | ctTAGGTaacagtt P$ANT_01 | 6262 (+) | 0.894 | 0.845 | cccatctGCCGAag P$ANT_01 | 8648 (+) | 0.984 | 0.879 | gacaattCCCAAtg P$ANT_01 | 10789 (-) | 0.774 | 0.825 | caGGGGGatgtgag P$TEIL_01 | 5 (+) | 0.976 | 0.912 | ATGCAtgt P$TEIL_01 | 15 (-) | 1.000 | 0.943 | aagTACAT P$TEIL_01 | 1009 (+) | 0.874 | 0.904 | ACGCAcct P$TEIL_01 | 1125 (+) | 0.976 | 0.930 | ATGCAcgt P$TEIL_01 | 1155 (+) | 1.000 | 0.883 | ATGTAgtt P$TEIL_01 | 2045 (+) | 0.869 | 0.901 | CTGTAcct P$TEIL_01 | 2657 (+) | 0.976 | 0.925 | ATGCActt P$TEIL_01 | 3307 (+) | 1.000 | 0.883 | ATGTAgtt P$TEIL_01 | 4107 (-) | 0.969 | 0.907 | acaTCCAT P$TEIL_01 | 4453 (-) | 0.992 | 0.916 | ataTTCAT P$TEIL_01 | 4908 (-) | 0.976 | 0.925 | aagTGCAT P$TEIL_01 | 7235 (-) | 0.992 | 0.937 | aagTTCAT P$TEIL_01 | 7321 (+) | 1.000 | 0.948 | ATGTAcgt P$TEIL_01 | 8276 (+) | 0.976 | 0.922 | ATGCAgct P$TEIL_01 | 9169 (-) | 0.976 | 0.964 | agaTGCAT P$TEIL_01 | 9739 (-) | 0.969 | 0.916 | atgTCCAT P$TEIL_01 | 9800 (+) | 1.000 | 0.940 | ATGTAcat P$TEIL_01 | 9800 (-) | 1.000 | 0.940 | atgTACAT P$TEIL_01 | 9942 (+) | 1.000 | 0.931 | ATGTAtgt P$TEIL_01 | 10465 (+) | 0.976 | 0.964 | ATGCAtct P$TEIL_01 | 11161 (+) | 0.992 | 0.934 | ATGAAgct P$TEIL_01 | 11643 (+) | 1.000 | 0.925 | ATGTAttt P$ATHB5_01 | 4059 (+) | 0.971 | 0.964 | caaTAATTa P$LIM1_01 | 202 (+) | 1.000 | 0.922 | CCACCatgcctg P$LIM1_01 | 427 (+) | 0.796 | 0.827 | CCAAAataccac P$LIM1_01 | 451 (-) | 0.796 | 0.826 | tgtctgtTTTGG P$LIM1_01 | 454 (-) | 0.813 | 0.837 | ctgttttGGTTG P$LIM1_01 | 733 (+) | 1.000 | 0.895 | CCACCacgtggt P$LIM1_01 | 739 (-) | 0.789 | 0.819 | cgtggttGCTGG P$LIM1_01 | 929 (+) | 0.968 | 0.950 | CCACGaaaccac P$LIM1_01 | 937 (+) | 0.977 | 0.884 | CCACAagttagg P$LIM1_01 | 995 (-) | 0.977 | 0.820 | attttcaTGTGG P$LIM1_01 | 1080 (-) | 0.796 | 0.819 | aggtcatTTTGG P$LIM1_01 | 1314 (+) | 0.791 | 0.836 | CCCCTaactccc P$LIM1_01 | 1596 (-) | 0.805 | 0.821 | ctgggttTGGGG P$LIM1_01 | 1613 (-) | 0.968 | 0.971 | ggggtctCGTGG P$LIM1_01 | 1980 (+) | 0.845 | 0.846 | CCTCAaactcac P$LIM1_01 | 2310 (-) | 0.796 | 0.832 | tgtgtgtTTTGG P$LIM1_01 | 2998 (+) | 0.868 | 0.820 | CCTCCaaggtgc P$LIM1_01 | 3141 (+) | 0.977 | 0.939 | CCACAacaatat P$LIM1_01 | 3148 (-) | 0.868 | 0.843 | aatatatGGAGG P$LIM1_01 | 3238 (+) | 0.819 | 0.832 | CCAACattccca P$LIM1_01 | 3259 (-) | 0.787 | 0.841 | tggttttCTTGG P$LIM1_01 | 3418 (-) | 0.813 | 0.843 | tgattctGGTTG P$LIM1_01 | 3510 (-) | 0.831 | 0.829 | tgctgttAGAGG P$LIM1_01 | 3525 (+) | 1.000 | 0.847 | CCACCttggccc P$LIM1_01 | 3641 (-) | 1.000 | 0.893 | accacatGGTGG P$LIM1_01 | 4306 (+) | 0.845 | 0.836 | CCTCAatactgc P$LIM1_01 | 4339 (-) | 1.000 | 0.903 | tccaggtGGTGG P$LIM1_01 | 4342 (-) | 0.868 | 0.885 | aggtggtGGAGG P$LIM1_01 | 4412 (+) | 0.977 | 0.911 | CCACAaccttgt P$LIM1_01 | 4919 (+) | 0.977 | 0.922 | CCACAacttaca P$LIM1_01 | 5278 (+) | 0.790 | 0.818 | CAACAacaggct P$LIM1_01 | 5428 (+) | 0.868 | 0.834 | CCTCCactttat P$LIM1_01 | 5691 (+) | 0.963 | 0.908 | CCACTagttcat P$LIM1_01 | 5716 (+) | 0.977 | 0.951 | CCACAaaaacac P$LIM1_01 | 5967 (-) | 0.963 | 0.815 | gtttttaAGTGG P$LIM1_01 | 6374 (-) | 0.813 | 0.826 | gagatttGGTAG P$LIM1_01 | 6507 (+) | 0.977 | 0.898 | CCACAaataaaa P$LIM1_01 | 6593 (-) | 1.000 | 0.829 | caaataaGGTGG P$LIM1_01 | 6716 (+) | 0.977 | 0.904 | CCACAaccttta P$LIM1_01 | 6752 (+) | 0.963 | 0.915 | CCACTatccctt P$LIM1_01 | 6887 (+) | 0.968 | 0.838 | CCACGtaagaca P$LIM1_01 | 6902 (-) | 0.963 | 0.826 | gggcataAGTGG P$LIM1_01 | 7388 (-) | 1.000 | 0.820 | gtgtccaGGTGG P$LIM1_01 | 7666 (-) | 1.000 | 0.922 | caggcatGGTGG P$LIM1_01 | 7702 (-) | 0.831 | 0.859 | gggaggtAGAGG P$LIM1_01 | 7735 (+) | 0.789 | 0.817 | CCATCacaggct P$LIM1_01 | 7820 (+) | 0.796 | 0.827 | CCAAAacaaaca P$LIM1_01 | 7962 (-) | 0.776 | 0.825 | agactatAGTTG P$LIM1_01 | 8000 (-) | 0.819 | 0.814 | tgtgcctGTTGG P$LIM1_01 | 8032 (-) | 0.789 | 0.813 | agagcttGCTGG P$LIM1_01 | 8251 (-) | 0.968 | 0.930 | gtgtgttCGTGG P$LIM1_01 | 8336 (-) | 0.789 | 0.834 | agtatttGCTGG P$LIM1_01 | 8738 (-) | 1.000 | 0.995 | ggggtgtGGTGG P$LIM1_01 | 8798 (-) | 0.828 | 0.815 | tagagttGGGGG P$LIM1_01 | 8899 (+) | 0.977 | 0.910 | CCACAaacagat P$LIM1_01 | 8971 (-) | 1.000 | 0.954 | caggtgtGGTGG P$LIM1_01 | 9000 (-) | 0.868 | 0.842 | agcacatGGAGG P$LIM1_01 | 9279 (+) | 1.000 | 0.912 | CCACCaagctga P$LIM1_01 | 9330 (-) | 0.868 | 0.869 | agcctctGGAGG P$LIM1_01 | 9571 (-) | 1.000 | 0.989 | tgggtgtGGTGG P$LIM1_01 | 9610 (-) | 1.000 | 0.841 | aagctgaGGTGG P$LIM1_01 | 9675 (-) | 0.868 | 0.831 | aagaagtGGAGG P$LIM1_01 | 9686 (-) | 1.000 | 0.980 | ggggggtGGTGG P$LIM1_01 | 9689 (-) | 1.000 | 0.975 | gggtggtGGTGG P$LIM1_01 | 9692 (-) | 1.000 | 0.969 | tggtggtGGTGG P$LIM1_01 | 9695 (-) | 1.000 | 0.969 | tggtggtGGTGG P$LIM1_01 | 9698 (-) | 1.000 | 0.969 | tggtggtGGTGG P$LIM1_01 | 9701 (-) | 1.000 | 0.969 | tggtggtGGTGG P$LIM1_01 | 10008 (-) | 0.868 | 0.865 | tgggactGGAGG P$LIM1_01 | 10235 (-) | 1.000 | 0.945 | ctaggttGGTGG P$LIM1_01 | 10400 (-) | 1.000 | 0.835 | tgaacgaGGTGG P$LIM1_01 | 10618 (-) | 0.790 | 0.821 | agggagtTGTAG P$LIM1_01 | 10730 (-) | 0.783 | 0.818 | agcttttATTGG P$LIM1_01 | 10868 (-) | 1.000 | 0.914 | gcagcgtGGTGG P$LIM1_01 | 10871 (-) | 1.000 | 0.935 | gcgtggtGGTGG P$LIM1_01 | 10930 (-) | 0.977 | 0.942 | cgcctctTGTGG P$LIM1_01 | 10962 (-) | 0.819 | 0.814 | tgtgcctGTTGG P$LIM1_01 | 11173 (-) | 0.796 | 0.813 | agatcttTTTGG P$LIM1_01 | 11501 (-) | 0.977 | 0.928 | gaattttTGTGG P$LIM1_01 | 11560 (+) | 0.813 | 0.861 | CAACCaaactct P$LIM1_01 | 11677 (+) | 0.796 | 0.835 | CCAAAaaagacc P$LIM1_01 | 11709 (+) | 0.805 | 0.824 | CCCCAactgccc P$LIM1_01 | 11719 (+) | 0.868 | 0.835 | CCTCCaccattt P$LIM1_01 | 11722 (+) | 1.000 | 0.950 | CCACCattttct P$LIM1_01 | 11804 (-) | 1.000 | 0.826 | attggaaGGTGG P$LIM1_01 | 11852 (-) | 0.828 | 0.875 | ggggtatGGGGG V$LHX3_01 | 5791 (-) | 1.000 | 0.979 | ataATTAAat V$ATF3_Q6 | 1236 (+) | 0.881 | 0.887 | cttTGACAtcaggg V$ATF3_Q6 | 1236 (-) | 0.877 | 0.893 | ctttgaCATCAggg V$ATF3_Q6 | 1443 (+) | 0.861 | 0.872 | tagTGAAGtcactg V$ATF3_Q6 | 1443 (-) | 0.862 | 0.844 | tagtgaAGTCActg V$ATF3_Q6 | 2963 (-) | 0.809 | 0.862 | cagtgaCGGCAgat V$ATF3_Q6 | 4331 (+) | 0.881 | 0.811 | tggTGACAtccagg V$ATF3_Q6 | 4848 (-) | 0.861 | 0.810 | ctgagaCTTCActg V$ATF3_Q6 | 4861 (-) | 0.862 | 0.802 | gagggaGGTCAgca V$ATF3_Q6 | 4962 (-) | 0.862 | 0.803 | gcaggaAGTCAggc V$ATF3_Q6 | 8208 (+) | 0.862 | 0.864 | ctcTGACCtcagtt V$ATF3_Q6 | 8208 (-) | 0.877 | 0.884 | ctctgaCCTCAgtt V$ATF3_Q6 | 9344 (+) | 0.861 | 0.801 | cccTGAAGtctctt V$ATF3_Q6 | 11108 (-) | 0.861 | 0.806 | gaacgaCTTCAgat V$ATF4_Q2 | 6322 (+) | 0.933 | 0.875 | ccTGACTgaagc V$ATF4_Q2 | 8046 (+) | 0.933 | 0.874 | ctTGACTcgagg V$ATF4_Q2 | 8321 (-) | 1.000 | 0.889 | ttctgCGTCAgc V$ATF4_Q2 | 8610 (-) | 1.000 | 0.867 | cattcCGTCAat V$ATF4_Q2 | 9319 (-) | 0.888 | 0.857 | cgttgGGTCAga V$E2F_03 | 2468 (-) | 0.992 | 0.887 | gcaagCGCGAag V$E2F_03 | 7089 (+) | 0.940 | 0.891 | ctTTGCGcgcac V$AP1_01 | 131 (+) | 1.000 | 0.894 | tgctcAGTCAatc V$AP1_01 | 438 (+) | 0.948 | 0.889 | cgtttAATCAggt V$AP1_01 | 796 (-) | 0.871 | 0.890 | cgcTGAGCcattt V$AP1_01 | 1030 (+) | 0.948 | 0.925 | agttgAATCAaca V$AP1_01 | 1030 (-) | 0.946 | 0.923 | agtTGAATcaaca V$AP1_01 | 1352 (+) | 0.903 | 0.895 | gcctgGGTCAcct V$AP1_01 | 3433 (-) | 1.000 | 0.897 | aatTGACTtaggt V$AP1_01 | 3724 (-) | 1.000 | 0.925 | gggTGACTaaaat V$AP1_01 | 3750 (+) | 1.000 | 0.919 | tttttAGTCAttc V$AP1_01 | 3932 (+) | 1.000 | 0.909 | ttatcAGTCActt V$AP1_01 | 3944 (+) | 0.946 | 0.886 | tgttcATTCAgcc V$AP1_01 | 4465 (-) | 1.000 | 0.923 | cacTGACTgacca V$AP1_01 | 4531 (+) | 0.909 | 0.899 | aaatgCGTCAaca V$AP1_01 | 6292 (-) | 1.000 | 0.915 | tgaTGACTgagca V$AP1_01 | 6321 (-) | 1.000 | 0.922 | cccTGACTgaagc V$AP1_01 | 6414 (-) | 1.000 | 0.901 | gaaTGACTcctct V$AP1_01 | 6554 (+) | 1.000 | 0.902 | gtaagAGTCAtgg V$AP1_01 | 7606 (-) | 0.871 | 0.890 | tgcTGAGCcatct V$AP1_01 | 8321 (+) | 0.909 | 0.910 | ttctgCGTCAgcg V$AP1_01 | 9287 (+) | 1.000 | 0.922 | ctgagAGTCAgct V$AP1_01 | 9319 (+) | 0.903 | 0.904 | cgttgGGTCAgag V$AP1_01 | 10033 (+) | 1.000 | 0.977 | ttgtgAGTCAccg V$AP1_01 | 10033 (-) | 0.968 | 0.945 | ttgTGAGTcaccg V$AP1_01 | 10631 (-) | 1.000 | 0.899 | aagTGACTgagga V$AP1_01 | 11453 (+) | 0.948 | 0.940 | tggtgAATCAgtg V$AP1_01 | 11453 (-) | 0.946 | 0.929 | tggTGAATcagtg V$PPARA_02 | 191 (-) | 0.810 | 0.736 | tgcctcccacaCCACCatg V$PPARA_02 | 1078 (+) | 1.000 | 0.901 | gtaGGTCAttttggtcaac V$PPARA_02 | 1348 (-) | 0.945 | 0.761 | gcttgcctgggTCACCtga V$PPARA_02 | 1379 (+) | 0.768 | 0.744 | gtgGGCGAggtagggcagg V$PPARA_02 | 1384 (+) | 0.805 | 0.834 | cgaGGTAGggcaggtgagg V$PPARA_02 | 1393 (+) | 0.945 | 0.755 | gcaGGTGAggagagtgggt V$PPARA_02 | 1497 (+) | 0.768 | 0.716 | gggGGGGAgggagggaggg V$PPARA_02 | 2666 (-) | 0.950 | 0.779 | gatgacactctTAACCcgt V$PPARA_02 | 3257 (+) | 0.796 | 0.817 | tatGGTTTtcttggtgatg V$PPARA_02 | 3520 (-) | 0.823 | 0.714 | ggaggccacctTGGCCcga V$PPARA_02 | 3880 (-) | 1.000 | 0.745 | tacaagctcctTGACCctc V$PPARA_02 | 4977 (-) | 0.812 | 0.723 | cagcccctaccTGTCCgaa V$PPARA_02 | 5746 (-) | 0.856 | 0.733 | atagacgcaaaGGACCtgt V$PPARA_02 | 6022 (-) | 0.856 | 0.720 | tttcaactctaGGACCcat V$PPARA_02 | 6434 (+) | 1.000 | 0.772 | atcGGTCAacccagtcatg V$PPARA_02 | 6754 (-) | 0.786 | 0.735 | actatccctttATACCatg V$PPARA_02 | 6802 (-) | 0.940 | 0.791 | tcctcccctcaTTACCttc V$PPARA_02 | 7188 (+) | 0.823 | 0.831 | aacGGCCAcgggggtgcta V$PPARA_02 | 8200 (-) | 1.000 | 0.842 | tgtttcccctcTGACCtca V$PPARA_02 | 8299 (-) | 0.823 | 0.736 | gcatgcccctcTGCCCtgc V$PPARA_02 | 8673 (+) | 0.950 | 0.858 | accGGTTAggaaggtaatt V$PPARA_02 | 9038 (+) | 0.823 | 0.782 | caaGGGCAgcctggtctac V$PPARA_02 | 9067 (+) | 0.773 | 0.714 | ccaGGCTAgccagggcagc V$PPARA_02 | 9117 (-) | 0.796 | 0.766 | aaaaaccgggaAAACCttc V$PPARA_02 | 9177 (-) | 0.846 | 0.712 | gaacgccacagAGACCact V$PPARA_02 | 9636 (+) | 1.000 | 0.783 | caaGGTCAgcttgggctac V$PPARA_02 | 9687 (+) | 0.810 | 0.826 | gggGGTGGtggtggtggtg V$PPARA_02 | 9690 (+) | 0.810 | 0.818 | ggtGGTGGtggtggtggtg V$PPARA_02 | 9693 (+) | 0.810 | 0.818 | ggtGGTGGtggtggtggtg V$PPARA_02 | 9696 (+) | 0.810 | 0.818 | ggtGGTGGtggtggtggtg V$PPARA_02 | 9699 (+) | 0.810 | 0.790 | ggtGGTGGtggtggtgtaa V$PPARA_02 | 9968 (+) | 0.805 | 0.713 | tctGGTTCctatggggatc V$PPARA_02 | 9998 (+) | 0.856 | 0.791 | ttaGGTCCtctgggactgg V$PPARA_02 | 10242 (+) | 1.000 | 0.735 | ggtGGTCAggcttgtgcag V$PPARA_02 | 11313 (+) | 0.945 | 0.746 | tacGGTGAcgacgatgatg V$PPARA_02 | 11618 (-) | 0.856 | 0.783 | gggtaccagaaGGACCctc V$PPARA_02 | 11695 (-) | 0.945 | 0.732 | gttaatcactgTCACCcca V$PPARA_02 | 11705 (-) | 0.823 | 0.742 | gtcaccccaacTGCCCtcc V$PPARA_02 | 11808 (+) | 0.810 | 0.714 | gaaGGTGGcccagggttgg V$PPARA_02 | 11850 (+) | 0.773 | 0.824 | gtgGGGTAtgggggtaggt V$PPARA_02 | 11859 (+) | 0.805 | 0.727 | gggGGTAGgtccgggggag V$PPARG_03 | 3438 (+) | 0.800 | 0.817 | acttaggtaACAGTtaa V$PPARG_03 | 5453 (-) | 1.000 | 0.904 | taaCCTTTtccccacat V$PPARG_03 | 5762 (+) | 1.000 | 0.802 | tgtagggtaAAAGGaga V$PPARG_03 | 8204 (-) | 0.900 | 0.797 | tccCCTCTgacctcagt V$PPARG_03 | 8578 (+) | 1.000 | 0.837 | aaatggggcAAAGGcaa V$PPARG_03 | 11516 (-) | 0.840 | 0.825 | ttaCCGTTgtccgtagc V$NERF_Q2 | 570 (-) | 0.800 | 0.801 | ataaacgtctACCTGttg V$NERF_Q2 | 603 (-) | 1.000 | 0.770 | tttttaaacaTCCTGtcg V$NERF_Q2 | 735 (-) | 0.800 | 0.724 | accacgtggtTGCTGgga V$NERF_Q2 | 766 (+) | 0.800 | 0.785 | cctCTGGAagagtagtcg V$NERF_Q2 | 1189 (+) | 1.000 | 0.775 | cccCAGGAaagctggtta V$NERF_Q2 | 1723 (-) | 1.000 | 0.794 | tccgggctcaTCCTGaac V$NERF_Q2 | 1732 (-) | 0.800 | 0.780 | atcctgaactTCCTAtag V$NERF_Q2 | 1770 (-) | 1.000 | 0.775 | tcgcctgtccTCCTGgag V$NERF_Q2 | 2140 (-) | 0.800 | 0.809 | ctctctctctTCCTTcct V$NERF_Q2 | 2144 (-) | 0.800 | 0.814 | ctctcttcctTCCTTcct V$NERF_Q2 | 2148 (-) | 0.800 | 0.812 | cttccttcctTCCTCttc V$NERF_Q2 | 2154 (-) | 0.800 | 0.790 | tccttcctctTCCTTtcc V$NERF_Q2 | 2198 (-) | 0.800 | 0.799 | ctttctttctTCCTTtct V$NERF_Q2 | 2679 (-) | 1.000 | 0.816 | acccgttgcaTCCTGgtc V$NERF_Q2 | 2700 (-) | 0.800 | 0.801 | ctcttctcctTCCTCttc V$NERF_Q2 | 2778 (-) | 0.800 | 0.827 | gtgagattctTCCTTagc V$NERF_Q2 | 2797 (-) | 0.800 | 0.782 | ggtcttctctTACTGaac V$NERF_Q2 | 3318 (-) | 1.000 | 0.795 | attggcctttTCCTGatt V$NERF_Q2 | 3335 (-) | 1.000 | 0.808 | tctatttcccTCCTGttc V$NERF_Q2 | 4173 (+) | 1.000 | 0.773 | attCAGGAggctgagcag V$NERF_Q2 | 4351 (+) | 1.000 | 0.762 | aggCAGGAggatcagaag V$NERF_Q2 | 4585 (-) | 1.000 | 0.787 | agtctcctccTCCTGccc V$NERF_Q2 | 4596 (-) | 0.800 | 0.828 | cctgcccgctTGCTGgaa V$NERF_Q2 | 4605 (+) | 0.800 | 0.762 | ttgCTGGAaggtaaaacc V$NERF_Q2 | 4686 (+) | 1.000 | 0.800 | tggCAGGAaacttgatgt V$NERF_Q2 | 4945 (-) | 0.800 | 0.784 | aactgggactTTCTGcag V$NERF_Q2 | 4960 (+) | 1.000 | 0.933 | cagCAGGAagtcaggctc V$NERF_Q2 | 4975 (-) | 0.800 | 0.847 | ctcagcccctACCTGtcc V$NERF_Q2 | 5057 (+) | 0.800 | 0.814 | aagCAGGTagtgagagat V$NERF_Q2 | 5189 (+) | 1.000 | 0.740 | tagCAGGAtttgtctgtg V$NERF_Q2 | 5260 (-) | 1.000 | 0.803 | gtcacagtcaTCCTGggc V$NERF_Q2 | 5466 (-) | 1.000 | 0.806 | acatactattTCCTGatt V$NERF_Q2 | 5538 (-) | 0.800 | 0.827 | ttcactctctTTCTGtct V$NERF_Q2 | 5729 (+) | 0.800 | 0.753 | aaaCTGGAagccataata V$NERF_Q2 | 6060 (-) | 0.800 | 0.718 | gccctgcacgTGCTGcca V$NERF_Q2 | 6082 (-) | 1.000 | 0.772 | ctctgtgagtTCCTGtgt V$NERF_Q2 | 6273 (+) | 0.800 | 0.808 | aagAAGGAagcctctctg V$NERF_Q2 | 6454 (-) | 1.000 | 0.865 | caaacttattTCCTGata V$NERF_Q2 | 6476 (+) | 1.000 | 0.792 | gctCAGGAtgaggactgg V$NERF_Q2 | 6519 (+) | 0.800 | 0.772 | gaaCACGAagttcaccct V$NERF_Q2 | 6792 (-) | 0.800 | 0.789 | tgctggctctTCCTCccc V$NERF_Q2 | 6875 (+) | 0.800 | 0.813 | ttaTAGGAagcaccacgt V$NERF_Q2 | 7012 (-) | 1.000 | 0.808 | ggcggatgttTCCTGcat V$NERF_Q2 | 7149 (-) | 1.000 | 0.718 | gcgccgtatcTCCTGgag V$NERF_Q2 | 7228 (+) | 1.000 | 0.928 | cagCAGGAagttcatcgt V$NERF_Q2 | 7245 (+) | 1.000 | 0.789 | tctCAGGAaaaagtctca V$NERF_Q2 | 7364 (+) | 0.800 | 0.792 | gggCAGAAagaattgata V$NERF_Q2 | 7463 (+) | 1.000 | 0.735 | ctgCAGGAtttgagtgtg V$NERF_Q2 | 7490 (-) | 0.800 | 0.783 | cgtctgttctTCTTGgag V$NERF_Q2 | 7663 (+) | 0.800 | 0.723 | tgcCAGGCatggtggctc V$NERF_Q2 | 8028 (-) | 0.800 | 0.748 | tgtcagagctTGCTGggc V$NERF_Q2 | 8075 (-) | 0.800 | 0.794 | tgcagaaactTCCTAtac V$NERF_Q2 | 8116 (-) | 0.800 | 0.821 | ctcccaccctGCCTGgtc V$NERF_Q2 | 8128 (-) | 0.800 | 0.820 | ctggtctgctTTCTGctg V$NERF_Q2 | 8152 (-) | 0.800 | 0.791 | aacattagctTCCAGagg V$NERF_Q2 | 8167 (+) | 0.800 | 0.803 | aggCAGGTagcaccagaa V$NERF_Q2 | 8187 (-) | 0.800 | 0.759 | tgtttgctctGCCTGttt V$NERF_Q2 | 8441 (-) | 0.800 | 0.727 | gcaacccgcaTCCTAagc V$NERF_Q2 | 8676 (+) | 0.800 | 0.802 | ggtTAGGAaggtaattag V$NERF_Q2 | 8687 (-) | 0.800 | 0.762 | taattagcctTTCTGaaa V$NERF_Q2 | 8909 (+) | 1.000 | 0.758 | attCAGGAatactcggaa V$NERF_Q2 | 9015 (+) | 0.800 | 0.798 | gggCAGGCagatctttgt V$NERF_Q2 | 9305 (+) | 1.000 | 0.809 | gccCAGGAgggaggcgtt V$NERF_Q2 | 9372 (+) | 0.800 | 0.785 | tcaCAGGGagagctgtca V$NERF_Q2 | 9433 (+) | 1.000 | 0.898 | taaCAGGAagcccaaggt V$NERF_Q2 | 9466 (+) | 1.000 | 0.764 | gtaCAGGAtgttctagct V$NERF_Q2 | 9847 (+) | 0.800 | 0.777 | agtCAGGCaggcagcatt V$NERF_Q2 | 9851 (+) | 0.800 | 0.800 | aggCAGGCagcattaaag V$NERF_Q2 | 9903 (-) | 0.800 | 0.838 | aagacttactTACTGgat V$NERF_Q2 | 10072 (+) | 0.800 | 0.785 | cttCTGGAagggcagcca V$NERF_Q2 | 10255 (+) | 1.000 | 0.953 | gtgCAGGAagtgattttt V$NERF_Q2 | 10636 (+) | 0.800 | 0.764 | actGAGGAagacacccag V$NERF_Q2 | 10665 (+) | 1.000 | 0.778 | caaCAGGAggatccacag V$NERF_Q2 | 10842 (+) | 1.000 | 0.921 | tcgCAGGAagaagccatt V$NERF_Q2 | 11903 (+) | 0.800 | 0.791 | tagCACGAaggcttcggg V$RP58_01 | 4118 (-) | 1.000 | 0.974 | cccAGATGttaa V$HTF_01 | 427 (+) | 1.000 | 0.855 | ccaaaatacCACGTttaatcaggt V$HTF_01 | 5996 (-) | 1.000 | 0.858 | gggatggggtACGTGtccaactct V$HTF_01 | 6879 (+) | 1.000 | 0.901 | aggaagcacCACGTaagacaagag V$HTF_01 | 7479 (+) | 1.000 | 0.866 | tgctacgcgCACGTctgttcttct V$ARNT_02 | 13 (+) | 0.800 | 0.798 | acaagtaCATGTgaatgtgt V$ARNT_02 | 13 (-) | 0.800 | 0.798 | acaagtacATGTGaatgtgt V$ARNT_02 | 282 (+) | 0.800 | 0.811 | tgcaagtCAAGTgttttaca V$ARNT_02 | 282 (-) | 0.800 | 0.811 | tgcaagtcAAGTGttttaca V$ARNT_02 | 730 (+) | 1.000 | 0.918 | gagccacCACGTggttgctg V$ARNT_02 | 730 (-) | 1.000 | 0.918 | gagccaccACGTGgttgctg V$ARNT_02 | 993 (+) | 0.800 | 0.791 | taattttCATGTggggacgc V$ARNT_02 | 993 (-) | 0.800 | 0.791 | taattttcATGTGgggacgc V$ARNT_02 | 1121 (+) | 1.000 | 0.919 | gtgtatgCACGTgtgttcct V$ARNT_02 | 1121 (-) | 1.000 | 0.919 | gtgtatgcACGTGtgttcct V$ARNT_02 | 1353 (+) | 0.800 | 0.846 | cctgggtCACCTgatgctcg V$ARNT_02 | 1353 (-) | 0.800 | 0.846 | cctgggtcACCTGatgctcg V$ARNT_02 | 1387 (+) | 0.800 | 0.811 | ggtagggCAGGTgaggagag V$ARNT_02 | 1387 (-) | 0.800 | 0.811 | ggtagggcAGGTGaggagag V$ARNT_02 | 1611 (+) | 0.800 | 0.797 | agggggtCTCGTggacagca V$ARNT_02 | 1611 (-) | 0.800 | 0.797 | agggggtcTCGTGgacagca V$ARNT_02 | 1781 (+) | 0.800 | 0.791 | cctggagCACCTgccattca V$ARNT_02 | 1781 (-) | 0.800 | 0.791 | cctggagcACCTGccattca V$ARNT_02 | 2768 (+) | 0.800 | 0.804 | gattccgCATGTgagattct V$ARNT_02 | 2768 (-) | 0.800 | 0.804 | gattccgcATGTGagattct V$ARNT_02 | 3933 (+) | 0.800 | 0.815 | tatcagtCACTTgttcattc V$ARNT_02 | 3933 (-) | 0.800 | 0.815 | tatcagtcACTTGttcattc V$ARNT_02 | 5016 (+) | 0.800 | 0.804 | attaagcCACGGgactggga V$ARNT_02 | 5016 (-) | 0.800 | 0.804 | attaagccACGGGactggga V$ARNT_02 | 5165 (+) | 0.800 | 0.800 | ggactttCACTTgtgactgt V$ARNT_02 | 5165 (-) | 0.800 | 0.800 | ggactttcACTTGtgactgt V$ARNT_02 | 5245 (+) | 0.800 | 0.795 | atgggttCACATgtggtcac V$ARNT_02 | 5245 (-) | 0.800 | 0.795 | atgggttcACATGtggtcac V$ARNT_02 | 5362 (+) | 0.800 | 0.808 | caaaacaCACGGgatttctt V$ARNT_02 | 5362 (-) | 0.800 | 0.808 | caaaacacACGGGatttctt V$ARNT_02 | 5998 (+) | 0.800 | 0.790 | gatggggTACGTgtccaact V$ARNT_02 | 5998 (-) | 1.000 | 0.790 | gatggggtACGTGtccaact V$ARNT_02 | 6059 (+) | 1.000 | 0.926 | agccctgCACGTgctgccac V$ARNT_02 | 6059 (-) | 1.000 | 0.926 | agccctgcACGTGctgccac V$ARNT_02 | 6881 (+) | 1.000 | 0.798 | gaagcacCACGTaagacaag V$ARNT_02 | 6881 (-) | 0.800 | 0.798 | gaagcaccACGTAagacaag V$ARNT_02 | 6938 (+) | 0.800 | 0.799 | cagtaccCAAGTgatactgt V$ARNT_02 | 6938 (-) | 0.800 | 0.799 | cagtacccAAGTGatactgt V$MYCMAX_03 | 730 (+) | 1.000 | 0.994 | gagccacCACGTggttgctg V$MYCMAX_03 | 730 (-) | 1.000 | 0.994 | gagccaccACGTGgttgctg V$AFP1_Q6 | 8057 (-) | 0.905 | 0.893 | gtctagTTTAT V$CEBPDELTA_Q6 | 490 (+) | 1.000 | 0.967 | cATTGCaacatt V$CEBPDELTA_Q6 | 9872 (-) | 0.986 | 0.966 | aatgagGAAATc V$CEBPDELTA_Q6 | 10133 (-) | 0.986 | 0.977 | agttaaGAAATt V$CEBPDELTA_Q6 | 10378 (-) | 1.000 | 0.957 | aatgttGCAATc V$CEBPGAMMA_Q6 | 37 (+) | 0.907 | 0.802 | acaACTTCagatg V$CEBPGAMMA_Q6 | 238 (-) | 0.907 | 0.853 | cttttTAAATctg V$CEBPGAMMA_Q6 | 467 (+) | 0.845 | 0.849 | cttATTTGacact V$CEBPGAMMA_Q6 | 499 (+) | 0.752 | 0.784 | attATTATatatt V$CEBPGAMMA_Q6 | 515 (-) | 0.691 | 0.795 | cttctGATTTtaa V$CEBPGAMMA_Q6 | 518 (+) | 0.845 | 0.920 | ctgATTTTaaaaa V$CEBPGAMMA_Q6 | 522 (-) | 0.845 | 0.809 | ttttaAAAATgat V$CEBPGAMMA_Q6 | 585 (+) | 0.752 | 0.800 | ttgAATTAacaga V$CEBPGAMMA_Q6 | 648 (+) | 0.907 | 0.780 | ttcATTTAttata V$CEBPGAMMA_Q6 | 652 (+) | 0.752 | 0.792 | tttATTATatata V$CEBPGAMMA_Q6 | 908 (-) | 0.721 | 0.791 | tttctTTCATgcg V$CEBPGAMMA_Q6 | 984 (+) | 0.752 | 0.784 | cacAGTTAataat V$CEBPGAMMA_Q6 | 1058 (+) | 0.752 | 0.794 | acaACTTTaaaat V$CEBPGAMMA_Q6 | 1061 (-) | 0.845 | 0.875 | actttAAAATaag V$CEBPGAMMA_Q6 | 1098 (+) | 0.845 | 0.809 | aaaATTTGagaaa V$CEBPGAMMA_Q6 | 1102 (-) | 1.000 | 0.810 | tttgaGAAATcgc V$CEBPGAMMA_Q6 | 1235 (-) | 0.907 | 0.890 | gctttGACATcag V$CEBPGAMMA_Q6 | 1254 (-) | 0.814 | 0.880 | attctGTCATcat V$CEBPGAMMA_Q6 | 1887 (+) | 0.845 | 0.814 | cccATTTTatatt V$CEBPGAMMA_Q6 | 1911 (+) | 0.845 | 0.848 | tttATTTGagaca V$CEBPGAMMA_Q6 | 2552 (-) | 0.660 | 0.820 | tttctGTTGTgaa V$CEBPGAMMA_Q6 | 2557 (-) | 1.000 | 0.847 | gttgtGAAATttc V$CEBPGAMMA_Q6 | 3613 (-) | 0.845 | 0.836 | gttttCAAATgcg V$CEBPGAMMA_Q6 | 3819 (-) | 0.691 | 0.806 | gttttCAATTcat V$CEBPGAMMA_Q6 | 3822 (+) | 0.845 | 0.887 | ttcAATTCataat V$CEBPGAMMA_Q6 | 3919 (-) | 0.845 | 0.807 | tttatGAATTaca V$CEBPGAMMA_Q6 | 3992 (+) | 0.721 | 0.837 | ctcATGAAacaac V$CEBPGAMMA_Q6 | 4096 (+) | 0.691 | 0.831 | ctgATTCTaaaac V$CEBPGAMMA_Q6 | 4120 (+) | 0.752 | 0.786 | cagATGTTaaaac V$CEBPGAMMA_Q6 | 4157 (-) | 0.907 | 0.784 | catttGTAATcct V$CEBPGAMMA_Q6 | 4214 (+) | 0.752 | 0.881 | ctcAGTTAaaaaa V$CEBPGAMMA_Q6 | 4367 (-) | 0.752 | 0.783 | agtttAAAGTtat V$CEBPGAMMA_Q6 | 4456 (+) | 1.000 | 0.913 | ttcATTTCacact V$CEBPGAMMA_Q6 | 5413 (-) | 0.907 | 0.807 | cttatTAAATctt V$CEBPGAMMA_Q6 | 5556 (+) | 0.660 | 0.822 | ctcACTAGaaaac V$CEBPGAMMA_Q6 | 5637 (+) | 0.660 | 0.794 | ataAAGACaaaat V$CEBPGAMMA_Q6 | 5783 (-) | 1.000 | 0.903 | aatatGAAATaat V$CEBPGAMMA_Q6 | 5791 (+) | 0.814 | 0.791 | ataATTAAataca V$CEBPGAMMA_Q6 | 5795 (+) | 0.752 | 0.847 | ttaAATACaaaaa V$CEBPGAMMA_Q6 | 5967 (-) | 0.814 | 0.827 | gttttTAAGTggt V$CEBPGAMMA_Q6 | 5985 (-) | 0.691 | 0.779 | cttctGGATTagg V$CEBPGAMMA_Q6 | 6447 (+) | 0.752 | 0.792 | gtcATGGCaaact V$CEBPGAMMA_Q6 | 6549 (+) | 0.660 | 0.786 | ctgAGGTAagagt V$CEBPGAMMA_Q6 | 7057 (-) | 0.752 | 0.794 | tttctTAATTtgt V$CEBPGAMMA_Q6 | 7203 (-) | 0.845 | 0.828 | gctatGAATTgaa V$CEBPGAMMA_Q6 | 7404 (+) | 0.660 | 0.791 | attACTGAagaac V$CEBPGAMMA_Q6 | 7581 (+) | 0.752 | 0.800 | atcAGGTCaaaca V$CEBPGAMMA_Q6 | 7640 (+) | 0.845 | 0.895 | attATTTTaaaaa V$CEBPGAMMA_Q6 | 7814 (+) | 0.752 | 0.780 | acaACTCCaaaac V$CEBPGAMMA_Q6 | 7900 (-) | 0.845 | 0.908 | attttGATATtag V$CEBPGAMMA_Q6 | 7947 (-) | 0.845 | 0.798 | gttttGAGATacc V$CEBPGAMMA_Q6 | 8070 (+) | 0.845 | 0.812 | ggtATTGCagaaa V$CEBPGAMMA_Q6 | 8207 (-) | 0.752 | 0.809 | cctctGACCTcag V$CEBPGAMMA_Q6 | 8219 (-) | 0.752 | 0.804 | gttctGATGTtgg V$CEBPGAMMA_Q6 | 8636 (+) | 0.907 | 0.928 | cttATTTAagaag V$CEBPGAMMA_Q6 | 8852 (-) | 0.845 | 0.785 | gtttaAAAATtaa V$CEBPGAMMA_Q6 | 8881 (-) | 0.691 | 0.829 | attctAGAATcaa V$CEBPGAMMA_Q6 | 8948 (-) | 0.845 | 0.908 | tttttAAAATtag V$CEBPGAMMA_Q6 | 9028 (-) | 0.691 | 0.814 | tttgtGAGTTcaa V$CEBPGAMMA_Q6 | 9137 (-) | 0.752 | 0.860 | gttttTGAATaag V$CEBPGAMMA_Q6 | 9488 (-) | 0.845 | 0.814 | tatttGCAATaga V$CEBPGAMMA_Q6 | 9770 (+) | 0.752 | 0.782 | gtcATGCCataga V$CEBPGAMMA_Q6 | 9788 (-) | 0.660 | 0.799 | tttgtTCAGTaaa V$CEBPGAMMA_Q6 | 9815 (+) | 0.907 | 0.861 | gtaACTTCaaatt V$CEBPGAMMA_Q6 | 9822 (+) | 0.845 | 0.812 | caaATTTTagaag V$CEBPGAMMA_Q6 | 9837 (+) | 0.845 | 0.839 | ataATTTTaaagt V$CEBPGAMMA_Q6 | 9840 (-) | 0.752 | 0.875 | attttAAAGTcag V$CEBPGAMMA_Q6 | 9859 (+) | 0.814 | 0.790 | agcATTAAagaag V$CEBPGAMMA_Q6 | 9881 (+) | 0.691 | 0.826 | atcAATTTaaaat V$CEBPGAMMA_Q6 | 9882 (+) | 0.907 | 0.832 | tcaATTTAaaatt V$CEBPGAMMA_Q6 | 10130 (+) | 0.752 | 0.850 | attAGTTAagaaa V$CEBPGAMMA_Q6 | 10134 (-) | 1.000 | 0.830 | gttaaGAAATtat V$CEBPGAMMA_Q6 | 10305 (-) | 0.752 | 0.782 | tttttTAACTgtt V$CEBPGAMMA_Q6 | 10326 (+) | 0.845 | 0.837 | tttATTTTatagt V$CEBPGAMMA_Q6 | 10345 (-) | 0.752 | 0.803 | tgtttTAATTcaa V$CEBPGAMMA_Q6 | 10348 (+) | 0.845 | 0.859 | tttAATTCaaagt V$CEBPGAMMA_Q6 | 10388 (+) | 0.752 | 0.820 | tccATATAaaaat V$CEBPGAMMA_Q6 | 10450 (-) | 0.752 | 0.814 | tctttTATATcaa V$CEBPGAMMA_Q6 | 10471 (+) | 0.752 | 0.810 | ctcACTTTagacc V$CEBPGAMMA_Q6 | 10593 (+) | 0.845 | 0.875 | cctATATCaaaaa V$CEBPGAMMA_Q6 | 11110 (+) | 0.907 | 0.821 | acgACTTCagatc V$CEBPGAMMA_Q6 | 11142 (-) | 0.752 | 0.786 | gttttGTACTctg V$CEBPGAMMA_Q6 | 11389 (-) | 0.907 | 0.795 | cttatTAAATttt V$CEBPGAMMA_Q6 | 11648 (-) | 0.752 | 0.782 | tttctGTACTgtt V$CRX_Q4 | 2023 (-) | 1.000 | 0.977 | ctggGATTAaagg V$CRX_Q4 | 2723 (+) | 1.000 | 0.943 | tagaTAATCttct V$CRX_Q4 | 3903 (-) | 0.948 | 0.933 | caggTATTAataa V$CRX_Q4 | 4027 (-) | 0.955 | 0.926 | atggAATTAatat V$CRX_Q4 | 4159 (+) | 1.000 | 0.957 | tttgTAATCctag V$CRX_Q4 | 4274 (+) | 0.955 | 0.932 | taccTAAGCacac V$CRX_Q4 | 4799 (+) | 1.000 | 0.961 | tgcaTAATCtctg V$CRX_Q4 | 6611 (+) | 0.948 | 0.935 | ttgcTAAACtccg V$CRX_Q4 | 7399 (-) | 0.955 | 0.928 | gtagAATTActga V$CRX_Q4 | 8989 (+) | 0.955 | 0.940 | ccttTAATTccag V$CRX_Q4 | 9589 (+) | 1.000 | 0.973 | cctgTAATCccag V$CRX_Q4 | 10999 (-) | 1.000 | 0.943 | atcaGATTAcctg V$DBP_Q6 | 149 (-) | 1.000 | 0.974 | atTTGCT V$DBP_Q6 | 1047 (+) | 1.000 | 0.979 | AGCAAaa V$DBP_Q6 | 1475 (-) | 1.000 | 0.974 | atTTGCT V$DBP_Q6 | 2621 (-) | 0.985 | 0.987 | gtATGCT V$DBP_Q6 | 3208 (-) | 0.985 | 0.987 | gtATGCT V$DBP_Q6 | 4280 (+) | 0.985 | 0.987 | AGCACac V$DBP_Q6 | 4637 (+) | 1.000 | 0.979 | AGCAAaa V$DBP_Q6 | 4669 (-) | 1.000 | 0.979 | ttTTGCT V$DBP_Q6 | 4788 (-) | 1.000 | 0.968 | gaTTGCT V$DBP_Q6 | 5043 (-) | 1.000 | 0.968 | gaTTGCT V$DBP_Q6 | 5212 (-) | 0.985 | 0.966 | ttGTGCT V$DBP_Q6 | 6331 (+) | 0.985 | 0.987 | AGCACac V$DBP_Q6 | 6609 (-) | 1.000 | 0.974 | atTTGCT V$DBP_Q6 | 6831 (-) | 0.985 | 0.961 | atGTGCT V$DBP_Q6 | 7476 (-) | 0.985 | 0.987 | gtGTGCT V$DBP_Q6 | 8131 (-) | 0.965 | 0.969 | gtCTGCT V$DBP_Q6 | 8188 (-) | 1.000 | 1.000 | gtTTGCT V$DBP_Q6 | 8339 (-) | 1.000 | 0.974 | atTTGCT V$DBP_Q6 | 9000 (+) | 0.985 | 0.961 | AGCACat V$DBP_Q6 | 9108 (+) | 1.000 | 0.979 | AGCAAaa V$DBP_Q6 | 10281 (+) | 1.000 | 0.968 | AGCAAtc V$DBP_Q6 | 10574 (-) | 1.000 | 1.000 | gtTTGCT V$DBP_Q6 | 11530 (+) | 1.000 | 0.974 | AGCAAat V$EFC_Q6 | 206 (-) | 0.820 | 0.756 | catgcctgGTATTt V$EFC_Q6 | 477 (+) | 0.820 | 0.788 | aCTTACaagtcaac V$EFC_Q6 | 1653 (+) | 0.640 | 0.818 | aTCTACttggcaaa V$EFC_Q6 | 2621 (+) | 0.640 | 0.805 | gTATGCtaggcaaa V$EFC_Q6 | 3208 (+) | 0.640 | 0.755 | gTATGCtaggcaag V$EFC_Q6 | 3262 (-) | 0.820 | 0.774 | ttttcttgGTGATg V$EFC_Q6 | 3679 (-) | 0.730 | 0.754 | attgccaaGTGCCa V$EFC_Q6 | 4374 (+) | 0.719 | 0.803 | aGTTATctggctac V$EFC_Q6 | 4896 (+) | 0.820 | 0.790 | tGTTCCttggccaa V$EFC_Q6 | 5169 (-) | 0.910 | 0.788 | tttcacttGTGACt V$EFC_Q6 | 6940 (-) | 0.820 | 0.766 | gtacccaaGTGATa V$EFC_Q6 | 8063 (-) | 0.820 | 0.756 | tttatctgGTATTg V$EFC_Q6 | 10433 (+) | 0.640 | 0.792 | cTTCGCaatgcaac V$EFC_Q6 | 10434 (-) | 0.910 | 0.825 | ttcgcaatGCAACg V$EFC_Q6 | 10905 (+) | 0.820 | 0.892 | cGCTACtatgcaaa V$EFC_Q6 | 11184 (-) | 0.730 | 0.752 | gctgccttGTTTCt V$EFC_Q6 | 11482 (-) | 0.730 | 0.828 | tttcccttGGAATg I$EVE_Q6 | 1227 (+) | 0.982 | 0.921 | tgatgACTGC I$EVE_Q6 | 1244 (+) | 0.940 | 0.942 | tcaggGCAGC I$EVE_Q6 | 4182 (-) | 0.958 | 0.954 | GCTGAgcaga I$EVE_Q6 | 5918 (+) | 0.952 | 0.922 | tttccCCAGC I$EVE_Q6 | 6057 (+) | 0.994 | 0.930 | acagcCCTGC I$EVE_Q6 | 6548 (-) | 0.958 | 0.944 | GCTGAggtaa I$EVE_Q6 | 8189 (+) | 1.000 | 0.977 | tttgcTCTGC I$EVE_Q6 | 8308 (+) | 0.994 | 0.979 | tctgcCCTGC I$EVE_Q6 | 9091 (+) | 1.000 | 0.939 | gaagcTCTGC I$EVE_Q6 | 10882 (+) | 0.982 | 0.929 | ggagcACTGC I$EVE_Q6 | 11954 (+) | 0.982 | 0.970 | tcaggACTGC V$FOXM1_01 | 484 (-) | 1.000 | 0.882 | AGTCAacat V$FOXM1_01 | 1652 (-) | 0.957 | 0.908 | AATCTactt V$FOXM1_01 | 2586 (+) | 0.827 | 0.885 | aaatGCAGT V$FOXM1_01 | 2793 (+) | 0.827 | 0.879 | agctGGTCT V$FOXM1_01 | 2892 (-) | 0.827 | 0.909 | ACTGCatct V$FOXM1_01 | 3087 (-) | 0.935 | 0.942 | ATTCTattt V$FOXM1_01 | 3333 (-) | 0.935 | 0.942 | ATTCTattt V$FOXM1_01 | 3755 (-) | 1.000 | 0.909 | AGTCAttct V$FOXM1_01 | 3985 (-) | 0.827 | 0.909 | ACACCatct V$FOXM1_01 | 4890 (-) | 0.957 | 0.904 | ATTCCatgt V$FOXM1_01 | 5535 (+) | 0.827 | 0.909 | agatTCACT V$FOXM1_01 | 5852 (+) | 1.000 | 0.882 | atgtTGAGT V$FOXM1_01 | 6413 (+) | 1.000 | 0.909 | agaaTGACT V$FOXM1_01 | 7314 (+) | 0.935 | 0.893 | atatAGAAT V$FOXM1_01 | 9088 (-) | 0.827 | 0.879 | AGTGAagct V$FOXM1_01 | 9296 (+) | 0.827 | 0.879 | agctTGTGT V$FOXM1_01 | 9880 (-) | 0.978 | 0.964 | AATCAattt V$FOXM1_01 | 10386 (-) | 0.978 | 0.916 | AATCCatat V$FOXM1_01 | 10464 (-) | 0.805 | 0.898 | AATGCatct V$FOXM1_01 | 10602 (+) | 1.000 | 0.885 | aaaaGGAGT V$FOXM1_01 | 10681 (+) | 1.000 | 0.879 | agcaGGACT V$FOXM1_01 | 11107 (+) | 0.957 | 0.887 | agaaCGACT V$FOXM1_01 | 11238 (+) | 1.000 | 0.946 | aactGGAGT V$FOXM1_01 | 11368 (-) | 0.935 | 0.966 | ATTCTatct V$FOXM1_01 | 11662 (-) | 0.827 | 0.885 | ACACAattt V$FOXM1_01 | 11755 (+) | 0.978 | 0.898 | agaaAGAGT V$FXR_Q3 | 439 (-) | 0.871 | 0.842 | GTTTAatcaggttg V$FXR_Q3 | 842 (+) | 0.742 | 0.827 | caagttaaaTAAAT V$FXR_Q3 | 3412 (-) | 0.871 | 0.805 | AGTTAatgattctg V$FXR_Q3 | 3725 (-) | 0.897 | 0.795 | GGTGActaaaatta V$FXR_Q3 | 3882 (+) | 0.948 | 0.807 | caagctcctTGACC V$FXR_Q3 | 4463 (+) | 0.948 | 0.799 | cacactgacTGACC V$FXR_Q3 | 4714 (-) | 0.897 | 0.833 | GGTAAttcactgtg V$FXR_Q3 | 8615 (-) | 0.787 | 0.836 | CGTCAatcactttg V$FXR_Q3 | 8682 (+) | 0.713 | 0.820 | gaaggtaatTAGCC V$FXR_Q3 | 8685 (-) | 0.897 | 0.816 | GGTAAttagccttt V$FXR_Q3 | 10034 (-) | 0.735 | 0.822 | TGTGAgtcaccgtg V$FXR_Q3 | 11694 (-) | 0.871 | 0.804 | AGTTAatcactgtc V$GATA4_Q3 | 151 (-) | 0.845 | 0.841 | ttgctcaTCTCT V$GATA4_Q3 | 208 (-) | 0.907 | 0.918 | tgcctggTATTT V$GATA4_Q3 | 1040 (+) | 0.791 | 0.829 | ACAAAgcagcaa V$GATA4_Q3 | 1099 (+) | 0.814 | 0.842 | AAATTtgagaaa V$GATA4_Q3 | 1111 (-) | 0.791 | 0.854 | tcgctgaTTTGT V$GATA4_Q3 | 1882 (-) | 0.791 | 0.856 | tgcctccCATTT V$GATA4_Q3 | 2048 (-) | 0.791 | 0.842 | tacctggCTTCT V$GATA4_Q3 | 2052 (-) | 0.845 | 0.865 | tggcttcTCTCT V$GATA4_Q3 | 2184 (-) | 0.845 | 0.873 | tttcttcTCTCT V$GATA4_Q3 | 2271 (-) | 0.791 | 0.852 | tttctttCTTCT V$GATA4_Q3 | 2275 (-) | 0.907 | 0.904 | tttcttcTTTCT V$GATA4_Q3 | 2505 (+) | 0.814 | 0.906 | AAAAAaaaggga V$GATA4_Q3 | 2506 (+) | 0.814 | 0.834 | AAAAAaagggaa V$GATA4_Q3 | 2749 (-) | 0.845 | 0.879 | tttctttTCTCT V$GATA4_Q3 | 3183 (-) | 0.845 | 0.854 | tccccccTCTCT V$GATA4_Q3 | 3781 (-) | 0.814 | 0.838 | tccccccTTTTT V$GATA4_Q3 | 4150 (-) | 0.791 | 0.856 | tgcctccCATTT V$GATA4_Q3 | 4720 (-) | 0.814 | 0.881 | tcactgtGATTT V$GATA4_Q3 | 4806 (-) | 0.791 | 0.859 | tctctggCATTT V$GATA4_Q3 | 4930 (+) | 0.907 | 0.866 | AGATCatggaga V$GATA4_Q3 | 5045 (-) | 0.752 | 0.815 | ttgctgtGGTCT V$GATA4_Q3 | 5178 (-) | 0.814 | 0.864 | tgactgtGTTCT V$GATA4_Q3 | 5393 (-) | 0.907 | 0.855 | ttaccttTATAT V$GATA4_Q3 | 5475 (-) | 0.814 | 0.875 | ttcctgaTTTTT V$GATA4_Q3 | 5523 (-) | 0.769 | 0.837 | ttcctccTATCC V$GATA4_Q3 | 5650 (+) | 0.814 | 0.857 | AAAAAgaagaaa V$GATA4_Q3 | 5834 (+) | 0.715 | 0.821 | TGAAAcaaggca V$GATA4_Q3 | 6707 (+) | 0.845 | 0.871 | AGAGAaaagcca V$GATA4_Q3 | 6819 (-) | 0.814 | 0.827 | tctccctTTTTT V$GATA4_Q3 | 6821 (-) | 0.814 | 0.906 | tccctttTTTAT V$GATA4_Q3 | 7046 (-) | 0.814 | 0.841 | tgactatTTTTT V$GATA4_Q3 | 8377 (-) | 0.907 | 0.820 | gctctttGATCT V$GATA4_Q3 | 9023 (+) | 0.907 | 0.821 | AGATCtttgtga V$GATA4_Q3 | 9116 (+) | 0.814 | 0.828 | AAAAAaccggga V$GATA4_Q3 | 9117 (+) | 0.814 | 0.815 | AAAAAccgggaa V$GATA4_Q3 | 9430 (+) | 0.907 | 0.928 | AAATAacaggaa V$GATA4_Q3 | 10448 (-) | 0.907 | 0.936 | tctctttTATAT V$GATA4_Q3 | 11186 (-) | 0.907 | 0.918 | tgccttgTTTCT V$GATA4_Q3 | 11577 (-) | 0.769 | 0.818 | ttgcttaTATCC V$GATA4_Q3 | 11731 (-) | 0.884 | 0.817 | tctcaaaCATCT F$GBF_Q6 | 103 (+) | 1.000 | 0.914 | ctgGGGGTg F$GBF_Q6 | 971 (-) | 1.000 | 0.947 | gACCCCtaa F$GBF_Q6 | 1186 (-) | 1.000 | 0.914 | cACCCCcag F$GBF_Q6 | 11857 (+) | 1.000 | 0.914 | atgGGGGTa V$GCM_Q2 | 6803 (+) | 0.938 | 0.937 | cctcCCCTCatt V$GCM_Q2 | 8441 (+) | 1.000 | 0.952 | gcaaCCCGCatc V$GCM_Q2 | 11856 (-) | 0.922 | 0.941 | tatGGGGGtagg P$GT1_Q6 | 65 (-) | 0.888 | 0.923 | taATTAC P$GT1_Q6 | 82 (+) | 0.932 | 0.953 | GTTGAta P$GT1_Q6 | 551 (-) | 0.984 | 0.959 | ttTTAAC P$GT1_Q6 | 605 (-) | 0.932 | 0.924 | ttTAAAC P$GT1_Q6 | 636 (-) | 0.932 | 0.924 | ttTAAAC P$GT1_Q6 | 901 (-) | 1.000 | 0.971 | ttTTCAC P$GT1_Q6 | 962 (-) | 1.000 | 1.000 | taTTTAC P$GT1_Q6 | 988 (+) | 0.984 | 0.989 | GTTAAta P$GT1_Q6 | 1124 (-) | 0.922 | 0.946 | taTGCAC P$GT1_Q6 | 2576 (-) | 1.000 | 1.000 | taTTTAC P$GT1_Q6 | 2905 (-) | 0.948 | 0.964 | taTCCAC P$GT1_Q6 | 3450 (+) | 0.984 | 0.989 | GTTAAta P$GT1_Q6 | 3625 (+) | 0.932 | 0.953 | GTTTAta P$GT1_Q6 | 3717 (-) | 1.000 | 1.000 | taTTTAC P$GT1_Q6 | 3742 (-) | 0.948 | 0.935 | ttTCTAC P$GT1_Q6 | 3875 (+) | 0.984 | 0.989 | GTTAAta P$GT1_Q6 | 4125 (+) | 0.984 | 0.959 | GTTAAaa P$GT1_Q6 | 4218 (+) | 0.984 | 0.959 | GTTAAaa P$GT1_Q6 | 4368 (+) | 0.932 | 0.924 | GTTTAaa P$GT1_Q6 | 5697 (+) | 0.906 | 0.935 | GTTCAta P$GT1_Q6 | 5745 (-) | 0.928 | 0.950 | taTAGAC P$GT1_Q6 | 6022 (-) | 0.932 | 0.924 | ttTCAAC P$GT1_Q6 | 6762 (-) | 0.948 | 0.935 | ttTATAC P$GT1_Q6 | 6837 (-) | 1.000 | 1.000 | taTTTAC P$GT1_Q6 | 7346 (-) | 0.932 | 0.924 | ttTCAAC P$GT1_Q6 | 8494 (-) | 0.948 | 0.935 | ttTCTAC P$GT1_Q6 | 8686 (+) | 0.888 | 0.923 | GTAATta P$GT1_Q6 | 8852 (+) | 0.932 | 0.924 | GTTTAaa P$GT1_Q6 | 8931 (+) | 0.948 | 0.935 | GTATAaa P$GT1_Q6 | 8932 (-) | 0.932 | 0.953 | taTAAAC P$GT1_Q6 | 9712 (+) | 0.948 | 0.935 | GTGTAaa P$GT1_Q6 | 9741 (+) | 0.902 | 0.932 | GTCCAta P$GT1_Q6 | 10020 (-) | 0.928 | 0.950 | taTAGAC P$GT1_Q6 | 10307 (-) | 0.984 | 0.959 | ttTTAAC P$GT1_Q6 | 10419 (-) | 0.984 | 0.959 | ttTTAAC P$GT1_Q6 | 10457 (-) | 0.932 | 0.953 | taTCAAC P$GT1_Q6 | 10552 (-) | 0.922 | 0.946 | taTGTAC P$GT1_Q6 | 10626 (+) | 0.948 | 0.935 | GTAGAaa P$GT1_Q6 | 11093 (-) | 0.948 | 0.935 | ttTCCAC P$GT1_Q6 | 11101 (+) | 0.932 | 0.924 | GTTTAaa P$GT1_Q6 | 11509 (+) | 0.948 | 0.964 | GTGGAta P$GT1_Q6 | 11637 (+) | 0.932 | 0.924 | GTTTAaa P$GT1_Q6 | 11697 (-) | 0.888 | 0.923 | taATCAC V$HNF4ALPHA_Q6 | 417 (+) | 0.779 | 0.832 | gtgagCTTGTcca V$HNF4ALPHA_Q6 | 856 (+) | 0.869 | 0.855 | acgatCGTTGccc V$HNF4ALPHA_Q6 | 948 (-) | 0.869 | 0.835 | gaaCAATGatcaa V$HNF4ALPHA_Q6 | 1735 (+) | 0.769 | 0.840 | ctgaaCTTCCtat V$HNF4ALPHA_Q6 | 1952 (+) | 0.898 | 0.808 | tggaaCTTTCctt V$HNF4ALPHA_Q6 | 2657 (+) | 1.000 | 0.816 | atgcaCTTTGatg V$HNF4ALPHA_Q6 | 2923 (+) | 0.829 | 0.815 | cggacCCTTAgct V$HNF4ALPHA_Q6 | 3319 (+) | 0.923 | 0.807 | ttggcCTTTTcct V$HNF4ALPHA_Q6 | 3566 (+) | 1.000 | 0.843 | gtggcCTTTGgca V$HNF4ALPHA_Q6 | 3768 (-) | 0.869 | 0.840 | aggCAATGctcct V$HNF4ALPHA_Q6 | 4873 (-) | 1.000 | 0.881 | cagCAAAGctctg V$HNF4ALPHA_Q6 | 5452 (+) | 0.923 | 0.829 | ataacCTTTTccc V$HNF4ALPHA_Q6 | 6351 (-) | 0.898 | 0.858 | cacGAAAGctcag V$HNF4ALPHA_Q6 | 6469 (-) | 0.793 | 0.838 | ataAAATGctcag V$HNF4ALPHA_Q6 | 6521 (-) | 0.872 | 0.855 | acaCGAAGttcac V$HNF4ALPHA_Q6 | 7121 (-) | 0.869 | 0.884 | gtgCAATGctcaa V$HNF4ALPHA_Q6 | 7230 (-) | 0.769 | 0.807 | gcaGGAAGttcat V$HNF4ALPHA_Q6 | 7411 (+) | 0.923 | 0.884 | aagaaCTTTTaaa V$HNF4ALPHA_Q6 | 7694 (+) | 0.856 | 0.846 | cagaaCTTGGgag V$HNF4ALPHA_Q6 | 7713 (-) | 0.816 | 0.818 | gcaGAAGGatcag V$HNF4ALPHA_Q6 | 7868 (-) | 0.923 | 0.866 | tagAAAAGctcac V$HNF4ALPHA_Q6 | 7908 (+) | 0.923 | 0.805 | attagCTTTTact V$HNF4ALPHA_Q6 | 8040 (+) | 0.918 | 0.815 | ctgggCCTTGact V$HNF4ALPHA_Q6 | 8375 (+) | 1.000 | 0.808 | ctgctCTTTGatc V$HNF4ALPHA_Q6 | 8432 (+) | 1.000 | 0.896 | tagagCTTTGcaa V$HNF4ALPHA_Q6 | 9022 (+) | 1.000 | 0.861 | cagatCTTTGtga V$HNF4ALPHA_Q6 | 9204 (-) | 0.918 | 0.811 | cacCAAGGctcct V$HNF4ALPHA_Q6 | 9632 (-) | 0.766 | 0.848 | agtTCAAGgtcag V$HNF4ALPHA_Q6 | 10491 (-) | 0.918 | 0.855 | tatCAAGGatcaa V$HNF4ALPHA_Q6 | 10798 (+) | 0.918 | 0.874 | gtgagCCTTGtgc V$HNF4ALPHA_Q6 | 11172 (+) | 0.923 | 0.825 | gagatCTTTTtgg V$HNF4ALPHA_Q6 | 11375 (+) | 1.000 | 0.830 | ctgtgCTTTGaaa V$HNF4ALPHA_Q6 | 11945 (-) | 0.918 | 0.916 | tttCAAGGgtcag V$HNF6_Q6 | 3432 (-) | 1.000 | 0.945 | taATTGActtag V$HNF6_Q6 | 9877 (+) | 1.000 | 0.961 | ggaaaTCAATtt V$HNF6_Q6 | 10502 (+) | 1.000 | 0.951 | aaaagTCAATta V$LBP1_Q6 | 4959 (-) | 0.955 | 0.962 | gcAGCAG V$LBP1_Q6 | 7621 (-) | 0.970 | 0.954 | ccAGCCG V$LBP1_Q6 | 8278 (-) | 1.000 | 1.000 | gcAGCTG V$LBP1_Q6 | 8279 (+) | 1.000 | 1.000 | CAGCTgc V$LBP1_Q6 | 8372 (+) | 0.955 | 0.962 | CTGCTgc V$LBP1_Q6 | 8422 (-) | 1.000 | 1.000 | gcAGCTG V$LFA1_Q6 | 1605 (+) | 0.986 | 0.990 | GGGCTcag V$LFA1_Q6 | 5486 (-) | 0.972 | 0.962 | ttgATCCC V$LFA1_Q6 | 9237 (-) | 0.986 | 0.972 | cagAGCCC V$LFA1_Q6 | 9981 (+) | 0.972 | 0.980 | GGGATcag V$LFA1_Q6 | 11117 (-) | 0.972 | 0.962 | cagATCCC V$MAZ_Q6 | 1489 (+) | 1.000 | 0.970 | aGGGAGgg V$MAZ_Q6 | 1500 (+) | 1.000 | 1.000 | gGGGAGgg V$MAZ_Q6 | 1504 (+) | 1.000 | 0.970 | aGGGAGgg V$MAZ_Q6 | 1508 (+) | 1.000 | 0.970 | aGGGAGgg V$MAZ_Q6 | 1512 (+) | 1.000 | 0.970 | aGGGAGgg V$MAZ_Q6 | 11872 (+) | 1.000 | 1.000 | gGGGAGgg P$OCSBF1_01 | 436 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 737 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 738 (-) | 1.000 | 1.000 | ACGTG P$OCSBF1_01 | 1128 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 1129 (-) | 1.000 | 1.000 | ACGTG P$OCSBF1_01 | 6006 (-) | 1.000 | 1.000 | ACGTG P$OCSBF1_01 | 6066 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 6067 (-) | 1.000 | 1.000 | ACGTG P$OCSBF1_01 | 6858 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 6888 (+) | 1.000 | 1.000 | CACGT P$OCSBF1_01 | 7488 (+) | 1.000 | 1.000 | CACGT V$PEA3_Q6 | 610 (+) | 1.000 | 1.000 | aCATCCt V$PEA3_Q6 | 1730 (+) | 1.000 | 0.934 | tCATCCt V$PEA3_Q6 | 1739 (+) | 1.000 | 1.000 | aCTTCCt V$PEA3_Q6 | 2147 (+) | 1.000 | 0.934 | tCTTCCt V$PEA3_Q6 | 2161 (+) | 1.000 | 0.934 | tCTTCCt V$PEA3_Q6 | 2205 (+) | 1.000 | 0.934 | tCTTCCt V$PEA3_Q6 | 2686 (+) | 1.000 | 0.934 | gCATCCt V$PEA3_Q6 | 2785 (+) | 1.000 | 0.934 | tCTTCCt V$PEA3_Q6 | 2989 (+) | 1.000 | 0.978 | aCTTCCg V$PEA3_Q6 | 3033 (+) | 1.000 | 0.978 | aCTTCCg V$PEA3_Q6 | 4266 (-) | 1.000 | 0.934 | aGGATGc V$PEA3_Q6 | 4964 (-) | 1.000 | 1.000 | aGGAAGt V$PEA3_Q6 | 5267 (+) | 1.000 | 0.934 | tCATCCt V$PEA3_Q6 | 6277 (-) | 1.000 | 0.934 | aGGAAGc V$PEA3_Q6 | 6480 (-) | 1.000 | 0.934 | aGGATGa V$PEA3_Q6 | 6799 (+) | 1.000 | 0.934 | tCTTCCt V$PEA3_Q6 | 6879 (-) | 1.000 | 0.934 | aGGAAGc V$PEA3_Q6 | 7014 (-) | 1.000 | 0.978 | cGGATGt V$PEA3_Q6 | 7232 (-) | 1.000 | 1.000 | aGGAAGt V$PEA3_Q6 | 8082 (+) | 1.000 | 1.000 | aCTTCCt V$PEA3_Q6 | 8448 (+) | 1.000 | 0.934 | gCATCCt V$PEA3_Q6 | 9437 (-) | 1.000 | 0.934 | aGGAAGc V$PEA3_Q6 | 9470 (-) | 1.000 | 1.000 | aGGATGt V$PEA3_Q6 | 10259 (-) | 1.000 | 1.000 | aGGAAGt V$PEA3_Q6 | 10640 (-) | 1.000 | 0.934 | aGGAAGa V$PEA3_Q6 | 10846 (-) | 1.000 | 0.934 | aGGAAGa V$PTF1BETA_Q6 | 8194 (-) | 1.000 | 0.930 | tctgcctgtTTCCC V$PTF1BETA_Q6 | 9504 (-) | 1.000 | 0.950 | tctgaaagtTTCCC V$PU1_Q6 | 2156 (-) | 1.000 | 1.000 | CTTCCtct V$PU1_Q6 | 2708 (-) | 1.000 | 1.000 | CTTCCtct V$PU1_Q6 | 2990 (-) | 1.000 | 0.963 | CTTCCgct V$PU1_Q6 | 3034 (-) | 1.000 | 0.963 | CTTCCgct V$PU1_Q6 | 3779 (-) | 1.000 | 0.931 | CTTCCccc V$PU1_Q6 | 6800 (-) | 1.000 | 0.962 | CTTCCtcc V$PU1_Q6 | 9435 (+) | 1.000 | 0.926 | acaGGAAG V$PU1_Q6 | 10207 (-) | 1.000 | 0.937 | CTTCCaca V$PU1_Q6 | 10638 (+) | 1.000 | 0.984 | tgaGGAAG P$RITA1_01 | 736 (+) | 1.000 | 0.997 | cCACGTg P$RITA1_01 | 737 (-) | 1.000 | 0.997 | cACGTGg P$RITA1_01 | 1128 (-) | 1.000 | 0.984 | cACGTGt P$RITA1_01 | 6887 (+) | 1.000 | 0.987 | cCACGTa I$SGF3_Q6 | 10 (-) | 0.950 | 0.966 | tGTACAa I$SGF3_Q6 | 34 (-) | 0.976 | 0.984 | tGAACAa I$SGF3_Q6 | 275 (+) | 0.976 | 0.984 | tTGTGCa I$SGF3_Q6 | 423 (+) | 0.937 | 0.957 | tTGTCCa I$SGF3_Q6 | 645 (+) | 1.000 | 1.000 | tTATTCa I$SGF3_Q6 | 876 (-) | 0.976 | 0.984 | tGCACAa I$SGF3_Q6 | 1123 (+) | 1.000 | 0.945 | gTATGCa I$SGF3_Q6 | 2904 (+) | 0.961 | 0.973 | tTATCCa I$SGF3_Q6 | 3581 (+) | 1.000 | 1.000 | tTATTCa I$SGF3_Q6 | 3943 (+) | 0.976 | 0.984 | tTGTTCa I$SGF3_Q6 | 4171 (+) | 1.000 | 0.945 | gTATTCa I$SGF3_Q6 | 4315 (-) | 1.000 | 1.000 | tGCATAa I$SGF3_Q6 | 4490 (-) | 0.976 | 0.984 | tGAACAa I$SGF3_Q6 | 4780 (-) | 0.961 | 0.973 | tGGATAa I$SGF3_Q6 | 4799 (-) | 1.000 | 1.000 | tGCATAa I$SGF3_Q6 | 6576 (+) | 1.000 | 1.000 | tTATGCa I$SGF3_Q6 | 6771 (-) | 0.976 | 0.984 | tGAACAa I$SGF3_Q6 | 7212 (-) | 0.976 | 0.984 | tGAACAa I$SGF3_Q6 | 8556 (-) | 0.976 | 0.984 | tGAACAa I$SGF3_Q6 | 9142 (-) | 1.000 | 1.000 | tGAATAa I$SGF3_Q6 | 9789 (+) | 0.976 | 0.984 | tTGTTCa I$SGF3_Q6 | 10253 (+) | 0.976 | 0.984 | tTGTGCa I$SGF3_Q6 | 10805 (+) | 0.976 | 0.984 | tTGTGCa I$SGF3_Q6 | 11555 (+) | 1.000 | 0.945 | gTATTCa F$STE12_Q4 | 3995 (+) | 1.000 | 1.000 | aTGAAAc F$STE12_Q4 | 10708 (+) | 1.000 | 1.000 | aTGAAAc F$STE12_Q4 | 10745 (-) | 1.000 | 0.980 | gTTTCAg F$STE12_Q4 | 11936 (+) | 1.000 | 0.980 | cTGAAAc V$SP3_Q3 | 163 (-) | 0.728 | 0.867 | cCGTTCccaatgct V$SP3_Q3 | 1172 (+) | 0.906 | 0.800 | agaaaggaGATGGg V$SP3_Q3 | 1330 (-) | 0.765 | 0.880 | cACTCCccagggct V$SP3_Q3 | 1487 (+) | 0.981 | 0.810 | agagggagGGGGGg V$SP3_Q3 | 1489 (+) | 0.981 | 0.844 | agggagggGGGGGg V$SP3_Q3 | 1638 (-) | 0.775 | 0.797 | cCTTTCctagtgct V$SP3_Q3 | 1794 (-) | 0.728 | 0.801 | cCATTCacaggatt V$SP3_Q3 | 2341 (-) | 0.944 | 0.825 | cCCTACgctcttct V$SP3_Q3 | 3246 (-) | 0.925 | 0.891 | cCCAGCccacttat V$SP3_Q3 | 3778 (-) | 0.812 | 0.810 | cCTTCCcccctttt V$SP3_Q3 | 5020 (+) | 0.690 | 0.801 | agccacggGACTGg V$SP3_Q3 | 5427 (-) | 1.000 | 0.836 | cCCTCCactttatt V$SP3_Q3 | 5462 (-) | 0.925 | 0.790 | cCCCACatactatt V$SP3_Q3 | 5492 (-) | 0.962 | 0.897 | cCCTTCcctctgct V$SP3_Q3 | 5898 (-) | 0.746 | 0.823 | cCGCCCctaagagt V$SP3_Q3 | 6294 (+) | 0.981 | 0.840 | atgactgaGCAGGg V$SP3_Q3 | 6960 (+) | 0.765 | 0.795 | attattctGGATGg V$SP3_Q3 | 9003 (+) | 0.981 | 0.803 | acatggagGCAGGg V$SP3_Q3 | 9674 (+) | 1.000 | 0.815 | aaagaagtGGAGGg V$SP3_Q3 | 10608 (+) | 1.000 | 0.840 | agtgtgtgGGAGGg V$SP3_Q3 | 10870 (+) | 0.944 | 0.862 | agcgtggtGGTGGg V$SP3_Q3 | 11718 (-) | 1.000 | 0.883 | cCCTCCaccatttt V$SP3_Q3 | 11819 (+) | 0.709 | 0.809 | agggttggGGTGAg V$SP3_Q3 | 11837 (+) | 0.981 | 0.875 | agcagagtGGGGGg V$SP3_Q3 | 11840 (+) | 0.746 | 0.808 | agagtgggGGGTGg V$SP3_Q3 | 11842 (+) | 0.925 | 0.839 | agtgggggGTGGGg I$SRYBETA_Q6 | 154 (+) | 0.905 | 0.948 | CTCATctct I$SRYBETA_Q6 | 5069 (-) | 0.905 | 0.948 | agagATGAG V$TCF4_Q5 | 1167 (-) | 1.000 | 0.975 | ttCAAAGa V$TCF4_Q5 | 2661 (+) | 1.000 | 0.975 | aCTTTGat V$TCF4_Q5 | 6741 (+) | 1.000 | 1.000 | cCTTTGaa V$TCF4_Q5 | 10353 (-) | 1.000 | 0.983 | ttCAAAGt V$TCF4_Q5 | 11379 (+) | 1.000 | 0.992 | gCTTTGaa V$TEF_Q6 | 633 (-) | 0.821 | 0.913 | TTTTTtaaacat V$TEF_Q6 | 1255 (-) | 0.836 | 0.877 | TTCTGtcatcat V$TEF_Q6 | 1899 (-) | 0.985 | 0.904 | TTATTttattat V$TEF_Q6 | 3739 (-) | 0.985 | 0.918 | TTATTtctacat V$TEF_Q6 | 4269 (+) | 0.836 | 0.901 | atgcttaCCTAA V$TEF_Q6 | 7270 (+) | 0.985 | 0.894 | atcctaaAATAA V$TEF_Q6 | 10454 (-) | 1.000 | 0.926 | TTATAtcaacaa V$TEF_Q6 | 11656 (+) | 0.877 | 0.861 | ctgttaaCACAA V$TEL2_Q6 | 2569 (-) | 0.800 | 0.824 | ccgTAAGTat V$TEL2_Q6 | 2987 (+) | 1.000 | 0.830 | ctACTTCcgc V$TEL2_Q6 | 4963 (-) | 1.000 | 0.937 | cagGAAGTca V$TEL2_Q6 | 5060 (-) | 0.800 | 0.830 | cagGTAGTga V$TEL2_Q6 | 5471 (+) | 0.800 | 0.877 | ctATTTCctg V$TEL2_Q6 | 6459 (+) | 0.800 | 0.877 | ttATTTCctg V$TEL2_Q6 | 7231 (-) | 1.000 | 0.925 | cagGAAGTtc V$TEL2_Q6 | 9908 (+) | 0.800 | 0.877 | ttACTTActg V$TEL2_Q6 | 10258 (-) | 1.000 | 0.953 | cagGAAGTga V$TEL2_Q6 | 10845 (-) | 0.800 | 0.877 | cagGAAGAag P$WRKY_Q2 | 130 (+) | 1.000 | 0.857 | ctgctcAGTCA P$WRKY_Q2 | 1075 (+) | 0.973 | 0.787 | tgtgtaGGTCA P$WRKY_Q2 | 1084 (+) | 0.973 | 0.787 | cattttGGTCA P$WRKY_Q2 | 1252 (+) | 0.919 | 0.809 | gcattcTGTCA P$WRKY_Q2 | 1351 (+) | 0.973 | 0.798 | tgcctgGGTCA P$WRKY_Q2 | 3069 (+) | 0.973 | 0.780 | tagctaGGTCA P$WRKY_Q2 | 3436 (-) | 1.000 | 0.794 | TGACTtaggta P$WRKY_Q2 | 3727 (-) | 1.000 | 0.784 | TGACTaaaatt P$WRKY_Q2 | 3749 (+) | 1.000 | 0.857 | atttttAGTCA P$WRKY_Q2 | 3931 (+) | 1.000 | 0.827 | attatcAGTCA P$WRKY_Q2 | 3947 (+) | 0.900 | 0.872 | tcattcAGCCA P$WRKY_Q2 | 4468 (-) | 1.000 | 0.827 | TGACTgaccag P$WRKY_Q2 | 5359 (-) | 0.919 | 0.794 | TGACAaaacac P$WRKY_Q2 | 6295 (-) | 1.000 | 0.857 | TGACTgagcag P$WRKY_Q2 | 6324 (-) | 1.000 | 0.872 | TGACTgaagca P$WRKY_Q2 | 6431 (+) | 0.973 | 0.868 | ttcatcGGTCA P$WRKY_Q2 | 8320 (+) | 0.919 | 0.787 | cttctgCGTCA P$WRKY_Q2 | 8609 (+) | 0.919 | 0.882 | tcattcCGTCA P$WRKY_Q2 | 10032 (+) | 1.000 | 0.856 | tttgtgAGTCA P$WRKY_Q2 | 10634 (-) | 1.000 | 0.912 | TGACTgaggaa P$WRKY_Q2 | 11333 (-) | 0.919 | 0.783 | TGACGatgtaa F$XBP1_Q2 | 8051 (-) | 1.000 | 0.984 | CTCGAgg F$XBP1_Q2 | 8825 (-) | 1.000 | 0.984 | CTCGAtg V$XPF1_Q6 | 679 (-) | 1.000 | 0.934 | gctctTCAGA V$XPF1_Q6 | 1357 (-) | 0.958 | 0.945 | ggtcaCCTGA V$XPF1_Q6 | 3349 (-) | 1.000 | 0.990 | gttcaTCAGA V$XPF1_Q6 | 6111 (-) | 0.958 | 0.945 | gttgaCCAGA V$XPF1_Q6 | 8719 (+) | 1.000 | 1.000 | TCTGAagaac V$XPF1_Q6 | 9162 (-) | 0.842 | 0.918 | gttccTAAGA V$ALPHACP1_01 | 110 (+) | 0.800 | 0.791 | tggCCAGTgag V$ALPHACP1_01 | 1021 (-) | 1.000 | 0.761 | gtgATTGGcag V$ALPHACP1_01 | 1209 (-) | 0.800 | 0.746 | cccATTGCttg V$ALPHACP1_01 | 1426 (+) | 0.800 | 0.795 | cacGCAATcag V$ALPHACP1_01 | 1832 (+) | 0.800 | 0.739 | cacCCATTggg V$ALPHACP1_01 | 1834 (-) | 1.000 | 0.767 | cccATTGGggt V$ALPHACP1_01 | 2445 (-) | 0.800 | 0.791 | ctcAGTGGtag V$ALPHACP1_01 | 3479 (+) | 0.800 | 0.818 | ctgCCTATgag V$ALPHACP1_01 | 4527 (+) | 0.800 | 0.746 | caaCAAATgcg V$ALPHACP1_01 | 5011 (+) | 0.800 | 0.756 | tacCCATTaag V$ALPHACP1_01 | 5087 (-) | 0.800 | 0.774 | ctgACTGGttt V$ALPHACP1_01 | 5664 (+) | 0.800 | 0.756 | aacACAATaag V$ALPHACP1_01 | 6590 (+) | 0.800 | 0.763 | aaaCAAATaag V$ALPHACP1_01 | 9256 (-) | 0.800 | 0.763 | ctcATGGGaca V$ALPHACP1_01 | 9911 (-) | 0.800 | 0.763 | cttACTGGatt V$ALPHACP1_01 | 11177 (-) | 0.800 | 0.818 | cttTTTGGctg V$ALPHACP1_01 | 11743 (+) | 0.800 | 0.739 | cacCAAATgac V$ALPHACP1_01 | 11928 (-) | 1.000 | 0.887 | cccATTGGctg N$UNC86_Q2 | 3829 (+) | 0.907 | 0.920 | cataaTAAAT N$UNC86_Q2 | 5303 (+) | 0.969 | 0.936 | catccAAAAT V$AP3_Q6 | 1468 (+) | 0.978 | 0.943 | tCTCAAta V$AP3_Q6 | 1665 (-) | 0.927 | 0.942 | aaTGGAGa V$AP3_Q6 | 3744 (+) | 0.949 | 0.959 | tCTACAtt V$AP3_Q6 | 4294 (+) | 1.000 | 0.961 | cCTAAAtt V$AP3_Q6 | 7059 (+) | 0.934 | 0.947 | tCTTAAtt V$AP3_Q6 | 8859 (-) | 0.934 | 0.947 | aaTTAAGa V$AP3_Q6 | 9825 (-) | 1.000 | 0.931 | atTTTAGa V$AP3_Q6 | 10474 (-) | 1.000 | 0.941 | acTTTAGa V$AP3_Q6 | 10545 (+) | 1.000 | 0.941 | tCTAAAgt V$AP3_Q6 | 10812 (+) | 1.000 | 0.941 | tCTAAAtc V$ATF1_Q6 | 1239 (-) | 0.820 | 0.886 | tGACATcaggg V$ATF1_Q6 | 2963 (+) | 0.902 | 0.910 | cagtgACGGCa V$E12_Q6 | 1392 (+) | 1.000 | 0.985 | ggCAGGTgagg V$E12_Q6 | 1785 (-) | 1.000 | 0.983 | gagcACCTGcc V$E12_Q6 | 8504 (+) | 1.000 | 0.977 | gaCAGGTgcgg V$ETF_Q6 | 5898 (-) | 1.000 | 0.996 | CCGCCcc I$EN_Q6 | 3432 (-) | 1.000 | 0.987 | TAATTga I$EN_Q6 | 10507 (+) | 1.000 | 0.987 | tcAATTA V$HEB_Q6 | 8370 (+) | 0.976 | 0.984 | gCCTGCtg V$ICSBP_Q6 | 5120 (+) | 1.000 | 0.886 | aaaagGAAACag V$ICSBP_Q6 | 11158 (+) | 0.783 | 0.874 | aaaatGAAGCtg P$ROM_Q2 | 9614 (-) | 1.000 | 0.948 | tgAGGTGgg V$SMAD3_Q6 | 893 (-) | 1.000 | 0.902 | ggaCAGACt V$SMAD3_Q6 | 2081 (+) | 1.000 | 1.000 | tGTCTGtct V$SMAD3_Q6 | 2944 (+) | 1.000 | 0.945 | aGTCTGcct V$SMAD3_Q6 | 3232 (-) | 0.910 | 0.922 | agtCAGCCa V$SMAD3_Q6 | 3356 (-) | 1.000 | 0.954 | agaCAGACc V$SMAD3_Q6 | 5675 (-) | 1.000 | 1.000 | agaCAGACa V$SMAD3_Q6 | 6237 (+) | 1.000 | 0.972 | tGTCTGgct V$SMAD3_Q6 | 8563 (+) | 0.847 | 0.896 | tGTCCGtct V$SMAD3_Q6 | 9847 (-) | 0.874 | 0.898 | agtCAGGCa V$SMAD3_Q6 | 9851 (-) | 0.874 | 0.898 | aggCAGGCa V$SMAD3_Q6 | 11182 (+) | 0.910 | 0.922 | tGGCTGcct P$SPF1_Q2 | 828 (-) | 1.000 | 0.949 | aatACTATat P$SPF1_Q2 | 5467 (-) | 1.000 | 0.949 | catACTATtt P$SPF1_Q2 | 7045 (-) | 1.000 | 0.945 | ttgACTATtt V$TEF1_Q6 | 410 (+) | 1.000 | 1.000 | GGAATg V$TEF1_Q6 | 1575 (+) | 1.000 | 1.000 | GGAATg V$TEF1_Q6 | 2848 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 3136 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 3163 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 3242 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 4889 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 8610 (-) | 1.000 | 1.000 | cATTCC V$TEF1_Q6 | 9396 (+) | 1.000 | 1.000 | GGAATg V$TEF1_Q6 | 11490 (+) | 1.000 | 1.000 | GGAATg V$TFIII_Q6 | 1316 (-) | 0.873 | 0.913 | CCTAActcc V$TFIII_Q6 | 1384 (+) | 0.973 | 0.965 | cgagGTAGG V$TFIII_Q6 | 1487 (+) | 1.000 | 1.000 | agagGGAGG V$TFIII_Q6 | 1498 (+) | 1.000 | 0.932 | ggggGGAGG V$TFIII_Q6 | 1502 (+) | 1.000 | 1.000 | ggagGGAGG V$TFIII_Q6 | 1506 (+) | 1.000 | 1.000 | ggagGGAGG V$TFIII_Q6 | 1510 (+) | 1.000 | 1.000 | ggagGGAGG V$TFIII_Q6 | 1514 (+) | 1.000 | 1.000 | ggagGGAGG V$TFIII_Q6 | 1675 (-) | 1.000 | 0.949 | CCTCCttcg V$TFIII_Q6 | 2155 (-) | 0.919 | 0.945 | CCTTCctct V$TFIII_Q6 | 2707 (-) | 0.919 | 0.945 | CCTTCctct V$TFIII_Q6 | 4349 (+) | 0.919 | 0.945 | ggagGCAGG V$TFIII_Q6 | 4860 (+) | 1.000 | 0.991 | tgagGGAGG V$TFIII_Q6 | 6159 (-) | 0.919 | 0.945 | CCTGCctcc V$TFIII_Q6 | 7023 (-) | 0.919 | 0.914 | CCTGCatca V$TFIII_Q6 | 8165 (+) | 0.919 | 0.945 | agagGCAGG V$TFIII_Q6 | 8243 (+) | 0.919 | 0.923 | agatGAAGG V$TFIII_Q6 | 9007 (+) | 0.919 | 0.945 | ggagGCAGG V$TFIII_Q6 | 9310 (+) | 1.000 | 1.000 | ggagGGAGG V$TFIII_Q6 | 11859 (+) | 0.973 | 0.913 | ggggGTAGG V$TFIII_Q6 | 11870 (+) | 1.000 | 0.915 | cgggGGAGG V$TFIIA_Q6 | 3513 (+) | 0.961 | 0.889 | tgttaGAGGAgg V$TFIIA_Q6 | 5932 (-) | 0.961 | 0.901 | ctCCCTTggaga I$ZEN_Q6 | 23 (-) | 1.000 | 0.960 | gtgAATGT I$ZEN_Q6 | 641 (+) | 1.000 | 0.928 | ACATTtat I$ZEN_Q6 | 649 (+) | 0.981 | 0.914 | TCATTtat I$ZEN_Q6 | 985 (+) | 0.888 | 0.913 | ACAGTtaa I$ZEN_Q6 | 3413 (-) | 0.981 | 0.971 | gttAATGA I$ZEN_Q6 | 3431 (-) | 0.869 | 0.899 | ttaATTGA I$ZEN_Q6 | 3447 (+) | 0.888 | 0.913 | ACAGTtaa I$ZEN_Q6 | 3655 (+) | 0.869 | 0.899 | TCAGTaaa I$ZEN_Q6 | 3864 (+) | 0.907 | 0.928 | CCATTtaa I$ZEN_Q6 | 4092 (-) | 0.869 | 0.899 | tttACTGA I$ZEN_Q6 | 4215 (+) | 0.869 | 0.899 | TCAGTtaa I$ZEN_Q6 | 4446 (+) | 0.888 | 0.913 | ACAATtaa I$ZEN_Q6 | 4457 (+) | 0.981 | 0.898 | TCATTtca I$ZEN_Q6 | 5724 (+) | 0.869 | 0.899 | ACACTaaa I$ZEN_Q6 | 6193 (+) | 0.907 | 0.928 | CCATTtaa I$ZEN_Q6 | 7260 (+) | 0.981 | 0.986 | TCATTaaa I$ZEN_Q6 | 7343 (+) | 1.000 | 0.912 | ACATTtca I$ZEN_Q6 | 7829 (+) | 0.888 | 0.913 | ACAATtaa I$ZEN_Q6 | 8576 (-) | 0.907 | 0.928 | ttaAATGG I$ZEN_Q6 | 9793 (+) | 0.869 | 0.899 | TCAGTaaa I$ZEN_Q6 | 9796 (-) | 1.000 | 0.986 | gtaAATGT I$ZEN_Q6 | 9860 (+) | 0.907 | 0.928 | GCATTaaa I$ZEN_Q6 | 10309 (-) | 0.888 | 0.913 | ttaACTGT I$ZEN_Q6 | 10340 (-) | 0.888 | 0.913 | tttATTGT I$ZEN_Q6 | 11340 (-) | 0.907 | 0.914 | gtaAATGC I$ZEN_Q6 | 11639 (-) | 1.000 | 1.000 | ttaAATGT V$ZTA_Q2 | 10036 (-) | 1.000 | 0.985 | TGAGTcaccgtgt V$MYOGENIN_Q6 | 1027 (+) | 0.973 | 0.981 | ggCAGTTg V$MYOGENIN_Q6 | 1392 (+) | 1.000 | 1.000 | ggCAGGTg V$MYOGENIN_Q6 | 1788 (-) | 1.000 | 1.000 | cACCTGcc V$MYOGENIN_Q6 | 2970 (+) | 0.964 | 0.974 | ggCAGATg V$MYOGENIN_Q6 | 6264 (-) | 0.964 | 0.974 | cATCTGcc V$MYOGENIN_Q6 | 8279 (-) | 1.000 | 1.000 | cAGCTGcc V$MYOGENIN_Q6 | 8421 (+) | 1.000 | 0.989 | agCAGCTg V$MYOGENIN_Q6 | 8423 (-) | 1.000 | 0.971 | cAGCTGtt V$MYOGENIN_Q6 | 8504 (+) | 1.000 | 0.982 | gaCAGGTg V$MYOGENIN_Q6 | 8969 (+) | 1.000 | 0.971 | acCAGGTg V$MYOGENIN_Q6 | 11712 (-) | 0.973 | 0.981 | cAACTGcc F$TBP_Q6 | 500 (+) | 1.000 | 0.934 | ttaTTATAt F$TBP_Q6 | 653 (+) | 1.000 | 0.934 | ttaTTATAt F$TBP_Q6 | 658 (-) | 0.903 | 0.906 | aTATATaag F$TBP_Q6 | 660 (-) | 1.000 | 0.963 | aTATAAgta F$TBP_Q6 | 834 (-) | 1.000 | 0.971 | aTATAAaac F$TBP_Q6 | 1890 (+) | 1.000 | 0.961 | attTTATAt F$TBP_Q6 | 3405 (-) | 1.000 | 0.935 | tTATAAaag F$TBP_Q6 | 3624 (+) | 1.000 | 0.931 | cgtTTATAa F$TBP_Q6 | 5396 (+) | 1.000 | 0.986 | cctTTATAt F$TBP_Q6 | 6760 (+) | 1.000 | 0.930 | cctTTATAc F$TBP_Q6 | 10316 (+) | 1.000 | 0.999 | tttTTATAt F$TBP_Q6 | 10329 (+) | 1.000 | 0.917 | attTTATAg F$TBP_Q6 | 10391 (-) | 1.000 | 0.999 | aTATAAaaa F$TBP_Q6 | 10451 (+) | 1.000 | 0.987 | cttTTATAt F$TBP_Q6 | 11578 (+) | 1.000 | 0.958 | tgcTTATAt V$ZF5_01 | 105 (+) | 0.844 | 0.759 | GGGGGtgg V$ZF5_01 | 163 (-) | 0.694 | 0.796 | ccgTTCCC V$ZF5_01 | 728 (+) | 0.799 | 0.781 | GTGAGcca V$ZF5_01 | 1379 (+) | 0.799 | 0.855 | GTGGGcga V$ZF5_01 | 1381 (+) | 1.000 | 0.852 | GGGCGagg V$ZF5_01 | 1585 (+) | 0.805 | 0.785 | GAGAGcca V$ZF5_01 | 1603 (-) | 0.783 | 0.770 | tggGGCTC V$ZF5_01 | 1605 (+) | 0.822 | 0.808 | GGGCTcag V$ZF5_01 | 1724 (-) | 0.783 | 0.856 | ccgGGCTC V$ZF5_01 | 1783 (-) | 0.777 | 0.766 | tggAGCAC V$ZF5_01 | 2018 (+) | 0.792 | 0.788 | GAGTGctg V$ZF5_01 | 2343 (-) | 0.961 | 0.762 | ctaCGCTC V$ZF5_01 | 2470 (+) | 0.671 | 0.770 | AAGCGcga V$ZF5_01 | 2472 (+) | 1.000 | 0.777 | GCGCGaag V$ZF5_01 | 2485 (+) | 0.653 | 0.770 | GGGTTcgg V$ZF5_01 | 2816 (-) | 0.811 | 0.874 | ccgTGCTC V$ZF5_01 | 2818 (+) | 0.777 | 0.777 | GTGCTctg V$ZF5_01 | 2918 (+) | 0.648 | 0.766 | GTGGAcgg V$ZF5_01 | 3006 (+) | 0.955 | 0.896 | GTGCGcag V$ZF5_01 | 3059 (+) | 0.844 | 0.759 | GCGAGtgg V$ZF5_01 | 3248 (-) | 0.799 | 0.792 | cagCCCAC V$ZF5_01 | 3384 (-) | 0.822 | 0.796 | tggGGCCC V$ZF5_01 | 3386 (+) | 0.822 | 0.808 | GGGCCctg V$ZF5_01 | 3963 (-) | 0.777 | 0.777 | ctgAGCAC V$ZF5_01 | 3965 (+) | 0.811 | 0.800 | GAGCActg V$ZF5_01 | 4019 (-) | 0.805 | 0.796 | ctgCTCTC V$ZF5_01 | 4597 (-) | 0.844 | 0.822 | ctgCCCGC V$ZF5_01 | 4601 (-) | 0.623 | 0.750 | ccgCTTGC V$ZF5_01 | 4731 (-) | 0.805 | 0.785 | tggCCCTC V$ZF5_01 | 4834 (+) | 0.694 | 0.785 | GGGAAcga V$ZF5_01 | 5114 (+) | 0.777 | 0.766 | GTGCTcaa V$ZF5_01 | 5222 (+) | 0.961 | 0.752 | GAGCGgag V$ZF5_01 | 5233 (+) | 0.694 | 0.785 | GGGAAcga V$ZF5_01 | 5329 (-) | 0.850 | 0.826 | cagTGCGC V$ZF5_01 | 5331 (+) | 0.955 | 0.896 | GTGCGccg V$ZF5_01 | 5501 (-) | 0.844 | 0.822 | ctgCTCCC V$ZF5_01 | 5887 (+) | 0.850 | 0.762 | GGGCAtgg V$ZF5_01 | 5892 (-) | 0.822 | 0.796 | tggGGCCC V$ZF5_01 | 5894 (+) | 0.822 | 0.769 | GGGCCcgc V$ZF5_01 | 6152 (-) | 0.805 | 0.785 | tggCTCTC V$ZF5_01 | 6222 (-) | 0.961 | 0.825 | tcaCGCTC V$ZF5_01 | 6303 (-) | 0.777 | 0.777 | cagGGCAC V$ZF5_01 | 6305 (+) | 0.850 | 0.826 | GGGCActg V$ZF5_01 | 6628 (-) | 0.777 | 0.766 | tggGGCAC V$ZF5_01 | 6693 (+) | 0.792 | 0.788 | GAGTGctg V$ZF5_01 | 6732 (+) | 0.799 | 0.792 | GTGAGcag V$ZF5_01 | 6855 (+) | 0.811 | 0.761 | GAGCAcgt V$ZF5_01 | 7009 (+) | 0.623 | 0.750 | GCAGGcgg V$ZF5_01 | 7091 (-) | 1.000 | 0.915 | ttgCGCGC V$ZF5_01 | 7093 (+) | 1.000 | 0.813 | GCGCGcac V$ZF5_01 | 7093 (-) | 0.955 | 0.857 | gcgCGCAC V$ZF5_01 | 7119 (+) | 0.786 | 0.772 | GTGTGcaa V$ZF5_01 | 7145 (+) | 0.805 | 0.758 | GAGGGcgc V$ZF5_01 | 7147 (+) | 1.000 | 0.926 | GGGCGccg V$ZF5_01 | 7199 (+) | 0.831 | 0.802 | GGGTGcta V$ZF5_01 | 7476 (+) | 0.786 | 0.772 | GTGTGcta V$ZF5_01 | 7481 (-) | 1.000 | 0.788 | ctaCGCGC V$ZF5_01 | 7483 (-) | 0.955 | 0.857 | acgCGCAC V$ZF5_01 | 7485 (+) | 0.850 | 0.787 | GCGCAcgt V$ZF5_01 | 7544 (+) | 0.799 | 0.781 | GTGAGcca V$ZF5_01 | 7675 (-) | 0.799 | 0.781 | tggCTCAC V$ZF5_01 | 8229 (-) | 0.844 | 0.811 | tggCTCCC V$ZF5_01 | 8235 (-) | 0.831 | 0.750 | ccaCACCC V$ZF5_01 | 8288 (-) | 0.844 | 0.811 | tggCTCCC V$ZF5_01 | 8509 (+) | 0.955 | 0.810 | GTGCGgga V$ZF5_01 | 8542 (+) | 0.799 | 0.792 | GTGGGccg V$ZF5_01 | 8739 (+) | 0.831 | 0.750 | GGGTGtgg V$ZF5_01 | 8747 (-) | 0.799 | 0.781 | tggCCCAC V$ZF5_01 | 8805 (+) | 0.844 | 0.811 | GGGGGcca V$ZF5_01 | 9201 (+) | 0.811 | 0.789 | GAGCAcca V$ZF5_01 | 9237 (-) | 0.822 | 0.808 | cagAGCCC V$ZF5_01 | 9239 (+) | 0.783 | 0.770 | GAGCCcta V$ZF5_01 | 9248 (+) | 0.805 | 0.796 | GAGAGctg V$ZF5_01 | 9303 (+) | 0.777 | 0.777 | GTGCCcag V$ZF5_01 | 9379 (+) | 0.805 | 0.796 | GAGAGctg V$ZF5_01 | 9572 (+) | 0.831 | 0.750 | GGGTGtgg V$ZF5_01 | 9687 (+) | 0.844 | 0.759 | GGGGGtgg V$ZF5_01 | 10088 (-) | 0.811 | 0.800 | cagTGCTC V$ZF5_01 | 10719 (-) | 0.779 | 0.779 | ctgCGTGC V$ZF5_01 | 10721 (+) | 0.831 | 0.813 | GCGTGctg V$ZF5_01 | 10764 (-) | 0.777 | 0.766 | ttgGGCAC V$ZF5_01 | 10766 (+) | 0.850 | 0.815 | GGGCActa V$ZF5_01 | 10782 (-) | 0.653 | 0.758 | tcgGACCC V$ZF5_01 | 10883 (+) | 0.811 | 0.800 | GAGCActg V$ZF5_01 | 10938 (+) | 0.648 | 0.766 | GTGGAcgg V$ZF5_01 | 11133 (-) | 0.792 | 0.788 | ctgCACTC V$ZF5_01 | 11260 (+) | 0.786 | 0.772 | GTGTGcca V$ZF5_01 | 11604 (+) | 0.804 | 0.796 | GTGCAcag V$ZF5_01 | 11715 (-) | 0.805 | 0.796 | ctgCCCTC V$ZF5_01 | 11846 (+) | 0.844 | 0.759 | GGGGGtgg V$ZF5_01 | 11916 (-) | 0.822 | 0.871 | tcgGGCCC V$ZF5_01 | 11918 (+) | 0.822 | 0.796 | GGGCCcaa V$ZF5_01 | 11923 (-) | 1.000 | 0.788 | caaCGCCC V$ZF5_01 | 11966 (-) | 0.783 | 0.782 | ctgAGCTC V$PAX8_01 | 1 (-) | 0.935 | 0.872 | gtTCATGcatgtaca V$PAX8_01 | 24 (+) | 0.953 | 0.880 | tgaatgtgTGTGAac V$PAX8_01 | 107 (+) | 0.973 | 0.839 | gggtggccAGTGAgg V$PAX8_01 | 132 (-) | 0.855 | 0.814 | gcTCAGTcaatccag V$PAX8_01 | 136 (-) | 0.862 | 0.841 | agTCAATccagacat V$PAX8_01 | 295 (+) | 0.855 | 0.849 | ttttacagACTGAgc V$PAX8_01 | 408 (+) | 0.963 | 0.884 | gcggaatgGGTGAgc V$PAX8_01 | 528 (+) | 0.963 | 0.939 | aaatgatgGGTGAtg V$PAX8_01 | 912 (-) | 0.935 | 0.872 | ttTCATGcgtagcac V$PAX8_01 | 981 (-) | 0.953 | 0.814 | caTCACAgttaataa V$PAX8_01 | 1261 (-) | 0.897 | 0.844 | caTCATCcgtgtgta V$PAX8_01 | 1274 (+) | 0.897 | 0.873 | tactctggGATGAca V$PAX8_01 | 1327 (-) | 0.973 | 0.937 | tcTCACTccccaggg V$PAX8_01 | 1411 (+) | 1.000 | 0.810 | tccattgcCGTGAga V$PAX8_01 | 1728 (-) | 0.897 | 0.887 | gcTCATCctgaactt V$PAX8_01 | 2011 (+) | 0.822 | 0.821 | atctcctgAGTGCtg V$PAX8_01 | 2292 (+) | 0.802 | 0.825 | agtttttgTGTGTgt V$PAX8_01 | 2294 (+) | 0.802 | 0.825 | tttttgtgTGTGTgt V$PAX8_01 | 2551 (+) | 0.953 | 0.810 | gtttctgtTGTGAaa V$PAX8_01 | 2668 (-) | 0.822 | 0.811 | tgACACTcttaaccc V$PAX8_01 | 2693 (-) | 0.953 | 0.874 | ggTCACActcttctc V$PAX8_01 | 2845 (-) | 0.908 | 0.813 | caTCATTccgtcctt V$PAX8_01 | 3040 (-) | 0.834 | 0.830 | ctCCACCcgggacat V$PAX8_01 | 3075 (+) | 0.963 | 0.880 | ggtcacagGGTGAtt V$PAX8_01 | 3232 (-) | 0.845 | 0.860 | agTCAGCcaacattc V$PAX8_01 | 3261 (+) | 0.963 | 0.827 | gttttcttGGTGAtg V$PAX8_01 | 3281 (+) | 0.897 | 0.829 | atgattggGATGAgg V$PAX8_01 | 3467 (-) | 0.935 | 0.824 | ccTCATGatttactg V$PAX8_01 | 3700 (+) | 0.889 | 0.844 | aagagatgCTTGAtc V$PAX8_01 | 3717 (+) | 0.963 | 0.942 | tatttacgGGTGAct V$PAX8_01 | 3755 (-) | 0.908 | 0.894 | agTCATTcttaaaag V$PAX8_01 | 3933 (-) | 0.855 | 0.813 | taTCAGTcacttgtt V$PAX8_01 | 3945 (-) | 0.908 | 0.844 | gtTCATTcagccatc V$PAX8_01 | 3969 (+) | 0.887 | 0.810 | actggctgTATGAta V$PAX8_01 | 3991 (-) | 0.935 | 0.827 | tcTCATGaaacaacg V$PAX8_01 | 4279 (-) | 0.802 | 0.822 | aaGCACAcattagac V$PAX8_01 | 4382 (+) | 0.973 | 0.836 | ggctacatAGTGAgt V$PAX8_01 | 4406 (-) | 0.845 | 0.833 | ggTCAGCcacaacct V$PAX8_01 | 4460 (-) | 0.953 | 0.853 | ttTCACActgactga V$PAX8_01 | 4716 (+) | 0.953 | 0.828 | taattcacTGTGAtt V$PAX8_01 | 4718 (-) | 0.973 | 0.814 | atTCACTgtgatttg V$PAX8_01 | 4833 (+) | 0.973 | 0.871 | agggaacgAGTGAtc V$PAX8_01 | 4933 (+) | 0.855 | 0.840 | tcatggagACTGAac V$PAX8_01 | 4968 (-) | 0.882 | 0.814 | agTCAGGctcagccc V$PAX8_01 | 4974 (-) | 0.845 | 0.826 | gcTCAGCccctacct V$PAX8_01 | 5101 (+) | 0.953 | 0.839 | ttattattTGTGAgt V$PAX8_01 | 5105 (+) | 0.822 | 0.834 | tatttgtgAGTGCtc V$PAX8_01 | 5168 (+) | 0.953 | 0.831 | ctttcactTGTGAct V$PAX8_01 | 5169 (-) | 0.973 | 0.852 | ttTCACTtgtgactg V$PAX8_01 | 5265 (-) | 0.897 | 0.826 | agTCATCctgggcca V$PAX8_01 | 5364 (-) | 0.802 | 0.818 | aaACACAcgggattt V$PAX8_01 | 5537 (-) | 0.973 | 0.886 | atTCACTctctttct V$PAX8_01 | 5555 (-) | 0.973 | 0.840 | tcTCACTagaaaaca V$PAX8_01 | 5697 (-) | 0.887 | 0.844 | gtTCATAcacttagg V$PAX8_01 | 5969 (+) | 0.963 | 0.812 | ttttaagtGGTGAta V$PAX8_01 | 6089 (+) | 0.824 | 0.832 | agttcctgTGTGGat V$PAX8_01 | 6220 (-) | 1.000 | 0.910 | tcTCACGctctgcat V$PAX8_01 | 6289 (+) | 0.855 | 0.861 | tgatgatgACTGAgc V$PAX8_01 | 6318 (+) | 0.855 | 0.820 | cttccctgACTGAag V$PAX8_01 | 6332 (-) | 0.802 | 0.814 | gcACACActggattt V$PAX8_01 | 6407 (+) | 0.908 | 0.834 | acgctcagAATGAct V$PAX8_01 | 6474 (+) | 0.897 | 0.832 | atgctcagGATGAgg V$PAX8_01 | 6528 (-) | 0.963 | 0.925 | gtTCACCctggagac V$PAX8_01 | 6540 (+) | 0.845 | 0.815 | gacaaacgGCTGAgg V$PAX8_01 | 6614 (+) | 1.000 | 0.844 | ctaaactcCGTGAgt V$PAX8_01 | 6630 (-) | 0.812 | 0.815 | ggGCACCcagaaggc V$PAX8_01 | 6646 (-) | 0.855 | 0.847 | ccTCAGTctgcaaac V$PAX8_01 | 6723 (+) | 0.963 | 0.943 | ctttaaagGGTGAgc V$PAX8_01 | 6761 (+) | 0.935 | 0.818 | ctttatacCATGAac V$PAX8_01 | 6983 (+) | 0.963 | 0.824 | acttagctGGTGAgg V$PAX8_01 | 7073 (+) | 0.887 | 0.886 | tgtttgtgTATGAgt V$PAX8_01 | 7439 (+) | 0.824 | 0.839 | attttatgTGTGGga V$PAX8_01 | 7535 (+) | 0.953 | 0.828 | cagttggtTGTGAgc V$PAX8_01 | 7747 (+) | 0.953 | 0.821 | ggctacatTGTGAgt V$PAX8_01 | 8014 (+) | 0.973 | 0.820 | taacctttAGTGAgt V$PAX8_01 | 8018 (+) | 0.822 | 0.831 | ctttagtgAGTGTca V$PAX8_01 | 8105 (-) | 0.953 | 0.855 | tgTCACAcactctcc V$PAX8_01 | 9022 (+) | 0.953 | 0.812 | cagatcttTGTGAgt V$PAX8_01 | 9166 (+) | 0.935 | 0.885 | ctaagatgCATGAac V$PAX8_01 | 9189 (-) | 0.845 | 0.858 | gaCCACTcaagagag V$PAX8_01 | 9255 (-) | 0.935 | 0.830 | gcTCATGggacaggg V$PAX8_01 | 9565 (+) | 0.812 | 0.823 | ggtttctgGGTGTgg V$PAX8_01 | 9724 (+) | 0.887 | 0.835 | tctctttgTATGAta V$PAX8_01 | 9769 (-) | 0.935 | 0.898 | agTCATGccatagaa V$PAX8_01 | 9847 (-) | 0.882 | 0.813 | agTCAGGcaggcagc V$PAX8_01 | 9864 (+) | 0.908 | 0.853 | taaagaagAATGAgg V$PAX8_01 | 10026 (+) | 0.953 | 0.844 | ccgtagttTGTGAgt V$PAX8_01 | 10174 (+) | 0.802 | 0.834 | tattagtgTGTGTgt V$PAX8_01 | 10219 (+) | 0.845 | 0.814 | catctaagAGTGGaa V$PAX8_01 | 10245 (-) | 0.882 | 0.811 | ggTCAGGcttgtgca V$PAX8_01 | 10269 (+) | 0.845 | 0.848 | ttttgccgGCTGAgc V$PAX8_01 | 10289 (-) | 0.882 | 0.817 | caTCAGGcttcggac V$PAX8_01 | 10470 (-) | 0.973 | 0.828 | tcTCACTttagacca V$PAX8_01 | 10624 (+) | 0.973 | 0.819 | ttgtagaaAGTGAct V$PAX8_01 | 10714 (+) | 0.849 | 0.820 | catctctgCGTGCtg V$PAX8_01 | 10831 (-) | 0.882 | 0.812 | ttTCAGGcccatcgc V$PAX8_01 | 10864 (+) | 0.872 | 0.842 | agatgcagCGTGGtg V$PAX8_01 | 11192 (+) | 0.862 | 0.868 | gtttcttgATTGAtt V$PAX8_01 | 11356 (+) | 0.935 | 0.848 | aacgagagCATGAtt V$PAX8_01 | 11584 (+) | 0.973 | 0.899 | tatccaggAGTGAga V$PAX8_01 | 11704 (-) | 0.963 | 0.851 | tgTCACCccaactgc V$PAX8_01 | 11731 (-) | 0.842 | 0.812 | tcTCAAAcatctcac V$PAX8_01 | 11819 (+) | 0.963 | 0.859 | agggttggGGTGAga V$PAX8_01 | 11841 (+) | 0.834 | 0.832 | gagtggggGGTGGgg V$PAX8_01 | 11887 (-) | 0.973 | 0.932 | tcTCACTctcaacca V$PAX8_01 | 11893 (-) | 0.852 | 0.859 | tcTCAACcaatagca V$CACBINDINGPROTEIN_Q6 | 105 (+) | 1.000 | 0.941 | gggGGTGGc V$CACBINDINGPROTEIN_Q6 | 118 (+) | 0.980 | 0.941 | gagGCTGGc V$CACBINDINGPROTEIN_Q6 | 1488 (+) | 0.934 | 0.955 | gagGGAGGg V$CACBINDINGPROTEIN_Q6 | 1499 (+) | 0.934 | 0.941 | gggGGAGGg V$CACBINDINGPROTEIN_Q6 | 1503 (+) | 0.934 | 0.955 | gagGGAGGg V$CACBINDINGPROTEIN_Q6 | 1507 (+) | 0.934 | 0.955 | gagGGAGGg V$CACBINDINGPROTEIN_Q6 | 1511 (+) | 0.934 | 0.955 | gagGGAGGg V$CACBINDINGPROTEIN_Q6 | 2429 (+) | 0.922 | 0.933 | gggGTTGGg V$CACBINDINGPROTEIN_Q6 | 4237 (+) | 0.980 | 0.928 | aggGCTGGg V$CACBINDINGPROTEIN_Q6 | 5077 (+) | 0.980 | 0.941 | gagGCTGGc V$CACBINDINGPROTEIN_Q6 | 9008 (+) | 0.914 | 0.941 | gagGCAGGg V$CACBINDINGPROTEIN_Q6 | 11846 (+) | 1.000 | 0.987 | gggGGTGGg V$CACBINDINGPROTEIN_Q6 | 11871 (+) | 0.934 | 0.941 | gggGGAGGg V$CACCCBINDINGFACTOR_Q6| 8235 (-) | 1.000 | 0.934 | ccaCACCCagatgaag V$CACCCBINDINGFACTOR_Q6| 9564 (+) | 1.000 | 0.912 | cggtttctGGGTGtgg V$COREBINDINGFACTOR_Q6 | 934 (-) | 1.000 | 1.000 | aaaCCACA V$COREBINDINGFACTOR_Q6 | 4439 (-) | 1.000 | 1.000 | aaaCCACA V$COREBINDINGFACTOR_Q6 | 5049 (+) | 1.000 | 0.983 | TGTGGtct V$COREBINDINGFACTOR_Q6 | 5256 (+) | 1.000 | 0.972 | TGTGGtca V$COREBINDINGFACTOR_Q6 | 9400 (+) | 1.000 | 0.948 | TGTGGttc V$COREBINDINGFACTOR_Q6 | 11256 (+) | 1.000 | 0.971 | TGTGGtgt I$GAGAFACTOR_Q6 | 378 (+) | 0.993 | 0.898 | ggtGAGAGatg I$GAGAFACTOR_Q6 | 2055 (-) | 0.993 | 0.944 | cttCTCTCtct I$GAGAFACTOR_Q6 | 2057 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2059 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2061 (-) | 0.993 | 0.989 | tctCTCTCtgt I$GAGAFACTOR_Q6 | 2085 (-) | 0.993 | 0.968 | tgtCTCTCtgt I$GAGAFACTOR_Q6 | 2093 (-) | 0.993 | 0.968 | tgtCTCTCtgt I$GAGAFACTOR_Q6 | 2101 (-) | 0.993 | 0.968 | tgtCTCTCtgt I$GAGAFACTOR_Q6 | 2109 (-) | 0.993 | 0.972 | tgtCTCTCtct I$GAGAFACTOR_Q6 | 2111 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2113 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2115 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2117 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2119 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2121 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2123 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2125 (-) | 0.993 | 0.953 | tctCTCTCttt I$GAGAFACTOR_Q6 | 2133 (-) | 0.993 | 0.986 | tttCTCTCtct I$GAGAFACTOR_Q6 | 2135 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2137 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2139 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2141 (-) | 0.993 | 0.939 | tctCTCTCttc I$GAGAFACTOR_Q6 | 2187 (-) | 0.993 | 0.944 | cttCTCTCtct I$GAGAFACTOR_Q6 | 2189 (-) | 0.993 | 0.993 | tctCTCTCtct I$GAGAFACTOR_Q6 | 2191 (-) | 0.993 | 0.953 | tctCTCTCttt I$GAGAFACTOR_Q6 | 2343 (-) | 1.000 | 0.894 | ctaCGCTCttc I$GAGAFACTOR_Q6 | 2991 (-) | 1.000 | 0.901 | ttcCGCTCctc I$GAGAFACTOR_Q6 | 3186 (-) | 0.993 | 0.910 | cccCTCTCttt I$GAGAFACTOR_Q6 | 5065 (+) | 0.993 | 0.913 | agtGAGAGatg I$GAGAFACTOR_Q6 | 5219 (+) | 1.000 | 0.912 | tctGAGCGgag I$GAGAFACTOR_Q6 | 5539 (-) | 0.993 | 0.953 | tcaCTCTCttt I$GAGAFACTOR_Q6 | 5551 (-) | 0.993 | 0.960 | tgtCTCTCact I$GAGAFACTOR_Q6 | 5772 (+) | 0.993 | 0.911 | aagGAGAGaaa I$GAGAFACTOR_Q6 | 6222 (-) | 1.000 | 0.980 | tcaCGCTCtgc I$GAGAFACTOR_Q6 | 6703 (+) | 0.993 | 0.924 | gtaGAGAGaaa I$GAGAFACTOR_Q6 | 7299 (+) | 0.993 | 0.908 | gctGAGAGagt I$GAGAFACTOR_Q6 | 7301 (+) | 0.993 | 0.965 | tgaGAGAGtaa I$GAGAFACTOR_Q6 | 9286 (+) | 0.993 | 0.941 | gctGAGAGtca I$GAGAFACTOR_Q6 | 11020 (+) | 0.993 | 0.898 | tagGAGAGaga I$GAGAFACTOR_Q6 | 11887 (-) | 0.904 | 0.902 | tctCACTCtca I$GAGAFACTOR_Q6 | 11889 (-) | 0.993 | 0.919 | tcaCTCTCaac V$HNF3ALPHA_Q6 | 215 (+) | 0.972 | 0.916 | TATTTtttttt V$HNF3ALPHA_Q6 | 508 (+) | 0.972 | 0.931 | TATTTgtcttc V$HNF3ALPHA_Q6 | 846 (-) | 0.972 | 0.931 | ttaaatAAATA V$HNF3ALPHA_Q6 | 875 (-) | 1.000 | 0.928 | atgcacAAACA V$HNF3ALPHA_Q6 | 953 (-) | 0.972 | 0.925 | atgatcAAATA V$HNF3ALPHA_Q6 | 1474 (+) | 0.972 | 0.956 | TATTTgctctg V$HNF3ALPHA_Q6 | 1533 (+) | 0.909 | 0.915 | TTTTTgttcta V$HNF3ALPHA_Q6 | 1896 (+) | 0.972 | 0.939 | TATTTatttta V$HNF3ALPHA_Q6 | 2363 (+) | 0.972 | 0.939 | TATTTattttg V$HNF3ALPHA_Q6 | 2408 (-) | 1.000 | 0.937 | taaaaaAAACA V$HNF3ALPHA_Q6 | 2412 (-) | 0.909 | 0.930 | aaaaacAAAAA V$HNF3ALPHA_Q6 | 3597 (+) | 1.000 | 0.922 | TGTTTgggttt V$HNF3ALPHA_Q6 | 3806 (-) | 1.000 | 0.922 | tagaaaAAACA V$HNF3ALPHA_Q6 | 4006 (-) | 0.972 | 0.931 | ataaatAAATA V$HNF3ALPHA_Q6 | 4219 (-) | 0.972 | 0.907 | ttaaaaAAATA V$HNF3ALPHA_Q6 | 4489 (-) | 0.909 | 0.906 | ttgaacAAAAA V$HNF3ALPHA_Q6 | 4497 (-) | 1.000 | 0.910 | aaaacaAAACA V$HNF3ALPHA_Q6 | 4502 (-) | 1.000 | 0.910 | aaaacaAAACA V$HNF3ALPHA_Q6 | 5120 (-) | 1.000 | 0.924 | aaaaggAAACA V$HNF3ALPHA_Q6 | 5437 (+) | 0.972 | 0.943 | TATTTgtattg V$HNF3ALPHA_Q6 | 5806 (-) | 0.972 | 0.916 | aaaaaaAAATA V$HNF3ALPHA_Q6 | 6588 (-) | 0.972 | 0.978 | caaaacAAATA V$HNF3ALPHA_Q6 | 6608 (+) | 0.972 | 0.932 | TATTTgctaaa V$HNF3ALPHA_Q6 | 6651 (-) | 1.000 | 0.905 | gtctgcAAACA V$HNF3ALPHA_Q6 | 7050 (+) | 0.972 | 0.912 | TATTTtttttc V$HNF3ALPHA_Q6 | 7073 (+) | 1.000 | 0.942 | TGTTTgtgtat V$HNF3ALPHA_Q6 | 7211 (-) | 0.909 | 0.906 | ttgaacAAAAA V$HNF3ALPHA_Q6 | 7428 (+) | 0.972 | 0.979 | TATTTgttttt V$HNF3ALPHA_Q6 | 7583 (-) | 1.000 | 0.910 | caggtcAAACA V$HNF3ALPHA_Q6 | 7775 (-) | 1.000 | 0.910 | aaaacaAAACA V$HNF3ALPHA_Q6 | 7779 (-) | 0.909 | 0.930 | caaaacAAAAA V$HNF3ALPHA_Q6 | 7821 (-) | 1.000 | 1.000 | caaaacAAACA V$HNF3ALPHA_Q6 | 7941 (+) | 1.000 | 0.933 | TGTTTagtttt V$HNF3ALPHA_Q6 | 7972 (+) | 1.000 | 1.000 | TGTTTgttttt V$HNF3ALPHA_Q6 | 8187 (+) | 1.000 | 0.977 | TGTTTgctctg V$HNF3ALPHA_Q6 | 8897 (-) | 1.000 | 0.911 | taccacAAACA V$HNF3ALPHA_Q6 | 9110 (-) | 0.909 | 0.930 | caaaacAAAAA V$HNF3ALPHA_Q6 | 9488 (+) | 0.972 | 0.905 | TATTTgcaata V$HNF3ALPHA_Q6 | 9533 (-) | 1.000 | 0.985 | tagaacAAACA V$HNF3ALPHA_Q6 | 10910 (-) | 0.972 | 0.926 | ctatgcAAATA V$HNF3ALPHA_Q6 | 11214 (+) | 1.000 | 0.918 | TGTTTgaactg V$HNF3ALPHA_Q6 | 11474 (+) | 0.972 | 0.979 | TATTTgttttt V$HP1SITEFACTOR_Q6 | 6954 (-) | 0.941 | 0.889 | CTGTCcattatt V$HP1SITEFACTOR_Q6 | 8947 (-) | 0.858 | 0.870 | CTTTTtaaaatt V$USF2_Q6 | 737 (+) | 1.000 | 1.000 | CACGTg V$USF2_Q6 | 737 (-) | 1.000 | 1.000 | cACGTG V$USF2_Q6 | 1128 (+) | 1.000 | 1.000 | CACGTg V$USF2_Q6 | 1128 (-) | 1.000 | 1.000 | cACGTG V$USF2_Q6 | 6066 (+) | 1.000 | 1.000 | CACGTg V$USF2_Q6 | 6066 (-) | 1.000 | 1.000 | cACGTG V$SF1_Q6 | 1937 (+) | 1.000 | 0.932 | tagCCTTG V$SF1_Q6 | 9636 (-) | 1.000 | 1.000 | CAAGGtca F$ROX1_Q6 | 3145 (-) | 1.000 | 0.972 | aACAATata F$ROX1_Q6 | 7333 (-) | 1.000 | 0.997 | aACAATaga F$ROX1_Q6 | 10340 (+) | 1.000 | 0.980 | tttATTGTt F$ROX1_Q6 | 10461 (-) | 1.000 | 0.983 | aACAATgca V$CDX2_Q5 | 60 (-) | 0.854 | 0.759 | ggccATAATtactg V$CDX2_Q5 | 291 (+) | 0.823 | 0.758 | agtgtTTTACagac V$CDX2_Q5 | 566 (-) | 1.000 | 0.863 | ggagATAAAcgtct V$CDX2_Q5 | 639 (+) | 1.000 | 0.776 | aaacaTTTATtcat V$CDX2_Q5 | 647 (+) | 1.000 | 0.812 | attcaTTTATtata V$CDX2_Q5 | 820 (+) | 0.798 | 0.773 | cacttTTTAAtact V$CDX2_Q5 | 821 (+) | 0.896 | 0.757 | actttTTAATacta V$CDX2_Q5 | 832 (-) | 1.000 | 0.752 | ctatATAAAacaag V$CDX2_Q5 | 952 (-) | 0.854 | 0.766 | aatgATCAAatatt V$CDX2_Q5 | 1221 (+) | 0.750 | 0.779 | agtttCTGATgact V$CDX2_Q5 | 1757 (-) | 0.896 | 0.754 | atctATAAGcctct V$CDX2_Q5 | 1887 (+) | 1.000 | 0.779 | cccatTTTATattt V$CDX2_Q5 | 1898 (+) | 1.000 | 0.808 | tttatTTTATtatt V$CDX2_Q5 | 1906 (+) | 1.000 | 0.807 | attatTTTATttga V$CDX2_Q5 | 2024 (-) | 0.896 | 0.772 | tgggATTAAaggag V$CDX2_Q5 | 2540 (-) | 0.854 | 0.753 | gagaATAACaggtt V$CDX2_Q5 | 2659 (+) | 0.854 | 0.807 | gcactTTGATgaca V$CDX2_Q5 | 2734 (+) | 1.000 | 0.769 | ctgccTTTATgcct V$CDX2_Q5 | 3121 (-) | 0.677 | 0.791 | ggctATATTaattt V$CDX2_Q5 | 3403 (-) | 1.000 | 0.894 | agttATAAAagtta V$CDX2_Q5 | 3409 (+) | 0.896 | 0.775 | aaaagTTAATgatt V$CDX2_Q5 | 3450 (-) | 1.000 | 0.799 | gttaATAAAacaca V$CDX2_Q5 | 3470 (+) | 0.823 | 0.762 | catgaTTTACtgcc V$CDX2_Q5 | 3579 (-) | 0.694 | 0.772 | agttATTCAactct V$CDX2_Q5 | 3621 (+) | 1.000 | 0.786 | atgcgTTTATaata V$CDX2_Q5 | 3733 (+) | 0.854 | 0.840 | aaattATTATttct V$CDX2_Q5 | 3826 (-) | 0.854 | 0.754 | attcATAATaaatt V$CDX2_Q5 | 3829 (-) | 1.000 | 0.764 | cataATAAAtttcc V$CDX2_Q5 | 3845 (-) | 1.000 | 0.777 | tctcATAAAatatc V$CDX2_Q5 | 3908 (-) | 1.000 | 0.808 | attaATAAAgcttt V$CDX2_Q5 | 3914 (+) | 1.000 | 0.803 | aaagcTTTATgaat V$CDX2_Q5 | 4002 (-) | 1.000 | 0.803 | aacgATAAAtaaat V$CDX2_Q5 | 4060 (-) | 0.896 | 0.843 | aataATTAAaatat V$CDX2_Q5 | 4314 (-) | 1.000 | 0.787 | ctgcATAAAactgg V$CDX2_Q5 | 5001 (+) | 0.798 | 0.757 | gacctTGTATtacc V$CDX2_Q5 | 5382 (+) | 1.000 | 0.798 | ttagtTTTATttta V$CDX2_Q5 | 5412 (-) | 0.896 | 0.759 | tcttATTAAatctt V$CDX2_Q5 | 5436 (+) | 0.798 | 0.808 | ttattTGTATtgcc V$CDX2_Q5 | 5446 (-) | 0.823 | 0.820 | tgccATATAacctt V$CDX2_Q5 | 5481 (+) | 0.854 | 0.805 | attttTTGATcccc V$CDX2_Q5 | 5585 (-) | 0.854 | 0.803 | gataATAATaaaat V$CDX2_Q5 | 5588 (-) | 1.000 | 0.808 | aataATAAAatata V$CDX2_Q5 | 5737 (-) | 0.854 | 0.785 | agccATAATataga V$CDX2_Q5 | 5943 (-) | 0.823 | 0.756 | aaaaGTAAAatttt V$CDX2_Q5 | 5949 (+) | 1.000 | 0.857 | aaaatTTTATttgc V$CDX2_Q5 | 6465 (-) | 1.000 | 0.809 | cctgATAAAatgct V$CDX2_Q5 | 6666 (+) | 1.000 | 0.779 | aagatTTTATttag V$CDX2_Q5 | 6757 (+) | 1.000 | 0.806 | atcccTTTATacca V$CDX2_Q5 | 6823 (+) | 1.000 | 0.784 | cctttTTTATgtgc V$CDX2_Q5 | 7258 (-) | 0.896 | 0.812 | tctcATTAAaatat V$CDX2_Q5 | 7421 (+) | 1.000 | 0.777 | aaaggTTTATttgt V$CDX2_Q5 | 7733 (-) | 0.708 | 0.773 | ggccATCACaggct V$CDX2_Q5 | 8685 (-) | 0.791 | 0.798 | ggtaATTAGccttt V$CDX2_Q5 | 8870 (-) | 0.798 | 0.768 | tggcATAGAatatt V$CDX2_Q5 | 8987 (+) | 0.896 | 0.791 | cacctTTAATtcca V$CDX2_Q5 | 9492 (-) | 0.798 | 0.758 | tgcaATAGAaaatc V$CDX2_Q5 | 9774 (-) | 0.798 | 0.821 | tgccATAGAacaat V$CDX2_Q5 | 9837 (+) | 0.798 | 0.801 | ataatTTTAAagtc V$CDX2_Q5 | 9943 (+) | 0.854 | 0.771 | tgtatGTTATaaca V$CDX2_Q5 | 9947 (-) | 0.854 | 0.823 | tgttATAACaccct V$CDX2_Q5 | 10121 (-) | 1.000 | 0.847 | cccaATAAAattag V$CDX2_Q5 | 10140 (+) | 0.854 | 0.817 | aaattATTATtatt V$CDX2_Q5 | 10143 (+) | 0.854 | 0.760 | ttattATTATtgtt V$CDX2_Q5 | 10313 (+) | 1.000 | 0.773 | ctgttTTTATattt V$CDX2_Q5 | 10321 (+) | 1.000 | 0.865 | atattTTTATttta V$CDX2_Q5 | 10326 (+) | 1.000 | 0.808 | tttatTTTATagtt V$CDX2_Q5 | 10335 (+) | 1.000 | 0.802 | tagttTTTATtgtt V$CDX2_Q5 | 10343 (+) | 0.798 | 0.773 | attgtTTTAAttca V$CDX2_Q5 | 10415 (+) | 0.798 | 0.758 | gcattTTTAAcact V$CDX2_Q5 | 10448 (+) | 1.000 | 0.818 | tctctTTTATatca V$CDX2_Q5 | 10505 (+) | 0.854 | 0.791 | agtcaATTATggct V$CDX2_Q5 | 10729 (+) | 1.000 | 0.764 | tagctTTTATtggg V$CDX2_Q5 | 10914 (-) | 0.854 | 0.766 | gcaaATAATagtct V$CDX2_Q5 | 11203 (-) | 0.896 | 0.785 | gattATTAAtttgt V$CDX2_Q5 | 11274 (-) | 0.896 | 0.761 | ggaaATTAAggcat V$CDX2_Q5 | 11295 (-) | 0.854 | 0.772 | agaaATAATaataa V$CDX2_Q5 | 11388 (-) | 0.896 | 0.808 | acttATTAAatttt V$CDX2_Q5 | 11467 (+) | 1.000 | 0.807 | aatttTTTATttgt V$CDX2_Q5 | 11565 (+) | 0.896 | 0.824 | aaactCTTATcatt V$CDX2_Q5 | 11665 (+) | 0.854 | 0.856 | caattTTCATaacc V$CDX2_Q5 | 11686 (-) | 1.000 | 0.751 | accaATAAAgttaa V$CDX2_Q5 | 11691 (+) | 0.896 | 0.757 | taaagTTAATcact F$HAC1_Q2 | 2974 (-) | 0.898 | 0.900 | gATGCGcaggca V$OSF2_Q6 | 936 (+) | 1.000 | 0.967 | aCCACAag V$OSF2_Q6 | 3563 (-) | 1.000 | 0.918 | tgTGTGGc V$OSF2_Q6 | 3641 (+) | 1.000 | 0.960 | aCCACAtg V$OSF2_Q6 | 4143 (-) | 1.000 | 0.971 | ggTGTGGt V$OSF2_Q6 | 4411 (+) | 1.000 | 0.907 | gCCACAac V$OSF2_Q6 | 4441 (+) | 1.000 | 0.991 | aCCACAca V$OSF2_Q6 | 4570 (-) | 0.915 | 0.907 | gtTGCGGt V$OSF2_Q6 | 4682 (-) | 1.000 | 0.921 | tcTGTGGc V$OSF2_Q6 | 5047 (-) | 1.000 | 0.973 | gcTGTGGt V$OSF2_Q6 | 5201 (-) | 1.000 | 0.921 | tcTGTGGc V$OSF2_Q6 | 5254 (-) | 1.000 | 0.960 | caTGTGGt V$OSF2_Q6 | 6506 (+) | 1.000 | 0.927 | gCCACAaa V$OSF2_Q6 | 6715 (+) | 1.000 | 0.907 | gCCACAac V$OSF2_Q6 | 7112 (+) | 1.000 | 0.960 | aCCACAtg V$OSF2_Q6 | 8740 (-) | 1.000 | 0.971 | ggTGTGGt V$OSF2_Q6 | 8898 (+) | 1.000 | 1.000 | aCCACAaa V$OSF2_Q6 | 8973 (-) | 1.000 | 0.971 | ggTGTGGt V$OSF2_Q6 | 9181 (+) | 1.000 | 0.921 | gCCACAga V$OSF2_Q6 | 9398 (-) | 1.000 | 0.953 | aaTGTGGt V$OSF2_Q6 | 9573 (-) | 1.000 | 0.971 | ggTGTGGt V$OSF2_Q6 | 10485 (+) | 1.000 | 0.921 | gCCACAta V$OSF2_Q6 | 10985 (-) | 1.000 | 0.918 | tgTGTGGc V$OSF2_Q6 | 11254 (-) | 1.000 | 0.993 | tcTGTGGt F$CAT8_Q6 | 407 (-) | 1.000 | 0.981 | ggcggAATGG F$CAT8_Q6 | 803 (+) | 1.000 | 0.976 | CCATTtctcc F$CAT8_Q6 | 3162 (+) | 1.000 | 0.981 | CCATTccgcc F$CAT8_Q6 | 4111 (+) | 1.000 | 0.959 | CCATTttccc F$CAT8_Q6 | 5014 (+) | 1.000 | 0.974 | CCATTaagcc V$SMAD4_Q6 | 116 (-) | 0.876 | 0.803 | gtgAGGCTggctaac V$SMAD4_Q6 | 120 (-) | 1.000 | 0.843 | ggcTGGCTaactgct V$SMAD4_Q6 | 138 (+) | 0.990 | 0.800 | tcaatccAGACAttt V$SMAD4_Q6 | 448 (-) | 0.990 | 0.795 | ggtTGTCTgttttgg V$SMAD4_Q6 | 679 (+) | 0.990 | 0.804 | gctcttcAGACAcac V$SMAD4_Q6 | 724 (+) | 1.000 | 0.797 | ggttgtgAGCCAcca V$SMAD4_Q6 | 890 (+) | 0.866 | 0.800 | atgggacAGACTttt V$SMAD4_Q6 | 1348 (-) | 0.949 | 0.801 | gctTGCCTgggtcac V$SMAD4_Q6 | 1581 (+) | 1.000 | 0.795 | catggagAGCCAgca V$SMAD4_Q6 | 1822 (+) | 0.990 | 0.826 | tcagggaAGACAccc V$SMAD4_Q6 | 1852 (+) | 1.000 | 0.812 | acaatagAGCCAact V$SMAD4_Q6 | 1999 (-) | 1.000 | 0.826 | atcTGGCTatacatc V$SMAD4_Q6 | 2078 (-) | 0.990 | 0.839 | ctcTGTCTgtctctc V$SMAD4_Q6 | 2082 (-) | 0.990 | 0.832 | gtcTGTCTctctgtc V$SMAD4_Q6 | 2090 (-) | 0.990 | 0.793 | ctcTGTCTctctgtc V$SMAD4_Q6 | 2098 (-) | 0.990 | 0.793 | ctcTGTCTctctgtc V$SMAD4_Q6 | 2106 (-) | 0.990 | 0.799 | ctcTGTCTctctctc V$SMAD4_Q6 | 2829 (-) | 0.990 | 0.833 | agtTGTCTgcaaatg V$SMAD4_Q6 | 2974 (+) | 0.949 | 0.868 | gatgcgcAGGCAtct V$SMAD4_Q6 | 3229 (+) | 1.000 | 0.846 | accagtcAGCCAaca V$SMAD4_Q6 | 3353 (+) | 0.825 | 0.799 | atcagacAGACCact V$SMAD4_Q6 | 3946 (+) | 1.000 | 0.869 | ttcattcAGCCAtcc V$SMAD4_Q6 | 4344 (+) | 0.949 | 0.795 | gtggtggAGGCAgga V$SMAD4_Q6 | 4403 (+) | 1.000 | 0.807 | acaggtcAGCCAcaa V$SMAD4_Q6 | 4465 (-) | 0.959 | 0.817 | cacTGACTgaccagc V$SMAD4_Q6 | 4985 (-) | 0.868 | 0.799 | accTGTCCgaatggc V$SMAD4_Q6 | 5132 (+) | 0.837 | 0.817 | atggtgcGGTCAacc V$SMAD4_Q6 | 5293 (+) | 1.000 | 0.869 | atgggctAGCCAtcc V$SMAD4_Q6 | 5672 (+) | 0.990 | 0.791 | aagagacAGACAcag V$SMAD4_Q6 | 6149 (-) | 1.000 | 0.801 | ctcTGGCTctcctgc V$SMAD4_Q6 | 6234 (-) | 0.990 | 0.825 | tgcTGTCTggctacg V$SMAD4_Q6 | 6623 (+) | 0.827 | 0.809 | gtgagtgGGGCAccc V$SMAD4_Q6 | 6792 (-) | 1.000 | 0.806 | tgcTGGCTcttcctc V$SMAD4_Q6 | 7540 (+) | 1.000 | 0.855 | ggttgtgAGCCAact V$SMAD4_Q6 | 7604 (+) | 1.000 | 0.801 | actgctgAGCCAtct V$SMAD4_Q6 | 7743 (-) | 1.000 | 0.809 | ggcTGGCTacattgt V$SMAD4_Q6 | 8226 (-) | 1.000 | 0.842 | tgtTGGCTcccacac V$SMAD4_Q6 | 8362 (+) | 0.876 | 0.825 | cctgtgcAGCCTgct V$SMAD4_Q6 | 8397 (+) | 1.000 | 0.815 | gaagccaAGCCAgcc V$SMAD4_Q6 | 8714 (-) | 0.866 | 0.803 | aggAGTCTgaagaac V$SMAD4_Q6 | 9012 (+) | 0.949 | 0.878 | caggggcAGGCAgat V$SMAD4_Q6 | 9038 (+) | 0.876 | 0.795 | caagggcAGCCTggt V$SMAD4_Q6 | 9270 (+) | 1.000 | 0.806 | cagcgacAGCCAcca V$SMAD4_Q6 | 9285 (+) | 0.959 | 0.793 | agctgagAGTCAgct V$SMAD4_Q6 | 9382 (-) | 0.851 | 0.860 | agcTGTCAgcccctg V$SMAD4_Q6 | 9848 (+) | 0.949 | 0.885 | gtcaggcAGGCAgca V$SMAD4_Q6 | 10078 (+) | 1.000 | 0.857 | gaagggcAGCCAgtg V$SMAD4_Q6 | 10242 (+) | 0.825 | 0.794 | ggtggtcAGGCTtgt V$SMAD4_Q6 | 10477 (+) | 1.000 | 0.806 | ttagaccAGCCAcat V$SMAD4_Q6 | 10637 (+) | 0.990 | 0.859 | ctgaggaAGACAccc V$SMAD4_Q6 | 11179 (-) | 1.000 | 0.797 | tttTGGCTgccttgt V$CIZ_01 | 3808 (+) | 1.000 | 0.998 | gAAAAAaca V$CIZ_01 | 5402 (-) | 1.000 | 0.999 | tatTTTTTc V$CIZ_01 | 7251 (+) | 1.000 | 0.999 | gAAAAAgtc P$ZAP1_01 | 5135 (-) | 1.000 | 0.937 | gtgcgGTCAA P$ZAP1_01 | 6433 (-) | 1.000 | 0.946 | catcgGTCAA V$E2F1DP1_01 | 2472 (-) | 0.982 | 0.947 | gCGCGAag V$E2F1DP1_01 | 9122 (-) | 1.000 | 0.931 | cCGGGAaa V$E2F1DP1RB_01 | 2472 (-) | 1.000 | 0.917 | gCGCGAag V$CETS168_Q6 | 610 (-) | 0.933 | 0.923 | aCATCCtg V$CETS168_Q6 | 769 (+) | 1.000 | 0.914 | ctGGAAGa V$CETS168_Q6 | 925 (-) | 1.000 | 0.851 | aCTTCCac V$CETS168_Q6 | 1459 (+) | 0.900 | 0.890 | ccGGACGt V$CETS168_Q6 | 1730 (-) | 0.933 | 0.888 | tCATCCtg V$CETS168_Q6 | 1739 (-) | 1.000 | 0.862 | aCTTCCta V$CETS168_Q6 | 1777 (-) | 0.900 | 0.861 | tCCTCCtg V$CETS168_Q6 | 2147 (-) | 1.000 | 0.827 | tCTTCCtt V$CETS168_Q6 | 2151 (-) | 1.000 | 0.827 | cCTTCCtt V$CETS168_Q6 | 2155 (-) | 1.000 | 0.843 | cCTTCCtc V$CETS168_Q6 | 2161 (-) | 1.000 | 0.827 | tCTTCCtt V$CETS168_Q6 | 2205 (-) | 1.000 | 0.827 | tCTTCCtt V$CETS168_Q6 | 2686 (-) | 0.933 | 0.947 | gCATCCtg V$CETS168_Q6 | 2707 (-) | 1.000 | 0.843 | cCTTCCtc V$CETS168_Q6 | 2785 (-) | 1.000 | 0.827 | tCTTCCtt V$CETS168_Q6 | 2989 (-) | 1.000 | 0.872 | aCTTCCgc V$CETS168_Q6 | 3033 (-) | 1.000 | 0.872 | aCTTCCgc V$CETS168_Q6 | 3342 (-) | 0.900 | 0.861 | cCCTCCtg V$CETS168_Q6 | 3518 (+) | 0.900 | 0.822 | gaGGAGGc V$CETS168_Q6 | 3778 (-) | 1.000 | 0.823 | cCTTCCcc V$CETS168_Q6 | 4176 (+) | 0.900 | 0.920 | caGGAGGc V$CETS168_Q6 | 4265 (+) | 0.933 | 0.832 | taGGATGc V$CETS168_Q6 | 4336 (-) | 0.933 | 0.896 | aCATCCag V$CETS168_Q6 | 4354 (+) | 0.900 | 0.861 | caGGAGGa V$CETS168_Q6 | 4592 (-) | 0.900 | 0.861 | tCCTCCtg V$CETS168_Q6 | 4603 (-) | 0.775 | 0.820 | gCTTGCtg V$CETS168_Q6 | 4608 (+) | 1.000 | 0.914 | ctGGAAGg V$CETS168_Q6 | 4689 (+) | 0.775 | 0.820 | caGGAAAc V$CETS168_Q6 | 4963 (+) | 1.000 | 0.976 | caGGAAGt V$CETS168_Q6 | 5031 (+) | 0.933 | 0.812 | tgGGATGc V$CETS168_Q6 | 5267 (-) | 0.933 | 0.888 | tCATCCtg V$CETS168_Q6 | 5347 (-) | 0.933 | 0.828 | gCATCCcc V$CETS168_Q6 | 5732 (+) | 1.000 | 0.973 | ctGGAAGc V$CETS168_Q6 | 6162 (-) | 0.900 | 0.822 | gCCTCCtc V$CETS168_Q6 | 6276 (+) | 1.000 | 0.885 | aaGGAAGc V$CETS168_Q6 | 6317 (-) | 1.000 | 0.842 | aCTTCCct V$CETS168_Q6 | 6479 (+) | 0.933 | 0.888 | caGGATGa V$CETS168_Q6 | 6799 (-) | 1.000 | 0.843 | tCTTCCtc V$CETS168_Q6 | 6859 (-) | 0.900 | 0.870 | aCGTCCag V$CETS168_Q6 | 6878 (+) | 1.000 | 0.885 | taGGAAGc V$CETS168_Q6 | 6966 (+) | 0.933 | 0.861 | ctGGATGg V$CETS168_Q6 | 7013 (+) | 0.933 | 0.818 | gcGGATGt V$CETS168_Q6 | 7019 (-) | 0.775 | 0.820 | gTTTCCtg V$CETS168_Q6 | 7231 (+) | 1.000 | 0.976 | caGGAAGt V$CETS168_Q6 | 8035 (-) | 0.775 | 0.820 | gCTTGCtg V$CETS168_Q6 | 8082 (-) | 1.000 | 0.862 | aCTTCCta V$CETS168_Q6 | 8135 (-) | 0.775 | 0.820 | gCTTTCtg V$CETS168_Q6 | 8159 (-) | 1.000 | 0.973 | gCTTCCag V$CETS168_Q6 | 8170 (+) | 0.775 | 0.820 | caGGTAGc V$CETS168_Q6 | 8394 (+) | 0.775 | 0.813 | ccTGAAGc V$CETS168_Q6 | 8448 (-) | 0.933 | 0.832 | gCATCCta V$CETS168_Q6 | 8679 (+) | 1.000 | 0.827 | taGGAAGg V$CETS168_Q6 | 8813 (-) | 0.775 | 0.813 | gCTTCTgg V$CETS168_Q6 | 9308 (+) | 0.900 | 0.861 | caGGAGGg V$CETS168_Q6 | 9335 (+) | 0.900 | 0.834 | ctGGAGGa V$CETS168_Q6 | 9436 (+) | 1.000 | 1.000 | caGGAAGc V$CETS168_Q6 | 9469 (+) | 0.933 | 0.923 | caGGATGt V$CETS168_Q6 | 9606 (+) | 1.000 | 0.865 | tgGGAAGc V$CETS168_Q6 | 9854 (+) | 0.775 | 0.820 | caGGCAGc V$CETS168_Q6 | 10013 (+) | 0.900 | 0.870 | ctGGAGGt V$CETS168_Q6 | 10075 (+) | 1.000 | 0.914 | ctGGAAGg V$CETS168_Q6 | 10206 (-) | 1.000 | 0.816 | cCTTCCac V$CETS168_Q6 | 10258 (+) | 1.000 | 0.976 | caGGAAGt V$CETS168_Q6 | 10639 (+) | 1.000 | 0.843 | gaGGAAGa V$CETS168_Q6 | 10668 (+) | 0.900 | 0.861 | caGGAGGa V$CETS168_Q6 | 10845 (+) | 1.000 | 0.941 | caGGAAGa V$CETS168_Q6 | 11913 (-) | 0.775 | 0.813 | gCTTCGgg V$POU1F1_Q6 | 643 (-) | 1.000 | 1.000 | atttATTCAt V$ELF1_Q6 | 2155 (-) | 1.000 | 0.963 | ccTTCCTcttcc V$ELF1_Q6 | 2707 (-) | 1.000 | 0.963 | ccTTCCTcttcc V$ELF1_Q6 | 4959 (+) | 1.000 | 0.958 | gcagcAGGAAgt V$ELF1_Q6 | 5473 (-) | 1.000 | 0.958 | atTTCCTgattt V$ELF1_Q6 | 5521 (-) | 1.000 | 0.945 | ctTTCCTcctat V$ELF1_Q6 | 9432 (+) | 1.000 | 0.949 | ataacAGGAAgc V$ELF1_Q6 | 9871 (+) | 1.000 | 0.980 | gaatgAGGAAat V$ELF1_Q6 | 10589 (-) | 1.000 | 0.948 | atTTCCTatatc V$ELF1_Q6 | 11268 (+) | 1.000 | 0.963 | agaaaAGGAAat V$IRF1_Q6 | 903 (+) | 1.000 | 1.000 | ttCACTT V$IRF1_Q6 | 1863 (-) | 0.980 | 0.986 | AACTGaa V$IRF1_Q6 | 5170 (+) | 1.000 | 1.000 | ttCACTT V$IRF1_Q6 | 7524 (-) | 0.980 | 0.986 | AACTGaa V$IRF1_Q6 | 9133 (+) | 0.980 | 0.986 | ttCAGTT V$IRF1_Q6 | 10059 (-) | 0.980 | 0.986 | AACTGaa V$SREBP1_Q6 | 198 (+) | 0.978 | 0.984 | CACACca V$SREBP1_Q6 | 689 (+) | 0.978 | 0.984 | CACACca V$SREBP1_Q6 | 5821 (-) | 0.989 | 0.992 | tgGCGTG V$SREBP1_Q6 | 10177 (-) | 0.978 | 0.905 | taGTGTG V$SREBP1_Q6 | 11258 (-) | 0.978 | 0.984 | tgGTGTG V$SREBP1_Q6 | 11707 (+) | 1.000 | 1.000 | CACCCca V$SREBP1_Q6 | 11824 (-) | 1.000 | 1.000 | tgGGGTG V$HMGIY_Q6 | 224 (-) | 1.000 | 0.931 | ttTTTCC V$HMGIY_Q6 | 294 (-) | 0.964 | 0.922 | gtTTTAC V$HMGIY_Q6 | 331 (+) | 1.000 | 1.000 | GGAAAtt V$HMGIY_Q6 | 899 (-) | 0.942 | 0.926 | acTTTTC V$HMGIY_Q6 | 994 (-) | 0.942 | 0.948 | aaTTTTC V$HMGIY_Q6 | 1194 (+) | 1.000 | 0.939 | GGAAAgc V$HMGIY_Q6 | 1551 (+) | 0.942 | 0.916 | GAAAAct V$HMGIY_Q6 | 1563 (+) | 1.000 | 0.960 | GGAAAtc V$HMGIY_Q6 | 1638 (-) | 1.000 | 0.923 | ccTTTCC V$HMGIY_Q6 | 1719 (-) | 1.000 | 0.939 | ctTTTCC V$HMGIY_Q6 | 1956 (-) | 1.000 | 0.979 | acTTTCC V$HMGIY_Q6 | 2165 (-) | 1.000 | 0.923 | ccTTTCC V$HMGIY_Q6 | 2173 (-) | 1.000 | 0.915 | tcTTTCC V$HMGIY_Q6 | 2368 (-) | 0.957 | 0.955 | atTTTGC V$HMGIY_Q6 | 2378 (-) | 0.942 | 0.916 | agTTTTC V$HMGIY_Q6 | 2379 (-) | 1.000 | 0.955 | gtTTTCC V$HMGIY_Q6 | 2400 (-) | 1.000 | 0.939 | gcTTTCC V$HMGIY_Q6 | 2514 (+) | 1.000 | 0.931 | GGAAAaa V$HMGIY_Q6 | 2564 (-) | 1.000 | 1.000 | aaTTTCC V$HMGIY_Q6 | 3092 (-) | 0.942 | 0.942 | atTTTTC V$HMGIY_Q6 | 3110 (-) | 1.000 | 1.000 | aaTTTCC V$HMGIY_Q6 | 3130 (-) | 0.964 | 0.967 | aaTTTAC V$HMGIY_Q6 | 3324 (-) | 1.000 | 0.939 | ctTTTCC V$HMGIY_Q6 | 3337 (-) | 1.000 | 0.936 | taTTTCC V$HMGIY_Q6 | 3473 (-) | 0.964 | 0.927 | gaTTTAC V$HMGIY_Q6 | 3612 (-) | 0.942 | 0.916 | agTTTTC V$HMGIY_Q6 | 3658 (+) | 0.964 | 0.946 | GTAAAgt V$HMGIY_Q6 | 3836 (-) | 1.000 | 1.000 | aaTTTCC V$HMGIY_Q6 | 4050 (+) | 1.000 | 0.936 | GGAAAta V$HMGIY_Q6 | 4074 (+) | 0.964 | 0.962 | GTAAAat V$HMGIY_Q6 | 4090 (-) | 0.964 | 0.962 | atTTTAC V$HMGIY_Q6 | 4113 (-) | 1.000 | 0.995 | atTTTCC V$HMGIY_Q6 | 4480 (-) | 0.942 | 0.942 | atTTTTC V$HMGIY_Q6 | 4615 (+) | 0.964 | 0.922 | GTAAAac V$HMGIY_Q6 | 4638 (+) | 0.957 | 0.955 | GCAAAat V$HMGIY_Q6 | 4668 (-) | 0.957 | 0.915 | gtTTTGC V$HMGIY_Q6 | 4691 (+) | 1.000 | 0.968 | GGAAAct V$HMGIY_Q6 | 5152 (-) | 0.964 | 0.935 | agTTTAC V$HMGIY_Q6 | 5342 (-) | 0.957 | 0.915 | gtTTTGC V$HMGIY_Q6 | 5390 (-) | 0.964 | 0.962 | atTTTAC V$HMGIY_Q6 | 5422 (-) | 1.000 | 0.915 | tcTTTCC V$HMGIY_Q6 | 5457 (-) | 1.000 | 0.939 | ctTTTCC V$HMGIY_Q6 | 5472 (-) | 1.000 | 0.936 | taTTTCC V$HMGIY_Q6 | 5510 (-) | 1.000 | 0.928 | ggTTTCC V$HMGIY_Q6 | 5520 (-) | 1.000 | 0.923 | ccTTTCC V$HMGIY_Q6 | 5779 (+) | 0.942 | 0.942 | GAAAAat V$HMGIY_Q6 | 5862 (-) | 1.000 | 0.928 | ggTTTCC V$HMGIY_Q6 | 5942 (+) | 0.942 | 0.926 | GAAAAgt V$HMGIY_Q6 | 5947 (+) | 0.964 | 0.962 | GTAAAat V$HMGIY_Q6 | 6453 (+) | 0.957 | 0.929 | GCAAAct V$HMGIY_Q6 | 6460 (-) | 1.000 | 0.936 | taTTTCC V$HMGIY_Q6 | 6580 (+) | 0.957 | 0.915 | GCAAAac V$HMGIY_Q6 | 6972 (+) | 1.000 | 0.923 | GGAAAgg V$HMGIY_Q6 | 7002 (+) | 1.000 | 0.939 | GGAAAag V$HMGIY_Q6 | 7250 (+) | 1.000 | 0.931 | GGAAAaa V$HMGIY_Q6 | 7456 (-) | 0.957 | 0.915 | gtTTTGC V$HMGIY_Q6 | 7566 (+) | 1.000 | 0.928 | GGAAAcc V$HMGIY_Q6 | 8148 (+) | 1.000 | 0.955 | GGAAAac V$HMGIY_Q6 | 9109 (+) | 0.957 | 0.915 | GCAAAac V$HMGIY_Q6 | 9125 (+) | 1.000 | 0.955 | GGAAAac V$HMGIY_Q6 | 9416 (+) | 0.942 | 0.916 | GAAAAct V$HMGIY_Q6 | 9510 (-) | 1.000 | 0.968 | agTTTCC V$HMGIY_Q6 | 9714 (+) | 0.964 | 0.962 | GTAAAat V$HMGIY_Q6 | 9877 (+) | 1.000 | 0.960 | GGAAAtc V$HMGIY_Q6 | 9935 (-) | 0.957 | 0.961 | aaTTTGC V$HMGIY_Q6 | 10588 (-) | 1.000 | 1.000 | aaTTTCC V$HMGIY_Q6 | 11090 (-) | 0.942 | 0.948 | aaTTTTC V$HMGIY_Q6 | 11091 (-) | 1.000 | 0.995 | atTTTCC V$HMGIY_Q6 | 11274 (+) | 1.000 | 1.000 | GGAAAtt V$HMGIY_Q6 | 11384 (+) | 0.942 | 0.916 | GAAAAct V$HMGIY_Q6 | 11480 (-) | 1.000 | 0.931 | ttTTTCC V$HMGIY_Q6 | 11531 (+) | 0.957 | 0.921 | GCAAAtc V$HMGIY_Q6 | 11666 (-) | 0.942 | 0.948 | aaTTTTC V$AML1_Q6 | 936 (-) | 1.000 | 1.000 | aCCACA V$AML1_Q6 | 3641 (-) | 1.000 | 1.000 | aCCACA V$AML1_Q6 | 4145 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 4441 (-) | 1.000 | 1.000 | aCCACA V$AML1_Q6 | 5049 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 5256 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 7112 (-) | 1.000 | 1.000 | aCCACA V$AML1_Q6 | 8742 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 8898 (-) | 1.000 | 1.000 | aCCACA V$AML1_Q6 | 8975 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 9400 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 9575 (+) | 1.000 | 1.000 | TGTGGt V$AML1_Q6 | 11256 (+) | 1.000 | 1.000 | TGTGGt V$P53_DECAMER_Q2 | 419 (-) | 1.000 | 0.936 | gagCTTGTcc V$P53_DECAMER_Q2 | 1346 (-) | 0.977 | 0.973 | aggCTTGCct V$P53_DECAMER_Q2 | 3215 (+) | 0.977 | 0.929 | agGCAAGcac V$P53_DECAMER_Q2 | 4206 (-) | 0.977 | 0.973 | aggCTTGCct V$P53_DECAMER_Q2 | 4401 (+) | 0.930 | 0.930 | agACAGGtca V$P53_DECAMER_Q2 | 4416 (-) | 1.000 | 0.941 | aacCTTGTct V$P53_DECAMER_Q2 | 6121 (-) | 1.000 | 0.916 | gggCTTGTtt V$P53_DECAMER_Q2 | 6894 (+) | 1.000 | 0.937 | agACAAGagg V$P53_DECAMER_Q2 | 8754 (-) | 1.000 | 0.911 | cacCTTGTcc V$P53_DECAMER_Q2 | 9223 (+) | 0.977 | 0.921 | agGCAAGtta V$P53_DECAMER_Q2 | 9539 (+) | 1.000 | 0.917 | aaACAAGtcc V$P53_DECAMER_Q2 | 9662 (-) | 1.000 | 0.950 | gacCTTGTct V$DR1_Q3 | 947 (+) | 0.855 | 0.818 | ggaaCAATGatca V$DR1_Q3 | 4896 (-) | 0.906 | 0.850 | tgttCCTTGgcca V$DR1_Q3 | 5453 (-) | 0.849 | 0.812 | taacCTTTTcccc V$DR1_Q3 | 6392 (-) | 0.904 | 0.836 | tggcCTGTGttca V$DR1_Q3 | 7120 (+) | 0.855 | 0.812 | tgtgCAATGctca V$DR1_Q3 | 8376 (-) | 1.000 | 0.838 | tgctCTTTGatct V$DR1_Q3 | 8556 (-) | 0.759 | 0.830 | tgaaCAATGtccg V$DR1_Q3 | 8582 (+) | 1.000 | 0.840 | ggggCAAAGgcaa V$DR1_Q3 | 10799 (-) | 0.906 | 0.841 | tgagCCTTGtgca V$PPAR_DR1_Q2 | 3567 (+) | 0.854 | 0.839 | tGGCCTttggcaa V$PPAR_DR1_Q2 | 5453 (+) | 0.824 | 0.848 | tAACCTtttcccc V$PPAR_DR1_Q2 | 8204 (+) | 0.739 | 0.836 | tCCCCTctgacct V$HNF4_DR1_Q3 | 857 (+) | 0.846 | 0.805 | cgatCGTTGccct V$HNF4_DR1_Q3 | 947 (-) | 0.846 | 0.836 | ggaaCAATGatca V$HNF4_DR1_Q3 | 3567 (+) | 1.000 | 0.876 | tggcCTTTGgcaa V$HNF4_DR1_Q3 | 4724 (+) | 0.777 | 0.814 | tgtgATTTGgccc V$HNF4_DR1_Q3 | 4896 (+) | 0.893 | 0.872 | tgttCCTTGgcca V$HNF4_DR1_Q3 | 6392 (+) | 0.830 | 0.805 | tggcCTGTGttca V$HNF4_DR1_Q3 | 7120 (-) | 0.846 | 0.825 | tgtgCAATGctca V$HNF4_DR1_Q3 | 8041 (+) | 0.893 | 0.830 | tgggCCTTGactc V$HNF4_DR1_Q3 | 8376 (+) | 1.000 | 0.866 | tgctCTTTGatct V$HNF4_DR1_Q3 | 8556 (+) | 0.692 | 0.817 | tgaaCAATGtccg V$HNF4_DR1_Q3 | 8582 (-) | 1.000 | 0.832 | ggggCAAAGgcaa V$HNF4_DR1_Q3 | 10497 (-) | 0.757 | 0.805 | ggatCAAAAgtca V$HNF4_DR1_Q3 | 10799 (+) | 0.893 | 0.864 | tgagCCTTGtgca V$HNF4_DR1_Q3 | 11281 (-) | 0.846 | 0.803 | aaggCATAGggca V$HNF4_DR1_Q3 | 11790 (-) | 0.777 | 0.824 | aggaCAAATccca V$HNF4_DR1_Q3 | 11919 (-) | 0.846 | 0.808 | ggccCAACGccca V$COUP_DR1_Q6 | 3567 (+) | 0.759 | 0.824 | tGGCCTttggcaa V$COUP_DR1_Q6 | 11944 (-) | 0.917 | 0.835 | atttcaaGGGTCa V$AML_Q6 | 932 (-) | 1.000 | 0.968 | cgaaaCCACAagtta V$AML_Q6 | 3637 (-) | 1.000 | 0.920 | acctaCCACAtggtg V$AML_Q6 | 4140 (+) | 1.000 | 0.919 | ctaggTGTGGtgcct V$AML_Q6 | 4437 (-) | 1.000 | 0.974 | aaaaaCCACAcaatt V$AML_Q6 | 4567 (+) | 0.901 | 0.902 | acagtTGCGGttaga V$AML_Q6 | 5044 (+) | 1.000 | 0.963 | attgcTGTGGtctaa V$AML_Q6 | 5131 (+) | 0.901 | 0.883 | gatggTGCGGtcaac V$AML_Q6 | 5251 (+) | 1.000 | 0.952 | tcacaTGTGGtcaca V$AML_Q6 | 6711 (-) | 1.000 | 0.885 | aaaagCCACAacctt V$AML_Q6 | 7108 (-) | 1.000 | 0.928 | gtgtaCCACAtgtgt V$AML_Q6 | 8737 (+) | 1.000 | 0.940 | cggggTGTGGtggcc V$AML_Q6 | 8894 (-) | 1.000 | 0.934 | acgtaCCACAaacag V$AML_Q6 | 8970 (+) | 1.000 | 0.922 | ccaggTGTGGtggta V$AML_Q6 | 9395 (+) | 1.000 | 0.933 | tggaaTGTGGttcta V$AML_Q6 | 9570 (+) | 1.000 | 0.930 | ctgggTGTGGtggta V$AML_Q6 | 10950 (+) | 0.901 | 0.883 | gtagcTGCGGtctgt V$AML_Q6 | 11251 (+) | 1.000 | 0.972 | aactcTGTGGtgtgc V$CEBP_Q3 | 150 (-) | 0.991 | 0.959 | tTTGCTcatctc V$CEBP_Q3 | 165 (-) | 0.989 | 0.961 | gTTCCCaatgct V$CEBP_Q3 | 324 (+) | 0.987 | 0.959 | ctgtgcTGGAAa V$CEBP_Q3 | 335 (-) | 0.989 | 0.966 | aTTCCCtcagac V$CEBP_Q3 | 396 (-) | 1.000 | 0.961 | tTTGCCaacaag V$CEBP_Q3 | 491 (-) | 0.997 | 0.980 | aTTGCAacatta V$CEBP_Q3 | 805 (-) | 0.982 | 0.958 | aTTTCTccagcc V$CEBP_Q3 | 1100 (+) | 0.982 | 0.976 | aatttgAGAAAt V$CEBP_Q3 | 1135 (-) | 0.980 | 0.979 | gTTCCTaaaacc V$CEBP_Q3 | 1465 (-) | 0.982 | 0.976 | gTTTCTcaatat V$CEBP_Q3 | 1556 (+) | 0.987 | 0.976 | ctcttcTGGAAa V$CEBP_Q3 | 1557 (+) | 0.991 | 0.959 | tcttctGGAAAt V$CEBP_Q3 | 1655 (+) | 1.000 | 0.967 | ctacttGGCAAa V$CEBP_Q3 | 2370 (-) | 1.000 | 0.981 | tTTGCCacagtt V$CEBP_Q3 | 2381 (-) | 0.989 | 0.959 | tTTCCCagagct V$CEBP_Q3 | 2402 (-) | 0.980 | 0.974 | tTTCCTtaaaaa V$CEBP_Q3 | 2523 (-) | 0.990 | 0.959 | aTTACAtattac V$CEBP_Q3 | 2555 (+) | 0.988 | 0.981 | ctgttgTGAAAt V$CEBP_Q3 | 2564 (+) | 0.980 | 0.970 | aatttcCGTAAg V$CEBP_Q3 | 2566 (-) | 0.976 | 0.972 | tTTCCGtaagta V$CEBP_Q3 | 2578 (-) | 0.990 | 0.971 | tTTACAacaaat V$CEBP_Q3 | 2623 (+) | 1.000 | 0.962 | atgctaGGCAAa V$CEBP_Q3 | 2714 (-) | 0.989 | 0.988 | cTTCCCaaatag V$CEBP_Q3 | 2805 (-) | 0.984 | 0.974 | cTTACTgaacac V$CEBP_Q3 | 3137 (-) | 0.989 | 0.970 | aTTCCCacaaca V$CEBP_Q3 | 3210 (+) | 1.000 | 0.962 | atgctaGGCAAg V$CEBP_Q3 | 3325 (-) | 0.991 | 0.958 | tTTTCCtgattc V$CEBP_Q3 | 3426 (-) | 0.991 | 0.986 | gTTGCTtaattg V$CEBP_Q3 | 3437 (+) | 0.993 | 0.988 | gacttaGGTAAc V$CEBP_Q3 | 3569 (+) | 1.000 | 0.996 | gcctttGGCAAg V$CEBP_Q3 | 3614 (-) | 0.988 | 0.960 | tTTTCAaatgcg V$CEBP_Q3 | 3644 (+) | 0.987 | 0.962 | acatggTGGAAt V$CEBP_Q3 | 3679 (-) | 1.000 | 0.967 | aTTGCCaagtgc V$CEBP_Q3 | 3741 (-) | 0.982 | 0.965 | aTTTCTacattt V$CEBP_Q3 | 4038 (-) | 0.987 | 0.957 | aTTCCAgcaggg V$CEBP_Q3 | 4051 (+) | 1.000 | 0.969 | gaaataGGCAAt V$CEBP_Q3 | 4067 (+) | 0.990 | 0.960 | aaaataTGTAAa V$CEBP_Q3 | 4115 (-) | 0.989 | 0.959 | tTTCCCagatgt V$CEBP_Q3 | 4155 (+) | 0.990 | 0.961 | cccattTGTAAt V$CEBP_Q3 | 4209 (-) | 1.000 | 0.976 | cTTGCCtcagtt V$CEBP_Q3 | 4272 (-) | 0.993 | 0.986 | cTTACCtaagca V$CEBP_Q3 | 4708 (+) | 0.993 | 0.987 | gacttgGGTAAt V$CEBP_Q3 | 4797 (-) | 0.997 | 0.991 | cTTGCAtaatct V$CEBP_Q3 | 5009 (-) | 0.993 | 0.960 | aTTACCcattaa V$CEBP_Q3 | 5152 (+) | 0.977 | 0.972 | agtttaCGAAAa V$CEBP_Q3 | 5154 (-) | 0.980 | 0.978 | tTTACGaaaagg V$CEBP_Q3 | 5444 (-) | 1.000 | 0.969 | aTTGCCatataa V$CEBP_Q3 | 5458 (-) | 0.991 | 0.967 | tTTTCCccacat V$CEBP_Q3 | 5473 (-) | 0.991 | 0.958 | aTTTCCtgattt V$CEBP_Q3 | 5781 (+) | 0.988 | 0.957 | aaaataTGAAAt V$CEBP_Q3 | 5917 (-) | 0.991 | 0.965 | gTTTCCccagcg V$CEBP_Q3 | 6179 (-) | 0.989 | 0.959 | gTTCCCagagac V$CEBP_Q3 | 6494 (-) | 0.997 | 0.959 | aTTGCAgatgcg V$CEBP_Q3 | 6547 (+) | 0.993 | 0.969 | ggctgaGGTAAg V$CEBP_Q3 | 6566 (-) | 1.000 | 0.981 | gTTGCCacagtt V$CEBP_Q3 | 6573 (+) | 0.997 | 0.992 | cagttaTGCAAa V$CEBP_Q3 | 6610 (-) | 0.991 | 0.991 | tTTGCTaaactc V$CEBP_Q3 | 7056 (-) | 0.982 | 0.978 | tTTTCTtaattt V$CEBP_Q3 | 7172 (-) | 0.997 | 0.958 | gTTGCAgatggt V$CEBP_Q3 | 7345 (-) | 0.988 | 0.969 | aTTTCAacatgt V$CEBP_Q3 | 7404 (-) | 0.984 | 0.974 | aTTACTgaagaa V$CEBP_Q3 | 7559 (+) | 0.987 | 0.960 | agatgcTGGAAa V$CEBP_Q3 | 7840 (+) | 1.000 | 0.968 | acttctGGCAAa V$CEBP_Q3 | 8006 (+) | 0.990 | 0.965 | tgttggTGTAAc V$CEBP_Q3 | 8073 (-) | 0.997 | 0.986 | aTTGCAgaaact V$CEBP_Q3 | 8141 (+) | 0.989 | 0.966 | tgctgaGGGAAa V$CEBP_Q3 | 8352 (-) | 1.000 | 0.962 | cTTGCCaacccc V$CEBP_Q3 | 8434 (+) | 0.997 | 0.965 | gagcttTGCAAc V$CEBP_Q3 | 8468 (+) | 1.000 | 0.964 | atctaaGGCAAa V$CEBP_Q3 | 8493 (-) | 0.982 | 0.962 | tTTTCTacaggg V$CEBP_Q3 | 8578 (+) | 1.000 | 0.977 | aaatggGGCAAa V$CEBP_Q3 | 8623 (+) | 0.982 | 0.974 | actttgAGAAAa V$CEBP_Q3 | 8652 (-) | 0.989 | 0.961 | aTTCCCaatggt V$CEBP_Q3 | 8694 (+) | 0.988 | 0.976 | cctttcTGAAAa V$CEBP_Q3 | 8695 (-) | 0.982 | 0.971 | cTTTCTgaaaat V$CEBP_Q3 | 8907 (+) | 0.980 | 0.972 | agattcAGGAAt V$CEBP_Q3 | 8995 (-) | 0.987 | 0.959 | aTTCCAgcacat V$CEBP_Q3 | 9100 (-) | 0.990 | 0.988 | cTTACAaaagca V$CEBP_Q3 | 9450 (-) | 0.993 | 0.967 | tTTACCaatctc V$CEBP_Q3 | 9486 (+) | 0.997 | 0.968 | tttattTGCAAt V$CEBP_Q3 | 9490 (-) | 0.997 | 0.965 | tTTGCAatagaa V$CEBP_Q3 | 9511 (-) | 0.991 | 0.966 | gTTTCCccatgt V$CEBP_Q3 | 9512 (-) | 0.989 | 0.957 | tTTCCCcatgtc V$CEBP_Q3 | 9601 (+) | 0.989 | 0.988 | gcatttGGGAAg V$CEBP_Q3 | 9707 (+) | 0.990 | 0.967 | tggtggTGTAAa V$CEBP_Q3 | 9756 (+) | 0.988 | 0.959 | taatatTGAAAa V$CEBP_Q3 | 9789 (+) | 0.984 | 0.975 | ttgttcAGTAAa V$CEBP_Q3 | 9871 (+) | 0.991 | 0.969 | gaatgaGGAAAt V$CEBP_Q3 | 9922 (-) | 1.000 | 0.966 | tTTGCCtatggg V$CEBP_Q3 | 9926 (+) | 0.993 | 0.958 | cctatgGGTAAt V$CEBP_Q3 | 10070 (+) | 0.987 | 0.976 | gtcttcTGGAAg V$CEBP_Q3 | 10132 (+) | 0.982 | 0.978 | tagttaAGAAAt V$CEBP_Q3 | 10371 (-) | 0.988 | 0.986 | cTTTCAaaatgt V$CEBP_Q3 | 10377 (+) | 0.997 | 0.980 | aaatgtTGCAAt V$CEBP_Q3 | 10589 (-) | 0.991 | 0.959 | aTTTCCtatatc V$CEBP_Q3 | 10622 (+) | 0.982 | 0.962 | agttgtAGAAAg V$CEBP_Q3 | 10686 (+) | 1.000 | 0.969 | gactctGGCAAc V$CEBP_Q3 | 10703 (+) | 0.988 | 0.981 | tgtttaTGAAAc V$CEBP_Q3 | 10907 (+) | 0.997 | 0.960 | ctactaTGCAAa V$CEBP_Q3 | 10968 (+) | 0.991 | 0.965 | tgttggAGCAAg V$CEBP_Q3 | 11081 (+) | 0.989 | 0.959 | tgctctGGGAAt V$CEBP_Q3 | 11150 (+) | 0.988 | 0.964 | ctctgaTGAAAa V$CEBP_Q3 | 11284 (+) | 1.000 | 0.963 | gcatagGGCAAa V$CEBP_Q3 | 11576 (-) | 0.991 | 0.959 | aTTGCTtatatc V$CEBP_Q3 | 11668 (-) | 0.988 | 0.983 | tTTTCAtaacca V$CEBP_Q3 | 11728 (-) | 0.982 | 0.975 | tTTTCTcaaaca V$CEBP_Q3 | 11780 (+) | 0.990 | 0.965 | acttgaTGTAAg V$ETS_Q4 | 606 (+) | 0.922 | 0.908 | ttaaacATCCTg V$ETS_Q4 | 1192 (-) | 1.000 | 0.911 | cAGGAAagctgg V$ETS_Q4 | 1726 (+) | 0.922 | 0.898 | gggctcATCCTg V$ETS_Q4 | 1735 (+) | 1.000 | 0.935 | ctgaacTTCCTa V$ETS_Q4 | 2143 (+) | 1.000 | 0.934 | tctctcTTCCTt V$ETS_Q4 | 2147 (+) | 1.000 | 0.924 | tcttccTTCCTt V$ETS_Q4 | 2151 (+) | 1.000 | 0.938 | ccttccTTCCTc V$ETS_Q4 | 2157 (+) | 1.000 | 0.937 | ttcctcTTCCTt V$ETS_Q4 | 2201 (+) | 1.000 | 0.925 | tctttcTTCCTt V$ETS_Q4 | 2682 (+) | 0.922 | 0.902 | cgttgcATCCTg V$ETS_Q4 | 2703 (+) | 1.000 | 0.941 | ttctccTTCCTc V$ETS_Q4 | 2781 (+) | 1.000 | 0.931 | agattcTTCCTt V$ETS_Q4 | 2985 (+) | 0.914 | 0.896 | atctacTTCCGc V$ETS_Q4 | 3321 (+) | 1.000 | 0.924 | ggccttTTCCTg V$ETS_Q4 | 4689 (-) | 1.000 | 0.919 | cAGGAAacttga V$ETS_Q4 | 4963 (-) | 1.000 | 0.977 | cAGGAAgtcagg V$ETS_Q4 | 5122 (-) | 1.000 | 0.899 | aAGGAAacagat V$ETS_Q4 | 5263 (+) | 0.922 | 0.900 | acagtcATCCTg V$ETS_Q4 | 5469 (+) | 1.000 | 0.938 | tactatTTCCTg V$ETS_Q4 | 5517 (+) | 1.000 | 0.902 | ctgcctTTCCTc V$ETS_Q4 | 6085 (+) | 1.000 | 0.919 | tgtgagTTCCTg V$ETS_Q4 | 6276 (-) | 1.000 | 0.936 | aAGGAAgcctct V$ETS_Q4 | 6457 (+) | 1.000 | 0.944 | acttatTTCCTg V$ETS_Q4 | 6479 (-) | 0.922 | 0.902 | cAGGATgaggac V$ETS_Q4 | 6795 (+) | 1.000 | 0.948 | tggctcTTCCTc V$ETS_Q4 | 6878 (-) | 1.000 | 0.927 | tAGGAAgcacca V$ETS_Q4 | 6919 (-) | 1.000 | 0.902 | aAGGAAttgatt V$ETS_Q4 | 7015 (+) | 1.000 | 0.916 | ggatgtTTCCTg V$ETS_Q4 | 7231 (-) | 1.000 | 0.985 | cAGGAAgttcat V$ETS_Q4 | 7248 (-) | 1.000 | 0.915 | cAGGAAaaagtc V$ETS_Q4 | 8078 (+) | 1.000 | 0.943 | agaaacTTCCTa V$ETS_Q4 | 8679 (-) | 1.000 | 0.926 | tAGGAAggtaat V$ETS_Q4 | 8912 (-) | 1.000 | 0.898 | cAGGAAtactcg V$ETS_Q4 | 9436 (-) | 1.000 | 0.966 | cAGGAAgcccaa V$ETS_Q4 | 9469 (-) | 0.922 | 0.910 | cAGGATgttcta V$ETS_Q4 | 9875 (-) | 1.000 | 0.921 | gAGGAAatcaat V$ETS_Q4 | 10258 (-) | 1.000 | 0.998 | cAGGAAgtgatt V$ETS_Q4 | 10585 (+) | 1.000 | 0.895 | ttcaatTTCCTa V$ETS_Q4 | 10639 (-) | 1.000 | 0.933 | gAGGAAgacacc V$ETS_Q4 | 10845 (-) | 1.000 | 0.959 | cAGGAAgaagcc V$ETS_Q4 | 11272 (-) | 1.000 | 0.896 | aAGGAAattaag V$MYB_Q3 | 127 (-) | 1.000 | 0.962 | tAACTGctcag V$MYB_Q3 | 158 (+) | 0.980 | 0.945 | tctctCCGTTc V$MYB_Q3 | 449 (+) | 0.962 | 0.956 | gttgtCTGTTt V$MYB_Q3 | 591 (-) | 0.962 | 0.959 | tAACAGactta V$MYB_Q3 | 981 (+) | 1.000 | 0.964 | catcaCAGTTa V$MYB_Q3 | 1024 (+) | 1.000 | 0.997 | attggCAGTTg V$MYB_Q3 | 1862 (-) | 1.000 | 0.970 | cAACTGaagaa V$MYB_Q3 | 2372 (+) | 1.000 | 0.963 | tgccaCAGTTt V$MYB_Q3 | 2823 (+) | 1.000 | 0.970 | ctgtcCAGTTg V$MYB_Q3 | 3397 (+) | 1.000 | 0.970 | atctcCAGTTa V$MYB_Q3 | 3443 (+) | 1.000 | 0.965 | ggtaaCAGTTa V$MYB_Q3 | 3606 (+) | 1.000 | 0.966 | ttctgCAGTTt V$MYB_Q3 | 4211 (+) | 1.000 | 0.967 | tgcctCAGTTa V$MYB_Q3 | 4321 (-) | 1.000 | 0.965 | aAACTGggtct V$MYB_Q3 | 4539 (-) | 0.962 | 0.964 | cAACAGcctct V$MYB_Q3 | 4563 (+) | 1.000 | 0.968 | ggacaCAGTTg V$MYB_Q3 | 4944 (-) | 1.000 | 0.966 | gAACTGggact V$MYB_Q3 | 5146 (+) | 1.000 | 0.992 | ctaggCAGTTt V$MYB_Q3 | 5281 (-) | 0.962 | 0.963 | cAACAGgctca V$MYB_Q3 | 5729 (-) | 1.000 | 0.969 | aAACTGgaagc V$MYB_Q3 | 6388 (-) | 1.000 | 0.993 | gAACTGgcctg V$MYB_Q3 | 6568 (+) | 1.000 | 0.967 | tgccaCAGTTa V$MYB_Q3 | 6847 (-) | 1.000 | 0.962 | cAACTGgtgag V$MYB_Q3 | 7187 (-) | 0.980 | 0.977 | gAACGGccacg V$MYB_Q3 | 7488 (+) | 0.962 | 0.957 | cacgtCTGTTc V$MYB_Q3 | 7523 (-) | 1.000 | 0.968 | gAACTGaagtt V$MYB_Q3 | 7530 (+) | 1.000 | 0.974 | agttaCAGTTg V$MYB_Q3 | 7550 (-) | 1.000 | 0.965 | cAACTGtgtag V$MYB_Q3 | 7602 (-) | 1.000 | 0.959 | gAACTGctgag V$MYB_Q3 | 7721 (+) | 1.000 | 0.959 | atcagCAGTTt V$MYB_Q3 | 8000 (+) | 0.962 | 0.963 | tgtgcCTGTTg V$MYB_Q3 | 8194 (+) | 0.962 | 0.958 | tctgcCTGTTt V$MYB_Q3 | 8212 (+) | 1.000 | 0.962 | gacctCAGTTc V$MYB_Q3 | 8725 (-) | 1.000 | 0.970 | gAACTGtaacg V$MYB_Q3 | 9130 (+) | 1.000 | 0.969 | accttCAGTTt V$MYB_Q3 | 9559 (+) | 0.968 | 0.964 | cctgtCGGTTt V$MYB_Q3 | 9765 (-) | 0.962 | 0.956 | aAACAGtcatg V$MYB_Q3 | 10058 (-) | 1.000 | 0.970 | gAACTGaaccg V$MYB_Q3 | 10192 (+) | 1.000 | 0.996 | gcagtCAGTTc V$MYB_Q3 | 10310 (-) | 1.000 | 0.963 | tAACTGttttt V$MYB_Q3 | 10962 (+) | 0.962 | 0.963 | tgtgcCTGTTg V$MYB_Q3 | 11065 (+) | 1.000 | 0.962 | gtgatCAGTTa V$MYB_Q3 | 11135 (+) | 1.000 | 0.963 | gcactCAGTTt V$MYB_Q3 | 11219 (-) | 1.000 | 0.993 | gAACTGcccag V$MYB_Q3 | 11222 (+) | 1.000 | 0.961 | ctgccCAGTTt V$MYB_Q3 | 11237 (-) | 1.000 | 0.970 | tAACTGgagtc V$MYB_Q3 | 11514 (+) | 0.980 | 0.951 | tattaCCGTTg V$MYB_Q3 | 11712 (-) | 1.000 | 0.996 | cAACTGccctc V$NFKB_Q6_01 | 326 (+) | 1.000 | 0.996 | gtgctGGAAAttccct V$NFY_Q6_01 | 1022 (-) | 1.000 | 0.972 | tgATTGGcagttg V$NFY_Q6_01 | 11893 (+) | 1.000 | 0.960 | tctcaaCCAATag V$NFY_Q6_01 | 11929 (-) | 1.000 | 0.974 | ccATTGGctgaaa V$SREBP_Q3 | 99 (-) | 0.817 | 0.768 | tcGCCTGggggt V$SREBP_Q3 | 130 (+) | 0.807 | 0.828 | ctgctCAGTCaa V$SREBP_Q3 | 134 (+) | 0.776 | 0.828 | tcagtCAATCca V$SREBP_Q3 | 193 (+) | 0.936 | 0.873 | cctccCACACca V$SREBP_Q3 | 249 (-) | 0.892 | 0.812 | tgGGCTGtaagg V$SREBP_Q3 | 413 (-) | 1.000 | 0.784 | atGGGTGagctt V$SREBP_Q3 | 459 (-) | 0.861 | 0.780 | ttGGTTGgctta V$SREBP_Q3 | 533 (-) | 1.000 | 0.791 | atGGGTGatgcg V$SREBP_Q3 | 684 (+) | 0.936 | 0.842 | tcagaCACACca V$SREBP_Q3 | 781 (-) | 1.000 | 0.877 | tcGGGTGctctt V$SREBP_Q3 | 1279 (-) | 0.861 | 0.895 | tgGGATGacagg V$SREBP_Q3 | 1355 (+) | 0.741 | 0.780 | tgggtCACCTga V$SREBP_Q3 | 1421 (+) | 0.926 | 0.803 | tgagaCACGCaa V$SREBP_Q3 | 1681 (-) | 1.000 | 0.872 | tcGGGTGtcagc V$SREBP_Q3 | 1947 (-) | 0.892 | 0.822 | taGGCTGgaact V$SREBP_Q3 | 2297 (-) | 0.936 | 0.821 | ttGTGTGtgtgt V$SREBP_Q3 | 2334 (+) | 0.741 | 0.794 | tatgtGACCCta V$SREBP_Q3 | 3080 (-) | 1.000 | 0.812 | caGGGTGattct V$SREBP_Q3 | 3230 (+) | 0.892 | 0.884 | ccagtCAGCCaa V$SREBP_Q3 | 3243 (+) | 0.892 | 0.843 | attccCAGCCca V$SREBP_Q3 | 3286 (-) | 0.861 | 0.896 | tgGGATGaggtg V$SREBP_Q3 | 3291 (-) | 0.741 | 0.805 | tgAGGTGaacat V$SREBP_Q3 | 3354 (+) | 0.828 | 0.770 | tcagaCAGACca V$SREBP_Q3 | 3722 (-) | 1.000 | 0.778 | acGGGTGactaa V$SREBP_Q3 | 3951 (+) | 0.861 | 0.802 | tcagcCATCCta V$SREBP_Q3 | 4257 (-) | 0.861 | 0.777 | ttGGTTGttagg V$SREBP_Q3 | 4265 (-) | 0.861 | 0.802 | taGGATGcttac V$SREBP_Q3 | 4329 (-) | 0.741 | 0.792 | tcTGGTGacatc V$SREBP_Q3 | 4438 (+) | 0.936 | 0.848 | aaaacCACACaa V$SREBP_Q3 | 4512 (+) | 0.828 | 0.881 | aacatCAGACca V$SREBP_Q3 | 4711 (-) | 0.741 | 0.781 | ttGGGTAattca V$SREBP_Q3 | 5031 (-) | 0.861 | 0.810 | tgGGATGctcag V$SREBP_Q3 | 5092 (-) | 0.797 | 0.770 | tgGTTTGggtta V$SREBP_Q3 | 5096 (-) | 0.741 | 0.790 | ttGGGTTattat V$SREBP_Q3 | 5137 (+) | 0.861 | 0.889 | gcggtCAACCta V$SREBP_Q3 | 5298 (+) | 0.861 | 0.791 | ctagcCATCCaa V$SREBP_Q3 | 5764 (-) | 0.741 | 0.794 | taGGGTAaaagg V$SREBP_Q3 | 5821 (-) | 0.926 | 0.866 | tgGCGTGgcact V$SREBP_Q3 | 6392 (-) | 0.817 | 0.776 | tgGCCTGtgttc V$SREBP_Q3 | 6435 (+) | 0.861 | 0.886 | tcggtCAACCca V$SREBP_Q3 | 6526 (+) | 1.000 | 0.795 | aagttCACCCtg V$SREBP_Q3 | 6728 (-) | 1.000 | 0.805 | aaGGGTGagcag V$SREBP_Q3 | 7102 (-) | 0.828 | 0.773 | taGTCTGtgtac V$SREBP_Q3 | 7472 (-) | 0.915 | 0.797 | ttGAGTGtgcta V$SREBP_Q3 | 7517 (-) | 0.817 | 0.856 | tgGCCTGaactg V$SREBP_Q3 | 7538 (-) | 0.861 | 0.773 | ttGGTTGtgagc V$SREBP_Q3 | 7583 (+) | 0.797 | 0.821 | caggtCAAACaa V$SREBP_Q3 | 7612 (+) | 0.656 | 0.770 | gccatCTCTCca V$SREBP_Q3 | 7936 (-) | 0.936 | 0.828 | ttGTGTGtttag V$SREBP_Q3 | 7956 (+) | 0.828 | 0.780 | tacccCAGACta V$SREBP_Q3 | 8009 (-) | 0.678 | 0.773 | tgGTGTAacctt V$SREBP_Q3 | 8129 (-) | 0.828 | 0.791 | tgGTCTGctttc V$SREBP_Q3 | 8397 (+) | 0.786 | 0.776 | gaagcCAAGCca V$SREBP_Q3 | 8444 (+) | 0.861 | 0.792 | acccgCATCCta V$SREBP_Q3 | 8662 (+) | 0.797 | 0.845 | gttttCAAACca V$SREBP_Q3 | 8961 (-) | 0.741 | 0.785 | taGGGAGaacca V$SREBP_Q3 | 9063 (+) | 0.817 | 0.772 | agttcCAGGCta V$SREBP_Q3 | 9187 (+) | 0.915 | 0.838 | gagacCACTCaa V$SREBP_Q3 | 9299 (-) | 0.936 | 0.821 | ttGTGTGcccag V$SREBP_Q3 | 9321 (-) | 0.741 | 0.787 | ttGGGTCagagc V$SREBP_Q3 | 9729 (-) | 0.797 | 0.828 | ttGTATGataat V$SREBP_Q3 | 9767 (+) | 0.786 | 0.845 | acagtCATGCca V$SREBP_Q3 | 10177 (-) | 0.936 | 0.842 | taGTGTGtgtgt V$SREBP_Q3 | 10236 (-) | 0.861 | 0.820 | taGGTTGgtggt V$SREBP_Q3 | 10877 (-) | 0.678 | 0.777 | tgGTGGGagcac V$SREBP_Q3 | 11181 (-) | 0.892 | 0.791 | ttGGCTGccttg V$SREBP_Q3 | 11197 (-) | 0.776 | 0.813 | ttGATTGattat V$SREBP_Q3 | 11213 (-) | 0.797 | 0.811 | ttGTTTGaactg V$SREBP_Q3 | 11258 (-) | 0.936 | 0.852 | tgGTGTGccaag V$SREBP_Q3 | 11313 (-) | 0.741 | 0.808 | taCGGTGacgac V$SREBP_Q3 | 11555 (+) | 0.861 | 0.856 | gtattCAACCaa V$SREBP_Q3 | 11600 (-) | 0.915 | 0.818 | taGAGTGcacag V$SREBP_Q3 | 11702 (+) | 1.000 | 0.988 | actgtCACCCca V$SREBP_Q3 | 11736 (+) | 0.678 | 0.780 | aacatCTCACca V$SREBP_Q3 | 11738 (+) | 0.741 | 0.783 | catctCACCAaa V$SREBP_Q3 | 11824 (-) | 1.000 | 0.978 | tgGGGTGagaaa V$SREBP_Q3 | 11891 (+) | 0.861 | 0.863 | actctCAACCaa V$SREBP_Q3 | 11932 (-) | 0.892 | 0.873 | ttGGCTGaaact V$STAT_Q6 | 556 (+) | 0.984 | 0.986 | accttTTCTAgga V$STAT_Q6 | 3259 (+) | 0.994 | 0.935 | tggttTTCTTggt V$STAT_Q6 | 3689 (-) | 0.994 | 0.937 | gccAAGAAcacaa V$STAT_Q6 | 4131 (-) | 1.000 | 0.937 | accCAGAAtctag V$STAT_Q6 | 6125 (+) | 0.981 | 0.930 | ttgttTTCTCggc V$STAT_Q6 | 6634 (-) | 1.000 | 0.938 | accCAGAAggcac V$STAT_Q6 | 7494 (+) | 0.994 | 0.992 | tgttcTTCTTgga V$STAT_Q6 | 8546 (-) | 0.981 | 0.933 | gccGAGAAaatga V$STAT_Q6 | 9563 (+) | 1.000 | 0.938 | tcggtTTCTGggt V$STAT_Q6 | 10053 (-) | 0.994 | 0.940 | tctAAGAActgaa V$STAT_Q6 | 11264 (-) | 0.994 | 0.945 | gccAAGAAaagga P$EMBP1_02 | 434 (-) | 1.000 | 0.951 | acCACGTtta P$EMBP1_02 | 735 (+) | 1.000 | 0.928 | accACGTGgt P$EMBP1_02 | 735 (-) | 1.000 | 0.928 | acCACGTggt P$EMBP1_02 | 1126 (-) | 1.000 | 0.943 | tgCACGTgtg P$EMBP1_02 | 6886 (-) | 1.000 | 0.926 | acCACGTaag V$GATA_Q6 | 84 (+) | 0.984 | 0.978 | tGATAGg V$GATA_Q6 | 92 (-) | 0.984 | 0.984 | cCTATCt V$GATA_Q6 | 308 (-) | 0.984 | 0.976 | gCTATCt V$GATA_Q6 | 568 (+) | 1.000 | 0.995 | aGATAAa V$GATA_Q6 | 1996 (-) | 1.000 | 0.990 | aTTATCt V$GATA_Q6 | 2724 (+) | 1.000 | 0.990 | aGATAAt V$GATA_Q6 | 3931 (-) | 1.000 | 0.985 | aTTATCa V$GATA_Q6 | 4375 (-) | 1.000 | 0.992 | gTTATCt V$GATA_Q6 | 5979 (+) | 1.000 | 0.987 | tGATAAc V$GATA_Q6 | 6467 (+) | 1.000 | 0.989 | tGATAAa V$GATA_Q6 | 7799 (+) | 1.000 | 0.995 | aGATAAa V$GATA_Q6 | 8063 (-) | 1.000 | 0.995 | tTTATCt V$GATA_Q6 | 8605 (-) | 1.000 | 0.985 | aTTATCa V$GATA_Q6 | 9734 (+) | 1.000 | 0.985 | tGATAAt V$GATA_Q6 | 11370 (-) | 0.984 | 0.979 | tCTATCt V$GATA_Q6 | 11442 (-) | 1.000 | 1.000 | cTTATCt V$GATA_Q6 | 11570 (-) | 1.000 | 0.994 | cTTATCa V$HNF1_Q6 | 986 (+) | 1.000 | 0.950 | cagTTAATaattttcatg V$HNF1_Q6 | 3411 (+) | 1.000 | 0.926 | aagTTAATgattctggtt V$HNF1_Q6 | 4054 (-) | 1.000 | 0.891 | ataggcaataATTAAaat V$HNF1_Q6 | 5098 (+) | 0.935 | 0.879 | gggTTATTatttgtgagt V$HNF1_Q6 | 5784 (-) | 1.000 | 0.883 | atatgaaataATTAAata V$HNF1_Q6 | 7353 (+) | 1.000 | 0.898 | atgTTAATcatgggcaga V$HNF1_Q6 | 7822 (-) | 1.000 | 0.927 | aaaacaaacaATTAAccc V$HNF1_Q6 | 11693 (+) | 1.000 | 0.869 | aagTTAATcactgtcacc V$HNF3_Q6 | 213 (-) | 0.977 | 0.923 | ggTATTTtttttt V$HNF3_Q6 | 229 (-) | 0.952 | 0.927 | ccTTTTTgtcttt V$HNF3_Q6 | 506 (-) | 0.977 | 0.944 | taTATTTgtcttc V$HNF3_Q6 | 846 (+) | 0.977 | 0.949 | ttaaatAAATAcg V$HNF3_Q6 | 875 (+) | 1.000 | 0.959 | atgcacAAACAcg V$HNF3_Q6 | 953 (+) | 0.977 | 0.918 | atgatcAAATAtt V$HNF3_Q6 | 1472 (-) | 0.977 | 0.943 | aaTATTTgctctg V$HNF3_Q6 | 1527 (-) | 1.000 | 0.936 | gcTGTTTttttgt V$HNF3_Q6 | 1531 (-) | 0.952 | 0.939 | ttTTTTTgttcta V$HNF3_Q6 | 1894 (-) | 0.977 | 0.944 | taTATTTatttta V$HNF3_Q6 | 2292 (-) | 0.952 | 0.916 | agTTTTTgtgtgt V$HNF3_Q6 | 2361 (-) | 0.977 | 0.947 | tgTATTTattttg V$HNF3_Q6 | 2408 (+) | 1.000 | 0.943 | taaaaaAAACAaa V$HNF3_Q6 | 2412 (+) | 0.952 | 0.951 | aaaaacAAAAAat V$HNF3_Q6 | 3595 (-) | 1.000 | 0.914 | taTGTTTgggttt V$HNF3_Q6 | 3806 (+) | 1.000 | 0.930 | tagaaaAAACAtc V$HNF3_Q6 | 4006 (+) | 0.977 | 0.958 | ataaatAAATAtt V$HNF3_Q6 | 4219 (+) | 0.977 | 0.934 | ttaaaaAAATAaa V$HNF3_Q6 | 4424 (+) | 0.952 | 0.917 | ctaaaaAAAAAaa V$HNF3_Q6 | 4489 (+) | 0.952 | 0.943 | ttgaacAAAAAac V$HNF3_Q6 | 4492 (+) | 1.000 | 0.930 | aacaaaAAACAaa V$HNF3_Q6 | 5103 (-) | 0.977 | 0.915 | atTATTTgtgagt V$HNF3_Q6 | 5120 (+) | 1.000 | 0.924 | aaaaggAAACAga V$HNF3_Q6 | 5400 (-) | 0.977 | 0.931 | taTATTTtttcat V$HNF3_Q6 | 5435 (-) | 0.977 | 0.951 | ttTATTTgtattg V$HNF3_Q6 | 5644 (+) | 0.952 | 0.927 | caaaatAAAAAga V$HNF3_Q6 | 5797 (+) | 0.952 | 0.931 | aaatacAAAAAaa V$HNF3_Q6 | 5806 (+) | 0.977 | 0.924 | aaaaaaAAATAgc V$HNF3_Q6 | 6253 (-) | 0.977 | 0.924 | tcTATTTgtccca V$HNF3_Q6 | 6588 (+) | 0.977 | 0.966 | caaaacAAATAag V$HNF3_Q6 | 6606 (-) | 0.977 | 0.930 | gcTATTTgctaaa V$HNF3_Q6 | 6651 (+) | 1.000 | 0.937 | gtctgcAAACAga V$HNF3_Q6 | 7048 (-) | 0.977 | 0.924 | acTATTTtttttc V$HNF3_Q6 | 7065 (-) | 1.000 | 0.920 | ttTGTTTttgttt V$HNF3_Q6 | 7067 (-) | 0.952 | 0.944 | tgTTTTTgtttgt V$HNF3_Q6 | 7071 (-) | 1.000 | 0.971 | ttTGTTTgtgtat V$HNF3_Q6 | 7211 (+) | 0.952 | 0.949 | ttgaacAAAAAgt V$HNF3_Q6 | 7426 (-) | 0.977 | 0.971 | ttTATTTgttttt V$HNF3_Q6 | 7430 (-) | 1.000 | 0.928 | ttTGTTTttattt V$HNF3_Q6 | 7432 (-) | 0.952 | 0.923 | tgTTTTTatttta V$HNF3_Q6 | 7770 (+) | 1.000 | 0.938 | ctcaaaAAACAaa V$HNF3_Q6 | 7779 (+) | 0.952 | 0.950 | caaaacAAAAAtt V$HNF3_Q6 | 7821 (+) | 1.000 | 0.989 | caaaacAAACAat V$HNF3_Q6 | 7893 (-) | 0.952 | 0.921 | ggTTTTTattttg V$HNF3_Q6 | 7939 (-) | 1.000 | 0.919 | tgTGTTTagtttt V$HNF3_Q6 | 7970 (-) | 1.000 | 0.984 | gtTGTTTgttttt V$HNF3_Q6 | 7974 (-) | 1.000 | 0.913 | ttTGTTTttcaag V$HNF3_Q6 | 8185 (-) | 1.000 | 0.957 | ctTGTTTgctctg V$HNF3_Q6 | 8336 (-) | 0.977 | 0.922 | agTATTTgctgga V$HNF3_Q6 | 8897 (+) | 1.000 | 0.940 | taccacAAACAga V$HNF3_Q6 | 9110 (+) | 0.952 | 0.944 | caaaacAAAAAac V$HNF3_Q6 | 9533 (+) | 1.000 | 0.974 | tagaacAAACAag V$HNF3_Q6 | 9724 (-) | 0.947 | 0.917 | tcTCTTTgtatga V$HNF3_Q6 | 9918 (-) | 0.952 | 0.914 | gaTTTTTgcctat V$HNF3_Q6 | 10312 (-) | 1.000 | 0.938 | acTGTTTttatat V$HNF3_Q6 | 10322 (-) | 0.952 | 0.923 | taTTTTTatttta V$HNF3_Q6 | 10701 (-) | 1.000 | 0.918 | cgTGTTTatgaaa V$HNF3_Q6 | 10910 (+) | 0.977 | 0.940 | ctatgcAAATAat V$HNF3_Q6 | 11189 (-) | 1.000 | 0.931 | ctTGTTTcttgat V$HNF3_Q6 | 11468 (-) | 0.952 | 0.921 | atTTTTTatttgt V$HNF3_Q6 | 11472 (-) | 0.977 | 0.971 | ttTATTTgttttt V$HNF3_Q6 | 11502 (-) | 0.952 | 0.915 | aaTTTTTgtggat V$SMAD_Q6 | 301 (+) | 0.996 | 0.974 | AGACTgagc V$SMAD_Q6 | 343 (+) | 1.000 | 0.984 | AGACActtc V$SMAD_Q6 | 595 (+) | 0.996 | 0.969 | AGACTtact V$SMAD_Q6 | 686 (+) | 1.000 | 0.987 | AGACAcacc V$SMAD_Q6 | 1423 (+) | 1.000 | 0.984 | AGACAcgca V$SMAD_Q6 | 1611 (-) | 0.989 | 0.990 | agggGGTCT V$SMAD_Q6 | 1829 (+) | 1.000 | 0.997 | AGACAccca V$SMAD_Q6 | 1919 (+) | 1.000 | 0.977 | AGACAgagt V$SMAD_Q6 | 1925 (-) | 1.000 | 0.973 | agtaTGTCT V$SMAD_Q6 | 2065 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2071 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2077 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2081 (-) | 1.000 | 0.977 | tgtcTGTCT V$SMAD_Q6 | 2089 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2097 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2105 (-) | 1.000 | 0.974 | tctcTGTCT V$SMAD_Q6 | 2302 (-) | 1.000 | 0.984 | tgtgTGTCT V$SMAD_Q6 | 2322 (-) | 0.996 | 0.976 | atacAGTCT V$SMAD_Q6 | 2940 (-) | 0.996 | 0.980 | gcgaAGTCT V$SMAD_Q6 | 3011 (-) | 0.985 | 0.981 | caggCGTCT V$SMAD_Q6 | 3356 (+) | 1.000 | 0.980 | AGACAgacc V$SMAD_Q6 | 3983 (+) | 1.000 | 1.000 | AGACAccat V$SMAD_Q6 | 4290 (+) | 0.989 | 0.974 | AGACCctaa V$SMAD_Q6 | 4323 (-) | 0.989 | 0.974 | actgGGTCT V$SMAD_Q6 | 4401 (+) | 1.000 | 0.977 | AGACAggtc V$SMAD_Q6 | 4772 (-) | 0.996 | 0.983 | agcgAGTCT V$SMAD_Q6 | 4851 (+) | 0.996 | 0.982 | AGACTtcac V$SMAD_Q6 | 4939 (+) | 0.996 | 0.976 | AGACTgaac V$SMAD_Q6 | 5228 (+) | 0.996 | 0.971 | AGACTggga V$SMAD_Q6 | 5547 (-) | 1.000 | 0.977 | tttcTGTCT V$SMAD_Q6 | 5675 (+) | 1.000 | 0.977 | AGACAgaca V$SMAD_Q6 | 5679 (+) | 1.000 | 0.982 | AGACAcaga V$SMAD_Q6 | 5687 (+) | 0.989 | 0.976 | AGACCcact V$SMAD_Q6 | 5748 (+) | 0.985 | 0.970 | AGACGcaaa V$SMAD_Q6 | 5929 (+) | 0.996 | 0.997 | AGACTccct V$SMAD_Q6 | 6075 (-) | 0.996 | 0.971 | ccacAGTCT V$SMAD_Q6 | 6133 (-) | 0.985 | 0.981 | tcggCGTCT V$SMAD_Q6 | 6187 (+) | 1.000 | 0.997 | AGACAccca V$SMAD_Q6 | 6233 (-) | 1.000 | 0.993 | atgcTGTCT V$SMAD_Q6 | 6646 (-) | 0.996 | 0.971 | cctcAGTCT V$SMAD_Q6 | 7281 (+) | 0.989 | 0.971 | AGACCcata V$SMAD_Q6 | 7339 (+) | 1.000 | 0.984 | AGACAcatt V$SMAD_Q6 | 7763 (-) | 0.989 | 0.974 | tgagGGTCT V$SMAD_Q6 | 8052 (-) | 0.989 | 0.970 | tcgaGGTCT V$SMAD_Q6 | 8713 (-) | 0.996 | 0.994 | aaggAGTCT V$SMAD_Q6 | 8864 (+) | 0.996 | 0.978 | AGACTctgg V$SMAD_Q6 | 9148 (+) | 1.000 | 0.982 | AGACAcata V$SMAD_Q6 | 9346 (-) | 0.996 | 0.980 | ctgaAGTCT V$SMAD_Q6 | 9904 (+) | 0.996 | 0.969 | AGACTtact V$SMAD_Q6 | 10065 (-) | 0.989 | 0.974 | accgGGTCT V$SMAD_Q6 | 10644 (+) | 1.000 | 0.997 | AGACAccca V$SMAD_Q6 | 10902 (+) | 0.985 | 0.973 | AGACGctac V$SMAD_Q6 | 10954 (-) | 0.989 | 0.980 | ctgcGGTCT V$YY1_Q6 | 61 (+) | 1.000 | 0.905 | GCCATaatt V$YY1_Q6 | 718 (-) | 0.902 | 0.863 | acagATGGT V$YY1_Q6 | 802 (+) | 1.000 | 0.942 | GCCATttct V$YY1_Q6 | 886 (-) | 0.889 | 0.853 | cgaaATGGG V$YY1_Q6 | 1662 (-) | 0.915 | 0.868 | gcaaATGGA V$YY1_Q6 | 1887 (+) | 0.889 | 0.895 | CCCATttta V$YY1_Q6 | 2456 (-) | 1.000 | 0.857 | cgctATGGC V$YY1_Q6 | 2614 (+) | 1.000 | 0.942 | GCCATgtgt V$YY1_Q6 | 2837 (-) | 1.000 | 0.939 | gcaaATGGC V$YY1_Q6 | 3149 (-) | 0.915 | 0.860 | atatATGGA V$YY1_Q6 | 3275 (+) | 1.000 | 0.912 | GCCATtatg V$YY1_Q6 | 3304 (+) | 0.915 | 0.867 | TCCATgtag V$YY1_Q6 | 3642 (-) | 0.902 | 0.864 | ccacATGGT V$YY1_Q6 | 3954 (+) | 1.000 | 0.899 | GCCATccta V$YY1_Q6 | 3987 (+) | 0.902 | 0.857 | ACCATctca V$YY1_Q6 | 4110 (+) | 0.915 | 0.919 | TCCATtttc V$YY1_Q6 | 4155 (+) | 0.889 | 0.850 | CCCATttgt V$YY1_Q6 | 4892 (+) | 0.915 | 0.919 | TCCATgttc V$YY1_Q6 | 4991 (-) | 1.000 | 0.855 | ccgaATGGC V$YY1_Q6 | 5128 (-) | 0.902 | 0.863 | acagATGGT V$YY1_Q6 | 5307 (-) | 1.000 | 0.997 | caaaATGGC V$YY1_Q6 | 5447 (+) | 1.000 | 0.925 | GCCATataa V$YY1_Q6 | 5738 (+) | 1.000 | 0.898 | GCCATaata V$YY1_Q6 | 5874 (+) | 0.902 | 0.860 | ACCATctcc V$YY1_Q6 | 6036 (+) | 0.889 | 0.856 | CCCATctgg V$YY1_Q6 | 6446 (-) | 1.000 | 0.857 | agtcATGGC V$YY1_Q6 | 6996 (-) | 0.915 | 0.871 | ggagATGGA V$YY1_Q6 | 7175 (-) | 0.902 | 0.860 | gcagATGGT V$YY1_Q6 | 7512 (-) | 1.000 | 0.939 | tcagATGGC V$YY1_Q6 | 7612 (+) | 1.000 | 0.945 | GCCATctct V$YY1_Q6 | 7734 (+) | 1.000 | 0.853 | GCCATcaca V$YY1_Q6 | 9001 (-) | 0.915 | 0.868 | gcacATGGA V$YY1_Q6 | 9053 (-) | 1.000 | 0.938 | ctacATGGC V$YY1_Q6 | 10107 (+) | 1.000 | 1.000 | GCCATcttg V$YY1_Q6 | 10115 (+) | 1.000 | 0.851 | GCCATcccc V$YY1_Q6 | 10388 (+) | 0.915 | 0.854 | TCCATataa V$YY1_Q6 | 10509 (-) | 1.000 | 0.905 | aattATGGC V$YY1_Q6 | 10522 (+) | 1.000 | 0.902 | GCCATtata V$YY1_Q6 | 10854 (+) | 1.000 | 0.902 | GCCATtgta V$YY1_Q6 | 11346 (+) | 1.000 | 0.904 | GCCATtatc V$YY1_Q6 | 11540 (-) | 0.902 | 0.857 | acatATGGT V$YY1_Q6 | 11724 (+) | 0.902 | 0.908 | ACCATtttc V$TTF1_Q6 | 10 (+) | 0.991 | 0.944 | tgtACAAGtaca V$TTF1_Q6 | 52 (-) | 0.993 | 0.961 | attaCTTGGgcc V$TTF1_Q6 | 98 (-) | 0.872 | 0.871 | ttcgCCTGGggg V$TTF1_Q6 | 270 (-) | 0.991 | 0.956 | ggtcCTTGTgca V$TTF1_Q6 | 280 (+) | 0.988 | 0.950 | catGCAAGtcaa V$TTF1_Q6 | 285 (+) | 1.000 | 0.978 | aagTCAAGtgtt V$TTF1_Q6 | 400 (+) | 0.991 | 0.946 | ccaACAAGgcgg V$TTF1_Q6 | 418 (-) | 0.991 | 0.934 | tgagCTTGTcca V$TTF1_Q6 | 478 (+) | 0.991 | 0.951 | cttACAAGtcaa V$TTF1_Q6 | 541 (-) | 0.991 | 0.973 | tgcgCTTGTgtt V$TTF1_Q6 | 838 (+) | 0.991 | 0.937 | aaaACAAGttaa V$TTF1_Q6 | 936 (+) | 0.991 | 0.958 | accACAAGttag V$TTF1_Q6 | 1212 (-) | 0.993 | 0.952 | attgCTTGGagt V$TTF1_Q6 | 1345 (-) | 0.988 | 0.951 | aaggCTTGCctg V$TTF1_Q6 | 1654 (-) | 0.993 | 0.950 | tctaCTTGGcaa V$TTF1_Q6 | 1937 (-) | 0.993 | 0.941 | tagcCTTGGcta V$TTF1_Q6 | 1988 (+) | 0.993 | 0.937 | tcaCCAAGatta V$TTF1_Q6 | 2011 (-) | 0.878 | 0.883 | atctCCTGAgtg V$TTF1_Q6 | 2325 (-) | 0.988 | 0.934 | cagtCTTGCtat V$TTF1_Q6 | 2355 (-) | 0.991 | 0.944 | caaaCTTGTatt V$TTF1_Q6 | 2465 (+) | 0.988 | 0.957 | ctaGCAAGcgcg V$TTF1_Q6 | 2605 (+) | 0.993 | 0.964 | aaaCCAAGagcc V$TTF1_Q6 | 2998 (+) | 0.993 | 0.955 | cctCCAAGgtgc V$TTF1_Q6 | 3201 (+) | 1.000 | 0.970 | gccTCAAGtatg V$TTF1_Q6 | 3214 (+) | 0.988 | 0.944 | tagGCAAGcact V$TTF1_Q6 | 3262 (-) | 0.993 | 0.944 | ttttCTTGGtga V$TTF1_Q6 | 3525 (-) | 0.993 | 0.937 | ccacCTTGGccc V$TTF1_Q6 | 3573 (+) | 0.988 | 0.939 | ttgGCAAGttat V$TTF1_Q6 | 3680 (+) | 0.993 | 0.966 | ttgCCAAGtgcc V$TTF1_Q6 | 3687 (+) | 0.993 | 0.950 | gtgCCAAGaaca V$TTF1_Q6 | 3695 (+) | 0.991 | 0.980 | aacACAAGagat V$TTF1_Q6 | 3704 (-) | 1.000 | 0.944 | gatgCTTGAtca V$TTF1_Q6 | 3878 (+) | 0.991 | 0.942 | aatACAAGctcc V$TTF1_Q6 | 3885 (-) | 1.000 | 0.954 | gctcCTTGAccc V$TTF1_Q6 | 3938 (-) | 0.991 | 0.953 | gtcaCTTGTtca V$TTF1_Q6 | 4194 (+) | 0.988 | 0.938 | gctGCAAGttca V$TTF1_Q6 | 4200 (+) | 1.000 | 0.949 | agtTCAAGgctt V$TTF1_Q6 | 4205 (-) | 0.988 | 0.938 | aaggCTTGCctc V$TTF1_Q6 | 4252 (-) | 0.993 | 0.958 | attaCTTGGttg V$TTF1_Q6 | 4395 (+) | 0.993 | 0.940 | gttCCAAGacag V$TTF1_Q6 | 4415 (-) | 0.991 | 0.937 | caacCTTGTcta V$TTF1_Q6 | 4518 (+) | 0.993 | 0.938 | agaCCAAGacaa V$TTF1_Q6 | 4600 (-) | 0.988 | 0.961 | cccgCTTGCtgg V$TTF1_Q6 | 4692 (-) | 1.000 | 0.951 | gaaaCTTGAtgt V$TTF1_Q6 | 4706 (-) | 0.993 | 0.956 | tagaCTTGGgta V$TTF1_Q6 | 4793 (-) | 0.988 | 0.937 | ctgaCTTGCata V$TTF1_Q6 | 4897 (-) | 0.993 | 0.947 | gttcCTTGGcca V$TTF1_Q6 | 4903 (+) | 0.993 | 0.962 | tggCCAAGtgca V$TTF1_Q6 | 5000 (-) | 0.991 | 0.942 | agacCTTGTatt V$TTF1_Q6 | 5170 (-) | 0.991 | 0.971 | ttcaCTTGTgac V$TTF1_Q6 | 5207 (-) | 0.991 | 0.978 | gcctCTTGTgct V$TTF1_Q6 | 5606 (+) | 0.991 | 0.968 | aacACAAGaaac V$TTF1_Q6 | 5835 (+) | 0.991 | 0.932 | gaaACAAGgcat V$TTF1_Q6 | 5932 (-) | 0.993 | 0.959 | ctccCTTGGaga V$TTF1_Q6 | 6120 (-) | 0.991 | 0.937 | agggCTTGTttt V$TTF1_Q6 | 6209 (+) | 0.993 | 0.945 | atgCCAAGgttt V$TTF1_Q6 | 6893 (+) | 0.991 | 0.968 | aagACAAGaggg V$TTF1_Q6 | 6914 (+) | 0.991 | 0.940 | agaACAAGgaat V$TTF1_Q6 | 6941 (+) | 0.993 | 0.977 | tacCCAAGtgat V$TTF1_Q6 | 7127 (+) | 1.000 | 0.961 | tgcTCAAGgagg V$TTF1_Q6 | 7292 (+) | 0.993 | 0.928 | gtaCCAAGctga V$TTF1_Q6 | 7381 (-) | 0.991 | 0.957 | attgCTTGTgtc V$TTF1_Q6 | 7497 (-) | 0.993 | 0.964 | tcttCTTGGagg V$TTF1_Q6 | 7695 (-) | 0.993 | 0.969 | agaaCTTGGgag V$TTF1_Q6 | 7979 (+) | 1.000 | 0.961 | tttTCAAGggaa V$TTF1_Q6 | 8032 (-) | 0.988 | 0.943 | agagCTTGCtgg V$TTF1_Q6 | 8042 (-) | 1.000 | 0.951 | gggcCTTGActc V$TTF1_Q6 | 8181 (-) | 0.991 | 0.949 | agaaCTTGTttg V$TTF1_Q6 | 8348 (-) | 0.988 | 0.960 | accaCTTGCcaa V$TTF1_Q6 | 8398 (+) | 0.993 | 0.945 | aagCCAAGccag V$TTF1_Q6 | 8455 (+) | 0.993 | 0.947 | aagCCAAGgtct V$TTF1_Q6 | 8528 (+) | 1.000 | 0.957 | gctTCAAGaaca V$TTF1_Q6 | 8708 (+) | 1.000 | 0.963 | ccaTCAAGgagt V$TTF1_Q6 | 8753 (-) | 0.991 | 0.942 | acacCTTGTcct V$TTF1_Q6 | 8768 (-) | 0.872 | 0.876 | agcaCCTGGgag V$TTF1_Q6 | 9034 (+) | 1.000 | 0.966 | agtTCAAGggca V$TTF1_Q6 | 9057 (+) | 0.988 | 0.945 | atgGCAAGttcc V$TTF1_Q6 | 9192 (+) | 1.000 | 0.994 | cacTCAAGagag V$TTF1_Q6 | 9203 (+) | 0.993 | 0.937 | gcaCCAAGgctc V$TTF1_Q6 | 9222 (+) | 0.988 | 0.943 | aagGCAAGttag V$TTF1_Q6 | 9279 (+) | 0.993 | 0.945 | ccaCCAAGctga V$TTF1_Q6 | 9294 (-) | 0.991 | 0.961 | tcagCTTGTgtg V$TTF1_Q6 | 9441 (+) | 0.993 | 0.954 | agcCCAAGgttt V$TTF1_Q6 | 9523 (+) | 0.991 | 0.950 | ccaACAAGacta V$TTF1_Q6 | 9538 (+) | 0.991 | 0.951 | caaACAAGtcct V$TTF1_Q6 | 9632 (+) | 1.000 | 0.945 | agtTCAAGgtca V$TTF1_Q6 | 9641 (-) | 0.993 | 0.951 | tcagCTTGGgct V$TTF1_Q6 | 9661 (-) | 0.991 | 0.940 | agacCTTGTctc V$TTF1_Q6 | 10108 (-) | 0.988 | 0.942 | ccatCTTGCcat V$TTF1_Q6 | 10248 (-) | 0.991 | 0.946 | caggCTTGTgca V$TTF1_Q6 | 10490 (+) | 1.000 | 0.951 | ataTCAAGgatc V$TTF1_Q6 | 10800 (-) | 0.991 | 0.949 | gagcCTTGTgca V$TTF1_Q6 | 10931 (-) | 0.991 | 0.978 | gcctCTTGTgga V$TTF1_Q6 | 10972 (+) | 0.988 | 0.933 | ggaGCAAGgagc V$TTF1_Q6 | 11072 (+) | 0.991 | 0.935 | gttACAAGctgc V$TTF1_Q6 | 11185 (-) | 0.991 | 0.929 | ctgcCTTGTttc V$TTF1_Q6 | 11192 (-) | 1.000 | 0.965 | gtttCTTGAttg V$TTF1_Q6 | 11262 (+) | 0.993 | 0.949 | gtgCCAAGaaaa V$TTF1_Q6 | 11483 (-) | 0.993 | 0.960 | ttccCTTGGaat V$TTF1_Q6 | 11777 (-) | 1.000 | 0.959 | gataCTTGAtgt V$TTF1_Q6 | 11944 (+) | 1.000 | 0.970 | attTCAAGggtc V$TTF1_Q6 | 11961 (-) | 0.878 | 0.870 | tgcaCCTGAgct V$OCT_Q6 | 993 (+) | 0.957 | 0.959 | taattTTCATg V$OCT_Q6 | 3109 (+) | 0.950 | 0.955 | taattTCCATg V$OCT_Q6 | 3129 (+) | 0.963 | 0.966 | taattTACATt V$OCT_Q6 | 3835 (+) | 0.950 | 0.948 | aaattTCCATt V$OCT_Q6 | 4071 (-) | 0.963 | 0.923 | tATGTAaaatg V$OCT_Q6 | 4451 (+) | 0.957 | 0.890 | taataTTCATt V$OCT_Q6 | 5341 (+) | 1.000 | 0.936 | agtttTGCATc V$OCT_Q6 | 5403 (+) | 0.957 | 0.906 | attttTTCATc V$OCT_Q6 | 6577 (-) | 1.000 | 0.960 | tATGCAaaaca V$OCT_Q6 | 6969 (-) | 0.950 | 0.908 | gATGGAaagga V$OCT_Q6 | 6999 (-) | 0.950 | 0.900 | gATGGAaaagg V$OCT_Q6 | 9737 (+) | 0.950 | 0.881 | taatgTCCATa V$OCT_Q6 | 9799 (-) | 0.963 | 0.882 | aATGTAcatca V$OCT_Q6 | 9934 (+) | 1.000 | 0.990 | taattTGCATg V$OCT_Q6 | 10911 (-) | 1.000 | 1.000 | tATGCAaataa V$OCT_Q6 | 11154 (-) | 0.957 | 0.955 | gATGAAaatga V$OCT_Q6 | 11337 (-) | 0.963 | 0.950 | gATGTAaatgc V$OCT_Q6 | 11411 (+) | 1.000 | 0.876 | tattcTGCATt V$OCT_Q6 | 11665 (+) | 0.957 | 0.958 | caattTTCATa V$USF_Q6_01 | 734 (+) | 1.000 | 0.907 | caccACGTGgtt V$USF_Q6_01 | 734 (-) | 1.000 | 0.908 | cacCACGTggtt V$USF_Q6_01 | 1125 (+) | 1.000 | 0.880 | atgcACGTGtgt V$USF_Q6_01 | 1357 (-) | 0.927 | 0.930 | ggtCACCTgatg V$USF_Q6_01 | 1391 (+) | 0.927 | 0.905 | gggcAGGTGagg V$USF_Q6_01 | 2772 (+) | 0.907 | 0.886 | ccgcATGTGaga V$USF_Q6_01 | 3937 (-) | 0.907 | 0.926 | agtCACTTgttc V$USF_Q6_01 | 6942 (+) | 0.907 | 0.884 | acccAAGTGata V$HIF1_Q3 | 6061 (-) | 1.000 | 0.951 | ccctgCACGTgctg V$HIF1_Q3 | 6063 (+) | 1.000 | 0.954 | ctgcACGTGctgcc V$HIF1_Q3 | 6853 (-) | 1.000 | 0.942 | gtgagCACGTccag V$HIF1_Q3 | 7483 (-) | 1.000 | 0.950 | acgcgCACGTctgt P$MYBAS1_01 | 124 (+) | 1.000 | 0.995 | ggCTAACtgct P$MYBAS1_01 | 789 (+) | 0.997 | 0.992 | tcTTAACcgct P$MYBAS1_01 | 1027 (-) | 0.997 | 0.990 | ggcaGTTGAat P$MYBAS1_01 | 1143 (-) | 0.999 | 0.987 | aaccGTTACtt P$MYBAS1_01 | 1315 (+) | 1.000 | 0.993 | ccCTAACtccc P$MYBAS1_01 | 2428 (-) | 1.000 | 0.992 | agggGTTGGgg P$MYBAS1_01 | 3442 (+) | 0.999 | 0.987 | agGTAACagtt P$MYBAS1_01 | 4536 (+) | 0.997 | 0.994 | cgTCAACagcc P$MYBAS1_01 | 5278 (+) | 0.999 | 0.988 | caACAACaggc P$MYBAS1_01 | 5615 (+) | 1.000 | 0.986 | aaCCAACatct P$MYBAS1_01 | 5979 (+) | 0.999 | 0.985 | tgATAACttct P$MYBAS1_01 | 6844 (+) | 1.000 | 0.986 | tgCCAACtggt P$MYBAS1_01 | 8003 (-) | 1.000 | 0.988 | gcctGTTGGtg P$MYBAS1_01 | 8354 (+) | 1.000 | 0.985 | tgCCAACccct P$MYBAS1_01 | 8485 (-) | 0.999 | 0.985 | ggggGTTATtt P$MYBAS1_01 | 8589 (+) | 0.999 | 0.986 | agGCAACatct P$MYBAS1_01 | 8673 (-) | 1.000 | 0.987 | accgGTTAGga P$MYBAS1_01 | 9316 (-) | 1.000 | 0.990 | aggcGTTGGgt P$MYBAS1_01 | 10619 (-) | 0.999 | 0.987 | gggaGTTGTag P$MYBAS1_01 | 10965 (-) | 1.000 | 0.986 | gcctGTTGGag P$MYBAS1_01 | 11709 (+) | 1.000 | 1.000 | ccCCAACtgcc P$MYBAS1_01 | 11920 (+) | 1.000 | 0.990 | gcCCAACgccc V$MYC_Q2 | 736 (-) | 1.000 | 1.000 | ccACGTG V$MYC_Q2 | 737 (+) | 1.000 | 1.000 | CACGTgg V$MYC_Q2 | 1127 (-) | 1.000 | 1.000 | gcACGTG V$MYC_Q2 | 1128 (+) | 1.000 | 0.994 | CACGTgt V$MYC_Q2 | 6065 (-) | 1.000 | 1.000 | gcACGTG V$MYC_Q2 | 6066 (+) | 1.000 | 1.000 | CACGTgc V$AP2_Q3 | 95 (-) | 1.000 | 0.956 | atcttcGCCTGggggt V$AP2_Q3 | 1209 (-) | 0.869 | 0.827 | cccattGCTTGgagtt V$AP2_Q3 | 1333 (+) | 0.893 | 0.847 | tccccAGGGCttaagg V$AP2_Q3 | 1375 (+) | 0.903 | 0.853 | tgcagTGGGCgaggta V$AP2_Q3 | 1411 (-) | 0.827 | 0.843 | tccattGCCGTgagac V$AP2_Q3 | 1452 (-) | 0.934 | 0.822 | cactgaGCCGGacgtt V$AP2_Q3 | 3527 (-) | 1.000 | 0.898 | accttgGCCCGaggtt V$AP2_Q3 | 3563 (-) | 0.893 | 0.818 | tgtgtgGCCTTtggca V$AP2_Q3 | 4174 (+) | 0.871 | 0.838 | ttcagGAGGCtgagca V$AP2_Q3 | 4186 (+) | 0.893 | 0.851 | agcagAAGGCtgcaag V$AP2_Q3 | 4232 (+) | 0.893 | 0.850 | ttctcAGGGCtgggga V$AP2_Q3 | 4599 (+) | 0.755 | 0.839 | gcccgCTTGCtggaag V$AP2_Q3 | 5015 (-) | 0.796 | 0.819 | cattaaGCCACgggac V$AP2_Q3 | 5279 (+) | 1.000 | 0.842 | aacaaCAGGCtcagat V$AP2_Q3 | 5883 (-) | 0.830 | 0.857 | tgctggGCATGgggcc V$AP2_Q3 | 6043 (-) | 0.893 | 0.874 | ggtgcaGCCCTgtgac V$AP2_Q3 | 6141 (-) | 0.871 | 0.824 | ttccatGCCTCtggct V$AP2_Q3 | 6225 (-) | 0.830 | 0.847 | cgctctGCATGctgtc V$AP2_Q3 | 6540 (+) | 0.827 | 0.818 | gacaaACGGCtgaggt V$AP2_Q3 | 6839 (-) | 0.828 | 0.821 | tttactGCCAActggt V$AP2_Q3 | 7455 (-) | 1.000 | 0.886 | tgttttGCCTGcagga V$AP2_Q3 | 7618 (-) | 0.805 | 0.841 | tctccaGCCGCctggc V$AP2_Q3 | 7621 (-) | 1.000 | 0.897 | ccagccGCCTGgcggt V$AP2_Q3 | 7624 (+) | 0.925 | 0.918 | gccgcCTGGCggtaga V$AP2_Q3 | 7663 (-) | 0.830 | 0.846 | tgccagGCATGgtggc V$AP2_Q3 | 7997 (-) | 1.000 | 0.870 | cactgtGCCTGttggt V$AP2_Q3 | 8029 (-) | 0.869 | 0.834 | gtcagaGCTTGctggg V$AP2_Q3 | 8088 (-) | 0.762 | 0.818 | tatacaGCTCTgggtc V$AP2_Q3 | 8364 (-) | 1.000 | 0.908 | tgtgcaGCCTGctgct V$AP2_Q3 | 8538 (+) | 0.903 | 0.859 | cacagTGGGCcgagaa V$AP2_Q3 | 9235 (-) | 0.893 | 0.833 | tccagaGCCCTaggag V$AP2_Q3 | 9278 (+) | 0.869 | 0.902 | gccacCAAGCtgagag V$AP2_Q3 | 9552 (-) | 1.000 | 0.850 | gtagtcGCCTGtcggt V$AP2_Q3 | 9919 (-) | 0.903 | 0.846 | atttttGCCTAtgggt V$AP2_Q3 | 10267 (-) | 0.934 | 0.853 | atttttGCCGGctgag V$AP2_Q3 | 10287 (+) | 1.000 | 0.873 | ctcatCAGGCttcgga V$AP2_Q3 | 10554 (-) | 0.934 | 0.864 | tgtacaGCCGGcggta V$AP2_Q3 | 10556 (+) | 0.934 | 0.841 | tacagCCGGCggtaca V$AP2_Q3 | 10865 (-) | 0.843 | 0.835 | gatgcaGCGTGgtggt V$AP2_Q3 | 10959 (-) | 1.000 | 0.857 | gtctgtGCCTGttgga V$AP2_Q3 | 11283 (+) | 0.893 | 0.867 | ggcatAGGGCaaagaa V$AP2_Q3 | 11809 (-) | 0.903 | 0.821 | aaggtgGCCCAgggtt V$AP2_Q3 | 11912 (+) | 1.000 | 0.875 | ggcttCGGGCccaacg V$AP2_Q3 | 11914 (-) | 0.903 | 0.831 | cttcggGCCCAacgcc V$AP2_Q3 | 11921 (-) | 0.903 | 0.860 | cccaacGCCCAttggc V$AP2_Q3 | 11927 (+) | 0.828 | 0.851 | gcccaTTGGCtgaaac V$CREB_Q3 | 1845 (+) | 1.000 | 0.993 | CGTCAa V$CREB_Q3 | 2965 (-) | 1.000 | 0.999 | gTGACG V$CREB_Q3 | 4536 (+) | 1.000 | 0.993 | CGTCAa V$CREB_Q3 | 8326 (+) | 1.000 | 0.999 | CGTCAg V$CREB_Q3 | 8615 (+) | 1.000 | 0.993 | CGTCAa V$CREB_Q3 | 11317 (-) | 1.000 | 0.999 | gTGACG V$CREB_Q3 | 11332 (-) | 1.000 | 1.000 | aTGACG V$PIT1_Q6 | 3825 (+) | 1.000 | 0.881 | aaTTCATaataaatttcc V$PIT1_Q6 | 3911 (-) | 1.000 | 0.883 | aataaagctttATGAAtt V$PIT1_Q6 | 5775 (-) | 1.000 | 0.879 | gagagaaaaatATGAAat V$PIT1_Q6 | 10388 (-) | 1.000 | 0.917 | tccatataaaaATGAAcg V$PIT1_Q6 | 11150 (-) | 1.000 | 0.874 | ctctgatgaaaATGAAgc V$E2F_Q2 | 4825 (+) | 0.962 | 0.964 | GGCGGg V$E2F_Q2 | 5332 (-) | 1.000 | 0.946 | tGCGCC V$E2F_Q2 | 5897 (-) | 0.962 | 0.964 | cCCGCC V$E2F_Q2 | 7148 (+) | 1.000 | 0.951 | GGCGCc V$E2F_Q2 | 7148 (-) | 1.000 | 0.951 | gGCGCC V$E2F_Q2 | 8330 (-) | 1.000 | 0.951 | aGCGCC V$E2F_Q2 | 10928 (-) | 1.000 | 0.951 | aGCGCC V$E2A_Q2 | 1359 (+) | 1.000 | 0.932 | tCACCTgatgctcg V$E2A_Q2 | 1387 (-) | 1.000 | 0.943 | ggtagggcAGGTGa V$E2A_Q2 | 1787 (+) | 1.000 | 0.944 | gCACCTgccattca V$E2A_Q2 | 2609 (-) | 0.909 | 0.917 | caagagccATGTGt V$E2A_Q2 | 7023 (-) | 0.942 | 0.925 | cctgcatcAGCTGa V$E2A_Q2 | 8278 (+) | 0.942 | 0.937 | gCAGCTgccctggc V$E2A_Q2 | 8499 (-) | 1.000 | 0.996 | acagggacAGGTGc V$E2A_Q2 | 8964 (-) | 1.000 | 0.934 | ggagaaccAGGTGt V$LEF1_Q2 | 1040 (+) | 1.000 | 0.928 | aCAAAG V$LEF1_Q2 | 1168 (+) | 1.000 | 1.000 | tCAAAG V$LEF1_Q2 | 1236 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 2662 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 4664 (-) | 1.000 | 0.928 | CTTTGt V$LEF1_Q2 | 5356 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 6742 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 8380 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 8570 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 8624 (-) | 1.000 | 1.000 | CTTTGa V$LEF1_Q2 | 8786 (+) | 1.000 | 0.928 | aCAAAG V$LEF1_Q2 | 9027 (-) | 1.000 | 0.928 | CTTTGt V$LEF1_Q2 | 9727 (-) | 1.000 | 0.928 | CTTTGt V$LEF1_Q2 | 10354 (+) | 1.000 | 1.000 | tCAAAG V$LEF1_Q2 | 11052 (-) | 1.000 | 0.928 | CTTTGt V$LEF1_Q2 | 11380 (-) | 1.000 | 1.000 | CTTTGa V$NF1_Q6_01 | 1974 (-) | 1.000 | 0.964 | ggTTGGCctcaaactca V$NF1_Q6_01 | 3317 (-) | 1.000 | 0.964 | gaTTGGCcttttcctga V$NF1_Q6_01 | 3679 (+) | 1.000 | 0.980 | attgccaagtGCCAAga V$NF1_Q6_01 | 7538 (+) | 1.000 | 0.992 | ttggttgtgaGCCAAct V$NF1_Q6_01 | 11254 (+) | 1.000 | 0.967 | tctgtggtgtGCCAAga V$EGR_Q6 | 1379 (+) | 0.979 | 0.910 | gtGGGCGaggt V$EGR_Q6 | 1492 (+) | 1.000 | 0.915 | gaGGGGGgggg V$EGR_Q6 | 1495 (+) | 1.000 | 0.906 | ggGGGGGggag V$EGR_Q6 | 8228 (-) | 0.959 | 0.922 | ttggCTCCCac V$EGR_Q6 | 10613 (+) | 0.959 | 0.929 | gtGGGAGggag V$EGR_Q6 | 10879 (+) | 0.959 | 0.906 | gtGGGAGcact V$EGR_Q6 | 11843 (+) | 1.000 | 0.929 | gtGGGGGgtgg V$PAX_Q6 | 329 (+) | 1.000 | 0.885 | CTGGAaattcc V$PAX_Q6 | 1561 (+) | 1.000 | 0.829 | CTGGAaatcta V$PAX_Q6 | 1951 (+) | 1.000 | 0.924 | CTGGAactttc V$PAX_Q6 | 2023 (+) | 0.900 | 0.815 | CTGGGattaaa V$PAX_Q6 | 4333 (-) | 1.000 | 0.852 | gtgacaTCCAG V$PAX_Q6 | 4391 (-) | 0.728 | 0.855 | gtgagtTCCAA V$PAX_Q6 | 4883 (+) | 1.000 | 0.822 | CTGGAacattc V$PAX_Q6 | 4947 (+) | 0.900 | 0.871 | CTGGGactttc V$PAX_Q6 | 6086 (-) | 0.857 | 0.924 | gtgagtTCCTG V$PAX_Q6 | 7161 (+) | 1.000 | 0.812 | CTGGAgctggc V$PAX_Q6 | 8023 (-) | 0.702 | 0.841 | gtgagtGTCAG V$PAX_Q6 | 8327 (-) | 0.883 | 0.809 | gtcagcGCCAG V$PAX_Q6 | 8817 (+) | 0.883 | 0.882 | CTGGTactctc V$PAX_Q6 | 8991 (-) | 1.000 | 0.868 | tttaatTCCAG V$PAX_Q6 | 9031 (-) | 0.760 | 0.872 | gtgagtTCAAG V$PAX_Q6 | 9060 (-) | 1.000 | 0.920 | gcaagtTCCAG V$PAX_Q6 | 9229 (-) | 1.000 | 0.892 | gttagcTCCAG V$PAX_Q6 | 9629 (-) | 0.760 | 0.817 | gggagtTCAAG V$FOX_Q2 | 847 (-) | 0.934 | 0.900 | taaatAAATAcga V$FOX_Q2 | 1893 (+) | 0.934 | 0.916 | ttaTATTTatttt V$FOX_Q2 | 2360 (+) | 0.934 | 0.912 | ttgTATTTatttt V$FOX_Q2 | 2409 (-) | 1.000 | 0.915 | aaaaaAAACAaaa V$FOX_Q2 | 2413 (-) | 0.881 | 0.915 | aaaacAAAAAatc V$FOX_Q2 | 4007 (-) | 0.934 | 0.953 | taaatAAATAttc V$FOX_Q2 | 6589 (-) | 0.934 | 0.894 | aaaacAAATAagg V$FOX_Q2 | 7047 (+) | 0.934 | 0.893 | gacTATTTttttt V$FOX_Q2 | 7425 (+) | 0.934 | 0.925 | gttTATTTgtttt V$FOX_Q2 | 7780 (-) | 0.881 | 0.900 | aaaacAAAAAtta V$FOX_Q2 | 7822 (-) | 1.000 | 0.964 | aaaacAAACAatt V$FOX_Q2 | 7934 (+) | 0.854 | 0.888 | gttTGTGTgttta V$FOX_Q2 | 7969 (+) | 1.000 | 0.928 | agtTGTTTgtttt V$FOX_Q2 | 11471 (+) | 0.934 | 0.918 | tttTATTTgtttt P$KNOX3_01 | 621 (+) | 1.000 | 0.975 | tacaTGACActt P$KNOX3_01 | 1280 (+) | 1.000 | 0.977 | gggaTGACAggg P$KNOX3_01 | 1354 (-) | 0.991 | 0.977 | ctgGGTCAcctg P$KNOX3_01 | 1683 (-) | 1.000 | 0.979 | gggTGTCAgctc P$KNOX3_01 | 2334 (+) | 0.991 | 0.977 | tatgTGACCcta P$KNOX3_01 | 2962 (+) | 0.997 | 0.982 | acagTGACGgca P$KNOX3_01 | 3072 (-) | 0.991 | 0.976 | ctaGGTCAcagg P$KNOX3_01 | 3934 (-) | 0.981 | 0.978 | atcAGTCActtg P$KNOX3_01 | 5084 (+) | 0.981 | 0.977 | gctcTGACTggt P$KNOX3_01 | 6051 (+) | 1.000 | 0.991 | cctgTGACAgcc P$KNOX3_01 | 7507 (-) | 1.000 | 0.982 | gggTGTCAgatg P$KNOX3_01 | 8025 (-) | 1.000 | 0.980 | gagTGTCAgagc P$KNOX3_01 | 8102 (-) | 1.000 | 0.993 | tccTGTCAcaca P$KNOX3_01 | 8323 (-) | 0.997 | 0.978 | ctgCGTCAgcgc P$KNOX3_01 | 9382 (-) | 1.000 | 0.990 | agcTGTCAgccc P$KNOX3_01 | 10357 (-) | 1.000 | 0.986 | aagTGTCAgaga P$KNOX3_01 | 11701 (-) | 1.000 | 0.990 | cacTGTCAcccc P$HAHB4_01 | 3415 (+) | 1.000 | 0.984 | tAATGAttc V$NRF2_Q4 | 130 (+) | 1.000 | 0.942 | ctgctcAGTCAat V$NRF2_Q4 | 1031 (-) | 0.905 | 0.860 | gtTGAATcaacaa V$NRF2_Q4 | 4466 (-) | 1.000 | 0.904 | acTGACTgaccag V$NRF2_Q4 | 6293 (-) | 1.000 | 0.963 | gaTGACTgagcag V$NRF2_Q4 | 6415 (-) | 1.000 | 0.883 | aaTGACTcctctg V$NRF2_Q4 | 10632 (-) | 1.000 | 0.881 | agTGACTgaggaa V$NRF2_Q4 | 11452 (+) | 0.873 | 0.845 | ttggtgAATCAgt V$CEBP_Q2_01 | 491 (+) | 1.000 | 0.955 | aTTGCAacatta V$CEBP_Q2_01 | 805 (+) | 0.971 | 0.926 | aTTTCTccagcc V$CEBP_Q2_01 | 1100 (-) | 0.971 | 0.966 | aatttgAGAAAt V$CEBP_Q2_01 | 1135 (+) | 0.957 | 0.948 | gTTCCTaaaacc V$CEBP_Q2_01 | 1465 (+) | 0.971 | 0.974 | gTTTCTcaatat V$CEBP_Q2_01 | 1556 (-) | 0.963 | 0.949 | ctcttcTGGAAa V$CEBP_Q2_01 | 2370 (+) | 0.995 | 0.943 | tTTGCCacagtt V$CEBP_Q2_01 | 2402 (+) | 0.957 | 0.949 | tTTCCTtaaaaa V$CEBP_Q2_01 | 2555 (-) | 0.977 | 0.970 | ctgttgTGAAAt V$CEBP_Q2_01 | 2564 (-) | 0.973 | 0.960 | aatttcCGTAAg V$CEBP_Q2_01 | 2566 (+) | 0.954 | 0.952 | tTTCCGtaagta V$CEBP_Q2_01 | 2578 (+) | 0.982 | 0.928 | tTTACAacaaat V$CEBP_Q2_01 | 2714 (+) | 0.958 | 0.955 | cTTCCCaaatag V$CEBP_Q2_01 | 2805 (+) | 0.976 | 0.965 | cTTACTgaacac V$CEBP_Q2_01 | 3426 (+) | 0.994 | 0.987 | gTTGCTtaattg V$CEBP_Q2_01 | 3437 (-) | 0.977 | 0.970 | gacttaGGTAAc V$CEBP_Q2_01 | 3569 (-) | 0.995 | 0.982 | gcctttGGCAAg V$CEBP_Q2_01 | 3644 (-) | 0.963 | 0.927 | acatggTGGAAt V$CEBP_Q2_01 | 3741 (+) | 0.971 | 0.931 | aTTTCTacattt V$CEBP_Q2_01 | 3926 (+) | 0.982 | 0.923 | aTTACAttatca V$CEBP_Q2_01 | 4209 (+) | 0.995 | 0.945 | cTTGCCtcagtt V$CEBP_Q2_01 | 4272 (+) | 0.977 | 0.970 | cTTACCtaagca V$CEBP_Q2_01 | 4708 (-) | 0.977 | 0.974 | gacttgGGTAAt V$CEBP_Q2_01 | 4797 (+) | 1.000 | 0.996 | cTTGCAtaatct V$CEBP_Q2_01 | 5152 (-) | 0.968 | 0.960 | agtttaCGAAAa V$CEBP_Q2_01 | 5154 (+) | 0.973 | 0.957 | tTTACGaaaagg V$CEBP_Q2_01 | 5444 (+) | 0.995 | 0.932 | aTTGCCatataa V$CEBP_Q2_01 | 5458 (+) | 0.972 | 0.929 | tTTTCCccacat V$CEBP_Q2_01 | 5917 (+) | 0.972 | 0.923 | gTTTCCccagcg V$CEBP_Q2_01 | 6547 (-) | 0.977 | 0.929 | ggctgaGGTAAg V$CEBP_Q2_01 | 6566 (+) | 0.995 | 0.943 | gTTGCCacagtt V$CEBP_Q2_01 | 6573 (-) | 1.000 | 0.987 | cagttaTGCAAa V$CEBP_Q2_01 | 6610 (+) | 0.994 | 0.984 | tTTGCTaaactc V$CEBP_Q2_01 | 7056 (+) | 0.971 | 0.973 | tTTTCTtaattt V$CEBP_Q2_01 | 7345 (+) | 0.977 | 0.934 | aTTTCAacatgt V$CEBP_Q2_01 | 7404 (+) | 0.976 | 0.964 | aTTACTgaagaa V$CEBP_Q2_01 | 8006 (-) | 0.982 | 0.932 | tgttggTGTAAc V$CEBP_Q2_01 | 8073 (+) | 1.000 | 0.982 | aTTGCAgaaact V$CEBP_Q2_01 | 8468 (-) | 0.995 | 0.926 | atctaaGGCAAa V$CEBP_Q2_01 | 8578 (-) | 0.995 | 0.954 | aaatggGGCAAa V$CEBP_Q2_01 | 8623 (-) | 0.971 | 0.963 | actttgAGAAAa V$CEBP_Q2_01 | 8694 (-) | 0.977 | 0.956 | cctttcTGAAAa V$CEBP_Q2_01 | 8695 (+) | 0.971 | 0.956 | cTTTCTgaaaat V$CEBP_Q2_01 | 8907 (-) | 0.957 | 0.957 | agattcAGGAAt V$CEBP_Q2_01 | 8936 (-) | 0.994 | 0.926 | aactaaAGCAAc V$CEBP_Q2_01 | 9100 (+) | 0.982 | 0.972 | cTTACAaaagca V$CEBP_Q2_01 | 9490 (+) | 1.000 | 0.928 | tTTGCAatagaa V$CEBP_Q2_01 | 9511 (+) | 0.972 | 0.933 | gTTTCCccatgt V$CEBP_Q2_01 | 9601 (-) | 0.958 | 0.958 | gcatttGGGAAg V$CEBP_Q2_01 | 9707 (-) | 0.982 | 0.937 | tggtggTGTAAa V$CEBP_Q2_01 | 9789 (-) | 0.976 | 0.965 | ttgttcAGTAAa V$CEBP_Q2_01 | 9871 (-) | 0.972 | 0.933 | gaatgaGGAAAt V$CEBP_Q2_01 | 10070 (-) | 0.963 | 0.953 | gtcttcTGGAAg V$CEBP_Q2_01 | 10132 (-) | 0.971 | 0.968 | tagttaAGAAAt V$CEBP_Q2_01 | 10371 (+) | 0.977 | 0.974 | cTTTCAaaatgt V$CEBP_Q2_01 | 10377 (-) | 1.000 | 0.955 | aaatgtTGCAAt V$CEBP_Q2_01 | 10686 (-) | 0.995 | 0.924 | gactctGGCAAc V$CEBP_Q2_01 | 10703 (-) | 0.977 | 0.966 | tgtttaTGAAAc V$CEBP_Q2_01 | 10968 (-) | 0.994 | 0.942 | tgttggAGCAAg V$CEBP_Q2_01 | 11150 (-) | 0.977 | 0.924 | ctctgaTGAAAa V$CEBP_Q2_01 | 11284 (-) | 0.995 | 0.934 | gcatagGGCAAa V$CEBP_Q2_01 | 11668 (+) | 0.977 | 0.972 | tTTTCAtaacca V$CEBP_Q2_01 | 11728 (+) | 0.971 | 0.964 | tTTTCTcaaaca V$CEBP_Q2_01 | 11780 (-) | 0.982 | 0.929 | acttgaTGTAAg V$MYB_Q5_01 | 68 (-) | 0.984 | 0.958 | ttaCTGTTg V$MYB_Q5_01 | 127 (+) | 1.000 | 0.964 | tAACTGctc V$MYB_Q5_01 | 160 (-) | 0.992 | 0.959 | tctCCGTTc V$MYB_Q5_01 | 451 (-) | 0.984 | 0.978 | tgtCTGTTt V$MYB_Q5_01 | 591 (+) | 0.984 | 0.982 | tAACAGact V$MYB_Q5_01 | 792 (+) | 0.989 | 0.953 | tAACCGctg V$MYB_Q5_01 | 983 (-) | 1.000 | 0.970 | tcaCAGTTa V$MYB_Q5_01 | 1026 (-) | 1.000 | 1.000 | tggCAGTTg V$MYB_Q5_01 | 1142 (-) | 0.992 | 0.956 | aaaCCGTTa V$MYB_Q5_01 | 1862 (+) | 1.000 | 0.974 | cAACTGaag V$MYB_Q5_01 | 2374 (-) | 1.000 | 0.965 | ccaCAGTTt V$MYB_Q5_01 | 2552 (-) | 0.984 | 0.958 | tttCTGTTg V$MYB_Q5_01 | 2678 (-) | 0.992 | 0.958 | aacCCGTTg V$MYB_Q5_01 | 2825 (-) | 1.000 | 0.974 | gtcCAGTTg V$MYB_Q5_01 | 2867 (+) | 0.984 | 0.954 | gAACAGaac V$MYB_Q5_01 | 3344 (-) | 0.984 | 0.954 | ctcCTGTTc V$MYB_Q5_01 | 3399 (-) | 1.000 | 0.972 | ctcCAGTTa V$MYB_Q5_01 | 3445 (-) | 1.000 | 0.964 | taaCAGTTa V$MYB_Q5_01 | 3509 (-) | 0.984 | 0.957 | ctgCTGTTa V$MYB_Q5_01 | 3545 (-) | 0.984 | 0.954 | gttCTGTTc V$MYB_Q5_01 | 3608 (-) | 1.000 | 0.968 | ctgCAGTTt V$MYB_Q5_01 | 4213 (-) | 1.000 | 0.969 | cctCAGTTa V$MYB_Q5_01 | 4321 (+) | 1.000 | 0.965 | aAACTGggt V$MYB_Q5_01 | 4539 (+) | 0.984 | 0.984 | cAACAGcct V$MYB_Q5_01 | 4565 (-) | 1.000 | 0.971 | acaCAGTTg V$MYB_Q5_01 | 4571 (-) | 0.989 | 0.961 | ttgCGGTTa V$MYB_Q5_01 | 4944 (+) | 1.000 | 0.967 | gAACTGgga V$MYB_Q5_01 | 5148 (-) | 1.000 | 0.994 | aggCAGTTt V$MYB_Q5_01 | 5281 (+) | 0.984 | 0.984 | cAACAGgct V$MYB_Q5_01 | 5729 (+) | 1.000 | 0.968 | aAACTGgaa V$MYB_Q5_01 | 6106 (-) | 0.984 | 0.958 | gtcCTGTTg V$MYB_Q5_01 | 6388 (+) | 1.000 | 0.995 | gAACTGgcc V$MYB_Q5_01 | 6570 (-) | 1.000 | 0.969 | ccaCAGTTa V$MYB_Q5_01 | 6847 (+) | 1.000 | 0.966 | cAACTGgtg V$MYB_Q5_01 | 7187 (+) | 0.992 | 0.988 | gAACGGcca V$MYB_Q5_01 | 7490 (-) | 0.984 | 0.980 | cgtCTGTTc V$MYB_Q5_01 | 7523 (+) | 1.000 | 0.969 | gAACTGaag V$MYB_Q5_01 | 7532 (-) | 1.000 | 0.974 | ttaCAGTTg V$MYB_Q5_01 | 7550 (+) | 1.000 | 0.971 | cAACTGtgt V$MYB_Q5_01 | 7602 (+) | 1.000 | 0.962 | gAACTGctg V$MYB_Q5_01 | 7723 (-) | 1.000 | 0.960 | cagCAGTTt V$MYB_Q5_01 | 8002 (-) | 0.984 | 0.984 | tgcCTGTTg V$MYB_Q5_01 | 8196 (-) | 0.984 | 0.978 | tgcCTGTTt V$MYB_Q5_01 | 8214 (-) | 1.000 | 0.967 | cctCAGTTc V$MYB_Q5_01 | 8672 (-) | 0.989 | 0.953 | cacCGGTTa V$MYB_Q5_01 | 8725 (+) | 1.000 | 0.969 | gAACTGtaa V$MYB_Q5_01 | 8847 (+) | 0.992 | 0.954 | gAACGGttt V$MYB_Q5_01 | 9119 (+) | 0.989 | 0.954 | aAACCGgga V$MYB_Q5_01 | 9132 (-) | 1.000 | 0.968 | cttCAGTTt V$MYB_Q5_01 | 9433 (+) | 0.984 | 0.956 | tAACAGgaa V$MYB_Q5_01 | 9561 (-) | 0.989 | 0.983 | tgtCGGTTt V$MYB_Q5_01 | 9765 (+) | 0.984 | 0.978 | aAACAGtca V$MYB_Q5_01 | 10058 (+) | 1.000 | 0.969 | gAACTGaac V$MYB_Q5_01 | 10063 (+) | 0.989 | 0.956 | gAACCGggt V$MYB_Q5_01 | 10194 (-) | 1.000 | 0.995 | agtCAGTTc V$MYB_Q5_01 | 10310 (+) | 1.000 | 0.964 | tAACTGttt V$MYB_Q5_01 | 10665 (+) | 0.984 | 0.958 | cAACAGgag V$MYB_Q5_01 | 10964 (-) | 0.984 | 0.984 | tgcCTGTTg V$MYB_Q5_01 | 11036 (-) | 0.984 | 0.955 | ccaCTGTTg V$MYB_Q5_01 | 11067 (-) | 1.000 | 0.964 | gatCAGTTa V$MYB_Q5_01 | 11137 (-) | 1.000 | 0.965 | actCAGTTt V$MYB_Q5_01 | 11219 (+) | 1.000 | 0.996 | gAACTGccc V$MYB_Q5_01 | 11224 (-) | 1.000 | 0.965 | gccCAGTTt V$MYB_Q5_01 | 11237 (+) | 1.000 | 0.972 | tAACTGgag V$MYB_Q5_01 | 11493 (-) | 0.989 | 0.963 | atgCGGTTg V$MYB_Q5_01 | 11516 (-) | 0.992 | 0.966 | ttaCCGTTg V$MYB_Q5_01 | 11653 (-) | 0.984 | 0.956 | gtaCTGTTa V$MYB_Q5_01 | 11712 (+) | 1.000 | 1.000 | cAACTGccc V$MYB_Q5_01 | 11764 (+) | 0.984 | 0.956 | tAACAGgat V$AP2_Q6_01 | 100 (-) | 1.000 | 0.991 | cGCCTGggggtgg V$AP2_Q6_01 | 1331 (+) | 0.869 | 0.873 | actccccAGGGCt V$AP2_Q6_01 | 2479 (-) | 0.869 | 0.893 | gGCCCTgggttcg V$AP2_Q6_01 | 3532 (-) | 0.986 | 0.963 | gGCCCGaggtttg V$AP2_Q6_01 | 3566 (+) | 0.863 | 0.886 | gtggcctTTGGCa V$AP2_Q6_01 | 3568 (-) | 0.883 | 0.903 | gGCCTTtggcaag V$AP2_Q6_01 | 5020 (-) | 0.884 | 0.916 | aGCCACgggactg V$AP2_Q6_01 | 6144 (+) | 0.980 | 0.969 | catgcctCTGGCt V$AP2_Q6_01 | 6146 (-) | 0.904 | 0.881 | tGCCTCtggctct V$AP2_Q6_01 | 6418 (+) | 0.980 | 0.961 | gactcctCTGGCt V$AP2_Q6_01 | 7191 (-) | 0.884 | 0.920 | gGCCACgggggtg V$AP2_Q6_01 | 7622 (+) | 0.980 | 0.877 | cagccgcCTGGCg V$AP2_Q6_01 | 7626 (-) | 1.000 | 0.881 | cGCCTGgcggtag V$AP2_Q6_01 | 8159 (+) | 0.904 | 0.904 | gcttccaGAGGCa V$AP2_Q6_01 | 9240 (-) | 0.869 | 0.898 | aGCCCTaggagag V$AP2_Q6_01 | 9330 (-) | 0.904 | 0.899 | aGCCTCtggagga V$AP2_Q6_01 | 9389 (-) | 0.890 | 0.906 | aGCCCCtggaatg V$AP2_Q6_01 | 9924 (-) | 0.883 | 0.880 | tGCCTAtgggtaa V$AP2_Q6_01 | 10559 (-) | 0.986 | 0.946 | aGCCGGcggtaca V$AP2_Q6_01 | 11814 (-) | 0.869 | 0.893 | gGCCCAgggttgg V$CREB_Q2_01 | 1237 (+) | 0.904 | 0.899 | tttgaCATCAgggc V$CREB_Q2_01 | 1840 (+) | 1.000 | 0.916 | ggggtCGTCAacac V$CREB_Q2_01 | 2962 (-) | 1.000 | 0.917 | acagTGACGgcaga V$CREB_Q2_01 | 4531 (+) | 1.000 | 0.922 | aaatgCGTCAacag V$CREB_Q2_01 | 8321 (+) | 1.000 | 0.929 | ttctgCGTCAgcgc V$CREB_Q2_01 | 8610 (+) | 1.000 | 0.909 | cattcCGTCAatca V$CREB_Q2_01 | 11314 (-) | 1.000 | 0.912 | acggTGACGacgat V$CREB_Q2_01 | 11329 (-) | 1.000 | 0.899 | atgaTGACGatgta V$CREB_Q4_01 | 1239 (-) | 0.914 | 0.920 | tgaCATCAggg V$CREB_Q4_01 | 1842 (-) | 1.000 | 0.915 | ggtCGTCAaca V$CREB_Q4_01 | 2963 (+) | 1.000 | 0.939 | cagTGACGgca V$CREB_Q4_01 | 4533 (-) | 1.000 | 0.923 | atgCGTCAaca V$CREB_Q4_01 | 8323 (-) | 1.000 | 0.941 | ctgCGTCAgcg V$CREB_Q4_01 | 8612 (-) | 1.000 | 0.914 | ttcCGTCAatc V$CREB_Q4_01 | 11315 (+) | 1.000 | 0.936 | cggTGACGacg V$CREB_Q4_01 | 11330 (+) | 1.000 | 0.914 | tgaTGACGatg V$E2F_Q6_01 | 7088 (-) | 0.949 | 0.926 | gctTTGCGcgca V$SRF_Q5_01 | 11538 (+) | 0.990 | 0.926 | ccacaTATGGtagat V$AP1_Q2_01 | 1350 (-) | 0.978 | 0.962 | ttgcctgGGTCA V$AP1_Q2_01 | 3436 (+) | 1.000 | 0.958 | TGACTtaggtaa V$AP1_Q2_01 | 3727 (+) | 1.000 | 0.961 | TGACTaaaatta V$AP1_Q2_01 | 3748 (-) | 1.000 | 0.956 | catttttAGTCA V$AP1_Q2_01 | 4529 (-) | 0.978 | 0.966 | acaaatgCGTCA V$AP1_Q2_01 | 6324 (+) | 1.000 | 0.954 | TGACTgaagcac V$AP1_Q2_01 | 8319 (-) | 0.978 | 0.960 | gcttctgCGTCA V$AP1_Q2_01 | 9317 (-) | 0.978 | 0.970 | ggcgttgGGTCA V$AP1_Q2_01 | 10031 (-) | 1.000 | 0.970 | gtttgtgAGTCA V$AP1_Q6_01 | 133 (-) | 1.000 | 0.964 | ctcAGTCAa V$AP1_Q6_01 | 1078 (-) | 0.961 | 0.945 | gtaGGTCAt V$AP1_Q6_01 | 1087 (-) | 0.961 | 0.938 | tttGGTCAa V$AP1_Q6_01 | 1255 (-) | 0.963 | 0.937 | ttcTGTCAt V$AP1_Q6_01 | 1354 (-) | 0.961 | 0.962 | ctgGGTCAc V$AP1_Q6_01 | 3072 (-) | 0.961 | 0.942 | ctaGGTCAc V$AP1_Q6_01 | 3435 (+) | 1.000 | 0.974 | tTGACTtag V$AP1_Q6_01 | 3726 (+) | 1.000 | 0.977 | gTGACTaaa V$AP1_Q6_01 | 3752 (-) | 1.000 | 0.980 | tttAGTCAt V$AP1_Q6_01 | 3934 (-) | 1.000 | 0.967 | atcAGTCAc V$AP1_Q6_01 | 4467 (+) | 1.000 | 0.966 | cTGACTgac V$AP1_Q6_01 | 4533 (-) | 0.960 | 0.957 | atgCGTCAa V$AP1_Q6_01 | 5358 (+) | 0.963 | 0.940 | tTGACAaaa V$AP1_Q6_01 | 6294 (+) | 1.000 | 0.970 | aTGACTgag V$AP1_Q6_01 | 6323 (+) | 1.000 | 0.966 | cTGACTgaa V$AP1_Q6_01 | 6416 (+) | 1.000 | 0.946 | aTGACTcct V$AP1_Q6_01 | 6556 (-) | 1.000 | 0.935 | aagAGTCAt V$AP1_Q6_01 | 8323 (-) | 0.960 | 0.960 | ctgCGTCAg V$AP1_Q6_01 | 9321 (-) | 0.961 | 0.961 | ttgGGTCAg V$AP1_Q6_01 | 10035 (+) | 0.932 | 0.935 | gTGAGTcac V$AP1_Q6_01 | 10035 (-) | 1.000 | 0.997 | gtgAGTCAc V$AP1_Q6_01 | 10633 (+) | 1.000 | 0.967 | gTGACTgag V$AP1_Q6_01 | 11455 (-) | 0.930 | 0.933 | gtgAATCAg V$AP1_Q4_01 | 134 (+) | 1.000 | 0.964 | tcAGTCAa V$AP1_Q4_01 | 1355 (+) | 0.957 | 0.959 | tgGGTCAc V$AP1_Q4_01 | 3435 (-) | 1.000 | 0.975 | tTGACTta V$AP1_Q4_01 | 3726 (-) | 1.000 | 0.976 | gTGACTaa V$AP1_Q4_01 | 3753 (+) | 1.000 | 0.979 | ttAGTCAt V$AP1_Q4_01 | 3935 (+) | 1.000 | 0.967 | tcAGTCAc V$AP1_Q4_01 | 4467 (-) | 1.000 | 0.968 | cTGACTga V$AP1_Q4_01 | 4534 (+) | 0.957 | 0.956 | tgCGTCAa V$AP1_Q4_01 | 6294 (-) | 1.000 | 0.971 | aTGACTga V$AP1_Q4_01 | 6323 (-) | 1.000 | 0.968 | cTGACTga V$AP1_Q4_01 | 8324 (+) | 0.957 | 0.959 | tgCGTCAg V$AP1_Q4_01 | 9322 (+) | 0.957 | 0.959 | tgGGTCAg V$AP1_Q4_01 | 10036 (+) | 1.000 | 0.997 | tgAGTCAc V$AP1_Q4_01 | 10633 (-) | 1.000 | 0.967 | gTGACTga V$AP4_Q6_01 | 1787 (-) | 0.924 | 0.936 | gCACCTgcc V$AP4_Q6_01 | 2970 (+) | 0.924 | 0.936 | ggcAGATGc V$AP4_Q6_01 | 7028 (+) | 1.000 | 0.962 | atcAGCTGa V$AP4_Q6_01 | 7029 (-) | 1.000 | 0.934 | tCAGCTgaa V$AP4_Q6_01 | 7680 (+) | 1.000 | 0.957 | cacAGCTGt V$AP4_Q6_01 | 7681 (-) | 1.000 | 0.947 | aCAGCTgta V$AP4_Q6_01 | 8277 (+) | 1.000 | 0.966 | tgcAGCTGc V$AP4_Q6_01 | 8278 (-) | 1.000 | 0.995 | gCAGCTgcc V$AP4_Q6_01 | 8370 (+) | 0.924 | 0.941 | gccTGCTGc V$AP4_Q6_01 | 8421 (+) | 1.000 | 0.973 | agcAGCTGt V$AP4_Q6_01 | 8422 (-) | 1.000 | 0.997 | gCAGCTgtt V$AP4_Q6_01 | 8504 (+) | 0.924 | 0.938 | gacAGGTGc V$MYOD_Q6_01 | 1354 (-) | 1.000 | 0.965 | ctgggtcACCTGatgctc V$MYOD_Q6_01 | 1388 (+) | 1.000 | 0.957 | gtagggCAGGTgaggaga V$MYOD_Q6_01 | 4335 (+) | 1.000 | 0.960 | gacatcCAGGTggtggag V$MYOD_Q6_01 | 8500 (+) | 1.000 | 0.985 | cagggaCAGGTgcgggac V$MYOD_Q6_01 | 8965 (+) | 1.000 | 0.971 | gagaacCAGGTgtggtgg V$OCT1_Q5_01 | 993 (+) | 0.985 | 0.983 | taaTTTTCatg V$OCT1_Q5_01 | 3109 (+) | 0.981 | 0.981 | taaTTTCCatg V$OCT1_Q5_01 | 3129 (+) | 0.986 | 0.989 | taaTTTACatt V$OCT1_Q5_01 | 3835 (+) | 0.981 | 0.965 | aaaTTTCCatt V$OCT1_Q5_01 | 4071 (-) | 0.986 | 0.909 | tatGTAAAatg V$OCT1_Q5_01 | 5341 (+) | 1.000 | 0.911 | agtTTTGCatc V$OCT1_Q5_01 | 6577 (-) | 1.000 | 0.943 | tatGCAAAaca V$OCT1_Q5_01 | 6969 (-) | 0.981 | 0.921 | gatGGAAAgga V$OCT1_Q5_01 | 9934 (+) | 1.000 | 0.992 | taaTTTGCatg V$OCT1_Q5_01 | 10911 (-) | 1.000 | 1.000 | tatGCAAAtaa V$OCT1_Q5_01 | 11154 (-) | 0.985 | 0.980 | gatGAAAAtga V$OCT1_Q5_01 | 11337 (-) | 0.986 | 0.959 | gatGTAAAtgc V$OCT1_Q5_01 | 11665 (+) | 0.985 | 0.966 | caaTTTTCata V$SP1_Q6_01 | 1489 (+) | 0.949 | 0.935 | agGGAGGggg V$SP1_Q6_01 | 1493 (+) | 0.934 | 0.924 | agGGGGGggg V$SP1_Q6_01 | 1494 (+) | 0.934 | 0.942 | ggGGGGGggg V$SP1_Q6_01 | 1495 (+) | 0.934 | 0.941 | ggGGGGGgga V$SP1_Q6_01 | 1496 (+) | 0.934 | 0.910 | ggGGGGGgag V$SP1_Q6_01 | 1500 (+) | 0.949 | 0.921 | ggGGAGGgag V$SP1_Q6_01 | 5895 (-) | 1.000 | 0.968 | ggcCCGCCcc V$SP1_Q6_01 | 10614 (+) | 0.949 | 0.904 | tgGGAGGgag V$SP1_Q6_01 | 11847 (+) | 0.945 | 0.951 | ggGGTGGggt V$SP1_Q6_01 | 11872 (+) | 0.949 | 0.930 | ggGGAGGgac V$SP1_Q4_01 | 1487 (+) | 0.964 | 0.952 | agagGGAGGgggg V$SP1_Q4_01 | 1491 (+) | 0.947 | 0.939 | ggagGGGGGgggg V$SP1_Q4_01 | 1492 (+) | 0.947 | 0.944 | gaggGGGGGggga V$SP1_Q4_01 | 1493 (+) | 0.947 | 0.952 | agggGGGGGggag V$SP1_Q4_01 | 1494 (+) | 0.947 | 0.919 | ggggGGGGGgagg V$SP1_Q4_01 | 1498 (+) | 0.964 | 0.931 | ggggGGAGGgagg V$SP1_Q4_01 | 5894 (-) | 1.000 | 0.958 | gggcCCGCCccta V$SP1_Q4_01 | 11845 (+) | 0.958 | 0.959 | ggggGGTGGggta V$SP1_Q4_01 | 11870 (+) | 0.964 | 0.937 | cgggGGAGGgact V$SP1_Q2_01 | 1488 (-) | 0.972 | 0.955 | gagGGAGGgg V$SP1_Q2_01 | 1492 (-) | 0.957 | 0.945 | gagGGGGGgg V$SP1_Q2_01 | 1493 (-) | 0.957 | 0.963 | aggGGGGGgg V$SP1_Q2_01 | 1494 (-) | 0.957 | 0.969 | gggGGGGGgg V$SP1_Q2_01 | 1495 (-) | 0.957 | 0.930 | gggGGGGGga V$SP1_Q2_01 | 1496 (-) | 0.957 | 0.918 | gggGGGGGag V$SP1_Q2_01 | 1499 (-) | 0.972 | 0.941 | gggGGAGGga V$SP1_Q2_01 | 1503 (-) | 0.972 | 0.917 | gagGGAGGga V$SP1_Q2_01 | 1507 (-) | 0.972 | 0.917 | gagGGAGGga V$SP1_Q2_01 | 1511 (-) | 0.972 | 0.917 | gagGGAGGga V$SP1_Q2_01 | 5896 (+) | 1.000 | 0.953 | gcCCGCCcct V$SP1_Q2_01 | 10613 (-) | 0.972 | 0.920 | gtgGGAGGga V$SP1_Q2_01 | 11846 (-) | 0.966 | 0.976 | gggGGTGGgg V$SP1_Q2_01 | 11871 (-) | 0.972 | 0.941 | gggGGAGGga I$ZESTE_Q2_01 | 7469 (+) | 1.000 | 0.994 | gatttGAGTGt I$ZESTE_Q2_01 | 9191 (-) | 1.000 | 0.990 | cCACTCaagag V$NFAT_Q4_01 | 329 (+) | 1.000 | 0.996 | ctGGAAAttc V$NFAT_Q4_01 | 1561 (+) | 1.000 | 0.982 | ctGGAAAtct V$NFAT_Q4_01 | 1955 (-) | 1.000 | 0.985 | aacTTTCCtt V$NFAT_Q4_01 | 2563 (-) | 1.000 | 0.981 | aaaTTTCCgt V$NFAT_Q4_01 | 3109 (-) | 1.000 | 0.991 | taaTTTCCat V$NFAT_Q4_01 | 3835 (-) | 1.000 | 0.997 | aaaTTTCCat V$NFAT_Q4_01 | 4689 (+) | 1.000 | 0.987 | caGGAAActt V$NFAT_Q4_01 | 10587 (-) | 1.000 | 0.982 | caaTTTCCta V$NFAT_Q4_01 | 11090 (-) | 1.000 | 1.000 | aatTTTCCac V$NFAT_Q4_01 | 11272 (+) | 1.000 | 0.980 | aaGGAAAtta V$E2F1_Q3_01 | 1380 (+) | 0.834 | 0.847 | tggGCGAGgtagggca V$E2F1_Q3_01 | 2463 (-) | 0.936 | 0.850 | gcctagcaAGCGCgaa V$E2F1_Q3_01 | 4823 (+) | 0.871 | 0.813 | cagGCGGGgtagggaa V$E2F1_Q3_01 | 5820 (+) | 0.834 | 0.808 | ctgGCGTGgcactgtg V$E2F1_Q3_01 | 9176 (+) | 0.830 | 0.843 | tgaACGCCacagagac V$E2F1_Q6_01 | 2471 (-) | 1.000 | 0.912 | aGCGCGaagg V$E2F1_Q6_01 | 9121 (-) | 0.870 | 0.882 | aCCGGGaaaa V$MEF2_Q6_01 | 3121 (+) | 0.962 | 0.936 | ggcTATATtaat P$TAF1_01 | 735 (+) | 1.000 | 0.992 | accACGTGgt P$TAF1_01 | 735 (-) | 1.000 | 0.992 | acCACGTggt P$CPRF3_01 | 735 (+) | 1.000 | 0.982 | acCACGTggt P$CPRF3_01 | 735 (-) | 1.000 | 0.982 | accACGTGgt P$CPRF2_01 | 735 (+) | 1.000 | 1.000 | acCACGTggt P$CPRF2_01 | 735 (-) | 1.000 | 1.000 | accACGTGgt V$CP2_02 | 102 (+) | 0.941 | 0.862 | CCTGGgggtggccag V$CP2_02 | 103 (-) | 0.968 | 0.869 | ctgggggtggCCAGT V$CP2_02 | 121 (+) | 1.000 | 0.777 | GCTGGctaactgctc V$CP2_02 | 133 (-) | 0.957 | 0.790 | ctcagtcaatCCAGA V$CP2_02 | 328 (+) | 1.000 | 0.826 | GCTGGaaattccctc V$CP2_02 | 1199 (+) | 1.000 | 0.873 | GCTGGttatacccat V$CP2_02 | 1200 (-) | 0.743 | 0.805 | ctggttatacCCATT V$CP2_02 | 1580 (-) | 1.000 | 0.773 | gcatggagagCCAGC V$CP2_02 | 1714 (+) | 0.705 | 0.767 | TCTGCcttttccggg V$CP2_02 | 1724 (-) | 0.759 | 0.804 | ccgggctcatCCTGA V$CP2_02 | 1800 (+) | 0.770 | 0.809 | ACAGGattcatcagg V$CP2_02 | 1950 (+) | 1.000 | 0.800 | GCTGGaactttcctt V$CP2_02 | 2680 (-) | 0.743 | 0.793 | cccgttgcatCCTGG V$CP2_02 | 2764 (+) | 0.732 | 0.790 | TCTAGattccgcatg V$CP2_02 | 2794 (+) | 1.000 | 0.822 | GCTGGtcttctctta V$CP2_02 | 2817 (-) | 0.968 | 0.874 | cgtgctctgtCCAGT V$CP2_02 | 3046 (-) | 0.802 | 0.780 | ccgggacataCCTGC V$CP2_02 | 3645 (-) | 0.700 | 0.762 | catggtggaaTCAGT V$CP2_02 | 3848 (-) | 0.968 | 0.787 | cataaaatatCCAGT V$CP2_02 | 4313 (+) | 0.715 | 0.794 | ACTGCataaaactgg V$CP2_02 | 4329 (+) | 0.957 | 0.867 | TCTGGtgacatccag V$CP2_02 | 4330 (-) | 0.941 | 0.924 | ctggtgacatCCAGG V$CP2_02 | 4343 (+) | 0.802 | 0.765 | GGTGGtggaggcagg V$CP2_02 | 4379 (+) | 0.957 | 0.783 | TCTGGctacatagtg V$CP2_02 | 4757 (-) | 1.000 | 0.796 | tctgtaggtcCCAGC V$CP2_02 | 4778 (+) | 0.957 | 0.807 | TCTGGataaagattg V$CP2_02 | 4882 (+) | 0.957 | 0.769 | TCTGGaacattccat V$CP2_02 | 4946 (+) | 0.968 | 0.829 | ACTGGgactttctgc V$CP2_02 | 5004 (-) | 0.743 | 0.790 | cttgtattacCCATT V$CP2_02 | 5029 (+) | 0.968 | 0.876 | ACTGGgatgctcagg V$CP2_02 | 5080 (+) | 1.000 | 0.978 | GCTGGctctgactgg V$CP2_02 | 5191 (+) | 0.802 | 0.761 | GCAGGatttgtctgt V$CP2_02 | 5549 (+) | 0.690 | 0.779 | TCTGTctctcactag V$CP2_02 | 5987 (+) | 0.957 | 0.804 | TCTGGattagggatg V$CP2_02 | 6040 (+) | 0.957 | 0.861 | TCTGGtgcagccctg V$CP2_02 | 6041 (-) | 0.770 | 0.766 | ctggtgcagcCCTGT V$CP2_02 | 6097 (+) | 0.759 | 0.807 | TGTGGatcagtcctg V$CP2_02 | 6123 (+) | 0.802 | 0.782 | GCTTGttttctcggc V$CP2_02 | 6183 (-) | 0.743 | 0.815 | ccagagacacCCATT V$CP2_02 | 6202 (+) | 0.716 | 0.770 | CCGGGgtatgccaag V$CP2_02 | 6239 (+) | 0.957 | 0.860 | TCTGGctacggcttt V$CP2_02 | 6424 (+) | 0.957 | 0.801 | TCTGGctttcatcgg V$CP2_02 | 6434 (-) | 0.968 | 0.853 | atcggtcaacCCAGT V$CP2_02 | 6489 (+) | 0.968 | 0.817 | ACTGGattgcagatg V$CP2_02 | 6626 (-) | 0.957 | 0.824 | agtggggcacCCAGA V$CP2_02 | 6769 (-) | 0.733 | 0.798 | catgaacaaaACAGC V$CP2_02 | 6793 (+) | 1.000 | 0.905 | GCTGGctcttcctcc V$CP2_02 | 6849 (+) | 0.968 | 0.818 | ACTGGtgagcacgtc V$CP2_02 | 6988 (+) | 1.000 | 0.769 | GCTGGtgaggagatg V$CP2_02 | 7160 (+) | 0.941 | 0.761 | CCTGGagctggcgtt V$CP2_02 | 7744 (+) | 1.000 | 0.810 | GCTGGctacattgtg V$CP2_02 | 7843 (+) | 0.957 | 0.801 | TCTGGcaaaagcgac V$CP2_02 | 8035 (+) | 0.802 | 0.849 | GCTTGctgggccttg V$CP2_02 | 8169 (-) | 0.957 | 0.820 | gcaggtagcaCCAGA V$CP2_02 | 8184 (+) | 0.770 | 0.789 | ACTTGtttgctctgc V$CP2_02 | 8324 (-) | 0.968 | 0.768 | tgcgtcagcgCCAGT V$CP2_02 | 8816 (+) | 0.957 | 0.770 | TCTGGtactctcgat V$CP2_02 | 8960 (-) | 0.941 | 0.879 | gtagggagaaCCAGG V$CP2_02 | 9057 (-) | 0.941 | 0.770 | atggcaagttCCAGG V$CP2_02 | 9226 (-) | 0.957 | 0.874 | caagttagctCCAGA V$CP2_02 | 9269 (-) | 0.802 | 0.799 | gcagcgacagCCACC V$CP2_02 | 9297 (+) | 0.802 | 0.769 | GCTTGtgtgcccagg V$CP2_02 | 9334 (+) | 0.957 | 0.853 | TCTGGaggacccctg V$CP2_02 | 9532 (-) | 0.770 | 0.842 | ctagaacaaaCAAGT V$CP2_02 | 9578 (+) | 0.802 | 0.764 | GGTGGtatagccctg V$CP2_02 | 9617 (+) | 0.802 | 0.771 | GGTGGgaggatcggg V$CP2_02 | 9968 (+) | 0.957 | 0.794 | TCTGGttcctatggg V$CP2_02 | 10247 (+) | 0.759 | 0.821 | TCAGGcttgtgcagg V$CP2_02 | 10689 (+) | 0.957 | 0.771 | TCTGGcaacacccgt V$CP2_02 | 10778 (-) | 0.941 | 0.832 | tctgtcggacCCAGG V$CP2_02 | 10845 (-) | 0.743 | 0.800 | caggaagaagCCATT V$CP2_02 | 10966 (-) | 0.743 | 0.794 | cctgttggagCAAGG V$CP2_02 | 11129 (-) | 0.700 | 0.772 | ctcgctgcacTCAGT V$CP2_02 | 11216 (-) | 0.968 | 0.785 | tttgaactgcCCAGT P$PCF2_01 | 1406 (-) | 1.000 | 0.997 | GTGGGtccat P$PCF2_01 | 5892 (+) | 0.877 | 0.905 | tggggCCCGC P$AGL15_01 | 1959 (+) | 0.880 | 0.890 | ttccttttTTGAGcag P$AGL15_01 | 1959 (-) | 0.894 | 0.881 | ttcCTTTTttgagcag P$AGL15_01 | 9404 (+) | 0.900 | 0.877 | gttctatgTCTAGaaa P$AGL15_01 | 9404 (-) | 0.827 | 0.878 | gttCTATGtctagaaa P$AGL15_01 | 11535 (+) | 0.946 | 0.893 | atcccacaTATGGtag P$AGL15_01 | 11535 (-) | 0.852 | 0.889 | atcCCACAtatggtag P$AG_02 | 165 (+) | 0.849 | 0.715 | GTTCCcaatgctggca P$AG_02 | 166 (+) | 0.972 | 0.817 | TTCCCaatgctggcat P$AG_02 | 166 (-) | 0.857 | 0.817 | ttcccaatgctGGCAT P$AG_02 | 320 (-) | 1.000 | 0.712 | agtcctgtgctGGAAA P$AG_02 | 554 (+) | 0.745 | 0.724 | TAACCttttctaggag P$AG_02 | 554 (-) | 0.718 | 0.734 | taaccttttctAGGAG P$AG_02 | 875 (+) | 0.678 | 0.735 | ATGCAcaaacacgaaa P$AG_02 | 875 (-) | 0.821 | 0.732 | atgcacaaacaCGAAA P$AG_02 | 908 (+) | 0.869 | 0.711 | TTTCTttcatgcgtag P$AG_02 | 908 (-) | 0.697 | 0.714 | tttctttcatgCGTAG P$AG_02 | 1135 (+) | 0.849 | 0.718 | GTTCCtaaaaccgtta P$AG_02 | 1135 (-) | 0.565 | 0.712 | gttcctaaaacCGTTA P$AG_02 | 1148 (+) | 0.869 | 0.710 | TTACTttatgtagttc P$AG_02 | 1166 (+) | 0.821 | 0.718 | TTTCAaagaaaggaga P$AG_02 | 1166 (-) | 0.724 | 0.708 | tttcaaagaaaGGAGA P$AG_02 | 1715 (+) | 0.871 | 0.712 | CTGCCttttccgggct P$AG_02 | 1715 (-) | 0.559 | 0.706 | ctgccttttccGGGCT P$AG_02 | 1791 (+) | 0.871 | 0.715 | CTGCCattcacaggat P$AG_02 | 1791 (-) | 0.704 | 0.719 | ctgccattcacAGGAT P$AG_02 | 1959 (+) | 0.842 | 0.780 | TTCCTtttttgagcag P$AG_02 | 1959 (-) | 0.740 | 0.782 | ttccttttttgAGCAG P$AG_02 | 2350 (+) | 0.746 | 0.708 | CTTCTcaaacttgtat P$AG_02 | 2350 (-) | 0.683 | 0.705 | cttctcaaactTGTAT P$AG_02 | 2714 (+) | 0.876 | 0.753 | CTTCCcaaatagataa P$AG_02 | 2714 (-) | 0.635 | 0.750 | cttcccaaataGATAA P$AG_02 | 2715 (+) | 0.972 | 0.717 | TTCCCaaatagataat P$AG_02 | 2786 (+) | 0.876 | 0.709 | CTTCCttagctggtct P$AG_02 | 2832 (+) | 0.593 | 0.711 | TGTCTgcaaatggcat P$AG_02 | 2832 (-) | 0.857 | 0.717 | tgtctgcaaatGGCAT P$AG_02 | 3087 (+) | 0.732 | 0.717 | ATTCTatttttcgaat P$AG_02 | 3087 (-) | 0.683 | 0.717 | attctatttttCGAAT P$AG_02 | 3639 (+) | 0.876 | 0.819 | CTACCacatggtggaa P$AG_02 | 3639 (-) | 0.793 | 0.822 | ctaccacatggTGGAA P$AG_02 | 3861 (+) | 0.849 | 0.734 | GTACCatttaaggagt P$AG_02 | 3861 (-) | 0.586 | 0.724 | gtaccatttaaGGAGT P$AG_02 | 4038 (+) | 0.863 | 0.795 | ATTCCagcagggggaa P$AG_02 | 4038 (-) | 0.972 | 0.798 | attccagcaggGGGAA P$AG_02 | 4039 (+) | 0.793 | 0.805 | TTCCAgcagggggaaa P$AG_02 | 4039 (-) | 1.000 | 0.804 | ttccagcagggGGAAA P$AG_02 | 4116 (+) | 0.972 | 0.730 | TTCCCagatgttaaaa P$AG_02 | 4210 (+) | 0.994 | 0.708 | TTGCCtcagttaaaaa P$AG_02 | 4395 (+) | 0.849 | 0.718 | GTTCCaagacaggtca P$AG_02 | 4395 (-) | 0.724 | 0.707 | gttccaagacaGGTCA P$AG_02 | 4598 (-) | 0.793 | 0.712 | tgcccgcttgcTGGAA P$AG_02 | 4680 (+) | 0.718 | 0.740 | GTTCTgtggcaggaaa P$AG_02 | 4680 (-) | 1.000 | 0.736 | gttctgtggcaGGAAA P$AG_02 | 4819 (+) | 1.000 | 0.788 | TTACCaggcggggtag P$AG_02 | 4819 (-) | 0.876 | 0.790 | ttaccaggcggGGTAG P$AG_02 | 4926 (+) | 0.821 | 0.716 | TTACAgatcatggaga P$AG_02 | 4954 (+) | 0.869 | 0.796 | TTTCTgcagcaggaag P$AG_02 | 4954 (-) | 0.876 | 0.792 | tttctgcagcaGGAAG P$AG_02 | 5010 (+) | 1.000 | 0.739 | TTACCcattaagccac P$AG_02 | 5349 (+) | 0.835 | 0.710 | ATCCCctctttgacaa P$AG_02 | 5349 (-) | 0.629 | 0.707 | atcccctctttGACAA P$AG_02 | 5985 (+) | 0.746 | 0.807 | CTTCTggattagggat P$AG_02 | 5985 (-) | 0.835 | 0.803 | cttctggattaGGGAT P$AG_02 | 6260 (+) | 0.821 | 0.710 | GTCCCatctgccgaag P$AG_02 | 6260 (-) | 0.697 | 0.714 | gtcccatctgcCGAAG P$AG_02 | 6267 (+) | 0.871 | 0.827 | CTGCCgaagaaggaag P$AG_02 | 6267 (-) | 0.876 | 0.823 | ctgccgaagaaGGAAG P$AG_02 | 6370 (-) | 0.876 | 0.704 | aagcgagatttGGTAG P$AG_02 | 6441 (+) | 0.580 | 0.710 | AACCCagtcatggcaa P$AG_02 | 6441 (-) | 0.994 | 0.720 | aacccagtcatGGCAA P$AG_02 | 7835 (+) | 0.580 | 0.754 | AACCCacttctggcaa P$AG_02 | 7835 (-) | 0.994 | 0.764 | aacccacttctGGCAA P$AG_02 | 8001 (+) | 0.843 | 0.784 | GTGCCtgttggtgtaa P$AG_02 | 8001 (-) | 0.821 | 0.787 | gtgcctgttggTGTAA P$AG_02 | 8137 (+) | 0.869 | 0.818 | TTTCTgctgagggaaa P$AG_02 | 8137 (-) | 1.000 | 0.817 | tttctgctgagGGAAA P$AG_02 | 8161 (+) | 0.793 | 0.720 | TTCCAgaggcaggtag P$AG_02 | 8161 (-) | 0.876 | 0.716 | ttccagaggcaGGTAG P$AG_02 | 8266 (+) | 0.724 | 0.721 | TGACCtgatgatgcag P$AG_02 | 8266 (-) | 0.691 | 0.732 | tgacctgatgaTGCAG P$AG_02 | 8463 (-) | 0.994 | 0.706 | gtctcatctaaGGCAA P$AG_02 | 9097 (+) | 0.740 | 0.731 | CTGCTtacaaaagcaa P$AG_02 | 9097 (-) | 0.864 | 0.731 | ctgcttacaaaAGCAA P$AG_02 | 9404 (+) | 0.718 | 0.753 | GTTCTatgtctagaaa P$AG_02 | 9404 (-) | 0.869 | 0.753 | gttctatgtctAGAAA P$AG_02 | 9596 (-) | 0.972 | 0.770 | tcccagcatttGGGAA P$AG_02 | 9819 (+) | 0.697 | 0.729 | CTTCAaattttagaag P$AG_02 | 9819 (-) | 0.746 | 0.729 | cttcaaattttAGAAG P$AG_02 | 10372 (+) | 0.821 | 0.829 | TTTCAaaatgttgcaa P$AG_02 | 10372 (-) | 0.815 | 0.829 | tttcaaaatgtTGCAA P$AG_02 | 10590 (+) | 1.000 | 0.708 | TTTCCtatatcaaaaa P$AG_02 | 10963 (+) | 0.843 | 0.802 | GTGCCtgttggagcaa P$AG_02 | 10963 (-) | 0.864 | 0.805 | gtgcctgttggAGCAA P$AG_02 | 11262 (+) | 0.843 | 0.767 | GTGCCaagaaaaggaa P$AG_02 | 11262 (-) | 0.842 | 0.767 | gtgccaagaaaAGGAA P$AG_02 | 11263 (-) | 1.000 | 0.756 | tgccaagaaaaGGAAA P$AG_02 | 11279 (-) | 0.994 | 0.740 | ttaaggcatagGGCAA P$AG_02 | 11516 (+) | 1.000 | 0.762 | TTACCgttgtccgtag P$AG_02 | 11516 (-) | 0.697 | 0.762 | ttaccgttgtcCGTAG P$AG_02 | 11535 (+) | 0.835 | 0.861 | ATCCCacatatggtag P$AG_02 | 11535 (-) | 0.876 | 0.861 | atcccacatatGGTAG I$GRH_01 | 2547 (+) | 1.000 | 0.976 | acaGGTTT I$GRH_01 | 5090 (+) | 1.000 | 1.000 | actGGTTT V$AR_02 | 2804 (+) | 0.800 | 0.856 | tcttactgaacaccgTGCTCtgtccag V$AR_02 | 6512 (-) | 1.000 | 0.944 | aataaaaGAACAcgaagttcaccctgg V$AR_02 | 8528 (-) | 1.000 | 0.917 | gcttcaaGAACAcagtgggccgagaaa V$AR_02 | 9459 (+) | 1.000 | 0.949 | ctctgctgtacaggaTGTTCtagctac V$AR_02 | 10056 (+) | 0.800 | 0.821 | aagaactgaaccgggTCTTCtggaagg V$AR_02 | 11217 (+) | 1.000 | 0.885 | ttgaactgcccagttTGTTCtaactgg V$PR_01 | 6512 (-) | 1.000 | 0.961 | aataaaaGAACAcgaagttcaccctgg V$PR_01 | 9459 (+) | 1.000 | 0.969 | ctctgctgtacaggaTGTTCtagctac V$GR_01 | 1522 (+) | 1.000 | 0.924 | gtagagctgttttttTGTTCtaaggga V$GR_01 | 1742 (-) | 1.000 | 0.922 | tcctataGAACActgatctataagcct V$GR_01 | 2686 (+) | 0.800 | 0.818 | gcatcctggtcacacTCTTCtccttcc V$GR_01 | 2804 (+) | 0.800 | 0.830 | tcttactgaacaccgTGCTCtgtccag V$GR_01 | 2804 (-) | 1.000 | 0.822 | tcttactGAACAccgtgctctgtccag V$GR_01 | 3534 (+) | 1.000 | 0.931 | cccgaggtttggttcTGTTCtttagta V$GR_01 | 3687 (-) | 1.000 | 0.920 | gtgccaaGAACAcaagagatgcttgat V$GR_01 | 5169 (+) | 1.000 | 0.926 | tttcacttgtgactgTGTTCtagcagg V$GR_01 | 5656 (-) | 1.000 | 0.914 | aagaaaaGAACAcaataagagacagac V$GR_01 | 6512 (-) | 1.000 | 0.970 | aataaaaGAACAcgaagttcaccctgg V$GR_01 | 6908 (-) | 1.000 | 0.915 | aagtggaGAACAaggaattgattgtat V$GR_01 | 7479 (+) | 1.000 | 0.912 | tgctacgcgcacgtcTGTTCttcttgg V$GR_01 | 7799 (-) | 1.000 | 0.951 | agataaaGAACAacgacaactccaaaa V$GR_01 | 8412 (+) | 1.000 | 0.933 | cggcctgtaagcagcTGTTCtagagct V$GR_01 | 8528 (-) | 1.000 | 0.967 | gcttcaaGAACAcagtgggccgagaaa V$GR_01 | 9034 (+) | 0.800 | 0.823 | agttcaagggcagccTGGTCtacatgg V$GR_01 | 9348 (+) | 1.000 | 0.942 | gaagtctcttctcagTGTTCtctatca V$GR_01 | 9426 (-) | 0.800 | 0.820 | ccataaaTAACAggaagcccaaggttt V$GR_01 | 9459 (+) | 1.000 | 0.984 | ctctgctgtacaggaTGTTCtagctac V$GR_01 | 9528 (-) | 1.000 | 0.946 | aagactaGAACAaacaagtcctgcgta V$GR_01 | 9774 (-) | 1.000 | 0.928 | tgccataGAACAattttgttcagtaaa V$GR_01 | 10056 (+) | 0.800 | 0.824 | aagaactgaaccgggTCTTCtggaagg V$GR_01 | 11217 (+) | 1.000 | 0.966 | ttgaactgcccagttTGTTCtaactgg V$AR_03 | 2804 (+) | 0.800 | 0.854 | tcttactgaacaccgTGCTCtgtccag V$AR_03 | 2804 (-) | 1.000 | 0.817 | tcttactGAACAccgtgctctgtccag V$AR_03 | 6512 (-) | 1.000 | 0.970 | aataaaaGAACAcgaagttcaccctgg V$AR_03 | 6908 (-) | 1.000 | 0.803 | aagtggaGAACAaggaattgattgtat V$AR_03 | 7799 (-) | 1.000 | 0.853 | agataaaGAACAacgacaactccaaaa V$AR_03 | 8412 (+) | 1.000 | 0.821 | cggcctgtaagcagcTGTTCtagagct V$AR_03 | 8528 (-) | 1.000 | 0.930 | gcttcaaGAACAcagtgggccgagaaa V$AR_03 | 9034 (+) | 0.800 | 0.804 | agttcaagggcagccTGGTCtacatgg V$AR_03 | 9348 (+) | 1.000 | 0.836 | gaagtctcttctcagTGTTCtctatca V$AR_03 | 9459 (+) | 1.000 | 0.964 | ctctgctgtacaggaTGTTCtagctac V$AR_03 | 9528 (-) | 1.000 | 0.843 | aagactaGAACAaacaagtcctgcgta V$AR_03 | 10056 (+) | 0.800 | 0.843 | aagaactgaaccgggTCTTCtggaagg V$AR_03 | 11217 (+) | 1.000 | 0.903 | ttgaactgcccagttTGTTCtaactgg V$PR_02 | 6512 (-) | 1.000 | 0.964 | aataaaaGAACAcgaagttcaccctgg V$PR_02 | 9459 (+) | 1.000 | 0.975 | ctctgctgtacaggaTGTTCtagctac P$ABI4_01 | 1183 (+) | 0.892 | 0.908 | gggcaCCCCCa P$ABI4_01 | 1813 (-) | 0.897 | 0.872 | gGGTGGtgttc P$ABI4_01 | 4144 (-) | 0.795 | 0.864 | gTGTGGtgcct P$ABI4_01 | 6599 (-) | 0.897 | 0.909 | aGGTGGtgcta P$ABI4_01 | 7196 (-) | 0.892 | 0.910 | cGGGGGtgcta P$ABI4_01 | 9686 (-) | 0.892 | 0.870 | gGGGGGtggtg P$ABI4_01 | 9689 (-) | 0.897 | 0.874 | gGGTGGtggtg P$ABI4_01 | 11845 (-) | 0.892 | 0.867 | gGGGGGtgggg V$DR4_Q2 | 2658 (+) | 0.754 | 0.793 | tgcactttgaTGACAct V$DR4_Q2 | 5000 (+) | 0.884 | 0.844 | agaccttgtaTTACCca V$DR4_Q2 | 5247 (-) | 0.853 | 0.840 | ggGTTCAcatgtggtca V$DR4_Q2 | 7135 (-) | 0.807 | 0.824 | gaGGCCAgaagagggcg V$DR4_Q2 | 9981 (-) | 0.813 | 0.794 | ggGATCAgaagagggta V$DR4_Q2 | 10243 (-) | 1.000 | 0.820 | gtGGTCAggcttgtgca V$DR3_Q4 | 294 (+) | 0.897 | 0.757 | gttttacagacTGAGCtatct V$DR3_Q4 | 411 (-) | 0.941 | 0.802 | gaatgGGTGAgcttgtccaaa V$DR3_Q4 | 434 (-) | 0.894 | 0.741 | accacGTTTAatcaggttgtc V$DR3_Q4 | 596 (+) | 0.894 | 0.791 | gacttacttttTAAACatcct V$DR3_Q4 | 1076 (-) | 1.000 | 0.788 | gtgtaGGTCAttttggtcaac V$DR3_Q4 | 1348 (+) | 0.941 | 0.761 | gcttgcctgggTCACCtgatg V$DR3_Q4 | 1377 (-) | 0.822 | 0.802 | cagtgGGCGAggtagggcagg V$DR3_Q4 | 1396 (-) | 0.808 | 0.762 | ggtgaGGAGAgtgggtccatt V$DR3_Q4 | 1602 (-) | 0.897 | 0.751 | ttgggGCTCAgggggtctcgt V$DR3_Q4 | 1618 (-) | 0.867 | 0.781 | ctcgtGGACAgcatgtccagc V$DR3_Q4 | 1620 (+) | 0.867 | 0.773 | cgtggacagcaTGTCCagcct V$DR3_Q4 | 1694 (-) | 0.808 | 0.782 | cagggGTAGAtgaagcaccct V$DR3_Q4 | 1723 (-) | 0.897 | 0.747 | tccggGCTCAtcctgaacttc V$DR3_Q4 | 1725 (+) | 1.000 | 0.806 | cgggctcatccTGAACttcct V$DR3_Q4 | 2327 (+) | 1.000 | 0.748 | gtcttgctatgTGACCctacg V$DR3_Q4 | 2665 (+) | 0.906 | 0.775 | tgatgacactcTTAACccgtt V$DR3_Q4 | 2799 (+) | 1.000 | 0.771 | tcttctcttacTGAACaccgt V$DR3_Q4 | 2808 (+) | 0.804 | 0.748 | actgaacaccgTGCTCtgtcc V$DR3_Q4 | 3070 (-) | 1.000 | 0.812 | agctaGGTCAcagggtgattc V$DR3_Q4 | 3307 (-) | 0.906 | 0.780 | atgtaGTTAAgattggccttt V$DR3_Q4 | 3520 (+) | 0.881 | 0.795 | ggaggccacctTGGCCcgagg V$DR3_Q4 | 3644 (-) | 0.848 | 0.803 | acatgGTGGAatcagtaaagt V$DR3_Q4 | 3770 (+) | 0.848 | 0.755 | gcaatgctcctTCCCCccttt V$DR3_Q4 | 3880 (+) | 1.000 | 0.811 | tacaagctcctTGACCctcat V$DR3_Q4 | 4043 (-) | 0.848 | 0.782 | agcagGGGGAaataggcaata V$DR3_Q4 | 4342 (-) | 0.848 | 0.778 | aggtgGTGGAggcaggaggat V$DR3_Q4 | 4461 (+) | 1.000 | 0.755 | ttcacactgacTGACCagcat V$DR3_Q4 | 5256 (+) | 0.838 | 0.752 | tgtggtcacagTCATCctggg V$DR3_Q4 | 5282 (-) | 0.897 | 0.806 | aacagGCTCAgatgggctagc V$DR3_Q4 | 5321 (-) | 1.000 | 0.796 | catcgGTTCAgtgcgccgata V$DR3_Q4 | 6248 (+) | 0.867 | 0.750 | ggctttctattTGTCCcatct V$DR3_Q4 | 6381 (+) | 0.881 | 0.779 | ggtagacgaacTGGCCtgtgt V$DR3_Q4 | 6395 (-) | 1.000 | 0.768 | cctgtGTTCAgaacgctcaga V$DR3_Q4 | 6471 (-) | 0.897 | 0.752 | aaaatGCTCAggatgaggact V$DR3_Q4 | 6523 (-) | 1.000 | 0.790 | acgaaGTTCAccctggagaca V$DR3_Q4 | 6794 (+) | 0.848 | 0.742 | ctggctcttccTCCCCtcatt V$DR3_Q4 | 6847 (-) | 0.941 | 0.814 | caactGGTGAgcacgtccaga V$DR3_Q4 | 7123 (-) | 0.897 | 0.754 | gcaatGCTCAaggaggccaga V$DR3_Q4 | 7220 (-) | 0.881 | 0.764 | aagtaGGCCAgcaggaagttc V$DR3_Q4 | 7350 (-) | 0.906 | 0.792 | aacatGTTAAtcatgggcaga V$DR3_Q4 | 8192 (+) | 0.813 | 0.765 | gctctgcctgtTTCCCctctg V$DR3_Q4 | 8200 (+) | 1.000 | 0.792 | tgtttcccctcTGACCtcagt V$DR3_Q4 | 8671 (-) | 0.894 | 0.784 | ccaccGGTTAggaaggtaatt V$DR3_Q4 | 8959 (-) | 0.808 | 0.739 | agtagGGAGAaccaggtgtgg V$DR3_Q4 | 9010 (-) | 0.907 | 0.779 | ggcagGGGCAggcagatcttt V$DR3_Q4 | 9461 (-) | 0.867 | 0.740 | ctgctGTACAggatgttctag V$DR3_Q4 | 9634 (-) | 1.000 | 0.758 | ttcaaGGTCAgcttgggctac V$DR3_Q4 | 9675 (-) | 0.848 | 0.737 | aagaaGTGGAgggggggtggt V$DR3_Q4 | 9988 (-) | 0.801 | 0.764 | gaagaGGGTAttaggtcctct V$DR3_Q4 | 10076 (-) | 0.907 | 0.787 | tggaaGGGCAgccagtgctct V$DR3_Q4 | 10250 (-) | 0.907 | 0.739 | ggcttGTGCAggaagtgattt V$DR3_Q4 | 10978 (+) | 0.881 | 0.737 | aggagcctgtgTGGCCctgga V$DR3_Q4 | 11046 (+) | 0.906 | 0.741 | tggttcctttgTTAACgttgt V$DR3_Q4 | 11055 (+) | 0.897 | 0.736 | tgttaacgttgTGATCagtta V$DR3_Q4 | 11123 (+) | 0.907 | 0.780 | ccttaactcgcTGCACtcagt V$DR3_Q4 | 11311 (-) | 0.941 | 0.752 | actacGGTGAcgacgatgatg V$DR3_Q4 | 11326 (-) | 0.838 | 0.743 | atgatGATGAcgatgtaaatg V$DR3_Q4 | 11449 (-) | 0.941 | 0.747 | attttGGTGAatcagtgaaat V$DR3_Q4 | 11884 (+) | 0.941 | 0.740 | agttctcactcTCAACcaata V$ETS_Q6 | 925 (+) | 1.000 | 0.948 | aCTTCCac V$ETS_Q6 | 1192 (-) | 0.954 | 0.944 | caGGAAAg V$ETS_Q6 | 1739 (+) | 1.000 | 0.987 | aCTTCCta V$ETS_Q6 | 2147 (+) | 1.000 | 0.976 | tCTTCCtt V$ETS_Q6 | 2151 (+) | 1.000 | 0.978 | cCTTCCtt V$ETS_Q6 | 2155 (+) | 1.000 | 0.988 | cCTTCCtc V$ETS_Q6 | 2161 (+) | 1.000 | 0.976 | tCTTCCtt V$ETS_Q6 | 2205 (+) | 1.000 | 0.976 | tCTTCCtt V$ETS_Q6 | 2707 (+) | 1.000 | 0.988 | cCTTCCtc V$ETS_Q6 | 2785 (+) | 1.000 | 0.976 | tCTTCCtt V$ETS_Q6 | 2989 (+) | 1.000 | 0.958 | aCTTCCgc V$ETS_Q6 | 3033 (+) | 1.000 | 0.958 | aCTTCCgc V$ETS_Q6 | 3325 (+) | 0.954 | 0.942 | tTTTCCtg V$ETS_Q6 | 3778 (+) | 1.000 | 0.939 | cCTTCCcc V$ETS_Q6 | 4689 (-) | 0.954 | 0.948 | caGGAAAc V$ETS_Q6 | 4963 (-) | 1.000 | 0.999 | caGGAAGt V$ETS_Q6 | 5473 (+) | 0.954 | 0.957 | aTTTCCtg V$ETS_Q6 | 5521 (+) | 0.954 | 0.945 | cTTTCCtc V$ETS_Q6 | 6089 (+) | 0.938 | 0.942 | aGTTCCtg V$ETS_Q6 | 6276 (-) | 1.000 | 0.981 | aaGGAAGc V$ETS_Q6 | 6317 (+) | 1.000 | 0.942 | aCTTCCct V$ETS_Q6 | 6461 (+) | 0.954 | 0.957 | aTTTCCtg V$ETS_Q6 | 6799 (+) | 1.000 | 0.986 | tCTTCCtc V$ETS_Q6 | 6878 (-) | 1.000 | 0.978 | taGGAAGc V$ETS_Q6 | 7019 (+) | 0.954 | 0.948 | gTTTCCtg V$ETS_Q6 | 7231 (-) | 1.000 | 0.999 | caGGAAGt V$ETS_Q6 | 7248 (-) | 0.954 | 0.942 | caGGAAAa V$ETS_Q6 | 8082 (+) | 1.000 | 0.987 | aCTTCCta V$ETS_Q6 | 8679 (-) | 1.000 | 0.975 | taGGAAGg V$ETS_Q6 | 9436 (-) | 1.000 | 0.990 | caGGAAGc V$ETS_Q6 | 9875 (-) | 0.954 | 0.958 | gaGGAAAt V$ETS_Q6 | 10258 (-) | 1.000 | 0.999 | caGGAAGt V$ETS_Q6 | 10589 (+) | 0.954 | 0.945 | aTTTCCta V$ETS_Q6 | 10639 (-) | 1.000 | 0.986 | gaGGAAGa V$ETS_Q6 | 10845 (-) | 1.000 | 0.985 | caGGAAGa V$ETS_Q6 | 11272 (-) | 0.954 | 0.948 | aaGGAAAt V$IRF_Q6_01 | 225 (-) | 0.968 | 0.972 | tTTTCCttttt V$IRF_Q6_01 | 901 (-) | 1.000 | 0.999 | tTTTCActttc V$IRF_Q6_01 | 1957 (-) | 0.968 | 0.973 | cTTTCCttttt V$IRF_Q6_01 | 2509 (+) | 0.968 | 0.978 | aaaagGGAAAa V$IRF_Q6_01 | 11269 (+) | 0.968 | 0.976 | gaaaaGGAAAt V$E2A_Q6 | 1360 (+) | 1.000 | 0.984 | CACCTgat V$E2A_Q6 | 1392 (-) | 1.000 | 1.000 | ggcAGGTG V$E2A_Q6 | 1788 (+) | 1.000 | 1.000 | CACCTgcc V$E2A_Q6 | 2970 (-) | 0.976 | 0.982 | ggcAGATG V$E2A_Q6 | 6264 (+) | 0.976 | 0.982 | CATCTgcc V$E2A_Q6 | 8279 (+) | 0.976 | 0.982 | CAGCTgcc V$E2A_Q6 | 8504 (-) | 1.000 | 1.000 | gacAGGTG V$E2A_Q6 | 8770 (+) | 1.000 | 0.980 | CACCTggg V$E2A_Q6 | 8969 (-) | 1.000 | 0.986 | accAGGTG V$SMAD_Q6_01 | 140 (+) | 1.000 | 0.974 | aatcCAGACat V$SMAD_Q6_01 | 296 (+) | 1.000 | 0.974 | tttaCAGACtg V$SMAD_Q6_01 | 338 (+) | 1.000 | 0.962 | ccctCAGACac V$SMAD_Q6_01 | 450 (-) | 1.000 | 0.966 | ttGTCTGtttt V$SMAD_Q6_01 | 590 (+) | 1.000 | 0.965 | ttaaCAGACtt V$SMAD_Q6_01 | 681 (+) | 1.000 | 0.980 | tcttCAGACac V$SMAD_Q6_01 | 892 (+) | 1.000 | 0.957 | gggaCAGACtt V$SMAD_Q6_01 | 1928 (-) | 0.925 | 0.915 | atGTCTAtgta V$SMAD_Q6_01 | 2080 (-) | 1.000 | 0.976 | ctGTCTGtctc V$SMAD_Q6_01 | 2305 (-) | 1.000 | 0.971 | gtGTCTGtgtg V$SMAD_Q6_01 | 2831 (-) | 1.000 | 0.983 | ttGTCTGcaaa V$SMAD_Q6_01 | 2943 (-) | 1.000 | 0.983 | aaGTCTGccta V$SMAD_Q6_01 | 3351 (+) | 1.000 | 0.977 | tcatCAGACag V$SMAD_Q6_01 | 3355 (+) | 1.000 | 0.954 | cagaCAGACca V$SMAD_Q6_01 | 3665 (+) | 1.000 | 0.956 | aacaCAGACtt V$SMAD_Q6_01 | 3978 (+) | 0.925 | 0.908 | atgaTAGACac V$SMAD_Q6_01 | 4326 (-) | 1.000 | 0.953 | ggGTCTGgtga V$SMAD_Q6_01 | 4420 (-) | 0.925 | 0.913 | ttGTCTAaaaa V$SMAD_Q6_01 | 4513 (+) | 1.000 | 0.933 | acatCAGACca V$SMAD_Q6_01 | 4775 (-) | 1.000 | 0.978 | gaGTCTGgata V$SMAD_Q6_01 | 4995 (+) | 1.000 | 0.953 | atggCAGACct V$SMAD_Q6_01 | 5198 (-) | 1.000 | 0.951 | ttGTCTGtggc V$SMAD_Q6_01 | 5674 (+) | 1.000 | 0.974 | gagaCAGACac V$SMAD_Q6_01 | 6236 (-) | 1.000 | 1.000 | ctGTCTGgcta V$SMAD_Q6_01 | 6649 (-) | 1.000 | 0.975 | caGTCTGcaaa V$SMAD_Q6_01 | 6860 (+) | 1.000 | 0.951 | cgtcCAGACcg V$SMAD_Q6_01 | 7102 (-) | 1.000 | 0.969 | taGTCTGtgta V$SMAD_Q6_01 | 7325 (-) | 1.000 | 0.936 | acGTCTGgaac V$SMAD_Q6_01 | 7489 (-) | 1.000 | 0.938 | acGTCTGttct V$SMAD_Q6_01 | 7957 (+) | 1.000 | 0.946 | acccCAGACta V$SMAD_Q6_01 | 8129 (-) | 1.000 | 0.943 | tgGTCTGcttt V$SMAD_Q6_01 | 8716 (-) | 1.000 | 0.967 | gaGTCTGaaga V$SMAD_Q6_01 | 9409 (-) | 0.925 | 0.916 | atGTCTAgaaa V$SMAD_Q6_01 | 10957 (-) | 1.000 | 0.939 | cgGTCTGtgcc V$RFX_Q6 | 71 (+) | 0.925 | 0.897 | ctGTTGTtg V$RFX_Q6 | 80 (+) | 0.938 | 0.890 | atGTTGAta V$RFX_Q6 | 383 (+) | 0.934 | 0.921 | gaGATGCca V$RFX_Q6 | 398 (-) | 0.913 | 0.888 | tgCCAACaa V$RFX_Q6 | 484 (-) | 0.938 | 0.920 | agTCAACat V$RFX_Q6 | 492 (-) | 1.000 | 0.937 | ttGCAACat V$RFX_Q6 | 537 (+) | 0.934 | 0.912 | gtGATGCgc V$RFX_Q6 | 582 (+) | 0.938 | 0.898 | ctGTTGAat V$RFX_Q6 | 701 (-) | 0.934 | 0.929 | ggGCATCgg V$RFX_Q6 | 741 (+) | 1.000 | 0.907 | tgGTTGCtg V$RFX_Q6 | 860 (+) | 1.000 | 0.954 | tcGTTGCcc V$RFX_Q6 | 978 (-) | 0.934 | 0.889 | aaGCATCac V$RFX_Q6 | 1029 (+) | 0.938 | 0.892 | caGTTGAat V$RFX_Q6 | 1090 (-) | 0.938 | 0.898 | ggTCAACta V$RFX_Q6 | 1363 (+) | 0.934 | 0.901 | ctGATGCtc V$RFX_Q6 | 1845 (-) | 0.938 | 0.928 | cgTCAACac V$RFX_Q6 | 1859 (-) | 0.913 | 0.912 | agCCAACtg V$RFX_Q6 | 2395 (+) | 1.000 | 0.943 | gaGTTGCtt V$RFX_Q6 | 2555 (+) | 0.925 | 0.924 | ctGTTGTga V$RFX_Q6 | 2681 (+) | 1.000 | 0.948 | ccGTTGCat V$RFX_Q6 | 2828 (+) | 0.925 | 0.923 | caGTTGTct V$RFX_Q6 | 2842 (-) | 0.934 | 0.920 | tgGCATCat V$RFX_Q6 | 2972 (+) | 0.934 | 0.920 | caGATGCgc V$RFX_Q6 | 2981 (-) | 0.934 | 0.897 | agGCATCta V$RFX_Q6 | 3141 (-) | 0.925 | 0.888 | ccACAACaa V$RFX_Q6 | 3236 (-) | 0.913 | 0.897 | agCCAACat V$RFX_Q6 | 3270 (+) | 0.934 | 0.927 | gtGATGCca V$RFX_Q6 | 3424 (+) | 1.000 | 0.905 | tgGTTGCtt V$RFX_Q6 | 4524 (-) | 0.925 | 0.895 | agACAACaa V$RFX_Q6 | 4536 (-) | 0.938 | 0.942 | cgTCAACag V$RFX_Q6 | 4568 (+) | 1.000 | 0.984 | caGTTGCgg V$RFX_Q6 | 4919 (-) | 0.925 | 0.896 | ccACAACtt V$RFX_Q6 | 5139 (-) | 0.938 | 0.895 | ggTCAACct V$RFX_Q6 | 5278 (-) | 0.925 | 0.897 | caACAACag V$RFX_Q6 | 5841 (-) | 0.934 | 0.917 | agGCATCtc V$RFX_Q6 | 5852 (+) | 0.938 | 0.911 | atGTTGAgt V$RFX_Q6 | 6109 (+) | 0.938 | 0.938 | ctGTTGAcc V$RFX_Q6 | 6437 (-) | 0.938 | 0.902 | ggTCAACcc V$RFX_Q6 | 6498 (+) | 0.934 | 0.925 | caGATGCgg V$RFX_Q6 | 6564 (+) | 1.000 | 0.943 | tgGTTGCca V$RFX_Q6 | 6844 (-) | 0.913 | 0.916 | tgCCAACtg V$RFX_Q6 | 7024 (-) | 0.934 | 0.896 | ctGCATCag V$RFX_Q6 | 7042 (+) | 0.938 | 0.900 | taGTTGAct V$RFX_Q6 | 7170 (+) | 1.000 | 0.936 | gcGTTGCag V$RFX_Q6 | 7547 (-) | 0.913 | 0.912 | agCCAACtg V$RFX_Q6 | 7812 (-) | 0.925 | 0.912 | cgACAACtc V$RFX_Q6 | 8225 (+) | 0.913 | 0.897 | atGTTGGct V$RFX_Q6 | 8439 (-) | 1.000 | 0.921 | ttGCAACcc V$RFX_Q6 | 8446 (-) | 0.934 | 0.887 | ccGCATCct V$RFX_Q6 | 8523 (+) | 0.934 | 0.897 | caGATGCtt V$RFX_Q6 | 8589 (-) | 1.000 | 0.975 | agGCAACat V$RFX_Q6 | 8800 (+) | 0.913 | 0.892 | gaGTTGGgg V$RFX_Q6 | 8941 (-) | 1.000 | 0.936 | aaGCAACtt V$RFX_Q6 | 9318 (+) | 0.913 | 0.890 | gcGTTGGgt V$RFX_Q6 | 10155 (+) | 0.925 | 0.897 | ttGTTGTcg V$RFX_Q6 | 10216 (-) | 0.934 | 0.901 | tgGCATCta V$RFX_Q6 | 10379 (+) | 1.000 | 0.937 | atGTTGCaa V$RFX_Q6 | 10440 (-) | 1.000 | 0.927 | atGCAACgt V$RFX_Q6 | 10538 (-) | 1.000 | 0.932 | gtGCAACtc V$RFX_Q6 | 10662 (-) | 0.938 | 0.944 | tgTCAACag V$RFX_Q6 | 10691 (-) | 1.000 | 0.986 | tgGCAACac V$RFX_Q6 | 11039 (+) | 0.925 | 0.897 | ctGTTGTtg V$RFX_Q6 | 11060 (+) | 0.925 | 0.898 | acGTTGTga V$RFX_Q6 | 11519 (+) | 0.925 | 0.921 | ccGTTGTcc V$RFX_Q6 | 11709 (-) | 0.913 | 0.906 | ccCCAACtg V$RFX_Q6 | 11920 (-) | 0.913 | 0.888 | gcCCAACgc V$AHRHIF_Q6 | 1423 (-) | 1.000 | 0.980 | agaCACGCa V$AHRHIF_Q6 | 6220 (-) | 1.000 | 0.990 | tctCACGCt V$AHRHIF_Q6 | 7485 (-) | 0.982 | 0.980 | gcgCACGTc V$AHRHIF_Q6 | 10720 (+) | 1.000 | 0.986 | tGCGTGctg V$EBF_Q6 | 1332 (+) | 0.976 | 0.934 | cTCCCCagggc V$EBF_Q6 | 5919 (+) | 0.976 | 0.923 | tTCCCCagcga V$LEF1TCF1_Q4 | 1036 (-) | 1.000 | 0.962 | atcaaCAAAGc V$LEF1TCF1_Q4 | 1164 (-) | 1.000 | 0.915 | tttttCAAAGa V$LEF1TCF1_Q4 | 1235 (+) | 1.000 | 0.894 | gCTTTGacatc V$LEF1TCF1_Q4 | 2661 (+) | 1.000 | 0.985 | aCTTTGatgac V$LEF1TCF1_Q4 | 4663 (+) | 1.000 | 0.933 | aCTTTGttttg V$LEF1TCF1_Q4 | 4871 (-) | 1.000 | 0.926 | agcagCAAAGc V$LEF1TCF1_Q4 | 5355 (+) | 1.000 | 0.881 | tCTTTGacaaa V$LEF1TCF1_Q4 | 6741 (+) | 1.000 | 0.935 | cCTTTGaaaac V$LEF1TCF1_Q4 | 8379 (+) | 1.000 | 0.972 | tCTTTGatctg V$LEF1TCF1_Q4 | 8436 (+) | 1.000 | 0.865 | gCTTTGcaacc V$LEF1TCF1_Q4 | 8569 (+) | 1.000 | 0.943 | tCTTTGattaa V$LEF1TCF1_Q4 | 8623 (+) | 1.000 | 0.888 | aCTTTGagaaa V$LEF1TCF1_Q4 | 8707 (-) | 0.863 | 0.869 | accatCAAGGa V$LEF1TCF1_Q4 | 9026 (+) | 1.000 | 0.865 | tCTTTGtgagt V$LEF1TCF1_Q4 | 9726 (+) | 1.000 | 0.886 | tCTTTGtatga V$LEF1TCF1_Q4 | 10350 (-) | 1.000 | 0.921 | taattCAAAGt V$LEF1TCF1_Q4 | 10578 (+) | 1.000 | 0.888 | gCTTTGgttca V$LEF1TCF1_Q4 | 11051 (+) | 1.000 | 0.947 | cCTTTGttaac V$LEF1TCF1_Q4 | 11379 (+) | 1.000 | 0.924 | gCTTTGaaaac V$PAX6_Q2 | 324 (+) | 1.000 | 0.862 | ctgtgcTGGAAatt V$PAX6_Q2 | 4393 (-) | 1.000 | 0.891 | gagTTCCAagacag V$PAX6_Q2 | 4942 (+) | 0.861 | 0.873 | ctgaacTGGGActt V$PAX6_Q2 | 9033 (-) | 0.821 | 0.870 | gagTTCAAgggcag V$PAX6_Q2 | 9631 (-) | 0.821 | 0.921 | gagTTCAAggtcag V$TBP_Q6 | 568 (-) | 1.000 | 0.971 | agATAAA V$TBP_Q6 | 644 (+) | 1.000 | 0.974 | TTTATtc V$TBP_Q6 | 652 (+) | 1.000 | 0.969 | TTTATta V$TBP_Q6 | 834 (-) | 1.000 | 0.997 | atATAAA V$TBP_Q6 | 848 (-) | 1.000 | 0.971 | aaATAAA V$TBP_Q6 | 1152 (+) | 1.000 | 0.969 | TTTATgt V$TBP_Q6 | 1892 (+) | 1.000 | 0.997 | TTTATat V$TBP_Q6 | 1898 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 1903 (+) | 1.000 | 0.969 | TTTATta V$TBP_Q6 | 1911 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 2365 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 2739 (+) | 1.000 | 0.972 | TTTATgc V$TBP_Q6 | 2903 (+) | 1.000 | 0.974 | TTTATcc V$TBP_Q6 | 3405 (-) | 1.000 | 0.994 | ttATAAA V$TBP_Q6 | 3452 (-) | 1.000 | 0.969 | taATAAA V$TBP_Q6 | 3626 (+) | 1.000 | 0.994 | TTTATaa V$TBP_Q6 | 3742 (+) | 0.923 | 0.926 | TTTCTac V$TBP_Q6 | 3831 (-) | 1.000 | 0.969 | taATAAA V$TBP_Q6 | 3847 (-) | 1.000 | 0.966 | tcATAAA V$TBP_Q6 | 3910 (-) | 1.000 | 0.969 | taATAAA V$TBP_Q6 | 3919 (+) | 1.000 | 0.966 | TTTATga V$TBP_Q6 | 4004 (-) | 1.000 | 0.971 | cgATAAA V$TBP_Q6 | 4008 (-) | 1.000 | 0.971 | aaATAAA V$TBP_Q6 | 4225 (-) | 1.000 | 0.971 | aaATAAA V$TBP_Q6 | 4316 (-) | 1.000 | 0.972 | gcATAAA V$TBP_Q6 | 4781 (-) | 1.000 | 0.974 | ggATAAA V$TBP_Q6 | 5387 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 5398 (+) | 1.000 | 0.997 | TTTATat V$TBP_Q6 | 5435 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 5590 (-) | 1.000 | 0.969 | taATAAA V$TBP_Q6 | 5635 (-) | 1.000 | 0.974 | ggATAAA V$TBP_Q6 | 5646 (-) | 1.000 | 0.971 | aaATAAA V$TBP_Q6 | 5954 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 6467 (-) | 1.000 | 0.969 | tgATAAA V$TBP_Q6 | 6511 (-) | 1.000 | 0.971 | aaATAAA V$TBP_Q6 | 6671 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 6762 (+) | 1.000 | 1.000 | TTTATac V$TBP_Q6 | 6828 (+) | 1.000 | 0.969 | TTTATgt V$TBP_Q6 | 7426 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 7436 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 7441 (+) | 1.000 | 0.969 | TTTATgt V$TBP_Q6 | 7799 (-) | 1.000 | 0.971 | agATAAA V$TBP_Q6 | 7897 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 8063 (+) | 1.000 | 0.971 | TTTATct V$TBP_Q6 | 8494 (+) | 0.923 | 0.926 | TTTCTac V$TBP_Q6 | 8931 (-) | 1.000 | 1.000 | gtATAAA V$TBP_Q6 | 9426 (-) | 1.000 | 0.969 | ccATAAA V$TBP_Q6 | 9486 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 10123 (-) | 1.000 | 0.971 | caATAAA V$TBP_Q6 | 10318 (+) | 1.000 | 0.997 | TTTATat V$TBP_Q6 | 10326 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 10331 (+) | 1.000 | 0.997 | TTTATag V$TBP_Q6 | 10340 (+) | 1.000 | 0.971 | TTTATtg V$TBP_Q6 | 10391 (-) | 1.000 | 0.997 | atATAAA V$TBP_Q6 | 10453 (+) | 1.000 | 0.997 | TTTATat V$TBP_Q6 | 10626 (-) | 0.923 | 0.926 | gtAGAAA V$TBP_Q6 | 10705 (+) | 1.000 | 0.966 | TTTATga V$TBP_Q6 | 10734 (+) | 1.000 | 0.971 | TTTATtg V$TBP_Q6 | 11409 (+) | 1.000 | 0.974 | TTTATtc V$TBP_Q6 | 11472 (+) | 1.000 | 0.971 | TTTATtt V$TBP_Q6 | 11688 (-) | 1.000 | 0.971 | caATAAA V$CREBATF_Q6 | 1239 (-) | 0.893 | 0.897 | tgaCATCAg V$CREBATF_Q6 | 1842 (-) | 1.000 | 0.937 | ggtCGTCAa V$CREBATF_Q6 | 2965 (+) | 1.000 | 0.954 | gTGACGgca V$CREBATF_Q6 | 4533 (-) | 1.000 | 0.948 | atgCGTCAa V$CREBATF_Q6 | 8323 (-) | 1.000 | 0.951 | ctgCGTCAg V$CREBATF_Q6 | 8612 (-) | 1.000 | 0.947 | ttcCGTCAa V$CREBATF_Q6 | 11317 (+) | 1.000 | 0.947 | gTGACGacg V$CREBATF_Q6 | 11332 (+) | 1.000 | 0.938 | aTGACGatg V$KROX_Q6 | 1494 (-) | 1.000 | 0.896 | gggggGGGGGaggg V$KROX_Q6 | 9682 (-) | 1.000 | 0.879 | ggaggGGGGGtggt V$PEBP_Q6 | 932 (+) | 1.000 | 0.921 | cgaaACCACaagtta V$PEBP_Q6 | 3637 (+) | 1.000 | 0.891 | acctACCACatggtg V$PEBP_Q6 | 4140 (-) | 1.000 | 0.886 | ctaggtGTGGTgcct V$PEBP_Q6 | 4437 (+) | 1.000 | 0.936 | aaaaACCACacaatt V$PEBP_Q6 | 4567 (-) | 0.910 | 0.891 | acagttGCGGTtaga V$PEBP_Q6 | 5044 (-) | 1.000 | 0.925 | attgctGTGGTctaa V$PEBP_Q6 | 5251 (-) | 1.000 | 0.932 | tcacatGTGGTcaca V$PEBP_Q6 | 7108 (+) | 1.000 | 0.911 | gtgtACCACatgtgt V$PEBP_Q6 | 8737 (-) | 1.000 | 0.907 | cggggtGTGGTggcc V$PEBP_Q6 | 8894 (+) | 1.000 | 0.918 | acgtACCACaaacag V$PEBP_Q6 | 8970 (-) | 1.000 | 0.885 | ccaggtGTGGTggta V$PEBP_Q6 | 9570 (-) | 1.000 | 0.885 | ctgggtGTGGTggta V$PEBP_Q6 | 11251 (-) | 1.000 | 0.963 | aactctGTGGTgtgc V$CHCH_01 | 185 (-) | 0.997 | 0.994 | tACCCG V$CHCH_01 | 782 (+) | 0.997 | 0.997 | CGGGTg V$CHCH_01 | 1572 (+) | 1.000 | 0.997 | CGGGGa V$CHCH_01 | 1682 (+) | 0.997 | 0.997 | CGGGTg V$CHCH_01 | 3043 (-) | 0.997 | 0.997 | cACCCG V$CHCH_01 | 3047 (+) | 0.998 | 0.991 | CGGGAc V$CHCH_01 | 3532 (-) | 0.994 | 0.987 | gGCCCG V$CHCH_01 | 3723 (+) | 0.997 | 0.997 | CGGGTg V$CHCH_01 | 4598 (-) | 0.994 | 0.991 | tGCCCG V$CHCH_01 | 4827 (+) | 1.000 | 0.989 | CGGGGt V$CHCH_01 | 5025 (+) | 0.998 | 0.991 | CGGGAc V$CHCH_01 | 5895 (-) | 0.994 | 0.987 | gGCCCG V$CHCH_01 | 6203 (+) | 1.000 | 0.989 | CGGGGt V$CHCH_01 | 7182 (-) | 0.998 | 0.991 | gTCCCG V$CHCH_01 | 7196 (+) | 1.000 | 1.000 | CGGGGg V$CHCH_01 | 8512 (+) | 0.998 | 0.991 | CGGGAc V$CHCH_01 | 8737 (+) | 1.000 | 0.989 | CGGGGt V$CHCH_01 | 9123 (+) | 0.998 | 0.995 | CGGGAa V$CHCH_01 | 9628 (+) | 0.998 | 0.998 | CGGGAg V$CHCH_01 | 10067 (+) | 0.997 | 0.990 | CGGGTc V$CHCH_01 | 10697 (-) | 0.997 | 0.997 | cACCCG V$CHCH_01 | 11870 (+) | 1.000 | 1.000 | CGGGGg V$CHCH_01 | 11917 (+) | 0.994 | 0.987 | CGGGCc V$TAL1_Q6 | 5127 (-) | 1.000 | 0.991 | aacaGATGGt V$TAL1_Q6 | 6036 (+) | 1.000 | 0.983 | cCCATCtggt V$TAL1_Q6 | 6845 (+) | 0.977 | 0.976 | gCCAACtggt V$TAL1_Q6 | 7511 (-) | 1.000 | 0.975 | gtcaGATGGc V$DEC_Q1 | 17 (+) | 0.987 | 0.918 | gtaCATGTgaatg V$DEC_Q1 | 1391 (+) | 0.973 | 0.920 | gggCAGGTgagga V$DEC_Q1 | 1859 (+) | 0.894 | 0.905 | agcCAACTgaaga V$DEC_Q1 | 2772 (+) | 0.987 | 0.917 | ccgCATGTgagat V$DEC_Q1 | 5168 (-) | 1.000 | 0.921 | ctttcACTTGtga V$DEC_Q1 | 5248 (-) | 0.987 | 0.938 | ggttcACATGtgg V$DEC_Q1 | 6942 (+) | 1.000 | 0.938 | accCAAGTgatac V$DEC_Q1 | 10887 (-) | 0.987 | 0.905 | actgcACATGcgc V$PBX_Q3 | 132 (-) | 1.000 | 0.919 | gctcagtCAATC V$PBX_Q3 | 8611 (-) | 1.000 | 0.931 | attccgtCAATC V$PBX_Q3 | 11199 (+) | 1.000 | 0.931 | GATTGattatta V$AIRE_01 | 102 (+) | 0.891 | 0.826 | cctgggggTGGCCagtgaggctggct V$AIRE_01 | 181 (-) | 0.891 | 0.844 | ttactacccgtgcCTCCCacaccacc V$AIRE_01 | 182 (-) | 0.907 | 0.858 | tactacccgtgccTCCCAcaccacca V$AIRE_01 | 187 (-) | 0.952 | 0.829 | cccgtgcctcccaCACCAccatgcct V$AIRE_01 | 190 (-) | 0.952 | 0.879 | gtgcctcccacacCACCAtgcctggt V$AIRE_01 | 1072 (+) | 0.931 | 0.893 | agatgtgtAGGTCattttggtcaact V$AIRE_01 | 1271 (+) | 0.907 | 0.855 | gtgtactcTGGGAtgacagggacctg V$AIRE_01 | 1272 (+) | 0.940 | 0.862 | tgtactctGGGATgacagggacctgg V$AIRE_01 | 1308 (-) | 0.891 | 0.845 | aagagtcccctaaCTCCCttctcact V$AIRE_01 | 1338 (+) | 0.906 | 0.836 | agggcttaAGGCTtgcctgggtcacc V$AIRE_01 | 1347 (-) | 0.925 | 0.845 | ggcttgcctgggtCACCTgatgctcg V$AIRE_01 | 1378 (+) | 0.960 | 0.853 | agtgggcgAGGTAgggcaggtgagga V$AIRE_01 | 1383 (+) | 0.891 | 0.834 | gcgaggtaGGGCAggtgaggagagtg V$AIRE_01 | 1482 (+) | 0.891 | 0.849 | ctggcagaGGGAGgggggggggaggg V$AIRE_01 | 1566 (+) | 0.927 | 0.878 | aatctacgGGGAAtgcatggagagcc V$AIRE_01 | 1597 (+) | 0.904 | 0.850 | tgggtttgGGGCTcagggggtctcgt V$AIRE_01 | 1628 (-) | 0.929 | 0.907 | gcatgtccagcctTTCCTagtgctaa V$AIRE_01 | 1728 (-) | 0.929 | 0.874 | gctcatcctgaacTTCCTatagaaca V$AIRE_01 | 1775 (-) | 0.925 | 0.856 | tgtcctcctggagCACCTgccattca V$AIRE_01 | 1808 (+) | 0.952 | 0.857 | catcagggTGGTGttcagggaagaca V$AIRE_01 | 1935 (+) | 0.933 | 0.869 | tgtagcctTGGCTaggctggaacttt V$AIRE_01 | 2016 (+) | 0.907 | 0.889 | ctgagtgcTGGGAttaaaggagtgta V$AIRE_01 | 2675 (-) | 0.942 | 0.925 | cttaacccgttgcATCCTggtcacac V$AIRE_01 | 3031 (-) | 0.886 | 0.857 | cgacttccgctccACCCGggacatac V$AIRE_01 | 3232 (-) | 0.907 | 0.865 | agtcagccaacatTCCCAgcccactt V$AIRE_01 | 3251 (+) | 1.000 | 0.894 | cccacttaTGGTTttcttggtgatgc V$AIRE_01 | 4334 (+) | 0.925 | 0.870 | tgacatccAGGTGgtggaggcaggag V$AIRE_01 | 4976 (-) | 0.893 | 0.858 | tcagcccctacctGTCCGaatggcag V$AIRE_01 | 5007 (-) | 0.933 | 0.880 | gtattacccattaAGCCAcgggactg V$AIRE_01 | 5024 (+) | 0.940 | 0.883 | acgggactGGGATgctcaggattgct V$AIRE_01 | 5277 (+) | 0.906 | 0.868 | ccaacaacAGGCTcagatgggctagc V$AIRE_01 | 5492 (-) | 0.907 | 0.830 | cccttccctctgcTCCCAggtttccc V$AIRE_01 | 5500 (-) | 0.927 | 0.844 | tctgctcccaggtTTCCCtgcctttc V$AIRE_01 | 5510 (-) | 0.929 | 0.868 | ggtttccctgcctTTCCTcctatcca V$AIRE_01 | 5748 (+) | 0.900 | 0.877 | agacgcaaAGGACctgtagggtaaaa V$AIRE_01 | 6177 (-) | 0.923 | 0.866 | gggttcccagagaCACCCatttaagc V$AIRE_01 | 6187 (-) | 0.899 | 0.855 | agacacccatttaAGCCGgggtatgc V$AIRE_01 | 6753 (-) | 0.987 | 0.949 | cactatccctttaTACCAtgaacaaa V$AIRE_01 | 6801 (-) | 0.960 | 0.901 | ttcctcccctcatTACCTtctccctt V$AIRE_01 | 6894 (+) | 0.891 | 0.856 | agacaagaGGGCAtaagtggagaaca V$AIRE_01 | 6903 (+) | 0.920 | 0.852 | ggcataagTGGAGaacaaggaattga V$AIRE_01 | 6959 (+) | 0.969 | 0.899 | cattattcTGGATggaaaggaacgac V$AIRE_01 | 7178 (-) | 0.891 | 0.830 | gatggtcccgaacGGCCAcgggggtg V$AIRE_01 | 7694 (+) | 0.891 | 0.857 | cagaacttGGGAGgtagaggcagaag V$AIRE_01 | 8112 (-) | 0.893 | 0.856 | cactctcccacccTGCCTggtctgct V$AIRE_01 | 8199 (-) | 0.931 | 0.866 | ctgtttcccctctGACCTcagttctg V$AIRE_01 | 8356 (-) | 0.906 | 0.843 | ccaacccctgtgcAGCCTgctgctct V$AIRE_01 | 8395 (-) | 0.904 | 0.882 | ctgaagccaagccAGCCCggcctgta V$AIRE_01 | 8667 (+) | 0.966 | 0.895 | caaaccacCGGTTaggaaggtaatta V$AIRE_01 | 8730 (+) | 0.890 | 0.842 | gtaacgtcGGGGTgtggtggcccaca V$AIRE_01 | 8750 (-) | 0.891 | 0.841 | cccacaccttgtcCTCCCagcacctg V$AIRE_01 | 8905 (+) | 0.929 | 0.876 | acagattcAGGAAtactcggaattca V$AIRE_01 | 8955 (+) | 0.891 | 0.840 | aattagtaGGGAGaaccaggtgtggt V$AIRE_01 | 8983 (-) | 0.956 | 0.900 | tacacacctttaaTTCCAgcacatgg V$AIRE_01 | 8998 (+) | 0.920 | 0.845 | ccagcacaTGGAGgcaggggcaggca V$AIRE_01 | 9041 (+) | 0.958 | 0.861 | gggcagccTGGTCtacatggcaagtt V$AIRE_01 | 9061 (+) | 0.906 | 0.859 | caagttccAGGCTagccagggcagca V$AIRE_01 | 9061 (-) | 0.933 | 0.861 | caagttccaggctAGCCAgggcagca V$AIRE_01 | 9116 (-) | 0.973 | 0.904 | aaaaaaccgggaaAACCTtcagtttt V$AIRE_01 | 9176 (-) | 0.958 | 0.881 | tgaacgccacagaGACCActcaagag V$AIRE_01 | 9563 (+) | 0.920 | 0.876 | tcggtttcTGGGTgtggtggtatagc V$AIRE_01 | 9582 (-) | 0.940 | 0.893 | gtatagccctgtaATCCCagcatttg V$AIRE_01 | 9583 (-) | 0.907 | 0.872 | tatagccctgtaaTCCCAgcatttgg V$AIRE_01 | 9599 (+) | 0.927 | 0.869 | cagcatttGGGAAgctgaggtgggag V$AIRE_01 | 9611 (+) | 0.907 | 0.856 | agctgaggTGGGAggatcgggagttc V$AIRE_01 | 9612 (+) | 0.891 | 0.828 | gctgaggtGGGAGgatcgggagttca V$AIRE_01 | 9621 (+) | 0.891 | 0.854 | ggaggatcGGGAGttcaaggtcagct V$AIRE_01 | 9630 (+) | 0.931 | 0.872 | ggagttcaAGGTCagcttgggctact V$AIRE_01 | 9681 (+) | 0.923 | 0.837 | tggaggggGGGTGgtggtggtggtgg V$AIRE_01 | 9684 (+) | 0.952 | 0.842 | agggggggTGGTGgtggtggtggtgg V$AIRE_01 | 9687 (+) | 0.952 | 0.845 | gggggtggTGGTGgtggtggtggtgg V$AIRE_01 | 9690 (+) | 0.952 | 0.853 | ggtggtggTGGTGgtggtggtggtgt V$AIRE_01 | 9693 (+) | 0.952 | 0.884 | ggtggtggTGGTGgtggtggtgtaaa V$AIRE_01 | 9962 (+) | 1.000 | 0.927 | tatgtttcTGGTTcctatggggatca V$AIRE_01 | 9992 (+) | 0.931 | 0.867 | agggtattAGGTCctctgggactgga V$AIRE_01 | 10000 (+) | 0.907 | 0.837 | aggtcctcTGGGActggaggtataga V$AIRE_01 | 10059 (+) | 0.886 | 0.849 | aactgaacCGGGTcttctggaagggc V$AIRE_01 | 10597 (+) | 0.893 | 0.855 | tatcaaaaAGGAGtgtgtgggaggga V$AIRE_01 | 10775 (+) | 0.893 | 0.847 | aactctgtCGGACccagggggatgtg V$AIRE_01 | 11265 (+) | 0.929 | 0.882 | ccaagaaaAGGAAattaaggcatagg V$AIRE_01 | 11617 (-) | 0.929 | 0.878 | ggggtaccagaagGACCCtcgtttaa V$AIRE_01 | 11703 (-) | 0.891 | 0.877 | ctgtcaccccaacTGCCCtccaccat V$AIRE_01 | 11802 (+) | 0.925 | 0.837 | agattggaAGGTGgcccagggttggg V$AIRE_01 | 11844 (+) | 0.890 | 0.833 | tggggggtGGGGTatgggggtaggtc V$AIRE_01 | 11853 (+) | 0.958 | 0.846 | gggtatggGGGTAggtccgggggagg V$AIRE_02 | 643 (+) | 1.000 | 0.639 | atTTATTcatttattatatataagt V$AIRE_02 | 830 (-) | 0.679 | 0.625 | tactatataaaacaagttAAATAaa V$AIRE_02 | 1047 (-) | 0.673 | 0.621 | agcaaaaatcaacaacttTAAAAta V$AIRE_02 | 2404 (-) | 0.802 | 0.622 | tccttaaaaaaaacaaaaAATCAca V$AIRE_02 | 3351 (-) | 0.802 | 0.639 | tcatcagacagaccactgAATAGcc V$AIRE_02 | 3389 (-) | 0.623 | 0.618 | ccctgaggatctccagttATAAAag V$AIRE_02 | 3758 (+) | 0.877 | 0.642 | caTTCTTaaaaggcaatgctccttc V$AIRE_02 | 4248 (+) | 0.802 | 0.674 | ttGTATTacttggttgttaggatgc V$AIRE_02 | 4251 (+) | 0.745 | 0.621 | taTTACTtggttgttaggatgctta V$AIRE_02 | 4430 (-) | 0.802 | 0.700 | aaaaaaaaaaaaccacacAATTAat V$AIRE_02 | 4434 (-) | 0.802 | 0.639 | aaaaaaaaccacacaattAATATtc V$AIRE_02 | 5001 (-) | 0.481 | 0.626 | gaccttgtattacccattAAGCCac V$AIRE_02 | 5314 (+) | 0.877 | 0.683 | gcTTCTTcatcggttcagtgcgccg V$AIRE_02 | 5966 (+) | 0.877 | 0.704 | tgTTTTTaagtggtgataacttctg V$AIRE_02 | 6181 (-) | 0.481 | 0.626 | tcccagagacacccatttAAGCCgg V$AIRE_02 | 7056 (+) | 0.877 | 0.681 | ttTTCTTaatttgtttttgtttgtg V$AIRE_02 | 7413 (+) | 0.481 | 0.638 | gaACTTTtaaaggtttatttgtttt V$AIRE_02 | 7414 (+) | 0.679 | 0.619 | aaCTTTTaaaggtttatttgttttt V$AIRE_02 | 7782 (-) | 0.783 | 0.639 | aacaaaaattatcccaaaGATAAag V$AIRE_02 | 7817 (-) | 0.472 | 0.697 | actccaaaacaaacaattAACCCac V$AIRE_02 | 7922 (+) | 0.585 | 0.623 | agGTATCaattcgtttgtgtgttta V$AIRE_02 | 8838 (-) | 0.877 | 0.651 | accaaaacagaacggtttAAAAAtt V$AIRE_02 | 9179 (-) | 0.679 | 0.619 | acgccacagagaccactcAAGAGag V$AIRE_02 | 9413 (-) | 1.000 | 0.731 | ctagaaaactatcccataAATAAca V$AIRE_02 | 9824 (+) | 0.673 | 0.619 | aaTTTTAgaagggataattttaaag V$AIRE_02 | 10119 (-) | 0.877 | 0.678 | tccccaataaaattagttAAGAAat V$AIRE_02 | 10304 (+) | 0.877 | 0.666 | atTTTTTaactgtttttatattttt V$AIRE_02 | 10325 (+) | 1.000 | 0.636 | ttTTATTttatagtttttattgttt V$AIRE_02 | 10326 (+) | 0.679 | 0.655 | ttTATTTtatagtttttattgtttt V$AIRE_02 | 10733 (+) | 1.000 | 0.651 | ttTTATTggggggtttcagtgatcc V$AIRE_02 | 11192 (+) | 0.877 | 0.657 | gtTTCTTgattgattattaatttgt V$AIRE_02 | 11203 (+) | 1.000 | 0.672 | gaTTATTaatttgtttgaactgccc V$AIRE_02 | 11665 (-) | 0.679 | 0.665 | caattttcataaccaaaaAAGACca V$AIRE_02 | 11675 (-) | 0.481 | 0.651 | aaccaaaaaagaccaataAAGTTaa V$DEAF1_01 | 1708 (+) | 1.000 | 0.821 | gcaccctctgccttTTCCGggctca V$HELIOSA_02 | 224 (-) | 1.000 | 0.998 | tttTTCCTttt V$HELIOSA_02 | 422 (-) | 0.881 | 0.902 | cttGTCCAaaa V$HELIOSA_02 | 767 (+) | 0.965 | 0.851 | ctcTGGAAgag V$HELIOSA_02 | 806 (-) | 0.877 | 0.885 | tttCTCCAgcc V$HELIOSA_02 | 943 (+) | 1.000 | 0.901 | gttAGGAAcaa V$HELIOSA_02 | 1133 (-) | 1.000 | 0.907 | gtgTTCCTaaa V$HELIOSA_02 | 1190 (+) | 1.000 | 0.858 | cccAGGAAagc V$HELIOSA_02 | 1559 (+) | 0.965 | 0.847 | ttcTGGAAatc V$HELIOSA_02 | 1638 (-) | 1.000 | 0.870 | cctTTCCTagt V$HELIOSA_02 | 1719 (-) | 0.942 | 0.931 | cttTTCCGggc V$HELIOSA_02 | 1822 (+) | 0.942 | 0.851 | tcaGGGAAgac V$HELIOSA_02 | 1956 (-) | 1.000 | 0.876 | actTTCCTttt V$HELIOSA_02 | 2146 (-) | 1.000 | 0.883 | ctcTTCCTtcc V$HELIOSA_02 | 2160 (-) | 1.000 | 0.885 | ctcTTCCTttc V$HELIOSA_02 | 2165 (-) | 1.000 | 0.872 | cctTTCCTtct V$HELIOSA_02 | 2173 (-) | 1.000 | 0.877 | tctTTCCTtct V$HELIOSA_02 | 2204 (-) | 1.000 | 0.890 | ttcTTCCTttc V$HELIOSA_02 | 2379 (-) | 0.942 | 0.948 | gttTTCCCaga V$HELIOSA_02 | 2400 (-) | 1.000 | 0.878 | gctTTCCTtaa V$HELIOSA_02 | 2510 (+) | 0.942 | 0.954 | aaaGGGAAaaa V$HELIOSA_02 | 2702 (-) | 0.912 | 0.917 | cttCTCCTtcc V$HELIOSA_02 | 2712 (-) | 0.942 | 0.848 | ctcTTCCCaaa V$HELIOSA_02 | 2784 (-) | 1.000 | 0.892 | ttcTTCCTtag V$HELIOSA_02 | 2903 (-) | 0.894 | 0.910 | tttATCCAcat V$HELIOSA_02 | 3110 (-) | 0.965 | 0.846 | aatTTCCAtgc V$HELIOSA_02 | 3324 (-) | 1.000 | 0.985 | cttTTCCTgat V$HELIOSA_02 | 3836 (-) | 0.965 | 0.847 | aatTTCCAttc V$HELIOSA_02 | 4036 (-) | 0.965 | 0.869 | ataTTCCAgca V$HELIOSA_02 | 4113 (-) | 0.942 | 0.948 | attTTCCCaga V$HELIOSA_02 | 4162 (-) | 0.929 | 0.851 | gtaATCCTagt V$HELIOSA_02 | 4687 (+) | 1.000 | 0.856 | ggcAGGAAact V$HELIOSA_02 | 4777 (+) | 0.894 | 0.902 | gtcTGGATaaa V$HELIOSA_02 | 4881 (+) | 0.965 | 0.865 | ctcTGGAAcat V$HELIOSA_02 | 4895 (-) | 1.000 | 0.899 | atgTTCCTtgg V$HELIOSA_02 | 5120 (+) | 1.000 | 0.879 | aaaAGGAAaca V$HELIOSA_02 | 5457 (-) | 0.942 | 0.933 | cttTTCCCcac V$HELIOSA_02 | 5472 (-) | 1.000 | 0.875 | tatTTCCTgat V$HELIOSA_02 | 5520 (-) | 1.000 | 0.862 | cctTTCCTcct V$HELIOSA_02 | 5580 (+) | 0.929 | 0.942 | taaAGGATaat V$HELIOSA_02 | 5631 (+) | 0.870 | 0.896 | actGGGATaaa V$HELIOSA_02 | 5935 (+) | 0.877 | 0.893 | cctTGGAGaaa V$HELIOSA_02 | 6138 (-) | 0.965 | 0.861 | gtcTTCCAtgc V$HELIOSA_02 | 6257 (-) | 0.858 | 0.880 | tttGTCCCatc V$HELIOSA_02 | 6460 (-) | 1.000 | 0.875 | tatTTCCTgat V$HELIOSA_02 | 6798 (-) | 1.000 | 0.873 | ctcTTCCTccc V$HELIOSA_02 | 6817 (-) | 0.853 | 0.883 | cttCTCCCttt V$HELIOSA_02 | 6908 (+) | 0.877 | 0.893 | aagTGGAGaac V$HELIOSA_02 | 6998 (+) | 0.965 | 0.965 | agaTGGAAaag V$HELIOSA_02 | 7018 (-) | 1.000 | 0.867 | tgtTTCCTgca V$HELIOSA_02 | 7246 (+) | 1.000 | 0.982 | ctcAGGAAaaa V$HELIOSA_02 | 7327 (+) | 0.965 | 0.868 | gtcTGGAAcaa V$HELIOSA_02 | 7715 (+) | 0.929 | 0.850 | agaAGGATcag V$HELIOSA_02 | 7789 (-) | 0.870 | 0.896 | attATCCCaaa V$HELIOSA_02 | 8144 (+) | 0.942 | 0.950 | tgaGGGAAaac V$HELIOSA_02 | 8757 (-) | 0.916 | 0.910 | cttGTCCTccc V$HELIOSA_02 | 8910 (+) | 1.000 | 0.900 | ttcAGGAAtac V$HELIOSA_02 | 8960 (+) | 0.853 | 0.879 | gtaGGGAGaac V$HELIOSA_02 | 9121 (+) | 0.942 | 0.941 | accGGGAAaac V$HELIOSA_02 | 9831 (+) | 0.870 | 0.890 | gaaGGGATaat V$HELIOSA_02 | 9873 (+) | 1.000 | 0.873 | atgAGGAAatc V$HELIOSA_02 | 10493 (+) | 0.929 | 0.856 | tcaAGGATcaa V$HELIOSA_02 | 10588 (-) | 1.000 | 0.878 | aatTTCCTata V$HELIOSA_02 | 10637 (+) | 1.000 | 0.880 | ctgAGGAAgac V$HELIOSA_02 | 10843 (+) | 1.000 | 0.878 | cgcAGGAAgaa V$HELIOSA_02 | 11091 (-) | 0.965 | 0.951 | attTTCCActg V$HELIOSA_02 | 11270 (+) | 1.000 | 0.881 | aaaAGGAAatt V$HELIOSA_02 | 11480 (-) | 0.942 | 0.946 | tttTTCCCttg V$HELIOSA_02 | 11521 (-) | 0.858 | 0.883 | gttGTCCGtag V$HELIOSA_02 | 11787 (+) | 0.916 | 0.929 | gtaAGGACaaa P$AGP1_01 | 6782 (+) | 1.000 | 0.993 | gcAGATCtaa V$SRF_Q5_02 | 11534 (-) | 0.926 | 0.901 | aatcCCACAtatggtagat V$HES1_Q2 | 1614 (+) | 1.000 | 0.976 | gggtcTCGTGgacag V$HES1_Q2 | 5207 (+) | 0.989 | 0.943 | gcctcTTGTGcttct V$HES1_Q2 | 10248 (+) | 0.989 | 0.947 | caggcTTGTGcagga V$HES1_Q2 | 10800 (+) | 0.989 | 0.958 | gagccTTGTGcatct V$HES1_Q2 | 10931 (+) | 0.989 | 0.955 | gcctcTTGTGgacgg V$HMGIY_Q3 | 210 (+) | 0.890 | 0.904 | cctggTATTTttttt V$HMGIY_Q3 | 211 (+) | 0.960 | 0.933 | ctggtATTTTttttt V$HMGIY_Q3 | 328 (-) | 1.000 | 0.879 | gctggAAATTccctc V$HMGIY_Q3 | 901 (+) | 0.908 | 0.883 | ttttcACTTTctttc V$HMGIY_Q3 | 989 (-) | 0.905 | 0.902 | ttaatAATTTtcatg V$HMGIY_Q3 | 990 (+) | 0.960 | 0.869 | taataATTTTcatgt V$HMGIY_Q3 | 1080 (+) | 0.960 | 0.883 | aggtcATTTTggtca V$HMGIY_Q3 | 1891 (+) | 0.890 | 0.864 | ttttaTATTTatttt V$HMGIY_Q3 | 1896 (+) | 0.960 | 0.917 | tatttATTTTattat V$HMGIY_Q3 | 1904 (+) | 0.960 | 0.887 | ttattATTTTatttg V$HMGIY_Q3 | 1951 (+) | 0.908 | 0.880 | ctggaACTTTccttt V$HMGIY_Q3 | 2358 (+) | 0.890 | 0.878 | acttgTATTTatttt V$HMGIY_Q3 | 2416 (-) | 0.867 | 0.882 | acaaaAAATCacagg V$HMGIY_Q3 | 2515 (-) | 0.960 | 0.906 | gaaaaAAAATtacat V$HMGIY_Q3 | 2516 (-) | 1.000 | 0.981 | aaaaaAAATTacata V$HMGIY_Q3 | 2558 (-) | 1.000 | 0.899 | ttgtgAAATTtccgt V$HMGIY_Q3 | 2559 (+) | 1.000 | 0.892 | tgtgaAATTTccgta V$HMGIY_Q3 | 2656 (+) | 0.908 | 0.895 | aatgcACTTTgatga V$HMGIY_Q3 | 3105 (-) | 0.905 | 0.870 | atactAATTTccatg V$HMGIY_Q3 | 3125 (-) | 0.905 | 0.879 | atattAATTTacatt V$HMGIY_Q3 | 3607 (+) | 0.928 | 0.863 | tctgcAGTTTtcaaa V$HMGIY_Q3 | 3830 (-) | 1.000 | 0.960 | ataatAAATTtccat V$HMGIY_Q3 | 3831 (-) | 0.905 | 0.897 | taataAATTTccatt V$HMGIY_Q3 | 4085 (+) | 0.960 | 0.896 | actatATTTTactga V$HMGIY_Q3 | 4219 (-) | 0.960 | 0.936 | ttaaaAAAATaaatt V$HMGIY_Q3 | 4224 (-) | 1.000 | 0.910 | aaaatAAATTctcag V$HMGIY_Q3 | 4432 (-) | 0.851 | 0.874 | aaaaaAAAAAccaca V$HMGIY_Q3 | 4475 (+) | 0.960 | 0.922 | ccagcATTTTtcttt V$HMGIY_Q3 | 4490 (-) | 0.851 | 0.863 | tgaacAAAAAacaaa V$HMGIY_Q3 | 4635 (-) | 0.960 | 0.912 | gtagcAAAATcccgg V$HMGIY_Q3 | 4658 (+) | 0.908 | 0.873 | ctcggACTTTgtttt V$HMGIY_Q3 | 4808 (+) | 0.875 | 0.875 | tctggCATTTgttac V$HMGIY_Q3 | 5379 (+) | 0.928 | 0.873 | cttttAGTTTtattt V$HMGIY_Q3 | 5385 (+) | 0.960 | 0.860 | gttttATTTTacctt V$HMGIY_Q3 | 5397 (+) | 0.890 | 0.874 | ctttaTATTTtttca V$HMGIY_Q3 | 5589 (-) | 0.960 | 0.900 | ataatAAAATataat V$HMGIY_Q3 | 5640 (-) | 0.960 | 0.877 | aagacAAAATaaaaa V$HMGIY_Q3 | 5778 (-) | 0.865 | 0.868 | agaaaAATATgaaat V$HMGIY_Q3 | 5806 (-) | 0.960 | 0.874 | aaaaaAAAATagccc V$HMGIY_Q3 | 5807 (-) | 0.890 | 0.896 | aaaaaAAATAgccct V$HMGIY_Q3 | 5907 (+) | 0.928 | 0.891 | agagtAGTTTgtttc V$HMGIY_Q3 | 5939 (-) | 0.908 | 0.875 | ggagaAAAGTaaaat V$HMGIY_Q3 | 5944 (-) | 0.960 | 0.864 | aaagtAAAATtttat V$HMGIY_Q3 | 5945 (-) | 1.000 | 0.868 | aagtaAAATTttatt V$HMGIY_Q3 | 5946 (+) | 1.000 | 0.910 | agtaaAATTTtattt V$HMGIY_Q3 | 6466 (-) | 0.960 | 0.876 | ctgatAAAATgctca V$HMGIY_Q3 | 6603 (+) | 0.890 | 0.885 | ggtgcTATTTgctaa V$HMGIY_Q3 | 6664 (+) | 0.960 | 0.874 | tgaagATTTTattta V$HMGIY_Q3 | 6832 (+) | 0.890 | 0.880 | tgtgcTATTTactgc V$HMGIY_Q3 | 7046 (+) | 0.960 | 0.862 | tgactATTTTttttc V$HMGIY_Q3 | 7058 (+) | 1.000 | 0.913 | ttcttAATTTgtttt V$HMGIY_Q3 | 7890 (+) | 0.851 | 0.865 | tttggTTTTTatttt V$HMGIY_Q3 | 7895 (+) | 0.960 | 0.890 | tttttATTTTgatat V$HMGIY_Q3 | 8547 (-) | 0.960 | 0.886 | ccgagAAAATgaaca V$HMGIY_Q3 | 9712 (-) | 1.000 | 0.864 | gtgtaAAATTgatct V$HMGIY_Q3 | 9780 (+) | 1.000 | 0.876 | agaacAATTTtgttc V$HMGIY_Q3 | 9885 (+) | 0.905 | 0.889 | atttaAAATTctttt V$HMGIY_Q3 | 9914 (+) | 0.960 | 0.885 | actggATTTTtgcct V$HMGIY_Q3 | 9930 (+) | 1.000 | 0.879 | tgggtAATTTgcatg V$HMGIY_Q3 | 10122 (-) | 0.960 | 0.886 | ccaatAAAATtagtt V$HMGIY_Q3 | 10123 (-) | 1.000 | 0.868 | caataAAATTagtta V$HMGIY_Q3 | 10135 (-) | 1.000 | 0.912 | ttaagAAATTattat V$HMGIY_Q3 | 10298 (+) | 0.875 | 0.870 | tcggaCATTTtttaa V$HMGIY_Q3 | 10318 (+) | 0.960 | 0.873 | tttatATTTTtattt V$HMGIY_Q3 | 10391 (-) | 0.960 | 0.904 | atataAAAATgaacg V$HMGIY_Q3 | 10583 (+) | 1.000 | 0.944 | ggttcAATTTcctat V$HMGIY_Q3 | 11085 (+) | 1.000 | 0.906 | ctgggAATTTtccac V$HMGIY_Q3 | 11153 (-) | 0.960 | 0.898 | tgatgAAAATgaagc V$HMGIY_Q3 | 11205 (+) | 1.000 | 0.918 | ttattAATTTgtttg V$HMGIY_Q3 | 11271 (-) | 1.000 | 0.902 | aaaggAAATTaaggc V$HMGIY_Q3 | 11391 (+) | 1.000 | 0.911 | tattaAATTTtgtac V$HMGIY_Q3 | 11444 (+) | 0.960 | 0.860 | tatctATTTTggtga V$HMGIY_Q3 | 11461 (+) | 0.905 | 0.875 | cagtgAAATTtttta V$HMGIY_Q3 | 11462 (+) | 1.000 | 0.986 | agtgaAATTTtttat V$HMGIY_Q3 | 11469 (+) | 0.890 | 0.867 | tttttTATTTgtttt V$HMGIY_Q3 | 11497 (+) | 1.000 | 0.872 | ggttgAATTTttgtg V$HMGIY_Q3 | 11498 (+) | 0.960 | 0.906 | gttgaATTTTtgtgg V$HMGIY_Q3 | 11687 (-) | 0.908 | 0.868 | ccaatAAAGTtaatc V$HNF1_Q6_01 | 984 (-) | 1.000 | 0.942 | cacagTTAATaattttcatgt V$HNF1_Q6_01 | 1043 (+) | 0.910 | 0.853 | aagcagcaaaaATCAAcaact V$HNF1_Q6_01 | 1887 (-) | 0.933 | 0.860 | cccatTTTATatttattttat V$HNF1_Q6_01 | 3409 (-) | 1.000 | 0.892 | aaaagTTAATgattctggttg V$HNF1_Q6_01 | 3901 (+) | 0.933 | 0.844 | cacaggtattaATAAAgcttt V$HNF1_Q6_01 | 4053 (+) | 1.000 | 0.902 | aataggcaataATTAAaatat V$HNF1_Q6_01 | 5581 (+) | 0.933 | 0.891 | aaaggataataATAAAatata V$HNF1_Q6_01 | 5775 (+) | 0.910 | 0.845 | gagagaaaaatATGAAataat V$HNF1_Q6_01 | 5783 (+) | 1.000 | 0.868 | aatatgaaataATTAAataca V$HNF1_Q6_01 | 7821 (+) | 1.000 | 0.919 | caaaacaaacaATTAAcccac V$HNF1_Q6_01 | 8849 (+) | 1.000 | 0.890 | acggtttaaaaATTAAgactc V$HNF1_Q6_01 | 10388 (+) | 0.910 | 0.879 | tccatataaaaATGAAcgagg V$HNF1_Q6_01 | 11196 (-) | 0.910 | 0.869 | cttgaTTGATtattaatttgt V$HNF1_Q6_01 | 11691 (-) | 1.000 | 0.865 | taaagTTAATcactgtcaccc V$HNF3_Q6_01 | 226 (+) | 0.946 | 0.918 | tttccTTTTTgtcttttt V$HNF3_Q6_01 | 503 (+) | 0.988 | 0.950 | ttataTATTTgtcttctg V$HNF3_Q6_01 | 840 (-) | 0.988 | 0.936 | aacaagttAAATAaatac V$HNF3_Q6_01 | 844 (-) | 0.988 | 0.945 | agttaaatAAATAcgatc V$HNF3_Q6_01 | 873 (-) | 1.000 | 0.947 | aaatgcacAAACAcgaaa V$HNF3_Q6_01 | 951 (-) | 0.988 | 0.946 | caatgatcAAATAtttac V$HNF3_Q6_01 | 1469 (+) | 0.988 | 0.978 | ctcaaTATTTgctctggc V$HNF3_Q6_01 | 1528 (+) | 0.946 | 0.919 | ctgttTTTTTgttctaag V$HNF3_Q6_01 | 1891 (+) | 0.988 | 0.963 | ttttaTATTTattttatt V$HNF3_Q6_01 | 2358 (+) | 0.988 | 0.953 | acttgTATTTattttgcc V$HNF3_Q6_01 | 2406 (-) | 1.000 | 0.929 | cttaaaaaAAACAaaaaa V$HNF3_Q6_01 | 2410 (-) | 0.946 | 0.941 | aaaaaaacAAAAAatcac V$HNF3_Q6_01 | 3592 (+) | 1.000 | 0.934 | tcctaTGTTTgggtttct V$HNF3_Q6_01 | 3804 (-) | 1.000 | 0.918 | aatagaaaAAACAtcgtt V$HNF3_Q6_01 | 4000 (-) | 0.988 | 0.926 | acaacgatAAATAaatat V$HNF3_Q6_01 | 4004 (-) | 0.988 | 0.950 | cgataaatAAATAttctg V$HNF3_Q6_01 | 4495 (-) | 1.000 | 0.921 | aaaaaacaAAACAaaaca V$HNF3_Q6_01 | 5118 (-) | 1.000 | 0.932 | tcaaaaggAAACAgatgg V$HNF3_Q6_01 | 5432 (+) | 0.988 | 0.945 | cacttTATTTgtattgcc V$HNF3_Q6_01 | 5642 (-) | 0.946 | 0.927 | gacaaaatAAAAAgaaga V$HNF3_Q6_01 | 5789 (-) | 0.988 | 0.948 | aaataattAAATAcaaaa V$HNF3_Q6_01 | 5951 (+) | 0.988 | 0.924 | aatttTATTTgccagtgt V$HNF3_Q6_01 | 6250 (+) | 0.988 | 0.926 | ctttcTATTTgtcccatc V$HNF3_Q6_01 | 6586 (-) | 0.988 | 0.966 | cacaaaacAAATAaggtg V$HNF3_Q6_01 | 6603 (+) | 0.988 | 0.942 | ggtgcTATTTgctaaact V$HNF3_Q6_01 | 6649 (-) | 1.000 | 0.938 | cagtctgcAAACAgatga V$HNF3_Q6_01 | 6668 (+) | 0.988 | 0.928 | gatttTATTTagcttttg V$HNF3_Q6_01 | 7045 (+) | 0.988 | 0.928 | ttgacTATTTtttttctt V$HNF3_Q6_01 | 7064 (+) | 0.946 | 0.922 | atttgTTTTTgtttgtgt V$HNF3_Q6_01 | 7068 (+) | 1.000 | 0.959 | gttttTGTTTgtgtatga V$HNF3_Q6_01 | 7209 (-) | 0.946 | 0.922 | aattgaacAAAAAgtagg V$HNF3_Q6_01 | 7423 (+) | 0.988 | 0.957 | aggttTATTTgtttttat V$HNF3_Q6_01 | 7427 (+) | 1.000 | 0.919 | ttattTGTTTttatttta V$HNF3_Q6_01 | 7429 (+) | 0.946 | 0.925 | atttgTTTTTattttatg V$HNF3_Q6_01 | 7581 (-) | 1.000 | 0.929 | atcaggtcAAACAaatgc V$HNF3_Q6_01 | 7773 (-) | 1.000 | 0.929 | aaaaaacaAAACAaaaat V$HNF3_Q6_01 | 7777 (-) | 0.946 | 0.935 | aacaaaacAAAAAttatc V$HNF3_Q6_01 | 7819 (-) | 1.000 | 0.987 | tccaaaacAAACAattaa V$HNF3_Q6_01 | 7890 (+) | 0.946 | 0.920 | tttggTTTTTattttgat V$HNF3_Q6_01 | 7936 (+) | 1.000 | 0.959 | ttgtgTGTTTagttttga V$HNF3_Q6_01 | 7967 (+) | 1.000 | 0.972 | atagtTGTTTgtttttca V$HNF3_Q6_01 | 8182 (+) | 1.000 | 0.971 | gaactTGTTTgctctgcc V$HNF3_Q6_01 | 8333 (+) | 0.988 | 0.931 | gccagTATTTgctggacc V$HNF3_Q6_01 | 8469 (-) | 0.946 | 0.920 | tctaaggcAAAAAtaagg V$HNF3_Q6_01 | 8895 (-) | 1.000 | 0.919 | cgtaccacAAACAgattc V$HNF3_Q6_01 | 9108 (-) | 0.946 | 0.925 | agcaaaacAAAAAaccgg V$HNF3_Q6_01 | 9483 (+) | 0.988 | 0.933 | tacttTATTTgcaataga V$HNF3_Q6_01 | 9531 (-) | 1.000 | 0.962 | actagaacAAACAagtcc V$HNF3_Q6_01 | 9915 (+) | 0.946 | 0.923 | ctggaTTTTTgcctatgg V$HNF3_Q6_01 | 10309 (+) | 1.000 | 0.919 | ttaacTGTTTttatattt V$HNF3_Q6_01 | 10319 (+) | 0.946 | 0.926 | ttataTTTTTattttata V$HNF3_Q6_01 | 10756 (+) | 1.000 | 0.926 | ccgtgTGTTTgggcacta V$HNF3_Q6_01 | 10908 (-) | 0.988 | 0.959 | tactatgcAAATAatagt V$HNF3_Q6_01 | 11209 (+) | 1.000 | 0.929 | taattTGTTTgaactgcc V$HNF3_Q6_01 | 11285 (-) | 0.946 | 0.920 | catagggcAAAGAaataa V$HNF3_Q6_01 | 11419 (+) | 0.988 | 0.929 | atttcTATTTagctatcc V$HNF3_Q6_01 | 11469 (+) | 0.988 | 0.969 | tttttTATTTgtttttcc V$IPF1_Q4_01 | 46 (+) | 1.000 | 0.949 | gatgtCATTActtgg V$IPF1_Q4_01 | 60 (-) | 0.929 | 0.908 | ggccaTAATTactgt V$IPF1_Q4_01 | 174 (+) | 1.000 | 0.930 | gctggCATTActacc V$IPF1_Q4_01 | 493 (+) | 1.000 | 0.924 | tgcaaCATTAttata V$IPF1_Q4_01 | 3171 (+) | 1.000 | 0.904 | ctgtaCATTAtatcc V$IPF1_Q4_01 | 3272 (+) | 1.000 | 0.929 | gatgcCATTAtgatt V$IPF1_Q4_01 | 3410 (-) | 1.000 | 0.944 | aaagtTAATGattct V$IPF1_Q4_01 | 4281 (+) | 1.000 | 0.924 | gcacaCATTAgaccc V$IPF1_Q4_01 | 5010 (+) | 1.000 | 0.924 | ttaccCATTAagcca V$IPF1_Q4_01 | 5097 (+) | 0.914 | 0.904 | tgggtTATTAtttgt V$IPF1_Q4_01 | 6806 (+) | 1.000 | 0.953 | cccctCATTAccttc V$IPF1_Q4_01 | 6954 (+) | 1.000 | 0.912 | ctgtcCATTAttctg V$IPF1_Q4_01 | 7256 (+) | 1.000 | 0.952 | agtctCATTAaaata V$IPF1_Q4_01 | 8149 (+) | 1.000 | 0.909 | gaaaaCATTAgcttc V$IPF1_Q4_01 | 9732 (-) | 1.000 | 0.930 | tatgaTAATGtccat V$IPF1_Q4_01 | 10519 (+) | 1.000 | 0.932 | gtagcCATTAtagga V$IPF1_Q4_01 | 10654 (-) | 1.000 | 0.919 | tgtccTAATGtcaac V$IPF1_Q4_01 | 11343 (+) | 1.000 | 0.934 | aatgcCATTAtcgaa V$SOX_Q6 | 398 (-) | 0.877 | 0.793 | tgccaaCAAGGcg V$SOX_Q6 | 945 (-) | 0.901 | 0.847 | taggaaCAATGat V$SOX_Q6 | 1035 (-) | 1.000 | 0.833 | aatcaaCAAAGca V$SOX_Q6 | 1163 (-) | 1.000 | 0.797 | ctttttCAAAGaa V$SOX_Q6 | 1532 (+) | 0.781 | 0.778 | ttTTTTGttctaa V$SOX_Q6 | 2196 (+) | 0.781 | 0.791 | ctCTTTCtttctt V$SOX_Q6 | 2602 (-) | 0.781 | 0.799 | attaaaCCAAGag V$SOX_Q6 | 2660 (+) | 1.000 | 0.793 | caCTTTGatgaca V$SOX_Q6 | 2897 (+) | 0.781 | 0.800 | atCTTTTttatcc V$SOX_Q6 | 3661 (-) | 0.781 | 0.782 | aagtaaCACAGac V$SOX_Q6 | 3995 (-) | 0.877 | 0.835 | atgaaaCAACGat V$SOX_Q6 | 4488 (-) | 0.781 | 0.777 | tttgaaCAAAAaa V$SOX_Q6 | 4546 (+) | 0.781 | 0.831 | ctCTTAGttttga V$SOX_Q6 | 4662 (+) | 1.000 | 0.853 | gaCTTTGttttgc V$SOX_Q6 | 5354 (+) | 1.000 | 0.829 | ctCTTTGacaaaa V$SOX_Q6 | 5833 (-) | 0.877 | 0.772 | gtgaaaCAAGGca V$SOX_Q6 | 6587 (-) | 0.781 | 0.800 | acaaaaCAAATaa V$SOX_Q6 | 6822 (+) | 0.781 | 0.773 | ccCTTTTttatgt V$SOX_Q6 | 6912 (-) | 0.877 | 0.835 | ggagaaCAAGGaa V$SOX_Q6 | 7210 (-) | 0.781 | 0.799 | attgaaCAAAAag V$SOX_Q6 | 7778 (-) | 0.781 | 0.800 | acaaaaCAAAAat V$SOX_Q6 | 7790 (-) | 1.000 | 0.803 | ttatccCAAAGat V$SOX_Q6 | 7803 (-) | 0.877 | 0.818 | aaagaaCAACGac V$SOX_Q6 | 7820 (-) | 0.781 | 0.800 | ccaaaaCAAACaa V$SOX_Q6 | 8378 (+) | 1.000 | 0.905 | ctCTTTGatctgt V$SOX_Q6 | 8568 (+) | 1.000 | 0.847 | gtCTTTGattaaa V$SOX_Q6 | 8706 (-) | 0.877 | 0.784 | taccatCAAGGag V$SOX_Q6 | 8781 (-) | 1.000 | 0.862 | ccgagaCAAAGaa V$SOX_Q6 | 9025 (+) | 1.000 | 0.868 | atCTTTGtgagtt V$SOX_Q6 | 9109 (-) | 0.781 | 0.800 | gcaaaaCAAAAaa V$SOX_Q6 | 9532 (-) | 0.781 | 0.773 | ctagaaCAAACaa V$SOX_Q6 | 9725 (+) | 1.000 | 0.861 | ctCTTTGtatgat V$SOX_Q6 | 9893 (+) | 0.781 | 0.799 | ttCTTTTttaaag V$SOX_Q6 | 10457 (-) | 0.901 | 0.774 | tatcaaCAATGca V$SOX_Q6 | 10488 (-) | 0.877 | 0.825 | acatatCAAGGat V$SOX_Q6 | 10577 (+) | 1.000 | 0.776 | tgCTTTGgttcaa V$SOX_Q6 | 10970 (-) | 0.877 | 0.787 | ttggagCAAGGag V$SOX_Q6 | 11050 (+) | 1.000 | 0.861 | tcCTTTGttaacg P$PEND_01 | 7056 (-) | 1.000 | 0.892 | ttTTCTTa P$PEND_01 | 8642 (+) | 1.000 | 0.920 | tAAGAAga P$PEND_01 | 10055 (+) | 1.000 | 0.956 | tAAGAAct P$PEND_01 | 10136 (+) | 1.000 | 0.972 | tAAGAAat V$PBX1_03 | 72 (-) | 1.000 | 0.875 | tgttgTTGATgt V$PBX1_03 | 135 (+) | 0.895 | 0.851 | caGTCAAtccag V$PBX1_03 | 256 (-) | 0.884 | 0.820 | taaggTTGAAct V$PBX1_03 | 532 (-) | 0.840 | 0.844 | gatggGTGATgc V$PBX1_03 | 954 (+) | 1.000 | 0.898 | tgATCAAatatt V$PBX1_03 | 1034 (+) | 1.000 | 0.871 | gaATCAAcaaag V$PBX1_03 | 1052 (+) | 1.000 | 0.872 | aaATCAAcaact V$PBX1_03 | 1911 (-) | 0.900 | 0.819 | tttatTTGAGac V$PBX1_03 | 1979 (+) | 0.900 | 0.839 | gcCTCAAactca V$PBX1_03 | 2351 (+) | 0.900 | 0.836 | ttCTCAAacttg V$PBX1_03 | 2659 (-) | 1.000 | 0.839 | gcactTTGATga V$PBX1_03 | 3351 (+) | 0.854 | 0.858 | tcATCAGacaga V$PBX1_03 | 3430 (-) | 0.895 | 0.837 | cttaaTTGACtt V$PBX1_03 | 3704 (-) | 1.000 | 0.967 | gatgcTTGATca V$PBX1_03 | 3710 (+) | 1.000 | 0.840 | tgATCAAtattt V$PBX1_03 | 3932 (+) | 0.854 | 0.877 | ttATCAGtcact V$PBX1_03 | 3973 (-) | 0.847 | 0.852 | gctgtATGATag V$PBX1_03 | 4550 (-) | 1.000 | 0.819 | tagttTTGATgc V$PBX1_03 | 4692 (-) | 1.000 | 0.841 | gaaacTTGATgt V$PBX1_03 | 4837 (-) | 0.840 | 0.817 | aacgaGTGATcc V$PBX1_03 | 5138 (+) | 0.895 | 0.829 | cgGTCAAcctag V$PBX1_03 | 5481 (-) | 1.000 | 0.876 | attttTTGATcc V$PBX1_03 | 6100 (+) | 0.854 | 0.818 | ggATCAGtcctg V$PBX1_03 | 6436 (+) | 0.895 | 0.826 | cgGTCAAcccag V$PBX1_03 | 6920 (-) | 1.000 | 0.861 | aggaaTTGATtg V$PBX1_03 | 7371 (-) | 1.000 | 0.863 | aagaaTTGATat V$PBX1_03 | 7584 (+) | 0.895 | 0.890 | agGTCAAacaaa V$PBX1_03 | 7898 (-) | 1.000 | 0.821 | ttattTTGATat V$PBX1_03 | 7924 (+) | 1.000 | 0.861 | gtATCAAttcgt V$PBX1_03 | 8377 (-) | 1.000 | 0.895 | gctctTTGATct V$PBX1_03 | 8567 (-) | 1.000 | 0.892 | cgtctTTGATta V$PBX1_03 | 8614 (+) | 0.895 | 0.914 | ccGTCAAtcact V$PBX1_03 | 8663 (+) | 0.884 | 0.819 | ttTTCAAaccac V$PBX1_03 | 8708 (+) | 1.000 | 0.892 | ccATCAAggagt V$PBX1_03 | 8887 (+) | 1.000 | 0.923 | gaATCAAacgta V$PBX1_03 | 9715 (-) | 1.000 | 0.866 | taaaaTTGATct V$PBX1_03 | 9728 (-) | 0.847 | 0.850 | tttgtATGATaa V$PBX1_03 | 9879 (+) | 1.000 | 0.865 | aaATCAAtttaa V$PBX1_03 | 10456 (+) | 1.000 | 0.874 | atATCAAcaatg V$PBX1_03 | 10490 (+) | 1.000 | 0.890 | atATCAAggatc V$PBX1_03 | 10497 (+) | 1.000 | 0.826 | ggATCAAaagtc V$PBX1_03 | 10504 (+) | 0.895 | 0.837 | aaGTCAAttatg V$PBX1_03 | 10596 (+) | 1.000 | 0.877 | atATCAAaaagg V$PBX1_03 | 11192 (-) | 1.000 | 0.870 | gtttcTTGATtg V$PBX1_03 | 11196 (-) | 1.000 | 0.994 | cttgaTTGATta V$PBX1_03 | 11212 (-) | 0.884 | 0.881 | tttgtTTGAAct V$PBX1_03 | 11494 (-) | 0.884 | 0.826 | tgcggTTGAAtt V$PBX1_03 | 11543 (-) | 0.850 | 0.852 | tatggTAGATgg V$PBX1_03 | 11556 (+) | 0.884 | 0.874 | taTTCAAccaaa V$PBX1_03 | 11730 (+) | 0.900 | 0.893 | ttCTCAAacatc V$PBX1_03 | 11777 (-) | 1.000 | 0.897 | gatacTTGATgt V$PBX1_03 | 11892 (+) | 0.900 | 0.891 | ctCTCAAccaat V$TBX5_01 | 4140 (+) | 1.000 | 0.971 | ctaGGTGTggtg V$TBX5_01 | 8247 (+) | 1.000 | 0.960 | gaaGGTGTgttc V$TBX5_01 | 8749 (-) | 1.000 | 0.970 | gcccACACCttg V$TBX5_01 | 8970 (+) | 1.000 | 0.968 | ccaGGTGTggtg V$TBX5_01 | 8982 (-) | 1.000 | 0.961 | gtacACACCttt V$TBX5_02 | 1681 (+) | 1.000 | 0.968 | tcgGGTGTca V$TBX5_02 | 1814 (+) | 1.000 | 0.962 | ggtGGTGTtc V$TBX5_02 | 2811 (-) | 1.000 | 0.964 | gaACACCgtg V$TBX5_02 | 4140 (+) | 1.000 | 0.969 | ctaGGTGTgg V$TBX5_02 | 8235 (-) | 1.000 | 0.958 | ccACACCcag V$TBX5_02 | 8737 (+) | 1.000 | 0.960 | cggGGTGTgg V$TBX5_02 | 8751 (-) | 1.000 | 0.970 | ccACACCttg V$TBX5_02 | 8970 (+) | 1.000 | 0.970 | ccaGGTGTgg V$TBX5_02 | 9570 (+) | 1.000 | 0.958 | ctgGGTGTgg V$TBX5_02 | 9952 (-) | 1.000 | 0.979 | taACACCctg V$TBX5_02 | 10694 (-) | 1.000 | 0.964 | caACACCcgt V$TBX5_02 | 10941 (+) | 1.000 | 0.959 | gacGGTGTtg V$TBX5_02 | 11256 (+) | 1.000 | 0.965 | tgtGGTGTgc P$ID1_01 | 224 (+) | 0.855 | 0.888 | tTTTTCctttt P$ID1_01 | 233 (+) | 1.000 | 0.975 | tTTGTCttttt P$ID1_01 | 5637 (-) | 1.000 | 0.961 | ataaaGACAAa P$ID1_01 | 10157 (+) | 1.000 | 0.905 | gTTGTCgtttt P$ID1_01 | 11787 (-) | 1.000 | 0.911 | gtaagGACAAa V$LEF1_Q2_01 | 1166 (+) | 1.000 | 0.979 | tttCAAAGaa V$LEF1_Q2_01 | 2660 (-) | 1.000 | 0.999 | caCTTTGatg V$LEF1_Q2_01 | 6740 (-) | 1.000 | 0.986 | ccCTTTGaaa V$LEF1_Q2_01 | 8378 (-) | 1.000 | 0.997 | ctCTTTGatc V$LEF1_Q2_01 | 8568 (-) | 1.000 | 0.988 | gtCTTTGatt V$LEF1_Q2_01 | 8622 (-) | 1.000 | 0.977 | caCTTTGaga V$LEF1_Q2_01 | 10352 (+) | 1.000 | 0.983 | attCAAAGtg V$LEF1_Q2_01 | 11378 (-) | 1.000 | 0.978 | tgCTTTGaaa V$HSF1_Q6 | 8874 (-) | 1.000 | 0.952 | atagaatattCTAGAat V$TFE_Q6 | 19 (+) | 1.000 | 0.987 | aCATGTga V$TFE_Q6 | 999 (+) | 1.000 | 0.999 | tCATGTgg V$TFE_Q6 | 2615 (+) | 1.000 | 0.982 | cCATGTgt V$TFE_Q6 | 2774 (+) | 1.000 | 0.984 | gCATGTga V$TFE_Q6 | 3642 (-) | 1.000 | 0.983 | ccACATGg V$TFE_Q6 | 5251 (-) | 1.000 | 0.987 | tcACATGt V$TFE_Q6 | 5253 (+) | 1.000 | 0.987 | aCATGTgg V$TFE_Q6 | 7113 (-) | 1.000 | 0.987 | ccACATGt V$TFE_Q6 | 7115 (+) | 1.000 | 0.986 | aCATGTgt V$TFE_Q6 | 9001 (-) | 1.000 | 0.983 | gcACATGg V$TFE_Q6 | 10890 (-) | 1.000 | 0.983 | gcACATGc F$RIM101_01 | 53 (-) | 1.000 | 0.994 | ttaCTTGG F$RIM101_01 | 1213 (-) | 1.000 | 0.997 | ttgCTTGG F$RIM101_01 | 3263 (-) | 1.000 | 1.000 | tttCTTGG F$RIM101_01 | 3690 (+) | 1.000 | 0.967 | CCAAGaac F$RIM101_01 | 4253 (-) | 1.000 | 0.994 | ttaCTTGG F$RIM101_01 | 4398 (+) | 1.000 | 0.950 | CCAAGaca F$RIM101_01 | 4521 (+) | 1.000 | 0.950 | CCAAGaca F$RIM101_01 | 4898 (-) | 1.000 | 0.992 | ttcCTTGG F$RIM101_01 | 11265 (+) | 1.000 | 1.000 | CCAAGaaa V$HNF4_Q6_01 | 2657 (-) | 1.000 | 0.891 | atgcaCTTTGatga V$HNF4_Q6_01 | 3566 (-) | 1.000 | 0.914 | gtggcCTTTGgcaa V$HNF4_Q6_01 | 8375 (-) | 1.000 | 0.941 | ctgctCTTTGatct V$HNF4_Q6_01 | 8582 (+) | 1.000 | 0.897 | ggggCAAAGgcaac V$EBOX_Q6_01 | 286 (-) | 0.980 | 0.973 | agtcAAGTGt V$EBOX_Q6_01 | 734 (-) | 1.000 | 0.972 | caccACGTGg V$EBOX_Q6_01 | 736 (+) | 1.000 | 0.973 | cCACGTggtt V$EBOX_Q6_01 | 1127 (+) | 1.000 | 0.957 | gCACGTgtgt V$EBOX_Q6_01 | 1359 (+) | 0.998 | 0.965 | tCACCTgatg V$EBOX_Q6_01 | 1391 (-) | 0.998 | 0.988 | gggcAGGTGa V$EBOX_Q6_01 | 1787 (+) | 0.998 | 0.984 | gCACCTgcca V$EBOX_Q6_01 | 2613 (-) | 0.989 | 0.977 | agccATGTGt V$EBOX_Q6_01 | 3642 (+) | 0.989 | 0.964 | cCACATggtg V$EBOX_Q6_01 | 3681 (-) | 0.980 | 0.970 | tgccAAGTGc V$EBOX_Q6_01 | 4338 (-) | 0.998 | 0.958 | atccAGGTGg V$EBOX_Q6_01 | 4904 (-) | 0.980 | 0.974 | ggccAAGTGc V$EBOX_Q6_01 | 6065 (+) | 1.000 | 0.964 | gCACGTgctg V$EBOX_Q6_01 | 7113 (+) | 0.989 | 0.957 | cCACATgtgt V$EBOX_Q6_01 | 7390 (-) | 0.998 | 0.960 | gtccAGGTGg V$EBOX_Q6_01 | 8349 (+) | 0.980 | 0.979 | cCACTTgcca V$EBOX_Q6_01 | 8503 (-) | 0.998 | 0.988 | ggacAGGTGc V$EBOX_Q6_01 | 8968 (-) | 0.998 | 0.962 | aaccAGGTGt V$EBOX_Q6_01 | 11962 (+) | 0.998 | 0.959 | gCACCTgagc V$YY1_Q6_02 | 58 (+) | 1.000 | 0.932 | tggGCCATaat V$YY1_Q6_02 | 530 (-) | 0.968 | 0.918 | atgATGGGtga V$YY1_Q6_02 | 719 (-) | 0.972 | 0.949 | cagATGGTtgt V$YY1_Q6_02 | 799 (+) | 1.000 | 0.957 | tgaGCCATttc V$YY1_Q6_02 | 887 (-) | 0.968 | 0.930 | gaaATGGGaca V$YY1_Q6_02 | 1178 (-) | 0.968 | 0.937 | gagATGGGcac V$YY1_Q6_02 | 1663 (-) | 0.983 | 0.957 | caaATGGAgat V$YY1_Q6_02 | 1790 (+) | 1.000 | 0.919 | cctGCCATtca V$YY1_Q6_02 | 1884 (+) | 0.968 | 0.967 | cctCCCATttt V$YY1_Q6_02 | 2611 (+) | 1.000 | 0.958 | agaGCCATgtg V$YY1_Q6_02 | 2838 (-) | 1.000 | 0.963 | caaATGGCatc V$YY1_Q6_02 | 3150 (-) | 0.983 | 0.962 | tatATGGAggt V$YY1_Q6_02 | 3272 (+) | 1.000 | 0.935 | gatGCCATtat V$YY1_Q6_02 | 3301 (+) | 0.983 | 0.955 | atcTCCATgta V$YY1_Q6_02 | 3380 (-) | 0.968 | 0.935 | catATGGGgcc V$YY1_Q6_02 | 3643 (-) | 0.972 | 0.954 | cacATGGTgga V$YY1_Q6_02 | 3837 (+) | 0.983 | 0.917 | attTCCATtct V$YY1_Q6_02 | 3860 (+) | 0.972 | 0.934 | agtACCATtta V$YY1_Q6_02 | 3951 (+) | 1.000 | 0.938 | tcaGCCATcct V$YY1_Q6_02 | 3984 (+) | 0.972 | 0.938 | gacACCATctc V$YY1_Q6_02 | 4107 (+) | 0.983 | 0.980 | acaTCCATttt V$YY1_Q6_02 | 4152 (+) | 0.968 | 0.948 | cctCCCATttg V$YY1_Q6_02 | 4889 (+) | 0.983 | 0.966 | catTCCATgtt V$YY1_Q6_02 | 4932 (-) | 0.983 | 0.928 | atcATGGAgac V$YY1_Q6_02 | 4992 (-) | 1.000 | 0.921 | cgaATGGCaga V$YY1_Q6_02 | 5129 (-) | 0.972 | 0.939 | cagATGGTgcg V$YY1_Q6_02 | 5242 (-) | 0.968 | 0.931 | tacATGGGttc V$YY1_Q6_02 | 5290 (-) | 0.968 | 0.934 | cagATGGGcta V$YY1_Q6_02 | 5308 (-) | 1.000 | 0.982 | aaaATGGCttc V$YY1_Q6_02 | 5444 (+) | 1.000 | 0.966 | attGCCATata V$YY1_Q6_02 | 5735 (+) | 1.000 | 0.933 | gaaGCCATaat V$YY1_Q6_02 | 5871 (+) | 0.972 | 0.932 | tggACCATctc V$YY1_Q6_02 | 6033 (+) | 0.968 | 0.931 | ggaCCCATctg V$YY1_Q6_02 | 6189 (+) | 0.968 | 0.945 | acaCCCATtta V$YY1_Q6_02 | 6259 (+) | 0.968 | 0.931 | tgtCCCATctg V$YY1_Q6_02 | 6447 (-) | 1.000 | 0.917 | gtcATGGCaaa V$YY1_Q6_02 | 6954 (+) | 0.983 | 0.926 | ctgTCCATtat V$YY1_Q6_02 | 6997 (-) | 0.983 | 0.950 | gagATGGAaaa V$YY1_Q6_02 | 7176 (-) | 0.972 | 0.935 | cagATGGTccc V$YY1_Q6_02 | 7359 (-) | 0.968 | 0.911 | atcATGGGcag V$YY1_Q6_02 | 7513 (-) | 1.000 | 0.962 | cagATGGCctg V$YY1_Q6_02 | 7609 (+) | 1.000 | 0.956 | tgaGCCATctc V$YY1_Q6_02 | 8577 (-) | 0.968 | 0.935 | taaATGGGgca V$YY1_Q6_02 | 9002 (-) | 0.983 | 0.964 | cacATGGAggc V$YY1_Q6_02 | 9054 (-) | 1.000 | 0.966 | tacATGGCaag V$YY1_Q6_02 | 9513 (+) | 0.968 | 0.940 | ttcCCCATgtc V$YY1_Q6_02 | 10104 (+) | 1.000 | 0.978 | tgaGCCATctt V$YY1_Q6_02 | 10385 (+) | 0.983 | 0.944 | caaTCCATata V$YY1_Q6_02 | 10510 (-) | 1.000 | 0.946 | attATGGCtgt V$YY1_Q6_02 | 10519 (+) | 1.000 | 0.940 | gtaGCCATtat V$YY1_Q6_02 | 10851 (+) | 1.000 | 0.924 | gaaGCCATtgt V$YY1_Q6_02 | 11343 (+) | 1.000 | 0.935 | aatGCCATtat V$YY1_Q6_02 | 11541 (-) | 0.972 | 0.950 | catATGGTaga V$YY1_Q6_02 | 11548 (-) | 0.968 | 0.937 | tagATGGGtat V$YY1_Q6_02 | 11721 (+) | 0.972 | 0.975 | tccACCATttt V$COUPTF_Q6 | 3562 (+) | 0.927 | 0.857 | ctgtgTGGCCtttggcaagttat V$COUPTF_Q6 | 6387 (+) | 0.927 | 0.846 | cgaacTGGCCtgtgttcagaacg V$COUPTF_Q6 | 9626 (-) | 1.000 | 0.845 | atcgggagttcaaGGTCAgcttg V$GLI_Q2 | 10238 (+) | 1.000 | 0.924 | ggttGGTGGtca V$NKX25_Q5 | 10470 (+) | 1.000 | 0.963 | tctCACTTta V$AP2ALPHA_02 | 98 (+) | 1.000 | 0.946 | ttCGCCTgggggtgg V$AP2ALPHA_02 | 1331 (-) | 0.946 | 0.921 | actccccaGGGCTta V$AP2ALPHA_02 | 3566 (+) | 0.979 | 0.906 | gtGGCCTttggcaag V$AP2ALPHA_02 | 3566 (-) | 0.911 | 0.912 | gtggccttTGGCAag V$AP2ALPHA_02 | 6144 (+) | 0.991 | 0.926 | caTGCCTctggctct V$AP2ALPHA_02 | 6144 (-) | 0.908 | 0.915 | catgcctcTGGCTct V$AP2ALPHA_02 | 7189 (+) | 0.898 | 0.901 | acGGCCAcgggggtg V$AP2ALPHA_02 | 9238 (+) | 0.946 | 0.906 | agAGCCCtaggagag V$AP2ALPHA_02 | 9328 (+) | 0.989 | 0.888 | agAGCCTctggagga V$AP2ALPHA_02 | 9922 (+) | 0.991 | 0.922 | ttTGCCTatgggtaa V$AP2ALPHA_03 | 98 (+) | 1.000 | 0.782 | ttCGCCTgggggtgg V$AP2ALPHA_03 | 98 (-) | 0.609 | 0.782 | ttcgcctgGGGGTgg V$AP2ALPHA_03 | 1331 (+) | 0.595 | 0.733 | acTCCCCagggctta V$AP2ALPHA_03 | 1331 (-) | 0.890 | 0.733 | actccccaGGGCTta V$AP2ALPHA_03 | 1338 (+) | 0.673 | 0.815 | agGGCTTaaggcttg V$AP2ALPHA_03 | 1338 (-) | 0.980 | 0.815 | agggcttaAGGCTtg V$AP2ALPHA_03 | 1349 (+) | 0.966 | 0.760 | ctTGCCTgggtcacc V$AP2ALPHA_03 | 1349 (-) | 0.595 | 0.760 | cttgcctgGGTCAcc V$AP2ALPHA_03 | 1373 (+) | 0.586 | 0.725 | ttTGCAGtgggcgag V$AP2ALPHA_03 | 1373 (-) | 0.910 | 0.725 | tttgcagtGGGCGag V$AP2ALPHA_03 | 1762 (+) | 0.980 | 0.747 | taAGCCTctcgcctg V$AP2ALPHA_03 | 1762 (-) | 0.562 | 0.747 | taagcctcTCGCCtg V$AP2ALPHA_03 | 2477 (+) | 0.864 | 0.729 | aaGGCCCtgggttcg V$AP2ALPHA_03 | 2477 (-) | 0.609 | 0.729 | aaggccctGGGTTcg V$AP2ALPHA_03 | 3530 (+) | 0.864 | 0.758 | ttGGCCCgaggtttg V$AP2ALPHA_03 | 3530 (-) | 0.699 | 0.758 | ttggcccgAGGTTtg V$AP2ALPHA_03 | 3566 (+) | 0.954 | 0.878 | gtGGCCTttggcaag V$AP2ALPHA_03 | 3566 (-) | 0.855 | 0.878 | gtggccttTGGCAag V$AP2ALPHA_03 | 4184 (+) | 0.599 | 0.764 | tgAGCAGaaggctgc V$AP2ALPHA_03 | 4184 (-) | 0.980 | 0.764 | tgagcagaAGGCTgc V$AP2ALPHA_03 | 5197 (+) | 0.685 | 0.741 | ttTGTCTgtggcctc V$AP2ALPHA_03 | 5197 (-) | 0.843 | 0.741 | tttgtctgTGGCCtc V$AP2ALPHA_03 | 5886 (+) | 0.673 | 0.746 | tgGGCATggggcccg V$AP2ALPHA_03 | 5886 (-) | 0.864 | 0.746 | tgggcatgGGGCCcg V$AP2ALPHA_03 | 6071 (+) | 0.855 | 0.770 | gcTGCCAcagtctct V$AP2ALPHA_03 | 6071 (-) | 0.699 | 0.770 | gctgccacAGTCTct V$AP2ALPHA_03 | 6144 (+) | 0.966 | 0.905 | caTGCCTctggctct V$AP2ALPHA_03 | 6144 (-) | 0.869 | 0.905 | catgcctcTGGCTct V$AP2ALPHA_03 | 6239 (+) | 0.685 | 0.772 | tcTGGCTacggcttt V$AP2ALPHA_03 | 6239 (-) | 0.881 | 0.772 | tctggctaCGGCTtt V$AP2ALPHA_03 | 6418 (+) | 0.719 | 0.785 | gaCTCCTctggcttt V$AP2ALPHA_03 | 6418 (-) | 0.869 | 0.785 | gactcctcTGGCTtt V$AP2ALPHA_03 | 6863 (+) | 0.609 | 0.771 | ccAGACCgaggctta V$AP2ALPHA_03 | 6863 (-) | 0.980 | 0.771 | ccagaccgAGGCTta V$AP2ALPHA_03 | 7189 (+) | 0.843 | 0.731 | acGGCCAcgggggtg V$AP2ALPHA_03 | 7189 (-) | 0.629 | 0.731 | acggccacGGGGGtg V$AP2ALPHA_03 | 8159 (+) | 0.574 | 0.751 | gcTTCCAgaggcagg V$AP2ALPHA_03 | 8159 (-) | 0.966 | 0.751 | gcttccagAGGCAgg V$AP2ALPHA_03 | 8367 (+) | 0.980 | 0.782 | gcAGCCTgctgctct V$AP2ALPHA_03 | 8367 (-) | 0.599 | 0.782 | gcagcctgCTGCTct V$AP2ALPHA_03 | 8581 (+) | 0.562 | 0.740 | tgGGGCAaaggcaac V$AP2ALPHA_03 | 8581 (-) | 0.966 | 0.740 | tggggcaaAGGCAac V$AP2ALPHA_03 | 8862 (+) | 0.699 | 0.751 | taAGACTctggcata V$AP2ALPHA_03 | 8862 (-) | 0.855 | 0.751 | taagactcTGGCAta V$AP2ALPHA_03 | 9238 (+) | 0.890 | 0.795 | agAGCCCtaggagag V$AP2ALPHA_03 | 9238 (-) | 0.719 | 0.795 | agagccctAGGAGag V$AP2ALPHA_03 | 9328 (+) | 0.980 | 0.785 | agAGCCTctggagga V$AP2ALPHA_03 | 9328 (-) | 0.608 | 0.785 | agagcctcTGGAGga V$AP2ALPHA_03 | 9439 (+) | 0.890 | 0.767 | gaAGCCCaaggttta V$AP2ALPHA_03 | 9439 (-) | 0.699 | 0.767 | gaagcccaAGGTTta V$AP2ALPHA_03 | 9922 (+) | 0.966 | 0.751 | ttTGCCTatgggtaa V$AP2ALPHA_03 | 9922 (-) | 0.588 | 0.751 | tttgcctaTGGGTaa V$AP2ALPHA_03 | 10557 (+) | 0.881 | 0.726 | acAGCCGgcggtaca V$AP2ALPHA_03 | 10557 (-) | 0.586 | 0.726 | acagccggCGGTAca V$AP2ALPHA_03 | 10683 (+) | 0.673 | 0.744 | caGGACTctggcaac V$AP2ALPHA_03 | 10683 (-) | 0.855 | 0.744 | caggactcTGGCAac V$AP2ALPHA_03 | 10799 (+) | 0.980 | 0.761 | tgAGCCTtgtgcatc V$AP2ALPHA_03 | 10799 (-) | 0.595 | 0.761 | tgagccttGTGCAtc V$AP2ALPHA_03 | 11281 (+) | 0.673 | 0.764 | aaGGCATagggcaaa V$AP2ALPHA_03 | 11281 (-) | 0.876 | 0.764 | aaggcataGGGCAaa V$AP2ALPHA_03 | 11812 (+) | 0.864 | 0.716 | gtGGCCCagggttgg V$AP2ALPHA_03 | 11812 (-) | 0.609 | 0.716 | gtggcccaGGGTTgg V$AP2ALPHA_03 | 11910 (+) | 0.673 | 0.759 | aaGGCTTcgggccca V$AP2ALPHA_03 | 11910 (-) | 0.864 | 0.759 | aaggcttcGGGCCca V$AP2ALPHA_03 | 11917 (+) | 0.864 | 0.744 | cgGGCCCaacgccca V$AP2ALPHA_03 | 11917 (-) | 0.673 | 0.744 | cgggcccaACGCCca N$TRA1_01 | 106 (+) | 0.998 | 0.902 | ggGGTGGc N$TRA1_01 | 192 (-) | 1.000 | 0.941 | gCCTCCca N$TRA1_01 | 201 (-) | 0.998 | 0.881 | aCCACCat N$TRA1_01 | 1489 (+) | 1.000 | 0.912 | agGGAGGg N$TRA1_01 | 1493 (+) | 0.985 | 0.900 | agGGGGGg N$TRA1_01 | 1494 (+) | 0.985 | 0.892 | ggGGGGGg N$TRA1_01 | 1495 (+) | 0.985 | 0.892 | ggGGGGGg N$TRA1_01 | 1496 (+) | 0.985 | 0.892 | ggGGGGGg N$TRA1_01 | 1497 (+) | 0.985 | 0.879 | ggGGGGGa N$TRA1_01 | 1500 (+) | 1.000 | 0.904 | ggGGAGGg N$TRA1_01 | 1504 (+) | 1.000 | 0.912 | agGGAGGg N$TRA1_01 | 1508 (+) | 1.000 | 0.912 | agGGAGGg N$TRA1_01 | 1512 (+) | 1.000 | 0.912 | agGGAGGg N$TRA1_01 | 1516 (+) | 1.000 | 0.971 | agGGAGGt N$TRA1_01 | 1812 (+) | 0.998 | 0.969 | agGGTGGt N$TRA1_01 | 1872 (+) | 1.000 | 0.899 | agGGAGGa N$TRA1_01 | 1883 (-) | 1.000 | 0.941 | gCCTCCca N$TRA1_01 | 3041 (-) | 0.998 | 0.895 | tCCACCcg N$TRA1_01 | 3153 (+) | 1.000 | 0.882 | atGGAGGt N$TRA1_01 | 3183 (-) | 0.985 | 0.887 | tCCCCCct N$TRA1_01 | 3637 (-) | 0.856 | 0.887 | aCCTACca N$TRA1_01 | 3781 (-) | 0.985 | 0.887 | tCCCCCct N$TRA1_01 | 4151 (-) | 1.000 | 0.941 | gCCTCCca N$TRA1_01 | 4862 (+) | 1.000 | 0.971 | agGGAGGt N$TRA1_01 | 5897 (-) | 0.981 | 0.888 | cCCGCCcc N$TRA1_01 | 6802 (-) | 1.000 | 0.891 | tCCTCCcc N$TRA1_01 | 7701 (+) | 1.000 | 1.000 | tgGGAGGt N$TRA1_01 | 8118 (-) | 0.998 | 0.910 | cCCACCct N$TRA1_01 | 8761 (-) | 1.000 | 0.928 | tCCTCCca N$TRA1_01 | 8774 (+) | 1.000 | 0.941 | tgGGAGGc N$TRA1_01 | 9312 (+) | 1.000 | 0.912 | agGGAGGc N$TRA1_01 | 9619 (+) | 1.000 | 0.928 | tgGGAGGa N$TRA1_01 | 9684 (+) | 0.985 | 0.900 | agGGGGGg N$TRA1_01 | 9685 (+) | 0.985 | 0.951 | ggGGGGGt N$TRA1_01 | 9688 (+) | 0.998 | 0.961 | ggGGTGGt N$TRA1_01 | 10013 (+) | 1.000 | 0.880 | ctGGAGGt N$TRA1_01 | 10240 (+) | 0.998 | 0.910 | ttGGTGGt N$TRA1_01 | 10614 (+) | 1.000 | 0.941 | tgGGAGGg N$TRA1_01 | 10739 (+) | 0.985 | 0.988 | tgGGGGGt N$TRA1_01 | 11844 (+) | 0.985 | 0.988 | tgGGGGGt N$TRA1_01 | 11847 (+) | 0.998 | 0.902 | ggGGTGGg N$TRA1_01 | 11872 (+) | 1.000 | 0.904 | ggGGAGGg N$TRA1_02 | 1516 (+) | 0.979 | 0.920 | aggGAGGT N$TRA1_02 | 1812 (+) | 1.000 | 0.936 | aggGTGGT N$TRA1_02 | 4441 (-) | 1.000 | 0.957 | ACCACaca N$TRA1_02 | 4862 (+) | 0.979 | 0.920 | aggGAGGT N$TRA1_02 | 7701 (+) | 0.979 | 0.984 | tggGAGGT N$TRA1_02 | 9190 (-) | 1.000 | 0.930 | ACCACtca N$TRA1_02 | 9685 (+) | 0.975 | 0.917 | gggGGGGT N$TRA1_02 | 9688 (+) | 1.000 | 0.936 | gggGTGGT N$TRA1_02 | 10240 (+) | 1.000 | 0.921 | ttgGTGGT N$TRA1_02 | 10739 (+) | 0.975 | 0.981 | tggGGGGT N$TRA1_02 | 11844 (+) | 0.975 | 0.981 | tggGGGGT P$ARR10_01 | 707 (-) | 1.000 | 1.000 | cgGATCT P$ARR10_01 | 1072 (+) | 0.979 | 0.960 | AGATGtg P$ARR10_01 | 1754 (-) | 1.000 | 0.934 | ctGATCT P$ARR10_01 | 1876 (-) | 1.000 | 0.978 | agGATCT P$ARR10_01 | 2781 (+) | 0.996 | 0.975 | AGATTct P$ARR10_01 | 2909 (-) | 0.979 | 0.960 | caCATCT P$ARR10_01 | 2973 (+) | 0.979 | 0.982 | AGATGcg P$ARR10_01 | 3098 (-) | 0.996 | 0.997 | cgAATCT P$ARR10_01 | 3298 (-) | 0.979 | 0.938 | aaCATCT P$ARR10_01 | 3394 (-) | 1.000 | 0.978 | agGATCT P$ARR10_01 | 3703 (+) | 0.979 | 0.960 | AGATGct P$ARR10_01 | 4121 (+) | 0.979 | 0.938 | AGATGtt P$ARR10_01 | 4135 (-) | 0.996 | 0.975 | agAATCT P$ARR10_01 | 5619 (-) | 0.979 | 0.938 | aaCATCT P$ARR10_01 | 6310 (-) | 1.000 | 0.934 | ctGATCT P$ARR10_01 | 6375 (+) | 0.996 | 0.975 | AGATTtg P$ARR10_01 | 6499 (+) | 0.979 | 0.982 | AGATGcg P$ARR10_01 | 6667 (+) | 0.996 | 0.953 | AGATTtt P$ARR10_01 | 6783 (-) | 1.000 | 0.978 | caGATCT P$ARR10_01 | 7153 (-) | 0.993 | 0.994 | cgTATCT P$ARR10_01 | 7559 (+) | 0.979 | 0.960 | AGATGct P$ARR10_01 | 8524 (+) | 0.979 | 0.960 | AGATGct P$ARR10_01 | 8593 (-) | 0.979 | 0.938 | aaCATCT P$ARR10_01 | 9022 (-) | 1.000 | 0.978 | caGATCT P$ARR10_01 | 9023 (+) | 1.000 | 0.956 | AGATCtt P$ARR10_01 | 9500 (-) | 0.996 | 0.953 | aaAATCT P$ARR10_01 | 10366 (-) | 0.996 | 0.975 | agAATCT P$ARR10_01 | 10712 (-) | 0.979 | 0.938 | aaCATCT P$ARR10_01 | 11028 (+) | 1.000 | 0.978 | AGATCct P$ARR10_01 | 11118 (+) | 1.000 | 0.934 | AGATCcc P$ARR10_01 | 11173 (+) | 1.000 | 0.956 | AGATCtt P$ARR10_01 | 11736 (-) | 0.979 | 0.938 | aaCATCT P$ARR10_01 | 11776 (+) | 0.993 | 0.972 | AGATAct F$MCM1_02 | 214 (-) | 0.977 | 0.741 | gTATTTtttttttttcctttttgtctt F$MCM1_02 | 223 (-) | 0.907 | 0.689 | tTTTTTcctttttgtctttttaaatct F$MCM1_02 | 474 (+) | 0.939 | 0.652 | gacacttacaagtcaacattgCAACAt F$MCM1_02 | 1046 (+) | 0.977 | 0.649 | cagcaaaaatcaacaactttaAAATAa F$MCM1_02 | 1155 (-) | 0.701 | 0.693 | aTGTAGttctttttcaaagaaaggaga F$MCM1_02 | 2212 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2216 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2220 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2224 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2228 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2232 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2236 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2240 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2244 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2248 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2252 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttctttc F$MCM1_02 | 2256 (-) | 0.907 | 0.655 | tTCTTTctttctttctttctttcttct F$MCM1_02 | 2260 (-) | 0.907 | 0.652 | tTCTTTctttctttctttcttctttct F$MCM1_02 | 2264 (-) | 0.907 | 0.673 | tTCTTTctttctttcttctttctcttc F$MCM1_02 | 2360 (-) | 0.762 | 0.674 | tTGTATttattttgccacagttttccc F$MCM1_02 | 2503 (+) | 0.682 | 0.671 | cgaaaaaaaagggaaaaaaaaTTACAt F$MCM1_02 | 2555 (-) | 0.939 | 0.693 | cTGTTGtgaaatttccgtaagtattta F$MCM1_02 | 3309 (-) | 0.696 | 0.655 | gTAGTTaagattggccttttcctgatt F$MCM1_02 | 3581 (-) | 0.885 | 0.661 | tTATTCaactctcctatgtttgggttt F$MCM1_02 | 3985 (+) | 0.669 | 0.650 | acaccatctcatgaaacaacgATAAAt F$MCM1_02 | 4038 (+) | 0.916 | 0.857 | attccagcagggggaaataggCAATAa F$MCM1_02 | 4415 (+) | 0.907 | 0.695 | caaccttgtctaaaaaaaaaaAAAAAa F$MCM1_02 | 4482 (+) | 1.000 | 0.698 | ttttcttttgaacaaaaaacaAAACAa F$MCM1_02 | 4490 (+) | 0.939 | 0.652 | tgaacaaaaaacaaaacaaaaCAACAt F$MCM1_02 | 4518 (+) | 0.939 | 0.699 | agaccaagacaacaaatgcgtCAACAg F$MCM1_02 | 4598 (+) | 0.719 | 0.672 | tgcccgcttgctggaaggtaaAACCAa F$MCM1_02 | 5382 (-) | 0.696 | 0.676 | tTAGTTttattttacctttatattttt F$MCM1_02 | 5414 (-) | 0.897 | 0.690 | tTATTAaatctttccctccactttatt F$MCM1_02 | 5442 (-) | 0.916 | 0.663 | gTATTGccatataaccttttccccaca F$MCM1_02 | 5625 (+) | 0.977 | 0.660 | taatacactgggataaagacaAAATAa F$MCM1_02 | 5631 (+) | 0.907 | 0.655 | actgggataaagacaaaataaAAAGAa F$MCM1_02 | 5647 (+) | 0.916 | 0.650 | aataaaaagaagaaaagaacaCAATAa F$MCM1_02 | 5757 (+) | 0.654 | 0.675 | ggacctgtagggtaaaaggagAGAAAa F$MCM1_02 | 5768 (+) | 0.977 | 0.650 | gtaaaaggagagaaaaatatgAAATAa F$MCM1_02 | 5776 (+) | 0.977 | 0.692 | agagaaaaatatgaaataattAAATAc F$MCM1_02 | 5815 (+) | 1.000 | 0.676 | tagccctggcgtggcactgtgAAACAa F$MCM1_02 | 6256 (-) | 0.654 | 0.672 | aTTTGTcccatctgccgaagaaggaag F$MCM1_02 | 6569 (+) | 1.000 | 0.696 | gccacagttatgcaaaacacaAAACAa F$MCM1_02 | 6607 (-) | 0.977 | 0.661 | cTATTTgctaaactccgtgagtggggc F$MCM1_02 | 6644 (+) | 0.816 | 0.664 | cacctcagtctgcaaacagatGAAGAt F$MCM1_02 | 7389 (+) | 0.816 | 0.665 | tgtccaggtggtagaattactGAAGAa F$MCM1_02 | 7431 (-) | 1.000 | 0.650 | tTGTTTttattttatgtgtgggagtgt F$MCM1_02 | 7641 (-) | 0.977 | 0.663 | tTATTTtaaaaatgcttttgtatgcca F$MCM1_02 | 7802 (+) | 1.000 | 0.648 | taaagaacaacgacaactccaAAACAa F$MCM1_02 | 7898 (-) | 0.977 | 0.663 | tTATTTtgatattagcttttacttagg F$MCM1_02 | 9514 (+) | 0.908 | 0.666 | tccccatgtccaacaagactaGAACAa F$MCM1_02 | 10315 (-) | 0.907 | 0.654 | gTTTTTatatttttattttatagtttt F$MCM1_02 | 10372 (+) | 0.669 | 0.691 | tttcaaaatgttgcaatccatATAAAa F$MCM1_02 | 10735 (-) | 0.916 | 0.665 | tTATTGgggggtttcagtgatccgtgt F$MCM1_02 | 11279 (+) | 0.897 | 0.734 | ttaaggcatagggcaaagaaaTAATAa F$MCM1_02 | 11469 (-) | 0.907 | 0.669 | tTTTTTatttgtttttcccttggaatg F$MCM1_02 | 11505 (-) | 0.654 | 0.718 | tTTTGTggatattaccgttgtccgtag P$BHLH66_01 | 6065 (+) | 1.000 | 1.000 | gCACGTgc P$BHLH66_01 | 6065 (-) | 1.000 | 1.000 | gcACGTGc P$ERF2_01 | 7148 (-) | 0.899 | 0.924 | GGCGCcg P$ERF2_01 | 7623 (+) | 1.000 | 0.922 | agCCGCC P$ERF2_01 | 10560 (-) | 0.856 | 0.892 | GCCGGcg V$GFI1B_01 | 4721 (-) | 1.000 | 0.910 | cactgTGATTtg V$GFI1B_01 | 10260 (-) | 1.000 | 0.941 | ggaagTGATTtt P$AGL1_01 | 165 (+) | 0.924 | 0.881 | gttcCCAATgctggcatt P$AGL1_01 | 6266 (+) | 0.973 | 0.904 | tctgCCGAAgaaggaagc P$AGL1_01 | 7834 (-) | 0.974 | 0.880 | taacccactTCTGGcaaa P$AGL1_01 | 11534 (+) | 0.975 | 0.941 | aatcCCACAtatggtaga P$AGL1_01 | 11534 (-) | 0.991 | 0.930 | aatcccacaTATGGtaga P$AGL1_02 | 165 (+) | 0.991 | 0.890 | gTTCCCaatgctggcatt P$AGL1_02 | 165 (-) | 0.967 | 0.882 | gttcccaatgctGGCATt P$AGL1_02 | 4037 (+) | 0.965 | 0.880 | tATTCCagcagggggaaa P$AGL1_02 | 4037 (-) | 0.991 | 0.874 | tattccagcaggGGGAAa P$AGL1_02 | 6266 (+) | 0.960 | 0.905 | tCTGCCgaagaaggaagc P$AGL1_02 | 6266 (-) | 0.957 | 0.868 | tctgccgaagaaGGAAGc P$AGL1_02 | 7834 (-) | 1.000 | 0.874 | taacccacttctGGCAAa P$AGL1_02 | 11534 (+) | 0.959 | 0.935 | aATCCCacatatggtaga P$AGL1_02 | 11534 (-) | 0.959 | 0.932 | aatcccacatatGGTAGa P$AGL2_01 | 11534 (+) | 0.940 | 0.942 | aatcCCACAtatggtaga P$AGL2_01 | 11534 (-) | 1.000 | 0.942 | aatcccacaTATGGtaga P$AGL2_02 | 11534 (+) | 0.956 | 0.937 | aatcCCACAtatggtaga P$AGL2_02 | 11534 (-) | 1.000 | 0.941 | aatcccacaTATGGtaga P$AG_03 | 164 (+) | 0.967 | 0.775 | cgttCCCAAtgctggcat P$AG_03 | 165 (+) | 0.878 | 0.858 | gttcCCAATgctggcatt P$AG_03 | 1134 (+) | 0.994 | 0.776 | tgttCCTAAaaccgttac P$AG_03 | 1551 (-) | 0.875 | 0.790 | gaaaactctTCTGGaaat P$AG_03 | 2381 (+) | 0.875 | 0.768 | tttcCCAGAgctgagagt P$AG_03 | 2713 (+) | 0.967 | 0.813 | tcttCCCAAatagataat P$AG_03 | 2714 (+) | 1.000 | 0.809 | cttcCCAAAtagataatc P$AG_03 | 2785 (+) | 0.915 | 0.786 | tcttCCTTAgctggtctt P$AG_03 | 3638 (+) | 0.881 | 0.771 | cctaCCACAtggtggaat P$AG_03 | 3860 (+) | 0.798 | 0.780 | agtaCCATTtaaggagtt P$AG_03 | 4394 (+) | 0.840 | 0.795 | agttCCAAGacaggtcag P$AG_03 | 5009 (+) | 0.845 | 0.807 | attaCCCATtaagccacg P$AG_03 | 5984 (-) | 0.994 | 0.825 | acttctggaTTAGGgatg P$AG_03 | 6266 (+) | 0.971 | 0.846 | tctgCCGAAgaaggaagc P$AG_03 | 6369 (-) | 1.000 | 0.784 | taagcgagaTTTGGtaga P$AG_03 | 7573 (-) | 0.869 | 0.788 | aaatccccaTCAGGtcaa P$AG_03 | 7834 (-) | 0.875 | 0.856 | taacccactTCTGGcaaa P$AG_03 | 8135 (-) | 0.875 | 0.770 | gctttctgcTGAGGgaaa P$AG_03 | 8462 (-) | 0.915 | 0.785 | ggtctcatcTAAGGcaaa P$AG_03 | 9595 (-) | 1.000 | 0.851 | atcccagcaTTTGGgaag P$AG_03 | 11442 (-) | 1.000 | 0.795 | cttatctatTTTGGtgaa P$AG_03 | 11534 (+) | 0.881 | 0.858 | aatcCCACAtatggtaga P$AG_03 | 11534 (-) | 0.920 | 0.875 | aatcccacaTATGGtaga P$AG_03 | 11673 (+) | 1.000 | 0.774 | ataaCCAAAaaagaccaa P$AG_03 | 11795 (-) | 0.878 | 0.773 | aaatcccagATTGGaagg P$AGL3_03 | 3590 (-) | 0.998 | 0.879 | tctcctatgTTTGGgttt P$AGL3_03 | 11534 (+) | 0.959 | 0.923 | aatcCCACAtatggtaga P$AGL3_03 | 11534 (-) | 1.000 | 0.874 | aatcccacaTATGGtaga P$ABZ1_01 | 6884 (-) | 1.000 | 0.973 | gcacCACGTaagac V$GFI1_Q6 | 242 (+) | 0.984 | 0.979 | ttaAATCTgggct V$GFI1_Q6 | 1428 (+) | 1.000 | 0.977 | cgcAATCAgtggc V$GFI1_Q6 | 2419 (+) | 1.000 | 0.991 | aaaAATCAcaggg V$GFI1_Q6 | 3467 (-) | 1.000 | 0.975 | cctcaTGATTtac V$GFI1_Q6 | 4639 (+) | 0.986 | 0.986 | caaAATCCcggca V$GFI1_Q6 | 4721 (-) | 1.000 | 0.987 | cactgTGATTtgg V$GFI1_Q6 | 7355 (+) | 1.000 | 0.976 | gttAATCAtgggc V$GFI1_Q6 | 10260 (-) | 1.000 | 0.988 | ggaagTGATTttt V$GFI1_Q6 | 11695 (+) | 1.000 | 0.975 | gttAATCActgtc V$CMAF_01 | 168 (+) | 1.000 | 0.823 | cccaaTGCTGgcattacta V$CMAF_01 | 478 (-) | 0.785 | 0.805 | cttacaagtCAACAttgca V$CMAF_01 | 1037 (-) | 1.000 | 0.810 | tcaacaaagCAGCAaaaat V$CMAF_01 | 1107 (+) | 0.816 | 0.820 | gaaatCGCTGatttgtgta V$CMAF_01 | 1222 (+) | 0.799 | 0.827 | gtttcTGATGactgctttg V$CMAF_01 | 1241 (-) | 1.000 | 0.814 | acatcagggCAGCAttctg V$CMAF_01 | 1582 (-) | 1.000 | 0.828 | atggagagcCAGCActggg V$CMAF_01 | 1617 (-) | 1.000 | 0.821 | tctcgtggaCAGCAtgtcc V$CMAF_01 | 1680 (-) | 0.822 | 0.826 | ttcgggtgtCAGCTcaggg V$CMAF_01 | 1839 (-) | 0.785 | 0.818 | tggggtcgtCAACAcaata V$CMAF_01 | 2660 (+) | 0.799 | 0.823 | cacttTGATGacactctta V$CMAF_01 | 2763 (-) | 0.867 | 0.828 | atctagattCCGCAtgtga V$CMAF_01 | 2826 (-) | 0.871 | 0.867 | tccagttgtCTGCAaatgg V$CMAF_01 | 2927 (+) | 0.822 | 0.808 | cccttAGCTGattgcgaag V$CMAF_01 | 3011 (-) | 0.867 | 0.833 | caggcgtctCGGCAgtaca V$CMAF_01 | 3226 (-) | 0.803 | 0.831 | tttaccagtCAGCCaacat V$CMAF_01 | 3599 (-) | 0.871 | 0.831 | tttgggtttCTGCAgtttt V$CMAF_01 | 4400 (-) | 0.803 | 0.822 | aagacaggtCAGCCacaac V$CMAF_01 | 4530 (-) | 0.785 | 0.814 | caaatgcgtCAACAgcctc V$CMAF_01 | 4786 (+) | 1.000 | 0.979 | aagatTGCTGacttgcata V$CMAF_01 | 4861 (-) | 1.000 | 0.956 | gagggaggtCAGCAgcaaa V$CMAF_01 | 4948 (-) | 0.871 | 0.815 | tgggactttCTGCAgcagg V$CMAF_01 | 5041 (+) | 1.000 | 0.807 | aggatTGCTGtggtctaag V$CMAF_01 | 6065 (+) | 1.000 | 0.817 | gcacgTGCTGccacagtct V$CMAF_01 | 6163 (-) | 0.871 | 0.837 | cctcctcttCTGCAgggtt V$CMAF_01 | 6220 (-) | 0.871 | 0.831 | tctcacgctCTGCAtgctg V$CMAF_01 | 6229 (+) | 1.000 | 0.827 | ctgcaTGCTGtctggctac V$CMAF_01 | 6263 (+) | 0.867 | 0.832 | ccatcTGCCGaagaaggaa V$CMAF_01 | 6287 (+) | 0.799 | 0.814 | tctgaTGATGactgagcag V$CMAF_01 | 6491 (+) | 0.871 | 0.856 | tggatTGCAGatgcggcca V$CMAF_01 | 6644 (-) | 0.871 | 0.884 | cacctcagtCTGCAaacag V$CMAF_01 | 6691 (+) | 1.000 | 0.946 | cagagTGCTGaagtagaga V$CMAF_01 | 6771 (-) | 1.000 | 0.807 | tgaacaaaaCAGCAgatct V$CMAF_01 | 6787 (+) | 1.000 | 0.834 | tctaaTGCTGgctcttcct V$CMAF_01 | 7021 (-) | 0.822 | 0.816 | ttcctgcatCAGCTgaaga V$CMAF_01 | 7169 (+) | 0.871 | 0.839 | ggcgtTGCAGatggtcccg V$CMAF_01 | 7219 (-) | 1.000 | 0.819 | aaagtaggcCAGCAggaag V$CMAF_01 | 7601 (+) | 1.000 | 0.930 | tgaacTGCTGagccatctc V$CMAF_01 | 7714 (-) | 1.000 | 0.940 | cagaaggatCAGCAgtttg V$CMAF_01 | 8070 (+) | 0.871 | 0.838 | ggtatTGCAGaaacttcct V$CMAF_01 | 8136 (+) | 1.000 | 0.944 | ctttcTGCTGagggaaaac V$CMAF_01 | 8320 (-) | 0.816 | 0.836 | cttctgcgtCAGCGccagt V$CMAF_01 | 9073 (-) | 1.000 | 0.823 | tagccagggCAGCAtagtg V$CMAF_01 | 9286 (-) | 0.822 | 0.829 | gctgagagtCAGCTtgtgt V$CMAF_01 | 9379 (-) | 0.803 | 0.810 | gagagctgtCAGCCcctgg V$CMAF_01 | 9633 (-) | 0.822 | 0.819 | gttcaaggtCAGCTtgggc V$CMAF_01 | 9849 (-) | 1.000 | 0.824 | tcaggcaggCAGCAttaaa V$CMAF_01 | 10181 (-) | 1.000 | 0.916 | gtgtgtgttCAGCAgtcag V$CMAF_01 | 10719 (+) | 1.000 | 0.809 | ctgcgTGCTGtagctttta V$CMAF_01 | 11163 (+) | 0.871 | 0.835 | gaagcTGCAGagatctttt V$CMAF_01 | 11325 (+) | 0.799 | 0.824 | gatgaTGATGacgatgtaa V$CMAF_01 | 11406 (-) | 0.871 | 0.831 | gcatttattCTGCAtttct V$CMAF_01 | 11426 (-) | 0.867 | 0.837 | tttagctatCCGCAtactt V$CMAF_01 | 11827 (-) | 1.000 | 0.807 | ggtgagaaaCAGCAgagtg V$HIC1_02 | 697 (-) | 1.000 | 0.951 | aagaGGGCAtcggat V$HIC1_02 | 1179 (-) | 1.000 | 0.956 | agatGGGCAccccca V$HIC1_02 | 4592 (+) | 1.000 | 0.969 | tcctccTGCCCgctt V$HIC1_02 | 5883 (-) | 1.000 | 0.957 | tgctGGGCAtggggc V$HIC1_02 | 6626 (-) | 1.000 | 0.962 | agtgGGGCAcccaga V$HIC1_02 | 8277 (+) | 1.000 | 0.966 | tgcagcTGCCCtggc V$HIC1_02 | 8304 (+) | 1.000 | 0.957 | cccctcTGCCCtgca V$HIC1_02 | 9011 (-) | 1.000 | 0.952 | gcagGGGCAggcaga V$HIC1_02 | 9037 (-) | 1.000 | 0.955 | tcaaGGGCAgcctgg V$HIC1_02 | 10077 (-) | 1.000 | 0.953 | ggaaGGGCAgccagt V$HIC1_02 | 11710 (+) | 1.000 | 0.955 | cccaacTGCCCtcca V$HIC1_03 | 1176 (-) | 1.000 | 0.943 | aggagatGGGCAccccca V$HIC1_03 | 6037 (-) | 0.975 | 0.929 | ccatctgGTGCAgccctg V$HIC1_03 | 6623 (-) | 1.000 | 0.954 | gtgagtgGGGCAcccaga V$HIC1_03 | 8358 (-) | 0.975 | 0.928 | aacccctGTGCAgcctgc V$HIC1_03 | 9034 (-) | 1.000 | 0.924 | agttcaaGGGCAgcctgg V$PLZF_02 | 231 (+) | 0.979 | 0.751 | tttttgtcttttTAAATctgggctgtaag V$PLZF_02 | 513 (+) | 0.978 | 0.737 | gtcttctgatttTAAAAatgatgggtgat V$PLZF_02 | 559 (+) | 0.976 | 0.868 | ttttctaggagaTAAACgtctacctgttg V$PLZF_02 | 626 (+) | 0.976 | 0.741 | gacactttttttTAAACatttattcattt V$PLZF_02 | 631 (-) | 0.979 | 0.742 | ttttttttaaacATTTAttcatttattat V$PLZF_02 | 639 (-) | 0.979 | 0.733 | aaacatttattcATTTAttatatataagt V$PLZF_02 | 812 (-) | 0.978 | 0.725 | cagcccgacactTTTTAatactatataaa V$PLZF_02 | 825 (+) | 0.978 | 0.738 | tttaatactataTAAAAcaagttaaataa V$PLZF_02 | 951 (-) | 0.979 | 0.766 | caatgatcaaatATTTAcaggacccctaa V$PLZF_02 | 1050 (-) | 1.000 | 0.734 | aaaaatcaacaaCTTTAaaataagatgtg V$PLZF_02 | 1053 (+) | 0.978 | 0.711 | aatcaacaacttTAAAAtaagatgtgtag V$PLZF_02 | 1085 (+) | 0.978 | 0.790 | attttggtcaacTAAAAtttgagaaatcg V$PLZF_02 | 1879 (-) | 0.978 | 0.707 | atctgcctcccaTTTTAtatttattttat V$PLZF_02 | 1898 (-) | 0.978 | 0.708 | tttattttattaTTTTAtttgagacagag V$PLZF_02 | 2427 (-) | 0.979 | 0.729 | caggggttggggATTTAgctcagtggtag V$PLZF_02 | 2565 (-) | 0.979 | 0.748 | atttccgtaagtATTTAcaacaaatgcag V$PLZF_02 | 2845 (-) | 1.000 | 0.727 | catcattccgtcCTTTAgagctgaacaga V$PLZF_02 | 2890 (-) | 0.978 | 0.716 | gtactgcatcttTTTTAtccacatctctg V$PLZF_02 | 3119 (-) | 0.979 | 0.783 | gcggctatattaATTTAcattcccacaac V$PLZF_02 | 3213 (-) | 0.978 | 0.720 | ctaggcaagcacTTTTAccagtcagccaa V$PLZF_02 | 3396 (+) | 0.978 | 0.714 | gatctccagttaTAAAAgttaatgattct V$PLZF_02 | 3462 (-) | 0.979 | 0.719 | cagggcctcatgATTTActgcctatgagg V$PLZF_02 | 3706 (-) | 0.979 | 0.774 | tgcttgatcaatATTTAcgggtgactaaa V$PLZF_02 | 3719 (+) | 0.978 | 0.761 | tttacgggtgacTAAAAttattatttcta V$PLZF_02 | 3739 (-) | 0.978 | 0.746 | ttatttctacatTTTTAgtcattcttaaa V$PLZF_02 | 3822 (+) | 0.979 | 0.767 | ttcaattcataaTAAATttccattctcat V$PLZF_02 | 3901 (+) | 1.000 | 0.717 | cacaggtattaaTAAAGctttatgaatta V$PLZF_02 | 3906 (-) | 1.000 | 0.764 | gtattaataaagCTTTAtgaattacatta V$PLZF_02 | 3995 (+) | 0.979 | 0.748 | atgaaacaacgaTAAATaaatattctgct V$PLZF_02 | 4054 (+) | 0.978 | 0.711 | ataggcaataatTAAAAtatgtaaaatgt V$PLZF_02 | 4079 (-) | 0.978 | 0.746 | atgtatactataTTTTActgattctaaaa V$PLZF_02 | 4091 (+) | 0.978 | 0.815 | ttttactgattcTAAAAcatccattttcc V$PLZF_02 | 4115 (+) | 0.978 | 0.735 | tttcccagatgtTAAAAcccagaatctag V$PLZF_02 | 4208 (+) | 0.978 | 0.719 | gcttgcctcagtTAAAAaaataaattctc V$PLZF_02 | 4284 (+) | 0.979 | 0.725 | cacattagacccTAAATttgaccctcaat V$PLZF_02 | 4307 (+) | 0.978 | 0.815 | ctcaatactgcaTAAAActgggtctggtg V$PLZF_02 | 4604 (+) | 0.978 | 0.749 | cttgctggaaggTAAAAccaatatccttt V$PLZF_02 | 4772 (+) | 1.000 | 0.740 | agcgagtctggaTAAAGattgctgacttg V$PLZF_02 | 5368 (-) | 0.978 | 0.735 | acacgggatttcTTTTAgttttattttac V$PLZF_02 | 5374 (-) | 0.978 | 0.789 | gatttcttttagTTTTAttttacctttat V$PLZF_02 | 5385 (-) | 1.000 | 0.732 | gttttattttacCTTTAtattttttcatc V$PLZF_02 | 5406 (+) | 0.979 | 0.768 | ttttcatcttatTAAATctttccctccac V$PLZF_02 | 5411 (-) | 0.756 | 0.730 | atcttattaaatCTTTCcctccactttat V$PLZF_02 | 5781 (-) | 0.735 | 0.716 | aaaatatgaaatAATTAaatacaaaaaaa V$PLZF_02 | 5936 (+) | 0.978 | 0.789 | cttggagaaaagTAAAAttttatttgcca V$PLZF_02 | 5941 (-) | 0.978 | 0.753 | agaaaagtaaaaTTTTAtttgccagtgtt V$PLZF_02 | 6662 (-) | 0.979 | 0.787 | gatgaagattttATTTAgcttttgaagag V$PLZF_02 | 6711 (-) | 1.000 | 0.765 | aaaagccacaacCTTTAaagggtgagcag V$PLZF_02 | 6714 (+) | 1.000 | 0.738 | agccacaaccttTAAAGggtgagcagccc V$PLZF_02 | 6749 (-) | 1.000 | 0.776 | aacccactatccCTTTAtaccatgaacaa V$PLZF_02 | 6815 (-) | 0.978 | 0.750 | accttctcccttTTTTAtgtgctatttac V$PLZF_02 | 7049 (+) | 0.735 | 0.725 | ctattttttttcTTAATttgtttttgttt V$PLZF_02 | 7262 (+) | 0.978 | 0.746 | attaaaatatccTAAAAtaagacccatag V$PLZF_02 | 7405 (-) | 0.978 | 0.714 | ttactgaagaacTTTTAaaggtttatttg V$PLZF_02 | 7408 (+) | 1.000 | 0.868 | ctgaagaactttTAAAGgtttatttgttt V$PLZF_02 | 7413 (-) | 0.976 | 0.765 | gaacttttaaagGTTTAtttgtttttatt V$PLZF_02 | 7521 (+) | 0.756 | 0.751 | ctgaactgaagtTACAGttggttgtgagc V$PLZF_02 | 7635 (+) | 0.978 | 0.796 | gtagaattatttTAAAAatgcttttgtat V$PLZF_02 | 7776 (-) | 0.735 | 0.742 | aaacaaaacaaaAATTAtcccaaagataa V$PLZF_02 | 7790 (+) | 1.000 | 0.749 | ttatcccaaagaTAAAGaacaacgacaac V$PLZF_02 | 7884 (-) | 0.978 | 0.726 | aagtcatttggtTTTTAttttgatattag V$PLZF_02 | 7902 (-) | 0.978 | 0.740 | tttgatattagcTTTTActtaggtatcaa V$PLZF_02 | 8927 (+) | 1.000 | 0.728 | ttcagtataaacTAAAGcaactttttaaa V$PLZF_02 | 8978 (-) | 1.000 | 0.748 | ggtggtacacacCTTTAattccagcacat V$PLZF_02 | 9437 (-) | 0.976 | 0.717 | aggaagcccaagGTTTAccaatctctgct V$PLZF_02 | 9473 (-) | 1.000 | 0.757 | atgttctagctaCTTTAtttgcaatagaa V$PLZF_02 | 9703 (+) | 0.978 | 0.765 | gtggtggtggtgTAAAAttgatctctttg V$PLZF_02 | 9785 (+) | 0.979 | 0.718 | aattttgttcagTAAATgtacatcacata V$PLZF_02 | 9814 (-) | 0.978 | 0.738 | agtaacttcaaaTTTTAgaagggataatt V$PLZF_02 | 9829 (-) | 0.978 | 0.712 | tagaagggataaTTTTAaagtcaggcagg V$PLZF_02 | 9886 (-) | 0.978 | 0.719 | tttaaaattcttTTTTAaagacttactta V$PLZF_02 | 9889 (+) | 1.000 | 0.743 | aaaattctttttTAAAGacttacttactg V$PLZF_02 | 10114 (+) | 0.978 | 0.707 | tgccatccccaaTAAAAttagttaagaaa V$PLZF_02 | 10295 (-) | 0.978 | 0.722 | gcttcggacattTTTTAactgtttttata V$PLZF_02 | 10308 (+) | 0.735 | 0.742 | tttaactgttttTATATttttattttata V$PLZF_02 | 10321 (+) | 0.756 | 0.742 | atatttttatttTATAGtttttattgttt V$PLZF_02 | 10382 (+) | 0.978 | 0.793 | ttgcaatccataTAAAAatgaacgaggtg V$PLZF_02 | 10463 (-) | 1.000 | 0.730 | caatgcatctcaCTTTAgaccagccacat V$PLZF_02 | 10535 (+) | 1.000 | 0.729 | gccgtgcaactcTAAAGtatgtacagccg V$PLZF_02 | 11089 (-) | 0.976 | 0.717 | gaattttccactGTTTAaagaacgacttc V$PLZF_02 | 11092 (+) | 1.000 | 0.733 | ttttccactgttTAAAGaacgacttcaga V$PLZF_02 | 11382 (+) | 0.979 | 0.785 | ttgaaaacttatTAAATtttgtacgcatt V$PLZF_02 | 11628 (+) | 0.979 | 0.729 | aggaccctcgttTAAATgtatttctgtac V$CETS1P54_03 | 606 (-) | 0.917 | 0.919 | ttaaacATCCTgtcgt V$CETS1P54_03 | 702 (+) | 0.988 | 0.884 | ggcatCGGATctcttt V$CETS1P54_03 | 1259 (-) | 0.988 | 0.905 | gtcatcATCCGtgtgt V$CETS1P54_03 | 1455 (+) | 0.926 | 0.923 | tgagcCGGACgtttct V$CETS1P54_03 | 1716 (-) | 1.000 | 0.916 | tgccttTTCCGggctc V$CETS1P54_03 | 1726 (-) | 0.917 | 0.880 | gggctcATCCTgaact V$CETS1P54_03 | 1735 (-) | 0.929 | 0.896 | ctgaacTTCCTataga V$CETS1P54_03 | 2151 (-) | 0.929 | 0.880 | ccttccTTCCTcttcc V$CETS1P54_03 | 2561 (-) | 1.000 | 0.924 | tgaaatTTCCGtaagt V$CETS1P54_03 | 2682 (-) | 0.917 | 0.889 | cgttgcATCCTggtca V$CETS1P54_03 | 2985 (-) | 1.000 | 0.968 | atctacTTCCGctcct V$CETS1P54_03 | 3029 (-) | 1.000 | 0.975 | accgacTTCCGctcca V$CETS1P54_03 | 4685 (+) | 0.929 | 0.882 | gtggcAGGAAacttga V$CETS1P54_03 | 4959 (+) | 0.929 | 0.928 | gcagcAGGAAgtcagg V$CETS1P54_03 | 5469 (-) | 0.929 | 0.890 | tactatTTCCTgattt V$CETS1P54_03 | 5728 (+) | 0.886 | 0.880 | taaacTGGAAgccata V$CETS1P54_03 | 6272 (+) | 0.929 | 0.882 | gaagaAGGAAgcctct V$CETS1P54_03 | 6457 (-) | 0.929 | 0.894 | acttatTTCCTgataa V$CETS1P54_03 | 6874 (+) | 0.929 | 0.883 | cttatAGGAAgcacca V$CETS1P54_03 | 7009 (+) | 0.988 | 0.950 | gcaggCGGATgtttcc V$CETS1P54_03 | 7227 (+) | 0.929 | 0.923 | ccagcAGGAAgttcat V$CETS1P54_03 | 8078 (-) | 0.929 | 0.901 | agaaacTTCCTataca V$CETS1P54_03 | 8917 (+) | 1.000 | 0.881 | atactCGGAAttcagt V$CETS1P54_03 | 9432 (+) | 0.929 | 0.912 | ataacAGGAAgcccaa V$CETS1P54_03 | 9465 (+) | 0.917 | 0.924 | tgtacAGGATgttcta V$CETS1P54_03 | 10254 (+) | 0.929 | 0.929 | tgtgcAGGAAgtgatt V$CETS1P54_03 | 10841 (+) | 0.929 | 0.893 | atcgcAGGAAgaagcc V$CETS1P54_03 | 11427 (-) | 0.988 | 0.880 | ttagctATCCGcatac V$CBF_01 | 789 (-) | 1.000 | 0.994 | tcttaACCGCtgagcc V$CBF_01 | 2766 (-) | 0.994 | 0.976 | tagatTCCGCatgtga V$CBF_01 | 2987 (-) | 0.994 | 0.982 | ctactTCCGCtcctcc V$CBF_01 | 3031 (-) | 0.994 | 0.982 | cgactTCCGCtccacc V$CBF_01 | 3113 (+) | 0.981 | 0.979 | ttccatGCGGCtatat V$CBF_01 | 4436 (-) | 0.978 | 0.975 | aaaaaACCACacaatt V$CBF_01 | 4567 (+) | 1.000 | 0.993 | acagttGCGGTtagat V$CBF_01 | 5131 (+) | 1.000 | 0.979 | gatggtGCGGTcaacc V$CBF_01 | 7007 (+) | 0.994 | 0.983 | aggcagGCGGAtgttt V$CBF_01 | 7626 (+) | 1.000 | 0.977 | cgcctgGCGGTagaat V$CBF_01 | 9395 (+) | 0.978 | 0.973 | tggaatGTGGTtctat V$CBF_01 | 10950 (+) | 1.000 | 0.978 | gtagctGCGGTctgtg V$CBF_01 | 11429 (-) | 0.994 | 0.986 | agctaTCCGCatactt V$CBF_01 | 11489 (+) | 1.000 | 0.994 | tggaatGCGGTtgaat V$CBF_02 | 789 (-) | 1.000 | 0.967 | tcttaACCGCtgagcc V$CBF_02 | 931 (-) | 0.989 | 0.982 | acgaaACCACaagtta V$CBF_02 | 3113 (+) | 0.965 | 0.961 | ttccatGCGGCtatat V$CBF_02 | 3636 (-) | 0.989 | 0.963 | tacctACCACatggtg V$CBF_02 | 4140 (+) | 0.989 | 0.959 | ctaggtGTGGTgcctc V$CBF_02 | 4436 (-) | 0.989 | 0.981 | aaaaaACCACacaatt V$CBF_02 | 4567 (+) | 1.000 | 0.993 | acagttGCGGTtagat V$CBF_02 | 5044 (+) | 0.989 | 0.977 | attgctGTGGTctaag V$CBF_02 | 5131 (+) | 1.000 | 0.985 | gatggtGCGGTcaacc V$CBF_02 | 5251 (+) | 0.989 | 0.976 | tcacatGTGGTcacag V$CBF_02 | 7107 (-) | 0.989 | 0.967 | tgtgtACCACatgtgt V$CBF_02 | 8737 (+) | 0.989 | 0.960 | cggggtGTGGTggccc V$CBF_02 | 8893 (-) | 0.989 | 0.963 | aacgtACCACaaacag V$CBF_02 | 8970 (+) | 0.989 | 0.961 | ccaggtGTGGTggtac V$CBF_02 | 9395 (+) | 0.989 | 0.980 | tggaatGTGGTtctat V$CBF_02 | 9570 (+) | 0.989 | 0.965 | ctgggtGTGGTggtat V$CBF_02 | 10950 (+) | 1.000 | 0.982 | gtagctGCGGTctgtg V$CBF_02 | 11251 (+) | 0.989 | 0.966 | aactctGTGGTgtgcc V$CBF_02 | 11429 (-) | 0.965 | 0.960 | agctaTCCGCatactt V$CBF_02 | 11489 (+) | 1.000 | 0.991 | tggaatGCGGTtgaat V$BRCA_01 | 38 (-) | 0.988 | 0.960 | CAACTtca V$BRCA_01 | 69 (+) | 1.000 | 0.964 | tacTGTTG V$BRCA_01 | 72 (+) | 1.000 | 0.974 | tgtTGTTG V$BRCA_01 | 78 (+) | 1.000 | 0.971 | tgaTGTTG V$BRCA_01 | 445 (+) | 0.997 | 0.963 | tcaGGTTG V$BRCA_01 | 458 (+) | 0.997 | 0.994 | tttGGTTG V$BRCA_01 | 495 (-) | 1.000 | 0.958 | CAACAtta V$BRCA_01 | 580 (+) | 1.000 | 0.956 | accTGTTG V$BRCA_01 | 1027 (+) | 0.988 | 0.956 | ggcAGTTG V$BRCA_01 | 1038 (-) | 1.000 | 0.976 | CAACAaag V$BRCA_01 | 1056 (-) | 1.000 | 0.958 | CAACAact V$BRCA_01 | 1093 (-) | 0.988 | 0.986 | CAACTaaa V$BRCA_01 | 1848 (-) | 1.000 | 0.999 | CAACAcaa V$BRCA_01 | 1862 (-) | 0.988 | 0.989 | CAACTgaa V$BRCA_01 | 2428 (+) | 0.997 | 0.957 | aggGGTTG V$BRCA_01 | 2553 (+) | 1.000 | 1.000 | ttcTGTTG V$BRCA_01 | 2582 (-) | 1.000 | 0.981 | CAACAaat V$BRCA_01 | 2826 (+) | 0.988 | 0.961 | tccAGTTG V$BRCA_01 | 3144 (-) | 1.000 | 0.961 | CAACAata V$BRCA_01 | 3422 (+) | 0.997 | 0.966 | tctGGTTG V$BRCA_01 | 4001 (-) | 0.994 | 0.956 | CAACGata V$BRCA_01 | 4256 (+) | 0.997 | 0.973 | cttGGTTG V$BRCA_01 | 4511 (-) | 1.000 | 0.971 | CAACAtca V$BRCA_01 | 4527 (-) | 1.000 | 0.981 | CAACAaat V$BRCA_01 | 4539 (-) | 1.000 | 0.967 | CAACAgcc V$BRCA_01 | 4922 (-) | 0.988 | 0.973 | CAACTtac V$BRCA_01 | 5142 (-) | 0.997 | 0.970 | CAACCtag V$BRCA_01 | 5278 (-) | 1.000 | 0.974 | CAACAaca V$BRCA_01 | 5281 (-) | 1.000 | 0.962 | CAACAggc V$BRCA_01 | 5855 (+) | 0.988 | 0.989 | ttgAGTTG V$BRCA_01 | 6107 (+) | 1.000 | 0.972 | tccTGTTG V$BRCA_01 | 6440 (-) | 0.997 | 0.976 | CAACCcag V$BRCA_01 | 7040 (+) | 0.988 | 0.976 | gttAGTTG V$BRCA_01 | 7168 (+) | 0.994 | 0.971 | tggCGTTG V$BRCA_01 | 7537 (+) | 0.997 | 0.984 | gttGGTTG V$BRCA_01 | 7809 (-) | 0.994 | 0.969 | CAACGaca V$BRCA_01 | 7815 (-) | 0.988 | 0.966 | CAACTcca V$BRCA_01 | 8003 (+) | 1.000 | 0.962 | gccTGTTG V$BRCA_01 | 8223 (+) | 1.000 | 0.971 | tgaTGTTG V$BRCA_01 | 8357 (-) | 0.997 | 0.957 | CAACCcct V$BRCA_01 | 8442 (-) | 0.997 | 0.958 | CAACCcgc V$BRCA_01 | 9524 (-) | 1.000 | 0.969 | CAACAaga V$BRCA_01 | 10150 (+) | 1.000 | 0.961 | tatTGTTG V$BRCA_01 | 10153 (+) | 1.000 | 0.974 | tgtTGTTG V$BRCA_01 | 10235 (+) | 0.997 | 0.970 | ctaGGTTG V$BRCA_01 | 10407 (+) | 0.997 | 0.961 | ggtGGTTG V$BRCA_01 | 10619 (+) | 0.988 | 0.955 | gggAGTTG V$BRCA_01 | 10665 (-) | 1.000 | 0.972 | CAACAgga V$BRCA_01 | 10694 (-) | 1.000 | 0.966 | CAACAccc V$BRCA_01 | 10943 (+) | 1.000 | 0.956 | cggTGTTG V$BRCA_01 | 10965 (+) | 1.000 | 0.962 | gccTGTTG V$BRCA_01 | 11040 (+) | 1.000 | 0.974 | tgtTGTTG V$BRCA_01 | 11494 (+) | 0.997 | 0.974 | tgcGGTTG V$BRCA_01 | 11517 (+) | 0.994 | 0.958 | tacCGTTG V$BRCA_01 | 11560 (-) | 0.997 | 0.994 | CAACCaaa V$BRCA_01 | 11712 (-) | 0.988 | 0.956 | CAACTgcc V$BRCA_01 | 11896 (-) | 0.997 | 0.978 | CAACCaat V$BRCA_01 | 11923 (-) | 0.994 | 0.961 | CAACGccc I$ABDA_Q6 | 391 (-) | 0.991 | 0.984 | attcATTTGc I$ABDA_Q6 | 585 (-) | 0.984 | 0.978 | ttgaATTAAc I$ABDA_Q6 | 639 (-) | 1.000 | 0.979 | aaacATTTAt I$ABDA_Q6 | 647 (-) | 1.000 | 0.981 | attcATTTAt I$ABDA_Q6 | 846 (+) | 1.000 | 0.990 | tTAAATaaat I$ABDA_Q6 | 957 (+) | 0.991 | 0.979 | tCAAATattt I$ABDA_Q6 | 959 (-) | 1.000 | 0.993 | aaatATTTAc I$ABDA_Q6 | 988 (+) | 0.984 | 0.980 | gTTAATaatt I$ABDA_Q6 | 991 (-) | 0.986 | 0.986 | aataATTTTc I$ABDA_Q6 | 1097 (-) | 0.991 | 0.983 | taaaATTTGa I$ABDA_Q6 | 1471 (-) | 0.991 | 0.979 | caatATTTGc I$ABDA_Q6 | 1563 (+) | 0.995 | 0.978 | gGAAATctac I$ABDA_Q6 | 1893 (-) | 1.000 | 0.982 | ttatATTTAt I$ABDA_Q6 | 1897 (-) | 0.986 | 0.977 | atttATTTTa I$ABDA_Q6 | 1905 (-) | 0.986 | 0.977 | tattATTTTa I$ABDA_Q6 | 1910 (-) | 0.991 | 0.982 | ttttATTTGa I$ABDA_Q6 | 2360 (-) | 1.000 | 0.978 | ttgtATTTAt I$ABDA_Q6 | 2561 (-) | 0.995 | 0.981 | tgaaATTTCc I$ABDA_Q6 | 2573 (-) | 1.000 | 0.989 | aagtATTTAc I$ABDA_Q6 | 3107 (-) | 0.995 | 0.987 | actaATTTCc I$ABDA_Q6 | 3127 (-) | 1.000 | 0.999 | attaATTTAc I$ABDA_Q6 | 3334 (-) | 0.995 | 0.986 | ttctATTTCc I$ABDA_Q6 | 3413 (+) | 0.984 | 0.978 | gTTAATgatt I$ABDA_Q6 | 3430 (-) | 0.982 | 0.976 | cttaATTGAc I$ABDA_Q6 | 3450 (+) | 0.984 | 0.981 | gTTAATaaaa I$ABDA_Q6 | 3470 (-) | 1.000 | 0.986 | catgATTTAc I$ABDA_Q6 | 3714 (-) | 1.000 | 0.988 | caatATTTAc I$ABDA_Q6 | 3731 (+) | 0.986 | 0.981 | tAAAATtatt I$ABDA_Q6 | 3737 (-) | 0.995 | 0.980 | tattATTTCt I$ABDA_Q6 | 3833 (-) | 0.995 | 0.992 | ataaATTTCc I$ABDA_Q6 | 3862 (-) | 1.000 | 0.982 | taccATTTAa I$ABDA_Q6 | 3875 (+) | 0.984 | 0.974 | gTTAATacaa I$ABDA_Q6 | 4006 (+) | 1.000 | 0.984 | aTAAATaaat I$ABDA_Q6 | 4010 (+) | 1.000 | 0.975 | aTAAATattc I$ABDA_Q6 | 4060 (-) | 0.984 | 0.978 | aataATTAAa I$ABDA_Q6 | 4109 (-) | 0.986 | 0.974 | atccATTTTc I$ABDA_Q6 | 4227 (+) | 1.000 | 0.975 | aTAAATtctc I$ABDA_Q6 | 4295 (+) | 1.000 | 0.976 | cTAAATttga I$ABDA_Q6 | 4295 (-) | 0.991 | 0.977 | ctaaATTTGa I$ABDA_Q6 | 4455 (-) | 0.995 | 0.982 | attcATTTCa I$ABDA_Q6 | 5102 (-) | 0.991 | 0.976 | tattATTTGt I$ABDA_Q6 | 5386 (-) | 0.986 | 0.977 | ttttATTTTa I$ABDA_Q6 | 5417 (+) | 1.000 | 0.983 | tTAAATcttt I$ABDA_Q6 | 5469 (-) | 0.995 | 0.986 | tactATTTCc I$ABDA_Q6 | 5593 (+) | 0.986 | 0.975 | tAAAATataa I$ABDA_Q6 | 5787 (+) | 0.995 | 0.985 | tGAAATaatt I$ABDA_Q6 | 5790 (-) | 0.984 | 0.978 | aataATTAAa I$ABDA_Q6 | 5795 (+) | 1.000 | 0.984 | tTAAATacaa I$ABDA_Q6 | 5948 (+) | 0.986 | 0.979 | tAAAATttta I$ABDA_Q6 | 5948 (-) | 0.986 | 0.979 | taaaATTTTa I$ABDA_Q6 | 5953 (-) | 0.991 | 0.988 | ttttATTTGc I$ABDA_Q6 | 6457 (-) | 0.995 | 0.983 | acttATTTCc I$ABDA_Q6 | 6509 (+) | 0.991 | 0.976 | aCAAATaaaa I$ABDA_Q6 | 6670 (-) | 1.000 | 0.983 | ttttATTTAg I$ABDA_Q6 | 6834 (-) | 1.000 | 0.979 | tgctATTTAc I$ABDA_Q6 | 7060 (-) | 0.991 | 0.974 | cttaATTTGt I$ABDA_Q6 | 7435 (-) | 0.986 | 0.977 | ttttATTTTa I$ABDA_Q6 | 7639 (-) | 0.986 | 0.977 | aattATTTTa I$ABDA_Q6 | 7828 (-) | 0.984 | 0.979 | aacaATTAAc I$ABDA_Q6 | 8335 (-) | 0.991 | 0.975 | cagtATTTGc I$ABDA_Q6 | 8635 (-) | 1.000 | 0.976 | gcttATTTAa I$ABDA_Q6 | 9485 (-) | 0.991 | 0.982 | ctttATTTGc I$ABDA_Q6 | 9715 (+) | 0.986 | 0.975 | tAAAATtgat I$ABDA_Q6 | 9796 (+) | 1.000 | 0.984 | gTAAATgtac I$ABDA_Q6 | 9821 (+) | 0.991 | 0.983 | tCAAATttta I$ABDA_Q6 | 9836 (-) | 0.986 | 0.974 | gataATTTTa I$ABDA_Q6 | 9877 (+) | 0.995 | 0.987 | gGAAATcaat I$ABDA_Q6 | 9881 (-) | 1.000 | 0.988 | atcaATTTAa I$ABDA_Q6 | 9888 (+) | 0.986 | 0.974 | tAAAATtctt I$ABDA_Q6 | 10138 (+) | 0.995 | 0.983 | aGAAATtatt I$ABDA_Q6 | 10325 (-) | 0.986 | 0.977 | ttttATTTTa I$ABDA_Q6 | 10348 (-) | 0.980 | 0.976 | tttaATTCAa I$ABDA_Q6 | 10506 (+) | 0.982 | 0.976 | gTCAATtatg I$ABDA_Q6 | 10585 (-) | 0.995 | 0.990 | ttcaATTTCc I$ABDA_Q6 | 10914 (+) | 0.991 | 0.988 | gCAAATaata I$ABDA_Q6 | 11157 (+) | 0.986 | 0.974 | gAAAATgaag I$ABDA_Q6 | 11207 (-) | 0.991 | 0.979 | attaATTTGt I$ABDA_Q6 | 11274 (+) | 0.995 | 0.989 | gGAAATtaag I$ABDA_Q6 | 11295 (+) | 0.995 | 0.980 | aGAAATaata I$ABDA_Q6 | 11340 (+) | 1.000 | 0.975 | gTAAATgcca I$ABDA_Q6 | 11393 (+) | 1.000 | 0.986 | tTAAATtttg I$ABDA_Q6 | 11421 (-) | 1.000 | 0.977 | ttctATTTAg I$ABDA_Q6 | 11464 (+) | 0.995 | 0.986 | tGAAATtttt I$ABDA_Q6 | 11471 (-) | 0.991 | 0.976 | ttttATTTGt I$ABDA_Q6 | 11639 (+) | 1.000 | 0.985 | tTAAATgtat I$ABDA_Q6 | 11663 (-) | 0.986 | 0.975 | cacaATTTTc I$ABDA_Q6 | 11940 (-) | 0.995 | 0.980 | aactATTTCa I$ANTP_Q6_01 | 64 (-) | 0.970 | 0.932 | aTAATTactgtt I$ANTP_Q6_01 | 843 (+) | 0.987 | 0.915 | aagttaAATAAa I$ANTP_Q6_01 | 1892 (-) | 0.911 | 0.942 | tTTATAtttatt I$ANTP_Q6_01 | 1906 (-) | 0.987 | 0.944 | aTTATTttattt I$ANTP_Q6_01 | 2516 (+) | 0.970 | 0.976 | aaaaaaAATTAc I$ANTP_Q6_01 | 2739 (-) | 1.000 | 0.963 | tTTATGcctttt I$ANTP_Q6_01 | 3128 (-) | 0.970 | 0.926 | tTAATTtacatt I$ANTP_Q6_01 | 3255 (-) | 1.000 | 0.928 | cTTATGgttttc I$ANTP_Q6_01 | 3999 (+) | 0.911 | 0.915 | aacaacGATAAa I$ANTP_Q6_01 | 5398 (-) | 0.911 | 0.938 | tTTATAtttttt I$ANTP_Q6_01 | 5435 (-) | 0.987 | 0.981 | tTTATTtgtatt I$ANTP_Q6_01 | 5582 (+) | 0.987 | 0.929 | aaggatAATAAt I$ANTP_Q6_01 | 5585 (+) | 0.987 | 0.918 | gataatAATAAa I$ANTP_Q6_01 | 6589 (+) | 0.987 | 0.976 | aaaacaAATAAg I$ANTP_Q6_01 | 6671 (-) | 0.987 | 0.915 | tTTATTtagctt I$ANTP_Q6_01 | 7061 (-) | 0.970 | 0.965 | tTAATTtgtttt I$ANTP_Q6_01 | 7426 (-) | 0.987 | 0.976 | tTTATTtgtttt I$ANTP_Q6_01 | 7782 (+) | 0.970 | 0.953 | aacaaaAATTAt I$ANTP_Q6_01 | 9866 (+) | 0.934 | 0.920 | aagaagAATGAg I$ANTP_Q6_01 | 10142 (-) | 0.987 | 0.984 | aTTATTattatt I$ANTP_Q6_01 | 10145 (-) | 0.987 | 0.957 | aTTATTattgtt I$ANTP_Q6_01 | 10148 (-) | 0.987 | 0.965 | aTTATTgttgtt I$ANTP_Q6_01 | 10331 (-) | 0.911 | 0.938 | tTTATAgttttt I$ANTP_Q6_01 | 10503 (+) | 0.970 | 0.921 | aaagtcAATTAt I$ANTP_Q6_01 | 11271 (+) | 0.970 | 0.925 | aaaggaAATTAa I$ANTP_Q6_01 | 11298 (+) | 0.987 | 0.984 | aataatAATAAc I$ANTP_Q6_01 | 11342 (+) | 0.982 | 0.927 | aaatgcCATTAt I$ANTP_Q6_01 | 11389 (-) | 0.987 | 0.920 | cTTATTaaattt I$ANTP_Q6_01 | 11442 (-) | 0.911 | 0.920 | cTTATCtatttt I$ANTP_Q6_01 | 11472 (-) | 0.987 | 0.976 | tTTATTtgtttt I$ANTP_Q6_01 | 11666 (+) | 1.000 | 0.933 | aattttCATAAc I$ANTP_Q6_01 | 11683 (+) | 0.987 | 0.963 | aagaccAATAAa I$BYN_Q6 | 11793 (+) | 0.891 | 0.857 | acaaaTCCCAgattg I$CEBP_Q6 | 25 (+) | 1.000 | 0.910 | gaatgTGTGTgaacaacttcaga I$CEBP_Q6 | 1115 (+) | 1.000 | 0.859 | tgattTGTGTatgcacgtgtgtt I$CEBP_Q6 | 1232 (+) | 0.940 | 0.864 | actgcTTTGAcatcagggcagca I$CEBP_Q6 | 1251 (+) | 0.885 | 0.860 | agcatTCTGTcatcatccgtgtg I$CEBP_Q6 | 2341 (-) | 0.940 | 0.867 | ccctacgctcttcTCAAActtgt I$CEBP_Q6 | 2361 (-) | 1.000 | 0.881 | tgtatttattttgCCACAgtttt I$CEBP_Q6 | 4270 (-) | 1.000 | 0.861 | tgcttacctaagcACACAttaga I$CEBP_Q6 | 4679 (+) | 1.000 | 0.887 | agttcTGTGGcaggaaacttgat I$CEBP_Q6 | 5251 (+) | 1.000 | 0.876 | tcacaTGTGGtcacagtcatcct I$CEBP_Q6 | 6062 (-) | 1.000 | 0.871 | cctgcacgtgctgCCACAgtctc I$CEBP_Q6 | 6440 (-) | 0.967 | 0.878 | caacccagtcatgGCAAActtat I$CEBP_Q6 | 6494 (-) | 1.000 | 0.903 | attgcagatgcggCCACAaataa I$CEBP_Q6 | 7782 (-) | 0.967 | 0.926 | aacaaaaattatcCCAAAgataa I$CEBP_Q6 | 7897 (+) | 0.940 | 0.880 | tttatTTTGAtattagcttttac I$CEBP_Q6 | 7932 (+) | 1.000 | 0.882 | tcgttTGTGTgtttagttttgag I$CEBP_Q6 | 11394 (+) | 0.967 | 0.904 | taaatTTTGTacgcatttattct I$CEBP_Q6 | 11525 (-) | 1.000 | 0.881 | tccgtagcaaatcCCACAtatgg I$CEBP_Q6 | 11649 (-) | 1.000 | 0.909 | ttctgtactgttaACACAatttt I$DEAF1_Q6 | 4988 (-) | 1.000 | 0.977 | tgtCCGAAtggc I$DEAF1_Q6 | 7182 (-) | 1.000 | 0.959 | gtcCCGAAcggc I$DEAF1_Q6 | 9558 (+) | 0.936 | 0.923 | gcctGTCGGttt I$KR_Q6 | 553 (+) | 0.878 | 0.904 | ttaaCCTTTtct I$KR_Q6 | 624 (+) | 0.930 | 0.920 | atgaCACTTttt I$KR_Q6 | 708 (+) | 0.930 | 0.889 | ggatCTCTTtac I$KR_Q6 | 816 (+) | 0.930 | 0.914 | ccgaCACTTttt I$KR_Q6 | 901 (+) | 0.930 | 0.878 | ttttCACTTtct I$KR_Q6 | 2126 (+) | 0.930 | 0.874 | ctctCTCTTtct I$KR_Q6 | 2192 (+) | 0.930 | 0.874 | ctctCTCTTtct I$KR_Q6 | 2508 (-) | 1.000 | 0.936 | aaaAAGGGaaaa I$KR_Q6 | 2656 (+) | 0.930 | 0.873 | aatgCACTTtga I$KR_Q6 | 2676 (+) | 0.816 | 0.859 | ttaaCCCGTtgc I$KR_Q6 | 3187 (+) | 0.930 | 0.870 | ccctCTCTTtct I$KR_Q6 | 3218 (+) | 0.930 | 0.897 | caagCACTTtta I$KR_Q6 | 3781 (+) | 1.000 | 0.930 | tcccCCCTTttt I$KR_Q6 | 3860 (+) | 0.866 | 0.861 | agtaCCATTtaa I$KR_Q6 | 4107 (+) | 0.866 | 0.854 | acatCCATTttc I$KR_Q6 | 5159 (-) | 1.000 | 0.930 | gaaAAGGGactt I$KR_Q6 | 5350 (+) | 0.930 | 0.874 | tcccCTCTTtga I$KR_Q6 | 5428 (+) | 0.930 | 0.868 | cctcCACTTtat I$KR_Q6 | 5452 (+) | 0.878 | 0.903 | ataaCCTTTtcc I$KR_Q6 | 5540 (+) | 0.930 | 0.872 | cactCTCTTtct I$KR_Q6 | 6725 (-) | 1.000 | 0.963 | ttaAAGGGtgag I$KR_Q6 | 6736 (+) | 1.000 | 0.946 | gcagCCCTTtga I$KR_Q6 | 6755 (+) | 1.000 | 0.948 | ctatCCCTTtat I$KR_Q6 | 6818 (+) | 1.000 | 0.934 | ttctCCCTTttt I$KR_Q6 | 7848 (-) | 0.917 | 0.867 | caaAAGCGacag I$KR_Q6 | 8374 (+) | 0.930 | 0.869 | gctgCTCTTtga I$KR_Q6 | 8618 (+) | 0.930 | 0.899 | caatCACTTtga I$KR_Q6 | 9721 (+) | 0.930 | 0.898 | tgatCTCTTtgt I$KR_Q6 | 10354 (-) | 0.930 | 0.915 | tcaAAGTGtcag I$KR_Q6 | 10445 (+) | 0.930 | 0.875 | acgtCTCTTtta I$KR_Q6 | 10469 (+) | 0.930 | 0.872 | atctCACTTtag I$KR_Q6 | 10628 (-) | 0.930 | 0.870 | agaAAGTGactg I$KR_Q6 | 11612 (-) | 1.000 | 0.953 | agaAAGGGgtac I$KR_Q6 | 11721 (+) | 0.866 | 0.870 | tccaCCATTttc I$KR_Q6 | 11755 (-) | 0.930 | 0.943 | agaAAGAGttaa I$MAD_Q6 | 3013 (-) | 0.981 | 0.982 | ggCGTCTc I$MAD_Q6 | 5747 (+) | 0.981 | 0.964 | tAGACGca I$MAD_Q6 | 6135 (-) | 0.981 | 0.958 | ggCGTCTt I$MAD_Q6 | 6383 (+) | 0.981 | 0.964 | tAGACGaa I$MAD_Q6 | 7241 (-) | 0.981 | 0.982 | atCGTCTc I$MAD_Q6 | 7853 (+) | 0.940 | 0.943 | gCGACAgc I$MAD_Q6 | 8565 (-) | 0.981 | 0.958 | tcCGTCTt I$MAD_Q6 | 8732 (-) | 1.000 | 0.953 | aaCGTCGg I$MAD_Q6 | 9272 (+) | 0.940 | 0.943 | gCGACAgc I$MAD_Q6 | 10444 (-) | 0.981 | 0.960 | aaCGTCTc I$MAD_Q6 | 10901 (+) | 0.981 | 0.982 | gAGACGct I$MAD_Q6 | 11320 (+) | 1.000 | 0.975 | aCGACGat I$PRD_Q6 | 992 (-) | 1.000 | 0.986 | atAATTT I$PRD_Q6 | 1108 (+) | 0.949 | 0.947 | AAATCgc I$PRD_Q6 | 2421 (+) | 0.949 | 0.933 | AAATCac I$PRD_Q6 | 2521 (+) | 1.000 | 0.979 | AAATTac I$PRD_Q6 | 3108 (-) | 1.000 | 0.959 | ctAATTT I$PRD_Q6 | 3128 (-) | 1.000 | 0.972 | ttAATTT I$PRD_Q6 | 3471 (-) | 0.949 | 0.939 | atGATTT I$PRD_Q6 | 3733 (+) | 1.000 | 0.986 | AAATTat I$PRD_Q6 | 4725 (-) | 0.949 | 0.933 | gtGATTT I$PRD_Q6 | 7061 (-) | 1.000 | 0.972 | ttAATTT I$PRD_Q6 | 7787 (+) | 1.000 | 0.986 | AAATTat I$PRD_Q6 | 8858 (+) | 1.000 | 0.972 | AAATTaa I$PRD_Q6 | 8954 (+) | 1.000 | 0.959 | AAATTag I$PRD_Q6 | 9717 (+) | 1.000 | 0.987 | AAATTga I$PRD_Q6 | 9783 (-) | 1.000 | 1.000 | acAATTT I$PRD_Q6 | 9837 (-) | 1.000 | 0.986 | atAATTT I$PRD_Q6 | 9882 (-) | 1.000 | 0.987 | tcAATTT I$PRD_Q6 | 9933 (-) | 1.000 | 0.979 | gtAATTT I$PRD_Q6 | 10128 (+) | 1.000 | 0.959 | AAATTag I$PRD_Q6 | 10140 (+) | 1.000 | 0.986 | AAATTat I$PRD_Q6 | 10264 (-) | 0.949 | 0.933 | gtGATTT I$PRD_Q6 | 10586 (-) | 1.000 | 0.987 | tcAATTT I$PRD_Q6 | 10815 (+) | 0.949 | 0.947 | AAATCgc I$PRD_Q6 | 11208 (-) | 1.000 | 0.972 | ttAATTT I$PRD_Q6 | 11276 (+) | 1.000 | 0.972 | AAATTaa I$PRD_Q6 | 11664 (-) | 1.000 | 1.000 | acAATTT I$TCF_Q6 | 144 (-) | 0.957 | 0.945 | cagacaTTTGCtcatc I$TCF_Q6 | 434 (-) | 0.937 | 0.915 | accacgTTTAAtcagg I$TCF_Q6 | 452 (-) | 0.964 | 0.956 | gtctgtTTTGGttggc I$TCF_Q6 | 646 (-) | 0.901 | 0.910 | tattcaTTTATtatat I$TCF_Q6 | 819 (-) | 0.937 | 0.928 | acacttTTTAAtacta I$TCF_Q6 | 906 (-) | 0.928 | 0.932 | actttcTTTCAtgcgt I$TCF_Q6 | 952 (+) | 1.000 | 0.980 | aatgaTCAAAtattta I$TCF_Q6 | 991 (-) | 0.928 | 0.924 | aataatTTTCAtgtgg I$TCF_Q6 | 1035 (+) | 0.964 | 0.959 | aatcaACAAAgcagca I$TCF_Q6 | 1043 (+) | 0.957 | 0.953 | aagcaGCAAAaatcaa I$TCF_Q6 | 1081 (-) | 0.964 | 0.962 | ggtcatTTTGGtcaac I$TCF_Q6 | 1470 (-) | 0.957 | 0.936 | tcaataTTTGCtctgg I$TCF_Q6 | 1529 (-) | 0.964 | 0.959 | tgttttTTTGTtctaa I$TCF_Q6 | 1885 (-) | 0.928 | 0.924 | ctcccaTTTTAtattt I$TCF_Q6 | 1896 (-) | 0.928 | 0.927 | tatttaTTTTAttatt I$TCF_Q6 | 1897 (-) | 0.901 | 0.916 | atttatTTTATtattt I$TCF_Q6 | 1904 (-) | 0.928 | 0.916 | ttattaTTTTAtttga I$TCF_Q6 | 2024 (+) | 0.937 | 0.929 | tgggaTTAAAggagtg I$TCF_Q6 | 2178 (-) | 0.891 | 0.910 | ccttctTTTCTtctct I$TCF_Q6 | 2411 (+) | 0.964 | 0.962 | aaaaaACAAAaaatca I$TCF_Q6 | 2579 (+) | 0.964 | 0.949 | ttacaACAAAtgcagt I$TCF_Q6 | 2598 (+) | 0.937 | 0.931 | gactaTTAAAccaaga I$TCF_Q6 | 2657 (-) | 1.000 | 0.978 | atgcacTTTGAtgaca I$TCF_Q6 | 2896 (-) | 0.928 | 0.932 | catcttTTTTAtccac I$TCF_Q6 | 3403 (+) | 0.928 | 0.933 | agttaTAAAAgttaat I$TCF_Q6 | 3449 (+) | 0.901 | 0.910 | agttaATAAAacacag I$TCF_Q6 | 3450 (+) | 0.928 | 0.920 | gttaaTAAAAcacagg I$TCF_Q6 | 3535 (-) | 0.964 | 0.935 | ccgaggTTTGGttctg I$TCF_Q6 | 3725 (+) | 0.901 | 0.911 | ggtgaCTAAAattatt I$TCF_Q6 | 3845 (+) | 0.928 | 0.912 | tctcaTAAAAtatcca I$TCF_Q6 | 3991 (+) | 0.928 | 0.910 | tctcaTGAAAcaacga I$TCF_Q6 | 4060 (+) | 0.937 | 0.938 | aataaTTAAAatatgt I$TCF_Q6 | 4488 (+) | 0.964 | 0.952 | tttgaACAAAaaacaa I$TCF_Q6 | 4495 (+) | 0.964 | 0.969 | aaaaaACAAAacaaaa I$TCF_Q6 | 4500 (+) | 0.964 | 0.969 | acaaaACAAAacaaca I$TCF_Q6 | 4524 (+) | 0.964 | 0.952 | agacaACAAAtgcgtc I$TCF_Q6 | 4548 (-) | 1.000 | 0.981 | cttagtTTTGAtgctg I$TCF_Q6 | 4633 (+) | 0.957 | 0.951 | gggtaGCAAAatcccg I$TCF_Q6 | 4659 (-) | 0.964 | 0.958 | tcggacTTTGTtttgc I$TCF_Q6 | 4664 (-) | 0.957 | 0.952 | ctttgtTTTGCtcata I$TCF_Q6 | 4809 (-) | 0.964 | 0.942 | ctggcaTTTGTtacca I$TCF_Q6 | 4870 (+) | 0.957 | 0.940 | cagcaGCAAAgctctg I$TCF_Q6 | 5380 (-) | 0.928 | 0.918 | ttttagTTTTAtttta I$TCF_Q6 | 5381 (-) | 0.901 | 0.915 | tttagtTTTATtttac I$TCF_Q6 | 5401 (-) | 0.928 | 0.924 | atatttTTTCAtctta I$TCF_Q6 | 5412 (+) | 0.937 | 0.920 | tcttaTTAAAtctttc I$TCF_Q6 | 5479 (-) | 1.000 | 0.988 | tgatttTTTGAtcccc I$TCF_Q6 | 5588 (+) | 0.928 | 0.925 | aataaTAAAAtataat I$TCF_Q6 | 5643 (+) | 0.901 | 0.914 | acaaaATAAAaagaag I$TCF_Q6 | 5644 (+) | 0.928 | 0.928 | caaaaTAAAAagaaga I$TCF_Q6 | 5652 (+) | 0.891 | 0.911 | aaagaAGAAAagaaca I$TCF_Q6 | 5782 (+) | 0.928 | 0.925 | aaataTGAAAtaatta I$TCF_Q6 | 5790 (+) | 0.937 | 0.933 | aataaTTAAAtacaaa I$TCF_Q6 | 5908 (-) | 0.964 | 0.945 | gagtagTTTGTttccc I$TCF_Q6 | 5947 (-) | 0.928 | 0.920 | gtaaaaTTTTAtttgc I$TCF_Q6 | 6372 (-) | 0.964 | 0.948 | gcgagaTTTGGtagac I$TCF_Q6 | 6424 (-) | 0.928 | 0.933 | tctggcTTTCAtcggt I$TCF_Q6 | 6465 (+) | 0.928 | 0.911 | cctgaTAAAAtgctca I$TCF_Q6 | 6509 (+) | 0.928 | 0.930 | acaaaTAAAAgaacac I$TCF_Q6 | 6587 (+) | 0.964 | 0.952 | acaaaACAAAtaaggt I$TCF_Q6 | 6604 (-) | 0.957 | 0.942 | gtgctaTTTGCtaaac I$TCF_Q6 | 6664 (-) | 0.928 | 0.913 | tgaagaTTTTAtttag I$TCF_Q6 | 6769 (+) | 0.964 | 0.949 | catgaACAAAacagca I$TCF_Q6 | 6782 (+) | 0.917 | 0.914 | gcagaTCTAAtgctgg I$TCF_Q6 | 6821 (-) | 0.928 | 0.930 | tcccttTTTTAtgtgc I$TCF_Q6 | 7059 (-) | 0.964 | 0.947 | tcttaaTTTGTttttg I$TCF_Q6 | 7065 (-) | 0.964 | 0.950 | tttgttTTTGTttgtg I$TCF_Q6 | 7210 (+) | 0.964 | 0.958 | attgaACAAAaagtag I$TCF_Q6 | 7258 (+) | 0.937 | 0.929 | tctcaTTAAAatatcc I$TCF_Q6 | 7424 (-) | 0.964 | 0.948 | ggtttaTTTGTtttta I$TCF_Q6 | 7429 (-) | 0.928 | 0.929 | atttgtTTTTAtttta I$TCF_Q6 | 7430 (-) | 0.901 | 0.914 | tttgttTTTATtttat I$TCF_Q6 | 7434 (-) | 0.928 | 0.929 | tttttaTTTTAtgtgt I$TCF_Q6 | 7566 (+) | 0.964 | 0.952 | ggaaaCCAAAtcccca I$TCF_Q6 | 7586 (+) | 0.964 | 0.948 | gtcaaACAAAtgctct I$TCF_Q6 | 7773 (+) | 0.964 | 0.969 | aaaaaACAAAacaaaa I$TCF_Q6 | 7778 (+) | 0.964 | 0.960 | acaaaACAAAaattat I$TCF_Q6 | 7820 (+) | 0.964 | 0.939 | ccaaaACAAAcaatta I$TCF_Q6 | 7884 (-) | 0.964 | 0.955 | aagtcaTTTGGttttt I$TCF_Q6 | 7890 (-) | 0.928 | 0.926 | tttggtTTTTAttttg I$TCF_Q6 | 7896 (-) | 1.000 | 0.989 | ttttatTTTGAtatta I$TCF_Q6 | 7968 (-) | 0.964 | 0.949 | tagttgTTTGTttttc I$TCF_Q6 | 8183 (-) | 0.957 | 0.942 | aacttgTTTGCtctgc I$TCF_Q6 | 8334 (-) | 0.957 | 0.941 | ccagtaTTTGCtggac I$TCF_Q6 | 8375 (-) | 1.000 | 0.989 | ctgctcTTTGAtctgt I$TCF_Q6 | 8464 (+) | 0.917 | 0.917 | tctcaTCTAAggcaaa I$TCF_Q6 | 8565 (-) | 1.000 | 0.977 | tccgtcTTTGAttaaa I$TCF_Q6 | 8571 (+) | 0.937 | 0.920 | tttgaTTAAAtggggc I$TCF_Q6 | 8593 (-) | 0.937 | 0.928 | aacatcTTTAAtatta I$TCF_Q6 | 8834 (+) | 0.964 | 0.951 | caaaaCCAAAacagaa I$TCF_Q6 | 8885 (+) | 1.000 | 0.973 | tagaaTCAAAcgtacc I$TCF_Q6 | 8933 (+) | 0.901 | 0.915 | ataaaCTAAAgcaact I$TCF_Q6 | 8985 (-) | 0.937 | 0.927 | cacaccTTTAAttcca I$TCF_Q6 | 9104 (+) | 0.957 | 0.952 | caaaaGCAAAacaaaa I$TCF_Q6 | 9109 (+) | 0.964 | 0.960 | gcaaaACAAAaaaccg I$TCF_Q6 | 9532 (+) | 0.964 | 0.937 | ctagaACAAAcaagtc I$TCF_Q6 | 9782 (-) | 0.964 | 0.958 | aacaatTTTGTtcagt I$TCF_Q6 | 9858 (+) | 0.937 | 0.924 | cagcaTTAAAgaagaa I$TCF_Q6 | 10311 (-) | 0.928 | 0.933 | aactgtTTTTAtattt I$TCF_Q6 | 10319 (-) | 0.928 | 0.925 | ttatatTTTTAtttta I$TCF_Q6 | 10320 (-) | 0.901 | 0.915 | tatattTTTATtttat I$TCF_Q6 | 10324 (-) | 0.928 | 0.926 | tttttaTTTTAtagtt I$TCF_Q6 | 10333 (-) | 0.928 | 0.934 | tatagtTTTTAttgtt I$TCF_Q6 | 10342 (-) | 0.937 | 0.935 | tattgtTTTAAttcaa I$TCF_Q6 | 10389 (+) | 0.928 | 0.924 | ccataTAAAAatgaac I$TCF_Q6 | 10446 (-) | 0.928 | 0.926 | cgtctcTTTTAtatca I$TCF_Q6 | 10495 (+) | 1.000 | 0.991 | aaggaTCAAAagtcaa I$TCF_Q6 | 10569 (-) | 0.957 | 0.933 | acagcgTTTGCtttgg I$TCF_Q6 | 10574 (-) | 0.964 | 0.954 | gtttgcTTTGGttcaa I$TCF_Q6 | 10594 (+) | 1.000 | 0.978 | ctataTCAAAaaggag I$TCF_Q6 | 10704 (+) | 0.928 | 0.915 | gtttaTGAAAcatctc I$TCF_Q6 | 10727 (-) | 0.928 | 0.922 | tgtagcTTTTAttggg I$TCF_Q6 | 10807 (+) | 0.917 | 0.918 | gtgcaTCTAAatcgca I$TCF_Q6 | 11047 (-) | 0.964 | 0.953 | ggttccTTTGTtaacg I$TCF_Q6 | 11151 (+) | 0.928 | 0.926 | tctgaTGAAAatgaag I$TCF_Q6 | 11206 (-) | 0.964 | 0.947 | tattaaTTTGTttgaa I$TCF_Q6 | 11224 (-) | 0.964 | 0.939 | gcccagTTTGTtctaa I$TCF_Q6 | 11388 (+) | 0.937 | 0.929 | acttaTTAAAttttgt I$TCF_Q6 | 11445 (-) | 0.964 | 0.959 | atctatTTTGGtgaat I$TCF_Q6 | 11465 (-) | 0.928 | 0.930 | gaaattTTTTAtttgt I$TCF_Q6 | 11470 (-) | 0.964 | 0.957 | tttttaTTTGTttttc I$TCF_Q6 | 11526 (+) | 0.957 | 0.932 | ccgtaGCAAAtcccac I$TCF_Q6 | 11558 (+) | 0.964 | 0.937 | ttcaaCCAAActctta I$TCF_Q6 | 11663 (-) | 0.928 | 0.927 | cacaatTTTCAtaacc I$TCF_Q6 | 11672 (+) | 0.964 | 0.946 | cataaCCAAAaaagac I$TCF_Q6 | 11740 (+) | 0.964 | 0.945 | tctcaCCAAAtgacta I$ABDB_Q6 | 835 (+) | 1.000 | 0.916 | tATAAAa I$ABDB_Q6 | 912 (-) | 0.868 | 0.896 | tTTCATg I$ABDB_Q6 | 997 (-) | 0.868 | 0.896 | tTTCATg I$ABDB_Q6 | 1891 (-) | 1.000 | 0.916 | tTTTATa I$ABDB_Q6 | 1902 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 1910 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 2902 (-) | 1.000 | 0.916 | tTTTATc I$ABDB_Q6 | 3406 (+) | 1.000 | 0.916 | tATAAAa I$ABDB_Q6 | 3453 (+) | 1.000 | 0.949 | aATAAAa I$ABDB_Q6 | 3848 (+) | 1.000 | 1.000 | cATAAAa I$ABDB_Q6 | 3994 (+) | 0.868 | 0.896 | cATGAAa I$ABDB_Q6 | 4317 (+) | 1.000 | 1.000 | cATAAAa I$ABDB_Q6 | 5386 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 5591 (+) | 1.000 | 0.949 | aATAAAa I$ABDB_Q6 | 5647 (+) | 1.000 | 0.949 | aATAAAa I$ABDB_Q6 | 5953 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 6468 (+) | 1.000 | 0.916 | gATAAAa I$ABDB_Q6 | 6512 (+) | 1.000 | 0.949 | aATAAAa I$ABDB_Q6 | 6670 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 6827 (-) | 1.000 | 1.000 | tTTTATg I$ABDB_Q6 | 7435 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 7440 (-) | 1.000 | 1.000 | tTTTATg I$ABDB_Q6 | 7896 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 9427 (+) | 1.000 | 0.896 | cATAAAt I$ABDB_Q6 | 10124 (+) | 1.000 | 0.949 | aATAAAa I$ABDB_Q6 | 10317 (-) | 1.000 | 0.916 | tTTTATa I$ABDB_Q6 | 10325 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 10330 (-) | 1.000 | 0.916 | tTTTATa I$ABDB_Q6 | 10339 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 10392 (+) | 1.000 | 0.916 | tATAAAa I$ABDB_Q6 | 10452 (-) | 1.000 | 0.916 | tTTTATa I$ABDB_Q6 | 10733 (-) | 1.000 | 0.949 | tTTTATt I$ABDB_Q6 | 11471 (-) | 1.000 | 0.949 | tTTTATt I$BRK_Q6 | 5820 (-) | 1.000 | 0.992 | cTGGCGt I$BRK_Q6 | 7167 (-) | 1.000 | 0.992 | cTGGCGt I$BRK_Q6 | 7629 (-) | 1.000 | 0.992 | cTGGCGg I$BRK_Q6 | 8331 (+) | 1.000 | 1.000 | gCGCCAg I$CAD_Q6 | 228 (+) | 0.990 | 0.984 | tccTTTTTgt I$CAD_Q6 | 877 (-) | 0.990 | 0.979 | gcACAAAcac I$CAD_Q6 | 1048 (-) | 0.990 | 0.972 | gcAAAAAtca I$CAD_Q6 | 1115 (+) | 0.990 | 0.971 | tgaTTTGTgt I$CAD_Q6 | 1137 (-) | 0.936 | 0.936 | tcCTAAAacc I$CAD_Q6 | 1149 (+) | 1.000 | 0.984 | tacTTTATgt I$CAD_Q6 | 1220 (+) | 0.983 | 0.974 | gagTTTCTga I$CAD_Q6 | 1530 (+) | 0.990 | 0.981 | gttTTTTTgt I$CAD_Q6 | 1718 (+) | 0.962 | 0.961 | cctTTTCCgg I$CAD_Q6 | 1961 (+) | 0.990 | 0.987 | cctTTTTTga I$CAD_Q6 | 2291 (+) | 0.990 | 0.978 | tagTTTTTgt I$CAD_Q6 | 2293 (+) | 0.990 | 0.981 | gttTTTGTgt I$CAD_Q6 | 2416 (-) | 0.990 | 0.979 | acAAAAAatc I$CAD_Q6 | 2502 (-) | 0.969 | 0.965 | ccGAAAAaaa I$CAD_Q6 | 2549 (+) | 0.983 | 0.968 | aggTTTCTgt I$CAD_Q6 | 2563 (+) | 0.962 | 0.943 | aaaTTTCCgt I$CAD_Q6 | 2736 (+) | 1.000 | 0.992 | gccTTTATgc I$CAD_Q6 | 3091 (+) | 0.969 | 0.965 | tatTTTTCga I$CAD_Q6 | 3602 (+) | 0.983 | 0.972 | gggTTTCTgc I$CAD_Q6 | 3716 (+) | 0.979 | 0.960 | ataTTTACgg I$CAD_Q6 | 3761 (-) | 0.926 | 0.933 | tcTTAAAagg I$CAD_Q6 | 3847 (-) | 1.000 | 0.992 | tcATAAAata I$CAD_Q6 | 3916 (+) | 1.000 | 0.983 | agcTTTATga I$CAD_Q6 | 4316 (-) | 1.000 | 0.985 | gcATAAAact I$CAD_Q6 | 4493 (-) | 0.990 | 0.980 | acAAAAAaca I$CAD_Q6 | 4951 (+) | 0.983 | 0.969 | gacTTTCTgc I$CAD_Q6 | 5104 (+) | 0.990 | 0.976 | ttaTTTGTga I$CAD_Q6 | 5151 (+) | 0.979 | 0.967 | cagTTTACga I$CAD_Q6 | 5157 (-) | 0.969 | 0.962 | acGAAAAggg I$CAD_Q6 | 5480 (+) | 0.990 | 0.982 | gatTTTTTga I$CAD_Q6 | 5544 (+) | 0.983 | 0.968 | ctcTTTCTgt I$CAD_Q6 | 5716 (-) | 0.990 | 0.982 | ccACAAAaac I$CAD_Q6 | 5718 (-) | 0.990 | 0.978 | acAAAAAcac I$CAD_Q6 | 5750 (-) | 0.969 | 0.966 | acGCAAAgga I$CAD_Q6 | 5801 (-) | 0.990 | 0.982 | acAAAAAaaa I$CAD_Q6 | 6507 (-) | 0.990 | 0.974 | ccACAAAtaa I$CAD_Q6 | 6585 (-) | 0.990 | 0.980 | acACAAAaca I$CAD_Q6 | 6825 (+) | 1.000 | 0.991 | tttTTTATgt I$CAD_Q6 | 7066 (+) | 0.990 | 0.980 | ttgTTTTTgt I$CAD_Q6 | 7072 (+) | 0.990 | 0.980 | ttgTTTGTgt I$CAD_Q6 | 7087 (+) | 0.969 | 0.959 | tgcTTTGCgc I$CAD_Q6 | 7215 (-) | 0.990 | 0.976 | acAAAAAgta I$CAD_Q6 | 7271 (-) | 0.936 | 0.938 | tcCTAAAata I$CAD_Q6 | 7366 (-) | 0.983 | 0.973 | gcAGAAAgaa I$CAD_Q6 | 7438 (+) | 1.000 | 0.989 | tatTTTATgt I$CAD_Q6 | 7771 (-) | 0.990 | 0.983 | tcAAAAAaca I$CAD_Q6 | 7783 (-) | 0.990 | 0.971 | acAAAAAtta I$CAD_Q6 | 7932 (+) | 0.990 | 0.986 | tcgTTTGTgt I$CAD_Q6 | 8076 (-) | 0.983 | 0.969 | gcAGAAActt I$CAD_Q6 | 8134 (+) | 0.983 | 0.971 | tgcTTTCTgc I$CAD_Q6 | 8475 (-) | 0.990 | 0.974 | gcAAAAAtaa I$CAD_Q6 | 8693 (+) | 0.983 | 0.980 | gccTTTCTga I$CAD_Q6 | 8899 (-) | 0.990 | 0.976 | ccACAAAcag I$CAD_Q6 | 9025 (+) | 0.990 | 0.976 | atcTTTGTga I$CAD_Q6 | 9114 (-) | 0.990 | 0.979 | acAAAAAacc I$CAD_Q6 | 9135 (+) | 0.990 | 0.976 | cagTTTTTga I$CAD_Q6 | 9217 (-) | 0.969 | 0.965 | acGCAAAggc I$CAD_Q6 | 9426 (-) | 1.000 | 0.983 | ccATAAAtaa I$CAD_Q6 | 9564 (+) | 0.983 | 0.969 | cggTTTCTgg I$CAD_Q6 | 9917 (+) | 0.990 | 0.971 | ggaTTTTTgc I$CAD_Q6 | 9963 (+) | 0.983 | 0.971 | atgTTTCTgg I$CAD_Q6 | 10029 (+) | 0.990 | 0.981 | tagTTTGTga I$CAD_Q6 | 10161 (+) | 0.969 | 0.968 | tcgTTTTCgt I$CAD_Q6 | 10265 (+) | 0.990 | 0.972 | tgaTTTTTgc I$CAD_Q6 | 10599 (-) | 0.990 | 0.987 | tcAAAAAgga I$CAD_Q6 | 10702 (+) | 1.000 | 0.990 | gtgTTTATga I$CAD_Q6 | 11010 (+) | 0.936 | 0.931 | tgcTTTAGgg I$CAD_Q6 | 11175 (+) | 0.990 | 0.974 | atcTTTTTgg I$CAD_Q6 | 11176 (+) | 0.926 | 0.936 | tctTTTTGgc I$CAD_Q6 | 11501 (+) | 0.990 | 0.970 | gaaTTTTTgt I$CAD_Q6 | 11503 (+) | 0.990 | 0.979 | attTTTGTgg I$CAD_Q6 | 11645 (+) | 0.983 | 0.966 | gtaTTTCTgt I$CAD_Q6 | 11677 (-) | 0.990 | 0.989 | ccAAAAAaga I$CF1A_Q6 | 645 (-) | 1.000 | 0.939 | ttattcatttATTATa I$CF1A_Q6 | 1891 (-) | 0.877 | 0.882 | ttttatatttATTTTa I$CF1A_Q6 | 3820 (-) | 0.911 | 0.947 | ttttcaattcATAATa I$CF1A_Q6 | 3829 (+) | 1.000 | 0.883 | cATAATaaatttccat I$CF1A_Q6 | 4054 (-) | 1.000 | 0.887 | ataggcaataATTAAa I$CF1A_Q6 | 5406 (-) | 1.000 | 0.935 | ttttcatcttATTAAa I$CF1A_Q6 | 5588 (+) | 1.000 | 0.875 | aATAATaaaatataat I$CF1A_Q6 | 5784 (-) | 1.000 | 0.944 | atatgaaataATTAAa I$CF1A_Q6 | 5790 (+) | 1.000 | 0.902 | aATAATtaaatacaaa I$CF1A_Q6 | 7636 (+) | 0.947 | 0.945 | tAGAATtattttaaaa I$CF1A_Q6 | 10135 (-) | 1.000 | 0.875 | ttaagaaattATTATt I$CF1A_Q6 | 11064 (+) | 0.881 | 0.878 | tGTGATcagttacaag V$LRF_Q2 | 968 (+) | 0.981 | 0.972 | caggACCCC V$LRF_Q2 | 1184 (+) | 1.000 | 0.975 | ggcaCCCCC V$LRF_Q2 | 1612 (-) | 1.000 | 0.977 | GGGGGtctc V$LRF_Q2 | 1613 (-) | 0.981 | 0.972 | GGGGTctcg V$LRF_Q2 | 1840 (-) | 0.981 | 0.969 | GGGGTcgtc V$LRF_Q2 | 7197 (-) | 1.000 | 0.975 | GGGGGtgct V$LRF_Q2 | 8485 (-) | 1.000 | 0.969 | GGGGGttat V$LRF_Q2 | 8805 (-) | 1.000 | 0.987 | GGGGGccag V$LRF_Q2 | 9338 (+) | 0.981 | 0.977 | gaggACCCC V$LRF_Q2 | 10741 (-) | 1.000 | 0.972 | GGGGGtttc I$OVO_Q6 | 69 (-) | 1.000 | 0.928 | taCTGTTg I$OVO_Q6 | 127 (+) | 0.987 | 0.967 | tAACTGct I$OVO_Q6 | 591 (+) | 1.000 | 0.966 | tAACAGac I$OVO_Q6 | 792 (+) | 0.975 | 0.956 | tAACCGct I$OVO_Q6 | 984 (-) | 0.987 | 0.978 | caCAGTTa I$OVO_Q6 | 1143 (-) | 0.994 | 0.989 | aaCCGTTa I$OVO_Q6 | 2545 (+) | 1.000 | 0.978 | tAACAGgt I$OVO_Q6 | 3400 (-) | 0.987 | 0.973 | tcCAGTTa I$OVO_Q6 | 3445 (+) | 1.000 | 0.994 | tAACAGtt I$OVO_Q6 | 3446 (-) | 0.987 | 0.983 | aaCAGTTa I$OVO_Q6 | 3510 (-) | 1.000 | 0.984 | tgCTGTTa I$OVO_Q6 | 4214 (-) | 0.987 | 0.966 | ctCAGTTa I$OVO_Q6 | 4572 (-) | 0.975 | 0.961 | tgCGGTTa I$OVO_Q6 | 6571 (-) | 0.987 | 0.978 | caCAGTTa I$OVO_Q6 | 8673 (-) | 0.975 | 0.956 | acCGGTTa I$OVO_Q6 | 9433 (+) | 1.000 | 0.984 | tAACAGga I$OVO_Q6 | 10310 (+) | 0.987 | 0.983 | tAACTGtt I$OVO_Q6 | 11068 (-) | 0.987 | 0.971 | atCAGTTa I$OVO_Q6 | 11237 (+) | 0.987 | 0.973 | tAACTGga I$OVO_Q6 | 11517 (-) | 0.994 | 0.922 | taCCGTTg I$OVO_Q6 | 11654 (-) | 1.000 | 1.000 | taCTGTTa I$OVO_Q6 | 11764 (+) | 1.000 | 0.984 | tAACAGga I$SD_Q6 | 23 (-) | 1.000 | 0.932 | gtGAATG I$SD_Q6 | 51 (+) | 0.934 | 0.939 | CATTAct I$SD_Q6 | 156 (+) | 0.930 | 0.936 | CATCTct I$SD_Q6 | 179 (+) | 0.934 | 0.939 | CATTAct I$SD_Q6 | 331 (-) | 0.940 | 0.941 | ggAAATT I$SD_Q6 | 334 (+) | 0.940 | 0.941 | AATTCcc I$SD_Q6 | 382 (-) | 0.930 | 0.936 | agAGATG I$SD_Q6 | 390 (+) | 1.000 | 0.936 | CATTCat I$SD_Q6 | 409 (-) | 1.000 | 0.996 | cgGAATG I$SD_Q6 | 490 (+) | 0.926 | 0.931 | CATTGca I$SD_Q6 | 526 (-) | 1.000 | 0.929 | aaAAATG I$SD_Q6 | 642 (+) | 1.000 | 0.936 | CATTTat I$SD_Q6 | 650 (+) | 1.000 | 0.936 | CATTTat I$SD_Q6 | 749 (-) | 0.940 | 0.941 | ggGAATT I$SD_Q6 | 804 (+) | 1.000 | 1.000 | CATTTct I$SD_Q6 | 871 (-) | 1.000 | 1.000 | agAAATG I$SD_Q6 | 886 (-) | 1.000 | 0.996 | cgAAATG I$SD_Q6 | 1177 (-) | 0.930 | 0.932 | ggAGATG I$SD_Q6 | 1211 (+) | 0.926 | 0.932 | CATTGct I$SD_Q6 | 1279 (-) | 0.930 | 0.935 | tgGGATG I$SD_Q6 | 1574 (-) | 1.000 | 0.996 | ggGAATG I$SD_Q6 | 1795 (+) | 1.000 | 0.932 | CATTCac I$SD_Q6 | 1889 (+) | 1.000 | 0.928 | CATTTta I$SD_Q6 | 2010 (+) | 0.930 | 0.932 | CATCTcc I$SD_Q6 | 2561 (-) | 0.940 | 0.944 | tgAAATT I$SD_Q6 | 2564 (+) | 0.940 | 0.941 | AATTTcc I$SD_Q6 | 2653 (-) | 0.934 | 0.935 | cgTAATG I$SD_Q6 | 2848 (+) | 1.000 | 0.996 | CATTCcg I$SD_Q6 | 2911 (+) | 0.930 | 0.936 | CATCTct I$SD_Q6 | 3052 (+) | 0.923 | 0.929 | CATACct I$SD_Q6 | 3110 (+) | 0.940 | 0.941 | AATTTcc I$SD_Q6 | 3136 (+) | 1.000 | 0.996 | CATTCcc I$SD_Q6 | 3163 (+) | 1.000 | 0.996 | CATTCcg I$SD_Q6 | 3242 (+) | 1.000 | 0.996 | CATTCcc I$SD_Q6 | 3286 (-) | 0.930 | 0.935 | tgGGATG I$SD_Q6 | 3300 (+) | 0.930 | 0.932 | CATCTcc I$SD_Q6 | 3337 (+) | 0.932 | 0.934 | TATTTcc I$SD_Q6 | 3701 (-) | 0.930 | 0.936 | agAGATG I$SD_Q6 | 3740 (+) | 0.932 | 0.938 | TATTTct I$SD_Q6 | 3748 (+) | 1.000 | 0.929 | CATTTtt I$SD_Q6 | 3758 (+) | 1.000 | 0.929 | CATTCtt I$SD_Q6 | 3836 (+) | 0.940 | 0.941 | AATTTcc I$SD_Q6 | 3865 (+) | 1.000 | 0.935 | CATTTaa I$SD_Q6 | 3948 (+) | 1.000 | 0.932 | CATTCag I$SD_Q6 | 3989 (+) | 0.930 | 0.935 | CATCTca I$SD_Q6 | 4028 (-) | 0.940 | 0.944 | tgGAATT I$SD_Q6 | 4037 (+) | 0.932 | 0.937 | TATTCca I$SD_Q6 | 4050 (-) | 0.932 | 0.934 | ggAAATA I$SD_Q6 | 4075 (-) | 1.000 | 0.928 | taAAATG I$SD_Q6 | 4458 (+) | 1.000 | 0.999 | CATTTca I$SD_Q6 | 4479 (+) | 1.000 | 0.929 | CATTTtt I$SD_Q6 | 4889 (+) | 1.000 | 0.999 | CATTCca I$SD_Q6 | 5031 (-) | 0.930 | 0.935 | tgGGATG I$SD_Q6 | 5069 (-) | 0.930 | 0.936 | agAGATG I$SD_Q6 | 5348 (+) | 0.930 | 0.932 | CATCCcc I$SD_Q6 | 5472 (+) | 0.932 | 0.934 | TATTTcc I$SD_Q6 | 5787 (-) | 0.932 | 0.937 | tgAAATA I$SD_Q6 | 5844 (+) | 0.930 | 0.932 | CATCTcc I$SD_Q6 | 5876 (+) | 0.930 | 0.932 | CATCTcc I$SD_Q6 | 5995 (-) | 0.930 | 0.936 | agGGATG I$SD_Q6 | 6194 (+) | 1.000 | 0.935 | CATTTaa I$SD_Q6 | 6460 (+) | 0.932 | 0.934 | TATTTcc I$SD_Q6 | 6470 (-) | 1.000 | 0.928 | taAAATG I$SD_Q6 | 6811 (+) | 0.934 | 0.935 | CATTAcc I$SD_Q6 | 6920 (-) | 0.940 | 0.945 | agGAATT I$SD_Q6 | 6996 (-) | 0.930 | 0.932 | ggAGATG I$SD_Q6 | 7122 (-) | 0.926 | 0.931 | tgCAATG I$SD_Q6 | 7317 (-) | 1.000 | 0.928 | taGAATG I$SD_Q6 | 7344 (+) | 1.000 | 0.999 | CATTTca I$SD_Q6 | 7614 (+) | 0.930 | 0.936 | CATCTct I$SD_Q6 | 7648 (-) | 1.000 | 0.929 | aaAAATG I$SD_Q6 | 8576 (-) | 1.000 | 0.935 | ttAAATG I$SD_Q6 | 8610 (+) | 1.000 | 0.996 | CATTCcg I$SD_Q6 | 8651 (+) | 0.940 | 0.941 | AATTCcc I$SD_Q6 | 8913 (-) | 0.932 | 0.938 | agGAATA I$SD_Q6 | 8922 (-) | 0.940 | 0.941 | cgGAATT I$SD_Q6 | 8994 (+) | 0.940 | 0.944 | AATTCca I$SD_Q6 | 9395 (-) | 1.000 | 0.999 | tgGAATG I$SD_Q6 | 9796 (-) | 1.000 | 0.932 | gtAAATG I$SD_Q6 | 9869 (-) | 1.000 | 0.929 | aaGAATG I$SD_Q6 | 10117 (+) | 0.930 | 0.932 | CATCCcc I$SD_Q6 | 10138 (-) | 0.940 | 0.945 | agAAATT I$SD_Q6 | 10303 (+) | 1.000 | 0.929 | CATTTtt I$SD_Q6 | 10395 (-) | 1.000 | 0.929 | aaAAATG I$SD_Q6 | 10416 (+) | 1.000 | 0.929 | CATTTtt I$SD_Q6 | 10468 (+) | 0.930 | 0.935 | CATCTca I$SD_Q6 | 10489 (+) | 0.923 | 0.928 | CATATca I$SD_Q6 | 10588 (+) | 0.940 | 0.941 | AATTTcc I$SD_Q6 | 10714 (+) | 0.930 | 0.936 | CATCTct I$SD_Q6 | 10792 (-) | 0.930 | 0.932 | ggGGATG I$SD_Q6 | 10862 (-) | 0.930 | 0.936 | agAGATG I$SD_Q6 | 11087 (-) | 0.940 | 0.941 | ggGAATT I$SD_Q6 | 11274 (-) | 0.940 | 0.941 | ggAAATT I$SD_Q6 | 11295 (-) | 0.932 | 0.938 | agAAATA I$SD_Q6 | 11340 (-) | 1.000 | 0.932 | gtAAATG I$SD_Q6 | 11407 (+) | 1.000 | 0.936 | CATTTat I$SD_Q6 | 11418 (+) | 1.000 | 1.000 | CATTTct I$SD_Q6 | 11464 (-) | 0.940 | 0.944 | tgAAATT I$SD_Q6 | 11489 (-) | 1.000 | 0.999 | tgGAATG I$SD_Q6 | 11575 (+) | 0.926 | 0.932 | CATTGct I$SD_Q6 | 11639 (-) | 1.000 | 0.935 | ttAAATG I$SD_Q6 | 11646 (+) | 0.932 | 0.938 | TATTTct I$SD_Q6 | 11738 (+) | 0.930 | 0.935 | CATCTca I$SD_Q6 | 11943 (+) | 0.932 | 0.937 | TATTTca I$TWI_Q6 | 711 (-) | 0.951 | 0.886 | tctctttACAGAtg I$TWI_Q6 | 1130 (+) | 1.000 | 0.874 | cgTGTGTtcctaaa I$TWI_Q6 | 2577 (-) | 0.941 | 0.888 | atttacaACAAAtg I$TWI_Q6 | 4522 (-) | 0.941 | 0.879 | caagacaACAAAtg I$TWI_Q6 | 5121 (-) | 0.951 | 0.879 | aaaggaaACAGAtg I$TWI_Q6 | 5359 (-) | 1.000 | 0.917 | tgacaaaACACAcg I$TWI_Q6 | 6093 (+) | 1.000 | 0.885 | ccTGTGTggatcag I$TWI_Q6 | 6395 (+) | 1.000 | 0.900 | ccTGTGTtcagaac I$TWI_Q6 | 7116 (+) | 1.000 | 0.904 | caTGTGTgcaatgc I$TWI_Q6 | 7490 (+) | 0.951 | 0.882 | cgTCTGTtcttctt I$TWI_Q6 | 7584 (-) | 0.941 | 0.894 | aggtcaaACAAAtg I$TWI_Q6 | 8971 (+) | 0.917 | 0.894 | caGGTGTggtggta I$TWI_Q6 | 9145 (-) | 0.980 | 0.910 | ataagacACATAtg I$TWI_Q6 | 9153 (+) | 0.980 | 0.898 | caTATGTgagttcc I$TWI_Q6 | 9802 (-) | 0.980 | 0.879 | gtacatcACATAag I$TWI_Q6 | 10757 (+) | 1.000 | 0.890 | cgTGTGTttgggca I$TWI_Q6 | 11533 (-) | 0.980 | 0.880 | aaatcccACATAtg V$MOVOB_01 | 103 (+) | 1.000 | 0.959 | ctGGGGG V$MOVOB_01 | 194 (-) | 0.906 | 0.910 | CTCCCac V$MOVOB_01 | 196 (-) | 0.938 | 0.941 | CCCACac V$MOVOB_01 | 1001 (+) | 0.938 | 0.900 | atGTGGG V$MOVOB_01 | 1188 (-) | 1.000 | 0.959 | CCCCCag V$MOVOB_01 | 1404 (+) | 0.938 | 0.939 | gaGTGGG V$MOVOB_01 | 1488 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 1492 (+) | 1.000 | 0.998 | gaGGGGG V$MOVOB_01 | 1493 (+) | 1.000 | 0.958 | agGGGGG V$MOVOB_01 | 1494 (+) | 1.000 | 0.998 | ggGGGGG V$MOVOB_01 | 1495 (+) | 1.000 | 0.998 | ggGGGGG V$MOVOB_01 | 1496 (+) | 1.000 | 0.998 | ggGGGGG V$MOVOB_01 | 1497 (+) | 1.000 | 0.998 | ggGGGGG V$MOVOB_01 | 1499 (+) | 0.906 | 0.908 | ggGGGAG V$MOVOB_01 | 1503 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 1507 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 1511 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 1515 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 1610 (+) | 1.000 | 0.957 | caGGGGG V$MOVOB_01 | 3140 (-) | 0.938 | 0.903 | CCCACaa V$MOVOB_01 | 3184 (-) | 1.000 | 0.958 | CCCCCct V$MOVOB_01 | 3185 (-) | 1.000 | 0.998 | CCCCCtc V$MOVOB_01 | 3782 (-) | 1.000 | 0.958 | CCCCCct V$MOVOB_01 | 3783 (-) | 1.000 | 0.957 | CCCCCtt V$MOVOB_01 | 4045 (+) | 1.000 | 0.957 | caGGGGG V$MOVOB_01 | 4861 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 5463 (-) | 0.938 | 0.900 | CCCACat V$MOVOB_01 | 5690 (-) | 0.938 | 0.901 | CCCACta V$MOVOB_01 | 5715 (-) | 0.938 | 0.903 | CCCACaa V$MOVOB_01 | 6625 (+) | 0.938 | 0.939 | gaGTGGG V$MOVOB_01 | 6751 (-) | 0.938 | 0.901 | CCCACta V$MOVOB_01 | 7195 (+) | 1.000 | 0.960 | acGGGGG V$MOVOB_01 | 7446 (+) | 0.938 | 0.941 | gtGTGGG V$MOVOB_01 | 7448 (+) | 0.906 | 0.910 | gtGGGAG V$MOVOB_01 | 8116 (-) | 0.906 | 0.910 | CTCCCac V$MOVOB_01 | 8118 (-) | 0.938 | 0.939 | CCCACcc V$MOVOB_01 | 8232 (-) | 0.906 | 0.910 | CTCCCac V$MOVOB_01 | 8234 (-) | 0.938 | 0.941 | CCCACac V$MOVOB_01 | 8483 (+) | 1.000 | 0.957 | aaGGGGG V$MOVOB_01 | 8750 (-) | 0.938 | 0.941 | CCCACac V$MOVOB_01 | 8803 (+) | 1.000 | 0.962 | ttGGGGG V$MOVOB_01 | 9311 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 9618 (+) | 0.906 | 0.910 | gtGGGAG V$MOVOB_01 | 9683 (+) | 1.000 | 0.998 | gaGGGGG V$MOVOB_01 | 9684 (+) | 1.000 | 0.958 | agGGGGG V$MOVOB_01 | 9685 (+) | 1.000 | 0.998 | ggGGGGG V$MOVOB_01 | 10045 (+) | 0.938 | 0.941 | gtGTGGG V$MOVOB_01 | 10611 (+) | 0.938 | 0.941 | gtGTGGG V$MOVOB_01 | 10613 (+) | 0.906 | 0.910 | gtGGGAG V$MOVOB_01 | 10617 (+) | 0.906 | 0.908 | gaGGGAG V$MOVOB_01 | 10738 (+) | 1.000 | 0.962 | ttGGGGG V$MOVOB_01 | 10739 (+) | 1.000 | 0.960 | tgGGGGG V$MOVOB_01 | 10789 (+) | 1.000 | 0.957 | caGGGGG V$MOVOB_01 | 10877 (+) | 0.938 | 0.901 | tgGTGGG V$MOVOB_01 | 10879 (+) | 0.906 | 0.910 | gtGGGAG V$MOVOB_01 | 11537 (-) | 0.938 | 0.900 | CCCACat V$MOVOB_01 | 11841 (+) | 0.938 | 0.939 | gaGTGGG V$MOVOB_01 | 11843 (+) | 1.000 | 1.000 | gtGGGGG V$MOVOB_01 | 11844 (+) | 1.000 | 0.960 | tgGGGGG V$MOVOB_01 | 11848 (+) | 0.938 | 0.939 | ggGTGGG V$MOVOB_01 | 11857 (+) | 1.000 | 0.959 | atGGGGG V$MOVOB_01 | 11869 (+) | 1.000 | 0.960 | ccGGGGG V$MOVOB_01 | 11871 (+) | 0.906 | 0.908 | ggGGGAG V$RUSH1A_02 | 11 (-) | 0.993 | 0.972 | gtACAAGtac V$RUSH1A_02 | 227 (+) | 0.997 | 0.994 | ttcCTTTTtg V$RUSH1A_02 | 271 (+) | 0.993 | 0.992 | gtcCTTGTgc V$RUSH1A_02 | 401 (-) | 0.993 | 0.986 | caACAAGgcg V$RUSH1A_02 | 555 (+) | 0.997 | 0.997 | aacCTTTTct V$RUSH1A_02 | 599 (+) | 0.997 | 0.974 | ttaCTTTTta V$RUSH1A_02 | 627 (+) | 0.997 | 0.973 | acaCTTTTtt V$RUSH1A_02 | 660 (-) | 1.000 | 0.978 | atATAAGtac V$RUSH1A_02 | 819 (+) | 0.997 | 0.972 | acaCTTTTta V$RUSH1A_02 | 839 (-) | 0.993 | 0.973 | aaACAAGtta V$RUSH1A_02 | 897 (+) | 0.997 | 0.970 | agaCTTTTca V$RUSH1A_02 | 937 (-) | 0.993 | 0.973 | ccACAAGtta V$RUSH1A_02 | 1322 (+) | 0.989 | 0.987 | tccCTTCTca V$RUSH1A_02 | 1716 (+) | 0.997 | 0.990 | tgcCTTTTcc V$RUSH1A_02 | 1867 (-) | 0.989 | 0.986 | gaAGAAGgga V$RUSH1A_02 | 1959 (+) | 0.997 | 0.994 | ttcCTTTTtt V$RUSH1A_02 | 2168 (+) | 0.989 | 0.988 | ttcCTTCTtt V$RUSH1A_02 | 2176 (+) | 0.989 | 0.988 | ttcCTTCTtt V$RUSH1A_02 | 2356 (+) | 0.993 | 0.974 | aaaCTTGTat V$RUSH1A_02 | 2507 (-) | 0.997 | 0.993 | aaAAAAGgga V$RUSH1A_02 | 2531 (+) | 0.997 | 0.975 | ttaCTTTTag V$RUSH1A_02 | 2743 (+) | 0.997 | 0.990 | tgcCTTTTtc V$RUSH1A_02 | 3149 (-) | 0.971 | 0.972 | atATATGgag V$RUSH1A_02 | 3221 (+) | 0.997 | 0.973 | gcaCTTTTac V$RUSH1A_02 | 3252 (+) | 1.000 | 0.976 | ccaCTTATgg V$RUSH1A_02 | 3321 (+) | 0.997 | 0.990 | ggcCTTTTcc V$RUSH1A_02 | 3379 (-) | 0.971 | 0.970 | tcATATGggg V$RUSH1A_02 | 3407 (-) | 0.997 | 0.976 | atAAAAGtta V$RUSH1A_02 | 3763 (-) | 0.997 | 0.991 | ttAAAAGgca V$RUSH1A_02 | 3784 (+) | 0.997 | 0.992 | cccCTTTTtc V$RUSH1A_02 | 3862 (+) | 0.967 | 0.970 | tacCATTTaa V$RUSH1A_02 | 3939 (+) | 0.993 | 0.969 | tcaCTTGTtc V$RUSH1A_02 | 4187 (-) | 0.989 | 0.984 | gcAGAAGgct V$RUSH1A_02 | 4362 (-) | 0.989 | 0.970 | tcAGAAGttt V$RUSH1A_02 | 4416 (+) | 0.993 | 0.994 | aacCTTGTct V$RUSH1A_02 | 4650 (+) | 0.989 | 0.990 | cacCTTCTct V$RUSH1A_02 | 5001 (+) | 0.993 | 0.994 | gacCTTGTat V$RUSH1A_02 | 5118 (-) | 0.997 | 0.995 | tcAAAAGgaa V$RUSH1A_02 | 5158 (-) | 0.997 | 0.993 | cgAAAAGgga V$RUSH1A_02 | 5171 (+) | 0.993 | 0.970 | tcaCTTGTga V$RUSH1A_02 | 5454 (+) | 0.997 | 0.997 | aacCTTTTcc V$RUSH1A_02 | 5768 (-) | 0.997 | 0.995 | gtAAAAGgag V$RUSH1A_02 | 5836 (-) | 0.993 | 0.987 | aaACAAGgca V$RUSH1A_02 | 5941 (-) | 0.997 | 0.975 | agAAAAGtaa V$RUSH1A_02 | 5982 (+) | 0.989 | 0.969 | taaCTTCTgg V$RUSH1A_02 | 6272 (-) | 0.989 | 0.987 | gaAGAAGgaa V$RUSH1A_02 | 6455 (+) | 1.000 | 0.979 | aaaCTTATtt V$RUSH1A_02 | 6594 (-) | 1.000 | 0.999 | aaATAAGgtg V$RUSH1A_02 | 6636 (-) | 0.989 | 0.984 | ccAGAAGgca V$RUSH1A_02 | 6814 (+) | 0.989 | 0.990 | tacCTTCTcc V$RUSH1A_02 | 6821 (+) | 0.997 | 0.993 | tccCTTTTtt V$RUSH1A_02 | 6904 (-) | 1.000 | 0.976 | gcATAAGtgg V$RUSH1A_02 | 6915 (-) | 0.993 | 0.991 | gaACAAGgaa V$RUSH1A_02 | 7002 (-) | 0.997 | 0.990 | ggAAAAGgca V$RUSH1A_02 | 7216 (-) | 0.997 | 0.974 | caAAAAGtag V$RUSH1A_02 | 7251 (-) | 0.997 | 0.970 | gaAAAAGtct V$RUSH1A_02 | 7413 (+) | 0.997 | 0.976 | gaaCTTTTaa V$RUSH1A_02 | 7713 (-) | 0.989 | 0.988 | gcAGAAGgat V$RUSH1A_02 | 8182 (+) | 0.993 | 0.973 | gaaCTTGTtt V$RUSH1A_02 | 8479 (-) | 1.000 | 0.996 | aaATAAGggg V$RUSH1A_02 | 8754 (+) | 0.993 | 0.993 | cacCTTGTcc V$RUSH1A_02 | 8944 (+) | 0.997 | 0.975 | caaCTTTTta V$RUSH1A_02 | 9662 (+) | 0.993 | 0.993 | gacCTTGTct V$RUSH1A_02 | 9809 (-) | 1.000 | 0.978 | acATAAGtaa V$RUSH1A_02 | 9828 (-) | 0.989 | 0.987 | ttAGAAGgga V$RUSH1A_02 | 10169 (+) | 1.000 | 0.997 | gtcCTTATta V$RUSH1A_02 | 10387 (+) | 0.971 | 0.972 | atcCATATaa V$RUSH1A_02 | 10500 (-) | 0.997 | 0.970 | tcAAAAGtca V$RUSH1A_02 | 10600 (-) | 0.997 | 0.994 | caAAAAGgag V$RUSH1A_02 | 10801 (+) | 0.993 | 0.987 | agcCTTGTgc V$RUSH1A_02 | 10826 (+) | 0.997 | 0.990 | cgcCTTTTca V$RUSH1A_02 | 11186 (+) | 0.993 | 0.987 | tgcCTTGTtt V$RUSH1A_02 | 11268 (-) | 0.997 | 0.995 | agAAAAGgaa V$RUSH1A_02 | 11386 (+) | 1.000 | 0.979 | aaaCTTATta V$RUSH1A_02 | 11439 (+) | 1.000 | 0.978 | ataCTTATct V$RUSH1A_02 | 11540 (-) | 0.971 | 0.973 | acATATGgta V$RUSH1A_02 | 11623 (-) | 0.989 | 0.988 | ccAGAAGgac V$RUSH1A_02 | 11723 (+) | 0.967 | 0.969 | cacCATTTtc V$HOXA7_01 | 3316 (-) | 1.000 | 1.000 | aGATTGg V$HOXA7_01 | 9454 (+) | 1.000 | 1.000 | cCAATCt V$HOXA7_01 | 11802 (-) | 1.000 | 1.000 | aGATTGg V$SZF11_01 | 1336 (+) | 1.000 | 0.822 | cCAGGGcttaaggct V$SZF11_01 | 5763 (+) | 0.878 | 0.813 | gTAGGGtaaaaggag V$SZF11_01 | 7504 (+) | 0.896 | 0.811 | gGAGGGtgtcagatg V$SZF11_01 | 8276 (-) | 1.000 | 0.823 | atgcagctgCCCTGg V$RBPJK_Q4 | 3138 (+) | 1.000 | 0.988 | TTCCCaca V$RBPJK_01 | 3137 (-) | 1.000 | 0.934 | atTCCCAcaac V$RBPJK_01 | 6218 (-) | 0.914 | 0.941 | ttTCTCAcgct V$CACD_01 | 105 (-) | 0.960 | 0.973 | GGGGGtgg V$CACD_01 | 197 (+) | 0.768 | 0.843 | ccaCACCA V$CACD_01 | 535 (-) | 1.000 | 0.844 | GGGTGatg V$CACD_01 | 1185 (+) | 0.960 | 0.833 | gcaCCCCC V$CACD_01 | 1381 (-) | 0.983 | 0.925 | GGGCGagg V$CACD_01 | 1490 (-) | 0.948 | 0.917 | GGGAGggg V$CACD_01 | 1494 (-) | 0.960 | 0.925 | GGGGGggg V$CACD_01 | 1495 (-) | 0.960 | 0.925 | GGGGGggg V$CACD_01 | 1496 (-) | 0.960 | 0.925 | GGGGGggg V$CACD_01 | 1499 (-) | 0.960 | 0.909 | GGGGGagg V$CACD_01 | 1598 (-) | 0.766 | 0.841 | GGGTTtgg V$CACD_01 | 1813 (-) | 1.000 | 0.860 | GGGTGgtg V$CACD_01 | 2431 (-) | 0.848 | 0.849 | GGTTGggg V$CACD_01 | 3040 (+) | 1.000 | 0.835 | ctcCACCC V$CACD_01 | 3288 (-) | 0.859 | 0.841 | GGATGagg V$CACD_01 | 4142 (-) | 0.784 | 0.853 | AGGTGtgg V$CACD_01 | 4239 (-) | 0.870 | 0.864 | GGCTGggg V$CACD_01 | 4326 (-) | 0.751 | 0.831 | GGGTCtgg V$CACD_01 | 4343 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 4412 (+) | 0.848 | 0.897 | ccaCAACC V$CACD_01 | 4748 (+) | 0.948 | 0.901 | cctCTCCC V$CACD_01 | 5821 (-) | 0.751 | 0.831 | TGGCGtgg V$CACD_01 | 5997 (-) | 0.859 | 0.857 | GGATGggg V$CACD_01 | 6481 (-) | 0.859 | 0.841 | GGATGagg V$CACD_01 | 6547 (-) | 0.870 | 0.848 | GGCTGagg V$CACD_01 | 6716 (+) | 0.848 | 0.897 | ccaCAACC V$CACD_01 | 7197 (-) | 0.960 | 0.833 | GGGGGtgc V$CACD_01 | 7450 (-) | 0.948 | 0.844 | GGGAGtgt V$CACD_01 | 8235 (+) | 1.000 | 1.000 | ccaCACCC V$CACD_01 | 8446 (+) | 0.859 | 0.851 | ccgCATCC V$CACD_01 | 8739 (-) | 1.000 | 1.000 | GGGTGtgg V$CACD_01 | 8751 (+) | 0.784 | 0.853 | ccaCACCT V$CACD_01 | 8972 (-) | 0.784 | 0.853 | AGGTGtgg V$CACD_01 | 9572 (-) | 1.000 | 1.000 | GGGTGtgg V$CACD_01 | 9686 (-) | 0.960 | 0.833 | GGGGGgtg V$CACD_01 | 9687 (-) | 0.960 | 0.973 | GGGGGtgg V$CACD_01 | 9689 (-) | 1.000 | 0.860 | GGGTGgtg V$CACD_01 | 9690 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 9693 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 9696 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 9699 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 9702 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 9705 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 10619 (-) | 0.948 | 0.873 | GGGAGttg V$CACD_01 | 10694 (+) | 1.000 | 0.908 | caaCACCC V$CACD_01 | 10875 (-) | 0.808 | 0.870 | GGTGGtgg V$CACD_01 | 11821 (-) | 0.848 | 0.849 | GGTTGggg V$CACD_01 | 11845 (-) | 0.960 | 0.833 | GGGGGgtg V$CACD_01 | 11846 (-) | 0.960 | 0.973 | GGGGGtgg V$CACD_01 | 11848 (-) | 1.000 | 0.952 | GGGTGggg V$CACD_01 | 11853 (-) | 0.751 | 0.831 | GGGTAtgg V$CACD_01 | 11859 (-) | 0.960 | 0.855 | GGGGGtag V$CACD_01 | 11871 (-) | 0.960 | 0.909 | GGGGGagg V$CACD_01 | 11923 (+) | 0.983 | 0.896 | caaCGCCC V$OTX_Q1 | 1648 (-) | 1.000 | 0.943 | tgcTAATC V$OTX_Q1 | 1995 (+) | 1.000 | 0.993 | GATTAtct V$OTX_Q1 | 2724 (-) | 1.000 | 0.993 | agaTAATC V$OTX_Q1 | 3429 (+) | 0.905 | 0.914 | GCTTAatt V$OTX_Q1 | 4160 (-) | 1.000 | 0.946 | ttgTAATC V$OTX_Q1 | 5239 (+) | 1.000 | 0.997 | GATTAcat V$OTX_Q1 | 5416 (-) | 0.905 | 0.912 | attAAATC V$OTX_Q1 | 6195 (-) | 0.905 | 0.912 | attTAAGC V$OTX_Q1 | 7354 (-) | 1.000 | 0.944 | tgtTAATC V$OTX_Q1 | 8574 (+) | 1.000 | 0.998 | GATTAaat V$OTX_Q1 | 11003 (+) | 1.000 | 0.993 | GATTAcct V$OTX_Q1 | 11203 (+) | 1.000 | 0.948 | GATTAtta V$OTX_Q1 | 11694 (-) | 1.000 | 0.995 | agtTAATC V$WT1_Q6 | 1488 (-) | 1.000 | 0.991 | gagGGAGGg V$WT1_Q6 | 1491 (-) | 0.998 | 0.994 | ggaGGGGGg V$WT1_Q6 | 1492 (-) | 0.998 | 0.990 | gagGGGGGg V$WT1_Q6 | 1494 (-) | 0.998 | 0.999 | gggGGGGGg V$WT1_Q6 | 1495 (-) | 0.998 | 0.999 | gggGGGGGg V$WT1_Q6 | 1499 (-) | 1.000 | 1.000 | gggGGAGGg V$WT1_Q6 | 1503 (-) | 1.000 | 0.991 | gagGGAGGg V$WT1_Q6 | 1507 (-) | 1.000 | 0.991 | gagGGAGGg V$WT1_Q6 | 1511 (-) | 1.000 | 0.991 | gagGGAGGg V$WT1_Q6 | 7448 (-) | 0.986 | 0.981 | gtgGGAGTg V$WT1_Q6 | 9682 (-) | 0.998 | 0.994 | ggaGGGGGg V$WT1_Q6 | 9683 (-) | 0.998 | 0.990 | gagGGGGGg V$WT1_Q6 | 9685 (-) | 0.985 | 0.987 | gggGGGGTg V$WT1_Q6 | 10613 (-) | 1.000 | 0.993 | gtgGGAGGg V$WT1_Q6 | 10875 (-) | 0.992 | 0.980 | ggtGGTGGg V$WT1_Q6 | 11718 (+) | 1.000 | 0.986 | cCCTCCacc V$WT1_Q6 | 11846 (-) | 0.992 | 0.993 | gggGGTGGg V$WT1_Q6 | 11871 (-) | 1.000 | 1.000 | gggGGAGGg V$KAISO_01 | 4685 (-) | 1.000 | 0.991 | gtgGCAGGaa V$KAISO_01 | 5188 (-) | 1.000 | 0.998 | ctaGCAGGat V$KAISO_01 | 10841 (-) | 1.000 | 0.997 | atcGCAGGaa V$ZNF219_01 | 1490 (-) | 1.000 | 0.989 | gggagGGGGGgg V$ZNF219_01 | 1494 (-) | 1.000 | 0.995 | gggggGGGGGag V$NANOG_01 | 1408 (+) | 1.000 | 0.983 | gggtCCATTgcc V$NANOG_01 | 3769 (-) | 0.937 | 0.941 | ggcAATGCtcct V$NANOG_01 | 11925 (+) | 1.000 | 0.941 | acgcCCATTggc V$OCT4_01 | 9051 (-) | 0.955 | 0.837 | gtcTACATggcaagt V$OCT4_01 | 9489 (-) | 0.794 | 0.886 | attTGCAAtagaaaa V$OCT4_01 | 10345 (+) | 0.793 | 0.833 | tgttttaATTCAaag V$OCT4_01 | 10805 (-) | 1.000 | 0.882 | ttgTGCATctaaatc V$OCT4_01 | 11413 (-) | 1.000 | 0.830 | ttcTGCATttctatt V$OCT4_01 | 11667 (-) | 0.933 | 0.865 | attTTCATaaccaaa P$BPC1_Q2 | 369 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 907 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 1172 (+) | 1.000 | 1.000 | AGAAAg P$BPC1_Q2 | 2132 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2198 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2202 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2210 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2214 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2218 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2222 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2226 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2230 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2234 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2238 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2242 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2246 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2250 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2254 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2258 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2262 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2266 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2270 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2274 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 2281 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 3193 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 4953 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 5546 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 6250 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 7368 (+) | 1.000 | 1.000 | AGAAAg P$BPC1_Q2 | 8136 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 8695 (-) | 1.000 | 1.000 | cTTTCT P$BPC1_Q2 | 9657 (+) | 1.000 | 1.000 | AGAAAg P$BPC1_Q2 | 10628 (+) | 1.000 | 1.000 | AGAAAg P$BPC1_Q2 | 11612 (+) | 1.000 | 1.000 | AGAAAg P$BPC1_Q2 | 11755 (+) | 1.000 | 1.000 | AGAAAg P$SED_Q2 | 1173 (-) | 1.000 | 0.984 | gAAAGGagat P$SED_Q2 | 1957 (+) | 1.000 | 0.992 | ctttCCTTTt P$SED_Q2 | 3409 (-) | 0.985 | 0.984 | aAAAGTtaat P$SED_Q2 | 3765 (-) | 1.000 | 0.996 | aAAAGGcaat P$SED_Q2 | 5120 (-) | 1.000 | 0.997 | aAAAGGaaac P$SED_Q2 | 5452 (+) | 1.000 | 0.999 | ataaCCTTTt P$SED_Q2 | 5770 (-) | 1.000 | 0.992 | aAAAGGagag P$SED_Q2 | 7871 (-) | 0.985 | 0.983 | aAAAGCtcac P$SED_Q2 | 7908 (+) | 0.985 | 0.984 | attaGCTTTt P$SED_Q2 | 8013 (+) | 1.000 | 0.985 | gtaaCCTTTa P$SED_Q2 | 8631 (-) | 0.985 | 0.985 | aAAAGCttat P$SED_Q2 | 10502 (-) | 0.985 | 0.983 | aAAAGTcaat P$SED_Q2 | 11270 (-) | 1.000 | 0.997 | aAAAGGaaat P$PBF_Q2 | 229 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 368 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 557 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 601 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 629 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 821 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 899 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 1173 (+) | 1.000 | 0.965 | gAAAGG P$PBF_Q2 | 1638 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 1718 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 1961 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 2030 (+) | 1.000 | 0.986 | tAAAGG P$PBF_Q2 | 2165 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 2209 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 2509 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 2533 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 2737 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 2745 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 2856 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 3223 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 3323 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 3409 (+) | 0.963 | 0.964 | aAAAGT P$PBF_Q2 | 3765 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 3786 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 5120 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 5160 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 5396 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 5456 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 5520 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 5580 (+) | 1.000 | 0.986 | tAAAGG P$PBF_Q2 | 5770 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 5943 (+) | 0.963 | 0.964 | aAAAGT P$PBF_Q2 | 6020 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 6679 (-) | 0.963 | 0.964 | GCTTTt P$PBF_Q2 | 6711 (+) | 0.963 | 0.964 | aAAAGC P$PBF_Q2 | 6722 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 6726 (+) | 1.000 | 0.986 | tAAAGG P$PBF_Q2 | 6760 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 6823 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 6973 (+) | 1.000 | 0.965 | gAAAGG P$PBF_Q2 | 7004 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 7218 (+) | 0.963 | 0.964 | aAAAGT P$PBF_Q2 | 7253 (+) | 0.963 | 0.964 | aAAAGT P$PBF_Q2 | 7415 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 7420 (+) | 1.000 | 0.986 | tAAAGG P$PBF_Q2 | 7654 (-) | 0.963 | 0.964 | GCTTTt P$PBF_Q2 | 7849 (+) | 0.963 | 0.964 | aAAAGC P$PBF_Q2 | 7871 (+) | 0.963 | 0.964 | aAAAGC P$PBF_Q2 | 7912 (-) | 0.963 | 0.964 | GCTTTt P$PBF_Q2 | 8017 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 8631 (+) | 0.963 | 0.964 | aAAAGC P$PBF_Q2 | 8694 (-) | 1.000 | 0.965 | CCTTTc P$PBF_Q2 | 8946 (-) | 0.963 | 0.964 | ACTTTt P$PBF_Q2 | 8989 (-) | 1.000 | 0.986 | CCTTTa P$PBF_Q2 | 9105 (+) | 0.963 | 0.964 | aAAAGC P$PBF_Q2 | 10502 (+) | 0.963 | 0.964 | aAAAGT P$PBF_Q2 | 10602 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 10731 (-) | 0.963 | 0.964 | GCTTTt P$PBF_Q2 | 10828 (-) | 1.000 | 1.000 | CCTTTt P$PBF_Q2 | 11270 (+) | 1.000 | 1.000 | aAAAGG P$PBF_Q2 | 11613 (+) | 1.000 | 0.965 | gAAAGG V$SOX10_Q6 | 949 (-) | 0.991 | 0.986 | aACAATg V$SOX10_Q6 | 1039 (-) | 1.000 | 0.995 | aACAAAg V$SOX10_Q6 | 3999 (-) | 0.953 | 0.952 | aACAACg V$SOX10_Q6 | 4664 (+) | 1.000 | 0.995 | cTTTGTt V$SOX10_Q6 | 7752 (+) | 0.991 | 0.992 | cATTGTg V$SOX10_Q6 | 7807 (-) | 0.953 | 0.952 | aACAACg V$SOX10_Q6 | 8558 (-) | 0.991 | 0.986 | aACAATg V$SOX10_Q6 | 8785 (-) | 1.000 | 1.000 | gACAAAg V$SOX10_Q6 | 9027 (+) | 1.000 | 1.000 | cTTTGTg V$SOX10_Q6 | 9727 (+) | 1.000 | 0.998 | cTTTGTa V$SOX10_Q6 | 10461 (-) | 0.991 | 0.986 | aACAATg V$SOX10_Q6 | 10856 (+) | 0.991 | 0.990 | cATTGTa V$SOX10_Q6 | 11052 (+) | 1.000 | 0.995 | cTTTGTt V$SOX10_Q6 | 11061 (+) | 0.953 | 0.958 | cGTTGTg V$SOX10_Q6 | 11520 (+) | 0.953 | 0.958 | cGTTGTc Total sequences length=11973 Total number of found sites=13438 Frequency of sites per nucleotide=1.122359 bug2869.tree100644000766000024 565714006427353 16400 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data((((((((EMBOSS_045:-0.50000,EMBOSS_025:-0.50000):1.79848, ((EMBOSS_027:-0.50000,EMBOSS_047:-0.50000):0.99824,EMBOSS_034:0.49824):0.80024):0.34147, EMBOSS_028:1.63995):0.22536,(((EMBOSS_092:-0.50000,EMBOSS_056:-0.50000):1.34396, EMBOSS_023:0.84396):0.80613,((EMBOSS_013:-0.50000,EMBOSS_082:-0.50000):0.00000, EMBOSS_094:-0.50000):2.15010):0.21521):0.06350,(((((EMBOSS_074:-0.50000, EMBOSS_091:-0.50000):1.31318,EMBOSS_029:0.81318):0.71221, ((EMBOSS_100:-0.50000,EMBOSS_003:-0.50000):1.23223,EMBOSS_057:0.73223):0.79316):0.12013, ((EMBOSS_039:-0.50000,EMBOSS_070:-0.50000):1.87273,((EMBOSS_066:-0.50000, EMBOSS_033:-0.50000):0.98761,EMBOSS_058:0.48761):0.88512):0.27280):0.23680, (((((EMBOSS_041:-0.50000,EMBOSS_032:-0.50000):0.00000,EMBOSS_044:-0.50000):1.48759, EMBOSS_018:0.98759):0.33203,(EMBOSS_048:-0.50000,EMBOSS_086:-0.50000):1.81961):0.41964, (EMBOSS_012:-0.50000,EMBOSS_073:-0.50000):2.23925):0.14308):0.04648):0.02893, (((((EMBOSS_055:1.46450,EMBOSS_008:1.46450):0.08397,EMBOSS_036:1.54848):0.17955, (EMBOSS_076:-0.50000,EMBOSS_038:-0.50000):2.22802):0.04384, EMBOSS_005:1.77186):0.12747,((((EMBOSS_075:-0.50000,EMBOSS_096:-0.50000):1.25986, EMBOSS_022:0.75986):0.82110,((EMBOSS_019:-0.50000,EMBOSS_031:-0.50000):1.19971, EMBOSS_098:0.69971):0.88125):0.23169,(((EMBOSS_097:-0.50000, EMBOSS_014:-0.50000):1.10683,EMBOSS_042:0.60683):0.57915, EMBOSS_009:1.18599):0.62667):0.08668):0.05840):0.00595,(((((EMBOSS_001:1.42164, EMBOSS_089:1.42164):0.15376,EMBOSS_093:1.57540):0.15734, (((EMBOSS_037:-0.50000,EMBOSS_004:-0.50000):0.00000,EMBOSS_067:-0.50000):1.63853, (EMBOSS_071:-0.50000,EMBOSS_015:-0.50000):1.63853):0.59421):0.17397, (((((EMBOSS_054:-0.50000,EMBOSS_026:-0.50000):1.22934,EMBOSS_052:0.72934):0.95330, EMBOSS_095:1.68264):0.15887,((EMBOSS_068:1.44021,EMBOSS_072:1.44021):0.17219, ((((EMBOSS_065:-0.50000,EMBOSS_062:-0.50000):0.00000,EMBOSS_059:-0.50000):1.53425, EMBOSS_046:1.03425):0.27633,EMBOSS_087:1.31058):0.30183):0.22910):0.02311, (((EMBOSS_080:-0.50000,EMBOSS_002:-0.50000):0.98801,EMBOSS_021:0.48801):1.00237, ((EMBOSS_035:-0.50000,EMBOSS_088:-0.50000):1.31447,EMBOSS_010:0.81447):0.67592):0.37423):0.04210):0.04173, ((((EMBOSS_078:-0.50000,EMBOSS_063:-0.50000):0.97078,EMBOSS_053:0.47078):1.01179, (EMBOSS_049:-0.50000,EMBOSS_030:-0.50000):1.98257):0.35719, (((EMBOSS_079:-0.50000,EMBOSS_060:-0.50000):1.03209,EMBOSS_007:0.53209):1.08158, ((EMBOSS_040:-0.50000,EMBOSS_077:-0.50000):1.91905,(EMBOSS_017:-0.50000, EMBOSS_016:-0.50000):1.91905):0.19462):0.22610):0.10868):0.01525):0.01135, ((((((EMBOSS_011:-0.50000,EMBOSS_043:-0.50000):0.00000,EMBOSS_064:-0.50000):1.93262, (EMBOSS_090:-0.50000,EMBOSS_006:-0.50000):1.93262):0.13110, ((EMBOSS_051:-0.50000,EMBOSS_069:-0.50000):1.20725,EMBOSS_085:0.70725):0.85647):0.22219, ((EMBOSS_050:-0.50000,EMBOSS_024:-0.50000):1.28332,(EMBOSS_083:-0.50000, EMBOSS_084:-0.50000):1.28332):1.00259):0.12842,(((EMBOSS_099:-0.50000, EMBOSS_061:-0.50000):1.27575,EMBOSS_020:0.77575):1.03830, EMBOSS_081:1.81405):0.10028):0.06072); spidey.test1100644000766000024 1405314006427353 16676 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data--SPIDEY version 1.40-- Genomic: lcl|chr2 No definition line found, 145732769 bp mRNA: lcl|tmpseq_0 No definition line found, 1110 bp Strand: minus Number of exons: 6 Exon 1(-): 36375691-36375798 (gen) 1-108 (mRNA) id 97.2% mismatches 3 gaps 0 splice site (d a): 1 0 Exon 2(-): 36369345-36369492 (gen) 109-256 (mRNA) id 100.0% mismatches 0 gaps 0 splice site (d a): 0 1 Exon 3(-): 36367232-36367437 (gen) 257-462 (mRNA) id 100.0% mismatches 0 gaps 0 splice site (d a): 1 1 Exon 4(-): 36364083-36364229 (gen) 463-609 (mRNA) id 100.0% mismatches 0 gaps 0 splice site (d a): 1 1 Exon 5(-): 36358231-36358489 (gen) 610-868 (mRNA) id 100.0% mismatches 0 gaps 0 splice site (d a): 1 1 Exon 6(-): 36356457-36356698 (gen) 869-1110 (mRNA) id 100.0% mismatches 0 gaps 0 splice site (d a): 0 1 Number of splice sites: 4 mRNA coverage: 100% overall percent identity: 99.7% Missing mRNA ends: neither Genomic: lcl|chr2 No definition line found mRNA: lcl|tmpseq_0 No definition line found Exon 1: 36375798-36375691 (gen) 1-108 (mRNA) CCTCTTTTTCTTTGCAGGGTATATACCCAGTTACTTAGACAAGGATGAGCTATGTGTAGT | |||||||||||||||||||||||||||||||||||||||||||||| ATGTCAGGGTATATACCCAGTTACTTAGACAAGGATGAGCTATGTGTAGT M S G Y I P S Y L D K D E L C V V ATGTGGGGACAAAGCCACCGGATATCATTATCGCTGCATCACTTGTGAAGGTTGCAAGGT |||||||||||||||||||||||||||||||||||||||||||||||||||||||||| ATGTGGGGACAAAGCCACCGGATATCATTATCGCTGCATCACTTGTGAAGGTTGCAAG C G D K A T G Y H Y R C I T C E G C K AAATGGCA Exon 2: 36369492-36369345 (gen) 109-256 (mRNA) TTGCACTTAGGGATTTTTCAGAAGAACCATTCAGAAAAACCTCCATCCAACCTATTCCTG |||||||||||||||||||||||||||||||||||||||||||||||||| GGATTTTTCAGAAGAACCATTCAGAAAAACCTCCATCCAACCTATTCCTG G F F R R T I Q K N L H P T Y S C TAAATATGAAGGAAAATGTGTGATAGACAAAGTAACAAGAAATCAGTGCCAGGAATGTCG |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| TAAATATGAAGGAAAATGTGTGATAGACAAAGTAACAAGAAATCAGTGCCAGGAATGTCG K Y E G K C V I D K V T R N Q C Q E C R CTTCAAAAAATGTATCTTTGTTGGCATGGCAACAGATTGTGAGTATAT |||||||||||||||||||||||||||||||||||||| CTTCAAAAAATGTATCTTTGTTGGCATGGCAACAGATT F K K C I F V G M A T D Exon 3: 36367437-36367232 (gen) 257-462 (mRNA) TCCCTGCTAGTGGTGTTGGATGACAGCAAGAGGCTGGCAAAGAGGAAGCTGATAGAAGAA |||||||||||||||||||||||||||||||||||||||||||||||||| TGGTGTTGGATGACAGCAAGAGGCTGGCAAAGAGGAAGCTGATAGAAGAA L V L D D S K R L A K R K L I E E AATCGAGAGAAGAGGCGTCGGGAAGAGCTGCAGAAAACGATTGGTCACAAACCAGAACCA |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| AATCGAGAGAAGAGGCGTCGGGAAGAGCTGCAGAAAACGATTGGTCACAAACCAGAACCA N R E K R R R E E L Q K T I G H K P E P ACAGATGAGGAATGGGAGCTGATCAAAATTGTCACTGAAGCACATGTGGCCACCAATGCA |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| ACAGATGAGGAATGGGAGCTGATCAAAATTGTCACTGAAGCACATGTGGCCACCAATGCA T D E E W E L I K I V T E A H V A T N A CAAGGAAGCCACTGGAAGCAGAAAAGGAAATTTCTGGTAGGGACTA |||||||||||||||||||||||||||||||||||| CAAGGAAGCCACTGGAAGCAGAAAAGGAAATTTCTG Q G S H W K Q K R K F L Exon 4: 36364229-36364083 (gen) 463-609 (mRNA) ATATCCTTAGCCAGAAGACATTGGGCAAGCACCAATAGTTAATGCCCCAGAAGGGGGGAA |||||||||||||||||||||||||||||||||||||||||||||||||| CCAGAAGACATTGGGCAAGCACCAATAGTTAATGCCCCAGAAGGGGGGAA P E D I G Q A P I V N A P E G G K AGTGGATTTAGAAGCCTTCAGCCAGTTTACAAAAATTATCACACCAGCGATTACAAGAGT |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| AGTGGATTTAGAAGCCTTCAGCCAGTTTACAAAAATTATCACACCAGCGATTACAAGAGT V D L E A F S Q F T K I I T P A I T R V GGTGGATTTTGCCAAAAAGTTGCCTATGTTTTGTGAGGTAAGACAAA ||||||||||||||||||||||||||||||||||||| GGTGGATTTTGCCAAAAAGTTGCCTATGTTTTGTGAG V D F A K K L P M F C E Exon 5: 36358489-36358231 (gen) 610-868 (mRNA) ATTTCTGCAGCTGCCATGTGAAGACCAGATCATCCTTCTGAAAGGCTGCTGTATGGAGAT |||||||||||||||||||||||||||||||||||||||||||||||||| CTGCCATGTGAAGACCAGATCATCCTTCTGAAAGGCTGCTGTATGGAGAT L P C E D Q I I L L K G C C M E I AATGTCCCTCCGAGCAGCAGTTCGCTATGACCCCGAGAGTGAGACTTTAACGCTAAATGG |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| AATGTCCCTCCGAGCAGCAGTTCGCTATGACCCCGAGAGTGAGACTTTAACGCTAAATGG M S L R A A V R Y D P E S E T L T L N G GGAGATGGCGGTGACAAGGGGCCAGCTGAAAAATGGGGGTCTTGGCGTAGTTTCTGATGC |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| GGAGATGGCGGTGACAAGGGGCCAGCTGAAAAATGGGGGTCTTGGCGTAGTTTCTGATGC E M A V T R G Q L K N G G L G V V S D A CATTTTTGACCTGGGCATGTCTCTTTCTTCATTTAACCTGGATGACACCGAGGTTGCCCT |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| CATTTTTGACCTGGGCATGTCTCTTTCTTCATTTAACCTGGATGACACCGAGGTTGCCCT I F D L G M S L S S F N L D D T E V A L TCTCCAGGCTGTCCTGCTCATGTCATCAGGTGAGAACAG ||||||||||||||||||||||||||||| TCTCCAGGCTGTCCTGCTCATGTCATCAG L Q A V L L M S S Exon 6: 36356698-36356457 (gen) 869-1110 (mRNA) GTATCTGCAGATCGCCCAGGCCTTGTTTGCGTCGAGAGAATAGAAAAGTGTCAAGAGGGT |||||||||||||||||||||||||||||||||||||||||||||||||| ATCGCCCAGGCCTTGTTTGCGTCGAGAGAATAGAAAAGTGTCAAGAGGGT D R P G L V C V E R I E K C Q E G TTCCTCCTGGCATTTGAACACTACATTAATTACAGAAAACACCATGTTGCACATTTTTGG |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| TTCCTCCTGGCATTTGAACACTACATTAATTACAGAAAACACCATGTTGCACATTTTTGG F L L A F E H Y I N Y R K H H V A H F W CCAAAACTGCTGATGAAAGTGACAGATCTGCGAATGATTGGAGCCTGCCATGCCAGCCGC |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| CCAAAACTGCTGATGAAAGTGACAGATCTGCGAATGATTGGAGCCTGCCATGCCAGCCGC P K L L M K V T D L R M I G A C H A S R TTCCTGCACATGAAGGTGGAGTGCCCCACAGAACTCTTCCCTCCATTGTTCCTGGAGGTG |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| TTCCTGCACATGAAGGTGGAGTGCCCCACAGAACTCTTCCCTCCATTGTTCCTGGAGGTG F L H M K V E C P T E L F P P L F L E V TTTGAGGATTAGAGAGACTGGA |||||||||||| TTTGAGGATTAG F E D * KF527485.gbk100644000766000024 1110314006427353 16074 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS KF527485 1444 bp cRNA linear VRL 02-OCT-2013 DEFINITION Influenza A virus (A/California/07/2009(H1N1)) segment 5 nucleocapsid protein (NP) gene, partial cds. ACCESSION KF527485 VERSION KF527485.1 GI:537446968 KEYWORDS . SOURCE Influenza A virus (A/California/07/2009(H1N1)) ORGANISM Influenza A virus (A/California/07/2009(H1N1)) Viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A. REFERENCE 1 (bases 1 to 1444) AUTHORS Le,V.L., Courtney,C.L., Steel,J. and Compans,R.W. TITLE Closely Related Influenza Viruses Induce Contrasting Respiratory Tract Immunopathology JOURNAL PLoS ONE 8 (9), E76708 (2013) PUBMED 24086762 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 1444) AUTHORS Le,V.L., Courtney,C.L., Steel,J. and Compans,R.W. TITLE Direct Submission JOURNAL Submitted (07-AUG-2013) Microbiology and Immunology, Emory University, 1518 Clifton Rd Suite 5000, Atlanta, GA 30322, USA COMMENT ##Assembly-Data-START## Assembly Method :: Lasergene v. 10 Sequencing Technology :: ABI37XL; Sanger dideoxy sequencing ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..1444 /organism="Influenza A virus (A/California/07/2009(H1N1))" /mol_type="viral cRNA" /strain="A/California/07/2009" /serotype="H1N1" /host="Homo sapiens" /db_xref="taxon:641809" /segment="5" /country="USA" /collection_date="09-Apr-2009" /note="passage details: MDCK 6" gene <1..>1444 /gene="NP" CDS <1..>1444 /gene="NP" /codon_start=2 /product="nucleocapsid protein" /protein_id="AGU69828.1" /db_xref="GI:537446969" /translation="ASQGTKRSYEQMETGGERQDATEIRASVGRMIGGIGRFYIQMCT ELKLSDYDGRLIQNSITIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVGGK WMRELILYDKEEIRRVWRQANNGEDATAGLTHIMIWHSNLNDATYQRTRALVRTGMDP RMCSLMQGSTLPRRSGAAGAAVKGVGTIAMELIRMIKRGINDRNFWRGENGRRTRVAY ERMCNILKGKFQTAAQRAMMDQVRESRNPGNAEIEDLIFLARSALILRGSVAHKSCLP ACVYGLAVASGHDFEREGYSLVGIDPFKLLQNSQVVSLMRPNENPAHKSQLVWMACHS AAFEDLRVSSFIRGKKVIPRGKLSTRGVQIASNENVETMDSNTLELRSRYWAIRTRSG GNTNQQKASAGQISVQPTFSVQRNLPFERATVMAAFSGNNEGRTSDMRTEVIRMMESA KPEDLSFQGRGVFELSDEKATNPIVPSFDMS" ORIGIN 1 ggcgtctcaa ggcaccaaac gatcatatga acaaatggag actggtgggg agcgccagga 61 tgccacagaa atcagagcat ctgtcggaag aatgattggt ggaatcggga gattctacat 121 ccaaatgtgc actgaactca aactcagtga ttatgatgga cgactaatcc agaatagcat 181 aacaatagag aggatggtgc tttctgcttt tgatgagaga agaaataaat acctagaaga 241 gcatcccagt gctgggaagg accctaagaa aacaggagga cccatatata gaagagtagg 301 cggaaagtgg atgagagaac tcatccttta tgacaaagaa gaaataagga gagtttggcg 361 ccaagcaaac aatggcgaag atgcaacagc aggtcttact catatcatga tttggcattc 421 caacctgaat gatgccacat atcagagaac aagagcgctt gttcgcaccg gaatggatcc 481 cagaatgtgc tctctaatgc aaggttcaac acttcccaga aggtctggtg ccgcaggtgc 541 tgcggtgaaa ggagttggaa caatagcaat ggagttaatc agaatgatca aacgtggaat 601 caatgaccga aatttctgga ggggtgaaaa tggacgaagg acaagggttg cttatgaaag 661 aatgtgcaat atcctcaaag gaaaatttca aacagctgcc cagagggcaa tgatggatca 721 agtaagagaa agtcgaaacc caggaaacgc tgagattgaa gacctcattt tcctggcacg 781 gtcagcactc attctgaggg gatcagttgc acataaatcc tgcctgcctg cttgtgtgta 841 tgggcttgca gtagcaagtg ggcatgactt tgaaagggaa gggtactcac tggtcgggat 901 agacccattc aaattactcc aaaacagcca agtggtcagc ctgatgagac caaatgaaaa 961 cccagctcac aagagtcaat tggtgtggat ggcatgccac tctgctgcat ttgaagattt 1021 aagagtatca agtttcataa gaggaaagaa agtgattcca agaggaaagc tttccacaag 1081 aggggtccag attgcttcaa atgagaatgt ggaaaccatg gactccaata ccctggaact 1141 gagaagcaga tactgggcca taaggaccag gagtggagga aataccaatc aacaaaaggc 1201 atccgcaggc cagatcagtg tgcagcctac attctcagtg cagcggaatc tcccttttga 1261 aagagcaacc gttatggcag cattcagcgg gaacaatgaa ggacggacat ccgacatgcg 1321 aacagaagtt ataagaatga tggaaagtgc aaagccagaa gatttgtcct tccaggggcg 1381 gggagtcttc gagctctcgg acgaaaaggc aacgaacccg atcgtgcctt cctttgacat 1441 gagt //ex1.nucl.nhx100644000766000024 245414006427353 16555 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data(((05_YEAST:0.493471[&&NHX:S=YEAST], 04_SCHPO:0.318204[&&NHX:S=SCHPO] ):0.157022[&&NHX:D=N:S=Ascomycota], 03_ARATH:0.308649[&&NHX:S=ARATH] ):0.04141[&&NHX:D=N:S=Eukaryota], ((((((((08_RAT:0.102249[&&NHX:S=RAT], (13_RAT:0.095242[&&NHX:S=RAT], 10_RAT:0.031982[&&NHX:S=RAT] ):0.018615[&&NHX:D=Y:S=RAT] ):0.014489[&&NHX:D=Y:S=RAT], 11_MOUSE:0.042825[&&NHX:S=MOUSE] ):0.037198[&&NHX:D=N:S=Murinae], 12_HUMAN:0.060992[&&NHX:E=$-PANTR:S=HUMAN] ):0.109605[&&NHX:D=N:E=$-CANFA:S=Eutheria], 09_CHICK:0.105274[&&NHX:S=CHICK] ):0.086729[&&NHX:D=N:E=$-XENTR:S=Amniota], 06_BRARE:0.356832[&&NHX:E=$-Tetraodontidae:S=BRARE] ):0.073642[&&NHX:D=N:S=Euteleostomi], (((18_HUMAN:0.077476[&&NHX:E=$-PANTR:S=HUMAN], 17_MOUSE:0.066898[&&NHX:E=$-RAT:S=MOUSE] ):0.142228[&&NHX:D=N:E=$-CANFA:S=Eutheria], 16_CHICK:0.196657[&&NHX:S=CHICK] ):0.083315[&&NHX:D=N:E=$-XENTR:S=Amniota], (15_FUGRU:0.125305[&&NHX:E=$-TETNG:S=FUGRU], 14_BRARE:0.192069[&&NHX:S=BRARE] ):0.133036[&&NHX:D=N:S=Clupeocephala] ):0.096936[&&NHX:D=N:S=Euteleostomi] ):0.06101[&&NHX:D=Y:E=$-CIOIN:S=Euteleostomi], 07_DROME:0.262868[&&NHX:E=$-ANOGA-APIME:S=DROME] ):0.1118[&&NHX:D=N:S=Bilateria], (02_CAEBR:0.168649[&&NHX:S=CAEBR], 01_CAEEL:0.166054[&&NHX:S=CAEEL] ):0.371596[&&NHX:D=N:Loglk=-18289.490411:S=Caenorhabditis] ):0.083573[&&NHX:D=N:S=Bilateria] )[&&NHX:D=N:S=Eukaryota]; ecolitst.bls100644000766000024 2601314006427353 16746 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBLASTP 2.1.3 [Apr-11-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogenase I [Escherichia coli] (820 letters) Database: ecoli.aa 4289 sequences; 1,358,990 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogen... 1567 0.0 gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydr... 332 1e-91 gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [E... 184 3e-47 gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli] 42 3e-04 >gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogenase I [Escherichia coli] Length = 820 Score = 1567 bits (4058), Expect = 0.0 Identities = 806/820 (98%), Positives = 806/820 (98%) Query: 1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA Sbjct: 1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60 Query: 61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA Sbjct: 61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120 Query: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP Sbjct: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180 Query: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV Sbjct: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240 Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD Sbjct: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300 Query: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF Sbjct: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360 Query: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL Sbjct: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420 Query: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQXXXXXXXXXXXXXXAL 480 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQ AL Sbjct: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480 Query: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL Sbjct: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540 Query: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR Sbjct: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600 Query: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA Sbjct: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660 Query: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA Sbjct: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720 Query: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF Sbjct: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780 Query: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV Sbjct: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820 >gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydrogenase II [Escherichia coli] Length = 810 Score = 332 bits (850), Expect = 1e-91 Identities = 243/821 (29%), Positives = 403/821 (48%), Gaps = 44/821 (5%) Query: 5 KFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDALPNI 64 KFGG+S+A+ + +LRVA I+ ++ + V+SA TN L+ ++ + + + + + Sbjct: 16 KFGGSSLADVKCYLRVAGIMAEYSQPDDMM-VVSAAGSTTNQLINWLKLSQTDRLSAHQV 74 Query: 65 SDAERIF-AELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINAALI 123 R + +L++GL A+ L + FV + GI+ D++ A ++ Sbjct: 75 QQTLRRYQCDLISGLLPAEEADSL--ISAFVSDLERLAALLDSGIN------DAVYAEVV 126 Query: 124 CRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIPADH 183 GE S +M+ VL +G +D E L A + + E ++ H Sbjct: 127 GHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAER---AAQPQVDEGLSYPLLQQLLVQH 183 Query: 184 ---MVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240 +++ GF + N GE V+LGRNGSDYSA + A IW+DV GVY+ DPR+V Sbjct: 184 PGKRLVVTGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVYSADPRKV 243 Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300 DA LL + EA EL+ A VLH RT+ P++ +I ++ + P G++R Sbjct: 244 KDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQ-----GSTRI 298 Query: 301 EDELP----VKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEY 356 E L + +++ +++ + P + + + RA++ + + + Sbjct: 299 ERVLASGTGARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHNDRQ 358 Query: 357 SISFCVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKF 416 + FC A + + E GL L + + LA++++VG G+ +F Sbjct: 359 LLQFCYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGVTRNPLHCHRF 411 Query: 417 FAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQXXXXXXXXXXXX 476 + L + Q S+ V+ + ++ HQ +F ++ Sbjct: 412 WQQLKGQPVEFTW--QSDDGISLVAVLRTGPTESLIQGLHQSVFRAEKRIGLVLFGKGNI 469 Query: 477 XXALLEQLKRQQSWLKNKH-IDLRVCGVANSKALLTNVHGLN----LENWQEELAQAKEP 531 LE R+QS L + + + GV +S+ L + GL+ L + +E + E Sbjct: 470 GSRWLELFAREQSTLSARTGFEFVLAGVVDSRRSLLSYDGLDASRALAFFNDEAVEQDEE 529 Query: 532 FNLGRLIRLVKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQ 591 L ++ + + V++D T+SQ +ADQY DF GFHV++ NK A S + Y Q Sbjct: 530 ----SLFLWMRAHPYDDLVVLDVTASQQLADQYLDFASHGFHVISANKLAGASDSNKYRQ 585 Query: 592 LRYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGM 651 + A EK+ R +LY+ VGAGLP+ +++L+++GD ++ SGI SG+LS++F + D + Sbjct: 586 IHDAFEKTGRHWLYNATVGAGLPINHTVRDLIDSGDTILSISGIFSGTLSWLFLQFDGSV 645 Query: 652 SFSEATTLAREMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNA 711 F+E A + G TEPDPRDDLSG DV RKL+ILARE G +E + +E ++PA Sbjct: 646 PFTELVDQAWQQGLTEPDPRDDLSGKDVMRKLVILAREAGYNIEPDQVRVESLVPAHCEG 705 Query: 712 EGDVAAFMANLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKV 771 G + F N +L++ R+ AR+ G VLRYV D +G RV + V + PL + Sbjct: 706 -GSIDHFFENGDELNEQMVQRLEAAREMGLVLRYVARFDANGKARVGVEAVREDHPLASL 764 Query: 772 KNGENALAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLR 812 +N A S +Y+ PLV+RG GAG DVTA + +D+ R Sbjct: 765 LPCDNVFAIESRWYRDNPLVIRGPGAGRDVTAGAIQSDINR 805 >gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [Escherichia coli] Length = 449 Score = 184 bits (467), Expect = 3e-47 Identities = 142/471 (30%), Positives = 228/471 (48%), Gaps = 41/471 (8%) Query: 3 VLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQ---- 58 V KFGGTSVA+ + R ADI+ S+A V VLSA A ITN LVA+ E G+ Sbjct: 6 VSKFGGTSVADFDAMNRSADIVLSDANVRLV--VLSASAGITNLLVALAEGLEPGERFEK 63 Query: 59 -DALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCP-- 115 DA+ NI F + + + + +I+ +L I++L + Sbjct: 64 LDAIRNIQ-------------------FAILERLRYPNVIREEIERLLENITVLAEAAAL 104 Query: 116 ---DSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTR 172 ++ L+ GE MS + +L R D + + + + DIA Sbjct: 105 ATSPALTDELVSHGELMSTLLFVEILRERDVQAQWFDVRKVMRTNDRFGRAEPDIAALAE 164 Query: 173 RIAASRIPA--DHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVD 230 A +P + +V+ GF KG LGR GSDY+AA+LA L A +IWTDV Sbjct: 165 LAALQLLPRLNEGLVITQGFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVP 224 Query: 231 GVYTCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290 G+YT DPR V A+ + +++ EA E++ FGAKVLHP T+ P + IP + ++ +P+A Sbjct: 225 GIYTTDPRVVSAAKRIDEIAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRA 284 Query: 291 PGTLIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLIT 350 GTL+ ++ E+ + ++ N + ++ M G A VF ++R ISV LIT Sbjct: 285 GGTLV-CNKTENPPLFRALALRRNQTLLTLHSLNMLHSRGFLAEVFGILARHNISVDLIT 343 Query: 351 QSSSEYSISFCVPQSDCV-RAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTL 409 +SE S++ + + + + + +EL L + V E LA+++++G+ + Sbjct: 344 --TSEVSVALTLDTTGSTSTGDTLLTQSLLMEL--SALCRVEVEEGLALVALIGNDLSKA 399 Query: 410 RGISAKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLF 460 G+ + F L NI + I G+S ++ +V +DA V+ H LF Sbjct: 400 CGVGKEVFGVLEPFNIRM--ICYGASSHNLCFLVPGEDAEQVVQKLHSNLF 448 >gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli] Length = 241 Score = 41.6 bits (96), Expect = 3e-04 Identities = 28/97 (28%), Positives = 44/97 (44%), Gaps = 8/97 (8%) Query: 199 LVVLGRNGSDYSAAVLAACLR-----ADCCEIWTDVDGVYTCDPRQVPDARLLKSMSYQE 253 +++ G+ + AACLR AD T VDGV+T DP + P A + + ++Y E Sbjct: 132 VILSAGTGNPFFTTDSAACLRGIEIEADVVLKATKVDGVFTADPAKDPTATMYEQLTYSE 191 Query: 254 AMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290 +E KV+ T ++P + N P A Sbjct: 192 VLEKE---LKVMDLAAFTLARDHKLPIRVFNMNKPGA 225 Database: ecoli.aa Posted date: Dec 6, 2001 1:58 PM Number of letters in database: 1,358,990 Number of sequences in database: 4289 Lambda K H 0.319 0.135 0.383 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2022122 Number of Sequences: 4289 Number of extensions: 82424 Number of successful extensions: 256 Number of sequences better than 1.0e-03: 4 Number of HSP's better than 0.0 without gapping: 3 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 243 Number of HSP's gapped (non-prelim): 4 length of query: 820 length of database: 1,358,990 effective HSP length: 47 effective length of query: 773 effective length of database: 1,157,407 effective search space: 894675611 effective search space used: 894675611 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 92 (40.0 bits) protpars.phy100644000766000024 5155514006427353 17023 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 3 5127 SINFRUP001 .......... ......DDQV VLQCTASVLK EQIKLCLSCE GFGNRLCFLE SINFRUP002 .......... ......DDQV VLQCTASVLK EQIKLCLSCE GFGNRLCFLE ENSP000003 .MGDAEGEDE VQFLRTDDEV VLQCSATVLK EQLKLCLAAE GFGNRLCFLE TTSNAQNVPP DLAICTFILE QSLSVRALQE MLANTVEMTE AVDLDKWSSQ TTSNAQNVPP DLAICTFILE QSLSVRALQE MLANTVEMTE AVDLDKWSSQ PTSNAQNVPP DLAICCFVLE QSLSVRALQE MLANT..... .VEAGVESSQ GGGHRTLLYG HAILLRHNHS GMYLSCLTTS RSLTDKLAFD VGLQEDSTGE GGGHRTLLYG HAILLRHNHS GMYLSCLTTS RSLTDKLAFD VGLQEDSTGE GGGHRTLLYG HAILLRHAHS RMYLSCLTTS RSMTDKLAFD VGLQEDATGE ACWWTIHPAS KQRSEGEKVR VGDDLILVSV SSERYLHLSY ASGDLMVDAS ACWWTIHPAS KQRSEGEKVR VGDDLILVSV SSERYLHLSY ASGDLMVDAS ACWWTMHPAS KQRSEGEKVR VGDDIILVSV SSERYLHLST ASGELQVDAS FMQTLWNMNP ISSGCELAEG FLTGGHVLRL FHGHMDECLA IATPEEGEEK FMQTLWNMNP ISSGCELAEG FLTGGHVLRL FHGHMDECLA IATPEEGEEK FMQTLWNMNP ICSRCE..EG FVTGGHVLRL FHGHMDECLT ISPADS.DDQ RRMAHYEGGS VCSQARSLWR LEPLRISWSG SHMKWGQSFR IRHITTGRYL RRMAHYEGGS VCSQARSLWR LEPLRISWSG SHMKWGQSFR IRHITTGRYL RRLVYYEGGA VCTHARSLWR LEPLRISWSG SHLRWGQPLR VRHVTTGQYL CLDEEKGLLV VDPERANTKL SAFCFRASKE KVDVAQKRDV EGMGIPEIKY CLDEEKGLLV VDPERANTKL SAFCFRASKE KVDVAQKRDV EGMGIPEIKY ALTEDQGLVV VDASKAHTKA TSFCFRISKE KLDVAPKRDV EGMGPPEIKY GESMCFVQHV STGLWLTYAS LDAKAARLGM MKRKVILHQE GHMDDALTVS GESMCFVQHV STGLWLTYAS LDAKAARLGM MKRKVILHQE GHMDDALTVS GESLCFVQHV ASGLWLTYAA PDPKALRLGV LKKKAMLHQE GHMDDALSLT RSQTEESQAA RMIYSTVGLF RQFIKGLDTL TGKNKSPGAL S...LPLEGV RSQTEESQAA RMIYSTVGLF RQFIKGLDTL TGKNKSPGAL S...LPLEGV RCQQEESQAA RMIHSTNGLY NQFIKSLDSF SGKPRGSGPP AGTALPIEGV ILSLQDLIFY FRPPDEELEH EEKQTKLRSL RNRQNLFQEE GMITIVLECI ILSLQDLIFY FRPPDEELEH EEKQTKLRSL RNRQNLFQEE GMITIVLECI ILSLQDLIIY FEPPSEDLQH EEKQSKLRSL RNRQSLFQEE GMLSMVLNCI DRLNVYNTAA HFSEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFC DRLNVYNTAA HFSEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFC DRLNVYTTAA HFAEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFS DNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD DNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD TNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD KHGRNHKVLD VLRSLCVCNG VAVRSNQNLI TENLLPGRDL LLQTNIVNYV KHGRNHKVLD VLRSLCVCNG VAVRSNQNLI TENLLPGRDL LLQTNIVNYV KHGRNHKVLD VLCSLCVCNG VAVRSNQDLI TENLLPGREL LLQTNLINYV TSVRPNIFLG TCEGSTQYKK WYYEVMVDHV EAFVTAQATH LRVGWAMTEG TSVRPNIFLG TCEGSTQYKK WYYEVMVDHV EAFVTAQATH LRVGWAMTEG TSIRPNIFVG RAEGTTQYSK WYFEVMVDEV TPFLTAQATH LRVGWALTEG YSPYPGGGEG WGGNGVGDDL YSYSFDGLHL WSGTVPRQVA SPNAHTLAAD YSPYPGGGEG WGGNGVGDDL YSYSFDGLHL WSGTVPRQVA SPNAHTLAAD YTPYPGAGEG WGGNGVGDDL YSYGFDGLHL WTGHVARPVT SPGQHLLAPE DVVSCCLDLS VPSISFRING HPVQGMFENF NVDSLFFPVI SFSAGVKARF DVVSCCLDLS VPSISFRING HPVQGMFENF NVDSLFFPVI SFSAGVKARF DVISCCLDLS VPSISFRING CPVQGVFESF NLDGLFFPVV SFSAGVKVRF LLGGRHGDFK FMPPPGYAPC YEALLPRERM RIEPIKEYKH DFNGVRNLLG LLGGRHGDFK FMPPPGYAPC YEALLPRERM RIEPIKEYKH DFNGVRNLLG LLGGRHGEFK FLPPPGYAPC HEAVLPRERL HLEPIKEYRR EGPRGPHLVG PTLSLTHTSF TPCPVDTVQI VLPPHLERIR EKLAENIHEL WAVTRIEQGW PTLSLTHTSF TPCPVDTVQI VLPPHLERIR EKLAENIHEL WAVTRIEQGW PSRCLSHTDF VPCPVDTVQI VLPPHLERIR EKLAENIHEL WALTRIEQGW TYGSFRDDNK KLHPCLVDFQ SLPEPERNYN LQMSAETLKC VCAV...A.. TYGSFRDDNK KLHPCLVDFQ SLPEPERNYN LQMSAETLKC VCAV...A.. TYGPVRDDNK RLHPCLVDFH SLPEPERNYN LQMSGETLKT LLALGCHVGM ......ETLH DCVSSR.YVM SNAYKPAPLD LSHVKLTPNQ NQLVEKLAEN ......ETLH DCVSSR.YVM SNAYKPAPLD LSHVKLTPNQ NQLVEKLAEN ADEKAEDNLK KTKLPKTYMM SNGYKPAPLD LSHVRLTPAQ TTLVDRLAEN GHNVWARDRV RQGWTYSIVQ DILNKRNPRL VPYILLDERT KKTNRDSVNN GHNVWARDRV RQGWTYSIVQ DILNKRNPRL VPYILLDERT KKTNRDSVNN GHNVWARDRV GQGWSYSAVQ DIPARRNPRL VPYRLLDEAT KRSNRDSLCQ AVRTLIGYGY NIEPPDQEST GHGLENTRGD KVRIFRAEKS YAVTQGKWYF AVRTLIGYGY NIEPPDQEST GHGLENTRGD KVRIFRAEKS YAVTQGKWYF AVRTLLGYGY NIEPPDQEPS Q.VENQSRCD RVRIFRAEKS YTVQSGRWYF EFEAVTTGEM RVGWARPNVH SDTELGADEL AYVFNGNKA. ........QR EFEAVTTGEM RVGWARPNVH SDTELGADEL AYVFNGNKA. ........QR EFEAVTTGEM RVGWARPELR PDVELGADEL AYVFNGHRG. ........QR WHIGNEPFGR QWQSGDVVGC MIDLTEMNIM FTLNGEMLIS DSGSEMAFKD WHIGNEPFGR QWQSGDVVGC MIDLTEMNIM FTLNGEMLIS DSGSEMAFKD WHLGSEPFGR PWQPGDVVGC MIDLTENTII FTLNGEVLMS DSGSETAFRE IEIGEGFIPV CTLGLSQVGR INLGQNVSSL RYFAICGLQE GFEPFAINMK IEIGEGFIPV CTLGLSQVGR INLGQNVSSL RYFAICGLQE GFEPFAINMK IEIGDGFLPV CSLGPGQVGH LNLGQDVSSL RFFAICGLQE GFEPFAINMQ RDTTMWFSKS LPQFVPVPAD HNHIEVSRVD GTVDSAPCLK LTHKTYGSQN RDTTMWFSKS LPQFVPVPAD HNHIEVSRVD GTVDSAPCLK LTHKTYGSQN RPVTTWFSKG LPQFEPVPLE HPHYEVSRVD GTVDTPPCLR LTHRTWGSQN ANTDMLFLRL SMPIQFHATF KVPAGTTPLT RALTIP...E DVAVVEPDSE ANTDMLFLRL SMPIQFHATF KVPAGTTPLT RALTIP...E DVAVVEPDSE SLVEMLFLRL SLPVQFHQHF RCTAGATPLA PPGLQPPAED EARAAEPDPD FEVLKKSASR KEQEEDKKEP SVPKEI.... ........L. .AENEKDTMS FEVLKKSASR KEQEEDKKEP SVPKEI.... ........L. .AENEKDTMS YENLRRSAGG WSEAENGKEG TAKEGAPGGT PQAGGEAQPA RAENEKDATT EKGKKRGFFS KAKKAAMTPL A.....PPPP PTVPRLVEDV VPDD.RDDPE EKGKKRGFFS KAKKAAMTPL A.....PPPP PTVPRLVEDV VPDD.RDDPE EKNKKRGFLF KAKKVAMMTQ P......PAT PTLPRLPHDV VPADNRDDPE IILSTTTYYY SVRIFAGQEP SGVWVGWVTP DYHQYDQTFD LSKVRSVTVT IILSTTTYYY SVRIFAGQEP SGVWVGWVTP DYHQYDQTFD LSKVRSVTVT IILNTTTYYY SVRVFAGQEP SCVWAGWVTP DYHQHDMSFD LSKVRVVTVT VGDDKGNIYN SMKRSNCYMV WGDDLVS.NH QTRFSQEDMV IGCLVDLATG VGDDKGNIYN SMKRSNCYMV WGDDLVS.NH QTRFSQEDMV IGCLVDLATG MGDEQGNVHS SLKCSNCYMV WGGDFVSPGQ QGRISHTDLV IGCLVDLATG LMTFTANGKE INTFYQVEPN TKLFPAVFVQ PLSQNMVQLE LGKLKNIMPI LMTFTANGKE INTFYQVEPN TKLFPAVFVQ PLSQNMVQLE LGKLKNIMPI LMTFTANGKE SNTFFQVEPN TKLFPAVFVL PTHQNVIQFE LGKQKNIMPL SAAMFRSERN NPVPQCPPRL DVQMLTPVIW SRMPNRFLNP DVGRVSERLG SAAMFRSERN NPVPQCPPRL DVQMLTPVIW SRMPNRFLNP DVGRVSERLG SAAMFQSERK NPAPQCPPRL EMQMLMPVSW SRMPNHFLQV ETRRAGERLG WVVECTEPLI MMALHIPEEN RCIDILELSE RQDLMKFHYH TLMLYCAVCA WVVECTEPLI MMALHIPEEN RCIDILELSE RQDLMKFHYH TLMLYCAVCA WAVQCQEPLT MMALHIPEEN RCMDILELSE RLDLQRFHSH TLRLYRAVCA LGNNRVAHAL CSHVDESQLF YATENTYLPG PLRSGYYDLL ISIHLESAKR LGNNRVAHAL CSHVDESQLF YATENTYLPG PLRSGYYDLL ISIHLESAKR LGNNRVAHAL CSHVDQAQLL HALEDAHLPG PLRAGYYDLL ISIHLESACR ARLGTNREFI VPMTEETLSI KLYPDAV... ...KAHSLPG VGLTTCLRPK ARLGTNREFI VPMTEETLSI KLYPDAV... ...KAHSLPG VGLTTCLRPK SRRSMLSEYI VPLTPETRAI TLFPPGRSTE NGHPRHGLPG VGVTTSLRPP LHFS...... SINFVGTDLD LYTLSPVFPL QELKNRAISM LTEAVLDGSQ LHFS...... SINFVGTDLD LYTLSPVFPL QELKNRAISM LTEAVLDGSQ HHFSPPCFVA ALPAAGAAEA PARLSPAIPL EALRDKALRM LGEAVRDGGQ AMRDPVGGSV EFHFVPILKL ISTLLIMGIF NDDDTKHILK MIDPNVFSGK AMRDPVGGSV EFHFVPILKL ISTLLIMGIF NDDDTKHILK MIDPNVFSGK HARDPVGGSV EFQFVPVLKL VSTLLVMGIF GDEDVKQILK MIEPEVFTEE DDEE...... ETDKPVEGGP AEGEGDKAKG EESEEAAELE D...EGVGKV DDEE...... ETDKPVEGGP AEGEGDKAKG EESEEAAELE D...EGVGKV EEEE...... ..DEEEEGEE EDEEE..... .........K E...EDEEET DGEKMEEEKE AEVVAVDLKD EEEGLEEGLL QMKLPESVKL QMCTLLQFFC DGEKMEEEKE AEVVAVDLKD EEEGLEEGLL QMKLPESVKL QMCTLLQFFC AQEKEDEEKE EEEAAE..GE KEEGLEEGLL QMKLPESVKL QMCHLLEYFC DCELRHRVEA IVAYSDKFVH NIQDNQRIRY NQLMRAFTMS AAETARKTRE DCELRHRVEA IVAYSDKFVH NIQDNQRIRY NQLMRAFTMS AAETARKTRE DQELQHRVES LAAFAERYVD KLQANQRSRY GLLIKAFSMT AAETARRTRE FRSPPQDQVL LLTNFKHSLE EEECPVPDNV RETLKEFHND LLLHCGIHIE FRSPPQDQVL LLTNFKHSLE EEECPVPDNV RETLKEFHND LLLHCGIHIE FRSPPQEQIN MLLQFKDGTD EEDCPLPEEI RQDLLDFHQD LLAHCGIQLD EEPVEEEVDT SLRGRLLSLV DKIKSIRGKK TEEKPE.VEE ETKPSTLQEL EEPVEEEVDT SLRGRLLSLV DKIKSIRGKK TEEKPE.VEE ETKPSTLQEL GEEEEPEEET TLGSRLMSLL EKVRLVKKKE EKPEEERSAE ESKPRSLQEL ISHTMIHWAQ ESFIQNPELV RLMFSLLHRQ YDGLGELIRA LPKAYAINAV ISHTMIHWAQ ESFIQNPELV RLMFSLLHRQ YDGLGELIRA LPKAYAINAV VSHMVVRWAQ EDFVQSPELV RAMFSLLHRQ YDGLGELLRA LPRAYTISPS SVQDTMDLLE CLGQIRSLLI VQMGPEEERL MIQSIGNIMN NKVFYQHPNL SVQDTMDLLE CLGQIRSLLI VQMGPEEERL MIQSIGNIMN NKVFYQHPNL SVEDTMSLLE CLGQIRSLLI VQMGPQEENL MIQSIGNIMN NKVFYQHPNL MRALGMHETV MEVMVNVLGG GGDSKEIRFP QMVTNCCRFL CYFCRISRQN MRALGMHETV MEVMVNVLGG GGDSKEIRFP QMVTNCCRFL CYFCRISRQN MRALGMHETV MEVMVNVLGG G.ESKEIRFP KMVTSCCRFL CYFCRISRQN QRSMFDHLSY LLQNSSIGLG MRGSTPLDVA AASCIDNNEL ALALQEQDLE QRSMFDHLSY LLQNSSIGLG MRGSTPLDVA AASCIDNNEL ALALQEQDLE QRSMFDHLSY LLENSGIGLG MQGSTPLDVA AASVIDNNEL ALALQEQDLE MVVTYLAGCG LQMCPMLLSK CYPDIGWNPC GGERYLDFLR FAVFVNGESV MVVTYLAGCG LQMCPMLLSK CYPDIGWNPC GGERYLDFLR FAVFVNGESV KVVSYLAGCG LQSCPMLVAK GYPDIGWNPC GGERYLDFLR FAVFVNGESV EENANVVVRL LIRRPECFGP ALRGEGGNGL LAAMEEAIKI SEDPARDGPT EENANVVVRL LIRRPECFGP ALRGEGGNGL LAAMEEAIKI SEDPARDGPT EENANVVVRL LIRKPECFGP ALRGEGGSGL LAAIEEAIRI SEDPARDGPG VKKDRRF.MF GGEEQQEENR VHLGNAIMSF YSALIDLLGR CAPEMHLIQA VKKDRRF.MF GGEEQQEENR VHLGNAIMSF YSALIDLLGR CAPEMHLIQA IRRDRRR.EH FGEEPPEENR VHLGHAIMSF YAALIDLLGR CAPEMHLIQA GKGEALRIRA ILRSLVPIED LVGVISLPVQ IPSYGKDSQI VEPKMSASFV GKGEALRIRA ILRSLVPIED LVGVISLPVQ IPSYGKDSQI VEPKMSASFV GKGEALRIRA ILRSLVPLED LVGIISLPLQ IPTLGKDGAL VQPKMSASFV PDHKASMVLF LDRVYGIDNQ DFLLHVLEVG FLPDMRAAAS LDTVAFSTTE PDHKASMVLF LDRVYGIDNQ DFLLHVLEVG FLPDMRAAAS LDTVAFSTTE PDHKASMVLF LDRVYGIENQ DFLLHVLDVG FLPDMRAAAS LDTATFSTTE MALALNRYLC SAVLPLLTKC APLFAGTDHR AIMIDSMLHT IYRLSRGRAL MALALNRYLC SAVLPLLTKC APLFAGTDHR AIMIDSMLHT IYRLSRGRAL MALALNRYLC LAVLPLITKC APLFAGTEHR AIMVDSMLHT VYRLSRGRSL TKAQRDVIEE CLMSLCKYLR PSMLQHLLRR LVFDVPILNE YAKMPLKLLT TKAQRDVIEE CLMSLCKYLR PSMLQHLLRR LVFDVPILNE YAKMPLKLLT TKAQRDVIED CLMSLCRYIR PSMLQHLLRR LVFDVPILNE FAKMPLKLLT NHYERCWKYY CLPNGWANFG VTSEEELHLS RKLFWGIFES LAHKKFDAEL NHYERCWKYY CLPNGWANFG VTSEEELHLS RKLFWGIFES LAHKKFDAEL NHYERCWKYY CLPTGWANFG VTSEEELHLT RKLFWGIFDS LAHKKYDPEL FKIAMPCLCA IAGAIPPDYV DASYSSHTEK KASVDAEGNF DPKPVETTNT FKIAMPCLCA IAGAIPPDYV DASYSSHTEK KASVDAEGNF DPKPVETTNT YRMAMPCLCA IAGALPPDYV DASYSSKAEK KATVDAEGNF DPRPVETLNV IIPERLDAFI NKYAEHTHDK WAFEKIQNNW TYGEVLDEDA KTHPMLRPYK IIPERLDAFI NKYAEHTHDK WAFEKIQNNW TYGEVLDEDA KTHPMLRPYK IIPEKLDSFI NKFAEYTHEK WAFDKIQNNW SYGENIDEEL KTHPMLRPYK TFSEKDKEIY RWPIKESIKA MLAWEWTLEK ARDGEGEVEK KAATRKISQT TFSEKDKEIY RWPIKESIKA MLAWEWTLEK ARDGEGEVEK KAATRKISQT TFSEKDKEIY RWPIKESLKA MIAWEWTIEK AREGEEEKTE KKKTRKISQS AQATYDPSHG YSPQPIDISG MTLSRELQSM AEQLAENYHN TWGRKKKVEL AQATYDPSHG YSPQPIDISG MTLSRELQSM AEQLAENYHN TWGRKKKVEL AQ.TYDPREG YNPQPPDLSA VTLSRELQAM AEQLAENYHN TWGRKKKQEL QSKGGGTHPL LVPYDTLTAK EKARDREKAQ DLLKFLQLNG YAVTR..GMK QSKGGGTHPL LVPYDTLTAK EKARDREKAQ DLLKFLQLNG YAVTR..GMK EAKGGGTHPL LVPYDTLTAK EKARDREKAQ ELLKFLQMNG YAVTRHAGLK DMEQDISSIE KRFAYGFLQK LLKWMDIAQE FIAHLEAVVS SGRVEKSPHE DMEQDISSIE KRFAYGFLQK LLKWMDIAQE FIAHLEAVVS SGRVEKSPHE DMELDSSSIE KRFAFGFLQQ LLRWMDISQE FIAHLEAVVS SGRVEKSPHE QEIKFFAKIL LPLVNQYFKN HCLYFLSTPA KVLGSGGHSS NKEKEMIASI QEIKFFAKIL LPLVNQYFKN HCLYFLSTPA KVLGSGGHSS NKEKEMIASI QEIKFFAKIL LPLINQYFTN HCLYFLSTPA KVLGSGGHAS NKEKEMITSL FCKLAALVRH RVSLFGTDAS AVVNCLHILS RSLDARTVMK SGPEIVKAGL FCKLAALVRH RVSLFGTDAS AVVNCLHILS RSLDARTVMK SGPEIVKAGL FCKLAALVRH RVSLFGTDAP AVVNCLHILA RSLDARTVMK SGPEIVKAGL RQFFESAADD IEKMVENLKL GKVSSRNQ.V KGVSQNINYT TIALLPVLTS RQFFESAADD IEKMVENLKL GKVSSRNQ.V KGVSQNINYT TIALLPVLTS RSFFESASED IEKMVENLRL GKVSQARTQV KGVGQNLTYT TVALLPVLTT LFDHIAQHQF GDDVILDDLQ ISCYRIMCSI YSLGTVKTPH AEKQRPALGE LFDHIAQHQF GDDVILDDLQ ISCYRIMCSI YSLGTVKTPH AEKQRPALGE LFQHIAQHQF GDDVILDDVQ VSCYRTLCSI YSLGTTKNTY VEKLRPALGE CLAHLAAAMP VAFLEPTLNE FNTFSVYTTK TPRERSILGL PSQVEELCPD CLAHLAAAMP VAFLEPTLNE FNTFSVYTTK TPRERSILGL PSQVEELCPD CLARLAAAMP VAFLEPQLNE YNACSVYTTK SPRERAILGL PNSVEEMCPD IPELEVLMKD IHDLAESGAR YTEMPHVIEI TLPMLCNYLP RWWERGLEN. IPELEVLMKD IHDLAESGAR YTEMPHVIEI TLPMLCNYLP RWWERGLEN. IPVLERLMAD IGGLAESGAR YTEMPHVIEI TLPMLCSYLP RWWERGPEAP ...FPEQEGQ ICTSVTSEQL NQLLGSIMKI VVNNLGIDEA SWMKRLAVFA ...FPEQEGQ ICTSVTSEQL NQLLGSIMKI VVNNLGIDEA SWMKRLAVFA PSALPAGAPP PCTAVTSDHL NSLLGNILRI IVNNLGIDEA SWMKRLAVFA QPIVSRAKPE MLKSHFIPTM EKLKKRCGKV VAEEDHLRME GKTEVDSENG QPIVSRAKPE MLKSHFIPTM EKLKKRCGKV VAEEDHLRME GKTEVDSENG QPIVSRARPE LLQSHFIPTI GRLRKRAGKV VSEEEQLRLE AKAEAQEGEL TIRDEFAVLC RDLYALYPLL IRYVDNSRAR WLTNPDPDAE ELFRMVGEVF TIRDEFAVLC RDLYALYPLL IRYVDNSRAR WLTNPDPDAE ELFRMVGEVF LVRDEFSVLC RDLYALYPLL IRYVDNNRAQ WLTEPNPSAE ELFRMVGEIF IFWSKSHNFK REEQNFVVMN EINNMSFLTA DSKSKMSKS. ........GG IFWSKSHNFK REEQNFVVMN EINNMSFLTA DSKSKMSKS. ........GG IYWSKSHNFK REEQNFVVQN EINNMSFLTA DNKSKMAKVG ACPVSPQSGG SEQERTKKKR RGDRYSVQTS LIVAALKKLL PIGLNMCSPA DQELINLAKI SEQERTKKKR RGDRYSVQTS LIVAALKKLL PIGLNMCSPA DQELINLAKI SDQERTKKKR RGDRYSVQTS LIVATLKKML PIGLNMCAPT DQDLITLAKT RYSLKDTDEE VREFLHNNLH LQGKVE.DPA MRWQMSLYKE MAGKAEDAED RYSLKDTDEE VREFLHNNLH LQGKVE.DPA MRWQMSLYKE MAGKAEDAED RYALKDTDEE VREFLHNNLH LQGKVEGSPS LRWQMALYRG VPGREEDADD PEKVVKRVQE VSAVLYHIEV TEHPFKSKKM VWHKLLSKQR RRAVVACFRM PEKVVKRVQE VSAVLYHIEV TEHPFKSKKM VWHKLLSKQR RRAVVACFRM PEKIVRRVQE VSAVLYYLDQ TEHPYKSKKA VWHKLLSKQR RRAVVACFRM TPLYNIITHR ATNMFLDAYK RNWLETEGYS FEDKMIDDLS VSLDHIRSE. TPLYNIITHR ATNMFLDAYK RNWLETEGYS FEDKMIDDLS VSLDHIRSE. TPLYNLPTHR ACNMFLESYK AAWILTEDHS FEDRMIDDLS KAGEQEEEEE ....KKPDPL HQLILHFSRT ALTEKMKLDV DHLYMSYADI MAKGFSVSPP ....KKPDPL HQLILHFSRT ALTEKMKLDV DHLYMSYADI MAKGFSVSPP EVEEKKPDPL HQLVLHFSRT ALTEKSKLDE DYLYMAYADI MAKSCHLEEG CSASQ..... ........EK EMEKQRLLYQ QSRLHNRGAA EMVLQMISAC CSASQ..... ........EK EMEKQRLLYQ QSRLHNRGAA EMVLQMISAC GENGE...AE EEVEVSFEEK QMEKQRLLYQ QARLHTRGAA EMVLQMISAC KGEPGAMVSS TLKLGISILN GGNSDVQQKM LDYLKDKKDV GFFLSIQSLM KGEPGAMVSS TLKLGISILN GGNSDVQQKM LDYLKDKKDV GFFLSIQSLM KGETGAMVSS TLKLGISILN GGNAEVQQKM LDYLKDKKEV GFFQSIQALM QTCSVLDLNA FERQNKAEGL GMVSEEGTNE KVMADDEFTC DLFRFLQLLC QTCSVLDLNA FERQNKAEGL GMVSEEGTNE KVMADDEFTC DLFRFLQLLC QTCSVLDLNA FERQNKAEGL GMVNEDGTGE KVMADDEFTQ DLFRFLQLLC EGHNNDFQNY LRTQTGSTTT INVIICTVDY LLRLQESISD FYWYYSGKDI EGHNNDFQNY LRTQTGSTTT INVIICTVDY LLRLQESISD FYWYYSGKDI EGHNNDFQNY LRTQTGNTTT INIIICTVDY LLRLQESISD FYWYYSGKDV IDEPGKRNFS KAMNVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF IDEPGKRNFS KAMNVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF IEEQGKRNFS KAMSVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF LHVFAHMMMK LAQ....... ..DSSQIGLL KELLDLQKDM VVMLLSLLEG LHVFAHMMMK LAQ....... ..DSSQIGLL KELLDLQKDM VVMLLSLLEG LHVFAHMMMK LAQ....... ..DSSQIELL KELLDLQKDM VVMLLSLLEG NVVNGTIAKQ MVDMLVESSS NVEMILKFFD MFLKLKDIVA SDAFRDYVTD NVVNGTIAKQ MVDMLVESSS NVEMILKFFD MFLKLKDIVA SDAFRDYVTD NVVNGMIARQ MVDMLVESSS NVEMILKFFD MFLKLKDIVG SEAFQDYVTD PRGLISKKDF SKAMDSQKQY TPAEIQFLLS CSEADENEMI NFEEFADRFQ PRGLISKKDF SKAMDSQKQY TPAEIQFLLS CSEADENEMI NFEEFADRFQ PRGLISKKDF QKAMDSQKQF SGPEIQFLLS CSEADENEMI NCEEFANRFQ EPAKDIGFNI AVLLTNLSEH VPHDTRLQNF LEQAESVLNY FRPFLGRIEI EPAKDIGFNI AVLLTNLSEH VPHDTRLQNF LEQAESVLNY FRPFLGRIEI EPARDIGFNV AVLLTNLSEH VPHDPRLHNF LELAESILEY FRPYLGRIEI MGASRKIERI YFEISEANRN QWEMPQVRES KRQFIFDVVN EGGESEKMEM MGASRKIERI YFEISEANRN QWEMPQVRES KRQFIFDVVN EGGESEKMEM MGASRRIERI YFEISETNRA QWEMPQVKES KRQFIFDVVN EGGEAEKMEL FVNFCEDTIF EMNIA...AH A......... .......... .......... FVNFCEDTIF EMNIA...AH A......... .......... .......... FVSFCEDTIF EMQIAAQISE PEGEPETDED EGAGAAEAGA EGAEEGAAGL .....PESTS AFADFLKSVV NFFNMFTFRN LRRRYRRFRK MTVKEMVIGL .....PESTS AFADFLKSVV NFFNMFTFRN LRRRYRRFRK MTVKEMVIGL EGTAATAAAG ATARVVAAAG RALRGLSYRS LRRRVRRLRR LTAREAATAV ATFVYTVVMG ILMFVYSICK GFFTLIWKVL FGGGLVESAK KMTVTDILAS ATFVYTVVMG ILMFVYSICK GFFTLIWKVL FGGGLVESAK KMTVTDILAS AALLWAAVTR AGAAGAGAAA GALGLLWGSL FGGGLVEGAK KVTVTELLAG MPDPTQDEVH GELPPEPGSR EDQD..TEGG ADLLDPVGGE EEEEDSEERE MPDPTQDEVH GELPPEPGSR EDQD..TEGG ADLLDPVGGE EEEEDSEERE MPDPTSDEVH GEQPAGPGGD ADGEGASEGA GDAAEG.AGD EEEAVHEAGP GGRLPGFNTP .......... GGLGDFGETT PEEPPTPEGT PLLKRKLVSR GGRLPGFNTP .......... GGLGDFGETT PEEPPTPEGT PLLKRKLVSR GGADGAVAVT DGGPFRPEGA GGLGDMGDTT PAEPPTPEGS PILKRKLGVD HNQIGGQGEE ENAEHEEPPQ ETEKADTENG EKAKKPEAEP EVKEEEPVEE HNQIGGQGEE ENAEHEEPPQ ETEKADTENG EKAKKPEAEP EVKEEEPVEE GVEEE..LPP EPEPEPEPEL EPEKADAENG EKEEV....P EPTPEP.... EEITVKAKAK KSKKPVEEGF ELWNELEIQR VKFMNYLSRN FYNLRYLALF EEITVKAKAK KSKKPVEEGF ELWNELEIQR VKFMNYLSRN FYNLRYLALF PKKQAPPSPP PKKE..EAGG EFWGELEVQR VKFLNYLSRN FYTLRFLALF IAFALNFILL FYKVSDSPP. GEED.....F EGSGLFEGSG LFEGSGVQED IAFALNFILL FYKVSDSPP. GEED.....F EGSGLFEGSG LFEGSGVQED LAFAINFILL FYKVSDSPP. GEDD.....M EGSAAGDVSG AGSG.GSSGW GSGLDDGGED DDEEGPLYYF LEESTGYMEP AMAFLSIVHT IISFLCIIGY GSGLDDGGED DDEEGPLYYF LEESTGYMEP AMAFLSIVHT IISFLCIIGY GLGAGEEAEG DEDENMVYYF LEESTGYMEP ALRCLSLLHT LVAFLCIIGY NCLKVPLVIF KREKELARKL EFDGVYVTEQ PEDDDIKGQW DRLVLNTPSF NCLKVPLVIF KREKELARKL EFDGVYVTEQ PEDDDIKGQW DRLVLNTPSF NCLKVPLVIF KREKELARKL EFDGLYITEQ PEDDDVKGQW DRLVLNTPSF PNNYWDKFVK RKVLDKYGDI YGRERIAELL GMDLASLDVS AMTHEKKPEP PNNYWDKFVK RKVLDKYGDI YGRERIAELL GMDLASLDVS AMTHEKKPEP PSNYWDKFVK RKVLDKHGDI YGRERIAELL GMDLATLEIT AHNERK.PNP DTSMFSWITS IDIKYQIWKF GVVFTDNTFL YLVWYFLMSI LGHYNNFFFA DTSMFSWITS IDIKYQIWKF GVVFTDNTFL YLVWYFLMSI LGHYNNFFFA PPGLLTWLMS IDVKYQIWKF GVIFTDNSFL YLGWYMVMSL LGHYNNFFFA AHLLDIAMGV KTLRTILSSV THNGKQLMMT VGLLAVVVYL YTVVAFNFFR AHLLDIAMGV KTLRTILSSV THNGKQLMMT VGLLAVVVYL YTVVAFNFFR AHLLDIAMGV KTLRTILSSV THNGKQLVMT VGLLAVVVYL YTVVAFNFFR KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV REDMETKCFI LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV REDMETKCFI LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV KEDMETKCFI CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV WKMYQERCWD FFPAGDCFRK QYEDQL. WKMYQERCWD FFPAGDCFRK QYEDQL. WKMYQERCWD FFPAGDCFRK QYEDQLS tab3part.mif100644000766000024 316014006427353 16611 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataDIP:4369E DIP:3111N PIR:C64526 GI:2313124 DIP:3109N PIR:B64525 GI:2313117 DIP:4370E DIP:3112N PIR:F64526 GI:2313132 DIP:3109N PIR:B64525 GI:2313117 DIP:4384E DIP:3112N PIR:F64526 GI:2313132 DIP:3111N PIR:C64526 GI:2313124 DIP:4371E DIP:3059N PIR:F64554 GI:2313368 DIP:3109N PIR:B64525 GI:2313117 DIP:4372E DIP:3113N PIR:E64567 GI:2313486 DIP:3109N PIR:B64525 GI:2313117 DIP:4373E DIP:3114N PIR:C64572 GI:2313522 DIP:3109N PIR:B64525 GI:2313117 DIP:4374E DIP:3115N SWP:O25424 PIR:C64610 GI:2313847 DIP:3109N PIR:B64525 GI:2313117 DIP:4375E DIP:3116N SWP:P56082 PIR:E64661 GI:2314284 DIP:3109N PIR:B64525 GI:2313117 DIP:4376E DIP:3117N SWP:P56032 PIR:F64684 GI:2314480 DIP:3109N PIR:B64525 GI:2313117 DIP:4377E DIP:3118N PIR:H64693 GI:2314561 DIP:3109N PIR:B64525 GI:2313117 DIP:4378E DIP:3119N PIR:E64525 GI:2313120 DIP:3119N PIR:E64525 GI:2313120 DIP:4379E DIP:3053N PIR:A64696 GI:2314583 DIP:3119N PIR:E64525 GI:2313120 DIP:4380E DIP:3120N PIR:G64525 GI:2313121 DIP:3120N PIR:G64525 GI:2313121 DIP:4381E DIP:3121N PIR:H64525 GI:2313122 DIP:3120N PIR:G64525 GI:2313121 DIP:4382E DIP:3122N PIR:G64606 GI:2313818 DIP:3120N PIR:G64525 GI:2313121 DIP:4383E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4389E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4390E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4391E DIP:3122N PIR:G64606 GI:2313818 DIP:3120N PIR:G64525 GI:2313121 DIP:5000E DIP:3122N PIR:G64606 GI:2313818 DIP:1111N PIR:G5 GI:4 DIP:5001E DIP:3122N PIR:G64606 GI:2313818 DIP:2222N PIR:G6 GI:2 primedseq.fa100644000766000024 34514006427353 16657 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>Test1 AGCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTGATAGCAGC TTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGGTCACTAAATACTTTAACCAA TATAGGCATAGCGCACAGACAGATAAAAATTACAGAGTACACAACATCCATGAAACGCATTAGCACCACC AAAAAAAA bug3086.embl100644000766000024 772614006427353 16347 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID NC_003888; SV 3; linear; unassigned DNA; STD; UNC; 8667507 BP. XX AC NC_003888; XX DT 03-MAR-2010 XX DE Streptomyces coelicolor A3(2), complete genome. XX KW complete genome XX OS Streptomyces coelicolor A3(2) (2) OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; OC Streptomycineae; Streptomycetaceae; Streptomyces. XX RN [1] RP 1-8667507 RX PUBMED; 12000953. RA Bentley,S.D., Chater,K.F., Cerdeno-Tarraga,A.M., Challis,G.L., RA Thomson,N.R., James,K.D., Harris,D.E., Quail,M.A., Kieser,H., Harper,D., RA Bateman,A., Brown,S., Chandra,G., Chen,C.W., Collins,M., Cronin,A., RA Fraser,A., Goble,A., Hidalgo,J., Hornsby,T., Howarth,S., Huang,C.H., RA Kieser,T., Larke,L., Murphy,L., Oliver,K., O'Neil,S., Rabbinowitsch,E., RA Rajandream,M.A., Rutherford,K., Rutter,S., Seeger,K., Saunders,D., RA Sharp,S., Squares,R., Squares,S., Taylor,K., Warren,T., Wietzorrek,A., RA Woodward,J., Barrell,B.G., Parkhill,J. and Hopwood,D.A.; RT Complete genome sequence of the model actinomycete Streptomyces coelicolor RT A3(2); RL Nature 417 (6885), 141-147 (2002) XX RN [2] RP 1-8667507 RX PUBMED; 8843436. RA Redenbach,M., Kieser,H.M., Denapaite,D., Eichner,A., Cullum,J., RA Kinashi,H. and Hopwood,D.A.; RT A set of ordered cosmids and a detailed genetic and physical map for the 8 RT Mb Streptomyces coelicolor A3(2) chromosome; RL Mol. Microbiol. 21 (1), 77-96 (1996) XX RN [3] RP 1-8667507 RA ; RT Direct Submission; RL Submitted (28-MAY-2002) National Center for Biotechnology Information, RL NIH, Bethesda, MD 20894, USA XX RN [4] RP 1-8667507 RA Bentley,S.D.; RT Direct Submission; RL Submitted (09-MAY-2002) Sanger Institute, Wellcome Trust Genome Campus, RL Hinxton, Cambridge CB10 1SA, United Kingdom XX CC PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI CC review. The reference sequence was derived from AL645882. On Jun 22, 2003 CC this sequence version replaced gi:31340543. COMPLETENESS: full length. XX FH Key Location/Qualifiers FH FT gene 5241975..5243546 FT /locus_tag="SCO4814" FT /db_xref="GeneID:1100255" FT /gene_synonym="SCD63A.25" FT /gene="purH" FT CDS 5241975..5243546 FT /locus_tag="SCO4814" FT /gene_synonym="SCD63A.25" FT /protein_id="NP_628971.1" FT /gene="purH" FT /transl_table=11 FT /note="involved in de novo purine biosynthesis" FT /db_xref="GI:21223192" FT /db_xref="GeneID:1100255" FT /codon_start=1 FT /translation="MTATAGSNKRAIRRALVSVYDKTGLEDLARGLHEAGVELVSTGST FT AGRIAAAGVPVTKVEELTGFPECLDGRVKTLHPKVHAGILADLRLESHRQQLDELGVAP FT FDLVVVNLYPFRETVASGATPDECVEQIDIGGPSMVRAAAKNHPSVAVVTSPARYADVL FT LAVEGGGFDLAARKRLAAEAFQHTAAYDVAVASWFAAEYAPVDESGFPDFLGATYERAN FT TLRYGENPHQPAALYTSPEGGGLAQAEQLHGKEMSYNNYTDTDAARRAAYDHAEPCVAI FT IKHANPCGIAIGADVAEAHRKAHDCDPVSAYGGVIAVNRPVSKEMAERVAGIFTEVIVA FT PDYEDGALEALTKKKNIRVLRAPAAPAAPVEVKPIDGGALLQVTDRLQAEGDDPATWTL FT ATGEALSEAELAELAFAWRACRAVKSNAILLAKDGASVGVGMGQVNRVDSAKLAVERAG FT AERAQGAYAASDAFFPFPDGLEILTGAGVKAVVQPGGSVRDELVVEAAKKAGVTMYFTG FT TRHFFH" FT /product="bifunctional FT phosphoribosylaminoimidazolecarboxamide FT formyltransferase/IMP cyclohydrolase" FT /EC_number="3.5.4.10" FT /EC_number="2.1.2.3" FT misc_feature 5242401..5243345 FT /locus_tag="SCO4814" FT /gene_synonym="SCD63A.25" FT /gene="purH" FT /note="Pfam match to entry PF01808 AICARFT_IMPCHas, FT AICARFT/IMPCHase bienzyme, score 508.80, E-value 4.2e-149" // dnaE-bsub.fa100644000766000024 410514006427353 16504 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>gi|142864|gb|M10040.1|BACDNAE B.subtilis dnaE gene encoding DNA primase, complete cds GTACGACGGAGTGTTATAAGATGGGAAATCGGATACCAGATGAAATTGTGGATCAGGTGCAAAAGTCGGC AGATATCGTTGAAGTCATAGGTGATTATGTTCAATTAAAGAAGCAAGGCCGAAACTACTTTGGACTCTGT CCTTTTCATGGAGAAAGCACACCTTCGTTTTCCGTATCGCCCGACAAACAGATTTTTCATTGCTTTGGCT GCGGAGCGGGCGGCAATGTTTTCTCTTTTTTAAGGCAGATGGAAGGCTATTCTTTTGCCGAGTCGGTTTC TCACCTTGCTGACAAATACCAAATTGATTTTCCAGATGATATAACAGTCCATTCCGGAGCCCGGCCAGAG TCTTCTGGAGAACAAAAAATGGCTGAGGCACATGAGCTCCTGAAGAAATTTTACCATCATTTGTTAATAA ATACAAAAGAAGGTCAAGAGGCACTGGATTATCTGCTTTCTAGGGGCTTTACGAAAGAGCTGATTAATGA ATTTCAGATTGGCTATGCTCTTGATTCTTGGGACTTTATCACGAAATTCCTTGTAAAGAGGGGATTTAGT GAGGCGCAAATGGAAAAAGCGGGTCTCCTGATCAGACGCGAAGACGGAAGCGGATATTTCGACCGCTTCA GAAACCGTGTCATGTTTCCGATCCATGATCATCACGGGGCTGTTGTTGCTTTCTCAGGCAGGGCTCTTGG CAGCCAGCAGCCTAAGTATATGAACAGTCCTGAAACCCCGCTCTTTCATAAAAGCAAACTGCTTTACAAT TTTTATAAGGCCCGCCTTCATATCAGAAAGCAGGAAAGAGCAGTCTTATTTGAAGGGTTTGCTGATGTCT ATACGGCCGTAAGCTCGGATGTAAAGGAAAGCATAGCCACGATGGGAACGTCTCTTACAGATGATCATGT CAAGATCCTGAGAAGAAACGTCGAAGAAATCATTCTTTGCTATGACTCTGATAAAGCCGGTTATGAAGCC ACCTTAAAAGCTTCGGAGCTTCTGCAAAAAAAAGGCTGCAAAGTCAGAGTTGCAATGATTCCTGACGGAT TGGACCCTGATGATTACATCAAAAAATTCGGCGGGGAAAAATTTAAAAACGACATTATTGACGCAAGTGT CACCGTAATGGCGTTCAAAATGCAATATTTCCGAAAAGGAAAGAACCTGTCCGATGAAGGCGACCGCCTA GCTTACATTAAAGACGTACTGAAAGAAATCAGCACGCTTTCAGGGTCTCTAGAGCAGGAAGTCTATGTAA AGCAGCTTGCTTCAGAGTTTTCGCTTTCACAGGAGTCTTTAACTGAGCAGCTGTCTGTTTTCAGCAAGCA AAACAAACCTGCTGACAATAGCGGTGAAACTAAAACGCGGCGAGCGCATCTGACGACAAAAGCAAGGCAA AAACGTTTGCGTCCGGCGTATGAAAATGCAGAAAGGCTGTTACTCGCTCACATGCTTCGAGATCGGAGCG TCATCAAAAAAGTGATTGACCGGGTAGGGTTTCAATTTAATATTGATGAGCACCGGGCATTAGCCGCTTA TCTTTATGCTTTTTATGAAGAGGGAGCCGAGCTGACGCCTCAGCATCTGATGGCCAGGGTGACGGATGAT CATATAAGCCAGCTCTTGTCCGATATATTAATGCTTCAGGTTAATCAAGAGCTTAGCGAAGCCGAGTTAT CAGATTATGTAAAAAAAGTGTTGAATCAAAGAAATTGGTCAATGATAAAAGAAAAAGAGGCGGAAAGAGC CGAAGCAGAAAGGCAAAAAGATTTTTTAAGAGCTGCTTCTTTGGCTCAAGAAATCGTTACATTGAACCGA TCTTTAAAATAACTGGAGAACTGATGAGGAGCATTTATTGGCAATGATTCCTTGCGGAGGAGCAAATAGA TCGCTTAACCTCATCATGAATTGTCATTTCATTATTCGCACATTGTTAAAGGCAGTTCACATAGAAAACG CCTGAATGGACCGAATAAGAATCATACCGCTTATAGAATTC cysprot1a.fa100644000766000024 211114006427353 16624 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>CYS1_DICDI MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHK ADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT ESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCN PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII >ALEU_HORVU MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGK SYESAAEVRRRFRIFSESLEEVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRD AAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNG GLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAF QVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCA IATCASYPVVAA >CATH_HUMAN MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHNN GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGS CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQGK DGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN HAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV multiseq.bls100644000766000024 3305414006427353 16766 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038724|dbj|BAB12759.1| DNA-binding protein hu-alpha [Buchnera sp. APS] (92 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.310 0.127 0.323 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 99 Number of Sequences: 1 Number of extensions: 4 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 92 length of database: 607 effective HSP length: 24 effective length of query: 68 effective length of database: 583 effective search space: 39644 effective search space used: 39644 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 42 (21.8 bits) S2: 156 (65.2 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038744|dbj|BAB12779.1| DNA primase [Buchnera sp. APS] (577 letters) Database: mycge 1 sequences; 607 total letters Searchingdone Score E Sequences producing significant alignments: (bits) Value gi|12045104|ref|NP_072915.1| DNA primase (dnaE) [Mycoplasma gen... 125 7e-33 >gi|12045104|ref|NP_072915.1| DNA primase (dnaE) [Mycoplasma genitalium] Length = 607 Score = 125 bits (310), Expect = 7e-33 Identities = 118/461 (25%), Positives = 222/461 (47%), Gaps = 57/461 (12%) Query: 10 ITELLSRTNIIELI-NTRLELKKYGKNYQTNCPFHHDKTPSFTVSNEKQFYYCFGCNAHG 68 + ELL + I E+I + ++++ G + CPFH DK PS ++S+ K + C+ CNA G Sbjct: 8 LDELLKQIKITEIIQHYGVKIQTKGNSLLALCPFHDDKNPSMSISSSKNIFKCWACNAAG 67 Query: 69 NAIDFLIQYEHLSFIESIEELALIHGVKIPFENTVQNSIYVKKQKLYLLMEKICKLY--- 125 N I F+ +++ L + ++++ I G+K+ N+ + KQK Y + Y Sbjct: 68 NGIAFIQKHDQLDWKTALKKAIEICGIKLENWNSNLLTKVDPKQKRYWEINNALITYYQT 127 Query: 126 --KKNINVTHLANKYLARRGINQNMIDFFLIGFSSLKWNEFYKKINISKEFEQELLINNI 183 K+ N + N + +R +N+ +I+ F +G + +++ + + E+ IN Sbjct: 128 RLKRETNPNGM-NYLVEKRKLNKTLIEQFQLGLAFHNEDKY-----LCESMERYPFINPK 181 Query: 184 I---------ATDKNGY-IYD------RFQGRIIFPIQDNHGRIIGFGGRSLNDMSP-KY 226 I T++ G +D FQ +I+ PI D +G +GF RS+++++ KY Sbjct: 182 IKPSELYLFSKTNQQGLGFFDFNTKKATFQNQIMIPIHDFNGNPVGFSARSVDNINKLKY 241 Query: 227 LNSPETDIFYKRKQIYGLYQVIKKCSKPVYLLVVEGYIDVITLTQYNIDYAVSILGTSTT 286 NS + + F K + ++ +++ K ++ L +VEGY DV TLT + AV+++G + Sbjct: 242 KNSADHEFFKKGELLFNFHRLNKNLNQ---LFIVEGYFDVFTLTNSKFE-AVALMGLALN 297 Query: 287 TEHIQLL---FKNTDIIICCYDGDDAGKNAAWKTLKKALPYISDKKTLKFILL--PNQED 341 I+ + FK ++ D D +G+NA + ++K +++ + I+ N +D Sbjct: 298 DVQIKAIKAHFKELQTLVLALDNDASGQNAVFSLIEK----LNNNNFIVEIVQWEHNYKD 353 Query: 342 PDTIIRKEGREKF----QKRIDNAITMSKFFFKNILKNINLSSDDDKFHLSVHALPLINT 397 D + +G E+ KR + + FF K L +++ F L T Sbjct: 354 WDELYLNKGSEQVILQANKRQNLIEYLVSFFKKQQLDQRVITNKIIAF------LTKNQT 407 Query: 398 ISSD-TIRIYLRQILARMIGILDDNQFEKFLYEKETKNTQK 437 I +D + I+L + L +++ D EK LYE K+ +K Sbjct: 408 ILNDHSFLIFLIKNLVKLLEYSD----EKTLYETVLKHKEK 444 Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.322 0.140 0.406 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 932 Number of Sequences: 1 Number of extensions: 63 Number of successful extensions: 4 Number of sequences better than 1.0e-15: 1 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 1 Number of HSP's gapped (non-prelim): 1 length of query: 577 length of database: 607 effective HSP length: 24 effective length of query: 553 effective length of database: 583 effective search space: 322399 effective search space used: 322399 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.9 bits) S2: 164 (68.3 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038814|dbj|BAB12849.1| integration host factor alpha-subunit [Buchnera sp. APS] (102 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.321 0.138 0.372 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 101 Number of Sequences: 1 Number of extensions: 4 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 102 length of database: 607 effective HSP length: 21 effective length of query: 81 effective length of database: 586 effective search space: 47466 effective search space used: 47466 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.9 bits) S2: 157 (65.6 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038947|dbj|BAB12982.1| DNA-binding protein H-ns [Buchnera sp. APS] (135 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.319 0.137 0.390 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 132 Number of Sequences: 1 Number of extensions: 3 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 135 length of database: 607 effective HSP length: 23 effective length of query: 112 effective length of database: 584 effective search space: 65408 effective search space used: 65408 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 158 (66.0 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038982|dbj|BAB13017.1| integration host factor beta-subunit [Buchnera sp. APS] (94 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.319 0.136 0.372 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 114 Number of Sequences: 1 Number of extensions: 6 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 94 length of database: 607 effective HSP length: 21 effective length of query: 73 effective length of database: 586 effective search space: 42778 effective search space used: 42778 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 157 (65.6 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10038996|dbj|BAB13030.1| cold shock-like protein cspC [Buchnera sp. APS] (69 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.316 0.136 0.399 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 100 Number of Sequences: 1 Number of extensions: 5 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 69 length of database: 607 effective HSP length: 20 effective length of query: 49 effective length of database: 587 effective search space: 28763 effective search space used: 28763 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.6 bits) S2: 155 (64.8 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10039073|dbj|BAB13107.1| carbon storage regulator [Buchnera sp. APS] (57 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.318 0.138 0.362 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 64 Number of Sequences: 1 Number of extensions: 2 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 57 length of database: 607 effective HSP length: 22 effective length of query: 35 effective length of database: 585 effective search space: 20475 effective search space used: 20475 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 154 (64.4 bits) BLASTP 2.1.2 [Oct-19-2000] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|10039151|dbj|BAB13185.1| cold shock-like protein cspE [Buchnera sp. APS] (69 letters) Database: mycge 1 sequences; 607 total letters Searchingdone ***** No hits found ****** Database: mycge Posted date: May 8, 2001 3:12 PM Number of letters in database: 607 Number of sequences in database: 1 Lambda K H 0.313 0.132 0.375 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 80 Number of Sequences: 1 Number of extensions: 2 Number of successful extensions: 0 Number of sequences better than 1.0e-15: 0 Number of HSP's better than 0.0 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 0 length of query: 69 length of database: 607 effective HSP length: 21 effective length of query: 48 effective length of database: 586 effective search space: 28128 effective search space used: 28128 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 42 (21.9 bits) S2: 155 (64.8 bits) testaln.pfam100644000766000024 1016014006427353 16731 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data1433_LYCES/9-246 REENVYMAKLADRAESDEEMVEFMEKVSNSLGS.EELTVEERNLLSVAYKNVIGARRASWRIISSIEQKEESRG.NEEHVNSIREYRSKIENELSKICDGILKLLDSKLIPSA..TSGDSKVFYLKMKGDYHRYLAEFKTGAERKEAAESTLTAYKAAQDIASAELAPTHPIRLGLALNFSVFYYEILNSPDRACNLAKQAFDEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSDMQDDG 1434_LYCES/6-243 REENVYLAKLAEQAERYEEMIEFMEKVAKTADV.EELTVEERNLLSVAYKNVIGARRASWRIISSIEQKEESRG.NEDHVNTIKEYRSKIEADLSKICDGILSLLESNLIPSA..STAESKVFHLKMKGDYHRYLAEFKTGTERKEAAENTLLAYKSAQDIALAELAPTHPIRLGLALNFSVFYYEILNSPDRACNLAKQAFDEAISELDTLGEESYKDSTLIMQLLRDNLTLWTSDNADDV 143R_ARATH/7-245 RDQYVYMAKLAEQAERYEEMVQFMEQLVTGATPAEELTVEERNLLSVAYKNVIGSLRAAWRIVSSIEQKEESRK.NDEHVSLVKDYRSKVESELSSVCSGILKLLDSHLIPSA..GASESKVFYLKMKGDYHRYMAEFKSGDERKTAAEDTMLAYKAAQDIAAADMAPTHPIRLGLALNFSVFYYEILNSSDKACNMAKQAFEEAIAELDTLGEESYKDSTLIMQLLRDNLTLWTSDYAGAD 143B_VICFA/7-242 RENFVYIAKLAEQAERYEEMVDSMKNVANLDV...ELTIEERNLLSVGYKNVIGARRASWRILSSIEQKEESKG.NDVNAKRIKEYRHKVETELSNICIDVMRVIDEHLIPSA..AAGESTVFYYKMKGDYYRYLAEFKTGNEKKEAGDQSMKAYESATTAAEAELPPTHPIRLGLALNFSVFYYEILNSPERACHLAKQAFDEAISELDTLNEESYKDSTLIMQLLRDNLTLWTSDIPEDG 143E_HUMAN/4-239 REDLVYQAKLAEQAERYDEMVESMKKVAGMDV...ELTVEERNLLSVAYKNVIGARRASWRIISSIEQKEENKG.GEDKLKMIREYRQMVETELKLICCDILDVLDKHLIPAA..NTGESKVFYYKMKGDYHRYLAEFATGNDRKEAAENSLVAYKAASDIAMTELPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQGDG BMH1_YEAST/4-240 REDSVYLAKLAEQAERYEEMVENMKTVASSGQ...ELSVEERNLLSVAYKNVIGARRASWRIVSSIEQKEESKEKSEHQVELICSYRSKIETELTKISDDILSVLDSHLIPSA..TTGESKVFYYKMKGDYHRYLAEFSSGDAREKATNASLEAYKTASEIATTELPPTHPIRLGLALNFSVFYYEIQNSPDKACHLAKQAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMSESG RA24_SCHPO/6-241 REDAVYLAKLAEQAERYEGMVENMKSVASTDQ...ELTVEERNLLSVAYKNVIGARRASWRIVSSIEQKEESKG.NTAQVELIKEYRQKIEQELDTICQDILTVLEKHLIPNA..ASAESKVFYYKMKGDYYRYLAEFAVGEKRQHSADQSLEGYKAASEIATAELAPTHPIRLGLALNFSVFYYEILNSPDRACYLAKQAFDEAISELDSLSEESYKDSTLIMQLLRDNLTLWTSDAEYSA RA25_SCHPO/5-240 RENSVYLAKLAEQAERYEEMVENMKKVACSND...KLSVEERNLLSVAYKNIIGARRASWRIISSIEQKEESRG.NTRQAALIKEYRKKIEDELSDICHDVLSVLEKHLIPAA..TTGESKVFYYKMKGDYYRYLAEFTVGEVCKEAADSSLEAYKAASDIAVAELPPTDPMRLGLALNFSVFYYEILDSPESACHLAKQVFDEAISELDSLSEESYKDSTLIMQLLRDNLTLWTSDAEYNQ 1431_ENTHI/4-239 REDCVYTAKLAEQSERYDEMVQCMKQVAEMEA...ELSIEERNLLSVAYKNVIGAKRASWRIISSLEQKEQAKG.NDKHVEIIKGYRAKIEKELSTCCDDVLKVIQENLLPKA..STSESKVFFKKMEGDYYRYFAEFTVDEKRKEVADKSLAAYTEATEISNAELAPTHPIRLGLALNFSVFYFEIMNDADKACQLAKQAFDDAIAKLDEVPENMYKDSTLIMQLLRDNLTLWTSDACDEE 1432_ENTHI/4-238 REDLVYLSKLAEQSERYEEMVQYMKQVAEMGT...ELSVEERNLISVAYKNVVGSRRASWRIISSLEQKEQAKG.NTQRVELIKTYRAKIEQELSQKCDDVLKIITEFLLKNS..TSIESKVFFKKMEGDYYRYYAEFTVDEKRKEVADKSLAAYQEATDTA.ASLVPTHPIRLGLALNFSVFYYQIMNDADKACQLAKEAFDEAIQKLDEVPEESYKESTLIMQLLRDNLTLWTSDMGDDE 1433_CAEEL/5-237 VEELVQRAKLAEQAERYDDMAAAMKKVTEQGQ...ELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEG...SEKKQQLAKEYRVKVEQELNDICQDVLKLLDEFLIVKA..GAAESKAFYLKMKGDYYRYLAEVAS.EDRAAVVEKSQKAYQEALDIAKDKMQPTHPIRLGLALNFSVFYYEILNTPEHACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDVGAED 143Z_DROME/6-239 KEELVQKAKLAEQSERYDDMAQAMKSVTETGV...ELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEA...SARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKA..SNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSQTAYQDAFDISKGKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDE 1433_XENLA/1-227 .......AKLSEQAERYDDMAASMKAVTELGA...ELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEG...NDKRQQMAREYREKVETELQDICKDVLDLLDRFLVPNA..TPPESKVFYLKMKGDYYRYLSEVASGDSKQETVASSQQAYQEAFEISKSEMQPTHPIRLGLALNFSVFYYEILNSPEKACSLAKSAFDEAIRELDTLNEESYKDSTLIMQLLRDNLTLWTSENQGEE 143T_HUMAN/3-236 KTELIQKAKLAEQAERYDDMATCMKAVTEQGA...ELSNEERNLLSVAYKNVVGGRRSAWRVISSIEQKTDT...SDKKLQLIKDYREKVESELRSICTTVLELLDKYLIANA..TNPESKVFYLKMKGDYFRYLAEVACGDDRKQTIDNSQGAYQEAFDISKKEMQPTHPIRLGLALNFSVFYYEILNNPELACTLAKTAFDEAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDSAGEE 143F_MOUSE/3-240 REQLLQRARLAEQAERYDDMASAMKAVTELNE...PLSNEDRNLLSVAYKNVVGARRSSWRVISSIEQKTMADG.NEKKLEKVKAYREKIEKELETVCNDVLALLDKFLIKNCNDFQYESKVFYLKMKGDYYRYLAEVASGEKKNSVVEASEAAYKEAFEISKEHMQPTHPIRLGLALNFSVFYYEIQNAPEQACLLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDQQDEE 143S_HUMAN/3-238 RASLIQKAKLAEQAERYEDMAAFMKGAVEKGE...ELSCEERNLLSVAYKNVVGGQRAAWRVLSSIEQKSNEEG.SEEKGPEVREYREKVETELQGVCDTVLGLLDSHLIKEA..GDAESRVFYLKMKGDYYRYLAEVATGDDKKRIIDSARSAYQEAMDISKKEMPPTNPIRLGLALNFSVFHYEIANSPEEAISLAKTTFDEAMADLHTLSEDSYKDSTLIMQLLRDNLTLWTADNAGEE tab2part.mif100644000766000024 35214006427353 16570 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataDIP:5000E DIP:3122N PIR:G64606 GI:2313818 DIP:3054N PIR:G64525 GI:2313121 DIP:5001E DIP:3122N PIR:G64606 GI:2313818 DIP:3054N PIR:G64525 GI:2313121 DIP:5002E DIP:3122N PIR:G64606 GI:2313818 DIP:3054N PIR:G64525 GI:2313121 test.embl2sq100644000766000024 3241714006427353 16671 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID SC10H5 standard; DNA; PRO; 4870 BP. XX AC AL031232; XX DE Streptomyces coelicolor cosmid 10H5. XX KW integral membrane protein. XX OS Streptomyces coelicolor OC Eubacteria; Firmicutes; Actinomycetes; Streptomycetes; OC Streptomycetaceae; Streptomyces. XX RN [1] RP 1-4870 RA Oliver K., Harris D.; RT ; RL Unpublished. XX RN [2] RP 1-4870 RA Parkhill J., Barrell B.G., Rajandream M.A.; RT ; RL Submitted (10-AUG-1998) to the EMBL/GenBank/DDBJ databases. RL Streptomyces coelicolor sequencing project, RL Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA RL E-mail: barrell@sanger.ac.uk RL Cosmids supplied by Prof. David A. Hopwood, [3] RL John Innes Centre, Norwich Research Park, Colney, RL Norwich, Norfolk NR4 7UH, UK. XX RN [3] RP 1-4870 RA Redenbach M., Kieser H.M., Denapaite D., Eichner A., RA Cullum J., Kinashi H., Hopwood D.A.; RT "A set of ordered cosmids and a detailed genetic and physical RT map for the 8 Mb Streptomyces coelicolor A3(2) chromosome."; RL Mol. Microbiol. 21(1):77-96(1996). XX CC Notes: CC CC Streptomyces coelicolor sequencing at The Sanger Centre is funded CC by the BBSRC. CC CC Details of S. coelicolor sequencing at the Sanger Centre CC are available on the World Wide Web. CC (URL; http://www.sanger.ac.uk/Projects/S_coelicolor/) CC CC CDS are numbered using the following system eg SC7B7.01c. CC SC (S. coelicolor), 7B7 (cosmid name), .01 (first CDS), CC c (complementary strand). CC CC The more significant matches with motifs in the PROSITE CC database are also included but some of these may be fortuitous. CC CC The length in codons is given for each CDS. CC CC Usually the highest scoring match found by fasta -o is given for CC CDS which show significant similarity to other CDS in the database. CC The position of possible ribosome binding site sequences are CC given where these have been used to deduce the initiation codon. CC CC Gene prediction is based on positional base preference in codons CC using a specially developed Hidden Markov Model (Krogh et al., CC Nucleic Acids Research, 22(22):4768-4778(1994)) and the FramePlot CC program of Bibb et al., Gene 30:157-66(1984) as implemented at CC http://www.nih.go.jp/~jun/cgi-bin/frameplot.pl. CAUTION: We may CC not have predicted the correct initiation codon. Where possible CC we choose an initiation codon (atg, gtg, ttg or (att)) which is CC preceded by an upstream ribosome binding site sequence (optimally CC 5-13bp before the initiation codon). If this cannot be identified CC we choose the most upstream initiation codon. CC CC IMPORTANT: This sequence MAY NOT be the entire insert of CC the sequenced clone. It may be shorter because we only CC sequence overlapping sections once, or longer, because we CC arrange for a small overlap between neighbouring submissions. CC CC Cosmid 10H5 lies to the right of 3A7 on the AseI-B genomic restriction CC fragment. XX FH Key Location/Qualifiers FH FT source 1..4870 FT /organism="Streptomyces coelicolor" FT /strain="A3(2)" FT /clone="cosmid 10H5" FT CDS complement(<1..327) FT /note="SC10H5.01c, unknown, partial CDS, len >109 aa; FT possible integral membrane protein" FT /gene="SC10H5.01c" FT /product="hypothetical protein SC10H5.01c" FT CDS complement(350..805) FT /note="SC10H5.02c, probable integral membrane protein, len: FT 151 aa; similar to S. coelicolor hypothetical protein FT TR:O54194 (EMBL:AL021411) SC7H1.35 (155 aa), fasta scores; FT opt: 431 z-score: 749.8 E(): 0, 53.5% identity in 114 aa FT overlap." FT /product="putative integral membrane protein" FT /gene="SC10H5.02c" FT RBS complement(812..815) FT /note="possible RBS upstream of SC10H5.02c" FT CDS complement(837..1301) FT /note="SC10H5.03c, probable integral membrane protein, len: FT 154 aa" FT /product="putative integral membrane protein" FT /gene="SC10H5.03c" FT RBS complement(1308..1312) FT /note="possible RBS upstream of SC10H5.03c" FT CDS complement(1427..1735) FT /note="SC10H5.04c, unknown, len: 103 aa; possible membrane" FT /gene="SC10H5.04c" FT /product="hypothetical protein SC10H5.04c" FT RBS complement(1738..1741) FT /note="possible RBS upstream of SC10H5.05c" FT misc_feature 1800^1801 FT /note="Zero-length feature added to test Bioperl parsing" FT CDS 1933..2022 FT /note="SC10H5.05, questionable ORF, len: 29 aa" FT /gene="SC10H5.05" FT /product="hypothetical protein SC10H5.05" FT CDS 2019..2642 FT /note="SC10H5.06, probable membrane protein, len: 207 aa; FT similar to S. coelicolor TR:O54192 SC7H1.33c (191 aa), FT fasta scores; opt: 312 z-score: 355.2 E(): 1.6e-12, 36.8% FT identity in 182 aa overlap" FT /product="putative membrane protein" FT /gene="SC10H5.06" FT RBS 2627..2631 FT /note="possible RBS upstream of SC10H5.07" FT CDS 2639..4048 FT /note="SC10H5.07, unknown, len: 469 aa" FT /gene="SC10H5.07" FT /product="hypothetical protein SC10H5.07" FT CDS complement(4100..4297) FT /note="SC10H5.08c, unknown, len: 65 aa" FT /gene="SC10H5.08c" FT /product="hypothetical protein SC10H5.08c" FT RBS complement(4314..4319) FT /note="possible RBS upstream of SC10H5.08c" FT CDS complement(4439..>4870) FT /note="SC10H5.09c, probable integral membrane protein, FT partial CDS len: >143 aa; some similarity in C-terminus to FT S. coelicolor hypothetical protein TR:O54106 FT (EMBL:AL021529) SC10A5.15 (114 aa), fasta scores; opt: 145 FT z-score: 233.8 E(): 9.2e-06, 33.3% identity in 81 aa FT overlap. Overlaps and extends SC3A7.01c" FT /product="putative integral membrane protein" FT /gene="SC10H5.09c" FT misc_feature 4769..4870 FT /note="overlap with cosmid 3A7 from 1 to 102" XX SQ Sequence 4870 BP; 769 A; 1717 C; 1693 G; 691 T; 0 SQ other; gatcagtaga cccagcgaca gcagggcggg gcccagcagg ccggccgtgg cgtagagcgc 60 gaggacggcg accggcgtgg ccaccgacag gatggctgcg gcgacgcgga cgacaccgga 120 gtgtgccagg gcccaccaca cgccgatggc cgcgagcgcg agtcccgcgc tgccgaacag 180 ggcccacagc acactgcgca gaccggcggc cacgagtggc gccaggacgg tgcccagcag 240 gagcagcagg gtgacgtggg cgcgcgctgc actgtggccg ccccgtccgc ccgacgcgcg 300 cggctcgtca tctcgcggtc ccaccaccgg tcggccccat tactcgtcct caaccctgtg 360 gcgactgacg ttccccggac aggtcgtacc gattgccgcc acgccccacc acgcacaggg 420 cccagacgac gaagcctgac atggtgatca tgacgacgga ccacaccggg tagtacggca 480 gcgagaggaa gttggcgatg atcaccagcc cggcgatggc gaccccggtg acacgtgccc 540 acatcgccgt tttgagcagc ccggcgctga cgaccatggc gagcgcgccg agcgcgagat 600 ggatccaccc ccacccggtg agatcgaact ggaaaacgta gttgggcgtg gtgacgaaga 660 cgtcgtcctc ggcgatggcc atgatgcccc ggaagaggct gagcagcccg gcgaggaaga 720 gcatcaccgc cgcgaaggcg gtaaggcccg tcgcccattc ctgcctcgcg gtgtgtgccg 780 ggtggtgggt atgtgacgtg gtcatctcgg acctcgtttc gtggaatgcg gatgcttcag 840 cgagcggagg cgccggtgcc cgccgcgccc gtgtgccctg ccgggccgtg accggacagg 900 accaattcct tcgccttgcg gaactcctcg tccgtgatgg caccccggtc tcggatctcg 960 gagagccggg ccagctcgtc gacgctgctg gacccgccgc ccacggtctt cctgatgtag 1020 gcgtcgaact cctcctgctg agcccgtgcc cgcgttgtct cccggctgcc catgttcttg 1080 ccgcgagcga tcacgtagac gaaaacgccc aggaagggca ggaggatgca gaacaccaac 1140 cagccggcct tcgcccagcc actcagtccg tcgtcccgga agatgtcggt gacgacgcgg 1200 aagagcagga cgaaccacat gatccacagg aagatcatca gcatcgtcca gaaggcaccc 1260 agcagtgggt agtcgtacgc caggtaggtc tgtgcactca tgtccgtcct ccgtcctccg 1320 gggcgcggcc cggcggccct cgttccgtac tgacatcagg gtggtcacgg gtcccaccgg 1380 tcggcatcac ccggcacggg tgagtggggc gccgaggccg tcgtggtcag gcccgggaca 1440 ccggtgtgac cctggtggaa ggacgcgtcc cgtggggcac gcaccgccgg ccgagggcga 1500 ccaccgcctc ggtcagtccg agcaggccca gccacaggcc gagaagtcgg gtcagggcac 1560 gggccgactc ggcgggcagc gcgaggacga cgattccggc gacgtcgacg gccagcgggt 1620 tgcgcaggcc cagcactccg gccggggcgc ccggcaccag cgtggcgagg gccgatgcca 1680 tgagccaggt ccaggaaccc ccaagcctgg cgaggacgtg cgccggatcg ctcaatgctc 1740 cggtgaccgc cccgcccgac ccgtctccct tgtcggcagg ttccgccgca tcacgcggaa 1800 cggagatggc tcccctgtgg atcgggcggc cgctgcgggg ccgcccggtt ggtcggtcgg 1860 tgagcgccgg actccccctt cagctcttcc agggtcgggg tcgacaccga ggtcctggat 1920 cacccgtcag gggtgatccg ggcatgccgt cgtggcggtg aggtgggata cgggaacgat 1980 cggcccacgg gggaccggac gagacgaaga gacgtgagat gagcgatacg aactcgggcg 2040 gcgggcgcca ggccgcttcc ggaccggccc cacgtggccg actccctttc cgccggcgcg 2100 tggccctggt cgctgtcgca cgtcccctga tcgtcacggt cggtctcgtc accgcctact 2160 acctgcttcc cctggacgag agactcagcg ccggcaccct ggtgtcgctg gtgtgcggac 2220 tgctcgcagt ccttctggtg ttctgctggg aggtgcgggc catcacgcgc tccccgcatc 2280 cgcgtctgag agcgatcgag ggcctggccg ccacgctggt gctgttcctg gtcctcttcg 2340 ccggctccta ctacctgctg ggtcgctccg cgcccggctc cttcagcgag ccgctgaaca 2400 ggacggacgc gctgtacttc actctgacca cgttcgccac cgtcggcttc ggggacatca 2460 ccgcacgctc cgagaccggg cggatcctca cgatggcgca gatgacggga gggctactgc 2520 tcgtcggagt cgccgcccgg gtgctggcga gcgcagtgca ggcggggctg caccgacagg 2580 gccggggacc ggcggcatcg ccacgctccg gtgctgcgga ggagccggag gccggaccat 2640 gaccgtaccc ggtggcttca ccgcctccct gccgccggcc gagcgagccg cgtacggcag 2700 gaaggcccgt aaaagggcct cacgttcgtg ccacggctgg tacgagccgg ggcagcggcg 2760 gcctgacccc gtcgacctgc tggagcgcca gtccggcgag cgtgtcccgg cactcgtgcc 2820 catccgctac ggtcgcatgc tggagtcgcc gttccgcttc taccgcggtg cggcagcgat 2880 catggcggcg gacctggcac ccctgcccag cagcggactc caggtgcaat tgtgcgggga 2940 cgcgcacccg ttgaacttcc ggctcctggc ctcaccggag cgccggctgg tcttcgacat 3000 caacgacttc gacgagacgc tgcccggccc cttcgagtgg gacgtcaaac ggctggcggc 3060 cggattcgtg atcgcggccc ggtcgaacgg cttctcgtcc aaggaacaga accgcaccgt 3120 tcgggcctgt gtgcgggcct accgggagcg catgagggag ttcgccgtca tgccgaccct 3180 ggacatctgg tacgcccagg acgacgccga ccacgtacgg caactgctgg ctacggaggc 3240 cagaggagaa gctgagcagc ggctcaggga cgcggctgcg aaggcccgca cacgcaccca 3300 catgagggcg ttcgcgaagc tcacccgcgt cacggccgag ggccggcgca tcacccccga 3360 cccgccgctg atcaccccac tcggcgatct gctcaccgac ccggccgaag ccggccggga 3420 ggaggaactg cggtccgtcg tgaacggcta cgcacggtcc ctgccgcccg agcgccggca 3480 cctgctgcgt cactaccggc ttgtggacat ggcgcgcaag gtggtcggcg tcggcagtgt 3540 cggcacccgc tgctgggtac tgcttctgct cggcagggac gacgacgatc ctctgctgct 3600 ccaggccaag gaagcctcgg aatcggtgct ggcggcccac acgggcggcg aacgctacga 3660 ccatcagggc cgcagggtcg tggccggcca gcgtctgatc cagaccaccg gtgacatctt 3720 tctcggctgg gcgcgcgtca ccggcttcga cggaaaggcc cgggacttct acgtgcgtca 3780 actgtgggac tggaagggcg tcgcgcggcc ggaaaccatg gggcccgacc tgctctccct 3840 cttcgcccgg ctgtgcggtg cctgcctggc gagggcccac gcccgttccg gtgaccccgt 3900 cgcgctcgcc gcgtacctgg gcggcagcga ccgcttcgac ggcgcgctca ccgagttcgc 3960 ccagtcctac gccgatcaga atgaacgcga ccacgaagct ctgctggcgg cctgccgctc 4020 cggcagggtc acggccgccc gtttgtgagg ccgacccggg aacggccggc gggctggcac 4080 acaccgccgc cggtcggcgt cattccggaa gctgccgcat ctccaggacg cgcaggccca 4140 gcgactggca gcgggtgagc aacccgtaca gatgggcctc gtcgatcacc gtgccgaaca 4200 gcacggtctg gccggacatg acgacgtgct ccagctccgg gaacgcgttg gccagcgtcc 4260 gtgacaggtg tccctcgacg cggatctcgt agcgcacgag cggtcctttc accgtaggag 4320 ctcgggacac cgcccggggc tccgggtcgg acggtgctct tggtgacgag cctgcgcctc 4380 gtcgccctcc ggtgccctca cccagcacag gtgactccaa ccgcagtgtc agtgcctttc 4440 agtgcgtcac tgtgatcttg acgacgacga tcaccaggcc gagcagtacg ttgaccgtcg 4500 cggtgacggc caccagtcgt cgcgaggcgc ccgcgcggtg cgccgcggcg acggaccagc 4560 ccacctgacc ggcgacggcg acggacagcg ccagccacag ggtgcccggg acgtccagcc 4620 ccagtacggg gctgacggcg atggccgcgg ccggaggcac ggcggccttg acgatcggcc 4680 actcctcgcg gcacacacgc agaatcaccc gccggtccgg agtgtgccgc gcgagacgcg 4740 ctccgaacag ttcggcgtgg acgtgagcga tccagaacac caagctggtg agcaacagca 4800 gaagaaccag ttcggcgcgg gggaacgagc ccagggtgcc ggcgccgatc acgacggagg 4860 ctgcgagcat 4870 // BC000007.gbk100644000766000024 1563414006427353 16045 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS BC000007 981 bp mRNA linear PRI 09-DEC-2005 DEFINITION Homo sapiens px19-like protein, mRNA (cDNA clone MGC:1082 IMAGE:3505068), complete cds. ACCESSION BC000007 VERSION BC000007.2 GI:33875090 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 981) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 981) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced gi:12652536. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: f Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31543450. Differences found between this sequence and the human reference genome (build 36) are described in misc_difference features below and these differences were also compared to chimpanzee genomic sequences available as of 09/15/2004. FEATURES Location/Qualifiers source 1..981 /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:1082 IMAGE:3505068" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..981 /gene="PX19" /note="synonyms: CGI-106, PRELI" /db_xref="GeneID:27166" /db_xref="MIM:605733" CDS 174..833 /gene="PX19" /codon_start=1 /product="PX19 protein" /protein_id="AAH00007.1" /db_xref="GI:12652537" /db_xref="GeneID:27166" /db_xref="MIM:605733" /translation="MVKYFLGQSVLRSSWDQVFAAFWQRYPNPYSKHVLTEDIVHREV TPDQKLLSRRLLTKTNRMPRWAERLFPANVAHSVYVLEDSIVDPQNQTMTTFTWNINH ARLMVVEERCVYCVNSDNSGWTEIRREAWVSSSLFGVSRAVQEFGLARFKSNVTKTMK GFEYILAKLQGEAPSKTLVETAKEAKEKAKETALAATEKAKDLASKAATKKQQQQQQF V" misc_difference 623 /gene="PX19" /note="'G' in cDNA is 'A' in the human genome; no amino acid change. The chimpanzee genome agrees with the cDNA sequence, suggesting that this difference is unlikely to be due to an artifact." misc_difference 878 /gene="PX19" /note="'C' in cDNA is 'T' in the human genome. The chimpanzee genome agrees with the cDNA sequence, suggesting that this difference is unlikely to be due to an artifact." misc_difference 925..981 /gene="PX19" /note="polyA tail: 57 bases do not align to the human genome." ORIGIN 1 ctcatggcgg cggcggcggc ggcggcagct gcttgggcgc ggtgcggtgg tgactgagct 61 acgagcctgg cggcgggtgt gcgccgagcc ccggcccggc ccggccctcg cgtgcctccc 121 aggctccgca cccctgatgc tgcgcgggtg ctgagcccgc ttcggccggg acgatggtga 181 agtatttcct gggccagagc gtgctccgga gttcctggga ccaagtgttc gccgccttct 241 ggcagcggta cccgaatccc tatagcaaac atgtcttgac ggaagacata gtacaccggg 301 aggtgacccc tgaccagaaa ctgctgtccc ggcgactcct gaccaagacc aacaggatgc 361 cacgctgggc cgagcgacta tttcctgcca atgttgctca ctcggtgtac gtcctggagg 421 actctattgt ggacccacag aatcagacca tgactacctt cacctggaac atcaaccacg 481 cccggctgat ggtggtggag gaacgatgtg tttactgtgt gaactctgac aacagtggct 541 ggactgaaat ccgccgggaa gcctgggtct cctctagctt atttggtgtc tccagagctg 601 tccaggaatt tggtcttgcc cggttcaaaa gcaacgtgac caagactatg aagggttttg 661 aatatatctt ggctaagctg caaggcgagg ccccttccaa aacacttgtt gagacagcca 721 aggaagccaa ggagaaggca aaggagacgg cactggcagc tacagagaag gccaaggacc 781 tcgccagcaa ggcggccacc aagaagcagc agcagcagca acagtttgtg tagccagtct 841 accaccacca cagcacccca gacagctagg cttagcccct ctgccctccc ttcattgtac 901 tttatcatta aaaatcaact tccaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 961 aaaaaaaaaa aaaaaaaaat a // roa1.genbank100644000766000024 1034214006427353 16605 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS AI129902 37 bp mRNA EST 27-OCT-1998 DEFINITION qc41b07.x1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone IMAGE:1712149 3' similar to SW:ROA1_SCHAM P21522 HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A1, A2/B1 HOMOLOG. ;contains MSR1.b2 MSR1 repetitive element ;, mRNA sequence. ACCESSION AI129902 NID g3598416 VERSION AI129902.1 GI:3598416 KEYWORDS EST. SOURCE human. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 37) AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), Tumor Gene Index JOURNAL Unpublished (1997) COMMENT On May 8, 1995 this sequence version replaced gi:800643 Contact: Robert Strausberg, Ph.D. Tel: (301) 496-1550 Email: Robert_Strausberg@nih.gov This clone is available royalty-free through LLNL ; contact the IMAGE Consortium (info@image.llnl.gov) for further information. Trace considered overall poor quality Insert Length: 525 Std Error: 0.00 Seq primer: -40m13 fwd. ET from Amersham High quality sequence stop: 1. FEATURES Location/Qualifiers source 1..37 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="IMAGE:1712149" /clone_lib="Soares_pregnant_uterus_NbHPU" /sex="female" /dev_stage="adult" /lab_host="DH10B" /note="Organ: uterus; Vector: pT7T3-Pac; Site_1: Not I; Site_2: Eco RI; 1st strand cDNA was primed with a Not I - oligo(dT) primer [5' AACTGGAAGAATTCGCGGCCGCCTTTTTTTTTTTTTTTTTT 3'], double-stranded cDNA was ligated to Eco RI adaptors (Pharmacia), digested with Not I and cloned into the Not I and Eco RI sites of the modified pT7T3 vector. Library went through one round of normalization. Library constructed by M. Fatima Bonaldo." BASE COUNT 5 a 28 c 2 g 2 t ORIGIN 1 ctccgcgcca actcccccca cccccccccc acacccc // LOCUS BAB68554 141 aa linear VRT 11-APR-2002 DEFINITION alpha D-globin [Aldabrachelys elephantina]. ACCESSION BAB68554 PID g15824047 VERSION BAB68554.1 GI:15824047 DBSOURCE accession AB072353.1 KEYWORDS . SOURCE Aldabra giant tortoise. ORGANISM Aldabrachelys elephantina Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Testudines; Cryptodira; Testudinoidea; Testudinidae; Aldabrachelys. REFERENCE 1 AUTHORS Shishikura,F. TITLE The primary structure of hemoglobin D from the Aldabra giant tortoise, Geochelone gigantea JOURNAL Zoolog. Sci. 19, 197-206 (2002) REFERENCE 2 (residues 1 to 141) AUTHORS Shishikura,F. TITLE Direct Submission JOURNAL Submitted (29-SEP-2001) Fumio Shishikura, Nihon University School of Medicine, Biology; Oyaguchi-kamimachi, 30-1, Itabashi-ku, Tokyo 173-8610, Japan (E-mail:fshishi@med.nihon-u.ac.jp, Tel:81-3-3972-8111(ex.2291), Fax:81-3-3972-0027) FEATURES Location/Qualifiers source 1..141 /organism="Aldabrachelys elephantina" /db_xref="taxon:167804" /note="synonym:Dipsochelys dussumieri~synonym:Geochelone gigantea" Protein 1..141 /product="alpha D-globin" CDS 1..141 /coded_by="join(AB072353.1:1..92,AB072353.1:307..511, AB072353.1:739..>864)" /note="hemoglobin D" ORIGIN 1 mlteddkqli qhvwekvleh qedfgaeale rmfivypstk tyfphfdlhh dseqirhhgk 61 kvvgalgdav khidnlsatl selsnlhayn lrvdpvnfkl lshcfqvvlg ahlgreytpq 121 vqvaydkfla avsavlaeky r // Q8GBD3.swiss100644000766000024 417714006427353 16367 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID CH10_ACEAC Reviewed; 97 AA. AC Q8GBD3; DT 11-APR-2003, integrated into UniProtKB/Swiss-Prot. DT 01-MAR-2003, sequence version 1. DT 12-JUN-2007, entry version 26. DE 10 kDa chaperonin (Protein Cpn10) (groES protein). GN Name=groS; Synonyms=groES; OS Acetobacter aceti. OC Bacteria; Proteobacteria; Alphaproteobacteria; Rhodospirillales; OC Acetobacteraceae; Acetobacter; Acetobacter subgen. Acetobacter. OX NCBI_TaxID=435; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RC STRAIN=IFO 3283 / LMG 1513; RX PubMed=16233284; DOI=10.1263/jbb.94.140; RA Okamoto-Kainuma A., Yan W., Kadono S., Tayama K., Koizumi Y., RA Yanagida F.; RT "Cloning and characterization of groESL operon in Acetobacter aceti."; RL J. Biosci. Bioeng. 94:140-147(2002). CC -!- FUNCTION: Binds to Cpn60 in the presence of Mg-ATP and suppresses CC the ATPase activity of the latter. CC -!- SUBUNIT: Heptamer of 7 subunits arranged in a ring (By CC similarity). CC -!- SUBCELLULAR LOCATION: Cytoplasm (By similarity). CC -!- SIMILARITY: Belongs to the groES chaperonin family. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB081586; BAC16231.1; -; Genomic_DNA. DR HSSP; P09621; 1HX5. DR GO; GO:0006457; P:protein folding; IEA:HAMAP. DR HAMAP; MF_00580; -; 1. DR InterPro; IPR001476; Chaprnin_Cpn10. DR Gene3D; G3DSA:2.30.33.40; Chaprnin_Cpn10; 1. DR PANTHER; PTHR10772; Chaprnin_Cpn10; 1. DR Pfam; PF00166; Cpn10; 1. DR PRINTS; PR00297; CHAPERONIN10. DR ProDom; PD000566; Chaprnin_Cpn10; 1. DR PROSITE; PS00681; CHAPERONINS_CPN10; 1. KW Chaperone. FT CHAIN 1 97 10 kDa chaperonin. FT /FTId=PRO_0000174677. SQ SEQUENCE 97 AA; 10514 MW; 4DF9365DE3CDA23E CRC64; MTKFRPLHDR VVVRRLEGEQ KTAGGIIIPD TAQEKPMEGE VVAVGPGARN EQGQIVALDV KAGDRVLFGK WSGTEVKIDG EELLIMKESD IMGVVTA // sim4.for.for100644000766000024 55214006427353 16527 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data seq1 = human.genomic, 5368 bp seq2 = est.for (>hs_est), 479 bp 695-813 (1-119) 100% -> 1377-1500 (120-243) 99% -> 1797-1935 (244-382) 100% -> 2084-2180 (383-479) 100% seq1 = human.genomic, 5368 bp seq2 = est.for (>hs_est), 479 bp 695-813 (1-119) 100% -> 1377-1500 (120-243) 99% -> 1797-1935 (244-382) 100% -> 2084-2180 (383-479) 100% hybrid2.gff3100644000766000024 101514006427353 16503 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data##gff-version 3 chr17 UCSC mRNA 62467934 62469545 . - . ID=A00469;Dbxref=AFFX-U133:205840_x_at,Locuslink:2688,Genbank-mRNA:A00469,Swissprot:P01241,PFAM:PF00103,AFFX-U95:1332_f_at,Swissprot:SOMA_HUMAN;Note=growth%20hormone%201;Alias=GH1 chr17 UCSC CDS 62468039 62468236 . - 1 Parent=A00469 chr17 UCSC CDS 62468490 62468654 . - 2 Parent=A00469 chr17 UCSC CDS 62468747 62468866 . - 1 Parent=A00469 chr17 UCSC CDS 62469076 62469236 . - 1 Parent=A00469 chr17 UCSC CDS 62469497 62469506 . - 0 Parent=A00469 >A00469 GATTACA GATTACA NC_008536.gb100644000766000024 2651714006427353 16066 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS NC_008536 2001 bp DNA linear BCT 21-JUL-2008 DEFINITION Solibacter usitatus Ellin6076, complete genome. ACCESSION NC_008536 REGION: 1000..3000 VERSION NC_008536.1 GI:116619145 PROJECT GenomeProject:12638 KEYWORDS . SOURCE Solibacter usitatus Ellin6076 ORGANISM Solibacter usitatus Ellin6076 Bacteria; Acidobacteria; Solibacteres; Solibacterales; Solibacteraceae; Solibacter. REFERENCE 1 (bases 1 to 2001) AUTHORS Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Thompson,L.S., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Janssen,P.H., Kuske,C.R. and Richardson,P. CONSRTM US DOE Joint Genome Institute TITLE Complete sequence of Solibacter usitatus Ellin6076 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2001) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (24-OCT-2006) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 2001) AUTHORS Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Thompson,L.S., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Janssen,P.H., Kuske,C.R. and Richardson,P. CONSRTM US DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (06-OCT-2006) US DOE Joint Genome Institute, 2800 Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from CP000473. URL -- http://www.jgi.doe.gov JGI Project ID: 3634513 Source DNA and bacteria available from Cheryl R. Kuske (kuske@lanl.gov) Contacts: Cheryl R. Kuske (kuske@lanl.gov) Paul Richardson (microbes@cuba.jgi-psf.org) Quality assurance done by JGI-Stanford Annotation done by JGI-ORNL and JGI-PGF Finishing done by JGI-LANL Finished microbial genomes have been curated to close all gaps with greater than 98% coverage of at least two independent clones. Each base pair has a minimum q (quality) value of 30 and the total error rate is less than one per 50000. The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. It is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376). COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..2001 /organism="Solibacter usitatus Ellin6076" /mol_type="genomic DNA" /strain="Ellin6076" /db_xref="taxon:234267" gene <1..403 /locus_tag="Acid_0001" /db_xref="GeneID:4431688" CDS <1..403 /locus_tag="Acid_0001" /note="KEGG: aba:Acid345_0001 chromosomal replication initiator protein DnaA TIGRFAM: chromosomal replication initiator protein DnaA PFAM: Chromosomal replication initiator, DnaA C-terminal domain; Chromosomal replication initiator, DnaA SMART: AAA ATPase" /codon_start=1 /transl_table=11 /product="chromosomal replication initiator protein DnaA" /protein_id="YP_821302.1" /db_xref="GI:116619146" /db_xref="InterPro:IPR001957" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR013159" /db_xref="InterPro:IPR013317" /db_xref="GeneID:4431688" /translation="MNYWDQIRHYLQSKVSADGYDNWLKSTAFVGQNGDTLYVSVPDR ETRAWLETEYTQLIQGGIQALGLPVRHVSFEAEQSRPAAPAATIAVNANTEPDAAATV LNPKFTFNSFVVGACNQFAHAAAKSVATNPSRSYNPLFLYGGVGMGKTHLMHAIGRQL IDQFGSMRIIYTSSERFMNEMISCIRTERMQQFHQRYREADVLLIDDIQLLGNKERTQ EEFFHTFNELHDHQKQIVISSDSPPKDIPGLLERLRSRFEWGLMADIQPPDLETKMAI LDKKAEIEGVELPDDVRTFMASKTKSNVRELEGALVKLIAYSSLTGTPIHLQMAQQVL KHLVHVQDRRVTMDSIQKAVAEKFSIKQSQLKEKSNTKKVVYPRQVAMYLVKELTNAS LPEIGRAFGGKHHTTVIHSINKIEKDRQSDPELNRLLHSLMDSLQ" gene 638..1756 /locus_tag="Acid_0002" /db_xref="GeneID:4431689" CDS 638..1756 /locus_tag="Acid_0002" /EC_number="2.7.7.7" /note="KEGG: aba:Acid345_0002 DNA polymerase III, beta subunit TIGRFAM: DNA polymerase III, beta subunit PFAM: DNA polymerase III, beta chain" /codon_start=1 /transl_table=11 /product="DNA polymerase III, beta subunit" /protein_id="YP_821303.1" /db_xref="GI:116619147" /db_xref="InterPro:IPR001001" /db_xref="GeneID:4431689" /translation="MEFTVSKSDLVRELSLSQGVVEKKTTIPILSNVLLEAADDRITL TATDLELGIRCSCPARVKKEGSGTVPARKLLDYMRLLPEGDVNMKFLDNHWASITSGR SRTRIAGMSRESFPELPQMPEPIAQVPVKTLASMIARTSFAISMEESRFTLNGALLLM RPEGLTMVATDGHRLAYVQAEPAESGSADKPFRALVPKKAMSELTKLSDDAGEGAMAI VAGDDNHLFFQIGHRLLITRKLTGNFPDYERVLPKDHQFMAKLEKGEIRSAIERVAQF ADERSRAIRVQFNSGEVRVFSSSVETGESEESVPAEYTGPDIEIGFNAQYLLDFLRAI SQDHVAFELKDQKSAGELRPAGDAIADQYRYVVMPMRI" gene 1782..>2001 /locus_tag="Acid_0003" /db_xref="GeneID:4431690" CDS 1782..>2001 /locus_tag="Acid_0003" /EC_number="5.99.1.3" /note="KEGG: aba:Acid345_0008 DNA gyrase, B subunit TIGRFAM: DNA gyrase, B subunit PFAM: DNA gyrase, subunit B domain protein; ATP-binding region, ATPase domain protein domain protein; TOPRIM domain protein; DNA topoisomerase, type IIA, subunit B, region 2 domain protein SMART: DNA topoisomerase II" /codon_start=1 /transl_table=11 /product="DNA gyrase subunit B" /protein_id="YP_821304.1" /db_xref="GI:116619148" /db_xref="InterPro:IPR000565" /db_xref="InterPro:IPR001241" /db_xref="InterPro:IPR002288" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR006171" /db_xref="InterPro:IPR011557" /db_xref="InterPro:IPR011558" /db_xref="InterPro:IPR013506" /db_xref="GeneID:4431690" /translation="MPNEVNPVGGNGNVYDSSSIKVLEGLEAVRLRPAMYIGSTGEMG LHHLVYEVVDNSVDEALAGYATHIEVTIHDDNSVTVEDDGRGIPVGIHEEEGVSAAQV VMTKLHAGGKFDSNSYKVSGGLHGVGVSCVNALSEKLELEIWREGYTWTQEYSCGKPK ADLVRAGKAGKRTGTRVTFKPDSTIMEVAVFNYDTLAQRLRELAFLNKGLKITLTDER VDPEKVTPFEYQGGISEFIKHLNRGKSVLHEKPIHFEAEREMPEKGTITMEVALQYND AYSENVFSFANNINTVDGGTHLSGFRSALTRTINAFGQKNGLFKDVKENLSGDDVREG LTAVVSVKVPQPQFEGQTKGKLNSDIAGYMTQFVNDKLSEYFDKNSAVGRKIVGKAIE ASRAREAARKARDLTRRKGALDSGGLPGKLADCQEKDPKLCELFLVEGESAGGTAKQG RERRFQAILPLKGKILNVEKARYDKMLGHEEIRCMITALGTGIGKDDFDVAKLRYDKI IIMTDADVDGSHIRTLLLTFFFRHMNELITRGKVYIAQPPLYRIKKGKSEKYIKDEKE FTKEIMRRATENLTVEIHSNGDGKPKSSLEGQELRTFLLNLDEFQQIFHKVERRLRDS RAVDVLTNVDLHVDNKAEFQVEANLKPVFEAMKAKGLNPQMKKDEEHSAWAVVFHDST GAERSVGLQLAAQPEYRRFRALARTIAKFNEPPFVVVKNEHRDAQPDWVELLEYVKNE GKKDASVQRYKGLGEMNAEQLADTTMNPEKRTLLEVRLEDAVQCEEIFSTLMGEDVES RRKFIEENALDVKNLDV" ORIGIN 1 ccgcgaactc gaaggcgccc tggttaaatt gatcgcgtac tcctccctca ccggaacgcc 61 gatccacctg cagatggcgc aacaggtcct gaagcacctg gtccacgtgc aggaccgccg 121 cgtcaccatg gattccatcc aaaaggccgt cgcggagaaa ttcagcatta agcagtccca 181 gctcaaggaa aagagcaata ccaagaaggt ggtctatccg cggcaggtgg caatgtacct 241 ggtcaaggag ctcaccaacg cgtcgctacc tgagatcggg cgcgctttcg gcggaaagca 301 tcacaccacc gtgatccact cgatcaacaa aatcgagaag gatcgccaga gcgaccctga 361 attgaacagg ttactacaca gcctaatgga ctcattacaa tgaccttcca ccgttttcca 421 caattgccct tgcaccgggc atggggaaat ctgtgtatat cgcatgacag cgttggtttt 481 gcgcaatttt ccgcgccgca atccgcagct tgacgtagca gaaagtggct gagtttagta 541 taatttagat agtcttcaac actttgcaca gggcctatga atacggttat ttatcaggcc 601 tatctataga gacagcaaag aactaggagc cgtgggcatg gaattcaccg tcagcaagag 661 cgatctggta cgtgagctga gtctctcgca gggtgtggtg gaaaaaaaga ccaccattcc 721 aatcctgtcg aacgtgctcc tggaggccgc ggacgatcgc attacgttga ctgcgaccga 781 cctggaactg ggcatccgct gttcttgccc ggcgcgcgtc aagaaagaag ggtccggcac 841 ggtgccggcg cgcaaactgc tcgattacat gcgtctgttg cccgaaggcg acgtgaacat 901 gaagttcctg gacaatcact gggccagtat caccagcgga cgttctcgca ccaggattgc 961 gggcatgtcg cgcgagagct tccccgaact gccgcagatg ccggagccga tcgcgcaggt 1021 gcccgtcaaa acgctggcct ccatgatcgc ccgtacgtcc ttcgccatct ccatggagga 1081 gtcgcgcttt acgctcaacg gcgcgctcct gctgatgcgc cccgaaggcc tcaccatggt 1141 cgccaccgat ggccaccgcc tcgcctacgt gcaggccgag cccgcggaga gcggcagcgc 1201 cgacaaaccc ttccgcgccc tcgtccccaa gaaggcgatg agcgaattga ccaagctcag 1261 cgatgatgcc ggcgaaggcg ccatggccat cgtcgccggt gatgacaatc acctgttctt 1321 ccaaattggc caccgcctgc tgatcacgcg aaagctcacc ggcaacttcc cggattacga 1381 gcgcgtcttg cccaaagatc accagttcat ggccaagctc gagaagggcg aaatccgctc 1441 ggcaatcgaa cgcgtggcgc agttcgccga tgaacggtcg cgcgccattc gggtgcaatt 1501 caacagcggc gaagtccgcg tcttcagttc cagcgtcgaa accggcgaga gcgaagagag 1561 cgttcccgcc gagtacacgg gtcccgatat cgagatcggt ttcaatgccc aatacctgct 1621 cgatttcctg cgtgcgattt ctcaggatca cgttgccttc gaattgaagg atcaaaaaag 1681 cgcgggcgaa ctccgtcccg ccggcgatgc catcgcggac cagtaccgct atgtagtcat 1741 gccgatgcgt atttgaagtt tagtcaaacc agtaaggaac tatgccaaac gaagtcaatc 1801 ccgtcggggg aaatggaaac gtctacgatt ccagcagtat caaagtcctg gaaggcctcg 1861 aagcagtgcg cctgcgccct gccatgtaca tcgggtctac aggcgagatg gggctgcacc 1921 atctggtcta tgaggtcgtc gataattccg tcgatgaagc tctagccggc tacgccaccc 1981 atatcgaggt gacgatccac g // bug2982.embl100644000766000024 11055514006427353 16406 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID AJ229040; SV 1; linear; genomic DNA; CON; HUM; 958952 BP. XX AC AJ229040; XX DT 22-JAN-2004 (Rel. 78, Created) DT 24-MAR-2007 (Rel. 91, Last updated, Version 2) XX DE Homo sapiens 959 kb contig between AML1 and CBR1 on chromosome 21q22 XX KW . XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 897811-898239 RX DOI; 10.1093/nar/23.14.2762. RX PUBMED; 7651838. RA Miyoshi H., Ohira M., Shimizu K., Mitani K., Hirai H., Imai T., RA Yokoyama K., Soeda E., Ohki M.; RT "Alternative splicing and genomic structure of the AML1 gene involved in RT acute myeloid leukemia"; RL Nucleic Acids Res. 23(14):2762-2769(1995). XX RN [2] RP 897811-898239 RX DOI; 10.1089/dna.1996.15.175. RX PUBMED; 8634147. RA Levanon D., Bernstein Y., Negreanu V., Ghozi M.C., Bar-Am I., Aloya R., RA Goldenberg D., Lotem J., Groner Y.; RT "A large variety of alternatively spliced and differentially expressed RT mRNAs are encoded by the human acute myeloid leukemia gene AML1"; RL DNA Cell Biol. 15(3):175-185(1996). XX RN [3] RP 896685-897802 RX DOI; 10.1073/pnas.93.5.1935. RX PUBMED; 8700862. RA Ghozi M.C., Bernstein Y., Negreanu V., Levanon D., Groner Y.; RT "Expression of the human acute myeloid leukemia gene AML1 is regulated by RT two promoter regions"; RL Proc. Natl. Acad. Sci. U.S.A. 93(5):1935-1940(1996). XX RN [4] RP 1-958952 RA Blechschmidt K., Rump A., Nordsiek G., Drescher B., Weber J., Rosenthal A.; RT "Sequencing and Analysis of 960 kb between AML1 and CBR1 on Chromosome RT 21q22.2"; RL Unpublished. XX RN [5] RP 1-958952 RA Blechschmidt K., Rump A., Nordsiek G., Drescher B., Weber J., RA Schattevoy R., Rosenthal A., Yaspo M.L., Osoegawa K., Soeda E.; RT ; RL Submitted (07-MAY-1998) to the EMBL/GenBank/DDBJ databases. RL Karin Blechschmidt, Institute of Molecular Biotechnology, Dept. of Genome RL Analysis, Beutenbergstrasse 11, 07745 Jena, GERMANY XX FH Key Location/Qualifiers FH FT source 1..958952 FT /organism="Homo sapiens" FT /chromosome="21" FT /map="q22" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" XX CO join(AJ229041.1:1..323000,AJ229042.1:51..348050,AJ229043.1:51..288002) // ID AL954800; SV 2; linear; genomic DNA; CON; HUM; 87191216 BP. XX AC AL954800; XX DT 28-JAN-2003 (Rel. 74, Created) DT 14-APR-2004 (Rel. 79, Last updated, Version 5) XX DE Human chromosome 14 complete sequence XX KW complete genome. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RC In collaboration with Kyoto University Graduate School of Medicine, RC Institute for Systems Biology, Seattle and Washington University Genome RC Sequencing Center. RP 1-87191216 RA Genoscope; RT ; RL Submitted (24-JAN-2003) to the EMBL/GenBank/DDBJ databases. RL Genoscope - Centre National de Sequencage BP 191 91006 EVRY cedex. FRANCE XX RN [2] RP 1-87191216 RX DOI; 10.1038/nature01348. RX PUBMED; 12508121. RA Heilig R., Eckenberg R., Petit J.L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., de Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., Sun H., RA Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., Bruls T., RA Jaillon O., Friedlander L., Samson G., Brottier P., Cure S., Segurens B., RA Aniere F., Samain S., Crespeau H., Abbasi N., Aiach N., Boscus D., RA Dickhoff R., Dors M., Dubois I., Friedman C., Gouyvenoux M., James R., RA Madan A., Mairey-Estrada B., Mangenot S., Martins N., Menard M., Oztas S., RA Ratcliffe A., Shaffer T., Trask B., Vacherie B., Bellemere C., Belser C., RA Besnard-Gonnet M., Bartol-Mavel D., Boutard M., Briez-Silla S., RA Combette S., Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., RA Muselet D., Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., RA Trybou A., Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., Verdier J., RA Verdier-Discalla C., Hillier L., Fulton L., McPherson J., Matsuda F., RA Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., Quetier F., RA Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14"; RL Nature 421(6923):601-607(2003). XX CC Annotation is available here : http://www.genoscope.cns.fr/cgi-bin/ggb/ggb XX FH Key Location/Qualifiers FH FT source 1..87191216 FT /organism="Homo sapiens" FT /chromosome="14" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" XX CO join(complement(AL512310.3:2094..185605), CO complement(AL391156.3:116..136599),complement(AL359218.4:329..156319), CO AL163152.4:54909..158201,AL356019.5:51149..176114, CO complement(AL355075.6:1..204005),complement(AL163195.5:1..114969), CO AL163636.7:67022..202337,AL133371.3:22302..160471, CO complement(AL355922.4:1992..95036),AL161668.6:54801..202193, CO AL157687.6:139001..196377,AL135744.4:7175..193768,AL161747.5:9066..110538, CO AE000658.1:105..250472,AE000659.1:2..251142,AE000660.1:2..251067, CO AE000661.1:2..247820,AE000662.1:2..69153,AL160314.7:32033..208920, CO complement(AL135998.6:408..165498),AL132780.5:8574..191946, CO complement(AL117258.4:25866..156931),AL049829.4:1274..194296, CO AL132855.4:46251..155912,AL135999.4:19191..196311, CO complement(AL160237.4:1..165160),AL136419.3:332..169476, CO complement(AL136295.3:1..176705),complement(AL096870.5:1..179920), CO complement(AL132800.4:1..177786),AL136018.4:11095..171414, CO AL161663.4:2691..169539,complement(AL137164.3:2002..118674), CO AL161666.4:2341..165438,complement(AL163052.4:1791..126816), CO AL132827.2:653..118925,complement(AL079352.3:2052..175251), CO AL132715.3:22001..197337,complement(AL359396.5:7349..156039), CO AL132633.2:2..84580,complement(AL049831.2:1..218406), CO complement(AL121841.5:558..194413),complement(AL133316.5:139424..157126), CO complement(AL132716.6:2680..152403),complement(AL079343.4:2006..171481), CO AL132718.5:19161..128000,AL110292.4:9..177427, CO complement(AL136522.6:553..170744),complement(AL356015.3:2001..128566), CO AL390334.4:25901..168011,complement(AL391748.3:13230..137037), CO complement(AL390894.2:691..175037),AL391158.5:991..202437, CO complement(AL445384.3:100433..122879),AL163194.5:1094..161813, CO complement(AL163639.3:1..68131),complement(AL139023.6:4382..151180), CO AL133162.4:1..196431,complement(AL121766.5:20241..126302), CO AL049777.5:1..168313,complement(AL122126.4:26722..148909), CO AL138974.4:2..32228,complement(AL135878.4:2006..143693), CO AL117693.5:31322..172938,AL133166.3:401..103590,AL158058.5:19871..125338, CO complement(AL135858.3:1..163722),complement(AL356756.4:660..164090), CO complement(AL445884.4:33300..47979),complement(AL355053.4:1..124077), CO AL133372.4:356..190470,complement(AL121840.4:1810..83535), CO AL079305.3:2003..188736,complement(AL132985.4:4200..76347), CO AL117355.5:2000..137548,complement(AL121852.3:754..165136), CO complement(AL513133.6:715..27714),AL049830.3:2..212559, CO complement(AL121808.4:1..99259),complement(AL136418.4:1..191213), CO AL139353.3:32858..169772,AL163973.1:2181..115463,AL390798.3:2771..152139, CO AL359400.4:54341..173676,AL355112.4:14701..126371,AL352984.4:41971..157133, CO complement(AL161665.5:1..161106),complement(AL136298.5:1..178278), CO AL132988.4:11495..146615,AL117672.5:2..164431, CO complement(AL049781.5:2..188164),complement(AL109769.5:970..113373), CO AL355886.4:36293..164606,complement(AL139296.4:1..178149), CO complement(AL158802.1:2000..95921),AL133305.3:7971..157528, CO AL161851.4:2..185732,AL157689.3:27171..214457,AL358340.6:41992..199163, CO AL358274.3:33761..144617,AL162191.6:5194..204662, CO complement(AL356806.4:1..91833),AL445363.7:55024..181350, CO complement(AL445883.3:80200..161151),AL355885.4:1982..116066, CO AL121603.4:2..118657,AL049776.3:2..194380,complement(AL121594.6:1..183519), CO AL133163.3:885..181496,AL160231.4:38601..194173, CO complement(AL137818.4:27480..161167),complement(AL162311.5:1..167127), CO complement(AL133304.4:2005..141396),AL162511.5:2841..163399, CO AL356803.2:61030..66254,AL137226.3:687..177925, CO complement(AL132857.4:13991..166862),AL079303.3:11000..197948, CO complement(AL162464.5:16270..168567),complement(AL121775.3:605..170576), CO AL079304.3:2219..161225,AL121857.6:113628..150200, CO complement(AL109759.4:8522..201771),AL121790.4:2..202240, CO AL136296.3:53807..192404,complement(AL163151.1:607..136552), CO complement(AL359233.5:2977..153326),complement(AL392023.3:1..108999), CO AL450109.3:63787..96734,AL161751.3:2..87743,AL355835.3:51580..172374, CO complement(AL499608.3:1..28000),complement(AL357094.4:2176..157828), CO AL359950.4:24001..194191,AL391694.4:2..7823, CO complement(AL132994.4:794..179329),complement(AL109628.5:22098..190016), CO AL132639.4:2..194543,complement(AL157791.4:2742..80074), CO AL049875.2:538..161119,AL117191.6:10964..155188, CO complement(AL355052.3:1..135259),complement(AL133233.2:656..129943), CO complement(AL121767.6:24204..184869),complement(AL049828.3:1146..203079), CO AL352977.4:2..173149,complement(AL355889.1:1999..154110), CO AL390800.4:13181..167872,complement(AL451071.3:1819..129825), CO AL390815.5:55176..147140,complement(AL390801.4:90990..166214), CO AL391516.4:2..165171,AL390335.3:119496..176502, CO complement(AL356596.3:414..172769),AL121821.7:8710..126746, CO AL109662.3:2..166180,complement(AL138498.5:706..146439), CO AL355839.4:438..158864,complement(AL391397.4:287..139949), CO AL583864.3:2..282,AL445074.4:2..142537,AL450442.2:6951..143035, CO AL442163.5:21095..167027,complement(AL389914.3:11003..76031), CO complement(AL356800.4:20129..157812),complement(AL163153.4:1..166562), CO AL390097.2:6877..91231,AL445383.5:25619..170914,AL583809.3:69492..169712, CO AL442183.4:44001..151033,complement(AL358913.4:1893..179861), CO AL133239.4:3183..129637,AL160211.2:44506..164513, CO complement(AL135961.4:38520..117179),complement(AL109766.5:1999..182447), CO AL355888.3:17231..212852,complement(AL161752.4:991..135786), CO AL356022.3:42536..135726,AL358293.4:8261..197927, CO complement(AL160233.3:13890..78284),AL049870.3:2001..205003, CO complement(AL121809.6:1..206369),complement(AL133501.6:64930..158397), CO complement(AL162632.4:1..159416),AL139354.6:21392..199538, CO AL359398.2:70461..174399,AL158111.5:72339..171424,AL133449.5:84247..123840, CO complement(AL133224.4:51367..176945),AL161664.8:1..170521, CO AL358341.3:56489..136105,AL512342.2:45..838, CO complement(AL355099.5:116..155826),complement(AL512414.2:1..101079), CO AL591768.2:52166..157502,AL591769.3:39501..170856,AL591771.1:40243..154205, CO complement(AL359951.4:1..175834),AL157792.3:8413..193038, CO AL358832.3:3751..176098,complement(AL079306.3:12833..181521), CO AL162551.4:1124..180960,AL121576.5:140439..190918, CO complement(AL161415.2:1..159442),complement(AL359212.2:2000..116077), CO AL359332.2:79981..162487,AL512358.2:36451..144551, CO complement(AL358335.4:1..143099),AL512359.2:34667..156865, CO AL583762.1:2..3583,complement(AL358336.4:1996..177161), CO AL512360.2:48221..190972,AL358338.3:9393..121786, CO complement(AL110505.5:1..196440),complement(AL358339.2:2001..73407), CO AL512361.3:62201..156305,AL163612.5:34354..171345, CO complement(AL139099.3:1..148139),AL591767.2:43974..187435, CO AL627171.1:63401..172074,AL117692.5:11326..157400,AL109758.3:2..207512, CO AL359397.3:17582..154136,AL118556.4:262..168599, CO complement(AL606834.1:1738..166769),AL133485.3:15379..133980, CO complement(AL358334.3:1997..172999),AL591770.3:5131..158392, CO complement(AL591807.1:1059..142790),AL358332.3:3143..159264, CO complement(AL122125.4:1171..187209),AL079307.7:2..188548, CO complement(AL358333.4:2002..166899),AL118557.5:25781..191132, CO complement(AL365475.1:418..187946),complement(AL355833.4:1..56941), CO complement(AL157971.4:2003..158614),AL133453.3:48071..211127, CO AL139317.5:5860..190646,complement(AL352979.4:1..139729), CO complement(AL356020.3:1..150470),complement(AL390799.4:1..152375), CO AL365295.4:27814..192059,complement(AL445886.3:117180..142997), CO complement(AL163953.3:2009..160985),AL162759.4:9751..172275, CO AL138479.4:11511..165226,complement(AL136297.3:485..144739), CO AL356802.5:81918..112125,AL049778.3:2..205001, CO complement(AL359792.3:31684..180352),AL138994.5:2..83427, CO AL133444.4:1001..191200,complement(AL160471.5:1161..191558), CO AL352982.3:135073..138227,complement(AL139316.5:1..190439), CO complement(AL158801.4:1..153599),complement(AL356801.5:685..133539), CO AL138499.4:11979..185713,complement(AL355773.4:1..205075), CO complement(AL163952.5:2000..186743),AL138995.5:39601..161525, CO AL355073.5:2..174986,complement(AL162633.3:691..39837), CO AL359234.4:33176..114847,complement(AL355103.3:1..131915), CO complement(AL161757.4:95..239559),complement(AL162831.6:1..97299), CO AL137100.4:42356..165389,complement(AL391152.3:1..129109), CO AL139195.4:13522..86352,AL355834.4:17101..220292, CO complement(AL161804.4:1451..100043),AL136520.3:25124..169746, CO complement(AL132668.4:30460..149499),AL049838.3:2..210930, CO AL121579.4:16023..156232,AL132989.5:27..144806,AL139021.6:12769..175179, CO complement(AL135752.6:1..98592),AL133312.3:95445..153072, CO AL121819.6:23451..142695,AL049873.3:739..147539, CO complement(AL117356.5:1..153457),complement(AL359219.4:976..113123), CO AL133502.4:1305..172046,AL159140.4:33905..164085, CO complement(AL162471.3:24000..26314),complement(AL121694.4:1993..173206), CO AL139194.7:571..157120,AL133299.4:30891..162939,AL157911.4:21681..174561, CO AL163853.4:84112..157570,complement(AL157756.3:1..104542), CO AL132778.6:7317..169268,AL122057.4:105275..158823,AL049874.3:2..193047, CO complement(AL132777.4:1..159423),complement(AL160236.4:1..157634), CO complement(AL160234.3:1..155293),AL359220.4:48358..194695, CO AL138996.4:1921..182267,complement(AL355916.2:2002..170999), CO AL137129.4:59161..186110,complement(AL137918.4:1658..160309), CO AL356018.5:27076..107424,AL390816.4:1..206404, CO complement(AL391154.4:1..162399),complement(AL391155.3:2001..105721), CO AL389895.3:21071..164054,complement(AL355101.2:1..156276), CO complement(AL109985.5:14361..155961),complement(AL132666.8:1..143426), CO complement(AL137191.5:56670..133019),AL049871.4:1993..167587, CO complement(AL118555.6:47030..96140),AL132992.4:1698..133186, CO complement(AL136038.5:726..149549),complement(AL161670.4:2004..148991), CO AL162832.6:5331..156928,AL359235.3:19311..149635, CO complement(AL355094.3:1..139046),complement(AL161756.6:1538..144357), CO complement(AL122035.6:1299..111809),complement(AL049869.6:1..194546), CO complement(AL121774.5:1..154083),complement(AL135745.5:806..84550), CO complement(AL139022.4:1..188472),AL355076.5:11755..213081, CO AL359236.4:69001..179874,AL355840.2:40881..179937, CO complement(AL161871.6:40600..81861),AL109847.5:2001..184774, CO AL513142.3:2..9520,complement(AL928655.1:726..22205),AL928760.1:734..9995, CO AL928766.1:2..22955,AL589183.2:600..17576, CO complement(AL391261.3:2003..157710),AL356021.4:66041..182663, CO AL157997.5:15001..168075,AL359232.4:38491..150544, CO complement(AL355093.3:8..142045),complement(AL139295.6:16230..142099), CO AL133241.3:2000..157347,AL117667.4:2..90331, CO complement(AL049835.3:1996..187563),AL159179.3:44061..106736, CO complement(AL135978.4:2001..172995),AL139785.5:20581..154246, CO complement(AL132640.4:16811..177687),AL049779.6:2..186608, CO complement(AL121595.5:1940..163986),complement(AL133370.4:1..153888), CO complement(AL157915.3:52459..121485),AL136294.4:2..88026, CO complement(AL122013.5:46..173009),complement(AL589168.3:15793..134499), CO complement(AL121820.4:154..116160),complement(AL133313.2:162226..179463), CO complement(AL132986.4:19254..182129),AL117694.5:413..164284, CO complement(AL391262.3:1424..121889),AL359317.3:33746..207092, CO complement(AL157996.8:1987..187098),AL133445.4:32351..189924, CO complement(AL355837.6:2647..91514),AL157789.6:60001..223844, CO complement(AL135747.4:897..150824),complement(AL160191.3:655..186922), CO AL356804.4:6389..184109,AL391223.4:2..17677,AL357153.4:2..203186, CO complement(AC004816.1:1..165628),complement(AC004818.1:1..21425), CO complement(AC004825.2:1..124021),complement(AL445903.3:23081..49040), CO complement(AC005230.1:1..111498),complement(AC004817.2:1..128198), CO complement(AC005476.4:310..107531),complement(AL391735.2:240..966), CO complement(AC005994.2:1..90770),complement(AC004968.1:1..108622), CO complement(AC004974.1:1..112631),complement(AC004900.2:15202..113050), CO complement(AL391749.4:106912..155299),complement(AC005993.2:1..141759), CO complement(AC005477.1:1..106060),complement(AC005227.2:60880..99850), CO complement(AC005857.1:2..100459),complement(AC005226.1:1..68766), CO complement(AC005157.1:1..164075),complement(AC005533.2:1..82147), CO complement(AC004828.2:81..183049),AL392024.3:111013..139439, CO complement(AC006360.2:1..136851),complement(AC007160.3:176..87400), CO complement(AL442663.3:26000..162999),complement(AC004858.2:1..123139), CO complement(AC004846.2:583..154416),complement(AL391733.3:1..667), CO complement(AC005280.3:1..136748),complement(AC005225.2:1..166574), CO complement(AC006146.2:1..103948),complement(AC005520.2:1..150923), CO complement(AC005480.3:1..94108),complement(AC005484.2:1..131743), CO complement(AC006349.3:1..111737),complement(AC005519.3:20108..163783), CO complement(AC005479.2:1..138424),AC013451.8:23015..105123, CO AC007956.5:2..232647,AC006530.4:84..117603, CO complement(AL049780.4:757..194552),complement(AL691403.2:29915..124711), CO AF111167.2:781..200267,AC009363.4:11932..44312,AC007182.3:239..190959, CO AC009399.5:4243..144171,AF107885.2:1154..190126,AC008015.5:6219..116006, CO AC016526.6:222..179234,AC016543.6:1980..191159,AC008050.6:504..174975, CO AC007376.9:60484..171453,AF111169.2:2..181916,AC007686.5:15408..179628, CO AC007375.6:2..178331,AC007954.7:82514..184385,AF111168.2:2..229730, CO AC008044.4:19300..102033,AC008372.6:981..266238, CO complement(AL049832.3:116842..161874),AC007955.4:2..161312, CO AF099810.1:23608..170589,AC007056.4:2..180481,AC008045.5:8782..183640, CO AC009396.5:2..162427,AC011440.5:5936..89643,AC026888.6:2..207681, CO AC022469.5:26897..153812,AC018514.7:127..155144,AC012099.4:8966..153594, CO AC008056.6:408..161584,AC007058.2:15039..94599,AF123462.1:342..178104, CO AC007514.5:6136..176831,AC007372.4:2..152796, CO complement(AL049837.4:20361..22355),AC010849.3:2..149659, CO AC018513.5:13250..183488,AC022404.7:2..231060,AC010072.5:7108..109619, CO AC007262.4:2..159737,AC010582.6:2..2618,complement(AL136040.5:260..197419), CO complement(AL121769.4:1995..166503),AL136039.4:4371..173852, CO AL160192.3:1..185228,complement(AL355838.3:1..146009), CO AL358275.4:15970..178614,complement(AL162811.6:186..158499), CO complement(AL357095.4:1993..155871),AL356799.3:29181..158955, CO complement(AL390076.4:2002..89476),AL163171.4:44571..146582, CO AL355095.4:64121..151193,AL359238.4:12286..165668,AL162872.4:32988..171880, CO complement(AL163613.2:1..139447),AL355072.3:10926..167003, CO AL356032.4:29877..167936,AL355887.4:72914..159182, CO complement(AL161713.4:1..147829),AL391732.3:475..1308, CO complement(AL162891.4:1..162612),complement(AL356807.3:1810..110460), CO AL390802.2:30354..109783,AL583743.3:30193..161991, CO complement(AL583742.2:1..83199),complement(AL512311.3:2002..129791), CO AL445385.4:13681..162501,complement(AL513143.3:120541..155001), CO AL157690.7:213..162528,complement(AL163642.4:1..184036), CO complement(AL357172.3:657..197550),complement(AL157957.4:23060..134864), CO complement(AL133369.4:25748..101601),AL049775.2:1702..181433, CO complement(AL132641.3:26..119421),complement(AL137228.4:140..121358), CO complement(AL357092.4:1..176089),complement(AL445593.3:1..109678), CO complement(AL359239.4:1..141816),AL161753.4:54806..163933, CO complement(AL359333.6:1..109249),complement(AL359680.4:2000..77590), CO AL356016.2:36751..154292,complement(AL365294.4:1804..145749), CO AL391517.3:5916..125242,AL358292.4:2..180211, CO complement(AL352955.7:970..146726),AL157688.4:29088..189917, CO complement(AL135746.3:1..127819),complement(AL158059.3:1993..114725), CO AL359237.4:45171..160230,AL136501.4:10306..178584,AL157955.5:16031..145390, CO AL133279.7:243..196743,AL049834.3:2..171405, CO complement(AL162171.4:1419..181119),AL121768.4:25842..211240, CO complement(AL133238.3:1275..171709),AL137785.6:16043..153730, CO complement(AL138478.3:75..75669),complement(AL357093.4:1..155099), CO complement(AL157736.3:87480..89442),AL356805.5:2..157157, CO AL163032.3:44931..120467,complement(AL137230.3:684..196215), CO complement(AL163011.3:1993..89641),AL137128.4:19101..226656, CO AL355074.5:7431..181635,AL161662.6:85251..195369,AL512791.3:47382..89068, CO complement(AL096869.8:367..227284),AL139193.4:90242..141396, CO complement(AL122020.5:714..148122),AL159191.4:60248..105885, CO AL133454.6:2..163176,complement(AL121784.5:10165..170529), CO AL135818.3:706..88440,AL133153.3:7..231490, CO complement(AL133373.5:644..141113),complement(AL121839.3:1561..179597), CO complement(AL590328.2:35676..105579),AL049872.3:1868..170669, CO AL121773.5:13121..158489,complement(AL133240.4:1..74439), CO complement(AL118559.6:1..191792),AL159141.5:28493..174318, CO complement(AL136332.5:34450..101876),complement(AL132987.4:1..167509), CO complement(AL117192.5:1..204505),complement(AL110118.7:831..132850), CO AL132838.4:4285..58790,AL122023.3:2..170429, CO complement(AL136338.4:1..141176),complement(AL157858.5:1594..179993), CO AL132642.4:2..180643,AL079302.7:32452..196082,AL121838.4:2..98377, CO complement(AL117259.6:19505..181441),complement(AL132708.3:1..145585), CO AL132990.4:105808..144053,AL049839.3:2..212527,AL121612.5:26541..180914, CO complement(AL391542.2:381..778),complement(AL390254.6:1..177792), CO AL356017.3:22744..118900,complement(AL117187.7:841..152932), CO AL138539.7:1..148355,AL133467.4:7781..208793,AL139020.5:34211..164609, CO complement(AL133167.4:1..163095),complement(AL137190.5:1..155056), CO complement(AL355102.5:2001..157206),AL359240.4:6621..168761, CO AL163051.5:38805..160867,complement(AL137786.3:1..197689), CO complement(AL133168.6:1..179113),AL049833.5:31058..192406, CO complement(AL158800.6:1..163979),complement(AL352976.3:806..163215), CO complement(AL445885.1:76203..90809),AL355098.3:2..151095, CO AL163872.2:86211..167898,AL391015.3:5..2759, CO complement(AL355097.5:1289..164522),AL445594.3:57389..81891, CO AL352981.4:76..157670,complement(AL163932.5:1..159904), CO complement(AL391153.3:1306..120388),AL163760.4:23906..173004, CO AL132719.4:8044..169507,AL096821.3:17951..169086,AL132796.4:2..159803, CO AL162151.5:2..230568,AL359681.5:32793..100000, CO complement(AL109767.7:115149..175332),complement(AL132819.6:1..140073), CO complement(AL110504.6:103330..206733),complement(AL160313.4:1993..164184), CO AL136000.4:9831..199592,complement(AL133368.6:1564..132229), CO AL157912.5:64883..117912,AL133523.5:20681..171053,AL157871.5:41501..178689, CO complement(AL135838.6:1511..83209),AL845552.2:2..128569, CO complement(AL163974.7:2002..72697),AL132711.5:67171..175375, CO AL117190.6:2..165045,complement(AL132709.5:1..196669), CO complement(AL355836.3:1080..142978),complement(AL359682.4:1..60830), CO complement(AL355096.4:64330..185480),AL049836.3:2..205774, CO complement(AL355032.6:1..86128),complement(AL137779.6:1..141674), CO AL118558.6:6885..192127,complement(AL133223.5:1..116259), CO AL359402.3:62672..117291,AL352978.6:37781..139455,AL137229.4:10231..123780, CO AL136293.5:29141..224035,complement(AL132801.5:2008..154098), CO AL117209.7:13601..157604,AL133455.4:18991..137833, CO complement(AL161669.5:928..180185),AL138976.5:52018..167357, CO complement(AL133367.4:1..181458),complement(AL139300.6:1..145044), CO complement(AL049840.8:1..165471),complement(AL132712.4:11633..99672), CO AL136001.6:847..184780,complement(AL359399.3:310..80288), CO complement(AL590079.3:3700..62642),complement(AL512790.1:282..18582), CO complement(AL512357.4:1..173320),complement(AL512362.4:36499..161862), CO AL583722.6:1..193000,complement(AL590327.3:10857..38088), CO AL590326.3:382..29282,AL583810.5:6120..93029,AL512802.2:10401..96567, CO AL512356.6:2..156461,AL512355.6:31161..196273,AL928654.1:15252..211420, CO complement(AL928742.3:574..124699),complement(AL928761.2:1..12040), CO complement(AL928769.1:1501..11357),complement(AL928765.1:2..7340), CO complement(AL928767.1:25..10609),complement(AL928762.2:14..2395), CO complement(AL928768.1:1..8825),complement(AL901608.1:13168..34978), CO AL122127.6:2..168209,complement(AB019441.1:1..125028), CO complement(AB019440.1:1..200000),complement(AB019439.1:1..200000), CO complement(AB019438.1:1..200000),complement(AB019437.1:1..200000)) // ID BA000005; SV 3; linear; genomic DNA; CON; HUM; 33543332 BP. XX AC BA000005; XX DT 23-JAN-2003 (Rel. 74, Created) DT 16-JUL-2008 (Rel. 96, Last updated, Version 5) XX DE Homo sapiens genomic DNA, chromosome 21q. XX KW . XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-33543332 RG The Chromosome 21 Mapping and Sequencing Consortium RA Hattori M., Fujiyama A., Taylor T.D., Watanabe H., Yada T., Park H.S., RA Toyoda A., Ishii K., Totoki Y., Choi D.K., Soeda E., Ohki M., Takagi T., RA Sakaki Y., Taudien S., Blechschmidt K., Polley A., Menzel U., Delabar J., RA Kumpf K., Lehmann R., Patterson D., Reichwald K., Rump A., Schillhabel M., RA Schudy A., Zimmermann W., Rosenthal A., Kudoh J., Shibuya K., Kawasaki K., RA Asakawa S., Shintani A., Sasaki T., Nagamine K., Mitsuyama S., RA Antonarakis S.E., Minoshima S., Shimizu N., Nordsiek G., Hornischer K., RA Barandt P., Scharfe M., Schoen O., Desario A., Reichelt J., Kauer G., RA Bloecker H., Ramser J., Beck A., Klages S., Hennig S., Riesselmann L., RA Dagand E., Wehrmaeyer S., Borzym K., Gardiner K., Nizetic D., Francis F., RA Lehrach H., Reinhardt R., Yaspo M.L.; RT ; RL Submitted (09-MAY-2000) to the EMBL/GenBank/DDBJ databases. RL * RIKEN Genomic Sciences Center Human Genome Research Group * Institute of RL Molecular Biotechnology Genome Analysis * Keio University School of RL Medicine, Dept. of Molecular Biology * GBF, Dept. of Genome Analysis * RL Max-Planck Institute for Molecular Genetics (addresses see below) XX RN [2] RX DOI; 10.1038/35012518. RX PUBMED; 10830953. RA Hattori M., Fujiyama A., Taylor T.D., Watanabe H., Yada T., Park H.S., RA Toyoda A., Ishii K., Totoki Y., Choi D.K., Soeda E., Ohki M., Takagi T., RA Sakaki Y., Taudien S., Blechschmidt K., Polley A., Menzel U., Delabar J., RA Kumpf K., Lehmann R., Patterson D., Reichwald K., Rump A., Schillhabel M., RA Schudy A., Zimmermann W., Rosenthal A., Kudoh J., Shibuya K., Kawasaki K., RA Asakawa S., Shintani A., Sasaki K., Nagamine K., Mitsuyama S., RA Antonarakis S.E., Minoshima S., Shimizu N., Nordsiek G., Hornischer K., RA Barandt P., Scharfe M., Schoen O., Desario A., Reichelt J., Kauer G., RA Bloecker H., Ramser J., Beck A., Klages S., Hennig S., Riesselmann L., RA Dagand E., Wehrmeyer S., Borzym K., Gardiner K., Nizetic D., Francis F., RA Lehrach H., Reinhardt R., Yaspo M.L.; RT "The DNA sequence of human chromosome 21"; RL Nature 405(6784):311-319(2000). XX CC The chromosome 21 mapping and sequencing consortium CC consisting of CC * RIKEN Genomic Sciences Center, Human Genome Research Group, CC * Sagamihara 228-8555, Japan, CC * e.mail: hattori@gsc.riken.go.jp CC * URL: http://hgp.gsc.riken.go.jp/ CC and CC * Institute of Molecular Biotechnology, Genome Analysis, CC * Beutenbergstrasse 11, D-07745 Jena, Germany, CC * e.mail: gscj-submit@genome.imb-jena.de CC * URL: http://genome.imb-jena.de/ CC and CC * Keio University School of Medicine, Molecular Biology, CC * Tokyo 160-8582, Japan, CC * e.mail: nshimizu@dmb-med.keio.ac.jp CC * URL: http://www.dmb.med.keio.ac.jp/ CC and CC * GBF, Dept. of Genome Analysis, CC * Mascheroder Weg 1, D-38124 Braunschweig, Germany, CC * e.mail: info.genome@gbf.de CC * URL: http://genome.gbf.de/ CC and CC * Max-Planck Institute for Molecular Genetics, CC * Ihnestrasse 73, D-14195 Berlin, Germany, CC * e.mail: info-chr21@molgen.mpg.de CC * URL: http://chr21.rz-berlin.mpg.de/ XX FH Key Location/Qualifiers FH FT source 1..33543332 FT /organism="Homo sapiens" FT /chromosome="21" FT /map="21q" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" XX CO join(AL163202.2:1..340000,AL163203.2:1001..340000,AL163204.2:1001..340000, CO AP001660.1:1001..340000,AL163206.2:1001..340000,AL163207.2:1001..340000, CO AL163208.2:1001..340000,AL163209.2:1001..340000,AL163210.2:1001..340000, CO AP001666.1:1001..340000,AP001667.1:1001..340000,AL163213.2:1001..340000, CO AP001669.1:1001..340000,AP001670.1:1001..340000,AP001671.1:1001..340000, CO AL163217.2:1001..340000,AL163218.2:1001..340000,AP001674.1:1001..340000, CO AP001675.1:1001..340000,AP001676.1:1001..340000,AP001677.1:1001..340000, CO AP001678.1:1001..340000,AP001679.1:1001..340000,AP001680.1:1001..340000, CO AP001681.1:1001..340000,AL163227.2:1001..340000,AP001683.1:1001..340000, CO AP001684.1:1001..340000,AP001685.1:1001..340000,AP001686.1:1001..340000, CO AP001687.1:1001..340000,AP001688.1:1001..340000,AP001689.1:1001..340000, CO AP001690.1:1001..340000,AP001691.1:1001..340000,AP001692.1:1001..340000, CO AP001693.1:1001..340000,AP001694.1:1001..340000,AP001695.1:1001..340000, CO AP001696.1:1001..340000,AP001697.1:1001..340000,AP001698.1:1001..340000, CO AP001699.1:1001..340000,AP001700.1:1001..340000,AL163246.2:1001..340000, CO AL163247.2:1001..340000,AL163248.2:1001..340000,AL163249.2:1001..340000, CO AP001705.1:1001..340000,AP001706.1:1001..340000,AP001707.1:1001..340000, CO AP001708.1:1001..340000,AP001709.1:1001..340000,AP001710.1:1001..340000, CO AP001711.1:1001..340000,AP001712.1:1001..340000,AP001713.1:1001..340000, CO AP001714.1:1001..340000,AP001715.1:1001..340000,AP001716.1:1001..340000, CO AP001717.1:1001..340000,AP001718.1:1001..340000,AP001719.1:1001..340000, CO AP001720.1:1001..340000,AP001721.1:1001..340000,AL163267.2:1001..340000, CO AL163268.2:1001..340000,AP001724.1:1001..340000,AP001725.1:1001..340000, CO AP001726.1:1001..340000,AP001727.1:1001..340000,AP001728.1:1001..340000, CO AP001729.1:1001..340000,AP001730.1:1001..340000,AP001731.1:1001..340000, CO AP001732.1:1001..340000,AL163278.2:1001..340000,AL163279.2:1001..340000, CO AL163280.2:1001..340000,AL163281.2:1001..340000,AL163282.2:1001..340000, CO AL163283.2:1001..340000,AL163284.2:1001..340000,AL163285.2:1001..340000, CO AP001741.1:1001..39322,gap(unk100),AP001743.1:1..219256,gap(unk100), CO AP001745.1:1..336578,AP001746.1:1001..340000,AP001747.1:1001..340000, CO AP001748.1:1001..340000,AP001749.1:1001..26076,gap(unk100), CO AP001751.1:1..340000,AP001752.1:1001..340000,AP001753.1:1001..340000, CO AP001754.1:1001..340000,AL163300.2:1001..340000,AL163301.2:1001..340000, CO AL163302.2:1001..340000,AL163303.2:1001..340000,AP001759.1:1001..340000, CO AP001760.1:1001..340000,AP001761.1:1001..39800) // ID BA000006; SV 2; linear; genomic DNA; CON; HUM; 962365 BP. XX AC BA000006; XX DT 23-JAN-2003 (Rel. 74, Created) DT 24-MAR-2007 (Rel. 91, Last updated, Version 3) XX DE Homo sapiens genomic DNA, chromosome 22q11.2, Cat Eye Syndrome region. XX KW . XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-962365 RA Shimizu N.; RT ; RL Submitted (05-JUN-2000) to the EMBL/GenBank/DDBJ databases. RL Nobuyoshi Shimizu, Keio University, School of Medicine, Molecular Biology; RL 35 Shinanomachi, Shinjuku-ku, Tokyo 160-8582, Japan RL (E-mail:nshimizu@dmb.med.keio.ac.jp, Tel:81-3-3351-2370, RL Fax:81-3-3351-2370) XX RN [2] RX DOI; 10.1038/990031. RX PUBMED; 10591208. RA Dunham I., Hunt A.R., Collins J.E., Bruskiewich R., Beare D.M., Clamp M., RA Smink L.J., Ainscough R., Almeida J.P., Babbage A., Bagguley C., Bailey J., RA Barlow K., Bates K.N., Beasley O., Bird C.P., Blakey S., Bridgeman A.M., RA Buck D., Burgess J., Burrill W.D., Burton J., Carder C., Carter N.P., RA Chen Y., Clark G., Clegg S.M., Cobley V., Cole C.G., Collier R.E., RA Connor R.E., Conroy D., Corby N., Coville G.J., Cox A.V., Davis J., RA Dawson E., Dhami P.D., Dockree C., Dodsworth S.J., Durbin R.M., RA Ellington A., Evans K.L., Fey J.M., Fleming K., French L., Garner A.A., RA Gilbert J.G.R., Goward M.E., Grafham D., Griffiths M.N., Hall C., Hall R., RA Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., Hunt S.E., Jones M.C., RA Kershaw J., Kimberley A., King A., Laird G.K., Langford C.F., RA Leversha M.A., Lloyd C., Lloyd D.M., Martyn I.D., Mashreghi-Mohammadi M., RA Matthews L., McCann O.T., McClay J., McLaren S., McMurray A.A., Milne S.A., RA Mortimore B.J., Odell C.N., Pavitt R., Pearce A.V., Pearson D., RA Phillimore B.J., Phillips S.H., Plumb R.W., Ramsay H., Ramsey Y., RA Rogers L., Ross M.T., Scott C.E., Sehra H.K., Skuce C.D., Smalley S., RA Smith M.L., Soderlund C., Spragon L., Steward C.A., Sulston J.E., RA Swann R.M., Vaudin M., Wall M., Wallis J.M., Whiteley M.N., Willey D., RA Williams L., Williams S., Williamson H., Wilmer T.E., Wilming L., RA Wright C.L., Hubbard T., Bentley D.R., Beck S., Rogers J., Shimizu N., RA Minoshima S., Kawasaki K., Sasaki T., Asakawa S., Kudoh J., Shintani A., RA Shibuya K., Yoshizaki Y., Aoki N., Mitsuyama S., Roe B.A., Chen F., Chu L., RA Crabtree J., Deschamps S., Do A., Do T., Dorman A., Fang F., Fu Y., Hu P., RA Hua A., Kenton S., Lai H., Lao H.I., Lewis J., Lewis S., Lin S.-P., Loh P., RA Malaj E., Nguyen T., Pan H., Phan S., Qi S., Qian Y., Ray L., Ren Q., RA Shaull S., Sloan D., Song L., Wang Q., Wang Y., Wang Z., White J., RA Willingham D., Wu H., Yao Z., Zhan M., Zhang G., Chissoe S., Murray J., RA Miller N., Minx P., Fulton R., Johnson D., Bemis G., Bentley D., RA Bradshaw H., Bourne S., Cordes M., Du Z., Fulton L., Goela D., Graves T., RA Hawkins J., Hinds K., Kemp K., Latreille P., Layman D., Ozersky P., RA Rohlfing T., Scheet P., Walker C., Wamsley A., Wohldmann P., Pepin K., RA Nelson J., Korf I., Bedell J.A., Hillier L., Mardis E., Waterston R., RA Wilson R., Emanuel B.S., Shaikh T., Kurahashi H., Saitta S., Budarf M.L., RA McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., Edelmann L., Kim U.J., RA Shizuya H., Simon M.I., Dumanski J.P., Peyrard M., Kedra D., Seroussi E., RA Fransson I., Tapia I., Bruder C.E., O'Brien K.P.; RT "The DNA sequence of human chromosome 22"; RL Nature 402(6761):489-495(1999). XX FH Key Location/Qualifiers FH FT source 1..962365 FT /organism="Homo sapiens" FT /chromosome="22" FT /map="22q11.2" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" XX CO join(AP000522.1:1..37693,AP000523.1:2273..41306,AP000524.1:474..4455, CO AP000525.1:846..39301,AP000526.1:22650..36885,AP000527.1:7214..40562, CO AP000528.1:6429..42301,AP000529.1:6550..37981,gap(unk100), CO AP000530.1:1..32968,AP000531.1:12539..34029,AP000532.1:7998..37599, CO AP000533.1:2490..45597,AP000534.1:2502..37769,AP000535.1:4727..39151, CO AP000536.1:5212..39668,AP000537.1:6785..43734,AP000538.1:15492..33760, CO AP000539.1:10187..42207,AP000540.2:18532..41100,AP000541.1:2678..34987, CO AP000542.1:8603..41389,gap(unk100),AP000543.1:1..36955, CO AP000544.1:19046..38993,AP000545.1:4338..37247,AP000546.1:4243..39984, CO AP000547.1:4740..123288,AP000365.1:33343..111123) testaln.list100644000766000024 7114006427353 16701 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataBAB68554 gb|443893|124775 P814153 P84139 P841414 P851414 testaln.mega100644000766000024 255714006427353 16712 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data#mega !Title: Amino acid sequences of hemoglobin alpha chains; !Format datatype=protein identical=. indel=-; [ M. Nei and S. Kumar (2000) Molecular Evolution and Phylogenetics. Oxford University Press. New York. Data for Example 2.1. ] #Human V-LSPADKTN VKAAWGKVGA HAGEYGAEAL ERMFLSFPTT KTYFPHF-DL SHGSAQVKGH #Horse ....A..... .....S...G .......... .....G.... .......... ........A. #Cow ....A...G. .........G ..A....... .......... .......... .......... #Kangaroo ....A...GH ...I.....G .....A..G. ..T.H..... .......... ......IQA. #Newt MK..AE..H. ..TT.DHIKG .EEAL..... F...T.L.A. R....AK... .E..SFLHS. #Carp S...DK..AA ..I..A.ISP K.DDI..... G..LTVY.Q. ....A.WA.. .P..GP..-. #Human GKKVA-DALT NAVAHVDDMP NALSALSDLH AHKLRVDPVN FKLLSHCLLV TLAAHLPAEF #Horse .......G.. L..G.L..L. G...D..N.. .......... .........S ...V...ND. #Cow .A....A... K..E.L..L. G...E..... .......... ......S... ...S...SD. #Kangaroo ...I.....G Q..E.I..L. GT..K..... .......... .......... .F....GDA. #Newt ....M.G..S .....I..ID A..CK...K. .QD.M...A. .PK.A.NI.. VMGI..K.HL #Carp ....IMG.VG D..SKI..LV GG.AS..E.. .S......A. ..I.ANHIV. GIMFY..GD. #Human TPAVHASLDK FLASVSTVLT SKYR #Horse .......... ..S....... .... #Cow .......... ...N...... .... #Kangaroo ..E....... ...A...... .... #Newt .YP..C.V.. ..DV.GH... .... #Carp P.E..M.V.. .FQNLALA.S E... tab1part.mif100644000766000024 1527614006427353 16642 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataDIP:4305E DIP:3048N PIR:B64526 GI:2313123 DIP:3047N SWP:O24853 PIR:A64520 GI:2313078 DIP:4307E DIP:3050N PIR:H64618 GI:2313921 DIP:3047N SWP:O24853 PIR:A64520 GI:2313078 DIP:4308E DIP:3051N PIR:B64520 GI:2313079 DIP:3051N PIR:B64520 GI:2313079 DIP:4309E DIP:3052N SWP:P56036 PIR:H64669 GI:2314362 DIP:3051N PIR:B64520 GI:2313079 DIP:4310E DIP:3053N PIR:A64696 GI:2314583 DIP:3051N PIR:B64520 GI:2313079 DIP:4311E DIP:3054N PIR:C64701 GI:2314631 DIP:3051N PIR:B64520 GI:2313079 DIP:4312E DIP:3055N SWP:P55993 PIR:H64530 GI:2313167 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4313E DIP:3057N SWP:P55991 PIR:D64534 GI:2313198 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4314E DIP:3058N SWP:P56112 PIR:G64541 GI:2313264 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4315E DIP:3059N PIR:F64554 GI:2313368 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4316E DIP:3060N PIR:E64559 GI:2313410 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4317E DIP:3061N PIR:G64592 GI:2313700 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4318E DIP:3062N SWP:O25336 PIR:G64596 GI:2313737 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4319E DIP:3063N PIR:H64599 GI:2313761 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4320E DIP:3064N SWP:P56143 PIR:B64609 GI:2313838 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4321E DIP:3065N PIR:A64642 GI:2314119 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4322E DIP:3066N PIR:B64679 GI:2314439 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4323E DIP:3067N PIR:F64688 GI:2314520 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4324E DIP:3053N PIR:A64696 GI:2314583 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4325E DIP:3068N PIR:C64696 GI:2314584 DIP:3056N SWP:P56155 PIR:E64520 GI:2313082 DIP:4326E DIP:3070N PIR:E64595 GI:2313728 DIP:3069N SWP:P42383 PIR:S36237 GI:2313084 DIP:4327E DIP:3071N PIR:B64648 GI:2314168 DIP:3069N SWP:P42383 PIR:S36237 GI:2313084 DIP:4328E DIP:3072N PIR:D64660 GI:2314290 DIP:3069N SWP:P42383 PIR:S36237 GI:2313084 DIP:4329E DIP:3073N PIR:H64671 GI:2314377 DIP:3069N SWP:P42383 PIR:S36237 GI:2313084 DIP:4330E DIP:3074N SWP:P48225 PIR:C71986 GI:2313085 DIP:3074N SWP:P48225 PIR:C71986 GI:2313085 DIP:4331E DIP:3053N PIR:A64696 GI:2314583 DIP:3074N SWP:P48225 PIR:C71986 GI:2313085 DIP:4332E DIP:3075N PIR:H64521 GI:2313097 DIP:3075N PIR:H64521 GI:2313097 DIP:4333E DIP:3076N PIR:F64524 GI:2313125 DIP:3075N PIR:H64521 GI:2313097 DIP:4334E DIP:3077N PIR:D64548 GI:2313317 DIP:3075N PIR:H64521 GI:2313097 DIP:4335E DIP:3078N PIR:G64574 GI:2313548 DIP:3075N PIR:H64521 GI:2313097 DIP:4336E DIP:3079N PIR:G64605 GI:2313813 DIP:3075N PIR:H64521 GI:2313097 DIP:4337E DIP:3080N PIR:A64607 GI:2313821 DIP:3075N PIR:H64521 GI:2313097 DIP:4338E DIP:3081N PIR:G64614 GI:2313885 DIP:3075N PIR:H64521 GI:2313097 DIP:4339E DIP:3082N SWP:Q59465 PIR:G64618 GI:2313920 DIP:3075N PIR:H64521 GI:2313097 DIP:4340E DIP:3083N PIR:A64641 GI:2314107 DIP:3075N PIR:H64521 GI:2313097 DIP:4341E DIP:3084N PIR:A64647 GI:2314162 DIP:3075N PIR:H64521 GI:2313097 DIP:4342E DIP:3085N PIR:G64667 GI:2314343 DIP:3075N PIR:H64521 GI:2313097 DIP:4343E DIP:3086N PIR:C64678 GI:2314432 DIP:3075N PIR:H64521 GI:2313097 DIP:4344E DIP:3087N PIR:B64701 GI:2314626 DIP:3075N PIR:H64521 GI:2313097 DIP:4345E DIP:3088N PIR:B64706 GI:2314662 DIP:3075N PIR:H64521 GI:2313097 DIP:4346E DIP:3090N PIR:F64546 GI:2313304 DIP:3089N PIR:D64523 GI:2313105 DIP:4347E DIP:3091N PIR:B64564 GI:2313455 DIP:3089N PIR:D64523 GI:2313105 DIP:4348E DIP:3092N PIR:G64585 GI:2313642 DIP:3089N PIR:D64523 GI:2313105 DIP:4349E DIP:3093N SWP:P55980 PIR:C64588 GI:2313664 DIP:3089N PIR:D64523 GI:2313105 DIP:4350E DIP:3094N PIR:G64595 GI:2313726 DIP:3089N PIR:D64523 GI:2313105 DIP:4351E DIP:3082N SWP:Q59465 PIR:G64618 GI:2313920 DIP:3089N PIR:D64523 GI:2313105 DIP:4352E DIP:3083N PIR:A64641 GI:2314107 DIP:3089N PIR:D64523 GI:2313105 DIP:4353E DIP:3095N SWP:O06758 PIR:A64650 GI:2314189 DIP:3089N PIR:D64523 GI:2313105 DIP:4354E DIP:3096N SWP:P55989 PIR:H64653 GI:2314221 DIP:3089N PIR:D64523 GI:2313105 DIP:4355E DIP:3097N PIR:F64672 GI:2314381 DIP:3089N PIR:D64523 GI:2313105 DIP:4356E DIP:3087N PIR:B64701 GI:2314626 DIP:3089N PIR:D64523 GI:2313105 DIP:4357E DIP:3098N SWP:P56105 PIR:F64703 GI:2314647 DIP:3089N PIR:D64523 GI:2313105 DIP:4358E DIP:3099N PIR:G64707 GI:2314677 DIP:3089N PIR:D64523 GI:2313105 DIP:4359E DIP:3100N SWP:O26074 PIR:F64713 GI:2314730 DIP:3089N PIR:D64523 GI:2313105 DIP:4360E DIP:3102N SWP:P56061 PIR:F64520 GI:2313083 DIP:3101N PIR:G64523 GI:2313114 DIP:4361E DIP:3103N PIR:B64528 GI:2313138 DIP:3101N PIR:G64523 GI:2313114 DIP:4362E DIP:3104N SWP:O08314 PIR:A64555 GI:2313378 DIP:3101N PIR:G64523 GI:2313114 DIP:4363E DIP:3095N SWP:O06758 PIR:A64650 GI:2314189 DIP:3101N PIR:G64523 GI:2313114 DIP:4364E DIP:3105N PIR:A64709 GI:2314693 DIP:3101N PIR:G64523 GI:2313114 DIP:4365E DIP:3106N SWP:O26087 PIR:G64715 GI:2314750 DIP:3101N PIR:G64523 GI:2313114 DIP:4366E DIP:3107N SWP:P56066 PIR:H64523 GI:2313107 DIP:3107N SWP:P56066 PIR:H64523 GI:2313107 DIP:4367E DIP:3108N PIR:C64524 GI:2313110 DIP:3108N PIR:C64524 GI:2313110 DIP:4368E DIP:3110N PIR:A64522 GI:2313091 DIP:3109N PIR:B64525 GI:2313117 DIP:4369E DIP:3111N PIR:C64526 GI:2313124 DIP:3109N PIR:B64525 GI:2313117 DIP:4370E DIP:3112N PIR:F64526 GI:2313132 DIP:3109N PIR:B64525 GI:2313117 DIP:4384E DIP:3112N PIR:F64526 GI:2313132 DIP:3111N PIR:C64526 GI:2313124 DIP:4371E DIP:3059N PIR:F64554 GI:2313368 DIP:3109N PIR:B64525 GI:2313117 DIP:4372E DIP:3113N PIR:E64567 GI:2313486 DIP:3109N PIR:B64525 GI:2313117 DIP:4373E DIP:3114N PIR:C64572 GI:2313522 DIP:3109N PIR:B64525 GI:2313117 DIP:4374E DIP:3115N SWP:O25424 PIR:C64610 GI:2313847 DIP:3109N PIR:B64525 GI:2313117 DIP:4375E DIP:3116N SWP:P56082 PIR:E64661 GI:2314284 DIP:3109N PIR:B64525 GI:2313117 DIP:4376E DIP:3117N SWP:P56032 PIR:F64684 GI:2314480 DIP:3109N PIR:B64525 GI:2313117 DIP:4377E DIP:3118N PIR:H64693 GI:2314561 DIP:3109N PIR:B64525 GI:2313117 DIP:4378E DIP:3119N PIR:E64525 GI:2313120 DIP:3119N PIR:E64525 GI:2313120 DIP:4379E DIP:3053N PIR:A64696 GI:2314583 DIP:3119N PIR:E64525 GI:2313120 DIP:4380E DIP:3120N PIR:G64525 GI:2313121 DIP:3120N PIR:G64525 GI:2313121 DIP:4381E DIP:3121N PIR:H64525 GI:2313122 DIP:3120N PIR:G64525 GI:2313121 DIP:4382E DIP:3122N PIR:G64606 GI:2313818 DIP:3120N PIR:G64525 GI:2313121 DIP:4383E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4389E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4390E DIP:3103N PIR:B64528 GI:2313138 DIP:3123N PIR:E64527 GI:2313147 DIP:4391E DIP:3122N PIR:G64606 GI:2313818 DIP:3120N PIR:G64525 GI:2313121 testaln.mase100644000766000024 175314006427353 16723 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data;; saved by seaview on Wed Jan 7 11:14:33 1998 ;;# of segments=14 all seqs ;; 25,25 27,27 30,30 96,96 98,98 117,117 149,149 160,160 164,164 173,173 ;; 258,258 282,282 289,289 291,291 ;no comment AK1H_ECOLI CPDSINAALICRGEKMSIAIMAGVLEARGH-N--VTVIDPVEKLLAVG-HYLESTVDIAE STRRIAASRIP------A-DHMVLMAGFTAGN-EKGELVVLGRNGSDYSAAVLAACLRAD CCEIWTDVNGVYTCDP-------------RQVPDARLLKSMSYQEAMELSY--FGAKVLH PRTITPIAQFQIPCLIKNTGNPQAPGTL-IG--ASRDEDELP----VKGISNLN------ NMAMFSVSGP-GMKGMVGMAARVFAAMS-------RARISVVLITQSSSEYSISFCVPQS DCVRAERAMLEEFY-----LELKEGLLEPLAVAERLAIISV-VGDGLRTLRGISAKF--- ---FAALARANINIVAIA ;no comment AKH_HAEIN -----------------VEDAVKATIDCRGEKLSIAMMKAWFEARGY-S--VHIVDPVKQ LLAKG-GYLESSVEIEESTKRVDAANIA--K-DKVVLMAGF---TAGNEKGELVLLGRNG SDYSAAC-----------------LAACLGASVCEIWTDVDGVYTCDP--RLVPDARLLP TLSYREAMELSYFGAKVIHPRTIGPLLPQNIPCVIKNTGNPSAPGSI-ID--GNVKSESL Q----VKGITNLDNLAMFNVSGPGMQGM---VGMASRVFSAMSGAGISVILITQSSSEYS ---ISFCVPVKSAEVAKTVLETEFA-----NELNEHQLEPIEVIKDLSIISV-VGDGMKQ AKGIAARF------FSALAQAN transfac.dat100644000766000024 454314006427353 16675 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataVV TRANSFAC MATRIX TABLE, Release 6.4 - licensed - 2002-12-02, (C) Biobase GmbH XX // AC M00001 XX ID V$MYOD_01 XX DT 19.10.1992 (created); ewi. DT 22.10.1997 (updated); dbo. CO Copyright (C), Biobase GmbH. XX NA MyoD XX DE myoblast determination gene product XX BF T00526; MyoD; Species: mouse, Mus musculus. XX P0 A C G T 01 1 2 2 0 S 02 2 1 2 0 R 03 3 0 1 1 A 04 0 5 0 0 C 05 5 0 0 0 A 06 0 0 4 1 G 07 0 1 4 0 G 08 0 0 0 5 T 09 0 0 5 0 G 10 0 1 2 2 K 11 0 2 0 3 Y 12 1 0 3 1 G XX BA 5 functional elements in 3 genes XX CC no comment XX // AC M00002 XX ID V$E47_01 XX DT 19.10.1992 (created); ewi. DT 18.07.2000 (updated); ewi. CO Copyright (C), Biobase GmbH. XX NA E47 XX DE E47 XX BF T00207; E47; Species: human, Homo sapiens. XX P0 A C G T 01 4 4 3 0 V 02 2 5 4 0 S 03 3 2 4 2 N 04 2 0 9 0 G 05 0 11 0 0 C 06 11 0 0 0 A 07 0 0 11 0 G 08 1 2 8 0 G 09 0 0 0 11 T 10 0 0 11 0 G 11 0 0 4 7 K 12 1 4 3 3 N 13 1 6 2 2 C 14 1 4 4 2 N 15 1 4 2 3 N XX BA 11 selected strong binding sites for E47, E47-MyoD, E12+MyoD and (weak) for BA E12 XX BS R05108; Start: 3; Length: 15; Gaps: 17; Orientation: p. BS R05109; Start: 1; Length: 15; Gaps:; Orientation: p. BS R05110; Start: 5; Length: 15; Gaps:; Orientation: p. BS R05111; Start: 5; Length: 15; Gaps:; Orientation: p. BS R05112; Start: 8; Length: 15; Gaps:; Orientation: p. BS R05113; Start: 9; Length: 15; Gaps:; Orientation: p. BS R05114; Start: 8; Length: 15; Gaps:; Orientation: p. BS R05115; Start: 7; Length: 15; Gaps:; Orientation: p. BS R05116; Start: 11; Length: 15; Gaps:; Orientation: p. BS R05117; Start: 5; Length: 15; Gaps:; Orientation: p. BS R05118; Start: 5; Length: 15; Gaps:; Orientation: p. XX sim4.for.rev100644000766000024 30614006427353 16532 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data seq1 = human.genomic, 5368 bp seq2 = hn_est.rev (>REVCOMP), 479 bp (complement) 695-813 (1-119) 100% -> 1377-1500 (120-243) 99% -> 1797-1935 (244-382) 100% -> 2084-2180 (383-479) 100% GlimmerM.out100644000766000024 403314006427353 16636 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataGlimmerM (Version 3.0) Sequence name: gi|23613028|ref|NC_004326.1| Sequence length: 69930 bp Predicted genes/exons Gene Exon Strand Exon Exon Range Exon # # Type Length 1 1 - Single 461 523 63 2 1 + Initial 1095 1103 9 2 2 + Internal 1167 1208 42 2 3 + Terminal 1404 1445 42 3 1 - Single 7055 7162 108 4 1 + Initial 7273 7356 84 4 2 + Internal 7434 7598 165 4 3 + Internal 7861 7898 38 4 4 + Terminal 7957 7975 19 5 1 + Initial 20929 23815 2887 5 2 + Terminal 23910 23956 47 6 1 + Initial 24046 26310 2265 6 2 + Internal 27215 28399 1185 6 3 + Internal 28848 28855 8 6 4 + Internal 29246 29309 64 6 5 + Terminal 29502 29516 15 7 2 - Terminal 29593 29600 8 7 1 - Initial 29919 30636 718 8 8 - Terminal 30737 30744 8 8 7 - Internal 31124 31150 27 8 6 - Internal 31814 31883 70 8 5 - Internal 32469 32515 47 8 4 - Internal 33199 34126 928 8 3 - Internal 34647 34688 42 8 2 - Internal 35216 35334 119 8 1 - Initial 36613 36853 241 9 1 + Initial 36912 37101 190 9 2 + Internal 37285 37300 16 9 3 + Terminal 37999 38005 7 10 5 - Terminal 42047 42104 58 10 4 - Internal 42781 42795 15 10 3 - Internal 42864 42888 25 10 2 - Internal 43211 43217 7 10 1 - Initial 43372 46329 2958 bug3021.gmap100644000766000024 324714006427353 16333 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>NM_004448 0 1 A -17:41277408 2541449272 A 0 2 A -17:41277407 2541449271 A 0 3 A -17:41277406 2541449270 A 0 4 A -17:41277405 2541449269 A 0 5 C -17:41277404 2541449268 C 0 6 T -17:41277403 2541449267 T 0 7 G -17:41277402 2541449266 G 0 8 C -17:41277401 2541449265 C 0 9 G -17:41277400 2541449264 G 0 10 A -17:41277399 2541449263 A 0 11 C -17:41277398 2541449262 C 0 12 T -17:41277397 2541449261 T 0 13 G -17:41277396 2541449260 G 0 14 C -17:41277395 2541449259 C 0 15 G -17:41277394 2541449258 G 0 16 C -17:41277393 2541449257 C 0 17 G -17:41277392 2541449256 G 0 18 G -17:41277391 2541449255 G 0 19 C -17:41277390 2541449254 C 0 20 G -17:41277389 2541449253 G 0 21 T -17:41277388 2541449252 T 0 22 G -17:41277387 2541449251 G 0 23 A -17:41277386 2541449250 A 0 24 G -17:41277385 2541449249 G 0 25 C -17:41277384 2541449248 C 0 26 T -17:41277383 2541449247 T 0 27 C -17:41277382 2541449246 C 0 28 G -17:41277381 2541449245 G 0 29 C -17:41277380 2541449244 C 0 30 T -17:41277379 2541449243 T 0 31 G -17:41277378 2541449242 G 0 32 A -17:41277377 2541449241 A 0 33 G -17:41277376 2541449240 G 0 34 A -17:41277375 2541449239 A 0 35 C -17:41277374 2541449238 C 0 36 T -17:41277373 2541449237 T 0 37 T -17:41277372 2541449236 T 0 38 C -17:41277371 2541449235 C 0 39 C -17:41277370 2541449234 C 0 40 T -17:41277369 2541449233 T 0 41 G -17:41277368 2541449232 G 0 42 G -17:41277367 2541449231 G 0 43 A -17:41277366 2541449230 A 0 44 C -17:41277365 2541449229 C 0 45 C -17:41277364 2541449228 G 0 46 C -17:41277363 2541449227 G 0 47 C -17:41277362 2541449226 G 0 48 G -17:41277361 2541449225 G 0 49 C -17:41277360 2541449224 G 0 50 A -17:41277359 2541449223 A masta.t100644000766000024 242314006427353 16531 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Matrix/IO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 16); use_ok('Bio::Matrix::PSM::IO'); } #Let's try masta formats here my $mio = Bio::Matrix::PSM::IO->new(-format=>'masta', -file=>test_input_file('masta.dat')); my $masta_w_dat = test_output_file(); my $wmio=Bio::Matrix::PSM::IO->new(-format=>'masta', -file=>">".$masta_w_dat); $wmio->_flush_on_write(1); ok $mio; ok $wmio; my @cons; my $carry; while (my $site=$mio->next_matrix) { ok $site; push @cons,$site->consensus; $carry=$site if ($site->id eq 'm1logs'); } is $cons[0],'CAGAAAAATNGAATNCCCACCCCCC'; is $cons[1],'CAGAAAAATAGAATCCCCACCCCCC'; is $cons[2],'CAGAAAAATNNAATNCCCACCNCCC'; $wmio->write_psm($carry,'PWM'); $carry->id('m1freq'); $wmio->write_psm($carry,'PFM'); $carry->id('m1seq'); $wmio->write_psm($carry,'SEQ'); $wmio->DESTROY; my $chio=Bio::Matrix::PSM::IO->new(-format=>'masta', -file=>$masta_w_dat); ok $chio; my $site=$chio->next_matrix; is $site->id,'m1logs'; is $site->consensus,'CAGAAAAATAGAATCCCCACCCCCC'; $site=$chio->next_matrix; is $site->id,'m1freq'; is $site->consensus,'CAGAAAAATAGAATCCCCACCCCCC'; $site=$chio->next_matrix; is $site->id,'m1seq'; is $site->consensus,'CAGAAAAATAGAATCCCCACCCCCC'; examples000755000766000024 014006427353 14676 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8longorf.pl100644000766000024 1022114006427353 17055 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/examples#!/usr/bin/perl # longorf.pl v0208020920 # (c) Dan Kortschak 2002 use vars qw($USAGE); use strict; use Getopt::Long; use Bio::SeqIO; $USAGE = "longorf [--help] [--notstrict] [--verbose] [--graph] [--width printwidth] [--format seqformat] --input seqfile\n"; my ($sequencefile,$sequenceformat,$notstrict,$graph,$verb,$printwidth,$help) = (undef, 'fasta', undef, undef,undef,50, undef); &GetOptions('input|i=s' => \$sequencefile, 'format|f=s' => \$sequenceformat, 'notstrict|n' => \$notstrict, 'width|w=s' => \$printwidth, 'graph|g' => \$graph, 'verbose|v' => \$verb, 'help|h' => \$help, ); if ($help) { exec('perldoc', $0); die; } if (!defined $sequencefile) { die($USAGE . "\nPlease specify an input filename.\n"); } sub longestORF { my $best=0; my ($bests,$beste,$beststrand)=(-1,-1,0); my $bestorf=""; my $relaxed=$_[1]; my $dna=Bio::Seq->new(-seq => $_[0]); my %strand=('+'=>$dna->seq, '-'=>$dna->revcom->seq); foreach my $direction (keys %strand) { my @starts=(); my @ends=(); if ($relaxed) { for (my $frame=0;$frame<3;$frame++) { unless ($strand{$direction}=~m/^.{$frame}(taa|tga|tag)/i) { push @starts,$frame+1; } } } while ($strand{$direction}=~m/(atg)/gi) { push @starts,pos($strand{$direction})-2; } while ($strand{$direction}=~m/(taa|tga|tag)/gi) { push @ends,pos($strand{$direction})-2; } push @ends,($dna->length-2,$dna->length-1,$dna->length); for my $s (@starts) { for my $e (@ends) { if ($e%3==$s%3 and $e>$s) { if ($e-$s>$best) { $best=$e-$s; ($bests,$beste,$beststrand)=($s,$e,$direction); $bestorf=Bio::Seq->new(-seq=>$strand{$direction})->subseq($s,$e); } last } else { next } } } } return ($best,$bests,$beste,$beststrand,$bestorf); } my $seqio = new Bio::SeqIO('-format' => $sequenceformat, '-file' => $sequencefile ); my ($length,$start,$end,$direction,$sequence); my $count=0; my @lengths; my $totallength=0; while (my $dna = $seqio->next_seq) { $count++; ($length,$start,$end,$direction,$sequence)=longestORF($dna->seq,$notstrict); if ($verb) { print $dna->display_id," ",$dna->desc,": "; print "$length, $start, $end ($direction)\n$sequence\n\n",Bio::Seq->new(-seq=>$sequence)->translate->seq,"\n\n--\n\n"; } $totallength+=$length; $lengths[$length/3]++; } print "Average ORF length: ", $totallength/$count,"\n\n"; print "Length distribution is:\n"; if ($graph) { my $length; my $maxlength=0; for ($length=0;$length<@lengths;$length++) { $lengths[$length]=0 unless $lengths[$length]; $maxlength=$lengths[$length] if ($lengths[$length]>$maxlength); } for ($length=0;$length<@lengths;$length++) { print $length*3,"\t",$lengths[$length],"\t|"; print "#"x(($lengths[$length])*$printwidth/$maxlength); print "\n"; } } else { for ($length=0;$length<@lengths;$length++) { print $length*3,"\t",($lengths[$length]or"0"),"\n"; } } __END__ =head1 NAME longorf.pl - perl script to find the longest ORF of a sequence =head1 SYNOPSIS % longorf.pl [-h] [-n] [-v] [-g] [-w printwidth] [-f seqformat] -i seqfile =head1 DESCRIPTION This script will examine a set of nucleotide sequences and determine the longest ORF in each sequence. ORFs may start at the canonical ATG or at the beginning of the sequence if the notstrict option is chosen. The script will output a list of the longest ORF lengths, starts, ends and strands with the ORF and amino acid sequence if the verbose option is chosen. A histogram of the longest ORFs in the input set may be printed by choosing the graph option. =head1 FEEDBACK This script is not supported by anyone. =head1 AUTHOR - Dan Kortschak =cut SeqUtils.pm100644000766000024 16173714006427353 16665 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# BioPerl module for Bio::SeqUtils # # Please direct questions and support issues to # # Cared for by Heikki Lehvaslaiho # # Copyright Heikki Lehvaslaiho # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::SeqUtils - Additional methods for PrimarySeq objects =head1 SYNOPSIS use Bio::SeqUtils; # get a Bio::PrimarySeqI compliant object, $seq, somehow $util = Bio::SeqUtils->new(); $polypeptide_3char = $util->seq3($seq); # or $polypeptide_3char = Bio::SeqUtils->seq3($seq); # set the sequence string (stored in one char code in the object) Bio::SeqUtils->seq3($seq, $polypeptide_3char); # translate a sequence in all six frames @seqs = Bio::SeqUtils->translate_6frames($seq); # inplace editing of the sequence Bio::SeqUtils->mutate($seq, Bio::LiveSeq::Mutation->new(-seq => 'c', -pos => 3 )); # mutate a sequence to desired similarity% $newseq = Bio::SeqUtils-> evolve ($seq, $similarity, $transition_transversion_rate); # concatenate two or more sequences with annotations and features, # the first sequence will be modified Bio::SeqUtils->cat(@seqs); my $catseq=$seqs[0]; # truncate a sequence, retaining features and adjusting their # coordinates if necessary my $truncseq = Bio::SeqUtils->trunc_with_features($seq, 100, 200); # reverse complement a sequence and its features my $revcomseq = Bio::SeqUtils->revcom_with_features($seq); # simulate cloning of a fragment into a vector. Cut the vector at # positions 1000 and 1100 (deleting positions 1001 to 1099) and # "ligate" a fragment into the sites. The fragment is # reverse-complemented in this example (option "flip"). # All features of the vector and fragment are preserved and # features that are affected by the deletion/insertion are # modified accordingly. # $vector and $fragment must be Bio::SeqI compliant objects my $new_molecule = Bio::Sequtils->ligate( -vector => $vector, -fragment => $fragment, -left => 1000, -right => 1100, -flip => 1 ); # delete a segment of a sequence (from pos 1000 to 1100, inclusive), # again preserving features and annotations my $new_molecule = Bio::SeqUtils->cut( $seq, 1000, 1100 ); # insert a fragment into a recipient between positions 1000 and # 1001. $recipient is a Bio::SeqI compliant object my $new_molecule = Bio::SeqUtils::PbrTools->insert( $recipient_seq, $fragment_seq, 1000 ); =head1 DESCRIPTION This class is a holder of methods that work on Bio::PrimarySeqI- compliant sequence objects, e.g. Bio::PrimarySeq and Bio::Seq. These methods are not part of the Bio::PrimarySeqI interface and should in general not be essential to the primary function of sequence objects. If you are thinking of adding essential functions, it might be better to create your own sequence class. See L, L, and L for more. The methods take as their first argument a sequence object. It is possible to use methods without first creating a SeqUtils object, i.e. use it as an anonymous hash. The first two methods, seq3() and seq3in(), give out or read in protein sequences coded in three letter IUPAC amino acid codes. The next two methods, translate_3frames() and translate_6frames(), wrap around the standard translate method to give back an array of three forward or all six frame translations. The mutate() method mutates the sequence string with a mutation description object. The cat() method concatenates two or more sequences. The first sequence is modified by addition of the remaining sequences. All annotations and sequence features will be transferred. The revcom_with_features() and trunc_with_features() methods are similar to the revcom() and trunc() methods from Bio::Seq, but also adjust any features associated with the sequence as appropriate. There are also methods that simulate molecular cloning with rich sequence objects. The delete() method cuts a segment out of a sequence and re-joins the left and right fragments (like splicing or digesting and re-ligating a molecule). Positions (and types) of sequence features are adjusted accordingly: Features that span the deleted segment are converted to split featuress to indicate the disruption. (Sub)Features that extend into the deleted segment are truncated. A new molecule is created and returned. The insert() method inserts a fragment (which can be a rich Bio::Seq object) into another sequence object adding all annotations and features to the final product. Features that span the insertion site are converted to split features to indicate the disruption. A new feature is added to indicate the inserted fragment itself. A new molecule is created and returned. The ligate() method simulates digesting a recipient (vector) and ligating a fragment into it, which can also be flipped if needed. It is simply a combination of a deletion and an insertion step and returns a new molecule. The rules for modifying feature locations outlined above are also used here, e.g. features that span the cut sites are converted to split features with truncated sub-locations. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Heikki Lehvaslaiho Email: heikki-at-bioperl-dot-org =head1 CONTRIBUTORS Roy R. Chaudhuri - roy.chaudhuri at gmail.com Frank Schwach - frank.schwach@sanger.ac.uk =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # Let the code begin... package Bio::SeqUtils; $Bio::SeqUtils::VERSION = '1.7.8'; use strict; use warnings; use Scalar::Util qw(blessed); use parent qw(Bio::Root::Root); # new inherited from RootI our %ONECODE = ( 'Ala' => 'A', 'Asx' => 'B', 'Cys' => 'C', 'Asp' => 'D', 'Glu' => 'E', 'Phe' => 'F', 'Gly' => 'G', 'His' => 'H', 'Ile' => 'I', 'Lys' => 'K', 'Leu' => 'L', 'Met' => 'M', 'Asn' => 'N', 'Pro' => 'P', 'Gln' => 'Q', 'Arg' => 'R', 'Ser' => 'S', 'Thr' => 'T', 'Val' => 'V', 'Trp' => 'W', 'Xaa' => 'X', 'Tyr' => 'Y', 'Glx' => 'Z', 'Ter' => '*', 'Sec' => 'U', 'Pyl' => 'O', 'Xle' => 'J' ); our %THREECODE = ( 'A' => 'Ala', 'B' => 'Asx', 'C' => 'Cys', 'D' => 'Asp', 'E' => 'Glu', 'F' => 'Phe', 'G' => 'Gly', 'H' => 'His', 'I' => 'Ile', 'K' => 'Lys', 'L' => 'Leu', 'M' => 'Met', 'N' => 'Asn', 'P' => 'Pro', 'Q' => 'Gln', 'R' => 'Arg', 'S' => 'Ser', 'T' => 'Thr', 'V' => 'Val', 'W' => 'Trp', 'Y' => 'Tyr', 'Z' => 'Glx', 'X' => 'Xaa', '*' => 'Ter', 'U' => 'Sec', 'O' => 'Pyl', 'J' => 'Xle' ); =head2 seq3 Title : seq3 Usage : $string = Bio::SeqUtils->seq3($seq) Function: Read only method that returns the amino acid sequence as a string of three letter codes. alphabet has to be 'protein'. Output follows the IUPAC standard plus 'Ter' for terminator. Any unknown character, including the default unknown character 'X', is changed into 'Xaa'. A noncoded aminoacid selenocystein is recognized (Sec, U). Returns : A scalar Args : character used for stop in the protein sequence optional, defaults to '*' string used to separate the output amino acid codes, optional, defaults to '' =cut sub seq3 { my ( $self, $seq, $stop, $sep ) = @_; $seq->isa('Bio::PrimarySeqI') || $self->throw('Not a Bio::PrimarySeqI object but [$self]'); $seq->alphabet eq 'protein' || $self->throw('Not a protein sequence'); if ( defined $stop ) { length $stop != 1 and $self->throw('One character stop needed, not [$stop]'); $THREECODE{$stop} = "Ter"; } $sep ||= ''; my $aa3s; foreach my $aa ( split //, uc $seq->seq ) { $THREECODE{$aa} and $aa3s .= $THREECODE{$aa} . $sep, next; $aa3s .= 'Xaa' . $sep; } $sep and substr( $aa3s, -( length $sep ), length $sep ) = ''; return $aa3s; } =head2 seq3in Title : seq3in Usage : $seq = Bio::SeqUtils->seq3in($seq, 'MetGlyTer') Function: Method for changing of the sequence of a Bio::PrimarySeqI sequence object. The three letter amino acid input string is converted into one letter code. Any unknown character triplet, including the default 'Xaa', is converted into 'X'. Returns : Bio::PrimarySeq object Args : sequence string optional character to be used for stop in the protein sequence, defaults to '*' optional character to be used for unknown in the protein sequence, defaults to 'X' =cut sub seq3in { my ( $self, $seq, $string, $stop, $unknown ) = @_; $seq->isa('Bio::PrimarySeqI') || $self->throw("Not a Bio::PrimarySeqI object but [$self]"); $seq->alphabet eq 'protein' || $self->throw('Not a protein sequence'); if ( defined $stop ) { length $stop != 1 and $self->throw("One character stop needed, not [$stop]"); $ONECODE{'Ter'} = $stop; } if ( defined $unknown ) { length $unknown != 1 and $self->throw("One character stop needed, not [$unknown]"); $ONECODE{'Xaa'} = $unknown; } my ( $aas, $aa3 ); my $length = ( length $string ) - 2; for ( my $i = 0 ; $i < $length ; $i += 3 ) { $aa3 = substr( $string, $i, 3 ); $aa3 = ucfirst( lc($aa3) ); $ONECODE{$aa3} and $aas .= $ONECODE{$aa3}, next; $aas .= $ONECODE{'Xaa'}; } $seq->seq($aas); return $seq; } =head2 translate_3frames Title : translate_3frames Usage : @prots = Bio::SeqUtils->translate_3frames($seq) Function: Translate a nucleotide sequence in three forward frames. The IDs of the sequences are appended with '-0F', '-1F', '-2F'. Returns : An array of seq objects Args : sequence object same arguments as to Bio::PrimarySeqI::translate =cut sub translate_3frames { my ( $self, $seq, @args ) = @_; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] can not be translated.' ) unless $seq->can('translate'); my ( $stop, $unknown, $frame, $tableid, $fullCDS, $throw ) = @args; my @seqs; my $f = 0; while ( $f != 3 ) { my $translation = $seq->translate( $stop, $unknown, $f, $tableid, $fullCDS, $throw ); $translation->id( $seq->id . "-" . $f . "F" ); push @seqs, $translation; $f++; } return @seqs; } =head2 translate_6frames Title : translate_6frames Usage : @prots = Bio::SeqUtils->translate_6frames($seq) Function: translate a nucleotide sequence in all six frames The IDs of the sequences are appended with '-0F', '-1F', '-2F', '-0R', '-1R', '-2R'. Returns : An array of seq objects Args : sequence object same arguments as to Bio::PrimarySeqI::translate =cut sub translate_6frames { my ( $self, $seq, @args ) = @_; my @seqs = $self->translate_3frames( $seq, @args ); my @seqs2 = $self->translate_3frames( $seq->revcom, @args ); foreach my $seq2 (@seqs2) { my ($tmp) = $seq2->id; $tmp =~ s/F$/R/g; $seq2->id($tmp); } return @seqs, @seqs2; } =head2 valid_aa Title : valid_aa Usage : my @aa = $table->valid_aa Function: Retrieves a list of the valid amino acid codes. The list is ordered so that first 21 codes are for unique amino acids. The rest are ['B', 'Z', 'X', '*']. Returns : array of all the valid amino acid codes Args : [optional] $code => [0 -> return list of 1 letter aa codes, 1 -> return list of 3 letter aa codes, 2 -> return associative array of both ] =cut sub valid_aa { my ( $self, $code ) = @_; if ( !$code ) { my @codes; foreach my $c ( sort values %ONECODE ) { push @codes, $c unless ( $c =~ /[BZX\*]/ ); } push @codes, qw(B Z X *); # so they are in correct order ? return @codes; } elsif ( $code == 1 ) { my @codes; foreach my $c ( sort keys %ONECODE ) { push @codes, $c unless ( $c =~ /(Asx|Glx|Xaa|Ter)/ ); } push @codes, ( 'Asx', 'Glx', 'Xaa', 'Ter' ); return @codes; } elsif ( $code == 2 ) { my %codes = %ONECODE; foreach my $c ( keys %ONECODE ) { my $aa = $ONECODE{$c}; $codes{$aa} = $c; } return %codes; } else { $self->warn( "unrecognized code in " . ref($self) . " method valid_aa()" ); return (); } } =head2 mutate Title : mutate Usage : Bio::SeqUtils->mutate($seq,$mutation1, $mutation2); Function: Inplace editing of the sequence. The second argument can be a Bio::LiveSeq::Mutation object or an array of them. The mutations are applied sequentially checking only that their position is within the current sequence. Insertions are inserted before the given position. Returns : boolean Args : sequence object mutation, a Bio::LiveSeq::Mutation object, or an array of them See L. =cut sub mutate { my ( $self, $seq, @mutations ) = @_; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] should be a Bio::PrimarySeqI ' ) unless $seq->isa('Bio::PrimarySeqI'); $self->throw( 'Object [$mutations[0]] ' . 'of class [' . ref( $mutations[0] ) . '] should be a Bio::LiveSeq::Mutation' ) unless $mutations[0]->isa('Bio::LiveSeq::Mutation'); foreach my $mutation (@mutations) { $self->throw('Attempting to mutate sequence beyond its length') unless $mutation->pos - 1 <= $seq->length; my $string = $seq->seq; substr $string, $mutation->pos - 1, $mutation->len, $mutation->seq; $seq->seq($string); } 1; } =head2 cat Title : cat Usage : Bio::SeqUtils->cat(@seqs); my $catseq=$seqs[0]; Function: Concatenates a list of Bio::Seq objects, adding them all on to the end of the first sequence. Annotations and sequence features are copied over from any additional objects, and the coordinates of any copied features are adjusted appropriately. Returns : a boolean Args : array of sequence objects Note that annotations have no sequence locations. If you concatenate sequences with the same annotations they will all be added. =cut sub cat { my ( $self, $seq, @seqs ) = @_; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] should be a Bio::PrimarySeqI ' ) unless $seq->isa('Bio::PrimarySeqI'); for my $catseq (@seqs) { $self->throw( 'Object [$catseq] ' . 'of class [' . ref($catseq) . '] should be a Bio::PrimarySeqI ' ) unless $catseq->isa('Bio::PrimarySeqI'); $self->throw( 'Trying to concatenate sequences with different alphabets: ' . $seq->display_id . '(' . $seq->alphabet . ') and ' . $catseq->display_id . '(' . $catseq->alphabet . ')' ) unless $catseq->alphabet eq $seq->alphabet; my $length = $seq->length; $seq->seq( $seq->seq . $catseq->seq ); # move annotations if ( $seq->isa("Bio::AnnotatableI") and $catseq->isa("Bio::AnnotatableI") ) { foreach my $key ( $catseq->annotation->get_all_annotation_keys() ) { foreach my $value ( $catseq->annotation->get_Annotations($key) ) { $seq->annotation->add_Annotation( $key, $value ); } } } # move SeqFeatures if ( $seq->isa('Bio::SeqI') and $catseq->isa('Bio::SeqI') ) { for my $feat ( $catseq->get_SeqFeatures ) { $seq->add_SeqFeature( $self->_coord_adjust( $feat, $length ) ); } } } 1; } =head2 trunc_with_features Title : trunc_with_features Usage : $trunc=Bio::SeqUtils->trunc_with_features($seq, $start, $end); Function: Like Bio::Seq::trunc, but keeps features (adjusting coordinates where necessary. Features that partially overlap the region have their location changed to a Bio::Location::Fuzzy. Returns : A new sequence object Args : A sequence object, start coordinate, end coordinate (inclusive) =cut sub trunc_with_features { use Bio::Range; my ( $self, $seq, $start, $end ) = @_; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] should be a Bio::SeqI ' ) unless $seq->isa('Bio::SeqI'); my $trunc = $seq->trunc( $start, $end ); my $truncrange = Bio::Range->new( -start => $start, -end => $end, -strand => 0 ); # make sure that there is no annotation or features in $trunc # (->trunc() now clone objects except for Bio::Seq::LargePrimarySeq) $trunc->annotation->remove_Annotations; $trunc->remove_SeqFeatures; # move annotations foreach my $key ( $seq->annotation->get_all_annotation_keys() ) { foreach my $value ( $seq->annotation->get_Annotations($key) ) { $trunc->annotation->add_Annotation( $key, $value ); } } # move features foreach ( grep { $_ = $self->_coord_adjust( $_, 1 - $start, $end + 1 - $start ) if $_->overlaps($truncrange) } $seq->get_SeqFeatures ) { $trunc->add_SeqFeature($_); } return $trunc; } =head2 delete Title : delete Function: cuts a segment out of a sequence and re-joins the left and right fragments (like splicing or digesting and re-ligating a molecule). Positions (and types) of sequence features are adjusted accordingly: Features that span the cut site are converted to split featuress to indicate the disruption. Features that extend into the cut-out fragment are truncated. A new molecule is created and returned. Usage : my $cutseq = Bio::SeqUtils::PbrTools->cut( $seq, 1000, 1100 ); Args : a Bio::PrimarySeqI compliant object to cut, first nt of the segment to be deleted last nt of the segment to be deleted optional: hash-ref of options: clone_obj: if true, clone the input sequence object rather than calling "new" on the object's class Returns : a new Bio::Seq object =cut sub delete { my $self = shift; my ( $seq, $left, $right, $opts_ref ) = @_; $self->throw( 'was expecting 3-4 paramters but got ' . @_ ) unless @_ == 3 || @_ == 4; $self->throw( 'Object of class [' . ref($seq) . '] should be a Bio::PrimarySeqI ' ) unless blessed($seq) && $seq->isa('Bio::PrimarySeqI'); $self->throw("Left coordinate ($left) must be >= 1") if $left < 1; if ( $right > $seq->length ) { $self->throw( "Right coordinate ($right) must be less than " . 'sequence length (' . $seq->length . ')' ); } # piece together the sequence string of the remaining fragments my $left_seq = $seq->subseq( 1, $left - 1 ); my $right_seq = $seq->subseq( $right + 1, $seq->length ); if ( !$left_seq || !$right_seq ) { $self->throw( 'could not assemble sequences. At least one of the fragments is empty' ); } my $seq_str = $left_seq . $right_seq; # create the new seq object with the same class as the recipient # or (if requested), make a clone of the existing object. In the # latter case we need to remove sequence features from the cloned # object instead of copying them my $product; if ( $opts_ref->{clone_obj} ) { $product = $self->_new_seq_via_clone( $seq, $seq_str ); } else { $product = $self->_new_seq_from_old( $seq, { seq => $seq_str } ); } # move sequence features if ( $product->isa('Bio::SeqI') && $seq->isa('Bio::SeqI') ) { for my $feat ( $seq->get_SeqFeatures ) { my $adjfeat = $self->_coord_adjust_deletion( $feat, $left, $right ); $product->add_SeqFeature($adjfeat) if $adjfeat; } } # add a feature to annotatde the deletion my $deletion_feature = Bio::SeqFeature::Generic->new( -primary_tag => 'misc_feature', -tag => { note => 'deletion of ' . ( $right - $left + 1 ) . 'bp' }, -location => Bio::Location::Simple->new( -start => $left - 1, -end => $left, -location_type => 'IN-BETWEEN' ) ); $product->add_SeqFeature($deletion_feature); return $product; } =head2 insert Title : insert Function: inserts a fragment (a Bio::Seq object) into a nother sequence object adding all annotations and features to the final product. Features that span the insertion site are converted to split features to indicate the disruption. A new feature is added to indicate the inserted fragment itself. A new molecule is created and returned. Usage : # insert a fragment after pos 1000 my $insert_seq = Bio::SeqUtils::PbrTools->insert( $recipient_seq, $fragment_seq, 1000 ); Args : recipient sequence (a Bio::PrimarySeqI compliant object), a fragmetn to insert (Bio::PrimarySeqI compliant object), insertion position (fragment is inserted to the right of this pos) pos=0 will prepend the fragment to the recipient optional: hash-ref of options: clone_obj: if true, clone the input sequence object rather than calling "new" on the object's class Returns : a new Bio::Seq object =cut sub insert { my $self = shift; my ( $recipient, $fragment, $insert_pos, $opts_ref ) = @_; $self->throw( 'was expecting 3-4 paramters but got ' . @_ ) unless @_ == 3 || @_ == 4; $self->throw( 'Recipient object of class [' . ref($recipient) . '] should be a Bio::PrimarySeqI ' ) unless blessed($recipient) && $recipient->isa('Bio::PrimarySeqI'); $self->throw( 'Fragment object of class [' . ref($fragment) . '] should be a Bio::PrimarySeqI ' ) unless blessed($fragment) && $fragment->isa('Bio::PrimarySeqI'); $self->throw( 'Can\'t concatenate sequences with different alphabets: ' . 'recipient is ' . $recipient->alphabet . ' and fragment is ' . $fragment->alphabet ) unless $recipient->alphabet eq $fragment->alphabet; if ( $insert_pos < 0 or $insert_pos > $recipient->length ) { $self->throw( "insertion position ($insert_pos) must be between 0 and " . 'recipient sequence length (' . $recipient->length . ')' ); } if ( $fragment->can('is_circular') && $fragment->is_circular ) { $self->throw('Can\'t insert circular fragments'); } if ( !$recipient->seq ) { $self->throw( 'Recipient has no sequence, can not insert into this object'); } # construct raw sequence of the new molecule my $left_seq = $insert_pos > 0 ? $recipient->subseq( 1, $insert_pos ) : ''; my $mid_seq = $fragment->seq; my $right_seq = $insert_pos < $recipient->length ? $recipient->subseq( $insert_pos + 1, $recipient->length ) : ''; my $seq_str = $left_seq . $mid_seq . $right_seq; # create the new seq object with the same class as the recipient # or (if requested), make a clone of the existing object. In the # latter case we need to remove sequence features from the cloned # object instead of copying them my $product; if ( $opts_ref->{clone_obj} ) { $product = $self->_new_seq_via_clone( $recipient, $seq_str ); } else { my @desc; push @desc, 'Inserted fragment: ' . $fragment->desc if defined $fragment->desc; push @desc, 'Recipient: ' . $recipient->desc if defined $recipient->desc; $product = $self->_new_seq_from_old( $recipient, { seq => $seq_str, display_id => $recipient->display_id, accession_number => $recipient->accession_number || '', alphabet => $recipient->alphabet, desc => join( '; ', @desc ), verbose => $recipient->verbose || $fragment->verbose, is_circular => $recipient->is_circular || 0, } ); } # if clone_obj # move annotations from fragment to product if ( $product->isa("Bio::AnnotatableI") && $fragment->isa("Bio::AnnotatableI") ) { foreach my $key ( $fragment->annotation->get_all_annotation_keys ) { foreach my $value ( $fragment->annotation->get_Annotations($key) ) { $product->annotation->add_Annotation( $key, $value ); } } } # move sequence features to product with adjusted coordinates if ( $product->isa('Bio::SeqI') ) { # for the fragment, just shift the features to new position if ( $fragment->isa('Bio::SeqI') ) { for my $feat ( $fragment->get_SeqFeatures ) { my $adjfeat = $self->_coord_adjust( $feat, $insert_pos ); $product->add_SeqFeature($adjfeat) if $adjfeat; } } # for recipient, shift and modify features according to insertion. if ( $recipient->isa('Bio::SeqI') ) { for my $feat ( $recipient->get_SeqFeatures ) { my $adjfeat = $self->_coord_adjust_insertion( $feat, $insert_pos, $fragment->length ); $product->add_SeqFeature($adjfeat) if $adjfeat; } } } # add a feature to annotate the insertion my $insertion_feature = Bio::SeqFeature::Generic->new( -start => $insert_pos + 1, -end => $insert_pos + $fragment->length, -primary_tag => 'misc_feature', -tag => { note => 'inserted fragment' }, ); $product->add_SeqFeature($insertion_feature); return $product; } =head2 ligate title : ligate function: pastes a fragment (which can also have features) into a recipient sequence between two "cut" sites, preserving features and adjusting their locations. This is a shortcut for deleting a segment from a sequence object followed by an insertion of a fragmnet and is supposed to be used to simulate in-vitro cloning where a recipient (a vector) is digested and a fragment is then ligated into the recipient molecule. The fragment can be flipped (reverse-complemented with all its features). A new sequence object is returned to represent the product of the reaction. Features and annotations are transferred from the insert to the product and features on the recipient are adjusted according to the methods L amd L: Features spanning the insertion site will be split up into two sub-locations. (Sub-)features in the deleted region are themselves deleted. (Sub-)features that extend into the deleted region are truncated. The class of the product object depends on the class of the recipient (vector) sequence object. if it is not possible to instantiate a new object of that class, a Bio::Primaryseq object is created instead. usage : # insert the flipped fragment between positions 1000 and 1100 of the # vector, i.e. everything between these two positions is deleted and # replaced by the fragment my $new_molecule = Bio::Sequtils::Pbrtools->ligate( -recipient => $vector, -fragment => $fragment, -left => 1000, -right => 1100, -flip => 1, -clone_obj => 1 ); args : recipient: the recipient/vector molecule fragment: molecule that is to be ligated into the vector left: left cut site (fragment will be inserted to the right of this position) optional: right: right cut site (fragment will be inseterted to the left of this position). defaults to left+1 flip: boolean, if true, the fragment is reverse-complemented (including features) before inserting clone_obj: if true, clone the recipient object to create the product instead of calling "new" on its class returns : a new Bio::Seq object of the ligated fragments =cut sub ligate { my $self = shift; my ( $recipient, $fragment, $left, $right, $flip, $clone_obj ) = $self->_rearrange( [qw(RECIPIENT FRAGMENT LEFT RIGHT FLIP CLONE_OBJ )], @_ ); $self->throw("missing required parameter 'recipient'") unless $recipient; $self->throw("missing required parameter 'fragment'") unless $fragment; $self->throw("missing required parameter 'left'") unless defined $left; $right ||= $left + 1; $self->throw( "Fragment must be a Bio::PrimarySeqI compliant object but it is a " . ref($fragment) ) unless blessed($fragment) && $fragment->isa('Bio::PrimarySeqI'); $fragment = $self->revcom_with_features($fragment) if $flip; my $opts_ref = {}; $opts_ref->{clone_obj} = 1 if $clone_obj; # clone in two steps: first delete between the insertion sites, # then insert the fragment. Step 1 is skipped if insert positions # are adjacent (no deletion) my ( $product1, $product2 ); eval { if ( $right == $left + 1 ) { $product1 = $recipient; } else { $product1 = $self->delete( $recipient, $left + 1, $right - 1, $opts_ref ); } }; $self->throw( "Failed in step 1 (cut recipient): " . $@ ) if $@; eval { $product2 = $self->insert( $product1, $fragment, $left, $opts_ref ) }; $self->throw( "Failed in step 2 (insert fragment): " . $@ ) if $@; return $product2; } =head2 _coord_adjust_deletion title : _coord_adjust_deletion function: recursively adjusts coordinates of seqfeatures on a molecule where a segment has been deleted. (sub)features that span the deletion site become split features. (sub)features that extend into the deletion site are truncated. A note is added to the feature to inform about the size and position of the deletion. usage : my $adjusted_feature = Bio::Sequtils::_coord_adjust_deletion( $feature, $start, $end ); args : a Bio::SeqFeatureI compliant object, start (inclusive) position of the deletion site, end (inclusive) position of the deletion site returns : a Bio::SeqFeatureI compliant object =cut sub _coord_adjust_deletion { my ( $self, $feat, $left, $right ) = @_; $self->throw( 'object [$feat] ' . 'of class [' . ref($feat) . '] should be a Bio::SeqFeatureI ' ) unless $feat->isa('Bio::SeqFeatureI'); $self->throw('missing coordinates: need a left and a right position') unless defined $left && defined $right; if ( $left > $right ) { if ( $feat->can('is_circular') && $feat->is_circular ) { # todo handle circular molecules $self->throw( 'can not yet handle deletions in circular molecules if deletion spans origin' ); } else { $self->throw( "left coordinate ($left) must be less than right ($right)" . " but it was greater" ); } } my $deletion = Bio::Location::Simple->new( -start => $left, -end => $right, ); my $del_length = $right - $left + 1; my @adjsubfeat; for my $subfeat ( $feat->get_SeqFeatures ) { my $adjsubfeat = $self->_coord_adjust_deletion( $subfeat, $left, $right ); push @adjsubfeat, $adjsubfeat if $adjsubfeat; } my @loc; my $note; for ( $feat->location->each_Location ) { next if $deletion->contains($_); # this location will be deleted; my $strand = $_->strand; my $type = $_->location_type; my $start = $_->start; my $start_type = $_->can('start_pos_type') ? $_->start_pos_type : undef; my $end = $_->end; my $end_type = $_->can('end_pos_type') ? $_->end_pos_type : undef; my @newcoords = (); if ( $start < $deletion->start && $end > $deletion->end ) { # split the feature @newcoords = ( [ $start, ( $deletion->start - 1 ), $start_type, $end_type ], [ ( $deletion->start ), $end - $del_length, $start_type, $end_type ] ); $note = $del_length . 'bp internal deletion between pos ' . ( $deletion->start - 1 ) . ' and ' . $deletion->start; } elsif ( $_->start < $deletion->start && $_->end >= $deletion->start ) { # truncate feature end @newcoords = ( [ $start, ( $deletion->start - 1 ), $start_type, $end_type ] ); $note = ( $end - $deletion->start + 1 ) . 'bp deleted from feature '; if ( $feat->strand ) { $note .= $feat->strand == 1 ? "3' " : "5' "; } $note .= 'end'; } elsif ( $_->start <= $deletion->end && $_->end > $deletion->end ) { # truncate feature start and shift end @newcoords = ( [ ( $deletion->start ), $end - $del_length, $start_type, $end_type ] ); $note = ( $deletion->end - $start + 1 ) . 'bp deleted from feature '; if ( $feat->strand ) { $note .= $feat->strand == 1 ? "5' end" : "3' end"; } else { $note .= 'start'; } } elsif ( $start >= $deletion->end ) { # just shift entire location @newcoords = ( [ $start - $del_length, $end - $del_length, $start_type, $end_type ] ); } else { # not affected by deletion @newcoords = ( [ $start, $end, $start_type, $end_type ] ); } # if we have no coordinates, we return nothing # the feature is deleted return unless @newcoords; my @subloc = $self->_location_objects_from_coordinate_list( \@newcoords, $strand, $type ); push @loc, $self->_single_loc_object_from_collection(@subloc); } # each location # create new feature based on original one and move annotation across my $newfeat = Bio::SeqFeature::Generic->new( -primary => $feat->primary_tag ); foreach my $key ( $feat->annotation->get_all_annotation_keys() ) { foreach my $value ( $feat->annotation->get_Annotations($key) ) { $newfeat->annotation->add_Annotation( $key, $value ); } } foreach my $key ( $feat->get_all_tags() ) { $newfeat->add_tag_value( $key, $feat->get_tag_values($key) ); } # If we have a note about the deleted bases, add it if ($note) { $newfeat->add_tag_value( 'note', $note ); } # set modified location(s) for the new feature and # add its subfeatures if any my $loc = $self->_single_loc_object_from_collection(@loc); $loc ? $newfeat->location($loc) : return; $newfeat->add_SeqFeature($_) for @adjsubfeat; return $newfeat; } =head2 _coord_adjust_insertion title : _coord_adjust_insertion function: recursively adjusts coordinates of seqfeatures on a molecule where another sequence has been inserted. (sub)features that span the insertion site become split features and a note is added about the size and positin of the insertion. Features with an IN-BETWEEN location at the insertion site are lost (such features can only exist between adjacent bases) usage : my $adjusted_feature = Bio::Sequtils::_coord_adjust_insertion( $feature, $insert_pos, $insert_length ); args : a Bio::SeqFeatureI compliant object, insertion position (insert to the right of this position) length of inserted fragment returns : a Bio::SeqFeatureI compliant object =cut sub _coord_adjust_insertion { my ( $self, $feat, $insert_pos, $insert_len ) = @_; $self->throw( 'object [$feat] ' . 'of class [' . ref($feat) . '] should be a Bio::SeqFeatureI ' ) unless $feat->isa('Bio::SeqFeatureI'); $self->throw('missing insert position') unless defined $insert_pos; $self->throw('missing insert length') unless defined $insert_len; my @adjsubfeat; for my $subfeat ( $feat->get_SeqFeatures ) { push @adjsubfeat, $self->_coord_adjust_insertion( $subfeat, $insert_pos, $insert_len ); } my @loc; my $note; for ( $feat->location->each_Location ) { # loose IN-BETWEEN features at the insertion site next if ( $_->location_type eq 'IN-BETWEEN' && $_->start == $insert_pos ); my $strand = $_->strand; my $type = $_->location_type; my $start = $_->start; my $start_type = $_->can('start_pos_type') ? $_->start_pos_type : undef; my $end = $_->end; my $end_type = $_->can('end_pos_type') ? $_->end_pos_type : undef; my @newcoords = (); if ( $start <= $insert_pos && $end > $insert_pos ) { # split the feature @newcoords = ( [ $start, $insert_pos, $start_type, $end_type ], [ ( $insert_pos + 1 + $insert_len ), $end + $insert_len, $start_type, $end_type ] ); $note = $insert_len . 'bp internal insertion between pos ' . $insert_pos . ' and ' . ( $insert_pos + $insert_len + 1 ); } elsif ( $start > $insert_pos ) { # just shift entire location @newcoords = ( [ $start + $insert_len, $end + $insert_len, $start_type, $end_type ] ); } else { # not affected @newcoords = ( [ $start, $end, $start_type, $end_type ] ); } # if we have deleted all coordinates, return nothing # (possible if all locations are IN-BETWEEN) return unless @newcoords; my @subloc = $self->_location_objects_from_coordinate_list( \@newcoords, $strand, $type ); # put together final location which could be a split now push @loc, $self->_single_loc_object_from_collection(@subloc); } # each location # create new feature based on original one and move annotation across my $newfeat = Bio::SeqFeature::Generic->new( -primary => $feat->primary_tag ); foreach my $key ( $feat->annotation->get_all_annotation_keys() ) { foreach my $value ( $feat->annotation->get_Annotations($key) ) { $newfeat->annotation->add_Annotation( $key, $value ); } } foreach my $key ( $feat->get_all_tags() ) { $newfeat->add_tag_value( $key, $feat->get_tag_values($key) ); } # If we have a note about the inserted bases, add it if ($note) { $newfeat->add_tag_value( 'note', $note ); } # set modified location(s) for the new feature and # add its subfeatures if any my $loc = $self->_single_loc_object_from_collection(@loc); $loc ? $newfeat->location($loc) : return; $newfeat->add_SeqFeature($_) for @adjsubfeat; return $newfeat; } =head2 _single_loc_object_from_collection Title : _single_loc_object_from_collection Function: takes an array of location objects. Returns either a split location object if there are more than one locations in the array or returns the single location if there is only one Usage : my $loc = _single_loc_object_from_collection( @sublocs ); Args : array of Bio::Location objects Returns : a single Bio:;Location object containing all locations =cut sub _single_loc_object_from_collection { my ( $self, @locs ) = @_; my $loc; if ( @locs > 1 ) { $loc = Bio::Location::Split->new; $loc->add_sub_Location(@locs); } elsif ( @locs == 1 ) { $loc = shift @locs; } return $loc; } # _single_loc_object_from_collection =head2 _location_objects_from_coordinate_list Title : _location_objects_from_coordinate_list Function: takes an array-ref of start/end coordinates, a strand and a type and returns a list of Bio::Location objects (Fuzzy by default, Simple in case of in-between coordinates). If location type is not "IN-BETWEEN", individual types may be passed in for start and end location as per Bio::Location::Fuzzy documentation. Usage : my @loc_objs = $self->_location_objects_from_coordinate_list( \@coords, $strand, $type ); Args : array-ref of array-refs each containing: start, end [, start-type, end-type] where types are optional. If given, must be a one of ('BEFORE', 'AFTER', 'EXACT','WITHIN', 'BETWEEN') strand (all locations must be on same strand) location-type (EXACT, IN-BETWEEN etc) Returns : list of Bio::Location objects =cut sub _location_objects_from_coordinate_list { my $self = shift; my ( $coords_ref, $strand, $type ) = @_; $self->throw( 'expected 3 parameters but got ' . @_ ) unless @_ == 3; $self->throw('first argument must be an ARRAY reference#') unless ref($coords_ref) eq 'ARRAY'; my @loc; foreach my $coords_set (@$coords_ref) { my ( $start, $end, $start_type, $end_type ) = @$coords_set; # taken from Bio::SeqUtils::_coord_adjust if ( $type ne 'IN-BETWEEN' ) { my $loc = Bio::Location::Fuzzy->new( -start => $start, -end => $end, -strand => $strand, -location_type => $type ); $loc->start_pos_type($start_type) if $start_type; $loc->end_pos_type($end_type) if $end_type; push @loc, $loc; } else { push @loc, Bio::Location::Simple->new( -start => $start, -end => $end, -strand => $strand, -location_type => $type ); } } # each coords_set return @loc; } # _location_objects_from_coordinate_list =head2 _new_seq_via_clone Title : _new_seq_via_clone Function: clone a sequence object using Bio::Root::Root::clone and set the new sequence string sequence features are removed. Usage : my $new_seq = $self->_new_seq_via_clone( $seq_obj, $seq_str ); Args : original seq object [, new sequence string] Returns : a clone of the original sequence object, optionally with new sequence string =cut sub _new_seq_via_clone { my ( $self, $in_seq_obj, $seq_str ) = @_; my $out_seq_obj = $in_seq_obj->clone; $out_seq_obj->remove_SeqFeatures if $out_seq_obj->can('remove_SeqFeatures'); if ( blessed $out_seq_obj->seq && $out_seq_obj->seq->isa('Bio::PrimarySeq') ) { $out_seq_obj->seq->seq($seq_str); } else { $out_seq_obj->seq($seq_str); } return $out_seq_obj; } # _new_seq_via_clone =head2 _new_seq_from_old Title : _new_seq_from_old Function: creates a new sequence obejct, if possible of the same class as the old and adds attributes to it. Also copies annotation across to the new object. Usage : my $new_seq = $self->_new_seq_from_old( $seq_obj, { seq => $seq_str, display_id => 'some_ID'}); Args : old sequence object hashref of attributes for the new sequence (sequence string etc.) Returns : a new Bio::Seq object =cut sub _new_seq_from_old { my ( $self, $in_seq_obj, $attr ) = @_; $self->throw('attributes must be a hashref') if $attr && ref($attr) ne 'HASH'; my $seqclass; if ( $in_seq_obj->can_call_new ) { $seqclass = ref($in_seq_obj); } else { $seqclass = 'Bio::Primaryseq'; $self->_attempt_to_load_seq; } my $out_seq_obj = $seqclass->new( -seq => $attr->{seq} || $in_seq_obj->seq, -display_id => $attr->{display_id} || $in_seq_obj->display_id, -accession_number => $attr->{accession_number} || $in_seq_obj->accession_number || '', -alphabet => $in_seq_obj->alphabet, -desc => $attr->{desc} || $in_seq_obj->desc, -verbose => $attr->{verbose} || $in_seq_obj->verbose, -is_circular => $attr->{is_circular} || $in_seq_obj->is_circular || 0, ); # move the annotation across to the product if ( $out_seq_obj->isa("Bio::AnnotatableI") && $in_seq_obj->isa("Bio::AnnotatableI") ) { foreach my $key ( $in_seq_obj->annotation->get_all_annotation_keys ) { foreach my $value ( $in_seq_obj->annotation->get_Annotations($key) ) { $out_seq_obj->annotation->add_Annotation( $key, $value ); } } } return $out_seq_obj; } # _new_seq_from_old =head2 _coord_adjust Title : _coord_adjust Usage : my $newfeat=Bio::SeqUtils->_coord_adjust($feature, 100, $seq->length); Function: Recursive subroutine to adjust the coordinates of a feature and all its subfeatures. If a sequence length is specified, then any adjusted features that have locations beyond the boundaries of the sequence are converted to Bio::Location::Fuzzy objects. Returns : A Bio::SeqFeatureI compliant object. Args : A Bio::SeqFeatureI compliant object, the number of bases to add to the coordinates (optional) the length of the parent sequence =cut sub _coord_adjust { my ( $self, $feat, $add, $length ) = @_; $self->throw( 'Object [$feat] ' . 'of class [' . ref($feat) . '] should be a Bio::SeqFeatureI ' ) unless $feat->isa('Bio::SeqFeatureI'); my @adjsubfeat; for my $subfeat ( $feat->get_SeqFeatures ) { push @adjsubfeat, $self->_coord_adjust( $subfeat, $add, $length ); } my @loc; for ( $feat->location->each_Location ) { my @coords = ( $_->start, $_->end ); my $strand = $_->strand; my $type = $_->location_type; foreach (@coords) { $self->throw("can not handle negative feature positions (got: $_)") if $_ < 0; if ( $add + $_ < 1 ) { $_ = '<1'; } elsif ( defined $length and $add + $_ > $length ) { $_ = ">$length"; } else { $_ = $add + $_; } } push @loc, $self->_location_objects_from_coordinate_list( [ \@coords ], $strand, $type ); } my $newfeat = Bio::SeqFeature::Generic->new( -primary => $feat->primary_tag ); foreach my $key ( $feat->annotation->get_all_annotation_keys() ) { foreach my $value ( $feat->annotation->get_Annotations($key) ) { $newfeat->annotation->add_Annotation( $key, $value ); } } foreach my $key ( $feat->get_all_tags() ) { $newfeat->add_tag_value( $key, $feat->get_tag_values($key) ); } my $loc = $self->_single_loc_object_from_collection(@loc); $loc ? $newfeat->location($loc) : return; $newfeat->add_SeqFeature($_) for @adjsubfeat; return $newfeat; } =head2 revcom_with_features Title : revcom_with_features Usage : $revcom=Bio::SeqUtils->revcom_with_features($seq); Function: Like Bio::Seq::revcom, but keeps features (adjusting coordinates as appropriate. Returns : A new sequence object Args : A sequence object =cut sub revcom_with_features { my ( $self, $seq ) = @_; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] should be a Bio::SeqI ' ) unless $seq->isa('Bio::SeqI'); my $revcom = $seq->revcom; # make sure that there is no annotation or features in $trunc # (->revcom() now clone objects except for Bio::Seq::LargePrimarySeq) $revcom->annotation->remove_Annotations; $revcom->remove_SeqFeatures; #move annotations foreach my $key ( $seq->annotation->get_all_annotation_keys() ) { foreach my $value ( $seq->annotation->get_Annotations($key) ) { $revcom->annotation->add_Annotation( $key, $value ); } } #move features for ( map { $self->_feature_revcom( $_, $seq->length ) } reverse $seq->get_SeqFeatures ) { $revcom->add_SeqFeature($_); } return $revcom; } =head2 _feature_revcom Title : _feature_revcom Usage : my $newfeat=Bio::SeqUtils->_feature_revcom($feature, $seq->length); Function: Recursive subroutine to reverse complement a feature and all its subfeatures. The length of the parent sequence must be specified. Returns : A Bio::SeqFeatureI compliant object. Args : A Bio::SeqFeatureI compliant object, the length of the parent sequence =cut sub _feature_revcom { my ( $self, $feat, $length ) = @_; $self->throw( 'Object [$feat] ' . 'of class [' . ref($feat) . '] should be a Bio::SeqFeatureI ' ) unless $feat->isa('Bio::SeqFeatureI'); my @adjsubfeat; for my $subfeat ( $feat->get_SeqFeatures ) { push @adjsubfeat, $self->_feature_revcom( $subfeat, $length ); } my @loc; for ( $feat->location->each_Location ) { my $type = $_->location_type; my $strand; if ( $_->strand == -1 ) { $strand = 1 } elsif ( $_->strand == 1 ) { $strand = -1 } else { $strand = $_->strand } my $newend = $self->_coord_revcom( $_->start, $_->start_pos_type, $length ); my $newstart = $self->_coord_revcom( $_->end, $_->end_pos_type, $length ); my $newstart_type = $_->end_pos_type; $newstart_type = 'BEFORE' if $_->end_pos_type eq 'AFTER'; $newstart_type = 'AFTER' if $_->end_pos_type eq 'BEFORE'; my $newend_type = $_->start_pos_type; $newend_type = 'BEFORE' if $_->start_pos_type eq 'AFTER'; $newend_type = 'AFTER' if $_->start_pos_type eq 'BEFORE'; push @loc, $self->_location_objects_from_coordinate_list( [ [ $newstart, $newend, $newstart_type, $newend_type ] ], $strand, $type ); } my $newfeat = Bio::SeqFeature::Generic->new( -primary => $feat->primary_tag ); foreach my $key ( $feat->annotation->get_all_annotation_keys() ) { foreach my $value ( $feat->annotation->get_Annotations($key) ) { $newfeat->annotation->add_Annotation( $key, $value ); } } foreach my $key ( $feat->get_all_tags() ) { $newfeat->add_tag_value( $key, $feat->get_tag_values($key) ); } my $loc = $self->_single_loc_object_from_collection(@loc); $loc ? $newfeat->location($loc) : return; $newfeat->add_SeqFeature($_) for @adjsubfeat; return $newfeat; } sub _coord_revcom { my ( $self, $coord, $type, $length ) = @_; if ( $type eq 'BETWEEN' or $type eq 'WITHIN' ) { $coord =~ s/(\d+)(\D*)(\d+)/$length+1-$3.$2.$length+1-$1/ge; } else { $coord =~ s/(\d+)/$length+1-$1/ge; $coord =~ tr/<>/>' . $coord if $type eq 'BEFORE' and substr( $coord, 0, 1 ) ne '>'; $coord = '<' . $coord if $type eq 'AFTER' and substr( $coord, 0, 1 ) ne '<'; } return $coord; } =head2 evolve Title : evolve Usage : my $newseq = Bio::SeqUtils-> evolve($seq, $similarity, $transition_transversion_rate); Function: Mutates the sequence by point mutations until the similarity of the new sequence has decreased to the required level. Transition/transversion rate is adjustable. Returns : A new Bio::PrimarySeq object Args : sequence object percentage similarity (e.g. 80) tr/tv rate, optional, defaults to 1 (= 1:1) Set the verbosity of the Bio::SeqUtils object to positive integer to see the mutations as they happen. This method works only on nucleotide sequences. It prints a warning if you set the target similarity to be less than 25%. Transition/transversion ratio is an observed attribute of an sequence comparison. We are dealing here with the transition/transversion rate that we set for our model of sequence evolution. =cut sub evolve { my ( $self, $seq, $sim, $rate ) = @_; $rate ||= 1; $self->throw( 'Object [$seq] ' . 'of class [' . ref($seq) . '] should be a Bio::PrimarySeqI ' ) unless $seq->isa('Bio::PrimarySeqI'); $self->throw( "[$sim] " . ' should be a positive integer or float under 100' ) unless $sim =~ /^[+\d.]+$/ and $sim <= 100; $self->warn( "Nucleotide sequences are 25% similar by chance. Do you really want to set similarity to [$sim]%?\n" ) unless $sim > 25; $self->throw('Only nucleotide sequences are supported') if $seq->alphabet eq 'protein'; # arrays of possible changes have transitions as first items my %changes; $changes{'a'} = [ 't', 'c', 'g' ]; $changes{'t'} = [ 'a', 'c', 'g' ]; $changes{'c'} = [ 'g', 'a', 't' ]; $changes{'g'} = [ 'c', 'a', 't' ]; # given the desired rate, find out where cut off points need to be # when random numbers are generated from 0 to 100 # we are ignoring identical mutations (e.g. A->A) to speed things up my $bin_size = 100 / ( $rate + 2 ); my $transition = 100 - ( 2 * $bin_size ); my $first_transversion = $transition + $bin_size; # unify the look of sequence strings my $string = lc $seq->seq; # lower case $string =~ s/u/t/; # simplyfy our life; modules should deal with the change anyway # store the original sequence string my $oristring = $string; my $length = $seq->length; # stop evolving if the limit has been reached until ( $self->_get_similarity( $oristring, $string ) <= $sim ) { # find the location in the string to change my $loc = int( rand $length ) + 1; # nucleotide to change my $oldnuc = substr $string, $loc - 1, 1; my $newnuc; # nucleotide it is changed to my $choose = rand(100); if ( $choose < $transition ) { $newnuc = $changes{$oldnuc}[0]; } elsif ( $choose < $first_transversion ) { $newnuc = $changes{$oldnuc}[1]; } else { $newnuc = $changes{$oldnuc}[2]; } # do the change substr $string, $loc - 1, 1, $newnuc; $self->debug("$loc$oldnuc>$newnuc\n"); } return new Bio::PrimarySeq( -id => $seq->id . "-$sim", -description => $seq->description, -seq => $string ); } sub _get_similarity { my ( $self, $oriseq, $seq ) = @_; my $len = length($oriseq); my $c; for ( my $i = 0 ; $i < $len ; $i++ ) { $c++ if substr( $oriseq, $i, 1 ) eq substr( $seq, $i, 1 ); } return 100 * $c / $len; } 1; WebAgent.pm100644000766000024 1210614006427353 16551 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::WebAgent # # Please direct questions and support issues to # # Cared for by Heikki Lehvaslaiho, heikki-at-bioperl-dot-org # For copyright and disclaimer see below. # # POD documentation - main docs before the code =head1 NAME Bio::WebAgent - A base class for Web (any protocol) access =head1 SYNOPSIS # This is a abstract superclass for bioperl modules accessing web # resources - normally you do not instantiate it but one of its # subclasess. =head1 DESCRIPTION This abstract superclass is a subclass of L which allows protocol independent access of remote locations over the Net. It takes care of error handling, proxies and various net protocols. BioPerl classes accessing the net should inherit from it. For details, see L. The interface is still evolving. For now, two public methods have been copied from Bio::DB::WebDBSeqI: delay() and delay_policy. These are used to prevent overwhelming the server by rapidly repeated . Ideally there should be a common abstract superclass with these. See L. =head1 SEE ALSO L, L, =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Heikki Lehvaslaiho, heikki-at-bioperl-dot-org =head1 COPYRIGHT Copyright (c) 2003, Heikki Lehvaslaiho and EMBL-EBI. All Rights Reserved. This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself. =head1 DISCLAIMER This software is provided "as is" without warranty of any kind. =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # Let the code begin... package Bio::WebAgent; $Bio::WebAgent::VERSION = '1.7.8'; use vars qw($LAST_INVOCATION_TIME); use strict; use base qw(LWP::UserAgent Bio::Root::Root); sub new { my $class = shift; # We make env_proxy the default here, but it can be # over-ridden by $self->env_proxy later, # or by new(env_proxy=>0) at constructor time my $self = $class->SUPER::new(env_proxy => 1); while( @_ ) { my $key = shift; $key =~ s/^-//; my $value = shift; $self->can($key) || next; $self->$key($value); } return $self; # success - we hope! } # ----------------------------------------------------------------------------- =head2 url Usage : $agent->url Returns : URL to reach out to Net Args : string =cut sub url { my ($self,$value) = @_; if( defined $value) { $self->{'_url'} = $value; } return $self->{'_url'}; } =head2 delay Title : delay Usage : $secs = $self->delay([$secs]) Function: get/set number of seconds to delay between fetches Returns : number of seconds to delay Args : new value NOTE: the default is to use the value specified by delay_policy(). This can be overridden by calling this method, or by passing the -delay argument to new(). =cut sub delay { my ($self, $value) = @_; if ($value) { $self->throw("Need a positive integer, not [$value]") unless $value >= 0; $self->{'_delay'} = int $value; } return $self->{'_delay'} || $self->delay_policy; } =head2 delay_policy Title : delay_policy Usage : $secs = $self->delay_policy Function: return number of seconds to delay between calls to remote db Returns : number of seconds to delay Args : none NOTE: The default delay policy is 3s. Override in subclasses to implement other delays. The timer has only second resolution, so the delay will actually be +/- 1s. =cut sub delay_policy { my $self = shift; return 3; } =head2 sleep Title : sleep Usage : $self->sleep Function: sleep for a number of seconds indicated by the delay policy Returns : none Args : none NOTE: This method keeps track of the last time it was called and only imposes a sleep if it was called more recently than the delay_policy() allows. =cut sub sleep { my $self = shift; $LAST_INVOCATION_TIME ||= 0; if (time - $LAST_INVOCATION_TIME < $self->delay) { my $delay = $self->delay - (time - $LAST_INVOCATION_TIME); $self->debug("sleeping for $delay seconds\n"); sleep $delay; } $LAST_INVOCATION_TIME = time; } 1; __END__ SearchIO.pm100644000766000024 5355314006427353 16525 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio# # BioPerl module for Bio::SearchIO # # Please direct questions and support issues to # # Cared for by Jason Stajich # # Copyright Jason Stajich # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::SearchIO - Driver for parsing Sequence Database Searches (BLAST, FASTA, ...) =head1 SYNOPSIS use Bio::SearchIO; # format can be 'fasta', 'blast', 'exonerate', ... my $searchio = Bio::SearchIO->new( -format => 'blastxml', -file => 'blastout.xml' ); while ( my $result = $searchio->next_result() ) { while( my $hit = $result->next_hit ) { # process the Bio::Search::Hit::HitI object while( my $hsp = $hit->next_hsp ) { # process the Bio::Search::HSP::HSPI object } } } =head1 DESCRIPTION This is a driver for instantiating a parser for report files from sequence database searches. This object serves as a wrapper for the format parsers in Bio::SearchIO::* - you should not need to ever use those format parsers directly. (For people used to the SeqIO system it, we are deliberately using the same pattern). Once you get a SearchIO object, calling next_result() gives you back a L compliant object, which is an object that represents one Blast/Fasta/HMMER whatever report. A list of module names and formats is below: blast BLAST (WUBLAST, NCBIBLAST,bl2seq) fasta FASTA -m9 and -m0 blasttable BLAST -m9 or -m8 output (both NCBI and WUBLAST tabular) megablast MEGABLAST psl UCSC PSL format waba WABA output axt AXT format sim4 Sim4 hmmer HMMER2 hmmpfam and hmmsearch or HMMER3 hmmscan and hmmsearch exonerate Exonerate CIGAR and VULGAR format blastxml NCBI BLAST XML wise Genewise -genesf format Also see the SearchIO HOWTO: http://bioperl.org/howtos/SearchIO_HOWTO.html =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Jason Stajich & Steve Chervitz Email jason-at-bioperl.org Email sac-at-bioperl.org =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # Let the code begin... package Bio::SearchIO; $Bio::SearchIO::VERSION = '1.7.8'; use strict; use warnings; # Object preamble - inherits from Bio::Root::IO use Bio::SearchIO::SearchResultEventBuilder; # Special exception class for exceptions during parsing. # End users should not ever see these. # For an example of usage, see blast.pm. @Bio::SearchIO::InternalParserError::ISA = qw(Bio::Root::Exception); use Symbol; use base qw(Bio::Root::IO Bio::Event::EventGeneratorI Bio::AnalysisParserI); =head2 new Title : new Usage : my $obj = Bio::SearchIO->new(); Function: Builds a new Bio::SearchIO object Returns : Bio::SearchIO initialized with the correct format Args : -file => $filename -format => format -fh => filehandle to attach to -result_factory => object implementing Bio::Factory::ObjectFactoryI -hit_factory => object implementing Bio::Factory::ObjectFactoryI -hsp_factory => object implementing Bio::Factory::ObjectFactoryI -writer => object implementing Bio::SearchIO::SearchWriterI -output_format => output format, which will dynamically load writer -inclusion_threshold => e-value threshold for inclusion in the PSI-BLAST score matrix model -signif => float or scientific notation number to be used as a P- or Expect value cutoff -check_all_hits => boolean. Check all hits for significance against significance criteria. Default = false. If false, stops processing hits after the first non-significant hit or the first hit that fails the hit_filter call. This speeds parsing, taking advantage of the fact that the hits are processed in the order they appear in the report. -min_query_len => integer to be used as a minimum for query sequence length. Reports with query sequences below this length will not be processed. default = no minimum length. -best => boolean. Only process the best hit of each report; default = false. See L, L Any factory objects in the arguments are passed along to the SearchResultEventBuilder object which holds these factories and sets default ones if none are supplied as arguments. =cut # TODO: The below don't seem to be implemented (e.g. in Bio::SearchIO::blast) # # -score => integer or scientific notation number to be used # as a blast score value cutoff # -bits => integer or scientific notation number to be used # as a bit score value cutoff # -overlap => integer. The amount of overlap to permit between # adjacent HSPs when tiling HSPs. A reasonable value is 2. # default = $Bio::SearchIO::blast::MAX_HSP_OVERLAP. sub new { my($caller,@args) = @_; my $class = ref($caller) || $caller; # or do we want to call SUPER on an object if $caller is an # object? if( $class =~ /Bio::SearchIO::(\S+)/ ) { my ($self) = $class->SUPER::new(@args); $self->_initialize(@args); return $self; } else { my %param = @args; @param{ map { lc $_ } keys %param } = values %param; # lowercase keys my $format = $param{'-format'} || $class->_guess_format( $param{'-file'} || $ARGV[0] ) || 'blast'; my $output_format = $param{'-output_format'}; my $writer = undef; if( defined $output_format ) { if( defined $param{'-writer'} ) { my $dummy = Bio::Root::Root->new(); $dummy->throw("Both writer and output format specified - not good"); } if( $output_format =~ /^blast$/i ) { $output_format = 'TextResultWriter'; } my $output_module = "Bio::SearchIO::Writer::".$output_format; $class->_load_module($output_module); $writer = $output_module->new(@args); push(@args,"-writer",$writer); } # normalize capitalization to lower case $format = "\L$format"; return unless( $class->_load_format_module($format) ); return "Bio::SearchIO::${format}"->new(@args); } } sub _initialize { my($self, @args) = @_; $self->{'_handler'} = undef; # not really necessary unless we put more in RootI #$self->SUPER::_initialize(@args); # initialize the IO part $self->_initialize_io(@args); $self->attach_EventHandler(Bio::SearchIO::SearchResultEventBuilder->new(@args)); $self->{'_reporttype'} = ''; $self->{_notfirsttime} = 0; my ($min_qlen, $check_all, $overlap, $best, $it, $writer ) = $self->_rearrange([qw( MIN_LENGTH CHECK_ALL_HITS OVERLAP BEST INCLUSION_THRESHOLD WRITER)], @args); # note: $overlap isn't used for some reason $writer && $self->writer( $writer ); defined $it && $self->inclusion_threshold($it); defined $min_qlen && $self->min_query_length($min_qlen); defined $best && $self->best_hit_only($best); defined $check_all && $self->check_all_hits($check_all); } =head2 newFh Title : newFh Usage : $fh = Bio::SearchIO->newFh(-file=>$filename, -format=>'Format') Function: does a new() followed by an fh() Example : $fh = Bio::SearchIO->newFh(-file=>$filename, -format=>'Format') $result = <$fh>; # read a ResultI object print $fh $result; # write a ResultI object Returns : filehandle tied to the Bio::SearchIO::Fh class Args : =cut sub newFh { my $class = shift; return unless my $self = $class->new(@_); return $self->fh; } =head2 fh Title : fh Usage : $obj->fh Function: Example : $fh = $obj->fh; # make a tied filehandle $result = <$fh>; # read a ResultI object print $fh $result; # write a ResultI object Returns : filehandle tied to the Bio::SearchIO::Fh class Args : =cut sub fh { my $self = shift; my $class = ref($self) || $self; my $s = Symbol::gensym; tie $$s,$class,$self; return $s; } =head2 format Title : format Usage : $format = $obj->format() Function: Get the search format Returns : search format Args : none =cut # format() method inherited from Bio::Root::IO =head2 attach_EventHandler Title : attach_EventHandler Usage : $parser->attatch_EventHandler($handler) Function: Adds an event handler to listen for events Returns : none Args : Bio::SearchIO::EventHandlerI See L =cut sub attach_EventHandler{ my ($self,$handler) = @_; return if( ! $handler ); if( ! $handler->isa('Bio::SearchIO::EventHandlerI') ) { $self->warn("Ignoring request to attatch handler ".ref($handler). ' because it is not a Bio::SearchIO::EventHandlerI'); } $self->{'_handler'} = $handler; return; } =head2 _eventHandler Title : _eventHandler Usage : private Function: Get the EventHandler Returns : Bio::SearchIO::EventHandlerI Args : none See L =cut sub _eventHandler{ my ($self) = @_; return $self->{'_handler'}; } =head2 next_result Title : next_result Usage : $result = stream->next_result Function: Reads the next ResultI object from the stream and returns it. Certain driver modules may encounter entries in the stream that are either misformatted or that use syntax not yet understood by the driver. If such an incident is recoverable, e.g., by dismissing a feature of a feature table or some other non-mandatory part of an entry, the driver will issue a warning. In the case of a non-recoverable situation an exception will be thrown. Do not assume that you can resume parsing the same stream after catching the exception. Note that you can always turn recoverable errors into exceptions by calling $stream->verbose(2) (see Bio::Root::RootI POD page). Returns : A Bio::Search::Result::ResultI object Args : n/a See L =cut sub next_result { my ($self) = @_; $self->throw_not_implemented; } =head2 write_result Title : write_result Usage : $stream->write_result($result_result, @other_args) Function: Writes data from the $result_result object into the stream. : Delegates to the to_string() method of the associated : WriterI object. Returns : 1 for success and 0 for error Args : Bio::Search:Result::ResultI object, : plus any other arguments for the Writer Throws : Bio::Root::Exception if a Writer has not been set. See L =cut sub write_result { my ($self, $result, @args) = @_; if( not ref($self->{'_result_writer'}) ) { $self->throw("ResultWriter not defined."); } @args = $self->{'_notfirsttime'} unless( @args ); my $str = $self->writer->to_string( $result, @args); $self->{'_notfirsttime'} = 1; $self->_print( "$str" ) if defined $str; $self->flush if $self->_flush_on_write && defined $self->_fh; return 1; } =head2 write_report Title : write_report Usage : $stream->write_report(SearchIO stream, @other_args) Function: Writes data directly from the SearchIO stream object into the : writer. This is mainly useful if one has multiple ResultI objects : in a SearchIO stream and you don't want to reiterate header/footer : between each call. Returns : 1 for success and 0 for error Args : Bio::SearchIO stream object, : plus any other arguments for the Writer Throws : Bio::Root::Exception if a Writer has not been set. See L =cut sub write_report { my ($self, $result, @args) = @_; if( not ref($self->{'_result_writer'}) ) { $self->throw("ResultWriter not defined."); } @args = $self->{'_notfirsttime'} unless( @args ); my $str = $self->writer->to_string( $result, @args); $self->{'_notfirsttime'} = 1; $self->_print( "$str" ) if defined $str; $self->flush if $self->_flush_on_write && defined $self->_fh; return 1; } =head2 writer Title : writer Usage : $writer = $stream->writer; Function: Sets/Gets a SearchWriterI object to be used for this searchIO. Returns : 1 for success and 0 for error Args : Bio::SearchIO::SearchWriterI object (when setting) Throws : Bio::Root::Exception if a non-Bio::SearchIO::SearchWriterI object is passed in. =cut sub writer { my ($self, $writer) = @_; if( ref($writer) and $writer->isa( 'Bio::SearchIO::SearchWriterI' )) { $self->{'_result_writer'} = $writer; } elsif( defined $writer ) { $self->throw("Can't set ResultWriter. Not a Bio::SearchIO::SearchWriterI: $writer"); } return $self->{'_result_writer'}; } =head2 result_count Title : result_count Usage : $num = $stream->result_count; Function: Gets the number of Blast results that have been successfully parsed at the point of the method call. This is not the total # of results in the file. Returns : integer Args : none Throws : none =cut sub result_count { my $self = shift; $self->throw_not_implemented; } =head2 inclusion_threshold Title : inclusion_threshold Usage : my $incl_thresh = $isreb->inclusion_threshold; : $isreb->inclusion_threshold(1e-5); Function: Get/Set the e-value threshold for inclusion in the PSI-BLAST score matrix model (blastpgp) that was used for generating the reports being parsed. Returns : number (real) Default value: $Bio::SearchIO::IteratedSearchResultEventBuilder::DEFAULT_INCLUSION_THRESHOLD Args : number (real) (e.g., 0.0001 or 1e-4 ) =cut # Delegates to the event handler. sub inclusion_threshold { shift->_eventHandler->inclusion_threshold(@_); } =head2 max_significance Usage : $obj->max_significance(); Purpose : Set/Get the P or Expect value used as significance screening cutoff. This is the value of the -signif parameter supplied to new(). Hits with P or E-value above this are skipped. Returns : Scientific notation number with this format: 1.0e-05. Argument : Scientific notation number or float (when setting) Comments : Screening of significant hits uses the data provided on the : description line. For NCBI BLAST1 and WU-BLAST, this data : is P-value. for NCBI BLAST2 it is an Expect value. =cut sub max_significance { shift->{'_handler_cache'}->max_significance(@_) } =head2 signif Synonym for L =cut sub signif { shift->max_significance(@_) } =head2 min_score Usage : $obj->min_score(); Purpose : Set/Get the Blast score used as screening cutoff. This is the value of the -score parameter supplied to new(). Hits with scores below this are skipped. Returns : Integer or scientific notation number. Argument : Integer or scientific notation number (when setting) Comments : Screening of significant hits uses the data provided on the : description line. =cut sub min_score { shift->{'_handler_cache'}->min_score(@_) } =head2 min_query_length Usage : $obj->min_query_length(); Purpose : Gets the query sequence length used as screening criteria. This is the value of the -min_query_len parameter supplied to new(). Hits with sequence length below this are skipped. Returns : Integer Argument : n/a =cut sub min_query_length { my $self = shift; if (@_) { my $min_qlen = shift; if ( $min_qlen =~ /\D/ or $min_qlen <= 0 ) { $self->throw( -class => 'Bio::Root::BadParameter', -text => "Invalid minimum query length value: $min_qlen\n" . "Value must be an integer > 0. Value not set.", -value => $min_qlen ); } $self->{'_confirm_qlength'} = 1; $self->{'_min_query_length'} = $min_qlen; } return $self->{'_min_query_length'}; } =head2 best_hit_only Title : best_hit_only Usage : print "only getting best hit.\n" if $obj->best_hit_only; Purpose : Set/Get the indicator for whether or not to process only : the best BlastHit. Returns : Boolean (1 | 0) Argument : Boolean (1 | 0) (when setting) =cut sub best_hit_only { my $self = shift; if (@_) { $self->{'_best'} = shift; } $self->{'_best'}; } =head2 check_all_hits Title : check_all_hits Usage : print "checking all hits.\n" if $obj->check_all_hits; Purpose : Set/Get the indicator for whether or not to process all hits. : If false, the parser will stop processing hits after the : the first non-significance hit or the first hit that fails : any hit filter. Returns : Boolean (1 | 0) Argument : Boolean (1 | 0) (when setting) =cut sub check_all_hits { my $self = shift; if (@_) { $self->{'_check_all'} = shift; } $self->{'_check_all'}; } =head2 _load_format_module Title : _load_format_module Usage : *INTERNAL SearchIO stuff* Function: Loads up (like use) a module at run time on demand Example : Returns : Args : =cut sub _load_format_module { my ($self,$format) = @_; my $module = "Bio::SearchIO::" . $format; my $ok; eval { $ok = $self->_load_module($module); }; if ( $@ ) { print STDERR <_guess_format($filename) Function: Example : Returns : guessed format of filename (lower case) Args : =cut sub _guess_format { my $class = shift; return unless $_ = shift; return 'blast' if (/\.(blast|t?bl\w)$/i ); return 'fasta' if (/\. (?: t? fas (?:ta)? | m\d+ | (?: t? (?: fa | fx | fy | ff | fs ) ) | (?: (?:ss | os | ps) (?:earch)? )) $/ix ); return 'blastxml' if ( /\.(blast)?xml$/i); return 'exonerate' if ( /\.exon(erate)?/i ); } sub close { my $self = shift; if( $self->writer ) { $self->_print($self->writer->end_report()); $self->{'_result_writer'}= undef; } $self->SUPER::close(@_); } sub DESTROY { my $self = shift; $self->close() if defined $self->_fh; $self->SUPER::DESTROY; } sub TIEHANDLE { my $class = shift; return bless {processor => shift}, $class; } sub READLINE { my $self = shift; return $self->{'processor'}->next_result() || undef unless wantarray; my (@list, $obj); push @list, $obj while $obj = $self->{'processor'}->next_result(); return @list; } sub PRINT { my $self = shift; $self->{'processor'}->write_result(@_); } 1; __END__ Seq000755000766000024 014006427353 15067 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/BioMeta.pm100644000766000024 4701614006427353 16503 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/Seq# # BioPerl module for Bio::Seq::Meta # # Please direct questions and support issues to # # Cared for by Heikki Lehvaslaiho # # Copyright Heikki Lehvaslaiho # # You may distribute this module under the same terms as perl itself # POD documentation - main docs before the code =head1 NAME Bio::Seq::Meta - Generic superclass for sequence objects with residue-based meta information =head1 SYNOPSIS use Bio::LocatableSeq; use Bio::Seq::Meta; use Bio::Tools::OddCodes; use Bio::SeqIO; my $seq = Bio::Seq::Meta->new(-id=>'test', -seq=>'ACTGCTAGCT', -start=>2434, -end=>2443, -strand=>1, -verbose=>1, # to see warnings ); # the existing sequence object can be a Bio::PrimarySeq, too # to test this is a meta seq object $seq->isa("Bio::Seq::Meta") || $seq->throw("$seq is not a Bio::Seq::Meta"); $seq->meta('1234567890'); $seq = Bio::Seq::Meta->new(-id=>'test', -seq=>'HACILMIFGT', -start=>2434, -end=>2443, -strand=>1, -meta=>'1234567890', -verbose=>1, # to see warnings ); # accessors $string = $seq->meta_text(); $substring = $seq->submeta_text(2,5); $unique_key = $seq->accession_number(); # storing output from Bio::Tools::OddCodes as meta data my $protcodes = Bio::Tools::OddCodes->new(-seq => $seq); my @codes = qw(structural chemical functional charge hydrophobic); map { $seq->named_meta($_, ${$protcodes->$_($seq) } )} @codes; my $out = Bio::SeqIO->new(-format=>'metafasta'); $out->write_seq($seq); =head1 DESCRIPTION This class implements generic methods for sequences with residue-based meta information. Meta sequences with meta data are Bio::LocatableSeq objects with additional methods to store that meta information. See L and L. The meta information in this class is always one character per residue long and blank values are space characters (ASCII 32). After the latest rewrite, the meta information no longer covers all the residues automatically. Methods to check the length of meta information (L)and to see if the ends are flushed to the sequence have been added (L). To force the old functionality, set L to true. It is assumed that meta data values do not depend on the nucleotide sequence strand value. Application specific implementations should inherit from this class to override and add to these methods. L allows for more complex meta values (scalars or objects) to be used. =head2 Method naming Character based meta data is read and set by method meta() and its variants. These are the suffixes and prefixes used in the variants: [named_] [sub] meta [_text] =over 3 =item _text Suffix B<_text> guaranties that output is a string. Note that it does not limit the input. In this implementation, the output is always text, so these methods are redundant. =item sub Prefix B, like in subseq(), means that the method applies to sub region of the sequence range and takes start and end as arguments. Unlike subseq(), these methods are able to set values. If the range is not defined, it defaults to the complete sequence. =item named Prefix B in method names allows the used to attach multiple meta strings to one sequence by explicitly naming them. The name is always the first argument to the method. The "unnamed" methods use the class wide default name for the meta data and are thus special cases "named" methods. Note that internally names are keys in a hash and any misspelling of a name will silently store the data under a wrong name. The used names (keys) can be retrieved using method meta_names(). See L. =back =head1 NOTE This Bio::Seq::MetaI implementation inherits from Bio::LocatableSeq, which itself inherits from Bio::PrimarySeq. It is not a Bio::SeqI, so bless-ing objects of this class into a Bio::SeqI or vice versa and will not work as expected (see bug 2262). This may be addressed in a future refactor of Bio::LocatableSeq. =head1 SEE ALSO L, L, L =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to one of the Bioperl mailing lists. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Support Please direct usage questions or support issues to the mailing list: I rather than to the module maintainer directly. Many experienced and reponsive experts will be able look at the problem and quickly address it. Please include a thorough description of the problem with code and data examples if at all possible. =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR - Heikki Lehvaslaiho Email heikki-at-bioperl-dot-org =head1 CONTRIBUTORS Chad Matsalla, bioinformatics@dieselwurks.com Aaron Mackey, amackey@virginia.edu =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ =cut # Let the code begin... package Bio::Seq::Meta; $Bio::Seq::Meta::VERSION = '1.7.8'; use vars qw($DEFAULT_NAME $GAP $META_GAP); use strict; #use overload '""' => \&to_string; use base qw(Bio::LocatableSeq Bio::Seq::MetaI); BEGIN { $DEFAULT_NAME = 'DEFAULT'; $GAP = '-'; $META_GAP = ' '; } =head2 new Title : new Usage : $metaseq = Bio::Seq::Meta->new ( -meta => 'aaaaaaaabbbbbbbb', -seq => 'TKLMILVSHIVILSRM' -id => 'human_id', -accession_number => 'S000012', ); Function: Constructor for Bio::Seq::Meta class, meta data being in a string. Note that you can provide an empty quality string. Returns : a new Bio::Seq::Meta object =cut sub new { my ($class, @args) = @_; my $self = $class->SUPER::new(@args); my($meta, $forceflush, $nm) = $self->_rearrange([qw(META FORCE_FLUSH NAMED_META)], @args); #$self->{'_meta'} = {}; $self->{'_meta'}->{$DEFAULT_NAME} = ""; $meta && $self->meta($meta); if ($nm && ref($nm) eq 'HASH') { while (my ($name, $meta) = each %$nm) { $self->named_meta($name, $meta); } } $forceflush && $self->force_flush($forceflush); return $self; } =head2 meta Title : meta Usage : $meta_values = $obj->meta($values_string); Function: Get and set method for the meta data starting from residue position one. Since it is dependent on the length of the sequence, it needs to be manipulated after the sequence. The length of the returned value always matches the length of the sequence, if force_flush() is set. See L. Returns : meta data in a string Args : new value, string, optional =cut sub meta { shift->named_meta($DEFAULT_NAME, shift); } =head2 meta_text Title : meta_text Usage : $meta_values = $obj->meta_text($values_arrayref); Function: Variant of meta() guarantied to return a textual representation of meta data. For details, see L. Returns : a string Args : new value, optional =cut sub meta_text { shift->meta(shift); } =head2 named_meta Title : named_meta() Usage : $meta_values = $obj->named_meta($name, $values_arrayref); Function: A more general version of meta(). Each meta data set needs to be named. See also L. Returns : a string Args : scalar, name of the meta data set new value, optional =cut sub named_meta { my ($self, $name, $value) = @_; $name ||= $DEFAULT_NAME; if( defined $value) { $self->throw("I need a scalar value, not [". ref($value). "]") if ref($value); # test for length my $diff = $self->length - CORE::length($value); if ($diff > 0) { $value .= (" " x $diff); } $self->{'_meta'}->{$name} = $value; #$self->_test_gap_positions($name) if $self->verbose > 0; } return " " x $self->length if $self->force_flush && not defined $self->{'_meta'}->{$name}; $self->_do_flush if $self->force_flush; return $self->{'_meta'}->{$name}; } =head2 _test_gap_positions Title : _test_gap_positions Usage : $meta_values = $obj->_test_gap_positions($name); Function: Internal test for correct position of gap characters. Gap being only '-' this time. This method is called from named_meta() when setting meta data but only if verbose is positive as this can be an expensive process on very long sequences. Set verbose(1) to see warnings when gaps do not align in sequence and meta data and turn them into errors by setting verbose(2). Returns : true on success, prints warnings Args : none =cut sub _test_gap_positions { my $self = shift; my $name = shift; my $success = 1; $self->seq || return $success; my $len = CORE::length($self->seq); for (my $i=0; $i < $len; $i++) { my $s = substr $self->{seq}, $i, 1; my $m = substr $self->{_meta}->{$name}, $i, 1; $self->warn("Gap mismatch [$m/$s] in column [". ($i+1). "] of [$name] meta data in seq [". $self->id. "]") and $success = 0 if ($s eq $META_GAP) && $s ne $m; } return $success; } =head2 named_meta_text Title : named_meta_text() Usage : $meta_values = $obj->named_meta_text($name, $values_arrayref); Function: Variant of named_meta() guarantied to return a textual representation of the named meta data. For details, see L. Returns : a string Args : scalar, name of the meta data set new value, optional =cut sub named_meta_text { shift->named_meta(@_); } =head2 submeta Title : submeta Usage : $subset_of_meta_values = $obj->submeta(10, 20, $value_string); $subset_of_meta_values = $obj->submeta(10, undef, $value_string); Function: Get and set method for meta data for subsequences. Numbering starts from 1 and the number is inclusive, ie 1-2 are the first two residue of the sequence. Start cannot be larger than end but can be equal. If the second argument is missing the returned values should extend to the end of the sequence. The return value may be a string or an array reference, depending on the implementation. If in doubt, use submeta_text() which is a variant guarantied to return a string. See L. Returns : A reference to an array or a string Args : integer, start position integer, end position, optional when a third argument present new value, optional =cut sub submeta { shift->named_submeta($DEFAULT_NAME, @_); } =head2 submeta_text Title : submeta_text Usage : $meta_values = $obj->submeta_text(20, $value_string); Function: Variant of submeta() guarantied to return a textual representation of meta data. For details, see L. Returns : a string Args : new value, optional =cut sub submeta_text { shift->submeta(@_); } =head2 named_submeta Title : named_submeta Usage : $subset_of_meta_values = $obj->named_submeta($name, 10, 20, $value_string); $subset_of_meta_values = $obj->named_submeta($name, 10); Function: Variant of submeta() guarantied to return a textual representation of meta data. For details, see L. Returns : A reference to an array or a string Args : scalar, name of the meta data set integer, start position integer, end position, optional when a third argument present new value, optional =cut sub named_submeta { my ($self, $name, $start, $end, $value) = @_; $name ||= $DEFAULT_NAME; $start ||=1; $start =~ /^[+]?\d+$/ and $start > 0 or $self->throw("Need at least a positive integer start value"); if ($value) { $end ||= $start+length($value)-1; $self->warn("You are setting meta values beyond the length of the sequence\n". "[$start > ". length($self->seq)."] in sequence ". $self->id) if $start > length $self->seq; # pad meta data if needed $self->{_meta}->{$name} = () unless defined $self->{_meta}->{$name}; if (length($self->{_meta}->{$name}) < $start) { $self->{'_meta'}->{$name} .= " " x ( $start - length($self->{'_meta'}->{$name}) -1); } my $tail = ''; $tail = substr ($self->{_meta}->{$name}, $start-1+length($value)) if length($self->{_meta}->{$name}) >= $start-1+length($value); substr ($self->{_meta}->{$name}, --$start) = $value; $self->{_meta}->{$name} .= $tail; return substr ($self->{_meta}->{$name}, $start, $end - $start + 1); } else { $end or $end = length $self->seq; # pad meta data if needed if (length($self->{_meta}->{$name}) < $end) { $self->{'_meta'}->{$name} .= " " x ( $start - length($self->{'_meta'}->{$name})); } return substr ($self->{_meta}->{$name}, $start-1, $end - $start + 1) } } =head2 named_submeta_text Title : named_submeta_text Usage : $meta_values = $obj->named_submeta_text($name, 20, $value_string); Function: Variant of submeta() guarantied to return a textual representation of meta data. For details, see L. Returns : a string Args : scalar, name of the meta data Args : integer, start position, optional integer, end position, optional new value, optional =cut sub named_submeta_text { shift->named_submeta(@_); } =head2 meta_names Title : meta_names Usage : @meta_names = $obj->meta_names() Function: Retrieves an array of meta data set names. The default (unnamed) set name is guarantied to be the first name. Returns : an array of names Args : none =cut sub meta_names { my ($self) = @_; my @r; foreach ( sort keys %{$self->{'_meta'}} ) { push (@r, $_) unless $_ eq $DEFAULT_NAME; } unshift @r, $DEFAULT_NAME if $self->{'_meta'}->{$DEFAULT_NAME}; return @r; } =head2 meta_length Title : meta_length() Usage : $meeta_len = $obj->meta_length(); Function: return the number of elements in the meta set Returns : integer Args : - =cut sub meta_length { my ($self) = @_; return $self->named_meta_length($DEFAULT_NAME); } =head2 named_meta_length Title : named_meta_length() Usage : $meta_len = $obj->named_meta_length($name); Function: return the number of elements in the named meta set Returns : integer Args : - =cut sub named_meta_length { my ($self, $name) = @_; $name ||= $DEFAULT_NAME; return length ($self->{'_meta'}->{$name}); } =head2 force_flush Title : force_flush() Usage : $force_flush = $obj->force_flush(1); Function: Automatically pad with empty values or truncate meta values to sequence length. Not done by default. Returns : boolean 1 or 0 Args : optional boolean value Note that if you turn this forced padding off, the previously padded values are not changed. =cut sub force_flush { my ($self, $value) = @_; if (defined $value) { if ($value) { $self->{force_flush} = 1; $self->_do_flush; } else { $self->{force_flush} = 0; } } return $self->{force_flush}; } =head2 _do_flush Title : _do_flush Usage : Function: internal method to do the force that meta values are same length as the sequence . Called from L Returns : Args : =cut sub _do_flush { my ($self) = @_; foreach my $name ( ('DEFAULT', $self->meta_names) ) { # elongnation if ($self->length > $self->named_meta_length($name)) { $self->{'_meta'}->{$name} .= $META_GAP x ($self->length - $self->named_meta_length($name)) ; } # truncation elsif ( $self->length < $self->named_meta_length($name) ) { $self->{_meta}->{$name} = substr($self->{_meta}->{$name}, 0, $self->length-1); } } } =head2 is_flush Title : is_flush Usage : $is_flush = $obj->is_flush() or $is_flush = $obj->is_flush($my_meta_name) Function: Boolean to tell if all meta values are in flush with the sequence length. Returns true if force_flush() is set Set verbosity to a positive value to see failed meta sets Returns : boolean 1 or 0 Args : optional name of the meta set =cut sub is_flush { my ($self, $name) = shift; return 1 if $self->force_flush; my $sticky = ''; if ($name) { $sticky .= "$name " if $self->length != $self->named_meta_length($name); } else { foreach my $m ($self->meta_names) { $sticky .= "$m " if ($self->named_meta_length($m) > 0) && ($self->length != $self->named_meta_length($m)); } } if ($sticky) { print "These meta set are not flush: $sticky\n" if $self->verbose; return 0; } return 1; } =head1 Bio::PrimarySeqI methods =head2 revcom Title : revcom Usage : $newseq = $seq->revcom(); Function: Produces a new Bio::Seq::MetaI implementing object where the order of residues and their meta information is reversed. Returns : A new (fresh) Bio::Seq::Meta object Args : none Throws : if the object returns false on is_flush() Note: The method does nothing to meta values, it reorders them, only. =cut sub revcom { my $self = shift; $self->throw("Can not get a reverse complement. The object is not flush.") unless $self->is_flush; my $new = $self->SUPER::revcom; foreach (keys %{$self->{_meta}}) { $new->named_meta($_, scalar reverse $self->{_meta}->{$_} ); }; return $new; } =head2 trunc Title : trunc Usage : $subseq = $seq->trunc(10,100); Function: Provides a truncation of a sequence together with meta data Returns : a fresh Bio::Seq::Meta implementing object Args : Two integers denoting first and last residue of the sub-sequence. =cut sub trunc { my ($self, $start, $end) = @_; # test arguments $start =~ /^[+]?\d+$/ and $start > 0 or $self->throw("Need at least a positive integer start value as start"); $end =~ /^[+]?\d+$/ and $end > 0 or $self->throw("Need at least a positive integer start value as end"); $end >= $start or $self->throw("End position has to be larger or equal to start"); $end <= $self->length or $self->throw("End position can not be larger than sequence length"); my $new = $self->SUPER::trunc($start, $end); $start--; foreach (keys %{$self->{_meta}}) { $new->named_meta($_, substr($self->{_meta}->{$_}, $start, $end - $start) ); }; return $new; } sub to_string { my ($self) = @_; my $out = Bio::SeqIO->new(-format=>'metafasta'); $out->write_seq($self); return 1; } 1; Fasta.pm100644000766000024 3450014006427353 16402 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/lib/Bio/DB# # BioPerl module for Bio::DB::Fasta # # You may distribute this module under the same terms as perl itself # =head1 NAME Bio::DB::Fasta - Fast indexed access to fasta files =head1 SYNOPSIS use Bio::DB::Fasta; # Create database from a directory of Fasta files my $db = Bio::DB::Fasta->new('/path/to/fasta/files/'); my @ids = $db->get_all_primary_ids; # Simple access my $seqstr = $db->seq('CHROMOSOME_I', 4_000_000 => 4_100_000); my $revseq = $db->seq('CHROMOSOME_I', 4_100_000 => 4_000_000); my $length = $db->length('CHROMOSOME_I'); my $header = $db->header('CHROMOSOME_I'); my $alphabet = $db->alphabet('CHROMOSOME_I'); # Access to sequence objects. See Bio::PrimarySeqI. my $seq = $db->get_Seq_by_id('CHROMOSOME_I'); my $seqstr = $seq->seq; my $subseq = $seq->subseq(4_000_000 => 4_100_000); my $trunc = $seq->trunc(4_000_000 => 4_100_000); my $length = $seq->length; # Loop through sequence objects my $stream = $db->get_PrimarySeq_stream; while (my $seq = $stream->next_seq) { # Bio::PrimarySeqI stuff } # Filehandle access my $fh = Bio::DB::Fasta->newFh('/path/to/fasta/files/'); while (my $seq = <$fh>) { # Bio::PrimarySeqI stuff } # Tied hash access tie %sequences,'Bio::DB::Fasta','/path/to/fasta/files/'; print $sequences{'CHROMOSOME_I:1,20000'}; =head1 DESCRIPTION Bio::DB::Fasta provides indexed access to a single Fasta file, several files, or a directory of files. It provides persistent random access to each sequence entry (either as a Bio::PrimarySeqI-compliant object or a string), and to subsequences within each entry, allowing you to retrieve portions of very large sequences without bringing the entire sequence into memory. Bio::DB::Fasta is based on Bio::DB::IndexedBase. See this module's documentation for details. The Fasta files may contain any combination of nucleotide and protein sequences; during indexing the module guesses the molecular type. Entries may have any line length up to 65,536 characters, and different line lengths are allowed in the same file. However, within a sequence entry, all lines must be the same length except for the last. An error will be thrown if this is not the case. The module uses /^E(\S+)/ to extract the primary ID of each sequence from the Fasta header. See -makeid in Bio::DB::IndexedBase to pass a callback routine to reversibly modify this primary ID, e.g. if you wish to extract a specific portion of the gi|gb|abc|xyz GenBank IDs. =head1 DATABASE CREATION AND INDEXING The object-oriented constructor is new(), the filehandle constructor is newFh() and the tied hash constructor is tie(). They all allow one to index a single Fasta file, several files, or a directory of files. See Bio::DB::IndexedBase. =head1 SEE ALSO L L L =head1 AUTHOR Lincoln Stein Elstein@cshl.orgE. Copyright (c) 2001 Cold Spring Harbor Laboratory. This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See DISCLAIMER.txt for disclaimers of warranty. =head1 APPENDIX The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _ For BioPerl-style access, the following methods are provided: =head2 get_Seq_by_id Title : get_Seq_by_id, get_Seq_by_acc, get_Seq_by_primary_id Usage : my $seq = $db->get_Seq_by_id($id); Function: Given an ID, fetch the corresponding sequence from the database. Returns : A Bio::PrimarySeq::Fasta object (Bio::PrimarySeqI compliant) Note that to save resource, Bio::PrimarySeq::Fasta sequence objects only load the sequence string into memory when requested using seq(). See L for methods provided by the sequence objects returned from get_Seq_by_id() and get_PrimarySeq_stream(). Args : ID =head2 get_PrimarySeq_stream Title : get_PrimarySeq_stream Usage : my $stream = $db->get_PrimarySeq_stream(); Function: Get a stream of Bio::PrimarySeq::Fasta objects. The stream supports a single method, next_seq(). Each call to next_seq() returns a new Bio::PrimarySeq::Fasta sequence object, until no more sequences remain. Returns : A Bio::DB::Indexed::Stream object Args : None =head1 For simple access, the following methods are provided: =cut package Bio::DB::Fasta; $Bio::DB::Fasta::VERSION = '1.7.8'; use strict; use IO::File; use File::Spec; use Bio::PrimarySeqI; use base qw(Bio::DB::IndexedBase); our $obj_class = 'Bio::PrimarySeq::Fasta'; our $file_glob = '*.{fa,FA,fasta,FASTA,fast,FAST,dna,DNA,fna,FNA,faa,FAA,fsa,FSA}'; =head2 new Title : new Usage : my $db = Bio::DB::Fasta->new( $path, %options); Function: Initialize a new database object. When indexing a directory, files ending in .fa,fasta,fast,dna,fna,faa,fsa are indexed by default. Returns : A new Bio::DB::Fasta object. Args : A single file, or path to dir, or arrayref of files Optional arguments: see Bio::DB::IndexedBase =cut sub _calculate_offsets { # Bio::DB::IndexedBase calls this to calculate offsets my ($self, $fileno, $file, $offsets) = @_; my $fh = IO::File->new($file) or $self->throw( "Could not open $file: $!"); binmode $fh; warn "Indexing $file\n" if $self->{debug}; my ($offset, @ids, $linelen, $alphabet, $headerlen, $count, $seq_lines, $last_line, %offsets); my ($l3_len, $l2_len, $l_len, $blank_lines) = (0, 0, 0, 0); my $termination_length = $self->{termination_length}; while (my $line = <$fh>) { # Account for crlf-terminated Windows files if (index($line, '>') == 0) { if ($line =~ /^>(\S+)/) { print STDERR "Indexed $count sequences...\n" if $self->{debug} && (++$count%1000) == 0; # please, do not enforce arbitrary line length requirements. # It's good practice but not enforced. #$self->_check_linelength($linelen); my $pos = tell($fh); if (@ids) { my $strlen = $pos - $offset - length($line); $strlen -= $termination_length * $seq_lines; my $ppos = &{$self->{packmeth}}($offset, $strlen, $strlen, $linelen, $headerlen, $alphabet, $fileno); $alphabet = Bio::DB::IndexedBase::NA; for my $id (@ids) { $offsets->{$id} = $ppos; } } @ids = $self->_makeid($line); ($offset, $headerlen, $linelen, $seq_lines) = ($pos, length $line, 0, 0); ($l3_len, $l2_len, $l_len, $blank_lines) = (0, 0, 0, 0); } else { # Catch bad header lines, bug 3172 $self->throw("FASTA header doesn't match '>(\\S+)': $line"); } } elsif ($line !~ /\S/) { # Skip blank line $blank_lines++; next; } else { # Need to check every line :( $l3_len = $l2_len; $l2_len = $l_len; $l_len = length $line; if (Bio::DB::IndexedBase::DIE_ON_MISSMATCHED_LINES) { if ( ($l3_len > 0) && ($l2_len > 0) && ($l3_len != $l2_len) ) { my $fap = substr($line, 0, 20).".."; $self->throw("Each line of the fasta entry must be the same ". "length except the last. Line above #$. '$fap' is $l2_len". " != $l3_len chars."); } if ($blank_lines) { # Blank lines not allowed in entry $self->throw("Blank lines can only precede header lines, ". "found preceding line #$."); } } $linelen ||= length $line; $alphabet ||= $self->_guess_alphabet($line); $seq_lines++; } $last_line = $line; } # Process last entry $self->_check_linelength($linelen); my $pos = tell $fh; if (@ids) { my $strlen = $pos - $offset; if ($linelen == 0) { # yet another pesky empty chr_random.fa file $strlen = 0; } else { if ($last_line !~ /\s$/) { $seq_lines--; } $strlen -= $termination_length * $seq_lines; } my $ppos = &{$self->{packmeth}}($offset, $strlen, $strlen, $linelen, $headerlen, $alphabet, $fileno); for my $id (@ids) { $offsets->{$id} = $ppos; } } return \%offsets; } =head2 seq Title : seq, sequence, subseq Usage : # Entire sequence string my $seqstr = $db->seq($id); # Subsequence my $subseqstr = $db->seq($id, $start, $stop, $strand); # or... my $subseqstr = $db->seq($compound_id); Function: Get a subseq of a sequence from the database. For your convenience, the sequence to extract can be specified with any of the following compound IDs: $db->seq("$id:$start,$stop") $db->seq("$id:$start..$stop") $db->seq("$id:$start-$stop") $db->seq("$id:$start,$stop/$strand") $db->seq("$id:$start..$stop/$strand") $db->seq("$id:$start-$stop/$strand") $db->seq("$id/$strand") In the case of DNA or RNA sequence, if $stop is less than $start, then the reverse complement of the sequence is returned. Avoid using it if possible since this goes against Bio::Seq conventions. Returns : A string Args : ID of sequence to retrieve or Compound ID of subsequence to fetch or ID, optional start (defaults to 1), optional end (defaults to length of sequence) and optional strand (defaults to 1). =cut sub subseq { my ($self, $id, $start, $stop, $strand) = @_; $self->throw('Need to provide a sequence ID') if not defined $id; ($id, $start, $stop, $strand) = $self->_parse_compound_id($id, $start, $stop, $strand); my $data; my $fh = $self->_fh($id) or return; my $filestart = $self->_calc_offset($id, $start); my $filestop = $self->_calc_offset($id, $stop ); seek($fh, $filestart,0); read($fh, $data, $filestop-$filestart+1); $data =~ tr/\n\r//d; #strip control characters if ($strand == -1) { # Reverse-complement the sequence $data = Bio::PrimarySeqI::_revcom_from_string($self, $data, $self->alphabet($id)); } return $data; } *seq = *sequence = \&subseq; =head2 length Title : length Usage : my $length = $qualdb->length($id); Function: Get the number of residues in the indicated sequence. Returns : Number Args : ID of entry =head2 header Title : header Usage : my $header = $db->header($id); Function: Get the header line (ID and description fields) of the specified sequence. Returns : String Args : ID of sequence =cut sub header { my ($self, $id) = @_; $self->throw('Need to provide a sequence ID') if not defined $id; my ($offset, $headerlen) = (&{$self->{unpackmeth}}($self->{offsets}{$id}))[0,4]; $offset -= $headerlen; my $data; my $fh = $self->_fh($id) or return; seek($fh, $offset, 0); read($fh, $data, $headerlen); # On Windows chomp remove '\n' but leaves '\r' # when reading '\r\n' in binary mode $data =~ tr/\n\r//d; #strip control characters substr($data, 0, 1) = ''; return $data; } =head2 alphabet Title : alphabet Usage : my $alphabet = $db->alphabet($id); Function: Get the molecular type of the indicated sequence: dna, rna or protein Returns : String Args : ID of sequence =cut #------------------------------------------------------------- # Bio::PrimarySeqI compatibility # package Bio::PrimarySeq::Fasta; $Bio::PrimarySeq::Fasta::VERSION = '1.7.8'; use overload '""' => 'display_id'; use base qw(Bio::Root::Root Bio::PrimarySeqI); sub new { my ($class, @args) = @_; my $self = $class->SUPER::new(@args); my ($db, $id, $start, $stop) = $self->_rearrange( [qw(DATABASE ID START STOP)], @args); $self->{db} = $db; $self->{id} = $id; $self->{stop} = $stop || $db->length($id); $self->{start} = $start || ($self->{stop} > 0 ? 1 : 0); # handle 0-length seqs return $self; } sub fetch_sequence { return shift->seq(@_); } sub seq { my $self = shift; return $self->{db}->seq($self->{id}, $self->{start}, $self->{stop}); } sub subseq { my $self = shift; return $self->trunc(@_)->seq(); } sub trunc { # Override Bio::PrimarySeqI trunc() method. This way, we create an object # that does not store the sequence in memory. my ($self, $start, $stop) = @_; $self->throw("Stop cannot be smaller than start") if $stop < $start; if ($self->{start} <= $self->{stop}) { $start = $self->{start}+$start-1; $stop = $self->{start}+$stop-1; } else { $start = $self->{start}-($start-1); $stop = $self->{start}-($stop-1); } return $self->new( $self->{db}, $self->{id}, $start, $stop ); } sub is_circular { my $self = shift; return $self->{is_circular}; } sub display_id { my $self = shift; return $self->{id}; } sub accession_number { my $self = shift; return 'unknown'; } sub primary_id { # Following Bio::PrimarySeqI, since this sequence has no accession number, # its primary_id should be a stringified memory location. my $self = shift; return overload::StrVal($self); } sub can_call_new { return 0; } sub alphabet { my $self = shift; return $self->{db}->alphabet($self->{id}); } sub revcom { # Override Bio::PrimarySeqI revcom() with optimized method. my $self = shift; return $self->new(@{$self}{'db', 'id', 'stop', 'start'}); } sub length { # Get length from sequence location, not the sequence string (too expensive) my $self = shift; return $self->{start} < $self->{stop} ? $self->{stop} - $self->{start} + 1 : $self->{start} - $self->{stop} + 1 ; } sub description { my $self = shift; my $header = $self->{'db'}->header($self->{id}); # Remove the ID from the header return (split(/\s+/, $header, 2))[1]; } *desc = \&description; 1; author-mojibake.t100644000766000024 35314006427353 16712 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t#!perl BEGIN { unless ($ENV{AUTHOR_TESTING}) { print qq{1..0 # SKIP these tests are for testing by the author\n}; exit } } use strict; use warnings qw(all); use Test::More; use Test::Mojibake; all_files_encoding_ok(); bp_taxonomy2tree100644000766000024 543314006427353 17221 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_taxonomy2tree - Building a taxonomic tree based on the full lineages of a set of species names =head1 DESCRIPTION This scripts looks up the provided species names in the NCBI Taxonomy database, retrieves their full lineage and puts them in a Newick taxonomic tree displayed on screen. bp_taxonomy2tree.pl -s Orangutan -s Gorilla -s Chimpanzee -s Human bp_taxonomy2tree.pl -s Orangutan -s Gorilla -s Chimpanzee -s "Homo Sapiens" Can also provide -d to specify the directory to store index files in, -o to specify the location of your NCBI nodes file, and -a for the NCBI names file. Or the option -e to use the web-based Entrez taxonomy database if you do not have the NCBI flatfiles installed. This script requires that the bioperl-run pkg be also installed. Providing the nodes.dmp and names.dmp files from the NCBI Taxonomy dump (see Bio::DB::Taxonomy::flatfile for more info) is only necessary on the first time running. This will create the local indexes and may take quite a long time. However once created, these indexes will allow fast access for species to taxon id OR taxon id to species name lookups. =head1 AUTHOR - Gabriel Valiente, reimplemented by Sendu Bala Email valiente@lsi.upc.edu Email bix@sendu.me.uk =cut use strict; use warnings; use Bio::DB::Taxonomy; use Bio::TreeIO; use Bio::Tree::Compatible; use Getopt::Long; my @species; my $index_dir = "./db/"; my $nodesfile = "nodes.dmp"; my $namesfile = "names.dmp"; my $use_entrez = 0; # the input to the script is an array of species names GetOptions( 's|species=s' => \@species, 'd|dir:s' => \$index_dir, 'o|nodesfile:s' => \$nodesfile, 'a|namesfile:s' => \$namesfile, 'e|entrez' => \$use_entrez, 'h|help' => sub { system('perldoc', $0); exit }, ); my $db = Bio::DB::Taxonomy->new( -source => $use_entrez ? 'entrez' : 'flatfile', -directory => $index_dir, -nodesfile => $nodesfile, -namesfile => $namesfile ); # the full lineages of the species are merged into a single tree my $tree; for my $name (@species) { my $node = $db->get_taxon(-name => $name); if ($node) { if ($tree) { $tree->merge_lineage($node); } else { $tree = Bio::Tree::Tree->new(-node => $node); } } else { warn "no NCBI Taxonomy node for species ",$name,"\n"; } } # simple paths are contracted by removing degree one nodes $tree->contract_linear_paths; # convert tree ids to their names for nice output with TreeIO foreach my $node ($tree->get_nodes) { $node->id($node->node_name); } # the tree is output in Newick format my $output = Bio::TreeIO->new(-format => 'newick'); $output->write_tree($tree); $output->close; 1; bp_unflatten_seq100644000766000024 1635714006427353 17300 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; # Author Chris Mungall =head1 NAME bp_unflatten_seq - unflatten a genbank or genbank-style feature file into a nested SeqFeature hierarchy =head1 SYNOPSIS bp_unflatten_seq.PLS -e 3 -gff ~/cvs/bioperl-live/t/data/AE003644_Adh-genomic.gb bp_unflatten_seq.PLS --detail ~/cvs/bioperl-live/t/data/AE003644_Adh-genomic.gb bp_unflatten_seq.PLS -i foo.embl --from embl --to chadoxml -o out.chado.xml bp_unflatten_seq.PLS --notypemap --detail --to asciitree -ethresh 2 AE003644_Adh-genomic.gb =head1 DESCRIPTION This script will B a genbank or genbank-style file of SeqFeatures into a nested hierarchy. See L In a GenBank/EMBL representation, features are 'flat' - for example, there is no link between an mRNA and a CDS, other than implicit links (eg via tags or via splice site coordinates) which may be hard to code for. This is most easily illustrated with the default output format, B An unflattened genbank feature set may look like this (AB077698) Seq: AB077698 databank_entry 1..2701[+] gene mRNA CDS hCHCR-G 80..1144[+] exon 80..1144[+] five_prime_UTR 1..79[+] located_sequence_feature 137..196[+] located_sequence_feature 239..292[+] located_sequence_feature 617..676[+] located_sequence_feature 725..778[+] three_prime_UTR 1145..2659[+] polyA_site 1606..1606[+] polyA_site 2660..2660[+] Or like this (portion of AE003734) gene mRNA CG3320-RA CDS CG3320-PA 53126..54971[-] exon 52204..53323[-] exon 53404..53631[-] exon 53688..53735[-] exon 53798..53918[-] exon 54949..55287[-] mRNA CG3320-RB CDS CG3320-PB 53383..54971[-] exon 52204..53631[-] exon 53688..53735[-] exon 53798..53918[-] exon 54949..55287[-] The unflattening will also 'normalize' the containment hierarchy (in the sense of standardising it - e.g. making sure there is always a transcript record, even if genbank just specifies CDS and gene) By default, the GenBank types will be mapped to SO types See L =head1 COMMAND LINE ARGUMENTS =over =item -i|input FILE input file (can also be specified as last argument) =item -from FORMAT input format (defaults to genbank) probably doesn't make so much sense to use this for non-flat formats; ie other than embl/genbank =item -to FORMAT output format (defaults to asciitree) should really be a format that is nested SeqFeature aware; I think this is only asciitree, chadoxml and gff3 =item -gff with export to GFF3 format (pre-3 GFFs make no sense with unflattened sequences, as they have no set way of representing feature graphs) =item -o|output FILE outfile defaults to STDOUT =item -detail show extra detail on features (asciitree mode only) =item -e|ethresh INT sets the error threshold on unflattening by default this script will throw a wobbly if it encounters weird stuff in the genbank file - raise the error threshold to signal these to be ignored (and reported on STDERR) =item -nomagic suppress use_magic in unflattening (see L =item -notypemap suppress type mapping (see L =back =head1 TODO L allows fine-grained control over the unflattening process - need to add more options to allow this control at the command line =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Chris Mungall Ecjm-at-bioperl.orgE =cut use Bio::SeqIO; use Bio::SeqFeature::Tools::Unflattener; use Bio::SeqFeature::Tools::TypeMapper; use Bio::SeqFeature::Tools::IDHandler; use Bio::Tools::GFF; use Getopt::Long; my ($input,$from,$to,$output,$verbosity,$ethresh,$nomagic,$group_tag,$detail, $notypemap); $from = 'genbank'; $to = 'asciitree'; $ethresh = 3; my $gff; my @remove_types = (); GetOptions( 'i|input:s' => \$input, 'from:s' => \$from, 'to:s' => \$to, 'o|output:s'=> \$output, "verbosity|v=s"=>\$verbosity, "ethresh|e=s"=>\$ethresh, "remove_type=s@"=>\@remove_types, "nomagic"=>\$nomagic, "notypemap"=>\$notypemap, "group_tag"=>\$group_tag, "detail"=>\$detail, "gff"=>\$gff, "h|help"=>sub { system("perldoc $0"); exit 0; }, ); if ($to =~ /^gff/i) { $gff = 1; } $input = $input || shift if @ARGV; my $in = new Bio::SeqIO(-file => $input, -format => $from); my $out; my @out_opt = $output ? (-file => ">$output") : (); unless ($gff) { $out = new Bio::SeqIO(-format=>$to, @out_opt); $out->show_detail($detail) if $out->can("show_detail") && $detail; } my $unflattener = Bio::SeqFeature::Tools::Unflattener->new; $unflattener->verbose($verbosity); $unflattener->error_threshold($ethresh); my $tm = Bio::SeqFeature::Tools::TypeMapper->new; my $idhandler = Bio::SeqFeature::Tools::IDHandler->new; while( my $seq = $in->next_seq ) { $unflattener->remove_types(-seq=>$seq, -types=>\@remove_types) if @remove_types; $unflattener->unflatten_seq(-seq=>$seq, -use_magic=>!$nomagic, -group_tag=>$group_tag, ); $unflattener->report_problems(\*STDERR); $tm->map_types_to_SO(-seq=>$seq) unless $notypemap; my @seq_args = ($seq); if ($to eq 'chadoxml') { @seq_args = (-seq=>$seq, -nounflatten=>1) } if ($gff) { my $gffio = Bio::Tools::GFF->new(@out_opt, -noparse=>1, -gff_version => 3); $idhandler->set_ParentIDs_from_hierarchy($seq); foreach my $feature ($seq->get_all_SeqFeatures) { $gffio->write_feature($feature); } $gffio->close(); } else { $out->write_seq(@seq_args); } } __END__ bp_taxid4species100644000766000024 611214006427353 17145 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl # Author: Jason Stajich # Purpose: Retrieve the NCBI Taxa ID for organism(s) # TODO: add rest of POD # use LWP::UserAgent; use XML::Twig; use strict; use warnings; use Getopt::Long; use Data::Dumper; my $verbose = 0; my $plain = 0; my $help = 0; my $USAGE = "taxid4species: [-v] [-p] \"Genus1 species1\" \"Genus2 species2\""; GetOptions('v|verbose' => \$verbose, 'p|plain' => \$plain, 'h|help' => \$help); die("$USAGE\n") if $help; my $ua = new LWP::UserAgent(); my $urlbase = 'https://www.ncbi.nlm.nih.gov/entrez/eutils/'; my $esearch = 'esearch.fcgi?db=taxonomy&usehistory=y&term='; my $esummary = 'esummary.fcgi?db=taxonomy&query_key=QUERYKEY&WebEnv=WEBENV'; my (@organisms) = @ARGV; die("must provide valid organism") unless @organisms; my $organismstr = join(" OR ", @organisms); $organismstr =~ s/\s/\+/g; # Esearch my $response = $ua->get($urlbase . $esearch . $organismstr); my $t = XML::Twig->new(); print $response->content,"\n"if($verbose); $t->parse($response->content); my $root = $t->root; my $querykey = $root->first_child('QueryKey')->text; my $webenv = $root->first_child('WebEnv')->text; # Esummary $esummary =~ s/QUERYKEY/$querykey/; $esummary =~ s/WEBENV/$webenv/; $response = $ua->get($urlbase . $esummary); $t = XML::Twig->new(); print $response->content,"\n"if($verbose); $t->parse($response->content); $root = $t->root; # Parse XML my %taxinfo; foreach my $docsum ($root->children) { foreach my $item ($docsum->children('Item')) { if ($item->{att}{Name} eq 'ScientificName') { my $sciname = $item->text; $taxinfo{lc $sciname}{sciname} = $sciname; $taxinfo{lc $sciname}{tid} = $docsum->first_child_text('Id'); last; } } } # Output in same order as given on command line foreach my $orgn (@organisms) { if (exists $taxinfo{lc $orgn}) { my $tid = $taxinfo{lc $orgn}{tid}; if ($plain) { print $tid, "\n"; } else { print join(", ", "'$orgn'", $tid), "\n"; } } else { print "'$orgn' not found\n"; } } =head1 NAME bp_taxid4species - simple script which returns the NCBI Taxonomic id for a requested species =head1 SYNOPSIS bp_taxid4species [-v] [-p] [-h] "Genus1 species1" "Genus2 species2" Options: -v verbose -p plain -h help =head1 DESCRIPTION This simple script shows how to get the taxa id from NCBI Entrez and will return a list of taxa ids for requested organisms. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Jason Stajich jason-at-bioperl-dot-org =cut bp_translate_seq100644000766000024 276714006427353 17255 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; =head1 NAME bp_translate_seq - translates a sequence =head1 SYNOPSIS bp_translate_seq E cdna_cds.fa E protein.fa =head1 DESCRIPTION The script will translate one fasta file (on stdin) to protein on stdout =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Ewan Birney Ebirney@ebi.ac.ukE =cut use Bio::SeqIO; use Getopt::Long; my ($format,$outfile) = 'fasta'; GetOptions( 'f|format:s' => \$format, 'o|out|outfile:s' => \$outfile, ); my $oformat = 'fasta'; # this implicity uses the <> file stream my $seqin = Bio::SeqIO->new( -format => $format, -fh => \*ARGV); my $seqout; if( $outfile ) { $seqout = Bio::SeqIO->new( -format => $oformat, -file => ">$outfile" ); } else { # defaults to writing to STDOUT $seqout = Bio::SeqIO->new( -format => $oformat ); } while( (my $seq = $seqin->next_seq()) ) { my $pseq = $seq->translate(); $seqout->write_seq($pseq); } __END__ bp_filter_search100644000766000024 326014006427353 17207 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl use strict; use warnings; =head1 NAME bp_filter_search - filters searchio results, outputting a tab delimited summary =head1 SYNOPSIS #bp_filter_search -format blast -score 200 < search.bl > search.tab =head1 DESCRIPTION This script filters searchio results allowing a number of different filters to be applied before outputting to stdout in a tab delimited format. =head1 FEEDBACK =head2 Mailing Lists User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated. bioperl-l@bioperl.org - General discussion http://bioperl.org/wiki/Mailing_lists - About the mailing lists =head2 Reporting Bugs Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via email or the web: https://github.com/bioperl/bioperl-live/issues =head1 AUTHOR Ewan Birney =cut use Bio::SearchIO; use Getopt::Long; my ($format,$score); $format = 'blast'; $score = 150; GetOptions( 'format:s' => \$format, 'score:s' => \$score, ); my $searchin = Bio::SearchIO->new( -format => $format); while( (my $result = $searchin->next_result()) ) { while( (my $hit = $result->next_hit())) { if( $score ) { if( $hit->raw_score < $score ) { next; } } foreach my $hsp ( $hit->hsps() ) { print $result->query_name,"\t",$hit->score,"\t",$hsp->start,"\t",$hsp->end,"\t",$hsp->strand,"\t",$hsp->hseq_id,"\t",$hsp->hstart,"\t",$hsp->hend,"\t",$hsp->strand,"\n"; } } } bp_bioflat_index100644000766000024 701714006427353 17210 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/bin#!/usr/bin/perl =head1 NAME bp_bioflat_index.pl - index sequence files using Bio::DB::Flat =head1 DESCRIPTION Create or update a biological sequence database indexed with the Bio::DB::Flat indexing scheme. The arguments are a list of flat files containing the sequence information to be indexed. =head1 USAGE bp_bioflat_index.pl file1 file2 file3... Options: --create Create or reinitialize the index. If not specified, the index must already exist. --format The format of the sequence files. Must be one of "genbank", "swissprot", "embl" or "fasta". --location Path to the directory in which the index files are stored. --dbname The symbolic name of the database to be created. --indextype Type of index to create. Either "bdb" or "flat". "binarysearch" is the same as "flat". Options can be abbreviated. For example, use -i for --indextype. The following environment variables will be used as defaults if the corresponding options are not provided: OBDA_FORMAT format of sequence file OBDA_LOCATION path to directory in which index files are stored OBDA_DBNAME name of database OBDA_INDEX type of index to create =cut use strict; use warnings; use Bio::Root::Root; use Bio::Root::IO; use Bio::DB::Flat; use Getopt::Long; use File::Path qw(mkpath rmtree); my ($CREATE,$FORMAT,$LOCATION,$DBNAME,$INDEXTYPE); GetOptions( 'create' => \$CREATE, 'format:s' => \$FORMAT, 'location:s' => \$LOCATION, 'dbname:s' => \$DBNAME, 'indextype:s' => \$INDEXTYPE ); $FORMAT = $ENV{OBDA_FORMAT} unless defined $FORMAT; $LOCATION = $ENV{OBDA_LOCATION} unless defined $LOCATION; $DBNAME = $ENV{OBDA_DBNAME} unless defined $DBNAME; $INDEXTYPE = $ENV{OBDA_INDEXTYPE} unless defined $INDEXTYPE; my $root = 'Bio::Root::Root'; my $io = 'Bio::Root::IO'; # confirm that database directory is there defined $LOCATION or $root->throw("please provide a base directory with the --location option"); -d $LOCATION or $root->throw("$LOCATION is not a valid directory; use --create to create a new index"); defined $DBNAME or $root->throw("please provide a database name with the --dbname option"); defined $FORMAT or $root->throw("please specify the format for the input files with the --format option"); unless (defined $INDEXTYPE) { $INDEXTYPE = 'flat'; $root->warn('setting index type to "flat", use the --indextype option to override'); } # Confirm that database is there and that --create flag is sensible. my $path = $io->catfile($LOCATION,$DBNAME,'config.dat'); if (-e $path) { if ($CREATE) { $root->warn("existing index detected; deleting."); rmtree($io->catfile($LOCATION,$DBNAME),1,1); } else { $root->warn("existing index detected; ignoring --indextype and --format options."); undef $INDEXTYPE; } } elsif (!$CREATE) { $root->throw("Cannot find database config file at location $path; use --create to create a new index"); } # open for writing/updating my $db = Bio::DB::Flat->new(-directory => $LOCATION, -dbname => $DBNAME, $INDEXTYPE ? ( -index => $INDEXTYPE ) : (), -write_flag => 1, -format => $FORMAT) or $root->throw("can't create Bio::DB::Flat object"); my $entries = $db->build_index(@ARGV); print STDERR "(Re)indexed $entries entries.\n "; __END__ tRNAscanSE.t100644000766000024 175314006427353 16657 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14); use_ok('Bio::Tools::tRNAscanSE'); } my $verbose = test_debug(); my $parser = Bio::Tools::tRNAscanSE->new(-verbose => $verbose, -file => test_input_file('yeast.tRNAscanSE')); isa_ok($parser, 'Bio::Tools::tRNAscanSE') ; my @genes; while( my $gene = $parser->next_prediction ) { push @genes, $gene; } is (scalar(@genes), 287); is($genes[2]->seq_id, 'I', 'seq_id'); my ($codon) = $genes[2]->get_tag_values('Codon'); is($codon, 'TTG', 'codon'); is($genes[2]->start, 181135, 'start'); is($genes[2]->end, 181248, 'end'); is($genes[2]->strand, 1, 'strand'); my @exons = $genes[2]->get_SeqFeatures ; is ( scalar(@exons), 2, 'exons' ); is($exons[0]->end,181172, 'end' ); is($exons[0]->start,$genes[2]->start, 'start'); is($exons[1]->start,181205, 'start'); is($exons[1]->end,$genes[2]->end, 'end'); is($exons[0]->seq_id, $genes[2]->seq_id, 'seq_id'); Pseudowise.t100644000766000024 206014006427353 17075 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 21); use_ok('Bio::Tools::Pseudowise'); } my $inputfilename= test_input_file('pseudowise.out'); my $parser = Bio::Tools::Pseudowise->new(-file => $inputfilename); my @gene; while (my $gene= $parser->next_prediction){ push @gene, $gene; } my ($g) = @gene; my @e = $g->sub_SeqFeature; is ($g->primary_tag, 'pseudogene'); is ($g->source_tag, 'pseudowise'); is(($g->get_tag_values('Synonymous'))[0],7); is(($g->get_tag_values('Nonsynonymous'))[0],18); is(($g->get_tag_values('Ka/Ks'))[0],2.57); is(($g->get_tag_values('Unlikely'))[0],0); is(($g->get_tag_values('Identical'))[0],5); is(($g->get_tag_values('Stop'))[0],0); is(($g->get_tag_values('Total codons'))[0],30); is(($g->get_tag_values('Frameshift'))[0],0); is(($g->get_tag_values('Intron'))[0],1); is($g->start,163); is($g->end,626); is($g->strand,1); is($e[0]->start, 163); is($e[0]->end,213); is($e[0]->strand,1); is($e[1]->start,585); is($e[1]->end,626); is($e[1]->strand,1); Est2Genome.t100644000766000024 425014006427353 16721 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 61); use_ok('Bio::Tools::Est2Genome'); } my $parser = Bio::Tools::Est2Genome->new(-file => test_input_file('hs_est.est2genome')); ok($parser); my $feature_set = $parser->parse_next_gene; like(ref($feature_set), qr/ARRAY/i ); is(scalar @$feature_set, 7); my @exons = grep { $_->primary_tag eq 'Exon' } @$feature_set; my @introns = grep { $_->primary_tag eq 'Intron' } @$feature_set; my @expected_exons = ( [695,813,1,1,119,1], [1377,1493,1,120,236,1], [1789,1935,1,237,382,1], [2084,2180,1,383,479,1]); my @expected_introns = ( [814,1376,1], [1494,1788,1], [1936,2083,1] ); foreach my $e ( @exons ) { my $test_e = shift @expected_exons; my $i = 0; is($e->query->start, $test_e->[$i++]); is($e->query->end, $test_e->[$i++]); is($e->query->strand, $test_e->[$i++]); is($e->hit->start, $test_e->[$i++]); is($e->hit->end, $test_e->[$i++]); is($e->hit->strand, $test_e->[$i++]); } ok(! @expected_exons); foreach my $intron ( @introns ) { my $test_i = shift @expected_introns; my $i = 0; is($intron->start, $test_i->[$i++]); is($intron->end, $test_i->[$i++]); is($intron->strand, $test_i->[$i++]); } ok(! @expected_introns); $parser = Bio::Tools::Est2Genome->new(-file => test_input_file('hs_est.est2genome')); ok($parser); my $gene = $parser->parse_next_gene(1); @expected_exons = ( [695,813,1,1,119,1], [1377,1493,1,120,236,1], [1789,1935,1,237,382,1], [2084,2180,1,383,479,1]); @expected_introns = ( [814,1376,1], [1494,1788,1], [1936,2083,1] ); foreach my $trans($gene->transcripts){ my @exons = $trans->exons; foreach my $e(@exons){ my $test_e = shift @expected_exons; my $i = 0; is($e->start, $test_e->[$i++]); is($e->end, $test_e->[$i++]); is($e->strand, $test_e->[$i++]); } my @introns = $trans->introns; foreach my $intron ( @introns ) { my $test_i = shift @expected_introns; my $i = 0; is($intron->start, $test_i->[$i++]); is($intron->end, $test_i->[$i++]); is($intron->strand, $test_i->[$i++]); } } Genomewise.t100644000766000024 257614006427353 17064 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools# -*-Perl-*- Test Harness script for Bioperl # $Id: Genomewise.t 11525 2007-06-27 10:16:38Z sendu $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 21); use_ok('Bio::Tools::Genomewise'); } my $inputfilename = test_input_file("genomewise.out"); my $parser = Bio::Tools::Genomewise->new(-file => $inputfilename); my @gene; while (my $gene= $parser->next_prediction){ push @gene, $gene; } my @t = $gene[0]->transcripts; my @e = $t[0]->exons; is ($t[0]->source_tag, 'genomewise'); is ($e[0]->source_tag, 'genomewise'); is ($t[0]->primary_tag, 'transcript'); is ($e[0]->primary_tag, 'exon'); is (scalar($t[0]->exons), 5); is ($t[0]->start, 4761); is ($t[0]->end, 6713); is ($e[0]->start,4761); is ($e[0]->end, 4874); my ($phase) = $e[0]->get_tag_values('phase'); is ($phase,0); open my $FH, '<', $inputfilename or die "Could not read file '$inputfilename': $!\n"; $parser = Bio::Tools::Genomewise->new(-fh => $FH); while (my $gene= $parser->next_prediction){ push @gene, $gene; } @t = $gene[1]->transcripts; @e = $t[0]->exons; is ($t[0]->source_tag, 'genomewise'); is ($e[0]->source_tag, 'genomewise'); is ($t[0]->primary_tag, 'transcript'); is ($e[0]->primary_tag, 'exon'); is (scalar($t[0]->exons), 3); is ($t[0]->start, 9862); is ($t[0]->end, 10316); is ($e[1]->start,10024); is ($e[1]->end, 10211); ($phase) = $e[2]->get_tag_values('phase'); is ($phase,2); Phylo000755000766000024 014006427353 15516 5ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/ToolsGerp.t100644000766000024 232714006427353 16744 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Tools/Phylo# -*-Perl-*- Test Harness script for Bioperl # $Id: gerp.t,v 1.15 2007/06/27 10:16:38 sendu Exp $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 33, -requires_module => 'URI::Escape'); use_ok('Bio::Tools::Phylo::Gerp'); } ok my $parser = Bio::Tools::Phylo::Gerp->new(-file => test_input_file('ENr111.mfa.example.elems')); my $count = 0; my @expected = ([qw(334180 334352 449 1.03744e-165)], [qw(337735 337915 458.2 5.02405e-164)], [qw(262604 262861 473.1 3.64789e-117)], [qw(285427 285608 386.1 8.42494e-113)], [qw(309563 309744 383.6 2.88895e-111)]); while (my $feat = $parser->next_result) { $count++; my @exp = @{shift(@expected)}; isa_ok $feat, 'Bio::SeqFeature::Generic'; is $feat->source_tag, 'GERP', 'correct source'; is $feat->start, shift(@exp), 'feature start correct'; is $feat->end, shift(@exp), 'feature end correct'; is $feat->score, shift(@exp), 'feature score correct'; my ($p_value) = $feat->annotation->get_Annotations('pvalue'); is ref $p_value ? $p_value->value : $p_value, shift(@exp), 'feature pvalue correct'; } is $count, 5, "correct number of results parsed out"; SeqBuilder.t100644000766000024 726514006427353 16701 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 137); use_ok('Bio::SeqIO'); } my $verbosity = test_debug; my ($seqio, $seq); # predeclare variables for strict # default mode ok $seqio = Bio::SeqIO->new('-file' => test_input_file('test.genbank'), '-format' => 'GenBank'); $seqio->verbose($verbosity); my $numseqs = 0; my @loci = qw(U63596 U63595 M37762 NT_010368 L26462); my @numfeas = (3,1,6,3,26); while ($seq = $seqio->next_seq) { is $seq->accession_number, $loci[$numseqs++]; ok $seq->annotation->get_Annotations; is scalar($seq->top_SeqFeatures), $numfeas[$numseqs-1]; ok $seq->species->binomial; ok $seq->seq; ok $seq->desc; ok $seq->id; } is $numseqs, 5; # minimalistic mode $seqio = Bio::SeqIO->new('-file' => test_input_file('test.genbank'), '-format' => 'GenBank'); ok $seqio; $seqio->verbose($verbosity); ok my $seqbuilder = $seqio->sequence_builder; isa_ok $seqbuilder, "Bio::Factory::ObjectBuilderI"; $seqbuilder->want_none; $seqbuilder->add_wanted_slot('display_id','accession_number','desc'); $numseqs = 0; while ($seq = $seqio->next_seq) { is $seq->accession_number, $loci[$numseqs++]; is scalar(grep { ! ($_->tagname eq "keyword" || $_->tagname eq "date_changed" || $_->tagname eq "secondary_accession"); } $seq->annotation->get_Annotations), 0; if ($numseqs <= 3) { is scalar($seq->top_SeqFeatures), 0; } else { is scalar($seq->top_SeqFeatures), $numfeas[$numseqs-1]; } is $seq->species, undef; is $seq->seq, undef; ok $seq->desc; ok $seq->id; # switch on features for the last 2 seqs $seqbuilder->add_wanted_slot('features') if $numseqs == 3; } is $numseqs, 5; # everything but no sequence, and no features $seqio = Bio::SeqIO->new('-file' => test_input_file('test.genbank'), '-format' => 'GenBank'); ok $seqio; $seqio->verbose($verbosity); $seqbuilder = $seqio->sequence_builder; # want-all is default $seqbuilder->add_unwanted_slot('seq','features'); $numseqs = 0; while ($seq = $seqio->next_seq) { is $seq->accession_number, $loci[$numseqs++]; ok scalar($seq->annotation->get_Annotations); if ($numseqs <= 3) { is scalar($seq->top_SeqFeatures), 0; } else { is scalar($seq->top_SeqFeatures), $numfeas[$numseqs-1]; } ok $seq->species->binomial; is $seq->seq, undef; ok $seq->desc; ok $seq->id; # switch on features for the last 2 seqs if ($numseqs == 3) { $seqbuilder->add_unwanted_slot( grep { $_ ne 'features'; } $seqbuilder->remove_unwanted_slots ); } } is $numseqs, 5; # skip sequences less than 100bp or accession like 'NT_*' $seqio = Bio::SeqIO->new('-file' => test_input_file('test.genbank'), '-format' => 'GenBank'); ok $seqio; $seqio->verbose($verbosity); $seqbuilder = $seqio->sequence_builder; # we could have as well combined the two conditions into one, but we want to # test the implicit AND here $seqbuilder->add_object_condition(sub { my $h = shift; return 0 if($h->{'-length'} < 100); return 1; }); $seqbuilder->add_object_condition(sub { my $h = shift; return 0 if($h->{'-display_id'} =~ /^NT_/); return 1; }); $numseqs = 0; my $i = 0; while ($seq = $seqio->next_seq) { $numseqs++; is $seq->accession_number, $loci[$i]; ok scalar($seq->annotation->get_Annotations); is scalar($seq->top_SeqFeatures), $numfeas[$i]; ok $seq->species->binomial; ok $seq->seq; ok $seq->desc; ok $seq->id; $i += 2; } is $numseqs, 3; Splicedseq.t100644000766000024 1067014006427353 16750 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 27); use_ok('Bio::Seq'); use_ok('Bio::SeqIO'); } ok my $str = Bio::SeqIO->new(-file => test_input_file('U58726.gb'), -format => 'GenBank'); my $seq; ok ( $seq = $str->next_seq() ); # Here is a cute way to verify the sequence by seeing if the # the translation matches what is annotated in the file -js foreach my $ft ( grep { $_->primary_tag eq 'CDS'} $seq->top_SeqFeatures ) { if( $ft->has_tag('translation') ) { my ($translation) = $ft->get_tag_values('translation'); my $t = $ft->spliced_seq(-nosort => 1); my $pepseq = $t->translate()->seq(); chop($pepseq); # chop is to remove stop codon is($translation, $pepseq); } } my $stream = Bio::SeqIO->new(-file => test_input_file('M12730.gb'), -format => 'genbank'); # Jump down to M12730 which lists CDS join(1959..2355,1..92) while ($seq->accession ne "M12730") { $seq = $stream->next_seq; } ok(my @features = $seq->get_SeqFeatures(), "get_SeqFeatures()"); my $feat; foreach my $feat2 ( @features ) { next unless ($feat2->primary_tag eq "CDS"); my @db_xrefs = $feat2->get_tag_values("db_xref"); if (grep { $_ eq "GI:150830" } @db_xrefs) { $feat = $feat2; last; } } my ($protein_seq) = $feat->get_tag_values("translation"); like($protein_seq, qr(^MKERYGTVYKGSQRLIDE.*ANEKQENALYLIIILSRTSIT$), "protein sequence"); my ($nucleotide_seq) = $feat->spliced_seq(-nosort => 1)->seq; like($nucleotide_seq, qr(^ATGAAAGAAAGATATGGA.*TCAAGGACTAGTATAACATAA$), "nucleotide sequence - correct CDS range"); is(length($nucleotide_seq), 489, "nucleotide length"); # Test for Fix spliced seq #72 my $str2 = Bio::SeqIO->new(-file => test_input_file('AF032047.gbk'), -format => 'GenBank'); my @feats = $str2-> next_seq -> get_SeqFeatures; # feat[1] has 2 exons from remote sequence AF032048.1 my $len_nodb; warnings_like { $len_nodb = length($feats[1]->spliced_seq()->seq); } [ {carped => qr/cannot get remote location for/}, {carped => qr/cannot get remote location for/} ], "appropriate warning if db not provided for remote sequence"; ok($len_nodb == 374, "correct number of Ns added if remote sequence not provided"); # Test for cut by origin features my $seq_obj = Bio::Seq->new(-display_id => 'NC_008309', -seq => 'AAAAACCCCCGGGGGTTTTT'); $seq_obj->is_circular(1); my $loc_obj = Bio::Factory::FTLocationFactory->from_string('join(16..20,1..2)'); my $cut_feat = Bio::SeqFeature::Generic->new(-primary_tag => 'CDS', -location => $loc_obj, -tag => { locus_tag => 'HS_1792', product => 'hypothetical protein', protein_id => 'YP_718205.1', } ); $seq_obj->add_SeqFeature($cut_feat); is $cut_feat->seq->seq, 'TTTTTAA', 'cut by origin sequence using $feat->seq'; is $cut_feat->spliced_seq->seq, 'TTTTTAA', 'cut by origin sequence using $feat->spliced_seq'; is $cut_feat->start, 16, 'cut by origin start using $feat->start'; is $cut_feat->end, 2, 'cut by origin end using $feat->end'; is $cut_feat->location->start, 16, 'cut by origin start using $feat->location->start'; is $cut_feat->location->end, 2, 'cut by origin end using $feat->location->end'; SKIP: { test_skip(-tests => 3, -requires_modules => [qw(Bio::DB::GenBank LWP::UserAgent )], -requires_networking => 1); my $db_in; eval { ok $db_in = Bio::DB::GenBank->new(); my $seq_obj = $db_in->get_Seq_by_id('AF032048.1'); }; if ($@) { print "$@\n"; skip "Warning: Problem accessing GenBank entry AF032048.1 " . "to test spliced_seq on remote DBs", 2; } my $len_w_db; warning_is { $len_w_db = length($feats[1]->spliced_seq(-db => $db_in)->seq) } [], "no warnings if GenBank db provided for remote sequence"; ok($len_w_db == 374, "correct length if remote sequence is provided") } largefasta.t100644000766000024 247414006427353 16750 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqIO# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 16); use_ok('Bio::SeqIO::largefasta'); } my $tmpfile = test_output_file(); my $seqio = Bio::SeqIO->new('-format' => 'largefasta', '-file' => test_input_file('genomic-seq.fasta'), ); isa_ok($seqio, 'Bio::SeqIO'); my $pseq = $seqio->next_seq(); $pseq->alphabet('dna'); $pseq->desc('this is my description');; my $plength = $pseq->length(); my $last_3 = $pseq->subseq($plength-3,$plength); is defined $pseq, 1; is $plength > 0, 1; is length($pseq->subseq(100, 299)), 200; is $pseq->trunc(100,199)->length(), 100; is $pseq->alphabet(), 'dna'; is $pseq->display_id(), 'HSBA536C5'; is $pseq->accession_number(), 'unknown'; is $pseq->desc, 'this is my description'; is open(OUT, '>', $tmpfile), 1; my $seqout = Bio::SeqIO->new('-format' => 'largefasta', '-fh' => \*OUT ); is defined $seqout, 1; is $seqout->write_seq($pseq), 1; $seqout->close(); close(OUT); my $seqin = Bio::SeqIO->new('-format' => 'largefasta', '-file' => $tmpfile); my $pseq2 = $seqin->next_seq; is ($plength, $pseq2->length()); is ($pseq->display_id(), $pseq2->display_id()); is ($pseq->desc(), $pseq2->desc()); LocatableSeq.t100644000766000024 2047314006427353 16765 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Seq# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 119); use_ok('Bio::LocatableSeq'); use_ok('Bio::AlignIO'); } my ($str, $aln, $seq, $loc); # basic tests ok $seq = Bio::LocatableSeq->new( -seq => '--atg---gta--', -strand => 1, -alphabet => 'dna' ); is $seq->alphabet, 'dna'; is $seq->start, 1; is $seq->end, 6; is $seq->strand, 1; is $seq->num_gaps, 1; is $seq->column_from_residue_number(4), 9; is $seq->column_from_residue_number(3), 5; ok $loc = $seq->location_from_column(4); isa_ok $loc,'Bio::Location::Simple'; is $loc->to_FTstring, 2; ok $loc = $seq->location_from_column(6); isa_ok $loc,'Bio::Location::Simple'; is $loc->start, 3; is $loc->location_type, 'IN-BETWEEN'; is $loc->to_FTstring, '3^4'; is $loc = $seq->location_from_column(2), undef; TODO: { local $TODO = "Need to fix columns before start of seq w/ start > 1"; $seq->start(90); is $loc = $seq->location_from_column(2), undef; } $str = Bio::AlignIO->new(-file=> test_input_file('testaln.pfam')); ok defined($str); isa_ok $str,'Bio::AlignIO'; $aln = $str->next_aln(); ok $seq = $aln->get_seq_by_pos(1); is ref($seq), 'Bio::LocatableSeq'; is $seq->get_nse, '1433_LYCES/9-246'; is $seq->id, '1433_LYCES'; # test invalid sequence throws_ok{ $seq = Bio::LocatableSeq->new( -seq => '//!\\' ) } qr/.+/; # test revcom and trunc $seq = Bio::LocatableSeq->new( -seq => '--atg---gta--', -strand => 1, -alphabet => 'dna' ); my $seq2 = $seq->trunc(1,9); is $seq2->seq, '--atg---g'; is $seq2->start, 1; is $seq2->end, 4; is $seq2->strand, $seq->strand; $seq2 = $seq->trunc(3,8); is $seq2->seq, 'atg---'; is $seq2->start, 1; is $seq2->end, 3; is $seq->strand(-1), -1; is $seq->start, 1; is $seq->end, 6; $seq2 = $seq->trunc(3,8); is $seq2->seq, 'atg---'; is $seq2->start, 4; is $seq2->end, 6; $seq2 = $seq->revcom(); is $seq2->seq, '--tac---cat--'; is $seq2->start, $seq->start; is $seq2->end, $seq->end; is $seq2->strand, $seq->strand * -1; is $seq2->column_from_residue_number(4), 9; is $seq2->column_from_residue_number(3), 5; # test column-mapping for -1 strand sequence $seq = Bio::LocatableSeq->new( -seq => '--atg---gtaa-', -strand => -1, -alphabet => 'dna' ); is $seq->column_from_residue_number(5),5; is $seq->column_from_residue_number(4),9; ok $loc = $seq->location_from_column(4); isa_ok $loc,'Bio::Location::Simple'; is $loc->to_FTstring, 6; ok $loc = $seq->location_from_column(6); isa_ok $loc,'Bio::Location::Simple'; is $loc->start, 4; is $loc->location_type, 'IN-BETWEEN'; is $loc->to_FTstring, '4^5'; # more tests for trunc() with strand -1 ok $seq = Bio::LocatableSeq->new( -seq => '--atg---gta--', -strand => -1, -alphabet => 'dna' ); is $seq->alphabet, 'dna'; is $seq->start, 1; is $seq->end, 6; is $seq->strand, -1; is $seq->num_gaps, 1; is $seq->column_from_residue_number(4), 5; ok $seq2 = $seq->trunc(1,9); is $seq2->seq, '--atg---g'; is $seq2->start, 3; is $seq2->end, 6; is $seq2->strand, $seq->strand; is $seq->location_from_column(3)->start, 6; is $seq->location_from_column(11)->start, 1; is $seq->location_from_column(9)->start, 3; ok $seq2 = $seq->trunc(7,12); is $seq2->seq, '--gta-'; is $seq2->start, 1; is $seq2->end, 3; ok $seq2 = $seq->trunc(2,6); is $seq2->seq, '-atg-'; is $seq2->start, 4; is $seq2->end, 6; ok $seq2 = $seq->trunc(4,7); is $seq2->seq, 'tg--'; is $seq2->start, 4; is $seq2->end, 5; ok $seq = Bio::LocatableSeq->new(); is $seq->seq, undef; is $seq->start, undef; is $seq->end, undef; my $nse; eval{$nse = $seq->get_nse}; ok($@); is ($nse, undef); $seq->force_nse(1); eval{$nse = $seq->get_nse}; ok(!$@); is ($nse, '/0-0'); # test mapping # mapping only supported for 1 => 1, 3 => 1, or 1 => 3 mapping relationships eval{$seq = Bio::LocatableSeq->new( -mapping => [40 => 2], );}; ok($@); like($@, qr/Mapping values other than 1 or 3 are not currently supported/); eval{$seq = Bio::LocatableSeq->new( -mapping => [3 => 3], );}; ok($@); # sequence is translated to protein, retains original DNA coordinates # mapping is 1 residue for every 3 coordinate positions $seq = Bio::LocatableSeq->new( -seq => 'KKKAIDLVGVDKARENRQAIYLGASAIAEF', -strand => -1, -mapping => [1 => 3], -start => 1, -end => 90, -alphabet => 'dna' ); is $seq->seq, 'KKKAIDLVGVDKARENRQAIYLGASAIAEF'; is $seq->start, 1; is $seq->end, 90; # sequence is reverse-translated to DNA, retains original protein coordinates # mapping is 3 residues for every 1 coordinate positions $seq = Bio::LocatableSeq->new( -seq => 'aaraaraargcnathgayytngtnggngtngayaargcnmgngaraaymgncargcnathtayytnggngcnwsngcnathgcngartty', -strand => -1, -mapping => [3 => 1], -start => 1, -end => 30, -alphabet => 'protein' ); is $seq->seq, 'aaraaraargcnathgayytngtnggngtngayaargcnmgngaraaymgncargcnathtayytnggngcnwsngcnathgcngartty'; is $seq->start, 1; is $seq->end, 30; # frameshifts (FASTA-like) # support for this is preliminary # this is a real example from a TFASTY report $seq = Bio::LocatableSeq->new( -seq => 'MGSSSTDRELLSAADVGRTVSRIAHQIIEKTALDDPAERTRVVLLGIPTRGVILATRLAAKIKEFAGEDVPHGALDITLYRDDLNFKPPRPLEATSIPAF\GGVDDAIVILVDDVLYSGRSVRSALDALRDIGRPRIVQLAVLVDRGHRELPI--/DYVGKNVPTSRSESVHVLLSEHDDRDGVVISK', -strand => 1, -mapping => [1 => 3], -start => 1, -end => 552, -frameshifts => { # position, frameshift 298 => -1, 455 => 1 }, -alphabet => 'dna' ); is $seq->seq, 'MGSSSTDRELLSAADVGRTVSRIAHQIIEKTALDDPAERTRVVLLGIPTRGVILATRLAAKIKEFAGEDVPHGALDITLYRDDLNFKPPRPLEATSIPAF\GGVDDAIVILVDDVLYSGRSVRSALDALRDIGRPRIVQLAVLVDRGHRELPI--/DYVGKNVPTSRSESVHVLLSEHDDRDGVVISK'; is $seq->start, 1; is $seq->end, 552; $seq->verbose(2); eval { $seq->end(554);}; ok $@; like $@, qr/Overriding value \[554\] with value 552/; lives_ok { $seq = Bio::LocatableSeq->new( -seq => 'LSYC*', -strand => 0, -start => 1, -end => 5, -verbose => 2 );} '* is counted in length'; throws_ok { $seq = Bio::LocatableSeq->new( -seq => 'LSYC*', -strand => 0, -start => 1, -end => 6, -verbose => 2 );} qr/Overriding value \[6\] with value 5/, '* is counted in length, but end is wrong'; # setting symbols (class variables) - demonstrate scoping issues when using # globals with and w/o localization. To be fixed in a future BioPerl version # see bug 2715 my $temp; { $temp = $Bio::LocatableSeq::GAP_SYMBOLS; $Bio::LocatableSeq::GAP_SYMBOLS = '-\?'; $seq = Bio::LocatableSeq->new( -seq => '??atg-?-gta-?', -strand => 1, -start => 10, -end => 15, -alphabet => 'dna', ); is $Bio::LocatableSeq::GAP_SYMBOLS, '-\?'; is $seq->start, 10; is $seq->end, 15; } is $Bio::LocatableSeq::GAP_SYMBOLS, '-\?'; is $seq->end(15), 15; $Bio::LocatableSeq::GAP_SYMBOLS = $temp; is $Bio::LocatableSeq::GAP_SYMBOLS, '\-\.=~'; { local $Bio::LocatableSeq::GAP_SYMBOLS = '-\?'; $seq = Bio::LocatableSeq->new( -seq => '??atg-?-gta-?', -strand => 1, -start => 10, -end => 15, -alphabet => 'dna', ); is $Bio::LocatableSeq::GAP_SYMBOLS, '-\?'; is $seq->start, 10; is $seq->end, 15; } is $seq->end, 15; # note, recalling the end() method uses old $GAP_SYMBOLS, which # no longer are set (this argues for locally set symbols) TODO: { local $TODO = 'Bio::LocatableSeq global variables have scoping issues'; is $Bio::LocatableSeq::GAP_SYMBOLS, '-\?'; # this should be 15 isnt $seq->end(19), 19; } Registry.t100644000766000024 526214006427353 16725 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/LocalDB# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 14); $ENV{OBDA_SEARCH_PATH} = 't/data/registry/flat;t/data/registry/bdb'; use_ok('Bio::DB::Registry'); use_ok('Bio::DB::Flat'); } # we need a temp directory t/tmp since t/tmp is specified in the registry files my $tmpdir = File::Spec->catfile(qw(t tmp)); mkdir($tmpdir,0777); SKIP: { skip "unable to create temp dir '$tmpdir', skipping tests", 12 unless -d $tmpdir; my $flat = Bio::DB::Flat->new(-directory => $tmpdir, -dbname => 'testflat', -format => 'fasta', -index => 'binarysearch', -write_flag => 1 ); my $entries = $flat->build_index(test_input_file('cysprot.fa')); is $entries, 7; SKIP: { test_skip(-tests => 2, -requires_modules => [qw(DB_File)]); my $bdb = Bio::DB::Flat->new(-directory => $tmpdir, -dbname => 'testbdb', -format => 'fasta', -index => 'bdb', -write_flag => 1 ); ok defined($bdb); $entries = $bdb->build_index(test_input_file('cysprot.fa')); is $entries, 7; } SKIP: { test_skip(-tests => 9, -requires_modules => [qw(LWP::UserAgent HTTP::Request::Common)], -requires_networking => 1); my $registry = Bio::DB::Registry->new(); ok defined($registry); my @available_services = $registry->services; ok grep /testflat/,@available_services; my $db = $registry->get_database('testflat'); ok defined($db); my $seq = $db->get_Seq_by_id("ALEU_HORVU"); ok defined($seq); my $sequence = $seq->seq; is $sequence, "MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIATCASYPVVAA"; SKIP: { test_skip(-tests => 4, -requires_modules => [qw(DB_File)]); ok grep /testbdb/,@available_services; $db = $registry->get_database('testbdb'); ok defined($db); $seq = $db->get_Seq_by_id("ALEU_HORVU"); ok defined($seq); $sequence = $seq->seq; is $sequence, "MAHARVLLLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYRLGINRFSDMSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIATCASYPVVAA"; } } } END { File::Path::rmtree($tmpdir) if ($tmpdir && (-d $tmpdir)); } clustalw.t100644000766000024 254014006427353 16771 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/AlignIO# -*-Perl-*- Test Harness script for Bioperl # $Id: clustalw.t 14971 2008-10-28 16:08:52Z cjfields $ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 6); use_ok('Bio::AlignIO::clustalw'); } my $DEBUG = test_debug(); my ($str,$aln,$strout,$status); # CLUSTAL my $io = Bio::AlignIO->new( -file => test_input_file("testaln.clustalw") ); $aln = $io->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is $aln->consensus_string, "MNEGEHQIKLDELFEKLLRARKIFKNKDVLRHSWEPKDLPHRHEQIEA". "LAQILVPVLRGETMKIIFCGHHACELGEDRGTKGFVIDELKDVDEDRNGKVDVIEINCEHMDTHYRVLPNIAKLF". "DDCTGIGVPMHGGPTDEVTAKLKQVIDMKERFVIIVLDEIDKLVKKSGDEVLYSLTRINTELKRAKVSVIGISND". "LKFKEYLDPRVLSSLSEEEVVFPPYDANQLRDILTQRAEEAFYPGVLDEGVIPLCAALAAREHGDARKALDLLRV". "AGEIAEREGASKVTEKHVWKAQEKIEQDMMEEVIKTLPLQSKVLLYAIVLLDENGDLPANTGDVYAVYRELCEYI". "DLEPLTQRRISDLINELDMLGIINAKVVSKGRYGRTKEIRLMVTSYKIRNVLRYDYSIQPLLTISLKSEQRRLI", "clustalw consensus_string test"; my $outfile = test_output_file(); $strout = Bio::AlignIO->new( '-file' => ">$outfile", '-format' => 'clustalw'); $status = $strout->write_aln($aln); is $status, 1, "clustalw output test"; undef $strout; $str = Bio::AlignIO->new( '-file'=> $outfile, '-format' => 'clustalw'); $aln = $str->next_aln($aln); isa_ok($aln,'Bio::Align::AlignI'); is $aln->get_seq_by_pos(1)->get_nse, 'P84139/1-420', "clustalw input test"; gmap_f9.t100644000766000024 1141614006427353 16652 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SearchIO# -*-Perl-*- Test Harness script for Bioperl # $Id: gmap_f9.t 14995 2008-11-16 06:20:00Z cjfields $ use strict; use warnings; BEGIN { use Bio::Root::Test; test_begin(-tests => 54); use_ok('Bio::SearchIO'); } my $searchio = Bio::SearchIO->new(-format => 'gmap_f9', -file => test_input_file('gmap_f9.txt')); my $result = $searchio->next_result; isa_ok($result, 'Bio::Search::Result::GenericResult', 'Did we get a Result?'); is($result->num_hits(), 1, 'Did we get the expected number of hits?'); is($result->algorithm(), 'gmap', 'Did we get the expected algorithm?'); is($result->query_name(), 'NM_004448', 'Did we get the expected query_name?'); my $hit = $result->next_hit; isa_ok($hit, 'Bio::Search::Hit::GenericHit', 'Did we get a Hit?'); _check_hit($hit, {name => '17', length => 4624, num_hsps => 27, query_length => 4623 } ); my $hsp = $hit->next_hsp; _check_hsp($hsp, {algorithm => 'GMAP', query_gaps => 1, hit_gaps => 0, query_length => 310, hit_length => 311, qseq => 'GGAGGAGGTGGAGGAGGAGG', # first 20 bases hseq => 'GGAGGAGGTGGAGGAGGAGG', # ditto query => {start => 1, end => 310, strand => 1}, hit => {start => 35109780, end => 35110090, strand => 1}, homology_string => 'GGAGGAGGTGGAGGAGGAGG', seq_inds_query_gap => [(61)] } ); my $searchio_rev = Bio::SearchIO->new(-format => 'gmap_f9', -file => test_input_file('gmap_f9-reverse-strand.txt')); my $result_rev = $searchio_rev->next_result; isa_ok($result_rev, 'Bio::Search::Result::GenericResult', 'Did we get a Result?'); is($result_rev->num_hits(), 1, 'Did we get the expected number of hits?'); is($result_rev->algorithm(), 'gmap', 'Did we get the expected algorithm?'); is($result_rev->query_name(), 'NM_004448', 'Did we get the expected query_name?'); $hit = $result_rev->next_hit; _check_hit($hit, {name => '17', length => 4624, num_hsps => 27, query_length => 4623 } ); $hsp = $hit->next_hsp; _check_hsp($hsp, {algorithm => 'GMAP', query_gaps => 0, hit_gaps => 0, query_length => 974, hit_length => 974, qseq => 'TAGCTGTTTTCCAAAATATA', # first 20 bases hseq => 'TAGCTGTTTTCCAAAATATA', # ditto query => {start => 1, end => 974, strand => 1}, hit => {start => 35137468, end => 35138441, strand => -1}, homology_string => 'TAGCTGTTTTCCAAAATATA', seq_inds_query_gap => [()] } ); $searchio = Bio::SearchIO->new(-format => 'gmap_f9', -file => test_input_file('gmap_f9-multiple_results.txt')); my $result_count = 0; while (my $result = $searchio->next_result) { $result_count++; } is($result_count, 58, "Can we loop over multiple results properly (expecting 58)?"); # bug 3021 $searchio = Bio::SearchIO->new(-format => 'gmap_f9', -file => test_input_file('bug3021.gmap')); $result = $searchio->next_result; is($result->query_name, 'NM_004448', 'simple query_name now caught, bug 3021'); exit(0); sub _check_hit { my ($hit, $info) = @_; isa_ok($hit, 'Bio::Search::Hit::HitI'); is($hit->name, $info->{name}, 'Check the name'); is($hit->length, $info->{length}, 'Check the hit length'); is($hit->num_hsps, $info->{num_hsps}, 'Check the number of hsps'); is($hit->query_length, $info->{query_length}, 'Check the query length'); } sub _check_hsp { my($hsp, $info) = @_; isa_ok($hsp, 'Bio::Search::HSP::HSPI'); is($hsp->algorithm, $info->{algorithm}, 'Check the algorithm'); is($hsp->gaps('query'), $info->{query_gaps}, 'Count gaps in the query'); is($hsp->gaps('hit'), $info->{hit_gaps}, 'Count gaps in the hit'); is($hsp->length('query'), $info->{query_length}, 'Length of the query'); is($hsp->length('hit'), $info->{hit_length}, 'Length of the hit'); is(substr($hsp->query_string, 0, 20), $info->{qseq}, 'Query sequence'); is(substr($hsp->hit_string, 0, 20), $info->{hseq}, 'Hit sequence'); is($hsp->query->start, $info->{query}->{start}, "Check query start"); is($hsp->query->end, $info->{query}->{end}, "Check query end"); is($hsp->query->strand, $info->{query}->{strand}, "Check query end"); is(substr($hsp->homology_string, 0, 20), $info->{homology_string}, 'Check the homology string'); is_deeply([$hsp->seq_inds('query', 'gap')], $info->{seq_inds_query_gap}, 'Check seq_inds'); is($hsp->hit->start, $info->{hit}->{start}, "Check hit start"); is($hsp->hit->end, $info->{hit}->{end}, "Check hit end"); is($hsp->hit->strand, $info->{hit}->{strand}, "Check hit end"); } Range.t100644000766000024 670014006427353 16753 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqFeature# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 49); use_ok('Bio::Range'); } my $range = Bio::Range->new(-start=>10, -end=>20, -strand=>1); isa_ok($range,'Bio::Range', 'BioRange object'); is($range->strand, 1); my $range2 = Bio::Range->new(-start=>15, -end=>25, -strand=>1); isa_ok($range2,'Bio::Range', 'BioRange object'); is($range2->strand, 1); my $r = Bio::Range->new(); is ( $r->strand(0), 0 ) ; is ( $r->start(27), 27 ); is ( $r->end(28), 28 ) ; ok(! defined $r->intersection($range2)); $r = $range->union($range2); is($r->start, 10); is($r->end, 25); $r = $range->intersection($range2); is ( $r->start, 15 ) ; is ( $r->end, 20 ); is ( $r->strand, 1 ); # intersection and union can also take lists my $range3 = Bio::Range->new(-start=>18,-end=>30); isa_ok($range3,'Bio::Range', 'BioRange object'); $r = $range->intersection([$range2, $range3]); ok( ( $r->start == 18 ) && ( $r->end == 20 )); $r = Bio::Range->intersection([$range, $range2, $range3]); ok($r->start == 18 && $r->end == 20); $r = $range->union($range2, $range3); ok( ( $r->start == 10 ) && ( $r->end == 30 ) ); $r = Bio::Range->union($range, $range2, $range3); ok( ( $r->start == 10 ) && ( $r->end == 30 ) ); $range3->start(21); ok (! $range->intersection([$range2, $range3])); ok (! $range->contains($range2)); ok (! $range2->contains($range)); ok ($range->overlaps($range2)); ok ($range2->overlaps($range)); # testing strand $range3 = Bio::Range->new(-start => 15, -end => 25, -strand => 1); my $range4 = Bio::Range->new(-start => 15, -end => 25, -strand => -1); isa_ok($range4,'Bio::Range', 'BioRange object'); my $range5 = Bio::Range->new(-start => 15, -end => 25, -strand => 0); isa_ok($range5,'Bio::Range', 'BioRange object'); my $range6 = Bio::Range->new(-start => 20, -end => 30, -strand => -1); isa_ok($range6,'Bio::Range', 'BioRange object'); ok $range3->_ignore($range4), ' 1 & -1' ; ok $range3->_weak($range3),' 1 & 1 true' ; ok $range3->_weak($range5), ' 1 & 0 true' ; ok (! $range3->_weak($range4), ' 1 & -1 false' ); ok $range3->_strong($range3), ' 1 & 1 true' ; ok (! $range3->_strong($range5), ' 1 & 0 false' ); ok (! $range3->_strong($range4), ' 1 & -1 false' ); ok ! ( $range3->overlaps($range4,'weak')); ok ! ( $range4->overlaps($range3,'weak')); ok ! ( $range3->overlaps($range4,'strong')); ok ! ( $range4->overlaps($range3,'strong')); $range3->strand(0); ok ( $range3->overlaps($range4,'weak')); ok ( $range4->overlaps($range3,'weak')); ok ! ( $range3->overlaps($range4,'strong')); ok ! ( $range4->overlaps($range3,'strong')); # if strands are different then intersection() should return 0... $r = $range3->intersection($range4); is ( $r->strand, 0 ); # or if both strands are -1 then -1 should be returned $r = $range6->intersection($range4); is ( $r->strand, -1 ); # test implemention of offsetStranded: $r = Bio::Range->new(-start => 30, -end => 40, -strand => -1); isa_ok($r, 'Bio::Range', 'Bio::Range object') ; is ($r->offsetStranded(-5,10)->toString, '(20, 45) strand=-1'); is ($r->offsetStranded(+5,-10)->toString, '(30, 40) strand=-1'); $r->strand(1); is ($r->offsetStranded(-5,10)->toString, '(25, 50) strand=1'); is ($r->offsetStranded(+5,-10)->toString, '(30, 40) strand=1'); Clone.t100644000766000024 503314006427353 16755 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/SeqFeature# -*-Perl-*- Test Harness script for Bioperl use strict; BEGIN { use Bio::Root::Test; use Bio::SeqFeature::Generic; use Bio::Location::Split; test_begin(-tests => 17); } my $DEBUG = test_debug(); my $orig = Bio::SeqFeature::Generic->new( -start => 40, -end => 80, -strand => 1, -primary => 'exon', -source => 'internal', -tag => { silly => 20, new => 1 } ); # ---------- # Verify simple attributes work and are independent of each other ok(my $clone = $orig->clone(), 'clone()'); ok($clone->start(140), 'start() clone set'); is($clone->start(), 140, 'start() clone get'); is($orig->start(), 40, 'start() original unchanged'); # ---------- # Verify that arguments passed into clone() are applied to the cloned object # and that the attributes are still independent. ok($clone = $orig->clone(-start => 150, -end => 157), 'clone() with arguments'); is($orig->start(), 40, 'start() orig get'); is($orig->end(), 80, 'end() orig get'); is($clone->start(), 150, 'start() clone get'); is($clone->end(), 157, 'end() clone get'); ok($clone->start(140), 'start() clone set'); is($clone->start(), 140, 'start() clone get'); is($orig->start(), 40, 'start() original unchanged'); # ---------- # Verify that object attributes can be cloned, and are independent after cloning my $splitlocation = Bio::Location::Split->new(); $splitlocation->add_sub_Location(Bio::Location::Simple->new( -start=>1, -end=>30, -strand=>1 )); $splitlocation->add_sub_Location(Bio::Location::Simple->new( -start=>50, -end=>61, -strand=>1 )); ok($orig->location($splitlocation), 'location() Bio::Location::Split'); ok($clone = $orig->clone(), 'clone()'); ok(($clone->location->sub_Location())[1]->start(51), 'start() clone set'); is(($clone->location->sub_Location())[1]->start, 51, 'start() clone get'); is(($orig->location->sub_Location())[1]->start, 50, 'start() original unchanged'); AlignStats.t100644000766000024 1170214006427353 16774 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/Align# -*-Perl-*- Test Harness script for Bioperl # $Id$ use strict; BEGIN { use Bio::Root::Test; test_begin(-tests => 45); use_ok('Bio::Align::DNAStatistics'); use_ok('Bio::Align::ProteinStatistics'); use_ok('Bio::AlignIO'); } my $debug = test_debug(); my $in = Bio::AlignIO->new(-format => 'emboss', -file => test_input_file('insulin.water')); my $aln = $in->next_aln(); isa_ok($aln, 'Bio::Align::AlignI'); my $stats = Bio::Align::DNAStatistics->new(-verbose => $debug); is( $stats->transversions($aln),4); is( $stats->transitions($aln),9); is( $stats->pairwise_stats->number_of_gaps($aln),21); is( $stats->pairwise_stats->number_of_comparable_bases($aln),173); is( $stats->pairwise_stats->number_of_differences($aln),13); is( $stats->pairwise_stats->score_nuc($aln), 224); is( $stats->pairwise_stats->score_nuc( -aln => $aln, -match => 1, -mismatch => -1, -gap_open => -1, -gap_ext => -1), 126); my $d = $stats->distance(-align => $aln, -method=> 'f81'); is( $d->get_entry('hs_insulin','seq2'), '0.07918'); $d = $stats->distance(-align=> $aln, -method => 'JC'); is( $d->get_entry('hs_insulin','seq2'), '0.07918'); $d = $stats->distance(-align=> $aln, -method => 'Kimura'); is( $d->get_entry('hs_insulin','seq2'), '0.07984'); $d = $stats->distance(-align=> $aln, -method => 'TajimaNei'); is( $d->get_entry('seq2','hs_insulin'), '0.08106'); $d = $stats->distance(-align=> $aln, -method => 'Tamura'); is( $d->get_entry('seq2','hs_insulin'), '0.08037'); #$d = $stats->distance(-align => $aln, # -method => 'JinNei'); #is( $d->get_entry('seq2','hs_insulin'), 0.0850); $in = Bio::AlignIO->new(-format => 'clustalw', -file => test_input_file('hs_owlmonkey.aln')); $aln = $in->next_aln(); isa_ok($aln,'Bio::Align::AlignI'); is( $stats->transversions($aln),10); is( $stats->transitions($aln),17); is( $stats->pairwise_stats->number_of_gaps($aln),19); is( $stats->pairwise_stats->number_of_comparable_bases($aln),170); is( $stats->pairwise_stats->number_of_differences($aln),27); is( $stats->pairwise_stats->score_nuc($aln), 134); is( $stats->pairwise_stats->score_nuc( -aln => $aln, -match => 1, -mismatch => -1, -gap_open => -1, -gap_ext => -1), 97); # now test the distance calculations $d = $stats->distance(-align => $aln, -method => 'jc'); is( $d->get_entry('human','owlmonkey'), 0.17847); $d = $stats->distance(-align => $aln, -method=> 'f81'); is( $d->get_entry('human','owlmonkey'), '0.17847'); $d = $stats->distance(-align => $aln, -method => 'uncorrected'); is( $d->get_entry('human','owlmonkey'), 0.15882); $d = $stats->distance(-align => $aln, -method => 'Kimura'); is( $d->get_entry('human','owlmonkey'), 0.18105); $d = $stats->distance(-align => $aln, -method => 'TajimaNei'); is( $d->get_entry('human','owlmonkey'), 0.18489); $d = $stats->distance(-align => $aln, -method => 'Tamura'); is( $d->get_entry('human','owlmonkey'), 0.18333); #$d = $stats->distance(-align => $aln, # -method => 'JinNei'); #is( $d->get_entry('human','owlmonkey'), 0.2079); ### now test Nei_gojobori methods, hiding the expected warnings so we can # avoid printing them ### $stats->verbose($debug ? $debug : -1); my ($alnobj, $result); $in = Bio::AlignIO->new(-format => 'fasta', -file => test_input_file('nei_gojobori_test.aln')); $alnobj = $in->next_aln(); isa_ok($alnobj,'Bio::Align::AlignI'); $result = $stats->calc_KaKs_pair($alnobj, 'seq1', 'seq2'); is (sprintf ("%.1f", $result->[0]{'S'}), 40.5); is (sprintf ("%.1f", $result->[0]{'z_score'}), '4.5'); $result = $stats->calc_all_KaKs_pairs($alnobj); is (int( $result->[1]{'S'}), 41); is (int( $result->[1]{'z_score'}), 4); $result = $stats->calc_average_KaKs($alnobj, 100); is (sprintf ("%.4f", $result->{'D_n'}), 0.1628); $stats->verbose($debug); # now test Protein Distances my $pstats = Bio::Align::ProteinStatistics->new(); $in = Bio::AlignIO->new(-format => 'clustalw', -file => test_input_file('testaln.clustalw')); $alnobj = $in->next_aln(); isa_ok($alnobj,'Bio::Align::AlignI'); $result = $pstats->distance(-method => 'Kimura', -align => $alnobj); isa_ok($result, 'Bio::Matrix::PhylipDist'); is ($result->get_entry('P84139','P814153'), '0.01443'); is ($result->get_entry('P841414','P851414'), '0.01686'); is ($result->get_entry('P84139','P851414'), '3.58352'); my $seq = Bio::Seq->new(-id=>'NOT3MUL', -seq=>'gatac'); isa_ok($seq, 'Bio::PrimarySeqI'); eval { Bio::Align::DNAStatistics->count_syn_sites($seq); }; like($@, qr/not integral number of codons/); # bug 2901 $in = Bio::AlignIO->new(-file => test_input_file('bug2901.fa'), -format => 'fasta'); $stats = Bio::Align::DNAStatistics->new(-verbose => 2); $aln = $in->next_aln(); my $matrix; throws_ok { $matrix = $stats->distance(-align=>$aln,-method=>'Uncorrected'); } qr/No distance calculated between seq3 and seq4/, "Warn if seqs don't overlap"; $stats->verbose(-1); $matrix = $stats->distance(-align=>$aln,-method=>'Uncorrected'); like($matrix->print_matrix, qr/-1/); GO.defs.test2100644000766000024 166014006427353 16607 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data!version: $Revision: 1.1 $ !date: Fri Sep 20 14:08:27 GMT 2002 !saved-by: gwg !autogenerated-by: DAG-Edit version 1.311 ! !Gene Ontology definitions ! term: cellular_component goid: GO:0003673 definition: The part of a cell of which a gene product is a component; for purpose of GO includes the extracellular environment of cells; a gene product may be a component of one or more parts of a cell; this term includes gene products that are parts of macromolecular complexes, by the definition that all members of a complex normally co-purify under all except extreme conditions. definition_reference: GO:curators term: ascus goid: GO:0005575 definition: A sac-like fruiting body (ascomycete fungi); contains ascospores (typically eight in number). definition_reference: ISBN:0198547684 term: lipid particle goid: GO:0018895 definition: Any particle of coalesced lipids in the cytoplasm of a cell. definition_reference: GO:mah testaln.nexus100644000766000024 105314006427353 17131 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data#NEXUS [TITLE: Four Anthropoidea] begin data; dimensions ntax=4 nchar=50; format interleave datatype=RNA missing=N gap=-; matrix 'Homo sapiens' AGUCGAGUC---GCAGAAACGCAUGAC-GAC Pan_paniscus AGUCGCGUCG--GCAGAAACGCAUGACGGAC Gorilla_gorilla AGUCGCGUCG--GCAGAUACGCAUCACGGAC Pongo_pigmaeus AGUCGCGUCGAAGCAGA--CGCAUGACGGAC 'Homo sapiens' CACAUUUU-CCUUGCAAAG Pan_paniscus CACAUCAU-CCUUGCAAAG Gorilla_gorilla -ACAUCAUCCCUCGCAGAG Pongo_pigmaeus CACAUCAUCCCUUGCAGAG ; endblock; begin assumptions; options deftype=unord; NC_001284.gbk100644000766000024 223424614006427353 16274 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS NC_001284 366924 bp DNA circular PLN 16-OCT-2003 DEFINITION Arabidopsis thaliana mitochondrion, complete genome. ACCESSION NC_001284 VERSION NC_001284.2 GI:26556996 KEYWORDS . SOURCE mitochondrion Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids II; Brassicales; Brassicaceae; Arabidopsis. REFERENCE 1 AUTHORS Giege,P. and Brennicke,A. TITLE RNA editing in Arabidopsis mitochondria effects 441 C to U changes in ORFs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 96 (26), 15324-15329 (1999) MEDLINE 20079652 PUBMED 10611383 REFERENCE 2 (bases 1 to 366924) AUTHORS Unseld,M., Marienfeld,J.R., Brandt,P. and Brennicke,A. TITLE The mitochondrial genome of Arabidopsis thaliana contains 57 genes in 366,924 nucleotides JOURNAL Nat. Genet. 15 (1), 57-61 (1997) MEDLINE 97141919 PUBMED 8988169 REFERENCE 3 (bases 1 to 366924) AUTHORS Marienfeld,J., Unseld,M., Brandt,P. and Brennicke,A. TITLE Genomic recombination of the mitochondrial atp6 gene in Arabidopsis thaliana at the protein processing site creates two different presequences JOURNAL DNA Res. 3 (5), 287-290 (1996) MEDLINE 97191539 PUBMED 9039497 REFERENCE 4 (bases 1 to 366924) AUTHORS Marienfeld,J.R. TITLE Direct Submission JOURNAL Submitted (30-SEP-1996) J.R. Marienfeld, Institut fuer Genbiologische Forschung GmbH, Ihnestrasse 63, 14195 Berlin, FRG COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from Y08501 and Y08502. On Dec 12, 2002 this sequence version replaced gi:13449290. FEATURES Location/Qualifiers source 1..366924 /organism="Arabidopsis thaliana" /organelle="mitochondrion" /mol_type="genomic DNA" /db_xref="taxon:3702" /sub_clone="pUC19" /sub_clone="pUC18" /sub_clone="pBluescript SK" /sub_clone="pBluescript KS" /clone_lib="Lorist X" CDS complement(join(327890..328078,329735..330306, 332945..333105,79740..80132,81113..81297)) /EC_number="1.6.99.3" /codon_start=1 /exception="trans-splicing" /label=nad2_cds /product="NADH dehydrogenase subunit 2" /protein_id="NP_085584.1" /db_xref="GI:13449314" /db_xref="UniProt/TrEMBL:O05000" /translation="MKAEFVRILPHMFNLFLAVSPEIFIINATSILLIHGVVFSTSKK YDYPPLASNVGWLGLLSVLITLLLLAAGAPLLTIAHLFWNNLFRRDNFTYFCQIFLLL STAGTISMCFDSSDQERFDAFEFIVLIPLPTRGMLFMISAHDLIAMYLAIEPQSLCFY VIAASKRKSEFSTEAGSKYLILGAFSSGILLFGCSMIYGSTGATHFDQLAKILTGYEI TGARSSGIFMGILSIAVGFLFKITAVPFHMWAPDIYEGSPTPVTAFLSIAPKISISAN ILRVSIYGSYGATLQQIFFFCSIASMILGALAAMAQTKVKRPLAHSSIGHVGYIRTGF SCGTIEGIQSLLIGIFIYALMTMDAFAIVSALRQTRVKYIADLGALAKTNPISAITFS ITMFSYAGIPPLAGFCSKFYLFFAALGCGAYFLAPVGVVTSVIGRFYYIRLVKRMFFD TPRTWILYEPMDRNKSLLLAMTSFFITSSLLYPSPLFSVTHQMALSSYL" CDS complement(273..734) /note="orf153a" /codon_start=1 /protein_id="NP_085474.1" /db_xref="GI:13449291" /db_xref="UniProt/TrEMBL:P93275" /translation="MSLLFQQTVPLSHLHRSLDPPLCFRTHILLILLLLSRHLPGFTG SDCESADPSIVSAIAPGTATTSERDCPVRTAGSDPVPIGDSGTFFDVGTAAPELLSPN RHHMITRAKDGIRKPNPRYNLFTQKYTPSEPKTITSASQDGDKLCKKRCRH" promoter complement(2485..2493) /note="orf153a" /evidence=not_experimental gene complement(8848..11415) /gene="rrn26" /db_xref="GeneID:814566" rRNA complement(8848..11415) /gene="rrn26" /product="26S ribosomal RNA protein" /db_xref="GeneID:814566" CDS 11918..12241 /note="orf107a" /codon_start=1 /protein_id="NP_085475.1" /db_xref="GI:13449292" /db_xref="UniProt/TrEMBL:P93276" /translation="MFQFAKFSKSKERRLATELGYGFPIGDPWITDGISPWPFASESV LPSQCPGIHPMHSFRSCTQGTLNTTKISMKLTISDCGFEPLTEGFTVLHSTRATTCYH FLFNS" CDS complement(16844..17791) /note="orf315" /codon_start=1 /protein_id="NP_085476.1" /db_xref="GI:13449293" /db_xref="UniProt/TrEMBL:P93278" /translation="MTKREYNSQPEMLEGAKSIGAGAATIASAGAAIGIGNVFSSLIH SVARNPSLATTTVLVVTLTLLGGVAAFYLHSFRLKGPLKKIIYLFLVFFIAVGISLIR IKAIHLLGLALPLLVPPLVWNAIGGGGEALPSTGPNGASSYSEWFTYTSDLEDSASSG RTSSSVNQPIQREQAGPSNALPEPAASPVAQQQDHLDQPFGEGGEREARAQEHDRISA EVETITSACENLEAAMVRKAHILLHQRGVTLGDPEDVKRALQLALHDDWEHDIDDRKR HFTVLRRDFGTARCERWNPFIDELRGLGNRQVNARHYVD" CDS 16856..17251 /note="orf131" /codon_start=1 /protein_id="NP_085477.1" /db_xref="GI:13449294" /db_xref="UniProt/TrEMBL:P93277" /translation="MSGVYLTVPQAPELINERIPSFTASCSEVAPEHSEMPLTVIYIV LPVVMESQLESTLDILWIPESYSTLMQENMGFPYHGRLQILAGARDSLYLGRDAVVLL CPSLPFPTFTKWLIQVILLLSYWGSGGFG" gene complement(20571..333105) /gene="nad5" /db_xref="GeneID:814567" mRNA complement(join(20571..20717,21692..22086,190740..190761, 140724..141939,142769..142998)) /gene="nad5" /note="trans-splicing, RNA editing" /db_xref="GeneID:814567" CDS complement(join(20571..20717,21692..22086,190740..190761, 140724..141939,142769..142998)) /gene="nad5" /EC_number="1.6.99.3" /codon_start=1 /exception="trans-splicing, RNA editing" /product="NADH dehydrogenase subunit 5" /protein_id="NP_085478.1" /db_xref="GI:13449295" /db_xref="UniProt/Swiss-Prot:P29388" /db_xref="GeneID:814567" /translation="MYLLIVFLPLLGSSVAGFFGRFLGSEGSAIMTTTCVSFSSILSL IAFYEVAPGASACYLRIAPWISSEMFDASWGFLFDSPTVVMLIVVTSISSLVHLYSIS YMSEDPHSPRFMCYLSILTFFMPMLVTGDNSLQLFLGWEGVGLASYLLIHFWFTRLQA DKAATKAMLVNRVGDFGLALGISGRFTLFQTVDFSTIFARASAPRNSWISCNMRLNAI SLICILLLIGAVGKSAQIGSHTWSPDAMEGPTPVSASIHAATMVTAGVFMIARCSPLF EYPPTALIVITSAGATTSFLAATTGILQNDLKRVIAYSTCSQLGYMIFACGISNYSVS VFHLMNHAFFKALLFLSAGSVIHAMSDEQDMRKMGGLASSFPLTYAMMLIGSLSLIGF PFLTGFYSKDVILELAYTKYTISGNFAFWLGSVSVLFTSYYSFRLLFLTFLVPTNSFG RDISRCHDAPIPMAIPLILLALGSLFVGYLAKDMMIGLGTNFWANSPLVLPKNEILAE SEFAAPTITKLIPILFSTSGAFVAYNVNPVADQFQRAFQTSTFCNRLYSFFNKRWFFD QVLNDFLVRSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQSGFVYHY AFAMLLGSTLFVTFSRMWDSLSSWVDNRSSFILIVSSFYTKSSQE" exon complement(20571..20717) /gene="nad5" /number=5 misc_feature 20623 /note="C to U RNA editing" misc_feature 20652 /note="C to U RNA editing" misc_feature 20663 /note="C to U RNA editing" misc_feature 20665 /note="C to U RNA editing" misc_feature 20686 /note="C to U RNA editing" intron complement(20718..21691) /gene="nad5" /number=4 exon complement(21692..22086) /gene="nad5" /number=4 misc_feature 21975 /note="C to U RNA editing" misc_feature 22005 /note="C to U RNA editing" misc_feature 22065 /note="C to U RNA editing" intron complement(22087..190739) /gene="nad5" /note="transpliced intron" /number=3 gene complement(23663..24235) /gene="nad9" /db_xref="GeneID:814568" CDS complement(23663..24235) /gene="nad9" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 9" /protein_id="NP_085479.1" /db_xref="GI:13449296" /db_xref="UniProt/Swiss-Prot:Q95748" /db_xref="GeneID:814568" /translation="MDNQFIFKYSWETLPKKWVKKMERSEHGNRSDTNTDYLFQLLCF LKLHTYTRVQVSIDICGVDHPSRKRRFEVVYNLLSTRYNSRIRVQTSADEVTRISPVV SLFPSAGRWEREVWDMFGVSFINHPDLRRISTDYGFEGHPLRKDLPLSGYVQVRYDDP EKRVVSEPIEMTQEFRYFDFASPWEQRSDG" misc_feature 23797 /note="C to U RNA editing" misc_feature 23838 /note="C to U RNA editing" misc_feature 23908 /note="C to U RNA editing" misc_feature 23938 /note="C to U RNA editing" misc_feature 24046 /note="C to U RNA editing" misc_feature 24069 /note="C to U RNA editing" misc_feature 24144 /note="C to U RNA editing" misc_feature 24685 /note="C to U RNA editing" misc_feature 24693 /note="C to U RNA editing" misc_feature 24749 /note="C to U RNA editing" misc_feature 24801 /note="C to U RNA editing" misc_feature 25004 /note="C to U RNA editing" misc_feature 25075 /note="C to U RNA editing" gene complement(25076..25615) /gene="rpl16" /db_xref="GeneID:814569" CDS complement(25076..25615) /gene="rpl16" /codon_start=1 /exception="RNA editing" /product="ribosomal protein L16" /protein_id="NP_085480.1" /db_xref="GI:13449297" /db_xref="UniProt/Swiss-Prot:Q95747" /db_xref="GeneID:814569" /translation="MYLTIKSIMLLRKYLLVTESQVSKCGFHIVKKKGDVLYPKRTKY SKYRKGRCSRGCKPDGTKLGFGRYGTKSCKAGRLSYRAIEAARRAIIGHFHRAMSGQF RRNGKIWVRVFADLPITGKPTEVRMGRGKGNPTGWIARVSTGQIPFEMDGVSLANARQ AATLAAHKPCSSTKFVQWS" misc_feature 25104 /note="C to U RNA editing" misc_feature 25110 /note="C to U RNA editing" misc_feature 25176 /note="C to U RNA editing" misc_feature 25387 /note="C to U RNA editing" misc_feature 25407 /note="C to U RNA editing" gene complement(25482..28733) /gene="rps3" /db_xref="GeneID:814570" mRNA complement(join(25482..27077,28659..28733)) /gene="rps3" /db_xref="GeneID:814570" CDS complement(join(25482..27077,28659..28733)) /gene="rps3" /codon_start=1 /exception="RNA editing" /product="ribosomal protein S3" /protein_id="NP_085481.1" /db_xref="GI:13449298" /db_xref="UniProt/TrEMBL:Q95749" /db_xref="GeneID:814570" /translation="MARKGNPISVRLGKNRSSDSSRFSEYYYGKFVYQDVNLRSYFGS IRPPTRLTFGFRLGRCIILHFPKRTFIHFFLPRRPRRLKRREKTRPGKEKGRWWTTFG KAGPIECLHSSDDTEEERNEVRGRGARKRVESIRLDDRKKQNEIRGWPKKKQRYGYHD RLPSIKKNLSKSLRISGAFKHPKYAGVVNDIAFLIENDDSFKKTKLFKLFFQNKSRSD GPTSYLRTLPAVRPSLNFLVMQYFFNTKNQINFDPVVVLNHFVAPGAAEPSTMGRANA QGRSLQKRIRSRIAFFVESSTSEKKCLAEAKNRLTHFIRLANDLRFAGTTKTTISLFP FFGATFFFLRDGVGVYNNLDAREQLLNQLRVKCWNLVGKDKIMELIEKLKNLGGIEEL IKVIDMMIEIILRKRGIPYRYNSYFYEVKKMRSFLSNRTNTKTLIESVKIKSVYQSAS PIAQDISFQLKNKRRSFHSIFAKIVKEIPKGVEGIRICFSGRLKDAAEKAQTKCYKHR KTSRNVFNHKIDYAPAEVSTRYGISGVKVWISYSQKKGRRAISETYEI" exon complement(25482..27077) /gene="rps3" /number=2 misc_feature 25555 /note="C to U RNA editing" misc_feature 25573 /note="C to U RNA editing" misc_feature 25582 /note="C to U RNA editing" misc_feature 25619 /note="C to U RNA editing" misc_feature 25683 /note="C to U RNA editing" misc_feature 25801 /note="C to U RNA editing" misc_feature 25809 /note="C to U RNA editing" misc_feature 26266 /note="C to U RNA editing" misc_feature 26550 /note="C to U RNA editing" misc_feature 26638 /note="C to U RNA editing" misc_feature 26966 /note="C to U RNA editing" misc_feature 27027 /note="C to U RNA editing" intron complement(27078..28658) /gene="rps3" /number=1 exon complement(28659..28733) /gene="rps3" /number=1 misc_feature 28670 /note="C to U RNA editing" gene complement(28898..28970) /gene="tRNA-Lys" /db_xref="GeneID:814571" tRNA complement(28898..28970) /gene="tRNA-Lys" /product="tRNA-Lys" /db_xref="GeneID:814571" gene 30463..31083 /gene="ccb206" /db_xref="GeneID:814572" CDS 30463..31083 /gene="ccb206" /codon_start=1 /exception="RNA editing" /product="cytochrome c biogenesis orf206" /protein_id="NP_085482.1" /db_xref="GI:13449299" /db_xref="UniProt/TrEMBL:P93280" /db_xref="GeneID:814572" /translation="MRRLFLELYHKLIFSSTPITSFSSFLSYIVVTPLMLGFEKDFSC HSHLGPIRIPPLFPFPPAPFPRNEKEDGTLELYYLSTYCLPKILLLQLVGHRVIQISR VFCGFPMLQLSYQFGRSGMDRLNIPLGSLVLTLLCGIHSRSALGITSSSGWNSSQNPT TSPTLLPLTVSRTSIETEWFHVLSSIGYSSLFVSLFPISVSISLQD" misc_feature 30478 /gene="ccb206" /note="C to U RNA editing" misc_feature 30490 /gene="ccb206" /note="C to U RNA editing" misc_feature 30533 /gene="ccb206" /note="C to U RNA editing" misc_feature 30537 /gene="ccb206" /note="C to U RNA editing" misc_feature 30540 /gene="ccb206" /note="C to U RNA editing" misc_feature 30542 /gene="ccb206" /note="C to U RNA editing" misc_feature 30590 /gene="ccb206" /note="C to U RNA editing" misc_feature 30599 /gene="ccb206" /note="C to U RNA editing" misc_feature 30610 /gene="ccb206" /note="C to U RNA editing" misc_feature 30611 /gene="ccb206" /note="C to U RNA editing" misc_feature 30616 /gene="ccb206" /note="C to U RNA editing" misc_feature 30621 /gene="ccb206" /note="C to U RNA editing" misc_feature 30622 /gene="ccb206" /note="C to U RNA editing" misc_feature 30626 /gene="ccb206" /note="C to U RNA editing" misc_feature 30634 /gene="ccb206" /note="C to U RNA editing" misc_feature 30641 /gene="ccb206" /note="C to U RNA editing" misc_feature 30643 /gene="ccb206" /note="C to U RNA editing" misc_feature 30650 /gene="ccb206" /note="C to U RNA editing" misc_feature 30655 /gene="ccb206" /note="C to U RNA editing" misc_feature 30656 /gene="ccb206" /note="C to U RNA editing" misc_feature 30748 /gene="ccb206" /note="C to U RNA editing" misc_feature 30766 /gene="ccb206" /note="C to U RNA editing" misc_feature 30800 /gene="ccb206" /note="C to U RNA editing" misc_feature 30829 /gene="ccb206" /note="C to U RNA editing" misc_feature 30841 /gene="ccb206" /note="C to U RNA editing" misc_feature 30842 /gene="ccb206" /note="C to U RNA editing" misc_feature 30868 /gene="ccb206" /note="C to U RNA editing" misc_feature 30886 /gene="ccb206" /note="C to U RNA editing" misc_feature 30890 /gene="ccb206" /note="C to U RNA editing" misc_feature 30929 /gene="ccb206" /note="C to U RNA editing" misc_feature 30937 /gene="ccb206" /note="C to U RNA editing" misc_feature 30938 /gene="ccb206" /note="C to U RNA editing" misc_feature 30947 /gene="ccb206" /note="C to U RNA editing" misc_feature 30974 /gene="ccb206" /note="C to U RNA editing" misc_feature 30976 /gene="ccb206" /note="C to U RNA editing" misc_feature 31013 /gene="ccb206" /note="C to U RNA editing" misc_feature 31016 /gene="ccb206" /note="C to U RNA editing" misc_feature 31028 /gene="ccb206" /note="C to U RNA editing" misc_feature 31031 /gene="ccb206" /note="C to U RNA editing" CDS complement(32041..32472) /gene="nad5" /note="orf143" /codon_start=1 /protein_id="NP_085483.1" /db_xref="GI:13449300" /db_xref="UniProt/TrEMBL:P93281" /db_xref="GeneID:814567" /translation="MAGQCLMQEIALYELFFFSLLKTGSFGLSARMEIFFSKTAWNLI RQTYPKVDYAGIVRNKFNIPNPSHSIIAWMALNSGLQQIELADFIPTNTVTLCGLCML EDESAEHLFSAAMQGGFFLSSVRNVDMISLQPTFMMFANGL" CDS 34190..34555 /note="orf121a" /codon_start=1 /protein_id="NP_085484.1" /db_xref="GI:13449301" /db_xref="UniProt/TrEMBL:P93282" /translation="MASKIRKVTNQNMRINSSLSKSSTFSTRLRITDSYLSSPSVTEL APLTLTTGDDFTVTLSVTPTMNSLESQVICPRAYDCKERIPPNQHIVSLELTYHPASI EPTATGSPETRDPDPSAYA" CDS 34599..35102 /note="orf167" /codon_start=1 /protein_id="NP_085485.1" /db_xref="GI:13449302" /db_xref="UniProt/TrEMBL:P93283" /translation="MNQLDQYSQPMQHLILLWFWLLDLSPPPSFHLSVKSVDLSLFSL SPLFLLLSISSLIFSRVNKFGIRRVGYAMAPKPDPTVLPDLQEKKAILGTQIEMITQA MTTLESRVTDLQQESNDHRTWVREALDKLLKRDLGDENRPKPTTNKMIATGEQHKGEV STSLFHD" promoter 35638..35646 /note="orf116" /evidence=not_experimental CDS 35782..36132 /note="orf116" /codon_start=1 /protein_id="NP_085486.1" /db_xref="GI:13449303" /db_xref="UniProt/TrEMBL:P93284" /translation="MSRSSIGPELEVNSKPLKGPIICPIRTYYSKVPLELLFPTTDRR FYFLKSYVFCSANSVPLYLLLLTSALHFNSYILLFDFQLKSKLLAYKRRARCVAGLLK SMERYPESTVTAMI" gene complement(40502..42628) /gene="cox2" /db_xref="GeneID:814573" mRNA complement(join(40502..40585,41930..42628)) /gene="cox2" /db_xref="GeneID:814573" CDS complement(join(40502..40585,41930..42628)) /gene="cox2" /codon_start=1 /exception="RNA editing" /product="cytochrome c oxidase subunit 2" /protein_id="NP_085487.1" /db_xref="GI:13449304" /db_xref="UniProt/TrEMBL:P93285" /db_xref="GeneID:814573" /translation="MIVLKWLFLTISPCDAAEPWQLGSQDAATPIMQGIIDLHHDIFF FLILILVFVLWILVRALWHFHYKKNAIPQRIVHGTTIEILRTIFPSIISMFIAIPSFA LLYSMDEVVVDPAITIKAIGHQWYRTYEYSDYNSSDEQSLTFDSYMIPEEDLELGQSR LLEVDNRVVVPAKTHLRIIVTSADVPHSWAVPSSGVKCDAVPGRLNQISILVQREGVY YGQCSEICGTNHAFTSIVVEAVPRKDYGSRVSNQLIPQTGEA" exon complement(40502..40585) /gene="cox2" /number=2 misc_feature 40543 /note="C to U RNA editing" misc_feature 40564 /note="C to U RNA editing" intron complement(40586..41929) /gene="cox2" /number=1 exon complement(41930..42628) /gene="cox2" /number=1 misc_feature 41931 /note="C to U RNA editing" misc_feature 42048 /note="C to U RNA editing" misc_feature 42072 /note="C to U RNA editing" misc_feature 42153 /note="C to U RNA editing" misc_feature 42250 /note="C to U RNA editing" misc_feature 42351 /note="C to U RNA editing" misc_feature 42368 /note="C to U RNA editing" misc_feature 42376 /note="C to U RNA editing" misc_feature 42491 /note="C to U RNA editing" misc_feature 42558 /note="C to U RNA editing" misc_feature 42602 /note="C to U RNA editing" misc_feature 42604 /note="C to U RNA editing" misc_feature 42605 /note="C to U RNA editing" repeat_unit 44698..48894 /note="repeat I" promoter 46909..48917 /note="orf139a" /evidence=not_experimental CDS 48113..48532 /note="orf139a identical to orf139b" /codon_start=1 /protein_id="NP_085488.1" /db_xref="GI:13449305" /db_xref="UniProt/TrEMBL:P94024" /translation="MIQRTRNQSIMLSLPSNQSANHAILTFQPIGQSRYLLTFQPTPS IPLLQQYIISVPYLDAYSSICFPVMARIRSAKYCFFFFLVLFLNGIIATRGKAMLPTL PQKGAAFFPPKMPVPPSGPSKQHNSAPRSDFVQFFYM" gene complement(51293..53611) /gene="ccb452" /db_xref="GeneID:814574" mRNA complement(join(51293..51871,52832..53611)) /gene="ccb452" /db_xref="GeneID:814574" CDS complement(join(51293..51871,52832..53611)) /gene="ccb452" /codon_start=1 /exception="RNA editing" /product="cytochrome c biogenesis orf452" /protein_id="NP_085489.1" /db_xref="GI:13449306" /db_xref="UniProt/TrEMBL:P93286" /db_xref="GeneID:814574" /translation="MVQLHNFFFFIIFMVVPCGTAAPVLLKWFVSRDVPTGAPFSNGT IIPIPISSFPLLVYLHSRKIIRSMDGAKSGVLVRASRPILLPDIIGRSSSETRARKAL FFFVPVLHFRLLESKGDFSYLESFCGVLCLLFFRTFLFLARDRSAKRERARRRKGQTL RPNGNEQRRNDKMRCSGHPHLDLERRVEGFGPLAFPVPPELGGACVGGVPPEIGLEAL ALPRSRQLMAMAVGHDYYQKVPMKMNISHGGVCICMLGVLLSNTKKIQFTQRLPLGYE LHMGKERCCLRGLDHLHGPTFHSICGNLMIYKPSLTNDRLMFEHDESLHADLLLINFP ASYKNGKLEHFLHWWMKNRKHNNFWLTMFPEKRYFRERTSTAEVAIHTNLFTDLYASI GTGSSRTGGWYTTIMKLPFIFFIRIGFMLASLGGSPSLLRQLQKDKLRWNRESSVEFI IA" exon complement(51293..51871) /gene="ccb452" /number=2 misc_feature 51325 /note="C to U RNA editing" misc_feature 51372 /note="C to U RNA editing" misc_feature 51406 /note="C to U RNA editing" misc_feature 51437 /note="C to U RNA editing" misc_feature 51480 /note="C to U RNA editing" misc_feature 51727 /note="C to U RNA editing" intron complement(51872..52831) /gene="ccb452" /number=1 exon complement(52832..53611) /gene="ccb452" /number=1 misc_feature 53197 /note="C to U RNA editing" misc_feature 53206 /note="C to U RNA editing" misc_feature 53278 /note="C to U RNA editing" misc_feature 53437 /note="C to U RNA editing" misc_feature 53452 /note="C to U RNA editing" misc_feature 53457 /note="C to U RNA editing" misc_feature 53466 /note="C to U RNA editing" misc_feature 53490 /note="C to U RNA editing" misc_feature 53509 /note="C to U RNA editing" misc_feature 53562 /note="C to U RNA editing" gene complement(53736..53809) /gene="tRNA-Gly" /db_xref="GeneID:814575" tRNA complement(53736..53809) /gene="tRNA-Gly" /product="tRNA-Gly" /db_xref="GeneID:814575" CDS complement(55767..56090) /gene="nad5" /note="orf107b" /codon_start=1 /protein_id="NP_085490.1" /db_xref="GI:13449307" /db_xref="UniProt/TrEMBL:P93287" /db_xref="GeneID:814567" /translation="MLNAFAFPQTNECFPAKRGFCANERTKCLNPKMPSKSMFGGSVS ENLFLSKIRIGLSFPLPLSEIKLQNQDFRLEGQMSSFDPFVDESKALVRRLGQKVKAK SFLCR" gene 57774..58331 /gene="rpl5" /db_xref="GeneID:814576" CDS 57774..58331 /gene="rpl5" /codon_start=1 /exception="RNA editing" /product="ribosomal protein L5" /protein_id="NP_085491.1" /db_xref="GI:13449308" /db_xref="UniProt/Swiss-Prot:P42793" /db_xref="GeneID:814576" /translation="MFPLNFHYEDVSRQDPLLKPNHANVMEVPGSCEIRVVPKAPYNF IIKNGKLAMEIPRGQKFIQTQRGSTGKSFRSNPFLGSNKDKGYVSDLARQSTLRGHGM SNFSVRISTVMSLLDFPVEIRKNSIQFSMETEFCEFSPELEDHFEIFEHIRGFNVTII TSANTQDETLPLWSGFLQKDEGETQ" misc_feature 57808 /gene="rpl5" /note="C to U RNA editing" misc_feature 57820 /gene="rpl5" /note="C to U RNA editing" misc_feature 57831 /gene="rpl5" /note="C to U RNA editing" misc_feature 57832 /gene="rpl5" /note="C to U RNA editing" misc_feature 57837 /gene="rpl5" /note="C to U RNA editing" misc_feature 57865 /gene="rpl5" /note="C to U RNA editing" misc_feature 57942 /gene="rpl5" /note="C to U RNA editing" misc_feature 58090 /gene="rpl5" /note="C to U RNA editing" misc_feature 58102 /gene="rpl5" /note="C to U RNA editing" misc_feature 58285 /gene="rpl5" /note="C to U RNA editing" misc_feature 58526 /note="C to U RNA editing" gene 60235..61416 /gene="cob" /db_xref="GeneID:814577" CDS 60235..61416 /gene="cob" /codon_start=1 /exception="RNA editing" /product="apocytochrome B" /protein_id="NP_085492.1" /db_xref="GI:13449309" /db_xref="UniProt/TrEMBL:P93289" /db_xref="GeneID:814577" /translation="MTIRNQRFSLLKQPISSTLNQHLVDYPTPSNLSYWWGFGPLAGI CLVIQIVTGVFLAMHYTPHVDLAFNSVEHIMRDVEGGWLLRYMHANGASMFLIVVYLH IFRGLYHASYSSPREFVWCLGVVIFLLMIVTAFIGYVLPWGQMSFWGATVITSLASAI PVVGDTIVTWLWGGFSVDNATLNRFFSLHHLLPFILVGASLLHLAALHQYGSNNPLGV HSEMDKIAFYPYFYVKDLVGWVAFAIFFSIWIFYAPNVLGHPDNYIPANPMSTPPHIV PEWYFLPIHAILRSIPDKAGGVAAIAPVFICLLALPFFKSMYVRSSSFRPIHQGMFWL LLADCLLLGWIGCQPVEAPFVTIGQISPLVFFLFFAITPILGRVGRGIPNSYTDETDH T" misc_feature 60520 /gene="cob" /note="C to U RNA editing" misc_feature 60559 /gene="cob" /note="C to U RNA editing" misc_feature 60802 /gene="cob" /note="C to U RNA editing" misc_feature 61087 /gene="cob" /note="C to U RNA editing" misc_feature 61142 /gene="cob" /note="C to U RNA editing" misc_feature 61216 /gene="cob" /note="C to U RNA editing" misc_feature 61318 /gene="cob" /note="C to U RNA editing" gene 62349..62433 /gene="tRNA-Ser" /db_xref="GeneID:814578" tRNA 62349..62433 /gene="tRNA-Ser" /product="tRNA-Ser" /db_xref="GeneID:814578" CDS complement(68918..69253) /gene="nad5" /note="orf111a" /codon_start=1 /protein_id="NP_085493.1" /db_xref="GI:13449310" /db_xref="UniProt/TrEMBL:P93290" /db_xref="GeneID:814567" /translation="MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVG QGLFYSATSDLQLKAFADSDWASCPDTRRSVTGFCSLVPLWFLGALRKSILSPGLLQR QNIEALHLL" gene 71349..71526 /gene="tRNA-fMet" /db_xref="GeneID:814579" promoter 71349..71357 /gene="tRNA-fMet" /evidence=not_experimental tRNA 71453..71526 /gene="tRNA-fMet" /product="tRNA-Met" /note="fMet" /db_xref="GeneID:814579" CDS 76132..76437 /note="orf101a" /codon_start=1 /protein_id="NP_085494.1" /db_xref="GI:13449311" /db_xref="UniProt/TrEMBL:P93291" /translation="MSRLELGKEAVALLTLYEEGVKSLAIVDTLRSARQEQYRKHRMP WFLLQTRLQVRVIESAQLGMLPMPIPELLKEAVAVQPLNLNGIAHSSSSINSFLLFE" gene complement(76642..77259) /gene="nad6" /db_xref="GeneID:814580" CDS complement(76642..77259) /gene="nad6" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 6" /protein_id="NP_085495.1" /db_xref="GI:13449312" /db_xref="UniProt/Swiss-Prot:Q01825" /db_xref="GeneID:814580" /translation="MILSVLSSPALVSGLMVARAKNPVHSVLFPIPVFRDTSGLLLLL GLDFFAMIFPVVHIGAIAVSFLFVVMMFHIQIAEIHEEVLRYLPVSGIIGLIFWWEMF FILDNESIPLLPTQRNTTSLRYTVYAGKVRSWTNLETLGNLLYTYYSVWFLVPSLILL VAMIGAIVLTMHRTTKVKRQDVFRRNAIDFRRTIMRRTTDPLTIY" misc_feature 76814 /note="C to U RNA editing" misc_feature 77069 /note="C to U RNA editing" misc_feature 77091 /note="C to U RNA editing" misc_feature 77099 /note="C to U RNA editing" misc_feature 77157 /note="C to U RNA editing" misc_feature 77165 /note="C to U RNA editing" misc_feature 77171 /note="C to U RNA editing" misc_feature 77172 /note="C to U RNA editing" misc_feature 77207 /note="C to U RNA editing" misc_feature 77234 /note="C to U RNA editing" misc_feature 77332 /note="C to U RNA editing" CDS complement(77819..78151) /gene="nad5" /note="orf110a" /codon_start=1 /protein_id="NP_085496.1" /db_xref="GI:13449313" /db_xref="UniProt/TrEMBL:P93292" /db_xref="GeneID:814567" /translation="MNNAAKRADCWFGAKNYGRAVYECLRGGLYFTKDDENVNSQPFM RWRDRFLFCAEAVYKAQAETGGIKGHYLNATAGTCEEMIKRAVFARELGVPIVMHDYL NRGIHRKY" mRNA complement(join(79740..<80132,81113..81297)) /gene="nad5" /label=nad2_mrna /db_xref="GeneID:814567" exon complement(79740..80132) /gene="nad2" /label=nad2_ex2 misc_feature 79760 /note="C to U RNA editing" misc_feature 79788 /note="C to U RNA editing" misc_feature 79790 /note="C to U RNA editing" misc_feature 79857 /note="C to U RNA editing" misc_feature 79877 /note="C to U RNA editing" misc_feature 79891 /note="C to U RNA editing" misc_feature 79918 /note="C to U RNA editing" misc_feature 79924 /note="C to U RNA editing" misc_feature 79929 /note="C to U RNA editing" misc_feature 79974 /note="C to U RNA editing" misc_feature 79977 /note="C to U RNA editing" intron complement(80133..81112) /gene="nad2" /number=1 exon complement(81113..>81297) /gene="nad2" /label=ex1 misc_feature 81208 /note="C to U RNA editing" misc_feature 81209 /note="C to U RNA editing" misc_feature 81239 /note="C to U RNA editing" gene complement(82028..83116) /gene="rps4" /db_xref="GeneID:814581" CDS complement(82028..83116) /gene="rps4" /codon_start=1 /exception="RNA editing" /product="ribosomal protein S4" /protein_id="NP_085497.1" /db_xref="GI:13449315" /db_xref="UniProt/Swiss-Prot:Q31708" /db_xref="GeneID:814581" /translation="MWLLKKLIQRDIDLSPLRFQTCRLLSGNVRNRELTIIQRRILRR LRNRKRSIKKRKIYPKKYLTSYIQLQTTRKLPLFHGDLPITEMHRGTKRTSYIPFPLN PETRFDVIPLRLHFLETIPQARQPISHRRVCVNKGMVSITHFKLSHGDIISFQENNAI IRGEEIRRSFYKEISVEKIIGKLLHQPLRMWRRSKTEWFHLLKTKRGCRLLLKSRFLQ QLRSSMQEEDLERTKKFGSEKVCLGSSFAEHKRMKRNLLKSLFLSKRRKDKNLNLPTR TISPIVYNSSLSLYSNSTYCFASPHKLTMKRRIKRIELPTHYSEVNHRTPKAVVSYGP NIGHIPHDIRLKDPNLPLRSRNGRGQNI" misc_feature 82060 /note="C to U RNA editing" misc_feature 82065 /note="C to U RNA editing" misc_feature 82074 /note="C to U RNA editing" misc_feature 82075 /note="C to U RNA editing" misc_feature 82125 /note="C to U RNA editing" misc_feature 82150 /note="C to U RNA editing" misc_feature 82161 /note="C to U RNA editing" misc_feature 82593 /note="C to U RNA editing" misc_feature 82740 /note="C to U RNA editing" misc_feature 82774 /note="C to U RNA editing" misc_feature 82809 /note="C to U RNA editing" misc_feature 82818 /note="C to U RNA editing" misc_feature 82882 /note="C to U RNA editing" misc_feature 82888 /note="C to U RNA editing" misc_feature 83029 /note="C to U RNA editing" CDS complement(89617..90054) /gene="nad5" /note="orf145a" /codon_start=1 /protein_id="NP_085498.1" /db_xref="GI:13449316" /db_xref="UniProt/TrEMBL:P93293" /db_xref="GeneID:814567" /translation="MRKGERMLATSKEWERPRPVEASCSEGVLKVLKGCRTILKGNRH DSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLK FCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWGAPSVPLSF" CDS complement(90883..91347) /gene="nad5" /note="orf154" /codon_start=1 /protein_id="NP_085499.1" /db_xref="GI:13449317" /db_xref="UniProt/TrEMBL:P93295" /db_xref="GeneID:814567" /translation="MALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAW QKLCKSKEDDGGLGFRDLGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMEC SVGTRPSYAWRSIIHGRELLSRGLLRTIGDGIHTKVWLDRWIMDETPLPPLN" CDS 91001..91384 /note="orf127" /codon_start=1 /protein_id="NP_085500.1" /db_xref="GI:13449318" /db_xref="UniProt/TrEMBL:P93294" /translation="MILLQAYEGLVPTLHSIIELCGKYRLRRSRDSSVCGWCIIRKDC FARRAWLNQPKSRKPKPPSSSLDLHNFCQATHEIFLLFSQELHQNSVIALVNFLHSSF ESRKHDIAYTGKAIAIDFSSTSFPP" promoter complement(92271..92279) /gene="nad5" /note="orf145a" /evidence=not_experimental gene 98939..103914 /gene="tRNA-Ser" /db_xref="GeneID:814582" promoter 98939..98947 /gene="tRNA-Ser" /evidence=not_experimental gene 103752..104295 /gene="tRNA-Tyr" /db_xref="GeneID:814583" promoter 103752..103760 /gene="tRNA-Tyr" /evidence=not_experimental repeat_unit 103805..104337 /note="repeat II" tRNA 103827..103914 /gene="tRNA-Ser" /product="tRNA-Ser" tRNA 104221..104295 /gene="tRNA-Tyr" /product="tRNA-Tyr" /db_xref="GeneID:814583" gene 104457..104531 /gene="tRNA-Pro" /db_xref="GeneID:814584" tRNA 104457..104531 /gene="tRNA-Pro" /product="tRNA-Pro" /db_xref="GeneID:814584" gene 104885..104995 /gene="tRNA-Cys" /db_xref="GeneID:814585" tRNA 104885..104995 /gene="tRNA-Cys" /product="tRNA-Cys" /db_xref="GeneID:814585" CDS complement(104954..105553) /gene="nad5" /note="orf199" /codon_start=1 /protein_id="NP_085501.1" /db_xref="GI:13449319" /db_xref="UniProt/TrEMBL:P93296" /db_xref="GeneID:814567" /translation="MVFQSFILGNLVYLCMKIINSVVVVGLYYGFLTTFSIGPSYLFL LRARVMDEGEEGTEKKVSATTGFIAGQLMMFISIYYAPLHLALGRPHTITVLALPYLL FHFFFWNNHKHFFDYGSTTRNEMRNLRIQCVFPNNLIFKLFNHLILPSSMLARLVNIY MFRCNNKMLFVTSSFVVCVRMLLVEWAFPLFQLFLVMKV" promoter complement(105653..105661) /gene="nad5" /note="orf199" /evidence=not_experimental gene 105887..105948 /gene="tRNA-Asn" /db_xref="GeneID:814586" tRNA 105887..105948 /gene="tRNA-Asn" /product="tRNA-Asn" /db_xref="GeneID:814586" gene 106797..106879 /gene="tRNA-Tyr" /db_xref="GeneID:814587" tRNA 106797..106879 /gene="tRNA-Tyr" /product="tRNA-Tyr" /db_xref="GeneID:814587" CDS complement(110930..111403) /gene="nad5" /note="orf157" /codon_start=1 /protein_id="NP_085502.1" /db_xref="GI:13449320" /db_xref="UniProt/TrEMBL:P93297" /db_xref="GeneID:814567" /translation="MGDLEGQDRPDPISTMVGPSGTGNLRLTSFQQVRRSILSQERRN PAPLATCTKKKLGRKEEPLLIPATINNYRRAIHLKNGARISLDVGLYFFRRARFGTLK QDMIYIIRHHRRLEIKVRFIALLIQACCRIVGYDYLFFYEVRNHLLLAALLIIIR" gene 111587..112907 /gene="atp6-1" /db_xref="GeneID:814588" promoter 111587..111595 /gene="atp6-1" /evidence=not_experimental CDS 111750..112907 /gene="atp6-1" /codon_start=1 /exception="RNA editing" /product="ATPase subunit 6" /protein_id="NP_085503.1" /db_xref="GI:13449321" /db_xref="UniProt/TrEMBL:P93298" /db_xref="GeneID:814588" /translation="MRRIFLFDENSLNSSSTIDTSSASTIDTSFASQCTNFSSGQASG TQDTHAGIFEDCPGLNPNDERVVELQCEIREKCEALTQDPEMGLILGEALHAESDNVP FLQSIADDLTQNGVSGEAFQEALNIVGQAAASPLDQFEIVPLIPMHIGNFYFSFTNPS LFMLLTLSFFLLLIHFVTKKGGGNLVPNAWQSLVELLYDFVLNLVKEQIGGLSGNVKQ MFFPCILVTFLFLLFCNLQGMIPYSFTVTSHFLITLALSFSIFIGITIVGFQRHGLHF FSFLLPAGVPLPLAPFLVLLELISYCFRALSLGIRLFANMMAGHSLVKILSGFAWTML CMNDIFYFIGALGPLFIVLALTGLELGVAILQAYVFTILICIYLNDAINLH" repeat_unit 112147..118736 /note="repeat III" misc_feature 112224 /gene="atp6-1" /note="C to U RNA editing" gene 112953..113039 /gene="tRNA-Ser" /db_xref="GeneID:814589" tRNA 112953..113039 /gene="tRNA-Ser" /product="tRNA-Ser" /db_xref="GeneID:814589" CDS 115431..115751 /note="orf106a identical to orf106g" /codon_start=1 /protein_id="NP_085504.1" /db_xref="GI:13449322" /db_xref="UniProt/TrEMBL:P93299" /translation="MLHRGRSCLTGLFPCYLLSNWLNSNLCWIPLKLVIPCFQLIVES YLLEFLLLLAISTCLLGEDSLIWLTLVVAHSKSRQSSSQEPLDTRMATRALLDQLRSD RRHN" CDS 116296..116754 /note="orf152a identical to orf152b" /codon_start=1 /protein_id="NP_085505.1" /db_xref="GI:13449323" /db_xref="UniProt/TrEMBL:P93300" /translation="MPTILSSKPAFNSLFSYHLIGLISNKLVTLAPDYTGTKKTTWGA RLHLQELRVQTKHRQIEPDIKNLPLQPIRYGSFRPVFHWIEKPCMLIGLCGLHSEVSF IANWPWGKPSEIGGCSIPCMLTGRGSELSYHIRASRRPLPGNRFHFQSFF" CDS complement(119381..119701) /gene="nad5" /note="orf106b" /codon_start=1 /protein_id="NP_085506.1" /db_xref="GI:13449324" /db_xref="UniProt/TrEMBL:P93301" /db_xref="GeneID:814567" /translation="MVVTAYPKSSAGMGVTVLPEYLKQSSYEAYSRPYSAFFLSGCTK QERSPLLARRLVDAWLSFHSILMINEEVSDWEQLSDHYTRRSLFKTIAFRNLQREEEY RPGG" gene complement(127021..127094) /gene="tRNA-Ile" /db_xref="GeneID:814590" tRNA complement(127021..127094) /gene="tRNA-Ile" /product="tRNA-Ile" /db_xref="GeneID:814590" CDS complement(127475..127843) /gene="nad5" /note="orf122a" /codon_start=1 /protein_id="NP_085507.1" /db_xref="GI:13449325" /db_xref="UniProt/TrEMBL:P93302" /db_xref="GeneID:814567" /translation="MILNLDTNIFNHGLSRHNILAFSQGFPIGLPCRNWIEVGLRLRL RLLLELAVGNFPQGFKIHLSGSFQAVRLALFSSFTSLRTDELLLIETRPSYLSSVQGL KYYVIFIDNYSQGSVGCSRN" misc_feature 129794 /note="C to U RNA editing" misc_feature 129825 /note="C to U RNA editing" gene 129909..130385 /gene="orfB" /db_xref="GeneID:814591" CDS 129909..130385 /gene="orfB" /codon_start=1 /protein_id="NP_085508.1" /db_xref="GI:13449326" /db_xref="UniProt/TrEMBL:P93303" /db_xref="GeneID:814591" /translation="MPQLDKFTYFSQFFWLCLFFFTFYIFICNDGDGVLGISRILKLR NQLLSHRGKTIRSKDPNSLEDLLRKGFSTGVSYMYASLFEVSQWCKAVDLLGKRRKIT LISCFGEISGSRGMERNILYNISKSSPSNTGRWITCRNCRNDIMLIHVVHGQGSIK" CDS 130817..131140 /note="orf107c" /codon_start=1 /protein_id="NP_085509.1" /db_xref="GI:13449327" /db_xref="UniProt/TrEMBL:P93304" /translation="MAVRCSKIQRTDGRPGHSLQPAKVSFVAGQPLGYYSSWPLFALS HHMVVWYAAEHVYPSSFFFQSKLPPSEVFAYPGMEVFNEYTLYHAWVDEALSGVSKME LYAKA" CDS 130968..131393 /note="orf141" /codon_start=1 /protein_id="NP_085510.1" /db_xref="GI:13449328" /db_xref="UniProt/TrEMBL:P93305" /translation="MRQNMSILPPSSFSRNFLLVRCSPIQVWKYSMNTLCTMHGWMKL YLEYQRWNCMLRLSASIELEVFMDVPIFIRTYYRPANMDDSFRYGIIFRMYDLAVQLQ WGNVMKLLLVTKTQVEKKKALNLNVSKTRDGRGGLNQQG" misc_feature 132032 /note="C to U RNA editing" gene 132071..138153 /gene="nad7" /db_xref="GeneID:814592" mRNA join(132071..132213,133177..133245,134309..134775, 135829..136072,137892..138153) /gene="nad7" /db_xref="GeneID:814592" CDS join(132071..132213,133177..133245,134309..134775, 135829..136072,137892..138153) /gene="nad7" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 7" /protein_id="NP_085511.1" /db_xref="GI:13449329" /db_xref="UniProt/Swiss-Prot:P93306" /db_xref="GeneID:814592" /translation="MTTRKRQIKNFTSNFGPQHPAAHGVSRLVLEMNGEVVERAEPHI GSLHRGTEKLIEYKTYLQALPYSDRSDYVSMMAQEHAHSSAVEKLLNCEVPLRAQYIR VLFREITRISNHSLALTTHAMDVGALTPFLWAFEEREKLLEFYERVSGARMHASFIRP GGVAQDLPLGLCRDIDSFTQQFASRIDELEEMSTGNRIWKQRLVDIGTVTAQQAKDWG FSGVMLRGPGVCWDSRRAAPYDVHDQSDLDVPVGTRGDRYDRYCIRIEEMRQSLRIIV QCLNQMPSGMIKADDRKLCPPSRCRMKLSMESSIHHFELYTEGFSVPASSTYTAVEAP KGEFGVFLVSNGSNRPYRRKIRAPGSAHSQGLDSMSKHHMPADVVTIIGTQDIVFGEV DR" exon 132071..132213 /gene="nad7" /number=1 misc_feature 132094 /gene="nad7" /note="C to U RNA editing" misc_feature 132108 /gene="nad7" /note="C to U RNA editing" misc_feature 132147 /gene="nad7" /note="C to U RNA editing" misc_feature 132207 /gene="nad7" /note="C to U RNA editing" intron 132214..133176 /gene="nad7" /number=1 exon 133177..133245 /gene="nad7" /number=2 misc_feature 133233 /gene="nad7" /note="C to U RNA editing" intron 133246..134308 /gene="nad7" /number=2 misc_feature 134300 /gene="nad7" /note="C to U RNA editing" exon 134309..134775 /gene="nad7" /number=3 misc_feature 134309 /gene="nad7" /note="C to U RNA editing" misc_feature 134340 /gene="nad7" /note="C to U RNA editing" misc_feature 134347 /gene="nad7" /note="C to U RNA editing" misc_feature 134412 /gene="nad7" /note="C to U RNA editing" misc_feature 134431 /gene="nad7" /note="C to U RNA editing" misc_feature 134440 /gene="nad7" /note="C to U RNA editing" misc_feature 134674 /gene="nad7" /note="C to U RNA editing" intron 134776..135828 /gene="nad7" /number=3 exon 135829..136072 /gene="nad7" /number=4 misc_feature 135847 /gene="nad7" /note="C to U RNA editing" misc_feature 135873 /gene="nad7" /note="C to U RNA editing" misc_feature 135883 /gene="nad7" /note="C to U RNA editing" misc_feature 135888 /gene="nad7" /note="C to U RNA editing" misc_feature 135918 /gene="nad7" /note="C to U RNA editing" misc_feature 135938 /gene="nad7" /note="C to U RNA editing" misc_feature 135944 /gene="nad7" /note="C to U RNA editing" intron 136073..137891 /gene="nad7" /number=4 exon 137892..138153 /gene="nad7" /number=5 misc_feature 137931 /gene="nad7" /note="C to U RNA editing" misc_feature 138018 /gene="nad7" /note="C to U RNA editing" misc_feature 138025 /gene="nad7" /note="C to U RNA editing" misc_feature 138047 /gene="nad7" /note="C to U RNA editing" misc_feature 138056 /gene="nad7" /note="C to U RNA editing" misc_feature 138071 /gene="nad7" /note="C to U RNA editing" misc_feature 138092 /gene="nad7" /note="C to U RNA editing" misc_feature 138105 /gene="nad7" /note="C to U RNA editing" intron complement(140723..190762) /gene="nad5" /note="transpliced intron" /number=2 exon complement(140724..141939) /gene="nad5" /number=2 misc_feature 140895 /note="C to U RNA editing" misc_feature 141295 /note="C to U RNA editing" misc_feature 141335 /note="C to U RNA editing" misc_feature 141406 /note="C to U RNA editing" misc_feature 141445 /note="C to U RNA editing" misc_feature 141457 /note="C to U RNA editing" misc_feature 141494 /note="C to U RNA editing" misc_feature 141541 /note="C to U RNA editing" misc_feature 141561 /note="C to U RNA editing" misc_feature 141562 /note="C to U RNA editing" misc_feature 141572 /note="C to U RNA editing" misc_feature 141617 /note="C to U RNA editing" misc_feature 141622 /note="C to U RNA editing" misc_feature 141676 /note="C to U RNA editing" misc_feature 141772 /note="C to U RNA editing" misc_feature 141796 /note="C to U RNA editing" misc_feature 141812 /note="C to U RNA editing" misc_feature 141928 /note="C to U RNA editing" intron complement(141940..142768) /gene="nad5" /number=1 misc_feature 141949 /note="C to U RNA editing" misc_feature 141961 /note="C to U RNA editing" misc_feature 142139 /note="C to U RNA editing" exon complement(142769..142998) /gene="nad5" /number=1 misc_feature 142844 /note="C to U RNA editing" gene complement(143219..>318390) /gene="nad1" /db_xref="GeneID:814625" mRNA complement(join(143219..143477,146990..147048, 287917..288108,289003..289083,318004..>318390)) /gene="nad1" /label=nad1_mrna /db_xref="GeneID:814625" CDS complement(join(143219..143477,146990..147048, 287917..288108,289003..289083,318004..>318390)) /gene="nad1" /codon_start=1 /label=nad1_cds /product="NADH dehydrogenase subunit 1" /protein_id="NP_085565.2" /db_xref="GI:26557004" /db_xref="UniProt/TrEMBL:P92558" /db_xref="GeneID:814625" /translation="TYIAVPAEILGIILPLLLGVAFLVLAERKVMAFVQRRKGPDVVG SFGLLQPLADGSKLILKEPISPSSANFFLFRMAPVATFMLSLVARAVVPFDYGMVLSD PNIGLLYLFAISSLGVYGIIIAGRSSNSKYAFLGALRSAAQMVSYEVSIGLILITVLI CVGPRNSSEIVMAQKQIWSGIPLFPVLVMFLISRLAETNRAPFDLPEAEAESVAGYNV EYSSMGSALFFLGEYANMILMSGPCTLFFPGGWPPILDLPIFKKIPGSIWFSIKVLLF LFLYIWVRAAFPRYRYDQLMGLGRKVFLPLSLARVVPVSGLLVTFQWLP" exon complement(143219..143477) /gene="nad1" /usedin=Y08502:nad1_cds /usedin=Y08502:nad1_mrna /label=nad1_ex5 misc_feature 143260 /note="C to U RNA editing" misc_feature 143269 /note="C to U RNA editing" misc_feature 143299 /note="C to U RNA editing" misc_feature 143374 /note="C to U RNA editing" misc_feature 143400 /note="C to U RNA editing" misc_feature 143434 /note="C to U RNA editing" misc_feature 143442 /note="C to U RNA editing" misc_feature 143454 /note="C to U RNA editing" misc_feature 143472 /note="C to U RNA editing" intron complement(143478..146989) /gene="nad1" /number=4 misc_feature 143590 /note="C to U RNA editing" misc_feature 144105 /note="C to U RNA editing" gene complement(144294..146312) /gene="matR" /db_xref="GeneID:814593" CDS complement(144294..>146312) /gene="matR" /codon_start=1 /exception="RNA editing" /product="maturase" /protein_id="NP_085512.1" /db_xref="GI:13449330" /db_xref="UniProt/TrEMBL:P93307" /db_xref="GeneID:814593" /translation="GLKFRPLTVVLPIEKIMKEAIRMVLESIYDPEFPDTSHFRSGQG CHSVLRRIKEEWGISRWFLEFDIRKCFHTIDRHRLIQILKEEIDDPKFFYSIQKVFSA GRLVGVERGPYSVPHSVLLSALPGNIYLHKLDQEIGRIRQKYEIPIVQRVRSVLLRTG RRIDDQENPGEEASFNAPQDNRAIIVGSVKSMQRKAAFHSLVSSWHTPPTSTLRLRGD QKRPFVFPPSSALAVFLNKPSSLLCAAFLIEAAGLTPKAEFYGGERCNNNWAMRDLLK YCKRKGLLIELGGEAILVIRSERGLARKQAPLKTHYLIRICYARYADDLLLGIVGAVE LLIEIQKRIAHFLQSGLNLWVGSAGSTTIAARSTVEFLGTVIREVPPRTTPIQFLREL EKRLRVKHRIHITACHLRSAIHSKFRNLGDSIPIKQLTKGMSKTGSLQDGVQLAETLG TAGVRSPQVSVLWGTVKHIRQGSRGISFLHSSGRSNASSDVQQVVSRSGTHARKLSLY TPPGRKAAGEGGGHWAGSISSEFPIKIEAPIKKILRRLRDRGIISRRRPWPIHVACLT NVSDEDIVNWSAGIAISPLSYYRCRDNLYQVRTIVDHQIRWSAIFTLAHKHKSSAPNI ILKYSKDSNIVNQEGGKILAEFPNSIELGKLGPGQDLNKKEHSTTSLV" misc_feature 144418 /note="C to U RNA editing" misc_feature 144506 /note="C to U RNA editing" misc_feature 144542 /note="C to U RNA editing" misc_feature 144562 /note="C to U RNA editing" misc_feature 144583 /note="C to U RNA editing" misc_feature 145939 /note="C to U RNA editing" misc_feature 146029 /note="C to U RNA editing" misc_feature 146072 /note="C to U RNA editing" misc_feature 146233 /note="C to U RNA editing" exon complement(146990..147048) /gene="nad1" /usedin=Y08502:nad1_cds /usedin=Y08502:nad1_mrna /label=nad1_ex4 CDS complement(147282..147611) /gene="nad1" /note="orf109" /codon_start=1 /protein_id="NP_085513.1" /db_xref="GI:13449331" /db_xref="UniProt/TrEMBL:P93308" /db_xref="GeneID:814625" /translation="MALFFFLLLLISHREQLLLVQGHQMRDLLPTPRNRSNGRLPSPF SRVINPSTHLSIHKKALPRGERKLQDEYALDSRIHSRPDPLWNFRNLQKHSRKGLVMI FSKITRC" CDS 150783..151151 /note="orf102b" /codon_start=1 /protein_id="NP_085514.1" /db_xref="GI:13449332" /db_xref="UniProt/TrEMBL:P93309" /translation="MYRYEISCPFNLRSPAVPVSSKASSTSFIKTKALRISEVNRELS VPRVYREKSFTRRLNAPIFGSLFVDKESRFANPYSFTLNQGLTRGRGKQAKLAPDRRG KSVVTEVDYRTGVGENIVKD" CDS complement(151265..151747) /gene="nad1" /note="orf160" /codon_start=1 /protein_id="NP_085515.1" /db_xref="GI:13449333" /db_xref="UniProt/TrEMBL:P93310" /db_xref="GeneID:814625" /translation="MDLRLLHGVAYGHSWFGKWGYRFCSGSFGVEEHHYHRAIAFLTS ISLVDDITANFRENKANLNIGDIVRCYRDMSEIQLTTLQDLLRFMLTIKSRAPPIRIP IGKIEAPSVVLPSMKAYGTRACPQVKQCPKDKEKSVKCRKFALPLPYRLAFTSLSVVV " promoter complement(153106..153114) /gene="nad1" /note="orf160" /evidence=not_experimental gene 154744..157345 /gene="rpl2" /db_xref="GeneID:814594" mRNA join(154744..155660,157213..157345) /gene="rpl2" /db_xref="GeneID:814594" CDS join(154744..155660,157213..157345) /gene="rpl2" /codon_start=1 /exception="RNA editing" /product="ribosomal protein L2" /protein_id="NP_085516.1" /db_xref="GI:13449334" /db_xref="UniProt/TrEMBL:P93311" /db_xref="GeneID:814594" /translation="MRPGRARALRQFTLSTGKSAGRNSSGRITVFHRGGGSKRLLRRI DLKRSTSSMGIVESIEYDPNRSSQIAPVRWIKGGCQKKMNTIEKFAPPRKILEPTTNT ISGLFSFSFLPGKVDKRKVACFSPGLMAAYVVVGLPTGMPPLSSSKSAFASKGAGSTK TLVKDVFFSAFSSPKAKRETASLAFASSFGFPRIAVAGAKPAFFAPRMRQKVRGKSTF SLCEVQKGRTHSILWAHRIKGKAGLSWQSFRRQDTLGLVGAAGHKKSKPKTDQGNLPA KPIGERAKQLKALRGLRAKDGACKVDRAPVTYIIASHQLEAGKMVMNCDWSKPSTSSF LQSAQNDHPKPLFTV" exon 154744..155660 /gene="rpl2" /number=1 misc_feature 155454 /gene="rpl2" /note="C to U RNA editing" intron 155661..157212 /gene="rpl2" /number=1 exon 157213..157345 /gene="rpl2" /number=2 gene 157491..158351 /gene="orfX" /db_xref="GeneID:814595" CDS <157491..158351 /gene="orfX" /codon_start=1 /exception="RNA editing" /protein_id="NP_085517.1" /db_xref="GI:13449335" /db_xref="UniProt/TrEMBL:P93312" /db_xref="GeneID:814595" /translation="NPSLLALNYLYISYEFNFASETILGEVRIRSVRILIGLGLTWFT CYWFPEELISPLASPFLTLPFDSYFVCTQLTEAFSTFVATSSIACSYFVFPLISYQIW CFLIPSCYGEQRTKYNRFLHLSGSRFFLFLFLTPPRVVPNVWHFPYFVGATSTNSLMI KLQPKIYDHIMLTVRISFIPSVCSQVPVIVICLPEPRGLSLETFTNNRRFLMVFPLLT AALSTPPDIWCQIVARFLISLIIELAIFVASIVQVREEGWTSGMRESGSIEKKNKSSP PPRTWQSNYQ" misc_feature 157549 /gene="orfX" /note="C to U RNA editing" misc_feature 157634 /gene="orfX" /note="C to U RNA editing" misc_feature 157635 /gene="orfX" /note="C to U RNA editing" misc_feature 157651 /gene="orfX" /note="C to U RNA editing" misc_feature 157654 /gene="orfX" /note="C to U RNA editing" misc_feature 157663 /gene="orfX" /note="C to U RNA editing" misc_feature 157690 /gene="orfX" /note="C to U RNA editing" misc_feature 157851 /gene="orfX" /note="C to U RNA editing" misc_feature 157854 /gene="orfX" /note="C to U RNA editing" misc_feature 157867 /gene="orfX" /note="C to U RNA editing" misc_feature 157869 /gene="orfX" /note="C to U RNA editing" misc_feature 157897 /gene="orfX" /note="C to U RNA editing" misc_feature 157899 /gene="orfX" /note="C to U RNA editing" misc_feature 157902 /gene="orfX" /note="C to U RNA editing" misc_feature 157930 /gene="orfX" /note="C to U RNA editing" misc_feature 157964 /gene="orfX" /note="C to U RNA editing" misc_feature 157995 /gene="orfX" /note="C to U RNA editing" misc_feature 158028 /gene="orfX" /note="C to U RNA editing" misc_feature 158071 /gene="orfX" /note="C to U RNA editing" misc_feature 158133 /gene="orfX" /note="C to U RNA editing" misc_feature 158155 /gene="orfX" /note="C to U RNA editing" misc_feature 158183 /gene="orfX" /note="C to U RNA editing" misc_feature 158190 /gene="orfX" /note="C to U RNA editing" misc_feature 158236 /gene="orfX" /note="C to U RNA editing" gene 161693..169674 /gene="nad4" /db_xref="GeneID:814596" mRNA join(161693..162153,163625..164139,167221..167642, 169585..169674) /gene="nad4" /db_xref="GeneID:814596" CDS join(161693..162153,163625..164139,167221..167642, 169585..169674) /gene="nad4" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 4" /protein_id="NP_085518.1" /db_xref="GI:13449336" /db_xref="UniProt/TrEMBL:P93313" /db_xref="GeneID:814596" /translation="MLEHFCECYSNLSGLILCPVLGSITLLFIPNSRIRPIRLIGLCA SLITFLYSPVPRIQFDSSTAKSQFVESLRWLPYENINFYLGIDGISLFFVILTTFLIP ICISVGWSGMRSYGKEYITAFLIREFLMIAVFRMLDLLLFYVFPESVPIPMFIIIGVW GSRQRKIKAAYQFFLYTLLGSLFMLLAILLILFQTGTTDLQISLTTEFSERRQIFLWI ASFASFAVKVPMVPVHIWLPEAHVEAPTAGSVILAGIPLKFGTHGFLRFSIPMFPEAT LCSTPFIYTLSAIAIIYTSLTTSRQIDLKKIIAYSSVAHMNLVTIGMFSPNIQGIGGS ILPMLSHGLVPSALFLCVGVLYDRHKTRLVRYYGGLVSTMPNLSTIFFSFTLANMSSP GTSSFIGEFLILVGAFQRNSLVATLAALGMILGAAYSLWLYNRVVSGNLKPDFLHKFS DSNGREVSIFIPFLVGLVRMGVHPKVFPDCMHTSVSNLVQHGKFH" exon 161693..162153 /gene="nad4" /number=1 misc_feature 161721 /gene="nad4" /note="C to U RNA editing" misc_feature 161766 /gene="nad4" /note="C to U RNA editing" misc_feature 161776 /gene="nad4" /note="C to U RNA editing" misc_feature 161799 /gene="nad4" /note="C to U RNA editing" misc_feature 161816 /gene="nad4" /note="C to U RNA editing" misc_feature 161850 /gene="nad4" /note="C to U RNA editing" misc_feature 161856 /gene="nad4" /note="C to U RNA editing" misc_feature 161858 /gene="nad4" /note="C to U RNA editing" misc_feature 161889 /gene="nad4" /note="C to U RNA editing" misc_feature 162009 /gene="nad4" /note="C to U RNA editing" misc_feature 162054 /gene="nad4" /note="C to U RNA editing" misc_feature 162068 /gene="nad4" /note="C to U RNA editing" misc_feature 162094 /gene="nad4" /note="C to U RNA editing" misc_feature 162095 /gene="nad4" /note="C to U RNA editing" misc_feature 162110 /gene="nad4" /note="C to U RNA editing" misc_feature 162141 /gene="nad4" /note="C to U RNA editing" intron 162154..163624 /gene="nad4" /number=1 exon 163625..164139 /gene="nad4" /number=2 misc_feature 163771 /gene="nad4" /note="C to U RNA editing" misc_feature 163822 /gene="nad4" /note="C to U RNA editing" misc_feature 163930 /gene="nad4" /note="C to U RNA editing" misc_feature 163947 /gene="nad4" /note="C to U RNA editing" misc_feature 163999 /gene="nad4" /note="C to U RNA editing" misc_feature 164059 /gene="nad4" /note="C to U RNA editing" intron 164140..167220 /gene="nad4" /number=2 exon 167221..167642 /gene="nad4" /number=3 misc_feature 167254 /gene="nad4" /note="C to U RNA editing" misc_feature 167277 /gene="nad4" /note="C to U RNA editing" misc_feature 167345 /gene="nad4" /note="C to U RNA editing" misc_feature 167373 /gene="nad4" /note="C to U RNA editing" misc_feature 167416 /gene="nad4" /note="C to U RNA editing" misc_feature 167599 /gene="nad4" /note="C to U RNA editing" misc_feature 167617 /gene="nad4" /note="C to U RNA editing" intron 167643..169584 /gene="nad4" /number=3 exon 169585..169674 /gene="nad4" /number=4 misc_feature 169591 /gene="nad4" /note="C to U RNA editing" misc_feature 169603 /gene="nad4" /note="C to U RNA editing" misc_feature 169619 /gene="nad4" /note="C to U RNA editing" CDS 169796..170737 /note="orf313" /codon_start=1 /protein_id="NP_085519.1" /db_xref="GI:13449337" /db_xref="UniProt/TrEMBL:P93314" /translation="MTIRNQRFSLLKQPISSTLNQHLVDYPTPSNLSYWWGFGPLAGT MILSVLSSPALVSGLMVARAKNLVHSVLFPIPIFFSINQLFHYFCRLPIIKHLATKCQ LLLFLISHFLLLLVLTKLVLDLGGYLFMDDLSRALSQFVPGFSGGLGGGSNTPPNPSG DFFLSSYQTSDPDYHDQRRGDSYFSSAPGVQETHRHASGSSTNLHLNLNDQSQDPIFL EVERLSLKCDKVKEKTILKTQSLLLERGYHIPDERDIERAINVVMTEHETIDIDRRRK RFYYLYSCLGKTGNKFWMELLETLADYNINIKSDSDN" CDS 174205..174525 /note="orf106c" /codon_start=1 /protein_id="NP_085520.1" /db_xref="GI:13449338" /db_xref="UniProt/TrEMBL:P93315" /translation="MTPCFINGKCAEQLTSTTAYKLLFLPPLTEHTTSLYPKQYPDKQ LVLLLDPRSGTRLTYYSQPFFIALSSPYSGSSLSAKFHFAPHLYAFLVVASITRSDIA YSVN" promoter 176194..176202 /evidence=not_experimental CDS complement(176542..177027) /gene="nad1" /note="orf161" /codon_start=1 /protein_id="NP_085521.1" /db_xref="GI:13449339" /db_xref="UniProt/TrEMBL:P93316" /db_xref="GeneID:814625" /translation="MLYGLRLYAFQEISFLDPWQLAAIFSGSCVLFISLEKRTLTGYM LTFILYSVLALFVSVWLSSAAGKAGIPIEGMVFLLFLIGGICFICLIQKIFQLTPNTV QALIPILFSALFFFLEELPALEGLPLLKWLKGLDLLLLLVGLLLLIFNENRQGGDGEG S" repeat_unit 178863..183059 /note="repeat I" promoter 181074..181082 /note="orf139b" /evidence=not_experimental CDS 182278..182697 /note="orf139b identical to orf139a" /codon_start=1 /protein_id="NP_085522.1" /db_xref="GI:13449340" /db_xref="UniProt/TrEMBL:P94024" /translation="MIQRTRNQSIMLSLPSNQSANHAILTFQPIGQSRYLLTFQPTPS IPLLQQYIISVPYLDAYSSICFPVMARIRSAKYCFFFFLVLFLNGIIATRGKAMLPTL PQKGAAFFPPKMPVPPSGPSKQHNSAPRSDFVQFFYM" CDS 183111..183443 /note="orf110b" /codon_start=1 /protein_id="NP_085523.1" /db_xref="GI:13449341" /db_xref="UniProt/TrEMBL:P93317" /translation="MPSPILPMLPISHLIGTEVRNLISVRTPNITMDQLKNGCCSILT QLETLLRSQSPSEMTIFQTLCDRCCGAEVANEATVECGKTMETTNLTSGGRYWPFHNG TNLSRISL" gene complement(188084..188662) /gene="orf25" /db_xref="GeneID:814597" CDS complement(188084..188662) /gene="orf25" /codon_start=1 /exception="RNA editing" /protein_id="NP_085524.1" /db_xref="GI:13449342" /db_xref="UniProt/Swiss-Prot:Q04613" /db_xref="GeneID:814597" /translation="MRLSITNMDGRKMLFAAILSICALSSKKISIYNEEMIVALCFIG FIIFSRKSLGTTFKVTLDGSLQAIQEESQQFPNPNEVVPPESNEQQRLLRISLRICGT VVESLPMARCAPKCEKTVQALLCRNLNVKSATLTNATSSRRIRFQDDLVTKFYTLVGK QFAYSCISKAERVEFIRESLVVLRMVRGGVFS" misc_feature 188247 /note="C to U RNA editing" misc_feature 188268 /note="C to U RNA editing" misc_feature 188412 /note="C to U RNA editing" misc_feature 188413 /note="C to U RNA editing" misc_feature 188415 /note="C to U RNA editing" misc_feature 188448 /note="C to U RNA editing" misc_feature 188525 /note="C to U RNA editing" misc_feature 188574 /note="C to U RNA editing" gene complement(188929..189231) /gene="nad4L" /db_xref="GeneID:814598" CDS complement(188929..189231) /gene="nad4L" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 4L" /protein_id="NP_085525.1" /db_xref="GI:13449343" /db_xref="UniProt/Swiss-Prot:Q04614" /db_xref="GeneID:814598" /translation="MDLIKYFTFSMIISILGIRGILLNRRNIPIMSMPIESMLLAVNS NFLVFSVSSDDMMGQVFASLVPTVAAAESAIGLAIFVITFRVRGTIAVEFINSIQG" misc_feature 189035 /note="C to U RNA editing" misc_feature 189044 /note="C to U RNA editing" misc_feature 189074 /note="C to U RNA editing" misc_feature 189122 /note="C to U RNA editing" misc_feature 189132 /note="C to U RNA editing" misc_feature 189137 /note="C to U RNA editing" misc_feature 189146 /note="C to U RNA editing" misc_feature 189177 /note="C to U RNA editing" misc_feature 189191 /note="C to U RNA editing" CDS complement(190104..190553) /gene="nad1" /note="orf149" /codon_start=1 /protein_id="NP_085526.1" /db_xref="GI:13449344" /db_xref="UniProt/TrEMBL:P93318" /db_xref="GeneID:814625" /translation="MRPFGEAKFLSGHQTLQLMRKKALWSKGKRVRCHTPCLPKVPRG RARRSGATTREQSPHRQGDRRRPSQGTSRPTGKTGETREGNPIGSQRIHSTCSPTDFI FLILESGGKGSLFCNEKKKRSRFDSAQPNDTSNTNDLCLECVARSLF" exon complement(190740..190761) /gene="nad1" /number=3 CDS complement(191055..191882) /gene="nad1" /note="orf275" /codon_start=1 /protein_id="NP_085527.1" /db_xref="GI:13449345" /db_xref="UniProt/TrEMBL:P93319" /db_xref="GeneID:814625" /translation="MKKYKMVINIDMLRLFLPLLGGSVSGSLFGRFLGSEGSAIMITT CVSFCALVVFIFGLFYFRKKGPLKRILYLFLVGFVLSLIRIKVVYLLGGQALPLLDPI LMYAVGAGALLGPNGAESSATWEEDSFELDVLGESFSSSKTDMDSQVAEAPQTEEGEP SVNQVPQEAGASHRVGPYQDQGLATDRNGNPIDLNDSLPPSSLLYGEIESSASVRARD LELEKDIKRVQRLTRNFDNAEDPARRLEVAARLDPEVRELDQKWALFQEKDASGLGR" intron complement(200001..<287916) /gene="nad1" /note="transpliced intron" /number=3 CDS 201729..202097 /note="orf122c" /codon_start=1 /protein_id="NP_085528.1" /db_xref="GI:13449346" /db_xref="UniProt/TrEMBL:P92510" /translation="MKHASFCLSSRILLLAPCRYLGTLLLLLPYPCSTLRQFLFLLRS LFIRDVEWIPAGLSHHIPYFPLASPPLTVETLLIARLLLSIKQLSLPPAKTASLSASL DAKTKGRSLLSSCSYCYMPT" CDS complement(203588..204310) /gene="nad1" /note="orf240a" /codon_start=1 /protein_id="NP_085529.1" /db_xref="GI:13449347" /db_xref="UniProt/TrEMBL:P92511" /db_xref="GeneID:814625" /translation="MRSSVLRSLRGRLVINLESTRKLRLSRTNIVPGRKKGQKSIKSK NMARKGNPILVRLGKNRSSDSSRFSAEALLGCLYFFIYFVAPTLGPVLFLLRLIHFVW GLRLGLGNENFHFGVGPDGGATGLDLNQPPQEQQPTLGVNRAALDLNELPPVHLLYAE VEGPQSTKAQNDVMLAHLNQVQNLTRDLQTEPNIWRRQALIDILDWEVRSLQRHFRIF RQRDRLREVQRSWLREQLNRYR" misc_feature 204112 /note="C to U RNA editing" gene complement(204340..204412) /gene="tRNA-Lys" /db_xref="GeneID:814600" tRNA complement(204340..204412) /gene="tRNA-Lys" /product="tRNA-Lys" /db_xref="GeneID:814600" CDS complement(207553..207915) /gene="nad1" /note="orf120" /codon_start=1 /protein_id="NP_085530.1" /db_xref="GI:13449348" /db_xref="UniProt/TrEMBL:P92512" /db_xref="GeneID:814625" /translation="MNRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINF HVPDEVWFQSVPTYSYLRRFGCVAYIHCDEGKLKPRAKKGEEKGSYLINRIVSILYTI GIGKTSSPRKASHLGIKG" CDS 209499..209822 /note="orf107d" /codon_start=1 /protein_id="NP_085531.1" /db_xref="GI:13449349" /db_xref="UniProt/TrEMBL:P92513" /translation="MEDFGFTTALRSESVYIVFRFISSSSLSYNTLSSAFIRLDWIKT PRHRAASRALGIQSNGQICTACPHLSALFGFLSVASCFPSNKDDGLHTRIYFDLEFQP DSAVF" gene 217631..219077 /gene="cox3" /db_xref="GeneID:814601" promoter 217631..217639 /gene="cox3" /evidence=not_experimental CDS 218280..219077 /gene="cox3" /codon_start=1 /evidence=not_experimental /exception="RNA editing" /product="cytochrome c oxidase subunit 3" /protein_id="NP_085532.2" /db_xref="GI:26556997" /db_xref="UniProt/Swiss-Prot:P92514" /db_xref="GeneID:814601" /translation="MIESQRHSYHLVDPSPWPISGSLGALATTVGGVMYMHPFQGGAR LLSLGLIFILYTMFVWWRDVLRESTLEGHHTKVVQLGPRYGSILFIVSEVMFFFAFFW ASSHSSLAPAVEIGGIWPPKGIEVLDPWEIPFLNTPILPSSGAAVTWAHHAILAGKEK RAVYALVATVLLALVFTGFQGMEYYQAPFTISDSIYGSTFFLATGFHGFHVIIGTLFL IICGIRQYLGHLTKEHHVGFEAAAWYWHFVDVVWLFLFVSIYWWGGI" misc_feature 218391 /gene="cox3" /note="C to U RNA editing" misc_feature 218524 /gene="cox3" /note="C to U RNA editing" misc_feature 218536 /gene="cox3" /note="C to U RNA editing" misc_feature 218590 /gene="cox3" /note="C to U RNA editing" misc_feature 218593 /gene="cox3" /note="C to U RNA editing" misc_feature 218692 /gene="cox3" /note="C to U RNA editing" misc_feature 218701 /gene="cox3" /note="C to U RNA editing" misc_feature 218882 /gene="cox3" /note="C to U RNA editing" misc_feature 219126 /note="C to U RNA editing" misc_feature 219159 /note="C to U RNA editing" misc_feature 219207 /note="C to U RNA editing" misc_feature 219244 /note="C to U RNA editing" misc_feature 219263 /note="C to U RNA editing" CDS complement(220471..220773) /gene="nad1" /note="orf100a" /codon_start=1 /protein_id="NP_085533.1" /db_xref="GI:13449351" /db_xref="UniProt/TrEMBL:P92515" /db_xref="GeneID:814625" /translation="MLHISQEFLLPISMEHWRLLITSQVIKKIGIPWKVIFHAFQNHN RGILFLQIFEHIIDFLFGEAIIYRYVLLDPKSRQVIRDSILNSFLVLYNQIKLLEK" CDS 220830..221189 /note="orf119" /codon_start=1 /protein_id="NP_085534.1" /db_xref="GI:13449352" /db_xref="UniProt/TrEMBL:P92516" /translation="MHFKRGSEGHSLPLPCMHRSMQDISQHLKQWPRFVLQAGFYWPT TFKDAHGFVSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFMKKTIFSWKPIHP NGGRLCLKMGGSSCEPH" CDS complement(221700..222029) /gene="nad1" /note="orf109b" /codon_start=1 /protein_id="NP_085535.1" /db_xref="GI:13449353" /db_xref="UniProt/TrEMBL:P92517" /db_xref="GeneID:814625" /translation="MLRSGKELEEVVRDDKEEEQVVVRKAKQIVNFPLLGMLSSARYG LRRWPFTVKCSPLTSQSTTAPYGFTSFMRKGKKHLDLNFPGNNLSLELRSSTWSFTLN SLGKIFL" CDS complement(222687..222989) /gene="nad1" /note="orf100b" /codon_start=1 /protein_id="NP_085536.1" /db_xref="GI:13449354" /db_xref="UniProt/TrEMBL:P92518" /db_xref="GeneID:814625" /translation="MGLSTHCQLVFSWKPMNTHMVSRAMEGSLKSYPFPSNINITSCV SIKLNDRNYLLYDEGRVRLSLLNLGERCDFVPFLSVELFKPLQSSGVYEKLHESNN" repeat_unit 227087..227619 /note="repeat II" gene 227109..227196 /gene="tRNA-Ser" /db_xref="GeneID:814602" tRNA 227109..227196 /gene="tRNA-Ser" /product="tRNA-Ser" /db_xref="GeneID:814602" gene 227502..227577 /gene="tRNA-Tyr" /db_xref="GeneID:814603" tRNA 227502..227577 /gene="tRNA-Tyr" /product="tRNA-Tyr" /db_xref="GeneID:814603" gene 227615..227686 /gene="tRNA-Glu" /db_xref="GeneID:814604" tRNA 227615..227686 /gene="tRNA-Glu" /product="tRNA-Glu" /db_xref="GeneID:814604" CDS complement(227709..228431) /gene="nad1" /note="orf240b" /codon_start=1 /protein_id="NP_085537.1" /db_xref="GI:13449355" /db_xref="UniProt/TrEMBL:P92519" /db_xref="GeneID:814625" /translation="MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLG IQIKTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRS IVGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNS KLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALAL TAAELTWSSASRSRDPSAMNTN" CDS complement(228573..229085) /gene="nad1" /note="orf170" /codon_start=1 /protein_id="NP_085538.1" /db_xref="GI:13449356" /db_xref="UniProt/TrEMBL:P92520" /db_xref="GeneID:814625" /translation="MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAM QEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEG IYFVETYSPVVRTATIRTILNVAQQLEVGQSINWMFKMHFSMGIFKKKFICINLLVLR ILFIHPMCVC" gene complement(231894..233042) /gene="ccb382" /db_xref="GeneID:814605" CDS complement(231894..233042) /gene="ccb382" /codon_start=1 /exception="RNA editing" /product="cytochrome c biogenesis orf382" /protein_id="NP_085539.2" /db_xref="GI:26556998" /db_xref="UniProt/TrEMBL:Q9T6H8" /db_xref="GeneID:814605" /translation="MSISIYEFFHYSLFPGLFVAFTYNKKQPPAFGAAPAFWCILLSF LGLSFCHIPNNLSNYNVLTANAPFFYQISGTWSNHEGSILLWCRIPNFYGFFLCYRGR PQSHNVLKQGGHRESLFFFFVSNFVKNSILSLPRYEQESGLKNQLYTPFVLRTLVDSE LRSRRNRTFDGPALFYAPLYPERKIKNPLDAWRSRGSREGKRTHPLLHLARDDKERAS SIDEQRIDGALGIALFFSPFLSASSDPFVRNFFVCTEPLAESNPVPQDPISAIHPPCI YAGDVASAEGFGLCRSKMMNGIVALHSPPMRKDAAEKNGTLLRSAGCVGSRITSELFT LKFKHVGAKCYPALLLRSNRSPLMLLRRRFFAFSSFWAGARSHSTKRY" misc_feature 232088 /note="C to U RNA editing" misc_feature 232237 /note="C to U RNA editing" misc_feature 232252 /note="C to U RNA editing" misc_feature 232264 /note="C to U RNA editing" misc_feature 232324 /note="C to U RNA editing" misc_feature 232333 /note="C to U RNA editing" misc_feature 232334 /note="C to U RNA editing" misc_feature 232462 /note="C to U RNA editing" misc_feature 232464 /note="C to U RNA editing" misc_feature 232559 /note="C to U RNA editing" misc_feature 232639 /note="C to U RNA editing" misc_feature 232665 /note="C to U RNA editing" misc_feature 232672 /note="C to U RNA editing" misc_feature 232703 /note="C to U RNA editing" misc_feature 232754 /note="C to U RNA editing" misc_feature 232774 /note="C to U RNA editing" misc_feature 232781 /note="C to U RNA editing" misc_feature 232843 /note="C to U RNA editing" misc_feature 232886 /note="C to U RNA editing" misc_feature 232900 /note="C to U RNA editing" misc_feature 232939 /note="C to U RNA editing" misc_feature 232999 /note="C to U RNA editing" CDS complement(234538..234903) /gene="nad1" /note="orf121b" /codon_start=1 /protein_id="NP_085540.1" /db_xref="GI:13449358" /db_xref="UniProt/TrEMBL:P92521" /db_xref="GeneID:814625" /translation="MKLLVAPESIKTMTSCPAICPLILMDLSQLLNLPSTAKTTRVIP LLHPHLILLSFSLFQLHQPHPTHLLHPQPYTLILCPYICDQDSTCHHNENKAPCSFYP DQTVSGMVPFYVLGIGRAR" CDS 235338..235661 /note="orf107e" /codon_start=1 /protein_id="NP_085541.1" /db_xref="GI:13449359" /db_xref="UniProt/TrEMBL:P92522" /translation="MKTCLRCRKDYPQNVAGNMLSTLRKEQNQLTSVLTGIHILRRTR LKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGWPTARGLPSLLQAHGTRQVPNSRD RGTVG" CDS 235916..236392 /note="orf158" /codon_start=1 /protein_id="NP_085542.1" /db_xref="GI:13449360" /db_xref="UniProt/TrEMBL:P92523" /translation="MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGV SADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWT EMAALAFKALKGAVTTLPVLALPDLKLPFVTRVGKWNWSCFITREQACCVSQPRVF" CDS complement(236927..237481) /gene="nad1" /note="orf184" /codon_start=1 /protein_id="NP_085543.1" /db_xref="GI:13449361" /db_xref="UniProt/TrEMBL:P92525" /db_xref="GeneID:814625" /translation="MVSSSRIARSFTRYTSALQRHTIVTKIKQKFPCPRSRTQGQSRR SETHTISRRRSCRAIARSNLGRVSSVTLPWFSCPSPAVVALAKKPFRTARSSCPRSRK SSRCRKSLLSWTAHHLLDEFPKRRIWVFLSFRIGGSLHRRHAMEKNICSTRARIRLIS AISLLSKPIVEIVVGYPQSEDRRL" CDS 237053..237616 /note="orf187" /codon_start=1 /protein_id="NP_085544.1" /db_xref="GI:13449362" /db_xref="UniProt/TrEMBL:P92524" /translation="MTTMKRSADPEAEEDPDPSLGKFIEQVVSGPTQKGFPASGAFAT PRARRSRGPKRFLGKRNYRRARARKPGKRDRAHSSKVRSSDGSTRPSARYGMRFRSPT LPLCSRPRTWEFLLDLGYDSMSLKSARVASERPCDPGRANHVWEPGGLSPEQAQWDWV GKVYSRDRVLKLMGELHELVYRESGRR" CDS complement(239636..239956) /gene="nad1" /note="orf106d" /codon_start=1 /protein_id="NP_085545.1" /db_xref="GI:13449363" /db_xref="UniProt/TrEMBL:P92526" /db_xref="GeneID:814625" /translation="MHLERSVQSQLTESKEIARPYSLWGISLAQHSFKTSTRSTGKKR SKGSTSQDGKKQESLESRNDLGPTIVGLIRKILSYSSKKEFSNLTGLESGGSSPPFSL AVSK" gene complement(239988..240758) /gene="ccb256" /db_xref="GeneID:814606" CDS complement(239988..240758) /gene="ccb256" /codon_start=1 /exception="RNA editing" /product="cytochrome c biogenesis orf256" /protein_id="NP_085546.2" /db_xref="GI:26556999" /db_xref="UniProt/TrEMBL:P92527" /db_xref="GeneID:814606" /translation="MSVSLLQPSFLMSKTRSYAQILIGSWLFLTAMAIHLSLGVAPLD LQQGGNSRILYVHVPAARMSIIVYIATAINTFLFLLTKHPLYLRSSGTGIEMGAFFTL FTLVTGGFRGRPMWGTFWVWDARLTSVFISFLIYLGALRFQKLPVEPASISIRAGPID IPIIKSSVNWWNTSHQPGSISRSGTSIHVPMPIPILSNFANFPFSTRILFVLETRLPI PSFLESPITEEIEAREGIPKPSSLALFASMAEWLKRPT" misc_feature 240086 /note="C to U RNA editing" misc_feature 240103 /note="C to U RNA editing" misc_feature 240109 /note="C to U RNA editing" misc_feature 240135 /note="C to U RNA editing" misc_feature 240140 /note="C to U RNA editing" misc_feature 240141 /note="C to U RNA editing" misc_feature 240145 /note="C to U RNA editing" misc_feature 240151 /note="C to U RNA editing" misc_feature 240184 /note="C to U RNA editing" misc_feature 240191 /note="C to U RNA editing" misc_feature 240211 /note="C to U RNA editing" misc_feature 240238 /note="C to U RNA editing" misc_feature 240262 /note="C to U RNA editing" misc_feature 240282 /note="C to U RNA editing" misc_feature 240286 /note="C to U RNA editing" misc_feature 240292 /note="C to U RNA editing" misc_feature 240296 /note="C to U RNA editing" misc_feature 240301 /note="C to U RNA editing" misc_feature 240313 /note="C to U RNA editing" misc_feature 240338 /note="C to U RNA editing" misc_feature 240359 /note="C to U RNA editing" misc_feature 240364 /note="C to U RNA editing" misc_feature 240428 /note="C to U RNA editing" misc_feature 240497 /note="C to U RNA editing" misc_feature 240575 /note="C to U RNA editing" misc_feature 240580 /note="C to U RNA editing" misc_feature 240626 /note="C to U RNA editing" misc_feature 240656 /note="C to U RNA editing" CDS complement(241306..241953) /gene="nad1" /note="orf215a" /codon_start=1 /protein_id="NP_085547.1" /db_xref="GI:13449365" /db_xref="UniProt/TrEMBL:P92528" /db_xref="GeneID:814625" /translation="MPTANQLIRHGREEKRRTDRTEVLVFGLLVTRIIRFVHSVLFPI PVFCSIKVLLDYFCSLPIIDKLSKKWQLIWFYVLSVILCKSLFAVGYLWMDDLSRAIS QFYPVVSGGLGGGNTPMPPTNPSEGGLLEGYYAHENEHSHDQQRGSPFWSKEYKESGS KRLFLNLEVEDQNTDTIGEQVKAESGKCEKIKAKIIAKTHELLVSEDTKFQIKTI" CDS complement(249272..249919) /gene="nad1" /note="orf215b" /codon_start=1 /protein_id="NP_085548.1" /db_xref="GI:13449366" /db_xref="UniProt/TrEMBL:P92529" /db_xref="GeneID:814625" /translation="MKMKSPLFRGPLVNSSTENPIHILTREKLRHQTSGTNSTEKVIL PNILHSYIKNLNLDFTSPYGSTGNNEVLSILRSRFKQSIFPSSGLKCLDTTGDFLIKN VLHKRYESVQKNISNALSSSINSRTAVFFCILFSITVLMEIAPGPLLKKPSLLFSDNL PNVLQYTRDVYVNHVCIIHKSLSPCECEEPLNRIIRDMFPQTTFDPLELQKPSPQ" gene complement(250080..250153) /gene="tRNA-Trp" /db_xref="GeneID:814607" tRNA complement(250080..250153) /gene="tRNA-Trp" /product="tRNA-Trp" /db_xref="GeneID:814607" CDS complement(251403..251897) /gene="nad1" /note="orf164" /codon_start=1 /protein_id="NP_085549.1" /db_xref="GI:13449367" /db_xref="UniProt/TrEMBL:P92530" /db_xref="GeneID:814625" /translation="MRKSADEMFIGVRRAPISSNVGGTSFYGGDEYCSYYQSNGGVAK EDDGSAKKGFRRTGKGKLTAEAVSEAINRAAQGLPFEVVYYPTAGWSDFVVKAEDVEA SMAIFWTPGTRVKMAMETEDSSRITWFQGIVFYTYQETGPWRGSPWNSFRYKIPFTIP LIFL" gene complement(254505..254571) /gene="tRNA-Gln" /db_xref="GeneID:814608" tRNA complement(254505..254571) /gene="tRNA-Gln" /product="tRNA-Gln" /db_xref="GeneID:814608" gene complement(256865..257476) /gene="ccb203" /db_xref="GeneID:814609" CDS complement(256865..>257476) /gene="ccb203" /codon_start=1 /exception="RNA editing" /product="cytochrome c biogenesis orf203" /protein_id="NP_085550.2" /db_xref="GI:26557000" /db_xref="UniProt/TrEMBL:Q9T6H7" /db_xref="GeneID:814609" /translation="VDTGREQAKRVVRNGKKETTTSPLCWTAGANTVVSDQDQEPIRI WILTCWWFLTVGILPGSWWAYHELGRGGWWFRDPVENASFMPRVLATARIHSVILPLL HSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWWFFLLMTGISMILF YQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN" misc_feature 257010 /note="C to U RNA editing" misc_feature 257086 /note="C to U RNA editing" misc_feature 257121 /note="C to U RNA editing" misc_feature 257133 /note="C to U RNA editing" misc_feature 257157 /note="C to U RNA editing" misc_feature 257176 /note="C to U RNA editing" misc_feature 257200 /note="C to U RNA editing" misc_feature 257218 /note="C to U RNA editing" misc_feature 257251 /note="C to U RNA editing" misc_feature 257269 /note="C to U RNA editing" misc_feature 257301 /note="C to U RNA editing" misc_feature 257412 /note="C to U RNA editing" CDS complement(258045..258398) /gene="nad1" /note="orf117" /codon_start=1 /protein_id="NP_085551.1" /db_xref="GI:13449369" /db_xref="UniProt/TrEMBL:P92531" /db_xref="GeneID:814625" /translation="MVASDSRPMRLRLRAELFLASFAVREESIRSKKEWTYISKYIKG ILKSRLSRREQSRWNIIDDTTSMAFFEEFASLNPVFHTFLFYGRRDGEDLSFHIVGFF RLSIRGYIFFLWESF" gene complement(260224..260601) /gene="rpsl2" /db_xref="GeneID:814610" CDS complement(260224..260601) /gene="rpsl2" /codon_start=1 /exception="RNA editing" /product="ribosomal protein L2" /protein_id="NP_085552.2" /db_xref="GI:26557001" /db_xref="UniProt/Swiss-Prot:P92532" /db_xref="GeneID:814610" /translation="MPTFNQLIRHGREEKRRTDRTRALDKCPQKTGVCPRVSTRTPKK PNSAPRKIAKVRLSNRHDIFAHIPGEGHNSQEHSTVLIRGGRVKDSPGVKSHCIRGVK DLMGIPGRRRGRSKYGAEKPKSI" misc_feature 260317 /note="C to U RNA editing" misc_feature 260318 /note="C to U RNA editing" misc_feature 260333 /note="C to U RNA editing" misc_feature 260381 /note="C to U RNA editing" misc_feature 260406 /note="C to U RNA editing" misc_feature 260456 /note="C to U RNA editing" misc_feature 260498 /note="C to U RNA editing" misc_feature 260518 /note="C to U RNA editing" gene complement(260647..261006) /gene="nad3" /db_xref="GeneID:814611" CDS complement(260647..261006) /gene="nad3" /EC_number="1.6.99.3" /codon_start=1 /exception="RNA editing" /product="NADH dehydrogenase subunit 3" /protein_id="NP_085553.2" /db_xref="GI:26557002" /db_xref="UniProt/Swiss-Prot:P92533" /db_xref="GeneID:814611" /translation="MMSEFAPISIYLVISLLVSLILLGVPFPFASNSSTYPEKLSAYE CGFDPSGDARSRFDIRFYLVSILFLIPDLEVTFFFPWAVPPNKIDLFGFWSMMAFLFI LTIGFLYEWKRGASDRE" misc_feature 260655 /note="C to U RNA editing" misc_feature 260660 /note="C to U RNA editing" misc_feature 260753 /note="C to U RNA editing" misc_feature 260757 /note="C to U RNA editing" misc_feature 260795 /note="C to U RNA editing" misc_feature 260796 /note="C to U RNA editing" misc_feature 260858 /note="C to U RNA editing" misc_feature 260924 /note="C to U RNA editing" misc_feature 260938 /note="C to U RNA editing" misc_feature 260943 /note="C to U RNA editing" gene complement(260963..261307) /gene="orf114" /db_xref="GeneID:814623" CDS complement(260963..261307) /gene="orf114" /codon_start=1 /exception="RNA editing" /protein_id="NP_085554.1" /db_xref="GI:13449372" /db_xref="UniProt/TrEMBL:P92534" /db_xref="GeneID:814623" /translation="MTQPAIGWRVGLGPSIIRGPLVGKSPWSVFMIYGRTSKKPGPSR TSFLVYKRKYSSRKAALGGTLSHKVCKPFGMGFCFFLYFSICRFFASKERENKVGCND VRICTNFYLFSD" misc_feature 260981 /note="C to U RNA editing" misc_feature 260999 /note="C to U RNA editing" gene complement(261105..261461) /gene="orf118" /db_xref="GeneID:814624" CDS complement(261105..261461) /gene="orf118" /codon_start=1 /protein_id="NP_085555.1" /db_xref="GI:13449373" /db_xref="UniProt/TrEMBL:P92535" /db_xref="GeneID:814624" /translation="MIGGDSVEAIERRLLAKYPEGSPSAEIIEMARIEAEDLFEIKAQ IIQRMALYDPTGDWMARGARALDNPRTTSGEESLERLYDIWKDLQETGPLSDEFSRLQ EKVFLKKGGPGGDPIA" CDS complement(261617..262078) /gene="nad1" /note="orf153b" /codon_start=1 /protein_id="NP_085556.1" /db_xref="GI:13449374" /db_xref="UniProt/TrEMBL:P92536" /db_xref="GeneID:814625" /translation="MKNMVRLLLPLLGALAGSFCARFLGSEGSAIMTTTRVSFSSILV VSFLFCFHFHSFRLKGPQKKIIYLFLVFSMGFVGSLIRIEVIHLVGGLALPVLGPLVL NAIGGQALPSTGPSGSGSSSMWEEDSFELGVLEESDSPPAGGPERKRGNPR" CDS complement(264113..264433) /gene="nad1" /note="orf106e" /codon_start=1 /protein_id="NP_085557.1" /db_xref="GI:13449375" /db_xref="UniProt/TrEMBL:P92537" /db_xref="GeneID:814625" /translation="MQQVFRREGINLYYYSNKTKKFSLDSWYLPQLHLLESKGNKKSK AATDQYFIHPSRTRQERDLTDRKHRPEQQQLQRRVTRWKKEVTTRSRPKETSSTHLPY HGSY" CDS complement(270238..270561) /gene="nad1" /note="orf107f" /codon_start=1 /protein_id="NP_085558.1" /db_xref="GI:13449376" /db_xref="UniProt/TrEMBL:P92538" /db_xref="GeneID:814625" /translation="MTERNASGRMNTKGRSIKETKKAMNEEVGPFTLFLVTLGADLIN ALQREGRLGLSHLGMAEHKRFENQVKIRKGREKTERKAVAPVRAREIKNKDSCFPHTH IGCEK" CDS complement(272944..273423) /gene="nad1" /note="orf159" /codon_start=1 /protein_id="NP_085559.1" /db_xref="GI:13449377" /db_xref="UniProt/TrEMBL:P92539" /db_xref="GeneID:814625" /translation="MAFNLFTTFTERLRLVSWRSVFEDTRSLRRFLIRIALVATGLVS KESAIVCHVLAGKVLRMYKTSRDPCLRIIVSLPLLSIYFRPDSIAKSNRKQRPWKEKI FHPDSRKMSCHCEPLPYQTRFDRQAPLQEYSIYPKSRKRLELDPQQNSTVVVQHDKF" gene 274899..275222 /gene="orf107g" /db_xref="GeneID:814599" CDS 274899..275222 /gene="orf107g" /codon_start=1 /product="ORF107g" /protein_id="NP_085560.1" /db_xref="GI:13449378" /db_xref="UniProt/TrEMBL:P92540" /db_xref="GeneID:814599" /translation="MEFFQLSRKIGCNITQSPIYLFLARKTNRCLCPAHTDFILAVVG PTTLLLSFEISYEKKVDKKNNFKTGIVVQSVRAPPCQGGSCGFEPRQSRPSHNCVLRP GLATK" gene 275103..275176 /gene="tRNA-Asp" /db_xref="GeneID:814612" tRNA 275103..275176 /gene="tRNA-Asp" /product="tRNA-Asp" /db_xref="GeneID:814612" gene 278649..279152 /gene="atp9" /db_xref="GeneID:814613" promoter 278649..278656 /gene="atp9" /evidence=not_experimental CDS 278895..279152 /gene="atp9" /codon_start=1 /exception="RNA editing" /product="ATPase subunit 9" /protein_id="NP_085561.2" /db_xref="GI:26557003" /db_xref="GeneID:814613" /translation="MTKREYNSQPEMLEGAKSIGAGAATIASAGAAIGIGNVFSSLIH SVARNPSLAKQSFGYAILGFALTEAIALFAPMMAFLILFVF" misc_feature 278947 /gene="atp9" /note="C to U RNA editing" misc_feature 278977 /gene="atp9" /note="C to U RNA editing" misc_feature 279061 /gene="atp9" /note="C to U RNA editing" misc_feature 279118 /gene="atp9" /note="C to U RNA editing" CDS 279463..280251 /note="orf262" /codon_start=1 /protein_id="NP_085562.1" /db_xref="GI:13449380" /db_xref="UniProt/TrEMBL:P92541" /translation="MYLLIVFLSMLSSSVAGFFGRFLGSESVSRFNLIIFLILLVFSI CLFRSLKQYLGKRMTQWCYLALVCQISLFLVLLRSHILAGFGTFSADVFTVFMGTFSV TGSSGGIVNHQDGASSEWFTYTSDMVEDSASSGRTSSSVNQPIPEEQAWEREARAQEH DRISAEVETITSACENLEAAMVRKAQILLHQRGVTLGDPEDVKRALQLALHDDWEHAI DDRKRHFTVLRRNFGTARCERWNPFIDELRGLGNHQVNARHYVD" CDS 281614..281931 /note="orf105a" /codon_start=1 /protein_id="NP_085563.1" /db_xref="GI:13449381" /db_xref="UniProt/TrEMBL:P92542" /translation="MQSPAMKRIKSSSHSRWDGSGSVNEMPFPSTIRLQGSFWECSTR RHMCYILRYLFRANGHRHFSYERLDCRNQTLRLPDHLYQPSRPHLLPHLSQLLLVRDS GYL" CDS complement(283034..283789) /gene="nad1" /note="orf251" /codon_start=1 /protein_id="NP_085564.1" /db_xref="GI:13449382" /db_xref="UniProt/TrEMBL:P92543" /db_xref="GeneID:814625" /translation="MSVLRRIPQDGTFHQEGPIHRLAKRRPRFIASFDLSAATDRWPV PVIYELMACLFGQTMASCIVNGALALNSCSLKSVTGRHDEVVFVAGQPLGYYGSWALF ALSHHAIVWLAALRAYPHQTRPFLDYALLGDDIVIADRSVAKEYRSLLDALQVDISDA KSIVSETGCLEFAKRFWVKIMSKDLSPVSAKAVLESYFLVGTQQLAYKYKLSPKTCLR LNKAGYRVLGQMDTTLRPYPGVLVGFRRYLVSF" exon complement(287917..288108) /gene="nad1" /label=nad1_ex3 misc_feature 287942 /note="C to U RNA editing" misc_feature 287997 /note="C to U RNA editing" misc_feature 288006 /note="C to U RNA editing" misc_feature 288031 /note="C to U RNA editing" misc_feature 288041 /note="C to U RNA editing" misc_feature 288077 /note="C to U RNA editing" misc_feature 288084 /note="C to U RNA editing" misc_feature 288085 /note="C to U RNA editing" misc_feature 288087 /note="C to U RNA editing" intron complement(288109..289004) /gene="nad1" /number=2 exon complement(289003..289083) /gene="nad1" /label=nad1_ex2 intron complement(289084..318003) /gene="nad1" /number=1 CDS complement(289197..289517) /gene="nad1" /note="orf106f" /codon_start=1 /protein_id="NP_085566.1" /db_xref="GI:13449384" /db_xref="UniProt/TrEMBL:P92544" /db_xref="GeneID:814625" /translation="MMVTALQILFSLIRYVTETIRSVSVLFSDSEDEPDDEASSSDKD VSDATLPARTTYSIVIFLAGSVRRDKREMLSISQKGTLSPAMLPRPAYRGIEREDRGG VQSK" repeat_unit 290991..297580 /note="repeat III" CDS complement(292973..293431) /gene="nad1" /note="orf152b identical to orf152a" /codon_start=1 /protein_id="NP_085567.1" /db_xref="GI:13449385" /db_xref="UniProt/TrEMBL:P92545" /db_xref="GeneID:814625" /translation="MPTILSSKPAFNSLFSYHLIGLISNKLVTLAPDYTGTKKTTWGA RLHLQELRVQTKHRQIEPDIKNLPLQPIRYGSFRPVFHWIEKPCMLIGLCGLHSEVSF IANWPWGKPSEIGGCSIPCMLTGRGSELSYHIRASRRPLPGNRFHFQSFF" CDS complement(293976..294296) /gene="nad1" /note="orf106g" /codon_start=1 /protein_id="NP_085568.1" /db_xref="GI:13449386" /db_xref="UniProt/TrEMBL:P92546" /db_xref="GeneID:814625" /translation="MLHRGRSCLTGLFPCYLLSNWLNSNLCWIPLKLVIPCFQLIVES YLLEFLLLLAISTCLLGEDSLIWLTLVVAHSKSRQSSSQEPLDTRMATRALLDQLRSD RRHN" gene complement(296688..296784) /gene="tRNA-Ser" /db_xref="GeneID:814614" tRNA complement(296688..296784) /gene="tRNA-Ser" /product="tRNA-Ser" /db_xref="GeneID:814614" gene complement(296820..298204) /gene="atp6-2" /db_xref="GeneID:814615" CDS complement(296820..297869) /gene="atp6-2" /codon_start=1 /product="ATPase subunit 6" /protein_id="NP_085569.1" /db_xref="GI:13449387" /db_xref="UniProt/TrEMBL:P92547" /db_xref="GeneID:814615" /translation="MERLTRLNHFLVNMRWDFYEGVIQAGYIRNLQRELDHTPAELLG SKLDLIFFRESLNLSTYVNNWYMQNLGVPGPVNFIEKYHDACFSNYMKLMEIPSPLDQ FEIVPLIPMHIGNFYFSFTNPSLFMLLTLSFFLLLIHFVTKKGGGNLVPNAWQSLVEL LYDFVLNLVKEQIGGLSGNVKQMFFPCILVTFLFLLFCNLQGMIPYSFTVTSHFLITL ALSFSIFIGITIVGFQRHGLHFFSFLLPAGVPLPLAPFLVLLELISYCFRALSLGIRL FANMMAGHSLVKILSGFAWTMLCMNDIFYFIGALGPLFIVLALTGLELGVAILQAYVF TILICIYLNDAINLH" promoter complement(298196..298204) /gene="atp6-2" /evidence=not_experimental CDS complement(300966..301301) /gene="nad1" /note="orf111b" /codon_start=1 /protein_id="NP_085570.1" /db_xref="GI:13449388" /db_xref="UniProt/TrEMBL:P92548" /db_xref="GeneID:814625" /translation="MMTLFTQEPSFSIKCLPTTPSKPHRSLLAARLLFLSISDFSCRC PKIKLSLKGYRFIYCVRVVPSPSSLAKAQSDRLNIGLIRRSLKLCCARDIRLELGQVL ISIPLLLFL" gene complement(302166..303689) /gene="atp1" /db_xref="GeneID:814616" CDS complement(302166..303689) /gene="atp1" /codon_start=1 /exception="RNA editing" /product="ATPase subunit 1" /protein_id="NP_085571.2" /db_xref="GI:26557005" /db_xref="UniProt/Swiss-Prot:P92549" /db_xref="GeneID:814616" /translation="MELSPRAAELTNLFESRIRNFYANFQVDEIGRVVSVGDGIAQVY GLNEIQAGEMVLFANGVKGMALNLENENVGIVVFGGDTAIKEGDLVKRTGSIVDVPAG KAMLGRVVDAMGVPIDGKGALSDHEQRRVEVKAPGILERKSVHEPMQTGLKAVDSLVP IGRGQRELLIGGRQTGKTTIAIDTILNQKQINSRATSESETMYCVYVAIGQKRSTVGQ LIQTLEEANALEYSILVAATASDPAPLQFLAPYSGCAMGEYFRDNGMHALIIYDDLSK QAVAYRQMSLLLRRPPGREAFPGDVFYLHSRLLERAAKRSDQTGAGSLTALPVIETQA GDVSAYIPTNVISITDGQICLETELFYRGIRPAINVGLSVSRVGSAAQLKAMKQVCGS SKLELAQYREVAAFAQFGSDLDAATQALLNRGARLTEVPKQPQYAPLPIEKQILVIYA AVNGFCDRMPLDRISQYEKAIPNSVKPELLQALKGGLTNERKMEPDAFLKERALALI" misc_feature 302206 /note="C to U RNA editing" misc_feature 302275 /note="C to U RNA editing" misc_feature 302398 /note="C to U RNA editing" misc_feature 302512 /note="C to U RNA editing" misc_feature 302580 /note="C to U RNA editing" CDS complement(303836..304720) /gene="nad1" /note="orf294" /codon_start=1 /protein_id="NP_085572.1" /db_xref="GI:13449390" /db_xref="UniProt/TrEMBL:P92550" /db_xref="GeneID:814625" /translation="MITRLFAQLVSLSIVTYWNDAIVATNFSWLFITFFVMTFTFRTF SRYFKKPIIWTLYFFLCLIAFLLLWAARIHINILFSFAFGDVYSFFMAGVFLFYGFGE LLPIGSDSDVGEASWVVNPATGASGSGGNGWTESAANDPAREVSLAPFPPQLTHPVPF PAEPGSPDPVSPPPPIASFYSRIERAESLHAGNIELAEDLQRIQEMERNLENERSPYR GRELAARIDWEVRELEGKVARNRAWDMVRDAQLDIWRQGLDQELVRQQENESRLEERR FQSHSTNSLFEADSSRDN" CDS complement(304779..305084) /gene="nad1" /note="orf101b" /codon_start=1 /protein_id="NP_085573.1" /db_xref="GI:13449391" /db_xref="UniProt/TrEMBL:P92551" /db_xref="GeneID:814625" /translation="MIHQINSINMEIILTDVARDALQEKIVSQLSILLRVYRDTNTSE SVTLPLSGVNLQEVAARSFFNNESIPWFFHLFRPCQSGHEKKPLCPQALELVLSFSS" CDS complement(305221..305562) /gene="nad1" /note="orf113" /codon_start=1 /protein_id="NP_085574.1" /db_xref="GI:13449392" /db_xref="UniProt/TrEMBL:P92552" /db_xref="GeneID:814625" /translation="MKRYATMLSEFTGVVPSTFLSKLFLKCEINLSIGRVQDRKEPYR QAKSAVRLHGLNNTQYCTGRIRFAARSIPQSPLVPFRLFPQFPTPSVRQNLTTLHFDT REEDRALVSSG" promoter complement(305586..305594) /gene="nad1" /note="orf113" /evidence=not_experimental CDS 308932..309369 /note="orf145b" /codon_start=1 /protein_id="NP_085575.1" /db_xref="GI:13449393" /db_xref="UniProt/TrEMBL:P92553" /translation="MWSYEGKCGFLLLSVYKEQVLDSYSPLTKENGISSNPRYIKRKF PFDSGFPFTRKLPAKVESFLCLPLFLSFLVANLILWLSFHSARVGHQKLSYHLLEWKA FPSSFRNKESKATCDLSSWSNPYFKRKAQIPFSFSRYLLKYLF" CDS complement(309338..309640) /gene="nad1" /note="orf100c" /codon_start=1 /protein_id="NP_085576.1" /db_xref="GI:13449394" /db_xref="UniProt/TrEMBL:P92554" /db_xref="GeneID:814625" /translation="MSPTLSTGNRDQRGSSRSYAFVSLASHHFTPQGKITITSSLTRK IDPGNFQGQDVVDFSPWVVLPDEDKVAFLISCRVTAKIRKSGLPQLFKTNTSRDIG" CDS 310514..310882 /note="orf102" /codon_start=1 /protein_id="NP_085577.1" /db_xref="GI:13449395" /db_xref="UniProt/TrEMBL:P92555" /translation="MGYGVCHFYLLFIINGAPQGLVTPSRGLRQGDPLSPYLFILCTE VLSGLCRRAQEQGRLPGIRVSNNSPRINHLLFADDTSSARWIPLAAQIWPIFFLSMRL FQGNPVNHPMSNLYFLGSLP" CDS complement(312545..313162) /gene="nad1" /note="orf205" /codon_start=1 /protein_id="NP_085578.1" /db_xref="GI:13449396" /db_xref="UniProt/TrEMBL:P92556" /db_xref="GeneID:814625" /translation="MQPDLTLLGKLRSTWASATVNVIHPISLCLSWFLGTIGCSSPLP LRCADLRILLLKKKEFCLLPLFYHLGIFQHLFYPIIPLLAFCFYAPRLVCPAASLEFQ RRYVVWILAVSRHIVFLENSYYIMLLHPHHLHHPHPPFLIFLFLILRKLRRNRSVKAQ RIMQRSCHRLLFAPVGNDSELSAPSAPSESVVPLRRFNRQSVSTV" misc_feature 314410 /note="C to U RNA editing" misc_feature 314421 /note="C to U RNA editing" gene 314627..315073 /gene="rps7" /db_xref="GeneID:814617" CDS 314627..315073 /gene="rps7" /codon_start=1 /product="ribosomal protein S7" /protein_id="NP_085579.1" /db_xref="GI:13449397" /db_xref="UniProt/Swiss-Prot:P92557" /db_xref="GeneID:814617" /translation="MGGLDGEQKLLIKKLVNFRMKEGKRTRVRAIVYQTFHRPARTER DVIKLMVDAVENIKPICEVAKVGVAGTIYDVPGIVARDRQQTLAIRWILEAAFKRRIS YRISLEKCSFAEILDAYQKRGSARRKRENLHGLASTNRSFAHFRWW" exon complement(318004..>318390) /gene="nad1" /label=nad1_ex1 misc_feature 318015 /note="C to U RNA editing" misc_feature 318083 /note="C to U RNA editing" misc_feature 318084 /note="C to U RNA editing" misc_feature 318126 /note="C to U RNA editing" misc_feature 318224 /note="C to U RNA editing" misc_feature 318389 /note="C to U RNA editing" CDS complement(318588..319463) /gene="nad5" /note="orf291" /codon_start=1 /exception="RNA editing" /protein_id="NP_085580.1" /db_xref="GI:13449398" /db_xref="UniProt/TrEMBL:P92559" /db_xref="GeneID:814567" /translation="MIVLKWLFLTISPCDAAEPWQLGSQDAATPIMQGIIDLHHDIFF FLILILVFVLWILVRALWHFHYKKNAIPQRIVHGTTIEILRTIFPCFISIFIVEPSFA LALDDAAEALFPNTAPTPSNTSSSEDSFGLRVLSEPWPITRNLGLESSICNRIRLLEA ANSPFLLGKEKGQYWGEIQECLYNVSEQREYYRLLDFENRDLQIRERKHSCLEVFRGV LLRNPYLEERAAYSPQEAFFDFLNERRDALDISNPGSSPAEMDRLEILFLGEIERDLL RRGDESLYIKQLLGD" CDS 324259..324594 /note="orf111c" /codon_start=1 /protein_id="NP_085581.1" /db_xref="GI:13449399" /db_xref="UniProt/TrEMBL:P92560" /translation="MTFVPTDFLIRTPDDPAYFKDRLASPFSFRGCSKTTSTSSSIYS KKKASTATYFRVDPVPRGSQSSRVCEPKTKLIVYQPGNYQKKVKRQVTDPLSLMDKVK KRIDKTEIL" CDS 325300..325710 /note="orf136a" /codon_start=1 /protein_id="NP_085582.1" /db_xref="GI:13449400" /db_xref="UniProt/TrEMBL:P92561" /translation="MPRTELILNAAVILYTMIPPDAHSLGSEGRVVNGNWRDTSDVKE GSLPREVTKQVNGSLSSRTKQVNEFSKHTRFLVDISFSCCSLINRSLWESAQKDELSD SFGKALTTKPECLAVRETPRNFRRNLCLVIPSLN" CDS 326099..326509 /note="orf136b" /codon_start=1 /protein_id="NP_085583.1" /db_xref="GI:13449401" /db_xref="UniProt/TrEMBL:P92562" /translation="MLKRKLKPKRLQLPPQDVVFEGEAAMNEYTFYRNWVESWLQHIR SYYLLFIDGDPSLSKFFEIEICAHSWKRSTFDQQVFKFGLLWECVDIARSRTVYWQCA LGTGHIQEDKVSEATSPFTDDSCTNSCLSRMTGQ" gene complement(327890..333105) /gene="nad2" /db_xref="GeneID:814618" exon complement(327890..328078) /gene="nad2" /usedin=Y08501:nad2_cds /usedin=Y08501:nad2_mrna /label=nad2_ex5 misc_feature 327900 /note="C to U RNA editing" misc_feature 327954 /note="C to U RNA editing" misc_feature 327957 /note="C to U RNA editing" intron complement(328079..329734) /gene="nad2" /number=4 CDS complement(329082..329465) /gene="nad2" /note="orf107h" /codon_start=1 /protein_id="NP_085585.1" /db_xref="GI:13449402" /db_xref="UniProt/TrEMBL:P92563" /db_xref="GeneID:814618" /translation="MLPAGCWNDTSRDGPGFRKMKGPKVEIGGYKFPISLGAENESTS RCDTAFSFLVGKERRSPSEPNRPMKNKRRAKPNGEAHAEQARRKISVEEKQPSSFPSH PGPKAVQSFLAKSRIWGFLLRYLTI" exon complement(329735..330306) /gene="nad2" /usedin=Y08501:nad2_cds /usedin=Y08501:nad2_mrna /label=nad2_ex4 misc_feature 329737 /note="C to U RNA editing" misc_feature 329766 /note="C to U RNA editing" misc_feature 329767 /note="C to U RNA editing" misc_feature 329813 /note="C to U RNA editing" misc_feature 329886 /note="C to U RNA editing" misc_feature 329955 /note="C to U RNA editing" misc_feature 330051 /note="C to U RNA editing" misc_feature 330055 /note="C to U RNA editing" misc_feature 330085 /note="C to U RNA editing" misc_feature 330093 /note="C to U RNA editing" misc_feature 330204 /note="C to U RNA editing" misc_feature 330225 /note="C to U RNA editing" intron complement(330307..332944) /gene="nad2" /number=3 exon complement(332945..333105) /gene="nad2" /usedin=Y08501:nad2_cds /usedin=Y08501:nad2_mrna /label=nad2_ex3 misc_feature 332989 /note="C to U RNA editing" misc_feature 333042 /note="C to U RNA editing" misc_feature 333099 /note="C to U RNA editing" gene 337669..337741 /gene="tRNA-met" /db_xref="GeneID:814619" tRNA 337669..337741 /gene="tRNA-met" /product="tRNA-Met" /db_xref="GeneID:814619" CDS 346757..347194 /note="orf145c" /codon_start=1 /protein_id="NP_085586.1" /db_xref="GI:13449403" /db_xref="UniProt/TrEMBL:P92564" /translation="MTKREYNSQPEMKEEVLAYLLQLSASLVLPVAIWLIAAGQIFTC LRGYTISNYQEKVEEKLCSTLVDKISEKLADLFPVYGITPSRNAPFPTILEQLLATVS QEERLAYLSNMYNSLIEMGIDSPCFYPIVQTFLFLMGGGGGPA" gene complement(349830..351413) /gene="cox1" /db_xref="GeneID:814620" CDS complement(349830..351413) /gene="cox1" /codon_start=1 /product="cytochrome c oxidase subunit 1" /protein_id="NP_085587.1" /db_xref="GI:13449404" /db_xref="UniProt/Swiss-Prot:Q07063" /db_xref="GeneID:814620" /translation="MKNLVRWLFSTNHKDIGTLYFIFGAIAGVMGTCFSVLIRMELAR PGDQILGGNHQLYNVLITAHAFLMIFFMVMPAMIGGFGNWFVPILIGAPDMAFPRLNN ISFWLLPPSLLLLLSSALVEVGSGTGWTVYPPLSGITSHSGGAVDLAIFSLHLSGVSS ILGSINFITTIFNMRGPGMTMHRLPLFVWSVLVTAFLLLLSLPVLAGAITMLLTDRNF NTTFFDPAGGGDPILYQHLFWFFGHPEVYILILPGFGIISHIVSTFSGKPVFGYLGMV YAMISIGVLGFLVWAHHMFTVGLDVDTRAYFTAATMIIAVPTGIKIFSWIATMWGGSI QYKTPMLFAVGFIFLFTIGGLTGIVLANSGLDIALHDTYYVVAHFHYVLSMGAVFALF AGFYYWVGKIFGRTYPETLGQIHFWITFFGVNLTFFPMHFLGLSGMPRRIPDYPDAYA GWNALSSFGSYISVVGICCFFVVVTITLSSGNNKRCAPSPWALELNSTTLEWMVQSPP AFHTFGELPAIKETKSYVK" promoter 355212..355220 /evidence=not_experimental CDS complement(360717..361052) /note="orf111d" /codon_start=1 /protein_id="NP_085588.1" /db_xref="GI:13449405" /db_xref="UniProt/TrEMBL:P92565" /translation="MKISYFIRRGKKTSRRSHFIKMKKNIITTQLFKPDNAFIFFSGI HGSVNRATYKYKISKTFGRFLAHISCLICILSKRIFVLSFSVIGSFCHPSIVHFDCLL FFLDTTPCL" gene complement(361062..361179) /gene="rrn5" /db_xref="GeneID:814621" rRNA complement(361062..361179) /gene="rrn5" /product="5S ribosomal RNA" /db_xref="GeneID:814621" gene complement(361350..363284) /gene="rrn18" /db_xref="GeneID:814622" rRNA complement(361350..363284) /gene="rrn18" /product="18S ribosomal RNA" /db_xref="GeneID:814622" CDS 363725..364042 /note="orf105b" /codon_start=1 /protein_id="NP_085589.1" /db_xref="GI:13449406" /db_xref="UniProt/TrEMBL:P92566" /translation="MKYHFSSMEPWWKREFSFCIPAIYIKMASISLFQNSWLKMKHLP SCLFTQTTNTLGIYRKKKPNHSRDNPRINSNLSTNYAQAKSVERSRSNSLNSGPNPLE NAT" CDS complement(366086..366700) /note="orf204" /codon_start=1 /protein_id="NP_085590.1" /db_xref="GI:13449407" /db_xref="UniProt/TrEMBL:P92567" /translation="MFGGRRRRLPQDGTFNQTQPFDRLVGSRHSFSFDLKSATDRWPL VFLFEVVQYLFDRSFASSVVNSAFACNIFEVPFVKLKRRFSQVCFVAGQPLGYHGSWP TFALSHHILVWWCAKQVHPGVRFTSYAVLGDDVVIADQEVAKVYESALGGLGVKISYQ KSLIPIQVLLSLLNASGLGNLVLKREIFPRESGVLSLMSVPSLA" ORIGIN 1 ggatccgttc gaaacaggtt agcctactat aatataagga ttggattcta ataagttcga 61 aacaggttag ccttagccta ctataggatt agatctttct tatcaaccta ctaacttctt 121 ccttgttggg atgagaaacc cttttgcaac caagcgtgct ttgagtttgt caagggaccc 181 atctgcattc agtttcactc tgaaaaccca tttacaaccg agaagattca tgtcaggtga 241 tgcgggaact aagtcccaag tgtgattctg tgttaatgcc gacatctctt cttgcatagc 301 ttgtctccat cctgggaggc agacgtaatg gtttttggtt cagagggagt gtatttttgt 361 gtaaacaggt tgtaacgagg attaggcttg cgaataccat cctttgcccg agtgatcata 421 tgatgtctat taggtgaaag tagctcagga gcagctgtcc caacatcaaa aaaggtaccg 481 ctgtcgccaa taggaacagg atctgagcct gccgtacgca caggacagtc tctttctgat 541 gtggtagcag ttccaggagc aatcgcagag acaattgaag gatctgcaga ttcgcaatca 601 gagcctgtga aaccgggaag atgtcgagat agcaggagaa gaattagcag tatgtgagtg 661 cggaagcaga gtggaggatc gagggacctg tgaaggtgtg ataaaggaac tgtctgttga 721 aacagaaggg acatatgaag gtttagaacc aagctgccaa gcacgaagca gagaacttgg 781 atacggtggg atgagatggt gataacagtc ctgaaaacga aagactagag ctcttgccca 841 actgacaggc ttcacaaaca gaaagatccc ttttattaat aactatagct ttactagtct 901 tgagttgttg gagaacttga ggatttggat gcccaagcct ataatgccaa gtgagctcac 961 tagctgcttg ttgtcttgta gagtagaaga cttatagatt aaaattctcc aacatataga 1021 tgtccttaca ccgttttcct ttgctcagca ggctccgtgt ttgcttgtcc tttatgcata 1081 cttcgttagc atcaaactca aagaagcatg gataaatcat cacaaagttt ggacacagag 1141 agaagagact tagttatgaa aggacaaaca agaacttcat ttaatggtaa actacctgat 1201 gagcttggta gattggtaga tccaacatga gtgatgggca agaaggctcc atccccaacc 1261 atcacactgt cggatcccac ataaggttgt gactgttgga ggtgatgagc agaattggta 1321 atgtgagagg aggcaccaga atctggaaac cattctgtac cagaggtatc cgtgatctga 1381 agagcagcaa gagcttgcgg aatatcactg ttctggaagg agttgtcata ttggttccag 1441 cattttaggg ctgagtgacc agtttagcca cagatttggc aaacgggtcg agaagagaac 1501 tgatcagagg agtgggattg atgcaaacta gttgagggct tgatagcgtt gatgaaaccc 1561 tcttcctctt gtagagaatg agccgcgacc acggtagtta ccaaaccgag aataggactg 1621 tcatacgagt ataccagagg ctgaaggtat tgagtgatga tccttcctca accctaagct 1681 aggcgaatca ataacaaagc aagactactt tccatcagta cggggtacgc cataaacgaa 1741 gtcagaaggt tgtcaatcaa atatggtacg cgatgcttgt ccaaataaaa aacgatccct 1801 taaaagtgag ttagaaaggt aagagtcttg tacctagtag aacttctaat aactaataaa 1861 agctattttc tcagaagcga gagcgcacct ctcgcaacaa gtgctcgagt cactccgcac 1921 ctcgaaaaac aagtctcagg aagtgagtga gttatagtat tacaggagct aatagcttca 1981 gcagagaaat aagcacagga aatactggtg ggaatctcac ctccaagcct ttcatgggcc 2041 cgcgttgaaa tacccaacaa aaatccaaag atcattgata gcagaatgaa gaaactgaag 2101 atcctgctat aaagatctcc tcataatgta atgaaattat tataagaaat aaaatgaaag 2161 cgcagctaac attatccagg aacaaggaac aggtgatgga ttgagcagta ataaaggtaa 2221 tggttggaag ctgatggatg gccgagtaaa gaacccagat attagaaggg agaccgttac 2281 agtggttcaa ggtagctaaa ttcatgtttc gcgaatcata gaaaagaaag aagaagggat 2341 attggaaaaa agaagtcatg gatttagcaa tacaaagagg ttcagctgtt tccaaatacg 2401 gcggatctgg tgataaaggc caagggcgtg ctcttctatt aaaggaagaa cagaaacgaa 2461 aaagctctct ttgcgtacta tggatctctt acgtgcgata ttccttgctt tgactttgac 2521 gagcagcatc cagtacattc accccggctg ggcagtagcg ccaagaagaa cttcaagcta 2581 agagcgaaga gaaggtaatg atttcgctca gtagaaggtc acctacatgg atattaaggc 2641 tataagccgc aggtaagata tagttcatcg aaggggaatc ttgaatcaaa tgtcgattca 2701 tcttaattgt acgtcaattc taattcaatt tgttctaaac ttcctcgggg ctaaagcctc 2761 gtgattcaat tatggctctt tgctgctatt aaggtgatag tatctgctta gcccatagta 2821 ataagatggt agtgaccgct tatcctaagt cttccgctgg catgggtgta actgtccttc 2881 cagagtacct caaacagtcc tcttacgaag cctattcccg gccttactct gcattctttc 2941 tttctggttg tactaagcag gagcgttccc ccttactggc taggcggcta gtagatgctt 3001 ggctttcatt ccattccatt ctgatgataa atgaagaaag aggtgtcgga ctgggcatat 3061 aagcagctat cagaaatttc actcatgata ccaggcgcga ggagcctgtt taagaccata 3121 gcttttcgaa acctgcaaag agaagaagag tatcgaccgg gaggttgaga catgactctt 3181 ttcggttaag tgaccctgta gaaaagcgtt gttgacatcc aattgacgaa gagtccagcc 3241 atggtataga gtaagactct cagacgaata gtgataggct taaccacagg gcttaatcta 3301 tcaaaagact ctctcccagg ccgctgattc ctgtctaaaa cacgagagaa ccttgccttg 3361 tgggtgtccg atgccgaccc cgttgtgctg tgcgtcttgt ctttcccaac gaaagtgatg 3421 ccgattcagc gtgtgtggcc tgatgcggtt cagcctacca attagtgggg caaggagata 3481 tggtttcctt aggcataagc atgtcatgta tatgtacaca aggggaatag caactaaagc 3541 tcaatagcac cttagtgaac tacaagggct ctaccgctcg ggtccctcaa ccgagagccc 3601 ctcctcgcta cgatccaccc tttctctagc cctctactca accgctttag ctggcccgct 3661 tctgacagaa gtagcagatt caatctgtga atcccgtcta gagctcctca cactgtccgc 3721 atcatcatca acggcaagcg gatcagattt taagagcttc aaaatacata caatacccta 3781 aggagtgagg gatgtacctt ttggaaaggg actgactgaa cggaatcaat gacttacatt 3841 aacattgctc ttaaaagaag tcgaattccc ttcattcaga gcataattaa tctagtcact 3901 gatttgagtt atatagttat tattcatgtt tgttgttcgt tggacctttt ttgaaaggct 3961 ttgttaatgc ttatcgagtt atgatataag aaagggggac gactgaggaa atgccatgct 4021 aaggcctatt tccctgctgt ttcaacttct ttctatagcg attggacaca agtttttctc 4081 aagttaatgc aactccgttt caattccata atttgactca aactacatgt caatccccca 4141 gtccactata ggggagagtt cactccagtt agggggtatt accagcggct tcgaccacat 4201 cctatcctct ctagctactg gcttctgtgg ggagggcaaa cttctatatt tccctgctag 4261 tgcactactt ccttttcttt cagaacagag acggagccct gctttaaatt gacagaggag 4321 ttattggacg ccgggtatat ccagccatcc aaagcacctt atgaagcccc agtgttgttc 4381 cagaagaagc gggagcctac gtctatgcat cgactaacct atatctaaac aaggttaccc 4441 tcaagaaaat aagtacctga ttctgcgcgt agaagaccta ttcgatccgt tgggagatgc 4501 aagcttcttc accaagttgg atctaaagcc ggggtactac caggtgcgca tcgccgaggg 4561 taatgaacca aagaggagta tactttatga atgggaaagc agtcatcaag ccagatgcgg 4621 atgaaattct aactgctgcg cttacgcctt ttgattagac aactcctagt gccaagctaa 4681 ctaactagga ttcgttcaaa ggattcgtga aaacagaaag aagagctagc gactctgacg 4741 aatctaatgt tgcccgcgtt gaggcgaaaa gaaaggcgga ttccctatat aagtgaattt 4801 gttcttcaat cgattcatct cggccttaga ctgctacggt tagctcttgc cccgagcacc 4861 tggactggct atctcgaaag aaatgctttc atatcatagc tgctgacttg gctttgaaac 4921 tataggagct gaaggaacgg ctactgattt ggacctaaat catttcatcc ggaagtgact 4981 gcactcgcct aagccaaaac caagctactt tcataaccac caggaaagcc tagctacttc 5041 ttcgttcctc accctgtccc ttacctctta ccgtaggaaa agccgttgat attcgtagat 5101 cgttcgttga ttcccttgct tttgggcgat gattcgattc ttctgggctt gttgccacta 5161 aagaaagagt tctgccttct agccaagcct ttgagaccaa gtcaagcttc ccagcagtca 5221 accaagaaag agtgtttctc ccctactgaa gaaatggaag tcagtctgct ttccttctta 5281 gctttgttca catctgctct ttccaggact ggctatctct caatgtattc acgcatctcg 5341 aaatagtgaa aagaagtagt ccctcctccc agagctgaaa tagctggaaa aaaacttcaa 5401 cagaagtaat gccgactctt ctacttcctg gtacttttga ataaggcaag gcctttacct 5461 attccctatc acaaccactt tcagtagagg aacttcctct ccttttagtc gagttatagc 5521 tttcgcttct tcagatcctg atccttcatc agaggagatc gctttacttt tttataagag 5581 ccattggcag gaccaagaaa ctcccattca aggcgccaat cattcataaa taattcttat 5641 atatgatatg caattcgatg attcaatttc ttgtggaaaa gactagcgca atggcttaat 5701 gcatttcatt ggcagtttgg tcctaacaaa ttatgcttct tgggagagga gaacttccac 5761 caagcaatat atttgagttg gctaacccaa tgattgagct agcttctaat ccatttcaat 5821 ctcatcttac acagcagaag ttgtggattc atacctggaa gcaggcaatg ccgccacatc 5881 tcccctgtga aagaaggtta cgcagctgat tcggttaatc acatgtcaga ggcatctatc 5941 taaatcgcac atgtcagaac aagacctagg gatccggtgg caccaggagc gggtagccga 6001 gttgagatca ttccgttaag acacaaccac ttttacagta aacatgcatc cgcccagcgc 6061 cccctcctta cgaatggtta cgggactaga gcgagtctct ttttctttgc aagaataggt 6121 ctgagcctga tgacattcct gctttccttg ccatgtccaa ctaaagtgaa agttaatcaa 6181 ctgcaaggag gaatcgacct ttttctacta ttcatttgcg ccggaaaacc ccttcatcat 6241 ttgccctgaa ctggtgaaag gaataggagg acttttccct cggtggaagt agggatttaa 6301 gcacagttgt gattccgctt tcatgtcttg gattgagctt tctcactctt tctatgcctc 6361 ttccttgtcg gcgtcctttg ttcgctactg ctttaaagaa tgggatgagg ctacctgatc 6421 gtcgagtcga cttccatcaa tcaattggat tggatcttat tatcccatag tcataaatct 6481 gattcttcac ccagtgggaa cagtcttcat ggtggtacca gtgcgtatat atgtagatga 6541 gttttggttc ccaccctcga ttttccatcc aatcttccta tcttcttttc cctgctttgg 6601 aagcattcga tctctcgtaa agcaaaccct cgtcagagaa agggggcggg aagcctgcgt 6661 gcgattcctg ggttggatat ccctggctct cttctctctg agtccccgct agctttcaag 6721 cgttcattca atctccctcc caacgcaagg aaaccgatcg atgcacttgg ctttaggttt 6781 cgatcgataa taatttatat tatatataaa aacgcttctc gctggagact cgggtggggc 6841 gcttgctctc cactggcggt tgaatgaaag acgtacccat aaagttgacc taagttggta 6901 tgcctcccat cgaactaaag caagaaatct tccttcgatg cgagtctatc aaatgtttag 6961 ccacatctgc tgcgccccgg tcagtggagt caatagcctt atcagctgtc cctcccttct 7021 tttaaaatat tacccaggat cttatcccag gccgctgatg aattggcaac gaggcgagcc 7081 ttgtaccgct caacactgcc atcaggtttc cgcttaaccc gataagaaga cccacaacaa 7141 tcttttgatg aggagatgat ggtactagtt cccgggtgcc atggagaatt aatgcatcaa 7201 attcagcaga catggatggt ggcactagga tctgcatacg aggcttttgg tgtgcggacg 7261 tatgtgacca atgcacttat gctgccttca ctcaaaggaa agcactttca aagaagcaag 7321 ggatgaaggg gaggtaagcc tgtcttaaaa tcgattaact aagctagctt aattcttgaa 7381 ctacttgaac tagaggagcg gtacgagcgg cttactacaa aagaaaggct atttgagatg 7441 ctttttccgg gccaaagtac tctgacccat gataccgatt caagctcact gagaacggat 7501 ttccctagtt ttaaattcaa ctacggacca acaaagattg attgttgtcc cgttgacagg 7561 aagatggagg gttgttctcg tactcgcttg gcggggtggg cattcttatt tgtttaagac 7621 ttctcacagc ttttaggcgc taggagggct tgatagcttt gatgtaaccg ttttggtcgt 7681 tgttggaatt gatgagccaa acaagccagt tctcagcctt ctatagtatc cgtttgaata 7741 cgagacaaag gaatctatag gcctatctct cactttctaa tactactatt gctagtgcta 7801 aggagctaga gtatactagt ctagaaaaga atcccacata gttggctgtc ttactctaat 7861 accaatgcga agaaactcat taattcattg atgaggaggg cgaagaaact ttcagcttta 7921 gagaatccag ttccagcgga agactaagta cggtagcaag tcaaatcaat ctcctccaac 7981 agaaggctct gacctgacca cagtcaatca gtctattgtt ctggttctag agagtctaac 8041 cgagaagaaa tcgtctcatc gctttgagcc tggtccaggt gtgggggaat gagtcctcca 8101 tctagtagcc cagcccttcc gacaagagca gttacgcctt ttcctaaagc tcttactaaa 8161 ccaccgccta gcgaccacgg aaagcatagt cagaagagct gaaacaatag ccatgtcata 8221 ttcctcggga attgcctttc gaactgaact accacgactc tcgttaactg catcggattc 8281 gtgaatcaat tgctgcagat tatctcattc aattgcagcg gctttagcag acatctcttc 8341 aatttgcgta gatagaagat ccggctagaa actccttttc attgccttct actttaagtt 8401 ttaaattaat ttctctatat ctaattccaa gtgagatgtc caaaatcaaa ggatggaggg 8461 taagaatcga cgaggaatct ataagataaa atcgttcatt caaaaggaag aagaagaaga 8521 atcggacgat tgaagactag aatcgctaat agtcagtcgg agtggattga gaaagaattg 8581 taggactaga tagctctttc cttttttgat gtgctttctt gcgggaaaga aaaggtattt 8641 agttagctag gctttgacgc ttgttctcgg attcctttgc ttgtatattg tccgaaatct 8701 ctatggcaag tagtaggacc ctcacactct gttcgttctc ctgtccctat ctctctctgt 8761 cacatcctat cccgttcaga atctttggga tcaagaactc tcttccattt ctctgctcgg 8821 aaagaaggtt ggcaagagat atttttaaat tagggtgcgg caactaaaga ggtatcgaga 8881 ctgaccgcaa ttgcaggttg acttttcttt tcattttgtt tatggcaagt gaaagtcgtt 8941 tcgtttagta cgagatggct tcacaacctc gcggtgcttc cacctctcgc ctatcgaagt 9001 tctgttctaa aacctgttcc gagaacttgt atagagaagg atttcccgct aagcagcagt 9061 tcttccatac caacttagct gcccggcgct gctattggca taacaaccgg tacaccatag 9121 gttggcccaa cccagtcctc tcgtactagg gttggctcct cgcagttctc cctttaacac 9181 caacggtaga taggaaccga actgtctcac gacgttctaa acccaactca cgtaccactt 9241 gaatcggcga acaaccgaac ccttgggacc ttcttcaacc ccaggatgtg atgagtcgac 9301 atcgaggtgc caaacgactc cgtcgataag agctcttggg agtcatcagc ctgttatccc 9361 cggcgtacct ttgatccgtt gagcgagagc ccttccacac gggactcccg gatcactatg 9421 gccgactttc gtctctgttc gaccagtcgg tctcacagtc aggcaggctt ataccattac 9481 gctcacgagc agaatcttag cttgagccta ccttcgcaca cctccgttac tctttaggag 9541 gcatccgccc cagataaact acccacctcg cagtgtcccg cctccccccg aattctcggt 9601 gcggcggtta ggcaccctta gacgaaagag tggtctttca ggattggtcc ttctatgtca 9661 cgacctccca cctatcctac acattcgatc aaggttgtca ctgcgaagct atagtgaagg 9721 tgcacggggt cttaccgtct agccgttggt actccgcatc ttcacggaga attcaatttc 9781 accgggtcca tgtcggagac agcggggcag tcgttacacc attcgtgcag gtcgctactt 9841 atgcgacaag gaatttcgct accttaggac agttagagtt actgccgccg tttaccgggg 9901 cttccattca aagcttataa cacttctcct tttgactttc cagcaccggg caggtgtcag 9961 actctataca tcgtgttacc acttagcaga gtcctgtgtt tttaataaac agtcgctacc 10021 ccctggtatg tgccgctttc ctaatcaaaa gataggagag caccccttct cccgaagtta 10081 cggggtcatt ttgccgagtt ccttcgacat ggttctctca agcgccctag tatactctac 10141 ttgttcacct gtgtcggttt ggggtacggt cagttcaccg ggaggatcgc cctcccaatt 10201 cgaagttttt tcctggaagt ttcaaccttc ttgactatga caagagtcgc gactataaac 10261 agactcgcga ctatggcagg gcggtacgct ctgctctctc gcgaccccta ctctaatcaa 10321 aagactaaag gcccctactg aaggtcgcca aactacgacg agactttcgc cttttgaagc 10381 gccagtagcg tagggcgacc gggccaggcc gagtcagaaa ggctttgatg actcaaggtt 10441 catattaggg aaaggagagt gaggggaaga gggctacgct gccctcggcc cgatcatcca 10501 attcgctcca agagagaggc atggttttgt agtcaaagca acttcgtcac tttcgtgtac 10561 ccatcggacg gcagcccttt cgggggttcc ttagggaccg attcactctg cgtagattga 10621 ctgaacgcag aaagccttcc actggcaggc gatcgtgttt ttcacaggat ttatcgttac 10681 tcatgtcagc attctcactt ctgatatctc caggtcttgt caccaaaaac cttccccgat 10741 tgacagaacg ttccgctact gacacttgaa aaagcagctt tcaaggtctc gtcgcttcgg 10801 tgaatcactt gagccctgat acattttcgg tgccatggag ctagaccagt gagctattac 10861 gctttcttca aaggatggct gcttccaagc ccacctcctg gttgtcatcg ctcgatcact 10921 tccttttcca ctaagtgatt gcttagggac cttagcgtac gatctgggct gtttccctct 10981 cgactttgga tcttagcacc ccaaaagtct gtctgtacaa acgagaacgg cctgtattcg 11041 gagtttccct ggggttggta aggcgaaatg gggccaccct agcccattga gtgctctacc 11101 tcgggccatc gacatcatac gctctactga aatagatttc gcggaaaacc agctatatcc 11161 gatcttggtt ggcctttcac ccctagccac aagtcatccc cgtattttgc cacatacgtg 11221 ggttcggtcc tccaaggcct gttagagctc tcttcaacct gctcatggct agatcgatcg 11281 gtttcgggtc aaataggaag aactagaaga ttccaccttt ggaaagcgcc tacacctaat 11341 ggcttaagcc gctcttccca tttcctcgct gacccatcat gcaaaaggta cgccgttaga 11401 gtgagtgcgc ttgactactc cttcgactga ttgttcgcat cggatctcag gttctctatt 11461 gcactcccgt catagggttc ttttcacctt tccctcacgg tacttgtacg ctatcggtca 11521 ttgaggaata cttaggctta gagggtggtc cccctttctc gcgtaaaagc gatcagaatt 11581 cgaacacgcc gcgttttact gggaaggatc gaaccatagg aacgaatcta cagggctatc 11641 accttctttg gccagatctt ccaacctttt cacaattaca gttcacagcg ccctttagga 11701 atcttcagtc aaggtacgaa gtaaactcga ctgaaagaag aggggctttc tggtttttcc 11761 atcatccaat ccacaataaa tcgaatgaaa cctggcgaaa aagaagtgaa cactttggaa 11821 cgaagcttcg tctttctttt tcttaaaatc ccaaatccgc tctcgctcgc cgctactaac 11881 ggggtctcgg ttgatttccc ttcctttagc tacttagatg tttcagttcg ctaagttttc 11941 aaagtccaaa gagcgcagac tagccacgga gcttggatac ggtttcccga tcggagatcc 12001 atggatcaca gacggtatct ccccatggcc tttcgcctct gaaagcgtcc ttccttctca 12061 atgcccgggc atccatccaa tgcattcttt tcgatcttgt actcagggta cactgaacac 12121 cacaaaaata tcgatgaaac taactataag tgattgcgga ttcgaaccgc tcacagaagg 12181 atttacagtc ctgcactcta ccagagctac tacctgttac cactttcttt tcaactcgta 12241 aaggcaaaga aaaaagggat ccgcctcgaa tcaaaacgtt ctttcttttc taaaaacgat 12301 ctttcttctc ttatgaaatt gatagtttgt gagagggatg caataactcg actgtgtaag 12361 gtccaccagg tccgtaggag agtcagtctt tctctaagca agctgttttc tggcctatac 12421 ggaaagagtt ttcaaggtct ttccctttcc ccgaagggaa gttaaagctt gcctgtaaag 12481 ggaagggaag cacagcaata acgccttttt ccttttttct cccgattaga cgaaagacgt 12541 gttggggctg tatccgaaac atttttgaac tgccggcccg acaaccaagc ctataaccgg 12601 caataagagg tcaagttgag acatacaaag gcgccagtga agagcttaaa atcagtacaa 12661 gggctgctat tgaagggatt aaatctccca atcaaggact gcttaggtcc cttctagtcc 12721 gtcctgtatt ttagtatgat cgacgtattg tattttacct atacgggcat atgttcgact 12781 tcctgatcaa gggcatccgt ccacaaatta ttttatgtaa atatagatat tccctacttt 12841 tcataccatg ggaaatccac tcggacgaag acatagatga taggattatt tgagagaaga 12901 gccgtgccct acctattcta caaaacctta tcctagtttt tactttattt tttttatata 12961 taagcattca tcaacggatt ctgcttccac tgccttagtg ctaattacaa actaattatc 13021 ctcatcccaa aaccccttac aaagaaaggt aggcgtgcaa taagagaaag cttaccttgc 13081 cccgagagag accataggat aaagtaagcg tgcaagagct agagaatccc ggggtatagt 13141 gagagacctg gtgcaagctt ttagagtatg gctatcagtc ccaagtttaa gctaatcact 13201 cgtaaggtta tgtagtccgc ctatagccca gcctttaagc atcaacagaa gatccagcgg 13261 tatagtcgtt ctctgtatcc aaatgctaga gtaggagagg agtaagctaa ccttatccaa 13321 cctttgtatc cgtaaatccc tatatagcct accagcgttg tacccttttg cgagcaagtc 13381 agcctctatt tattaaaccc tttctagagt caccagctaa tcctaaaatc cctggtgcgt 13441 aagcacaagc taccctttag tagtcagcaa ccctcctaag ccttgttttt aggagtaacc 13501 tgggatcttt ttaagctcgt tataggaaga cgcgctatat gagtaatcta tcctagtaga 13561 cattatacta ggaagctcct gagtcaacct ctaatctata agccagtcag cgggatatat 13621 gaatatgaac ttctctaaga gaacagttca acgggtttaa ttgaattccg ttatctgatc 13681 agattagata ttatctaaga ctgtataata gagacctaac taactggatt gctagcagtg 13741 atcatacccg atgatgtgaa cccgattgaa tcagctcttg tgatcatatc ccatgctatc 13801 aatgggagtg aagtccgcct atccatcaac ataaaagtga aaggagcccg ctaactactg 13861 cacttcctaa tgtgatattg agttttgcct aaacttgccc gcccatactc ttaacttttc 13921 gcttaagctc tcagactaga gctattctca ctaaaccgaa aattataata tacccctctc 13981 tcctctctcc tcaatggcta aataccaaag ctaaccttaa atgggttaaa gaccgagcca 14041 gaacaggagc ttagcctcaa tcttagatag gagattaccc ataacaactc aactacgtcg 14101 agccttgtct tgaacaagcc aatcaacctc aatcctagga aatgatttca ggtgaagctc 14161 acccctcttg ttctcatctt aagggaaggc ccaggcgcag aaggaagact ctttgatcgg 14221 gtagttcctg ttctcagtca tcttaggtgc ggttggccaa ttaattactt aaatttatag 14281 cgcgagcccc actcatacaa gtaggggaaa actctagttc aactgttttt ttattcagat 14341 tcagaattgg ctgggcctta accgtagctt tagggccaaa agcctatcta tggccttttg 14401 gctgggggta gctatggcaa gggtagtttc ctcggaaaaa ggaggggaag ccccgacttt 14461 ccaataaggc ccgggggaat gaaaacactc aaccgatggg gggaggaatg aacagcagct 14521 ctaggggata aatgtggata tccagtggca actgagcagg agcagctcta cccgcacata 14581 gctaaatatc ccccttccac gccacccagt tggtgagctc cgcccttaag ggagcccatc 14641 cgtcggtcgg ccaatattgg atagtaaact agcgcacagg gtaactcgaa tatcaaatat 14701 aagaatagca gccccggtct caaaaacaag catggaaaca gcagtcaatc agaaagataa 14761 gttgaagtta gctttcagct gttcaaccac taaagactaa agtgggtgaa gcatagtttc 14821 gtccgtccct ctcgctttca gcgcgggata gagttatcag tacgtgcgac accaactctt 14881 gcgttttcac ttttttttta tacatatata gataaattcc taccatattt cgttacttta 14941 tcccgaaaaa ggggagctgc tgggctcctt cctgttttat tagcatcaaa cccactcaac 15001 aggatattcc ataagcactc cggaaggtat agtgtgaagg aaagtaggtc ccttacatga 15061 gagtttgcga acacgggggg cttatgctcc tgctgctcat atggaacagg ttggccccca 15121 aaagccccaa gggaacgagc gaaagggctg cttgccggaa cagcagatcg agcagcctat 15181 tatttattac atttctgccc caactattcc aaaaagggta aaaaatgggt ctaattctga 15241 gacaaaagta atttaacatc cctgattcgg aaactgctgc agtgggaact ctacttgcca 15301 ttcctccttc cgaagtgaaa gttttcattc tccccttggc ccagttgaaa agcctaaccc 15361 caggtgaagt accagattct taagtgaaag taaggaattg cccctgtctg attcgtaaag 15421 ctttcggaga aggaggggag atccgcggag gtagaattct gaatacgggt acgaagggtc 15481 atccagaagc gctggaaggg ctggaggaag gggacgaacc gcatcgattg agttatatcc 15541 ggcaggctaa cacccacgta cctagaaagg aaggggaaag tcttaccttt tggaaacata 15601 aggcgcatca gaaaggaatc cggatatatt tagatagcca gattcacgag ctgaggacta 15661 agtcgaagcg aaagtctgtg ccattgttag tgccaagacc ctagaatagt gaacaccaag 15721 tgaagagcca cctccgggaa cgtcgagaag ggggacgaac agctttgatt ggatccgatc 15781 tggtagacag agacgaagac aaggatgaat aatctgaaga aggctatgcc gaatccgagg 15841 gaagaagaag actacgaaga aggaagccga tgcctaggcc gaggctggtg cctaattcta 15901 tgcttgaagc ctaagctaga gctggagctg aagcctaatt ataagtttca tctaaagctg 15961 aagttagggc tggagcctga gttggtgttg atgtcctaga gtagcgaaac tacaagtgaa 16021 ggggcgctat ccggaaccct taaggtcggg gacaatctgc ttcgattggg ccttgcaggc 16081 agacaggcta acttgtggag acgaaatgga atcctataag aatgtcgagt cccgagggtc 16141 tgcatgcaag aggctgagct aacaagctgg ctgaatatat catggggatg cccgatttgc 16201 gattgagcta gccgataagt ataatctctc cggtctggca tgagccactc cattcctact 16261 tccactcaac aaaaaagata ggattcttgg gctctacctc cagagtgcgt acggctattt 16321 cagaccaaag ggggccaagg gggatcccat aaataataaa gtaagagaag taacccgggc 16381 tatgagctat ttcgtcgttt tcttgccgga gttcgctagc cgttcaatcc gatcttccct 16441 tcgcatggca gaagagcgcg gctcgagact actctttcaa ttaaaaatga aaggcaagca 16501 ggaatttaac ccactaatgg gccagctgca taatcacgta tgtaagcaag tgatcctctt 16561 ctttcgcagg aagcattact catttcttca gagttgaaga aaaggctacc gatttcccac 16621 tgactattat gtaagggctt gaccttggat gataaattca tggctaagtt agtccgtagt 16681 acgaagcttt caattcaagc accttttttc tatactaaag gccaaaggga agttgaagcc 16741 caacatcccg tggtcgggat ggacaccggt ctggcccaca cctacatgga cttcaaattg 16801 acttctgatc gtaactactt taaaaacctg tggttcctca cgcctagtcg acataatgtc 16861 tggcgtttac ctgacggttc cccaagcccc tgagctcatc aatgaacgga ttccatcttt 16921 cacagcgagc tgttccgaag tcgcgcctga gcacagtgaa atgcctctta cggtcatcta 16981 tatcgtgctc ccagtcgtca tggagagcca actggagagc acgcttgaca tcctctggat 17041 ccccgagagt tactccacgt tgatgcaaga gaatatgggc tttccgtacc atggccgcct 17101 ccaaattctc gcaggcgctc gtgatagtct ctacctcggc agagatgcgg tcgtgctctt 17161 gtgcccgagc ctccctttcc ccaccttcac caaatggttg atccaggtga tcctgctgtt 17221 gagctactgg ggaagcggcg ggttcgggta aggcattgga tggcccagcc tgttccctct 17281 gaatcggttg attcaccgac gaggaggtac gcccggaact ggccgaatcc tccaaatcag 17341 acgtgtaggt aaaccactct gaataggaag atgccccgtt agggcccgta gaaggaagtg 17401 cttcccctcc ccctcctatg gcattccaga ccaaaggagg taccaacagg ggcaaagcca 17461 gacccagtag gtggatggct ttgatccgta tcaaagatat tcccaccgcg atgaaaaaga 17521 ccaaaaagag atagataatc ttcttcagtg gccctttcaa ccggaaggaa tgcaaataaa 17581 aggcggcgac cccgccgagg agtgtaagtg taaccactaa aacagtggtg gtcgccaatg 17641 atggatttcg agccacagaa tggatcaaag aactgaatac gtttccaata ccgatagcag 17701 ctcccgctga agcaattgta gcagctccgg cacctattga ttttgcacct tctaacatct 17761 cgggttgaga attatactca cgctttgtca ttcacttatc ttatatctta ttgattcctc 17821 gtcgattctt accctcgttc ctttgatctt ggacatctca cttggaatgc attctttgcg 17881 atcttatacc cacggagtgg tagactgaag accaactgaa tctcgacaaa gagaagtaca 17941 agcaagtaga tcattcgacg tcagaggggc tgatgtgggt cgggctatct atcttgctat 18001 cttttttttt atacataccg tattattcta tcaatcggga agttggcctc actcaatagc 18061 gggaagttgg cctcactcaa tcaatagtcg gatacaatcg tggacaccgt agacctgctt 18121 cgcctagagt ggatcaggtc cttatacgta aggcgatccc gcgaagccgg tcaccggagg 18181 tgaagtcaac catcatttct tttcttacta aattcaacta tgtattctga ttaaagagct 18241 tattctcttg attcatgtcc accttgcccc tggtaactta aggatgagtt tcgaaattag 18301 gaaggattgg actgcagatt tagaaagaga atggggagcc ctcatcaatg aattaatgag 18361 tttcttcgcc ctctcttggt gatagtagct cttcatcagc ttatagtagg agctgagctc 18421 ttccgctagc ttactagctc tggtagtcag gctcttcggt caatagttct actggaaaca 18481 gcttcctgta tcactgcttt ttaatttttt tttaatgctt atattttctg taaagatctc 18541 gatgaccgct taattcatct ttcctttccc gctcctgaat gaaaagtggt tttttttatt 18601 cctacggtct tggcgtgatc tctgaaactg aaattcgact gaaaagttac caatagaata 18661 ggctttagag tagctctttc caaataggtc gagtagccat ttctagaaat agaatgaatt 18721 aaaaggagcc aaatagttgg gattcacacg cacagcctta tcctatgagt ctgcctatgc 18781 tacgcgcctg cctttgagag cctttaccgc tggttccctt catcggcgtc attgaacctc 18841 gttagcattt cgaaaagaat tggaggctca aaacgaatgg tcaaaggaat tgatgattcg 18901 tgtttagtct ccgatcttcg cctagtctta gaacctgcac tgtagagagg cgctgcgctg 18961 cctctttctg ccgctctcct cccagatgta ggtcttcccg gaagtgagaa cactgctgcc 19021 caacttataa ttcggtcact ccccgtcttc atcctagacg ccactggtac tattcatcta 19081 tactatatat ggatgactcc tgctttctcc ccaatggaat gatcaggacc ctaaagaagg 19141 atcgtctcac tccaactaga ataaagatga gctagttagc gctggagttt tcttgcttct 19201 tcgcctctgg atagtagttt gaatgaatct ttgatactgg gattaactct tgaaggaaag 19261 atcttccact cctatgcttt caagcgattt ggattccaag acgatagata ggccggttat 19321 gaaaaagagg gatcgagcag gacgaagtca acccggaaag aaagtcggat aatatgaaag 19381 gtcatccgtc tcattctaag cgtaaaatta atttttttca gagaaagaag agcccatcag 19441 ttctcactct tccttaccag tcagttcccg ggtctataac aaagcagtca cagtgtgtca 19501 tcttcttgcg tggaagtgag tcctccatta atggtatcaa tgcctgtccc cctctcatcc 19561 ctgtctattg ggcttcacgc tagcccccat ttcgatcaag aagaaacttc ccagtcccaa 19621 gtgagaaagt gagaatagtc agatgatccc atctcataga taattggctg tctcggaaag 19681 agaggaagtg atgaatatag aagatctagc tctcattcct acagaataga cgacatcaga 19741 aggaaagatc agtcgataga atagaatccg aggatttgct tggatttgtc cgcttcatct 19801 cccccggttt acgcttccag gttctcacat ttggttcttg gaaaagaaca attgaattcg 19861 gatgtgattt gacttcttcc ctgatgatct tgtcggccct atctatcctg aaaaagaagc 19921 attccagaat caaaaagaag aaaggtctcg acgagcatta actagttgat gaggtctcaa 19981 cgagcgactg aaaggagagg agaaaaaagc tgctagaaag aaagtgagct acgatcggat 20041 aaggtatgcg ccttctatta ttgatgttgt cactgcctca acttaacctc agaataagat 20101 tgaacgtcaa cgagaagcgt cagtccaagc taaagaggaa tgatggagag caagaaaagt 20161 cctgtgctga ctctgtgttc taagctgcca tttcgatcaa aagtatactt atttttcctc 20221 ctactcgaga atgatggatc cggggaggca aggaagtagt agaaatgttg aagaaggaga 20281 cttaaaggca actcgagtaa aacgagaggt ccattaggga ctcgacttaa aggagaggta 20341 cggagtgact cgagcaaagc aagaggttta aataactcga gtgtaaacga gaggagacaa 20401 cccgctcttg aacttgaaac tatgcgcgtc cgtcttcgat ctagcaatct tggacaattg 20461 ttttgatgag ctgtcactaa actaaaaaaa gagagaagag aaagaactgg gaatgggcgc 20521 ctggttgctt gcttaccaag ccatcctggc aagctcctcc agttcgatta ttattcttga 20581 cttgactttg tataaaaact actcactatc aaaatgaaag acgatcgatt atctacccaa 20641 gaagatagag agtcccacat acgagaaaag gtcacaaata gagttgaacc aagtaacatt 20701 gcaaaggcat aatgatagta gggtcgggat atccccgccc tccgaaccgg acgtgaaggt 20761 ctcccctcat ccggctctcc gcaggggaat ctccactcac tgcttcccct aatatcctcc 20821 tttaccacat catgggggtt tacaggagat cccagaggca cgcacggaaa ggctgctata 20881 ccataccttt gactcaactc tactctagta gtcatagact cactagaata gtccgtccgg 20941 ctgctcttgc tgaaatacat tactcttcat tctcccgtgc tctagcaatt gcgctcccta 21001 gaccactacc atgtagttag gtagggacat ccgtaatgac gggaatctag gaatgaatgg 21061 ggatccctat caataagaat atgaagaata tctaattagt tacactacct ttctctcact 21121 caatagatct atctggtctg gatacggtac agtacaatac gagacgatgg aatgctatgg 21181 gatggatggt agagggatgc cagcgcccaa aagcgatgat tcacttgtcc ccttgtccat 21241 agggacctcg tggcatacaa ccgaaacgac tcccgctaga tagccgcccc tatctctctt 21301 tttacagcct cgtggacgga cgaaagaagg caagttacag aacggtgcag tgaaggctcg 21361 cgaagtagac agcaagcaac tgcttttcag ccccttctct attatattag taaagggaag 21421 gggactctat caggatctta cgaatctaaa gatctccaaa ttgaagaacg gactcttttg 21481 ctgcccccgg agcgtaagca cttcactcgc taggggatgg gattcattca cttgcattcc 21541 tgctagcact acaaaaagct cggtcttaac gcccttacta ctgctgtgca gcctttcctc 21601 gggttcgtag agtcgggttt cccgtttacc cacaacggag gagggccccc caccaggcag 21661 gcggccacgg gtcataacgc actcttcgca caacaaatcc actttgaagt tgacttattc 21721 gctcggccaa tcgtcggaat gtgtacgaga taccataagg gcccaatatc tcaatagcac 21781 ctttgtctaa agcttcgaat gagacttcat atccgaaacg caggaacgat ctgactagaa 21841 agtcattcaa aacttgatcg aagaaccagc gtttattgaa gaagctatag agtcgattac 21901 aaaaagtact agtttgaaag gctcgttgga attgatccgc tacgggattt acattatacg 21961 caacaaaagc acctgaagta ctaaacagaa taggtattag tttggtaatg gttggagcag 22021 caaactcgga ttcggcaaga atctcatttt ttggtagtac gaggggggaa ttggcccaaa 22081 aattggatag gggtcaagac gcagtcgggc gccggcggct tactcaagtg ccccccgaac 22141 cgcgcgaaat ggtcgcctat tacacggctc actaactctg cctggggtgg gggtacgtac 22201 ctatttatta ttcgtcgccg gtccggctta atggagaagg aaattcattc ttaatctaga 22261 gagttccctt ttgttaagcg catcgagcca aaagactttt cagtcagaaa attaggaagc 22321 atggactcac tctactcctc gcggcggtat tgccgcataa tggaatagat agcatctttt 22381 tccttttgag tttgaaaggg aaaaggaata tggttcccgc cattataata atggcgagca 22441 aggtatcgac tcgcgaaagg agctcacgat aaaaacccac cggattttct ttttccaata 22501 ttaaattgat cgtatggaaa gacgcctaca cggaacctat acgaaggagc aaacaagcaa 22561 cggtaactac tctactttac tttgatttga ctaggcaatt gattccccat ctcggtcaat 22621 caatcggcct ctaggactgt tgtttaaaga atctgacctt gacaaccgaa acgtgagcgg 22681 ggaagctgac tacaccttac cccttgactt cgatttccgc agtgctctat gagacgactg 22741 cgagcacata cctatggatc tatgtcaacc aaaagcaacc actccagcaa gtttaccaac 22801 cagcacagat cctcccaagc aacttccact gcgccaacta gaaaacttat gcgcttgtca 22861 attctttctt atcttactat atttctaatt acctaaataa aataataaat actgtagtta 22921 tagctaaagc tcgacttctt tcttcttttg gatctgcttt gccttgcttt gctaccgggc 22981 ttctagctgc ccttcctcca ccgttagctc agtagagcat tttctcctcc actcatgtca 23041 cggattcagt tcaaaaatga caacagtctc tagtttacaa tggctcagaa gtatggttgg 23101 agatggtgct tatcttccta tttctcatgt cggttcaaca actcttgcta ccacagaagg 23161 tagaatacct gttagtgatg tttcttcttg ttcctactaa aaaaagaatc cttgctctca 23221 gtctctcaat ttaccccttg gtcctaaacc atgctcagtc cttgattgtc agctgctaga 23281 acaagcacca gacacctttc aagaaagagc aagagcggcc gggaaagata aatcggatta 23341 gcctttcatc gatccccgaa gagaaacatt ttgttgttga atctgtgaga aaagcataag 23401 gataggtgag gatagattct ctagattaac tttatcgaac gaatgagaag cttagcaatt 23461 catctcggtt aaaggattct acggcgaggg tttaagcgca taaactgctt ggagaaggac 23521 ttctgacaag gcggctatct tgatagaagc aattctcttt tgaaacactt tcatagaagt 23581 cttcttacaa agcaaatagc atttcctata gcatttccta ttgatttgtc ccctggactg 23641 gtggacctat tctgattctg aattatccgt cgctacgctg ttcccaagga ctagcaaaat 23701 cgaaatagcg aaattcttgg gtcatctcaa tgggttcaga aaccacacgt ttctctggat 23761 catcatagcg tacttggaca tatccactca gaggaaggtc ttttcgtaat ggatgaccct 23821 cgaaaccata atcagttgat atacggcgta gatccggatg attgatgaaa gaaacaccaa 23881 acatatccca aacttctcgc tcccaccggc cggctgatgg aaatagactt actaccggag 23941 atattcgtgt tacttcgtct gcactggttt gtacacgaat gcgtgagtta taccgagtac 24001 tcagtaaatt atagaccact tcaaatcttc gttttcgaga gggatgatca actccgcaaa 24061 tatcgatcga aacttgaacc cttgtatagg tatgcaattt gagaaagcac aacaattgaa 24121 ataggtagtc cgtattggta tcagatctat tcccatgttc cgatctttcc atttttttga 24181 cccatttctt gggtaaagtc tcccaactat atttgaaaat gaattggtta tccataaaga 24241 taaagaaagc tttcttgtag ttccgcttct tgctcttcaa attctgaaag actcgtcgga 24301 aattgccggt gccggttggt ccaaccaaga aaggcatggg agtctttggg cattacttgg 24361 gtcgagttaa gtaaagactg gttacctata aagatccctt actttttttc gcaccggaca 24421 atctaatttg attgaaagcg aatgatcaaa ttcgtggatc gcctgccata ttggactttt 24481 tgtttttcca atacaggcga gcacgctttt gagatgccct agacgactcg catttgaaag 24541 gaagctgtgc agatcgctta agtaggctta aatacctatg cgaatatcgc tctcctcgat 24601 atttagatct cggagatccc tttcttttcc ttcaattaaa cttttgagag ttccttctcc 24661 atacatatcg agggctttat caatgattct ttggatcttt tgttcgcaaa agtggaaatg 24721 gcgctctgtg aatcgccact ccttagatga aatcagatct ttccacactt tatcttcatt 24781 tagttccaca gttctaaata gatcatctgg ctttctaaca acataacgga tgaagtcccc 24841 cccacgggta aagtctgtgg aacttgcagc atagggacaa ttccttgtcc cacctcctgg 24901 attactccaa gaaaaggttc gaccattggc aacagttcgg attcgaacat tgtctatctc 24961 gtaatcattt gattccgtct gactcctcct actcctcctt tccgaaaaga ttatccttgg 25021 tccttacttt tgacataaga ttctcggccc gctttatccc cattaaccaa ttacgttacg 25081 accactgaac aaacttggtt gacgaacatg gtttatgagc cgctaatgta gcggcttgtc 25141 gagcatttgc caaactcaca ccatccattt caaatgggat ttgtcccgtg gacacacgag 25201 caatccaacc cgtaggattt ccttttcctc ttcccattct tacttctgtg ggtttcccgg 25261 taatagggag atccgcgaaa actcttaccc atatcttacc atttcttcgg aattgtccgc 25321 tcatagcacg atgaaagtgt ccgattatag cccgacgcgc tgcttcaatg gctcgatatg 25381 aaagacgacc agctttacaa cttttagtgc catatcttcc aaaacccagt tttgtaccat 25441 ccggtttgca acccctacta catctgcctt tacgatattt actatatttc gtacgtttcg 25501 gatatagcac gtctcccttt tttttgacta tatgaaatcc acactttgac acctgagatt 25561 ccgtaacgag tagatacttc cgcaggagca taatcgattt tatggttaaa tacattacga 25621 gaagtttttc tatgcttata gcatttagtt tgagcttttt ctgctgcgtc ttttaatcga 25681 ccggaaaaac atatacggat tccctccacc ccttttggaa tctccttcac tattttagca 25741 aaaatggaat gaaatgatct tcttttgttc ttcagttgaa aagagatgtc ttgagcaatc 25801 ggagaagcgc tttgataaac agattttatt ttgactgact caattaaggt cttagtgttt 25861 gttctattag acaagaaaga tcgcattttt ttgacttcgt aaaaataaga gttgtacctg 25921 tacggaattc ctctctttct cagtatgatc tctatcatca tatctattac ctttatcaat 25981 tcttctatcc cacctaggtt ttttaatttc tctatcaatt ccattatctt atccttaccc 26041 acgaggttcc aacatttgac ccttaattga ttgaggagtt gttcccgagc atcaaggtta 26101 ttatacaccc caaccccatc tcttagaaag aaaaaggtag caccgaagaa tggaaagagc 26161 gagatggtgg ttttagttgt tcccgcgaag cgaagatcat tcgccaagcg aatgaagtgg 26221 gtcaacctat ttttggcttc ggccaaacac tttttctcgc tggtcgagct ttctacaaaa 26281 aaagcgatac gagaacgtat tctcttctgt aagcttcttc cctgtgcatt cgctctcccc 26341 atcgtagatg gttcagcggc gcccggtgcc acgaaatgat tgagaactac gacggggtcg 26401 aaattaattt ggttctttgt attaaaaaag tattgcatga ccaaaaaatt caaggagggg 26461 cgcactgcag ggagggtccg tagataactc gtcgggccgt cggagcggga cttattttgg 26521 aaaaataact tgaataactt cgtttttttg aaggagtcgt cattttctat caggaacgct 26581 atgtcattta caaccccggc gtatttcgga tgcttgaagg ccccgctgat ccgaagtgat 26641 ttagaaagat tcttctttat cgagggtaat cggtcatggt atccatagcg ttgttttttt 26701 ttcggccaac cccggatttc gttttgcttt tttcggtcgt cgagcctgat cgactcgact 26761 cttttccttg ccccccggcc tctcacttcg tttcgttctt cttctgtatc gtcgcttgaa 26821 tgaagacact cgatcggccc ggctttcccg aatgtcgtcc accaccggcc cttctccttt 26881 ccgggtctag ttttttcacg tcgtttcagt cgtcgtggtc gacggggaag aaagaaatga 26941 atgaatgttc ttttaggaaa atgtagaata atacacctac cgagacgaaa gccgaaggtg 27001 agtctcgtag gtggacgtat cgaaccgaaa taagatctca gattgacatc ttgatacaca 27061 aatttaccat aataataaat agagtcaata gtgaccccgc cgtagaaaga gccgcttatt 27121 ttttgcttga taggggggct tccattcacc aacaaagact aaaagtgctc tccgaaccgt 27181 gctggatagt cacccatcac acggctctca aacccaacct gtggtggatc ccgggagaca 27241 aagtaaaagc gcttgatcct ttgccccata ctttgagatg ctcctccccg cgcagcacgc 27301 tgctcgtgag aggcttagct ttaagtttaa gctgctatcg ttcggttcgg ataagtcaag 27361 tccctttgat cggttcgctc aggtggtctt atcttcccta acgttcagag tcgttttggt 27421 ttgagaaagg agcaccggcc gagcgccctg aatgaataag aaatggacag gagagagaat 27481 caatgattct tattcaaccg agttgaagct agcaacatgt tgattacaca ccggaggtta 27541 gtaagaactg agggcaatgc cctcctaacc taagtaggct acccgcgctg cgaagcaact 27601 ggtacttgat tggggcgggg ggcggtttgg tttcgtgcaa ggccctcgca acaggaagag 27661 gcagttgtca ttaacctttc gcctttattg atataaaaca agcttactta ccggcaacaa 27721 gcaccttttt ttctcaaagt aaagtttctc gcttgtttct ttagaaagat tgcagcctta 27781 gcgaaacaac ttatcctttt ctacgaatct tccctttatt gagcaaaatt ctatctatat 27841 ttattcccgg aatattttct atttcaattc tatcatttgt ttgacagctt aaactaggct 27901 tcattttaga taggttttcg gtctcttaat caaaatcggt caggggtgcc gtcggaacgg 27961 tcggtggctg cttagtctca tccgagagtc taatctcttt gtactgcttt ttttcaaaga 28021 aaaaaaaaga agaaaggggt cgatttaggc tccttccctt ccactgcata gctacgctaa 28081 caggtactac gagccctctg tcccacacat ctaaccagct cgcgtggttc accggttcca 28141 ccgaaaactc tcatttgtta aaacggagca tagtgcgctt taggcgccga gcgagaaacc 28201 tctcttcttt cgtttcggtt tattcactat ctgaaactta gcacttgttt gatttttctt 28261 attgggcggc ggcgttcttt tttgaagtct atccgaaccg aattagcact tctcagatca 28321 gactaggcct cccccgcaga gctgggcgcc ggggacctgg atgcgctagc gatcggcgca 28381 tagggggggt tagttgcccg ggtttctcgg cctggtatcc tacacctcgc gttaatgata 28441 tctacattca actgtgcccc ggagacacgg tcgaagcacg cccatccaat gtgcttcttt 28501 cacgacatgc tctggttccg ggtgttttcc agtggtcttc tagcgttaga agaagtcgtg 28561 tccgtcccgg acatctgcaa cgtcctccca cgcttttttg aaaatatagg ataaactgaa 28621 ggaaaagact tacccattcg gtgactttcg cggtcgccct cactgaaccg acttgaatct 28681 gaactacgat tttttccaag tcttaccgaa atcggatttc cttttcgtgc catatttttt 28741 gactttatgg atttctgtcc cttttttctt cccggtacaa tatttgttct cgaaagtctg 28801 agtttccgtg tactctccaa atttatgacc aaccttcccc tcagtgatct tagaacgcta 28861 ctacgcatct ttactatata gtgtggtaag gtaggtttgg gtatagcagg acttgaacct 28921 gcgaccatta ggttaaaagc ccaatgctct accaactgag ctatacaccc aaataaagat 28981 gtagtagtca attaagattg gtgcggaaaa gagaagaggt ctcaacaagt attaactagt 29041 tgatgctgat cgaaccctca gttcgaagct cttcgccaac atgttacgag gctccagtct 29101 taggcgggtc accattactt gacttagctt agaaaggcga acatctgggc aagggaaagc 29161 gcagtgcgcc tggtcctttc gaaccagtct ttataaacct ggtgcaaatg gctttctttt 29221 tttatgatat accaatcaga atggaaggcc ctacgtacat gattgataga agcactacgt 29281 atggggggtc cacttgatgc gggagaggca tgagtgcagt tcgatcttgt ggtgtattcg 29341 tttgtagtga gtagggcctc tttctcgttg atcagttcgc ctccgctcta ttgaatggtc 29401 cccattccta cggcggtttt gtcaacgaat gcgtctcggg ccggattgac taacctaacc 29461 cttaaggtaa gggggccttg ccatagttgg gtgctctgct ttagtgtgat tgacgaaggc 29521 gctaggctag gtttggtaat accaaaaaca aattaaaaga gattggagtc gatagttggc 29581 aaggaacttt tttcccctcc cccccccaaa aaaaaaaagg agctgcggtc gaactctaat 29641 tctgaaaata agtcggggcc cttttaccaa gtctactaga tagaaatgat agagggccaa 29701 ccatggttgc gttgcacaag acggtgccgc ctcacctcga gttcctttct tcgtagtcaa 29761 atcagatgat aggcttcggc gatgttacct accaaataaa aaaaaggcct gctaggtgat 29821 cgaaatcgct caggtcagtg aggactcact cactcatcta cgtattacta ataatttttt 29881 ctatctactg aattaaagac ccgccacgcc gggctataag atacagctgt tgtactactt 29941 gactggtaag aaaaagctaa cgtaagaact ggaagagtct tgtatgtaca gtacccctct 30001 cttccgctat tggtggactg ttgaacagaa aggccgacag aagcaacctt tcttagcgcg 30061 cttttcaagc gcttagcttt tgctagcgac ggggcggcaa ggctataaag taagtaaagt 30121 tggaagccta agccaagaag gtacgaacga agtgacgggc ccctttagag ataggatagg 30181 gggctcttgt ggtagtaatt gcgaacatct ctctttttct cgtagtgtgc acagtttgct 30241 tgcatgccgc cttaggcaca tctctctttc ttagtttcat gcttcctaat gaacgaaagt 30301 cagtctttta gtaagcgtat ataagcagca gtttagtgta taagcaattc tttagtggct 30361 gtgcaccgaa aggtacgtac ggtggaggtt ggttgaagat gatgttacgc ggcgttcaga 30421 accagccttg aagtgaatga attagaaaga agaagtaagg aaatgagacg actttttctt 30481 gaactatatc ataaactgat cttctcctcc acaccaatca cgagtttttc ttcattcctc 30541 tcgtatatcg tcgtaacgcc cttaatgcta ggttttgaaa aagacttttc atgtcattcc 30601 catttaggtc cgattcggat ccctccgttg tttccttttc ctcctgcacc ttttcctcga 30661 aatgagaaag aagatggtac actcgaattg tattatttaa gtacttattg cttgccaaag 30721 atcctacttc tacaattggt gggtcaccgg gttattcaaa taagtcgtgt tttctgtggt 30781 tttcccatgt tacaactttc gtaccaattc ggtcgatccg gaatggatcg gttaaacatt 30841 ccattaggga gcctggtctt gactcttctg tgtggtattc attctcgttc ggctcttgga 30901 atcacatcca gcagtggttg gaacagctcg caaaatccaa ccacttcacc tactttattg 30961 cccctaaccg tttctcgtac ctctattgaa acagaatggt ttcatgttct ttcatcgatt 31021 ggttattcct ctctgttcgt atctcttttt ccaatttcgg tctcgattag tttacaagat 31081 taaatggcca attctcttct ggaccctcga ttctttagtt ttccgagaat gttgggccgg 31141 gtatgtaagc catgtatcta ggaggaatta gaaagaaggg cttttgtttt tttgcatcct 31201 gtgttgtatt ggtcttgcag ctatatatta gaataagaaa tattattctt gtcttatatt 31261 attatataat acataataat attagaaaga tggtacataa tatctcatac acgtacataa 31321 agaagttcct gcatatcctc tctaataatg taattaaggg ctaagggagg tttcttgggt 31381 acattaagac ataactcgac atcccgaaat caagtcggcg gctctattgc tctccctcca 31441 acaatgtgtg actctaaaaa acaaactgct ctggcatttt cagtgaagct cagcagcata 31501 tttcgcgctt atggggcttg gagttaatca tcttgactgc atataatgag tcagactcta 31561 cccaaaggtt acgaatgcct attcgcgaag catctccccc gtatggcgga ttgcataaag 31621 ttccaccttc aaaatatctc tccactcaac cgtcttccgg aaagcccaaa gaggccgata 31681 aagtctcttg ctatcacaag ccgcaccgca gcagcctcta ttgctactga accgtctgtg 31741 ttcacttttg tttacggagg gttcgaagaa gagtaagatt cagctgagca gggtaaccat 31801 gcattcccaa aaacggagtt gagtgcaaaa agagtccata actctcagtt taatctgaga 31861 ggaaaccaaa tgcagtctca gttttacttt tcccttacaa ttctccaaaa ctacgttctg 31921 cttacagctt ggattccctt catggcgcct ggaatttctc tcacgcccat agtatatgac 31981 tgcagtaaag cagagcttta ggatcaggcc agtaaaggtc taattcttag tttttctatt 32041 ttataatcca ttggcaaaca tcatgaaggt tggttgcagg ctgatcatat ctacatttcg 32101 aactgagctg agaaaaaacc caccctgcat agctgcactg aagagatgtt cggcactttc 32161 atcttcaagc atgcacaggc cacaaagagt cacggtgttg gttgggataa aatctgctaa 32221 ctcgatctgc tgaagcccag agttgagggc catccaggcg ataatactgt ggcttgggtt 32281 ggggatatta aatttattcc gaacgatgcc tgcatagtcg accttaggat aggtctgcct 32341 aatgaggttc caggctgttt tggagaaaaa aatttccatc cgggctgaga gaccaaacga 32401 tcctgtcttc aacagggaga aaaaaaagag ttcatacagg gcaatctctt gcattagaca 32461 ctgtccagcc attctgcggc taggaggaag ggtccaatgc aatgcccatt ttttattaaa 32521 tcgcttacta gagcatgggt ccgggagacc tctttccctt ataacagtga cctcgtacct 32581 gtagctgatc gggccatttc taagccacgg acaatgccac agcttcgtgc tttcgccatt 32641 aaatctgaca ctttatagta ttatattagc cacatcttcc cttaccgaca ctcctccaag 32701 actaagatca gtcggatgga agcttagcca ccaaaataga atgtctccag agatatctca 32761 aattaaccca ttgagtccaa atggacccat tatcattaac tactctccag aaaagcttta 32821 acatggcagt cctattccag ccctcaattg ttcttaaacc caatccacct tcatccttcg 32881 gtctacattg gaccagctga tcatatggat cttataggtg ggttcggggc ctgaccaaaa 32941 gaaatcccga atgctcaatg aaatggattt ctatatcttc tccggaagcc tgaaagcgga 33001 cgaccaatag atgtgaaggc ttgcaagcac tgaggtctgc aagcctttca tgacagcagc 33061 cttcttttct tttccaattt aaaatttggc gattgaactt gtccaagagg gggagacagt 33121 cagagtgtct caatctggta gatactagga aaaaaaccag tatttgagag ggagctgcac 33181 ctcacgtaag ccaagagtac gaatgatttt tagttttact tttttttaac tggtggccga 33241 gaatgatagg gaacatttgg ctttattaat ccaaagaccc gcctagctgt gaaactaatt 33301 cagcacttgg ttgatgacca acgcattacg cacggagctc tccgcgaaaa tgagcacatc 33361 agcatatatt atatgagaaa gatagagttt atgaaatctt cttgaatggg ttatccgctt 33421 gttgtttact tccctgttaa aaagctggct gaatattcag ggtaaaaaga taggctctaa 33481 gaaacttgtc tcaatcaaac cgcggccccc cctgaagaag ctagatggag atccgttaat 33541 cataactgca aatgacacta tggagataca ttatctaatc tatcttacct aaattgaagg 33601 aaacttcaag gattcaaggg tattgaggac agcatcccaa ggtactgaca tgactcgact 33661 gaaaggagag gtgctttagc aactcgactt aaaaggagag gtgctttagc aactcgactt 33721 aaaaggagag gataagaaaa agccaacttg aagttcagtt acttgccttc cccattcttt 33781 cgaagttgct agcattctct ctcctttgct ccaagggaat ctgtaatagt aaaccacgca 33841 cagatgctag gaatgaagcc agccaatgcc ccttacatta caagaagatc gctagtagaa 33901 gaagataccc acacccacgg ccagagaata ggcgaatagg ggaaaaacat gaatagggct 33961 ttttgaaata aatagggttt gcgcaattcc atttctaata gaatgccgtc tttagcaaaa 34021 gcaaaggagg aatgatctac gaatgccggg catgatgtat tagtattacc cgagctaagt 34081 gcgctgtttt cagaacaaat gcacagtgaa tctacgaagt cttcccctta atgttaatgt 34141 cctaaagcag gagagagtta atcaacgggt agccctctgt caatctctga tggcatcaaa 34201 aatccgcaaa gtgacaaatc aaaatatgag aatcaacagc agtctcagta agagtagcac 34261 gttttcgacc aggcttagga taaccgattc ttatttatca agcccttctg taacagaact 34321 agcaccgctt actttgacaa caggggatga tttcacagtg actttatctg taacaccgac 34381 aatgaactca ctagaaagtc aagtcatctg tcccagagcc tatgattgca aagaacgcat 34441 tcctcctaac caacatatcg tttctctcga actaacatat catccagctt ctatcgaacc 34501 aacggctacg ggttctcccg aaacaagaga tcccgatcct tctgcatacg cataagccgt 34561 ttcattcgcc gaagcatcag tctcaagatc cgaatcctat gaatcaactg gatcaatatt 34621 cgcaaccgat gcaacatctg attcttttat ggttttggtt gctggacctt tctccgccac 34681 cgtcttttca tctatcagtt aaatcagttg acctttctct cttctctctc tctcctcttt 34741 ttctcctcct ttctatttct tctcttatct tttctagggt taacaaattt ggtatcagac 34801 gcgtaggtta cgcaatggca ccgaaaccag atcccactgt tcttcctgat cttcaggaga 34861 aaaaggcgat cctgggaacc cagatcgaaa tgataaccca agccatgaca acactggagt 34921 ctcgtgtgac agatctgcag caagaaagca acgaccatag gacttgggtc agggaagctc 34981 tagacaagtt gttgaagagg gatttgggag atgaaaacag gccgaaaccc actaccaaca 35041 agatgatagc aacaggcgag cagcacaagg gagaggtctc gacgagcctc ttccacgatt 35101 aacggttcct ccgagacaag ctcccaacgg aaagagctca cttaggcttg atagctttga 35161 tgtgagcgtt tgtccaattc aaagattcta agctctcaca gcactgattt ccgatggaaa 35221 ctacagacct tcatcacaga gaaacttcca ggaagatgga gggttctcgt actcgcttgg 35281 cgggggatga agtaaaccct ctttctgagc agctatcgag cctattagct tcttgcccgc 35341 atcccaaagt acgattcccc cggacaacca caatggttac tcaactaaga agcaagtcct 35401 tcaaagagac agctttactt accttttctc cctagatgga acaactcaac tccttactta 35461 gatttcaggg ggttaaggta gttgactttc tggctgataa atcaggtaac gaagcctaag 35521 atttagatga ctgattagat tactagtttg tttaactgag attccccgcc taaatcccat 35581 atcttaaagt aaagactata actcaacagc aagctgcctt tacttggctt caagtcccat 35641 aagagaagga agaatagcta gaagctagag agtagctgcc tagctacaaa ctcaaaagcc 35701 tggtgaaagg aaggcctttt ccttacttag cttagagtcc caaaaggtgt tataagctgg 35761 aagctaagaa gctagtcgtt tatgagtagg agttccatcg gtccagaact agaagtcaac 35821 tcaaaaccgc taaaaggacc tattatttgc cctattagga cctattattc taaagtacca 35881 ttggagctcc ttttcccgac aacagatcgg agattttact ttctcaagtc atacgtcttt 35941 tgttcagcca acagtgttcc gctctactta ttattactta ctagcgccct tcacttcaac 36001 tcatacatac tactttttga ctttcagctc aaatcaaagc tacttgctta taagagaaga 36061 gcaaggtgcg tagctggctt gttaaaaagc atggaaaggt atccagaaag cacagtaaca 36121 gctatgatat aggcaccctc tcgcctaata cccactacta aggattgaag tagcacaatc 36181 ggctatagaa ctccagatct atgaatagcc agagcctagc ttgattactg gaactaaaca 36241 gcaagctgct cctaccttac ttatcgataa ggagtacagg gactgagtcg acttctttta 36301 gttctctttc ccctagcagc ctttccttct tgcttagagc ttctattcct tttgcattgc 36361 ccatcattct actcaaaaga ggacgaaggc tactttcggg acattggttg cagcgatttg 36421 tagaccaata gcaaaagcag caacgattcg atgctctctt cctccaagag cgcattcaag 36481 gtctaatgaa attcccggat cgagcattct tgggctaagt attcactgat tcaagatttc 36541 cggtccacgg taaccgatac ccctattaag cagcatacac attgagtcaa ctctgttcac 36601 tccgaggcca ttcaatcaag gagataagcg ctgatattcc gatctgccta attcgtcgag 36661 tcttccgtcg gcgaagtttt tcatttcaaa ataaagtgga aactgaggac tgggctaagt 36721 ctttaatttc ctctatgtga ggaattacat tcgccctata gatcaaggag agtttcagca 36781 tagatgaagt agtgccctgg tagctcttcc gagggctggt gctttccttc cctcccgggc 36841 gagaagggta ggtttttaat cctggctctc ttcttttatc tttttggggg ttctccagtt 36901 tcataattag aaatcttgag cagagcttgt gcagcacatt agttcactct ttggcaatcc 36961 tttgaatctt ctactagtga agcagcccta gcgtagcaat tttagttttc cttcctccac 37021 acactttcgg ttgataagtc aagcacataa aaggatactt cttttcggag ggatttatga 37081 tggatcacgg atgttccaac agatggatct atcgggaatg aagaaatatc gaacaggcct 37141 acgtctatcc aaatgctaag ccttgtcact agccatctat ggtatcctcg gtatgacgat 37201 gaatgtctct aacctagcta gagccctgtt gagatcctat tctctcgtat agtcagctac 37261 tatagaccct attccgccct ttcttttcta cattccttta ttgagccgaa tcaccatcat 37321 tatattatat attcattgtt ggtttggctg ctggtctagc gaagcaacct ccttctatag 37381 tttatttctg gctggcctgt cggagtccta ttattgtaga tgacaaggag ctatataatc 37441 cctcttcttt tccaagaaca acgacatgct cttagatctg gaaggctaga agagagtagc 37501 tcatgcccta gaagtccgat cgtataatat ccaaatgttt cccccggagt ggttcaagct 37561 tatgtgacca aagctggaaa gaaagattct tctaaacttt acccaaaccc aatgcctgac 37621 catcgatggt agagcggtca gtatgatgac ggagaggtcc tgtcttcaaa tcctaggcgg 37681 ggttccttag aatgatcata ctcacgaaga tggcatcgaa aagctaaggt aaaggcaaag 37741 ctactccacc ttcaattgta actagccggg catcagaaag agcacgaggg taatgcctgt 37801 aagacagaag cacttccatt ttatgggctc tccaactggc tcaaaataag gactgggagg 37861 caaggaaagg agaaaaaatt tctcactccc ggaagttttg gtaaggacgt caaaaatata 37921 cgtccgaggg ttcaattcag tgatcggatg cccccggact aaagtcaaga tttcatacat 37981 atcgaaaaca atagctagat actctgcggg acctctttga gaagctctca ctgttcatac 38041 attctttcta attaatcgtt ttaccgggaa tactatagtg taagattcat tcctttgttc 38101 acaacaattg cccgcaagat gtttctctct acgttggatg atcggagatc catggatcac 38161 tgacggtatc ttcccatggc taagcgtcta aaatacttta ctatatcgaa gatttctctc 38221 ctaaagtgtt ttccctttgt cagaaacaac ctcaagcttg gaatggatct tacactattg 38281 gtttaccggc ctacgcggat cgtacgcaat gtacctgacc tagattgcga ggcactactt 38341 gggccctagc ttctcgatgc caattccgta atgaggctct tcttcgatta gattgagatc 38401 tgattttctt ctcttacgat attatcgatt tctagattga ataggaatgg gtcagcacgg 38461 tgcagcggca acttaaatat atctaaacgg attgcgtggt acccccggac cgaaggaagg 38521 gcttcaaaaa tgaattgaat caatgaaatg ttgcttggtc ttcactgaga acttgagtaa 38581 ggagtagatt cttttggggt aactcgaaaa aggctttttt ctaggcaatc gaccactgca 38641 tctgcatttc acaaaaagat ggttccgctc gccacaagca agcccttccc tttacaggga 38701 agctttcacc cttccccatc gagtcgagta gcctcgaata actctcttct ccctctttac 38761 tttcgcgtac tcctctgtct ctttcttctg tcctttgtcc ttctcttccg agcagtagca 38821 gcagcggtag agtcggtaag tcaattgaaa gagatgctca tctgctctgg ttagctcggt 38881 agagtggcag gcgaaatccg tctctcttta tagatataga ataggcggct agtcacttct 38941 tactcgtgta gtgagaaaat ccttcttttt ttcatgataa tagtttttga ctctttattt 39001 tgaagcaagg tggatctata cgggaaccgg taagtagata gttagaatcg tattcttctt 39061 tcttcttgat tgtttgcacg tttgtatgag aataaagctc atctccttcg gtaagcacat 39121 atagggtatc gagccctcgg cgcttgagag gaccccgggg agctcttctg tgataggctc 39181 cgagttcctg actctactag gcaactaaga cattagatga gggagttgct tgtctgttta 39241 gcgctctgta gttcatagat tggatattga attctccgaa gatgagtctc cctcgagcct 39301 tacttccaag cactgcttct atagtcaaca aagaagtagg gtttcagact tcgtcatgag 39361 tttgaccttc tgttctaaga aagaatgcct gagcttctgt agcgcgaaga caagaactag 39421 tcaatttatt atctccaatc tggaataacc ttcttcctcc atccatacac tgacatcagt 39481 tactgaaaac cgagatatgg gcataaagaa aataaaactc tttgctttga atagctatcc 39541 cgtcgtactc aagggatgaa ggaagtgaag ttgagtttcc tgcctctatc ttctagttat 39601 gtaccttata gcccgcctag tccccctttt gggaggagta gtgccagtaa gggtaagact 39661 ctcatatgat cattcctagt ttccagccat cgattgagtt gcagtccttt gctcaaccag 39721 ttacacgcct gttgaattgg acttttttca aattgaccct tcaagcagtc ttacagccga 39781 gtctttcttt gtttagttca aagaggagta gagggtttgc tgggaattcc tggtggaaaa 39841 agatcgtgaa atggcaatgg cttttattag tcagatccgc ccttctcggt ggtcgttagc 39901 agctgctatt tgaaatccat tcccgctgct gtcgtcaacg gtagaactct tcgggcatcc 39961 gtccgttctt cttctgtctg aggcaaaggc gaatccctta tactgtatac ggtcgagctg 40021 gcttggctgg tacatcaagc atatcggcat attgcttgtt cggtgtggta cacatatctg 40081 tcaatgtcaa tcaagtaata gaattcattc ccactgtctg aagaaaacgt gaataattgc 40141 ctgtttatga gcttggaaga cccgttgaag tccccgaata aaggggaaaa ggctataagt 40201 aggccgtttg ctattgctag aagggctgct cgcctttata cggcttggct tcgctatcgc 40261 tcctatgtag tgatcggcct caaaagtggt attcctgcca tacaagccta ttttttctag 40321 tgctagaagc ttcgccagaa gcaagcaaga cgagggagcg gagcttcgta gacaaggctc 40381 gttacgtact tgactgacga gctgtctact aaacgagcgt tagagcgagc gagttaagcc 40441 tctaaagttt gatagctctt ccccctcccc cccctccctc accttactct taaatttccg 40501 cttaagcttc cccggtttgg gggattaatt gattggatac ccgagaacca taatctttcc 40561 taggaacagc ttctacgacg atagataggg gtcagctttc tttggcatct atgccccctg 40621 ccctccaaac agtatgggag cctttcagct cgtactgctc acactcctag atcttcacgg 40681 cacctcctcc accatagtgt gagctgctcc cagcggagaa aagcaaggcc tacttcctaa 40741 ttagctttca acaacgtcaa caacaccacg aaaaaagtaa acaatggttg cccactaatc 40801 tgatcatagg tgaaatccaa tcccttcgct tcgcgccagg ctttgaaacc gtaagtcagg 40861 ctcctttcgc ctctcctttc agtcgagtca cgtcagtacc tctcagaagc gagaaagcga 40921 gcagcaagct gaaaaaagga gaaagtggtt ttataaagca aaataagcta agggggctgg 40981 ctaggaatcg caagaattga gaagggtggg aaagacaggt tcggtaatgg ccggattagc 41041 aggaggaagg tcttgaagag cctgaaacaa agaaaggtgt acataaaaaa agaggctggt 41101 tatggccttt acttgatagg actcctttcc catcccggga agaggggata aaaaaaacct 41161 tgcggaagcc ctgcccaccc ttccggatcc ctcatattta tgattccagg cttcccggac 41221 tcgtaataga cggctaagaa caagaagagg gtcagtagtc tctgccgttg caggtccttc 41281 tccttccgct gagacggccc tctttttttg tttgttcacc gcggcacgaa atcgaaatca 41341 tgaagaagct ggaataactc agaaagagag tggcgcctag ccgttgagag cgtctattat 41401 ctttgtagag gaacagtacg atcttggact ggcccccttc gcatgaccta gaaagatcaa 41461 gaagtccatg ctactataag gcctctaaac tcctccctca ggacactatt gcccatgggg 41521 acggggtagc cccgacttcc ataggtcctt ggttcgacct cctaatgaga attgaggtcc 41581 ttgcgcgggc gtctcatccc taagacttgc ttgctctgta tggagtgccc tgtggttcct 41641 cgagtgccag ccgcagagag gaatgccatc aactagggcg ctattggcca ctaaccactc 41701 gctcgccagc cgctcgggct ccgcgtttca agttcgttat cctaaccgtc ccctctgctc 41761 caccgggtgc ctggcccttt tcttctatct tatctactac cttgctcctc ggctccctac 41821 agctccagcc gctcactgta atagcttgct tctcgggtgg ctcgcacccc gggtggtgcg 41881 gctgagccag agtggggctc aacagtcggc ctatgtttcc gggcgcacgc gtaaaggcat 41941 gattagttcc acaaatctca ctgcactgac catagtaaac tccttctcgt tgtaccaaaa 42001 tagagatttg atttaaacga ccaggtacag catcacattt gacacctgag gaaggtacag 42061 cccaactatg aggtacatca gcagatgtta caataatacg tagatgagtt ttggctggta 42121 caaccactct attgtccact tctaataaac gtgattgacc caattctaga tcttcttctg 42181 gaatcatata actgtcaaaa gtgagtgact gctcatcgga actgttatag tcagaatact 42241 cataagtccg ataccattga tgtccaatag ctttgatagt aatggctgga tctactacta 42301 cctcgtccat tgagtataag agagcaaatg atggtatagc aatgaacatc gagatgatac 42361 taggaaagat ggtccgaaga atctcgatag tagttccatg aacaatcctt tgcgggattg 42421 catttttttt atagtggaaa tgccataaag cgcgaaccaa gatccataat acgaaaacca 42481 aaatcagaat gaggaagaaa aagatatcgt gatgtaagtc tattattcct tgcattatag 42541 gtgtagctgc gtcttgagat cctaattgcc atggttccgc tgcatcacaa ggagaaattg 42601 tgaggaataa ccattttaga acaatcattt tcaaagcaaa ggttccttca ttttctgctc 42661 cccccaaaca aagagagact gattctgact ctcccaatta aggaagacgg aaatggctgg 42721 tgccggttgg tccaaccaag aaaaaagaga tgggaatttg gggcgtaaga ttcttcttct 42781 tcttacaata ttttgagtta gatgaacaga tcactctcct aaaagcagca gtcttcttat 42841 atacgaaacc aacatcctta taatactact aggccccacc acactgaatt atgaactttg 42901 cgcctccagg agggtcaagg cagaattcca ttcctatctc cttcgtctgg tcgagaaggg 42961 actctgactc ttctattact acagtacaga ggattagtcc cattttttcg tcacgatcga 43021 tccacgtccg ggcttagaaa gctagcttac taaggcgaag gaccgctttt tattgattgc 43081 acgagctaag aacagatcca caatctatta tctaatgaat attcattaga tagctaatta 43141 tcctttgcct agctgcccat tgctagaact tccagcgcta aggtggttgg tgtggtaagg 43201 caaggcaact cctctttcct acgctaagcg caaaaggcac tcgaaggagt actgggacca 43261 accatcacta ccatagggtt atagtggtaa atcctgccac ctcagattct tattttacct 43321 tacgtgtcaa caagcaagtt gggatgctct cccttaacgt ggtagggctc tgtttcaggt 43381 cttgacgttg gccttattaa aaaaggcatc ctcgtcgcag caaagcccgt gtacaatgct 43441 caaaaaaaac tgagtcaaaa acgagacttt cacaggaata acacatcttt ctaaacaaca 43501 acgggttcta ataaattaag ccttgtcatg gctggttgag gttagaattt cataaagata 43561 aaggtgggta gagaaggccg cagtagattc ttccgaccga gtcccagtgg cagagtcttg 43621 aggcacgaat ccaacggcag gggaatcagc ggctgatcgc gattcatcag tcgcaattct 43681 cccagcagct atccatttta gttcaccagt ccattactga gctgtgattg cataggtaat 43741 agagcaagct ggagcggagg cagcagagat gaaggtgaga acaggggggg gtaggagtag 43801 ggcacctagt tagttacagt ttgcccgaag catcgttaga ggcataggca gtcagaaata 43861 aaaatcgtgg tcgatctttt tcaggctatc aaatcagcaa gggtacttga atagtcaagg 43921 ctactcgata aaggatggct ggcattcatc agtacgtgtg gatccgccgt aggggttggt 43981 tcagccactc gacgaccggc acttgccatt gaaaaaccgc ttaatcgtgc tttctattcc 44041 tatgaataaa agaagaagat aaggaagggt agtgatacca atattcactc aacgccaaag 44101 aaagcttcta aaacaaggtc aaaacttcac ggatttcata aggattaata gtcataggaa 44161 aggtacactc aaacgaagaa aagatctttt ttgcaaggtc accatagttc tatttgaaat 44221 tcgactgttt actacattac ttcctgaaag tgtgcccgtg ccctcagagg ggaaggggtt 44281 ttggaatagg ggtaggaagg acctttatgg gtataatcat gttcggtcct ccactcaaca 44341 agccactgga ttagcttccg gcggaatccg cttttgaagg ccttggcctt ccattgagac 44401 aagagatgat tcacgctgaa ctaggatgct tggaaattgc gtactttttt tctgacacct 44461 ggcacaaaga actcgagtgc cgcctgctac tacggtgtat tagagttcga gcaacagatc 44521 aaggttcgag ctaaagctaa ttccgagttt tcgcgggaag aagtacggtt tcggacagct 44581 catgatgaac tattgaatcc ttctccatcc gaatccgtct cagttccagg tattgaactg 44641 aaagatgact cgaccggagg aaaaagccac tcggggatca attttccaca tcagtctaag 44701 tggctttttg gttggtagca ggacggtaat tggcaacgaa gcagagattt attttaccca 44761 ttaaaaagta tatttcacca ttcctctcgt cgtccagaat gggtgactca aagaaaacgt 44821 ttcattcacg atgaggcaga attgacataa tataaccaag attggatggt gacatataat 44881 atagtaaagt aaggggttcc tccgcccgtc ttcgatacaa tcattgcgat gtgtcactca 44941 aggatagcta gccattcttc tctacccatg ccatgattta cagatgagct cgagaccatt 45001 attgaatata ttatgaacac aatttgccaa gagttggttg tgacaaaagt gattgggatg 45061 cccatctttg gtccagaagg ggttctctat gaaaagagtt tccacctttc tctactcgaa 45121 aagccctatc taaaagggct tgtctggatg aatgcagtgt cggaagccgt gatcacatag 45181 taacttccgc ccacagtgct attacgacgg cgggtcaccg ggagtgaagt aaactcggct 45241 cctgatgtag cattcattcg gaccattcga cgtttgattc tttttatcag ggataccgat 45301 gactctgtga gaggtgttct tcggccaagt ttcccatgac gggttacccg gttcaaggct 45361 tttctatata atgagaaata ctactttctt tctagcttaa gtgttcacgt aggtaaaata 45421 gcttctatag ctccatccaa tagtaatcaa cggagataga gtccagcggt tcaaccaacg 45481 cttctaagga gagcggggca agcaagaaag caggcaaagt cattgagcct attctattcc 45541 gaaagttcaa ctactggata aacaacgaaa gccgtcggca ttcttctcct actgtagctg 45601 ctacaattgc tttagcgcga gcagcaagga ggaggcagct cttactaaaa aagcaaaaag 45661 ggaagggcat gacagaaggg agatagacct cctataagca ttactctctt ttgaggtaac 45721 ttacttactt actctgatat gatgagttcc gtgggctagt aagataacta ttgaggtgag 45781 ggtccgaagg agatctttca ctatttatgc ttgtacagtt ttatgatagt tttgtaaggg 45841 aaagcaaatc ctgagaatgg aatgtaagtt agggaacagt agtcagacgc ggaccagcgc 45901 aggtgggcgc cacagctgtc ttcctccatg tgatatctga tatcacgcaa caggagattc 45961 ttcaccctta tttattgaca cgatttacaa aagaagaacc aacccggtcg tgctatatgt 46021 atttgggatc aaaaaggctc actgctgtat gatataaatc tttctgaatg agagttcatg 46081 agctacagga agagatcgag tctagaataa accttagaag tggcaaaaac gacttacttt 46141 gttgcaacgg gaactactcg ccccgggcac tggtgaaata gaatcttatg tgttgcagct 46201 acttatgctt tcccaactag aaatcaaacg gaacaaaacc gagtcttgct atttagagcc 46261 agggttcctc cgggggtcac aaccagagac ctaccagccc catgtaaacc actttcggaa 46321 gggacaaagc agttgaactc tttagaggaa gagtctaacc acaagtcaaa ctgtgaacat 46381 cagaaggatc gatatattgg taatagctct acacgagaga attgaaactc cacctgcaat 46441 tccgagtata ggagagtcca gtcaataata aagtcctggt tgcagcatcg ggtaaatggg 46501 atccaggtaa atggcttact gttcggtcaa catttcatga ttcgtaatta gtcaagctcc 46561 ggccggttcc tatgtggtga atagggatat atttagtatt aaagacatgc gagtgctccg 46621 ttcgtcagta agcgaaagag actgaaacct gggagaattc cgctctgtta agagagagaa 46681 ctaacgaaaa ttggagactg acggaaggaa aggggttcct cccttctcct ttaggggaga 46741 agcttggttc ctccttctcc gctgagaggg gagaagagct gggttcctct ttgaaaactc 46801 tttccgtata ggccagaaaa cagcttgctt agagaaagac tgactctcct acggacctgg 46861 tggaccttac agtcgagtta ttgcatcgat ctcacaaact atcaatttca taagagaaga 46921 aagatcgttt ttagatcatc aagtgaggac aggtagtagc tctggtagag cgagggactg 46981 aaaatccttc tgtctgcggt tcgaatccgg actcacttct agctctggca caagttcctc 47041 atccttagtt gtttgatgag cctgcgtagt attaggtagt tggttaggta aaggctgacc 47101 aagccgatga tgcttagccg gttagagcaa aggacttgaa atccttagag caaagccagg 47161 gacttgaaat cctttttgtg tcagctcttt ggaagtgcct tttcctttac ttcagtaagg 47221 agtctcaaaa taggaaatac ctagctagca aagtaagcaa taacggaagg taacgaattc 47281 tccgtttcga tattaacaat ccggatctag ggaaagtagg ttcccgcatc acgtaaattt 47341 catttctttt gggaaaataa tacataaacc ttttgagtca cagccacctc agcacaagtt 47401 actcgagtag cacagaagcc atcatcaata gcttgcttgc ctgggagctc aaccacgaat 47461 tctctgtctt ctaacctttc tatcacttga ggattatcta cgcaattcgt aaatgatcaa 47521 actagatagt cccttatctt actggccagt agtagaaggg gatcttgaat caaagagttc 47581 ctgtccaaca agcaaggaat gtcagtcata aataatgaaa tccatataac tctactccgg 47641 gttgtgagaa catcgtgcct gttgcgattg tgtggctttc cgcatggttg gagaccccct 47701 atgaacaaac aaaacagtag ggtggtgtag gagaaccccg actccctaat gcaagataga 47761 gctcttaggg tgcgtttcgt ctttgtcagg aagaaaaaag tggcaagagg agtgtatcca 47821 tacccgtacc gaagagatct tgggaaggca tctctataga tttagtagtt ggactaccaa 47881 agacccagac agcttcaagg ttaggtcttc agtgttaggt cagttcgagg tcagttgttc 47941 cctccgtact gttgctagag agctgctttc catctatctt cctacatgaa aggatctaag 48001 tctatccaaa taaaatagcc cagaaaatga cagccatcaa aaggcgcgca cccatatagc 48061 ctcggcctgt ccaaatgatg ttcagcggtc tctacccaag tagctgtggc ccatgatcca 48121 aaggacccgc aaccagtcaa tcatgctatc cttaccttcc aaccaatcgg ccaatcacgc 48181 tatccttacc ttccaaccaa tcggccaatc acgctatctc cttacctttc aaccaacccc 48241 ttcgattccg cttctgcagc agtatataat ctcggtccct taccttgatg cctacagctc 48301 aatttgtttt ccagtgatgg caagaatccg atccgcgaaa tactgctttt tcttttttct 48361 tgtgttgttt ctgaatggca tcatagctac acgagggaaa gcgatgctgc ccactctgcc 48421 gcaaaagggg gccgctttct tcccccccaa aatgccagtt ccaccatcag ggcccagcaa 48481 gcagcataat tctgctcctc gatcggactt cgtgcaattc ttttatatgt aaataaggga 48541 ggggtctcga ccacattttt gagtaatagt aataggctaa ggcggattcg tctgcttcgc 48601 ttacgtagtc aaaacattct cgctggacac ctggtaagga atatttgtat atgcgggaaa 48661 gatactctca cctacagctc ccgttactag tagttccggt tagcccactt gcccgagcac 48721 actctcaact tgtagatgat ccaacagaaa aagcaagaga ccacgaaagc acgcatgaaa 48781 acagcccttt tgaaagcata cccaaggcac ccatccaatc tgaacttata gacatcaaga 48841 agaagatcta ttaacacgca tggctaccta tataacaagt tacctctgac ctcttctgaa 48901 agcatttgag acttctcgtg ctgaaaaaga taatttcatt gacctacggc acggaaggaa 48961 agcaccgctg ctagctcgcc ttgtccacgg aacttatcct gttgagaatc ctatcttgct 49021 tgactgcgct gggaagtctt tccttaaagc attgatttta cttggagtat ccgcttaagt 49081 gagaaaagga acttgattta gcttgaacct tgagccgtag aagaggacct ttcagaagga 49141 cggctggctt tagcccttga gcatggacga gagtgaacct atttaaaggt ttactgtatc 49201 gtttactggt gatgggaata tgtaatattg ttgtagaagt ttccagcctc acaggcttta 49261 gattgaacta tggaaccgag cgaagacctt tgagattgag attccccagt ccacttatga 49321 aatggatccg gattaaccgc tttcgactga gatgtaaagt caccttcgct tttgcttttc 49381 gtattctatg cggctttctc caactctaat ctcaggtttt tccttctcta tgaactctat 49441 agattgatcc acaaagagat tgtacctata cctacttcag tataagttta tcctgaaccc 49501 tatggcactt cgttgcttgc tccagtgagc tacctattta atgaaaggga tatttttatc 49561 ctttttcagt gtgtgtacac gcttgaaagc ggaaaagatg aactcagact aataaaagga 49621 aacggaactt agaaaacaaa acaactaagg atgggctctc cggtggatga aacgacccct 49681 aggagtccct tcgattgggt gagggcgaag caaatctaca atccttatgg gaggatggga 49741 ggcctaccac aaatatgggg attgaaaaca ggcaaacggt ctcatacgtc gtaaataagc 49801 attcttaatg tcaagttggt agataggcca tagctgtcgc caaccaagtc tattagaacc 49861 atcacagttt aaaatttagt aagcgaaggt ctctttatca tccttcccaa tcacttgcgt 49921 ctacccttta gctactaatg acgctttcta tctctcaata gaaccatcag gatgagactg 49981 gatcttatac attagagcct ataagccttt gtattaggtg gtaatgtccg aagtcttctt 50041 ggatttgagt gcaccaattt atttttgcat ggcctctttg gtctcattct ttttctgaga 50101 agaaggttca gagacagcaa gaagattgtt caaagaagct agatagtctg ctgggaatgt 50161 gtgtctatta tgcagaggaa ttgaaagaga atgtgaacag agaatcaaca gaaagtttcc 50221 cttcttcttt ggtaccgtag aagcttggcg atgtgaaaag atagaattct tgaacgagcg 50281 aagaagcaat cggggcagcg tatagcctta gtcacgcgca tccctttcct gcataaagat 50341 gagctagata gcacggcttt gaaagaaaga tatctgaggc aattttccat tcaccttttc 50401 ttgctggtca agatagtaag ccgagacttc tgtagtagct ttctgttcct tggataagga 50461 gtcagttgaa agtggtgaat aagacccgca aatccccctc taaatggttt tttcattttc 50521 aatctgtaaa ttcatgaagt caagaagtgt agccgaatca aatagttggc tctagtctaa 50581 ttgactactt ggaaagaaaa aagtgattta gtgaagtctc tttgatgcgc tacgaaggtc 50641 aagcaggaaa cagggaagcc ctttcgacct aagccaggct aataggttga gtgaccagga 50701 gagtagacca gagaaaggtt atgtaataga ggcaatagtg agccccgaag agtgcttttc 50761 tattcttccc cttattcgct acgtattcct ttcactggga tttttcatcg ctcctaatac 50821 tatgacaagg ctcgcttatg gctcgctccc gtggctctag tagtagtcca ttttattcag 50881 aattattctt tctttttgtt gatggtgcgt tggaagggac gaaaggagct agtgagcaat 50941 gacatccgcg cggggaatag gcttgcttct tcctagcgat cgggcggtcc tccctcttcc 51001 tgttcaccgc cggttcgccc actgctgctt tcatagatgt cgtgcgaagg gacagagcta 51061 atggatgcct attccgttct cctgctcctg ctccagctca agagttcaaa cggacagaag 51121 agagtcctgc taccgaaaaa aaagtgcata atattggatt caaacaaagg ggatttattc 51181 acgaatacga tttctattga ttgaaagctt tcctattact gatggcagac gggcgagacg 51241 gagaccgtcg aagaagtgcc ttgacgcgcc gcagccacta ctttgactcc ttttatgcaa 51301 ttatgaactc cacggaactt tctcgattcc aacgcaactt atccttttgg agctgacgta 51361 acaaactagg cgagcctccc aacgaagcca acataaatcc tatccgaata aaaaaaataa 51421 aaggcagttt cattatggtg gtataccaac ctcctgttct ggaacttcca gttccaatcg 51481 aagcatatag atccgtaaat agatttgtat gtatagccac ttcagccgtg ctcgtccttt 51541 ctcgaaagta tcttttttct gggaacatgg ttaaccaaaa attattatgt ttgcgattct 51601 tcatccacca atgcagaaaa tgctcaagtt ttccattctt atatgaggcc ggaaagttta 51661 ttagcaagag gtcggcatga agtgattcat catgctcaaa catgagccga tcgttcgtta 51721 gggacggttt atagatcatc aaattcccac aaatggaatg aaaagtgggt ccatgtaaat 51781 gatcgagacc tcgcaaacaa caacgttcct tccccatatg gagttcgtaa cccaaaggca 51841 atcgttgagt gaactgtatc ttctttgtgt tagttgacct aagccgcacc attactgctg 51901 gggtggggct cggcctccga accgtacgtg ggacgagttt ctgcctcata cagctcgggc 51961 cgaagaccgg gggaagttta ggagagatgg ggagacccag cagctgccgg tcggggcggg 52021 ggtaagcttg tgaagaagcg agcttatccc ccccaaaaaa tcctatagcg ctagcgcttc 52081 gcgttctttc tattttatct atctcatttc attccgggat aggcggctaa tactaatcta 52141 ataaagtgaa gtagtcgtcg tctgaccaat tgactcggac accagaccgc tcgtgcccgc 52201 ccattctgtc tcgccctaaa tggaatggct ctcttagtta cgctgcaccc cgacccgagt 52261 ccccacgtcc gctcttctcc gtccgcaacc taagaagttg gctttgccaa cacaacatta 52321 gggccgtccc cttcattcta tgctgacccc gcccggcccg gggctggctt tttgggaagc 52381 ccgttcccac cgcgctcatg gcccggctgg cctgccagcg gtagtgggaa ttatcccgtt 52441 ccctggtcaa agacttggtt ggatgcggga tctactccac gaggagcggt acggacgtag 52501 atgatatcat cacgacctct cttttcgtac cgctagggat gcttaacgcc acttcgccaa 52561 ctagcgttac ccgcgctttc gtgtctctca gtgtggtcag cactgggtgt ttccgagcag 52621 cgaagcttac acccattcgc attagttcat ccaaagttcc ttacccttat gcacgaattt 52681 tggaataagc catcttccta tcaaggttaa ggagtcaact gagcatctca gcggcgggat 52741 tgaatacccg gatcgaatca gagttcacgc cgcccgccct gaacaaatag gaggcgtggg 52801 ccacaggtcg cacataagcc gccgggtcgc acgacagaag aacacccaac atacagatgc 52861 acactcctcc atgtgaaata ttcatcttca ttggaacctt ttggtagtag tcgtgaccaa 52921 cagccatagc catcagctgt cggctccttg gtaaggcgag agcttcaagc ccgatttctg 52981 gtggcacgcc ccccacacaa gcaccaccca actcaggggg gacggggaaa gctagaggcc 53041 caaaaccttc gacccttctt tctaaatcta aatgggggtg cccggagcac ctcatcttgt 53101 catttcgtcg ttgctcattc ccgttcggcc gaagtgtttg gcctttcctt ctccgcgccc 53161 gctcacgctt cgctgaccta tcgcgtgcta aaaagaggaa agtacgaaag aatagtaaac 53221 agagcacacc gcagaaagat tctaaatatg agaagtcccc cttggattcg agaagacgga 53281 aatgaagaac gggaacgaaa aaaaataagg cctttctagc tctagtttca gatgagcttc 53341 tcccaattat gtctggtaat agaatagggc gacttgctct gaccaagact ccactttttg 53401 ctccgtccat ggagcgtatg attttccttg aatgtagata gaccaaaaga gggaatgaag 53461 agataggaat aggaattata gtaccattgg aaaaaggggc acccgtggga acatctctac 53521 tgacgaacca tttcaatagt acgggtgctg ccgtgccaca aggcacgacc ataaaaataa 53581 taaaaaagaa aaagttatgt agttggacca tctgctctat ccgttcgagt ttggcttctc 53641 taaagaagag aaggcgctaa aaatgaaaat gttcgcaagg cataccgaaa ggataccaat 53701 gaagccgacc attaatgact agttcgaaca ccaggagcgg aaggagggac ttgaaccctc 53761 aaccttagcc ttggcaaggc tatgctctac cattaagcta tttccgccag ctacggtagt 53821 ggcgaagcac tactgagcaa ttcacgtatt acttgatacg gacgacttcc gtttttccgc 53881 cggaccacgg ctcttgacct ccgatcgagc tacgagcacg agtaggtagg cggcatctgt 53941 cttttttttt ttgaagttga gtcatttcct ttcctaagac ggctcctctt attctacgat 54001 aatccaagca gcagctgccc gagtccgctc ggaaccagtc gattcctgag cctggactca 54061 aggtctctcc tgcctgcgag tttgctgcga atgctgcgag taaatacatg aacgagacga 54121 gtgagttagt ctccgagcga atccggcggg tctttcaaac atccaagttg ccacttttca 54181 gctacttgga atgcatgaga cccctgcatt taggaatctt aggttacctc cagccttctg 54241 aagcgagttg accttctgtt gaccttatat ctgtctacct tatataataa tgaatatgaa 54301 aggaaagcac taactcctga gcttgggcaa agttggctaa tcgatttaga ctattataat 54361 agaataataa atccgcgact tgggaaaccc tttcttcatt cgtttggcgg gaaaggagaa 54421 accatttctt cattcattgg cgagaaagca tttcgtgcct tcgtcctagg ctcggctact 54481 tctttctcaa ataagacaaa gattgcacta gaccttgagc ccgggaagac cgctactagc 54541 cctttactct attggattga tttggtcgag acccggtttc tccttcctag aaacctgaaa 54601 ctgcaatccg cgaacgaccg ctttcttcct tattagccta atatcttact ctcggactac 54661 atttgtctaa gagcagttga ggaatctgca actgaaagaa aggaatcatg gaatctgaac 54721 ccaaggttaa aaaaagggga aagaactacg gaaaaggttc tatcacgtta atcgtcaccc 54781 acttacgctc ccaagccatc tctggaattg aacttcctcg gaacagctac aatgctttga 54841 attcctctgt gctttcacga acaagccgaa taaccttcta tttcggaaga ggcattctga 54901 taccattcac ccatagatga tattgagaat tctcaggttg aaagcccatc aacttaacgc 54961 tatctggggc cattcagaaa ttataatcca cgaaaggagc catcaccgga ctcgagccgg 55021 gagaaccttt gaacaatact ggctgccttt gctcgttgac gactagcctt tgttctttga 55081 ccgagatgtc gacctgatca cttttctcaa atttcagcaa caacaataag aaatagtaga 55141 gccttcaacg ggacattcga acgcaacaaa gggtttcttg accgcctttg aacgacaact 55201 cgataataga atagaagact ttcttcgtcg acttactcgc tcgcatgaca gcatttagtg 55261 gtctcttcac tggcttaggc tatattgggg aacgctcggc tacaagattg aaaagaccga 55321 tatgatagct ccacggccgg agccgatact tctctctcaa accattacct ttccccgcca 55381 cattcttttt tgacggagag acgattcaag aggatccagc taggctgaaa gcggaggatt 55441 ctttctttac ctgtgccccg gacccaagct cggctgaagg caaggaattc tttctagact 55501 gacacgcaag ctaggctgaa agcaacccgg gttacagcaa tggggattct ctcttctatt 55561 agaaatgata ttatattggt gactactccc ttcagattga agatttgaaa gataacacgc 55621 ccaccgccgg atactcttca cgtgctgata cctatattgg attggtgaac aaccctggag 55681 tttaggacat ttgaaactat agaaagacct tagccgcttt tatgatatag tcctgaagac 55741 ctaactctta gaaatgacct actaactcat cgacaaagaa aggacttcgc cttgaccttc 55801 tgacctagac gtcgaaccag tgccttggat tcatcaacaa agggatcaaa ggaactcatc 55861 tgtccctcca aacggaaatc ttggttttgt aatttgatct ctgacaaagg gagagggaaa 55921 gaaagaccaa tccgtatctt tgaaagaaaa agattctcag aaacgctacc accgaacatc 55981 gacttactcg gcatcttcgg attcaagcat ttcgttcgtt cgttggcgca aaagcctcgt 56041 ttggcgggaa agcattcatt cgtttgcggg aaagcaaaag cgttcagcat tgacttccta 56101 aagcgacttc ttagcgtgct tcaacagcgc ccagataatg ccttgaccgg agccggaggg 56161 agacctaaac agaagaacga cctcgcagtg gcaacatagc ccatcctcaa ctcatacctc 56221 agattttgaa ccatactttg agtcctcaaa ctttgagatg gactggacca acactcttct 56281 ttgagatgga attcccacca ctcttatttg acatgaactg taacgctact ctttcaaaga 56341 tacctaacaa ccttagctgg cattgatcac tcttccttcc caatccctcc atttagaatt 56401 gcaaacatcc caacttccag agctacctat gagccaaggg cggagggaga tatattagac 56461 tgatcaactt caaacaaaac gaccttcctg gactcgcttt catctccttc tttgaaacga 56521 ctcaaataat tgagtgggca ttgggagcct tatttaatgg tggattcctg agctatgaat 56581 gtctctgaaa tcgtgatgtc aaaaaggaat gatatgcgat tattaacttg cttccttccc 56641 tctaattttc atgcagtagt tgaatttttg cactcttacc tggatgcgat tttccgcctt 56701 gaatagatgg aggtgttttc caccttgaat tgagaatttt tgcatgctcg atgggatttc 56761 tgccttgaat tatgatacat ttattatgaa tttttgcatg ctcgatctga tttccttcgc 56821 tgtatggagt gcataggcct attctagcat ggggtttctc ggggtaaggc ttgtgctttt 56881 ccttccatat aagactagct atgtgctccc tcggtacgcc tgtattcctt tcgtggattg 56941 gaacctacgt actgttgtat tgtagcgtga tgttttccca ctgctttctg gcttgactca 57001 cataaagtac gtgcgtaact ggcactatca tgtaggtgcg aaatctgtct tctcctatgt 57061 cttgttctcc cagtttgatt cctatgtttc gaagtccagc taattagtga aggaaagtgg 57121 tatggatgtt tggagctcaa tccatcgaat tcactcgctt ctttgcgctg cgttgtagct 57181 cttagattat tgggaggagg tcatttcgaa tgttttgtaa tttggaatcg ttttttcttc 57241 aaggcacggc ttcttgaatc gtctttgtct tgtagaatgg ttgtcatctg gaatctaact 57301 aaatagttag ttaagaagac gcgtgctttg tttggttttg atctggaatt tcactataag 57361 ataagataag agtgaagtgc tttgtttggg caatctgcct ccctaaatct agtggaggtc 57421 ccggtcccgc cgctcctgtt tgtaatgggg ccgctttgtg ccaaatgtgt aagttgtttt 57481 ttcaatccgt tctaactgta ataagagaag agggaagtcc gtctaaagaa attagtgaag 57541 gaaagttata tggatgttca ttagttcagt aaagtgttag taatgttgct gggagtaacc 57601 ctttgatctg atctgtcttt gtacgtgtta tcttgcgagg ctgtcaggtt ggcttgtgag 57661 gatggggaga aagcgggccg agaatcttat gtcaaaagga ccaaggatga tcttttcgga 57721 aaggaggagt aggaggagtc agcttattct atagatactg taaaagccaa ctcatgtttc 57781 cactcaattt tcattacgaa gatgtatcac gtcaagatcc gttgctcaaa ccgaatcacg 57841 ccaacgttat ggaagttcct ggatcgtgtg aaataagagt agtaccaaag gcaccctata 57901 atttcataat aaaaaatgga aaattggcta tggagattcc gcgcggtcag aaattcatac 57961 agacacaaag gggttcgaca ggaaagtcct ttcgatctaa tccattcttg gggtcaaata 58021 aagacaaagg atatgtaagt gacctagcac gacaaagcac tctccgaggg catggaatgt 58081 ctaatttttc ggtcagaatc tcgacagtaa tgtctctatt agattttccg gtcgaaatac 58141 ggaaaaactc cattcaattc tcgatggaaa cggagttttg cgaattctcc ccggaactgg 58201 aagatcattt cgagatcttc gaacatattc gggggttcaa tgtgactatt atcacttcgg 58261 ccaacacaca agatgagact ttaccactgt ggagcggttt tttgcaaaaa gatgaggggg 58321 aaactcagta agatgtcgta gaagcaaaat agtagagatc acaaacgtag attgctcgcg 58381 gctaaatttg aattgagacg aaagctttat aaagcctttt gtaaagatcc cgatcttcct 58441 agtgatatgc gggacaaaca ttgttataag ttgtccaagt tgccaagaaa tagttccttt 58501 gcacgagtca gaaaccgatg tatttccacg gtcgccctcg ttctgtatct gagttctttc 58561 gaatttctcg tatcgttttt cgtggattag catctcgagg ttctttgatg ggcataaata 58621 aatcgtcttg gtagcaacca ccaaaccaat agaacaaaga aaggttagct ccgcagctgg 58681 tccacaagca aggtaagtaa gcccattacc agccggctcc ggaccgaaaa gtaacgtatt 58741 taaaaccctt atcttggatc ggagatgcga acggggcggg aatcgaagtg ggggacctct 58801 ctaccgcttg tgtctatttc ctgtcaagta tgctccccat acatagacta cgtacaggta 58861 gtactcttgg aaagaaagat ataatgcatg aacataacat taagttacga atgtaactcc 58921 cgaccactct tctaaatata ctaaggcgga gaactcttgt tcattggagc gccgtagtgc 58981 ggaggttctt cccatcatgg aagtccgagt tgggactgag ccttccgaat gataatgctt 59041 tgtttcgttg gaaaaaccaa cgcaaatctc atattgactt tctatcgccc tacttctaag 59101 gatagataga gagagttact ttatgaaatt atctcccttc taaagcagcg caagtcggcc 59161 cccccagaac aaagccccta ctcccgagag gtataaatga ctcgactaaa aggagaggta 59221 tttattactc gagcacttgt tgcgagaggt ccaacgtaat ttattactct tataaaagag 59281 ggaactcgac tgaaaggaga ggttgtttac atactcgact aaaaggagag gtccaaatgg 59341 acttctgtga attacagtga tccagtctca cggatatgga gcttcgccgg agatggcagg 59401 gcaaaacctg atggaccttt tttttttttt tctcaagagg tgatttcgag aatcaaccaa 59461 ccgacgagac taattcgagg atgtgttaaa agagagtcta accgccaagg caagtcccat 59521 ggataagccc cagcctccct ctcgtttcac tctcgttcct ctccttacag tcgagctcct 59581 ttgttccttc ggacctctcg cccaaatgaa atgggatgaa tccaatcaat aagcttattg 59641 attgattcag agcgcagcga agccaaattc aatcaaggca aaggggggct tacttttcct 59701 gacgctgagt catcctattc aaatttagct atgctaatgt aacaggaaaa gttttcacag 59761 atgatatgga tcccaagaga tgagcgagaa cctccaattg cttaaggatc gcactccgct 59821 atcccgcttg gtggacgaga tcttctctcg ggtcatccat cctgggttac tgaagggttg 59881 tccgactgct cggtgaccga atcagagaag ttttgaccgc tttctcttct ctccagcact 59941 ctcggactga tcatccaatc catcttgctg cgacaaagca agcttaggaa tgaatctaag 60001 aaatttaggt ctctgcccgc ttgaaagatt cttctttcct tttcggtgaa agagggcaaa 60061 agtgtgtagg agaaagaatt ctaaaaacgt cgacgcttaa ttcgccccct ccatccttca 60121 aaagtaaaag aaaggctcaa atatcaatat tatatatata tttgaggata ttttagggcc 60181 ctagaacgca aaaaaaaggt gggtgaacaa gagttgtcac gataggaaag agaaatgact 60241 ataaggaacc aacgattctc tcttcttaaa caacctatat cctccacact taatcagcat 60301 ttagtagatt atccaacccc gagcaatctt agttattggt gggggttcgg tccgttagct 60361 ggtatttgtt tagtcattca gatagtgact ggcgtttttt tagctatgca ttacacacct 60421 catgtggatt tagctttcaa cagcgtagaa cacattatga gagatgttga agggggctgg 60481 ttgctccgtt atatgcatgc taatggggca agtatgtttc ttattgtggt ttaccttcat 60541 atttttcgtg gtctatatca tgcgagttat agcagtccta gggaatttgt ttggtgtctt 60601 ggagttgtaa tcttcctatt aatgattgtg acagctttta taggatatgt actaccttgg 60661 ggtcagatga gcttttgggg agctacagta attacaagct tagctagcgc catacctgta 60721 gtaggagata ccatagtgac ttggctttgg ggtggtttct ccgtggacaa tgccacctta 60781 aatcgttttt ttagtcttca tcatttactc ccctttattt tagtaggcgc cagtcttctt 60841 catctggccg cattgcatca atatggatca aataatccat tgggtgtaca ttctgagatg 60901 gataaaatag ctttttaccc ttatttttat gtcaaggatc tagttggttg ggtagctttt 60961 gctatctttt tttctatttg gattttttat gctcctaatg ttttgggaca tcccgacaat 61021 tatatacctg ctaatccgat gtccaccccg cctcatattg tgccggaatg gtatttccta 61081 ccgatccatg ccattcttcg tagtatacct gacaaagcgg gaggtgtagc cgcaatagca 61141 ccagttttta tatgtctctt ggctttacct ttttttaaaa gtatgtatgt gcgtagttca 61201 agttttcgac cgattcacca aggaatgttt tggttgcttt tggcggattg cttactacta 61261 ggttggatcg gatgtcaacc tgtggaggct ccatttgtta ctattggaca aatttctcct 61321 ttggttttct tcttgttctt tgccataacg cccattctgg gacgagttgg aagaggaatt 61381 cctaattctt acacggatga gactgatcac acctgatcag tgaaaaattc tgacaccaat 61441 catttacata ttacaccaag aattgacaag cagataagtt ttctagtttg ctatgttgat 61501 atagcttaga tagggaaaag ataactccac tatagagtag ggctgtactt caaaaatcaa 61561 aaagggtccc tctccccctt tttttattaa aaaatcaaaa aagaggcccc gccccccaag 61621 gcctaaggaa aaaacctctg ataaagaaag aaaaaatata gaactaacct ttacactata 61681 accattaagt caagttatta acaatcctat actaagagga gaattcggat cagctcgggc 61741 ggaagaggag aggcgaagaa gaagacaata agcactcaga ttggaccaac ctttcattta 61801 aggttcgtca ttatcttttc aatcctgact agaatcaggc ttctcttgaa aaaaggtaag 61861 gagttattcg atataatatg gtattaaccc tagcgcctaa gtcacccccg caaggtagga 61921 agaagataaa ggagaaagtg aacggaggaa aaaaaggcga taagcgaacc ggcgggcggt 61981 aaaaaccaca aagaatttat atagaaaaat aggtactacc aatatgaaat taagaaagac 62041 atatatacct gaaacaaaat tctgtaatga ctatgtgacc atgaagggag tagttgattc 62101 gcgttccaat tcattggcaa aatccggtat aaatataata atataacggg atcgtcgtct 62161 tgacaaagat gaatagaaag agtgcttttt ttttaatggt aacaattctt attgtttttt 62221 tttcttcgaa ggaagatctt tctttgacta aaaaggtttc catttttaat agattgtttg 62281 tggttgtacc tttactgcaa gaatataaat gactcgctat tcactcgagg tttctgggtc 62341 ataatgtagg agagatggcc gagtggttta aggcgtagca ttggaactgc tatgtaggct 62401 tttgtttacc gagggttcga atccctctct ttccgtacct ttacctaatt caccaacgtt 62461 accgaccgcg caataccaat cgagacctct aagacgaact aaagggatgt ctctaagcag 62521 ccaaggccaa gagcaagcag gagtggtcct atccgcccat tcatttttaa gtgacttata 62581 agacgtgaga gatactctaa agtcataacg gggaaggcca gagacttcgt tcaaatgggg 62641 ggaggttttc tcttcaataa aatgaaaggc aggtattttt atacgaaaat tgagaattca 62701 ataataaatg ttcacttgag attaggttcg cgaagaagaa gatcacaaat gagatcttga 62761 gcttgaagct tactctccag aatcgaaaga tctcttatcc ccgaggcgag gatggcttca 62821 taaacttttt ttgccgaagg aaagatagat aaggggtttc atccatattt ttctcccatt 62881 tacgtatcac gttcagagat tttaattagt tgtccgctct ttccgtacct ttacttaccg 62941 accaaatacc aagactactt acaacaggtg gtgaggatca gcacactgcg ggaatctaag 63001 gaattgattg ctgtgcatcg accaatctca gaaagggaac tataaaagaa ggtgaagatg 63061 caccgaggga tttatgaaga cgcatggtat gctgtggaga aaggggtgaa aggagtagaa 63121 gcggcagggc aagaaaagga tagtgataaa tggacagacg ctgagaagaa gaagtcgaaa 63181 gtataacagc aaagcactgt cagcaatcgc aagtgacaca ttcaagggtg aatcatccaa 63241 agatgcttgg gatatcctac agaagggttg acaagtgtaa agagatctcg tattgatatg 63301 ctgctggctt cacggaggaa agagagacca ttgaacaatt catctgtgct ttgccaatgc 63361 tgctcaaaat gaaagacaaa gacaaaagtc aaaaagcttc tgttctgtag cacactcccg 63421 gcgaagagcc cttcgcaagc catgaatgac acagataccc ttgcgtttga tgttgttgta 63481 ggtcacctac aagcgtatga aatggagact gagaagtccg agaaaaagca ttgctgcagc 63541 acacactaag acaacaagag agatattgaa gattcaatcg gcagcacact gcttgctaga 63601 aactttggca aagtagtaaa gaagaatgga caaagtagga aaaacccttt tacacagaac 63661 acaacaacaa atcgtgatcg tgtgtcaaag aagggagaac aacagtgtag ggatatggca 63721 aagctgagca ctgccttttt ttagagaaga gacttcaaat gtttttaatg caaaggttgc 63781 ttcgctaggg gcacacaaaa gctgaatgtg tgaatactcc cttattggtt tgcttggagc 63841 gagagcgaaa cagacgagga tgatgatgca gaggaagtct ctggattcgt agcatttgtg 63901 ataaagatga gacgcttcat tcggacaatt gctttgctag tgaaacaaat ttgctgcttt 63961 actcaagccc acaccctcaa ggggttactt cgtcacctcc gtcaaacccg cgtcaaagcc 64021 cctcgtcaaa gacctacgtc aaagtaggtt gcttcgccac ctccgtcaaa cctccgtcaa 64081 aggaggctgc gggaggctac ggggctgcgg gaggctgcgg gaggctacgg ggctgcggga 64141 ggctgcgggg gctacttcgc tgggagtgtg cgcttatgct gaagaaagaa agcaggtcca 64201 ggttcaaagc gtaaaacggt gaaaccaaag gcttcttcaa ctcgaagaaa aagcaggaac 64261 ggaaagaact ttacttcctg gtgaagcaat tttcaggata gaattctgcc agcttaggaa 64321 tctgaacata agcagtagag cgagggcata gattactctg gtcatgccac cccagcaatt 64381 ccaagaatta atgcaagcaa agcataggag tgtgatttgt cgtttacctt cttttgctct 64441 taggaataaa ggtaaaaaga aaagcagtgt gctggacagc acctttccta cgtttgaaaa 64501 cgaattgccg gtactccact tagacttctg cacctgacct agtggctgaa gaaggatctc 64561 cctcaagtga tttgctgctt cctccccgac gtaggtttga cggaggtatc agtctcgttg 64621 ctaaagcacc ttcggacctc ttttttcgtc tattctttgg ttaaagctaa aaatatttct 64681 tttcattttg gatcattcag tagctcagac tggaacttga gagctataac cttttttcct 64741 taatttcttc cgttgccgta tcttgcatat ataagatcga gatatctcta gtattcataa 64801 tccaggattt cgccatactc ttacacctta cccagatctt ataacttata ataagatcat 64861 ctaggctagg acgaggcaga ctaccatcaa taaaagagcg cttcagtcaa catatacaaa 64921 gtccagcata gttagatcca ttaagagttg agccagaccc ggatgatcac cgaagataat 64981 aagggctaga aacattatct ggtgaatctg agacaacgaa gctgcgatcg gcaacacaaa 65041 ccattggagt gtgctgggga ggatctgagc tgtagaaacc cttgatccac taggtaaggt 65101 tacttccaga gcggagtatg ctcaggtgaa actggtggtg atgccgatct cttctttctc 65161 gatttctagt tccgtttcgc cattgttgag gaaaaaaagt gatccgagac tcaacaaatc 65221 gaagagaaaa ggaaaacttt gttgattcag gaactgtgaa acggaaagtc tcaccagaaa 65281 gtcattgcac tctctttgaa gagggtcttc cctttgtaag caaaggtaat aaatgctcaa 65341 tgaggaaaga ttatgaattt attctttcat ttcggcttaa catctcttaa taattgtttc 65401 cctgttttta atacaaaatt tgaggtgaag ccttctctaa tgctacgtgt cttcatctta 65461 gcagatctct aaagcttgga gcgtatggta acggttagaa gtgggatttg agccgctgta 65521 ctgtggcaga gaatgacgtc gctgtacgtt catggcttca gctctggttt tgttttaagt 65581 gataaataag acagaggata ctttgggctt cggtttgtag acaaaactcg tcttgggctt 65641 atgtcttgta gtcataggag agctgagcct tgtgttggcc cagttttggt tcatttagtg 65701 aaacagtgtc gttttgagtt gtaaccggaa atatctcctt cttcctttga ctgtcgatct 65761 cggtgtgggt ggcttaacga gatgttaggg ctaatgttag tatgtagtga tgtgctgttg 65821 aaggtagata cactttatac cttcttcctg aagcagacga agcgagctgg aacgtacaag 65881 gaaagagaag atgcctccta atctacttat ggatgtcaga gttgatggtc gagcataggt 65941 ccattagcag atggaaagaa gattctgctt agcgacactg cagagtcaga tcggccccta 66001 ctaatgagaa ggccttgaag gaagtgaaga ttgaggcttt cagggtttag ttaagacttt 66061 gccggtgact cttttcctct cgcttcgtaa aaagaaaact cttccatact aagattaatg 66121 agattttccg gtatacttcc cagtcgacta tgtgaacagc acaaagcaaa gttttaccct 66181 ttagtgtcca atgatatcaa tggtatcttt ctgtttgtta ccaaacagga gggaagcttt 66241 atggtttcgt aggttttctt ctactttttc ttctacttta gggtttgata gaatcgtccg 66301 ctattacaga ctttgattat ctgtggggca gtgcgttaga cgaaggaagt agcaaggaag 66361 atgaagaccg gatattaaaa ggaatatttc acaaagccga agcattacaa gcatacataa 66421 aagaggaagc ctaccgtaga cccccacaca agacaaagtt cactaataac atagtacgtg 66481 aacacaaaaa gacatagaaa tagaacaaaa ggagaacaaa gaaaagccat gcagtacttt 66541 gggtcgaccg gactccttat ttaaatatta gaagagagtg gttctagtct ccctggcgct 66601 ggagagcgtg ttcgatagta cacacaatta ggtagttccg ctcctgcagg agaccagaaa 66661 tcctttgttg caggccgcga atgcggtccc gtaagacttg catccttgca cccacaaatt 66721 ccgggtccag agcaggcggg tgctcccaga acaagcccag catttcctcc cagtggagct 66781 ttgcgttttc cgcggcttcg atttcttgtg gaatggtcgc cagcctgtta tagatctcca 66841 tatcatccat ctctcttaac tttcgacttc taagtgttct caaatataga cggaaagctg 66901 cttctattta taggcaggct agtcttctta tcttaactta atgagtcttc ttacgtaaca 66961 cgtttaatat ggaacaccct ttagtccgct gaagaattaa ttaagtacgc tgaagacttt 67021 aaaggcccca tccatcaaag agttcttttt ggcgggtatt gccacgcggc ttaataccga 67081 tcactccctc agaaagaaga ggcaataata gaacgcacag tagagagtac tcccccgcgt 67141 ttctcgcaaa gcccagagca ctccttactc gacagctcct gaccagagtc ctcgttggcc 67201 ctacttgcat tagctctgcc tgctcagagc tcgcttttct ttgtctattt ttgattggct 67261 gattcccagg tacataacag ccctcggcca atccttactc gacagctccg tcctttctta 67321 ggtgagctaa gaccgaagct agctctcttg aatttccccg gaaagacgga acctttccag 67381 ctcactctgt cagtccacta agacgcgtat agagtaaatg agcactagca ataacttttt 67441 cagtacaata agtccccttc tccacgacca cttttgtttt tgggctttct tggattcgtt 67501 tccccagtaa ttccattttg gactcggagg cttttccatt gggctcagta tcataattat 67561 cccaaaagcc ccctctcctc tagcttgctt ctttaaaaaa caaagcacac tgcttgttaa 67621 tgggcttata tggatcaaag cattagatag aatgctaagt ccaattcctc aatattcttc 67681 ttcttcagtg gcccaagcaa gatcatctat cagagaggat agggtccaag gtctgactct 67741 tagatatatc ccctactacg agttactccg tacctctgcc tattactgaa caattaccta 67801 atgagtgcgg gtagcataag acaaggtgag caaagagagg tctcaaaaag taaaaagacg 67861 ggttttactt tttgacacag ttttgccctc gtatctagaa aaggataatt tacggaaata 67921 gtaaaagata ggacctccta actttcctgc tacgagtgat taaagacggg gtttactcta 67981 aaaaaacgtc aaaatattgt acgggattcc attccccggt tggtttgttt aatcaaacaa 68041 gagaaagcac ctcttttcca aaggcagact cggctgtaaa cagaacagaa ttattactaa 68101 aaggatgtta atgacttatg gctcctgtct tacttttctg atagggaaac ggaaatccat 68161 aactagggat cctcctacgt ggacataaag ggattggaga cagagaaagt gattgtgcta 68221 atactataat ggttcgctag ttgctctccc ttgttctcta ggttcggtaa cctgggtaaa 68281 ggctgccttg cttcccgttg tggaatcttt tctaatcctt ttggccgctt ctttccactc 68341 ggaccatagg gaagtcagaa cggaaaccaa caaaggacaa cacaaacagt ctatctcggc 68401 aggagcagag cggctacgca aacccgtctg gctattcata ttccgacctg aggagtaaga 68461 ctctcgcagc tactcagggt attgagctac tactagggag gctccggagc tttaaccaaa 68521 ggagagaaaa gtacctacac cggtttatac aacaccggtt cacagactca actatacaat 68581 tcatcacaat cacagatgca actatacaat tcatcacatt acatcacaaa attccccggg 68641 atcaaataga gatcaaaccc atcttactta ccaagttgtg gaaaggagtt ggatgtaatg 68701 cctttgtgag aacatctgca atttgtagat cagttcggag agaagcttga atagacctgc 68761 aaccaacttc taacacacag tcgatctcta tatgtttagt acgctgaaaa caggatagca 68821 agatgtctgg cagcagtgtt atcacaaaag ggaggagcag aagaactcat ttgaatgcca 68881 agttcctgca agaaatttgt gagccaaacc atttcactta cagcagatgc aaggcttcga 68941 tattctgcct ctgcagaaga cctggagaca gtatgctctt tcttagagcg ccaagaaacc 69001 agagaggaac caatgaacag aagccagtta cagagcgccg tgtgtcggga caagaagccc 69061 agtcagaatc agcaaaagcc ttgagctgaa gatcagatgt agcagaataa aaaagaccct 69121 gcccaactgt ccctttcaca tagtgtaaaa ccttgtaaac tgcctgcatc tgggctgtac 69181 gagatgctga tgagaactgg cttaacctgt tgacagcaaa agtgagatct ggacgagtaa 69241 tagtcagata catcaacctg ccaataagcc tacgatacaa ctctgcatca accagatcac 69301 caccagaatc cttagacaga gattaggttc catagggata gaagaggtct cagcgagcca 69361 acaaacctgt ctagatccag agcatatttg cgttgacaaa ctgaaatacc ctctgtagac 69421 cgagcaattt caagcccaag gaagtaagag gaccgaggta tgaagttgca actgatcttc 69481 gagagtgtat cagctggaaa aatagcttct ctgcgccaag gaaatccgga tctctctaca 69541 tactatacga aaagcctctt tcttcacaaa gctggacctt cgttcgggct attggaaagt 69601 cagaattaca gaaggaaacg agcctaaaac cacttacttc tgttaccaga tatggcttta 69661 cgagttcctt gtcatgctga taagaggcca gtcgatgagg ctcttgaaca agttcccggc 69721 ggaagttagc cctataagct aaccgcctgc catgtctacc ccgaaggtat ctttggtgct 69781 gttgttccta aaagtctctt ctttaacttt gtttccaaca acttcccctg ctttgttgtt 69841 tcagaattga gaggcacgaa atagccccca aaaaaaggcg aaatccacta tttcatcaat 69901 caaattttcg ttcgagggtt gaaagcgcca gaacttttga agagttttcg ccccaggcct 69961 gtcatccggg ggatctcttt atgctcgtcc actcggggat gacccctact gatctctctc 70021 atcggatcga atcgaatcca ctcctgaact ccagaagcaa gaacggacgg atgcccgaag 70081 agttctaccg ttgacgacag cagcgggaat ggatttcaaa tagcagctgc taacgaccac 70141 cgagaagggc gagtcaagac ttccagtttt atgaaataga agatccagat cttggagcac 70201 ctaatcaacc agttgaggaa gaggaacgaa ggtactcagc ctcctggggc aagtgggagc 70261 gacacactga atgaaccaac tccaatggag caggatcaag aagaagtgaa gctacgtaaa 70321 agtgagcgtg gtagaatctc tcgtcgtcga tttgagattg agggagatcc ggggaatctt 70381 attctgtcac ccaaaacgac cactccttct gtcggaaaaa ggacttgccc tatattgaaa 70441 tgaaatcgaa acgaatggaa ccgcgacaga gcactcccta tcatcaggta gtgcgcgcca 70501 ttcagaacta tgatatcgtc ttctagtcta ttcggcaggt ccaagttgtt ttggtcttat 70561 aagctcaggt tcttgagttt tgggaatttc cttccattgg ctggttaaaa ttcaatacgg 70621 atggtgcttc caagaggaac cttttcttgc atggtgctgc aacgggacat tctggtaagt 70681 cgggtgctgg tggccttctc cgagactgtt caggaacatg gatctatggg tatacctgca 70741 aaatagcttt ctctacgagc ctacaagctt cactttggtt tagcttccaa ctaaggctaa 70801 gggagttgtt tccacgggat tgagtgaacg agctcatagc cctccacccg taccacaaaa 70861 gcgctaattt atataaaaaa gaccgggaaa acgtgaatct atcccgaaca tttcctaaaa 70921 tactggataa actgactcag atcggtctcc ctcaatcgag aggctcgctt gcttcacttc 70981 aacttcatag aagagggaag gagctatgca taggggtgat tccggtggag atggaatgaa 71041 tgcattgaca gttcggtctg atccgtctta ttcgtatatc ttattatggg atatttgcct 71101 taacggaatc attagaaggc tttgggcggg gaaacccaac tgcaagacca gaggctactg 71161 aaaggaagat actatctctt cttgagcttc cgattgacat accgagattg attcaatgaa 71221 agtccctagc gcaggtgaga cctaattcta tgtttacttt tcattcaaat cagcttcctg 71281 atgactaaag aagactttta tcctggcggt tgttggtcca acgactctac ttctttcttt 71341 tgaaatatcg taagagaaga aggttgacaa gaagaataat ttgtctcctg tgattgtagt 71401 tcaatcggtc gtttgtctct agaagaaaga agcctacttg tagaagagaa gaagcggggt 71461 agaggaattg gtcaactcat caggctcatg acctgaagat tacaggttcg aatcctgtcc 71521 ccgcataaag aacaaacaaa caatagaaaa aataaaaaat gggatttgaa gagtcaaagt 71581 caaataaatg atcattcggt ggggactgct ccgccgacat tggaacgagg ccaaccgctg 71641 cttatcacaa cctctggtgc cggtggaggt ttgacctcgg cgccgatgag gattaactat 71701 gtgaggatta aggtgccgga cgtcatccgc aacgtaaaga tcgtggtgca cagtgtgtac 71761 ttgccatttc cacatattaa tcctgtggct gcggcttacg aagtatccta ggaggctcag 71821 aagtctcagg aggagatcag acgatggaag gaaggatttc gatcttcgat cttcgttacc 71881 attccaaagt taattattta gctttcgatt aagtgagtgt ttgtttgctc agactcagac 71941 taaaatgcga ctctatgcct tcccggtaga aaacgacctc tgtggcggga tttaccaggt 72001 caaattcaac gggaccctag cctgcagtcg tcaaccggtc cgcttccttg aatgagagta 72061 ggtcttcccg atccacgaat actgtctgtt gtcctttact tgtttagtgg catctggaat 72121 tgtctgtttc ggtattcact cttgtaaagg gccgcgaact aagctaagcg cttggcttgt 72181 cttcttatag ggtcccaacc gaccagtaca ttcaatatcg aaatccgtta ctgttacaga 72241 agagaattaa gataacgagg cttggagtat ctaaaactaa gtccagcgaa ctggaagtct 72301 cttcagactc cctcattgct tggctatcga agtggacttg tacctttttc gcgtgcgtcg 72361 acacgacaat gcttaccttt ctagctttta cccttcttct ccggaaggaa tagaaagaag 72421 acaactagaa taaggggaac tactctatat attacagcag gagaaattcc aacatgaaag 72481 aaagcaggcg gtgcacttaa atctaaatag ggtgagaaag cttggcttgg gattgggatt 72541 gttgactcat catgataggg catgaattgc gtatagataa ggtccagttc attaagttga 72601 gacagctgta ggctcaaacc aaactgacgt gtaaagagcc ctagttgaaa taaaccctgg 72661 tgaatctttt ggtgtcttag ggtaggagta agtggaggtt aggtaaccca gccagaggat 72721 tccgttagcg ggttgattcc gagctggtaa tcgaacaggt caccaatcgt cctcgaaagg 72781 gcggttcaag cgcatgagtt cagaggtcta tagatcgtga tctgaatata aagtggattc 72841 caggggcagc tgcttctata aacacaagaa aaggtcttgt ggacttacgg attgataact 72901 tgccagtaca cgctgagcct cgtcgatgag atgaggtacc tcatttagga tgactattgg 72961 aaacgcgcgg tgccgcgtta ggttagggag cggggctcat ttcatcgatc atttctgcac 73021 gagcaaagct caatagtaca tatagattca gatagcataa taaactaagt gaacatggct 73081 aaaattaaaa ttgataaaca agagcataag attgcctacg tcaccataaa actcgacaca 73141 gaaattcata agtaaaagcg gcgtaagtta ctaacatgcc acatgcgagg gacagaactg 73201 ccattgacag cctccagggc aggtagctcc cagctcggct tgcttctact accttatagg 73261 gtccccccca gttaggatct aacttgcgct ttcacttaag taaggaaagt cctattctaa 73321 aagaaaagta gatttctccc tagatgcttg gaaattttaa ttactccacc acttcgcccg 73381 gtgaacttct aaccatcctt ttctagagaa gaatagctat ctttcgtcag aattgatttc 73441 ttttataccc gagtgaattg aagcattcct ttatgtggga acgtccatct ctagcgtgga 73501 taagcctgaa ggtcctacta agcggtctaa ctcgactccc ctttctaggg agatgtggac 73561 gaaggctact tgagctcatt ctcacatttc ggggtaagag gctaggctag gaatagggta 73621 cagaggcgta gggggaatga aagaatttgc ttagtcataa tagtgctttc aaagtgaaag 73681 gatataaagt aattaattcc tctgcttgca agctacctat gattttcttc gtcaccaacc 73741 atttccgctg actcaagcat ggcatctttc ctcaagcccc tggcatttcc aggatcgata 73801 ggtgcatccc ctcctcaacc ctattttctt atacagccta tcagttattg aataagaaat 73861 aggcacctcg ctaaagcctg actatgccac cggtcgttct agcatagtca ctgagaccca 73921 acttgcgtac tcggattact tcaggcggcc cattggaaaa tatacttcta ccaggcctac 73981 tcaatcctat cttaggaagc ccgcatctct gtgtcaaacc tttttgaaaa ggctctactt 74041 gaattgaata atatttcttt attccgactt cgccaatgaa agagggtagt tgaatagacc 74101 gatatatcta gaaagtgtgg agtgagtgaa tggtggaaaa aaagtaactg gcaccgcccc 74161 ttgctggata gaatatcgag atgcacatgg gattggggtg ctctctaagt gaggggccgt 74221 ggaactatag cttagataga aaggtttttt aggccggaca aaggtctagg ttaagtccgg 74281 agagaaggct tgatgaacac cacctcctgg ttagtgattg ggcacatgct tcctttacgg 74341 cgtagaatcg gcttcttttt ttgacgtatc agaaagggcc tctcttctac cgcccgaagt 74401 ttctatttat gcaagtatct tcacttcaat accccagcct atacatttaa tgataaccga 74461 acccaagcaa ggagcttaaa tcttgtctat ttccttgaat ggtcgggttg gaaattcttt 74521 tatgtgccaa cgcaggccta tcacctaatc tccactgcga aatcctttgt tggaaggctt 74581 gctgatttac acctctctcc gccgcttctt aacagttgac tctgcgctgc gagattagca 74641 ctttcttagt tactaacaga tatatctttt ccgcgcgagc tagttcaaca gttcattctt 74701 gtaagccccc ttgtcagcca cactaagact tcgggtcatc attacctctg ttgaaggaat 74761 aagcccggct atttcagctc tgggaggaag gactactttt ttcaccgcga attgcctttg 74821 atgcaaaacc atataccgat cctgttaagt cttgaattga cttctgatgt cctggctgcc 74881 ccttaagcct ggaagaagag tgactcggga agaaatctcc tgctggagag gtatatccgg 74941 cccttctacg aagcctcaac tgcaactgag actgataata ggagcagtat ttttaaattg 75001 aattgctggt gccggttggt taaaggcatg gggactgctt tcaattggac agctaactaa 75061 caatcgattc cccataaaca aatagttatg aaacattgga ttggatgcct tttaaacaag 75121 gaaagtaaag acagctaagg taagtattcc ccacattgaa agatatctat ttctttcttc 75181 tgttcccttt gtctaataag atgggactgc aggaaagtaa gtggaaagta agaacacagt 75241 tgttagctta tagcggggaa gaaagaacta gctcgactga aaggagaggt ccaaggtaat 75301 ttattactct tataaaagag ggaactcgag catttattgc gagagggaaa gcttagagtt 75361 ccactccagg agaagtagct tatggtaggg attataggta ggctggctaa ggagttgctc 75421 ctggaaaggt agcagcttgg tcccttgatt gcggggtttc aaacctcata ccaagagaaa 75481 gattccaaac cagcatacgc aaagcaaaga gcggaaaggt gctttgtacc tcatttcgct 75541 aagcagcgag aattgtactg aagctctctt tcaaacgccc gccgatcgta ctacttcatt 75601 cttagtgtgc tgaccagata cccccaccct gagctaagag ccatagcaag agatatagcg 75661 ttggctgtag gcttagtgag ctcataaact ggcgaatcaa ttcatttgaa agagaaagtc 75721 gggtctagct gattcccgag ttgaccaatg atagataggt aggaatggca ggacctaaac 75781 gagctaggct tcgttagcca gttctccttt tagaagctat tcttaatgcc gctcgggctc 75841 ccttttttat gggattccat cgctcttgga acagatgata taagatctta ctaaaataaa 75901 aaaagtatga aaattaaata tacagatata ctctataccg gtatacagca ataacaatgt 75961 cagccaatta gcaaagatca agcgaactca ctcaagcgta ctacagcttt tccgtctttc 76021 ttccctaagc gcaatagccc cttgaataga taggatagaa ttagtgcgct tgaaagagat 76081 ttggcttagt gaaaattcct agtcttactc atctcagatg cgggattacc aatgtcacga 76141 ttggaattag gaaaggaagc tgtggcactt cttactttat atgaggaagg ggttaaatca 76201 ctagcaatag tagacacgct acgatctgct agacaggagc agtacaggaa gcatcgaatg 76261 ccatggttct tgttacagac taggctgcaa gtgagagtga tagaatcagc tcaactggga 76321 atgttgccga tgccgatacc ggagctgcta aaggaagcag tagcagtaca accattaaat 76381 ttgaatggaa ttgcacattc atcttcctct atcaattcat ttcttttgtt tgaataagct 76441 attaaggtgc ctaggacact agtagacacc cagaaggagt tgtaagggag aacggaaatt 76501 taagatcgga cgattttcaa cagacaagat cagatcgtag aatagagaag gatttgctgc 76561 cgctactgaa tcagagtcca cggtgcaaca atttctcatc tgctcacgaa ttggattcga 76621 accaatcaag ctacttgccc tttagtagat cgtgagtggg tcagtcgtcc tcctcattat 76681 agtcctccta aaatcaatag catttcgtcg gaatacatcc tgtcttttca ccttagtagt 76741 cctatgcata gtcagtacta tagccccaat catggctact aataaaataa gactaggaac 76801 caaaaaccag acagaatagt aggtataaag taaattgccc aatgtttcca aattagtcca 76861 acttcgtacc tttccggcat aaaccgtata tctaagagag gtcgtatttc tttgggttgg 76921 tagtaatgga atgctttcat tatctaaaat gaaaaacatt tcccaccaaa agatcagtcc 76981 aataatccca ctcactggta aatagcgcaa tacttcttcg tgaatctccg ctatttgaat 77041 atggaacatc ataacaacga aaaggaatga aacggctata gctcctatat gaactactgg 77101 gaagatcata gcgaagaagt cgagacctaa caaaagaagt aaacctgaag tgtcgcgaaa 77161 gactgggatg ggaaacaaaa cggaatgtac cggattttta gcacgtgcaa ccatcaaacc 77221 agagaccaaa gcagggctcg acaaaacaga aagtatcatg gtaccgtcgt ccttccctga 77281 aatggaactt tcatgctagt tcttgcttca gcatgaaagt cgatctatta cgaccagcgc 77341 ggttgttcct atttcatttt cttcttccaa gcccttcttt cttagaaagc actcactgag 77401 ttacttacgg aatctcaatg catccattta atgcattctt ttcgatcttg tacccacgga 77461 gcggtagact gaacaccaca aaaatatcga ttcaaaaaaa ggtacaggca actaaacctg 77521 tgaactcaga tagccttgtg gtatggtagc gagatccaat cttgagtgaa aaagataccg 77581 cggcttcgat ctttttcaac ataatcatcg cgcagtaaat caacaaagcc caaagttgac 77641 tctctgtctc cttcaagttt acctactact gtacccgcgt gaatatgatc tccaccagat 77701 agacgtaaag ctttagctag tacacggaag tgcaaaccat gattcttctg tctatcaata 77761 acagcatgca ttgcacggtg gatgtgaaga agtaggccat tatctcgaat gagccaaact 77821 agtatttgcg gtgaatcccc ctgtttaagt agtcatgcat tacgatagga actcccaatt 77881 ctctggcaaa tacagctctt ttgatcattt cttcgcatgt acccgcagta gcattcaaat 77941 aatgcccttt gattccacct gtttcagcct gtgctttata aacagcttcg gcacaaaata 78001 agaaacggtc tctccaacgc ataaatggtt gggagttcac attctcatca tctttggtaa 78061 aataaagtcc accacgtaga cattcataaa ctgctctacc atagttcttc gcgccaaacc 78121 aacaatctgc acgcttagcc gcattgttca tcttgcctta accctacccg aaggaaggct 78181 atgaattgag ctaacgccct ctgtaacagc ttaaatgata taggaggggg ttgatcctcg 78241 atttgattct cgtgttgcga gtgtcgtgtc tcgcgtttcg aattcatttc gattgggaaa 78301 gtgagccgat ccagacctaa ttgatctggt ttctgagagc actgacaagg ctaaactcgc 78361 cggttgaaga gagcactgaa tcattcgaaa ggggaagcct cgactggaat ccctgctata 78421 aagggcagcg gaggaatacc tacagattgc tcgccttaac cttaagattg ctagaaaata 78481 tctaatgcag gagaataact aacaatttat gtggaatact atttcatttc gggggtaacg 78541 actttaaaac ttgcctatag gatcgcttga cctagaacct gtttacactg gtcctgctgt 78601 gaggaacgga ctttatcgag caaaaaaagc gattctatgc gcagacgtga aagctctatt 78661 gataggaaac attctagcca attttagagg aacgattccc ttgtctgaga accgccattc 78721 ccgcactatt cctccccaca ttgagaatct cgataactcc aaattaagaa gtgagcgaat 78781 cccaaagctc tctctttccc ccttttcacc tttaattggt gaaagtctaa aatctgatgt 78841 gaatagaaga aaaagaaaga agaaagatag gcggacgact ttactatagt ataggtcgaa 78901 aaaaccgtga gcagtaagca gcggatctcc ccttcttttg ggaaagcagg tggccgcgtg 78961 tgaattcttt caaggcaagg ggcggcctga tttcgacatt gttgtttcac tctgatccgg 79021 tcgaggtggt tttcgtttac cagcgcgtag gtggtctaag ttcctatgac cgagtctttc 79081 tggcccctgt gaacatcttt tcttaatgta ttaaagaggg cctctctaac cggtgaaaag 79141 tagtgggtgt ccactaacgg ccgttcccgg ctcggctccg ataacgaaat tccgggagga 79201 gtcggtagtt gggcactgga tcccttcgga cctggagaac gtgtgacgct gggtaggggt 79261 ttggtgaacc aactggtcgc tcctctagtt gaagtatcgg gccccttttc gttgcctagg 79321 ttacaccttc ggaatacccc agaagggaat ttatctctac ttccctaaat cttcacttga 79381 tgccacgccg actctccttt cgtcataagg cgtggaatta gaccaagggg aatctccata 79441 gaaacaggtc cctctgattt cgcgcgtagg agatcgagac acagcccaag ggctatggag 79501 aaagatgtag atcgatcgcc ctagatagac aactacatag agggtcttta caagtctata 79561 tattctttga tttcgttccc cccccccgta agatcaatat cacaactaat gaggtcttaa 79621 agtttcacat ctaagtaata cccgatccga tagtttacaa tataaataga tatttatcta 79681 tttaacaaca acattctaaa aaaagatatt aattgatatc ggtagttgtc cggtcgtacc 79741 caaacaataa tattccagag gaaaatgcac ctaagatcaa atatttcgag ccggcttccg 79801 tggaaaattc agactttctt tttgatgctg cgattacata aaaacataaa ctttgaggct 79861 caatagctaa atacatggca attaaatcat gagccgagat cataaagagc ataccgcgag 79921 taggaagtgg aattaataca atgaattcaa aagcatcaaa cctctcttgg tcggaagaat 79981 cgaaacacat cgaaatggta ccagccgtac ttaataatag aaagatttgg cagaaatatg 80041 taaaattgtc cctcctaaaa agattattcc agaataaatg ggcaatagtt aggagaggtg 80101 cgccagcggc gagcagaagc aaggttatta gatacctcag gaaagcccgg ccagaaccac 80161 acgtgcaagt ttccctgcat gtggctcgtc cgtgataact tcttcggatt tgcgtgaact 80221 agcagaccga tcactaagtg gaaaaaaccc tccagcataa gcgaaccttt tctccactgg 80281 ttaggaatgg gctaagagag gaagccaaac aaaccgacct attaagcaca gctaagctaa 80341 tatgcgccgg ggaaagccag gtgccggagg taagctttat tcggcccgga gcattgattc 80401 cccataacga ataggctagc tctatctttc aattgcctat cctgtgctcg agaaggtctc 80461 ccagttcctc ggcagcacct cggggtgcat ttagttgtct tacatgagac tcgggatatt 80521 cctcacccca tggcaccttt tgctcatcca ttccgcccgc ccgtacagac gcggcgccct 80581 ttttcattat catctaccgc cctttctttc ctcccggcta ttcacgcatg aagatcgtcc 80641 tatgtatgct ttacttcggt tgccggtgct ataggtagga gtacattatg gcaggatcag 80701 tcacccgggc aaaccaaccc ccctccaaga ggaattgtgc tatgcccccc cgatccctat 80761 agagcgaaag agctgcctgg tgatccctag gttgcagcac gtccggtcgt taactcctcg 80821 cgccgttgcc gccgtttcta ggaccgaccg acgcctcacc tgcactggta cctacgtagt 80881 gtcggtcgca cattcaacat agcgttcgga ctcatgtgcc ttccagaacc agggggtctt 80941 cgagcctgct gcgtacgatt agccagcctt gcggcggcaa aaggattaga cgtccccccc 81001 cccatgctac ggttcctgcg gtccgaatcc ggcgccgcat taggttaggg tagggggctg 81061 ggcgataata gttcctctgc atccccaagc gcgctgccct cctaggcgcg caacactaag 81121 taatccaagc caacccacat tactggctaa cggcggataa tcatatttct tagaggtact 81181 aaatacaact ccatgaatga gcaaaatgga ggttgcatta atgataaaga tctctgggga 81241 aaccgctaaa aaaagattga acatgtgtgg gaggatccga acgaattctg ctttcatttc 81301 ccccgtatgg agcactgagt cacttacgtt acggtcttca gcgggcaggc tgcactctag 81361 gcgacggcta ggaaggatcg ctagaccagc agccaaacca acaatgaata tataatataa 81421 tgatagtgat tcggctcaat aaaaaaatgt agaaaagaaa gggcggaata gcataattgt 81481 ccgatttatt cctcgatctt ctttgaagac ttactcctcc tttccgaaaa gatcgtcctt 81541 ggtccttttg acataagatt ctcggccgct caagagactc tctctcgtat atatatttat 81601 acgcaatatc ttgactccta ctacccctcc tttccgaaaa attgattgag tctttgtcta 81661 agcgagtttt ctaggccggg tgacctggcc acttctattg cttatctttt cattacatct 81721 tttgttctct agcggacttt ccgacttcaa ctggaccacc cacccggact aggaacccga 81781 atttatccgc ccatccccgt cgaagaaggg gcgacggcct tgattatgcc gggcatgctc 81841 cccgaacccc acccaattaa taagactaag taaaaaaaaa tggaaatagt gaatcaagat 81901 ctagatggat ccctatcttt ctctctatcc acgggatacc tatctagatg aaaaaatcca 81961 tctatgaatc gatctagact atcctctatc cggatagaga tgtccctatg agtgactacg 82021 ccgatcttta tatgttttgg ccacgtccgt ttctgctccg aagaggaagg tttggatctt 82081 ttaatcttat gtcgtgaggg atatgaccta tgttaggtcc ataagatacc acagcttttg 82141 gtgttctatg attaacctcc gaataatgag taggtagttc gatccttttg attcttctct 82201 tcatagtcaa cttatggggg gatgcgaagc aataggtcga attactatat aaagataaag 82261 aagagttgta aactatagga ctgattgttc gagtaggaag atttaggttt ttatccttcc 82321 ttctcttcga taagaatagg gattttaaca aattcctctt cattctcttg tgctcagcga 82381 aggaacttcc taagcatact ttttcggatc caaacttctt tgttctttct aagtcttctt 82441 cttgcataga agaacgcaac tgttgcaaaa accgggattt tagtagtagg cggcatcccc 82501 tcttagtttt gagtaggtgg aaccattcag ttttgcttct tctccacatt cttagcggtt 82561 gatgcagtaa tttgcctatg attttttcaa ctgaaatttc tttatagaaa gatctcctta 82621 tttcttcacc gcgtattatc gcgttatttt cttgaaaaga tattatatca ccgtgggaaa 82681 gtttaaaatg agtaatgctt accattcctt tattcacaca aacccttcga tgacttatcg 82741 gctgccttgc ttgaggaata gtttcaagaa aatggagacg aagcggaata acgtcaaatc 82801 ttgtttctgg attgagtgga aaagggatat atgaagttcg ttttgttcct ctgtgcatct 82861 ctgtgatggg taaatcccca tgaaaaaggg gcaactttcg tgtagtttgt aattgtatat 82921 aactggtaag atattttttc ggataaatct ttctcttctt aatagatctc ttcctgttcc 82981 tcaatcttcg gagaatgcgg cgttgtatta ttgtcagttc tctgttccga acatttcctg 83041 aaagtagacg acaagtttga aatcttaatg gggacaaatc tatatctcgt tgaatcagtt 83101 ttttaagcag ccacattgcg tccctgggac tcgaacccaa ttctgcaaca acccctgttc 83161 ctgtcctttc tttgttttgt tgatctattc ctcttcacgt tgtcttttta aaaaaagact 83221 gaaagtatgg gctattgaca ccatggtcag aaagatattt taggactttt ctaagtcccg 83281 cttctacgat aaagtctttc cggttttaat aaagcgagcg ccactctgaa tccacgatcc 83341 gtaaggatct agctccgcca cattttgagt agcacttctt tattttttga aaaggccaca 83401 aaactctttt caaaaaataa atctacagtt tttggtagtt catttttttg tgaaagaaag 83461 aggcggaaag ctatgtcctg tacagtttct cctcccacct gagggcgaag tcatgactcg 83521 acctttagtt ttgtgactcg agtacttgtt gcgagaggta ctttagtaac tcgactaaaa 83581 ggagaggttg tttacatact tgagctttct gcgagaggga tttattactc gactaaaagg 83641 agagagagca aggtgaactc ttcccattgc taacattcca gggattccag aaccaactaa 83701 tctaatttcc taacctggga ttccttaacc aactaccaat ttaaagagta aaacacttcc 83761 ttgggataaa ctgattcaaa cagataggca gattgaaatg gaactttcta caaacaaaaa 83821 ctccttacct tacttggctt acctctccca ttcctctaat ccctaaccaa aacaaaccta 83881 atctcatcac ctcattgcta atattgctgg gattccctcc taaccaacta atctccttac 83941 ctcctattcc tgggattact gggattactg gttctcccat tcctaacatt cctaggaatt 84001 cgcaagacac cataggcaac atgggaatcc ttgatctact tgtttctcag attccctttg 84061 aggttaggag agtgagtcga ttcgacgagg ctagcttgta ggttagcctt tagtttccgt 84121 caaagaattg gattttcata cttccttccc ctttggtcga ttcaagtgga ctttatccta 84181 gatttgtcga tttatccctg gttgaggacc tttggatgcc agttttctag atgccagttt 84241 tctataagaa gggtcagcct taaaggaaaa aaaccaatcc caaggtaagg tctaaactga 84301 tctaatcggt aaagcccgta aaaagagctg agtttaacgg cgagaagctc ctttcaaagg 84361 aaggagtgac agaaaaggtc gggacagtgg ctgctaccgc attgacttcg taagatttgg 84421 atgtgttaag gctaacctcc caaaactcac taaatttacg ttgcttcaca aggtaattcc 84481 tatttgctta cttgttagag taaggaaagg agaagagctt caaggtagaa cctatgaacg 84541 tagatcgcat ataatgtgtc gatctcatca ccttaccaac cagggcctat gtatacgaaa 84601 ctggtgcacc atctgcaatc gctatatata cctgacctat cttctcgtgt atcacccttt 84661 ctttcttcct ttcctaacta tcactgaccg cgggggaaat gctgacagaa tgccaatact 84721 tgttcgatcg aaattctaac agttgttccg ggcgtttaat gtttgttctg gttcaagcta 84781 gaaaacggtc aagtgtgttt ctaatcaggg ttatctaaat tggaataagt actttttgat 84841 aactttgtct catgcggagc aaagcccttc ttagtcaagc ttagtccctt tggagtatat 84901 ttaaagggaa gtgcggatct ggagtgtttt gtttagacga gctatgcagc aagctgagtt 84961 ctatcgaaac aacgttttcc gggcatacga gaaaagagtc cagcccagca tatttatttg 85021 cgactttctt gatcccggaa atcttgtact cagagtcacg gataaagata aagtggacta 85081 ccctgtatga cctacagttc tttttagtgg ctttttcttg tcctggacca ttcagccctt 85141 tctcaaccaa gtataggtag aagccctctt ttccatacag ataaaggttc gacttccaag 85201 agaaaagata aggaagtcgg tattgatcgt ggaaatatct tagaaagtct tcaaaatcgc 85261 ttctcttcct tcattctcag taggctaccg tcctcggaac caatgcctaa gatttgcctt 85321 gcggtctgcc cttagtctaa agacttctat ccactacgat ggaatctgtt tagtggagag 85381 aaggcctagt taatctagca ctatggtgga atctaagtgt aagatattgt tccgcggttg 85441 gaaaactagc tcctatgcgc ctgtaggtag gacgttagaa aaagaatccg aattaggctt 85501 gcagacctac tattaggctt tcagtgtagt aacctagatg ctgaatatag actcccatct 85561 atagtaaaag agaagaaagt caggcggaca ctggcagctg caagtcagtc ggttggcttc 85621 cgtcacttct ataaaacgct gcccaacaga ctgaaatgac tatcttcggg atatgatgcc 85681 agagttggtg cttttgcggc atctagttca gtatcagagg gctgcgctgc ccttggctaa 85741 gaaggtttac ttctctgtca cttccgtact tctgtaactt ctctttagct tattaatgga 85801 attggttaga aagtctttcc ggctagaaga gagggattag tacacctcca acactgactt 85861 cttagtcgag tagttctctt tctcctgggc tgagacgtga cctcttctcc tcgttttcat 85921 tgggagggaa gatatagata caaaggtaag cacgaacgga agtattcgaa agagctattt 85981 agctgaaaac tctatagcta aattactcta tcatagaagc cattactgac tgaatctggg 86041 agggcaaaag gcttagtaat agtaagcttc tcaaccctgg taagggttta cttcgtccca 86101 ggccaggtcg ttgctcgttg cacggtaggc ggctttgagg aaaggtatga agtcactcgc 86161 cctttagctt tagaaagcag gagaggtgct ttagcaactc gactgaaagg agaggttgtg 86221 aacacaaact cgactgaaag gagaggttgt gaacacaaac tcgactgaaa ggagaggttg 86281 tgaacacaaa ctcgactgaa aggagaggtc caaggtaatt tattactctt ataaaagagg 86341 gaactcgact gaaaggagag ggagagggat ttgacttttt ctgtcaggcc aatttctttc 86401 tagtggtaag tgctttcagc cagttctagt ccttttctta tacgcttatt ggcgttagca 86461 cattcctctt tcttattgga tggagttgtt ttgccacctg gagttttaat ggaagtttga 86521 gtgcgtccta aaagccaatg cttctgcctt ttggaagtgt ctgcctatat ctgtcctccc 86581 catttaatca gtctctgtcc ttaccttcga cggtaacccc tttatagtac agcagtcccg 86641 tttccggcag tgccttcccc ccccctagtc tagaccgagt atcaatagtt gtagttcctg 86701 tccgttatgg acccttcgta tctttcaaat ccaatcaatg tgtggtgtgt atccaagggc 86761 aaatgcaatc tctgaatctg ttacagccat tgtagcacta ggtgcctacc cataagcagt 86821 agttgccctt gaatgtgcat caaatcctaa gacaagccta tctaatctca cttcttagtc 86881 taaagcactc aaatcccttc taaggagcca gccctttttg aatccctctc ttttcttcat 86941 gctgtgaaga aattttccct atccccctga ctggcgaggc ttcttctaaa cgagtggagt 87001 tcagttagac tgccccttct ttctcgtgct ctccgtccaa gcgagctaac tcctgatctc 87061 ctgttgtaaa gatctccaga ccaaggctat tcttaccagt tgacatccat cccgttgcat 87121 tcccagcggt tttagttcta ttggatgggg ttctatacga gcgttccata ccgtctcttg 87181 ggatcaaact gaattaagtt ggactccctt caagagatag atttctttcc acgataatat 87241 ctatattttc ttaagaaaga agactagaat atcattggga gtgcacttaa gccaattgga 87301 gtactaggcc ttctgttcca gtgacagtta tagaaaaaag cgaagccctc cactcataag 87361 taataagtaa tttcatccct taccagtccg tcctaccagc tcttctgtct tcctttttaa 87421 aaggagctaa cgatacgatg ttcaagcttt gccagaagcc ttttgcagag agaaagttcg 87481 aagatcagga tcaggattgg attgaaaata aatctcctca gcattcaatt caagtgggtc 87541 tatcgataca agtggtaaga ccgataattc agcatccaag acctggccga tgtctacttc 87601 aaattcaagt gccacttatc ttaacacaac tcggggctaa tcgactccca gcctatcccg 87661 acctggtcct ttcgaaccag tctttataaa cctggtgaaa gcaatcaagc aggacttacc 87721 tgctgggagc cccttccttc tattccgaaa cctagaagtt ttttttccag ctcttgagct 87781 gattgaatga ttttgttcac tgctaagcta atccttctgt cctactttac tctattatga 87841 gacctcaccc taggctaggg agctggttta agccatgccc ttatttgact cccaagaccg 87901 gatacatagc tcttctgagc gggtcaaatt caaatacaat gagtgcttag ggaaagtaag 87961 aaagaaagac ataaagcgga atccggtaat cctactgcta ctaagactac tccgtcttag 88021 ggctaggagt tcaagcgcgg cgtaacgggg gttgtcagcg taacagctgg tgctctaatc 88081 tgagctgaga aataggaact gcacttcgac ttgatttatt tatttactat attcccccct 88141 ctctaaccca agtcctagaa gtgaatgagg aatgaaagaa cttctgcttc cgcaatgaga 88201 cgctaaagtc cttaaatcaa ggaaatcaca atgcttctct tattcagtag ttcaatctta 88261 ctttcttacc ccgccgggga agcgttgaat gaatgtgtaa tgatgtccat caggggagat 88321 cttgaccgat ggccgcgcga tgtccgagtt ccttcgaatg ctttttccaa gccccttctt 88381 gcgctttacg aaattcgttg ttaaggccag tccttctctc accttcacag aagaaaagag 88441 attcttctga atggttgata tattcattcc agaggagatg ccggtgaaga ctaccagcca 88501 atgatggacc tgggcttact accgccaagt agcctattaa gtctcaaatt gagtactaaa 88561 gtcaaactac ttgtcctccc caaaagagca gctgaaagaa acccatggca aagcaacata 88621 cccgtttaag catagtcgct cgtctaacgt ttcgggacaa gaggagagga acgcctttga 88681 gatcaggatg ttctaccgct tttcctaggg aaggtatata ctactgatag tattccgctt 88741 tgtgaaagtg ccataaaagc tctcttcgcg tcaacgacat ccatctaacc atctaacgag 88801 atatgcttcg ggtaagattc cctgatctga aatttcgtta gtgagctggt tcggtctatc 88861 ccttcactta acctaaagcg tacattcgat agggaacttt gattatcaaa tgggttggtt 88921 tattagcgca ctcacgccaa gtgctaccgg ctcatcctgg tgaggcatcc caagcccctg 88981 taatatttcg cttttttcgt cctctcgctt tgtccattcc ttgatctacg aaaggtattc 89041 agtattagga tcagaggcct atgggatttc agtagccttt aatcaattac aaactcatca 89101 ctcttcttga aggttaactt cacatcttgt gtgtgttccc ttgatgtatc ttaagaccca 89161 tttcacagcc tgccaatgac tcctcaatgg cttactcata tatctgctaa tcactcctac 89221 tggataaggg taatctggtc ttgtaccaat cattgcgtac atcacacttc caacagcact 89281 tgaatatggg agtgatttca tatagtccat ttgttcctgc agttcttctt ctgtagctgt 89341 ttggaatctg aagtgaactc ccaaaggagg taggaaaggt ttggcacaac gctcatagga 89401 ctatcggtgt gcggagttca tctcttggtt agcaaggaaa gattcttcaa ccagacatct 89461 cccatttaat cggaacagaa gtaagaaatc taatatcagt acggaccagg ggaaaggtgt 89521 tacaattgta gcatcaagtg cactcaatcc ggttgttagt atttgttagg tcttccgtgt 89581 taggtcatct ataagactta gaaaatactg acactttcaa aaagataatg ggactgaagg 89641 tgctccccat aaatctgaat gcacataatc cagaggattc ttagtagtgt gctggcccgt 89701 agagaaattc actctatgtg tctttccata gatgcaatct tcacaaaatt tgagacttga 89761 aaccttgctg ctatctagga aacctttctt gaccagtaac tccattcctc tttggctcat 89821 atgtgcaagc ctactgtgcc aaagccttgt ctcatcctta gctgtttctg caaggttaga 89881 ctctcctgtt tcaacagagc cttgaagaat gtaaagtgaa tcatgtctgt tccctttcag 89941 tattgttctg catcctttaa gaaccttcag aactccttca ctacaacttg cttcaacggg 90001 ccggggcctt tcccattctt tcgaagttgc tagcattctc tctcctttgc gcatgtgtgg 90061 gaataaattg actccatctg tctctgagaa tgggggaagg ccacactttg ttctcttctc 90121 tctctctagt ccaatgtcca attaaaccta accgatcaac cctttcagtg gtccaatacg 90181 cgcgctgtag ggcgctgaca cttgggtcaa cgatgattgt acctaaaatg agaagagagg 90241 agataaagaa agagagcgta ttgaaactct taggtgagga atttggaagt actttaaggc 90301 ttgacggaga agagagagat gagagggaag gatagcagtt accgagtagg agtagttgat 90361 cctcacatca aaatatgttt ataagaaagt ttcctcattc aagtgagaaa taggataaac 90421 caccgcccaa agttgcttag ccaagagtag tcctatatgg gctcgcgaag ccggtctccg 90481 gtgggaaaga tcctatctaa ctatttggga gaacttctct tcttttcttt gattgtgctt 90541 tgcttaacac aatcgaagtg ggatcatcgt ggagaaaaat atcttgcggt cagtctttcc 90601 atatcttctt ctttagctca ttcgcaacag tttcctccat tcctgcgatt tcctcaggat 90661 tcctcgcttg ttgtgatagc aatctataac cgctctgtac cgtgtacatg ccactcttag 90721 tgtaagccca cacattatca tccataagat ttgcccctgg tatcatcgag atgataaatg 90781 ccacattctg tggaggaaaa atattatgca acatatccag tttccacgac tgcctattca 90841 tgtcgtacaa gtcagacgaa gattaaggtc aatagatagg tttcaattca gtggaggtag 90901 aggtgtctca tccattatcc atctgtctaa ccaaaccttc gtgtgtatac cgtctccaat 90961 cgtccttagg agcccgcgag agaggagttc tctgccatga atgatacttc tccaagcata 91021 tgaaggtctt gtcccaacac tacactccat cattgagcta tgtgggaaat atcggctacg 91081 gagaagtcga gatagtagcg tatgtggttg gtgtataatc cggaaagatt gttttgcaag 91141 aagagcttgg ttaaaccaac ctaaatcccg aaaacccaaa ccaccatcat cttctttaga 91201 tttgcataat ttctgccaag ctacccatga gatcttcctt ttgttttcgc aagaactcca 91261 ccaaaattcc gtcatcgcac ttgttaactt cttgcatagc agctttgaaa gccggaagca 91321 cgacatcgca tacaccggta aggccattgc tatagatttc agcagcacct cctttccccc 91381 ctgagatcaa aattttgcaa accaaccatt aaggcgggat tgaagcttgt cccgaatgaa 91441 ggctaacaac ttcttatttt atctacaaca gcagaatcgc agcaataatc aacagataga 91501 aagaaataga ttagaggaga gggtaagggt ggatagggta tggacagccg tagaactcgc 91561 gaagcatgca ttagaagcag atcctactag ggtttgttaa cctactcggc ggggtaggag 91621 atagatatgt tcctcgtagg gatgacctag tcgaaggaaa gaagatgttc ccaaagagat 91681 gttgaggaag gatagcagta tacataatct taatgcaatc aaacggttat cctccttttt 91741 ataggtatcc aaactcggaa gatgttatga gactgagccg gctaccgtac ttcgtgctgg 91801 aacaggggtg actcctaaca atggatccac cacagtaggg gttatatcac aacttattac 91861 catccggaca aaggaaaagg gaatagtagt gaaatgacag gaacgacagc tttcactgct 91921 acggctattg gctttagcta tactatggca ttccctgtag ttcttggagt cactcaaact 91981 cgaactcctg ttacagctcg gctatagact acgcctctat taatcctttg ttcgtgatca 92041 ctcccagcaa tcacttcact ggctaacagt tagttagaaa gtgttccggc taaggcctat 92101 agaactcatt tcactccggt tatagctgaa agtggctttg ttcttatcgt tcagtggtca 92161 ctctttatag gtaagtaaag aagggattct attgtattcc gatactgcta cgagctgtct 92221 taactccact tacttctagg ccattagctc agtaagctca gtaactccat tctcttatgc 92281 cagccttacc tcaattccct tcttagcctc aacagctaaa gcatcccttt ctacagctcc 92341 tgaaacagcc gaagaaccac aggcctaagt aagtgttgtg ttcaaccttt ctataagtaa 92401 tatgggttgg ctggttggaa ggacagcttt atgtcgtaag taagtaaata gtcgtcaact 92461 atcgagaacc tgtctgaagt gttattccag cgcttcctag tctttacata gtctttttcc 92521 ctataatcaa gaaacgtctc tcaagtcgtg ttccaaccag atgactccat ctctctcttg 92581 gactgtttcc ttcatcctca gaataggtgt ctgagggtct tcatctccgc tggaatcttt 92641 acttgcaatt tgatcaagtc agctatccaa aatgagtttt cattcaagaa ccactttact 92701 ttgagctcgt agcggaaaga gcagattaac tttcagtttc tcatatggag tcccatatgc 92761 ttagccatca gctgagaaag attgatacca agtcggagca tatcccgttc gagggtccac 92821 ctattcggga gccagtacga gggtcaattc cagtttcgga tacctgcgcc tattgcatat 92881 ttagtatttg cagttcaccc cgctgttaga taggcagagt aagcataggc tgtggtataa 92941 atcttttgag tttgaagata ttttatcgac cggagcaaga tccattcata agggtttgga 93001 gatttctcaa tctaaaaaat caaatacgat tcaacagcgg gagagatcag cattgatagg 93061 acatggagat gtgggctcag aagggagtgg aatagagtca tttgatagct tagatgccgc 93121 cactagagag aggttcactt gcttaccttg ccgggtcacc aggagaggtt gtgcagaaca 93181 agaccaataa cccgatctct gctaaggagc aatacggatc ggataagaga ggatccatct 93241 tctcggggga gtctaaaaga agaccaccac ttccatacgg aaaagtgtag acgatgatcc 93301 gggacttttc gatctttttt ctgttaccta actcttcaat cactctttct tagggggcga 93361 caggattcga acctgcagtc ttcaggtcat gagcctgatg agttttcttc ctctacccag 93421 cggtcttccc cttctctttc tttcacaatt cccacctagg tcccccgctt tgttagcttg 93481 gccgggatag aagctgcgct tagtaagcgc cccttgtctt gtataggttg gagctatgaa 93541 gcttagcctt cttattagaa aaaggtatga aatgactcga ctaaaagagg aactttcctt 93601 tgtgaacatt tacttgcctt tgcctttcct tcccgcctta agaaaaagaa agcccggatc 93661 cggataagca gctatcggct tcatgccatc ttcattcatt tgtccaattc cgggcactgt 93721 aagaacctat tctctgattc aatcaaatat cccaaaccaa aggcctatga aagcccgctc 93781 aatagcaaga aggctttttc aacgattttc atgcttcctt cctcgattga tgaggccgtc 93841 ggggtgcctt ttgtgctttg cttgcctgtt agttttgcgt atcagttact agctttcctt 93901 ccctagcttg aagttcagtt cttcctttcc ttgggtcatt agtggcatta gtttgaaaat 93961 gaatatcaga tcatctgtcc aaattgatgg aggaagattc tttcacagcg cattcaattc 94021 ctaactattg tgacgtagcc cgatgtagtg tgactcgacc tttcttcttc ttttttctgg 94081 agagcttcaa tttagctgtc atccacggag aagcttcttg ggctctagcc tcatgtctca 94141 tattctttca tattctttct cgttggatat ggtatcaact tcaagactgc agacatgaac 94201 tcaagatccc accaatccct tcttcttgtt atatgcgacg acctaccgaa gacaagagat 94261 gattagctcc tttcttattg gattggagaa gatggctcgc tcacttcctt tgtgcacctc 94321 ccgctatcga tctcatttct tcccttctaa aagaaactga agaaagaagt agagtcgcct 94381 aggaagtaaa gaaagttgtg ttggttcttt ccctggttcc ttgttcccct gttgccctaa 94441 gagaaaggct gttgtgattc tacggtttag agcatttcca agataggcat tgttagcctt 94501 gcgcttgaac ctacaaggag gaagagatag caaagtctta tcttataggg ggcctcgaaa 94561 tcagaataag ttgaggggga gggaattcaa tgaatctgaa gcatcagata gcaacaccac 94621 aggataatcc attcctactt tagaaaaagg gaaggcgtga gggtaaggta aagtagaata 94681 ctactcaaaa agctggtaaa agaccttggt tgatcgggta tcgaaggaat ccctttagtt 94741 tgaaactaat acccccttgg aaattgattt gatctggcta cgtatctcac ttcctcagca 94801 accggtaaaa agtagattca tcaagatagg aaggaaagac gcagtgctcc tgtggtgaaa 94861 ctgcaggata aagataggaa aagtggtgct tgtcttgaag gtgaggaagg gaagaggaag 94921 tgaatttgct ttgggtgtgg gttaagctct tccttatctt aggacgaatc agcagagaag 94981 ggccttcgct tcgaactcgg atgtttagga agaaagctgt ccacttgaag ctgaaacctt 95041 agacttcctc tcgggctatc tatgaattag aaagctaacc cttagatccg ggaagaggga 95101 actggcagta agggtattac acttcaagaa aagtggaact cactcgaata gagagagtct 95161 aggctcactc acagacccag ctgtctttgc tttgttcaca acgtcagccc gcaagaactc 95221 accctcgtgt gaaagatagc aagatgtggt tgatggaagt aggtatgtgt cataagttca 95281 ggcacagtgt cttctaggaa gaaactttgt gtaaggatct tttacagtaa agcaaggtat 95341 tgatcaaagt gacagtggtg agggtaacct tgtgaaaaaa aaaatggtta ggtgtgccct 95401 ttcctatggt gtctcttctg tgttagctag tataagtctt agtgcttttg ttttagtaag 95461 gtcaatagga gttagctggt agggcaatct ctcgtgcaat caggaataat ccttgaacct 95521 tccaccccgg agtttcttct ttcttcaaag ctttggattg aaaacctttg cttagcctta 95581 taaagtagcc aattttccaa ctataattag aatgaatgtg aaaagaagat gaaagaatcg 95641 tgtgatacta tagctagcta cgatccccta caagaggaat cactctactt accctttggc 95701 gggcttcgag gaccctactg actaattctt tatgcatgga tgcaggaaga gcctacgagt 95761 tcatttgtaa caatatgaaa gaagaacctt gcgtgcccga tcatgatata ggtacagctt 95821 ccagccagaa aggtgattgt gctagaaaga tggcagtgaa gatcatggat aaagagctag 95881 tcaagcagca tcctgatgat atccacgtac caactactgg cactgctagt gcttttgtgg 95941 attaacctac ttcccagggc cggcggtcag agagaaaaga tctctatgaa aagcaggaga 96001 cgggaaaata ggctcgctcg aattcgaatg ggttttaaga gttcccggta atcaataatg 96061 gcaatgcttt ccttagaaat ccaataagac aaaagggcgt aacggtaccc tacgaaatcg 96121 gcagatgatt gctttgattg gcctgctgta tcagtttatt cgactacttc ctccataccg 96181 tagtggactt ccttcctaga tagatgtagg agcagcggaa agggctgctt tagaagcttc 96241 aatagttccc tcatcaacgg atggtttagc cgccaggtga tagggatgtc aaatagaaaa 96301 ggcaagcagg cgtagaaggc cacgttcgat agctgaaaag tcaagctgcg gtagagggtg 96361 taaactaaat agttacagct gggcttggca attatctagc cgaagtggaa ggttaaacgg 96421 gataaaatcc tggtcattct atagttgcag gatcggtaaa tgaagccgga aagagtatct 96481 agagataaaa gtccaggcgt cgaagccaat taaacacttg gacaagatag tgaagggtct 96541 ggggagaagt aaattctaga gttcattact cggatgggga ctacagtaaa gaaatctatt 96601 ctagaggact aaatgtcaac aaactttagc gagagtacca agatcggact caacttccta 96661 ttccaaaggc aactttacta gcgtggttat gaatcaactt acaattcttg tacaggagga 96721 actgttaatg aaagacaact atagtagtaa tagctgtgaa ttccacttct gagttagagt 96781 tcaatccttt ccatctgaaa gtacgtactt cttatagctt gagaagagga aggatgcatt 96841 cgtcagcata ggaaggccca atcaaatacc ccatcatagt ataagttttc cctcttgatt 96901 gacttgaaca aaaggaaaga aagaaagatg cacaaacgaa acaggaaact acgaaggtct 96961 caggagcaat atcaactacc tagaccgacc tgctataaga attccataaa gaccttgcga 97021 agatacctgt actgaatgac tggatcgtac tagatagata gtcttagata gacttgaata 97081 cgcaattcct gccaatctcg cacttgacac gcaataaaca gcagtctatc ttcacagaga 97141 aggatggcaa ggggtgaaag gcttagacct tatctgctct aggctagttt cattctattc 97201 ttgttcagca tatttagttt caaaagatag aggtccggag gcattaatta gttgatgagg 97261 agaaaggcca gccacgaaac actcacccaa tcaaccttca aaggcaagta aggactctct 97321 caaactgaaa attgctgttc acattgcaag atctgtctca agatgttttg aaaatgagaa 97381 acagcagcag aataaatagt atactttatc gagaaaaggg aggaagtatc ttctttctct 97441 gacgtagacg ggcattccta tcgagattac ttccttagga ccgggaggtc gaagaatcta 97501 tatgctacct tactttcagg atggctttac gaaagattcc atttagctga agtaggacga 97561 agcggtatga gataagaggt ctcaacgagc cttcagcatc tgaaacggga gctttaacct 97621 gcatatcata cgggagtctt agtggagagc gttctctggt tatcatccct agttaaattt 97681 gaagtgacgg aggaacccct attcgataaa gaagggaacg gtgcaaataa agctatttcg 97741 agtcctctat caacaaccaa ccaaaccacc cgcttgtcgg gcttaaagcg aaactgagct 97801 tctcactaac aaaatcaatc gtaagcttca tgtccgagac tcggaaagag tggcgttaaa 97861 gagggcgtca gagcctctcg aagtctccga tctcacagat ggttaggctg caattccttc 97921 gcttgtgaag aagcttggat agttctcctt actcggacat tctattcatt cgaccaccgc 97981 ccatgcttaa agattgaagt tttgttttga cccggtaaaa aaaaagaacg ttgtaacgtc 98041 ggttaacgca gcctcacagg cagcctgaga agtttttgac ccctttacct atcttgggtg 98101 agctacatcc atacatagtc taagcctctg attgacagta ctagatacgg aatcttagtc 98161 ccaagagttc tgcggaagcg ggctaaaagg caatagttcg ctagggtggt tatcccaatt 98221 ctttgataag atcgctaccg aacagcccca aaagcctagt ctttagaagg aacgaacgga 98281 tttctttgtt tgttcgtgca atgaagaatc ttatctgaag aaggtaatac ggaaagaaaa 98341 aaactcaacc tgatgctact ggaatagaga agcaagcaaa gtgggtcggg ctggccaatt 98401 gaaaaaagcg gtatttcttc cgcgaagact aactaacggg gcgcactcaa tttctacctt 98461 ccccgagctt accaccaccc gtggtacaat agaatgaaaa cagagtaatg accaagggaa 98521 ttttctatta accaacggga gatcctcctt ttctcgctaa tctaggtctc caggttcccc 98581 agcacagact tcttgttagt ttaaactagt aaaatgggag tgaagctatg ctgttaaaca 98641 agacaaacca atcgcagatg tactatcatg cgaggaaggc cccccattat tgttgaaaaa 98701 ggactattcc caacacgtct ttcgtctaat cgggagaaaa aaggaaaaag gcgttattgc 98761 tgtgcttccc ttccctttac aggcaagctt taacttcctt cggggaaagg gaaagacctt 98821 gaaaactctt tccgtatagg ccagaaaaca gcttgcttag agaaagactg actctcctac 98881 ggacctggtg gaccttacac agtcgagtta ttgcatccct ctcacaaact atcaatttca 98941 taagagaaga aagatcgttt ttagaaaaga aagaacgttt tgattcgagg cggatccctt 99001 ttttctttgc ctttacgagt tgaaaagaaa gtggtaacag gtagtagctc tggtagagtg 99061 caggactgta aatccttctg tgagcggttc gaatccgcaa tcacttatag ttagtttcat 99121 cgatattttt gtggtgttca gtgtaccctg agtacaagat cgaagaggat cctcattccc 99181 tctttagtca atcccaatcg aaaaatcagc ggagccaatg gcttgatcat aatgctgcgg 99241 cggggtcaga gagccggcaa accgtttctg gttctagtag aaagggattt actgattcgg 99301 gttgaggatg acgagcaaaa gagtggagga aaagctgtta tatagttagt ctatttctat 99361 cttcctatct ttgtcatgtc agtcttattc ctatcttggt atctttgtca tgaggtatgc 99421 agcacactca accaaaatat atagaattgg atcatcaacg ggtgtgctaa agaaacgcct 99481 ctacgtaaac ggatgaacgg gatcatcata tagtgagctt cgtttgggat acttaaaccc 99541 acaaaaatta gctctttagc tgatatacga ccaccaacta tctggctatc gagaagatct 99601 aatatgcagg gaagtaattc aatagaataa cgcaaggttc gataaatcaa ttgatgcaaa 99661 agcagggaat tagaccactc catcctttat ttcacctagg agccatctta gcagcagggc 99721 tttcgactct cgaatgtcaa agaaggtcct ttttatcatc tgagctttct cacttgtgcc 99781 ctcagccctc aagacttcgc cagaccttga ttttcgattg aagataatcg tttcttcgct 99841 atcccgatac caaaatggga agatctccct tctgggttgg tagatgaatt gttctgggta 99901 acaggcggtg gacatcttcg ttgacagatc tgagttatcg acttggtcat atgccctaaa 99961 gttaccctaa gccctcgggg ctccatccga acctgtgact caaccttctc tttagcggag 100021 gagaatagtt tcgagtgagt gtgctcccga accaacgatt tgtatactct ttgccccagg 100081 taccaatacc catgtcttcc tgcttctttt gattctctta ctcttcaaaa ataaaagcaa 100141 agcacaccgc taaacaaaaa gctggtatta ccaaccttct cagaggttag gcgattcatt 100201 gagtcgatca gtcatctttt tcccaagaca actttattgt gtgtaggcgc acaaaggaag 100261 atcgtaaaat caggattact cgtccacttt catcgccttt tgttgatccg tgtcatcctt 100321 gctgcttcat ccaattattc tttattcctc ctctcacttg ggcgaaacac tagaaacggc 100381 cattaattta cggatcggtt atacaaggag catacctact aatacttcgg cgtgtgctgc 100441 attcgagtgg ttaacagcat acatctattc ccaaatcccc aacggagata gatagtatcc 100501 ttctgtgcta tcccctatat cctcgtagta gggttgaggg cagcgaagtc tcccgtttat 100561 cctgtctcct tcctcttggt tagtgcgtac tacttccaac gtatcttata tagagacatt 100621 ggaggagcct gatcttccat ctacgcagaa gagatgcttg agttcttcca tccgcgaagg 100681 cggtgtgctc aaataagtta gtttgttacg agactatcag gtccaccagg ataattcgag 100741 gtggatagcc ctgactgaca ggggactaag attaggaagc atccacaaga gcgaaagagc 100801 ccttttatgt ttgtttcgct aatcttatag attggaagtt tgcccagcac agagtagtag 100861 ggttatgctt tcgaaagtct tttcggagta taagggaggt ccatctttag agatctgaat 100921 gaaagttctg agcttaatgg attcccggaa ccccaggact ttagagagtc ccaaccctaa 100981 gaaggcttgc tcgggttgaa agatgtgatg cagcacagac agaggtgcga tagtaagtca 101041 attgatcagg agaggtggta gcttctggtc tctcaagctg acaatggtag tctagataga 101101 tataaagttc tatgctgttt gcttgctcaa ggctagaagc aggagtctgt tattgatagc 101161 tcccgttgcc attggctgtt ccgcccgtgg tctttcagat agacgttcaa aataaaattc 101221 ttttcatcct ctcctcctgg ctaccagtat atggagatgg tttgattctg agagttgagg 101281 aacaccaagc aagatttgat tgacatcatc tacggctttg cgccaatatg aattatgaat 101341 ataggttaag tgaaatctga caaccagcac actgcttgat ctcattagca gctccggcga 101401 agcagcacag accaaggtaa gcacaggata ggcagtatct ttatgaagtc tcgtaaggct 101461 cagatcttta atagaagtaa acttcaggtc tttgttctaa taaatagcca tcttcgtcct 101521 caatggcaag aaagaaaggc tgctaggagt aagtgggaag tgccagggta aagcccctga 101581 tggatcgatc aacggattct gcaatttcat ctctttcata agcgattaat tcttgtttag 101641 ccttagttgg gatttggact cagaagccga acacagttct tagatcgatt aaaagcctgc 101701 cgcttatcga cactcttttt ctaaggatct taggcagcgg ttgaagttac cagggatgga 101761 accataggtg ctttagcaac agcaactcga ctattagaac atgcgtacgt gaggagcaac 101821 tcgagtgaag aagcaagtta tggtgtattg tccaatgtgc cccgcttttc tccatgctga 101881 ggactattcg acttgtgagt cagattccga tttagtggag gaacttcctt tgaaggcacc 101941 cgagagccta tataagatta ccaatttcct cagctagatg atcaaagtct tggatgcagg 102001 atggctccaa tctgacagat gtggcaccaa aacctaaaga ttcctttact agggtttggt 102061 ggaatgatag cccaaagaag atttttggtg tagaaagtct caaattcagg atcttccgct 102121 gtacggattt cctgggaaac gaagtcatag gcacgtaggt tcagagccag accgatcact 102181 cacaatatcc ctgggcaagc caaagatttt gaccacatgc ttaaggaaca actccgcagc 102241 atcatggacg gtgcatgcat gttgcgctgc tatgaacaca gcatacttag aaaagcgatc 102301 caccactacc atcacggacc taatcttttt aggaatggaa cccacaaaca aggtcttgaa 102361 gagccttgtg ttacctaact aaagagctcc caaggccacc tgattcggag tagagcactc 102421 ctatttacac ttcgattccg cccactgatt cagatccagc taagccccgc aaccaaagag 102481 gagacacttg caccggggta atgctaccac acaggttttg aggcgaggcg ctgccctctt 102541 tcgggcaatc aactctttct tgtacctggc actagtcaaa ttgtatctcg gttcatggca 102601 tgccacaacg actctatcgg cttatagtca ttcctggcac tcggcgcatg tctttgatca 102661 accccggttc cgtcactcgt atctggttga accagtactt cctttccgcc gttcctaggg 102721 accacccact cccgttccat gagtgatccg agacagtacg cggagattaa tcggtctctt 102781 ttggttctca ttccaaaagt tgagcagcct gttagcataa aaatgcaatt acctccttct 102841 tatcgttctg ggtacgggcc tagtcctgag tgcgctgtct gtcttctcgg ctggcccaaa 102901 caacgggatt ggtcctgtgg gattagcacc aaaattcctc ggcatctcat cctcctgttg 102961 ctggtcattt cctaaaggat cgaaggttca gccgacgcct caaagtggat ttgttgtgcg 103021 aagggtgcga ccgggattta gagttagaat agcggggaag acttgacctc aggcagcgtg 103081 aaaccagttt atttcattcg atgtagtggt ccgattgcct agtggaagcc ctgaggcact 103141 tgattcttgc gtacgctcgg ctcctctacc aatcaccaca gcgcgagatg cgcaagagtg 103201 acccatctct ccacaccaac tggaccatac aatacactct ttataagaaa aaaagatttg 103261 gaaccgaaag ataggaaata aatttacttc cagagcggaa gggaatactt aaccacgcct 103321 cgagtcccag tctcctagtg gtgagatggt aaagggcaat gtccggttct ggccaagaga 103381 tgtctttgct ccgtccccct tcttgttggt tggcacaagc ccttgtagta gtaagtcttt 103441 cctccggccg gatcggatgt tagagtcaat tccattgtag gtttgaccat taagtgaatg 103501 caaatatagg tagaatttga atcaacgacc taagagtctt cttttgccta tcgtagttac 103561 cttgatgtga taaatggtcc aactttcttg aaagatatca gatctttaaa gtcctgctca 103621 tatacatact aatatgatcg ctaccaggtc atcccactac acaccatcaa gcaaccacta 103681 ctgaagctat caactcaatt ctctcgttcg aatggggaga tagataaaag aaacgaaact 103741 ctataaaata tcataagaga agaagatttt aattccagct taaataagta agacttgact 103801 ctttaaaaaa ttccgatcaa caacttggag ggatggctga gtggcttaag gcattggttt 103861 gctaaatcga catacaagaa gattgtatca tgggttcgaa tcccatttcc tccggcgcgg 103921 aagtgaaacg ggcgggcgaa atgagaagag cactacttag tgactaggag cggggagccc 103981 gttgcgcgtt tttttgtttg accggcctat cttcataagt aagctcccta tggccgtcca 104041 gtccctgggc ggctctcggt tcttgagcat gttgggagat tagtcgtcaa ttgaaagagc 104101 tgctctaaag cttgacgaag aagttttccc tattaattag attagtaaag ggcttttccc 104161 ttactagtca agtggtaagg tagggcgctc ttcgatgaag aaaagaagag acttttggaa 104221 aagtggttca gctcagctgg ttagagcaaa ggactgtaaa tccttgtgtc agtggttcga 104281 atccacaacc acttctattc tcggagctga ggtatatgaa gaatggcctt ttggtccttt 104341 ggacacgtag ccgagagcga gccggatttt taaaattcaa gtgaaagaag gaagaaggca 104401 aaaagccgta tagtgcagga ggcagatgaa aaaaaaaaag caattacagt gagacgcgag 104461 gtgtagcgca gtctggtcag cgcatctgtt ttgggtacag agggccatag gttcgaatcc 104521 tgtcaccttg attaatgtct ttttttttgg aacctttgtt ttggagtttt cccaaaccta 104581 taccttactc atcttactaa gaaaatgggg cttatgtttt tcagccgcat cgcactgctg 104641 cataaacaat agatccgctg ggctggatga gcaaccttct atctggcctc tgtactagta 104701 gtagagttgc ttgctacttt caatcataaa acgaaaattg agcaaggcaa gggagaaaga 104761 agttgtcccc tcctctctgg taacccgccg ccggtcatat agagcgtccg cccgccccag 104821 catatgtata aaaaagagga agaaagaaca accgttttac tttggcacat gaggtggcgg 104881 gtttggctag gtaacataat ggaaatgtat cggactgcaa atcctgtaat gacggttcga 104941 ctccgtcctt ggcctacacc ttcatgacca gaaataactg gaacaaaggg aaagcccact 105001 caactagtaa catacggaca caaacaacaa aactacttgt tacaaataac atcttgttgt 105061 tgcatcgaaa catataaatg tttactaatc tggctaacat tgaacttggt aaaattaaat 105121 ggttgaataa tttaaaaatt agattattcg ggaatacaca ttgaatgcga agattacgca 105181 tttcatttct ggtagtagat ccataatcaa aaaagtgttt gtgattgttc cagaagaaga 105241 aatgaaacaa aagatacggt agagctagga cagttattgt atgaggtcta cccaatgcta 105301 aatgcagagg cgcataatag atcgatatga acatcatgag ctgtcccgca ataaaacctg 105361 ttgttgctga tactttcttc tcggttcctt cttctccttc gtccataacc cgagctcgga 105421 gaaggaagag ataagagggc cctatggaga atgtggtcag aaatccataa tagagtccga 105481 ccacaacgac cgaattgatt atcttcatgc ataagtatac tagattacct agtataaaag 105541 attgaaaaac catcacaaac ctcccctttt tcttttctat ttctggatta ttatatgatg 105601 attttgcaac tttccatata tagaaataga aagagataga ctagaaacga catctcttat 105661 gtcaatgaca ccaaagggat attaaatgaa tggaattggg atatggatag aatataatga 105721 aataaataga gccgctttga ggttccctat gaaatgaggc atggaacgga gccactacga 105781 agaagttccg ggggttacga aggaaacttc gagttcatat tggtcatgtg ggttgagaac 105841 gggaattgaa ctctataaga tctaatctcc cgttgttcct cagtagctca gtggtagagc 105901 ggtcggctgt taactgattg gtcgtaggtt caaatcctac ttggggagat ttgttagtta 105961 tcgcttttct gacctaacgg cccctgtcct tctcctttgt ttctaaacta gctgaatcgt 106021 gtcaaaagtg gaaagttgat tgttggtttt gattcctcac tctcgtatag gtgaacttgg 106081 ttcgttaaat tcttaggaag aaaagaagga tcaactggga atgaatggga agactggagt 106141 agtatctctt cattagtcgg aaggaatttg tctccactag cctcattcga aaaagaacag 106201 aatccgaaca aacaagaaaa gggtttaggt aggatagtgc gatgtagtcc aatggctaaa 106261 gctctgccag cttcttgtag actgaactct ctttaggctc ctagtggttt tcgggtggga 106321 tggttgaaat tttattcgcc actagtggca acgaagttct tggtaatttt aaagggggaa 106381 ggcttgcaga ccactcattt cattcattca gtgcctgcgg tgaggcgcga cccacaacaa 106441 aaacaaaggg ggaaagcttg cttactgtag ccctctttta gtagactagg cttggtaagc 106501 gtaactattt taagtaggct cgagaagggt agggctcgca gcttcgccgg aagcgacgaa 106561 ggggggctgg ggaaggccga cgactacatg agggggaagc tgtcgtagaa gctttgtccc 106621 ttgctttaca gagattgtta tgaattgact aaatgactag gattctcccg gaacgctagc 106681 taagctaaga aatagtatta gttcccttca atcaaatcaa taagtttatt ttttttattg 106741 attcagggaa aaaacctcct tcttagaacc cattgagggg gcctcggccc gggaagggga 106801 gagtggccga gtggtcaaaa gcggcagact gtaaatctgt tgaagttttt ctacgtaggt 106861 tcgaatcctg cctctcccac ttgttgtagt tttagcttgt atatgtgaaa aaacgagatt 106921 tctttcatag aataactctt tcacataaga aagtggaggc aggcttgggg gtacgactaa 106981 aatgattcca catgaaaagg gaccggcaat ccccctcttt tttaatgaag aagcgggcta 107041 gtccccggga atgcccgtta atcaagcaag ttggggaaaa aatcttacct attagacgga 107101 acacgaagag gtgaactgcc ggaacaggag gacttaaagc tgaggaaaca ggaagggcag 107161 ctaagggaac ttgcttcggt aggagtgtgc tgaagttggc ttcaaacaaa gagaaagaaa 107221 gtagtgtgct gaaggaaaga agaaagagag aaggttaaga agaggatact ggcttcgtaa 107281 gtaaagtatc tgaagatgag atagatcgga gaggtgagtt accagcgaaa ctattcactc 107341 tgatgtccat ttcctgcctt ttctttgtgc cgatagagga tttaatatgt agattccttt 107401 ccttattccc tttctggtaa ccataacacg tccctgcttt tttctgtcgt tgactgctag 107461 agcaagaggg gacgaccttt cctaccttat gatagactga actgctacgg cggctttagc 107521 tctactatgg cagaaggaat cgccaacttt catttcatat agctgtaaaa gttagcaacc 107581 ccatttctaa gggaaagagc aatctttttt tgaaaagttt catctacagt caaaagaaag 107641 caaacctcgg cagcacagaa gagagctact actattattg catgcgtgct cacgtgtaag 107701 agtgtgctaa ataaggaaac ttcggtctaa tcctcgggtg tgctatcaca aggaactcgt 107761 ccggtgtagg attgtagatt acctgtccta ttcagcacac tcctccatta cagcacagag 107821 gattagctcg ggttaggaaa acaagtagct ggtcaaagct atcggaaaag cagaggagtg 107881 tgctgacttc aagcagtagc tagtcaacca acttctgctc aaccctagga acaaggaaag 107941 tatcagccga taactctgtt cacttggtgt gggattccta aaagtgtact attatcgcac 108001 gccaaagagt gtgctgaaaa gggaatagtt ggttaactac cgaagctaga aacaaatacc 108061 tgctaaaacc aagggaagca agagtaggaa ctacccaccc aggaatcagc attctaaact 108121 gcagcacaag cacaccagcg tgtatcaggg gtgtgctata aaagatcaca gctaaggttg 108181 agtccttacc ttcttattcc ttccgtagtt agcagaacat actagcctgg cttatccatt 108241 cctcgtattt gaggaaagta gggtttgaca tcccattctc ggatgcacga attatgcgtt 108301 taatcccacg caacaaactc ctctgtgctg ggttgtgatc ttaaaaagga gctttactgt 108361 atagtgttac agtaggaata cctactagat cgaggattcg gaacagtact atcctgttta 108421 tattgacatt cttctgttaa gatttcccgt ttgagaaacc ctttaagatc tgagtatctc 108481 ggaattgaaa ataggacttt atttgcatcc cttccatttc ataggaatac agctaaggat 108541 accatcagct cgcatactat ttacgggaaa cagaagtagg gcaaagagga aaagagcaac 108601 tccctctatt agaacaatat tctcgaatcg ttgccctttt attcatgaag aaagaaaaca 108661 catcggtcta tctaaagcaa caaaattcgt tctatctttc tttctctcct attaataaaa 108721 ctggacttct cctctgctcc tatcttagcg cgacggagaa gccagcactg agagaagacc 108781 tccctttcaa agtccatcgc ctgcacttga accggatacc tactatttat tttattgaac 108841 tagacgatca gcggactttt atatcatatc accaacaacg ggtttcgccc acagtacaga 108901 gatgatctca atcgcatcgg cataaaggtc cggattccaa ctttcaaaaa ggatttacgc 108961 gcgaattcaa ctatagacaa aggtccggat tccacttcaa aaggatttac gcgcgaattc 109021 aactatagac atatctcttt ggcgacccag ggacggggat tcatcaatcc cggggttcga 109081 gggagacttt agaaccccta tatatgagtc tcgaaaccat tatttataca tttctcgatg 109141 acagagatca tctttaagga gaataaacat tggtccttta tttaataacc ccgatattat 109201 caaaaccatg atttccttat ttctcgatgg cataggttcc agggaaggaa aacagtccgg 109261 tcctttttca ccccgaaatt cgagagtagc cgtataaacc aataccggca gaaccaccca 109321 aactcggctc aaagcaggaa caaagacatc tagaaagccc acatcacgtg gggacttggc 109381 cttggtggga cattcaaccg acatacacaa ccaaataata ataggtttta gccatgtatt 109441 ccgctacacc gccctattga tctacgcttt cgaagcaacc accggtacca ttttcacaca 109501 acagagccga cttacttatt acatttatag gacagggaac ccgtctcggt aggacttcca 109561 aatcctacct caagcaaccg acatctcttt acttacttga ccgctagcca ggggcttgaa 109621 accccttaca accttccttg cttgccgggc ttacatcgct atcatcgcta gccagggact 109681 ggaaatccct tacaatcaca tgcctgcgct cgtcgtgcct ttaatactta tttattaacc 109741 aaaccctcct tggccccaat tggtctaaga ctcaccaaca tcttacgatg agacatcggg 109801 ctacctggca agagtcaata aattggtgaa cctatgagag acgcttctaa ttcttttctt 109861 tctcctgaca ctcactatcc tacccgatcc tgctaacagg aaatttgatg aaataaggaa 109921 acttaggctg ttgttgaatc aacttagaat acaaccggta cccacgtcgc aactaaagca 109981 ctcgtagcat acgttccatc cacatccggc cagttgaagt gaagatcgag aaaatgttct 110041 tcaaagcttg ctaataacca aaaaagaatt ggactagtgt aagggacttc cttttcaagc 110101 tagcatttta gaccaatgca atcgagtgaa agtgcttcct atcatttgtc tgcttgaaac 110161 cttgcattga gtgtgcacct tgggaatact tgagtttgga ggtcttcaag cctttctatg 110221 ggaaaggttt ccaatacgta ttcttgccgt tgttcgagct taaaagcgga tgatgatcct 110281 aatcaagagt ttggtctagc ttttcgattc atttgagtct gtcctcgcaa agatatgaag 110341 ggcgctttca ggacggatac ggatacaggt gacccctttt tcccctttgc tttctttatt 110401 acttctcgta cgtgcccagt ctcaacgtca cttccaaaac cgcctttccc agatctatga 110461 aaggtgaaag gttgaaggtc cacaaaaggt aacataaatg aacaagttta ttttgtattc 110521 gagctgctga gggccggagc tcgtatggat acattacttc attgttctat gaggccttgc 110581 acaaagccca tactcggata aggatcgcga agtgcaaaga tccggtcttt gacaacccaa 110641 ccgtcgtaag gacaacaaaa cctatactta tgtgtgttcg acactatagg cgagaccagt 110701 tgtcagatga tcgccgagtg gcgttctgct tgattaggcg aatgcgagtg aggtatctaa 110761 tcccgttcgc taggagggaa actcaggaag agttcgggca gaaatagatg gtgaaaagcc 110821 ctttatatta tataataaag cgcgctcctt gcatgactcc atatcattca ttattaaagc 110881 gaaagcacaa cgtgtcaccc taaccttcaa atcgataaac gggaatgcgt tacctaataa 110941 taatgagcaa tgctgctaag agcaaatggt tcctaacctc ataaaaaaag agatagtcat 111001 atccgactat ccggcaacac gcttggataa gtaaggcgat gaaccgtacc tttatctcta 111061 ggcgcctatg atgtcttatt atgtagatca tgtcttgctt gagagttcca aacctcgccc 111121 ttctaaaaaa gtacaaaccc acgtctaggg atatacgagc gccattcttc agatggatcg 111181 ctcttcgata gttatttatt gtagccggga taagcagggg ttcctccttc cgtcctagtt 111241 tttttttagt acaagtagct aagggggctg ggttgcgtct ttcttgtgaa agaatacttc 111301 tgcgcacctg ctgaaaggaa gtcaaacgga gattaccggt tccggaggga ccaaccattg 111361 tacttatagg gtcaggtcta tcttgccctt ccaaatcacc catgtgtagc tccaccgcac 111421 gagtcagacc ttgtacgggc ttttccacag gcttgaggca gactctttcc ttattgggct 111481 tcaatccgtt tgtctttgcc tcctctactg agtcagtgac agaagtgcag cagccaataa 111541 tacgtatata agaagaggac tgcttacggg atcaaactat caatctcata agagaagaaa 111601 tctctatgcc ccctttttct tggttttctc ccatgctttt gttggtcaac aaccaaccac 111661 aactttctat agttcttcac tactcctaga ggcttgacgg agtgaagctg tctggaggga 111721 atcattttgt tgaaatcaat taatccaata tgcgacgaat ctttttgttt gatgaaaata 111781 gtcttaattc aagttccacc attgatacat cttctgcttc caccattgat acatcttttg 111841 cgagtcaatg cactaacttt tctagtggtc aagcgtccgg tactcaggat actcatgctg 111901 gtatttttga ggattgtccg ggccttaatc ctaacgatga gcgtgtagta gagctgcaat 111961 gtgagatacg cgagaagtgt gaggcattaa cgcaagatcc cgaaatgggc ttgattttgg 112021 gcgaagcttt acatgcggaa agcgacaatg tccctttttt gcagtccatt gctgatgatt 112081 taacccaaaa cggagtatcc ggggaagcct ttcaagaagc tctgaatata gtgggacagg 112141 cggcggcctc cccactggac caatttgaga ttgtcccatt gattcctatg catatcggaa 112201 acttctattt ctcattcaca aatccatctt tgttcatgct gctaactctg agttttttcc 112261 tacttctgat tcattttgtt actaaaaagg gaggaggaaa cttagtccca aatgcttggc 112321 aatccttggt agagcttctt tatgatttcg tgctgaacct ggtaaaggaa caaataggtg 112381 gtctttccgg gaatgtgaaa caaatgtttt tcccttgcat cttggtcact tttctttttt 112441 tgttattttg taatcttcag ggtatgatac cttatagctt cacagtgaca agtcattttc 112501 tcattacttt ggctctctca ttttcgattt ttattggcat tactatagtg ggatttcaaa 112561 gacatgggct tcattttttc agctttttat tacccgcagg agtcccactg ccgttagcac 112621 cttttttagt actccttgag ctaatttctt attgttttcg cgcattaagc ttaggaatac 112681 gtttatttgc taatatgatg gccggtcata gtttagtaaa gattttaagt gggttcgctt 112741 ggactatgct atgtatgaat gatattttct attttatagg ggctcttggt cctttattta 112801 tagttcttgc attaaccggt ctggaattag gtgtagctat attacaagct tatgttttta 112861 cgatcttaat ctgtatttac ttgaatgatg ctataaatct ccattaaagt tcttctttct 112921 tttatttata tttataattg aacaaaagcg agggatggat gtctgagcgg ttgaaagagt 112981 cggtcttgaa aaccgaagta tttctaggaa taccgggggt tcgaatccct ctccatccgc 113041 gaagtcataa gttctctctt gccgcctgat aagaacgaat cggatcgact cgactgatat 113101 gatagatgga atgggtacct tgtgttatga ttttgttagg aactttgtct ccctttcgtt 113161 atcttctctt tttttcttgg tcggcaaggg ttagggcctt tctcgctggg cgagcgcatc 113221 cgattagtcc tttcctaagc cacttcccgt tcagttgctg aaaaaaaaat ggggataagc 113281 tatgcgacga cagagaaatc tgaatctact ttttgccagt tccacgaatt ctgcagataa 113341 cgacgttctg ccaggagtgc cagttctgga catgtttcct aatatatcag tagacaccat 113401 aattaaccac cccttctact gttcgggtac agtagctctc gcagaagaat tggtggatcc 113461 tattattatg gagaggtgga tagataggac gactagttgc tcgatcagga ccttagcttt 113521 attgcgagcc tagaagtctc tcttttttcg gaacagcctt taagtaaaat ccaacctaat 113581 atcattcaga gtaaaaggac tacggaacca aagagaacca cacttgcttc gctaggaatg 113641 aagcagcaca ccaatgggca aagggtatgg gcgacctatc tgctgcgcta gagaaagaaa 113701 gaaagtgctt cgggctgctt tgctgcgcta gaggaaggag actgataagg agaggagtga 113761 tccaagctca actcgatact ccactcatct ttctcgattc tattgacata atttggtttc 113821 ttctatatta gtgaaatcat tacatgacag tggatcttgc ttggtacttg tccgagggga 113881 gatccatact caactcatct ttctcaattc tactgacatg attacttgga attgattgtc 113941 gtgaactcaa tctctcatat aggcatagtt gaatatcgcg aagcaaaaac ccaaacttga 114001 atatcgcgaa gcaaaaggct acatcagacg atgtacaagt ttttcaaaaa tgatttgatc 114061 gattggtgtg tttcgatctt gttgattcac taggtaaatc acagcctcat ccttctatat 114121 tacctgttga tcgatcaaaa aaagcacctt acctcggatc aagaaaaaag aagaaaccga 114181 atctcggaaa acgacaaatc acacagttga acaaggtggg agtagctcgt gtgatctagt 114241 cgaaggtcca taggaaactg ataaggagag gggtcttcca tctcactatt ctagattcta 114301 ccgaattagt tggaattgaa ttcgtccaag attaaatgcc agttgaatgt cgactagtgt 114361 tgaaggtcca agctgtactc gactcataat tttgcttttc tataccttac tcgtgaaatc 114421 gatgatgatt acatgacagt ttaatgtcgc cttattgctt gatccttatc ttgtatccaa 114481 accatactca actcagaatc tactgacgat cgaccaccaa ctcatgatta caaaaaccct 114541 ctcttcgaaa acccaataat cataaaccaa ctcgtcgact ttggttgatc cttcacttgg 114601 ccaaaggtcc tagccgtact tccctccact cataatcacc aactctatca aatttttaga 114661 attgaaatgt ttgactatta caaaagtaca ttttccatag agtgcaggaa ccccatcccc 114721 tcacccctca tcatctagca cacccttggg gttcacatac ggttttaaaa agcgaagcaa 114781 ctcttcatac tcatggcgaa gcacgcaaca agccgaaccc tcaatcccag tgaggactac 114841 tccgttgggg gttacttcgc aaggacgccg catgtgcagt tcctcagtct tccttctcct 114901 ccatatgaag atttatccca tacggtgcta gcgaagcaag tcttccgagt tcatactgca 114961 tctcgaacga aggtcactca cagttccgaa ggtcactacg gtttccgagt cttattgttc 115021 ttcttcttcc acgtatgaca gataccctca accctcaggc ccacgggggg ttgctacgcc 115081 cagggctact ccgttggccc tgaagggtta cttcgttagg gctacttgga ggagttgacc 115141 aacccgaaca tggtgccttg atcttcagct ttccaataag aagatcattg atgcagcaca 115201 ctgagctaaa ttcttaaact atctttaccg gtgtgctgcc tcgcagagat tctcaaccta 115261 gtccgcgcat actctttccg tttctgtctt acagtctttc ttgttcttct tttccctata 115321 gttagaagat gagcccctgg ggtgccctgc ccttctttta cggctgatgg acattcaata 115381 gatttatctg ctgttgcctg atctacgacc ccctcaatct gtcgaagcaa atgctgcaca 115441 gagggagaag ttgcctgacc gggctattcc cttgttatct gttatccaac tggttgaata 115501 gtaatctgtg ctggatccct ctgaagctgg tcattccttg tttccagtta atagtggagt 115561 cttacttact tgaattccta ttactgttag ctattagcac ttgcttgtta ggtgaggact 115621 ctcttatctg gttaacctta gtggtagcac actccaagtc tagacaatct tcctcccaag 115681 aacctctcga taccagaatg gctaccaggg cgctactaga ccaactcaga tcagatcgtc 115741 ggcataatta attcttttta cttgcccagt agtggtctct gaagtttact actagtggta 115801 gtgcagaaga cctctttacc tacccggcta ctgacaggaa aggatgattt tagattcttt 115861 gcttgcccat acgccgatcc tgttagctga ctggtactac gaagatctat gccattcaaa 115921 ggagagatat gaatacacca actacagaca agaaagagac tctaagtcca acccatgtcg 115981 tgggcaggga gagacaggca ggcagctaac acagatatca gatattagac ttggcaacag 116041 acgctttgat atgtgttaca cagacgccct gtagacattg aatcggctca ctccgacttc 116101 gattgctaaa atgttgacgt gaatagtgaa tacgcttgct ggttgtgata tgttccctac 116161 tatactacta ttgaagacct tgacttttga tcttccttct aggagaattc atcaaacctt 116221 gcttatcctt ctgatgagca ttcctctaac taagttagac tgcctcttac gagcaggtaa 116281 cctaccttct aaaccatgcc cactatactt tcttctaagc cagcttttaa ctctttgttt 116341 tcgtatcatc tcattggact tatttcaaac aaactcgtta cactcgcacc tgattacacg 116401 ggaacgaaga aaactacatg gggagcacgc ttacacctgc aagaactaag ggtccagacc 116461 aagcacagac agatagaacc agacatcaaa aatcttcctt tgcagcctat cagatatgga 116521 agttttcgac ctgtctttca ctggattgaa aagccttgta tgttgatagg actatgtggg 116581 ctgcattctg aagtttcttt catcgcaaac tggccatggg gtaaaccctc tgagataggg 116641 ggctgctcaa tcccatgtat gttgacaggg agagggagcg aactttctta tcatatcaga 116701 gcaagccgaa gacctcttcc gggaaatcgg ttccacttcc aatctttttt ttgaagaaag 116761 aaaagtagca agagaatggt tcaagttcgg atcgagtaac ggtgattgaa gggatgagat 116821 cggaagctcc tgattcaaat agttaactgg gctgtggttg tcgagtctgc tttccctctc 116881 tatcaaagcc cttgcacctg ccttctcagt tcctcgtgct tagcgggact ccttttctaa 116941 ttagaaggag acagattagg tcaagttgct ccaggtatga aatcaatccc atgttggtcg 117001 taagagtggc aagccatgag gtaggtctta tggaaataaa taagcatctt cctcaagcaa 117061 ctccttcatc tgtggcggct gctcctccaa gacaactgcc cagcataact aaactcttat 117121 ctttgacttg actctctttt gttctatgcc ccttaggtag gagtcagatc tgttttgtag 117181 cttgtttcac ccttctttgt agcttgccgt ttggtgtgtt catccccgcc ctccctgaac 117241 ggagctgttg ctaaagcata actaatatct tagctgtttg gaacgtatag ctacccggtt 117301 cttcctttct ttagcggagg cgtaggtaac atatctatct atgagttgct gagaacgtct 117361 ccaaagcata aggagttgtt cttcaatttc aaccaagaag cacaggttcc cgcaaccctg 117421 acgccagaag agttgcaaga cgtggtctgg tcgaacagat tacgaccgct gacctttaca 117481 gatcaattat tttccatctt gaaagaacag ccggaaggta tgaagctcta gtccttcttt 117541 ctagccgtaa atcaagaact cgaaaggttt attactttgt ttaatcgctt gactagcttc 117601 gaaagagcaa tctcctcttc tagtcttacc acaaatcaat gctttcaaga aagtgctcag 117661 ttcctgacaa catggttccc ggtcgcgcat ccccatggag gtggatattt cacacattgg 117721 ggtcatcagc cgtcgaacac agtcaagtta gcaaagattc gtacgaagga aaagaagaaa 117781 gatagatggt ttctaatggg aaggtagcag ctgtcatcat tatgatgatt caatccatat 117841 taatccctat ttcagaacta tgtattagtt agacgacgtc aacggaacaa gaactccttc 117901 ctttacaggc aagcggggta gctgctgtct cgtttaggaa cttgagttag tcgactaaga 117961 ggtagaaagt ggaaagaagc ggcctaaagg attagcgacg acccctctcg gcttaaagta 118021 aaggaagcgg cttgtcccat ccaatgccct acttttaagt actcggtatg ttcactcaat 118081 cagaagaagc tttcggaagt tcattggcct ctgcggagca tgaaaaggcc tgttttgaag 118141 ctccggggac ttattagccc gtcagcctta cttcttagga agaagagaaa tgtgcttgac 118201 ttcttcatct tataagatga cggcaacatc ctagataaga tctacgattc ctgcttatga 118261 atcatggcaa accggtggtg aaaacagtat tacagagtag gttggttaaa ttaaagcggg 118321 ttactttgat cttatcgtct agccgggcat gagagcccaa taaaaaagga ccgacgacga 118381 tccgagaagg acgaatcaat aaatcgactt cctttagagc agacagacta cgcgctaaca 118441 gctttagaaa ctttgctata ctaacaaagc aagcatagca aagtcaaagt ttgagttaga 118501 ccgggtatga ggcaaagcca aaggcagaag tggtttatcc aggtcggttg aatccccgaa 118561 ataagcacca attgctaccg tccaatcgct atcattatca tggttggagc atagatagca 118621 aaggtatggt tagaggtcaa atacacatat ctaccaggta ggtgaaacga attccatacc 118681 catatcgggt aatgggcctc caattacttc agcctaaaac gaagtctgag gtcgggcctt 118741 cctggacctg ccttaaaccc tccatcttcc tgtcaacggg actcctctct ctctgaggag 118801 gtccgtagag tctcactagg gatctccgtt ctcagtgagc gagaatcggt atggtcagag 118861 tactgaggcc tggtcaaagc atctcagcct ttcttttgta gtaagccgct cgtatcgctc 118921 ctctagttta gttaattaag ctagcttagt ttctcgattg aaagacaggc ttacctcccc 118981 ttcatccctt gcttctttga aagtgctttc ctgagagtgt cggcagagtg cattggtcac 119041 agacgtccgc acaccaaaag cctcgtatgc agatcctagt gccaccatct ctgctgaaga 119101 tgatgccttc tccatggcac ctgggaacta gtaccatcat ctcctcatca tcagagagtt 119161 gtgggtctta tcgggttaag gggatccctg atggcagtgt tgagcggtac aaggctcgcc 119221 tcgttgccca tcagcggcct gggagagagt cttttgatag attaagccct gtggttaagc 119281 ctatcactat tcgtctgaga gtcttactct ataccatggc tggactcttc gtcaattgga 119341 tgtctttcta cagggtcact taaccgaaaa gagtcatgtc tcaacctccc ggtcgatact 119401 cttcttctct ttgcaggttt cgaaaagcta tggtcttaaa caggctcctc ctggtatagt 119461 gatctgatag ctgctcccag tccgacacct cttcatttat catcagaatg gaatggaatg 119521 aaagccaagc atctactagc cgcctagcca gtaaggggga acgctcctgc ttagtacaac 119581 cagaaagaaa gaatgcagag taaggccggg aataggcttc gtaagaggac tgtttgaggt 119641 actctggaag gacagttaca cccatgccag cggaagactt aggataagcg gtcactacca 119701 tcttattact atgggctaag cagatactat caccttaata gcagcaaaga gcaatagtcc 119761 ttggactcga ggaaggatag aacagaagtc gtacgtacaa ttaagatgaa tcgatgatta 119821 ttccccttcg atgaactata tcttacctgc ggcttatagc ctatctaggt gaccgactac 119881 tgagctatct accttctctt tgctcttaga agttcttctt gttataacat taatggagaa 119941 gacaagttat tgtacgttag ttgaaatata gatagaaata caaacctttt cctgttctgt 120001 cctaggactt tctaacattt gaagcttacc gggatttgct cttccctacc tatgataaag 120061 gctgagaggg gacttcagca gtataatttc attccgcttt ggacacgcaa caaaaagtga 120121 gggaggggca aatatttcac cgaaaggtgg gagggggggc caatatcctg cagcccagct 120181 ggatcttccc atcattgatc gactaaagag aaagtcggcg gatatattgg ttcgaatcca 120241 attaatgatt acaatttaga aggacttcat tcaatcgatg tttaggatgg aacagactag 120301 tcgttcagta gtcttacaga gcggttactc aatggtacag tacagacgtc tttgatcctt 120361 cttggtacgg gagttgaggt tgaataagtc gttcagatag ccatacagac agacagaatc 120421 taatcttttg tgaaagaaaa aaggggatag ggagctggca atgtcggaga tcctacctca 120481 gaatctttct atttagaagt tagggataga aggtgagtac caactcgaaa agtatactca 120541 catccccatg tctttagcca accgtcttga ttgaatagat ttcctgatgt ctagttttcc 120601 tactgtggaa atcctattta gagtagactc acttcaatga aagctattac ttagtgggtc 120661 aaaaagattt actttcaaac gtcagcatct tatcctatta gggaaatagg ctaaactatt 120721 ggtgtgtaac ggaagaaggg gaagaaaata aatgatccta gctgtcgaat gcagcttgat 120781 cttaatcaga ggaatgaagg tctttgaatc gcttatctct cagattgaac tgaccgggga 120841 gtacctgagc tctttctctg tttaaatctg tattccttag ttctgttatt agctgttagg 120901 tgggggaagt ataattccct aaggtaagtt cttctgatct catatgtatg tatactcagg 120961 ttctactggg attctcgaaa gagctccttt tgttcttctc tttggtttgg agtgagagga 121021 agtatcggaa ggtgccttgt tcagcacact cctctgcttt gctttgcttt tccgagacca 121081 gctatttgcg atcgaagttg aatagttcct atgaagatgc gtaatcccat tccgttgtcg 121141 tccttcttct tcttgcattt attttaattg taaactgagc ttgtcgatca aactgtggct 121201 taggtcggac cgttccctaa attattcact tcctcacaac cgagcccttc gagtctttgt 121261 cctgaaacag ttccttcttc gcatctctca agtgagaagg atttggtctc atcttcttcc 121321 ttcttccttt tagtcgagac tcttcctctc cttttagtcg agactcttcc tctcctttta 121381 gtcgagactc ttcctggaaa gcctaaaccc tcactcttcc aagcggacat caatcccgga 121441 atagtgtaag taagaagaag accggttagg atcacactag tcctggctgg cctataactt 121501 ctcagaaaca aaaagaggct tgcaggtcta gtcgcatacc cctatcgtct tattgttcta 121561 ttcgttgcag tggtaaactc attgaatact caaatatggt ttagtcacac ccttagtgta 121621 cattctgatt caatagcaaa cagagcattc ctttgcttcc tgattccatc cctgaaacaa 121681 ttctatcggc gaatcgagga ctctcccaca gctttgctta ttcgtgcaag atccagcatt 121741 ccttcactta tcatatagat aacgatcgac agtcagagtg tgcacaacga gagatgaagt 121801 acgctgcaga ttcccgatag cagcatctct ctatatgtga tgagaagtca gttaagtcag 121861 tctacgttcc ggaccagaaa gcattgaata cggctgtcag gctagcaact gattctccgg 121921 taagatagat cagtcagtcc gcccgcattc aataataaaa taaatggcat tgagctaagt 121981 caagcttttt ccaagtgttg taaagaacgg gctacaacga aagtaaaaaa gatgaaaact 122041 tccgtcactg cgcaccactt tgggctctgg tccactttcg ctccgtttac gcgaattggt 122101 tgtatctgcc cgctaggaac gctggcatct gttgacaaca gtcccgctaa aggccgctgt 122161 ctcttctcgg caaggtaagt tagttaggaa tcatagcagt cggtgaagct ttttctttgt 122221 tatgcacaga tggttgggcg cagctgctaa ggaaaaagtg ctaaagcttt agctgctttg 122281 attgaccgaa gagcctgact accagagcta gtaagctagc ggaagagctc agctcctact 122341 ataagctgat aaagagctac tatcaccaag agagggcgaa gaaactcatt aattcattga 122401 tgagggctcc ccattctctt tctaaatctg cagtccaatc cttcctaatt tcgaaactca 122461 tccttaagtt accaggggca aggtggacat gaatcaagag aataagctct ttaatcagaa 122521 tacatagttg aatttagtaa gaaaagaaat gatggttgac tcattcaagt gataaatcgg 122581 atcaagaaaa caccgcccaa ataagcttcg ccatgagtgg atcaggtcct ttaaggcgat 122641 cccgcgaagc cggtccccgg tgtctaagat cctatctgac tattgaggag agctgatctt 122701 cacgacgaac ttttgttgat agtgctttgc ttaacactaa gaagttcgat catccaacgt 122761 agagagaaac atcttgcggg cagttgttgt gaacaaagga atgaatcttt gggtttcctc 122821 tcaacaacag aattttgttc aggagtttca gggcaagaat gataagaaag aatgcctttt 122881 tatttaaaaa gagtgtgaaa tttgagttca tgagcattat cggctctaac agctttggct 122941 acccttttgt attgagtttc tatgagcttt agaaaccctg ggaagacagt gagaacctca 123001 ctttgacttt tgagaagata gacccaagta gcccttgaat gatcatccac tatggttcaa 123061 aaatatcgaa aaccttctgt tctgaggttt caactgaaaa agggcccaaa atgtcaatgt 123121 gaatcagatc aaaaggttca tcacaaatgt tattatgaga tttaaaaggt aagtgttttt 123181 gtttagccaa aggacagatt ttacaaatgg tctgattgac ctttttattc ttatgagtag 123241 ggagtcaatc tgaaaggata gcagttttag aagaagaagg atggcctagc ctgttatgcc 123301 aagtgtgaga atctgcaata acagaggcac taaaaaaagt agtacctgaa agagatggat 123361 cagaatccaa aacatagagg tttgatactt cacataaatt ttcattttta cgaattcgtc 123421 tttctaaatt caactattag catagcatta agtagtaaac attttacact aggagctgcg 123481 cccataccac acacatacat gcaaacataa caaatcaaac aaagacaaag aaaagctatt 123541 tctagtatac ttcttcactt cagataatcc aaccgagatg tgattgcgat gatgctacca 123601 gaaggaggtg atgtcttcgc gctacgatcg aaagatcaag aatcaaataa tgacttttcg 123661 aggaatcatg ttaacgaaac gaatccaaca ggcttgaccg catacctact cctgaaagga 123721 gtcaagctta ttacttttag ttactagtta tagttatccc gggtaagcat acttttgagt 123781 agtttcagtg tattcaactc ttatctatag aaagcgagcg gagtccagta gtgaaattac 123841 tttatttagt aaatgaatac agtgtctact aagaaaagtt agaggtaagg cgggaacagc 123901 ttccatcttt atctttatta agctggaaaa tagaaagccg gggctccagt ccagagattg 123961 gatcatctct acgttcattt ttaggaagta gggtacgtct aaggtaaggt ctgaagccga 124021 tgttcattgg ccttccgcgc ctatgatttc ttgaattcct cgcaggtcaa gcggctgaac 124081 tctttttatc tttttttttg tgtctatccg gtcttccact gttcttgcct tcttatctta 124141 agcagtcgct aggctaatgg gtatggtcca atcaatggaa cattcttctg aactgaagac 124201 ggaagcaagc gcatttagta ataaatttct tatacaggat ttctttcact attagattca 124261 gtattagaaa ggtctgggtg atcgcagtat attcggacct gtagttagct cagcttgggc 124321 agcaccccat ttttaggtac catcaacagc aagaccatac aaatgtcatc gcttattctt 124381 tcaacatttg cggagcgtct cactgaactg ggtcaaagaa ctaccttcct ggagctacct 124441 ccttacttag atcgttcgtt ccccaattca aactcgggca aagttgtcca ggtgggaaaa 124501 tatctaccac ctgaggtaaa acataaagct gtagacctgc ttcgtgagta caaagacgtc 124561 taatccttcg cctgtgagac atctacttag ttatcggtac tcggacaccc aatcccatcc 124621 ccgaaagaca gaaggaaaga caaagattac cattaggcga aagcaggtcc acaaggtgaa 124681 gactctctat cgtcagatcg ttggcagtct agttctgctt tctttggaat cgaagtttaa 124741 ggcattcaga catgggattt tttaagtgca ttcagcatca gtatctgctt tctttcctga 124801 tgtaattgca gagaagatgt agacaaaact tacttaattt aattcaaacc ctgtgcgaag 124861 gaactcagga atcacttccc cttcattgaa aacaacataa gaaggagtgt tctaaggcca 124921 atagacagaa agtgagtgct tgatgtagca tatctgctgt tcactatcaa ccataggcag 124981 actaagcaag gctcgatgga attgaggttc gatgtaattg acctgcctta tgttggactt 125041 cccaacttac ctagcggaag aaaagggcta ggacctataa atataaatat gtataggtca 125101 cttccaccaa caactcaagc tatttcttac tgggattctc cacaagtact agaaagtatg 125161 ggatctctac aacttgttcc acaaactcaa actatcggtc aaggaagcat acgcctctaa 125221 ctcggttcat tccctaggta cccattcttg ggttcttaca aggtaggctc gtgcccgggt 125281 tcactcgtga aagaaaggtt atagcggtaa agtaaagaaa atcctccctt cttgcttcct 125341 tagggtacgt cccctatcgc ctttcattcg gaaaagctcg agtcattatc cttcctgtac 125401 ttcaagtgag agctagagcg cttctttctc aattacatcg acccttcgca ccttggaata 125461 ttgaatagac acttttgaat cccccctttc gattagtagg ggattcctta gctgagcatc 125521 aatcccattc tttgcggata aagtacacta cccgcttgcc tgtaaccttc tgcttgcttg 125581 actacagtca ctcctattag gtcacgaaca tccttaactt aagagagcag gaagagttta 125641 ctatcatagc agcacataca gagtgccgga actacaaatg gactaatgct ttctctgctt 125701 tccctcttgg tttcgttcat ggaagatgcg gcatagatag aaagagagca ttccagacta 125761 ttctctagca tccgagtctg ggcatgaatg gaagactcct agctccctcc taccacactt 125821 aagtatctct tctttagcgg gcgcttcata atcaaaagat tgttccatac catatgaccc 125881 aggctttgtc acgagctgga ttcgaaccag cacttccgga tgagttacat ccgacgaatt 125941 cccacaacta tagtgatttc ccggttcctc attccattca tttcagccgg attgtacggc 126001 gaaactcgct aatagataga gaaattttct cagggaagaa gaacatctat ctccggtctt 126061 gtggaacttc tgtcgtcccg ttcattgtgc ttcctcggcc ctgctagcca tttgcttgct 126121 cgaacaactg gaagtccccc caccccattc tttgcctcat cctggggagc ggtttgttgc 126181 cattgaaatc ccttgttgaa ggggctggct taccaaacca acttaccaaa tcgactctcc 126241 ctctactctc tacctttctt ccctacaaac tcttaacctc aatccagggc taaggggact 126301 aacaactggc ttcccatcac taaccagcta acctccttag aaaaactcta tctcctgttt 126361 caagtaacct taatctaggg ctaggggaac tagggacaga tctaactccc attcctccaa 126421 tccttaacca actaccaatt aaaagattca aagctcactg agaacagatt tctatagtta 126481 taatttcaat tactgacctt tctcaccctc gctcaattca ttgataattt tggctcaatt 126541 cttccattag ttaaatggga aagaaaggct aagtaaggaa gtccgtagag tagcggagct 126601 aatttcaggg agaaataaat cactgaaaca cagtattcgc accttacttg attcattcct 126661 tccacccgaa gcccctgctg cctgcctggg atgagcgtcc tctgatctaa ataagagctc 126721 ctgtttgctt tctggcaatg agctagctta accctgattg cgacaggctt aatacagtca 126781 tgttggatct cctattcgtt ctgctaccag gaagagaaag aaagactgat tgcgacagct 126841 caaccggatg agacatctct tatttacaga actgtgccaa ccgtgccttc ctaccaactc 126901 cgccaacccc agacggggat attgatttag ccgcacctga acccgcaact gctgcctcta 126961 ctccgcctac ttcttgtgcc gacttcgact gccttatttc gggtgaagag taaggggtgg 127021 taggcttagt agggctcgaa cctacaatat aaccgttatg agcggtacgt ttcaaccaat 127081 taaactataa gcccctacgg atctctacat gcaatttgct cacttcggga agagcggacg 127141 caagcaaagg ggaggccttt gctctatctg cttcttcctc ttcccaggaa tgcccaatta 127201 gataaaataa tgattttaat attgtttata gatagataat cttatatatc tattatttat 127261 aataataaga aaataaagca agcggtcctt tcgcgactaa aactgccggt acgctttccg 127321 gctcgcaacg actcaaaacg ggcggggggc tctctatttg cttgcaatgc cggctgttcg 127381 cgtttagtta aaagtagaag tagagggcgc ccttttcccc acacttaaaa aaagtataaa 127441 agtctaaagt acataaaaag taagaaagaa aaacttagtt acgggaacat ccaacagaac 127501 cttgtgaata attgtcgatg aaaataacat agtacttcaa accttgaaca gaggagaggt 127561 aactggggcg ggtctcaata agtaagagtt catccgttct caaggaagta aagcttgaga 127621 acagagcgag ccttacagct tgaaaggaac ctgaaagatg gatcttgaaa ccttggggaa 127681 agtttcctac ggcgagctcg agcagaaggc gcagccgaag gcgcagccct acttctatcc 127741 aattccggca aggtaatcca attggaaaac cctgactgaa cgcgaggatg ttatgtcgac 127801 ttaatccgtg gttgaatatg ttagtatcta gattgagtat cataccattt tccaaagaaa 127861 gatagtatgc taaaatatta tgctaatatg gtcatttggt atgctcttat agggatagag 127921 cactatgtaa gaggtctcga cgagcctcca ccatactaag ctaaggtcta ctagctatag 127981 tagctagatg ggaatgtagc ctagcagcta gtagctctct tcttatagct cggtagttgg 128041 atctttcata attcctttat ggttagagca cactgggata gaagaataga tgagtttgta 128101 tctttcttcc tacttttttt ttatcggaaa acagagctag aacaaggaac aagcaacact 128161 agcacttaaa ataaggtagg cggctaaaac aattgggcat ttccttgatg tccggaaatg 128221 gtatatctga aaacagaaga aagactttct tttgaaccaa aactaaagat gatccgatcc 128281 gtgcaccgct tttcttctct accctttaga atccaaggaa aaggcaaaga gaaagaggtt 128341 gcagagggca cctaaccttc caaagacaca gcagctctag cctaacttcg gattcgaaac 128401 catacttccc tgacttcctc tcaacaagaa gttaaggctg aaagtcctaa tcttcttatc 128461 ccgatgcctg gagtagcggg tcttagttcg gattagtacc ttttgagcaa ggtaagttaa 128521 gtttcggcag tgaaagaaag taaccaggat ccgcttaacc agttatggat gaaggtaggt 128581 ctactccaat ccctcagcca ataaacaata aagtaggcac atttccctcg gcaaagaaag 128641 tagtccctcg ctaaagagta gacttgagta tcttcgaagt caaagccgag gaataagact 128701 cagttcgggt agagcaacca tttatatttt cattattctt acccgtttac ataaatgaaa 128761 aataaaagtg aattaccacg gaaaatatcc acctaaaaaa atttctgggg ttacttggca 128821 tagtcctcta aagaccctat ttagatactc tttcttccga gttcgagata aggaggaaga 128881 atggggcgac agataaaatt aaataaagag caaaaatgct tgactttcat tttattaaat 128941 ataatggtag gtagggcttc ttttaggtca taaagggaat agtactttta gctcgttcat 129001 ctaattctgc tggtctatag gagtggctcc cagggctatg tgatagcacc caaaatacga 129061 cggcctgatc aagagagcca gaagctgggt ccgcccgccg gagcttcttt caatttccgg 129121 ggggcttagc ttgaaccact cccaagtgcc aagaaccggc gaatgaaggc tccgtggcgg 129181 catcggagct gccaataagc taatccgttc ccagtgagct attacgcgct ctttcaaggg 129241 tggctgcttc taggcaaacc tccttggctg tctctgcacc cctacctcct ttataactga 129301 gcggccattt gggggcctta gctggtgatc cgggctgttt ccctctccta gagaaagaag 129361 ttctctagtc actaaagtgc gtttcactct cgttcctctc gtttcactcg ggttgctaaa 129421 gcacctctcc tctgccaagc tctctgtctt ggttaatggt tatccaaaag gatatttctc 129481 atgttaggcg gtgtgagaca gggggataaa ggatcccccg ttgttatttt gtattgcgga 129541 agatgtctta tcacggggtc ttaccgggct taatcacctg ggcattcttt cttcaattac 129601 cgagacctgt acatacaaag atctaggcag ctcatcttct tctaacgaag tcagatcttt 129661 ttccatacca taacatatat agaatcgatt ttcttttctg atcgctagcc tgccgggccg 129721 cccccgcgat caaactatca atctcataag agaagaaatc tctatgcccc ctttttcttg 129781 gttttctccc atgcttttgt tggtcaacaa ccaaccacaa ctttctatag ttcttcacta 129841 ctcctagaga cttgacggag tgaagctgtc tggagggaat cattttgttg aaatcaatta 129901 atctaatcat gcctcaactg gataaattca cttatttttc acaattcttc tggttatgcc 129961 ttttcttctt tactttctat attttcatat gcaatgatgg agatggagta cttgggatca 130021 gcagaattct aaaactacgg aaccaactgc tttcacaccg ggggaagacc atccggagca 130081 aggaccccaa cagtttggaa gatctcttga gaaagggttt tagcactggt gtatcctata 130141 tgtacgctag tttattcgaa gtatcccaat ggtgtaaggc cgtcgactta ttgggaaaaa 130201 ggaggaaaat cactttgatc tcttgtttcg gagaaataag tggctcacga ggaatggaaa 130261 gaaacatatt atataatata tcgaagtcct ctccttcaaa tactggaagg tggatcactt 130321 gtaggaattg taggaatgac ataatgctaa tccatgttgt acatggccaa ggaagcataa 130381 aatgattctt tcattctata gatacctctg gtaggtaaag cactcgactg tgctttattg 130441 aaagttccca tcgcgggggc gaggatactt gccttcgcgg ttcgactttc ttttcaggct 130501 tgactcattc gctagcttct cctccccttt agagctcttt atgatgccca ctgagtaaga 130561 ttcggggact tccggcgcag aagctcattc tgaaccgcgg gaaccttcgt ctcaatcttc 130621 ataatcaatt tatgccatct taaaaaaatt ggaatcttgt taggtacccg tacgacagct 130681 ctaataatga gctaacgtac tcatctttag agtggtttgt gcgcaggctc gagaagttgc 130741 tgcccgcttc agaagacctt agggtgcctt cgattcctgg acgactttgt tcgtcgtgta 130801 caggggacgg gaagagatgg ctgttcgctg tagcaaaatt caacgaaccg acgggcgtcc 130861 cggacatagc cttcaacccg caaaggttag ctttgttgct ggacaaccac tagggtacta 130921 ttcttcctgg cccctcttcg ccctatcaca tcacatggtg gtgtggtatg cggcagaaca 130981 tgtctatcct tcctccttct tctttcagtc gaaacttcct cctagtgagg tgttcgccta 131041 tccaggtatg gaagtattca atgaatacac tctgtaccat gcatgggtgg atgaagcttt 131101 atctggagta tcaaagatgg aattgtatgc taaggcttag tgccagtata gagcttgaag 131161 tctttatgga tgtgccgatc tttatccgta catattaccg acctgcgaac atggatgatt 131221 cgttcaggta cgggataata tttaggatgt acgatttggc tgtccagtta cagtggggca 131281 atgtcatgaa attgctacta gtaactaaga ctcaagtgga gaaaaaaaaa gcactcaatt 131341 tgaatgtaag taaaacacgg gatggaagag gaggcctgaa ccaacaggga tgaaacagta 131401 tagattcccc tgggcgaagc aagtcaccga ttagtaacct aagaaagagt tgtcaacggg 131461 cgagtgtcct gctataaggt aaactcctac tatttcagct tcttttcccg tggtcgaaag 131521 ctaaatatct caagatgata ttttccaaac tctcgacaag cagcaaatag aaaattaaaa 131581 gaagattttg aaggatctga tttcatctaa ggagtggcgg ttaacagaag gccatttcaa 131641 cttatgctct aaaaagatag aaagcattgt tgataaagcc ctctctttat ataaagaggg 131701 ctttgcgcct tcggataaag gaagattccg atattcgcat aggtattgaa gcggggatat 131761 gaacagctag cttcccctca aatgggagcc gtttaagtaa tcttaaaaga gtcctcgact 131821 gtattggaaa aacaaaaagt tcaatatggc gggagatcca taattttatt gagagctttc 131881 aatcaaatta gattgtaaag tagtagtcct gtgtaaaaaa aaaagctggt ggggcggggt 131941 ccaagcaagc gtaataaggg gagggggact aggggtggaa gggtcgtcga aggagatgca 132001 tttctggtac aagtggtatt ggacaagatc tcagggaatc atctctttca gatttctgcc 132061 tttctttccc atgacgacta ggaaaaggca aatcaaaaat tttacttcga attttggacc 132121 tcaacatcct gctgctcatg gtgtttcacg attagtattg gaaatgaacg gagaagtggt 132181 ggaacgtgcg gaaccacata ttggatcact ccagtgcggc acgaagccgc tgacgccgag 132241 tcggctccta tgccgctagc tatgccctgc ttggtccccc ggcacggtgg aggttccgta 132301 gcgggtcatg agcaccgggc taaggggcga agtcactcga ctgaaaagag aggggcggtt 132361 gagcaactca agcgaaccgc cctaccttac tacaacatag ggacagaggg ggagaaggtt 132421 gtgaaggtgg cctcgttatc cacacctccg gtcggatgaa tggaggaccg cccgacccgg 132481 gtttcatgag cgttggcggg tcctggagtg cctgtcaagg gcgctagcgc ataccccggg 132541 gtgatcatca tcacctgcac ctcacatctc ggcgtagtgg aacgtgtaac ccgcctgctg 132601 tctcattcaa ctacatttgt tactgtaatc tatagcctaa cagaaggcag cgtcgagggg 132661 ctttaggaac tcgactgaaa ggagaggaga taaattccca tacagccagc ggggaggatg 132721 gcactacagg caaagaccgt ctggcgaaaa cgccgcaggc gcgaagcgtg gtaggcctgc 132781 gccgggtgag catagggggg gaaagggatc ccggacggtg taagagccag gggaggccgg 132841 gtcatttgac ggaaatggaa ggcttttccc ctcttataga aagccctatg aagttaaggg 132901 aagtgaatga attcttggaa aaagaaggag cgagcctata tataaaatgt aagaaagtca 132961 attattcaat gaatagatga taaagtcaac cgtacgacag acagcgctgc ctacacgcga 133021 attagcttcc aaggtcgagc agtctcaatt tcactacagg atttgcgaat gaatgctggg 133081 ctgggccacc tcgaatggcg tgagcgcatg cggggagacc cgcacgtacg gtttttaggg 133141 ggatctggcc gaaagaccgg ccggcgccca cccgactaga gggactgaga aattaataga 133201 gtacaaaact tatcttcaag ctttacctta ttctgatcgt tcagagggcg atcgcggagt 133261 cactgaatga agtcctccgt ttctttcgga ggtgctgacc cgcagcgagg cagagatgac 133321 taagtgacat atggaatatg acgacaacaa cagcatgtcg tagaaggaga gaacaggtgg 133381 agctaacgac ccacgttgac taacgtatct acaactacat ccccgagcgg cagtcaaacg 133441 gaggcgtgaa tgcaagatgc cagcggaatg atcggccgga cagaggctag ggctgcttcc 133501 ttcccaccgc gtccttcctt gtgtatcgga gatataaagc gagtgcaccg gaaaagaacg 133561 ggaactgggt cgatctattg cgaagcatcc gaagcataac tgcacactca cacgatcttt 133621 gccgagagat aggagcattc ggtggaaccg gtgaactaca cttgcttctg gatagatgtg 133681 tgggacagag ggctcgtggt accttctgcc cacccttcct cctctgcttt gagaactgtg 133741 tgaacggaga gtgggcagaa gggaaggagg tcctcataca gagaaaatca tggaatgggt 133801 cgagatagat gacagcgcct ttttcctctc ttccttgccg ggagggcaat cttctcttat 133861 ggtcttcacc tcccgcccgg cctggaaatt gaatccagcc cccttctttc tgatccattc 133921 atttctgcaa gcccagagcg ttgcctccct tctattgcat aacctaaaaa gctataagca 133981 aagtaacaaa agcgcgctcc gcccggtgac taagaaagag gtttgcgcaa caattaaagt 134041 gataaggtcg agggaagtag ggctcctatt gaaaggcttt ccctccctca aaagaagact 134101 agctttcaat actagttctt acgttacgct gccatttttc caatattatt gaatagcatg 134161 gcctggggct aaagtaactc aagtgggaga gccgtgttat gggtgacctt attgcacggt 134221 tcagagagca cttgtgtatg tgatgcaagt gaacgtgtac gaaaaagctg tcgtaaagtt 134281 tcgtttttcg ttccgttttc gaccctatct atgtttctat gatggcccaa gaacacgctc 134341 attcttcagc tgtagagaaa cttttgaatt gcgaggtacc attacgagct caatatatac 134401 gagtgttatt ccgtgaaata actcgaattt caaatcattc acttgcttta actactcatg 134461 ctatggatgt gggagcatta actccgttcc tgtgggcttt tgaggagcgg gagaaattgt 134521 tggaattcta tgaaagagtc tcgggagcca ggatgcatgc cagtttcata cgaccaggtg 134581 gagtggcaca agatctgcct cttggcttat gtcgagatat tgattccttc acacaacaat 134641 ttgcttctcg tatcgatgaa ttagaagaga tgtcaaccgg caaccgtatc tggaaacaac 134701 gattagtgga tattggtact gtcactgcac agcaagcaaa ggattgggga ttcagtggtg 134761 taatgttaag aggtcgtgcg acatgaagac attgatagca atatggggga agttcccatc 134821 aggcaacaat ggttccgcct gactctactt aagcatgcat attatgtaag tgaagacttg 134881 gtgtgaagcc ttggagctta cgttagaaga gcaaaaggcc cggggctagg gtgagctggg 134941 gggggacagc gtaagtgagc gaatgtgtgt aagcccagtc aaagatgact gttctaagcg 135001 gggggagcca cccacctttg aatggtgttg gtcctacgga ccgtgaacgg atttcgcctc 135061 tggcctctgg gcacgtcgga accgcgtgag ttcaccgggg tggagcacgg tccgccaaaa 135121 ccggcataga ttaggtgcta ttgatggaac atggtaagcc tatctttctc catatggaag 135181 tgctgcgagc acttagagat gcgggtagag gaagcctcaa aaagcgaagg ccgagctgta 135241 ggtcacgtga cctgcaccga gttggtggct gactgggctt tttccttgat caaagcagat 135301 caactcgcct tctttcttgt tacccaaaac taaagttggt cgaatggttt ttttcctgcc 135361 ccggaacgtc gaatgaaata gggggccggg ttctctttct acaacccttt tgatatgata 135421 ggcctggcta cctttcccct atcccttatg attagggggc tgttaagccc aagaacaacc 135481 agtcttgtgg tggtacggaa ggaacggact ccgcgaacgt cccgcgcccc ggaaagaaag 135541 tctcaaccag aaccacattc cttttgcgtg cggatgtagc taagtgtctg actctattgg 135601 tcatagtttc ctgctgttgc ggctggtgct cgtttgcgcg cgcgtgaacc aactcaacaa 135661 agaaggaaag gatgcccggg gaggcatctg agaatgattc gagccgtatg aagggaaact 135721 ctcacgtaca gtttgttttt tttggggggg gcaggagccc gacagggtcc cccactgact 135781 tggcccgggc ctaagttaaa gtgaagtggt gggcctaccc atcccaacca ggggtatgct 135841 gggattcgcg aagagcagca ccttacgatg ttcatgacca atcggatctt gacgtaccag 135901 taggtaccag aggagatcgc tatgatcgtt actgtatccg tatcgaagag atgcgacaaa 135961 gtcttcggat cattgtgcaa tgtcttaatc aaatgcctag cggcatgatc aaagccgatg 136021 atcgtaagct atgtcctcca tcacgatgtc gaatgaaact atccatggaa tcgtgcgtcg 136081 tgtgaaacgt agatcatcgc cgttcttaac caagactcag gttaagctcc gtctcggaac 136141 cttgtgggtt aggagtaaag catcccgggg ttggcgcatc tcattgggcg tagagaagca 136201 ttgggaaccc caatttattt cttcggagcc gtttcttttc ccgtcccccc caccccggca 136261 tagcgcttcg cttccggttc ttcggaagaa tcaacttact tctaccttct tcattgatct 136321 gggggaaaag gaaccgtcta ccagttggga agctagacat caagtaagtg gcttgatgag 136381 gataactaag ctgacacgcc ggagttggct gctggcacaa cagggtggtg ccttaccgca 136441 ccgcaggcga acgcgcggta gcgttcgtgg tggtgcttca ggattccaat gtactgcgtc 136501 caagatcaga acgagcttgc cggcggacca ctgccgtccc attcttgagt gagctggagc 136561 gcagccatct tatccactga actagctaga agctatcgct tcgggtcgaa gcactaaaag 136621 aaaagcaccg ggaaacgcgg cggcatagga accacgggac ccccacccta ctagtaaagg 136681 gaaaacggaa gtgcgctcct gcgcaccagc tgaaaaaagc cctttcccct ttctctgata 136741 ataaggaaag cttcttagct ccaacctata caaggggttt ttatgtcctt tttataggtt 136801 gggttgttgg atacgggatc ctcgtagtag gctggaccaa catccagccg agagagggca 136861 gcctctagaa gcaacaggtt gggaaaccaa gagaacgctt cgccttttct tatcttcttt 136921 ctgccctagg agtagaagta gcacaaaaag agggattcgc attattgacc caatgataaa 136981 ccactaacac cttcctcgtt ggggctccgc gcactgggaa aacgcttgac gcgattggga 137041 aaccggccac tagttacaaa gctccaataa ggtatcgaga gggctatcac agtcaggtgc 137101 gaagaattac ccctatttgg aaagtaccct tcttcctatt tagggggtta aggcgagaaa 137161 tggcttgatg aatcgttccg ttcgccatgc accggcccca ttcacttgct tatcgtagag 137221 gctgtaagta cacagtgccc cacaactatc aatagtatag tggggttgaa agacgagagt 137281 gcccgccctt tcttttcttt caagtgggcc acttttttcc cgaacgcagt ccgggatcac 137341 cgtggccgtg tatatatata tatatatctt cgatgctgtc atttcgaaat gtccgcttca 137401 atcgctcttt cacctccccc caaaaaaagc aaagttggct taacgagcgc agatgtgagg 137461 aagcgggagc aataaaacaa aaaaatatct ttcttgtcct tctacttaag gggcaaagag 137521 aagcgctttt gctactgaga aagcgaacgg tcagcgcgaa ggttcaagac ttttctgagc 137581 gttagcgaag ctagattctc atagcgaggc gcttcgagtt agcgaagcct gtagtagcgc 137641 cgaagcccta tgtgctataa tgctgagcca aggacactcc gccttattct tataagaagc 137701 agtcaactga gttctgaacg aattagatcc ttggtaaatg gctcaatcta tagatagaaa 137761 gccttatgat gggaaactac cacgttaggt ttggagagag atgggaccgg ttatataata 137821 gagggagcag atgcaagctt ttttctttca atagccggcc aaatgactac aggatcatcg 137881 gtctactcta cctcaattca ccatttcgaa ctttatacag aaggtttttc cgtaccagct 137941 tcttctacct ataccgcagt tgaagcacct aaaggagaat ttggtgtctt tctggtcagt 138001 aatggaagca atcgtcccta ccgtcgtaaa ataagagcac ccggctctgc ccattcacaa 138061 ggactcgatt ctatgtccaa acatcacatg ccagcagatg tggtcaccat cataggtact 138121 caagatattg tgtttggaga ggtggataga taggacgact agttgctcga tcaggacctt 138181 agctttattg cgagcccaga agtctctctt tttttcggcc ttcaggaaca gcctttaagt 138241 aaaatccaac ctaatataat gaatatcctt ctacatagaa gaaagacact ctaagatcct 138301 ttttcaaacc tgctcccatt tcgagtcaag agatagataa atagacacgt cccattgcac 138361 tgatcggggg cgttcgttgt atgttgaagc agagatgaat agggtggctg tgaagagagt 138421 gatggttgat cctgactcca cagtcaatct catacctatg tccactaggc attccaaaag 138481 aaaagattgg aggagcactt ctcgaggtat ataatttgag ttcgaattgt caaatcaatt 138541 ttctaggaat ggtatggtca gagtcaattg caacgttgga ccttttcaaa gtccgataga 138601 gttccaggtt gtttttgcac cgacgacaca ttatgctctt ctgggcagac tctagattca 138661 taaagaccaa gcggtcccat cgacatacca tcaatgtata aaaggtataa tcaaaggaaa 138721 ataagtactc gtaccagcgg tgagcactcc cttcgaaaga tctgaaatcc atttttgctg 138781 acgctataca ttattcggag tttgccgagg atggagagct ttcagggtcc agggagctgt 138841 tggcttactc gttgggagga cttattaatt aaggaagatt cagttcgtat ctatttatag 138901 aaaagagacc accggtatgc aaaagttgag aaaagccgtc tttagcctgt cactaagttg 138961 ccaatcgaca tgatctattt gctcacattc caggcgaagg tcataattcg ctgtttccta 139021 caggtactgg gacggaaaca tcatattccc gaaaaaaatg cgggacttcg gtagcgagga 139081 gggccctcac agcaaagccc attgccaagg ccagaccccc agccgatctt gatgatcaag 139141 aaagaaagat ggctcgactc cctactaggt ataccccaac cttagatagt agggccataa 139201 aaggtttccc taggaaaaaa gaaagagaat caaatatagc gagaacacat aaaagaatcg 139261 agattctgat gagaatgcaa ccactgcgat tagaaagggt tccgcgattg taattcacag 139321 tcatttcttc aagatgtttc tgccattccc aacaaatcat ctttagaaag ctgcagattt 139381 tccattcttt ttaatgcctt aaatacggat tttttataat catctgtctc agtactcatc 139441 tttttactta tttacttatc cttcctgaga cttttttcag tttatcgtag aaattagtcc 139501 acaaaactga taagaacgac acgagtttaa aaatttcgat tacgctgggc catgcctggc 139561 actttcattt tattgagtca taagattaat ttaatcaatt gcctcttatt ttaggcaaaa 139621 aaaacaatcc catttgttct ccgcagtcct tctgtgtgta gaacattggc cagctccatc 139681 taccaatggg tgcacctgac tgtctccagt tcagttagaa cagaaggccg ccagcttaca 139741 tagtatataa ttgtagcggg gggagctgac tgttgcctcc ctagttgaca aatgagccaa 139801 atataattta atcctatgtg ctctacatat tttctattta tttcttatct atcccattta 139861 gcgataagat actcgggtta tgtttggtga actggggccg tgttttatat cctccttccc 139921 cacgggaaaa cgtttgcaga tgcgatcatg aatcgaacca gatcgaaaca ttcagctgtc 139981 gacggacaaa ccttgccgaa acgtcgaccg caaacgaagg atggccaggc cccggttccc 140041 agagttatag tattccctat tcctcgccta ctcaaataag aactagttga ttctctttcg 140101 cctatcggcc ggccggcttg ttgcaacctt cgcatttcct gctgagatcc taagtctcca 140161 agtgggccct cttggccacc cgcgaccttg gctttttaaa gaatcccgct cctgagtcgt 140221 ggaacttgta gctcggcagt ccaccgggtg ggtttgttta tccttccagt ttcgagtgtc 140281 ttcttggata gttatagcgg cccataggcg cgagatgtac cttgtggggg ggggggcggc 140341 ggtcccctgg acatagtcgt ttcaggcagt ggccgtttag tccatggtcc attagatggg 140401 aggtgcaagg ccagaaaatt gaacacattg attccgctcg ttcccgtcct tcgcttcagg 140461 gcctgcccct cggtgtggtc agtactccat actgtcgggc agcgaagctt acacttgttc 140521 actaattatg acggttcgcc agggcctctt tcctcctccc ttttctgctc actcgtaggg 140581 gtccggaccc ccacaaaggg ggagggagtc gactgaacat ctcagccatt ggcggaaatt 140641 tcgcccgcat ccgatgtacg atcgtgtcgg gtgagcaaca gccgcttcgt cacagtactt 140701 acttatgggc taacgggtca cactttggcc aagtatccta caaagagact cccgagagcc 140761 agaagtatta aaggaatggc cataggaatg ggcgcatcat gacatcgtga gatgtctcgc 140821 ccgaacgaat tagttggtac tagaaatgtt agaaaaagta aacgaaaaga gtaataagaa 140881 gtgaaaagga cagagacact tcccaaccag aaagcaaagt tcccactgat ggtatactta 140941 gtgtaagcga gctctaagat cacatctttg gaataaaatc cagttagaaa aggaaatcca 141001 attagagata agctgcctat gagcatcatg gcataggtca aagggaagga ggaggcaagc 141061 ccccccatct tccgcatatc ttgctcatcc gacatggcat gaatcaccga accagcactc 141121 aggaatagta atgctttgaa aaaggcgtga ttcattaagt gaaagacgct aaccgaatag 141181 ttagagatgc cgcaagcaaa gatcatatag cctaattgac tacaagttga ataagcgatg 141241 accctcttta gatcgttctg taatattcca gtggttgccg caaggaatga cgtcgtagct 141301 cctgcagaag taataacaat caaagccgta ggtgggtatt caaataaagg ggagcacctt 141361 gctatcatga aaacgccagc tgttaccata gtagctgcat gaatcgaagc ggatactgga 141421 gtgggaccct ccatagcatc gggtgaccaa gtatgcgatc ctatctgcgc ggatttccca 141481 acagcaccaa taagaagtaa aatacaaata agacttatgg cattcaatct catattgcaa 141541 gaaatccaag aatttctggg ggcactagca cgagcaaaaa tggttgaaaa gtctactgtt 141601 tgaaagagag taaaacgacc cgaaatccca agagctaatc caaaatcacc tactcgattg 141661 acaagcatag cttttgtagc tgctttatct gcctgaagtc gtgtaaacca aaaatgaatt 141721 aacaaatatg aagcaagacc tactccctcc catcccagga ataattgaag agagttatct 141781 ccagtcacca acattggcat aaaaaaagta agaatggata aataacacat aaatcgaggg 141841 ctatgcggat cctcggacat atatgaaatg gaataaagat ggaccaagct acttatggat 141901 gtaaccacaa ttaacatcac tacggtcggg ctatcgaaca cagagtcaga agtgaattac 141961 gagtcggacc tatttgcgaa tcgagcgagc tccccttgca tgcaatgatg tggtggtgaa 142021 cctctcattc taattcagtg ctctccgaac cgtgcgggaa ggtttcccat cacacggctc 142081 accaacttga tcttccggcg ggaaccgtat gtccaaacag gcctagaaaa acaggtacga 142141 ttgccactaa agtgtacgaa aaaatccgtg cttaggcccc ttcttccctt ccaagagtca 142201 ccaccgcctt agtagtctca caaatagggc gtgcaggcct ccctattttt tgagtaggta 142261 attcactacc gaagcgcgaa gaaaaggctg gatcaagaaa agggggtact acgagccctc 142321 tgccccacgc atctaaccag ctcgcgtggt tcaccggttc caccgactag accaaaaaga 142381 gttattcagt cgatacagag gtgcgcttga agcggggggt gtgctgtccc tattggctgg 142441 gcccttcccc ataaggcccc accgtcgggg cataagcgcc ctcttgctac ccatatgcag 142501 ggcgccgtct tagccttccc tgaccaggat cgctcccaca cctgtagcgt tcgtgatcgg 142561 cctactcaac tgtgtatcga ttgaaaggca ggaatttccc acaaccaaag ggagtggtta 142621 cgtccagtat gtcccccctt attcccgaca tgctatggtg ccccggggtg ggtaggagcg 142681 ggtcgagtcc gtatcgccgc ggagcaacag ccgcgtccgg atcagatcta tctactcggc 142741 aattcatccg gtgacttcac ggtcgccaaa gaagccccaa gaagcatcaa acatttccga 142801 tgagatccat ggagcaattc ttagatagca agcactagct cccggtgcga cttcataaaa 142861 agcaatcaaa gataagatcg aagagaatga aacgcacgta gtggtcatta tagcgcttcc 142921 ttctgatcct agaaaacgtc cgaaaaaacc tgctacggaa ctaccgagca ggggcaaaaa 142981 tacgataagt agatacataa tttaatttcg agtgtgatca aaaacctaaa atcaagacaa 143041 tgacagagcg gccagtgatt gagtgataga tttctcgacg tccggagaac gctcgaccga 143101 agaaatgagt aactaacaag gaagattttg ttccccaact tgcttgatta acgggcattt 143161 tcggggacta gcccgcttcc cattactcca agagggcaat tcctcgcaca taataaaatt 143221 aaggaagcca ttgaaaggtg actaaaagac cagaaacagg gactacccga gctaatgata 143281 gaggcaagaa cactttccgg ccaagtccca ttaattgatc ataacgatat cgtggaaatg 143341 ctgcacggac ccatatatat aggaacagaa agagaagaac cttgatacta aaccagatcg 143401 agcccgggat cttcttgaaa atgggaagat ctaggatagg cggccaacct cctggaaaga 143461 acaatgtgca tggaccaggt gagtagggat gcagctccgt ggaccgctcg tcgggcctga 143521 taggtggtgg tatcacaccc ttctcaaaga aaccgtacgt gacactctcg cgtcatacgg 143581 ctccgccccg gaatcatttt tatacctctc cttttagtcg agtttgtttc acaacctctc 143641 ccaacaagtg gtcgagttcc tttgtacctt tcctttgacc aaggggtcct cgaaccaacc 143701 tgtccttcct ttctattcct agtcaggcgt tttcattcat tcattgattg attcaaggta 143761 cgaagtgaca agtccaagcg atagcgtaaa agctagtcgc cggaagcgaa cttccgggcc 143821 gggaaggggc caaaaaacgt gagcgccccc cgcaatcttt ctaaagaaac aagctagaaa 143881 ctttactttg agaaatacct tattatatat atataattag taaaggcgtg ttagcctatc 143941 tagtagtaag gggccttttc ttgatcgtta gcgctttact aataacaagg acttttctcg 144001 cttgtttagt attgctttgg cttcgctgtc cgtatcttgc tggcgcggaa gctaccgcaa 144061 ctaaaagaaa atgaatgaag gaagaaggca ttagaaagac taccgaggca ttccgggccg 144121 actacaatac aagtcatgag cgatagcgaa gccaagccta tccggctttt ttatgtcaaa 144181 agccctaccc caaaactagc tctatcttat agcagacaac tcaggcaagt ctactcaact 144241 aatctcataa gtaaacgcct gttcgcatcg caactaatag aaaaaaacga ctactagact 144301 agactagtag ttgagtgttc cttcttgttc aggtcttgac cgggtccgag cttcccaagc 144361 tctatgctgt tggggaactc tgcaaggatc ttgccacctt cttgatttac aatatttgag 144421 tctttggagt acttgaggat tatattcggc gccgaggatt tgtgcttgtg ggctagggtg 144481 aatattgcag accagcgaat ctggtggtcg acaatcgttc ggacttgata aaggttgtcg 144541 cggcacctgt agtaggacag aggacttatc gcgatgcccg cggaccaatt tacgatgtct 144601 tcgtcgctga cgttcgtcaa acaggccacg tggattggcc agggtcttct tcggctaatg 144661 atacctcgat cccgaagcct tcggagtatc ttttttatag gtgcctctat ctttatgggg 144721 aattcgctgc tgatagatcc cgcccagtgt cctcctccct cccccgccgc cttccgaccc 144781 gggggagtat acaatgacaa cttacgggca tgagtgcccg atcgtgagac tacctgttga 144841 acgtccgatg acgcgttgct ccgacctgag ctatgcaaga acgagatccc ccttgatcct 144901 tgccggatgt gcttgacggt cccccataat acgctaactt ggggacttct gactccagct 144961 gttccaagag tctccgctag ttgaaccccg tcctgtagac tccctgtttt gctcatcccc 145021 ttcgtcagct gtttgatcgg gatactatca cctaggttcc taaactttga atggatggcg 145081 gagcgtaggt ggcaagcagt tatatggata cggtgcttta cccgtagacg cttttccagc 145141 tctcgcaaaa attgtatggg agtcgtcctc ggagggactt cccgaatgac cgtaccaagg 145201 aattctaccg tactccgtgc agctattgtt gttgatcctg cggagcctac ccaaaggttc 145261 aggccagatt gtaggaaatg ggcgatacgt ttttgtattt ctatgagaag ctctacggca 145321 cccacgattc ccagtagtaa gtcgtcggca tatcgcgcgt aacaaatcct tattaagtaa 145381 tgggttttta agggggcctg cttacgggcc aggcctctct ctgacctgat aactagtatc 145441 gcctccccgc ccagctctat cagcaggccc tttcttttgc aatacttaag aaggtctctc 145501 atggcccaat tattattaca gcgttctcca ccatagaatt cagccttcgg ggtcaacccg 145561 gcggcttcta tgaggaaggc ggcgcaaaga aggctcgagg gcttgttaag gaagacggca 145621 agggccgacg aaggggggaa aacgaaaggc cttttctggt cccccctgag ccggagggtg 145681 cttgtggggg gggtgtgcca cgacgaaaca agggaatgaa aggccgcttt gcgttgcatg 145741 ctcttaacgc tccccacaat gatggctctg ttgtcttggg gagcgttgaa gcttgcttct 145801 tctccagggt tttcttggtc atcaatacga cgacctgtcc ttaatagaac cgatctgact 145861 ctctgaacaa tcggaatttc gtacttctgt cggatcctcc ctatctcctg atcgagcttg 145921 tgtaggtaga tgttgcctgg tagggccgat agtagtacac tgtgtgggac ggagtaaggg 145981 cccctctcaa ctcctacgag tcgtccggcg gaaaatactt tctgaatgga gtaaaagaac 146041 ttgggatcgt cgatctcttc cttcaaaatt tggatgagtc gatgtcggtc gatggtgtga 146101 aaacacttcc tgatgtcgaa ttctaaaaac cagcgagaga ttccccactc ttctttgatc 146161 cgtcttagga ccgagtggca gccttgaccc gagcggaaat gcgatgtgtc tggaaactcg 146221 ggatcgtaaa tggattcgag taccattctg atcgcctctt tcatgatctt ttctataggt 146281 agaactactg tgagcggtct aaacttcaac ccttctttct ttcttcatat tgtaaagggg 146341 ggaaaagctc gttttttgct cccctttttt ataatcaaag gccccctcct gccgtcgttt 146401 cagtgactca tagggcttcc ctcagttcag tctttttggt tcttgaaaat ggtcgccacc 146461 tctcctcttc tttcagtcga tggggttcct ctccttttag tcgagtaatc aatacctctc 146521 ccaggaccgg aatgattatc cctgcccgat gggtctacat ccatccctga atgtcgtcgg 146581 gtactgttca cttccccgca ttcttgtaac cgtcacctgt aatccgcgcg ggtgtgtccg 146641 cacccccctg agtggacgag aaagaaagga tttctcggag caacccccca ggttccagac 146701 ccaggagtca actttcccgt atgagcattc ggtacatgta tcagtccgtg ggaagaggtg 146761 aaagggtcac cactactgag gatctccccc ccccctaatc ttagataggt cgtctgaggg 146821 ttcgccgcgg ttcattgctg tgcttacaca caaggctacc cttctccgaa agctacgcgg 146881 gaccacctac cactagtctt cggccggagg ggtttattgc acaaaaacgg ccgggacgca 146941 ggctcccgaa gagggaagcc caacgaatgt cagatgcaaa gttccgcacc tcattaagat 147001 catattggca tactctccca aaaaaaaaag agcagacccc attgaagacg agagtaaggt 147061 acaacaaggc catttctgtc caccgccctt ctcacggaac cgtacgtgga cgttaccgct 147121 catacagctc ccagccagca agcacagtta gccttcctct acaagaaatg gaaatgtgga 147181 tgaatcgaat gaaatcgagg aattcggttt ttcttttcag acatgataag taagagcatc 147241 cctttcacaa aaactacgga tccccctcct atcctgccac ttcagcacct tgtgatcttt 147301 gagaagatca ttacgagccc tttcctagaa tgtttttgta gattccgaaa attccagagt 147361 ggatcaggac gagaatgaat ccgggaatcc agggcatact catcctggag cttccgctct 147421 cctctgggaa gggctttttt atggatagag agatgagtcg agggattgat tactcgacta 147481 aaaggagagg gtagcctccc gtttgaccga tttctcggag tcggaagaag atctctcatc 147541 tgatgaccct gaacaagaag gagctgctct cgatgtgaga tcagcagcaa aagaaagaag 147601 aaaagggcca tgatgatgat cctatgttcg ttttcgccct gcccctatga tgcgctccta 147661 gctcgcagag cataagtcag cgcaaaggag agatacatcc ttttaagact gtgaagagaa 147721 tccttggttt gacctacgga ctacgtggga aatacgagat ctaggcaagc cgctacatgt 147781 tctccccttg cccttgaacg atagaagaac tacttatctt ctcttagata gcagtcgatc 147841 ggttctatga tctatggtca atcaataggc ccacggatct attcttctat acgataaatt 147901 gccatctcgt agcaccacag cctgggggta agactagaag aggagatttg cttggtcttc 147961 tgtgcggaat ctgcttccta atcagtgtag ctcttgggtc agctcacagc taggacgtca 148021 tcagttctaa aggattcctc actagggaga ctctctcctt atgggcaagg gaggcagatt 148081 ttgacgggga gaaggattcg gaatatagtt acataagtcc ccttccctct cctcaatata 148141 gtttcactcg agtcacgtca gtacctctcc ttttagtcgc tcgtcgagac ctcatcaact 148201 agttaatacc tccggacctc tcctttatca gtcgagtttg tgttcacaac ctctcgagaa 148261 acaagttcga gagttacgta ctttcaatgc tctctctatt ccttgctaat caaggcctgc 148321 aagggaactg gttacgacta taggcgggac gacatgcttt tagcttccct gaaataggcg 148381 acttgctcgc tttggcagga agtatttttt ttgaagaatt gaacaaaaaa aactgaaatt 148441 tctgtaaatc attaaaacgg gactttcttt aaagaaaaat ggtctctcag gctggttcaa 148501 agccctattt ttagataata ggaaaaatcc catttcattc gggttttcta ttctgtcctt 148561 atccccactg agggaagtag ctaacctaac ttcatattat ggattagtaa ggatccactg 148621 ggtgggagga atcaaactca acgttgcgag ctcaaaaagt gatcctggaa ggcccacggg 148681 taagcaaaga ttctaattcg agattggcag agaaacggcc cgtgcgagca cgaaccaaaa 148741 gaaggtgcca agccgaagtg tacaaatctc ataggtctat atctgctcag tctctgttag 148801 cagcttcagt aggattctgg tcttttcttt ctttcctgcg agtgttgaag tggggaagtc 148861 cggatcaatc cgtcgaaaga gacgccatct ctcagtctaa atggaagtag accaagtagt 148921 ttcataggaa gaaatgtgaa agtaggggct gctaagcgcc cagacccatg ggaggtgagg 148981 tgagatagtc aatgaatggg gtggtagatt ctggcgcagt gtcaattgag aaagcgattg 149041 tgactctgac cttcttatcg gctgctggtc ttgaagtctc gctaactact tggatacctt 149101 ctctcattcc tcgggaacga aaaagcgaga gtacattctc ctttttagtg catttcagct 149161 gcgtcagcag tttattcttt cttattccct ttcccgtgtt cattttaaag tggtggatag 149221 gatgaactta ctggccgttg gtccttctct tctctttctt attagtaatc ttgttcaagt 149281 gagagggact gacatcttct tttcatttct ttcatatctc ccttgattct gtttcagttt 149341 gaagtaatag ggaggcttgc agaccttaca attgtctttg ttaaatactt ttatgaccga 149401 ctgacttgaa tatgcttgcc attggtcctt atcttatctt taagataatg gtaagagtgg 149461 taatgtaggg ctaggggtat agaatggaga gcctttgtag gccgataggt tgattgagtc 149521 agtacacttg acgtatgctt ctagaaagaa aaagaatgag attcagctct gatcaccacc 149581 accggatgca tttttcttga tcttctcgga ctccagtaaa gtttatgcgt tgggcctaga 149641 aagcggtaaa cgtcagcaat gagtgcttcg attgtattaa attcttcttc tcgattgggg 149701 aaagtccgtt acatgctaag acgcaccaga tctatgatct ctattttcta cagaatcgtc 149761 tagcgagaaa actctttcac taatttttct tcttggcact ttcggtcttc tgcagaatag 149821 tagcattgca tcgtcaattc acaattgatg tggggagagc ttgttcaaag aactcgagtc 149881 aacacgccac tctgcacctt gtgtgagcta gacgaagaaa gcacaaagta cttaccctac 149941 atatgcagga aagaaagtta actatggata ccatagctac tcgggcactt ggcgtcattc 150001 tcacccctaa tgcttctttc tattcggatt ctatttgaag gtttttctcc ggattatatt 150061 ttttttataa aatgaaaaaa gggggctttg ttccctcgta ctaatagctg aacagttgta 150121 gaattaatgt ggtcagcata acacaagtac tcttcgcccg ttgtggaatc atgcaaaaaa 150181 tagacgaaaa aagacgggtt ttgtcaactt tttgacactg tttagacatc tatctaacag 150241 ccagaaagat aaagcagctt tgataactaa gaaagcagtc tatgggatcc tttttagcaa 150301 gaatagcaat tggattagga tatagatggc atgtctcatt agccggcact ggctttcgat 150361 tcggggaaaa actagactaa tccgtatagg agaagtttaa gtggaagcag tggaggcaga 150421 agggcgaaaa tccccaccac tctccctgaa agtagctaat tcaaagcaga acgagaacag 150481 atggcttgca cgctctacct tttcatctct tgcaacatct catctgcaac tatcacttat 150541 cgatttcttg cacctatatc ttatgttggg tacagaaagg gatcaagaat agggtggatt 150601 caccaaaggg tattgacacc ccagggcaaa tcacacctcg acctgccttg ggcaaaaaag 150661 tctccgtgcg gatcacctag gggcaactcg taccggcttc taacaagtcg aagaagcact 150721 ctttccttca gtcaacccat ctccttcggt ttagtggaaa gaaaactctc ctagctgagt 150781 tgatgtaccg atatgaaatc tcctgcccct tcaacctaag gagtccggca gtacccgtct 150841 catcaaaagc cagcagtacc tctttcatca aaacaaaggc cctcagaata agcgaagtaa 150901 accgtgagtt aagcgtccct cgcgtttacc gtgagaagag tttcacgcgg cggctaaacg 150961 cacctatttt tggcagccta ttcgtagaca aagaaagccg attcgctaat ccttattcct 151021 ttactttgaa tcaaggtctt acccgaggaa gagggaagca agcaaagctt gccccggatc 151081 gtaggggaaa gagtgttgta accgaagtag actaccgaac gggtgtgggg gagaatatcg 151141 tcaaagattg agctaatccg aatccctctc acaacaagag ctaaccgtag cagtctaagg 151201 ccgagatgaa tcaattcgaa agctgttctt cacgtttatg atctgaaaga cccggacaga 151261 agagtcatac tactacggaa agggaagtga aagcaagcct gtaagggagg gggagggcaa 151321 acttccgaca tttcactgat ttctccttat ctttaggaca ttgctttact tgagggcatg 151381 ctcgtgtacc atacgctttc atcgatggaa gcacaacaga aggtgcttca atctttccta 151441 tgggtattct gatgggcggg gctcgggatt tgatagtaag catgaatcta aggagatctt 151501 gcagagtagt cagctggatc tcgctcatgt ctctatagca cctgactatg tcgcctatat 151561 ttagattggc tttattctct ctgaaattag cagttatatc atccaccaaa gatatggagg 151621 ttagaaacgc aattgccctg tgataatggt gttcttccac accaaagctc ccactgcaaa 151681 agcggtaacc ccatttacca aaccatgaat gaccataagc aactccatgc aatagcctaa 151741 gatccattga cctcttttgt gattcatcct ccagagtgat tttcctacaa aagcaacatt 151801 aaaatcgctc atcataagag ataaaccgaa cattaatcaa gttaacccaa acaaaccgac 151861 aagttatcaa atatcgtcag atttcaagcc aacccaaatt ggtttctcgc aagaaggata 151921 aacaggtcta tccgatttgt ttattttcca tcgaattact aacctagcat cgtttttttg 151981 gggtaaaacc taacttcgag ttttagcaat tttttagggc aaaacttacc gagtatgcag 152041 gttaacacaa aaacgatccc aaagatccat aatctctctc ccatgaaggt gcttggaccc 152101 accttcaatc cccttcgagc aagtgaccga acccgttcaa atggaccatt ccatgcaggt 152161 ccgggtcagt ggatctgtgc cttacccgcg ggggctttct cattgagtca tttccgtagt 152221 cagggaggga tcgatatctt ggtccatggg cccattctca attagtcttc ctggactatg 152281 tttcttaccc acccgaacca tccctgctag cataaaaaca agttcgggca gtgagtcaca 152341 tgttggaatg tcaggatgtg gaattttttt attgagatta gattagtgat taattagtca 152401 cacctaccgt aacctatatt ggtaaggtta agcagctgac ttagcccccc gtcttcttaa 152461 ggaggtcttt ctcacaggac ggagataggt tatgcaatta tgacttattc aagaggtcct 152521 cccttcccta ctggagcgct aactcctatt tttttgtaga gaatccttgt gtagtgtatt 152581 ctactggtat agaatctttg tgcgctgact cctacgaata taccgaacta gtccatccat 152641 tttgattgat tcactgttcc tcgtccaatc cctctcgtat caatccttcg gatctgcctt 152701 tccctgggtt actttttgta gctgttgttt catcccccta tctaacgagg gaattgagct 152761 ggtgaatcac agcttatttc tcgagtgatc cctttgtaag tgtagctaaa gcaggggcga 152821 tagccccttt ttaaatatct ttcctatctt tcttacttcg tttaccagct tggaatatca 152881 ctgataaccc ttcatatgtc attcgttgtt atctgttgat ttccctgtgt ttgccgatca 152941 ctgatgtaat ggctgttccg tcctttcttt cctctgagtt ggcttgcaga ccaaggagga 153001 tcaacgtctc attgtattcc agttagtttc acagctaccc cttcccctac atcttctttc 153061 ttcttctctg tatctttcgc tgttggccag cttgaatcgt agttttctct tatgggaagg 153121 gcggacgaag ccactttcac tttggcaaac aggaaccagc caagtaaaaa agaaaggtga 153181 tgggaaattt ccaattagat cgagattctt ctttcttctt atggatagag gttaggttat 153241 ctttgccagc tcgatgccac aaaggtttaa aatggggatt ccttggtgcc gttcgcttga 153301 cttgaggtca gtcctttcaa ctagcctaga cccatcatat ggggagcttt aaatggtgcc 153361 tagcctttcg aaaggaagcc tgtctgtctg ttagtatgga ctatagaata gaattctttt 153421 atcactgcta agcaataagt ctcagctagg gcacaggaga actgaggtcg gggggggggt 153481 gaaataatat agattagcaa actcgttcga ttccagattg gcgagtcagg gggatcttat 153541 taccaaggat cgatcaggga taaagaaaaa aaaagggttc cggccgggct aaaggtttca 153601 aatcaaggtg taggtgggca tatccatgtt tgggcatttc acatgatgag tagagcgatg 153661 ggtcttcctc ttgcaatcga gagaggcaaa taggcggaaa tcggtacact tttagtacgg 153721 tggtatgcga gcagcaaaac agtatatgca atacagaaat acgagcaagc tcactaatca 153781 tagtacggta aataagcaca agcagcagca aggaaagaaa gccagctagc ccttatagtg 153841 cccgaagcga gatcggagta ggaagaccat gagcggaaag gaagaaagct tcaatgccaa 153901 gagcagatag gataagagct gctacggtgg gggaacctca cccagtcaat cctaggagta 153961 agccctttta ctaagaagta gaagttcaca tgcccatctt ttctcttgca tgcgccttcg 154021 ttcctcatta attcaaatca aagtaagcga catcgcatcg ggaagggata gggatagcgc 154081 atttgcttgg ttggcggctg gctagcttag ctctcgtcct gtagctgctg cttatcgacc 154141 ggcagaccgg ctaccagcag caagaattga atcttttgag ccttaagttc ttcttgcccc 154201 agtttcacta tctctttcac ttcttcgatt ccccttgaag tagggggttt cagtgggaac 154261 gagatcaatc aatgaagggt caagagaaag aagaaaagaa gactaatctg gatattcccg 154321 aagatgcttg acttcgagtt cttggagtga cagctttgta aacaagggtc ccttactttt 154381 gagtttcgaa aatgcatctt tctttctccc atgcttttca ttggtcaaca accaaaccaa 154441 ccactaattc ttccttcact actaatccag gaagaagtct tgtcttcttc tgttcggaat 154501 acattttttt caaaagaaag actcaaaaat aatataaata ataatatata gaatatagaa 154561 agaattgctt aaataactca gcgatctaaa atcatagtta cgatctacta aaagtaaagt 154621 tgagcacccg gaccagacag aggtggccga gaatcttatg tcaaaagtaa ggaccaagga 154681 tgatcttttc ggaaaggagg agtaggagga gtcagacgga atcaaatgat tacgagatag 154741 acaatgagac cagggagagc aagagcactt agacaattca ctttgagtac aggaaagtct 154801 gctggtagga attcctcagg gcgtattacg gtttttcacc gagggggtgg ctcgaagcga 154861 ttgctgcgaa gaattgatct gaaacgaagc acttcctcta tgggcattgt agagagtata 154921 gaatatgacc ctaatcgttc ttctcagatc gctccagtac gatggatcaa agggggctgc 154981 cagaaaaaaa tgaacacgat cgagaagttc gctccgccgc gcaagatcct cgaacctacc 155041 acgaacacca tcagcggcct cttttcgttc tctttcctgc ccgggaaggt ggataaaaga 155101 aaggtagctt gcttctctcc tggactgatg gccgcttatg tagtggtcgg ccttcctacc 155161 ggaatgcctc ctttgtcttc gtctaagagc gcctttgcta gtaagggcgc aggaagcaca 155221 aaaactttag tgaaggacgt cttcttctct gccttctcct ctccaaaggc caagagagag 155281 actgcatccc ttgccttcgc tagctctttt ggtttcccaa ggatagcggt agctggggca 155341 aagcccgctt tcttcgctcc gcgaatgaga cagaaagtga gaggaaaaag cacgttctct 155401 ctttgcgagg tccaaaaggg gagaacgcat agcattctct gggcacatag gatcaaaggt 155461 aaagcagggc tttcttggca gagttttagg cggcaagata ctttagggct tgttggagct 155521 gctgggcata aaaaatcgaa gccgaagacg gatcaaggta acttgcctgc caagccaata 155581 ggcgaaaggg cgaagcaact caaagctctc cggggtttga gggcgaagga tggagcgtgc 155641 aaagtcgatc gtgcacctgt cgtgtgaccc gttggtccta agcaatgtct tgcgcgaagc 155701 gacccaccta gaaagagctc tcctttatct gggggcacta aaatgaaact tcgatcagat 155761 gcgggtataa aatcccgccg ctgagatgtc cagcggattc ctgagccttg acgaaaggtc 155821 ggccaccttt tttttacgga gagcaaaagg cccggggcat agcaggatga accaatgtga 155881 atgagtgtaa gcttcgttgc ccgaacacga ttggtgctga ccacactagg tgctaccgcg 155941 gtagcaagag aggccaggca atgacaattg agaggttgtc actgaacatt tctagtcaca 156001 cgggaagaga ggtccaatgg caaggccata cgcccgtttg gctcctcgcg gagtatagct 156061 cacatccaaa tatcatatct gattggggaa cggggcaaca cccatgaagc tccgacggaa 156121 agggaaggcc tgccaggccg tatgcccatg ggtgcaggat tcttcgaaaa agcgcgggct 156181 gactcggaga cctgggacct tggcttagca acgaatgaat atttctcctc gagctttctc 156241 cgccagcggc ttatgtagtg atcggccagc tcgctaagct ttccttcttg tagtcggccc 156301 gtaatgcctc ccttcatttg cttgcctcct tccttacttt tcagagaagc attttacgac 156361 tataaggggc gctgttcacc tttggaaact tagctacacc ggtcacgata tcttgttgat 156421 attgattgag gactttcgct gactaaatcc ataaacctag aaagtcaccg tcactggtac 156481 ttttttgact cgataggtag gtattggtgg agcttgcgta atgtagttgt agttaaggtt 156541 gcattgaagt ctttcttttt tttgaagatc tactgaacaa aggcgaacgg ggttcccaag 156601 gcgggacgtc tggcagaatg cttggcctcc cgcgctggaa gcgagacccg aagggtgagc 156661 ttctggcggt tagcttctag aacttataat aggcattagg cattctgagc tggaaggagg 156721 caagcaaaag gccgaccact ctatcatttc ccatttctga tgggaaggcc gaccactaca 156781 cggactctat aacaagtcat gagcgatatc gaaaccaagc caaagcctat aggctttggg 156841 atgaaagcca gcccgacgaa ggcctatgat agagtaagaa aaaaagtacg ttaaagtgac 156901 gaagtaactt agccgtctac aaagggaaag gcgtcggtac ggagtcaccg tcagctgtgg 156961 atatagacta tactataagg aacggagtct taaactatgg accgagacta cactaaggaa 157021 caaggaagct tgactgagca aagaagtcaa ggaacgaagc tgcttctcta atagccccgt 157081 tgaataggag ggcgaaggct ttaaaaaaaa gtttgattta gggagagggg gcttcaagtt 157141 cttaggaaga gccgtacgag gcagctcacg tacggttcgg gagccgagcc cctgcacagg 157201 ggcttaggtc aacacttata taatagccag tcatcaatta gaagcaggca aaatggtgat 157261 gaattgcgat tggtccaaac cttcgaccag ctccttcttg caatccgccc agaatgacca 157321 tcctaagccc ttattcactg tgtgaagaac ggggcagtcc gctggagtgt aggcttcttg 157381 gccacgcccc ctgcttatag atacgaaata cttgagataa attatcaaat aggaaattgc 157441 ataccattag ccgatatacg tttaggaaca tgggtacatg aaccctataa aatccttcac 157501 ttttagcttt gaattactta tatatatcct atgaatttaa tttcgcatcg gaaacgattc 157561 taggagaagt tcgaatccgt tccgttcgga tattgatcgg tcttggtttg acatggttta 157621 cgtgttactg gttcccggaa gagttaatat ctccattagc gtcacccttt cttaccctgc 157681 cttttgactc gtattttgtt tgtacacaat taacggaggc cttttcgaca tttgttgcaa 157741 cgtcttcaat agcatgctct tatttcgtct ttcctttaat aagttatcaa atttggtgct 157801 ttttgatccc cagttgctat ggagaacaaa ggacgaaata caatcgattc ctccatttaa 157861 gtggttctcg cttcttcttg ttcctgttcc taactcctcc ccgggtcgtt cccaatgttt 157921 ggcactttcc atacttcgtg ggtgcaacat caacaaattc gctcatgatc aagttacaac 157981 ctaagatcta tgaccatatt atgttaactg ttcgtatttc gttcattcca tcggtatgct 158041 cccaggtacc tgtaattgtg atctgtttgc cagaaccaag gggtctttct ttggaaacct 158101 tcacgaacaa tcgtcgtttt ttgatggttt ttccgcttct cacagctgct ctttccacac 158161 ctccggatat ctggtgccaa atcgtcgccc gtttccttat ttctttgata atagagttgg 158221 ctatttttgt ggcatcgatt gtacaagttc gtgaagaggg ctggacgagt ggaatgaggg 158281 agagcggctc gatcgagaaa aaaaataaga gtagcccccc ccctagaacc tggcaaagta 158341 actatcaatg aattccaata aagattataa cacacagagg actccttacc agcgggatga 158401 gtgatacatt cggtgagcgc cgtagttaag ttttccttat tcgttagcta tgctaaagga 158461 atggattgtt ctctttaggg agtgcagaat caactagggt taacctctct ttagcctatc 158521 tgtcctcatg ctagtcaatc tcaggtaatt ggtctatggc aaaggggttc ctctttcctt 158581 cacctgatgg caaatacaag aagagttcgg caacgcaaaa gcgggcttaa taatcaaata 158641 ataagatcga gttaagagaa ctcgtcccct aaactatcta gttaacggcg gcgtgatctt 158701 gctgtagggt cagagctgca ccgaagaaca gttattatcg actcgatggc tctgatccgc 158761 gtgaatcaga cggtgagtta ctggttcttg aatgggaatc ccgagcacca aagagcagag 158821 tgatcctctg ctcgagatag gaccgggtaa agggaaaaag acagaaaaga ccatagaata 158881 gcaataggaa tcgctgctat ccgacaactt tctccgtgga gcccgcagct aaaggttaac 158941 aaaagattgt agaacccatg gacttactat ctaaacgata cgagacgggt cagtcaaggg 159001 ccaccagagg cacaaagctg gcgaatatcc gtgagagcgg aagatataaa aggtataggt 159061 aagtaagctc ttttaagatc aagggcaata agacgtcgaa ggaagaatcc gcaatgcttt 159121 gacattcctt ttcaaatgag ttgtgcccag cccagttagt ccactactac agatagtaac 159181 tagcacagaa aaacaagatg gaatcaaata cggtgttaat taatttactg aagtgaagca 159241 aggaacgtag taactcgact ggttggagag aaactttgtt attagggttg aatttccagt 159301 cttcgggcct caacaccgga gcttatcctg aggcctaggt aggtaggtag tgacgtctct 159361 tgctgggtct ttcataagga atggtcctgt tagtcttaat ccagcgactg gccttgctcc 159421 gagccgatat ccgaagtatg gttcgctcgc tagttatatg cttaacacat gcatctgagg 159481 tcagaggtgc cgctaaggtg aacgctcagc ttcaactctg actatatatt atatataagt 159541 gcactgaagg ttaactatgt caatctacaa ctcaatgtga ttggcttcgc ccgggctcag 159601 tctctttcgg ccggtatgta gaatcgtcgg agcgagcaaa gcagcggagc gaagtgggct 159661 gtgtaatcat tttatttttt gacttttttt taatatataa gggtaagtaa ggagctgaaa 159721 acgagtcctt cggagggcga agaaactcat taattcattt tgatgagggc ccctcattca 159781 aatgttatgc ttagtgcttc cctcacattt tgagttgatt atttccaatt tcgagagtga 159841 gattgatccg accaagtagt agtgcggaag ccagtacata catggctagt ttccaggcaa 159901 gccatccata ttggcataac cttcttctat gccatctagc ttcatagccg atagtttccg 159961 ttgatcatta tcttcatcca tcggatcagc tccttctttt ctgatttttt tcaatgactg 160021 cttcttatgc ttgccctcgt tcttctctct tctcctagac ccacccccct tactgtctgt 160081 ttaggccccc tgccaggtac tccagtctca ttgcgtaccg tcgtctgcat cttcttgccc 160141 tgttttcata gttgtttgct tttcttatat ggattaggct tggttaaaag cgtaccgtca 160201 agcaagaaaa ggaaccgtaa actagcttta ccgagttggc ttcttcgtat gagccctagc 160261 agcatctatt ccttactctg ttttctgctt tcaagcccct aagtaggatc gtctaacgaa 160321 gtcaggtacg agggaatatg ttaatgcaaa agaggcgagg aggctggtga ccagagaagg 160381 tctgtgatgg aagcaattcc tggctttctg gtctgtgata aaagcaatct ctctccggtc 160441 ggttcgactg ttaatggaaa aatgaataga tccttaggaa gaaaaaggct cttttgctct 160501 tgtgtagaat cagttgttac agagaaggag cggttttcgt ttcgcaatgg aatcagaatt 160561 agctacgatc aattaaaata tcgtagtata gtaagagcca acgaagtagc tgtaacgtga 160621 acagctaaag ctccttacct tatatgggct gcaccgcgct gaatgatcaa atcccattta 160681 ttcttatttt tttacattac aactcccatg cctttccgtt ggtcaacaac caaccgtcga 160741 tttacctttt ccttcatttt gagaacaagt ctctcttggg gggagcagag catgcaaaat 160801 cgagcaatag atggatctta gaagaattcc actttgaacg gcacgactct ttcgattttt 160861 gcgctggcat ttgagttgtc tcccctcctc tttcaatcga cacactcgac gcagatagct 160921 ccggtggccc cggcttctgc ctctatcagg cttaggctgc ccccaccccc acagagccac 160981 agcatggagt agctgctccc gcgctacaac caatcaaaat tttatacgga tcgatatatg 161041 atgatgctaa accgagatag agaaagaggg cgaagaaact cattaattca tcgatgaggg 161101 ttgccgcgcc tccctttgtt ggctcttcga gacaaaaaca ctcataggaa tggtgctaat 161161 tcccatgttc cttctagtct cgtttggttt cgaaagcctc cccctccctg tctcttactc 161221 tttgaaagct gtcatcctgg atttattttt gaatggtttc tttgttctct tatttggatt 161281 gaaagaaaga caaaacttca ttttcttcga cgaatttcgg ctatgaccaa tgcaccaata 161341 gctgaatagg cgtaacaaag ctacctgaaa aggcaaggtg caccttggga ttgaaatcga 161401 cgaattgcgt tttgcccgag agatttattt agaaagatta tccatcggat aaagaatgca 161461 aaaatagaaa gaatataaat aaaggcgtat acacgggaag ggggcaccac tacttccagg 161521 gggagactag cctcattact tcttagtggg cgagaggtgc gcctaaccca cctacccact 161581 cataaatcac ggtgttcagc tgacttgaac tgatagaccc ttattgtatt ggaatttagc 161641 gcccatcttt tgactgttgt caacaaatct cttcaatgtt cgattctact ctatgttaga 161701 acatttctgt gaatgctatt ctaatctaag tggtcttatt ctgtgtcctg tgctaggaag 161761 cattactctt cttttcattc caaattcaag aatacgaccg atacgattaa ttggtctgtg 161821 tgcctctctt attacttttt tgtattctcc tgttcctcgg atacaattcg actcttctac 161881 ggccaaatct caatttgtgg aaagccttcg atggcttcct tatgaaaaca tcaattttta 161941 tttgggtata gacggtatct ctttattctt cgtgatattg accacatttc tgatccctat 162001 ttgcatttca gtgggttggt ctggtatgag aagttatggg aaagagtata ttacagcatt 162061 tttaattcgt gaatttctaa tgatcgccgt gttccgcatg ctagatcttc tactattcta 162121 tgtttttccc gaaagcgtgc caatccctat gttgtgcgga gcggagtatc ttatattcgc 162181 tgggagaaag cttttcctct gcaggggcct tgtgcagtaa acccctacgg gcggtcgtcc 162241 gtcgtcgtaa agtagtcccc gcgaagcttt cgggaagagg ggtagtcttg tgtgtaagca 162301 tagcatttct ggtcgaaccc gcccaatcca actaagaaga accgaacctg acaaacacat 162361 ctttttcctt ttgggagggt actccgagta tagtgggtac ctcgtaggac ctcgacccgc 162421 ctactcgggt cttgtatgga tatgcaggaa ggggtgctcc taggtgtgtg taggggttgt 162481 gtttgttcgc gagaatggat tcctcgtcaa gtctgtttgg ggggtgtgga cacacttgcg 162541 cgaattcagg taacggctac aagggagaaa taaaaaggaa actgtacccg accagggatg 162601 gacgtaaact cgtaagctac cgaggttagg gataatcgtc caggtcttat tgtgaaacaa 162661 aaaagccgcc ccgccacagc aagcgggttg gttcctctgt cgtcgccgga tagctcttgg 162721 cgaggtactt ttgtcactcg actgaaagga gaggagacct taggataagt tgctaaaaca 162781 aaggggagta gaggatcgac ccgttcagta gaattccgaa gaaagactgt tgacagctgg 162841 tggagacatt tctttggccc ccggcaaatc aaaaggaaat gcgggcaggt ttaagctcgg 162901 cagagggttc aagaataggg tcctgccctt cagattctca gaataaaaaa atagttccaa 162961 acctttatgc atgcacctcc gtataagtgc tgcgtacaag ttccggccag gataattggg 163021 aaagatcaaa cccgaaagaa ccgctcacat cacagtagta gtagcgtaaa ggccgtaagt 163081 cgggtagcgg ccataccata aggtaagggg ctattacttt cacatctctc cttctgtcgt 163141 actataagaa agagagatcc gctgcgtgag caacccgact gtgcgttaca tgtgctctac 163201 aggccgaact ccatctttct tcttaacaag cccatttatc tttagatttt gaagacgggc 163261 gttgcgttcg gttcgaaagg tatggttttc agtatgtctc cagatagggc gccccactag 163321 tccggctagc tagtgagcgg ttctttcggg cgagaagcag gccgggccct acgggcgggg 163381 gcatctcccg caacgaaagc tgcatagttc gccaccaccc gaaaagtaaa agattagaga 163441 gtccagacta aaaatacatg catagatagt gatctaatga caagggccga cgacggaagc 163501 tcgggacgga gccgtatgat gcggaagtct cacgtacggt tccctgagaa gggagtggct 163561 acctactgga gcttcgacca agcacccccg gtcaattccg ctttggggcc accccttact 163621 ctaccattat tataggagta tggggttcga gacaaagaaa gatcaaggca gcatatcagt 163681 ttttccttta tactttactt ggatctcttt ttatgctatt agctattctg ttgattcttt 163741 tccaaacagg aaccaccgat ttacaaatat cattaaccac agaatttagt gagcggcgcc 163801 aaatctttct atggattgct tctttcgcct ctttcgccgt caaagtgcct atggtaccag 163861 ttcatatttg gttacctgaa gctcatgtag aggcacctac ggcaggatcc gtcatcttgg 163921 caggaattcc tttaaaattt ggaacccacg ggtttttaag attttcaata cccatgtttc 163981 ccgaagcgac actttgttct actcctttca tttatacttt aagcgcgatt gctataatat 164041 atacttcctt gaccacttca agacagatcg atctaaagaa gatcattgct tactcctcag 164101 tagcccatat gaatctggtg actattggta tgtttagtcg ggcggcggcc gttaggtcac 164161 ctattttgag ttatggacac acaaggccaa aacatgtgtg tcgggcgtgc gacccatcaa 164221 cctactagca atgggggaga aaacatagca tgtcgcaaca aaagcttgat tcgaggcgtc 164281 agcaaaacac tgccgtctgt tcccttcagt cccttagcgc cccgggacgg gagtggggga 164341 cggctctacg cgcaggcaac agcagcaccg gctccacgaa gtctgaatcg aatctttctg 164401 ttggctttcc caaattcatt cgtaaatcaa aatcaaaggg ctagaagcga gcgcttctag 164461 ctgcttcgcc tgcttcttct tattatggcg gccatgttgg cgtggcgaaa atgaacgaaa 164521 agcgagatga acgtgctatt ttcaaatcgg attgatagat tgatctgttc tgatagatct 164581 aaagagtaga aatagataga gaatagagag ggaatcaata ataaggtctt tgagcctatt 164641 tctatctatt gatgagacaa ctatctattc ttgatccatc agaaagaaat tatttatcca 164701 tcagaaagaa atcgatatgt atctgatgga gtcttctaca tcgtacgtag agcgcccaag 164761 cgctttttgg gccagctcag ttctcttatc catcggtcca atgcactggg ctcatctcat 164821 ggagggaaaa gccaaaatgt agttgtcttg ttgttgttcg ccgcctcgac gcattccctt 164881 ctctccccgg catcgtccca cacagaaaga aagagcgcgg agccccggcc cgagccgtag 164941 gtccgctaac gtaaagcgag gagttgagcc tgaactggcg aaccgaagtc actttcggaa 165001 ccatacttcc tacagctgct aacatgtgcc cagtcctgcg gaaaggcgca aacgaacgtg 165061 agctgctata ccggaatccc ccgctggcca tcggggaccg agtggtaagg ccatgatctg 165121 caggggaacg gatcactcat tcttccattg gggacaggtg cacgaacgac aactccaaac 165181 gtcacacatc cgccgcctac ttaccgttta ggtggcacca gcgagatcca gctaaggaaa 165241 aagagtgtcg cggctgctcc actccgccgc ggtctcatga acttcacttc gttgccttcc 165301 cgcgcgcaaa gcgaatgggc gctgtgctgt gggtcagttt cggggcgggg ggcgcagaga 165361 taccagaata aatgattcat ttgtttggat cgacgagctt tttcagcccc aaaactcaga 165421 atcaatggaa tgtctgtcca taaacatcta ttctatctgt atatataggg gatctctcta 165481 tacatatcaa agtcttttat ggcattgata tgatcgccta gctgtagccg catatcacag 165541 ctgcactcaa tatgcggatt tctgttggat cagatctttt ctttcgcttt gacggaagct 165601 ttttgaccta gcgaaaagca ctttcgcgca agcaaagtag aagctttgcc agaagcaaga 165661 cgaggaagtg gagctgtcgt ataagcggta gcttcccccg accgactaaa atacaagagt 165721 cgcggcctac tttgattgcg aaggggtttg gcaacaagca aacggctttc tatcatagtt 165781 gcaagggttc aaaaccttag ttcgctgctt ttcccagtac cagagaaggg cttatactgc 165841 tcgcctttgt ttggtttgat atattcattc agtcaaaata caaactacaa caaagtggaa 165901 gtggaagggc cgctatagaa gctagaactt gctttataag tcggcctaac caaagcgtca 165961 cgacaacaaa aaattatcct tatgaatgga atcttaagta gggggggctt tgaccgcccg 166021 cctaccaatc aaagaggcag agagtaaacg taagctcacc cgtaagctcg aagagagctt 166081 cccttcattc gcttcgcggg agccgcacaa gcacatagct ggaagtcaga gggcccatac 166141 tacctgccta accctttgtt ccgagggacc gtagatcgga aagcacccca tttatccaaa 166201 agagaaggga aggggcctat gtatttgcat gacccctgcg gatttgaccc tatcccggag 166261 ccaatcccct attggtcctg ccaccacgcc gcagaacgag agctcgtgtg gaaccttttc 166321 tttctggcgt aacagccggt ggaacgtaac aaaagattac ggtcgcctaa cattaacata 166381 agggcggggg gtacggtaaa ctcggccaaa atatgacacc cgaagggccc gaacgcacaa 166441 tcctatccca tccgagtccg agtttacccc ttgcacttcg gacagccgac cgtagcatca 166501 taaggaggac cccctttcga ggtaaaaaaa aaaaaaggta cggtacatat aggaggttgg 166561 actttctcaa cgtggtgtat agcacgaaaa acttttcgat acaagaaagg gccgttctca 166621 catgaaagaa gagaagaaat cctttcttct cttctttctc tttctcgaga aggaaagaaa 166681 gaggatgggg ggaggggggg aatggggccg gtgcccttct tttacggccg tcactcttat 166741 ttgtcagccg tgaggaacta ccggctcggt ctcggtggga aaggaaaggc ttgggcctac 166801 ctatcccgat aagacctcat aaaggaacgg cgggagtgat aggttccata ttgccgagct 166861 gaagggcaag acttttgtac gtgatcgtag tatgtgacgt cgtctcgtcc acgctgcatt 166921 gaagagtacc tacgcactaa gttccggttc actgataagg aagatagagt tgggcggggg 166981 tctacgatgt gatactcaag tatatgaccc ggggagatac atgctaacta tgggtaggaa 167041 gcaggaaccc ttatgtaaat aatttcgggg gggggttaca gatctcttat actaccctcg 167101 atcgacagag cggaacgacc agaaaaataa gtgatgttag aaagccgtat gataggtggt 167161 aactatcttg tacggttcgg ggggtaatcg gcgtactccg ggagaaatct ttcgctctat 167221 cgaacataca gggaattgga ggtagcattc taccgatgtt aagtcatgga ctggttcctt 167281 cagccctttt tctatgtgtt ggtgttttat atgaccgaca taagactcga cttgttagat 167341 attacggagg tttagtgagc accatgccga atctctctac cattttcttt tcttttactt 167401 tggccaatat gagttcacct ggtactagca gctttatcgg ggaatttctc atcttagtag 167461 gagctttcca aagaaatagc ttagtagcca cattagcagc gcttgggatg attttaggtg 167521 cggcctattc cctttggcta tataatcgtg tggtttctgg aaatttaaaa cctgatttcc 167581 tccataaatt ctccgattca aatggcagag aagtttccat atttatacct tttcttgttg 167641 gaggggcgac cgtccgttga actaccaaag aaaaagggta aacctatgtg atcatgacat 167701 tgtaggtgct tgcgatggga cggatgcgac ttccctcagt tggtttgggt ggcatagccc 167761 gttgcataag tccccctttt tttgattcat tttttgagtc tttagggagc caaagcttta 167821 ctttactaat aaaggctcgc gcaggggcgc tcactttttt ttgctaagcc gtctctttct 167881 gggtgggacc gagagaaata aaggacagag ggcaaccatg catggtactt ctcgaccctg 167941 tctccgaggg acagttgaac gagcgactca tgaatgctgc cgggtcggac gagccaataa 168001 ctcgaacgcg ttcggtctgt tttttgagca agaatcacag cgttacctta ccttcaccat 168061 gatacggact ccaagttctt atggcagagc acgaggagat ttatcatcaa tattctaatg 168121 ggaatggaaa ccagaagcac gaccgaggtc ttcgtagtct aaaataataa aaccatcaat 168181 aacagtaatg taagcatgag actttttggt agtaccggtg aaccagatgg ccgcggcgat 168241 ggaatctggg acggaggact cgtagtatct ctctagatcc ggcaaaagcg aaagaccccc 168301 tagttccatt cgaatgaagg ctgactgctg agcccactct ggccccggcg ggcccccgtg 168361 gttgcgagcc ggagctgcca tagcttatgg ctagagcaat gggaggggcc tcagcagaga 168421 gaacagaaca gaaccaccgt aaggataacg agtgctccgc ccgtcaagcg ggcggcagga 168481 gcagcaggca agtacttggt aggccaacag tccagtgggc actcgacgaa agggggcaca 168541 cggagcaagt acgagaaatt ggccccgctc cgctttataa aaagcaagga cccactacgg 168601 gaggtcaaaa acccaggacc tatggaagtc ggggctcgtc cccggtcaat attggatcaa 168661 acaaaacaag caataggggc ccgtagcact gacctctttt tttattgatt caatataata 168721 ggggaaaaga tcgtacagtt ccctaccgag acaagagaca ctcttaacag atcctccgcg 168781 cgctgggcat acctcttccg tgcgtctttc tcgtggcagg aacagaacaa cagggaaaga 168841 aagacccggg gccgacctgc ccagggctcg agggcgagct ttatttaaga gagaatgggg 168901 agcgaatcga aaggcttccg ttttagttct tggtttggtt cgggctcctc tcgatctttt 168961 tcgtagtagg gaagggggaa ggagtctaaa tctatggaca ttaatgaacc atcattgatg 169021 gacgttgcac atgacacgat caattcgact cagggtccgg cgctaataga ggttgcttac 169081 tttcctagta gcgaaggaaa agggcagggc ttttttcgtg gtaatagtgg gcgggtctcc 169141 tttcgaagta aaggccttcg cattcctaat ccgccccacc aaccccggac ggcttagttt 169201 gtcccagctt ggtgaatcgc atccccgcgc aacgacatag tttgtgcgcc ctttaccgtt 169261 ctcgctcagt gtttgcaacg gctggggagg cagtcgtaga agcgaagtct atcgccacgc 169321 caaccatcaa atacgagatt gggccccttc tcaaagattt gatggaatgg cccacccaat 169381 agcgcttatg tcatatggga actcatggct ggaaacaatc cttatggttt tgatatccgg 169441 taggaataat aagaatcaaa gtccaggtag gttggtgagc ctagtgatag gagactatct 169501 agcttggttc ggagagcact tgttgggtta aaaacttttt ttgttgctaa atgttacagc 169561 ctaaatgctg aactattgac tctactcgtt cggatgggtg ttcaccccaa agtgttcccg 169621 gactgcatgc atacatccgt aagtaactta gtgcaacatg gcaaatttca ttgagaggaa 169681 tcagcaaaga aaagaaaaac gggtcaacat cttaatgtgt atttgaggat attttagggc 169741 cctagaacgc aaaaaaaagg tgggtgaaca agagttgtca cgataggaaa gagaaatgac 169801 tataaggaac caacggttct ctcttcttaa acaacctata tcctccacac ttaatcagca 169861 tttagtagat tatccaaccc cgagcaatct tagttattgg tgggggttcg gtccgttagc 169921 tggtaccatg atactttctg ttttgtcgag ccctgctttg gtctctggtt tgatggttgc 169981 acgtgctaaa aatctggtac attccgtttt gtttcccatc cctatctttt tttctatcaa 170041 tcaacttttc cattactttt gtaggcttcc tattataaag cacctggcaa caaaatgcca 170101 actccttctt tttttgattt ctcactttct gcttttgctg gttcttacca aattggtact 170161 tgatctgggc ggctatctct ttatggacga cctgagtcgt gccctttctc aattcgtccc 170221 cggtttttcc gggggattgg gagggggttc taatacgccg cccaacccct caggggattt 170281 ttttctctct tcttatcaaa cttcagaccc ggactatcac gatcagcgga ggggtgattc 170341 ctacttttcc tccgcgcccg gagtacagga aacccaccgg cacgcgtcgg gaagttccac 170401 aaaccttcac ttgaacctaa acgaccaaag ccaagatccc atttttttag aggttgagcg 170461 cctaagcttg aaatgcgata aagtgaaaga aaaaacgata ttaaagaccc agtctttatt 170521 gctagaaaga ggttaccata ttccggatga acgggatata gaacgagcaa taaatgttgt 170581 gatgactgaa catgaaacga tcgatataga tcggcgtagg aaacggttct actaccttta 170641 ctcgtgttta ggaaaaaccg gaaacaagtt ttggatggag ctactggaaa cgctggctga 170701 ctacaatata aatataaaaa gcgattccga caactaacct gccatttcag gttttttgtt 170761 aggtctcgac tttcttttca ggcttgactc attattttgc taggtatagg tcctctcctt 170821 caaatactgg aaggtggatc acttgtagga attgtaggaa tgacataatg ctaatccatg 170881 ttgtacatgg ccaaggaagc ataaaatgat tctttcattc tatagatacc tctggtaggt 170941 aaagcactcg actgtgcttt attgaaagtt cccatcgcgg gggcgaggat acttgccttc 171001 gcggttcgac tttcttttca ggcttgactc attcgctagc ttctcctccc ctttagagct 171061 ctttatgatg cccactgagt aagattcggg gacttccggc gcagaagctc attctgaacc 171121 gcgggaacct tcgtctcaat cttcataatc aatttatgca cttttttctt atgtgatttc 171181 atagttcgag gcgcttaatg aagaagcagc tatagttcct ctcgggctag aatgactgac 171241 ttggctttcg actgatatag tgcatgggtc tgtacttcag gctgaggttc ttggattgaa 171301 atcttatgat gttctgactg accttttccc tcctcgactt gagtagatag tcagggtttc 171361 agaatatgtt tgttggatgg gaatatggtg cttgatggtg ttgttgttca gacctcggct 171421 ttctcccttc ctttcgtata tgccatgcca gctgacaggg agatcttcgt tgttcagctt 171481 atattagagt cgttctacct tctatgggtc caagggaaat ggggcagttg ttccatcttc 171541 ccatgcctgg actaccgata gttccctctt cttcctctta ttgtttccct ctttcgtacg 171601 tacacacgtc gtcaagggat tgatgctaaa caaccaaagc tgccttggct gctgtgactg 171661 cctctccttg cctcctatcc tttaagtcga gtacttcgtt ccttttcagt cgagttatcg 171721 ctttcgcttc ttttgatcct gcagaggcag gacaaactcc cggcgagcaa ggtcaagatg 171781 ccgatgctgt cggtctgcag cctcctttag tcaacgaagc gaactggctg aactctatgt 171841 cttttggctt cgtcttgctg aaagaaaaag agtgaggaaa acaaagcaac aaaagagagt 171901 ggcaacaaga agaatggaag gcttcccgtt actgaagtga acttctcggt aagtccggca 171961 ctctgcccca tataccttat tcgtagtatc gctcctatct cgatgaagct tctagcctac 172021 accggggttc ctctcgatac atcttgcaag ggatttctag ttgactgttg cctctccttc 172081 tccaccaccg aatccttacc ctcctcttct ctccggctag acatctgctt tggcttcacc 172141 ggctgactcg gatgtaagcg atgaactagt ctctcctaca gttaatcatt tctatggtgt 172201 gggtagcgag gttcttcttc tccattgaag aaatgagcag ccttgagacc caggagaata 172261 gcttcaacat agccttcacc atctgacccg gagcccttgc cgttcacttc actactacgt 172321 acgccaggat ctagccctta gctgcagaga ttcggataaa gctctgtgct ctctccaaca 172381 gcgtattatc ccgtagctac cgctctttcc tgccacttca aagtggaatt ctttctctgt 172441 gttgctttgc gaattgggat tgagtctcag ccaagaaaga taaggtgatc gaaaggtcaa 172501 atcagagatc tcctcgctta cccctgaagg agctgatcga ggaccaatgt cagaacactc 172561 gagttggcct tccttcccta cggatgagga gatgaaacag attatggttt agaagccgaa 172621 caaagagcat tggtgtgacc gttagcagcc cagtcgtgcc ctcccctgaa cactcggacg 172681 agagggtgag ggtgggtgcc tctctatgat tcgtctatct atgttcctat ctatcggaat 172741 gactcccacc gctgcttcaa tctatatctc ccggtaaagt gaaatcatca atctgaatca 172801 attgaatcct tcgatatcag tgaaatcctc catctatgat tgctccttca tttcctcaat 172861 cggaagagtc ttcttctctt ctgtcagttg atcagcatcc cttttccaag aacacctcga 172921 attcggccta ctctacttcc cggaatgaga cgggggtcga gtcataggag ctgcttctct 172981 ctgctattgg acttgcaaca gcggtctctc tataggaaag agatagctcg ttcactccgt 173041 ctctctctgc tttgccgacc agaaactttc cgtatcgtac ttaagagccc gaaagcacac 173101 tcctctgctt tgattccagg ttagggataa gaactcttca ctgttgaata gagagattca 173161 attgctagta agttattcaa tcgatacttc tgggcctgcc tacgctgatg aatccctcct 173221 ttcttcaggt agcatgagaa gatggaatct cagacttgca aagcagacct acgactcagg 173281 tctcgaagag ccttgccata gcgtacagag aaagcaagag gactggtagc tctagagtcc 173341 agggggtctt cacctttact tatagtcgag tgggcagcgt tgacggaagg agagtcaggt 173401 attcacttag ttgacgctat cgtccttcta atccatcttt ccgtttagct cggcattgct 173461 attgatctct atctctgctg aagcatgtta ttgatcggga atcgttcctt gaccatattg 173521 gtcttatcca tcctcctcta acccctacca acttcgatag cgatcactaa ggcaacctac 173581 atcctctatc ccttagcctc tctcaatgta tgcaagagtc tccctgcgag gagaaggaag 173641 gcaaacaaca gaaaggagag ggcaccagca agtgaaacac acatggaagt cagtctagca 173701 atctatccca gccaagaagc cgttaaaaag aagtcaggat acccgaggta agccaaccaa 173761 gccaatgatc caaccaagcc aatgatgtgg tttgatgcat cgatcttctt ccgaacaagg 173821 tagccaacca agaaagtatc tgatcaccca tccgatacga agacctttga tagtcaacct 173881 ctccagcttg atagtccctc tccagtcaat aaagaacctt cccttgcgaa ataaccttcc 173941 cttgccaggc agttggcttt gttccttggt aaagaagaga aagctaacgg gtaagcggat 174001 ggcaaagaag agaaagctag cctttgcttg ggtgaagatg tgaaagagga gaaatccaaa 174061 gaaagaagag aaagctgcta tagtttaagc taactcctcc gacagcgaaa agaaagagga 174121 gaaagctgcc agtactcact cagccgagga tccttacttc ctttcataga cgaaacgggg 174181 cttgaatcgg tatattggtt gagaatgact ccctgcttca ttaatgggaa atgcgcggag 174241 caactaacta gtaccactgc atacaagctt ctcttccttc ctccgttaac agaacatacg 174301 acttctctgt atccaaagca gtaccccgat aaacaactag tacttttact agaccccagg 174361 agcggaactc gattaacata ctacagccag cccttcttca ttgccctctc ttccccatat 174421 tctgggtcct ctttgtctgc taagttccac tttgctcctc atctctatgc attcttggtg 174481 gttgctagca taaccagatc tgacatagct tactcagtga actgactttc tgatcgaaga 174541 gagatcagaa ggtttaccag tttatattcc ttcctagaaa acgctttctc tgttgagata 174601 tcttctctcg acctctctcc ttgcgtagta gctgagtcat cagtttatct tgtagtgagg 174661 gattcacctc tggcttattc ttcgtgtcag gcttgttaga ccacttcttg tcttcctagt 174721 attcttgctg cttttactta ttctgagtga gagctcagtc ttatcttctt agccttctat 174781 ccgcatactt ccaacacaaa gagaaagaag gtaaatggcc tctggagagg taagatttgt 174841 agggaaaggt aagccaatac atcaaatttc tcactcacaa ctggcatgct accagcaaga 174901 atagaatctt ccattttcgg tgaatgctct ctcttcaaca acaggagttc taaataaagt 174961 aacgaaggag agctgcggaa ccgctagcag cacttgccaa cacttgtcag cagccaaggg 175021 gtcctcttct tgccagccaa tgactaacag atagatgttt gggattcaga cttgtatgca 175081 tcgggggaaa catggctccg cctaaggttc gaggaattcc tttgacctag tttgctgcgt 175141 ttccgttctt gcttgctcga cttatgactc acaactggtg gagctttgcc cttggcttgc 175201 tgtaaagagc ttggccaaat aggatggttc ctcactcttt aggggaagga acggtgcttc 175261 tcaaagtagg attggagaag tctcccccta acaggaaatg tacacaacgc tgtttgtggg 175321 acgggaactt cctggtgatt tgactttgtc gaagtctctc tagtgcttcc gatgtgataa 175381 tgttaagcat atagctagta acgcctttga agcaaggttg ccaccgaagt tcatcaaaaa 175441 aagacggggt attgcaactt ttgaaggctt ttcggctctc taaaataaga tcagatacta 175501 tacgaaatta gccttttcga ggacccaaaa atgggtcttt ctttactgat tcattaaaaa 175561 accacggctc tttaaaaaaa gaaagagttg aaaaagctga ttgcatagcc gttttattca 175621 cgtcgggcat tcactttagc agctttagcg agtaaaggca tatagctagt atagcagttg 175681 agtcggtgaa ctcggataag taagcttaca gtatccgtta tcttattcct tagggcaggc 175741 ctctatcttg ttccaatccc agtagcaaaa cgcgggctag gagtggcttg ttcctcactc 175801 gggggactgc ttgatccgct cctcccgtct atatttctta ccttgtcgct tcccgttctc 175861 catcgtactg atatgactgc tatggtagga gaatctcaag tgcttgcgat aaaggtcggc 175921 cccttccaag tcggaccttg gcttgtagcc tggacttctc atttgatgag gaggaaaagg 175981 ggcatgtaag tgtaaggatg aaacctcggg taggtgagat tggagcctat tattgtatgc 176041 gtgatggaat tccgatcttc cctctcttcc tctggagccc ttacctcgca ggggcctgac 176101 tttggggtgt ggttcttggc ctgtattatt tgtccgatcg cattatcttt tattccggtt 176161 catcatagtc tcgtatctct caaccatcta tatcataaga gaagccggtt ccaagactag 176221 tcgagatgcc ctattcagaa aggtagatga gagcttgcgg agaaggaaag ttttcatggt 176281 tggctgatcc atcgaatact gtcctttccc ctttttacca ttttttcgtc gagttgctga 176341 tacgttcttc tccagcaaca gcctacgtag caatcttcct cttttcaagg acaaggaaga 176401 aagcatcaga cacgtccgta gttctaggtc tttttgccca gttcctggag cactcttgtg 176461 gtttagtgtg ctaagtaatc tgggagccca ctgtcatttt attcgggctc agctcaatcg 176521 gcatcgaaac tcttaattct attatgaccc ttcgccatct cctccctgtc gattctcatt 176581 aaaaattaag agcaggagac ccacaagtag taaaagcaag tctagacctt ttagccattt 176641 caagagaggt agaccctcca gcgcaggaag ttcttctaaa aagaagaaga gtgcactgaa 176701 gaggatagga atcagggcct gtactgtatt gggagtgagt tgaaatatct tttgtataag 176761 acatataaaa caaatcccac cgatcagaaa taaaaggaag accatccctt cgatcggtat 176821 tcccgctttg cccgccgcgg acgagagcca aacggataca aaaagggcga gaaccgaata 176881 aagtataaag gtcagcatat atcctgttag cgtgcgtttc tccagggaaa taaaaagaac 176941 gcacgatccc gagaatatag cggctagttg ccaggggtct aagaaagaga tctcctgaaa 177001 agcgtataac cttagtccat ataacatgag acaaagcaaa atgagaggta gaattccccc 177061 tcgtagagct acaggggatt ctcttcttcc tagacagaat gaaagcgcgg agtatcctaa 177121 aagctgcgaa gctaatagga acccccactc cgcgagttct ctatctagtc caaagaacag 177181 taaaaaaata aaaggcgaca agggagggga accccaagac cagaacaccg gagaaaatgc 177241 caagcaaccc tacttgggcg aaagccaacc ctcttaaaaa aagggtggaa agttggcata 177301 gggcgaatag tcctaggaga tatggcctta gtctcggatc tatagaaagt gtgcagtgag 177361 agatctttat aggtagagtt aagtaaagcc agtctttact taactcggcc caaacaattt 177421 tgattattca atataaaact gtcccatgct ttccgttggt caacaaccaa ccaaaccaca 177481 tattttcgtc tttccaaatt aggagagcaa ataagcaagt cctttccaac tagagctctg 177541 gcggtctttt aagctgttca actagttagg gcattagcct cagaggcagg ggtttgaagt 177601 caggaatttc ccactcccac tggacgacat gcggatagag aaatgcgact aagctagatt 177661 caggaacagc ttctattagt acacaatctt tttgaatata gatcagcagc ggatgttgcg 177721 acagtaactt caacatcgac aacagccttc agtcgtcgaa gtaaggaatt actaggaact 177781 gctttcattc ccacgatatg cgccaacagg gcttttagga agagtagtaa cagcaccagt 177841 aaccgctttc tttattccgg actttcagta aggaatgata gaaccgctga tacaaagtat 177901 agtgagaaga gctgatatgc ttaagcgtcg aaaggactgt ttgcttacgt gccggattgc 177961 ttttctaact agccagtata gttacagctg atatttctat atgaaagaaa gctttaataa 178021 tagatgcgat tgtgagcagg cttacttatc agtgctagct tgtatgcgct ttctattaca 178081 gcaacagctt ttatgcgaaa aacaagaaaa aggaaaactg cagctctgcg acataaccag 178141 tcttgcagaa ctggaagaac gaatttactc gacaaggaaa ccaagtcaaa aggttgcgcc 178201 tgacagccaa gcgccagttt ttgtaatctc tttccttcca aggagtcctt gtcttcttgc 178261 atcacggcct cgatctacgc aatgaaaacg tatttcctct aactgcccgt aaagtcagaa 178321 ttcaacgaag agcactcatt tatgatgtaa ctattatgta cctagaccgg ccttacccga 178381 tacattactt gcttcccata ggaagaaggt ctgcaagcct ccatatgatt gattctcgca 178441 atgaagtctt cccgttgatt ccacttcttt ctcctctgac tcagtctagc aaccccacct 178501 tggtctcgta gcccacttgt agccatcact accgcgggtc tttgctcata gtcacgaata 178561 agcttctcct aagtcctctc agcaattcaa agatgagact gactgacttc tattaagtaa 178621 tcggataaga ctgatgcact cacttttaag acttccccga tctcctaccc ggctaaaaac 178681 tagaagtcag cattctattc taatctccgg ccttcatgcc aggggttcct cctccgtacc 178741 ttttccttct gtttggcttt tccccagccc gtgagaaatt gataagccaa tagagtcaat 178801 cgtcaagtcg ggatatccaa taggtttttt tccaggtaag gaactagaat agattcatac 178861 gaaagtggct ttttggttgg tagcaggacg gtaattggca acgaagcaga gatttatttt 178921 acccattaaa aagtatattt caccattcct ctcgtcgtcc agaatgggtg actcaaagaa 178981 aacgtttcat tcacgatgag gcagaattga cataatataa ccaagattgg atggtgacat 179041 ataatatagt aaagtaaggg gttcctccgc ccgtcttcga tacaatcatt gcgatgtgtc 179101 actcaaggat agctagccat tcttctctac ccatgccatg atttacagat gagctcgaga 179161 ccattattga atatattatg aacacaattt gccaagagtt ggttgtgaca aaagtgattg 179221 ggatgcccat ctttggtcca gaaggggttc tctatgaaaa gagtttccac ctttctctac 179281 tcgaaaagcc ctatctaaaa gggcttgtct ggatgaatgc agtgtcggaa gccgtgatca 179341 catagtaact tccgcccaca gtgctattac gacggcgggt caccgggagt gaagtaaact 179401 cggctcctga tgtagcattc attcggacca ttcgacgttt gattcttttt atcagggata 179461 ccgatgactc tgtgagaggt gttcttcggc caagtttccc atgacgggtt acccggttca 179521 aggcttttct atataatgag aaatactact ttctttctag cttaagtgtt cacgtaggta 179581 aaatagcttc tatagctcca tccaatagta atcaacggag atagagtcca gcggttcaac 179641 caacgcttct aaggagagcg gggcaagcaa gaaagcaggc aaagtcattg agcctattct 179701 attccgaaag ttcaactact ggataaacaa cgaaagccgt cggcattctt ctcctactgt 179761 agctgctaca attgctttag cgcgagcagc aaggaggagg cagctcttac taaaaaagca 179821 aaaagggaag ggcatgacag aagggagata gacctcctat aagcattact ctcttttgag 179881 gtaacttact tacttactct gatatgatga gttccgtggg ctagtaagat aactattgag 179941 gtgagggtcc gaaggagatc tttcactatt tatgcttgta cagttttatg atagttttgt 180001 aagggaaagc aaatcctgag aatggaatgt aagttaggga acagtagtca gacgcggacc 180061 agcgcaggtg ggcgccacag ctgtcttcct ccatgtgata tctgatatca cgcaacagga 180121 gattcttcac ccttatttat tgacacgatt tacaaaagaa gaaccaaccc ggtcgtgcta 180181 tatgtatttg ggatcaaaaa ggctcactgc tgtatgatat aaatctttct gaatgagagt 180241 tcatgagcta caggaagaga tcgagtctag aataaacctt agaagtggca aaaacgactt 180301 actttgttgc aacgggaact actcgccccg ggcactggtg aaatagaatc ttatgtgttg 180361 cagctactta tgctttccca actagaaatc aaacggaaca aaaccgagtc ttgctattta 180421 gagccagggt tcctccgggg gtcacaacca gagacctacc agccccatgt aaaccacttt 180481 cggaagggac aaagcagttg aactctttag aggaagagtc taaccacaag tcaaactgtg 180541 aacatcagaa ggatcgatat attggtaata gctctacacg agagaattga aactccacct 180601 gcaattccga gtataggaga gtccagtcaa taataaagtc ctggttgcag catcgggtaa 180661 atgggatcca ggtaaatggc ttactgttcg gtcaacattt catgattcgt aattagtcaa 180721 gctccggccg gttcctatgt ggtgaatagg gatatattta gtattaaaga catgcgagtg 180781 ctccgttcgt cagtaagcga aagagactga aacctgggag aattccgctc tgttaagaga 180841 gagaactaac gaaaattgga gactgacgga aggaaagggg ttcctccctt ctcctttagg 180901 ggagaagctt ggttcctcct tctccgctga gaggggagaa gagctgggtt cctctttgaa 180961 aactctttcc gtataggcca gaaaacagct tgcttagaga aagactgact ctcctacgga 181021 cctggtggac cttacagtcg agttattgca tcgatctcac aaactatcaa tttcataaga 181081 gaagaaagat cgtttttaga tcatcaagtg aggacaggta gtagctctgg tagagcgagg 181141 gactgaaaat ccttctgtct gcggttcgaa tccggactca cttctagctc tggcacaagt 181201 tcctcatcct tagttgtttg atgagcctgc gtagtattag gtagttggtt aggtaaaggc 181261 tgaccaagcc gatgatgctt agccggttag agcaaaggac ttgaaatcct tagagcaaag 181321 ccagggactt gaaatccttt ttgtgtcagc tctttggaag tgccttttcc tttacttcag 181381 taaggagtct caaaatagga aatacctagc tagcaaagta agcaataacg gaaggtaacg 181441 aattctccgt ttcgatatta acaatccgga tctagggaaa gtaggttccc gcatcacgta 181501 aatttcattt cttttgggaa aataatacat aaaccttttg agtcacagcc acctcagcac 181561 aagttactcg agtagcacag aagccatcat caatagcttg cttgcctggg agctcaacca 181621 cgaattctct gtcttctaac ctttctatca cttgaggatt atctacgcaa ttcgtaaatg 181681 atcaaactag atagtccctt atcttactgg ccagtagtag aaggggatct tgaatcaaag 181741 agttcctgtc caacaagcaa ggaatgtcag tcataaataa tgaaatccat ataactctac 181801 tccgggttgt gagaacatcg tgcctgttgc gattgtgtgg ctttccgcat ggttggagac 181861 cccctatgaa caaacaaaac agtagggtgg tgtaggagaa ccccgactcc ctaatgcaag 181921 atagagctct tagggtgcgt ttcgtctttg tcaggaagaa aaaagtggca agaggagtgt 181981 atccataccc gtaccgaaga gatcttggga aggcatctct atagatttag tagttggact 182041 accaaagacc cagacagctt caaggttagg tcttcagtgt taggtcagtt cgaggtcagt 182101 tgttccctcc gtactgttgc tagagagctg ctttccatct atcttcctac atgaaaggat 182161 ctaagtctat ccaaataaaa tagcccagaa aatgacagcc atcaaaaggc gcgcacccat 182221 atagcctcgg cctgtccaaa tgatgttcag cggtctctac ccaagtagct gtggcccatg 182281 atccaaagga cccgcaacca gtcaatcatg ctatccttac cttccaacca atcggccaat 182341 cacgctatcc ttaccttcca accaatcggc caatcacgct atctccttac ctttcaacca 182401 accccttcga ttccgcttct gcagcagtat ataatctcgg tcccttacct tgatgcctac 182461 agctcaattt gttttccagt gatggcaaga atccgatccg cgaaatactg ctttttcttt 182521 tttcttgtgt tgtttctgaa tggcatcata gctacacgag ggaaagcgat gctgcccact 182581 ctgccgcaaa agggggccgc tttcttcccc cccaaaatgc cagttccacc atcagggccc 182641 agcaagcagc ataattctgc tcctcgatcg gacttcgtgc aattctttta tatgtaaata 182701 agggaggggt ctcgaccaca tttttgagta atagtaatag gctaaggcgg attcgtctgc 182761 ttcgcttacg tagtcaaaac attctcgctg gacacctggt aaggaatatt tgtatatgcg 182821 ggaaagatac tctcacctac agctcccgtt actagtagtt ccggttagcc cacttgcccg 182881 agcacactct caacttgtag atgatccaac agaaaaagca agagaccacg aaagcacgca 182941 tgaaaacagc ccttttgaaa gcatacccaa ggcacccatc caatctgaac ttatagacat 183001 caagaagaag atctattaac acgcatggct acctatataa caagttacct ctgacctctg 183061 tctcacgacc gcaaatataa tctgtagctt catgccctga cctgaacctg atgccaagtc 183121 ccatcttgcc tatgttgccc atctcccatt taatcggaac ggaagtaaga aatctaatat 183181 ctgtacggac cccaaacatc acaatggatc aactgaaaaa tggctgttgt tctattctca 183241 ctcaactgga aactctcctc cgcagccaga gtccaagtga aatgactatc tttcaaacac 183301 tctgtgatag gtgctgtggt gctgaagttg cgaatgaagc gacggtagaa tgtggcaaga 183361 ctatggaaac tacgaacctt actagtggag gtaggtactg gccattccac aatggcacta 183421 acctttcccg gatcagcctt tgactccttc agctgaaatg caaaatctca agaacaaaaa 183481 aaggatcccc cggaggcttc ttcaaattga tatagagttt ctcttgttgc aacactatca 183541 ttaattggcg aagatgttcc atatgttcaa cggcattttt acgaaagacc aatatctcat 183601 caaaagatac caaaacaaac cgaccatggg cgcaagacct gcgtcattaa ttgcataaac 183661 gtacttggag cgttgcttcg accaaacggc atcatcatcc actgacccct tgctcccaag 183721 ccaagtccag gttttttttc catctctata tgagccacgt aatgatttgc ataggatcaa 183781 aatccagaga cacactttct atagcattaa caaaccggaa cccggactca agtcattcgg 183841 tcatagtaat gatttttaac aaagaccaat tgttaggatt ctttgcctcc ctatcacatg 183901 aacatcttaa tgagatagaa atgaatagcc atcaagaaag ggactctcga agattcccac 183961 tggaaacgga ataagaagga aaagcagaag gtccgaaggg cttgatagat ctcattattg 184021 gatagattat cctagtggtt ctcacttatt ttatggtact attatcctag tggttctcac 184081 ttattttatg gtacttttga accatttttt ttgaatacat agtagaaaaa aagtcatgca 184141 aaagagcgta gtgtaagaga aagagcctgt caagtaggtc tactcaatct tctatcaggc 184201 agtttcaaag tcatagatct ttcaaagtca tagatttcca agtcatagat tctgctggca 184261 gtttcaaagt catagattct tctcgaagaa aataagtcat agttcgcttg ataatcagag 184321 ttcacttcat agttatagat cacactttct tctaaacaga cttctatcaa tcctttggag 184381 aatcctaatc cacccgtctt ccccagtccc tgagcccagc ttgtagcaag ccaactgttt 184441 gactttgcta agagaagaga gagaagggaa agacagacgg cagaaagcct tagttgctat 184501 ggagtttgat tgcttcccct gggcctctgg cgtaaaagga ttcgcacctt tgctagcccg 184561 gtcccaatga accgagcggg gcgcacactg cagcttcctg cattcgtact tctgactagc 184621 tcccatccta ttcttttagt cagtctagct agtatcggat tcggaaaatg aaagtgagat 184681 ccttgtactg tggtactttg cctattcact catctccttt agtcgcaagg aggcaaccca 184741 aaacagcaga gccaaaccaa accctaagtc aagcacagtc acatcagcac agcactcgag 184801 accagtctag tacaggcgca gacattagag ttatagatcc acagcaaaag cctactttga 184861 aaccttggag tacttattcc aggttattta aaaggccaaa gtcgggtctt tctttcacct 184921 tgaataacta tcgataaata aagggctttg agaattaacc tcaccctttt tctaaggcga 184981 agtgggaaat gtaagataag gattccggaa gcattggaaa agcagcgaga cgggcctaca 185041 gtaaaaaaag gtaaactgaa gctacatcaa ctatcgaagc tgttgagtcg gcagaagaag 185101 tgaaatcatc tgcagtagcc aattaaatta ggaaccatca gactgatcaa tccctagtta 185161 aagctatggt tctttgtagg ctcttccccg gctagtcctc aagccatgcc agtcataata 185221 gaatctcttc cagccaaggc agttaaaaaa ggactagagt aaagggaggg gcacacaagg 185281 ctcgttccgt acttgactta cgagctcgcc tctatctcta acaaaggaaa aggattcaat 185341 ccagtcccaa gcgtacccgg ggctaacctg ttttaccgta tccttagagg tctgcccgtc 185401 ggtacctcat ttaattaagg cggcccttca accgcttttg gggtcttttt cgacttgtaa 185461 ttgcaaaagc tcatcgggag tacgtatcct ataaaagctg ttacaatcat taaaaagaag 185521 gtaaaccaca atgagaaacc tacttgtact agtaagttga ttaccatttg aagcaaaaag 185581 tgcattaaga gtcacgatta tgctcagggc gactgctcca gtcagtactg gtatggtctt 185641 agtggctctt gactatttgc atctttctat gagttggcta ctagagtaat caaagagtga 185701 cggattcagg aattacttgc tagagattgg acaagtgagg attatcggga gggtcgcaga 185761 tttgctcgcg aacaactatt cggatgggga acttcggagt gaaaattctt ctttacccac 185821 cccaactgcg atgcaggatc ttcttgtcgc ggacaaatgt ggaatagcat ggcctagatt 185881 gtgtgtgtat taggtagttc tctctctttt ataagaaaga actattatac aacatcccca 185941 ggcttgagtt ttcttctcaa ctctttctat ccaatagttt gctgctgctt tctttgattt 186001 accccacaag gaagaatcag gcttatccta ttatctttat gccaactgta tttaaatcta 186061 tcatcacatt atggttacac cacaccattc gaaagacgaa catcaagatc acatttcagg 186121 gactacaaga ttttgagatt ggcagcatta agtagaaagc attattgtag aagggccccc 186181 catccaatag atcaggggtc ttgccctata acctagtgaa tagggaagaa cagatcaaat 186241 atgtctagtt aaaatgtctt tcctttgcgg cggagtgaac taagcaccaa tcggaatgcc 186301 tagcgtgctt gcccctcttc cctctgcttc gacttaaaat gtgatctata cgctcttcga 186361 tatttttaag gagatctttt ggtcgtatcg tagaaaaaga aagatcaagg cggagtgaga 186421 tgtccctgtc ctgtaactat caatcacttg aataatcgaa gagagatggg atcctagggc 186481 agatgaaaga gatgcccgtt tcatgatcta ggtgccgttg attgattcta cttcttttcc 186541 ttctttctcg agatgttgtt ggtcttcaat ctactgatca taggtagaag agagagaaaa 186601 ttcatcttat ataaggataa gttcgttctg ttgcacttct tcatgtcttg gttcactgtc 186661 catctgaaga taggaagtga ggggcgcacg gaaggaggta gcttcttttt cggtcttgtt 186721 tttcaacaag cggaataacc caatttgtat ctcggaaatg aatcccctat tgaattgaga 186781 tctttagtac agaaatcatt ttccattctt tttgtccttg gccgcattag caaaacattc 186841 ttatctgact ttggatccag accttcgaac cctggaacga tggaagctat cgaaaaacta 186901 acttcactcc atcttcccac ccattctgtc gatcaaatgg ttataatgac ccaaccctgg 186961 aatgaattag atgcataaag tggtagagat gtttgactga gatggatgtg tcacttgaag 187021 cttagagttc catgtgtcag tcaagcgaga gggctagatc aaggtggcaa gcggaaggaa 187081 gagggcatag agtcaccgac taaagcaagc caggaaaggt aattgcttac agacagacca 187141 gatttttgaa tagcagctta ctctcaaaca ccgtattccg ccaaaaccat ttactacgca 187201 aacaagaccg gcaactggtt gagctgatag gaccacagct gagattgact caagagcgtc 187261 tgtggctaaa agactagcag catattcttt cttcaattgc gacaggagct cattctttcg 187321 cagcttattc gtcgtttggg caaatggatg tgggatggag cttgaactaa gggacccggg 187381 gcgtagcctt gattcaaagt gtctaggtac caagagtaaa ggaaggaggg ctaactaata 187441 taataggggt aggggggcgc taacgagcca gaaagggccc ctttattagt aaggttgctt 187501 gcttgtcaca caggtcgtct ttggctcgtc tccttccgcg atacgcacct ggtagtatcc 187561 cgcccggcca tagatccaac ttcgagaagt atctcgcgcg ggcgaagaag tctgcaataa 187621 aagtggatac ttgtttttga tagttaggtt accttgttga gcgcccgata atcaatgcac 187681 aaggctcccg ctttttctgg aaaggggctc cccaaccttt tcccagcagg caatcaatat 187741 agggaaaaac tgccttggag gctggaatca aaataaggcc tccatgagtt cctgaaattt 187801 attcctgagt tcaaccaatc cccttaacta atagatgcta gttggggggc catccgctta 187861 acccatagcc ccagtctaaa tagttggggg tttggctcca gctccaactc gatcttgtgg 187921 tagactgccc tactagggcg acttggcaac taattcgtgg ggcatgatat cccaaaattc 187981 ctcaagtact tgctgcactt cctgagaaag aagtagtctt ggtattggat ccgctcttct 188041 gagagcatga acatcaatta gattcaattc ctcttcttta ttcttaagag aaaacccccc 188101 cccgaaccat tcttaagacc accaagctct ctcgaatgaa ttctactctt tctgctttcg 188161 agatacaaga gtaggcaaat tgcttcccca ctaaggtgta aaactttgtg actagatcgt 188221 cctgaaaacg gatgcgacgg gaagaagtgg catttgtaag tgttgctgac ttaacattta 188281 ggtttcggca taacaaagct tgcactgtct tttcgcactt aggcgcacag cgtgccattg 188341 gtaatgattc tactacggtg ccacaaattc gcaagctgat cctaagtaat cgttgttgtt 188401 cattggattc cggaggaact acttcgttag gattggggaa ttgctgcgat tcttcctgaa 188461 tagcctggag gctcccgtcg agagtcactt tgaaagtcgt acctaaactc ttacgactga 188521 atatgataaa gcctataaaa caaagagcta ctatcatttc ttcattatag attgagatct 188581 tcttcgaact taatgcacaa atagatagaa tagcagcaaa taacatcttt ctaccatcca 188641 tattcgtaat actcaatctc atttagaaag cttatcccca tttttttttc agcaactgaa 188701 cgggaagtgg cttaggaaag gactaatcgg atgcgctcgc ccagcgagaa aggccctgac 188761 cctgccgacc aagaaaaaaa gagaacgaaa ggagaagaga acttcgtatt ttccttattt 188821 gagaagaggt acaaagtgac tcgactgaaa gaagaggtcc gaaggaacaa aggagctcga 188881 ctgtaaggtc caccaggtct cgacgagcga ctgaaaggag aggaatggtt aaccttgaat 188941 gctattaata aattctacag caatagtacc tcggactcgg aaagttataa cgaaaatggc 189001 taacccaata gcggattccg cagctgccac cgttggaacc aatgaagcaa atacttgacc 189061 catcatatca tccgaagaaa cggaaaatac caaaaagttc gaattcacag ctaataacat 189121 tgattcaatt ggcattgaca taataggaat atttcgtcta ttaaggagga ttccccgaat 189181 acctaaaata gaaataatca tagaaaatgt gaaatatttg ataagatcca tttcgggaac 189241 gtggaatcta agataaattc aaatgttatt agatgacaag ctcgaccgaa cacctgaagt 189301 ccttgatatt taggttaggg agacacgcgc gcattcctga caacaggcac gggcgataat 189361 ccaggcaagc ttccccgcaa gcctcacaac agacacgtcc cagctgtccg accgtaggcc 189421 ccgctgctgc ggcaaggcgc ggcagccctc tgaccctgcg aaaaagagct tcccgtcggt 189481 tcaggcacca tttttttgaa taggtgggac ttcggtgata ggttgttttt gatcccatga 189541 gggatctagt gatttccccc gttcccccca agaggagatt ggttaggaac gggatgggca 189601 acttcagccg gtgctggcgc ctgcgtaagc ggctcaagag ccgctaaaga gaaccaatag 189661 acacgacagc tgggccagca ccagaagtgg atttactaaa tcccccttat ctatttctgg 189721 gtaaataagt tggtgactaa caacaaagac aaaaaaactt gctaccaaga agacgatgga 189781 taagagaaaa attagggtat tttctccttc cggtttgact ttagaaaaaa aaaaggaata 189841 atttacaccg actaagatca aagaaactag cagactaatc actaaaaagg cctttggttc 189901 aaattgttac atctatctta acttactcaa ttatctcgtg aactatctgc ttcaaaaaga 189961 gagttggttg aatgagactg aaaccttctt tcttcgagga gggcaccact tcgactgatg 190021 cgggcgcggt tccctcgtcg tgaaataagt aatgaaaggg aagacccgcc cctggtcaac 190081 tttattgttg cctgtatagc tttttagaac agagatctag caacgcattc taggcacaga 190141 tcattggtat tggatgtatc gttaggttgt gccgagtcaa atctgctccg tttttttttt 190201 tcgttgcaga aaagagatcc cttccctccg ctttctaaaa tgaggaatat gaagtccgtg 190261 gggctgcagg tactatggat cctctgactc ccaatcgggt tgccttccct ggtctcgccg 190321 gtcttacctg taggtcgtga tgtgccttga gatggccgtc tcctgtcccc ctgccggtgg 190381 ggactctgct cccgggttgt cgctccgctg cgtctggccc gtcctctagg cacctttgga 190441 aggcagggag tgtgacaacg tacacgcttc cctttactcc atagggcctt ctttctcatc 190501 agttgcaggg tttggtggcc cgaaagaaac ttggcttcgc caaaaggcct catctctgga 190561 agcccggctc gcgaggcgtc cctcccgctg tagggttcca aaactcgcct cgctcttggg 190621 acatgctatc tttcccctct ttattctcaa gtaaagggtg agtcccatgc gtcagtttgt 190681 ggatcgcggt ctcacgcact aatccctaca ggcgggtgcc cgtagtaggc cggccgccct 190741 acctaaacca atcatcatat cggtccctaa gccccattgc tggaaaggct cggcttcaaa 190801 accgtacgtg gggcttccgc ctcatacggc tcctctaagg atggaggtag gcccagccca 190861 ggcttgcgcg gttaaggttg ttgtacacct gccaatcaag taaaaaaaaa gaagagaacg 190921 aaagctttag ttttgggggg gttggttatt ccgtgtatgg atgatcacct tgagccacga 190981 atttagtaat gcaaggcggg cttctttctc ccttcccgcc gctaaatcct ccgaaaacgt 191041 ctccctccat tcggctatct gcctagcccg gaggcatcct tttcttgaaa gagtgcccac 191101 ttctgatcta gctcgcgcac ctccggatcc aggcgcgcgg ctacttccag acgacgggcg 191161 ggatcctccg cgttatcgaa gttccgcgtc aacctctgga cccgcttaat atctttctcc 191221 aattcgagat cccgcgctcg cacagaggcg gaactctcta tttctccgta taacaacgaa 191281 gaggggggga gcgaatcatt aagatctatc gggttaccat tcctatcagt agctaggccc 191341 tggtcctgat aggggcccac tctatgagat gccccagctt cctggggcac ttgatttacc 191401 gagggttccc cctcttccgt ttggggcgcc tccgccacct gggagtccat gtccgttttg 191461 gaggaggaga atgattctcc gagaacgtcc agttcaaagg aatcctcctc ccacgtggcg 191521 gagctctcag ccccgttagg gcccagaagt gctcccgctc cgactgcata cataagaatg 191581 ggatctaaca ggggcaaagc ctgaccaccc agtagataga cgactttgat ccggatcaaa 191641 gatagcacga accccactaa gaagagatag agaatcctct tcagcggtcc tttcttacga 191701 aagtaaaaaa gtccaaatat aaataccact agcgcacaga atgaaacgca cgtagtgatc 191761 attatagcgc ttccttctga tcctagaaaa cgtccgaata aagaacctga tacggaacca 191821 ccgaggaggg gcaaaaatag tcgaagcata tcaatattaa taaccatttt atattttttc 191881 atttttgata ttatttgaga gtatttttga ataaataatt tacggcgagc tcttccgagg 191941 aatgcttaca atagaagtgg ttgtggattc gaaccactgc aagggtttac agtcctttgc 192001 tctaaccaac tgagctgaac cactttgttc tcaactgaat ctctttctct tatttcattc 192061 cactttgttt cgtttagtga tagttagaga gaaagatcac tccacaaagc agccttctga 192121 ttatatacgt attattctat caatcggaaa gtagagtctg gaatgagatg aatcttctta 192181 tatacgtatt tggataagca atcgataagc ctggtttgtc tgtctcttct tttggaacga 192241 gatttaactc tttcttcttt gggaactaaa agttgtttag ctttcattct tccttacggt 192301 accctttcag tccaactgag caaattagtc aaactttgta ctcgaaatgc tacttccctg 192361 agttttgttt caaatacttt cttacttcac tgggaatatg ctgttattga actgtgttct 192421 ttccttattc ccctggtttc accggttggc ctgtgcccgt actactcgta agctaaccac 192481 taacctaatc cgcatctatc atccggaaca tcttctgttg ttttctgaaa cttttttgta 192541 ttagcttgtt tcgttatcaa agggttcgtt tgtgtaaggc gtagttcacc ccctatagaa 192601 gtgagctggt taatcacagc ttacttcttg agtgatcccg gaggcagggc taaggcagag 192661 ggaaggtgta gttcgcttac ttctacctgg gagagaaagt cccctccttg agaagcggaa 192721 gaaggggaag gaagactaat gggatggtta aggcatagcc gaggccatta gtctgagtga 192781 gcggctcctg ttgctttgca gctgcggtat aactcctatc cctgtttgtt cgatgggaat 192841 aggtattact aagccagcta gtttcctacg gcgagctcga gcagaaggtc tgcaagccta 192901 acaatttacg tacctcggga gggaaatcag actcagtgga atctatttat ctaatcttac 192961 ttgaattgga gtagcaagcc cttcaactgg cacaacacaa gcaaaaggaa agaggcaagt 193021 ctgctatctt cgcctcgttg tctgccttga cttagtcttt ctttccgaac agaaatccgg 193081 gtttcatcgg aaagatcaca aaaattaatc cactgaggga cttttacgag aaggcttttt 193141 ttcttagcgt ttaatggcat atttatgagc cggttgtccc aactactata cgaacagtcc 193201 tctccattgc tctaggccaa ggctggcaaa tcaaacaact ttatgttaat aacaccttta 193261 acttatgaca ttttaatgaa tgaggtgcgg gcaacaggcg ggaataacaa gactctatgg 193321 cttaattcct tccccaccaa aacccttact cgggcctcga cagggtgaga gtagttggaa 193381 atgagcgtct cctttgtttg cgaggaggtt gcgttgcttt cgcgtgaaga ggacctgcag 193441 acggcataat ttgcttatgt taataaggat atgaggtggt gggatgccga atggtagtct 193501 tattctgtgg agtcgatcta agtcgagcta gccctagtat ggacggtttt gagtgccggg 193561 acctttacca attctgatct acgatagggg tattccagtg gtatgttaat ttgattttcc 193621 tccttcacac aggccttcca atgtagattc ttccaaagat atactgacta aaaggtattt 193681 atggattcgg attggaggtc ttgggaatga aagtcttata gaggagactg gcttcgtaag 193741 taaagtaaat taattttaga tagatagctc ggagaagctg gagaggcacg gagtcgagaa 193801 caacatctgt tctgtatact agtcgaccag agagaagccg acttatccga caaggctaaa 193861 aaagaatggt ttattgacca gtttccacgg actttcgata accggcttgt ggaggtgccc 193921 attatgtcgt tcgttatgat gcttcttgtg attgaggtgt ccggtgggtg gacgttccct 193981 tatgttcacc tctcgccctt cgcttaaaaa caaggatcga atggacgtgc tatttttgaa 194041 ctcttccagt gaaaaaaagc gatctatgca tttatcatag gcaaaaaaaa taaggtattg 194101 aataagctca gttctactca tatctatgtc aatttgatga aagcaattcc tacctttggt 194161 agctctcctt aagctcggaa gtcacggaac tctcttccta aactgaaact caaataggaa 194221 agtacactga ctgagttcag tcgtgccggt ctttcttatt ctgattttca gagagtgaag 194281 tagttctggg aaagagaaat agacttactt tccagtaaga taagagccaa ggaagagaag 194341 gtaacgagag ctagagcttg aatcaaacag gtcctaggag tcaagtcatg agcgtacgag 194401 tctagagtga atgtcttatt tgtttgaatg gtcctgttat tgctaggaag gtatggaaca 194461 cgggtaagct agccaacaca gggcaataca atagtacaag agagggagta agggatgggt 194521 caagtaagta aagcacgcat aggaataaag agagcttccc ctgccttcct cggcacacgc 194581 atataagata taaaggtacc ccgtgggcac acacacgcga gagttggaaa tgcatgattg 194641 gccgattcga taatatccgc ataatagagg gaactcaaga ttggatactc tatgtctttc 194701 tttccttctg agtatttgga gatatctgaa ctggatactc tatgtggaat ggacactcta 194761 tgcctttctt gactaatagg aatttctacg tgttaaacac tcaactggtt tcttccctgg 194821 attgttagct ggaatggctc aattcgaata tttacgtgtt aaacactcaa ctgtaatggc 194881 tactctatgt cttcctccta cttaactcag tggttagagt attgctttca tagggcacta 194941 gtgattggtt cgactccaat ccaatagtag gtaactgctt tccttctgtt gcttgtctga 195001 gactttccgc gtatatgata gaacgataga ttgatactct atgtcccttt ttgatcccta 195061 actaggaata gatgggtgac tgttaaccac gagactgagc tctttcacct tgttattcat 195121 agataggttt gttctgcttg cctggcttgc cttgattcct ttccttgctt gtattttatt 195181 gattggcttg gaataattgt ttgataactc ctccttctat cactggctta tatctaggga 195241 acaccggtag gcaaatacag gtgagtgacc acactccctg gcttaatgga cctgcttgta 195301 gtaaagagag aagtcttggg tcaaggaagt aaagaaagaa caggccaagc aagtaaggaa 195361 ccgggaaagc gagccaagca agccctcgag tattggagtt gggatgggcg cgtaagtgca 195421 catcaaatga gaggttgaat tccttccacg cgagagagct tttgaatcaa gctaagcaaa 195481 tcaagctaag caaatcaaag aaggggttcc ggtcaagcta attaagggag gcccaggttt 195541 tggttctcaa aagtcatgca cttaatagct tgcaaaagga aagcgagcct atatataagc 195601 aggtgatcct ttcttagaga gctcaagttc ctaaagctaa agagaaagcc aagcaagcca 195661 agccagatga tacgcgcatt tgaaccattg accgaaagga aatgcactcc caagcaatga 195721 gagaaggatt cgaactagac ttctcatata aagagaaaga tctacagcta caactccaaa 195781 tagactctta taggaaagga atcccagtag ctacttatct gataggctgg ctcatccaca 195841 gcccattcct actacttatc tattgatcgg aaactccttc acatgaggct tcttaggagc 195901 ccttctcttt cttccgttac tgtgctgcac gtattcgcag cagcggatgg gagatgaacc 195961 gagcaactaa ctagtaccgg ggaactccca ctccaagcct tgactatacc ccactcaaag 196021 ccttaactac aatcttatct tggaattcct tcctccgtta gcagaacata cgacttctct 196081 gtatccaaag caggggatta ccccgatcaa ctagtaccga ccccaggaac tcgattagca 196141 gaacatacgg cttctctatt agcagaacat acggcttctc tatatccaaa gcagcggatt 196201 gggaaaggag taaaccctag aaaccgaagc caatagccaa gacatgaata cacttttgtc 196261 gatccatctc acagacaaag aaagtaggct tatgtacggg acaccaagat cggtatgatg 196321 ccctggccct tacccattca cctggaccct accattcata cctaagcgga ataactagaa 196381 agactgccga ggctaactcc cggaatcgcc aacccgaatt cagctaacag ctgctaaccc 196441 gccaacctac catcaaccta ccaataagaa tgaagctaac tgctaaagcc atgcccttct 196501 ctaacagcta aagctaagtc ctcaagaaag gggaatccta tggttaccta ggcatccggc 196561 tacctatcag gacgagaagt tacgtagaaa gggaagagtt ctagagattc ctccgatacc 196621 ttttagctgc ttcacgcaga gactccaagg cgttctaacg gaagaggaaa gagttgttta 196681 tgccttccct tcgcccggaa agagtttatg ccctctgtct tagggttcat tcaacagccc 196741 catctgatcg ttcatatgat tcagcacact cttccgtcta acatagtttc cttgttgctc 196801 ctcctcgaga tgcccggatg gatagaagac ttatagattg aataactaga ttccgcggat 196861 gtaatgtagc tgatagtagg agtcaacgta ggaaggggaa gatcctcacc caagcacacg 196921 ctagccttta tacgttagca acatccgcta gcctttctac tggagattcg catctcaaca 196981 tccgaataca ctgaactcct ctatcaggca tctcattcac atcattcctc tatcctggca 197041 gctggatatt ccatcactcc tctggcatct caacatccga atccccgagt tgacttccct 197101 tctatcagcg gaatagcatc ccttcggaac agcagacctt ctgccatttt ctttagtcct 197161 ccattccttc tttagtttgg taggattagc agagccttct atcagtttgg taggaacagg 197221 atcccatcta tcaggagcat ctcttctctc agttgaacag catcccattg atcttccttg 197281 ccatagcgag agatgtatcg acggacacta tagagtgaac tcggcgagag gggtccctaa 197341 ctccaggaga agagttgtca atgggcttag attaggaaag tgttgtcaat ggcttagatg 197401 aatcaaagca ttagcttcct cttaccaagc aagatcatat agggagggaa tacccactat 197461 ctatgggcgt aggcaataag tagctccctg tttgtgttca ggtaaagccg accctgcttc 197521 ctctccgttc tcaaagcagc aatacactca agcactttac tcgattgaac aaggaaattg 197581 tattaggtga agaggactca gattccggaa caaggaaacg gcttatgtct tccctatctg 197641 cagaagcagc aggaatgggg ctctgactca ccgaatacct tcatcatcca caagcaccca 197701 actgagggaa ctgttcttca acatacgaca atcttccttc ccaacaatca agctagtcct 197761 ttcttccgta cttttgttgc aagattcgat tggccgcttc tccttaccct ctcatattct 197821 gtgtttcata tgcctctctc ctccgctttg cttccttagg gcctcgctgc tttgaagtgg 197881 aagatgaata gaccgaggct ccggctactt taccaggacc aggagtctct gttaaccttg 197941 cagacctttc ccacctatag ttcgttgact gaggaacttt cctacctata gacttcacag 198001 ctacgacttg cttgctgata ctgatgagga gcaagaggac caatccaaac ctatagactt 198061 cactgctacg gctgctttag ctctactatg ggtcgggagt cctgctgatc tgtccttcct 198121 cgtcttctcc atgtcaagac atactggact tctcaatcag taagagagaa agggctaacc 198181 actaagtagg agaattgttc gggagatccc gctgtctctg aacttgctta cctagttgac 198241 aaaggagggc agttgctatt gctaaaccaa aggagaacac taagccttac cactaagagg 198301 agcaggtata aggggggaag gctaagatag cagggttgaa ggatcgttga gacatctgtt 198361 tattcttttt ttacattgaa cagggatttg accgactcgc aggcagcgct cccgaagcga 198421 gaaaggggat tggctcaccg gccggcgctg cgtcaacaag gcccttgggc gacattccag 198481 gtctctcgaa tgcctattca gagggatacg ggacagagca actcgaagtg aagtaaagtg 198541 ttctagttac accagttgga ttattaaagt caaaaacact gattccagta aggagttcaa 198601 atttctattc tcaggcccgg ctatgtgatc agattttctt tggctattgg ggcagaggac 198661 cggatgtcgc cttctcttac ttgctcaata gagcgaacag gggcatgcta cttttttttt 198721 ggtcttgtct catttcagtc ctcaaaaggg gagaagaggc gcgtctcacg ccgctttcct 198781 cacttctgtt tgttagtaaa ggttcgatct aattgaaata ttctatgacg ggatccccgt 198841 ctcctctttc cttccgatga atggattgac agcttacagt gccttgtaga aaagcattgc 198901 ctcattgttc attcactgat ctacgactga tatagggcct acctggctca tagagactcc 198961 gcctggaaaa tagagaatag ggccacatac acctttcttt tccacaaaaa tctctcctta 199021 agcgttgagg ggacaattga ttcgtaggac cattaactga tattttcact tcttgtttag 199081 caaatggctg gtagatcttc attgttctta ccggaggaat cataaatagt cgcccagtca 199141 acttcctgtc aaatcaagac cgaaaagagt tctctatatg gaacgtaaac tcatcttacc 199201 gggcgaaagg gcacataact gactcttcct tctttagctg aagcgaaagc attggacagc 199261 aggtcttgcc tgccccaatc aatacagaga tctcgtccca cactacttca tctttccttt 199321 actcctttta ggggtcaggt ccgcttaaca tcactggtgt agaccgaagc ctatactcac 199381 tcctccccat cagaacgact ctcagcttca gccattcgag tcagaaggga gagagggggt 199441 cgaagcgcga tgcttctgat ctgccgattc cgaggtcagg gatgatgatg gctgtggtga 199501 ctaggcctag tggcagctaa tccaactgta attgatacaa accttgactc tcgctatgac 199561 aaggaagctt tcaatcaaca gaagcaacaa ctatgcctta aacggagaat ataactattg 199621 ctcacttaag accgatttca agcaatcatc caagcagaaa tctcttcctt tcgataagcc 199681 ttgcctggcc ctgctttacc tgccccctca aaccaaaaag gcagctaagc cataccccaa 199741 aaccgactat cagatatatg ggtagagtca gagtacgagc tcaatatgga cactaccctt 199801 aaagaaatag gcttctccgt cggagacaga atggctcaga cctggagaca aacaggcgaa 199861 acttttctga taatgggtgg gaatgaagga gcaggcgtgg taaactgaga tcaggctcgt 199921 tctcacaacc agaatgagaa ctcctctcga agttattact tctcaacaca gaagctctga 199981 tccgagcgag actcttcttt catttcgata caactgtaaa caagagtagt gccggtactc 200041 attgatgttt ttgtaattct ttcagcgggg gttgataatg ctaataccag aggagagtag 200101 ttcccatagg tgatatcagt gaccaaaccg gaccgggagg agtacgggcg ggcttccttt 200161 cgagcggggt ggcgagagtg gtcaataaat ctcactttct agactacggt tacggtctgg 200221 atagtaagaa aggaaaagaa aggtagtaga tcgaagttta ccagagagat cttccggaga 200281 cttggtagta gtagcgagcc ttcccagatc gagaagcgag aaatcttccc ccattacaga 200341 ggcgcagctt gcctctgtct ttcccccata taccttattt taagtatcag actctcgaat 200401 gcgactgcac ctatcccttc ttcggtgcct cttactccga acgagtcacc cttgaatatt 200461 tcaattggtg tgctagtcaa ccagtcgtca tcccggaatc ccactgagtt ggtaagcaag 200521 cctgggcgtg gtaatatgcc cattgagggc caagcagctg catctgattc cgactcttct 200581 ttctgagact caaccatcag gagctcgtgt ttcatgggaa gcagtcaaac tctaacgttt 200641 cgcctagcgg agccgatagt acatctttct tcctatcccg ataatcgtac atctttccgg 200701 taggagcaaa gcagaggaag gtgctttatc gagtagttct attgggagga aacgaaagga 200761 aaaggtaaga gacccgggaa aggcggggac atccattggt gcatactctg aacgagagac 200821 ggtaggttca tcagaggaag aaagatcggc acttaggttc tctgaccaag catctatata 200881 tctctcggtt tcggcatctc cctcctactc atcagtctcc agagtggtag aggaatcagc 200941 ctcatcagtg ttagagtcat ccccctcccg agttgtagtt gaacagatgg aatcaggaac 201001 atctgtgtgt ggagggccca agtgatacgg agctcctttc tagcgctagt ctttgaatcc 201061 cgtccttcgg ctaaggttga tacagaacag agaagaggaa ccaatacgaa atcaaagtag 201121 tgagccggag agtccctttt ccgcccccaa ctctcgattg cccgacctgt tcacataagt 201181 cattctcaag ggtaatggga aagagagata gaatacccaa taggcatagg gtagctcctc 201241 tcgtacaagt gttcaggatg tagcgataga agactaaaag gataggatat cgaggagatc 201301 atagaagagt gatcctcgca tcaaaataca gggcagaagg aaaagaagag acatccagaa 201361 actagggaca gaaacctgac atccatttcc attgaatagg agacagaagt gacacacatt 201421 caatccattg agaagccaaa gaaagtcctt caccctttac ccacccatct ctctcagcac 201481 agtgctgctt cttcccttgt tgcggaatac cacctctctc tccgaatctt tcatctccta 201541 cacttccgat acatcagaga cattcgcgcc tcgctcttac cgccactcaa ctcaggctta 201601 ttctcctact caaggtaaac ggcatctcca cccaactgcc aagccgatct ataagattcg 201661 cgtatagcct ctaccttccc cgagtgtacg cccgccaagc agcgcaaacg atccaattcg 201721 aataggccat gaaacatgcc tctttctgcc tctctagccg gattctgctc ttagccccat 201781 gtagatacct ggggacacta ctcttgcttc ttccctatcc gtgctccacc ctacgccagt 201841 tcctattcct actccggagc ctattcatac gcgatgtaga atggatccct gcaggtctga 201901 gtcatcatat cccttatttt ccgctagctt ctcctcccct gacagttgag actcttctga 201961 tagcaagact tcttctctcg ataaagcaac tttcattgcc tccggccaag actgcatccc 202021 tctcggcttc gctagatgct aaaaccaagg gaaggagttt gttatcttcc tgcagctact 202081 gttatatgcc tacgtaaaga tatgtgtccg gccccgcagg gggcttccca taggtgtgag 202141 cgcagggcgt taatgcagct actccgtcaa ccataggaaa cagtactctt atcccctatg 202201 tcttgtacct ttcagagctt cggctggtta ggtaaaagat gaataaaggt tagcttgctt 202261 gcttccgccc ctcactccgc ctcgtggcca cgttcatgac ttcatccctc cagctcaaac 202321 tacgacgata tggtgacctc tcccaccacg ggtctcttta tagcacctat tccgcacttc 202381 cggcgcagaa agaaagctcg aacccttcct tgcagaggag tattatcaag ctactgttac 202441 gtttgtgacg caaagacctg attcaactag accgttgaac aaagaatccc agctcgtgta 202501 aatatggcac aaagaaaaga cgacacctgc cccttcctcg ttctagcctt cctcgttctc 202561 agtcaacgaa tcggaatcgg gatgttgcca gtgcccaagc gaactttatt taagaaggaa 202621 atgagcgtag attgatctct gatgctagaa atgatttatt ctttcaagct agtctctgat 202681 cgtaaagaga tgaaattcca cctgctaatg ggatgttagg ataggtctat ctctttccct 202741 tctatcagtt tggtgagaat agcatccctt cttattaagc acacaaagca gggatgcaca 202801 tatatagtaa gttgatcacc ctgagtagtt tccttgccat agcgagagcg ttggactagc 202861 tgaacaggta tccgaactta aataaaaccc tgactctcca ccttcttcgt atgggaccga 202921 gtggtagatc aaattggaag tagaaatgtg aacgaaggtg aagattaagg cttggttgct 202981 attgggatgg gaactaggtg gaaccggccg atcgatggca atcactggtt cagtagcctg 203041 cagtcagtcg agcttacagg tgtcacatat agcagacttc tttctccgga ttggggcaag 203101 tgtagagggt cacggcctag ttactaattc ctttcctctt ccaacgctct ctttcttctg 203161 gacgaggtta ccctatctcg acttacttat gctatcggga atggtgttgc acccatagta 203221 gttgttctct tctcatcggt acctctcatc catctatttc ataagagaat aaaagttgag 203281 ccagttgaaa gactagtcga gaagccctat tcagaaaggt agatgagagc ttgcggagaa 203341 ggcaaagttc tcatttgtcc aatccaccga atactatcct ttccttcttt taccaatttt 203401 atcgtcgagt tgctgatacg tttgttgaac agcaacagcc tacatagccc gtctccctct 203461 tttcaaggaa gaaagcatca gacaaatctg tagttctagg tccttttgtc cagttcctga 203521 aagcaccatc gtggtttctt cactcaggtt tgctaagtag tctgggagcc cgactgtcat 203581 tttaaattta tcgataacga tttagctgct cgcgcaacca ggacctttgg acctccctga 203641 gtcgatctcg ctggcgaaag atacgaaaat ggcgctggag gcttctgacc tcccagtcca 203701 gtatatctat caacgcctga cgacgccaga tattaggctc agtctggagg tccctagtta 203761 agttttgtac ttgatttaga tgagctaaca ttacgtcatt ttgcgccttc gtagactggg 203821 ggccctcgac ctctgcgtac aataaatgta ctggaggaag ctcgttgaga tcgagagcag 203881 cacgattgac tcctagggtg ggctgctgct cttggggcgg ctggttgaga tcgagaccag 203941 tcgcgccgcc atcaggccca acgccaaaat gaaagttctc atttcccaat cccaacctca 204001 gcccccaaac aaaatgtatt aatcgcagta gaaacagaac aggccccaat gttggggcta 204061 cgaaatatat aaaaaagtat aaacacccta acagtgcctc ggcactgaac cgacttgaat 204121 ctgaactacg attttttcca agtcttacca aaatcggatt tccttttcgt gccatatttt 204181 ttgactttat ggatttctgt cccttttttc ttcccggtac aatatttgtt ctcgaaagtc 204241 tgagtttccg tgtactctcc aaatttatga ccaaccttcc cctcagtgat cttagaacgc 204301 tactacgcat ctttactata tagtgtggta aggtaggttt gggtatagca ggacttgaac 204361 ctgcgaccat taggttaaaa gcccaatgct ctaccaactg agctatacac ccaaataaag 204421 atgtagtagt caattaagat tggtgcggaa aagagaagag gtctcaacaa gtattaacta 204481 gttgatgctg atcgaaccct cagttcgaag ctcttcgcca acatgttacg aggctccagt 204541 cttaggcggg tcaccattac ttgacttagc ttagaaaggc gaacatctgg gcaagggaaa 204601 gcgcagtgcg cctggtcctt tcgaaccagt ctttataaac ctggtgcaaa tggctttcga 204661 tagttgcatc gtaaacgggc atccaataga ttctactcat aaagagatag atcagtagtt 204721 taggaatgta tgcttgcttt atttttttct ttctttcctt tcatcggaaa atcaacaacc 204781 taattcaact cacttctttt accgggcgat ggaagactag ggatatatca tttaatcagg 204841 aagggcatca ccagctacag tagttgcatt ctcaccagcc ttcatagcaa gatttcataa 204901 agagtattta cttaggaggg ctatagataa gattctaagg tagggttcta tataatctat 204961 ttatttcctg ttgaggatag tttaataatc tcatagcgca tcgacgtaga tcgcccagtt 205021 ggttctaggt ggggtctcga gtaagggtag ccgcactctc cttcttttat aacaaaaagc 205081 ggaggacctt ccctcttagc tcttaaggaa ttctgaaacc tatacacgtt gttaacgagc 205141 taacctaact aatatagata gaaaaacctg agcgaagtag ctaagggact aggtcggtta 205201 tttcagttcc gaggaaagag aggactagag ctaattctta taagctaatg tgcgatacgc 205261 gttacacgtg ttaagcatat acgcgtgtaa catgttcata tgtatgttca ttcacatgtt 205321 cacatcagtg gtagcttgaa caaacaacaa tttatccctt ctcacatagc atgtctcatt 205381 catattcaaa gcagactgtc ttcatgctca atcagtagca agtttacttc atccccgggg 205441 accgtccagt agaggagcgg aaacgactct gttatcttac attcctcatt cgcattagag 205501 cattagagta tgaatataag caagctttcc ttattccgaa atctaaaaaa aaaagagctc 205561 tagaagttca gtcggtgttc tctcatctct taattaagat atttcttaat cgagaatcgt 205621 gagactcccc aacctagtaa aggggcttga atcggtattt tgtttaggga gaaaccctac 205681 taaggaaagg atgagatata gcattaactc cacttgttgg tattcgttgg gacgcatgct 205741 tggaacggcc tctaaggggt tgttgtataa caagtgcata attgtccact agaacgtccg 205801 tgtattttat catcaacctg atgaatgaat ttcaagatat agggctttcc tattatcaca 205861 ggctgttcaa aaggatctcc tgttcttcca tcaaaaaatc tgctttttcc tggatactcg 205921 ggttcaaata cccatggatt cgctgtttgc ttactggctt catataatta agaaaacact 205981 agttttctcg aagcctcttg ttcatatctc tcatcaaaag gggctattcg ataatgtcta 206041 tctagcagac ttcccgctaa cccaagcgag cattcaaata tctgtcctac attcattcgt 206101 gagggtactc ctaatgggta tgggttgaag accatatcca cgggtcttcc cgcttaggaa 206161 gaattgcgag ccggagtttt gttttagtcc atacaatgcc tttctgagtc tactaactag 206221 atgtgaagaa actgatcacc ctggcgaagg tgtaaactaa acccgactct aaatctagaa 206281 tcgttttcca ttcataaaag cattccccac atctcactgg aactgagtat caagggctac 206341 agtaagaaag acaagagaaa ggtaagatgc cgtggtgatc taagccgtca acgagcaatt 206401 tgtctcagtt gaatccaaag aaagagggcg aagaaactca ttaattcatt gatgagggct 206461 aggccgagag aaaataaata tgaaagggaa ggagatattc aggtgggtcg aatccataag 206521 gtcatgatcg agttaagctt gcgcttacgt tttacagcat tgagaacttc gaatcatcct 206581 ccacaaccct aacctcacaa taaaatcttt taaaattatc tttttagctg tgatttgaat 206641 gtctaaagtt tgtcttcaac ctccctggtt tcgatatggg ttcaaaactc ccggttaaga 206701 tcaaaagtag gagctaattc gtctatcaca ggtagtagct cacccgcctc tctttctcaa 206761 gtgagacaaa ggcattcaag atggactgct aaaagacata tcttaccttt actttacatc 206821 gcatttattc tttcagaacc ttactatcta ttgagcttga actactaaat ctctattatt 206881 acataccagg gtttactaca gtttcactgc cctgaaggga gctagccctg cttcttcttc 206941 agtgtttgct aactgcttct gtgcttaatc ccgcatgata agaggaggtt tctgaaacgg 207001 acacggactc aaaactccct gagtctatac ctggtgaact ccctggggct gagaagaaag 207061 cttataattc gattggaaca tttacttcga gggcactcat aactaaagta ctggggagcg 207121 gtttgttgcc cttgtaggta gtccttgttg gagagtcata gcttatctca attgagagtc 207181 ctgcgctgcg tcttatctag accgctgcca gccggattga aattccagtt gttggaagag 207241 ctactgcaac taaagcaaca gatactagaa ctattgaatt cacagctact tctgcaacta 207301 actactgaga ctcttacttg aatcatttat ggaactctcg tcggagcctg ttctttcttc 207361 tacgaagcct cctgtaaagg catcatttcc atccgtctca gtaaaggctt cagttgttga 207421 attacctgga gtttcctcaa cacccactac tagagaaagt gaagatggtt catttacaga 207481 ccctgtcact ctttcttatc ctgtagtttc atatggaatg gtttcagctt cctctactcc 207541 tactaggtct acttagcctt ttattcctag gtggcttgct ttcctggggc tgctagtctt 207601 tcctatccct atggtgtaaa gaatgcttac tatcctattt attaggtaac tccccttctc 207661 ctctcctttc ttagctctag gcttcagctt tccttcatca cagtgaatgt aagcaacaca 207721 cccaaatctc ctgagatatg aataagtagg gactgattga aaccatacct catctggaac 207781 atggaaattg attgctgtgg aagggtattt gttgatgatg tgcacagctg tgtttgcagc 207841 atctgccctg aaagtcttgg gaagtccaca ctcacaaagc atacttctga ccttttcaat 207901 gattgtgcgg ttcatccttt cagctacccc gttttgttga ggggtgtatg cacatgtctt 207961 acaatcccat ttctccgatt tacagatctt ccattaaggg atctaacatt aagggatcta 208021 acattgattt accattcctc ctatgaagtg aacaactaac caaaccgctc cgccccttgg 208081 cttacttggc ttcccttcgt ttcctaccat tcccaactaa cctactttcc tagttctttg 208141 attctgatat tcctattcct gggattacta gttctcacat tcctcttttc tttctggttt 208201 cccggatttc tcttcttcct actttctttg ctctcctggc tttctggatg tgagggttag 208261 cttaaaaaga actatctcta gctctcccag ctaagatgta cgatgtctga taccgattca 208321 agctcactca gagatatatg atttatgtgt tctatgggtt tgaaaacagg gatggtaggg 208381 gtttcttcca ttgaaaagaa agaaagctta tttaggacct ctgagaccgt ttagctctta 208441 ttcctctttg gaagttactt tgtgatggga ttgggtcttc tgtgagggat cattctatta 208501 aacgaataaa gtgaacgttt ccggacgttt tctggttggg agagcgtaca aagcaaggac 208561 tcggaggtcg gtgtagttag gatagtaata taacctatta cctattggta acctattgct 208621 gtacgtatcc cctataagat cttgtccagc atctttgtct agctaatctt agggactgta 208681 agcaaaggat gtcactatag ctaaccttcc caagtaacag tacctaattt gaccaatttt 208741 tattttataa gaaaacaagg aaagtcaact accctattcc caaaccatcg gtacaggaaa 208801 acaaggaaag tcaactaccc tattcccaat tcccaagcca accaaaccag ggaagctagg 208861 tccaaccatt acccttttcg agcagactca gaaaaagaac cattagttgg tcgctggtct 208921 tatggtaatg gtgactatct gcttcgtttc caatgcgtag tagttctcaa gtgtaggagg 208981 tgcttttagg ttttaccgtg ttttgagctt gccggtgttt tcactctttg aaatcataac 209041 aaaaagtgaa gagttgcgtt ttaccgggaa ttttggttgt aaactcaatt catacttaga 209101 aagccgatca aaaagagaca agcgacgcaa accccggtat tttggctatc gatgaatttg 209161 actcttagga atcatggaaa aatgatcaaa tttgctaggt tttggccatg gtggaacgca 209221 acatttctct ttccgatggg tccccaaaga gtgaaacagg gtttctaggt tttcccctag 209281 ggggagatgg aagattctac tttttcatgt catccaaaga gtcaaagtgc cttgctaggt 209341 tttgtccatg gtggagtgta aactttttgt tttctcatgt ggtcttacga agtcaaaagg 209401 gtttgcatgt ttttgccact atatgagatg gaagttcttc attccaacag aggtcttacg 209461 aagtaaaaca aggtttcatc aattcgtcgc tatatgagat ggaagatttt ggtttcacca 209521 cggccctacg aagtgaatct gtctatattg tcttccgctt catctcttca tcttccttat 209581 cttataatac actgtctagc gcctttattc gattagactg gattaagact cctcgtcacc 209641 gggcagcttc ccgagccctc ggaatccagt ccaatggtca gatctgcaca gcctgtccac 209701 atctctctgc tctcttcgga ttcctttctg ttgcgtcttg ctttccctct aacaaagatg 209761 acggactaca caccaggatc tattttgact tagaatttca gccggatagc gccgtgtttt 209821 agggttacga caagctgacc tctgccctaa tcaactgctt tatagccggt catagttata 209881 ccgggaaaga aactccaact ggaaatcaaa atggggggac ttacacttca actagatagg 209941 catctctttt aggaaagaaa atatctgata ataacctgcc tcaaaggata tgtcgtactc 210001 tccaatcccc tcactcagga aactatagcc acagataggc ttcattgcca tatcccgagg 210061 tagccttcca acaagaattg agctagatgc gcttggaaaa cagtgaacta ccactccaac 210121 tggcttgtcg ggccaggata aggtattata ttcaggcagg gcgtcttaca aaagagatat 210181 ataaaaaagg gctggccgta cttctgtttc gacgaatgga atcttagtta agagaacaag 210241 accgtagaca aagccttcca acataagtta ttcaaacctt tttcgagttg accatcaaga 210301 gcaaagcgtg gctttttgcc ccggggcact agggcctgat catctttatg gctataaatc 210361 tgccttcaca gtgatcagct ttttgatccg agtggaaaga ggagtcttcc cagatgtgat 210421 tagagaataa cccaaaagag gcatgtcggc atcagctgag gaaagggttg gctaggggtt 210481 agcaatcaag aaagcaatca aagacgatga aagaatcaat tagtactgcg ccctgatgca 210541 agaacggaga gtttctggtc tctaaaagag tcagacgacg actaataaga agccgataga 210601 agagctacgc cccttcttct ctgttctaga tagaatagga atcccagagc ctggcgaaga 210661 tgaataagag gaattttact agtagtgtta tggaagtttc agagcgagag aaggttccat 210721 atctgttttt tttctctaag ggatgagaga gtgagggagg tcagatctat ctatcaaagc 210781 ggaaagtcca gagccggctg tacagcactc ttccgccgag taccggataa aggtaaaggg 210841 ttagctttct agcaagagag tcaaagccct atccacagcc aagaagcaag ggctgtctaa 210901 agaccggatt ccctctctga atcacagggt attcactgga ctttttatca aactcgtctg 210961 agtaataagc caaggcaatc gggaaaagaa ctcaatgaaa aagggcctat tctcttaagg 211021 ctaacggatt cttagtcgag tctcgtccct ctcctttaaa gtagagtcag cacacttccc 211081 cccccccttc aaagaagctt acctgagttt agtctcgggg gactgatgac cttctattga 211141 aagagtagaa ggtagcaaat ggagattgag attgaaggtg gcccgacgag gaagtcttgc 211201 ccattaaagt cgcttccggc aaccgcttat ccgtgggaaa gagaggatgg aacttgagaa 211261 ggcttcatgc cgattcggca actgattgac tattattcag gtacggatga ggctgggaag 211321 ggctgataag catcggctag tggaaccact cctgtccttc tttcgtatcc gaggtgggcg 211381 ctttacacaa tatggaaatc tagtacggtg aataaggtag gggctgatcg attaggcgaa 211441 ccagtccaac tagtcaaaaa aagttaaccc cttttgcttt agcaaagctt aggtccccgg 211501 gtccaatcgg cagaggctcg tcgagacctc gatttgctgc acagaggtga gagcccctct 211561 gatgcgagcc cttcccccct actatccacg taacaccaag ctcagaaagt gtacctgacg 211621 agggcttcct cccctccaag cctgtctact ctggaaaaaa aaataactcc catctctcag 211681 tgagtcccat caaatcaaca gccccgagtt gttggcctga cctgttgatg gaaggtccga 211741 gctgctttgt catccagagc ctgggggtcc cctcggttta ttaataggcc ccaatagtct 211801 tagaggaagt caaaggtcct cccaaaatac acagttgaat aaggtgagtg aagtgaaaga 211861 agagaaaaaa gcccgaggtt cgaaggctcg gactctcatc cggtcaccca agagggctct 211921 tctagaagca ctactatcta ctgttccttc tttagtgagt agacggtttt gaacggagaa 211981 tgcagaatag aatcatcaga ataatggata ttccatctag ctggggttgg ctgactctca 212041 ccaggtcagg gggagaccac agcagatgaa tgagggattc agtagcacgg acaccattcc 212101 caaggaatgg aaagaaggaa ccatcaacta gaagaacagg caaggaagag ccctctcagg 212161 tcgactcagg agaaggagct tacgatccac cccgtccacc ggaagaccga acccagagat 212221 ttattacctg tgctttgctc ggtacggaga taggagctcc gaacctgcag agacgagatc 212281 tctactccta gaagtttggg gcaccgtagc tcacctttca ctaaaaagaa gtgtagccag 212341 gttgctaaca tcggctaccc ctgaccttct ccttccgcat ctagatttat tcagagcgca 212401 caatcggacg ctaacctgga actgaccctt caactgacta ggcgcttggg aattctgaaa 212461 cccccccctg aaatgagact tcctccttgc ttgaaccttt gcgacgacta tggcaccttt 212521 tggtactcct ctgagacctt tggaaggggc acggactcca ctttagagtc tgattggaat 212581 tcgcattctc gataagagcc gctacgagat ctcaaaacag aaagtacctt atcgtagaag 212641 acgacttctt tctattctat tttgatcttg cagaatggaa ccctactctg agaggcaggg 212701 aagaccctct ccagagaatc cagagaggaa agaggaaggt tatttcaagc tgctttcagg 212761 cgtgcgccca cgtcgtcctg ccatcagaaa gagcctacca gtggtcccta ctggcacagt 212821 actggcccct cggtcaactt ttcatctgaa agaaggtagc ccttctcccc ttctgaacag 212881 aacatttcta agcttgaagc tctttcggat tagatctttt ttcactggat tcacctgagt 212941 agctcagtct gatttccaaa aacggatcga attcttggct cggcaaccct tattcattcg 213001 atttgtcccc tcagatattc taattgaatg aaattaggcc caacggcaaa gagtaggcga 213061 ggaaggaggg aggggttctt tttttctttt tcccctaaat agctttttag aagtgccagt 213121 ccgttcccgg gtaatagtca gaacgaagag ctagaatagg agttctagta gagtaagcgc 213181 ggacgtcttc ctactttcat ttgtcacaat aggttgaatc tctaaataaa aagcgagccg 213241 gcacatgggt tctttgccaa ccctagctct atccatagtg taggatgaga agagatcagc 213301 gacctgaatt caccatcctt ttctccttca ccacaaaata caaactgctt gtagaaggag 213361 gtgccttctc cctttctgag acttctttcc acttattagt gctggcagat cctctttatt 213421 gattgcccac cgaaagttcc ctctacctct aaatatcggg ttgtcatcgt ccttttccca 213481 gtttgcgcca ggagcctccg acagcctggc agactgcaaa atcattgcac ttccttctca 213541 cgatggatgg aaccaggaga gctaggcacg ggaagggaag ctaggctcac tcccgacaat 213601 tatatacctg cattccagac aatggtccaa ccctttgata ttcctaaggt aacagagtac 213661 ataggtgaga aaggagcgct tttctaagct cagcaagacc gaatggaatt caagtatatg 213721 gcactctaac tacgaaagga atgccaacaa gaaagctaat agggatagga ctactagctg 213781 ctagtataga aaccgaaaag aactcctaat agcattggtt gccgctctat ctattctgtt 213841 agttccatag ccgcgcttcc tactagtcgg ataggaacaa ctaactgcct agctacaaga 213901 agagagctac gaggaaagaa gagctagcag ctaggatagg aactctaaac tagctactag 213961 aggaagtcaa gctactaaca gagctcaaag cggataagaa atcggaagag acagacgcaa 214021 ggaagatcag aggcgttcct cggtgctatt gccctcttgt cctagcagcc gatcactcac 214081 taatagctac taccaggagg agaggaattt ttttcgaaca ggatcctttc cggtggattg 214141 gaatttgact cagatttgct taatccccaa gaagctcaat cctcggttaa tgacatagat 214201 gcgacccatt agtctttgct cagtcatgta taagatcata tcaaagatca tggtggccag 214261 actcaaaccg attctgcctt cccttgtgtc tcctacacag tctgcatttg tttcggaaag 214321 gcttatatct gacaatattt taatagccca tgagttggtt tacaatctac gtactcaccc 214381 ctcatttcat cagctactct cccctcattt cattggtcac tggcttagtt gcggaggaac 214441 ctagcttctt cagcgaagct ggaggaaccc ctatggctag cattgttctt gcttcactca 214501 cttacattcc cttcattcaa acttcttcta attcacattc acttacccaa cttctctggc 214561 ttagggtcag ggttactcac attcctagtt ataagattcc tagttctggg tctgggtaag 214621 ggaagggtct aggctcctaa cattccttcc caactcacca cttatctccc tagttcttga 214681 attcaaagtt ctgggattct ttgattctta gttctgagat tgttagttct gggactgaga 214741 gtgccaggct gcgacccatc tttgcaaagc aagagcgagg ccaagaagca gcagcactcg 214801 tacactagaa gtgagtatag ccttgccgga aagagattta gctttgactt tgcttagcag 214861 gctggctggt ttgtttggct tggctgtctt cgtagttgag cacactgatg taccagtttc 214921 aagggttaaa gttggaagct gcttcccttt tgtgcagcac acaggttcat ctcatatgta 214981 gctctttcta ccaatgtgct gcattaatcc ctagctatca agctgaaact tgaacagaaa 215041 ttgctctaaa cctgttcgct acaactatag gagtccaatc tactgaggtt cggcttctgt 215101 tgaatctcca gttcacttgc gatggaaagg cttcactcct agcgcagcaa atgaagcgca 215161 gcaattttaa gtagcttctc ctctagcaca gtgctgcatt tagtcgtcag cacacaccta 215221 gaacatctcc tttagacgca agcaaagacc tgtcttcgtc taaacgtcgt ccagcccttg 215281 aaaggatcct tgaagaaaac ccaattcagc acactcctcg tgaagcagcg ggtaatcatt 215341 caaatgctaa agataaaggg ccagctttag agcgacttga gctccctgtc gatgaggctg 215401 gtacctctaa atctaaagaa aagcggcctg ctttggaaag aatcgagacc caaccagtct 215461 tccagcacac tcctcctacg gtcttcgata ggcttggaga tcctgatctg ttaactttct 215521 ttcgtctatg agccgtctta tcttccatag cagtgtgctg gctttccact cctgcttcct 215581 tttagatcaa agttcgacaa gtgcacctct attgattaaa ggcgtgattc gtacatcttt 215641 gaatagtatg ccaccgatta tccaatcaat catttccatc ttttgctttt attgatcaca 215701 tatgtttatc tagtttaaat cagctggtaa tgcaggacaa taagcagaac tgtaaacatc 215761 cccttgtcat ctgagtaatc taagtcttcc tctttttaaa gcagcaattt aagttgagtt 215821 cctttggacc tcttctgaaa ccggttcttc ttcttttggt aatcagagga tttctctttt 215881 cccttttctt gatcttatgc ttttcctctc ttttcgcagg gatggacccc gggtaatatg 215941 agaaatcaga attctttttt tctgatgctt gcccttacgc atccctagcg catcaaagga 216001 aattcttgac tttatcttac acgtgcgtgc tgtcacgcat ctgccaatag taggagctag 216061 cacaaagctg ctaatcccgg gacctatccc ttgaaaggta cccttcgttt ttattggctc 216121 ttcagtttta ggggtatcag cacctgaaaa gttatgaaac gtgcgattgc cagtaagcaa 216181 cttgaaaggt ttttacaagg acctacccta aatctgatac tttattgggt cggatgccca 216241 gctttcttgg atgggataga tacaacaggt gattgatctc ccatttctgt catccttaat 216301 ttcccaattc ctagcgcagc aactttgact tttcttgatc ttttgctttt ccttttcttt 216361 tcgcagggat ttatactttg acttgataga ggtaagggct cctgcccaat atgtcaacat 216421 gccaacacat gccacaccaa tatgagttga gttggccctg ataagtggtg ttcaatcttc 216481 ctacccttat atttcagagg tcctatcacc ttcatacccc tcgaaacgtg ggttttcgac 216541 tatagacatt tcgatgagca acctaaaagt agaagtttat tgtgggagat agatgaatct 216601 ccaattgata tcatctcgaa ttacgaagat tggttctttc ttttcattcc gaattagcag 216661 ggatatctcc caccttctac caacaagtcg attatcaact gacgagaaat ttgattgaag 216721 ctttcctctt tcctttcctc tgctgcttct tgccagggat ctaccaccaa gtcgatatga 216781 aacatgaaaa gtgtgatgtt ggttttcgtc gatcgatagt aatgtatata aaagcgatgg 216841 atttgagttg acgatcgaag gtgtaaggta gataatagcg attgttgagt tgagtacggc 216901 ttggatcaat cctcccctcg acaaggatca accaagaaat aatcgtcgat gattacaatt 216961 tcaacatatc atgtattgat aggattggtt gatttgattg atttgaagtg tgattcccct 217021 cgtatgaagc ggagaatcaa tgagtcccaa acccgacaat tgatcaaccc ttctcgtctt 217081 ttttcattga ttgattaatc gttttgaatc gattgtgttt ctggttttaa aatatagtca 217141 agtgttacaa catttgaatg ttgatgttga ggagaaagca ggagtgtgct gaggagctca 217201 gggttgcttc gctcacttaa ctggctcata ctacttgttc acagacggac aaagagatag 217261 ttgtacgttt gctgctcaga cgattaacaa agtgtttagt gcaaggaaaa aaagcatagt 217321 ttgatgagtg ccaagatgat tcggagagtg ttcagtgcag cacaaaggaa gaagatctct 217381 cttggcaggg ccggaaacag gtgcgcgcag ttgggaggat gccggggtgt gctagtaagc 217441 aaatgggaag ttgatccgat cttaagtagc ccaggatcca tcccagggga agatctatcg 217501 agtaaccagg aaagagatgg gtaggtagaa cagccagaaa gatagaaaga gctgttgctg 217561 gaaatcagag aaatagtcga gttcttccct ctcgtttttc ggacctctaa ctatcactaa 217621 aaaagaatct cataagagaa gaaatcaagt tgatagatca gttagttgag gcggatcctt 217681 tctttacctt tacgagcacg gaatcgccta ttgtcggagt cttgggaacg ggtcgagggg 217741 acggaacggg ccctccagaa cgagggggat ccaggccggc gtcgggaatt aaccgcgcgc 217801 ctggatcagg aaatacgaac cctccaacga caaatccatc tcggtcgaag agctgattcc 217861 attcgggatc ggcaaatcgc cgagtggagg ggaaggttta atacggaatt agcagggata 217921 gaggaggaaa gcgcacgccg tgcgttcctt aattggtgtc tccgtgtgct catccatgca 217981 cacgaggacc aacctcccca gaactaaatc cgttcgttcg ttctcttctt tcttttcata 218041 gatttaagtc tgtttgtgca tctttctttc tcccatgctt tccgtcggtc aaaccaacga 218101 ttctcttctc aaagtaatag agagatcctt ttctagttag aacttctatc aatgcaatta 218161 aagaaccatc ccttcctatt agtttgtcct gtcagataga aagaaaatta ggccccaaag 218221 ataaagagcc cggtgggggt gaagggtggg gtttacatat aaccgagaca aagtggttta 218281 tgattgaatc tcagaggcat tcttatcatt tggtagatcc aagtccatgg cctatttcgg 218341 gttcactcgg agctttggca accaccgtag gaggtgtgat gtacatgcac ccatttcaag 218401 ggggtgcaag acttctaagt ttgggcctca tatttatcct atataccatg ttcgtatggt 218461 ggcgcgatgt tctacgtgaa tccacgttgg aaggacatca taccaaagtc gtacaattag 218521 gacctcgata tggttctatt ctgttcatcg tatcggaggt tatgttcttt tttgcttttt 218581 tttgggcttc ttctcattct tctttggcac ctgcggtaga gatcggaggt atttggcccc 218641 caaaagggat tgaggtttta gatccttggg aaatcccttt tcttaatacc cctattctcc 218701 cttcatccgg agctgccgta acttgggctc atcatgctat actcgcgggg aaagaaaaac 218761 gagcagttta tgctttagta gctaccgttt tattggctct agtatttact ggctttcaag 218821 gaatggaata ttatcaagca cccttcacta tttcggatag tatttatggt tctacctttt 218881 tcttagcaac aggctttcat ggttttcatg tgattatagg tactcttttc ttgattatat 218941 gtggtattcg gcaatatctt ggtcatctga cgaaggagca tcacgttggc tttgaagcag 219001 ctgcatggta ctggcatttt gtagacgtag tttggttatt cctatttgtc tctatctatt 219061 ggtggggagg tatatgaagg aacgaaagag tggattacaa aatgaaagct cgaagacaaa 219121 gagaaccggg cttttccaaa gaattactgc agctttccca ctccctttga ttatcatata 219181 caataaagtc tcttccactt tcctaccaaa tctatcttta ttctggcaca taaatgaagg 219241 aatcgaagag attatggcag atcatgttca ccaagaaatg acccgaaatt agatcttggt 219301 ctatttgaga ttgttccttt taatcgtaat caaagatgtt ttcttgtttc ttgtttcttt 219361 tctgaacaaa ttgaagaacc taatggatcg aactcatcct acgaaagata tcggaacacc 219421 aaaaccagaa agaaagaaag tagagctaat tgcgagagtt cctcgttaaa agtgcaagct 219481 gtaattatac gtttcagaat cccacttaat aatttcctca aaaaattaaa tctgcttttc 219541 gtaagacgtg tctttagaat actgttcaaa acaagtaaaa aattcgttga ccaagcgaag 219601 attgaggtga tgacaagttt gcaaccacca aacttggtaa agggtattcg aagcttctta 219661 ggacattagg gttttataga agatttctca aggatttatc gaagattgct aggccactca 219721 cacaactctt atgcaaagat gtggcttttg tgtttgatga gaaatgcctt gaggcattct 219781 tgatgcttaa agaggctcta gtgaccgcgc ccatagtcca actggaattt accatttgag 219841 gtcatgtgcg atgcaagtga ctacgcggtt ggagccgttt tggggcaaag gaaggacagg 219901 aagttgagcg ccatctactt agtgccatag ggcgaagcgg gacatgttcg acctcacgtc 219961 taaccagcgt caggtcgaac gagcccccta gctctttctt gtttgaagac cggaacatta 220021 gcaaagatgg attaggaaca ttagcaagat ggattaggat ggctctgggg gtaagccccg 220081 taggaatact agcaaatcca atgttacatg tattttaggg caaggaaggg ctatctttac 220141 tctttagttt gagagggaga aggcttcctt ttccgcgtag gcagattagc aatccatagt 220201 cttactaaaa gagtcttgga cttagatttc ctcctcctcg aggatgatgg atccggggag 220261 ggggaaaaga tgaggctagt agaaccttgg acgacgcgca agtcaactat gccaccaccg 220321 cgcttacgct cctcgccata gttttcgcct ttgagaagtt taggtcggtc ttacttggtg 220381 ggctccaaag taatagtgca cacggaccac gcggctttga gatacttatt ggcaaagaag 220441 gacgcgaaac cgaggctact tagatggatt ctacttctcc aagagtttga tttggttata 220501 aaggacaaga aaggaattga gaatggagtc gcgaatcact tgtctagact tcgggtcgag 220561 gaggacatac cgatagatga tagcctcccc gaagagaaag tctattatgt gctcgaatat 220621 ttgaaggaag agtatcccgc ggttatgatt ttggaaagca tggaagatga ccttccatgg 220681 tatgccgatt tttttaatta cttggcttgt gatcaagagc ctccaatgtt ccatggatat 220741 aggaagaaga aattcttgcg agatgtgcaa cattattttt tggatgagcc atttttgtat 220801 aagagatgct ccgatggtct ttttaggaga tgcatttcaa aagaggaagt gaagggcatt 220861 ctctaccatt gccatgcatg catcggagta tgcaggacat ttcgcaacat ttaaaacagt 220921 ggccaaggtt tgttttacaa gcaggattct attggccaac cacgttcaag gatgctcatg 220981 ggtttgtttc ttcttgcgac gcttgccaaa ggaagggcaa ctttacaaag aggaatgaga 221041 tgccacaaca ctttatcttg gaagtagaag tatttgatgt gtggggcatc tatttcatga 221101 aaaaaaccat cttctcatgg aaaccaatac atcctaatgg cggtagatta tgtttaaaaa 221161 tgggtggaag ctcttgcgag ccccactaat gacgccaagg ttgtgctaaa aatgtttaag 221221 aaagtgattt tcccaagatt cggtatacca cgagtggtta ttagtgatgg gggatctcac 221281 ttcatcaaca aagtttttga gaaccttttg aagaaaacat ggtgtgaagc ataaggttgc 221341 cactccttac catcctcaaa caagtggcca agtggaaatc tccaaccgat aaatcaaagc 221401 gagaagacgg ttgggaaaac aagaaaggat tggtccacca agcttgacga tgcgctatgg 221461 gcatatagga cggcatacaa gacacccata agaaccacac ctttcaatct tgtctatggg 221521 aagtcttgcc atttgccggt tgagattgag cacaaagctt tttgggctac aaaattgttg 221581 aactatgaca taaaaaccgc ttcggaaagg cggttggtcc aattgaacca cctcgacgaa 221641 ataaggcaag aggcctacga gaacacaaag atctacaagg agaggactaa ggcttggcat 221701 tataagaaga tcttaccaag ggagttcaag gtaaatgacc aagtgctgct cctcaactct 221761 agactcaagt tatttccagg aaagttgaga tctagatgtt ttttaccctt tcgcataaag 221821 gaagtgaaac cgtatggagc ggtagtgctt tgggatgtca atggtgaaca cttcacggtg 221881 aatggccagc gccttaagcc ataccttgct gatgagagca tgccaagcaa gggaaagttg 221941 acgatctgtt tagcctttct cacaacaact tgctcctctt ccttatcatc tctcacaact 222001 tcctcaagct cttttccact tctaagcatt atgtaaaatg ttcccttggg tttgcttccg 222061 gtttaccggg tagtgttccc atgggtctct ttgatgaaga ggcggtttaa gcaacttggt 222121 tgtcaagaga tttgacatga gaggccaaag tttaaaactt cccattaaga tcattgtaca 222181 tgttgtccat ctttgtgttc atctcaatgg tacttttctt ttgtccttca aggaattgtt 222241 gcaacattag cttcacctca ctatctggtg cagttggagc agaaaaagaa ctcccttgag 222301 cttggtaagg ggcttggtaa ggtgcttggt agggcgcttg gtaaggtggt ctttgttgga 222361 gcagtagccc ctctagcttg gaacccttgg ttgaaaggcc ttggttggaa tccttgacct 222421 ccaacatagt tcaactcttc ttgtggctcc tcataccctt caacaccaaa gtcttggtac 222481 ccttgatatc caccatactc ttcacaagag tttaccgtca tcttttcttt tggtcacgct 222541 tcaatatctt atccattttt gcattcagct cttggatggc atgatgagat tcatcattgt 222601 ttttccttgt agtccgatca taatccgagc catgactccc attgctttat gcgagattct 222661 ctaccaaagt ataagtatcg gcttcctcaa ttgtttgact catgaagctt ctcgtagaca 222721 ccagagctct ggagaggttt aaataactcg actgaaagga aaggaacgaa gtcacatctc 222781 tcgcccaagt tcagcagact caatcgcacc cttccttcgt cgtataaaag atagttccga 222841 tcgttgagtt taatggagac acaactagta atattgatat tggaaggaaa aggataagac 222901 ttaagagaac cctccatggc tcttgatacc atgtgagtgt tcatgggctt ccaactaaaa 222961 accaattggc aatgagtgga gaggcccatg tcctttatat actagtgtag gtcccttctc 223021 actttcaatt agggattcct aacaaaggga agtctggata cttgacacag gttgttcttt 223081 cggaagaaga gaatggtttg taagtttatc agatgctggt agtgaaacaa gtaaagatgg 223141 caaatgacac catgtcaaca gtaaagggaa taggaagtgt gagactgaga aatgaagatg 223201 gttctactgt tttgctgaca aaggtgagat atgtccccag ggtgagagca ttactaatcg 223261 gggaagtttt aactagctgg agagtaagga tctcttctcg aaagctatta tcaacgttca 223321 gcgattgaat taccttcagt aaatctaacc agctaagcta ccctttataa ttaagttaag 223381 aaagggcagt tctgtcgatt cccaatctcg aaactgagag tgtgcgaagt cacaagctga 223441 tgggctatat gtgaagttaa aaaacccagt taagaaagga aagatatagc tatcaagctg 223501 aaacttgaac agaaatttcc ctagctttta acctttctcg ttagtacact atctttcacc 223561 cttccacatc ttctctttat tattggatgg tttatggtca aaaagcttag gctgtgtatc 223621 ctcagctgtt gtctgttcag tattctcact ggcatctccg tctgtgataa catctactta 223681 tctcctttca ctctctcctt ctcctctcct tcttccaatc ccgtagagag gcctggaaaa 223741 atttattgaa aataacaaga cgattgacat cttaaataaa gacatcatgc cctagatgcg 223801 aaggttagta agagacccaa gtgaattctg cgaagataga agagcggact tctgaggaga 223861 ctggaagcct ataaataata ggaatggcga actggaacct catccttcaa agggttggtg 223921 tatatttgtc ctattcgtaa agaccccgac cttgcgtcag gaaaagtgga aaaaccccta 223981 agactctacg ggctagccgg gcctaaagaa gcttataaaa ttccacctat gtaatgactc 224041 gcattcaaca actaagagtc ttccttgctt tcgtcacaaa tcccatcgct cgaaagaaag 224101 agggtttgga gttagttaca agctggacta agactgaaat gaaggaacca gatggaggct 224161 gatgcaacgg ataggagtta tgctttttgg aaatgaggat tagcaatcat tcgacagttg 224221 ggatgccgct ttcactagta aaagatatcc acgcaaggaa agaaaatgcc ccgccggaaa 224281 gaaaagtaat taagttaaag agtgtcggtt tggggtgagg cttactcctc gaccaaaaga 224341 attgaaaaaa agcaagacta acaagtagag gtggtttttt tcctaagcca aagaagggtt 224401 tgattccacc tcaactgaaa gagagtcagt agcagaactt aactaagcag ctgcagagtt 224461 tcgctcagtc taatctaagt agctaggcta agagtcagac agtaggcttc cggtagggac 224521 agggatcggg tggcctgaat ggctcaacta attagtagag gaggaacttg agttactcga 224581 ccactcggaa gaggaacgaa ggtactcgac caaaagaaaa ggttagaacc tgctcggccg 224641 actaaagcta aagggatagg ggaaagaaaa ctgcttgaat aactagtggg ttactactcg 224701 atcgactaaa acagaagtca gggaaaaaaa ccaccatcat actgtctttt cccttaaaga 224761 aagtaaggca tcccaaggca gggaaggaat ctgaaccaag gaaggattct gctggagctg 224821 cagtctcgga gtttaaaact gacctttagg cactgacgtg actcttccac ttgttgatga 224881 ggagagggat tgattactcg actaaaagga gagggtagca gaataccatt aataaaaata 224941 tcccattagt gctgctcttt cctaagtaaa atacctaaca acgagaatca ggagggagct 225001 tctgggtagc atcaggagac ttacttacta gaaactaata aaatagtgaa ggttgtagag 225061 aagggttctt ataagataaa ggtacagtag ctcgtatcga agaatattcc cgagatgtga 225121 atctcactaa agcatttcaa ataaagagag gggccaagct taagaattcg ctttccagga 225181 cgtactcagg gcagggagac agggcgcaac aattgtgact ggccattttg atcgatgagg 225241 gaaggcaagg ccttaaccaa gagtcatcgc attcgtcgca ccaatcatag atagaacccg 225301 atccttggat agaatccata atcagaagtt atcggtaccc ttagcatctt ccctttgatt 225361 agaatgtgaa ggtccttggt tcgggcaagc agcgtattta gaagcctgct accgaaatgg 225421 aattcctcgg tcgataactt tcatgataaa gcagaattcc gttggaccaa ttaagtacct 225481 tctcgtttca gtcgagtcac tgagcgccca ccgggttgat tgaaagtgcc cctacggact 225541 gggacacctt cttccagagc attctcacca tctgtactag ccttgcatga aaaaaggatg 225601 aaccatgaag aattctccac tctatgaact catagcgctc ttgggatggg catgtgtctt 225661 gcagtaggta aaagttaaag aataaataag cacgagggag atgctaggct gatcaaaaga 225721 agaaggatta gcgagaagta tatctaactt tattctccaa ttgcaattga ctgattatag 225781 attttctata ttctttttct attttcctag cgaactagca gctcccccgc cctacacagg 225841 attggatact gaacctgatc agatagaccg atcggtgcgt aagatgatag tgacgcacta 225901 ggtttgattt gatctcgcta cctactatgt atactgttat agctcttctt ctacctctat 225961 ctactgagga tattggagca ctaacccgca actttcatct acgctaacag ctctcttctc 226021 ttaatccgcc ttcgattatt cattagcgct ccgcgggtcc tcctgctact caaacagctt 226081 tctttaggaa atgccgacat ctactactaa tggcttgaca ttccagaaca ggaagtcatt 226141 tgaatgcccg aagtctgcat cttctattac atctccgaaa gagtgaaaag tccccttctt 226201 gcctgatgag ttcttctttc ttttcttgct cctcacagtc aagatcccgc aaattctgac 226261 tgacttgaaa ttaatagtcg ttttatctga tgatatgaga tgacgagcgt cagacagaga 226321 aagatcaaaa agtctagtca agaagcttaa cggtgaagat ggagaatgaa ttgaatcttt 226381 cacccacctc agtccaacta gcactcttgt gcctcatcga tctgccttct cactccagga 226441 gagccatttc gccattcaaa aagaactgta cacgcgcgta tggatcatcc atcagaatca 226501 atcagtcttt cagagtagtc atcaagcaag ctatagaagt acgtgtgagg aaagagaaag 226561 aagtgagttg agtgacttgt gaccatcctt ataatttcga attgcataaa aggagtcagt 226621 cactgacgcc gctcccagaa tcttaggttg atttcttttc gttttaggta aaataaccaa 226681 ggttctaaaa agctatgggg ggccctcata aatccaccac ttatcagact gcgaatgcta 226741 acagtatcga agcgcaccaa gagattcccg gtcgaacaaa tttcgtatag aaaaaaaaga 226801 tgtgttttgt cgagtttgct tcgtcctctt tttttttctt attttccttc ctttaggcgc 226861 ttgctctcaa aaaaaaggga aaggggtcaa ataaaaagct accttttatc aaggctaatt 226921 ccaacggact tcgcccagat cttcattcaa tgggacgacg tcagtgcatc tttctggatg 226981 atgaacgcct gtctcgctca tagatagagg aagagtacgc gcgctagcgc gctacggctt 227041 acgaagttga tcgggtcaga tagccccttc gggcaaccaa ccgcacaaaa aattccgatc 227101 aacaacttgg agggatggct gagtggctta aggcattggt ttgctaaatc gacatacaag 227161 aagattgtat catgggttcg aatcccattt cctccggcgc ggaagtgaaa cgggcgggcg 227221 aaatgagaag agcactactt agtgactagg agcggggagc ccgttgcgcg tttttttgtt 227281 tgaccggcct atcttcataa gtaagctccc tatggccgtc cagtccctgg gcggctctcg 227341 gttcttgagc atgttgggag attagtcgtc aattgaaaga gctgctctaa agcttgacga 227401 agaagttttc cctattaatt agattagtaa agggcttttc ccttactagt caagtggtaa 227461 ggtagggcgc tcttcgatga agaaaagaag agacttttgg aaaagtggtt cagctcagct 227521 ggttagagca aaggactgta aatccttgtg tcagtggttc gaatccacaa ccacttctat 227581 tctcggagct gaggtatatg aagaatggcc ttttggtccc tttcgtccag tggttaggac 227641 atcgtctttt catgtcgaag acacgggttc gattcccgta agggataggt actcattctc 227701 ggccgctttc agttagtgtt cattgctgag ggatcccgag atctcgaagc agatgaccaa 227761 gtgagctccg cagccgttag tgcaagggct cgatattccg tttccgtaga agaacgagac 227821 actgtaggct ggcgttttgc ggaccaggag ataatattgc atccgagaaa cgtacagaat 227881 ccagttgtag acctcctcgt ggatgtacat cctgcccagt cactatcaca gaaggcctga 227941 acattgagct tgctgttttt gtgaatatac agcccatgaa agatggtgcc tttaacatag 228001 cgaaggacac gcttcaggag atcaaaatca gcgagtgtag gctcatgcat tcgctgacag 228061 acaatgttca cggcataact tatgtctggt cgagtgagag ttaagtattg gagagcacct 228121 acaatgctgc gaaagtctga aggatccgga tacttagcag tggaaacaga ggagttgagc 228181 ttaaggggca gaggagtaga cataggtttg caatctaaca tccctgcatt gttcagaatc 228241 tgttcagcat acttggtttg ggatagaaac agacccgaag gatgagtttt gatctgaatt 228301 cccagaaaat agtggactgg tccaaggtct ttcattgaga aggtggagct gagctgaaag 228361 atcaacatat tgagcagagt gttggaactg cccgtgagaa gaatatcatc cacgtagaga 228421 agcagataca tgacatcaga attgttgtga tacgtgaaca gagaaggatc agctgagctg 228481 cagacaaacc caaactccag aagaaaatcg aaaatcgcta aacttgtcga accaagcacg 228541 aggcgcctgt ttcaatccgt agagagcctt tttcaacaga cacacatggg atggatgaat 228601 aggattcgta aaaccaggag attgatgcat ataaactttt tcttgaagat acccatggag 228661 aaatgcattt tgaacatcca attgattgat tggccaacct ctaactgttg tgcaacattc 228721 aagatagtac gaatagtggc tgtcctcaca accggactgt aggtctcaac aaagtaaata 228781 ccttcttctt gatgaaaccc tttagcaacc aagcgagcct ttaggcgatc aagagtacca 228841 tcagaatgta atttggtttt gaagacccat ttacaaccaa gaatattctg attaactgga 228901 ggtggaacaa gtatccaagt cttgttccga gagagagcat caagttcttc ctgcattgct 228961 tgacaccatc ctggatcctt aagggcaaaa ataacagatt ttggttcttt tttaatagtg 229021 gtagtaatag tgagactata cttgggattt agtttgttta tgccagcctt tgatctggta 229081 agcatggggt gagaggagtg gttttgaggt agtggaagaa ctgtagatga agaaagcggt 229141 atttgagagg gaagtagaag aacatggaga ggaaggtggg tctggttctc tgtttagtaa 229201 agaagaaggt gggttagggg aaggtggatt aggtcagttg gttagggatg tgaggaaagg 229261 aagggaaggt tgggccaggc aagctaggga agttgagaag gttgggctag ggaagttggg 229321 agttggtgtt tcccttgttc ccagttgcta gtgcctgtgt agcttgttcc cttgttccct 229381 tgttccctgt agctagtccc ctgttgttag gaatgggact aaagagctct aacagaagag 229441 cggcaagagc ttctctgaaa agacctgttc tcttatcgct ttgaacattc tcctgggttc 229501 ctagcccaaa gaaaaagaca aggctgagtt gccccttcta ccatctgagg tggaatacgg 229561 atcaagattt gctgagtttg ttctatggaa taggccaggc cctctcactc tatctcatgt 229621 cggagaagat tatgtatgaa tttgaatacc cacgaacgtc taagaggtca gctacttagt 229681 tgattttatt taaggtatct cttgatcgaa gggttcaagc cgctaaagcg gaaagatata 229741 tctcttcgcc tgaagcgttg tggatttagg cgtgtcacgc caaccttctg taagttcctg 229801 gataatcgtt cccggataat gcctgccgca gggactaacc ctatcttcgc gcatacttcc 229861 ttgtttgtct gggagttgaa tcagaagcat cgaatgaaag cttctttaca ggacctcttc 229921 tatttccctc cattcttctt cctagtgacc taggagttgg gttcgtgaca ttttttcttt 229981 ctgcttattc aagtagcccg gtctggtaag taaggacttt gctctcagta gaagtcaaaa 230041 agccttcaac aagaccacat ccgctattga atcagctgct atagacttag ctaaagaatg 230101 tcttgcacga gattcaatgt tatgtgcttg attgcgatca catttttctc gtagttgatg 230161 agctgccaac ctccggaatt gagggttgta ggtagagttt cctgcgttca acttggagtt 230221 ggaagagttg gatttggagt ttataaaagc tgttattagt tccgactccc cggtctaggc 230281 ctggtacttg ttagttctaa ggtatcagtg tcgctacttt ctgtaaaaaa tttcctgcga 230341 agggaagcgg gaaagacaag gtttcgaatc ttggtctaag aacttacact acactacgga 230401 ggcacgggcc aagagatcat aacagtcaaa tcctcaacct cacaagcaga caagctggtg 230461 gatcatctga tcttgaaata ggtcatccca gctatcccga tcttttctct ctttatttcg 230521 tcaggtatct gaatcatacg attcctcgga tgtagctttt cacattggac caaaacaagg 230581 cccggaaacc ccgctagctt tgttgataga aagttagctt cccgtgcttg actagtaggg 230641 ctaatgaacg acccttaact atggaatgtt acgcctttcc tcggttactc acaacgtaat 230701 caactccatt ggttattttc cccgggggaa cacctatagc cccaatagac tgatttagta 230761 ggtatgatgt attactagga gtctctaagt tcctgctttg gttgtccctg tccgggctaa 230821 gcggagttta gctttggccg ttaataagag cttccctcca ttccttcagt ccagtcacgc 230881 ttttcttttc ggcttttgct tcagtctctt cttttcgtcc tttcaggtgg catttcgctt 230941 agtcttgtca ttccttccgt catgctactg tcatgtgcta taaattgatc ctcccgtcta 231001 gtgtttcctc tctcatgtta tacagaccgc tcctttaatc caatgaatag gcggcgaggg 231061 tgacaaaggc ttagaagcag gggcgggcag aagcagcatt aattctttcg cccgcgcgta 231121 catattctat tgagcgtata cgatcattga gctgatcacc ggtggcttcc tcacttgtga 231181 tgtgaagctg ctttccattt ctcatttttg aggatatcga tattctctta agagaaaggg 231241 tagccttttg tcacgagctg gactcgaacc agcgcgtctg gcaaaaagac catccagatt 231301 tcaaccttag actgatcgtg actttggtta ctcggttccc cacgcgaacc ccacacgcca 231361 tgggtacgtc cgaggtagat taactacgaa cttttttcaa aaaaagtgtg atatattttc 231421 ctcacttact agcaaagaac tggaccagga ttcgtccttg gctgcctacg tacccctttg 231481 ggaatcaagt aaatcgtagt tctcttaact tattgtcttc tttctaagtc ttgctttctt 231541 ctctctttac tattgcttct ttttcttttc tttctataaa atgttcctca atagcactga 231601 ttcttcatat atttttgtca aagcaattgg gttcacaaag cattattttc tcggtccatt 231661 ttccacctct ttcttttgtc gaaccatgct aagcaaagat gcacggttcc agtcttcttt 231721 ccctccccgt gtccactagc gctcctatcc tctccttttc agtcgagtca ctagggttcc 231781 tctcctttta gtcgagtcat tagggttcct ctccttttag tcgagtcatt agggttcctc 231841 tccttttagt cgagtcatta gggttcctct ccttttttta gttgagagtt acgtcagtac 231901 cttttagtcg agtggcttct cgctcctgcc cagaacgaag agaaggcgaa aaagcgccgc 231961 cgaagcagca tgagcgggct tctattgcta cgtaacaata gagcaggata gcattttgcg 232021 cccacatgtt tgaattttag ggtaaaaagc tcgcttgtta tacgggatcc gacgcatcca 232081 gcagagcgaa gcagcgttcc attcttttcg gcggcatcct tccgcattgg cggcgagtgg 232141 agtgccacaa tcccattcat catttttgat ctacataagc caaagccctc cgcactggcg 232201 acgtccccgg cataaatgca aggaggatgt atagctgata taggatcttg tggaacagga 232261 tttgattctg caagcggttc ggtacaaaca aagaaatttc gaacaaaagg atcggaactc 232321 gctgatagga aaggagagaa aaacaaagca atgccaagag ctccgtcaat tcgctgttca 232381 tcgatagacg aagctctctc tttatcatct cgtgccagat gcaacaaagg atgagtcctt 232441 tttccttctc gcgaaccacg ggagcgccaa gcgtccagag gatttttaat tttcctttca 232501 ggataaagcg gcgcataaaa aagggctggt ccgtcaaaag tccggttcct tcgcgaacga 232561 agttcagaat caacaagggt tcgtagaacg aagggagtgt acaactggtt tttcaaacca 232621 ctttcttgtt cgtaacgagg gagagataaa attgagttct tcacgaagtt cgagacaaaa 232681 aaaaaaaaaa gactttccct atggcctcct tgttttaaga cattatggct ctggggtcga 232741 ccccggtaac aaagaaaaaa tccataaaaa tttgggatcc gacaccataa taaaatacta 232801 ccctcatgat tagaccatgt tcctgagatt tgataaaaga aaggtgcatt agcggttaat 232861 acgttgtaat tggataagtt attaggaata tgacagaacg aaagaccaag gaaagaaaga 232921 agaatgcacc aaaatgcagg tgctgcacca aacgctggtg gttgtttctt gttgtaagtg 232981 aatgcaacga aaagacccgg aaataacgaa taatgaaaga attcatatat tgatattgac 233041 atttcgtgct catttcaaga tttctgcttc gttattccca tcatccggta accacaggat 233101 gatccacaag aaaggtggca ggattcgaac ctatggccgg cccaaccctg acctgttggg 233161 ttgggtggcc tgcttcgccc tcgtcgcctc tgtacccgaa acagatgcgc tgcgctaccc 233221 agcgcgtaac cttgtccccc tatccctctt ctgcttatgc cattaccaat cgcgggtaac 233281 ccccggaccg gccgccccta acctaataag aacgattatc cttatgacca aacaaggacc 233341 agcttacttt tcgagcgaga gtttcacgat cccgaccagc aacttgttgt gggagtaagg 233401 gcatccaagc ttgcccaacc tagtaaaggg gcttggagat agagggtttt ctggggggat 233461 ttggcttctt tggaaagctg agttggaggt gaagattgtt catcgccagt atgttcattt 233521 tagtgtttat cggaataatg cttttctgtc ctgggtaacg gcagtctatg ctagtcctaa 233581 tgctactttg cggcgtgctc tctgggagca gttgagtagc ttggcatcta tgatatctga 233641 tccttggcta ataggtgggg ggactttcat tccattctgt gtagttcaga tagccggggt 233701 ggctcagcta gtggtaatcg gccgtgtcgt ttgagttggt tccattcctc tagtctctat 233761 gacatgaagt tcgttggtcc gaagttgact tgggctcgca agggtttgct gaagcgttta 233821 gatagagcta cacttgagag tctttctatt acttccttgc ttacctgggc tcacttcgca 233881 acaaagagaa gttgtttatt tgcttgctta gctagaagat gccaacttcg ttcacttacg 233941 tctctcgttt acttgttagc tagccctatt cctcaccttt gctttgaggg tatctaaata 234001 aaaaaaatat acagtggtgt gctcgttaga ggatgctttt ttagttagct aagcgactac 234061 cttagttgaa ggtgggagga gagcagcaaa gtctcttatc tttccaaaga aggcggaaat 234121 cgagatgtga gaaaaagaat agctattcac tctccaaaga agccctcaaa attctttcta 234181 gcaaatgtga agagaggatt gttgacaatt caataaccac tgcaataccc tcttacggaa 234241 tgatctcggt gttctcgcta tgtcgtcaat ctcagtatac ggaaatggtc tgaccaagag 234301 gcatcgatct ggagtgggtc ttgcattgcc ggatgacagg agccctttac ctccactctt 234361 caattatatc cataccaggt gaacacagat gatcgacctt cagtgcctgg tcaaccatta 234421 aagaagagaa gaggagcgga acaatgagaa ggagttcgta gccctccaac cagaactaat 234481 caacccgcct cttcaacatc tccatttccc attccatctc tgggatccaa tgcttcttca 234541 ccgggcccgc ccgatcccca gaacgtaaaa tggaaccata ccggaaaccg tttgatcagg 234601 atagaaagag caaggggctt tgttttcgtt gtggtgacaa gtggaatcct ggtcacagat 234661 gtaagggcag agaattaaag tatatggctg tggatgaagg agatgagttg gatgaggctg 234721 atgaagctga aacagggaaa aagatagaag aatcagatga ggatgaagta aaggaattac 234781 tctggttgtc tttgcggtcg atggcaggtt taacaactga gagagatcca tgaggatgag 234841 aggacaaatt gcaggacaag aagtcattgt tttaattgat tcaggggcta caagcaattt 234901 catagcagag agtgtagcac atcggtgtgg tctgcaaatc acagaaactc gtggatttgg 234961 ggtttctatt ggaaatggtc aagtcgttcc aagtgccggg aaatgcagcg gagtagagct 235021 aacggtccaa gatgtgcaga tacgtgcgga cttcttcctg taaccatgga tatagtgctg 235081 ggttatgcgt ggttggctac tctgggtgac accagaatga attggggaag acacaccttg 235141 agtttccgta gggaagatca gtgggtcacc ttagcaggcg atccgtcact ggtgcgagct 235201 caaatctctc tcaattcggt agtcaaagag ggaaggaagg cttattatct cctagagctc 235261 accgcactgt tttgcaggag gaattaagac tcaaaatttt cctgaggatg ctcaagtgaa 235321 acagttgttg gaatggcatg aaaacgtgtt tgagatgccg caaggattac ccccaaaatg 235381 tggcagggaa catgctatca accttaagga aggagcagaa ccagttaaca tccgtcctta 235441 ccggtataca tatactcaga aggacgagat tgaaaaattg gttaggggag atgttagaag 235501 cacggatcat ccagccgagc atcagtccat attcaagtcc agttttgctc gtccaaaaga 235561 aggacggagg atggcctacg gcgcgtggat taccgagcct tttacaagct cacggtacca 235621 gacaggtacc caattcccgt gatcgaggaa ctgttggatg agttacaagg agcttctgta 235681 ttctccaaac tcgatctcaa atccgggtac caccaaatac gtgtacgagc cacagacgtg 235741 aagaaaactg cctttcgaac tcacgaaggg cactatgagt tcctggtcat gccatttggg 235801 cttaccaacg caccagcaac attccaatcc gtgatgaacg atattttccg cagggaagtt 235861 tgtgttggta ttttttgatg acatactagt ctatagcaag ggcacggagg aacacatgaa 235921 tcacttggga atggtgctcc agatttggga acaacaccag ttttatgcaa acagaaagaa 235981 gtgtgctttt ggacaaccac agattgcata tttgggacat agacatatta tatcgggcga 236041 aggggtgtca gctgatccag ctaagttaga agcaatggtg ggttggccag aaccaaagaa 236101 cacaacagaa ttgcgaggct tcttgggctt gacggggtac tacaggagat tcgttaagaa 236161 ttatgggaaa attgtgagac cgttgaccga gttactcaaa aagaacagtt taaaatggac 236221 agaaatggca gcactagctt tcaaggctct caaaggagca gtgacgacgc ttcctgttct 236281 agccttgcca gatttgaagc taccttttgt gacccgcgtc gggaaatgga attggagctg 236341 ttttattaca agggaacagg cctgttgcgt atctcagcca agggttttct gaaaaagggg 236401 agcttgaaat cagtgtacga acgagagctc ttagcaatag tcttggcagt ccaacgttgg 236461 agacatcagg gcaacgtttc acgatacgca cagaccagaa gagtctaaag cacttgctgg 236521 aacaaaggtt tgtcaccaag gaacaacaga ggatgggcta ccaaactgtt aggcttggat 236581 tttctcattg agtataagcc gggaactgaa aataaggctg ctgatgcgtt gtcaagacga 236641 gtgcaacctg agcgtctgtt ggaacttgta ttcgcacctc ctccctcttt cgatgcagca 236701 gaattactat ctcaagttga agctgatccg gagttaagaa ctgtgttaca acaagcccga 236761 tagggaaaga acctggaatc tggttatacg gaaaaaagag gggctgctat gtaaagatgg 236821 aagagtggcg tctcgaatca gccaagcttt ctcccaattt cttctatatt ttcatctctt 236881 tctctattct accgagaaga tccatctcgg gtcttatcag atggtatcag agccttcggt 236941 cttctgactg tgggtatcca acgacgattt cgacaatggg tttggatagt aacgatatag 237001 cgctgatcag gcggattctc gctcgggtgg agcagatgtt tttttccatc gcatgacgac 237061 gatgaagaga tccgccgatc ctgaagctga ggaagaccca gatccgtcgc ttgggaaatt 237121 catcgagcag gtggtgagcg gtccaactca gaagggattt cctgcatctg gagcttttgc 237181 gactcctcgg gcaagacgat ctcgcggtcc gaaaaggttt cttggcaagc gcaactaccg 237241 cagggctcgg gcaagaaaac cagggaagcg tgacagagct cactcgtcca aggttagatc 237301 tagcgatggc tctacacgac cttctgcgag atatggtatg cgtttccgat ctcctacttt 237361 gcccttgtgt tctagacctc ggacatggga atttctgctt gatcttggtt acgatagtat 237421 gtcgctgaag agcgctcgtg tagcgagtga acgaccttgc gatcctggac gagctaacca 237481 tgtgtgggag ccgggtggat tatcgcctga acaagctcag tgggactggg tagggaaggt 237541 ttacagtcgt gatagagttc tgaagctaat gggagagctt catgaattgg tttacaggga 237601 atcaggaaga cggtaaggat cttagattgt caaggatctt cctcataaac acgtttccgc 237661 ttgcagctga aacaagcgga gaagatggga ccaacgagat cagccaccag gaagttaata 237721 agatctgatt tttggcatcc tttgatcctg gaagaacttg tgctgaaaag tgagttcgtc 237781 aaagagctct ggcgaggcat tagagctgtg aagggaaaga aggttaagaa gaggaagaaa 237841 cgggaaccta gcctgccagc tttggaaccc cagaattttc cctgtgttgg ggggttggag 237901 gtatgattag taggcttaat tataaagttc tcgttcaacc agaactagat aaagctacaa 237961 gggcagtctc tagaattact tacggattgg ttgctgttgg taaagctgat gggtatagtt 238021 ttgaggttgt ggattgatca agttaactcc actgacgaag ataagcttgt gagtttcatt 238081 ggatctggtg ggtgtcaaac acactttaag aagcagctag tagcaaggaa gttagcattg 238141 tttaatcccg aaaaattcat gaacctaaga ctacctcatg acaagggaat tctagtcttt 238201 gctggtcata gaattgctca aaccacattg tgagattgat gccattttga tagaaatcgt 238261 cggtatataa agagaaagtt tgcgatgtaa gatgtgctct tacgaaggaa aataggattg 238321 ctaacttttc catctatata aaatgaaaag caagtttttg tatcaggaaa gaggcaagga 238381 cagatccgcc gaagaaggat cggggttaga actagtctga agcttaaggc ttttgctatt 238441 caagaaggct ctccctgctg gctccatgta tgctatagct cctgaaagtg tgagtttggc 238501 ccctggatag agagaaaggt aaaataaaga ggagagtgat gtttcgtttc aagttcgaga 238561 tctcgtgtag gtatacatga gaacagagaa gtgtcctacg gattcagaac cggaaggtca 238621 gaaaaaagtg ggaatgacag tccggctacc ggaggagcta ttcaactatt atcaactatt 238681 gagtctctga gtggattttc caatagatgg gagcaaggat attctgtctg aagaaggacg 238741 cagaagttag tgcgtacgct tgatacatca aagtatgcct cccgcttaca catcagcagt 238801 cgttctaatt tttcaatcaa agtagtagct gtcgtattcc agaaggaagg gggattgata 238861 taagaagagt cagacttctt tttaagcagc aatcttttac agaaaagaga gtagttgtcg 238921 tcgatgtggg atagtccctg aagaatagta cccatgtcaa cggattgcag gcatagttca 238981 actcatagcc ccaatccacg ggtgacgata ttgtcattgg ggatctgtat gctttcaaaa 239041 gagtcgttag ctgatctagg agtcagtggt caataagtag gcaaaagact ctggggaagt 239101 aagcgaagct attgattggt tgagcccctc aacaaagcat gtccagcact tatcttaagt 239161 tcctttaagc agatctgtga gcaatcctag gcctctcttt cagtaatttc caaagagtat 239221 aggctcaatt aaaattattt atggtaggca gagcaatcgt agacccttac ctagctagtg 239281 acctggctct cttcttcata cttactgcac gcaggataag atctacttgt cctcctccct 239341 tccctttacc tttaaatgaa agctttcacc ctctccctca tctgcatcca aaccggagcg 239401 actttgttca atcgcttttg cacttttttc tctgcctggc aggcttttca tagttcacgg 239461 cccctgttga gtagggtcgg tataggtaag gcaatcgatt caaaagtaga ctgagagtgg 239521 aggtttagaa taacctaagg cgtactgtag aaggaaagaa aggtaagcaa tcccagaccc 239581 aatagggcta cctttcttcg actgagacct atctgcttca ttcccgagtt tcacttcact 239641 tggacacagc caaagaaaaa ggggggcttg atccaccgct ttcaagtcca gttaaattag 239701 agaactcctt tttagaagaa taggatagaa tcttccgaat cagtcctact atggttggac 239761 caaggtcatt tcgactctct agactttctt gcttctttcc atcctgagaa gtagatcctt 239821 tggacctttt ctttcctgtt gaacgagtgg atgttttgaa cgagtgttga gcgagtgaaa 239881 tgccccataa gctataaggg cgagctatct ctttagattc tgtgagctgc gattgaactg 239941 aacgttccaa gtgcatccag caggaatcga acctacttgc ctctttatta ggttgggcgc 240001 tttaaccatt cagccatgga tgcaaagaga gcgagtgaac taggttttgg tattccttct 240061 cgagcttcaa tttcttccgt tataggagat tcgagaaaag atggaatagg aagacgtgtt 240121 tccagaacaa acaagatacg ggttgagaag gggaagttag caaagttaga caagattgga 240181 atgggcatag gaacatgtat tgatgtacca gatcggctaa tgctcccagg ttgatgcgat 240241 gtattccacc agttgactga agactttatt attggtatat cgatcggtcc agcacgaatt 240301 gaaatagaag ccggttcgac aggaagcttt tgaaaacgca gtgcacccag gtaaataaga 240361 aacgagatga atacagaggt caaacgagca tcccacaccc aaaaggtccc ccacattggt 240421 cttccccgaa accccccagt aactaaggta aacaacgtaa aaaaagcacc catttctata 240481 ccggttccgg aagagcgaag ataaagggga tgttttgtta ataggaacaa gaaagtgttt 240541 atagccgtgg cgatataaac aataatactc atccgagccg caggaacatg tacatagaga 240601 atacgagaat ttccaccttg ttgaagatct agtggtgcta ccccaagact taaatgaata 240661 gccatcgctg ttaagaacaa ccaagaccca atgagaattt gcgcgtagct tctggtcttt 240721 gacatcaaaa aagaaggttg taataacgaa acggacatgt gcaaattttg tcctagctag 240781 tggaacaaga aagacatgga tctataatac gcaatcaaag gatttccccc aacgaataat 240841 tccccctccc ctgctttgtt tgttttcgct ctgcttgtgc gtggcactcc ttgctggcgg 240901 agcggcgcat agcgaaaaaa agaaaaaagg aaactcacca ccaaaagatc agtccaataa 240961 tcccactcac tggtaaatag cgcaatactt cttcgtgaat ctccgctatt tgaatatgga 241021 acatcataac aacgaaaagg aatgaaacgg ctatagctcc tatatgaact accgggaaga 241081 tcatagcgaa gaagtcgaga cctaacaaaa gacgtaaacc tgaaatggcg gtttagtcgt 241141 cggaatcgct tttatttatg tcgtagtcag acagcaattc cagtagctcc atccaaaact 241201 tgtctccgtt ttttcctaac cgggaataaa gatagtagaa tcgtttctta cgccgatcga 241261 tatcgatcaa ctcatcgtca aacatcgcta catcgataat tctttttata tcgtttttat 241321 ctggaatttt gtatcttcgg atactaagag ttcatgtgtt tttgctatta tttttgcctt 241381 gattttttcg catttaccac tctctgcttt tacttgctcc ccgatggtat ctgtgttttg 241441 atcctctacc tcgaggttca agaaaaggcg ttttgagcca ctttccttgt actctttcga 241501 ccagaaaggt gaacccctct gctgatcgtg agagtgctca ttctcatgag cataatatcc 241561 ttctaaaaga ccaccctcag atggatttgt cgggggcatt ggcgtatttc ctcctcctag 241621 tcctccggat accaccgggt agaactgaga aatggcacga cttaagtcgt ccatccacaa 241681 gtatcctaca gcaaataaag atttacagag aataactgaa agaacataaa accaaatgag 241741 ttgccatttt ttggaaagtt tgtctataat aggaagcgaa caaaagtaat ctaaaaggac 241801 ttttatgctg caaaagactg ggatgggaaa caaaacagaa tgtacaaaac ggataatacg 241861 tgtaaccagc aaaccaaaga ccaaaacttc agtacggtcc gtgcgccgtt tttcttctct 241921 accatgacga atcaattgat tagctgtagg cattattctc tttcctttct tttccccccc 241981 attttgtgcc ctatcacttt actcccgatc cgaagcaccc cttttccatt catatagaaa 242041 tccaatcgtc aaaataaata aaaagagtcc atcatggacc aaaatctgcc cgagacaaag 242101 ctgagctgtc aaataagacc aacaagggat aagataggga ggctcctcta aagctggaac 242161 ctcttccaac cccagcacgg ctcttccccg tcgactcctc gtacgcgtat cacaggggca 242221 aacacgcctt caaaacatct ggatcataca aacttctcat tcagtggaaa aaaccttacc 242281 aagactagtt gcctcaacga attgatatgc ttttgctttt tgtgacaccg gcgcgaatgc 242341 tgaatacgga ggaatacaat gaaatcagaa caaagcaggt agaccaagct agcttacaga 242401 attcgaggaa ttaaaactca gaataagctc ttttattagc acgtttggta gccagaaagg 242461 aaagaggttc tcattcacag tgccttctcc ttctgtgatc actgatcaaa aggacattca 242521 ttcagtaagg gattgatcct caaagtggtt cttttagcac aatgaattac ttatcttata 242581 atgaattact tatcttatat agtagtgtaa ccttctcttc gtacaaaaaa gcctattctg 242641 aaagccgtct ttcaatgcct gtaccctagg gcgggctagg ttcaagggta cgcacctagt 242701 agctgcgagc ctaagattaa aggcaagttc ccttggttcg gcacgctttc actcagtatg 242761 ataggctacg gtcagattcg ctcggtctag gtatcacctt gatgcgcgta gttattgcct 242821 tgatcccttt cctttgctcg ggtcattcca ttgcctgaaa aagtgagcct ggatcttcgg 242881 agaaaggatt ttagagctct gtggagaaag tcttgaaata ttggatgctt tctttcgtgg 242941 ttaggcgggc atggcatcaa ctctgtatag gtgggttggc ccggccggcc aaagctagat 243001 atagatcggt tggcactgga tctgctggat agaggtagcg gcacagacag ctacggatgg 243061 atagaattac gtcagcttgt cgggtcagta gatatgccca gcatccattt tgactagaaa 243121 tgaaatggta cagagaagcg cagtgagcgg tagcaaccat tgactactcg ggccgggcct 243181 tgcagccaga aaatgggact cgatcagctc agcccattga ctcttttatg gggaactatt 243241 agagacctag gaaagtcaag atgccaccgg ggcgtagaaa gaaaaaggct tcttcggcat 243301 gatagctctg ctctgtcgga taatatgagc cccttatgga acagctcact ccggggtgat 243361 tctggtttag tcatagaata aagtaaatcc atgcgtttgt cttcgccttt tcggaagacc 243421 aagacagcta gacgatagat taaccaaagc cataaagaga agatcgatat cgcacagaaa 243481 tcctggatct atcaaaaaag cccgccaaag gtgtcctcac atgaaatcat gaaaaggatc 243541 cgagcccatc aaagactcta ctacatgaaa ggatctgaat ctatccaaat agctgtggcc 243601 catgatccaa gggacccgca accagtcaat catgctatcc ttaccttcca accaatcgga 243661 aaaaattacg ttcaattcag gatcgaatac atctttggct tatgagttat gcgttttctc 243721 ttcttggcac tttatgtctg ctagtgcata atcatgctta tggaaatagt ggaggttgag 243781 cagtttccct gatgaagaga actctccttc tctaaacaag cttgcctgtg ggcgtcgagt 243841 aagagcatct gggaattgat cttttacgag acagcccgga gttggtctcg tggctaggtc 243901 tttaaccgaa ccggttcgga atagaaagtc tcggtcaatg aagtggagtg ggctgaaaca 243961 gaaagttggg tattgtattt ttatgctcgc ttggccggaa tcccctatag tgcttttcgg 244021 gcgaagtgac ggattacttt ttgagtgact accctcagtc tttcagagcc tcgggataag 244081 ctgttcttgt ttcataagca aatgtggcaa cctggtggta tagtgagctc tctctctgga 244141 ttaagcttcg gtcattgata gagaggaatc acctagccta gcattagact cgactagcta 244201 gccttccttg ggcggattgc tttgaatagc gtagttttct tagctccggg gactgcctta 244261 tctacactat gagaaatgtt agatgcaacc attatagtat agatatattc ttagatgatt 244321 actatatgtg atttctttca tacagtctta acaaatatag agatcttcac cataggagac 244381 tagcttgaac taaatcctcc aaccttaaag cagtatgaga gaggacggat accagaaaga 244441 atcacctcgg agccgctatg ccactatgag aaacgttata ctatagatag acatgagatg 244501 atattctata aggaatacaa tgtttgaaaa gcaagaatag atagagttga aagaaaggcg 244561 ctcgtataga gccctcgact atgtatcgga catatggcag gttgggaaag cgcgcttgga 244621 aaagaaagac tatcctcgag gtaaatgtaa atcaatctat ctcgcattgc ccaatctgaa 244681 tgctaaaggc tcgtaacccc ctaaaaggaa actcaccacc aaaagaaaaa ggtaggttaa 244741 ctaacgaaac tcaaaactac attttaattc ttgctcaccg ataatccttt agaaaaaata 244801 ttatacagct gtatacagga ttattggtat agctgtccac tagccgaaaa tcggacacca 244861 aattattgta gactgaatta agtaattgca tagctgccgg ggatgccgga ttgcttgagt 244921 tccttcctat agaaaaagaa ttaattcagg agttccaggc aggaaaagta tacaaccata 244981 cttaaaaata ctgggcgcca ttcggaatag aaagaataga gatgttgggg gtggaaagat 245041 tccatggatg tcttatagag ccatatggga aagataagtc tgtgttcggt ctctgggcat 245101 cggtatcgtc ttatgggctt gtcgactcta gaacccgaaa agggaatacc aaaaatgaag 245161 ttaggggatt ccggcgcgga tgatctgtct tagagtctct tgctaagcca ctcggattcg 245221 ggtgttctca aacggtgaag actttaaggt accaggtctg cgagcctatt cagttcttgt 245281 tcttccgaca tcggatattg cccgactaaa acgacgtgtc ttgcaatcct aaccaacatc 245341 gcttcggtag ctcagtcagc atcaaaggac acatgggggt ctcgaaacga acgttttcga 245401 ttagtaacat ggggtcggtc gccccattcc ttcttaggta ccagcataat aggcagagac 245461 agagctacga ttccccaggg cgagggaaaa agactgacta gacccggctc gaggagattc 245521 aaagggcaga ccctcttcac aatcagcttt ttaagaagca agaaatgcca cttttggccc 245581 tcctgggtca tacctatggg gttggctctc acgtttattc gtattcaagg aaaggtgtca 245641 atctagcgta agccgcaata ctagatcgat acgagatagc agtcaaaagc agcacccatt 245701 gaggaatgaa aaaaatccac cccctaacta gactatggaa gatcttctcg ttttggctaa 245761 acggatcttc cctgtactgt aacagggtca acaccgcctt ttgaggttgg tgtccgagct 245821 ttccgcccag ctttcttcga tcgtatgcca cgaactatgt cttagatacg taagggcccc 245881 tgtccaagtt tggctttctt agtacattgc ctttgtactg gaagctggat catacgaacc 245941 agcaatgacc tgaagctttc ttgcagcaag atgatcagtc cgagagtgct gggcattcat 246001 ctctagcttt tacaatggtt atttcattcg ccggatattc accaaaagaa agtgctcgct 246061 agccgtaagc ggaatgactt tcttgatatg gaccgatcga agcccttaag gatagcttac 246121 tatggtaaat agctagaccc cgaacctatt cccagtttcc gagtaactat cattggtcaa 246181 tttcctgttt cccatggtct atcattggtc cggtccttca gaggcaggag aagtacaaga 246241 tgcaagctca taaacgagac ggtaggatgg cgctaatgcc aggcgatcat tagacttcgg 246301 ttaatggaaa gtggaactta gaaatagctt cttaagaaag agatatatag tagtgcttcc 246361 ccctccttag aatggtaatg gttcgttctt cgccaggatc atctccaggt agtgctgctc 246421 cgtcttcgcc ttgaccttgt gctggccgct tttgtacagc atgaaggggg gaaggttccg 246481 tctctgtttt atcttttgct gcgaactttt ttttcctctg ttctctgtac agcacactgc 246541 tgctttttct cggtactcct ctgctttctc tagcacagtg ctgggttact ccttcctaac 246601 tctcctaact gttatcctcc tctgtgctct acctgaagca cactcctctc tctgtgctgg 246661 tatactccta actcttaata gtacgcacac acgtaactct taatagtacg cacacacgaa 246721 agtgaaagta aagggattgt ctggccctgc tacctagcca acatgagcac acccagctgt 246781 atttcctaga tttaggtttg attacttttc tcggtatagc agcgctacta cgcttcgctc 246841 acactctctc gaccaatgct gccttctcta acaactctca gggagactaa agaactgaag 246901 gcttcggagc ctagcctcta actcaggtga caggccagag aagaacttcc tagctagagt 246961 caactcgctt caggtgcctg ccaaatatac ctgcgggctt agttgaacct cctgatgtag 247021 accttaacag agtagctaaa ctgatccaat catcatccca gtgattaggc taactacatt 247081 cctttttttc gaaagagaaa gaagtcaaag gcgttcctct cctattagtc gagtagtagg 247141 cgttcctctc ctattagtcg agtagtaggc gttcctctcc gagtaagaga ctgagccggc 247201 taccttactt cgtctggaac aggaacagct aacaacagta cgaagagtat gaccgctgac 247261 cttctgtgga aagaaagaaa agatattgat tctacgggtt cctcctatga gataggagaa 247321 gctgggttcc tcctacctac ttgaacagat aactaactac acctactgga ctggctactc 247381 tgcttctaag ggcgcgtagc cagatacaag ggttaacaac agatacaata gttcccatcc 247441 gaacaaagag aggaggagat tattaactga cagcagaacg cagatcgtac ttagctacac 247501 atgataactc agccagggac tacggacaac agatgaactc ttcatgctac ttattcccag 247561 agaacaaaga aaggaagtgg aagctgcaca gcgggaagca gacacaaccc agaaacaata 247621 cctgcaagac ggagaggggg aacttagtga tataaagtgg tagacccctt ctcctatccc 247681 acgtatcaac gggaggactg cttcgcctac ttcacgccta ctcatacaca ttccgcccag 247741 aagaaggaag actcttttac cgcagcagca agggatcccg ggccccgcca tgcatgaaca 247801 ggggttcctc cactggaagc aaccaactgg ttattcatgg ctagtccgaa cgaagagaag 247861 ggagattctt tcacaaatat gccatgtcgc ttacgctcct cgcccagaat atacagctgc 247921 aagtgctggt tcagctacaa cggtaactag aggaagaagc cctagtatgc acgcacctta 247981 ggaacactct agctacattc acattattcc ctggaccgga acattcataa gatattgact 248041 aaagacagct gagcaacaag caacccttaa ccgctcacac ggcttacccg tcgtcgtcga 248101 caagctcgct tcccgggtaa tcctaacatg agctacactt catgacagaa ctcactggaa 248161 caaagaagag cagactagta agggaccaac agggaacttc tagacaatac ttacatattc 248221 cagcctatac taacacatat tctaggaaat gacaggactg gcaacaaccg aagtgccaac 248281 acccgataca ccgctagaga gaagtatcca ctagcagaga accacaggga agacatctag 248341 ctacttcttc cacattacag acgatactaa gatattgact cattccagta tgacaaccgg 248401 aggcaacaac tgaactaaga tattgactcc agtatgccag ctacaccttc tacactggaa 248461 gcaaggaagt aggaactagt caagacccta gggaaacatc tagcatcttc taccccttcc 248521 cggaaacttt cttaataagg tcaaatctgt gcagtgtaag cttgatctct ttgaattggg 248581 tattccccca acctgtcaaa caaagaaagc tagtataggc atgaaaacag cttgcttaga 248641 gagcttgcct tccccttaca ggcaagcttt aacttccctt cctaacgagc caacaagtta 248701 gagatgtaaa gacggttcct taccaggagt caatcccaca agaatcctag cttgattggt 248761 ttgatgtaag cgaatgcgag tacagggaaa cgagacagct cacaacaaag cttaggctca 248821 cttctgaccg aaagtccaat tcaatcattt aaagctcact gagcacttct ttccttagtt 248881 tgaggacgtt aatgagtttc aaaggataat agctgatcta gaaaacctga aagtctctat 248941 atcagacgag gatcaatact gatgtcctta ccaaaacata attcaattca attcaacagc 249001 aaaagaagga tagcttgatc ctcaatgcct ttgatcatct ttttctaggt agggaattgc 249061 ttatctcggt aagctaagaa tgttagaaac aaagaactag gacagaacgg gaaatggaga 249121 aggaggttag aatcaaagaa cagtaaatgg agaaggaggt taatgtgtat ttcattctat 249181 ctacatttta actaattgag tgtatccagt cttatccatt aatgtaatta caagaagaat 249241 agtaccaagc atgtaggtta tagttttcac tttactgggg tgaaggtttc tgtagttcaa 249301 gtgggtcaaa agtggtttgc ggaaacatat ctctaataat tcgattgaga ggctcctcgc 249361 actcacatgg acttaaactt ttgtgtatta tacaaacatg attcacatac acatctcgtg 249421 tatattgcaa tacatttggt aaattatctg aaaataataa tgaaggtttc ttcaaaagag 249481 gtccaggagc tatttccatt aacactgtta tactgaacag tatacaaaag aagactgcag 249541 tgcgagaatt tatggaggat gataatgcat ttgagatatt cttctgaaca ctttcatatc 249601 ttttatgtaa aacatttttg atgagaaaat caccagtagt atccaaacac tttaatccag 249661 atgatgggaa aatgctttgt ttaaacctac tacgaagtat gcttaatact tcattattac 249721 cagttgatcc atatggtgat gtgaaatcta agtttaggtt tttgatataa ctatgaagta 249781 tattaggtag tattactttc tctgttgagt ttgtaccact ggtctgatgt ctcaatttct 249841 ctcgcgttag gatatgaatt ggattttctg tggaactgtt gaccagtggt cctctaaata 249901 aaggagattt cattttcata tctaaattca ccaatatgta tagagcacca ctggtcagaa 249961 gtaagaaaga taatatatat gtctattggt taaatcaatt ctggcgagag atcaaattca 250021 gattactgtt caattatttg agttcagtct aattttcgac ctaacaagag caacggaatc 250081 acgctctgta ggatttgaac ctacgacatt gggttttgga gacccacgtt ctaccgaact 250141 gaactaagag cgctttctta tcacaattga taagactgta aagacgagga ttcttttttt 250201 ttttataacc ccaataaatt ttccacgcct atattatcat atataatatg agaaattgaa 250261 agattatcta tgtccaattt gaatcgatta ccaagccatg tcccctcatg ctatatgaga 250321 ctgaactctc agtttgtata tgtggaaaga gacctaagag aactgaaact gacttctaac 250381 ctaggctctt actctcttat tttgacttct ttccttcttt tcccgtagat ctcccaagaa 250441 aaagccaata tcctgagagt gatctataaa ttcccacgaa ggccaatcag catcactaaa 250501 acaagagcaa gttgagagga aggtgtcaag tgcagacaat tcagagaaga tatctcaaca 250561 atcttttgaa agcgtttaat aggaaggaat agaaacttta atctaaacct tcttttctat 250621 ctaaggtctc atcagaaatt gtgaaagttc actgagtaag ctatgtcagg tctggttatg 250681 ctagcaacca ccaagaatgc tctagctacg atagagatga ggattagcag caaagtggaa 250741 cttacaacag cagagcagac aaagaggacc cagaagtgac tatgggagaa gggagaggtt 250801 ctttccatgc cagccttgac caatagatca gtaatgtttt gagaaagata gaagcagaag 250861 ttctactcac tttttgacca agattgagcc agatacttta tagaaaagat ggcatttttg 250921 atgcatacta tgtcatcaat atgaagagga ttggaccatg tgattctaag atcatcaaca 250981 tatactagaa tgtacagagc atgatcaccg ctcagaaaga gagagagaga aagagaccgt 251041 aatgtctgag agaaaggaga gattgaaaac ggaagcaagc ggagggatag gagaagagag 251101 aagagagatt gtagcgattt actgtatctg agtgcaacac aaacaatact aagaccaaga 251161 tcttccgact tctttttcca atgtatcttt gttgaagatg aatctcaaac aacaaaacta 251221 agggatatgg cattcatttg aaaatgaact aactaatact gcttccaatc aacatatttc 251281 gatttcaaac cacgaatccc cccaacacag caatccaaac ttgacacaac tggaacctca 251341 cattccaaac ccaaattagc attcgaataa ccagatttat atattttccc atcagctcaa 251401 catcagagga atattaaggg aatagtgaag gggatcttat acctgaagct gttccaagga 251461 gatccacgcc atggaccagt ttcctgatat gtatagaaga caatcccctg aaaccaagta 251521 atccgcgaag agtcctcagt ctccatggcc atcttcactc tagtaccagg agtccagaat 251581 atagccatgg aagcctcaac atcttctgcc ttcaccacaa agtccgacca tccagccgta 251641 ggataataaa ccacctcaaa cggcaaaccc tgagctgctc tattaatcgc ctcagagaca 251701 gcttcagccg tcaatttccc tttcccagtc ctcctaaacc ccttcttcgc acttccgtcg 251761 tcttccttag caacgcctcc attgctctga taataactac agtactcatc tccgccgtag 251821 aagcttgttc ctccgacgtt actagagatt ggtgctcgcc ttacaccaat aaacatctca 251881 tcagccgatt tcctcataaa tacaacagaa tcaccagcga ttagcttctt attgttcacg 251941 aacttgctcc atccggtggt taacaagtgc ctcctcggtg tggttcgact tgacgaagag 252001 cctttatgcc aacaacaact gctttcgaag cggagctttt tgagtgtttt ttgggctgct 252061 gaaagtttaa gcaccaacag gtgagagtgg aattttaaac tgactgtaga gaggtggcgg 252121 aggcaatcaa ccacaggtgt ggcctttgca tcggaacctc ctaagtcaaa ttgataatta 252181 tcctgccatt gaaggttggt ctgttaccgt cattgaaaaa gaagcgaaca gagcggcaac 252241 attgctcgga gtgtcgcaag agaccgaagt accagtccta tatatctagt gaggaacctc 252301 ggtggttggc tgctgttcta tttcataaag agaggaaacc cttcgaggtt ggccctagag 252361 tgaagttgtg aagttctcga acagaccctt ccaaaggtcc gcccggttca acttgagtac 252421 tccattcggg gtgatatata aaggctaaaa tcgtcagcag aatagcgagt tttgaagccg 252481 gttgaagggg cctctgagac aaggaagccc ttgcttgaaa gagtcaacgg agcaatggat 252541 tattttccgt tacttatttt ataataaagt ctgatgacca caacagggct tgttgaatgg 252601 caggcttgac tggcaatgga aaggagatgg ctcatccatc tcagatgatc ctcaatcaaa 252661 gactccttct agtggttcgc ttaaaagatc tgtactaaaa acatccctcc ctaccgggaa 252721 atcacaaaaa aagagtacgc gaaaaaagca cctttaggtg ccaggagtgc ggcgtccagc 252781 catgcttcga atcgagtacc ggcaccgatg tcctatttcg tctgattcaa caagcctcct 252841 atcccggacc aaaagaagtg gctggataac ctttcttccc ttctactttt agtagtcgat 252901 gttataccgg tactaaagag aagtaattga tagtgttcaa catccctatc aatgacttct 252961 aacggtcagt ctatagcgct tctagccagt ctgacttaat aaaagatctt ccagccaagc 253021 ctgtttcagt tgaaatagaa gtcagccttt ctctggtttt atatatccct gtgactggtc 253081 ttactaagag cagctactgg gtttgcttac tacagaccct tcattaccac tttatcgata 253141 aagggattga ctgctctagc tacaacggtc tcgctcctac tacttttacc tcttattaca 253201 taggataaac cgagaagact accgacaaac agcaggaatg aatgtgaccc acggctacaa 253261 acgaatgaat agaggggttt ggaactagaa ccgtatccca actcaaggcc ctagaaaggt 253321 agggcgatgc agtaggaaca gattacttgc tgcttgatac tgaaggaatt tagtgtactc 253381 atcctcacat acagatagag ttttatgctc agtagtggta gaaggattag gagggggaac 253441 atagtcagaa gtagctacat tcgcaaggtc taccaaatct caacaagaat tcactaggct 253501 taacgccctc ttcaccgaat caccatagtg agaacacttc atcctcatcc acttctccac 253561 tgccataaaa agagccttaa tctccttctc gggcaggcca taacaccttc tttccctcag 253621 ctaaacacca tataaagagt cactcctcga ggtaggtcag cgactggaag agtcgtttga 253681 aggcgatcag cagtcggtct gataaagaag tctgtatgac catctctgtc tgttctcctc 253741 atccccctat cataactcgg caaagcgtta gtgcctttcc agctagtcgt tccaacagga 253801 atgcttaaga taacgaacct tgaccaactt gttcaaaaac agaggtcggc ctcttcatcc 253861 ttcaagcttt gagtggaccg ggtagcgtca atcagagtgt gaatgaaagg tcaagggaac 253921 aagcaacgga ttgaacaacg ttagcgaaag ccgttgcgct aacgcgcatc cgttttcttg 253981 ctcgtcttcc ttgatctttt aatatggaag gaaggctaga atcttctttc cgctttcaaa 254041 tcttccgatt tcgatgagaa taaaaaagat caaagcctgc agggtcagaa taatggaatt 254101 agactagtta agttcagtta gcaattcaat cagccttagc taataaagcg aatatccttc 254161 cgcttcaacc caatcttact cgaagaacta gttatgtttg ccggaagtac gatcagtgtg 254221 aggggtagta taagtgatca aggggtggta actcatatta ctggaggaaa ctttgcacag 254281 agttccatta ctattaatgg gtggaattgc cacaacatgg gcgttcttct tagcaagaat 254341 tattgcagta ggataatggc taggaggatt ttaaaggcat tatggcatta agatttccaa 254401 ggcttagctc agggactgct gtcagtcgct tcccctcatc aactaattaa tgagagcttc 254461 cgtcgtcagc tcgttcttga cagatccgat cgggtgttca taatctggag taaaaggatt 254521 cgaacctttg catgccggta ccaaaaaccg gtgccttacc acttggctat actccatacg 254581 gcctctgagt tttggacggt aagggggggg agggggaagg gagaagcagg gcggggttgc 254641 gcgcgagccg tgcaagaacg cgggaatata gcaagtaagc agcaaggttc tttaggaccg 254701 actacaacaa gctcgcgact ctcatagaaa gaaagggtaa gctctctgct ctctatttgc 254761 tcgcaatgct atacctatca aagttggcga acgcttctgt aaccttagac tcgttacgct 254821 cttcgactga actcgtttgg ctccgcgtcc accgaaagtc ggtaaccttc gtcaccgtca 254881 gcatttggta ctctctcgat agcttcaata gttcactgaa agcctttggt gtaatgaacc 254941 gcaagccctt ctgaaagaaa gaataagaga aaggcttggt tgcgggaacc gacggtgacg 255001 aagcgggccg atgcggccgt ttccctttgt aaggtaggcc ttttgataac taattagagt 255061 tgacatgaaa tggatcacgg gaaaagacgt atccgatgaa tgaatgattc aatcctccca 255121 ctcacacggg ttcttctatt gaagggcttc ccccttcttc tatgtgaggt ggggaaaggt 255181 cagagcggaa cctagataga aacgcggagc gccgacccct tagccgatac agttaccatg 255241 cttaccagct cttaccattg ccgcttatag atgggaggta ggcgaagggt agcttgcttc 255301 tcttctccta ggctacctcc actccacatg gttattcgcg aagaaaaggc agcgagcggt 255361 tcttcgctta gtagagctac cggccggcgt acgacgaccg cttcttgttc tcgcccagcc 255421 gcttcttttg atttgattat tatttgaaat cctagactaa agaagaagct ccagaatccg 255481 cgcagggcaa aatctgcagc aggagaagaa agagggtcca ggtcaatttg ataatgaagc 255541 gaaggtcccc cttactccct attccctctt aaagctttat aaagctgggc ggttggttaa 255601 gaactactga ctgtaaacta tagatagcag ttacagtcac tcaattgaaa tgttcgcctt 255661 tggaatgaag atagatgagc aggggagttt tcactcctac gtggtattct cttccaacaa 255721 gggttacggc taaaacaccg cttacttttc aaagacaaac tgctcctaaa ccaagcctga 255781 acacgtttcc agttgttgat cgacgcgtgt cgtgatcacc agcatagtcg gcgtcacaat 255841 agccaactat cttacactgt tctcctttct tatacagaag accatagtca agtcttcctt 255901 tcatgtacct caatattcgt cgaaagtgag gtttctttgg attttgcatg aatcgactaa 255961 ccactccaac tgcatatgcg atgtagggcc ttgtcagggt tagcgcttgt gctaagcgat 256021 aagaattcgt tctggaatgc cggcgcggac tgaaggatga tgtttattag ctagacaaat 256081 ccttcattcg aaagacgaca atcaaagcat aacgactcgt ggaacattca ttcctaaaag 256141 gtgtaaacct gtgagatcgg acaacccgta aaaggaagcc gctaggcatc aagccctatt 256201 atcttacacc tagggagggt ggccgttgtt gggtttttct caattagagt cgcaaggagt 256261 ttgctgctcg ttggtagtgg aatgctgact aaccgtctcc gtccccctat gatgagaaaa 256321 agctttccga gtggacgagg accgatcttc tctttcttta gtcttgctcg tcttatggca 256381 ttagcctttg cggattcctt aatccgcggg acagctgtct cggcacatgt caatccttat 256441 gtccccagcg agcaggtcaa ggtactctat gttgctgctt gcctttatct tccagtctgc 256501 caagcaagct tgtttattct atgttattga taaatcgcta tttcccccaa accgctagtt 256561 ttcatacgac tcagaactta taacgttgtt ctgtctccaa acagagagct tgtaggcctt 256621 attctttgga taaaagtagc gacgagccga ctactacgac cacatgcgca tctagcgcag 256681 tggcttgtca cttcgtacct tgaccatctt ccgaagttct aaataatcta ctgatcaaac 256741 gctgtagggg cggactgctc tacattcagc cacgccacag tgaccccccg aagcgatctg 256801 cctcattgca ggacgaaatc cggcagccaa ttgctggctc tgaataacca gcccagcaag 256861 aagttcaatt cttccataac ataacggggc ggggttgcgc gcgagccgag tgacgtagat 256921 gcacaagagt acttcgcgcc acaaccatct cttttttata cgttctacgg accgatgcct 256981 gctgcttcat ctggtagaaa agaatcatag atatgccggt cattagaagg aagaaccacc 257041 ataaaaagat tcctcgtgta tcatctgtag caaaactatg aacgggagct agcaatccgg 257101 accgtattga aaaggttcct gagacacagc atggaaaagt cacaatatta agaaacgagg 257161 tccaagaatg aagaaggggt aaaattacag aatgaatacg agctgtggct aatacccgag 257221 gcataaaaga agcattttct acgggatccc gaaaccacca gccaccccga cctaattcat 257281 gataagccca ccaacttcct ggcaagatgc ctacggttaa aaaccaccaa catgtcaaga 257341 tccaaattcg aattggttcc tggtcctgat cagagaccac tgtgttcgcg ccggcggtcc 257401 aacaaagagg cgaagtagtg gtctctttct ttccattacg aacgacacgc ttcgcctgct 257461 ccctccccgt gtccacgtcg tcgaagaaag ggtcttttgc cccgatttac cgtcgtaggg 257521 agcactcttc tgagccccac atgattcgcc gaaggatgag atcttgccta tgcagtagga 257581 ttaaaaggaa agttatccga acagtgagac ccgggaagac tactacttct gtatttttga 257641 taaatcaaag ggctatttcc tgattcattc cacatttcac attccccttg ggattagcaa 257701 aatcaaatac ctttttctcg ttatagctat cctttgcttg tgtaatgact cgaactttct 257761 catttatgaa gacttgaaat cgcatgcata agtgttggtc catcagagct tacagttgga 257821 gatgccgcaa agcaaagatc tggacgaaat gagagtcctt ccgcgttgga gctagggaac 257881 ctagggccag attgatcggt aactgaatca gttaaccaaa agatgctgac acgatcatta 257941 cttgcacaaa gataaaaaaa ttgagtactc tgcccttcca gtgctacctt tcttcgatgt 258001 gatttgattc tagaagatga gcttgctcgg atccttttcg gtagctaaaa gctttcccat 258061 aaaaagaata tgtaaccccg tatcgataga cgaaaaaacc caacaatgtg aaaagaaaga 258121 tcttctccgt ctctcctccc gtaaaataag aaagtatgaa aaactggatt aagagacgca 258181 aattcttcaa agaaagccat agaagttgta tcatctatta tattccagcg tgactgctca 258241 cgtcgggata atcttgattt tagaatacct ttaatatact ttgaaatata agtccactcc 258301 ttcttactgc gtatgctctc ttctctcact gcgaatgatg ccaaaaagag ttcagctcgc 258361 aagcggagtc tcattggccg tgaatcagaa gctaccatgt tctaacttgc acttagtcaa 258421 ttgggaagat taaccacaac aaatcgtaaa catcagatcg gacctttatt ttattagcgt 258481 tctttctttt agtttgaaca acaaagtccg ggccaacaat atcccaagtt aatctgtaaa 258541 agtcagctgg aaaaccatcc gagccagggg atttgtcgag cggcatgctg aaaagacaat 258601 ttggaatctc ctccgtagtt aacctgttgc agaagattcg cttgcatcta gttggagcat 258661 ctgatgaagt cgataagaga gcgaagggat gagatagagg ctcgagtgtg gttcaccggt 258721 gaaaactgaa acaggttgga gaagtaacga atggcttctt cttggatatc acttgttcga 258781 aatagatcca caccttgatc tgatgtaatt ctgcggattg cattagcaga ttttctggct 258841 cgcactctat gaagatcgcc gatgaaaaaa aggcatatcc actcgtagat cgggttggtt 258901 atatttatat aagggctatt ttgcatattt ttaatgttga taggcagtag ggtaaggaaa 258961 ggatcatcaa tctatgataa tttccgagtt gcaaaatcca aatacaacct gacttgaatc 259021 cttactggac tacttcatat gccttccgca cacaggacag aggaattcac tccacacacg 259081 agccaactcc cctaaaaggc ctttctttca tgcaattggc gacgccatcg ccttatatct 259141 ttcttgttaa gatcttaagc ttcctggacg tctacaaatc tgttaggcag gaaatgttta 259201 catcggggga agagttcatg ctatcatgtc gctaaaaagt gggctttttg cctctacctt 259261 tgaaagtagt gttaaaaagc cttacttcat tggttcgatc agtcttttcg tataaatagc 259321 attgggcgtt tagcaccggc cttctattca ttcccaccct gaagcagtta ttcggactaa 259381 gatcgtcatc gtaccctcaa ggcaagctcc tgaagtagaa gtgtgaggca agacgggctg 259441 ataaactacg aaaatcgttt tttgagggtg gaataaggca cgcggaagac ttaggagtcg 259501 gagatcatca aaaaataaat ccaatgcttg tcgtgggcaa tcctatagcc cctttcctcg 259561 ggatcggctt aggctggcta gccttttcta tctgcaaatc tgctaaccga ccgacggata 259621 ccactaactg actaagctag agaacaggag accaaaggtc ttcctgctta atcgagcttc 259681 ttttagaacg ggtaaaataa ggagctttcc ctccttccac tgtcactttt ctcattcttt 259741 tccctaaaaa aaattcataa gctaaagaag gaaataactt acaatgctac aaagaaggga 259801 acttcaaact attctcaacg aaaaagcctt tcaattagtc ggtctaacct tcctaacttg 259861 gcacacatat gacataaagc catcaaccaa gtcagccaac taccagcagt cgatgagagg 259921 gtaaggagga agcgatagcc tgctctcatt gatgatttcg aaccttctta ccctctcttg 259981 gatgaagcaa ccatataact gacgatatgg attgagtagc gagatcagtt acactagccc 260041 gggcatgcct taggcttcct ctgccacctc cacgggcgaa gtgagtcaaa tgcactcttt 260101 cttggctagt gtagtagact ccattatggt cattcgtaac ggctccatat agttttattt 260161 tggggcgtaa cgttgtgatt gttccaccga ctgaccgata ctagttccag aggcatcttc 260221 cattcatatc gatttgggtt tttctgcacc atattttgat ctgcctcttc ttcgacccgg 260281 aattcccatc aaatccttta ctcctcgaat acaatgggat tttacacctg gcgaatcttt 260341 cactctacct cctcttatta acaccgtaga atgttcctgc gaattatgac cttcgcccgg 260401 gatgtgagca aatatatcat gtcgattgct caaccgtact ttggctatct tacgtggagc 260461 ggaattaggt tttttcggtg ttctcgttga aacacgcggg catactcctg ttttctgggg 260521 acatttatcc aaagctcgag tacggtccgt gcgccgtttt tcttctctac catgacgaat 260581 caattgatta aacgtgggca ttattctctt tcctttcttt ttccccccat tttgtgccct 260641 atcactttac tcccgatccg aagcacccct tttccattca tatagaaatc caatcgtcaa 260701 aataaataaa aaggccatca tggaccaaaa tccaaacaga tcaatcttgt tgggaggtac 260761 tgcccaagga aagaaaaagg ttacttccag atcagggatt aaaaataaaa ttgaaacaag 260821 ataaaatcgt atatcgaaac gacttctggc atcaccggaa ggatcgaaac cacattcgta 260881 ggccgacaat ttttctgggt aggtagaact attggaagca aatggaaaag gaacaccgag 260941 taggatcaaa gaaactagca gactaatcac taaatagata gaaattggtg caaattctga 261001 catcattaca gcccactttg ttttctcgct ccttgctggc gaagaagcgg catatcgaaa 261061 aataaagaaa gaagcaaaag cccatcccga aaggcttgca gaccttatgc gatagggtcc 261121 cccccagggc cgcctttctt gaggaatact ttctcttgta aacgagaaaa ctcgtccgag 261181 aggggcccgg tttcttggag gtccttccat atatcataaa gacgctccaa ggactcttcc 261241 ccactagtgg tcctcggatt atcgagggcc cgagccccac gcgccatcca atcgccggtt 261301 gggtcatata gagccatccg ttggatgatt tgggctttga tctcgaatag atcttcggcc 261361 tctattcggg ccatctctat gatctccgca gagggagagc cttcggggta tttcgccaga 261421 aggcggcgtt ctatcgcctc tacgctatcc cccccaatca tttcatccct cctgtatgga 261481 taggggacta ctggcccagc ttcctccccc ccgggaggga ctggattagc tggaagcgca 261541 ggcccagctt cctggggcac ttgatttacc gagggttccc cctcttccgt ttgggacggg 261601 aggggactgc cctgatttac cgagggttcc ccctcttccg ttcgggaccc cctgccgggg 261661 gggagtcgga ttcctcgaga actcccagtt caaaggaatc ttcctcccac atggaagaac 261721 tgcccgaccc ggaagggccg gtagaaggaa gtgcttgccc ccctattgca ttcaagacca 261781 aaggtcccaa cacgggcaaa gccagacccc ccactaggtg gatgacttcg atccggatca 261841 aagatcccac gaaccccatg gaaaagacca aaaagagata gataatcttt ttctgcggcc 261901 ctttcaaacg aaaggaatga aaatgaaagc aaaataggaa gcttacgact aagatcgaag 261961 agaatgaaac gcgcgtagtg gtcattatag cgcttccttc tgatcctaga aaacgggcgc 262021 aaaaactacc tgccaaagct ccgagcaggg gcaacaacag tcgaaccata tttttcataa 262081 atttgagatt ctgtagtttg cttcgttatt agcgtccgag gaacgcatag aagtggttgt 262141 ggaatcgaac cactaacaca aggatttaca gtcctttgct ctaaccatct gagctacctg 262201 tgaaccactt tgttctcaac ttatccaacg aatctctttc tcttattcaa ttccactttg 262261 tttagtttag tgatagttag aggttagaga gctagatcac tcctctaagc agccttctga 262321 ttatatacgt attattctat ctatctgtcc ctaccttata ctatccttcc tatctgccat 262381 tacttgtatt ctatcgatag gaatggttag ttagtgtgca tagctgcctt gtttctgtgt 262441 atgcacagca gttgtttctg tgtgaagcat gtatggaggt agtgaagaac ggcctgcgaa 262501 ctaccgtagc taagcggcac tagttcctgt tgcttgcagc ttcagtagaa ttcatatccc 262561 ttgttggtat gtgagtaagt tgttgcatga atggtcaggt aggtaagcgt agcagtattc 262621 ctgttcatgc atgtatccct gcaggttgtc ttccctatcg aatgagggaa gtgagctggt 262681 taatcgcagc ttacttctgg agtggtccct aggcgggtgt agctattgtt gcttgtagct 262741 gctgttaagt atctatcttt gtttgccgat cactgatgta atggctgttc cgtgctgtgc 262801 agcgcgttcc gaggtctgct ttcctttctt tgtcatttga ctatctctct acgttcgatg 262861 ggaacagttg tttcacgact aagtaagaaa ggttagcgaa gccgggttcc tatgtatgca 262921 tgatatttgt tccgaggtgg ttgttttctc ttcttttgga aattgcccgg ggtttaaata 262981 gggcaagaac atcaggaagg tcttttcgtt cggatacaga catggcaatc tacgtatatg 263041 aagtctcgat ttaagctgcc taagcataga tccaacgcgc tactcaagta ggcgggaaag 263101 aaatgccctt ctagcattgc cacttgcttc aagctgctct ttctataagc tatattccag 263161 tagtgattcg agaaggaatt catgagtgac tgaccatcca gcctttcctt cgtatgagac 263221 cggacatgcc agcgaatcga atagcgtagg aagtcaacaa gtcttaatgg tggaaagcta 263281 gaaagggaaa aagcattagt ttcaggttgt ttacatactc gactaaaagg aggaacccct 263341 cgggcgaata aagtatgtaa ctggcttaag aacaagaaag caagaaaaag gtatgaagtt 263401 ctaattgtac aatgctatta cttcatccgc tgtgaatagg ctgaaggaat tggatcttgt 263461 ctctttgaat ctgcttttaa ggcaggaagt cagaactata aactttctct ttagatatcc 263521 ccatacgttc ccttgtgtga aagagctgta gaataagtcc tagctatctg ataatagcag 263581 tcttgttgct atgggcttag cactagtctg caatactcaa ttcataccag tagtacggtt 263641 gctaacttgt ctttcaaagt tgccctggtg tatatcccgg agtacttcga ggaacgccag 263701 taggcctatt agtgtataga tccttcccgg gttgtttgct gtgtttcact gtttcttcct 263761 tgggaactgt ttgttttttc tcttctgtta tagtaagagt tctcactcct tcacctcatc 263821 ggtcaagtgg aagagtcacg tcagtcccga ggaacgcctt tccgctcatg gtcttcctac 263881 tccgatctcg cttcgggcac tataagggct agctggcttg ttgctagagc acagttaaga 263941 gtcactcctc tcctggttct acgtagcttg atgagataga ttgctggcgt agctgtgatt 264001 cctaagtaaa caagttgtag ggaaggagtg ccttccgggt tgtgcgtatg catctactga 264061 ggtaaggcac gattgagcgg ctagctgctc tgtcattact agagagtctt tcttagtatg 264121 agccgtgata gggaagatgt gtgctcgaag tctccttagg tcttgatcta gttgtgactt 264181 cttttttcca gcgagttacc cgtcgttgta gttgttgctg ttccgggcgg tgcttcctgt 264241 ctgtaagatc cctttcttgt ctcgtccggg aaggatgtat gaagtattgg tccgtagctg 264301 ccttgctttt cttgtttcct ttgctttcga gtaggtgcag ctgaggcagg taccaagagt 264361 caagagagaa cttctttgtt ttgtttgaat aatagtatag atttatacct tctctacgga 264421 atacctgttg catcctgtct ctcctgccaa gcgaatggat ttgtcatgca tttgttgcta 264481 actgccgaca caagctggct tgttaggaaa aaaacccttt tcttcttctt atgaatgttg 264541 cttactctgt ccactgcgat acaactgttg taagctctgt tttacagctt gtaatgggat 264601 ttctgctctc ttttcaatat atcccctctc tttttcctag gaaagcctct gttgaatcta 264661 tctctctccg ttcggaatgg gaactccaag ctgtttagct ttcattcttc cttccggtac 264721 cttttcacag tgcaactgag caaattagtc aaactttgta ctcggaatgt tactttcctt 264781 agttttgttt caaatatctg ccgtttcatc tgcttcagga aaggcagttg acttggtcga 264841 atcccgatta cgatcaaaag gaaaatcttg acttgcagtg gaataacggc tttgcggagg 264901 aagatcatca aaggctgttg ctgggagagg tccattagtc cggggtcgat tgcttccatc 264961 ctcaattgct tcttttgaag gttacatgcc tgctgttgaa agcctgggac gagacagagt 265021 tagtaggcgc ctccggccct gcggttgaag gtgatatacc aacaacatga gctagcatgg 265081 tttcgagtgc gataacatgg gatatggata tgttttcttc gatgcctagg caggctggga 265141 ttggaactgc tcgtgattag accttctata tgactgaact tccccgcaac aagatagaaa 265201 gaagagtata attcggagaa agatgaagat cgtccaatag tgatctacgg atagctaccg 265261 acgtgagagg cctagagtga agctctggca gagtggctcc cttgttaagt gtagggcccc 265321 ttggctttgg acaactcacc aagccataga gactaaagaa agagctaaat aagtccttgc 265381 tgcaactgaa ggaacgaaag aaatctactt ccgaactgag cgctaaaggt acatacggag 265441 agacctatga tttccattct gtttaggcta tgaaaacctc cccggatgaa ggaatgtaat 265501 acattgaaga cttcgaactg cttacgtaat ggaaaaaggg gctagatggc ttacggggta 265561 tagggttttg aggaggaaaa tgactcactt tttgacagaa tcaaggtcct atcatttctt 265621 ctttctgagt gaaatcaaag ctagaccata aagggaagaa ggttggttac ctgctatact 265681 aagagtaagc gagctaacga ggcctaatcc tgggttcacg gcattccatt cctaccaatt 265741 ccatttcgag gggtagcagt agattagatc ctgttgattc aaaaccttgt tcgaactctc 265801 tttggcatct cacaaatgtt tcaaaatata tagtacagat tggatcctga tcaagctcaa 265861 gctaaaactt acctacaaga gaatccatac tgaatctatt atcatagatc ttgctttcgt 265921 cggatacgtc tctttctccc caatttggat ggatcctctc attctaattc atcgctacta 265981 actagaagaa gttcctaaaa atcgaaagct cccctctttt tctccatcca taaccaccag 266041 ttgtagcgga ggaagtagca gtcagtggct tctcccgggg cccttaccct tgctcttttc 266101 tttctcgggc gcagtaaaaa gatatacgag gttatagaga aatagacaaa gaagttcagc 266161 tgcaagagtg gatcctatcc cgccgtattt accagcggga ttcatccgtc taggcattcc 266221 tcggaggaac tcgtccaagc acatgatgtc tcctttatct gttttcagcc agcctaagcg 266281 ccctacttta cctacttgaa atgtcaattt tcctttcatc aaagccttcg tgttcatcga 266341 atactcgctt cactatgtgt agctcgctcc cctctcttga aagccgaacc ggcgccttct 266401 tttgataact gggttagggt taccgaaggc aatctggcag gcaaagtttc cccttttatt 266461 cggcagccac ctttgatgta actaaactag atagagtttg ggtcacgaat tcggatttac 266521 cgatccgagg gaaccaaatg aagcatcttt ttctccttcc gaaagaaaag aggaagactt 266581 attcccaccg aagtgcacta atagaaaacc cctaccagtc gaagtggaat gatgagctaa 266641 aagtcccttg tacgatgaag ctgagtcaca gaagcgggag aggtgtcttc tttgaagatc 266701 gtttcctatc gagtggataa aatcccaagc cttaagagcc cattaaccca tcttctgttc 266761 cgctagccag ctaagctaaa gcttgccatc gaccggttcg ttcggaagta acgcatgcgc 266821 tcacctatcc atcaaccaag acagaaggga atgaccaacc aagcgagcag agcccaacta 266881 ccaatcctat gttctcccaa ttctggaact ggtggcaatc taacgaattc gggaaagcaa 266941 gatgatcgac actggaaaag acgtcttggc gagaggtgct ttagcaactc gactgaaaag 267001 gagagggcga agtcatgact caagcacttg tctagagagg agagggagct tacagcattc 267061 tcaggcgcgg atgccctaga cgagagagtt attgagagag aaaagttagg taggtcgtca 267121 gtaagttagc gagatagagc gagactgttt gttaggtagc ttagcgagat agggagcagc 267181 tatggaacaa ggggtctgtt aaaacagaac taattaccct tttttttgtt tgttcaaggg 267241 gttcctcgga agagctcgca gtgaatgccc agtgcaatcc aataaaacaa cctaacagaa 267301 ccgctccgcc cctccatgta cagaaaagta aacccaaaga agttgttctg gacgagagag 267361 tgagattaga ctgcttaaac cgtagatagg cgttcctctc cctttagtcg actaactggc 267421 gttcctcggg aagtacacct taggtttaca aacctaacac agaataacga gacgaagtcg 267481 actttagaac ctattgatta gagcttggct gctcaagagc tacaaatgcg aggggatagc 267541 tgcaataaac ccatgtaagc acagggagaa gccggctaaa gcctacatac ctgttccctt 267601 aatgcaacct attcccaacg gaggagaaga ggaatactaa tgaaatgaca gaagctacag 267661 cgcggacttc gtgcttgcag gaacagcagc actactatgc aattaattag ctgccccatt 267721 catgcgccct ctaaacaacg aaagctacaa catcaacgaa gggaatgggt tcctccggca 267781 tctaggtaga accttccaga aaaacgggga gtagacccaa ggagcgaagc tacagccaag 267841 gaagaaatcc cgggcaggga catgaacaag atagatattc tacgcagcca agccccacag 267901 cacaggagag gtgtaggagt aggaagattc gctttttata gatggtcaaa gcgctcaaac 267961 ctgcttacag gatgaattag aaaaggagat agtaccaagt caagatacga aaagatgtgc 268021 tgcacgactc agttgaccac aaggggcata gtcaggaggg tgtaaaggac tggagattga 268081 tcacattatc cttcttccat tagctcctat tacatccttt gagataaagt catgacgggc 268141 cactgcctga attagggatt ttttctataa tgcccttatc acgatctagc cccttgtcta 268201 tatcatggtc tgttttcata taaccttact actgtatagc atgctaaata aagcgctgtg 268261 ggatcaatca ttgcattttt ccattcataa ccatcatata cgctctttca agaaaaggga 268321 tagaatgaca gtctatcagt cttaccagca tgcatacaaa agggagtgta gttgcctatc 268381 agctggttaa acagataaga ttctgatata ttatataagg aataaggatc ccatatcgat 268441 cgaatgaaga tggatgggtt gagataggtt cgttccaact taaggtaagg ccggctgtta 268501 gcagggatcg ttagctgtag aagtccattc agtagaggta gggcccggga ctcctttctt 268561 tattagacga gaaaagggtt ggcacaaggg gcgtagcgct gaatgaaaat agaaaagcat 268621 tccaaggggt tcttccaagt actcgactag acacaaaata aatgaatcct tacttcatca 268681 agaaaggaga cggagagcca cgaacctact ttcctgtgcc ggaggagaac cagaaagaag 268741 tcttgatcca tggccgggtg caggggaggt agggacggag aaaacgtgct tcatatacgc 268801 cagcggtcga agattggcga aagttggtct agtcctatac tatatataaa gcgagtggtg 268861 acatcgcaag ataaaagcat ttttttgaag tattattctc aaagcaagag attcaagtaa 268921 gaaccagcag aaccatcatc ggtcaactga ggttaggact actctgacac tcgcttattc 268981 ctctctaata ctttcttctg tctattcctt tctttcccgc tttcaagctg aaggaagaaa 269041 gtcaatcttt cgtagtcccg tccggatcaa tagattgggt cagatcaatt ccttttcatc 269101 cccgaggaac gccaattcct cgactaatag gtctgcaagc cttatctttt ctttttctgt 269161 tgtaaaccgg cctgttagga tcttctcgcc gtaactagta ataggcttat ctgccctttc 269221 attcgcatta tcgtgagcgg gtctggtgaa ctaccctttc actggttcta gctactattg 269281 gatttgaacc ctagcttagc actccctctt gttattgtta gggcgctagc gtcccgtcga 269341 atcttgattc tccttgacag accagcagac cttacatacg caatttatcg attaattaac 269401 gtgcaagaga aagcgatcca caagagggtt cgttcatcta acacttaccc actatcggct 269461 tcaagcaact atcgatgtag tctgataacc tattgtataa ggtgtttgac tatgtgtgag 269521 aatttattga caattttttt tcacatactc gatttaccta ggaagtcctg agagagcaac 269581 agcagctgga attggaccta agggctgaaa gcttcagact gtctcaatcc aggtaactct 269641 tactccatta cttacaaacg gattaaggta gctcagatct agctaggtta gacggtaaag 269701 aagtccctca agcattgaag aaaggaagaa agcctgtaaa ctaccttact aaactgcttt 269761 ctgactctct ctagccagta aactacctct ccttccagtg ttttttcctt tgtacctcca 269821 cctctctgaa gagtcgatcc cgacttggga agcaagggtc tcggagttca ggaagcagtt 269881 cagcaacaca caaagaaggg ttggagtgcc cgaactacta aaaaccaagc aaaggtataa 269941 tgaacctgtc atggaattca ctgcaaggtg gcgagctttt acttttgcct gtccccataa 270001 gtttacttag caagagctcc tcaagatgtg catgaacaac ttcaggcacg acttgtcgtc 270061 gatactcctg ccccaaacct ttcctttaag ggattcgacg acttgtgcac taaagactca 270121 cgatgtggaa gcacatctta gcaaacggcg gcgtcctacg aaggacttga agttcgttcc 270181 gttaccttat taagaaagta gacgaatcac tcactttctc tattaaaaaa agtagactta 270241 tttttcacag cctatatgcg tatgcggaaa acaagagtcc ttatttttta tttctcttgc 270301 cctgacgggg gctaccgctt tacgttcagt tttctctcta ccttttctta ttttgacttg 270361 attttcaaat cttttatgct cggccatacc aagatgggaa agacctagcc ttccctccct 270421 ttgaagtgca tttatcagat cagctcccag agtaactaga aagagggtga aaggcccaac 270481 ttcttcattc atggcctttt ttgtttcttt gattgatctc cctttcgtat tcattcgacc 270541 tgaggcgttc ctctcagtca tacaaaaaag aaaggttatt tcatgctctg gttccgggtg 270601 ttttccagat gaaatgaaaa caaaaaggaa aagggtcaaa ccatatctta cttaataatc 270661 tggcttaaag gacggaacca ccgaggaata ccggttagca gctctagtag gatgtatact 270721 gtgctttgtt ccctgtctct ttctttcatc ctccgagtag ttctccaagc ctaataggaa 270781 catcgctagc tcggtcagtt tgtgtttctc ctgctttctt tccttctgct tccgaagaac 270841 tgttggtctc tttgtatcca ctgaacactc tcctaatctg tcttccttct cccgtactct 270901 tcattccgac taggtttact agcctaggag gataaccgtt tgaatttggt atataagagc 270961 tctactgcta tcctttcccc tatcatactc tctatcttcc ttctctcaat cgttcagatc 271021 tttgtttaca tcggtatatg taagggacat cttatgggtt cttctgtagt aatctttcga 271081 cagttccggt taaacttcta ctttcactcg gaacatcccg ggtgttggct tccctgtaat 271141 tccgtgtaag gaaggttagt aagtaagcta agccctgtca ggaagtaatc ccaggaatgt 271201 ttgttagttg ttagtaaggt aagccaggta agccatgtca gttggtaaga aagtaatccc 271261 ttccagagtt gtaaggttag taagcttttc ccgtttatag gaatgggttt ctaagcctag 271321 gaagtcaagg aagttgtgtt ggtctttccc ctggttcctt gttcccctgt tgcctttctc 271381 ttcatttcat tgacgggttg gagaaatgtt cacgacgtaa gtcagtggaa gcaaacgggc 271441 aatcggctgc ctaaaggacc ccatttgtgg ttgaaggcta agttcattct tcgatatata 271501 tcactccgct atgctagaag actgagcagc ttcagtttat taccaacctc gagttcaaca 271561 actattgggt aaagcctaaa ccttactcct ttccttctat cggaactggc aacagatctc 271621 tagatccgat gcaactagaa gtcagtcgat tccgtctgaa aatggtttca gtccaaacct 271681 tttttgtcga gtgaaactcc gcttcttaag agcttctttg gacagactca ctcccttact 271741 cgctgggttg gcgtatccaa ggtagcgcca gcagcaagcg taggggttac tacgtaagtg 271801 cgttcgttag cgcgttacgt tttctcgctt gctagacacg acaactaagg atagacgcgc 271861 tgggatagca agcaagtttg ttgaaagagg ggcagtagat tctagatctg ctgaaaacca 271921 ggttataggg ctatatcttt attccatggt agggctaggg gcagggtggt ttaaaataac 271981 tccttttact ttccctccta caattacaat gggaagggac gagacaaagc cttcttttaa 272041 actaaataaa gtaggattcc gctttcactg ttctcaaggt atctcctgac tcgaagatta 272101 gggcttcttc ccagccggag gcatccgaaa ctggagaatc atcaaccctc gcataaagaa 272161 acaactaaac tatacgcggc aggttggtca aaaagggaaa gaaaaaggtg acatacattc 272221 tattcgggga cataatagtg attcctatcg agcacactct gttatatcga tcccaatcaa 272281 tagtttttgg gggaagtcga tggtcccccc catgctttat gacatctttt ctcttctccg 272341 tagtgatcag tgtatgcacg tagcatcaca gtcaatcttt tatgcttggt tccggtcgcg 272401 ctcttcttct aactgcagga gcccgacagg gctgtcttac tctttttgtt gctaatgaat 272461 aacctttttc ttggtgattc atttccaagg gaataaaaga aagacctgtg actcattgtc 272521 tcaatcataa cttcaaacag aacctcactg tagagaggga gacccccgaa agcgtaagaa 272581 cgacgatttt tgagcctgag ggtgaggttc tccatcgacg agtgagaaaa gggcctggtc 272641 tcggaagaga tgcagctaac gtgagtaata gctgaacagt tgcctaatgc atgtggtcag 272701 catcagacaa ggttataaaa tacatcttta catcccggtc aaagtaacga aaaaagacga 272761 gttttagcaa ctttttcata cagttttgcc ctcttttata taaaaaacta acttacgcaa 272821 atagtaaatg agaggacctt agcacttttc gttatggagt gattcatcta aaattcccgg 272881 ggttttgtga atgaattcaa ctagaaaatc tcaattgcac gggattcgcg attcggtgta 272941 agttcaaaat ttatcgtgct ggacgaccac ggttgagttt tgttgtggat ccaactccaa 273001 gcgtttgcgg gattttggat agatggaata ctcttgcaac ggagcttgtc tgtcaaatct 273061 ggtctggtat ggtagcggtt cacaatggca gctcattttc ctggaatccg ggtgaaagat 273121 cttttctttc caaggtcttt gctttctatt cgacttagcg atggagtcgg gtcggaaata 273181 aatactaagt aaaggcagac ttacaataat acgcaaacaa gggtcccgcg atgtcttgta 273241 cattctaagt actttgcctg ctaagacgtg acaaacaatc gctgattctt tactaacgag 273301 cccagtagcc actaaggcga tccgaatgag gaaacgccgg agtgaccgag tgtcttcaaa 273361 aacactgcgc caagagacca agcggagcct ctcggtgaat gtggtaaaca aattgaaagc 273421 catatcaatc ggtataaaat gacaaacctt ttggtatgta ccacaaaaca ctaagaaaga 273481 ccccgccggt tctactctcc ttgacccgtg gacaaacgcc acgtgatgga cggcgcccgg 273541 ctctgacgaa gaaccagtaa gattaggtat tgcctaatcc gtaaggttgt cataccctta 273601 tcagctttct gaccaaacct ctgtgttgcc actttggtaa ggaaagtcca ccaaccgggc 273661 atagtacctc gttagcagga ccactattgc tccttttaaa gctttgtgct caagtgatct 273721 gggagggtgt caaacctccc ccagcccctt attcagggga tggtgccagg gagtttgaac 273781 tgttaacaaa agattttctc gatacaaaga agagccccac tcctagaata gttagccact 273841 cagtccacag attcggctta gtgaaaatag ccctagaaag atcaaggtgc ggaaggataa 273901 tttcctccgt taatgatagc tataccatag agttttccat caaggaaaga caatgccttc 273961 tcttctccat cctccaggta ccagggaaga actctatatt ccgggtccaa ggatgatggt 274021 tgcaacgccc cgcgcttcca ccaataagag gtgcggggga tggtgatatc ccttgaaaga 274081 cataactaac ttacaagact taaaatgaaa agtaaactaa cggaacaagg cgccggagga 274141 acccccttct aaaagtctaa gagctagaag gcggggtaca gcggttgctt cctgtaagat 274201 ttgaatagtc atggcctcat ggtcaagcca gaaaggaaag aaaaagtcta tatcttctgt 274261 cctctctatt actcgctcca ggtattacac tcgttgggct caaggtagca ttactaaacc 274321 tataaagagc gggatgctat tctaagagga atttactgta aaatagatat ctctttgctc 274381 ttcagccagt aagtctgaaa gcaagtcagt tgtaggccag acctgtcatc agtagttcta 274441 cttttgtaga ttgaagcttt tataaggcag gaatgggagc tgcaaaggca gagataaagc 274501 tgcgattgct cttgctcgac tcttcatggc acggttggcc gggaatgaga acaaaaggag 274561 taagaagaaa ctggctaaga ttcaattgac ctatatagca accagttcaa ctaaaggggc 274621 gtagcgggct taaggctgct agggactctg gggcgagaag aagcttttcc cgcattcctg 274681 ttgatgcaga gatcagacga gaaggcccat ctctttacta gaatccgatg tgatagaagg 274741 agcagttcta gcttaagcct aaggcctaag ttgatcctta cttaatagaa tagccataga 274801 gctcatcccc ggtgaaatag ttgatgattt cttgatggtt gactgctata tcttaattag 274861 agaatccgac tgggaacgaa tactttgagc tcttcttcat ggaatttttc cagctctcaa 274921 ggaagattgg atgtaatatt acacaatctc caatttatct ctttctcgct cggaaaacaa 274981 acaggtgtct ttgccccgct cacacagatt ttatcctggc ggttgttggt ccaacgactc 275041 tacttctttc ttttgaaatt tcgtatgaga agaaggttga caagaagaat aattttaaaa 275101 ctgggattgt agttcaatcg gtcagagcac cgccctgtca aggcggaagc tgcgggttcg 275161 agccccgtca gtcccgacct agtcataatt gcgttttaag acctggtcta gcaacaaagt 275221 agaatcataa gcacactccc ttgtcacaac taaagagggg agcggcaggc ccttgataag 275281 acgcctttgc tttagcgcac ccaaccgtcc tggttgaaag gtcgacccac tctatcttta 275341 atggaatgtc gtctttggct ttacggaatg agactgaact gatctcatat cttcatctga 275401 ttcaattagc tcatccgaaa ggaagaacaa ttaacccacc tatatccatg tgattaggag 275461 acttaaaggc aactcgagtc aaacgagagg agacgtcccc agtgatcacg attcaatttg 275521 ttaatgaacc agattttttt gggaagaaaa gaaactggtc aaagcagtcc gggctgatct 275581 atgcacagtg gattcatcta ggggtctata tgcacgtgtg tgtgtggagg tagatttcaa 275641 attcaaaaag agttcccagt tttgtcttaa tgatgacatc cagagggtcg aatacgaagg 275701 cttgcatctt atatgttttt gagtgtgggg aatatggcca ccggatagag ctttgtccga 275761 agcaaaacca gactgatgta gaaaactcac cggcggcgaa ggaggttctc ccaaacccgg 275821 atccgcaagt cgggtcggaa caaaggcctt tcggaccttg gatgctacct cccaacagac 275881 gtcgcaagag gaccacgtgg tcagcctaag aatcagggac cacaagatag caaataccca 275941 aaaggagaag gaaggatgtt aacaaaccca gaaaggtcgt atcggtcagg gagccagaag 276001 cgagatgcgt gcgcacgggt gagaaatcaa ccgtatttca gcctagtgga aagaagacac 276061 tcagtgctag caggagtact ttctcggata cgatcccttt ttataggttt gctgccctgc 276121 gacacgtggc agaagaggac tcaaggacac atgccagatc tttgggtcgg tatttcttcc 276181 ttaagatctc aaccaaccac cctttggtgg ttttccacga ccgaaccata ggtcatgagg 276241 cattcaaaat ttgacatgca tcgaggtcaa tctttgcacc ttagatgtga tgagactatt 276301 tattttcaaa gcgacccgtt gaacccaaag cccagagcga acgaccttga caaaggtccc 276361 ataaagaccc ttttttgggg gtcttgtttc aaggtcttat tccttccata ggaggacacc 276421 ttggttccaa gatctctatg tctttaatgc cccttgggac gttgaaagga gtgagtactt 276481 ggaggacttt ggggtcggcc ggcctctgta ctaaatcata tttcggatta cggggaactc 276541 cctatctcct acgcctctca gtcaaaagag tgcgggttcc catcccgaat ctccttagct 276601 tagttgaaag tgaaaaacct ctctctatcg cccttccttc ttttagcagt tcatctcgag 276661 aatcttgaat gtttagcgaa tcgatcgtaa ttggtctgac cgctcaaggc ttatctcttc 276721 tattcccctt tctggtctcc ggtcattggt agaaagcctg taaatggaat ttatctcctc 276781 tcctttcagt cgagttccct cttttataag agtaataaat tacgttggac ctctccgatg 276841 gccattgctg gggcattcta tcaagttaga gctagggctt tctatcaagt tttcctgatc 276901 tctgagtccc taaagatgct acccataaag tactagcttt tgactgtaga catatcacag 276961 taagtcgata gtgtccttaa cagtaaactg ctgggtgccc cccaaaaaag gggcagaagc 277021 tgtgaacgat ggatactatt ggcgaactgg aacatatgtc ggctacagag aatatttcat 277081 tgtaatcgat gcgagaatat ccctttccca ccaatcttgc cttaaaccgt ggagaatatg 277141 ggagcaagcc tacatcgagc aagttgacat tgcgaaattg tgtgaaataa atcaaagatg 277201 gatctttttc acgcccgatt cgagatgctt ccttaagcgc cttagattct cacgaggagt 277261 attggtcatt gtaagtgatc catctagtgc atgaagggaa aggcttgatc tgctcattcc 277321 ccctctttct ttgaagcaga aaggcaaaga gctgcttctt caggacagga agagaccccc 277381 aatcctatga tacgatgaac gggaaagaaa ccgatcctcc tttatttgag tccctcgtgg 277441 aagaagaatt agctgatgta gaaggtttct ttcttatcct atccaccaca tcttttacat 277501 actcagacag actccagcgc ctatttcaag ctaaagagat cacaatagtc aattcgctac 277561 gcccctactc tttagactta gactcaccaa cttaaccaat ggcttagagg gcgaaggaca 277621 agaaagtcaa ttggcagaaa agattcgcta cagatcgatg atcaaccctc atttctcatt 277681 agactgcaac ccccactgaa acgaaaggag ctgtggttag cctgaacttc attctcaccc 277741 gttcccttca gattattagg ggaaagcaaa agtcgatccg ctcaacgcct tgctctcgag 277801 actctttcgc taacttctcc aatcaagttg agtgcttgga tctttcttca aaggaattct 277861 tccggagagt cgctccaact tagaatgggc ccaacgctcc tcaaccatag gagccgtgag 277921 agccctctaa taagagaagg agttgtacgc aacgcaggaa agcaagaaaa gaagaaaaag 277981 caattcatgg gggaagtcac agcatcaaaa ccaggagttc aagtagaaag gggtcggacg 278041 ggatcgcatg agtacgagag aaagcaattc atgggggaaa gaaaggacga tgccattcaa 278101 ttatgggact gattctttga cttgctttag ctttccattc tttggataat ggtctagccg 278161 agtaacttga tcttccttta ccatctatat ctgaatggac tttaccatcc atatctgaat 278221 tgaaaagaag attgaagtaa ggagcaggtt caagtcatag ttctcaccaa ctatagaaga 278281 atagggatcg actcgtgcta aagaagattg atagccaact aacatcttat ctgttcggaa 278341 cgaactccta tgtcttattg gtatgtgatt gaaatctatc ggatcctcct atgtcttatt 278401 ggtatgtgat acaagcatat taaaggaaag aatgcggaag gagattggaa aagctttcca 278461 taggcttggc cctagaggtt gtcattgagt tgaactagct cgccttcttt cctcgatcag 278521 aatacgaata agatctttac ttttgggata agtgaaatcg tatgtatcca tccatggtgt 278581 atctggtgct ctcgtatata agagaagggc agcatttatg agtaatcgat ctcacaaact 278641 atcaatttca taagagaaga cgaagacgga tcaaattgaa taatcgaaga gagatgggac 278701 cctagctacg agtcattccc tctgacgtcg aatgatctac ttgcttgtac ttctctttgt 278761 cgagattcag ttggtcttca gtctaccact ccgtgggtat aagatcgcaa agaatgcatt 278821 ccaagtgaga tgtccaagat caaaggaacg agggtaagaa tcgacgagga atcaataaga 278881 tataagataa gtgaatgaca aagcgtgagt ataattctca acccgagatg ttagaaggtg 278941 caaaatcaat aggtgccgga gctgctacaa ttgcttcagc gggagctgct atcggtattg 279001 gaaacgtatt cagttctttg attcattctg tggcgcgaaa tccatcattg gctaaacaat 279061 catttggtta tgccattttg ggctttgctc taaccgaagc tattgcattg tttgccccaa 279121 tgatggcctt tttgatctta ttcgtattct gatcgaagaa agaaggtttc cattcagtct 279181 cataaagcaa gcacctcttt cacataagaa agtggaggca ggcttgggga tacgatctaa 279241 aatgattcca aggaccgggc aatcgccctc ttttttaatg aagaagcggg ctagtccccg 279301 aaaatgcccg ttaatcaagc aagttgggga acaaaatctt ccttgttagt tactcatttc 279361 ttcggtcgag cgttctccgg acgtcgagaa atctatcact caatcactgg ccgctctgta 279421 attgtctgat tttaggtttt tgatcacact cgaaattaaa ttatgtatct acttatcgta 279481 tttttgtcca tgctcagtag ttccgtagca ggttttttcg gacgttttct aggatcagaa 279541 agcgtttccc gtttcaatct tataatcttc ttgattctat tggttttttc aatttgccta 279601 tttagatcct taaagcagta tttaggaaag aggatgacac aatggtgcta tctagccctt 279661 gtttgtcaaa tctccctctt tctcgttctt ctacgtagcc atatcttggc gggttttggt 279721 acattctccg cggatgtatt tactgtcttt atgggcacat tttcagttac cggttcatcg 279781 ggggggatag tgaatcacca agacggggcc tcgtctgagt ggttcacgta tacatccgat 279841 atggtcgaag attcggccag ttccgggcgt acctcctcgt cggtcaatca accgattcct 279901 gaggagcagg cttgggaaag ggaggctcgg gcacaagagc acgaccgcat ctctgccgag 279961 gtagagacta tcacgagcgc ctgcgagaat ttggaggcgg ccatggtacg gaaagcccaa 280021 attctcttgc atcaacgtgg agtaactctc ggggatccag aggatgtcaa gcgtgctctc 280081 cagttggctc tacatgacga ctgggagcac gctatagatg accgtaagag gcatttcact 280141 gtgctcaggc gcaacttcgg aacagctcgc tgtgaaaggt ggaatccgtt cattgatgag 280201 ctcaggggct tggggaacca tcaggtgaat gcccggcatt acgtcgactg aggcataccg 280261 tgctgggatc ttcgactggg gtgaagtcgt aacaaggtag ccgtagggga acctgcggct 280321 ggattgaatc cttctataga aaagttgtta agcaaagact ggagaggccc ccggtcgaga 280381 tgtagtaagt aggtctccaa tactgggaga ctgaggagaa tgggagtttg tgggttgagg 280441 gtggcatgct ccccagggcc ctcttttttt tagataggta gggtaatgca tttcagtatg 280501 aatttgatct agcagtgtaa acctgagatt ttcaagagtt ggcttccgct tattttgatt 280561 attaagcgta acgagactaa aagagtcgtg aagcgagtcg gaaagaaaga ggcgaatcta 280621 caagagtctg attttttata tgttaacggg cggagattaa gaggtgggca agttggtagg 280681 ctccggagaa tagaatgcaa tggaggacct aacgcctagt aagacgggga aggagtgtta 280741 tgaaagggag gaagaggaaa agctactctt gagcctcctt gctcaaacct gccttcttcg 280801 ggaactaatg gatgtctatt ccgttctcca gctccatcag tccactagcc agtaccactt 280861 tcaagtaagt cttcagtact agtgtcagcg ggcaggtttg aagatgtctt tcctggtcta 280921 tctccagact tggccagaga gtaaaatatg cgttttcgca cagtaaggca ctttagactt 280981 tgatttccgc taaatcttca agataaaatg tctttgcctc agcttcacat atggcttacc 281041 atcctcttct agtaaggaag tccttcccgc ccttcctaac cttacccgta accacgactc 281101 cctttgatta tcataagtta atattcaaaa tgtgggtctt tctcgtcagg tctatgttct 281161 tgctttcaat tcaatctttc caatcgggtc tagaatgaca atttatccca gacaaacaag 281221 tctcctctag cttaaggagt agtccatagc taataatcta tggtactagc ttattccagt 281281 cattgagcac aacttccttt tctgtttaca gagaatagga taaacctcta ttccatccgg 281341 gcaaataaga tgctcgctca ctccactacg ccacttgctg tgcagtctcc aatacgccac 281401 ttgcagttcc acagagtgtt tttctaatct tcttgagtat aaagaggaat ctaagggaag 281461 acacttcatt gcgggaagca ccacacgcct gcttcctcct ctaagcaagc aatagccaga 281521 gctctgaagc tgaagtgagc caagctcaac aatcttgaat ggcttactca ctcaagcagt 281581 aaggcgactc cgcgggctca atcagaggtt cctatgcaaa gcccagctat gaagcgaata 281641 aagagcagtt cgcactctcg gtgggatggg tcgggttcgg tcaatgaaat gccttttcca 281701 agcacgatca ggcttcaggg tagcttctgg gagtgctcta cgagaaggca tatgtgttac 281761 atcttgagat atttgtttag ggcaaacggt caccgccatt tctcgtatga aaggctggac 281821 tgtcgcaatc aaactctccg cctgccggat catctctatc aaccgtctcg gccgcacctt 281881 cttccgcatc tgtctcaact gctccttgtg cgggactctg gatacctata aagctcctct 281941 gggcaatcgt attggttcca atcagcattc ctaacctccg tcttctcctg agcgtctgga 282001 ctgcactgag tgcctggact gaaaggattc tctttcgcca tttagaactt ccccgtggta 282061 gcatagcata gtagtgcaag tgaaagtaaa taatcaaaga cgttgaaaaa aaaaccttgt 282121 gatcgatgtg ggaccttaga taacaaagta gcgatagggc gcctaattct ggtcatttag 282181 agcatcttct cttgattcac tcagaaatta ccctttctct gaaaagcttc aactcaatca 282241 cttggacaga gggatttggc ctaccaactg caggctctgg tgggggaacg gtatccattg 282301 cctgctgccc catcccaata aaacgtgttt ctcaaccacc cctgtcgact tcgcacaagc 282361 caaccttgca ctaaccctaa aagaaaaatc cagtataaaa tcccgagcgg ggaaccacca 282421 agtgcagcta aagcagggcc gctaggtccg aagagtaact gcccttctga cttccctgtc 282481 cgggatctct tcactgaatg tggaggcgcg tagcgctgcg aaaaagaaag ggttgccttc 282541 catgccaagc cagcacatgg accggattgt tactcgctac gcgcctctgc atggtgagat 282601 gggaacaagt tgtcatgatg tatgcaggta agtgagcgat gaacgcctgt ctctgttgct 282661 gcatgtgctg tttgtttgtt tccgaggtgg tgtggtctgc ccccgaggaa tgtctaggag 282721 aagggttttc tgctgtgatt gcaccaagtg cagctgcggg ttagtatctc ttcctaggtg 282781 ggaaggggca cgacgaggaa cgaagtcaaa ccataccacg tactcacacc attgttcata 282841 caatgtctat aagcggtgtt ttctgcgata tgtgcttctg gctctccaca aaggaacatc 282901 tccaagggag ggggcgtgag ctgcttaagc ttcatcccat cccgaatcct tgcaattaac 282961 acaccacgaa gatagggact gagaggtaat cctcttccaa tccaccactc gagtggtaac 283021 cgatcaggac cgactaaaag cttactagat aacgcctgaa tccgactaaa acgccgggat 283081 agggccttag tgtagtgtcc atttgtccta gcacgcggta tccagcttta tttaacctta 283141 aacaagtctt aggacttaac ttgtacttat aggccaactg ctgagtacca acgaggaagt 283201 agctctcaag cacagctttc gcagatactg gggataagtc cttcgacatt atcttaaccc 283261 agaaccgctt ggcgaactca agacaacccg tctcagatac aatagacttg gcatcagata 283321 tatctacttg caaagcatca agtagagaac ggtactcctt agccacacta cgatcggcaa 283381 tgacgatgtc atcacctaac aaagcgtagt cgagaaatgg tctcgtctga tgaggatatg 283441 ctcgtaatgc tgccaaccac acaatagcat ggtgggacaa cgcgaataga gcccaggagc 283501 cgtaataacc caaaggttga cctgcaacaa acacaacttc gtcatgccgt ccagtaacgg 283561 acttcaaaga gcatgagttg agtgctaagg caccattgac aatgcacgat gccatcgttt 283621 gaccgaaaag gcacgccata agctcatata tgactggaac gggccaccga tcggtggcag 283681 cgctcaagtc aaaacttgct ataaatcttg ggcgtctctt tgccagacga tggataggac 283741 cttcttgatg gaaggtaccg tcttgtggaa tacgcctcag aacagacata ccccacacat 283801 gcacggggta taacagacgt tgtttaaacc agttaccgat tacaaacaaa cgccgctttc 283861 cggccaccct ctaaggactg agccaaacgt ccggaataaa taggttgaga agaacggtcg 283921 aagcccatgg cgagggaatc aaacaccaag ccaggccaag cctcataaaa cctcaaccct 283981 tcattagctg cccaggtact aaattgcgta tcaaacgggt acaaagtata cccaggttga 284041 aagagaatac ctggactgaa aataccatcc ggttctgaat gaatgattcg cagctggcgg 284101 gcaaaagccg cgatttccaa gaacatcgca gagaagaaac aagtctcctt ccgttcttta 284161 ggagcaggct gctcatctac ctgtaacctt tttagaacac atgtgcaact gaaggattcg 284221 gtttcggcgg acgcagtttt ttgagcttac gctcttcacg aagttgcctc cgtgcttcac 284281 ggcgcgcttc ctcctcttca gggctccgtt ggatcttatg ttgaacaagg ttacttaaat 284341 cattaggggt ggacttccag gacgggacaa actgaaatcc cttttctaag gggattgagt 284401 tcacccaggg gagatagcgt ctgaacaaat tttctacatt ttgtttaagt tcagaacaca 284461 cctctttaac ccgacccata tctttaatgg gagtcacaat ggtacggaat gtgctagcct 284521 ttataggctt cgccaattta ataatccgac acactgagaa ccaagaaaga tacaatcgga 284581 ccaactgatc tgcccgatca tcccgcactt tgatcaattg gcggtgatgg gtagggattc 284641 tagttggtat cccaagcaag ccaggaaggc agctgcgcgc tacggtcctt tgctgtgtta 284701 agcatgtgtt gctgagctaa ccatgtcaac agtatggaac cccgctcgta gagaccttaa 284761 aaagaataca agcaagtgta gctcactctc ccgaagggag cttagcaaca gtagttagtt 284821 agaaacagca gccgcggatc agtcggagaa gtcgtttcta ccatttcaat cggacaataa 284881 attccgagac ctttcaaaca gcgtattctc ggcacgaacc aggcccgctg ggagagtcaa 284941 ctttctgatt tccgaattga ttggaaaaaa ctcctttttg ggccagagac aggtacggaa 285001 aattctcagg ggccgaagcg ccaagtagga gaggtcagag tcaatatctc gtccgatact 285061 cgtcctatcc tttctttaaa gtggaaacgg cgttaccgtc attgaaaaat aagcgaacag 285121 ggaacggcgg gagaaagact cggcattcag gccatgctag tacagtttgt tttcttagct 285181 acacttgctt ccatcaagct tccatcaaga gctcccaagg ccacctgatt cggagtagcc 285241 cgttcgtggc agagatttta attccagttg gacttggcca aatgccaatc ctcctagttg 285301 tggaagtggt tggaacggtt tcctattcct atcgagcagc tgagccagtg aaaagcagga 285361 cagggtcaca atcttccaaa ttttaaaagc aaataaaaaa aagtcggagc tggtaacgcc 285421 ttggtgagca aatcggcagg ttgttcttta gtagagatgt gttcggttgt aatgagtttg 285481 tcttgaaccg catcacgcac ctgatgacaa tcagattcaa tgtgcttcgt gcgctccatg 285541 aaacacagga ttggcagcaa tattcatagc agacttgcta tcacaaaatc acttcattgg 285601 ttcgtcatgt ttgataccaa acgagagaag taactccttg aaccatttaa gctcgcacaa 285661 agcaaacgcc atagatctat actcggcttc ggcggaagac cttgacactg ttggttgttt 285721 cctagttttc caagagatcg gagaactacc aagatagaac cgttcaagca ggacttgctt 285781 gcttcgcctg ttgattacct ctctcctttc agtcgagtta tttcaacctc tcctgaagcg 285841 tcttattcag atatagatat tacagattcg catacgaaag attggaatga ccggataact 285901 gcccaaagca ctactaaatg ggtgtcaaag aggaatgaaa tgggatgact gattgatgga 285961 tacgtggtaa ttttccactc ctgaacgact tggtactccc tctttctcta tggatcgagc 286021 atatgaggtg ccactctacc gcaccaatgc aggttatgaa attatagatg ctccccccgg 286081 tgtcttgcca gaagctcgtt ttctaaccaa aaaatggaag attttaaaac cccatccccg 286141 ggaaaaggac ctaaactcgt gatagaagcc gtctttctct agtaggaaat caaagacatc 286201 gggaaggctc gcgtattcct attcttgtta tgggagactg tccataccct atctcgactc 286261 cgttcacaag accataaagt aggggaaggc gggtttgatc ctatcatttg ttattttagt 286321 ttgaaatggc aacactaaag aagtgtggaa acagagattt agaaagcaga gttagagacc 286381 ccttacccag ggatgtgatt gggaaaagag ccattggaag gtgactaaaa gaccagaaac 286441 aggggctacc cgagctaatg atagaggcaa gaacactttc cggccaggcc tgactataac 286501 caaccttaca gatcccactc aaccttttac accgatgtat cgtactctct cgaccaggtc 286561 atcataagaa aatactgcta aatctttcca aagttataga aggagggaag gcgcgctaca 286621 actaaataac agccagctga aaaatgctag ctagttaggc tagcgcgcaa tggctttctc 286681 tgctctgata ggggcttgct tcttcaagct ctgcccaacc tatacaaggt gctgctcgct 286741 ttctctcagc cactagtggc ttatgtggtg gtcgacattc ctacgcgctc ctccttgccc 286801 cctacttcag aatccaaagg taaactttgg atgttgtcgt gacgctttgg ttacgaaggt 286861 taccggggtc taggtttatg gatggagtca gtcagcgaaa gtcctcaatc aatatcaaca 286921 agatgtcgtg accgcttaga cttggtggat tttgtcagaa aataaagttg gtttcggggg 286981 ttcattgatt agtacacctc tggtgctggt aaggtcggaa ccgtggtcgt ccgtctccgg 287041 tttgccggcc gataagatct ctgagattga tcagccagct gggttggttt ggctattcct 287101 ttctattgaa aaggagtcag ctgtctgcgc gagtcacctt cttttaggct ccgctgaacg 287161 acacggcatt ctcgttcaaa tataggccgg ccgtacttgt gatggttccc aaggatccaa 287221 tatgaccact taggtctacg ttgccgcgat atttttctat ggaagcgggc gggctgttag 287281 aagttcggcc cggttttttt ggtgtcgtag gggagggatt gacctttcta acgcatattc 287341 agtcgtaccg gcatggcccc actgatttgt tttcgatcgg agcatcaagc agcgcaaccc 287401 ggttctactt gactaagccc cgtgctcgtc caaggaggga gtttgcttta cccaacaact 287461 tcctttttga taacaaggca gaaacctctg acccgacatt caaaagtttc gaatgaagaa 287521 tgaagagtgc cctgccccag caagcaccta ctcaattcta aataaaaaag cgtgacttta 287581 ctaaacaagc aagaaaagcc ctttcgcact tcttagtaaa gcctaagcgc ccttgttgct 287641 aaaggtaagg ccggctttct tcgcatgctt gagcgcatta cgcattaata taatacatat 287701 atatcaagct ttccttgagt tttcaataag gggcatctat agtaaggggc cttttcaata 287761 agactcgcgc taggcgatca cgttttttgt cttcccaaaa atcgaatatt tttgagttgg 287821 taaagaccca cccctagttt cagtcagaat gagtccccgg gaccccggga cattggcttc 287881 cgccaacagt ggactattaa ggatcgcatc ccgcgcatat tctacattat agcctgcaac 287941 tgattcagct tccgcttctg ggagatcaaa cggagctcga ttagtttctg ctagacgaga 288001 aataaggaac ataaccaata cagggaacag gggaatacca gaccatatct gcttttgcgc 288061 catgacaatc tcactcgaat tacggggacc tacacatatt agtacagtat accgggggtg 288121 tccccggcca gaaccacacg tgcaagtttc cctgcaatgt ggctcgtccg tgctttccga 288181 ggcgctgcct gtgactcagc atgagagaag ggggcggaac tgcacactct cgtgttcaga 288241 gcattgtccg agtgagtagg cagcgcctac caagcaaagc tttcttgaag aggctgggct 288301 gcttcctttt cggcgcccgc ctttattcat ctatactaaa gccacacacg acccaatagg 288361 aacgatttca gcgcccctct ctagataaga agcaaaacgc gccatcacct acagcccttt 288421 cctctgccgg ggacttccat gaaatgaaaa cgggcggcat cgttgtatgc tcgacatttg 288481 ttgccctggt ttatatcccg gagtactcct atggccgatc tgtcacccaa cctacttaaa 288541 caacctaaag gcttggccct gtcccgcttt tagaattacc cttatggcac agcttagtca 288601 gttattctcg cagttcagat aagattcgga ccgccacttc tctgaaagca tggtgcttgc 288661 ctcctccctc cttggagctc gtccattcgt tgggtgatcc cttgctctca cgttcgagtg 288721 ttttctcgtc gtttaggccg gtgagtgaga tttctgctca ttgcagtcac ctccggggtt 288781 cttcgcacct ggatagtacc aggacccgtg tgccccggcc cttgatcaga taaagctgcc 288841 cgcccgaagc ccgacctatg acccacaact caaaatgagc cttgcgacga gacgcggaca 288901 ttacccatgc tgcggttccc tccggtccgt tcccgggttg ggttagggga acatccgagc 288961 gatgattgcc ttcgcggata caaacgcgct cacaaggcgc acaataagaa taagaccaat 289021 agatacttca taagagacca tttgagctgc agatcgtaat gctcctagaa aggcatattt 289081 cgaatataga ggagttcaag ttaccaacaa tcaacgagtt gatcataccc ttctatgaac 289141 cgtacgagca cgtcctctct cacaccccac cgcgcttacg gctctatacc ccaaccctac 289201 ttgctctgga cccccccccg gtcctccctt tctattcctc ggtaagcggg ccgtgggagc 289261 atggcgggag acaaagtccc cttttgactg atagaaagca tctctctttt gtctctcctg 289321 accgaacccg ccaaaaaaat cacaatagaa taagttgttc ttgccgggag agtagcgtct 289381 gagacatctt tatctgagga ggaggcttcg tcgtccggct catcctcgga atccgaaaat 289441 aaaactgaga cagaacggat tgtttcagtg acatatctaa tcagactgaa taggatttga 289501 agagctgtca ccatcattca attcacatca atttaagcag gcaggaaggg cgcggaagtt 289561 accgtttata gaatgcaagc aaggtagctt gcctgccaag ccgataggcg aaagggcgaa 289621 ctgatcgact tgcatgtgtt aagcatatag ctagcgttcc ttagtctcaa tcacagacct 289681 ttttccattt taggtgaacg agggttaccg gctctacgac cttgcaaggc actacagtag 289741 agctcttttg gcctgccact ttctcaatcg aaaatgtcaa ctgtgaagat tctcgcgttt 289801 ctttatgtaa tttcaggatt tcttttcgtt gatcggatcg agcacatagg aatatttccc 289861 taagtcaatt cattctcttt ggctgaagtc aatcttcatc aagacagctg accgagtcga 289921 aacctactgc tcaagtctga tgaatgcccc gttcatgagc tggtaaagtc gagatcaatc 289981 aactgggatc ggagactgga cgtctatttc actatcccgc ttttgtctaa gaaagtaaga 290041 tacctatgcg ggatcattat gaacaggtta cgaggcctga agcggaagcg ctattgaact 290101 gtcgggggga cttggatcaa tcaaagacac tagtcggtaa acccgatctc ttacaggcac 290161 tggaattgga cgatatagtg gcggctatgc atcccgagga aaaatttggt actggcatcc 290221 ttgcttctgt ccccgttgcc tctgatagct atgcccccgc ctctcttttt gtctctgtga 290281 ctcgagaagc gagtctcgat ctcaaaaggg tacttgtatt aaaaaaatcg agatctttca 290341 ggtgctccca gatctggagc aggcgatggc tcttgaaatg ggctagggat gcctagaaaa 290401 ggtgaataat taggctattc tgctgctggt catggaaaaa aaatggatga aactgaaggg 290461 tctgcttcga cgctcccgac cttcaactcc gaattatgct atgcctttcc tttcggggag 290521 cttccgccgg ctcagatgct ttctttgcca cggatgcctt cggtttgagg tcagggactt 290581 tttttttccg gctggtggct gtgatgctag gctaaatgct gttagtgaga ctcggtaaac 290641 ccggtccatt gtaggtgcta cttcttgctc ttttgccccc atttctctgt gattggcttt 290701 agaaccggga aaaaagagtc aacaacaatc aatcagttat gcctctcctc ttcctgcctc 290761 tctagctggc ttgatcttag actccgctac cagagaagag cccttgtgtc cctagctaat 290821 ggatgcctct tccattagtg ctccttcctc tgatgtctcc gctcggtcaa ccgtctctgg 290881 aatttcttac tttttcggtt ctaaagccac tccgcccgcc atccagtgga tcttcacctg 290941 gctaggattc tgactctcga acgggggaag gcaacctatt agattgttga cccgacctca 291001 gacttcgttt taggctgaag taattggagg cccattaccc gatatgggta tggaattcgt 291061 ttcacctacc tggtagatat gtgtatttga cctctaacca tacctttgct atctatgctc 291121 caaccatgat aatgatagcg attggacggt agcaattggt gcttatttcg gggattcaac 291181 cgacctggat aaaccacttc tgcctttggc tttgcctcat acccggtcta actcaaactt 291241 tgactttgct atgcttgctt tgttagtata gcaaagtttc taaagctgtt agcgcgtagt 291301 ctgtctgctc taaaggaagt cgatttattg attcgtcctt ctcggatcgt cgtcggtcct 291361 tttttattgg gctctcatgc ccggctagac gataagatca aagtaacccg ctttaattta 291421 accaacctac tctgtaatac tgttttcacc accggtttgc catgattcat aagcaggaat 291481 cgtagatctt atctaggatg ttgccgtcat cttataagat gaagaagtca agcacatttc 291541 tcttcttcct aagaagtaag gctgacgggc taataagtcc ccggagcttc aaaacaggcc 291601 ttttcatgct ccgcagaggc caatgaactt ccgaaagctt cttctgattg agtgaacata 291661 ccgagtactt aaaagtaggg cattggatgg gacaagccgc ttcctttact ttaagccgag 291721 aggggtcgtc gctaatcctt taggccgctt ctttccactt tctacctctt agtcgactaa 291781 ctcaagttcc taaacgagac agcagctacc ccgcttgcct gtaaaggaag gagttcttgt 291841 tccgttgacg tcgtctaact aatacatagt tctgaaatag ggattaatat ggattgaatc 291901 atcataatga tgacagctgc taccttccca ttagaaacca tctatctttc ttcttttcct 291961 tcgtacgaat ctttgctaac ttgactgtgt tcgacggctg atgaccccaa tgtgtgaaat 292021 atccacctcc atggggatgc gcgaccggga accatgttgt caggaactga gcactttctt 292081 gaaagcattg atttgtggta agactagaag aggagattgc tctttcgaag ctagtcaagc 292141 gattaaacaa agtaataaac ctttcgagtt cttgatttac ggctagaaag aaggactaga 292201 gcttcatacc ttccggctgt tctttcaaga tggaaaataa ttgatctgta aaggtcagcg 292261 gtcgtaatct gttcgaccag accacgtctt gcaactcttc tggcgtcagg gttgcgggaa 292321 cctgtgcttc ttggttgaaa ttgaagaaca actccttatg ctttggagac gttctcagca 292381 actcatagat agatatgtta cctacgcctc cgctaaagaa aggaagaacc gggtagctat 292441 acgttccaaa cagctaagat attagttatg ctttagcaac agctccgttc agggagggcg 292501 gggatgaaca caccaaacgg caagctacaa agaagggtga aacaagctac aaaacagatc 292561 tgactcctac ctaaggggca tagaacaaaa gagagtcaag tcaaagataa gagtttagtt 292621 atgctgggca gttgtcttgg aggagcagcc gccacagatg aaggagttgc ttgaggaaga 292681 tgcttattta tttccataag acctacctca tggcttgcca ctcttacgac caacatggga 292741 ttgatttcat acctggagca acttgaccta atctgtctcc ttctaattag aaaaggagtc 292801 ccgctaagca cgaggaactg agaaggcagg tgcaagggct ttgatagaga gggaaagcag 292861 actcgacaac cacagcccag ttaactattt gaatcaggag cttccgatct catcccttca 292921 atcaccgtta ctcgatccga acttgaacca ttctcttgct acttttcttt cttcaaaaaa 292981 aagattggaa gtggaaccga tttcccggaa gaggtcttcg gcttgctctg atatgataag 293041 aaagttcgct ccctctccct gtcaacatac atgggattga gcagccccct atctcagagg 293101 gtttacccca tggccagttt gcgatgaaag aaacttcaga atgcagccca catagtccta 293161 tcaacataca aggcttttca atccagtgaa agacaggtcg aaaacttcca tatctgatag 293221 gctgcaaagg aagatttttg atgtctggtt ctatctgtct gtgcttggtc tggaccctta 293281 gttcttgcag gtgtaagcgt gctccccatg tagttttctt cgttcccgtg taatcaggtg 293341 cgagtgtaac gagtttgttt gaaataagtc caatgagatg atacgaaaac aaagagttaa 293401 aagctggctt agaagaaagt atagtgggca tggtttagaa ggtaggttac ctgctcgtaa 293461 gaggcagtct aacttagtta gaggaatgct catcagaagg ataagcaagg tttgatgaat 293521 tctcctagaa ggaagatcaa aagtcaaggt cttcaatagt agtatagtag ggaacatatc 293581 acaaccagca agcgtattca ctattcacgt caacatttta gcaatcgaag tcggagtgag 293641 ccgattcaat gtctacaggg cgtctgtgta acacatatca aagcgtctgt tgccaagtct 293701 aatatctgat atctgtgtta gctgcctgcc tgtctctccc tgcccacgac atgggttgga 293761 cttagagtct ctttcttgtc tgtagttggt gtattcatat ctctcctttg aatggcatag 293821 atcttcgtag taccagtcag ctaacaggat cggcgtatgg gcaagcaaag aatctaaaat 293881 catcctttcc tgtcagtagc cgggtaggta aagaggtctt ctgcactacc actagtagta 293941 aacttcagag accactactg ggcaagtaaa aagaattaat tatgccgacg atctgatctg 294001 agttggtcta gtagcgccct ggtagccatt ctggtatcga gaggttcttg ggaggaagat 294061 tgtctagact tggagtgtgc taccactaag gttaaccaga taagagagtc ctcacctaac 294121 aagcaagtgc taatagctaa cagtaatagg aattcaagta agtaagactc cactattaac 294181 tggaaacaag gaatgaccag cttcagaggg atccagcaca gattactatt caaccagttg 294241 gataacagat aacaagggaa tagcccggtc aggcaacttc tccctctgtg cagcatttgc 294301 ttcgacagat tgagggggtc gtagatcagg caacagcaga taaatctatt gaatgtccat 294361 cagccgtaaa agaagggcag ggcaccccag gggctcatct tctaactata gggaaaagaa 294421 gaacaagaaa gactgtaaga cagaaacgga aagagtatgc gcggactagg ttgagaatct 294481 ctgcgaggca gcacaccggt aaagatagtt taagaattta gctcagtgtg ctgcatcaat 294541 gatcttctta ttggaaagct gaagatcaag gcaccatgtt cgggttggtc aactcctcca 294601 agtagcccta acgaagtaac ccttcagggc caacggagta gccctgggcg tagcaacccc 294661 ccgtgggcct gagggttgag ggtatctgtc atacgtggaa gaagaagaac aataagactc 294721 ggaaaccgta gtgaccttcg gaactgtgag tgaccttcgt tcgagatgca gtatgaactc 294781 ggaagacttg cttcgctagc accgtatggg ataaatcttc atatggagga gaaggaagac 294841 tgaggaactg cacatgcggc gtccttgcga agtaaccccc aacggagtag tcctcactgg 294901 gattgagggt tcggcttgtt gcgtgcttcg ccatgagtat gaagagttgc ttcgcttttt 294961 aaaaccgtat gtgaacccca agggtgtgct agatgatgag gggtgagggg atggggttcc 295021 tgcactctat ggaaaatgta cttttgtaat agtcaaacat ttcaattcta aaaatttgat 295081 agagttggtg attatgagtg gagggaagta cggctaggac ctttggccaa gtgaaggatc 295141 aaccaaagtc gacgagttgg tttatgatta ttgggttttc gaagagaggg tttttgtaat 295201 catgagttgg tggtcgatcg tcagtagatt ctgagttgag tatggtttgg atacaagata 295261 aggatcaagc aataaggcga cattaaactg tcatgtaatc atcatcgatt tcacgagtaa 295321 ggtatagaaa agcaaaatta tgagtcgagt acagcttgga ccttcaacac tagtcgacat 295381 tcaactggca tttaatcttg gacgaattca attccaacta attcggtaga atctagaata 295441 gtgagatgga agacccctct ccttatcagt ttcctatgga ccttcgacta gatcacacga 295501 gctactccca ccttgttcaa ctgtgtgatt tgtcgttttc cgagattcgg tttcttcttt 295561 tttcttgatc cgaggtaagg tgcttttttt gatcgatcaa caggtaatat agaaggatga 295621 ggctgtgatt tacctagtga atcaacaaga tcgaaacaca ccaatcgatc aaatcatttt 295681 tgaaaaactt gtacatcgtc tgatgtagcc ttttgcttcg cgatattcaa gtttgggttt 295741 ttgcttcgcg atattcaact atgcctatat gagagattga gttcacgaca atcaattcca 295801 agtaatcatg tcagtagaat tgagaaagat gagttgagta tggatctccc ctcggacaag 295861 taccaagcaa gatccactgt catgtaatga tttcactaat atagaagaaa ccaaattatg 295921 tcaatagaat cgagaaagat gagtggagta tcgagttgag cttggatcac tcctctcctt 295981 atcagtctcc ttcctctagc gcagcaaagc agcccgaagc actttctttc tttctctagc 296041 gcagcagata ggtcgcccat accctttgcc cattggtgtg ctgcttcatt cctagcgaag 296101 caagtgtggt tctctttggt tccgtagtcc ttttactctg aatgatatta ggttggattt 296161 tacttaaagg ctgttccgaa aaaagagaga cttctaggct cgcaataaag ctaaggtcct 296221 gatcgagcaa ctagtcgtcc tatctatcca cctctccata ataataggat ccaccaattc 296281 ttctgcgaga gctactgtac ccgaacagta gaaggggtgg ttaattatgg tgtctactga 296341 tatattagga aacatgtcca gaactggcac tcctggcaga acgtcgttat ctgcagaatt 296401 cgtggaactg gcaaaaagta gattcagatt tctctgtcgt cgcatagctt atccccattt 296461 ttttttcagc aactgaacgg gaagtggctt aggaaaggac taatcggatg cgctcgccca 296521 gcgagaaagg ccctaaccct tgccgaccaa gaaaaaaaga gaagataacg aaagggagac 296581 aaagttccta acaaaatcat aacacaaggt acccattcca tctatcatat cagtcgagtc 296641 gatccgattc gttcttatca ggcggcaaga gagaacttat gacttcgcgg atggagaggg 296701 attcgaaccc ccggtattcc tagaaatact tcggttttca agaccgactc tttcaaccgc 296761 tcagacatcc atccctcgct tttgttcaat tataaatata aataaaagaa agaagaactt 296821 taatggagat ttatagcatc attcaagtaa atacagatta agatcgtaaa aacataagct 296881 tgtaatatag ctacacctaa ttccagaccg gttaatgcaa gaactataaa taaaggacca 296941 agagccccta taaaatagaa aatatcattc atacatagca tagtccaagc gaacccactt 297001 aaaatcttta ctaaactatg accggccatc atattagcaa ataaacgtat tcctaagctt 297061 aatgcgcgaa aacaataaga aattagctca aggagtacta aaaaaggtgc taacggcagt 297121 gggactcctg cgggtaataa aaagctgaaa aaatgaagcc catgtctttg aaatcccact 297181 atagtaatgc caataaaaat cgaaaatgag agagccaaag taatgagaaa atgacttgtc 297241 actgtgaagc tataaggtat cataccctga agattacaaa ataacaaaaa aagaaaagtg 297301 accaagatgc aagggaaaaa catttgtttc acattcccgg aaagaccacc tatttgttcc 297361 tttaccaggt tcagcacgaa atcataaaga agctctacca aggattgcca agcatttggg 297421 actaagtttc ctcctccctt tttagtaaca aaatgaatca gaagtaggaa aaaactcaga 297481 gttagcagca tgaacaaaga tggatttgtg aatgagaaat agaagtttcc gatatgcata 297541 ggaatcaatg ggacaatctc aaattggtcc agtggggagg ggatttccat aagtttcata 297601 tagttagaaa aacaggcgtc gtgatatttt tcaataaaat tgacagggcc aggaacgccc 297661 aaattctgca tataccagtt attaacgtac gtagataaat taagcgattc cctaaaaaaa 297721 ataaggtcca gtttgctccc gaggagttcc gcgggagtat gatccaattc tcgctggagg 297781 ttcctaatat atccggcctg aatcactccc tcgtaaaaat cccagcgcat attaactaag 297841 aagtgattca accgggttaa cctctccata ataataggat ccaccaattc ttctgcgaga 297901 gctactgtac ccgaacacta gaaggggtgg ttaattatgg tgtctactta tatattagga 297961 aacatgtcca gaataggcac tcctggcaga acgtcgttag ttccggaatt cgtggaactg 298021 caaaaaagta gatttggctt ttgaggcatg attagattaa ttgatttcaa caaaatgatt 298081 ccctccagac agcttacgtc aagcctctag gagtaacaac tataaaaagt tgtggttggt 298141 tgttgaccaa caaaaacatg gtagaaaacc aagaacaggg ggcatagaga tttcttctct 298201 tatgagattg atagtttgat ggggggcccg gcaggctagc gatcagaaac ctaaaatcag 298261 acaatgatag agcgcccgta ccctacctga agtagcatcc cgccgatctt tcgaataaac 298321 aaaaggatca actaatacaa ggattgagaa atcttggcaa ctcaacagaa acgggattga 298381 ggaaacttga ctttattgaa attcgagcag cgaacccgat aaacacctca ataggatcgc 298441 taatactacc acttctagac ttggcacaga caacacttct tttctatata cttaattcaa 298501 gatgaaagca atgaatgatt gatccaacac gccaagccat acttttcatt cttattacta 298561 tacttaattt atctaggtga tcgacggaac acggagaggt gctttagtca ggacgagggg 298621 atttagacgg aactgagagg agaaccgaat gaatgcaagt gcaagagtgg aagattgatc 298681 ggactctcta cgaaagctgt ccttcttctc gcacaaacat atccgactcg tacgaaacaa 298741 aagattgaac cttcacccaa aaccgtacga agctgagtag caaagatgac agcacgccaa 298801 gcagccgtcg aaagaaagtt gacagatcag gagctgaaag actctactga tgattcccaa 298861 gccagctgta caaaacatag attacagatt aatcaggatg gaacacgaaa gaaagaatat 298921 actgaaatga tgattagcac aaaggcagtg ctcgtaccga aacaaatcta cagcactttg 298981 accaagcatt actcctccac tttctactga tgattagcac aaaggagcag tttggcatac 299041 aaaactgatt agcagcacaa aggatcagga gcagaagcag gatggagtct caatattgaa 299101 atgattagca cattgatcac gggatggagt caagtacgaa actccacact tcgcacacgc 299161 tctatggaaa caaaagccct cacacaactg atcagtagca catcactcaa gtacaaaact 299221 catactttca cattcttctt tattttagtt atcacattat cgcaaaagca cgaacgactg 299281 atcagcacaa tcgtacacag aagcactaat acacttatgc cccgaagcag gtcacaggtc 299341 cgaagcagga ctctccctcc cttccccgac tgataaaatg gacttaatta cgaaagaaag 299401 aaaaagaaat aagattccct acacaggtgc agtaaccatt ggagcattat cactatctgt 299461 actgaaccat aatccactga ccaaggaaag aaagattaca gcaccaagcc gtacgagaag 299521 ccaagacgct ttacgtcgat tagtcaaggt caaggtgccg atgaataaac aacacgctca 299581 tactccgccc taaaacccct gtaagcatac cgaaacccca acctcgagga aaaggtgcaa 299641 aggtgcttta tcaacaggac gaggaacatc tgaatgacat ggatcagcag caggcgcaga 299701 gtggaagatt gatcaggagc caggagctga agagtctact gatgaacaca aagattgata 299761 aatagcatta ctcgtatgaa agaaacccaa gccaaaacaa aggaaataat aagattgatg 299821 gaaccaatca ttactcctcc actttcccgt ttactgaaat ggattagcac aaagatagca 299881 gtatgcaaag aatatactat actgattagc agcacaaaga tagcacggtg gagtaaacaa 299941 cgccaagcct acgcacaaca agaaggaaca gtgaagctgc caagcatcag cattaccata 300001 aatgtgagtt aattactgca aggaaagaga aggaataaaa gacaggtctt caaaggaaca 300061 atgatcggac gaaggtctcg aaatggcaca gtaacaaagg aacaactgtg ctggactcag 300121 cacctaccat aggtcaacgt ctttcgagaa taagtcctgg ggaagatgtc taggcataag 300181 cagcacccgg tcataagtaa acaaatgttg cttaaggagc caatacaaca caatcagagg 300241 actactttac gcttgagagg gccgatcagc tcataccaac aaaacaaaga agatggcaat 300301 gctgtggagt aaaccggcag taagtaaata cctgggaaag gctcggtagc acaccggagg 300361 caacaaagaa agataggtgc tttagcaact cgaccaagag gagaggaaga gacggacgat 300421 cggacaaagg aaaagtgctg gactcagaag aagactcgac taatcaaaac acttggaaac 300481 agctcgtaca aaacaaagag gcacttggga aggggaccac cggtctatgc ctctccttat 300541 agagtttcta cgaaccttgg ttatacggtc atagctattg ttcattggta attggatagc 300601 tggtaaagag ctgataaagg gaacggggaa cagcaacaaa agaccaaggg tcactgggcc 300661 acggagtagt cccttgaggg taatggttca aagaaaagaa taaaggaagc ggctgtactt 300721 accgagtcga gcaaacaaag atgtgctggg tagtcaactc ttccatatca gaaggaaccg 300781 atcaaagcat cggaagaata ggattctaga tgacctgtcc ttctactgct ctgctcgttt 300841 ggctgcttat cttgcggatt ctgattcaat tagatcaagg attgccaagc aaagcagcac 300901 caacctagct ctactctagc taatcaatca tggcctactt ctaactgtaa atgtgatcac 300961 cagcatcaca aaaacaacag aagcggtatc gaaataagca cctgtcccaa ctcaagcctg 301021 atatccctgg cacaacagag tttgagactt cgccggataa gcccgatatt gagcctatct 301081 gactgagcct tagccagaga tgaagggctt ggcaccaccc taacgcagta aatgaagcga 301141 tatcccttga gcgaaagctt tatttttggg catctacagg agaagtcgct tatggaaaga 301201 aacagaagcc tagctgctaa caaagaccta tgcggcttgc ttggggtagt gggcaggcat 301261 ttaatcgaga aagaaggctc ttgagtaaac agagtcatca tatgagactt ctgtcggagt 301321 gctgtggaaa gactttcgtc cgcataagca aagatgattg ctgctcaaga ctgggaggca 301381 agtaagggag aaagagagaa ggttgtgcgc taggagagtg ttgaagctat agatttattt 301441 agtagagact aggagtctct aacttactgt gctttggccg agggtgaaag tcgaaaagat 301501 atatcagagg gctgcccttt cttcttagag accgagtcaa ataaagctcc tccaaactga 301561 cctgatctat tgtttgagtc taaccgagaa gtctcatcgc tttgagcctg gtttaggcgc 301621 atacgcccat agagtagccc agcccgacca atcatcagtg tacgtggtcc aaggaaatca 301681 aggctttatg gccaatgaag aggtgggcta tgaaggaatg gaaaagcagg gttgtgcttg 301741 tgctgcataa cttacttcta cattcctcag attacccagg agtgtgctgg aatgagaaag 301801 tgaagtattc cttgggcgaa aaatagtcaa ctcttacttt gcttgggtag ggctttgcaa 301861 cacaccgata tccctttctt tctagggtgt atggttccta gacctgtaca catgtaatta 301921 taaggccaac tcttgaaaat ctcaggttta cactgctaga tcaaattcat actgaaatgc 301981 attaccctac ctatctaaaa aaaagagggc cctggggagc atgccaccct caacccacaa 302041 actcccattc tcctcagtct cccagtattg gagacctact tactacatct cgaccggggg 302101 cctctccagt ctttgcttaa caacttttct ttcttactga ttgaggattc ctaatgtgat 302161 gaaagctaaa ttaaagctaa agctctttct tttaagaaag catctggttc catttttctt 302221 tcgttagtta atccaccttt aagggcttgt agtaattcag gtttgacact atttggaatg 302281 gctttctcat attgagagat tctgtctagt ggcattcgat cacagaatcc attgacagct 302341 gcataaatga ctagtatttg tttttcaatt ggaagtggtg catattgtgg ttgtttcggt 302401 acttctgtca gccttgcacc tctattgagt aatgcctgag tcgcagcatc aaggtctgag 302461 ccaaattgag caaaggcggc cacttcgcga tattgtgcca attccagttt tgaactaccg 302521 catacttgtt tcatagcttt caactgagcg gcagacccga cgcgactgac agataagccg 302581 acgttaatag caggtctaat tccgcgataa aagagctctg tttccaaaca gatttgtcca 302641 tcagtaatgg agatcacatt ggtgggaata taggccgata cgtctccagc ttgtgtttca 302701 atgacgggta aggcggtcaa gctacctgca cctgtctggt ccgatcgttt agccgctctt 302761 tctaagagac gggaatgtaa atagaaaaca tcacctggga aagcctcacg gcctggtggt 302821 cggcgtaaca ataatgacat ttgtcgatat gccaccgcct gtttactaag atcatcatag 302881 attattaatg cgtgcattcc attatcgcgg aaatattccc ccatggcaca cccggaatat 302941 ggggccaaaa attgcagagg agcaggatcc gaagcggtgg ctgctacaag aatggaatat 303001 tccaaagcat tcgcttcttc aagagtttga attaattgtc ccacagtcga gcgtttctgt 303061 ccaatcgcta catagacaca atacattgtc tcactctcag aggtggccct tgagtttatt 303121 tgcttttggt ttaatatggt atcgatagca atagtcgttt ttccagtttg tcggccaccg 303181 attagaagtt ctcgttgacc acggcctata ggaaccaggc tatctaccgc ttttaaccct 303241 gtttgcatag gctcgtgcac tgatttacgt tcaagaatcc caggggcttt cacttcgaca 303301 cgtctttgct cgtgatcgct tagagcccct tttccatcaa taggtactcc catcgcgtcg 303361 accacacgcc ctagcatagc ctttcccgcg ggaacatcca caatagatcc agtgcgcttg 303421 acaagatctc cttcttttat agcggtatca ccaccaaaga caacaatccc gacattctca 303481 ttctcaagat tcaaggccat tcctttcaca ccgttggcaa aaagaaccat ttccccagct 303541 tgaatctcgt tcaatccata aacttgtgca atcccatctc caactgagac cactcgaccg 303601 atctcatcca cttgaaaatt cgcgtaaaag ttcctaattc gactttcgaa tagattcgtt 303661 agttccgcag ctctaggaga taattccata attccatttt ttaattaaga ggcaaggagg 303721 aataccgctt tagaaaaaaa aaaggaataa tttacaccga ctaagatcaa agaaactagc 303781 agactaatca ctaaaaaggc ctttggttca aattgttaca tctatcttaa cttactcaat 303841 tatctcgtga actatctgct tcaaaaagag agttggttga atgagactga aaccttcttt 303901 cttcgaggcg ggactcattc tcctgttgcc ttactaattc ctgatcaagc ccctgcctcc 303961 atatgtcgag ttgagcatct cgtaccatat cccaggcgcg attgcgcgcc acttttccct 304021 ccagctcgcg cacctcccaa tcgatgcggg cggctagttc ccgcccacgg tagggagacc 304081 tctcgttctc caggttacgt tccatttcct ggatccgttg tagatcctcc gccaattcta 304141 tatttccggc gtgtaaggac tcagcccttt ctattcggga gtagaacgag gctatggggg 304201 gagggggaga aaccgggtcc ggagagccag gttcggcagg aaagggcacg ggatgtgtaa 304261 gttgcggagg aaagggggcg agggaaactt cccttgccgg atcatttgca gcagactcag 304321 tccagccgtt ccccccacta ccgctcgccc ctgtagcagg gtttacgacc caacttgcct 304381 caccaacatc ggaatctgat ccgatgggca gcaattcccc gaaaccataa aacaagaaca 304441 cccctgccat gaagaagcta taaacatctc caaaagcaaa tgagaaaagg atatttatat 304501 ggattcgggc tgcccaaaga aggagaaagg ctattaaaca taaaaaaaaa tataaagtcc 304561 aaatgatcgg ttttttgaag tagcgcgaga aggtacgaaa ggtgaaagtc ataacaaaga 304621 aagttatgaa gagccaggaa aagtttgtcg caacaatagc gtcattccaa taagttacaa 304681 ttgataacga taccagttgg gcgaacagcc gagtaatcat attagatttg aggattagag 304741 gcaaggagga ataccgcttt agaaaaaaaa agtctacctt atgaactgaa cgaaagcact 304801 aactccagag cttgtgggca aagtggtttt ttttcgtgcc ctgattgaca aggtctaaat 304861 agatgaaaaa accatggaat gctctcgttg ttaaaaaaac tacgagcggc gacttcttgg 304921 agatttactc cggataaagg taaggtcaca ctctcactag tattagtatc cctatatact 304981 ctcaacagaa ttgagagttg ggacacgatt ttttcttgaa gcgcatctcg cgctacatca 305041 gtcaggatta tttccatatt aatactattt atttgatgga tcatttggcc agacctaacg 305101 gctacaagaa taaaagtaca ggcagagcca aacccatcct agaaacgaaa tcagcaataa 305161 gttgatccat gagataagga gtatttgatt tgagttttct taaaagaaga ccgataactc 305221 ctatcccgaa gatacaagcg cccggtcctc ttctcttgtg tcgaagtgta gtgtggtgag 305281 gttttgcctc acagaaggag tgggaaattg gggaaaaagc cgaaacggaa ctaacggaga 305341 ttgaggtata ctccgtgctg cgaaacggat tcggccagta caatactgag tattattgag 305401 gccatgaaga cggacagcgc ttttagcctg cctatagggt tcttttcgat cttgtactct 305461 accaattgat aggtttattt cgcactttag aaaaagtttg gaaagaaaag tagacggaac 305521 gacacctgtg aactcggata gcattgtggc ataccttttc atctgaacta ggctactttc 305581 tttcttctct tatgaaattg atagtttcag atcattctta aaagaagctc tctcttatat 305641 acgataccac cagatgtgcc ccttccaccc tcgtccaatc ttcctaagtg ccagcctagc 305701 ttcataggaa gctgaacaat acgtccagct ccttaaggaa tgaatacttg ggcttggacc 305761 atcaacgggc gaagcaggtt tcctgtttgc atcacctttt ctccggtatt ttctttttga 305821 aactcaatct ttaaatcatt caattataaa tagcttatat aacgaaaaat tggaattcga 305881 ttgaaaactc tctcaactca attggaagat aacatagtaa gtagctttcg cccaaaaaag 305941 cttcgcctag agtggatcag gtccttatac gtaaggcgat cccgcgaagc cggtcaccgg 306001 agtgaagtca acttcctttt ctttttgagc agatgttgct tgcaaataaa aagataataa 306061 gtgttttctt attacccgtt tttttaagag aaaagattta attagctctt agatatccca 306121 agtaagcttt taggtcaaaa tagatatgat gatatcagac agcagtgaac ccccttatcc 306181 ccatctgcat ggacttggct agtagtgaag cacaaccact cttctttcca tagagagtgc 306241 taccctagag taagatgagc gaccccagtg gcaacgtggg ccccaaggaa aaacccttca 306301 ctagtcaacg aagtgcatca acggatgtag gcacgttctt gcttgatcat ttgctctcac 306361 tagaccgacc tatcataaac gacggcattt gaccaggtaa acataatggc taacggtgaa 306421 gagtcgttct caggcttgtt tccgagtaac attctgtgct tccggcttat gcaacgtggt 306481 tctttaagcc atcaccactc ctttctcttc ctctcttcca aggggttgga agaaaatctc 306541 ccttatcgat gtcggccgac tgagtttcag tgcccagcag ttgcaagcta gcaaacaagt 306601 gagatagaag catttgaagg aaaaagctat acgattgggc gaactctttc tttcttttca 306661 ttgctcgaat taccattccc aatttcttca ttcttttatt gagctacttc ggtcacctcc 306721 tgcattggct ctatcctatc tcaatggttg ttgatcagcg tgggggggaa caaaaaggtc 306781 cttacggaag aagagatagc tagaaagaaa aagggataga gtcgatagct agaaaggacg 306841 caaaagggat cgatagcgag aggtagagga ctgaccgctc tcccgcccac atcctccctt 306901 gcagcaccag cttcgcactt aggtgcacta aagggcctga acgagcacaa ttctaagagt 306961 tctcgactcg tggcaccact ttcgcagcag tctgcttcat ccgataggtc agccgatttc 307021 cttcagaatc catctttccc tcaaacttct tcaacacgac tggggaagag cgcgtcgaca 307081 ctctcctgac atcagtacct ctgctaaaaa agaagctctc atcttctgag accagacaat 307141 ttccttgtga tagcactctc tggtatggct gggttgcgca gaggagtttg aggaaggtag 307201 cttcttctac gcgggtacgc ctatgagaat aagttagcac actcctagcg aagcaagcta 307261 tctactcttc tgtgctgtcc tgtcctagcg aagaagcttt gagtgctcct taatagtaat 307321 agtaggagct ttcctctcca aaggtaatcc caggatctct ggcacaccta actctttcct 307381 ttagaatcct tattcttggc agttacatga ttagctggta ttctcgctcg tacagttaaa 307441 caaaggtgta ctgtgagcag cttttgttct aattcttggg tagacagttc cactagctcc 307501 tcctcagctt tagtcccatg ctttagtctt ccttttcctt ttcaggtttg gaagtataaa 307561 gtatcatagt ttcatagatg gctagggagt ttggtaaacc ttagttctta tagggaaggt 307621 gaaaccaact ctcccttttc ctttcctagc gaagcaagtt ctctctactt ctcggtattc 307681 ctctgctttc tctagcgcag tgctgggtta ttccttccta actgttatct gtatactcct 307741 agcgaagaag cccgataaac tagtattgca cacattccag cacacacccg tggatgaaga 307801 ttggtcccta tctcgtgcta aaatcaattg ggtgccacta tagtataaac gcgggcgaaa 307861 gaccctaaga gtcggtttac cccttcccag gagtgtgctc aaccaaccaa cttagtagat 307921 tagctgccca tcttcaccta acctagttga cttacttatc tgtgttctat cctctgtgct 307981 gggatcagat caaaggaact ctcctgaaaa ggatagccaa tttagcaact tcacaagggt 308041 gtgttgtaaa caagcgtagc cctgtgttgg caaaagggat gaactacgga aggtaagaga 308101 tcagattcgg aactagcgct gcaatcttct gataaaagaa agtataaatc cctgcttact 308161 tagcagactt agcttagttc ctatccatcc atggaactag aaacccatca gcattagcaa 308221 ctgcccaaaa gagtcaccct tcgcgctacc tatgaaatat aaggaacgtc agttagtcga 308281 ctaaagggat ccttagaccc ggactcccac gactattaag aaagcactgc tctttctttc 308341 ttcttttgca tttcctactg aggaatccat tgcaagagct cttcttcccc caaacttctt 308401 ttcttagaga ggcggccctt caccttaagg cccggttcca gtaaacatag atagcggggc 308461 atgtcatgga aagatatcga cgcaatacaa agctgctcga gctcgccgta ggaaactaac 308521 tggcttagtc atgcctattc ccacaaaaca aacagaggag agtctttctt catgcaagca 308581 aaggaacaag gcagataagc taactacctt ttctcatcat gcacattccc aacgaaccaa 308641 gaaagagact ctttaatcaa agctagctac aggggaatga tccggcaggc ggagagtttg 308701 attgcgacag tccagccttt catacgagaa atggcggtga ccgtttgccc taaacaaata 308761 tctcaagatg taacacatat gccttctcgt agagcacacc cagctcgtat ttcctagatt 308821 tgcttacttt tcaccatgct gagaaggcat ttttccaaaa aagatttgtc ttacgaaggg 308881 aaatgtggtt tcttactttt caccatgtgg agaaggcttt tttgattcca aatgtggtct 308941 tacgaaggga aatgtggttt cttacttttg tcggtatata aagagcaagt tttggattcc 309001 tacagtcctc ttacgaagga aaatgggatt tcatcaaatc ctcgctatat aaagaggaag 309061 tttccctttg actctggttt cccttttacg aggaagctac cagctaaagt ggaatccttt 309121 ctctgtctcc ctctctttct ttcctttcta gtagcaaact tgattctgtg gctttccttc 309181 cattccgcga gagtagggca ccaaaagctg tcctatcatt tattggagtg gaaggccttc 309241 ccatcctcat tccgaaacaa agagagtaag gccacttgtg acctatcttc ttggtccaac 309301 ccctacttta agaggaaagc acaaatcccc ttttcgttta gccgatatct cttgaagtat 309361 ttgttttgaa aagctgcggt aatccactct tccttatttt tgctgtaact cgacaagata 309421 tgagaaaggc gaccttgtct tcgtctggca acactaccca aggcgagaaa tcaactacgt 309481 cttggccttg gaagttcccg gggtctatct ttctagtaag gctggacgta atcgtaatct 309541 tcccttgtgg agtgaagtga tgggatgcta aagatacaaa agcgtaggat ctggaggaac 309601 ccctttggtc tcgattccct gtagagagcg taggactcat gggatggtgt agtcgttaga 309661 cggccatgag tccgagtgat cggctagctg ggctgctgct tctgtcatta tataatatct 309721 cttcctctac tttaagtcga gttccgttgg tggggacagg agcaccggat acttttacgc 309781 agtaacaaga ggaagaagaa cgataaacag attcactgtg ttggaggatg aaaacggagt 309841 tgaagtcaag aagctcaaat agcgaacgtc atctcctcat actttaatga catcttcacg 309901 gcgagagctt cgggtaattt ccaactcgtc cgtgaagctc tttctccttg catttctgat 309961 gaaatgaact cagccctcac ggcttctcct agtgatgaag agattaaggc agcagtcctc 310021 tcaattcaaa taaggctcca ggccccgatg ggttttccgc caagttctac cataaatact 310081 gggatatcat tggaccagaa atttcaaaga agatcaaaga attcttcacc tcggcctcct 310141 ttcctcctcg ggttaacgaa actcatattt gtttaatccc aaaagtgaga gctccgaaga 310201 aagtctcaga ttatcgacct attgctctat gcaatgttta ctacaagatt gtagcaaaga 310261 tgttaacgaa gcgcctccaa ccgtatctgc ctttgattat ttcagagaat cagtctgctt 310321 ttgttccggg tagagtgatc tctgataatg tactgatcac ccatgagaca ctccattact 310381 taagaatatc gaaagctact gttagaggtt ccatggctgt gaaaacagat atgagcaaag 310441 catacgatcg catcgaatgg ggctttctaa gataagtttt gacttgcttt ggcttccata 310501 atcaatggat tgaatgggtt atggagtgtg tcacttctat ctcctattca ttatcaatgg 310561 agcccctcaa ggtttagtca ccccatctcg aggtcttcgt caaggagatc cactttcccc 310621 atacctcttt atcctatgta cggaggtgct ctcaggatta tgtcgaagag ctcaggaaca 310681 aggacgtttg cctggtatta gagtatcaaa caacagcccc cgtatcaacc atctcctctt 310741 tgcagatgac acttcttctg caagatggat cccactagct gctcaaatct ggccgatatt 310801 ctttctaagt atgaggctgt ttcagggcaa cccagtcaac cacccgatgt cgaaccttta 310861 ctttctcggc tcacttccat gatagtgcag aatcctaaaa gaagatcaat ctattgctac 310921 tgaaggtgag aaagaaagag tgatttgctt cttgcaaacc ctagtgagtt atctaccgcg 310981 tctgatatgc ttgctaagct cttgatgaac gaaactggat agaaagattc ctcttgatgt 311041 acgaaaacct tggattcctt actagttggt acttgatcca atgctacatc ggaaccggga 311101 tatcgatttc ccaagcctta ctcagagcga tgctcgatac gaggaggaaa gcaggtcttc 311161 tcactctctc tcatcttgaa atgcccctat ctcaataagg cagtgaaatt acattacatc 311221 aaagtctggg tctcgttatc cacccttttc cttttatcct ggcactgtgc ccctcagcac 311281 ttggacgatt gagtgatcat agggaaagta catatctaga tgcttctcat agggttgaag 311341 aaggtataac gaagctatgg tacatcagtt ccctacagcg gccgtcttag ccgaacccgg 311401 aaagaattaa aaaagacgag tgctcattat tctatttcat agccttccgg gagagaattc 311461 tttctcgatt cgtaccaaat tcctagggtg tgaggcgata gttttccacc catggcattg 311521 gcctacttac ttctccaatc tagatgtgcg tcttagtctg ctgtcaacgc gttatgggaa 311581 atcttctttg atcttcccag tagttctgtt accgctctgt gaggggaagg gcaggctgac 311641 gagtgactat tcatcatccg aaacaaggat ctttgaaata gtctcgtttc aagccaactc 311701 cagagctcta ttccataacc aggtgctgtg gcgagtgtgg tttccgatac ttgattgagt 311761 tcgattgctt gcccggaaag aagaattgtg ccatcgacgg ttgggagtca aataagagga 311821 agacaagttc tactgaaaga cttctggtca gacacaagaa aaggaatata ataataatag 311881 ttggtacggc caagagcact tgtaattgtt actagttgtc ttctctatat cttctccggc 311941 aaagaaacat ggaatagtcg tgcgaactct ggcggaatct agcactaaaa agagtggttt 312001 cattattggt agcaaatggt tatacttaat caagctgaaa tccgatggta gtatagatag 312061 gtataaagct cgtctgcttg ctcaaggcta taagcaggag tatggcatcg attatgaaga 312121 aagatttgct cccgttgcca aaatgactac tgttcgtacc ccgttggctg ttcgtacccc 312181 gttggctgtt ccgcccattc actataggtc tttacatcag atggacgttc aaaatgaatt 312241 tcttcatggt catcttaagg agacagttta tatgctatgc aacctcctcc tggctaccag 312301 gaagcggaaa aaactctcgt ttgtcgtctt cgtaaatctc tatatggatt gtgaaacaag 312361 tgcatggttt gataaatttc aaactatctg agagttgagg aacaccaagt atgatttgat 312421 tctagggcct ggagctcatc atctatggct ttgcgccaac atgcgagttc tgctgcctgc 312481 ttataggtag aagggacatg aacagaatga aaagtagtca aaaaagaaga tgataatacc 312541 catatcagac agtagataca gattgacgat tgaaacgtcg aagaggaact acagattcag 312601 atggtgcaga aggagcagag agttctgagt cattacctac tggggcaaaa agaaggcggt 312661 ggcacgaacg ttgcattatt cgctgggctt tgacagaccg attccgtcga agttttctca 312721 agatcaaaaa aaggaagata agaaatggag gatgtgggtg gtgcaagtgg tgtgggtgaa 312781 gaagcataat ataataagaa ttttccaaga agacaatgtg tcgggaaaca gctaggatcc 312841 agacgacata cctcctctgg aattccaatg aagcggccgg acaaacaaga cgaggggcat 312901 agaaacagaa agccaaaagt ggaatgattg gataaaagag atgctggaag atgccgagat 312961 ggtaaaacag tgggaggagg cagaattcct ttttctttag aagaagtatc ctcagatcag 313021 cacaccttag aggtagaggc gatgagcagc ctatggtgcc cagaaaccag ctcaaacata 313081 atgagatggg gtgaatcaca ttcactgtcg cggacgccca tgtgcttcgt aatttgccta 313141 aaagagtcaa gtctggctgc atacgataaa gtagccaaga aaacgatgtt gaattaatgg 313201 atcacatact tccgaacaaa agtaacagga gatccactta tccccccgat aggtttatat 313261 gccgcctggc ttagcaagtt catcttccct ttgagccccc tactacctaa tagggagaga 313321 cttcgctcca gtttcagttt ggttagtgct cagtcagtag taactagatc tcacggcagg 313381 tgccgcttaa tcagtctacc ggattccttc tgtctttatc acagattttt ttcttctcca 313441 gaagtacaga tcagtgctcc tacactacag ctcaaaacat actttgcatc gatcgattgg 313501 cttagcttat aagtaggaac ggaaccttct tttctcaact caaatgcttc gagtttaagt 313561 tagtgttcat tgcttactag taaattgata ttataagttt atcctaagtg gggagaagaa 313621 gataattgcg gatttcacac aaaataaatt ctaccaaagc attctccttt gaattagatc 313681 gctcgcgacc tatgtcaatc aaaggaatga atggttgaca ggcgatggat aaaaataagt 313741 ggaagaaaat gtgcgattac cacttcgatt ggactttatt ctttctttag ccataataga 313801 agctctgttc aagcggagga ataaaaatgc gttagtgact tcgatttgaa ggaaggtata 313861 cctaacggcc caagaagggg cagttcacca agaaggagcc aggggctagc gaagaagaag 313921 aatcaggggc ccctgattct ccgacttttg cagaagcaga attgaaagac tttttatttg 313981 ttgaaactcc tacaattgag gaggacctcg agttcgatag attgttcgag gagatctacg 314041 acagaatcct ttccattttt gaaagggaaa atcttattct cccccccccc ccaaatccac 314101 tttcttctat gatggatata gtcaatttcg tcctcgtgga cgattcagat atgtctgaac 314161 tcatatatat atatatatga tgacttatat ctaaatggaa tccagtctat atatttagaa 314221 gagtttgact ctacctacgt ccaggaaatg ctactttttt ttagcgtggg agggtaattc 314281 ttggagattg agggaggaca ggttggttcg aggacccctt ggtcaaagga aaggtacaaa 314341 ggaacttgac cacttgttgg gagaggttgt gaaacaaact cgactaaaag aagaggtata 314401 aaatgattcc ggggcggagc catatgacgc gagagtgtag actctggaac tcagggagca 314461 agaccctaaa gaaagttcaa gaagctatga agagtggtaa actctaaaag gaaagataga 314521 aactggggag ttggctgata aagatggaca gtaacgattg cgtaatataa atttatcggc 314581 ctcgtcatcg aaagcggctt ccaattgctc ggaaattttc agctatatgg ggggcttgga 314641 tggtgagcaa aaactattga tcaagaagtt ggtcaatttt cgcatgaaag aaggtaaaag 314701 aacgagagtt cgtgctattg tttatcaaac ttttcatcgc ccagctcgaa ctgaacgcga 314761 tgtaatcaaa cttatggttg acgccgtaga gaatataaag cccatatgcg aagtagcaaa 314821 agtaggagta gcgggtacta tttatgatgt ccctgggatt gtagccaggg atcgtcaaca 314881 aaccttagct attcgttgga tccttgaagc agctttcaaa cgacgtataa gctataggat 314941 aagcttagag aaatgttcat ttgctgagat actggatgct taccaaaaga ggggaagtgc 315001 acgtaggaaa agggagaatc ttcatggact ggcttccacc aatcgaagtt tcgcgcattt 315061 cagatggtgg taaagtgaga ccacataaag agctcttcgt cattcagtca gattattaag 315121 taagatatgg tttgaccctt ttcctttttg ttttcatttt catccagaaa gccggccttc 315181 ctcatactcc tcccttcatt cattgagtta gaggaatcca taggaggccc acccgttatg 315241 cattgcatga aagaaccttt cttttttgta tgactgagag gaacgcctca ggtcgaatga 315301 atacgaaagg gagatcaatc caaagaaaca aaaaaggcca tgaatgaaga agttgggcct 315361 ttcaccctct ttctagttac tctgggagct gatctgataa atgcacttca aagggaggga 315421 aggctaggaa tctcgttttt gctcctcgta cacatcttcc tcagtccggg cattccgacc 315481 ggcggaatag gcattcatat aatccattta gccatctgag gcacctctat tggatgagtt 315541 gacctcacct ctttcattca gatctgcaga gacaactgga acagaagctt cactggaaga 315601 ggtagctaag cctgccgaat cttgggaatt caaatctgtt accgacctac gttgaaaata 315661 aaccagcagc taactgctga gtaagaaggg cagggtcggt actgtgtagg tggaacaaag 315721 taaggtcgag agtcgcgagt caagaaagaa atcttcttag caccaaagga ttataagtta 315781 gagttagacg atggttcaga tgttatgaag cacaagggtg caggtaaaga ctacgtcact 315841 gaagagtggt ttgaagaaca atacaacgat ccctgctcgg acagtaactg gtcgaaaagc 315901 ctctgctaaa gatcgtcttc ctagcaaaat gacattctcc taactgagag agtcaagggc 315961 agctcttacc ctgccaagcc aatctagatg tcttacgcat aacacaagct aagccgcttc 316021 cagtgaaagc agtagtgagg tagctcaata aacctagcaa acaccggcta gataagtggg 316081 gcaggctgct aactcagggg ggattctttg tcgatagaag gcattcgcga aagtcgccga 316141 tgaagagaag ctttactaaa agagggcttc tcattatgtg gatacaaagg cttcaagtgg 316201 tgcggttccc acaggaattg caaacacctg aagagagcgg gtccaacccg atacgatatc 316261 cttctttccc tagaaagaag tattcacata gtccaatcca ttacagcccg gcagtctaac 316321 aggcgtagat cacactggaa ctggagtagt ggaactagca cgacgaacaa tgctcggcac 316381 tctggtcaag tgagcccctc tcattctcta taatccctat agttgccctg tataagcatt 316441 cttaatatgc cctctcaggc taatgaaagc aaaaaaaagc agaaggccct tactacacta 316501 agcgctacga gagcccccta tacggaagga gactctattt ggtcaaagtc actttccaga 316561 agagatcaag tcgtttactt cttatttgtg actcttgcca acaattagtt ctgtaactgg 316621 cacttgactc aaaccgcttg tctaaatact ggaaaagagg gaattgattc aagattccct 316681 tgtgccctac aaccctcgag gcttttaagt ccatagcgcc cgattcgatc actctatcga 316741 atagaagtcc agggatgctt caatcgattc ttaacacctt gaaaactccc ttctttattt 316801 ttgatgagaa ggcaaatttt tagctttctc ggcttaacga ctctttcttc gaaccttttg 316861 gtatgtattg ccccataact atagatcaga tccaccagac gagaaactca attccgcact 316921 gctgctgagt cgtcggactt atccaccaag ggattcgtgg aatttctgtg aattgcgttg 316981 ggggaaatct accaaagcta ggcagataga tagactcctt tcccgctgct aagggagagc 317041 aagaaataaa ataaaattct tgttttttaa ttagcgcctc cttttgagta tgccgatact 317101 aagagtcctc tcactaccaa ccagcagaca tcatctggct gggtcttgcc ggtatcaaca 317161 acgagaaaaa aacaaccaaa tggaaacagc aactaactat gactcttggc ccagacaacg 317221 tcctaggcgt agggtagatc ggagcaagag ggaacgccta gacgacgttt ttattcctgg 317281 gctgctgtaa gtagatcgag aggtcgttcc gccccttgtc cccgaagatg gggtaatatg 317341 ttctcaaaaa atgttgcttc aatctgacct agctggcgag cgatcccagt tcgcggcaga 317401 gaacaagaca gcaagcgagc gtacctttgt tcgcttcttc tccaccaagc acggaagttt 317461 aaggataact gtaggtcggt ggctacctaa ggaaactgcg attcgatccg ccccccgaaa 317521 aggaggcatt tacctcatcc cgatcacaag gagaggttca ccatgatggg ggtacttata 317581 attttccttt ctagaaagaa tgaaatgcat aggggaccat ccttttgttg cggcatgctc 317641 ctcaatttac ggattcgcag ggggcggaac gacctactta gttgactgac aatgacaaag 317701 gcttgcgaaa ctaagtatgg cctttcaaat tgaatcttta gtagtctatc agtggtgcga 317761 agagaaacat atctttttat gacttctact tattgatccc ggcccggaaa agtgagcgac 317821 caggtttata aagactggtt cgaaaggcgc gtctagccct cttgatgaat gaaagaaagg 317881 gtttctgagc cctagccctg taagcccaca cctgtgtaga gtagatcgtt caacacctgc 317941 ggcacaaacc aatttttgac ctattggtcc tagtctcata ggcgaccgaa cggccgcccc 318001 ctaattacta gaccgacccg ctataataat tccataaaca cctagcgaag atatggcaaa 318061 caaataaagt agccctatgt tcggatctga caataccata ccataatcaa aaggtacaac 318121 ggcccgagcg accagactta acataaatgt agccactgga gccattctaa aaaggaaaaa 318181 attagcacta cttggtgaaa taggttcttt tagaatcaat ttcgaaccat ctgctagagg 318241 ttgtaacaat ccgaacgatc ccactacatc aggacccttt ctacgttgca caaaagccat 318301 tactttacgt tcagctagca ctaaaaaggc tactcctagt agaagtggta gaattattcc 318361 aagtatttca gctggaacag ctatgtacgt tttcgatttt ttatttactc atgatctggc 318421 ctggtcgacc caatcatgat attgaaggat gggacctttt ctcgaaaaac tcccgatacc 318481 gagaagagtt gaagatggtg caatattcca tctattctca atggaacgaa ccccctatca 318541 ctttactccc gaaataaagc accccttccc cttctccgcg aactatctca atccccaagt 318601 aattgtttaa tataaagaga ttcgtcgccc cgtctgagaa ggtcacgctc tatttcgcct 318661 agaaataaga tctctaggcg gtccatttcg gccgggctag atccggggtt cgaaatgtcc 318721 agcgcgtccc gcctctcgtt caagaagtcg aaaaaagcct cttgaggaga gtaggcggcc 318781 ctttcttcca aataaggatt cctcaaaagc actccccgaa agacttctaa acaggagtgc 318841 ttccgttccc ttatttggag atcccggttc tcgaaatcca agagtctata atactccctt 318901 tgttcggaaa cattataaag acattcttga atttcccccc aatactgccc tttctctttt 318961 ccaagcagaa aaggggaatt cgccgcctcg agcaggcgaa ttcgattaca aatcgaagac 319021 tccaacccga gatttcgggt aatgggccag ggttccgaaa gaaccctcag cccaaaagaa 319081 tcctccgacg aggaggtgtt ggaaggggtc ggggctgtgt tggggaagag agcctccgcc 319141 gcgtcgtcaa gggccaaagc aaaggatggt tcaacaatga atatcgaaat gaaacaagga 319201 aaaatggtcc gaagaatctc gatagtagtt ccatgaacaa ttctttgcgg gattgcattt 319261 tttttatagt ggaaatgcca taaagcgcga accaagatcc ataatacgaa aaccaaaatc 319321 agaatgagga agaaaaagat atcgtgatgt aagtctatta ttccttgcat tataggtgta 319381 gctgcgtctt gagatcctaa ttgccatggt tccgctgcat cacaaggaga aattgtgagg 319441 aataaccatt ttagaacaat cattttcaaa gcaaaggttc cttcattttc tgctcccccc 319501 aaacaaagag agactgattc tgactctccc aattaaggaa gacggaaatg gctggtgccg 319561 gttggtccaa ccaagaaaaa agagatggga atttggggcg taagattcta tcaatcggga 319621 agttggcctc actcaatagt cggatacaat cgtggacacc gtagacctgc ttcgcgggat 319681 cgccttaaaa ggacttgatc cgcacatggc gaacggtatt ttggcagtgt tttcttaatt 319741 gatatctcta actggaatga taatctgatt atatacgtat tattccaagg aggatcaaca 319801 tcccattcca ttacaaatgg ttaaacagct atcctttacc atacatctct tcccgctacg 319861 cgcctcacag ccggtattgg cccttgctgt gctccccctg gaactgtagt tcacttcctt 319921 cctccgggag ctgcagctat cttctcaaca atcaactatc caatcttgtt gctaatgtgc 319981 aaaaagttga agcaggtgtt gcgaagctgc cctgctgctt tgattgcgta gctacctccc 320041 gcagggaaat gcagctaact ccttaaaaag agtaaacagt ttgttagtag gtatgagttg 320101 caaggaagtg ttgcgtagct gccttgctcc ttagtaaagt aagcacggga accctcccgc 320161 agggagctgt agctagagct ccttattaag aactcccttg cttgttttgc ctaggcatga 320221 gtagttcagt tcgaaaatgg atcctcaata aaaaaaatgg aatatctgtg cggatatgtt 320281 cagctttggg tggaaaggta aggaatttta ctccgactaa gattattgtc catggtgtaa 320341 cggacacagc caaaataact ctcgggtaca acatgctaca gatgcctgtg gatcgctggg 320401 aggcgtacat acaatgaatg ttattgatgc tactccttct taccattttt gactggggcg 320461 accccccatt gatcttggct tgcttctttg ttcttggcta ctgcttggtt aactattgcg 320521 aattatttaa tggtataaaa aatacttatt ccttctcgca tcgcgcccta gaggggccgc 320581 tcggacgaag aaagtcaggg atagagttcg agtgagacat caaccataga tttcggaact 320641 ttcgagatgc tttcttgcga aagccaacgg aatctgtaac tcaaccttct cacccatggg 320701 ggattcaatc atattcgatg aaaaatttca atcgagggca ccctcatttc cagagataaa 320761 atcaggctgc acagaagggg aaaagccctc ctactgagca agatcattaa tgaataagga 320821 agcacttaac ttaggacagc cggtctcact tcacgcagca tagcgccttg ctatttgcat 320881 tcatccttgt aacttaccga agtgaggaaa gaaagccaca ttggggatcg cgagtcggga 320941 gggtggcgaa gatattgtgt gtgtcagcgg agtctttccg gggttggttg aaagcatgag 321001 cgatgaactt ctcgacttcg atctttttct tttaagttat gggttgattg tgacctagta 321061 taggactgat ccaggtcaat tcaaaggcta agcccaagaa ttccaatagt tcgcttttta 321121 gttttagcgc tacttggata cttcaagaaa accatgaatg accattgtgc cttcataaag 321181 aggtacgtta gcggcgactt tctcatactt ttgctatatg tcgatgattg attgtgggcc 321241 atgaccgcac caagataacc gctctaaaga aagatttgaa caagtgcttt gccatgaaag 321301 tggggccagc aaaacattgg gatgaagatc actcgtgata gatcgagaaa cctgttatcc 321361 taagaacgat ctgttgggtg gagagattca atatacacaa ggcaaagcct acttgctgga 321421 catttaaaat taagctcgga acaaagtcca acaaatgaga aggaggaaat gaagactacc 321481 ccatatgcat ctgcagtagg cagtcttctg tatgccgtgg tgtgtacaag gccagatatc 321541 gcctatgcag ttgtgagtcg ctttctagcc aatccagact gcaaatggat cctccactat 321601 ttacgtaaaa gttgtctatg ctttggtaat gagaagcttg agctgatggg ctataccgat 321661 gtggtgtgct gccgacgaca aagattctag gaaataaact tcagggtata taactacctt 321721 tgcaggagga gctgtttcct ggcaatcaaa actacagctt tgttccacga ctgaagcaga 321781 attgctgcca cagaagcttg caaggagatg ttgtggatga agaacttctt gcttgagttg 321841 ggagtaaagc aagataaata tatctatgtg ataaccagag cgctattcac ttagcaaaga 321901 attcgacctt gcactcccgc tcaatcgata tacgacatcg ttggatccga gaagttctga 321961 aagatctcaa caaagtggag gagaactggt cagacgacct aggtgttgtc gattcccttt 322021 cctaatataa aaagagtttt cttgcattcc tattgtctaa ttcccatctt catcacacga 322081 aagaatattg ggtaaatttc ccaacaatgt ctggaacagc cactatcaaa cagggaatac 322141 tctcatctta gcacactcct gtcacatgca tttgagttct ccttcatctc gaaaaatcac 322201 tcgcgcactc agacttcact acagcttgag ttctgtaatc aggttttctg catctctgtt 322261 gtatctccgt ctgtaggtgt ttagcacact gctgtgagcc ccatatagat ctgtcctttt 322321 gatccatgct tgattacgtc aaccgtagag cgcatccaat ctgtcaaaat agtcgtagca 322381 gtaagaatca gtaattacag cgtggccttc ttttctttct gatacgtatg cttgctcctt 322441 ctgctttcct ttgacgaata cattccgtat ttgagacgcc ttgatatcct agaccttact 322501 ttctcactct tcttatttga ctgttcagaa tcttgtctag ttcttttgtt cctgagttca 322561 gcatccttat cagcttgagc tgcgtgtcta ggtctgtctg aagcttagct gatttcgatg 322621 tctgtcttta gcttctctat aagacttcgt agctctgtgc tgtgttctca ttttccagaa 322681 taaaattctc tttagggttt cacacatagc cctgtcctcc ttttcaacat ctttgccttc 322741 gacatcgacc tgatgaatca gactctgtgt cactttcttc tgttatccct gaccatgcaa 322801 cgaatcatgc ctaacgaatc ctcagattca tcttcatctg cattcactgt catcagcaca 322861 cagaagagag cgagtattcc caatccctca gcacaatgca cacacctaga acaacacctt 322921 gtccctttct cggttatccg gcacaacatc gtcactccat aacgattcgg agccaagctg 322981 gcatccgcaa acccaagttg atactctctc ttctagcctt ccagatctaa gagcatgtcg 323041 ttgttcttgg aaaagaagag ggattatata gctccttgtc atctacaata ataggactcc 323101 gacaggccag ccagaaataa actatagaag gaggttgctt cgctagagct gaagaagttt 323161 cgggctgaaa agctgcctca ccaggaagac ctgcttctta ctttctggtc gtatgcccga 323221 taggatttcc gtagtataaa ttgaaactcg ggtaccctta tcatttcact aatcaaaaga 323281 ctcaaggtct gtgctcttca cctgttcacc tgttctcttc acttgcttag cagcctcact 323341 tcgaaggtgt gctgcatact ccttcctgct tgctcttgag cacactcctt ttggagattc 323401 catacactac taagtaaaac gtcgtcaccc aagtggaaaa gaatcacttg aatccagaag 323461 atctagagtt ttgtttggag ctggcttgtg cagcacactc ccacccaact catcttcttt 323521 gggaactaaa agttgtttag ctgtcattct tccttccggt accctttcag tccaactgag 323581 caaattagtc aaactttgta ctcacaatga gagaggaacc tgtaggagct ccccataggt 323641 ctgaatgaac ataatccagc ttttcttcct ttgtatcatg ttgagctaaa tcaaaactaa 323701 ctctcttagc tctgccgtat atgcaatctt caaagatgct gaaggttgac actttctttc 323761 tgtcaaggaa tcctcttttt actaagagat ccatgttctt ctggctcatg tgcccaagcc 323821 tttgatgcca caagactgtg ttatcaactc tcttctccac tgatagagag tgtccagttt 323881 aaggtgtccc acgcaatatg tataaagtat ctagcctctt cctcaaagcc gatgcttctg 323941 actcaattgc gaaaaccatt ctaagactaa ctgccgtaat cccactaatg attgaatcga 324001 gagtcatcct tcttgctaaa ctcctccctt ctgtgaaact cactttaagt aagcagatga 324061 gcctcttacc cgtgcaacat cagaacatgc attctcgcga tctacagagt ccttatgggc 324121 ttgctcttga aaaagaagaa tgaaccttct tgttacagaa catacattcg aattcgttcc 324181 ttccccagca ctgactcttt cgatgctaca gtcggctcta ctgattagtc aactagtcaa 324241 atcgctttca catctcagat gacatttgta ccaacagact tcttgataag aactccagat 324301 gatccagctt attttaaaga taggttagcc agccccttct ccttcagggg atgcagcaag 324361 actacgtcca cttcttcctc aatctactct aagaaaaagg caagtacagc tacttacttt 324421 agagtcgatc cagttcctcg ggggtctcaa tcaagtaggg tatgtgagcc taaaactaaa 324481 ctaatagtct atcaacctgg gaactatcaa aaaaaagtaa agagacaagt aaccgatcct 324541 ctttctttaa tggataaagt aaaaaaacga attgataaaa cagaaatact ctgaggggtc 324601 ttggattgta gagctgatca cagcagccag acagaagaca ataataatcc ccgaaactat 324661 aatccgccgg gcgaaaaaag tgcaagtgtg tgcaggtcaa gctcagggga agttgaatct 324721 agcggagggg tctttgcttt tgcttttcat gactcagcta ctattgaatc cgatcctagg 324781 gcattaaaag aagagtacga gttagcaggg ctgtctcacc ttgggcttgt tggccggcgg 324841 aataaccgca gtaacctcct tcacttcatg cctttgatag aacggagatg gtaattaggc 324901 tgctggtagt acaactaggc tctttctttg caagattctc agcatctacg gggagaatgt 324961 cccaaagccc cttttctcca attgcaattc cagagggata tccttttaga cgctctcccg 325021 gcattagcgg aataagagca gtgggacttg agctagtgag atagattcac tttcactatt 325081 tgaatgcccc tcttcgattt cgatgagcag aggattttcg gacattagct ttagcagacc 325141 atttgcggca tatactacta tttccatgaa gactataact gaagacttac tttcttatta 325201 ataaacttac actaggaata gccccgagaa cccatagttg cccgctacgc cccttctgct 325261 actttctttc taggtccaat aggctctttg tcggtcttga tgccgagaac tgagttaatc 325321 ctaaatgcag ccgtgatctt atatacgatg ataccaccag acgcgcattc cttgggatca 325381 gaaggaagag ttgtcaacgg gaactggcga gacacttctg acgttaaaga aggaagtctt 325441 ccccgggagg taactaaaca agttaatgga agtctgtctt cccgaactaa gcaagttaat 325501 gagttcagca agcacacaag gtttcttgta gacataagtt tcagttgttg ttccctaata 325561 aatagatctc tttgggaatc agcacagaaa gatgagctta gtgattcctt cgggaaagca 325621 cttacaacaa agccagaatg tttagctgtg agagaaaccc ctaggaactt tagaagaaat 325681 ctgtgcttag tgattccttc tttgaattga aatgcttact atagctaaga aatcatacgg 325741 gattacttgg accttaccac gttccctccc ctctcttagt atgggtcctg ccccgtacga 325801 gcaaatcgct tcaaaaggat ttgtctccaa tctctttaaa ggcactatta acttttcggt 325861 cttcacttgg aatctgtgca ctatgagtgc gttacaattg ttctttgaat actgtggcgc 325921 gcctttcagg tgctggtttc cgggtgaggg caaaccttca atcaaatgtg tccaagaaat 325981 ggactagatt gaggacaata tattctaaac ccccgtcatc gtctcaacta cctttagagt 326041 ggtggattgc tggagacaaa gtcccctctc tccatacata aagggaattc tagttgagat 326101 gctgaagaga aaattgaagc ctaaacgact tcaattgcct cctcaggatg tcgtctttga 326161 aggggaggca gctatgaatg agtatacttt ctatcgtaac tgggtggaat cctggttaca 326221 gcacatccgt tcatactacc ttcttttcat tgatggagat ccttctcttt caaagttctt 326281 tgagattgag atatgtgccc actcttggaa gcgttcgact ttcgaccaac aggtctttaa 326341 gtttggcctt ctatgggagt gtgtggatat tgctcgctct agaacagttt attggcaatg 326401 tgctttaggc acaggtcata tacaggaaga taaggtatcg gaggcaacaa gcccctttac 326461 cgatgacagt tgtaccaaca gctgtttatc aaggatgact ggacaataaa gaatgggaac 326521 tgggaaagtc actgctattg gcggatcttt gcctagaacc gaaactgatc tttagctagc 326581 cgaccacgtg ccagaagttt atactggcat tgcaataggt caagcaagtc tgctggcaac 326641 cctgtagcct attggcttct tcctatatcc ctattcgggc aaacttttga ttccattacc 326701 gagccaaaga agattcttcc cctagttcta cttatcacct tttcatactg ggcagcatct 326761 attgagagtg gattgacggt gttgagagat ttccagcata agcagtagtt gaaccagcat 326821 aatgaaaaag ctagaaaaag gggcaagtag gtcggcttca gccagaagag cttaaccgaa 326881 tttttccaat acaacttaag ctccaggaag aaagcaaagc cttgagggag tgagcttagt 326941 tacttttttt cggtcggttt gctaaagcac ctctcgggca ctaaacaaaa agtaatcaaa 327001 aagtaatact taataggtcc ctatccacct ttctaagaca acagatcgaa ctcctatact 327061 cccttcgttc gaagaccact cttgaagagc ttttccttac ctatgcccct tagccaatca 327121 agccggactt tccctttttt tcttgcctcg gtctttcttt ggtctatttc tgcgggtcac 327181 taaactaacc ttctctggtc cgctttggct attgaactaa tattcttccc ccccaggccc 327241 caagccagtc agtttgacca ggaatgcctg cgaacggtat aatcatgaat aagaagagca 327301 agcggtaaac gagtgcgaag ggagcgaagc gagtttttct ttttaaggaa gtctagtctc 327361 cttcaggcga gttgaacgaa cggtctactc aagctcctga gagcgagcgg aatagcctgg 327421 tctttctaat caaataagta gttaacccgc gattgacctt tatcttgtct acctctgtct 327481 cgcatgagaa atcgaattcg gggtgatcaa aactagccaa cgaactcagc tggcgagtgt 327541 gaggctatag atatagtcta taaaaaaaaa gaattgagct gacaaaaggg aagaaagagt 327601 tgttacgccg ccttcgacaa tcgtgaaacc aaccaacagg tgctttcttt tgttgtgtcg 327661 ctaaataagc aggcttccgt caagcgaggt cagccacaaa gggaaagaag aaagagtgag 327721 cgagaaagcg gtgagcttac tctttaagaa ttgattaccg ttgatggcgg ggggcggaaa 327781 cctttccctt tctagttgtg aattccgtat ttttgaaaaa aaaaaagtgc ataatattgg 327841 attcaaaccg acggcctacc ctttctttga accttgtcaa tgatcgaact taaagatatg 327901 aactgagtgc catttgatga gtaactgaga acaaaggaga ggggtatagc aaggatgaag 327961 taatgaaaaa ggaagtcatt gctagtagta acgacttatt acgatccatt ggttcatata 328021 gaatccatgt cctaggtgta tcaaaaaaca ttcttttcac taagcgtata taataaaata 328081 gagtgaaaaa gggtccaccc cgaaaccaag gggatactaa ctaacagctg agtcctctcc 328141 aaaccgcagg agatagttgc ccatcatacg gctcaccaac ttggcctcta tgggaggctc 328201 actccgggca ggttcggatc acttataata caaagctcgg agaaggaagg ggttaggtta 328261 ggaacgcagt aactcgaccc ctcatcaact aattaatgag accttatcct tgggagagcc 328321 ggaaaggcac tcgactaaaa ggttaaaaga tctcgactga aaaggagagg gcgaagtcat 328381 gactcgagca cttgttgcga gaggtccata ggaacaaagg agctcgactg taaggaaagg 328441 aacgagagtg aaacgagagg agaggtgagg ttttcaatat gattcttttt tcgtatttgt 328501 tcgatgagaa atgcgagtcc aggttttgtc cattttttct atccccctct atcaaacaaa 328561 atgatcaaaa aggaagttta ctggcttctt attctcgtct ttgatctctt ccatctctgc 328621 ctcgctcgtt gtcacctata ttgaagaaag aaaccctgta gagaatgaag aggggcctag 328681 gatcttcttc tcaacagtgc ttctcgaggc tccccccctg aataagtaag gccccgttag 328741 cctgggcgaa gatggggata aggaataagg attgaagccc ccttagctct gccaggcact 328801 gacaggggtt agctcggtaa atgtgtagag ccaagtgtag tatggtgtag tagtagtagg 328861 cacttctagg ccccttcccg gctactggat cactccagtg ctttgggtac tacggaccct 328921 ctgccatcca ttgcagcaga gccgtttcat gagcgggggg gctaggcgca gttctttgaa 328981 tcaaacgttg aatgaaatcg attctttttt agatatgaaa atggaaatcg gataggatag 329041 atggatggat ctatctttcc atttatatat tactaaagga tttatatagt taagtatcgt 329101 agcaagaagc cccaaatcct tgatttggcc aggaaagact gcactgcttt gggcccagga 329161 tgcgaaggga atgagctcgg ctgcttctcc tccacactga tttttctccg tgcctgctcc 329221 gcatgcgctt ccccattggg ctttgctctc ctcttattct tcattggacg gttcggttcg 329281 gatggacttc gccgttcttt cccaactaaa aaagaaaagg ctgtatcaca tcgagatgtc 329341 gattcgtttt ccgcccccaa tgagatgggg aatttgtaac cccctatttc tactttgggg 329401 cccttcatct ttctgaatcc aggcccgtcc cggctcgtgt cgttccaaca accggcgggg 329461 agcatctcag tatacgatcg cgcgcagtaa ctgggagtcc tattacacct aaggcgaact 329521 tcaattcacc aaaccaaggt tcatctcgtg tagtgattgt ggactctact aaggatattg 329581 agtagacggt tgatgtatca gactcgaccc tatctttcgt agcatgcatt cccatcggtg 329641 tcgcaactga ttcggtaagc tacgtgtccg gtgcacggag aactgccttc ggtcctccaa 329701 acttacttat tcgtggcaac cttccggccg cccaacgacc tataacgcta gtcactactc 329761 ccactggggc tagaaagtaa gccccacaac ccaaagcggc gaagaacaaa tagaatttgc 329821 tacaaaagcc ggctaacggg ggtattcctg cgtatgagaa catagtaatg gagaaggtaa 329881 tagccgaaat aggattcgtt ttggctagag cgcccaaatc cgctatatat ttgacacggg 329941 tttgccgtaa tgctgaaact atggcgaatg catccatcgt cattaatgca taaataaaga 330001 taccaattag tagtgattga attccttcta tggttccaca tgagaaacca gtacgaatat 330061 aacctacatg tccaattgaa ctatgagcta gaggtctttt gactttcgtt tgggccatgg 330121 cggccagtgc tcctaagatc atagaagcaa tgctgcagaa aaagaagatt tgttgcaatg 330181 tagctccata ggaaccataa atagaaacac gtaaaatatt agcagaaata gagattttag 330241 gcgcaataga aaggaatgct gtaaccgggg tgggtgaacc ctcatagata tctggtgccc 330301 acatatatag agtcaaaggg aatatggagt ccgaggggga gggggttttc ttcggggctc 330361 gagagattga atagacccac aagtttgaaa ttcgccactg gggaattcac acgaaaggga 330421 acgaggaatt ctcaacgaaa aaagtgctct ctgaaccgaa cgtgaaagtt tttttccatc 330481 agacggctgt tcctgtaact ccactctcga cttggattat atatccaaaa aggtccgctc 330541 tcggccattc cacacgctgc ctagccgagg tgtggtttct gaagtggatt ctctcttttc 330601 tagtagatgc cgaacctgct gccccattga ctaagaagag gggcgggcgc agcagctaca 330661 tggacccctt cctttctgtt gttgccagcg ctttcccggg ccgagccgga atttttgatt 330721 attattctaa tgggcaggca aagccgcagg ctgctatccc aataggccag cgggcggttc 330781 gaggagaggc tccggcaatc atagcaccgc ggtcgaaaaa gcgtgctccc ttcagataac 330841 acgcaccgta gctatcctcg gccttcttcg ttttcgatgc aaaagaaagg aaatctctcg 330901 atctccgaaa gccttttcct taccccgccg catctccctc ccttcgtcga gcctttccgg 330961 ggttgttcct tccttatcag aacttggcgg gcattctttc cagccggggg atgggttttg 331021 tttgaacata ggctcaaaca taattttaag gtcctagcta cgccatgcgt tcaataccaa 331081 atctggaaag ctgcagaaat gacaaaaaga gggccgcttt cctatgttgc actgaaaaaa 331141 gaaggaccta agagcgtctt ctcttaggtt ttttcctccc gcgcccgccg ccgcctggcc 331201 cccgttagag ggaaggggaa gattagcaaa gcgaaaaaag acagaggaag ggggagcagc 331261 atcttattct cttcgcgaga acttccggat tgaagaagca ttcggaacgg gctccagcgt 331321 tcatttcatt ggcgggaaac gatccaatcc attcggggct tcggccaaaa acgaattcga 331381 cttgattcat agatagaatc aatgataaat aaacaaaaga tagatgttcg agatatcttt 331441 tctttctata aaaaaagagg aatagaatag acatcccttt ctttagatgt ctatctatcc 331501 ttccgatttt atatcgttat atctgctctc tcaatttttt tttagagaga gcagatagat 331561 cctatcccct atatcgaaca ctaattccta tctattgata ggaagatctt cgtctaatag 331621 cggactttgc ctttttttag gaatttctca tatccaaggc agcttaccac aagaacccca 331681 ccccatacaa ctagttgggg gggctgttcg ccttttgaat caaacaaaga ttgaagtagg 331741 taggggcttc atagctactt tcattctaaa ggaaagcgaa gaaccaatct ttagtcaata 331801 ggagccctac ttccctcgac ctcatcactt caattgttgc gcaaacctct ttcttagtca 331861 ccgggggagc gcacctttgt cacagtattt cgagctgttt gatagtgact tcttgcgttt 331921 ggtagggcga cgaaagggct acttccttca atctaggaaa cagccggaaa ctcgagaagg 331981 tcgcctttgg aacggttcgc cagtaaccaa agcctcactc cttccttttg attatgtcgt 332041 gacgctgact gaatccaatc tataaaaaaa cccggaaact caacaaagtg cgttcccttc 332101 gtcaagtagg taaagtaggc atacgaacca cttttgcttt gcggaataaa acaatgaatg 332161 aggcggaatg gagaaaggaa aggtctgcag gtatttatta ctcttataaa agaggaactc 332221 gagcttattg cgagaggtgc tttacgaact cgactaaaag gagaggggcg aagtcactcg 332281 agcacttgtc tagagaggtt gtgaacacaa actcgactga aaggagaggg acaagggcgg 332341 tcttgcttgg cgcgaaggct gctggtttgg gggtacggta ctaaaggtcc tcggacttcc 332401 aggcggtttt tattttgggc agctgttcac cgttggatct cgccaataca gccccctatg 332461 gtttttgtta ccgagatatc tttttttttc attgttccca gggatttttt gggtaatctg 332521 ctcccatgct gcaaacagtc aaatctgaac tcaaccttgt cgctcttctt tctttcctcg 332581 cgggctggaa gcacaccagc agcgtgcgtt gcgtgattct actgtgtttt ttgcacttga 332641 ctgggtggac agttgaccgg aagagaactt cgattcgccc ggccagcagg cgggcgacgt 332701 gcttatcttg tgtgacaaca ctacagagcg agtgactctt ctaggcgggc agctgtgtga 332761 caacactaca agggtttttt tcctcgtgta acatgctatg gtctcaatgc ccttacgagg 332821 tagtgatgag tttcactcgc tctaagcccg gcccgcgcgc ggttaggaag attggccgca 332881 atctgagcgg taccacccac cctaccctac cacctatagg cggccgtccg tcctacagcc 332941 gcccgaaaag gaactgcagt gatcttgaat aggaatccta cagcgataga cagaatcccc 333001 ataaaaatac cactagatcg agcaccagtg atttcgtatc cggtcaaaat cttggctaat 333061 tgatcgaagt gggtagctcc agtagaccca tagatcatgg aacaaatagg gtgggtaggc 333121 ccaaccacca cactacacgt atagacgcga accccccctc gttaccgtac gtgcgactct 333181 caccgcatac ggctcgcaca aagactccta aatccatccc gagccttttc ttcccacttc 333241 tcccttcagt cgagttacta aagtacctct ccaatcctcg atcaaacttg cccaggcgct 333301 attaaataaa tgggggtctt tccttcgcct atcgtatatt gtatgtatcg gcttggcttc 333361 gtccaaaact aaaacaagga ggcattccgg cgggctctag cgtgtaggaa ggccgaccat 333421 tacataagct aaaagactag gactataagc cagccggaag cgactcgctt ctattccccg 333481 ttgggattgg atatagatgg ggaatctatt gatcgtagta gttcgccaac ctctctaact 333541 aacatacgat caaattttcg gcacggccca aaaaaaagaa agagcttcgc tcggtgggcc 333601 ttcctacgct gacgaatgcc tcctttctct tctcttcagt ctacaacaag tgggagaggc 333661 aggattcgaa cctacgtaga aaaacttcaa cagatttaca gtctgccgct tttgaccact 333721 ctcccctttc ccttttctct taccttctgc cttctcccct tcccctagtt ccaatcgttt 333781 taggaaagtc ggcattcttt gtggaaagga cttataccat gaacggactc tttgccttgg 333841 gtcaatcagt tccttggctt actaatgacc acttcgagaa gaacctattt gaaaacagtc 333901 taatgccaca tctgactcaa cagctccttc ttccttttct tcttcataca agcccccttc 333961 aactatatgc aaccacttct tgaacaacct attcctattc atgcgatctt cagctcattc 334021 tatgccatct tacttatact attattaaac caagcttcaa ttgcacaagc cattctaagg 334081 aagattagac tggcatctgc ttagccctgt ctccctctct agctaattga ataccagctt 334141 tctcttatat tagtattata tgtcacttcc ttgtcctatt cttttagttt tagtatgcat 334201 gggggtaagg agatattgat tacacagtag ttaggagctc atagtttcgt gcgcttacgg 334261 cgtgctcttt cttttatttt tatgacaaat ggccgccagt agacgacaat aggtgaggtg 334321 tccggtacgt gcttctaatg attattccag gctgagcact tcattcttat gcacccattt 334381 cacatgttag gtgtagctgg tgtattcggc ggctccctat ttagtgctat acatgggttc 334441 cttggtaact tctagtttga tcagggaaac catagaaaat gaatctgcta atcaaggtta 334501 cagattcggt caagaagaat aaacttacaa cattgtagct gctcatggtt attttggccg 334561 attgatcttc cgtttcaaca attctcgttc tttacatttc ttcttagcgg cttggccggt 334621 agtaggtatt tggtttactg ctttaggtat tagcactatg gctttcaacc taaatggttt 334681 aaatttcaac caatcagtag ttgatagtca agggcgtgtt attaatactt gggctgatat 334741 tattaaccgt gctaactttg gtatagaagt tatgcatgaa cgtaatgctc acaacttccc 334801 tctagaccta gctgctgttg aggctccatc taagagggaa cactgtaaca acaactagat 334861 cgggtctatt cccattcctt caccgaagtg gttgacgagc tcaaaagttc cttttttctt 334921 acttttcgtg gctgattaga tcactgaact tggttcactg acgctaagag aaggaaccat 334981 tggattattt atatatccct ccctttccac taggcttaaa tcggccagta aattcctggt 335041 ggaccaaata cagcaacctc acctttagcc atgcagaaga ggagattccc agactatgat 335101 gcctggcacc gatcaactct gaaagtcctg gattttttgt actctctctg aagaaactat 335161 tggatatgta tttagacttg caacagctca ggaggtctct ctttggctga tcatttcaat 335221 cattgttcta tcttctatct cccgtgactt ttctttgcta ataagctagc tctctgcaaa 335281 tgataagaaa ggtagaacta gggagataag agataagaga tttctttcca attccctttt 335341 ttggaggcca agcgagctaa atggtataaa aataactcgc cagtcaatgg gagtttgctt 335401 aggaaaataa gtgcaggtca ggttcggttc gttcctttca gcagtatcga ataagccaaa 335461 tgaaagtgag aaagcaggcc atctatccgt tgacttgccc cggagtgagc tgcatcgaca 335521 gaagagagaa gagagagtgg aagcggtaag tgaacttgac ttgaaagaga attcccactt 335581 gacgcccgaa cagcattttc agagggaaga tatcgaccat acatagggag agagctcaag 335641 ttagacggcg ccttagcgaa actttctcac tctacattca ttccctaatc acacatgact 335701 ttgaacgttg atttgacttc acatagcttt tcgtctcctt gatagctgga agagaacgtt 335761 gagaagctga tctgtctttc cgcgcagaat cctctttatc taaagtaaag agcagagttg 335821 cgatagagaa cctatttgct ttcttcgact cccttgcccc tgtccttgca gcaaattcaa 335881 agagttgtgt ggcattgact aactcactat tcctgcctga aggaaatgca tgagcgaatc 335941 tttcagttgc aattcccctt tccaaatctg aatcaaagaa cgggattaag gggaaataaa 336001 agtcggcgat gggactgatg ctctttcatt gcttcgagga tcacatttct tttcaggtca 336061 gaatctgatt gatgcgttgt aacatccact gactcaaaga actacccttt agtaaataaa 336121 tcccagggaa agtccattta tgttaaaatc tctagtgtag tggctggttc ctatacaact 336181 acatcataag aagaagccag gtctgaggcc agcgagctca agatcaggag ccaaagcgga 336241 ccgaagaagg cttcatctgc tccaagcccc taaagaagtg aagggacttg ctttcctaaa 336301 aaaaaatgaa atggaaagag ttcgtttctt cctcaaattg taaacatggg ttgtccaact 336361 ggaaatgata acagaatgca taggtcaaac gaaggagtcc caataagcgg atgaatatag 336421 tatacctatt tcctctatga gggagaaagc aaattgaaga acccgcggat atgtgcaaaa 336481 cttcaattct tcttagccaa ggctttccaa ctcttatagt agataatatc taaagcgata 336541 gagctggtgg atggaaatta aaatgtcctc cagaatgaaa gtgataaaat agataaagaa 336601 agggcgcaaa acgcagtcct ccaacggaat caatggcatt cctttcttac tagtccttac 336661 gaactctaca ttgcttacaa ggttgaatag cagaaagcct tagctttagc tacttcaatc 336721 tctcctttcc tcatattcga gcaaggaaag ctgtacaaag gaaaaggtat tctaaagact 336781 tgtttgcaaa aaaatgcaaa atctcttact tagaaaaagt acctgcagat tcctgcccaa 336841 aatcaactgc tattgaatct aagccaattg aatggaatca gatcatccgt ttctggtatg 336901 ttgggtgtca tcctatccag tactggatgc agtagaaggt gctctcgggc gaattcccgg 336961 ctaaaaagta cactgagtta ggcgagaatt gtctcgagat tgacagattg acttagggga 337021 attaggagtt gtttggaggt gggtaccacg gaagaggtag gttatccctg aaaaagggac 337081 tcggatagga atagagacta gaagctgaca gcaagctctt agtcttggct tcttcttcct 337141 ttattttatg gagacagggt tactaccata cttggagcac gaattcatat gtaagatgcc 337201 cagttggatc actaattcgt aaattgacaa cctctagaaa gggccttttt cccaaaagat 337261 tcctaattta atttcgtcga tattgaatac cttattgact ttgagactac taataaatag 337321 caacgaagct gactcgacca gtttacaact tcgaaagaag gagagcggag atctttcttt 337381 caatcgaagg gtcatgggtt ttttcatagg caccggactg aggtatgaaa tgaaggacga 337441 gtttcactcg cttcgggctc aagtttagat aaaagagtcc aatgagccca aatggaaatt 337501 cgatagaaaa ctaaagctga gaaagtcaca atccattgaa taagatcggc taaaggtgag 337561 agctcaaaaa agaaagaaaa gaagtcgctc acccgcctac gattgattgg agacttattc 337621 aataggtcct cctttagggg tctattttct ctgacttgac tcagcttgac ctacttgact 337681 cagcggttag agtatcgctt tcatacggcg agagtcattg gttcaaatcc aatagtaggt 337741 aaacccggcc gaaaccccag caagaaaacc tcctcaaaat atttccctgg caacctttta 337801 agggaacccg cccagaagct tcgatttgtc aaagctgcgc taaaagcctt gaattcaaaa 337861 gttgaaggtc gataaaagaa taagaaaggc catagaagaa ctagacgcca ttcaatccca 337921 gctatccttt ttccagttca atgaaaggct cgcagagcag aagagtatat tatcaaattg 337981 gatgagttga acggatacag aaaaggattg ggaccacaat atcaaagact tccacgctac 338041 actcacagaa agaagggcaa ggaagactat caatatcagt atcaagagaa taaaggatga 338101 gagtggaacc tgtggaagcc tacatcaaaa gaaaagacag gcggaactct actttattaa 338161 tctagtgggc acagctagca gagtggaggg cctctataat ataataggtt aggcttcttt 338221 cttctgggaa ttggcagttg ctttagcagc tgtgcatatc ttaactagtg ggcttagtcc 338281 gaaaagaata ttgcctgctc ttcgtagagc aactatgtca cttctgtcta ttagtgaaag 338341 gctaaggctc catccatccg aactttcgaa tgattcctct atcaaagtta tgaaataagt 338401 ttgttagaga aagctaggtt tccctaaaaa agggttttct cgctcagtgt catcgttccg 338461 gtgatcacct tcaggtcagg atactaaagt ggaatggatt gcatttcaat cttgcctata 338521 atgttgatta ttttccccag cgtatgtggt cttgaagagc ctttcagcgg cttaataaag 338581 gctggaattt acggaccgga ctactacgtg tcggttcctt ttcaatctaa aagagcaaaa 338641 gcctattctc ccacctacca gcagcctaat tccccctctt tccaatccta gctctacggt 338701 tgattcgaga aagaggaaga ggaccgaagg aacaacctaa gagaccaatc caaatccaaa 338761 cctcacctct tcgcttcttc aacaagaaag aagaagactc ccacctgtac gccgacgcag 338821 cactcgcggt tacttagcgg aattcacgga gcagtcacgg ataccaccat ctgggcagac 338881 tagaaattcg ctatcgcttt tctgtttttt aagattaggc taaaaaagtg gttagtagca 338941 ccttgacgtg ccatgatatt tactttgctt ctgtgaggcg gtaggctcga ttgttttcca 339001 tgtgatgata gaccctttcg cccataccgt gaccacgtga agcaggcgaa gagctttgtg 339061 tcaccatgtc aacttgtatg tgttcagcat atcatatatt cttttggata gttggacgga 339121 agtcctatca acctacagga tctgacgctc aaaagtaccc acgcgtttcc tacgagtact 339181 tatcatggcg gtgtagcgcc cttggcaagc cctgcaaacg acaccactgg taccctaact 339241 tagtctaatc atagatctgt atgaagcact catttctgat tctaccaatt cccaagacgg 339301 aaagtacttt tgatcataca ttcgcacgtg ttctatggat gcagtggtcg atccatatgt 339361 cgtgcagagg caagggtatt caaatgtcat agttgagcca gctcatcagc atcaataaaa 339421 gaaccttttg tatatcaatt gcgtcatcac gactagcacg gttatcatag cttcaagatc 339481 tcatgcttcc atagctcagc tgctttctcc tttctctgca ctgtgccctt gcccagctta 339541 ctctggtgcc tctagggaag aaaagaggct ttaaaggttt tgctactata actataggag 339601 atgaactttt gcaattccaa tataggtctt acgaaaggaa atcgcgttgg taactttctc 339661 gctatgggag atggaatgga tttctcgata agcccactac tgcatgggat cggtatagat 339721 cgtgacgctc atctcaagat ctaaggagat gtaaatgtgt caaggaaatg gcatctgact 339781 ttaattccca ctattctata atagcactca atatggtgac gaaacaatgg tgaccaactc 339841 ccacaattga ggagcaagcc gggatccaag cccaagcagc caatgaaaaa gccacttgtc 339901 tctgtcaact tcatttcatt ttcgctctag gattgacatg caaagtaagg acttagatta 339961 gccgaggagc tgtggctgct ggactcgatt tgacttttgt taacgatcga ctaattaacc 340021 ctttatcaga tggaagcatg ggatagactt gtctgaaatg agctggtcat aataggaatt 340081 agtgtctgca ctgtctgctt tagcgtcgtg gtcccaccat ttggtggcgg aaagagagca 340141 tattcacgct tctatataag gacgagagcc ctggaccggg ttttgcatga cagcccatct 340201 cgcgacccta actaactttt gttctaaacg gggactccag tgcattctga agcaagtacg 340261 tatgctttta agccaaagga acgaaaactc ccgagcaaca ctatcggtag tcaaaggaag 340321 agagaaggtc ctacgatcat tcctaaacag aggagaggga gtagaaattg ctaaggattt 340381 tacagaaaaa tcacagatca aacttaacac agagaaaaat gatgtgactg tttatcggtt 340441 gagtatagag tcatctcgcg tagagtcatt tctaaaagta atgtagtcat atggtatcag 340501 ccagtagcac taaggagcat tctatagcat ggtgcatcgg ttgactatca agagtggtca 340561 agtccattga gaactatttc cgggtatagg agcatttcag ggtcaagatc caatcgaaga 340621 aagggcacag ggcatgccag tattccagtt cttagcttct tggcttgggc taatcacgag 340681 cctgggtcta gctaattggc tgcaggcacc ggtgatgaga ttgggcacct ttcgcgtttc 340741 aaattgaaag gattccagga ccagaatgaa ggtcgtcagt ccctcccgat cactgctctc 340801 gacgactcga cgaggtaagg tctcgagatg cgtgtccctc ttttcgtatt gattagcgcc 340861 cggacgatag tagtttgccc ttgctgtttc ttcgactgtt gctttgtcgc attgatgctt 340921 tccatcgctg ctttactccg cttgtcatct tgctcttgct gctccccatg tttgttcttt 340981 tgcattggta gctctttgta gtattccatg agttgggcta aaacgctacc ctccgttact 341041 ttgatgctag ataggtggga ttcgattgat tatgtgatgt tgccgctgct gctcctcgtc 341101 gcgatgaaca gaccaagtta ggctaacgca atcctagtca gaagacaaaa acttctcttt 341161 ttattcctat acttggacaa ataaggcgag tagaagaata gactgagatg cttcttcaaa 341221 acaaagaagt cacatcaata atgggactaa ccataaggga gttggtagta tcccattggg 341281 gggaagcagt agaatggcaa tcgcaaggtt tgacctcttt gtagctagcc tcccttctat 341341 gtgttagtga gcgcccattt agtggaaaag cgagggatca ggtggagtta tcccttgtag 341401 tcaaaacaca gtccgaaatc cgcctcattg ttttcgtcat cagcattgtc ttcggcatag 341461 ctttagccgt tagccctcca tctcgattcc ggaatttggt gcagacttgg agtgacttcg 341521 cagagggaag ggttgtggct ttgacagata aataggttgt cattagacaa atcggggagt 341581 taggcctggc acagctgaat ctagctaggg aactaagtag cagcgggttg cggcacaaaa 341641 gaagttgaca cttcagccat cggcacaatc caataatctc aagtcaactc agggtggcat 341701 agagtattag ccccttagct tttagtcgaa aagtttaggc ctgatgggaa tacagactac 341761 aaagtttgct cagcagcgga agcacttttt tttagttatc aaattagaga gtagtgccag 341821 gcaatgaagg cccggtttat gcgagcaaga tattcatacc tttcatacca tccgaaggct 341881 gaactaagag tgcatatgta ctatcttcta ttctttctat tctattggat tattggattt 341941 ctatcttcta tttcatttca acattccttc taattattgt tattggattt cagcaatcta 342001 agcagaaaag catgaagtta aggaaagcac ggaagaagaa aaggccaatg cgaagtagtt 342061 tgccaacctt tgactagtcc cattctatcg gttaaggagg ccaggaaaga tcctcaggta 342121 gaactcagtt tgggaacaaa aacttttccg agcccagcaa aataatcgca aggagcccag 342181 cttcaaacta atagcccagc tccataacag gcgagcagcc aacaaactat gacccgaggc 342241 aaggaacatt ccccatccag ctcagcccgg caagtgaaac ataaaacaca atttgcactc 342301 ttccagccga cttctcagca tagcaggacc aaccaatcag cacatttggc acttacgcca 342361 gcaaagcata tatgcaagtc aggctcaaga ttcacaccca aactaggctc agcagcataa 342421 agatgcccgt ggtatagact actcgactta gactaagaag gaaccgagaa agtaagccaa 342481 gcggcgtaga ccaataatca atgtagactt tgacacctaa caattgcact agattcaagg 342541 aaacctacca aggaacaatc acatcgggat tagcaagtca ggctccgcat tcgtactacc 342601 aagctaagca gaaagattac cggtaaatgg attcctttac ttagaagtcc actcacccga 342661 gcaaggaaga aaagcaatgc aaggaccaat ccaactaagg acaatcaaac aataccagca 342721 tgggagaaaa aggagactca tcacattcca tcagtcactc gagtcaaagc ataaagcaga 342781 gcaatcgtac tccactcaag aataatccca tcaaacaaaa atgggcgatt cacatttccc 342841 ttccctcgac ctggctctct tcaacttgtt tcgaccggct tctatattct gaatgtttca 342901 accaatttaa ctttcaccct tatcgttcta aatgatcttt ctctttgact tcgagtttca 342961 gtattcgtgt gacttatgct ccacttgatt agcgtagctg ccttagcttg ccccgacttg 343021 actgattggc tgggttcgct tcacctcgct cggcttagtt ggaggactac agaaagagaa 343081 gcatgacaag catgacgttc cttaattata tctatatata gttaactatc cgtatatcat 343141 agtaaggaac cgaaaagtag ttacccattc acgtacatcg gttgtgtcga atatttgtaa 343201 tcaatcggtc aaccagacat cacgtaacta ttcagatcca acttctagaa ataggtttca 343261 gtcagctcga tgaaatagac taatattgac catttatgtc agatccaata gcttgcctcg 343321 aatgccagca tgaaacctaa ttacgatagc ggtggcgaag acttgaagta atcacttttt 343381 cctggacaat cggccagcta aaggtcagac ccaataatag cataatttca cccatcagtg 343441 gttggtacca gtccgtaaaa gtccagcact taggtagcag ccccatcaac tgtacaatac 343501 cgacctaaca ggaccagtat ccaccataaa aaccgtagca taacaagcta ggttcccata 343561 gcatataaag gtagggccta gtctaataga ctgaagcact aatcacaagg aatcacaaat 343621 agcatcccat ggacaagtat ttcaccaaca tacagaggag cggaccaatt tcaacgagta 343681 gacaaatcca tttgaaaaga caactaaaga tttattaaat ttccggttaa cttatgatcc 343741 cggcctagct cgtcttttct tctataatag cttttggtca ccgctaatca tcaggtgacg 343801 ctggtcaaga tcagtactct tattaacttt ctgattggca ttagtataga atgggcgact 343861 aatataatag aagcatcaat ctaaaatcaa caatcacaag tatctacatg tttactcaat 343921 gcctggacca ccttaactac tatgtctagt cttcaaaatg cctggctacg ctataataaa 343981 ggcttgcaat gattgtgatc tcttattgta tttgatccta ttaaagtatc atgagaaagc 344041 tacgaatgtt actatgcacg cataaaaaga agtgagtcga atatcttgag tgagaaatat 344101 ggtagcttta cgtgctagaa tggaccaata cattataaaa aaggatctct ctttgaaacc 344161 tttatagacc ccaggcttca aaccggacct tagaaccctc caagctcaag agtacttatc 344221 tagacatcgc aagtcaggct caacagtcgt actacaagct cagcagcaat atgaccatgg 344281 aatggagtca ttgagaagaa agggagatcc tacctccaat acgagaggga atcgtacgaa 344341 aggaacaaac tcaacttcaa gctcttagcc cagcagcaac aaactatggg attggtccga 344401 gacaaagaag acagaaacag cccggcccag taccaatatc atctaacgaa ttgtacgaga 344461 aagctaccaa tccagcccgg ctcagcaagg cgtggaaaat gaagcacatc gggtccacat 344521 gggacagcgc aagttagact caagattcgt accatccacg actgagtgaa gtaggacgaa 344581 agcaaccaat ctaactaaac tagatagaat aatgtttcaa ctaattaaac tataaacctt 344641 gttcgttctg aactatctcc tttcttcgac gttcactggg ttggcgtgac tggcttaaac 344701 tctctgaaac ctgctattat atcaaccgat gaactaacta tctcttacat atctctatat 344761 agaatagatt atctataatg tttgattctt agtctactgc ccgtttaatt cccttgactt 344821 ggccttactt cttttccttt cttcgcttgg gcgttcctat caaaagaatc tgtcatgcca 344881 aagacccaat ccatgaggta gtccaccgat ccagtaaacg atggaaatgc acaagtcctg 344941 aaagcataag accgaagtat gagaccgagt gacgagcaaa ttcagtattt ttgtttccca 345001 ggaaaaaagg aaaagggggg cgagagaagg aaaggggaca cagtggtgac aaggatgact 345061 caggtcagtc ttccacccat aagcaccatc aggaaaaaat aaaggaaggt agggacaagc 345121 ataagccacg tgagcttcat tctttcctat gcaatggacc tcactgggaa ggtaataaaa 345181 aataaaaaaa aaaagccatc cactctctca cagacaagct aagtgaggag gagccagatg 345241 aggagttggg atctatacag ctgatcaatg ccatggccaa atcaaaaggg ctcatccacg 345301 ttgaagtcgg aatcaatggg aagaacactc gggccatcat aaagaggtct caacgagcca 345361 ctactttctc tcagtagagg aggcaaagac aaacggattg aagcccaata aggaaagagt 345421 ctgcctcaag cctgtggaaa agcccgtaca aggtctgact cgtgcggtgg agctacacat 345481 gggtgattgg aagggcaaga tagacctgac cctataagta caatggttgg tccctccgga 345541 accggtaatc tccgtttgac ttcctttcag caggtgcgca gaagtattct ttcacaagtt 345601 tgacgcaagt cgtacctccc agccccctta gctacttgta ctaaaaaaaa ctaggacgga 345661 agttgctaat cagaggtggg cagggttcct ctccacactc tcaaccttaa ttatctttct 345721 cctctgtggc tatagcctcg ggttcgtatg gatgaatcaa ccgactgtgg aattttcttt 345781 gcctttgcat ctccgtcttt cgcgtgccat ggaaatttct ttgctttctt caacccttac 345841 ggacctttac tttattccta tctgtgtggt tactcctagt acgcactgtt taagtatccg 345901 gtttccttaa ggcctagatt caatcgtatt tttgatactt gggagatatc cgacgagtac 345961 ctgtttggca aaccaaggaa gctgtatact caacctctgt cccgaaagta gctttaaggt 346021 ctaatgaccc ggatcgagta cactgattca atctttccgg tcttaacacc tctactaagt 346081 aagcagctcc attcaatgac gatctgtcta aggagtcttc cgtcgtcgaa gtgattcatt 346141 tattctgtca gtgggctcat agatcaagga gtgtgcagca gcacattgtg ctaaaagccg 346201 gttatcgtgt ctgcctcttt cctctctagc cgcccttcag gctggtctat ccaaacaaaa 346261 agcagaaaaa gaaactctag ctagctaaca aacctttgct ttctctaaga ctccgggtga 346321 ttgtactggc taaccttttc agaggagaaa cgccgagggc atcaagagca gaataaaaaa 346381 caaggcatgt tcaatgagaa ggtatgggat aaagacccaa gggatattct aacgacagat 346441 acgttctttc gatacggaga aggggctcca aggttcgttg agaccaaacc aatgaaacga 346501 ttcggtagtt aacaatcttt cgattgagct gctatcttat tgaacgcaga gagtggctac 346561 tgcttcaaag ctttgaaaag gagctgccct tctctatccc gtactcctat tcttcttttc 346621 ctctaataat acaaagcatc tgtttgtagg taaacaggtc aagcactatc ttcagaacag 346681 tgagaagatg cctaaagcta agaataagaa cgagggtaag aatcgacgag gaatcaataa 346741 gatataagat aagtgaatga caaagcgtga gtataattct caacccgaga tgaaggagga 346801 ggtgttagcc tacttgctgc agctgtctgc gtcgctagtt ctgccggtag ccatttggct 346861 gattgctgcc ggacagattt ttacctgttt acgcggatat actatctcca attaccaaga 346921 aaaagtagag gagaagctct gctcgacttt agtagataag atttcagaga aactggccga 346981 tttatttccc gtatatggta tcacgccgtc aaggaacgca ccctttccta ctattctcga 347041 gcaattgctc gccacagttt cgcaagagga gcgcttggcc tacctctcca atatgtataa 347101 cagtttgata gaaatgggta tcgacagccc ctgtttttat cccatagttc aaacatttct 347161 ttttctaatg ggcggcggcg gcgggcccgc ttgacctcaa ttttccatac ccctagcccc 347221 cggatgtagt ccttgcttag tcttctagcg gcgaaccggg taaacaacgc gatccgaata 347281 aaaggaagtt cgctgggatt gtatttattc attttccaga ggtactggtt cgactccagt 347341 tcgcgactac cgtgatttca ttttatacga gcatcaaatc aaacctgtcg gctatttctg 347401 tttcatctct ttctaatctt cgttgcgtgc aaattcatat ttctcgcgga attctccatt 347461 ttatagatct cagcaaacaa atgtacaaag tggatcgcct agcaaacgtt gctgcctatg 347521 atatgccaac aggttcgaaa gaagctttct gggaagccat cgggaccagg ggatattctt 347581 ctcgaagacc gtgcttttga tctcgtcctc atctggcata gcagttagga aagcattgtc 347641 atcctcggat accactggag gaatcgattc caggagttcc ggatgggctt tcctggggct 347701 ggaggtgtac acaacttatc aaagaattgg aatatgtact aataaatcat gttggtcatc 347761 ttggatttcg tataaagagt ttttcgatct atgtatttga cccgagattt ttgcttccac 347821 atcctatatt gaagtcacaa cgcattatgg aggtcccttt tcagctgtaa tttatccctc 347881 tcccgatctc tttcttctcc tgctgttcgc ttcgcttctt tctttcgtta gcttcctttc 347941 ttttctcctg agaaaaagat tgtttgttgt ggcatgcatg actaagccac ctttcgatag 348001 cctcggactc tacattaatc tggttgggag taagacaggg ttagaagcag acaagaagga 348061 gcagaagagg atctttgtta tctgcttact ttatgtttga gtttgatctc tgacttgtct 348121 aactccatct gtactttagc attcactaat ccatcacttc cctatcttat ctatcttaaa 348181 tccaacttgg cagtttccct gaccacccat aagaaaaaaa aagatgatta aataggcata 348241 atctaaagga aataagttta ttagaataag aaagaaaaac aaaactaaag aaaggatttc 348301 ccaatctatc ccttgcacaa aagactactt agaacgctca taaaaaacac agaagaataa 348361 gtcatcgaag aagtaataat ggggtaggaa gaggtaaact aactttattg ctagtctttg 348421 tctttctagt gtcctcaatt gtccgcctct gaatggcttt gtcagagctt tctgtcgttc 348481 cgttccctca ggtcttgggg gagtagagct gcttctttca actcggagga acacacctca 348541 ttggagaact tgttgtagtt tcaggtatga ctaggaaatg caatatacgc taggtcaatc 348601 cattcctata aggccggtga gccgccgtta ggccgttaag gctaattaca tgaagggatc 348661 caagagaagg taggtatgga attggcttta gtgctctatt aggtaaaggg gtgctttagt 348721 agcctcccct cactctcctt gacccgtgga caaacgccac gtgatggacg gcgcccggct 348781 ctgacgaaga accagtaaga ttaggtattg cctaatccgt aaggttgtca tacccttatc 348841 agctttctga ccaatcctcc tatcctatgg gaatgctttc ctaaactctg attgctcttg 348901 ccgactcgga acgaaaagat agcgagggat tgatagattc ctcgttggat tgggagacca 348961 actcggaaag aaagaaagag gcttggatgc tagggatagc ttgaagaacg agtaccggga 349021 gaaaggacct tcttttgagg aatgaggatt aggaagctta ctcgtagaaa attaagttgg 349081 cacctactaa accaagtgcc tgaaaagggg tagtgaaaaa ggagaaggca atctcggatt 349141 gttcagacac tgcctttggt tcattacccg ttgagaaggc agtagaagag gtagcattcg 349201 cagaccaaag caaagacggc tggggattct tcatagcata gcaagaagga agggcgtgct 349261 taatatataa ggactagtag aggcagtaga ggtccttcac catgaagatt gtatgctttc 349321 cttttttcac tcaacaatga aagctaaacg aaaccgattc caatctcttt tccattcaga 349381 aagagaatcg attatgtagc tcacagaagg ggtgatcaaa gaagttgact aagtgaatgg 349441 gaatccgata gctctaccag ccttggaaga gtagtcagag gcaattcgct aatatttaac 349501 cttttctatc atctttcttt cctcaagcat gaaacggagt tgagcggtag gcattccttt 349561 ccattacagt gggaaagcgt atgcgagtca tagagttctt ccagtaaggt ttgaggtcac 349621 aggtaccgat gcagtggaaa aggaaaacta ggatataaac aagacccatt tgtttcaagt 349681 ttagggcccc ctagtgttac aagctactgc ttgcgaagga aagaaagagt tgctaaagct 349741 ccatgaaaat agtctcctta agataaccat gcataaattt ctaaaagttc tgttaggttc 349801 ttagtagcag tcggcgacct tttcttcttt tacttcacat agcttttcgt ctccttgata 349861 gctggaagtt ctccaaaagt atgaaaagca ggaggacttt gtaccatcca ttccagtgta 349921 gttgaattca gttcaagagc ccaaggactc ggagcacatc ttttgttatt tccactgctt 349981 aaagtgattg ttacgaccac gaagaaacaa caaatcccaa ctacggatat ataagagcca 350041 aaactggaaa gggcattcca tccagcgtaa gcatccggat aatctggaat acgacgtggc 350101 atacctgaaa gccctaagaa atgcatagga aagaaggtca aattaacccc gaaaaaagtg 350161 atccaaaaat ggatttgacc taaagtttca gggtatgtcc gaccaaagat tttacccacc 350221 caatagtaaa atcctgcaaa taaagcaaaa acggctccca tagaaagtac ataatggaaa 350281 tgtgcaacca cataataagt atcatgtaga gcaatgtcta gccctgaatt tgccaggact 350341 attccagtga gtcctcctat ggtgaacaaa aagatgaatc ctacagcaaa taacatgggt 350401 gttttgtatt gtatcgaacc cccccacatg gtagcgatcc aactaaagat tttgattcca 350461 gtggggacag ctatgatcat ggtagctgcg gtgaagtagg cacgggtatc tacgtctaag 350521 cccacagtaa acatatgatg agcccaaaca agaaatccta agacaccaat actgatcatg 350581 gcataaacca tgcctagata cccgaagacc ggttttcccg aaaaagtcga aacgatatga 350641 cttatgatac cgaatccagg cagaatgaga atatacacct ctggatgacc gaagaaccaa 350701 aagagatgct ggtataaaat tgggtctccc cctccagcgg gatcaaaaaa ggttgtatta 350761 aagtttcgat cggttaataa catggtaatt gcccctgcca gtaccgggag tgataataaa 350821 agtaggaatg ctgtcactag aacggaccac acaaataggg gtaatctatg catagtcatt 350881 ccaggtccac gcatgttgaa gatagttgtt ataaaattga tagaacctaa aatggatgaa 350941 acaccagata gatgaagact aaaaattgct aaatcaactg ctcctccaga atgactggta 351001 ataccactta agggcggata gaccgtccac ccagtgccgc tacctacttc tactaaggct 351061 gagcttaata ggagcaagag acttggtggc aacaaccaga atgaaatatt atttaatcgt 351121 ggaaatgcca tgtcaggtgc acctatcaga atcggaacaa accaattacc aaatccacct 351181 atcatcgccg gcataaccat aaaaaagatc attaaaaaag catgagctgt tattaaaaca 351241 ttataaagtt gatgattccc accaagaatt tgatcgccgg gtcgtgctaa ttccatacga 351301 atcagtactg agaagcatgt gcccatcact ccagcaatgg caccgaaaat gaaatagaga 351361 gtccctatat ccttgtggtt tgtggagaac agccatcgaa ccagattttt cataaatttg 351421 agattctttc gtttattacc ttatcagaga ggggttaggt atttagtaac tctcgcgctt 351481 tcctggaacg ggttccatag tctacgagtt cttgagagtc ccgtggaacg gtcaaatttt 351541 tgctagtaaa ctagctcaag caggaaattc caacctagct gatggttttg ctaggaacgc 351601 tagctcatgg caggaaatca taccttgcct gattctcctg cgtaggagag cccaggccta 351661 ccattggatt cgaaccaatc agcatcgtcc tacaaaagat aatgctctca ccaattagag 351721 aagtaggaaa atacaacaaa cacatttcga cagaacataa ttattattat ataattgaat 351781 aaattctcag gaaaagagaa acagtggtgc aaactctact acaatagggt ttcacagatc 351841 aaaactttct tcctcgaatt gaagttttgg gagctacatt ctatattaag acaacccaga 351901 agccctagga ctatactcca acaagatcac ggaacggacg ttacgaccac tcaacaaagt 351961 ctcgacaata cacccaaata atcacaactt ttttcttcgc tacctttagt attcggataa 352021 acactcggac agtattgaca aatcgcgtgt gttaagcaaa tcacaagact tctcttttct 352081 ttgcctcatt tgaaacttct ttattaaacc tgacttcagt ctatcatttt aagaaaagga 352141 agactcttat taaacctgag ggaaagtata gagttatgac agaggccaca tttcatacac 352201 tttaggcagg tgaaacactt caaacaggaa acacttacag aggaacagga aacagctcac 352261 cttacagagt tacgcacaac agagaggcag cgcggggcag tcaaggaaca cattcggctt 352321 tcacatacaa ttactttact taccttaccc tacagtctta gcgaacctta cagatccttt 352381 tacaaacctg gaagcggggg aatacctatc aacgggggga cccatccgac tcttctatat 352441 atatgcccgg gggacacaaa gaacttcttt cgggatccca gtgcgagtca catcaatcag 352501 ctttctaaca accactcgta ctattttcac acaaaagatc cgccattgat cacttttcgg 352561 tcacaataca gacatttagc aacaggggag attagagact atcgaagtaa gcccgaccta 352621 ggctgaaacc agcagctaca accaagtcag ctgaaagcag tagctatatc taagctcagc 352681 tgaaaacagt gacacttcaa aagctagcgt tcccagttcc gatacaagac aaccaaggag 352741 gttcagaatt tagactatcg aaaggacctt tagccgctca tacgtaacgc ctcctcggtt 352801 acgccttccc tcgcacgggg cgccactact gagctacaaa caagctcggc agaagcagga 352861 ctaagtacgg ctgaaagcag gtataagact ttagacggag gagcccaagc taggctgaag 352921 gcagaccaag cggaggctga aggcatacat tcactcagag ctatacgcgt tacggcagct 352981 gctaccttcc ttcaaaggag ataccaaata gaggctgaaa acaaccaagt aggctgaaca 353041 gaacaatcta ggcagaagca gcagaccggg agtagatcct tagctctttg cccgggggcc 353101 ccatagaact tcttgagcgg aaactcgaaa cggggcggat ctacagaccc tctttcacgg 353161 tgacagtcca agtaccatca gacgggggtg caagtactca cattatcacg cggggcagag 353221 ggaccatctc atccagtcaa tacgggtcca ccacttcatc ctccccagcc atcgggtcag 353281 ccatgccaaa cagctaggac atctcaggac cggtcaacca gaggagatcc atcgccaaca 353341 ggggctccag tacaaacagt agggaggcgg atagaccatt tgtcagttcc aatagccaat 353401 cagtcaggct atttagcagt cgggacatcc ggaacattct actaaaaacc atacgataca 353461 gatttttatc tatttaaaac attggcatca tccgggagtg ggagtggtat acctcagact 353521 ttgaggacct gaaagcagca gaactagatt tggtggacct tcgtagcgga agaataagac 353581 tttcttgacc ttcgaagcaa cagttttacc tgacatccca gtggcagcat tctttgactt 353641 ggaggcccta agcgatcgtc ctacctcgaa aggagtggca gcagcggcag gaatacccaa 353701 gtcatagtct ttttcgggag agggctcggc atagtcttta tgcgacagta ccagcatgcc 353761 atagtcttta tgcgacagta ccagcatgcc atagtctttt atgcgacagt aacagcgtca 353821 tagtctttat tatgcggctt cagcagcacc cggttaggac ttcaaccaca gaagacccgt 353881 ctttgccttc acgtcagaaa caacaacgca ttgaacttcg accgcttgac ttccaacaga 353941 agtggatgat tctttcgcct ttcttcgttg acctgacacg acattcaaaa tggagagtcc 354001 caatagtcca gaggtagcag cacgatcagg gaaccaaaat ggctgtctcg cgcttcgaag 354061 ccacagcatt tagagggccc ttggcacatc ttgcaatgac aacacgtctc accacccaaa 354121 cttggttttc gacggcgaat cagcaatgtg tgtcaagcaa ccaattctgc tcctaggcca 354181 gttctgatct taggaattcc ctgatccaga atatctgagc cggaaggtgg cgttcaaaga 354241 cttaaacgag tagtgggcca aagggctcta taatgatagt tcgccctcag cttttgtcca 354301 ggattggcat tctcaaagga gaaagatctg atcagggaag gattgatcga tcaagatcag 354361 gaagaggaga acgaggaaca attaacgaaa gaaggggaaa gggctaaagc atctgcttcc 354421 acttccgctg cagaagaagt tgggggaggt tacaaagccc ttatttcaac agactgttaa 354481 ccactttaat gcctttatcc ggctttgcaa caaagtgatc ttcgatcgca ctaaggcgca 354541 taagacagta cacttcaccg agattcacca agtaacccta cggaattgat gttgtgacac 354601 ttgaaacgca tgaaaggaat ctaggaacaa aacaaagagt tgcacccctt gcatttgaat 354661 gccggtatac tgctccgcca ctcagggcac actaccaaag gactccaatc ctcatcattg 354721 ggcaattcaa gacgcaagtg gtgtgtatca tctctagcct cctcggtgat gtttcatatg 354781 ggtcactaga atctccatct ggggccgaag aacgtacgat caaagacttc tgaagcgtcg 354841 tctaaaggct ccgatctaat aagttacgtg cgtgatgtcg atgagatgtc ggcaggagat 354901 gtcggcaaag aaagaggaag gcgcgcagaa ggtaggtaat agatctcata ccgaacgtga 354961 ccgagtccac ccaaccgtac acgatccttg agtcaagtat gcgaatgaaa aagaaggagt 355021 aattcgagca ggaagaagaa aggctgacgc tgcagcttca acttccgctg caaaagacag 355081 gggatggggg agtttcaaag ccattcttat aacggattac tgtaccttat atacctttat 355141 tggctttttc aactaaaagt gctccgctcc actaccactc ataaggttcc gccactacaa 355201 tagggcacag tcataagaga atcgagagct ttatcctggc tcaggcaaac ggaactacct 355261 taggtctcgc tgggaaccca gctcaggaac ctaaaccaca gtgctttctt aaggagttgc 355321 cgagtcctac ctcgctatca aagatactaa ctgaggaact cacctatgac ccaagggaag 355381 agcctagaag aaggtttctc acgttgatcg tcacccctgg gtctaagaga cccccaagct 355441 gtctcaccgg gatcgaatcg ggctcacctc ggatcaggaa cagctacaat atacgccgct 355501 taaacaagca gattaaagga aagagaccca tctatctaag gctaaaagaa gggaaagaag 355561 ccgtagctag tcgtagaagt cagtgaaggg gaacaagcaa gccgaaccgt aggcgagcaa 355621 agctttcatt caacagcaga ggaagacgta gaaagaccag caaaaagaat gaacacgtgc 355681 caagaaccca ttccattcac ggacgcgggg aaggaggaaa gtcaaggcac aagtttctct 355741 attaagaacc aaggccctac tgagctacat ttactcttta caccaagaac agctcagata 355801 tgcgaccccg gggactgggc tttggtcttc gaccgacggc tcctttttcg gcatatgaag 355861 caacaataaa taaaagagaa ctacagatga acgaagggaa gaggtagatc atacagttca 355921 aactcaaaca acaaagacat tctaacagtt gcctcgcttt catccttata agtggacaga 355981 ctcatccgca ttctccgctt tgaacagagt ttgcttcttc gttcgccttc cctcacacta 356041 ctgagctaca accaagctcg gcagaagcag gacagagatt taagaaccgt gaagtgagcc 356101 caaacaaggc agaaacagca gctgcaacca aataggctga aaacagtgcg gcagcactag 356161 ccgtccccag taacagcagc agtacgagca ttccttagga gggtcaaacg acggactacg 356221 tcctatcctc aggtgaatac ttttacttca gttggatact ttgatagttc actactacga 356281 taggggaata ctaggctagg gaatgggaac tacttgggaa gtcatctctt tacttaacga 356341 actaaccact cagggattca aatccacgat actcctacca tctgcttgat ccagttgaaa 356401 actttactcc gacttcacat cagcccttag ggtcaacaca gctcctctta tcctttactc 356461 tttctatgcc tggattccta tgcctgtttt acctcgagtt ggaattcagt gggagaaatt 356521 ccaacaaaat tcagtgggag aaattccaac aaaattcagt gggagaaatt ctccagaaaa 356581 cgaaaacggc gattccaact ctacaatcaa ggcagaacgc agggaggaga agagaataga 356641 ttatagacgg cggaccggac ccaagctagg ctgaaggcag gactctatta agtacggcta 356701 aaggcagact ctattaagta cgactgaagg caaaccaagt aagtgcggtt tcaaacaaag 356761 gtgaaggaaa ggtacgatcg gtggacaagg aatcggttga gatcgaaacc atattcgtag 356821 gacgcgtacc caatagcgtt tccaggggcc tcatcccggg ggcaagccgg ggcatcagta 356881 cgcgctaggc ctatcactcg gagcccgggc gatctactcc tactctttat ttagaggcgg 356941 gaaatcttac tcttcttgag cgggggatag accaatcttc ttcagcggga aatccaatcc 357001 ataccttggg ggaaatacat agcttggtag acaatccata gcttggtata caatcactct 357061 acttcaagtg ggctaactta ctcatctttt gggattggtc tttcttcaat gtataagtgg 357121 agtaaggtct ttttccccac cgaaagaaaa gaaggtttcc attcccattc cagtagtaag 357181 aaaagatcag aggcaccgtt gcctacttcg cgggatcgcc ttacgtagaa ggacttgtga 357241 tccactctcg gctaagtttc tatggtcggt cctctctcga ttaatagcta actggaaaga 357301 gaatcctctt ttatacttaa ttatcgatca agagggaagt tcctaacgtg ttcttaggaa 357361 cgaaggtctg tcgaaggcgg gcgaacgtgt ctttctatgg tacgctcggt acgccttgtt 357421 tctattgttc tccgatagct ctatcagtga ttatttaaag tagtgctttt tgtctgttac 357481 cggatatgta tagaagtttc cttatggtat tcctctccgg tgttgctgaa tacacgtcac 357541 cggttcgact cctatttata tttagtggtc catttattga ttgatgggcg aatggagctc 357601 gggatgctga aaatgtaaag gccaaggcct ctctcggttg ttgctttacc tgtcctatta 357661 gactagctag gccctccttc ggtatgtctt gttccttggc ttctccttgt tggaattcat 357721 ggttgattgt aacctaaggg ttgttatatc actgtccttc tgtgggggag aaagaggggg 357781 tgttgctagc tcggtaggca cacaggcgcg ggttccggtt gaagagtcaa attctccttc 357841 ttgtttttat tgtgagaatc tttctctata ctctatataa gatgccatct gttttcttta 357901 ctcgtttcct ccttagcttt gaatctatct gtgactgttc tccttcttct ctaaggtctg 357961 ttcctttgtt ccttctttgt gccttcattt gctttctttt gtttgactgt taacttaaag 358021 tatctacctt cttcctttgg tttcttgtag ttctatcagt tagttcttca atggttagct 358081 ctgcctgagc tggagttgtc cctccagcga gagttgcttt cttccttgct tgatcggtag 358141 gagttgccga gatagcaggt attagctgtt agccgtcgga ctcatttaaa ggagtcctat 358201 cgggctacac tagtctaagg gcagttgagt aaccgattcc tacgcttcct ccgcttacct 358261 ccgtcgctta actattaata cgtcgcttac ccgcctccgc ttacccgcct ccgtcgcttg 358321 ttatctgctt cccttagcta aaacgttagg cgatagtgaa agccttcact ctcttatatt 358381 tacgtacacg tgggaagaag acctagtaat tgagtgacag cattcctgga ccaattctat 358441 aaacagccaa tcaggcatat catggtacag gtgaaagaaa cttatccaat acagatctta 358501 tcacctgccc atcaacgctc aaccgtctgc tcctctcgtg cgtgtaggaa taatcaaaag 358561 aggagacagc atgcctatta ttaagaaccc aattctacta aaaacgataa caacgattcg 358621 aggttctcaa tcaaggcaga aagcaagaaa cccatccagc taggaggaaa gcaaatcaag 358681 gagaccaatc agacatctag attctatagt acgcgtatgt gtacgtaccc aggctaaaac 358741 agcaaatcaa acttggctat cgtttgagag caacctatcg ctaaacacaa accaatcctt 358801 ttacacagtc cattgaaagg aagaaaacaa gggtggaaat tcatgttcct ttttacacag 358861 acctccaaaa ctcgaatagg ccaatcttac ttttcctcat tctagaggtt gcaagggaag 358921 aaaacaagat tggttcttta tgaaacctgg ttcaaagcaa cgaactccgc tctctcccgt 358981 gggcacattc ctattatagt aaacggggac ttctctcttt attcctatgc gtgctttact 359041 tgactcctct gaattcagaa caatgacttt taggctattt ttccccaccg gcaagaatga 359101 acttctatgt gttaagcaaa gaagaacaga agatctattc cagtagttag gtacaatctt 359161 ggacggccgg ggacctactt cgcgggaccg cctaaaagga cttggtccac acgcggcgaa 359221 gtttctttgg gcggcgtttt atcgattgat atctaaagtg actggaattc gaatcttctt 359281 atataaggaa ttatcgatta taaacttctt atataaggaa ttatcgatta taaacgagga 359341 tgaaagaagt taggaataaa aacccgatgt tgtgttctca tgcccggtcg aagtccaagg 359401 ccaacccggt ctgttgccaa tgagttgtga ggaagtccac cgtcaagtgc gggatgtgag 359461 gccacgtcaa ctgtgtatgt tgctgcgaag gtcaaccaag tcgaggaatg cgctgcggaa 359521 aaaataaaag tcactgagga tttgccttcg cggataccaa ttgcttggat ggatgcgttt 359581 gaaagcctcg agatgacttg cagaaaaaat gctttctagg tttgtcttgt gtctccgtaa 359641 caaatggtcc atttattgat taatgggcga acgacgggaa ttgaacccgc gcatggtgga 359701 ttcacaatcc actgccttga tccacttggc tacatccgcc cctaccctcc ccgcccttaa 359761 ttttgtttca ctgtcataaa acctattaaa caactcattt acatttgtat cttagcacca 359821 aactaaaatg aaattattct ctgcctaatc attaataatc attaatgaag gggcccgcct 359881 caccactcca cttccccttt ttatggaagc aaccaaactc accatgacaa gggccagcgc 359941 tcttacagcg aaaggctcct tttctctagc caaaaacatt atgccatcaa gccaagccag 360001 ctcccgaatg tctttaatcg agagaggtac agaatagtcg aagaaagtaa gcaactgcga 360061 caagaaagga cttataagat aacatggaaa atgagatttc acttctagtt atccgccaag 360121 gaaatgccgg aggaattcta tatatagctg agcgatgacc gtttcgttgt gcatcttgga 360181 tacagtaccg aaggtctgct ctactaaaac tttgtcataa tgagagcggg gatcatgaag 360241 attgaaacta ttcatgttgt tttctataaa ctccaaggct tctggtctta tggcccaggg 360301 tgattgagat caaattttga ttaggcgcgc atccggttcg tttaccaata aagtaaaaag 360361 tttattttgt aaatgtgctt tccgctccat tattataatg tcgaatagct cgttgtttag 360421 ggcactttga tagtgatgaa tgttgactgc gtggtccaaa taatctctga aaaaggtttc 360481 gtatttaccg aagtggaggc aatccatgaa ctaatttttg aggtttcgta tgcgcgaaaa 360541 gagtttcgct ctcaacttcg acgttctctg ccctatattg tccagccagg gatgtaaatc 360601 aagttggctg gttaaaaaca ttggtccagc ttaagcttga cgttgaatgg gaatttagag 360661 ttccaatcca gctagaagaa gcgttgttag aagcggcaaa agctatgaga ttcgaatcat 360721 aggcaaggag ttgtgtcaag aaaaaacaaa agacaatcga aatggacaat cgacggatgg 360781 caaaaagatc caataactga gaaagacaaa acaaagattc ttttcgataa aatacaaata 360841 agacaagata tatgagcaag aaatctgcca aaggtcttcg agattttgta tttatatgtc 360901 gctctattaa cagagccgtg tataccggaa aaaaaaataa aagcgttatc gggtttgaac 360961 aactgagttg ttataatgtt ctttttcatt ttaataaagt gactccgtcg cgatgttttt 361021 tttcctcttc ttatgaaata tgaaattttc attttatttc ttcaccgggc ttggaccatg 361081 tctcccgaac aatctcagta catatggcgc aagacgattc cacatatcga ggtcggaatg 361141 ggatcgggtg ttttcacgtc tcaccgtagt gcccggtttg tcttgatttc cgattgatga 361201 acaagaagga aaatggaact attggattta gttgtgactc gcccccggct acctgtccaa 361261 aggaccaaaa gcccgcccgc cgccagtggg tgggaaggaa gctagccccc tatcttggtt 361321 gggggaagag gaacgaagtc catcgcgaag gattcaatcc agccacaggt tcccctacgg 361381 ctaccttgtt acgacttcac cccagtcgaa gaccccaccg tggtatgcgc caataagacc 361441 accaaaagcc tttgtggcac tagtggtaca cagaagtcat gggtgatcat tggtccgatg 361501 cttcgggcga aaccaattcc cagggtgtga cgggcggtgt gtacagggcc cgggtacata 361561 ttcaccgcgg catgctgatc cgcgattact agcgattcca acttcatgtt cccgagttgc 361621 agagaacaat ccgaactgag gcaatctttc cggattcgct ccgccttaca gccttgcttc 361681 ccattgtaat tgccattgta gcacgtgtgt ggcccagccc ataagggcca tgcggacttg 361741 acgtcatccc caccttcctc cagtatatca ctggcagtcc ctcgtgagtg cggcacgcac 361801 ctttttgttt gtttcggagc cgttttggca gggcgtacta aacccactta cttcgtccca 361861 caccaccggg cggctcgcct gaatgccgag tctttctccg ccgccaactc gacgtcgtcg 361921 tcacctgggt caaaaacttg actttactaa acaagcgaga aaagcccttt cgcacttctt 361981 agtaaagcgc ggcgctagct gcaatcaaac tcaagcgcag actagaaagg gcttggaaag 362041 gcgccgtctc ccttcttact gagagcagag ctagttgctg tcactcaatt actaggtctg 362101 gcacgtcact cggctccttg gctcacttcg gttttcaagc ctttctcctt aggcgcatgt 362161 ctgagcaaca caagacgagg gtttcgctcg ttataggact tgaccaaaca tctcacgaca 362221 cgagctgacg acagccatgc agcacctgta tgaaagtgag taccatcccg ttaaggatag 362281 gttttgttgt tcatatgtca agggctggta aggttttgcg cgttgtatcg aattaaacca 362341 catgctccac cgcttgtgca ggcccccgtc aattcctttg agttttggtc ttgcgaccgt 362401 actccccagg cggagtgttc acgcgttagc tgagcccctg atctgcgtag accaagggcg 362461 aacactcatc gtttacggca tggactacca gggtatctaa tcccgttcgc tcccccatgc 362521 tttcgcaccc ccagcgtcgg tagggaccca gagagctgcc ttcgcttttg gcgttccttc 362581 gtagatctgt agatttcacc cctccacacg aaattccact ctcctctgtc tcactcaagt 362641 gaattggttt cgaaagcatt ccgccacttt ttggcgactt tcactttcaa cccgattcac 362701 cgcctacgtg ccctttacgc ccagtcattc cgaagaacac ttgccccccc cgttttaccg 362761 cggctgctgg cacggagtta gccggggctt cttcctcgag tcctgtcatg atcgcgcact 362821 cgacgaaaga gctttacaag cggcattgcc tttcttcact cacgcgatat tgctggatcg 362881 gctttcgccc attgtccaag attccccact gctgcccccc gtgggagtcc gggccgtgtc 362941 tcagtcccag tgtggctgat catccgaaaa gaccagctaa gcatcattgg cttggtcagc 363001 ctttacctaa ccaactacct aatactacgc aggctcatca aacagcgctt tttagctttc 363061 ttcaggattt ggcccgaact gttcggcaga ttcccacgcc ttacgcaccc gttcgccact 363121 ttgttctcaa ctcttcccgc ctcctgggcg agacaagcta ccttgagcta ggagcctctt 363181 ttccttctgc ctagctcccc gagaacaacg ttcgacttgc atgtgttaag catatagcta 363241 gcgttccttc tgagccagga tcaaactctt cttttgacta tgattgggcc ctgcagtggt 363301 agaacctcgt gaaccgggcg tactacttcc caaccttctg tggacctttc ttctcttatt 363361 caattccact ttgtttagtt tagtgatagt tagaggttag agaagagagc tagatcactc 363421 ctctaagcag ccttctgatt atatacgtat tattctatca atcgataagc aagggtaggg 363481 ttcctggatc ggtctcgacc agaagaccga gaggtacttt tgtcacgagc tggcttaacc 363541 cattccctta acaccaagcc tggataagca actgaaaagt caagtagaat ccctggcttg 363601 tgtaggcgag ctaaacacaa atctttcttt tcattttata tagatggaaa agtaagaaat 363661 cctattttcc ttcgtaagag cacatcttac atcgcaaact ttctctttat ataccgagga 363721 tttgatgaaa taccatttta gctctatgga gccgtggtgg aaaagagaat tttcattttg 363781 tataccggcg atttatatca aaatggcatc aatctcactt tttcaaaata gttggctgaa 363841 gatgaaacat ctccctagtt gcctattcac ccagacaacc aacactttag gcatttacag 363901 gaaaaaaaaa cctaaccata gtcgtgacaa ccctagaatc aactcaaacc tttcgaccaa 363961 ctatgcccaa gctaagtcag ttgagagatc aagatccaat agcctcaact caggcccaaa 364021 cccattagag aatgccacgt agagaaccat cgggcacata aacttctata ttccacccga 364081 tgagactaat gctgtcactc gaaaccaagc tgctcaaact aatgctagct aggctgtgaa 364141 ctttgaaggg gtagatagca atccaagtct tatacccgaa atgctgcaaa taacttgaga 364201 agtgcctaga gagttcctac gaaaagatac tacatcatca tcgataagct gagctccacc 364261 cgcataagca tttccatgtt agagctgctt aaaactttgc ttgtgcgtag ggatgctttc 364321 gtgagagaag tgctactctc ggtttggatc aaaagaatat actgctgaac ctattgactg 364381 actactttat gcaactactc gctataaagc cttaaagcta gctctcacta acaaagccag 364441 tgaaatctct attagcccat agggtatact ctcaaccatt acttgccctt tactagattg 364501 tctgaaactt gtcttattgc tgctttgtct taggataacc accctgcaaa ggcatgggga 364561 aagctatagg catcagcaga aaggcaagca aaggcgtagg tagccaaatg gtctagtccc 364621 tgccagagaa caactgcagc tgagacgaga gctgacaagt atgtaacacc tacttccacc 364681 ccatcgggga gtgaaataga acatgaaacc gtaagctccc aagcagtggg aggagccctg 364741 ggctctgacc gcgtgcctgt tgaagaatga gccggcgact cataggcagt ggcttggtta 364801 agggaaccca ccggagccgt agcgaaaggc gagtcttcat agggcaattg tcactgctta 364861 tggacccgaa cctgggtgat ctatctatga ccaggatgaa gcttgggtga aactaagtgg 364921 aggtccgaac cgactgatgt tgaaaaatca gcggatgagt tgtggttagg ggtgaaatgc 364981 cactcgaacc cagagctagc tggttctccc cgaaatgcgt tgaggcgcag cagttgactg 365041 gacatctagg ggtaaagcac tgtttcggtg cgggccgcga gagcggtacc aaatcgaggc 365101 aaactctgaa tactagatat gacctcaaaa taacaggggt caaggtcggc cagtgagacg 365161 gtgggggata agcttcatcg tcgagaggtt gtttacatac tcgactaaaa ggagaggtcc 365221 aaaatcgact gataaaggag aggtccggag gtattaacta gttgatgagg tctcgacgag 365281 cgactgaaag gttcacctag tgctcgagag agaattgggt cgcgcacctt ctgaacccga 365341 ccaccaccta taagacgacg tcaagaaagc caactctcag gccaagagaa catacctcgc 365401 ctaatagact ggaggaggtg tcccgtaccc ttagaatcgt ctaataaaaa aaatgtgctt 365461 accttcatga gagctgacat tacttctttt gatatcgatg ccttagtcag gctaactata 365521 tgcgatgcgt caaacataaa atgaaattca tcctaccgaa accagaccag agaaaagagg 365581 atctccggag ctcaaccatc gaatcgggaa agcactcttt ccagcttaaa acagagcaca 365641 gaaaggacac tcaaagcagg tccgtctgta tttcattata gtagtgatcc ccttctcttg 365701 atcaattaga caagaacttt actttgttaa agacataaaa ggaggtccca ctatgactat 365761 tagcggttcc tagcaggcat gggagaagag caggaagaga aagaaaaaag agacaaagat 365821 agactagcca cttccttatt atacgaaaag tttttgattg atgagttctc accttctctc 365881 atggagtagg tagatgagac agaagggaaa aggagtgacc cctactatga ctcatctcag 365941 ttactcttca tcacaggaca agatacgagc agaatcagag gaaaatcgaa caactacaag 366001 ctaagatgaa aaatctaagt gaaagactaa ctgataggcc ctggctggac cgatggaatg 366061 tttcaagtaa agggctaact cttaactagg ctagcgaagg aactgacatt agggaaagaa 366121 ctcctgactc tcggggaaag atctctctct ttaatacgag attccctaac cctgaagcat 366181 ttagcaaact cagcaaaacc tgaatgggga tcaaggactt ttgataactt attttgaccc 366241 ccaatccacc caaagcagat tcatagactt tggcgacttc ctgatcggca atgacaacat 366301 catcaccgag taccgcgtac gatgtaaagc gtacaccagg atgcacctgt ttcgcacacc 366361 accacactaa tatatggtgt gatagcgcga aagtaggcca agaaccgtga tatcccaatg 366421 gctgccctgc cacaaagcat acttgagaga accttcgttt aagtttaaca aaaggcacct 366481 caaagatatt gcatgcaaat gcagaattaa ccacacttga ggcaaaggag cggtcaaata 366541 ggtactgcac cacctcaaac agaaagacta aaggccaacg atcagtggcc gactttaagt 366601 caaaggagaa agagtgccta ctgccaacca gtcgatcaaa aggctgtgtt tggttaaaag 366661 tcccatcttg ggggagacgt cttctccgcc ccccaaacat ggaaggggtg taacaaccta 366721 ttattaacgt agttcccaat ggcgaataga aggcttttgc cacctccctc aacaacctga 366781 cctagtcggc ccatcctccg aggtttttgt tctgctgata tgatagctct tttaaagaag 366841 aggtctcaac gagcctcctg aatttaattt catccataca ggagaatgca tcattcttat 366901 cgcagaatgg aaacaaaccg gatt // bug2473.fasta100644000766000024 423614006427353 16516 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# fasta34 -Q -H -E 10 -m 9 -b 1 -d 1 -r 15/-10 -g -12 -n -U ../testfiles/NAC_miRNA.fasta ../databases/cotton/CGI8.fasta FASTA searches a protein or DNA sequence data bank version 3.4t25 Sept 2, 2005 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query library ../testfiles/NAC_miRNA.fasta vs ../databases/cotton/CGI8.fasta library searching ../databases/cotton/CGI8.fasta library 1>>>total:39860_L:12096_-3:12346_0:617_+3:14801 - 22 nt (forward-only) vs ../databases/cotton/CGI8.fasta library 43526801 residues in 55673 sequences Expectation_n fit: rho(ln(x))= 23.6457+/-0.000561; mu= -11.6380+/- 0.037 mean_var=607.0983+/-151.124, 0's: 0 Z-trim: 2 B-trim: 0 in 0/43 Lambda= 0.052053 FASTA (3.47 Mar 2004) function [optimized, 15/-10 matrix (15:-10)] ktup: 2 join: 213, opt: 198, open/ext: -12/-12, width: 16 Scan time: 2.700 The best scores are: opt bits E(55673) %_id %_sim bs alen an0 ax0 pn0 px0 an1 ax1 pn1 px1 gapq gapl fs BE054209 similar to PRF|NP_974632.1|42573071|N ( 510) [f] 286 25.6 19 0.762 0.952 286 21 1 21 1 22 471 491 1 510 0 0 0 >>>total:39860_L:12096_-3:12346_0:617_+3:14801, 22 nt vs ../databases/cotton/CGI8.fasta library >>BE054209 similar to PRF|NP_974632.1|42573071|NP_974632 pfkB-type carbohydrate kinase family pr (510 nt) initn: 286 init1: 286 opt: 286 Z-score: 111.0 bits: 25.6 E(): 19 banded Smith-Waterman score: 286; 76.190% identity (95.238% similar) in 21 nt overlap (1-21:471-491) 10 20 total: UUGGACAGAGUAAUCACGGUCG :::::..:::::: .:.:::: BE0542 GAACUNUCUCAGUCAAGUUUAUUAUCUGCAUUGGAUGGAGUAAAUAUGGUCUACUUUGAUGGAAGACAUC 450 460 470 480 490 500 510 22 residues in 1 query sequences 43526801 residues in 55673 library sequences Scomplib [34t25] start: Sat Mar 22 16:31:47 2008 done: Sat Mar 22 16:31:49 2008 Total Scan time: 2.700 Total Display time: 0.000 Function used was FASTA [version 3.4t25 Sept 2, 2005] U71225.gb.mac100644000766000024 616514006427353 16262 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataLOCUS U71225 1164 bp DNA linear VRT 27-NOV-2001 DEFINITION Desmognathus quadramaculatus 12S ribosomal RNA gene, partial sequence; tRNA-Val gene, complete sequence; and 16S ribosomal RNA gene, partial sequence, mitochondrial genes for mitochondrial RNAs. ACCESSION U71225 VERSION U71225.1 GI:2804359 KEYWORDS . SOURCE mitochondrion Desmognathus quadramaculatus (black-bellied salamander) ORGANISM Desmognathus quadramaculatus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia; Batrachia; Caudata; Salamandroidea; Plethodontidae; Desmognathinae; Desmognathus. REFERENCE 1 (bases 1 to 1164) AUTHORS Titus,T.A. and Larson,A. TITLE Molecular phylogenetics of Desmognathine salamanders (Caudata: Plethodontidae): A reevaluation of evolution in ecology, life history, and morphology JOURNAL Syst. Biol. 45, 451-472 (1996) REFERENCE 2 (bases 1 to 1164) AUTHORS Titus,T.A. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Biology, University of Oregon, Eugene, OR 97403, USA FEATURES Location/Qualifiers source 1..1164 /organism="Desmognathus quadramaculatus" /organelle="mitochondrion" /mol_type="genomic DNA" /db_xref="taxon:52105" rRNA <1..638 /product="12S ribosomal RNA" tRNA 639..706 /product="tRNA-Val" rRNA 707..>1164 /product="16S ribosomal RNA" ORIGIN 1 ggcccaaagg gtagttttag gtgaaataaa atagaattta aaatttatct agtagttata 61 tataaacata aaatgtaaaa tcaaaaacga aagtcatact atataacctt gaatctacta 121 cagctgagaa acaaactagg attagatacc ctactatgct caactttaaa atggaccttc 181 ccgccagagc actacgagcc acagcttaaa actcaaagga cttggcggtg ctctacaccc 241 acctagagga gcctgttcta taatcgacac tccccgataa acctcaccac ctcttgctaa 301 tacagcctat ataccaccgc cctcagttca cccttcaaaa gaataatagt gaacaaaata 361 atttaaaata aaaaagtcag gtcaaggtgc agcaaatgaa gtggaaagaa atgggctaca 421 ttttttatag taaaaaatac ggaatattct atgaaataaa atataaagga ggatttagaa 481 gtaaaaagaa aaaagagtgt tctttttaaa ttggcaatag agcacgcaca caccgcccgt 541 caccctcttc aaaattaaat aaactaaata aatatataaa tttataagaa aaggtaagtc 601 gtaacatggt aagtctaccg gaaggtggcc ttggatatcg aagtatagct taaataaagc 661 attttgctta caccaaaaaa atatttgtta acccaaatta ccttaaattt taaatctatg 721 ctaaatataa aatactactt cctaatacac aaaacattat tatatgatag tacgggcgac 781 agaaaactta ttagcgcaat agaaaaagta ctgtaaagga aagatgaaat aaaattgaaa 841 taaaataaaa atataaaaga gcaaagatta taacttttac ctttagcata atggtctagc 901 cagtctatat taacataaag aattttagtt atataccccg aaaccaggcg agctacccta 961 aaacagcaat atatgagcga actcttctct gtggcaaaag agtgagaaga atttttggta 1021 gaggcgaaaa accaaacgag cccggatata gctggttact tgagaatgaa ttttagttca 1081 attaaaagca taaatattat aaaaacataa cgcttttatt ataattaatt gaggtacagc 1141 ccaattaata aaggaaacaa ccta // longnames.aln100644000766000024 2044314006427353 17076 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataCLUSTAL W (1.82) multiple sequence alignment Spar_21273 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA YOR262W -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Smik_Contig1103.1 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Skud_Contig1703.7 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Sbay_Contig635.43 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHSQVVNMDPA Scas_Contig692.20 -------------MPFAQIVIGPPGSGKSTYCN------GCSQFFNAIGRHGQIVNMDPA Sklu_Contig2277.4 -------------MPFGQIVIGPPGSGKSTYCN------GCSQFFNAVGRHAQVINMDPA AAL117C -------------MAYGQIVIGPPGSGKSTYCN------GCSQFFNAIGRHARIVNMDPA SPAC144.07c -------------MPFCQVVVGPPGSGKSTYCF------GMYQLLSAIGRSSIIVNLDPA MG06110.4 -----------------------------------------MGFLGAIGRACSVVNLDPA NCU09745.1 ----------------------------------MTSPLPVQQFMGAIGRQCSVVNLDPA FG05298.1 -------------MPFAQLVLGSPGCGKSTYCDGIQLTGQVHQFLGAIGRACSVVNLDPA AN2438.1 ------------------------------------------------------------ 183.m01790 MDDKELEIPVEHSTAFGQLVTGPPGAGKSTYCH------GLHQFLTAIGRPVHIINLDPA Spar_21273 NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLVYAVESLDKSIDLFILQIKSLV--EEE YOR262W NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLMYAVESLDNSIDLFILQIKSLV--EEE Smik_Contig1103.1 NDALPYPCAVDIRDFITLEEIMQEQQLGPNGGLMYAVESLDKSIDLFLLQIKSLV--EEE Skud_Contig1703.7 NDALPYPCAVDIRDFITLEEVMQEQQLGPNGGLMYAVESLDNSIDLFILQIKSLV--EEE Sbay_Contig635.43 NDALPYPCAVDIRDFITLEEIMKEQHLGPNGGLMYAVESLDKSIDLFILQIKSLV--EEE Scas_Contig692.20 NDALPYPCAVDIRDFVTLEEIMQEQQLGPNGGLMYAVESLDESIDLFILQIKSLV--QEE Sklu_Contig2277.4 NDSLPYPCAVDIRDFITLEEIMTEQQLGPNGGLMYALESLDKSIDLFVLQIKSLV--QDE AAL117C NDSLPYQCDVDIRDFITLEEIMNEQHLGPNGGLVYAFESVEHSLSLFALQIKTLV--KDE SPAC144.07c NDFIKYPCAIDIRKVLDVEMIQKDYDLGPNGALIYAMEAIEYHVEWLLKELK-----KHR MG06110.4 NDHTSYPCALDIRNLVTLEEIMGDDNLGPNGGILYAIEELEHNFEWLEDGLK-----ELG NCU09745.1 NDHTNYPCALDIRDLVTLEEIMADDKLGPNGGILYALEELENNMEWLENGLK-----ELG FG05298.1 NDHTNYPAALDIRSLIKLEEIMKDDKLGPNGGILYALEELEHNFEWLEEGLK-----EFS AN2438.1 ---------------------MSEDQLGPNGGVLYALEELEENFDFLEEGLK-----ELG 183.m01790 VPNPPYPCSINITELITLESVMEEYNLGPNGAMLYCIEFLEANFDWLVERLDEVLAEEGG : .*****.::*..* :: .. : :. : Spar_21273 KAYLVFDCPGQVELFTHHSSLFSIFKKMEKELDMRFCVVNLIDCFYMTSPSQYISILLLA YOR262W KAYLVFDCPGQVELFTHHSSLFNIFKKMEKELDIRFCVVNLIDCFYMTSPSQYISILLLA Smik_Contig1103.1 KAYLVFDCPGQVELFTHHSSLFNIFKKMEKELDMRFCVINLIDCFYMTSPSQYVSILLLA Skud_Contig1703.7 KAYLVFDCPGQVELFTHHSSLFSIFKKMEKELDMRFCVVNLIDCFYMTSPSQYVSILLLA Sbay_Contig635.43 KAYVVFDCPGQVELFTHHSSLFSIFKKLEKELDMRFCVVNLIDCFYMTSPSQYVSILLLA Scas_Contig692.20 KAYLVFDCPGQVELFTHHSSLFKIFKKLEKELDMRFCVVNLIDSFYITSPSQYVSILLLA Sklu_Contig2277.4 HAYVVFDCPGQVELFTHHSSLFRIFKKLERELDMRLCVVNLIDCFYITSPSQYVSILLLA AAL117C NAYLVFDCPGQVELFTHHSALSKIFQQLVRDLDLRVCVVNLMDSIYITSPSQYVSVLLLA SPAC144.07c DSYVIFDCPGQVELFTNHNSLQKIIKTLEKELDYRPVSVQLVDAYCCTNPSAYVSALLVC MG06110.4 DDYILFDCPGQVELYTHHNSLRNIFFKLQK-LGYRLVVVHLSDSICLTQPSLYISNLLLA NCU09745.1 EDYVLFDCPGQVELYTHHNSLRNIFYRLQK-LGYRLVVVHLSDCFCLTQPSLYISNVLLS FG05298.1 EDYILFDCPGQVELYTHHNSLRNIFYKLQK-IGFRLVSVHLSDSFCLTQPSLYVSNVLLS AN2438.1 EDYIIFDCPGQVEIFTHHSSLRNIFFKIQK-MGYRLIVLHLIDSYNLTLPSMYISSLILC 183.m01790 NGYVVFDTPGQAELWTNHDSLKNVVEKLVK-MDYRLAAVHLSDAHYITDASKFISVVLLA . *::** ***.*::*:*.:* :. : : :. * ::* *. * .* ::* :::. Spar_21273 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYS YOR262W LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYS Smik_Contig1103.1 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSSVLGKKYN Skud_Contig1703.7 LRSMLMMDLPHINVFSKIDMLKSYGELPFRLDYYTEVQELDHLEPYIEKEGSSVLGKKYS Sbay_Contig635.43 LRSMLMMDLPHINVFSKIDKLKSYGELPFRLDYYTEVQDLDYLEPYIEKEGSGALGKRYS Scas_Contig692.20 LRSMLMMDLPQINVFSKIDMLKSYGELPFRLDYYTEVQDLDYLQPFIEKESSSVLGRRYS Sklu_Contig2277.4 LRSMLMMDLPHINVFSKIDLLKSYGELPFRLDYYTEVQELDYLKPHIDKEGSSVLGRKYS AAL117C LRSMLMMDLPHINVLSKIDMLSSYGDLPFRLDYYTEVQDLEYLQPHIEREHKGAKALRYR SPAC144.07c LKGMLQLDMPHVNILSKADLLCTYGTLPMKLDFFTEVQDLSYLAPLLDRDKR---LQRYS MG06110.4 LRAMLQMDLSHVNVLTKIDKVSSYDRLAFNLDFYTEVHDLSYLLPELEAENPSLRSEKFA NCU09745.1 LRAMLQMDLPHINVLTKIDKISSYDPLPFNLDYYTEVQDLRYLMPSLDAESPALKKGKFT FG05298.1 LRAMIQMDMPHINILSKIDKVADYDELPFNLDYYTDVDDLTYLTPHLETESPALRSEKFG AN2438.1 LRAMLQMDLPHLNVLTKIDNLSNYTSLPFNLDFYTEVQDLTYLLPHLEAESSRLSHEKFG 183.m01790 LRAMLQMEMPHLNVLSKIDLISTYGELPFDLSYYTEVQDLSYLLGSLDSDPR---TAKYH *:.*: :::.::*:::* * : * *.: *.::*:*.:* :* :: : :: Spar_21273 KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ YOR262W KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Smik_Contig1103.1 KLTDAIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Skud_Contig1703.7 KLTETIKELVSDFNLVSFEVLSVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Sbay_Contig635.43 KLTETISELVSDFNLVSFEVLAVDDKESMINLQGVIDKANGYIFGAS---EVGG------ Scas_Contig692.20 KLTETISELVSDFNLVSFEVLAVDDKQSMINLQSVVDKANGYIFGAS---EVGG------ Sklu_Contig2277.4 RLTETISELVSDFNLVSFEVLCVDDKQSMINLQSIVDKANGYIFGVS---EIGG------ AAL117C RLTEAIGEVVSDFNLVAFEVLCVDDKQSMINLQSAIDKANGYIFGAS---EVGG------ SPAC144.07c DLNKAICELVEDFNLVSFEVVAVENKASMLRVLRKIDQAGGYAYGST---EIGG------ MG06110.4 KLNRAVANLIEDFGLVRFEVLAVENKKSMMHLLRVLDRANGYVFGGA---EGAN------ NCU09745.1 KLNEAVANMVEQFGLVSFEVLAVENKKSMMHLLRVIDRASGYVFGGA---EGTN------ FG05298.1 KLNEAIANLIESYGLVRYEVLAVENKKSMMHILRVIDRAGGYVFGSA---EGAN------ AN2438.1 ALNNAIITLIEEFGLVGFETLAVEDKKSMMNLLRAIDRASGYVFGPA---EGAN------ 183.m01790 KLNKALVELIEGFSLVGFQTLAVEDKESMLNIVRLVDKMTGYIFIPSGDLEGTNAINTQA *. :: ::. :.** ::.:.*::* **:.: :*: ** : : * . Spar_21273 --DTVWAEASREGA-LLANYDIQDRWIDNKEKYDKEEEEKRAALLKEQELQNKAVDVNEE YOR262W --DTVWAEASREGA-LIANYDIQDRWIDNKEKYDKEEEEKRTALLKEQELQNKAVDVNEE Smik_Contig1103.1 --DTVWAEASREGA-LLTSYDIQDRWIDNKEKYDKEEEEKRVILLKEQELQNKAVDVNED Skud_Contig1703.7 --DTVWAEASREGA-LLENYDIQDRWIDNKETYDKEEQEKRASLLKEQELQNKTVDVKEE Sbay_Contig635.43 --DTVWAEASREGA-LLASYDIQDRWIDNKEKYDKEEQEKRAAMVKEQELQNKEVNVDEE Scas_Contig692.20 --DTVWAEATREGA-MMVNYDIQDRWIDNKEKYDEEERKRQEEQAKEQNMQEKEVDVDNE Sklu_Contig2277.4 --DTVWAEATRQGS-AIANYDIQERWIDNKDMYDREEQEKREQLLKEEELQNKEVDVDKG AAL117C --DTVWAEATRQGT-AAIEYDIQDRWIDNKDFYDKEEEARRKKLLEEHELLEKEVDVNQD SPAC144.07c --DAVWVNAVRQGGDPLQGISPQERWIDKKEEYDK----------YEWELEQK--STMDE MG06110.4 --DTVWQVAMRNEGSLMGVQDIQERWIDNKEAYDEMEQREWEEQVKAQEAMAEADAAAAE NCU09745.1 --DTVWQVAMRNESSLPDALDIQERWIDSKEEYDEMERKEEEEQEKLRAEQARAAEEAGL FG05298.1 --DTVWSVAMRNESSMLGVQDIQERWIDQKVEYDQMEREAEEEQARIQEEQAMEMEQSQP AN2438.1 --DSVWQVAVREGMGSMDIRDIQERWIDAKDEYDELERRQREEEIKNHQQAATYQAGNED 183.m01790 LFGSAMSSAKLTGRAGGDVRDVQERWMDNKEAWDEWEKKEWKREAEIRAQMGTGIPEGMK .:. * . *:**:* * :*. Spar_21273 DEWENALKEWEEKQGTDFVR-------------- YOR262W DEWENALKEWEEKQGMDFVR-------------- Smik_Contig1103.1 DEWESALKEWEEKQGMDFVR-------------- Skud_Contig1703.7 DEWENALKEWEEKQDTEFVR-------------- Sbay_Contig635.43 DEWENALNDWEEKQGTDFVR-------------- Scas_Contig692.20 DEWEKALKDWEEKQGTGYVR-------------- Sklu_Contig2277.4 DEWENALKEWEEKQGMSYVK-------------- AAL117C DEWERAVKEWESQHSVNFVK-------------- SPAC144.07c DENEG----------------------------- MG06110.4 -EG-----DDDLMGGPGAR--------------- NCU09745.1 GDGSVPGVAPQFTSGSGIRVTLSLVAAFTKYSDL FG05298.1 PPAPTGGMDPDFGDMTVPKDSGIKVVRK------ AN2438.1 DDDDN---DYEFGRRMPVPDSGVKVMRK------ 183.m01790 GGED--------AESTGI---------------- biorecipe.nhx100644000766000024 2015414006427353 17076 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data((((GUITH:45.260595[&&NHX:S=Guillardia_theta_nucleomorph_only:T=55529],((((((SORBI:5.5511156[&&NHX:S=Sorghum_bicolor:T=4558],ORYSA:4.3949497[&&NHX:S=Oryza_sativa:T=4530]):8.209326[&&NHX: B=100],((POPTR:7.743716[&&NHX:S=Populus_trichocarpa:T=3694],VITVI:7.4273393[&&NHX:S=Vitis_vinifera:T=29760]):1.7993087[&&NHX:B=100],(ARALY:1.3226165[&&NHX:S=Arabidopsis_lyrata:T=59689],ARATH:1.2519382[&& NHX:S=Arabidopsis_thaliana:T=3702]):10.38364[&&NHX:B=100]):3.0587217[&&NHX:B=100]):17.755212[&&NHX:B=100],OSTLU:34.956257[&&NHX:S=Ostreococcus_lucimarinus:T=436017]):7.0083058[&&NHX:B=100],DICDI:36.79394 8[&&NHX:S=Dictyostelium_discoideum:T=44689]):0.37800671[&&NHX:B=69],(CRYPV:31.705985[&&NHX:S=Cryptosporidium_parvum:T=5807],PLAF7:34.478153[&&NHX:S=Plasmodium_falciparum:T=36329]):7.4079384[&&NHX:B=100]) :0.60828157[&&NHX:B=58],(GIALA:47.040426[&&NHX:S=Giardia_lamblia:T=184922],((LEIBR:4.818744[&&NHX:S=Leishmania_braziliensis:T=5660],(LEIIN:0.94650628[&&NHX:S=Leishmania_infantum:T=5671],LEIMA:1.9314021[& &NHX:S=Leishmania_major:T=5664]):3.7710558[&&NHX:B=100]):16.797883[&&NHX:B=100],TRYBB:19.911383[&&NHX:S=Trypanosoma_brucei:T=5702]):25.940805[&&NHX:B=100]):3.5294196[&&NHX:B=99]):0.73373836[&&NHX:B=52]): 0.42838058[&&NHX:B=31],(MONBE:35.257509[&&NHX:S=Monosiga_brevicollis:T=81824],((PRIPA:18.086434[&&NHX:S=Pristionchus_pacificus:T=54126],(CAEJA:6.0045163[&&NHX:S=Caenorhabditis_japonica:T=281687],((CAEEL: 5.3180217[&&NHX:S=Caenorhabditis_elegans:T=6239],(CAERE:3.6291663[&&NHX:S=Caenorhabditis_remanei:T=31234],CAEBR:4.3000153[&&NHX:S=Caenorhabditis_briggsae:T=6238]):0.41116075[&&NHX:B=98]):0.13776817[&&NHX :B=72],CAEBE:4.2883189[&&NHX:S=Caenorhabditis_brenneri:T=135651]):1.9913751[&&NHX:B=100]):15.936087[&&NHX:B=100]):16.164887[&&NHX:B=100],((((LOTGI:23.922724[&&NHX:S=Lottia_gigantea:T=225164],(HELRO:26.34 055[&&NHX:S=Helobdella_robusta:T=6412],CAPI1:22.755934[&&NHX:S=Capitella_sp._I:T=73382]):0.85815597[&&NHX:B=89]):2.3825092[&&NHX:B=100],(NEMVE:26.477253[&&NHX:S=Nematostella_vectensis:T=45351],((CIOSA:7. 9216903[&&NHX:S=Ciona_savignyi:T=51511],CIOIN:10.00926[&&NHX:S=Ciona_intestinalis:T=7719]):21.59891[&&NHX:B=100],((((ORYLA:6.090087[&&NHX:S=Oryzias_latipes:T=8090],(GASAC:6.1152276[&&NHX:S=Gasterosteus_a culeatus:T=69293],(TAKRU:3.0333091[&&NHX:S=Takifugu_rubripes:T=31033],TETNG:3.3887518[&&NHX:S=Tetraodon_nigroviridis:T=99883]):3.199465[&&NHX:B=100]):0.42854645[&&NHX:B=100]):3.037336[&&NHX:B=100],DANRE: 8.0442086[&&NHX:S=Danio_rerio:T=7955]):3.80391[&&NHX:B=100],(XENTR:9.8212561[&&NHX:S=Xenopus_tropicalis:T=8364],((ORNAN:5.6204266[&&NHX:S=Ornithorhynchus_anatinus:T=9258],((MACEU:3.0333255[&&NHX:S=Macrop us_eugenii:T=9315],MONDO:2.435399[&&NHX:S=Monodelphis_domestica:T=13616]):2.833353[&&NHX:B=100],((RATNO:1.3842393[&&NHX:S=Rattus_norvegicus:T=10116],MOUSE:1.2347199[&&NHX:S=Mus_musculus:T=10090]):2.30283 32[&&NHX:B=100],((((OCHPR:2.9196125[&&NHX:S=Ochotona_princeps:T=9978],RABIT:2.0106071[&&NHX:S=Oryctolagus_cuniculus:T=9986]):0.89841649[&&NHX:B=100],((((SPETR:3.0357335[&&NHX:S=Spermophilus_tridecemlinea tus:T=43179],((MICMU:1.9748823[&&NHX:S=Microcebus_murinus:T=30608],OTOGA:2.6116392[&&NHX:S=Otolemur_garnettii:T=30611]):0.45473502[&&NHX:B=100],(TARSY:2.4175242[&&NHX:S=Tarsius_syrichta:T=9478],(CALJA:1. 4056291[&&NHX:S=Callithrix_jacchus:T=9483],((PONAB:0.49708744[&&NHX:S=Pongo_pygmaeus_abelii:T=9601],((HUMAN:0.18782408[&&NHX:S=Homo_sapiens:T=9606],PANTR:0.22094877[&&NHX:S=Pan_troglodytes:T=9598]):0.087 77922[&&NHX:B=100],GORGO:0.41787667[&&NHX:S=Gorilla_gorilla:T=9595]):0.15069262[&&NHX:B=100]):0.17405127[&&NHX:B=100],MACMU:0.81068563[&&NHX:S=Macaca_mulatta:T=9544]):0.40572836[&&NHX:B=100]):0.90012338[ &&NHX:B=100]):0.0382438[&&NHX:B=77]):0.16073256[&&NHX:B=100]):0.0346796[&&NHX:B=75],TUPGB:2.7339007[&&NHX:S=Tupaia_belangeri:T=37347]):0.13427474[&&NHX:B=99],(((PTEVA:2.4199647[&&NHX:S=Pteropus_vampyrus: T=132908],MYOLU:2.916452[&&NHX:S=Myotis_lucifugus:T=59463]):0.30536254[&&NHX:B=100],(((PIGXX:1.9356621[&&NHX:S=Sus_scrofa:T=9823],(TURTR:1.8706565[&&NHX:S=Tursiops_truncatus:T=9739],BOVIN:1.8671969[&&NHX :S=Bos_taurus:T=9913]):0.14506514[&&NHX:B=100]):0.10418683[&&NHX:B=100],LAMPA:1.5909325[&&NHX:S=Lama_guanicoe_pacos:T=30538]):0.35680951[&&NHX:B=100],(HORSE:1.7973879[&&NHX:S=Equus_caballus:T=9796],(FELC A:1.7499263[&&NHX:S=Felis_catus:T=9685],CANFA:1.4661618[&&NHX:S=Canis_familiaris:T=9615]):0.57865839[&&NHX:B=100]):0.04776824[&&NHX:B=96]):0.02540859[&&NHX:B=52]):0.17928021[&&NHX:B=100],ERIEU:4.2884946[ &&NHX:S=Erinaceus_europaeus:T=9365]):0.10507715[&&NHX:B=45]):0.09302481[&&NHX:B=41],((ECHTE:4.2107472[&&NHX:S=Echinops_telfairi:T=9371],(PROCA:2.9088459[&&NHX:S=Procavia_capensis:T=9813],LOXAF:1.7750296[ &&NHX:S=Loxodonta_africana:T=9785]):0.57104556[&&NHX:B=100]):0.3806632[&&NHX:B=73],(CHOHO:2.5356993[&&NHX:S=Choloepus_hoffmanni:T=9358],DASNO:3.0320132[&&NHX:S=Dasypus_novemcinctus:T=9361]):0.586947[&&NH X:B=100]):0.15651023[&&NHX:B=57]):0.13791254[&&NHX:B=51]):0.06585063[&&NHX:B=36],(CAVPO:3.3526167[&&NHX:S=Cavia_porcellus:T=10141],DIPOR:3.3549741[&&NHX:S=Dipodomys_ordii:T=10020]):0.11345984[&&NHX:B=70] ):0.05227836[&&NHX:B=31],SORAR:3.5427681[&&NHX:S=Sorex_araneus:T=42254]):0.19739776[&&NHX:B=81]):2.2484555[&&NHX:B=100]):0.8115488[&&NHX:B=100]):1.925033[&&NHX:B=100],(ANOCA:6.8789277[&&NHX:S=Anolis_caro linensis:T=28377],(TAEGU:3.0131789[&&NHX:S=Taeniopygia_guttata:T=59729],(MELGA:0.55164365[&&NHX:S=Meleagris_gallopavo:T=9103],CHICK:0.83790488[&&NHX:S=Gallus_gallus:T=9031]):1.9357516[&&NHX:B=100]):2.787 5104[&&NHX:B=100]):0.9202608[&&NHX:B=100]):2.0010926[&&NHX:B=100]):2.6931318[&&NHX:B=100]):11.078988[&&NHX:B=100],BRAFL:21.730013[&&NHX:S=Branchiostoma_floridae:T=7739]):1.9779545[&&NHX:B=100]):0.5263615 7[&&NHX:B=100]):0.55860669[&&NHX:B=100]):0.92419882[&&NHX:B=100],TRIAD:31.681815[&&NHX:S=Trichoplax_adhaerens:T=10228]):0.72889323[&&NHX:B=79],((DAPPU:25.747514[&&NHX:S=Daphnia_pulex:T=6669],(((AEDAE:9.7 762877[&&NHX:S=Aedes_aegypti:T=7159],ANOGA:9.6648339[&&NHX:S=Anopheles_gambiae:T=7165]):9.0087147[&&NHX:B=100],(DROWI:5.6411163[&&NHX:S=Drosophila_willistoni:T=7260],((DROPS:5.494109[&&NHX:S=Drosophila_p seudoobscura:T=46245],DROME:4.9847676[&&NHX:S=Drosophila_melanogaster:T=7227]):1.0096506[&&NHX:B=100],DROVI:6.2668086[&&NHX:S=Drosophila_virilis:T=7244]):0.11597573[&&NHX:B=67]):13.350809[&&NHX:B=100]):6 .4492757[&&NHX:B=100],(APIME:18.616225[&&NHX:S=Apis_mellifera:T=7460],PEDHC:22.359785[&&NHX:S=Pediculus_humanus_corporis:T=121224]):1.7086512[&&NHX:B=98]):3.3794766[&&NHX:B=100]):2.0656827[&&NHX:B=100],I XOSC:25.911987[&&NHX:S=Ixodes_scapularis:T=6945]):1.7730423[&&NHX:B=100]):3.3897979[&&NHX:B=100]):1.8522167[&&NHX:B=100]):2.6952513[&&NHX:B=100]):1.5835346[&&NHX:B=81],ENCCU:57.116454[&&NHX:S=Encephalito zoon_cuniculi:T=6035]):0.3079138[&&NHX:B=100],(((LACBI:24.773942[&&NHX:S=Laccaria_bicolor:T=29883],CRYNE:23.093254[&&NHX:S=Cryptococcus_neoformans:T=5207]):2.2382142[&&NHX:B=100],USTMA:22.093731[&&NHX:S= Ustilago_maydis:T=5270]):5.8884234[&&NHX:B=100],(SCHPO:33.58652[&&NHX:S=Schizosaccharomyces_pombe:T=4896],((YARLI:27.176259[&&NHX:S=Yarrowia_lipolytica:T=4952],(((LODEL:10.798514[&&NHX:S=Lodderomyces_elo ngisporus:T=36914],CANAL:10.930718[&&NHX:S=Candida_albicans:T=5476]):2.2819566[&&NHX:B=100],(PICST:11.62454[&&NHX:S=Pichia_stipitis:T=4924],DEBHA:12.486821[&&NHX:S=Debaryomyces_hansenii:T=4959]):1.548869 7[&&NHX:B=100]):9.7653977[&&NHX:B=100],((CANGA:12.853514[&&NHX:S=Candida_glabrata:T=5478],YEAST:12.995237[&&NHX:S=Saccharomyces_cerevisiae:T=4932]):2.7292937[&&NHX:B=100],(KLULA:15.769021[&&NHX:S=Kluyver omyces_lactis:T=28985],ASHGO:15.205039[&&NHX:S=Ashbya_gossypii:T=33169]):1.5031692[&&NHX:B=100]):8.385147[&&NHX:B=100]):5.2388175[&&NHX:B=100]):4.6698393[&&NHX:B=100],(((PHANO:17.796313[&&NHX:S=Phaeosphaeria_nodorum:T=13684],MYCGR:17.395458[&&NHX:S=Mycosphaerella_graminicola:T=54734]):2.2904277[&&NHX:B=100],(ASPFU:7.6395693[&&NHX:S=Aspergillus_fumigatus:T=5085],EMENI:7.6966689[&&NHX:S=Emericella_nidulans:T=162425]):10.679052[&&NHX:B=100]):0.21889651[&&NHX:B=62],(BOTFB:14.068847[&&NHX:S=Botrytis_cinerea:T=332648],(MAGGR:13.45956[&&NHX:S=Magnaporthe_grisea:T=148305],NEUCR:12.904786[&&NHX:S=Neurospora_crassa:T=5141]):3.3272798[&&NHX:B=100]):3.0611813[&&NHX:B=100]):11.080428[&&NHX:B=100]):0.54809742[&&NHX:B=99]):3.3723191[&&NHX:B=100]):5.8323612[&&NHX:B=100]):0; alnfile.fasta100644000766000024 44614006427353 17012 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data>BS1-fragment 7fab light chain variable region TISCTGSSSNIGAGNHVKWYQQLPG >BS2-fragment 2fb4 light chain variable region VTISCTGTSSNIGSITVNWYQQLPG >BS3-fragment 2fb4 heavy chain variable region LRLSCSSSGFIFSSYAMYWVRQAPG >BS4-fragment 7fab heavy chain variable region LSLTCTVSGTSFDDYYSTWVRQPPG test_data.axt100644000766000024 62014006427353 17040 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data0 chr19 3001012 3001075 chr11 70568380 70568443 - 3500 TCAGCTCATAAATCACCTCCTGCCACAAGCCTGGCCTGGTCCCAGGAGAGTGTCCAGGCTCAGA TCTGTTCATAAACCACCTGCCATGACAAGCCTGGCCTGTTCCCAAGACAATGTCCAGGCTCAGA 1 chr19 3008279 3008357 chr11 70573976 70574054 - 3900 CACAATCTTCACATTGAGATCCTGAGTTGCTGATCAGAATGGAAGGCTGAGCTAAGATGAGCGACGAGGCAATGTCACA CACAGTCTTCACATTGAGGTACCAAGTTGTGGATCAGAATGGAAAGCTAGGCTATGATGAGGGACAGTGCGCTGTCACAtestaln.selex100644000766000024 1133114006427353 17127 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data#=SQ HSFAU 1.00 - - 0..0:0 H.sapiens fau mRNA #=SQ HSFAU1 1.00 - - 0..0:0 H.sapiens fau 1 gene HSFAU ttcctctttctcgactccatcttcgcggtagctgggaccgccgttcagtc HSFAU1 ctaccattttccctctcgattctatatgtacactcgggacaagttctcct HSFAU gccaatatgcagctctttgtccgcgcccaggagctacacaccttcgaggt HSFAU1 gatcgaaaacggcaaaactaaggccccaagtaggaatgccttagttttcg HSFAU gaccggccaggaaacggtcgcccagatcaaggctcatgtagcctcactgg HSFAU1 gggttaacaatgattaacactgagcctcacacccacgcgatgccctcagc HSFAU agggcattgccccggaagatcaagtcgtgctcctggcaggcgcgcccctg HSFAU1 tcctcgctcagcgctctcaccaacagccgtagcccgcagccccgctggac HSFAU gaggatgaggccactctgggccagtgcggggtggaggccctgactaccct HSFAU1 accggttctccatccccgcagcgtagcccggaacatggtagctgccatct HSFAU ggaagtagcaggccgcatgcttggaggtaaagttcatggttccctggccc HSFAU1 ttacctgctacgccagccttctgtgcgcgcaactgtctggtcccgccccg HSFAU gtgctggaaaagtgagaggtcagactcctaaggtggccaaacaggagaag HSFAU1 tcctgcgcgagctgctgcccaggcaggttcgccggtgcgagcgtaaaggg HSFAU aagaagaagaagacaggtcgggctaagcggcggatgcagtacaaccggcg HSFAU1 gcggagctaggactgccttgggcggtacaaatagcagggaaccgcgcggt HSFAU ctttgtcaacgttgtgcccacctttggcaagaagaagggccccaatgcca HSFAU1 cgctcagcagtgacgtgacacgcagcccacggtctgtactgacgcgccct HSFAU actcttaagtcttttgtaattctggctttctctaataaaaaagccactta HSFAU1 cgcttcttcctctttctcgactccatcttcgcggtagctgggaccgccgt HSFAU gttcagtcaaaaaaaaaa HSFAU1 tcaggtaagaatggggccttggctggatccgaagggcttgtagcaggttg HSFAU HSFAU1 gctgcggggtcagaaggcgcggggggaaccgaagaacggggcctgctccg HSFAU HSFAU1 tggccctgctccagtccctatccgaactccttgggaggcactggccttcc HSFAU HSFAU1 gcacgtgagccgccgcgaccaccatcccgtcgcgatcgtttctggaccgc HSFAU HSFAU1 tttccactcccaaatctcctttatcccagagcatttcttggcttctctta HSFAU HSFAU1 caagccgtcttttctttactcagtcgccaatatgcagctctttgtccgcg HSFAU HSFAU1 cccaggagctacacaccttcgaggtgaccggccaggaaacggtcgcccag HSFAU HSFAU1 atcaaggtaaggctgcttggtgcgccctgggttccattttcttgtgctct HSFAU HSFAU1 tcactctcgcggcccgagggaacgcttacgagccttatctttccctgtag HSFAU HSFAU1 gctcatgtagcctcactggagggcattgccccggaagatcaagtcgtgct HSFAU HSFAU1 cctggcaggcgcgcccctggaggatgaggccactctgggccagtgcgggg HSFAU HSFAU1 tggaggccctgactaccctggaagtagcaggccgcatgcttggaggtgag HSFAU HSFAU1 tgagagaggaatgttctttgaagtaccggtaagcgtctagtgagtgtggg HSFAU HSFAU1 gtgcatagtcctgacagctgagtgtcacacctatggtaatagagtacttc HSFAU HSFAU1 tcactgtcttcagttcagagtgattcttcctgtttacatccctcatgttg HSFAU HSFAU1 aacacagacgtccatgggagactgagccagagtgtagttgtatttcagtc HSFAU HSFAU1 acatcacgagatcctagtctggttatcagcttccacactaaaaattaggt HSFAU HSFAU1 cagaccaggccccaaagtgctctataaattagaagctggaagatcctgaa HSFAU HSFAU1 atgaaacttaagatttcaaggtcaaatatctgcaactttgttctcattac HSFAU HSFAU1 ctattgggcgcagcttctctttaaaggcttgaattgagaaaagaggggtt HSFAU HSFAU1 ctgctgggtggcaccttcttgctcttacctgctggtgccttcctttccca HSFAU HSFAU1 ctacaggtaaagtccatggttccctggcccgtgctggaaaagtgagaggt HSFAU HSFAU1 cagactcctaaggtgagtgagagtattagtggtcatggtgttaggacttt HSFAU HSFAU1 ttttcctttcacagctaaaccaagtccctgggctcttactcggtttgcct HSFAU HSFAU1 tctccctccctggagatgagcctgagggaagggatgctaggtgtggaaga HSFAU HSFAU1 caggaaccagggcctgattaaccttcccttctccaggtggccaaacagga HSFAU HSFAU1 gaagaagaagaagaagacaggtcgggctaagcggcggatgcagtacaacc HSFAU HSFAU1 ggcgctttgtcaacgttgtgcccacctttggcaagaagaagggccccaat HSFAU HSFAU1 gccaactcttaagtcttttgtaattctggctttctctaataaaaaagcca HSFAU HSFAU1 cttagttcagtcatcgcattgtttcatctttacttgcaaggcctcaggga HSFAU HSFAU1 gaggtgtgcttctcgg cysprot1b.msf100644000766000024 464514006427353 17042 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataPileUp MSF: 366 Type: P Check: 8252 .. Name: CATL_HUMAN oo Len: 366 Check: 9087 Weight: 21.6 Name: CATL_RAT oo Len: 366 Check: 271 Weight: 20.7 Name: CATH_RAT oo Len: 366 Check: 8974 Weight: 27.9 Name: PAPA_CARPA oo Len: 366 Check: 9920 Weight: 29.7 // CATL_HUMAN ......MNPT LILAAFCLGI .......... ...ASATLTF DHSLEAQWTK CATL_RAT ......MTPL LLLAVLCLGT .......... ...ALATPKF DQTFNAQWHQ CATH_RAT ...MWTALPL LCAGAWLLSA G......... ...ATAELTV NAIEKFHFTS PAPA_CARPA MAMIPSISKL LFVAICLFVY MGLSFGDFSI VGYSQNDLTS TERLIQLFES CATL_HUMAN WKAMHNRLYG .MNEEGWRRA VWEKNMKMIE LHNQEYREGK HSFTMAMNAF CATL_RAT WKSTHRRLYG .TNEEEWRRA VWEKNMRMIQ LHNGEYSNGK HGFTMEMNAF CATH_RAT WMKQHQKTYS .SREYSHRLQ VFANNWRKIQ AHNQRN.... HTFKMGLNQF PAPA_CARPA WMLKHNKIYK NIDEKIYRFE IFKDNLKYID ETNKKN.... NSYWLGLNVF CATL_HUMAN GDMTSEEFRQ VMNGFQNRKP RKGKVFQEPL FY....EAPR SVDWREKG.Y CATL_RAT GDMTNEEFRQ IVNGYRHQKH KKGRLFQEPL ML....QIPK TVDWREKG.C CATH_RAT SDMSFAEIKH KYLWSEPQNC SATKSNYLRG TG....PYPS SMDWRKKGNV PAPA_CARPA ADMSNDEFKE KYTGSIAGNY TTTELSYEEV LNDGDVNIPE YVDWRQKG.A CATL_HUMAN VTPVKNQGQC GSCWAFSATG ALEGQMFRKT GRLISLSEQN LVDCSGPQGN CATL_RAT VTPVKNQGQC GSCWAFSASG CLEGQMFLKT GKLISLSEQN LVDCSHDQGN CATH_RAT VSPVKNQGAC GSCWTFSTTG ALESAVAIAS GKMMTLAEQQ LVDCAQNFNN PAPA_CARPA VTPVKNQGSC GSCWAFSAVV TIEGIIKIRT GNLNEYSEQE LLDCDRR..S CATL_HUMAN EGCNGGLMDY AFQYVQDNGG LDSEESYPYE ATEESCKYNP KYSVANDTGF CATL_RAT QGCNGGLMDF AFQYIKENGG LDSEESYPYE AKDGSCKYRA EYAVANDTGF CATH_RAT HGCQGGLPSQ AFEYILYNKG IMGEDSYPYI GKNGQCKFNP EKAVAFVKNV PAPA_CARPA YGCNGGYPWS ALQLVAQYG. IHYRNTYPYE GVQRYCRSRE KGPYAAKTDG CATL_HUMAN VDIPK.QEKA LMKAVATVGP ISVAIDAGHE SFLFYKEGIY FEPDCSS..E CATL_RAT VDIPQ.QEKA LMKAVATVGP ISVAMDASHP SLQFYSSGIY YEPNCSS..K CATH_RAT VNITLNDEAA MVEAVALYNP VSFAFEVT.E DFMMYKSGVY SSNSCHKTPD PAPA_CARPA VRQVQPYNEG ALLYSIANQP VSVVLEAAGK DFQLYRGGIF VGP.CGN... CATL_HUMAN DMDHGVLVVG YGFESTESDN NKYWLVKNSW GEEWGMGGYV KMAKDRR... CATL_RAT DLDHGVLVVG YGYEGTDSNK DKYWLVKNSW GKEWGMDGYI KIAKDRN... CATH_RAT KVNHAVLAVG YG....EQNG LLYWIVKNSW GSNWGNNGYF LIERGK.... PAPA_CARPA KVDHAVAAVG YG........ PNYILIKNSW GTGWGENGYI RIKRGTGNSY CATL_HUMAN NHCGIASAAS YPTV.. CATL_RAT NHCGLATAAS YPIVN. CATH_RAT NMCGLAACAS YPIPQV PAPA_CARPA GVCGLYTSSF YPVKN. codeml45b.mlc100644000766000024 11035014006427353 16707 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 9 1425 Pdel181_DNA2 ATG TCA CCA CAA ACA GAG ACT AAA ACA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAG TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT ACT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATG CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Pfre186_DNA2 ATG TCA CCA CAA ACA GAG ACT AAA ACA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAG TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT ACT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATG CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT ATT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Pgra187_DNA2 ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT ACT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG ACT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATG AAA AGG GCT ATA TTT GCC AGA GAA TTG GGA GTT CCT ATT GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCT GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATA AAA TTT GAA TTC GAA GCA ATG GAT ACG TTG Phet26_DNA21 ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTA CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Pmex37_DNA21 ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT ACT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATG CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT ATT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Ptre197_DNA2 ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT ACT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG ACT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATG AAA AGG GCT ATA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCT GGT GCT GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC GAA GCA ATG GAT ACG TTG WHR1_DNA225 ATG TCA CCA CAA ACA GAG ACT AAA GCG GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAG TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCC GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT ACT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATG AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTG CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG YALD273_DNA5 ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTA CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Pop_trich_ch ATG TCA CCA CAA ACA GAG ACT AAA GCA GGT GTT GGA TTC AAG GCT GGT GTT AAA GAT TAT AAA TTG ACT TAT TAT ACT CCT GAC TAT GAA ACC AAA GAT ACT GAT ATC TTG GCA GCA TTC CGA GTA ACT CCT CAA CCT GGA GTT CCG CCC GAG GAA GCA GGG GCC GCA GTA GCT GCT GAA TCT TCT ACT GGT ACA TGG ACA ACT GTG TGG ACC GAC GGG CTT ACC AGT CTT GAT CGT TAT AAG GGA CGA TGC TAC GAC ATC GAG CCC GTT GCT GGA GAA GAA AAT CAA TTT ATT GCT TAT GTA GCT TAC CCC TTA GAC CTT TTT GAA GAA GGT TCT GTT ACT AAC ATG TTT ACT TCC ATT GTG GGT AAT GTA TTT GGG TTC AAA GCC CTA CGC GCT CTA CGT CTG GAG GAT TTG CGA ATT CCT CCT GCT TAT GTT AAA ACT TTT CAA GGC CCA CCT CAT GGT ATC CAA GTT GAA AGA GAT AAA TTG AAC AAG TAT GGT CGC CCC CTA TTG GGC TGT ACT ATT AAA CCT AAA TTG GGG TTA TCC GCT AAG AAT TAC GGT AGA GCA GTT TAT GAA TGT CTA CGC GGT GGA CTT GAT TTT ACC AAA GAT GAT GAG AAC GTG AAC TCC CAA CCA TTT ATG CGT TGG AGA GAT CGT TTC TTA TTT TGT GCC GAA GCA CTT TAT AAA GCA CAG GCT GAA ACC GGT GAA ATC AAA GGG CAT TAT TTA AAC GCT ACT GCA GGT ACA TGC GAA GAA ATG ATC AAA AGG GCT GTA TTT GCC AGA GAA TTG GGA GTT CCT ATC GTA ATG CAT GAC TAC TTA ACA GGG GGA TTC ACC GCA AAC ACT AGT TTG GCT CAT TAT TGC CGA GAT AAT GGT TTA CTT CTT CAC ATC CAT CGC GCA ATG CAT GCA GTT ATT GAT AGA CAG AAA AAT CAT GGT ATA CAC TTT CGT GTA CTA GCT AAG GCA TTA CGT ATG TCT GGT GGA GAT CAT ATT CAC TCT GGT ACC GTA GTA GGT AAA CTT GAA GGG GAA AGA GAC ATA ACT TTG GGT TTT GTT GAT TTA CTG CGT GAT GAT TTT GTT GAA AAA GAT CGA AGC CGC GGT ATT TAT TTC ACT CAA GAT TGG GTC TCT CTA CCG GGT GTT CTA CCC GTG GCT TCG GGG GGT ATT CAC GTT TGG CAT ATG CCT GCT CTG ACC GAG ATC TTT GGA GAT GAT TCC GTA CTA CAA TTC GGT GGA GGA ACT TTA GGG CAC CCT TGG GGA AAT GCA CCC GGT GCC GTT GCT AAT CGA GTA GCT CTA GAA GCA TGT GTA CAA GCT CGT AAT GAG GGA CGT GAT CTT GCT CGT GAG GGT AAT GAA ATT ATC CGT GAA GCT AGC AAA TGG AGC CCT GAA CTA GCT GCT GCT TGT GAA GTA TGG AAG GAG ATT AAA TTT GAA TTC CAA GCA ATG GAT ACG TTG Printing out site pattern counts 9 225 P Pdel181_DNA2 AAA AAC AAG AAT ACA ACA ACC ACG ACT ACT AGA AGC AGG AGT ATA ATC ATC ATC ATG ATG ATT ATT CAA CAA CAC CAG CAT CCA CCC CCC CCG CCT CCT CGA CGC CGT CTA CTG CTG CTT GAA GAC GAG GAG GAT GCA GCC GCC GCT GCT GGA GGC GGG GGT GTA GTA GTC GTG GTT GTT GTT TAC TAT TCA TCC TCG TCT TCT TGC TGG TGT TTA TTC TTG TTT Pfre186_DNA2 ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... A.. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Pgra187_DNA2 ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ..G ..T ..A ... ..A ... ... G.. ... ... ... ... ... ..T ... ... ... ... ... ... ... ... ... ... ... ... ..C ... ... ... ... ... A.. ... ... ... ... ... A.. ... ... ... ... AC. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Phet26_DNA21 ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ... ... ..A ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ..A ... ... ... ... ..C ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Pmex37_DNA21 ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... A.. ... ... ... ... ... ... ... ... ... ... ..C ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... A.. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Ptre197_DNA2 ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ..G ... ..A ... ... ... ... G.. ... ... ... ... ... ..T ... ... ... ... ... ... ... ... ... ... ... ... ..C ... ... ... ... ..T A.. ... ... ... ... ... A.. ... ... ... ... AC. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... WHR1_DNA225 ... ... ... ... ... G.G ... ... ... .G. ... ... ... ... ... ... ..G ... ..A ... ... ... ... ... ... ... ... ... ... ... ... A.. ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ..C ... ... ... ... ... ... ... ... YALD273_DNA5 ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ... ... ..A ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ..A ... ... ... ... ..C ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Pop_trich_ch ... ... ... ... ... G.. ... ... ... .G. ... ... ... ... ... ... ... ... ..A ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ..A ... ... ... ... ..C ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... 18 6 6 9 5 1 8 1 16 1 6 3 1 1 1 7 1 1 1 9 1 9 9 1 5 2 9 3 5 1 2 1 10 6 5 11 9 1 3 9 24 5 1 9 22 15 4 1 1 23 13 2 9 23 1 12 1 4 1 1 13 4 13 1 4 1 1 5 3 8 5 9 8 10 14 CODONML (in paml version 4.5, December 2011) examples/small_taxon_set.nuc Model: One dN/dS ratio for branches Codon frequency model: F3x4 ns = 9 ls = 475 Codon usage in sequences -------------------------------------------------------------------------------------------------------------- Phe TTT 14 14 14 14 14 14 | Ser TCT 6 6 6 6 6 6 | Tyr TAT 13 13 13 13 13 13 | Cys TGT 5 5 5 5 5 5 TTC 8 8 8 8 8 8 | TCC 4 4 4 4 4 4 | TAC 4 4 4 4 4 4 | TGC 3 3 3 3 3 3 Leu TTA 9 9 9 9 9 9 | TCA 1 1 1 1 1 1 | *** TAA 0 0 0 0 0 0 | *** TGA 0 0 0 0 0 0 TTG 10 10 10 10 10 10 | TCG 1 1 1 1 1 1 | TAG 0 0 0 0 0 0 | Trp TGG 8 8 8 8 8 8 -------------------------------------------------------------------------------------------------------------- Leu CTT 9 9 9 9 9 9 | Pro CCT 11 11 12 11 10 12 | His CAT 9 9 9 9 9 9 | Arg CGT 11 11 11 11 11 11 CTC 0 0 0 0 0 0 | CCC 6 6 5 6 6 5 | CAC 5 5 5 5 5 5 | CGC 5 5 5 5 5 5 CTA 9 9 9 10 9 9 | CCA 3 3 3 3 3 3 | Gln CAA 10 10 9 10 10 9 | CGA 6 6 6 6 6 6 CTG 4 4 4 3 4 4 | CCG 2 2 2 2 2 2 | CAG 2 2 2 2 2 2 | CGG 0 0 0 0 0 0 -------------------------------------------------------------------------------------------------------------- Ile ATT 10 11 10 10 11 10 | Thr ACT 17 17 18 16 17 18 | Asn AAT 9 9 9 9 9 9 | Ser AGT 1 1 2 2 2 2 ATC 9 9 7 9 9 8 | ACC 8 8 8 8 8 8 | AAC 6 6 6 6 6 6 | AGC 3 3 3 3 3 3 ATA 1 1 4 2 1 3 | ACA 6 6 5 5 5 5 | Lys AAA 18 18 18 18 18 18 | Arg AGA 6 6 6 6 6 6 Met ATG 10 10 10 9 10 10 | ACG 1 1 1 1 1 1 | AAG 6 6 6 6 6 6 | AGG 1 1 1 1 1 1 -------------------------------------------------------------------------------------------------------------- Val GTT 15 14 14 15 14 14 | Ala GCT 24 24 23 24 24 24 | Asp GAT 22 22 22 22 22 22 | Gly GGT 23 23 23 23 23 23 GTC 1 1 1 1 1 1 | GCC 5 5 5 5 5 4 | GAC 5 5 6 6 6 6 | GGC 2 2 2 2 2 2 GTA 13 13 12 13 13 12 | GCA 15 15 16 16 16 16 | Glu GAA 24 24 25 24 24 25 | GGA 13 13 13 13 13 13 GTG 4 4 4 4 4 4 | GCG 0 0 0 0 0 0 | GAG 10 10 9 9 9 9 | GGG 9 9 9 9 9 9 -------------------------------------------------------------------------------------------------------------- -------------------------------------------------------------------------- Phe TTT 14 14 14 | Ser TCT 5 6 6 | Tyr TAT 13 13 13 | Cys TGT 5 5 5 TTC 8 8 8 | TCC 5 4 4 | TAC 4 4 4 | TGC 3 3 3 Leu TTA 9 9 9 | TCA 1 1 1 | *** TAA 0 0 0 | *** TGA 0 0 0 TTG 10 10 10 | TCG 1 1 1 | TAG 0 0 0 | Trp TGG 8 8 8 -------------------------------------------------------------------------- Leu CTT 9 9 9 | Pro CCT 10 11 11 | His CAT 9 9 9 | Arg CGT 11 11 11 CTC 0 0 0 | CCC 6 6 6 | CAC 5 5 5 | CGC 5 5 5 CTA 9 10 10 | CCA 3 3 3 | Gln CAA 10 10 10 | CGA 6 6 6 CTG 4 3 3 | CCG 2 2 2 | CAG 2 2 2 | CGG 0 0 0 -------------------------------------------------------------------------- Ile ATT 10 10 10 | Thr ACT 17 16 16 | Asn AAT 9 9 9 | Ser AGT 2 2 2 ATC 8 9 9 | ACC 8 8 8 | AAC 6 6 6 | AGC 3 3 3 ATA 2 2 2 | ACA 5 5 5 | Lys AAA 18 18 18 | Arg AGA 6 6 6 Met ATG 10 9 9 | ACG 1 1 1 | AAG 6 6 6 | AGG 1 1 1 -------------------------------------------------------------------------- Val GTT 15 15 15 | Ala GCT 24 24 24 | Asp GAT 22 22 22 | Gly GGT 23 23 23 GTC 1 1 1 | GCC 5 5 5 | GAC 5 6 6 | GGC 2 2 2 GTA 13 13 13 | GCA 15 16 16 | Glu GAA 24 24 24 | GGA 13 13 13 GTG 4 4 4 | GCG 1 0 0 | GAG 10 9 9 | GGG 9 9 9 -------------------------------------------------------------------------- Codon position x base (3x4) table for each sequence. #1: Pdel181_DNA2 position 1: T:0.18105 C:0.19368 A:0.23579 G:0.38947 position 2: T:0.26526 C:0.23158 A:0.30105 G:0.20211 position 3: T:0.41895 C:0.15579 A:0.28211 G:0.14316 Average T:0.28842 C:0.19368 A:0.27298 G:0.24491 #2: Pfre186_DNA2 position 1: T:0.18105 C:0.19368 A:0.23789 G:0.38737 position 2: T:0.26526 C:0.23158 A:0.30105 G:0.20211 position 3: T:0.41895 C:0.15579 A:0.28211 G:0.14316 Average T:0.28842 C:0.19368 A:0.27368 G:0.24421 #3: Pgra187_DNA2 position 1: T:0.18105 C:0.19158 A:0.24000 G:0.38737 position 2: T:0.26316 C:0.23158 A:0.30105 G:0.20421 position 3: T:0.42105 C:0.15158 A:0.28632 G:0.14105 Average T:0.28842 C:0.19158 A:0.27579 G:0.24421 #4: Phet26_DNA21 position 1: T:0.18105 C:0.19368 A:0.23368 G:0.39158 position 2: T:0.26526 C:0.22947 A:0.30105 G:0.20421 position 3: T:0.41895 C:0.15789 A:0.28632 G:0.13684 Average T:0.28842 C:0.19368 A:0.27368 G:0.24421 #5: Pmex37_DNA21 position 1: T:0.18105 C:0.19158 A:0.23789 G:0.38947 position 2: T:0.26526 C:0.22947 A:0.30105 G:0.20421 position 3: T:0.41895 C:0.15789 A:0.28211 G:0.14105 Average T:0.28842 C:0.19298 A:0.27368 G:0.24491 #6: Ptre197_DNA2 position 1: T:0.18105 C:0.19158 A:0.24000 G:0.38737 position 2: T:0.26316 C:0.23158 A:0.30105 G:0.20421 position 3: T:0.42316 C:0.15158 A:0.28421 G:0.14105 Average T:0.28912 C:0.19158 A:0.27509 G:0.24421 #7: WHR1_DNA225 position 1: T:0.18105 C:0.19158 A:0.23579 G:0.39158 position 2: T:0.26526 C:0.22947 A:0.30105 G:0.20421 position 3: T:0.41684 C:0.15579 A:0.28211 G:0.14526 Average T:0.28772 C:0.19228 A:0.27298 G:0.24702 #8: YALD273_DNA5 position 1: T:0.18105 C:0.19368 A:0.23368 G:0.39158 position 2: T:0.26526 C:0.22947 A:0.30105 G:0.20421 position 3: T:0.41895 C:0.15789 A:0.28632 G:0.13684 Average T:0.28842 C:0.19368 A:0.27368 G:0.24421 #9: Pop_trich_ch position 1: T:0.18105 C:0.19368 A:0.23368 G:0.39158 position 2: T:0.26526 C:0.22947 A:0.30105 G:0.20421 position 3: T:0.41895 C:0.15789 A:0.28632 G:0.13684 Average T:0.28842 C:0.19368 A:0.27368 G:0.24421 Sums of codon usage counts ------------------------------------------------------------------------------ Phe F TTT 126 | Ser S TCT 53 | Tyr Y TAT 117 | Cys C TGT 45 TTC 72 | TCC 37 | TAC 36 | TGC 27 Leu L TTA 81 | TCA 9 | *** * TAA 0 | *** * TGA 0 TTG 90 | TCG 9 | TAG 0 | Trp W TGG 72 ------------------------------------------------------------------------------ Leu L CTT 81 | Pro P CCT 99 | His H CAT 81 | Arg R CGT 99 CTC 0 | CCC 52 | CAC 45 | CGC 45 CTA 84 | CCA 27 | Gln Q CAA 88 | CGA 54 CTG 33 | CCG 18 | CAG 18 | CGG 0 ------------------------------------------------------------------------------ Ile I ATT 92 | Thr T ACT 152 | Asn N AAT 81 | Ser S AGT 16 ATC 77 | ACC 72 | AAC 54 | AGC 27 ATA 18 | ACA 47 | Lys K AAA 162 | Arg R AGA 54 Met M ATG 87 | ACG 9 | AAG 54 | AGG 9 ------------------------------------------------------------------------------ Val V GTT 131 | Ala A GCT 215 | Asp D GAT 198 | Gly G GGT 207 GTC 9 | GCC 44 | GAC 51 | GGC 18 GTA 115 | GCA 141 | Glu E GAA 218 | GGA 117 GTG 36 | GCG 1 | GAG 84 | GGG 81 ------------------------------------------------------------------------------ Codon position x base (3x4) table, overall position 1: T:0.18105 C:0.19275 A:0.23649 G:0.38971 position 2: T:0.26480 C:0.23041 A:0.30105 G:0.20374 position 3: T:0.41942 C:0.15579 A:0.28421 G:0.14058 Average T:0.28842 C:0.19298 A:0.27392 G:0.24468 Codon frequencies under model, for use in evolver (TTT TTC TTA TTG ... GGG): 0.02080748 0.00772882 0.01409988 0.00697451 0.01810545 0.00672517 0.01226889 0.00606881 0.02365656 0.00878710 0.00000000 0.00000000 0.01601000 0.00594683 0.00000000 0.00536643 0.02215163 0.00822810 0.01501073 0.00742506 0.01927505 0.00715961 0.01306145 0.00646085 0.02518476 0.00935474 0.01706608 0.00844174 0.01704423 0.00633099 0.01154977 0.00571310 0.02717876 0.01009540 0.01841729 0.00911012 0.02364937 0.00878443 0.01602564 0.00792709 0.03090024 0.01147772 0.02093909 0.01035753 0.02091228 0.00776775 0.01417090 0.00700964 0.04478716 0.01663594 0.03034936 0.01501232 0.03897116 0.01447562 0.02640823 0.01306284 0.05091968 0.01891383 0.03450497 0.01706789 0.03446079 0.01280027 0.02335185 0.01155100 Nei & Gojobori 1986. dN/dS (dN, dS) (Note: This matrix is not used in later ML. analysis. Use runmode = -2 for ML pairwise comparison.) Pdel181_DNA2 Pfre186_DNA2aaaaaaaa-1.0300 (0.0009 0.0000) Pgra187_DNA2 1.0623 (0.0093 0.0088) 1.1684 (0.0102 0.0088) Phet26_DNA21 1.2775 (0.0037 0.0029) 1.5968 (0.0046 0.0029) 0.4758 (0.0056 0.0117) Pmex37_DNA21 -1.0000 (0.0046 0.0000)-1.0000 (0.0037 0.0000) 0.9534 (0.0084 0.0088) 0.9555 (0.0028 0.0029) Ptre197_DNA2 1.5965 (0.0093 0.0058) 1.7561 (0.0102 0.0058) 0.0000 (0.0000 0.0088) 0.6357 (0.0056 0.0088) 1.4328 (0.0084 0.0058) WHR1_DNA225 0.7964 (0.0046 0.0058) 0.9557 (0.0056 0.0058) 0.4430 (0.0065 0.0147) 0.3175 (0.0028 0.0088) 0.6354 (0.0037 0.0058) 0.5548 (0.0065 0.0117) YALD273_DNA5 1.2775 (0.0037 0.0029) 1.5968 (0.0046 0.0029) 0.4758 (0.0056 0.0117)-1.0000 (0.0000 0.0000) 0.9555 (0.0028 0.0029) 0.6357 (0.0056 0.0088) 0.3175 (0.0028 0.0088) Pop_trich_ch 1.2775 (0.0037 0.0029) 1.5968 (0.0046 0.0029) 0.4758 (0.0056 0.0117)-1.0000 (0.0000 0.0000) 0.9555 (0.0028 0.0029) 0.6357 (0.0056 0.0088) 0.3175 (0.0028 0.0088)-1.0000 (0.0000 0.0000) pairwise comparison, codon frequencies: F3x4. 2 (Pfre186_DNA2) ... 1 (Pdel181_DNA2) lnL =-1891.391969 0.00214 999.00000 99.00000 t= 0.0021 S= 396.3 N= 1028.7 dN/dS= 99.0000 dN = 0.0010 dS = 0.0000 3 (Pgra187_DNA2) ... 1 (Pdel181_DNA2) lnL =-1968.029688 0.02808 3.26389 0.98370 t= 0.0281 S= 341.7 N= 1083.3 dN/dS= 0.9837 dN = 0.0093 dS = 0.0095 3 (Pgra187_DNA2) ... 2 (Pfre186_DNA2) lnL =-1972.711786 0.03025 3.71696 1.10191 t= 0.0302 S= 345.4 N= 1079.6 dN/dS= 1.1019 dN = 0.0103 dS = 0.0094 4 (Phet26_DNA21) ... 1 (Pdel181_DNA2) lnL =-1921.561815 0.01073 3.11375 1.18654 t= 0.0107 S= 342.5 N= 1082.5 dN/dS= 1.1865 dN = 0.0037 dS = 0.0031 4 (Phet26_DNA21) ... 2 (Pfre186_DNA2) lnL =-1927.106138 0.01287 4.25522 1.54964 t= 0.0129 S= 352.0 N= 1073.0 dN/dS= 1.5496 dN = 0.0047 dS = 0.0030 4 (Phet26_DNA21) ... 3 (Pgra187_DNA2) lnL =-1948.501568 0.02179 4.30601 0.45869 t= 0.0218 S= 349.8 N= 1075.2 dN/dS= 0.4587 dN = 0.0056 dS = 0.0123 5 (Pmex37_DNA21) ... 1 (Pdel181_DNA2) lnL =-1920.298777 0.01053 1.55733 99.00000 t= 0.0105 S= 318.9 N= 1106.1 dN/dS= 99.0000 dN = 0.0045 dS = 0.0000 5 (Pmex37_DNA21) ... 2 (Pfre186_DNA2) lnL =-1914.616800 0.00838 0.77696 99.00000 t= 0.0084 S= 296.5 N= 1128.5 dN/dS= 99.0000 dN = 0.0035 dS = 0.0000 5 (Pmex37_DNA21) ... 3 (Pgra187_DNA2) lnL =-1960.971487 0.02594 4.03558 0.91514 t= 0.0259 S= 348.1 N= 1076.9 dN/dS= 0.9151 dN = 0.0085 dS = 0.0092 5 (Pmex37_DNA21) ... 4 (Phet26_DNA21) lnL =-1913.389962 0.00862 6.08298 0.97563 t= 0.0086 S= 362.3 N= 1062.7 dN/dS= 0.9756 dN = 0.0029 dS = 0.0029 6 (Ptre197_DNA2) ... 1 (Pdel181_DNA2) lnL =-1961.098136 0.02585 4.25957 1.53161 t= 0.0258 S= 349.5 N= 1075.5 dN/dS= 1.5316 dN = 0.0094 dS = 0.0061 6 (Ptre197_DNA2) ... 2 (Pfre186_DNA2) lnL =-1965.694050 0.02802 4.83643 1.71212 t= 0.0280 S= 352.8 N= 1072.2 dN/dS= 1.7121 dN = 0.0104 dS = 0.0061 6 (Ptre197_DNA2) ... 3 (Pgra187_DNA2) lnL =-1901.130192 0.00676 4.05781 0.00100 t= 0.0068 S= 345.2 N= 1079.8 dN/dS= 0.0010 dN = 0.0000 dS = 0.0093 6 (Ptre197_DNA2) ... 4 (Phet26_DNA21) lnL =-1941.618065 0.01954 6.78012 0.64571 t= 0.0195 S= 361.6 N= 1063.4 dN/dS= 0.6457 dN = 0.0057 dS = 0.0089 6 (Ptre197_DNA2) ... 5 (Pmex37_DNA21) lnL =-1953.852896 0.02372 5.63419 1.43296 t= 0.0237 S= 357.2 N= 1067.8 dN/dS= 1.4330 dN = 0.0086 dS = 0.0060 7 (WHR1_DNA225) ... 1 (Pdel181_DNA2) lnL =-1936.561781 0.01511 2.61161 0.72506 t= 0.0151 S= 337.6 N= 1087.4 dN/dS= 0.7251 dN = 0.0046 dS = 0.0064 7 (WHR1_DNA225) ... 2 (Pfre186_DNA2) lnL =-1941.975543 0.01725 3.35589 0.90347 t= 0.0172 S= 345.7 N= 1079.3 dN/dS= 0.9035 dN = 0.0056 dS = 0.0062 7 (WHR1_DNA225) ... 3 (Pgra187_DNA2) lnL =-1960.455637 0.02624 3.64066 0.41889 t= 0.0262 S= 345.9 N= 1079.1 dN/dS= 0.4189 dN = 0.0065 dS = 0.0156 7 (WHR1_DNA225) ... 4 (Phet26_DNA21) lnL =-1928.593448 0.01318 1.68562 0.26826 t= 0.0132 S= 322.2 N= 1102.8 dN/dS= 0.2683 dN = 0.0027 dS = 0.0101 7 (WHR1_DNA225) ... 5 (Pmex37_DNA21) lnL =-1928.718524 0.01300 3.82266 0.60956 t= 0.0130 S= 350.1 N= 1074.9 dN/dS= 0.6096 dN = 0.0037 dS = 0.0061 7 (WHR1_DNA225) ... 6 (Ptre197_DNA2) lnL =-1954.005346 0.02397 5.04975 0.54692 t= 0.0240 S= 355.1 N= 1069.9 dN/dS= 0.5469 dN = 0.0066 dS = 0.0121 8 (YALD273_DNA5) ... 1 (Pdel181_DNA2) lnL =-1921.561815 0.01073 3.11377 1.18654 t= 0.0107 S= 342.5 N= 1082.5 dN/dS= 1.1865 dN = 0.0037 dS = 0.0031 8 (YALD273_DNA5) ... 2 (Pfre186_DNA2) lnL =-1927.106138 0.01287 4.25516 1.54964 t= 0.0129 S= 352.0 N= 1073.0 dN/dS= 1.5496 dN = 0.0047 dS = 0.0030 8 (YALD273_DNA5) ... 3 (Pgra187_DNA2) lnL =-1948.501568 0.02179 4.30601 0.45869 t= 0.0218 S= 349.8 N= 1075.2 dN/dS= 0.4587 dN = 0.0056 dS = 0.0123 8 (YALD273_DNA5) ... 4 (Phet26_DNA21) lnL =-1881.846112 0.00001 8.33165 0.18598 t= 0.0000 S= 369.5 N= 1055.5 dN/dS= 0.1860 dN = 0.0000 dS = 0.0000 8 (YALD273_DNA5) ... 5 (Pmex37_DNA21) lnL =-1913.389962 0.00862 6.08296 0.97563 t= 0.0086 S= 362.3 N= 1062.7 dN/dS= 0.9756 dN = 0.0029 dS = 0.0029 8 (YALD273_DNA5) ... 6 (Ptre197_DNA2) lnL =-1941.618065 0.01954 6.78014 0.64571 t= 0.0195 S= 361.6 N= 1063.4 dN/dS= 0.6457 dN = 0.0057 dS = 0.0089 8 (YALD273_DNA5) ... 7 (WHR1_DNA225) lnL =-1928.593448 0.01318 1.68561 0.26826 t= 0.0132 S= 322.2 N= 1102.8 dN/dS= 0.2683 dN = 0.0027 dS = 0.0101 9 (Pop_trich_ch) ... 1 (Pdel181_DNA2) lnL =-1921.561815 0.01073 3.11378 1.18654 t= 0.0107 S= 342.5 N= 1082.5 dN/dS= 1.1865 dN = 0.0037 dS = 0.0031 9 (Pop_trich_ch) ... 2 (Pfre186_DNA2) lnL =-1927.106138 0.01287 4.25517 1.54964 t= 0.0129 S= 352.0 N= 1073.0 dN/dS= 1.5496 dN = 0.0047 dS = 0.0030 9 (Pop_trich_ch) ... 3 (Pgra187_DNA2) lnL =-1948.501568 0.02179 4.30599 0.45869 t= 0.0218 S= 349.8 N= 1075.2 dN/dS= 0.4587 dN = 0.0056 dS = 0.0123 9 (Pop_trich_ch) ... 4 (Phet26_DNA21) lnL =-1881.845859 0.00001 0.40000 0.00100 t= 0.0000 S= 281.0 N= 1144.0 dN/dS= 0.0010 dN = 0.0000 dS = 0.0000 9 (Pop_trich_ch) ... 5 (Pmex37_DNA21) lnL =-1913.389962 0.00862 6.08299 0.97562 t= 0.0086 S= 362.3 N= 1062.7 dN/dS= 0.9756 dN = 0.0029 dS = 0.0029 9 (Pop_trich_ch) ... 6 (Ptre197_DNA2) lnL =-1941.618065 0.01954 6.78013 0.64571 t= 0.0195 S= 361.6 N= 1063.4 dN/dS= 0.6457 dN = 0.0057 dS = 0.0089 9 (Pop_trich_ch) ... 7 (WHR1_DNA225) lnL =-1928.593448 0.01318 1.68561 0.26826 t= 0.0132 S= 322.2 N= 1102.8 dN/dS= 0.2683 dN = 0.0027 dS = 0.0101 9 (Pop_trich_ch) ... 8 (YALD273_DNA5) lnL =-1881.846052 0.00001 0.40000 0.16519 t= 0.0000 S= 281.0 N= 1144.0 dN/dS= 0.1652 dN = 0.0000 dS = 0.0000 neighbor.dist100644000766000024 20114006427353 17027 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data 3 SINFRUP001 0.00000 0.16316 0.28603 SINFRUP002 0.16316 0.00000 0.27995 ENSP000002 0.28603 0.27995 0.00000 cysprot1a.msf100644000766000024 361314006427353 17033 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataPileUp MSF: 383 Type: P Check: 3696 .. Name: ALEU_HORVU oo Len: 383 Check: 9840 Weight: 34.4 Name: CATH_HUMAN oo Len: 383 Check: 7134 Weight: 30.1 Name: CYS1_DICDI oo Len: 383 Check: 6722 Weight: 35.4 // ALEU_HORVU MAHARVLLLA LAVLATAAVA VASSSSFADS NPIRPVTDRA ASTLESAVLG CATH_HUMAN ......MWAT LPLLCAGAWL LG........ VPVCGAAELS VNSLEK.... CYS1_DICDI .....MKVIL LFVLAVFTVF VS........ .......SRG IPPEEQ.... ALEU_HORVU ALGRTRHALR FARFAVRYGK SYESAAEVRR RFRIFSESLE EVRSTN.... CATH_HUMAN ........FH FKSWMSKHRK TYS.TEEYHH RLQTFASNWR KINAHN.... CYS1_DICDI ........SQ FLEFQDKFNK KYS.HEEYLE RFEIFKSNLG KIEELNLIAI ALEU_HORVU RKGLPYRLGI NRFSDMSWEE FQATRLG.AA QTCSATLAGN HLMRDAAA.. CATH_HUMAN NGNHTFKMAL NQFSDMSFAE IKHKYLWSEP QNCSATKS.. NYLRGTGP.. CYS1_DICDI NHKADTKFGV NKFADLSSDE FKNYYLNNKE AIFTDDLPVA DYLDDEFINS ALEU_HORVU LPETKDWRED G.IVSPVKNQ AHCGSCWTFS TTGALEAAYT QATGKNISLS CATH_HUMAN YPPSVDWRKK GNFVSPVKNQ GACGSCWTFS TTGALESAIA IATGKMLSLA CYS1_DICDI IPTAFDWRTR G.AVTPVKNQ GQCGSCWSFS TTGNVEGQHF ISQNKLVSLS ALEU_HORVU EQQLVDCAGG FNNF...... ..GCNGGLPS QAFEYIKYNG GIDTEESYPY CATH_HUMAN EQQLVDCAQD FNNY...... ..GCQGGLPS QAFEYILYNK GIMGEDTYPY CYS1_DICDI EQNLVDCDHE CMEYEGEEAC DEGCNGGLQP NAYNYIIKNG GIQTESSYPY ALEU_HORVU KGVNGVCHYK AENAAVQVLD SVNITLNAED ELKNAVGLVR PVSVAFQVID CATH_HUMAN QGKDGYCKFQ PGKAIGFVKD VANITIYDEE AMVEAVALYN PVSFAFEVTQ CYS1_DICDI TAETGTQCNF NSANIGAKIS NFTMIPKNET VMAGYIVSTG PLAIAADAVE ALEU_HORVU GFRQYKSGVY TSDHCGTTPD DVNHAVLAVG YGVEN..... GVPYWLIKNS CATH_HUMAN DFMMYRTGIY SSTSCHKTPD KVNHAVLAVG YGEKN..... GIPYWIVKNS CYS1_DICDI .WQFYIGGVF DIPCN...PN SLDHGILIVG YSAKNTIFRK NMPYWIVKNS ALEU_HORVU WGADWGDNGY FKMEMGKNMC AIATCASYPV VAA CATH_HUMAN WGPQWGMNGY FLIERGKNMC GLAACASYPI PLV CYS1_DICDI WGADWGEQGY IYLRRGKNTC GVSNFVSTSI I.. LOAD_Ccd1.dnd100644000766000024 145114006427353 16435 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data( ( ( A_aeolicus:0.27435, S_aureus:0.30536) :0.01589, ( M_leprae:0.08606, M_tubercul:0.06545) :0.32304) :0.02937, ( ( ( ( ( A_fulgidus:0.26385, T_maritima:0.22186) :0.04261, Synechocys:0.37630) :0.05068, C_jejuni:0.34050) :0.00557, ( M_jannasch:0.35330, M_thermoau:0.41813) :0.03001) :0.04323, ( ( A_tumefaci:0.19429, S_meliloti:0.17884) :0.13366, ( ( B_sp:0.23437, ( ( ( E_coli_A:0.10440, Y_pestis:0.10989) :0.04376, V_cholerae:0.16338) :0.01631, ( H_influenz:0.11186, P_multocid:0.10553) :0.07654) :0.02011) :0.09091, P_aerugino:0.26019) :0.04366) :0.02890) :0.00030, ( ( ( B_subtilis:0.27453, H_sp:0.29690) :0.01772, N_meningit:0.29801) :0.02281, ( C_crescent:0.33036, ( ( S_solfatar:0.06649, S_tokodaii:0.06208) :0.19814, ( T_acidophi:0.01448, T_volcaniu:0.01451) :0.31728) :0.01669) :0.01921) :0.00981); ECAPAH02.embl100644000766000024 74137414006427353 16400 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataID ECAPAH02 standard; DNA; PRO; 111408 BP. XX AC D10483; J01597; J01683; J01706; K01298; K01990; M10420; M10611; M12544; AC V00259; X04711; X54847; X54945; X55034; X56742; XX SV D10483.2 XX DT 06-AUG-1992 (Rel. 33, Created) DT 03-JUN-2002 (Rel. 72, Last updated, Version 62) XX DE Escherichia coli genomic DNA. (0.0 - 2.4 min). XX KW ant; antO; apaG; apaH; araA; araB; araC; araD; asmB; azi; brnP; caiA; caiB; KW caiC; caiD; caiE; caiF; caiT; carA; carB; chlG; KW Complete and shotgun sequencing; dapB; ddl; ddlB; dinA; divA; dnaJ; dnaK; KW envA; fixA; fixB; fixC; folA; fruR; ftsA; ftsI; ftsL; ftsQ; ftsW; ftsZ; KW gef; groP; hepA; htgA; htpY; ileS; ilvH; ilvI; imp; kefC; leuA; leuB; leuC; KW leuD; leuLP; lpxC; lspA; lueO; lytB; mog; mra; mraY; murC; murD; murE; KW murF; murG; murX; mutT; nhaA; nhaR; ostA; pbpB; pdxA; pea; polB; prlD; KW pyrA; rpsT; secA; sfiB; shl; sulB; surA; talB; tbpA; thrA; thrA1; thrA2; KW thrB; thrC; tmrA; trkC; yaaA; yaaC; yaaD; yaaF; yaaI; yaaJ; yaaT; yaaU; KW yaaV; yabB; yabC; yabE; yabF; yabH; yabI; yabJ; yabK; yabM; yabN; yabO; KW yabP; yabQ; yacA; yacG. XX OS Escherichia coli OC Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; OC Escherichia. XX RN [1] RP 1-111408 RA Mori H.; RT ; RL Submitted (18-FEB-1992) to the EMBL/GenBank/DDBJ databases. RL Hirotada Mori, Institute for Virus Research, Kyoto University, Genetics and RL Molecular Biology; 53 Shogoin Kawara-Machi, Sakyo-ku, Kyoto 606, Japan RL (E-mail:e52985@sakura.kudpc.kyoto-u.ac.jp, Tel:075-751-4042, RL Fax:075-761-5626) XX RN [2] RX MEDLINE; 79005683. RA Smith B.R., Schleif R.; RT "Nucleotide sequence of the L-arabinose regulatory region of Escherichia RT coli K12"; RL J. Biol. Chem. 253:6931-6933(1978). XX RN [3] RX MEDLINE; 78137003. RA Ohtsubo H., Ohtsubo E.; RT "Nucleotide sequence of an insertion element, IS1"; RL Proc. Natl. Acad. Sci. U.S.A. 75:615-619(1978). XX RN [4] RX MEDLINE; 79116194. RA Greenfield L., Boone T., Wilcox G.; RT "DNA sequence of the araBAD promoter in Escherichia coli B/r"; RL Proc. Natl. Acad. Sci. U.S.A. 75:4724-4728(1978). XX RN [5] RX MEDLINE; 79177885. RA Johnsrud L.; RT "DNA sequence of the transposable element IS1"; RL Mol. Gen. Genet. 169:213-218(1979). XX RN [6] RX MEDLINE; 81053692. RA Calvo J.M., Smith D.R.; RT "Nucleotide sequence of the E. coli gene coding for dihydrofolate RT reductase"; RL Nucleic Acids Res. 8:2255-2274(1980). XX RN [7] RX MEDLINE; 81124262. RA Miyada C.G., Horwitz A.H., Cass L.G., Timko J., Wilcox G.; RT "DNA sequence of the araC regulatory gene from Escherichia coli B/r"; RL Nucleic Acids Res. 8:5267-5274(1980). XX RN [8] RX MEDLINE; 81013881. RA Ogden S., Haggerty D., Stoner C.M., Kolodrubetz D., Schleif R.; RT "The Escherichia coli L-arabinose operon: binding sites of the regulatory RT proteins and a mechanism of positive and negative regulation"; RL Proc. Natl. Acad. Sci. U.S.A. 77:3346-3350(1980). XX RN [9] RX MEDLINE; 81077247. RA Katina M., Cossart P., Sibilli L., Saint-Girons I.; RT "Nucleotide sequence of the thrA gene of Escherichia coli"; RL Proc. Natl. Acad. Sci. U.S.A. 77:5730-5733(1980). XX RN [10] RX MEDLINE; 81264207. RA Mackie G.A.; RT "Nucleotide sequence of the gene for ribosomal protein S20 and its flanking RT regions"; RL J. Biol. Chem. 256:8177-8182(1981). XX RN [11] RX MEDLINE; 81150470. RA Cossart P., Katinka M., Yaniv M.; RT "Nucleotide sequence of the thrB gene of E. coli, and its two adjacent RT regions; the thrAB and thrBC junctions"; RL Nucleic Acids Res. 9:339-347(1981). XX RN [12] RX MEDLINE; 81199399. RA Lee N.L., Gielow W.O., Wallace R.G.; RT "Mechanism of araC autoregulation and the domains of two overlapping RT promoters, pC and pBAD, in the L-arabinose regulatory region of Escherichia RT coli"; RL Proc. Natl. Acad. Sci. U.S.A. 78:752-756(1981). XX RN [13] RX MEDLINE; 82216830. RA Stoner C.M., Schleif R.; RT "Is the amino acid but not nucleotide sequence of the Escherichia coli araC RT gene conserved?"; RL J. Mol. Biol. 154:649-652(1982). XX RN [14] RX MEDLINE; 83116968. RA Gilson E., Nikaido H., Hofnung M.; RT "Sequence of the malK gene in E.coli K12"; RL Nucleic Acids Res. 10:7449-7458(1982). XX RN [15] RX MEDLINE; 84069770. RA Parsot C., Cossart P., Saint-Girons I., Cohen G.N.; RT "Nucleotide sequence of thrC and of the transcription termination region of RT the threonine operon in Escherichia coli K12"; RL Nucleic Acids Res. 11:7331-7345(1983). XX RN [16] RX MEDLINE; 85054974. RA Bouvier J., Richaud C., Richaud F., Patte J., Stragier P.; RT "Nucleotide sequence and expression of the Escherichia coli dapB gene"; RL J. Biol. Chem. 259:14829-14834(1984). XX RN [17] RX MEDLINE; 84144800. RA Bardwell J.C., Craig E.A.; RT "Major heat shock gene of drosophila and the Escherichia coli heatinducible RT dnaK gene are homologous"; RL Proc. Natl. Acad. Sci. U.S.A. 81:848-852(1984). XX RN [18] RX MEDLINE; 84222028. RA Innis M.A., Tokunaga M., Williams M.E., Loranger J.M., Chang S., Chang S., RA Wu H.C.; RT "Nucleotide sequence of the Escherichia coli prolipoprotein signal RT peptidase (lsp) gene"; RL Proc. Natl. Acad. Sci. U.S.A. 81:3708-3712(1984). XX RN [19] RX MEDLINE; 84248073. RA Bouvier J., Patte J., Stragier P.; RT "Multiple regulatory signals in the control region of the Escherichia coli RT carAB operon"; RL Proc. Natl. Acad. Sci. U.S.A. 81:4139-4143(1984). XX RN [20] RX MEDLINE; 86030702. RA Chong P., Hui I., Loo T., Gillam S.; RT "Structural analysis of a new GC-specific insertion element IS186"; RL FEBS Lett. 192:47-52(1985). XX RN [21] RX MEDLINE; 85182715. RA Kamio Y., Lin C., Regue M., Wu H.C.; RT "Characterization of the ileX-lsp operon in Escherichia coli: RT Identification of an open reading frame upstream of the ileX gene and RT potential promoter(s) for the ileS-lsp operon"; RL J. Biol. Chem. 260:5616-5620(1985). XX RN [22] RX MEDLINE; 85295470. RA Friedberg D., Rosenthal E.R., Jones J.W., Calvo J.M.; RT "Characterization of the 3' end of the leucine operon of Salmonella RT typhimurium"; RL Mol. Gen. Genet. 199:486-494(1985). XX RN [23] RX MEDLINE; 85190560. RA Cowing D.W.; RT "Consensus sequence for Escherichia coli heatshock gene promoters"; RL Proc. Natl. Acad. Sci. U.S.A. 82:2679-2683(1985). XX RN [24] RA Sekiguchi T., Ortega-Cesena J., Nosoh Y., Ohashi S., Tsuda K., Kanaya S.; RT "DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of RT Bacillus coagulans. Comparison with the enzymes of Saccharomyces cerevisiae RT and Thermus thermophilus"; RL Biochim. Biophys. Acta 867:36-44(1986). XX RN [25] RX MEDLINE; 87163495. RA Lee N., Gielow W., Martin R., Hamilton E., Fowler A.; RT "The organization of the araBAD operaon of Escherichia coli"; RL Gene 47:231-244(1986). XX RN [26] RX MEDLINE; 86111849. RA Ohki M., Tamura F., Nishimura S., Uchida H.; RT "Nucleotide sequence of the Escherichia coli dnaJ gene and purification of RT the gene product"; RL J. Biol. Chem. 261:1778-1781(1986). XX RN [27] RX MEDLINE; 87172305. RA Blanchin-Roland S., Blanquet S., Schmitter J., Fayat G.; RT "The gene for Escherichia coli diadenosine tetraphosphatase is located RT immediately clockwise to folA and forms an operon with ksgA"; RL Mol. Gen. Genet. 205:515-522(1986). XX RN [28] RX MEDLINE; 87016337. RA Mackie G.A.; RT "Structure of the DNA distal to the gene for ribosomal protein S20 in RT Escherichia coli K12: Presence of a strong terminator and an IS1 element"; RL Nucleic Acids Res. 14:6965-6981(1986). XX RN [29] RX MEDLINE; 86287338. RA Birnbaum M.J., Haspel H.C., Rosen O.M.; RT "Cloning and characterization of a cDNA encoding the rat brain RT glucose-transporter protein"; RL Proc. Natl. Acad. Sci. U.S.A. 83:5784-5788(1986). XX RN [30] RX MEDLINE; 87308226. RA Nonet M.L., Marvel C.C., Tolan D.R.; RT "The hisT-purF region of the Escherichia coli K-12 chromosome. RT Identification of additional genes of the hisT and purF operons"; RL J. Biol. Chem. 262:12209-12217(1987). XX RN [31] RX MEDLINE; 87146339. RA Gronger P., Manian S.S., Reilander H., O'Connell M., Priefer U.B., RA Puhler A.; RT "Organization and partial sequence of a DNA region of the Rhizobium RT leguminosarum symbiotic plasmid pRL6JI containing the genes fixABC, nifA, RT nifB and a novel operon reading frame"; RL Nucleic Acids Res. 15:31-49(1987). XX RN [32] RX MEDLINE; 89005119. RA Lozoya E., Hoffmann H., Douglas C., Schulz W., Scheel D., Hahlbrock K.; RT "Primary structures and catalytic properties of isoenzymes encoded by the RT two 4-coumarate:CoA ligase genes in parsley"; RL Eur. J. Biochem. 176:661-667(1988). XX RN [33] RX MEDLINE; 88273145. RA Karpel R., Olami Y., Taglicht D., Schuldiner S., Padan E.; RT "Sequencing of the gene ant which affects the Na+/H+ antiporter activity in RT Escherichia coli"; RL J. Biol. Chem. 263:10408-10414(1988). XX RN [34] RX MEDLINE; 88330879. RA Jaiswal A.K., McBride O.W., Adesnik M., Nebert D.W.; RT "Human dioxin-inducible cytosolic NAD(P)H:menadione oxidoreductase. cDNA RT sequence and localization of gene to chromosome 16"; RL J. Biol. Chem. 263:13572-13578(1988). XX RN [35] RX MEDLINE; 90032688. RA Minami-Ishii N., Taketani S., Osumi T., Hashimoto T.; RT "Molecular cloning and sequence analysis of the cDNA for rat mitochondrial RT enoyl-CoA hydratase. Structural and evolutionary relationships linked to RT the bifunctional enzyme of the peroxisomal beta-oxidation system"; RL Eur. J. Biochem. 185:73-78(1989). XX RN [36] RX MEDLINE; 89252065. RA Sato S., Nakada Y., Shiratsuchi A.; RT "IS421, a new insetion sequence in Escherichia coli"; RL FEBS Lett. 249:21-26(1989). XX RN [37] RX MEDLINE; 89359108. RA Roa B.B., Connolly D.M., Winkler M.E.; RT "Overlap between pdxA and ksgA in the complex pdxA-ksgA-apaG-apaH operon of RT Escherichia coli K-12"; RL J. Bacteriol. 171:4767-4777(1989). XX RN [38] RX MEDLINE; 89380240. RA Matsubara Y., Indo Y., Naito E., Ozasa H., Glassberg R., Vockley J., RA Ikeda Y., Kraus J., Tanaka K.; RT "Molecular cloning and nucleotide sequence of cDNAs encoding the precursors RT of rat long chain acyl-coenzyme A, short chain acyl-coenzyme A, and RT isovaleryl-coenzyme A dehydrogenases. Sequence homology of four enzymes of RT the acyl-CoA dehydrogenase famil"; RL J. Biol. Chem. 264:16321-16331(1989). XX RN [39] RA Stragier P.; RT ; RL Unpublished. XX RN [40] RA Ishino Y.; RT ; RL Unpublished. XX RN [41] RA Chen H., Sun Y., Stark T., Beattie W., Moses R.E.; RT "Nucleotide sequence and deletion analysis of the polB gene of E. coli"; RL DNA Cell Biol. 9:613-635(1990). XX RN [42] RX MEDLINE; 90235831. RA Schaaff I., Hohmann S., Zimmermann F.K.; RT "Molecular analysis of the structural gene for yeast transaldolase"; RL Eur. J. Biochem. 188:597-603(1990). XX RN [43] RX MEDLINE; 90130288. RA Angerer A., Gaisser S., Braun V.; RT "Nucleotide sequences of the sfuA, sfuB, and sfuC genes of Serratia RT marcescens suggest a periplasmic-binding-protein-dependent iron transport RT mechanism"; RL J. Bacteriol. 172:572-578(1990). XX RN [44] RX MEDLINE; 91072253. RA Mallonee D.H., White W.B., Hylemon P.B.; RT "Cloning and sequencing of a bile acid-inducible operon from Eubacterium RT sp. strain VPI 12708"; RL J. Bacteriol. 172:7011-7019(1990). XX RN [45] RX MEDLINE; 90206796. RA Ricca E., Calvo J.M.; RT "The nucleotide sequence of leuA from Salmonella typhimurium"; RL Nucleic Acids Res. 18:1290-1290(1990). XX RN [46] RX MEDLINE; 90272436. RA Rosenthal E.R., Calvo J.M.; RT "The nucleotide sequence of leuC from Salmonella typhimurium"; RL Nucleic Acids Res. 18:3072-3072(1990). XX RN [47] RX MEDLINE; 91088346. RA Chen H., Lawrence C.B., Bryan S.K., Moses R.E.; RT "Aphidicolin inhibits DNA polymerase II of Escherichia coli, an alpha-like RT DNA polymerase"; RL Nucleic Acids Res. 18:7185-7186(1990). XX RN [48] RA Ayala J.A.; RT "Regulation of transcription at 2-minute region of the genetic map of RT Escherichia coli"; RL Unpublished. XX RN [49] RX MEDLINE; 91192599. RA Umeda M., Ohtsubo E.; RT "Four types of IS1 with differences in nucleotide sequence reside in the RT Escherichia coli K-12 chromosome"; RL Gene 98:1-5(1991). XX RN [50] RX MEDLINE; 91203829. RA Arigoni F., Kaminski P.A., Hennecke H., Elmerich C.; RT "Nucleotide sequence of the fixABC region of Azorhizobium caulinodans RT ORS571: similarity of the fixB product with eukaryotic flavoproteins, RT characterization of fixX, and identification of nifW"; RL Mol. Gen. Genet. 225:514-520(1991). XX RN [51] RX MEDLINE; 91260444. RA Munro A.W., Ritchie G.Y., Lamb A.J., Douglas R.M., Booth I.R.; RT "The cloning and DNA sequence of the gene for the glutathione-regulated RT potassium-efflux system KefC of Escherichia coli"; RL Mol. Microbiol. 5:607-616(1991). XX RN [52] RX MEDLINE; 92048480. RA Poulsen L.K., Refn A., Molin S., Andersson P.; RT "Topographic analysis of the toxic Gef protein from Escherichia coli"; RL Mol. Microbiol. 5:1627-1637(1991). XX RN [53] RX MEDLINE; 92048481. RA Poulsen L.K., Refn A., Molin S., Andersson P.; RT "The gef gene from Escherichia coli is regulated at the level of RT translation"; RL Mol. Microbiol. 5:1639-1648(1991). XX RN [54] RX MEDLINE; 91187617. RA Bouvier J., Stragier P.; RT "Nucleotide sequence of the lsp-dapB interval in Escherichia coli"; RL Nucleic Acids Res. 19:180-180(1991). XX RN [55] RP 1-111401 RX MEDLINE; 92334977. RA Yura T., Mori H., Nagai H., Nagata T., Ishihama A., Fujita N., Isono K., RA Mizobuchi K., Nakata A.; RT "Systematic sequencing of the Escherichia coli genome: analysis of the RT 0-2.4min region"; RL Nucleic Acids Res. 20:3305-3308(1992). XX RN [56] RX MEDLINE; 94261430. RA Fujita N., Mori H., Yura T., Ishihama A.; RT "Systematic sequencing of the Escherichia coli genome: analysis of the RT 2.4-4.1 min (110,917-193,643 bp) region"; RL Nucleic Acids Res. 22:1637-1639(1994). XX DR SWISS-PROT; P00907; CARA_ECOLI. DR SWISS-PROT; P00968; CARB_ECOLI. DR SWISS-PROT; P09151; LEU1_ECOLI. DR SWISS-PROT; P28694; MOG_ECOLI. DR SWISS-PROT; P28695; YAAH_ECOLI. DR SWISS-PROT; P28696; YAAI_ECOLI. DR SWISS-PROT; P28697; HTGA_ECOLI. DR SWISS-PROT; P30127; LEU2_ECOLI. DR SWISS-PROT; P30143; YAAJ_ECOLI. DR SWISS-PROT; P31554; OSTA_ECOLI. DR SWISS-PROT; P31675; SETA_ECOLI. DR SWISS-PROT; P31680; DJLA_ECOLI. DR SWISS-PROT; P33595; YABN_ECOLI. DR SWISS-PROT; P39206; CAIE_ECOLI. DR SWISS-PROT; P39219; RLUA_ECOLI. DR SWISS-PROT; P39220; YABP_ECOLI. DR SWISS-PROT; P39221; YABQ_ECOLI. DR SWISS-PROT; P46145; YAAV_ECOLI. XX CC Submitted (18-FEB-1992) to DDBJ by: CC Hirotada Mori CC Institute for Virus Research CC Kyoto University CC Kawahara-cho Shogoin Sakyo-ku CC Kyoto 606 CC Phone: 075-751-4042 CC Fax: 075-761-5626 CC Sequence updated (21-DEC-1992) by: CC Hirotada Mori XX FH Key Location/Qualifiers FH FT source 1..111408 FT /db_xref="taxon:562" FT /note="Nucleotide position 1-111408 from the initiation FT site of ThrA (0 min.)." FT /sequenced_mol="DNA" FT /organism="Escherichia coli" FT /map="0-2.4 min" FT /strain="K-12" FT CDS 1..2463 FT /note="ORF_ID:o101#1" FT /note="similar to PIR Accession Number A00671" FT /transl_table=11 FT /gene="thrA" FT /gene="thrA1" FT /gene="thrA2" FT /product="ThrA bifunctional enzyme" FT /protein_id="BAB96579.1" FT /translation="MRVLKFGGTSLANAERFLRVADILESNARQGQVATVLSAPAKITN FT HLVAMIEKTISGQDALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHV FT LHGISLLGQCPDSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLES FT TVDIAESTRRIAASRIPADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADC FT CEIWTDVNGVYTCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCL FT IKNTGNPQAPGTLIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMS FT RARISVVLITQSSSEYSISFCVPQSDCVRAERAMLEEFYLELKEGLLEPLAVAERLAII FT SVVGDGLRTLRGISAKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQM FT LFNTDQVIEVFVIGVGGVGGALLEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLN FT LENWQEELAQAKEPFNLGRLIRLVKEYHLLNPVIVNCTSSQAVADQYADFLREGFHVVT FT PNKKANTSSMDYYHQLRYAAEKSRRKFLYDINVGAGLPVIENLQNLLNAGDELMKFSGI FT LSGSLSYIFGKLDEGMSFSEATRLAREMGYTEPDPRDDLSGMDVARKLLILARETGREL FT ELADIEIEPVLPAEFNAEGDVAAFMANLSQLDDLFAARVAKARDEGKVLRYVGNIDEDG FT VCRVKIAEVDGNDPLFKVKNGENALAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLR FT TLSWKLGV" FT CDS 2465..3394 FT /note="ORF_ID:o101#2" FT /note="similar to PIR Accession Number A00658" FT /transl_table=11 FT /gene="thrB" FT /product="Homoserine kinase (EC 2.7.1.39)" FT /protein_id="BAB96580.1" FT /translation="MVKVYAPASSANMSVGFDVLGAAVTPVDGALLGDVVTVEAAQTFS FT LNNLGRFADKLPSEPRENIVYQCWERFCQELGKQIPVAMTLEKNMPIGSGLGSSACSVV FT AALMAMNEHCGKPLNDTRLLALMGELEGRISGSIHYDNVAPCFLGGMQLMIEENDIISQ FT QVQGLMSGCGCSRIRGLKSRRQKQGYLPAQYRRQDCIAHGRHLAGFIHACYSRQPELAA FT KLMKDVIAEPYRERLLPGFRQARQAVAEIGAVASGISGSGPTLFALCDKPETAQRVADW FT LGKNYLQNQEGFVHICRLDTAGARVLEN" FT CDS 3395..4681 FT /note="ORF_ID:o101#3" FT /note="similar to PIR Accession Number A01157" FT /transl_table=11 FT /gene="thrC" FT /product="Threonine synthase (EC 4.2.99.2)" FT /protein_id="BAB96581.1" FT /translation="MKLYNLKDHNEQVSFAQAVTQGLGKNQGLFFPHDLPEFSLTEIDE FT MLKLDFVTRSAKILSAFIGDEIPQEILEERVRAAFAFPAPVANVESDVGCLELFHGPTL FT AFKDFGGRFMAQMLTHIAGDKPVTILTATSGDTGAAVAHAFYGLPNVKVVILYPRGKIS FT PLQEKLFCTLGGNIETVAIDGDFDACQALVKQAFDDEELKVALGLNSANSINISRLLAQ FT ICYYFEAVAQLPQETRNQLVVSVPSGNFGDLTAGLLAKSLGLPVKRFIAATNVNDTVPR FT FLHDGQWSPKATQATLSNAMDVSQPNNWPRVEELFRRKIWQLKELGYAAVDDETTQQTM FT RELKELGYTSEPHAAVAYRALRDQLNPGEYGLFLGTAHPAKFKESVEAILGETLDLPKE FT LAERADLPLLSHNLPADFAALRKLMMNHQ" FT CDS 4895..5191 FT /note="ORF_ID:o101#4" FT /transl_table=11 FT /protein_id="BAB96582.1" FT /translation="MKKMQSIVLALSLVLVAPMAAQAAEITLVPSVKLQIGDRDNRGYY FT WDGGHWRDHGWWKQHYEWRGNRWHLHGPPPPPRHHKKAPHDHHGGHGPGKHHR" FT CDS complement(4971..5402) FT /note="ORF_ID:o101#5" FT /transl_table=11 FT /protein_id="BAB96583.1" FT /translation="MMKIPPAMANWCLNATSSVKFAVNAGCGVNALSCLQKHADSIYCR FT INVGLIRRASVASGAECRIVTRHLSFSGDVCQDHDRRDDHEELSYGGAAVAAVRVGASD FT CLAIHNVVSTSRGRASDLHPSNSHDYHDRLSVILLTGLT" FT CDS complement(5344..6120) FT /note="ORF_ID:o101#6" FT /note="similar to PIR Accession Number S40534" FT /transl_table=11 FT /gene="yaaA" FT /product="Hypothetical protein" FT /protein_id="BAB96584.1" FT /translation="MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPPQ FT ISTLMRISDKLAGINAARFHDWQPDFTPANARQAILAFKGDVYTGLQAETFSEDDFDFA FT QQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDIITNKLNEALAAQG FT DNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGKFKIISFYAKKARGLMSRFIIEN FT RLTKPEQLTGFNSEGYFFDEDSSSNGELVFKRYEQR" FT CDS complement(6190..7620) FT /note="ORF_ID:o101#7" FT /note="similar to SwissProt Accession Number P30143" FT /transl_table=11 FT /gene="yaaJ" FT /product="Hypothetical 51.7 kd protein in thrC-talB FT intergenic region (ORF8)." FT /protein_id="BAB96585.1" FT /translation="MPDFFSFINSVLWGSVMIYLLFGAGCWFTFRTGFVQFRYIRQFGK FT SLKNSIHPQPGGLTSFQSLCTSLAARVGSGNLAGVALAITAGGPGAVFWMWVAAFIGMA FT TSFAECSLAQLYKERDVNGQFRGGPAWYMARGLGMRWMGVLFAVFLLIAYGIIFSGVQA FT NAVARALSFSFDFPPLVTGIILAVFTLLAITRGLHGVARLMQGFVPLMAIIWVLTSLVI FT CVMNIGQLPHVIWSIFESAFGWQEAAGGAAGYTLSQAITNGFQRSMFSNEAGMGSTPNA FT AAAAASWPPHPAAQGIVQMIGIFIDTLVICTASAMLILLAGNGTTYMPLEGIQLIQKAM FT RVLMGSWGAEFVTLVVILFAFSSIVANYIYAENNLFFLRLNNPKAIWCLRICTFATVIG FT GTLLSLPLMWQLADIIMACMAITNLTAILLLSPVVHTIASDYLRQRKLGVRPVFDPLRY FT PDIGRQLSPDAWDDVSQE" FT CDS 7899..8852 FT /note="ORF_ID:o101#8" FT /note="similar to PIR Accession Number S40535" FT /transl_table=11 FT /gene="talB" FT /product="Hypothetical protein" FT /protein_id="BAB96586.1" FT /translation="MTDKLTSLRQYTTVVADTGDIAAMKLYQPQDATTNPSLILNAAQI FT PEYRKLIDDAVAWAKQQSNDRAQQIVDATDKLAVNIGLEILKLVPGRISTEVDARLSYD FT TEASIAKAKRLIKLYNDAGISNDRILIKLASTWQGIRAAEQLEKEGINCNLTLLFSFAQ FT ARACAEAGVFLISPFVGRILDWYKANTDKKEYAPAEDPGVVSVSEIYQYYKEHGYETVV FT MGASFRNIGEILELAGCDRLTIAPALLKELAESEGAIERKLSYTGEVKARPARITESEF FT LWQHNQDPMAVDKLAEGIRKFAIDQEKLEKMIGDLL" FT CDS 9136..9552 FT /note="ORF_ID:o102#1" FT /note="similar to PIR Accession Number B56688" FT /transl_table=11 FT /gene="chlG" FT /gene="mog" FT /product="Molybdopterin biosynthesis Mog protein." FT /protein_id="BAB96587.1" FT /translation="MCELVDEMSCHLVLTTGGTGPARRDVTPDATLAVADREMPGFGEQ FT MRQISLHFVPTAILSRQVGVIRKQALILNLPGQPKSIKETLEGVKDAEGNVVVHGIFAS FT VPYCIQLLEGPYVETAPEVVAAFRPKSARRDVSE" FT CDS complement(9584..10150) FT /note="ORF_ID:o102#2" FT /note="similar to PIR Accession Number E56688" FT /transl_table=11 FT /product="hgtA 5'-region hypothetical protein 1" FT /protein_id="BAB96588.1" FT /translation="MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIFY FT GGIAQIFAGLLEYKKGNTFGLTAFTSYGSFWLTLVAILLMPKLGLTDAPNAQFLGVYLG FT LWGVFTVFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAIIHFAGWIGLICGAS FT AIYLAMGEVLNEQFGRTVLPIGESH" FT misc_feature 10479..10963 FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o102#3" FT /note="similar to PIR Accession Number A40623" FT /gene="htgA" FT /gene="htpY" FT /product="Heat shock protein Y" FT misc_feature complement(11030..11435) FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o102#4" FT /note="similar to PIR Accession Number C56688" FT /gene="yaaI" FT /product="dnaK 5'-region hypothetical protein 1" FT CDS 11812..13728 FT /note="ORF_ID:o102#5" FT /note="similar to PIR Accession Number A03311" FT /transl_table=11 FT /gene="dnaK" FT /product="DnaK protein" FT /protein_id="BAB96589.1" FT /translation="MGKIIGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQD FT GETLVGQPAKRQAVTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAWVE FT VKGQKMAPPQISAEVLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGL FT EVKRIINEPTAAALAYGLDKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATN FT GDTHLGGEDFDSRLINYLVEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSAQQTDV FT NLPYITADATGPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQDAGLSVSDIDDVILV FT GGQTRMPMVQKKVAEFFGKEPRKDVNPDEAVAIGAAVQGGVLTGDVKDVLLLDVTPLSL FT GIETMGGVMTTLIAKNTTIPTKHSQVFSTAEDNQSAVTIHVLQGERKRAADNKSLGQFN FT LDGINPAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKITIKASSGLNEDEIQKMVR FT DAEANAEADRKFEELVQTRNQGDHLLHSTRKQVEEAGDKLPADDKTAIESALTALETAL FT KGEDKAAIEAKMQELAQVSQKLMEIAQQQHAQQQTAGADASANNAKDDDVVDAEFEEVK FT DKK" FT CDS 13817..14947 FT /note="ORF_ID:o102#6" FT /note="similar to SwissProt Accession Number P08622" FT /transl_table=11 FT /gene="dnaJ" FT /gene="groP" FT /product="DnaJ protein." FT /protein_id="BAB96590.1" FT /translation="MAKQDYYEILGVSKTAEEREIRKAYKRLAMKYHPDRNQGDKEAEA FT KFKEIKEAYEVLTDSQKRAAYDQYGHAAFEQGGMGGGGFGGGADFSDIFGDVFGDIFGG FT GRGRQRAARGADLRYNMELTLEEAVRGVTKEIRIPTLEECDVCHGSGAKPGTQPQTCPT FT CHGSGQVQMRQGFFAVQQTCPHCQGRGTLIKDPCNKCHGHGRVERSKTLSVKIPAGVDT FT GDRIRLAGEGEAGEHGAPAGDLYVQVQVKQHPIFEREGNNLYCEVPINFAMAALGGEIE FT VPTLDGRVKLKVPGETQTGKLFRMRGKGVKSVRGGAQGDLLCRVVVETPVGLNERQKQL FT LQELQESFGGPTGEHNSPRSKSFFDGVKKFFDDLTR" FT CDS complement(16400..16609) FT /note="ORF_ID:o102#7" FT /note="similar to PIR Accession Number S16473" FT /transl_table=11 FT /gene="gef" FT /product="Gef protein" FT /protein_id="BAB96591.1" FT /translation="MLNTCRVPLTDRKVKEKRAMKQHKAMIVALIVICITAVVAALVTR FT KDLCEVHIRTGQTEVAVFTAYESE" FT CDS 17138..18304 FT /note="ORF_ID:o102#8" FT /note="similar to SwissProt Accession Number P13738" FT /transl_table=11 FT /gene="ant" FT /gene="nhaA" FT /product="Na(+)/H(+) antiporter 1." FT /protein_id="BAB96592.1" FT /translation="MKHLHRFFSSDASGGIILIIAAILAMIMANSGATSGWYHDFLETP FT VQLRVGSLEINKNMLLWINDALMAVFFLLVGLEVKRELMQGSLASLRQAAFPVIAAIGG FT MIVPALLYLAFNYADPITREGWAIPAATDIAFALGVLALLGSRVPLALKIFLMALAIID FT DLGAIIIIALFYTNDLSMASLGVAAVAIAVLAVLNLCGARRTGVYILVGVVLWTAVLKS FT GVHATLAGVIVGFFIPLKEKHGRSPAKRLEHVLHPWVAYLILPLFAFANAGVSLQGVTL FT DGLTSILPLGIIAGLLIGKPLGISLFCWLALRLKLAHLPEGTTYQQIMVVGILCGIGFT FT MSIFIASLAFGSVDPELINWAKLGILVGSISSAVIGYSWLRVRLRPSV" FT misc_feature 18364..19283 FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o103#1" FT /note="similar to SwissProt Accession Number P10087" FT /gene="antO" FT /gene="nhaR" FT /product="Transcriptional activator protein NhaR." FT CDS complement(20478..20741) FT /note="ORF_ID:o103#2" FT /note="similar to PIR Accession Number A30425" FT /transl_table=11 FT /gene="rpsT" FT /product="Ribosomal protein S20" FT /protein_id="BAB96593.1" FT /translation="MANIKSAKKRAIQSEKARKHNASRRSMMRTFIKKVYAAIEAGDKA FT AAQKAFNEMQPIVDRQAAKGLIHKNKAARHKANLTAQINKLA" FT CDS 21070..22008 FT /note="ORF_ID:o103#3" FT /note="similar to PIR Accession Number A22609" FT /transl_table=11 FT /gene="yaaC" FT /product="Hypothetical 35k protein (ileS-lsp operon)" FT /protein_id="BAB96594.1" FT /translation="MKLIRGIHNLSQAPQEGCVLTIGNFDGVHRGHRALLQGLQEEGRK FT RNLPVMVMLFEPQPLELFATDKAPARLTRLREKLRYLAECGVDYVLCVRFDRRFAALTA FT QNFVSDLLVKHLRVKFLAVGDDFPLALVVKAISCYYRKLAWNTASISPVRKLFAEVACA FT SAARLRQALADDNLALAESLLGHPFAISGRVVHGDELGRTIGFPTANVPPRRQVSPVKG FT VYAVEVLGLGEKPLPGVANIGTRPTVAGIRQQLEVHLLDVAMDLYGRHIQVVLRKKIRN FT EQRFASLDELKAQIARDELTAREFFGLTKPA" FT CDS 22051..24861 FT /note="ORF_ID:o103#4" FT /note="similar to PIR Accession Number S40549" FT /transl_table=11 FT /gene="ileS" FT /product="Isoleucine--tRNA ligase (EC 6.1.1.5)" FT /protein_id="BAB96595.1" FT /translation="MSDYKSTLNLPETGFPMRGDLAKREPGMLARWTDDDLYGIIRAAK FT KGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVKSKGLSGYDSPYVPGWDCHGLPIEL FT KVEQEYGKPGEKFTAAEFRAKCREYAATQVDGQRKDFIRLGVLGDWSHPYLTMGLQNWK FT ANIIRALGKIIGNGHLHKGAKPVHWCVDCRSALAKRKLSITTKLLRPSTLLSRAVDQDA FT LKAKFAVSNVNGPISLVIWTTTPWTLPANRAISIAPDFDYALVQIDGQAVILAKDLVES FT VMQRIGVTDSRHGKRCGAGAAAFTHPFMGFDVPAILGDHVTLDAGTGAVHTAPGHGPDD FT YVIGQKYGLETANPVGPDGTYLPGTYPTLDGVNVFKANDIVAALLQEKGALLHVEKMQH FT SYPCCWRHKTPIIFRATPQWFVSMDQKGLRAQSLKEIKGVQWIPDWGQARIESMVANRP FT DWCISRQRTWGVPMSLFVHKDTEELHPRTLELMEEVAKRVEVDGIQAWWDLDAKEILGD FT EADQYVKVPDTLDVWFDSGSTHSSVVDVRPEFAGHAADMYLEGSDQHRGWFMSSLMIST FT AMKGKAPYRQVLTHGFTVDGQGRKMSKSIGNTVSPQDVMNKLGADILRLWVASTDYTGE FT MAVSDEILKRAADTYRRIRNTARFLLANLNGFDPAKDMVKRREMVVLDRWAVVVRKRHR FT KTSSRRTKHTISTKWYKRLMRFCSVEMGSFYLDIIKDRQYYAKGHSVARRSCQTALYHI FT AEALVRWMAPILSFTADEVWGYLPGEREKYVFTGEWYEGLFGLADSEAMNDAFWDELLK FT VRGEVNKVIEQARADKKVGGSLEAAVTLYAEPELSAKLTALGDELRFVLLTSGATVADY FT NDAPADAQQSEVLKGLKVALSKAEGEKCPRCWHYTQDVGKVAEHAEICGRCVSNVAGDG FT EKRKFA" FT CDS 24861..25355 FT /note="ORF_ID:o103#5" FT /note="similar to SwissProt Accession Number P00804" FT /transl_table=11 FT /gene="lspA" FT /product="Lipoprotein signal peptidase (EC 3.4.23.36) FT (Prolipoprotein signal peptidase) (Signal peptidase II) FT (Spase II)." FT /protein_id="BAB96596.1" FT /translation="MSQSICSTGLRWLWLVVVVLIIDLGSKYLILQNFALGDTVPLFPS FT LNLHYARNYGAAFSFLADSGGWQRWFFAGIAIGISVILAVMMYRSKATQKLNNIAYALI FT IGGALGNLFDRLWHGFVVDMIDFYVGDWHFATFNLADTAICVGAALIVLEGFLPSRAKK FT Q" FT CDS 25480..25929 FT /note="ORF_ID:o103#6" FT /note="similar to PIR Accession Number JE0402" FT /transl_table=11 FT /gene="yaaD" FT /product="Hypothetical 16.4K protein (lsp-dapB intergenic FT region)" FT /protein_id="BAB96597.1" FT /translation="MSESVQSNSAVLVHFTLKLDDGTTAESTRNNGKPALFRLGDASLS FT EGLEQHLLGLKVGDKTTFSLEPDAAFGVPSPDLIQYFSRREFMDAGEPEIGAIMLFTAM FT DGSEMPGVIREINGDSITVDFNHPLAGQTVHFDIEVLEIDPALEA" FT CDS 25931..26881 FT /note="ORF_ID:o103#7" FT /note="similar to PIR Accession Number JE0403" FT /transl_table=11 FT /gene="lytB" FT /product="Hypothetical 34.8k protein (lsp-dapB intergenic FT region)" FT /protein_id="BAB96598.1" FT /translation="MQILLANPRGFCAGVDRAISIVENALAIYGAPIYVRHEVVHNRYV FT VDSLRERGAIFIEQISEVPDGAILIFSAHGVSQAVRNEAKSRDLTVFDATCPLVTKVHM FT EVARASRRGEESILIGHAGHPEVEGTMGQYSNPEGGMYLVESPDDVWKLTVKNEEKLSF FT MTQTTLSVDDTSDVIDALRKRFPKIVGPRKDDICYATTNRQEAVRALAEQAEVVLVVGS FT KNSSNSNRLAELAQRMGKRAFLIDDAKDIQEEWVKEVKCVGVTAGASAPDILVQNVVAR FT LQQLGGGEAIPLEGREENIVFEVPKELRVDIREVD" FT CDS 26947..27861 FT /note="ORF_ID:o103#8" FT /note="similar to PIR Accession Number JE0404" FT /transl_table=11 FT /gene="yaaF" FT /product="Hypothetical 32.6k protein (lsp-dapB intergenic FT region)" FT /protein_id="BAB96599.1" FT /translation="MRLPIFLDTDPGIDDAVAIAAAIFAPELDLQLMTTVAGNVSVEKT FT TRNALQLLHFWNAEIPLAQGAAVPLVRAPRDAASVHGESGMAGYDFVEHNRKPLGIPAF FT LAIRDALMRAPEPVTLVAIGPLTNIALLLSQCPECKPYIRRLVIMGGSAGRGNCTPNAE FT FNIAADPEAAACVFRSGIEIVMCGLDVTNQAILTPDYLSTLPQLNRTGKMLHALFSHYR FT SGSMQSGLRMHDLCAIAWLVRPDLFTLKPCFVAVETQGEFTSGTTVVDIDGCLGKPANV FT QVALDLDVKGFQQWVAEVLALAS" FT CDS 28028..28849 FT /note="ORF_ID:o103#9" FT /note="similar to SwissProt Accession Number P04036" FT /transl_table=11 FT /gene="dapB" FT /product="Dihydrodipicolinate reductase (EC 1.3.1.26)." FT /protein_id="BAB96600.1" FT /translation="MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLL FT GSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTT FT GFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHK FT VDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTA FT MFADIGERLEITHKASSRMTFANGAVRSALWLSGKESGLFDMRDVLDLNNL" FT CDS 29305..30453 FT /db_xref="SWISS-PROT:P00907" FT /note="ORF_ID:o103#10" FT /note="similar to SwissProt Accession Number P00907" FT /transl_table=11 FT /gene="carA" FT /gene="pyrA" FT /product="Carbamoyl-phosphate synthase small chain (EC FT 6.3.5.5) (carbamoyl-phosphate synthetase glutamine chain)." FT /protein_id="BAB96601.1" FT /translation="MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTD FT PSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKR FT HNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVT FT TAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQ FT TSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGA FT KTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGI FT HRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK" FT CDS 30471..33692 FT /db_xref="SWISS-PROT:P00968" FT /note="ORF_ID:o103#11" FT /note="similar to PIR Accession Number A01198" FT /transl_table=11 FT /gene="carB" FT /product="Carbamoyl-phosphate synthase FT (glutamine-hydrolyzing) (EC 6.3.5.5) large chain" FT /protein_id="BAB96602.1" FT /translation="MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVI FT LVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELER FT QGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVG FT FPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVR FT DKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGG FT SNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDIT FT GGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKAL FT RGLEVGATGFDPKVSLDDPEALTKIRRELKDAGADRIWYIADAFRAGLSVDGVFNLTNI FT DRWFLVQIEELVRLEEKVAEVGITGLNADFLRQLKRKGFADARLAKLAGVREAEIRKLR FT DQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGI FT EFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKP FT KGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANAT FT VTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLD FT HFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQ FT VQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAG FT KSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAK FT AQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPR FT LVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGF FT ATAMALNADATEKVISVQEMHAQIK" FT CDS complement(33740..33919) FT /note="ORF_ID:o103#12" FT /note="similar to SwissProt Accession Number P46145" FT /transl_table=11 FT /gene="yaaV" FT /product="Hypothetical 6.8 kd protein in carB-caiE FT intergenic region." FT /protein_id="BAB96603.1" FT /translation="MTRFEAIKQGHIKIVDISIVCNFTVDKCELNPAYVIKNIDSPKDL FT LNGQKKTVLIREPY" FT misc_feature 33849..34351 FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o103#13" FT /note="similar to PIR Accession Number JC6025" FT /gene="caiF" FT /product="Transcription activator caiF" FT CDS complement(34438..35049) FT /note="ORF_ID:o103#14" FT /note="similar to SwissProt Accession Number P39206" FT /transl_table=11 FT /gene="caiE" FT /product="Carnitine operon protein caiE." FT /protein_id="BAB96604.1" FT /translation="MERTLTTVSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIG FT PLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCLIGRDALV FT GMNSVIMDGAVIGEESIVAAMSFVKAGFRGEKRQLLMGTPARAVRNVSDDELHWKRLNT FT KEYQDLVGRCHVSLHETQPLRQMEENRPRLQGTTDVTPKR" FT CDS complement(35034..35927) FT /note="ORF_ID:o103#15" FT /note="similar to PIR Accession Number S40557" FT /transl_table=11 FT /gene="caiD" FT /product="Hypothetical protein." FT /protein_id="BAB96605.1" FT /translation="MKQQGTTLPANNHTLKQYAFFAGMLSSLKKQKWRKGMSESLHLTR FT NGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKA FT AAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGAFELALAADFIVCADNASF FT ALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGAEEALRWGIVNRVVSQAELM FT DNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAIE FT GPLAFAEKRDPVWKGR" FT CDS complement(35928..37496) FT /note="ORF_ID:o104#1" FT /note="similar to PIR Accession Number S40558" FT /transl_table=11 FT /gene="caiC" FT /product="Hypothetical protein." FT /protein_id="BAB96606.1" FT /translation="MDRGAMDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSY FT LELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLCE FT ESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKN FT QQPATLCYAPPLSTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVY FT LTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATVTECIPMMIRTL FT MVQPPSANDQQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGD FT KRRWPSIGRVGFCYEAEIRDDHNRPLPAGEIGEICIKGIPGKTIFKEYFLNPQATAKVL FT EADGWLHTGDTGYRDEEDFFYFVDRRCNMIKRGGENVSCVELENIIAAHPKIQDIVVVG FT IKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKII FT RKNLK" FT CDS complement(37555..38772) FT /note="ORF_ID:o104#2" FT /note="similar to PIR Accession Number S40559" FT /transl_table=11 FT /gene="caiB" FT /product="L-carnitine dehydratase (EC 4.-.-.-)." FT /protein_id="BAB96607.1" FT /translation="MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIEN FT VAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARR FT GITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPA FT FPYTADYFSGLTATTAALAALHKVRETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMC FT PRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLGTPEIPEGTQLI FT HRIECPYGPLVEEKLDAWLATHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESI FT TQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKG FT LAKVED" FT CDS complement(38901..40043) FT /note="ORF_ID:o105#1" FT /note="similar to PIR Accession Number S40560" FT /transl_table=11 FT /gene="caiA" FT /product="Hypothetical protein." FT /protein_id="BAB96608.1" FT /translation="MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKA FT LADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQE FT QIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRKNGKIYLNGSKCFITSSAYTP FT YIVVMARDGASPDKPVYTGWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDM FT FGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQE FT KFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVG FT IAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR" FT CDS complement(40075..41589) FT /note="ORF_ID:o105#2" FT /note="similar to PIR Accession Number S40561" FT /transl_table=11 FT /gene="caiT" FT /product="Hypothetical protein." FT /protein_id="BAB96609.1" FT /translation="MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFS FT YVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAV FT LFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFV FT RKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFTMGTSLGLATPLVTECMQWL FT FGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASF FT IMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISR FT GRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWA FT ALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGI FT IGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD" FT CDS 42025..42831 FT /note="ORF_ID:o105#3" FT /note="similar to PIR Accession Number S40562" FT /transl_table=11 FT /gene="fixA" FT /product="FixA homolog." FT /protein_id="BAB96610.1" FT /translation="MNDVSFMISGDAMKIITCYKCVPDEQDIAVNNADGSLDFSKADAK FT ISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRDPDELIVVIDDQF FT EQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIIS FT LTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAAD FT IGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI" FT misc_feature 42810..43787 FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o105#4" FT /note="similar to SwissProt Accession Number P31574" FT /gene="fixB" FT /product="FixB protein." FT CDS 43838..45124 FT /note="ORF_ID:o105#5" FT /note="similar to PIR Accession Number S40564" FT /transl_table=11 FT /gene="fixC" FT /product="FixC protein" FT /protein_id="BAB96611.1" FT /translation="MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNV FT TGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYS FT VLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVN FT SILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMG FT GGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHIM FT PEAGINMLPELVGDGVLIAGDAAGMCMNFGFTIRGMDLAIAAGEAAAKTVLSAMKSDDF FT SKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPE FT LMRKKILRHGKKVGFINLIKDGMKGVTVL" FT CDS 45121..45408 FT /note="ORF_ID:o105#6" FT /note="similar to PIR Accession Number S40565" FT /transl_table=11 FT /gene="yaaT" FT /product="Hypothetical protein" FT /protein_id="BAB96612.1" FT /translation="MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPA FT GLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYS" FT misc_feature 45465..46050 FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o105#7" FT /note="similar to SwissProt Accession Number P31679" FT /gene="yaaU" FT /product="Hypothetical 18.4 kd protein in fixC-kefC FT intergenic region (orf65)." FT CDS 46137..46799 FT /note="ORF_ID:o105#8" FT /note="similar to PIR Accession Number S40566" FT /transl_table=11 FT /gene="yabE" FT /product="Hypothetical protein" FT /protein_id="BAB96613.1" FT /translation="MKSSRSKPGFRDLFNRCHFPFVLFVAAIWTCQVIPMFAIYTFGPQ FT IVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLSTAGRRPLLIGSFAMMTLALAVLG FT LIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVS FT TWALPIFIDNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG" FT CDS 46907..47437 FT /note="ORF_ID:o105#9" FT /note="similar to PIR Accession Number S40567" FT /transl_table=11 FT /gene="yabF" FT /product="Hypothetical protein" FT /protein_id="BAB96614.1" FT /translation="MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNID FT IAAEQEALSRADLIVWQHPMQWYSIPPLLKLWINKVFSHGWAYGHGGTALHGKHLLWAV FT TTGGGESHFEIGAHPGFDGLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHY FT KQRLLEWQEAHHG" FT CDS 47430..49292 FT /note="ORF_ID:o105#10" FT /note="similar to SwissProt Accession Number P03819" FT /transl_table=11 FT /gene="kefC" FT /gene="trkC" FT /product="Glutathione-regulated potassium-efflux system FT protein KefC (K(+)/H(+) antiporter)." FT /protein_id="BAB96615.1" FT /translation="MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWG FT LRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGCGALQMVICGGLLGLF FT CMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLV FT AMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAV FT ALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVG FT MSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAF FT VVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQ FT PRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLES FT AGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERE FT TFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTS FT AMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS" FT CDS 49482..49961 FT /note="ORF_ID:o105#11" FT /note="similar to SwissProt Accession Number P00379" FT /transl_table=11 FT /gene="folA" FT /gene="tmrA" FT /product="Dihydrofolate reductase type I (EC 1.5.1.3)." FT /protein_id="BAB96616.1" FT /translation="MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRH FT TWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFL FT PKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR" FT CDS complement(50039..50881) FT /note="ORF_ID:o105#12" FT /note="similar to PIR Accession Number A26221" FT /transl_table=11 FT /gene="apaH" FT /product="Bis(5'-nucleosyl)-tetraphosphatase (symmetrical) FT (EC 3.6.1.41)" FT /protein_id="BAB96617.1" FT /translation="MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGS FT LDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPLDRLTPLLEAPDADELLNWLRR FT QPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNW FT SPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYS FT IAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLAEAAAS" FT CDS complement(50888..51265) FT /note="ORF_ID:o106#1" FT /note="similar to PIR Accession Number A30273" FT /transl_table=11 FT /gene="apaG" FT /product="ApaG protein" FT /protein_id="BAB96618.1" FT /translation="MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQL FT LGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDE FT NGVPFSIDIPVFRLAVPTLIH" FT CDS complement(52087..53076) FT /note="ORF_ID:o106#2" FT /note="similar to PIR Accession Number JV0026" FT /transl_table=11 FT /gene="pdxA" FT /product="PdxA protein" FT /protein_id="BAB96619.1" FT /translation="MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTN FT RAAMLGLPLTLRPYSPNSPAQPQTAGTLTLLPVALRAPVTAGQLAVENGHYVVETLARA FT CDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALA FT TTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEE FT IDTIIPVLNELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRG FT VNITLGLPFIRTSVDHGTALELAGRGKADVGSFITALNLAIKMIVNTQ" FT CDS complement(53076..53873) FT /note="ORF_ID:o106#3" FT /note="similar to PIR Accession Number PV0009" FT /transl_table=11 FT /product="Hypothetical protein 98 (pdx 5' region)" FT /protein_id="BAB96620.1" FT /translation="MGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARN FT GADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHI FT LKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAK FT EFSQDPGSANQGGDLGWATPDIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRN FT VDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN" FT CDS complement(53840..54361) FT /note="ORF_ID:o106#4" FT /note="similar to PIR Accession Number S40574" FT /transl_table=11 FT /gene="surA" FT /product="Survival protein SurA precursor (peptidyl-prolyl FT cis-trans isomerase SurA) (EC 5.2.1.8) (PPiase) (rotamase FT C)." FT /protein_id="BAB96621.1" FT /translation="MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGL FT MQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIA FT KQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQKSNPWRSR FT WVTKTTPALS" FT CDS complement(54414..56768) FT /note="ORF_ID:o106#5" FT /note="similar to SwissProt Accession Number P31554" FT /transl_table=11 FT /gene="imp" FT /gene="ostA" FT /product="Organic solvent tolerance protein precursor." FT /protein_id="BAB96622.1" FT /translation="MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQG FT DTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTV FT DALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYT FT ILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGD FT KRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQ FT AGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYF FT NDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYY FT QNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHY FT QQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYV FT PYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFN FT ISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNV FT ATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIA FT DRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIG FT FNIELRGLSSNYGLGTQEMLRSNILPYQNTL" FT CDS 57023..57838 FT /note="ORF_ID:o106#6" FT /note="similar to SwissProt Accession Number P31680" FT /transl_table=11 FT /gene="yabH" FT /product="Hypothetical 30.6 kd protein in folA-hepA FT intergenic region (orf81)." FT /protein_id="BAB96623.1" FT /translation="MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFA FT NQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVG FT KSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGI FT SRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIK FT RAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK" FT CDS 58625..58783 FT /note="ORF_ID:o106#7" FT /note="similar to SwissProt Accession Number P39220" FT /transl_table=11 FT /gene="yabP" FT /product="Hypothetical 5.9 kd protein in surA-hepA FT intergenic region." FT /protein_id="BAB96624.1" FT /translation="MRDCYLGKKTMKGSNDILYERPGWNANLGVLPRTVLPRTVLTRTV FT LTWTVLP" FT CDS 58780..58938 FT /note="ORF_ID:o106#8" FT /note="similar to SwissProt Accession Number P39221" FT /transl_table=11 FT /gene="yabQ" FT /product="Hypothetical 5.7 kd protein in surA-hepA FT intergenic region." FT /protein_id="BAB96625.1" FT /translation="MNGATSLYDEVIIINKIPPKKIDTKGVATEEVATKKVLLNKLLTT FT QLLNEPE" FT misc_feature complement(59346..60009) FT /note="ORF is interrupted by frame-shifting" FT /note="ORF_ID:o106#9" FT /note="similar to SwissProt Accession Number P39219" FT /gene="yabO" FT /product="Hypothetical 24.9 kd protein in surA-hepA FT intergenic region." FT CDS 59985..60518 FT /note="ORF_ID:o106#10" FT /transl_table=11 FT /protein_id="BAB96626.1" FT /translation="MDCSFPSPFSAPLLMRYNDQTQGIQTPTCLIQAFHYLLTVALNGG FT QFVVTNVRVHCTQSFQTRQLGRQFFVGFVTRCINQCTGRFLDLRFTQLKDSVNILLHGV FT NQFAAGVTVNRVKLAVKGFKLYLRRQVVAVFIQQHTYRRRWQEAVELQLLRSLGFNHIN FT QFHQQRTYRQRFVF" FT CDS complement(60021..62927) FT /note="ORF_ID:o106#11" FT /note="similar to SwissProt Accession Number P23852" FT /transl_table=11 FT /gene="hepA" FT /product="Probable ATP-dependent helicase HepA." FT /protein_id="BAB96627.1" FT /translation="MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYAR FT SDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDS FT KLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAH FT DVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRR FT FNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDE FT AHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFA FT QFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSA FT RQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARK FT SAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATAL FT QLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFAS FT HMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHT FT CPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSN FT GGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPG FT LSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVG FT TLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRH FT TGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPN FT IRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ" FT CDS complement(63092..64741) FT /note="ORF_ID:o107#1" FT /note="dinA; polB" FT /note="similar to E. coli DNA polymerase II (EC 2.7.7.7) FT (Pol II): PIR Accession Number S15943" FT /transl_table=11 FT /protein_id="BAB96628.1" FT /translation="MPSRYRLPLRLGRDNSELEWREHGPAKTASFLPQAKGRLIIDGIE FT ALKSAFWDFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCEL FT VTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPH FT ASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLD FT AWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRL FT ASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVN FT AWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKG FT LETVRTDWTPLAQQFQQELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPL FT SEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEH FT YLTRQLQPVAEGILPFIEDNFATLMTGQLGLF" FT CDS complement(64698..65447) FT /note="ORF_ID:o107#2" FT /note="polB" FT /note="similar to E. coli DNA-directed DNA polymerase (EC FT 2.7.7.7) II: PIR Accession Number S15943" FT /transl_table=11 FT /protein_id="BAB96629.1" FT /translation="MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFI FT PADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVY FT EADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPLKWVSIDIETTRHG FT ELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVASRPQLLEKLNAWFANYDPDVIIG FT WNVVQFDLRMLQKHAEPLPSSAASWAR" FT CDS complement(65522..66217) FT /note="ORF_ID:o107#3" FT /note="similar to SwissProt Accession Number P08203" FT /transl_table=11 FT /gene="araD" FT /product="L-ribulose-5-phosphate 4-epimerase (EC 5.1.3.4)." FT /protein_id="BAB96630.1" FT /translation="MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSG FT VDYSVMTADDMVVVSIETGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIW FT AQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQM FT PGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRK FT HGAKAYYGQ" FT CDS complement(66502..68004) FT /note="ORF_ID:o107#4" FT /note="similar to SwissProt Accession Number P08202" FT /transl_table=11 FT /gene="araA" FT /product="L-arabinose isomerase (EC 5.3.1.4)." FT /protein_id="BAB96631.1" FT /translation="MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLP FT CKLVLKPLGTTPDEITAICRDANYDDPCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFH FT TQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSW FT MRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSIS FT DGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFED FT LHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFE FT KGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDL FT GDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALN FT LNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR" FT CDS complement(68015..69715) FT /note="ORF_ID:o108#1" FT /note="similar to PIR Accession Number B29022" FT /transl_table=11 FT /gene="araB" FT /product="Ribulokinase (EC 2.7.1.16)" FT /protein_id="BAB96632.1" FT /translation="MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDA FT PNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDSTGSTPAPIDADGNVLAL FT RPEFAENPNAMFVLWKDHTAVERSEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHV FT TRQDSAVAQSAASWIELCDWVPALLSGTTRPQDIRRGRCSAGHKSLWHESWGGLPPASF FT FDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVG FT AGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGD FT IYAWFGRVLSWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFN FT GRRSPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMAL FT GGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASA FT VEKTLQPRSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVATL" FT CDS 70054..70932 FT /note="ORF_ID:o108#2" FT /note="similar to PIR Accession Number A91473" FT /transl_table=11 FT /gene="araC" FT /product="Arabinose operon regulatory protein" FT /protein_id="BAB96633.1" FT /translation="MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGY FT ILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYW FT HEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRR FT MEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGIS FT VLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCE FT EKVNDVAVKLS" FT CDS 71066..71779 FT /note="ORF_ID:o108#3" FT /note="similar to PIR Accession Number S40581" FT /transl_table=11 FT /gene="yabI" FT /product="Hypothetical protein" FT /protein_id="BAB96634.1" FT /translation="MAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLA FT GIIGCLMGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGP FT TRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWL FT LLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHP FT LMPVYIDILRKVVGV" FT CDS complement(71894..72592) FT /note="ORF_ID:o108#4" FT /note="similar to PIR Accession Number S40582" FT /transl_table=11 FT /gene="yabJ" FT /product="Hypothetical protein" FT /protein_id="BAB96635.1" FT /translation="MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLI FT AGFLTPASGSLTIDGVDHTTMPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNA FT VQQGKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDELFSALDPA FT LRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGMTNELLSGKA FT SASALLGITG" FT CDS complement(72576..74186) FT /note="ORF_ID:o108#5" FT /note="similar to PIR Accession Number S40583" FT /transl_table=11 FT /gene="yabK" FT /product="Hypothetical protein" FT /protein_id="BAB96636.1" FT /translation="MATRRQPLIPGWLIPGVSATTLVVAVALAAFLALWWNAPQDDWVA FT VWQDSYLWHVVRFSFWQAFLSALLSVIPAIFLARALYRRRFPGRLALLRLCAMTLILPV FT LVAVFGILSVYGRQGWLATLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALE FT NIPGEQRQLAAQLGMRSWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQ FT ATTIELAIYQALSYDYDPARAAMLALLQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDP FT DDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALA FT AGVLCVVLTMMLLWSSRELRARQKMLAGQVLEMSGMLILAMPGIVLATGFFLLLNNTIG FT LPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRAL FT KRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTTLILLLL FT CFLLFTVIEKLPGRNVKTD" FT CDS complement(74162..75133) FT /note="ORF_ID:o108#6" FT /note="similar to PIR Accession Number S40584" FT /transl_table=11 FT /gene="tbpA" FT /product="Hypothetical protein." FT /protein_id="BAB96637.1" FT /translation="MSAPAVAVTAPVFAKPVLTVYTYDSFAADWGPGPVVKKAFEADCN FT CELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNV FT PGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGL FT LLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEE FT KKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVA FT NVTLPAGFEKLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR" FT CDS complement(75308..76963) FT /note="ORF_ID:o108#7" FT /note="similar to SwissProt Accession Number P33595" FT /transl_table=11 FT /gene="yabN" FT /product="Hypothetical 63.9 kd protein in tbpA-leuD FT intergenic region (orf103)." FT /protein_id="BAB96638.1" FT /translation="MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLN FT TMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVR FT QMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGE FT LEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVS FT PTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNSTNQL FT KIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLF FT DSRTHRGANQQVRDWASYVLSPTNLVYFAEEQYQQLWFPAYGLFPRWHHARTIKSEKPA FT GLESLTLTFCQDHSEHRVIAGIMQQILASHQVTLKIKEIDYDQWHTGEIESDIWLNSAN FT FTLPLDFSVFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLLH FT HWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP" FT CDS 77285..78463 FT /note="ORF_ID:o108#8" FT /note="similar to SwissProt Accession Number P31675" FT /transl_table=11 FT /gene="yabM" FT /product="Hypothetical 42.7 kd protein in tbpA-leuD FT intergenic region (orf104)." FT /protein_id="BAB96639.1" FT /translation="MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVG FT AQPFWIGLFYAVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYL FT TLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFM FT LALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFV FT ASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKR FT RMMVIAVAAGVLFYTGLIFFNSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAA FT TTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV" FT CDS complement(78512..79117) FT /note="ORF_ID:o108#9" FT /note="similar to PIR Accession Number S40585" FT /transl_table=11 FT /gene="leuD" FT /product="Isopropylmalate isomerase subunit" FT /protein_id="BAB96640.1" FT /translation="MAEKFIKHTGLVVPLDAANVDTDAIIPKQFLQKVTRTGFGAHLFN FT DWRFLDEKGQQPNPDFVLNFPQYQGASILLARENFGCGSSREHAPWALTDYGFKVVIAP FT SFADIFYGNSFNNQLLPVKLSDAEVDELFALVKANPGIHFDVDLEAQEVKAGEKTYRFT FT IDAFRRHCMMNGLDSIGLTLQHDDAIAAYEAKQPAFMN" FT CDS complement(79128..80528) FT /db_xref="SWISS-PROT:P30127" FT /note="ORF_ID:o109#1" FT /note="similar to PIR Accession Number S40586" FT /transl_table=11 FT /gene="leuC" FT /product="3-isopropylmalate dehydratase (EC 4.2.1.33) alpha FT chain" FT /protein_id="BAB96641.1" FT /translation="MAKTLYEKLFDAHVVYEAENETPLLYIDRHLVHEVTSPQAFDGLR FT AHGRPVRQPGKTFATMDHNVSTQTKGINACGEMARIQMQELIKNCKEFGVELYDLNHPY FT QGIVHVMGPEQGVTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVEHVLATQTLKQGRA FT KTMKIEVQGKAAPGITAKDIVLAIIGKTGSAGGTGHVVEFCGEAIRDLSMEGRMTLCNM FT AIEMGAKAGLVAPDETTFNYVKGRLHAPKGKDFDDAVAYWKTLQTDEGATFDTVVTLQA FT EEISPQVTWGTNPGQVISVNDNIPDPASFADPVERASAEKALAYMGLKPGIPLTEVAID FT KVFIGSCTNSRIEDLRAAAEIVKGRKVAPGVQALVVPGSGPVKAQAEAEGLDKIFIEAG FT FEWRLPGCSMCLAMNNDRLNPGERCASTSNRNFEGRQGRGGRTHLVSPAMAAAAAVTGH FT FADIRNIK" FT CDS complement(80531..81622) FT /note="ORF_ID:o109#2" FT /note="similar to SwissProt Accession Number P30125" FT /transl_table=11 FT /gene="leuB" FT /product="3-isopropylmalate dehydrogenase (EC 1.1.1.85) FT (beta-ipm dehydrogenase) (imdH) (3-ipm-dh)." FT /protein_id="BAB96642.1" FT /translation="MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMRITTSHYDV FT GGAAIDNHGQPLPPATVEGCEQADAVLFGSVGGPKWEHLPPDQQPERGALLPLRKHFKL FT FSNLRPAKLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEKAFD FT TEVYHRFEIERIARIAFESARKRRHKVTSIDKANVLQSSILWREIVNEIATEYPDVELA FT HMYIDNATMQLIKDPSQFDVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQGFGLYE FT PAGGSAPDIAGKNIANPIAQILSLALLLRYSLDADDAACAIERAINRALEEGIRTGDLA FT RGAAAVSTDEMGDIIARYVAEGV" FT CDS complement(81622..83193) FT /db_xref="SWISS-PROT:P09151" FT /note="ORF_ID:o109#3" FT /note="similar to PIR Accession Number S40588" FT /transl_table=11 FT /gene="leuA" FT /product="2-Isopropylmalate synthase" FT /protein_id="BAB96643.1" FT /translation="MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEV FT GFPVSSPGDFESVQTIARQVKNSRVCALARCVEKDIDVAAESLKVAEAFRIHTFIATSP FT MHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAINAGA FT TTINIPDTVGYTMPFEFAGIISGLYERVPSIGKAIISVHTHDDLGLAVGNSLAAVHAGA FT RQVEGAMNGIGERAGNCSLEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPI FT PANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIGLNQIQLNLTSRSGRAAVKHR FT MDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQ FT SGSNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITEYNVELVKYSLTAKGHGK FT DALGQVDIVANYNGRRFHGVGLATDIVESSAKAMVHVLNNIWRAAEVEKELQRKAQHNE FT NNKETV" FT CDS complement(83286..83372) FT /note="ORF_ID:o109#4" FT /note="similar to PIR Accession Number A30376" FT /transl_table=11 FT /gene="leuLP" FT /product="LeuABCD leader peptide." FT /protein_id="BAB96644.1" FT /translation="MTHIVRFIGLLLLNASSLRGRRVSGIQH" FT CDS 83855..84904 FT /note="ORF_ID:o109#5" FT /note="similar to PIR Accession Number S40589" FT /transl_table=11 FT /gene="lueO" FT /product="LeuO protein." FT /protein_id="BAB96645.1" FT /translation="MTHSTAMDSVFIRTRIFMFSEFYSFCFFLFYMHDKSYSSGLFLCI FT PIRERELSVTVELSMPEVQTDHPETAELSKPQLRMVDLNLLTVFDAVMQEQNITRAAHV FT LGMSQPAVSNAVARLKVMFNDELFVRYGRGIQPTARAFQLFGSVRQALQLVQNELPGSG FT FEPASSERVFHLCVCSPLDSILTSQIYNHIEQIAPNIHVMFKSSLNQNTEHQLRYQETE FT FVISYEDFHRPEFTSVPLFKDEMVLVASKNHPTIKGPLLKHDVYNEQHAAVSLDRFASF FT SQPWYDTVDKQASIAYQGMAMMSVLSVVSQTHLVAIAPRWLAEEFAESLELQVLPLPLK FT QNSRTCYLS" FT CDS 85294..86994 FT /note="ORF_ID:o109#6" FT /note="similar to PIR Accession Number S14385" FT /transl_table=11 FT /gene="ilvI" FT /product="Acetolactate synthase (EC 4.1.3.18) III large FT chain." FT /protein_id="BAB96646.1" FT /translation="MEMLSGGEMVVRSLIDQGVKQVFGYPGGAVLDIYDALHTVGGIDH FT VLVRHEQAAVHMADGLARATGEVGVVLVTSGPGATNAITGIATAYMDSIPLVVLSGQVA FT TSLIGYDAFQECDMVGISRPVVKHSFLVKQTEDIPQVLKKAFWLAASGRPGPVVVDLPK FT DILNPANKLPYVWPESVSMRSYNPTTTGHKGQIKRALQSVVAVKKPVVYVGGGAITAGC FT HQQLKETVEALNLPVVCSLMGLGAFPATHRQVLGMLGMHGTYEANMTMHNADVIFAVGV FT RFDDRTTNNLAKYCPNATVLHIDIDPTSISKTVTADIPIVGDARQVLEQMLELLSQESA FT HQPLDEIRDWWQQIEQWRARQCLKYDTHSEKIKPQAVIETLWRLTKGDAYVTSDVGQHQ FT MFAALYYPFDKPRRWINSGGLGSMGFGLPAALGVKMAFPEETVVCVTGDGSIQMNIQEL FT STALQYELPVLVVNLNNRYLGMVKQWQDMIYSGRHSQSYMQSLPDFVRRGAYGHVGIQI FT SHPHGWKANLARRWNRCAIIAWCLLMLPSMAASTSTRCRFAGAEWMKCG" FT CDS 87017..87508 FT /note="ORF_ID:o109#7" FT /note="similar to SwissProt Accession Number P00894" FT /transl_table=11 FT /gene="ilvH" FT /gene="brnP" FT /product="Acetolactate synthase isozyme III small subunit FT (EC 4.1.3.18) (ahas-III) (acetohydroxy-acid synthase III FT small subunit) (als-III)." FT /protein_id="BAB96647.1" FT /translation="MRRILSVLLENESGALSRVIGLFSQRGYNIESLTVAPTDDPTLSR FT MTIQTVGDEKVLEQIEKQLHKLVDVLRVSELGQGAHVEREIMLVKIQASGYGRDEVKRN FT TEIFRGQIIDVTPSLYTVQLAGTSGKLSAFLASIRDVAKIVEVARSGVVGLSRGDKIMR FT " FT CDS 87688..88692 FT /note="ORF_ID:o109#8" FT /note="similar to PIR Accession Number JU0298" FT /transl_table=11 FT /gene="shl" FT /gene="fruR" FT /product="Pep-fructosephosphotransferase system repressor." FT /protein_id="BAB96648.1" FT /translation="MKLDEIARLAGVSRTTASYVINGKAKQYRVSDKTVEKVMAVVREH FT NYHPNAVAAGLRAGRTRSIGLVIPDLENTSYTRIANYLERQARQRGYQLLIACSEDQPD FT NEMRCIEHLLQRQVDAIIVSTSLPPEHPFYQRWANDPFPIVALDRALDREHFTSVVGAD FT QDDAEMLAEELRKFPAETVLYLGALPELSVSFLREQGFRTAWKDDPREVHFLYANSYER FT EAAAQLFEKWLETHPMPQALFTTSFALLQGVMDVTLRRDGKLPSDLAIATFGDNELLDF FT LQCPVLAVAQRHRDVAERVLEIVLASLDEPRKPKPGLTRIKRNLYRRGVLSRS" FT CDS 89294..89752 FT /note="ORF_ID:o109#9" FT /note="similar to PIR Accession Number S14388" FT /transl_table=11 FT /gene="yabB" FT /product="Hypothetical protein C." FT /protein_id="BAB96649.1" FT /translation="MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIYHPCL FT LLYPLPEWEIIEQKLSRLSSMNPVERRVQRLLLGHASECQMDGAGRLLIAPVLRQHAGL FT TKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDLSL" FT CDS 89754..90695 FT /note="ORF_ID:o109#10" FT /note="similar to SwissProt Accession Number P18595" FT /transl_table=11 FT /gene="yabC" FT /product="Hypothetical 34.9 kd protein in fruR-ftsL FT intergenic region (orfB)." FT /protein_id="BAB96650.1" FT /translation="MMENYKHTTVLLDEAVNGLNIRPDGIYIDGTFGRGGHSRLILSQL FT GEEGRLLAIDRDPQAIAVAKTIDDPRFSIIHGPFSALGEYVAERDLIGKIDGILLDLGV FT SSPQLDDAERGFSFMRDGPLDMRMDPTRGQSAAEWLQTAEEADIAWVLKTYGEERFAKR FT IARAIVERNREQPMTRTKELAEVVAAATPVKDKFKHPATRTFQAVRIWVNSELEEIEQA FT LKSSLNVLAPGGRLSIISFHSLEDRIVKRFMRENSRGPQVPAGLPMTEEQLKKLGGRQL FT RALGKLMPGEEEVAENPRARSSVLRIAERTNA" FT CDS 90692..91024 FT /note="ORF_ID:o110#1" FT /note="similar to PIR Accession Number S14387" FT /transl_table=11 FT /gene="ftsL" FT /product="Cell division protein FtsL" FT /protein_id="BAB96651.1" FT /translation="MISRVTEALSKVKGSMGSHERHALPGVIGDDLLRFGKLPLCLFIC FT IILTAVTVVTTAHHTRLLTAQREQLVLERDALDIEWRNLILEENALGDHSRVERIATEK FT LQMQTC" FT CDS 91074..92840 FT /note="ORF_ID:o110#2" FT /note="similar to PIR Accession Number A93123" FT /transl_table=11 FT /gene="ftsI" FT /gene="pbpB" FT /product="Penicillin-binding protein 3 precursor." FT /protein_id="BAB96652.1" FT /translation="MKAAAKTQKPKRQEEHANFISWRFALLCGCILLALAFLLGRVAWL FT QVISPDMLVKEGDMRSLRVQQVSTSRGMITDRSGRPLAVSVPVKAIWADPKEVHDAGGI FT SVGDRWKALANALNIPLDQLSARINANPKGRFIYLARQVNPDMADYIKKLKLPGIHLRE FT ESRRYYPSGEVTAHLIGFTNVDSQGIEGVEKSFDKWLTGQPGERIVRKDRYGRVIEDIS FT STDSQAAHNLALSIDERLQALVYRELNNAVAFNKAESGSAVLVDVNTGEVLAMANSPSY FT NPNNLSGTPKEAMRNRTITDVFEPGSTVKPMVVMTALQRGVVRENSVLNTIPYRINGHE FT IKDVARYSELTLTGVLQKSSNVGVSKLALAMPSSALVDTYSRFGLGKATNLGLVGERSG FT LYPQKQRWSDIERATFSFGYGLMVTPLQLARVYATIGSYGIYRPLSITKVDPPVPGERV FT FPESIVRTVVHMMESVALPGGGGVKAAIKGYRIAIKTGTAKKVGPDGRYINKYIAYTAG FT VAPASQPRFALVVVINDPQAGKYYGGAVSAPVFGAIMGGVLRTMNIEPDALTTGDKNEF FT VINQGEGTGGRS" FT CDS 92827..94314 FT /note="ORF_ID:o110#3" FT /note="similar to PIR Accession Number S40595" FT /transl_table=11 FT /gene="murE" FT /product="UDP-N-acetylmuramoylalanyl-D-glutamate--2, FT 6-diaminopimelate ligase (EC 6.3.2.13) murE" FT /protein_id="BAB96653.1" FT /translation="MADRNLRDLLAPWVPDAPSRALREMTLDSRVAAAGDLFVAVVGHQ FT ADGRRYIPQAIAQGVAAIIAEAKDEATDGEIREMHGVPVIYLSQLNERLSALAGRFYHE FT PSDNLRLVGVTGTNGKTTTTQLLAQWSQLLGEISAVMGTVGNGLLGKVIPTENTTGSAV FT DVQHELAGLVDQGATFCAMEVSSHGLVQHRVAALKFAASVFTNLSRDHLDYHGDMEHYE FT AAKWLLYSEHHCGQAIINADDEVGRRWLAKLPDAVAVSMEDHINPNCHGRWLKATEVNY FT HDSGATIRFSSSWGDGEIESHLMGAFNVSNLLLALATLLALGYPLADLLKTAARLQPVC FT GRMEVFTAPGKPTVVVDYAHTPDALEKALQAARLHCAGKLWCVFGCGGDRDKGKRPLMG FT AIAEEFADVAVVTDDNPRTEEPRAIINDILAGMLDAGHAKVMEGRAEAVTCAVMQAKEN FT DVVLVAGKGHEDYQIVGNQRLDYSDRVTVARLLGVIA" FT CDS 94311..95669 FT /note="ORF_ID:o110#4" FT /note="similar to SwissProt Accession Number P11880" FT /transl_table=11 FT /gene="murF" FT /gene="mra" FT /product="UDP-n-acetylmuramoylalanyl-d-glutamyl-2, FT 6-diaminopimelate-d-alanyl-d-alanyl ligase (EC 6.3.2.15) FT (UDP-murnac-pentapeptide synthetase) FT (d-alanyl-d-alanine-adding enzyme)." FT /protein_id="BAB96654.1" FT /translation="MISVTLSQLTDILNGELQGADITLDAVTTDTRKLTPGCLFVALKG FT ERFDAHDFADQAKAGAAGALLVSRPLDIDLPQLIVKDTRLAFGELAAWVRQQVPARVVA FT LTGSSGKTSVKEMTAAILSQCGNTLYTAGNLNNDIGVPMTLLRLTPEYDYAVIELGANH FT QGEIAWTVSLTRPERALVNNLAAAHLEGFGSLAGVAKAKGEIFSGLPENGIAIMNADNN FT DWLNWQSVIGSRKVWRFSPNAANSDFTATNIHVTSHGTEFTLQTPTGSVDVLLPLPGRH FT NIANALAAAALSMSVGATLDAIKAGLANLKAVPGRLFPIQLAENQLLLDDSYNANVGSM FT TAAVQVLAEMPGYRVLVVGDMAELGAESEACHVQVGEAAKAAGIDRVLSVGKQSHAIST FT ASGVGEHFADKTALITRLKLLIAEQQVITILVKGSRSAAMEEVVRALQENGTC" FT CDS 95663..96745 FT /note="ORF_ID:o110#5" FT /note="similar to SwissProt Accession Number P15876" FT /transl_table=11 FT /gene="mraY" FT /gene="murX" FT /product="Phospho-n-acetylmuramoyl-pentapeptide-transferas FT e (EC 2.7.8.13)." FT /protein_id="BAB96655.1" FT /translation="MLVWLAEHLVKYYSGFNVFSYLTFRAIVSLLTALFISLWMGPRMI FT AHLQKLSFGQVVRNDGPESHFSKRGTPTMGGIMILTAIVISVLLWAYPSNPYVWCVLVV FT LVGYGVIGFVDDYRKVVRKDTKGLIARWKYFWMSVIALGVAFALYLAGKDTPATQLVVP FT FFKDVMPQLGLFYILLAYFVIVGTGNAVNLTDGLDGLAIMPTVFVAGGFALVAWATGNM FT NFASYLHIPYLRHAGELVIVCTAIVGAGLGFLWFNTYPAQVFMGDVGSLALGGALGIIA FT VLLRQEFLLVIMGGVFVVETLSVILQVGSFKLRGQRIFRMAPIHHHYELKGWPEPRVIV FT RFWIISLMLVLIGLATLKVR" FT CDS 96748..98064 FT /note="ORF_ID:o110#6" FT /note="similar to PIR Accession Number S08396" FT /transl_table=11 FT /gene="murD" FT /product="UDP-n-acetylmuramoylalanine-d-glutamate ligase FT (EC 6.3.2.9)." FT /protein_id="BAB96656.1" FT /translation="MADYQGKNVVIIGLGLTGLSCVDFFLARGVTPRVMDTRMTPPGLD FT KLPEAVERHTGSLNDEWLMAADLIVASPGIALAHPSLSAAADAGIEIVGDIELFCREAQ FT APIVAITGSNGKSTVTTLVGEMAKAAGVNVGVGGNIGLPALMLLDDECELYVLELSSFQ FT LETTSSLQAVAATILNVTEDHMDRYPFGLQQYRAAKLRIYENAKVCVVNADDALTMPIR FT GADERCVSFGVNMGDYHLNHQQGETWLRVKGEKVLNVKEMKLSGQHNYTNALAALALAD FT AAGLPRASSLKALTTFTGLPHRFEVVLEHNGVRWINDSKATNVGSTEAALNGLHVDGTL FT HLLLGGDGKSADFSPLARYLNGDNVRLYCFGRDGAQLAALRPEVAEQTETMEQAMRLLA FT PRVQPGDMVLLSPACASLDQFKNFEQRGNEFARLAKELG" FT CDS 98064..99308 FT /note="ORF_ID:o111#1" FT /note="similar to PIR Accession Number A32581" FT /transl_table=11 FT /gene="ftsW" FT /product="Cell division protein FtsW." FT /protein_id="BAB96657.1" FT /translation="MRLSLPRLKMPRLPGFSILVWISTALKGWVMGSREKDTDSLIMYD FT RTLLWLTFGLAAIGFIMVTSASMPIGQRLTNDPFFFAKRDGVYLILAFILAIITLRLPM FT EFWQRYSATMLLGSIILLMIVLVVGSSVKGASRWIDLGLLRIQPAELTKLSLFCYIANY FT LVRKGDEVRNNLRGFLKPMGVILVLAVLLLAQPDLGTVVVLFVTTLAMLFLAGAKLWQF FT IAIIGMGISAVVLLILAEPYRIRRVTAFWNPWEDPFGSGYQLTQSLMAFGRGELWGQGL FT GNSVQKLEYLPEAHTDFIFAIIGEELGYVGVVLALLMVFFVAFRAMSIGRKALEIDHRF FT SGFLACSIGIWFSFQALVNVGAAAGMLPTKGLTLPLISYGGSSLLIMSTAIMMLLRIDY FT ETRLEKAQAFVRGSR" FT CDS 99305..100372 FT /note="ORF_ID:o111#2" FT /note="similar to PIR Accession Number JQ0544" FT /transl_table=11 FT /gene="murG" FT /product="MurG protein." FT /protein_id="BAB96658.1" FT /translation="MSGQGKRLMVMAGGTGGHVFPGLAVAHHLMAQGWQVRWLGTADRM FT EADLVPKHGIEIDFIRISGLRGKGIKALIAAPLRIFNAWRQARAIMKAYKPDVVLGMGG FT YVSGPGGLAAWSLGIPVVLHEQNGIAGLTNKWLAKIATKVMQAFPGAFPNAEVVGNPVR FT TDVLALPLPQQRLAGREGPVRVLVVGGSQGARILNQTMPQVAAKLGDSVTIWHQSGKGS FT QQSVEQAYAEAGQPQHKVTEFIDDMAAAYAWADVVVCRSGALTVSEIAAAGLPALFVPF FT QHKDRQQYWNALPLEKAGAAKIIEQPQLSVDAVANTLAGWSRETLLTMAERARAASIPD FT ATERVANEVSRVARA" FT CDS 100426..101901 FT /note="ORF_ID:o111#3" FT /note="similar to PIR Accession Number JQ0545" FT /transl_table=11 FT /gene="murC" FT /product="UDP-n-acetylmuramate-alanine ligase (EC FT 6.3.2.8)." FT /protein_id="BAB96659.1" FT /translation="MNTQQLAKLRSIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGYQ FT ISGSDLAPNPVTQQLMNLGATIYFNHRPENVRDASVVVVSSAISADNPEIVAAHEARIP FT VIRRAEMLAELMRFRHGIAIAGTHGKTTTTAMVSSIYAEAGLDPTFVNGGLVKAAGVHA FT RLGHGRYLIAEADESDASFLHLQPMVAIVTNIEADHMDTYQGDFENLKQTFINFLHNLP FT FYGRAVMCVDDPVIRELLPRVGRQTTTYGFSEDADVRVEDYQQIGPQGHFTLLRQDKEP FT MRVTLNAPGRHNALNAAAAVAVATEEGIDDEAILRALESFQGTGRRFDFLGEFPLEPVN FT GKSGTAMLVDDYGHHPTEVDATIKAARAGWPDKNLVMLFQPHRFTRTRDLYDDFANVLT FT QVDTLLMLEVYPAGEAPIPGADSRSLCRTIRGRGKIDPILVPDPARVAEMLAPVLTGND FT LILVQGAGNIGKIARSLAEIKLKPQTPEEEQHD" FT CDS 101894..102814 FT /note="ORF_ID:o111#4" FT /note="similar to PIR Accession Number A30289" FT /transl_table=11 FT /gene="ddl" FT /gene="ddlB" FT /product="D-alanine-d-alanine ligase (EC 6.3.2.4) B." FT /protein_id="BAB96660.1" FT /translation="MTDKIAVLLGGTSAEREVSLNSGAAVLAGLREGGIDAYPVDPKEV FT DVTQLKSMGFQKVFIALHGRGGEDGTLQGMLELMGLPYTGSGVMASALSMDKLRSKLLW FT QGAGLPVAPWVALTRAEFEKGLSDKQLAEISALGLPVIVKPSREGSSVGMSKVVAENAL FT QDALRLAFQHDEEVLIEKWLSGPEFTVAILGEEILPSIRIQPSGTFYDYEAKYLSDETQ FT YFCPAGLEASQEANLQALVLKAWTTLGCKGWGRIDVMLDSDGQFYLLEANTSPGMTSHS FT LVPMAARQAGMSFSQLVVRILELAD" FT CDS 102816..103646 FT /note="ORF_ID:o111#5" FT /note="similar to PIR Accession Number S10852" FT /transl_table=11 FT /gene="ftsQ" FT /product="Cell division protein FtsQ." FT /protein_id="BAB96661.1" FT /translation="MSQAALNTRNSEEEVSSRRNNGTRLAGILFLLTVLTTVLVSGWVV FT LGWMEDAQRLPLSKLVLTGERHYTRNDDIRQSILALGEPGTFMTQDVNIIQTQIEQRLP FT WIKQVSVRKQWPDELKIHLVEYVPIARWNDQHMVDAEGNTFSVPPERTSKQVLPMLYGP FT EGSANEVLQGYREMGQMLAKDRFTLKEAAMTARRSWQLTLNNDIKLNLGRGDTMKRLAR FT FVELYPVLQQQAQTDGKRISYVDLRYDSGAAVGWAPLPPEESTQQQNQAQAEQQ" FT CDS 103643..104905 FT /note="ORF_ID:o111#6" FT /note="similar to SwissProt Accession Number P06137" FT /transl_table=11 FT /gene="ftsA" FT /gene="divA" FT /product="Cell division protein FtsA." FT /protein_id="BAB96662.1" FT /translation="MIKATDRKLVVGLEIGTAKVAALVGEVLPDGMVNIIGVGSCPSRG FT MDKGGVNDLESVVKCVQRAIDQAELMADCQISSVYLALSGKHISCQNEIGMVPISEEEV FT TQEDVENVVHTAKSVRVRDEHRVLHVIPQEYAIDYQEGIKNPVGLSGVRMQAKVHLITC FT HNDMAKNIVKAVERCGLKVDQLIFAGLASSYSVLTEDERELGVCVVDIGGGTMDIAVYT FT GGALRHTKVIPYAGNVVTSDIAYAFGTPPSDAEAIKVRHGCALGSIVGKDESVEVPSVG FT GRPPRSLQRQTLAEVIEPRYTELLNLVNEEILQLQEKLRQQGVKHHLAAGIVLTGGAAQ FT IEGLAACAQRVFHTQVRIGAPLNITGLTDYAQEPYYSTAVGLLHYGKESHLNGEAEVEK FT RVTASVGSWIKRLNSWLRKEF" FT CDS 104966..106117 FT /note="ORF_ID:o111#7" FT /note="similar to SwissProt Accession Number P06138" FT /transl_table=11 FT /gene="ftsZ" FT /gene="sfiB" FT /gene="sulB" FT /product="Cell division protein FtsZ." FT /protein_id="BAB96663.1" FT /translation="MFEPMELTNDAVIKVIGVGGGGGNAVEHMVRERIEGVEFFAVNTD FT AQALRKTAVGQTIQIGSGITKGLGAGANPEVGRNAADEDRDALRAALEGADMVFIAAGM FT GGGTGTGAAPVVAEVAKDLGILTVAVVTKPFNFEGKKRMAFAEQGITELSKHVNSLITI FT PNDKLLKVLGRGISLLDAFGAANDVLKGAVQGIAELITRPGLMNVDFADVRTVMSEMGH FT AMMGSGVASGEDRAEEAAEMAISSPLLEDIDLSGARGVLVNITAGFDLRLDEFETVGNT FT IRAFASDNATVVIGTSLDPDMNDELRVTVVATGIGMDKRPEITLVTNKQVQQPVMDRYQ FT QHGMAPLTQEQKPVAKVVNDNAPQTAKEPDYLDIPAFLRKQAD" FT CDS 106218..107135 FT /note="ORF_ID:o111#8" FT /note="similar to SwissProt Accession Number P07652" FT /transl_table=11 FT /gene="envA" FT /gene="asmB" FT /gene="lpxC" FT /product="Udp-3-o-[3-hydroxymyristoyl] n-acetylglucosamine FT deacetylase (EC 3.5.1.-) (EnvA protein)." FT /protein_id="BAB96664.1" FT /translation="MIKQRTLKRIVQATGVGLHTGKKVTLTLRPAPANTGVIYRRTDLN FT PPVDFPADAKSVRDTMLCTCLVNEHDVRISTVEHLNAALAGLGIDNIVIEVNAPEIPIM FT DGSAAPFVYLLLDAGIDELNCAKKFVRIKETVRVEDGDKWAEFKPYNGFSLDFTIDFNH FT PAIDSSNQRYAMNFSADAFMRQISRARTFGFMRDIEYLQSRGLCLGGSFDCAIVVDDYR FT VLNEDGLRFEDEFVRHKMLDAIGDLFMCGHNIIGAFTAYKSGHALNNKLLQAVLAKQEA FT WEYVTFQDDAELPLAFKAPSAVLA" FT CDS 107435..107878 FT /note="ORF_ID:o111#9" FT /note="similar to PIR Accession Number B28381" FT /transl_table=11 FT /gene="yacA" FT /product="Hypothetical 16k protein (eneA-secA intergenic FT region)." FT /protein_id="BAB96665.1" FT /translation="MVAASLGLPALSNAAEPNAPAKATTRNHEPSAKVNFGQLALLEAN FT TRRPNSNYSVDYWHQHAIRTVIRHLSFAMAPQTLPVAEESLPLQAQHLALLDTLSALLT FT QEGTPSEKGYRIDYAHFTPQAKFSTPVWISQAQGIRAGPQRLT" FT CDS 107940..110645 FT /note="ORF_ID:o111#10" FT /note="similar to SwissProt Accession Number P10408" FT /transl_table=11 FT /gene="secA" FT /gene="azi" FT /gene="pea" FT /gene="prlD" FT /product="Preprotein translocase SecA subunit." FT /protein_id="BAB96666.1" FT /translation="MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKG FT KTAEFRARLEKGEVLENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRT FT GEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPA FT PAKREAYAADITYGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTP FT LIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVL FT IEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHT FT GRTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAF FT EFSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTIS FT IEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGG FT SWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSG FT RQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVE FT SRNFDIRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSETINSIREDVFKATIDAYIPP FT QSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRDGILAQSIEVYQRKEEV FT VGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFA FT AMLESLKYEVISTLSKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALA FT AQTGERKVGRNDPCPCGSGKKYKQCHGRLQ" FT CDS 110705..111094 FT /note="ORF_ID:o111#11" FT /note="similar to PIR Accession Number A27890" FT /transl_table=11 FT /gene="mutT" FT /product="Mutator MutT (AT-GC transversion)." FT /protein_id="BAB96667.1" FT /translation="MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGET FT PEQAVVRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVERWEGEPWGKEGQPGEW FT MSLVGLNADDFPPANEPVIAKLKRL" FT CDS complement(111310..>111408) FT /note="ORF_ID:o111#12" FT /note="similar to SwissProt Accession Number P36681" FT /transl_table=11 FT /gene="yacG" FT /product="Hypothetical 5.8 kd protein in mutT-guaC FT intergenic region." FT /protein_id="BAB96668.1" FT /translation="LIDLGEWAAEEKRIPSSGDLSESDDWSEEPKQ" XX SQ Sequence 111408 BP; 26083 A; 28055 C; 30531 G; 26739 T; 0 other; atgcgagtgt tgaagttcgg cggtacatca ttggcaaatg cagaacgttt tctgcgtgtt 60 gccgatattc tggaaagcaa tgccaggcag gggcaggtgg ccaccgtcct ctctgcccct 120 gccaaaatca ccaaccacct ggtggcgatg attgaaaaaa ccattagcgg ccaggatgct 180 ttacccaata tcagcgatgc cgaacgtatt tttgccgaac ttttgacggg actcgccgcc 240 gcccagccgg ggttcccgtt ggcgcaattg aaaacattcg tcgatcagga atttgcccaa 300 ataaaacatg tcctgcatgg cattagtttg ttggggcagt gcccggatag catcaacgct 360 gcgctgattt gccgtggcga gaagatgtcg atcgccatta tggccggcgt attagaagcg 420 cgcggtcaca acgttactgt tatcgatccg gtcgaaaaac tgctggcagt ggggcattac 480 ctcgaatcca ccgtcgatat tgccgagtcc acccgccgta tagcggcaag ccgcattccg 540 gctgatcaca tggtgctgat ggcaggtttc accgccggta atgagaaagg cgaactggtg 600 gtgcttggac gcaacggttc cgactactct gctgcggtgc tggctgcctg tttacgcgcc 660 gattgttgcg agatttggac ggacgttaac ggggtctata cctgcgaccc gcgtcaggtg 720 cccgacgcga ggttgttgaa gtcgatgtcc taccaggaag cgatggagct ttcctacttc 780 ggcgctaaag ttcttcaccc ccgcaccatt acccccatcg cccagttcca gatcccttgc 840 ctgattaaaa ataccggaaa tcctcaagca ccaggtacgc tcattggtgc aagccgtgat 900 gaagacgaat taccggtcaa gggcatttcc aatctgaata acatggcaat gttcagcgtt 960 tctggtccgg ggatgaaagg gatggtcggc atggcggcgc gcgtctttgc agcgatgtca 1020 cgcgcccgta tttccgtggt gctgattacg caatcatctt ccgaatacag catcagtttc 1080 tgcgttccac aaagcgactg tgtgcgagct gaacgggcaa tgctggaaga gttctaccta 1140 gaactgaaag aaggcttact ggagccgctg gcagtggcgg aacggctggc cattatctcg 1200 gtggtaggtg atggtttgcg caccttgcgt gggatctcgg cgaaattctt tgccgcactg 1260 gcccgcgcca atatcaacat tgtcgccatt gctcagggat cttctgaacg ctcaatctct 1320 gtcgtggtaa ataacgatga tgcgaccact ggcgtgcgcg ttactcatca gatgctgttc 1380 aataccgatc aggttatcga agtgtttgtg attggcgtcg gtggcgttgg cggtgcgctg 1440 ctggagcaac tgaagcgtca gcaaagctgg ctgaagaata aacatatcga cttacgtgtc 1500 tgcggtgttg ccaactcgaa ggctctgctc accaatgtac atggccttaa tctggaaaac 1560 tggcaggagg aactggcgca agccaaagag ccgtttaatc tcgggcgctt aattcgcctc 1620 gtgaaagaat atcatctgct gaacccggtc attgttaact gcacttccag ccaggcagtg 1680 gcggatcaat atgccgactt cctgcgcgaa ggtttccacg ttgtcacgcc gaacaaaaag 1740 gccaacacct cgtcgatgga ttactaccat cagttgcgtt atgcggcgga aaaatcgcgg 1800 cgtaaattcc tctatgacat caacgttggg gctggattac cggttattga gaacctgcaa 1860 aatctgctca atgcaggtga tgaattgatg aaattctccg gcattctttc tggttcgctt 1920 tcttatatct tcggcaagtt agacgaaggc atgagtttct ccgaggcgac ccggctggcg 1980 cgggaaatgg gttataccga accggacccg cgagatgatc tttctggtat ggatgtggcg 2040 cgtaaactat tgattctcgc tcgtgaaacg ggacgtgaac tggagctggc ggatattgaa 2100 attgaacctg tgctgcccgc agagtttaac gccgagggtg atgttgccgc ttttatggcg 2160 aatctgtcac aactcgacga tctctttgcc gcgcgcgtgg cgaaggcccg tgatgaagga 2220 aaagttttgc gctatgttgg caatattgat gaagatggcg tctgccgcgt gaagattgcc 2280 gaagtggatg gtaatgatcc gctgttcaaa gtgaaaaatg gcgaaaacgc cctggccttc 2340 tatagccact attatcagcc gctgccgttg gtactgcgcg gatatggtgc gggcaatgac 2400 gttacagctg ccggtgtctt tgctgatctg ctacgtaccc tctcatggaa gttaggagtc 2460 tgacatggtt aaagtttatg ccccggcttc cagtgccaat atgagcgtcg ggtttgatgt 2520 gctcggggcg gcggtgacac ctgttgatgg tgcattgctc ggagatgtag tcacggttga 2580 ggcggcacag acattcagtc tcaacaacct cggacgcttt gccgataagc tgccgtcaga 2640 accacgggaa aatatcgttt atcagtgctg ggagcgtttt tgccaggaac tgggtaagca 2700 aattccagtg gcgatgaccc tggaaaagaa tatgccgatc ggttcgggct taggctccag 2760 tgcctgttcg gtggtcgcgg cgctgatggc gatgaatgaa cactgcggca agccgcttaa 2820 tgacactcgt ttgctggctt tgatgggcga gctggaaggc cgtatctccg gcagcattca 2880 ttacgacaac gtggcaccgt gttttctcgg tggtatgcag ttgatgatcg aagaaaacga 2940 catcatcagc cagcaagtgc agggtttgat gagtggctgt gggtgctcgc gtatccgggg 3000 attaaagtct cgacggcaga agcagggcta tttaccggcg cagtatcgcc gccaggattg 3060 cattgcgcac gggcgacatc tggcaggctt cattcacgcc tgctattccc gtcagcctga 3120 gcttgccgcg aagctgatga aagatgttat cgctgaaccc taccgtgaac ggttactgcc 3180 aggcttccgg caggcgcggc aggcggtcgc ggaaatcggc gcggtagcga gcggtatctc 3240 cggctccggc ccgaccttgt tcgctctgtg tgacaagccg gaaaccgccc agcgcgttgc 3300 cgactggttg ggtaagaact acctgcaaaa tcaggaaggt tttgttcata tttgccggct 3360 ggatacggcg ggcgcacgag tactggaaaa ctaaatgaaa ctctacaatc tgaaagatca 3420 caacgagcag gtcagctttg cgcaagccgt aacccagggg ttgggcaaaa atcaggggct 3480 gttttttccg cacgacctgc cggaattcag cctgactgaa attgatgaga tgctgaagct 3540 ggattttgtc acccgcagtg cgaagatcct ctcggcgttt attggtgatg aaatcccaca 3600 ggaaatcctg gaagagcgcg tgcgcgcggc gtttgccttc ccggctccgg tcgccaatgt 3660 tgaaagcgat gtcggttgtc tggaattgtt ccacgggcca acgctggcat ttaaagattt 3720 cggcggtcgc tttatggcac aaatgctgac ccatattgcg ggtgataagc cagtgaccat 3780 tctgaccgcg acctccggtg ataccggagc ggcagtggct catgctttct acggtttacc 3840 gaatgtgaaa gtggttatcc tctatccacg aggcaaaatc agtccactgc aagaaaaact 3900 gttctgtaca ttgggcggca atatcgaaac tgttgccatc gacggcgatt tcgatgcctg 3960 tcaggcgctg gtgaagcagg cgtttgatga tgaagaactg aaagtggcgc tagggttaaa 4020 ctcggctaac tcgattaaca tcagccgttt gctggcgcag atttgctact actttgaagc 4080 tgttgcgcag ctgccgcagg agacgcgcaa ccagctggtt gtctcggtgc caagcggaaa 4140 cttcggcgat ttgacggcgg gtctgctggc gaagtcactc ggtctgccgg tgaaacgttt 4200 tattgctgcg accaacgtga acgataccgt gccacgtttc ctgcacgacg gtcagtggtc 4260 acccaaagcg actcaggcga cgttatccaa cgcgatggac gtgagtcagc cgaacaactg 4320 gccgcgtgtg gaagagttgt tccgccgcaa aatctggcaa ctgaaagagc tgggttatgc 4380 agccgtggat gatgaaacca cgcaacagac aatgcgtgag ttaaaagaac tgggctacac 4440 ttcggagccg cacgctgccg tagcttatcg tgcgctgcgt gatcagttga atccaggcga 4500 atatggcttg ttcctcggca ccgcgcatcc ggcgaaattt aaagagagcg tggaagcgat 4560 tctcggtgaa acgttggatc tgccaaaaga gctggcagaa cgtgctgatt tacccttgct 4620 ttcacataat ctgcccgccg attttgctgc gttgcgtaaa ttgatgatga atcatcagta 4680 aaatctattc attatctcaa tcaggccggg tttgctttta tgcagcccgg cttttttatg 4740 aagaaattat ggagaaaaat gacagggaaa aaggagaaat tctcaataaa tgcggtaact 4800 tagagattag gattgcggag aataacaacc gccgttctca tcgagtaatc tccggatatc 4860 gacccataac gggcaatgat aaaaggagta acctgtgaaa aagatgcaat ctatcgtact 4920 cgcactttcc ctggttctgg tcgctcccat ggcagcacag gctgcggaaa ttacgttagt 4980 cccgtcagta aaattacaga taggcgatcg tgataatcgt ggctattact gggatggagg 5040 tcactggcgc gaccacggct ggtggaaaca acattatgaa tggcgaggca atcgctggca 5100 cctacacgga ccgccgccac cgccgcgcca ccataagaaa gctcctcatg atcatcacgg 5160 cggtcatggt cctggcaaac atcaccgcta aatgacaaat gccgggtaac aatccggcat 5220 tcagcgcctg atgcgacgct ggcgcgtctt atcaggccta cgttaattct gcaatatatt 5280 gaatctgcat gcttttgtag gcaggataag gcgttcacgc cgcatccggc attgactgca 5340 aacttaacgc tgctcgtagc gtttaaacac cagttcgcca ttgctggagg aatcttcatc 5400 aaagaagtaa ccttcgctat taaaaccagt cagttgctct ggtttggtca gccgattttc 5460 aataatgaaa cgactcatca gaccgcgtgc tttcttagcg tagaagctga tgatcttaaa 5520 tttgccgttc ttctcatcga ggaacaccgg cttgataatc tcggcattca atttcttcgg 5580 cttcaccgat ttaaaatact catctgacgc cagattaatc accacattat cgccttgtgc 5640 tgcgagcgcc tcgttcagct tgttggtgat gatatctccc cagaattgat acagatcttt 5700 ccctcgggca ttctcaagac ggatccccat ttccagacga taaggctgca ttaaatcgag 5760 cgggcggagt acgccataca agccggaaag cattcgcaaa tgctgttggg caaaatcgaa 5820 atcgtcttcg ctgaaggttt cggcctgcaa gccggtgtag acatcacctt taaacgccag 5880 aatcgcctgg cgggcattcg ccggcgtgaa atctggctgc cagtcatgaa agcgagcggc 5940 gttgataccc gccagtttgt cgctgatgcg catcagcgtg ctaatctgcg gaggcgtcag 6000 tttccgcgcc tcatggatca actgctggga attgtctaac agctccggca gcgtatagcg 6060 cgtggtggtc aacgggcttt ggtaatcaag cgttttcgca ggtgaaataa gaatcagcat 6120 atccagtcct tgcaggaaat ttatgccgac tttagcaaaa aatgagaatg agttgatcga 6180 tagttgtgat tactcctgcg aaacatcatc ccacgcgtcc ggagaaagct ggcgaccgat 6240 atccggataa cgcaatggat caaacaccgg gcgcacgccg agtttacgct ggcgtagata 6300 atcactggca atggtatgaa ccacaggcga gagcagtaaa atggcggtca aattggtaat 6360 agccatgcag gccattatga tatctgccag ttgccacatc agcggaaggc ttagcaaggt 6420 gccgccgatg accgttgcga aggtgcagat ccgcaaacac cagatcgctt tagggttgtt 6480 caggcgtaaa aagaagagat tgttttcggc ataaatgtag ttggcaacga tggagctgaa 6540 ggcaaacaga ataaccacaa gggtaacaaa ctcagcaccc caggaaccca ttagcacccg 6600 catcgccttc tggataagct gaataccttc cagcggcatg taggttgtgc cgttacccgc 6660 cagtaatatc agcatggcgc ttgccgtaca gatgaccagg gtgtcgataa aaatgccaat 6720 catctggaca atcccttgcg ctgccggatg cggaggccag gacgccgctg ccgctgccgc 6780 gtttggcgtc gaacccattc ccgcctcatt ggaaaacata ctgcgctgaa aaccgttagt 6840 aatcgcctgg cttaaggtat atcccgccgc gccgcctgcc gcttcctgcc agccaaaagc 6900 actctcaaaa atagaccaaa tgacgtgggg aagttgcccg atattcatta cgcaaattac 6960 caggctggtc agtacccaga ttatcgccat caacgggaca aagccctgca tgagccgggc 7020 gacgccatga agaccgcgag tgattgccag cagagtaaag acagcgagaa taatgcctgt 7080 caccagcggg ggaaaatcaa aagaaaaact cagggcgcgg gcaacggcgt tcgcttgaac 7140 tccgctgaaa attatgccat aggcgatgag caaaaagacg gcgaacagaa cgcccatcca 7200 gcgcatcccc agcccgcgcg ccatatacca tgccggtccg ccacgaaact gcccattgac 7260 gtcacgttct ttataaagtt gtgccagaga acattcggca aacgaggtcg ccatgccgat 7320 aaacgcggca acccacatcc aaaagacggc tccaggtcca ccggcggtaa tagccagcgc 7380 aacgccggcc aggttgccgc tacccacgcg cgccgcaaga ctggtacaca atgactgaaa 7440 tgaggttaaa ccgcctggct gtggatgaat gctattttta agacttttgc caaactggcg 7500 gatgtagcga aactgcacaa atccggtgcg aaaagtgaac caacaacctg cgccgaagag 7560 caggtaaatc attaccgatc cccaaaggac gctgttaatg aaggagaaaa aatctggcat 7620 gcatatccct cttattgccg gtcgcgatga ctttcctgtg taaacgttac caattgttta 7680 agaagtatat acgctacgag gtacttgata acttctgcgt agcatacatg aggttttgta 7740 taaaaatggc gggcgatatc aacgcagtgt cagaaatccg aaacagtctc gcctggcgat 7800 aaccgtcttg tcggcggttg cgctgacgtt gcgtcgtgat atcatcaggg cagaccggtt 7860 acatccccct aacaagctgt ttaaagagaa atactatcat gacggacaaa ttgacctccc 7920 ttcgtcagta caccaccgta gtggccgaca ctggggacat cgcggcaatg aagctgtatc 7980 aaccgcagga tgccacaacc aacccttctc tcattcttaa cgcagcgcag attccggaat 8040 accgtaagtt gattgatgat gctgtcgcct gggcgaaaca gcagagcaac gatcgcgcgc 8100 agcagatcgt ggacgcgacc gacaaactgg cagtaaatat tggtctggaa atcctgaaac 8160 tggttccggg ccgtatctca actgaagttg atgcgcgtct ttcctatgac accgaagcgt 8220 caattgcgaa agcaaaacgc ctgatcaaac tctacaacga tgctggtatt agcaacgatc 8280 gtattctgat caaactggct tctacctggc agggtatccg tgctgcagaa cagctggaaa 8340 aagaaggcat caactgtaac ctgaccctgc tgttctcctt cgctcaggct cgtgcttgtg 8400 cggaagcggg cgtgttcctg atctcgccgt ttgttggccg tattcttgac tggtacaaag 8460 cgaataccga taagaaagag tacgctccgg cagaagatcc gggcgtggtt tctgtatctg 8520 aaatctacca gtactacaaa gagcacggtt atgaaaccgt ggttatgggc gcaagcttcc 8580 gtaacatcgg cgaaattctg gaactggcag gctgcgaccg tctgaccatc gcaccggcac 8640 tgctgaaaga gctggcggag agcgaagggg ctatcgaacg taaactgtct tacaccggcg 8700 aagtgaaagc gcgtccggcg cgtatcactg agtccgagtt cctgtggcag cacaaccagg 8760 atccaatggc agtagataaa ctggcggaag gtatccgtaa gtttgctatt gaccaggaaa 8820 aactggaaaa aatgatcggc gatctgctgt aatcattctt agcgtgaccg ggaagtcggt 8880 cacgctacct cttctgaagc ctgtctgtca ctcccttcgc agtgtatcat tctgtttaac 8940 gagactgttt aaacggaaaa atcttgatga atactttacg tattggctta gtttccatct 9000 ctgatcgcgc atccagcggc gtttatcagg ataaaggcat ccctgcgctg gaagaatggc 9060 tgacatgcgc taaccacgcc gtttgaactg gaaacccgct taatccccga tgagcaggcg 9120 atcatcgagc aaacgttgtg tgagctggtg gatgaaatga gttgccatct ggtgctcacc 9180 acgggcggaa ctggcccggc gcgtcgtgac gtaacgcccg atgcgacgct ggcagtagcg 9240 gaccgcgaga tgcctggctt tggtgaacag atgcgccaga tcagcctgca ttttgtacca 9300 actgcgatcc tttcgcgtca ggtgggcgtg attcgcaaac aggcgctgat ccttaactta 9360 cccggtcagc cgaagtctat taaagagacg ctggaaggtg tgaaggacgc tgagggtaac 9420 gttgtggtac acggtatttt tgccagcgta ccgtactgca ttcagttgct ggaagggcca 9480 tacgttgaaa cggcaccgga agtggttgca gcattcagac cgaagagtgc aagacgcgac 9540 gttagcgaat aaaaaatacc cgagcggggg gatctcaaaa caattagtgg gattcaccaa 9600 tcggcagaac ggtgcgacca aactgctcgt tcagtacttc acccatcgcc agatagattg 9660 cgctggcacc gcagatcagc ccaatccagc cggcaaagtg gatgattgcg gcgttaccgg 9720 caatgttacc gatcgccagc agggcaaaca gcacggtcag gctaaagaaa acgaattgca 9780 gaacgcgtgc gcctttcagc gtgccgaaga acataaacac ggtaaatacg ccccacagac 9840 ccaggtagac accaaggaac tgtgcatttg gcgcatcggt cagacccagt ttcggcatca 9900 gcagaatcgc aaccagcgtc agccagaaag aaccgtaaga ggtgaatgcg gttaaaccga 9960 aagtgttgcc ttttttgtac tccagcagac cagcaaaaat ttgcgcgatg ccgccgtaga 10020 aaatgcccat ggcaagaata ataccgtcca gagcgaaata acccacgttg tgcaggttaa 10080 gcagaatggt ggtcatgccg aagcccatca ggcccagcgg tgccggatta gccaacttag 10140 tgttgcccat aattcctcaa aaatcatcat cgaatgaatg gtgaaataat ttccctgaat 10200 aactgtagtg ttttcagggc gcggcataat aatcagccag tggggcagtg tctacgatct 10260 tttgagggga aaatgaaaat tttcccggtt tccggtatca gacctgagtg gcgctaacca 10320 tccggcgcag gcaggcgatt tgcagtacgg ctggaatcgt cacgcgatag cgtgcgtgac 10380 cgctttaacc ccatttagtg ccgcacctac aggcctccca gcccgcgccg cgcagcaaac 10440 catgcccaag tacgctcatt gctgcgtggg tgcgtaaaat gcgggtcagt tggctggaaa 10500 gcaaatgcga cacacctttt gccaataatt tgtctttcat cagcagcggc agcagctctt 10560 ccagctcatt caccctggca tcgaccgcgt gcagaaactc ctgcttatgt tcctcgtcca 10620 ttttcttcca ggtattacgc agaaattgtt ccagtaactg ttgctcaatt tcaaacgtag 10680 acatctcttt gtcggctttc agcttcaatc gctttgaaac atcgagcaaa atggcccgat 10740 acaatttacc gtgtccgcgc agtttgttgc gatactatcg ccaccaaaat gctgtaattc 10800 tccggcaatc agctgccagt tgcggcgatg ttgctcggga tgcccttcca tcgatttaaa 10860 cagttcgttg cgcatcagta cgctggagag gcgagttttg cctttttcat tatgggtgag 10920 caatcgggcg aaatttgcca actgttcctc actacaatgc tgaagaaaat ccagatctga 10980 atcattcagg taattaacat tcattttttg tggcttctat attctggcgt tagtcgtcgc 11040 cgataatttt cagcgtggcc atatccgatg agttcaccgt atgacccgaa aaggtgattt 11100 ttgagacgca gcgtttattg tcgttatcgc tcttaatgtt gatccagtca gtggtttgcc 11160 cttcttttat ttctgaagga atattcaggc tctgacctgg cgctgacggg cggctgtgaa 11220 ataaaccgat gcaccgctta actgtaaatc gccatggtcg gcagagagtt gtatgcgttt 11280 cacaatgcga caaacaggaa gtttcagcgc cagatcgttg gtttcgttac gcggcattgc 11340 aatgcgccga ggagtttatg gtcgtttgcc tgcgccgtgc agcacagcat caggctaatc 11400 gccaggctgg cggaaatcgt aaaaacggat ttcataagga ttctcttagt gggaagaggt 11460 agggggatga atacccacta gtttactgct gataaagaga agattcaggc acgtaatctt 11520 ttctttttat tacaattttt tgatgaatgc cttggctgcg attcattctt tatatgaata 11580 aaattgctgt caattttacg tcttgtcctg ccatatcgcg aaatttctgc gcaaaagcac 11640 aaaaaatttt tgcatctccc ccttgatgac gtggtttacg accccattta gtagtcaacc 11700 gcagtgagtg agtctgcaaa aaaatgaaat tgggcagttg aaaccagacg tttcgcccct 11760 attacagact cacaaccaca tgatgaccga atatatagtg gagacgttta gatgggtaaa 11820 ataattggta tcgacctggg tactaccaac tcttgtgtag cgattatgga tggcaccact 11880 cctcgcgtgc tggagaacgc cgaaggcgat cgcaccacgc cttctatcat tgcctatacc 11940 caggatggtg aaactctagt tggtcagccg gctaaacgtc aggcagtgac gaacccgcaa 12000 aacactctgt ttgcgattaa acgcctgatt ggtcgccgct tccaggacga agaagtacag 12060 cgtgatgttt ccatcatgcc gttcaaaatt attgctgctg ataacggcga cgcatgggtc 12120 gaagttaaag gccagaaaat ggcaccgccg cagatttctg ctgaagtgct gaaaaaaatg 12180 aagaaaaccg ctgaagatta cctgggtgaa ccggtaactg aagctgttat caccgtaccg 12240 gcatacttta acgatgctca gcgtcaggca accaaagacg caggccgtat cgctggtctg 12300 gaagtaaaac gtatcatcaa cgaaccgacc gcagctgcgc tggcttacgg tctggacaaa 12360 ggcactggca accgtactat cgcggtttat gacctgggtg gtggtacttt cgatatttct 12420 attatcgaaa tcgacgaagt tgacggcgaa aaaaccttcg aagttctggc aaccaacggt 12480 gatacccacc tggggggtga agacttcgac agccgtctga tcaactatct ggttgaagaa 12540 ttcaagaaag atcagggcat tgacctgcgc aacgatccgc tggcaatgca gcgcctgaaa 12600 gaagcggcag aaaaagcgaa aatcgaactg tcttccgctc agcagaccga cgttaacctg 12660 ccatacatca ctgcagacgc gaccggtccg aaacacatga acatcaaagt gactcgtgcg 12720 aaactggaaa gcctggttga agatctggta aaccgttcca ttgagccgct gaaagttgca 12780 ctgcaggacg ctggcctgtc cgtatctgat atcgacgacg ttatcctcgt tggtggtcag 12840 actcgtatgc caatggttca gaagaaagtt gctgagttct ttggtaaaga gccgcgtaaa 12900 gacgttaacc cggacgaagc tgtagcaatc ggtgctgctg ttcagggtgg tgttctgact 12960 ggtgacgtaa aagacgtact gctgctggac gttaccccgc tgtctctggg tatcgaaacc 13020 atgggcggtg tgatgacgac gctgatcgcg aaaaacacca ctatcccgac caagcacagc 13080 caggtgttct ctaccgctga agacaaccag tctgcggtaa ccatccatgt gctgcagggt 13140 gaacgtaaac gtgcggctga taacaaatct ctgggtcagt tcaacctaga tggtatcaac 13200 ccggcaccgc gcggcatgcc gcagatcgaa gttaccttcg atatcgatgc tgacggtatc 13260 ctgcacgttt ccgcgaaaga taaaaacagc ggtaaagagc agaagatcac catcaaggct 13320 tcttctggtc tgaacgaaga tgaaatccag aaaatggtac gcgacgcaga agctaacgcc 13380 gaagctgacc gtaagtttga agagctggta cagactcgca accagggcga ccatctgctg 13440 cacagcaccc gtaagcaggt tgaagaagca ggcgacaaac tgccggctga cgacaaaact 13500 gctatcgagt ctgcgctgac tgcactggaa actgctctga aaggtgaaga caaagccgct 13560 atcgaagcga aaatgcagga actggcacag gtttcccaga aactgatgga aatcgcccag 13620 cagcaacatg cccagcagca gactgccggt gctgatgctt ctgcaaacaa cgcgaaagat 13680 gacgatgttg tcgacgctga atttgaagaa gtcaaagaca aaaaataatc gccctataaa 13740 cgggtaatta tactgacacg ggcgaagggg aatttcctct ccgcccgtgc attcatctag 13800 gggcaattta aaaaagatgg ctaagcaaga ttattacgag attttaggcg tttccaaaac 13860 agcggaagag cgtgaaatca gaaaggccta caaacgcctg gccatgaaat accacccgga 13920 ccgtaaccag ggtgacaaag aggccgaggc gaaatttaaa gagatcaagg aagcttatga 13980 agttctgacc gactcgcaaa aacgtgcggc atacgatcag tatggtcatg ctgcgtttga 14040 gcaaggtggc atgggcggcg gcggttttgg cggcggcgca gacttcagcg atatttttgg 14100 tgacgttttc ggcgatattt ttggcggcgg acgtggtcgt caacgtgcgg cgcgcggtgc 14160 tgatttacgc tataacatgg agctcaccct cgaagaagct gtacgtggcg tgaccaaaga 14220 gatccgcatt ccgactctgg aagagtgtga cgtttgccac ggtagcggtg caaaaccagg 14280 tacacagccg cagacttgtc cgacctgtca tggttctggt caggtgcaga tgcgccaggg 14340 attcttcgct gtacagcaga cctgtccaca ctgtcagggc cgcggtacgc tgatcaaaga 14400 tccgtgcaac aaatgtcatg gtcatggtcg tgttgagcgc agcaaaacgc tgtccgttaa 14460 aatcccggca ggggtggaca ctggagaccg catccgtctt gcgggcgaag gtgaagcggg 14520 cgagcatggc gcaccggcag gcgatctgta cgttcaggtt caggttaaac agcacccgat 14580 tttcgagcgt gaaggcaaca acctgtattg cgaagtcccg atcaacttcg ctatggcggc 14640 gctgggtggc gaaatcgaag taccgaccct tgatggtcgc gtcaaactga aagtgcctgg 14700 cgaaacccag accggtaagc tattccgtat gcgcggtaaa ggcgtcaagt ctgtccgcgg 14760 tggcgcacag ggtgatttgc tgtgccgcgt tgtcgtcgaa acaccggtag gcctgaacga 14820 aaggcagaaa cagctgctgc aagagctgca agaaagcttc ggtggcccaa ccggcgagca 14880 caacagcccg cgctcaaaga gcttctttga tggtgtgaag aagttttttg acgacctgac 14940 ccgctaacct ccccaaaagc ctgcccgtgg gcaggcctgg gtaaaaatag ggtgcgttga 15000 agatatgcga gcacctgtaa agtggcgggg atcactccca taagcgctaa cttaagggtt 15060 gtggtattac gcctgatatg atttaacgtg ccgatgaatt actctcacga taactggtca 15120 gcaattctgg cccatattgg taagcccgaa gaactggata cttcggcacg taatgccggg 15180 gctctaaccc gccgccgcga aattcgtgat gctgcaactc tgctacgtct ggggctggct 15240 tacggccccg gggggatgtc attacgtgaa gtcactgcat gggctcagct ccatgacgtt 15300 gcaacattat ctgacgtggc tctcctgaag cggctgcgga atgccgccga ctggtttggc 15360 atacttgccg cacaaacact tgctgtacgc gccgcagtta cgggttgtac aagcggaaag 15420 agattgcgtc ttgtcgatgg aacagcaatc agtgcgcccg ggggcggcag cgctgaatgg 15480 cgactacata tgggatatga tcctcatacc tgtcagttca ctgattttga gctaaccgac 15540 agcagagacg ctgaacggct ggaccgattt gcgcaaacgg cagacgagat acgcattgct 15600 gaccggggat tcggttcgcg tcccgaatgt atccgctcac ttgcttttgg agaagctgat 15660 tatatcgtcc gggttcactg gcgaggattg cgctggttaa ctgcagaagg aatgcgcttt 15720 gacatgatgg gttttctgcg cgggctggat tgcggtaaga acggtgaaac cactgtaatg 15780 ataggcaatt caggtaataa aaaagccgga gctccctttc cggcacgtct cattgccgta 15840 tcacttcctc ccgaaaaagc attaatcagt aaaacccgac tgctcagcga gaatcgtcga 15900 aaaggacgag tagttcaggc ggaaacgctg gaagcagcgg gccatgtgct attgctaaca 15960 tcattaccgg aagatgaata ttcagcagag caagtggctg attgttaccg tctgcgatgg 16020 caaattgaac tggcttttaa gcggctcaaa agtttgctgc acctggatgc tttgcgtgca 16080 aaggaacctg aactcgcgaa agcgtggata tttgctaatc tactcgccgc atttttaatt 16140 gacgacataa tccagccatc gctggatttc ccccccagaa gtgccggatc cgaaaagaag 16200 aactaactcg ttgtggagaa taacaaaaat ggtcatctgg agcttacagg tggccattcg 16260 tgggacagta tccctgacag cctacaaaac gcaattgaag aacgcgaggc atcgtcttaa 16320 cgaggcaccg aggcgtcgca ttcttcagat ggttcaaccc ttaagttagc gcttatggga 16380 tcactccccg ccgttgctct tactcggatt cgtaagccgt gaaaacagca acctccgtct 16440 ggccagttcg gatgtgaacc tcacagaggt cttttctcgt taccagcgcc gccactacgg 16500 cggtgataca gatgacgatc agggcgacaa tcatcgcctt atgctgcttc attgctctct 16560 tctccttgac cttacggtca gtaagaggca ctctacatgt gttcagcata taggaggcct 16620 cgggttgatg gtaaaatatc actcggggct tttctctatc tgccgttcag ctaatgcctg 16680 agacagacag cctcaagcac ccgccgctat tatatcgctc tctttaaccc attttgtttt 16740 atcgattcta atcctgaaga cgcctcgcat ttttgtggcg taatttttta atgatttaat 16800 tatttaactt taatttatct cttcatcgca attattgacg acaagctgga ttatttttga 16860 aatattggcc taacaagcat cgccgactga caacaaatta attattactt ttcctaatta 16920 atccctcagg aatcctcacc ttaagctatg attatctagg cttagggtca ctcgtgagcg 16980 cttacagccg tcaaaaacgc atctcaccgc tgatggcgca aattcttcaa tagctcgtaa 17040 aaaacgaatt attcctacac tataatctga ttttaacgat gattcgtgcg gggtaaaata 17100 gtaaaaacga tctattcacc tgaaagagaa ataaaaagtg aaacatctgc atcgattctt 17160 tagcagtgat gcctcgggag gcattattct tatcattgcc gctatcctgg cgatgattat 17220 ggccaacagc ggcgcaacca gtggatggta tcacgacttt ctggagacgc cggttcagct 17280 ccgggttggt tcactcgaaa tcaacaaaaa catgctgtta tggataaatg acgcgctgat 17340 ggcggtattt ttcctgttag tcggtctgga agttaaacgt gaactgatgc aaggatcgct 17400 agccagctta cgccaggccg catttccagt tatcgccgct attggtggga tgattgtgcc 17460 ggcattactc tatctggctt ttaactatgc cgatccgatt acccgcgaag ggtgggcgat 17520 cccggcggct actgacattg cttttgcact tggtgtactg gcgctgttgg gaagtcgtgt 17580 tccgttagcg ctgaagatct ttttgatggc tctggctatt atcgacgatc ttggggccat 17640 cattatcatc gcattgttct acactaatga cttatcgatg gcctctcttg gcgtcgcggc 17700 tgtagcaatt gcggtactcg cggtattgaa tctgtgtggt gcacgccgca cgggcgtcta 17760 tattcttgtt ggcgtggtgt tgtggactgc ggtgttgaaa tcgggggttc acgcaactct 17820 ggcgggggta attgtcggct tctttattcc tttgaaagag aagcatgggc gttctccagc 17880 gaagcgactg gagcatgtgt tgcacccgtg ggtggcgtat ctgattttgc cgctgtttgc 17940 atttgctaat gctggcgttt cactgcaagg cgtcacgctg gatggcttga cctccattct 18000 gccattgggg atcatcgctg gcttgctgat tggcaaaccg ctggggatta gtctgttctg 18060 ctggttggcg ctgcgtttga aactggcgca tctgcctgag ggaacgactt atcagcaaat 18120 tatggtggtg gggatcctgt gcggtatcgg ttttactatg tctatcttta ttgccagcct 18180 ggcctttggt agcgtagatc cagaactgat taactgggcg aaactcggta tcctggtcgg 18240 ttctatctct tcggcggtaa ttggatacag ctggttacgc gttcgtttgc gtccatcagt 18300 ttgacaggac ggtttaccgg ggagccataa acggctccct tttcattgtt atcagggaga 18360 gaaatgagca tgtctcatat caattacaac cacttgtatt acttctggca tgtctataaa 18420 gaaggttccg tggttggcgc agcggaggcg ctttatttaa ctccacaaac cattaccgga 18480 cagattcgag cgctggaaga cgccctgcaa gcgaaattat ttaaacgcaa ggggacgtgg 18540 tctcgaaccc agcgagctgg agaactggtc tatcgctatg ccgataaaat gttcacctta 18600 agccaggaaa tgctggatat tgtgaactat cgcaaagaat ccaatttatt gtttgacgtt 18660 ggcgtggctg atgcactttc caaacgcctg gtcagtagcg tacttaacgc cgcagtggta 18720 gaaggcgagc ccattcatct tcgctgcttc gaatccaccc acgaaatgct gctggagcaa 18780 ttaagtcagc ataaactgga gatgatcatt tctgactgtc cgatagactc tacgcagcag 18840 gaaggcctgt tctccgtgag aattggcgaa tgtggcgtga gtttctggtg tacaaatcca 18900 ccaccagaaa aaccgttccc ggcttgtctg gaagaacggc gacttttgat tcctgggcga 18960 cgttcaatgt tagggcgcaa attgcttaac tggtttaact cccagggatt aaacgtagaa 19020 atcctcggcg agtttgatga tgccgctttg atgaaagctt ggctgcaggt ccttttggtg 19080 gcgatgcaaa tgcaatcttc gttgccccaa cgctttatgc atatgacttt tatgccgata 19140 aaactgtcgt agaaattggt cgcgtcgaga atgtgatgga agagtaccat gctatttttg 19200 ctgagcggat gattcagcac ccggcggtac agcgaatctg caatacggat tattctgcgc 19260 tttttagtcc agcggtgcgt taatcggcag ctcccccaaa gttaaggtgg gggagataga 19320 ttagttgtac attaccacga ttttgactcg gctcattatt tgcccgcttg agacattgtt 19380 tccatatgta cgcgggcgaa taaatagagg aatctgatta cttccttcat ggggatgctg 19440 aaaagagtag taattgctgg taatgactcc aacttattga tagtgtttta tgttcagata 19500 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc 19560 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg 19620 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt 19680 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat 19740 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa 19800 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca 19860 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt 19920 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca 19980 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa 20040 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg 20100 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa 20160 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc 20220 atcacctacc tcaatgtgta tcacaatatc catattcttt gtgggggagt ctggagattg 20280 agtagatatt cttgttcaga atgtatcagc cgatggttct acgattctta agccacgaag 20340 agttcagata gtacaacggc atgtctcttt tgactatctg gcaaccggca gtgtgttctc 20400 tcacgcatca caaaagcagc aggcataaaa aaacccgctt gcgcgggctt tttcacaaag 20460 cttcagcaaa ttggcgatta agccagtttg ttgatctgtg cagtcaggtt agccttatga 20520 cgtgcagctt tgtttttgtg gatcagacct ttagcagcct gacggtccac gatcggttgc 20580 atttcgttaa atgctttctg tgcagcagct ttgtcgccag cttcgatagc tgcgtatact 20640 ttcttgatga aagtacgcat catagagcga cggcttgcgt tgtgcttacg agccttttca 20700 gactgaatgg cgcgcttctt agctgatttg atattagcca aggtccaact cccaaatgtg 20760 ttctatatgg acaattcaaa ggccgaggaa tatgcccttt tagccttctt ttgtcaatgg 20820 atttgtgcaa ataagcgccg ttaatgtgcc ggcactcgtt acgtagtgat ggcgcaggat 20880 tctaccagct tgcggggtgt gaatacagct tttccgcgat aaaaattgca gcaggcggtc 20940 agtttcttcc cgtgatttgc gccatggcaa tgaaaagcca cttctttctg atttcggtac 21000 tcaatcgccg gttaaccttg accgctgtac aaggtctact cggacgattt tcactgtttt 21060 gagccagaca tgaagctgat acgcggcata cataatctca gccaggcccc gcaagaaggg 21120 tgtgtgctga ctattggtaa tttcgacggc gtgcatcgcg gtcatcgcgc gctgttacag 21180 ggcttgcagg aagaagggcg caagcgcaac ttaccggtga tggtgatgct ttttgaacct 21240 caaccactgg aactgtttgc taccgataaa gccccggcaa gactgacccg gctgcgggaa 21300 aaactgcgtt accttgcaga gtgtggcgtt gattacgtgc tgtgcgtgcg tttcgacagg 21360 cgtttcgcgg cgttaaccgc gcaaaatttc gtcagcgatc ttctggtgaa gcatttgcgc 21420 gtaaaatttc ttgccgtagg tgatgatttc cctttggcgc tggtcgtgaa ggcgatttct 21480 tgttattaca gaaagctggc atggaatacg gcttcgatat caccagtacg caaacttttt 21540 gcagaggtgg cgtgcgcatc agcagcacgg ctgcgtcagg cccttgcgga tgacaatctg 21600 gctctggcag agagtttact ggggcacccg tttgccatct ccgggcgtgt agtccacggt 21660 gatgaattag ggcgcactat aggtttcccg acggcgaatg taccgccgcg ccgtcaggtt 21720 tccccggtga aaggggttta tgcggtagaa gtgctgggcc tcggtgaaaa gccgttaccc 21780 ggcgtggcaa acatcggaac acgcccaacg gttgccggta ttcgccagca gctggaagtg 21840 catttgttag atgttgcaat ggacctttac ggtcgccata tacaagtagt gctgcgtaaa 21900 aaaatacgca atgagcagcg atttgcgtcg ctggacgaac tgaaagcgca gattgcgcgt 21960 gatgaattaa ccgcccgcga attttttggg ctaacaaaac cggcttaagc ctgttatgta 22020 atcaaaccga aatacggaac cgagaatctg atgagtgact ataaatcaac cctgaatttg 22080 ccggaaacag ggttcccgat gcgtggcgat ctcgccaagc gcgaacccgg aatgctggcg 22140 cgttggactg atgatgatct gtacggcatc atccgtgcgg ctaaaaaagg caaaaaaacc 22200 ttcattctgc atgatggccc tccttatgcg aatggcagca ttcatattgg tcactcggtt 22260 aacaagattc tgaaagacat tatcgtgaag tccaaagggc tttccggtta tgactcgccg 22320 tatgtgcctg gctgggactg ccacggtctg ccgatcgagc tgaaagtcga gcaagaatac 22380 ggtaagccgg gtgagaaatt caccgccgcc gagttccgcg ccaagtgccg cgaatacgcg 22440 gcgacccagg ttgacggtca acgcaaagac tttatccgtc tgggcgtgct gggcgactgg 22500 tcgcacccgt acctgaccat gggacttcaa aactggaagg ccaacatcat ccgcgcgctg 22560 ggcaaaatca tcggcaacgg tcacctgcac aaaggcgcga agccagttca ctggtgcgtt 22620 gactgccgtt ctgcgctggc gaagcggaag ttgagtatta cgacaaaact tctccgtcca 22680 tcgacgttgc tttccagggc agtcgatcag gatgcactga aagcaaaatt tgccgtaagc 22740 aacgttaacg gcccaatctc gctggtaatc tggaccacca cgccgtggac tctgcctgcc 22800 aaccgcgcaa tctctattgc accagatttc gactatgcgc tggtgcagat cgacggtcag 22860 gccgtgattc tggcgaaaga tctggttgaa agcgtaatgc agcgtatcgg cgtgaccgat 22920 tctcggcacg gtaaaaggtg cggagctgga gccgctgcgt ttacccatcc gtttatgggc 22980 ttcgacgttc cggcaatcct cggcgatcac gttaccctgg atgccggtac cggtgccgtt 23040 cacaccgcgc ctggccacgg cccggacgac tatgtgatcg gtcagaaata cggcctggaa 23100 accgctaacc cggttggccc ggacggcact tatctgccgg gcacttatcc gacgctggat 23160 ggcgtgaacg tcttcaaagc gaacgacatc gtcgctgcgc tgctgcagga aaaaggcgcg 23220 ctgctgcacg ttgagaaaat gcagcacagc tatccgtgct gctggcgtca caaaacgccg 23280 atcatcttcc gcgcgacgcc gcagtggttc gtcagcatgg atcagaaagg tctgcgtgcg 23340 cagtcactga aagagatcaa aggcgtgcag tggatcccgg actggggcca ggcgcgtatc 23400 gagtcgatgg ttgctaaccg tcctgactgg tgtatctccc gtcagcgcac ctggggtgta 23460 ccgatgtcac tgttcgtgca caaagacacg gaagagctgc atccgcgtac ccttgaactg 23520 atggaagaag tggcaaaacg cgttgaagtc gatggcatcc aggcgtggtg ggatctcgat 23580 gcgaaagaga tcctcggcga cgaagctgat cagtacgtga aagtgccgga cacattggat 23640 gtatggtttg actccggatc tacccactct tctgttgttg acgtgcgtcc ggaatttgcc 23700 ggtcacgcag cggacatgta tctggaaggt tctgaccaac accgcggctg gttcatgtct 23760 tccctaatga tctccaccgc gatgaagggt aaagcgccgt atcgtcaggt actcacccac 23820 ggctttaccg tggatggtca gggccgcaag atgtctaaat ccatcggcaa taccgtttcg 23880 ccgcaggatg tgatgaacaa actgggcgcg gatattctgc gtctgtgggt ggcatcaacc 23940 gactacaccg gtgaaatggc cgtttctgac gagatcctga aacgtgctgc cgatacgtat 24000 cgtcgtatcc gtaacaccgc gcgcttcctg ctggcaaacc tgaacggttt tgatccagca 24060 aaagatatgg tgaaacggag agagatggtg gtactggatc gctgggccgt agttgtgcga 24120 aagcggcaca ggaagacatc ctcaaggcgt acgaagcata cgatttccac gaagtggtac 24180 aagcgtctga tgcgcttctg ctccgttgag atgggttcct tctacctcga catcatcaaa 24240 gaccgtcagt actacgccaa aggacacagt gtggcgcgtc gtagctgcca gactgcgcta 24300 tatcacatcg cagaagcgct ggtgcgctgg atggcaccaa tcctctcctt caccgctgat 24360 gaagtgtggg gctacctgcc gggcgaacgt gaaaaatacg tcttcaccgg tgagtggtac 24420 gaaggcctgt ttggcctggc agacagtgaa gcgatgaacg atgcgttctg ggacgagctg 24480 ttgaaagtgc gtggcgaagt gaacaaagtc attgagcaag cgcgtgccga caagaaagtg 24540 ggtggctcgc tggaagcggc agtaaccttg tatgcagaac cggaactgtc ggcgaaactg 24600 accgcgctgg gcgatgaatt acgatttgtc ctgttgacct ccggcgctac cgttgcagac 24660 tataacgacg cacctgctga tgctcagcag agcgaagtac tcaaagggct gaaagtcgcg 24720 ttgagtaaag ccgaaggtga gaagtgccca cgctgctggc actacaccca ggatgtcggc 24780 aaggtggcgg aacacgcaga aatctgcggc cgctgtgtca gcaacgtcgc cggtgacggt 24840 gaaaaacgta agtttgcctg atgagtcaat cgatctgttc aacagggcta cgctggctgt 24900 ggctggtggt agtcgtgctg attatcgatc tgggcagcaa atacctgatc ctccagaact 24960 ttgctctggg ggatacggtc ccgctgttcc cgtcgcttaa tctgcattat gcgcgtaact 25020 atggcgcggc gtttagtttc cttgccgata gcggcggctg gcagcgttgg ttctttgccg 25080 gtattgcgat tggtattagc gtgatcctgg cagtgatgat gtatcgctcg aaggccacgc 25140 agaagctaaa caatatcgct tacgcgctga ttattggcgg cgcgctgggc aacctgttcg 25200 accgcctgtg gcacggcttc gttgtcgata tgatcgactt ctacgtcggc gactggcact 25260 tcgccacctt caaccttgcc gatactgcca tctgtgtcgg tgcggcactg attgtgctgg 25320 aaggtttttt gccttctaga gcgaaaaaac aataataaac cctgccggat gcgatgctga 25380 cgcatcttat ccggcctaca gattgctgcg aaatcgtagg ccggataagg cgtttacgcc 25440 gcatccggca aaaatcctta aatataagag caaacctgca tgtctgaatc tgtacagagc 25500 aatagcgccg tcctggtgca cttcacgcta aaactcgacg atggcaccac cgccgagtct 25560 acccgcaaca acggtaaacc ggcgctgttc cgcctgggtg atgcttctct ttctgaaggg 25620 ctggagcaac acctgttggg gctgaaagtg ggcgataaaa ccaccttctc gttggagcca 25680 gatgcggcgt ttggcgtgcc gtcaccggac ctgattcagt acttctcccg ccgtgaattt 25740 atggatgcag gcgagccaga aattggcgca atcatgcttt ttaccgcaat ggatggcagt 25800 gagatgcctg gcgtgatccg cgaaattaac ggcgactcca ttaccgttga tttcaaccat 25860 ccgctggccg ggcagaccgt tcattttgat attgaagtgc tggaaatcga tccggcactg 25920 gaggcgtaac atgcagatcc tgttggccaa cccgcgtggt ttttgtgccg gggtagaccg 25980 cgctatcagc attgttgaaa acgcgctggc catttacggc gcaccgatat atgtccgtca 26040 cgaagtggta cataaccgct atgtggtcga tagcttgcgt gagcgtgggg ctatctttat 26100 tgagcagatt agcgaagtac cggacggcgc gatcctgatt ttctccgcac acggtgtttc 26160 tcaggcggta cgtaacgaag caaaaagtcg cgatttgacg gtgtttgatg ccacctgtcc 26220 gctggtgacc aaagtgcata tggaagtcgc ccgcgccagt cgccgtggcg aagaatctat 26280 tctcatcggt cacgccgggc acccggaagt ggaagggaca atgggccagt acagtaaccc 26340 ggaaggggga atgtatctgg tcgaatcgcc ggacgatgtg tggaaactga cggtcaaaaa 26400 cgaagagaag ctctccttta tgacccagac cacgctgtcg gtggatgaca cgtctgatgt 26460 gatcgacgcg ctgcgtaaac gcttcccgaa aattgtcggt ccgcgcaaag atgacatctg 26520 ctacgccacg actaaccgtc aggaagcggt acgcgccctg gcagaacagg cggaagttgt 26580 gttggtggtc ggttcgaaaa actcctccaa ctccaaccgt ctggcggagc tggcccagcg 26640 tatgggcaaa cgcgcgtttt tgattgacga tgcgaaagac atccaggaag agtgggtgaa 26700 agaggttaaa tgcgtcggcg tgactgcggg cgcatcggct ccggatattc tggtgcagaa 26760 tgtggtggca cgtttgcagc agctgggcgg tggtgaagcc attccgctgg aaggccgtga 26820 agaaaacatt gttttcgaag tgccgaaaga gctgcgtgtc gatattcgtg aagtcgatta 26880 agtcattagc agcctaagtt atgcgaaaat gccggtcttg ttaccggcat tttttatgga 26940 gaaaacatgc gtttacctat cttcctcgat actgaccccg gcattgacga tgccgtcgcc 27000 attgccgccg cgatttttgc acccgaactc gacctgcaac tgatgaccac cgtcgcgggt 27060 aatgtctcgg ttgagaaaac tacccgcaat gccctgcaac tgctgcattt ctggaatgcg 27120 gagattccgc tcgcccaagg ggccgctgtg ccactggtac gcgcaccgcg tgatgcggca 27180 tctgtgcacg gcgaatcggg aatggctggc tacgactttg ttgagcacaa ccgaaagccg 27240 ctcgggatac cggcgtttct ggcgattcgg gatgccctga tgcgtgcacc agagcctgtt 27300 accctggtgg ccatcggccc gttaaccaat attgcgctgt tactttcaca atgcccggaa 27360 tgcaagccgt atattcgccg tctggtgatc atgggtggtt ctgccggacg cggcaactgt 27420 acgccaaacg ccgagtttaa tattgctgcc gatccagaag ctgctgcctg tgtcttccgc 27480 agtggtattg aaatcgtcat gtgcggtttg gatgtcacca atcaggcaat attaactcct 27540 gactatctct ctacactgcc gcagttaaac cgtaccggga aaatgcttca cgccctgttt 27600 agccactacc gtagcggcag tatgcaaagc ggcttgcgaa tgcacgatct ctgcgccatc 27660 gcctggctgg tgcgcccgga cctgttcact ctcaaaccct gttttgtggc agtggaaact 27720 cagggcgaat ttacctcagg cacgacggtg gttgatatcg acggttgcct gggcaagcca 27780 gccaatgtac aggtggcatt ggatctggat gtgaaaggct tccagcagtg ggtggctgag 27840 gtgctggctc tggcgtcgta acctgtcaca tgttattggc atgcagtcat tcatcgactc 27900 atgcctttca ctgatatccc tccctgttta tcattaattt ctaattatca gcgtttttgg 27960 ctggcggcgt agcgatgcgc tggttactct gaaaacggtc tatgcaaatt aacaaaagag 28020 aatagctatg catgatgcaa acatccgcgt tgccatcgcg ggagccgggg ggcgtatggg 28080 ccgccagttg attcaggcgg cgctggcatt agagggcgtg cagttgggcg ctgcgctgga 28140 gcgtgaagga tcttctttac tgggcagcga cgccggtgag ctggccggag ccgggaaaac 28200 aggcgttacc gtgcaaagca gcctcgatgc ggtaaaagat gattttgatg tgtttatcga 28260 ttttacccgt ccggaaggta cgctgaacca tctcgctttt tgtcgccagc atggcaaagg 28320 gatggtgatc ggcactacgg ggtttgacga agccggtaaa caagcaattc gtgacgccgc 28380 tgccgatatt gcgattgtct ttgcggccaa ttttagcgtt ggcgttaacg tcatgcttaa 28440 gctgctggag aaagcagcca aagtgatggg tgactacacc gatatcgaaa ttattgaagc 28500 acatcataga cataaagttg atgcgccgtc aggcaccgca ctggcaatgg gagaggcgat 28560 cgcccacgcc cttgataaag atctgaaaga ttgcgcggtc tacagtcgtg aaggccacac 28620 cggtgaacgt gtgcctggca ccattggttt tgccaccgtg cgtgcaggtg acatcgttgg 28680 tgaacatacc gcgatgtttg ccgatattgg cgagcgtctg gagatcaccc ataaggcgtc 28740 cagccgtatg acatttgcta acggcgcggt aagatcggct ttgtggttga gtggtaagga 28800 aagcggtctt tttgatatgc gagatgtact tgatctcaat aatttgtaac cacaaaatat 28860 ttgttatggt gcaaaaataa cacatttaat ttattgatta taaagggctt taatttttgg 28920 cccttttatt tttggtgtta tgtttttaaa ttgtctataa gtgccaaaaa ttacatgttt 28980 tgtcttctgt ttttgttgtt ttaatgtaaa ttttgaccat ttggtccact tttttctgct 29040 cgtttttatt tcatgcaatc ttcttgctgc gcaagcgttt tccagaacag gttagatgat 29100 ctttttgtcg cttaatgcct gtaaaacatg catgagccac aaaataatat aaaaaatccc 29160 gccattaagt tgacttttag cgcccatatc tccagaatgc cgccgtttgc cagaaattcg 29220 tcggtaagca gatttgcatt gatttacgtc atcattgtga attaatatgc aaataaagtg 29280 agtgaatatt ctctggaggg tgttttgatt aagtcagcgc tattggttct ggaagacgga 29340 acccagtttc acggtcgggc cataggggca acaggttcgg cggttgggga agtcgttttc 29400 aatacttcaa tgaccggtta tcaagaaatc ctcactgatc cttcctattc tcgtcaaatc 29460 gttactctta cttatcccca tattggcaat gtcggcacca atgacgccga tgaagaatct 29520 tctcaggtac atgcacaagg tctggtgatt cgcgacctgc cgctgattgc cagcaacttc 29580 cgtaataccg aagacctctc ttcttacctg aaacgccata acatcgtggc gattgccgat 29640 atcgataccc gtaagctgac gcgtttactg cgcgagaaag gcgcacagaa tggctgcatt 29700 atcgcgggcg ataacccgga tgcggcgctg gcgttagaaa aagcccgcgc gttcccaggt 29760 ctgaatggca tggatctggc aaaagaagtg accaccgcag aagcctatag ctggacacaa 29820 gggagctgga cgttgaccgg tggcctgcca gaagcgaaaa aagaagacga gctgccgttc 29880 cacgtcgtgg cttatgattt tggtgccaag cgcaacatcc tgcggatgct ggtggataga 29940 ggctgtcgcc tgaccatcgt tccggcgcaa acttctgcgg aagatgtgct gaaaatgaat 30000 ccagacggca tcttcctctc caacggtcct ggcgacccgg ccccgtgcga ttacgccatt 30060 accgccatcc agaaattcct cgaaaccgat attccggtat tcggcatctg tctcggtcat 30120 cagctgctgg cgctggcgag cggtgcgaag actgtcaaaa tgaaatttgg tcaccacggc 30180 ggcaaccatc cggttaaaga tgtggagaaa aacgtggtaa tgatcaccgc ccagaaccac 30240 ggttttgcgg tggacgaagc aacattacct gcaaacctgc gtgtcacgca taaatccctg 30300 ttcgacggta cgttacaggg cattcatcgc accgataaac cggcattcag cttccagggg 30360 caccctgaag ccagccctgg tccacacgac gccgcgccgt tgttcgacca ctttatcgag 30420 ttaattgagc agtaccgtaa aaccgctaag taatcaggag taaaagagcc atgccaaaac 30480 gtacagatat aaaaagtatc ctgattctgg gtgcgggccc gattgttatc ggtcaggcgt 30540 gtgagtttga ctactctggc gcgcaagcgt gtaaagccct gcgtgaagag ggttaccgcg 30600 tcattctggt gaactccaac ccggcgacca tcatgaccga cccggaaatg gctgatgcaa 30660 cctacatcga gccgattcac tgggaagttg tacgcaagat tattgaaaaa gagcgcccgg 30720 acgcggtgct gccaacgatg ggcggtcaga cggcgctgaa ctgcgcgctg gagctggaac 30780 gtcagggcgt gttggaagag ttcggtgtca ccatgattgg tgccactgcc gatgcgattg 30840 ataaagcaga agaccgccgt cgtttcgacg tagcgatgaa gaaaattggt ctggaaaccg 30900 cgcgttccgg tatcgcacac acgatggaag aagcgctggc ggttgccgct gacgtgggct 30960 tcccgtgcat tattcgccca tcctttacca tgggcggtag cggcggcggt atcgcttata 31020 accgtgaaga gtttgaagaa atttgcgccc gcggtctgga tctctctccg accaaagagt 31080 tgctgattga tgagtcgctg atcggctgga aagagtacga gatggaagtg gtgcgtgata 31140 aaaacgacaa ctgcatcatc gtctgctcta tcgaaaactt cgatgcgatg ggcatccaca 31200 ccggtgactc catcactgtc gcgccagccc aaacgctgac cgacaaagaa tatcaaatca 31260 tgcgtaacgc ctcgatggcg gtgctgcgtg aaatcggcgt tgaaaccggt ggttccaacg 31320 ttcagtttgc ggtgaacccg aaaaacggtc gtctgattgt tatcgaaatg aacccacgcg 31380 tgtcccgttc ttcggcgctg gcgtcgaaag cgaccggttt cccgattgct aaagtggcgg 31440 cgaaactggc ggtgggttac accctcgacg aactgatgaa cgacatcact ggcggacgta 31500 ctccggcctc cttcgagccg tccatcgact atgtggttac taaaattcct cgcttcaact 31560 tcgaaaaatt cgccggtgct aacgaccgtc tgaccactca gatgaaatcg gttggcgaag 31620 tgatggcgat tggtcgcacg cagcaggaat ccctgcaaaa agcgctgcgc ggcctggaag 31680 tcggtgcgac tggattcgac ccgaaagtga gcctggatga cccggaagcg ttaaccaaaa 31740 tccgtcgcga actgaaagac gcaggcgcag atcgtatctg gtacatcgcc gatgcgttcc 31800 gtgcgggcct gtctgtggac ggcgtcttca acctgaccaa cattgaccgc tggttcctgg 31860 tacagattga agagctggtg cgtctggaag agaaagtggc ggaagtgggc atcactggcc 31920 tgaacgctga cttcctgcgc cagctgaaac gcaaaggctt tgccgatgcg cgcttggcaa 31980 aactggcggg cgtacgcgaa gcggaaatcc gtaagctgcg tgaccagtat gacctgcacc 32040 cggtttataa gcgcgtggat acctgtgcgg cagagttcgc caccgacacc gcttacatgt 32100 actccactta tgaagaagag tgcgaagcga atccgtctac cgaccgtgaa aaaatcatgg 32160 tgcttggcgg cggcccgaac cgtatcggtc agggtatcga attcgactac tgttgcgtac 32220 acgcctcgct ggcgctgcgc gaagacggtt acgaaaccat tatggttaac tgtaacccgg 32280 aaaccgtctc caccgactac gacacttccg accgcctcta cttcgagccg gtaactctgg 32340 aagatgtgct ggaaatcgtg cgtatcgaga agccgaaagg cgttatcgtc cagtacggcg 32400 gtcagacccc gctgaaactg gcgcgcgcgc tggaagctgc tggcgtaccg gttatcggca 32460 ccagcccgga tgctatcgac cgtgcagaag accgtgaacg cttccagcat gcggttgagc 32520 gtctgaaact gaaacaaccg gcgaacgcca ccgttaccgc tattgaaatg gcggtagaga 32580 aggcgaaaga gattggctac ccgctggtgg tacgtccgtc ttacgttctc ggcggtcggg 32640 cgatggaaat cgtctatgac gaagctgacc tgcgtcgcta cttccagacg gcggtcagcg 32700 tgtctaacga tgcgccagtg ttgctggacc acttcctcga tgacgcggta gaagttgacg 32760 tggatgccat ctgcgacggc gaaatggtgc tgattggcgg catcatggag catattgagc 32820 aggcgggcgt gcactccggt gactccgcat gttctctgcc agcctacacc ttaagtcagg 32880 aaattcagga tgtgatgcgc cagcaggtgc agaaactggc cttcgaattg caggtgcgcg 32940 gcctgatgaa cgtgcagttt gcggtgaaaa acaacgaagt ctacctgatt gaagttaacc 33000 cgcgtgcggc gcgtaccgtt ccgttcgtct ccaaagccac cggcgtaccg ctggcaaaag 33060 tggcggcgcg cgtgatggct ggcaaatcgc tggctgagca gggcgtaacc aaagaagtta 33120 tcccgccgta ctactcggtg aaagaagtgg tgctgccgtt caataaattc ccgggcgttg 33180 acccgctgtt agggccagaa atgcgctcta ccggggaagt catgggcgtg ggccgcacct 33240 tcgctgaagc gtttgccaaa gcgcagctgg gcagcaactc caccatgaag aaacacggtc 33300 gtgcgctgct ttccgtgcgc gaaggcgata aagaacgcgt ggtggacctg gcggcaaaac 33360 tgctgaaaca gggcttcgag ctggatgcga cccacggcac ggcgattgtg ctgggcgaag 33420 caggtatcaa cccgcgtctg gtaaacaagg tgcatgaagg ccgtccgcac attcaggacc 33480 gtatcaagaa tggcgaatat acctacatca tcaacaccac ctcaggccgt cgtgcgattg 33540 aagactcccg cgtgattcgt cgcagtgcgc tgcaatataa agtgcattac gacaccaccc 33600 tgaacggcgg ctttgccacc gcgatggcgc tgaatgccga tgcgactgaa aaagtaattt 33660 cggtgcagga aatgcacgca cagatcaaat aatagcgtgt catggcagat atttttcatc 33720 cgctaatttg atcgaataac taatacggtt ctctgatgag gaccgttttt ttttgcccat 33780 taagtaaatc ttttggggaa tcgatatttt tgatgacata agcaggattt agctcacact 33840 tatcgacggt gaagttgcat actatcgata tatccacaat tttaatatgg ccttgtttaa 33900 ttgcttcaaa acgagtcata gccagacttt taatttgtga aactggagtt cgtatgtgtg 33960 aaggatatgt tgaaaaacca ctctacttgt taatcgccga atggatgatg gctgaaaatc 34020 ggtgggtgat agcaatagag agatctctat tcatttcgat attgaacaca gcaaggcggt 34080 taataccctg acttatattc tgtcggaagt cacagaaata agctgcgaag ttaagatgat 34140 ccctaataag ctggaagggc ggggatgcca gtgtcagcga ctggttaaag tggtcgatat 34200 ccatgagcaa atttacgcgc gcctgcgcaa taacagtcgg gaaaaattag tcggtgtaag 34260 aaagacgccg cgtattcctg ccgttccgct cacggaactt aaccgcgagc agaagtggca 34320 gatgatgttg tcaaagagat tgcgtcgtta attttatctc gttgataccg ggcgtcctgc 34380 ttgaccagat gcgatgttgt agcatcttat ccagcaacca ggtcgcatcc ggcaagatca 34440 ccgtttaggc gtcacatccg tcgtcccctg caaacggggg cgattttcct ccatttgcct 34500 cagtggctgc gtttcatgta acgatacatg acagcgcccg acaagatcct gatactcttt 34560 ggtattcaac cgtttccagt gtaactcgtc gtcactaaca ttgcgtacag cgcgggctgg 34620 cgtacccatc aacaactggc gtttctcgcc gcgaaagccc gctttgacaa agctcatggc 34680 ggcaacaatg ctctcttcgc caatgaccgc gccatccata atcacgctgt tcatcccgac 34740 caatgcatcg cgaccaatca aacaaccatg caggatcgct ccgtgcccga tatggccgtt 34800 ttccccaacg atagtgtcag tgtcgcagta gccatgcata atgcagccat cctgaatatt 34860 ggctcccgct tgcacgatca accgcccgta gtcaccacgc agtgaggcga gtgggccgat 34920 gtagacaccg gctcccacaa tcacatcgcc aatcaagacg gcactgggat ggacaaacgc 34980 cgtcgggtga accaccggaa ttaacccctc aaaggcgtaa tagctcacgg ttgttaacgt 35040 cctttccaca ccggatcgcg cttctcggca aacgccagcg gcccttcaat ggcatcttcc 35100 gaatgcagaa ccgatggata gtgtttcaac acgccgctgc gaatatagcg atacgcttct 35160 tctaccggca tttcgctggt ggtgcggtag atctctttca gcgccgcaat cgccagcggg 35220 gcgctgttaa ccagctgctg agccagttcg cgggcgttat ccatcagttc cgcctggcta 35280 accacgcggt tgactatccc ccaacgcagc gcctcttctg cgcccattcg tctgccggtc 35340 atcaccattt cattgacgat ggcaggcggc aggatcttcg gcagacgcag cacaccgccg 35400 ctgtcaggaa cgatgcccag tttggcttcc ggcagggcga agctggcgtt atcggcacaa 35460 acaataaaat ctgccgccag cgccagttca aaggcgccgc caaaggcata gccgttcaca 35520 gctgcgataa ccggtttgtc gagattgaaa atttcggtta atcccgcaaa accacccgga 35580 ccaaagtcag catccggtgc ttcgccttct gctgccgctt ttaaatccca gcccgcggaa 35640 aagaacttct ctccggcacc ggtaataatg gcgacacgta attgcggatc gtcacggaaa 35700 tttagaaata cttcgcccat ttcaaagctg gtttttgcat caatagcatt cgcttttgga 35760 cgatcaaggg taatttccag aattgatcca ttgcgggtca gatgtaatga ttcactcatt 35820 ccttttctcc atttttgctt tttcagggac gacaacatcc ctgcaaaaaa tgcatattgt 35880 tttagagtgt gattattagc tggcagggta gttccctgct gtttcattta tttcagattc 35940 tttctaatta ttttccccga gcaattacgt ggcagatctt ttctgatctc cagataagag 36000 ggcactttaa atttcgccat attttgttcg cagaagcgga aaaattcctc ttcgctcaat 36060 gtttcacctt cattcagcac cacaaatgct ttgatggctt catcgcgaat cgaatcttta 36120 atacccacaa ccacgatgtc ctgaattttc gggtgcgcgg cgataatatt ttccagctcc 36180 acgcaggaga cattctcgcc gccacgttta atcatattgc agcggcgatc gacgaaataa 36240 aaaaagtcct cttcgtcgcg gtatccggta tcgccggtat gcagccagcc atcggcttcc 36300 agcactttcg cagtggcttg tgggttgaga aagtactctt tgaagatggt tttcccaggt 36360 atgcctttaa tgcagatttc accgatctca ccagccggga gcgggcgatt gtgatcgtcg 36420 cggatctccg cttcgtagca aaaccccacc cgaccaatcg acggccagcg tcgtttatcg 36480 ccaggacgat caccgataat gcccacaatg gtttccgtca tcccataaga cgtcagcaag 36540 cgaacgccga agcgttcaca aaacgcatct ttttcctgct ccgacaagtt gagataaaac 36600 atcacttccc gcaggcggtg ttgctgatcg ttcgctgaag gcggctgtac catcaacgta 36660 cggatcatca tcggaataca ttcggtaacg gtggcgcggt acttctgtac ctgtccccag 36720 aaggcgcggg cgctgtattt ctcgaccagc acaaaggtgg ccccggcaga aaacgccgcc 36780 atcgccgcag tacactggca atcgatatga aacgcaggca ttaccgtcag gtagacgtca 36840 tcgtcacgca gtgcacactg ccaggcggag taatatccag cgaagcgcag gttgtaatgg 36900 gtaatcacca cacctttcgg tcgggaggtg gtgccggagg tgaagagaat ttccgccgta 36960 tcgtcagtcg atagcggcgg tgcatagcac aaggtggcag gttgttgatt tttcagttga 37020 gtaaacgaac tcacgccatc atcagcggga agtgccacat ctgtcaggca aatgtgccgc 37080 aattgagtgg catcttcctg ctgaatctgt tgatacatag gatagaattg cgcactggtc 37140 accagcaggc acgcctggct attttgcagg atccacgcgc tttcctcgca caacaggcgg 37200 gcgttaatcg gcaccataat cgcgccaatt tttgccagcc cgaaccagca aaagataaat 37260 tccgggcagt tgtcgagatg tagtgcaacc ttgtcgcctt tgcgaatccc cagcgtataa 37320 aacaggtttg ccgtgcggtt aatctcctga tttaactcaa gataactata ccggttaacg 37380 actccgccgc tggattcaca aatcagcgcc gttttatgac cgtaaacgtc cgcaagatcg 37440 tcccacattt gacgtagatg ttgtccgcca atgatatcca ttgcacctct atccattttt 37500 gttcgtttgt tattgggcgg gcgctagtca ggcaagccga ctgacgccac gcgtttagtc 37560 ctcaactttg gccagacctt tgctgaccaa ctcctgaatg tcgttttcgc tgtagccgat 37620 atttttcaaa atggcagccg tgtccatgcc atgtgagggc attccgcgcc agatttgtcc 37680 ggggttattt ttgaatttcg gcatgatgtt cggccctttg caggtgcgac catccatcgt 37740 ttgccactga gtgattgatt cgcgagccac atactgtgga ttgctttcca gttccggtac 37800 ggtcagcact ttggcgcagg cgatattcag ttcagcaaag cgttctttta cttccgcgat 37860 ggtatgtgtc gccagccagg catcgagttt ctcttcaacc agtgggccgt aagggcattc 37920 gatacggtgg ataagctgag tgccttccgg gatttctggc gtgccaagca gatgtgcgag 37980 gccaatatct ttaaagcact cttcaatttg ggtaatgccc accagttcca tcacgatgta 38040 gccgtcggca catttataca gaccgcaacc ggcgtagtag ggatctttac ctttgctcat 38100 gcgcgggcac atttcgccgc cgttgaagta atccatcatg aagtactggc ccatacgcag 38160 catcacttca tacatggcga tgtcgatact ttcgccttta ccggtttcac gcactttatg 38220 cagtgctgcc agcgccgccg tggtggcggt caggccagaa aagtaatcgg cggtatacgg 38280 gaaggcaggc attggctggt caacatcacc gttctgaatc aggtaaccac taaaggcctg 38340 ggcgatagtg ttataggccg gaagattggt gtactcctcg gtgccgtact gaccaaaacc 38400 ggacaggtga gcgataacca gtttcgggtt gtgctgccac agtacttcat cggtaatgcc 38460 acgacgggca aaggccggac ctttactggc ttcgatgaag atatcggtgg tttccattaa 38520 tttcagaaac gcttcgcggc cttcatcttt gaaaatattt aacgacagcg cgtgcaaatt 38580 gcggcgggag agttgcgggt agttcggttg aacgcgaatg gtgtcggccc aggcgacgtt 38640 ctcgatccag ataacttccg cgccccattc tgcgaacatt tgcccggcaa acggtccggc 38700 gatttcgata ccggagaaga caacgcgcaa tccggccaac ggcccgaatt tcggcatggg 38760 tagatgatcc attatttgct cctgaaaaat ttatgtagcg catgactgcc ggatgcggcg 38820 taaacgcttt atccggccta cattcgtgct cccgtaggcc tgataagacg catcagcgcg 38880 gcatcaggca gcgcacggac ttagcggtat tgcttcagca ccgcacgacc cagcgtcagg 38940 atctgcattt cgtcagatcc cccggagacg cggtctacac gcagatcacg ccagaagcgg 39000 ctgatgcggt ggttgcccgc aatcccgaca ccgcccagca cctgcattgc gctatccaca 39060 acttcaaatg ccgcattggc gcagaagtat ttgcacatcg ctgcatcgcc agaggtgatg 39120 gtgccgttgt ctgctttcca cgctgcttca tacagcatgt ttttcatgga gtttaatttg 39180 atcgccatgt gggcgaattt ttcctgaatc aactggaaac gaccaatagc ctcgccaaac 39240 tgcacgcgct gattggcgta gcgcgccgca tcttcaaagg cgcacatcgc cgtaccgtag 39300 ttggtgaggg ctaccaggaa acgttcatgg tcgaactctt ctttgacgcg gttaaagccg 39360 ttaccttccc gaccgaacat gtctttctcg tccagttcca cgtcgtcaaa ggtgatttca 39420 cagcagctat ccatacgcag accgagcttt tcaagtttgg tcactttgat gcccggtttg 39480 ctcatatcaa caaaccatcc ggtgtagaca ggtttgtccg gagaagcccc gtcgcgcgcc 39540 atcaccacga tgtacggggt gtaggcgctg ctggtaataa aacacttact accattaaga 39600 taaatcttac catttttacg ggtataagtc gttttcaggc tacccacgtc ggagcccgcg 39660 cccggttcgg taatcgctga gttccacatc tgcttaccgg tgccgcggaa agccataatt 39720 ttgtcgatct gctcttgtgt gccttcgcgc aggaaggtgt tgaacccgcc cggcaactgg 39780 tacagcacat aggttggtgc ccccagacgt cccagctcca tccacacggc ggcgagagta 39840 acaaaccccg cgtccagacc accgtgctct tcagggatca gcagactgtc gatacccata 39900 tccgccagtg ctttgacaaa acgttccggg tagacgctgt cacggtcgca ctcggcaaaa 39960 taggcctccc agttttcgct ggccatcagt tcgcggatac cggcgacaaa cagttcctgc 40020 tcatcattta aattaaaatc catctttcaa cctcttgata ttttgggggg ttaattaatc 40080 tttccagttc tgtttcgcgt ctttaataaa ggagagcgtc accataatgt tgacgaagaa 40140 cagcgggcat cctccggcga taatggcggt ttgaatcggt ttcaggccgc cgagcgccag 40200 cagaacaata ccgataatgc caaccagaat tgaccaaccg atacgcacca gcagaggtgg 40260 ttcttcacca tcgcgtactt cgcggcaagt ggacatcgcc agggtataag agcaggcgtt 40320 aaccagcgta acggtggcaa taaagcagag gatgaagaag ccccacatgg tggcggtgct 40380 gagtggcaga gcggcccagg tttcaatgat ggcgcgcgcc acaccgtact gttcgatcag 40440 atttggaatg ttgatgatgt ttttatctat caacagcaga gtgttactac cgagtacagt 40500 ccacaggatc caggttgacg ctgtcagccc cagcaccatg ccgaagcaca gttcacgcac 40560 agtacgacca cgggagatgc gggcgaggaa gatactcatc tggatagcat aaatcaccca 40620 ccatgcccag tagaacacgg tccagccctg cgggaagccg cctttagcga tgggatcggt 40680 atagaacaac atgcgcggca gatacatcag caacatcccc accgaatcgg tgaagtagtt 40740 catgatgaag ctggcaccgc tgacaatgaa cacccaaccc agcatcagga agctcaggta 40800 actacgcacg tcactggcga tacgtacccc tttttgcaga ccgcaagcga cgcaaatggc 40860 gttgaggata atccagcagg taatgatgat agcgtccagt tgcagggtat gcggaatgcc 40920 aaacaaccat tgcatacact cggtcaccag cggcgtggca aggcccagac tggtacccat 40980 cgtgaagatc aaggcgacga gatagaagtt gtcgacgata gtgccgaaca accctttggc 41040 gtgtttttca cctaccagcg gcaccagtgt cgagctgggg cgaatcactt ccattttgcg 41100 gacaaagaag aagtaagcga aggcgactga aaggaagctg taagtggccc acggcagagg 41160 tccccagtgg aacaagctgt aagccagccc caactctttc gcccctgtcg agttcggttc 41220 taagccaaac ggcggggtgg agatgtagta gtagatctca atcgatcccc agaacagtac 41280 ggcagcagac gtacaggagg cgaacatcat aaagatccaa ctggcggtgc taaattctgg 41340 cggttcgtta cctaaacgct ttttggcata cgggccaaac accagccaga accaaccgaa 41400 aagcatcacc accatatacc attcaaatgc ccatccccat acattggtga cgtaactgaa 41460 tacagcatta ataacgacat tcgctgcatc cagatctctg actgtaagcc aacaaagtat 41520 gccgacgatt attaacggcg gaaagaaaac cttcggttct attcccgttt ttctcttttc 41580 attcttcatg agttaattcc actgtgaaaa cgaatattta ttttgcgttc ccgtttgttt 41640 tatttttgtt aacatttaat ataattatta ttaacctcgt ggacgcgtta atggctaact 41700 cataatgggt attcaataag ctgtattctg tgattggtat cacatttttg tttcgggtga 41760 atagagggcg ttttttcgtt aattttgatt aataatcagt ttgttatgct ctgttgtgag 41820 taaaaaataa catctgactt tcaatattgg tgatccataa aacaatattg aaaatttctt 41880 tttgctacgc cgtgttttca atattggtga ggaacttaac aatattgaaa gttggattta 41940 tctgcgtgtg acattttcaa tattggtgat taaagtttta tttcaaaatt aaagggcgtg 42000 atatctgtaa ttaacaccac cgatatgaac gacgtttcct tcatgatttc tggagatgca 42060 atgaagatta ttacttgcta taagtgcgtg cctgatgaac aggatattgc ggtcaataat 42120 gctgatggtt cattagactt cagcaaagcc gatgccaaaa taagccaata cgatctcaac 42180 gctattgaag cggcttgcca gctaaagcaa caggcagcag aggcgcaggt gacagcctta 42240 agtgtgggcg gtaaagccct gaccaacgcc aaagggcgta aagatgtgct atcgcgcgac 42300 ccggatgaac tgattgtggt gattgatgac cagttcgagc aggcactgcc gcaacaaacg 42360 gcgagcgcac tggctgcagc cgcccagaaa gcaggctttg atctgatcct ctgtggcgat 42420 ggttcttccg acctttatgc ccagcaggtt ggtctgctgg tgggcgaaat cctcaatatt 42480 ccggcagtta acggcgtcag caaaattatc tccctgacgg cagataccct caccgttgag 42540 cgcgaactgg aagatgaaac cgaaacctta agcattccgc tgcctgcggt tgttgctgtt 42600 tccactgata tcaactcccc acaaattcct tcgatgaaag ccattctcgg cgcggcgaaa 42660 aagcccgtcc aggtatggtc ggcggcggat attggtttta acgcagaggc agcctggtca 42720 gaacaacagg ttgccgcgcc gaaacagcgc gaacgtcagc gcatcgtgat tgaaggcgac 42780 ggcgaagaac agatcgccgc atttgctgaa aatcttcgca aagtcattta accacagggg 42840 atgctacgaa cacattttct caagtctggg tattcagcga taccccttct cgtctgccgg 42900 aactgatgaa cggtgcgcag gctttagcta atcaaatcaa cacctttgtc ctcgatgatc 42960 ggcgacggtg tacaggcaat ccagctcggc gctaatcatg tctggaaatt aaacggcaaa 43020 ccggacgatc ggatgatcga agattacgcc ggtgtcatgg ctgacactat tcgccagcac 43080 ggcgcagacg gcctggtgct gctgccaaac acccgtcgcg gcaaattact ggcggcaaaa 43140 ctgggttatc gccttaaagc ggcggtgtct aacgatgcca gcaccgtcag cgtacaggac 43200 ggtaaagcga cagtgaaaca catggtttac ggtggtctgg cgattggcga agaacgcatt 43260 gccacgccgt atgcggtact gaccatcagc agcggcacgt tcgatgcggc tcagccagac 43320 gcgtcacgca ctgtgaaacg cacaccgtgg agtggcaggc tccggctgtg gcgattaccc 43380 gcacggcaac ccaggcgcgc cagagcaaca gcgtcgatct cgacaaagcc cgtctggtgg 43440 tcagcgtcgg tcgcggtatt ggcagcaaag agaacattgc gctggcagaa cagctttgca 43500 aggcgatagg tgcggagttg gcctgttctc gtccggtggc ggaaaacgaa aaatggatgg 43560 agcacgaacg ctatgtcggt atctccaacc tgatgctgaa acctgaactg tacctggcgg 43620 tggggatctc cgggcagatc cagcacatgg ttggcgctaa cgcgtcgcaa accattttcg 43680 ccatcaataa agataaaaat gcgccgatct tccagtacgc ggattacggc attgttggcg 43740 acgccgtgaa gatccttccg gcgctgaccg cagctttagc gcgttgatcc actctggcag 43800 ggctgcattt tggccctgcc gctgacaggg agctcttatg tccgaagata tctttgacgc 43860 catcatcgtc ggtgcagggc ttgctggttc ggttgccgca ctggtgctcg cccgcgaagg 43920 tgcgcaagtg ttagttatcg agcgtggcaa ttccgcaggt gccaagaacg tcaccggcgg 43980 gcgtctctat gcccacagtc tggaacacat tattcctggt ttcgccgact ccgcccccgt 44040 agaacgcctg atcacccatg aaaaactcgc gtttatgacg gaaaagtcag cgatgactat 44100 ggactactgc aatggtgacg aaacctcgcc atcccagcgt tcttactccg ttttgcgcag 44160 taaatttgat gcctggctga tggagcaggc cgaagaagcg ggcgcgcagt taattaccgg 44220 gatccgcgtc gataacctcg tacagcgcga tggcaaagtc gtcggtgtag aagccgatgg 44280 cgatgtgatt gaagcgaaaa cggtgatcct tgctgatggg gtgaactcca tccttgccga 44340 aaaattgggg atggcaaaac gcgtcaaacc gacggatgtg gcggttggcg tgaaggaact 44400 gatcgagtta ccgaagtcgg ttattgaaga ccgttttcag ttgcagggta atcagggggc 44460 ggcttgcctg tttgcgggat cacccaccga tggcctgatg ggcggcggct tcctttatac 44520 caatgaaaac accctgtcgc tggggctggt ttgtggtttg catcatctgc atgacgcgaa 44580 aaaatcggtg ccgcaaatgc tggaagattt caaacagcat ccggccgttg caccgctgat 44640 cgcgggcggc aagctggtgg aatattccgc tcacataatg ccggaagcag gcatcaacat 44700 gctgccggag ttggttggtg acggcgtatt gattgccggt gatgccgccg gaatgtgtat 44760 gaacttcggt tttaccattc gcggtatgga tctggcgatt gccgccgggg aagccgcagc 44820 aaaaaccgtg ctttcagcga tgaaaagcga cgatttcagt aagcaaaaac tggcggaata 44880 tcgtcagcat cttgagagtg gtccgctgcg cgatatgcgt atgtaccaga aactaccggc 44940 gttccttgat aacccacgca tgtttagcgg ctacccggag ctggcggtgg gtgtggcgcg 45000 tgacctgttc accattgatg gcagcgcgcc ggaactgatg cgcaagaaaa tcctccgcca 45060 cggcaagaaa gtgggcttca tcaatctaat caaggatggc atgaaaggag tgaccgtttt 45120 atgacttctc ccgtcaatgt ggacgtcaaa ctgggcgtca ataaattcaa tgtcgatgaa 45180 gagcatccgc acattgttgt gaaggccgat gctgataaac aggtgctgga gctgctggtg 45240 aaagcgtgcc ccgcaggtct gtacaagaag caggatgacg gcagtgtgcg cttcgattac 45300 gccggatgtc tggagtgcgg cacctgtcgc attctggggc tggggtcggc gctggaacag 45360 tgggaatacc cgcgcggcac ctttggtgtg gagttccgtt acagctaatg ttgctttgat 45420 acgtaacgcc gcactgactc tcattgcaaa aaacatgaat agctatgcaa ccgtccagaa 45480 accttgacga tctcaaatgc ttgtcccact accgccgcat tttgctgtgg ggaagcggtg 45540 gtccgtttct gtatggttat gtactggtaa tgattggcgt ggcgctggag caactgacgc 45600 cggcgctgaa actggacgct gactggattg gcttgctggg cgcgggaacg ctcgccgggc 45660 tgttcgttgg cacatcgctg tttggttata tttccgataa agtcggacgg cgcaaaatgt 45720 tcctcattga tatcatcgcc atcggcgtga tatcggtggc gacgatgttt gtttcatccc 45780 ccgtcgaact gttggtgatg cgggtattta tcggcattgt catcggtgca gattatccca 45840 tcgccacctc aatgatcaac gagttctcca gtacccgtca gcgggcggtt ttccatcagc 45900 tttattgccg cgatgtggta tgttggcgcg acctgtgccg atctggtcgg ctactggctt 45960 tatgatgtgg aaggcggctg gcgctggatg ctgggtagcg cggcgatccc ctgtctgttg 46020 attttgattg gtcagattcg aactgcctga atctccccgc tggttattat gcaaagggcg 46080 agtaaaagag tgcgaggaga tgatgatcaa actgtttgga gaaccggtgg ctttcgatga 46140 agagcagccg cagcaaaccc ggttttcgcg atctgtttaa tcgctgccat tttccttttg 46200 ttctgtttgt tgccgccatc tggacctgcc aggtgatccc aatgttcgcc atttacacct 46260 ttggcccgca aattgttggt ttgttgggat tgggggttgg caaaaacgcg gcactaggaa 46320 atgtggtgat tagcctgttc tttatgctcg gctgtattcc gccgatgctg tggttaagca 46380 ctgccggacg gcgtccattg ttgattggca gctttgccat gatgacgctg gcgctggcgg 46440 ttttggggct aatcccggat atggggatct ggctggtagt gatggccttt gcggtgtatg 46500 cctttttctc tggcgggccg ggtaatttgc agtggctcta tcctaatgaa ctcttcccga 46560 cagatatccg cgcctctgcc gtgggcgtga ttatgtcctt aagtcgtatt ggcaccattg 46620 tttcgacctg ggcactaccg atctttatcg ataattacgg tatcagtaac acgatgctaa 46680 tgggggcggg tatctcgctg tttggcttgt tgatttccgt agcgtttgcc ccggagactc 46740 gagggatgtc actggcgcag accagcaata tgacgatccg cgggcagaga atggggtaaa 46800 ttgttcagat ttctctcttt tctgaatcaa tattattgac tataagccgc gtgaatatat 46860 gactacactt tgtgggaaaa caaaggcgta atcacgcggg ctacctatga ttcttataat 46920 ttatgcgcat ccgtatccgc atcattccca tgcgaataaa cggatgcttg aacaggcaag 46980 gacgctggaa ggcgtcgaaa ttcgctctct ttatcaactc tatcctgact tcaatatcga 47040 tattgccgcc gagcaggagg cgctgtctcg cgccgatctg atcgtctggc agcatccgat 47100 gcagtggtac agcattcctc cgctcctcaa actttggatc aataaagttt tctcccacgg 47160 ctgggcttac ggacatggcg gcacggcgct gcatggcaaa catttgctgt gggcggtgac 47220 gaccggcggc ggggaaagcc attttgaaat tggtgcgcat ccgggctttg atggtctgtc 47280 gcagccgcta caggcgacgg caatctactg cgggctgaac tggctgccac cgtttgccat 47340 gcactgcacc tttatttgtg acgacgaaac cctcgaaggg caggcgcgtc actataagca 47400 acgtctgctg gaatggcagg aggcccatca tggatagcca tacgctgatt caggcgctga 47460 tttatctcgg ttcggcagcg ctgattgtac ccattgcggt acgtcttggt ctgggatcgg 47520 tacttggcta cctgatcgcc ggctgcatta ttggcccgtg ggggctgcga ctggtgaccg 47580 atgccgaatc tattctgcac tttgccgaga ttggggtggt gctgatgctg tttattatcg 47640 gcctcgaact cgatccacaa aggctgtgga agctgcgtgc ggcagtgttc ggctgtggcg 47700 cattgcagat ggtgatttgc ggcggcctgc tggggctgtt ctgcatgtta cttgggctgc 47760 gctggcaggt cgcggaattg atcggcatga cgctggcgct ctcctctacg gcgattgcca 47820 tgcaggcgat gaatgaacgc aatctgatgg tgacgcaaat gggtcgcagt gcctttgcgg 47880 tgctgctgtt ccaggatatc gcggcgatcc cgctggtggc gatgattccg ctactggcaa 47940 cgagcagtgc ctcgacgacg atgggcgcat ttgctctctc ggcgttaaaa gtggcgggtg 48000 cgctggtgct ggtggtattg ctggggcgct atgtcacgcg tccggcgctg cgttttgtag 48060 cccgctctgg cttgcgggaa gtgtttagtg ccgtggcgtt attcctcgtg tttggctttg 48120 gtttgctgct ggaagaggtc ggcttgtcga tggcgatggg cgcgtttctg gcgggcgtac 48180 tgctggcaag ctcggaatac cgtcatgcgc tggagagcga tatcgaacca tttaaaggtt 48240 tgctgttggg gctgtttttc atcggtgttg gcatgtcgat agactttggc acgctgcttg 48300 aaaacccatt gcgcattgtc attttgctgc tcggtttcct catcatcaaa atcgccatgc 48360 tgtggctgat tgcccgaccg ttgcaagtgc caaataaaca gcgtcgttgg tttgcggtgt 48420 tgttagggca gggcagtgag tttgcctttg tggtatttgg cgcggcgcag atggcgaatg 48480 tgctggagcc ggagtgggcg aaatcgctga ccctggcggt ggcgctgtcg atggcagcaa 48540 cgccgattct gctggtgatc ctcaatcgcc ttgagcaatc ttctactgag gaagcgcgtg 48600 aagccgatga gatcgacgaa gaacagccgc gcgtgattat cgccggattc ggtcgttttg 48660 ggcagattac cggacgttta ctgctctcca gcggggtgaa aatggtggta ctcgatcacg 48720 atccggacca tatcgaaacc ttgcgtaaat ttggtatgaa agtgttttat ggcgatgcca 48780 cgcggatgga tttactggaa tctgccggag cggcgaaagc ggaagtgctg attaacgcca 48840 tcgacgatcc gcaaaccaac ctgcaactga cagagatggt gaaagaacat ttcccgcatt 48900 tgcagattat tgcccgcgcc cgcgatgtcg accactacat tcgtttgcgt caggcaggcg 48960 ttgaaaagcc ggagcgtgaa accttcgaag gtgcgctgaa aaccgggcgt ctggcactgg 49020 aaagtttagg tctggggccg tatgaagcgc gagaacgtgc cgatgtgttc cgccgcttta 49080 atattcagat ggtggaagag atggcaatgg ttgagaacga caccaaagcc cgcgcggcgg 49140 tctataaacg caccagcgcg atgttaagtg agatcattac cgaggaccgc gaacatctgt 49200 cattaattca acgacatggc tggcagggaa ccgaagaagg taaacatacc ggcaacatgg 49260 cggatgaacc ggaaacgaaa ccctcatcct aataaagagt gacgtaaatc acacttacag 49320 ctaactgttt gttttgtttc attgtaatgc ggcgagtcca gggagagagc gtggactcgc 49380 cagcagaata taaaattttc ctcaacatca tcctcgcacc agtcgacgac ggtttacgct 49440 ttacgtatag tggcgacaat tttttttatc gggaaatctc aatgatcagt ctgattgcgg 49500 cgttagcggt agatcgcgtt atcggcatgg aaaacgccat gccgtggaac ctgcctgccg 49560 atctcgcctg gtttaaacgc aacaccttaa ataaacccgt gattatgggc cgccatacct 49620 gggaatcaat cggtcgtccg ttgccaggac gcaaaaatat tatcctcagc agtcaaccgg 49680 gtacggacga tcgcgtaacg tgggtgaagt cggtggatga agccatcgcg gcgtgtggtg 49740 acgtaccaga aatcatggtg attggcggcg gtcgcgttta tgaacagttc ttgccaaaag 49800 cgcaaaaact gtatctgacg catatcgacg cagaagtgga aggcgacacc catttcccgg 49860 attacgagcc ggatgactgg gaatcggtat tcagcgaatt ccacgatgct gatgcgcaga 49920 actctcacag ctattgcttt gagattctgg agcggcggta attttgtata gaatttacgg 49980 ctagcgccgg atgcgacgcc ggtcgcgtct tatccggcct tcctatatca ggctgtgttt 50040 aagacgccgc cgcttcggcc aaatccttat gccggttcga cggctggaca aaatactgtt 50100 tatcttccca gcgcaggcag gttaatgtac caccccagca gcagccggta tccagcgcgt 50160 atataccttc cggcgtacct ttgccctcca gcgatgccca gtgaccaaag gcgatgctgt 50220 attcttcagc gacagggcca ggaatcgcaa accacggttt cagtggggca ggggcctctt 50280 ccggcgattc tttgctgtac atatccagtt gaccgttcgg gaagcaaaaa cgcatacggg 50340 taaaagcgtt ggtgataaaa cgcagtcttc ccagcccccg caattccggt gaccagttat 50400 ttggcatatc gccgtacatg gcatcaagaa agaagggata ggagtcactc gatagcaccg 50460 cttctacatc gcgtgcgcac tctttggcgg tctgcagatc ccactgcggc gtgatccctg 50520 cgtgggccat caccagcttt ttctcttcgt cgatttgcag cagaggctgg cgccgcagcc 50580 agttaagcag ctcgtcggca tccggcgctt ccagcagcgg tgtcaggcga tctaacggtt 50640 tattgcggct gatcccggca aataccgcca gcagatgcag atcgtgattg cccagcacca 50700 gacgtacgct gtcgcctaag gatttcacat agcgcagaac atccagcgaa cccggcccgc 50760 gcgcgaccag atcgcccgtc agccagaggg tatctttccc aggggtaaat tctactttat 50820 gcagcaatgc gatcagttca tcgtaacaac catgaacgtc gccaataagg tatgtcgcca 50880 tattctttta atgaatgagt gtgggaacgg cgagtcggaa tacgggaatg tcgatgctga 50940 aagggacgcc attttcatcg atcatttcgt agtgaccctg catggtgccc agcggggttt 51000 caatgattgc accgctggtg tactggtact cttcgccagg cgcgataagt ggctggacgc 51060 caaccactcc ttcgccctgg acttcggttt cacggccatt gccattggtg atcagccagt 51120 aacgccccaa caactgcact ggcgctcgcc ccagattgcg tatggttacg gtataagcaa 51180 aaacgtaacg ttcattatca ggtgaagatt gagcctcaat gtagacgctt tgaacctgaa 51240 tacacactcg gggcgaattg atcatcgtta actctcctgc aaaggcgcgt tctccgccca 51300 gatagttcgc catctggcaa tattgcgcga cagagatatt ttccgctcgc atcgccgggt 51360 cgatccccat tcccgttaac acctcgacgc taaacaggtt gccgaggctg ttacgaatgg 51420 ttttacgacg ctggttaaag gcttcggtgg tgatgcggct caacacacga acatctttaa 51480 ccgggtgagg catcgttgca tgaggaacca ggcgcacgac ggcggaatcc actttgggtg 51540 gtggtgtaaa ggctgacggc ggtacttcca gtaccgggat cacattgcaa tagtattgcg 51600 ccatgacgct taatcgacca tacgctttgc tgttcggtcc tgcaaccaga cgattcacca 51660 cctctttttg caacataaag tgcatgtcgg caatggcatc agtatagcta aacagatgga 51720 acatcaacgg cgtggagatg ttataaggca ggttgccgaa aacacgcagc ggctgaccca 51780 ttttctcggc cagttcacca aagttaaagg tcatcgcatc ctgctgataa atcgtcagtt 51840 tcgggcctaa gaatggatgc gtttgcagac gtgccgccag atcgcggtca agttcgatga 51900 ccgtcagctg gtccagacgt tcgccgaccg gttcggtcaa tgccgccaga ccggggccga 51960 tttcgaccat cgcctggccc ttttgcgggt taatggcaga cacaatactg tcgatcacga 52020 actgatcgtt gagaaagttt tgcccgaagc gtttacgggc taagtggccc tggtggactc 52080 gattattcat tgggtgttaa caatcatttt gatggcgaga ttaagcgccg taataaaact 52140 gccgacatcg gctttgccac gtcccgccag ttcaagcgcg gtgccgtggt ccactgatgt 52200 gcgaataaag ggcaggccca gcgtaatgtt cacaccgcgc ccgaagccct ggtattttag 52260 cacgggaaga ccctgatcgt ggtacatcgc cagcacggcg tcggcgttat caagatattt 52320 cggctgaaac agggtatcgg caggcagcgg cccgttgagt ttcatcccct gcgcccgcag 52380 ctcattgagc accggaataa tggtgtctat ctcttccgta cccatatgac cgccttcgcc 52440 cgcgtgcgga ttcagcccgc agaccagaat gcgcggttcg gcaataccaa atttggtccg 52500 caaatcgtga tgcaaaatag caatcacttc gtgcaaaagt gcaggggtga tagcgtctgc 52560 gatatcgcgc agcggtaaat gcgtcgttgc cagcgccacg cgaagttctt cggtcgccag 52620 catcatcacc acctttttcg cctgcgaacg ctcttcgaaa aactcggtat gaccggtaaa 52680 aggaatgcca gcgtcgttaa taacgccttt atgcaccgga cctgtgatca gcgcggcaaa 52740 ttcgccgttc agacaaccat cgcacgctcg cgccagcgtt tccaccacat aatgcccatt 52800 ttcaaccgct aactgccccg cagtgacagg tgcacgtagc gcgacaggaa gtagcgttaa 52860 tgtgcccgca gtttgcggtt gtgcagggga gttgggggaa taagggcgga gggtgagcgg 52920 caaaccgagc atcgctgccc ggttggtaag gagagtggca tcggcacaaa caaccagttc 52980 gaccggccac tcacgctgtg caagctggac aactaagtcc gggccaatcc cggcgggctc 53040 gccgggagtg atcacaacac gttgggtttt aaccattagt tgctcaggat tttaacgtag 53100 gcgctggcac gttgttcctg catccagctt gctgcttctt ccgagaactt acggttcatc 53160 agcatgcggt atgcacgatc tttctgcgca gcgtcggttt tatcgacatt acgggtatcc 53220 agcagttcga ttaaatgcca gccgaatgaa gagtgaaccg gtgcactcat ttgacctttg 53280 ttcaggcgag tcagggcgtc acggaaggcc ggatcgaaaa tatctggtgt agcccagccg 53340 agatcgccgc cctggttagc agagcctgga tcctgagaga actctttcgc tgcggcagca 53400 aaagtcgttt taccactctt gatatcagca gcaatctgtt ccagtttcac acgggcctgt 53460 tcgtcagtca tgatcggcga cggtttcagc agaatatggc gagcatgaac ttcggtcacc 53520 gagatatttt tgctttcgcc gcgcaggtcg ttaactttca gaatatggaa gccaacgccg 53580 gaacgaatcg ggccaacaat gtcgcctttc ttcgcggtgc ttaatgcctg ggcgaagatc 53640 ccgggcaact cctgaatacg gccccagccc atctggccgc cgttcagcgc ctgctggtcg 53700 gcagaatgag caatcgccag cttaccgaaa tcagcgccgt tacgcgcctg atcgacaatg 53760 gcgcgcgcct ggctttccgc ttcgttcacc tgatcagagg tcgggttttc cggcagcggg 53820 atcaggatgt ggctcaggtt cagctcagtg ctggcgtcgt tttggttacc cacctgctgc 53880 gccagggatt cgacttctgc ggcaggatgg tgatgcgacg acgcacctcg ttgttacgca 53940 cttcagagat aatcatctct ttgcggatct ggttacgata ggtgttgtag ttcagtccat 54000 cgtaagccag acggctgcgc atctgatcca gcgtcatgtt gttctgtttc gcgatgttag 54060 caatcgcctg atccagctgc tcatcggaga ttttcactcc cattttctgc cccatctgca 54120 ggatgatttg atccatgatc aaacgttcca tgatttggtg gcgcagcgtc gcgtcatcag 54180 gaagttgctg ccttgcctga gcagcgttca gttttaccga ctgcattaat ccatcaacgt 54240 cgctttccag cacgacgccg ttattgacga cggctgcgac tttatcgact acctgggggg 54300 cagcgaaact ggtattcgcg atcatggcga taccgagaag cagcgttttc cagttcttca 54360 tactttttcc atttcaatta accgcactgc ggattacgtg gtaaatcaac aaatcacaaa 54420 gtgttttgat acggcagaat gttcgaacgc agcatctctt gcgtacccag accgtagttg 54480 gagctcaggc cgcgaagttc gatgttaaag ccgattgcgt tgtcatatac cgcatgttgt 54540 ttatcgttat cccaaccgtt cagcttccgc tcgtaaccga cgcgaattgc atagcagcag 54600 gagctgtatt gcacacctaa catagagtcg gcttgcttgt tagcattggt gtcgtagtag 54660 taggccccaa caatggacca acgatcggca attggccagc tggcgacagc acctacctgc 54720 gaaataccat tcttatattg ctcagcagtg gaatagtact taggcagcgt agcctgaata 54780 tattccgggc tggcgtaacg gtaattcagc tgtaccagac ggtcttcatc ccgacggtat 54840 tcaatgctgg agttactggt cgctacgtta tccagacgtg tatcgtactg aatcccgcca 54900 cgcaatcccc aacgctcgga gatacgccag taagtatcgc ctgcccacac cagtgaaccc 54960 gttttgtcgt cattctccca tgttatgttg tcatcgccag tgcgagactc cgtgaaatag 55020 tagatttgac caacggaaat attaaaacgt tcaacggcag catcatcata tatgcgagat 55080 gtgacaccgg tcgtcacctg gttagcggag gcaatacggt caagaccgcc gtaagtccgg 55140 tcccggaaca ggccagagta gtcagattgc agcagagacg agtcgtagtt atagatgtcg 55200 ctctgatcgc gatacggcac gtacaaatac tgcgcgcgcg gttccagcgt ttgggtataa 55260 cccggagcca gcatttccat atcgcgttca aagaccattt tgccgtcaac tttgaattgc 55320 ggcattacgc ggttaacgga ttcgtccagc ttggtcgtgt ttctggagtt ataccagtca 55380 agattggttt gctgataatg ggttgccagc aacttcgctt cggtattgat gctgccccag 55440 ttattagaga gcggcaaatt gatggtcggt tccaggtgaa cacgggttgc ttcaggcatg 55500 tcgtctctgg tgttaacaaa gtgcactgcc tggccgtaaa tacgcgtatc aaacggacca 55560 acatcattct ggtagtaatt aacgtctaac tgcggctctg ccgagtagct actggtgttc 55620 tgttcgctga aaacctggaa ctgcttggtt gaaacggtgg cattgaagtt ttgcaccgca 55680 tagccaacgc tgaatttttg cgttgcgtag ccgtcagtac tggaaccgta cttgttatcg 55740 aaatcattga agtagctagg atcgctgacc ttggtgtagt cgacgttgaa acgccacacc 55800 tgatccatga ccccggagtg gttccagtag aataaccaac gacgtgaact gtcatcgttc 55860 gggtgttcat cttcatagac tttatctgaa ggcagatagt ccagttccat caagccagcg 55920 cccgcctggg agaggtagcg gaattcgttc tcccacatga tgttgccacg acgatgcata 55980 taatgcggcg tgatggtggc atccatattt ggcgcgatgt tccagtaata tggcaggtag 56040 aactcaaagt agttggtggt ggtgtacttg gcgttcggga tcaagaaacc agagcgacgt 56100 ttgtcaccca ccggcaactg caaatagggg ctataaaaga tcggtaccgg acccacctta 56160 aagcgggcgt tccagatctc cgcaacttgt tcttcgcggt catgaataat ttcgctacct 56220 accacgctcc aggtgtcaga acccggcaga caggaggtaa agctaccgtt atccagaatg 56280 gtatagcggt tttcgccacg ttgtttcatc aggtccgctt taccgcgacc ctggcgaccc 56340 accatctggt aatcaccttc ccagacgttg gtatctttgg tgttcagatt cgcccagcct 56400 ttcggccctt tgaggatcac ctggttatcg tcgtaatgga cattaccgag cgcatcaacg 56460 gtacgtaccg gctccggttg tcctggtgcc tctttttgat ggagctgcac ttcgtcggcc 56520 tgcagacggc tgttaccctg catgatatcc acgctgccag taaacacggc gtcatccggg 56580 tagtcccctt tcgcgtggtc agcattgata gtcacgggta agtcattggt atcgccctgt 56640 accagaggac ggtcatagct tggcacgccc aacatgcact gtgaggcgag gtcggctgcc 56700 agtccctgtt gactataaag ggcggtggca atcatggtgg ccaggagagt ggggatacgt 56760 tttttcatac gttgatttta ttgttccatc atcggtaacg ttgcgcgtga caaacggtca 56820 gagactaacg tactcgtcat ctctacgcta gtgttaatcc tgtccgaata gcgtcagtgg 56880 tgttaggcac ggcattgaat gacaggtatg ataatgcaaa ttataggcga tgtcccacaa 56940 ttgaccgcag ccggaaaacg gtaaaagcac ctttatattg tgggagatag ccctgatatc 57000 cgtgtgtcga tttggggaat atatgcagta ttggggaaaa atcattggcg tggccgtggc 57060 cttactgatg ggcggcggct tttggggcgt agtgttaggc ctgttaattg gccatatgtt 57120 tgataaagcc cgtagccgta aaatggcgtg gttcgccaac cagcgtgagc gtcaggcgct 57180 gttttttgcc accacttttg aagtgatggg gcatttaacc aaatccaaag gtcgcgtcac 57240 ggaggctgat attcatatcg ccagccagtt gatggaccga atgaatcttc atggcgcttc 57300 ccgtactgcg gcgcaaaatg cgttccgggt gggaaaatca gacaattacc cgctgcgcga 57360 aaagatgcgc cagtttcgca gtgtctgctt tggtcgtttt gacttaattc gtatgtttct 57420 ggagatccag attcaggcgg cgtttgctga tggttcactg cacccgaatg aacgggcggt 57480 gctgtatgtc attgcagaag aattagggat ctcccgcgct cagtttgacc agtttttgcg 57540 catgatgcag ggcggtgcac agtttggcgg cggttatcag cagcaaactg gcggtggtaa 57600 ctggcagcaa gcgcagcgtg gcccaacgct ggaagatgcc tgtaatgtgc tgggcgtgaa 57660 gccgacggat gatgcgacca ccatcaaacg tgcctaccgt aagctgatga gtgaacacca 57720 tcccgataag ctggtggcga aaggtttgcc gcctgagatg atggagatgg cgaagcagaa 57780 agcgcaggaa attcagcagg catatgagct gataaagcag cagaaagggt ttaaatgacc 57840 ctgtaaatga tgctgagtaa ctgcccacga ttaaaggtgg ccgccctggc ggtcacttct 57900 ttgagaaaag gcgtttactc agaatggtgg acaggctcaa tgcacggttt acgggagggg 57960 ttctgtaggt tttatcgcgt tgaccctgct taaggttgag agctttacga cgagcggaat 58020 tatattttta cgtcttaaaa ataaaaaaca catacctgaa tgagcgattt ttgaaagtat 58080 atttattcag aacgcgcatc atgagttttt aactcaatgc gaggctatta ccatgaaagt 58140 atcagttcca ggcatgccgg ttacactttt aaatatgagc aagaacgata tttataagat 58200 ggtgagcggg gacaagatgg acgtgaagat gaatatcttt caacgcttgt gggagacgtt 58260 acgccatctg ttctggtcag ataaacagac tgaggcttat aaacttctgt tcaatttcgt 58320 gaataaccag actggcaaca tcaacgcctc agaatacttt actggggcta tcaacgagaa 58380 tgagagagaa aagtttatca atagcctgga attattcaat aaacttaaaa catgcgcaaa 58440 aaatccggat gagttggtcg caaagggcaa tatgcgctgg gtcgcccaga ccttcgggga 58500 tatcgagtta agtgtcactt ttttcattga aaagaataag atatgtactc agacgttgca 58560 gctgcataag ggccaaggta acttgggcgt tgatcttaga aaggcttacc ttcccggcgt 58620 tgacatgagg gattgttacc ttggtaaaaa aacaatgaaa ggtagcaatg atatccttta 58680 tgagagacct gggtggaatg ctaacctggg cgtgctaccc cggacggtgc taccccggac 58740 ggtgctaacc cggacggtgc taacctggac ggtgctaccg tgaacggtgc tacctcctta 58800 tatgatgagg taattattat taataaaatc ccccccaaaa aaattgatac taaaggagtt 58860 gctactgaag aagttgctac taaaaaagta ctgctgaaca aattactgac aacgcaatta 58920 ttgaatgagc cagaataagc taaggttgaa ggggctggaa cgccccttca accttagcag 58980 tagcgtggga tgatttcaca attagaaaga cctgcatgat gagctagaga agaggctagt 59040 gacgcaaggc gtcgtgcagg acacggatca ccgagatggg catcgccaac cagactgcta 59100 attagcccat gaataacaat cagaaaggac cataacagac ccgttaaaat gaaatataag 59160 agacggtcaa cgggtgaaga aaaagttcaa aaattcgctg tggagcagga agggaattac 59220 cgaatggaaa gcgtagccac acgcaacaac tgaaagcagt ttggcagaaa caaaaaatcc 59280 ccggactcgg ggatttatgt acaagaggca gcccttagga tgagggtata aacgtacagg 59340 aaaggttaaa aatccgctgg cgctttaaac gtcatactat tgccatacgc cggatgggta 59400 atcgtcaaca tctctgcatg tagcaacaaa cgtggtgcca tcgctctcgc ttctggtgat 59460 gcataaaaac gatcgccgag aatcggatga cccagcgcca gcatatgcac acgcaattga 59520 tgcgaacgcc cggtaatcgg ttttaacacc actcttgccg tgttatccgc cgcatactcc 59580 accacttcat attccgtctg cgccaggttt acccgttttc gtaacagcac tttctgtttc 59640 gggcggtttg gccagtcgca aatcagcggc agatccacca gaccttctgc gggggatgga 59700 tgcccccaga cgcgggccac atactgcttt ttcggctcgc gctcgcgaga actggcgttt 59760 taactcccgc tccgcggctt tggtcagcgc cactacaatc acgccgctgg tagccatatc 59820 cagacgatgc accgattctg cctgcggata atcacgctga atgcgcgtca tcacgctgtc 59880 tttgtgctct tccagacgac ccggcactga caacaaaccg ctcggcttgt tgaccaccat 59940 aatatggtca tcctgataca ggataaccaa ccagggttcc tgcggtggat tgtagttttc 60000 catccccatt ttcggctccg ttactgatgc gttacaacga tcaaacgcag ggcatccaga 60060 cgccaacctg cctgatccag gctttccatt acctgctgac ggttgctctc aatggcggtc 60120 agttcgtcgt cacgaatgtt cgggttcact gcacgcagag cttccagacg agacagctcg 60180 gcagacagtt tttcgtcggc ttcgttacgc gctgcatcaa tcaatgcacg ggcagatttc 60240 tcgatctgcg cttcacccag ttgaaggata gcgtgaacat cctgctgcac ggcgttaacc 60300 agtttgctgc cggtgtgacg gttaaccgcg ttaagctggc ggttaaaggt ttcaaactct 60360 acctgcgccg ccaggttgtt gccgttttta tccagcagca tacgtaccgg cgtcggtggc 60420 aggaagcggt tgagctgcaa ctgcttcgga gcctgggctt caaccacata aatcagttcc 60480 accaacagcg tacctaccgg caacgctttg ttttttaaca gtgaaatcgt gctgctaccg 60540 gtatcgccag aaaggatcag atccagaccg ttgcggatca gcggatgctc ccaggtaata 60600 aactgtgcat cttcacgcgc cagcgccact tcacgatcaa aggtgatggt gatgccatct 60660 tccgacaggc cagggaagtc cggcaccagc atatgatcgg acggcgtcag cacgatcatg 60720 ttgtcgccgc gatcgtcctg attgataccg ataatatcga acaggttcat ggcgaaggcg 60780 atcaggttgg tatcgtcatc ctgctcttca atgctttctg ccagtgcctg ggctttttcg 60840 ccaccgttgg agtggatttc cagcaggcgg tcacgaccct gttccagctg tgctttcagc 60900 gcttcatgtt gctcgcggca gtttttgatc agatcgtcaa agccttcggt ttgatccggt 60960 gaagccagat agttaatcag atcgttgtat acgctatcgt aaatagtgcg tccggtcggg 61020 caggtgtgct caaatgcatc cagaccttcg tgataccagc gcaccagcac cgactgagcg 61080 gttttctcca gataaggcac atggatctga atatcgtgcg cctggccgat acgatccaga 61140 cgaccaatac gctgctccag tagatccggg ttgaatggca ggtcaaacat caccatgtgg 61200 ctggcgaact ggaagttacg tccttcagaa ccgatttctg agcacagcag tacctgtgcg 61260 ccggtgtctt cttcggcaaa ccaggcggca gcgcggtcac gttcgataat cgacatacct 61320 tcgtggaaca ccgcagcgcg aataccttca cgttcgcgca gtacctgctc cagttgcagc 61380 gcagtggcag ctttggcgca gatcaccagc actttctgag agcgatggct ggtcaggtag 61440 cccatcagcc actcaacgcg cggatcgaag ttccaccagg tggcgttatc accttcaaat 61500 tcctgataaa tacgctccgg gtagagcata tcgcgagcac gatcttccgc acttttacgt 61560 gcgcccataa tgccggagac tttaatagcc gtctgatact gcgtcggtag cggcagctta 61620 atggtgtgca gctcgcgttt cgggaatcct ttcacaccgt tacgcgtgtt acggaacagc 61680 acgcggctgg tgccgtggcg atccatcagc atcgaaacca gctcctgacg ggcgctctgg 61740 gcatcttcgc tgtcgctgtt tgctgcctgc aacagcggct cgatatcctg ctcgccgatc 61800 atctcgccga gcatgttcag ttcgtcattg ctcagtttgt tacctgccag cagcatggca 61860 acggcgtccg caaccggacg ataatttttc tgctcttcaa cgaactgcgc aaaatcgtgg 61920 aaacggttcg ggtccagcag acgcagacgg gcgaagtggc tttccatccc cagctgttcc 61980 ggggtcgcgg tcagcagcag aacgcccggc acgtgctctg ccagttgttc aatggcctga 62040 tattcacggc ttggcgcatc ttcgctccac accaggtgat gcgcttcatc gaccaccagc 62100 aggtcccatt cggcttcaca gagatgttcc aggcgctgtt tgctacgacg ggcaaaatcc 62160 agcgagcaaa tcaccagctg ttcggtgtca aacgggttgt aagcatcgtg ctgagcttcg 62220 gcataacgct catcatcaaa tagcgcaaag cgcaggttga aacggcgcag catttctacc 62280 agccactgat gctgtaaggt ttccgggacg ataattagca cacgttcagc agcgccagag 62340 agcagttgct gatgcaggat catcccggct tcaatggttt tccctaaacc cacttcgtca 62400 gccagcagga cgcgcggcgc gtggcggcga ccaacatcat gagcgatgtt gagctgatgc 62460 gggatcaggc tggtacgctg accgcgcagg ccgctgtacg gcatacggaa ctgttcgctg 62520 gaatatttac gcgcgcgata acgcagcgca aagcggtcca tacggtcaat ctgcccggca 62580 aacagacggt cctgcggttt gctgaacacc agtttgctat caaggaaaac ttcacgcagg 62640 gctacgccgg actcttcagt atccaggcga gtaccgatat aggtcagcaa gccattttct 62700 tcttttactt cttcgacttg catctgccag ccgtcatggc tggtaatggt atcaccaggg 62760 ttgaacatca cgcgggtcac gggggaatca ctgcgtgcgt acagacggtt ttcaccagta 62820 gatgggaaaa gtaaagtgac agttcgcgca tccaccgcga caacggttcc aagtcccaat 62880 tcgctttctg tatcgctgat ccagcgttga ccaagtgtaa aaggcatatg tgttcggctc 62940 tatatcttta attgcaggca ataaccaccc gctaccgtgc ttatgaggta gtggtgttat 63000 tcaggtccag gaatggaaag ggcgctatgg tactggatgg caaagcattc gtcacgcatc 63060 aaaatggtat ctggcgaact cttttttttg ctcaaaatag cccaagttgc ccggtcataa 63120 gtgtagcaaa attatcctca ataaaaggga gtattccctc cgccacgggt tgtagctggc 63180 gggtcagata gtgttcgtaa tccagtggtg aacgttggta gtccagcggc tccgggccgt 63240 tggtggtcca tacgtactta atggtgccgc gattctgata ttgcaagggg cgaccacgct 63300 tttggttttc ttcatcggca aggcgagcgg cgcgtacatg aggcggcaca ttacgctgat 63360 actcgctcag cggacggcga aggcgtttac ggtaaaccag tcgcgcatcc agttcacccg 63420 ccatcagttt gtcgatggtt tcgcgtacat attcctgata tggctcgttg cggaagatgc 63480 gcaggtatag ctcctgctga aactgctggg ccagcggcgt ccagtcggtg cgcacggttt 63540 ccagcccttt aaacaccatc cgctgcttgt cgccctcctg aatcagtccg gcataacgct 63600 ttttactgcc ggtatcggct ccgcgaatgg ttggcatcag aaaacggcag aaatgggttt 63660 catactccag ttctaatgcg ctggtcagcc gttgtttttg cagcgtttcc gcccaccagg 63720 cgttaacgtg ctgcaccagt gcacgaccga ttttcgccgc ttcttcttcc gaatgtgcgc 63780 ctttcagcca gacaaacgtt gagtcggtat cgccgtagat aacgtcgtag ccctgtgctt 63840 caatcaacgc tttggtttgc cgcatgatct gatgaccacg catggtgatc gacgatgcca 63900 gccgcggatc gaagaagcgg caggcggtgg tgccgagcac gccataaaag gcattcatga 63960 tgattttcag cgcctgcgac agcggtttgt taccctggcg tttggcttca tcgcgcccgt 64020 gccagatgtt agtcacaatc tccggcaggc aatgtttttc tcgcgagaac caggcatcga 64080 gaaaaccttc ggtactgtgc tctggatcag gctgcgccat gccttccacc agcccgacgg 64140 gatcaatcag aaaggtgcgg atgatcgacg ggtacaggct tttatagtcc agcaccagca 64200 ctgaatcata aagccctggc cgtgaatcca tcacgtagcc gccagggctg gcgtgcggcg 64260 gcacttcgcc gagattaggc gcgacataac cagcgcgatg cattcgcgga aaatagagat 64320 gaccaaatgc cgccaccgaa ccgccgtgtc ggtccaccgg caggccgttc accgttgccc 64380 gttcgagtaa aaatggcatg atttcagttt tgtggaagat ctgcgtcacc agctcgcaat 64440 ctttcaggtt ataagttgcc agcgcaggtt tatcttcggc gaaacggcgg tcaatttcgt 64500 ccattcgatc ccacgggtta tcgatagatt ttccttcgcc taatagctcc tgagcgacag 64560 tttccagcga gaatgaagag aaatcccaga acgcggattt cagcgcctcg ataccgtcga 64620 taattagccg acctttagcc tggggcaaaa aagacgccgt ttttgcgggg ccgtgctcgc 64680 gccactccag ctcgctatta tcgcgcccaa gacgcagcgg aagacggtaa cggctcggca 64740 tgtttttgca gcattcgcag atcgaactgc accacgttcc aaccgatgat cacatcagga 64800 tcgtagttgg caaaccaggc gttgagtttt tccagcaact gcgggcggct ggcgacgtat 64860 tccagttcga aatcaagcga ggaggcgtcg ccattctccg gccccagcat ataaacgatg 64920 cgctgcccgc agccttccag gccgatgcag tacagctcac cgtggcgggt ggtttcaata 64980 tctatagaaa cccacttgag cggcggacga tagtcgggat gcggtttcag acgggcatta 65040 acgatagtgc cattgtgcat atcaccctcg acccacaccg gtgaggtgat aaaccgctcc 65100 atcagatagc gttctggcgg acgcacatcg gcctcgtaga cggtaacgcc accttcacgc 65160 aggcgctttt cgtaattcat caattggcga tgggcgcgac agtaaaggcc atacaccggc 65220 tggcggtgaa aatcctttaa cgccagcggt gtcaggcgaa agccttgttc accctgcaaa 65280 atatgctgag cgcggggaac ctgatcggcg ggaataaacg ccacggactc ttgcggtgca 65340 agcgtaacct gcaacggccc gttgtccgtc gccagccaga aggagacttc tgtcccttgc 65400 ggggtgtccc gccagtgtcg ggttaagata aaacctgcct gcgccacgct gaaaatccat 65460 caaaaaacca ggcttgagta tagcctggtt tcgtttgatt ggctgtggtt ttatacagtc 65520 attactgccc gtaatatgcc ttcgcgccat gcttacgcag atagtgttta tccagcagcg 65580 tttgctgcat atccggtaac tgcggcgcta actgacggca gaatatcccc atataagcga 65640 cctcttccag cacgatggcg ttatgcaccg catcttcggc atttttgccc catgcaaacg 65700 ggccgtggga atggaccaga acgccgggca tttgcgctgc atcgataccc tgtttttcaa 65760 aggtttctac gatgacgtta ccggtttccc actcatattc gccgttgatt tctgcgtcgg 65820 tcattttgcg ggtgcaggga atggtgccgt agaaatagtc ggcgtgggtg gtgccggttg 65880 ctggaatcga ctgacccgcc tgcgcccaga tggtggcgtg gcgcgagtgc gtatgcacaa 65940 tgccgccaat ggaggggaat gcctgataga gcagccggtg agttggcgtg tcggaggagg 66000 gctttttcgt accttcaacc acttcaccgg tttcgatgct aaccacgacc atatcgtcag 66060 cggtcatgac gctgtaatcg acgccggaag gtttgatcac aaagacgccg cgctcgcgat 66120 caacggcgct gacgttgccc catgtgagcg tgaccaggtt gtgttttggc agcgccaggt 66180 tggcttctaa tacctggcgt ttgagatctt ctaacatgtt gactccttcg tgccggatgc 66240 gctttgctta tccggcctac aaaatcgcag cgtgtaggcc tgataagacg cgccagcgtc 66300 gcatcaggcg ttgaatgccg gatgcgcttt gcttatccgg cctacaaaat cgcagcgcgt 66360 aggcctgata agacgcgcca gcgtcgcatc aggcgttgaa tgccggatgc gctttgctta 66420 tccggcctac aaaatcgcag cgtgtaggcc agataagacg cgtcagcgtc gcatcaggcg 66480 ttacataccg gatgcggcta cttagcgacg aaacccgtaa tacacttcgt tccagcgcag 66540 cgcgtcttta aacgctggca ggcgtgtgtc gttatcaatc accgtgattt caatgtcgtg 66600 catctcggcg aattggcgca tatcgttgag gttcagtgca tggctgaaga cggtatggtg 66660 cgcgccacca gcgaggatcc acgcttcgga agcagttggc agatccggtt gcgctttcca 66720 cagcgcattc gccaccggca gtttcggcag ggagtgcggt gttttcaccg tgtcgataca 66780 gttaaccagc agacggtaac gatcgccgag atcaatcagg ctggcgacaa tcgctggacc 66840 ggtttgggta ttgaagatca gtcgggcagg atcgtcctta ccaccaatac cgagatgctg 66900 aacgtcgagg atcggtttct cttctacggc aatcgacggg cagacttcca gcatatggga 66960 gccgagcacc aagtcattac ctttctcgaa gtgataggtg tagtcctcca taaaggaggt 67020 gccgccctgc agaccggttg acatcacctt catgatgcga agcagggcgg cggttttcca 67080 gtcgccttcg cccgcaaagc cgtaaccctg ctgcatcaga cgctgtacgg ccagacctgg 67140 aagctgtttc agaccgtgca aatcttcaaa ggtggtggtg aacgcgtgga agccaccttg 67200 ttccaggaaa cgcttcatcc ccagctcaat acgcgccgct tccagcacgt tctgtcgttt 67260 ttcgccgtgg atttgtgttg caggcgtcat ggtgtagcag ctttcgtact catcgaccag 67320 cgcgttaaca tcgccgtcgc tgatggagtt caccacctgc accagatcgc caaccgccca 67380 ggtattgacg gagaaaccga acttgatctg tgcggcaact ttatcaccat cggtgaccgc 67440 cacttcacgc atgttatcgc caaaacggca gactttcaga tgacgggtat cctgtttaga 67500 aaccgcctga cgcatccagg agccgatacg ctcatgggct tgtttatcct gccagtgacc 67560 ggtaacgacg gcatgttgct gacgcatacg cgcgccaatg aagccgaact cgcgaccgcc 67620 atgtgcagtc tggttcaggt tcataaagtc catatcgata ctgtcccacg gcagcgccgc 67680 gttgaactgg gtgtggaatt gcagcaacgg tttgttgagc atggtcaggc cgttgatcca 67740 cattttggcc ggggagaagg tgtgcagcca caccaccaga ccagcgcacg gatcgtcgta 67800 attcgcgtcg cggcaaatag cggtgatttc atccggcgtg gtgcccagcg gtttcaacac 67860 cagtttgcag ggcagtttcg cttccgtatt cagcgcatta acaacgtgct cggcatgttg 67920 ggtgacctga cgcagggttt ccgggccata cagatgctgg ctgccaatga caaaccacac 67980 ttcataatta tcaaaaatcg tcattatcgt gtccttatag agtcgcaacg gcctgggcag 68040 cctgtgccgg ggcggaagtt ggaagatagt gttgttcggc gctcatcgcc cattgctgat 68100 agcggcgata aagctgttca aagcgttgtg cctgttcgct gcgcggttgc agggttttct 68160 ctaccgcact ggccattttt tgctgggctg atgggatgtc tgcgtgcact ttcgcggcga 68220 cggcagcaaa aatcgccgca ccgagcgcac agcactggtc agaggcaaca atttgcagcg 68280 ggcgattcag cacgtcgcag caggcctgca taatgacttg gtttttccgc gcgatgccgc 68340 ccagcgccat cacgttattg acggcgatcc cctgatcggt aaagcactcc atgattgcgc 68400 gtgcgccaaa ggcggtggca gcaatcaaac cgccgaacag cagcggagcg tcggtagcga 68460 ggttaagatc ggtaatcacc cctttcaggc gttggttagc gtttggcgag cgacgaccgt 68520 taaaccagtc gagcaccacc ggcaggtgat ccagagacgg atttttggcc catgcttcgg 68580 tcagcgccgg aagcagttgt ttctggctgg cgttgatttg cgctttcagt tccggatgct 68640 gggcggcaag ctgttccagc ggccagctga gtacgcgacc gaaccaggcg tagatatcac 68700 caaacgccga ttggcctgct tccagaccga taaatccagg caccacgctg ccatcaacct 68760 gaccgcaaat acctttaact gcccgctcgc caacgctctg tttgtcggca atcagaatgt 68820 cgcaggtgga agtaccgata acttttacca gtgcgttagg ctgtgcgcct gcgccaactg 68880 cgcccatatg gcagtcaaac gcgccgccgg aaatcaccac gctttcaggc aggccgagac 68940 gctgcgccca ttccgggcat aaggtgccca ccggaatatc ggcagtccag gtgtcagtga 69000 acagcgggga aggcaaatgg cgattgagga tcgggtccag ctcatcaaag aaactggctg 69060 gcggcaagcc gccccagctt tcgtgccaca gagatttatg cccggcgctg caacgtccgc 69120 gacgaatatc ctgcgggcgg gtggtaccgg aaagcagagc tggcacccag tcgcacagct 69180 caatccacga tgcggcagat tgcgccacgg cgctgtcctg gcgagtcaca tgcaggattt 69240 ttgcccagaa ccattcgctg gaataaatac cgccaatata gcgggagtag tcaacattgc 69300 ccggcgcgtg gcacaaacgg gtaatctctt cgcttctttc aaccgcagtg tggtctttcc 69360 acaatacgaa catcgcgttc gggttttcgg caaactccgg gcgcagcgcc agcacgttac 69420 cgtcggcatc aatcggtgcg ggcgtcgagc cggtactgtc aacgccaatc ccgaccacag 69480 ctgcgcgctg ttcgacgcta agctctgcaa gcacggtttt cagtgccgct tccattgact 69540 caatgtagtc acgcggatga tgacggaact ggttattcgg ggcatcacaa aattgccctt 69600 tttgccaacg gggataccac tctacgctgg tggcgatctc ttcaccgctg gcgcagtcca 69660 ccgccaaagc tcgcacagaa tcactgccaa aatcgaggcc aattgcaatc gccatcgttt 69720 cactccatcc aaaaaaacgg gtatggagaa acagtagaga gttgcgataa aaagcgtcag 69780 gtaggatccg ctaatcttat ggataaaaat gctatggcat agcaaagtgt gacgccgtgc 69840 aaataatcaa tgtggacttt tctgccgtga ttatagacac ttttgttacg cgtttttgtc 69900 atggctttgg tcccgctttg ttacagaatg cttttaataa gcggggttac cggttgggtt 69960 agcgagaaga gccagtaaaa gacgcagtga cggcaatgtc tgatgcaata tggacaattg 70020 gtttcttctc tgaatggtgg gagtatgaaa agtatggctg aagcgcaaaa tgatcccctg 70080 ctgccgggat actcgtttaa cgcccatctg gtggcgggtt taacgccgat tgaggccaac 70140 ggttatctcg atttttttat cgaccgaccg ctgggaatga aaggttatat tctcaatctc 70200 accattcgcg gtcagggggt ggtgaaaaat cagggacgag aatttgtctg ccgaccgggt 70260 gatattttgc tgttcccgcc aggagagatt catcactacg gtcgtcatcc ggaggctcgc 70320 gaatggtatc accagtgggt ttactttcgt ccgcgcgcct actggcatga atggcttaac 70380 tggccgtcaa tatttgccaa tacgggtttc tttcgcccgg atgaagcgca ccagccgcat 70440 ttcagcgacc tgtttgggca aatcattaac gccgggcaag gggaagggcg ctattcggag 70500 ctgctggcga taaatctgct tgagcaattg ttactgcggc gcatggaagc gattaacgag 70560 tcgctccatc cgccgatgga taatcgggta cgcgaggctt gtcagtacat cagcgatcac 70620 ctggcagaca gcaattttga tatcgccagc gtcgcacagc atgtttgctt gtcgccgtcg 70680 cgtctgtcac atcttttccg ccagcagtta gggattagcg tcttaagctg gcgcgaggac 70740 caacgtatca gccaggcgaa gctgcttttg agcactaccc ggatgcctat cgccaccgtc 70800 ggtcgcaatg ttggttttga cgatcaactc tatttctcgc gagtatttaa aaaatgcacc 70860 ggggccagcc cgagcgagtt tcgtgccggt tgtgaagaaa aagtgaatga tgtagccgtc 70920 aagttgtcat aattggtaac gaatcagaca attgacggct tgacggagta gcatagggtt 70980 tgcagaatcc ctgcttcgtc catttgacag gccacattat gcaagcattg cggaacactt 71040 tattacccaa ccaccgtgtt cattgatggc ggtggtgttg gtggcctttc tggagtcgct 71100 ggcgctggtc ggtttgattc tacccggtac ggtgctgatg gcggggctgg gagcgctgat 71160 tggcagcggc gagttaagtt tctggcacgc ctggctggca gggattattg gctgcttgat 71220 gggcgactgg atttctttct ggctgggttg gcgttttaaa aagccgttgc atcgctggtc 71280 atttctgaag aaaaacaaag cactacttga taaaactgaa catgcgttgc atcaacacag 71340 catgttcacc attctggtcg gtcgttttgt tggcccgacg cgtccgctgg tgccaatggt 71400 ggcgggaatg ctggatctgc cggtggctaa atttattacg ccgaatatta tcggctgcct 71460 gctgtggccg ccgttttact tcctgccagg gattctggcg ggcgcggcga tcgatattcc 71520 tgccggaatg cagagcggtg agtttaaatg gttgctgctg gcaacagcgg tgtttttgtg 71580 ggttggtggc tggctgtgct ggcggttatg gcgcagcggt aaagcgactg accgtttgag 71640 tcattatttg tcccgcggtc gtttgttgtg gctgacgccg ttgatttctg ccatcggcgt 71700 ggtggcgctg gtggtgttaa ttcgccaccc gttgatgccg gtgtatatcg atattttgcg 71760 taaagtggtt ggggtttagg agatagtctt gtgcgggttg cctgagcgcg acgcttgccg 71820 cgtcttatca ggcctacaaa acgcactacc cgtaggtcgg ataaggcgtt cacgccgcat 71880 ccgacagtgc atactaaccc gtaatcccca atagtgccga agcactcgcc ttaccgctca 71940 acaactcatt ggtcataccc tgccaggcga tgcgcccgtc ggcgactact accgagcgcg 72000 tggcgatccg cgccgcatct tccacgctgt gcgacaccat caatagcgtc attttttgct 72060 gctggcagct cgtgctcacc agcgtcaaca tctcctgacg taacgccgga tcgagcgcag 72120 agaacagttc atcgagcaat aaaatcggct gttcgcgtac cagacaacgc gctaacgcca 72180 ctcgctgtcg ctgaccgccg gaaagctcgc ccggtaaccg cgccattaaa ttatcaatcc 72240 ccatctggcg ggcgatagcg tgcattttcc cctgctgtac cgcgttcagt ttcaatcccg 72300 gatttagccc cagcccgatg ttctgtgcga ccgtcaggtg gctgaacagg ttgttctcct 72360 gaaacagcat cgacaccgga cggcgtgacg gcggcatagt tgtgtgatct acgccatcga 72420 tagtcagcga accgctggct ggcgtcagaa aaccggcgat caaattcagc agggtacttt 72480 tacccgcgcc gcttggcccg aggatcgcca cctgctcgcc gcgttccacc gttaagctaa 72540 aacgcatcgg caaatggtgg taaagccagg tgatatcagt cagttttaac atttcgcccc 72600 ggtagttttt caatcacggt aaacagcaga aaacagagca gcagcagaat taacgtggtg 72660 accgcaccgt cctggctgcg ataggagcca atttgctggt agagataaaa cggcagggtg 72720 cggaaatcat cgttaccgaa caacgccacc acgccaaaat caccaatcga cagcacgcat 72780 gcaaaggcca gcgcctgcgc cagtggacgt ttcagggcgc gcagctccac cacttttaag 72840 cgtgaccagc cttcaatccc cagcgactga cataacatgc tgtagcgggc ggtgatatcg 72900 cgcatcgggt tttccagcac tttcagcgca taagggatcg ccattaacgc attggtgaaa 72960 atcacaatgc cgtcagcaga ttgtggcagg ccgatagtgt tgttgagcag taaaaagaag 73020 ccggtagcca gcacaatccc cggcatggcg aggatcaaca tgccgctcat ctccagcacc 73080 tgacccgcca gcattttctg ccgcgcccgc agttcgcgac tgctccatag cagcatcatg 73140 gtcagcacta cgcacaatac acctgccgcc agcgcaatac gcaacgaggt ccacagcgcc 73200 tgccacagca ccggttgtgc cagcacttcc ggcaactggc gatttacccc atcgacgatc 73260 accgccagta acggtggcag caacagcagc agcgccagca caattaacac cgtgtcgcaa 73320 atgcggctat gcagacgatc gtccgggtcg cgccagcctt gcagcagcgt ggtgccgggc 73380 gcaatggcct tactcaatcg ctgactcaac agcaccagcc cgaggcagca caccatctgg 73440 agcagcgcca gcattgccgc gcgggcagga tcgtagtcgt aactcagcgc ctgatagatt 73500 gccagctcga tagtggtcgc ctgcggaccg ccccccagcg atagcacggt ggcgaagctg 73560 gcgaaacaga gcataaagat aagcgcagca accggcggga tttgtcgccg taaccacggc 73620 cattcgacga agcggaaaaa atgccagcta cgcatcccaa gctgggcggc aagttgacgc 73680 tgttcgccgg ggatgttttc cagtgcctgg agtaataagc ggctcgccat cggcagatta 73740 aaaaacacat gggccagcaa aataccttgc aggccgtagg gcgaaaaggt ccactccaga 73800 ccgagcgatt ggcagagtgt tgccagccag ccctggcgac catagacgct aagaatgccg 73860 aaaacagcga ccaacaccgg gaggatcaag gtcattgcac acagacgcaa cagcgccagc 73920 cgacccggaa agcgcctgcg atagagcgcg cgggcgagga atatcgcggg tatgacagag 73980 agcagtgccg agagaaacgc ctgccagaag gagaagcgca ccacatgcca cagatagctg 74040 tcctgccaga ctgccaccca gtcatcctgc ggcgcgttcc accacagggc gagaaacgcc 74100 gccagcgcaa ccgctaccac cagcgtggtg gcgcttacac ctggaattaa ccagccggga 74160 attaacggct gacggcgcgt tgccattcgc taatccatgc ctgacgttgt gccgccactt 74220 cggctggcgt gaactccaac gtggttgcgg gtttggtcaa tttttcaaaa ccggcaggca 74280 gcgtgacgtt tgccaccgga tacatccagt tgccggttgg gatcgcattc tggaaagccg 74340 gagaaaccat aaactggagg aatttttgcg ccagctccgg ctgcttgctg gcagcggtgc 74400 gggcggcgac ttccacttgc agatagtgac cttcgctgaa gttcgcggcg gcgtagttat 74460 ctttcttctc ttcgagaatg tgataagccg gagaggtggt gtaactcagt accagatcgc 74520 tttcaccttt taaaaacagg ccgtaggctt cgctccagcc tttggtgacc gtgaccgttt 74580 tcttcgccag tttctgccag gcttgtgggg cgtcatcgcc atagactttt tgcatccata 74640 gcaacagacc cagccccggt gtactggtgc gcggatcctg ataaatcacc cgccagtttt 74700 gatcgctctc aaccagttct ttcaggcttt gtggcgggtt tttcagtttg ttcttgtcat 74760 aaacgaaggc gaagtagcca taatcaaacg gtacgaaagt gtcattattc cagccgccgg 74820 gaacgttaac ggcatccgct gccacaccgc ttttggcaaa cagtccggtt ttactggcgg 74880 cgtctaacag gttgttatcc agccccagca ccacatcggc tttactgttt ttgccttcca 74940 tccgtagacg gttgagaagc gaaacgccat cttccagcgc caccagtttc agttcgcaat 75000 tacagtcggc ttcaaaggct tttttaacca ccggaccagg cccccagtcg gcggcgaagg 75060 aatcgtaggt ataaacagtc agaacgggtt tagcgaaaac gggcgctgtc acagcaacag 75120 caggggcaga cattttttta acactttgca cctcaaaaaa gagtggcaaa ggacttgaga 75180 aggagcctca aatcccttcg ccggcgttat ccggatcagg ttcgacgggt attttctcag 75240 cgcacgcgta cgcgtggcac cccgttgaga acggcgttag tgtagtgatt ttgttatcaa 75300 ccagcaatca tggatccggt ggcgcaaacc acgctgattt aaaatcgaac cagccgaggg 75360 tattcatgcg caggccgcgc atactgcgtt gcccctgaat gatcagccag tggtgcaata 75420 atggcaccat cgctttgctg gcgaccagtt gctggcacca gttcgccaga ttcatctcgc 75480 cattgcgcca gcgagcagcg tcggcttgcc agtcaatggg aatgcaatgt tgtagcagtg 75540 gcacttcgca taaatgtgcg aaaacagaga agtccagcgg cagggtaaag ttggcgctgt 75600 ttagccagat atcactttcg atctctcctg tatgccactg atcgtagtcg atctctttga 75660 ttttcagcgt cacctggtga cttgccagaa tctgctgcat gatcccggca atcacccgat 75720 gctcactgtg atcctgacaa aaggttaggg tgaggctttc caggccagcc ggtttttcgc 75780 tctttatggt gcgggcatgg tgccaacggg ggaacagtcc ataagccggg aaccacagtt 75840 gctggtactg ttcctcagcg aaatagacca agttagttgg agaaagcaca tagcttgccc 75900 agtccctgac ttgctgattc gccccgcgat gggtgcggct gtcgaacagt aaatagtagc 75960 aaccttcctc caggcggctt tcaatctctt tttcctcgcc ctgtggacct tttagcatca 76020 gccctccggc tggctcgtcg gcaatttccg gcagaaccca gacgttaact tcgtcgatta 76080 atgcccggta accgaagaag tcatcgaatg cctgaatttt cagttgattg gtgctgttgc 76140 gaatcaccgc atacggaccg gtgccgatgg gatggctggc aaagttactg agggtttccc 76200 attcgcgcgg caggatcatc gccggaactt gccccagcag taacggtaac cagcggtccg 76260 gttgcgtgag atggatatcc agcgtccagg gcgtcggcga cacaatgtca gcaatatgcg 76320 aatagagcgg cagcgtattg attcgtttta aagaggcgat cacatcgtcc atttccagtt 76380 cacgaccatg gtgaaaatgg actcctggac gcaaaaagaa acgccagtga agcggtgaaa 76440 tttgctgcca gtggtgggcg atgtctgctt ccagttcccc attttcctca tttatgcgcg 76500 ttagcgaact gaagatttgc cgggcgatat gggtttcgga acggcgcaat gcgctgccag 76560 gtagcagatt acgcaacgga cgatagtaga gcacgcgcag gatgtgccgc ccctggcgga 76620 agctgcggcc cagatgagaa accagcattt gccgcacagt cgctttgtcg ccaaccaact 76680 gcaccagttg atcgatacga tcctgctcca gcaggtcttc cgcccgctgt tgctgaagcg 76740 ccagcccggt atagaggaat gtcagacgcg agcgtttacc gcgcccgact tccgcttccc 76800 acgtcagcca gccgcgatcc tgcatggtgt tgagcagggt gcgcatatga cgacgcgagc 76860 agctcaataa cgctgccagt tcgttgagcg ttgtgtcctg cgatttaccc tcgcagcatt 76920 gccacaggcg gatgaactgt tgttgcagac gagcagatgg cataaaaggg gaactcctgt 76980 gcaaaagaca gcaattttat tgtccctata ttaagtcaat aattcctaac gatgaagcaa 77040 gggggtgccc catgcgtcag ttttatcagc actattttac cgcgacagcg aagttgtgct 77100 ggttgcgttg gttaagcgtc ccacaacgat taaccatgct tgaaggactg atgcagtggg 77160 atgaccgcaa ttctgaaagt tgacttgcct gcatcatgtg tgactgagta ttggtgtaaa 77220 atcacccgcc agcagattat acctgctggt tttttttatt ctcgccgcgc taaaaaggga 77280 acgtatgatc tggataatga cgatggctcg ccgtatgaac ggtgtttacg cggcatttat 77340 gctggtcgct tttatgatgg gggtggccgg ggcgctacag gctcctacat tgagcttatt 77400 tctgagtcgt gaggttggcg cgcaaccttt ctggatcggc ctcttttatg cggtgaatgc 77460 tattgctggg atcggcgtaa gcctctggtt ggcaaaacgt tctgacagtc agggcgatcg 77520 gcgaaaactg attatatttt gctgtttgat ggctatcggc aatgcgctat tgtttgcatt 77580 taatcgtcat tatctgacgc ttatcacctg tggtgtgctt ctggcatctc tggccaatac 77640 ggcaatgcca cagttatttg ctctggcgcg ggaatatgcg gataactcgg cgcgagaagt 77700 ggtgatgttt agctcggtga tgcgtgcgca gctttctctg gcatgggtta tcggtccacc 77760 gttggccttt atgctggcgt tgaattacgg ctttacggtg atgttttcga ttgccgccgg 77820 gatattcaca ctcagtctgg tattgattgc atttatgctt ccgtctgtgg cgcgggtaga 77880 actgccgtcg gaaaatgctt tatcaatgca aggtggctgg caggatagta acgtacggat 77940 gttatttgtc gcctcgacgt taatgtggac ctgcaacacc atgtacatta ttgatatgcc 78000 gttgtggatc agtagcgagt taggattgcc agacaaactg gcgggtttcc tgatggggac 78060 ggcagctgga ctggaaatac cagcaatgat tctggctggc tactatgtca aacgttatgg 78120 taagcggcga atgatggtca tagcagtggc ggcaggagta ctgttttaca ccggattgat 78180 tttctttaat agccgtatgg cgttgatgac gctgcaactt tttaacgctg tatttatcgg 78240 cattgttgcg ggtattggga tgctatggtt tcaggattta atgcctggaa gagcgggggc 78300 agctaccacc ttatttacta acagtatttc taccggggta attctggctg gcgttattca 78360 gggagcaatt gcacaaagtt gggggcactt tgctgtctac tgggtaattg cggttatttc 78420 tgttgtcgca ttatttttaa ccgcaaaggt taaagacgtt tgatgacgtg gacgatagcg 78480 gaaagcccgg tcatttgacc gggcaagggg attaattcat aaacgcaggt tgttttgctt 78540 cataagcggc aatggcgtcg tcgtgctgca aggtaagccc aatactgtcc agaccgttca 78600 tcatgcagtg gcggcggaag gcatcgatgg taaagcgata ggttttctct cccgctttca 78660 cctcttgcgc ttccagatcc acgtcgaaat ggatccccgg attagctttc accagcgcaa 78720 acagttcgtc cacttctgca tcgcttaatt tcaccggcag cagctggttg ttaaagctat 78780 tgccgtagaa gatgtcagca aaactcggcg caatcaccac tttaaaaccg tagtcggtca 78840 atgcccaggg cgcgtgctca cgcgaagagc cacagccgaa gttttctcgt gccagcaaaa 78900 tggaagcgcc ctgatactgc gggaagttca gcacgaagtc cgggtttggc tgttggcctt 78960 tttcatccag aaaacgccag tcgttaaaca gatgcgcgcc aaaacccgta cgggtcactt 79020 tctgcaaaaa ctgtttcggg atgattgcat cggtatcgac attggcggca tccagcggaa 79080 ccaccaggcc tgtgtgtttg ataaatttct ctgccatggt gtgctcctta tttaatgttg 79140 cgaatgtcgg cgaaatgtcc ggtcacagca gcagcggcag ccattgccgg gctgaccaga 79200 tgcgtgcgcc cgccgcgccc ctggcggcct tcaaagttac ggttgctggt ggaggcacaa 79260 cgttcgcccg gattcagacg gtcgttgttc atcgccagac acattgagca gccaggcaag 79320 cgccattcaa aaccggcttc aataaagatt ttatccagac cttccgcttc cgcctgggct 79380 tttaccgggc cagagccggg aaccaccagt gcctgcacgc ctggcgcgac ttttcgccct 79440 ttgacgatct ccgctgccgc gcgtaaatct tcaatgcgcg agttggtaca ggaaccgata 79500 aacactttgt cgatagccac ttcggtcagc ggaatacccg gtttcagccc catataggcc 79560 agcgcttttt ctgccgacgc gcgttcaacc ggatcggcaa acgaagccgg atcgggaata 79620 ttgtcgttca cggaaatcac ctggccggga ttggtgcccc aggtgacctg cggtgaaatt 79680 tcttctgctt gcagagtgac aacggtatcg aaagttgcgc cttcgtcggt ttgcagggtt 79740 ttccagtagg caacggcgtc gtcgaaatct ttgcctttcg gcgcatgcag acggcctttg 79800 acatagttaa aggtggtttc gtccggtgca accagaccgg cttttgcgcc catttcgatt 79860 gccatattgc acagggtcat acgaccttcc atgcttaaat cacggattgc ttcgccgcaa 79920 aactccacca catgcccggt gccgcctgcg ctaccggttt taccgataat tgccagcacg 79980 atatcttttg cggtaatgcc cggcgcggct ttgccctgga cttcaatttt catggttttt 80040 gcgcggccct gtttcagggt ttgcgttgcc agtacgtgtt caacttcgga agtgccgata 80100 ccaaaggcca gtgcgccaaa cgcgccgtgg gtggcggtat gcgagtcgcc gcagacaatg 80160 gtcatccccg gcaaggtgac gccctgttcc ggccccatta cgtggacgat cccctgatac 80220 gggtgattca ggtcatacag ttcgacgcca aattctttgc agtttttgat cagttcctgc 80280 atctggatac gcgccatttc accgcaggca ttaatgcctt tggtctgggt agagacgttg 80340 tgatccatgg tagcgaaggt tttgcccggc tgacgtaccg ggcgaccgtg ggcgcgcaga 80400 ccatcgaacg cctgcggtga ggtcacttca tgcaccaggt ggcggtcgat atataacagt 80460 ggggtttcgt tttcggcttc gtacacaacg tgagcgtcga acaatttttc gtataacgtc 80520 ttagccatga ttacacccct tctgctacat agcgggcaat gatatcgccc atttcatcgg 80580 tactaacggc ggcagcgcca cgggctaaat ccccggtgcg aatgccttct tctaatgcgc 80640 ggttaatggc gcgttcaatg gcgcaagccg catcatcggc atccaggctg taacgcagca 80700 gcagtgccag cgaaaggatt tgtgcaatcg ggttggcgat gtttttgcct gcgatatctg 80760 gtgccgagcc gcccgccggt tcatacagtc caaaaccttg ctcgttcagg ctggcggaag 80820 gcaacatccc catcgagcca gtgatcattg cgcactcgtc agacagaatg tcgccaaaca 80880 ggttggagca cagcagaacg tcaaactgtg atggatcttt aatcagctgc atggtggcgt 80940 tgtcgatgta catatgcgcc agttcgacat ccgggtattc cgtggcgatc tcgttaacga 81000 tctcccgcca taaaatagag gattgcagca cgttggcttt atcgatcgac gtcactttgt 81060 ggcgacgctt gcgagcagat tcaaacgcga tgcgggcgat acgttcgatc tcaaaacggt 81120 gatacacctc ggtatcaaag gctttttcat attgtccgct accttcgcgg ccttttggct 81180 gaccgaaata gatgccgccg gtcagttcgc gcacacacag gatgtcgaag ccgtttgcgg 81240 caatgtctgc acgcagcgga cagaatgctt ccagcccctg atacagtttt gccgggcgca 81300 ggttgctgaa taatttgaag tgcttacgca gaggcagcag cgcgccgcgt tctggttgct 81360 ggtctggtgg taaatgttcc cacttcgggc cgcctaccga gccaaacagc acggcatcgg 81420 cttgctcaca accttcaacc gtcgcaggcg gcagtggttg cccgtggtta tcaatggctg 81480 cgccgcctac atcgtaatgg ctggtggtga tgcgcatcgc aaagcggttg cgcacggcat 81540 ccagcacttt cagcgcctgg gtcatcactt ccggaccaat accgtccccc ggcaatacgg 81600 caatatggta attcttcgac atcacacggt ttccttgttg ttttcgttgt gttgagcttt 81660 gcgttgcaac tctttttcga cttctgcggc acgccagata ttgttcagaa cgtgcaccat 81720 ggctttggca gatgactcga caatatcggt agccaggccg acgccgtgga agcggcgacc 81780 gttgtagtta gcgacgatat ccacctgacc cagcgcatct ttaccgtggc ctttggcggt 81840 caggctgtat ttcaccagtt cgacgttata ttcagtgatg cggttaattg cctgatagac 81900 ggcatcgacc ggaccgttac cgttggcggc ttctgctttg acttcttcgc cacaggccag 81960 tttgacggcg gcggtggcga tatcgttaga gccagactgc acgctgaagt aatccagacg 82020 gaaatgctcc ggctcttctt gctgcttacc gatgaaggcc agcgcctcca gatcgtaatc 82080 aaacacctga ccttttttgt ccgccagctt caggaaagca tcgtacaaat tgtctaaatt 82140 atattcactt tctttatacc ccatctcatc catgcgatgt ttcaccgccg cacgccccga 82200 acgagaggtc agattcagct ggatttggtt cagaccaata gattctggtg tcatgatttc 82260 gtagttttcg cggtttttca gcacgccatc ctggtgtata ccggaggagt gtgcgaatgc 82320 gccgctgcca acaatggctt tgtttgccgg gatcggcata ttacaaatct ggctaactaa 82380 ctggctggtg cgccatatct cctggtgatt aatggcggtg tggacgttga gaatatcctt 82440 acgaactttg atcgccatga tgacttcttc cagggaacag tttccggcac gctcgccgat 82500 cccgttcatt gcgccttcca cctggcgtgc accggcatgt accgccgcca gtgagtttcc 82560 gaccgccagg cccaaatcgt cgtgggtatg tacggagata atggctttgc cgatgctagg 82620 cacgcgttca tacaggccgc tgatgattcc ggcgaactca aacggcatgg tgtagcccac 82680 ggtgtccgga atgttgatgg tggtggcacc ggcattaatc gccgcttcga ccactcgcgc 82740 cagatcggca atgggtgtac gcccggcatc ttcgcaagaa aattcaacat catcggtgta 82800 attacgggcg cgtttcacca tatagatagc gcgttcgatc acctcgtcca gcgtgctgcg 82860 cagcttggtg gcgatgtgca ttggcgaagt ggcaataaag gtatgaatac ggaaggcttc 82920 ggcgactttc agggattcgg ccgccacgtc gatatctttt tccacgcagc gagctaacgc 82980 acatacgcgg ctgtttttaa cctggcgggc gatggtttgc accgattcaa aatcgcccgg 83040 cgaagagacg gggaaaccga cttccatcac gtcaacaccc atacgctcaa gggccagcgc 83100 aatttgcagt ttttctttca cactcaagct tgcctgtaac gcctgttcac cgtcgcgcaa 83160 tgtggtatcg aaaataatga cttgctggct catggtttgg gtccttgtct cttttagagc 83220 gcctcgcttc gggcataaaa aaacccgcgc aatggcgcgg gttttttgtt tgactgcgtg 83280 ctggcttaat gctggatgcc gctcactcgt ctaccgcgca aagaagatgc gtttagtagt 83340 agtagaccga taaagcgaac gatgtgagtc attaaatcag ctccagatga atgcgatatg 83400 cttttagagt tactggatac aaaaacggat gtcaaccctg acgcaataaa aacgtcccgc 83460 cagcgtgagt tctgcatccg taaaattagc taattgtgct gcggtggtta aagtaagcga 83520 tattaatttc tgcttaacta ccgacgcttt tcatcggttg acatatttca gcataaattt 83580 ttgcatctaa tcaacgagga aaaaggggac aaaatgcacg cgttgcaaaa cctatcctga 83640 tgatttgtat tgaattatat gttttgcgat tttttttgat attgatttgg tgaatattat 83700 tgatcaatta atgttaagaa ttaatgcatt aaatatataa attaattatt aaataagcac 83760 atttaatcca ttttgtagat gattgagtat tcgcggtagt tatgattaga ttgttttcgc 83820 aacaaaaaca ttatggatta ttatgctgtg gtaaatgact cattccacgg caatggattc 83880 tgtttttatc agaacccgta tctttatgtt ttccgaattt tactcatttt gctttttctt 83940 attttatatg catgataaat catattcttc aggattattt ctctgcattc caataaggga 84000 aagggagtta agtgtgacag tggagttaag tatgccagag gtacaaacag atcatccaga 84060 gacggcggag ttaagcaaac cacagctacg catggtcgat ctcaacttat taaccgtttt 84120 cgatgccgtg atgcaggagc aaaacattac tcgtgccgct catgttctgg gaatgtcgca 84180 acctgcggtc agtaacgctg ttgcacgcct gaaggtgatg tttaatgacg agctttttgt 84240 tcgttatggc cgtggtattc aaccgactgc tcgcgcattt caactttttg gttcagttcg 84300 tcaggcattg caactagtac aaaatgaatt gcctggttca ggttttgaac ccgcgagcag 84360 tgaacgtgta tttcatcttt gtgtttgcag cccgttagac agcattctga cctcgcagat 84420 ttataatcac attgagcaga ttgcgccaaa tatacatgtt atgttcaagt cttcattaaa 84480 tcagaacact gaacatcagc tgcgttatca ggaaacggag tttgtgatta gttatgaaga 84540 cttccatcgt cctgaattta ccagcgtacc attatttaaa gatgaaatgg tgctggtagc 84600 cagcaaaaat catccaacaa ttaagggccc gttactgaaa catgatgttt ataacgaaca 84660 acatgcggcg gtttcgctcg atcgtttcgc gtcatttagt caaccttggt atgacacggt 84720 agataagcaa gccagtatcg cgtatcaggg catggcaatg atgagcgtac ttagcgtggt 84780 gtcgcaaacg catttggtcg ctattgcgcc gcgttggctg gctgaagagt tcgctgaatc 84840 cttagaatta caggtattac cgctgccgtt aaaacaaaac agcagaacct gttatctctc 84900 ctagcatgaa gctgccgggc gcgataaagg ccatcagtgg atggaagagc aattagtctc 84960 aatttgcaaa cgctaactga ttgcagaata ggtcagacat gaatgtctgg tttattctgc 85020 attttttatt gaatgtagaa ttttattctg aatgtgtggg ctctctattt taggattaat 85080 taaaaaaata gagaaattgc tgtaagttgt gggattcagc cgatttatta tcaatttaat 85140 cctctgtaat ggaggatttt atcgtttctt ttcacctttc ctcctgttta ttcttattac 85200 cccgtgttta tgtctctggc tgccaattgc ttaagcaaga tcggacggtt aatgtgtttt 85260 acacattttt tccgtcaaac agtgaggcag gccatggaga tgttgtctgg aggcgagatg 85320 gtcgtccgat cgcttatcga tcagggcgtt aaacaagtat tcggttatcc cggaggcgca 85380 gtccttgata tttatgatgc attgcatacc gtgggtggta ttgatcatgt attagttcgt 85440 catgagcagg cggcggtgca tatggccgat ggcttggcgc gcgcgaccgg ggaagtcggc 85500 gtcgtgctgg taacgtcggg tccaggggcg accaatgcga ttactggcat cgccaccgct 85560 tatatggatt ccattccatt agttgtcctt tccgggcagg tagcgacctc gttgataggt 85620 tacgatgcct ttcaggagtg cgacatggtg gggatttcgc gacccgtggt taaacacagt 85680 tttctggtta agcaaacgga agacattccg caggtgctga aaaaggcttt ctggctggcg 85740 gcaagcggtc gcccaggacc agtagtcgtt gatttaccga aagatattct taatccggcg 85800 aacaaattac cctatgtctg gccggagtcg gtcagtatgc gttcttacaa tcccactact 85860 accggacata aagggcaaat taagcgtgct ctgcaaagcg tggtagcggt aaaaaaaccg 85920 gttgtctacg taggcggtgg ggcaatcacg gcgggctgcc atcagcagtt gaaagaaacg 85980 gtggaggcgt tgaatctgcc cgttgtttgc tcattgatgg ggctgggggc gtttccggca 86040 acgcatcgtc aggtactggg tatgctggga atgcacggta cctacgaagc caatatgacg 86100 atgcataacg cggatgtgat tttcgccgtc ggggtacgat ttgatgaccg aacgacgaac 86160 aatctggcaa agtactgccc aaatgccact gttctgcata tcgatattga tcctacttcc 86220 atttctaaaa ccgtgactgc ggatatcccg attgtggggg atgctcgcca agtcctcgaa 86280 caaatgcttg aactcttgtc gcaagaatcc gcccatcaac cactggatga gatccgcgac 86340 tggtggcagc aaattgaaca gtggcgcgct cgtcagtgcc tgaaatatga cactcacagt 86400 gaaaagatta aaccgcaggc ggtgatcgag actctttggc ggttgacgaa gggagacgct 86460 tacgtgacgt ccgatgtcgg gcagcaccag atgtttgctg cactttatta tccattcgac 86520 aaaccgcgtc gctggatcaa ttccggtggc ctcggcagca tgggttttgg tttacctgcg 86580 gcactgggcg tcaaaatggc gttcccagaa gaaaccgtgg tttgcgtcac tggcgacggc 86640 agtattcaga tgaacatcca ggaactgtct accgcgttgc aatacgagtt gcccgtactg 86700 gtggtgaatc tcaataaccg ctatctgggg atggtgaagc agtggcagga catgatctat 86760 tccggccgtc attcacaatc ttatatgcaa tcgctacccg atttcgtccg tcgcggagcc 86820 tatgggcatg tcgggatcca gatttctcat ccgcatggct ggaaagcaaa cttagcgagg 86880 cgctggaaca ggtgcgcaat aatcgcctgg tgtttgttga tgttaccgtc gatggcagcg 86940 agcacgtcta cccgatgcag attcgcgggg gcggaatgga tgaaatgtgg ttaagcaaaa 87000 cggagagaac ctgattatgc gccggatatt atcagtctta ctcgaaaatg aatcaggcgc 87060 gttatcccgc gtgattggcc ttttttccca gcgtggctac aacattgaaa gcctgaccgt 87120 tgcgccaacc gacgatccga cattatcgcg tatgaccatc cagaccgtgg gcgatgaaaa 87180 agtacttgag cagatcgaaa agcaattaca caaactggtc gatgtcttgc gcgtgagtga 87240 gttggggcag ggcgcgcatg ttgagcggga aatcatgctg gtgaaaattc aggccagcgg 87300 ttacgggcgt gacgaagtga aacgtaatac ggaaatattc cgtgggcaaa ttatcgatgt 87360 cacaccctcg ctttataccg ttcaattagc aggcaccagc ggtaagctta gtgcattttt 87420 agcatcgatt cgcgatgtgg cgaaaattgt ggaggttgct cgctctggtg tggtcggact 87480 ttcgcgcggc gataaaataa tgcgttgaga atgatctcaa tgcgcaattt acagcccaac 87540 atgtcacgtt gggctttttt tgcgaaatca gtgggaacct ggaataaaag cagttgccgc 87600 agttaatttt ctgcgcttag atgttaatga atttaaccca taccagtaca atggctatgg 87660 tttttacatt ttacgcaagg ggcaattgtg aaactggatg aaatcgctcg gctggcggga 87720 gtgtcgcgga ccactgcaag ctatgttatt aacggcaaag cgaagcaata ccgtgtgagc 87780 gacaaaaccg ttgaaaaagt catggctgtg gtgcgtgagc acaattacca cccgaacgcc 87840 gtggcagctg ggcttcgtgc tggacgcaca cgttctattg gtcttgtgat ccccgatctg 87900 gagaacacca gctatacccg catcgctaac tatcttgaac gccaggcgcg gcaacggggt 87960 tatcaactgc tgattgcctg ctcagaagat cagccagaca acgaaatgcg gtgcattgag 88020 caccttttac agcgtcaggt tgatgccatt attgtttcga cgtcgttgcc tcctgagcat 88080 cctttttatc aacgctgggc taacgacccg ttcccgattg tcgcgctgga ccgcgccctc 88140 gatcgtgaac acttcaccag cgtggttggt gccgatcagg atgatgccga aatgctggcg 88200 gaagagttac gtaagtttcc cgccgagacg gtgctttatc ttggtgcgct accggagctt 88260 tctgtcagct tcctgcgtga acaaggtttc cgtactgcct ggaaagatga tccgcgcgaa 88320 gtgcatttcc tgtatgccaa cagctatgag cgggaggcgg ctgcccagtt attcgaaaaa 88380 tggctggaaa cgcatccgat gccgcaggcg ctgttcacaa cgtcgtttgc gttgttgcaa 88440 ggagtgatgg atgtcacgct gcgtcgcgac ggcaaactgc cttctgacct ggcaattgcc 88500 acctttggcg ataacgaact gctcgacttc ttacagtgtc cggtgctggc agtggctcaa 88560 cgtcaccgcg atgtcgcaga gcgtgtgctg gagattgtcc tggcaagcct ggacgaaccg 88620 cgtaagccaa aacctggttt aacgcgcatt aaacgtaatc tctatcgccg cggcgtgctc 88680 agccgtagct aagccgcgaa caaaaatacg cgccaggtga atttccctct ggcgcgtaga 88740 gtacgggact ggacatcaat atgcttaaag taaataagac tattcctgac tattattgat 88800 aaatgctttt aaacccgccc gttaattaac tcaccagctg aaattcacaa taattaagtg 88860 atatcgacag cgcgtttttg cattattttg ttacatgcgg cgatgaattg ccgatttaac 88920 aaacactttt ctttgctttt gcgcaaaccc gctggcatca agcgccacac agacgtaaca 88980 aggactgtta accggggaag atatgtccta aaatgccgct cgcgtcgcaa actgacactt 89040 tatatttgct gtggaaaata gtgagtcatt ttaaaacggt gatgacgatg agggattttt 89100 tcttacagct attcataacg ttaatttgct tcgcacgttg gacgtaaaat aaacaacgct 89160 gatattagcc gtaaacatcg ggttttttac ctcggtatgc cttgtgactg gcttgacaag 89220 cttttcctca gctccgtaaa ctcctttcag tgggaaattg tggggcaaag tgggaataag 89280 gggtgaggct ggcatgttcc ggggagcaac gttagtcaat ctcgacagca aagggcgctt 89340 atcagtgcct acccgttatc gggaacagct gcttgagaac gctgccggtc aaatggtttg 89400 caccattgac atttatcacc cgtgcctgct gctttacccc ctgcctgaat gggaaattat 89460 cgagcaaaaa ttatcgcgtc tgtcgagcat gaacccggtt gagcgccgtg tgcagcgcct 89520 actgttaggt catgccagcg aatgtcagat ggatggcgca ggtcgattgt taatcgcgcc 89580 agtactgcgg caacatgccg ggctgacaaa agaagtgatg ctggttggac agttcaacaa 89640 gtttgagctg tgggatgaaa caacctggca tcaacaggtc aaggaagata tcgacgcaga 89700 gcagttggct accggagact tatcggagcg actgcaggac ttgtctctat aaaatgatgg 89760 aaaactataa acatactacg gtgctgctgg atgaagccgt taatggcctc aatatccgtc 89820 ctgatggcat ctacattgat gggacttttg gtcgcggtgg tcactcacgt ctgatcctct 89880 cgcagcttgg cgaagagggg cgtttgctgg cgatcgatcg cgacccgcag gctatcgccg 89940 ttgcgaagac tattgatgat ccgcgcttct ccatcatcca cggacctttc tccgcgctgg 90000 gcgaatacgt tgccgagcgc gatcttatcg gcaagatcga cggcattctc ctcgatcttg 90060 gcgtctcttc accgcaactt gatgatgctg aacgtggctt ttcctttatg cgcgatggtc 90120 cgctggacat gcgtatggac ccaacccgtg ggcagtcagc cgctgaatgg ctacaaaccg 90180 cagaagaagc cgatatcgcc tgggtattga aaacctatgg tgaagagcgt tttgccaaac 90240 gcattgcccg cgccattgtc gagcgtaacc gcgaacagcc gatgacccgc accaaagaac 90300 tggcggaagt cgtggctgct gcaacgccgg tgaaagataa gtttaaacat cccgcgaccc 90360 gtaccttcca ggcggtgcgc atttgggtaa acagtgaact ggaggagata gagcaggcgc 90420 taaaaagctc gctcaacgtg ctggccccgg gtgggcggct ttcgatcatc agcttccact 90480 cgctggaaga ccgtattgtg aaacgtttta tgcgtgaaaa cagccgcggt ccgcaagttc 90540 cggcagggtt accgatgact gaagagcagc tcaaaaaact gggtggccgt cagctgcgag 90600 cactaggcaa gttaatgccg ggcgaagaag aggtggctga gaaccctcgt gcccgtagtt 90660 cagttctgcg tattgcagag aggacgaatg catgatcagc agagtgacag aagctctaag 90720 caaagttaaa ggatcgatgg gaagccacga gcgccatgca ttgcctggtg ttatcggtga 90780 cgatcttttg cgatttggga agctgccact ctgcctgttc atttgcatta ttttgacggc 90840 ggtgactgtg gtaaccacgg cgcaccatac ccgtttactg accgctcagc gcgaacaact 90900 ggtgctggag cgagatgctt tagacattga atggcgcaac ctgatccttg aagagaatgc 90960 gctcggcgac catagccggg tggaaaggat cgccacggaa aagctgcaaa tgcagacatg 91020 ttgatccgtc acaagaaaat atcgtagtgc aaaaataagg ataaacgcga cgcatgaaag 91080 cagcggcgaa aacgcagaaa ccaaaacgtc aggaagaaca tgccaacttt atcagttggc 91140 gttttgcgtt gttatgcggc tgtattctcc tggcgctggc ttttctgctc ggacgcgtag 91200 cgtggttaca agttatctcc ccggatatgc tggtgaaaga gggcgacatg cgttctcttc 91260 gcgttcagca agtttccacc tcccgcggca tgattactga ccgttctggt cgcccgttag 91320 cggtgagcgt gccggtaaaa gcgatttggg ctgacccgaa agaagtgcat gacgctggcg 91380 gtatcagcgt cggtgaccgc tggaaggcgc tggctaacgc gctcaatatt ccgctggatc 91440 agctttcagc ccgcattaac gccaacccga aagggcgctt tatttatctg gcgcgtcagg 91500 tgaaccctga catggcggac tacatcaaaa aactgaaact gccggggatt catctgcgtg 91560 aagagtctcg ccgttactat ccgtccggcg aagtgactgc tcacctcatc ggctttacta 91620 acgtcgatag tcaagggatt gagggcgttg agaagagttt cgataaatgg cttaccgggc 91680 agccgggtga gcgcattgtg cgtaaagacc gctatggtcg cgtaattgaa gatatttctt 91740 ctactgacag ccaggcagcg cacaacctgg cgctgagtat tgatgaacgc ctgcaggcgc 91800 tggtttatcg cgaactgaac aacgcggtgg cctttaacaa ggctgaatct ggtagcgccg 91860 tgctggtgga tgtcaacacc ggtgaagtgc tggcgatggc taacagcccg tcatacaacc 91920 ctaacaatct gagcggcacg ccgaaagagg cgatgcgtaa ccgtaccatc accgacgtgt 91980 ttgaaccggg ctcaacggtt aaaccgatgg tggtaatgac cgcgttgcaa cgtggcgtgg 92040 tgcgggaaaa ctcggtactc aataccattc cttatcgaat taacggccac gaaatcaaag 92100 acgtggcacg ctacagcgaa ttaaccctga ccggggtatt acagaagtcg agtaacgtcg 92160 gtgtttccaa gctggcgtta gcgatgccgt cctcagcgtt agtagatact tactcacgtt 92220 ttggactggg aaaagcgacc aatttggggt tggtcggaga acgcagtggc ttatatcctc 92280 aaaaacaacg gtggtctgac atagagaggg ccaccttctc tttcggctac gggctaatgg 92340 taacaccatt acagttagcg cgagtctacg caactatcgg cagctacggc atttatcgcc 92400 cactgtcgat taccaaagtt gaccccccgg ttcccggtga acgtgtcttc ccggaatcca 92460 ttgtccgcac tgtggtgcat atgatggaaa gcgtggcgct accaggcggc ggcggcgtga 92520 aggcggcgat taaaggctat cgtatcgcca ttaaaaccgg taccgcgaaa aaggtcgggc 92580 cggacggtcg ctacatcaat aaatatattg cttataccgc aggcgttgcg cctgcgagtc 92640 agccgcgctt cgcgctggtt gttgttatca acgatccgca ggcgggtaaa tactacggcg 92700 gcgccgtttc cgcgccggtc tttggtgcca tcatgggcgg cgtattgcgt accatgaaca 92760 tcgagccgga tgcgctgaca acgggcgata aaaatgaatt tgtgattaat caaggcgagg 92820 ggacaggtgg cagatcgtaa tttgcgcgac cttcttgctc cgtgggtgcc agacgcacct 92880 tcgcgagcac tgcgagagat gacactcgac agccgtgtgg ctgcggcggg cgatctcttt 92940 gtagctgtag taggtcatca ggcggacggg cgtcgatata tcccgcaggc gatagcgcaa 93000 ggtgtcgctg ccattattgc agaggcgaaa gatgaggcga ccgatggtga aatccgtgaa 93060 atgcacggcg taccggtcat ctatctcagc cagctcaacg agcgtttatc tgcactggcg 93120 ggccgctttt accatgaacc ctctgacaat ttacgtctcg tgggcgtaac gggcaccaac 93180 ggcaaaacca cgactaccca gctgttggcg cagtggagcc aactgcttgg cgaaatcagc 93240 gcggtaatgg gcaccgttgg taacggcctg ctggggaaag tgatcccgac agaaaataca 93300 accggttcgg cagtcgatgt tcagcatgag ctggcggggc tggtggatca gggcgcgacg 93360 ttttgcgcaa tggaagtttc ctcccacggg ctggtacagc accgtgtggc ggcattgaaa 93420 tttgcggcgt cggtctttac caacttaagc cgcgatcacc ttgattatca tggtgatatg 93480 gaacactacg aagccgcgaa atggctgctt tattctgagc atcattgcgg tcaggcgatt 93540 attaacgccg acgatgaagt gggccgccgc tggctggcaa aactgccgga cgcggttgcg 93600 gtatcaatgg aagatcatat taatccgaac tgtcacggac gctggttgaa agcgaccgaa 93660 gtgaactatc acgacagcgg tgcgacgatt cgctttagct caagttgggg cgatggcgaa 93720 attgaaagcc atctgatggg cgcttttaac gtcagcaacc tgctgctcgc gctggcgaca 93780 ctgttggcac tcggctatcc actggctgat ctgctgaaaa ccgccgcgcg tctgcaaccg 93840 gtttgcggac gtatggaagt gttcactgcg ccaggcaaac cgacggtggt ggtggattac 93900 gcgcatacgc cggatgcact ggaaaaagcc ttacaggcgg cgcgtctgca ctgtgcgggc 93960 aagctgtggt gtgtctttgg ctgtggtggc gatcgcgata aaggtaagcg tccactgatg 94020 ggcgcaattg ccgaagagtt tgctgacgtg gcggtggtga cggacgataa cccgcgtacc 94080 gaagaaccgc gtgccatcat caacgatatt ctggcgggaa tgttagatgc cggacatgcc 94140 aaagtgatgg aaggccgtgc tgaagcggtg acttgcgccg ttatgcaggc taaagagaat 94200 gatgtggtac tggtcgcggg caaaggccat gaagattacc agattgttgg caatcagcgt 94260 ctggactact ccgatcgcgt cacggtggcg cgtctgctgg gggtgattgc atgattagcg 94320 taacccttag ccaacttacc gacattctca acggtgaact gcaaggtgca gatatcaccc 94380 ttgatgctgt aaccactgat acccgaaaac tgacgccggg ctgcctgttt gttgccctga 94440 aaggcgaacg ttttgatgcc cacgattttg ccgaccaggc gaaagctggc gcggcaggcg 94500 cactactggt tagccgtccg ctggacatcg acctgccgca gttaatcgtc aaggatacgc 94560 gtctggcgtt tggtgaactg gctgcatggg ttcgccagca agttccggcg cgcgtggttg 94620 ctctgacggg gtcctccggc aaaacctccg ttaaagagat gacggcggcg attttaagcc 94680 agtgcggcaa cacgctttat acggcaggca atctcaacaa cgacatcggt gtaccgatga 94740 cgctgttgcg cttaacgccg gaatacgatt acgcagttat tgaacttggc gcgaaccatc 94800 agggcgaaat agcctggact gtgagtctga ctcgcccgga acgtgcgctg gtcaacaacc 94860 tggcagcggc gcatctggaa ggttttggct cgcttgcggg tgtcgcgaaa gcgaaaggtg 94920 aaatctttag cggcctgccg gaaaacggta tcgccattat gaacgccgac aacaacgact 94980 ggctgaactg gcagagcgta attggctcac gcaaagtgtg gcgtttctca cccaatgccg 95040 ccaacagcga tttcaccgcc accaatatcc atgtgacctc gcacggtacg gaatttaccc 95100 tacaaacccc aaccggtagc gtcgatgttc tgctgccgtt gccggggcgt cacaatattg 95160 cgaatgcgct ggcagccgct gcgctctcca tgtccgtggg cgcaacgctt gatgctatca 95220 aagcggggct ggcaaatctg aaagctgttc caggccgtct gttccccatc caactggcag 95280 aaaaccagtt gctgctcgac gactcctaca acgccaatgt cggttcaatg actgcagcag 95340 tccaggtact ggctgaaatg ccgggctacc gcgtgctggt ggtgggcgat atggcggaac 95400 tgggcgctga aagcgaagcc tgccatgtac aggtgggcga ggcggcaaaa gctgctggta 95460 ttgaccgcgt gttaagcgtg ggtaaacaaa gccatgctat cagcaccgcc agcggcgttg 95520 gcgaacattt tgctgataaa actgcgttaa ttacgcgtct taaattactg attgctgagc 95580 aacaggtaat tacgatttta gttaagggtt cacgtagtgc cgccatggaa gaggtagtac 95640 gcgctttaca ggagaatggg acatgttagt ttggctggcc gaacatttgg tcaaatatta 95700 ttccggcttt aacgtctttt cctatctgac gtttcgcgcc atcgtcagcc tgctgaccgc 95760 gctgttcatc tcattgtgga tgggcccgcg tatgattgct catttgcaaa aactttcctt 95820 tggtcaggtg gtgcgtaacg acggtcctga atcacacttc agcaagcgcg gtacgccgac 95880 catgggcggg attatgatcc tgacggcgat tgtgatctcc gtactgctgt gggcttaccc 95940 gtccaatccg tacgtctggt gcgtgttggt ggtgctggta ggttacggtg ttattggctt 96000 tgttgatgat tatcgcaaag tggtgcgtaa agacaccaaa gggttgatcg ctcgttggaa 96060 gtatttctgg atgtcggtca ttgcgctggg tgtcgccttc gccctgtacc ttgccggcaa 96120 agacacgccc gcaacgcagc tggtggtccc attctttaaa gatgtgatgc cgcagctggg 96180 gctgttctac attctgctgg cttacttcgt cattgtgggt actggcaacg cggtaaacct 96240 gaccgatggt ctcgacggcc tggcaattat gccgaccgta tttgtcgccg gtggttttgc 96300 gctggtggcg tgggcgaccg gcaatatgaa ctttgccagc tacttgcata taccgtatct 96360 gcgacacgcc ggggaactgg ttattgtctg taccgcgata gtcggggcag gactgggctt 96420 cctgtggttt aacacctatc cggcgcaggt ctttatgggc gatgtaggtt cgctggcgtt 96480 aggtggtgcg ttaggcatta tcgccgtact gctacgtcag gaattcctgc tggtgattat 96540 ggggggcgtg ttcgtggtag aaacgctttc tgtcatcctg caggtcggct cctttaaact 96600 gcgcggacaa cgtattttcc gcatggcacc gattcatcac cactatgaac tgaaaggctg 96660 gccggaaccg cgcgtcattg tgcgtttctg gattatttcg ctgatgctgg ttctgattgg 96720 tctggcaacg ctgaaggtac gttaatcatg gctgattatc agggtaaaaa tgtcgtcatt 96780 atcggcctgg gcctcaccgg gctttcctgc gtggactttt tcctcgctcg cggtgtgacg 96840 ccgcgcgtta tggatacgcg tatgacaccg cctggcctgg ataaattacc cgaagccgta 96900 gaacgccaca cgggcagtct gaatgatgaa tggctgatgg cggcagatct gattgtcgcc 96960 agtcccggta ttgcactggc gcatccatcc ttaagcgctg ccgctgatgc cggaatcgaa 97020 atcgttggcg atatcgagct gttctgtcgc gaagcacaag caccgattgt ggcgattacc 97080 ggttctaacg gcaaaagcac ggtcaccacg ctagtgggtg aaatggcgaa agcggcgggg 97140 gttaacgttg gtgtgggtgg caatattggc ctgcctgcgt tgatgctact ggatgatgag 97200 tgtgaactgt acgtgctgga actgtcgagc ttccagctgg aaaccacctc cagcttacag 97260 gcggtagcag cgaccattct gaacgtgact gaagatcata tggatcgcta tccgtttggt 97320 ttacaacagt atcgtgcagc aaaactgcgc atttacgaaa acgcgaaagt ttgcgtggtt 97380 aatgctgatg atgccttaac aatgccgatt cgcggtgcgg atgaacgctg cgtcagcttt 97440 ggcgtcaaca tgggtgacta tcacctgaat catcagcagg gcgaaacctg gctgcgggtt 97500 aaaggcgaga aagtgctgaa tgtgaaagag atgaaacttt ccgggcagca taactacacc 97560 aatgcgctgg cggcgctggc gctggcagat gctgcagggt taccgcgtgc cagcagcctg 97620 aaagcgttaa ccacattcac tggtctgccg catcgctttg aagttgtgct ggagcataac 97680 ggcgtacgtt ggattaacga ttcgaaagcg accaacgtcg gcagtacgga agcggcgctg 97740 aatggcctgc acgtagacgg cacactgcat ttgttgctgg gtggcgatgg taaatcggcg 97800 gactttagcc cactggcgcg ttacctgaat ggcgataacg tacgtctgta ttgtttcggt 97860 cgtgacggcg cgcagctggc ggcgctacgc ccggaagtgg cagaacaaac cgaaactatg 97920 gaacaggcga tgcgcttgct ggctccgcgt gttcagccgg gcgatatggt tctgctctcc 97980 ccagcctgtg ccagccttga tcagttcaag aactttgaac aacgaggcaa tgagtttgcc 98040 cgtctggcga aggagttagg ttgatgcgtt tatctctccc tcgcctgaaa atgccgcgcc 98100 tgccaggatt cagtatcctg gtctggatct ccacggcgct aaagggctgg gtgatgggct 98160 cgcgggaaaa agataccgac agcctgatca tgtacgatcg caccttactg tggctgacct 98220 tcggcctcgc ggcgattggc tttatcatgg tgacctcggc gtcaatgccc atagggcaac 98280 gcttaaccaa cgatccgttc ttcttcgcga agcgtgatgg tgtctatctg attttggcgt 98340 ttattctggc gatcattacg ctgcgtctgc cgatggagtt ctggcaacgc tacagtgcca 98400 cgatgctgct cggatctatc atcctgctga tgatcgtcct ggtagtgggt agctcggtta 98460 aaggggcatc gcgttggatc gatctcggtt tgctgcgtat ccagcctgcg gagctgacaa 98520 aactgtcgct gttttgctat atcgccaact atctggtgcg taaaggcgac gaagtacgta 98580 ataacctgcg cggcttcctg aaaccgatgg gcgtgattct ggtgttggca gtgttactgc 98640 tggcacagcc agaccttggt acggtggtgg tgttgtttgt gactacgctg gcgatgttgt 98700 tcctggcggg agcgaaattg tggcagttca ttgccattat cggtatgggc atttcagcgg 98760 ttgtgttgct gatactcgcc gaaccgtacc gtatccgccg tgttaccgca ttctggaacc 98820 cgtgggaaga tccctttggc agcggctatc agttaacgca atcgctgatg gcgtttggtc 98880 gcggcgaact ttgggggcaa ggtttaggta actcggtaca aaaactggag tatctgccgg 98940 aagcgcacac tgactttatt ttcgccatta tcggcgaaga actggggtat gtcggtgtgg 99000 tgctggcact tttaatggta ttcttcgtcg cttttcgcgc gatgtcgatt ggccgtaaag 99060 cattagaaat tgaccaccgt ttttccggtt ttctcgcctg ttctattggc atctggttta 99120 gcttccaggc gctggttaac gtaggcgcgg cggcggggat gttaccgacc aaaggtctga 99180 cattgccgct gatcagttac ggtggttcga gcttactgat tatgtcgaca gccatcatga 99240 tgctgttgcg tattgattat gaaacgcgtc tggagaaagc gcaggcgttt gtacgaggtt 99300 cacgatgagt ggtcaaggaa agcgattaat ggtgatggca ggcggaaccg gtggacatgt 99360 attcccggga ctggcggttg cgcaccatct aatggctcag ggttggcaag ttcgctggct 99420 ggggactgcc gaccgtatgg aagcggactt agtgccaaaa catggcatcg aaattgattt 99480 cattcgtatc tctggtctgc gtggaaaagg tataaaagca ctgatagctg ccccgctgcg 99540 tatcttcaac gcctggcgtc aggcgcgggc gattatgaaa gcgtacaaac ctgacgtggt 99600 gctcggtatg ggaggctacg tgtcaggtcc aggtggtctg gccgcgtggt cgttaggcat 99660 tccggttgta cttcatgaac aaaacggtat tgcgggctta accaataaat ggctggcgaa 99720 gattgccacc aaagtgatgc aggcgtttcc aggtgctttc cctaatgcgg aagtagtggg 99780 taacccggtg cgtaccgatg tgttggcgct gccgttgccg cagcaacgtt tggctggacg 99840 tgaaggtccg gttcgtgtgc tggtagtggg tggttctcag ggcgcacgca ttcttaacca 99900 gacaatgccg caggttgctg cgaaactggg tgattcagtc actatctggc atcagagcgg 99960 caaaggttcg caacaatccg ttgaacaggc gtatgccgaa gcggggcaac cgcagcataa 100020 agtgacggaa tttattgatg atatggcggc ggcgtatgcg tgggcggatg tcgtcgtttg 100080 ccgctccggt gcgttaacgg tgagtgaaat cgccgcggca ggactaccgg cgttgtttgt 100140 gccgtttcaa cataaagacc gccagcaata ctggaatgcg ctaccgctgg aaaaagcggg 100200 cgcagccaaa attatcgagc agccacagct tagcgtggat gctgtcgcca acaccctggc 100260 cgggtggtcg cgagaaacct tattaaccat ggcagaacgc gcccgcgctg catccattcc 100320 ggatgccacc gagcgagtgg caaatgaagt gagccgggtt gcccgggcgt aattgtagcg 100380 atgccttttg catcgtatga atttaagaag ttaatggcgt aaagaatgaa tacacaacaa 100440 ttggcaaaac tgcgttccat cgtgcccgaa atgcgtcgcg ttcggcacat acattttgtc 100500 ggcattggtg gtgccggtat gggcggtatt gccgaagttc tggccaatga aggttatcag 100560 atcagtggtt ccgatttagc gccaaatccg gtcacgcagc agttaatgaa tctgggtgcg 100620 acgatttatt tcaaccatcg cccggaaaac gtacgtgatg ccagcgtggt cgttgtttcc 100680 agcgcgattt ctgccgataa cccggaaatt gtcgccgctc atgaagcgcg tattccggtg 100740 atccgtcgtg ccgaaatgct ggctgagtta atgcgttttc gtcatggcat cgccattgcc 100800 ggaacgcacg gcaaaacgac aaccaccgcg atggtttcca gcatctacgc agaagcgggg 100860 ctcgacccaa ccttcgttaa cggcgggctg gtaaaagcgg cgggggttca tgcgcgtttg 100920 gggcatggtc ggtacctgat tgccgaagca gatgagagtg atgcatcgtt cctgcatctg 100980 caaccgatgg tggcgattgt caccaatatc gaagccgacc acatggatac ctaccagggc 101040 gactttgaga atttaaaaca gacttttatt aattttctgc acaacctgcc gttttacggt 101100 cgtgcggtga tgtgtgttga tgatccggtg atccgcgaat tgttaccgcg agtggggcgt 101160 cagaccacga cttacggctt cagcgaagat gccgacgtgc gtgtagaaga ttatcagcag 101220 attggcccgc aggggcactt tacgctgctg cgccaggaca aagagccgat gcgcgtcacc 101280 ctgaatgcgc caggtcgtca taacgcgctg aacgccgcag ctgcggttgc ggttgctacg 101340 gaagagggca ttgacgacga ggctattttg cgggcgcttg aaagcttcca ggggactggt 101400 cgccgttttg atttcctcgg tgaattcccg ctggagccag tgaatggtaa aagcggtacg 101460 gcaatgctgg tcgatgacta cggccaccac ccgacggaag tggacgccac cattaaagcg 101520 gcgcgcgcag gctggccgga taaaaacctg gtaatgctgt ttcagccgca ccgttttacc 101580 cgtacgcgcg acctgtatga tgatttcgcc aatgtgctga cgcaggttga taccctgttg 101640 atgctggaag tgtatccggc tggcgaagcg ccaattccgg gagcggacag ccgttcgctg 101700 tgtcgcacaa ttcgtggacg tgggaaaatt gatcccattc tggtgccgga tccggcgcgg 101760 gtagccgaga tgctggcacc ggtattaacc ggtaacgacc tgattctcgt tcagggggct 101820 ggtaatattg gaaaaattgc ccgttcttta gctgaaatca aactgaagcc gcaaactccg 101880 gaggaagaac aacatgactg ataaaatcgc ggtcctgttg ggtgggacct ccgctgagcg 101940 ggaagtttct ctgaattctg gcgcagcggt gttagccgga ctgcgtgaag gcggtattga 102000 cgcgtatcct gtcgacccga aagaagtcga cgtgacgcaa ctgaagtcga tgggctttca 102060 gaaagtgttt atcgcgctac acggtcgcgg cggtgaagat ggtacgctgc aggggatgct 102120 cgagctgatg ggcttgcctt ataccggaag cggagtgatg gcatctgcgc tttcaatgga 102180 taaactacgc agcaaacttc tatggcaagg tgccggttta ccggtcgcgc cgtgggtagc 102240 gttaacccgc gcagagtttg aaaaaggcct gagcgataag cagttagcag aaatttctgc 102300 tctgggtttg ccggttatcg ttaagccgag ccgcgaaggt tccagtgtgg gaatgtcaaa 102360 agtagtagca gaaaatgctc tacaagatgc attaagattg gcatttcagc acgatgaaga 102420 agtattgatt gaaaaatggc taagtgggcc ggagttcacg gttgcgatac tcggtgaaga 102480 aattttaccg tcaatacgta ttcaaccgtc cggaaccttc tatgattatg aggcgaagta 102540 tctctctgat gagacacagt atttctgccc cgcaggtctg gaagcgtcac aagaggccaa 102600 tttgcaggca ttagtgctga aagcatggac gacgttaggt tgcaaaggat ggggacgtat 102660 tgacgttatg ctggacagcg atggacagtt ttatctgctg gaagccaata cctcaccggg 102720 tatgaccagc cacagcctgg tgccgatggc ggcacgtcag gcaggtatga gcttctcgca 102780 gttggtagta cgaattctgg aactggcgga ctaatatgtc gcaggctgct ctgaacacgc 102840 gaaacagcga agaagaggtt tcttctcgcc gcaataatgg aacgcgtctg gcggggatcc 102900 ttttcctgct gaccgtttta acgacagtgt tggtgagcgg ctgggtcgtg ttgggctgga 102960 tggaagatgc gcaacgcctg ccgctctcaa agctggtgtt gaccggtgaa cgccattaca 103020 cacgtaatga cgatatccgg cagtcgatcc tggcattggg tgagccgggt acctttatga 103080 cccaggatgt caacatcatc cagacgcaaa tagaacaacg cctgccgtgg attaagcagg 103140 tgagcgtcag aaagcagtgg cctgatgaat tgaagattca tctggttgaa tatgtgccga 103200 ttgcgcggtg gaatgatcaa catatggtag acgcggaagg aaataccttc agcgtgccgc 103260 cagaacgcac cagcaagcag gtgcttccaa tgctgtatgg cccggaaggc agcgccaatg 103320 aagtgttgca gggctatcgc gaaatggggc agatgctggc aaaggacaga tttactctga 103380 aggaagcggc gatgaccgcg cggcgttcct ggcagttgac gctgaataac gatattaagc 103440 tcaatcttgg ccggggcgat acgatgaaac gtttggctcg ctttgtagaa ctttatccgg 103500 ttttacagca gcaggcgcaa accgatggca aacggattag ctacgttgat ttgcgttatg 103560 actctggagc ggcagtaggc tgggcgccct tgccgccaga ggaatctact cagcaacaaa 103620 atcaggcaca ggcagaacaa caatgatcaa ggcgacggac agaaaactgg tagtaggact 103680 ggagattggt accgcgaagg ttgccgcttt agtaggggaa gttctgcccg acggtatggt 103740 caatatcatt ggcgtgggca gctgcccgtc gcgtggtatg gataaaggcg gggtgaacga 103800 cctcgaatcc gtggtcaagt gcgtacaacg cgccattgac caggcagaat tgatggcaga 103860 ttgtcagatc tcttcggtat atctggcgct ttctggtaag cacatcagct gccagaatga 103920 aattggtatg gtgcctattt ctgaagaaga agtgacgcaa gaagatgtgg aaaacgtcgt 103980 ccataccgcg aaatcggtgc gtgtgcgcga tgagcatcgt gtgctgcatg tgatcccgca 104040 agagtatgcg attgactatc aggaagggat caagaatccg gtaggacttt cgggcgtgcg 104100 gatgcaggca aaagtgcacc tgatcacatg tcacaacgat atggcgaaaa acatcgtcaa 104160 agcggttgaa cgttgtgggc tgaaagttga ccaactgata tttgccggac tggcatcaag 104220 ttattcggta ttgacggaag atgaacgtga actgggtgtc tgcgtcgtcg atatcggtgg 104280 tggtacaatg gatatcgccg tttataccgg tggggcattg cgccacacta aggtaattcc 104340 ttatgctggc aatgtcgtga ccagtgatat cgcttacgcc tttggcacgc cgccaagcga 104400 cgccgaagcg attaaagttc gccacggttg tgcgctgggt tccatcgttg gaaaagatga 104460 gagcgtggaa gtgccgagcg taggtggtcg tccgccacgg agtctgcaac gtcagacact 104520 ggcagaggtg atcgagccgc gctataccga gctgctcaac ctggtcaacg aagagatatt 104580 gcagttgcag gaaaagcttc gccaacaagg ggttaaacat cacctggcgg caggcattgt 104640 attaaccggt ggcgcagcgc agatcgaagg tcttgcagcc tgtgctcagc gcgtgtttca 104700 tacgcaagtg cgtatcggcg cgccgctgaa cattaccggt ttaacggatt atgctcagga 104760 gccgtattat tcgacggcgg tgggattgct tcactatggg aaagagtcac atcttaacgg 104820 tgaagctgaa gtagaaaaac gtgttacagc atcagttggc tcgtggatca agcgactcaa 104880 tagttggctg cgaaaagagt tttaattttt atgaggccga cgatgattac ggcctcaggc 104940 gacaggcaca aatcggagag aaactatgtt tgaaccaatg gaacttacca atgacgcggt 105000 gattaaagtc atcggcgtcg gcggcggcgg cggtaatgct gttgaacaca tggtgcgcga 105060 gcgcattgaa ggtgttgaat tcttcgcggt aaataccgat gcacaagcgc tgcgtaaaac 105120 agcggttgga cagacgattc aaatcggtag cggtatcacc aaaggactgg gcgctggcgc 105180 taatccagaa gttggccgca atgcggctga tgaggatcgc gatgcattgc gtgcggcgct 105240 ggaaggtgca gacatggtct ttattgctgc gggtatgggt ggtggtaccg gtacaggtgc 105300 ggcaccagtc gtcgctgaag tggcaaaaga tttgggtatc ctgaccgttg ctgtcgtcac 105360 taagcctttc aactttgaag gcaagaagcg tatggcattc gcggagcagg ggatcactga 105420 actgtccaag catgtgaact ctctgatcac tatcccgaac gacaaactgc tgaaagttct 105480 gggccgcggt atctccctgc tggatgcgtt tggcgcagcg aacgatgtac tgaaaggcgc 105540 tgtgcaaggt atcgctgaac tgattactcg tccgggtttg atgaacgtgg actttgcaga 105600 cgtacgcacc gtaatgtctg agatgggcca cgcaatgatg ggttctggcg tggcgagcgg 105660 tgaagaccgt gcggaagaag ctgctgaaat ggctatctct tctccgctgc tggaagatat 105720 cgacctgtct ggcgcgcgcg gcgtgctggt taacatcacg gcgggcttcg acctgcgtct 105780 ggatgagttc gaaacggtag gtaacaccat ccgtgcattt gcttccgaca acgcgactgt 105840 ggttatcggt acttctcttg acccggatat gaatgacgag ctgcgcgtaa ccgttgttgc 105900 gacaggtatc ggcatggaca aacgtcctga aatcactctg gtgaccaata agcaggttca 105960 gcagccagtg atggatcgct accagcagca tgggatggct ccgctgaccc aagagcagaa 106020 gccggttgct aaagtcgtga atgacaatgc gccgcaaact gcgaaagagc cggattatct 106080 ggatatccca gcattcctgc gtaagcaagc tgattaagaa ttgactggaa tttgggtttc 106140 gattctcttt gtgctaaact ggcccgccga atgtatagta cacttcggtt ggataggtaa 106200 tttggcgaga taatacgatg atcaaacaaa ggacacttaa acgtatcgtt caggcgacgg 106260 gtgtcggttt acataccggc aagaaagtca ccctgacgtt acgccctgcg ccggccaaca 106320 ccggggtcat ctatcgtcgc accgacttga atccaccggt agatttcccg gccgatgcca 106380 aatctgtgcg tgataccatg ctctgtacgt gtctggtcaa cgagcatgat gtacggattt 106440 caaccgtaga gcacctcaat gctgctctcg cgggcttggg catcgataac attgttatcg 106500 aagttaacgc gccggaaatc ccgatcatgg acggcagcgc cgctccgttt gtatacctgc 106560 tgcttgacgc cggtatcgac gagttgaact gcgccaaaaa atttgttcgc atcaaagaga 106620 ctgttcgtgt cgaagatggc gataagtggg ctgaatttaa gccgtacaat ggtttttcgc 106680 tggatttcac catcgatttt aaccatccgg ctattgattc cagcaaccag cgctatgcga 106740 tgaacttctc cgctgatgcg tttatgcgcc agatcagccg tgcgcgtacg ttcggtttca 106800 tgcgtgatat cgaatatctg cagtcccgtg gtttgtgcct gggcggcagc ttcgattgtg 106860 ccatcgttgt tgacgattat cgcgtactga acgaagacgg cctgcgtttt gaagacgaat 106920 ttgtgcgtca caaaatgctc gatgcgatcg gtgacttgtt catgtgtggt cacaatatta 106980 ttggtgcatt taccgcttat aaatccggtc atgcactgaa taacaaactg ctgcaggctg 107040 tcctggcgaa acaggaagcc tgggaatatg tgaccttcca ggacgacgca gaactgccgt 107100 tggccttcaa agcgccttca gctgtactgg cataacgaca tttatactgt cgtataaaat 107160 tcgactggca aatctggcac tctctccggc caggtgaacc agtcgttttt ttttgaattt 107220 tataagagct ataaaaaacg gtgcgaacgc tgttttctta agcacttttc cgcacaactt 107280 atcttcattc gtgctgtgga ctgcaggctt taatgataag atttgtgcgc taaatacgtt 107340 tgaatatgat cgggatggca ataacgtgag tggaatactg acgcgctggc gacagtttgg 107400 taaacgctac ttctggccgc atctcttatt agggatggtt gcggcgagtt taggtttgcc 107460 tgcgctcagc aacgccgccg aaccaaacgc gcccgcaaaa gcgacaaccc gcaaccacga 107520 gccttcagcc aaagttaact ttggtcaatt ggccttgctg gaagcgaaca cacgccgccc 107580 gaattcgaac tattccgttg attactggca tcaacatgcc attcgcacgg taatccgtca 107640 tctttctttc gcaatggcac cgcaaacact gcccgttgct gaagaatctt tgcctcttca 107700 ggcgcaacat cttgcattac tggatacgct cagcgcgctg ctgacccagg aaggcacgcc 107760 gtctgaaaag ggttatcgca ttgattatgc gcattttacc ccacaagcaa aattcagcac 107820 gcccgtctgg ataagccagg cgcaaggcat ccgtgctggc cctcaacgcc tcacctaaca 107880 acaataaacc tttacttcat tttattaact ccgcaacgcg gggcgtttga gattttatta 107940 tgctaatcaa attgttaact aaagttttcg gtagtcgtaa cgatcgcacc ctgcgccgga 108000 tgcgcaaagt ggtcaacatc atcaatgcca tggaaccgga gatggaaaaa ctctccgacg 108060 aagaactgaa agggaaaacc gcagagtttc gtgcacgtct ggaaaaaggc gaagtgctgg 108120 aaaatctgat cccggaagct ttcgccgtgg tacgtgaggc aagtaagcgc gtctttggta 108180 tgcgtcactt cgacgttcag ttactcggcg gtatggttct taacgaacgc tgcatcgccg 108240 aaatgcgtac cggtgaagga aaaaccctga ccgcaacgct gcctgcttac ctgaacgcac 108300 taaccggtaa aggcgtgcac gtagttaccg tcaacgacta cctggcgcaa cgtgacgccg 108360 aaaacaaccg tccgctgttt gaattccttg gcctgactgt cggtatcaac ctgccgggca 108420 tgccagcacc ggcaaagcgc gaagcttacg cagctgacat cacttacggt acgaacaacg 108480 aatacggctt tgactacctg cgcgacaaca tggcgttcag ccctgaagaa cgtgtacagc 108540 gtaaactgca ctatgcgctg gtggacgaag tggactccat cctgatcgat gaagcgcgta 108600 caccgctgat catttccggc ccggcagaag acagctcgga aatgtataaa cgcgtgaata 108660 aaattattcc gcacctgatc cgtcaggaaa aagaagactc cgaaaccttc cagggcgaag 108720 gccacttctc ggtggacgaa aaatctcgcc aggtgaacct gaccgaacgt ggtctggtgc 108780 tgattgaaga actgctggtg aaagagggca tcatggatga aggggagtct ctgtactctc 108840 cggccaacat catgctgatg caccacgtaa cggcggcgct gcgcgctcat gcgctgttta 108900 cccgtgacgt cgactacatc gttaaagatg gtgaagttat catcgttgac gaacacaccg 108960 gtcgtaccat gcagggccgt cgctggtccg atggtctgca ccaggctgtg gaagcgaaag 109020 aaggtgtgca gatccagaac gaaaaccaaa cgctggcttc gatcaccttc cagaactact 109080 tccgtctgta tgaaaaactg gcggggatga ccggtactgc tgataccgaa gctttcgaat 109140 ttagctcaat ctacaagctg gataccgtcg ttgttccgac caaccgtcca atgattcgta 109200 aagatctgcc ggacctggtc tacatgactg aagcggaaaa aattcaggcg atcattgaag 109260 atatcaaaga acgtactgcg aaaggccagc cggtgctggt gggtactatc tccatcgaaa 109320 aatcggagct ggtgtcaaac gaactgacca aagccggtat taagcacaac gtcctgaacg 109380 ccaaattcca cgccaacgaa gcggcgattg ttgctcaggc aggttatccg gctgcggtga 109440 ctatcgcgac caatatggcg ggtcgtggta cagatattgt gctcggtggt agctggcagg 109500 cagaagttgc cgcgctggaa aatccgaccg cagagcaaat tgaaaaaatt aaagccgact 109560 ggcaggtacg tcacgatgcg gtactggaag caggtggcct gcatatcatc ggtaccgagc 109620 gtcacgaatc ccgtcgtatc gataaccagt tgcgcggtcg ttctggtcgt cagggggatg 109680 ctggttcttc ccgtttctac ctgtcgatgg aagatgcgct gatgcgtatt tttgcttccg 109740 accgagtatc cggcatgatg cgtaaactgg gtatgaagcc aggcgaagcc attgaacacc 109800 cgtgggtgac taaagcgatt gccaacgccc agcgtaaagt tgaaagccgt aacttcgaca 109860 ttcgtaagca actgctggaa tatgatgacg tggctaacga tcagcgtcgc gccatttact 109920 cccagcgtaa cgaactgttg gatgtcagcg atgtgagcga aaccattaac agcattcgtg 109980 aagatgtgtt caaagcgacc attgatgcct acattccacc acagtcgctg gaagaaatgt 110040 gggatattcc ggggctgcag gaacgtctga agaacgattt cgacctcgat ttgccaattg 110100 ccgagtggct ggataaagaa ccagaactgc atgaagagac gctgcgtgac ggcattctgg 110160 cgcagtccat cgaagtgtat cagcgtaaag aagaagtggt tggtgctgag atgatgcgtc 110220 acttcgagaa aggcgtcatg ctgcaaacgc ttgactccct gtggaaagag cacctggcag 110280 cgatggacta tctgcgtcag ggtatccacc tgcgtggcta cgcacagaaa gatccgaagc 110340 aggaatacaa acgtgaatcg ttctccatgt ttgcagcgat gctggagtcg ttgaaatatg 110400 aagttatcag tacgctgagc aaagttcagg tacgtatgcc tgaagaggtt gaggagctgg 110460 aacaacagcg tcgtatggaa gccgagcgtt tagcgcaaat gcagcagctt agccatcagg 110520 atgacgactc tgcagccgca gctgcactgg cggcgcaaac cggagagcgc aaagtaggac 110580 gtaacgatcc ttgcccgtgc ggttctggta aaaaatacaa gcagtgccat ggccgcctgc 110640 aataaaagct aactgttgaa gtaaaaggcg caggattctg cgcctttttt ataggtttaa 110700 gacaatgaaa aagctgcaaa ttgcggtagg tattattcgc aacgagaaca atgaaatctt 110760 tataacgcgt cgcgcagcag atgcgcacat ggcgaataaa ctggagtttc ccggcggtaa 110820 aattgaaatg ggtgaaacgc cggaacaggc ggtggtgcgt gaacttcagg aagaagtcgg 110880 gattaccccc caacattttt cgctatttga aaaactggaa tatgaattcc cggacaggca 110940 tataacactg tggttttggc tggtcgaacg ctgggaaggg gagccgtggg gtaaagaagg 111000 gcaacccggt gagtggatgt cgctggtcgg tcttaatgcc gatgattttc cgccagccaa 111060 tgaaccggta attgcgaagc ttaaacgtct gtaggtcaga taaggcgttt tcgccgcatc 111120 cgacattcgc acacgatgcc tgatgcgacg ctggcgcgtc ttatcaggcc taaagggatt 111180 tctaactcat tgataaattt gtttttgtag gtcggataag gcgttcacgc cgcatccgac 111240 atttgcacaa gatgcctgat gcgacgctgt ccgcgtctta tcaggcctac gtgcggcatc 111300 agacaaatgt cactgctttg gttcttcgct ccagtcatcg ctttcggaaa gatcgccact 111360 gctggggatt cgtttttctt cagcagccca ttctccgagg tcgatcag 111408 // test.infernal100644000766000024 2021614006427353 17114 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data CPU time (band calc): 0.01u 0.00s 00:00:00.01 Elapsed: 00:00:00 sequence: gi|2239287|gb|U51115.1|BSU51115 hit 0 : 15589 15691 78.40 bits hit 1 : 15589 15691 78.40 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 A+ A+A+ AAAA A :CUC:UAUAAU: :GGGAAUAUGGCCC: :AGUUUCUACC:G 15589 CAUGAAAUCAAAACACGACCUCAUAUAAUCUUGGGAAUAUGGCCCAUAAGUUUCUACCCG 15648 <<<<_______>>>>>>>,,)))).)))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcG.aGuaAauauuaaauauuu 102 GCAACCGUAAAUUGCC:GACUA:G AG: AA + ++ +++++ 15649 GCAACCGUAAAUUGCCGGACUAUGcAGGGAAGUGAUCGAUAAA 15691 hit 2 : 13198 13294 5.41 bits hit 3 : 13198 13294 5.41 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 +A+ +U + AAAA AA : :U:::A AA:C::::G A UGGC::::G: UUU U :: 13198 UAUCUUCUUAAAAUAAGGAAUGAAAAAACCAGCUGCAC-UGGCAGCUGGUUUUUUUUGU- 13255 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 : C AA :U+++:: U:::A: : AUAU+ U +U 13256 ---UGCAAAAGAUAAAACCAUUUCAUACGAUAUACCGUCAUG 13294 hit 4 : 11655 11756 81.29 bits hit 5 : 11655 11756 81.29 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 A AAAU AAA+AA A+ : CGUAUAAU::CG:GAAUAUGGC:CG::AGU UCUACCA: 11655 AGAAAUCAAAUAAGAUGAAUUCGUAUAAUCGCGGGAAUAUGGCUCGCAAGUCUCUACCAA 11714 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 GC ACCGUAAAU GC:UGACUACG : AU+U +++ UUU 11715 GCUACCGUAAAUGGCUUGACUACGUAAACAUUUCUUUCGUUU 11756 hit 0 : 825 748 2.84 bits hit 1 : 825 748 2.84 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 +AAAA+ AA AAAA A: ::C:U:U A UG C+ A A UCUAC 15073 UAAAAACAACAAAAGGAGCUGCUUUUUA-----------UGCUCACACACGAUCUAC--- 15118 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 G AAA +AC:A:G:: :A ++UU AU+U+U 15119 ----AUGAAAA------AACAAAGCAUUACAUUUCCAUUUAU 15150 hit 2 : 4125 4023 1.38 bits hit 3 : 4125 4023 1.38 bits :::::::::::::::::((((((((,,,<<<<<<<_______.>>>.>>>>,,,,,,,,< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGG.ccc.gagaGUuUCUACC 58 AAA +AAA A A AA: :U:::AUAA :+:G:: AU U ::C :+: UUUC AC 11773 CAAACAAAACACUACAAAUUUGAAAUAAGCUGGUAUAUUUUCuUACaCGAAUUUUCGAC- 11831 <<<<<<_______>>>...>>>>,,)))))))):::::::::::::: 59 aGgcaaCCGUAAAuug...cCuGACUAcGaGuaAauauuaaauauuu 102 :AAC GUA AUU: + +GA U:::A: :A+ AU+A+AU+ + 11832 ---AAACUGUAUAUUUaugAUAGAGUUUCAAUUAUCAUAAUAUUCAG 11875 hit 4 : 8105 8018 1.70 bits hit 5 : 8105 8018 1.70 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 A+A A +A AA U:::AU A UAUGGC ++A+ GU UC A 7793 GAUACGGAUUACGGCAAAAGUGGCAUCA---------UAUGGCGUAAAGGUAUCAAG--- 7840 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 A CGUAA U+C ACU:::A AA+A +AAA+ +++ 7841 --AAUCGUAAUAUACGCUACUGUCAUCAAAAAGAAAAACAAA 7880 hit 6 : 11759 11658 9.64 bits hit 7 : 11759 11658 9.64 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 A AA+ AAA AAA+ : CGUA +::::: : UA GG: :::::GU AC: G 4139 AUCAAACGAAAGAAAUGUUUACGUA-GUCAAGCCAUUUACGGUAGCUUGGUAGAGACUUG 4197 <<<<_______>>>>>>>,,.)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGAC.UAcGaGuaAauauuaaauauuu 102 :: :CC UA : ::C : UACG : AU UUA++U +UU 4198 CGAGCCAUAUUCCCGCGAUUaUACGAAUUCAUCUUAUUUGAUU 4240 hit 8 : 13350 13253 0.79 bits hit 9 : 13350 13253 0.79 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 +AA UA AA A + :A +C:U: C::: AA :::G G C C:A 2548 UAAGGUACCGAACAGUCAAGUCAUU---UCCAUUUAAGUAAAGCAUGGCGGCACAUCUAC 2604 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 : :AC GUA AU: : U:A :A:G+ U: AU UU++ A+ + 2605 AUGACGGUAUAUCGUAUGA-AAUGGUUUUAUCUUUUGCAACA 2645 hit 10 : 13846 13740 6.35 bits hit 11 : 13846 13740 6.35 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,..... 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCU..... 55 A A+A+A A AA AA:A :::A A :C ::::A A G :::: G: UUCU 2052 CAGCAAAUACACAAGAACAACUAAAAACCCGUCAUAGAAAGCAUGAUGGCGUUCUccgaa 2111 ,,<<<<<<<_______>>>>>>>,,)))))))):::::::::::::: 56 ACCaGgcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 ACC::GC :CCG : GC::G CU::: U:AA+ +A++ ++ 2112 ACCGAGCGGCCGAUUCCAGCUUGCCUUUAACUGAAACCAAUUCCCAA 2158 hit 12 : 14215 14125 0.90 bits hit 13 : 14215 14125 0.90 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 AA AA AA A :+ :UAUA+U: :+++ A A GG : :AGU 1683 AAGCGGAAUCAACGACGCCGUAAUAUAUUCAGAAAUAAACGG---CCAAGUCA------- 1732 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 C::CCGUAAA::GC+ CUA: +: ++ U+ AA+++ + 1733 -CUGCCGUAAACAGCAGCGCUAUCGCAACUACUGCAAAUACA 1773 hit 14 : 15695 15594 7.88 bits hit 15 : 15695 15594 7.88 bits :::::::::::::::::((((((((,,.,<<<<<<<_______>>>>>>>,,,,,,,,<< 1 aAaaauaaAaaaaaaaauaCuCgUAUA.aucucgggAAUAUGGcccgagaGUuUCUACCa 59 A ++U+A A A ++ C :: AUA +:C::::: UA GG:::::G:GU AC:: 203 CAGUUUUAUCGAUCACUUCCCUGCAUAgUCCGGCAAUUUACGGUUGCCGGGUAGAAACUU 262 <<<<<_______>>>>>>>,,)))))))):::::::::::::: 60 GgcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 +::::CC UA :::: ::A U :: G + + U++++U +UU 263 AUGGGCCAUAUUCCCA-AGAUUAUAUGAGGUCGUGUUUUGAUU 304 sequence: gi|633168|emb|X83878.1| hit 0 : 168 267 79.36 bits hit 1 : 168 267 79.36 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 + A A++A AA A AA:AC+C:UAUAAU::CG:G AUAUGGC:CG::AGUUUCUACC:G 168 UUACAAUAUAAUAGGAACACUCAUAUAAUCGCGUGGAUAUGGCACGCAAGUUUCUACCGG 227 <<<<_______>>>>>>>,,)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGACUAcGaGuaAauauuaaauauuu 102 CA CCGUAAA UG C:GACUA:G+GU:A A+U A+ 228 GCA-CCGUAAA-UGUCCGACUAUGGGUGAGCAAUGGAACCGC 267 hit 0 : 270 171 2.25 bits hit 1 : 270 171 2.25 bits :::::::::::::::::((((((((,,,<<<<<<<_______>>>>>>>,,,,,,,,<<< 1 aAaaauaaAaaaaaaaauaCuCgUAUAaucucgggAAUAUGGcccgagaGUuUCUACCaG 60 + + A +:AC C:UA +::: :: UA GG :: :::GU AC: G 2144 CGUGCGGUUCCAUUGCUCACCCAUA-GUCGGACAU-UUACGG-UGCCCGGUAGAAACUUG 2200 <<<<_______>>>>>>>,,.)))))))):::::::::::::: 61 gcaaCCGUAAAuugcCuGAC.UAcGaGuaAauauuaaauauuu 102 ::::CC UA ::::C : UA:G GU: + U+++AUAUU 2201 CGUGCCAUAUCCACGCGAUUaUAUGAGUGUUCCUAUUAUAUUG 2243 CPU time (search) : 56.03u 0.38s 00:00:56.41 Elapsed: 00:00:58 CYK memory : 0.66 MB nucmatrix.txt100644000766000024 35214006427353 17127 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data A C G T . N A 100 -150 -50 -150 0 -50 C -150 100 -150 -50 0 -50 G -50 -150 100 -150 0 -50 T -150 -50 -150 100 0 -50 . 0 0 0 0 0 0 N -50 -50 -50 -50 0 -50 -300 -25 crypto.sim4-4100644000766000024 220014006427353 16645 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data seq1 = cn416, 630 bp seq2 = Contig147.fa (>Contig147), 1086 bp >cn416 lastmodified=[2002-10-29 11:40:27] project=[B3501] length=[630] clusterid=[371] coverage=[1] crawid=[1] sequencecount=[2] longestsequence=[k9k11j2.f1] sequences=[k9k11j2.f1|k9k11j2.f1C] clone=[|k9k11j2] >Contig147 36-132 (191-286) 89% <- 133-191 (343-401) 93% 0 . : . : . : . : . : 36 AAAGCCATGCAATGCATCCCATGATACCAAAGATAAAATATCCAAAAATC |||||||||||||||||||-||||||||||||| || ||||||||||||| 191 AAAGCCATGCAATGCATCC ATGATACCAAAGACAATATATCCAAAAATC 50 . : . : . : . : . : 86 CATGACCGATCCTGATGCAATAACCAAGAGACCGTTTATCCCGGGAA | || |||| | ||||||||||||||||||| |||||||||||||<<< 240 TAAGATTGATCATCATGCAATAACCAAGAGACCCTTTATCCCGGGAACTA 100 . : . : . : . : . : 133 TGCCACAACGTAGTCCTTCTGTACATCCTGAACATTTCCATCAT ...<<<|||||||||||||||||| ||||||||||||||||||||||||| 290 ...CACTGCCACAACGTAGTCCTTTTGTACATCCTGAACATTTCCATCAT 150 . : . 177 CATCAACGAA AAGTA |||| |||||-||-|| 387 CATCTACGAACAA TA crypto.sim4-3100644000766000024 164214006427353 16655 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data seq1 = cn416, 630 bp seq2 = Contig147.fa (>Contig147), 1086 bp 36-132 (191-286) 89% <- 133-191 (343-401) 93% 0 . : . : . : . : . : 36 AAAGCCATGCAATGCATCCCATGATACCAAAGATAAAATATCCAAAAATC |||||||||||||||||||-||||||||||||| || ||||||||||||| 191 AAAGCCATGCAATGCATCC ATGATACCAAAGACAATATATCCAAAAATC 50 . : . : . : . : . : 86 CATGACCGATCCTGATGCAATAACCAAGAGACCGTTTATCCCGGGAA | || |||| | ||||||||||||||||||| |||||||||||||<<< 240 TAAGATTGATCATCATGCAATAACCAAGAGACCCTTTATCCCGGGAACTA 100 . : . : . : . : . : 133 TGCCACAACGTAGTCCTTCTGTACATCCTGAACATTTCCATCAT ...<<<|||||||||||||||||| ||||||||||||||||||||||||| 290 ...CACTGCCACAACGTAGTCCTTTTGTACATCCTGAACATTTCCATCAT 150 . : . 177 CATCAACGAA AAGTA |||| |||||-||-|| 387 CATCTACGAACAA TA acefile.ace.1100755000766000024 123373414006427353 16675 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/dataAS 53 114 CO Contig1 796 1 1 U atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccggcaacaagcacaccacctgttgaaGCAGAAGAGAGTAAGAATAA GAAGCCACGATTGGCATAGAGTCAGCTGAGTGTCGCACCCACCACCGCAT TTACTTTTCTTTTTAAAATCTTTGTTTTCATTTGCTTTCTGTCTTTTTGT TTCCAACTTCGTATTTTAGACTCCATGTGGTGGTGCATGTTTTACTTATG AATCCAAGTTCTTCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAaggGGggxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxgggggacgggaaaaaccct ggggttccccaacttaatcgccttgcagaaaatccccttttccccagttg gggtaaaaccaaaaaggcccccaccgatcgcccttcccaacagttgccca ccctgaatggcaaaggggaccccccctgtaccggcccattaagcgcgggg ggtgtgggggttccccccagggggaccgttacatttgccagggccctagc gcccgctcctttggtttttttcccttcttttttcgccacgttcgccggtt ttccccgtcaagctttaaatggggggccccctatagggttccgatt BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 46 42 42 39 39 37 42 42 43 37 40 35 39 39 39 35 35 35 35 35 35 37 37 35 39 35 35 35 35 40 40 39 39 39 35 35 35 39 39 40 39 35 35 35 35 35 39 39 40 51 51 40 35 35 35 35 35 35 40 40 40 40 40 40 35 35 38 38 38 39 39 35 35 35 35 39 40 56 56 56 56 46 46 46 46 40 40 51 56 56 56 51 51 51 51 51 51 51 51 51 51 43 35 35 35 35 35 43 51 51 41 51 51 51 51 51 56 56 56 43 43 43 43 43 43 43 43 43 43 43 43 43 45 56 51 51 51 51 43 43 43 43 43 43 45 51 51 51 56 56 56 56 43 43 43 43 43 43 56 56 51 51 43 43 43 43 43 43 51 51 56 56 56 56 56 56 56 51 45 45 45 45 45 51 51 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 46 33 13 13 12 31 29 14 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2261r U 1 BS 1 796 LL2261r RD LL2261r 796 0 6 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccggcaacaagcacaccacctgttgaaGCAGAAGAGAGTAAGAATAA GAAGCCACGATTGGCATAGAGTCAGCTGAGTGTCGCACCCACCACCGCAT TTACTTTTCTTTTTAAAATCTTTGTTTTCATTTGCTTTCTGTCTTTTTGT TTCCAACTTCGTATTTTAGACTCCATGTGGTGGTGCATGTTTTACTTATG AATCCAAGTTCTTCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAaggGGggxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxgggggacgggaaaaaccct ggggttccccaacttaatcgccttgcagaaaatccccttttccccagttg gggtaaaaccaaaaaggcccccaccgatcgcccttcccaacagttgccca ccctgaatggcaaaggggaccccccctgtaccggcccattaagcgcgggg ggtgtgggggttccccccagggggaccgttacatttgccagggccctagc gcccgctcctttggtttttttcccttcttttttcgccacgttcgccggtt ttccccgtcaagctttaaatggggggccccctatagggttccgatt QA 81 571 46 796 DS CHROMAT_FILE: LL2261r PHD_FILE: LL2261r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:32 2000 RT{ LL2261r matchElsewhereLowQual phrap 710 758 000919:094547 } RT{ LL2261r matchElsewhereLowQual phrap 664 695 000919:094547 } RT{ LL2261r matchElsewhereLowQual phrap 599 614 000919:094547 } RT{ LL2261r matchElsewhereLowQual phrap 572 589 000919:094547 } RT{ LL2261r matchElsewhereLowQual phrap 529 553 000919:094547 } RT{ LL2261r matchElsewhereLowQual phrap 492 525 000919:094547 } CO Contig2 787 1 1 U atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacgcg tccgcccacgcgtccgcaacaaccaataaacggtttATTTCGCTCAGCAC TCAACCGCAATGGCCGCCTCAACAATGGCTCTCTCCTCCCCTGCCTTCGC CGGAAAGGCCGTCAAGCTTTCCCCAGCAGCATCAGAAGTCCTTGGAAGCG GCCGTGTGACAATGAGGAAGACCGTAGCCAAGCCAAAGGGCCCATCAGGC AGCCCATGGTACGGATCCGAGAGAGTCAAGTACTTGGGCCCATTCTCCGG CGAGCCACCGAGCTACCTTACCGGAGAGTTCCCCGGAGACTACGGATGGG ACACCGCAGGTCTCTCAGCCGATCCCGAGACGTTCGCGAGGAACCGTGAG CTAGAAGTTATCCACTGCAGGTGGGCCATGCTCGGAGCCCTAGGCTGCGT CTTCCCGGAGCTGTTGGCCAGGAACGGAGTCAAGTTCGGAGAGGCGGTTT GGTTCAAGGCCGGTTCGCAGATCTTCAGCGAAGGAGGACTTGATTACTTG GGAAACCCTAGCTTGGTTCACGCTCAGAGCATCTTGgCTATTTGGGCTAC TCAAGTGATCTTGATGGGAGCCGTTGAAGGTTACAGAGTCGCAGGAGATG GGCCGTTGGGAGAGGCCGAGGACTTGCttTACCCAGGTggCAGCTTTGAC ccgTTGGGTTTGgctACTGACccagaggccTTCGCGGAGTTGAaggTGAA GgAGATCaagaaacggaGattggctatgttCtctAtg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 24 25 29 32 34 34 34 34 34 40 46 39 39 39 39 39 40 45 35 35 35 35 35 40 46 40 40 34 34 34 34 34 35 40 40 51 40 40 40 40 39 39 39 39 39 39 40 40 40 40 40 40 40 40 39 39 39 39 39 35 35 35 35 35 35 35 51 51 56 46 40 39 39 39 35 40 40 51 39 39 39 39 39 39 51 51 51 51 51 51 40 40 40 40 40 40 45 45 45 51 40 40 40 40 40 40 56 51 56 51 39 39 35 35 35 35 35 35 45 45 51 51 51 40 40 40 45 45 51 56 56 56 39 38 35 35 35 35 43 43 45 45 45 45 51 51 51 51 51 51 56 46 45 41 41 41 45 46 56 56 56 56 56 51 45 45 45 45 45 56 56 56 56 45 45 45 45 45 45 45 45 43 43 43 43 43 43 43 45 51 56 51 51 43 43 43 43 43 43 43 51 51 51 51 51 51 51 45 45 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 51 51 45 45 51 51 43 43 51 43 43 36 36 36 36 36 36 43 43 43 51 56 56 43 43 43 43 43 43 43 43 43 43 45 51 51 51 51 51 51 56 51 51 51 51 51 51 51 45 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 56 56 45 45 45 45 45 45 51 45 45 45 45 45 45 51 51 51 51 51 51 51 45 45 45 45 40 40 43 46 46 51 51 51 45 45 45 45 40 40 45 45 45 40 45 45 45 45 45 51 51 56 45 40 40 40 40 40 40 45 45 51 51 51 51 45 45 45 45 45 45 45 51 51 51 45 45 45 45 45 45 51 45 45 45 45 45 51 45 45 45 45 45 45 45 45 45 45 45 51 51 46 46 43 42 42 46 48 56 56 56 56 51 51 51 51 51 45 56 56 42 46 46 51 37 37 37 37 37 40 46 51 51 56 56 56 45 45 45 37 37 37 40 38 40 40 40 40 40 40 45 42 37 37 37 40 40 40 51 56 51 56 46 46 46 40 40 40 44 40 40 40 40 34 32 25 29 32 32 32 34 34 34 33 37 35 48 40 40 40 48 48 48 48 34 26 19 25 27 27 29 39 48 46 39 31 35 35 35 35 35 37 40 40 40 51 42 51 56 56 56 56 56 56 56 56 56 48 46 44 32 32 26 26 25 29 29 29 29 21 21 21 29 32 40 48 40 40 29 29 27 27 25 25 34 37 40 40 40 32 34 32 32 32 25 25 25 29 29 48 40 40 27 25 22 25 27 29 25 22 22 25 25 22 24 29 29 19 19 23 27 27 27 32 25 22 22 25 19 18 24 25 27 29 25 32 20 25 21 21 12 15 19 21 21 29 29 37 34 34 22 22 15 19 18 20 20 29 27 27 25 18 19 14 13 8 8 8 11 15 24 27 27 25 25 20 22 29 29 29 25 28 25 19 19 15 24 24 20 20 22 19 25 26 26 29 24 19 9 10 10 9 9 9 10 18 19 22 19 15 19 18 12 17 14 13 13 16 11 16 23 17 18 14 21 18 0 AF LL2330r U 1 BS 1 787 LL2330r RD LL2330r 787 0 0 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacgcg tccgcccacgcgtccgcaacaaccaataaacggtttATTTCGCTCAGCAC TCAACCGCAATGGCCGCCTCAACAATGGCTCTCTCCTCCCCTGCCTTCGC CGGAAAGGCCGTCAAGCTTTCCCCAGCAGCATCAGAAGTCCTTGGAAGCG GCCGTGTGACAATGAGGAAGACCGTAGCCAAGCCAAAGGGCCCATCAGGC AGCCCATGGTACGGATCCGAGAGAGTCAAGTACTTGGGCCCATTCTCCGG CGAGCCACCGAGCTACCTTACCGGAGAGTTCCCCGGAGACTACGGATGGG ACACCGCAGGTCTCTCAGCCGATCCCGAGACGTTCGCGAGGAACCGTGAG CTAGAAGTTATCCACTGCAGGTGGGCCATGCTCGGAGCCCTAGGCTGCGT CTTCCCGGAGCTGTTGGCCAGGAACGGAGTCAAGTTCGGAGAGGCGGTTT GGTTCAAGGCCGGTTCGCAGATCTTCAGCGAAGGAGGACTTGATTACTTG GGAAACCCTAGCTTGGTTCACGCTCAGAGCATCTTGgCTATTTGGGCTAC TCAAGTGATCTTGATGGGAGCCGTTGAAGGTTACAGAGTCGCAGGAGATG GGCCGTTGGGAGAGGCCGAGGACTTGCttTACCCAGGTggCAGCTTTGAC ccgTTGGGTTTGgctACTGACccagaggccTTCGCGGAGTTGAaggTGAA GgAGATCaagaaacggaGattggctatgttCtctAtg QA 81 786 44 787 DS CHROMAT_FILE: LL2330r PHD_FILE: LL2330r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:21 2000 CO Contig3 770 1 1 U atttcgagctcggtacccggggatcctctagagtcgacctgcaggcxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxGCCTGGGGTGCCTAATGAGTGAG CTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAA ACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGC GGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG CTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAA TACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCA AAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTT TTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAA GTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCC CCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGG ATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGgCGCtTTCTCATAGCT CACGCTGTAggTATCTCAGttCGGTGTAggTCGTtCGCTCCAAGCTGGGC TGTgTGCACGAACCCCCCGt BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 41 45 45 45 51 51 56 36 36 36 36 36 36 41 41 46 56 56 42 46 42 38 36 43 36 36 36 43 43 36 36 40 40 43 41 41 41 43 43 43 43 43 43 43 43 43 43 43 43 36 36 36 36 39 43 36 36 36 36 36 36 43 46 42 42 42 46 46 46 46 42 42 41 41 41 45 45 45 45 45 43 43 39 40 40 36 38 43 43 43 43 40 43 41 41 41 45 43 43 41 45 39 43 36 39 38 36 36 43 43 43 43 43 43 36 40 41 41 41 38 38 43 39 43 43 46 43 43 43 43 43 43 43 56 56 51 43 43 43 43 43 43 43 43 43 36 36 36 36 36 36 43 43 43 43 43 36 36 36 36 36 36 36 43 43 36 39 43 43 43 43 43 43 43 43 43 43 43 45 51 51 51 51 51 51 51 45 45 45 45 43 40 40 38 38 41 45 43 43 43 51 45 45 45 45 45 40 38 38 40 40 38 41 43 46 45 45 45 45 45 45 56 56 40 43 38 40 38 35 42 43 42 46 41 41 41 41 41 42 41 41 41 41 41 45 50 56 56 56 41 40 40 40 40 35 41 41 42 42 42 44 44 56 42 43 43 50 50 44 47 47 50 50 56 56 50 42 45 45 40 35 35 42 50 44 44 42 40 40 40 37 37 37 35 42 40 40 40 40 40 40 50 50 43 43 44 46 40 39 35 35 35 39 39 35 35 35 39 40 45 45 45 45 56 56 51 51 46 46 40 37 37 40 45 45 40 40 51 51 51 51 56 56 56 40 39 35 35 35 39 40 51 56 56 44 47 56 56 56 56 56 42 42 40 40 40 40 40 37 40 42 42 44 44 42 42 45 37 37 40 45 45 40 35 35 35 35 35 37 37 40 40 45 40 40 40 40 40 37 37 35 35 35 35 42 51 45 40 35 35 35 35 35 37 40 40 40 36 34 40 40 36 40 48 40 34 32 32 29 40 30 32 32 32 33 46 48 56 40 46 33 40 34 32 32 32 29 29 29 34 40 40 37 29 22 25 27 27 29 39 39 33 28 28 29 29 32 29 27 27 25 26 32 34 32 32 36 34 40 40 40 33 39 29 29 29 35 46 40 40 40 40 40 32 48 31 29 25 29 31 34 29 29 40 31 29 22 19 25 25 25 19 22 29 40 40 40 40 44 40 40 40 40 33 31 29 29 29 29 27 27 25 18 18 25 29 27 29 29 27 25 21 17 19 25 20 22 27 28 29 24 19 19 25 29 29 25 18 24 27 27 25 22 22 22 20 25 27 34 29 29 29 29 32 24 24 18 25 20 29 25 27 24 24 25 29 40 40 40 29 29 24 16 AF pgemr U 1 BS 1 770 pgemr RD pgemr 770 0 1 atttcgagctcggtacccggggatcctctagagtcgacctgcaggcxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxGCCTGGGGTGCCTAATGAGTGAG CTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAA ACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGC GGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCG CTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAA TACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCA AAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTT TTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAA GTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCC CCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGG ATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGgCGCtTTCTCATAGCT CACGCTGTAggTATCTCAGttCGGTGTAggTCGTtCGCTCCAAGCTGGGC TGTgTGCACGAACCCCCCGt QA 178 770 178 770 DS CHROMAT_FILE: pgemr PHD_FILE: pgemr.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:41 2000 RT{ pgemr matchElsewhereHighQual phrap 178 318 000919:094547 } CO Contig4 637 2 79 U tTtttCGCATAAA*TGAAAAATA*GTTTAAAACGgc*GAAAAATAAACAA TAATCAGGACTTCAACAATATTTATAAGAAAAAAAAAACCAAAAAAAATG AAGCAGCGACCAAAACATAATATAATATTTTTTGCTTAAGGACAAAAAAA CAGCATCACAAGAATCCTAAAGAGAAATAAAAGACACAGATAGAACTAAA GTACACAGAGACTCTTGACACATCCAGAAGGTTCCAAAGAACCACATAAC AACATATAGTTCCAATTCCCATATCAGCTAGGGAAAGAGACTTGTAAACA GACTCACCACACCACGAAGAAGGGAGCAGAGCCCTTAGAAGCACTTCCTG TGAACAATGGATGGACCTGACTCATCGTACTCTCCCTTTGAAATCCACAT CTGTTGGAAAGTGCTGAGGGATGCAAGGATAGATCCTCCAATCCAGACAC TGTATTTCCTCTCAGGAGGAGCCACCACCTTGATCTTCATGCTGCTAGGC GCAAGCGCCGTGATCTCCTTGCTCATACGATCAGCGATTCCTGGGAACAT GGTTGAACCACCACTGAGGACGATGTTTCCATACAGATCCTTCCTGATAT CGACATCACACTTCCCCACGCGTCCGCGGACGCGTGG BQ 0 27 18 18 19 24 25 27 27 25 22 22 25 32 32 42 47 44 42 31 29 27 29 27 41 47 43 44 44 44 36 31 13 18 25 25 55 55 55 61 55 55 69 62 67 65 65 74 66 69 69 69 75 72 72 71 69 55 50 38 35 33 36 28 43 47 55 80 86 88 82 82 82 77 77 84 84 78 82 90 90 88 75 65 62 60 64 83 87 88 90 90 80 71 71 77 77 69 69 66 64 66 61 64 70 70 90 90 90 90 90 90 82 81 82 74 74 72 72 74 74 74 74 77 88 83 90 90 90 90 90 81 79 77 75 75 71 88 88 87 86 90 90 90 90 90 81 75 75 75 72 72 82 88 83 83 80 90 87 82 79 77 79 82 84 85 90 90 90 90 90 90 90 90 88 88 90 90 90 90 90 88 88 88 90 90 90 90 90 88 86 86 86 86 80 77 82 82 82 82 90 90 90 90 90 90 90 85 85 82 78 82 86 86 90 82 82 82 87 86 90 90 90 90 90 90 90 90 90 84 84 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 85 85 77 77 83 90 90 90 90 90 90 90 90 90 90 90 90 90 87 81 80 76 77 86 86 90 90 90 90 90 90 90 90 88 88 71 71 67 76 76 87 88 89 86 86 80 80 77 80 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 84 84 84 84 84 80 85 84 84 86 74 74 74 74 73 85 78 78 71 70 82 85 90 71 71 62 69 69 77 82 82 82 77 78 72 88 81 88 90 90 90 90 90 90 90 90 85 82 82 78 85 85 78 71 70 76 75 77 81 83 88 88 88 90 90 90 90 90 90 90 85 86 85 82 82 78 80 80 80 88 88 90 90 87 73 73 57 57 62 72 74 82 85 90 90 90 83 83 79 79 78 78 90 87 87 87 87 87 82 88 90 90 90 90 90 90 90 90 89 87 80 80 80 74 79 67 69 57 73 68 79 84 85 87 78 78 78 85 88 90 90 90 84 77 76 80 75 84 84 90 90 90 90 90 90 84 84 84 88 88 84 86 90 88 79 80 80 70 70 66 60 60 64 67 68 68 64 64 64 67 64 67 62 59 59 50 50 50 67 68 83 90 90 90 82 75 71 71 71 71 75 74 80 82 82 82 81 76 72 67 64 69 70 72 77 72 72 73 79 82 82 79 79 77 72 68 70 64 62 69 81 75 70 77 83 83 90 90 76 71 71 72 78 79 73 78 74 74 68 67 66 85 61 61 70 75 80 90 90 90 85 85 85 80 90 82 82 84 79 79 79 77 77 73 64 61 65 65 69 71 82 74 76 59 68 63 67 61 67 73 72 66 60 55 59 59 57 54 64 58 58 57 57 56 63 75 71 49 49 49 34 34 35 AF LL2260r C -108 AF LL2260f U 0 BS 1 9 LL2260r BS 10 10 LL2260f BS 11 16 LL2260r BS 17 21 LL2260f BS 22 26 LL2260r BS 27 35 LL2260f BS 36 54 LL2260r BS 55 55 LL2260f BS 56 61 LL2260r BS 62 68 LL2260f BS 69 69 LL2260r BS 70 71 LL2260f BS 72 73 LL2260r BS 74 76 LL2260f BS 77 81 LL2260r BS 82 90 LL2260f BS 91 93 LL2260r BS 94 98 LL2260f BS 99 100 LL2260r BS 101 109 LL2260f BS 110 115 LL2260r BS 116 126 LL2260f BS 127 129 LL2260r BS 130 131 LL2260f BS 132 143 LL2260r BS 144 144 LL2260f BS 145 146 LL2260r BS 147 149 LL2260f BS 150 160 LL2260r BS 161 164 LL2260f BS 165 165 LL2260r BS 166 168 LL2260f BS 169 172 LL2260r BS 173 175 LL2260f BS 176 218 LL2260r BS 219 221 LL2260f BS 222 223 LL2260r BS 224 224 LL2260f BS 225 228 LL2260r BS 229 234 LL2260f BS 235 242 LL2260r BS 243 248 LL2260f BS 249 258 LL2260r BS 259 262 LL2260f BS 263 273 LL2260r BS 274 274 LL2260f BS 275 282 LL2260r BS 283 284 LL2260f BS 285 322 LL2260r BS 323 323 LL2260f BS 324 338 LL2260r BS 339 339 LL2260f BS 340 352 LL2260r BS 353 353 LL2260f BS 354 390 LL2260r BS 391 394 LL2260f BS 395 407 LL2260r BS 408 410 LL2260f BS 411 455 LL2260r BS 456 456 LL2260f BS 457 461 LL2260r BS 462 463 LL2260f BS 464 468 LL2260r BS 469 470 LL2260f BS 471 471 LL2260r BS 472 473 LL2260f BS 474 479 LL2260r BS 480 482 LL2260f BS 483 484 LL2260r BS 485 486 LL2260f BS 487 526 LL2260r BS 527 537 LL2260f BS 538 542 LL2260r BS 543 548 LL2260f BS 549 554 LL2260r BS 555 557 LL2260f BS 558 591 LL2260r BS 592 596 LL2260f BS 597 637 LL2260r RD LL2260r 791 0 0 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxctttttttttttttttttttttttttttttttttnn nntttnntntTtttCGCATAAA*TGAAAAATA*GTTTAAAAcggc*GAAA AATAAACAATAATCAGGACTTcagcaATATTTATAAGAAAAAAAAAACCA AAAAAAATGAAGCAGCGACCAAAACATAATATAATATTTTTTGCTTAAGG ACAAAAAAACAGCATCACAAGAATCCTAAAGAGAAATAAAAGACACAGAT AGAACTAAAGTACACAGAGACTCTTGACACATCCAGAAGGTTCCAAAGAA CCACATAACAACATATAGTTCCAATTCCCATATCAGCTAGGGAAAGAGAC TTGTAAACAGACTCACCACACCACGAAGAAGGGAGCAGAGCCCTTAGAAG CACTTCCTGTGAACAATGGATGGACCTGACTCATCGTACTCTCCCTTTGA AATCCACATCTGTTGGAAAGTGCTGAGGGATGCAAGGATAGATCCTCCAA TCCAGACACTGTATTTCCTCTCAGGAGGAGCCACCACCTTGATCTTCATG CTGCTAGGCGCAAGCGCCGTGATCTCCTTGCTCATACGATCAGCGATTCC TGGGAACATGGTTGAACCACCACTGAGGACGATGTTTCCATACAGATCCT TCCTGATATCGACATCACACTTCCCCACGCGTCCGCGGACGCGTGGxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 111 746 110 746 DS CHROMAT_FILE: LL2260r PHD_FILE: LL2260r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:27 2000 RD LL2260f 792 0 0 nctttccgcATaaactgAAAAAtaggtTTAAAACGgcggaAAAATAAACA ATAATCAGGACTTCAACAATATTTATAAGAAAAAAAAAACCAAAAAAAAT GAAGCAGCGACCAAAACATAATATAATATTTTTTGCTTAAGGACAAAAAA ACAGCATCACAAGAATCCTAAAGAGAAATAAAAGACACAGATAGAACTAA AGTACACAGAGACTCTTGACACATCCAGAAGGTTCCAAAGAACCACATAA CAACATATAGTTCCAATTCCCATATCAGCTAGGGAAAGAGACTTGTAAAC AGACTCACCACACCACGAAGAAGGGAGCAGAGCCCTTAGAAGCACTTCCT GTGAACAATGGATGGACCTGACTCATCGTACTCTCCCTTTGAAATCCACA TCTGTTGGAAAGTGCTGAGGGATGCAAGGATAGATCCTCCAATCCAGACA CTGTATTTCCTCTCAGGAGGAGCCACCACCTTGATCTTCATGCTGCTAGG CGCAAGCGCCGTGATCTCCTTGCTCATACGATCAGCGATTCCTGGGAACA TGGTTGAACCACCACTGAGGACGATGTTTCCATACAGATCCTTCCTGATA TCGACATCACACTTCCCCACGCGTCCGCGGACGCGTgGxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx QA 40 638 3 638 DS CHROMAT_FILE: LL2260f PHD_FILE: LL2260f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:50 2000 CO Contig5 605 2 63 U CTtTTTTTTTTTTTTTTTTtttTtttttTTTTTTTTTTTTTTTTTTtttt ttTTTTTTTTTTTTTTTTTTTTTTTGAGGTTAACTTGATAGATTCAAATG GATATGAGAACTTTGATGAATCAAGACAGTGAGAGAGATGGGTGATAGAT ACATTGTTGAGTTTATACAAGAATGGACAGAAAAAAAAAAAGAAACATGT TCACCATTTCCAGAAAACATTGGAACCACACTTGAACTTGTCTTTGCCTT CGCATTCCAACGCCAAGTCTTCTGAGATAAATGGGACTTTCTTTTGCTTG GCAAGATCTTGGCAACCAGTGAAATTCTCAGGGAACTTGCAGCTTCCGAA TTGGACAGTAAACGCTCTCGCAAAGTTTGCTCCACTTGTTGCCAATCTCT TCTTATCATTAAGCTCTTTGTTGGCTTTGCTCTTCTCGAGGTATTCATCG AAGACGCCAGCATTAGCAGAAGCAGAAACAGCAGCAGAGGAGAAGAGTGT AGCGGCTAAGACAACCATTGCGGATCTTCGTCCATTAACATCATCACCAC CAACTCTTTGAGCTGCTTTGATCACCGGAAGCGTTTGTTTCTTCCGGACG CGTGG BQ 22 21 16 25 27 27 56 56 47 42 33 44 42 30 30 30 33 33 31 18 18 18 25 19 18 17 17 17 25 25 26 26 26 42 48 42 42 42 42 42 43 44 44 30 30 20 19 14 14 14 14 18 28 29 33 42 44 47 48 48 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 45 40 40 39 39 39 44 50 55 62 57 56 55 66 67 69 66 71 76 78 76 80 80 71 70 74 71 71 72 81 81 75 71 71 71 71 77 81 80 85 85 80 80 80 80 80 80 81 81 74 71 55 61 62 68 69 80 85 85 85 86 90 89 90 90 86 90 90 90 90 87 87 80 80 86 90 90 90 90 90 90 90 80 82 82 82 82 82 90 90 90 90 90 90 90 90 90 90 88 81 83 81 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 82 85 85 90 90 90 90 90 90 90 90 86 86 86 90 86 86 86 78 78 78 83 83 83 89 90 90 90 90 80 80 80 80 80 80 90 90 90 90 81 78 78 78 78 78 80 86 84 83 81 85 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 90 86 86 85 90 90 90 90 90 84 84 89 89 88 88 90 90 90 90 90 87 90 90 90 90 90 90 87 90 90 90 90 89 90 89 83 79 86 79 79 79 79 83 83 83 88 90 90 90 90 90 90 90 90 86 86 83 81 86 79 86 81 86 86 86 86 86 86 86 86 86 86 86 84 90 90 81 86 90 90 90 90 90 90 90 87 87 90 90 89 90 89 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 84 88 83 83 86 90 90 90 90 90 88 82 80 80 80 82 88 90 90 90 90 90 90 90 90 90 90 90 90 90 85 80 80 80 85 90 90 86 85 85 86 87 83 70 70 70 70 73 74 80 90 86 86 86 81 90 85 85 77 77 71 73 71 84 71 82 90 90 85 75 75 75 75 75 75 77 77 76 88 90 90 85 85 85 70 70 62 59 56 66 63 52 52 52 57 56 64 58 70 58 61 69 80 86 86 90 88 88 72 77 79 90 90 90 90 90 90 83 83 77 69 65 65 75 57 57 61 64 64 67 73 80 85 90 90 90 85 85 90 90 90 90 84 84 86 86 80 80 80 82 84 82 82 80 79 90 90 79 79 82 85 80 80 76 85 83 80 79 70 57 56 62 58 57 57 53 60 60 62 62 62 59 60 64 68 69 76 76 80 83 49 49 49 34 33 32 AF LL2263r C -139 AF LL2263f U 71 BS 1 81 LL2263r BS 82 83 LL2263f BS 84 144 LL2263r BS 145 145 LL2263f BS 146 182 LL2263r BS 183 188 LL2263f BS 189 265 LL2263r BS 266 268 LL2263f BS 269 281 LL2263r BS 282 283 LL2263f BS 284 290 LL2263r BS 291 293 LL2263f BS 294 298 LL2263r BS 299 303 LL2263f BS 304 304 LL2263r BS 305 305 LL2263f BS 306 311 LL2263r BS 312 321 LL2263f BS 322 322 LL2263r BS 323 324 LL2263f BS 325 328 LL2263r BS 329 333 LL2263f BS 334 337 LL2263r BS 338 342 LL2263f BS 343 346 LL2263r BS 347 350 LL2263f BS 351 354 LL2263r BS 355 356 LL2263f BS 357 358 LL2263r BS 359 361 LL2263f BS 362 362 LL2263r BS 363 373 LL2263f BS 374 374 LL2263r BS 375 376 LL2263f BS 377 377 LL2263r BS 378 378 LL2263f BS 379 410 LL2263r BS 411 411 LL2263f BS 412 414 LL2263r BS 415 415 LL2263f BS 416 436 LL2263r BS 437 439 LL2263f BS 440 446 LL2263r BS 447 447 LL2263f BS 448 449 LL2263r BS 450 451 LL2263f BS 452 452 LL2263r BS 453 475 LL2263f BS 476 478 LL2263r BS 479 486 LL2263f BS 487 507 LL2263r BS 508 508 LL2263f BS 509 517 LL2263r BS 518 529 LL2263f BS 530 540 LL2263r BS 541 542 LL2263f BS 543 545 LL2263r BS 546 547 LL2263f BS 548 558 LL2263r BS 559 562 LL2263f BS 563 570 LL2263r BS 571 572 LL2263f BS 573 605 LL2263r RD LL2263r 791 0 0 ttaagttgggtaacgccagggttttxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxCTtTTTTTTT TTTTTTTTTtttTtttttTTTTTTTTTTTTTTTTTTttttttTTTTTTTT TTTTTTTTTTTTTTTGAGGTTAACTTGATAGATTCAAATGGATATGAGAA CTTTGATGAATCAAGACAGTGAGAGAGATGGGTGATAGATACATTGTTGA GTTTATACAAGAATGGACAGAAAAAAAAAAAGAAACATGTTCACCATTTC CAGAAAACATTGGAACCACACTTGAACTTGTCTTTGCCTTCGCATTCCAA CGCCAAGTCTTCTGAGATAAATGGGACTTTCTTTTGCTTGGCAAGATCTT GGCAACCAGTGAAATTCTCAGGGAACTTGCAGCTTCCGAATTGGACAGTA AACGCTCTCGCAAAGTTTGCTCCACTTGTTGCCAATCTCTTCTTATCATT AAGCTCTTTGTTGGCTTTGCTCTTCTCGAGGTATTCATCGAAGACGCCAG CATTAGCAGAAGCAGAAACAGCAGCAGAGGAGAAGAGTGTAGCGGCTAAG ACAACCATTGCGGATCTTCGTCCATTAACATCATCACCACCAACTCTTTG AGCTGCTTTGATCACCGGAAGCGTTTGTTTCTTCCGGACGCGTGGxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxatt QA 141 745 141 745 DS CHROMAT_FILE: LL2263r PHD_FILE: LL2263r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:41 2000 RD LL2263f 788 0 3 nctaaaagGTTAACTTGATAGATTCAAATGGATATGAGAACTTTGATGAA TCAAGACAGTGAGAGAGATGGGTGATAGATACATTGTTGAGTTTATACAA GAATGGACAGAAAAAAAAAAAGAAACATGTTCACCATTTCCAGAAAACAT TGGAACCACACTTGAACTTGTCTTTGCCTTCGCATTCCAACGCCAAGTCT TCTGAGATAAATGGGACTTTCTTTTGCTTGGCAAGATCTTGGCAACCAGT GAAATTCTCAGGGAACTTGCAGCTTCCGAATTGGACAGTAAACGCTCTCG CAAAGTTTGCTCCACTTGTTGCCAATCTCTTCTTATCATTAAGCTCTTTG TTGGCTTTGCTCTTCTCGAGGTATTCATCGAAGACGCCAGCATTAGCAGA AGCAGAAACAGCAGCAGAGGAGAAGAGTGTAGCGGCTAAGACAACCATTG CGGATCTTCGTCCATTAACATCATCACCACCAACTCTTTGAGCTGCTTTG ATCACCGGAAGCGTTTGTTTCTTCCGGACGCGtgGxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxcgcatagc ctcgggtgcctaatgagcgagccacctcacattattgcgctgcgctcact gcccgtttccagccgggaaacccggcgtgccacctgca QA 14 535 7 535 DS CHROMAT_FILE: LL2263f PHD_FILE: LL2263f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:04 2000 RT{ LL2263f chimera phrap 698 788 000919:094547 } RT{ LL2263f matchElsewhereLowQual phrap 735 754 000919:094547 } RT{ LL2263f matchElsewhereLowQual phrap 704 722 000919:094547 } CO Contig6 1092 2 38 U cttttatatttcaattgcataaccatatatttatgcacaccacgtacttg cgtatgtgcacaattgcataaaagacattaCAATCCAATTTTTCTCGCAA TTATATAAACGTACAAGAAGAAACATGATCATTGTACGTACGTCTTGTAC ATGGTATTATTTCAAGCATGACACTCGATTGGTGGGACAGGATACCTTGA CTTGTCGGTACAATAATCATATACCATATGGTTTACTCTAACCCAACGGT AACGTCGAGCTTCCACGGCGTTAAGTGACTGATAAGCGTAGCCTTCCCAC CAATTGTGAGGGTTCGATGGACAATTGGTGGGTCCAGGAACAGGACAGCC TTCGATGTCGAAATCTTTGTAATAAGCATAAAAAGGAGCTTTGCTCCAAT CAATTTTCTCTAATCCTCCACGTGTCGCCCAATCATCGGCTTCCCATAGT GTTGAGTAAACTCCCATGGGTTGTGATGTTGGGTAGGCTATGTTCTTGGC TTGGTTGTTTTTGTATTCTCTTATTGGTACATCGTCCACGTAAAAAACAA TGTGTTTGTGTGACCAAAGGATACTGTAAGTGTGGAAATCCAAAGATGGG TCGAACCAAAGATTAACCCTTTGTTCTCTATCTCCTTTGCCATGAGCAAA TATGTTTGTTTGCACCGAGTAAGGTTGGCCACTTCGGTTTCCCAAGAACT CAAAATCTAGTTCATCCCGTACGGTGTCTGTATCCGAGTTCATGTAGAAG GCGGTGACCGTACCGGCAGAGTCGCCGGGAATGAGTTTGATCTTCATGCT CACTTTTCCGAATAGATACTTTCTTTTGGAAGAGAATCCACATCCAGTGC TCTGGTCAAGGACAAGTTGGATAGCTTTTCCACCGTCGACTTGACGGATG TGAGATCCCGACCATGCGGCTTTGAAATCCTCGGCGAATGTCGTAGGCCG TGCTGAAATCTTGATGAACATTAGCGTACAAAGAGTGACAATGCAAAGAA AATGAGTTTTGGCCATCATGGTCGGCCTTTATAACGATGAAAAAAATATG TTCGAGTgactacttgtgacccacgcgtccgcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 39 39 51 40 40 40 46 46 46 51 51 51 51 51 39 39 39 39 35 35 35 35 35 35 35 39 39 35 35 35 35 35 35 40 40 45 45 40 40 51 51 51 51 51 40 40 40 40 40 40 40 40 40 45 40 40 39 39 38 38 38 38 39 39 51 39 39 40 38 38 43 45 43 43 40 35 43 43 43 43 43 45 51 51 51 51 51 51 51 51 51 43 43 43 43 43 43 43 45 56 56 56 56 56 43 43 43 43 43 43 51 43 43 43 43 43 43 36 36 36 36 36 36 43 43 43 43 43 43 43 43 43 43 51 51 45 45 45 45 45 45 45 45 45 45 45 45 45 51 51 56 43 43 43 43 43 43 43 43 45 45 45 45 45 43 43 43 43 43 45 43 43 43 43 43 43 51 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 51 51 51 51 43 43 43 43 43 43 56 56 43 43 43 43 43 43 45 45 45 45 43 43 43 43 43 43 43 43 56 56 56 56 56 56 56 56 56 51 43 43 43 38 38 43 43 43 43 43 43 43 51 51 56 56 56 56 56 56 56 56 56 56 56 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 51 45 63 70 66 61 43 51 56 56 56 56 56 56 56 51 51 70 69 69 45 51 51 56 56 56 56 56 56 56 56 45 45 45 45 55 54 43 48 48 46 47 65 68 85 85 80 85 76 78 80 74 74 77 82 82 88 88 85 81 84 88 83 60 63 64 61 69 80 74 74 77 74 70 70 83 88 88 88 83 83 80 76 76 74 77 77 77 85 85 81 78 83 88 88 85 85 81 81 72 78 76 73 60 64 69 77 71 74 88 90 90 88 88 88 84 84 76 82 84 86 86 76 85 81 83 73 78 80 80 66 75 75 75 75 83 78 77 77 77 75 80 64 64 66 61 56 62 66 70 74 80 86 86 84 88 80 80 77 74 74 74 88 88 80 79 76 64 61 63 65 65 85 88 90 80 77 69 67 65 60 56 62 61 61 66 73 88 81 89 89 88 73 73 71 90 73 90 90 90 90 90 90 90 84 77 75 74 86 84 82 81 80 76 77 80 78 88 78 75 68 62 59 69 69 80 80 90 82 74 70 73 77 80 80 76 72 72 81 85 90 90 74 70 70 69 74 69 70 69 76 74 67 70 75 83 67 69 69 62 65 65 78 76 80 66 70 60 65 64 53 56 58 42 47 43 43 51 51 51 51 51 51 51 56 72 76 66 67 61 64 61 59 64 76 85 80 88 79 74 71 66 71 68 64 63 62 62 61 59 68 69 65 69 72 63 59 59 58 58 58 61 59 62 62 76 75 69 61 62 55 55 55 69 70 76 73 74 74 68 63 62 64 64 61 61 57 57 57 59 54 53 51 50 50 50 45 44 44 42 42 46 46 51 45 45 45 45 43 43 45 45 45 45 62 63 62 63 66 55 56 56 53 51 51 57 51 45 45 51 51 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 51 45 45 45 45 43 43 43 43 43 43 43 45 45 51 56 56 51 51 43 43 43 43 43 43 51 56 56 56 56 56 56 56 51 51 51 51 51 56 56 56 56 56 56 45 45 45 45 45 41 41 41 45 45 45 56 56 51 51 51 51 51 51 56 51 51 45 45 45 45 45 45 45 41 41 43 56 56 46 43 42 38 43 43 43 43 43 43 43 43 43 43 43 43 43 43 46 43 46 46 43 43 43 43 43 43 51 51 40 43 43 43 43 45 56 56 56 56 56 51 43 43 43 43 43 43 43 45 51 51 51 51 51 46 46 41 43 43 43 43 43 43 40 41 41 45 45 45 45 46 56 38 36 36 36 36 36 45 45 51 51 51 51 51 56 51 51 43 43 36 36 36 36 35 35 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 56 56 51 46 46 43 43 43 46 46 51 40 38 35 35 35 35 35 35 38 46 45 45 39 39 38 38 38 38 40 40 46 37 37 37 40 40 40 51 51 51 51 51 51 45 45 45 45 45 40 40 40 40 40 40 40 40 56 40 40 40 39 35 35 35 35 35 39 40 40 40 39 39 39 35 35 35 35 35 34 35 35 35 56 56 56 51 51 51 46 51 51 40 40 35 35 35 35 35 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2264f U 1 AF LL2264r C 354 BS 1 357 LL2264f BS 358 361 LL2264r BS 362 372 LL2264f BS 373 375 LL2264r BS 376 390 LL2264f BS 391 399 LL2264r BS 400 461 LL2264f BS 462 462 LL2264r BS 463 486 LL2264f BS 487 491 LL2264r BS 492 497 LL2264f BS 498 500 LL2264r BS 501 501 LL2264f BS 502 504 LL2264r BS 505 506 LL2264f BS 507 510 LL2264r BS 511 511 LL2264f BS 512 515 LL2264r BS 516 517 LL2264f BS 518 523 LL2264r BS 524 525 LL2264f BS 526 527 LL2264r BS 528 530 LL2264f BS 531 531 LL2264r BS 532 532 LL2264f BS 533 541 LL2264r BS 542 544 LL2264f BS 545 550 LL2264r BS 551 551 LL2264f BS 552 622 LL2264r BS 623 628 LL2264f BS 629 629 LL2264r BS 630 630 LL2264f BS 631 640 LL2264r BS 641 711 LL2264f BS 712 729 LL2264r BS 730 742 LL2264f BS 743 1092 LL2264r RD LL2264f 762 0 0 cttttatatttcaattgcataaccatatatttatgcacaccacgtacttg cgtatgtgcacaattgcataaaagacattaCAATCCAATTTTTCTCGCAA TTATATAAACGTACAAGAAGAAACATGATCATTGTACGTACGTCTTGTAC ATGGTATTATTTCAAGCATGACACTCGATTGGTGGGACAGGATACCTTGA CTTGTCGGTACAATAATCATATACCATATGGTTTACTCTAACCCAACGGT AACGTCGAGCTTCCACGGCGTTAAGTGACTGATAAGCGTAGCCTTCCCAC CAATTGTGAGGGTTCGATGGACAATTGGTGGGTCCAGGAACAGGACAGCC TTCGATGTCGAAATCTTTGTAATAAGCATAAAAAGGAGCTTTGCTCCAAT CAATTTTCTCTAATCCTCCACGTGTCGCCCAATCATCGGCTTCCCATAGT GTTGAGTAAACTCCCATGGGTTGTGATGTTGGGTAGGCTATGTTCTTGGC TTGGTTGTTTTTGTATTCTCTTATTGGTACATCGTCCACGTAAAAAACAA TGTGTTTGTGTGACCAAAGGATACTGTAAGTGTGGAAATCCAAAGATGGG TCGAACCAAAGATTAACCCTTTGTTCTCTATCtnctTTGCCATGAGCAAA TATGTTTGTTTGCACCGAGTAAGGTTGGCCACTTCGGTTTCCCAAGAACT CAAAATCTAGTTCattccgtacggngtCTGTATCCGAGTTCATGtaaaag gccgtgACCGta QA 81 622 1 762 DS CHROMAT_FILE: LL2264f PHD_FILE: LL2264f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:31 2000 RD LL2264r 784 0 0 gATGTCGAAATctntgTAATAAGCATanaaagGAGCTTTGCTCCAATCAA TTTTCTCTAATCCTCCACGTGTCGCCCAATCATCGGCTTCCCATAGTGTT GAGTAAACTCCCATGGGTTGTGATGTTGGGTAGGCTATGTTCTTGGCTTG GTTGTTTTTGTATTCTCTTATTGGTACATCGTCCACGTAAAAAACAATGT GTTTGTGTGACCAAAGGATACTGTAAGTGTGGAAATCCAAAGATGGGTCG AACCAAAGATTAACCCTTTGTTCTCTATCTCCTTTGCCATGAGCAAATAT GTTTGTTTGCACCGAGTAAGGTTGGCCACTTCGGTTTCCCAAGAACTCAA AATCTAGTTCATCCCGTACGGTGTCTGTATCCGAGTTCATGTAGAAGGCG GTGACCGTACCGGCAGAGTCGCCGGGAATGAGTTTGATCTTCATGCTCAC TTTTCCGAATAGATACTTTCTTTTGGAAGAGAATCCACATCCAGTGCTCT GGTCAAGGACAAGTTGGATAGCTTTTCCACCGTCGACTTGACGGATGTGA GATCCCGACCATGCGGCTTTGAAATCCTCGGCGAATGTCGTAGGCCGTGC TGAAATCTTGATGAACATTAGCGTACAAAGAGTGACAATGCAAAGAAAAT GAGTTTTGGCCATCATGGTCGGCCTTTATAACGATGAAAAAAATATGTTC GAGTgactacttgtgacccacgcgtccgcggacgcgtggxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 47 704 1 739 DS CHROMAT_FILE: LL2264r PHD_FILE: LL2264r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:09 2000 CO Contig7 796 2 49 U ctttagggaaccatagagactaggatgttaaatagagaagcattcattat gaagagaccacatattatagagaagcccaaatcaaaccaaagcaaagtat tcaatggcagtttttcatacagacgataacatgctctttacttataaatc ttattcatatgtttctaaaagcttattacggtcctgaatctgccaccttt ttcaaaacttctattttgtttttaagcacgccctttataaaatgcgttta tgataataggaaagtggtgctcaaaatttggcgctttggaggcctatcct cagcttcagagactctTATTTGTCTGCCgtcCAAAtcagcgccATTCAAG gttctGaTGGCAtttctggaCcttcttgAgcagagttgtacgtcacaAAC CcaaaacCcttggaACGgccactgtcCCTGTCgtagaTGACTCTAGCCTC GACAACCTTTCCTTgcTCACCGAACAAACTCTCCAGAGCCATGTCATCGA CtccCCATGAGagactgcCGACATAAACACGgcttccTGAACCAGCAGCA GCAC*CACCACCATATCCAGAACCAGAGCTTCCAAAGCTGcttCTGGgaC ctctgGaGAAAGagtcttccctcttcgggggaggaggaccagcattcact ctnaagggtctaccatccaactcatagccaccgaattgctgcgcagctgc cctaa*cttcagagaccgacgaccatagcc*cgaacccgaaccccctgcc tcgc*cctgtccctttgtcatagatcacccca*ccatcccgaaatt BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 14 25 27 34 44 30 28 26 24 24 26 22 20 19 19 19 22 23 22 25 14 14 12 12 11 13 17 16 22 22 25 25 21 21 21 19 19 11 11 13 23 19 23 23 22 22 22 13 13 13 9 10 10 18 17 22 12 12 12 12 13 16 18 22 19 19 15 15 15 14 14 12 11 11 11 14 14 15 15 15 17 19 22 22 25 25 17 17 18 18 18 18 20 13 8 8 12 13 13 23 24 24 18 18 17 19 16 16 11 11 15 21 21 27 24 24 27 19 18 13 12 16 25 25 30 33 26 24 24 33 24 24 27 24 27 24 24 30 31 31 29 31 31 42 33 37 33 33 27 14 14 22 23 23 28 32 33 35 39 35 42 37 37 42 30 30 24 25 22 27 22 24 27 31 30 27 27 33 24 24 24 27 33 33 24 27 15 15 15 29 25 33 28 28 23 26 17 14 8 6 6 8 13 21 25 25 27 30 29 29 25 25 22 24 20 24 14 11 11 13 8 15 23 29 29 40 31 31 28 23 30 29 32 29 29 29 31 31 32 29 29 33 33 29 33 30 27 22 32 27 32 28 30 25 21 25 25 31 32 29 31 38 32 32 35 35 41 48 45 38 41 27 24 20 12 11 11 24 25 25 22 19 19 23 12 11 11 14 12 23 18 21 20 22 25 23 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2266f U 1 AF LL2266r C 438 BS 1 568 LL2266f BS 569 569 LL2266r BS 570 575 LL2266f BS 576 587 LL2266r BS 588 588 LL2266f BS 589 589 LL2266r BS 590 613 LL2266f BS 614 616 LL2266r BS 617 626 LL2266f BS 627 631 LL2266r BS 632 635 LL2266f BS 636 636 LL2266r BS 637 638 LL2266f BS 639 644 LL2266r BS 645 646 LL2266f BS 647 647 LL2266r BS 648 654 LL2266f BS 655 661 LL2266r BS 662 663 LL2266f BS 664 664 LL2266r BS 665 665 LL2266f BS 666 668 LL2266r BS 669 676 LL2266f BS 677 679 LL2266r BS 680 692 LL2266f BS 693 693 LL2266r BS 694 694 LL2266f BS 695 695 LL2266r BS 696 720 LL2266f BS 721 722 LL2266r BS 723 724 LL2266f BS 725 727 LL2266r BS 728 732 LL2266f BS 733 733 LL2266r BS 734 738 LL2266f BS 739 740 LL2266r BS 741 745 LL2266f BS 746 748 LL2266r BS 749 750 LL2266f BS 751 751 LL2266r BS 752 754 LL2266f BS 756 762 LL2266r BS 763 764 LL2266f BS 765 773 LL2266r BS 774 775 LL2266f BS 776 776 LL2266r BS 777 784 LL2266f BS 785 788 LL2266r BS 789 796 LL2266f RD LL2266f 796 0 0 ctttagggaaccatagagactaggatgttaaatagagaagcattcattat gaagagaccacatattatagagaagcccaaatcaaaccaaagcaaagtat tcaatggcagtttttcatacagacgataacatgctctttacttataaatc ttattcatatgtttctaaaagcttattacggtcctgaatctgccaccttt ttcaaaacttctattttgtttttaagcacgccctttataaaatgcgttta tgataataggaaagtggtgctcaaaatttggcgctttggaggcctatcct cagcttcagagactctTATTTGTCTGCCgtcCAAAtcagcgccATTCAAG gttctGaTGGCAtttctggaCcttcttgAgcagagttgtacgtcacaAAC CcaaaacCcttggaACGgccactgtcCCTGTCgtagaTGACTCTAGCCTC GACAACCTTTCCTTgcTCACCGAACAAACTCTCCAGAGCCATGTCATCGA CtccCCATGAGagactgcCGACATAAACACGgcttccTGAACCAGCAGCA GCAC*CACCACCATATCCAGAACCAGAGCTTCCAAAGCTGcttCTGGgaC ctctgGaGAAAGagtcttccctcttcggnggaggaggaccagcattcact ctnaagggtctaccatccaactcatagccaccgaattgctgcgcagctgc cctaa*cttcagagaccgacgaccatagcc*cgaacccgaaccccctgcc tcgc*cctgtccctttgtcatagatcacccca*ccatcccgaaatt QA 316 613 1 796 DS CHROMAT_FILE: LL2266f PHD_FILE: LL2266f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:40 2000 RD LL2266r 816 0 0 ttttttttgcttcgacaacctttcgttggtcgcgggacaggatcccgaga gcgggggctggaactccccatgggagggaccgggcagaagcccggcttcg ggagccagcaggagcagtcaccgccagatCCAgacccaGAGCTTCCAAAG CTgcgtggaggacctctgtagagagagtctacgctgttcgggggaggagg gccagcattcactagcaagggtctagcatccaggtgaacgccattgagnt gatgagcaga*gccctcagcatcggaggcagaagac*atagtcgcgaaac cgaaacctctgcttcgtgcctgtcccgttgtcatagagcacctcaaccat ctacacatctccggcgctctcgaagagctgagcgagctgagcactgtacg cgttaaaggcaaggtgaccgactcagagtctgaggtcaccatcgaagtgt tgctcccgcatgcggaggcggctcagcgtcatcggagataatgctgtcat cttcttcttcttattcgctgaagtcatctgtacccgcggccgatgcgcat aaccagaaacccggcggaggaagggaagatggagagggtaatggactggg agttgagtttgaagtagacgaaagaatggaggaggagagtgcaacctggg ttcggagatggagaaagggatgcatttacggcttgaaggtgcagagagcg agggcggaagctggagcagccatgtaggccgagaagagagcggacgcgtg ggtcgacccgggaattcccggaacggtaccgcaagttgacccgagggncg cccaacagcttatcct QA -1 -1 93 351 DS CHROMAT_FILE: LL2266r PHD_FILE: LL2266r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:18 2000 CO Contig8 875 2 40 U cttttgggatacagtacaaatgatctcatgacataaactgctggcaaaac atatgtagtctcaggaggaacaaaaacaaaAGACAAGTCAAAAACTCTTT ATGGACTTGTAGCAAGCTTGTTGAAGGCGGCATGAACCCTCTCTTCAGTG AACATTCTGTGATTCTCATAGAAATCTAATATCTCCATTGCGATATTCTT CACTATGTTCATGTCCGCAGAGAGCTCTTCAAACCATGTCTTTATGTCTT TCTCTTTGTACACAGAAGCAACGTAAATGCAAGCGATCGTGATGAGATGA GGCGGGTGTGTAAGGATCAGGTCCATTTTGTAAGTGTCGTTGACAAGACC CCAAGTTAAATGGGTCATGCTTGTGTCATTGAGTCCAGAGTCCTGCAAGT ACTCAGGGAGAGAACGGTATGGGTGGAAGACAACAAGATAGAAGTTCAAA GCTTCCAAGACCTTCATCTCCATCTCCAGAATATCCTTAATCTCATACCT AAACTTCTCATCAGCATACAATTTCTTGATGTAGAAGACAAGAATCTTGG CATGGACCACACTCTCCTCTGCTTTGCAAGCCAGATACAAGCAGGTGGGA GCAACAAGACGAGGCTCATACTCTGTCAAACTCTTCCTTGTGTAAACACG GCGCATATAGGTTACAGCAGTTGCAACAACTCTTTGCCTAATCTTAATGT GTTGTGCCAGCTTCGATATGTAGTTAGACATATGAAGCTTAATGAGTTTG AAATCTTCGAGAGAGATTCCTCTCTGAGCATCAAGAGGATGAACAACATT AATCTCTTCTGGGTCCTTAAGCTCTTTGTAGTGTGTTGATgtccagaaat tggaagccattttccggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 40 40 40 39 39 39 39 39 39 40 40 51 51 51 51 51 40 46 40 40 40 40 40 40 40 40 40 45 51 51 51 51 51 40 40 40 40 40 45 40 40 40 40 40 35 35 35 35 35 35 39 39 40 40 40 40 66 60 60 60 63 63 55 58 58 65 69 61 59 55 55 55 60 65 67 67 69 74 71 74 69 69 70 70 69 78 76 80 82 85 85 80 70 67 61 61 74 74 75 77 71 75 72 77 77 88 65 60 66 68 70 70 66 74 69 69 69 67 68 68 75 69 74 72 72 80 80 80 74 74 85 80 73 61 58 61 61 68 83 85 85 83 83 83 74 72 65 67 69 78 76 90 81 85 85 90 90 90 90 85 85 85 80 90 90 90 90 85 83 83 83 83 84 84 83 77 84 90 90 85 85 85 85 90 85 85 80 80 78 78 74 74 72 82 90 83 83 83 80 83 68 72 75 75 75 79 85 85 85 85 90 90 90 90 90 90 90 90 90 76 78 78 82 76 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 86 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 88 83 83 83 90 83 83 90 90 90 90 90 90 90 90 90 90 86 86 90 90 90 86 90 90 90 87 89 87 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 88 85 90 90 90 90 90 90 90 90 90 90 90 90 90 86 90 90 90 90 90 90 90 90 90 90 90 90 89 90 90 90 90 90 90 90 90 90 85 87 90 87 87 85 85 90 89 89 85 90 85 85 90 90 90 90 90 90 90 90 86 86 90 90 90 90 90 88 88 88 90 90 90 90 90 87 90 90 86 86 87 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 90 90 90 90 90 90 90 89 84 81 85 85 85 80 84 90 89 73 75 80 90 84 83 78 77 77 77 78 85 86 90 90 90 82 80 72 66 75 72 71 72 75 71 71 80 70 76 82 82 79 76 66 66 69 78 72 76 71 66 66 76 85 66 66 66 75 70 62 60 60 60 63 64 67 75 64 66 64 76 76 77 77 77 76 76 79 75 74 75 80 61 64 58 58 58 60 58 59 64 61 63 66 58 58 43 43 43 51 51 51 51 43 43 43 43 43 45 51 51 68 73 70 80 74 51 43 43 43 43 43 43 56 51 51 51 45 45 51 51 51 43 45 43 43 43 43 43 43 65 62 65 48 42 42 43 43 43 55 58 51 45 45 45 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 56 56 56 56 56 51 51 51 51 51 45 45 45 45 45 45 45 45 45 45 45 45 45 51 51 51 51 51 51 51 43 43 43 45 45 43 43 43 43 43 43 45 43 43 43 43 43 43 43 43 43 43 45 56 56 51 51 43 43 43 43 43 43 51 51 51 43 43 43 43 43 43 51 51 51 51 51 56 51 40 40 40 40 40 40 45 45 45 45 45 45 51 51 51 51 51 40 40 39 35 35 35 35 35 39 51 51 51 51 51 51 39 39 39 39 39 39 40 40 40 56 39 39 39 39 40 40 51 51 51 51 51 51 51 40 40 40 40 40 40 40 35 35 35 35 45 45 51 51 51 51 45 40 40 40 40 40 46 46 51 51 51 40 35 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2268f U 1 AF LL2268r C 136 BS 1 264 LL2268f BS 265 265 LL2268r BS 266 291 LL2268f BS 292 298 LL2268r BS 299 305 LL2268f BS 306 306 LL2268r BS 307 315 LL2268f BS 316 317 LL2268r BS 318 324 LL2268f BS 325 325 LL2268r BS 326 326 LL2268f BS 327 327 LL2268r BS 328 337 LL2268f BS 338 338 LL2268r BS 339 341 LL2268f BS 342 342 LL2268r BS 343 344 LL2268f BS 345 352 LL2268r BS 353 354 LL2268f BS 355 360 LL2268r BS 361 363 LL2268f BS 364 379 LL2268r BS 380 386 LL2268f BS 387 387 LL2268r BS 388 393 LL2268f BS 394 403 LL2268r BS 404 404 LL2268f BS 405 440 LL2268r BS 441 444 LL2268f BS 445 450 LL2268r BS 451 451 LL2268f BS 452 464 LL2268r BS 465 465 LL2268f BS 466 512 LL2268r BS 513 513 LL2268f BS 514 616 LL2268r BS 617 621 LL2268f BS 622 645 LL2268r BS 646 656 LL2268f BS 657 875 LL2268r RD LL2268f 764 0 0 cttttgggatacagtacaaatgatctcatgacataaactgctggcaaaac atatgtagtctcaggaggaacaaaaacaaaAGACAAGTCAAAAACTCTTT ATGGACTTGTAGCAAGCTTGTTGAAGGCGGCATGAACCCTCTCTTCAGTG AACATTCTGTGATTCTCATAGAAATCTAATATCTCCATTGCGATATTCTT CACTATGTTCATGTCCGCAGAGAGCTCTTCAAACCATGTCTTTATGTCTT TCTCTTTGTACACAGAAGCAACGTAAATGCAAGCGATCGTGATGAGATGA GGCGGGTGTGTAAGGATCAGGTCCATTTTGTAAGTGTCGTTGACAAGACC CCAAGTTAAATGGGTCATGCTTGTGTCATTGAGTCCAGAGTCCTGCAAGT ACTCAGGGAGAGAACGGTATGGGTGGAAGACAACAAGATAGAAGTTCAAA GCTTCCAAGACCTTCATCTCCATCTCCAGAATATCCTTAATCTCATACCT AAACTTCTCATCAGCATACAATTTCTTGATGTAGAAGACAAGAATCTTGG CATGGACCACACTCTCCTCTGCTTTGCAAGCCAGATACAAGCAGGTGGGA GcaccacgaccagGCTCATACTCTgCcaacctCTTCCTtgcgtAAACACG GCGCATATaggctaCAGcacgctgcccacctttcctccctatctcactgc gctgcgcccacccccccccgctccctcactccccatccccaccgcccccc ccatcccccatacg QA 81 601 1 669 DS CHROMAT_FILE: LL2268f PHD_FILE: LL2268f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:50 2000 RD LL2268r 785 0 0 acCCTCTCTTCAGTGAACATTCTGTGATTCTCATAGAAATCTAATATCTC CATTGCGATATTCTTCACTATGTTCATGTCCGCAGAGAGCTCTTCAAACC ATGTCTTTATGTCTTTCTCTTTGTACACAGAAGCAACGTAAATGCAAGCG ATCGTGATGAGATGAGGCGGGTGTGTAAGGATCAGGTCCATTTTGTAAGT GTCGTTGACAAGACCCCAAGTTAAATGGGTCATGCTTGTGTCATTGAGTC CAGAGTCCTGCAAGTACTCAGGGAGAGAACGGTATGGGTGGAAGACAACA AGATAGAAGTTCAAAGCTTCCAAGACCTTCATCTCCATCTCCAGAATATC CTTAATCTCATACCTAAACTTCTCATCAGCATACAATTTCTTGATGTAGA AGACAAGAATCTTGGCATGGACCACACTCTCCTCTGCTTTGCAAGCCAGA TACAAGCAGGTGGGAGCAACAAGACGAGGCTCATACTCTGTCAAACTCTT CCTTGTGTAAACACGGCGCATATAGGTTACAGCAGTTGCAACAACTCTTT GCCTAATCTTAATGTGTTGTGCCAGCTTCGATATGTAGTTAGACATATGA AGCTTAATGAGTTTGAAATCTTCGAGAGAGATTCCTCTCTGAGCATCAAG AGGATGAACAACATTAATCTCTTCTGGGTCCTTAAGCTCTTTGTAGTGTG TTGATgtccagaaattggaagccattttccggacgcgtggxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxatt QA 3 705 1 740 DS CHROMAT_FILE: LL2268r PHD_FILE: LL2268r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:27 2000 CO Contig9 1201 2 34 U ccacgcgtccgtacgataaaaaaaaaaaaagaacTTACCAACTAATAACT ACAGCATTAATAAATAACCTAAGCAACTCATTGCTTCGCATTATCTGGAT CCAAAAAATTAGTCAAGATATGATAGGCTGATCATATCATTGGAGCAACT GACTAAAAAAAAAGAATAAAGAAAGATGATTATTAAGTTATGAAAGGTAA TCAAAAAGTATGCGGATAAATGGAAGGATGAAAGAAAGAGCGAAAAAATT GAATATTAATGATATATGATTCCAATTTGGAAAATCTATGAGGTCACTGT AAGAAAAGCAATGTAATAAAGCATCAATACAGATTCATTCATAATAATTA GATAAATAATAATAATTAGATAAATCTGTTCCGAAAACAAAAAATTAAGA GCCTTGAGACAATCAAAACTGAGAAAATTGCCTCAAAAAAAAAAAGAAAT TCAAAATTTCATGTAAAAGCTCCATTGTAGAATTCAGGCCTAATGATTAA TCAAGAAGCGATGGGAACGACGGAACCCATGAATATATAGGATTCTAGTG AACAAGAAATCTTAGTAATTCATTGGACAGGATGGCGGAATAAACCAGAA ACTTTATTATCTATTCTGATTTTGATTCTGAGACCTCGGGGGATAAACAG CAAACTTAAATAGATATTGAAAGAGTAAATATTCGCCGGCGAAAAATTGG TTTTTTTTTTTCAAATAAAAACAGTAATAAAAGATGAAAAAAACAATGAA AAAAAAATAAGGATTTGTTATAATATTCTAACTCTAATAAAAATTACATT TGTAATGATGATATTACGTTATTTTTAAATAAATCGAAATAAAATTGATC TTTGATTCTATTTCAAAAAAGACATACACAAATTTAGAAGAGATAAGATG AAATAAAAAAAAAATACCATGATTAATAGGATTAATCATTAACTACATCT ATATCTTAATTAATCCTTTTATTCGCGAGGAGCTGGATGAGaaGAAACTC TCACGTCCAGTtCTGCAGTAGAGATGGAATTTCTCATTTAGAAAAAACCC ATCAACTATAACCCAAAAAGAACCAAATTTCGTAAACAACATCGAGGAaG aCTAaAAGGAATATcCTCTCgtgggAATCGTATttGttttGGCAGaTATG CTcttCaaACACttGAACCCGCTTGGAttAcAtctagacaaatagaagca g BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 40 40 40 40 37 37 37 40 40 40 46 40 39 39 39 39 39 40 56 56 46 46 46 46 46 46 46 40 40 40 39 39 39 40 40 40 40 46 51 40 40 40 40 40 40 40 39 35 39 39 35 35 35 35 35 39 39 35 35 35 51 51 45 51 46 46 46 46 46 46 56 56 35 35 35 35 35 35 40 51 51 51 51 45 45 45 45 45 45 45 43 43 43 43 43 43 51 51 51 51 51 51 51 56 56 56 56 51 51 51 43 35 35 35 35 35 35 35 35 35 43 43 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 56 56 56 56 56 51 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 51 51 51 51 51 51 51 51 51 51 51 56 51 51 45 43 36 36 36 36 36 43 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 43 43 43 43 43 43 51 56 56 56 56 56 56 56 56 51 45 45 45 45 45 45 51 51 51 56 51 51 51 51 45 45 45 45 45 51 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 56 56 43 36 36 36 36 36 45 56 51 51 51 51 45 51 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 51 56 56 56 56 56 56 56 56 56 56 51 45 45 45 45 45 45 51 51 51 51 56 51 51 51 51 43 45 45 45 45 45 56 56 56 56 56 56 56 56 45 45 45 45 45 43 56 56 56 56 56 56 56 43 43 42 46 43 51 56 56 56 56 45 45 45 45 45 41 43 51 51 51 56 51 45 43 43 43 43 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 46 56 56 56 50 53 53 56 52 61 56 56 56 66 66 66 61 61 61 61 55 55 55 55 56 61 66 50 50 50 50 50 50 53 53 52 52 66 55 55 55 50 50 50 66 66 66 50 49 49 45 45 45 55 55 50 61 61 61 61 66 66 66 56 56 56 52 52 52 52 52 52 52 66 66 52 52 52 52 52 52 52 54 56 54 52 50 52 52 56 56 55 50 50 50 50 50 50 50 50 50 61 61 61 61 61 50 53 53 53 53 53 66 66 53 45 45 45 45 45 53 61 61 53 53 53 53 53 53 55 53 45 45 45 45 50 53 53 53 53 53 53 53 53 53 53 55 56 55 66 55 55 66 66 66 66 61 61 61 61 61 66 53 53 53 53 53 53 61 61 61 66 66 61 61 61 61 61 55 55 55 55 55 55 53 53 53 53 53 53 61 55 55 53 55 55 61 61 61 61 61 61 61 61 61 61 61 61 66 66 66 66 66 53 53 53 53 53 53 53 55 55 55 55 55 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 56 56 42 42 42 44 43 51 51 61 53 50 50 50 53 48 56 42 42 56 56 56 56 56 66 56 56 56 56 56 56 56 43 38 38 38 38 38 42 42 50 56 56 56 56 56 56 56 56 51 51 43 43 56 50 50 44 44 42 46 44 56 51 51 51 51 44 43 42 42 43 41 41 41 41 41 51 56 56 56 56 56 56 56 43 43 42 41 41 41 50 50 42 42 46 46 42 42 50 42 56 56 56 56 56 56 43 43 43 43 43 35 40 40 40 40 40 51 51 51 51 51 51 46 40 43 42 42 42 46 56 56 56 56 56 50 42 42 42 42 43 43 56 56 51 43 44 43 42 42 43 46 43 42 42 42 56 56 56 56 56 40 40 40 40 40 42 44 44 44 44 48 48 56 56 56 56 56 56 56 56 47 56 56 56 42 46 46 46 42 42 56 56 44 44 44 56 56 56 56 56 56 56 56 56 56 42 42 42 46 42 42 42 40 40 45 45 45 45 51 51 51 40 40 40 40 40 40 51 45 40 40 40 40 40 40 40 35 40 40 40 46 46 44 48 48 48 40 37 34 32 34 40 40 40 40 40 48 48 40 40 37 37 40 35 29 31 31 34 34 37 40 40 40 40 40 34 34 40 37 37 28 25 18 16 21 26 29 29 32 29 29 32 32 32 32 29 29 24 25 29 29 25 19 24 25 29 32 33 32 37 27 27 24 33 33 35 35 40 40 40 40 48 33 33 33 26 26 29 22 22 24 24 24 48 48 32 25 25 25 25 24 37 40 48 29 20 24 24 24 25 24 25 29 32 32 32 26 29 29 29 25 20 22 25 27 27 27 27 29 29 25 29 32 28 24 24 29 29 29 29 37 40 40 40 27 27 25 25 22 18 21 17 22 22 22 19 24 24 24 25 22 24 25 26 25 19 25 24 23 24 21 10 10 10 17 18 25 25 29 29 29 24 24 24 19 19 21 16 16 19 19 29 25 29 21 21 18 22 22 23 29 27 23 18 12 17 20 17 18 23 25 24 21 19 18 22 22 22 22 22 24 25 25 21 22 22 25 21 17 14 21 18 20 18 18 18 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2271r U -45 AF LL2276r U 409 BS 1 468 LL2271r BS 469 473 LL2276r BS 474 477 LL2271r BS 478 481 LL2276r BS 482 492 LL2271r BS 493 493 LL2276r BS 494 495 LL2271r BS 496 501 LL2276r BS 502 515 LL2271r BS 516 516 LL2276r BS 517 518 LL2271r BS 519 521 LL2276r BS 522 531 LL2271r BS 532 534 LL2276r BS 535 551 LL2271r BS 552 552 LL2276r BS 553 554 LL2271r BS 555 555 LL2276r BS 556 559 LL2271r BS 560 603 LL2276r BS 604 604 LL2271r BS 605 615 LL2276r BS 616 616 LL2271r BS 617 617 LL2276r BS 618 618 LL2271r BS 619 620 LL2276r BS 621 621 LL2271r BS 622 623 LL2276r BS 624 624 LL2271r BS 625 717 LL2276r BS 718 725 LL2271r BS 726 732 LL2276r BS 733 733 LL2271r BS 734 1201 LL2276r RD LL2271r 785 0 0 actxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccac gcgtccgtacgataaaaaaaaaaaaagaacTTACCAACTAATAACTACAG CATTAATAAATAACCTAAGCAACTCATTGCTTCGCATTATCTGGATCCAA AAAATTAGTCAAGATATGATAGGCTGATCATATCATTGGAGCAACTGACT AAAAAAAAAGAATAAAGAAAGATGATTATTAAGTTATGAAAGGTAATCAA AAAGTATGCGGATAAATGGAAGGATGAAAGAAAGAGCGAAAAAATTGAAT ATTAATGATATATGATTCCAATTTGGAAAATCTATGAGGTCACTGTAAGA AAAGCAATGTAATAAAGCATCAATACAGATTCATTCATAATAATTAGATA AATAATAATAATTAGATAAATCTGTTCCGAAAACAAAAAATTAAGAGCCT TGAGACAATCAAAACTGAGAAAATTGCCTCAAAAAAAAAAAGAAATTCAA AATTTCATGTAAAAGCTCCATTGTAGAATTCAGGCCTAATGATTAATCAA GAAGCGATGGGAACGACGGAACCCATGAATATATAGGATTCTAGTGAACA AGAAATCTTAGTAATTCATTGGACAGGATGGCGGAATAAACCAGAAACTT TATTATCTATTCTGATTTTGATTCTGAGACCTCGGGGGATAAACAGCAAA CTTAAATAGATATTGAAAGAGTAAATATTCGCCGGCGAAAAATTGGTTTT TTTTTTTCAAATAAAAACAGTAATAAAAGATGAAA QA 81 756 47 785 DS CHROMAT_FILE: LL2271r PHD_FILE: LL2271r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:42 2000 RD LL2276r 793 0 0 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgAAAAGCTCCATTGTAGAATTCAGGCCTAATGATTAATCAAGAAG CGATGGGAACGACGGAACCCATGAATATATAGGATTCTAGTGAACAAGAA ATCTTAGTAATTCATTGGACAGGATGGCGGAATAAACCAGAAACTTTATT ATCTATTCTGATTTTGATTCTGAGACCTCGGGGGATAAACAGCAAACTTA AATAGATATTGAAAGAGTAAATATTCGCCGGCGAAAAATTGGTTTTTTTT TTTCAAATAAAAACAGTAATAAAAGATGAAAAAAACAATGAAAAAAAAAT AAGGATTTGTTATAATATTCTAACTCTAATAAAAATTACATTTGTAATGA TGATATTACGTTATTTTTAAATAAATCGAAATAAAATTGATCTTTGATTC TATTTCAAAAAAGACATACACAAATTTAGAAGAGATAAGATGAAATAAAA AAAAAATACCATGATTAATAGGATTAATCATTAACTACATCTATATCTTA ATTAATCCTTTTATTCGCGAGGAGCTGGATGAGaaGAAACTCTCACGTCC AGTtCTGCAGTAGAGATGGAATTTCTCATTTAGAAAAAACCCATCAACTA TAACCCAAAAAGAACCAAATTTCGTAAACAACATCGAGGAaGaCTAaAAG GAATATcCTCTCgtgggAATCGTATttGttttGGCAGaTATGCTcttCaa ACACttGAACCCGCTTGGAttAcAtctagacaaatagaagcag QA 57 777 57 793 DS CHROMAT_FILE: LL2276r PHD_FILE: LL2276r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:28 2000 CO Contig10 1053 2 32 U cattgttttcttcaagaatcaagatcctttacatgataagttccaagaag agggaacttgagcgccccccaccccccactCAAGAACCACAAAGTTACAA ACCACTAGAGCCTTACATATTAGTCTGTCTCTTTCCACACCATTAAACAA GCAAACAAGCAGTGTAGGAAAGATAAAGAGAAGTGGTTTTAGAAAGAAAA ATAAGTTGTTTACTTGCGGTGTGTGTATTGTTATAGTGGGGCAAAAGGCT CAGGGAGCAGCTCGAAACCTCCATTATCAAACATGAGTCCATGTTCTGAC GACCTATCATCAATGCTATGCAACCAGTTTGTGTTCTTCACGTCTTCTTT CAACAGAAACGCGTCATACCTTGAGCTCTCATACATATTCATATCGCTCG ACTTCCCCATCAGTCTTCTTCCCTCGTTTGTTTCGTTCCCGTTCTCTGCG CTGTTGCTGCTGATAGAATCCGAAAGGCTATTCACTGAACGGATTGGCTG TATCTTGGACGAGGCTTGTCTGATGCTCTGAGGGAACAACGTCGTGCTTG ATAGTGCGCAACCGTTACTCTTCCCGTTTCTTATGTCCATGTGCTTAATG GCCATATCAAGTGAACTTTTTGATAATGACCTCCCAAGCCCGTTGTTATT GTCCATCACAGTTGAAGAAGTCCTCACGGTTCTCCGTGAAGTTACCTCTG AAACATTTGAAATCCTTCTCGGCTCTGGTGCATATATGTTATGTTGCTGT CCATTACCACTCAAACGGCCTTTTCCTTGGCTCTCTATGAGTCTTCCTCT CGTCACAACAGGAGatgnnagtTCCTTCTCGTCATGGGACCTTTGGGTTC TGGACTTGCCTTTGCTGCAACAGGCCTAGACCTGCCAGCTGATATCGGTC TTCCCGGGAGACTTGTTCTGAGGTTAGGTGGTGCATCAAGAGAGAAGTCT GGTAGCACAATTGGCTGTTGTGTGTTGGTGGTGGTGGTGGTTCGGACTCT AGGTTGTCCAGGAGAGCTcggcctagacaatgaaggtgcgggcggacgcg tgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 35 35 37 37 37 35 35 37 37 40 40 42 42 37 37 37 37 37 39 43 37 37 35 35 32 32 32 32 35 42 42 35 35 35 35 35 35 37 35 28 35 35 28 28 33 37 35 36 36 35 35 35 46 35 40 40 40 38 35 42 42 37 37 40 40 40 40 40 40 40 40 43 45 43 56 43 45 41 41 41 41 41 35 35 40 40 37 37 43 56 56 37 37 38 43 43 43 56 56 56 50 50 44 44 44 50 50 51 46 46 42 42 42 56 56 56 56 56 56 43 43 41 41 45 45 42 42 44 46 42 38 38 38 38 38 35 35 35 32 35 35 50 42 42 42 41 41 41 41 35 38 38 40 40 40 35 35 44 40 40 40 40 40 43 56 42 38 31 31 31 35 35 46 56 56 56 43 40 36 32 32 32 36 36 36 36 36 32 36 31 36 35 35 42 50 50 44 44 42 43 42 42 42 42 50 44 50 50 50 44 44 44 50 42 42 42 35 36 36 36 36 36 38 41 41 42 42 44 44 44 50 42 42 32 35 38 38 38 38 41 41 41 43 42 43 43 50 50 44 44 44 44 44 46 42 42 42 35 35 35 36 36 36 38 42 42 42 42 46 42 42 42 42 42 41 41 44 45 40 40 40 40 38 38 35 44 42 42 42 42 42 42 42 41 41 41 41 48 48 53 52 57 56 56 56 48 48 50 49 51 49 57 41 32 35 33 37 35 37 41 42 42 42 46 46 50 50 50 50 44 57 50 56 56 56 56 50 50 50 50 50 44 44 44 50 50 50 52 50 50 53 56 51 43 43 42 42 40 40 40 40 40 36 52 52 45 39 46 44 46 46 48 52 47 47 51 51 51 51 51 43 48 50 62 54 57 53 60 50 48 45 45 45 48 52 45 40 45 38 42 42 51 41 41 45 49 50 43 45 45 45 45 52 55 51 43 47 53 54 54 50 56 45 44 44 58 49 52 62 61 65 60 60 57 62 62 53 48 54 56 58 70 67 67 54 54 46 48 47 66 57 66 66 52 49 49 52 48 53 62 83 65 71 59 57 57 69 59 62 65 65 67 69 69 58 50 50 45 44 30 34 41 49 54 57 56 59 59 61 65 53 80 61 57 61 72 64 57 53 58 48 61 58 67 67 65 64 65 69 77 74 74 74 66 57 54 57 56 49 38 39 48 59 67 69 80 58 57 54 57 57 61 68 57 57 61 80 88 77 72 72 64 63 69 65 65 69 77 80 80 80 67 62 65 71 71 67 65 65 61 66 69 70 76 75 71 71 71 60 55 55 55 55 55 76 75 82 64 64 66 64 58 60 69 64 64 78 59 55 57 60 47 48 47 45 46 51 61 65 64 59 64 68 61 64 61 61 61 66 64 65 58 54 50 48 48 52 54 62 71 71 74 73 79 76 69 64 67 66 63 52 55 54 52 47 50 48 40 40 40 45 40 40 40 40 40 46 51 59 59 68 77 82 85 89 84 90 75 68 66 68 68 62 59 59 54 49 51 55 55 54 53 66 66 70 75 66 61 65 65 65 65 62 40 40 39 39 39 39 45 51 56 56 56 56 56 56 51 51 51 51 45 45 45 45 45 45 40 34 34 34 34 34 35 51 51 51 51 40 39 39 39 39 39 40 46 46 46 46 40 39 39 36 33 19 15 4 0 0 4 19 19 33 36 44 44 46 39 34 34 34 34 34 40 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 51 51 51 45 45 45 45 45 45 45 45 45 40 40 40 40 40 40 51 45 45 45 45 45 45 45 45 45 51 56 56 56 40 39 39 39 39 39 45 45 51 40 45 35 35 35 35 35 35 51 45 45 45 40 40 40 45 45 45 40 39 39 35 35 35 35 45 45 40 40 40 40 40 40 45 45 40 40 39 39 39 39 39 39 45 45 45 45 45 45 45 45 45 45 45 40 40 40 35 35 35 35 35 35 39 40 56 40 40 40 40 40 40 45 51 51 51 51 51 51 51 51 51 51 40 40 40 40 40 40 40 40 40 40 40 40 40 46 40 40 40 40 40 45 51 51 51 51 46 40 40 35 35 40 40 40 40 51 51 51 51 42 42 37 40 40 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2272f U 1 AF LL2272r C 368 BS 1 377 LL2272f BS 378 392 LL2272r BS 393 410 LL2272f BS 411 411 LL2272r BS 412 427 LL2272f BS 428 432 LL2272r BS 433 443 LL2272f BS 444 539 LL2272r BS 540 554 LL2272f BS 555 557 LL2272r BS 558 563 LL2272f BS 564 564 LL2272r BS 565 572 LL2272f BS 573 573 LL2272r BS 574 574 LL2272f BS 575 575 LL2272r BS 576 576 LL2272f BS 577 577 LL2272r BS 578 589 LL2272f BS 590 594 LL2272r BS 595 599 LL2272f BS 600 612 LL2272r BS 613 616 LL2272f BS 617 618 LL2272r BS 619 621 LL2272f BS 622 627 LL2272r BS 628 628 LL2272f BS 629 671 LL2272r BS 672 717 LL2272f BS 718 728 LL2272r BS 729 763 LL2272f BS 764 1053 LL2272r RD LL2272f 779 0 0 cattgttttcttcaagaatcaagatcctttacatgataagttccaagaag agggaacttgagcgccccccaccccccactCAAGAACCACAAAGTTACAA ACCACTAGAGCCTTACATATTAGTCTGTCTCTTTCCACACCATTAAACAA GCAAACAAGCAGTGTAGGAAAGATAAAGAGAAGTGGTTTTAGAAAGAAAA ATAAGTTGTTTACTTGCGGTGTGTGTATTGTTATAGTGGGGCAAAAGGCT CAGGGAGCAGCTCGAAACCTCCATTATCAAACATGAGTCCATGTTCTGAC GACCTATCATCAATGCTATGCAACCAGTTTGTGTTCTTCACGTCTTCTTT CAACAGAAACGCGTCATACCTTGAGCTCTCATACATATTCATATCGCTCG ACTTCCCCATCAGTCTTCTTCCCTCGTTTGTTTCGTTCCCGTTCTCTGCG CTGTTGCTGCTGATAGAATCCGAAAGGCTATTCACTGAACGGATTGGCTG TATCTTGGACGAGGCTTGTCTGATGCTCTGAGGGAACAACGTCGTGCTTG ATAGTGCGCAACCGTTACTCTTCCCGTTTCTTATGTCCATGTGCTTAATG GCCATATCAAGTGAACTTTTTGATAATGACCTCCCAAGCCCGTTGTTATT GTCCATCACAGTTGAAGAAGTCCTCACGGTTCTCCGTGAAGTTACCTCTG AAACATTTGAAATCCTTCTcgcttcTGGTGCATATATGTTATGTTGCTGT CCATTACCACTCAAAcggncttTttcttg QA 81 671 1 779 DS CHROMAT_FILE: LL2272f PHD_FILE: LL2272f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:31 2000 RD LL2272r 731 0 0 accttaagCTCTCATACATATTCATATcggtggact*cCCCATCAGTCTT ctcccctCGTTTGTTTCgtccccgTTCTCTGCGCTGTTGCTGCTGATAGA ATCCGAAAGGCTATTCACTGAACGGATTGGCTGTATCTTGGACGAGGCTT GTCTGATGCTCTGAGGGAACAACGTCGTGCTTGATAGTGCGCAACCGTTA CTCTTCCCGTTTCTTATGTCCATGTGCTTAATGGCCATATCAAGTGAACT TTTTGATAATGACCTCCCAAGCCCGTTGTTATTGTCCATCACAGTTGAAG AAGTCCTCACGGTTCTCCGTGAAGTTACCTCTGAAACATTTGAAATCCTT CTCGGCTCTGGTGCATATATGTTATGTTGCTGTCCATTACCACTCAAACG GCCTTTTCCTTGGCTCTCTATGAGTCTTCCTCTCGTCACAACAGGAGatg nnagtTCCTTCTCGTCATGGGACCTTTGGGTTCTGGACTTGCCTTTGCTG CAACAGGCCTAGACCTGCCAGCTGATATCGGTCTTCCCGGGAGACTTGTT CTGAGGTTAGGTGGTGCATCAAGAGAGAAGTCTGGTAGCACAATTGGCTG TTGTGTGTTGGTGGTGGTGGTGGTTCGGACTCTAGGTTGTCCAGGAGAGC Tcggcctagacaatgaaggtgcgggcggacgcgtggxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 173 651 1 686 DS CHROMAT_FILE: LL2272r PHD_FILE: LL2272r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:09 2000 CO Contig11 735 2 105 U ccacgcgtccgcttttttttttttttttttttttTTTTTCTCATAAAACC TCTGTTTATatctTGAACAAAGTTACATTATAAGATGAGATGATGAGTGA TGAGAAAGGAGAAAAAGATTTCTAAGAGGAGGAGAAGGTATCGATGATGG TGGTGTGAAGGGGATCACTGAGGTGAGTAGCCCAATTGTTGAGTGGGCCT TTACCAGTAGCAGCGGCTTGAACAGCGAAGCCCAAGAAGGCGACCATGGC AAGACGAGCGTGCTTGATCTCAGCTAACTGAAGCTGAGCCATCTTCTCAG GGTCAGACGCTAAACCAAGCGGGTCGAAGAATTTGCCGCCAGGGTACAAA CGCTTCTCAGAGTCAAGCTCAGCGTTGCGCTGGAACTCGATGTAGCCGAT CACTAACACCTCGATCCATATCAACGTCGAGATGGAGAAGGGTAGTGGCT GCCCCAAGTAGGACGATCCATCCACTAGCTCCACCTTTCCGGCGTCTTGC CAAGTGACGCCGGTGAGCCAtTcGACGGagaGAGCGCCGAGAGTGGCGAG CATGGCCCACCGTCCGTGAATCAGCTCGCATTCTCTGAACCtCTGGATCC CGAACACCtCGCTGTACGGCTGAAACGGAGTcgaTtTGGGGGCCgCCTCt tcGGTACGGGttcCGATCACGtctccggctaagttcttcGccaGaTtCtg gtcTagtgaatctatatCgaaCTggagataCTcgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 56 56 56 56 47 35 35 35 35 35 34 46 35 40 40 37 48 50 50 50 44 42 42 34 25 4 4 4 13 20 26 42 42 42 44 43 44 44 47 44 45 45 45 45 45 45 50 50 44 45 45 45 45 45 45 45 45 45 47 50 50 47 47 47 47 47 47 50 47 47 47 47 47 52 53 50 50 47 52 52 52 52 47 47 52 47 47 47 47 47 47 47 52 54 53 61 66 66 50 50 50 52 50 56 50 45 45 45 45 45 45 52 53 52 56 66 54 54 52 52 45 50 50 50 50 50 58 56 53 50 50 35 35 35 35 37 47 53 45 45 45 47 45 47 45 45 45 54 54 52 52 52 66 60 52 52 53 53 61 61 50 50 50 50 50 50 50 50 52 53 53 50 54 53 53 53 53 45 45 45 45 45 45 45 45 53 50 50 50 45 45 45 45 45 45 52 52 52 52 54 54 60 60 48 46 46 46 46 46 46 48 50 66 66 66 66 66 48 46 48 46 46 46 55 55 48 48 48 53 50 50 55 61 61 61 61 61 53 53 53 53 46 46 46 51 51 55 55 55 51 51 51 55 53 53 55 55 51 51 51 55 55 55 53 61 61 61 54 54 52 48 46 45 46 46 46 45 45 47 60 60 60 60 48 48 48 48 56 52 51 51 51 51 45 47 47 52 52 51 51 47 46 46 46 45 46 48 56 56 52 51 51 52 52 51 51 46 53 46 46 45 45 45 46 45 45 45 45 52 54 60 51 48 48 42 42 41 45 51 51 51 51 47 46 46 42 42 42 47 46 45 46 43 43 45 52 52 47 54 45 45 45 60 60 52 56 51 51 51 55 53 53 51 52 52 52 45 47 52 47 56 60 66 66 56 52 47 60 52 56 54 56 56 52 52 44 47 47 52 46 56 32 32 32 47 58 66 52 52 47 47 47 45 45 42 42 45 45 55 50 53 52 56 56 56 56 50 45 45 45 45 45 45 52 52 54 54 52 49 50 49 49 41 43 45 48 45 45 48 47 49 48 56 52 52 53 53 52 52 42 34 34 34 42 42 56 56 42 52 44 43 33 39 37 34 32 30 36 42 42 42 37 40 40 33 22 19 25 16 24 34 41 34 34 15 15 15 35 30 32 34 39 39 32 35 32 32 32 43 39 35 42 34 39 39 39 40 50 50 57 57 47 43 42 39 34 33 23 28 20 20 24 26 26 31 32 31 33 21 21 21 29 25 29 29 29 30 38 38 36 36 54 43 43 50 41 31 18 20 32 27 25 34 40 29 29 21 24 33 35 38 39 32 29 18 25 25 30 30 33 34 34 35 35 29 29 28 32 29 35 35 33 29 29 33 29 21 18 15 15 20 18 25 29 40 48 40 32 29 22 19 25 23 23 21 15 15 17 24 27 26 29 25 24 26 22 18 15 18 23 25 25 24 23 25 24 20 17 18 18 15 15 18 11 15 10 9 8 19 15 9 12 9 11 15 21 15 18 18 21 16 23 14 20 14 15 14 17 15 21 14 13 14 14 17 17 16 16 13 18 11 12 11 21 19 18 19 24 25 13 15 12 12 11 16 18 23 20 18 18 18 AF LL2273r U -45 AF LL2273f U 36 BS 1 50 LL2273r BS 51 63 LL2273f BS 64 64 LL2273r BS 65 65 LL2273f BS 66 68 LL2273r BS 69 69 LL2273f BS 70 70 LL2273r BS 71 72 LL2273f BS 73 73 LL2273r BS 74 83 LL2273f BS 84 93 LL2273r BS 94 107 LL2273f BS 108 108 LL2273r BS 109 112 LL2273f BS 113 116 LL2273r BS 117 118 LL2273f BS 119 119 LL2273r BS 120 135 LL2273f BS 136 136 LL2273r BS 137 137 LL2273f BS 138 138 LL2273r BS 139 145 LL2273f BS 146 146 LL2273r BS 147 155 LL2273f BS 156 161 LL2273r BS 162 175 LL2273f BS 176 176 LL2273r BS 177 177 LL2273f BS 178 178 LL2273r BS 179 181 LL2273f BS 182 184 LL2273r BS 185 202 LL2273f BS 203 203 LL2273r BS 204 206 LL2273f BS 207 207 LL2273r BS 208 312 LL2273f BS 313 317 LL2273r BS 318 321 LL2273f BS 322 323 LL2273r BS 324 332 LL2273f BS 333 335 LL2273r BS 336 339 LL2273f BS 340 341 LL2273r BS 342 343 LL2273f BS 344 344 LL2273r BS 345 346 LL2273f BS 347 350 LL2273r BS 351 354 LL2273f BS 355 356 LL2273r BS 357 365 LL2273f BS 366 366 LL2273r BS 367 370 LL2273f BS 371 377 LL2273r BS 378 382 LL2273f BS 383 383 LL2273r BS 384 384 LL2273f BS 385 385 LL2273r BS 386 386 LL2273f BS 387 391 LL2273r BS 392 392 LL2273f BS 393 393 LL2273r BS 394 396 LL2273f BS 397 399 LL2273r BS 400 407 LL2273f BS 408 414 LL2273r BS 415 421 LL2273f BS 422 432 LL2273r BS 433 434 LL2273f BS 435 435 LL2273r BS 436 439 LL2273f BS 440 441 LL2273r BS 442 450 LL2273f BS 451 451 LL2273r BS 452 452 LL2273f BS 453 453 LL2273r BS 454 454 LL2273f BS 455 466 LL2273r BS 467 468 LL2273f BS 469 471 LL2273r BS 472 476 LL2273f BS 477 483 LL2273r BS 484 485 LL2273f BS 486 487 LL2273r BS 488 489 LL2273f BS 490 507 LL2273r BS 508 510 LL2273f BS 511 517 LL2273r BS 518 520 LL2273f BS 521 528 LL2273r BS 529 533 LL2273f BS 534 557 LL2273r BS 558 560 LL2273f BS 561 581 LL2273r BS 582 590 LL2273f BS 591 593 LL2273r BS 594 596 LL2273f BS 597 602 LL2273r BS 603 607 LL2273f BS 608 645 LL2273r BS 646 651 LL2273f BS 652 663 LL2273r BS 664 666 LL2273f BS 667 669 LL2273r BS 670 670 LL2273f BS 671 735 LL2273r RD LL2273r 781 0 0 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccac gcgtccgcttttttttttttttttttttttTTTTTCTCATAAAACCTCTG TTTatat*tTGAACAAAGTTACATTATAAGATGAGATGATGAGTGATGAG AAAGGAGAAAAAGATTTCTAAGAGGAGGAGAAGGTATCGATGATGGTGGT GTGAAGGGGATcattGAGGTGAGTAGCCCAATTGTTGAGTGGGCCTTTAC CAGTAGCAGCGGCTTGAACAGCGAAGCCCAAGAAGGCGACCATGGCAAGA CGAGCGTGCTTGATCTCAGCTAACTGAAGCTGAGCCATCTTCTCAGGGTC AGACGCTAAACCAAGCGGGTCGAAGAATTTGCCGCCAGGGTACAAACGCT TCTCAGAGTCAAGCTCAGCGTTGCGCTGGAACTCGATGTAGCCGATCACT AACACCTCGATCCATATCAACGTCGAGATGGAGAAGGGTAGTGGCTGCCC CAAGTAGGACGATCCATCCACTAGCTCCACCTTTCCGGCGTCTTGCCAAG TGACGCCGGTGAGCCAtTcGACGGagaGAGCGCCGAGAGTGGCGAGCATG GCCCACCGTCCGTGAATCAGCTCGCATTCTCTGAACCtCtGGATCCCGAA cACCtCGCTGTACGGCTGAAACGGAGTcgaTtTGGGGGCCgcctcttcGG TACGGGttccGATCAcGtctccggctaagttcttcGccaGaTtCtggtcT agtgaatctatatCgaaCTggagataCTcgg QA 81 781 47 781 DS CHROMAT_FILE: LL2273r PHD_FILE: LL2273r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:14 2000 RD LL2273f 767 0 0 cttaaccggaaccccTCTGTTTATatcttGAACAAAGTTACATTATAAGA TGAGATGATGAGTGATGAGAAAGGAGAAAAAGATTTCTAAGAGGAGGAGA AGGTATCGATGATGGTGGTGTGAAGGGGATCACTGAGGTGAGTAGCCCAA TTGTTGAGTGGGCCTTTACCAGTAGCAGCGGCTTGAACAGCGAAGCCCAA GAAGGCGACCATGGCAAGACGAGCGTGCTTGATCTCAGCTAACTGAAGCT GAGCCATCTTCTCAGGGTCAGACGCTAAACCAAGCGGGTCGAAGAATTTG CCGCCAGGGTACAAACGCTTCTCAGAGTCAAGCTCAGCGTTGCGCTGGAA CTCGATGTAGCCGATCACTAACACCTCGATCCATATCAACGTCGAGATGG AGAAGGGTAGTGGCTGCCCCAAGTAGGACGATCCATCCACTAGCTCCACC TTTCCGGCgtttTgtcaAGTGACGCcggcGAGCCAtccgACGGagaGAGC GCCGAGAGTGGCGAGCATGGCCCACCgtncgcgaATCacctCGCATTCTC TGAACCtCTGGattccgAACACCtccctgcacGGCTGAAACGGAgttcat tcgcgggtcgCCTCttcGGTAcggcttcCGATCACgttctcggctccctc ctttcccacactctgccccactcactcacatcccattcgttcattttgcc cgcttcacctactttataggctcaatctgtctttttcccttatcgccctc tcccctctcttgttacc QA 28 526 14 637 DS CHROMAT_FILE: LL2273f PHD_FILE: LL2273f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:36 2000 CO Contig12 771 2 7 U agcgcacacccaccatatgatctttaagggaatgacttaacaactgttgg taatagttttgttgcttctcatataaCCCAATAGAAAAAAAAATTAATCA TCCCGCTTATGGAAAATTAACATGCTTTCAGTTTTATTATAAAAAGTTTC TAAGCATTAGCTCCAATGGGAACATTCTTGAGTCTTGATGTTCCGTAAAG GGTTTCTTCAAAGCGGATGATCTCGTTCTTTGAGCCATAAGCAACTTGAG TTCTATCGTCGAGGTTGACGATGGTCTTGTCGAGCACAGACTTGTATCCA TCACTGCTGAATCCACCAGCATTCTCAATGAGTAGGCCTAAAGGAGCCAC TTCGAACAAGAGTCTCAACTTTGCCTTAGCCGTAGGAGAAGTCACGTTTG TGAAGATTCCTTTCTCCTTCACAATAATCTGGTTAACGTCAGGAACCATT CCTCCTGTATATCTCAATGTGTACTTCTCTTTCACGTAGTAATCAATCAG CTTGCTGTATTCAGAATTGTCAAACGTGGCTCTCAAGTTTCCTGGTGAGA ACATTTTCCCTTCATTAATCTctgTGGTCTCCTTAACATGCTGCCATTTA CCTTCATCAAGA*AGCAAGAAC*TCATGAG*TTCC*TGGAAACCCTTTAA CAGCCAAAACATAAGTGgttcTTGGACCATagATTccCATGgCTGCAGcc ACttGATCTcctcccgTGACTccggTtaaCTTATCACCAGGCCAAACCGg ACgcgtGggCggacgcgtggc BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 40 39 37 35 35 35 37 37 40 46 51 56 56 56 56 40 40 37 40 40 35 35 35 35 35 35 35 35 35 35 35 35 35 39 51 51 51 51 35 35 35 35 35 35 37 35 39 35 35 35 35 40 37 37 37 35 35 35 38 38 38 38 38 39 40 40 56 56 50 42 42 40 40 40 40 40 40 42 37 37 40 35 35 35 35 35 43 40 51 51 56 56 51 51 51 51 51 43 41 45 45 45 51 51 45 45 45 45 45 43 43 43 42 42 38 43 43 40 41 45 56 56 56 56 56 40 38 43 43 36 43 43 51 51 45 45 45 45 45 45 56 56 51 51 45 43 43 38 38 38 41 41 51 56 56 56 46 46 43 43 42 46 56 56 56 43 43 43 43 43 43 56 51 45 43 43 43 43 43 43 45 45 45 51 51 51 56 56 43 43 43 43 43 43 43 51 56 56 56 51 51 45 45 45 45 45 43 43 43 43 43 43 43 43 43 43 45 51 51 51 51 51 51 51 43 43 43 43 43 43 43 43 43 56 56 46 43 43 43 43 43 51 51 51 51 45 45 45 45 45 51 56 56 56 56 51 45 45 45 45 45 45 51 51 56 56 56 51 51 43 43 43 43 43 43 43 51 51 51 56 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 56 56 56 50 50 46 46 51 51 51 51 51 51 46 46 50 50 50 56 56 45 45 45 45 45 51 56 56 56 56 56 56 51 45 45 45 45 41 41 56 46 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 56 56 56 45 45 51 43 43 43 51 51 51 56 56 56 51 45 45 45 45 40 40 51 51 51 51 51 56 56 56 56 56 45 45 45 45 45 45 56 56 50 50 50 45 45 40 45 40 42 56 56 56 56 56 51 51 51 51 51 51 56 56 56 56 48 56 44 44 42 43 45 45 45 37 37 37 37 37 37 40 56 56 56 56 56 56 42 46 46 46 40 40 42 46 45 40 51 42 42 35 35 35 35 35 37 43 46 56 56 42 42 40 45 45 45 48 42 42 42 44 42 42 42 42 42 42 42 56 56 42 46 42 40 38 35 35 38 40 40 38 42 37 48 16 16 11 21 21 40 40 40 44 40 40 40 34 32 29 34 48 32 32 29 29 25 25 27 29 33 33 34 34 34 32 29 31 33 32 40 40 29 29 29 29 29 29 40 40 40 34 22 24 25 27 32 32 48 40 37 29 25 28 25 29 27 34 34 27 32 32 32 32 32 32 32 29 25 24 29 32 38 41 38 38 35 42 39 37 27 26 25 20 25 24 20 15 10 10 17 44 46 50 48 70 45 42 40 24 18 18 22 24 23 18 18 27 40 29 22 19 23 25 25 25 25 24 19 15 21 23 18 18 25 32 40 26 25 17 15 19 15 10 15 17 26 22 24 21 25 18 13 15 12 20 16 19 19 32 32 24 25 29 27 29 27 27 25 22 22 25 24 20 22 25 27 25 26 18 24 21 18 18 18 18 22 13 13 21 18 18 17 17 13 11 19 15 0 0 0 AF LL2274f U -3 AF LL2274r C 4 BS 1 652 LL2274f BS 653 660 LL2274r BS 661 661 LL2274f BS 662 663 LL2274r BS 664 671 LL2274f BS 672 679 LL2274r BS 680 771 LL2274f RD LL2274f 776 0 0 ctgnagcgcacacccaccatatgatctttaagggaatgacttaacaactg ttggtaatagttttgttgcttctcatataaCCCAATAGAAAAAAAAATTA ATCATCCCGCTTATGGAAAATTAACATGCTTTCAGTTTTATTATAAAAAG TTTCTAAGCATTAGCTCCAATGGGAACATTCTTGAGTCTTGATGTTCCGT AAAGGGTTTCTTCAAAGCGGATGATCTCGTTCTTTGAGCCATAAGCAACT TGAGTTCTATCGTCGAGGTTGACGATGGTCTTGTCGAGCACAGACTTGTA TCCATCACTGCTGAATCCACCAGCATTCTCAATGAGTAGGCCTAAAGGAG CCACTTCGAACAAGAGTCTCAACTTTGCCTTAGCCGTAGGAGAAGTCACG TTTGTGAAGATTCCTTTCTCCTTCACAATAATCTGGTTAACGTCAGGAAC CATTCCTCCTGTATATCTCAATGTGTACTTCTCTTTCACGTAGTAATCAA TCAGCTTGCTGTATTCAGAATTGTCAAACGTGGCTCTCAAGTTTCCTGGT GAGAACATTTTCCCTTCATTAATCTctgTGGTCTCCTTAACATGCTGCCA TTTACCTTCATCAAGA*AGCAAGAAC*TCATGAG*TTCC*TGGAAACCCT TTAACAGCCAAAACATAAGTGgttcTTgGACCATagATTccCATGgCTGC AGccACttGATCTcctcccgTGACTccggTtaaCTTATCACCAGGCCAAA CCGgACgcgtGggCggacgcgtggcn QA 81 772 5 775 DS CHROMAT_FILE: LL2274f PHD_FILE: LL2274f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:41 2000 RD LL2274r 866 0 0 atataactcttagaaacaacaattcagcagtacggacatggagaatcagc atgcgctcagtcggaccatacacagagtccaatcataggctgcaatgaga acagtatttagttgtgaagtgccgaacaagggattctcaaaacggacgat ctcgttcttgaagccataagcagcttgagctcgatcgtcgagtgtaacga cgctctcatcgagcgcataggtagtatccgacaccgaataagccaccacc actcgcaaaggaagccttaaaatagccgctgatacaaagctctccactat gcttcaccaggtagcatgaagcgcgcgctttgatgacagtcatcacgttc ctaccttgcgagcagagtatactactagagtgtatactgtcccagcgaga gcctatntccatacgctgtcactatgcacctaagcgtcgttacagtgctc aatttcccaggtagctaactcaatcacagcantgtagtaatcagatgatg tcggtcagtggctagctaaggttgtctgagagatgagcaattatctcctc atacatctagttgtattgtggtcgccatcaacattggagacatcatacat tcatcagaatagcaagaacgtcatgaggttccgtgtaatccgtttaACAG CCAAAACATAagaga*tcTTGGACCATacaatgccatgggggcaagcact *gctctcctccatgttgcaccggttaacaaaactaccagcccaatccatc cgtgtagccggaagcgtgctcgaacgcggagacgaacgcgcgggcggacg cgtgggtcgacccgggaatcccggaacggtaccgcaagtacccngagggn gnnnaaaaggttatac QA -1 -1 596 711 DS CHROMAT_FILE: LL2274r PHD_FILE: LL2274r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:19 2000 CO Contig13 319 2 1 U cttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTAAATTGTAAACCAAAAAATGATATTACATTACAT GTGTTAACATATTTCTACTAGAAATGACATCACTAGAACTCGAATAGACT TAAAATTAAAGAAAAGATGAATGACTGTTATTTACCATACAACGCAAGCG CAAGAGAATTGTTTTCCATGTATTCATACACGAGTAGCAATTGATTCTTC TCGACACAACATCCATAAAGCTTGACAAGGTTTGGATGGTTCAACCCTGA GATCATTCCGGACGCGTGG BQ 15 15 15 36 46 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 45 45 45 45 45 45 45 56 71 71 71 71 71 71 71 71 66 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 82 83 78 83 90 85 90 90 90 90 85 84 84 84 84 84 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 83 83 83 83 83 83 90 85 90 90 90 90 90 84 90 90 85 85 80 80 80 80 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 80 80 86 86 86 86 90 84 84 84 85 85 85 90 82 70 70 70 70 70 70 69 69 69 69 72 75 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 78 78 78 82 82 82 83 83 83 90 83 83 83 83 78 78 78 78 78 78 78 82 83 83 83 89 89 83 90 90 89 82 82 82 82 82 83 90 90 90 90 90 90 90 90 90 90 90 90 86 83 88 78 78 76 76 83 83 89 83 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 80 80 78 78 83 88 90 90 90 49 49 49 34 33 32 AF LL2275r C -406 AF LL2275f U 63 BS 1 319 LL2275r RD LL2275r 771 0 5 aaagggagcccccgatttagagcttgacggggaaagccggcgaacgtggc gagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaa gtgtagcggtcacgctgcgggtaaccaccacacccgccgcgcttaatgcg ccgctacagggcgcgtcccattcgccattcagggtgcgcaactgttggga agggcgatcggtgcgggccttttcgctattacgccagctggcgaaagggg gatgtgctgcaaggcgattaagttgggtaacgccagggttttxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxcttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTAAATTGTAAACCAAAAAATGATATTAC ATTACATGTGTTAACATATTTCTACTAGAAATGACATCACTAGAACTCGA ATAGACTTAAAATTAAAGAAAAGATGAATGACTGTTATTTACCATACAAC GCAAGCGCAAGAGAATTGTTTTCCATGTATTCATACACGAGTAGCAATTG ATTCTTCTCGACACAACATCCATAAAGCTTGACAAGGTTTGGATGGTTCA ACCCTGAGATCATTCCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxatt QA 1 726 408 726 DS CHROMAT_FILE: LL2275r PHD_FILE: LL2275r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:23 2000 RT{ LL2275r chimera phrap 1 292 000919:094547 } RT{ LL2275r matchElsewhereHighQual phrap 1 119 000919:094547 } RT{ LL2275r matchElsewhereHighQual phrap 121 183 000919:094547 } RT{ LL2275r matchElsewhereHighQual phrap 185 236 000919:094547 } RT{ LL2275r matchElsewhereHighQual phrap 222 292 000919:094547 } RD LL2275f 767 0 2 ctttaaggtgacaaCCAAAAAATGATATTACATTACATGTGTTAACATAT TTCTACTAGAAATGACATCACTAGAACTCGAATAGACTTAAAATTAAAGA AAAGATGAATGACTGTTATTTACCATACAACGCAAGCGCAAGAGAATTGT TTTCCATGTATTCATACACGAGTAGCAATTGATTCTTCTCGACACAACAT CCATAAAGCTTGACAAGGTTTGGATGGTTCAACCCTGAGATCATTCCGGA CGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxGCCTGGGGTGCCTAATGAGTGAGCTAACTC ACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTC GTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGC GTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAG CTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCcA CGCTGGTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGgttaACGGC GGGATATAACATGAGCTGTCTTCGGTATCGTCGTATCCCACTACCGAGAT ATCCGCACCAACGCGCA QA 15 767 13 257 DS CHROMAT_FILE: LL2275f PHD_FILE: LL2275f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:46 2000 RT{ LL2275f chimera phrap 421 767 000919:094547 } RT{ LL2275f matchElsewhereHighQual phrap 421 767 000919:094547 } CO Contig14 1056 2 32 U ccttttcttagaagcatatgataaaaccattgtagtatatcaagaagatg aagatggaagtttgttttcacagtgtcaatAGAAATTGTCATCAAACAAA GTCTACAAAGCTCTTTAATAAACTAGAAATGGCTTAAAGTGTAGGAGCTC TCTCAGCGGTTCCAGCTATGACGGTAAGCAAATTGTTTCCAAAAGGATCA CTGAGATGCTTTGAAAGGTTCTCAACAGGACCTTCTCCAGTAACATAAGC TTGGATGAAGAAACCGAGCATCGCAAACATGGCTAATCTCCCGTTCTTGA TCTCTTTAACCTTGAGGAGAGCTCCTTGTTCGGGGTCCTTAGCAAGTCCT AAAGGATCAAATGGACCTCCGGGGTGTAGCTTGTCCTCAAAATCCAATCC GTTGGTGATTCTGTAGTACTCGGCTCCACCGAGGAGAACAACCTCAGCAA CTACGGCGAGAACAAGGTTGATTGGGATGTTCTTGCCAAAGTAGTTCAAT GTGTTTCCATCAAGAAGCAGAGCACCAGTCTTGAACCAGACGGCTTCAGG ACCACAGTTAGCTCCGTATTTGTTTAAAGCTTCAGGGATGATGAAACCAG CTGCTCCCAACATAGCCCATCTCGCATGGATCAGCTCAAAGGCTTGGTAT TTAGCAAAGTTCTCAGGCTTCTTTCCAAGTCCAAATGGGTCATAGCCATA GTCTCCAGCAACTTCACCGTTTAAGTACTCTGGGATCTCTGATCTGTCCA AAAGACCATCCGGCAGGAAAACTCTCCTGTCAGGACCATACCACTTGGCG AGTTCATCGTTAGCAACAGAGACGGTTTTGGCCTTGGCAGGAGCTGGCTT TTTCTTGGAGAACAGAGCAACTGTCTTAAACGTAGCCGGGCTCGATGCCG ATGGAGCAGACGATCTTGACACTGCCTTGAAGTTAAGTGGCTTACCAAGC ATTTCCGAAACACCCATAGACGCCATCTTTGTACCCCAAAGTTTCAAGCT TTAACTCTGTTTCAAAACAAtgaaatgctaagtgaaaggaaggaacggac gcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 40 40 51 51 46 46 46 46 46 40 40 40 46 51 51 51 51 51 51 51 39 39 39 39 39 40 51 40 39 39 39 39 35 39 39 35 35 35 39 38 35 35 35 35 35 35 40 51 39 35 35 35 35 35 35 39 39 51 43 43 43 43 43 43 43 35 35 35 35 35 35 43 43 43 43 43 43 35 35 37 43 43 43 35 35 35 35 35 35 43 43 43 43 43 43 43 43 43 43 43 56 56 56 56 38 38 43 43 46 46 56 56 56 56 51 51 43 36 36 36 36 36 43 51 43 35 35 35 35 35 43 56 56 56 56 56 51 45 45 45 45 45 45 51 51 51 43 43 43 43 43 43 45 45 45 45 45 51 56 56 56 51 45 45 45 45 43 43 45 45 45 51 56 51 51 51 51 51 51 51 45 43 43 43 43 45 45 56 43 36 36 36 36 36 36 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 45 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 45 45 51 51 56 56 56 56 71 58 58 58 58 67 65 67 78 75 73 80 80 80 77 67 69 69 60 60 60 66 69 76 74 76 69 69 69 60 60 63 69 70 85 85 85 85 90 88 88 88 85 81 71 64 69 64 64 69 73 77 83 82 81 83 83 81 81 70 72 72 69 74 72 80 75 75 80 80 88 88 88 88 85 81 81 88 88 77 81 89 79 74 74 85 88 83 88 85 85 85 90 90 78 80 85 88 78 76 85 81 85 90 85 79 77 90 90 90 79 79 90 85 80 80 80 80 75 75 76 76 76 80 80 80 83 83 83 77 63 63 70 85 77 74 72 72 72 75 72 89 90 90 90 90 84 84 81 79 79 82 88 88 90 90 90 90 90 90 76 74 74 76 77 67 71 75 78 78 86 80 75 75 75 79 72 84 90 90 90 90 90 90 90 90 85 80 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 78 83 77 77 76 83 88 88 90 90 77 72 72 72 72 88 88 88 90 84 84 86 88 88 90 90 90 90 90 90 90 86 80 84 79 79 79 75 79 76 88 82 85 90 90 90 90 90 90 88 90 90 90 86 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 90 86 74 79 80 74 72 83 83 81 78 78 90 85 85 74 75 75 75 75 79 86 90 90 90 80 69 69 69 67 70 83 85 90 90 90 83 77 77 72 72 81 81 85 90 83 83 69 65 64 66 74 69 85 83 72 72 66 61 56 60 55 59 56 64 65 74 90 83 80 75 76 80 71 75 80 67 71 75 73 67 61 61 66 68 68 74 90 90 90 85 80 80 75 80 75 83 83 70 68 65 65 68 70 69 76 83 72 80 70 67 72 68 65 67 69 78 74 74 77 81 77 76 72 66 66 62 60 60 64 68 68 72 68 68 64 69 74 73 74 90 90 90 90 90 90 90 90 76 77 67 71 66 71 78 78 82 66 71 65 59 64 73 66 69 62 69 60 66 66 52 53 53 59 51 51 51 51 51 51 56 43 43 43 43 43 43 45 45 45 51 51 56 56 56 56 56 56 56 56 43 43 43 43 43 43 43 43 43 43 51 45 45 43 43 43 43 43 43 43 43 51 51 51 51 45 45 43 43 43 43 43 43 51 51 51 51 51 51 43 43 43 43 43 43 45 45 45 45 45 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 45 43 43 43 43 43 43 51 51 51 51 51 51 51 51 43 43 43 43 43 43 43 43 43 43 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 35 35 35 35 35 35 43 43 43 43 43 51 51 51 51 51 51 51 51 51 51 51 51 51 51 43 43 43 43 43 40 40 40 40 38 39 40 40 40 40 40 46 39 39 39 39 39 39 46 51 40 39 39 39 39 39 40 51 51 51 56 39 35 35 35 35 35 35 35 35 35 40 56 51 51 51 39 39 40 40 40 40 56 56 56 51 51 51 40 40 46 40 40 40 45 45 35 40 40 40 40 35 51 40 40 40 40 46 46 46 46 46 51 51 51 40 40 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2277f U 1 AF LL2277r C 314 BS 1 425 LL2277f BS 426 428 LL2277r BS 429 430 LL2277f BS 431 431 LL2277r BS 432 459 LL2277f BS 460 460 LL2277r BS 461 482 LL2277f BS 483 483 LL2277r BS 484 484 LL2277f BS 485 485 LL2277r BS 486 486 LL2277f BS 487 490 LL2277r BS 491 497 LL2277f BS 498 498 LL2277r BS 499 503 LL2277f BS 504 509 LL2277r BS 510 510 LL2277f BS 511 511 LL2277r BS 512 537 LL2277f BS 538 555 LL2277r BS 556 561 LL2277f BS 562 566 LL2277r BS 567 569 LL2277f BS 570 578 LL2277r BS 579 579 LL2277f BS 580 587 LL2277r BS 588 593 LL2277f BS 594 611 LL2277r BS 612 617 LL2277f BS 618 752 LL2277r BS 753 769 LL2277f BS 770 1056 LL2277r RD LL2277f 776 0 0 ccttttcttagaagcatatgataaaaccattgtagtatatcaagaagatg aagatggaagtttgttttcacagtgtcaatAGAAATTGTCATCAAACAAA GTCTACAAAGCTCTTTAATAAACTAGAAATGGCTTAAAGTGTAGGAGCTC TCTCAGCGGTTCCAGCTATGACGGTAAGCAAATTGTTTCCAAAAGGATCA CTGAGATGCTTTGAAAGGTTCTCAACAGGACCTTCTCCAGTAACATAAGC TTGGATGAAGAAACCGAGCATCGCAAACATGGCTAATCTCCCGTTCTTGA TCTCTTTAACCTTGAGGAGAGCTCCTTGTTCGGGGTCCTTAGCAAGTCCT AAAGGATCAAATGGACCTCCGGGGTGTAGCTTGTCCTCAAAATCCAATCC GTTGGTGATTCTGTAGTACTCGGCTCCACCGAGGAGAACAACCTCAGCAA CTACGGCGAGAACAAGGTTGATTGGGATGTTCTTGCCAAAGTAGTTCAAT GTGTTTCCATCAAGAAGCAGAGCACCAGTCTTGAACCAGACGGCTTCAGG ACCACAGTTAGCTCCGTATTTGTTTAAAGCTTCAGGGATGATGAAACCAG CTGCTCCCAACATAGCCCATCTCGCATGGATCAGCTCAAAGGCTTGGTAT TTAGCAAAGTTCTCAGGCTTCTTTCCAAGTCCAAATGGGTCATAGCCATA GTCTCCAGCAACTTCACCGTTTAAGTACTCTGGGATCTCTGATCTGTCCA AAAGACCATCCGGCAGGAAAACTCTC QA 81 752 1 776 DS CHROMAT_FILE: LL2277f PHD_FILE: LL2277f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:56 2000 RD LL2277r 787 0 0 nagGAGAGCTCCTTGTTCGGGGTCCTTAGCAAGTCCTAAAGGATCAAATG GACCTCCGGGGTGTAGCTTGTCCTCAAAATCCAATCCGTTGGTGATTCTG TAGTACTCGGCTCCACCGAGGAGAACAACCTCAGCAACTACGGCGAGAAC AAGGTTGATTGGGATGTTCTTGCCAAAGTAGTTCAATGTGTTTCCATCAA GAAGCAGAGCACCAGTCTTGAACCAGACGGCTTCAGGACCACAGTTAGCT CCGTATTTGTTTAAAGCTTCAGGGATGATGAAACCAGCTGCTCCCAACAT AGCCCATCTCGCATGGATCAGCTCAAAGGCTTGGTATTTAGCAAAGTTCT CAGGCTTCTTTCCAAGTCCAAATGGGTCATAGCCATAGTCTCCAGCAACT TCACCGTTTAAGTACTCTGGGATCTCTGATCTGTCCAAAAGACCATCCGG CAGGAAAACTCTCCTGTCAGGACCATACCACTTGGCGAGTTCATCGTTAG CAACAGAGACGGTTTTGGCCTTGGCAGGAGCTGGCTTTTTCTTGGAGAAC AGAGCAACTGTCTTAAACGTAGCCGGGCTCGATGCCGATGGAGCAGACGA TCTTGACACTGCCTTGAAGTTAAGTGGCTTACCAAGCATTTCCGAAACAC CCATAGACGCCATCTTTGTACCCCAAAGTTTCAAGCTTTAACTCTGTTTC AAAACAAtgaaatgctaagtgaaaggaaggaacggacgcgtggxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 6 707 2 743 DS CHROMAT_FILE: LL2277r PHD_FILE: LL2277r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:33 2000 CO Contig15 728 2 1 U taaccccgccccaaaggaagttaagaaagggaaaggggcggggctaggtc cctctcaagtgtgcggtttttttgacggaacaaggggggccctttgttaa ttccccctacaggaaaagggaattcgctcaaaggggtttacatttgttag caacgggtttttcccagggccttttggttaaaacgccggccgtgaaaaag agtatgtgggcacacaaaaaaaagtttggtaacccacgggccttgccagt aaattgggtgctattaaaaggcccgccccatttatttaggttggctttat aaaagatttatttttttgcattgtttcgcgtTTTttatTTtgtTtTTTTt ttatttttTttttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTCAGACTCTACCAAACGTAAAATAAAACCTGCT TGTAGCACAACTCCAGGCAACAACAACAAGCAAATTACAATCTCTAATAT ATTTGCCTAGTGACTGACAATAGGCAAATAGGGATAGAGACCCCTTGAAG ATGTAGTAGTAGCTAAGACTAAGATTTATGTGCTCTGTGATCAGGCCTTA ACCTCCTCTCCACCACCACCAGATCCTCCTCCAGCTTTCTCTCTCTCTTT CTCAGCATCCTCTTCTTCTCCTTCCTCAGCATCGGCTTCATCTGCTAGCT TTGTGAGCTCATCTTGGCGGACGCGTGG BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 14 16 25 23 24 16 12 12 12 22 33 12 11 11 20 19 29 29 33 44 19 16 11 11 11 19 12 17 17 24 19 15 15 15 21 28 48 48 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 56 56 56 56 71 75 82 75 72 72 77 75 75 77 70 70 70 74 76 70 71 71 74 78 90 90 77 80 72 74 77 77 77 83 83 80 80 90 90 89 89 82 82 82 76 83 83 85 85 85 90 90 90 90 90 90 90 90 89 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 82 83 83 89 83 85 85 85 85 85 85 82 82 82 83 83 83 88 90 82 80 80 80 80 80 82 82 82 85 82 82 82 85 90 88 88 90 90 90 90 90 90 90 90 85 80 80 78 78 78 78 78 78 78 78 78 78 86 86 86 86 86 80 80 80 80 89 82 88 90 78 78 78 73 73 78 75 73 78 78 80 86 89 88 88 81 81 86 83 76 76 75 75 75 75 90 90 90 90 90 90 90 86 86 86 90 90 90 90 90 90 90 88 88 85 85 85 85 85 85 90 90 90 90 90 84 84 80 80 87 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 77 76 76 76 76 76 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 82 72 76 76 71 71 71 72 72 78 82 86 83 82 82 76 75 75 75 77 77 78 78 78 82 81 81 79 74 89 67 66 66 69 75 75 75 75 75 75 70 70 76 75 44 44 44 29 29 34 AF LL2279r C 1 AF LL2279f U 412 BS 1 728 LL2279r RD LL2279r 772 0 0 taaccccgccccaaaggaagttaagaaagggaaaggggcggggctaggtc cctctcaagtgtgcggtttttttgacggaacaaggggggccctttgttaa ttccccctacaggaaaagggaattcgctcaaaggggtttacatttgttag caacgggtttttcccagggccttttggttaaaacgccggccgtgaaaaag agtatgtgggcacacaaaaaaaagtttggtaacccacgggccttgccagt aaattgggtgctattaaaaggcccgccccatttatttaggttggctttat aaaagatttatttttttgcattgtttcgcgtTTTttatTTtgtTtTTTTt ttatttttTttttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTCAGACTCTACCAAACGTAAAATAAAACCTGCT TGTAGCACAACTCCAGGCAACAACAACAAGCAAATTACAATCTCTAATAT ATTTGCCTAGTGACTGACAATAGGCAAATAGGGATAGAGACCCCTTGAAG ATGTAGTAGTAGCTAAGACTAAGATTTATGTGCTCTGTGATCAGGCCTTA ACCTCCTCTCCACCACCACCAGATCCTCCTCCAGCTTTCTCTCTCTCTTT CTCAGCATCCTCTTCTTCTCCTTCCTCAGCATCGGCTTCATCTGCTAGCT TTGTGAGCTCATCTTGGCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxat QA 330 728 1 728 DS CHROMAT_FILE: LL2279r PHD_FILE: LL2279r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:42 2000 RD LL2279f 779 0 5 ncttttaagacaactACCAAACGTAAAATAAAACCTGCTTGTAGCACAAC TCCAGGCAACAACAACAAGCAAATTACAATCTCTAATATATTTGCCTAGT GACTGACAATAGGCAAATAGGGATAGAGACCCCTTGAAGATGTAGTAGTA GCTAAGACTAAGATTTATGTGCTCTGTGATCAGGCCTTAACCTCCTCTCC ACCACCACCAGATCCTCCTCCAGCTTTCTCTCTCTCTTTCTCAGCATCCT CTTCTTCTCCTTCCTCAGCATCGGCTTCATCTGCTAGCTTTGTGAGCTCA TCTTGGCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxcgtcaaagcctggggtgcctaatgag cgagctaactcacattaattgcgctgcgctcactgcccgctttccagtcg ggacgcctgccgtgccacctgcattaacgcactccccatcgcgcggggag aggcggcttgcgcactgtccgccacggcggctctctttctcaccactgag acggtcgacacccgcccgccctcttccgtctcgtcccgtcacagccgctc cgtgccgcctcatcctcgatcgtcccggcgggctcatcgttctgcccgct cgcgcgctcgtcccccttttccgctcccc QA 17 545 14 317 DS CHROMAT_FILE: LL2279f PHD_FILE: LL2279f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:05 2000 RT{ LL2279f chimera phrap 481 654 000919:094547 } RT{ LL2279f matchElsewhereHighQual phrap 590 606 000919:094547 } RT{ LL2279f matchElsewhereHighQual phrap 525 553 000919:094547 } RT{ LL2279f matchElsewhereHighQual phrap 502 523 000919:094547 } RT{ LL2279f matchElsewhereHighQual phrap 482 500 000919:094547 } CO Contig16 440 2 50 U CCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTAAAGTAAAACCAACGAAAGACTCA*ACAGCTTG AAAACCCCATAAAAAAAAACAGAGTTTAGAGTTTCCAAAGAAGAAAACCA AATCATGGTCTTCTCTTTGCTTCTACAAACCTCAAATCATAATGTTTTAA GAAACTAAAGCCAAATCCATACCATACATACTTAGAAATGTCTTGCTCGC TCCCTTCCTCTGATCACTTCAACAACTACGCTTGTTTCCCTTCTTCAGGC TCGTCTTTTGACGCCTCCTTGATCTCATCACCACCCACCTCATCGTTGAG GTCAGAAGTCCAGAGGGTCAGATTGTCACGGAGAAGTTGCATTATCAGTG TACTGTCTTTGTATGATTCTTCTCCTAATCGGACGCGTGG BQ 27 42 47 50 50 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 41 45 43 43 43 43 43 43 56 56 46 43 51 43 46 47 82 75 75 65 56 56 56 56 51 51 66 66 58 77 79 84 77 79 90 90 90 90 90 79 79 79 79 85 90 90 90 90 90 90 90 90 86 80 80 80 80 82 90 90 90 90 90 90 90 90 90 90 90 86 82 90 90 90 90 90 86 82 90 90 90 90 90 90 90 90 90 80 80 87 80 85 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 82 82 85 90 90 90 90 90 90 86 86 86 86 90 90 86 86 86 78 71 75 76 76 76 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 90 88 88 88 88 90 90 90 90 90 88 85 88 78 90 86 78 78 78 78 78 78 86 90 90 90 90 90 90 88 88 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 86 86 86 79 86 88 90 90 90 90 90 90 90 90 90 90 90 90 87 86 86 88 79 77 77 79 79 86 90 90 90 86 78 78 78 78 78 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 83 90 84 84 84 84 84 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 85 90 86 86 82 90 90 90 90 90 82 82 90 90 90 90 85 85 86 84 77 74 76 76 76 76 90 90 90 90 90 90 78 76 81 85 83 82 77 79 80 80 81 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 76 76 90 80 90 90 90 58 58 58 43 51 56 AF LL2280r C -296 AF LL2280f U 66 BS 1 82 LL2280r BS 83 87 LL2280f BS 88 139 LL2280r BS 140 144 LL2280f BS 145 157 LL2280r BS 158 159 LL2280f BS 160 160 LL2280r BS 161 161 LL2280f BS 162 162 LL2280r BS 163 166 LL2280f BS 167 184 LL2280r BS 185 185 LL2280f BS 186 191 LL2280r BS 192 195 LL2280f BS 196 208 LL2280r BS 209 212 LL2280f BS 213 215 LL2280r BS 216 217 LL2280f BS 218 220 LL2280r BS 221 228 LL2280f BS 229 229 LL2280r BS 230 230 LL2280f BS 231 234 LL2280r BS 235 242 LL2280f BS 243 257 LL2280r BS 258 263 LL2280f BS 264 276 LL2280r BS 277 287 LL2280f BS 288 288 LL2280r BS 289 295 LL2280f BS 296 299 LL2280r BS 300 316 LL2280f BS 317 319 LL2280r BS 320 328 LL2280f BS 329 337 LL2280r BS 338 361 LL2280f BS 362 364 LL2280r BS 365 371 LL2280f BS 372 372 LL2280r BS 373 378 LL2280f BS 379 382 LL2280r BS 383 386 LL2280f BS 387 394 LL2280r BS 395 404 LL2280f BS 405 405 LL2280r BS 406 409 LL2280f BS 410 410 LL2280r BS 411 431 LL2280f BS 432 434 LL2280r BS 435 440 LL2280f RD LL2280r 782 0 5 cacgctgcgcgtaaccaccacaccccccgcgcttaatgcgccgctacagg gcgcgtcccattcgccattcaggctgcgcaactgttgggaagggcgatcg gtgcgggccttttcgctattacgccaactggcgaaagggggatgtgctgc aaggcgattaagttgggtaacgccagggttttxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxCCT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTAAAGTAAAACCAACGAAAGACTCA*ACAGCTTGAAA ACCCCATAAAAAAAAACAGAGTTTAGAGTTTCCAAAGAAGAAAACCAAAT CATGGTCTTCTCTTTGCTTCTACAAACCTCAAATCATAATGTTTTAAGAA ACTAAAGCCAAATCCATACCATACATACTTAGAAATGTCTTGCTCGCTCC CTTCCTCTGATCACTTCAACAACTACGCTTGTTTCCCTTCTTCAGGCTCG TCTTTTGACGCCTCCTTGATCTCATCACCACCCACCTCATCGTTGAGGTC AGAAGTCCAGAGGGTCAGATTGTCACGGAGAAGTTGCATTATCAGTGTAC TGTCTTTGTATGATTCTTCTCCTAATCGGACGCGTGGxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxatt QA 94 737 298 737 DS CHROMAT_FILE: LL2280r PHD_FILE: LL2280r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:10 2000 RT{ LL2280r chimera phrap 1 182 000919:094547 } RT{ LL2280r matchElsewhereHighQual phrap 1 25 000919:094547 } RT{ LL2280r matchElsewhereHighQual phrap 27 110 000919:094547 } RT{ LL2280r matchElsewhereHighQual phrap 75 126 000919:094547 } RT{ LL2280r matchElsewhereHighQual phrap 128 182 000919:094547 } RD LL2280f 777 0 3 ctttaagtaacccaACGAAAGACTcanacAGCTTGAAAACCCCATAAAAA AAAACAGAGTTTAGAGTTTCCAAAGAAGAAAACCAAATCATGGTCTTCTC TTTGCTTCTACAAACCTCAAATCATAATGTTTTAAGAAACTAAAGCCAAA TCCATACCATACATACTTAGAAATGTCTTGCTCGCTCCCTTCCTCTGATC ACTTCAACAACTACGCTTGTTTCCCTTCTTCAGGCTCGTCTTTTGACGCC TCCTTGATCTCATCACCACCCACCTCATCGTTGAGGTCAGAAGTCCAGAG GGTCAGATTGTCACGGAGAAGTTGCATTATCAGTGTACTGTCTTTGTATG ATTCTTCTCCTAATCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxgcctggggtgcc taatgagtgagctaactcacattaattgcgttgcgctcactgcccgcttt ccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcg cggggagaggcggcttgcgtattgggcgccagggtggtttttcttttcac cagtgagacgggcaacagctgattgcccttcaccgcctggccctgagaga gttgcagcaagcggtccacgctggttt QA 29 755 12 375 DS CHROMAT_FILE: LL2280f PHD_FILE: LL2280f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:32 2000 RT{ LL2280f chimera phrap 539 777 000919:094547 } RT{ LL2280f matchElsewhereHighQual phrap 665 777 000919:094547 } RT{ LL2280f matchElsewhereHighQual phrap 539 663 000919:094547 } CO Contig17 969 2 46 U cttttagaaacgaggttcggtttctgataaaatcatacaaagatacgaca aaggaagtttcacaaatatacaaacaaactCTCTTTCACATGATAATTAA CCTTTTAACTCCTTCACTTTCCGGGGACGAAGTTGGTGGCAAAGGCCCAT GCATTGTTGTTGACTGGATCAGCCAAATGGTCAGCAAGATTCTCCAACGG TCCCTTTCCGGTGACGATAGCCTGAACGAAGAATCCAAACATAGAGAACA TAGCCAACCTTCCGTTCTTGATCTCCTTCACCTTCAACTCCGCGAAAGCT TCTGGGTCGGTAGCAAGACCCAACGGGTCAAAGCTGCCTCCTGGGTAGAG CAAGTCCTCTGCTTCCCCCAATGGTCCTTCTCCGGCGACTCTGTAACCTT CAACAGCTCCCATGAGGATCACCTGAGTAGCCCAAATGGCTAAGATGCTC TGTGCGTGGACCAAGCTCGGGTTTCCCAAGTAGTCGAGCCCTCCTTCGCT GAAGATCTGTGAACCAGCCTTGAACCAAACAGCTTCTCCGAACTTCACTC CGTTCCTAGCCAAAAGCTCAGGGAAAACGCAGCCTAGGGCTCCAAGCATG GCCCATCTGCAGTGGATAACTTCTAGCTCACGGTTCCTGGCGAAGGTCTC GGGGTCAGCGGAAAGACCGGCGGTGTCCCATCCGTAGTCACCGGGGAACT CTCCTGTGAGGTAGCTTGGGGGCTCTCCGGAGAATGGACCCAAGTACTTG ACCCTGTCGGATCCGTACCATGGGCTGCCAGATGGACCGGTGGGTTTGAC GGTTTTTCTCATGGTGACACGGCCGGTTCCAAAGACCTCTGAGGCTCCCG GTGAGAGCTTCACGGCCTTTCCGGTAAAGGCAGGGGAGGACAGAGCCATG GTTGAGGCAGCCATTAGAAATTTTCTTGATCTTttttagtttttgtgttt tttttttcggacggcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 40 56 56 56 40 40 40 40 39 39 40 51 51 51 51 35 35 35 35 35 35 45 45 40 46 46 40 40 40 45 45 45 46 46 40 40 40 40 45 45 45 45 45 51 40 39 35 35 35 35 35 35 56 56 56 56 40 40 40 40 40 40 40 40 40 40 40 40 43 43 43 43 43 43 56 56 51 51 51 43 43 43 43 43 43 43 45 51 45 43 40 40 43 37 40 41 51 56 56 56 56 37 38 38 38 38 38 42 43 41 41 41 41 41 38 36 36 36 38 36 40 42 42 42 41 45 40 38 38 38 41 38 38 36 36 36 36 36 40 43 43 43 51 40 36 36 36 36 51 50 71 73 79 79 79 75 85 77 74 68 73 69 74 79 83 82 70 63 60 56 56 61 64 60 65 62 62 62 62 62 67 67 69 70 67 67 70 67 74 80 90 90 82 80 75 76 75 68 72 65 60 60 60 70 77 72 72 72 75 61 61 66 67 67 63 69 70 77 70 65 63 66 64 60 66 62 62 74 72 68 72 83 85 85 85 83 83 80 77 69 65 60 63 67 69 61 68 78 80 78 88 73 72 75 72 72 67 69 68 83 83 72 90 74 80 82 82 78 79 85 85 85 85 90 90 86 90 90 85 83 90 90 90 90 90 90 90 88 88 88 83 86 86 85 83 81 81 85 85 85 90 89 84 74 74 74 79 79 85 89 85 90 90 85 80 80 85 90 90 90 90 90 89 89 85 85 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 80 80 85 80 88 88 90 90 90 90 90 90 83 83 80 81 90 90 90 90 90 90 90 85 90 90 90 83 83 83 83 83 83 90 90 90 90 90 90 90 88 88 88 83 90 90 90 90 90 90 90 90 90 90 90 90 88 85 85 84 84 84 84 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 80 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 83 83 83 83 83 88 90 90 90 90 90 88 86 86 86 90 86 90 90 90 90 90 88 83 80 80 82 90 90 90 90 90 87 87 87 87 90 87 90 90 90 90 90 90 90 90 87 90 87 85 85 85 85 78 79 79 85 79 83 83 83 79 71 70 71 72 74 68 67 67 72 83 83 83 85 90 90 89 87 76 68 68 68 64 68 75 90 72 80 68 70 63 65 62 72 72 72 70 75 68 67 68 76 90 74 74 72 72 83 83 74 72 68 70 72 70 64 68 77 90 80 74 74 80 80 76 77 64 72 74 72 63 61 65 62 68 66 70 66 60 71 71 66 60 60 70 72 68 66 66 64 64 71 69 68 66 66 66 71 74 76 79 81 76 69 63 72 72 75 75 74 70 67 70 70 67 73 64 76 71 69 68 65 65 68 70 65 63 67 66 64 69 70 81 81 70 74 74 62 62 59 53 53 58 51 51 51 56 56 51 51 43 43 43 43 43 43 51 51 51 51 45 45 35 35 35 35 43 43 56 56 56 56 56 56 43 43 43 43 43 43 43 43 43 43 43 43 35 35 35 35 39 40 56 56 45 45 45 45 45 51 56 46 40 40 40 40 40 40 40 39 45 35 35 35 35 35 35 39 39 39 39 40 51 40 40 39 39 39 39 40 46 51 45 45 40 35 35 35 39 39 39 39 39 39 39 39 40 40 40 40 40 40 51 40 39 39 39 39 39 35 39 39 39 39 39 40 40 35 35 32 32 33 33 39 39 46 46 46 46 46 46 51 51 51 51 51 51 40 40 37 40 40 40 46 46 51 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2281f U 1 AF LL2281r C 227 BS 1 317 LL2281f BS 318 318 LL2281r BS 319 360 LL2281f BS 361 365 LL2281r BS 366 408 LL2281f BS 409 424 LL2281r BS 425 441 LL2281f BS 442 448 LL2281r BS 449 449 LL2281f BS 450 450 LL2281r BS 451 458 LL2281f BS 459 468 LL2281r BS 469 471 LL2281f BS 472 475 LL2281r BS 476 476 LL2281f BS 477 477 LL2281r BS 478 500 LL2281f BS 501 536 LL2281r BS 537 540 LL2281f BS 541 549 LL2281r BS 550 554 LL2281f BS 555 555 LL2281r BS 556 558 LL2281f BS 559 560 LL2281r BS 561 561 LL2281f BS 562 564 LL2281r BS 565 568 LL2281f BS 569 572 LL2281r BS 573 573 LL2281f BS 574 581 LL2281r BS 582 582 LL2281f BS 583 593 LL2281r BS 594 599 LL2281f BS 600 612 LL2281r BS 613 613 LL2281f BS 614 623 LL2281r BS 624 624 LL2281f BS 625 651 LL2281r BS 652 653 LL2281f BS 654 654 LL2281r BS 655 656 LL2281f BS 657 662 LL2281r BS 663 663 LL2281f BS 664 776 LL2281r BS 777 780 LL2281f BS 781 969 LL2281r RD LL2281f 785 0 0 cttttagaaacgaggttcggtttctgataaaatcatacaaagatacgaca aaggaagtttcacaaatatacaaacaaactCTCTTTCACATGATAATTAA CCTTTTAACTCCTTCACTTTCCGGGGACGAAGTTGGTGGCAAAGGCCCAT GCATTGTTGTTGACTGGATCAGCCAAATGGTCAGCAAGATTCTCCAACGG TCCCTTTCCGGTGACGATAGCCTGAACGAAGAATCCAAACATAGAGAACA TAGCCAACCTTCCGTTCTTGATCTCCTTCACCTTCAACTCCGCGAAAGCT TCTGGGTCGGTAGCAAGACCCAACGGGTCAAAGCTGCCTCCTGGGTAGAG CAAGTCCTCTGCTTCCCCCAATGGTCCTTCTCCGGCGACTCTGTAACCTT CAACAGCTCCCATGAGGATCACCTGAGTAGCCCAAATGGCTAAGATGCTC TGTGCGTGGACCAAGCTCGGGTTTCCCAAGTAGTCGAGCCCTCCTTCGCT GAAGATCTGTGAACCAGCCTTGAACCAAACAGCTTCTCCGAACTTCACTC CGTTCCTAGCCAAAAGCTCAGGGAAAACGCAGCCTAGGGCTCCAAGCATG GCCCATCTGCAGTGGATAACTTCTAGCTCACGGTTCCTGGCGAAGGTCTC GGGGTCAGCGGAAAGACCGGCGGTGTCCCATCCGTAGTCACCGGGGAACT CTCCTGTGAGGTAGCTTGGGGGCTCTCCGGAGAATGGACCCAAGTACTTG ACCCTGTCGGATCCGTACCATGGGCTGCCAGATgg QA 81 776 1 785 DS CHROMAT_FILE: LL2281f PHD_FILE: LL2281f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:36 2000 RD LL2281r 787 0 0 CGAAGAATCCAAACATAGAGAACATAGCCAACCTTCCGTTCTTGATCTCC TTCACCTTCAACTCCGCGAAAGCTTCTGGGTCGGTAGCAAGACCCAACGG GTCAAAGCTGCCTCCTGGGTAGAGCAAGTCCTCTGCTTCCCCCAATGGTC CTTCTCCGGCGACTCTGTAACCTTCAACAGCTCCCATGAGGATCACCTGA GTAGCCCAAATGGCTAAGATGCTCTGTGCGTGGACCAAGCTCGGGTTTCC CAAGTAGTCGAGCCCTCCTTCGCTGAAGATCTGTGAACCAGCCTTGAACC AAACAGCTTCTCCGAACTTCACTCCGTTCCTAGCCAAAAGCTCAGGGAAA ACGCAGCCTAGGGCTCCAAGCATGGCCCATCTGCAGTGGATAACTTCTAG CTCACGGTTCCTGGCGAAGGTCTCGGGGTCAGCGGAAAGACCGGCGGTGT CCCATCCGTAGTCACCGGGGAACTCTCCTGTGAGGTAGCTTGGGGGCTCT CCGGAGAATGGACCCAAGTACTTGACCCTGTCGGATCCGTACCATGGGCT GCCAGATGGACCGGTGGGTTTGACGGTTTTTCTCATGGTGACACGGCCGG TTCCAAAGACCTCTGAGGCTCCCGGTGAGAGCTTCACGGCCTTTCCGGTA AAGGCAGGGGAGGACAGAGCCATGGTTGAGGCAGCCATTAGAAATTTTCT TGATCTTttttagtttttgtgttttttttttcggacggcgtggxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 1 707 1 743 DS CHROMAT_FILE: LL2281r PHD_FILE: LL2281r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:15 2000 CO Contig18 491 2 1 U GCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTC ACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAA TCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCCAGGGTGGT TTTTCTTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCT GGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTGGTTTGCCCCAGCAGG CGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACATGAGCTGTC TTCGGTATCGTCGTATCCCACTACCGAGATATCCGCACCAACGCGCAGCC CGGACTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCA ACCAGCATCGCAGTGGGAACGATGCCCTCATTCAGCATTTGCATGgtttg ttgaAAACCGGACATGGCActccaGTcgccttCccgttccg BQ 43 43 43 43 51 51 56 43 43 43 43 43 43 45 45 56 56 56 56 56 51 43 43 43 43 43 43 43 43 43 43 45 45 45 51 51 51 43 36 36 36 36 36 36 36 36 36 36 36 36 36 36 36 36 43 45 43 43 43 43 43 45 56 51 51 51 51 56 56 51 51 51 45 45 45 45 45 56 51 51 45 45 45 45 45 43 45 43 43 43 43 45 45 51 51 51 56 56 56 51 51 45 45 45 45 51 51 51 45 45 45 45 45 45 51 56 56 56 56 51 45 45 45 45 45 51 51 43 43 43 43 43 43 56 56 56 43 43 43 43 43 43 40 43 43 46 56 56 56 56 56 56 56 56 56 56 56 51 51 45 45 45 45 51 51 56 56 51 45 45 45 45 45 51 56 56 56 56 45 45 51 45 45 45 45 45 45 45 45 45 51 45 45 45 40 39 35 35 35 39 40 45 45 45 45 51 51 51 51 51 51 51 56 40 40 40 40 37 40 56 56 45 45 45 45 45 45 51 40 40 39 39 39 40 40 40 51 39 39 37 37 37 39 46 56 56 51 51 45 45 45 45 45 45 56 51 51 51 51 51 51 56 56 56 56 51 51 51 51 51 51 35 35 35 35 35 39 46 42 42 42 51 46 56 46 46 51 51 51 51 56 56 56 51 51 45 45 35 35 35 40 40 40 40 46 56 40 40 40 40 40 36 40 40 40 34 29 28 32 32 32 36 36 40 40 40 40 40 40 40 40 40 33 32 29 29 27 27 25 25 28 32 32 32 32 32 32 32 29 32 32 26 25 25 27 27 29 32 44 40 40 40 34 26 25 25 27 32 32 34 40 39 37 29 29 29 29 29 29 42 35 35 29 32 29 32 40 40 40 40 40 32 32 29 24 24 29 29 29 25 25 29 29 26 26 32 32 29 29 29 29 27 27 25 25 29 25 25 25 29 32 30 30 29 29 24 25 25 25 26 32 32 34 32 40 40 40 34 22 22 24 24 25 24 22 19 19 10 17 9 9 9 19 16 25 24 22 25 25 26 20 23 25 27 25 25 21 25 21 19 9 10 10 19 21 22 19 15 10 11 14 17 21 12 15 0 0 0 0 0 0 AF LL2283r C 1 AF LL2283f U -303 BS 1 491 LL2283f RD LL2283r 800 1 5 aagggcgaaaaaccgtttatcagggcgatgggcccctacgtgaaccatcc ccctaatcaagttttttggggtcgaggtgccgtaaagcccttaattggaa ccctaaagggagccccccatttaaaactttacggggaaaaccggcgaacg tggcgaaaaaagaagggaaaaaaaccaaaagagcgggcgctaaggccctg gcaaatgtaacggtccccctgggggtaaccccccccccccccgcgcttta tgggccggttcagggggggttccctttgccatttagggtgggcaaatttt tggaaaggcgattggttcgggccttttttttttttccccaactggggaaa aggggatttttttcaaagcgatttaatttggtaaccccagggtttttxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxCCCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTTGGTGCAACAAACACAAAAGCATAATCTCTCA GATTAATAAATAATAACGAGCTTTAACTCCAATCTGCAGAGAACATTTCA AGCACATGGGAGAAGCTCTGCAGAACAAGAAAGATCATATCTTCCCGGAC gctggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 265 752 -1 -1 DS CHROMAT_FILE: LL2283r PHD_FILE: LL2283r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:24 2000 WR{ LL2283r unaligned phrap 000919:094547 } RT{ LL2283r chimera phrap 1 397 000919:094547 } RT{ LL2283r matchElsewhereLowQual phrap 131 179 000919:094547 } RT{ LL2283r matchElsewhereLowQual phrap 194 225 000919:094547 } RT{ LL2283r matchElsewhereHighQual phrap 336 360 000919:094547 } RT{ LL2283r matchElsewhereHighQual phrap 383 397 000919:094547 } RD LL2283f 796 0 2 ngtgnngggaacAAACACAAAAGCATAATCTCTCAGATTAATAAATAATA ACGAGCTTTAACTCCAATCTGCAGAGAACATTTCAAGCACATGGGAGAAG CTCTGCAGAACAAGAAAGATCATATCTTCCCGGACGCGTGGxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGC GCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAA TGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCCAGGG TGGTTTTTCTTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACC GCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTGGTTTGCCCCAG CAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACATGAGC TGTCTTCGGTATCGTCGTATCCCACTACCGAGATATCCGCACCAACGCGC AGCCCGGACTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTT GGCAACCAGCATCGCAGTGGGAACGATGCCCTCATTCAGCATTTGCATGg tttgttgaAAACCGGACATGGCActccaGTcgccttCccgttccgn QA 11 789 305 795 DS CHROMAT_FILE: LL2283f PHD_FILE: LL2283f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:46 2000 RT{ LL2283f chimera phrap 305 651 000919:094547 } RT{ LL2283f matchElsewhereHighQual phrap 305 512 000919:094547 } CO Contig19 911 2 62 U ctttaagggaacaagagttgaatataaagtcacaatatagcagacagaaa tcatgatcaaataaacaaatatttttttaaTACAGAAAACAATACAAAAA GACAGCTGGGCCAAGCTAATCGAGCATGTGTGTCTCTTCGCCATCATTAC CTGTCTCCGGCACAGCCGAGCTGGAGCCAACGGCGGAGATAATGGCATTC AAAGCACGAATCTGCATCTCCATCGCAACAATGTAATCAGTAGTTTCTTC TAAAACCACCGGTAATGGTTGTTTCCGGCAACCGGGGATTAACCGGCTCA GCAATTTCACTTTCCTCTGTACAGCTGTAAACCCCTTCCCCTTCAACCTC AAAACCGTTGCTCTCTTTTTCTTTAACCGGCTATTCCCGGTTATCGAAAC CACCGTGGGAGGAGGATTCGAAGTTCTTAACCGGTTATGTTTCCTAAACT TGAGCTTCACTTTAACGGCTTTAGATAGTATAGCTCGGCTCCACAGTGTT CTGCCTCTCGCCGTAACCGCCAGAGACCTGTAAGCGGCGTCACGTACGGC TATACCTCCGCCGCGAGGAGAGAGCTTACCAGATGCATCGGTGGCGGCGC TAATACGTAGTTCCCTTAGAGATTGAATGATGTTGGTCGAGTAGATTTGC TGCTGCTTCTCGGATCTCCATTTCTGCACCAAAGATTGAGACGAAGAATG CGACGGTGATGGTGACGTTGGTTTCTTCTTACGCTTACGCCGCGACTGAT CATAACTGGAGCCAGCCGTTGGATGATTTGTCAGTGGAGAAATATACTCC ATGTCAATTCCTCTGCTTCTCCGGTGACAGTACCTGAAGACAGAAACAGA GAAAGGAGATTAGCAAGGTGAGACACgtatgtggaggaagaggaaaatcc cggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 45 40 40 40 51 51 39 40 45 40 40 40 51 51 51 51 51 51 51 46 46 40 40 39 39 40 40 40 40 39 39 39 39 35 35 35 35 35 35 35 39 39 39 39 46 46 46 56 56 51 51 45 45 45 45 40 40 38 38 38 38 38 40 51 51 39 39 39 39 39 39 51 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 51 51 51 51 43 43 58 58 58 62 58 58 68 83 80 90 83 80 72 72 70 68 70 68 77 80 77 68 73 72 60 65 65 68 68 76 71 76 80 80 80 83 79 85 85 89 81 75 65 68 64 62 58 58 66 74 74 80 80 80 80 82 82 90 79 72 72 70 83 76 77 74 78 75 69 72 72 72 69 67 74 74 85 80 72 69 74 77 77 81 80 78 78 83 80 76 85 83 83 75 72 69 66 63 63 58 58 72 83 90 90 79 85 70 70 70 66 67 61 69 72 83 83 75 72 72 74 76 79 81 90 90 83 85 89 89 90 89 83 90 90 90 90 90 83 85 80 80 88 83 83 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 80 80 80 88 89 90 90 90 90 85 88 88 88 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 90 90 89 90 90 90 90 90 90 90 90 90 90 87 87 89 85 85 85 90 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 90 90 90 90 90 88 88 88 90 90 86 86 90 90 90 90 90 90 90 90 88 83 88 81 83 83 85 85 85 85 85 90 90 90 86 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 83 82 82 78 78 78 90 90 90 90 90 90 90 90 90 90 90 88 88 83 83 80 80 80 83 90 88 88 83 83 83 88 90 88 88 80 90 90 90 90 90 90 90 90 90 83 83 88 88 90 90 90 90 90 90 90 90 85 86 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 78 78 78 78 76 76 74 78 78 86 86 82 80 79 77 82 75 83 83 72 74 80 80 88 83 74 74 77 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 83 80 80 83 83 83 88 83 85 90 90 80 84 80 80 80 78 78 78 78 76 76 82 83 83 83 83 77 83 83 83 76 78 78 83 90 90 90 83 83 75 75 75 80 90 90 90 90 90 90 82 80 79 74 72 70 70 70 72 72 81 85 76 72 72 72 72 72 74 74 68 78 90 90 90 90 77 77 72 72 72 68 65 65 79 74 74 74 80 85 90 90 80 73 61 68 66 66 68 65 61 64 64 64 65 65 65 64 65 65 74 80 80 83 90 90 90 80 80 80 80 80 72 67 68 70 70 64 61 60 58 58 62 58 61 62 78 71 71 51 45 45 43 43 43 43 43 43 43 43 43 43 45 45 45 45 45 40 40 40 40 40 40 56 56 51 40 40 40 40 40 40 51 51 51 51 51 51 51 51 51 51 51 51 40 45 40 39 39 39 40 39 39 39 39 39 39 46 46 46 40 40 40 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 40 39 39 39 39 39 51 39 39 39 39 35 40 37 51 45 40 40 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2286f U 1 AF LL2286r C 176 BS 1 283 LL2286f BS 284 284 LL2286r BS 285 308 LL2286f BS 309 312 LL2286r BS 313 322 LL2286f BS 323 323 LL2286r BS 324 331 LL2286f BS 332 342 LL2286r BS 343 343 LL2286f BS 344 344 LL2286r BS 345 347 LL2286f BS 348 353 LL2286r BS 354 354 LL2286f BS 355 371 LL2286r BS 372 376 LL2286f BS 377 392 LL2286r BS 393 395 LL2286f BS 396 402 LL2286r BS 403 414 LL2286f BS 415 422 LL2286r BS 423 425 LL2286f BS 426 426 LL2286r BS 427 431 LL2286f BS 432 442 LL2286r BS 443 446 LL2286f BS 447 451 LL2286r BS 452 454 LL2286f BS 455 463 LL2286r BS 464 466 LL2286f BS 467 474 LL2286r BS 475 476 LL2286f BS 477 478 LL2286r BS 479 479 LL2286f BS 480 480 LL2286r BS 481 481 LL2286f BS 482 496 LL2286r BS 497 498 LL2286f BS 499 506 LL2286r BS 507 515 LL2286f BS 516 530 LL2286r BS 531 534 LL2286f BS 535 541 LL2286r BS 542 543 LL2286f BS 544 546 LL2286r BS 547 547 LL2286f BS 548 556 LL2286r BS 557 557 LL2286f BS 558 562 LL2286r BS 563 565 LL2286f BS 566 569 LL2286r BS 570 572 LL2286f BS 573 586 LL2286r BS 587 591 LL2286f BS 592 622 LL2286r BS 623 628 LL2286f BS 629 629 LL2286r BS 630 635 LL2286f BS 636 641 LL2286r BS 642 642 LL2286f BS 643 709 LL2286r BS 710 711 LL2286f BS 712 911 LL2286r RD LL2286f 776 0 0 ctttaagggaacaagagttgaatataaagtcacaatatagcagacagaaa tcatgatcaaataaacaaatatttttttaaTACAGAAAACAATACAAAAA GACAGCTGGGCCAAGCTAATCGAGCATGTGTGTCTCTTCGCCATCATTAC CTGTCTCCGGCACAGCCGAGCTGGAGCCAACGGCGGAGATAATGGCATTC AAAGCACGAATCTGCATCTCCATCGCAACAATGTAATCAGTAGTTTCTTC TAAAACCACCGGTAATGGTTGTTTCCGGCAACCGGGGATTAACCGGCTCA GCAATTTCACTTTCCTCTGTACAGCTGTAAACCCCTTCCCCTTCAACCTC AAAACCGTTGCTCTCTTTTTCTTTAACCGGCTATTCCCGGTTATCGAAAC CACCGTGGGAGGAGGATTCGAAGTTCTTAACCGGTTATGTTTCCTAAACT TGAGCTTCACTTTAACGGCTTTAGATAGTATAGCTCGGCTCCACAGTGTT CTGCCTCTCGCCGTAACCGCCAGAGACCTGTAAGCGGCGTCACGTACGGC TATACCTCCGCCGCGAGGAGAGAGCTTACCAGATGCATCGGTGGCGGCGC TAATACGTAGTTCCCTTAGAGATTGAATGATGTTGGTCGAGTAGATTTGC TGCTGCTTCTCGGATCTCCATTTCTGCACCAAAGATTGAGACGAAGAATG CGACGGTGATGGTGACGTTGGTTTCTTCTTACGCTTACGCCGCGACTGAT CATAACTGGAGCCAGCCGTTGGATga QA 81 774 1 776 DS CHROMAT_FILE: LL2286f PHD_FILE: LL2286f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:01 2000 RD LL2286r 781 0 0 nccaACGGCGGAGATAATGGCATTCAAAGCACGAATCTGCATCTCCATCG CAACAATGTAATCAGTAGTTTCTTCTAAAACCACCGGTAATGGTTGTTTC CGGCAACCGGGGATTAACCGGCTCAGCAATTTCACTTTCCTCTGTACAGC TGTAAACCCCTTCCCCTTCAACCTCAAAACCGTTGCTCTCTTTTTCTTTA ACCGGCTATTCCCGGTTATCGAAACCACCGTGGGAGGAGGATTCGAAGTT CTTAACCGGTTATGTTTCCTAAACTTGAGCTTCACTTTAACGGCTTTAGA TAGTATAGCTCGGCTCCACAGTGTTCTGCCTCTCGCCGTAACCGCCAGAG ACCTGTAAGCGGCGTCACGTACGGCTATACCTCCGCCGCGAGGAGAGAGC TTACCAGATGCATCGGTGGCGGCGCTAATACGTAGTTCCCTTAGAGATTG AATGATGTTGGTCGAGTAGATTTGCTGCTGCTTCTCGGATCTCCATTTCT GCACCAAAGATTGAGACGAAGAATGCGACGGTGATGGTGACGTTGGTTTC TTCTTACGCTTACGCCGCGACTGATCATAACTGGAGCCAGCCGTTGGATG ATTTGTCAGTGGAGAAATATACTCCATGTCAATTCCTCTGCTTCTCCGGT GACAGTACCTGAAGACAGAAACAGAGAAAGGAGATTAGCAAGGTGAGACA Cgtatgtggaggaagaggaaaatcccggacgcgtggxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 6 701 2 736 DS CHROMAT_FILE: LL2286r PHD_FILE: LL2286r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:38 2000 CO Contig20 547 2 7 U cCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAT TTGAAGAAACCTTTATAAAAACGATTAAGGGGCTGAATGCGAGACAAAAA TATATTTCTGATTTGAATCAATGAAAAATATATAAATGGAAGATCTAAAA CATTTCTCACCAATGGCGCTATAACACAACAGAGAAGATTGAATTTAAAG GCATGACTGCTCTTGTCACTTCCAGCCTTTCTTTTATTATTGGTTTAAAT AC*GTGAAGGTGGATGAGCTCCATCACCTTGTCAACTTTATAGCTTTGCG ATCGCTGAAGCAGGCAAGTGTTTCCTTGGTGGAAGCTTAAGATCCTCCCA TCCCTCCTCTTCTTCCACTCTCTTGAAGTACTTATCTACCATGTTATCTT TCATATCCTCCAGTCTCCTTGGCTCCCACTTTGGGTTCCTATCTTTGTTT ATCAATATGGCTCTACCCCCCTCCACAAAATCTTTGCTTATATCTCCCTT CATCACATGTGACACCATTCTATTCTCACGGATAAGCGGACGCGTGG BQ 19 22 37 48 48 48 48 48 48 48 48 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 51 51 51 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 51 51 51 56 56 56 56 51 60 60 60 60 60 66 71 60 60 45 45 45 45 56 71 71 71 73 77 78 84 90 90 89 87 72 68 68 70 76 80 80 85 87 80 81 81 82 83 78 73 80 80 86 84 84 90 85 85 90 90 89 89 66 66 66 70 78 72 74 72 72 72 79 76 81 74 74 74 74 80 86 89 84 71 71 71 79 75 84 81 74 66 74 61 62 60 62 62 62 58 60 60 64 68 78 89 88 88 71 71 71 56 56 56 56 56 56 60 60 60 77 83 82 77 75 73 73 70 66 75 77 71 67 73 85 85 80 80 75 74 74 75 75 78 81 85 87 84 82 79 78 80 75 70 73 73 78 90 90 83 74 73 80 71 71 71 75 75 81 72 71 66 66 66 66 66 66 66 66 61 43 43 43 43 58 58 58 71 72 68 66 67 73 60 60 58 58 58 68 74 83 86 84 84 84 66 66 58 43 43 43 43 43 71 71 71 71 76 76 76 71 69 62 62 51 51 51 57 58 65 72 71 67 67 67 65 70 78 76 90 73 70 70 66 58 58 58 43 43 43 43 43 45 60 60 51 51 51 51 51 55 62 58 58 60 60 51 51 51 51 56 56 71 71 77 77 77 89 89 90 90 84 84 78 78 78 79 79 79 89 84 90 74 74 71 79 74 78 75 74 77 77 82 74 67 58 58 62 70 74 74 84 77 74 74 81 81 84 90 78 80 66 70 74 82 82 84 87 84 78 78 79 90 87 86 74 80 72 58 50 35 35 35 35 43 71 66 60 58 58 58 58 66 66 86 86 86 86 86 86 89 73 59 59 55 55 55 84 84 90 86 86 84 90 75 70 64 65 65 67 73 73 86 83 72 72 69 63 55 55 55 64 67 75 81 81 90 90 89 89 72 66 69 76 78 80 86 86 86 86 86 82 82 65 61 61 76 71 80 88 87 79 79 74 79 79 88 88 74 74 74 74 74 77 84 76 64 54 55 54 63 67 84 84 73 69 69 68 77 70 68 64 63 63 66 66 69 67 54 54 54 39 39 39 39 39 35 35 45 45 49 49 49 34 33 32 AF LL2289r C -194 AF LL2289f U 57 BS 1 310 LL2289r BS 311 311 LL2289f BS 312 465 LL2289r BS 466 466 LL2289f BS 467 489 LL2289r BS 490 490 LL2289f BS 491 547 LL2289r RD LL2289r 788 0 2 gcgggcctcttcgctattacgccagctggcgaaagggggatgtgctgcaa ggcgattaagttgggtaacgccagggttttxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxcCTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTTGAA GAAACCTTTATAAAAACGATTAAGGGGCTGAATGCGAGACAAAAATATAT TTCTGATTTGAATCAATGAAAAATATATAAATGGAAGATCTAAAACATTT CTCACCAATGGCGCTATAACACAACAGAGAAGATTGAATTTAAAGGCATG ACTGCTCTTGTCACTTCCAGCCTTTCTTTTATTATTGGTTTAAATAC*GT GAAGGTGGATGAGCTCCATCACCTTGTCAACTTTATAGCTTTGCGATCGC TGAAGCAGGCAAGTGTTTCCTTGGTGGAAGCTTAAGATCCTCCCATCCCT CCTCTTCTTCCACTCTCTTGAAGTACTTATCTACCATGTTATCTTTCATA TCCTCCAGTCTCCTTGGCTCCCACTTTGGGTTCCTATCTTTGTTTATCAA TATGGCTCTACCCCCCTCCACAAAATCTTTGCTTATATCTCCCTTCATCA CATGTGACACCATTCTATTCTCACGGATAAGCGGACGCGTGGxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaatt QA 19 742 196 742 DS CHROMAT_FILE: LL2289r PHD_FILE: LL2289r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:15 2000 RT{ LL2289r chimera phrap 1 80 000919:094547 } RT{ LL2289r matchElsewhereHighQual phrap 1 80 000919:094547 } RD LL2289f 769 0 3 caacCTTTat*aaAaCgatgaagGGGCTGAATGccagACAAAAATATATT TCTGATTTGAATCAATGAAAAATATATAAATGGAAGATCTAAAACATTTC TCACCAATGGCGCTATAACACAACAGaaaAGATTGAATTTAAAGGCATGA CTGCTCTTGTCACTTCCAGCCTTTCTTTTATTATTGGTTTAAATacagtG AAGGTGGATGAGCTCCATCACCTTgccaACTTTATAGCTTTGCGATCGCT GAAGCAGGCAAGTgcttcCTTGGTGGAAGCTtaacaTCCTCCCATCCCTC CTCTTCTTCCACTCTCTTGAAGTACTTATCTACCATGTTATCTTTCATAT CCTCCagcctCCTTGGCTCCCACTTTGGGTTCCTATCTTTGTTTATCAAT ATGGCTCTACCCCCCTCCACAAAATCTTTGCTTATATCTCCCTTCATCAC ATGTGACACCATTCTATTCTCACGGATAaacgcaCGCGTGgxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxgcctggcgtgcctaatgagcgagctaactcacattaattgcgttgc gctcactgcccgctttccagtcggcaaacctgtccagccagctgcattaa tgaatcggccaacgcgcgg QA 23 720 2 491 DS CHROMAT_FILE: LL2289f PHD_FILE: LL2289f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:37 2000 RT{ LL2289f chimera phrap 655 769 000919:094547 } RT{ LL2289f matchElsewhereHighQual phrap 737 769 000919:094547 } RT{ LL2289f matchElsewhereHighQual phrap 675 724 000919:094547 } CO Contig21 766 2 29 U ccacgcgtccgtttttgatgtaatacAGTTTTTTCTGAATAAAATTTAAC ATTTATTCAAAAAAAAAAAAATAATTAGATTACAAGCTTAACAAATTTAT TATAAATCATCATACCTTGTTCATTTATAAATGAAAATCTGAAGAAAACA AAAAATGTTTGGTAAACACCAACATGAACTGTTGTCAGGTTGGTATAACT AATTGTGAAGCAGGAGAGGACTGGAGGAGCAGTAACCGAACTATGGAGAC TAGCAAGATCAACTGTATCAAGAAATAGAAACTGGACTTGCATGTTTTAC CTAACATGGCTCCACGGTTCGGTTCAGGATCTTCAGCTACAGAATGTACC TTCTTCGGATGTTCTTTGAGTAGCGTTGAGTTCAGAGAACTTCTGAGCAT CAAATGCTGGATTTCAGCTTCGATCTTTCGCTGGAAAACGCCTTCAAGTT CAGCTTCAATCTTCGCGTTTTCGAGTTCTTGGATCCTGGCTTCCTTTGCG TCGA*GATCAGCTCTCGCTTCCTCGAGTTTAACTTGGAGAAGTTTCACCT TCTGCTTCAAG*ATCAGTACCTGTGATCCTGAATTGTCTTCATCAATATT CGAATGTAGTGGTGGTATAGATTCCTTACTCagCTcCTGGAAACTtGTaa cCTCtTTccaaAGAGCttCcTGCaaAGCTgcaagaCAtCTGATAGaATCt gcaacATGATCCTtgtctgcctgataatggntgntcttcttcttatggct ctcatctccaccatct BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 51 61 66 66 66 66 66 50 50 50 45 45 47 47 50 50 61 56 56 56 56 56 50 61 50 50 50 50 49 45 49 49 50 50 61 61 66 66 66 66 66 66 57 52 52 48 47 45 45 45 45 45 45 50 52 48 48 55 55 47 47 47 47 52 52 66 61 61 50 56 50 47 47 50 50 50 50 56 56 50 50 50 50 48 52 55 55 55 55 50 66 52 56 56 55 55 50 50 50 50 50 50 55 55 55 55 55 61 61 61 61 61 61 61 61 61 66 66 54 66 66 53 53 53 66 66 66 66 66 66 66 66 66 61 61 61 66 66 66 66 61 61 55 55 55 55 55 55 61 55 55 55 55 53 53 61 51 55 55 55 55 55 66 66 55 55 55 55 55 53 53 53 53 53 53 53 61 61 61 61 61 61 66 66 66 66 66 61 61 61 53 53 53 53 53 53 53 61 61 61 61 61 61 61 61 55 55 53 53 53 53 53 53 53 53 53 55 61 66 53 53 53 53 53 53 53 55 61 61 61 61 61 55 53 53 53 53 53 53 66 66 66 61 61 61 61 61 61 61 61 61 55 53 53 53 53 53 53 53 53 53 53 53 53 53 53 66 66 61 61 55 53 53 53 53 53 53 61 61 61 56 53 53 53 53 53 53 43 43 43 51 56 56 66 61 61 61 61 61 61 61 66 66 66 61 61 61 55 53 53 53 53 53 53 51 56 56 56 51 55 53 53 53 50 61 61 66 66 66 66 66 66 66 66 66 66 66 66 66 56 56 56 51 51 40 40 40 40 40 40 51 56 51 51 46 46 42 42 42 46 45 45 45 45 55 55 61 61 66 66 66 66 56 66 56 56 56 55 55 55 55 55 51 56 56 56 56 56 45 45 45 45 45 40 51 51 56 56 56 56 40 35 35 35 45 45 46 56 56 61 51 51 51 51 51 51 51 51 56 51 51 51 51 51 45 45 40 40 40 40 45 51 56 56 51 40 40 37 37 37 40 40 40 40 40 40 40 46 56 56 56 48 48 48 48 48 48 48 48 48 29 32 32 32 32 36 36 40 39 34 29 29 29 29 35 42 42 42 42 44 44 37 40 40 37 34 29 32 32 27 27 27 25 25 32 48 48 40 48 40 46 44 34 32 32 32 32 25 29 27 27 25 25 37 40 40 40 34 34 40 40 40 40 33 32 32 32 32 32 29 25 25 29 29 29 29 28 28 33 33 40 33 40 36 36 36 48 48 34 34 48 48 29 29 29 40 40 40 29 34 29 29 27 32 37 29 29 29 40 29 29 29 32 40 40 40 40 32 26 29 25 25 21 27 21 25 29 29 40 32 30 28 33 33 40 34 35 33 33 33 25 25 18 18 20 23 19 20 29 24 25 29 26 29 29 25 18 23 24 19 19 18 25 25 25 18 22 21 18 15 19 19 23 25 25 25 21 18 18 20 16 24 21 21 14 18 29 40 27 21 15 13 8 10 11 14 20 24 19 25 32 32 29 29 26 25 19 29 29 21 19 10 10 10 15 19 25 40 40 28 27 25 25 20 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2291r U -43 AF LL2291f U 14 BS 1 65 LL2291r BS 66 77 LL2291f BS 78 79 LL2291r BS 80 80 LL2291f BS 81 81 LL2291r BS 82 84 LL2291f BS 85 86 LL2291r BS 87 93 LL2291f BS 94 95 LL2291r BS 96 107 LL2291f BS 108 110 LL2291r BS 111 112 LL2291f BS 113 145 LL2291r BS 146 148 LL2291f BS 149 150 LL2291r BS 151 151 LL2291f BS 152 166 LL2291r BS 167 167 LL2291f BS 168 391 LL2291r BS 392 399 LL2291f BS 400 400 LL2291r BS 401 401 LL2291f BS 402 404 LL2291r BS 405 409 LL2291f BS 410 431 LL2291r BS 432 433 LL2291f BS 434 436 LL2291r BS 437 437 LL2291f BS 438 766 LL2291r RD LL2291r 810 0 0 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacgc gtccgtttttgatgtaatacAGTTTTTTCTGAATAAAATTTAACATTTAT TCAAAAAAAAAAAAATAATTAGATTACAAGCTTAACAAATTTATTATAAA TCATCATACCTTGTTCATTTATAAATGAAAATCTGAAGAAAACAAAAAAT GTTTGGTAAACACCAACATGAACTGTTGTCAGGTTGGTATAACTAATTGT GAAGCAGGAGAGGACTGGAGGAGCAGTAACCGAACTATGGAGACTAGCAA GATCAACTGTATCAAGAAATAGAAACTGGACTTGCATGTTTTACCTAACA TGGCTCCACGGTTCGGTTCAGGATCTTCAGCTACAGAATGTACCTTCTTC GGATGTTCTTTGAGTAGCGTTGAGTTCAGAGAACTTCTGAGCATCAAATG CTGGATTTCAGCTTCGATCTTTCGCTGGAAAACGCCTTCAAGTTCAGCTT CAATCTTCGCGTTTTCGAGTTCTTGGATCCTGGCTTCCTTTGCGTCGA*G ATCAGCTCTCGCTTCCTCGAGTTTAACTTGGAGAAGTTTCACCTTCTGCT TCAAG*ATCAGTACCTGTGATCCTGAATTGTCTTCATCAATATTCGAATG TAGTGGTGGTATAGATTCCTTACTCagCTcCTGGAAACTtGTaacCTCtT TccaaAGAGCttCcTGCaaAGCTgcaagaCAtCTGATAGaATCtgcaacA TGATCCTtgtctgcctgataatggntgntcttcttcttatggctctcatc tccaccatct QA 71 758 45 810 DS CHROMAT_FILE: LL2291r PHD_FILE: LL2291r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:24 2000 RD LL2291f 810 0 0 cctttatgtatctagTTTTTTCTGAATAAAATTTAACATTTATTCAAAAA AAAAAAAATAATTAGATTACAAGCTTAACAAATTTATTATAAATCATCAT ACCTTGTTCATTTATAAATGAAAATCTGAAGAAAACAAAAAATGTTTGGT AAACACCAACATGAACTGTTGTCAGGTTGGTATAACTAATTGTGAAGCAG GAGAGGACTGGAGGAGCAGTAACCGAACTATGGAGACTAGCAAGATCAAC TGTATCAAGAAATAGAAACTGGACTTGCATGTTTTACCTAACATGGCTCC ACggctcGGTTCAGGATCTTCAGCTACAgactgTACCTTCTTCGGATGTT CTTtgcgtactgccgagtttagAGAACTTCTGAGCATCAAATGCTGGATT tccgttcctatttTtcGCTGGAAAACGCCTTCAagctcAGCTtccttctt tccgtctccgagttcctggaccttggctttcttt*cgccctcgatcagtt ctcgctttctcgaccctaacttgcccaagccccaccttccgcctcaagca tcagcccctgcgattccgaattgtctccttccctccccgacagtcccgct cgtcctctctctctctcttctcctgcacttcctctcccccgtttcccgtc cccgcgcttccccgctcccgcctgtcgtcttcgcccctccctcctcgctc ttcccttcccttcccttcccttcctcctttgccgcctcccttctcgtgtc ttatccgccgatgcgctcccttctccattctccgcccggctcgcgccaat gcgcctaccc QA 15 354 14 576 DS CHROMAT_FILE: LL2291f PHD_FILE: LL2291f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:47 2000 CO Contig22 1048 2 38 U ttagtttctcattcttgatttggtacgcttgatgaagtgatattgtcata caaatcccatgtcccAAAACACTGAAAGATCTACAAACAGCTCTCTAACC TCATCAAACGTTAACTGTTCTTCGGTTTTCTCTCAAGTCTTTATTTGACT GGCCCTGTCCTTGTTCTTGTTCATCATCTCCATCTTCTTCCCCTTCCCCT TCTTCATCTTCATACACCACCATAACAACATTACCATCCTCATCTACTTC ATACTCTACATCATCCCCTTCTTCCTCTTCTTCAAACTCCTCTTCCTCTT CCTCATCATCATCAAAGTCGTCATCCTCGTCCATACTTGCGCGAACCGAG TGAAACTCGGGAGGGGGAGGGCAGTCTTCTGTAATAGCCTCATCACGTTT CATCACAAGATGCCTAATAACCCTCTCATCACCATCCAACATCCCTTTGA ACTCGTTCAAGTGTTTCGCTTCCATCTCAAAGTTCATCAGTATATAATGT GCATTCTCCGCCTTCTGTATCTTATACGCCAGTCTACGCATTCCCCAGTC ACTAAACCTCCACACTTTCCCCTTCTTCTCCTTCAGAAACTCTTGAACTT TCTCATTGACGCTTTTGACCTCCTCTGCATGTTTCTCATGAATCAAGTAA ACCACCTCATAGTGTCTCATTCTTTCTTCATTCAAGTCGCTCTGAAGCTG AAGATCAAGAAACTCGTATAGCTCCTTTTCTTCATCATCAGCGAACTCAG GGAGAAGATCACCTTCCTCATCAACCTTCTTCTCTTTGAGAAGAATGGAT TCGGGAAACGGACCGGTGGCTTCGTCAGGTCTAGCGGAGAAGCTGTGATT GTCATCTTTCTTGCTTTTCTTCTTCTTCGCGGCGGCGACCAAAGGAAGGG ACTTTGATCCTCTGCGGCCGAGACTAGGTATGAAGCACACGCGGGGAGAG TTGATGAACGAATCTCGGCCGTCTATTCTGGGACCGAGAGATACTAGAGA CGAAGACGAGTGcaggattgactccattggaggaatgcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 51 51 51 40 40 40 51 51 51 51 40 46 46 46 46 46 46 46 46 46 51 40 39 39 39 39 39 39 40 40 40 40 40 40 40 40 40 46 46 46 40 40 45 35 35 35 35 35 35 40 40 40 46 42 42 43 46 46 42 44 42 46 46 51 51 40 40 40 40 40 40 46 46 37 37 37 40 40 37 37 43 43 43 43 38 43 43 43 43 45 41 41 41 41 41 41 41 46 51 51 51 51 51 51 51 51 51 51 51 43 43 43 43 43 43 45 51 51 46 46 38 38 38 43 43 43 45 45 45 45 51 51 56 56 56 56 51 51 51 51 51 51 43 43 43 51 51 51 56 56 51 45 45 45 45 45 51 56 56 56 56 56 51 45 45 45 45 45 45 45 45 45 51 51 51 51 45 45 43 43 43 43 43 43 51 40 40 40 38 38 43 45 45 51 51 56 51 51 51 51 51 45 45 45 45 45 45 51 51 51 51 51 51 51 56 56 56 51 43 43 43 43 43 45 51 60 60 66 66 66 66 66 66 58 60 60 58 66 66 71 88 88 88 88 88 88 90 81 76 75 63 67 60 58 58 58 86 75 76 65 68 66 64 64 58 76 80 72 64 55 56 53 58 62 62 60 62 58 62 62 62 66 66 74 81 75 81 66 62 66 69 65 64 60 64 71 71 68 68 74 74 74 80 71 65 51 51 35 33 57 51 58 71 67 62 70 68 59 65 64 64 61 61 71 68 79 81 71 66 61 61 55 56 69 73 76 79 77 80 80 64 63 78 81 81 70 63 58 61 62 72 69 57 61 61 71 71 71 77 80 80 80 82 66 66 65 69 74 79 85 85 85 87 90 83 71 74 75 68 68 64 78 81 64 64 62 71 68 57 57 59 61 54 65 77 88 80 80 52 51 48 58 58 56 53 57 74 71 82 70 63 71 74 69 75 84 80 80 76 79 68 80 85 85 73 73 84 76 82 75 68 67 62 65 67 67 75 81 80 75 67 67 74 70 66 66 69 76 73 88 84 84 90 88 81 76 76 74 62 57 67 67 60 49 56 49 55 55 51 58 56 60 62 76 64 83 83 87 78 78 75 69 69 64 73 73 67 65 58 60 60 66 62 65 70 72 67 73 73 78 68 75 90 90 90 90 90 90 90 90 90 90 90 90 83 83 77 80 79 77 73 74 81 83 83 80 83 90 80 80 80 85 85 86 88 88 88 90 90 90 90 78 75 77 77 72 74 77 71 71 66 67 66 69 76 80 83 81 78 66 83 80 81 81 81 76 79 84 81 83 83 80 80 75 75 71 75 69 56 54 54 54 54 66 66 76 67 65 64 73 65 61 55 57 59 50 49 49 54 55 69 74 79 81 79 79 79 73 69 67 66 65 65 71 66 56 45 45 45 45 45 45 45 45 45 56 75 66 65 59 66 67 72 75 71 71 73 73 56 56 56 56 56 56 56 56 43 43 43 54 56 59 69 67 60 55 55 66 66 71 70 70 66 66 64 64 67 69 71 74 74 57 47 46 46 50 49 36 43 45 45 45 45 45 45 45 45 45 45 51 51 51 51 56 56 56 56 56 56 56 56 51 51 51 51 63 56 56 56 56 56 56 51 51 51 51 51 51 56 56 51 51 51 51 51 51 51 51 51 51 51 56 56 45 45 45 43 43 43 45 43 43 45 45 45 45 45 43 43 43 43 43 43 56 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 43 43 43 43 45 45 56 51 51 51 51 45 45 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 45 43 43 43 35 34 34 34 34 34 34 35 35 43 51 56 56 43 43 43 43 43 43 43 43 43 43 43 51 51 51 51 51 51 51 51 35 35 35 35 35 35 43 43 43 43 43 43 45 45 45 45 45 45 40 46 39 35 35 35 35 35 35 34 34 34 34 34 35 40 40 35 35 35 39 39 39 40 40 40 40 39 39 39 39 39 39 40 35 35 35 35 35 35 35 35 35 35 39 39 39 40 46 46 46 46 51 45 45 45 45 45 45 51 40 40 40 40 40 40 40 40 40 51 51 51 51 51 51 46 46 42 35 35 35 35 35 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2292f U -14 AF LL2292r C 289 BS 1 319 LL2292f BS 320 322 LL2292r BS 323 373 LL2292f BS 374 375 LL2292r BS 376 424 LL2292f BS 425 428 LL2292r BS 429 452 LL2292f BS 453 458 LL2292r BS 459 464 LL2292f BS 465 471 LL2292r BS 472 475 LL2292f BS 476 481 LL2292r BS 482 484 LL2292f BS 485 486 LL2292r BS 487 490 LL2292f BS 491 494 LL2292r BS 495 495 LL2292f BS 496 506 LL2292r BS 507 518 LL2292f BS 519 519 LL2292r BS 520 529 LL2292f BS 530 536 LL2292r BS 537 539 LL2292f BS 540 544 LL2292r BS 545 550 LL2292f BS 551 552 LL2292r BS 553 558 LL2292f BS 559 568 LL2292r BS 569 569 LL2292f BS 570 652 LL2292r BS 653 681 LL2292f BS 682 692 LL2292r BS 693 704 LL2292f BS 705 715 LL2292r BS 716 743 LL2292f BS 744 771 LL2292r BS 772 772 LL2292f BS 773 1048 LL2292r RD LL2292f 802 0 0 cttaagggaacctgnttagtttctcattcttgatttggtacgcttgatga agtgatattgtcatacaaatcccatgtcccAAAACACTGAAAGATCTACA AACAGCTCTCTAACCTCATCAAACGTTAACTGTTCTTCGGTTTTCTCTCA AGTCTTTATTTGACTGGCCCTGTCCTTGTTCTTGTTCATCATCTCCATCT TCTTCCCCTTCCCCTTCTTCATCTTCATACACCACCATAACAACATTACC ATCCTCATCTACTTCATACTCTACATCATCCCCTTCTTCCTCTTCTTCAA ACTCCTCTTCCTCTTCCTCATCATCATCAAAGTCGTCATCCTCGTCCATA CTTGCGCGAACCGAGTGAAACTCGGGAGGGGGAGGGCAGTCTTCTGTAAT AGCCTCATCACGTTTCATCACAAGATGCCTAATAACCCTCTCATCACCAT CCAACATCCCTTTGAACTCGTTCAAGTGTTTCGCTTCCATCTCAAAGTTC ATCAGTATATAATGTGCATTCTCCGCCTTCTGTATCTTATACGCCAGTCT ACGCATTCCCCAGTCACTAAACCTCCACACTTTCCCCTTCTTCTCCTTCA GAAACTCTTGAACTTTCTCATTGACGCTTTTGACCTCCTCTGCATGTTTC TCATGAATCAAGTAAACCACCTCATAGTGTCTCATTCTTTCTTCATTCAa gccGCTCTGAAGCTGAAGATCAAgacacTCGTATAGCTCCTTTTCTTCAT CATCAGCGAActtagGGAGAAGATCACctttctCATCAACCTTCTTCTct tg QA 81 667 16 801 DS CHROMAT_FILE: LL2292f PHD_FILE: LL2292f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:52 2000 RD LL2292r 804 0 0 cctCTTCCTCTTCCTCATCATCATCAAAGTCGTCATCCTCGTCCATACTT GCGCGAACCGAGTGAAACTCGGGAGGGGGAGGGCAGTCTTCTGTAATAGC CTCATCACGTTTCATCACAAGATGCCTAATAACCCTCTCATCACCATCCA ACATCCCTTTGAACTCGTTCAAGTGTTTCGCTTCCATCTCAAAGTTCATC AGTATATAATGTGCATTCTCCGCCTTCTGTATCTTATACGCCAGTCTACG CATTCCCCAGTCACTAAACCTCCACACTTTCCCCTTCTTCTCCTTCAGAA ACTCTTGAACTTTCTCATTGACGCTTTTGACCTCCTCTGCATGTTTCTCA TGAATCAAGTAAACCACCTCATAGTGTCTCATTCTTTCTTCATTCAAGTC GCTCTGAAGCTGAAGATCAAGAAACTCGTATAGCTCCTTTTCTTCATCAT CAGCGAACTCAGGGAGAAGATCACCTTCCTCATCAACCTTCTTCTCTTTG AGAAGAATGGATTCGGGAAACGGACCGGTGGCTTCGTCAGGTCTAGCGGA GAAGCTGTGATTGTCATCTTTCTTGCTTTTCTTCTTCTTCGCGGCGGCGA CCAAAGGAAGGGACTTTGATCCTCTGCGGCCGAGACTAGGTATGAAGCAC ACGCGGGGAGAGTTGATGAACGAATCTCGGCCGTCTATTCTGGGACCGAG AGATACTAGAGACGAAGACGAGTGcaggattgactccattggaggaatgc ggacgcgtggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxat QA 3 724 1 760 DS CHROMAT_FILE: LL2292r PHD_FILE: LL2292r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:29 2000 CO Contig23 1431 2 12 U cttttaagaacaaacactaataaaatattgaatcttctttttaacataac attccaatccaaagtatctcccttatgagaGCTCCAAGTGGAAAATAAAA AAATAAAACTGAATAAGGACTCTCATGTAGCTCATATTATCAAAATACAG CAACAATACAAATATATATATATATAGTAGATGATTAAGATCATTTTTTT AACTGGAAGTTAAAGACAAAGAAaccttcagaagATAacatgtaaCCACC AGATGATGAAGATACTATTTGTCTCCCTggatgagCTTaaactggaGAAT gtACCaGACAGATTCTTAAGCGTTGCAAGCGGAGACCCTGCATTTGTAAA GTATGACGCGAGGTTGATCtCCGAGTTGTTGTCGTATTTCTTCACGAAAG GATGTTCCATTAGTTCCCTAGCAGAGCTTCGACTGTTTGGGTCCTTTTGC AAACAAGTGGAGATGAATGAAGATAACTCAGGGGAGAAGCttTCTGAAGG AAGagttggTGGTGGTTGGTCCACGATTGCTTCCATCAACTCGAAAACAC TGCTCCATGTTTCCTCTTCATCTGGTGGCAAATATGGGAACTTCCCTGTT GCACATTCAAGCACTACTAATCCCAAGCTCCATATATCGCTTTTGTTACC GTACTTGTTccCAACGATCCTCTCTGGAGACATATAGTTGTAAGTCCCAA CAAATGTGtttgCTAAACCCGCGGTGTTAGTCATAACGGTACTCACACCG AAGTCAGTGATCTTGACCTCTCCTCTGTGGTTGACCAACAGATTCGATGG TTTCAAGTCACGGTGGATGATGTGCTTGTCATGATGAAGATAGATCAGTC CTTGAAGCACTTGCTTAAAGATGGTAGAAAGATAGGACTCAGGGATGGTT TTGACTGATTTGAGAAAATCTTCTAGAGATCCACCGTCCATGTACTCCAA AATCAGCGAGATTGCGCCATTATCGTAAAACGATTGGTACGAGGTAACAA GGTACGGACATTGTGACGACTGGTTTATTTTAAGCTCTTGTGCAATTGAC TTGCGAATCGCTTCATCGACGTTTAGTTGAATGACCTTTAAGGCGAAGAA TTGACCAGTCCATTTGTGTTGAACAAGCTGCACAACACCGCTGCTTCCTT TACCAATGACTTTAACCATGTCCAAATCGGATAAGCTCAACTCATCATCC GCTGGCTTAATCGGAGACAGAGCTTCAGGCTCAGACTGAGAGACGATTCG AACTCCGTCTTTGTTGACTCGCAGATCTCCATCCTTGAACGTACCGCTCT GCGTCAAGAATTTGGTGATGGATTGCTCGCCAGCAGGAGGGATTGAGAGC TTGAGATTATTGCTGAATCCACCCTTCTTCATTTCGTCTTCTTCTtcctt cttctcctcgtcgctttccccggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 35 35 35 32 29 32 33 26 30 29 36 36 42 32 29 29 32 32 32 47 47 39 39 35 35 35 35 42 35 35 35 35 35 32 35 35 35 35 35 35 35 35 32 35 35 35 32 32 32 29 29 32 29 29 35 35 35 35 35 35 29 35 35 30 35 35 35 26 26 30 30 30 30 29 29 29 29 31 31 32 35 35 35 35 37 37 37 37 37 37 37 37 37 35 32 32 31 31 29 27 27 35 42 42 42 42 42 35 35 33 42 37 37 42 48 48 56 40 39 37 31 31 31 31 31 35 30 30 20 20 29 25 25 29 33 37 30 28 28 28 28 28 27 12 11 7 7 7 9 11 14 14 19 18 25 25 25 16 16 9 9 10 13 14 18 21 33 27 27 30 26 26 28 33 33 42 42 33 42 30 28 28 29 27 29 29 29 29 39 33 33 30 28 28 28 28 28 28 16 13 6 6 6 13 13 20 22 22 19 18 13 10 10 10 16 13 21 21 23 23 19 19 20 24 24 19 20 20 29 29 35 33 33 28 28 28 36 33 29 27 28 29 24 24 28 28 28 22 28 29 29 29 26 29 27 29 29 29 26 23 23 23 26 35 28 29 29 29 32 42 44 37 37 35 32 29 29 32 29 29 29 32 22 22 22 24 24 24 24 19 23 23 32 35 35 32 32 28 29 30 26 29 29 29 29 35 35 39 29 29 29 29 28 28 33 33 35 35 44 47 44 42 42 35 35 35 35 30 35 36 36 30 30 35 35 35 35 33 35 33 37 44 44 44 35 35 35 36 30 27 27 29 29 32 35 38 56 47 47 47 47 47 47 47 47 48 45 35 35 35 35 35 42 42 39 39 37 37 42 47 42 42 42 42 42 42 42 42 42 42 37 37 37 42 35 33 42 30 30 30 33 37 40 40 40 40 40 40 31 27 16 18 21 27 27 29 40 40 40 40 47 47 23 16 9 9 9 16 15 22 23 25 30 42 42 42 42 42 42 42 42 35 35 35 35 35 35 35 35 35 35 35 35 42 43 48 47 37 36 36 35 35 35 40 40 46 51 37 37 37 35 32 32 32 35 35 40 45 40 37 37 37 37 37 39 45 45 45 45 37 37 37 40 45 40 42 42 42 38 38 39 39 40 40 40 40 56 51 37 40 37 35 35 35 42 44 48 42 42 42 37 37 37 46 46 51 56 56 56 56 46 37 33 33 34 32 32 32 32 32 29 29 29 34 40 32 29 34 29 29 31 37 46 46 46 40 32 32 25 25 25 32 30 46 30 30 30 39 30 39 33 34 34 40 32 32 24 24 21 19 19 29 25 25 29 32 32 32 40 40 40 40 40 40 37 34 34 29 29 24 28 25 32 33 37 34 33 33 41 43 43 41 42 40 44 44 37 37 30 30 20 22 22 37 40 32 29 23 17 12 19 19 32 40 37 37 40 45 37 30 39 35 42 44 45 47 40 40 47 55 53 72 72 54 44 49 47 38 34 48 59 60 60 61 53 53 52 49 47 47 41 44 47 52 52 63 49 49 44 61 61 63 58 48 50 54 55 63 58 61 44 37 37 40 42 48 47 44 47 44 34 37 40 42 34 41 33 33 40 46 47 48 32 29 34 32 32 29 27 30 25 25 25 34 32 25 29 32 32 32 29 27 27 25 22 22 34 40 48 40 44 44 46 46 46 56 56 56 56 56 56 56 51 51 51 51 51 46 46 42 44 43 51 42 42 40 40 40 40 40 40 40 46 40 40 34 40 40 40 48 34 34 48 32 32 32 32 39 39 39 31 35 35 35 35 34 40 40 44 44 40 48 40 32 32 32 32 32 32 36 44 48 48 46 40 39 39 39 35 39 40 40 37 40 40 51 51 51 51 51 51 51 51 51 56 51 51 51 51 45 45 45 40 40 40 45 45 56 56 56 56 56 56 45 45 45 45 45 45 51 45 45 45 45 45 45 51 56 56 56 56 56 45 39 39 39 39 39 40 56 56 45 39 39 39 39 39 39 40 45 51 40 39 39 39 39 39 45 45 51 51 51 45 45 45 45 45 45 45 45 56 51 45 45 45 45 51 51 56 45 45 45 45 45 45 51 51 51 51 56 56 56 56 56 51 51 51 56 51 51 51 51 51 51 51 56 56 56 56 56 45 45 45 45 51 51 56 56 56 56 56 51 51 51 51 51 51 51 51 51 43 43 43 43 43 43 45 45 45 45 45 45 56 56 56 56 56 51 45 45 45 43 43 43 43 43 43 45 56 56 56 56 56 56 56 56 56 56 56 51 45 45 45 45 45 45 56 56 56 51 51 51 56 56 45 45 45 45 45 45 56 56 56 56 56 56 56 51 51 51 51 51 51 46 43 43 43 43 43 43 43 43 43 51 56 56 46 43 43 43 43 43 45 45 45 51 56 56 56 56 56 56 51 51 56 51 51 51 56 56 51 45 45 45 45 45 56 43 43 43 43 43 43 56 56 45 45 45 45 45 45 43 43 43 43 43 43 56 56 56 56 56 56 56 56 56 56 43 43 43 43 43 43 45 45 45 45 45 45 43 43 43 43 43 43 45 45 45 43 43 43 43 43 45 56 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 45 45 45 45 43 43 43 43 43 43 43 51 43 43 43 43 43 43 43 51 51 51 43 43 43 43 36 31 31 31 31 31 35 43 43 51 56 56 56 56 56 56 56 51 51 51 43 43 43 43 43 43 43 43 43 43 43 43 36 35 35 35 35 43 43 45 45 45 45 51 56 51 51 51 51 51 51 51 45 45 45 45 45 43 35 35 35 35 35 35 35 35 35 35 39 51 51 51 51 51 51 40 40 40 40 40 40 45 40 40 40 40 40 40 39 35 35 35 35 35 40 40 40 40 40 46 40 51 51 51 51 51 51 51 51 46 46 46 46 46 40 35 35 35 35 35 35 45 44 40 40 40 40 46 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2293f U 1 AF LL2293r C 682 BS 1 685 LL2293f BS 686 706 LL2293r BS 707 711 LL2293f BS 712 717 LL2293r BS 718 718 LL2293f BS 719 720 LL2293r BS 721 721 LL2293f BS 722 728 LL2293r BS 729 737 LL2293f BS 738 747 LL2293r BS 748 749 LL2293f BS 750 1431 LL2293r RD LL2293f 794 0 0 cttttaagaacaaacactaataaaatattgaatcttctttttaacataac attccaatccaaagtatctcccttatgagaGCTCCAAGTGGAAAATAAAA AAATAAAACTGAATAAGGACTCTCATGTAGCTCATATTATCAAAATACAG CAACAATACAAATATATATATATATAGTAGATGATTAAGATCATTTTTTT AACTGGAAGTTAAAGACAAAGAAaccttcagaagATAacatgtaaCCACC AGATGATGAAGATACTATTTGTCTCCCTggatgagCTTaaactggaGAAT gtACCaGACAGATTCTTAAGCGTTGCAAGCGGAGACCCTGCATTTGTAAA GTATGACGCGAGGTTGATCtCCGAGTTGTTGTCGTATTTCTTCACGAAAG GATGTTCCATTAGTTCCCTAGCAGAGCTTCGACTGTTTGGGTCCTTTTGC AAACAAGTGGAGATGAATGAAGATAACTCAGGGGAGAAGCttTCTGAAGG AAGagttggTGGTGGTTGGTCCACGATTGCTTCCATCAACTCGAAAACAC TGCTCCATGTTTCCTCTTCATCTGGTGGCAAATATGGGAACTTCCCTGTT GCACATTCAAGCACTACTAATCCCAAGCTCCATATATCGCTTTTGTTACC GTACTTGTTccCAACGATCCTCTCTGGAGACATATAGTTGTaAGTcCCAa canaTGTGtttgCTAAACCCGCGGTGTTAGTCATAACGGTACTCACACCG AAGTCAGTGATCTTGACCTCTCCTCTGTGGTTGACCAACAGATT QA 81 794 1 794 DS CHROMAT_FILE: LL2293f PHD_FILE: LL2293f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:56 2000 RD LL2293r 794 0 0 atATAGTTGTAAGTCCCAACAAATGTGtntgCTAAACCCGCGGTGTTAGT CATAACGGTACTCACACCGAAGTCAGTGATCTTGACCTCTCCTCTGTGGT TGACCAACAGATTCGATGGTTTCAAGTCACGGTGGATGATGTGCTTGTCA TGATGAAGATAGATCAGTCCTTGAAGCACTTGCTTAAAGATGGTAGAAAG ATAGGACTCAGGGATGGTTTTGACTGATTTGAGAAAATCTTCTAGAGATC CACCGTCCATGTACTCCAAAATCAGCGAGATTGCGCCATTATCGTAAAAC GATTGGTACGAGGTAACAAGGTACGGACATTGTGACGACTGGTTTATTTT AAGCTCTTGTGCAATTGACTTGCGAATCGCTTCATCGACGTTTAGTTGAA TGACCTTTAAGGCGAAGAATTGACCAGTCCATTTGTGTTGAACAAGCTGC ACAACACCGCTGCTTCCTTTACCAATGACTTTAACCATGTCCAAATCGGA TAAGCTCAACTCATCATCCGCTGGCTTAATCGGAGACAGAGCTTCAGGCT CAGACTGAGAGACGATTCGAACTCCGTCTTTGTTGACTCGCAGATCTCCA TCCTTGAACGTACCGCTCTGCGTCAAGAATTTGGTGATGGATTGCTCGCC AGCAGGAGGGATTGAGAGCTTGAGATTATTGCTGAATCCACCCTTCTTCA TTTCGTCTTCTTCTtccttcttctcctcgtcgctttccccggacgcgtgg xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaat QA 31 714 1 750 DS CHROMAT_FILE: LL2293r PHD_FILE: LL2293r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:33 2000 CO Contig24 1039 2 38 U agttactattacagatgtatgagacggccactaaaatgaagcagcttatT TTGTCAAATGTAAAAAAGAAAGTAGTCATGTGGAATGATATTTTTCGATT TGTTTTTTTTTTCTTTCTTATAAATGGCATTTTCAGATTCAGAAACTGGA GTCGTCAGTAGATGGACGCCTCAAGAAGTACCTCAGACTATTGGCAAGAA CCTGCTGAAGAGGCTTGATTAGGGCTTTGTTTCTGTAACTTACGTGGAAC TTAACCTTAGCCAACAAACCCTCAGTGTCGAATTCTCCAGATTCGACAGC GACGCTGATATCTGTAATGTTCTTCACTAGCTCAACTAATAGTCCAGGCC TATCCGCTGTTTCTATGTATAGTAAACTGCGGTCTGGTCCATCATCTTCA ATTTTTATATGCGTTGCTATGTCCACATCAACCTGTTCAGTTGGTGGAAG AACACCAAAAGCTGCTCCCATTGCCAACTGAGAACTTGATTCAGGATGAA ACTCGAGCAAATTGTTAATGACAGTGAGACGGATGGCCTCGAGCAACTCA GGGTCTTCTACTTTTCTTCCACTATCCGCTTTAGTAATGGCAAATTTGTT GTGCTTGCCAGAAGAATCGAGGTAGACATTAGCCTTGACAACATTCAGAC CCAAATTTTTGAGCGCATTCATAGTGTCAAGTAGAGCTCCTAGACGATCT CCAAATGTTACTTCAACAACGGTTGCATCAGGGTCAGAATCTTGGTCGAT TATGACCACTGGAGTTGGGACTTCGTCAGAATCCTGCTTACCATTCTCCA CAGCAGTAGCGTTTGCTGCTCGAGGCTTCAACCCCGATAATCTGAGTCGC TGAAGATTTATAATAGGCTTATCGACAAATCCAAACGTGATTGTGCTAGG AGGAAGACATAAAGCTCCAAGATCACGTCTGATTGCGGGAGAAGTAGAAG CATCAGTGAAGCAAAGAGCAGAACCAGAAGCAGAGGCCATAGCCACCATA GCtgacttgaacccgaagcaaaaaaaaacggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 40 40 39 35 35 37 37 37 51 51 51 51 51 51 51 51 51 51 51 51 40 35 35 35 35 35 37 37 37 40 56 56 56 56 46 40 40 40 40 40 40 40 51 40 35 35 35 35 35 40 51 51 51 51 51 56 56 56 56 56 56 56 56 51 51 51 46 46 46 46 46 40 40 37 37 37 40 40 40 40 56 51 51 51 51 51 51 51 45 45 40 40 40 35 35 35 35 35 40 46 43 43 43 43 43 43 51 51 51 51 51 43 56 43 43 43 43 43 43 43 43 43 43 43 51 51 45 45 51 40 43 43 45 45 43 43 36 43 43 43 40 41 41 45 45 45 41 41 45 45 45 43 43 43 43 43 43 43 56 56 56 45 45 45 41 41 41 43 43 43 43 43 43 43 43 43 43 43 51 51 51 51 51 51 43 45 43 43 43 43 43 43 43 36 36 40 45 43 43 43 43 43 43 45 45 45 45 51 51 51 51 45 45 45 45 45 45 45 43 43 43 43 45 45 56 56 38 43 43 43 43 43 51 51 51 51 51 51 51 51 51 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 56 41 41 44 46 43 46 56 56 56 50 56 56 56 56 51 43 43 43 43 43 51 45 45 45 43 43 43 43 43 43 45 56 56 51 51 51 51 51 51 51 51 51 51 45 43 43 43 38 38 38 46 56 56 56 51 51 51 51 51 51 51 51 51 51 56 46 46 43 43 41 49 41 45 46 56 56 56 56 51 56 56 66 71 56 56 56 56 45 45 45 51 51 51 56 56 56 56 56 51 51 45 45 51 59 48 46 46 46 47 49 47 53 59 59 59 59 59 56 56 56 45 45 45 45 45 40 43 43 51 51 51 43 56 56 56 56 56 56 56 56 56 56 56 45 45 53 55 54 53 64 52 49 57 61 58 52 52 52 53 58 63 46 35 35 35 35 39 40 45 45 40 40 42 40 45 45 45 45 40 40 59 58 60 64 61 61 60 63 64 64 53 55 57 63 60 58 51 51 51 51 51 51 46 46 38 38 37 40 40 66 66 64 54 63 59 57 55 54 49 45 46 48 49 52 50 48 46 43 42 46 48 65 60 58 44 37 36 45 50 50 55 49 47 47 58 50 58 57 49 52 56 53 45 48 48 50 46 52 55 55 46 56 59 61 59 58 59 59 54 52 47 56 52 54 56 56 49 46 46 32 27 33 33 28 29 31 29 32 27 40 40 40 39 36 36 33 37 40 40 45 42 47 51 49 54 58 59 66 67 64 50 38 33 36 38 45 45 51 53 44 44 42 45 47 50 45 51 50 48 44 37 37 49 73 57 66 69 77 69 72 76 56 59 59 63 82 63 68 85 63 58 56 56 48 56 56 54 55 69 70 73 77 74 69 63 61 63 68 62 71 67 70 73 60 65 60 55 53 51 40 40 48 50 52 54 55 57 61 58 57 53 47 47 49 55 47 51 42 48 48 60 59 56 59 59 62 50 45 40 40 39 44 39 35 35 56 48 51 57 64 63 54 52 51 55 57 54 49 52 52 49 56 54 54 54 44 48 45 49 58 58 68 70 71 76 71 69 64 74 74 70 54 54 59 57 56 56 70 73 51 45 45 40 35 35 35 35 35 39 45 45 45 51 51 51 51 51 51 51 51 51 56 56 56 51 51 51 51 51 51 46 39 35 35 35 39 39 39 39 39 39 39 39 39 40 45 45 40 45 40 40 40 40 45 45 51 51 51 51 51 51 51 56 46 46 46 39 39 39 45 45 45 51 40 40 39 39 39 39 39 39 40 51 51 51 56 56 51 56 51 45 45 45 45 45 51 51 51 51 51 51 43 43 43 43 43 51 56 56 56 56 56 51 51 43 43 43 43 43 43 45 43 43 43 43 43 43 51 45 45 45 45 51 45 45 45 45 51 45 51 51 43 43 43 43 43 43 51 56 56 56 51 51 51 43 35 35 35 35 35 43 45 45 43 43 43 43 43 43 51 56 51 45 45 45 45 45 45 45 51 51 51 51 46 40 39 39 39 39 40 40 45 45 45 56 45 39 39 39 39 39 40 51 51 51 45 51 45 40 40 40 40 40 40 51 51 51 51 39 35 35 35 35 35 40 40 56 40 40 37 37 35 35 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2300f U -30 AF LL2300r C 344 BS 1 379 LL2300f BS 380 380 LL2300r BS 381 390 LL2300f BS 391 392 LL2300r BS 393 412 LL2300f BS 413 426 LL2300r BS 427 454 LL2300f BS 455 471 LL2300r BS 472 489 LL2300f BS 490 505 LL2300r BS 506 518 LL2300f BS 519 593 LL2300r BS 594 600 LL2300f BS 601 606 LL2300r BS 607 607 LL2300f BS 608 611 LL2300r BS 612 621 LL2300f BS 622 626 LL2300r BS 627 630 LL2300f BS 631 632 LL2300r BS 633 633 LL2300f BS 634 634 LL2300r BS 635 635 LL2300f BS 636 637 LL2300r BS 638 638 LL2300f BS 639 639 LL2300r BS 640 641 LL2300f BS 642 643 LL2300r BS 644 644 LL2300f BS 645 670 LL2300r BS 671 671 LL2300f BS 672 679 LL2300r BS 680 682 LL2300f BS 683 683 LL2300r BS 684 684 LL2300f BS 685 706 LL2300r BS 707 768 LL2300f BS 769 1039 LL2300r RD LL2300f 804 0 0 ncttttatgnatagcgttggaagtagagaanagttactattacagatgta tgagacggccactaaaatgaagcagcttatTTTGTCAAATGTAAAAAAGA AAGTAGTCATGTGGAATGATATTTTTCGATTTGTTTTTTTTTTCTTTCTT ATAAATGGCATTTTCAGATTCAGAAACTGGAGTCGTCAGTAGATGGACGC CTCAAGAAGTACCTCAGACTATTGGCAAGAACCTGCTGAAGAGGCTTGAT TAGGGCTTTGTTTCTGTAACTTACGTGGAACTTAACCTTAGCCAACAAAC CCTCAGTGTCGAATTCTCCAGATTCGACAGCGACGCTGATATCTGTAATG TTCTTCACTAGCTCAACTAATAGTCCAGGCCTATCCGCTGTTTCTATGTA TAGTAAACTGCGGTCTGGTCCATCATCTTCAATTTTTATATGCGTTGCTA TGTCCACATCAACCTGTTCAGTTGGTGGAAGAACACCAAAAGCTGCTCCC ATTGCCAACTGAGAACTTGATTCAGGATGAAACTCGAGCAAATTGTTAAT GACAGTGAGACGGATGGCCTCGAGCAACTCAGGGTCTTCTACTTTTCTTC CACTATCCGCTTTAGTAATGGCaAATTTGTTGTGCTTGCCAGAAGAATCG AGGTAGACATTAGCCTTGACAACATTCAGACCCAAATTTTTGAGCGCATT CATAGTGTCAAGTAGAGCTCCTAGACGATCTCCAAATGTTACTTCAACAA CGGTTGCATCAGGGTCAGAATCTTGGTCGATTATGACCACTGGAGTTGGG ACTT QA 81 737 32 804 DS CHROMAT_FILE: LL2300f PHD_FILE: LL2300f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:52 2000 RD LL2300r 739 0 1 ccaggcc*atc*gctggttcaagg*atagt*aacTGCGGTctg*tCCATC ATctccaatttt*aTATGCGTTGCTATGTCCACATcancctgtncagttg *tgaaagAACACCAAAAGCTGCTCCCATTGCcanctgaganctTGATTCA GGATGAAACTCGAGcanantgTTAATGACAGTGAGACGGATGGCCTCGAG CAACTCAGGGTCTTCTACTTTTCTTCCACTATCCGCTTTAGTAATGGCAA atntGTTGTGCTTGCCAGAAGAATCGAGGTAGACATTAGCCTTGACAACA TTCAGACCCAAATTTTTGAGCGCATTCATAGTGTCAAGTAGAGCTCCTAG ACGATCTCCAAATGTTACTTCAACAACGGTTGCATCAGGGTCAGAATCTT GGTCGATTATGACCACTGGAGTTGGGACTTCGTCAGAATCCTGCTTACCA TTCTCCACAGCAGTAGCGTTTGCTGCTCGAGGCTTCAACCCCGATAATCT GAGTCGCTGAAGATTTATAATAGGCTTATCGACAAATCCAAACGTGATTG TGCTAGGAGGAAGACATAAAGCTCCAAGATCACGTCTGATTGCGGGAGAA GTAGAAGCATCAGTGAAGCAAAGAGCAGAACCAGAAGCAGAGGCCATAGC CACCATAGCtgacttgaacccgaagcaaaaaaaaacggacgcgtggxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 255 659 1 696 DS CHROMAT_FILE: LL2300r PHD_FILE: LL2300r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:29 2000 RT{ LL2300r compression phrap 100 102 000919:094547 } CO Contig25 1296 2 20 U gatggaagataacttcaATATATTACAAACTTATTAAAAGGAGTAGCTAA TGCAAATTCAGCAGAGAAAATAAGTGATTTGACATGATAATAACAACCAT ACTCGAAAACAAATGTAATTAAAACACTTGCTTCATCTCCTTTCACAGCA TCTTCACCTCTTCAACATTGTTGTCTTCAGTAGGTGTAGCCTTTGACGAA CATACCCTCCTTAGCCTCTTCAGACTCACCTTCTCCGGTGTATTTTCCGA GCTGAGCCAACGAATTGGCCTTGGCTCGAGTCAAGAGAGTGGTCTGAGCT GCGTTCACGTTCTCAGCTCTGCCTCCCCATGTTTTCAGGCAAGTGTTCTG AAGAGCACGTGCGTAGGAGAAGGACACGTGCCACGGGTTTGGTGCCTGGT TCATCGCGTTCAGGTTCAACGTTGCCTCCAACTCAGACTGTCCTCCGGAC AAGAACATGATTCCAGGGACGGCAGGAGGGATTCTGTTGCGGAGGAGCTT AAGGGTGTAGGAGGCAACTTGTTCAGGAGTAGCTCTGTCTTTAGACTCGG CTCCTGGAGTCACCATGCTCGGCTTCAGGAGGATACCTTCAAACATGACA TTGTTCTGAGCTAGGTAAAAGAAAACCTCAGCCCACACCTTCTCTGCTAC TTCGTAAGTCCTGTCAATGTCGTGTTCTCCATCCAACAAGATCTCTGGCT CCACTATGGGAACCAATCCACTGTCTTGTGAAATGGCAGCGTATCGAGCA AGGCCCCAAGCAGCTTCTTTCACAGCGAGAGCAGACGGACCGTTGGGAAT GCTCACCACAGTACGCCATTTGGCGAAACGAGCACCCTGTTGGTAGTAAG CAGCGGTCCGAGAGGATAGACCGTCAAGTCCTTGGCACCATGACTCATTG TTAGATCCAACAAGTGGCACCAAACCCTTGTCGACTTTGATACCGGGGAC GATGTTCTGCTCGACTAGGACGTCGACCATTTTCTTGCCTTCGGTGGTAG ACTGATAGAGAGTCTCCTCGAACAGGATTGCACCGGAGATGTACTGTCCG AGACCTGGTGCAGAGACAAGCAGTGTCCGGTATGCTTGACGGTTTGCCTC AGTGTTCTCTAGCCCTATCGAGTCCAAACGCTTCCCGCATGTGGCGTTCG ACTCGTCCATCGCCAAGATTCCTCGTCCAGGAGACGCAATTGTTTTCGCT GTCTTAACGAGCTCATCGGCGTAGGAAGAAGCGGCGCGGACGGTAAGGTA GGTGGCGCGGTTGCGTAGGACGACGGCGGCGGAAGcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 36 37 40 39 34 34 34 34 34 35 35 35 35 35 35 35 39 39 39 39 40 51 46 46 46 46 46 40 34 34 34 34 34 35 40 40 40 46 51 46 46 46 46 46 40 40 40 40 40 40 46 46 56 37 37 40 40 40 40 46 40 35 39 39 39 39 39 40 40 40 40 39 39 39 39 39 45 51 40 40 40 40 40 40 45 40 40 40 40 40 51 51 46 46 46 51 46 46 40 40 35 35 35 35 35 35 51 40 40 40 35 35 35 35 35 35 39 40 40 40 40 40 40 40 40 40 46 40 40 40 45 45 45 45 45 43 43 43 43 43 43 45 43 43 43 43 46 43 51 51 51 56 56 56 51 43 45 45 45 45 45 43 43 43 43 43 40 51 56 43 43 43 43 43 43 56 43 36 36 36 36 36 43 43 43 43 43 43 43 43 43 43 43 45 45 45 45 45 45 45 45 51 51 51 43 43 43 43 43 43 43 45 45 45 51 56 51 51 51 51 56 41 51 51 51 51 51 51 43 43 43 43 43 43 43 43 43 43 45 45 45 43 43 43 45 45 45 56 43 43 43 43 43 43 45 43 43 43 43 43 43 45 51 56 56 51 51 51 51 51 51 51 51 56 45 45 45 45 45 45 45 36 36 36 36 36 36 43 43 43 43 43 43 43 45 45 45 43 43 43 43 43 43 43 43 43 43 45 45 51 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 51 43 43 43 43 43 45 56 51 45 43 43 43 43 43 43 43 43 43 43 45 51 51 56 51 56 56 51 51 51 43 43 43 43 43 43 45 43 43 43 43 43 51 56 56 56 56 51 45 45 45 45 45 56 51 56 56 56 45 43 43 43 43 43 43 45 56 56 56 56 56 56 51 43 40 40 40 40 40 45 45 45 45 45 51 40 45 45 45 45 40 45 45 45 45 40 43 43 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 56 51 45 40 40 40 40 40 45 56 56 56 56 56 56 51 51 51 51 51 51 51 45 40 40 40 40 40 45 56 56 45 45 37 37 37 40 45 45 51 51 45 51 51 51 51 51 46 46 42 42 37 40 40 40 40 40 56 56 56 56 56 56 51 51 51 40 40 40 45 45 45 56 56 56 46 46 46 37 48 51 59 59 65 63 51 54 54 49 42 42 44 47 44 47 52 50 49 55 59 58 71 68 68 66 65 69 57 60 60 62 60 67 55 50 52 52 48 50 54 51 51 51 46 46 50 44 46 53 72 75 75 71 61 59 58 58 64 55 55 55 66 66 64 64 54 50 43 48 41 42 40 40 40 42 47 25 25 26 23 25 25 44 44 47 61 57 51 46 43 55 55 52 43 41 38 43 48 65 69 69 55 55 50 59 51 44 44 52 59 54 47 54 59 65 65 62 67 62 56 54 60 57 62 47 44 47 43 43 40 44 44 58 58 63 59 76 72 44 41 37 37 50 53 57 69 67 80 80 80 69 54 51 45 51 48 40 49 67 63 63 71 69 47 41 48 44 37 37 47 47 55 59 60 58 61 56 55 45 49 47 47 54 49 55 47 49 56 61 54 46 47 53 55 56 67 72 72 72 80 80 80 69 63 62 59 54 47 32 32 32 32 26 29 29 30 39 39 57 52 47 51 61 59 63 58 57 61 53 53 56 56 54 62 65 75 69 63 57 40 51 51 51 51 56 45 45 45 45 45 51 40 40 40 40 40 45 56 56 56 56 56 56 42 37 40 40 40 40 37 40 40 45 56 51 45 40 37 37 37 40 40 51 51 45 45 40 45 40 45 56 56 56 56 51 45 45 45 45 45 45 45 40 40 37 37 37 37 37 37 40 45 45 51 45 51 51 51 51 51 45 45 45 45 45 45 51 51 51 51 51 45 45 45 45 45 45 51 56 56 56 56 56 56 51 51 51 51 51 51 56 56 56 56 56 51 45 45 45 45 45 56 56 56 56 56 51 45 45 45 43 43 45 45 45 51 56 56 56 56 56 56 56 45 51 45 45 45 45 51 56 51 51 56 56 56 56 51 56 56 56 51 51 51 51 51 51 45 45 51 51 51 51 56 56 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 51 51 56 56 56 56 56 56 56 51 51 51 45 45 45 43 43 43 43 43 43 43 43 43 43 43 51 51 56 56 51 51 51 51 45 43 43 43 43 43 43 56 45 45 45 45 45 45 45 45 45 45 45 51 51 51 51 45 45 45 43 43 43 43 43 43 51 51 51 43 43 43 43 43 43 43 43 43 45 51 56 43 43 43 43 43 43 45 45 51 51 51 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 51 51 56 56 56 45 45 43 43 43 43 51 45 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 51 51 43 43 43 43 43 43 43 43 43 51 43 36 36 36 36 36 43 43 43 43 43 43 36 43 43 36 34 34 34 34 34 43 43 43 43 43 36 36 36 36 36 43 43 43 43 43 43 43 43 45 45 45 45 51 43 35 35 35 35 35 35 43 43 43 43 43 35 35 35 35 35 35 43 45 45 45 51 43 35 35 35 35 35 38 40 40 39 39 39 35 35 35 39 39 39 40 40 40 40 40 40 40 40 40 40 40 40 46 46 46 46 39 35 35 35 35 35 35 35 35 35 35 35 40 40 40 40 40 40 39 39 39 39 39 39 39 39 35 35 29 29 29 35 30 35 39 39 35 35 39 39 45 46 46 46 46 46 40 40 39 35 35 35 35 35 35 35 35 35 0 0 0 0 0 0 0 0 0 0 0 AF LL2302f U 1 AF LL2302r C 539 BS 1 590 LL2302f BS 591 591 LL2302r BS 592 612 LL2302f BS 613 616 LL2302r BS 617 632 LL2302f BS 633 636 LL2302r BS 637 643 LL2302f BS 644 650 LL2302r BS 651 656 LL2302f BS 657 684 LL2302r BS 685 687 LL2302f BS 688 694 LL2302r BS 695 695 LL2302f BS 696 708 LL2302r BS 709 711 LL2302f BS 712 725 LL2302r BS 726 726 LL2302f BS 727 760 LL2302r BS 761 783 LL2302f BS 784 1296 LL2302r RD LL2302f 788 0 0 gatggaagataacttcaATATATTACAAACTTATTAAAAGGAGTAGCTAA TGCAAATTCAGCAGAGAAAATAAGTGATTTGACATGATAATAACAACCAT ACTCGAAAACAAATGTAATTAAAACACTTGCTTCATCTCCTTTCACAGCA TCTTCACCTCTTCAACATTGTTGTCTTCAGTAGGTGTAGCCTTTGACGAA CATACCCTCCTTAGCCTCTTCAGACTCACCTTCTCCGGTGTATTTTCCGA GCTGAGCCAACGAATTGGCCTTGGCTCGAGTCAAGAGAGTGGTCTGAGCT GCGTTCACGTTCTCAGCTCTGCCTCCCCATGTTTTCAGGCAAGTGTTCTG AAGAGCACGTGCGTAGGAGAAGGACACGTGCCACGGGTTTGGTGCCTGGT TCATCGCGTTCAGGTTCAACGTTGCCTCCAACTCAGACTGTCCTCCGGAC AAGAACATGATTCCAGGGACGGCAGGAGGGATTCTGTTGCGGAGGAGCTT AAGGGTGTAGGAGGCAACTTGTTCAGGAGTAGCTCTGTCTTTAGACTCGG CTCCTGGAGTCACCATGCTCGGCTTCAGGAGGATACCTTCAAACATGACA TTGTTCTGAGCTAGGTAAAAGAAAACCTCAGCCCACACCTTCTCTGCTAC TTCGTAAGTCCTGTCAATGTCGTGTTCTCCATCCAACAAGATCTCTGGCT CCACTATGGGAACCAATCCACTGTCTTGTGAAATGGCAGCGTATCGAGCA AGgcccncagCAGCTTCTTTCACAGCGAGAGCAGAcgg QA 18 752 1 788 DS CHROMAT_FILE: LL2302f PHD_FILE: LL2302f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:02 2000 RD LL2302r 801 0 0 ctTTAGACTCGGCTCCTGGAGTCACCATGCTCGGCTTCAGGAGGATACCT TCAAACATGACATTGTTCTGAGCTAGGTAAAAGanaacCTCAGCCCACAC CTTCTCTGCTACTTCGTAAGTCCTGTCAATGTCGTGTTCTCCATCCAACA AGATCTCTGGCTCCACTATGGGAACCAATCCACTGTCTTGTGAAATGGCA GCGTATCGAGCAAGGCCCCAAGCAGCTTCTTTCACAGCGAGAGCAGACGG ACCGTTGGGAATGCTCACCACAGTACGCCATTTGGCGAAACGAGCACCCT GTTGGTAGTAAGCAGCGGTCCGAGAGGATAGACCGTCAAGTCCTTGGCAC CATGACTCATTGTTAGATCCAACAAGTGGCACCAAACCCTTGTCGACTTT GATACCGGGGACGATGTTCTGCTCGACTAGGACGTCGACCATTTTCTTGC CTTCGGTGGTAGACTGATAGAGAGTCTCCTCGAACAGGATTGCACCGGAG ATGTACTGTCCGAGACCTGGTGCAGAGACAAGCAGTGTCCGGTATGCTTG ACGGTTTGCCTCAGTGTTCTCTAGCCCTATCGAGTCCAAACGCTTCCCGC ATGTGGCGTTCGACTCGTCCATCGCCAAGATTCCTCGTCCAGGAGACGCA ATTGTTTTCGCTGTCTTAACGAGCTCATCGGCGTAGGAAGAAGCGGCGCG GACGGTAAGGTAGGTGGCGCGGTTGCGTAGGACGACGGCGGCGGAAGcgg acgcgtggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxa t QA 6 747 1 758 DS CHROMAT_FILE: LL2302r PHD_FILE: LL2302r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:39 2000 CO Contig26 816 2 80 U cacaatagacttcattaataacaccacaacaaaataaacaatatcgagct ccaacctcacttttacACACAAGACAATCACAACAACAAGTACATAAACA AACTCAACAACAACTAAGGGCAGGGATAAAAGAACATGATATGGTGATTT TATTTAAAAAAAACGCTAGTCTCAGCTGCATTTTCAGTCTTGATACTCCA CTTCTTCCTCCTCCTCTTCTTCATACTCTCCTTCTTCATCTGCGGTTGCG TCTTGGTATTGCTGATACTCTGAGACCAGATCGTTCATGTTGCTCTCGGC TTCTGTAAACTCCATCTCGTCCATCCCTTCACCTGTGTACCAATGCAAGA AAGCTTTCCTCCTGAACATGGCTGTGAACTGCTCACTCACCCTCCTAAAC ATCTCTTGGATCGATGTGGAGTTCCCAATAAAGGTTGAGGCCATCGAGAG GCCTCGAGGCGCTATGTCACAGACGCTTGACTTCACGTTGTTCGGTATCC ATTCCACAAAGTAGGATGAGTTCTTGTTCTGCACGTTTATCATCTGTTCG TCCACTTCTTTTGTGCTCATTTTGCCACGGAACATTGCTGAGGCAGTCAA GTATCGTCCGTGACGCGGGTCTGCGGCACACATCATGTTCTTTGAATCCC ACATCTGTTGGGTGAGCTCAGGGACGGTGAGTGCGCGGTACTGCTGGGAG CCACGAGAGGTGAGAGGAGCGAAACCGACCATGAAGAAGTGGAGACGAGG GAAGGGGATGAGGTTCACTGCAAGCTTCCTcagatcagagttgagctgac cagggcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 51 51 51 46 51 40 40 46 48 50 40 40 40 40 51 51 56 56 56 56 51 51 40 40 40 65 67 73 68 63 59 59 59 70 60 70 40 40 40 40 51 51 56 56 56 40 35 50 50 50 57 61 55 55 61 64 62 85 65 75 69 65 66 66 76 90 70 70 76 83 83 79 83 83 77 83 77 67 67 67 77 72 69 65 65 66 61 61 61 66 73 73 85 82 80 61 53 56 62 58 60 64 72 68 76 76 83 89 89 75 70 78 78 89 89 90 90 90 88 88 82 88 90 90 90 85 86 83 83 83 77 70 69 83 83 69 69 84 77 77 90 83 75 77 77 77 77 85 90 90 88 90 90 90 90 90 90 87 87 85 83 90 75 77 77 83 88 90 90 90 90 89 89 87 75 75 70 75 75 69 63 70 70 70 72 82 79 85 85 90 90 90 89 90 90 90 90 89 89 89 83 83 83 83 79 75 68 75 75 72 75 90 85 75 79 79 82 84 84 90 90 90 90 89 90 83 83 83 83 83 90 90 90 90 90 90 90 85 85 90 90 80 80 90 90 90 90 90 88 83 83 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 88 88 90 90 87 87 90 90 90 90 90 90 88 88 88 88 89 89 90 90 90 90 90 90 90 90 90 90 90 90 83 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 86 86 86 89 90 90 90 90 90 90 90 90 86 83 83 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 84 84 82 78 78 82 83 83 83 90 90 90 90 90 90 90 90 90 90 87 81 81 81 76 83 90 90 90 90 90 90 90 90 90 87 83 83 90 90 90 90 90 90 90 90 88 90 82 82 85 90 90 90 90 90 90 90 90 86 86 86 86 78 83 83 83 83 89 89 89 89 89 89 83 72 75 75 70 75 68 68 72 70 70 72 72 72 72 74 82 89 83 75 68 68 69 69 72 78 78 78 86 86 90 90 90 88 90 83 90 90 90 90 90 90 90 90 90 85 90 90 90 89 89 85 85 75 75 75 75 72 75 77 90 90 83 75 75 75 72 74 69 73 73 65 70 60 54 54 60 75 83 83 70 58 51 51 49 53 49 67 67 70 72 78 78 76 68 63 70 65 61 61 54 56 60 68 68 74 65 73 67 69 72 80 90 69 64 56 59 61 61 61 54 54 54 63 70 64 64 60 55 57 58 57 61 62 66 58 59 65 65 74 65 65 57 56 66 66 68 73 73 76 76 76 79 79 83 80 85 90 90 69 62 55 55 55 63 71 60 62 56 52 52 49 52 52 37 40 42 46 46 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2304f U 1 AF LL2304r C 67 BS 1 74 LL2304f BS 75 76 LL2304r BS 77 91 LL2304f BS 92 102 LL2304r BS 103 137 LL2304f BS 138 138 LL2304r BS 139 173 LL2304f BS 174 176 LL2304r BS 177 177 LL2304f BS 178 182 LL2304r BS 183 199 LL2304f BS 200 201 LL2304r BS 202 203 LL2304f BS 204 204 LL2304r BS 205 206 LL2304f BS 207 207 LL2304r BS 208 223 LL2304f BS 224 225 LL2304r BS 226 227 LL2304f BS 228 228 LL2304r BS 229 239 LL2304f BS 240 240 LL2304r BS 241 258 LL2304f BS 259 266 LL2304r BS 267 277 LL2304f BS 278 278 LL2304r BS 279 289 LL2304f BS 290 291 LL2304r BS 292 296 LL2304f BS 297 303 LL2304r BS 304 309 LL2304f BS 310 315 LL2304r BS 316 320 LL2304f BS 321 323 LL2304r BS 324 324 LL2304f BS 325 334 LL2304r BS 335 335 LL2304f BS 336 343 LL2304r BS 344 345 LL2304f BS 346 349 LL2304r BS 350 350 LL2304f BS 351 357 LL2304r BS 358 363 LL2304f BS 364 366 LL2304r BS 367 373 LL2304f BS 374 375 LL2304r BS 376 376 LL2304f BS 377 393 LL2304r BS 394 396 LL2304f BS 397 402 LL2304r BS 403 407 LL2304f BS 408 417 LL2304r BS 418 423 LL2304f BS 424 426 LL2304r BS 427 439 LL2304f BS 440 487 LL2304r BS 488 489 LL2304f BS 490 490 LL2304r BS 491 496 LL2304f BS 497 508 LL2304r BS 509 513 LL2304f BS 514 516 LL2304r BS 517 528 LL2304f BS 529 544 LL2304r BS 545 552 LL2304f BS 553 558 LL2304r BS 559 559 LL2304f BS 560 560 LL2304r BS 561 562 LL2304f BS 563 595 LL2304r BS 596 601 LL2304f BS 602 618 LL2304r BS 619 619 LL2304f BS 620 642 LL2304r BS 643 646 LL2304f BS 647 661 LL2304r BS 662 663 LL2304f BS 664 678 LL2304r BS 679 680 LL2304f BS 681 816 LL2304r RD LL2304f 779 0 0 cacaatagacttcattaataacaccacaacaaaataaacaatatcgagct ccaacctcacttttacACACAAGACAATCACAACAACAAGTACATAAACA AACTCAACAACAACTAAGGGCAGGGATAAAAGAACATGATATGGTGATTT TATTTAAAAAAAACGCTAGTCTCAGCTGCATTTTCAGTCTTGATACTCCA CTTCTTCCTCCTCCTCTTCTTCATACTCTCCTTCTTCATCTGCGGTTGCG TCTTGGTATTGCTGATACTCTGAGACCAGATCGTTCATGTTGCTCTCGGC TTCTGTAAACTCCATCTCGTCCATCCCTTCACCTGTGTACCAATGCAAGA AAGCTTTCCTCCTGAACATGGCTGTGAACTGCTCACTCACCCTCCTAAAC ATCTCTTGGATCGATGTGGAGTTCCCAATAAAGGTTGAGGCCATCGAGAG GCCTCGAGGCGCTATGTCACAGACGCTTGACTTCACGTTGTTCGGTATCC ATTCCACAAAGTAGGATGAGTTCTTGTTCTGCACGTTTATCATCTGTTCG TCCACTTCTTTTGTGCTCATTTTGCCACGGAACATTGCTGAGGCAGTCAA GTATCGTCCGTGACGCGGGTCTGCGGCACACATCATGTTCTTTGAATCCC ACATCTGTTGGGTGAGCTCAGGGACGGTGAGTGCGCGGTACTGCTGGGAG CCACGAGAGGTGAGAGGAGCGAAACCGACCATGAAGAAGTGGAGACGAGG GAAGGGGATGAGGTTCACTGCAAGCTtcn QA 67 775 1 778 DS CHROMAT_FILE: LL2304f PHD_FILE: LL2304f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:33 2000 RD LL2304r 794 0 0 acACAAGACAATCACAacancaAGTACATAAACAAACTcaccaACAACTA AGGGCAGGGATAAAAGAACATGATATGGTGATTTTATTTAAAAAAAACGC TAGTCTCAGCTGCATTTTCAGTCTTGATACTCCACTTCTTCCTCCTCCTC TTCTTCATACTCTCCTTCTTCATCTGCGGTTGCGTCTTGGTATTGCTGAT ACTCTGAGACCAGATCGTTCATGTTGCTCTCGGCTTCTGTAAACTCCATC TCGTCCATCCCTTCACCTGTGTACCAATGCAAGAAAGCTTTCCTCCTGAA CATGGCTGTGAACTGCTCACTCACCCTCCTAAACATCTCTTGGATCGATG TGGAGTTCCCAATAAAGGTTGAGGCCATCGAGAGGCCTCGAGGCGCTATG TCACAGACGCTTGACTTCACGTTGTTCGGTATCCATTCCACAAAGTAGGA TGAGTTCTTGTTCTGCACGTTTATCATCTGTTCGTCCACTTCTTTTGTGC TCATTTTGCCACGGAACATTGCTGAGGCAGTCAAGTATCGTCCGTGACGC GGGTCTGCGGCACACATCATGTTCTTTGAATCCCACATCTGTTGGGTGAG CTCAGGGACGGTGAGTGCGCGGTACTGCTGGGAGCCACGAGAGGTGAGAG GAGCGAAACCGACCATGAAGAAGTGGAGACGAGGGAAGGGGATGAGGTTC ACTGCAAGCTTCCTcagatcagagttgagctgaccagggcggacgcgtgg xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 48 714 1 750 DS CHROMAT_FILE: LL2304r PHD_FILE: LL2304r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:11 2000 CO Contig27 547 2 10 U CTTTTTTTTTTTTTTTTAAACGAAGAAGCAAAAACTCAAGTTAATCCAAA AAATATTCAGAATTATTACACGATAAAAGTTTTTTAGACAAAGATAGATT TTGTTCTATATGAAGTACAATCGATGAAGAATTTACTTTTAAGTGATCAC TGATCTAAGTACTTCACGGGAACAAGTAGAAGCTCACTTTTCTTGCTCAC AGTAAGCCCAGGAGCTTCCTCCATGTCGACATCTTCCACCACCATGCCTT CAGGTAACTCCCAGTCAAAATGATACAATAGACTCGCCAGACCAAACTCC ACCGAAGTCGTCCCCATGTATATTCCAGGACAGATTCTCCTACCACTCCC AAACGACAACAGCTCAAAG**TTTTGTCCCTTTGCATCAATGTTATTATC CATAAACCTCTCGGGGAAAAACATCTCTGGATCCTTCCAGGTATCCGGAT CACGCCCAATAGCCCAGACATTCACATGAAGACGTGTCTTGACGGGAATC GTGTAGCCGTCGATCTCAAACTCCGAGGTTACTTCGGACGCGTGGga BQ 46 56 56 56 56 56 56 56 56 56 56 56 56 56 46 40 40 40 40 40 45 45 45 45 39 39 35 35 35 39 40 56 56 56 56 56 51 45 45 39 39 39 40 40 40 56 56 56 56 56 56 51 51 51 51 60 60 60 60 70 78 74 73 74 74 80 83 86 75 77 79 79 74 80 90 90 82 80 72 72 72 78 75 86 83 85 85 90 85 85 85 88 83 76 71 66 60 60 71 71 71 56 56 56 56 56 56 56 56 56 56 45 45 45 45 45 45 51 66 66 66 66 75 69 73 86 86 71 71 71 56 51 45 45 45 45 60 66 79 71 70 74 67 70 60 72 72 77 73 68 74 74 74 76 83 83 83 80 80 77 76 76 72 90 80 85 85 80 66 66 66 66 51 51 45 45 45 60 58 59 70 70 74 77 85 85 85 81 80 79 90 90 90 90 90 90 85 85 66 58 43 43 43 43 43 43 60 60 83 81 80 72 68 72 62 62 66 73 75 86 90 90 87 81 67 63 62 62 65 69 80 85 85 85 85 90 90 90 90 90 90 90 90 82 80 77 77 67 64 60 60 62 68 68 76 69 71 80 87 87 86 85 87 80 73 73 78 89 90 90 90 88 90 90 84 78 78 68 68 66 78 78 75 69 69 69 69 72 74 80 86 86 90 90 87 80 80 80 80 80 88 83 80 74 70 62 58 58 58 58 43 43 45 56 56 45 60 60 60 60 66 56 56 56 56 56 56 71 66 76 80 80 80 90 90 89 90 81 75 60 62 65 74 76 79 79 85 85 90 90 90 78 78 80 77 78 74 86 60 60 45 45 45 45 56 45 43 43 43 43 43 51 51 51 66 66 66 80 67 58 58 58 58 59 58 69 83 75 66 66 66 67 75 70 70 70 69 69 72 81 84 86 80 83 74 74 74 72 72 67 72 67 72 78 88 82 90 90 71 71 56 56 56 51 51 66 66 66 78 66 66 75 77 71 66 66 45 40 40 40 40 39 54 60 61 58 55 61 50 55 55 58 54 58 68 68 68 68 57 57 52 62 59 63 65 69 75 73 70 70 70 67 55 55 55 55 67 59 66 50 50 50 35 35 40 40 40 46 40 40 40 40 40 45 51 40 39 39 39 35 32 32 29 29 29 32 32 35 35 35 35 40 46 46 68 68 59 62 56 56 49 54 65 64 64 57 61 39 41 40 39 39 39 51 51 51 51 39 39 34 34 33 32 0 0 AF LL2305r C -196 AF LL2305f U 36 BS 1 262 LL2305r BS 263 263 LL2305f BS 264 516 LL2305r BS 517 529 LL2305f BS 530 530 LL2305r BS 531 531 LL2305f BS 532 539 LL2305r BS 540 541 LL2305f BS 542 545 LL2305r BS 546 547 LL2305f RD LL2305r 787 0 2 gtgcgggcctcttcgctattacgccagctggcgaaagggggatgtgctgc aaggcgattaagttgggtaacgccagggttttxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxCTT TTTTTTTTTTTTTTAAACGAAGAAGCAAAAACTCAAGTTAATCCAAAAAA TATTCAGAATTATTACACGATAAAAGTTTTTTAGACAAAGATAGATTTTG TTCTATATGAAGTACAATCGATGAAGAATTTACTTTTAAGTGATCACTGA TCTAAGTACTTCACGGGAACAAGTAGAAGCTCACTTTTCTTGCTCACAGT AAGCCCAGGAGCTTCCTCCATGTCGACATCTTCCACCACCATGCCTTCAG GTAACTCCCAGTCAAAATGATACAATAGACTCGCCAGACCAAACTCCACC GAAGTCGTCCCCATGTATATTCCAGGACAGATTCTCCTACCACTCCCAAA CGACAACAGCTCAAAG**TTTTGTCCCTTTGCATCAATGTTATTATCCAT AAACCTCTCGGGGAAAAACATCTCTGGATCCTTCCAGGTATCCGGATCAC GCCCAATAGCCCAGACATTCACATGAAGACGTGTCTTGACGGGAATCGTG TAGCCGTCGATCTCAAACTCCGAGGTTACTTCGGACGCGTGGxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaatt QA 13 742 198 742 DS CHROMAT_FILE: LL2305r PHD_FILE: LL2305r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:16 2000 RT{ LL2305r chimera phrap 1 82 000919:094547 } RT{ LL2305r matchElsewhereHighQual phrap 1 82 000919:094547 } RD LL2305f 768 0 2 tcatgttattccAAaAactaTTCAGAATTATTACACGATAAAAGTTTTTT AGACAAAGATAGATTTtgctctAtctGAagcaCAATCGATGAAGAATtca cTTTTAAGTGATCACTGATCTAAGTACTTCACGGGAACacgTAGAAGCTC ACTTTTCTTGCTCACAgcaaGCCCAGGAGCTTCCTCCATGTCGACATCTT CCACCACCATGCCTTCAGGTAACTCCCAGTCAAAATGATACAATAGACTC GCCAGACCAAACTCCACCGAAGTCGtcctcaTGTATAtttcaGGACAGAT TCTCCTACCACTCCCAAACGACAACagttcaaacatttttgTCCCTTTGC ATCAATGTTATTATCCATAAACCTCTCGGGGAAAAACatatcTGGATCCT TCCAggcATCCGGATCACGCCCAATAGCCCAGACATTCACATGAAGACGt gcctTGAcggcaatcgtgcaccCgttcatCTCAAACTCCGAGGTTACTta ggaCGCGtgGgaxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxtgcgaaatcgtcatcggctcacaatcgcacacaccatac tcacccgcacgtccaatactgtacagtctggggcgcctaatgagcgagct aactcacattaattcgcgctgcgtccactgcccgattttcacgccggaaa cctgttttgccacttgcg QA 20 449 1 512 DS CHROMAT_FILE: LL2305f PHD_FILE: LL2305f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:37 2000 RT{ LL2305f chimera phrap 678 762 000919:094547 } RT{ LL2305f matchElsewhereLowQual phrap 685 714 000919:094547 } CO Contig28 692 2 131 U ctttttagcgacTATAATATGTTTCAAACGAAAGGAAGACCCACAGAGGC AGAAACAAGCAGAAGAAACAAAGATTGGTGACTAGATTGGGTAACTTGCC ATACAAGTCAAACATATACATAAACGAGAGAGATATAGTTTGAATGAATC AGCCAAAGAAGCTTGGATCGTAGCCATTGCTAGAAGTGGCCAAAACATAG TAAGCAACGATGTGACCAATAGAACCCCAAGCAAGAACATCCACGATGTT GAATCCAACAGGGTCGTTCGATTTCAAGAGGCTTACGTACTCCTTGGCAC GATCATCTCCCGCTTCGAAGTGGGTCTTCCCGTTCTGCTCCGGCACCTGT TTAGCCACATTCTCTCTCTGGAAGTTGAAGAAGACGAACCGGCCTAGGAA GAGGGAGAGACCTGTGCTGAGGCTAATAACGACGGATGGGCTGAGCTCGG CTCTCACGACGGCGGAGGAGGAAGATCTTCTTCCGGAGGTGGTGGATAGC TTGGGTAAGGCAGATGAGGAGCCACCGAGGCGGAGAGGACGGAGGCCGTG GAATGAGATGGAGTTCGGGTGTTTCTGGGAGATTGTAGAGGAGAAAGTTG TGGGTGTGAGCAAAGCAGATGCGCTTGTCGCCATGTTATCTTCTTCTTCT GTGGAGAGTTTCAGTGCAACCCACGCGTCCGCGGACGCGTGG BQ 0 0 0 0 0 0 0 0 0 0 0 0 27 27 54 55 55 52 55 55 55 55 53 53 53 55 52 56 59 61 54 55 51 49 49 48 53 54 77 52 52 58 59 54 57 52 54 48 48 58 71 67 64 51 51 63 71 66 90 86 90 84 80 80 72 64 54 54 65 69 69 88 88 79 74 74 75 71 71 70 75 75 75 75 75 69 71 74 75 71 69 65 55 57 60 75 64 67 67 75 75 68 80 74 80 80 80 80 75 86 85 90 79 88 90 88 88 88 74 77 76 74 72 72 77 81 81 86 90 90 90 90 90 79 75 75 75 78 75 75 75 75 75 75 80 88 90 85 75 88 75 75 75 88 90 90 90 90 88 80 80 80 80 86 88 90 75 80 70 70 70 70 72 72 80 90 90 90 87 87 87 87 76 80 90 83 77 71 83 83 83 83 90 90 90 78 77 79 81 81 78 82 78 78 86 78 80 89 89 89 85 90 90 90 90 90 90 83 81 86 90 82 82 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 89 90 89 89 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 83 83 81 81 88 90 90 90 90 90 90 90 90 90 90 89 89 83 81 79 79 79 79 79 81 90 90 90 90 90 86 86 88 83 86 86 88 88 88 90 90 90 90 90 83 81 81 80 77 81 90 90 87 85 89 90 90 90 90 90 90 90 90 90 90 86 83 84 84 84 90 90 90 90 88 88 86 86 83 83 90 87 90 90 90 81 85 85 84 84 84 84 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 81 79 79 79 81 87 83 88 88 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 86 83 85 83 86 84 81 81 86 88 88 86 86 85 88 88 88 88 90 84 84 88 90 90 88 88 87 88 90 90 86 86 86 88 88 83 83 83 88 86 80 80 80 88 88 90 88 88 85 85 85 83 83 83 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 87 88 90 90 90 90 90 90 90 90 90 86 83 78 78 78 80 90 89 90 90 90 88 88 88 88 88 83 86 90 90 90 87 85 85 85 85 85 80 80 80 80 82 76 76 83 88 88 88 88 90 87 89 80 83 80 80 83 80 83 83 83 88 88 85 85 90 90 90 90 90 90 90 90 90 90 90 90 77 75 75 80 88 88 90 90 90 82 82 82 90 90 90 90 90 90 85 85 90 90 86 87 87 79 75 90 90 80 80 65 69 88 77 73 73 83 80 80 88 88 88 80 80 75 90 75 72 75 75 75 73 77 79 79 65 70 68 72 72 72 75 75 74 63 63 63 63 64 68 90 65 62 73 78 78 78 86 86 86 86 86 86 86 90 77 74 71 74 66 80 83 90 90 90 80 80 74 69 67 64 71 60 62 63 63 85 86 72 67 67 62 62 62 59 45 45 53 53 55 63 69 69 49 49 49 34 33 32 AF LL2308r C -58 AF LL2308f U 1 BS 1 12 LL2308f BS 13 14 LL2308r BS 15 38 LL2308f BS 39 39 LL2308r BS 40 44 LL2308f BS 45 47 LL2308r BS 48 61 LL2308f BS 62 62 LL2308r BS 63 71 LL2308f BS 72 73 LL2308r BS 74 80 LL2308f BS 81 85 LL2308r BS 86 95 LL2308f BS 96 96 LL2308r BS 97 99 LL2308f BS 100 101 LL2308r BS 102 109 LL2308f BS 110 110 LL2308r BS 111 111 LL2308f BS 112 118 LL2308r BS 119 119 LL2308f BS 120 120 LL2308r BS 121 122 LL2308f BS 123 147 LL2308r BS 148 148 LL2308f BS 149 157 LL2308r BS 158 158 LL2308f BS 159 168 LL2308r BS 169 175 LL2308f BS 176 183 LL2308r BS 184 186 LL2308f BS 187 187 LL2308r BS 188 192 LL2308f BS 193 197 LL2308r BS 198 207 LL2308f BS 208 211 LL2308r BS 212 212 LL2308f BS 213 215 LL2308r BS 216 216 LL2308f BS 217 217 LL2308r BS 218 223 LL2308f BS 224 227 LL2308r BS 228 237 LL2308f BS 238 254 LL2308r BS 255 260 LL2308f BS 261 261 LL2308r BS 262 264 LL2308f BS 265 270 LL2308r BS 271 271 LL2308f BS 272 274 LL2308r BS 275 275 LL2308f BS 276 298 LL2308r BS 299 300 LL2308f BS 301 303 LL2308r BS 304 304 LL2308f BS 305 318 LL2308r BS 319 323 LL2308f BS 324 324 LL2308r BS 325 325 LL2308f BS 326 329 LL2308r BS 330 330 LL2308f BS 331 340 LL2308r BS 341 342 LL2308f BS 343 360 LL2308r BS 361 366 LL2308f BS 367 369 LL2308r BS 370 381 LL2308f BS 382 395 LL2308r BS 396 407 LL2308f BS 408 408 LL2308r BS 409 409 LL2308f BS 410 412 LL2308r BS 413 413 LL2308f BS 414 416 LL2308r BS 417 417 LL2308f BS 418 426 LL2308r BS 427 427 LL2308f BS 428 429 LL2308r BS 430 432 LL2308f BS 433 436 LL2308r BS 437 438 LL2308f BS 439 441 LL2308r BS 442 443 LL2308f BS 444 446 LL2308r BS 447 448 LL2308f BS 449 466 LL2308r BS 467 467 LL2308f BS 468 468 LL2308r BS 469 474 LL2308f BS 475 475 LL2308r BS 476 476 LL2308f BS 477 483 LL2308r BS 484 484 LL2308f BS 485 486 LL2308r BS 487 496 LL2308f BS 497 502 LL2308r BS 503 511 LL2308f BS 512 512 LL2308r BS 513 516 LL2308f BS 517 527 LL2308r BS 528 529 LL2308f BS 530 530 LL2308r BS 531 536 LL2308f BS 537 546 LL2308r BS 547 548 LL2308f BS 549 550 LL2308r BS 551 574 LL2308f BS 575 576 LL2308r BS 577 580 LL2308f BS 581 582 LL2308r BS 583 588 LL2308f BS 589 595 LL2308r BS 596 596 LL2308f BS 597 600 LL2308r BS 601 618 LL2308f BS 619 619 LL2308r BS 620 620 LL2308f BS 621 621 LL2308r BS 622 624 LL2308f BS 625 625 LL2308r BS 626 627 LL2308f BS 628 649 LL2308r BS 650 650 LL2308f BS 651 652 LL2308r BS 653 658 LL2308f BS 659 663 LL2308r BS 664 665 LL2308f BS 666 674 LL2308r BS 675 677 LL2308f BS 678 691 LL2308r BS 692 692 LL2308f RD LL2308r 795 0 0 tnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnnntntnntnnaccacGAAATATAATATGTTTCAAACGAAAGGAAGACC CACAGAGGCAGAAACAAGCAGAAGAAACAAAGATTGGTGACTAGATTGGG TAACTTGCCATACAAGTCAAACATATACATAAACGAGAGAGATATAGTTT GAATGAATCAGCCAAAGAAGCTTGGATCGTAGCCATTGCTAGAAGTGGCC AAAACATAGTAAGCAACGATGTGACCAATAGAACCCCAAGCAAGAACATC CACGATGTTGAATCCAACAGGGTCGTTCGATTTCAAGAGGCTTACGTACT CCTTGGCACGATCATCTCCCGCTTCGAAGTGGGTCTTCCCGTTCTGCTCC GGCACCTGTTTAGCCACATTCTCTCTCTGGAAGTTGAAGAAGACGAACCG GCCTAGGAAGAGGGAGAGACCTGTGCTGAGGCTAATAACGACGGATGGGC TGAGCTCGGCTCTCACGACGGCGGAGGAGGAAGATCTTCTTCCGGAGGTG GTGGATAGCTTGGGTAAGGCAGATGAGGAGCCACCGAGGCGGAGAGGACG GAGGCCGTGGAATGAGATGGAGTTCGGGTGTTTCTGGGAGATTGTAGAGG AGAAAGTTGTGGGTGTGAGCAAAGCAGATGCGCTTGTCGCCATGTTATCT TCTTCTTCTGTGGAGAGTTTCAGTGCAACCCACGCGTCCGCGGACGCGTG Gxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 68 751 72 751 DS CHROMAT_FILE: LL2308r PHD_FILE: LL2308r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:29 2000 RD LL2308f 791 0 0 ctttttagcgactATAATATGTTTCAAACGAAAGGAAGACCCACAGAGGC AGAAACAAGCAGAAGAAACAAAGATTGGTGACTAGATTGGGTAACTTGCC ATACAAGTCAAACATATACATAAACGAGAGAGATATAGTTTGAATGAATC AGCCAAAGAAGCTTGGATCGTAGCCATTGCTAGAAGTGGCCAAAACATAG TAAGCAACGATGTGACCAATAGAACCCCAAGCAAGAACATCCACGATGTT GAATCCAACAGGGTCGTTCGATTTCAAGAGGCTTACGTACTCCTTGGCAC GATCATCTCCCGCTTCGAAGTGGGTCTTCCCGTTCTGCTCCGGCACCTGT TTAGCCACATTCTCTCTCTGGAAGTTGAAGAAGACGAACCGGCCTAGGAA GAGGGAGAGACCTGTGCTGAGGCTAATAACGACGGATGGGCTGAGCTCGG CTCTCACGACGGCGGAGGAGGAAGATCTTCTTCCGGAGGTGGTGGATAGC TTGGGTAAGGCAGATGAGGAGCCACCGAGGCGGAGAGGACGGAGGCCGTG GAATGAGATGGAGTTCGGGTGTTTCTGGGAGATTGTAGAGGAGAAAGTTG TGGGTGTGAGCAAAGCAGATGCGCTTGTCGCCATGTTATCTTCTTCTTCT GTGGAGAGTTTCAGTGCAACCCACGCGTCCGCGGACGCGTGGxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx QA 13 692 1 692 DS CHROMAT_FILE: LL2308f PHD_FILE: LL2308f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:52 2000 CO Contig29 904 2 54 U ctggaaatttagaaaatatgaccttaaatcataatcattcgttcgggaaa aaagaaatacattttactcaacaaaataaaTATAAATACATTTTACCCGA GCATAGTAAACATTTCAAAGATTGAGAAATCTTTGAAGACTCATCACTCA TATGATTCTACACCGTCAAGTACTTTAGAACTGAAGCACGCACCTTCTCC ATGTAATCTCCTGCTTCTTTCTGGTTTCCTCTGTATGCATTTGCTATGTT TTCATCAAGAAGTGTCAACAAAGACCTGTTGATCTCGTTTTTCTCCACCA TATCCAACAATGTTGCTTTTATGTCGGTTGAGGTTAATATCTTGGTTAAG CTATTTGTAGCGGTCATAAGCTCCTTTTGCATTTTGTCATAAGCCTCTAT TCCTTCTTCTAAGGCTTTTTGAAGTGATTCAAGCTCAATCAATCTGTCTT CATCTTCCGCGGTTTTTGTAACCGCAAACCGGATTTGCCCGATTTCAAGT CGAATTTGCGCAAAAAACTCGTCATTTAACCTTCCACGAAGCCTAGCGAT TTCAAACTCAATCTCCTGAGCTTCCGTATCAAGAAAGAACTCAATGAGCT CAGCGGAAGTGTCTGGTACAACTCTAGACTCTCTAAGAGCTTGACGACGC TCTCTCTCTTCACGGAGTTGTTTCTGAAAAGCCTCTTTAGCCTCGGAGTC TCTCTCTAGCCTGCGTTTGAACTCAAGGCGAGCGATATGACCAGTTTGAG CAGGACCTAAGATACCTTTCGGATCCCATTCGCAGGAGATTTTGCCCCTT CTTGTATTAAATCTCGGGATTGTACGGACGGTGTTAGCAAAGGCGATTCC TATTGAGAGCGAAGCCATtgttatttttatttttttgtgggcgcggacgc gtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 35 37 32 32 29 29 29 35 35 35 35 26 26 26 35 35 35 32 32 32 32 35 35 35 35 29 23 23 25 25 42 35 39 35 35 35 35 35 35 37 37 29 29 29 29 32 32 36 36 37 37 37 37 35 35 35 35 32 35 35 35 39 35 35 32 32 35 32 35 35 37 37 40 40 40 35 35 35 35 35 37 35 29 30 29 29 23 31 29 35 47 47 56 49 39 41 35 40 33 45 48 60 60 54 57 54 53 53 49 59 55 55 50 59 59 67 66 65 58 58 64 77 77 79 79 71 61 68 57 59 58 50 55 51 54 47 35 35 37 35 37 48 51 52 59 52 58 67 75 75 73 90 73 75 75 82 82 86 82 71 64 64 66 72 72 61 67 74 90 90 76 81 85 89 85 81 83 90 84 58 61 54 65 65 73 71 70 66 79 82 81 79 79 79 80 75 75 79 81 81 90 88 87 78 78 78 78 83 86 90 90 90 86 86 89 90 89 79 80 83 90 90 90 90 90 90 82 77 86 84 87 87 90 83 79 79 83 80 83 90 86 90 80 86 73 73 67 67 67 72 73 77 66 62 68 73 79 82 90 90 86 75 73 73 67 67 73 73 70 70 75 80 86 89 86 90 77 75 72 72 73 75 80 86 86 88 90 90 86 86 90 90 90 90 90 90 88 85 90 90 90 90 90 90 90 90 90 75 77 80 75 82 82 81 81 90 90 90 90 90 89 90 90 90 90 90 90 86 86 86 88 86 87 90 90 90 90 90 90 90 90 87 80 78 80 85 90 90 90 89 89 86 84 86 86 90 90 90 90 90 90 88 88 80 85 85 85 87 90 90 90 90 90 90 90 88 80 78 78 79 79 79 90 85 88 89 79 72 69 69 76 76 83 78 78 78 78 86 90 90 88 90 88 90 90 90 90 86 86 88 90 90 90 90 90 89 89 90 82 75 75 86 84 87 89 86 72 72 72 74 75 80 89 90 90 90 85 83 83 78 78 76 78 86 86 83 83 83 86 86 90 90 88 90 90 80 83 79 76 76 72 73 72 76 65 69 69 74 74 80 90 90 90 90 86 88 90 89 90 89 90 90 90 88 88 88 87 87 80 80 66 70 62 67 70 76 85 89 90 90 90 90 90 88 90 84 85 80 81 85 85 85 90 78 71 73 71 73 80 80 78 75 72 72 72 78 78 85 70 70 65 65 65 76 68 61 61 70 75 85 90 90 80 80 80 79 74 69 69 63 65 65 69 70 74 65 68 68 69 69 76 88 78 85 84 82 72 68 66 68 58 60 66 66 66 69 69 84 84 81 80 81 83 77 77 84 89 80 80 81 70 64 60 66 66 76 74 68 72 83 83 70 74 60 60 58 69 69 68 72 72 85 81 75 77 89 76 76 65 65 61 74 68 73 65 70 73 76 73 73 72 72 72 72 65 62 66 72 72 69 70 76 83 83 76 68 61 62 64 66 66 71 71 77 72 74 88 88 85 85 72 65 63 59 58 58 58 64 71 74 74 68 58 60 60 66 66 60 58 58 58 58 58 66 66 51 46 43 43 43 43 43 43 51 51 56 56 51 43 35 35 35 35 35 38 45 45 45 45 45 51 40 40 40 40 40 40 56 45 45 45 45 35 35 35 35 35 35 35 39 39 40 39 39 39 39 39 40 45 40 40 40 40 40 40 45 45 45 45 45 45 45 39 35 35 35 35 35 35 35 34 31 31 31 31 31 40 51 40 40 40 40 40 46 46 39 34 34 34 33 32 33 40 40 40 45 46 51 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2309f U 1 AF LL2309r C 168 BS 1 176 LL2309f BS 177 179 LL2309r BS 180 180 LL2309f BS 181 181 LL2309r BS 182 186 LL2309f BS 187 189 LL2309r BS 190 196 LL2309f BS 197 200 LL2309r BS 201 210 LL2309f BS 211 211 LL2309r BS 212 212 LL2309f BS 213 213 LL2309r BS 214 224 LL2309f BS 225 225 LL2309r BS 226 227 LL2309f BS 228 231 LL2309r BS 232 232 LL2309f BS 233 235 LL2309r BS 236 240 LL2309f BS 241 246 LL2309r BS 247 248 LL2309f BS 249 250 LL2309r BS 251 254 LL2309f BS 255 255 LL2309r BS 256 261 LL2309f BS 262 262 LL2309r BS 263 271 LL2309f BS 272 280 LL2309r BS 281 281 LL2309f BS 282 300 LL2309r BS 301 301 LL2309f BS 302 303 LL2309r BS 304 304 LL2309f BS 305 337 LL2309r BS 338 338 LL2309f BS 339 366 LL2309r BS 367 368 LL2309f BS 369 391 LL2309r BS 392 393 LL2309f BS 394 401 LL2309r BS 402 405 LL2309f BS 406 420 LL2309r BS 421 421 LL2309f BS 422 447 LL2309r BS 448 449 LL2309f BS 450 462 LL2309r BS 463 463 LL2309f BS 464 497 LL2309r BS 498 498 LL2309f BS 499 512 LL2309r BS 513 513 LL2309f BS 514 695 LL2309r BS 696 696 LL2309f BS 697 904 LL2309r RD LL2309f 768 0 0 ctggaaatttagaaaatatgaccttaaatcataatcattcgttcgggaaa aaagaaatacattttactcaacaaaataaaTATAAATACATTTTACCCGA GCATAGTAAACATTTCAAAGATTGAGAAATCTTTGAAGACTCATCACTCA TATGATTCTACACCGTCAAGTACTTTAGAACTGAAGCACGCACCTTCTCC ATGTAATCTCCTGCTTCTTTCTGGTTTCCTCTGTATGCATTTGCTATGTT TTCATCAAGAAGTGTCAACAAAGACCTGTTGATCTCGTTTTTCTCCACCA TATCCAACAATGTTGCTTTTATGTCGGTTGAGGTTAATATCTTGGTTAAG CTATTTGTAGCGGTCATAAGCTCCTTTTGCATTTTGTCATAAGCCTCTAT TCCTTCTTCTAAGGCTTTTTGAAGTGATTCAAGCTCAATCAATCTGTCTT CATCTTCCGCGGTTTTTGTAACCGCAAACCGGATTTGCCCGATTTCAAGT CGAATTTGCGCAAAAAACTCGTCATTTAACCTTCCACGAAGCCTAGCGAT TTCAAACTCAATCTCCTGAGCTTCCGTATCAAGAAAGAACTCAATGAGCT CAGCGGAAGTGTCTGGTACAACTCTAGACTCTCTAAGAGCTTGACGACGC TCTCTCTCTTCACGGAGTTGTTTCTGAAAAGCCTCTTTAGCCTCGGAGTC TCTCTCTAGCCTGCGTTTGAACTCAAGGCGAGCGATATGACCAGTTTGAG CAGGACCTAAGATACcTt QA 81 768 1 768 DS CHROMAT_FILE: LL2309f PHD_FILE: LL2309f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:57 2000 RD LL2309r 781 0 0 nagTACTTTAGAACTGAAGCACGCACCTTCTCCATGTAATCTCCTGCTTC tntcTGGTTTCCTCTGTATGCATTTGCTATGTTTTCATCAAGAAGTGTCA ACAAAGACCTGTTGATCTCGTTTTTCTCCACCATATCCAACAATGTTGCT TTTATGTCGGTTGAGGTTAATATCTTGGTTAAGCTATTTGTAGCGGTCAT AAGCTCCTTTTGCATTTTGTCATAAGCCTCTATTCCTTCTTCTAAGGCTT TTTGAAGTGATTCAAGCTCAATCAATCTGTCTTCATCTTCCGCGGTTTTT GTAACCGCAAACCGGATTTGCCCGATTTCAAGTCGAATTTGCGCAAAAAA CTCGTCATTTAACCTTCCACGAAGCCTAGCGATTTCAAACTCAATCTCCT GAGCTTCCGTATCAAGAAAGAACTCAATGAGCTCAGCGGAAGTGTCTGGT ACAACTCTAGACTCTCTAAGAGCTTGACGACGCTCTCTCTCTTCACGGAG TTGTTTCTGAAAAGCCTCTTTAGCCTCGGAGTCTCTCTCTAGCCTGCGTT TGAACTCAAGGCGAGCGATATGACCAGTTTGAGCAGGACCTAAGATACCT TTCGGATCCCATTCGCAGGAGATTTTGCCCCTTCTTGTATTAAATCTCGG GATTGTACGGACGGTGTTAGCAAAGGCGATTCCTATTGAGAGCGAAGCCA Ttgttatttttatttttttgtgggcgcggacgcgtggxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 3 701 2 737 DS CHROMAT_FILE: LL2309r PHD_FILE: LL2309r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:34 2000 CO Contig30 1421 2 24 U ttcatatattaaaggctTATTAAAAAGGAGTAGCTAATGCAAATTTAGTA GAGAAAGTAAATGATTTGACATGATAATAACAACCAAACTCAAACAAATG CAACCAAGACACTTGCTTCGTCTCCTTTCACAGCATCATGATCGCTTCTT CAACATTGTTGTCTTTAATAGGTGTAGCCTTTGACGAACATACCCTCCTT AGCCTCTTCAGACTCGCCTTCTCCGCTGTATTTTCCAAGCTGAGCCAACG AGTTGGCCTTGGCACGAGCCAAGAGAGTGGTCTGAGCAGCGTTCACGTTC TCAGCTCTGCCTCCCCATGTTTTCAGACAAGTGTTCTGGAGAGCACGTGC GTAGGAGAAGGACACGTGCCATGGGTTTGGTGCCTGGTTCATCGCGTTGA GGTTCAACGTTGCCTCCAACTCAGACTGTCCTCCAGACAAGAACATGATT CCGGGGACGGCAGGAGGGATTCTGTTGCGGAGGAGCTTAAGGGTGTAGGC GGCAACTTGTTCAGGAGTAGCTCTGTCTTTAGACTCGGCTCCAGGAGTCA CCATGCTCGGCTTCAGGAGGATACCTTCAAACATGACATTGTTCTGAGCA AGGTAAAAGAAAACCTCAGCCCAAACCTTCTCAGCTACTTCGTAAGTCCT GTCAATGTCGTGTTCTccGTCCAACAAGATCTCTGGCTCCACAATTGGGA CCAATCCACTGTCCTGTGAAATGGCAGCGTAGCGAGCAAGGCCCCAAGCA GCTTCTTTCACAGCGAGCGCAGACGGGCCGTTGGGAATGCTCACGACAGT ACGCCATttaGCGAAACGAGCACCCTGTTGGTAGTAAGCAGCAGTGCGAG AGGATAGACCATCGAGTCCTTGACACCATGACTCATTGTTAGAGCCAACA AGTGGCACCAAACCCTTGTCAACTTTGATACCGGGGACGATGTTCTGCTC GACGAGGACGTCGACCATTTTCTTTCCTTCGGTGGTAGACTGATAGAGAG TCTCCTCGAACAAGATTGCCCCGGATATGTACTGTCCGAGACCTGGTGCA GAGACAAGCAATGTCCTGTACGCTTGACGGTTTGCCTCAGTGTTCTCTAG CCCTATCGAGTCCAAACGCTTCCCGCAAGTCGCGTTCGACTCGTCCATCG CCAAGATTCCACGTCCCGGAGATGCAATTGTTTTCGCTGTCTTGACAAGC TCATCGGCGTAGGAAGAAGCGGCACGGACGGCGAGGGAGGTGGCGCGGTT GGGGAGGACGACGGATGCGGAAGAAGGCTGACGGAAGAGAACGCTTTGGC CCTTGACCCACTCAGATTTGTCCAACACTGGAGATGCCTTGAGGAGTGAG GTAGATGCCATTTTTTTTTGTTAATGTTATCTcTTtctcttctctctgcc tttggtttgtcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 15 25 25 29 29 27 27 31 46 40 40 40 40 37 37 34 29 29 29 29 29 35 46 40 40 40 40 40 40 40 38 38 40 40 40 40 46 40 40 37 37 37 37 37 37 46 40 46 40 40 40 35 35 35 37 40 46 46 40 35 35 35 35 35 40 51 35 35 35 35 35 35 37 40 37 37 40 40 40 40 40 40 40 40 40 37 37 37 37 37 37 40 46 35 35 35 39 35 35 40 40 37 40 37 37 40 40 37 37 46 46 46 46 40 40 40 40 40 40 40 40 40 40 40 38 34 34 34 34 34 35 35 37 40 40 51 40 43 51 46 46 44 43 40 38 35 35 35 35 35 43 43 43 43 43 43 43 43 43 43 43 43 43 43 51 43 35 35 35 35 35 36 38 43 43 43 43 43 43 43 41 41 43 45 41 41 41 45 41 45 45 45 45 40 36 36 43 43 43 43 43 40 43 42 56 56 43 43 43 36 36 36 43 46 51 51 51 51 51 51 51 43 43 45 45 45 45 56 56 51 43 43 43 43 43 45 56 51 45 45 45 45 45 45 41 43 38 43 38 38 38 40 43 44 42 41 41 41 41 41 41 42 56 41 41 41 43 45 45 56 51 43 43 43 43 43 43 36 36 36 36 36 43 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 56 56 56 56 56 51 51 51 51 43 43 43 43 43 43 51 51 45 45 45 45 45 45 46 46 41 41 43 36 36 36 36 36 43 43 43 38 38 56 56 56 56 56 56 51 51 51 43 43 43 43 43 43 45 45 45 51 56 56 56 56 56 56 56 51 51 41 45 45 45 51 51 45 45 45 43 43 43 43 43 43 43 45 56 56 56 56 51 51 51 43 43 43 45 43 43 45 45 41 41 43 51 45 45 45 45 45 41 41 41 45 45 43 56 56 56 56 56 51 43 43 43 43 43 43 43 43 43 43 43 43 56 56 56 51 51 51 51 51 45 45 45 45 45 45 51 56 56 56 56 56 56 56 56 56 56 56 56 50 40 38 39 35 35 35 40 51 51 40 45 40 40 40 40 51 51 56 56 51 51 51 51 40 40 40 40 40 40 37 39 37 37 37 40 43 51 45 40 40 40 45 45 51 40 40 40 45 45 45 56 56 56 56 56 56 42 40 40 45 45 45 51 40 40 40 40 40 51 56 56 56 51 51 45 45 45 45 45 45 45 51 51 46 46 42 40 38 38 38 46 46 56 56 56 51 51 40 40 45 40 40 40 43 46 51 51 56 56 56 56 56 56 56 42 40 35 30 30 35 40 43 40 40 40 47 37 46 28 28 28 32 32 48 48 34 32 34 40 40 40 40 40 40 40 40 40 40 36 40 37 37 36 39 35 42 42 46 37 35 35 33 35 35 29 39 40 44 44 48 40 40 34 29 29 29 29 40 40 40 40 46 34 33 33 33 31 40 31 23 27 21 17 18 21 25 24 25 25 25 29 25 24 29 40 40 40 40 40 29 47 34 45 42 34 30 45 49 45 45 61 61 57 51 49 50 49 51 50 52 62 71 53 56 55 53 54 50 42 40 48 48 48 48 39 46 53 53 39 37 48 54 51 58 50 57 52 58 57 58 54 50 50 42 41 41 37 40 42 39 37 37 35 52 63 71 67 58 50 50 58 49 54 54 58 60 56 58 44 43 43 28 29 29 29 32 32 28 40 29 26 24 22 20 20 25 25 25 25 25 25 29 32 32 40 48 44 33 33 33 28 28 34 39 39 33 33 32 32 32 32 32 27 15 15 17 22 22 29 39 32 33 29 33 29 35 29 35 42 56 56 56 40 37 37 37 37 37 37 37 35 36 35 29 35 28 28 26 32 32 37 37 37 40 40 51 51 46 40 37 40 37 37 37 51 56 51 51 51 45 45 45 40 40 51 51 51 42 40 40 40 47 46 46 46 44 44 40 48 48 48 32 32 34 40 40 36 39 32 34 34 39 37 56 40 40 40 40 39 35 36 33 36 37 56 56 56 56 56 56 56 45 51 51 45 45 45 51 56 45 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 45 45 51 51 51 51 56 56 56 56 56 56 56 56 51 51 51 51 45 45 40 40 40 37 40 51 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 45 51 51 51 51 51 56 56 51 45 45 45 45 40 40 40 40 40 40 51 51 45 45 45 45 40 51 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 51 45 45 45 43 43 43 43 43 43 51 51 51 51 56 56 51 51 51 51 45 45 45 45 51 51 56 51 45 45 45 45 45 45 45 45 45 45 45 45 45 43 43 43 45 45 45 56 45 45 43 43 43 43 45 45 51 51 51 45 45 45 43 43 43 51 45 45 45 43 43 43 43 43 45 56 56 51 51 43 43 43 43 43 43 43 56 43 43 43 43 43 43 51 46 43 43 43 43 36 36 36 36 36 36 43 43 43 43 43 45 45 43 36 36 36 36 36 43 43 43 43 43 43 43 43 43 43 43 43 45 51 43 43 43 43 43 43 43 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 51 56 56 45 45 45 45 45 45 56 45 45 43 43 43 43 43 43 43 51 45 45 45 43 36 36 36 36 36 36 36 40 40 51 51 51 43 43 43 43 43 43 51 51 51 43 43 43 43 43 43 43 43 43 43 43 43 51 51 43 43 43 43 43 43 43 43 43 43 43 43 43 34 35 35 35 35 35 43 51 45 45 43 43 43 43 43 43 45 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 43 43 40 40 40 40 51 51 51 51 51 51 56 40 40 40 40 40 40 45 45 45 45 51 51 51 51 51 56 56 51 51 40 40 40 40 40 40 45 51 51 51 51 51 51 51 45 40 40 40 40 40 40 40 40 40 40 40 40 40 46 46 46 46 51 51 51 56 51 51 51 51 51 40 39 39 39 39 39 39 40 37 29 30 24 30 19 30 30 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2311f U 1 AF LL2311r C 683 BS 1 685 LL2311f BS 686 690 LL2311r BS 691 696 LL2311f BS 697 697 LL2311r BS 698 699 LL2311f BS 700 700 LL2311r BS 701 710 LL2311f BS 711 718 LL2311r BS 719 719 LL2311f BS 720 720 LL2311r BS 721 725 LL2311f BS 726 726 LL2311r BS 727 727 LL2311f BS 728 728 LL2311r BS 729 731 LL2311f BS 732 732 LL2311r BS 733 733 LL2311f BS 734 747 LL2311r BS 748 751 LL2311f BS 752 760 LL2311r BS 761 761 LL2311f BS 762 763 LL2311r BS 764 764 LL2311f BS 765 1421 LL2311r RD LL2311f 767 0 0 ttcatatattaaaggctTATTAAAAAGGAGTAGCTAATGCAAATTTAGTA GAGAAAGTAAATGATTTGACATGATAATAACAACCAAACTCAAACAAATG CAACCAAGACACTTGCTTCGTCTCCTTTCACAGCATCATGATCGCTTCTT CAACATTGTTGTCTTTAATAGGTGTAGCCTTTGACGAACATACCCTCCTT AGCCTCTTCAGACTCGCCTTCTCCGCTGTATTTTCCAAGCTGAGCCAACG AGTTGGCCTTGGCACGAGCCAAGAGAGTGGTCTGAGCAGCGTTCACGTTC TCAGCTCTGCCTCCCCATGTTTTCAGACAAGTGTTCTGGAGAGCACGTGC GTAGGAGAAGGACACGTGCCATGGGTTTGGTGCCTGGTTCATCGCGTTGA GGTTCAACGTTGCCTCCAACTCAGACTGTCCTCCAGACAAGAACATGATT CCGGGGACGGCAGGAGGGATTCTGTTGCGGAGGAGCTTAAGGGTGTAGGC GGCAACTTGTTCAGGAGTAGCTCTGTCTTTAGACTCGGCTCCAGGAGTCA CCATGCTCGGCTTCAGGAGGATACCTTCAAACATGACATTGTTCTGAGCA AGGTAAAAGAAAACCTCAGCCCAAACCTTCTCAGCTACTTCGTAAGTCCT GTCAATGTCGTGTTCTccGTCCAACAAGATCTCTGGCTCCACAATTGGGA CCAATCCACTGTCCTGTGAAATGGCAGCGTAGCGAGCAAGGCCCCAAGCA GCTTCTTTCACAGCGAG QA 17 767 1 767 DS CHROMAT_FILE: LL2311f PHD_FILE: LL2311f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:07 2000 RD LL2311r 783 0 0 cTGGCTCCACAATTGGGACCAATCCACTGTCCTGTGAAATGGCAGCGTAG CGAGCAAGGCCCCAAGCAGCTTCTTTCACAGCGAGCGCAGACGGGCCGTT GGGAATGCTCACGACAGTACGCCATttaGCGAAACGAGCACCCTGTTGGT AGTAAGCAGCAGTGCGAGAGGATAGACCATCGAGTCCTTGACACCATGAC TCATTGTTAGAGCCAACAAGTGGCACCAAACCCTTGTCAACTTTGATACC GGGGACGATGTTCTGCTCGACGAGGACGTCGACCATTTTCTTTCCTTCGG TGGTAGACTGATAGAGAGTCTCCTCGAACAAGATTGCCCCGGATATGTAC TGTCCGAGACCTGGTGCAGAGACAAGCAATGTCCTGTACGCTTGACGGTT TGCCTCAGTGTTCTCTAGCCCTATCGAGTCCAAACGCTTCCCGCAAGTCG CGTTCGACTCGTCCATCGCCAAGATTCCACGTCCCGGAGATGCAATTGTT TTCGCTGTCTTGACAAGCTCATCGGCGTAGGAAGAAGCGGCACGGACGGC GAGGGAGGTGGCGCGGTTGGGGAGGACGACGGATGCGGAAGAAGGCTGAC GGAAGAGAACGCTTTGGCCCTTGACCCACTCAGATTTGTCCAACACTGGA GATGCCTTGAGGAGTGAGGTAGATGCCATTTTTTTTTGTTAATGTTATCT cTTtctcttctctctgcctttggtttgtcggacgcgtggxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 1 703 1 739 DS CHROMAT_FILE: LL2311r PHD_FILE: LL2311r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:44 2000 CO Contig31 788 2 46 U ccattctttctacttttacacattgaaaatacaacaatttcaataccata tacaatacacaaaaatataaaaatttcgttTGTATAAGTTCTACAGGGAC GAGCCCTAAATAAAAATCATACACTTATTGTACGTAAACACTTCTCGAAT GAAGCAATGTGACTTGAGAACAAAGGCAAATACGGAGAGAGACTTACGAA GGGGAAGTAGAAGGTGCAGTTGCAGGGTTAAATGCCTTCTGCTCAGCGCA GCTGAGCTCCTCGTTTGGCCCCGCGTCTTCCCTTCCAATACCCAAGAGGT *CGAGGAAGTACAAGTAGTGCGAGACTATGTTGCTCATTGGCTCAATGTC ACCTTGACCACATGTGTACTCACCGTACAAGACGTTCATCGTGGTACCAA AAGTCGGGCCACGTTTGGACAAAGTATCATTCTTTGTCGGCTTCCAGTTC CCAACAAAGATGTCGTGAGCTGACGGCTGAGCTTTCTTGATCGGTGTCAT CCATCTCCAGATTGCAGCTTGGAAGGCAAGCGTCGCGTTCTGCTCGATGT ACTCGGGGTGGTTCAAGAGATCGGCTTTTAGAGCATCCCCAGCTGCACCG TAGTTGAAGTTCCAGTAGATGGGTAAAGCACCACGTCCGTAGTACTCAGC TCCAGGGCTGCAAGGGTACTTGTATTTCCATGTCTCGTCACAGTAAGATT GCATTGGACTCATCTCCCTGTTATAGCACAGACCCCAAGCTAAAGGCCCT CCTGTCGCAACTCCGTATCCACaggagcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 37 37 35 35 35 35 35 35 45 35 37 37 35 35 35 51 40 35 35 35 35 35 35 35 40 42 42 44 35 37 37 40 40 40 56 37 35 35 35 35 35 35 35 35 35 35 35 37 37 35 35 35 35 35 35 35 40 40 43 37 37 37 40 40 40 43 40 40 37 37 40 40 40 40 40 40 37 44 42 42 37 40 40 37 40 37 42 42 42 42 42 46 43 43 43 43 43 43 40 40 36 36 36 36 36 40 42 42 44 36 34 32 32 32 28 36 36 36 36 43 46 56 56 56 56 43 42 42 42 42 42 50 50 38 38 38 40 43 43 40 40 43 43 43 43 56 43 43 43 43 43 38 38 38 38 46 47 49 45 43 46 44 44 36 41 36 36 35 36 35 35 38 35 47 38 38 38 38 38 38 41 41 41 45 50 38 38 43 40 43 43 43 43 38 38 36 38 41 45 45 35 41 41 45 41 42 43 43 41 45 45 45 45 46 42 42 42 46 40 43 43 43 43 43 56 56 56 43 38 38 46 56 52 63 71 72 56 48 49 47 48 49 47 45 44 45 60 58 58 53 54 51 54 53 54 53 57 60 53 57 56 56 58 56 68 74 63 60 56 55 58 58 55 61 50 58 58 58 58 58 69 79 76 72 71 61 66 57 57 55 55 58 58 59 58 55 58 59 59 52 55 61 59 59 65 69 66 63 66 61 61 53 69 71 81 78 71 71 71 71 71 71 71 75 74 71 55 55 55 60 62 64 70 64 70 62 64 67 71 72 74 74 69 80 76 80 67 69 65 60 77 77 71 71 65 83 69 69 69 69 62 66 66 71 76 90 85 85 90 73 68 61 61 65 72 77 90 90 86 86 78 76 78 73 68 68 69 76 83 69 77 66 72 57 57 55 58 61 63 70 70 72 78 74 81 82 90 90 78 76 76 82 68 68 69 79 82 82 90 88 86 82 82 82 76 74 72 68 66 66 75 75 69 73 71 71 72 72 75 64 74 58 60 50 54 56 72 66 74 69 69 64 71 67 67 67 72 70 73 84 86 75 75 73 75 81 83 90 87 87 85 85 85 90 87 83 69 63 63 67 67 64 60 75 73 80 79 79 78 85 77 90 81 81 84 80 63 60 54 54 54 66 66 68 76 78 78 65 68 67 75 64 64 73 68 69 69 68 64 67 63 83 55 59 65 65 69 69 69 69 69 69 69 72 72 66 71 77 90 88 72 69 69 64 64 69 69 70 72 83 78 64 62 62 60 55 60 64 78 74 79 71 69 73 73 72 69 69 67 64 64 67 85 72 67 67 67 67 58 58 59 61 63 68 64 69 72 69 65 63 68 76 76 76 73 74 72 72 63 66 66 67 68 67 75 56 54 54 55 61 60 63 62 56 55 58 80 58 59 58 58 58 59 62 65 65 65 69 66 63 62 55 71 71 71 71 71 68 61 56 55 53 53 52 53 57 59 61 66 66 71 75 71 78 75 73 73 68 68 63 63 61 60 59 56 47 60 35 37 30 45 44 50 50 50 50 50 50 55 58 63 59 55 50 50 35 35 35 35 40 45 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2314f U 1 AF LL2314r C 62 BS 1 235 LL2314f BS 236 243 LL2314r BS 244 253 LL2314f BS 254 254 LL2314r BS 255 311 LL2314f BS 312 327 LL2314r BS 328 424 LL2314f BS 425 425 LL2314r BS 426 426 LL2314f BS 427 427 LL2314r BS 428 457 LL2314f BS 458 460 LL2314r BS 461 471 LL2314f BS 472 472 LL2314r BS 473 476 LL2314f BS 477 480 LL2314r BS 481 493 LL2314f BS 494 496 LL2314r BS 497 499 LL2314f BS 500 502 LL2314r BS 503 508 LL2314f BS 509 509 LL2314r BS 510 515 LL2314f BS 516 523 LL2314r BS 524 534 LL2314f BS 535 537 LL2314r BS 538 538 LL2314f BS 539 546 LL2314r BS 547 549 LL2314f BS 550 554 LL2314r BS 555 557 LL2314f BS 558 572 LL2314r BS 573 573 LL2314f BS 574 591 LL2314r BS 592 592 LL2314f BS 593 602 LL2314r BS 603 603 LL2314f BS 604 620 LL2314r BS 621 621 LL2314f BS 622 639 LL2314r BS 640 640 LL2314f BS 641 641 LL2314r BS 642 642 LL2314f BS 643 748 LL2314r BS 749 751 LL2314f BS 752 788 LL2314r RD LL2314f 773 0 0 ccattctttctacttttacacattgaaaatacaacaatttcaataccata tacaatacacaaaaatataaaaatttcgttTGTATAAGTTCTACAGGGAC GAGCCCTAAATAAAAATCATACACTTATTGTACGTAAACACTTCTCGAAT GAAGCAATGTGACTTGAGAACAAAGGCAAATACGGAGAGAGACTTACGAA GGGGAAGTAGAAGGTGCAGTTGCAGGGTTAAATGCCTTCTGCTCAGCGCA GCTGAGCTCCTCGTTTGGCCCCGCGTCTTCCCTTCCAATACCCAAGAGGT *CGAGGAAGTACAAGTAGTGCGAGACTATGTTGCTCATTGGCTCAATGTC ACCTTGACCACATGTGTACTCACCGTACAAGACGTTCATCGTGGTACCAA AAGTCGGGCCACGTTTGGACAAAGTATCATTCTTTGTCGGCTTCCAGTTC CCAACAAAGATGTCGTGAGCTGACGGCTGAGCTTTCTTGATCGGTGTCAT CCATCTCCAGATTGCAGCTTGGAAGGCAAGCGTCGCGTTCTGCTCGATGT ACTCGGGGTGGTTCAAGAGATCGGCTTTTAGAGCATCCCCAGCTGCACCG TAGTTGAAGTTCCAGTAGATGGGTAAAGCACCACGTCCGTAGTACTCAGC TCCAGGGCTGCAAGGGTACTTGTATTTCCATGTCTCGTCACAGTAAGATT GCATTGGACTCATCTCCCTGTTATAGCACAGACCCCAAGCTAAAGGCCCT CCTGTCGCAACTCCGTattcacn QA 81 766 1 772 DS CHROMAT_FILE: LL2314f PHD_FILE: LL2314f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:43 2000 RD LL2314r 772 0 2 aaaat*taaaaatt*cgtttgt*taagtt*tccaggc*cgagcc*taa*t aaaa*tcat*cact*attg*acgtaacc*ct*ctcgaatg*agcaatgt* acttaa*aacaaa*gcaaatccg*agaaagncttcng*a*gggaAGTaga agtt*cagt*gcagggt*aAATGCCTTCTGCTCAGcg*agCTGAGctc*t cgtttg*cCCCGCGTCTtcctttcca*tacc**agaggtncgagnaagTA CAAGTAGTGCGAGACTATGTTGCTCATTGGCTCAATGTCACCTTGACCAC ATGTGTACTCACCGTACAAGACGTTCATCGTGGTACCAAAAGTCGGGCCA CGTTTGGACAAAGTATCATTCTTTGTCGGCTTCCAGTTCCCAACAAAGAT GTCGTGAGCTGACGGCTGAGCTTTCTTGATCGGTGTCATCCATCTCCAGA TTGCAGCTTGGAAGGCAAGCGTCGCGTTCTGCTCGATGTACTCGGGGTGG TTCAAGAGATCGGCTTTTAGAGCATCCCCAGCTGCACCGTAGTTGAAGTT CCAGTAGATGGGTAAAGCACCACGTCCGTAGTACTCAGCTCCAGGGCTGC AAGGGTACTTGTATTTCCATGTCTCGTCACAGTAAGATTGCATTGGACTC ATCTCCCTGTTATAGCACAGACCCCAAGCTAAAGGCCCTCCTGTCGCAAC TCCGTATCCACaggagcggacgcgtggxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxatt QA 267 711 1 727 DS CHROMAT_FILE: LL2314r PHD_FILE: LL2314r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:21 2000 RT{ LL2314r compression phrap 206 208 000919:094547 } RT{ LL2314r compression phrap 154 156 000919:094547 } CO Contig32 1387 2 16 U ccatccccagctttgttaataatgcatttacatatacagaagattctaga gcagagtaacaatgtgagattggagatcatTCTTCATTTACTTTCTCCTA GAGAAATCATCCCATGGGTCTTCTTGACCAAAGATCTCGTCGATGATGTT GGAAGTTGGAAGAAAATCAATGTTCTTACTCCAACAATGATCAAATGAAT AAAAATGATCCTCCACAAGATCATTGCAATGGCTCCAGTGATTCCCAACA TCGGATTCCTCTCTGTTTACATTGGACTGGAAGTCATCGGACATGCACGA ATCTTCTTCTACAATCACCTCTTCTTCTTCATCATCTCTTGAGAGGCTTT TTCCGGTTTTATCAACTGGAGTTTCAGGACCATAAAACGTGTTGTCATAC TCTACTTGCCACTCATCATCGTCACTATCGATGTCCAGGCTTTTAGAACG ATTCTGAACAGCTCTTCTTCGCAGGATGAAGACGTTGAAATAATAGCTCA CAATCTCCTTCATGGTTCGTGAAGGAAACGCAGACTTTAGGTGTTTCCAG AAATCACGGTTCAGTGAAACAGGGTTGGAGTAAACAACCTCGTGGAATAG ATCTTCCTCGTCCTCGGTTAGTTTACCAGCGACTTCCTCTCCCATTTCAC CGAGACCTATGTTCAAACATCTTTCGTATCCGATAGTCTCAAACAAGCCT TCTCGGTTTTCAATGATGTGCTGCTGCACGCATCTGATGGAACCTTTATC CACGCAAATGCATTCTTTTCTTCCTTTACCAATTTCGAAGAACTCGGTTT CATGGTCAGGCATTGGAATCACGCACTTGCCGATCAGTTGTTCTTCGTTG TGATCAACAACATCACGGACTTCTTCCTTGACACAATCAGGAATATTAGC CTGGTAATCAGACCCGATTCGAACTTGTTTTCTAGGAGAACTCTGACACC AATCTGACTCGGGAGATTCGTGAGAAAGGGTAGACTGAGTCGTAGCATCC TCCTCAGATGAGATCCAGGAGAAAGGTGCCTTGGTCTCAGACCCCTTCTC AACACTATCCTCACATTCAAGCCCGTAGATATCACTCAGATCCTCTCCTA TGACAAGACCAGACTTCTCCAGTGAAACATGATATGGAACTCCTTCGTCT AGCTTGGCCAGCTTATTGCAATAACTAATCTCTCTTGCATGCTTCATGTT GAGCTCTTGCACATCCTCAGCTTCAAAAGTGCGCTTAACACCCATATTTT TTTGTACAATTGATTCGAAGATAGATCTGGAAGAAGGAAGAGATCTGAAC ACGAATTCAAGAAAAAATAAATTTGGGGGAAATTTCGTGCTTTTCgatct ggggaagacccggcccggattaaattcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 39 56 40 46 40 40 40 40 40 40 40 40 45 45 51 51 46 46 40 40 40 40 51 51 51 46 40 40 40 40 40 51 51 51 51 51 56 51 51 45 45 45 45 51 40 40 39 39 39 39 39 40 56 51 45 40 40 40 43 43 43 43 43 43 43 43 43 43 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 51 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 56 56 56 56 51 51 51 45 45 51 51 45 45 51 51 51 51 51 45 45 45 43 41 42 56 40 43 43 43 43 43 51 56 56 56 51 51 51 51 51 51 51 51 51 43 43 46 46 46 43 56 51 51 51 51 51 51 51 51 51 51 43 43 43 43 43 43 56 43 43 43 43 43 43 43 51 51 51 51 51 45 45 45 36 36 36 43 43 43 45 43 43 43 43 43 43 51 51 51 51 56 56 56 43 43 43 43 43 43 45 45 43 43 43 43 45 45 51 51 51 51 51 51 51 51 56 56 56 51 51 51 51 51 51 56 56 42 42 41 43 43 43 43 43 43 56 51 51 51 51 56 56 56 56 56 56 56 56 43 43 43 46 46 46 56 56 56 51 45 45 45 45 45 45 45 45 43 51 42 56 51 51 51 51 51 51 56 56 56 56 56 56 45 45 45 45 45 45 51 51 51 45 43 43 43 43 43 43 43 43 56 56 51 51 51 51 51 45 45 45 45 45 45 45 51 51 51 56 56 56 56 56 45 45 51 46 45 41 44 42 42 41 45 45 45 45 45 51 56 56 51 51 51 51 43 43 43 43 43 40 56 50 50 40 40 45 45 45 45 51 51 56 56 56 56 45 45 45 40 40 40 46 56 51 51 40 40 45 45 40 40 40 39 39 35 46 43 45 46 51 51 51 51 51 51 56 56 56 51 56 56 43 51 43 43 42 51 51 56 56 51 40 40 37 37 40 37 45 40 45 40 40 46 42 42 42 43 44 40 40 45 45 45 40 40 42 56 56 56 45 45 45 45 38 38 42 46 46 56 56 44 56 56 56 56 56 56 56 47 47 47 56 56 40 37 37 37 37 40 56 51 51 51 51 43 43 42 42 42 56 42 42 42 37 37 34 40 40 40 40 40 40 40 40 40 29 29 34 37 40 40 25 20 20 25 27 27 29 29 25 29 29 29 29 29 29 29 27 27 25 22 22 25 32 29 29 29 29 32 32 32 34 34 44 49 52 58 56 57 50 51 48 56 63 67 69 47 47 50 42 40 41 64 66 66 69 60 65 64 65 65 56 56 50 44 40 50 48 53 65 69 54 57 56 57 52 55 55 44 41 35 50 42 42 44 47 50 53 56 52 51 49 50 50 58 61 69 69 69 69 69 78 72 70 73 69 64 62 62 62 62 52 59 57 62 62 69 75 88 88 80 80 58 42 42 37 37 40 44 46 47 47 60 61 55 44 49 43 46 38 44 47 51 51 53 51 47 47 49 63 63 63 49 49 63 25 25 29 32 29 32 24 29 24 32 34 40 32 32 32 34 34 31 40 40 40 46 37 40 37 37 37 37 37 37 40 37 34 34 29 29 32 34 32 32 32 32 40 25 22 22 25 32 32 32 32 29 34 31 36 40 39 31 31 31 39 39 40 32 32 32 32 25 25 34 40 32 32 32 34 34 39 40 40 46 46 46 51 40 46 42 42 42 42 42 42 46 56 51 51 40 45 45 45 51 51 56 56 56 56 56 56 56 56 44 44 44 42 42 46 43 51 51 56 51 51 51 51 46 40 35 35 35 39 39 45 51 40 40 40 40 37 37 51 51 51 56 56 56 56 56 45 40 40 40 40 40 40 56 56 56 56 56 56 56 56 56 44 56 56 56 56 56 56 56 42 46 46 51 43 43 42 43 43 51 56 56 56 56 51 51 51 51 51 51 51 51 51 56 56 56 45 40 40 40 40 40 51 56 56 56 56 45 45 45 45 45 45 45 51 45 45 45 45 45 43 43 43 43 43 43 41 51 51 51 51 51 56 51 51 51 45 45 45 51 51 51 56 56 56 56 56 56 56 56 56 56 56 45 45 45 45 45 51 51 45 43 43 43 43 43 43 51 51 51 56 56 56 56 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 51 56 56 56 56 56 51 51 51 51 51 45 51 45 45 45 45 45 45 56 51 51 51 51 45 45 45 45 45 45 56 56 56 56 56 56 56 51 51 56 56 56 45 45 45 45 45 45 51 51 51 51 51 51 51 51 51 43 36 36 36 36 36 43 51 51 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 56 45 43 43 43 43 43 43 43 43 43 43 45 45 45 43 43 43 45 43 43 51 51 43 43 43 43 43 43 43 43 43 43 45 45 45 45 51 51 51 51 51 45 45 45 46 43 43 56 56 56 56 56 51 51 51 51 43 43 43 43 43 43 45 43 43 43 43 40 40 56 56 56 56 45 45 45 45 45 45 56 56 56 40 38 38 35 35 35 35 35 35 35 40 40 40 40 56 56 56 51 51 51 51 51 51 56 56 56 51 51 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 45 40 40 40 40 40 40 39 39 39 39 39 40 51 51 51 40 40 40 40 40 40 45 45 45 35 35 35 35 35 39 45 45 45 45 45 45 51 51 51 51 51 51 51 51 51 56 39 39 39 39 40 40 51 51 56 56 51 51 51 51 51 51 56 56 40 39 39 39 39 39 46 40 40 29 24 23 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2319f U 1 AF LL2319r C 645 BS 1 654 LL2319f BS 655 655 LL2319r BS 656 663 LL2319f BS 664 664 LL2319r BS 665 676 LL2319f BS 677 682 LL2319r BS 683 696 LL2319f BS 697 697 LL2319r BS 698 704 LL2319f BS 705 728 LL2319r BS 729 735 LL2319f BS 736 738 LL2319r BS 739 744 LL2319f BS 745 759 LL2319r BS 760 763 LL2319f BS 764 1387 LL2319r RD LL2319f 774 0 0 ccatccccagctttgttaataatgcatttacatatacagaagattctaga gcagagtaacaatgtgagattggagatcatTCTTCATTTACTTTCTCCTA GAGAAATCATCCCATGGGTCTTCTTGACCAAAGATCTCGTCGATGATGTT GGAAGTTGGAAGAAAATCAATGTTCTTACTCCAACAATGATCAAATGAAT AAAAATGATCCTCCACAAGATCATTGCAATGGCTCCAGTGATTCCCAACA TCGGATTCCTCTCTGTTTACATTGGACTGGAAGTCATCGGACATGCACGA ATCTTCTTCTACAATCACCTCTTCTTCTTCATCATCTCTTGAGAGGCTTT TTCCGGTTTTATCAACTGGAGTTTCAGGACCATAAAACGTGTTGTCATAC TCTACTTGCCACTCATCATCGTCACTATCGATGTCCAGGCTTTTAGAACG ATTCTGAACAGCTCTTCTTCGCAGGATGAAGACGTTGAAATAATAGCTCA CAATCTCCTTCATGGTTCGTGAAGGAAACGCAGACTTTAGGTGTTTCCAG AAATCACGGTTCAGTGAAACAGGGTTGGAGTAAACAACCTCGTGGAATAG ATCTTCCTCGTCCTCGGTTAGTTTACCAGCGACTTCCTCTCCCATTTCAC CGAGACCTATGTTCAAACATCTTTCGTATCCGATAGTCTCAAACAAGCCT TCTCGGTTTTCAATGATGTGCTGCTGCACGCATCTGATGGAACCTTTATC CACGCAAATGCATTCTTTTCTtcc QA 81 770 1 774 DS CHROMAT_FILE: LL2319f PHD_FILE: LL2319f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:07 2000 RD LL2319r 787 0 0 tttcACCGAGACCTATGTTCAAACATCTTTCGTATCCGATAGTCTCAAAC AAGCCTTCTCGGTTTTCAATGATGTGCTGCTGCACGCATCTGATGGAACC TTTATCCACGCAAATGCATTCTTTTCTTCCTTTACCAATTTCGAAGAACT CGGTTTCATGGTCAGGCATTGGAATCACGCACTTGCCGATCAGTTGTTCT TCGTTGTGATCAACAACATCACGGACTTCTTCCTTGACACAATCAGGAAT ATTAGCCTGGTAATCAGACCCGATTCGAACTTGTTTTCTAGGAGAACTCT GACACCAATCTGACTCGGGAGATTCGTGAGAAAGGGTAGACTGAGTCGTA GCATCCTCCTCAGATGAGATCCAGGAGAAAGGTGCCTTGGTCTCAGACCC CTTCTCAACACTATCCTCACATTCAAGCCCGTAGATATCACTCAGATCCT CTCCTATGACAAGACCAGACTTCTCCAGTGAAACATGATATGGAACTCCT TCGTCTAGCTTGGCCAGCTTATTGCAATAACTAATCTCTCTTGCATGCTT CATGTTGAGCTCTTGCACATCCTCAGCTTCAAAAGTGCGCTTAACACCCA TATTTTTTTGTACAATTGATTCGAAGATAGATCTGGAAGAAGGAAGAGAT CTGAACACGAATTCAAGAAAAAATAAATTTGGGGGAAATTTCGTGCTTTT Cgatctggggaagacccggcccggattaaattcggacgcgtggxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 1 707 1 743 DS CHROMAT_FILE: LL2319r PHD_FILE: LL2319r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:44 2000 CO Contig33 558 2 92 U CTTTTTTTTTTTTTTTTTGAGAAAAGAATAAATACATTATTATGCTTTTC ATCTGCTCAGATAAGTGCAATGCAGAAGAAATTGCAACAATGTTGTCCAC TGGAAGAAAAGACGCAGAATAAAAAAATAATATATAAGCTTTAAAAAGAA AAACAACGAGAAGAGAAACTCAAAAACAATGTTTGATTGTACGATACAAA ATACTATTGCATTTTCGGTTATTTTATTATATAGATATTCATCGGAGGAA TTGTTTACGCAACCGTCGACCGCCGTCGGCCGTAGCTCTCTACGCCGTCT TACCTTGCCGTTTCTCTTCTTCGATCTTGGCCTTAATGACAGAGTAGAGT GCAACGCCAGCAATGGCTATTCCAGTTCCGATACCTGTCTGCGTCGATAT CTTGTTTCCGAAGATAACGATGGAGAATCCGATCACGAAGACACGTTTCA GAACGTTTCCAACAGCGTGAGTCAACGGTGCTACCCTCTCCAACGTATTA GTAGCCAACTGATTGTAGAGATGGTAAAACATTCCAACCCAGAAGAGCGG ACGCGTGG BQ 42 56 56 56 56 56 56 48 48 48 48 48 40 40 40 34 29 29 34 40 40 48 48 48 40 40 40 40 40 55 55 55 57 54 54 55 79 79 79 79 75 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 89 73 72 72 72 75 86 86 86 86 90 85 83 83 77 80 80 90 90 90 90 90 90 85 76 76 77 76 82 90 90 90 90 90 90 90 71 71 66 51 45 45 45 45 66 66 66 90 90 90 70 70 49 47 46 57 62 74 86 77 77 77 77 77 77 90 90 90 90 86 86 86 86 88 88 88 80 80 80 80 90 80 86 86 88 90 90 90 90 90 90 90 90 90 90 90 86 85 85 85 85 86 90 90 90 90 90 90 90 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 90 90 90 78 78 78 78 78 78 90 90 90 90 90 90 90 90 90 90 90 86 90 90 88 88 90 90 78 78 82 81 78 78 90 90 90 90 90 90 90 88 90 90 90 85 85 90 85 85 90 90 90 90 90 88 88 88 88 88 86 90 90 90 90 90 90 90 90 90 79 78 78 78 78 78 79 88 88 88 88 86 86 83 86 86 86 79 79 79 88 86 86 86 86 86 86 88 86 86 86 86 90 90 90 82 79 79 79 79 86 79 81 87 86 90 90 90 90 88 90 86 86 86 86 86 86 86 90 90 90 90 90 90 90 90 90 90 90 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 90 87 89 85 85 85 83 83 83 86 86 86 86 86 86 86 86 86 83 81 81 83 83 84 83 83 83 83 90 90 90 90 90 90 85 85 89 89 88 84 90 90 88 88 90 90 88 86 83 83 73 73 78 90 90 90 90 90 90 90 90 90 90 90 83 85 80 83 78 80 85 90 90 83 83 88 85 85 83 90 89 90 90 90 90 83 90 90 90 90 90 90 90 90 90 85 82 87 87 75 74 74 76 74 74 80 80 80 80 73 78 78 79 74 74 90 86 86 86 75 76 84 84 84 76 72 76 81 81 81 82 79 80 80 80 71 71 70 72 72 75 72 80 90 90 90 90 90 88 78 78 72 72 72 72 77 77 72 70 70 77 74 79 73 76 76 76 84 84 84 86 82 82 87 69 66 63 74 74 79 83 79 82 82 87 90 90 90 90 90 88 90 90 90 90 90 90 90 90 88 85 85 82 66 66 66 68 66 80 77 49 50 54 35 46 56 AF LL2321r C -190 AF LL2321f U 15 BS 1 33 LL2321r BS 34 35 LL2321f BS 36 46 LL2321r BS 47 47 LL2321f BS 48 64 LL2321r BS 65 75 LL2321f BS 76 108 LL2321r BS 109 114 LL2321f BS 115 153 LL2321r BS 154 154 LL2321f BS 155 172 LL2321r BS 173 173 LL2321f BS 174 179 LL2321r BS 180 182 LL2321f BS 183 188 LL2321r BS 189 189 LL2321f BS 190 196 LL2321r BS 197 198 LL2321f BS 199 212 LL2321r BS 213 213 LL2321f BS 214 219 LL2321r BS 220 220 LL2321f BS 221 222 LL2321r BS 223 226 LL2321f BS 227 227 LL2321r BS 228 230 LL2321f BS 231 232 LL2321r BS 233 233 LL2321f BS 234 245 LL2321r BS 246 268 LL2321f BS 269 269 LL2321r BS 270 272 LL2321f BS 273 275 LL2321r BS 276 287 LL2321f BS 288 295 LL2321r BS 296 300 LL2321f BS 301 305 LL2321r BS 306 313 LL2321f BS 314 319 LL2321r BS 320 327 LL2321f BS 328 331 LL2321r BS 332 332 LL2321f BS 333 333 LL2321r BS 334 337 LL2321f BS 338 340 LL2321r BS 341 345 LL2321f BS 346 351 LL2321r BS 352 360 LL2321f BS 361 371 LL2321r BS 372 376 LL2321f BS 377 378 LL2321r BS 379 381 LL2321f BS 382 384 LL2321r BS 385 390 LL2321f BS 391 392 LL2321r BS 393 406 LL2321f BS 407 413 LL2321r BS 414 415 LL2321f BS 416 417 LL2321r BS 418 418 LL2321f BS 419 421 LL2321r BS 422 427 LL2321f BS 428 428 LL2321r BS 429 433 LL2321f BS 434 434 LL2321r BS 435 437 LL2321f BS 438 447 LL2321r BS 448 451 LL2321f BS 452 452 LL2321r BS 453 455 LL2321f BS 456 463 LL2321r BS 464 466 LL2321f BS 467 467 LL2321r BS 468 468 LL2321f BS 469 469 LL2321r BS 470 477 LL2321f BS 478 479 LL2321r BS 480 485 LL2321f BS 486 491 LL2321r BS 492 506 LL2321f BS 507 509 LL2321r BS 510 515 LL2321f BS 516 517 LL2321r BS 518 526 LL2321f BS 527 527 LL2321r BS 528 529 LL2321f BS 530 544 LL2321r BS 545 545 LL2321f BS 546 548 LL2321r BS 549 549 LL2321f BS 550 552 LL2321r BS 553 558 LL2321f RD LL2321r 793 0 2 ccctcttcgctattacgccagctggcgaaagggggatgtgctgcaaggcg attaagttgggtaacgccagggttttxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxCTTTTTTTT TTTTTTTTTGAGAAAAGAATAAATACATTATTATGCTTTTCATCTGCTCA GATAAGTGCAATGCAGAAGAAATTGCAACAATGTTGTCCACTGGAAGAAA AGACGCAGAATAAAAAAATAATATATAAGCTTTAAAAAGAAAAACAACGA GAAGAGAAACTCAAAAACAATGTTTGATTGTACGATACAAAATACTATTG CATTTTCGGTTATTTTATTATATAGATATTCATCGGAGGAATTGTTTACG CAACCGTCGACCGCCGTCGGCCGTAGCTCTCTACGCCGTCTTACCTTGCC GTTTCTCTTCTTCGATCTTGGCCTTAATGACAGAGTAGAGTGCAACGCCA GCAATGGCTATTCCAGTTCCGATACCTGTCTGCGTCGATATCTTGTTTCC GAAGATAACGATGGAGAATCCGATCACGAAGACACGTTTCAGAACGTTTC CAACAGCGTGAGTCAACGGTGCTACCCTCTCCAACGTATTAGTAGCCAAC TGATTGTAGAGATGGTAAAACATTCCAACCCAGAAGAGCGGACGCGTGGx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 7 749 192 749 DS CHROMAT_FILE: LL2321r PHD_FILE: LL2321r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:17 2000 RT{ LL2321r chimera phrap 2 76 000919:094547 } RT{ LL2321r matchElsewhereHighQual phrap 2 76 000919:094547 } RD LL2321f 794 0 3 cttttanggaaacatAAATACATTATTATGCTTTTCATCTGCTCAGATAA GTGCAATGCAGAAGAAATTGCAACAATGTTGTCcagtGGAAGAAAAGACG CAGAATAAAAAAATAATATATAAGCTTTAAAAAGAAAAACAACGAGAAGA GAAACTCAAAAACAATGTTTGATTGTACGATACAAAATACTATTGCATTT TCGGTTATTTTATTATATAGATATTCATCGGAGGAATTGTTTACGCAACC GTCGACCGCCGTCGGCCGTAGCTCTCTACGCCGTCTTACCTTGCCGTTTC TCTTCTTCGATCTTGGCCTTAATGACAGAGTAGAGTGCAACGCCAGCAAT GGCTATTCCAGTTCCGATACCTGTCTGCGTCGATATCTTGTTTCCGAAGA TAACGATGGAGAATCCGATCACGAAGACACGTTTCAGAACGTTTCCAACA GCGTGAGTCAACGGTGCTACCCTCTCCAACGTATTAGTAGCCAACTGATT GTAGAGATGGTAAAACATTCCAACCCAGAAGAGCGGACGCGTGGxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxgcctgtggtgcctactgagtgagctaactcacattaattgcgt tgcgctcactgccccgctttccagtcggaaacctgtcgtgccag QA 15 548 14 544 DS CHROMAT_FILE: LL2321f PHD_FILE: LL2321f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:38 2000 RT{ LL2321f chimera phrap 708 794 000919:094547 } RT{ LL2321f matchElsewhereHighQual phrap 779 794 000919:094547 } RT{ LL2321f matchElsewhereHighQual phrap 723 764 000919:094547 } CO Contig34 956 2 50 U cttggtaataagaatcacaacatgaacaaactcaggagagtacatgaaca tgaacaagtgagtacaggtaagaaggtacaACAAGCACAGACGCAAAGAA AAAACAATCTGAAACCAGTTTGGTAGTAACCACCTTTTTGAAACTTAAAA ATGGAACTTTTAAGATTCAGGGACGGCTTCTACAGCAGCAGCAGCAGCAG TAGCCGCAGCAGCGGAGCGTTTGATGGAAGCAGATTTGATGAGGTGATTG TAACTTCCTTTCTCCTTGAAAAGCTGAGCAAAGTGGTGTTTGCAGTACAA GATCCCTTCAAGAGCTGCGTAGTTGGATGGAGAGATAGGACAGCCTCCGT GCGAACACTTGAAGCAAGACTTGTGGTAACACTGGCTCTCCACAGTTACC TTCTCGATAGGGTACACTGTTTTACTGCAAGTAGCGCATTTATCTTGCGT TCCAGAGAACATTCCAGCAACTCGGCTAGGTGTTCTTGTCAGCTCAGGTG TTGGTTTGTCTGTTAATGGCTTTGCAGGTGATTGAAAGTTCTTGTTGAAG CTACCAGTCTCCTTGAAGAGCTGCTCAAAGTGAGGCTTACAGTACAACAC ACCTTCCATTGATGAATAGTTGCTCAGTTGAAGGGTGGATTTGCAGTGAG AGCACTTGAAGCAAGACTTGTGGAAAGGGACACCATCAGCTGAGAGAAGC TCCATAGGGTACACTGTCTTCTCACACGCCCTGCACTTCTGCTGTGTTCC TGTAAACGACATCGTTTTAGTCTTCCTTCTTCACGATCTGTCCTTCCTCT CCTCGCTCACTCTCCGTGGATCTGAACAGAAAATGCGGAATCAAAAAAGA AGAAGTTAGATGTGTAGAAGACGAAGTTAAAGATGAGATGTAATGATTAT TGAGAGCACGTTCgacacacttatgagacagatagtattctaaagcggac gcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 44 46 42 42 37 37 39 37 35 35 35 35 35 35 35 35 35 39 36 35 35 42 42 42 37 37 35 42 35 35 37 37 37 40 40 35 35 35 35 42 47 35 35 32 32 32 32 32 35 35 40 40 37 37 42 42 50 40 37 37 37 35 35 35 32 35 35 35 35 42 42 40 40 40 37 37 37 37 35 35 40 40 40 37 42 42 41 41 41 41 37 35 35 35 29 29 27 24 26 26 32 35 35 38 38 41 41 41 41 41 41 40 38 35 32 32 32 35 35 30 23 21 21 21 23 21 26 28 29 32 27 27 32 32 51 53 55 65 65 62 64 74 73 74 63 65 65 53 55 53 56 62 66 90 75 69 69 69 82 82 79 79 73 68 68 64 63 63 63 60 66 56 58 62 58 53 62 73 71 76 71 75 75 67 71 71 81 85 75 74 76 70 70 74 71 76 73 84 86 90 89 82 75 71 71 71 68 68 65 55 53 53 54 62 60 63 70 74 78 79 84 79 79 68 68 67 67 67 68 73 70 81 78 78 76 69 76 68 52 50 48 46 34 49 43 54 59 60 68 68 68 71 72 90 76 88 75 58 53 56 56 64 59 60 60 65 63 61 64 60 56 61 58 58 60 59 65 61 61 57 65 79 82 76 78 82 88 75 80 75 90 75 72 73 76 76 76 71 74 75 87 76 80 81 81 81 81 85 89 90 70 70 62 73 67 67 64 75 66 70 70 73 76 82 82 82 84 90 90 90 90 90 81 87 71 83 83 90 90 89 84 84 81 77 82 82 80 73 78 82 78 78 70 71 65 67 67 75 82 86 82 82 82 72 73 64 67 70 70 67 75 73 79 73 81 81 81 79 79 57 63 63 56 43 39 33 29 29 47 50 77 86 78 81 78 78 70 78 78 82 90 90 88 88 90 90 90 83 74 79 72 72 72 69 72 77 90 90 90 87 82 82 86 86 80 82 90 90 90 90 86 86 82 82 82 80 82 75 75 75 80 81 77 84 82 87 87 90 90 90 90 90 90 90 90 90 90 89 89 89 90 90 90 78 73 73 73 73 77 72 75 68 73 68 68 72 81 77 77 74 80 90 90 90 90 90 90 87 84 84 82 80 90 90 90 90 80 76 85 83 83 83 84 90 88 78 78 90 75 65 71 70 72 72 90 90 90 90 90 87 87 84 90 90 90 90 90 90 90 90 90 90 90 90 90 79 79 78 73 73 75 90 89 72 74 62 62 62 67 89 89 90 78 78 67 67 67 84 89 90 76 75 75 72 75 69 73 58 62 62 71 75 68 67 70 69 70 68 67 72 76 65 67 65 68 70 70 72 72 69 69 69 68 63 59 67 59 58 55 65 67 67 81 81 85 68 70 77 80 73 75 77 71 71 66 61 61 61 58 63 60 69 74 74 72 68 60 61 61 66 67 72 85 75 76 67 67 67 76 76 76 69 69 74 70 66 68 66 62 68 68 70 66 64 64 60 71 71 71 71 71 71 73 66 66 66 66 66 68 69 82 72 72 68 58 65 68 62 62 58 52 52 37 40 40 43 43 43 43 43 43 43 46 56 56 56 56 56 56 56 56 45 45 45 45 45 45 40 40 40 40 40 46 56 56 56 56 56 56 56 51 35 35 35 35 35 35 51 45 45 45 45 39 35 35 35 35 35 46 56 51 51 51 51 51 51 51 51 51 51 51 51 35 34 34 34 34 34 35 37 37 40 40 40 56 56 51 51 51 51 51 40 40 40 39 39 39 40 40 40 45 40 40 46 46 46 46 46 51 40 40 45 45 45 45 40 40 40 40 40 40 40 40 40 40 40 40 51 39 40 39 39 39 39 39 39 39 40 40 46 46 46 40 40 40 40 40 35 34 29 29 29 29 14 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2322f U 1 AF LL2322r C 213 BS 1 280 LL2322f BS 281 281 LL2322r BS 282 312 LL2322f BS 313 315 LL2322r BS 316 316 LL2322f BS 317 317 LL2322r BS 318 318 LL2322f BS 319 323 LL2322r BS 324 352 LL2322f BS 353 353 LL2322r BS 354 366 LL2322f BS 367 374 LL2322r BS 375 380 LL2322f BS 381 381 LL2322r BS 382 389 LL2322f BS 390 390 LL2322r BS 391 411 LL2322f BS 412 412 LL2322r BS 413 415 LL2322f BS 416 427 LL2322r BS 428 432 LL2322f BS 433 433 LL2322r BS 434 434 LL2322f BS 435 435 LL2322r BS 436 437 LL2322f BS 438 440 LL2322r BS 441 450 LL2322f BS 451 457 LL2322r BS 458 460 LL2322f BS 461 479 LL2322r BS 480 482 LL2322f BS 483 485 LL2322r BS 486 487 LL2322f BS 488 489 LL2322r BS 490 490 LL2322f BS 491 524 LL2322r BS 525 525 LL2322f BS 526 534 LL2322r BS 535 537 LL2322f BS 538 540 LL2322r BS 541 541 LL2322f BS 542 556 LL2322r BS 557 559 LL2322f BS 560 566 LL2322r BS 567 571 LL2322f BS 572 578 LL2322r BS 579 583 LL2322f BS 584 660 LL2322r BS 661 661 LL2322f BS 662 956 LL2322r RD LL2322f 780 0 0 cttggtaataagaatcacaacatgaacaaactcaggagagtacatgaaca tgaacaagtgagtacaggtaagaaggtacaACAAGCACAGACGCAAAGAA AAAACAATCTGAAACCAGTTTGGTAGTAACCACCTTTTTGAAACTTAAAA ATGGAACTTTTAAGATTCAGGGACGGCTTCTACAGCAGCAGCAGCAGCAG TAGCCGCAGCAGCGGAGCGTTTGATGGAAGCAGATTTGATGAGGTGATTG TAACTTCCTTTCTCCTTGAAAAGCTGAGCAAAGTGGTGTTTGCAGTACAA GATCCCTTCAAGAGCTGCGTAGTTGGATGGAGAGATAGGACAGCCTCCGT GCGAACACTTGAAGCAAGACTTGTGGTAACACTGGCTCTCCACAGTTACC TTCTCGATAGGGTACACTGTTTTACTGCAAGTAGCGCATTTATCTTGCGT TCCAGAGAACATTCccgCAACTCGGCTAGGTGTTCTTGTCAGCTCAGGTG TTGGTTTGTCTGTTAATGGCTTTGCAGGTGATTGAAAGTTCTTGTTGAAG CTACCAGTCTCCTTGAAGAGCTGCTCAAAGTGAGGCTTACAGTACAACAC ACCTTCCATTGATGAATAGTTGCTCAGTTGAAGGGTGGATTTGCAGTGAG AGCACTTGAAGCAAGACTTGTGGAAAGGGACACCATCAGCTGAGAGAAGC TCCATAGGGTACACTGTCTTCTCACACGCCCTGCACTTCTGCTGTGTTCC TGTAAACGACATcgntctaGTCTTCCTTct QA 81 763 1 780 DS CHROMAT_FILE: LL2322f PHD_FILE: LL2322f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:43 2000 RD LL2322r 789 0 0 cgGAGCGTTTGATGGAAGCAGATTTGATGAGGTGATTGTAACTTCCTTTC TCCTTGAAAAGCTGAGCAAAGTGGTGTTTGCAGTACAAGATCCCTTCAAG AGCTGCGTAGTTGGATGGAGAGATAGGACAGCCTCCGTGCGAACACTTGA AGCAAGACTTGTGGTAACACTGGCTCTCCACAGTTACCTTCTCGATAGGG TACACTGTTTTACTGCAAGTAGCGCATTTATCTTGCGTTCCAGAGAACAT TCCAGCAACTCGGCTAGGTGTTCTTGTCAGCTCAGGTGTTGGTTTGTCTG TTAATGGCTTTGCAGGTGATTGAAAGTTCTTGTTGAAGCTACCAGTCTCC TTGAAGAGCTGCTCAAAGTGAGGCTTACAGTACAACACACCTTCCATTGA TGAATAGTTGCTCAGTTGAAGGGTGGATTTGCAGTGAGAGCACTTGAAGC AAGACTTGTGGAAAGGGACACCATCAGCTGAGAGAAGCTCCATAGGGTAC ACTGTCTTCTCACACGCCCTGCACTTCTGCTGTGTTCCTGTAAACGACAT CGTTTTAGTCTTCCTTCTTCACGATCTGTCCTTCCTCTCCTCGCTCACTC TCCGTGGATCTGAACAGAAAATGCGGAATCAAAAAAGAAGAAGTTAGATG TGTAGAAGACGAAGTTAAAGATGAGATGTAATGATTATTGAGAGCACGTT Cgacacacttatgagacagatagtattctaaagcggacgcgtggxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxag QA 1 709 1 744 DS CHROMAT_FILE: LL2322r PHD_FILE: LL2322r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:21 2000 CO Contig35 645 2 99 U acggccgGAGAAGATTTCATTTCTTTAAATAGCAAAGAGAAAACAca*cG ACAAGAGTTCCAAAAGCAAACTAGATAAGCCAAGACTGCGTGCTGTTGTT GCTAAACAAAGCTCTTATAACAAACGGCAGACAATATCTGTGACTTTCTT CTTCCCATAAAACCCCATTAATTTTGTCATAATTAACCACTACGTCCAGG TAATCGCCACGGGACTACCCACCACGTGTTTCCCATCTGACCATTCAATG TTCCCGAAGCTAGATCCAGACGCCTTAGAAGAGTCTACGGTAAACGTCAC CGTATACGATTTCTTCTCATTAACTTCTTTGAAATTCAAAACCGCCGGTG CAACCGAAATCTTGACTGCTGTACTCTCCGAAATGACTTTAACCTTGTAG GACCCAGCTCCTCCAACGCTCGTGACAGTACGCGTGTACTTATACGCTCC GGATCCGTCAACGTTAACGGCGAAGGACGGATAGTTTAAATCCGCGACTG AGTGCGTTTTGGAGGGATCGCAAGTGAAATTGCGTCTCGAAACGCTTCTG ATCTGCGACGACGTGTAGTTCAACGCGCAGAGGAAACCTAAGTAATCCAC CGTCGttAGATCGTAAACGAGTCCTGGATTGGTGCGGACGCGTGG BQ 0 0 0 0 0 15 18 25 29 29 25 34 37 48 34 34 33 30 35 35 41 54 61 51 51 45 48 51 52 55 50 47 49 59 57 69 71 63 63 49 44 44 49 41 23 16 19 19 39 39 50 60 53 64 64 64 60 65 75 80 85 73 71 71 66 61 61 55 56 60 64 64 69 69 66 41 41 25 22 26 29 48 40 46 50 50 64 64 69 64 67 75 83 83 74 80 80 69 64 60 62 62 59 50 55 72 80 80 85 85 85 85 74 69 69 65 76 66 66 58 75 58 55 54 55 51 50 60 52 57 57 88 88 90 90 88 86 90 84 84 78 79 68 67 69 72 85 80 85 90 90 90 86 84 79 74 80 72 78 72 88 88 90 90 90 83 80 80 80 83 83 90 90 90 88 90 85 89 89 89 82 72 72 68 68 68 89 90 90 83 75 75 75 75 75 83 83 83 75 75 73 72 63 67 65 67 65 67 77 83 76 75 75 75 75 75 76 89 90 90 90 76 76 83 80 80 83 90 90 90 90 90 90 90 90 88 87 83 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 85 83 88 88 90 90 90 90 90 90 90 79 79 78 79 79 88 88 90 90 90 90 90 90 90 88 83 88 78 78 78 80 73 78 87 80 80 80 80 80 80 80 83 88 81 83 81 76 76 76 81 81 88 90 90 90 90 90 88 88 90 85 90 90 83 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 78 78 71 71 79 79 70 70 73 75 72 75 84 88 88 88 88 90 90 90 90 90 90 90 88 80 80 80 84 78 75 75 81 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 90 86 78 78 80 85 86 88 90 90 88 88 88 88 90 90 90 90 90 83 83 83 80 80 80 80 83 80 80 89 89 90 90 88 83 82 78 78 78 78 78 82 82 85 85 85 85 83 83 83 83 85 83 83 88 90 90 90 83 88 88 83 81 81 88 88 83 80 80 80 85 85 82 83 75 75 80 84 90 90 78 80 80 80 79 89 90 90 90 75 75 77 77 80 77 88 75 75 72 72 72 75 77 79 75 77 70 69 64 64 64 64 72 90 90 90 90 87 87 87 83 69 64 64 56 56 50 57 77 80 80 90 90 90 68 64 53 52 52 57 64 75 75 73 75 75 75 75 75 75 79 85 88 90 89 89 90 72 65 60 60 60 65 73 76 79 79 79 71 64 72 72 73 73 79 75 59 60 67 62 62 64 67 72 72 77 80 70 64 64 67 64 69 65 64 56 59 61 66 66 54 62 56 52 53 56 40 38 23 21 15 15 24 29 47 47 44 58 58 53 56 58 68 63 63 63 63 63 59 72 65 69 69 80 67 61 51 48 50 59 63 63 69 65 45 45 45 32 33 32 AF LL2323r C -108 AF LL2323f U 13 BS 1 14 LL2323r BS 15 19 LL2323f BS 20 21 LL2323r BS 22 23 LL2323f BS 24 30 LL2323r BS 31 36 LL2323f BS 37 39 LL2323r BS 40 78 LL2323f BS 79 85 LL2323r BS 86 92 LL2323f BS 93 95 LL2323r BS 96 132 LL2323f BS 133 143 LL2323r BS 144 154 LL2323f BS 155 155 LL2323r BS 156 176 LL2323f BS 177 180 LL2323r BS 181 187 LL2323f BS 188 190 LL2323r BS 191 205 LL2323f BS 206 208 LL2323r BS 209 211 LL2323f BS 212 224 LL2323r BS 225 231 LL2323f BS 232 239 LL2323r BS 240 241 LL2323f BS 242 247 LL2323r BS 248 250 LL2323f BS 251 255 LL2323r BS 256 259 LL2323f BS 260 264 LL2323r BS 265 273 LL2323f BS 274 280 LL2323r BS 281 282 LL2323f BS 283 283 LL2323r BS 284 284 LL2323f BS 285 285 LL2323r BS 286 289 LL2323f BS 290 292 LL2323r BS 293 300 LL2323f BS 301 311 LL2323r BS 312 314 LL2323f BS 315 315 LL2323r BS 316 317 LL2323f BS 318 318 LL2323r BS 319 324 LL2323f BS 325 327 LL2323r BS 328 338 LL2323f BS 339 340 LL2323r BS 341 345 LL2323f BS 346 347 LL2323r BS 348 354 LL2323f BS 355 373 LL2323r BS 374 374 LL2323f BS 375 376 LL2323r BS 377 384 LL2323f BS 385 387 LL2323r BS 388 400 LL2323f BS 401 401 LL2323r BS 402 402 LL2323f BS 403 403 LL2323r BS 404 409 LL2323f BS 410 424 LL2323r BS 425 429 LL2323f BS 430 449 LL2323r BS 450 453 LL2323f BS 454 454 LL2323r BS 455 456 LL2323f BS 457 459 LL2323r BS 460 461 LL2323f BS 462 468 LL2323r BS 469 469 LL2323f BS 470 471 LL2323r BS 472 473 LL2323f BS 474 476 LL2323r BS 477 484 LL2323f BS 485 488 LL2323r BS 489 502 LL2323f BS 503 507 LL2323r BS 508 516 LL2323f BS 517 524 LL2323r BS 525 526 LL2323f BS 527 536 LL2323r BS 537 546 LL2323f BS 547 548 LL2323r BS 549 549 LL2323f BS 550 551 LL2323r BS 552 553 LL2323f BS 554 560 LL2323r BS 561 570 LL2323f BS 571 599 LL2323r BS 600 600 LL2323f BS 601 601 LL2323r BS 602 612 LL2323f BS 613 624 LL2323r BS 625 625 LL2323f BS 626 628 LL2323r BS 629 630 LL2323f BS 631 645 LL2323r RD LL2323r 799 0 0 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxctttttnnnnnnnnnnnnnnnn ntntntnnnacggccgGAGAAGATtTCATTTCTTTAAATAGCAAAGAGAA AACacaacGACAAGAGTTCCAAAAGCAAACTAGATAAGCCAAGACTGCGT GCTGTTGTTGCTAAACAAAGCTCTTATAACAAACGGCAGACAATATCTGT GACTTTCTTCTTCCCATAAAACCCCATTAATTTTGTCATAATTAACCACT ACGTCCAGGTAATCGCCACGGGACTACCCACCACGTGTTTCCCATCTGAC CATTCAATGTTCCCGAAGCTAGATCCAGACGCCTTAGAAGAGTCTACGGT AAACGTCACCGTATACGATTTCTTCTCATTAACTTCTTTGAAATTCAAAA CCGCCGGTGCAACCGAAATCTTGACTGCTGTACTCTCCGAAATGACTTTA ACCTTGTAGGACCCAGCTCCTCCAACGCTCGTGACAGTACGCGTGTACTT ATACGCTCCGGATCCGTCAACGTTAACGGCGAAGGACGGATAGTTTAAAT CCGCGACTGAGTGCGTTTTGGAGGGATCGCAAGTGAAATTGCGTCTCGAA ACGCTTCTGATCTGCGACGACGTGTAGTTCAACGCGCAGAGGAAACCTAA GTAATCCACCGTcgc*agATCGTAAACGAGTCCTGGATTGGTGCGGACGC GTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 115 754 110 754 DS CHROMAT_FILE: LL2323r PHD_FILE: LL2323r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:25 2000 RD LL2323f 784 0 0 gaTTTCATTTCTTTAAATAGCAAAGAGAAAACAca*cGACAAGAGTTCCA AAAGCAAACTAGATAAgcgaagACTGCGTGCTGTTGTTGCTAAACAAAGC TCTTATAACAAACGGCAGACAATATCTGTGACTTTCTTCTTCCCATAAAA CCCCATTAATTTTGTCATAATTAACCACTACGTCCAGGTAATCGCCACGG GACTACCCACCACGTGTTTCCCATCTGACCATTCAATGTTCCCGAAGCTA GATCCAGACGCCTTAGAAGAGTCTACGGTAAACGTCACCGTATACGATTT CTTCTCATTAACTTCTTTGAAATTCAAAACCGCCGGTGCAACCGAAATCT TGACTGCTGTACTCTCCGAAATGACTTTAACCTTGTAGGACCCAGCTCCT CCAACGCTCGTGACAGTACGCGTGTACTTATACGCTCCGGATCCGTCAAC GTTAACGGCGAAGGACGGATAGTTTAAATCCGCGACTGAGTGCGTTTTGG AGGGATCGCAAGTGAAATTGCGTCTCGAAACGCTTCTGATCTGCGACGAC GTGTAGTTCAACGCGCAGAGGAAACCTAAGTAATCCACCGTCGttAGATC GTAAACGAGTCCTGGATTGGTGCGGACGCGTGGxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx QA 8 633 1 633 DS CHROMAT_FILE: LL2323f PHD_FILE: LL2323f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:49 2000 CO Contig36 742 2 113 U TTTTTTTttTTTTTTTTTTTTATCTATGAAATCGATaaacTATAACCAGA TACTTTGATCATCACTGTTTCATTGGATTTCCAATGAAGATCTCAACAGT GGAGTATACTTATTTAAGAAATAACAGTACAAGAAAAAAAGAGAAGGATA GTTTCATTTCCAAGCCGGAGATGCAAGAAACATGATGAGGATCAGTATCC CCAGTAGTCGTTACGGATTTCGTCATCAAACTTCTTTTTGAGTTCTGGGT GCTTCTCAAAGTACTCATCTGCAGTCATCGTGCTGAGCTTTTTGCTTATC TCCTGGACATCAACAATTTCTTTCTCCAGCCGTTCAGACTCCTTGAGCGA CTTCTGTTCTGCTTCTTTCAGCTCCACCAACAAAGCATCAAACTTAGGCT TGTATTCAGGGGTAACATTGTCCACGTACTTGGGAATCTCAACGCTGTCA TAAGCTTCCTTGTACAAGTCAACAATGCCGGATCCAATACCCTTTCTGTA GAAATCCCAATCAATAGGTTCAGGCTCCTGGCTGAACTTGGTCTGGAGCT GCGTGTTGACCTCGTCGAAAGCGCGACGGAGGTTAGAGAACTCTCTGCGA GCCTCATCGGTGACGAGGACCTTAGCCATCCCTTCCCAATCTATAGTCCT CGACGCTTTGAATGCCACATCCGCCACTTTCTTGCCTGCTCCGCTCATTT TTCTCGATTTCGCCCAAAACACACACTGATGCGGACGCGTGG BQ 32 32 32 32 32 22 23 17 15 23 27 32 26 26 22 22 22 22 22 20 20 21 27 24 24 25 29 25 25 25 25 27 27 27 27 24 19 17 19 16 23 21 32 34 34 34 37 42 47 54 54 62 58 62 55 59 63 62 62 75 66 66 62 66 61 67 60 66 62 65 74 64 55 55 53 53 59 64 61 69 65 77 66 69 66 69 71 62 66 77 77 79 72 78 66 55 58 70 59 66 61 66 64 67 71 71 69 71 69 69 69 69 64 64 64 60 60 60 85 67 64 64 55 50 50 61 61 54 52 52 66 88 88 88 88 90 90 90 90 82 82 90 90 90 90 76 66 62 69 69 69 69 67 75 75 75 69 67 66 66 72 69 69 69 80 74 74 75 76 88 69 77 77 75 81 81 88 88 80 77 80 85 88 90 90 90 88 80 86 86 80 73 71 76 71 76 73 69 72 77 75 88 69 69 69 66 69 69 68 68 69 66 72 75 82 82 89 80 78 71 71 71 71 75 84 90 90 87 78 90 90 90 88 88 90 90 90 90 90 90 90 82 82 81 90 90 81 86 78 76 76 79 90 90 90 90 90 90 82 90 82 78 78 90 90 90 90 90 84 80 77 77 78 77 89 90 86 81 85 71 71 71 86 86 87 89 90 90 90 90 90 90 87 87 81 81 88 88 90 87 90 90 90 90 90 90 90 90 90 83 87 82 83 86 90 90 90 90 90 90 90 90 90 90 90 81 80 79 76 76 77 83 83 83 90 90 88 83 87 87 87 89 90 88 87 83 81 79 79 86 81 90 87 83 87 88 88 90 87 90 90 90 90 90 90 90 90 90 90 85 85 80 83 88 90 90 90 90 90 90 90 86 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 86 90 90 90 90 90 90 87 89 85 90 87 80 80 80 90 90 90 90 90 90 90 90 90 90 90 90 86 83 81 83 83 83 85 86 89 90 90 90 86 88 81 81 83 90 90 85 90 87 87 87 90 89 90 90 90 84 88 90 90 90 86 87 85 85 86 85 86 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 89 90 89 90 90 90 90 90 90 90 88 90 90 90 90 90 86 86 86 83 83 89 90 86 86 86 84 80 80 90 82 82 82 88 90 90 90 90 90 88 90 90 90 90 90 87 82 78 78 78 78 78 78 78 78 83 90 90 90 90 90 83 88 85 85 83 83 83 85 88 83 75 75 71 71 71 78 80 82 82 81 81 83 83 83 85 85 80 85 85 83 83 90 90 85 72 71 71 71 71 71 85 83 83 78 86 78 83 85 85 85 73 70 73 72 77 77 72 76 82 88 76 72 79 67 64 64 65 73 76 67 67 70 74 85 85 90 85 85 83 69 79 74 71 71 71 71 71 79 80 80 74 71 67 67 70 65 64 68 69 77 74 63 64 64 66 72 72 77 83 68 63 62 56 53 53 63 83 90 90 90 90 65 69 60 68 67 75 74 76 63 58 58 64 68 63 72 72 75 75 78 84 71 62 59 59 66 66 64 60 54 56 56 54 58 62 75 72 72 64 69 72 72 83 83 78 79 80 79 74 74 63 61 56 56 76 73 47 44 44 32 33 34 AF LL2325r C 1 AF LL2325f U 35 BS 1 38 LL2325r BS 39 39 LL2325f BS 40 41 LL2325r BS 42 42 LL2325f BS 43 43 LL2325r BS 44 46 LL2325f BS 47 51 LL2325r BS 52 59 LL2325f BS 60 60 LL2325r BS 61 89 LL2325f BS 90 92 LL2325r BS 93 118 LL2325f BS 119 119 LL2325r BS 120 131 LL2325f BS 132 135 LL2325r BS 136 137 LL2325f BS 138 139 LL2325r BS 140 141 LL2325f BS 142 145 LL2325r BS 146 153 LL2325f BS 154 156 LL2325r BS 157 171 LL2325f BS 172 178 LL2325r BS 179 179 LL2325f BS 180 180 LL2325r BS 181 181 LL2325f BS 182 187 LL2325r BS 188 188 LL2325f BS 189 190 LL2325r BS 191 193 LL2325f BS 194 194 LL2325r BS 195 208 LL2325f BS 209 217 LL2325r BS 218 227 LL2325f BS 228 237 LL2325r BS 238 244 LL2325f BS 245 246 LL2325r BS 247 248 LL2325f BS 249 258 LL2325r BS 259 261 LL2325f BS 262 270 LL2325r BS 271 272 LL2325f BS 273 275 LL2325r BS 276 276 LL2325f BS 277 277 LL2325r BS 278 279 LL2325f BS 280 311 LL2325r BS 312 315 LL2325f BS 316 325 LL2325r BS 326 330 LL2325f BS 331 344 LL2325r BS 345 345 LL2325f BS 346 349 LL2325r BS 350 350 LL2325f BS 351 367 LL2325r BS 368 369 LL2325f BS 370 383 LL2325r BS 384 388 LL2325f BS 389 409 LL2325r BS 410 411 LL2325f BS 412 413 LL2325r BS 414 415 LL2325f BS 416 423 LL2325r BS 424 424 LL2325f BS 425 449 LL2325r BS 450 454 LL2325f BS 455 460 LL2325r BS 461 469 LL2325f BS 470 470 LL2325r BS 471 471 LL2325f BS 472 478 LL2325r BS 479 479 LL2325f BS 480 480 LL2325r BS 481 481 LL2325f BS 482 492 LL2325r BS 493 494 LL2325f BS 495 495 LL2325r BS 496 499 LL2325f BS 500 500 LL2325r BS 501 501 LL2325f BS 502 502 LL2325r BS 503 504 LL2325f BS 505 506 LL2325r BS 507 509 LL2325f BS 510 510 LL2325r BS 511 513 LL2325f BS 514 515 LL2325r BS 516 518 LL2325f BS 519 520 LL2325r BS 521 521 LL2325f BS 522 528 LL2325r BS 529 529 LL2325f BS 530 537 LL2325r BS 538 548 LL2325f BS 549 561 LL2325r BS 562 562 LL2325f BS 563 568 LL2325r BS 569 569 LL2325f BS 570 570 LL2325r BS 571 572 LL2325f BS 573 591 LL2325r BS 592 593 LL2325f BS 594 594 LL2325r BS 595 600 LL2325f BS 601 621 LL2325r BS 622 623 LL2325f BS 624 640 LL2325r BS 641 650 LL2325f BS 651 700 LL2325r BS 701 707 LL2325f BS 708 726 LL2325r BS 727 730 LL2325f BS 731 742 LL2325r RD LL2325r 786 0 0 TTTTTTTttTTTTTTTTTTTTATCTATGAAATCGATaaacTATAACCAGA TACTTTGATCATCACTGTTTCATTGGATTTCCAATGAAGATCTCAACAGT GGAGTATACTTATTTAAGAAATAACAGTACAAGAAAAAAAGAGAAGGATA GTTTCATTTCCAAGCCGGAGATGCAAGAAACATGATGAGGATCAGTATCC CCAGTAGTCGTTACGGATTTCGTCATCAAACTTCTTTTTGAGTTCTGGGT GCTTCTCAAAGTACTCATCTGCAGTCATCGTGCTGAGCTTTTTGCTTATC TCCTGGACATCAACAATTTCTTTCTCCAGCCGTTCAGACTCCTTGAGCGA CTTCTGTTCTGCTTCTTTCAGCTCCACCAACAAAGCATCAAACTTAGGCT TGTATTCAGGGGTAACATTGTCCACGTACTTGGGAATCTCAACGCTGTCA TAAGCTTCCTTGTACAAGTCAACAATGCCGGATCCAATACCCTTTCTGTA GAAATCCCAATCAATAGGTTCAGGCTCCTGGCTGAACTTGGTCTGGAGCT GCGTGTTGACCTCGTCGAAAGCGCGACGGAGGTTAGAGAACTCTCTGCGA GCCTCATCGGTGACGAGGACCTTAGCCATCCCTTCCCAATCTATAGTCCT CGACGCTTTGAATGCCACATCCGCCACTTTCTTGCCTGCTCCGCTCATTT TTCTCGATTTCGCCCAAAACACACACTGATGCGGACGCGTGGxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 1 742 1 742 DS CHROMAT_FILE: LL2325r PHD_FILE: LL2325r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:35 2000 RD LL2325f 766 0 0 agtaacTATAACCAGATACTTTGATCATCACTGTTTCATTGGATTTCCAA TGAAGATCTCAACAGTGGAGTATACTTATTTAAGAAATAACAGTACAAGA AAAAAAGAGAAGGATAGTTTCATTTCCAAGCCGGAGATGCAAGAAACATG ATGAGGATCAGTATCCCCAGTAGTCGTTACGGATTTCGTCATCAAACTTC TTTTTGAGTTCTGGGTGCTTCTCAAAGTACTCATCTGCAGTCATCGTGCT GAGCTTTTTGCTTATCTCCTGGACATCAACAATTTCTTTCTCCAGCCGTT CAGACTCCTTGAGCGACTTCTGTTCTGCTTCTTTCAGCTCCACCAACAAA GCATCAAACTTAGGCTTGTATTCAGGGGTAACATTGTCCACGTACTTGGG AATCTCAACGCTGTCATAAGCTTCCTTGTACAAGTCAACAATGCCGGATC CAATACCCTTTCTGTAGAAATCCCAATCAATAGGTTCAGGCTCCTGGCTG AACTTGGTCTGGAGCTGCGTGTTGACCTCGTCGAAAGCGCGACGGAGGTT AGAGAACTCTCTGCGAGCCTCATCGGTGACGAGGACCTTAGCCATCCCTT CCCAATCTATAGTCCTCGACGCTTTGAATGCCACATCCGCCACTTTCTTG CCTGCTCCGCTCATTTTTCTCGATTTCGCCCAAAACACACACTGATGCGG ACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxx QA 9 708 4 708 DS CHROMAT_FILE: LL2325f PHD_FILE: LL2325f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:58 2000 CO Contig37 1093 2 34 U tcgaaattctttacaaaataacaaaaacctaaaccttaaaaccaacaatg gacattctctcgagctcgctTCACCTACATTACAAAAACATAAATATTTA CATAAAAAAATAAATCAAACCGGTTTACGCTAAACCTAAAAGAAACAAAA AAAAACCTAATCCTCCGCCTCCACCTCCTCTGTTCGCTTCCTCCGTCAAC CCCTCTGCCTCCTCCTCCATCAAGATGTAGCAATGTCGTCATGGCTACCA AGTCTATCAATTGTTCCATCTTTTAAATCGTACCACGGATTCTAACACCT CGTTTCTTGCTTGGTAGAATGCTCTCATCCTCACGATCCTCAAACGTCCT CCGTAGTTAATCTCCAACAATGAGTCAATAATAGCTTCTTGCTCCATTCT TTCTTGACAAGAGAAGACCAAATCTCGAACCATTGCTAGACCTTTGCTAG AATCCACCAATGAGAAGCTTCAACACTCAAAAAATGGTTCAGGAAAAACA CAAACTGATGAATCAAGTTCTCTTTTCACTGCTGAAGACCAGCTCGAGCA GAAGATCTGAATAGAGCTTAGAAGCAATGGACGCCATATGGGAGAGAAGC TCGGGATCTGAGCCGTCGTCCTCTTCCTCTGTCTTCTTCGTTTCCGCTAA TGGGATAGAAGCTCAGGATTCGCTGCCACCGCCTTTGACGCTGCTCCGCC TTGCCACCGGAGAAGAGAGAGAGAACGAGGTGTGACGGAGATGAGGAGAT GCGACGAACCCCTAACAAGAAGTCTCGAATGGAAACAAGTAAATctGAAC CCTATAATCAGTAGAAGAAGAAACAAATTCTCAGAAATCGACCTGTGGAT CAAGGCCTGTTCGAGACCCGTAACGATGACGACGAACTCAGACCGGTGGT GATGGAGACGAACTCATACCGGTGACGATGGCACAAACTCAGACCGGTCT CTTCTTCCTCATGCAATCTCTCATCTTTCTCTGTCTCATCTCTCTTCTCT CTGTCGTGTTTGAGAAGAGAAATGGAGAGGACAAAACACAATCTCCCTCC TTcccctaatgcaggaatcaagattaactttccggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 20 29 32 34 34 39 40 40 46 46 46 46 46 46 51 46 46 46 46 46 46 51 46 46 46 46 40 40 40 39 39 39 40 40 40 46 51 51 40 40 40 39 39 39 40 40 40 40 39 40 37 37 37 35 35 29 29 31 31 31 31 34 35 39 40 40 40 40 40 40 51 56 40 40 46 40 40 40 51 51 56 56 46 40 40 40 40 40 45 45 45 45 45 45 35 35 35 35 35 35 35 35 35 35 35 35 39 39 40 45 45 40 37 35 34 34 34 34 34 38 40 40 45 45 40 40 40 45 45 45 45 45 51 51 45 45 51 51 51 45 45 45 45 45 45 51 51 51 51 51 51 56 56 56 56 56 50 42 42 42 45 45 45 46 41 41 37 37 43 43 45 45 43 36 36 36 36 36 43 56 56 56 56 56 56 56 56 56 56 56 56 56 42 42 42 43 46 45 51 51 51 51 51 56 56 56 45 45 45 36 36 36 43 43 43 43 43 43 43 43 45 45 45 45 45 45 45 45 45 45 51 51 45 45 45 45 43 43 43 43 43 43 43 43 43 42 43 43 43 43 43 43 43 43 43 45 45 45 45 45 43 43 43 43 43 43 45 51 51 51 51 43 43 43 43 36 36 38 46 46 51 51 56 56 79 60 52 52 56 56 60 76 70 58 60 59 59 59 75 79 83 72 71 65 66 68 74 75 64 60 60 62 63 70 67 72 72 72 74 70 72 69 68 67 80 85 85 85 85 81 81 81 88 76 71 75 75 75 73 71 85 85 85 81 72 72 72 68 72 69 81 81 83 72 72 72 67 58 59 67 73 90 85 80 66 65 62 70 65 69 63 63 68 61 61 64 64 69 77 77 83 85 85 77 75 72 73 77 80 88 88 88 90 90 85 74 74 70 65 61 66 72 72 67 72 66 70 70 82 90 90 90 90 86 77 71 60 51 51 66 72 65 67 81 87 82 77 79 79 81 79 90 82 88 83 86 82 82 90 90 90 83 90 90 86 86 76 85 71 79 90 90 90 90 85 90 77 88 87 90 90 83 90 90 86 88 88 88 90 90 84 80 80 80 80 72 74 80 85 88 90 90 90 90 90 90 90 90 90 90 88 90 88 90 85 90 80 74 80 71 72 68 77 80 90 90 90 90 90 72 67 67 67 67 78 86 90 90 90 90 90 90 90 90 90 90 87 90 90 90 90 90 90 90 90 90 90 90 83 75 75 75 81 71 67 68 66 69 66 66 78 90 88 88 88 82 81 85 77 88 68 66 69 88 90 90 79 78 78 63 60 57 64 62 71 63 65 67 80 80 90 90 90 90 85 85 85 74 74 73 77 75 75 83 80 81 80 85 74 77 74 74 67 64 60 57 57 56 60 58 56 56 60 65 82 83 72 72 67 72 72 72 68 74 68 60 56 59 60 65 84 80 77 73 71 76 81 79 71 82 81 81 78 73 78 78 81 81 88 88 90 77 71 59 60 65 73 69 67 67 82 85 75 68 68 63 61 62 71 82 82 88 88 81 73 73 78 82 83 72 72 72 65 63 67 61 53 56 63 79 64 63 64 60 56 41 45 46 56 45 45 45 45 45 45 51 51 51 51 45 45 46 46 42 42 41 43 43 43 43 43 51 46 46 15 15 24 33 33 50 46 46 45 45 45 42 42 38 36 43 43 43 43 43 56 56 51 45 45 45 45 41 42 50 56 56 51 51 51 51 51 45 45 45 45 45 43 45 43 41 41 41 43 43 43 43 43 43 56 56 51 51 51 51 51 51 43 43 43 43 43 43 43 43 40 40 40 43 43 43 43 38 38 41 45 45 45 51 51 45 43 43 43 43 43 43 51 45 45 45 41 50 50 41 41 41 41 41 41 41 45 46 51 45 45 45 43 43 43 43 43 43 51 51 51 51 43 40 40 39 39 39 39 38 38 39 39 39 35 38 38 38 38 38 45 51 56 56 51 45 45 40 37 37 35 35 35 35 35 35 51 51 51 51 45 51 45 45 45 45 35 35 35 35 35 35 56 56 40 40 40 40 40 40 51 51 51 51 45 45 40 40 40 35 35 35 35 35 39 56 51 51 51 51 51 51 51 51 56 56 51 40 35 35 35 35 35 39 51 51 51 51 51 46 51 51 51 51 51 51 39 39 39 39 40 40 51 51 46 40 40 40 40 40 51 51 51 51 51 51 46 46 40 40 48 48 40 40 40 40 24 24 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2326f U -19 AF LL2326r C 350 BS 1 354 LL2326f BS 355 368 LL2326r BS 369 520 LL2326f BS 521 524 LL2326r BS 525 527 LL2326f BS 528 536 LL2326r BS 537 540 LL2326f BS 541 545 LL2326r BS 546 551 LL2326f BS 552 555 LL2326r BS 556 560 LL2326f BS 561 561 LL2326r BS 562 562 LL2326f BS 563 563 LL2326r BS 564 564 LL2326f BS 565 565 LL2326r BS 566 566 LL2326f BS 567 571 LL2326r BS 572 584 LL2326f BS 585 586 LL2326r BS 587 594 LL2326f BS 595 595 LL2326r BS 596 596 LL2326f BS 597 603 LL2326r BS 604 605 LL2326f BS 606 607 LL2326r BS 608 613 LL2326f BS 614 629 LL2326r BS 630 630 LL2326f BS 631 633 LL2326r BS 634 634 LL2326f BS 635 636 LL2326r BS 637 639 LL2326f BS 640 1093 LL2326r RD LL2326f 790 0 0 nctttaggggtacgatatantcgaaattctttacaaaataacaaaaacct aaaccttaaaaccaacaatggacattctctcgagctcgctTCACCTACAT TACAAAAACATAAATATTTACATAAAAAAATAAATCAAACCGGTTTACGC TAAACCTAAAAGAAACAAAAAAAAACCTAATCCTCCGCCTCCACCTCCTC TGTTCGCTTCCTCCGTCAACCCCTCTGCCTCCTCCTCCATCAAGATGTAG CAATGTCGTCATGGCTACCAAGTCTATCAATTGTTCCATCTTTTAAATCG TACCACGGATTCTAACACCTCGTTTCTTGCTTGGTAGAATGCTCTCATCC TCACGATCCTCAAACGTCCTCCGTAGTTAATCTCCAACAATGAGTCAATA ATAGCTTCTTGCTCCATTCTTTCTTGACAAGAGAAGACCAAATCTCGAAC CATTGCTAGACCTTTGCTAGAATCCACCAATGAGAAGCTTCAACACTCAA AAAATGGTTCAGGAAAAACACAAACTGATGAATCAAGTTCTCTTTTCACT GCTGAAGACCAGCTCGAGCAGAAGATCTGAATAGAGCTTAGAAGCAATGG ACGCCATATGGGAGAGAAGCTCGGGATCTGAGCCGTCGTCCTCTTCCTCT GTCTTCTTCGTTTCCGCTAATGGGATAGAAGCTCAGGATTCGCTGCCACC GCCTTTGACGCTGCTCCGCCTTGCCACCGGAGAAGAGAGAGAGAACGAGG TGTGACGGAGATGAGGAGATGCGACGAACCCCTAACAAga QA 81 785 21 790 DS CHROMAT_FILE: LL2326f PHD_FILE: LL2326f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:03 2000 RD LL2326r 789 0 0 tcCGTAGTTAATCTCCAACAATGAGTCAATAATAGCTTCTTGCTCCATTC TTTCTTGACAAGAGAAGACCAAATCTCGAACCATTGCTAGACCTTTGCTA GAATCCACCAATGAGAAGCTTCAACACTCAAAAAATGGTTCAGGAAAAAC ACAAACTGATGAATCAAGTTCTCTTTTCACTGCTGAAGACCAGCTCGAGC AGAAGATCTGAATAGAGCTTAGAAGCAATGGACGCCATATGGGAGAGAAG CTCGGGATCTGAGCCGTCGTCCTCTTCCTCTGTCTTCTTCGTTTCCGCTA ATGGGATAGAAGCTCAGGATTCGCTGCCACCGCCTTTGACGCTGCTCCGC CTTGCCACCGGAGAAGAGAGAGAGAACGAGGTGTGACGGAGATGAGGAGA TGCGACGAACCCCTAACAAGAAGTCTCGAATGGAAACAAGTAAATctGAA CCCTATAATCAGTAGAAGAAGAAACAAATTCTCAGAAATCGACCTGTGGA TCAAGGCCTGTTCGAGACCCGTAACGATGACGACGAACTCAGACCGGTGG TGATGGAGACGAACTCATACCGGTGACGATGGCACAAACTCAGACCGGTC TCTTCTTCCTCATGCAATCTCTCATCTTTCTCTGTCTCATCTCTCTTCTC TCTGTCGTGTTTGAGAAGAGAAATGGAGAGGACAAAACACAATCTCCCTC CTTcccctaatgcaggaatcaagattaactttccggacgcgtggxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaagt QA 20 709 1 744 DS CHROMAT_FILE: LL2326r PHD_FILE: LL2326r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:40 2000 CO Contig38 894 2 66 U cttttgacaaaccaaacataactctacaattatatgtactaaactgagta caacattagaaatgaatgtctcatgctataGATtTGGAATCAAGTTTGTT TATGGTTTATAAAAGAGAAAATAGCCTTTAAAGTGGCGAAACACAAACAA ACAATGTTTCGAGAAATGAGACGATTAATTTATGAAAAGTGGACAGCTGC AGAGTCTCTGCGACACAACCTATGTAGTGAGTTCCATGATGGCGGAAACA ATATCTCCATCATTGGCTTTGAGAGCCTTAGTGGCTTTGGCCTTCGAAAC ACCAGCCTGAGTCATCACGAGCTCAACATCCTTGGCTTCAACACCAGTCT CATCCACGTCATCATCTTCGTACTCCTCTTGTGCAACCGCGGCTGCTTCA GAAGCATCGGAGTTAGGGATCATAGAGGCAACGTCCGGCATCTTGAACCT CTGAGCAGCTTGAGCTTGTAGCTGAGAGCTCATATCATCAATCTTGGCCT CACCGAATATGACATAGGTCTCAGAGTTGGGACTCTTGAAGACATCCGGC TTCGAGATGACAAACAAAACATTCTTTGATCTCTTGATAGTCACTCTGCT AACATCAGTGACAGGTTTCATTCCAAGTTTCAGCATAGCTTTGCGGCTTT TCTTTTCGCTTCTGCTTTGTTTAGAGCTCTCATTGTCACCAGCTCCATCG ACGTTGTCATCGTCATCGTCATCGACGTCTTCGTCTCCGTCTTTCACATC CTCAACGACAACATCATCTTCCTTCTCGAGCTTCATTTGCTCTTTGATGG CTTCCTCAATCTTAACTTCTTCAGCAACAGGGCCTGGCATCTTCTTCTTC Aaccccctatcttcttccaacaaacccttctagcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 33 24 24 16 22 26 39 40 40 37 37 40 37 37 56 40 37 37 35 35 35 35 35 35 40 35 35 35 35 35 35 46 46 51 56 40 40 40 40 37 32 35 35 35 35 35 35 35 40 35 32 32 44 32 30 35 35 35 35 35 37 40 40 40 39 42 42 42 42 42 42 37 50 50 50 52 56 55 52 56 58 64 60 71 63 64 66 65 62 53 53 53 59 53 49 57 57 57 57 65 61 64 64 63 56 59 63 61 62 60 50 53 52 56 58 53 48 44 46 50 60 59 69 73 61 59 52 52 52 56 56 51 51 51 53 55 51 50 50 50 62 69 71 78 73 65 71 71 71 71 71 66 68 69 66 90 73 71 62 57 56 56 56 56 56 53 57 57 57 61 62 85 74 72 66 65 59 69 70 78 78 71 60 60 57 57 63 65 64 60 67 69 74 59 58 56 56 57 62 62 79 71 64 61 61 64 58 58 59 54 50 48 50 50 56 62 54 51 56 67 67 60 69 64 64 53 54 66 71 73 76 78 76 84 82 78 73 71 73 73 75 75 72 82 78 81 81 81 64 59 59 65 67 66 70 61 68 76 70 72 70 70 72 68 69 70 88 72 75 82 86 88 90 90 78 79 70 70 67 70 88 90 87 90 90 90 90 82 82 81 72 72 82 80 78 90 86 87 82 82 76 79 72 73 72 69 69 69 62 59 65 56 66 69 70 82 76 90 90 90 89 74 85 72 81 79 88 90 90 82 82 86 81 82 83 88 87 90 90 90 90 90 90 90 88 78 78 78 78 75 75 81 86 86 86 86 81 81 81 87 90 90 90 87 87 82 83 81 88 84 90 87 89 79 81 81 80 80 86 86 90 90 90 90 90 90 81 75 85 70 74 74 85 90 90 90 90 90 86 86 86 86 90 90 90 90 90 90 90 90 88 90 90 90 85 80 78 81 85 85 82 82 82 90 90 90 90 90 90 90 90 90 90 90 90 85 85 85 86 84 83 90 89 85 85 80 80 80 90 90 90 88 88 88 82 85 87 87 87 80 90 79 84 90 81 88 90 86 86 83 86 86 88 90 90 90 83 88 90 90 90 90 90 86 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 85 85 85 80 87 90 90 77 76 74 74 74 70 72 78 90 82 82 87 85 85 71 80 83 85 85 90 85 90 90 90 90 90 90 90 90 85 85 83 83 83 83 83 76 76 76 83 90 83 83 72 68 58 73 63 61 61 64 71 82 90 90 90 90 87 85 68 65 70 70 68 65 65 75 80 77 81 75 63 68 67 59 59 67 67 69 68 75 75 74 80 74 77 77 72 87 65 71 71 76 72 74 70 64 60 56 65 65 58 62 64 67 65 62 60 58 58 66 66 72 72 72 72 73 68 64 64 72 75 63 59 60 68 80 74 63 67 67 63 60 63 60 57 60 63 59 52 50 50 50 54 56 60 59 78 71 77 79 76 76 72 66 66 69 69 54 54 58 62 54 54 60 66 60 64 65 65 71 78 81 85 77 72 72 76 70 66 63 60 57 59 61 54 50 55 66 66 70 55 55 40 46 46 46 51 51 51 40 40 40 40 40 40 40 40 40 40 40 40 40 40 45 34 34 32 32 35 35 42 46 46 46 39 33 33 33 33 33 34 51 51 40 40 40 40 40 39 39 39 39 39 40 51 56 56 56 44 46 33 33 30 24 24 14 19 16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2327f U 1 AF LL2327r C 144 BS 1 194 LL2327f BS 195 197 LL2327r BS 198 235 LL2327f BS 236 236 LL2327r BS 237 257 LL2327f BS 258 258 LL2327r BS 259 282 LL2327f BS 283 283 LL2327r BS 284 315 LL2327f BS 316 316 LL2327r BS 317 323 LL2327f BS 324 325 LL2327r BS 326 337 LL2327f BS 338 338 LL2327r BS 339 350 LL2327f BS 351 355 LL2327r BS 356 356 LL2327f BS 357 364 LL2327r BS 365 365 LL2327f BS 366 366 LL2327r BS 367 370 LL2327f BS 371 376 LL2327r BS 377 377 LL2327f BS 378 378 LL2327r BS 379 390 LL2327f BS 391 402 LL2327r BS 403 406 LL2327f BS 407 409 LL2327r BS 410 411 LL2327f BS 412 412 LL2327r BS 413 415 LL2327f BS 416 425 LL2327r BS 426 432 LL2327f BS 433 436 LL2327r BS 437 439 LL2327f BS 440 445 LL2327r BS 446 448 LL2327f BS 449 450 LL2327r BS 451 451 LL2327f BS 452 452 LL2327r BS 453 458 LL2327f BS 459 466 LL2327r BS 467 469 LL2327f BS 470 507 LL2327r BS 508 509 LL2327f BS 510 519 LL2327r BS 520 521 LL2327f BS 522 562 LL2327r BS 563 565 LL2327f BS 566 566 LL2327r BS 567 570 LL2327f BS 571 577 LL2327r BS 578 582 LL2327f BS 583 604 LL2327r BS 605 607 LL2327f BS 608 610 LL2327r BS 611 612 LL2327f BS 613 614 LL2327r BS 615 615 LL2327f BS 616 635 LL2327r BS 636 636 LL2327f BS 637 641 LL2327r BS 642 642 LL2327f BS 643 683 LL2327r BS 684 684 LL2327f BS 685 894 LL2327r RD LL2327f 792 0 0 cttttgacaaaccaaacataactctacaattatatgtactaaactgagta caacattagaaatgaatgtctcatgctataGATtTGGAATCAAGTTTGTT TATGGTTTATAAAAGAGAAAATAGCCTTTAAAGTGGCGAAACACAAACAA ACAATGTTTCGAGAAATGAGACGATTAATTTATGAAAAGTGGACAGCTGC AGAGTCTCTGCGACACAACCTATGTAGTGAGTTCCATGATGGCGGAAACA ATATCTCCATCATTGGCTTTGAGAGCCTTAGTGGCTTTGGCCTTCGAAAC ACCAGCCTGAGTCATCACGAGCTCAACATCCTTGGCTTCAACACCAGTCT CATCCACGTCATCATCTTCGTACTCCTCTTGTGCAACCGCGGCTGCTTCA GAAGCATCGGAGTTAGGGATCATAGAGGCAACGTCCGGCATCTTGAACCT CTGAGCAGCTTGAGCTTGTAGCTGAGAGCTCATATCATCAATCTTGGCCT CACCGAATATGACATAGGTCTCAGAGTTGGGACTCTTGAAGACATCCGGC TTCGAGATGACAAACAAAACATTCTTTGATCTCTTGATAGTCACTCTGCT AACATCAGTGACAGGTTTCATTCCAAGTTTCAGCATAGCTTTGCGGCTTT TCTTTTCGCTTCTGCTTTGTTTAGAGCTCTCATTGTCACCAGCTCCATCG ACGTTGTCATCGTCATCGTCATCGACGTCTTCGTCTCCGTCTTTCACATC CTCAACGACAACATCATCTTCCTTCTCGAGCTTCATTTGctt QA 81 788 1 791 DS CHROMAT_FILE: LL2327f PHD_FILE: LL2327f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:08 2000 RD LL2327r 795 0 0 caAACAAACAATGTTTCGAGAAATGAGACGATTAATTTATGAAAAGTGGA CAGCTGCAGAGTCTCTGCGACACAACCTATGTAGTGAGTTCCATGATGGC GGAAACAATATCTCCATCATTGGCTTTGAGAGCCTTAGTGGCTTTGGCCT TCGAAACACCAGCCTGAGTCATCACGAGCTCAACATCCTTGGCTTCAACA CCAGTCTCATCCACGTCATCATCTTCGTACTCCTCTTGTGCAACCGCGGC TGCTTCAGAAGCATCGGAGTTAGGGATCATAGAGGCAACGTCCGGCATCT TGAACCTCTGAGCAGCTTGAGCTTGTAGCTGAGAGCTCATATCATCAATC TTGGCCTCACCGAATATGACATAGGTCTCAGAGTTGGGACTCTTGAAGAC ATCCGGCTTCGAGATGACAAACAAAACATTCTTTGATCTCTTGATAGTCA CTCTGCTAACATCAGTGACAGGTTTCATTCCAAGTTTCAGCATAGCTTTG CGGCTTTTCTTTTCGCTTCTGCTTTGTTTAGAGCTCTCATTGTCACCAGC TCCATCGACGTTGTCATCGTCATCGTCATCGACGTCTTCGTCTCCGTCTT TCACATCCTCAACGACAACATCATCTTCCTTCTCGAGCTTCATTTGCTCT TTGATGGCTTCCTCAATCTTAACTTCTTCAGCAACAGGGCCTGGCATCTT CTTCTTCAaccccctatcttcttccaacaaacccttctagcggacgcgtg gxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaag QA 10 715 1 751 DS CHROMAT_FILE: LL2327r PHD_FILE: LL2327r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:44 2000 CO Contig39 566 2 76 U tTTTTTTTTtttaacCataacAGTTTCCTTGATTACATTAACTTCATGAG GTTTAGTATTACACAC*ACTGGTCTGAAACAAAGCTATCTCAGCCTTATT TATTTCATATAATTTAGCAGAAACATTTAGATTATTACCCTTTAAACAAG TGAAATGTTCAAGAAGAAGATTCTTTAAACTCTTGTAACTGCTTGAAGTT CATCCATGGCTTCACCCAAACTTTGGCTTCGAAGTTCTTAGCCTGGCCAC CTTCGTTTGCTTCAAGAGTTAAGTAGTACATGGTTCCAGCAACCACCTGT TCTCTTGCCTTTACAATCTTCTTGAACTCAAGAACCTTGTTCTCTCGGTT GTTATGTTCTTGAATAGCGAATCGAGCGAGACTCTCGATCTCTCCACTGT TTTGGTTTCCTCGGAGATCATGAACGCCTCCTAAAATCATCGTCTTCTCC GTGGTTTTCTCTATTGATCTGCAACTTCTTGGTTGAATTGTTCGACAGAG AAGGAGGGTTACGATCAAGAACGTAACAGATACGAAAGCTTTTGATTCCA TTTTCCGGACGCGTGG BQ 0 27 27 48 40 40 40 40 25 17 8 9 9 11 11 22 17 19 10 10 10 24 24 29 25 25 25 22 22 22 25 27 22 22 25 24 27 27 24 21 28 43 46 57 70 87 79 79 75 80 66 66 64 65 65 67 64 58 49 49 51 48 42 40 40 40 40 42 47 48 72 66 68 70 78 69 65 71 67 69 66 74 72 72 72 80 80 80 86 86 65 60 57 59 54 56 60 60 71 72 79 85 79 90 90 90 77 77 80 77 75 74 68 68 66 68 68 80 82 88 82 88 79 88 86 82 84 79 85 80 80 80 80 86 80 84 80 80 90 72 72 75 81 81 86 82 86 85 85 85 85 90 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 86 90 86 80 90 90 90 90 90 90 90 77 77 76 90 90 89 89 89 84 90 90 90 90 90 90 90 90 90 90 90 88 80 80 80 80 86 90 90 90 90 90 90 90 80 80 66 61 64 74 70 88 90 90 90 90 88 90 88 88 83 90 86 86 88 88 88 90 90 90 90 90 90 90 90 90 88 90 90 90 90 86 86 90 90 90 90 90 90 90 90 90 86 86 86 86 86 86 86 86 86 86 88 90 90 90 90 90 90 90 90 90 90 90 90 87 90 90 90 90 90 86 86 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 90 88 88 88 88 88 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 81 75 75 75 75 75 85 90 85 82 82 82 82 82 78 78 80 86 80 78 82 83 83 83 88 90 90 90 90 90 90 90 83 83 83 90 90 90 90 90 90 83 83 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 82 78 78 78 78 78 83 88 88 90 90 90 90 90 90 90 86 86 86 90 90 90 90 90 90 90 85 84 80 85 85 90 90 90 90 90 90 90 90 90 89 84 90 82 82 90 81 81 75 75 68 69 83 87 79 73 88 88 72 69 69 69 72 72 80 90 90 79 75 69 75 75 75 85 90 90 90 90 90 90 90 90 90 90 90 86 86 75 72 71 74 74 74 72 68 67 67 64 67 67 67 67 61 71 70 77 77 80 77 74 77 77 77 87 86 86 86 86 80 84 84 72 72 67 71 61 61 46 40 39 39 39 39 39 51 66 66 61 47 47 47 32 33 46 AF LL2329r C -169 AF LL2329f U 29 BS 1 37 LL2329r BS 38 41 LL2329f BS 42 43 LL2329r BS 44 44 LL2329f BS 45 48 LL2329r BS 49 63 LL2329f BS 64 69 LL2329r BS 70 96 LL2329f BS 97 99 LL2329r BS 100 100 LL2329f BS 101 107 LL2329r BS 108 119 LL2329f BS 120 122 LL2329r BS 123 123 LL2329f BS 124 139 LL2329r BS 140 146 LL2329f BS 147 152 LL2329r BS 153 153 LL2329f BS 154 155 LL2329r BS 156 156 LL2329f BS 157 183 LL2329r BS 184 189 LL2329f BS 190 195 LL2329r BS 196 197 LL2329f BS 198 203 LL2329r BS 204 206 LL2329f BS 207 232 LL2329r BS 233 233 LL2329f BS 234 235 LL2329r BS 236 236 LL2329f BS 237 237 LL2329r BS 238 248 LL2329f BS 249 251 LL2329r BS 252 258 LL2329f BS 259 263 LL2329r BS 264 278 LL2329f BS 279 280 LL2329r BS 281 292 LL2329f BS 293 296 LL2329r BS 297 302 LL2329f BS 303 303 LL2329r BS 304 304 LL2329f BS 305 316 LL2329r BS 317 324 LL2329f BS 325 330 LL2329r BS 331 331 LL2329f BS 332 336 LL2329r BS 337 350 LL2329f BS 351 356 LL2329r BS 357 370 LL2329f BS 371 388 LL2329r BS 389 389 LL2329f BS 390 396 LL2329r BS 397 405 LL2329f BS 406 407 LL2329r BS 408 408 LL2329f BS 409 409 LL2329r BS 410 422 LL2329f BS 423 429 LL2329r BS 430 472 LL2329f BS 473 478 LL2329r BS 479 480 LL2329f BS 481 486 LL2329r BS 487 487 LL2329f BS 488 489 LL2329r BS 490 491 LL2329f BS 492 492 LL2329r BS 493 495 LL2329f BS 496 496 LL2329r BS 497 507 LL2329f BS 508 524 LL2329r BS 525 532 LL2329f BS 533 540 LL2329r BS 541 543 LL2329f BS 544 564 LL2329r BS 565 566 LL2329f RD LL2329r 780 0 0 ttggtancgccagggttttxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxcttttttttttttttt ttttnttttttttnnttnnntTTTTTTTTtttaacCataacAGTTTCCTT GATTACAttaaCTTCATGAGGTTTAGTATTACACAC*ACTGGTCTGAAAC AAAGCTATCTCAGCCTTATTTATTTCATATAATTTAGCAGAAACATTTAG ATTATTACCCTTTAAACAAGTGAAATGTTCAAGAAGAAGATTCTTTAAAC TCTTGTAACTGCTTGAAGTTCATCCATGGCTTCACCCAAACTTTGGCTTC GAAGTTCTTAGCCTGGCCACCTTCGTTTGCTTCAAGAGTTAAGTAGTACA TGGTTCCAGCAACCACCTGTTCTCTTGCCTTTACAATCTTCTTGAACTCA AGAACCTTGTTCTCTCGGTTGTTATGTTCTTGAATAGCGAATCGAGCGAG ACTCTCGATCTCTCCACTGTTTTGGTTTCCTCGGAGATCATGAACGCCTC CTAAAATCATCGTCTTCTCCGTGGTTTTCTCTATTGATCTGCAACTTCTT GGTTGAATTGTTCGACAGAGAAGGAGGGTTACGATCAAGAACGTAACAGA TACGAAAGCTTTTGATTCCATTTTCCGGACGCGTGGxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 172 736 171 736 DS CHROMAT_FILE: LL2329r PHD_FILE: LL2329r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:17 2000 RD LL2329f 759 0 2 ttgat*aCATTAACTTCATGAGGTTTAGTATTACAcaccaCTGGTCTGAA ACAAAGCTATCTCAGCCTTATTTATTTCATATAATTTAGCAGAAACATTT AGATTATTACCCTTTAAACAAGTGAAATGTTCAAGAAGAAGATTCTTTAA ACTCTTGTAACTGCTTGAAGTTCATCCATGGCTTCACCCAAACTTTGGCT TCGAAGTTCTTAGCCTGGCCACCTTCGTTTGCTTCAAGAGTTAAGTAGTA CATGGTTCCAGCAACCACCTGTTCTCTTGCCTTTACAATCTTCTTGAACT CAAGAACCTTGTTCTCTCGGTTGTTATGTTCTTGAATAGCGAATCGAGCG AGACTCTCGATCTCTCCACTGTTTTGGTTTCCTCGGAGATCATGAACGCC TCCTAAAATCATCGTCTTCTCCGTGGTTTTCTCTATTGATCTGCAACTTC TTGGTTGAATTGTTCGACAGAGAAGGAGGGTTACGATCAAGAACGTAACA GATACGAAAGCTTTTGATTCCATttctcgGACGCGTGGxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xgcctggngtgcctaatgagtgagctaactcacattaattgcgttgcgct cactgcccg QA 14 538 1 538 DS CHROMAT_FILE: LL2329f PHD_FILE: LL2329f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:39 2000 RT{ LL2329f chimera phrap 702 759 000919:094547 } RT{ LL2329f matchElsewhereHighQual phrap 709 759 000919:094547 } CO Contig40 338 2 1 U CCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGAAA AAGAGGTTCGATTTCTGATGAATCATACAAAGATACGACAAAGGAAATTG CACAAATTTACATAGGAAAGACCACAAACAAACTTCTCTTTCTCTGGCTC TCACTCACATAAGCTTCACTTTCCGGGGACGAAGTTGGTGGCGAAGGCCC ATGCGTTGTTGTTGACTGGATCAGCCAAATGGTCGGCGAGATTCTCCAAC GGTCCCTTTCCAGTGACAATAGCCTGGACAAAGAATCCAAACATAGagcc caTAGCCAACCTTCCGTTCTTGATCTCCGGACGCGTGG BQ 20 27 27 50 50 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 43 43 58 58 58 77 79 79 79 90 86 90 77 77 82 82 82 85 85 85 86 86 86 86 90 86 80 80 80 74 72 72 72 78 78 80 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 90 90 90 90 90 90 90 90 86 85 85 90 90 90 60 66 66 66 51 51 51 51 56 56 56 71 71 71 71 71 71 90 90 90 90 90 90 90 90 90 90 85 85 85 85 90 90 90 90 90 90 90 90 85 85 83 83 80 80 90 85 90 90 90 90 90 90 85 85 85 85 80 79 75 71 71 69 68 68 74 87 79 90 90 72 70 67 67 67 67 71 75 82 82 85 82 77 70 70 70 70 70 80 90 83 86 86 85 85 75 86 86 86 86 77 82 76 74 74 74 74 74 77 90 90 90 90 75 75 69 69 69 69 69 69 74 74 84 90 90 88 90 88 75 75 74 74 74 74 80 86 90 90 90 90 82 82 81 83 77 77 79 77 75 73 73 70 70 73 74 79 76 76 77 88 88 88 90 90 90 90 88 88 88 90 90 88 88 55 49 47 40 19 11 9 9 14 16 31 44 71 79 72 77 84 86 89 89 90 84 84 80 80 80 80 80 85 90 85 79 79 79 79 79 83 76 76 76 49 49 49 35 35 35 AF LL2331r C -410 AF LL2331f U 42 BS 1 338 LL2331r RD LL2331r 793 0 2 ccctaaagggagcccccgatttagagcttgacggggaaagccggcgaacg tggcgagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctg gcaagtgtagcggtcacgctgcgcgtaaccaccacacccgccgcgcttaa tgcgccgctacagggcgcgtcccattcgccattcaggctgcgcaactgtt gggaagggcgatcggtgcgggcctcttcgctattacgccagctggcgaaa gggggatgtgctgcaaggcgattaagttgggtaacgccagggttttxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTGAGAAAAAGAGGTTCGATTTCTGATGAATCATACAAAGATACGAC AAAGGAAATTGCACAAATTTACATAGGAAAGACCACAAACAAACTTCTCT TTCTCTGGCTCTCACTCACATAAGCTTCACTTTCCGGGGACGAAGTTGGT GGCGAAGGCCCATGCGTTGTTGTTGACTGGATCAGCCAAATGGTCGGCGA GATTCTCCAACGGTCCCTTTCCAGTGACAATAGCCTGGACAAAGAATCCA AACATAGagcccaTAGCCAACCTTCCGTTCTTGATCTCCGGACGCGTGGx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccagaat QA 142 749 412 749 DS CHROMAT_FILE: LL2331r PHD_FILE: LL2331r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:26 2000 RT{ LL2331r chimera phrap 1 296 000919:094547 } RT{ LL2331r matchElsewhereHighQual phrap 1 296 000919:094547 } RD LL2331f 788 0 3 cttttagaaaacagGTTCGATTTCTGATGAATCATACAAAGATACGACAA AGGAAATTGCACAAATTTACATAGGAAAGACcaggaaCAAACTTCTCTTT CTCTGGCTCTCACTCACATAAGCTTCACTTTCCGGGGACGAAGTTGGTGG CGAAGGCCCATGCGTTGTTGTTGACTGGATCAGCCAAATGGTCGGCGAGA TTCTCCAACGGTCCCTTTCCAGTGACAATAGCCTGGACAAAGAATCCAAA CATAGAGAACATAGCCAACCTTCCGTTCTTGATCTCCGGACGCGTGGxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxgcctggggtgcctaatgagtgagctaactcacattaattg cgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctg cattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcg ccagggtggtttttcttttcaccagtgagacgggcaacagctgattgccc ttcaccgcctggccctgagagagttgcagcaagcggtccacgctggtttg ccccgcaggcgaaaatcctgtttgatggtggttaacggcgggatataaca tgagctgtcttcggtatcgtcgtatcccactaccgaga QA 14 785 8 297 DS CHROMAT_FILE: LL2331f PHD_FILE: LL2331f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:49 2000 RT{ LL2331f chimera phrap 461 788 000919:094547 } RT{ LL2331f matchElsewhereHighQual phrap 705 788 000919:094547 } RT{ LL2331f matchElsewhereHighQual phrap 461 703 000919:094547 } CO Contig41 830 2 64 U ccacgcgtccgcggacgcgtgggttttttttttttTTTTTTTTTTTTATT TTTATTTTTTAGAAATTGAATTATTAAATTTAGTTTTGCTATAATCTTTA TATTTTCTGGATAATTAAATTCATCAATAAACTCTCATTTTTTAGTTTTG CTATAATCTTGATTATGAAAATAGCTCATACACACGACTCATGATCACAC AACTCACTCAGCTCTGGATCAGTCTGCCTGAACCGTGATTGGCTTTCCTC TTCCCATCGAGAATCCTCTGGTCCCATCAGGCATCCTTGGTCCCGGTGGT TGCTGTTTTGGCTACTCCATGCTGCTCATTGGATGGTTTGAAGGTGGTGT TCCAACTTGATGTTGATGGTGATGATTATGATGATTCCCACGCTACGCCC TTTTCTCTGCCCACCCGCTTTCTCATTCCCTTCATCCCTTGCTCCTCTCC TACTATCACTGCCTGCTCAGGCACATTTGTATCCCATTCTCCAGAGAACT CCTCAGCTTGTTTCTCAATTGGCTGTTGCTCAGATGTCGTGGCATCATCT TCTTCATGTTCTACTTCTACATCATGTCCTTTTCTTCCTCGCCCTTGAAC CTGTTTTGGTTCCTTTGTCTTGGTGCCTTGATCGAAACCTTGCAGAGGCA AGTGTAATTATCTCTTTTCTGAAGATTTTCAGGGAAACCACCAAAGATCC TCTCAATACAAGTGTGAATGTCTCTTTTCATAGACTTATTCCCAAATTGG AAGAaACCTAAATTcCCAAATCagaaaatCAAAATCCCTAAATTGAGATA ATACCCACGaAaTCaaACATGATGAagaat BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 37 30 30 30 30 33 44 48 56 40 40 37 37 40 40 40 40 40 40 40 40 56 40 39 39 29 29 29 46 46 40 40 40 40 35 35 35 35 35 35 35 35 45 49 45 45 45 45 45 53 45 45 45 45 45 45 45 45 50 50 52 50 50 49 55 50 50 52 52 66 56 61 61 53 53 53 50 55 50 50 50 50 50 50 61 61 61 61 61 66 50 40 40 40 35 35 37 47 47 50 56 55 61 66 66 61 56 56 52 52 52 54 45 43 44 44 44 44 45 53 50 50 66 50 50 50 50 50 53 53 53 53 66 66 61 55 55 53 53 47 51 51 50 56 56 56 61 49 44 44 50 50 53 51 51 51 66 66 66 66 60 52 56 56 61 49 48 53 53 53 48 49 49 49 48 50 50 48 48 48 66 66 66 61 56 53 52 52 51 53 51 51 51 53 53 50 53 49 49 45 50 50 61 66 53 48 48 48 48 53 53 51 51 51 51 51 55 55 61 61 55 53 51 51 51 51 52 56 56 56 54 53 53 53 53 53 66 66 66 66 66 66 66 66 61 61 61 61 61 61 61 61 66 66 66 66 55 55 55 56 51 51 66 66 53 46 46 46 46 46 53 66 66 61 53 46 46 46 46 46 46 53 55 66 61 61 53 53 50 53 50 53 53 66 66 66 66 66 66 66 66 66 61 61 61 61 61 61 66 66 66 66 60 52 52 52 66 66 53 53 53 53 60 66 61 66 66 66 66 66 66 61 61 61 61 61 61 61 66 61 61 61 61 61 61 66 66 66 61 61 61 61 55 50 50 48 48 48 48 48 48 53 53 54 56 66 66 66 61 61 61 53 53 53 53 53 53 53 51 53 53 46 46 46 53 55 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 61 55 55 55 55 55 55 61 66 66 66 53 53 53 53 53 53 55 53 53 53 53 53 53 53 55 53 53 53 55 53 61 61 61 61 61 61 61 66 66 66 66 66 66 66 66 66 66 66 66 66 61 56 56 52 52 52 53 50 53 53 53 53 53 61 66 55 55 55 55 55 55 61 61 61 61 61 61 61 61 61 61 55 55 55 55 55 53 48 53 53 53 53 47 52 66 56 55 55 55 50 50 50 56 52 61 53 61 61 61 66 66 66 66 61 61 61 61 61 61 66 50 52 55 50 50 50 55 50 52 47 52 52 56 61 61 61 61 61 61 61 61 66 66 66 61 61 61 61 61 49 45 45 45 45 49 66 66 50 50 50 50 50 50 54 61 61 61 61 61 47 47 47 50 50 50 66 66 66 66 66 66 66 66 50 45 45 45 45 50 66 66 61 50 50 50 50 50 50 61 61 61 61 56 53 52 52 45 47 45 47 45 56 57 53 52 52 52 50 47 47 47 47 47 50 50 50 50 50 50 50 50 44 39 39 44 50 50 50 54 54 56 56 56 56 56 56 50 50 50 47 47 52 56 56 56 56 56 50 50 50 50 50 50 50 44 39 44 58 58 50 50 50 50 50 37 39 39 45 42 43 47 46 46 50 50 50 44 35 35 39 39 42 42 42 42 39 41 41 45 45 44 47 36 40 44 40 40 40 34 29 29 34 40 37 25 18 23 27 27 23 22 22 27 27 23 19 20 25 34 34 34 32 32 15 9 9 9 9 9 16 25 25 32 32 31 29 25 29 29 29 34 29 34 29 29 26 28 26 25 29 24 24 29 32 29 32 20 20 20 22 19 25 19 25 21 17 17 25 29 32 32 40 36 28 27 25 17 0 0 0 0 AF LL2336r U -44 AF LL2336f U 67 BS 1 92 LL2336r BS 93 95 LL2336f BS 96 96 LL2336r BS 97 98 LL2336f BS 99 99 LL2336r BS 100 105 LL2336f BS 106 106 LL2336r BS 107 108 LL2336f BS 109 135 LL2336r BS 136 136 LL2336f BS 137 147 LL2336r BS 148 153 LL2336f BS 154 155 LL2336r BS 156 161 LL2336f BS 162 178 LL2336r BS 179 185 LL2336f BS 186 197 LL2336r BS 198 201 LL2336f BS 202 206 LL2336r BS 207 209 LL2336f BS 210 230 LL2336r BS 231 231 LL2336f BS 232 232 LL2336r BS 233 239 LL2336f BS 240 266 LL2336r BS 267 271 LL2336f BS 272 279 LL2336r BS 280 294 LL2336f BS 295 295 LL2336r BS 296 308 LL2336f BS 309 309 LL2336r BS 310 317 LL2336f BS 318 327 LL2336r BS 328 328 LL2336f BS 329 329 LL2336r BS 330 351 LL2336f BS 352 353 LL2336r BS 354 357 LL2336f BS 358 359 LL2336r BS 360 360 LL2336f BS 361 366 LL2336r BS 367 373 LL2336f BS 374 374 LL2336r BS 375 387 LL2336f BS 388 400 LL2336r BS 401 401 LL2336f BS 402 403 LL2336r BS 404 413 LL2336f BS 414 416 LL2336r BS 417 419 LL2336f BS 420 436 LL2336r BS 437 463 LL2336f BS 464 464 LL2336r BS 465 467 LL2336f BS 468 468 LL2336r BS 469 543 LL2336f BS 544 544 LL2336r BS 545 560 LL2336f BS 561 562 LL2336r BS 563 567 LL2336f BS 568 568 LL2336r BS 569 569 LL2336f BS 570 571 LL2336r BS 572 830 LL2336f RD LL2336r 790 0 0 actxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgcggacgcgtgggttttttttttttTTTTTTTTTTTTATTTTTAT TTTTTAGAAATTGAATTATTAAATTTAGTTTTGCTATAATCTTTATATTT TCTGGATAATTAAATTCATCAATAAACTCTCATTTTTTAGTTTTGCTATA ATCTTGATTATGAAAATAGCTCATACACACGACTCATGATCACACAACTC ACTCAGCTCTGGATCAGTCTGCCTGAACCGTGATTGGCTTTCCTCTTCCC ATCGAGAATCCTCTGGTCCCATCAGGCATCCTTGGTCCCGGTGGTTGCTG TTTTGGCTACTCCATGCTGCTCATTGGATGGTTTGAAGGTGGTGTTCCAA CTTGATGTTGATGGTGATGATTATGATGATTCCCACGCTACGCCCTTTTC TCTGCCCACCCGCTTTCTCATTCCCTTCATCCCTTGCTCCTCTCCTACTA TCACTGCCTGCTCAGGCACATTTGTATCCCATTCTCCAGAGAACTCCTCA GCTTGTTTCTCAATTGGCTGTTGCTCAGATGTCGTGGCATCATCTTCTTC ATGTTCTACTTCTACATCATGTCCTTTTCTTCCTCGCCCTTGAACCTGTT TTGGTTCCTTTGTCTTGGTGCCTTGATCGAAACCTTGCAGAGGCAAGTGT AATTATCTCTTTTCTGAAGATTTTCAGGGAAACCACCAAAGATCCTCTCA ATACAAGTGTGAATGTCTCTTTTCATAGACTTATTCCcan QA 81 786 46 789 DS CHROMAT_FILE: LL2336r PHD_FILE: LL2336r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:13 2000 RD LL2336f 765 0 0 cttgatactaAATTTAGTTTTGCTATAATCTTTATATTTTCTGGATAATT AAATTCATCAatgaaCTCTCATTTTTTAGTTTTGCTATAATCTTGATTAT GAAAATAGCTCATACACACGACTCATGATCACACAACTCACTCAGCTCTG GATCAGTCTGCCTGAACCGTGATTGGCTTTCCTCTTCCCATCGAGAATCC TCTGGTCCCATCAGGCATCCTTGGTCCCGGTGGTTGCTGTTTTGGCTACT CCATGCTGCTCATTGGATGGTTTGAAGGTGGTGTTCCAACTTGATGTTGA TGGTGATGATTATGATGATTCCCACGCTACGCCCTTTTCTCTGCCCACCC GCTTTCTCATTCCCTTCATCCCTTGCTCCTCTCCTACTATCACTGCCTGC TCAGGCACATTTGTATCCCATTCTCCAGAGAACTCCTCAGCTTGTTTCTC AATTGGCTGTTGCTCAGATGTCGTGGCATCATCTTCTTCATGTTCTACTT CTACATCATGTCCTTTTCTTCCTCGCCCTTGAACCTGTTTTGGTTCCTTT GTCTTGGTGCCTTGATCGAAACCTTGCAGAGGCAAGTGTAATTATCTCTT TTCTGAAGATTTTCAGGGAAACCACCAAAGATCCTCTCAATACAAGTGTG AATGTCTCTTTTCATAGACTTATTCCCAAATTGGAAGAaACCTAAATTcC CAAATCagaaaatCAAAATCCCTAAATTGAGATAATACCCACGaAaTCaa ACATGATGAagaatn QA 12 760 9 764 DS CHROMAT_FILE: LL2336f PHD_FILE: LL2336f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:34 2000 CO Contig42 1294 2 18 U ctttaagggaactgcttttagttaagtttattcgagtttgtaataccaaa gataaagcaaataccatgttcaagaggtaTATGAACAAACTCAAACACAA ACATCTAACATAGATTCGGACAAACAAACAAAAACGAAGTGTGTGTGTAA TAAGCTCTCTCAGACATAGCTTACTCAACTTATAGACGAAACTAAAAGAT ATGACATTTGACTCCACGCATCATTCTCCTTAATGCAGATAGGTTTCTAG GAATGTCTACTACACATAGCTTACATATAATATCTTTTAGCCTAGATCAG GGACAATGAAATGCTCTTCTTCATCATCCTCATCATCAACTCTGATCTCT AACCTCGGCTTCTTGTATGACACGCTGGACTTCATTGGTCTGTACGAATG AACATGACCTAAATGCGCAACAGAAAGCTCCGGAACTTGAGCTGCAGGAA GAGTCTCCTGATTAATCTGATCACTGAAGAAACCCATGTCTGAAAACCAC TCCAACTCCCCAAGATCAAGCTGTCCTTTCTGCTATACATTCAAACAAAA AAAAAAACAAGTCAATACAAAAAAAAAACAAAGCATTAAGGATCAAACCA ACACTTCAATCTCTACCTTATCGGTGAACTCAGGATCAGAGAAGTGGAAG AAATCATCAACAGCCCAAGGAAGAGGAGAAGCAGAAGAAGGCTGTTGCTG GCTTGGGATCTCCTTAGCTTTCTGTTGGTTGTTGGAAGGCTCGGACTGAT GATTCTTGTCTGTTTCTTTACTGCAACTACTTGAGCTCAGAGCCACTTTG ATCCCAGTGGCTAAGAACCTCTGGTGATTTGCAGATCTAGAGTTAGCCAC GTGGATAGATTCATCACAGTCCCTGCAAAGCAGAGCTCTATCCTCCACAC AGAAAATGAAAGCTGCCTTCTCTTGACAGATGTCGCAACGAGGGAACTTG GTGGAGAGAGAGTTGAGATGGAGGCGTTGGTGCTTGCTGGCGAGTTTGTT AGCTGCGTGAATCTCCACGTCGCATTTAGGACACAGAGCTGCTTCGTCGG CGCAACATATCACCGTCGCAGGAGCATTCTCACACACATCACACTGTATC TTCATCACAGTTCTTGATTCTTGGTTTAATAGATCAAGAACACTTGAATC TACCTTCTGAGAGTAGCCTAGCAAAAATATAGATGAAGAGAAAGTATTTT GAGACAGAAATCGAGAGATGTAAACAACAAAAGAGGAACAAGTAACAACT GTGTGGTGtggaggggaagagaggagagaagatcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 37 35 35 35 27 27 29 32 35 35 40 40 37 37 43 51 46 46 46 40 40 42 42 42 37 37 39 37 37 37 40 40 37 40 40 40 40 40 35 35 35 40 40 40 51 51 51 51 51 51 51 51 51 40 39 39 39 39 38 37 56 50 50 44 44 46 40 40 40 38 38 35 35 35 35 43 43 43 43 45 45 45 43 46 46 41 40 40 38 38 38 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 36 36 36 43 43 36 36 38 56 42 42 41 41 41 43 45 45 41 41 42 38 36 36 36 35 35 36 36 43 43 36 38 42 42 42 42 46 43 43 43 41 43 43 42 42 42 46 46 42 42 42 46 46 46 42 41 38 38 38 38 40 43 51 56 50 50 41 38 38 39 43 43 43 43 36 40 43 46 46 43 42 41 35 35 35 38 38 38 56 56 56 56 56 42 42 42 42 41 41 50 42 42 42 46 40 35 35 35 36 40 50 41 36 36 36 36 35 35 47 50 50 56 51 43 43 43 43 43 40 42 42 50 50 50 44 44 44 50 46 56 50 50 44 44 47 50 50 50 43 43 43 43 43 38 38 43 42 56 45 45 45 38 38 38 36 36 35 41 45 45 45 41 41 41 44 50 50 37 37 42 42 43 43 43 43 43 38 38 38 38 43 38 38 38 41 42 43 43 56 50 50 41 42 35 32 32 38 38 38 43 56 56 56 56 56 56 56 45 41 41 41 41 42 50 50 41 38 38 38 38 35 37 42 42 42 42 41 41 41 41 41 41 41 42 41 35 35 38 38 38 41 38 40 38 40 35 38 50 50 50 50 42 42 42 42 41 41 35 33 40 35 35 35 41 42 46 51 45 51 46 46 35 35 31 35 35 35 30 35 35 37 35 35 35 42 46 42 56 56 56 56 50 56 56 56 56 56 50 50 35 35 35 40 40 40 40 40 37 40 40 40 56 56 56 56 56 51 51 51 51 45 45 56 46 42 42 56 56 56 56 56 56 56 42 42 42 42 42 46 56 56 56 50 56 46 46 46 42 42 42 47 56 56 56 56 47 47 47 48 44 37 52 57 57 63 64 64 57 61 55 69 71 82 82 74 88 78 78 71 62 59 61 50 51 58 57 62 62 65 60 60 62 73 73 84 69 73 62 74 65 64 53 53 59 70 62 54 43 42 42 56 54 63 65 80 69 69 67 67 69 71 71 82 73 67 47 42 40 46 50 60 75 69 63 66 69 74 80 72 72 74 74 71 53 48 54 50 40 43 58 53 52 59 60 71 73 73 65 54 49 34 34 40 54 48 58 61 61 67 50 41 44 54 50 50 54 46 49 59 64 56 55 55 40 48 48 48 46 46 55 55 55 55 55 55 71 71 71 66 55 55 60 60 68 63 68 70 70 76 90 83 69 61 57 50 50 47 47 51 51 61 63 65 61 61 58 55 55 58 59 58 62 59 51 48 41 41 40 47 47 55 59 64 63 61 61 61 64 71 67 69 70 69 58 53 55 72 52 54 59 61 62 61 61 67 61 61 61 61 71 68 71 54 52 52 52 52 53 66 68 69 66 61 60 68 60 60 60 45 56 40 40 45 45 45 45 56 56 51 51 51 51 51 51 56 51 45 40 40 40 40 45 40 40 51 45 45 51 40 40 45 40 40 40 40 40 40 40 45 45 40 37 37 37 40 40 51 51 56 56 45 45 45 40 40 40 40 40 40 45 40 40 40 40 40 51 51 51 51 51 51 56 56 56 56 56 56 51 40 40 40 40 40 45 51 56 56 56 45 40 40 40 40 40 40 45 45 45 45 46 46 43 43 56 56 56 56 56 56 51 51 51 51 51 56 56 56 56 56 56 43 43 43 43 43 43 56 56 51 51 51 51 51 45 45 45 51 45 45 56 56 51 51 45 43 43 43 43 43 45 51 51 51 51 51 51 51 51 51 51 45 45 45 45 45 51 56 56 56 56 51 45 45 45 45 45 56 56 56 56 56 51 51 45 43 43 43 43 43 43 45 45 45 45 45 51 51 51 51 43 43 43 43 43 43 43 43 43 45 45 43 43 43 43 43 43 36 36 36 36 36 43 45 45 51 51 51 45 43 43 43 43 43 43 43 43 51 51 51 51 51 51 51 56 51 45 43 43 43 43 43 43 45 45 43 43 43 43 43 36 35 35 35 35 35 36 43 51 51 51 51 56 56 51 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 45 45 45 45 51 51 56 56 56 56 56 56 56 51 43 43 43 43 43 51 51 51 51 51 51 56 56 56 56 56 56 43 43 43 43 43 43 45 45 45 51 45 43 43 43 43 43 45 51 51 45 45 51 51 51 51 56 56 56 56 51 51 51 45 45 45 43 43 35 35 35 35 35 35 43 35 35 35 35 35 38 56 45 45 40 40 40 40 40 40 45 51 51 51 51 51 51 40 40 39 39 39 39 46 46 46 40 40 40 40 51 40 40 40 40 40 40 51 51 51 51 51 51 51 40 39 39 39 39 39 39 40 40 40 37 40 46 56 56 56 51 51 51 51 51 45 40 40 40 40 40 40 37 51 51 40 37 46 40 36 35 35 35 35 31 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2337f U 0 AF LL2337r C 553 BS 1 582 LL2337f BS 583 583 LL2337r BS 584 606 LL2337f BS 607 608 LL2337r BS 609 612 LL2337f BS 613 613 LL2337r BS 614 621 LL2337f BS 622 626 LL2337r BS 627 627 LL2337f BS 628 628 LL2337r BS 629 629 LL2337f BS 630 632 LL2337r BS 633 635 LL2337f BS 636 644 LL2337r BS 645 648 LL2337f BS 649 658 LL2337r BS 659 661 LL2337f BS 662 1294 LL2337r RD LL2337f 789 0 0 nctttaagggaactgcttttagttaagtttattcgagtttgtaataccaa agataaagcaaataccatgttcaagaggtaTATGAACAAACTCAAACACA AACATCTAACATAGATTCGGACAAACAAACAAAAACGAAGTGTGTGTGTA ATAAGCTCTCTCAGACATAGCTTACTCAACTTATAGACGAAACTAAAAGA TATGACATTTGACTCCACGCATCATTCTCCTTAATGCAGATAGGTTTCTA GGAATGTCTACTACACATAGCTTACATATAATATCTTTTAGCCTAGATCA GGGACAATGAAATGCTCTTCTTCATCATCCTCATCATCAACTCTGATCTC TAACCTCGGCTTCTTGTATGACACGCTGGACTTCATTGGTCTGTACGAAT GAACATGACCTAAATGCGCAACAGAAAGCTCCGGAACTTGAGCTGCAGGA AGAGTCTCCTGATTAATCTGATCACTGAAGAAACCCATGTCTGAAAACCA CTCCAACTCCCCAAGATCAAGCTGTCCTTTCTGCTATACATTCAAACAAA AAAAAAAACAAGTCAATACAAAAAAAAAACAAAGCATTAAGGATCAAACC AACACTTCAATCTCTACCTTATCGGTGAACTCAGGATCAGAGAAGTGGAA GAAATCATCAACAGCCCAAGGAAGAGGAGAAGcacaagAAGGCTGTTGCT GGCTTGGGATCTCCTTAGCTTTCTGTTGGTTGTTGGAAGGCTCGGACTGA TGATTCTTGTCTGTTTCTTTACTGCAACTACTTGAGCtc QA 81 787 2 789 DS CHROMAT_FILE: LL2337f PHD_FILE: LL2337f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:39 2000 RD LL2337r 786 0 0 aaaancaAGTCAATACAAAAAAAAAACAAAGCATTAAGGATCAAACCAAC ACTTCAATCTCTACCTTATCGGTGAACTCAGGATCAGAGAAGTGGAAGAA ATCATCAACAGCCCAAGGAAGAGGAGAAGCAGAAGAAGGCTGTTGCTGGC TTGGGATCTCCTTAGCTTTCTGTTGGTTGTTGGAAGGCTCGGACTGATGA TTCTTGTCTGTTTCTTTACTGCAACTACTTGAGCTCAGAGCCACTTTGAT CCCAGTGGCTAAGAACCTCTGGTGATTTGCAGATCTAGAGTTAGCCACGT GGATAGATTCATCACAGTCCCTGCAAAGCAGAGCTCTATCCTCCACACAG AAAATGAAAGCTGCCTTCTCTTGACAGATGTCGCAACGAGGGAACTTGGT GGAGAGAGAGTTGAGATGGAGGCGTTGGTGCTTGCTGGCGAGTTTGTTAG CTGCGTGAATCTCCACGTCGCATTTAGGACACAGAGCTGCTTCGTCGGCG CAACATATCACCGTCGCAGGAGCATTCTCACACACATCACACTGTATCTT CATCACAGTTCTTGATTCTTGGTTTAATAGATCAAGAACACTTGAATCTA CCTTCTGAGAGTAGCCTAGCAAAAATATAGATGAAGAGAAAGTATTTTGA GACAGAAATCGAGAGATGTAAACAACAAAAGAGGAACAAGTAACAACTGT GTGGTGtggaggggaagagaggagagaagatcggacgcgtggxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxat QA 7 706 1 742 DS CHROMAT_FILE: LL2337r PHD_FILE: LL2337r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:17 2000 CO Contig43 512 2 76 U tgAGCGAATTGTGTCTATGTCATTCACTTTCAAAAAATTATAGAACAACT CTTTCAAGAAGAAGAGAAAAAAATCATCTATTAACAAGGTACAAGTCTAT CTCTAATTAGATGAACAATATGATGGATAATCATCAATGAAGCCGAGAAG TAACATTGATGAAAGTATCTTCAGGACAAGGGATAGTTAAGCCACCCATT GGATGATCAAACCCAAACTCTTCTTCAGATCTGCTGAGGAGAGCTTGAAA TAAGGGCTGGCTCAAGTATGAGATTGGCACCACATATCTCTTCTTCTGGC TCTCTCCAACGTACACCGCAAGAAACCCTTTTGGTGGTGCCGCCATGGTT GCTCTTTTGCTTGTAGAAGCTGTTGCTGTTACGGAACGACCAAGAATCTT TTTTGCACCCAATAAACTTCTAACCAAAGCCATTTGTCTGAAATTTTAAA CTCTTTGGAAGAGATGTATGAAAAGATTCAAGTTATTGGATTGCTGCTTG CCGGACGCGTGG BQ 0 17 20 27 29 34 34 39 40 40 46 55 55 55 55 55 86 83 85 85 85 85 85 88 88 85 85 85 88 90 90 90 90 90 86 85 85 85 79 85 90 80 85 78 82 82 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 86 82 82 79 90 90 85 82 77 80 80 80 90 90 79 79 90 90 90 90 84 84 84 84 80 80 86 86 86 86 86 86 82 82 82 81 81 86 89 90 90 85 85 85 84 84 83 89 89 89 90 90 90 90 90 90 90 90 85 85 80 88 88 90 90 88 88 78 78 78 75 86 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 83 83 86 86 83 83 83 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 88 88 88 88 88 90 90 86 84 86 88 88 86 90 90 90 90 90 90 90 90 88 88 88 88 88 88 90 90 90 86 86 88 88 88 88 88 90 90 90 90 90 88 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 90 90 90 90 90 90 90 90 89 86 86 86 86 86 90 86 86 86 81 81 81 79 86 88 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 75 72 72 70 72 72 86 82 90 90 76 75 75 75 75 75 90 90 90 90 90 90 85 82 86 90 90 85 89 80 80 76 80 85 83 83 83 83 83 83 88 82 82 78 78 78 78 78 78 78 78 78 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 86 90 90 90 90 90 90 84 84 90 90 74 74 90 85 79 80 90 90 84 90 90 90 90 90 90 88 90 90 90 79 84 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 66 66 51 51 51 56 56 51 51 60 60 89 89 90 86 86 86 90 86 90 90 90 90 85 85 81 81 81 86 81 81 85 81 83 83 86 90 90 83 52 55 55 40 51 56 AF LL2340r C -231 AF LL2340f U -2 BS 1 15 LL2340r BS 16 16 LL2340f BS 17 41 LL2340r BS 42 42 LL2340f BS 43 43 LL2340r BS 44 44 LL2340f BS 45 46 LL2340r BS 47 47 LL2340f BS 48 71 LL2340r BS 72 74 LL2340f BS 75 76 LL2340r BS 77 80 LL2340f BS 81 86 LL2340r BS 87 87 LL2340f BS 88 90 LL2340r BS 91 95 LL2340f BS 96 138 LL2340r BS 139 139 LL2340f BS 140 154 LL2340r BS 155 163 LL2340f BS 164 181 LL2340r BS 182 185 LL2340f BS 186 187 LL2340r BS 188 189 LL2340f BS 190 192 LL2340r BS 193 193 LL2340f BS 194 197 LL2340r BS 198 200 LL2340f BS 201 207 LL2340r BS 208 208 LL2340f BS 209 221 LL2340r BS 222 230 LL2340f BS 231 237 LL2340r BS 238 240 LL2340f BS 241 241 LL2340r BS 242 242 LL2340f BS 243 244 LL2340r BS 245 245 LL2340f BS 246 247 LL2340r BS 248 248 LL2340f BS 249 262 LL2340r BS 263 269 LL2340f BS 270 274 LL2340r BS 275 280 LL2340f BS 281 283 LL2340r BS 284 292 LL2340f BS 293 293 LL2340r BS 294 297 LL2340f BS 298 308 LL2340r BS 309 313 LL2340f BS 314 314 LL2340r BS 315 317 LL2340f BS 318 331 LL2340r BS 332 336 LL2340f BS 337 345 LL2340r BS 346 348 LL2340f BS 349 356 LL2340r BS 357 378 LL2340f BS 379 379 LL2340r BS 380 390 LL2340f BS 391 394 LL2340r BS 395 408 LL2340f BS 409 410 LL2340r BS 411 416 LL2340f BS 417 417 LL2340r BS 418 424 LL2340f BS 425 425 LL2340r BS 426 441 LL2340f BS 442 442 LL2340r BS 443 450 LL2340f BS 451 451 LL2340r BS 452 480 LL2340f BS 481 487 LL2340r BS 488 488 LL2340f BS 489 506 LL2340r BS 507 512 LL2340f RD LL2340r 789 0 2 cgatcggtgcgggcctcttcgctattacgccagctggcgaaagggggatg tgctgcaaggcgattaagttgggtaacgccagggttttxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxctttttttttttttttttttttttttttntgAGCGAATTGTGTCTAT GTCATTCACTTTCAAAAAATTATAGAACAACTCTTTCAAGAAGAAGAGAA AAAAATCATCTATTAACAAGGTACAAGTCTATCTCTAATTAGATGAACAA TATGATGGATAATCATCAATGAAGCCGAGAAGTAACATTGATGAAAGTAT CTTCAGGACAAGGGATAGTTAAGCCACCCATTGGATGATCAAACCCAAAC TCTTCTTCAGATCTGCTGAGGAGAGCTTGAAATAAGGGCTGGCTCAAGTA TGAGATTGGCACCACATATCTCTTCTTCTGGCTCTCTCCAACGTACACCG CAAGAAACCCTTTTGGTGGTGCCGCCATGGTTGCTCTTTTGCTTGTAGAA GCTGTTGCTGTTACGGAACGACCAAGAATCTTTTTTGCACCCAATAAACT TCTAACCAAAGCCATTTGTCTGAAATTTTAAACTCTTTGGAAGAGATGTA TGAaacgATTCAAGTTATTGGATTGCTGCTTGCCGGACGCGTGGxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaag QA 28 744 233 744 DS CHROMAT_FILE: LL2340r PHD_FILE: LL2340r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:31 2000 RT{ LL2340r chimera phrap 1 88 000919:094547 } RT{ LL2340r matchElsewhereHighQual phrap 1 88 000919:094547 } RD LL2340f 787 0 3 cttttagcgaactGTGTCTATGTCATTCACTTTCAAAAAATTATAGAACA ACTCTTTCAAGAAGAAGAGAAAAAAATCATCTATTAACAAGGTACAAGTC TATCTCTAATTAGATGAACAATATGATGGATAATCATCAATGAAGCCGAG AAGTAACATTGATGAAAGTATCTTCAGGACAAGGGATAGTTAAGCCACCC ATTGGATGATCAAACCCAAACTCTTCTTCAGATCTGCTGAGGAGAGCTTG AAATAAGGGCTGGCTCAAGTATGAGATTGGCACCACATATCTCTTCTTCT GGCTCTCTCCAACGTACACCGCAAGAAACCCTTTTGGTGGTGCCGCCATG GTTGCTCTTTTGCTTGTAGAAGCTGTTGCTGTTACGGAACGACCAAGAAT CTTTTTTGCACCCAATAAACTTCTAACCAAAGCCATTTGTCTGAAATTTT AAACTCTTTGGAAGAGATGTATGAAAAGATTCAAGTTATTGGATTGCTGC TTGCCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxgcctggggtgcctaatgagtga gctaactcacattaattgcgttgcgctcactggcccgctttcagtcggga aacctgtcgtgccagctgcattaatgaatcggccacg QA 14 781 6 515 DS CHROMAT_FILE: LL2340f PHD_FILE: LL2340f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:54 2000 RT{ LL2340f chimera phrap 679 785 000919:094547 } RT{ LL2340f matchElsewhereHighQual phrap 743 785 000919:094547 } RT{ LL2340f matchElsewhereHighQual phrap 679 732 000919:094547 } CO Contig44 1230 2 30 U ctttatgggaaacaaaaaaatatttatatatcatagacctagtgactatt cccctgtcttctcgtctccgaatctctaataactaacatatcttcgtctc tctttgtactttgtaatatgagactctgacttgaacatccaatctaacac acaacatttaaaagccaatattgatttgccagtcttgtcttctctttctc atatatttggcatactatcattatagtcttgttcaactgtgtcattagcc tggtgagttggttgccgaacccttttctctttctctctcagctgctgctt catcctcagttttcttggcaaaccggctctcactatctccttgcccTTTt caTTTgcatgcttgtgtctTGACTCATGAagaTATGgctTtctGTTGATA AGTTTTCTTTctagctcggccttgtgcgcgtgctTTttctccgccttata attccgtggtacttgtTTcgcATTCACaTAAACAGGCTCTTGTGTCATGT CAAGCGGTAGTGCTGTTCTTTCACCAGGCATTCCAGGATATGGACGAAAC CCTTGCAATTGCTGATGTCCATATGCTCCCACCATTCCCCCATAATATGG ATCCTGATATGGATTTGGGACACAAGCAATGTAGTGTCCAACAAGCTCTG GTGGTTGTACAAGTTGCTGCTGATCATGCACGTTACCCACAGAAGCAGCA GGATCATTCCCTTCCACTCCATGGTTATCTGAAGGGGACGAAGTAGCAGC TTGTGTATCCTTCCAAGCGCCATCGTCTCCAGACGCTGAGTGAACATCGT TTGATTCTAAAGAGGAGGACTTTGAAAGAACCACAGAAGGTATCTCTTGA GGAACGACACCAAAAGAGTTATTTTTTTTCCACCATGGCTCCTCTGCATA CATCATTGGCTGCTCACCAGAATGGTGATTGACTTCCACTTCATTCCCAC TTCCCTTGGATTGCATCTCTTCAAAACAACACCAGAGGATATTCACGAGG GAGTTTTTTGCAATCGTCTTCGTGTTCCTATGATGATGACTTAAAGACCC CTGAAGTTATGTAATCGATGGAGCTCAGAGTGAGACAAAACAAAAAGAAT CAGAATTTGAACAGAGCCAGAGAAGAATTTGGAGAAGAGAGACGAATACG ATTGGGCAGAAGAAACCCTAATTTTTGGGGATTTTTGTttctcAAaattg ttttttctttgtatttttgcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 15 15 15 14 18 21 22 20 17 17 15 29 27 21 15 12 11 11 12 13 15 17 14 15 15 17 17 18 23 23 25 25 22 22 24 24 20 25 14 14 14 25 22 22 22 19 19 16 27 17 15 15 21 21 26 30 30 28 30 22 22 22 33 36 24 30 30 42 24 19 19 15 15 18 18 16 16 14 12 11 14 10 8 8 8 11 11 17 11 10 10 10 11 23 23 18 15 15 15 19 17 15 18 17 17 17 11 11 11 11 10 15 12 14 11 13 14 14 13 15 10 8 10 13 18 24 21 18 18 18 24 24 25 25 23 22 19 21 34 37 38 49 45 50 42 48 49 52 39 46 47 51 54 53 61 54 56 50 53 50 45 39 47 48 51 48 48 46 46 46 50 49 54 50 40 40 50 50 53 44 52 51 44 44 36 40 43 43 42 44 47 47 44 53 46 52 55 57 60 58 50 48 54 55 53 47 49 49 53 54 49 48 49 54 54 61 57 55 54 47 59 43 49 44 47 47 49 50 65 64 63 63 59 61 55 55 55 63 44 52 52 55 58 68 68 68 72 77 79 68 63 62 62 59 60 60 55 49 51 45 52 46 51 49 55 47 55 56 52 47 47 50 49 40 40 40 40 39 52 52 56 65 76 55 55 40 46 40 40 37 37 52 50 50 50 55 52 54 49 49 49 61 60 55 57 57 67 63 68 60 67 62 58 52 50 57 57 60 68 75 78 73 62 60 67 66 60 61 65 64 80 78 80 85 83 70 73 54 52 52 59 59 50 55 59 64 73 73 65 75 63 63 55 60 58 58 58 62 62 67 75 80 72 72 75 71 74 71 71 68 66 66 66 66 70 71 71 71 71 75 75 74 66 67 66 66 67 74 73 71 71 71 71 67 66 70 73 70 56 56 55 55 55 55 62 64 65 59 67 54 64 64 51 51 51 45 45 45 51 51 51 56 45 43 43 68 70 68 63 63 68 64 66 70 62 60 51 58 63 66 62 59 60 61 45 45 43 43 43 43 43 51 56 56 45 45 45 45 43 43 51 51 51 51 51 51 51 45 45 45 51 45 45 45 45 45 45 51 51 45 45 51 51 51 51 56 56 56 56 56 56 56 56 56 51 56 45 45 45 45 45 45 51 51 51 51 51 56 56 51 45 43 43 43 40 40 41 56 56 56 56 56 56 51 45 45 45 45 45 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 45 45 45 45 45 45 51 51 45 51 51 51 51 51 56 56 56 56 56 51 43 43 43 43 43 43 51 56 56 56 56 56 51 51 51 51 51 45 45 45 45 45 51 51 51 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 45 43 43 43 43 43 43 45 45 51 56 56 56 56 56 56 56 56 56 56 56 56 45 43 43 43 43 43 43 45 56 56 56 56 51 51 51 51 51 51 45 43 43 43 43 43 46 56 51 51 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 51 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 51 51 45 45 45 51 51 51 56 56 56 56 45 45 45 43 43 43 51 43 43 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 43 43 43 43 43 43 43 45 45 45 51 51 51 51 51 51 51 56 51 51 51 43 43 43 43 43 43 45 51 51 43 40 40 40 40 40 39 39 39 39 39 39 40 51 51 51 51 51 51 51 51 40 40 46 46 46 46 51 51 51 51 51 51 51 40 40 40 40 40 39 39 39 39 39 39 39 46 40 39 39 39 39 39 51 51 51 51 51 51 51 35 35 35 35 35 40 51 56 56 56 56 56 56 56 56 51 51 42 40 36 39 24 16 13 11 10 12 24 24 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2343f U 0 AF LL2343r C 478 BS 1 477 LL2343f BS 478 479 LL2343r BS 480 484 LL2343f BS 485 485 LL2343r BS 486 486 LL2343f BS 487 489 LL2343r BS 490 491 LL2343f BS 492 500 LL2343r BS 501 505 LL2343f BS 506 512 LL2343r BS 513 514 LL2343f BS 515 515 LL2343r BS 516 516 LL2343f BS 517 519 LL2343r BS 520 531 LL2343f BS 532 535 LL2343r BS 536 556 LL2343f BS 557 557 LL2343r BS 558 558 LL2343f BS 559 581 LL2343r BS 582 582 LL2343f BS 583 591 LL2343r BS 592 600 LL2343f BS 601 619 LL2343r BS 620 622 LL2343f BS 623 750 LL2343r BS 751 753 LL2343f BS 754 766 LL2343r BS 767 785 LL2343f BS 786 1230 LL2343r RD LL2343f 809 0 0 nctttatgggaaacaaaaaaatatttatatatcatagacctagtgactat tcccctgtcttctcgtctccgaatctctaataactaacatatcttcgtct ctctttgtactttgtaatatgagactctgacttgaacatccaatctaaca cacaacatttaaaagccaatattgatttgccagtcttgtcttctctttct catatatttggcatactatcattatagtcttgttcaactgtgtcattagc ctggtgagttggttgccgaacccttttctctttctctctcagctgctgct tcatcctcagttttcttggcaaaccggctctcactatctccttgcccTTT tcaTTTgcatgcttgtgtctTGACTCATGAagaTATGgctTtctGTTGAT AAGTTTTCTTTctagctcggccttgtgcgcgtgctTTttctccgccttat aattccgtggtacttgtTTcgcATTCACatAAACAGGCTCTTGTGTCATG TCAAGCGGTAGTGCTGTTCTTTCACCAGGCATTCCAGGATATGGACGAAA CCCTTGCAATTGCTGATGTCCATATGCTCCCACCATTCCCCCATAATATG GATCCTGATATGGATTTGGGACACAAGCAacgtAGTGTCCAACAAGCTCT GGTGGTTGTACAAGTTGCTGCTGATCATGCACGTTACCCACAGAAGCAGC AGGATCATTCCCTTCCACTCCATGGTTATCTGAAGGGGACGAAGTAGCAG CTTGTGTattctTCCAAGCGCCATCGTCTCCAGACGCTGAgcgAACATCg cttgATTct QA 343 751 2 809 DS CHROMAT_FILE: LL2343f PHD_FILE: LL2343f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:08 2000 RD LL2343r 798 0 0 aTAAACAGGCTCTTGTGTCATGTCAAGCGGTAGTGCTGTTCTTTCACCAG GCATTCCAGGATATGGACGAAACCCTTGCAATTGCTGATGTCCATATGCT CCCACCATTCCCCCATAATATGGATCCTGATATGGATTTGGGACACAAGC AATGTAGTGTCCAACAAGCTCTGGTGGTTGTACAAGTTGCTGCTGATCAT GCACGTTACCCACAGAAGCAGCAGGATCATTCCCTTCCACTCCATGGTTA TCTGAAGGGGACGAAGTAGCAGCTTGTGTATCCTTCCAAGCGCCATCGTC TCCAGACGCTGAGTGAACATCGTTTGATTCTAAAGAGGAGGACTTTGAAA GAACCACAGAAGGTATCTCTTGAGGAACGACACCAAAAGAGTTATTTTTT TTCCACCATGGCTCCTCTGCATACATCATTGGCTGCTCACCAGAATGGTG ATTGACTTCCACTTCATTCCCACTTCCCTTGGATTGCATCTCTTCAAAAC AACACCAGAGGATATTCACGAGGGAGTTTTTTGCAATCGTCTTCGTGTTC CTATGATGATGACTTAAAGACCCCTGAAGTTATGTAATCGATGGAGCTCA GAGTGAGACAAAACAAAAAGAATCAGAATTTGAACAGAGCCAGAGAAGAA TTTGGAGAAGAGAGACGAATACGATTGGGCAGAAGAAACCCTAATTTTTG GGGATTTTTGTttctcAAaattgttttttctttgtatttttgcggacgcg tggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxtgtccagaatt QA 1 718 1 753 DS CHROMAT_FILE: LL2343r PHD_FILE: LL2343r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:45 2000 CO Contig45 867 2 66 U ctgtaaggcgtccatctacttcaacagacaaatgtattacaaaatagctt caagaaaatatttataacattttgaagattCCAAAATATTTTTTTTAAAT AGCAAAAAAGACAAATGCAGCAAAAGATTTAACAAGAAAAACTTCACAAA CTTTGATCCTGGCCAATCTCTTCCCCCCTTTCTCTATGTTCCTTGGGGAT GCAGCAGTAGCAGTCTTTGACTATTGGAAAGGTTGTTAGTTGGCGAAAAC GTTGAGCCTAACGCGAGCAGTGACATCAGGGTGAAGTTTGAGTTCGGCTA TGTATTCTCCCGTTTCACGGATCTCCGGAAGAGAGACAAGGCGCTTGTCT ATATCCCTTTGGAGCTGCGCCTTGATGATGTCAACAAGGTCTTGTGCTGT GACAGATCCAAATATTTGTTTGCCCTTACCGCCCTTGCGTTTAACCTTGA AAGCCCCAACGGTTTCGAATACGGTAGCCAATTGTAGCGCCTCTTCTTTC ACCCTTTGCTTTTCTGCCTCTATCCTTTCGTTTTCCATCTTCATTTCCTT GAGCAGAAGCGGAGTCATGAGCTGAGCCTTTCCAGTGGGCAAGAGGAAGT TTCTGAAGAAACCGGCTTTCACGTCCAGTAATTGCCCTTGCTTGCCCAAG TCCAACACATCCTCTTTCAAGATTACCTTGCGGAGTTTCTTGGCTTTCTT CTGGGCAACAACCTCCAAAGTAGACCTTCGCTGGGATACTTTTAGTGTTT CATTTCCGACGACATTGAAGGTGTGAGACGAACAAAGCGAGGAGCTCCAC GAGAGTGACAAAGACGAAGAAGCCATTggcGAAgcttattcagtcccacg cgtccgcggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 34 39 39 40 37 37 37 37 40 37 56 56 56 35 35 35 35 35 34 35 35 35 35 40 46 51 51 51 40 40 40 40 35 35 35 35 35 35 35 35 40 40 55 61 61 71 71 54 55 55 55 60 51 65 65 80 83 80 81 67 67 72 62 62 65 72 71 72 65 79 76 74 58 63 65 67 64 62 62 58 55 55 55 55 60 72 67 72 74 84 84 88 90 90 90 80 80 79 77 70 67 72 77 85 85 77 77 72 65 65 65 65 65 76 83 88 85 80 64 66 66 68 66 66 73 79 69 74 80 80 80 86 78 80 79 77 77 77 79 79 75 90 72 68 68 78 78 83 83 85 88 82 68 67 64 61 61 69 74 72 77 72 73 72 77 77 77 75 75 75 75 75 68 81 81 83 77 77 85 83 77 79 80 78 78 78 80 82 85 85 83 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 84 84 83 83 83 89 87 87 83 90 90 79 77 77 90 90 85 81 81 90 85 85 89 89 87 87 89 89 87 90 90 90 90 70 68 70 70 77 77 90 90 90 90 90 90 89 83 82 71 71 75 75 76 76 83 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 80 80 78 78 80 83 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 87 90 83 88 90 90 90 90 90 90 85 85 85 85 85 85 90 90 80 80 80 83 78 78 88 82 82 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 88 88 90 90 90 90 90 90 90 90 85 85 85 87 88 88 90 90 90 90 90 90 90 88 82 82 79 79 79 84 87 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 83 83 76 78 90 90 90 90 89 90 83 83 81 83 83 83 87 87 86 83 90 90 86 84 89 85 85 90 90 90 90 90 90 90 90 90 89 82 90 90 83 76 76 85 79 77 85 83 75 69 66 73 76 74 76 85 90 84 90 90 75 72 63 63 61 58 65 77 83 83 75 80 70 80 76 72 78 80 89 89 85 85 80 84 72 70 70 82 75 83 83 68 65 73 76 78 83 83 75 85 85 85 85 90 90 90 89 88 83 76 76 85 73 73 74 75 80 75 78 78 76 72 70 70 83 83 72 71 76 80 77 72 63 63 68 63 63 70 74 75 83 90 83 68 68 65 65 61 61 79 72 64 67 72 66 63 68 68 57 67 67 67 59 53 52 59 64 72 69 83 83 83 90 85 80 72 72 70 75 75 78 79 77 75 75 68 61 55 55 55 55 60 72 74 66 62 62 58 60 61 61 62 65 69 83 90 83 68 63 60 55 55 52 52 52 54 40 40 40 40 40 40 45 43 40 35 35 35 35 35 35 35 35 35 39 39 39 46 45 45 45 45 45 45 35 35 35 35 37 37 46 46 42 42 42 46 46 46 46 40 40 40 40 40 40 40 29 21 15 15 15 21 21 46 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2345f U 1 AF LL2345r C 121 BS 1 169 LL2345f BS 170 173 LL2345r BS 174 221 LL2345f BS 222 222 LL2345r BS 223 272 LL2345f BS 273 276 LL2345r BS 277 297 LL2345f BS 298 300 LL2345r BS 301 306 LL2345f BS 307 308 LL2345r BS 309 314 LL2345f BS 315 325 LL2345r BS 326 331 LL2345f BS 332 338 LL2345r BS 339 342 LL2345f BS 343 346 LL2345r BS 347 347 LL2345f BS 348 359 LL2345r BS 360 366 LL2345f BS 367 367 LL2345r BS 368 370 LL2345f BS 371 377 LL2345r BS 378 391 LL2345f BS 392 400 LL2345r BS 401 406 LL2345f BS 407 416 LL2345r BS 417 417 LL2345f BS 418 418 LL2345r BS 419 419 LL2345f BS 420 427 LL2345r BS 428 445 LL2345f BS 446 448 LL2345r BS 449 457 LL2345f BS 458 475 LL2345r BS 476 478 LL2345f BS 479 482 LL2345r BS 483 486 LL2345f BS 487 493 LL2345r BS 494 496 LL2345f BS 497 519 LL2345r BS 520 521 LL2345f BS 522 525 LL2345r BS 526 532 LL2345f BS 533 549 LL2345r BS 550 557 LL2345f BS 558 561 LL2345r BS 562 566 LL2345f BS 567 567 LL2345r BS 568 569 LL2345f BS 570 575 LL2345r BS 576 577 LL2345f BS 578 581 LL2345r BS 582 584 LL2345f BS 585 586 LL2345r BS 587 592 LL2345f BS 593 594 LL2345r BS 595 595 LL2345f BS 596 596 LL2345r BS 597 599 LL2345f BS 600 637 LL2345r BS 638 639 LL2345f BS 640 677 LL2345r BS 678 679 LL2345f BS 680 731 LL2345r BS 732 732 LL2345f BS 733 867 LL2345r RD LL2345f 777 0 0 ctgtaaggcgtccatctacttcaacagacaaatgtattacaaaatagctt caagaaaatatttataacattttgaagattCCAAAATATTTTTTTTAAAT AGCAAAAAAGACAAATGCAGCAAAAGATTTAACAAGAAAAACTTCACAAA CTTTGATCCTGGCCAATCTCTTCCCCCCTTTCTCTATGTTCCTTGGGGAT GCAGCAGTAGCAGTCTTTGACTATTGGAAAGGTTGTTAGTTGGCGAAAAC GTTGAGCCTAACGCGAGCAGTGACATCAGGGTGAAGTTTGAGTTCGGCTA TGTATTCTCCCGTTTCACGGATCTCCGGAAGAGAGACAAGGCGCTTGTCT ATATCCCTTTGGAGCTGCGCCTTGATGATGTCAACAAGGTCTTGTGCTGT GACAGATCCAAATATTTGTTTGCCCTTACCGCCCTTGCGTTTAACCTTGA AAGCCCCAACGGTTTCGAATACGGTAGCCAATTGTAGCGCCTCTTCTTTC ACCCTTTGCTTTTCTGCCTCTATCCTTTCGTTTTCCATCTTCATTTCCTT GAGCAGAAGCGGAGTCATGAGCTGAGCCTTTCCAGTGGGCAAGAGGAAGT TTCTGAAGAAACCGGCTTTCACGTCCAGTAATTGCCCTTGCTTGCCCAAG TCCAACACATCCTCTTTCAAGATTACCTTGCGGAGTTTCTTGGCTTTCTT CTGGGCAACAACCTCCAAAGTAGACCTTCGCTGGGATACTTTTAGTGTTT CATTTCCGACGACATTGAAGGTGTGAg QA 81 777 1 777 DS CHROMAT_FILE: LL2345f PHD_FILE: LL2345f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:39 2000 RD LL2345r 793 0 0 CAAAAGATTTAACAAGAAAAACTTCACAAACTTTGATCCTGGCCAATCTC TTCCCCCCTTTCTCTATGTTCCTTGGGGATGCAGCAGTAGCAGTCTTTGA CTATTGGAAAGGTTGTTAGTTGGCGAAAACGTTGAGCCTAACGCGAGCAG TGACATCAGGGTGAAGTTTGAGTTCGGCTATGTATTCTCCCGTTTCACGG ATCTCCGGAAGAGAGACAAGGCGCTTGTCTATATCCCTTTGGAGCTGCGC CTTGATGATGTCAACAAGGTCTTGTGCTGTGACAGATCCAAATATTTGTT TGCCCTTACCGCCCTTGCGTTTAACCTTGAAAGCCCCAACGGTTTCGAAT ACGGTAGCCAATTGTAGCGCCTCTTCTTTCACCCTTTGCTTTTCTGCCTC TATCCTTTCGTTTTCCATCTTCATTTCCTTGAGCAGAAGCGGAGTCATGA GCTGAGCCTTTCCAGTGGGCAAGAGGAAGTTTCTGAAGAAACCGGCTTTC ACGTCCAGTAATTGCCCTTGCTTGCCCAAGTCCAACACATCCTCTTTCAA GATTACCTTGCGGAGTTTCTTGGCTTTCTTCTGGGCAACAACCTCCAAAG TAGACCTTCGCTGGGATACTTTTAGTGTTTCATTTCCGACGACATTGAAG GTGTGAGACGAACAAAGCGAGGAGCTCCACGAGAGTGACAAAGACGAAGA AGCCATTggcGAAgcttattcagtcccacgcgtccgcggacgcgtggxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxatt QA 1 713 1 747 DS CHROMAT_FILE: LL2345r PHD_FILE: LL2345r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:18 2000 CO Contig46 1226 2 44 U cttttggtaacgaagaggctacagatagatgaagcttgccaagaaaaagc ttatacaatagataaacgtgtggactgacTCTGAATTTGCTGAGTTCTAC AGAGCGATTGACGCGGCAAGTGCTTTCATACATTaGcTgCTTTGAGGGAA AGAAAATCagaAATCTTGCTGgtgagaAATCTGGTTGTATCTTGTTCTCA GGGCTTTCCAACATCGAGATTGAAGTAACACCCGAGAGGACAAGCAGAGT TTTACAACCACCATTCTGCCCAAACAAAATATCAGTGTCCAATCTATCAC CCACCATGCATATCTGTGACTTTTCGATTCCAAATTTGTCTGCCAAATAG TCCATCATAAAAGTTGAGGGTTTTCCAACTACAAGAggttcACgttGACA GGATCCAACAAGAGCACCAACCATAGAGCCACCACCTGCCCATTCTTGAG CATCGGTAAGGTGGGTGACAGCATCTCGgttTgTAGCAATGAACAGACAG CCCGGGTTTTCGCGGATACAGAGAGTTCCATACTGAATTTTGTAGTAGTT GAAATAGCGGTCAAATCCAACCACCACAGCTCCCACATCATTATCATGCT CCATTAGAAACCCTGGCTTCAGTTCAATCTGTTTTTTACCATCATCCGGA CCTCCAAGATACTGGAAACCAGCAAGGTCGAGCTCCTTCAAGATACCCTC CTCACCAATCACATAGACCTTCTTATCTTTGGGGAAATTAATAGACTGCA AGTATGCAGCTGCAGCAAAAGAAGAAGCAAATATTTCCTCCTCGTTAACA TTCAGGCCAAGAGTCTCAAACTTTTTGCCATATTGTTTCCTAGATTTTGT TGAGTTGTTTGTCACAAAAACCAACCTCTTTCCCTTGGCACGGAGCATAT CAAGAGTCTCTGGAACTCCCTCAATCAATTTATCTCCCTTCCAAATGACT CCGTCACAGTCGAAGATAAAAGTTTCAACGGAATCGATGAGCTGATCGGC GTTCTCGAGCTGCTGTGCCGCCATAGCTCTCGGTGTCATATTGGGAGTCA TCACGCGGAGAGGCTTATGATCGATTCTGCGAATGATGCCACCGGAGAAG GAGGAAGAACGAAATCCTGAGAAAGCCTTGACGGAGAAGAAAGGTTTGGA GTTTGGTAAGAGAGAAACTGAGGAAGAAGAAACAagcgctgttctgttca gcatcttttcttctccggacgcgtgg BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 35 35 35 39 37 33 42 32 29 29 35 35 35 35 37 37 35 35 35 35 32 32 35 35 32 32 32 32 35 37 35 32 35 32 32 32 29 29 29 29 29 29 29 35 32 35 32 32 35 35 35 35 35 24 29 17 21 15 28 19 27 27 28 39 35 35 35 35 35 35 35 32 37 37 37 37 44 27 27 15 13 13 20 24 35 35 32 35 35 32 25 21 10 10 9 9 9 10 31 23 32 32 33 33 35 37 33 35 35 36 36 36 36 36 32 35 35 38 38 38 32 35 32 32 32 36 36 35 41 41 35 42 42 35 35 30 30 29 35 37 39 39 42 35 29 26 26 26 26 26 37 35 41 31 33 27 27 27 33 29 35 32 30 30 31 28 21 21 21 37 37 46 46 42 42 39 35 35 35 35 35 35 36 42 42 42 35 35 35 35 37 37 35 32 32 32 32 32 35 37 42 42 35 33 28 28 26 26 29 32 33 33 37 42 42 42 43 40 40 40 40 37 37 37 37 37 37 40 40 40 40 40 35 32 32 32 32 29 35 35 35 35 35 42 36 35 33 30 36 42 44 44 44 44 44 44 35 38 36 35 35 35 35 35 35 32 37 30 33 26 28 33 42 42 42 47 47 47 47 47 47 42 42 37 37 42 42 34 34 34 34 34 35 35 34 34 34 33 33 33 33 33 29 32 33 33 23 19 15 10 8 17 20 23 18 19 19 23 25 37 40 40 37 37 37 37 42 37 33 42 30 30 30 30 30 42 42 37 42 32 29 29 35 35 35 33 35 35 42 35 38 38 38 38 38 42 42 42 42 42 44 42 44 44 44 44 44 48 44 47 35 33 35 33 33 29 26 26 26 33 31 23 22 22 23 36 33 40 40 40 35 35 29 29 24 34 34 39 27 12 9 9 21 14 29 35 40 52 64 60 50 54 41 41 41 58 46 50 50 66 61 62 62 66 66 63 69 69 67 59 55 56 56 55 50 50 58 58 61 62 66 73 75 67 60 59 55 58 62 62 66 75 80 68 55 59 59 44 44 44 37 42 33 30 29 35 42 44 49 44 58 58 58 59 52 52 43 52 52 61 55 54 46 51 58 71 71 72 74 69 69 69 67 63 77 72 72 66 71 73 80 78 86 83 88 85 77 77 79 80 88 73 73 62 65 64 79 72 72 80 63 54 58 56 51 51 46 52 45 47 54 55 58 52 50 44 47 52 48 51 46 50 50 48 48 56 55 61 64 63 63 58 55 55 58 53 54 54 54 61 61 69 60 60 68 70 58 73 43 40 46 46 46 44 40 41 54 65 68 73 67 67 55 40 40 37 40 37 45 48 49 43 47 58 53 49 44 44 44 54 57 63 59 48 47 40 40 44 47 47 58 59 58 58 66 70 75 73 71 69 73 72 68 74 83 75 80 65 64 70 78 75 80 71 70 52 50 47 55 47 47 47 50 49 48 57 70 70 72 75 75 71 72 77 77 74 71 55 56 56 58 53 53 56 61 56 60 42 40 40 45 45 45 45 45 42 56 53 51 48 48 52 53 57 37 37 40 45 51 56 56 56 56 56 45 40 40 40 37 40 51 51 51 51 51 45 51 51 51 51 45 40 40 40 40 40 40 51 51 51 56 56 51 51 51 51 51 51 56 56 45 40 40 40 40 40 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 51 51 51 56 56 51 51 51 51 40 40 45 45 45 45 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 51 51 51 45 45 45 45 45 45 56 56 56 56 56 45 43 43 43 43 43 43 51 51 51 56 56 56 56 56 56 51 51 51 51 51 51 51 51 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 41 43 41 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 51 51 45 45 45 51 43 43 43 43 43 43 43 56 45 45 43 43 43 43 43 43 45 45 43 43 43 43 43 43 43 51 45 45 43 43 43 43 43 43 43 43 51 43 43 43 43 43 43 45 45 45 43 43 43 43 43 43 43 43 43 43 43 43 43 45 45 45 45 45 45 45 56 56 51 51 51 51 51 51 56 45 43 43 43 43 43 51 56 45 39 39 39 39 39 40 45 45 45 45 45 45 51 51 51 51 51 39 39 39 39 39 39 45 35 34 34 34 34 34 39 51 51 40 40 40 40 40 40 51 40 40 40 40 40 40 40 40 40 40 40 40 51 51 51 40 40 40 40 40 40 51 56 51 40 40 40 40 40 39 40 40 40 40 39 46 46 40 40 40 45 45 45 51 51 51 51 51 51 51 51 51 51 51 56 45 45 45 45 35 35 39 39 39 39 39 40 46 51 51 51 51 51 51 51 51 51 51 56 56 51 51 46 46 46 46 46 40 40 32 27 21 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 AF LL2347f U 0 AF LL2347r C 473 BS 1 474 LL2347f BS 475 479 LL2347r BS 480 482 LL2347f BS 483 485 LL2347r BS 486 521 LL2347f BS 522 522 LL2347r BS 523 523 LL2347f BS 524 524 LL2347r BS 525 530 LL2347f BS 531 532 LL2347r BS 533 544 LL2347f BS 545 549 LL2347r BS 550 555 LL2347f BS 556 556 LL2347r BS 557 561 LL2347f BS 562 562 LL2347r BS 563 563 LL2347f BS 564 564 LL2347r BS 565 572 LL2347f BS 573 576 LL2347r BS 577 579 LL2347f BS 580 589 LL2347r BS 590 594 LL2347f BS 595 602 LL2347r BS 603 606 LL2347f BS 607 610 LL2347r BS 611 611 LL2347f BS 612 612 LL2347r BS 613 613 LL2347f BS 614 635 LL2347r BS 636 639 LL2347f BS 640 646 LL2347r BS 647 647 LL2347f BS 648 650 LL2347r BS 651 652 LL2347f BS 653 657 LL2347r BS 658 659 LL2347f BS 660 667 LL2347r BS 668 668 LL2347f BS 669 740 LL2347r BS 741 746 LL2347f BS 747 755 LL2347r BS 756 763 LL2347f BS 764 1226 LL2347r RD LL2347f 795 0 0 ncttttggtaacgaagaggctacagatagatgaagcttgccaagaaaaag cttatacaatagataaacgtgtggactgacTCTGAATTTGCTGAGTTCTA CAGAGCGATTGACGCGGCAAGTGCTTTCATACATTaGcTgCTTTGAGGGA AAGAAAATCagaAATCTTGCTGgtgagaAATCTGGTTGTATCTTGTTCTC AGGGCTTTCCAACATCGAGATTGAAGTAACACCCGAGAGGACAAGCAGAG TTTTACAACCACCATTCTGCCCAAACAAAATATCAGTGTCCAATCTATCA CCCACCATGCATATCTGTGACTTTTCGATTCCAAATTTGTCTGCCAAATA GTCCATCATAAAAGTTGAGGGTTTTCCAACTACAAGAggttcACgttGAC AGGATCCAACAAGAGCACCAACCATAGAGCCACCACCTGCCCATTCTTGA GCATCGGTAAGGTGGGTGACAGCATCTcGgttTgTAGCAATGAACAGACA GCCCGGGTTTTCGCGGATACAGAGAGTTCCATACTGAATTTTGTAGTAGT TGAAATAGCGGTCAAATCCAACCACCACAGCTCCCACATCATTATCATGC TCCATTAGAAACCCTGGCTTCAGTTCAATCTGTTTTTTACCATCATCCGG ACCTCCAAGATACTGGAAACCAGCAAGGTCGAGCTCCTTCAAGATACCCT CCTCACCAATCACATAGACCTTCTTATCTTTGGGGAAATTAATAGACTGc angtATGCAGCTGCAGcacacgacgaagcacatattccctcctcg QA 81 741 2 795 DS CHROMAT_FILE: LL2347f PHD_FILE: LL2347f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:50 2000 RD LL2347r 799 0 0 atCTCGgtntgTAGCAATGAACAGACAGCCCGGGTTTTCGCGGATACAGA GAGTTCCATACTGAatttgntaGTAGTTGAAATAGCGGTCAAATCCAACC ACCACAGCTCCCACATCATTATCATGCTCCATTAGAAACCCTGGCTTCAG TTCAATCTGTTTTTTACCATCATCCGGACCTCCAAGATACTGGAAACCAG CAAGGTCGAGCTCCTTCAAGATACCCTCCTCACCAATCACATAGACCTTC TTATCTTTGGGGAAATTAATAGACTGCAAGTATGCAGCTGCAGCAAAAGA AGAAGCAAATATTTCCTCCTCGTTAACATTCAGGCCAAGAGTCTCAAACT TTTTGCCATATTGTTTCCTAGATTTTGTTGAGTTGTTTGTCACAAAAACC AACCTCTTTCCCTTGGCACGGAGCATATCAAGAGTCTCTGGAACTCCCTC AATCAATTTATCTCCCTTCCAAATGACTCCGTCACAGTCGAAGATAAAAG TTTCAACGGAATCGATGAGCTGATCGGCGTTCTCGAGCTGCTGTGCCGCC ATAGCTCTCGGTGTCATATTGGGAGTCATCACGCGGAGAGGCTTATGATC GATTCTGCGAATGATGCCACCGGAGAAGGAGGAAGAACGAAATCCTGAGA AAGCCTTGACGGAGAAGAAAGGTTTGGAGTTTGGTAAGAGAGAAACTGAG GAAGAAGAAACAagcgctgttctgttcagcatcttttcttctccggacgc gtggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagt QA 11 719 1 754 DS CHROMAT_FILE: LL2347r PHD_FILE: LL2347r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:27 2000 CO Contig47 757 2 91 U tgaatttaaATCTCAAGTTGATGTCATGAAGTCACCAATATACATTCTCA GTTTACAATCACAAAGCTTAGTTTATGACTTTTTGGTATACAAAAAAACT ATATAAAAATGTCACTGAAGACCATACACAAGCAGAGCTCCAACAGCTCC AAGACCGGCCAATAAGCCAACAATAACACCGATCGGAGGCAGTCCTCCTC CTATGGCCTTCCCTGTGTTCACATCATACCTCGCAAACCTTTTCTTTGGT GCTATACATTGTGGACATACGTATGTATCCGGCTGTTCATCAAAAGGCTT TGGTAAAGTGTATATGAATCCACAATCAAGACAAATGTGAGTAGCTCTTG CCTTCTGAGTCTCGGTTAACTTTCTTCCAAACCGAGGAGGAGCCGGACGC TTATTTAGCTTCTTCACATCAACATCCGCACCACCTCTGCTGACGACAAA GTAGACAGAATCAGGTTTTGCCTGAATAGCTGTTTTAGTGAACCCCAACT TATCAGCAGGCCAAAGCTCATCTCCAAAGAAGCTGCTTGTGTACAGAACT TGATCTCCAGATTTAAGCCCAGCTTTTGCTGCGTTGCCACCTCCTTCCAC ACCAGTGATGACAACTCCACCACCTTGCTTCTGTCCAAGAGTCAGACCCA AAGGCTTGTCCACTTCAACCTCTATGGTCTTAGAAGCTGCTGTTCTCGCT TTGAC*TTCGAATCTTctcgaCAGGTTTAACCCACCGATCTTTCTGCGGA CGCGtgg BQ 0 0 0 0 0 0 0 0 14 20 26 31 31 29 24 29 25 39 39 45 46 44 47 57 59 60 62 58 47 51 52 52 44 52 47 49 53 75 72 71 63 63 54 52 52 55 55 52 60 60 58 55 55 52 52 57 64 52 55 55 55 40 38 38 42 46 55 52 52 54 64 61 64 57 61 67 66 77 66 64 71 70 69 64 65 67 67 71 70 71 64 68 71 76 76 71 71 69 72 67 67 75 75 66 69 69 66 71 73 61 64 68 63 66 75 75 75 72 74 67 65 65 67 67 65 69 66 69 67 62 61 69 71 77 71 64 59 59 57 54 54 57 53 59 64 61 61 58 60 58 64 75 71 68 67 64 57 53 50 47 54 67 67 65 65 60 60 60 60 60 71 66 64 62 57 57 61 60 63 72 72 83 83 75 72 69 64 69 69 68 71 61 72 64 67 64 61 58 56 50 54 60 70 64 66 66 66 62 62 79 86 81 90 84 86 86 86 84 84 84 90 90 90 90 72 74 75 75 71 71 66 64 64 64 70 72 77 80 80 80 90 90 90 84 84 90 88 80 69 69 69 71 71 71 71 74 57 57 57 42 42 42 42 42 44 59 63 71 66 66 67 67 72 79 79 79 84 69 74 79 70 72 70 70 76 76 87 87 90 90 90 90 90 90 90 90 90 85 74 73 78 73 75 75 81 72 75 75 78 78 90 90 90 85 82 82 82 82 89 88 69 65 65 77 77 80 85 83 79 76 79 82 82 85 90 90 90 81 79 79 79 72 63 61 55 60 67 67 83 75 72 66 68 61 53 52 69 69 79 76 50 54 42 42 37 35 35 35 50 50 57 87 87 90 90 90 90 90 90 90 84 84 84 79 79 85 90 82 80 88 75 73 71 74 74 76 73 76 70 77 77 87 87 78 81 83 71 74 73 81 84 90 90 90 90 90 86 88 84 90 90 90 90 90 87 80 75 72 73 70 75 75 70 72 79 69 77 75 77 77 79 72 72 74 67 71 73 75 71 69 63 55 50 50 56 74 60 65 65 65 56 58 58 53 53 71 71 78 85 85 77 83 81 81 77 68 66 71 53 53 53 43 43 43 38 38 38 53 53 57 78 84 86 90 90 88 88 75 75 72 68 81 61 56 54 54 68 70 74 78 75 75 71 73 73 62 63 54 73 63 72 68 65 65 65 83 72 74 64 56 54 56 54 55 62 61 56 56 56 58 61 69 73 74 68 75 78 78 82 82 86 86 86 90 90 75 60 57 60 60 67 69 78 68 68 60 59 62 60 59 61 62 61 62 61 64 73 69 70 64 51 51 52 54 56 64 75 73 75 90 90 90 90 83 83 84 84 78 79 79 85 66 67 69 70 78 90 87 87 84 85 83 73 71 85 87 79 79 79 78 66 66 59 62 57 63 65 90 70 67 67 72 73 72 83 66 74 56 60 50 50 54 59 55 55 57 59 57 65 77 85 68 65 64 57 44 43 44 48 48 48 55 49 47 46 43 42 48 63 65 63 61 60 61 54 48 44 37 37 35 39 32 44 44 45 45 50 55 61 53 51 47 45 45 45 51 35 35 35 35 39 37 35 35 29 29 24 24 24 24 29 35 40 40 22 19 9 9 11 16 31 30 53 57 49 51 52 45 50 50 69 66 75 63 54 53 54 52 50 43 43 50 41 40 47 44 41 34 40 40 34 34 34 0 0 0 AF LL2349f U -1 AF LL2349r C 0 BS 1 17 LL2349r BS 18 19 LL2349f BS 20 25 LL2349r BS 26 37 LL2349f BS 38 38 LL2349r BS 39 77 LL2349f BS 78 78 LL2349r BS 79 98 LL2349f BS 99 99 LL2349r BS 100 101 LL2349f BS 102 103 LL2349r BS 104 114 LL2349f BS 115 119 LL2349r BS 120 147 LL2349f BS 148 148 LL2349r BS 149 149 LL2349f BS 150 150 LL2349r BS 151 151 LL2349f BS 152 154 LL2349r BS 155 179 LL2349f BS 180 185 LL2349r BS 186 190 LL2349f BS 191 191 LL2349r BS 192 192 LL2349f BS 193 194 LL2349r BS 195 210 LL2349f BS 211 211 LL2349r BS 212 217 LL2349f BS 218 224 LL2349r BS 225 225 LL2349f BS 226 226 LL2349r BS 227 234 LL2349f BS 235 241 LL2349r BS 242 247 LL2349f BS 248 286 LL2349r BS 287 288 LL2349f BS 289 304 LL2349r BS 305 305 LL2349f BS 306 314 LL2349r BS 315 318 LL2349f BS 319 320 LL2349r BS 321 323 LL2349f BS 324 337 LL2349r BS 338 342 LL2349f BS 343 352 LL2349r BS 353 353 LL2349f BS 354 354 LL2349r BS 355 362 LL2349f BS 363 369 LL2349r BS 370 370 LL2349f BS 371 388 LL2349r BS 389 390 LL2349f BS 391 399 LL2349r BS 400 401 LL2349f BS 402 403 LL2349r BS 404 405 LL2349f BS 406 408 LL2349r BS 409 409 LL2349f BS 410 412 LL2349r BS 413 413 LL2349f BS 414 414 LL2349r BS 415 415 LL2349f BS 416 416 LL2349r BS 417 417 LL2349f BS 418 419 LL2349r BS 420 423 LL2349f BS 424 427 LL2349r BS 428 432 LL2349f BS 433 441 LL2349r BS 442 446 LL2349f BS 447 467 LL2349r BS 468 468 LL2349f BS 469 492 LL2349r BS 493 493 LL2349f BS 494 553 LL2349r BS 554 554 LL2349f BS 555 562 LL2349r BS 563 563 LL2349f BS 564 606 LL2349r BS 607 609 LL2349f BS 610 611 LL2349r BS 612 612 LL2349f BS 613 655 LL2349r BS 656 696 LL2349f BS 697 715 LL2349r BS 716 716 LL2349f BS 717 719 LL2349r BS 720 722 LL2349f BS 723 723 LL2349r BS 724 748 LL2349f BS 749 757 LL2349r RD LL2349f 797 0 0 cttttnagcaacccngcagTTGATGTCATGAAGTCACCAATATACATTCT CAGTTTACAATCACAAAGCTTAGTTTATGACTTTTTGGTATACAAAAAAA CTATATAAAAATGTCACTGAAGACCATACACAAGCAGAGCTCCAACAGCT CCAAGACCGGCCAATAAGCCAACAATAACACCGATCGGAGGCAGTCCTCC TCCTATGGCCTTCCCTGTGTTCACATCATACCTCGCAAACCTTTTCTTTG GTGCTATACATtgcggaCATACGTATGTATCCGGCTGTTCATCAAAAGGC TTTGGTAAAGTGTATATGAATCCACAATCAAGACAAATGTGAGTAGCTCT TGCCTTCTGAGTCTCGgctAACTTTCTTCCAAACCGAGGAGGAGCCGGAC GCTTATTTAGCTTCTTCACATCAACATCCGCACCACCTCTGCTGACGACA AAGTAGACAGAATCAGGTTTTGCCTGAATAGCTgttctaGTGAACCCCAA CTTATCAGCAGGCCAAAGCTCATCTCCAAAGAAGCTGCTTGTGTACAGAA CTTGATCTCCAGATTTAAGCCCAGCTTTTGCTGCGTTGCCACCTCCTTCC ACACCAGTGATGACAACTCCACCACCTTGCTTCTGTCCAAGAGTCAGACC CAAAGGCTTGTCCACTTCAACCTCTATGGTCTTAGAAGCTGCTGTTCTCG ctctgaccttccaatCTTcttgaCAGGTTTAACCCACCGATCTTTCTGCG GACGcgcggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxt QA 20 657 18 756 DS CHROMAT_FILE: LL2349f PHD_FILE: LL2349f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:59 2000 RD LL2349r 803 0 0 ntgaatttaaATCTCAAGTTGATGTCATGAAGTCACCAATATACATTCTC AGTTTACAATCACanaGCTTAGTTTATGACTTTTTGGTATACAAAAAAAC TATATAAAAATGTCACTGAAGACCATACACAAGCAGAGCTCCAACAGCTC CAAGACCGGCCAATAAGCCAACAATAACACCGATCGGAGGCAGTCCTCCT CCTATGGCCTTCCCTGTGTTCACATCATACCTCGCAAACCTTTTCTTTGG TGCTATACATTGTGGACATACGTATGTATCCGGCTGTTCATCAAAAGGCT TTGGTAAAGTGTATATGAATCCACAATCAAGACAAATGTGAGTAGCTCTT GCCTTCTGAGTCTCGGTTAACTTTCTTCCAAACCGAGGAGGAGCCGGACG CTTATTTAGCTTCTTCACATCAACATCCGCACCACCTCTGCTGACGACAA AGTAGACAGAATCAGGTTTTGCCTGAATAGCTGTTTTAGTGAACCCCAAC TTATCAGCAGGCCAAAGCTCATCTCCAAAGAAGCTGCTTGTGTACAGAAC TTGATCTCCAGATTTAAGCCCAGCTTTTGCTGCGTTGCCACCTCCTTCCA CACCAGTGATGACAACTCCACCACCTTGCTTCTGTCCAAGAGTCAGACCC AAAGGCTTGTCCACTTCAACCTCTATGGTCTTAGAAGCTGCTGTTCTCGC TTTGAC*TTCGAATCTTctcgaCAGGTTTAACCCACCGATCTTTCTGCGG ACGCGtggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx aag QA 10 755 2 758 DS CHROMAT_FILE: LL2349r PHD_FILE: LL2349r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:36 2000 CO Contig48 549 2 72 U CTTTTTTTTTTTTTTTTTAGAGAATCATTCACGTTCATTCCATAGAAAGA AGTACAAACTGACTCATCCGTCACACATGGCAGAGACAACTAGATATGAC GTCAACACAAACGAAAACCATTATCGGCTAAAAGACAAAAGCCTGCAGCC GTTATTACACCGTAAACAAAACCTAGAACATCATTAGGGTTACTTAATAT AATTAAACTCTGATTATCTCTGCCTCCTTAGAAAGGTTCCATTATTATTC ATTTACAGGTGCAAGGATCACACTTGCAGTCAGATCCACACTTGCATGCA TCGTTCTCAGCGACACCCTCTCCAGAAGCCTCGTACTGGTTCTTCATCGC CGGTGCAACGCCGAAGACAAAAGTCTCGGTCGTGGTCGACTCGCCGGAAA AGCCCAAGTCCGGGTACATTTTGCAACCTCCGCAACCGTTGCCGCACTTG CAGCCAGATCCACAACCACAGTTTCCACCACAGCAAGACATTTTCTCGAG AAAGACCGAAGAAATAGCAGAGAATGTTTATCAAATTGCGGACGCGTGG BQ 29 37 48 48 48 48 48 48 48 48 56 56 56 56 56 46 42 42 36 46 46 48 46 42 42 42 46 46 56 40 52 50 62 54 59 60 66 70 82 85 80 72 71 65 74 75 85 85 88 73 77 74 80 85 79 90 90 90 90 90 90 90 90 90 90 90 84 79 76 77 75 75 81 90 90 90 90 81 90 90 90 86 86 90 90 86 90 90 86 86 86 90 90 90 90 80 83 62 61 53 55 63 63 85 86 90 90 90 90 77 77 77 80 80 80 89 90 90 90 88 88 88 80 80 80 86 75 75 75 75 75 75 90 90 90 90 90 80 82 82 82 83 78 78 75 75 75 75 75 80 72 72 65 77 77 84 86 90 78 82 82 81 81 78 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 90 90 90 88 77 77 77 86 86 86 86 86 89 89 90 90 90 89 89 89 90 87 88 90 88 88 88 90 88 88 83 85 83 88 90 88 84 79 79 81 81 87 86 89 90 90 90 90 90 90 90 85 85 85 88 90 86 86 86 85 85 85 90 90 90 90 90 90 90 90 90 90 84 84 84 84 85 89 89 90 90 90 90 90 90 90 90 90 90 79 79 86 83 86 86 90 90 88 90 90 90 90 90 90 90 90 90 81 81 89 90 90 90 90 90 86 79 81 81 81 79 88 88 81 79 79 74 74 79 86 90 90 85 85 83 85 85 90 90 90 90 90 90 90 90 90 90 90 83 86 86 88 86 81 88 89 90 90 90 90 90 86 78 79 82 86 79 83 84 84 85 90 86 86 86 86 86 86 81 90 80 86 86 87 87 79 78 86 80 73 78 82 89 85 84 59 59 59 36 36 36 36 36 36 58 58 79 79 81 84 84 84 85 85 90 90 85 86 86 90 86 77 76 76 76 76 77 85 85 85 90 90 90 90 90 90 88 78 78 75 75 75 90 90 90 79 79 85 85 85 74 82 82 81 75 70 72 72 75 75 75 75 79 72 72 72 77 77 72 89 89 90 90 90 90 90 90 90 90 90 90 90 90 84 82 82 81 82 85 80 90 90 90 90 90 80 75 75 75 79 82 81 90 90 90 90 90 83 78 81 75 77 86 86 88 88 88 88 82 80 82 55 60 40 42 30 30 30 42 33 40 50 54 54 66 67 84 90 84 82 82 82 75 75 75 75 75 77 90 90 90 78 71 71 71 71 71 75 75 50 50 49 34 42 42 AF LL2350r C -206 AF LL2350f U 17 BS 1 52 LL2350r BS 53 54 LL2350f BS 55 68 LL2350r BS 69 70 LL2350f BS 71 78 LL2350r BS 79 81 LL2350f BS 82 104 LL2350r BS 105 105 LL2350f BS 106 108 LL2350r BS 109 116 LL2350f BS 117 164 LL2350r BS 165 166 LL2350f BS 167 198 LL2350r BS 199 199 LL2350f BS 200 211 LL2350r BS 212 212 LL2350f BS 213 218 LL2350r BS 219 219 LL2350f BS 220 220 LL2350r BS 221 221 LL2350f BS 222 232 LL2350r BS 233 235 LL2350f BS 236 261 LL2350r BS 262 267 LL2350f BS 268 275 LL2350r BS 276 276 LL2350f BS 277 277 LL2350r BS 278 285 LL2350f BS 286 299 LL2350r BS 300 314 LL2350f BS 315 321 LL2350r BS 322 332 LL2350f BS 333 333 LL2350r BS 334 335 LL2350f BS 336 336 LL2350r BS 337 337 LL2350f BS 338 338 LL2350r BS 339 342 LL2350f BS 343 349 LL2350r BS 350 350 LL2350f BS 351 355 LL2350r BS 356 362 LL2350f BS 363 370 LL2350r BS 371 377 LL2350f BS 378 379 LL2350r BS 380 382 LL2350f BS 383 398 LL2350r BS 399 400 LL2350f BS 401 401 LL2350r BS 402 411 LL2350f BS 412 414 LL2350r BS 415 418 LL2350f BS 419 419 LL2350r BS 420 426 LL2350f BS 427 428 LL2350r BS 429 456 LL2350f BS 457 467 LL2350r BS 468 474 LL2350f BS 475 475 LL2350r BS 476 476 LL2350f BS 477 479 LL2350r BS 480 486 LL2350f BS 487 505 LL2350r BS 506 515 LL2350f BS 516 523 LL2350r BS 524 526 LL2350f BS 527 532 LL2350r BS 533 541 LL2350f BS 542 543 LL2350r BS 544 545 LL2350f BS 546 547 LL2350r BS 548 549 LL2350f RD LL2350r 801 0 2 agggcgatcggtgcgggcctcttcgctattacgccagctggcgaaagggg gatgtgctgcaaggcgattaagttgggtaacgccagggttttxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxCTTTTTTTTTTTTTTTTTAGAGAATCATTCACGTTCATTCCAT AGAAAGAAGTACAAACTGACTCATCCGTCACACATGGCAGAGACAACTAG ATATGACGTCAACACAAACGAAAACCATTATCGGCTAAAAGACAAAAGCC TGCAGCCGTTATTACACCGTAAACAAAACCTAGAACATCATTAGGGTTAC TTAATATAATTAAACTCTGATTATCTCTGCCTCCTTAGAAAGGTTCCATT ATTATTCATTTACAGGTGCAAGGATCACACTTGCAGTCAGATCCACACTT GCATGCATCGTTCTCAGCGACACCCTCTCCAGAAGCCTCGTACTGGTTCT TCATCGCCGGTGCAACGCCGAAGACAAAAGTCTCGGTCGTGGTCGACTCG CCGGAAAAGCCCAAGTCCGGGTACATTTTGCAACCTCCGCAACCGTTGCC GCACTTGCAGCCAGATCCACAACCACAGTTTCCACCACAGCAAGACATTT TCTCGAGAAAGACCgaacaaATAGCAGAGAATGTTTATCAAATTGCGGAC GCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxa t QA 45 756 208 756 DS CHROMAT_FILE: LL2350r PHD_FILE: LL2350r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:41 2000 RT{ LL2350r chimera phrap 1 92 000919:094547 } RT{ LL2350r matchElsewhereHighQual phrap 1 92 000919:094547 } RD LL2350f 798 0 2 ctttgagaaacttcACGTTCATTCCATAGAAAGAAGTACAAACTGACTCA TCCGTCACACATGGCAGAGACAACTAGATATGACGTCAACACAAACGAAA ACCATTATCGGCTAAAAGACAAAAGCCTGCAGCCGTTATTACACCGTAAA CAAAACCTAGAACATCATTAGGGTTACTTAATATAATTAAACTCTGATTA TCTCTGCCTCCTTAGAAAGGTTCCATTATTATTCATTTACAGGTGCAAGG ATCACACTTGCAGTCAGATCCACACTTGCATGCATCGTTCTCAGCGACAC CCTCTCCAGAAGCCTCGTACTGGTTCTTCATCGCCGGTGCAACGCCGAAG ACAAAAGTCTCGGTCGTggccgACTCGCCGGAAAAGCCCAAGTCCGGGTA CATTTTGCAACCTCCGCAACCGTTGCCGCACTTGCAGCCAGATCCACAAC CACAGTTTCCACCACAGCAAGACATTTTCTCGAGAAAGACCGAAGAAATA GCAGAGAATGTTTATCAAATTGCGGACGCGTGGxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxgcct gcggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactg cccgctttccagtcgggaaacctgtcgtgccagctgcattactgaatn QA 14 751 12 533 DS CHROMAT_FILE: LL2350f PHD_FILE: LL2350f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:04 2000 RT{ LL2350f chimera phrap 697 797 000919:094547 } RT{ LL2350f matchElsewhereHighQual phrap 703 791 000919:094547 } CO Contig49 516 2 56 U CCCCccttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTAAGAAAGAAATTTTGGACATTTGTGGACATCGATT CAGTTTGGTTCTTTGGTTGGGATGGGGTTTTGGGTTAAAATGCCCATTCA CCAAGCGGAAGAACTTGCACAGATGCAATACACGTTGATAGACATCGAAT TCAAAAATAAACGCACTTCAAGAACAGCAACAGTCCGATTGAATTACACT CAAGTTTGCTCAGAGAGACAAGTAAACAACCATTCTTTACCCTATGGCGA CGGCTGCTGAGACTCCATCGGAATCCCAAGATCATTCCTCTTGAGAACAG GAAGCGCGACGAAAGATTTGGTCTTACTGATGGGTCTGCTCTTTTCAAGC CTCACAACGTCACCAAGCGGAAGAACTTGCACAGATGCAATACACGTTGA TAGACCGGACGCGTGG BQ 22 22 27 26 12 10 10 19 33 48 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 71 71 71 71 71 90 89 89 77 75 71 71 71 72 75 75 75 70 73 66 66 51 56 51 45 45 60 60 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 80 76 79 90 90 90 71 71 56 56 56 56 56 56 71 71 71 90 82 75 76 76 82 76 76 83 90 90 85 85 85 83 83 83 89 83 83 85 85 82 82 82 82 82 82 90 90 80 78 82 78 78 78 90 85 85 85 85 85 90 84 82 82 82 82 82 90 90 90 90 90 90 90 90 90 90 78 78 78 78 78 78 78 90 90 90 90 90 90 90 90 90 90 90 90 90 88 87 81 81 75 75 75 84 80 80 90 90 90 90 90 90 90 90 90 90 90 84 83 90 90 90 78 86 90 90 90 90 90 90 90 90 89 89 89 83 81 81 83 83 83 89 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 83 83 83 82 82 74 74 74 74 75 75 75 82 82 77 79 79 82 80 90 90 84 90 86 80 80 80 90 90 90 90 90 90 90 83 83 83 83 85 90 90 90 90 90 90 81 83 83 83 85 85 90 90 85 85 90 79 77 70 70 70 70 70 78 88 88 88 85 90 90 90 82 83 82 90 82 86 81 80 83 83 83 83 90 90 90 90 90 84 84 82 75 76 76 82 82 90 90 89 90 90 90 85 85 85 85 89 89 89 89 83 82 82 82 82 82 90 90 90 76 76 76 76 76 76 76 76 74 56 56 41 40 40 40 41 56 56 85 85 81 81 81 89 89 90 90 90 83 83 83 83 83 86 90 90 90 85 81 90 90 90 86 80 80 83 55 55 56 42 56 56 AF LL2351r C -249 AF LL2351f U 112 BS 1 154 LL2351r BS 155 161 LL2351f BS 162 162 LL2351r BS 163 169 LL2351f BS 170 175 LL2351r BS 176 176 LL2351f BS 177 182 LL2351r BS 183 183 LL2351f BS 184 202 LL2351r BS 203 208 LL2351f BS 209 209 LL2351r BS 210 211 LL2351f BS 212 217 LL2351r BS 218 218 LL2351f BS 219 269 LL2351r BS 270 286 LL2351f BS 287 290 LL2351r BS 291 292 LL2351f BS 293 302 LL2351r BS 303 305 LL2351f BS 306 324 LL2351r BS 325 327 LL2351f BS 328 334 LL2351r BS 335 335 LL2351f BS 336 336 LL2351r BS 337 337 LL2351f BS 338 344 LL2351r BS 345 350 LL2351f BS 351 357 LL2351r BS 358 359 LL2351f BS 360 360 LL2351r BS 361 364 LL2351f BS 365 366 LL2351r BS 367 377 LL2351f BS 378 379 LL2351r BS 380 385 LL2351f BS 386 390 LL2351r BS 391 396 LL2351f BS 397 398 LL2351r BS 399 409 LL2351f BS 410 416 LL2351r BS 417 428 LL2351f BS 429 430 LL2351r BS 431 436 LL2351f BS 437 442 LL2351r BS 443 443 LL2351f BS 444 444 LL2351r BS 445 451 LL2351f BS 452 455 LL2351r BS 456 463 LL2351f BS 464 464 LL2351r BS 465 482 LL2351f BS 483 484 LL2351r BS 485 503 LL2351f BS 504 507 LL2351r BS 508 516 LL2351f RD LL2351r 812 0 4 gtacagggggcgtcccatttgccattcagggtgcgcaactgtgggaaggg cgatcggtgcgggccttttcggtattaccccaactggcgaaaaggggatg ttcttcaaggcgattaagttggggaaccccagggtttttxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx CCCCccttTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTTAAGAAAGAAATTTTGGACATTTGTGGACATCGATT CAGTTTGGTTCTTTGGTTGGGATGGGGTTTTGGGTTAAAATGCCCATTCA CCAAGCGGAAGAACTTGCACAGATGCAATACACGTTGATAGACATCGAAT TCAAAAATAAACGCACTTCAAGAACAGCAACAGTCCGATTGAATTACACT CAAGTTTGCTCAGAGAGACAAGTAAACAACCATTCTTTACCCTATGGCGA CGGCTGCTGAGACTCCATCGGAATCCCAAGATCATTCCTCTTGAGAACAG GAAGCGCGACGAAAGATTTGGTCTTACTGATGGGTCTGCTCTTTTCAAGC CTCACAACGTCACCAAGCGGAAGAActcgcACAGATGCAATACACGTTGA TAGACCGGACGCGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxaagt QA 251 766 251 766 DS CHROMAT_FILE: LL2351r PHD_FILE: LL2351r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:46 2000 RT{ LL2351r chimera phrap 1 139 000919:094547 } RT{ LL2351r matchElsewhereLowQual phrap 18 33 000919:094547 } RT{ LL2351r matchElsewhereLowQual phrap 43 59 000919:094547 } RT{ LL2351r matchElsewhereHighQual phrap 106 139 000919:094547 } RD LL2351f 805 0 6 cctttatgaagacatTTTGGACATTTGTGGACATCGatccaGTTTGGTTC TTTGGTTGGGATGGGGTTTTGGGTTAAAATGcggatTCACCAAGCGGAAG AACTTGCACAGATGCAATACACGTTGATAGACATCGAATTCAAAAATAAA CGCACTTCAAGAACAGCAACAGTCCGATTGAATTACACTCAAGTTTGCTC AGAGAGACAAGTAAACAACCATTCTTTACCCTATGGCGACGGCTGCTGAG ACTCCATCGGAATCCCAAGATCATTCCTCTTGAGAACAGGAAGCGCGACG AAAGATTTGGTCTTACTGATGGGTCTGCTCTTTTCAAGCCTCACAACGTC ACCAAGCGGAAGAACTTGCACAGATGCAATACACGTTGATAGACCGGACG CGTGGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxgcctggggtgcctaatgagtgagctaactcac attaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgt gccagctgcattaatgaatcggccaacgcgcggngagaggcggtttgcgt attgcgcgccagggtggtttttcttttcccagtgagacgggcaacagctg attgcccttcaccgnctgcgcctgacagagctgcaccaagcggtccacgc tgcct QA 15 682 9 405 DS CHROMAT_FILE: LL2351f PHD_FILE: LL2351f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:09 2000 RT{ LL2351f chimera phrap 569 802 000919:094547 } RT{ LL2351f matchElsewhereHighQual phrap 787 802 000919:094547 } RT{ LL2351f matchElsewhereHighQual phrap 729 764 000919:094547 } RT{ LL2351f matchElsewhereHighQual phrap 706 727 000919:094547 } RT{ LL2351f matchElsewhereHighQual phrap 685 704 000919:094547 } RT{ LL2351f matchElsewhereHighQual phrap 569 683 000919:094547 } CO Contig50 984 3 112 U ccacgcgtccgattcattgcaacgaaatatttaagacatgtttTTTGAGT TAATTAACAGCTTCTATTTTTTTTGTTCTTGTTCTTTATGGACCCTAAAA TTAAAATAGAAGATTGGGGGTGAATCATAAATCCAAAGGAGGTTTCATGG CCAAAGGTAAAGATGTTCGAGTAACAATTATTTTGGAATGTACCAGTTGT GTTCGAAATGATATTAAGAAAGAATCGGCTGGAATTTCCAGATATATTAC TCAAAAGAATCGGCATAACACTCCTAGTCGATTGGAATTGAGAAAATTCT GTCCCTATTGTTATAAACATACAATTCATGGGGAAATTAAGAAATAGATA AAATTGAGTGCTTGTATGTCAAATTTTATTTTAAGAACAGGAATAATGAG AGTATCTACGTATTATTACATATATATAAATATAAACAAATAAAATAATA GAAAGAAATCAAATCCTATATTCTTAATTCTATATAGAAACTCTATCCTA TATAGAAATAGCAATCGTTTTTATTTTGATCCGATCAAAAATAGGATTTT ATAGGTAAGGAATAAAAAATTATGAATAAATCTAAGCGACCTTTTACTAA ATCCAAGCGATCTTTTCGTCGGCGTTTGCCCCCGATCCAATCGGGGGATC GAATTGATTATAGAAACATGAGTTTAATTAGTCGATTTATTAGTGAACAA GGAAAAATATTATCTAGACGGGTGAATAGAGTAACTTTAAAACAACAACG ATTAATTACTATTGCTATAAAACAAGCTCGTATTTTATCTTTGTTACCTT TTCTTAATAATCAGAAACAATTTGAAAGAAGTGAGTCGACCCCTAGAACT ACTAGCCTTAGAACCAGAAAAAAATAGACTTATTCTTCAATTGAATAACT AATCTGAAGGAATTAAAAAAGAGGTTAATATTTTGTTCGAca*aatCCA* ATCA*AGAATCA*AAATTTGATTGTtacGTctgt BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 27 29 32 34 33 34 30 35 34 34 34 34 34 34 34 40 40 40 40 37 40 37 40 40 40 56 56 56 56 40 40 37 37 37 40 51 51 42 42 42 51 37 40 35 35 35 39 51 51 40 40 40 40 40 40 46 35 35 35 35 35 35 35 35 35 35 35 37 40 40 51 56 56 56 56 56 42 42 42 46 46 51 43 43 51 56 51 43 43 43 51 51 51 51 56 56 56 56 56 37 37 37 37 37 37 40 40 40 40 45 51 56 51 51 51 51 51 51 56 56 56 50 40 37 40 40 40 40 42 42 42 40 42 40 40 40 40 40 45 56 56 56 56 56 56 56 51 51 51 45 45 45 42 42 42 42 42 46 45 43 43 43 43 43 74 81 81 82 81 76 67 62 64 60 60 67 61 61 63 66 58 58 56 51 42 42 44 51 58 73 78 65 66 67 71 66 71 66 67 73 70 73 65 60 58 62 62 67 78 78 88 86 86 86 80 74 80 80 80 82 77 75 71 70 79 71 78 80 90 71 71 80 83 77 79 80 80 84 77 81 79 83 87 89 84 85 90 90 90 90 90 90 90 90 86 86 88 89 82 82 81 90 90 90 90 90 77 77 77 81 81 86 88 90 90 89 87 87 90 86 90 89 90 90 90 90 90 90 90 90 90 90 87 86 80 84 86 85 90 90 90 90 90 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 82 82 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 81 76 90 80 90 86 88 88 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 81 81 89 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 88 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 88 88 88 87 85 85 88 82 81 86 90 90 90 90 90 88 86 86 88 88 90 90 90 90 86 84 84 86 86 90 90 90 90 90 90 90 90 90 90 87 88 89 89 89 88 88 88 90 90 90 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 85 90 90 90 88 90 90 90 90 90 90 90 90 90 90 90 90 90 86 86 90 87 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 85 90 90 90 90 90 90 90 90 90 90 90 90 82 73 73 65 55 63 72 75 83 84 86 86 88 84 72 72 77 66 66 72 78 64 70 78 76 69 69 75 75 75 76 81 81 81 75 74 68 78 78 82 86 83 85 83 65 63 55 63 58 61 64 58 66 80 76 82 86 86 82 82 90 85 81 69 62 59 65 74 71 77 81 85 90 64 67 62 60 63 70 75 75 75 82 77 71 69 71 68 68 58 58 64 86 76 77 75 75 83 83 69 69 61 75 75 70 62 69 63 66 63 64 64 74 67 67 54 54 48 54 56 62 62 64 57 56 59 68 56 61 61 55 54 51 52 44 48 48 37 37 42 42 56 56 40 40 40 55 55 55 69 69 69 69 66 69 67 64 64 64 64 64 61 75 72 64 61 58 45 40 48 51 52 52 66 70 75 60 52 57 50 50 48 51 58 64 66 69 76 90 90 90 90 67 64 48 48 48 50 50 58 86 69 72 62 64 58 44 40 40 42 36 13 13 13 16 19 20 22 25 24 29 29 25 25 27 29 42 33 29 25 25 26 25 27 44 41 41 39 44 43 22 20 16 17 17 21 23 18 19 9 9 AF LL2342r U -44 AF LL2317r U 194 AF LL2342f C 204 BS 1 207 LL2342r BS 208 231 LL2342f BS 232 267 LL2342r BS 268 268 LL2317r BS 269 271 LL2342r BS 272 274 LL2317r BS 275 283 LL2342r BS 284 284 LL2317r BS 285 296 LL2342r BS 297 297 LL2317r BS 298 315 LL2342r BS 316 316 LL2317r BS 317 340 LL2342r BS 341 341 LL2342f BS 342 342 LL2317r BS 343 352 LL2342r BS 353 353 LL2317r BS 354 362 LL2342r BS 363 363 LL2342f BS 364 364 LL2342r BS 365 366 LL2317r BS 367 371 LL2342r BS 372 377 LL2317r BS 378 379 LL2342r BS 380 384 LL2317r BS 385 390 LL2342r BS 391 400 LL2317r BS 401 410 LL2342r BS 411 412 LL2317r BS 413 413 LL2342r BS 414 414 LL2342f BS 415 424 LL2342r BS 425 438 LL2317r BS 439 442 LL2342r BS 443 463 LL2317r BS 464 468 LL2342r BS 469 470 LL2317r BS 471 472 LL2342r BS 473 485 LL2317r BS 486 487 LL2342r BS 488 494 LL2342f BS 495 497 LL2317r BS 498 501 LL2342r BS 502 513 LL2317r BS 514 516 LL2342r BS 517 520 LL2317r BS 521 526 LL2342f BS 527 549 LL2317r BS 550 551 LL2342f BS 552 590 LL2317r BS 591 592 LL2342r BS 593 627 LL2317r BS 628 628 LL2342f BS 629 674 LL2317r BS 675 677 LL2342f BS 678 685 LL2317r BS 686 691 LL2342f BS 692 706 LL2317r BS 707 710 LL2342f BS 711 715 LL2317r BS 716 716 LL2342f BS 717 751 LL2317r BS 752 752 LL2342f BS 753 754 LL2317r BS 755 766 LL2342f BS 767 775 LL2317r BS 776 776 LL2342f BS 777 777 LL2317r BS 778 778 LL2342f BS 779 779 LL2317r BS 780 780 LL2342f BS 781 784 LL2317r BS 785 789 LL2342f BS 790 792 LL2317r BS 793 795 LL2342f BS 796 796 LL2317r BS 797 798 LL2342f BS 799 802 LL2317r BS 803 808 LL2342f BS 809 811 LL2317r BS 812 812 LL2342f BS 813 813 LL2317r BS 814 822 LL2342f BS 823 828 LL2317r BS 829 831 LL2342f BS 832 838 LL2317r BS 839 841 LL2342f BS 842 842 LL2317r BS 843 852 LL2342f BS 853 853 LL2317r BS 854 854 LL2342f BS 855 859 LL2317r BS 860 872 LL2342f BS 873 878 LL2317r BS 879 891 LL2342f BS 892 893 LL2317r BS 894 894 LL2342f BS 895 895 LL2317r BS 896 904 LL2342f BS 905 905 LL2317r BS 906 917 LL2342f BS 918 919 LL2317r BS 920 938 LL2342f BS 939 944 LL2317r BS 945 947 LL2342f BS 948 958 LL2317r BS 959 960 LL2342f BS 961 966 LL2317r BS 967 976 LL2342f BS 977 977 LL2317r BS 978 978 LL2342f BS 979 984 LL2317r RD LL2342r 792 0 0 actaxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgattcattgcaacgaaatatttaagacatgtttTTTGAGTTAATT AACAGCTTCTATTTTTTTTGTTCTTGTTCTTTATGGACCCTAAAATTAAA ATAGAAGATTGGGGGTGAATCATAAATCCAAAGGAGGTTTCATGGCCAAA GGTAAAGATGTTCGAGTAACAATTATTTTGGAATGTACCAGTTGTGTTCG AAATGATATTAAGAAAGAATCGGCTGGAATTTCCAGATATATTACTCAAA AGAATCGGCATAACACTCCTAGTCGATTGGAATTGAGAAAATTCTGTCCC TATTGTTATAAACATACAATTCATGGGGAAATTAAGAAATAGATAAAATT GAGTGCTTGTATGTCAAATTTTATTTTAAGAACAGGAATAATGAGAGTAT CTACGTATTATTACATATATATAAATATAAACAAATAAAATAATAGAAAG AAATCAAATCCTATATTCTTAATTCTATATAGAAACTCTATCCTATATAG AAATAGCAATCGTTTTTATTTTGATCCGATCAAAAATAGGATTTTATAGG TAAGGAATAAAAAATTATGAATAAATCTAAGCGACCTTTTACTAAATCCA AGCGATCTTTTCGTCGGCGTTTGCCCCCGATCCAATCGGGGGATCGAATT GATTATAGAAACATGAGTTTAATTAGTCGATTTATTAGTGAACAAGGAAA AATATTATCTAGACGGGTGAATAGAGTAActnt*aaacaaca QA 81 778 46 792 DS CHROMAT_FILE: LL2342r PHD_FILE: LL2342r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:40 2000 RD LL2317r 791 0 0 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgCTCAAAAGAATCGGCATAACACTCCTAGTCGATTGGAATTGAGA AAATTCTGTCCCTATTGTTATAAACATACAATTCATGGGGAAATTAAGAA ATAGATAAAATTGAGTGCTTGTATGTCAAATTTTATTTTAAGAACAGGAA TAATGAGAGTATCTACGTATTATTACATATATATAAATATAAACAAATAA AATAATAGAAAGAAATCAAATCCTATATTCTTAATTCTATATAGAAACTC TATCCTATATAGAAATAGCAATCGTTTTTATTTTGATCCGATCAAAAATA GGATTTTATAGGTAAGGAATAAAAAATTATGAATAAATCTAAGCGACCTT TTACTAAATCCAAGCGATCTTTTCGTCGGCGTTTGCCCCCGATCCAATCG GGGGATCGAATTGATTATAGAAACATGAGTTTAATTAGTCGATTTATTAG TGAACAAGGAAAAATATTATCTAGACGGGTGAATAGAGTAACTTTAAAAC AACAACGATTAATTACTATTGCTATAAAACAAGCTCGTATTTTATCTTTG TTACCTTTTCTTAATAATCAGAAACAATTTGAAAGAAGTGAGTCGACCCC TAGAACTACTAGCCTTAGAACCAganaAAAATAGACTTATTCTTCAATTG AATAACTAATCTGAAGGAATTAAAAAAGAGGTTAATATTTTGTTCGAca* aatCCA*ATCA*AGAATCA*AAATTTGATtgttacGTctgt QA 57 780 57 791 DS CHROMAT_FILE: LL2317r PHD_FILE: LL2317r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:35 2000 RD LL2342f 782 0 0 cgAAATGATATTAAGAAAGAATCGGCTGGAATTTCCAGATATATTACTCA AAAGAATCGGCATAACACTCCTAGTCGATTGGAATTGAGAAAATTCTGTC CCTATTGTTATAAACATACAATTCATGGGGAAATTAAGAAATAGATAAAA TTGAGTGCTTGTATGTCAAATTTTATTTTAAGAACAGGAATAATGAGAGT ATCTACGTATTATTACATATATATAAATATAAACAAATAAAATAATAGAA AGAAATCAAATCCTATATTCTTAATTCTATATAGAAACTCTATCCTATAT AGAAATAGCAATCGTTTTTATTTTGATCCGATCAAAAATAGGATTTTATA GGTAAGGAATAAAAAATTATGAATAAATCTAAGCGACCTTTTACTAAATC CAAGCGATCTTTTCGTCGGCGTTTGCCCCCGATCCAATCGGGGGATCGAA TTGATTATAGAAACATGAGTTTAATTAGTCGATTTATTAGTGAACAAGGA AAAATATTATCTAGACGGGTGAATAGAGTAACTTTAAAACAACAACGATT AATTACTATTGCTATAAAACAAGCTCGTATTTTATCTTTGTTACCTTTTC TTAATAATCAGAAACAATTTGAAAGAAGTGAGTCGACCCCTAGAACTACT AGCCTTAGAACCAGAAAAAAATAGACTTATTCTTCAATTGAATAACTAAT CTGAAGGAATTAAAAAAGAGGTTAATATTTTGTTCGAcacaatCcacatc acagAATcacaaaTTTGATTGTtacgtctgtg QA 29 737 1 781 DS CHROMAT_FILE: LL2342f PHD_FILE: LL2342f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:04 2000 CO Contig51 797 4 178 U cgttTGGGATCCGTGTGGAAG*ACGATTACAACAGGTGTTGTCCTCTGAG GACATAAAATACACA*CCGAGATTCATCAACTCAT*TGCTGGAGTTAGCA TATCTACAATTGGGTGAAATGGGGAGCGATTTGCAGGCATTTGCTCGGCA TGCCGGTAGAGGTGTGGTCAATAAGAGCGACCTCATGCTATACCTGAGAA AGCAACCTGACCTACAGGAAAGAGTTACTCAAGAATAAGAATTTTCGTTT TAAAACCTAAGAGTCACTTTAAAATTTGTATACACTTATTTTTTTTATAA CTTATTTAATAATAAAAATCATAAATCATAAGAAATTCGCTTATTTAGAA GTGTCAACAACGTATCTACCAACGATTTGACCCTTTTCCATCTTTTCGTA AATTTCTGGCAAGGTAGACAAGCCGACAACCTTGATTGGAGACTTGACCA AACCTCTGGCGAAGAAGTCCAAAGCTCCACCGCGGTGGCGGCCGTTACTT ACTTAGAGCTCGACGTCTTACTTACTTAGCGGCCGCACTAGTAGATCTGA ATTCCCGGGGTCGACCCACCCTCTTTTTTTGGGTTTGGTGGGGTATCTTC ATCATCGAATAGATAGTTATATACATCATCCATTGTAGTGGTATTAAACA TCCCTGTAGTGATTCCAAACGCGTTATACGCAGTTTGGTCCGTCCAACCA GGTGACAGTGGTTTTGAATTATTACCATCATCAATTTtACTAGCCGTGAT TTCATTATTCATGAAGTTATCATGAACGTTAGAGGAGGCAatTggtt BQ 0 8 8 12 25 33 37 36 42 42 42 44 44 44 44 44 49 49 49 44 44 44 39 39 43 43 44 44 44 45 44 49 49 49 50 50 61 50 50 50 50 50 56 56 56 45 45 45 45 45 45 45 45 45 56 56 56 56 56 66 61 61 50 50 50 50 50 45 45 45 45 45 50 56 56 56 50 50 50 55 55 55 50 50 47 50 50 50 55 61 50 49 49 49 49 49 50 56 56 56 56 56 56 50 50 61 61 61 66 66 50 56 56 56 56 56 66 66 66 66 56 49 45 45 45 45 45 45 45 45 45 45 45 50 50 50 50 49 45 45 45 45 45 48 53 53 53 53 49 49 49 49 53 53 61 61 61 66 66 66 66 66 61 61 61 61 61 61 55 55 55 53 53 53 53 53 53 53 53 53 55 53 53 53 53 53 53 53 53 53 53 53 61 61 66 66 55 55 61 53 53 53 55 55 53 53 53 53 53 53 53 53 53 53 66 66 66 66 66 66 61 61 53 53 53 53 53 53 66 66 61 61 61 61 61 61 66 66 66 66 61 61 61 61 61 55 55 55 61 61 55 61 61 61 61 61 61 61 66 66 66 53 53 53 53 53 53 66 66 66 66 66 61 61 61 61 55 61 61 53 53 53 53 53 53 66 66 66 66 66 66 66 66 66 66 53 53 53 53 53 53 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 55 53 53 53 53 53 53 53 61 61 61 61 66 66 66 66 66 66 66 66 66 66 66 61 61 61 61 61 61 66 61 61 61 61 61 61 61 61 61 61 66 66 61 55 55 55 55 55 66 66 66 61 61 61 55 55 55 66 66 66 66 61 61 61 61 66 66 66 66 66 66 66 66 66 66 66 66 61 61 61 55 55 55 55 55 55 66 66 61 61 61 61 61 61 66 66 66 61 61 66 66 66 61 66 61 55 55 55 55 55 61 55 55 55 55 61 61 66 66 66 66 66 61 55 55 55 55 55 66 66 66 66 66 66 66 66 66 61 61 61 61 53 53 55 55 55 55 61 61 61 61 55 55 53 53 53 53 61 61 53 53 53 53 53 55 55 55 55 55 55 66 55 55 55 55 50 50 50 50 55 55 61 56 55 55 55 55 55 61 61 61 61 55 55 50 50 47 47 47 47 45 45 49 50 50 55 66 66 66 66 66 66 66 66 66 61 61 61 61 61 61 66 66 61 61 61 55 55 50 55 52 55 50 50 50 55 56 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 66 56 61 55 55 56 52 52 52 61 61 61 61 61 61 61 61 61 61 66 66 66 66 66 61 50 50 47 50 50 50 66 66 66 66 66 66 56 56 56 43 50 50 50 50 50 50 58 58 42 39 39 35 39 39 39 44 44 58 58 47 50 50 58 58 50 50 50 58 58 46 39 39 39 34 34 39 42 42 42 42 50 39 41 41 42 39 39 42 42 39 40 44 44 44 58 42 44 39 35 42 39 39 35 39 39 42 42 42 42 42 50 50 50 50 50 50 42 47 35 35 35 35 39 39 39 39 30 33 37 37 39 39 44 44 44 44 39 42 50 50 44 39 32 34 27 22 17 24 37 37 36 38 42 40 34 34 34 42 39 35 35 37 37 42 42 42 42 39 42 50 50 39 39 35 37 29 25 35 31 37 39 50 39 39 39 39 35 28 28 31 34 34 25 27 33 28 25 25 25 18 18 23 16 17 0 0 AF LL2269r U 1 AF LL2335r U 1 AF LL2312r U 1 AF LL2320r U 2 BS 1 1 LL2312r BS 2 4 LL2320r BS 5 5 LL2312r BS 6 12 LL2320r BS 13 15 LL2269r BS 16 25 LL2320r BS 26 26 LL2269r BS 27 27 LL2320r BS 28 29 LL2269r BS 30 31 LL2335r BS 32 35 LL2320r BS 36 37 LL2269r BS 38 38 LL2320r BS 39 45 LL2269r BS 46 60 LL2320r BS 61 61 LL2269r BS 62 75 LL2320r BS 76 78 LL2269r BS 79 81 LL2320r BS 82 90 LL2269r BS 91 93 LL2320r BS 94 99 LL2269r BS 100 100 LL2320r BS 101 101 LL2269r BS 102 102 LL2312r BS 103 108 LL2269r BS 109 124 LL2320r BS 125 125 LL2335r BS 126 146 LL2320r BS 147 147 LL2269r BS 148 151 LL2320r BS 152 155 LL2269r BS 156 157 LL2320r BS 158 160 LL2312r BS 161 196 LL2320r BS 197 199 LL2269r BS 200 205 LL2320r BS 206 210 LL2312r BS 211 223 LL2320r BS 224 225 LL2269r BS 226 231 LL2320r BS 232 233 LL2269r BS 234 239 LL2320r BS 240 241 LL2269r BS 242 243 LL2320r BS 244 249 LL2312r BS 250 261 LL2320r BS 262 264 LL2312r BS 265 280 LL2320r BS 281 282 LL2312r BS 283 288 LL2320r BS 289 289 LL2335r BS 290 293 LL2320r BS 294 298 LL2312r BS 299 304 LL2320r BS 305 308 LL2312r BS 309 309 LL2320r BS 310 315 LL2312r BS 316 330 LL2320r BS 331 335 LL2269r BS 336 358 LL2320r BS 359 360 LL2312r BS 361 364 LL2320r BS 365 365 LL2312r BS 366 384 LL2320r BS 385 386 LL2312r BS 387 395 LL2320r BS 396 396 LL2312r BS 397 411 LL2320r BS 412 412 LL2312r BS 413 422 LL2320r BS 423 423 LL2312r BS 424 429 LL2320r BS 430 431 LL2269r BS 432 434 LL2320r BS 435 437 LL2335r BS 438 440 LL2320r BS 441 446 LL2312r BS 447 450 LL2320r BS 451 455 LL2269r BS 456 457 LL2320r BS 458 458 LL2269r BS 459 472 LL2320r BS 473 476 LL2269r BS 477 478 LL2320r BS 479 480 LL2335r BS 481 485 LL2269r BS 486 490 LL2320r BS 491 492 LL2269r BS 493 494 LL2312r BS 495 499 LL2320r BS 500 503 LL2269r BS 504 516 LL2312r BS 517 517 LL2269r BS 518 518 LL2320r BS 519 529 LL2269r BS 530 538 LL2320r BS 539 545 LL2312r BS 546 557 LL2320r BS 558 558 LL2269r BS 559 561 LL2320r BS 562 565 LL2312r BS 566 566 LL2335r BS 567 568 LL2269r BS 569 569 LL2335r BS 570 571 LL2269r BS 572 574 LL2312r BS 575 589 LL2320r BS 590 593 LL2269r BS 594 595 LL2320r BS 596 596 LL2312r BS 597 601 LL2269r BS 602 602 LL2312r BS 603 604 LL2320r BS 605 613 LL2269r BS 614 617 LL2312r BS 618 621 LL2320r BS 622 627 LL2269r BS 628 637 LL2320r BS 638 639 LL2312r BS 640 640 LL2320r BS 641 641 LL2312r BS 642 646 LL2320r BS 647 648 LL2335r BS 649 651 LL2320r BS 652 653 LL2335r BS 654 654 LL2312r BS 655 661 LL2320r BS 662 669 LL2269r BS 670 673 LL2320r BS 674 676 LL2269r BS 677 678 LL2320r BS 679 680 LL2312r BS 681 681 LL2335r BS 682 682 LL2312r BS 683 686 LL2269r BS 687 687 LL2320r BS 688 688 LL2312r BS 689 690 LL2320r BS 691 691 LL2312r BS 692 692 LL2320r BS 693 702 LL2312r BS 703 703 LL2320r BS 704 704 LL2312r BS 705 705 LL2320r BS 706 707 LL2312r BS 708 709 LL2269r BS 710 716 LL2320r BS 717 718 LL2335r BS 719 728 LL2312r BS 729 731 LL2320r BS 732 732 LL2312r BS 733 741 LL2320r BS 742 742 LL2312r BS 743 744 LL2320r BS 745 745 LL2312r BS 746 747 LL2320r BS 748 753 LL2312r BS 754 754 LL2320r BS 755 758 LL2335r BS 759 759 LL2320r BS 760 763 LL2312r BS 764 764 LL2269r BS 765 765 LL2312r BS 766 766 LL2335r BS 767 767 LL2312r BS 768 768 LL2335r BS 769 769 LL2312r BS 770 770 LL2335r BS 771 771 LL2320r BS 772 774 LL2269r BS 775 777 LL2320r BS 778 778 LL2335r BS 779 784 LL2320r BS 785 787 LL2269r BS 788 790 LL2335r BS 791 792 LL2320r BS 793 797 LL2335r RD LL2269r 792 0 0 natatgGGATCCGTGTGGAAGAACGATTACAACAGGTGTTGTCCTCTGAG GACATAAAATACACA*CCGAGATTCATCAACTCAT*TGCTGGAGTTAGCA TATCTACAATTGGGTGAAATGGGGAGCGATTTGCAGGCATTTGCTCGGCA TGCCGGTAGAGGTGTGGTCAATAAGAGCGACCTCATGCTATACCTGAGAA AGCAACCTGACCTACAGGAAAGAGTTACTCAAGAATAAGAATTTTCGTTT TAAAACCTAAGAGTCACTTTAAAATTTGTATACACTTATTTTTTTTATAA CTTATTTAATAATAAAAATCATAAATCATAAGAAATTCGCTTATTTAGAA GTGTCAACAACGTATCTACCAACGATTTGACCCTTTTCCATCTTTTCGTA AATTTCTGGCAAGGTAGACAAGCCGACAACCTTGATTGGAGACTTGACCA AACCTCTGGCGAAGAAGTCCAAAGCTCCACCGCGGTGGCGGCCGTTACTT ACTTAGAGCTCGACGTCTTACTTACTTAGCGGCCGCACTAGTAGATCTGA ATTCCCGGGGTCGACCCACCCTCTTTTTTTGGGTTTGGTGGGGTATCTTC ATCATCGAATAGATAGTTATATACATCATCCATTGTAGTGGTATTAAACA TCCCTGTAGTGATTCCAAACGCGTTATACGCAGTTTGGTCCGTCCAACCA GGTGACAGTGGTTTTGAATTATTACCATCATCAaTtntaCTAGCCGTGAT TTCATTATTCATGAAGTTATCATGAACGTTAGAGGAGGCAat QA 6 792 5 792 DS CHROMAT_FILE: LL2269r PHD_FILE: LL2269r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:32 2000 RD LL2335r 797 0 0 catttGGGATCCGTGTGGAAG*ACGATTACAACAGGTGTTGTCCTCTGAG GACATAAAATACACA*CCGAGATTCATCAACTCatgtgCTGGAGTTAGCA TATCTACAATTGGGTGAAATGGGGAGCGATTTGCAGGCATTTGCTCGGCA TGCCGGTAGAGGTGTGGTCAATAAGAGCGACCTCATGCTATACCTGAGAA AGCAACCTGACCTACAGGAAAGAGTTACTCAAGAATAAGAATTTTCGTTT TAAAACCTAAGAGTCACTTTAAAATTTGTATACACTTATTTTTTTTATAA CTTATTTAATAATAAAAATCATAAATCATAAGAAATTCGCTTATTTAGAA GTGTCAACAACGTATCTACCAACGATTTGACCCTTTTCCATCTTTTCGTA AATTTCTGGCAAGGTAGACAAGCCGACAACCTTGATTGGAGACTTGACCA AACCTCTGGCGAAGAAGTCCAAAGCTCCACCGCGGTGGCGGCCGTTACTT ACTTAGAGCTCGACGTCTTACTTACTTAGCGGCCGCACTAGTAGATCTGA ATTCCCGGGGTCGACCCACCCTCTTTTTTTGGGTTTGGTGGGGTATCTTC ATCATCGAATAGATAGTTATATACATCATCCATTGTAGTGGTATTAAACA TCCCTGTAGTGATTCCAAACGCGTTATACGCAGTTTGGTCCGTCCAACCA GGTGACAGTGGTTTTGAATTATTACCATCATCAattctaCTAGCCGTGAT TTCATTATTCATGAAGTTATCATGAACGTTAGAGGAGGCAatTggtt QA 6 795 3 797 DS CHROMAT_FILE: LL2335r PHD_FILE: LL2335r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:45 2000 RD LL2312r 787 0 0 cgttTGGGATCCGTGTGGAAG*ACGATTACAACAGGTGTTGTCCTCTGAG GACATAAAATACAcanccgAGATTCATCAACTCAT*TGCTGGAGTTAGCA TATCTACAATTGGGTGAAATGGGGAGCGATTTGCAGGCATTTGCTCGGCA TGCCGGTAGAGGTGTGGTCAATAAGAGCGACCTCATGCTATACCTGAGAA AGCAACCTGACCTACAGGAAAGAGTTACTCAAGAATAAGAATTTTCGTTT TAAAACCTAAGAGTCACTTTAAAATTTGTATACACTTATTTTTTTTATAA CTTATTTAATAATAAAAATCATAAATCATAAGAAATTCGCTTATTTAGAA GTGTCAACAACGTATCTACCAACGATTTGACCCTTTTCCATCTTTTCGTA AATTTCTGGCAAGGTAGACAAGCCGACAACCTTGATTGGAGACTTGACCA AACCTCTGGCGAAGAAGTCCAAAGCTCCACCGCGGTGGCGGCCGTTACTT ACTTAGAGCTCGACGTCTTACTTACTTAGCGGCCGCACTAGTAGATCTGA ATTCCCGGGGTCGACCCACCCTCTTTTTTTGGGTTTGGTGGGGTATCTTC ATCATCGAATAGATAGTTATATACATCATCCATTGTAGTGGTATTAAACA TCCCTGTAGTGATTCCAAACGCGTTATACGCAGTTTGGTCCGTCCAACCA GGTGACAGTGGTTTTGAATTATTACCATCATCAaTtntaCTAGCCGTGAT TTCATTATTCATGAAGTTATCATGAACGTTAGAGGag QA 6 785 1 787 DS CHROMAT_FILE: LL2312r PHD_FILE: LL2312r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:11 2000 RD LL2320r 791 0 0 gtttGGGATCCGTGTGGAAG*ACGATTACAACAGGTGTTGTCCTCTGAGG ACATAAAATACACA*CCGAGATTCATCAACTCAT*TGCTGGAGTTAGCAT ATCTACAATTGGGTGAAATGGGGAGCGATTTGCAGGCATTTGCTCGGCAT GCCGGTAGAGGTGTGGTCAATAAGAGCGACCTCATGCTATACCTGAGAAA GCAACCTGACCTACAGGAAAGAGTTACTCAAGAATAAGAATTTTCGTTTT AAAACCTAAGAGTCACTTTAAAATTTGTATACACTTATTTTTTTTATAAC TTATTTAATAATAAAAATCATAAATCATAAGAAATTCGCTTATTTAGAAG TGTCAACAACGTATCTACCAACGATTTGACCCTTTTCCATCTTTTCGTAA ATTTCTGGCAAGGTAGACAAGCCGACAACCTTGATTGGAGACTTGACCAA ACCTCTGGCGAAGAAGTCCAAAGCTCCACCGCGGTGGCGGCCGTTACTTA CTTAGAGCTCGACGTCTTACTTACTTAGCGGCCGCACTAGTAGATCTGAA TTCCCGGGGTCGACCCACCCTCTTTTTTTGGGTTTGGTGGGGTATCTTCA TCATCGAATAGATAGTTATATACATCATCCATTGTAGTGGTATTAAACAT CCCTGTAGTGATTCCAAACGCGTTATACGCAGTTTGGTCCGTCCAACCAG GTGACAGTGGTTTTGAATTATTACCATCATCAATTTtACTAGCCGTGATT TCATTATTCATGAAGTTATCATGAACGTTAGAGGAGGCAat QA 4 791 1 791 DS CHROMAT_FILE: LL2320r PHD_FILE: LL2320r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:12 2000 CO Contig52 657 5 44 U tgaggaacggtagcttggaacagtacagcggggttaatgaagggtttcca cccTTCcTGtggtgcaggagccACATGAgaaggatgttgGggaAaGAAGA TaatgcAaATGAGAATGAaacAGAGAGAAAGAGATGAGAGTTTGGATGAC TAGTTCTAGATCGCGAGCGG*CCGCCCTTTTTTTTTTTTTTTTTTTTTAA CTTTGAAAAGCTTCATTTTGAAGGAAATAATGAACAAACTCACCATATTG GTCAGTTATTGACCTTATTACACGCAATTAGCAACATAAAAAGTTTTGGA TCTTATTGACGATCTTCTCTTCTTTGTTTTATGCCCTTTGCTCCACGTAT GACCCCGAGAGTCCTGAGGTTCCAAGAACGGTAACCTTTCCCCACTCTGA ACCGAGCCCGCTGCATCTCATCTTCAAGTGGATCATCTTTGTCCCCAACT CCATCTCAACCTTCTTCTTCAAGTAAACTTCTTCA*GGCCCAAACGCATC TAGGTAGGTAGTATACCTATGATAAGCAAGATGCAGCGCTTTGGTTATAT CTTGTGGTTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAggggcg ggccggt BQ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 18 20 20 22 18 23 20 18 11 14 15 19 16 17 10 10 10 17 12 16 24 21 27 21 27 21 18 14 14 8 10 16 16 10 13 18 18 22 15 15 15 22 19 29 25 25 25 25 25 18 19 17 17 12 20 15 29 29 40 40 40 32 29 25 21 21 10 12 12 31 31 48 40 40 40 27 25 25 37 40 40 40 40 40 34 32 32 32 32 25 25 25 29 29 40 40 40 40 29 43 50 56 71 71 65 81 81 83 80 80 80 76 74 73 80 75 76 68 69 70 70 77 85 85 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 86 77 76 75 75 75 81 78 74 77 77 90 85 81 81 75 75 87 81 84 84 77 81 70 70 70 71 72 90 88 78 81 76 76 76 71 72 75 81 75 86 81 84 77 77 70 70 75 74 88 87 90 90 84 90 76 73 75 80 76 76 76 79 72 72 73 61 61 73 73 82 79 80 73 73 61 56 56 61 61 70 73 78 76 76 79 79 75 78 78 90 90 90 90 90 90 90 88 88 85 88 90 90 90 90 90 80 78 80 83 82 82 72 75 75 75 78 81 80 90 90 90 90 90 83 89 89 90 90 90 90 90 75 75 75 84 84 81 82 76 80 90 76 75 75 75 81 84 78 78 73 73 75 76 78 80 76 78 70 71 72 70 61 61 59 67 67 65 65 67 74 74 86 86 86 82 88 90 86 77 76 70 75 76 79 74 74 73 75 74 80 80 89 89 90 90 84 80 80 65 51 51 57 56 55 66 60 66 58 55 55 60 58 60 60 51 51 53 59 59 50 57 57 70 70 64 61 61 67 58 64 64 62 62 66 77 82 75 57 56 56 41 41 41 41 50 65 65 71 79 63 51 58 58 53 55 61 66 57 57 57 57 57 61 61 65 65 59 59 62 63 78 90 81 78 57 58 42 43 43 51 51 46 46 56 59 59 50 50 56 43 42 43 43 51 45 43 43 40 55 55 56 55 57 57 66 66 59 59 60 69 73 59 59 55 60 68 72 62 61 60 55 59 56 73 73 73 67 66 68 71 75 66 71 60 63 56 61 54 54 53 54 52 53 55 58 65 52 52 62 56 52 37 42 43 36 36 21 21 21 37 29 35 35 28 29 32 48 48 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 44 44 56 47 47 56 44 56 44 44 44 44 47 47 47 56 56 56 56 56 56 56 56 56 56 47 47 47 44 48 56 42 56 56 56 56 56 56 56 56 56 56 56 56 56 56 56 47 56 47 48 48 48 44 40 40 28 26 19 18 15 0 0 0 0 0 0 0 0 0 0 AF LL2297f C -232 AF LL2296r C 1 AF LL2332r U 93 AF LL2297r U 93 AF LL2259r U 95 BS 1 150 LL2296r BS 151 153 LL2332r BS 154 160 LL2297r BS 161 163 LL2297f BS 164 164 LL2332r BS 165 175 LL2297r BS 176 182 LL2297f BS 183 195 LL2297r BS 196 203 LL2297f BS 204 208 LL2297r BS 209 210 LL2297f BS 211 212 LL2297r BS 213 216 LL2297f BS 217 217 LL2297r BS 218 218 LL2297f BS 219 221 LL2297r BS 222 222 LL2297f BS 223 225 LL2297r BS 226 231 LL2297f BS 232 236 LL2297r BS 237 240 LL2297f BS 241 245 LL2297r BS 246 248 LL2297f BS 249 250 LL2297r BS 251 253 LL2297f BS 254 254 LL2297r BS 255 260 LL2297f BS 261 265 LL2297r BS 266 268 LL2297f BS 269 276 LL2297r BS 277 277 LL2297f BS 278 285 LL2297r BS 286 287 LL2297f BS 288 309 LL2297r BS 310 313 LL2297f BS 314 332 LL2297r BS 333 333 LL2297f BS 334 341 LL2297r BS 342 342 LL2297f BS 343 391 LL2297r BS 392 394 LL2297f BS 395 467 LL2297r BS 468 468 LL2297f BS 469 657 LL2297r RD LL2297f 797 0 2 tcagggccaggcggtgaagggcaatcagctgttgcccgtctcactggtga aaagaaaaaccaccctggcgcccaatacgcaaaccgcctctccccgcgcg ttggccgattcattaatgcagctggcacgacaggtttcccgactggaaag cgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggca ccccaggcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxCCACGCGTCCGCTAGTTTTAGATCGCGAG CGG*CCGCCCTTTTTTTTTTTTTTTTTTTTTAACTTTGAAAAGCTTCATT TTGAAGGAAATAATGAACAAACTCACCATATTGGTCAGTTATTGACCTTA TTACACGCAATTAGCAACATAAAAAGTTTTGGATCTTATTGACGATCTTC TCTTCTTTGTTTTATGCCCTTTGCTCCACGTATGACCCCGAGAGTCCTGA GGTTCCAAGAACGGTAACCTTTCCCCACTCTGAACCGAGCCCGCTGCATC TCATCTTCAAGTGGATCATctatGTCCCCAACTCCATCTCAACCTTCTTC TTCAAGtgagcTTCTtcgcGgcccataCGCATCTAGGTAGGTAGTATACC TATGATAAGCAAGATGCAGCGCTTTGGTTATATctggtggtgtaaan QA 15 781 383 791 DS CHROMAT_FILE: LL2297f PHD_FILE: LL2297f.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:44:37 2000 RT{ LL2297f chimera phrap 1 208 000919:094547 } RT{ LL2297f matchElsewhereHighQual phrap 1 208 000919:094547 } RD LL2296r 799 0 0 tgaggaacggtagcttggaacagtacagcggggttaatgaagggtttcca cccTTCcTGtggtgcaggagccACATGAgaaggatgttgGggaAaGAAGA TaatgcAaATGAGAATGAaacAGAGAGAAAGAGATGAGAGTTTGGATGAC TCGtTCTAGATCGCGAGcgggcCGCCCTTTTTTTTTTTTTTTtAaaCAAA ACCCAAGGcaaCTAGGAAAATGCAATCAAATAGTACATCGTATCTCATTG GTTTATTATTCCATCTCGGGAATTTTATTACAAAAACAAAAGAAAAACAA GATTACCTTAACTTAAAATCATCGTCAAGGCAATAAAAACCGAACAATAT TACAGAATCGAAGAAACAACAACATAAACACACAAAACATATTTGCTTAT AAATATTATTGATTACGAAGAAATTATACGACATGTCTCCCGCAGAGGCT TTTCTCACCTTTAGGCGTCGGACGTGGAAGGAGCT*GTTGCATCGTCAGT CGTGGGAGGAGCTGTTGCATCGTCGGACGCAGGAGGAGCTGTGTCAGCGT CGGCTGTGGCATCCTCGGAGGCTGATTTATCCTTTGATGAGTTAAGTTTG TTGAACAATGGCTTAAACACGAACTCTACCGCCAACCATGCGACAGCTAA AGCTCCGACAACAACTGTCGTTGTTTGTTTCTTGCCTGAAGATTTCTCCA GTTTTGGAAAATGAATTTCtggtactttatccattgcttcttccggacgc gtggxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaat QA 52 719 1 193 DS CHROMAT_FILE: LL2296r PHD_FILE: LL2296r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:11 2000 RD LL2332r 790 0 0 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxCCACGC GTCCGGACTAGTTCTAGATCGCGAGCGG*CCgcgcTTTTTTTTTTTTTTT CGAATCTTAGCGACAAAGGGCTGAATCTCAGTGGATCGTGGCAGCAAGGC CACTCTGCCACTTACAATACCCCGTCGCGTATTTAAGTCGTCTGCAAAGG ATTCTACCCGCCACTCGGTGGTAATTATAATTCAAGGCGGTCCGAACGGC GCTTCCACCGAACGGACTTAGCCAACGACACGTGCCTTTGGGAGCCGAAG CTCCTACTGAGGGTCGGCAATCGGGCGGCGGGCGCATGCGTCGCTTCTAG CCCGGATTCTGACTTAGAGGCGTTCAGTCATAATCCAGCGCAC*GGTAGC TTCGCGCCACTGGCTTTTCAACCAAGCGCGATGACCAATTGTGCGAATCA ACGGTTCCTCTCGTACTAGGTTGAATTACTATTGCGACGCGGGCATCAGT AGGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGCTCACGTTCCC TATTGgtGGGTGAACAATCCAACACTTGgTGAATTCTGCTTCACAATGAT AGGAAGAGCCGACATCGAAGGATCAAAAAGCAACGTCGCtaTGAACGCtt GGCTGCCACaaGCCAgttatcCCTgtggtAACTTTTCTGACAcCtCTAGC TTCAAATTCCGaaggTCTaAAGGATCGATAggccacgctttCAcggttCG TattcgtacTGaaaatcagaatcaaacGAGCtttTACcct QA 45 790 56 100 DS CHROMAT_FILE: LL2332r PHD_FILE: LL2332r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:31 2000 RD LL2297r 800 0 4 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxcCAC GCGTCCGCTAGTTCTAGATCGCGAGCGG*CCGCCCTTTTTTTTTTTTTTT TTTTTTAACTTTGAAAAGCTTCATTTTGAAGGAAATAATGAACAAACTCA CCATATTGGTCAGTTATTGACCTTATTACACGCAATTAGCAACATAAAAA GTTTTGGATCTTATTGACGATCTTCTCTTCTTTGTTTTATGCCCTTTGCT CCACGTATGACCCCGAGAGTCCTGAGGTTCCAAGAACGGTAACCTTTCCC CACTCTGAACCGAGCCCGCTGCATCTCATCTTCAAGTGGATCATCTTTGT CCCCAACTCCATCTCAACCTTCTTCTTCAAGTAAACTTCTTCA*GGCCCA AACGCATCTAGGTAGGTAGTATACCTATGATAAGCAAGATGCAGCGCTTT GGTTATATCTTGTGGTTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAggggcgggccggtxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxaaaaaccctggcgttacccaacttaatcgc cttgaagaacatccccctttcgccagttggcgtaatagagaagaggcccg acccgatcgcccttcccaacagttggcgcaactgaatgggaaatggcccc QA 47 555 58 565 DS CHROMAT_FILE: LL2297r PHD_FILE: LL2297r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:15 2000 RT{ LL2297r chimera phrap 671 796 000919:094547 } RT{ LL2297r matchElsewhereHighQual phrap 753 775 000919:094547 } RT{ LL2297r matchElsewhereHighQual phrap 709 738 000919:094547 } RT{ LL2297r matchElsewhereHighQual phrap 672 704 000919:094547 } RD LL2259r 799 0 0 natcttxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxagctcgacccatg cGTCCGTAGTTCTAGATCGCGAGCGG*CCGCCcTTTTTTTTTTTTTtttg atcCAACAAATCAACTTGATCAGATCATTATAACAGTACAACCCTTAAAC AAGTGAGTTGCTACAACTGTTGGTAATAAGAATTACAGTTTCTTGTCTAA GAAAATGGCAATGTAAGGGAAGTAAGGACACTCGTATTCCCATCCGGTTC CAGAGTAACGAACGCAATGCAACCAAAGAGCCTCATCATGCTCCTCGTct TGTgggAACCagtgACAGGGAGAACTCATCAAATAAgagcgcatctAcaa TATACAGGGcatcCCATagCGAATGACCCgctTtatctaAC*TCAACACG gggatGATCTTgagaccaagattgtcagaagaatcgatgcacgtgagaga aagctcgccttcccagacgactatctgcattccagcgaagaccngagcat cgatccaatgcccacaacactgccctaagtctgctccccccagcggtctg acggaccatgacgaggccctatcggccgtattgtgtttcgtccgccagca ccccggatggcgcctccgcggcgcccccacctcccccgacaccgccatcc gccgcagccacggcaaccggcatcctcccaacagccgcgggcgtgagcga cgcaccccggagatcgggctccccncgccacccagatccgaccaagtttg gccccctcgtctgtcacgcattccatttgtgtctcgcgttctcttttct QA 50 412 57 99 DS CHROMAT_FILE: LL2259r PHD_FILE: LL2259r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:22 2000 CO Contig53 615 7 124 U AAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTC GCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACA GTTGCGCAGCCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAA GCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGC GCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTT CGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCC GATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGAT GGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGAC GTTGGAGTCCACGTTCTTAATAGTGGACTCTTGTTCCAAACTGGAACAAC ACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGA TTTCGGCCTATTGgttaaAAAATGAGCTGATTTAACAAAAATTTAACGCG AATTTTAACaaAaTATTAACGCTTACAATTTaggtGGCACTTTTCgggga AATGTGcGCGgaacc BQ 56 56 51 56 51 61 61 55 53 53 53 53 45 45 55 55 55 55 55 55 53 53 53 53 56 55 55 51 51 51 55 55 60 60 53 56 61 61 61 61 66 66 66 66 66 66 66 61 55 53 55 53 53 50 53 55 55 55 53 53 53 53 53 61 61 53 53 53 53 53 53 56 61 61 61 55 46 49 46 46 46 53 53 53 55 53 53 53 53 53 53 58 55 55 61 66 66 66 61 61 61 55 53 53 53 53 53 53 53 53 53 53 66 66 66 61 53 53 53 53 53 53 66 66 61 53 53 53 53 53 53 53 53 53 66 61 61 53 53 53 53 53 53 53 53 53 55 61 61 53 53 53 53 53 53 53 53 53 53 56 66 66 66 66 66 66 66 61 61 53 53 46 46 46 46 46 47 53 53 53 53 53 55 55 53 53 53 53 53 53 53 53 53 53 53 55 55 55 55 55 55 55 55 55 61 55 53 53 53 53 53 53 53 53 53 53 55 66 66 61 53 53 53 53 53 61 66 66 66 66 66 66 66 66 66 66 66 66 66 66 61 55 55 55 55 55 55 55 55 53 53 53 53 53 53 55 55 53 53 53 61 61 61 66 66 66 66 66 55 55 66 61 61 61 66 66 61 66 66 66 66 61 55 55 55 55 55 55 61 61 55 55 55 55 55 55 55 55 55 55 55 61 55 61 61 61 61 50 50 55 55 50 50 55 55 55 55 55 55 55 55 55 50 50 50 50 50 50 61 66 66 66 61 55 55 55 55 55 61 55 55 55 55 55 55 61 61 66 66 66 61 61 61 61 55 55 55 55 55 61 66 66 61 55 55 55 50 50 47 49 47 47 47 50 66 66 66 66 61 55 50 50 50 50 52 54 58 61 50 50 47 47 47 50 66 66 56 50 50 50 43 43 50 54 58 50 50 44 36 34 35 37 37 42 42 42 42 42 40 38 37 42 42 42 44 44 44 47 46 46 44 58 50 39 35 32 32 42 42 42 42 42 42 39 42 42 44 39 42 44 50 43 42 42 35 35 42 42 43 58 58 58 58 58 58 66 66 50 50 46 46 47 47 61 56 66 66 56 50 50 44 39 39 39 50 37 39 35 35 38 42 42 44 39 38 29 31 31 36 39 39 35 42 37 32 29 31 35 37 39 50 39 32 15 15 15 15 15 37 58 58 50 58 50 44 43 42 42 40 42 39 39 35 35 35 35 39 39 42 42 58 35 35 35 32 32 35 39 39 39 35 35 44 44 39 39 39 28 26 15 15 25 15 33 37 37 33 28 28 33 42 39 34 32 35 36 27 28 37 25 24 18 15 15 17 25 30 34 39 39 35 36 32 29 20 10 10 10 17 10 25 24 25 25 24 25 17 25 25 24 18 18 18 18 18 AF LL2257r U -747 AF LL2348r U -680 AF LL2306r U -518 AF LL2303r U -433 AF LL2318r U -175 AF LL2265r U -174 AF LL2316r U -174 BS 1 1 LL2316r BS 2 2 LL2306r BS 3 3 LL2316r BS 4 4 LL2303r BS 5 8 LL2306r BS 9 12 LL2265r BS 13 14 LL2316r BS 15 20 LL2306r BS 21 24 LL2316r BS 25 26 LL2303r BS 27 27 LL2306r BS 28 30 LL2303r BS 31 32 LL2306r BS 33 34 LL2303r BS 35 35 LL2316r BS 36 36 LL2306r BS 37 40 LL2316r BS 41 41 LL2303r BS 42 43 LL2316r BS 44 44 LL2265r BS 45 46 LL2316r BS 47 47 LL2265r BS 48 48 LL2316r BS 49 49 LL2303r BS 50 50 LL2316r BS 51 51 LL2303r BS 52 53 LL2316r BS 54 54 LL2303r BS 55 55 LL2316r BS 56 58 LL2306r BS 59 63 LL2316r BS 64 65 LL2306r BS 66 71 LL2316r BS 72 72 LL2306r BS 73 75 LL2265r BS 76 76 LL2303r BS 77 77 LL2316r BS 78 78 LL2306r BS 79 81 LL2316r BS 82 82 LL2265r BS 83 91 LL2316r BS 92 92 LL2306r BS 93 95 LL2316r BS 96 98 LL2303r BS 99 101 LL2265r BS 102 112 LL2316r BS 113 113 LL2303r BS 114 122 LL2316r BS 123 123 LL2265r BS 124 124 LL2316r BS 125 125 LL2306r BS 126 133 LL2265r BS 134 134 LL2316r BS 135 137 LL2265r BS 138 167 LL2316r BS 168 169 LL2265r BS 170 176 LL2316r BS 177 177 LL2303r BS 178 197 LL2316r BS 198 203 LL2265r BS 204 239 LL2316r BS 240 249 LL2265r BS 250 264 LL2316r BS 265 268 LL2265r BS 269 277 LL2316r BS 278 278 LL2265r BS 279 308 LL2316r BS 309 311 LL2265r BS 312 351 LL2316r BS 352 360 LL2265r BS 361 363 LL2316r BS 364 369 LL2265r BS 370 370 LL2316r BS 371 373 LL2265r BS 374 385 LL2316r BS 386 394 LL2265r BS 395 404 LL2316r BS 405 405 LL2265r BS 406 406 LL2316r BS 407 409 LL2265r BS 410 418 LL2316r BS 419 420 LL2318r BS 421 443 LL2316r BS 444 444 LL2318r BS 445 467 LL2316r BS 468 469 LL2265r BS 470 484 LL2316r BS 485 485 LL2265r BS 486 493 LL2316r BS 494 495 LL2265r BS 496 502 LL2316r BS 503 506 LL2265r BS 507 508 LL2316r BS 509 509 LL2265r BS 510 520 LL2316r BS 521 522 LL2265r BS 523 524 LL2316r BS 525 526 LL2265r BS 527 528 LL2316r BS 529 532 LL2265r BS 533 557 LL2316r BS 558 558 LL2265r BS 559 559 LL2316r BS 560 564 LL2265r BS 565 571 LL2316r BS 572 573 LL2265r BS 574 575 LL2316r BS 576 576 LL2265r BS 577 579 LL2316r BS 580 580 LL2265r BS 581 582 LL2316r BS 583 584 LL2318r BS 585 585 LL2316r BS 586 592 LL2265r BS 593 595 LL2316r BS 596 596 LL2265r BS 597 600 LL2316r BS 601 602 LL2318r BS 603 603 LL2265r BS 604 604 LL2318r BS 605 606 LL2265r BS 607 607 LL2316r BS 608 610 LL2318r BS 611 615 LL2316r RD LL2257r 788 0 1 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccggtggtgcctacatcgttaggcaagCGGCTAAGAGTATTGTTGCC AGTGGGCTAGCGAGGCGTTGCATTGTGCAGGTCTCGTACGCTATTGGTGT CCCTGAGCCGTTGTCTGTGTTTGTGGACAGCTACGGAACTGGGAAGATAC CAGACAAGGAGATTCTTGAGATTGTGAAGGAGAGTTTTGACTTCAGGCCG GGGATGATTTCGATTAACTTGGATTTGAAGAGGGGTGGTAATGGTAGGTT CTTGAAGACTGCTGCTTATGGTCATTTCGGAAGGGACGATGCTGACTTCA CCTGGGAGGTTGTGAAGCCACTCAAGTCTAACAAGGTCCAAGCTTGAAGA AAACTTGAACTAGTTTAGTTCTCTCAGGTCTTGCCTCTGTTTCAACTAAA GAGATGGGTCTATTCTTTTTCCTTTTTCAATATAAGCAGTTGAGAATTTA TATTATTATTATTTATTATCATATGTTATTGTTGTGGTAAGAAAGTTAAA AGTCATATTTTGTATTTCTTGTTTCTTTTCAGAAGAGAATTTATATAAAA GATTATATTCAATATTAAAAAAAAAAAAAAAAGxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxaa AACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCAca QA 81 775 749 788 DS CHROMAT_FILE: LL2257r PHD_FILE: LL2257r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:13 2000 RT{ LL2257r matchElsewhereHighQual phrap 749 788 000919:094547 } RD LL2348r 793 0 1 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgcaactgtcttggtgactcagaacaAAAATCTGAACAGACCTTCT GAGTATCTTCTTGATGGGAAAGTAACAGACAAGAATGATGTCTACTCTTT TGGGGTGATTCTCCTAGAACTTCTCCTGGGGAAGAGATCAGTGGAGAAAC CATCAACTGAACCAGAATCCGTTGTCACTTGGGCTGTACCAAAGCTGAGT GACAGAGCTAATCTGCCAAACATATTGGATCCTGCAATCAAAGGAACCAT GGATTTGAAGCATCTTTATCAGGTAGCAGCGGTTGCGGTGTTGTGTGTGC AGCCAGAGCCAAGTTACAGACCACTTATAACCGATGTTTTGCACTCACTC ATCCCTCTTCTACCACTAGAACTCGGCGGATCATTGCGAATTTTATAGAA ACACATTTTCATTTCCTTTATTGTAATCTTTTCATTTTGAATTGAATGTT GTTGGTGATCTTGTGCAAAATTCCACTTTTTATCTGTTTTTTTTTGGTAA AAAAAAAAAAAAAAAGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxAAAACCCTGGCGTTACCCA ACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCG AAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCct QA 81 793 682 793 DS CHROMAT_FILE: LL2348r PHD_FILE: LL2348r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:31 2000 RT{ LL2348r matchElsewhereHighQual phrap 682 793 000919:094547 } RD LL2306r 814 0 2 agggtaggttttggacngcctttcgtgactgccgcaccgtttccgggttc cacggtcgacccacgcgtccgctcactctcACTACCTTCTCTTCACCCgc aaATGGCGACCGCTCCTCTCTCCGGCTTCTTCCTCACCTCTCTTTCTCCT TCTCAGCCTTCTCTCCAAAAACAGACTCTTCGTTCTTCTCCCACCGTGGC TTGCCTTCCCTCATCCTCCTCTTCCTCCTCCTCCTCCTCCTCCTCTCGTT CCGTTCCAACACTTATCCGTAACGAGCCCGTTTTTGCCGCTCCTGCTCCT ATCATCACCCCTTACTGGAGCGAAGAGATGGGTAGCGAAGCATACGAAGA GGCCATTGAAGCTCTCAAGAAGCTTATTATCGAGAAGGAGGAGCTAAAGA CTGTTxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxAAAACCCTGGCGTTACCCAACTTAATCGCCT TGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCA CCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGCG CCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGT GACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGctntCTTCC CTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCgg nggCTCCCTTTagg QA 81 742 520 814 DS CHROMAT_FILE: LL2306r PHD_FILE: LL2306r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:20 2000 RT{ LL2306r matchElsewhereHighQual phrap 745 800 000919:094547 } RT{ LL2306r matchElsewhereHighQual phrap 520 743 000919:094547 } RD LL2303r 793 0 1 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxccacg cgtccgtagatcgcgaccggccgcccttttttttttttttaatatttgct cttttctctcatcttcttttagttatctcttcgtttacctcttcctcctt tatttaatgtaactttgctttcaatatcttatattaatgttctcagtgac ataaaaaatatattgtatgcatatgattatgcatgtacatgattgaaaaa acttaatggcttaagtgtttaaattgtaataactaaaaatatattttaat caaaaaaaaaaaaaaaaagxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxAAAACCCTGGCGTTAC CCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATA GCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAAT GGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGT GGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTC CTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGT CAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACG GCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACgt QA 81 777 435 793 DS CHROMAT_FILE: LL2303r PHD_FILE: LL2303r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:43 2000 RT{ LL2303r matchElsewhereHighQual phrap 435 730 000919:094547 } RD LL2318r 788 0 4 aatxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxAAAACCctgtcgTTACCCAACTTA ATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAG GCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATG GGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGtggctACGC GCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCT TTCTTCCCTTCCTTTCTCGCCAcggtCGCCGGCTTTCCCCgccAAGCTCT AAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGggaCCTCG ACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGtggtccATCGCCC TGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTAATAGT GGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTcgtgctATTC TTTTGATTTATAAGGGATTTTGCCGatctcggactATTGgtataaAAAtG AGCTGATTTAAcagatatTTAACGCGAattgtaacaacatATTAACGCTT ACAATTTaggtggCActgttcgtggaAATGtgcGCGga QA 177 671 177 788 DS CHROMAT_FILE: LL2318r PHD_FILE: LL2318r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:39 2000 RT{ LL2318r matchElsewhereLowQual phrap 443 472 000919:094547 } RT{ LL2318r matchElsewhereLowQual phrap 426 441 000919:094547 } RT{ LL2318r matchElsewhereHighQual phrap 350 424 000919:094547 } RT{ LL2318r matchElsewhereHighQual phrap 187 344 000919:094547 } RD LL2265r 783 0 1 atxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxAAAACCCTGGCGTTACCCAACTTAA TCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGG CCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGG GACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCG CAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTT TCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTA AATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGA CCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCT GATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTAATAGTG GACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCT TTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGgttaaAAAATGA GCTGATTTAACAAAAATTTAACGCGAATTTTAAcaaAaTATTAACGCTTA CAATTtangtGGCACTTTTcggngaAATGTGcg QA 176 754 176 783 DS CHROMAT_FILE: LL2265r PHD_FILE: LL2265r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:14 2000 RT{ LL2265r matchElsewhereHighQual phrap 176 471 000919:094547 } RD LL2316r 790 0 1 attxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxAAAACCCTGGCGTTACCCAACTTAA TCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGG CCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGG GACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCG CAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTT TCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTA AATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGA CCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCT GATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTAATAGTG GACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCT TTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGgttaaAAAATGA GCTGATTTAACAAAAATTTAACGCGAATTTTAaCaaaaTATTAACGCTTA CAATTTaggtggcACTTTTCggggaaaTGtgcgcggaacc QA 176 790 176 790 DS CHROMAT_FILE: LL2316r PHD_FILE: LL2316r.phd.1 CHEM: unknown DYE: unknown TIME: Tue Sep 19 09:45:30 2000 RT{ LL2316r matchElsewhereHighQual phrap 176 471 000919:094547 } WA{ phrap_params phrap 000919:094547 /usr/local/genome/bin/phrap Run_SRC3700_2000-07-06_44+45.fasta.screen -new_ace -view phrap version 0.990329 } test.gcgblast100644000766000024 733114006427353 17067 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data!!SEQUENCE_LIST 1.0 BLASTP 2.2.1 [Apr-13-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= /v0/people/staji002/test.gcg (146 letters) Database: pir 274,514 sequences; 93,460,074 total letters Searching. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .done Score E Sequences producing significant alignments: (bits) Value .. PIR2:S44629 Begin: 342 End: 470 !F22B7.10 protein - Caenorhabditis elegans 57 2e-08 PIR2:T21398 Begin: 40 End: 108 !hypothetical protein F26D2.1 - Caenorhabditis elegans 33 0.40 PIR1:WMBELM Begin: 307 End: 385 !membrane protein LMP-2A - human herpesvirus 4 32 0.53 \\End of List >PIR2:S44629 F22B7.10 protein - Caenorhabditis elegans Length = 628 Score = 57.0 bits (136), Expect = 2e-08 Identities = 38/135 (28%), Positives = 69/135 (50%), Gaps = 8/135 (5%) Query: 3 CAAEFDFMEKETPLRYTKTXXXXXXXXXXXXXXRKIISDMWGVLAKQQTHVRKHQFDHGE 62 C+AEFDF++ T + T + + +L + + ++GE Sbjct: 342 CSAEFDFIQYSTIEKLCGTLLIPLALISLVTFVFNFVKNT-NLLWRNSEEIG----ENGE 396 Query: 63 LVYHALQLLAYTALGILIMRLKLFLTPYMCVMASLICSRQLFGW--LFCKVHPGAIVFVI 120 ++Y+ +QL T + LIMRLKLF+TP++C++A+L + +L G + + A+V VI Sbjct: 397 ILYNVVQLCCSTVMAFLIMRLKLFMTPHLCIVAALFANSKLLGGDRISKTIRVSALVGVI 456 Query: 121 LAAMSIQGSANLQTQ 135 A + +G N++ Q Sbjct: 457 -AILFYRGIPNIRQQ 470 >PIR2:T21398 hypothetical protein F26D2.1 - Caenorhabditis elegans Length = 346 Score = 32.7 bits (73), Expect = 0.40 Identities = 20/71 (28%), Positives = 41/71 (57%), Gaps = 11/71 (15%) Query: 66 HALQLLAYTALGILIMRLKLFLTPYMCV---------MASLICSRQLFGWLFCKVHPGAI 116 + + ++A+ +LGI+ L++F+ PY+ V +++ I ++ L WLF + G + Sbjct: 40 YRIMIVAFASLGIIYSGLEVFIKPYLHVYNNCILYFSLSTWISAKPLLPWLFA-IWSG-M 97 Query: 117 VFVILAAMSIQ 127 V++A +SIQ Sbjct: 98 YLVVIAFISIQ 108 >PIR1:WMBELM membrane protein LMP-2A - human herpesvirus 4 Length = 497 Score = 32.3 bits (72), Expect = 0.53 Identities = 26/79 (32%), Positives = 38/79 (47%), Gaps = 4/79 (5%) Query: 67 ALQLLAYTALGILIMRLKLFLTPYMCVMASLICSR----QLFGWLFCKVHPGAIVFVILA 122 AL LLA LG L + L ++ LICS L L ++ A+ ++LA Sbjct: 307 ALALLASLILGTLNLTTMFLLMLLWTLVVLLICSSCSSCPLSKILLARLFLYALALLLLA 366 Query: 123 AMSIQGSANLQTQWKSTAS 141 + I G + LQT +KS +S Sbjct: 367 SALIAGGSILQTNFKSLSS 385 Database: pir Posted date: Jan 3, 2002 1:13 PM Number of letters in database: 93,460,074 Number of sequences in database: 274,514 Lambda K H 0.329 0.137 0.426 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 21,034,208 Number of Sequences: 274514 Number of extensions: 620007 Number of successful extensions: 1381 Number of sequences better than 1.0: 3 Number of HSP's better than 1.0 without gapping: 1 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 1379 Number of HSP's gapped (non-prelim): 3 length of query: 146 length of database: 93,460,074 effective HSP length: 102 effective length of query: 44 effective length of database: 65,459,646 effective search space: 2880224424 effective search space used: 2880224424 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.9 bits) S2: 70 (31.6 bits) testaln.fastq100644000766000024 524214006427353 17111 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data@FC12044_91407_8_200_406_24 GTTAGCTCCCACCTTAAGATGTTTA +FC12044_91407_8_200_406_24 SXXTXXXXXXXXXTTSUXSSXKTMQ @FC12044_91407_8_200_720_610 CTCTGTGGCACCCCATCCCTCACTT +FC12044_91407_8_200_720_610 OXXXXXXXXXXXXXXXXXTSXQTXU @FC12044_91407_8_200_345_133 GATTTTTTAACAATAAACGTACATA +FC12044_91407_8_200_345_133 OQTOOSFORTFFFIIOFFFFFFFFF @FC12044_91407_8_200_106_131 GTTGCCCAGGCTCGTCTTGAACTCC +FC12044_91407_8_200_106_131 XXXXXXXXXXXXXXSXXXXISTXQS @FC12044_91407_8_200_916_471 TGATTGAAGGTAGGGTAGCATACTG +FC12044_91407_8_200_916_471 XXXXXXXXXXXXXXXUXXUSXXTXW @FC12044_91407_8_200_57_85 GCTCCAATAGCGCAGAGGAAACCTG +FC12044_91407_8_200_57_85 XFXMXSXXSXXXOSQROOSROFQIQ @FC12044_91407_8_200_10_437 GCTGCTTGGGAGGCTGAGGCAGGAG +FC12044_91407_8_200_10_437 USXSXXXXXXUXXXSXQXXUQXXKS @FC12044_91407_8_200_154_436 AGACCTTTGGATACAATGAACGACT +FC12044_91407_8_200_154_436 MKKMQTSRXMSQTOMRFOOIFFFFF @FC12044_91407_8_200_336_64 AGGGAATTTTAGAGGAGGGCTGCCG +FC12044_91407_8_200_336_64 STQMOSXSXSQXQXXKXXXKFXFFK @FC12044_91407_8_200_620_233 TCTCCATGTTGGTCAGGCTGGTCTC +FC12044_91407_8_200_620_233 XXXXXXXXXXXXXXXXXXXXXSXSW @FC12044_91407_8_200_902_349 TGAACGTCGAGACGCAAGGCCCGCC +FC12044_91407_8_200_902_349 XMXSSXMXXSXQSXTSQXFKSKTOF @FC12044_91407_8_200_40_618 CTGTCCCCACGGCGGGGGGGCCTGG +FC12044_91407_8_200_40_618 TXXXXSXXXXXXXXXXXXXRKFOXS @FC12044_91407_8_200_83_511 GATGTACTCTTACACCCAGACTTTG +FC12044_91407_8_200_83_511 SOXXXXXUXXXXXXQKQKKROOQSU @FC12044_91407_8_200_76_246 TCAAGGGTGGATCTTGGCTCCCAGT +FC12044_91407_8_200_76_246 XTXTUXXXXXRXXXTXXSUXSRFXQ @FC12044_91407_8_200_303_427 TTGCGACAGAGTTTTGCTCTTGTCC +FC12044_91407_8_200_303_427 XXQROXXXXIXFQXXXOIQSSXUFF @FC12044_91407_8_200_31_299 TCTGCTCCAGCTCCAAGACGCCGCC +FC12044_91407_8_200_31_299 XRXTSXXXRXXSXQQOXQTSQSXKQ @FC12044_91407_8_200_553_135 TACGGAGCCGCGGGCGGGAAAGGCG +FC12044_91407_8_200_553_135 XSQQXXXXXXXXXXSXXMFFQXTKU @FC12044_91407_8_200_139_74 CCTCCCAGGTTCAAGCGATTATCCT +FC12044_91407_8_200_139_74 RMXUSXTXXQXXQUXXXSQISISSO @FC12044_91407_8_200_108_33 GTCATGGCGGCCCGCGCGGGGAGCG +FC12044_91407_8_200_108_33 OOOSSXXSXXOMKMOFMKFOKFFFF @FC12044_91407_8_200_980_965 ACAGTGGGTTCTTAAAGAAGAGTCG +FC12044_91407_8_200_980_965 TOSSRXXXSSMSXMOMXIRXOXFFS @FC12044_91407_8_200_981_857 AACGAGGGGCGCGACTTGACCTTGG +FC12044_91407_8_200_981_857 RXMSSXXXXSXQXQXFSXQFQKMXS @FC12044_91407_8_200_8_865 TTTCCCACCCCAGGAAGCCTTGGAC +FC12044_91407_8_200_8_865 XXXFKOROMKOORMIMRIIKKORFF @FC12044_91407_8_200_292_484 TCAGCCTCCGTGCCCAGCCCACTCC +FC12044_91407_8_200_292_484 XQXOSXXXXXUXXXXIXXXXQTOXF @FC12044_91407_8_200_675_16 CTCGGGAGGCTGAGGCAGGGGGGTT +FC12044_91407_8_200_675_16 OXTXXXSXXQXXOXXKMXXMXOKQF @FC12044_91407_8_200_285_136 CCAAATCTTGAATTGTAGCTCCCCT +FC12044_91407_8_200_285_136 OSXOQXXXXXSXXUXXTXXXXTRMS gf-s71.needle100644000766000024 541214006427353 16560 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data######################################## # Program: needle # Rundate: Thu Jun 20 10:53:54 2002 # Report_file: /tmp/jason/water/1/gf.s71.44_Y50C1A.2.water ######################################## #======================================= # # Aligned_sequences: 2 # 1: gf.s71.44 # 2: Y50C1A.2 # Matrix: EBLOSUM62 # Gap_penalty: 10.0 # Extend_penalty: 0.5 # # Length: 468 # Identity: 285/468 (60.9%) # Similarity: 339/468 (72.4%) # Gaps: 82/468 (17.5%) # Score: 1394.0 # # #======================================= gf.s71.44 1 MEDVTLHHFRWRKPVENKNGEIVYKTSETQTAEISRKDVECVANFQKSQE 50 ||||||..|.||||: ...||||||||||||.|.::||||||||||:::| Y50C1A.2 1 MEDVTLFQFTWRKPI-RLQGEIVYKTSETQTIETNKKDVECVANFQENKE 49 gf.s71.44 51 SQTDDFMQNGVGDGIKKEIRISKEVLGHIYDFLRDDSKVNYDRLLEFHKF 100 .|||. :.||||:.:||:|.||||||..:|||:||||||||||||||||| Y50C1A.2 50 VQTDS-VDNGVGENVKKDITISKEVLNLLYDFVRDDSKVNYDRLLEFHKF 98 gf.s71.44 101 DKVSLETVQKYHVETRNENIILMISNSSRKTLILFGGLSHETFCSHQARA 150 |||:|||||||||||||||||||||:|||||||||||:|||||||||||| Y50C1A.2 99 DKVALETVQKYHVETRNENIILMISSSSRKTLILFGGISHETFCSHQARA 148 gf.s71.44 151 VLCSSSTTSSLPLPVCAISAVFYSSTQFLLGDISGNISMWTKEKMIFENK 200 :||||||:.|:|||||||||||||||||:|||:||||||.:|:|:|||.| Y50C1A.2 149 LLCSSSTSFSIPLPVCAISAVFYSSTQFILGDVSGNISMCSKDKIIFEKK 198 gf.s71.44 201 VTDGSVTSLELCRYGLLSGSDDGNVILWKVEESKIEKIEGIKLTVSDLSR 250 :|||:||.||:||:||||||||||:|||::..|.:||:.|.||||||||| Y50C1A.2 199 ITDGAVTCLEMCRHGLLSGSDDGNIILWQIGTSGLEKLGGTKLTVSDLSR 248 gf.s71.44 251 KIRRSSTSNKPVAIVSMQV----SGDEVCVATETGGLYLLTLPTLESKPL 296 ||||||||||||||||||| ||:|.|||||||||||||||||:.||| Y50C1A.2 249 KIRRSSTSNKPVAIVSMQVYVWPSGEEACVATETGGLYLLTLPTLDYKPL 298 gf.s71.44 297 T-QSATSIFKILYEHPYIAVVYHTSNSAIFNSEGLVDEIPFVATLAVRCG 345 : |:||||.|||:|:.::||:|||||:|:||||||||||||||||||| Y50C1A.2 299 SHQTATSINKILFENQFVAVIYHTSNAAVFNSEGLVDEIPFVATLAVR-- 346 gf.s71.44 346 AYFIFSNQSRLIIWSMNTRSTVIDENLNCHS-ICSLSND----------- 383 .:|::: .|...|.|..|||.| ....:|: Y50C1A.2 347 --------PKLVLF--YTSVCVQDITLNCTSPFREFNNEYNPVIKFSKIR 386 gf.s71.44 384 ---TLQVLDGDFNLNSQSENSATSESENLRISDLQNLRMLKLQNLRTSEF 430 .|.|::| |..:|.:.|: Y50C1A.2 387 FSADLSVING-FRTSSPNSNN 406 gf.s71.44 431 QNFRTSESQYFKKDNGEL 448 Y50C1A.2 407 406 #--------------------------------------- #--------------------------------------- signalp.short100644000766000024 257514006427353 17123 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data# SignalP-NN euk predictions # SignalP-HMM euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? D ? # name ! Cmax pos ? Sprob ? BC1G_00001.1 0.074 23 N 0.014 33 N 0.067 2 N 0.023 N 0.018 N BC1G_00001.1 Q 0.000 0 N 0.000 N BC1G_00002.1 0.079 26 N 0.059 22 N 0.406 1 N 0.100 N 0.079 N BC1G_00002.1 Q 0.000 22 N 0.000 N BC1G_00003.1 0.934 22 Y 0.866 22 Y 0.991 13 Y 0.938 Y 0.902 Y BC1G_00003.1 S 0.973 22 Y 0.999 Y BC1G_00004.1 0.078 30 N 0.101 17 N 0.425 1 N 0.221 N 0.161 N BC1G_00004.1 Q 0.001 19 N 0.003 N BC1G_00005.1 0.085 18 N 0.078 18 N 0.683 3 N 0.188 N 0.133 N BC1G_00005.1 Q 0.007 22 N 0.008 N BC1G_00006.1 0.244 27 N 0.035 27 N 0.161 2 N 0.053 N 0.044 N BC1G_00006.1 Q 0.000 23 N 0.000 N BC1G_00007.1 0.257 22 N 0.176 22 N 0.620 2 N 0.258 N 0.217 N BC1G_00007.1 Q 0.228 22 N 0.240 N BC1G_00008.1 0.576 83 Y 0.383 83 Y 0.952 45 Y 0.489 Y 0.436 Y BC1G_00008.1 Q 0.061 22 N 0.222 N BC1G_00009.1 0.425 28 Y 0.153 28 N 0.448 4 N 0.140 N 0.147 N BC1G_00009.1 Q 0.000 20 N 0.000 N BC1G_00010.1 0.380 421 Y 0.057 15 N 0.362 4 N 0.146 N 0.102 N BC1G_00010.1 Q 0.000 0 N 0.000 N test-4.9.meme100644000766000024 51524514006427353 16604 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data******************************************************************************** MEME - Motif discovery tool ******************************************************************************** MEME version 4.9.0 (Release date: Wed Oct 3 11:07:26 EST 2012) For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.nbcr.net. This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.nbcr.net. ******************************************************************************** ******************************************************************************** REFERENCE ******************************************************************************** If you use this program in your research, please cite: Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994. ******************************************************************************** ******************************************************************************** TRAINING SET ******************************************************************************** DATAFILE= multibasic-uniq.fa ALPHABET= ACDEFGHIKLMNPQRSTVWY Sequence name Weight Length Sequence name Weight Length ------------- ------ ------ ------------- ------ ------ A/chicken/Bangladesh/830 1.0000 21 A/chicken/Egypt/39825/20 1.0000 21 A/ck/Indonesia/072/10 1.0000 21 A/duck/Bangladesh/5749/1 1.0000 21 A/mallard/Crimea/245/200 1.0000 21 A/environment/Bangladesh 1.0000 21 A/chicken/Egypt/111945V/ 1.0000 21 A/avian/New_York/Sg00372 1.0000 21 A/chicken/Egypt/1117AF/2 1.0000 21 A/wild_bird_feces/Cheons 1.0000 21 A/chicken/India/241272/2 1.0000 21 A/duck/Jiangxi/80/2005 1.0000 21 A/chicken/Inhu/BPPVRII/2 1.0000 21 A/duck/Guangxi/668/2004 1.0000 21 A/whooper_swan/Hokkaido/ 1.0000 21 A/duck/Vietnam/1/2010 1.0000 21 A/avian/New_York/Sg00387 1.0000 21 A/chicken/Cambodia/LC/20 1.0000 21 A/mallard/Maryland/786/2 1.0000 21 A/chicken/Egypt/128s/201 1.0000 21 A/guinea_fowl/Yangon/834 1.0000 21 A/chicken/WestJava/SmiAc 1.0000 21 A/Indonesia/625/2006 1.0000 21 A/Egypt/N07460/2012 1.0000 21 A/duck/Victoria/26/1981| 1.0000 21 A/duck/Viet_Nam/TG2401/2 1.0000 21 A/chicken/Egypt/398252/2 1.0000 21 A/environment/Maryland/1 1.0000 21 A/duck/New_Zealand/41/19 1.0000 21 A/heron/Cambodia/TM068/2 1.0000 21 A/chicken/Egypt/10117/20 1.0000 21 A/VietNam/HN31413/2008 1.0000 21 A/chicken/Egypt/398214/2 1.0000 21 A/chicken/Egypt/39823/20 1.0000 21 A/chicken/Belgium/150VB/ 1.0000 21 A/goose/Fujian/bb/2003 1.0000 21 A/Vietnam/UT30259/2004 1.0000 21 A/duck/Yunnan/47/2006 1.0000 21 A/chicken/Egypt/10259SF/ 1.0000 21 A/bird/Turkey/Unye_ist06 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/chicken/Vietnam/NCVD19 1.0000 21 A/swan/England/AV3142149 1.0000 21 A/chicken/Ibaraki/17/200 1.0000 21 A/chicken/Nepal/354/2010 1.0000 21 A/duck/Vietnam/NCVD366/2 1.0000 21 A/duck/Korea/GJ54/2004|G 1.0000 21 A/muscovy_duck/Vietnam/L 1.0000 21 A/duck/Egypt/1130AG/2011 1.0000 21 A/ck/Indonesia/091/10 1.0000 21 A/duck/Vietnam/9/2010 1.0000 21 A/duck/Hokkaido/Vac3/200 1.0000 21 A/Thailand/WRAIR1720H/20 1.0000 21 A/chicken/Cambodia/022LC 1.0000 21 A/spurwinged_goose/Niger 1.0000 21 A/pigeon/Egypt/SHAH5803/ 1.0000 21 A/whooper_swan/Mongolia/ 1.0000 21 A/chicken/Shan/2626/2007 1.0000 21 A/chicken/Italy/367/97|A 1.0000 21 A/turkey/Italy/1325/2005 1.0000 21 A/chicken/Egypt/1158SF/2 1.0000 21 A/chicken/Egypt/398220/2 1.0000 21 A/condor/Guangdong/139/2 1.0000 21 A/chicken/Shandong/A1/20 1.0000 21 A/chicken/Egypt/11764s/2 1.0000 21 A/swine/NorthSumatra/UT6 1.0000 21 A/o.bill_stork/Thailand/ 1.0000 21 A/chicken/Texas/2983132/ 1.0000 21 A/chicken/Bangladesh/11V 1.0000 21 A/chicken/Ibaraki/15/200 1.0000 21 A/chicken/Shandong/A10/2 1.0000 21 A/duck/Vietnam/OIE1287/2 1.0000 21 A/duck/Vietnam/NCVD1161/ 1.0000 21 A/duck/Ireland/113/1983| 1.0000 21 A/chicken/EastKalimantan 1.0000 21 A/duck/Vietnam/3/2010 1.0000 21 A/Muscovy_duck/Ca_Mau/11 1.0000 21 A/chicken/Turkey/Misinli 1.0000 21 A/chicken/Italy/9097/199 1.0000 21 A/mallard/Washington/456 1.0000 21 A/duck/Iran/11VIR53161/2 1.0000 21 A/Muscovy_Duck/Vietnam/1 1.0000 21 A/duck/Hunan/149/2005 1.0000 21 A/Egypt/4822NAMRU3/2009 1.0000 21 A/duck/Vietnam/NCVD1463/ 1.0000 21 A/poultry/Egypt/398256/2 1.0000 21 A/turkey/Italy/1980|GQ24 1.0000 21 A/goose/Bangladesh/11VIR 1.0000 21 A/mallard/Washington/454 1.0000 21 A/chicken/Cambodia/013LC 1.0000 21 A/chicken/Banten/PdglKas 1.0000 21 A/chicken/Vietnam/NCVD03 1.0000 21 A/wigeon/Ohio/379/1988|C 1.0000 21 A/chicken/Magelang/BBVW6 1.0000 21 A/chicken/Lampung/BPPVRI 1.0000 21 A/chick/Pennsylvania/1/1 1.0000 21 A/chicken/Liaoning/A1/20 1.0000 21 A/wild_bird_feces/Cheons 1.0000 21 A/chicken/Sikkim/151466/ 1.0000 21 A/environment/ChangSha/2 1.0000 21 A/chicken/CentralJava/UT 1.0000 21 A/Vietnam/HN36250/2010 1.0000 21 A/chicken/Guiyang/821/20 1.0000 21 A/owstons_civet/VietNam/ 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/environment/New_York/1 1.0000 21 A/domestic_goose/Hong_Ko 1.0000 21 A/Hubei/1/2010 1.0000 21 A/Hunan/1/2009 1.0000 21 A/chicken/Egypt/1123AL/2 1.0000 21 A/wood_duck/MD/04623/200 1.0000 21 A/chicken/Shanxi/2/2006 1.0000 21 A/quail/Egypt/1171SG/201 1.0000 21 A/duck/France/080036/200 1.0000 21 A/chicken/Yangon/182/201 1.0000 21 A/chicken/Scotland/59|X0 1.0000 21 A/chicken/Sharkia/CAI41/ 1.0000 21 A/mallard/Sweden/21/2002 1.0000 21 A/environment/Bangladesh 1.0000 21 A/chicken/Hebei/A8/2009 1.0000 21 A/chicken/Egypt/11VIR445 1.0000 21 A/duck/Guangxi/13/2004 1.0000 21 A/duck/France/090043/200 1.0000 21 A/chicken/Denpasar/BBVD1 1.0000 21 A/Egypt/N6774/2011 1.0000 21 A/parrot/CA/6032/04|DQ25 1.0000 21 A/Cambodia/VN05103/2005 1.0000 21 A_GenBank/heron/Cambodia 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/chicken/Tabanan/BBVD14 1.0000 21 A/great_cormorant/Tibet/ 1.0000 21 A/Egypt/321NAMRU3/2007 1.0000 21 A/chicken/Egypt/10512AG/ 1.0000 21 AHAH5_[11734;11734] 1.0000 21 A/Egypt/9174NAMRU3/2009 1.0000 21 A/Indonesia/NIHRD12379/2 1.0000 21 A/chicken/Korea/ES/03 1.0000 21 A/avian/New_York/Sg00377 1.0000 21 A/chicken/Vietnam/NCVD09 1.0000 21 A/Chicken/TurkeyMus/09rs 1.0000 21 A/duck/Hong_Kong/312/197 1.0000 21 A/Vietnam/UT3030/2003 1.0000 21 A/crow/Bangladesh/11rs19 1.0000 21 A/duck/Primorie/2633/200 1.0000 21 A/Indonesia/UT3006/2005 1.0000 21 A/duck/Qalubia/CAI11/201 1.0000 21 A/green_winged_teal/Dela 1.0000 21 A/chicken/Anhui/T5/2006 1.0000 21 A/chicken/Puebla/8623607 1.0000 21 A/duck/Guangxi/951/2005 1.0000 21 A/chicken/Egypt/1219s/20 1.0000 21 A/chicken/WestJava/SmiSu 1.0000 21 A/chicken/Cambodia/TLC2/ 1.0000 21 A/duck/France/05056a/200 1.0000 21 A/Shandong/1/2009 1.0000 21 A/chicken/Vietnam/NCVD40 1.0000 21 A/chicken/Queretaro/7653 1.0000 21 A/environment/Bangladesh 1.0000 21 A/barn_swallow/Hong_Kong 1.0000 21 A/duck/Primorie/2621/200 1.0000 21 A/chicken/BacLieuVietnam 1.0000 21 A_DISC/Cambodia/V0401301 1.0000 21 A/bar_headed_goose/Mongo 1.0000 21 A/chicken/Egypt/113Q/201 1.0000 21 A/quail/Thanatpin/2283/2 1.0000 21 A/chicken/Egypt/125s/201 1.0000 21 A/duck/Hong_Kong/698/197 1.0000 21 A/chicken/Egypt/209573/2 1.0000 21 A/chicken/Indonesia/SmiW 1.0000 21 A/turkey/Egypt/091QNLQP/ 1.0000 21 A/duck/Egypt/1053/2010 1.0000 21 A/Egypt/2786NAMRU3/2006 1.0000 21 A/mallard/Netherlands/3/ 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/chicken/Liaoning/23/20 1.0000 21 A/unknown/NY/98996/01|AY 1.0000 21 A/tern/South_Africa/1961 1.0000 21 A/chicken/Nepal/T1P/12 1.0000 21 A/marabou_stork/Cambodia 1.0000 21 A/chicken/Cambodia/67F1/ 1.0000 21 A/gadwall/California/442 1.0000 21 A/chicken/TanseMyanmar/S 1.0000 21 A/duck/Cao_Bang/43/2007 1.0000 21 A/chicken/Miyazaki/T10/2 1.0000 21 A/whitefaced_whistling_d 1.0000 21 A/muscovy_duck/Vietnam/N 1.0000 21 A/Canada_goose/Alaska//4 1.0000 21 A/chicken/Bangladesh/967 1.0000 21 A/chicken/Egypt/11VIR445 1.0000 21 A/Hong_Kong/7032/2012 1.0000 21 A/chicken/Vietnam/945/20 1.0000 21 A/ostrich/South_Africa/A 1.0000 21 A/wild_bird/Wisconsin/43 1.0000 21 A/spurwinged_goose/Niger 1.0000 21 A/duck/Vietnam/NCVD1026/ 1.0000 21 A/tern/South_Africa/1959 1.0000 21 A/muscovy_duck/Jakarta/S 1.0000 21 A/goose/Germany/R3160/09 1.0000 21 A/turkey/Ontario/7732/19 1.0000 21 A/chicken/Egypt/1090/201 1.0000 21 A/mallard/Netherlands/2/ 1.0000 21 A/chicken/Vietnam/NCVD18 1.0000 21 A/chicken/Texas/1672804/ 1.0000 21 A/duck/Italy/775/2004|CY 1.0000 21 A/chicken/Bangladesh/FD( 1.0000 21 A/chicken/Vietnam/NCVD40 1.0000 21 A/swan/Hokkaido/67/1996| 1.0000 21 A/chicken/Bangladesh/152 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/chicken/Indonesia/Suka 1.0000 21 A/gull/Pennsylvania/4175 1.0000 21 A/turkey/England/N28/73| 1.0000 21 A/chicken/Badung/BBVD302 1.0000 21 A/chicken/Bangladesh/11r 1.0000 21 A/chicken/Yichang/lung1/ 1.0000 21 A/avian/Missouri/4655937 1.0000 21 A/turkey/TX/14082/1982_H 1.0000 21 A/Cambodia/W0526301/2012 1.0000 21 A/chicken/Vietnam/NCVD11 1.0000 21 A/duck/Egypt/SHZA6605/20 1.0000 21 A/chicken/Shandong/A5/20 1.0000 21 A/chicken/VA/40018/1984_ 1.0000 21 A/common_magpie/HongKong 1.0000 21 A/shearwater/Australia/7 1.0000 21 A/Chicken/TurkeyEdirne/0 1.0000 21 A/goose/Guiyang/337/2006 1.0000 21 A/chicken/Egypt/11VIR445 1.0000 21 A/environment/Thailand/I 1.0000 21 A/wild_bird_feces/Byeong 1.0000 21 A/duck/Vietnam/NCVD422/2 1.0000 21 A/chicken/Bhutan/4/10 1.0000 21 A/turkey/MN/40550/1987_H 1.0000 21 A/chicken/Egypt/1085/201 1.0000 21 A/duck/France/06436/2006 1.0000 21 A/HongKong/6841/2010 1.0000 21 A/chicken/WestBengal/239 1.0000 21 A/duck/Bac_Lieu/1213/200 1.0000 21 A/cinnamon_teal/Californ 1.0000 21 A/Muscovy_duck/France/07 1.0000 21 A/chicken/Vietnam/4/2010 1.0000 21 A/chicken/Vietnam/NCVD01 1.0000 21 A/chicken/Puebla/1458665 1.0000 21 A/Northern_shoveler/Utah 1.0000 21 A/tundra_swan/Alaska//48 1.0000 21 A/chicken/Sheny/0606/200 1.0000 21 A/turkey/England/N28/73| 1.0000 21 A/duck/Iran/VIR53161/201 1.0000 21 A/chicken/Yangon/1023/20 1.0000 21 A/chicken/Vietnam/5/2010 1.0000 21 A/chicken/Hebei/326/2005 1.0000 21 A/chicken/Bangladesh/150 1.0000 21 A/wild_bird/Minnesota/46 1.0000 21 A/chicken/India/81766/20 1.0000 21 ******************************************************************************** ******************************************************************************** COMMAND LINE SUMMARY ******************************************************************************** This information can also be useful in the event you wish to report a problem with the MEME software. command: meme multibasic-uniq.fa -mod zoops -nmotifs 10 model: mod= zoops nmotifs= 10 evt= inf object function= E-value of product of p-values width: minw= 8 maxw= 21 minic= 0.00 width: wg= 11 ws= 1 endgaps= yes nsites: minsites= 2 maxsites= 253 wnsites= 0.8 theta: prob= 1 spmap= pam spfuzz= 120 global: substring= yes branching= no wbranch= no em: prior= megap b= 26565 maxiter= 50 distance= 1e-05 data: n= 5313 N= 253 sample: seed= 0 seqfrac= 1 Dirichlet mixture priors file: prior30.plib Letter frequencies in dataset: A 0.481 C 0.062 D 0.001 E 0.000 F 0.000 G 0.333 H 0.000 I 0.000 K 0.000 L 0.000 M 0.000 N 0.000 P 0.000 Q 0.000 R 0.001 S 0.000 T 0.122 V 0.000 W 0.000 Y 0.000 Background letter frequencies (from dataset with add-one prior applied): A 0.479 C 0.062 D 0.001 E 0.000 F 0.000 G 0.332 H 0.000 I 0.000 K 0.001 L 0.000 M 0.000 N 0.001 P 0.000 Q 0.000 R 0.001 S 0.000 T 0.122 V 0.000 W 0.000 Y 0.000 ******************************************************************************** ******************************************************************************** MOTIF 1 width = 21 sites = 47 llr = 971 E-value = 5.0e-239 ******************************************************************************** -------------------------------------------------------------------------------- Motif 1 Description -------------------------------------------------------------------------------- Simplified A 928298a29a:a::4::8::: pos.-specific C :::::::8::::::49:1::: probability D ::::::::::::::::::::: matrix E ::::::::::::::::::::: F ::::::::::::::::::::: G :82811::1:a:aa:::1::: H ::::::::::::::::::::: I ::::::::::::::::::::: K ::::::::::::::::::::: L ::::::::::::::::::::: M ::::::::::::::::::::: N ::::::::::::::::::::: P ::::::::::::::::::::: Q ::::::::::::::::::::: R ::::::::::::::::::::: S ::::::::::::::::::::: T ::::::::::::::21a:9a9 V ::::::::::::::::::::: W ::::::::::::::::::::: Y ::::::::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (29.8 bits) 5.0 3.7 * 2.5 * ** *** 1.2 ** ** ** ******** *** 0.0 --------------------- Multilevel AGAGAAACAAGAGGACTATTT consensus A A C sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------------- AHAH5_[11734;11734] 1 3.05e-19 . CNAAKWGAARGAGADRDNRKR A/environment/New_York/1 1 7.24e-13 . AGAGAAACAAGAGGCCTATTT A/Northern_shoveler/Utah 1 1.11e-12 . AGAGAAACAAGAGGCCTCTTT A/gadwall/California/442 1 2.10e-12 . AGGGAAACAAGAGGCCTATTT A/wigeon/Ohio/379/1988|C 1 2.82e-12 . AGAGAAACAAGAGGCCTGTTT A/wild_bird/Minnesota/46 1 4.20e-12 . AGAGAAACAAGAGGTCTATTT A/green_winged_teal/Dela 1 5.44e-12 . AGAAAAACAAGAGGCCTATTT A/wood_duck/MD/04623/200 1 9.89e-12 . AGAGAAACAAGAGGACTATTT A/mallard/Washington/454 1 9.89e-12 . AAAGAAACAAGAGGCCTATTT A/tundra_swan/Alaska//48 1 1.29e-11 . AGAGGAACAAGAGGCCTATTT A/duck/Victoria/26/1981| 1 1.29e-11 . AGGGAGACAAGAGGCCTATTT A/cinnamon_teal/Californ 1 1.49e-11 . AAAGAAACAAGAGGCCTCTTT A/turkey/TX/14082/1982_H 1 1.88e-11 . AGAGAAACAAGAGGTCTGTTT A/avian/Missouri/4655937 1 1.88e-11 . AGAGAAACAAGAGGCTTATTT A/unknown/NY/98996/01|AY 1 1.88e-11 . GGAGAAACAAGAGGCCTATTT A/duck/Hong_Kong/698/197 1 2.08e-11 . AGAGAGACAAGAGGTCTATTT A/mallard/Sweden/21/2002 1 2.67e-11 . CGAGAAACAAGAGGACTATTT A/environment/Maryland/1 1 3.01e-11 . AGAGAAACAAGAGGCCCATTT A/chicken/Texas/1672804/ 1 3.63e-11 . AGAGAAAAAAGAGGCCTATTT A/goose/Germany/R3160/09 1 4.14e-11 . AGAGAGACAAGAGGACTATTT A/mallard/Netherlands/2/ 1 5.83e-11 . AGAAAAACAAGAGGACTATTT A/chicken/Ibaraki/17/200 1 5.83e-11 . AGGGAAACAAGAGGCTTATTT A/shearwater/Australia/7 1 6.48e-11 . AGGGAGACAAGAGGTCTATTT A/mallard/Crimea/245/200 1 6.48e-11 . AGAGAAACAAGAGGGCTATTT A/duck/France/080036/200 1 8.17e-11 . AAAGAAACAAGAGGACTATTT A/duck/Italy/775/2004|CY 1 9.22e-11 . AGAGATACAAGAGGACTATTT A/duck/Hokkaido/Vac3/200 1 1.27e-10 . AGAGAAACAAGAGGACTATTC A/parrot/CA/6032/04|DQ25 1 1.65e-10 . AAAGAAACAAGAGGCTTATTT A/chicken/Ibaraki/15/200 1 1.65e-10 . AGAGAAACAAGAGGATTATTT A/swan/Hokkaido/67/1996| 1 2.49e-10 . AGAGAAACAAGAGGACTACTT A/duck/France/090043/200 1 2.89e-10 . AGAGAGACGAGAGGACTATTT A/spurwinged_goose/Niger 1 2.89e-10 . AGAGAAAAAAGAGGACTATTT A/duck/France/05056a/200 1 4.65e-10 . AAAGAAACAAGAGGGCTATTT A/chicken/Texas/2983132/ 1 5.19e-10 . AGGAAAAAAAGAGGCCTATTT A/duck/New_Zealand/41/19 1 6.33e-10 . AGAGACACAAGGGGCCTGTTT A/chicken/Belgium/150VB/ 1 7.84e-10 . AAGGAGACAAGAGGACTATTT A/duck/France/06436/2006 1 8.65e-10 . AAAGAAACAAGAGGACTATTC A/Muscovy_duck/France/07 1 1.05e-09 . AAAGAAGCAAGAGGACTATTT A/gull/Pennsylvania/4175 1 1.05e-09 . AGAGAAACAAAAGGTCTGTTT A/chicken/Scotland/59|X0 1 5.46e-09 . AGGAAGAAAAGAGGTCTATTT A/swan/England/AV3142149 1 5.93e-09 . AGAAAAAAGAGAGGACTATTT A/duck/Hunan/149/2005 1 8.40e-09 . AGAAGAAAAAGAGGACTATTT A/bar_headed_goose/Mongo 1 2.05e-08 . AGAAGAAAAAGAGGACTGTTT A/chick/Pennsylvania/1/1 1 2.05e-08 . AGGAAAAAGAGAGGTCTGTTT A/whooper_swan/Mongolia/ 1 5.15e-08 . ATTGAAACTAGAGGATTATTT A/chicken/VA/40018/1984_ 1 8.00e-08 . AAGAAAAAGAGAGGTCTGTTT A/mallard/Maryland/786/2 1 3.02e-04 . GGAGCAATAGCAGGATTCATA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- AHAH5_[11734;11734] 3.1e-19 [1] A/environment/New_York/1 7.2e-13 [1] A/Northern_shoveler/Utah 1.1e-12 [1] A/gadwall/California/442 2.1e-12 [1] A/wigeon/Ohio/379/1988|C 2.8e-12 [1] A/wild_bird/Minnesota/46 4.2e-12 [1] A/green_winged_teal/Dela 5.4e-12 [1] A/wood_duck/MD/04623/200 9.9e-12 [1] A/mallard/Washington/454 9.9e-12 [1] A/tundra_swan/Alaska//48 1.3e-11 [1] A/duck/Victoria/26/1981| 1.3e-11 [1] A/cinnamon_teal/Californ 1.5e-11 [1] A/turkey/TX/14082/1982_H 1.9e-11 [1] A/avian/Missouri/4655937 1.9e-11 [1] A/unknown/NY/98996/01|AY 1.9e-11 [1] A/duck/Hong_Kong/698/197 2.1e-11 [1] A/mallard/Sweden/21/2002 2.7e-11 [1] A/environment/Maryland/1 3e-11 [1] A/chicken/Texas/1672804/ 3.6e-11 [1] A/goose/Germany/R3160/09 4.1e-11 [1] A/mallard/Netherlands/2/ 5.8e-11 [1] A/chicken/Ibaraki/17/200 5.8e-11 [1] A/shearwater/Australia/7 6.5e-11 [1] A/mallard/Crimea/245/200 6.5e-11 [1] A/duck/France/080036/200 8.2e-11 [1] A/duck/Italy/775/2004|CY 9.2e-11 [1] A/duck/Hokkaido/Vac3/200 1.3e-10 [1] A/parrot/CA/6032/04|DQ25 1.7e-10 [1] A/chicken/Ibaraki/15/200 1.7e-10 [1] A/swan/Hokkaido/67/1996| 2.5e-10 [1] A/duck/France/090043/200 2.9e-10 [1] A/spurwinged_goose/Niger 2.9e-10 [1] A/duck/France/05056a/200 4.7e-10 [1] A/chicken/Texas/2983132/ 5.2e-10 [1] A/duck/New_Zealand/41/19 6.3e-10 [1] A/chicken/Belgium/150VB/ 7.8e-10 [1] A/duck/France/06436/2006 8.6e-10 [1] A/Muscovy_duck/France/07 1e-09 [1] A/gull/Pennsylvania/4175 1e-09 [1] A/chicken/Scotland/59|X0 5.5e-09 [1] A/swan/England/AV3142149 5.9e-09 [1] A/duck/Hunan/149/2005 8.4e-09 [1] A/bar_headed_goose/Mongo 2e-08 [1] A/chick/Pennsylvania/1/1 2e-08 [1] A/whooper_swan/Mongolia/ 5.1e-08 [1] A/chicken/VA/40018/1984_ 8e-08 [1] A/mallard/Maryland/786/2 0.0003 [1] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 1 width=21 seqs=47 AHAH5_[11734;11734] ( 1) CNAAKWGAARGAGADRDNRKR 1 A/environment/New_York/1 ( 1) AGAGAAACAAGAGGCCTATTT 1 A/Northern_shoveler/Utah ( 1) AGAGAAACAAGAGGCCTCTTT 1 A/gadwall/California/442 ( 1) AGGGAAACAAGAGGCCTATTT 1 A/wigeon/Ohio/379/1988|C ( 1) AGAGAAACAAGAGGCCTGTTT 1 A/wild_bird/Minnesota/46 ( 1) AGAGAAACAAGAGGTCTATTT 1 A/green_winged_teal/Dela ( 1) AGAAAAACAAGAGGCCTATTT 1 A/wood_duck/MD/04623/200 ( 1) AGAGAAACAAGAGGACTATTT 1 A/mallard/Washington/454 ( 1) AAAGAAACAAGAGGCCTATTT 1 A/tundra_swan/Alaska//48 ( 1) AGAGGAACAAGAGGCCTATTT 1 A/duck/Victoria/26/1981| ( 1) AGGGAGACAAGAGGCCTATTT 1 A/cinnamon_teal/Californ ( 1) AAAGAAACAAGAGGCCTCTTT 1 A/turkey/TX/14082/1982_H ( 1) AGAGAAACAAGAGGTCTGTTT 1 A/avian/Missouri/4655937 ( 1) AGAGAAACAAGAGGCTTATTT 1 A/unknown/NY/98996/01|AY ( 1) GGAGAAACAAGAGGCCTATTT 1 A/duck/Hong_Kong/698/197 ( 1) AGAGAGACAAGAGGTCTATTT 1 A/mallard/Sweden/21/2002 ( 1) CGAGAAACAAGAGGACTATTT 1 A/environment/Maryland/1 ( 1) AGAGAAACAAGAGGCCCATTT 1 A/chicken/Texas/1672804/ ( 1) AGAGAAAAAAGAGGCCTATTT 1 A/goose/Germany/R3160/09 ( 1) AGAGAGACAAGAGGACTATTT 1 A/mallard/Netherlands/2/ ( 1) AGAAAAACAAGAGGACTATTT 1 A/chicken/Ibaraki/17/200 ( 1) AGGGAAACAAGAGGCTTATTT 1 A/shearwater/Australia/7 ( 1) AGGGAGACAAGAGGTCTATTT 1 A/mallard/Crimea/245/200 ( 1) AGAGAAACAAGAGGGCTATTT 1 A/duck/France/080036/200 ( 1) AAAGAAACAAGAGGACTATTT 1 A/duck/Italy/775/2004|CY ( 1) AGAGATACAAGAGGACTATTT 1 A/duck/Hokkaido/Vac3/200 ( 1) AGAGAAACAAGAGGACTATTC 1 A/parrot/CA/6032/04|DQ25 ( 1) AAAGAAACAAGAGGCTTATTT 1 A/chicken/Ibaraki/15/200 ( 1) AGAGAAACAAGAGGATTATTT 1 A/swan/Hokkaido/67/1996| ( 1) AGAGAAACAAGAGGACTACTT 1 A/duck/France/090043/200 ( 1) AGAGAGACGAGAGGACTATTT 1 A/spurwinged_goose/Niger ( 1) AGAGAAAAAAGAGGACTATTT 1 A/duck/France/05056a/200 ( 1) AAAGAAACAAGAGGGCTATTT 1 A/chicken/Texas/2983132/ ( 1) AGGAAAAAAAGAGGCCTATTT 1 A/duck/New_Zealand/41/19 ( 1) AGAGACACAAGGGGCCTGTTT 1 A/chicken/Belgium/150VB/ ( 1) AAGGAGACAAGAGGACTATTT 1 A/duck/France/06436/2006 ( 1) AAAGAAACAAGAGGACTATTC 1 A/Muscovy_duck/France/07 ( 1) AAAGAAGCAAGAGGACTATTT 1 A/gull/Pennsylvania/4175 ( 1) AGAGAAACAAAAGGTCTGTTT 1 A/chicken/Scotland/59|X0 ( 1) AGGAAGAAAAGAGGTCTATTT 1 A/swan/England/AV3142149 ( 1) AGAAAAAAGAGAGGACTATTT 1 A/duck/Hunan/149/2005 ( 1) AGAAGAAAAAGAGGACTATTT 1 A/bar_headed_goose/Mongo ( 1) AGAAGAAAAAGAGGACTGTTT 1 A/chick/Pennsylvania/1/1 ( 1) AGGAAAAAGAGAGGTCTGTTT 1 A/whooper_swan/Mongolia/ ( 1) ATTGAAACTAGAGGATTATTT 1 A/chicken/VA/40018/1984_ ( 1) AAGAAAAAGAGAGGTCTGTTT 1 A/mallard/Maryland/786/2 ( 1) GGAGCAATAGCAGGATTCATA 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 21 n= 253 bayes= 3.90811 E= 5.0e-239 90 -56 -60 174 142 -291 71 203 -4 259 154 54 235 175 -78 517 -525 377 -128 -25 -167 -590 204 358 201 124 239 247 205 279 197 509 293 282 96 508 -273 342 7 107 69 -571 -100 118 89 -81 33 138 -66 196 96 34 230 146 -121 493 -238 319 -177 -72 -121 -641 -19 153 68 123 59 106 -9 161 58 61 191 129 -94 427 -609 255 -157 -60 83 -161 94 349 310 -252 214 386 505 442 329 162 273 302 74 629 -443 554 32 133 68 -158 64 317 279 -135 185 354 146 410 298 139 264 276 44 606 -243 522 559 103 98 -625 -131 77 52 -292 2 93 -101 157 47 -17 181 98 -159 434 -598 268 -202 -104 -115 359 -108 110 85 -668 28 137 -74 191 90 29 229 142 -126 489 -238 315 -182 -76 87 -557 -81 145 114 -196 51 169 -36 226 123 44 232 159 -101 504 -238 347 -154 -50 96 -547 -53 178 143 -381 73 204 8 262 154 52 223 173 421 508 -536 377 -125 -23 -438 -209 245 398 231 148 276 278 248 305 226 301 312 313 137 513 -552 353 47 144 101 -633 -136 70 46 -385 -3 85 -108 150 39 -23 176 92 -165 427 -605 260 -206 -109 -965 -907 -110 29 -56 159 -32 -75 -117 17 -143 -80 40 -20 -205 127 -901 -11 -235 -158 -440 -623 223 375 209 151 254 254 225 283 202 277 289 290 115 487 -578 328 25 122 -26 251 481 124 87 -290 39 134 -62 192 93 49 232 149 -120 493 46 315 -180 -72 -681 378 -119 89 66 -770 18 119 -55 163 45 -27 152 93 418 377 -1 226 -201 -78 -679 -162 475 225 185 -776 161 323 128 289 245 216 224 268 11 592 292 386 -74 30 65 1 -57 166 131 -119 67 189 -14 246 142 523 236 172 -86 513 -526 365 -139 -34 -422 -164 71 261 227 -711 195 367 169 334 292 251 273 311 415 642 287 437 -36 66 -891 -794 -111 53 23 -922 11 119 524 117 38 15 80 88 -142 375 299 178 -207 -111 -420 -68 64 254 221 -707 188 359 160 327 284 243 270 304 416 635 284 431 -43 60 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 21 nsites= 47 E= 5.0e-239 0.914894 0.042553 0.000000 0.000000 0.000000 0.042553 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.191489 0.000000 0.000000 0.000000 0.000000 0.765957 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.787234 0.000000 0.000000 0.000000 0.000000 0.191489 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.212766 0.000000 0.000000 0.000000 0.000000 0.787234 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.893617 0.021277 0.000000 0.000000 0.000000 0.063830 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.787234 0.021277 0.000000 0.000000 0.000000 0.148936 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.021277 0.000000 0.957447 0.000000 0.000000 0.000000 0.000000 0.042553 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.212766 0.765957 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.893617 0.000000 0.000000 0.000000 0.000000 0.085106 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.957447 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.021277 0.000000 0.000000 0.000000 0.957447 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.978723 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.978723 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.404255 0.361702 0.021277 0.000000 0.000000 0.042553 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.170213 0.000000 0.000000 0.000000 0.000000 0.851064 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.127660 0.000000 0.000000 0.000000 0.000000 0.021277 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.957447 0.000000 0.000000 0.000000 0.765957 0.063830 0.000000 0.000000 0.000000 0.148936 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.936170 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.978723 0.000000 0.000000 0.000000 0.021277 0.042553 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021277 0.000000 0.914894 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 1 regular expression -------------------------------------------------------------------------------- AGA[GA]AAA[CA]AAGAGG[AC]CTATTT -------------------------------------------------------------------------------- Time 54.91 secs. ******************************************************************************** ******************************************************************************** MOTIF 2 width = 21 sites = 93 llr = 1061 E-value = 4.5e-207 ******************************************************************************** -------------------------------------------------------------------------------- Motif 2 Description -------------------------------------------------------------------------------- Simplified A 8:9:a28189:aa19a89a75 pos.-specific C ::::::::1:::::::::::: probability D ::::::::::::::::::::: matrix E ::::::::::::::::::::: F ::::::::::::::::::::: G 291a:82911a::91:21:35 H ::::::::::::::::::::: I ::::::::::::::::::::: K ::::::::::::::::::::: L ::::::::::::::::::::: M ::::::::::::::::::::: N ::::::::::::::::::::: P ::::::::::::::::::::: Q ::::::::::::::::::::: R ::::::::::::::::::::: S ::::::::::::::::::::: T :1::::::::::::::::::: V ::::::::::::::::::::: W ::::::::::::::::::::: Y ::::::::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (16.5 bits) 5.0 3.7 2.5 1.2 * *** * ******* * 0.0 --------------------- Multilevel AGAGAGAGAAGAAGAAAAAAG consensus G AG GA sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------------- A/great_cormorant/Tibet/ 1 3.08e-09 . GGARAGAGAAGAAGAAAAAAG A/chicken/Shan/2626/2007 1 1.84e-08 . AGAGAGAGAAGAAGAAAAAAG A/duck/Vietnam/OIE1287/2 1 3.14e-08 . AGAGAGAGAAGAAGAAAAAGG A/muscovy_duck/Jakarta/S 1 5.58e-08 . AGAGAGAGCAGAAGAAAAAAG A/Vietnam/UT3030/2003 1 5.58e-08 . AGAGAGAGAAGAAGAAAAAAA A/chicken/Magelang/BBVW6 1 7.67e-08 . AGAGAGAGCAGAAGAAAAAGG A/chicken/Vietnam/NCVD19 1 7.67e-08 . AGAGAGAGAAGAAGAAAAAGA A/duck/Bac_Lieu/1213/200 1 9.16e-08 . AGAGAGGGAAGAAGAAAAAAG A/chicken/Sharkia/CAI41/ 1 1.04e-07 . GGAGAGAGAAGAAGAAAAAAG A/chicken/Tabanan/BBVD14 1 2.08e-07 . AGAGAGAGAAGAAGAAAGAAG A/duck/Jiangxi/80/2005 1 2.08e-07 . AGAGAAAGAAGAAGAAAAAAG A/chicken/Badung/BBVD302 1 2.17e-07 . ATAGAGAGAAGAAGAAAAAAG A/chicken/Banten/PdglKas 1 2.53e-07 . AGAGAGAGGAGAAGAAAAAAG A/o.bill_stork/Thailand/ 1 2.53e-07 . AGAGAGAGAAGACGAAAAAAG A/chicken/Bangladesh/150 1 3.10e-07 . GGAGAGAGAAGAAGAAAAAGA A/chicken/Yangon/182/201 1 3.10e-07 . AGAGAGAGAAGAAGAAGAAAA A/chicken/Shandong/A1/20 1 3.10e-07 . AGAGAAAGAAGAAGAAAAAGG A/duck/Qalubia/CAI11/201 1 3.89e-07 . GGAGAGGGAAGAAGAAAAAAG A/Indonesia/UT3006/2005 1 3.89e-07 . AGAGAAAGCAGAAGAAAAAAG A/goose/Fujian/bb/2003 1 3.89e-07 . AGAGAGAGAAGAAGGAAAAAG A/environment/Thailand/I 1 4.76e-07 . AGAGAGAGAGGAAGAAAAAAG A/chicken/EastKalimantan 1 4.76e-07 . AGAGAGAGTAGAAGAAAAAAG A/swine/NorthSumatra/UT6 1 4.76e-07 . AGGGAGAGAAGAAGAAAAAAG A/duck/Cao_Bang/43/2007 1 5.46e-07 . AGAGAAAGAAGAAGAAAAAGA A/muscovy_duck/Vietnam/N 1 6.23e-07 . AGAGAAGGAAGAAGAAAAAAG A/Indonesia/NIHRD12379/2 1 6.23e-07 . AGAGAGAGCAGAAGGAAAAAG A/Egypt/321NAMRU3/2007 1 6.23e-07 . GGAGAGAGAAGAAGAAGAAAG A/chicken/India/81766/20 1 7.12e-07 . GGAGAGAGAAGAAGAAAGAAG A/chicken/Bangladesh/967 1 7.12e-07 . GGAGAAAGAAGAAGAAAAAAG A/chicken/Nepal/T1P/12 1 7.12e-07 . AGAGAGAGGAGAAGAAAAAGA A/Thailand/WRAIR1720H/20 1 7.12e-07 . AGAGAGAAAAGAAGAAAAAAG A/goose/Guiyang/337/2006 1 7.95e-07 . AGAGAGAGAAGGAGAAAAAAG A/Muscovy_duck/Ca_Mau/11 1 7.95e-07 . AGAGAGGGGAGAAGAAAAAAG A/owstons_civet/VietNam/ 1 8.77e-07 . AGAGAGAGAAGAAGGAAAAGA A/chicken/Bangladesh/152 1 1.20e-06 . AGGGAGAGAAGAAGAAAAAGA A/chicken/Liaoning/23/20 1 1.20e-06 . GGAGAGAGAAGAAGGAAAAAG A/condor/Guangdong/139/2 1 1.20e-06 . AGAGGGAGAAGAAGAAAAAAG A/muscovy_duck/Vietnam/L 1 1.20e-06 . ATAGAGAGAAGAAGAAGAAAG A/chicken/Hebei/326/2005 1 1.32e-06 . AGAGAGGGAGGAAGAAAAAAG A/chicken/Inhu/BPPVRII/2 1 1.32e-06 . AGGGAGGGAAGAAGAAAAAAG A/chicken/WestBengal/239 1 1.45e-06 . GGAGAGAGGAGAAGAAAAAAA A/bird/Turkey/Unye_ist06 1 1.45e-06 . GGGGAGAGAAGAAGAAAAAAG A/chicken/Sheny/0606/200 1 1.67e-06 . AGAGAGGGAAGAAGGAAAAAA A/chicken/Miyazaki/T10/2 1 1.67e-06 . AGAGAGAAAAGAAGAAAAAGA A/duck/Guangxi/668/2004 1 1.67e-06 . AGAGAGATAAGAAGAAAAAAG A/Cambodia/W0526301/2012 1 1.92e-06 . AGAGAGGAAAGAAGAAAAAAG A/chicken/Egypt/11VIR445 1 1.92e-06 . GGAGAGGGAAGAAGAAAGAAG A/chicken/Egypt/113Q/201 1 1.92e-06 . GGAGAAGGAAGAAGAAAAAAG A/Hubei/1/2010 1 1.92e-06 . AGAGAGAGAAGGAGAAAAAGA A/duck/Vietnam/NCVD1463/ 1 1.92e-06 . ATAGAGAGAAGAAGAAGAAAA A/chicken/Shandong/A10/2 1 1.92e-06 . AGAGGGAGAAGAAGAAAAAAA A/chicken/Cambodia/67F1/ 1 2.10e-06 . AGAGAGGGAAGGAGAAAAAAG A/chicken/Liaoning/A1/20 1 2.10e-06 . AGAGAGGGAGGAAGAAAAAAA A/chicken/Egypt/1158SF/2 1 2.10e-06 . GGAGAGAAAAGAAGAAAAAAG A/Shandong/1/2009 1 2.34e-06 . AGAGAAAGAAGAAGAAGAAGA A/chicken/Egypt/1085/201 1 2.60e-06 . GGAGAGGGAAGAAGAAGAAAA A/chicken/Hebei/A8/2009 1 2.60e-06 . AGAGGGAGAAGAAGAAAAAGA A/chicken/CentralJava/UT 1 2.60e-06 . AGAGAGAGAAGAAAAAAAAGA A_DISC/Cambodia/V0401301 1 2.91e-06 . AGAGTGGGAAGAAGAAAAAAG A/chicken/Lampung/BPPVRI 1 2.91e-06 . AGAGAAAGCAGAAGGAAAAAG A/chicken/Vietnam/NCVD40 1 3.27e-06 . AGAGAAAGGAGAAGAAAAAGA A/duck/Vietnam/NCVD366/2 1 3.27e-06 . AGAGATGGAAGAAGAAAAAAG A/duck/Yunnan/47/2006 1 3.27e-06 . AGAAAGAGAAGAAGAAAAAAG A/chicken/Denpasar/BBVD1 1 3.61e-06 . AGAGAGAGAAGAAGAGAAAAG A/Hunan/1/2009 1 3.93e-06 . AGAGAAAGAAGAAGGAAAAGA A/chicken/Vietnam/NCVD09 1 4.40e-06 . ATAGAGGGAAGAAGAAGAAAA A/duck/Guangxi/951/2005 1 5.43e-06 . AGTGAAAGAAGAAGAAAAAGA A/environment/Bangladesh 1 5.95e-06 . AGGGAGAGGAGAAGAAAAAGA A/environment/ChangSha/2 1 5.95e-06 . AGAGAGAGAAGAGGAAAAAGA A/chicken/India/241272/2 1 5.95e-06 . GGAGAGAGGAGAAGAAAGAAA A/quail/Thanatpin/2283/2 1 6.62e-06 . AGAGAAAAAAGAAGAAAAAGA A/chicken/Sikkim/151466/ 1 6.62e-06 . GGTGAGAGAAGAAGAAAGAAG A/chicken/Bangladesh/830 1 6.62e-06 . GGAGAGAGAAGAAAAAAAAGA A/chicken/Shandong/A5/20 1 7.33e-06 . AGAGAGGGAGGAAGAAGAAAA A/domestic_goose/Hong_Ko 1 8.76e-06 . ATAGAGAGGAGAAGAAGAAAA A/chicken/Egypt/10512AG/ 1 9.60e-06 . GAAGAGGGAAGAAGAAAAAAG A/duck/Guangxi/13/2004 1 9.60e-06 . AGAGAAAGAAGAAAAAAAAGA A/Vietnam/UT30259/2004 1 9.60e-06 . AGAGAGAGAAGAAAAAAGAGA A/spurwinged_goose/Niger 1 1.16e-05 . AGAGAAAAAAGAAGAAAGAAG A/chicken/Vietnam/NCVD18 1 1.27e-05 . AAAGAAAGAAGAAGAAAAAGA A/duck/Vietnam/NCVD1026/ 1 1.27e-05 . ATAGAGAGAGGAAGAAGAAAA A/chicken/WestJava/SmiAc 1 1.27e-05 . AGAGAGAGCAGACGTAAAAAA A/Vietnam/HN36250/2010 1 1.51e-05 . AGAGAAAAAAGAAGAAGAAAA A/chicken/Vietnam/NCVD40 1 1.66e-05 . AGAGAAAGGAGAAGGAAAAGA A/duck/Vietnam/NCVD422/2 1 1.96e-05 . AGGGAAAGGAGAAGAAAAAGA A/barn_swallow/Hong_Kong 1 2.31e-05 . ATAGAAAGAAGAAGAAGGAAA A/ck/Indonesia/072/10 1 2.31e-05 . AAGGAGGGAAGAAGAAAAAAG A/chicken/Yangon/1023/20 1 2.51e-05 . AGAGAGAAAGGAAGAAGAAAA A/chicken/Vietnam/NCVD01 1 2.94e-05 . AGAGAGGGAGGAAGAAGGAGA A/Egypt/4822NAMRU3/2009 1 4.03e-05 . GGAAAGAGTAGAAGAAAAAAG A/chicken/Korea/ES/03 1 5.89e-05 . AGAGAGAAAAGAAAAAAGAGA A/duck/Viet_Nam/TG2401/2 1 6.33e-05 . AGAGAGAGAAGGAAAAAGAGA A/whitefaced_whistling_d 1 1.11e-04 . AAAGAAAAAAGAAGAAAGAAG -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/great_cormorant/Tibet/ 3.1e-09 [2] A/chicken/Shan/2626/2007 1.8e-08 [2] A/duck/Vietnam/OIE1287/2 3.1e-08 [2] A/muscovy_duck/Jakarta/S 5.6e-08 [2] A/Vietnam/UT3030/2003 5.6e-08 [2] A/chicken/Magelang/BBVW6 7.7e-08 [2] A/chicken/Vietnam/NCVD19 7.7e-08 [2] A/duck/Bac_Lieu/1213/200 9.2e-08 [2] A/chicken/Sharkia/CAI41/ 1e-07 [2] A/chicken/Tabanan/BBVD14 2.1e-07 [2] A/duck/Jiangxi/80/2005 2.1e-07 [2] A/chicken/Badung/BBVD302 2.2e-07 [2] A/chicken/Banten/PdglKas 2.5e-07 [2] A/o.bill_stork/Thailand/ 2.5e-07 [2] A/chicken/Bangladesh/150 3.1e-07 [2] A/chicken/Yangon/182/201 3.1e-07 [2] A/chicken/Shandong/A1/20 3.1e-07 [2] A/duck/Qalubia/CAI11/201 3.9e-07 [2] A/Indonesia/UT3006/2005 3.9e-07 [2] A/goose/Fujian/bb/2003 3.9e-07 [2] A/environment/Thailand/I 4.8e-07 [2] A/chicken/EastKalimantan 4.8e-07 [2] A/swine/NorthSumatra/UT6 4.8e-07 [2] A/duck/Cao_Bang/43/2007 5.5e-07 [2] A/muscovy_duck/Vietnam/N 6.2e-07 [2] A/Indonesia/NIHRD12379/2 6.2e-07 [2] A/Egypt/321NAMRU3/2007 6.2e-07 [2] A/chicken/India/81766/20 7.1e-07 [2] A/chicken/Bangladesh/967 7.1e-07 [2] A/chicken/Nepal/T1P/12 7.1e-07 [2] A/Thailand/WRAIR1720H/20 7.1e-07 [2] A/goose/Guiyang/337/2006 7.9e-07 [2] A/Muscovy_duck/Ca_Mau/11 7.9e-07 [2] A/owstons_civet/VietNam/ 8.8e-07 [2] A/chicken/Bangladesh/152 1.2e-06 [2] A/chicken/Liaoning/23/20 1.2e-06 [2] A/condor/Guangdong/139/2 1.2e-06 [2] A/muscovy_duck/Vietnam/L 1.2e-06 [2] A/chicken/Hebei/326/2005 1.3e-06 [2] A/chicken/Inhu/BPPVRII/2 1.3e-06 [2] A/chicken/WestBengal/239 1.5e-06 [2] A/bird/Turkey/Unye_ist06 1.5e-06 [2] A/chicken/Sheny/0606/200 1.7e-06 [2] A/chicken/Miyazaki/T10/2 1.7e-06 [2] A/duck/Guangxi/668/2004 1.7e-06 [2] A/Cambodia/W0526301/2012 1.9e-06 [2] A/chicken/Egypt/11VIR445 1.9e-06 [2] A/chicken/Egypt/113Q/201 1.9e-06 [2] A/Hubei/1/2010 1.9e-06 [2] A/duck/Vietnam/NCVD1463/ 1.9e-06 [2] A/chicken/Shandong/A10/2 1.9e-06 [2] A/chicken/Cambodia/67F1/ 2.1e-06 [2] A/chicken/Liaoning/A1/20 2.1e-06 [2] A/chicken/Egypt/1158SF/2 2.1e-06 [2] A/Shandong/1/2009 2.3e-06 [2] A/chicken/Egypt/1085/201 2.6e-06 [2] A/chicken/Hebei/A8/2009 2.6e-06 [2] A/chicken/CentralJava/UT 2.6e-06 [2] A_DISC/Cambodia/V0401301 2.9e-06 [2] A/chicken/Lampung/BPPVRI 2.9e-06 [2] A/chicken/Vietnam/NCVD40 3.3e-06 [2] A/duck/Vietnam/NCVD366/2 3.3e-06 [2] A/duck/Yunnan/47/2006 3.3e-06 [2] A/chicken/Denpasar/BBVD1 3.6e-06 [2] A/Hunan/1/2009 3.9e-06 [2] A/chicken/Vietnam/NCVD09 4.4e-06 [2] A/duck/Guangxi/951/2005 5.4e-06 [2] A/environment/Bangladesh 6e-06 [2] A/environment/ChangSha/2 6e-06 [2] A/chicken/India/241272/2 6e-06 [2] A/quail/Thanatpin/2283/2 6.6e-06 [2] A/chicken/Sikkim/151466/ 6.6e-06 [2] A/chicken/Bangladesh/830 6.6e-06 [2] A/chicken/Shandong/A5/20 7.3e-06 [2] A/domestic_goose/Hong_Ko 8.8e-06 [2] A/chicken/Egypt/10512AG/ 9.6e-06 [2] A/duck/Guangxi/13/2004 9.6e-06 [2] A/Vietnam/UT30259/2004 9.6e-06 [2] A/spurwinged_goose/Niger 1.2e-05 [2] A/chicken/Vietnam/NCVD18 1.3e-05 [2] A/duck/Vietnam/NCVD1026/ 1.3e-05 [2] A/chicken/WestJava/SmiAc 1.3e-05 [2] A/Vietnam/HN36250/2010 1.5e-05 [2] A/chicken/Vietnam/NCVD40 1.7e-05 [2] A/duck/Vietnam/NCVD422/2 2e-05 [2] A/barn_swallow/Hong_Kong 2.3e-05 [2] A/ck/Indonesia/072/10 2.3e-05 [2] A/chicken/Yangon/1023/20 2.5e-05 [2] A/chicken/Vietnam/NCVD01 2.9e-05 [2] A/Egypt/4822NAMRU3/2009 4e-05 [2] A/chicken/Korea/ES/03 5.9e-05 [2] A/duck/Viet_Nam/TG2401/2 6.3e-05 [2] A/whitefaced_whistling_d 0.00011 [2] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 2 width=21 seqs=93 A/great_cormorant/Tibet/ ( 1) GGARAGAGAAGAAGAAAAAAG 1 A/chicken/Shan/2626/2007 ( 1) AGAGAGAGAAGAAGAAAAAAG 1 A/duck/Vietnam/OIE1287/2 ( 1) AGAGAGAGAAGAAGAAAAAGG 1 A/muscovy_duck/Jakarta/S ( 1) AGAGAGAGCAGAAGAAAAAAG 1 A/Vietnam/UT3030/2003 ( 1) AGAGAGAGAAGAAGAAAAAAA 1 A/chicken/Magelang/BBVW6 ( 1) AGAGAGAGCAGAAGAAAAAGG 1 A/chicken/Vietnam/NCVD19 ( 1) AGAGAGAGAAGAAGAAAAAGA 1 A/duck/Bac_Lieu/1213/200 ( 1) AGAGAGGGAAGAAGAAAAAAG 1 A/chicken/Sharkia/CAI41/ ( 1) GGAGAGAGAAGAAGAAAAAAG 1 A/chicken/Tabanan/BBVD14 ( 1) AGAGAGAGAAGAAGAAAGAAG 1 A/duck/Jiangxi/80/2005 ( 1) AGAGAAAGAAGAAGAAAAAAG 1 A/chicken/Badung/BBVD302 ( 1) ATAGAGAGAAGAAGAAAAAAG 1 A/chicken/Banten/PdglKas ( 1) AGAGAGAGGAGAAGAAAAAAG 1 A/o.bill_stork/Thailand/ ( 1) AGAGAGAGAAGACGAAAAAAG 1 A/chicken/Bangladesh/150 ( 1) GGAGAGAGAAGAAGAAAAAGA 1 A/chicken/Yangon/182/201 ( 1) AGAGAGAGAAGAAGAAGAAAA 1 A/chicken/Shandong/A1/20 ( 1) AGAGAAAGAAGAAGAAAAAGG 1 A/duck/Qalubia/CAI11/201 ( 1) GGAGAGGGAAGAAGAAAAAAG 1 A/Indonesia/UT3006/2005 ( 1) AGAGAAAGCAGAAGAAAAAAG 1 A/goose/Fujian/bb/2003 ( 1) AGAGAGAGAAGAAGGAAAAAG 1 A/environment/Thailand/I ( 1) AGAGAGAGAGGAAGAAAAAAG 1 A/chicken/EastKalimantan ( 1) AGAGAGAGTAGAAGAAAAAAG 1 A/swine/NorthSumatra/UT6 ( 1) AGGGAGAGAAGAAGAAAAAAG 1 A/duck/Cao_Bang/43/2007 ( 1) AGAGAAAGAAGAAGAAAAAGA 1 A/muscovy_duck/Vietnam/N ( 1) AGAGAAGGAAGAAGAAAAAAG 1 A/Indonesia/NIHRD12379/2 ( 1) AGAGAGAGCAGAAGGAAAAAG 1 A/Egypt/321NAMRU3/2007 ( 1) GGAGAGAGAAGAAGAAGAAAG 1 A/chicken/India/81766/20 ( 1) GGAGAGAGAAGAAGAAAGAAG 1 A/chicken/Bangladesh/967 ( 1) GGAGAAAGAAGAAGAAAAAAG 1 A/chicken/Nepal/T1P/12 ( 1) AGAGAGAGGAGAAGAAAAAGA 1 A/Thailand/WRAIR1720H/20 ( 1) AGAGAGAAAAGAAGAAAAAAG 1 A/goose/Guiyang/337/2006 ( 1) AGAGAGAGAAGGAGAAAAAAG 1 A/Muscovy_duck/Ca_Mau/11 ( 1) AGAGAGGGGAGAAGAAAAAAG 1 A/owstons_civet/VietNam/ ( 1) AGAGAGAGAAGAAGGAAAAGA 1 A/chicken/Bangladesh/152 ( 1) AGGGAGAGAAGAAGAAAAAGA 1 A/chicken/Liaoning/23/20 ( 1) GGAGAGAGAAGAAGGAAAAAG 1 A/condor/Guangdong/139/2 ( 1) AGAGGGAGAAGAAGAAAAAAG 1 A/muscovy_duck/Vietnam/L ( 1) ATAGAGAGAAGAAGAAGAAAG 1 A/chicken/Hebei/326/2005 ( 1) AGAGAGGGAGGAAGAAAAAAG 1 A/chicken/Inhu/BPPVRII/2 ( 1) AGGGAGGGAAGAAGAAAAAAG 1 A/chicken/WestBengal/239 ( 1) GGAGAGAGGAGAAGAAAAAAA 1 A/bird/Turkey/Unye_ist06 ( 1) GGGGAGAGAAGAAGAAAAAAG 1 A/chicken/Sheny/0606/200 ( 1) AGAGAGGGAAGAAGGAAAAAA 1 A/chicken/Miyazaki/T10/2 ( 1) AGAGAGAAAAGAAGAAAAAGA 1 A/duck/Guangxi/668/2004 ( 1) AGAGAGATAAGAAGAAAAAAG 1 A/Cambodia/W0526301/2012 ( 1) AGAGAGGAAAGAAGAAAAAAG 1 A/chicken/Egypt/11VIR445 ( 1) GGAGAGGGAAGAAGAAAGAAG 1 A/chicken/Egypt/113Q/201 ( 1) GGAGAAGGAAGAAGAAAAAAG 1 A/Hubei/1/2010 ( 1) AGAGAGAGAAGGAGAAAAAGA 1 A/duck/Vietnam/NCVD1463/ ( 1) ATAGAGAGAAGAAGAAGAAAA 1 A/chicken/Shandong/A10/2 ( 1) AGAGGGAGAAGAAGAAAAAAA 1 A/chicken/Cambodia/67F1/ ( 1) AGAGAGGGAAGGAGAAAAAAG 1 A/chicken/Liaoning/A1/20 ( 1) AGAGAGGGAGGAAGAAAAAAA 1 A/chicken/Egypt/1158SF/2 ( 1) GGAGAGAAAAGAAGAAAAAAG 1 A/Shandong/1/2009 ( 1) AGAGAAAGAAGAAGAAGAAGA 1 A/chicken/Egypt/1085/201 ( 1) GGAGAGGGAAGAAGAAGAAAA 1 A/chicken/Hebei/A8/2009 ( 1) AGAGGGAGAAGAAGAAAAAGA 1 A/chicken/CentralJava/UT ( 1) AGAGAGAGAAGAAAAAAAAGA 1 A_DISC/Cambodia/V0401301 ( 1) AGAGTGGGAAGAAGAAAAAAG 1 A/chicken/Lampung/BPPVRI ( 1) AGAGAAAGCAGAAGGAAAAAG 1 A/chicken/Vietnam/NCVD40 ( 1) AGAGAAAGGAGAAGAAAAAGA 1 A/duck/Vietnam/NCVD366/2 ( 1) AGAGATGGAAGAAGAAAAAAG 1 A/duck/Yunnan/47/2006 ( 1) AGAAAGAGAAGAAGAAAAAAG 1 A/chicken/Denpasar/BBVD1 ( 1) AGAGAGAGAAGAAGAGAAAAG 1 A/Hunan/1/2009 ( 1) AGAGAAAGAAGAAGGAAAAGA 1 A/chicken/Vietnam/NCVD09 ( 1) ATAGAGGGAAGAAGAAGAAAA 1 A/duck/Guangxi/951/2005 ( 1) AGTGAAAGAAGAAGAAAAAGA 1 A/environment/Bangladesh ( 1) AGGGAGAGGAGAAGAAAAAGA 1 A/environment/ChangSha/2 ( 1) AGAGAGAGAAGAGGAAAAAGA 1 A/chicken/India/241272/2 ( 1) GGAGAGAGGAGAAGAAAGAAA 1 A/quail/Thanatpin/2283/2 ( 1) AGAGAAAAAAGAAGAAAAAGA 1 A/chicken/Sikkim/151466/ ( 1) GGTGAGAGAAGAAGAAAGAAG 1 A/chicken/Bangladesh/830 ( 1) GGAGAGAGAAGAAAAAAAAGA 1 A/chicken/Shandong/A5/20 ( 1) AGAGAGGGAGGAAGAAGAAAA 1 A/domestic_goose/Hong_Ko ( 1) ATAGAGAGGAGAAGAAGAAAA 1 A/chicken/Egypt/10512AG/ ( 1) GAAGAGGGAAGAAGAAAAAAG 1 A/duck/Guangxi/13/2004 ( 1) AGAGAAAGAAGAAAAAAAAGA 1 A/Vietnam/UT30259/2004 ( 1) AGAGAGAGAAGAAAAAAGAGA 1 A/spurwinged_goose/Niger ( 1) AGAGAAAAAAGAAGAAAGAAG 1 A/chicken/Vietnam/NCVD18 ( 1) AAAGAAAGAAGAAGAAAAAGA 1 A/duck/Vietnam/NCVD1026/ ( 1) ATAGAGAGAGGAAGAAGAAAA 1 A/chicken/WestJava/SmiAc ( 1) AGAGAGAGCAGACGTAAAAAA 1 A/Vietnam/HN36250/2010 ( 1) AGAGAAAAAAGAAGAAGAAAA 1 A/chicken/Vietnam/NCVD40 ( 1) AGAGAAAGGAGAAGGAAAAGA 1 A/duck/Vietnam/NCVD422/2 ( 1) AGGGAAAGGAGAAGAAAAAGA 1 A/barn_swallow/Hong_Kong ( 1) ATAGAAAGAAGAAGAAGGAAA 1 A/ck/Indonesia/072/10 ( 1) AAGGAGGGAAGAAGAAAAAAG 1 A/chicken/Yangon/1023/20 ( 1) AGAGAGAAAGGAAGAAGAAAA 1 A/chicken/Vietnam/NCVD01 ( 1) AGAGAGGGAGGAAGAAGGAGA 1 A/Egypt/4822NAMRU3/2009 ( 1) GGAAAGAGTAGAAGAAAAAAG 1 A/chicken/Korea/ES/03 ( 1) AGAGAGAAAAGAAAAAAGAGA 1 A/duck/Viet_Nam/TG2401/2 ( 1) AGAGAGAGAAGGAAAAAGAGA 1 A/whitefaced_whistling_d ( 1) AAAGAAAAAAGAAGAAAGAAG 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 21 n= 253 bayes= -2.7879 E= 4.5e-207 72 -749 -249 -50 -72 -70 -114 -39 -227 29 -84 -137 67 -22 -279 310 -719 136 -318 -223 -344 -673 49 210 76 140 95 123 55 161 75 116 181 151 -46 411 -81 242 -130 -31 90 -670 -203 16 -12 -213 -67 36 -169 94 -6 -67 132 46 -221 393 -243 218 -277 -173 -444 -670 187 339 169 151 216 216 189 242 164 241 247 252 316 445 -623 286 -13 84 98 -671 -204 15 -12 -331 -68 36 -169 94 -6 -68 131 45 -222 392 -334 217 -277 -173 -108 -673 -166 37 -7 119 -51 40 -141 96 -2 -46 134 53 -203 392 -335 216 -264 -160 70 -749 -249 -50 -72 -63 -114 -39 -227 29 -84 -137 67 -22 -279 310 -719 136 -318 -223 -223 -672 58 219 82 140 103 129 64 166 80 124 185 157 -38 413 -345 245 -123 -24 74 5 -200 21 -6 -163 -64 43 -164 100 0 -63 136 50 -216 398 -243 224 -273 -168 94 -754 -252 -54 -75 -213 -117 -43 -231 25 -89 -142 63 -26 -282 305 -724 131 -320 -226 -1220 -1064 -309 -185 -204 159 -192 -270 -327 -131 -355 -312 -101 -197 -390 -105 -1086 -214 -381 -314 99 -759 -254 -57 -78 -293 -119 -48 -233 22 -93 -146 60 -30 -285 300 -729 126 -322 -228 100 -150 -207 12 -15 -476 -70 33 -172 91 -10 -72 128 42 -224 388 -643 213 -279 -175 -294 -754 33 188 41 148 74 81 36 119 29 90 133 115 -69 337 -716 174 -150 -55 90 -671 -204 15 -12 -194 -68 35 -170 93 -7 -67 132 45 -222 393 -334 217 -278 -173 104 -775 -262 -69 -89 -485 -127 -62 -244 10 -109 -160 48 -43 -296 283 -746 109 -329 -237 82 -750 -250 -51 -72 -114 -115 -40 -228 28 -85 -138 67 -23 -280 309 -720 135 -318 -224 85 -751 -250 -51 -73 -136 -115 -41 -228 27 -86 -139 66 -23 -280 309 -721 134 -319 -224 106 -975 -324 -187 -188 -1037 -192 -227 -336 -107 -299 -307 -74 -175 -389 40 -969 -109 -380 -309 52 -750 -249 -50 -72 -10 -115 -40 -228 28 -85 -138 67 -22 -279 310 -719 135 -318 -223 -6 -756 -252 -55 -77 69 -118 -45 -232 23 -91 -143 62 -28 -284 303 -726 129 -321 -227 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 21 nsites= 93 E= 4.5e-207 0.795699 0.000000 0.000000 0.000000 0.000000 0.204301 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.043011 0.000000 0.000000 0.000000 0.000000 0.881720 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.075269 0.000000 0.000000 0.000000 0.903226 0.000000 0.000000 0.000000 0.000000 0.075269 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021505 0.000000 0.000000 0.000000 0.021505 0.000000 0.000000 0.000000 0.000000 0.967742 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.000000 0.000000 0.956989 0.000000 0.000000 0.000000 0.000000 0.032258 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.225806 0.000000 0.000000 0.000000 0.000000 0.763441 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.784946 0.000000 0.000000 0.000000 0.000000 0.215054 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.107527 0.000000 0.000000 0.000000 0.000000 0.881720 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.806452 0.064516 0.000000 0.000000 0.000000 0.107527 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.021505 0.000000 0.000000 0.000000 0.924731 0.000000 0.000000 0.000000 0.000000 0.075269 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.956989 0.000000 0.000000 0.000000 0.000000 0.043011 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.967742 0.021505 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.064516 0.000000 0.000000 0.000000 0.000000 0.935484 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.903226 0.000000 0.000000 0.000000 0.000000 0.086022 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.989247 0.000000 0.000000 0.000000 0.000000 0.010753 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.849462 0.000000 0.000000 0.000000 0.000000 0.150538 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.870968 0.000000 0.000000 0.000000 0.000000 0.129032 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.688172 0.000000 0.000000 0.000000 0.000000 0.311828 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.462366 0.000000 0.000000 0.000000 0.000000 0.537634 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 2 regular expression -------------------------------------------------------------------------------- [AG]GAGA[GA][AG]GAAGAAGAAAAA[AG][GA] -------------------------------------------------------------------------------- Time 56.02 secs. ******************************************************************************** ******************************************************************************** MOTIF 3 width = 15 sites = 25 llr = 475 E-value = 2.5e-074 ******************************************************************************** -------------------------------------------------------------------------------- Motif 3 Description -------------------------------------------------------------------------------- Simplified A ::9:3::24::17:a pos.-specific C 8:::::3:::9::1: probability D ::::::::::::::: matrix E ::::::::::::::: F ::::::::::::::: G ::1::3786a::::: H ::::::::::::::: I ::::::::::::::: K ::::::::::::::: L ::::::::::::::: M ::::::::::::::: N ::::::::::::::: P ::::::::::::::: Q ::::::::::::::: R ::::::::::::::: S ::::::::::::::: T 2a:a77::::1939: V ::::::::::::::: W ::::::::::::::: Y ::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (27.4 bits) 5.0 3.7 * * 2.5 ** * ** * 1.2 ******** ****** 0.0 --------------- Multilevel CTATTTGGGGCTATA consensus AGCAA T sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------- A/chicken/Bangladesh/FD( 1 4.15e-11 . CTATTTGGGGCTATA GCAGGT A/tern/South_Africa/1959 4 4.15e-11 GGT CTATTTGGGGCTATA GCA A/mallard/Netherlands/3/ 4 4.15e-11 GGA CTATTTGGGGCTATA GCA A/duck/Primorie/2633/200 1 4.15e-11 . CTATTTGGGGCTATA GCAGGC A/wild_bird_feces/Cheons 7 4.15e-11 AGAGGA CTATTTGGGGCTATA A/chicken/Bangladesh/11r 1 8.33e-11 . CTATTTGGAGCTATA GCAGGT A/turkey/Egypt/091QNLQP/ 1 8.33e-11 . CTATTTGGAGCTATA GCAGGG A/duck/Hong_Kong/312/197 1 8.33e-11 . CTATTTGGAGCTATA GCAGGC A/chicken/Bangladesh/11r 4 8.33e-11 GGA CTATTTGGAGCTATA GCA A/chicken/Turkey/Misinli 7 8.33e-11 AGAGGA CTATTTGGAGCTATA A/wild_bird_feces/Byeong 4 6.63e-10 GGA CTATTTGGGGCCATA GCA A/ostrich/South_Africa/A 4 7.90e-10 GGA TTATTTGGGGCTATA GCA A/whooper_swan/Hokkaido/ 7 8.51e-10 AGAGGA CTGTTTGGAGCTATA A/chicken/Cambodia/TLC2/ 4 1.22e-09 GGA TTATTTGGAGCTATA GCA A/chicken/Cambodia/013LC 7 1.22e-09 AGGGGA TTATTTGGAGCTATA A/avian/New_York/Sg00387 4 3.23e-09 GGC CTATTTGGAGCAATA GCA A/avian/New_York/Sg00372 1 3.23e-09 . CTATTTGGAGCAATA GCAGGA A/chicken/Egypt/1219s/20 5 4.76e-09 GGAG CTATAGCAGGCTTTA TA A/chicken/Egypt/128s/201 2 4.76e-09 G CTATAGCAGGCTTTA TAGAG A/chicken/Bangladesh/11r 1 7.12e-09 . TTGTTTGGAGCTATA GCAGGG A/duck/Primorie/2621/200 2 1.97e-08 G CTATAGCAGGCTTCA TAGAA A/turkey/England/N28/73| 2 2.10e-08 G CTATAGCGGGTTTTA TAGAA A/chicken/Egypt/125s/201 5 5.87e-08 GGAG CTATAGCAGGTTTTA TA A/chicken/Egypt/398252/2 2 5.87e-08 G CTATAGCAGGTTTTA TAGAG A/chicken/Italy/9097/199 2 1.05e-07 G CCATAGCAGGCTTCA TAGAG -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/chicken/Bangladesh/FD( 4.2e-11 [3]_6 A/tern/South_Africa/1959 4.2e-11 3_[3]_3 A/mallard/Netherlands/3/ 4.2e-11 3_[3]_3 A/duck/Primorie/2633/200 4.2e-11 [3]_6 A/wild_bird_feces/Cheons 4.2e-11 6_[3] A/chicken/Bangladesh/11r 8.3e-11 [3]_6 A/turkey/Egypt/091QNLQP/ 8.3e-11 [3]_6 A/duck/Hong_Kong/312/197 8.3e-11 [3]_6 A/chicken/Bangladesh/11r 8.3e-11 3_[3]_3 A/chicken/Turkey/Misinli 8.3e-11 6_[3] A/wild_bird_feces/Byeong 6.6e-10 3_[3]_3 A/ostrich/South_Africa/A 7.9e-10 3_[3]_3 A/whooper_swan/Hokkaido/ 8.5e-10 6_[3] A/chicken/Cambodia/TLC2/ 1.2e-09 3_[3]_3 A/chicken/Cambodia/013LC 1.2e-09 6_[3] A/avian/New_York/Sg00387 3.2e-09 3_[3]_3 A/avian/New_York/Sg00372 3.2e-09 [3]_6 A/chicken/Egypt/1219s/20 4.8e-09 4_[3]_2 A/chicken/Egypt/128s/201 4.8e-09 1_[3]_5 A/chicken/Bangladesh/11r 7.1e-09 [3]_6 A/duck/Primorie/2621/200 2e-08 1_[3]_5 A/turkey/England/N28/73| 2.1e-08 1_[3]_5 A/chicken/Egypt/125s/201 5.9e-08 4_[3]_2 A/chicken/Egypt/398252/2 5.9e-08 1_[3]_5 A/chicken/Italy/9097/199 1.1e-07 1_[3]_5 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 3 width=15 seqs=25 A/chicken/Bangladesh/FD( ( 1) CTATTTGGGGCTATA 1 A/tern/South_Africa/1959 ( 4) CTATTTGGGGCTATA 1 A/mallard/Netherlands/3/ ( 4) CTATTTGGGGCTATA 1 A/duck/Primorie/2633/200 ( 1) CTATTTGGGGCTATA 1 A/wild_bird_feces/Cheons ( 7) CTATTTGGGGCTATA 1 A/chicken/Bangladesh/11r ( 1) CTATTTGGAGCTATA 1 A/turkey/Egypt/091QNLQP/ ( 1) CTATTTGGAGCTATA 1 A/duck/Hong_Kong/312/197 ( 1) CTATTTGGAGCTATA 1 A/chicken/Bangladesh/11r ( 4) CTATTTGGAGCTATA 1 A/chicken/Turkey/Misinli ( 7) CTATTTGGAGCTATA 1 A/wild_bird_feces/Byeong ( 4) CTATTTGGGGCCATA 1 A/ostrich/South_Africa/A ( 4) TTATTTGGGGCTATA 1 A/whooper_swan/Hokkaido/ ( 7) CTGTTTGGAGCTATA 1 A/chicken/Cambodia/TLC2/ ( 4) TTATTTGGAGCTATA 1 A/chicken/Cambodia/013LC ( 7) TTATTTGGAGCTATA 1 A/avian/New_York/Sg00387 ( 4) CTATTTGGAGCAATA 1 A/avian/New_York/Sg00372 ( 1) CTATTTGGAGCAATA 1 A/chicken/Egypt/1219s/20 ( 5) CTATAGCAGGCTTTA 1 A/chicken/Egypt/128s/201 ( 2) CTATAGCAGGCTTTA 1 A/chicken/Bangladesh/11r ( 1) TTGTTTGGAGCTATA 1 A/duck/Primorie/2621/200 ( 2) CTATAGCAGGCTTCA 1 A/turkey/England/N28/73| ( 2) CTATAGCGGGTTTTA 1 A/chicken/Egypt/125s/201 ( 5) CTATAGCAGGTTTTA 1 A/chicken/Egypt/398252/2 ( 2) CTATAGCAGGTTTTA 1 A/chicken/Italy/9097/199 ( 2) CCATAGCAGGCTTCA 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 15 n= 1771 bayes= 7.27007 E= 2.5e-074 -645 376 -59 141 129 -726 76 206 -39 233 103 17 205 135 -98 412 25 289 -134 -19 -618 -75 62 244 220 -712 186 351 146 325 268 227 269 290 36 614 291 420 -37 67 89 -475 23 253 219 -208 148 280 77 338 228 123 294 244 -3 579 -466 451 -48 53 -910 -801 -108 32 30 -879 20 64 -91 115 -28 -45 112 53 -168 292 302 126 -174 -92 -78 -504 -6 202 174 -601 124 246 37 281 186 132 303 232 -31 573 250 397 -90 16 -650 -639 -37 120 81 -27 66 112 -41 174 50 50 201 131 -117 429 253 254 -150 -52 -712 215 -72 83 52 111 39 51 -76 136 -11 -16 167 82 -143 352 -673 189 -161 -72 -116 -537 135 300 189 118 194 232 141 278 184 208 304 257 47 538 -502 367 -28 71 -14 -524 -28 175 145 71 99 186 -6 247 142 93 286 200 -62 539 -492 363 -112 -10 -729 -729 104 252 116 157 152 135 103 189 78 150 202 178 0 355 -699 203 -65 23 -649 389 31 275 239 -749 174 337 71 349 240 92 235 218 11 399 -104 337 -71 60 -252 -76 84 282 251 -605 211 372 167 358 300 255 333 326 66 662 276 468 -14 90 55 -499 -21 194 166 -592 111 222 12 274 169 109 305 221 -44 564 115 394 -98 8 -618 25 44 227 204 -710 169 331 124 309 246 204 260 271 17 594 286 403 -52 53 104 -669 -98 78 66 -755 33 86 -85 164 19 -39 169 86 -147 374 -655 233 -159 -72 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 15 nsites= 25 E= 2.5e-074 0.000000 0.840000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.160000 0.000000 0.000000 0.000000 0.000000 0.040000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.960000 0.000000 0.000000 0.000000 0.920000 0.000000 0.000000 0.000000 0.000000 0.080000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.280000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.720000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.280000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.720000 0.000000 0.000000 0.000000 0.000000 0.280000 0.000000 0.000000 0.000000 0.720000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.240000 0.000000 0.000000 0.000000 0.000000 0.760000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.440000 0.000000 0.000000 0.000000 0.000000 0.560000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.880000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.120000 0.000000 0.000000 0.000000 0.080000 0.040000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.880000 0.000000 0.000000 0.000000 0.720000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.280000 0.000000 0.000000 0.000000 0.000000 0.080000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.920000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 3 regular expression -------------------------------------------------------------------------------- CTAT[TA][TG][GC][GA][GA]GCT[AT]TA -------------------------------------------------------------------------------- Time 56.86 secs. ******************************************************************************** ******************************************************************************** MOTIF 4 width = 15 sites = 18 llr = 248 E-value = 1.6e-011 ******************************************************************************** -------------------------------------------------------------------------------- Motif 4 Description -------------------------------------------------------------------------------- Simplified A a:7:76:36::7::: pos.-specific C ::::::::1:::::: probability D ::::::::::::::: matrix E ::::::::::::::: F ::::::::::::::: G ::3a:4a71aa::aa H ::::::::::::::: I ::::::::::::::: K ::::::::::::::: L ::::::::::::::: M ::::::::::::::: N ::::::::::::::: P ::::::::::::::: Q ::::::::::::::: R ::::::::::::::: S ::::::::::::::: T :a::3:::3::3a:: V ::::::::::::::: W ::::::::::::::: Y ::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (19.9 bits) 5.0 3.7 2.5 * * 1.2 ** ** * ****** 0.0 --------------- Multilevel ATAGAAGGAGGATGG consensus G TG AT T sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------- A/VietNam/HN31413/2008 1 1.21e-07 . ATAGAGGGTGGATGG CAGGGA A/Muscovy_Duck/Vietnam/1 1 2.36e-07 . ATAGAGGGCGGATGG CAGGGA A/chicken/Vietnam/NCVD11 4 3.75e-07 TTT ATAGAAGGAGGATGG CAG A/environment/Bangladesh 1 3.75e-07 . ATAGAAGGAGGATGG CAGGGA A/chicken/Queretaro/7653 4 3.75e-07 TTC ATAGAAGGAGGATGG CAA A_GenBank/heron/Cambodia 7 3.75e-07 GGGTTT ATAGAAGGAGGATGG A/heron/Cambodia/TM068/2 1 3.75e-07 . ATAGAAGGAGGATGG CAGGGG A/Hong_Kong/7032/2012 4 4.39e-07 TTT ATAGAGGGAGGATGG CAG A/duck/Vietnam/NCVD1161/ 4 4.39e-07 TTT ATAGAGGGAGGATGG CAA A/pigeon/Egypt/SHAH5803/ 1 4.39e-07 . ATAGAGGGAGGATGG CAGGGA A/environment/Bangladesh 1 4.39e-07 . ATAGAGGGAGGATGG CAGGGG A/chicken/Egypt/39825/20 7 4.39e-07 GGTTTT ATAGAGGGAGGATGG A/chicken/TanseMyanmar/S 4 6.25e-07 GGA ATGGTAGATGGTTGG TAT A/Egypt/2786NAMRU3/2006 7 6.25e-07 CAGGGA ATGGTAGATGGTTGG A/chicken/BacLieuVietnam 4 6.25e-07 GGG ATGGTAGATGGTTGG TAT A/poultry/Egypt/398256/2 1 6.25e-07 . ATGGTAGATGGTTGG TATGGG A/chicken/Nepal/354/2010 4 7.69e-07 GGA ATGGTAGACGGTTGG TAT A/Indonesia/625/2006 4 1.47e-06 TTT ATAGAAGGGGGATGG CAG -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/VietNam/HN31413/2008 1.2e-07 [4]_6 A/Muscovy_Duck/Vietnam/1 2.4e-07 [4]_6 A/chicken/Vietnam/NCVD11 3.7e-07 3_[4]_3 A/environment/Bangladesh 3.7e-07 [4]_6 A/chicken/Queretaro/7653 3.7e-07 3_[4]_3 A_GenBank/heron/Cambodia 3.7e-07 6_[4] A/heron/Cambodia/TM068/2 3.7e-07 [4]_6 A/Hong_Kong/7032/2012 4.4e-07 3_[4]_3 A/duck/Vietnam/NCVD1161/ 4.4e-07 3_[4]_3 A/pigeon/Egypt/SHAH5803/ 4.4e-07 [4]_6 A/environment/Bangladesh 4.4e-07 [4]_6 A/chicken/Egypt/39825/20 4.4e-07 6_[4] A/chicken/TanseMyanmar/S 6.2e-07 3_[4]_3 A/Egypt/2786NAMRU3/2006 6.2e-07 6_[4] A/chicken/BacLieuVietnam 6.2e-07 3_[4]_3 A/poultry/Egypt/398256/2 6.2e-07 [4]_6 A/chicken/Nepal/354/2010 7.7e-07 3_[4]_3 A/Indonesia/625/2006 1.5e-06 3_[4]_3 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 4 width=15 seqs=18 A/VietNam/HN31413/2008 ( 1) ATAGAGGGTGGATGG 1 A/Muscovy_Duck/Vietnam/1 ( 1) ATAGAGGGCGGATGG 1 A/chicken/Vietnam/NCVD11 ( 4) ATAGAAGGAGGATGG 1 A/environment/Bangladesh ( 1) ATAGAAGGAGGATGG 1 A/chicken/Queretaro/7653 ( 4) ATAGAAGGAGGATGG 1 A_GenBank/heron/Cambodia ( 7) ATAGAAGGAGGATGG 1 A/heron/Cambodia/TM068/2 ( 1) ATAGAAGGAGGATGG 1 A/Hong_Kong/7032/2012 ( 4) ATAGAGGGAGGATGG 1 A/duck/Vietnam/NCVD1161/ ( 4) ATAGAGGGAGGATGG 1 A/pigeon/Egypt/SHAH5803/ ( 1) ATAGAGGGAGGATGG 1 A/environment/Bangladesh ( 1) ATAGAGGGAGGATGG 1 A/chicken/Egypt/39825/20 ( 7) ATAGAGGGAGGATGG 1 A/chicken/TanseMyanmar/S ( 4) ATGGTAGATGGTTGG 1 A/Egypt/2786NAMRU3/2006 ( 7) ATGGTAGATGGTTGG 1 A/chicken/BacLieuVietnam ( 4) ATGGTAGATGGTTGG 1 A/poultry/Egypt/398256/2 ( 1) ATGGTAGATGGTTGG 1 A/chicken/Nepal/354/2010 ( 4) ATGGTAGACGGTTGG 1 A/Indonesia/625/2006 ( 4) ATAGAAGGGGGATGG 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 15 n= 1771 bayes= 5.89847 E= 1.6e-011 103 -568 -19 181 160 -667 110 204 14 270 138 48 237 174 -60 477 -563 356 -81 13 -802 -703 -25 128 119 -810 101 195 17 211 105 77 185 158 -75 435 301 255 -102 -11 54 -459 35 247 214 -33 160 263 66 322 216 154 340 263 3 602 -431 439 -48 55 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 53 -448 35 253 222 -543 165 284 72 332 227 162 353 273 11 615 111 451 -45 62 31 -468 26 232 199 16 151 244 50 304 199 149 339 256 -8 595 -436 421 -61 44 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 -101 -489 172 339 231 111 233 275 178 321 227 247 348 299 85 585 -454 414 10 110 18 75 47 272 242 -250 179 299 91 354 251 174 364 289 27 633 108 474 -27 78 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 53 -448 35 253 222 -543 165 284 72 332 227 162 353 273 11 615 111 451 -45 62 -802 -703 -25 128 119 -810 101 195 17 211 105 77 185 158 -75 435 301 255 -102 -11 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 -637 -650 192 341 193 154 233 222 192 266 167 242 277 262 86 446 -613 291 11 103 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 15 nsites= 18 E= 1.6e-011 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.722222 0.000000 0.000000 0.000000 0.000000 0.277778 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.722222 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.277778 0.000000 0.000000 0.000000 0.611111 0.000000 0.000000 0.000000 0.000000 0.388889 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.277778 0.000000 0.000000 0.000000 0.000000 0.722222 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.555556 0.111111 0.000000 0.000000 0.000000 0.055556 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.277778 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.722222 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.277778 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 4 regular expression -------------------------------------------------------------------------------- AT[AG]G[AT][AG]G[GA][AT]GG[AT]TGG -------------------------------------------------------------------------------- Time 57.67 secs. ******************************************************************************** ******************************************************************************** MOTIF 5 width = 14 sites = 12 llr = 196 E-value = 8.3e-010 ******************************************************************************** -------------------------------------------------------------------------------- Motif 5 Description -------------------------------------------------------------------------------- Simplified A :7a1a::81:61:: pos.-specific C a2::::713::::3 probability D ::::::::::::1: matrix E :::::::::::::: F :::::::::::::: G :::2:a3:773::: H :::::::::::::: I :::::::::::::: K :::::::::::::: L :::::::::::::: M :::::::::::::: N :::::::::::::: P :::::::::::::: Q :::::::::::::: R :::::::::::::: S :::::::::::::: T :2:8::11:32997 V :::::::::::::: W :::::::::::::: Y :::::::::1:::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (23.6 bits) 5.0 3.7 * * 2.5 * * *** 1.2 ********** *** 0.0 -------------- Multilevel CAATAGCAGGATTT consensus G CTG C sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- -------------- A/Chicken/TurkeyEdirne/0 2 1.90e-09 G CCATAGCAGGTTTT ATAGAG A/Canada_goose/Alaska//4 5 4.78e-09 GGAG CAATAGCAGGATTT ATA A/mallard/Washington/456 8 4.78e-09 TTTGGAG CAATAGCAGGATTT A/wild_bird/Wisconsin/43 2 5.66e-09 G CAATAGCAGGATTC ATAGAA A/avian/New_York/Sg00377 8 5.66e-09 TTTGGAG CAATAGCAGGATTC A/chicken/Egypt/398220/2 8 5.66e-09 TTTGGAG CTATAGCAGGTTTT A/chicken/Bangladesh/11r 8 1.10e-08 TTTGGAG CTATAGCAGGGTTT A/chicken/Puebla/1458665 5 1.20e-08 GGAG CAATAGCCGGATTC ATA A/duck/Bangladesh/5749/1 3 4.54e-08 CA CCATAGTAAYGADC AGGGG A/wild_bird_feces/Cheons 5 5.98e-07 GAAA CAAGAGGACTATTT GGG A/turkey/MN/40550/1987_H 5 9.56e-07 GAAA CAAGAGGACTGTTT GGA A/duck/Korea/GJ54/2004|G 5 2.69e-06 GAAA CAAAAGGTCTATTT GGG -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/Chicken/TurkeyEdirne/0 1.9e-09 1_[5]_6 A/Canada_goose/Alaska//4 4.8e-09 4_[5]_3 A/mallard/Washington/456 4.8e-09 7_[5] A/wild_bird/Wisconsin/43 5.7e-09 1_[5]_6 A/avian/New_York/Sg00377 5.7e-09 7_[5] A/chicken/Egypt/398220/2 5.7e-09 7_[5] A/chicken/Bangladesh/11r 1.1e-08 7_[5] A/chicken/Puebla/1458665 1.2e-08 4_[5]_3 A/duck/Bangladesh/5749/1 4.5e-08 2_[5]_5 A/wild_bird_feces/Cheons 6e-07 4_[5]_3 A/turkey/MN/40550/1987_H 9.6e-07 4_[5]_3 A/duck/Korea/GJ54/2004|G 2.7e-06 4_[5]_3 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 5 width=14 seqs=12 A/Chicken/TurkeyEdirne/0 ( 2) CCATAGCAGGTTTT 1 A/Canada_goose/Alaska//4 ( 5) CAATAGCAGGATTT 1 A/mallard/Washington/456 ( 8) CAATAGCAGGATTT 1 A/wild_bird/Wisconsin/43 ( 2) CAATAGCAGGATTC 1 A/avian/New_York/Sg00377 ( 8) CAATAGCAGGATTC 1 A/chicken/Egypt/398220/2 ( 8) CTATAGCAGGTTTT 1 A/chicken/Bangladesh/11r ( 8) CTATAGCAGGGTTT 1 A/chicken/Puebla/1458665 ( 5) CAATAGCCGGATTC 1 A/duck/Bangladesh/5749/1 ( 3) CCATAGTAAYGADC 1 A/wild_bird_feces/Cheons ( 5) CAAGAGGACTATTT 1 A/turkey/MN/40550/1987_H ( 5) CAAGAGGACTGTTT 1 A/duck/Korea/GJ54/2004|G ( 5) CAAAAGGTCTATTT 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 14 n= 2024 bayes= 8.98014 E= 8.3e-010 -651 396 86 340 298 -767 231 401 130 410 306 140 263 270 70 389 -504 373 -26 110 43 106 151 393 360 -455 278 435 220 486 375 250 410 376 132 711 14 598 85 189 98 -441 91 317 287 -554 215 351 148 408 285 160 320 290 58 604 -449 507 27 127 -228 -392 163 357 309 -117 274 404 213 417 341 307 417 383 120 714 249 528 42 152 98 -441 91 317 287 -554 215 351 148 408 285 160 320 290 58 604 -449 507 27 127 -557 -578 269 419 263 151 306 299 269 336 245 321 344 337 161 525 -538 368 81 175 -393 330 123 315 276 -52 231 336 137 384 277 241 407 332 68 665 -55 496 6 122 67 -43 230 487 448 -413 349 526 317 581 468 294 398 436 210 762 -126 692 169 270 -244 156 218 390 290 101 284 341 228 385 292 300 411 358 137 657 -381 485 61 164 -469 -485 325 472 396 112 371 373 318 414 311 390 397 404 201 615 21 458 143 701 23 -372 130 350 307 -57 248 367 170 422 317 244 415 353 94 689 31 539 36 144 -264 -405 206 397 360 -558 326 499 295 469 418 378 412 440 183 768 274 571 95 201 -553 -454 632 449 357 -627 355 498 321 461 418 418 387 454 193 762 276 552 96 211 -473 221 124 312 300 -568 248 427 192 414 323 271 368 347 93 670 238 511 31 151 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 14 nsites= 12 E= 8.3e-010 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.666667 0.166667 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.166667 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.083333 0.000000 0.000000 0.000000 0.000000 0.166667 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.750000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.666667 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.083333 0.000000 0.000000 0.000000 0.833333 0.083333 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.083333 0.000000 0.000000 0.000000 0.083333 0.250000 0.000000 0.000000 0.000000 0.666667 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.666667 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.083333 0.583333 0.000000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.166667 0.000000 0.000000 0.000000 0.083333 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.916667 0.000000 0.000000 0.000000 0.000000 0.000000 0.083333 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.916667 0.000000 0.000000 0.000000 0.000000 0.333333 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.666667 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 5 regular expression -------------------------------------------------------------------------------- CAATAG[CG]A[GC][GT][AG]TT[TC] -------------------------------------------------------------------------------- Time 58.49 secs. ******************************************************************************** ******************************************************************************** MOTIF 6 width = 21 sites = 4 llr = 106 E-value = 2.1e-004 ******************************************************************************** -------------------------------------------------------------------------------- Motif 6 Description -------------------------------------------------------------------------------- Simplified A ::a::::::a:a:a:::8::a pos.-specific C :a:::3:::::::::::3::: probability D ::::::::::::::::::::: matrix E ::::::::::::::::::::: F ::::::::::::::::::::: G a::aa3::::::a:aaa:aa: H ::::::::::::::::::::: I ::::::::::::::::::::: K ::::::::::::::::::::: L ::::::::::::::::::::: M ::::::::::::::::::::: N ::::::::::::::::::::: P ::::::::::::::::::::: Q ::::::::::::::::::::: R ::::::::::::::::::::: S ::::::::::::::::::::: T :::::5aaa:a:::::::::: V ::::::::::::::::::::: W ::::::::::::::::::::: Y ::::::::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (38.2 bits) 5.0 3.7 * 2.5 * *** * 1.2 ********************* 0.0 --------------------- Multilevel GCAGGTTTTATAGAGGGAGGA consensus C C sequence G -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------------- A/chicken/Indonesia/SmiW 1 5.44e-12 . GCAGGTTTTATAGAGGGAGGA A/chicken/Egypt/10117/20 1 6.47e-12 . GCAGGTTTTATAGAGGGCGGA A/turkey/Italy/1980|GQ24 1 7.65e-12 . GCAGGCTTTATAGAGGGAGGA A/chicken/Egypt/398214/2 1 2.63e-11 . GCAGGGTTTATAGAGGGAGGA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/chicken/Indonesia/SmiW 5.4e-12 [6] A/chicken/Egypt/10117/20 6.5e-12 [6] A/turkey/Italy/1980|GQ24 7.6e-12 [6] A/chicken/Egypt/398214/2 2.6e-11 [6] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 6 width=21 seqs=4 A/chicken/Indonesia/SmiW ( 1) GCAGGTTTTATAGAGGGAGGA 1 A/chicken/Egypt/10117/20 ( 1) GCAGGTTTTATAGAGGGCGGA 1 A/turkey/Italy/1980|GQ24 ( 1) GCAGGCTTTATAGAGGGAGGA 1 A/chicken/Egypt/398214/2 ( 1) GCAGGGTTTATAGAGGGAGGA 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 21 n= 253 bayes= 5.96 E= 2.1e-004 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -639 396 94 349 308 -756 240 412 140 420 315 150 272 279 79 401 -494 386 -17 121 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -292 122 347 562 501 -98 460 615 424 631 526 464 546 559 300 849 176 692 212 358 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 55 34 281 536 499 -365 403 587 369 634 518 351 466 491 261 809 -257 740 218 326 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 21 nsites= 4 E= 2.1e-004 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.500000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.750000 0.250000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 6 regular expression -------------------------------------------------------------------------------- GCAGG[TCG]TTTATAGAGGG[AC]GGA -------------------------------------------------------------------------------- Time 59.26 secs. ******************************************************************************** ******************************************************************************** MOTIF 7 width = 17 sites = 4 llr = 97 E-value = 6.2e-002 ******************************************************************************** -------------------------------------------------------------------------------- Motif 7 Description -------------------------------------------------------------------------------- Simplified A :a::8::5:::::a::: pos.-specific C :::::8:::::::::a: probability D ::::::::::::::::: matrix E ::::::::::::::::: F ::::::::::::::::: G a:aa:::5:::aa:a:: H ::::::::::::::::: I ::::::::::::::::: K ::::::::::::::::: L ::::::::::::::::: M ::::::::::::::::: N ::::::::::::::::: P ::::::::::::::::: Q ::::::::::::::::: R ::::::::::::::::: S ::::::::::::::::: T ::::33a:aaa:::::a V ::::::::::::::::: W ::::::::::::::::: Y ::::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (34.9 bits) 5.0 3.7 * 2.5 ** *** ** 1.2 ******* ********* 0.0 ----------------- Multilevel GAGGACTGTTTGGAGCT consensus TT A sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- ----------------- A/common_magpie/HongKong 5 1.62e-11 AAAA GAGGACTGTTTGGAGCT A/Chicken/TurkeyMus/09rs 5 3.22e-11 AAGA GAGGACTATTTGGAGCT A/ck/Indonesia/091/10 5 4.14e-11 AAGA GAGGTCTATTTGGAGCT A/chicken/Bangladesh/11r 5 6.12e-10 AAAA GAGGATTGTTTGGAGCT -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/common_magpie/HongKong 1.6e-11 4_[7] A/Chicken/TurkeyMus/09rs 3.2e-11 4_[7] A/ck/Indonesia/091/10 4.1e-11 4_[7] A/chicken/Bangladesh/11r 6.1e-10 4_[7] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 7 width=17 seqs=4 A/common_magpie/HongKong ( 5) GAGGACTGTTTGGAGCT 1 A/Chicken/TurkeyMus/09rs ( 5) GAGGACTATTTGGAGCT 1 A/ck/Indonesia/091/10 ( 5) GAGGTCTATTTGGAGCT 1 A/chicken/Bangladesh/11r ( 5) GAGGATTGTTTGGAGCT 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 17 n= 1265 bayes= 8.30035 E= 6.2e-002 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 48 -241 309 566 497 -365 422 588 396 632 516 380 496 519 279 809 -13 731 215 332 -593 393 110 361 324 -706 253 435 162 441 333 175 296 301 93 459 -298 429 0 142 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -44 -306 340 542 425 57 409 492 371 539 433 413 506 491 259 761 -280 630 175 291 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 73 -256 279 533 480 -384 392 565 359 613 494 341 456 479 249 787 -278 716 202 312 -462 -489 361 513 350 143 394 392 362 423 337 415 428 429 251 619 -444 462 164 263 -639 396 94 349 308 -756 240 412 140 420 315 150 272 279 79 401 -494 386 -17 121 -458 -371 302 494 448 -550 414 590 392 561 497 458 466 520 270 831 272 643 180 300 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 17 nsites= 4 E= 6.2e-002 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.750000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.000000 0.750000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.250000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.500000 0.000000 0.000000 0.000000 0.000000 0.500000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 7 regular expression -------------------------------------------------------------------------------- GAGG[AT][CT]T[GA]TTTGGAGCT -------------------------------------------------------------------------------- Time 59.96 secs. ******************************************************************************** ******************************************************************************** MOTIF 8 width = 11 sites = 5 llr = 88 E-value = 1.2e+000 ******************************************************************************** -------------------------------------------------------------------------------- Motif 8 Description -------------------------------------------------------------------------------- Simplified A ::::2:a:::: pos.-specific C :8:::::a:a: probability D ::::::::::: matrix E ::::::::::: F ::::::::::: G a:aa8:::a:: H ::::::::::: I ::::::::::: K ::::::::::: L ::::::::::: M ::::::::::: N ::::::::::: P ::::::::::: Q ::::::::::: R ::::::::::: S ::::::::::: T :2:::a::::a V ::::::::::: W ::::::::::: Y ::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (25.4 bits) 5.0 3.7 * * * 2.5 * * * ** 1.2 *********** 0.0 ----------- Multilevel GCGGGTACGCT consensus T A sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- ----------- A/chicken/Vietnam/5/2010 5 8.62e-09 GGAA GCGGGTACGCT GCAGAC A/chicken/Vietnam/4/2010 2 8.62e-09 A GCGGGTACGCT GCAGACAAA A/duck/Vietnam/1/2010 11 8.62e-09 GAGCAGGGAA GCGGGTACGCT A/duck/Vietnam/3/2010 8 3.40e-08 CAGGGAA GCGGATACGCT GCA A/chicken/Egypt/11VIR445 11 1.63e-07 GAGCAGGGGA GTGGGTACGCT -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/chicken/Vietnam/5/2010 8.6e-09 4_[8]_6 A/chicken/Vietnam/4/2010 8.6e-09 1_[8]_9 A/duck/Vietnam/1/2010 8.6e-09 10_[8] A/duck/Vietnam/3/2010 3.4e-08 7_[8]_3 A/chicken/Egypt/11VIR445 1.6e-07 10_[8] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 8 width=11 seqs=5 A/chicken/Vietnam/5/2010 ( 5) GCGGGTACGCT 1 A/chicken/Vietnam/4/2010 ( 2) GCGGGTACGCT 1 A/duck/Vietnam/1/2010 ( 11) GCGGGTACGCT 1 A/duck/Vietnam/3/2010 ( 8) GCGGATACGCT 1 A/chicken/Egypt/11VIR445 ( 11) GTGGGTACGCT 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 11 n= 2783 bayes= 9.36986 E= 1.2e+000 -474 -500 350 502 339 144 383 380 351 412 326 404 418 417 241 608 -456 451 154 252 -614 394 99 353 313 -730 244 420 147 427 321 160 282 287 84 430 -344 405 -12 127 -474 -500 350 502 339 144 383 380 351 412 326 404 418 417 241 608 -456 451 154 252 -474 -500 350 502 339 144 383 380 351 412 326 404 418 417 241 608 -456 451 154 252 -292 -458 365 520 356 134 397 404 368 433 351 422 440 438 257 646 -413 486 166 265 -486 -396 271 458 420 -574 386 560 360 530 468 430 440 490 240 803 278 613 156 272 78 -274 255 506 459 -402 370 541 333 590 472 318 437 455 226 768 -297 695 183 290 -641 396 93 347 307 -758 238 410 138 418 314 148 270 277 77 399 -496 383 -19 119 -474 -500 350 502 339 144 383 380 351 412 326 404 418 417 241 608 -456 451 154 252 -641 396 93 347 307 -758 238 410 138 418 314 148 270 277 77 399 -496 383 -19 119 -486 -396 271 458 420 -574 386 560 360 530 468 430 440 490 240 803 278 613 156 272 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 11 nsites= 5 E= 1.2e+000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.800000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.200000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.200000 0.000000 0.000000 0.000000 0.000000 0.800000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 8 regular expression -------------------------------------------------------------------------------- G[CT]GG[GA]TACGCT -------------------------------------------------------------------------------- Time 60.70 secs. ******************************************************************************** ******************************************************************************** MOTIF 9 width = 21 sites = 11 llr = 162 E-value = 8.0e-003 ******************************************************************************** -------------------------------------------------------------------------------- Motif 9 Description -------------------------------------------------------------------------------- Simplified A :91a47a:aa:9aa9a92a:a pos.-specific C :::::1::::::::::::::: probability D ::::::::::::::::::::: matrix E ::::::::::::::::::::: F ::::::::::::::::::::: G a19:62:a::a1::1:18:a: H ::::::::::::::::::::: I ::::::::::::::::::::: K ::::::::::::::::::::: L ::::::::::::::::::::: M ::::::::::::::::::::: N ::::::::::::::::::::: P ::::::::::::::::::::: Q ::::::::::::::::::::: R ::::::::::::::::::::: S ::::::::::::::::::::: T ::::::::::::::::::::: V ::::::::::::::::::::: W ::::::::::::::::::::: Y ::::::::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (21.3 bits) 5.0 3.7 2.5 1.2 **** *************** 0.0 --------------------- Multilevel GAGAGAAGAAGAAAAAAGAGA consensus A sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------------- A/Egypt/9174NAMRU3/2009 1 3.52e-08 . GAGAGAAGAAGAAAAAAGAGA A/chicken/WestJava/SmiSu 1 4.74e-08 . GAGAGCAGAAGAAAAAAGAGA A/Egypt/N07460/2012 1 9.07e-08 . GAGAAAAGAAGAAAAAAGAGA A/chicken/Egypt/1123AL/2 1 1.20e-07 . GAGAGGAGAAGAAAAAAGAGA A/duck/Egypt/1130AG/2011 1 2.11e-07 . GAGAGAAGAAGGAAAAAGAGA A/chicken/Egypt/10259SF/ 1 2.11e-07 . GAGAGAAGAAGAAAAAGGAGA A/chicken/Egypt/1117AF/2 1 3.30e-07 . GAGAGAAGAAGAAAAAAAAGA A/Egypt/N6774/2011 1 5.02e-07 . GGGAAAAGAAGAAAAAAGAGA A/chicken/Egypt/1090/201 1 6.98e-07 . GAGAGGAGAAGAAAGAAGAGA A/chicken/Egypt/111945V/ 1 8.06e-07 . GAGAAAAGAAGAAAAAAAAGA A/quail/Egypt/1171SG/201 1 1.34e-06 . GAAAAAAGAAGAAAAAAGAGA -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/Egypt/9174NAMRU3/2009 3.5e-08 [9] A/chicken/WestJava/SmiSu 4.7e-08 [9] A/Egypt/N07460/2012 9.1e-08 [9] A/chicken/Egypt/1123AL/2 1.2e-07 [9] A/duck/Egypt/1130AG/2011 2.1e-07 [9] A/chicken/Egypt/10259SF/ 2.1e-07 [9] A/chicken/Egypt/1117AF/2 3.3e-07 [9] A/Egypt/N6774/2011 5e-07 [9] A/chicken/Egypt/1090/201 7e-07 [9] A/chicken/Egypt/111945V/ 8.1e-07 [9] A/quail/Egypt/1171SG/201 1.3e-06 [9] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 9 width=21 seqs=11 A/Egypt/9174NAMRU3/2009 ( 1) GAGAGAAGAAGAAAAAAGAGA 1 A/chicken/WestJava/SmiSu ( 1) GAGAGCAGAAGAAAAAAGAGA 1 A/Egypt/N07460/2012 ( 1) GAGAAAAGAAGAAAAAAGAGA 1 A/chicken/Egypt/1123AL/2 ( 1) GAGAGGAGAAGAAAAAAGAGA 1 A/duck/Egypt/1130AG/2011 ( 1) GAGAGAAGAAGGAAAAAGAGA 1 A/chicken/Egypt/10259SF/ ( 1) GAGAGAAGAAGAAAAAGGAGA 1 A/chicken/Egypt/1117AF/2 ( 1) GAGAGAAGAAGAAAAAAAAGA 1 A/Egypt/N6774/2011 ( 1) GGGAAAAGAAGAAAAAAGAGA 1 A/chicken/Egypt/1090/201 ( 1) GAGAGGAGAAGAAAGAAGAGA 1 A/chicken/Egypt/111945V/ ( 1) GAGAAAAGAAGAAAAAAAAGA 1 A/quail/Egypt/1171SG/201 ( 1) GAAAAAAGAAGAAAAAAGAGA 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 21 n= 253 bayes= -2.07973 E= 8.0e-003 -545 -567 281 432 275 150 317 311 282 347 257 333 355 349 173 537 -526 381 92 187 76 -302 209 459 418 -229 327 493 288 549 436 281 401 416 185 742 -324 660 142 244 -341 -504 342 495 327 140 372 373 344 401 322 397 407 409 233 607 -458 447 143 240 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 -65 -418 205 379 284 93 275 331 213 380 282 289 408 352 124 653 -382 481 51 155 56 -16 216 469 430 -157 336 506 298 561 449 288 409 426 195 754 -314 673 153 254 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 -545 -567 281 432 275 150 317 311 282 347 257 333 355 349 173 537 -526 381 92 187 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 -545 -567 281 432 275 150 317 311 282 347 257 333 355 349 173 537 -526 381 92 187 76 -302 209 459 418 -229 327 493 288 549 436 281 401 416 185 742 -324 660 142 244 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 76 -302 209 459 418 -229 327 493 288 549 436 281 401 416 185 742 -324 660 142 244 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 76 -302 209 459 418 -229 327 493 288 549 436 281 401 416 185 742 -324 660 142 244 -247 -487 327 482 321 134 361 367 330 397 316 385 407 401 221 615 -444 453 131 230 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 -545 -567 281 432 275 150 317 311 282 347 257 333 355 349 173 537 -526 381 92 187 96 -416 114 344 312 -532 236 379 174 435 313 182 337 313 82 628 -427 535 49 150 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 21 nsites= 11 E= 8.0e-003 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.909091 0.000000 0.000000 0.000000 0.000000 0.090909 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.090909 0.000000 0.000000 0.000000 0.000000 0.909091 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.363636 0.000000 0.000000 0.000000 0.000000 0.636364 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.727273 0.090909 0.000000 0.000000 0.000000 0.181818 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.909091 0.000000 0.000000 0.000000 0.000000 0.090909 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.909091 0.000000 0.000000 0.000000 0.000000 0.090909 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.909091 0.000000 0.000000 0.000000 0.000000 0.090909 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.181818 0.000000 0.000000 0.000000 0.000000 0.818182 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 9 regular expression -------------------------------------------------------------------------------- GAGA[GA]AAGAAGAAAAAAGAGA -------------------------------------------------------------------------------- Time 61.66 secs. ******************************************************************************** ******************************************************************************** MOTIF 10 width = 15 sites = 3 llr = 65 E-value = 5.1e+001 ******************************************************************************** -------------------------------------------------------------------------------- Motif 10 Description -------------------------------------------------------------------------------- Simplified A :::::::a:::7:a: pos.-specific C ::::::::::::::a probability D ::::::::::::::: matrix E ::::::::::::::: F ::::::::::::::: G aa::aa:::aa3::: H ::::::::::::::: I ::::::::::::::: K ::::::::::::::: L ::::::::::::::: M ::::::::::::::: N ::::::::::::::: P ::::::::::::::: Q ::::::::::::::: R ::::::::::::::: S ::::::::::::::: T ::aa::a:a:::a:: V ::::::::::::::: W ::::::::::::::: Y ::::::::::::::: bits 12.4 11.1 9.9 8.7 Relative 7.4 Entropy 6.2 (31.2 bits) 5.0 3.7 * 2.5 ** * * * * 1.2 *********** *** 0.0 --------------- Multilevel GGTTGGTATGGATAC consensus G sequence -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 sites sorted by position p-value -------------------------------------------------------------------------------- Sequence name Start P-value Site ------------- ----- --------- --------------- A/chicken/Anhui/T5/2006 1 6.50e-10 . GGTTGGTATGGATAC CATCAT A/chicken/Guiyang/821/20 7 6.50e-10 GTAGAT GGTTGGTATGGATAC A/chicken/Bangladesh/11V 7 1.21e-09 GTAGAT GGTTGGTATGGGTAC -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 block diagrams -------------------------------------------------------------------------------- SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/chicken/Anhui/T5/2006 6.5e-10 [10]_6 A/chicken/Guiyang/821/20 6.5e-10 6_[10] A/chicken/Bangladesh/11V 1.2e-09 6_[10] -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 in BLOCKS format -------------------------------------------------------------------------------- BL MOTIF 10 width=15 seqs=3 A/chicken/Anhui/T5/2006 ( 1) GGTTGGTATGGATAC 1 A/chicken/Guiyang/821/20 ( 7) GGTTGGTATGGATAC 1 A/chicken/Bangladesh/11V ( 7) GGTTGGTATGGGTAC 1 // -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 position-specific scoring matrix -------------------------------------------------------------------------------- log-odds matrix: alength= 20 w= 15 n= 1771 bayes= 8.86125 E= 5.1e+001 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 -429 -346 336 539 479 -521 446 622 430 598 526 486 497 553 303 861 262 674 206 334 -429 -346 336 539 479 -521 446 622 430 598 526 486 497 553 303 861 262 674 206 334 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 -429 -346 336 539 479 -521 446 622 430 598 526 486 497 553 303 861 262 674 206 334 67 -239 308 567 502 -366 417 591 390 637 517 367 485 508 275 804 -258 737 221 337 -429 -346 336 539 479 -521 446 622 430 598 526 486 497 553 303 861 262 674 206 334 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 -450 -477 373 526 362 141 405 405 375 437 349 427 440 442 263 632 -430 476 174 275 31 -240 343 583 496 -86 438 574 411 626 512 409 518 531 291 812 -243 725 221 340 -429 -346 336 539 479 -521 446 622 430 598 526 486 497 553 303 861 262 674 206 334 67 -239 308 567 502 -366 417 591 390 637 517 367 485 508 275 804 -258 737 221 337 -636 396 96 351 311 -752 242 415 143 423 318 152 274 281 81 405 -492 391 -15 125 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 position-specific probability matrix -------------------------------------------------------------------------------- letter-probability matrix: alength= 20 w= 15 nsites= 3 E= 5.1e+001 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.666667 0.000000 0.000000 0.000000 0.000000 0.333333 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Motif 10 regular expression -------------------------------------------------------------------------------- GGTTGGTATGG[AG]TAC -------------------------------------------------------------------------------- Time 62.43 secs. ******************************************************************************** ******************************************************************************** SUMMARY OF MOTIFS ******************************************************************************** -------------------------------------------------------------------------------- Combined block diagrams: non-overlapping sites with p-value < 0.0001 -------------------------------------------------------------------------------- SEQUENCE NAME COMBINED P-VALUE MOTIF DIAGRAM ------------- ---------------- ------------- A/chicken/Bangladesh/830 1.22e-03 [2(6.62e-06)] A/chicken/Egypt/39825/20 4.13e-03 6_[4(4.39e-07)] A/ck/Indonesia/072/10 3.16e-02 [2(2.31e-05)] A/duck/Bangladesh/5749/1 2.68e-07 2_[5(4.54e-08)]_5 A/mallard/Crimea/245/200 3.86e-09 [1(6.48e-11)] A/environment/Bangladesh 1.25e-03 [4(4.39e-07)]_6 A/chicken/Egypt/111945V/ 1.66e-03 [9(8.06e-07)] A/avian/New_York/Sg00372 7.47e-04 [3(3.23e-09)]_6 A/chicken/Egypt/1117AF/2 1.30e-03 [9(3.30e-07)] A/wild_bird_feces/Cheons 1.32e-02 4_[5(5.98e-07)]_3 A/chicken/India/241272/2 8.88e-03 [2(5.95e-06)] A/duck/Jiangxi/80/2005 2.00e-03 [2(2.08e-07)] A/chicken/Inhu/BPPVRII/2 5.07e-03 [2(1.32e-06)] A/duck/Guangxi/668/2004 3.93e-03 [2(1.67e-06)] A/whooper_swan/Hokkaido/ 5.48e-13 1_[7(1.62e-11)]_3 A/duck/Vietnam/1/2010 8.68e-04 10_[8(8.62e-09)] A/avian/New_York/Sg00387 1.90e-04 3_[3(3.23e-09)]_3 A/chicken/Cambodia/LC/20 3.04e-01 21 A/mallard/Maryland/786/2 2.27e-09 4_[5(5.66e-09)]_3 A/chicken/Egypt/128s/201 4.88e-08 1_[3(4.76e-09)]_5 A/guinea_fowl/Yangon/834 2.87e-02 21 A/chicken/WestJava/SmiAc 1.07e-02 [2(1.27e-05)] A/Indonesia/625/2006 1.65e-02 3_[4(1.47e-06)]_3 A/Egypt/N07460/2012 5.29e-04 [9(9.07e-08)] A/duck/Victoria/26/1981| 1.14e-09 [1(1.29e-11)] A/duck/Viet_Nam/TG2401/2 9.09e-03 [2(6.33e-05)] A/chicken/Egypt/398252/2 1.21e-08 1_[5(5.66e-09)]_6 A/environment/Maryland/1 2.37e-08 [1(3.01e-11)] A/duck/New_Zealand/41/19 3.43e-08 [1(6.33e-10)] A/heron/Cambodia/TM068/2 3.39e-03 [4(3.75e-07)]_6 A/chicken/Egypt/10117/20 6.56e-08 [6(6.47e-12)] A/VietNam/HN31413/2008 3.48e-04 [4(1.21e-07)]_6 A/chicken/Egypt/398214/2 4.80e-07 [6(2.63e-11)] A/chicken/Egypt/39823/20 6.21e-02 21 A/chicken/Belgium/150VB/ 1.66e-08 [1(7.84e-10)] A/goose/Fujian/bb/2003 3.18e-03 [2(3.89e-07)] A/Vietnam/UT30259/2004 2.87e-03 [2(9.60e-06)] A/duck/Yunnan/47/2006 1.19e-02 [2(3.27e-06)] A/chicken/Egypt/10259SF/ 1.55e-03 [9(2.11e-07)] A/bird/Turkey/Unye_ist06 4.33e-03 [2(1.45e-06)] A/chicken/Bangladesh/11r 4.43e-08 4_[7(6.12e-10)] A/chicken/Vietnam/NCVD19 2.96e-04 [2(7.67e-08)] A/swan/England/AV3142149 8.58e-06 [1(5.93e-09)] A/chicken/Ibaraki/17/200 3.23e-08 [1(5.83e-11)] A/chicken/Nepal/354/2010 7.04e-03 3_[4(7.69e-07)]_3 A/duck/Vietnam/NCVD366/2 1.04e-02 [2(3.27e-06)] A/duck/Korea/GJ54/2004|G 8.70e-02 4_[5(2.69e-06)]_3 A/muscovy_duck/Vietnam/L 5.49e-03 [2(1.20e-06)] A/duck/Egypt/1130AG/2011 1.67e-03 [9(2.11e-07)] A/ck/Indonesia/091/10 6.76e-08 4_[7(4.14e-11)] A/duck/Vietnam/9/2010 8.80e-01 21 A/duck/Hokkaido/Vac3/200 6.46e-09 [1(1.27e-10)] A/Thailand/WRAIR1720H/20 7.09e-03 [2(7.12e-07)] A/chicken/Cambodia/022LC 5.25e-01 21 A/spurwinged_goose/Niger 1.87e-07 [1(2.89e-10)] A/pigeon/Egypt/SHAH5803/ 7.97e-04 [4(4.39e-07)]_6 A/whooper_swan/Mongolia/ 1.74e-06 [1(5.15e-08)] A/chicken/Shan/2626/2007 7.31e-04 [2(1.84e-08)] A/chicken/Italy/367/97|A 5.38e-01 21 A/turkey/Italy/1325/2005 6.19e-01 21 A/chicken/Egypt/1158SF/2 1.23e-02 [2(2.10e-06)] A/chicken/Egypt/398220/2 2.97e-05 7_[5(5.66e-09)] A/condor/Guangdong/139/2 9.86e-03 [2(1.20e-06)] A/chicken/Shandong/A1/20 1.24e-03 [2(3.10e-07)] A/chicken/Egypt/11764s/2 2.76e-01 21 A/swine/NorthSumatra/UT6 3.58e-03 [2(4.76e-07)] A/o.bill_stork/Thailand/ 2.66e-03 [2(2.53e-07)] A/chicken/Texas/2983132/ 1.39e-06 [1(5.19e-10)] A/chicken/Bangladesh/11V 1.24e-05 6_[10(1.21e-09)] A/chicken/Ibaraki/15/200 2.90e-08 [1(1.65e-10)] A/chicken/Shandong/A10/2 5.58e-03 [2(1.92e-06)] A/duck/Vietnam/OIE1287/2 4.98e-04 [2(3.14e-08)] A/duck/Vietnam/NCVD1161/ 1.05e-02 3_[4(4.39e-07)]_3 A/duck/Ireland/113/1983| 7.45e-01 21 A/chicken/EastKalimantan 5.24e-03 [2(4.76e-07)] A/duck/Vietnam/3/2010 2.10e-03 7_[8(3.40e-08)]_3 A/Muscovy_duck/Ca_Mau/11 1.21e-02 [2(7.95e-07)] A/chicken/Turkey/Misinli 9.81e-14 1_[7(3.22e-11)]_3 A/chicken/Italy/9097/199 6.24e-07 1_[5(3.92e-08)]_6 A/mallard/Washington/456 7.87e-06 7_[5(4.78e-09)] A/duck/Iran/11VIR53161/2 6.67e-01 21 A/Muscovy_Duck/Vietnam/1 1.77e-04 [4(2.36e-07)]_6 A/duck/Hunan/149/2005 3.48e-06 [1(8.40e-09)] A/Egypt/4822NAMRU3/2009 2.66e-02 [2(4.03e-05)] A/duck/Vietnam/NCVD1463/ 3.24e-03 [2(1.92e-06)] A/poultry/Egypt/398256/2 1.80e-03 [4(6.25e-07)]_6 A/turkey/Italy/1980|GQ24 2.83e-08 [6(7.65e-12)] A/goose/Bangladesh/11VIR 2.97e-01 21 A/mallard/Washington/454 3.11e-09 [1(9.89e-12)] A/chicken/Cambodia/013LC 3.57e-11 6_[3(1.22e-09)] A/chicken/Banten/PdglKas 5.91e-03 [2(2.53e-07)] A/chicken/Vietnam/NCVD03 3.60e-01 21 A/wigeon/Ohio/379/1988|C 1.02e-09 [1(2.82e-12)] A/chicken/Magelang/BBVW6 1.62e-03 [2(7.67e-08)] A/chicken/Lampung/BPPVRI 1.43e-02 [2(2.91e-06)] A/chick/Pennsylvania/1/1 6.69e-05 [1(2.05e-08)] A/chicken/Liaoning/A1/20 3.40e-03 [2(2.10e-06)] A/wild_bird_feces/Cheons 1.67e-13 6_[3(4.15e-11)] A/chicken/Sikkim/151466/ 9.38e-03 [2(6.62e-06)] A/environment/ChangSha/2 1.87e-03 [2(5.95e-06)] A/chicken/CentralJava/UT 2.46e-03 [2(2.60e-06)] A/Vietnam/HN36250/2010 1.88e-02 [2(1.51e-05)] A/chicken/Guiyang/821/20 1.39e-06 6_[10(6.50e-10)] A/owstons_civet/VietNam/ 9.57e-04 [2(8.77e-07)] A/chicken/Bangladesh/11r 3.28e-06 3_[3(8.33e-11)]_3 A/environment/New_York/1 3.29e-10 [1(7.24e-13)] A/domestic_goose/Hong_Ko 1.50e-02 [2(8.76e-06)] A/Hubei/1/2010 3.24e-03 [2(1.92e-06)] A/Hunan/1/2009 1.76e-03 [2(3.93e-06)] A/chicken/Egypt/1123AL/2 8.11e-04 [9(1.20e-07)] A/wood_duck/MD/04623/200 8.07e-10 [1(9.89e-12)] A/chicken/Shanxi/2/2006 2.04e-02 21 A/quail/Egypt/1171SG/201 1.40e-03 [9(1.34e-06)] A/duck/France/080036/200 7.54e-09 [1(8.17e-11)] A/chicken/Yangon/182/201 2.37e-03 [2(3.10e-07)] A/chicken/Scotland/59|X0 5.00e-06 [1(5.46e-09)] A/chicken/Sharkia/CAI41/ 1.52e-03 [2(1.04e-07)] A/mallard/Sweden/21/2002 2.50e-09 [1(2.67e-11)] A/environment/Bangladesh 7.64e-03 [2(5.95e-06)] A/chicken/Hebei/A8/2009 2.47e-03 [2(2.60e-06)] A/chicken/Egypt/11VIR445 1.66e-03 10_[8(1.63e-07)] A/duck/Guangxi/13/2004 3.67e-03 [2(9.60e-06)] A/duck/France/090043/200 4.42e-08 [1(2.89e-10)] A/chicken/Denpasar/BBVD1 2.21e-02 [2(3.61e-06)] A/Egypt/N6774/2011 6.32e-04 [9(5.02e-07)] A/parrot/CA/6032/04|DQ25 7.46e-08 [1(1.65e-10)] A/Cambodia/VN05103/2005 1.41e-01 21 A_GenBank/heron/Cambodia 2.73e-03 6_[4(3.75e-07)] A/chicken/Bangladesh/11r 6.14e-05 7_[5(1.10e-08)] A/chicken/Tabanan/BBVD14 1.77e-03 [2(2.08e-07)] A/great_cormorant/Tibet/ 5.06e-05 [2(3.08e-09)] A/Egypt/321NAMRU3/2007 5.79e-03 [2(6.23e-07)] A/chicken/Egypt/10512AG/ 3.04e-02 [2(9.60e-06)] AHAH5_[11734;11734] 4.21e-37 [1(3.05e-19)] A/Egypt/9174NAMRU3/2009 5.33e-04 [9(3.52e-08)] A/Indonesia/NIHRD12379/2 4.50e-03 [2(6.23e-07)] A/chicken/Korea/ES/03 1.77e-02 [2(5.89e-05)] A/avian/New_York/Sg00377 1.75e-05 7_[5(5.66e-09)] A/chicken/Vietnam/NCVD09 2.29e-03 [2(4.40e-06)] A/Chicken/TurkeyMus/09rs 5.71e-08 4_[7(3.22e-11)] A/duck/Hong_Kong/312/197 6.18e-05 [3(8.33e-11)]_6 A/Vietnam/UT3030/2003 6.22e-04 [2(5.58e-08)] A/crow/Bangladesh/11rs19 6.76e-02 21 A/duck/Primorie/2633/200 3.16e-05 [3(4.15e-11)]_6 A/Indonesia/UT3006/2005 7.23e-03 [2(3.89e-07)] A/duck/Qalubia/CAI11/201 5.33e-03 [2(3.89e-07)] A/green_winged_teal/Dela 2.40e-09 [1(5.44e-12)] A/chicken/Anhui/T5/2006 2.35e-05 [10(6.50e-10)]_6 A/chicken/Puebla/8623607 9.01e-01 21 A/duck/Guangxi/951/2005 4.60e-03 [2(5.43e-06)] A/chicken/Egypt/1219s/20 4.44e-11 4_[3(4.76e-09)]_2 A/chicken/WestJava/SmiSu 1.15e-03 [9(4.74e-08)] A/chicken/Cambodia/TLC2/ 1.63e-05 3_[3(1.22e-09)]_3 A/duck/France/05056a/200 1.22e-08 [1(4.65e-10)] A/Shandong/1/2009 2.38e-03 [2(2.34e-06)] A/chicken/Vietnam/NCVD40 5.75e-03 [2(3.27e-06)] A/chicken/Queretaro/7653 1.33e-02 3_[4(3.75e-07)]_3 A/environment/Bangladesh 2.15e-03 [4(3.75e-07)]_6 A/barn_swallow/Hong_Kong 1.05e-02 [2(2.31e-05)] A/duck/Primorie/2621/200 9.31e-08 1_[3(1.97e-08)]_5 A/chicken/BacLieuVietnam 2.11e-03 3_[4(6.25e-07)]_3 A_DISC/Cambodia/V0401301 8.39e-03 [2(2.91e-06)] A/bar_headed_goose/Mongo 9.64e-06 [1(2.05e-08)] A/chicken/Egypt/113Q/201 1.22e-02 [2(1.92e-06)] A/quail/Thanatpin/2283/2 4.14e-03 [2(6.62e-06)] A/chicken/Egypt/125s/201 1.53e-11 4_[5(5.66e-09)]_3 A/duck/Hong_Kong/698/197 5.60e-09 [1(2.08e-11)] A/chicken/Egypt/209573/2 8.03e-01 21 A/chicken/Indonesia/SmiW 3.08e-08 [6(5.44e-12)] A/turkey/Egypt/091QNLQP/ 7.34e-05 [3(8.33e-11)]_6 A/duck/Egypt/1053/2010 2.54e-02 21 A/Egypt/2786NAMRU3/2006 8.02e-04 6_[4(6.25e-07)] A/mallard/Netherlands/3/ 1.90e-06 3_[3(4.15e-11)]_3 A/chicken/Bangladesh/11r 3.27e-05 [3(8.33e-11)]_6 A/chicken/Liaoning/23/20 4.61e-03 [2(1.20e-06)] A/unknown/NY/98996/01|AY 2.75e-09 [1(1.88e-11)] A/tern/South_Africa/1961 5.03e-01 21 A/chicken/Nepal/T1P/12 2.79e-03 [2(7.12e-07)] A/marabou_stork/Cambodia 2.36e-01 21 A/chicken/Cambodia/67F1/ 1.19e-02 [2(2.10e-06)] A/gadwall/California/442 6.07e-10 [1(2.10e-12)] A/chicken/TanseMyanmar/S 5.35e-03 3_[4(6.25e-07)]_3 A/duck/Cao_Bang/43/2007 5.78e-04 [2(5.46e-07)] A/chicken/Miyazaki/T10/2 2.84e-03 [2(1.67e-06)] A/whitefaced_whistling_d 7.41e-02 21 A/muscovy_duck/Vietnam/N 5.05e-03 [2(6.23e-07)] A/Canada_goose/Alaska//4 2.31e-09 4_[5(4.78e-09)]_3 A/chicken/Bangladesh/967 2.96e-03 [2(7.12e-07)] A/chicken/Egypt/11VIR445 6.80e-03 [2(1.92e-06)] A/Hong_Kong/7032/2012 1.26e-02 3_[4(4.39e-07)]_3 A/chicken/Vietnam/945/20 5.75e-01 21 A/ostrich/South_Africa/A 1.76e-05 3_[3(7.90e-10)]_3 A/wild_bird/Wisconsin/43 1.40e-06 1_[5(5.66e-09)]_6 A/spurwinged_goose/Niger 2.31e-02 [2(1.16e-05)] A/duck/Vietnam/NCVD1026/ 4.60e-03 [2(1.27e-05)] A/tern/South_Africa/1959 1.00e-06 3_[3(4.15e-11)]_3 A/muscovy_duck/Jakarta/S 2.32e-03 [2(5.58e-08)] A/goose/Germany/R3160/09 2.01e-09 [1(4.14e-11)] A/turkey/Ontario/7732/19 7.44e-01 21 A/chicken/Egypt/1090/201 1.64e-03 [9(6.98e-07)] A/mallard/Netherlands/2/ 5.96e-09 [1(5.83e-11)] A/chicken/Vietnam/NCVD18 3.46e-03 [2(1.27e-05)] A/chicken/Texas/1672804/ 1.67e-07 [1(3.63e-11)] A/duck/Italy/775/2004|CY 1.24e-09 [1(9.22e-11)] A/chicken/Bangladesh/FD( 2.17e-05 [3(4.15e-11)]_6 A/chicken/Vietnam/NCVD40 1.17e-02 [2(1.66e-05)] A/swan/Hokkaido/67/1996| 1.62e-07 [1(2.49e-10)] A/chicken/Bangladesh/152 9.67e-04 [2(1.20e-06)] A/chicken/Bangladesh/11r 1.76e-01 21 A/chicken/Indonesia/Suka 2.79e-01 21 A/gull/Pennsylvania/4175 3.23e-07 [1(1.05e-09)] A/turkey/England/N28/73| 2.35e-08 1_[3(2.10e-08)]_5 A/chicken/Badung/BBVD302 1.81e-03 [2(2.17e-07)] A/chicken/Bangladesh/11r 2.34e-03 [3(7.12e-09)]_6 A/chicken/Yichang/lung1/ 8.73e-01 21 A/avian/Missouri/4655937 9.68e-09 [1(1.88e-11)] A/turkey/TX/14082/1982_H 4.49e-09 [1(1.88e-11)] A/Cambodia/W0526301/2012 1.11e-02 [2(1.92e-06)] A/chicken/Vietnam/NCVD11 1.10e-02 3_[4(3.75e-07)]_3 A/duck/Egypt/SHZA6605/20 6.64e-02 21 A/chicken/Shandong/A5/20 8.69e-03 [2(7.33e-06)] A/chicken/VA/40018/1984_ 1.38e-04 [1(8.00e-08)] A/common_magpie/HongKong 3.92e-09 4_[7(1.62e-11)] A/shearwater/Australia/7 5.83e-09 [1(6.48e-11)] A/Chicken/TurkeyEdirne/0 2.06e-08 1_[5(1.90e-09)]_6 A/goose/Guiyang/337/2006 7.92e-03 [2(7.95e-07)] A/chicken/Egypt/11VIR445 4.91e-01 21 A/environment/Thailand/I 3.12e-03 [2(4.76e-07)] A/wild_bird_feces/Byeong 9.09e-06 3_[3(6.63e-10)]_3 A/duck/Vietnam/NCVD422/2 1.28e-02 [2(1.96e-05)] A/chicken/Bhutan/4/10 9.80e-01 21 A/turkey/MN/40550/1987_H 5.51e-03 4_[5(9.56e-07)]_3 A/chicken/Egypt/1085/201 7.60e-03 [2(2.60e-06)] A/duck/France/06436/2006 4.96e-08 [1(8.65e-10)] A/HongKong/6841/2010 4.72e-01 21 A/chicken/WestBengal/239 6.43e-03 [2(1.45e-06)] A/duck/Bac_Lieu/1213/200 1.71e-03 [2(9.16e-08)] A/cinnamon_teal/Californ 2.44e-09 [1(1.49e-11)] A/Muscovy_duck/France/07 1.85e-08 [1(1.05e-09)] A/chicken/Vietnam/4/2010 6.85e-04 1_[8(8.62e-09)]_9 A/chicken/Vietnam/NCVD01 1.05e-02 [2(2.94e-05)] A/chicken/Puebla/1458665 1.71e-08 4_[5(1.20e-08)]_3 A/Northern_shoveler/Utah 3.16e-10 [1(1.11e-12)] A/tundra_swan/Alaska//48 1.72e-09 [1(1.29e-11)] A/chicken/Sheny/0606/200 2.19e-03 [2(1.67e-06)] A/turkey/England/N28/73| 7.65e-01 21 A/duck/Iran/VIR53161/201 9.31e-01 21 A/chicken/Yangon/1023/20 3.00e-02 [2(2.51e-05)] A/chicken/Vietnam/5/2010 5.43e-04 4_[8(8.62e-09)]_6 A/chicken/Hebei/326/2005 4.20e-03 [2(1.32e-06)] A/chicken/Bangladesh/150 3.10e-04 [2(3.10e-07)] A/wild_bird/Minnesota/46 2.06e-09 [1(4.20e-12)] A/chicken/India/81766/20 2.22e-03 [2(7.12e-07)] -------------------------------------------------------------------------------- ******************************************************************************** ******************************************************************************** Stopped because nmotifs = 10 reached. ******************************************************************************** CPU: bioteam52 ******************************************************************************** bug2937.fasta100644000766000024 764614006427353 16533 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data /usr/local/fasta3/bin/fasta35 -O test_revcomp.fasta35 test_revcomp.fa clusters.fasta FASTA searches a protein or DNA sequence data bank version 35.04 Mar. 26, 2009 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: ILTV-miR1, 70 nt 1>>>ILTV-miR1 - 70 nt - 70 nt Library: clusters.fasta 3924656 residues in 179575 sequences opt E() < 20 8261 0:============================== 22 16 0:= one = represents 279 library sequences 24 5 0:= 26 38 4:* 28 262 41:* 30 712 247:*== 32 1499 956:===*== 34 4618 2593:=========*======= 36 5199 5325:===================* 38 7360 8800:=========================== * 40 9090 12276:================================= * 42 15194 15005:=====================================================*= 44 16710 16552:===========================================================* 46 14217 16859:=================================================== * 48 11294 16141:========================================= * 50 13398 14728:================================================= * 52 14142 12949:==============================================*==== 54 10605 11060:=======================================* 56 6654 9239:======================== * 58 7042 7585:========================== * 60 7276 6144:======================*==== 62 9388 4926:=================*================ 64 3357 3918:============= * 66 3240 3096:===========* 68 3655 2435:========*===== 70 1842 1909:======* 72 1001 1491:==== * 74 718 1163:=== * 76 747 905:===* 78 478 703:==* 80 334 546:=* 82 284 418:=* 84 196 331:=* 86 156 256:* 88 126 198:* inset = represents 2 library sequences 90 96 153:* 92 79 119:* :=======================================* 94 47 92:* :======================== * 96 40 71:* :==================== * 98 33 55:* :================= * 100 24 43:* :============ * 102 12 33:* :====== * 104 15 25:* :======== * 106 9 20:* :===== * 108 6 15:* :=== * 110 0 12:* : * 112 0 9:* : * 114 1 7:* := * 116 1 5:* := * 118 1 4:* :=* >120 94 3:* :=*====================================== 3924656 residues in 179575 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7921+/-0.000406; mu= 7.9651+/- 0.013 mean_var=36.4967+/-12.281, 0's: 2734 Z-trim: 2761 B-trim: 0 in 0/14 Lambda= 0.212299 statistics sampled from 60000 to 179478 sequences Kolmogorov-Smirnov statistic: 0.0718 (N=29) at 58 Algorithm: FASTA (3.5 Sept 2006) [optimized] Parameters: +5/-4 matrix (5:-4) ktup: 3 join: 60, opt: 45, open/ext: -12/-4, width: 16 The best scores are: opt bits E(179572) cluster_79238:1 ( 27) [r] 126 41.0 0.00012 >>cluster_79238:1 (27 nt) rev-comp initn: 125 init1: 125 opt: 126 Z-score: 208.3 bits: 41.0 E(): 0.00012 banded Smith-Waterman score: 126; 96.3% identity (96.3% similar) in 27 nt overlap (29-3:1-27) 60 50 40 30 20 10 ILTV-- TGATTGGGGAATGATTGGGAAGCTTGTGCCAATTCCATTCCTCTTTCTGTCTCCACCGC ::::::::::::::::::::::::: : cluste AATTCCATTCCTCTTTCTGTCTCCAAC 10 20 70 residues in 1 query sequences 3924656 residues in 179575 library sequences Scomplib [35.04] start: Tue Oct 27 08:58:20 2009 done: Tue Oct 27 08:58:25 2009 Total Scan time: 1.360 Total Display time: 0.000 Function used was FASTA [version 35.04 Mar. 26, 2009] sofa.ontology100644000766000024 761214006427353 17126 0ustar00cjfieldsstaff000000000000BioPerl-1.7.8/t/data!autogenerated-by: DAG-Edit version 1.316 !saved-by: suzi !date: Wed Feb 19 16:38:05 SGT 2003 !version: $Revision: 1.1 $ !type: % ISA Is a !type: < PARTOF Part of !Sequence_ontology_Lite_Version !This is only for comment; not for implementation !Comments to: song-devel@sourceforge.net $Sequence_Feature_Ontology ; SO:0000000 %sofa ; GO:0000001 %feature ; SO:20000000 %chromosome ; GO:0000005