PhyLoTA -- processing GenBank for molecular phylogenetics research

What you can do:
Offers a view of GenBank tailored for molecular phylogenetics.
Highlights:
  • The first release of the Browser is computed from 2.6 million sequences representing the taxonomically enriched subset of GenBank sequences for eukaryotes (excluding most genome survey sequences, ESTs, and other high-throughput data).
  • In addition to summarizing sequence diversity and species diversity across nodes in the NCBI taxonomy, it reports 87,000 potentially phylogenetically informative clusters of homologous sequences, which can be viewed or downloaded, along with provisional alignments and coarse phylogenetic trees. At each node in the NCBI hierarchy, the user can display a "data availability matrix" of all available sequences for entries in a subtaxa-by-clusters matrix.
  • This matrix provides a guidepost for subsequent assembly of multigene data sets or supertrees.
  • The database allows for comparison of results from previous GenBank releases, highlighting recent additions of either sequences or taxa to GenBank and letting investigators track progress on data availability worldwide.
  • Although the reported alignments and trees are extremely approximate, the database reports several statistics correlated with alignment quality to help users choose from alternative data sources.
Keywords:
  • Phylogeny
  • phylogenetics
  • GenBank
This record last updated: 11-26-2008
Report a missing or misdirected URL.

The Health Sciences Library System supports the Health Sciences at the University of Pittsburgh.

© 1996 - 2023 Health Sciences Library System, University of Pittsburgh. All rights reserved.