A Protein Classification Benchmark collection for machine learning
Use standard datasets to compare protein classification by different machine learning methods.
 
A fully automatic evolutionary classification of protein folds -- Dali Domain Dictionary version 3
Search information on numerical taxonomy of all known structures in the Protein Data Bank (PDB).
 
A rapid classification protocol for the CATH Domain Database to support structural genomics
Search for information on protein structures, domains, folds, and structural classifications.
 
ADDA -- a domain database with global coverage of the protein universe
Search for information on protein domain family.
 
Applications for protein sequence–function evolution data -- mRNA/protein expression analysis and coding SNP scoring tools
Conduct protein classification, expression analysis and SNP function screening.
 
BIOZON -- a hub of heterogeneous biological data
Search for a wide range of protein data and information through this unified biological database.
 
Berkeley Phylogenomics Group
Resources for structural phylogenomic analysis.
 
CD-Search -- Protein Domain Annotations on the Fly
Detect structural and functional domains in protein sequences.
 
CDD -- A curated Entrez database of conserved domain alignments
Identify conserved domain in a protein sequence.
 
CHOP -- Parsing proteins into structural domains
Chop proteins into domain-like fragments.
 
CluSTr -- The database of SWISS-PROT+TrEMBL protein clusters
Study protein classification based on an automatic classification of SWISS-PROT+TrEMBL proteins into groups of related proteins.
 
DAhunter -- Domain Architecture hunter
A retrieval tool for conserved protein domain architecture.
 
DIAL -- a web-based server for the automatic identification of structural domains in proteins
Automatically identify protein structural domains given the three-dimensional coordinates of a protein.
 
DOMAC -- a hybrid protein domain prediction server
An accurate protein domain prediction server.
 
DOUTfinder — identification of distant domain outliers using subsignificant sequence similarity
Detect distantly related protein domains.
 
FISH — family identification of sequence homologues using structure anchored hidden Markov models
Identify homologous protein domain sequences.
 
FunShift -- a database of function shift analysis on protein subfamilies
Search for information on functional shift within protein family.
 
GeneSpeed -- protein domain organization of the transcriptome
Study the PFAM protein domain content of the transcriptome (Unigene Database) for all expressed genes of Homo sapiens, Mus musculus, Drosophila melanogaster, and Caenorhabditis elegans.
 
GeneTrees -- a phylogenomics resource for prokaryotes
Search for pre-computed alignments and phylogenetic trees for all protein sequences from 325 fully sequenced and annotated prokaryote genomes.
 
Hits -- Access to databases of predicted protein sequences
Search and investigate protein motif sequences.
 
InterProScan -- protein domains identifier
Identify protein family (and DNA) domains, patterns, motifs, protein families, and functional sites.
 
KOG - Eukaryotic Orthologous Groups of proteins
Search Clusters of Orthologous Groups of protein (COGs) for seven (nearly) complete eukaryotic genomes.
 
MINER -- software for phylogenetic motif identification
Identify phylogenetic motifs in protein sequences.
 
MulPSSM -- a database of multiple position-specific scoring matrices of protein domain families
Search for position-specific scoring matrices for a large number of sequence and structural families of protein domains.
 
MultiPhyl -- A high-throughput phylogenomics webserver using distributed computing
Upload multiple amino acid or nucleotide alignments and perform the tasks of ML model selection, tree searching, and bootstrapping.
 
MyHits -- An Interactive Resource for Analyzing Protein Sequences
An integrated service dedicated to the analysis of protein sequences.
 
NEWT -- a new taxonomy portal
Search taxonomy data for the complete set of species represented in SWISS-PROT, as well as those stored at the NCBI.
 
PANDIT -- an evolution-centric database of protein and associated nucleotide domains with inferred trees
Search for multiple sequence alignments and phylogenetic trees covering many common protein domains.
 
PANTHER -- A browsable database of gene products organized by biological function, using curated protein family and subfamily classification
Browse and search proteins based on their biological functions.
 
PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference
Calculate maximum likelihood phylogenies from DNA and protein sequences.
 
PIRSF -- Protein family classification system at the Protein Information Resource
To analyze protein phylogenetic profiles, to reveal functional convergence and divergence, and to identify relationships between homeomorphic families, domains and structural classes.
 
POWER -- PhylOgenetic WEb Repeater—an integrated and user-optimized framework for biomolecular phylogenetic analysis
Perform user-friendly pipeline phylogenetic analysis of protein or DNA sequences.
 
PRECISE -- a Database of Predicted and Consensus Interaction Sites in Enzymes
Search for information on interactions between the amino acid residues of an enzyme and its ligands.
 
PRED-GPCR -- GPCR Recognition and Family Classification Server
Recognize and classify G-protein coupled receptors (GPCRs) at the family level.
 
PRINTS and its automatic supplement, prePRINTS -- A compendium of protein fingerprints
Search for protein fingerprints.
 
PRODOC -- a resource for the comparison of tethered protein domain architectures with in-built information on remotely related domain families
Search and compare functional domain assignments for proteins encoded in 192 complete genomes.
 
PairsDB -- atlas of protein sequence space
Explore protein sequences and their similarity relationships.
 
Pfam -- Protein Families Database
A comprehensive collection of protein domains and families, represented as multiple sequence alignments and as profile hidden Markov models.
 
Phylemon -- A suite of web tools for molecular evolution, phylogenetics and phylogenomics
Phylemon is a web server that integrates a selected suite of more than 20 different tools from the most popular stand-alone programs of phylogenetic and evolutionary analysis.
 
PhyloDome —- visualization of taxonomic distributions of domains occurring in eukaryote protein sequence sets
Visualize and analyze the phylogenetic distribution of one or more eukaryotic domains.
 
Phylocom -- software for the analysis of phylogenetic community structure and trait evolution
Calculates numerous metrics of phylogenetic community structure and trait similarity within communities.
 
Phylogeny.fr -- robust phylogenetic analysis for the non-specialist
A web service dedicated to reconstructing and analysing phylogenetic relationships between molecular sequences.
 
PipeAlign -- a new toolkit for protein family analysis
Use this protein family analysis tool to search for sequence homologues in protein and 3D structure databases, the definition of the hierarchical relationships within and between subfamilies.
 
ProDom and ProDom-CG -- Tools for protein domain analysis and whole genome comparisons
Search and analyze protein domain families.
 
ProtoMap -- Automatic classification of protein sequences and hierarchy of protein families
Classify protein sequences.
 
ProtoNet -- Hierarchical classification of the protein space
Classify proteins using automatic hierarchical clustering of the SWISS-PROT protein database.
 
S4 -- structure-based sequence alignments of SCOP superfamilies
Search for structure-based sequence alignments of domains in SCOP protein superfamilies.
 
SCOPEC -- Mapping of catalytic function to domain structure
Search for information on enzyme catalytic domains.
 
SEARCHPKS -- a program for detection and analysis of polyketide synthase domains
Detect and analyze polyketide synthase (PKS) domains in a polypeptide sequence.
 
SH3-Hunter -- discovery of SH3 domain interaction sites in proteins
Identify putative SH3 domain interaction sites on protein sequences.
 
SIMAP -- structuring the network of protein similarities
Search for pre-computed similarity matrix of protein sequences.
 
SMART 5 -- domains in the context of genomes and networks
To identify and annotate protein domains.
 
SUPFAM — A database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families
Analyze and compare homologous protein families in a multiple sequence alignment database of either known or unknown structure.
 
SVM-Prot -- web-based support vector machine software for functional classification of a protein from its primary sequence
Classify a protein into functional family from its primary sequence.
 
The COG database -- phylogenetic classification of proteins from complete genomes
Search information on phylogenetic classification of proteins encoded in complete genomes of different organisms.
 
The InterPro Database
Identify protein family, domains, patterns, motifs, protein families, and functional sites.
 
The MPI Bioinformatics Toolkit for protein sequence analysis
Conduct protein sequence and structure analysis using a suite of software tools.
 
The PredictProtein server
Predict protein structure and function based on protein sequence.
 
The SBASE domain sequence library -- Domain architecture prediction
Search for protein domain sequences, structures, functions,etc.
 
The SYSTERS Protein Family Database in 2005
Partition of the whole protein sequence space by a fully automatic procedure.
 
The TIGRFAMs database of protein families
Search for curated information of protein families based on Hidden Markov Models.
 
TreeDomViewer -- a tool for the visualization of phylogeny and protein domain structure
Visualize structure and phylogeny of protein domains.
 
Web-based toolkits for topology prediction of transmembrane helical proteins, fold recognition, structure and binding scoring, folding-kinetics analysis and comparative analysis of domain combinations
Use several Web servers to conduct various protein modeling and protein structure analysis.
 
iProClass -- An integrated database of protein family, function and structure information
Search for integrated and comprehensive information on family relationships and structural/functional features of proteins.
 

The Health Sciences Library System supports the Health Sciences at the University of Pittsburgh and the
UPMC | University of Pittsburgh Medical Center.

© 1996 - 2008 Health Sciences Library System, University of Pittsburgh. All rights reserved.
Contact the Webmaster

University of Pittsburgh Libraries