|
A Protein Classification Benchmark collection for machine learning
|
| Use standard datasets to compare protein classification by different machine learning methods.
|
| |
|
A fully automatic evolutionary classification of protein folds -- Dali Domain Dictionary version 3
|
| Search information on numerical taxonomy of all known structures in the Protein Data Bank (PDB).
|
| |
|
A rapid classification protocol for the CATH Domain Database to support structural genomics
|
| Search for information on protein structures, domains, folds, and structural classifications.
|
| |
|
ADDA -- a domain database with global coverage of the protein universe
|
| Search for information on protein domain family.
|
| |
|
Applications for protein sequence–function evolution data -- mRNA/protein expression analysis and coding SNP scoring tools
|
| Conduct protein classification, expression analysis and SNP function screening.
|
| |
|
BIOZON -- a hub of heterogeneous biological data
|
| Search for a wide range of protein data and information through this unified biological database.
|
| |
|
Berkeley Phylogenomics Group
|
| Resources for structural phylogenomic analysis.
|
| |
|
CD-Search -- Protein Domain Annotations on the Fly
|
| Detect structural and functional domains in protein sequences.
|
| |
|
CDD -- A curated Entrez database of conserved domain alignments
|
| Identify conserved domain in a protein sequence.
|
| |
|
CHOP -- Parsing proteins into structural domains
|
| Chop proteins into domain-like fragments.
|
| |
|
CluSTr -- The database of SWISS-PROT+TrEMBL protein clusters
|
| Study protein classification based on an automatic classification of SWISS-PROT+TrEMBL proteins into groups of related proteins.
|
| |
|
DAhunter -- Domain Architecture hunter
|
| A retrieval tool for conserved protein domain architecture.
|
| |
|
DIAL -- a web-based server for the automatic identification of structural domains in proteins
|
| Automatically identify protein structural domains given the three-dimensional coordinates of a protein.
|
| |
|
DOMAC -- a hybrid protein domain prediction server
|
| An accurate protein domain prediction server.
|
| |
|
DOUTfinder — identification of distant domain outliers using subsignificant sequence similarity
|
| Detect distantly related protein domains.
|
| |
|
FISH — family identification of sequence homologues using structure anchored hidden Markov models
|
| Identify homologous protein domain sequences.
|
| |
|
FunShift -- a database of function shift analysis on protein subfamilies
|
| Search for information on functional shift within protein family.
|
| |
|
GeneSpeed -- protein domain organization of the transcriptome
|
| Study the PFAM protein domain content of the transcriptome (Unigene Database) for all expressed genes of Homo sapiens, Mus musculus, Drosophila melanogaster, and Caenorhabditis elegans.
|
| |
|
GeneTrees -- a phylogenomics resource for prokaryotes
|
| Search for pre-computed alignments and phylogenetic trees for all protein sequences from 325 fully sequenced and annotated prokaryote genomes.
|
| |
|
Hits -- Access to databases of predicted protein sequences
|
| Search and investigate protein motif sequences.
|
| |
|
InterProScan -- protein domains identifier
|
| Identify protein family (and DNA) domains, patterns, motifs, protein families, and functional sites.
|
| |
|
KOG - Eukaryotic Orthologous Groups of proteins
|
| Search Clusters of Orthologous Groups of protein (COGs) for seven (nearly) complete eukaryotic genomes. |
| |
|
MINER -- software for phylogenetic motif identification
|
| Identify phylogenetic motifs in protein sequences. |
| |
|
MulPSSM -- a database of multiple position-specific scoring matrices of protein domain families
|
| Search for position-specific scoring matrices for a large number of sequence and structural families of protein domains.
|
| |
|
MultiPhyl -- A high-throughput phylogenomics webserver using distributed computing
|
| Upload multiple amino acid or nucleotide alignments and perform the tasks of ML model selection, tree searching, and bootstrapping.
|
| |
|
MyHits -- An Interactive Resource for Analyzing Protein Sequences
|
| An integrated service dedicated to the analysis of protein sequences.
|
| |
|
NEWT -- a new taxonomy portal
|
| Search taxonomy data for the complete set of species represented in SWISS-PROT, as well as those stored at the NCBI.
|
| |
|
PANDIT -- an evolution-centric database of protein and associated nucleotide domains with inferred trees
|
| Search for multiple sequence alignments and phylogenetic trees covering many common protein domains. |
| |
|
PANTHER -- A browsable database of gene products organized by biological function, using curated protein family and subfamily classification
|
| Browse and search proteins based on their biological functions. |
| |
|
PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference
|
| Calculate maximum likelihood phylogenies from DNA and protein sequences.
|
| |
|
PIRSF -- Protein family classification system at the Protein Information Resource
|
| To analyze protein phylogenetic profiles, to reveal functional convergence and divergence, and to identify relationships between homeomorphic families, domains and structural classes.
|
| |
|
POWER -- PhylOgenetic WEb Repeater—an integrated and user-optimized framework for biomolecular phylogenetic analysis
|
| Perform user-friendly pipeline phylogenetic analysis of protein or DNA sequences. |
| |
|
PRECISE -- a Database of Predicted and Consensus Interaction Sites in Enzymes
|
| Search for information on interactions between the amino acid residues of an enzyme and its ligands.
|
| |
|
PRED-GPCR -- GPCR Recognition and Family Classification Server
|
| Recognize and classify G-protein coupled receptors (GPCRs) at the family level. |
| |
|
PRINTS and its automatic supplement, prePRINTS -- A compendium of protein fingerprints
|
| Search for protein fingerprints.
|
| |
|
PRODOC -- a resource for the comparison of tethered protein domain architectures with in-built information on remotely related domain families
|
| Search and compare functional domain assignments for proteins encoded in 192 complete genomes.
|
| |
|
PairsDB -- atlas of protein sequence space
|
| Explore protein sequences and their similarity relationships.
|
| |
|
Pfam -- Protein Families Database
|
| A comprehensive collection of protein domains and families, represented as multiple sequence alignments and as profile hidden Markov models.
|
| |
|
Phylemon -- A suite of web tools for molecular evolution, phylogenetics and phylogenomics
|
| Phylemon is a web server that integrates a selected suite of more than 20 different tools from the most popular stand-alone programs of phylogenetic and evolutionary analysis.
|
| |
|
PhyloDome —- visualization of taxonomic distributions of domains occurring in eukaryote protein sequence sets
|
| Visualize and analyze the phylogenetic distribution of one or more eukaryotic domains.
|
| |
|
Phylocom -- software for the analysis of phylogenetic community structure and trait evolution
|
| Calculates numerous metrics of phylogenetic community structure and trait similarity within communities.
|
| |
|
Phylogeny.fr -- robust phylogenetic analysis for the non-specialist
|
| A web service dedicated to reconstructing and analysing phylogenetic relationships between molecular sequences.
|
| |
|
PipeAlign -- a new toolkit for protein family analysis
|
| Use this protein family analysis tool to search for sequence homologues in protein and 3D structure databases, the definition of the hierarchical relationships within and between subfamilies.
|
| |
|
ProDom and ProDom-CG -- Tools for protein domain analysis and whole genome comparisons
|
| Search and analyze protein domain families.
|
| |
|
ProtoMap -- Automatic classification of protein sequences and hierarchy of protein families
|
| Classify protein sequences.
|
| |
|
ProtoNet -- Hierarchical classification of the protein space
|
| Classify proteins using automatic hierarchical clustering of the SWISS-PROT protein database.
|
| |
|
S4 -- structure-based sequence alignments of SCOP superfamilies
|
| Search for structure-based sequence alignments of domains in SCOP protein superfamilies.
|
| |
|
SCOPEC -- Mapping of catalytic function to domain structure
|
| Search for information on enzyme catalytic domains. |
| |
|
SEARCHPKS -- a program for detection and analysis of polyketide synthase domains
|
| Detect and analyze polyketide synthase (PKS) domains in a polypeptide sequence.
|
| |
|
SH3-Hunter -- discovery of SH3 domain interaction sites in proteins
|
| Identify putative SH3 domain interaction sites on protein sequences.
|
| |
|
SIMAP -- structuring the network of protein similarities
|
| Search for pre-computed similarity matrix of protein sequences.
|
| |
|
SMART 5 -- domains in the context of genomes and networks
|
| To identify and annotate protein domains.
|
| |
|
SUPFAM — A database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families
|
| Analyze and compare homologous protein families in a multiple sequence alignment database of either known or unknown structure.
|
| |
|
SVM-Prot -- web-based support vector machine software for functional classification of a protein from its primary sequence
|
| Classify a protein into functional family from its primary sequence.
|
| |
|
The COG database -- phylogenetic classification of proteins from complete genomes
|
| Search information on phylogenetic classification of proteins encoded in complete genomes of different organisms. |
| |
|
The InterPro Database
|
| Identify protein family, domains, patterns, motifs, protein families, and functional sites.
|
| |
|
The MPI Bioinformatics Toolkit for protein sequence analysis
|
| Conduct protein sequence and structure analysis using a suite of software tools. |
| |
|
The PredictProtein server
|
| Predict protein structure and function based on protein sequence. |
| |
|
The SBASE domain sequence library -- Domain architecture prediction
|
| Search for protein domain sequences, structures, functions,etc.
|
| |
|
The SYSTERS Protein Family Database in 2005
|
| Partition of the whole protein sequence space by a fully automatic procedure.
|
| |
|
The TIGRFAMs database of protein families
|
| Search for curated information of protein families based on Hidden Markov Models.
|
| |
|
TreeDomViewer -- a tool for the visualization of phylogeny and protein domain structure
|
| Visualize structure and phylogeny of protein domains.
|
| |
|
Web-based toolkits for topology prediction of transmembrane helical proteins, fold recognition, structure and binding scoring, folding-kinetics analysis and comparative analysis of domain combinations
|
| Use several Web servers to conduct various protein modeling and protein structure analysis. |
| |
|
iProClass -- An integrated database of protein family, function and structure information
|
| Search for integrated and comprehensive information on family relationships and structural/functional features of proteins.
|
| |