CleanEST -- the cleansed EST libraries database
What you can do:
A novel database server that classifies GenBank's dbEST (database of expressed gene sequences) libraries and removes contaminants.
Highlights:
- All dbEST libraries were classified according to species and sequencing center.
- Human EST libraries were classified by anatomical and pathological systems according to eVOC ontologies.
- For each dbEST library, two different cleansed sequences were provided: 'pre-cleansed' and 'user-cleansed'.
- To generate pre-cleansed sequences, sequences were cleaned in dbEST by alignment of EST sequences against well-known contamination sources: UniVec, Escherichia coli, mitochondria and chloroplast (for plant).
- To provide user-cleansed sequences, an automatic user-cleansing pipeline was built, in which sequences of a user-selected library are cleansed on-the-fly according to user-selected options.
Keywords:
- expressed gene sequences
- ontology
- EST
- expressed sequence tag
Literature & Tutorials:
PubMed Link: CleanEST: a database of cleansed EST libraries
This record last updated: 01-23-2009