NLProt -- Extracting protein names and sequences from papers

What you can do:
Find protein-names and sequences in natural language-text.
Highlights:
  • It combines dictionary- and rule-based filtering with several support vector machines (SVMs) to tag protein names in PubMed abstracts.
  • Input can be PubMed/MEDLINE identifiers, authors, titles and journals, as well as collections of abstracts, or entire papers.
  • When considering partially tagged names as errors, NLProt still reached a precision of 75% at a recall of 76%.
Keywords:
  • literature mining tools
  • literature retrieval tools
  • knowledge discovery tools
  • text mining tools
  • literature studies tools
Literature & Tutorials:
PubMed Link: NLProt
This record last updated: 04-21-2014
Report a missing or misdirected URL.

The Health Sciences Library System supports the Health Sciences at the University of Pittsburgh.

© 1996 - 2023 Health Sciences Library System, University of Pittsburgh. All rights reserved.