Entrez Information Extraction
Next
Objective: Extract useful information from Entrez pages prepared by NCBI and tabulate extracted data into spreadsheets.
Public databases contain tremendous information about DNA and protein sequences. The Entrez page is one of the most important information sources. A typical Entrez page is shown below. It contains annotated sequence with detailed information about it. In general, an Entrez page is designed for viewing one sequence at a time, however it is sometimes necessary to examine large quantities of sequences such as sequences from a gene family. GeneLooper’s entrez information extraction allows you to extract important information from multiple Entrez pages and put them into spreadsheets. With the tabulated data, you can classify genes into groups based on the items extracted using simple sorting. This function gives you tremendous advantage in studying a large number of genes and closely following the frequently updated databases in NCBI.
A sample of Entrez page from NCBI:
Next
|
|
Single Sequence Utilities
High-Throughput Utilities
Sequence Formatting
Sequence Collection
Sequence Separation
Sequence Retrieving
Open Reading Frame Detection
Sequence Clustering
Multi-Sequence Similarity Search
Restriction Site Search
Translation and Reverse Complement
Hydrophobic Domain Detection
Batch Oligo Design
Entrez Information Extraction
|
|