Sequence Clustering
Objective:
Group sequences based on sequence similarities, tabulate the clustering
data and save all sequences of each cluster in a cluster folder
for later study.
Sequence clustering is an important method for studying multiple
sequences. It groups sequences based on their sequence similarities
for the purposes of identifying the members of a gene family, alternatively
spliced variants, fragmented ESTs from a single transcript or multiple
entries of single transcript to avoid redundant work. The majority
of clustering programs use Unix or Linux as a platform and require
a high-processing computing power and professionals to prepare data
sets and generate clustering reports for field-researchers. In the
cases of clustering millions of sequences, there is no other alternative
method. However, when the total sequences to be clustered are fewer
than 50,000, such as the whole human mRNAs or more often one gene
family, those large computer-based programs become unnecessary and
sometimes inconvenient. GeneLooper’s Sequence Clustering function
is designed for effectively clustering small data sets. It is easy
to set up and requires no programming knowledge at all.
Features:
1. Cluster sequences in flexible data sets with adjustable clustering
parameters.
2. Sequences of each cluster can be saved in a common folder for
later use.
3. Detect the ORFs of all sequences.
4. Instant data reporting.
5. A clustering process can be interrupted and resumed.
|
|
Single Sequence Utilities
High-Throughput Utilities
Sequence Formatting
Sequence Collection
Sequence Separation
Sequence Retrieving
Open Reading Frame Detection
Sequence Clustering
Multi-Sequence Similarity Search
Restriction Site Search
Translation and Reverse Complement
Hydrophobic Domain Detection
Batch Oligo Design
Entrez Information Extraction
|
|