|
|
|||||||
| Genome Browser | BLAST | dictyMart | Stock Center | Research Tools | Help | Links | Contact Us | |
dictyBase Help: BLAST Searches
DescriptionBLAST stands for Basic Local Alignment Search Tool and was developed by Altschul et al. (1990). It is a very fast search algorithm that is used to search protein or DNA databases for sequence similarity. A fairly complete on-line guide to BLAST searching can be found at the NCBI BLAST Help Manual. BLAST searches offered by dictyBase allow users to compare any query sequence to D. discoideum sequence data sets. To search any other (non-Dicty) data sets, NCBI BLAST can be used. Using BLASTdictyBase offers these five BLAST programs to accommodate different types of searches:
DatabasesCoding Sequences (CDS)These are the coding sequences of the best quality sequence available for a given gene. In case a gene has a curated gene model, the database contains this sequence. Genes that are not yet curated are represented by the gene prediction of the Sequencing Center. In addition, if a gene that is in GenBank has not been mapped to the genome, the sequence from GenBank is contained in the database. In case a gene has more than one transcript, all transcripts are represented. Genomic SequencesThis database contains the genomic sequences of all genes as described above in the Coding Sequence section. The genomic sequence in general is described as the gene sequence containing all exons and intron plus 1,000 base pairs at each end 5' and 3'. Note that this can mean, as in Dictyostelium gene density is quite high, that the genomic sequence of one gene overlaps with its neighbor, resulting in two partial hits in a blast search. Note also that 1,000 base pairs are only present when available, which might not be the case at the end of a contig or for a non-mapped GenBank record. Protein Sequences:This is the protein translationof the DNA "coding sequences (CDS)".EST SequencesThis database contains EST sequences from the Japanese Sequencing Project as obtained from GenBank, and additional EST sequences contributed by H. Urushihara by direct submission to dictyBase. Full Chomosomes 1,2,3,4,5,6,MThe entries in this database are the full length chromosomes in dictyBase. In addition to chromosomes 1, 2, 3, 4, 5, 6, and M (mitochondrial), this includes 'floating contigs' which are long stretches of DNA that have been sequenced but have not been fit into an assembly yet. These contigs are in two large arbitrary concatemers, 2F and 3 F, from chromosome 2 and 3, respectively. Options
Blast at NCBIIn addition there is a button 'BLAST at NCBI' that links out directly to NCBI BLAST with the protein sequence pasted into the query window.
Accessing the BLAST Search PageBLAST can be accessed by selecting the hypertext link on the menu bar at the top of all dictyBase WWW pages or through a link in the "Associated Sequences" section on each gene page.Sequences for a BLAST search can be submitted by typing or pasting a sequence into the Query Sequence window. When the BLAST page is accessed from the "Associated Sequences" section on a gene page, the sequence of that locus is pasted automatically into the window.
Associated Glossary Terms:
|
![]() | ||||
![]() |
Return to Home | Supported by NIH (NIGMS and NHGRI) | Send a Message to the dictyBase Curators | ![]() |
![]() | ||||