top of page

NCBI BLAST Database Searching

You're currently learning a lecture from the course: 

... 

Prerequisite Terminologies

In order to have thorough understanding of the main topic, you should have the basic concept of the following terms:

Primary Biological Sequences.
GenBank.
NCBI.
Python and R.
FASTA format.
[watch our video on FASTA format]
Non-redundant (nr/nt) database.

Duration: 

Transcription

By:

Hamsa Ehsan

Introduction:

In bioinformatics, Basic Local Alignment Search Tool (BLAST) is an algorithm and program for comparing primary biological sequence information, such as the amino-acid sequences of proteins or the nucleotides of DNA and/or RNA sequences. Query is searched against the entire database of GenBank or the entire datasets which have been incorporated in NCBI.


Steps:

  • You can access BLAST whether from web-browser, or through installing the command line BLAST to your computer or through programming languages such as Python and R.

  • Go to the BLAST web-browser.

  • Here you will see four kind of Blast which you can use.     


BLAST Type                                          Query Type                                         Database type

Nucleotide BLAST                             Nucleotide                                        Nucleotide

Protein BLAST                                     Protein                                               Protein

BLASTX                                                Translated Nucleotide                    Protein

TBLASTN                                             Protein                                               Translated Nucleotide


Nucleotide BLAST

  • A page will be opened, where you can customize your search according to your analysis.

  • In the ‘Enter Query Search’ box, you can put accession number or copy the FASTA sequence of your nucleotide query.

  • Beside this box, you can see ‘Query subrange’ box, from where you can subset your query and tell from what nucleotide to what nucleotide you want to search.

  • You can also upload the sequence file from your computer.

  • In ‘Job Title’, you can give a descriptive title for your BLAST search.

  • In ‘Database’, you can select any kind of database you want to search against; For nucleotide sequences, non-redundant (nr/nt) database is a good option.

  • In ‘Organism’ (which is optional), you can select the organism or organisms you want to search against or you can exclude those organisms from your search.

  • All of the next parameters are optional, which won’t matter until you want to include them according to your own analysis.

  • In ‘Program Selection’ section, you can select the option which is better for your own analysis.

  • Click on the ‘BLAST’.

  • The BLAST analysis will be opened, where you can analyze your query with many other aligned nucleotide sequences.

TBLASTN

  • From Blast web-browser, go to ‘tblastn’.

  • Put in the accession number or FASTA sequence in the query box.

  • Select nucleotide non-redundant(nr/nt) database from the ‘Database’.

  • BLAST it.

  • The analysis page is same as any other kind of BLAST (for example Nucleotide Blast).

  • You can analyze your query with aligned mRNA sequences.

  • You can see that ‘Percentage Identity’ is lower as compared to Nucleotide BLAST, it’s because of Codon Degeneracy.

  • You can also visualize your sequences in many kinds of alignments.


Summary:

In this video, we learnt about BLAST and how we can use it to compare different biological sequences to different databases.

File(s) Section

If a particular file is required for this video, and was discussed in the lecture, you can download it by clicking the button below.

Useful Data
bottom of page