blast kese kam krta h?

 **BLAST (Basic Local Alignment Search Tool)** ek powerful bioinformatics tool hai jo sequences ke beech mein similarity dhoondne mein madad karta hai. Chaliye main aapko BLAST ke kaam karne ka tareeka samjhaata hoon:


1. **Query Sequence Aur Target Sequence**:

    - Har BLAST run mein ek **query sequence** hoti hai, jo aapke paas available hai. Yeh woh sequence hai jiske similarity ko hum dhoondna chahte hain.

    - **Target sequence** ya **database** woh set of sequences hota hai jisse hum compare karte hain. Yeh database mein kai tarah ki sequences hote hain, jaise nucleotide sequences (blastn), protein sequences (blastp), ya unka combination (tblastx).


2. **Heuristic Approach**:

    - BLAST ek **heuristic method** hai, matlab yeh dynamic programming algorithm hai jo tezi se kaam karta hai. Lekin iski sensitivity thodi kam hoti hai.

    - Heuristic approach se BLAST database mein similarities dhoondta hai. Isse BLAST tezi se results deta hai.


3. **Algorithm Steps**:

    - BLAST kaam karte waqt kuch steps follow karta hai:

        - **Word Matching**: BLAST query sequence ko database mein matching words (short subsequences) ke sath compare karta hai.

        - **Scoring System**: Har matching word ko score diya jaata hai. Score us word ki similarity ko measure karta hai.

        - **Seed Extension**: High-scoring words ko extend kiya jaata hai, jisse longer alignments milte hain.

        - **Gaps and Extensions**: BLAST gaps aur extensions ko bhi consider karta hai.

        - **Statistical Significance**: BLAST results ko statistical significance ke hisab se rank karta hai.


4. **Different Types of BLAST**:

    - **blastn**: Nucleotide sequence ko nucleotide database ke sath compare karta hai.

    - **blastp**: Protein sequence ko protein database ke sath compare karta hai.

    - **blastx**: Nucleotide sequence ko protein database ke sath compare karta hai (nucleotide ko protein mein convert karke).

    - **tblastn**: Protein sequence ko nucleotide database ke sath compare karta hai (protein ko nucleotide mein convert karke).

    - **tblastx**: Nucleotide sequence ko nucleotide database ke sath protein level par compare karta hai (donon ko protein mein convert karke).


5. **Special Kinds of BLAST**:

    - **Megablast**: blastn ka ek variant hai jisme long sequences ko align kiya ja sakta hai. Yeh greedy algorithm hai aur gaps induce karta hai, lekin similar sequences ko avoid nahi karta.


BLAST se aap functional aur evolutionary relationships dhoond sakte hain aur gene families ke members ko identify kar sakte hain¹². 😊



Q. mujhe matching words wali baat smjh nhi ai? koi example se smjha do?


Bilkul! Main aapko ek simple example ke zariye samjhata hoon.


**Example**:

Sochiye aapke paas do sequences hain:


1. **Query Sequence**: `ATCGGCTA`

2. **Target Sequence**: `ATCGGCTG`


Ab hum dekhte hain ki in dono sequences mein kitni matching words (subsequences) hain:


1. **Matching Words**:

    - Hum sequence ko chhote chhote words mein divide karte hain. Yahan, har 3 nucleotides (ATC, GGC, TGA) ek word hoga.

    - Ab dekhte hain ki kitne matching words hain:

        - `ATC` dono sequences mein same hai.

        - `GGC` bhi same hai.

        - Lekin `TGA` different hai.


2. **Score Calculation**:

    - Har matching word ko score diya jaata hai. Yeh score us word ki similarity ko measure karta hai.

    - Jaise ki `ATC` ka score 3 hoga (kyunki 3 nucleotides same hain).

    - `GGC` ka bhi score 3 hoga.

    - `TGA` ka score 0 hoga (kyunki yeh different hai).


3. **Total Score**:

    - Total score `ATC` + `GGC` = 6 hoga.


Is tarah se BLAST matching words ko compare karta hai aur unka score nikalta hai. Agar aapke query sequence aur target sequence mein zyada matching words honge, to unka similarity score bhi zyada hoga. 😊


Comments

Popular posts from this blog

Blast or Fasta mn farq

Asking AI: Clustal W