Quantcast
Channel: Post Feed
Viewing all articles
Browse latest Browse all 41826

data from SRA against blast databases method.

$
0
0

Hello.

I want to find a specific gene inside SRA files and I am wondering if i am using a correct way to align those sequences because its the first time that i am "playing" with something like that.

So, here is what i've done till now.

Download .sra files from NCBI and using fastq-dump program , convert .sra files into .fastq using :

fastq --split-spot 

Then those .fastq files convert them into .fasta using :

awk 'NR % 4 == 1 || NR % 4 == 2' myfile.fastq | sed -e 's/@/>/' > myfile.fasta

Finally, convert .fasta into blast readable database using :

makeblastdb -in myfile.fasta -dbtype nucl -out myNewdb

Now having ready the Database i can run a blast using :

blastn -query query.fa -db myNewdb -task blastn -dust no -outfmt 7 -num_alignments 2 -num_descriptions 2 nucl, prot

So, is this approach right/correct? Or i have to try it somehow different ?

Thanks in advance.


Viewing all articles
Browse latest Browse all 41826

Trending Articles