Hi,
I have a protein sequence which I first do a blastp against NR database without any limitations. It returns a possible protein family and an organism in the output with a very low E-value (something like 3E-143). This all works fine, but I am told that there's a very high possibility that the sequence comes from a nematode. So I redo blastp now limiting the taxa to nematodes. I get just one decent hit, with the follwing header
ENA|CL652565|CL652565.1 PRI0115a_H01 - PRI0115a.B21 (762) Mixed stage fosmid library of P. pacificus var. California Pristionchus pacificus genomic, genomic survey sequence.
I don't understand this header completely (what does PRI0115a_H01 - PRI0115a.B21 (762) mean?), but from what I can tell it is not showing me the protein family, just the name of the organism. My question is, should the protein family differ from the first case (I tried all reading frames), it shouldn't right? So why is it not shown this time? So can I deduce the organism from my second query and the protein family from the first? I am kinda new to bio and bioinformatics and have been directly put into all this, so don't mind if I have gotten something completely wrong.