Hi, I have a really large BLAST XML file - something like 30gb in size.
I'd like to reduce it so I can run through it quicker with Biopython.
Is there a way to reduce the file by keeping something like the top 25 hits based on bitscore for each query.
Preferably I'd like to do with Python/Biopython.
Thanks