Dear all,
I have blast result in tabulated format. Each of the query hits multiple subject sequences. From all of those hits, I want to assign that subject sequence which covers the query sequence the most. Does any of you have a way to achieve this. I would really appreciate your ideas and resources.
EDIT: I should have been much clear in my question. I have the following file. The best hit is uti_contig0036, but still "uti_contig7141" covers the query more than "uti_contig0036".
contig996
uti_contig0036
80.4
903
110
34
7491
8352
2709010
2709886
1.00E-175
625
contig996
uti_contig0036
85.28
197
20
3
7793
7986
6423228
6423418
3.00E-46
195
contig996
uti_contig0036
76.76
327
46
15
7868
8168
6388503
6388181
2.00E-34
156
contig996
uti_contig0036
88.51
87
9
1
8203
8288
855009
855095
6.00E-19
104
contig996
uti_contig0036
93.44
61
3
1
8228
8288
3695407
3695348
2.00E-14
89.8
contig996
uti_contig0036
93.75
48
3
0
23516
23563
2295958
2295911
2.00E-09
73.1
contig996
uti_contig7141
80.22
819
97
35
7481
8280
2616628
2617400
1.00E-154
555
contig996
uti_contig7141
81.26
619
66
20
7721
8320
468207
467620
1.00E-124
455
contig996
uti_contig7141
...
↧