Protein Number of BLASTphits Numberof BLASTphits chosenfor MSA QS Coverage threshold Number ofhits pooled forMSA§ NumberofHMM
hits
Highest E-value (HMM search)
Total Hitsfor
further analysis
CategoryI              
CjPglC 1205 60 95 NA 1212 150 9.2E-35
GsWsaP 1476 80 90 NA 1684 557 3.0E-12
HpWecA 1080 26 80 NA 1498 521 1.1E-12
PaWbpL 1381 38 85 NA 1502 596 1.1E-15
PaWsfP 1411 38 90 NA 1645 502 7.1E-69
SeWbaP 1207 133 90 NA 1211 1152 1.1E-20
SpWchA 1206 28 85 NA 1606 513 4.6E-62
YeWbcO 1355 44 85 NA 1506 583 1.9E-19
EcWecA 976 49 95 118 1495 531 2.5E-13
KpWecA 764 67 90
YeWecA 879 117 90
NgPglB 1204 54 95 56 1211 197 3.3E-34
NmPglB 1212 51 95
NmPglB2 1212 55 95
CategoryII              
NmPglB2 129 32 85 NA 6110 569 0.064
NmPglC 1867 99 95 NA 4586 1411 2.9E-31
NmPglD 1660 71 95 NA 3767 357 1.0E-122
NmPglB 2480 54 85 105 6654 231 2.8E-20
NgPglB 523 64 95
CategoryIII              
NgPglA 2824 77 95 NA 9162 2703 0.079
NmPglH 234 26 70 NA 7788 1005 4.3E-05
NmPglG 2290 52 95 NA 9652 4139 0.1
NmPglE 1253 37 45 NA 7011 367 4.4E-05
CategoryIV              
SeWzx 43 19 70 NA 296 129 0.083
EcWzm 488 96 95 NA 2150 571 5.7E-06
PaWzx 62 25 65 NA 672 360 0.1
BfWzx 295 48 85 NA 1561 604 0.077
NsPglF¶ 52 NA NA NA NA 11 NA
EcWzm 278 46 95 NA 1190 472 0.071
PbaWzm 348 50 88 NA 1823 559 0.005
PbaWzt 49642 82 85 NA 16352 160 6.0E-24
EcWzt_I 30738 62 90 65 16575 125 1.8E-24
EcWzt_II 30625 47 90
CategoryV              
PaPilO¶ 21 NA NA NA NA 3 NA
PaWaaL¶ 55 NA NA NA NA 5 NA
HpWaaL¶ 22 NA NA NA NA 7 NA
PgWaaL¶ 18 NA NA NA NA 2 NA
HpWaaL-G¶ 42 NA NA NA NA 7 NA
AaWaaL 109 26 65# 26 670 145 0.046
NmPglL 63 24 65# 24 477 49 3.9E-38
NmOTase 63 24 65#
NA denotes not applicable §Hits were grouped together when query sequence identity is ≥ 75%
BLAST hits with query coverage ≥ 70% selected directly for final analysis and HMM profiles were not generated
#Only query coverage was taken in these cases
Table 1: Number of hits obtained from BLAST and HMM searches .