BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= TBN1_____ (277 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, s... 316 4e-85 UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyt... 310 3e-83 UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B... 300 4e-80 UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepI... 297 2e-79 UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepI... 296 6e-79 UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY... 291 2e-77 UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN 273 4e-72 UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens ... 255 9e-67 UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H... 242 8e-63 UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepI... 242 9e-63 UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE... 238 2e-61 UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold... 236 5e-61 UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR 234 2e-60 UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacter... 233 6e-60 UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus... 233 6e-60 UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidops... 229 1e-58 UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerot... 227 4e-58 UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD... 225 9e-58 UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus... 224 3e-57 UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 223 5e-57 UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepI... 223 6e-57 UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistip... 222 1e-56 UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold... 221 2e-56 UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella... 221 2e-56 UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis... 219 1e-55 UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus... 216 9e-55 UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales ... 214 3e-54 UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=... 214 3e-54 UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus... 212 9e-54 UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 Re... 211 2e-53 UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_... 211 2e-53 UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM... 211 2e-53 UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. B... 211 3e-53 UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ 211 3e-53 UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID... 210 6e-53 UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15... 209 9e-53 UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacter... 208 2e-52 UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriacea... 208 2e-52 UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM ... 208 2e-52 UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=... 208 2e-52 UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromona... 204 2e-51 UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_X... 203 5e-51 UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritiv... 202 9e-51 UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ... 201 2e-50 UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_... 201 2e-50 UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Ta... 201 2e-50 UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 498... 200 4e-50 UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacter... 200 7e-50 UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Asperg... 198 2e-49 UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N... 198 2e-49 UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usi... 196 7e-49 UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q9... 196 8e-49 UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium vi... 193 4e-48 UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacif... 193 4e-48 UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR... 193 5e-48 UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp.... 193 7e-48 UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole geno... 193 8e-48 UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans OR... 192 1e-47 UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichom... 192 1e-47 UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas v... 191 2e-47 UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leish... 191 2e-47 UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatida... 191 3e-47 UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=... 190 4e-47 UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales... 190 4e-47 UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas v... 190 5e-47 UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT... 188 1e-46 UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ 187 4e-46 UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepI... 186 5e-46 UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID... 186 8e-46 UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Per... 185 1e-45 UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila S... 183 4e-45 UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisru... 183 4e-45 UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichom... 183 5e-45 UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinu... 183 6e-45 UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_... 181 3e-44 UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT7... 181 3e-44 UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichom... 179 8e-44 UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo sal... 178 2e-43 UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw10... 176 5e-43 UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepI... 176 6e-43 UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahy... 176 7e-43 UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacteri... 176 9e-43 UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=... 175 2e-42 UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichom... 174 4e-42 UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium lo... 172 1e-41 UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis R... 172 1e-41 UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 ... 170 5e-41 UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilag... 169 7e-41 UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidops... 169 8e-41 UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichom... 169 1e-40 UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 ... 168 1e-40 UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepI... 168 3e-40 UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobas... 167 4e-40 UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmod... 165 2e-39 UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichom... 164 3e-39 UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytoph... 163 5e-39 UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacte... 163 6e-39 UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkins... 160 6e-38 UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellu... 156 5e-37 UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-69... 156 6e-37 UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptos... 156 1e-36 UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Plancto... 155 1e-36 UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechlor... 154 4e-36 UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc ... 152 1e-35 UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium ... 151 3e-35 UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaM... 150 5e-35 UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxopla... 149 8e-35 UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma p... 148 2e-34 UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxopla... 148 2e-34 UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candida... 148 3e-34 UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensi... 143 1e-32 UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing pr... 141 2e-32 UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprino... 141 3e-32 UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoni... 138 2e-31 UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID... 136 7e-31 UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrh... 135 2e-30 UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium... 131 4e-29 UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinu... 126 7e-28 UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichom... 126 8e-28 UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileri... 123 8e-27 UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=... 122 1e-26 UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichom... 118 2e-25 UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredin... 115 1e-24 UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstoni... 115 2e-24 UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichom... 114 5e-24 UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomi... 112 1e-23 UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curviba... 112 1e-23 UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N... 112 2e-23 UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugi... 109 1e-22 UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavoba... 105 2e-21 UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichom... 102 1e-20 UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichom... 100 8e-20 UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytopha... 99 2e-19 UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza s... 96 1e-18 UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opituta... 95 3e-18 UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitino... 94 5e-18 UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spiroso... 93 7e-18 UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytoph... 92 3e-17 UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Sacchar... 91 3e-17 UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=... 87 6e-16 UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichom... 86 1e-15 UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichom... 84 5e-15 UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadoba... 81 6e-14 UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania... 80 9e-14 UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis R... 80 9e-14 UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_8... 79 1e-13 UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxopla... 78 2e-13 UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobac... 75 2e-12 UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bactero... 74 4e-12 UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingo... 74 4e-12 UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 ... 74 5e-12 UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium... 74 5e-12 UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobac... 73 1e-11 UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichom... 71 6e-11 UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingo... 69 1e-10 UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curviba... 68 5e-10 UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticca... 62 2e-08 UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verruco... 61 5e-08 UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobac... 61 7e-08 UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermod... 60 7e-08 UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitid... 56 1e-06 UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidoba... 56 2e-06 UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitid... 54 5e-06 UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algorip... 54 5e-06 UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candida... 49 2e-04 UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=... 48 3e-04 UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_C... 41 0.060 >UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, scaffold_301.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HBQ0_VITVI Length = 332 Score = 316 bits (810), Expect = 4e-85, Method: Composition-based stats. Identities = 146/272 (53%), Positives = 199/272 (73%), Gaps = 3/272 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH C+IA+G L+++A AVK LLP+Y GDL+A+C W D++RH + ++W+ PLH Sbjct: 25 WGKEGHYAVCKIAEGFLSEDALGAVKALLPDYAEGDLAAVCSWADEIRHNFHWRWSGPLH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD+CV GAI N+T QL+ Y S+ RYN+TEAL+F Sbjct: 85 YVDTPDYRCNYEYCRDCHDFRGHKDICVTGAIYNYTKQLTSGYHNSGSEIRYNLTEALMF 144 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGFT D GGN+I +RW+R K+NLHH+WD II +A K YY D+ ++ Sbjct: 145 LSHFIGDVHQPLHVGFTGDEGGNTIIVRWYRRKTNLHHIWDNMIIDSALKTYYNSDLAIM 204 Query: 180 EEDIEGNFTDGIWSDDLASWREC-GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + I+ N T WS D++SW+ C + +C N +A+ESI++ACK+ Y+ G TL DDY Sbjct: 205 IQAIQRNITGD-WSFDISSWKNCASDDTACPNLYASESISLACKFAYRNATPGSTLGDDY 263 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 F SRLPIV KR+AQGGIRLA LN +F + + Sbjct: 264 FLSRLPIVEKRLAQGGIRLAATLNRIFASQPK 295 >UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyta RepID=Q9SXA6_ARATH Length = 305 Score = 310 bits (794), Expect = 3e-83, Method: Composition-based stats. Identities = 201/277 (72%), Positives = 241/277 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AH V+ LLP+YV GDLSALCVWPDQ+RHWYKY+WTS LH Sbjct: 29 WSKEGHILTCRIAQNLLEAGPAHVVENLLPDYVKGDLSALCVWPDQIRHWYKYRWTSHLH 88 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +IDTPD+AC+++Y RDCHDQHG+KDMCV GAIQNFT+QL HY EGTSDRRYNMTEALLFL Sbjct: 89 YIDTPDQACSYEYSRDCHDQHGLKDMCVDGAIQNFTSQLQHYGEGTSDRRYNMTEALLFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 SHFMGDIHQPMHVGFTSD GGN+IDLRW++HKSNLHHVWDREIILTA K+ Y K+++LL+ Sbjct: 149 SHFMGDIHQPMHVGFTSDEGGNTIDLRWYKHKSNLHHVWDREIILTALKENYDKNLDLLQ 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 ED+E N T+G+W DDL+SW EC ++ +C +K+A+ESI +ACKWGYKGV++GETLS++YFN Sbjct: 209 EDLEKNITNGLWHDDLSSWTECNDLIACPHKYASESIKLACKWGYKGVKSGETLSEEYFN 268 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 +RLPIVMKR+ QGG+RLAM+LN VF V AT Sbjct: 269 TRLPIVMKRIVQGGVRLAMILNRVFSDDHAIAGVAAT 305 >UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B9HYZ1_POPTR Length = 297 Score = 300 bits (767), Expect = 4e-80, Method: Composition-based stats. Identities = 145/269 (53%), Positives = 185/269 (68%), Gaps = 5/269 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH TC+IA+G L EA AVK LLPE GDL+ +C WPD++R + Y W+S LH Sbjct: 25 WGKEGHYATCKIAEGYLTAEALAAVKELLPESAEGDLANVCSWPDEIR--FHYHWSSALH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD CV GAI N+T QL Y+ S+ YN+TEAL+F Sbjct: 83 YVDTPDFRCNYEYFRDCHDSSGRKDRCVTGAIYNYTNQLLSLYQNSNSESNYNLTEALMF 142 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGF D GGN+I + W+R KSNLHHVWD II +A K +Y+ D+ + Sbjct: 143 LSHFIGDVHQPLHVGFLGDLGGNTIQVHWYRRKSNLHHVWDNMIIESALKTFYSSDLATM 202 Query: 180 EEDIEGNFTDGIWSDDLASWREC-GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 I+ N T+ WS+ W C N C N +A+ESI++ACK+ YK G TL DDY Sbjct: 203 IRAIQNNITEN-WSNQQPLWEHCAHNHTVCPNPYASESISLACKFAYKNASPGSTLEDDY 261 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 F SRLP+V KR+AQGGIRLA LN +F + Sbjct: 262 FLSRLPVVEKRLAQGGIRLAATLNRIFAS 290 >UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepID=Q9LGA5_ORYSJ Length = 308 Score = 297 bits (761), Expect = 2e-79, Method: Composition-based stats. Identities = 140/276 (50%), Positives = 193/276 (69%), Gaps = 7/276 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+GH++ C+IA+ L+++AA AV+ LLPE G+LS +C W D+VR + Y W+ PLH Sbjct: 34 WGKQGHIIVCKIAEKYLSEKAAAAVEELLPESAGGELSTVCPWADEVR--FHYYWSRPLH 91 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + +TP CNF Y RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL Sbjct: 92 YANTPQ-VCNFKYSRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 +HF+GD+HQP+HVGF D GGN+I + W+R K NLHHVWD II TA KD+Y + ++ + Sbjct: 149 AHFVGDVHQPLHVGFEEDEGGNTIKVHWYRRKENLHHVWDNSIIETAMKDFYNRSLDTMV 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVF-SCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 E ++ N TDG WS+D++ W CGN +C N +A ESI+++C + YK VE TL DDYF Sbjct: 209 EALKMNLTDG-WSEDISHWENCGNKKETCANDYAIESIHLSCNYAYKDVEQDITLGDDYF 267 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 SR PIV KR+AQ GIRLA++LN +FG + + +V+ Sbjct: 268 YSRYPIVEKRLAQAGIRLALILNRIFGEDKPDGNVI 303 >UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepID=Q8LA68_ARATH Length = 296 Score = 296 bits (757), Expect = 6e-79, Method: Composition-based stats. Identities = 130/273 (47%), Positives = 184/273 (67%), Gaps = 4/273 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPL 59 W K+GH C++A+G D+ AVK LLPE G L+ C WPD+++ +++WTS L Sbjct: 21 WGKDGHYTVCKLAEGFFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTL 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALL 118 H+++TP+ CN++Y RDCHD H +D CV GAI N+T QL E + + YN+TEALL Sbjct: 81 HYVNTPEYRCNYEYCRDCHDTHKHRDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALL 140 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FLSH+MGD+HQP+H GF D GGN+I + W+ +KSNLHHVWD II +A + YY + Sbjct: 141 FLSHYMGDVHQPLHTGFLGDLGGNTIIVNWYHNKSNLHHVWDNMIIDSALETYYNSSLPH 200 Query: 179 LEEDIEGNFTDGIWSDDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + ++ +G WS+D+ SW+ C + +C N +A+ESI++ACK+ Y+ G TL D+ Sbjct: 201 MIQALQAKLKNG-WSNDVPSWKSCHFHQKACPNLYASESIDLACKYAYRNATPGTTLGDE 259 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 YF SRLP+V KR+AQGGIRLA LN +F A + Sbjct: 260 YFLSRLPVVEKRLAQGGIRLAATLNRIFSAKPK 292 >UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY2_CUCSA Length = 311 Score = 291 bits (744), Expect = 2e-77, Method: Composition-based stats. Identities = 161/258 (62%), Positives = 199/258 (77%), Gaps = 1/258 (0%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDYE 74 LL EAA AV+ LLPE G+LSA+CVWPDQ+R KY+W SPLH+ +TP +C+F Y+ Sbjct: 50 ELLIPEAAEAVQDLLPESAGGNLSAMCVWPDQIRLQSKYRWASPLHYANTP-DSCSFVYK 108 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 RDCH+ G DMCVAGAI+NFTTQL+ YR D +N+TEALLFLSHF+GDIHQP+HVG Sbjct: 109 RDCHNDAGQPDMCVAGAIRNFTTQLTTYRTQGFDSPHNLTEALLFLSHFVGDIHQPLHVG 168 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 F SDAGGN+I++RWFR KSNLHHVWDR+IIL A DYY KD LL +++ N T GIWS+ Sbjct: 169 FESDAGGNTIEVRWFRRKSNLHHVWDRDIILEALGDYYDKDGGLLLDELNRNLTQGIWSN 228 Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 D++ W C V SCVN++A ES +ACKW Y+GVEAG TLS++Y++SRLPIVM+R+AQGG Sbjct: 229 DVSEWERCSTVNSCVNRWADESTGLACKWAYEGVEAGITLSEEYYDSRLPIVMERLAQGG 288 Query: 255 IRLAMLLNNVFGASQQED 272 +RLAMLLN VF Sbjct: 289 VRLAMLLNRVFAEDATRG 306 >UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN Length = 297 Score = 273 bits (698), Expect = 4e-72, Method: Composition-based stats. Identities = 128/270 (47%), Positives = 168/270 (62%), Gaps = 6/270 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GHV+ C+IAQ L++ AA AVK LLP DLS C W D V H Y W S LH Sbjct: 27 WGDDGHVIVCKIAQARLSEAAAEAVKKLLPISAGNDLSTKCSWADHVHHI--YPWASALH 84 Query: 61 FIDTPDKACNFDYERDCHD-QHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 + +TP+ C++ RDC D + G+K CV AI N+TTQL Y + RYN+T++L F Sbjct: 85 YANTPEALCSYKNSRDCVDYKKGIKGRCVVAAINNYTTQLLEYG-SDTKSRYNLTQSLFF 143 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 SHFMGDIHQP+H GF SD GGN+I +RW++ K NLHH+WD I+LT +Y D++ Sbjct: 144 PSHFMGDIHQPLHCGFLSDNGGNAITVRWYKRKQNLHHIWDSTILLTEVDKFYDSDMDEF 203 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNV-FSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + ++ N T +W+D + W CG+ C +A+ES ACKW YK G L+DDY Sbjct: 204 IDALQQNITK-VWADQVEEWENCGDKDLPCPATYASESTIDACKWAYKDATEGSVLNDDY 262 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 F SRLPIV R+AQ G+RLA +LN VF Sbjct: 263 FLSRLPIVNMRLAQAGVRLAAILNRVFEKK 292 >UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U2Y4_PHYPA Length = 284 Score = 255 bits (652), Expect = 9e-67, Method: Composition-based stats. Identities = 117/272 (43%), Positives = 167/272 (61%), Gaps = 11/272 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +TC IA+ LL + A+ LLP+ NG+L+ LC WPD VR KYKWT LH Sbjct: 23 WGADGHRVTCLIAEPLLYEPTKQAIAALLPKSANGNLADLCTWPDDVRWMDKYKWTRELH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++TP+ C +DY RDCHD G ++C++GAI NFT L + T +R +L Sbjct: 83 WVNTPNHVCKYDYNRDCHDHMGTPNVCISGAINNFTHILWN---HTRNRNMKNGRGILLC 139 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 ++P+H GF SD GGN+I + W+ +S+LHHVWD EI+ A K+ + D ++ Sbjct: 140 C------YEPLHTGFRSDQGGNNISVYWYHRRSDLHHVWDTEIVSKALKENHNSDPEIMA 193 Query: 181 EDIEGNFTDGIWSDDLASWRECGN-VFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I N TD W+ ++ +W C N SC + +ATESIN+ACKW Y G G L D+Y+ Sbjct: 194 DSILNNATDN-WASEVDAWGICHNRKLSCPDTYATESINLACKWAYSGAAPGTALGDEYY 252 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 SRLP V R+AQGG+RLA +LN++F + + Sbjct: 253 TSRLPTVELRLAQGGVRLAAILNSIFDPNAPQ 284 >UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H0E5_PENCW Length = 344 Score = 242 bits (618), Expect = 8e-63, Method: Composition-based stats. Identities = 81/281 (28%), Positives = 127/281 (45%), Gaps = 19/281 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + + L+ + W D+ R KW++PLH Sbjct: 21 WGALGHATVAYVAQHYISSEAASWAQGILNDTSSSYLANVASWADKYRLTDDGKWSAPLH 80 Query: 61 FIDT---PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +ID P K+CN DYERDC D+ C A+ N+T++ R T EAL Sbjct: 81 YIDAMDDPPKSCNVDYERDCGDE-----GCSVSAVANYTSRAGDGRLSTDHT----AEAL 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GDI QP+H + GGN ID+ + + NLH WD + D Sbjct: 132 RFLVHFIGDITQPLH-DENYEVGGNGIDVTFDGYDDNLHSDWDTYMPGKLVGGSSLTDAQ 190 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVEAG--- 231 + + G + + SW E + + ++A+++ C A Sbjct: 191 GWADSLVDEINSGTYKEQAKSWIEGDTISDAVTTATRWASDANAFVCTVVMPDGAAALQT 250 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 L Y+NS + + +VA+GG RLA +N ++ +D Sbjct: 251 GDLYPTYYNSAIGTIEMQVAKGGYRLANWINLIYEQKVAKD 291 >UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepID=B8MCF5_TALSN Length = 363 Score = 242 bits (617), Expect = 9e-63, Method: Composition-based stats. Identities = 84/281 (29%), Positives = 127/281 (45%), Gaps = 20/281 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ L+D A K +L + + L+ + W D R KW++PLH Sbjct: 47 WGTLGHATVAYIAQNYLDDATATWAKGVLGDTSDSYLANIASWADSYRSTSAGKWSAPLH 106 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D+P +CN DYERDC C AI N+T ++ R ++ EAL Sbjct: 107 FIDAEDSPPTSCNVDYERDCGS-----SGCSVSAIANYTQRVGDGRLSKANT----AEAL 157 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAKDYYAKDI 176 FL HF+GD+ QP+H D GGN I + + + S NLH WD I D Sbjct: 158 KFLVHFLGDVTQPLH-DEALDRGGNEITVTFDGYDSDNLHSDWDTYIPQKLVGGSTLSDA 216 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNV---FSCVNKFATESINIACKWGYKGVEAG-- 231 ++ G + A+W + ++ + +A+++ C A Sbjct: 217 QTWANELISQIDSGSYKSVAANWIKGDDISDPITSATTWASDANAFVCSVVMPNGVAALQ 276 Query: 232 -ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L DY+NS +P + ++A+GG RLA LN+++ A + Sbjct: 277 QGDLYPDYYNSVIPTIELQIAKGGYRLANWLNSIYSAHIAK 317 >UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE1_LACBS Length = 317 Score = 238 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 82/305 (26%), Positives = 120/305 (39%), Gaps = 48/305 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSP 58 W +GH+ A L A V+ L + L W D VR Y W ++P Sbjct: 20 WGADGHMAVGYTAMQFLAPNALSFVQNSLGSSYSRSLGPAATWADTVRSQAAYSWCASAP 79 Query: 59 LHFID---TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF+D P +C+ RDC C+ AI N+TT++ + R+ E Sbjct: 80 FHFVDAEDNPPTSCSVSETRDCGS-----GNCILTAIANYTTRVVQTSLSATQRQ----E 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKD 175 AL FL HF+GDI QP+HV GGN I ++ +NLH +WD II K Y Sbjct: 131 ALKFLDHFLGDITQPLHV-EALKVGGNDITVKCNGSSTNLHALWDTGIIEGFLKAQYGNS 189 Query: 176 INLLEEDIEGNFTDGIWSDDLASWRECGNV----------------------------FS 207 + + G ++ ASW C + Sbjct: 190 VTTWANSLATRIKTGNFASSKASWIACSDPSAPLSQKRSIQDDIDEFLAARSTAAITPLK 249 Query: 208 CVNKFATESINIACKWGYKGVEAGETL----SDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 C +A +S C + + G G+ L + Y PI+ +++A+G RLA LN Sbjct: 250 CPLVWAQDSNTFDCSYVF-GFTTGKDLCSGGTSSYAAGAQPIIEEQIAKGAYRLAAWLNV 308 Query: 264 VFGAS 268 +F S Sbjct: 309 LFDGS 313 >UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold_4 n=10 Tax=Sordariomycetes RepID=D1Z5H6_SORMA Length = 336 Score = 236 bits (602), Expect = 5e-61, Method: Composition-based stats. Identities = 76/290 (26%), Positives = 124/290 (42%), Gaps = 26/290 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH+ +A +++ A + LL L+ + W D +R+ +WT PLH Sbjct: 21 WGGFGHITVAYLASNFVSNTTAAYFQTLLRNDTTDYLANVATWADSIRYTKWGRWTGPLH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D+P +C YERDC + CV AIQN+T+++ +R +A Sbjct: 81 YIDAKDSPPHSCGIVYERDCK-----PEGCVVSAIQNYTSRVLDQSLHVVER----AQAA 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT-------AAKD 170 F+ HF+GDIHQP+H + GGN I + + + NLHHVWD I Sbjct: 132 KFVIHFVGDIHQPLHTEDV-EKGGNGISVFFDDKRFNLHHVWDSSIAEKIVTHKKHGVGR 190 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN---KFATESINIACKWGYKG 227 E + +G + + + W + + S ++A E C Sbjct: 191 RPFPAAKKWAEQLAEEIREGQYKANSSEWVKGLELKSASEIALEWAVEGNAHVCTVVLPE 250 Query: 228 VE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 + L YF + P+V ++A+ G RLA L+ V A + +++ Sbjct: 251 GPEAIRDQELGGAYFEAAAPVVELQIAKAGYRLAAWLDLVVTAISKNETI 300 >UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR Length = 287 Score = 234 bits (597), Expect = 2e-60, Method: Composition-based stats. Identities = 73/276 (26%), Positives = 115/276 (41%), Gaps = 20/276 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASSTESFCQNILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D P ++C DY+RDC C AIQN+T L G+ AL Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYTNILLESPNGSEALN-----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GDIHQP+H +AGGN ID+ + +NLHH+WD + AA Y Sbjct: 131 KFVVHIIGDIHQPLH-DENLEAGGNGIDVTYDGETTNLHHIWDTNMPEEAAGGYSLSVAK 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVE---AG 231 + + G +S SW + + S +A ++ C Sbjct: 190 TYADLLTERIKTGTYSSKKDSWTDGIDIKDPVSTSMIWAADANTYVCSTVLDDGLAYINS 249 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 LS +Y++ P+ + +A+ G RLA L+ + Sbjct: 250 TDLSGEYYDKSQPVFEELIAKAGYRLAAWLDLIASQ 285 >UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacteroidetes RepID=A0M3W8_GRAFK Length = 260 Score = 233 bits (593), Expect = 6e-60, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 121/266 (45%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH T IA+ L+++A +A+ LL + L+ + + D ++ +Y+ P H Sbjct: 24 WGKTGHRATAEIAETHLSNKAKNAIDGLLGGHG---LAFVANYADDIKSDPEYREFGPWH 80 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + ++ K + AI+ L +++ L L Sbjct: 81 YVNIDPENKKY------IEEEANKSGDLVQAIKKCVEVLKDQNSSRDEKQ----FYLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP H G D GGN I +RWF SN+H VWD ++I Y +N Sbjct: 131 VHFVGDLHQPFHTGHAEDKGGNDIQVRWFNEGSNIHRVWDSDMINFYQMSYTELALN--T 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +D+ N I L W ES +A Y GV+ GE L Y Sbjct: 189 KDLSKNQIKAIEKGKLLDWVY-------------ESRAMAEDL-YTGVDNGEKLGYSYMY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +P V++++ +GGIRLA +LN+++ Sbjct: 235 KNMPTVLEQLQKGGIRLAKILNDIYS 260 >UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus ATCC 50983 RepID=C5K479_9ALVE Length = 337 Score = 233 bits (593), Expect = 6e-60, Method: Composition-based stats. Identities = 90/291 (30%), Positives = 147/291 (50%), Gaps = 32/291 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH + ++ Q + E A+ ++ + V +S W D+V++ ++KW+S Sbjct: 19 WGHDGHAVVAQLGQERIKKETQEALDAIMGKGVP--MSNYSSWADEVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 SLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAK 174 F+ HF+GD HQP+H+G D GGN I + + +NLH WD ++I Sbjct: 126 KFIVHFVGDAHQPLHIGKPEDLGGNKIAVHLGFGEKPSTNLHSTWDSKLIYELEDQSDPI 185 Query: 175 DINLL----EEDIEGNF-TDGIWSDDLASWRECGNVF---SCVNKFATESINIACKWGYK 226 D E+ + G ++D++ W E + CV+ + +ES AC + Y+ Sbjct: 186 DGEPSWMITEDAVSDELDKGGKYADEIDDWIEDCEKYGLDVCVDSWLSESSKTACDYSYR 245 Query: 227 GVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 V + L DY+N+R+ +V +++A+GG+RL LLN VF A Sbjct: 246 HVNGSLIVDHDFLPMDYYNNRIEVVKEQLAKGGVRLTWLLNTVFAAQDATP 296 >UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidopsis thaliana RepID=O65424_ARATH Length = 362 Score = 229 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 108/258 (41%), Positives = 153/258 (59%), Gaps = 38/258 (14%) Query: 14 QGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDY 73 + ++ AVK LLPE NG+L+A+C WPD+++ +++WTS LHF DTPD CN++Y Sbjct: 138 KSYFEEDTVVAVKKLLPESANGELAAVCSWPDEIKKLPQWRWTSALHFADTPDYKCNYEY 197 Query: 74 ERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV 133 +N+TEAL+FLSH+MGDIHQP+H Sbjct: 198 ------------------------------------SHNLTEALMFLSHYMGDIHQPLHE 221 Query: 134 GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 GF D GGN I + W+ ++NLH VWD II +A + YY + + +++ +G WS Sbjct: 222 GFIGDLGGNKIKVHWYNQETNLHRVWDDMIIESALETYYNSSLPRMIHELQAKLKNG-WS 280 Query: 194 DDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D+ SW C N +C N +A+ESI++ACK+ Y+ AG TL D YF SRLP+V KR+AQ Sbjct: 281 NDVPSWESCQLNQTACPNPYASESIDLACKYAYRNATAGTTLGDYYFVSRLPVVEKRLAQ 340 Query: 253 GGIRLAMLLNNVFGASQQ 270 GGIRLA LN +F A ++ Sbjct: 341 GGIRLAGTLNRIFSAKRK 358 Score = 93.7 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 59/175 (33%), Positives = 77/175 (44%), Gaps = 35/175 (20%) Query: 97 TQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLH 156 +S + YN+TEAL+FLSHF+GDIHQP+HVGF D GGN+I +RW+R K+NLH Sbjct: 2 QLMSASENSDTIVHYNLTEALMFLSHFIGDIHQPLHVGFLGDEGGNTITVRWYRRKTNLH 61 Query: 157 H----------------------------VWDREIILTAAKDYYAKDINLLEEDIEGNFT 188 H VWD II +A K YY K + L+ E ++ N T Sbjct: 62 HVSVCYRMLKEKVIFPDWINYSYDLPMMKVWDNMIIESALKTYYNKSLPLMIEALQANLT 121 Query: 189 DGIWSDDLASWRE-------CGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 I S WR + V K ES N + + L Sbjct: 122 MTISSLGYPLWRRDLRKSYFEEDTVVAVKKLLPESANGELAAVCSWPDEIKKLPQ 176 >UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7ETG5_SCLS1 Length = 283 Score = 227 bits (578), Expect = 4e-58, Method: Composition-based stats. Identities = 76/278 (27%), Positives = 114/278 (41%), Gaps = 23/278 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A + + +MLL L+ + W D R + Sbjct: 21 WGTLGHQTVAYVATNFVAESTRDYFQMLLRNDTGSYLAGVATWADSYRLAALLRLFQR-- 78 Query: 61 FIDTPDKA-CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 F +T A C + RDC ++ CV GAI NFT+QL + RY+ A F Sbjct: 79 FFNTEINAACGVKFARDCGEE-----GCVVGAILNFTSQLLDP----NVSRYHKYIAAKF 129 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 +GDIHQP+H + GGN+I + + ++NLH WD I Y D Sbjct: 130 ----VGDIHQPLHA-ENINIGGNTIKVTFNGKETNLHSFWDTAIPEELVGGYSMADAQEW 184 Query: 180 EEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKG---VEAGET 233 + GI+ SW E G+ + +A +S C V G+ Sbjct: 185 ANVLTTAIKTGIYKSQAKSWLEDMNIGDPLTTALGWAKDSNAFICTTVIPDGAEVLQGKE 244 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 LS +Y+ S +P+V +VA+ G RLA L+ + + E Sbjct: 245 LSGEYYESGIPVVELQVARAGYRLAAWLDMIVRGIKTE 282 >UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD39_ASPTN Length = 300 Score = 225 bits (574), Expect = 9e-58, Method: Composition-based stats. Identities = 72/287 (25%), Positives = 127/287 (44%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ + + LLP N D+S W D+ + +Y T P H Sbjct: 21 WGDVGHRTVAYVAENYLTEDGSKFLDNLLPFSNNFDISDAATWADEQKR--RYPKTKPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D D ++ D C+ A++ T+Q+S Y +N TEA+LFL Sbjct: 79 YVDIKDDP--VHHKCDISSLDCPNGDCIISAMEAMTSQVSEYS-------FNRTEAVLFL 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA--------AKDYY 172 HF GD+H P+HV GGN ID+ + NLH +WD ++ D Sbjct: 130 VHFFGDLHMPLHV-EGLCRGGNEIDVSFNGRNDNLHSIWDTDMPHKINGIKHSLKHNDEK 188 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK---GVE 229 + ++ I+ N + + C ++ATES ++ C +K Sbjct: 189 TASLKWAKDLIQKNLHR---PATVTECNDVTQPQKCFKQWATESNHLNCAVVFKRGLQYL 245 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 + L+ DY+ +P++ +++ + G+RLA +N++ + + VA Sbjct: 246 TTQDLAGDYYEDAVPVIEEQIFKAGVRLATWINSIAEKQHAKAAFVA 292 >UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5K482_9ALVE Length = 328 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 86/283 (30%), Positives = 145/283 (51%), Gaps = 32/283 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH + ++ Q +N E A+ ++ + V + W D V++ ++KW+S Sbjct: 19 WGHDGHAVVAQLGQERINKETQEAIDAIMGKGVP--MYNYSSWADDVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 PLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREII-LTAAKDYYA 173 F+ HF+GD HQP+H G D GGN ID+ +NLH WD ++ + + A Sbjct: 126 KFIVHFVGDAHQPLHAGNPKDRGGNKIDVSLGFARHQHTNLHSTWDSALLYEFQGRGHRA 185 Query: 174 KDINLL---EEDIEGNF-TDGIWSDDLASWRECGNVF---SCVNKFATESINIACKWGYK 226 + E+ I+ G ++ D+ W E + +C+ K+ E+ AC++ YK Sbjct: 186 RGAPYWTVTEDAIDDELDKGGRYAGDVDDWVEDCEKYGYDACIEKWVDETAKAACEYSYK 245 Query: 227 GVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + + +D Y++ R+ + +++A+ GIRL LLNN+ Sbjct: 246 HMNGSRVVDNDYLPMKYYDGRIEVAKEQLAKAGIRLTWLLNNL 288 >UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FP92_PHATR Length = 308 Score = 223 bits (568), Expect = 5e-57, Method: Composition-based stats. Identities = 96/308 (31%), Positives = 148/308 (48%), Gaps = 43/308 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------LSALCVWPDQVRHWYKY 53 W KEGH + +A LL++++ AV+ +L + D L + W D VR ++Y Sbjct: 6 WGKEGHEVVGNLAWKLLSEQSQSAVRNILQDVPIPDNCTACSPLGQVADWADTVRRTHEY 65 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTS---DRR 110 W+ PLH++D C F+YERDC + D+CVAGA+ N+T L +R + Sbjct: 66 FWSGPLHYVDISQDECRFEYERDCAN-----DICVAGAVVNYTRHLQKFRRDETREYGDE 120 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRW---------------------F 149 + ++L+FL+HF+GD+HQP+HV +SD GGNSI + + Sbjct: 121 LLVRDSLMFLTHFVGDLHQPLHVSRSSDRGGNSIHVVYSPGNADTAPKDGRLGYLRAGRH 180 Query: 150 RHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN--VFS 207 H NLH VWD II T K Y + L E+ + + + W C N + Sbjct: 181 HHVDNLHAVWDTGIIETCVKLNYKESRVLWEKVLYERIIQAQGTGEWDVWTSCPNGAQQT 240 Query: 208 CVNKFATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 CV++++ +S+ A W Y+ V+ G LS Y+ +RLP V ++ RLA L Sbjct: 241 CVSEWSEQSLEYALIWAYRNVDGTAIGDGTHLSHAYYETRLPFVEHQLTVAAARLATTLE 300 Query: 263 NVFGASQQ 270 F + Sbjct: 301 ISFTQNVA 308 >UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S8Q5_NEUCR Length = 306 Score = 223 bits (567), Expect = 6e-57, Method: Composition-based stats. Identities = 68/290 (23%), Positives = 115/290 (39%), Gaps = 32/290 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH-WYKYKWTSPL 59 W K GH +AQ L V+ +L + + + W D R+ W++ L Sbjct: 20 WGKLGHATVASVAQQYLTPNTVKQVQTILGDNSTSYMGNIASWADSFRYESAANAWSAGL 79 Query: 60 HFID----TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF++ P ++C+ DC + CV AI N+T ++ + + Sbjct: 80 HFVNGHDGPPPESCHLVLPEDCP-----PEGCVVSAIGNYTERVQMKNITADQK----AQ 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------ 169 AL F+ HF+GDI QP+H + G N+I + + +K+NLH WD I Sbjct: 131 ALKFIVHFLGDIAQPLHTEGFGE-GANNITVTFQGYKTNLHAAWDTSIPNAMLGISPPTS 189 Query: 170 --DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS------CVNKFATESINIAC 221 + + D ++ G + D+ W +V + +A + C Sbjct: 190 AANITSADFLGWANNLAAKINQGQYRKDVRRWLRYHSVATRKASERAAAAWAQDGNEEVC 249 Query: 222 KWGYK---GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 250 HYVMKVPGNQLNGTEIGGDYYKGATEVVERSIIKGGIRLAGWLNLIFDNR 299 >UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYD6_9BACT Length = 257 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 73/266 (27%), Positives = 109/266 (40%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH + IA+ L EAA + +L + W D H +Y +T+ H Sbjct: 21 WGPKGHDVVAYIAECNLTPEAAEKIDKILG---GASMVYWANWLDSASHTPEYAYTATWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + + + D + AI +L + + + L L Sbjct: 78 YANVDEGF-------TYETMTKNPDGDIVEAIDRIVAELKGGQLDPAQEQL----YLKML 126 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH G SD GGNS+ +R+F +SNLH VWD + A K Y + N L Sbjct: 127 VHLVGDLHQPMHTGHLSDRGGNSVPVRFFGRESNLHAVWDSSLPEAAHKWSYTEWQNQL- 185 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I S W E N C+ Y G LS DY Sbjct: 186 DRLTEEEVARIQSGTPLDWFEESNAI--------------CREIYVATPEGSDLSYDYIA 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 P++ +++ +GG RLA LLN ++G Sbjct: 232 KYAPVIERQLLRGGHRLAGLLNEIYG 257 >UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold_39 n=1 Tax=Sordaria macrospora RepID=D1ZIR6_SORMA Length = 309 Score = 221 bits (563), Expect = 2e-56, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 116/294 (39%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH +AQ L V+ +L + + + W D R+ W+S LH Sbjct: 19 WGKLGHATVASVAQQYLTPNTVKQVQAILGDKSTTYMGNIASWADSFRYEEGNAWSSGLH 78 Query: 61 FID----TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 F++ P ++C+ DC + CV AI N+T ++ + R T+A Sbjct: 79 FVNGHDAPPPESCHLILPEDCP-----PEGCVVSAIGNYTERVQNKELAAEQR----TQA 129 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------- 169 L F+ HF+GDI QP+H + G N++ + + +K+NLH WD I T Sbjct: 130 LKFIIHFLGDIAQPLHTEAFGE-GANNVTVFFDGYKTNLHAAWDTSIPNTMLGISPPTSA 188 Query: 170 -DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC-------VNKFATESINIAC 221 + D ++ G + D+ W + + +A + C Sbjct: 189 ANITNADFLGWANNLAAKINQGSYRRDVRRWLRNHRLPANRKGAERAAAAWAQDGNEEVC 248 Query: 222 KWGYK---GVEAGETLSD----DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 249 HYVMKIPGNQLNGTEIGAGAGGDYYKGAAEVVERSIIKGGIRLAGWLNLIFDKR 302 >UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XR21_9FLAO Length = 263 Score = 221 bits (563), Expect = 2e-56, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 109/266 (40%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH T IA L A++ LL + L + + D+++ + +Y+ S H Sbjct: 28 WGSKGHRATAAIAVKYLKPRTKKAIEKLLG---DETLVTVSTYGDEIKSYEEYRKYSSWH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I ++ ++R L L Sbjct: 85 YVNIAPGLS-------YAEADKNEYGDLVQGINTCKEVITSEDATIEEKR----FYLKML 133 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H+G D GGN +RWF + +NLH +WD ++I + Y N Sbjct: 134 VHFIGDLHQPLHLGHAEDKGGNDFQVRWFNNGTNLHSLWDSKLIESYGMSYSELATN--F 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + I DL W G + + + Y E GE LS Y Sbjct: 192 GQVSKKQFKEISKGDLMDWVSEGQILA--------------EKVYDSAEIGEKLSYRYQA 237 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V +++ +GG+RLA LLN +F Sbjct: 238 DYNQMVQEQLQKGGVRLAALLNELFD 263 >UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFD4_HAHCH Length = 304 Score = 219 bits (557), Expect = 1e-55, Method: Composition-based stats. Identities = 67/271 (24%), Positives = 111/271 (40%), Gaps = 20/271 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + C +A L+ A V+ LL + + C+WPDQVR ++K T H Sbjct: 50 WGELGHRVVCDVAWKELSPVARDQVQKLLQQAGKRTFAEACLWPDQVRSEKEFKHTGSYH 109 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ A +C + CV A+ + L + +AL+F+ Sbjct: 110 YVNVERAAKRVSTAENCESK-----GCVLTALNAYAEALKGE--PRQGYQATPAQALMFI 162 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GDIHQP+HV + D GGN + + ++NLH +WD I + + K + Sbjct: 163 GHFIGDIHQPLHVSYADDRGGNKVVYKVAGEETNLHRLWDVNIPESGLPRDWRKAGKKVR 222 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 G + + +A ES+ I K G S Sbjct: 223 GKHRGETVTALSLQE-------------AEAWANESLAITRKVYESLPPQGSEWSKKDLA 269 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 P+ R+ Q G+RL +LN + ++Q + Sbjct: 270 REYPVAEMRLYQAGVRLGAVLNQLLASNQDQ 300 >UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NJ54_ASPFN Length = 320 Score = 216 bits (549), Expect = 9e-55, Method: Composition-based stats. Identities = 71/305 (23%), Positives = 112/305 (36%), Gaps = 43/305 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASPTESFCQDILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FID---TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLS----------------- 100 FID P ++C DY+RDC C AIQN+ + Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYVSYFRVYNNIGCSSYLDQYSPG 135 Query: 101 -----------HYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF 149 R S R +S +GD HQP+H +AGGN ID+ + Sbjct: 136 ISQWLGGVECPEIRGSCSSRPLTGLIRFPNMSQIIGDTHQPLH-DENLEAGGNGIDVTYD 194 Query: 150 RHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC---GNVF 206 +NLHH+WD + AA Y + + G +S SW E + Sbjct: 195 GETTNLHHIWDTNMPEEAAGGYSLSVAKTYADLLTERIKTGTYSSKKDSWTEGIDIKDPV 254 Query: 207 SCVNKFATESINIACKWGYKGVE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 S +A ++ C LS +Y++ P+ + +A+ G RLA L+ Sbjct: 255 STSMIWAADANTYVCSTVLDDGLAYINSTDLSGEYYDKSQPVFEELIAKAGYRLAAWLDL 314 Query: 264 VFGAS 268 + S Sbjct: 315 IASQS 319 >UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales RepID=Q3IBZ8_PSEHT Length = 288 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 74/271 (27%), Positives = 115/271 (42%), Gaps = 30/271 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH + +IA+ L++ LLP N L+ + WPD++R W +S Sbjct: 27 WGQNGHRIIAKIAESHLSETTKT---KLLPLLNNESLAQVSTWPDEMRSAPGEFWQRKSS 83 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+T ++ V + I L + ++ +L Sbjct: 84 RWHYINTSANKPISLNHSHTKNKESVT--NILEGIHYSIKVLQDEQSSLDAKQ----FSL 137 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL H +GD HQP H G D GGN+I ++ F ++NLH +WD ++I Y Sbjct: 138 RFLVHLVGDSHQPFHAGRADDRGGNNIKVKHFGQETNLHSLWDSKLIEGENLSY------ 191 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 F D I +++ E + S + ES N+A K +S Sbjct: 192 -------TEFADFINTNNQTLISE--YLTSTPTSWLVESNNLAESIYNKNETN---ISYS 239 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 Y +PI+ R+ QGGIRLA LLN++F S Sbjct: 240 YIFDHMPIIKTRLQQGGIRLAGLLNSLFDES 270 >UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=Trypanosoma cruzi RepID=Q4DEV4_TRYCR Length = 333 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 63/284 (22%), Positives = 103/284 (36%), Gaps = 28/284 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDE-------AAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ E AA + P D W D ++ Sbjct: 28 WWCNGHMLVNEIARRRLHPEVALIVEEAAVNLSASGPFPHTTDFVESGCWADDIKKL-GL 86 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 H+IDTP N + +++ + +K L Y M Sbjct: 87 FVMEDWHYIDTPYNPQNINIKKNPVNTENLKT---------VIESLKRTLMKQDLVPYIM 137 Query: 114 TEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + A++ ++HF+GDIHQP+H D GGN+ + LH +WD Sbjct: 138 SFAIVNIAHFLGDIHQPLHAVELFSPEYPHGDRGGNAETVIVHGKMMALHSLWDSIC--Q 195 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + ++ F D + D + + + A ES +IA + Y Sbjct: 196 GDVKNPRRPLDRWHYAKLREFADRLE--DTYKFPAEVKNETNTTQMAMESYDIAVQVAYP 253 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G G ++D+Y RV G RLA +LN + +Q+ Sbjct: 254 GFVDGAKITDEYLEKCRAAAESRVVLAGYRLANVLNQLLDKTQK 297 >UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KMC3_9ALVE Length = 367 Score = 212 bits (540), Expect = 9e-54, Method: Composition-based stats. Identities = 81/291 (27%), Positives = 137/291 (47%), Gaps = 43/291 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W +GH + +A ++ +A V ++ E L+ W D + + ++ W+ Sbjct: 19 WGPDGHAVVAELADTRMSSKARKWVYDIMGEGYR--LATSASWADSILYGNNSGEWSWSK 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ + C F Y RDC + ++CVAGAI+N+T QL++ R+ +A+ Sbjct: 77 PLHYANV--DDCEFVYARDCPN-----NVCVAGAIKNYTAQLTNTSLTKEQRQ----DAV 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYY-- 172 FL HFMGD+H+P++ G +D GGN+I + K+NLH VW ++I + Y Sbjct: 126 KFLVHFMGDVHEPLNAGRYTDLGGNTISVAINFADYEKTNLHKVWGEKLIDEYEGELYPG 185 Query: 173 --------------AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKFATE 215 +E G + G ++ + SW+ CVN+ E Sbjct: 186 PYIQQDADYNKDRTQYWSVSADEIGRGLASGGKYAGKVPSWKSKCESLGIDVCVNEMVQE 245 Query: 216 SINIACKWGYKGVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLL 261 S +AC Y V+ + +DD Y+ SR+ V +++A+G +RLA +L Sbjct: 246 SATLACNQAYVNVDGSQIGNDDGLLMGYYTSRIETVKEQLAKGAVRLAWVL 296 >UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMT2_MARMM Length = 299 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 87/283 (30%), Positives = 132/283 (46%), Gaps = 23/283 (8%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-LSALCVWPDQVRHWYKYKWTSPLH 60 +GH + C +A L+DE + L+ + D +C W D VR ++ T+P H Sbjct: 27 GPDGHRIVCDLAWRYLSDETRTEIDRLVAQDPEFDHFRDVCSWADDVRGS-THRHTAPWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+ + D E DC + D C+ AI DR EAL FL Sbjct: 86 YINQTRDDPHVDAE-DCAE-----DGCITSAIDLHAGIFVDRSRSDEDRL----EALKFL 135 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH-KSNLHHVWDREIILTAAKDYYAKDINLL 179 +H+MGDIHQP+HV D GGN I++ W ++NLH VWD EI+L DY A+ + Sbjct: 136 AHWMGDIHQPLHVSIEGDRGGNDINVLWRGERRTNLHRVWDSEILL----DYMAETWPYI 191 Query: 180 EE-DIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK--WGYKGVEAGETL-- 234 ++ D D + +D + + +A ES +I + Y A E + Sbjct: 192 DDGDRWAQLADQLAADIPLNGISVYTPLA-PVDWAQESHDIVRSRGFAYYWARAEEMIEP 250 Query: 235 SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 D Y++ LP+ ++R+ QGG+RLA LLN + Q + T Sbjct: 251 GDAYYDRNLPVSLQRLKQGGVRLAGLLNQLVEERQLSGTGAVT 293 >UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_PYRTR Length = 312 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 71/284 (25%), Positives = 115/284 (40%), Gaps = 22/284 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H +A+ + + +L NG + W D H + ++ H Sbjct: 19 WNTDVHNQIGFMAETFFTPQTTLILAKILEPKYNGSVGRAAAWADGYAHTSEGHFSYQWH 78 Query: 61 FIDTPDK---ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD------RRY 111 +IDT D +C+ DY RDC K CV AI N T L D Sbjct: 79 WIDTHDNQPESCHLDYVRDCA-----KGGCVVSAIANQTGILRECITQVQDGKLAGGTNL 133 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT---AA 168 + AL +++HF+GDIHQP+H + GGN+ + + H + LH VWD I A+ Sbjct: 134 TCSYALKWVAHFLGDIHQPLHASGRA-VGGNTYKVVFGNHSTQLHAVWDGFIPYYAAEAS 192 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGY 225 + + ++ D+ + W C C +A ES C + Y Sbjct: 193 HPFSNQSLDPFFADLVTRIRKDQFYSAPYMWLSCTNPSTPIDCATAWARESNKWDCDYVY 252 Query: 226 KGVEAGETL-SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 V+ L ++ Y +PIV ++++ +RL LN + S Sbjct: 253 SRVQNDTDLGTNGYAAGAVPIVELQISKAALRLGTWLNKLVEGS 296 >UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PH62_CHIPD Length = 266 Score = 211 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 71/267 (26%), Positives = 109/267 (40%), Gaps = 27/267 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH--WYKYKWTSP 58 W GH + IA L +A A+ LL ++ + WPD ++ +KY TSP Sbjct: 24 WGVTGHRVVAEIASRHLTPQARKAIIALLGP---QSMAMVANWPDFIKSDTTHKYDHTSP 80 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H++D P + D + + + +L +D+ AL Sbjct: 81 WHYLDFPANVDRVHF--DEVLKEHTTGENLYAQTEALIKKLKDPATSKADK----VFALT 134 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FL H +GD+HQP+H+G D GGN I + WF +SNLH VWD ++I Y L Sbjct: 135 FLIHMIGDMHQPLHIGRDEDQGGNKIPVMWFDKQSNLHRVWDEQLIEFQQLSYTEYTQAL 194 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + + S +A W N S Y A + LS Y Sbjct: 195 --DTASAAEVRKLQSGSIADWMYDSNQLS--------------NKVYALTHANDKLSYRY 238 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + + ++ +GG+RLA LLN ++ Sbjct: 239 NYWFIADLNGQLLKGGLRLAALLNQIY 265 >UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. BAL39 RepID=A6EB04_9SPHI Length = 250 Score = 211 bits (536), Expect = 3e-53, Method: Composition-based stats. Identities = 67/265 (25%), Positives = 106/265 (40%), Gaps = 26/265 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+ L+ +A VK +L N L+ W D ++ Y + H Sbjct: 11 WGMLGHRIVGQIAEAHLSKKALKGVKGVLG---NETLAMASNWGDFIKSDTSYNYLYNWH 67 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + + V++ V + L + A+ L Sbjct: 68 FVNLP---AGLDKQGVFNVLDKVQEPNVYNKVPEMVAILKDNNSSAEQK----VFAMRML 120 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 121 VHLIGDLNQPMHTARKDDLGGNKVAVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 173 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 D + LASW + S AC Y + + LS Y Sbjct: 174 ---YAKAIDYPSTAQLASWNGLSL-----RDYVYGSYE-ACNQIYAKTKGDDKLSYQYNF 224 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVF 265 + L ++ +++ +GGI LA +LN ++ Sbjct: 225 NFLKLLNEQLLKGGICLANVLNEIY 249 >UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ Length = 270 Score = 211 bits (536), Expect = 3e-53, Method: Composition-based stats. Identities = 76/276 (27%), Positives = 124/276 (44%), Gaps = 19/276 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + L+++ W D+ R KW++ LH Sbjct: 1 WGALGHATVAYVAQHYVSPEAASWAQGILGSSSSSYLASIASWADEYRLTSAGKWSASLH 60 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D P CN DYERDC C AI N+T ++S + N EAL Sbjct: 61 FIDAEDNPPTNCNVDYERDCGS-----SGCSISAIANYTQRVSDSSLSSE----NHAEAL 111 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GD+ QP+H GGN I++ + + NLH WD + + D Sbjct: 112 RFLVHFIGDMTQPLH-DEAYAVGGNKINVTFDGYHDNLHSDWDTYMPQKLIGGHALSDAE 170 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNV---FSCVNKFATESINIACKWGYKGVEAG--- 231 + + N G ++ W + N+ + ++A+++ + C A Sbjct: 171 SWAKTLVQNIESGNYTAQATGWIKGDNISEPITTATRWASDANALVCTVVMPHGAAALQT 230 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 L Y++S + + ++A+GG RLA +N + G+ Sbjct: 231 GDLYPTYYDSVIDTIELQIAKGGYRLANWINEIHGS 266 >UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID=C8WD33_ZYMMN Length = 319 Score = 210 bits (533), Expect = 6e-53, Method: Composition-based stats. Identities = 67/288 (23%), Positives = 108/288 (37%), Gaps = 38/288 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W EGH +A + V +L + D + W D+ R T Sbjct: 33 WGMEGHEAIAALAWKYMTPTTRKKVNAILAMDHDRLTEPDFMSRATWADKWRSAGHG-ET 91 Query: 57 SPLHFIDTPDK------ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 P HF+D AC R ++G CV + F +LS + DR Sbjct: 92 EPWHFVDIEIDNPNLVTACAAASNRSNPMKNGGAQPCVVSQLDRFERELSSKQTSDQDRV 151 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 AL ++ HF+GD+HQP+H D GGN + + +S NLH WD Sbjct: 152 L----ALKYVLHFVGDLHQPLHAADHDDRGGNCVKVSINNARSLNLHSYWDT-------- 199 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y K+I+ + + + I +D SW V ++A ES + ++ Y Sbjct: 200 -YVVKEIDPDPQHLADSLKKEISPEDKKSW-----VLGDSKQWAMESFQLGKRYAYSFNP 253 Query: 230 --------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 L Y ++ + ++ + G+RLA +LN+ + Sbjct: 254 PAGCDATRPPIPLPAGYDSAARKVAASQLKKAGVRLAYILNHRLRSIP 301 >UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15ZB2_PSEA6 Length = 256 Score = 209 bits (532), Expect = 9e-53, Method: Composition-based stats. Identities = 73/268 (27%), Positives = 112/268 (41%), Gaps = 35/268 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH +T IAQ L +A A+ LLP DL+ +PD++R W Sbjct: 20 WGQIGHRVTGAIAQQHLTPQAQAAISALLP---TEDLAEASTYPDEMRSSPDDFWQKKAG 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P D + A++ FT L+ + ++++ AL Sbjct: 77 PFHYVTIPKGQ-------TYADVGAPEQGDGVSALKMFTANLTSSQTSKAEKQL----AL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GD+HQP+H G +D GGN + +F SNLH VWD E++ Y Sbjct: 126 RFIVHIIGDLHQPLHAGNGTDRGGNDFKVNFFWQDSNLHRVWDSELLDQRQLSYTEWT-- 183 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 I + D+ W + + ES+ I Y E T+S D Sbjct: 184 -------AILNRKISAQDINDW-----NTTDPKVWIAESVKI-RDEIYPSQE---TISWD 227 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y LP +R+ GIR+A LN ++ Sbjct: 228 YLYHHLPQAKQRLKMAGIRIAAYLNEIY 255 >UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacteroidetes RepID=C6X5W4_FLAB3 Length = 263 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 63/266 (23%), Positives = 109/266 (40%), Gaps = 28/266 (10%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSPL 59 GH + IA+ L+++A +K ++ N L+ WPD ++ W T Sbjct: 24 GVTGHRVVAEIAENHLSNKARKNLKKIIG---NQKLAYWANWPDAIKSDTTGVWKQTDTW 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 H+++ + D + + I+ + Q+ + DR AL F Sbjct: 81 HYVNI---SPQADLKSFSDSLQAQTGPNLYTQIKTLSAQIKDKKTSAKDRE----IALRF 133 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 L H +GD QPMHVG D GGN+I L++F +NLH +WD +++ Y + Sbjct: 134 LIHLVGDSSQPMHVGRAGDLGGNTIKLKFFGENTNLHSLWDSKLVDFQKYSYEE--FAKV 191 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I S L W ++ + Y A ++ S DY Sbjct: 192 LDVKSKEEVRAIQSGTLEEWFYDSHLKA--------------NNIYANTVADKSYSYDYN 237 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVF 265 P++ +++ GG+RLA +LN++ Sbjct: 238 YKYAPLLERQLLYGGLRLAKILNDIL 263 >UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriaceae RepID=A4BZ60_9FLAO Length = 260 Score = 208 bits (528), Expect = 2e-52, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 105/266 (39%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH T IA+ LN A + LL L+ + + D+++ Y + H Sbjct: 25 WGQNGHRATGEIAESHLNKRAKRKIDKLL---NGQSLAFVSTYADEIKSDKAYSEYASWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I L + + + L L Sbjct: 82 YVNM-------NLDETYATAAKNTKGDLITGINTCIAVLKDKSSSSE----DKSFHLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH+G D GGNS+ + WF +SNLH VWD ++I Y + Sbjct: 131 IHLVGDLHQPMHIGRKEDKGGNSVKVEWFGKRSNLHAVWDTKMIEGWNMSYLE--LAESA 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I + L W I+ K Y V+A + +S Y Sbjct: 189 KKVSKEQIAAIEAGTLLDWVAE--------------IHEVTKKVYNSVDANKGISYRYSY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 IV ++ GGIRLA +LN++F Sbjct: 235 DHFDIVRDQLQIGGIRLAKILNDIFS 260 >UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYC1_PEDHD Length = 268 Score = 208 bits (528), Expect = 2e-52, Method: Composition-based stats. Identities = 70/265 (26%), Positives = 111/265 (41%), Gaps = 26/265 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+G L+++A +K +L N L+ W D ++ Y + H Sbjct: 29 WGMLGHRIVGQIAEGYLSNKAKKGIKDVLG---NESLAMASNWGDFIKSDPAYDYLYNWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + V I L + + ++R A+ L Sbjct: 86 FVNLP---AGLDKQGVFDQLDKETSPNVYNKIPEMAAVLKNRQSTAEEKRL----AMRLL 138 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 139 IHLVGDLNQPMHTARKEDLGGNKVFVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 N + +D L SWR F S AC Y ++ E LS Y Sbjct: 192 ---YANAINYPSNDQLNSWRNNSLK-----DFVYGSYQ-ACNRIYADIKPEERLSYKYNF 242 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVF 265 + ++ +++ +GGI LA +LN+++ Sbjct: 243 EFVGLLNEQLLKGGICLANMLNDIY 267 >UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=Q5FP59_GLUOX Length = 300 Score = 208 bits (528), Expect = 2e-52, Method: Composition-based stats. Identities = 77/282 (27%), Positives = 113/282 (40%), Gaps = 26/282 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W GH + IAQ L +A A LL + L + WPD + H K K +P Sbjct: 25 WGPYGHAIVADIAQERLTPQAQKAATALLALENHQTLDQVASWPDTIGHVPKKKGGAPET 84 Query: 59 --LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 H++D +D RDC D +CV + L+ DR A Sbjct: 85 LKWHYVDIDVSHPAYDQARDCPDH-----VCVVEKLPEEIKILADTHASAQDRL----TA 135 Query: 117 LLFLSHFMGDIHQPMHVG-FTSDAGGNSIDLRWFRHK----SNLHHVWDREIIL---TAA 168 L ++ H +GDIHQP+H D GGN+I L +F NLH +WD +I Sbjct: 136 LKWVVHLVGDIHQPLHAAERNKDMGGNAIRLTYFGDNANGHMNLHSLWDEGVIDHEADLH 195 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASW---RECGNVFSCVNKFATESINIACKWGY 225 + + I D+ W + +V++ +A ES ++A Y Sbjct: 196 VGPFYSIDASRAKKEADRLGALITPDETKYWVQDLDGDDVYNATVDWADESHSLARSVAY 255 Query: 226 KGVEA--GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + A G + DY PI+ R+ Q G+RLA +LN Sbjct: 256 GALPANKGADIGKDYTALTWPIMELRLEQAGVRLAAVLNTAL 297 >UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C4V1_9GAMM Length = 290 Score = 204 bits (519), Expect = 2e-51, Method: Composition-based stats. Identities = 67/271 (24%), Positives = 107/271 (39%), Gaps = 29/271 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W++ GH + +IA+ L D+ A+ LL L + W D++R W Sbjct: 28 WAQNGHRVVGQIAENHLTDKTKMAIAHLLEGDK---LPEVTTWADEMRSDPSKFWKKESV 84 Query: 59 -LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ ++A +F R + AI L + +R Sbjct: 85 IWHYINI-NEAEDFKPNRYRITATKGEVTDAYSAILKSIAVLQSEQTSLDKKR----FYF 139 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL+H +GDIHQPMHVG D GGN + +++F +NLH +WD++++ + Sbjct: 140 RFLTHVVGDIHQPMHVGRKDDRGGNDVKVKYFNKDTNLHSLWDKDLLEGENLSFSEYAY- 198 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + + + ES +IA K S Sbjct: 199 -FIDTTNKELISQYLASE-------------PKDWVLESFHIAKKLYE---VDDGNFSYS 241 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 Y + + R+ QGGIRLA LLN +F S Sbjct: 242 YVYEQKNTMNTRLLQGGIRLAGLLNAIFDPS 272 >UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_XANC5 Length = 318 Score = 203 bits (516), Expect = 5e-51, Method: Composition-based stats. Identities = 64/258 (24%), Positives = 99/258 (38%), Gaps = 27/258 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK--YKWTSP 58 W +GH + RIA+ L+ +A V LL + L + W D++R K + P Sbjct: 74 WGPQGHRLVARIAETELSPQARTQVAQLLAGEPDPTLHGVATWADELREHDPDLGKRSGP 133 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+++ + C + RDC D CV A+ L+ + RR +AL Sbjct: 134 WHYVNLGEHDCTYSPPRDCPD-----GNCVIAALDQQAALLADRTQPLDVRR----QALK 184 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 F+ HF+GDIHQPMH G+ D GGN L+ SNLH +WD ++ A L Sbjct: 185 FVVHFVGDIHQPMHAGYAHDKGGNDFQLQIDGKGSNLHALWDSGMLNDRHLSDDAYLQRL 244 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 L + + A + + + + L Y Sbjct: 245 LALPAATAGSAALPPPAAAWAQASCKIAITPGVY----------------PSAHVLPATY 288 Query: 239 FNSRLPIVMKRVAQGGIR 256 + PI ++ G R Sbjct: 289 IATYRPIAETQLRIAGDR 306 >UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PWU6_9SPHI Length = 262 Score = 202 bits (514), Expect = 9e-51, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 111/266 (41%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+T N E+ D + + + L +G ++ M + L FL Sbjct: 80 YINTE---GNLTKEQFATALQQSPDNNIYKQLIRLSADLKAKDKGLTE----MQQNLYFL 132 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H MGD HQPMHVG +D GGN I++ WF N+H VWD ++ Y + Sbjct: 133 IHLMGDAHQPMHVGRPADLGGNKIEVMWFGKPDNIHRVWDSNLVDYEKYSYTE--YANVL 190 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + D ASW +I YK VE LS Y Sbjct: 191 DIHTRQENQRLTDGDFASWLYDT--------------HIVANKIYKDVEQNSNLSYRYIY 236 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V + +GG+RLA +LN +FG Sbjct: 237 DNKYVVEDALLKGGLRLAKVLNEIFG 262 >UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5K8A7_9ALVE Length = 366 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 97/298 (32%), Positives = 139/298 (46%), Gaps = 46/298 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH L ND A AV +L E V ++ WPD V H +++W+S Sbjct: 18 WGPDGHATVADAGNKLFNDNANEAVAEILGEGVR--MADYASWPDSVLHGPDSSEWEWSS 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LHF D C+F Y RDC D D CV G I+N+T Q++ R+ AL Sbjct: 76 GLHFADVE--QCHFIYSRDCKD-----DYCVVGGIKNYTRQVADTSLPIEQRQV----AL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRW---FRHKSNLHHVWDREIILTAA-----K 169 FL HFMGDIHQP+HVG SD GGN+I + LHH WD ++I + Sbjct: 125 KFLMHFMGDIHQPLHVGRHSDYGGNTIKVDMKFANYEYGALHHAWDEKMIDQSQASQYDG 184 Query: 170 DYYAKDIN--------------LLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKF 212 +Y +D N + + + G + D + W CVN Sbjct: 185 EYIQQDANYSTPLAERETFWGITVSDIMTELAEGGAFHDRVPMWLADCETNGLDECVNTM 244 Query: 213 ATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 A ES IAC Y+ ++ G+ LS DY++ R+ IV +++A+G +R A ++N+ F Sbjct: 245 AEESAIIACADAYRHLDGDEIEYGDVLSMDYYDDRIKIVKEQLAKGAVRFAWIMNHAF 302 >UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_LEIBR Length = 328 Score = 201 bits (511), Expect = 2e-50, Method: Composition-based stats. Identities = 60/292 (20%), Positives = 105/292 (35%), Gaps = 39/292 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ ++ + P ++ D+ WPD V+ W + Sbjct: 31 WGCTGHMVLAEIARRQLDPSNEKKIQAMAMKFKESGPFLLSPDMIQAACWPDDVKRWGQ- 89 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR--Y 111 S H+ + A+ + L ++ R Y Sbjct: 90 DAMSTWHYYAMQYNPDGINIT------------DSVEAVNAVSVSLDMITSLSNVRSPLY 137 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREI- 163 + A ++L H +GD+HQP+H D GGN + +R LH WD Sbjct: 138 MLNFAWVYLVHLIGDLHQPLHAVSRYSEKYPHGDRGGNLVWVRVQTKMLRLHAFWDNICT 197 Query: 164 --ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 + + + D+ + E + +S DL + V + A ES A Sbjct: 198 ATPVLYRRPLSSTDLLAISETADRLLKTYSFSSDLKT-------MQDVQRMANESYAFAV 250 Query: 222 KWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 Y + G TLS Y + + + R+ GG RL +LN + +++ Sbjct: 251 NSSYADMIPGTTLSAAYISRCVEVAESRLTLGGYRLGYILNKLLSDIDVDEN 302 >UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Tax=Trypanosoma brucei RepID=C9ZQW0_TRYBG Length = 326 Score = 201 bits (511), Expect = 2e-50, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 102/282 (36%), Gaps = 29/282 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W+ GH++ IA+ L+ + VK P D WPD ++ Y Sbjct: 27 WAAFGHMVVAEIAKRNLDADVLEKVKQYTQHLSESGPFPKIPDFVQSACWPDDLKS-YDL 85 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + H+ F+ + + + I + + LS++ Y Sbjct: 86 GVMNGWHYTANVYSRDGFELKE-----PLQQKSNIVSVIDSLSATLSYHETPL----YVR 136 Query: 114 TEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + AL L H GDIHQP+H T D GGN + +R + LH WD + Sbjct: 137 SFALAHLIHHYGDIHQPLHTTSQVSSEYKTGDLGGNLVHVRVRNTTTKLHSFWDDICRPS 196 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +F D + SW + + E +A + Y Sbjct: 197 ISMK---RPLEEKHYAKVRSFADRLVETYDVSW--EHRRQTNATIMSMEGFELAKEIAYA 251 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 GV G LS Y + + +R+ G RLA LNN+ G+ Sbjct: 252 GVVNGSQLSSQYVDRCVETAEQRMTLAGYRLATHLNNILGSK 293 >UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIU0_HIRBI Length = 264 Score = 200 bits (508), Expect = 4e-50, Method: Composition-based stats. Identities = 70/270 (25%), Positives = 117/270 (43%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W K GH +T IA+G L+D+A AV+ +L D++ + WPD +R + Sbjct: 25 WGKLGHRVTGEIAEGYLSDQAKVAVEAILG---VEDMAEVSTWPDYMRSSDDEFFKREAF 81 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLHF+ PD E+ + K ++ F L + + R AL Sbjct: 82 PLHFVTVPD-------EQTYAEAGAPKQGDAFTGLERFKAVLQNNESSAEELRL----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + + H + D+HQP+HVG D GGN +++ + SNLH +WD +++ Y + Sbjct: 131 IMVIHIVSDLHQPLHVGKGDDWGGNKVEIMFKGEASNLHEIWDEKLVQDEELSYTE-MAH 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 L+ + ++ + + + ES I Y + LS Sbjct: 190 WLDRKMTPELAQEWYN-------------ADPSVWIAESKEI-RPSIYPK-DGETDLSWQ 234 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y P++ +R++Q G+RLA LN +FG Sbjct: 235 YIYDHRPVMRQRLSQSGVRLAAYLNEIFGE 264 >UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YUT9_9GAMM Length = 281 Score = 200 bits (507), Expect = 7e-50, Method: Composition-based stats. Identities = 70/271 (25%), Positives = 109/271 (40%), Gaps = 34/271 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 +GH + IA+ L+ + A + + L+ L +WPDQ+R K+ T H+ Sbjct: 20 GADGHRIIVSIAEKHLSKKTAAELTQISG---GTALTELALWPDQIRGQQKWSHTKSWHY 76 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 I+ D V A++ QL + + RR EAL F Sbjct: 77 INIKDH-------ERFSGLRRSPKGDVLSALKESYKQLKDPKTESQQRR----EALAFFV 125 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFR--HKSNLHHVWDREIILTAAKDYYAKDINLL 179 H GDIHQP+HVG SD GGN + ++W + NLH VWD +I Sbjct: 126 HLAGDIHQPLHVGRYSDLGGNRVSIKWLGSNKRRNLHWVWDTGLIKDEQLGV-------- 177 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINI---ACKWGYKGVEAGETLSD 236 D + + +W+ +A ES + ++G + T+ Sbjct: 178 --DQYSALINKTTAQQRYNWQSDS-----FLDWAMESKVLRAQVYEFGQPVQKGPVTIDQ 230 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y N P++ KR+ G+RLA LN +F + Sbjct: 231 QYINRTKPLLKKRLLMAGVRLAGCLNRLFDS 261 >UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QX99_ASPNC Length = 309 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 113/294 (38%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ V LL N D+S W D ++ K T PLH Sbjct: 21 WGDVGHRAIAYLAEKYLTVAGSNLVNELLANDKNYDISDAATWADTIKW--KRPLTRPLH 78 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P K+C Y DC + C+ + N T Q++ + ++ EAL Sbjct: 79 YINPDDEPPKSCFVSYPHDCP-----PEGCIISQMANMTRQINDRHANMTQQK----EAL 129 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFR--------HKSNLHHVWDREIILT--- 166 +FL H GD+HQP+HV + GGN I + + + NLH VWD I Sbjct: 130 MFLIHLFGDLHQPLHVTGVA-RGGNDIHVCFDGKNHCNNDTKRWNLHSVWDTAIPHKING 188 Query: 167 ----AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 + + + C+ ++ATES + C Sbjct: 189 IKHNLKHNPERLASAKWADRLHEE---NKLRPADTECANTQEPLECIMQWATESNQLNCD 245 Query: 223 WGYKGVEAG---ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 + K L Y+ PIV ++ + +RLA ++ + ++ D+ Sbjct: 246 FVMKKGLQWLEKTDLGVKYYEVAAPIVDDQIFKAAVRLAAWISALAEDREEADN 299 >UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT7_LACBS Length = 357 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 75/338 (22%), Positives = 123/338 (36%), Gaps = 71/338 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHWYK 52 W GH + IAQ L+ + ++ P ++ + W D R+ Sbjct: 23 WGFAGHEIVATIAQIYLHPTVLPTLCTIIDFSSTNFSPPDSTCHIAPIATWAD--RYKSN 80 Query: 53 YKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD 108 W++ LHFI D P +C F + G K + V ++N T L + Sbjct: 81 MTWSAQLHFIGALDDHPPSSCAFPGKNGWA---GTKRVNVLDGMKNVTALLQGW-VKGET 136 Query: 109 RRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 EAL FL HF GD HQPMH+ + GGN + + + ++NLH VWD +I A Sbjct: 137 SDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGKETNLHGVWDDSLITKAI 195 Query: 169 KDYYAK-----DINLLEEDIEGNFTD------------GIWSDDLASWRECGNVFS---- 207 +E+ + G+ D W+D++ W C +V Sbjct: 196 STIPQNYTLPLPYPEIEQALRGSSYDPYIRRIIWEGIVQRWADEIPGWLSCPDVVKRTSV 255 Query: 208 -----------------------CVNKFATESINIACKWGYKGVEAGETLS------DDY 238 C ++ + ++ C + + L Y Sbjct: 256 DSQVALGLGGTTGIEILPDNDVLCPYHWSRPTHDLLCDGVWPKEDDNPQLPLLELDTPAY 315 Query: 239 --FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 + +V K++A GG+RLA +LN +F Q + Sbjct: 316 SGMIGQRWLVEKQLALGGLRLAGILNYIFVNQGQRGAF 353 >UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01U80_SOLUE Length = 261 Score = 196 bits (498), Expect = 7e-49, Method: Composition-based stats. Identities = 71/270 (26%), Positives = 109/270 (40%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + R+A L AA V +L L+++ W D VR + P H Sbjct: 19 WGPEGHSLIARLAAARLTPAAAAKVAEILG--PGNTLASISSWADSVRRARA--ESGPWH 74 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D P + D ERDC K CV I++F L + R+ EAL+F+ Sbjct: 75 YVDIPINKPHLDMERDCP-----KGDCVIAKIEDFEKVLVNPAATPVQRK----EALMFI 125 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H D GGN + L +F SNLH VWD ++ E Sbjct: 126 VHFVGDMHQPLHCSDNKDKGGNDVKLEFFGRPSNLHSVWDSGLLGRM----------GAE 175 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGE-----TLS 235 + + + + + V +A + A K Y + + Sbjct: 176 DALFATLNRDLTPKRARKFEKG-----TVENWADQIHKAAQKTTYGRLPKSTAGVPPKID 230 Query: 236 DDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + ++ + +GG RLA +LN Sbjct: 231 AHYEHEADELIRIELEKGGARLAKVLNATL 260 >UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q989R8_RHILO Length = 278 Score = 196 bits (497), Expect = 8e-49, Method: Composition-based stats. Identities = 68/280 (24%), Positives = 116/280 (41%), Gaps = 36/280 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + IAQ L+ A VK +L V ++++ W D VR+ + + H Sbjct: 21 WGPEGHSIVAEIAQRRLSSTALMEVKRILGGEVA--MASVASWADDVRYAI-HPESYNWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+D P +D C V+ C I +++ + R ++L +L Sbjct: 78 FVDIPLADSKYDPVSQCA--ANVQGDCAIAEIDRAEHEITCATDPLQRR-----DSLRYL 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNS--IDLRWFR--------HKSNLHHVWDREIILTAAKD 170 H +GD+HQP H + G N+ + +++ NLH VWD II Sbjct: 131 IHIVGDLHQPFHTV-ADNTGENALAVTVKFGGLIKSPPKTPADNLHAVWDSTIIKQTTYA 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + G++ D + +D L E +A E+ +A + G+ Sbjct: 190 W-------------GSYVDRLETDWLLKHPEASETL-DPVAWALEAHTLAQEMA-AGITN 234 Query: 231 GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G L +DY+ LP+V +++ + G+RLA +LN + Sbjct: 235 GANLDNDYYAKALPVVDEQLGRAGLRLAAVLNRWLATAPA 274 >UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium violaceum RepID=Q7P202_CHRVO Length = 274 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 69/269 (25%), Positives = 109/269 (40%), Gaps = 22/269 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHW--YKYKWTSP 58 W +EGH +T IAQ LL+ +A VK L+P N D + L ++ DQ + + Sbjct: 23 WGQEGHRITGYIAQQLLSSKAKAEVKKLIP---NADFAQLALYMDQHKQELKQTLPGSDQ 79 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P C+ E +C D C A I + L+ +DR +AL Sbjct: 80 WHYNDEPV--CSGVTEDECPD-----GNCAANQIDRYRKVLADRGAAKADR----AQALT 128 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK--SNLHHVWDREIILTAAKDYYAKDI 176 FL H +GDIHQP+H D GGN ++ SNLH VWD ++ K Sbjct: 129 FLIHMVGDIHQPLHAADNLDRGGNDFKVQLPGSSKISNLHSVWDTALVQQELNGADEKSW 188 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 + G + W N ++ + + +A L + Sbjct: 189 AAADLQRYQRNVSGWQGGGVMDWVHESNQYARADVYG----PLAGFSCGASPSTPVYLDN 244 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + +V +++A+ G R+A ++N Sbjct: 245 TYLRAGGLLVDQQLAKAGARIAAVINQAL 273 >UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GGE9_9DELT Length = 285 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 70/283 (24%), Positives = 110/283 (38%), Gaps = 34/283 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG---DLSALCVWPD-QVRHWYKYKWT 56 W +GH + IA+ L+ V+ LL L+ +W D + R ++ + Sbjct: 20 WHDDGHRIVGEIAERNLSPATRAKVRALLQGSDGKGDGSLATASIWADHEARESPEFAFA 79 Query: 57 SPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 + H+++ + C ++ G C+A A+ + L R EA Sbjct: 80 ASSHYVNLDGPTSPRELHAQCLERAG----CLATAVPYYADILRSEGASEDQR----AEA 131 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSID------LRWFRHKSNLHHVWDREIILTAAKD 170 L FL HF+GD HQP+H G D GGN ID +NLH WD ++ A + Sbjct: 132 LRFLVHFVGDAHQPLHAGRRGDRGGNDIDRLTIPGYTAKGETTNLHAAWDGALVALALTE 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + GI +D A W + + ES A Y V+ Sbjct: 192 RGVDW-----KAYAVALDAGIDADARARWVGG-----TIYDWLEESRRFAAAEAYLHVDG 241 Query: 231 ------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 G+TL D++ +R++Q G+RLA LL +F Sbjct: 242 LTPVRSGDTLGADWYRRNSSTAEQRLSQAGVRLAALLEAIFED 284 >UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KH31_9GAMM Length = 323 Score = 193 bits (490), Expect = 5e-48, Method: Composition-based stats. Identities = 61/271 (22%), Positives = 93/271 (34%), Gaps = 36/271 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + ++A L V+ LL + L W D++R W Sbjct: 58 WGAMGHEIAAQLADPYLTAHTRQQVEALLGKD---TLKTASTWADRMRSDPAPFWQEEAG 114 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P R D A A+ F L ++ AL Sbjct: 115 PYHYVTIPRG-------RQYADVGPPPQGDAASALTQFARDLRSPSVSLERKQL----AL 163 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN + +R F SNLH VWDR++ + A+ Sbjct: 164 RFAIHIIQDLQQPLHVGNGLDRGGNDVPVRIFGETSNLHSVWDRQMFESTARTQAQWLDY 223 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 ++ T + + ES + ++ Sbjct: 224 FKASELLRRPTQN---------------DADPQVWIAESAKLRETLY----PVPASIDTR 264 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 Y LP R+A GIR A LN ++ + Sbjct: 265 YIRRELPRAEARLALAGIRTAAWLNAIYDDN 295 >UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp. PR1 RepID=A3HUK9_9SPHI Length = 257 Score = 193 bits (489), Expect = 7e-48, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 103/266 (38%), Gaps = 31/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + +A L A V+ +L + W D+++ +Y + H Sbjct: 23 WGQIGHYLIGYMAGQQLKRSARKNVERVL---YPMSIGRSGTWMDEIKSDKRYDYAYSWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ + + + AI +L ++ E L L Sbjct: 80 YLTSKHG--------EYDPHLQEEGGDAYEAINRIKEELKSGNLNPTE----EAEKLKML 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H + DIHQP+HVG D GGN + L +F SNLH VWD +I + Y + Sbjct: 128 IHMVEDIHQPLHVGTGEDRGGNDVKLEYFWQSSNLHSVWDSGMIDRWSMSYTE-----IG 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +++ T + + + E+++ A YK + LS +Y Sbjct: 183 DELMRRLTPEMEDQYRE---------GSMEDWLQEAVD-ARPLVYK-IPENRKLSYNYDY 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + P++ +R+ +RLA +L ++G Sbjct: 232 AVRPLLEERLIAASVRLAQILEEIYG 257 >UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole genome shotgun sequence n=6 Tax=Paramecium tetraurelia RepID=A0BLJ0_PARTE Length = 712 Score = 193 bits (489), Expect = 8e-48, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 101/293 (34%), Gaps = 28/293 (9%) Query: 1 WSKEGHVMTCRIAQGLLN---DEAAHAVKML------LPEYVNGDLSALCVWPDQVRHWY 51 W + GH+MT +IA+ L + L L + + + VW D ++ Sbjct: 422 WWEVGHMMTAQIAKNYLRDNRPDVLAWADSLVQDFNSLTDGKSNTFAEAAVWLDDIKETG 481 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 S H+ D P + +++ AI L++ + + Sbjct: 482 TEFLFS-WHYTDRPINPDGLL----IKIEDESRNINSIYAINQAVAVLTNSKTSRNRHTV 536 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRW-FRHKSNLHHVWDREI 163 + L L H +GDIHQP+H DAGGN ++++ N H WD Sbjct: 537 FKAQMLRVLLHVIGDIHQPLHDTSLYNNSYPDGDAGGNFLNIQLQNGTLMNFHSFWDSGA 596 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR---ECGNVFSCVNKFATESINIA 220 + A + + + + D D + + + + + A Sbjct: 597 LTFAPNNSFLARPLSQSD---SEYLDKWSKDLMKKFPISKYSNYDMTNPSVWTYLGFRQA 653 Query: 221 CKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 ++ Y V A + S DY + + + GG RL L ++ Q ++ Sbjct: 654 QQFVYPMVAASNSYSSDYEKQAIAFCEENLIVGGYRLGSKLIEIYDQILQNEA 706 >UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8HTU7_AZOC5 Length = 282 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 65/276 (23%), Positives = 109/276 (39%), Gaps = 37/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ L A V LLP L+++ W D VR + T H Sbjct: 26 WGEDGHAIVAEIAQRRLTPTGAALVASLLP--KGASLASVASWADDVR--PDHPETRRWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ P A +D RDC + + C+ AI+ + E T+AL L Sbjct: 82 YVGIPMGAATYDPLRDCPSR--PEGDCIVAAIERARLDMHCAPEPA-----ARTDALKLL 134 Query: 121 SHFMGDIHQPMHVGFTSDAGG-NSIDLRWFRH-----------KSNLHHVWDREIILTAA 168 H MGD+HQPMH G + L W +N+H +WD ++ A+ Sbjct: 135 VHLMGDLHQPMHAIAADHLGTRRKVLLNWAGQACTHDCEAPPPTTNMHVLWDTTLVRKAS 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 + G + D + + L +A+E+ + Y V Sbjct: 195 LSW-------------GGYVDRLEAGWLKEADAAAVAAGTPADWASETHGVGLAM-YALV 240 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 ++ Y+ + LP++ +++ + G+RLA +N Sbjct: 241 PPDNVINTTYYRAALPVLDQQLGKAGLRLAHEINAA 276 >UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EEH7_TRIVA Length = 328 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 48/280 (17%), Positives = 99/280 (35%), Gaps = 24/280 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W H R+A+ L+ E + +L + + W D ++ + Sbjct: 14 WWGAPHYTVARLAETRLSPEQLKYINDILETWTSEKAVFHDTANWHDDIK-AANVAIMAN 72 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + +++ D + A ++ + + ++ + Sbjct: 73 WHFRNQPIFSSDYE-----GDFSYPTTYNITDASKDCINTIMSET---TTSQWILGFCFR 124 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNS--IDLRWFRHKSNLHHVWDREIILTAAK 169 LSHF+ D H P+H D GGNS + + + N+H +WD + Sbjct: 125 TLSHFVADAHCPVHSAGRWSKAFPDGDRGGNSQAVVCTYGQPCRNMHMLWDSACLDFQIW 184 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D+ ++ E N T+ + + ++ + + + E+ A K+ Y + Sbjct: 185 PLSKNDV----DEYEKNLTNLLNNYQPKTYLPETYQSTDPDVWENEAYRYASKYVYGNLP 240 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 T +D Y + ++ G RL +L F A + Sbjct: 241 DDFTANDTYIKEGANAAKQLISAAGYRLGEVLLKFFEARK 280 >UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas vaginalis RepID=A2ELH6_TRIVA Length = 315 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 97/279 (34%), Gaps = 27/279 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN--GDLSALCVWPDQVRHWYKYKWTSP 58 WS E H + R+AQ +L + + +L + + DL + W D +R Sbjct: 5 WSGEPHQLIARVAQTMLTKKQRKWIDEMLFLWPSEAQDLITVSNWEDTIRSDIDDILMQ- 63 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + ++ + + AI + + T+ + Sbjct: 64 WHFENKPYIEPEYTPKK------VTRTFNITNAID---DAMKSILDPTTTSFWTFGFYFR 114 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNSIDLR--WFRHKSNLHHVWDREIILTAAK 169 L HF+GD H P+H DAGGN I L S LH +WD + Sbjct: 115 ALIHFVGDSHCPVHSIAYYSDKYPKGDAGGNFIKLNCSISYFCSTLHKLWDSACLNFQHN 174 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y A + E++I + + E S + + ES A + Y + Sbjct: 175 KYVAPTLEDFEKNITR-----MMNAYPLKILEEHPSLS-PHDWIDESYKTAIDYAYTPLV 228 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + ++D Y + R+ G RL M+ F Sbjct: 229 DWKNINDTYLANGAEAAEYRITLAGYRLGMVFKQFFKER 267 >UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leishmania RepID=Q4QGQ3_LEIMA Length = 381 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 63/288 (21%), Positives = 101/288 (35%), Gaps = 31/288 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVK-------MLLPEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ L + V+ + P + ++ L W D ++ Y Sbjct: 29 WWDKGHMCIAEIARRNLKPDVQAKVQACANALNKIGPFPKSTNIVELGPWADDLKSMGLY 88 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 S HFIDT + + V+ + VA I L + + Sbjct: 89 -TMSTWHFIDTIYNPQDVK-----VTINPVEIVNVASVIP----MLISAITSPTATSDII 138 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWF---RHKSNLHHVWDREI 163 ++ L HF+GDIH P+H D GGN + LH WD Sbjct: 139 ITSVANLIHFVGDIHMPLHSADLFSPEYPLGDLGGNKQIVIVNETAGTSMKLHAFWDSMC 198 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 ++ + ++ F D + S+ E + + A ES +A K Sbjct: 199 --EGPQNNAVRPLDKDAYAELSAFVDNLVKSH--SFTEEQMMMTNSTIMAAESYELAVKN 254 Query: 224 GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ G LS+ Y + + RV G RLA +LN + Sbjct: 255 VYPGISDGTVLSESYKANGKILAAGRVTLAGYRLATILNTALAGVSLD 302 >UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatidae RepID=Q25267_LEIDO Length = 477 Score = 191 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 71/287 (24%), Positives = 119/287 (41%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGL----LNDEAAHAVKML---LPEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ L ++A A K+L P + D+ W D ++ Sbjct: 126 WWSKGHMSVALIAKRHMGASLVEKAELAAKVLSFSGPYPKSPDMVQTAPWADDIK-TIGL 184 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 K S H+I TP + E D V+ + VA I L E + + Sbjct: 185 KTLSTWHYITTPY----YTDEDFTLDVSPVQTVNVASVIP----MLQTAIEKPTANSDVI 236 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSN--LHHVWDREII 164 ++L L HFMGDIHQP+H SD GGN + + LH WD + Sbjct: 237 VQSLALLLHFMGDIHQPLHNVNLFSNQYPESDLGGNKQLVVIDSKGTKMLLHAYWDS-MA 295 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + ++ + D NF D + + ++ + + + E+ ++A K+ Sbjct: 296 EGKSGEDVPRPLSEADYDDLNNFADYLEATYASTLTDKEKNLVDTTEISKETFDLALKYA 355 Query: 225 YKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G + G TLS++Y + I ++V G RLA +LN + + Sbjct: 356 YPGADNGATLSNEYKTNAKKISERQVLLAGYRLAKMLNTTLKSVSMD 402 >UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=B9XJ21_9BACT Length = 377 Score = 190 bits (482), Expect = 4e-47, Method: Composition-based stats. Identities = 69/284 (24%), Positives = 104/284 (36%), Gaps = 35/284 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLP------EYVNGDLSALCVWPDQVRHWYKYK 54 W EGH++ +I L+ L+ N W D + Sbjct: 44 WDAEGHMVVAQIGYNHLDPAVKAKCDALISVALTNVSSQNNTFVTAACWADDNKAALG-- 101 Query: 55 WTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMT 114 T+ H+ID P F + + V AI+ L T+ + + Sbjct: 102 -TAIWHYIDLP-----FSLDGTPTNGVAPASTNVVFAIRQCVATLQS----TNATQIDQA 151 Query: 115 EALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA 167 +L +L HF+GDI QP+H DAGGNS L + +NLH +WD Sbjct: 152 ISLRYLIHFVGDIQQPLHASTAVSASSPGGDAGGNSFSL--SGYWNNLHSLWDAG----- 204 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNV--FSCVNKFATESINIACKWGY 225 Y I+ + DG S ++ N+ +A ES +A Y Sbjct: 205 -GGYLTNSISRPLTAGGQSIIDGKVSAIEVAYPFTSNIGVIPNPMDWANESWGLAQNVAY 263 Query: 226 KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 G+ T S Y + +R++QGG RLA LLN ++ S Sbjct: 264 AGLTRSSTPSVGYLTTVQNTTQQRMSQGGHRLANLLNTIYSTSP 307 >UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales RepID=A4CQ68_9FLAO Length = 257 Score = 190 bits (482), Expect = 4e-47, Method: Composition-based stats. Identities = 61/247 (24%), Positives = 93/247 (37%), Gaps = 30/247 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH +A+ L+ A AV LL L+ + + D ++ Y+ SP H Sbjct: 22 WGRTGHRAIGEVAEAHLSRRARKAVSRLL---EGESLAKVSTFGDDIKSDTTYRSFSPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + D + I++ L L L Sbjct: 79 YVNLPPETP-------YGEITPNPDGDILQGIEHCIRVLKDPASPRDQ----QVFYLKLL 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMHVG D GGN I L++F +NLH +WD ++I Y L Sbjct: 128 VHLVGDLHQPMHVGRPEDRGGNDIQLQYFDKGTNLHRLWDSDMIEDYGMSYTE-----LA 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 E + I V ++A +S ++A Y VE GE L Y Sbjct: 183 ETLPPATRREI----------RVIQSGSVLEWAGQSQSLA-NRVYASVENGEKLYYRYRY 231 Query: 241 SRLPIVM 247 V Sbjct: 232 LWWDSVE 238 >UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas vaginalis RepID=A2ECC5_TRIVA Length = 319 Score = 190 bits (482), Expect = 5e-47, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 99/286 (34%), Gaps = 27/286 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+M RIA+ LL + ++ +L ++ ++ W D ++ Y Sbjct: 12 WWGHAHMMIGRIAESLLTSKEKKKIEAVLRYGQHPIQTITEATTWQDDLKGTYSLSVMET 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF+D P + + + + + L + + L Sbjct: 72 WHFLDHPIN-------KGKNTSIPPPTYNITTYMDSAYRALKDKT---TTDPWVWAFHLR 121 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L HF+GD+H P H + T D GGN + +N+H +WD + Sbjct: 122 SLIHFVGDVHTPHHNVALFNDLFPTGDHGGNLYILNCNLGSGCNNIHFLWDSAGFYFPMR 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC--VNKFATESINIACKWGYKG 227 + I ++ + N T I + + + ES +A +GY Sbjct: 182 NPV---IPKYRDEFQKNATKLINELPQSHYTSQNMDVKTFHPEVWHNESYEVAYNFGYNT 238 Query: 228 VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 G S DYF + +R+A G RL L V G E + Sbjct: 239 TMYGW-PSKDYFTTVQTQSKERIAISGYRLGYFLKEVVGNIPVEPT 283 >UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT9_LACBS Length = 375 Score = 188 bits (478), Expect = 1e-46, Method: Composition-based stats. Identities = 84/354 (23%), Positives = 125/354 (35%), Gaps = 96/354 (27%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------LSALCVWPDQVRHWYKY 53 W GH + IAQ L+ + +L + L+ + W D++R +K Sbjct: 22 WGAAGHEIIATIAQMYLHPSILPTICDILNFSEDETQPEQPCHLAPISTWADKLR--FKM 79 Query: 54 KWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR 109 +W++ LH++ D P + C F ER G + V AI+N T L + G + Sbjct: 80 RWSAALHYVGSLDDHPSQTCLFPGERGWA---GTRGGNVLDAIKNVTGLLEDWTRGEAGD 136 Query: 110 RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK 169 EAL FL HFMGD+H P+H+ D GGNS + W ++NLH +WD +I A + Sbjct: 137 -ATANEALKFLVHFMGDLHMPLHLT-GRDRGGNSDRVLWSGRQTNLHSLWDGLLIAKAIR 194 Query: 170 DYY-----AKDINLLEEDIEGNFTD------------GIWSDDLASWRECGNVFS----- 207 +E + G D W DD+ W C Sbjct: 195 TVPRNYSRPLPYPDVEHALRGTIYDSYIRRIMWEGVFQKWKDDVPEWFSCPETTPPPPAR 254 Query: 208 ---------------------------CVNKFATESINIACKWGYKGVE-------AGET 233 C +A + C + G Sbjct: 255 GWQQVVMSLKRLAGKQGVEIGPDTDVLCPYHWAKPIHALNCDIVWPKELDEPPYGGGGSK 314 Query: 234 LSDDYFNSRLP----------------------IVMKRVAQGGIRLAMLLNNVF 265 +D+ R P +V K +AQGGIRLA +LN +F Sbjct: 315 FADEDVAGRPPKPHPPLLELDTPKYAGVIEDTMVVEKLLAQGGIRLAGILNYLF 368 >UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ Length = 295 Score = 187 bits (474), Expect = 4e-46, Method: Composition-based stats. Identities = 72/295 (24%), Positives = 116/295 (39%), Gaps = 50/295 (16%) Query: 1 WSKEGHVMTCRIAQGLL-NDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---- 55 W +GH IAQ LL N +A + +L L + PD++R + K Sbjct: 26 WGHQGHKTIGIIAQHLLVNSKAFEEINNILG---GLTLEEISTCPDELRVFQSEKKPMSS 82 Query: 56 --------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 T HFIDTP N +E K CV I ++ L+ Sbjct: 83 VCNQIFTNPEPPTNTGSWHFIDTPISQFNPTHEDI---VKACKSSCVLTEIDRWSNVLAD 139 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG-FTSDAGGNSIDLRWFRHKSNLHHVWD 160 T+ +AL F+ HF+GDIHQP+HV D GGN + +R R+K+NLH WD Sbjct: 140 ----TTQTNAKRLQALSFVVHFIGDIHQPLHVAERNHDLGGNKVKVRIGRYKTNLHSFWD 195 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ + + + I L + + + + A Sbjct: 196 TNLVNYISTNPISTTILLKSDV----------------AFAQTEAQTTPETWVLQGFQFA 239 Query: 221 CKWGYKGVEAGET----LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ +S+ Y + +P+V ++A G+RL+ L +F +S ++ Sbjct: 240 RNVAYDGIPIDYASVVRISNAYIQNAIPVVKHQLASAGVRLSQHLARIFSSSNKQ 294 >UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepID=Q7RSD2_PLAYO Length = 328 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 53/298 (17%), Positives = 103/298 (34%), Gaps = 27/298 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRH------- 49 WS EGH++ IA L+D + + Y + VWPD +++ Sbjct: 24 WSDEGHMLISAIAYEGLDDREKKILTQIFQNYKEDNDFNNHIYAAVWPDHIKYYEHPVDT 83 Query: 50 ---WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 H+I+ P N D + + + D + + + F ++ Sbjct: 84 TKRMDGISIMDRWHYINVPYNPTNIDLDMYHKEYYKDTDNSLTISRKIFQDLKLMEKKNN 143 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRWFRHKSNLHHVW 159 ++ L + H GD+HQP+H D GG +I++ + LHH+ Sbjct: 144 YGSYFSYNFQLRYFIHVFGDMHQPLHTATFFNKHFIKGDFGGTAINVNYNNRTEKLHHLC 203 Query: 160 D------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 D + +A + D L + ++ + + G + A Sbjct: 204 DCVFHARDKKWPSATVEEVTNDARTLMNTYPPEYFGNRLNNGMDEYEYLGYIVEDSYAQA 263 Query: 214 TESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + I A + TL++ Y + ++ +++A GG RL L + + Sbjct: 264 IDHIYYAFPFESLNRHTAYTLTNAYVINLKKVLNEQIALGGYRLTRYLKTIIANVPDD 321 >UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID=B0T6T3_CAUSK Length = 287 Score = 186 bits (471), Expect = 8e-46, Method: Composition-based stats. Identities = 71/285 (24%), Positives = 110/285 (38%), Gaps = 39/285 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W + GH + +IA+G L +AA AV LL + DL+A W D R ++ T Sbjct: 23 WGRTGHAVVAQIARGYLTPKAAAAVDALLAADTDALTPPDLAARASWADAWRKD--HRQT 80 Query: 57 SPLHFIDTPDKA------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 + HF+D C G + C+ G + F +L+ + ++R Sbjct: 81 TEWHFVDVELDHPDLAGACFGFPASATPASAGPEKDCIVGRLNAFEAELADPKTDAAERL 140 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 A F+ HF+GD+HQP+H D GGN I L ++ NLH WD + Sbjct: 141 L----AFKFVLHFVGDLHQPLHAADNQDRGGNCIPLALGGPRTVNLHSYWDTVAVEAIEA 196 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY---- 225 D + + + I + +W + +A ES +A Y Sbjct: 197 DP---------DKLAAKLSAQITPAERKAWEKG-----DAKTWAMESFALAKSTVYTIGS 242 Query: 226 ----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 A L Y S V ++ + G+RLA+ LN G Sbjct: 243 KPGCASDTAPVPLPAGYNQSAQAAVALQLKKAGVRLALELNRALG 287 >UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5LHN6_9ALVE Length = 1614 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 77/300 (25%), Positives = 122/300 (40%), Gaps = 58/300 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ +++D V L D+ + W D+ H +Y+WT+PLH Sbjct: 22 WGEDGHSIVAAIAQRIVSDRVIEGVNETLGR--GQDMIGVACWADKASHSAQYRWTAPLH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+DTP K C YERDC D D CV GAI N+T + ++R A+ + Sbjct: 80 FVDTPTKQCQMVYERDCRD-----DFCVIGAIYNYTNRAISKSVSRAERE----FAMKLV 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT-------------- 166 + P H + S LH VWD +IL Sbjct: 131 TTDFAPP-GPRH-----------------KVSSKLHQVWDSGLILQDEFELRVQRRREHR 172 Query: 167 -------AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKFATES 216 + + L E G ++ W C A ES Sbjct: 173 KIPPHPPYRHKFEERWHELFEHLWTKLSKGGEYAKHREEWLAPCRQNGLQECTKTMAEES 232 Query: 217 INIACKWGY-----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + +AC Y + + G+ L +YF +R P++ +++A+GG+RLA +L +FG+++ Sbjct: 233 LAVACTAAYHDEYRRWIADGDVLDRNYFLTRNPLMEEQLAKGGVRLAWVLQQMFGSNRHR 292 >UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila SB210 RepID=Q236I5_TETTH Length = 330 Score = 183 bits (465), Expect = 4e-45, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 103/290 (35%), Gaps = 31/290 (10%) Query: 1 WSKEGHVMTCRIAQG---------LLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY 51 W GH++T +A+ L E + L + + W D ++ Sbjct: 19 WWDGGHMITVEVAKQEILARDPALYLKIEKYVTILNPLCDARSQTFVQAASWADDIKDPA 78 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE----GTS 107 W HF + P D + A++ +L Sbjct: 79 MNFW-DKWHFFNKPINEEGLYVVLD----QDSLNNNSINALKRCIQELQKNNTTPINNPD 133 Query: 108 DRRYNMTEALLFLSHFMGDIHQPMH---------VGFTSDAGGNSIDL-RWFRHKSNLHH 157 + + +L H +GD+HQP+H D GGN ++ LH+ Sbjct: 134 NISVQQAIMMRYLIHIVGDMHQPLHNTNLFNYTFSTNQGDLGGNKENVILLNGTSMVLHY 193 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESI 217 +D + A +++ ++ +E +F + S+ + +A ES Sbjct: 194 YFDSGALRLAD---FSRPLSQEQEQQVTDFAASFRAQYPRSFFNERVNITLPEMWAQESY 250 Query: 218 NIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 IA + Y ++ ++ ++ N + ++ +++A GG RLA LL +VF Sbjct: 251 EIAVRDIYPYLKLTNKVTPEWDNLQYEMIKQQIALGGYRLADLLTSVFNP 300 >UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisrubri RepID=Q1N3Y8_9GAMM Length = 226 Score = 183 bits (465), Expect = 4e-45, Method: Composition-based stats. Identities = 53/258 (20%), Positives = 103/258 (39%), Gaps = 32/258 (12%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDK 67 M A L A H ++ +L VW D ++ ++ PLH+++ P Sbjct: 1 MVAAAAWPQLTPYAKHQIESILGFG-REKFVNASVWADHIKSDQRFNHLKPLHYVNLPKG 59 Query: 68 ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDI 127 + + +RDC + C+ AI +F+ S A+ L H + DI Sbjct: 60 STQYKQQRDCPE-----GQCIVQAIYDFSE------YARSGSEREQAMAVRMLIHLIADI 108 Query: 128 HQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNF 187 HQP+H G+ D GGN ++++ + +LH +WD +++ +++ LL++ + Sbjct: 109 HQPLHAGYKEDRGGNWFEVKYQDYTLSLHKLWDHQLVERFHENWQQGSTELLKDMPKATL 168 Query: 188 TDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVM 247 K+A S + + Y+ + +S+ Y + Sbjct: 169 YS-------------------PEKWAEISHALVERSVYE-TQENRLVSEAYLEMADDVTH 208 Query: 248 KRVAQGGIRLAMLLNNVF 265 +++ RLAM LN ++ Sbjct: 209 RQLQLASWRLAMWLNQLW 226 >UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G6P9_TRIVA Length = 348 Score = 183 bits (464), Expect = 5e-45, Method: Composition-based stats. Identities = 59/296 (19%), Positives = 103/296 (34%), Gaps = 34/296 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG--DLSALCVWPDQVR-HWYKYKWTS 57 W E H+ RIA+ ++ + + +L + + + + W D++ + + Sbjct: 12 WWNEPHMAVVRIAERMITKQQKDWMNVLFSMWPSEADTMVSASTWHDEIPENSAQVSIMK 71 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 HF D P A F+YE V + + L + T+ Y Sbjct: 72 NWHFADKPILAPGFEYEYQ-------PTYNVTSVVSDSMNALFN---PTTKSLYAYHFLF 121 Query: 118 LFLSHFMGDIHQPMHVG-------FTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAA 168 L HF+GDIH P H D GGN I+ ++ LH +WD ++ Sbjct: 122 RNLVHFIGDIHTPCHTAAYYSPKFEEGDRGGNSLKINCKYGEPCKQLHKMWDSGVLNFQH 181 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 D N L ++ E N I S + E+ ++A + Y + Sbjct: 182 M---YLDTNELLDEFEHNI-SHIMQMHPESSLPTVKSL-NAYLWFNETYDVAVNYAYGML 236 Query: 229 EA-------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 + L +Y + ++ + G RLA ++ F ED + T Sbjct: 237 KDLNNSELDKYDLMPNYISKGAMAAEIQIVKAGYRLAYVIQEFFKVHSPEDPRIFT 292 >UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5LN34_9ALVE Length = 401 Score = 183 bits (464), Expect = 6e-45, Method: Composition-based stats. Identities = 58/293 (19%), Positives = 114/293 (38%), Gaps = 35/293 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +A L+ A++ +K LL D W + W++ LH Sbjct: 29 WDIDGHEAVGMVAMSALDSRASNQLKRLLQ---GKDAVEDAGWAH--KAESSIPWSTRLH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR----------- 109 F+ P+ N ++ + C+ A++ F Q S + Sbjct: 84 FLSQPEPFSNTLVV---NEITCPQGQCLLEALKLFYDQAKGDTSKISQKDRLMMSSARLP 140 Query: 110 -RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTA 167 + +A+ FL + +GD+HQP+H GF +D G ++ + +L+ +WD EII Sbjct: 141 VQVTDADAVRFLINLIGDMHQPLHEGFQTDDFGKQTIVKLPGGSTLSLYELWDHEIIQET 200 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG 227 K++ + N ++ D W+E + + K+ ++ A K+ Y Sbjct: 201 IKNHPQFWWSGWTHIQRAN--PDTYNADKKLWQENNK--AALEKWCNDNAEFANKFIYTN 256 Query: 228 VEAGETLS----------DDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + E L ++++++ G R A++LN++ +S Sbjct: 257 PLSNERLPIGSGSPINVDAAVLEKWRQLLIQQILLAGSRTAIVLNDILESSAA 309 >UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_ERYLH Length = 276 Score = 181 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 64/290 (22%), Positives = 106/290 (36%), Gaps = 48/290 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHW-Y 51 W H +T IA+ + + A++ L PE L VWPD VR + Sbjct: 8 WGFFAHTVTGDIAEANIRPDTRAAMQRLFRAEGLLGTPECELKTLQDATVWPDCVRRMRW 67 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 ++ T+ H+ TP ++ ++C C+ I L+ + R Sbjct: 68 RWGHTAAWHYRTTPICEP-YEPWKNCPG-----GNCILAQIDRNQRILADESLPANVRL- 120 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSNLHHVWDREIILTAAKD 170 +AL F+ HF+GD+H P+H G D GGN + + NLH +WD + A Sbjct: 121 ---QALAFMVHFVGDVHMPLHSGDKDDRGGNDRETDYGIAPGLNLHWIWDGPLAERAITS 177 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG--- 227 + GI +D + ES I+ + Y Sbjct: 178 ARPSLVRRYSAAERAELAGGISAD-----------------WGRESWAISRDFVYPNAFD 220 Query: 228 --------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 + L+ + + +P+ +RV Q G+R+A LL+ F Sbjct: 221 TDAVCETDLPGETALTQEDIVAAIPVSQRRVTQAGLRIARLLDEAFAPGP 270 >UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A822_9GAMM Length = 293 Score = 181 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 61/270 (22%), Positives = 90/270 (33%), Gaps = 36/270 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + +A L+ A + LL + L++ W D++R W Sbjct: 19 WGAMGHELAGTLAAPYLSANARAQIDALL---KDETLASASTWADRMRGDPDPFWQEEAG 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ PD A+Q F L T +R AL Sbjct: 76 PYHYVTVPDGQS-------YTQVGAPPQGDGYTALQQFRKDLRDPTTPTRRKRL----AL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN I + SNLH VWDR++ + + Sbjct: 125 RFALHIVQDLQQPLHVGNGRDRGGNQIRVAINGETSNLHSVWDRQLFESTGRSKETWLDY 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 D+ E S + ES + + Sbjct: 185 FRRGDLLR---------------EPNPADSDPLLWIRESAALRETLY----PVPTAIDRA 225 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y +LP +R+A +R A LN F Sbjct: 226 YIKQQLPRAEQRLALSAVRTAAWLNATFDG 255 >UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2E6R1_TRIVA Length = 330 Score = 179 bits (454), Expect = 8e-44, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 26/275 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + I+Q L + + +L D+ + WPD + Y K + Sbjct: 12 WWGHSHTIIAHISQNQLTHKQISNINRILSSSGFETTDIEKISSWPDDLIE-YNLKSMAE 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P + D + V I + L T+ + + Sbjct: 71 WHYADKP-----YVPYEDFNFIKPPPTYNVTTYINDAWETLHD---PTTTDLWAWAFHIR 122 Query: 119 FLSHFMGDIHQPMHVGF-------TSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L H++GDIH P H D GGN + W N+H +WD + Sbjct: 123 NLIHYVGDIHTPHHNIARFTVYHQNGDMGGNLYRLNCTWGDACKNIHFLWDSCALAFPIA 182 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D + D+ N + ++ ++ ES IA GY + Sbjct: 183 DITN---PIYASDLAKN--SSLIEEEFPMSSFENMTSVDPRAWSLESYAIASTLGYA-LP 236 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + S DY + +R+A G RL +L + Sbjct: 237 SYSEPSQDYLYNARQAGKRRIAMAGYRLGYMLKEL 271 >UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo saltans RepID=B6DTM7_9EUGL Length = 360 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 56/291 (19%), Positives = 98/291 (33%), Gaps = 28/291 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-----LPEYVNGDLSALCVWPDQVRHWYKYKW 55 W GH++T IAQ LL + + ++ WPD ++ + Sbjct: 77 WGCAGHMITAEIAQQLLPTNVRRYFTDISAYQQMYYPRITSMTEASCWPDDMKSYTSQYS 136 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 + HF + N C V+ + A+ N QL+ T Sbjct: 137 S--WHFYNVCLLRANGT-NLTCPVWTSVETGQMPTAVANARAQLAMGSNLTHAES---AF 190 Query: 116 ALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 L FL H +GD HQP+H+ D GGN + ++NLH D L Sbjct: 191 WLAFLVHLVGDFHQPLHIATLFNPMFPKGDQGGNRFYIYVNNSRTNLHAFHDDLAWLLPR 250 Query: 169 KDYYAKDINLLEED--IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +D + ++ + ++ N + + + E Y Sbjct: 251 DGFPQRPLAEYPDDVSMIEGLSESLILLQKFAYPSQPN-VTNTSVWIEEGFETGVNISYT 309 Query: 227 GVEAGE-------TLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + LSD Y ++ ++A GG RLA +L ++ Sbjct: 310 LPNGQDLQFNQHFNLSDTYVTRLRSMLQNKLALGGRRLARILMEIYDEVHA 360 >UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7R9_ANADF Length = 285 Score = 176 bits (447), Expect = 5e-43, Method: Composition-based stats. Identities = 60/279 (21%), Positives = 100/279 (35%), Gaps = 35/279 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS+ GH + IA+ L A V+ +L + + + W D R T H Sbjct: 28 WSEPGHRIVAAIAEERLGPSARRLVREVLGATPMSN-ADVAGWADAQRD----PATRAWH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P A FD RDC ++ CV A++ +L +A +L Sbjct: 83 YVNIPL-AAAFDPARDCP-----REACVVAALERAIAELRDGEGAAR-----RADAFRWL 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAKDIN 177 H + D+HQP+H G D GGN + R H VWD++++ + Sbjct: 132 VHLVADVHQPLHAGDGRDRGGNDLPTRRERARGQPRPFHRVWDQDVLGPILR-------R 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET---- 233 I + A W ++A ES +A + Sbjct: 185 RGTVAAARALARDIGPAEAARWAARP----SPAEWADESHALARALYAELGPLPRDGRIV 240 Query: 234 -LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L +Y + + ++ + G+RLA LL + A Sbjct: 241 LLPREYADRQRARTELQLQKAGVRLAALLERIAAARAVR 279 >UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepID=Q5ZV70_LEGPH Length = 285 Score = 176 bits (447), Expect = 6e-43, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 100/282 (35%), Gaps = 38/282 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQVRHWYKYKW 55 W+ GH + +IA L ++ + L + W D +R W Sbjct: 28 WNAIGHQLVAQIAYDNLTPQSRR-MCDLYSHSKSKTSSNVNFVKSASWLDSIRAHD-VHW 85 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 LH+ID P + D + + D+ I LS + +D++ Sbjct: 86 FDALHYIDIP-------FSMDETELPVLTDINALWGINQAIAVLSSKKASIADKKL---- 134 Query: 116 ALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 +L L H +GDIHQP+H D GGN L +NLH WD + Sbjct: 135 SLRILVHLVGDIHQPLHTVTKISKKLPKGDLGGNLFQLAKNPIGNNLHQYWDNGGGILIG 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 +D + + N + WS AS ++ S +A YK V Sbjct: 195 QDKFFQIKNK------ARQLEKKWSCQSAS------KEKNPQQWINASHQLALTKVYK-V 241 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 A + Y + I K++ G RLA LLNN+ + Sbjct: 242 SAHQVPGKQYQLNTQNITEKQILLAGCRLAYLLNNIAEGKNK 283 >UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahymena thermophila RepID=Q23AG7_TETTH Length = 630 Score = 176 bits (446), Expect = 7e-43, Method: Composition-based stats. Identities = 57/298 (19%), Positives = 98/298 (32%), Gaps = 38/298 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK---MLLPEYVNGDLSAL--------CVWPDQVR-HWYK 52 H++ IA+ L L Y + + VW D ++ + Sbjct: 24 PHMLVLAIAKKELMKNDMEVYNITAKYLDTYSTQGVDTVSTTTYEENAVWADDIKVYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K H+I + N + A N L++ + + Sbjct: 84 QKAMEMWHYIGNKDSNPQNLTPLKKDPMAD---SENALNAYNNIVKVLTNEKFVGQMTEF 140 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------------FTSDAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G F D GGN + ++ K+ Sbjct: 141 KVNM-LKMLVHIVGDIHMPHHTGSFYNATYKNDKGEFWGDLGGNRQMINFYTSTGEMKKT 199 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 N+H +D + + +N + D I + N + +A Sbjct: 200 NIHFYFDSSCFFYTWTNRLVRPLNETFKIYFQRELDRIVAQYPKESLNIDN-TKTFSDWA 258 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 ES N+A Y + + + DD++NS ++ KR+ G RLA L +F + Sbjct: 259 DESWNLALNNVYPFLLSKNEIHYGDDFYNSSFDMIQKRIVTAGYRLAYTLQKLFTPEK 316 >UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XA25_9BACT Length = 309 Score = 176 bits (445), Expect = 9e-43, Method: Composition-based stats. Identities = 56/296 (18%), Positives = 92/296 (31%), Gaps = 44/296 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-------------GDLS-------AL 40 WS GH++ A L + V +L + + DLS Sbjct: 24 WSGAGHMVIAAEAYHELPERTRSKVDEILKAHPDYAKWVATHSKEKFADLSLSEYVFLRA 83 Query: 41 CVWPDQVRHW----YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 WPD++R + H++D P K F E + I Sbjct: 84 SKWPDEIRRAKGQGSRSYDHPHWHYVDYPLKPTKFPLE-----PGPSPKDDLLYGIAQCE 138 Query: 97 TQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWF 149 L + ++ L +L H +GD+HQP+H D GGN ++ Sbjct: 139 KNLCDSKASPEEK----AVYLSYLIHLVGDVHQPLHCCSLVNETYPNGDKGGNDFYVKPG 194 Query: 150 RHKSNLHHVWDREIILTAA-KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC 208 LH WD + ++ + I LL + + + + W G + Sbjct: 195 NKGIKLHSFWDGLLGTSSKPQTQIYYAIELLHDHPRKSLPELAKATTPKDWSLEGRQIAI 254 Query: 209 VNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + IN C + L +Y + R A G RLA + + Sbjct: 255 DKAYLRADINGGCGTSEQNA---CELPSNYTKEAKAVAENRAALAGYRLADEIQML 307 >UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ECB Length = 323 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 55/314 (17%), Positives = 92/314 (29%), Gaps = 55/314 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL------------------PEYVNGDLSALCV 42 W GH++ +A L+ + LL + Sbjct: 24 WWGTGHMVVTSVAWRQLSQQEQEQAHALLKAHPKYNDWMSSYPADVPGLSKGLYAAMAAS 83 Query: 43 -WPDQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 W D +R H++D P +F + V I+ ++ Sbjct: 84 LWADDIRDKNNPATHPEWHYVDYPLVPPHFP-----KEPAPNPTNDVLVGIKECERVIAS 138 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNSIDLRWFRHK 152 T ++ E + +L H +GD+HQP+H D GGNS +R + Sbjct: 139 PTTSTQEK----GEMVSWLIHLVGDVHQPLHCASLTNDDFPAPEGDRGGNSAFVRPDKQS 194 Query: 153 S--NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 NLH VWD ++ D E + + ++ Sbjct: 195 KAINLHMVWDSQLGGARV-----ADAGSSREALNKAIL--LETEHPRVAAAELQKSPSPE 247 Query: 211 KFATESINIACKWGYKGVE---------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 ++ E +A + Y L + Y I +RV G RLA +L Sbjct: 248 SWSLEGRELAIQEAYLHGNLRYAVGKQLNAPVLPEGYTKKARAISERRVTLAGYRLADML 307 Query: 262 NNVFGASQQEDSVV 275 + S E Sbjct: 308 KRLLAVSTAEPERA 321 >UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2F450_TRIVA Length = 329 Score = 174 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 49/281 (17%), Positives = 97/281 (34%), Gaps = 26/281 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + IA + + ++ L ++ + + VW D ++ Y S Sbjct: 11 WWGHAHSLIASIAMKDFSSKERKILEKFLEYGQHKRATIEEVAVWQDDLKGAYDLGIMSS 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF P + + + + L++ + + + L Sbjct: 71 WHFTPRPLIKDGYTATLQ------PVTYNITSYMNSAWNSLTN---PATTDPWIIAFHLR 121 Query: 119 FLSHFMGDIHQPMHV-------GFTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAAK 169 L HF+ D+H P H D GGN I + N+H +WD + Sbjct: 122 SLIHFVADVHTPHHNVGYYSQETPDGDKGGNLYQIICNYGSACMNIHFLWDSACLALPLG 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY-KGV 228 + I ++ N T + + A K++ ES + ++GY + Sbjct: 182 NP---LIPKYLDEFSENVTKIMKNHQKAK--MGDLETIDFMKWSNESYDTVKQYGYSPAI 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 E ++D Y + + + RV+ G RL+ +L ++ + Sbjct: 237 ERYGEVTDQYLKTCQSVALNRVSLAGYRLSTVLRQIYNEKK 277 >UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium loti RepID=O68530_RHILO Length = 309 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 72/296 (24%), Positives = 112/296 (37%), Gaps = 43/296 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYKYK 54 W +EGH IAQ L A+ V+ LL ++ ++++ W D R +K Sbjct: 22 WGQEGHAAVAEIAQHRLTSSASDVVQRLLRAHLGLTGQQVVSMASIASWADDYR-ADGHK 80 Query: 55 WTSPLHFIDTPDKA--------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 TS HF+D P + ++D RDC D C+ A+ LS + Sbjct: 81 DTSNWHFVDIPLASLPGGSSATTDYDAIRDCAD-DATYGSCLLKALPAQEAILSDATKDD 139 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHV-----GFTSDAGGNSIDLRWF-----------R 150 R +AL F+ H GD+ QP+H G D GGN++ + + R Sbjct: 140 ESR----WKALAFVIHLTGDLAQPLHCVQRVDGSQKDQGGNTLTVTFNVTRPAPDNSTFR 195 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR-ECGNVFSCV 209 + H VWD ++I D+ E+ + D + D W EC Sbjct: 196 DFTTFHSVWDTDLITFKYYDW-GLAAAEAEKLLPTLAADLLADDTPEKWLAECHRQAEAA 254 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + + G+ L YF P+V +++A GG+ LA LN Sbjct: 255 YQALPAGTPLKSDIGHP-----VILDQAYFEKFHPVVTQQLALGGLHLAAELNEAL 305 >UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZI8_MONBE Length = 179 Score = 172 bits (435), Expect = 1e-41, Method: Composition-based stats. Identities = 71/156 (45%), Positives = 93/156 (59%), Gaps = 4/156 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH T IA+ LL ++AA V +L N + ++ W D VR + W++PLH Sbjct: 26 WGPIGHQTTAAIAETLLTEKAATTVAQIL---DNASMVSVSTWADDVRSTSAWAWSAPLH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 FIDTPD+ C+FDY RDC + G D CVAGAI N+T QL + EAL F+ Sbjct: 83 FIDTPDRVCSFDYSRDCQN-DGRPDFCVAGAIVNYTRQLELAVAQGRLQDETTQEALKFV 141 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLH 156 HF+GDIHQP+HV FTSD GGN +++ +F NLH Sbjct: 142 IHFLGDIHQPLHVSFTSDEGGNLVNVTFFGEPENLH 177 >UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CE90A Length = 482 Score = 170 bits (430), Expect = 5e-41, Method: Composition-based stats. Identities = 59/303 (19%), Positives = 99/303 (32%), Gaps = 39/303 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK---MLLPEYVNGDLSAL--------CVWPDQVR-HWYK 52 H++ IA+ L K L + + + VW D ++ + Sbjct: 24 PHMLILGIAKRELMKNDQEIYKITAKYLDTFSASGIETISTTSYEENAVWGDDIKTYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K HFI + N +D A N + + Sbjct: 84 QKAMGMWHFIGNKDSNPENLTLVKD----PMADSENALNAYDNIVKTFKNKSFIGKITEF 139 Query: 112 NMTEALLFLSHFMGDIHQPMHVGF-------------TSDAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G D GGN ++++ + Sbjct: 140 KI-MMLKMLVHLVGDIHMPHHTGSYYNSTIVGPNKEIWGDRGGNRQKIKFYTSTGKKEST 198 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 ++H +D K + +N + D I + N + N +A Sbjct: 199 DIHFYFDSSCFYYNWKSRLQRPLNDTFKAYFEAELDRIMTQYPKETLNINNAQT-FNDWA 257 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 ES NIA Y + + D ++NS ++ KR+ G RLA L N+F A + + Sbjct: 258 EESWNIALTEVYPFLLKNNEIRFGDAFYNSSFDMIQKRIVIAGYRLAYTLQNMFAAEKGK 317 Query: 272 DSV 274 + Sbjct: 318 IDL 320 >UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PFZ0_USTMA Length = 397 Score = 169 bits (429), Expect = 7e-41, Method: Composition-based stats. Identities = 64/369 (17%), Positives = 118/369 (31%), Gaps = 109/369 (29%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----------------GDLSALCVWP 44 W GH + IAQ L+ + +LP Y L+ L WP Sbjct: 35 WGIAGHQIVATIAQTQLHPLVREQLCTILPNYTRYPSHWPTSEDSKPRTHCHLAVLAGWP 94 Query: 45 DQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 D +R +Y W+ LH+++ D + + V ++ N+T+++ Sbjct: 95 DTIRS--RYPWSGQLHYVNPVDDHP--PSQCLYGETGWTSPNNVLTSMVNYTSRVV---- 146 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 ++ + AL F+ H GD HQP+H+ + GGN + + + K+ LH VWD +I Sbjct: 147 --TETGWQRDMALRFMVHLFGDAHQPLHLTGRA-RGGNDVWVHFEGRKARLHTVWDTLLI 203 Query: 165 LTAAKDYYAKDINLLEEDIEGNFT--------------------------DGIWSDDLAS 198 ++ L IE D W + + Sbjct: 204 DKQIRELSNYTTRLPSGRIESALVGARYDPLIRFILKEGLGQPASRGQEGDAWWKQESSG 263 Query: 199 WRECGNVFS--------------------------------CVNKFATESINIACKWGYK 226 W C S C ++ ++ C + + Sbjct: 264 WPACQGQRSEIGALTQEYEGQLALSSISEDPHRVDNTVLPICPYEWTRPMHSLVCTYAFA 323 Query: 227 GVEAGETLS----------------------DDYF--NSRLPIVMKRVAQGGIRLAMLLN 262 + +Y R ++ K++A+ G+RLA +LN Sbjct: 324 APVPAWEPAPPPGQGEPEPSPTPVPEPELDVPEYVGRIERDKVIHKQLAKAGLRLAAVLN 383 Query: 263 NVFGASQQE 271 + ++ + Sbjct: 384 TLLLPAEVD 392 >UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidopsis thaliana RepID=O65425_ARATH Length = 454 Score = 169 bits (428), Expect = 8e-41, Method: Composition-based stats. Identities = 74/145 (51%), Positives = 100/145 (68%), Gaps = 2/145 (1%) Query: 14 QGLLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFD 72 +G D+ AVK LLPE G L+ C WPD+++ +++WTS LH+++TP+ CN++ Sbjct: 2 KGFFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTLHYVNTPEYRCNYE 61 Query: 73 YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALLFLSHFMGDIHQPM 131 Y RDCHD H KD CV GAI N+T QL E + + YN+TEALLFLSH+MGD+HQP+ Sbjct: 62 YCRDCHDTHKHKDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALLFLSHYMGDVHQPL 121 Query: 132 HVGFTSDAGGNSIDLRWFRHKSNLH 156 H GF D GGN+I + W+ +KSNLH Sbjct: 122 HTGFLGDLGGNTIIVNWYHNKSNLH 146 >UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2E030_TRIVA Length = 372 Score = 169 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 46/282 (16%), Positives = 96/282 (34%), Gaps = 36/282 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQV-----RHWYKY 53 W H M R++ L D + +L + + + W D++ R Sbjct: 12 WWNGPHEMVARVSWNDLTDRQQKIIYKILLTWPDEQKLFTNCGSWLDEIAAKYNRGTDLI 71 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 P HF+D P D + ++ + A+ + + T+ + + Sbjct: 72 SHFKPWHFVDFPL----IDGCENFEEKDTPFVYNITSALNHIISSFLD---PTTKSLWAI 124 Query: 114 TEALLFLSHFMGDIHQPMHVGFTS---------DAGGNSIDLRWFRHKSNLHHVWDREII 164 + L H + D+H P+H D G N L + NLH +WD + Sbjct: 125 NFDIRMLLHLVADVHTPVHCIDRYTPSSGTCKADHGANFFSLSLSINGKNLHSLWDSAVY 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + L + + + + ++ V +A S IA ++ Sbjct: 185 AYPTGSFSEEMVQKLIFEYKDKIPEDSYVQNM-----------NVTAWALHSYEIAKEYV 233 Query: 225 YKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 Y G++ + + +D Y P ++ R+A +++ Sbjct: 234 YNGLKLNQYVGENDAYVTRAQPQAKAQIILASKRMAYIIDQF 275 >UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 Tax=Tetrahymena thermophila RepID=UPI000150A357 Length = 389 Score = 168 bits (426), Expect = 1e-40, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 97/291 (33%), Gaps = 28/291 (9%) Query: 3 KEGHVMTCRIAQGLL---NDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYK- 52 H++ IA+ L + E + ++ +W D +++WYK Sbjct: 26 DLPHMLILGIAKETLIEKDPEIIQIAEKYFDQFEEPHQKGQVQFEEHSIWSDDIKYWYKS 85 Query: 53 -YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K+ H+ID N+ + ++ + A L + Sbjct: 86 SVKYWDTWHYIDQIYNPSNYPID---VNKQKDSNSNAQVAFNQIKETLKNKNLNGKITVM 142 Query: 112 NMTEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRW-FRHKSNLHHVWDREI 163 L L H +GDIHQP+H D GGN ++ K+NLH +D Sbjct: 143 KHIF-LKHLVHLVGDIHQPLHTVSFYSYQFQNGDLGGNKQMVQLSDNRKNNLHFYFDSGA 201 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 +D + N D + + + +++ ES I+ + Sbjct: 202 FYYTFEDRIHRPFNESFIDYFEEEIARLIKLYPREELKINDEDIQFDQWVKESYMISIEQ 261 Query: 224 GYKGVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 Y ++ ++D+ + K++ + G RLA +L + + Sbjct: 262 IYSQIDLTGNQKINKITDENHRKNQELCQKQIVKAGYRLANILVDFLKDEK 312 >UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepID=Q8ILX4_PLAF7 Length = 320 Score = 168 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 96/303 (31%), Gaps = 34/303 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDL---SALCVWPDQV---------- 47 WS E H++ IA LND + + + +W D++ Sbjct: 19 WSDEPHMLISYIAYINLNDGEKEILNRIFQNGNDAIFDNPITASIWADKIKPNNHKRTFH 78 Query: 48 ----RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R + H++ Y H + G +++ L R Sbjct: 79 SSNFRRNELLDIFNEWHYVQLNYNPMKI-YIAPYHLRAHKGKHNAMGILKHIYRILIEVR 137 Query: 104 EG-TSDRRYNMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNL 155 + Y+ L F H D+HQP+H D GG I + + + L Sbjct: 138 QKMGHGTYYSYNFYLRFFIHIFSDLHQPLHAINFFNSNYPNGDRGGTDISVNYKGSINKL 197 Query: 156 HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATE 215 H++ D I T K + ++ +E D + + ++ A E Sbjct: 198 HYLCDN-IFKTRKKQWPNINMTNIERDARYLMSTYPPESFGNKLFLPHDKIKYIDDIAHE 256 Query: 216 SINIACKWGYKGVEAG-------ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 S +IA + Y +++ + + ++ ++ G RL+ L ++ Sbjct: 257 SHDIAVQNIYSFFPLTDLKRSEQYSINQHFVINTKKLLNSQMVLAGYRLSAYLKDIIANI 316 Query: 269 QQE 271 + Sbjct: 317 PPD 319 >UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobasidiella neoformans RepID=Q560K3_CRYNE Length = 393 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 66/223 (29%), Positives = 90/223 (40%), Gaps = 33/223 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH M IAQ L + +LPE N L+ + W D VR+ +Y+ T+P+H Sbjct: 20 WGAAGHEMVATIAQIHLFPSTRAKLCSILPEEANCHLAPVAAWADIVRN--RYRGTAPMH 77 Query: 61 FI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 +I D P C F +D+ V AIQNFT + + G Sbjct: 78 YINARNDHPQDHCEFGQH-----GWQNEDVNVITAIQNFTRLIMDGKGGKDVD-----IP 127 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L FL HF+GD HQP+H+ D GGN + + NLH VWD II ++ Sbjct: 128 LRFLVHFIGDSHQPLHLA-GRDKGGNGAKFLFEGRERNLHSVWDSGIITKNIRELSNYTS 186 Query: 177 NLLEEDIEGNFTDG----------------IWSDDLASWRECG 203 L + IE W D++ SW C Sbjct: 187 PLPSKHIERCLPGAIFDPYVRWIVWEGIRLWWRDEVDSWISCP 229 Score = 52.1 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 27/70 (38%), Gaps = 9/70 (12%) Query: 207 SCVNKFATESINIACKWG----YKGVEAGETL---SDDYF--NSRLPIVMKRVAQGGIRL 257 SC + + + C Y G + +D+Y R I+ K +A G+RL Sbjct: 311 SCPYHWISPIHQLNCDIVWPSKYTGQPNEPLIELDTDEYLGEIGRQKILEKMIAMAGLRL 370 Query: 258 AMLLNNVFGA 267 A +LN Sbjct: 371 AKVLNEALAE 380 >UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmodium knowlesi strain H RepID=B3LAP6_PLAKH Length = 331 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 100/300 (33%), Gaps = 30/300 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQV--------- 47 WS EGH++ IA L D+ ++ + Y D VW D + Sbjct: 24 WSDEGHLLISAIAYEGLTDDEKFVLQTIFKNYKEDNDFNDPVTAAVWADHIKPIDYHYTT 83 Query: 48 --RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQN-FTTQLSHYRE 104 R + + H+ P N ++ K +++ FT+ + ++ Sbjct: 84 KVRRIGGLELMNKWHYTSNPYNPTNIPLNE-YRKKYYQKTDNALSVLKSIFTSLKNMNKQ 142 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHH 157 ++ L + H GDIH+P+HV D G I++++ + LH+ Sbjct: 143 ENHGTFFSYNFNLRYFIHIFGDIHEPLHVVEFFNKHFPEGDNGATLINIKYNNNVEKLHY 202 Query: 158 VWD------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNK 211 + D T+ ++ N L + + +DL+ + + Sbjct: 203 LCDCVFHTRSRRWPTSGMKEMLEEGNALMKMYPPEYFGDRLKNDLSDLEYLDFIVNDSYT 262 Query: 212 FATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 A I + L + + ++ +++A GG RL L + + Sbjct: 263 KAVNDIYSNFPHDTLNSKTPYVLDNSAVDKLKKMLNEQIALGGYRLRRYLKIMIENVPDD 322 >UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FAR0_TRIVA Length = 326 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 46/280 (16%), Positives = 93/280 (33%), Gaps = 27/280 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W E H R+A+ +L+ + +L + + W D ++ P Sbjct: 12 WWGEPHYFIARLAESMLSASEVKYLNRVLATWESEKAVFHDTGNWHDDLK-PIGMPLMVP 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P N++ V ++ LS + ++ + + Sbjct: 71 WHFRNQPVVDPNYNL------VTYPVTYNVTQVNKDC---LSAIYDTSTTSMWILGFCFR 121 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNS--IDLRWFRHKSNLHHVWDREIILTAAK 169 L+HF+ D H P+H D G LH VWD + Sbjct: 122 SLAHFVADAHCPVHASCYFSADYPNGDGGATKEKFVCPVDEVCDKLHFVWDSGSLNFQTW 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS-CVNKFATESINIACKWGYKGV 228 + E ++ +W++ S +++ +++ ++A ++ Y Sbjct: 182 PIPESLVKEAEYNL-----SHLWTNYPPEKHYSSTYNSIDPDQWQSDAYDVAKEYVYGLY 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + G ++ +YFN P K ++ RL +L F Sbjct: 237 QFGHNVTGEYFNKTQPPAAKLISVAAYRLGKVLQTFFHKR 276 >UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT7_PHYIN Length = 343 Score = 163 bits (413), Expect = 5e-39, Method: Composition-based stats. Identities = 68/312 (21%), Positives = 114/312 (36%), Gaps = 48/312 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYV-----NGDLSALCVWPDQVRHW----- 50 W GH++ +A+ L+++ ++ +L ++ G+++ VW D ++ Sbjct: 27 WWDNGHMLVGEVAKQLMSEADVVTIESVLSKWNEDFPNTGEITTSAVWMDLIKCTSVSSY 86 Query: 51 ------YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 S H+ID P +E D +D A L + Sbjct: 87 CQSPLAPSITSMSDWHYIDLPVNINGDKWEYKDADLSLFEDTMGGDAASVIEGALRSLK- 145 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT-------SDAGGNSIDLRWFRHKSNLHH 157 T+ + + H GD+HQP+H D GGNS SNLH Sbjct: 146 -TTKSSWAANLFIRNFIHIFGDLHQPLHTVAGVSEAFTEGDGGGNSEYFASPCAFSNLHA 204 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGI-----WSDDLASWRECGNVF------ 206 VWD L + ++ A +I+ + ++ N TD I SD L + + Sbjct: 205 VWDAAGGLYSLNNW-ALNIDDFKSTLQSNATDLIALLLNISDTLDFSQYENTTYNELYTA 263 Query: 207 ----SCVNKFATESINIACKWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGI 255 S + + E+ + A Y G++ T S Y I KR+A GG Sbjct: 264 LVTNSALREVILETYSYADTVVYSGLDLNATSSGKYPCPSSSYLTLAGEISQKRIAIGGS 323 Query: 256 RLAMLLNNVFGA 267 RLA++L + Sbjct: 324 RLAIILKHFAAQ 335 >UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z194_9GAMM Length = 275 Score = 163 bits (412), Expect = 6e-39, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 108/282 (38%), Gaps = 48/282 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH C A + A+ LL L LC W D+++ + T H Sbjct: 30 WWDDGHQQVCEQAVAQVQPATLAAIADLLDAP----LGELCSWADEIKG--QRPETRQWH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + + A+ +L H EALL++ Sbjct: 84 YLNAPPD------TLSIGNAPRPEGGDIIAALNEQIHRLKHAPTN------QRREALLWV 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF----------RHKSNLHHVWDREIILTAAKD 170 H +GD+HQP+H+G+ SD GGN+ L R + ++H VWD I+ + Sbjct: 132 GHLIGDLHQPLHLGYASDLGGNTYRLELPEELALQLNEKRERVSMHAVWDGLILRYQDQP 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINI--ACKWGYKGV 228 A +E + N I + +A E++++ K Y+ Sbjct: 192 SVAATATPIERPLLLNPEVEIIA------------------WADETLSVLNDAKVHYRHG 233 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 +TL+ Y S V ++ + RLA LL+ F S++ Sbjct: 234 TRLQTLTSQYLISNRSAVDLQIRRAATRLAALLDWAFSQSKR 275 >UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5LKE6_9ALVE Length = 342 Score = 160 bits (404), Expect = 6e-38, Method: Composition-based stats. Identities = 65/291 (22%), Positives = 119/291 (40%), Gaps = 41/291 (14%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 + H + +A L D+ + ++L LS W + W + L Sbjct: 17 GSDFHAVVVELADLRLADKTRQELSIMLGNDYR--LSTTANWA----ARLNFPWLADL-- 68 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 + CNF Y RDC + C+AG+I N+T ++ T +R EA+ FL Sbjct: 69 STAYNDHCNFSYARDCTN----NGRCLAGSIWNYTNRMIDPYLSTKERS----EAVKFLV 120 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSN--LHHVWDREIILTAA-----KDYYA 173 H + D H P+ G +SD GG I++ SN L W +I+ Y Sbjct: 121 HLVADAHLPLSAGRSSDQGGKKINVHINFADFSNVDLSKAWREKILDEMQGALYPGKYVQ 180 Query: 174 KDINLLEEDIE---------GNFTDGIWSDDLASWRECGNV---FSCVNKFATESINIAC 221 +D N ++ G D ++ + SW +C++ E+ ++AC Sbjct: 181 QDSNSSSHRMKFWRVTSNSIGADLDQKYAGMVPSWLAECTQHGINACIDMILNEAADLAC 240 Query: 222 KWGYKG-----VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 + Y+ ++ + LS +Y+ SR+ ++ +++A+ RL +++ F Sbjct: 241 RIAYRNMDGRDIQNNDDLSREYYTSRIGMLREQLAKAATRLGWIMDEAFKN 291 >UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QW83_9PLAN Length = 338 Score = 156 bits (395), Expect = 5e-37, Method: Composition-based stats. Identities = 61/318 (19%), Positives = 104/318 (32%), Gaps = 58/318 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ +GH + IA L E A+ +L ++ Sbjct: 28 WNAKGHRLVAAIAYRSLTPEDRDALIEILKQHPRFAADFERQMPDVVKSGTKDQQQEWLF 87 Query: 38 SALCVWPDQVR----HWYKYKWTSPLHFIDTPDKACNFDYER----------DCHDQHGV 83 VWPD +R H+I+ P + + V Sbjct: 88 GHAAVWPDYIRGFKGEESDKYHRPTWHYINWPHYLSDAEAAELAMPPMVNRHLDPAMTPV 147 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-------- 135 + + +I +Q + +R + +L H MGD+HQPMH Sbjct: 148 LEQNLMQSIARLRSQFVDSKYSAEER----AVMICWLLHTMGDLHQPMHGASLFCKPLFV 203 Query: 136 TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAK--DINLLEEDIEGNFTDGIWS 193 D GGNSI R NLH VWD + + + + L ++ T S Sbjct: 204 QGDRGGNSILTRQSG---NLHAVWDNALGNDDSFREVNRHATLLLATPEMTKIGTASQAS 260 Query: 194 DDLASWRECGNVFSCVNKF--ATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKR 249 + +W E + + + + A S K V+ L++DY + + +R Sbjct: 261 IEQKTWLEESHALAVEHVYDQAVLSHVRVQMLTAKNVDDFPPLMLNEDYLRNSSKVSERR 320 Query: 250 VAQGGIRLAMLLNNVFGA 267 + G R+A +L + Sbjct: 321 SVEAGYRIAAVLRQLLHP 338 >UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8P2Q4_POSPM Length = 753 Score = 156 bits (395), Expect = 6e-37, Method: Composition-based stats. Identities = 58/212 (27%), Positives = 89/212 (41%), Gaps = 28/212 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPE-------------YVNGDLSALCVWPDQV 47 W GH + IAQ L+ + +L Y L+ + W D+V Sbjct: 323 WGAAGHEIVATIAQIHLDPSVLPVLCDILYPPSSSSHKASTSSAYPPCHLAPIAAWADRV 382 Query: 48 RHWYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R Y+WT+PLH++ D P +C F G ++ V A+ N T Q++ Sbjct: 383 RGSPAYRWTAPLHYVGAVDDAPADSCAFPGPNGWA---GRHNINVLAAVSNKTGQVA-AF 438 Query: 104 EGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREI 163 + EAL +L HFMGD+H P+H+ + GGN + + SNLH VWD + Sbjct: 439 LSGEAGLHEGEEALKYLVHFMGDMHMPLHLT-GKERGGNGAKVTFDGRVSNLHSVWDNLL 497 Query: 164 ILTAAK------DYYAKDINLLEEDIEGNFTD 189 I A + + D+ +E + G D Sbjct: 498 IAQALRTVPPNYTWPLPDMRGVEAHLRGAIYD 529 >UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6ABV1_9CRYT Length = 433 Score = 156 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 111/312 (35%), Gaps = 49/312 (15%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVW--------PDQVRHWYKY 53 +GH A L H +K L+ D+ + W P + ++Y Sbjct: 22 DADGHSAIAMTAMSGLKGNTLHQLKRLM---NGKDIVDISAWGERVSQKHPSTMPFHFQY 78 Query: 54 KWTSPLHFI--------------DTPDKACNFDYERDCHDQH-----GVKDMCVAGAIQN 94 + + LHF D + ++ C++ C+ I++ Sbjct: 79 QDMNELHFDKFLPESAPQMFGLGDGTRSFSHTYSDKYCNEVGASAECKETGHCLVPMIKH 138 Query: 95 FTTQLSHYREG----TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----L 146 ++L + ++++ FL + +GD+HQP+H GFT G + Sbjct: 139 LYSRLIGLDRNKISYPEGIQLTDSDSVKFLVNLIGDLHQPLHFGFTESNAGRDFHGHLII 198 Query: 147 RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 +L +W++ +I + I+ + W+E G Sbjct: 199 NGTEETISLFEIWEKGLIQKLKIEKPQFWYGGWTHVFA---IRDIFDKETILWKERG--I 253 Query: 207 SCVNKFATESINIACKWGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAML 260 ++ +A ESI I C + E L++++ + I+ R+ G RL+++ Sbjct: 254 DIIDDWARESIQIMCSALFIHPLNQEKLTNNFNIDPLLEFAWFEILRSRLLIAGARLSIV 313 Query: 261 LNNVFGASQQED 272 LN++ + ++ Sbjct: 314 LNDILKYREGKE 325 >UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3P1_9PLAN Length = 330 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 98/312 (31%), Gaps = 58/312 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ GH + IA L E A+ LL ++ + Sbjct: 24 WNYAGHRVIASIAWDQLTPETQAAMIALLKQHPRFEQDFQSRMPEVILKASPAVQDRWLF 83 Query: 38 SALCVWPDQVRH----WYKYKWTSPLHFIDTPDKACN-----------FDYERDCHDQHG 82 WPD R + H+I+ P + + Sbjct: 84 MRAATWPDIARSFKEADREKYHHGTWHYINQPIYLDTASELSLSSKLPVNTAKSIRQGDD 143 Query: 83 VKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG-------- 134 + A++ Q+ +D+ AL ++ H GD HQP+H Sbjct: 144 PLQFNILQALEYNVAQMKDPAVSEADK----ALALCWIMHLTGDSHQPLHSSALFSKGSF 199 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 D GGNSI + KSNLH WD + + L D + Sbjct: 200 PEGDRGGNSIRI----GKSNLHAQWDGLLGNSFKDSEIVSQAVGLARDPALKQLGEQATK 255 Query: 195 DL--ASWRECGNVFSCVNKFATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKRV 250 +L A W + + + + + A + E + L Y+ + I +KR Sbjct: 256 NLNYADWIDESHALAKSAGYTQLILAAAKQNDSPQNEFLKLKDLPAAYYRTAGAIAVKRA 315 Query: 251 AQGGIRLAMLLN 262 AQ G RLA ++N Sbjct: 316 AQSGWRLAAVIN 327 >UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47K45_DECAR Length = 301 Score = 154 bits (388), Expect = 4e-36, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 104/312 (33%), Gaps = 70/312 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--------------LSALCVWPDQ 46 W+ GH + IA L+ A+ L + + + + WPD Sbjct: 20 WNAAGHRLVAVIAWQQLSPATRDAISAALAHHPDHERWVEKARSREGIAVFAEASTWPDD 79 Query: 47 VRHWYKYKW------------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCV 88 +R+ + H++D V+D + Sbjct: 80 IRNDPRLYDEDREPPTPAVPGLPETARHKRWHYVDLD-------------ATGKVRDGEL 126 Query: 89 AGAIQNFTTQLSHY-REGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL- 146 I+ + L + + + AL +L H + DIHQP+HVG D GGN +++ Sbjct: 127 DRQIERLSQLLQAKGSSPGTRKSEQIAYALPWLLHLVADIHQPLHVGQHGDEGGNKVEIE 186 Query: 147 RWFRHK---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECG 203 F + S+LH WD + N LE++ Sbjct: 187 NPFNKRLPFSSLHLYWDDLPGPPWLRG------NRLEKNAGRLLDS-----------YPK 229 Query: 204 NVFSCVNKFATESINIACKWGYKGVEAG--ETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 V V + ES + Y V +S+D+ ++ I +R+ + G RL LL Sbjct: 230 PVQGNVALWRDESHQL-LAAAYPKVSGSLLPIISEDFQDNARQIANRRIVEAGYRLGHLL 288 Query: 262 NNVFGASQQEDS 273 ++F ++ Sbjct: 289 ESIFRERVSRET 300 >UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2JAU7_NOSP7 Length = 332 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 57/307 (18%), Positives = 98/307 (31%), Gaps = 54/307 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKM-------------------------LLPEYVNG 35 W+K GH+++ IA L + + PE N Sbjct: 41 WNKSGHMVSGAIAYSELKQSNQQNLDKVVAILKEHPEYSKFEQQWNSLNQSNISPEDKNL 100 Query: 36 DLSALCV-WPDQVRHWYKYKWTSPLHFIDTPDKA--CNFDYERDCHDQHGVKDMCVAGAI 92 L W D+ R ++ H+I+ P + + R+ D+ + I Sbjct: 101 YLFMWAAKWADEARDNPEFNH-PTWHYINFPYQPGRASNSIPREIPDEENI--------I 151 Query: 93 QNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNS 143 F L + S+ A+ +L H +GD+HQP+H D GG Sbjct: 152 FAFQKNLDVVKSNASNSD--KAVAICWLFHLIGDVHQPLHTTKLITNQYPQPEGDRGGTR 209 Query: 144 --IDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE 201 I ++ +LH WD I+ + L + N + +W Sbjct: 210 FYIRVKPNSQTISLHKFWDDLILGSERFQAVRNAATSLRSSYQRNKLPELRETKFNNWA- 268 Query: 202 CGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 ++ G G+ L +Y + I +R++ G RLA +L Sbjct: 269 ---KLESFRIAKQDAYLNGKLSGSSDKNDGKLLPANYAATAKQIAQRRMSLAGYRLADVL 325 Query: 262 NNVFGAS 268 N + G Sbjct: 326 NQLLGQR 332 >UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium RepID=A3FPP7_CRYPV Length = 416 Score = 151 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 107/296 (36%), Gaps = 35/296 (11%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 EGH L + + ++ L+ D+ + W + R K+ T P HF Sbjct: 24 DAEGHSAIGMTTISGLQNNFSQKLRRLM---NGKDIVDISGWGE--RVSKKHPSTLPFHF 78 Query: 62 IDTP--DKACNFDYERDCHDQ--------HGVKDMCVAGAIQNFTTQLSHYREG-----T 106 D N + D ++ C+ I++ +L Sbjct: 79 QGQSKGDYFKNGELGNDFKEKFILKSDSNCKHTGHCLVPMIKHLYYRLIGDNSKFKINYP 138 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----LRWFRHKSNLHHVWDRE 162 + ++++ FL + +GD+HQPMH GF D G I + + +L +W+ Sbjct: 139 EGIQLTDSDSIKFLINLIGDLHQPMHFGFIEDGLGREIKGMMSINGTNERLSLFEIWESG 198 Query: 163 IILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 I + + I+ +L W+E G +N +A E+ I Sbjct: 199 IARKLKTEKPQFWFGGWTHILA---IRDIFDKELLLWKERG--IEMINDWAKENFEIVTN 253 Query: 223 WGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 Y + + + D++ + L I R+ G RL+++LN++ + ++ Sbjct: 254 EIYFHPISKQPIIDNFNVDVTLEFAWLEIFRSRILIAGARLSIILNDILKLREGKE 309 >UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SGH7_VERA1 Length = 303 Score = 150 bits (378), Expect = 5e-35, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 85/282 (30%), Gaps = 23/282 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H A+ L+ A + +L L + W D R + + T+ H Sbjct: 21 WNTDIHQQIGFAAEKFLSPAAKAILSEILEPESGASLGRIGAWADAHRGTPEGRHTTTWH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P CN Y RDC C+ A+ N T L D + Sbjct: 81 WINPADQPPSFCNVHYNRDCTS-----GGCIVSALANETQILKSCIRSVKDASLSAAPTP 135 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + V D + +S + + I Sbjct: 136 RAPTPPT--------VFPVVDREEEKF-VYLTPARSGTAPL--STCSAANVTGFPNTTIQ 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVEAGETL 234 D+ + W C +C ++A ++ C + + L Sbjct: 185 PFFSDMVDRIRADTYFVPTRDWLSCTDPSTPLACPLEWARDANQWNCDYAFSQNTNASDL 244 Query: 235 -SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 + Y PI ++A+ +R+A N + + ++ VV Sbjct: 245 RTSGYAEGAWPIAELQIAKAVLRIATWFNKLADCNFKDREVV 286 >UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KFB6_TOXGO Length = 439 Score = 149 bits (376), Expect = 8e-35, Method: Composition-based stats. Identities = 56/324 (17%), Positives = 107/324 (33%), Gaps = 66/324 (20%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDT 64 H L+ A A+K LL DL+ + W R KY T+ LHF+ Sbjct: 32 AHEAVSMTTLSGLSTSANQALKKLL---NGKDLADVAGWAH--RVSDKYPDTARLHFMSQ 86 Query: 65 PDKACNFDYERDC---HDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 P D VK C+ A+ F L + Sbjct: 87 PTCPSKPLRTDDIILDKSFCEVKGNCLLEALTYFFFHLVDPDQNKVEQTNPDVITTTNFV 146 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK----SNLHHVWD 160 D + +A+ ++ + +GD+HQP+H+G D G +++ + + L++ + Sbjct: 147 FPHDIKTTDADAVKYIINLVGDMHQPLHMGSADDDYGRRAVVQYSDGEQMRLTTLYNFLE 206 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ K + N G + + + + +++A E+ + Sbjct: 207 AGLVDKTVKQRQYFWFSGWTHV---NSVKGAYDSEKSLFATNKEKM--FSEWAKENRAVL 261 Query: 221 CKWGYKGVEA------------GETLSDDYFNSRLP--------------------IVMK 248 C Y V G D+Y + L ++ K Sbjct: 262 CNEVYPHVRKTGKDARAAANALGSDAVDEYAKAVLDGSSDVPLFEIDAAAEFALFQVLKK 321 Query: 249 RVAQGGIRLAMLLNNVFGASQQED 272 R+ G R+A+++N + + +D Sbjct: 322 RILLAGARVAIVMNYILQVRESKD 345 >UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KWM0_9GAMM Length = 271 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 93/264 (35%), Gaps = 43/264 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH C A + + LL N ALC WPD+++ T+P H Sbjct: 22 WWDLGHAAICDAALEYVKPGTRLEIDRLLATRDNRGFGALCSWPDEIKTDQ--PTTAPWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P D + + + +LS R EALL++ Sbjct: 80 YLNVPVGTT------DIATAPRPAEGDILAVLTEQQARLSQANTDIHAR----AEALLWV 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFR----------HKSNLHHVWDREIILTAAKD 170 +H +GD+HQP+HV + D GG+S L+ R ++ +H +WD + L A Sbjct: 130 AHLVGDLHQPLHVAYAEDRGGSSYRLQVPREIRALLGERYEETGMHQIWDGYLPLYARYS 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK--WGYKGV 228 + L+ E ++A ES+ I Y Sbjct: 190 GGSGLKQLVIEQ-------------------SAEAGGTPLEWAQESLTIMNNPGTAYLYG 230 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQ 252 L + Y I +KR+ Q Sbjct: 231 YRITILDEAYLAKNYRIALKRMKQ 254 >UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KF36_TOXGO Length = 397 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 54/358 (15%), Positives = 92/358 (25%), Gaps = 96/358 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQV-------- 47 W H++ IA+ ++ A V +L + + VW D + Sbjct: 25 WHSGPHMIVAAIARSEMSALAQIKVDYILGLWRGQYPDHATMERASVWLDDINGKGPPYE 84 Query: 48 ---RHWYKYKWTSPLHFIDTPDKA------------------------------------ 68 R + K +H ++ P Sbjct: 85 KPSRRFDFLKIFQFMHGVNIPYNPEGIQLQGLDALLPLYERSAEFLLDMAWDGLKATTPT 144 Query: 69 --------CNFDYERDCHDQHGVKDMCVAGAIQNF------------------TTQLSHY 102 C+ + V A NF ++Q+S Sbjct: 145 TEKLEDPFCSVPPPVSSFSLASYSEGTVNAANGNFLEVSHPDEYRRNTGVSARSSQVSTD 204 Query: 103 REGTSDRRYNMTEALLFLSHFMGDIHQPMH-------VGFTSDAGGNSID-LRWFRHKSN 154 E ++ L + H + DIHQP+H D G I + +N Sbjct: 205 AESPVGTVLSLNFYLRMVIHLVADIHQPLHSLLAFSPAFPHGDRFGTKISMVLPNGEDTN 264 Query: 155 LHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFAT 214 LH WD + + D + EE D L S + + A Sbjct: 265 LHAFWDGAGSVYTKRRGEFTDEEIAEEARRIKL--EFPKDSLESHLKPELLAPNFRNMAE 322 Query: 215 ESINIACKWGYKG--------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 ES + Y+ + + Y +++A G RL L + Sbjct: 323 ESHRLGAALAYREFNFRTFRPADLPYVPTHTYLADVRLACRRQIAIAGYRLGYALEEL 380 >UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RIT3_9PROT Length = 320 Score = 148 bits (372), Expect = 3e-34, Method: Composition-based stats. Identities = 61/314 (19%), Positives = 97/314 (30%), Gaps = 73/314 (23%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD---------------LSALCVWPDQVRH 49 GH ++ IA ++ AV LL ++ + + WPD +R Sbjct: 33 GHRISAMIAWESMDAGTKSAVGQLLRQHPDYERWQARAHGGDPELTAFLEASTWPDDIRK 92 Query: 50 WYKYKWTS------------------PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGA 91 ++ T H++D P G AG Sbjct: 93 DRRFYTTGREEPTATLPGFPDMERRLHWHYVDRPVNP-------------GAGTGPAAGV 139 Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT------SDAGGNSID 145 I L+ AL +L H +GD HQP+H SD GGN + Sbjct: 140 IDRQLAVLARIVGDRQATMAERAYALPWLIHLVGDAHQPLHAASRYGPDGQSDNGGNLVS 199 Query: 146 -LRWFRHK---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE 201 + F + +LH WD +D + Sbjct: 200 IVNPFAARYTSMSLHRYWDDLPGPPWLRDGRLASAARSLAAL----------------HR 243 Query: 202 CGNVFSCVNKFATESINIACKWGY-KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAML 260 ++ ES +A + Y G +A T+S + L I +RVA+ G RLA L Sbjct: 244 PPTSPGTPEQWLDESWRLARERVYPPGDDAVPTISATFHEDALAIAGRRVAEAGYRLADL 303 Query: 261 LNNVFGASQQEDSV 274 L + + + + Sbjct: 304 LQRLLHSGPRREDR 317 >UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensis MED297 RepID=A4BF01_9GAMM Length = 262 Score = 143 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 96/271 (35%), Gaps = 28/271 (10%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALC--VWPDQVRHWYKYKWTSPLHFI 62 GH M ++ L D A ++ L E + ++ + V D R + K PL Sbjct: 9 GHTMVAQLMVPFLKDGARSELERLYGEDWSREIVSRAAMVQADLNR--PQNKSMIPLQLT 66 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 F ++ C + + C GA+ L +D+R +A ++L H Sbjct: 67 LFEQGDETFQPDKHCPN-----NRCSVGAVLESREVLLRSSFSDADKR----QATIYLMH 117 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFR-HKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + +H P++ G D GG I L+ NL +W+ ++ K ++ Sbjct: 118 YALQMHIPVNSGLKRDDGGRKIYLKDDDLQPVNLAWIWNHDLYRQMDKRWFT-------- 169 Query: 182 DIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNS 241 I D +W E N +A E+ IA Y G S + Sbjct: 170 -YAQELYRDIEKVDPQAWVESMN----PADWALEAHEIAEAEVYPLAAEGRY-SAQLKRA 223 Query: 242 RLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 ++ +++ + R A L N +F D Sbjct: 224 GTAVLEEQLKKAAYRTASLFNEMFPPEDAPD 254 >UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing protein n=1 Tax=Caulobacter segnis ATCC 21756 RepID=D0Y4Z6_9CAUL Length = 307 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 57/314 (18%), Positives = 93/314 (29%), Gaps = 77/314 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAA--------------HAVKMLLPEYVNG-DLSALCVWPD 45 W+ GH+M +A + +A VK + E + WPD Sbjct: 23 WNGRGHMMVAAVAWEEMTPKAKARAAALLRKNPNYGDWVKGVPVELADKVAFMNAATWPD 82 Query: 46 QVRHWYKYKWTSP-------------------LHFIDTPDKACNFDYERDCHDQHGVKDM 86 +R ++ P HF + + D + Sbjct: 83 DIRSTHQDDGYDPTVPQADDNVGYSDPYVHAYWHFTNI-------AFSIDATPVPPPPAV 135 Query: 87 CVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-------TSDA 139 I+ F+ L+ S + L++++H +GD+HQPMH D Sbjct: 136 NAIERIKLFSATLA-----PSGDDDVQSYDLVWVAHLVGDMHQPMHATSRYSQAKKRGDN 190 Query: 140 GGNSIDLRWFRHKS---NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDL 196 GGN + + LH WD + ++D + +D L Sbjct: 191 GGNGVFVCKTGQCDKGQKLHQFWDYGVG-------SSQDYASVIAA----------ADKL 233 Query: 197 ASWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKRVAQ 252 + + ES +A Y + L+ Y +VA Sbjct: 234 PKAPAAQRAIGDPDAWLQESYQLARTKAYVDPIGPAKGPYVLTTRYRVEAGQTCEAQVAL 293 Query: 253 GGIRLAMLLNNVFG 266 G RLA LLN G Sbjct: 294 AGARLADLLNARLG 307 >UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8PCL3_COPC7 Length = 484 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 83/227 (36%), Gaps = 52/227 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-----------GDLSALCVWPDQVRH 49 W GH + IAQ L+ + LL V+ LS++ W D + Sbjct: 27 WGAAGHEIVATIAQIHLHPSVLPTICALLDIDVDASDDTSSLRAKCHLSSIATWAD--KE 84 Query: 50 WYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG 105 K +W++ +H++ D P + C F + G + + V A +N T L+ + Sbjct: 85 KMKIRWSAAMHYVGAVDDFPRERCEFPGPKGWA---GTRSINVLDATKNVTRILAEWGGV 141 Query: 106 TSDRRYNMT-------------------------------EALLFLSHFMGDIHQPMHVG 134 + ++ EA FL HF+GD+HQP+H+ Sbjct: 142 DENEFSLVSPVTSYVPPYGSRSQVPGKRVKQLPVPGPLQEEAFKFLVHFVGDMHQPLHLT 201 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + GGN I + + +NLH WD I + L + Sbjct: 202 GRA-RGGNGIKIHFGTRTTNLHSAWDTMIPTKLIRTVPRNYTRPLPD 247 >UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYG7_9BACT Length = 346 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 54/334 (16%), Positives = 96/334 (28%), Gaps = 76/334 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------------LSALCVWPDQV 47 W GH +A L A + ++ +L +PD + Sbjct: 22 WDTPGHEQIADMAYTRLTPAAKNKIREILQHGDPRYVPANNGDDTLRDAFRRASSFPDVI 81 Query: 48 RHW-------------------------------YKYKWTSPLHFIDTPDKACNFDYERD 76 R +Y H+ DTP Sbjct: 82 RDPGASTVFDDAYVDRMNLTFQPDVSPQQLAKPKSEYIRCKTWHYYDTPIH-------YS 134 Query: 77 CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD-RRYNMTEALLFLSHFMGDIHQPMHVGF 135 + + A T QL+ + + + L ++ H GD+HQP+H Sbjct: 135 TSHAPKIYESNALVAYNYATAQLAKLKNSAAGADLRDAAWWLCWIEHLTGDLHQPLHCTS 194 Query: 136 T------SDAGGNSIDL--RWFRHKS-----NLHHVWDREIILTAAKDYYAKDINLLEED 182 D GGN++++ W NLH WD I A A+ + Sbjct: 195 NYAHNHRGDIGGNAVNIIAPWDGASGALHAVNLHSYWDEGIDHAAGGHRSARQDLTPADA 254 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA---------GET 233 + TD ++ + V + + +A Y+ A G Sbjct: 255 M--EVTDAWLRNNQLKPGDSDAADLNVAHWIAQGAALADAHVYQETNAAGQTQEIIDGTN 312 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 ++ Y ++ + + + RLA +LN +F Sbjct: 313 VTPQYTTDQIDVCEHQAVRAAYRLAAVLNGIFQP 346 >UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID=B3L390_PLAKH Length = 417 Score = 136 bits (342), Expect = 7e-31, Method: Composition-based stats. Identities = 55/319 (17%), Positives = 108/319 (33%), Gaps = 61/319 (19%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 S EGH +A L E + +K LL D+ + W V K K +HF Sbjct: 34 SGEGHEAIGMVAMSGLKSEQLYELKKLL---SGKDIVDIGKWGHLV--HEKIKGAESMHF 88 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 + + C + C D++G+ C+ +I++F +L+ + Sbjct: 89 -NLQNHDCKRAVFK-CEDENGL---CLINSIKHFYVKLAGGKPTDHTTGQSTNQSTGQAT 143 Query: 106 -------------------TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL 146 + + +AL +L + D+HQP+ + + D GG I + Sbjct: 144 EEHALNSAPPEAKDIPFKYPQNIAFTDADALKYLVSLIADMHQPLRIAYRYDNGGKDIKV 203 Query: 147 ----RWFRHKSNLHHVWDREIILTAAKDYYAKDINLL--------EEDIEGNFTDGIWSD 194 + ++NL + E+I K Y + E + + Sbjct: 204 IHHDDYKTVRTNLFDYMESELINKMIKRYQSAWYGGWTHINRLLDEHKKDEKLFSEKGIN 263 Query: 195 DLASWRECGNVFSCVNKFATE--SINIACKWGYKGVEAGETLSDDYFNSRL--PIVMKRV 250 + W E C + + + K + + + Y ++ + Sbjct: 264 AIDIWGEQIINEFCSEFYLNSYVTNFMVEKKDELHFDTSKEIEITYDLEFHLERLLKVNI 323 Query: 251 AQGGIRLAMLLNNVFGASQ 269 + G R+A+LLN++F + Sbjct: 324 LRAGSRIAILLNSLFANRK 342 >UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrhizobium RepID=A4YRX0_BRASO Length = 312 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 77/297 (25%), Gaps = 71/297 (23%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL---------------PEYVNGDLSALCVWPD 45 W EGH+ +A L+ LL + WPD Sbjct: 22 WWDEGHMQIAYLAYKKLSPTVRDRADALLKLNPDYASWIAGAPQGQEKLYAFVHAATWPD 81 Query: 46 QVRHWYKY-------------------KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDM 86 ++ Y K T H+ D D + Sbjct: 82 DIKMKPDYYDDQVGDSTAKQLVPYGHLKHTY-WHYKD----------ALFSVDDTPLPRP 130 Query: 87 CVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV---------GFTS 137 A+ ++ + + +L + H +GD+HQP+H Sbjct: 131 DAVDAVSQLKLMIAKLPANSDATEPLRSYSLSWTIHLVGDLHQPLHAIARYSAALPDKGG 190 Query: 138 DAGGNSIDL-RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDL 196 D GGN + NLH WD Y + + D G + Sbjct: 191 DRGGNEEQVIAANGETQNLHAYWDG-----IFGGYSTVFGAMFDADQRGGLST------- 238 Query: 197 ASWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKR 249 + +A ES ++A Y + L+ +Y + K+ Sbjct: 239 VTADPGKAQIVDPATWAQESFDLAKSVAYAAPIRTDKQPVELTREYETNARDTARKQ 295 >UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium profundum RepID=Q6LI73_PHOPR Length = 305 Score = 131 bits (328), Expect = 4e-29, Method: Composition-based stats. Identities = 69/311 (22%), Positives = 109/311 (35%), Gaps = 77/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDL---------SALCVWP 44 W+ +GHV +IA L+ A V +L +PE + + + L + P Sbjct: 29 WNYQGHVTVAQIAYQNLDTTARTQVDVLAAKAYQSMPEDIQQKMDSFEGASQFAKLAMVP 88 Query: 45 DQVRHWY-------------------KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D +R K T H+I+ + C D Sbjct: 89 DLIRKIPAEDIWAQMGETIPASLNQWDEKETGAWHYINQ-----AYPATSQC-------D 136 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS-------- 137 I+ + L + +++F+SH GD HQPMH S Sbjct: 137 FIHVPNIKLVASYLFDDFKQNPQ-----AASMMFMSHVAGDSHQPMHSISQSLSKNVCVT 191 Query: 138 DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 D G N L + +LHH+WD + L +IN D++ + + Sbjct: 192 DLGANKHTLDV--PQKDLHHLWDSGMGLLG----TEHNINDFATDLQLAYPSTTMTL--- 242 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 + VN + TES +A +GY V S+ Y+N +V +R+ Q G RL Sbjct: 243 ------GKTADVNLWVTESYQLA-DFGYS-VAIDAKPSESYYNKGTELVKQRLTQAGYRL 294 Query: 258 AMLLNNVFGAS 268 A LN+ Sbjct: 295 ADELNSALAKK 305 >UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYE5_9ALVE Length = 357 Score = 126 bits (317), Expect = 7e-28, Method: Composition-based stats. Identities = 48/275 (17%), Positives = 96/275 (34%), Gaps = 19/275 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+ H +L+ + LL +S + + Y T H Sbjct: 21 WDKDIHERIGEAVSRVLSYRDIEDLNKLLKGQSIPYMSR---YAHDKLQYANYDRTVENH 77 Query: 61 FIDTPDK-ACNFD-YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 + C FD D + + + T + + + Sbjct: 78 YETQLRDWQCTFDVNNPDKYAESQGLYRSIHDIFGRVTHASKSGEDHGIAKDMTEPVQIS 137 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDREIILTAAKDYYAKDIN 177 +L + D+HQP+H GF +D G I +++ +NL+ W+R+I +AA + Sbjct: 138 WLLGLVQDLHQPLHTGFGADDHGRRISVQYHDDPSTNLYDFWERDIS-SAANLETQLVLK 196 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK--------GVE 229 +++ DG + L + + ++ ES+ ++C Y V Sbjct: 197 AYNAELDKLVQDGGYGIQLVNKIYSKG----IAEWIAESMEMSCSDIYSVIAGGRGREVP 252 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + DD + + K+V + R A++L+ + Sbjct: 253 RMYQIDDDVYAKWRDLATKQVVKAAARSAVVLHGI 287 >UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DKF6_TRIVA Length = 323 Score = 126 bits (316), Expect = 8e-28, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 75/235 (31%), Gaps = 25/235 (10%) Query: 38 SALCVWPDQV-RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 + + W V R + +K + HF P F D + I N Sbjct: 53 AKVGAWMSYVERPPFNFKGFNHWHFTRQPYVPKEFGQIPSQIDNDNL--------ISNVM 104 Query: 97 TQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRWF 149 +G++ R + + ++ L + DIH P+HV D G ++ + Sbjct: 105 EMSDDIYKGSTKRSWPLAFSMKILFAGVCDIHTPLHVSEYFSSEFPNGDQNGRLYEVVYK 164 Query: 150 RHKSNLHHVWDREII--LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS 207 K+NL V++ Y N +++ + D + S E + Sbjct: 165 GQKTNLFDVYETGCGLDENLQVTYDESFWNDVKDLADNLLEDFKFVSKKFSRTEITAQNA 224 Query: 208 CVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 ++ + Y V+ G L+ + N + RL +LN Sbjct: 225 TTYQYTVD-------KIYSLVKPGGELTTEMINECQSHTRDMMRLAAERLVYILN 272 >UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileria RepID=Q4UCH4_THEAN Length = 391 Score = 123 bits (308), Expect = 8e-27, Method: Composition-based stats. Identities = 49/311 (15%), Positives = 98/311 (31%), Gaps = 61/311 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ A + +KMLL DL W D+V + + PLH Sbjct: 22 WNELCREAIESTAMSAITYMRLRRLKMLL---KGEDLVDYTWWADEV--LKRIPESLPLH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 + PDK N ++ C + ++C+ I+ F L + Sbjct: 77 YQYQPDKKSN-NFNFTCSN-----NLCLMAGIKYFFAVLMNSGYPVGTSNTQKFDIPPLG 130 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 +++ ++ + +L + D+H P+H+ FT +I + VW+ I Sbjct: 131 YPRKIKFSPSDCIKYLVVLLSDLHHPLHLDFTQPDSIATIPVDLSDFP-----VWEN-IS 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE---------------CGNVFSCV 209 + + L+ + + + SW C Sbjct: 185 VQTLNTKRPLYGDFLKHIYMPKYIEVNENAWYGSWTHVSTLGLRYSTELDLFNNKTVECF 244 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMK-----------RVAQGGIRLA 258 +A E+ ++ E LSD + + ++ G R+A Sbjct: 245 EVWAAETASLNNTIF--DKEDFVYLSDTVRTKAIRFTERLDSKLGFLMRLQIVMAGARVA 302 Query: 259 MLLNNVFGASQ 269 ++LN + + Sbjct: 303 IVLNYILSHRE 313 >UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=A4KXI8_HVAVE Length = 277 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 86/274 (31%), Gaps = 46/274 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ GH + +A+ + + + + L + PD + + LH Sbjct: 33 WAQNGHRVCAAVARAHIAP---ALLNHIESNLLKATLDEVSNDPDNIDVERR-----HLH 84 Query: 61 ---FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 ++DTP D C+ A Sbjct: 85 WVNYVDTPSDGAQNVSSYLTSDCQIDNRECIVSA-------------------------- 118 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAKDYYAKDI 176 H++ D+HQP+HV + A + + WF + LH VWD E+ Y + Sbjct: 119 ---VHYICDLHQPLHVIPATYANQSFARVLWFHGFNYTLHQVWD-ELPEQLHLSYESHAK 174 Query: 177 NLLEEDIEGNFTDGIWSD-DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETL- 234 L+ I + + W + + + E + E G + Sbjct: 175 WLVRHHISPEMYVAMVKQTTVDKWIDSRVAAYEIARKLNE--KLVKCHTENNSERGRYIC 232 Query: 235 SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + + S P V +A GG+RLA L F Sbjct: 233 NLKFVFSARPTVDSSLASGGVRLAGYLKQSFKNK 266 >UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DRT9_TRIVA Length = 300 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 75/256 (29%), Gaps = 28/256 (10%) Query: 22 AHAVKMLLPE--YVNGDLSALCVWPDQV-RHWYKYKWTSPLHFIDTPDKACNFDYERDCH 78 + + +S W R + + HF P N E Sbjct: 18 QKKLNSVFQNAGDDFTRVSQAAAWLYYAERPPFNIPSFNHWHFYSQPINPNNLSIE-THI 76 Query: 79 DQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF--- 135 D +KD NF + R G R + + M DI+ P+HV Sbjct: 77 DVDNLKD--------NFDSIRKSVRGGKVSRTWPFAFLMKLYLTGMCDIYSPLHVSELFN 128 Query: 136 ----TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGI 191 D G +++ + +L+ +W+ Y+ ++ ED Sbjct: 129 EQFPNGDRNGRDFYVKYNGNFISLYDLWETGCG------YFDSQVDFTSEDDWKKIDKLT 182 Query: 192 WSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVA 251 LA E V + + N Y G+ G +S +Y + V Sbjct: 183 NELSLAFTSEDWPSTLSVTQVIEGNYNYTRDTVYNGLVNGSEVSKEYITTCQNYAQDIVI 242 Query: 252 QGGIRLA---MLLNNV 264 G R+A LN + Sbjct: 243 LAGKRIATDLANLNII 258 >UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BI21_TERTT Length = 343 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 51/311 (16%), Positives = 86/311 (27%), Gaps = 75/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV--------------KMLLPEY--VNGDLSALCVWP 44 WS GH + A L+ A LP+ L WP Sbjct: 64 WSYSGHAVILGSALSQLDPTARKEAFTQIEYLYNRASGNSRFLPKSCLSQKSLCFFASWP 123 Query: 45 DQVRHWYKYKWT-------------------SPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D+ R + + HF + + + C + + Sbjct: 124 DRERDKTLGELYRMVGAEVPAVLKGLTSSEIASWHFTNQVFNLNDRKFSAACELRDRGQL 183 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV------GFTSDA 139 V ++ L +H + D HQP+H G D Sbjct: 184 YDVLPQLE--------SALIRELSIAQRAVTLALWTHLLADAHQPLHNLTGSLEGCAHDF 235 Query: 140 GGNSIDL--RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 GGN + + R + + +LH +WD L D + + + D Sbjct: 236 GGNGLCVVKRRNKCERSLHQLWDSGAGLFDKPDMIS--PLGVADARSPTAVDY------- 286 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 ES+ +A + +E S+ Y + + R Q R+ Sbjct: 287 ------------RVIQNESLALASEVYAPNLELS---SNAYITTVRRLSRIRAQQAAQRI 331 Query: 258 AMLLNNVFGAS 268 A+LL + G Sbjct: 332 ALLLKELTGNK 342 >UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstonia solanacearum RepID=Q8XRE8_RALSO Length = 337 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 106/320 (33%), Gaps = 66/320 (20%) Query: 2 SKEGHVMTCRIAQGLLN-DEAAHAVKMLLPEYVNGDLSALCVWPDQVRH----------- 49 +GH +A L+ A V+ +L L VW D + Sbjct: 27 GPDGHQTVGELADSLIAGTNAESQVQNILGM----TLEQASVWADCAKGVTRTQSGKFVY 82 Query: 50 --WYKYKWTSP---------------------------------LHFIDTPDKACNFDYE 74 Y P H+ D + + Sbjct: 83 QGAGHYPECKPFETTTGKSAMVAFVKRNWSGCHPAADEEVCHKQYHYTDVALQRGQYQQ- 141 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 G D + AI+ +L + + EALL LSH++GDIHQP+HV Sbjct: 142 ----GLVGTSDHDIVAAIRAAIIKLQGGTTPSPIDFASKREALLLLSHYVGDIHQPLHVS 197 Query: 135 F-TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 DA G+ +D + I+ K ++ D + G+ + Sbjct: 198 AVYLDAQGHVVDPDQGTFDPQTKTIGGNSILDAGKKLHFEWDQVPAALKPDQLGVSGV-A 256 Query: 194 DDLASWRECGNVFSCVNKFATESINIACK----WGYKGVEAGE----TLSDDYFNSRLPI 245 + A G++ S ++AT++++ A + +A + TL +Y + R + Sbjct: 257 EARAIPLTSGDIISWPAQWATDTMHSAAPAFSGTAFSAEDASKHWQVTLPANYVSERETV 316 Query: 246 VMKRVAQGGIRLAMLLNNVF 265 ++ + G RLA LL ++ Sbjct: 317 QRAQLIKAGARLAQLLQAIW 336 >UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FG69_TRIVA Length = 339 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 82/249 (32%), Gaps = 27/249 (10%) Query: 32 YVNGDLSALCVWPDQV-RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAG 90 + +LS L W + V R + K + HF P + +Y + + + D+ Sbjct: 31 DLAKNLSKLSTWMNYVERPPFNLKCFNHWHFSREPFTLESRNYIPQYNGKDNLVDVLKES 90 Query: 91 AIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNS 143 A + F + ++ L L + DIH MH D G Sbjct: 91 ATKIF--------FLIPSSPFILSTHLKVLFAGVPDIHATMHTQEFFSNDFPDGDRNGQV 142 Query: 144 IDLRWFRHKSNLHHVWDREI-ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC 202 + + ++L V + + + +++D ++ + +S Sbjct: 143 FYVMYNGTNTSLFDVLESGCGLDSQKHATFSRDFWEDVRKLKVELFKSWETPTFSS---- 198 Query: 203 GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 S V E+ Y + G+T+SD++ +++ + A +L Sbjct: 199 --TDSVVEAAKIENREYTKATIYSKLRPGDTISDEFITECQTRTKQQILKS----AEILY 252 Query: 263 NVFGASQQE 271 ++ +E Sbjct: 253 HITENKMKE 261 >UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LJW8_RHOVA Length = 200 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 48/174 (27%), Positives = 67/174 (38%), Gaps = 29/174 (16%) Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L L+HFMGDIHQPMHV F D GGN I +S LH WD +I Sbjct: 25 LKTLTHFMGDIHQPMHVSFEDDKGGNLISASGLCGRS-LHAAWDSCLIEKTLG------- 76 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINI--------------ACK 222 + I + I S D + W V +A E+ I C+ Sbjct: 77 -FDSDTIATSLEAEITSGDRSRWLAGDIGPKAVASWANETFTITTRPEVGYCERASDGCR 135 Query: 223 W------GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + + G + + + Y + P V R+ G+RL +LN+V Q Sbjct: 136 YSAYQPEYHGGAQKVVVVDEHYLSVNAPFVRDRIKAAGVRLGAVLNSVLMPDQS 189 >UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD1_9BURK Length = 117 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 54/112 (48%), Gaps = 10/112 (8%) Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 ++ P CN+ ERDC D CV AI L T AL ++ Sbjct: 1 MNFPRGDCNYQQERDCPD-----GKCVIAAIDRQIEVLR-----TPGDDEKRLTALKYVV 50 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 HF+GDIHQP+H GF D GGNS L+ F SNLH VWD +I + +D Sbjct: 51 HFIGDIHQPLHAGFGDDRGGNSYQLQAFMRGSNLHAVWDTGLIKSLKQDNEQ 102 >UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT4_LACBS Length = 242 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 79/245 (32%), Gaps = 67/245 (27%) Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH 151 ++N T L + EAL FL HF GD HQPMH+ + GGN + + + Sbjct: 1 MKNVTALLQGW-VKGETSDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGK 58 Query: 152 KSNLHHVWDREIILTAAKDYYAK-----DINLLEEDIEGNFTD------------GIWSD 194 ++ WD +I +E+ + G D W+D Sbjct: 59 QTT----WDDSLITKVISTIPQNYTLPLPYPEIEQALRGASYDPYIRRIIWEGILQKWAD 114 Query: 195 DLASWRECGNVFS---------------------------CVNKFATESINIACKWG--- 224 ++ W C + C +A S ++ C Sbjct: 115 EIPGWLSCPDAVKRTFVDSQIALGLEGTTGIEILPDNDVLCPYHWARPSHDLLCDGVWLK 174 Query: 225 ------YKGVEAGETLS------DDY--FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 Y+ + Y + +V K++A GG+RLA L N +F Q Sbjct: 175 EVDEPPYRRTDDNPHPPLLELETPAYSGMIGQRWLVEKQLALGGLRLAGLFNYIFADQGQ 234 Query: 271 EDSVV 275 + + Sbjct: 235 RGAFI 239 >UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugiperda ascovirus 1a RepID=Q0E526_SFAVA Length = 261 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 92/275 (33%), Gaps = 49/275 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ GH + +A+ L+ V+ + L + D+ + + +H Sbjct: 24 WALTGHRVCANVARRLIPSPILKHVET--EVLDHETLDGVSNVADE-----TPRSLAAMH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ N R L + + + + Sbjct: 77 YVNY-----NVTPTRS------------------ARKVLEYTENNMTSTYRWDAAFITNV 113 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDR--EIILTAAKDYYAKDIN 177 H + D+HQP+HV +D + +W + LH +WD ++ L + Y +N Sbjct: 114 VHLLCDLHQPLHVVPYADVPSTFTETQWVNGQNTTLHTIWDTLPDLRLLSHHIYAEWLVN 173 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESIN-IACKWGYKGVEAGETL-- 234 L+ + + D W ++A ++ + AG L Sbjct: 174 KLKANTYALLFEQ---DRPHKWL-------DSRRYAYDAAKRLNDNLARCHTNAGSKLLI 223 Query: 235 ---SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + + +S +V + + GG+RLA + +++ Sbjct: 224 NSCNYRFVDSARALVDESLLYGGVRLAAYITSLYS 258 >UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFX0_FLAJ1 Length = 332 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 78/270 (28%), Gaps = 35/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + A L + + ++ PD ++ YK P H Sbjct: 25 WGNVGHERINKAAVMALPKQLQ-----IFFYNHIDFITQEASVPDIRKYALNYKEEGPRH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + D + Y + + D + I++ +L+ + + Sbjct: 80 YFDMENFGAADTYPQTLEEAKQKYDAKFLSDNGILPWYIEDMMAKLTKAFKEKNRAEILF 139 Query: 114 TEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 A L H++GD H P+H D + +H +W+ + K+Y Sbjct: 140 LAAD--LGHYVGDAHMPLHTSANHDG--------QLTDQKGIHSLWESRLPELFVKNY-- 187 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESIN------IACKWGYKG 227 +N+ E + IW + + T + A K Sbjct: 188 -KLNVPEAQYYTDVHKAIWDMINDTHSFAQPLLDIDKSLRTATPQDKVFKLDAEGKVLKS 246 Query: 228 VEAGETLSDDYFNSRLP----IVMKRVAQG 253 SD+Y +V ++ + Sbjct: 247 KYNTAVFSDEYAKKLHEQLNGMVETQMRKA 276 >UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EIL3_TRIVA Length = 310 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 71/225 (31%), Gaps = 28/225 (12%) Query: 40 LCVWPDQVRHWYKY-KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQ 98 W +V + K + F+ TP + +Y R+ D + + G I N Sbjct: 52 AGGWLARVEYAPTNTKCFNHWRFVQTPINGSD-NYHRNKDDLTVQLNGLLGGLINNTI-- 108 Query: 99 LSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF--------TSDAGGNSIDLRWFR 150 ++ A S + P+H D G +++ Sbjct: 109 ---------TDKWAYNFAFKVASALFFEAFSPLHTSELFDNDRFKDGDDSGKKYMIKYQG 159 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 ++ +L WD + E +F + L R NV Sbjct: 160 NEMSLLDFWDSGCGRYTRQT-------PYTETQWTDFYKNVDYMLLKFPRPSCNVNITWQ 212 Query: 211 KFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 +++N+ Y+G++ + LS +Y + + I +R+A Sbjct: 213 MAVNDTLNVTNTVVYQGIKYSQELSKEYIDKCIEITDERLACAAY 257 >UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2F5A5_TRIVA Length = 343 Score = 99.9 bits (247), Expect = 8e-20, Method: Composition-based stats. Identities = 28/257 (10%), Positives = 71/257 (27%), Gaps = 35/257 (13%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSA---LCVW-PDQVRHWYKYKWTSPLHFIDTPDKACN 70 L ++ ++ ++ + W + + A Sbjct: 26 RKLGNKGISKLQKVIDM-TGEKMERPSLAGSWLASLLHAPSNTNCFDHWRYSQKNINAI- 83 Query: 71 FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQP 130 E C ++ ++ C + +GT + + D P Sbjct: 84 PHPEHHCINKDDLE--CTLDKLN------KTIMKGTLNGPWPYNFGFKVFLTLYMDSFDP 135 Query: 131 MHVGFT--------SDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 +HV D G ++++ +LH W+ K + + E+ Sbjct: 136 VHVTEYFDNDTFIDGDDNGKKFNIKFKGKNMSLHDFWETGCGRYVLKTPFNGNGWKEIEE 195 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFA---TESINIACKWGY--KGVEAGETLSDD 237 + + C + +A +S N++ + Y ++ L ++ Sbjct: 196 TTTRLYKRLNDSKF--------ITPCPSDYAGAINQSFNLSKEIVYNLSMIQKDNDLPEE 247 Query: 238 YFNSRLPIVMKRVAQGG 254 Y + + +R+ Q Sbjct: 248 YIKTCYELTDQRILQAA 264 >UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11TZ7_CYTH3 Length = 318 Score = 98.7 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 84/267 (31%), Gaps = 35/267 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H ++A L K + ++ V PD+ R+ + +P H Sbjct: 24 WGFFAHKEINKMAVFTLPHPLMSFYKRHIDF-----ITEQAVNPDKRRYIVSGE--APKH 76 Query: 61 FIDTPDKACNFDYER-DCHDQHGVKDMCVAGA-------IQNFTTQLSHYREGTSDRRYN 112 ++D + + R D + + A + T +L+ + + Sbjct: 77 YMDIEYYSDSILIVRPDWNTAQAIYPEDSLHAHGILPWNLVRLTYRLTDAFKHRDAKSIL 136 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 A L H++GD+H P+H + + +H +W+ + + DY Sbjct: 137 KLSAD--LGHYVGDLHVPLHTTKNYNG--------QLTGQQGIHGLWESRLPELFSADY- 185 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + L + + +W S R C V + + + Y+ Sbjct: 186 --NYYLGTANYVTDIKKVVWESMTES-RACVAQVLAVELKLQQQMKADKIFSYEDRNGQT 242 Query: 231 ----GETLSDDYFNSRLPIVMKRVAQG 253 S+ Y + +V KR+ Sbjct: 243 VRVYSYDFSNAYHKALEDMVQKRMRAA 269 >UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=B9EZB3_ORYSJ Length = 170 Score = 96.0 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 69/119 (57%), Gaps = 8/119 (6%) Query: 73 YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH 132 RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL+HF+GD+HQP+H Sbjct: 28 PRRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFLAHFVGDVHQPLH 85 Query: 133 VGFTSDAGGNSIDLRWFRHKSNLH-----HVWDREIILTAAKDYYAKDINLLEEDIEGN 186 VGF D GGN+I + + +S +H D E +T DY+ ++E+ + Sbjct: 86 VGFEEDEGGNTIKVHCYAIES-IHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQA 143 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 36/76 (47%), Positives = 51/76 (67%) Query: 200 RECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAM 259 + G V+ +A ESI+++C + YK VE TL DDYF SR PIV KR+AQ GIRLA+ Sbjct: 90 EDEGGNTIKVHCYAIESIHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQAGIRLAL 149 Query: 260 LLNNVFGASQQEDSVV 275 +LN +FG + + +V+ Sbjct: 150 ILNRIFGEDKPDGNVI 165 >UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A652_9BACT Length = 348 Score = 94.9 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 55/306 (17%), Positives = 93/306 (30%), Gaps = 56/306 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W EGH + ++A L E V+ ++ L PD+ R+ + Sbjct: 23 WDYEGHRIVNQLALAALPPEFPAFVRE---AANAERIAFLSGEPDRWRNVEDGPLRHAQT 79 Query: 58 PLHFIDTPD---------------------------KACNFDYERDCHDQHGVKD--MCV 88 P HF D + + D+ +D + Sbjct: 80 PDHFFDIEYLVEGGLPLAKLSEFRQVFAVQLAEARAARPSAYPKSGSKDKDRTRDLVGFL 139 Query: 89 AGAIQNFTTQLSHY------------REGTSDRRYNMTEALLFLSHFMGDIHQPMHVG-- 134 AI ++ E ++ R N+ + L H++GD QP+H Sbjct: 140 PWAITENYGRVKSAFTYLKAYEALGTPEEVANARANVVYQMGLLGHYVGDGAQPLHTTKH 199 Query: 135 FTSDAG--GNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 F AG G++ + R F + LH D I A + D +G Sbjct: 200 FNGWAGEAGSAANPRGFTTRRTLHSWIDGGYIAAARITVADLLPRAFKADPLTLSGEGRG 259 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D VF + Y+ +AGE + + +R+ + Sbjct: 260 GNDARR----DPVFEAALAYLVRQHEQVIPL-YELEKAGELNAPPATRKGRAFIEQRLQE 314 Query: 253 GGIRLA 258 GG LA Sbjct: 315 GGRMLA 320 >UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNU1_CHIPD Length = 313 Score = 94.1 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 70/272 (25%), Gaps = 43/272 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + LS D+ R+ P H Sbjct: 20 WGFFAHQRINRLAVFSLPPEML-----VFYKPNIEYLSTHATDADKRRYI--IPEEGPRH 72 Query: 61 FIDTPDKACN-------------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTS 107 +ID Y D +G+ + + T Sbjct: 73 YIDIDHYGQAPFAALPRSWEEALLKYTADTLQTYGILPWYLTQMLSRLTQAFKDKDPDRI 132 Query: 108 DRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA 167 R + H+ GD H P+H + + +H +W+ I Sbjct: 133 MRLSAD------IGHYAGDAHVPLHACSNHNG--------QRTGQQGIHGLWESRIPELM 178 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG 227 A + + + W L S V K ++ K+ Y+ Sbjct: 179 ADKTFQ--YLSAKAYYIKDINAYTWQIVLESAAAADTVLQQ-EKLVSDRFPSGRKFAYEK 235 Query: 228 VEA------GETLSDDYFNSRLPIVMKRVAQG 253 + Y + ++ +R++ Sbjct: 236 RNGKLIRNYATAYAKAYHGALGDMIERRMSAA 267 >UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QFB3_9SPHI Length = 354 Score = 93.3 bits (230), Expect = 7e-18, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 85/276 (30%), Gaps = 56/276 (20%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-LPEYVNGDLSALCVWPDQVRHWYKYKWTSPL 59 W H R+A L V M+ + LS V PD+ R+ + +P Sbjct: 52 WGFFAHQQINRLAVFTLP------VDMIPFFKKHINFLSDNAVNPDKRRYAVVGE--APR 103 Query: 60 HFIDTPDKACNF--DYERDCHDQHGVKDMC-------VAGAIQNFTTQLSHYREGTSDRR 110 HFID R + V IQ QL+ + + RR Sbjct: 104 HFIDLDAYPDTTSATLPRYYKEATDRYGEDSLALHGLVPWQIQLTKYQLTEAFKQRNVRR 163 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSN---LHHVWDREIILTA 167 A L H++ D + P+H + +N +H W+ + Sbjct: 164 ILRVAAD--LGHYIADANVPLHTTRN-----------YNGQLTNQQGIHGFWESRLPELF 210 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC------VNKFATESINIAC 221 + +Y D + I+S A+WR N + + + TE + Sbjct: 211 SANY----------DFLTGQAEYIYSPQKAAWRAVFNANAALDSVLHIERQLTEQVGETR 260 Query: 222 KWGYKGVEA------GETLSDDYFNSRLPIVMKRVA 251 K+G++ S Y V +++ Sbjct: 261 KYGFEERNGITAKVYSADFSQQYHERLHGQVERQMR 296 >UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT6_PHYIN Length = 269 Score = 91.8 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 83/293 (28%), Gaps = 83/293 (28%) Query: 14 QGLLNDEAAHAVKMLLPEY-----VNGDLSALCVWPDQVRHW-----------YKYKWTS 57 + +L++ ++ +L + G+++ VW D V+ S Sbjct: 11 RNVLDEADVTTIESILSRWDEDFPNTGEITTTAVWMDIVKCTAESSTCLTPASPSITSIS 70 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ P +E D A + + Sbjct: 71 DWHYINLPLHINGDKWEDKDTDLTLRSTQSRVSARPSLS--------------------- 109 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA---- 173 D GGNS SN H VWD L + + Sbjct: 110 --------------------DGGGNSETFTSPCVFSNPHAVWDAAGGLYSLNKWSLNIDS 149 Query: 174 ------------KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 + ++++I + + ++L + V + A E+ N A Sbjct: 150 FRPTLENASELIALLPSVQDNITFSQYVNVTYNELNTALVTNQVL---REVALETYNFAN 206 Query: 222 KWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y ++ T S Y I KR+A G RLA++L + Sbjct: 207 TIVYSNLDLNATSSGTYPCPSASYLAMVGEISQKRIAIAGSRLAVVLKHFAAQ 259 >UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21JG1_SACD2 Length = 321 Score = 91.4 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 74/247 (29%), Gaps = 60/247 (24%) Query: 43 WPDQVRH-------------------WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGV 83 WPD VR YK TS H+ + + N C+ ++ Sbjct: 100 WPDLVRSQKLSVLFKAVGATTPADLAAYKNYTTSTWHYHNVFYDSNN-KLLLSCNKKNRG 158 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT------S 137 K A++ + A F H +GD HQP+H Sbjct: 159 KLYSALSALE--------SSLQSDLSISQQAIAFAFYVHLVGDAHQPLHNVSRANKHCEH 210 Query: 138 DAGGNSIDLRWFRHKSNL--HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDD 195 D GGN+ L+ K +L H WD L A + DI ++ Sbjct: 211 DRGGNTYCLKKKGAKCSLNAHQFWD----LAAFNPVESIDIQPVKHK------------- 253 Query: 196 LASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 CG + + E+ + K + + Y ++ I R+ Sbjct: 254 ----AACGTSPAWGSYLLAEAKELVVNLYPKNDDFN---NAKYRSNAKSIAKSRIEMAAS 306 Query: 256 RLAMLLN 262 R A ++ Sbjct: 307 RTAQIMK 313 >UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=C7J139_ORYSJ Length = 141 Score = 87.1 bits (214), Expect = 6e-16, Method: Composition-based stats. Identities = 51/64 (79%), Positives = 56/64 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AAHAV+ LL E +GDLSALCVWPDQVRHWYKY+WTSPLH Sbjct: 30 WSKEGHMLTCRIAQDLLEPAAAHAVRNLLTEEADGDLSALCVWPDQVRHWYKYRWTSPLH 89 Query: 61 FIDT 64 FIDT Sbjct: 90 FIDT 93 >UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R8_TRIVA Length = 181 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 46/137 (33%), Gaps = 8/137 (5%) Query: 135 FTSDAGGNS--IDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 D GGN I+ + +++H WD ++ A T I Sbjct: 2 PNGDRGGNLYHINCPYGAACNHIHFFWDAIVLNYMLMKPTASLYRNEFIKNVTRLTKEIT 61 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 L + ++ ES+ A K+GY + + Y+ RVA Sbjct: 62 ESSLNL-----DKTVDPMAWSMESLEYAKKYGYS-TPINDAPNASYYEIVRKYGSIRVAM 115 Query: 253 GGIRLAMLLNNVFGASQ 269 G RL LL+++ + Sbjct: 116 AGHRLGYLLDSLLDKAP 132 >UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FZN6_TRIVA Length = 232 Score = 84.1 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 54/166 (32%), Gaps = 16/166 (9%) Query: 100 SHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT-------SDAGGNSIDLRWFRHK 152 T + + A + P ++ D G ++ + K Sbjct: 7 KSLFPQTIQGAWPINVAWKSYFGLFLEAFNPTNIANYYSNNHTEGDNNGKDFEIFYKGRK 66 Query: 153 SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKF 212 +N+H W K + ++ + ++ D+ + +N Sbjct: 67 TNIHDFWGSLCGRLTGKYPFNSNVWSDIDK---------YAHDITLVYRNVTHYQNINDI 117 Query: 213 ATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLA 258 T+S NIA Y GV GE LSD+Y + K++A LA Sbjct: 118 LTQSYNIAKDVVYVGVNEGEILSDEYVEKCYDVTSKQLASAAFSLA 163 >UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VWZ8_DYAFD Length = 341 Score = 80.6 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 78/266 (29%), Gaps = 36/266 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + L+ V PD+ R+ + + H Sbjct: 42 WGFWAHKRINRLAVFRLPMEMQ-----VFYKKHIDYLTENAVNPDKRRYAVVGE--AERH 94 Query: 61 FIDTPDKACNFDYERDCH---------DQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID + H + K V +Q +QL+ + R Sbjct: 95 FIDLDVYGDSALAVLPKHWQAAVNKVGEDSLRKHGIVPWHVQIAASQLTSAFREKNAARI 154 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + +H W+ + A+ Y Sbjct: 155 LRMSAD--LGHYIADAHVPLHTTRNYNG--------QLTGQDGIHGFWESRLPEIYAEQY 204 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA- 230 + IW AS + K TE+ K+ ++ Sbjct: 205 DMWLGP---AAYREDIAHDIWQAVEASH-SGSDSVLAFEKQLTEAFKPDKKYAFELRNNI 260 Query: 231 -----GETLSDDYFNSRLPIVMKRVA 251 S+ Y + V +R+ Sbjct: 261 LTRMHSRDFSEKYHRALAGQVERRMR 286 >UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania major RepID=Q4Q7F8_LEIMA Length = 180 Score = 79.8 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 32/79 (40%) Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 + ++ E V ES A Y GV G TLSD Y + R+ GG Sbjct: 88 ETYTFPEALRTLVDVVAIHEESHMFAVNTSYPGVTPGATLSDAYLARCKRVAEARLTLGG 147 Query: 255 IRLAMLLNNVFGASQQEDS 273 RL LLN + + +++ Sbjct: 148 YRLGYLLNELLPSIPVDEA 166 >UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis RepID=A7ARD9_BABBO Length = 393 Score = 79.8 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 90/304 (29%), Gaps = 52/304 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W A + + +K++L + DL W D+VR + ++ LH Sbjct: 23 WDDITREAIESTAMSAITFDRLRRMKVILRGH---DLVDYTWWSDEVR--KRIPESATLH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 D+ C ++ C + +C+ + F +L Sbjct: 78 RQLQNDETC-LTFDSTCPN-----GLCLIQGSKFFFAKLMSSGYSIVSQPIKFELPLFRY 131 Query: 106 TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIIL 165 D + ++ L +L + D+H P +V + W+ + Sbjct: 132 PKDVTFTPSDCLKYLVVLLSDMHYPFNVDLAEPHSLAHRKVDLSGFPM-----WE-ALSK 185 Query: 166 TAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE---------------CGNVFSCVN 210 + + + ++ SW N + Sbjct: 186 EKLGHAKPSFEDFIMKVYMPHYIQTNEESWYGSWTNVEVLGSRYKVEQETFNRNTWDNFE 245 Query: 211 KFATESINIACK-----WGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 +A+E+ N+ C + + LSD + + ++ G R+A++LN + Sbjct: 246 IWASETANLHCNGLVTKSDFSKDKQTIKLSDALLDRIGNTIKFQIVLAGARVAVVLNYIL 305 Query: 266 GASQ 269 + Sbjct: 306 SHRE 309 >UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_886 (Fragment) n=2 Tax=cellular organisms RepID=D1ZW87_SORMA Length = 159 Score = 79.1 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 40/123 (32%), Gaps = 15/123 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRH-WYK 52 + GH IA+ + E A+ +L P + VW D V+ + Sbjct: 42 WEYGHQSVATIARLNVRSETRAAIDRILRHQALLETPTCPARTIEEASVWADCVKPLGER 101 Query: 53 YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 + + H+ + FD + C D CV+ I+ L + ++ Sbjct: 102 FSYAYSWHYQNVDVCRP-FDLKAACKD-----GNCVSAQIERDVKLLKDPKVPMREKVLA 155 Query: 113 MTE 115 + Sbjct: 156 LAF 158 >UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KMV3_TOXGO Length = 632 Score = 78.3 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 36/127 (28%), Gaps = 18/127 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQVRHWYKYKW 55 W EGH++ +A+ L E ++ +L E+ L VW D V ++ Sbjct: 27 WHDEGHMLVAAVAKEYLKPETVEKIEYILSEWSPQYPTTSTLETAAVWLDHVACSMPGRY 86 Query: 56 ------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 P H+ N E Q + + L + Sbjct: 87 CRGFLGLDDIRIFKPWHYTSNVFNPQNLTLEPLYEVQPYPQTGSS-WILLKSYESLRNCT 145 Query: 104 EGTSDRR 110 + + Sbjct: 146 GDSRASQ 152 Score = 61.7 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 51/172 (29%) Query: 123 FMGDIHQPMHVGF-------TSDAGGNSIDLRWFR------------------------- 150 GD HQP+H D GGN+I + R Sbjct: 276 IYGDAHQPLHATETYSKAFPNGDFGGNNISIVLPRSEKMLENYPSTPEEFPEVGAEAHRG 335 Query: 151 ----HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 H+ +LH WD + +Y D++ L+++ + ++ D F Sbjct: 336 SGVPHRQSLHSQWDGAFGQYNSL-FYEVDLDELKKEAQRLV--RLYPVD----EHAKRTF 388 Query: 207 SCVNKFATESINIACKWGYKGVE--------AGETLSDDYFNSRLPIVMKRV 250 + + + ES +A + E S +Y + K++ Sbjct: 389 ADFHGISIESSMLARSHVFSEFEWSTFSASSLPYHPSVEYIEKSKKVCEKQI 440 >UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E734_9SPHI Length = 271 Score = 75.2 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 79/261 (30%), Gaps = 35/261 (13%) Query: 7 VMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPD 66 + +A L + + L V PD+ R+ + H++D Sbjct: 1 MRINELAVFTLPEGMYT-----FYKQNRRYLRDHAVDPDKRRYADT--SEAARHYLDVEH 53 Query: 67 KA-CNFDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 C R D + IQ +L + + + A Sbjct: 54 YEVCIDSIPRKYPDAVKKYGLKKMNQSGILPWQIQQSYYKLVRAFQQRDSAKILIYSA-- 111 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 +L H++ D P+H D + +H W+ + ++DY + L Sbjct: 112 YLGHYLSDAQVPLHTTANHDG--------QLSGQQGIHAFWESRLPELFSEDY---NFLL 160 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA------GE 232 + + + W + +V ++ S I K+GY + E Sbjct: 161 GKAQYISDPLEEAWKMVSKTHLLVDSVLQ-LDSVLNSSFPIYRKYGYSKRKNKVVKQHTE 219 Query: 233 TLSDDYFNSRLPIVMKRVAQG 253 S Y +S +V +++ + Sbjct: 220 GYSRLYHDSMKHMVERQMREA 240 >UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=B3EUC7_AMOA5 Length = 317 Score = 74.4 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 81/261 (31%), Gaps = 32/261 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R A L K L ++ V PD+ R+ + + + H Sbjct: 22 WGFAAHKHINRCAVFTLPPAMFTFYKYYLG-----YITENAVNPDKRRYVLEGE--ASRH 74 Query: 61 FIDTPDKACN--FDYERDCHDQHGVKDMC-------VAGAIQNFTTQLSHYREGTSDRRY 111 +ID N +D V IQ+ +L++ + Sbjct: 75 YIDLDYYGDNALDKLPKDWAQATHKYSQDTLLAHGIVPWHIQHMQHRLTNAFRNKDIAQI 134 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 + + H++ D + P+H + + +H +W+ + ++Y Sbjct: 135 LKLSSD--IGHYIADANVPLHTTQNYNG--------QLTGQDGIHGLWETRLPELFKEEY 184 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY--KGVE 229 N + W + + N+ + +++ N K+ Y +G Sbjct: 185 NFFLGN---ATYVKDPQQRAWKAIIQAHATVPNLLKLEKE-LSQNFNTLHKFSYEKRGAS 240 Query: 230 AGETLSDDYFNSRLPIVMKRV 250 + S+ Y + ++ +V Sbjct: 241 LKKVYSEAYARAYHDLLQGQV 261 >UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PTL3_9SPHI Length = 315 Score = 74.4 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 77/275 (28%), Gaps = 36/275 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + +++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDEITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACN---------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + ++ + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWYRAKEKIEEKRLLSNGIVPWQIYITYQKLVKAFIARDKIKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFATHY 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + W+ S V + + + K Y Sbjct: 183 KLTAG---KAQFITDPAALGWAIVYESAPLADTVLRIEKELSVR-FPASQKKTYLTRNNV 238 Query: 232 ETLSD------DYFNSRLPIVMKRVAQGGIRLAML 260 L+ Y + +V R+ Q R+ L Sbjct: 239 LVLTYSDAYAKAYHEALNGMVEVRMRQAIHRIGSL 273 >UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 Tax=Ricinus communis RepID=B9TFK5_RICCO Length = 228 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 73/235 (31%), Gaps = 64/235 (27%) Query: 47 VRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 V + S H+ D P + +++ G D + ++ L T Sbjct: 2 VAYTTANPKHSEYHYTDVPFQLAHYEDH-----GVGTTDHDIVQTLKQCIAVLQGKGNAT 56 Query: 107 SD-RRYNMTEALLFLSHFMGDIHQPMHVGFTS----------------------DAGGNS 143 ++ + +ALL L+H GDI QP+HVG GGN+ Sbjct: 57 TNPHNFTPRQALLMLTHLTGDIAQPLHVGEGYVGKNGGFVVPTQKQLDDKEAFATQGGNN 116 Query: 144 I---DLRWFRHKSNL------------------------HHVWDREIILTAAKDYYAKDI 176 + D++ S L H WD ++ A + A+ Sbjct: 117 LQLDDIKLTAKSSELIPAAAPDDSKPAAPARTPQATRAFHSYWDTTVVNYAFRRIGARTP 176 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + + G+ + +A +++ +A K Y V G Sbjct: 177 EQFA--------QMVSAGNPVVAPNSGDPVTWPYAWADQTLVVA-KLAYADVVPG 222 >UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G3H0_9SPHI Length = 100 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 3/70 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDKACN 70 +I+T Sbjct: 80 YINTEGNLTK 89 >UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y3Y4_PEDHD Length = 285 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 38/267 (14%), Positives = 78/267 (29%), Gaps = 35/267 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H+ R+A L + + LS V PD+ R+ + + H Sbjct: 20 WGFYAHIRINRLAVFTLP----AGLNR-FYKANISYLSDHAVDPDKRRYADTAE--AARH 72 Query: 61 FIDTPDKACNFD-YERDCHDQHGV-------KDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 ++D + D R + ++ + IQ +L H Sbjct: 73 YLDVELYEAHIDSIPRKWEEAVKRYGLVRLNQNGILPWQIQKSYYKLVHALRDRD--SLK 130 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 + +L H++ D H P+H + ++ +H W+ + AK Y Sbjct: 131 ILIYSAYLGHYLADAHVPLHTTQNHNG--------QLSNQLGIHAFWESRLPELFAKKY- 181 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + + + N W + + V + K+ + Sbjct: 182 --NYVVGQAIYIENPLKEAWKIITHTHKMVDTVLT-FEARLNARFPAHRKYSFSERNNQV 238 Query: 231 GETLSDDYFNSRLP----IVMKRVAQG 253 G S Y + +V +++ Sbjct: 239 GRQYSLAYSKAFHDGMNHMVERQMRAA 265 >UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R9_TRIVA Length = 115 Score = 70.6 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 32/108 (29%), Gaps = 7/108 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+ IA G L+ + + + L+ + W D ++ YK+ Sbjct: 12 WWAHAHMAITEIALGHLSSKKINKLYELINRDGLPFQSVVDSSAWQDDLKDTYKFHAIGD 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 HF D P + V + + L+ + Sbjct: 72 WHFSDNPIY-----MNKTIPAIIPNPSYNVTSFLYDALDTLNDPTTTS 114 >UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FVU8_9SPHI Length = 238 Score = 69.0 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 50/180 (27%), Gaps = 26/180 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + ++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDQITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACNFDYE---------RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWSRAKEKIEQKRLLSNGIVPWQIYITYQKLVKAFIARDKTKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFAPQY 182 >UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD0_9BURK Length = 79 Score = 67.5 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 23/51 (45%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY 51 W +GH + +A+ L+ A V LL + L+++ W D+ R Sbjct: 26 WGSDGHKIVAMLAEAQLSPAARKEVDRLLAQEPGATLASISTWADEHRSPA 76 >UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SFS5_9CAUL Length = 339 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 46/185 (24%), Gaps = 43/185 (23%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPL- 59 W GH + A L ++ GD+ PD + K Sbjct: 24 WGPTGHRIVGEEAARALPAYMPEFLR---SAQGVGDIGFYSNEPDAWKGAGKVHDFERDS 80 Query: 60 -HFIDTPDKACNFDYERDCHDQHGVKDMCVA----------------GAIQNFTTQLSH- 101 HFID D R D I + + Sbjct: 81 AHFIDLDDDGKTLAGVRLQEVPQSRSDFDALLRSKNVMPWKSGYLNYALIDAWQQVVKDF 140 Query: 102 ----------YREGTSDRRYNMTEALL-----------FLSHFMGDIHQPMHVGFTSDAG 140 E R+ + EA+ LSH++GD QP+H+ + Sbjct: 141 AYWRGMTYLEAHESDPKRKAWLKEAIRRREALTLRDIGILSHYVGDSSQPLHLSIHYNGW 200 Query: 141 GNSID 145 G Sbjct: 201 GKEYP 205 >UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B1ZQR9_OPITP Length = 349 Score = 60.6 bits (145), Expect = 5e-08, Method: Composition-based stats. Identities = 37/308 (12%), Positives = 71/308 (23%), Gaps = 63/308 (20%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKY--KWTSP 58 W GH + + A L + V+ ++ L PD+ R+ K + Sbjct: 26 WDYTGHRIVNQAALASLPADFPEFVRA---PAAAERIAFLAGEPDRWRNVPDLPIKHANG 82 Query: 59 L-HFIDTPD----------------------------KACNFDYERDCHDQHGVKDMC-- 87 L H+ D F + ++ Sbjct: 83 LDHYCDLEHLAGAGVDPRTVSSLRFEFALTFAAGRAAHPEKFPPIDPAKNADRSREWAGF 142 Query: 88 VAGAIQNFTTQLSHYRE-------------GTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 A + +L + R N+ + + H +GD+ QP+H Sbjct: 143 APWAAAEYYGKLKSAFSYLKAYQEHGGTPVEIENARANILYLMGVMGHVVGDLAQPLHTT 202 Query: 135 --FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 G N + + +H D +I + Sbjct: 203 MHHHGWVGEN---PHGYSTWTGIHAWLDGGLIAQTGVTAGEVCAQVRPAHAL-------- 251 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 VF V +A + + +++ Sbjct: 252 -SVQPRADGRDPVFVQVMDYALAQNARVEPLYQLEKAGKLAPEAADLSEARTFICEQLQV 310 Query: 253 GGIRLAML 260 GG L + Sbjct: 311 GGEMLGSI 318 >UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobacterium abscessus ATCC 19977 RepID=B1MDJ0_MYCA9 Length = 728 Score = 60.6 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 41/254 (16%), Positives = 74/254 (29%), Gaps = 79/254 (31%) Query: 1 WSKEGHVMTCR---------------------------------IAQGLLNDEAAHAVKM 27 W + GH IAQ L EA Sbjct: 376 WGQTGHYSIATFTLDAIRSPNLKTLMQANLDAISFSLSELDPKSIAQRL--KEARSNPDG 433 Query: 28 LLPEYVNGDLSALCVW---PDQV-----RHWYKYKWTSPL---HFIDTPDKACNFDYERD 76 ++P DL VW P++V H Y+ P H+ D + + RD Sbjct: 434 IIPLADVPDL----VWKNLPNKVVGGRDDHMVGYRSQGPEHPCHYADIDEPGPDGSIVRD 489 Query: 77 ----------------CHDQHGVKDMC----VAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 +D+ G + + + F + + + ++ Sbjct: 490 LCLQDIANLTVTKWQQFYDERGHRTPDKRGLLPFRVWQFYDAMVGFAKSRQVDQFVCAAG 549 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L L+H++GD QP+H + +D + +H ++ ++I A+ A Sbjct: 550 L--LAHYVGDASQPLHGSYLADG-------YPDGTGAGVHSCYESKMIDRYARQLVAAIP 600 Query: 177 NLLEEDIEGNFTDG 190 L + D Sbjct: 601 ADLATLGDLELIDD 614 >UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKD8_THEYD Length = 262 Score = 60.2 bits (144), Expect = 7e-08, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 38/127 (29%), Gaps = 16/127 (12%) Query: 27 MLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFID-------TPDKACNFDYERDCHD 79 + + + PD +R Y +P H+ D TP+ F + Sbjct: 32 AYIAKKAGIRIPEAACMPDIIR-DENYDLLAPFHYHDASPDTVVTPEYIDKFGIKEAFLL 90 Query: 80 QH--------GVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPM 131 + I ++ D L+ ++H++GD+ QP+ Sbjct: 91 VDGKNFRISVPHPAGVLYWKIVQIYEKMKSLDRTKPDNVLAYEYYLVSIAHYIGDLSQPL 150 Query: 132 HVGFTSD 138 H D Sbjct: 151 HNFPYGD 157 >UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitidis ER-3 RepID=C5GNE5_AJEDR Length = 380 Score = 55.9 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 9/72 (12%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFD 72 +I+ D ++ Sbjct: 144 YINPADNPPAYE 155 >UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J9_ACIC5 Length = 319 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 82/287 (28%), Gaps = 60/287 (20%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W K+GH M +A L ++ +++ L PD+ R + + + Sbjct: 29 WGKDGHKMINHLAVTSLPPSIPAFLR---SPAAVDEITYLGPEPDRWRSPAEPELDAMQA 85 Query: 58 PLHFIDT-------PDKACNFDY------------------ERDCHDQHGVKDMCVAGAI 92 P H+ID P + Y + V + Sbjct: 86 PDHYIDMELADRIAPLPRERYQYIAKLYAYIEAHPDQAREMQPTHIGFQPYISEEVWERL 145 Query: 93 Q---NFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG--FTSDAGGNSIDLR 147 + QL + T + + +L H++ D QP+H + G N Sbjct: 146 KSAMRDYRQLKAAGKDTMPVQQAIIFYAGWLGHYVADGSQPLHTTIEYNGWVGPN---PN 202 Query: 148 WFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS 207 + ++H ++ E + + E + I + W + Sbjct: 203 HYTTSHHIHSQFESEFVHDNMTN--------AEVRQYMKPVEPIGDEWTQYWDYLNTTHA 254 Query: 208 CVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 V+ E + + G++G +R+A G Sbjct: 255 DVD----EVYQLWNEHGFEGKGT---------AESRKFTAERLAAGA 288 >UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitidis SLH14081 RepID=C5JC63_AJEDS Length = 303 Score = 54.0 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 35/98 (35%), Gaps = 11/98 (11%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQ 98 +I+ P R + V + C G++ + + Sbjct: 144 YIN-PADNAGTKNGR-VLNGLPVVNGCAEGSVADVEDE 179 >UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWS6_9SPHI Length = 280 Score = 54.0 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 79/256 (30%), Gaps = 36/256 (14%) Query: 12 IAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACN- 70 +A L E + ++ V PD+ R+ + + H+ID + N Sbjct: 1 MAIYSLPPELIA-----FYKPHIQFITEKAVNPDRRRYAVIGE--AEKHYIDLDEYGENP 53 Query: 71 --------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 ++ ++ K+ + L+ E +++ A L H Sbjct: 54 LDILPIYWYEAVEKFSEEELRKNGIGPWSAYLTFLNLTEAFESKNEKAILRLSAD--LGH 111 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 ++ D++ P+H + + +H W+ I + A + + Sbjct: 112 YLADLNVPLHTTKNYNG--------QLTGQEGIHGFWESRIPESQANRFELWVG---TAE 160 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA------GETLSD 236 IW + +V + K T + K+ Y+ + E + Sbjct: 161 YISQPQQAIWDAVAQAHAMVDSVLT-FEKELTSNFPQDQKYSYEQRNSLTVRVYSEEFTQ 219 Query: 237 DYFNSRLPIVMKRVAQ 252 Y + V +++ + Sbjct: 220 QYAEALDHQVDRQMRK 235 >UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q028C4_SOLUE Length = 352 Score = 48.6 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 55/180 (30%), Gaps = 20/180 (11%) Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 R+ + + + L+ + ++ + ++ H++ D QP+H Sbjct: 148 RNVSGPEEANRVNIGSIYAAISPTLADRAQVQQMLANDIAFYMGWVGHYVADAAQPLHNS 207 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 D + D + + N+H ++ + + +D++ E D +W Sbjct: 208 IHHDGW-SGADPKGYTRDPNIHGRFESQYLDLIGVT--EEDVDKYMRK-EPRLLDNVWKA 263 Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 L E + Y+ G + +V KR+A G Sbjct: 264 VLDHSLEARGFT---------------EEVYRLDLRGA-FTKKDDAEARELVCKRLAAGA 307 >UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48C1 Length = 288 Score = 47.9 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 43/158 (27%), Gaps = 24/158 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W GH A L D L+ PD+ ++ + + Sbjct: 26 WWSGGHETVAAAAAARLPDGVPE-----FFRNGGKHLAHFSGDPDRWKNREMTFLRRAEE 80 Query: 58 PLHFIDTPDKACNFDYERDCHD----------QHGVKDMCVAGAIQNFTTQLSHYREGTS 107 HF+D D +D + K + AI + +L+ Sbjct: 81 GNHFLDLEDLDGKKYPATHRYDGLKMVYGELKKEPNKVGTLPYAIVEYYEKLTVGFYDHR 140 Query: 108 DRRYNMTEALLFLS------HFMGDIHQPMHVGFTSDA 139 + + + L H+ GD P+H D Sbjct: 141 KAPKDTSVPMKCLVYGGTLAHYTGDAAMPLHTTRDFDG 178 >UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_CLOHA Length = 399 Score = 40.5 bits (93), Expect = 0.060, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 67/227 (29%), Gaps = 21/227 (9%) Query: 6 HVMTCRIAQGLLNDEAAHA----VK---MLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 H + A +L ++ VK +L E L +PD + K Sbjct: 38 HALIVTQAVEILKNDVISTSPLSVKENFKIL-ESNLKKLQRGSTYPD---YDPKAYALYQ 93 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF D PD NF + + +G+ + ++ + + + L Sbjct: 94 DHFWD-PDTDNNFTKDSKWYLAYGI-NETGESQLRKLFALAKDEWKKGNYEQATWL--LG 149 Query: 119 FLSHFMGDIHQPMH---VGFTSDAGGNSIDLRWFRHKSN--LHHVWDREIILTAAKDYYA 173 H+ GD H P H V AG + K + LH + Sbjct: 150 QGLHYFGDFHTPYHPSNVTAVDSAGHTKFETYVEGKKDSYKLHTAGANSVKEFYPTTLQN 209 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 +++ + + + A + ATE+++ Sbjct: 210 TNLDNWITEYSRGWAKKAKNMYYAHATMSHSW-KDWEIAATETMHNV 255 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, s... 296 6e-79 UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyt... 289 9e-77 UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B... 278 1e-73 UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepI... 278 1e-73 UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepI... 277 4e-73 UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY... 272 7e-72 UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN 254 2e-66 UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens ... 239 8e-62 UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepI... 234 2e-60 UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H... 233 4e-60 UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacter... 227 3e-58 UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR 226 8e-58 UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE... 224 2e-57 UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold... 223 4e-57 UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus... 222 1e-56 UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD... 217 3e-55 UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidops... 216 8e-55 UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistip... 215 1e-54 UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerot... 214 2e-54 UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus... 213 4e-54 UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella... 213 6e-54 UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepI... 213 7e-54 UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis... 212 1e-53 UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold... 211 2e-53 UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 211 2e-53 UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales ... 208 2e-52 UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus... 207 3e-52 UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. B... 206 9e-52 UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 Re... 205 1e-51 UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM... 204 3e-51 UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ 203 4e-51 UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=... 203 5e-51 UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_... 203 5e-51 UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID... 202 9e-51 UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM ... 202 1e-50 UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriacea... 202 1e-50 UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15... 201 1e-50 UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacter... 201 2e-50 UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus... 200 4e-50 UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=... 200 6e-50 UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromona... 199 1e-49 UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_... 196 4e-49 UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N... 196 5e-49 UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_X... 194 3e-48 UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Asperg... 194 3e-48 UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritiv... 194 3e-48 UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 498... 193 4e-48 UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Ta... 193 4e-48 UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacter... 193 5e-48 UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usi... 191 3e-47 UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole geno... 190 4e-47 UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ... 190 4e-47 UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=... 188 2e-46 UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium vi... 188 3e-46 UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacif... 186 5e-46 UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR... 186 5e-46 UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichom... 186 5e-46 UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leish... 186 7e-46 UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas v... 186 7e-46 UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q9... 186 9e-46 UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp.... 186 1e-45 UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales... 186 1e-45 UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas v... 185 1e-45 UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatida... 184 3e-45 UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT... 184 3e-45 UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepI... 183 7e-45 UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Per... 183 8e-45 UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ 182 1e-44 UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinu... 181 2e-44 UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichom... 180 3e-44 UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo sal... 179 6e-44 UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichom... 179 8e-44 UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila S... 179 1e-43 UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans OR... 178 1e-43 UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID... 176 6e-43 UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahy... 176 7e-43 UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisru... 176 9e-43 UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT7... 174 2e-42 UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichom... 174 2e-42 UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_... 173 4e-42 UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw10... 173 5e-42 UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacteri... 172 1e-41 UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=... 172 1e-41 UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepI... 171 2e-41 UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichom... 168 2e-40 UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 ... 168 2e-40 UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 ... 168 2e-40 UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium lo... 165 1e-39 UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis R... 164 3e-39 UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepI... 163 5e-39 UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmod... 163 9e-39 UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilag... 162 1e-38 UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidops... 161 2e-38 UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichom... 161 3e-38 UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytoph... 160 5e-38 UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptos... 159 9e-38 UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobas... 159 1e-37 UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium ... 156 5e-37 UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacte... 156 8e-37 UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-69... 156 9e-37 UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaM... 154 3e-36 UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellu... 153 5e-36 UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc ... 152 1e-35 UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Plancto... 151 2e-35 UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkins... 151 2e-35 UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechlor... 150 4e-35 UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxopla... 150 6e-35 UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candida... 144 3e-33 UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma p... 144 4e-33 UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxopla... 144 4e-33 UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprino... 137 4e-31 UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoni... 137 5e-31 UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensi... 136 6e-31 UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing pr... 136 6e-31 UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID... 134 3e-30 UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileri... 134 4e-30 UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrh... 131 3e-29 UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichom... 126 6e-28 UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinu... 125 1e-27 UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavoba... 123 5e-27 UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium... 123 5e-27 UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichom... 123 6e-27 UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytopha... 117 5e-25 UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=... 117 5e-25 UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingo... 116 6e-25 UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis R... 116 1e-24 UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredin... 115 1e-24 UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadoba... 115 2e-24 UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichom... 115 2e-24 UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstoni... 111 3e-23 UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitino... 111 4e-23 UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobac... 110 6e-23 UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spiroso... 110 6e-23 UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N... 110 6e-23 UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomi... 109 8e-23 UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobac... 109 9e-23 UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichom... 107 3e-22 UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curviba... 106 7e-22 UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bactero... 106 8e-22 UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichom... 106 1e-21 UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugi... 104 3e-21 UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opituta... 102 1e-20 UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingo... 97 5e-19 UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algorip... 93 1e-17 UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Sacchar... 92 2e-17 UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verruco... 91 3e-17 UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichom... 91 5e-17 UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytoph... 91 5e-17 UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza s... 88 3e-16 UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichom... 86 1e-15 UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=... 83 8e-15 UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxopla... 83 1e-14 UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidoba... 81 5e-14 UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_8... 77 6e-13 UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania... 76 1e-12 UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 ... 75 3e-12 UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium... 72 2e-11 UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichom... 71 6e-11 UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticca... 67 8e-10 UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=... 64 5e-09 UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curviba... 64 5e-09 UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermod... 64 6e-09 UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candida... 63 1e-08 UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitid... 60 8e-08 UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobac... 60 1e-07 UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitid... 56 2e-06 Sequences not found previously or not previously below threshold: UniRef50_B4WCT7 Putative uncharacterized protein n=1 Tax=Brevund... 64 6e-09 UniRef50_D0XMV2 Putative uncharacterized protein n=1 Tax=Brevund... 60 1e-07 UniRef50_B0T3S4 Putative uncharacterized protein n=5 Tax=Cauloba... 58 2e-07 UniRef50_Q6MQM4 Putative uncharacterized protein n=1 Tax=Bdellov... 58 3e-07 UniRef50_B0RM73 Exported putative nuclease n=3 Tax=Xanthomonas c... 48 3e-04 UniRef50_C8X622 Putative uncharacterized protein n=1 Tax=Nakamur... 44 0.004 UniRef50_B5YDN6 Putative uncharacterized protein n=1 Tax=Dictyog... 44 0.004 UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_C... 43 0.012 >UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, scaffold_301.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HBQ0_VITVI Length = 332 Score = 296 bits (757), Expect = 6e-79, Method: Composition-based stats. Identities = 146/272 (53%), Positives = 199/272 (73%), Gaps = 3/272 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH C+IA+G L+++A AVK LLP+Y GDL+A+C W D++RH + ++W+ PLH Sbjct: 25 WGKEGHYAVCKIAEGFLSEDALGAVKALLPDYAEGDLAAVCSWADEIRHNFHWRWSGPLH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD+CV GAI N+T QL+ Y S+ RYN+TEAL+F Sbjct: 85 YVDTPDYRCNYEYCRDCHDFRGHKDICVTGAIYNYTKQLTSGYHNSGSEIRYNLTEALMF 144 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGFT D GGN+I +RW+R K+NLHH+WD II +A K YY D+ ++ Sbjct: 145 LSHFIGDVHQPLHVGFTGDEGGNTIIVRWYRRKTNLHHIWDNMIIDSALKTYYNSDLAIM 204 Query: 180 EEDIEGNFTDGIWSDDLASWREC-GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + I+ N T WS D++SW+ C + +C N +A+ESI++ACK+ Y+ G TL DDY Sbjct: 205 IQAIQRNITGD-WSFDISSWKNCASDDTACPNLYASESISLACKFAYRNATPGSTLGDDY 263 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 F SRLPIV KR+AQGGIRLA LN +F + + Sbjct: 264 FLSRLPIVEKRLAQGGIRLAATLNRIFASQPK 295 >UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyta RepID=Q9SXA6_ARATH Length = 305 Score = 289 bits (738), Expect = 9e-77, Method: Composition-based stats. Identities = 201/277 (72%), Positives = 241/277 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AH V+ LLP+YV GDLSALCVWPDQ+RHWYKY+WTS LH Sbjct: 29 WSKEGHILTCRIAQNLLEAGPAHVVENLLPDYVKGDLSALCVWPDQIRHWYKYRWTSHLH 88 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +IDTPD+AC+++Y RDCHDQHG+KDMCV GAIQNFT+QL HY EGTSDRRYNMTEALLFL Sbjct: 89 YIDTPDQACSYEYSRDCHDQHGLKDMCVDGAIQNFTSQLQHYGEGTSDRRYNMTEALLFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 SHFMGDIHQPMHVGFTSD GGN+IDLRW++HKSNLHHVWDREIILTA K+ Y K+++LL+ Sbjct: 149 SHFMGDIHQPMHVGFTSDEGGNTIDLRWYKHKSNLHHVWDREIILTALKENYDKNLDLLQ 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 ED+E N T+G+W DDL+SW EC ++ +C +K+A+ESI +ACKWGYKGV++GETLS++YFN Sbjct: 209 EDLEKNITNGLWHDDLSSWTECNDLIACPHKYASESIKLACKWGYKGVKSGETLSEEYFN 268 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 +RLPIVMKR+ QGG+RLAM+LN VF V AT Sbjct: 269 TRLPIVMKRIVQGGVRLAMILNRVFSDDHAIAGVAAT 305 >UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B9HYZ1_POPTR Length = 297 Score = 278 bits (712), Expect = 1e-73, Method: Composition-based stats. Identities = 145/269 (53%), Positives = 185/269 (68%), Gaps = 5/269 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH TC+IA+G L EA AVK LLPE GDL+ +C WPD++R + Y W+S LH Sbjct: 25 WGKEGHYATCKIAEGYLTAEALAAVKELLPESAEGDLANVCSWPDEIR--FHYHWSSALH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD CV GAI N+T QL Y+ S+ YN+TEAL+F Sbjct: 83 YVDTPDFRCNYEYFRDCHDSSGRKDRCVTGAIYNYTNQLLSLYQNSNSESNYNLTEALMF 142 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGF D GGN+I + W+R KSNLHHVWD II +A K +Y+ D+ + Sbjct: 143 LSHFIGDVHQPLHVGFLGDLGGNTIQVHWYRRKSNLHHVWDNMIIESALKTFYSSDLATM 202 Query: 180 EEDIEGNFTDGIWSDDLASWREC-GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 I+ N T+ WS+ W C N C N +A+ESI++ACK+ YK G TL DDY Sbjct: 203 IRAIQNNITEN-WSNQQPLWEHCAHNHTVCPNPYASESISLACKFAYKNASPGSTLEDDY 261 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 F SRLP+V KR+AQGGIRLA LN +F + Sbjct: 262 FLSRLPVVEKRLAQGGIRLAATLNRIFAS 290 >UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepID=Q8LA68_ARATH Length = 296 Score = 278 bits (712), Expect = 1e-73, Method: Composition-based stats. Identities = 130/273 (47%), Positives = 184/273 (67%), Gaps = 4/273 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPL 59 W K+GH C++A+G D+ AVK LLPE G L+ C WPD+++ +++WTS L Sbjct: 21 WGKDGHYTVCKLAEGFFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTL 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALL 118 H+++TP+ CN++Y RDCHD H +D CV GAI N+T QL E + + YN+TEALL Sbjct: 81 HYVNTPEYRCNYEYCRDCHDTHKHRDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALL 140 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FLSH+MGD+HQP+H GF D GGN+I + W+ +KSNLHHVWD II +A + YY + Sbjct: 141 FLSHYMGDVHQPLHTGFLGDLGGNTIIVNWYHNKSNLHHVWDNMIIDSALETYYNSSLPH 200 Query: 179 LEEDIEGNFTDGIWSDDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + ++ +G WS+D+ SW+ C + +C N +A+ESI++ACK+ Y+ G TL D+ Sbjct: 201 MIQALQAKLKNG-WSNDVPSWKSCHFHQKACPNLYASESIDLACKYAYRNATPGTTLGDE 259 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 YF SRLP+V KR+AQGGIRLA LN +F A + Sbjct: 260 YFLSRLPVVEKRLAQGGIRLAATLNRIFSAKPK 292 >UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepID=Q9LGA5_ORYSJ Length = 308 Score = 277 bits (707), Expect = 4e-73, Method: Composition-based stats. Identities = 140/276 (50%), Positives = 194/276 (70%), Gaps = 7/276 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+GH++ C+IA+ L+++AA AV+ LLPE G+LS +C W D+VR + Y W+ PLH Sbjct: 34 WGKQGHIIVCKIAEKYLSEKAAAAVEELLPESAGGELSTVCPWADEVR--FHYYWSRPLH 91 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + +TP + CNF Y RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL Sbjct: 92 YANTP-QVCNFKYSRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 +HF+GD+HQP+HVGF D GGN+I + W+R K NLHHVWD II TA KD+Y + ++ + Sbjct: 149 AHFVGDVHQPLHVGFEEDEGGNTIKVHWYRRKENLHHVWDNSIIETAMKDFYNRSLDTMV 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGN-VFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 E ++ N TDG WS+D++ W CGN +C N +A ESI+++C + YK VE TL DDYF Sbjct: 209 EALKMNLTDG-WSEDISHWENCGNKKETCANDYAIESIHLSCNYAYKDVEQDITLGDDYF 267 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 SR PIV KR+AQ GIRLA++LN +FG + + +V+ Sbjct: 268 YSRYPIVEKRLAQAGIRLALILNRIFGEDKPDGNVI 303 >UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY2_CUCSA Length = 311 Score = 272 bits (696), Expect = 7e-72, Method: Composition-based stats. Identities = 161/258 (62%), Positives = 199/258 (77%), Gaps = 1/258 (0%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDYE 74 LL EAA AV+ LLPE G+LSA+CVWPDQ+R KY+W SPLH+ +TP +C+F Y+ Sbjct: 50 ELLIPEAAEAVQDLLPESAGGNLSAMCVWPDQIRLQSKYRWASPLHYANTP-DSCSFVYK 108 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 RDCH+ G DMCVAGAI+NFTTQL+ YR D +N+TEALLFLSHF+GDIHQP+HVG Sbjct: 109 RDCHNDAGQPDMCVAGAIRNFTTQLTTYRTQGFDSPHNLTEALLFLSHFVGDIHQPLHVG 168 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 F SDAGGN+I++RWFR KSNLHHVWDR+IIL A DYY KD LL +++ N T GIWS+ Sbjct: 169 FESDAGGNTIEVRWFRRKSNLHHVWDRDIILEALGDYYDKDGGLLLDELNRNLTQGIWSN 228 Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 D++ W C V SCVN++A ES +ACKW Y+GVEAG TLS++Y++SRLPIVM+R+AQGG Sbjct: 229 DVSEWERCSTVNSCVNRWADESTGLACKWAYEGVEAGITLSEEYYDSRLPIVMERLAQGG 288 Query: 255 IRLAMLLNNVFGASQQED 272 +RLAMLLN VF Sbjct: 289 VRLAMLLNRVFAEDATRG 306 >UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN Length = 297 Score = 254 bits (649), Expect = 2e-66, Method: Composition-based stats. Identities = 128/270 (47%), Positives = 168/270 (62%), Gaps = 6/270 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GHV+ C+IAQ L++ AA AVK LLP DLS C W D V H Y W S LH Sbjct: 27 WGDDGHVIVCKIAQARLSEAAAEAVKKLLPISAGNDLSTKCSWADHVHHI--YPWASALH 84 Query: 61 FIDTPDKACNFDYERDCHD-QHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 + +TP+ C++ RDC D + G+K CV AI N+TTQL Y + RYN+T++L F Sbjct: 85 YANTPEALCSYKNSRDCVDYKKGIKGRCVVAAINNYTTQLLEY-GSDTKSRYNLTQSLFF 143 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 SHFMGDIHQP+H GF SD GGN+I +RW++ K NLHH+WD I+LT +Y D++ Sbjct: 144 PSHFMGDIHQPLHCGFLSDNGGNAITVRWYKRKQNLHHIWDSTILLTEVDKFYDSDMDEF 203 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNV-FSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + ++ N T +W+D + W CG+ C +A+ES ACKW YK G L+DDY Sbjct: 204 IDALQQNITK-VWADQVEEWENCGDKDLPCPATYASESTIDACKWAYKDATEGSVLNDDY 262 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 F SRLPIV R+AQ G+RLA +LN VF Sbjct: 263 FLSRLPIVNMRLAQAGVRLAAILNRVFEKK 292 >UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U2Y4_PHYPA Length = 284 Score = 239 bits (610), Expect = 8e-62, Method: Composition-based stats. Identities = 117/272 (43%), Positives = 167/272 (61%), Gaps = 11/272 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +TC IA+ LL + A+ LLP+ NG+L+ LC WPD VR KYKWT LH Sbjct: 23 WGADGHRVTCLIAEPLLYEPTKQAIAALLPKSANGNLADLCTWPDDVRWMDKYKWTRELH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++TP+ C +DY RDCHD G ++C++GAI NFT L + T +R +L Sbjct: 83 WVNTPNHVCKYDYNRDCHDHMGTPNVCISGAINNFTHILWN---HTRNRNMKNGRGILLC 139 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 ++P+H GF SD GGN+I + W+ +S+LHHVWD EI+ A K+ + D ++ Sbjct: 140 C------YEPLHTGFRSDQGGNNISVYWYHRRSDLHHVWDTEIVSKALKENHNSDPEIMA 193 Query: 181 EDIEGNFTDGIWSDDLASWRECGN-VFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I N TD W+ ++ +W C N SC + +ATESIN+ACKW Y G G L D+Y+ Sbjct: 194 DSILNNATDN-WASEVDAWGICHNRKLSCPDTYATESINLACKWAYSGAAPGTALGDEYY 252 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 SRLP V R+AQGG+RLA +LN++F + + Sbjct: 253 TSRLPTVELRLAQGGVRLAAILNSIFDPNAPQ 284 >UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepID=B8MCF5_TALSN Length = 363 Score = 234 bits (597), Expect = 2e-60, Method: Composition-based stats. Identities = 84/284 (29%), Positives = 126/284 (44%), Gaps = 20/284 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ L+D A K +L + + L+ + W D R KW++PLH Sbjct: 47 WGTLGHATVAYIAQNYLDDATATWAKGVLGDTSDSYLANIASWADSYRSTSAGKWSAPLH 106 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D+P +CN DYERDC C AI N+T ++ R ++ EAL Sbjct: 107 FIDAEDSPPTSCNVDYERDC-----GSSGCSVSAIANYTQRVGDGRLSKANT----AEAL 157 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAKDYYAKDI 176 FL HF+GD+ QP+H D GGN I + + + S NLH WD I D Sbjct: 158 KFLVHFLGDVTQPLH-DEALDRGGNEITVTFDGYDSDNLHSDWDTYIPQKLVGGSTLSDA 216 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECG---NVFSCVNKFATESINIACKWGYKGVEAG-- 231 ++ G + A+W + + + +A+++ C A Sbjct: 217 QTWANELISQIDSGSYKSVAANWIKGDDISDPITSATTWASDANAFVCSVVMPNGVAALQ 276 Query: 232 -ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 L DY+NS +P + ++A+GG RLA LN+++ A + Sbjct: 277 QGDLYPDYYNSVIPTIELQIAKGGYRLANWLNSIYSAHIAKRKR 320 >UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H0E5_PENCW Length = 344 Score = 233 bits (595), Expect = 4e-60, Method: Composition-based stats. Identities = 80/281 (28%), Positives = 126/281 (44%), Gaps = 19/281 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + + L+ + W D+ R KW++PLH Sbjct: 21 WGALGHATVAYVAQHYISSEAASWAQGILNDTSSSYLANVASWADKYRLTDDGKWSAPLH 80 Query: 61 FIDT---PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +ID P K+CN DYERDC D + C A+ N+T++ R T EAL Sbjct: 81 YIDAMDDPPKSCNVDYERDCGD-----EGCSVSAVANYTSRAGDGRLSTDHT----AEAL 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GDI QP+H + GGN ID+ + + NLH WD + D Sbjct: 132 RFLVHFIGDITQPLH-DENYEVGGNGIDVTFDGYDDNLHSDWDTYMPGKLVGGSSLTDAQ 190 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECG---NVFSCVNKFATESINIACKWGYKG---VEAG 231 + + G + + SW E + + ++A+++ C Sbjct: 191 GWADSLVDEINSGTYKEQAKSWIEGDTISDAVTTATRWASDANAFVCTVVMPDGAAALQT 250 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 L Y+NS + + +VA+GG RLA +N ++ +D Sbjct: 251 GDLYPTYYNSAIGTIEMQVAKGGYRLANWINLIYEQKVAKD 291 >UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacteroidetes RepID=A0M3W8_GRAFK Length = 260 Score = 227 bits (578), Expect = 3e-58, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 121/266 (45%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH T IA+ L+++A +A+ LL + L+ + + D ++ +Y+ P H Sbjct: 24 WGKTGHRATAEIAETHLSNKAKNAIDGLLGGHG---LAFVANYADDIKSDPEYREFGPWH 80 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + ++ K + AI+ L +++ L L Sbjct: 81 YVNIDPE------NKKYIEEEANKSGDLVQAIKKCVEVLKDQNSSRDEKQ----FYLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP H G D GGN I +RWF SN+H VWD ++I Y +N Sbjct: 131 VHFVGDLHQPFHTGHAEDKGGNDIQVRWFNEGSNIHRVWDSDMINFYQMSYTELALN--T 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +D+ N I L W ES +A Y GV+ GE L Y Sbjct: 189 KDLSKNQIKAIEKGKLLDWVY-------------ESRAMAEDL-YTGVDNGEKLGYSYMY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +P V++++ +GGIRLA +LN+++ Sbjct: 235 KNMPTVLEQLQKGGIRLAKILNDIYS 260 >UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR Length = 287 Score = 226 bits (575), Expect = 8e-58, Method: Composition-based stats. Identities = 73/276 (26%), Positives = 115/276 (41%), Gaps = 20/276 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASSTESFCQNILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D P ++C DY+RDC C AIQN+T L G+ AL Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYTNILLESPNGSEALN-----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GDIHQP+H +AGGN ID+ + +NLHH+WD + AA Y Sbjct: 131 KFVVHIIGDIHQPLH-DENLEAGGNGIDVTYDGETTNLHHIWDTNMPEEAAGGYSLSVAK 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVE---AG 231 + + G +S SW + + S +A ++ C Sbjct: 190 TYADLLTERIKTGTYSSKKDSWTDGIDIKDPVSTSMIWAADANTYVCSTVLDDGLAYINS 249 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 LS +Y++ P+ + +A+ G RLA L+ + Sbjct: 250 TDLSGEYYDKSQPVFEELIAKAGYRLAAWLDLIASQ 285 >UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE1_LACBS Length = 317 Score = 224 bits (571), Expect = 2e-57, Method: Composition-based stats. Identities = 82/305 (26%), Positives = 120/305 (39%), Gaps = 48/305 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSP 58 W +GH+ A L A V+ L + L W D VR Y W ++P Sbjct: 20 WGADGHMAVGYTAMQFLAPNALSFVQNSLGSSYSRSLGPAATWADTVRSQAAYSWCASAP 79 Query: 59 LHFID---TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF+D P +C+ RDC C+ AI N+TT++ + R+ E Sbjct: 80 FHFVDAEDNPPTSCSVSETRDC-----GSGNCILTAIANYTTRVVQTSLSATQRQ----E 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKD 175 AL FL HF+GDI QP+HV GGN I ++ +NLH +WD II K Y Sbjct: 131 ALKFLDHFLGDITQPLHV-EALKVGGNDITVKCNGSSTNLHALWDTGIIEGFLKAQYGNS 189 Query: 176 INLLEEDIEGNFTDGIWSDDLASWRECGNV----------------------------FS 207 + + G ++ ASW C + Sbjct: 190 VTTWANSLATRIKTGNFASSKASWIACSDPSAPLSQKRSIQDDIDEFLAARSTAAITPLK 249 Query: 208 CVNKFATESINIACKWGYKGVEAGETL----SDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 C +A +S C + + G G+ L + Y PI+ +++A+G RLA LN Sbjct: 250 CPLVWAQDSNTFDCSYVF-GFTTGKDLCSGGTSSYAAGAQPIIEEQIAKGAYRLAAWLNV 308 Query: 264 VFGAS 268 +F S Sbjct: 309 LFDGS 313 >UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold_4 n=10 Tax=Sordariomycetes RepID=D1Z5H6_SORMA Length = 336 Score = 223 bits (569), Expect = 4e-57, Method: Composition-based stats. Identities = 76/290 (26%), Positives = 123/290 (42%), Gaps = 26/290 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH+ +A +++ A + LL L+ + W D +R+ +WT PLH Sbjct: 21 WGGFGHITVAYLASNFVSNTTAAYFQTLLRNDTTDYLANVATWADSIRYTKWGRWTGPLH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D+P +C YERDC + CV AIQN+T+++ +R +A Sbjct: 81 YIDAKDSPPHSCGIVYERDCK-----PEGCVVSAIQNYTSRVLDQSLHVVER----AQAA 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA-------AKD 170 F+ HF+GDIHQP+H + GGN I + + + NLHHVWD I Sbjct: 132 KFVIHFVGDIHQPLHT-EDVEKGGNGISVFFDDKRFNLHHVWDSSIAEKIVTHKKHGVGR 190 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASW---RECGNVFSCVNKFATESINIACKWGYKG 227 E + +G + + + W E + ++A E C Sbjct: 191 RPFPAAKKWAEQLAEEIREGQYKANSSEWVKGLELKSASEIALEWAVEGNAHVCTVVLPE 250 Query: 228 VE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 + L YF + P+V ++A+ G RLA L+ V A + +++ Sbjct: 251 GPEAIRDQELGGAYFEAAAPVVELQIAKAGYRLAAWLDLVVTAISKNETI 300 >UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus ATCC 50983 RepID=C5K479_9ALVE Length = 337 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 90/291 (30%), Positives = 146/291 (50%), Gaps = 32/291 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W +GH + ++ Q + E A+ ++ + V +S W D+V++ +KW+S Sbjct: 19 WGHDGHAVVAQLGQERIKKETQEALDAIMGKGVP--MSNYSSWADEVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 SLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAK 174 F+ HF+GD HQP+H+G D GGN I + + +NLH WD ++I Sbjct: 126 KFIVHFVGDAHQPLHIGKPEDLGGNKIAVHLGFGEKPSTNLHSTWDSKLIYELEDQSDPI 185 Query: 175 DINLL----EEDIEGNF-TDGIWSDDLASWRECGNVFS---CVNKFATESINIACKWGYK 226 D E+ + G ++D++ W E + CV+ + +ES AC + Y+ Sbjct: 186 DGEPSWMITEDAVSDELDKGGKYADEIDDWIEDCEKYGLDVCVDSWLSESSKTACDYSYR 245 Query: 227 GVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 V + L DY+N+R+ +V +++A+GG+RL LLN VF A Sbjct: 246 HVNGSLIVDHDFLPMDYYNNRIEVVKEQLAKGGVRLTWLLNTVFAAQDATP 296 >UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD39_ASPTN Length = 300 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 72/287 (25%), Positives = 127/287 (44%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ + + LLP N D+S W D+ + +Y T P H Sbjct: 21 WGDVGHRTVAYVAENYLTEDGSKFLDNLLPFSNNFDISDAATWADEQKR--RYPKTKPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D D ++ D C+ A++ T+Q+S Y +N TEA+LFL Sbjct: 79 YVDIKDDP--VHHKCDISSLDCPNGDCIISAMEAMTSQVSEYS-------FNRTEAVLFL 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA--------AKDYY 172 HF GD+H P+HV GGN ID+ + NLH +WD ++ D Sbjct: 130 VHFFGDLHMPLHV-EGLCRGGNEIDVSFNGRNDNLHSIWDTDMPHKINGIKHSLKHNDEK 188 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK---GVE 229 + ++ I+ N + + C ++ATES ++ C +K Sbjct: 189 TASLKWAKDLIQKNLHR---PATVTECNDVTQPQKCFKQWATESNHLNCAVVFKRGLQYL 245 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 + L+ DY+ +P++ +++ + G+RLA +N++ + + VA Sbjct: 246 TTQDLAGDYYEDAVPVIEEQIFKAGVRLATWINSIAEKQHAKAAFVA 292 >UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidopsis thaliana RepID=O65424_ARATH Length = 362 Score = 216 bits (549), Expect = 8e-55, Method: Composition-based stats. Identities = 108/258 (41%), Positives = 153/258 (59%), Gaps = 38/258 (14%) Query: 14 QGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDY 73 + ++ AVK LLPE NG+L+A+C WPD+++ +++WTS LHF DTPD CN++Y Sbjct: 138 KSYFEEDTVVAVKKLLPESANGELAAVCSWPDEIKKLPQWRWTSALHFADTPDYKCNYEY 197 Query: 74 ERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV 133 +N+TEAL+FLSH+MGDIHQP+H Sbjct: 198 ------------------------------------SHNLTEALMFLSHYMGDIHQPLHE 221 Query: 134 GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 GF D GGN I + W+ ++NLH VWD II +A + YY + + +++ +G WS Sbjct: 222 GFIGDLGGNKIKVHWYNQETNLHRVWDDMIIESALETYYNSSLPRMIHELQAKLKNG-WS 280 Query: 194 DDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D+ SW C N +C N +A+ESI++ACK+ Y+ AG TL D YF SRLP+V KR+AQ Sbjct: 281 NDVPSWESCQLNQTACPNPYASESIDLACKYAYRNATAGTTLGDYYFVSRLPVVEKRLAQ 340 Query: 253 GGIRLAMLLNNVFGASQQ 270 GGIRLA LN +F A ++ Sbjct: 341 GGIRLAGTLNRIFSAKRK 358 Score = 87.5 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 58/165 (35%), Positives = 75/165 (45%), Gaps = 33/165 (20%) Query: 96 TTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNL 155 +S + YN+TEAL+FLSHF+GDIHQP+HVGF D GGN+I +RW+R K+NL Sbjct: 1 MQLMSASENSDTIVHYNLTEALMFLSHFIGDIHQPLHVGFLGDEGGNTITVRWYRRKTNL 60 Query: 156 HH----------------------------VWDREIILTAAKDYYAKDINLLEEDIEGNF 187 HH VWD II +A K YY K + L+ E ++ N Sbjct: 61 HHVSVCYRMLKEKVIFPDWINYSYDLPMMKVWDNMIIESALKTYYNKSLPLMIEALQANL 120 Query: 188 TDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGE 232 T I S WR + E +A K GE Sbjct: 121 TMTISSLGYPLWRRDLR-----KSYFEEDTVVAVKKLLPESANGE 160 >UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYD6_9BACT Length = 257 Score = 215 bits (548), Expect = 1e-54, Method: Composition-based stats. Identities = 73/266 (27%), Positives = 109/266 (40%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH + IA+ L EAA + +L + W D H +Y +T+ H Sbjct: 21 WGPKGHDVVAYIAECNLTPEAAEKIDKILG---GASMVYWANWLDSASHTPEYAYTATWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + + + D + AI +L + + + L L Sbjct: 78 YANVDEGF-------TYETMTKNPDGDIVEAIDRIVAELKGGQLDPAQEQL----YLKML 126 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH G SD GGNS+ +R+F +SNLH VWD + A K Y + N L Sbjct: 127 VHLVGDLHQPMHTGHLSDRGGNSVPVRFFGRESNLHAVWDSSLPEAAHKWSYTEWQNQL- 185 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I S W E N C+ Y G LS DY Sbjct: 186 DRLTEEEVARIQSGTPLDWFEESNAI--------------CREIYVATPEGSDLSYDYIA 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 P++ +++ +GG RLA LLN ++G Sbjct: 232 KYAPVIERQLLRGGHRLAGLLNEIYG 257 >UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7ETG5_SCLS1 Length = 283 Score = 214 bits (545), Expect = 2e-54, Method: Composition-based stats. Identities = 72/278 (25%), Positives = 109/278 (39%), Gaps = 23/278 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A + + +MLL L+ + W D R + Sbjct: 21 WGTLGHQTVAYVATNFVAESTRDYFQMLLRNDTGSYLAGVATWADSYRLAALLRLFQR-- 78 Query: 61 FIDTPDKA-CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 F +T A C + RDC + + CV GAI NFT+QL + Sbjct: 79 FFNTEINAACGVKFARDCGE-----EGCVVGAILNFTSQLLDPNVSRYHKYIAAKF---- 129 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 +GDIHQP+H + GGN+I + + ++NLH WD I Y D Sbjct: 130 ----VGDIHQPLHA-ENINIGGNTIKVTFNGKETNLHSFWDTAIPEELVGGYSMADAQEW 184 Query: 180 EEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKG---VEAGET 233 + GI+ SW E G+ + +A +S C V G+ Sbjct: 185 ANVLTTAIKTGIYKSQAKSWLEDMNIGDPLTTALGWAKDSNAFICTTVIPDGAEVLQGKE 244 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 LS +Y+ S +P+V +VA+ G RLA L+ + + E Sbjct: 245 LSGEYYESGIPVVELQVARAGYRLAAWLDMIVRGIKTE 282 >UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5K482_9ALVE Length = 328 Score = 213 bits (543), Expect = 4e-54, Method: Composition-based stats. Identities = 86/283 (30%), Positives = 144/283 (50%), Gaps = 32/283 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W +GH + ++ Q +N E A+ ++ + V + W D V++ +KW+S Sbjct: 19 WGHDGHAVVAQLGQERINKETQEAIDAIMGKGVP--MYNYSSWADDVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 PLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREII-LTAAKDYYA 173 F+ HF+GD HQP+H G D GGN ID+ +NLH WD ++ + + A Sbjct: 126 KFIVHFVGDAHQPLHAGNPKDRGGNKIDVSLGFARHQHTNLHSTWDSALLYEFQGRGHRA 185 Query: 174 KDINLL---EEDIEGNF-TDGIWSDDLASWRECGNVF---SCVNKFATESINIACKWGYK 226 + E+ I+ G ++ D+ W E + +C+ K+ E+ AC++ YK Sbjct: 186 RGAPYWTVTEDAIDDELDKGGRYAGDVDDWVEDCEKYGYDACIEKWVDETAKAACEYSYK 245 Query: 227 GVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + + +D Y++ R+ + +++A+ GIRL LLNN+ Sbjct: 246 HMNGSRVVDNDYLPMKYYDGRIEVAKEQLAKAGIRLTWLLNNL 288 >UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XR21_9FLAO Length = 263 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 109/266 (40%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH T IA L A++ LL + L + + D+++ + +Y+ S H Sbjct: 28 WGSKGHRATAAIAVKYLKPRTKKAIEKLLG---DETLVTVSTYGDEIKSYEEYRKYSSWH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I ++ ++R L L Sbjct: 85 YVNI-------APGLSYAEADKNEYGDLVQGINTCKEVITSEDATIEEKR----FYLKML 133 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H+G D GGN +RWF + +NLH +WD ++I + Y N Sbjct: 134 VHFIGDLHQPLHLGHAEDKGGNDFQVRWFNNGTNLHSLWDSKLIESYGMSYSELATN--F 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + I DL W G + + + Y E GE LS Y Sbjct: 192 GQVSKKQFKEISKGDLMDWVSEGQILA--------------EKVYDSAEIGEKLSYRYQA 237 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V +++ +GG+RLA LLN +F Sbjct: 238 DYNQMVQEQLQKGGVRLAALLNELFD 263 >UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S8Q5_NEUCR Length = 306 Score = 213 bits (541), Expect = 7e-54, Method: Composition-based stats. Identities = 68/290 (23%), Positives = 115/290 (39%), Gaps = 32/290 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH-WYKYKWTSPL 59 W K GH +AQ L V+ +L + + + W D R+ W++ L Sbjct: 20 WGKLGHATVASVAQQYLTPNTVKQVQTILGDNSTSYMGNIASWADSFRYESAANAWSAGL 79 Query: 60 HFID----TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF++ P ++C+ DC + CV AI N+T ++ + + Sbjct: 80 HFVNGHDGPPPESCHLVLPEDC-----PPEGCVVSAIGNYTERVQMKNITADQK----AQ 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------ 169 AL F+ HF+GDI QP+H + G N+I + + +K+NLH WD I Sbjct: 131 ALKFIVHFLGDIAQPLHTEGFGE-GANNITVTFQGYKTNLHAAWDTSIPNAMLGISPPTS 189 Query: 170 --DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS------CVNKFATESINIAC 221 + + D ++ G + D+ W +V + +A + C Sbjct: 190 AANITSADFLGWANNLAAKINQGQYRKDVRRWLRYHSVATRKASERAAAAWAQDGNEEVC 249 Query: 222 KWGYK---GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 250 HYVMKVPGNQLNGTEIGGDYYKGATEVVERSIIKGGIRLAGWLNLIFDNR 299 >UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFD4_HAHCH Length = 304 Score = 212 bits (539), Expect = 1e-53, Method: Composition-based stats. Identities = 68/271 (25%), Positives = 111/271 (40%), Gaps = 20/271 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + C +A L+ A V+ LL + + C+WPDQVR ++K T H Sbjct: 50 WGELGHRVVCDVAWKELSPVARDQVQKLLQQAGKRTFAEACLWPDQVRSEKEFKHTGSYH 109 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ A +C + CV A+ + L E + +AL+F+ Sbjct: 110 YVNVERAAKRVSTAENCESK-----GCVLTALNAYAEALKG--EPRQGYQATPAQALMFI 162 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GDIHQP+HV + D GGN + + ++NLH +WD I + + K + Sbjct: 163 GHFIGDIHQPLHVSYADDRGGNKVVYKVAGEETNLHRLWDVNIPESGLPRDWRKAGKKVR 222 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 G + +A ES+ I K G S Sbjct: 223 GKHRGETVTAL-------------SLQEAEAWANESLAITRKVYESLPPQGSEWSKKDLA 269 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 P+ R+ Q G+RL +LN + ++Q + Sbjct: 270 REYPVAEMRLYQAGVRLGAVLNQLLASNQDQ 300 >UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold_39 n=1 Tax=Sordaria macrospora RepID=D1ZIR6_SORMA Length = 309 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 116/294 (39%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH +AQ L V+ +L + + + W D R+ W+S LH Sbjct: 19 WGKLGHATVASVAQQYLTPNTVKQVQAILGDKSTTYMGNIASWADSFRYEEGNAWSSGLH 78 Query: 61 FID----TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 F++ P ++C+ DC + CV AI N+T ++ + R T+A Sbjct: 79 FVNGHDAPPPESCHLILPEDC-----PPEGCVVSAIGNYTERVQNKELAAEQR----TQA 129 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------- 169 L F+ HF+GDI QP+H + G N++ + + +K+NLH WD I T Sbjct: 130 LKFIIHFLGDIAQPLHTEAFGE-GANNVTVFFDGYKTNLHAAWDTSIPNTMLGISPPTSA 188 Query: 170 -DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC-------VNKFATESINIAC 221 + D ++ G + D+ W + + +A + C Sbjct: 189 ANITNADFLGWANNLAAKINQGSYRRDVRRWLRNHRLPANRKGAERAAAAWAQDGNEEVC 248 Query: 222 KWGYK---GVEAGETLSD----DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 249 HYVMKIPGNQLNGTEIGAGAGGDYYKGAAEVVERSIIKGGIRLAGWLNLIFDKR 302 >UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FP92_PHATR Length = 308 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 95/308 (30%), Positives = 147/308 (47%), Gaps = 43/308 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------LSALCVWPDQVRHWYKY 53 W KEGH + +A LL++++ AV+ +L + D L + W D VR ++Y Sbjct: 6 WGKEGHEVVGNLAWKLLSEQSQSAVRNILQDVPIPDNCTACSPLGQVADWADTVRRTHEY 65 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTS---DRR 110 W+ PLH++D C F+YERDC + D+CVAGA+ N+T L +R + Sbjct: 66 FWSGPLHYVDISQDECRFEYERDCAN-----DICVAGAVVNYTRHLQKFRRDETREYGDE 120 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK------------------ 152 + ++L+FL+HF+GD+HQP+HV +SD GGNSI + + Sbjct: 121 LLVRDSLMFLTHFVGDLHQPLHVSRSSDRGGNSIHVVYSPGNADTAPKDGRLGYLRAGRH 180 Query: 153 ---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN--VFS 207 NLH VWD II T K Y + L E+ + + + W C N + Sbjct: 181 HHVDNLHAVWDTGIIETCVKLNYKESRVLWEKVLYERIIQAQGTGEWDVWTSCPNGAQQT 240 Query: 208 CVNKFATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 CV++++ +S+ A W Y+ V+ G LS Y+ +RLP V ++ RLA L Sbjct: 241 CVSEWSEQSLEYALIWAYRNVDGTAIGDGTHLSHAYYETRLPFVEHQLTVAAARLATTLE 300 Query: 263 NVFGASQQ 270 F + Sbjct: 301 ISFTQNVA 308 >UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales RepID=Q3IBZ8_PSEHT Length = 288 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 72/271 (26%), Positives = 114/271 (42%), Gaps = 30/271 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH + +IA+ L++ LLP N L+ + WPD++R W +S Sbjct: 27 WGQNGHRIIAKIAESHLSETTKT---KLLPLLNNESLAQVSTWPDEMRSAPGEFWQRKSS 83 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+T ++ V + I L + ++ +L Sbjct: 84 RWHYINTSANKPISLNHSHTKNKESVT--NILEGIHYSIKVLQDEQSSLDAKQ----FSL 137 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL H +GD HQP H G D GGN+I ++ F ++NLH +WD ++I Y Sbjct: 138 RFLVHLVGDSHQPFHAGRADDRGGNNIKVKHFGQETNLHSLWDSKLIEGENLSY------ 191 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 F D I +++ + S + ES N+A + +S Sbjct: 192 -------TEFADFINTNNQT--LISEYLTSTPTSWLVESNNLAESIY---NKNETNISYS 239 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 Y +PI+ R+ QGGIRLA LLN++F S Sbjct: 240 YIFDHMPIIKTRLQQGGIRLAGLLNSLFDES 270 >UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NJ54_ASPFN Length = 320 Score = 207 bits (527), Expect = 3e-52, Method: Composition-based stats. Identities = 71/305 (23%), Positives = 112/305 (36%), Gaps = 43/305 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASPTESFCQDILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FID---TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLS----------------- 100 FID P ++C DY+RDC C AIQN+ + Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYVSYFRVYNNIGCSSYLDQYSPG 135 Query: 101 -----------HYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF 149 R S R +S +GD HQP+H +AGGN ID+ + Sbjct: 136 ISQWLGGVECPEIRGSCSSRPLTGLIRFPNMSQIIGDTHQPLH-DENLEAGGNGIDVTYD 194 Query: 150 RHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC---GNVF 206 +NLHH+WD + AA Y + + G +S SW E + Sbjct: 195 GETTNLHHIWDTNMPEEAAGGYSLSVAKTYADLLTERIKTGTYSSKKDSWTEGIDIKDPV 254 Query: 207 SCVNKFATESINIACKWGYKGVE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 S +A ++ C LS +Y++ P+ + +A+ G RLA L+ Sbjct: 255 STSMIWAADANTYVCSTVLDDGLAYINSTDLSGEYYDKSQPVFEELIAKAGYRLAAWLDL 314 Query: 264 VFGAS 268 + S Sbjct: 315 IASQS 319 >UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. BAL39 RepID=A6EB04_9SPHI Length = 250 Score = 206 bits (523), Expect = 9e-52, Method: Composition-based stats. Identities = 66/266 (24%), Positives = 105/266 (39%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+ L+ +A VK +L L+ W D ++ Y + H Sbjct: 11 WGMLGHRIVGQIAEAHLSKKALKGVKGVLGN---ETLAMASNWGDFIKSDTSYNYLYNWH 67 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + + V++ V + L + A+ L Sbjct: 68 FVNLP---AGLDKQGVFNVLDKVQEPNVYNKVPEMVAILKDNNSSAEQK----VFAMRML 120 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 121 VHLIGDLNQPMHTARKDDLGGNKVAVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 173 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 D + LASW + S AC Y + + LS Y Sbjct: 174 ---YAKAIDYPSTAQLASWNGLSL-----RDYVYGSYE-ACNQIYAKTKGDDKLSYQYNF 224 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + L ++ +++ +GGI LA +LN ++ Sbjct: 225 NFLKLLNEQLLKGGICLANVLNEIYK 250 >UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMT2_MARMM Length = 299 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 82/282 (29%), Positives = 125/282 (44%), Gaps = 21/282 (7%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-LSALCVWPDQVRHWYKYKWTSPLH 60 +GH + C +A L+DE + L+ + D +C W D VR ++ T+P H Sbjct: 27 GPDGHRIVCDLAWRYLSDETRTEIDRLVAQDPEFDHFRDVCSWADDVRGS-THRHTAPWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+ + D E DC + D C+ AI DR EAL FL Sbjct: 86 YINQTRDDPHVDAE-DCAE-----DGCITSAIDLHAGIFVDRSRSDEDR----LEALKFL 135 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH-KSNLHHVWDREIILTAAKDYYAKDINLL 179 +H+MGDIHQP+HV D GGN I++ W ++NLH VWD EI+L + Sbjct: 136 AHWMGDIHQPLHVSIEGDRGGNDINVLWRGERRTNLHRVWDSEILLDYM---AETWPYID 192 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK----WGYKGVEAGETLS 235 + D D + +D + + +A ES +I + + E Sbjct: 193 DGDRWAQLADQLAADIPLNGISVYTPLA-PVDWAQESHDIVRSRGFAYYWARAEEMIEPG 251 Query: 236 DDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 D Y++ LP+ ++R+ QGG+RLA LLN + Q + T Sbjct: 252 DAYYDRNLPVSLQRLKQGGVRLAGLLNQLVEERQLSGTGAVT 293 >UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PH62_CHIPD Length = 266 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 71/268 (26%), Positives = 109/268 (40%), Gaps = 27/268 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH--WYKYKWTSP 58 W GH + IA L +A A+ LL ++ + WPD ++ +KY TSP Sbjct: 24 WGVTGHRVVAEIASRHLTPQARKAIIALLGP---QSMAMVANWPDFIKSDTTHKYDHTSP 80 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H++D P + D + + + +L +D+ AL Sbjct: 81 WHYLDFPANVDRVHF--DEVLKEHTTGENLYAQTEALIKKLKDPATSKADK----VFALT 134 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FL H +GD+HQP+H+G D GGN I + WF +SNLH VWD ++I Y L Sbjct: 135 FLIHMIGDMHQPLHIGRDEDQGGNKIPVMWFDKQSNLHRVWDEQLIEFQQLSYTEYTQAL 194 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + + S +A W N S Y A + LS Y Sbjct: 195 --DTASAAEVRKLQSGSIADWMYDSNQLS--------------NKVYALTHANDKLSYRY 238 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + + ++ +GG+RLA LLN ++ Sbjct: 239 NYWFIADLNGQLLKGGLRLAALLNQIYK 266 >UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ Length = 270 Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats. Identities = 74/280 (26%), Positives = 123/280 (43%), Gaps = 19/280 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + L+++ W D+ R KW++ LH Sbjct: 1 WGALGHATVAYVAQHYVSPEAASWAQGILGSSSSSYLASIASWADEYRLTSAGKWSASLH 60 Query: 61 FIDTPDKA---CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FID D CN DYERDC C AI N+T ++S + + EAL Sbjct: 61 FIDAEDNPPTNCNVDYERDC-----GSSGCSISAIANYTQRVSDSSLSSEN----HAEAL 111 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GD+ QP+H GGN I++ + + NLH WD + + D Sbjct: 112 RFLVHFIGDMTQPLH-DEAYAVGGNKINVTFDGYHDNLHSDWDTYMPQKLIGGHALSDAE 170 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGN---VFSCVNKFATESINIACKWGYKG---VEAG 231 + + N G ++ W + N + ++A+++ + C Sbjct: 171 SWAKTLVQNIESGNYTAQATGWIKGDNISEPITTATRWASDANALVCTVVMPHGAAALQT 230 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L Y++S + + ++A+GG RLA +N + G+ + Sbjct: 231 GDLYPTYYDSVIDTIELQIAKGGYRLANWINEIHGSEIAK 270 >UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=Trypanosoma cruzi RepID=Q4DEV4_TRYCR Length = 333 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 103/284 (36%), Gaps = 28/284 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDE-------AAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ E AA + P D W D ++ Sbjct: 28 WWCNGHMLVNEIARRRLHPEVALIVEEAAVNLSASGPFPHTTDFVESGCWADDIK-KLGL 86 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 H+IDTP N + +++ + ++ L Y M Sbjct: 87 FVMEDWHYIDTPYNPQNINIKKNPVNTEN---------LKTVIESLKRTLMKQDLVPYIM 137 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + A++ ++HF+GDIHQP+H D GGN+ + LH +WD Sbjct: 138 SFAIVNIAHFLGDIHQPLHAVELFSPEYPHGDRGGNAETVIVHGKMMALHSLWDSIC--Q 195 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + ++ F D + D + + + A ES +IA + Y Sbjct: 196 GDVKNPRRPLDRWHYAKLREFADRLE--DTYKFPAEVKNETNTTQMAMESYDIAVQVAYP 253 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G G ++D+Y RV G RLA +LN + +Q+ Sbjct: 254 GFVDGAKITDEYLEKCRAAAESRVVLAGYRLANVLNQLLDKTQK 297 >UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_PYRTR Length = 312 Score = 203 bits (516), Expect = 5e-51, Method: Composition-based stats. Identities = 71/284 (25%), Positives = 115/284 (40%), Gaps = 22/284 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H +A+ + + +L NG + W D H + ++ H Sbjct: 19 WNTDVHNQIGFMAETFFTPQTTLILAKILEPKYNGSVGRAAAWADGYAHTSEGHFSYQWH 78 Query: 61 FIDTPDK---ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD------RRY 111 +IDT D +C+ DY RDC K CV AI N T L D Sbjct: 79 WIDTHDNQPESCHLDYVRDCA-----KGGCVVSAIANQTGILRECITQVQDGKLAGGTNL 133 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT---AA 168 + AL +++HF+GDIHQP+H + GGN+ + + H + LH VWD I A+ Sbjct: 134 TCSYALKWVAHFLGDIHQPLHASGRA-VGGNTYKVVFGNHSTQLHAVWDGFIPYYAAEAS 192 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGY 225 + + ++ D+ + W C C +A ES C + Y Sbjct: 193 HPFSNQSLDPFFADLVTRIRKDQFYSAPYMWLSCTNPSTPIDCATAWARESNKWDCDYVY 252 Query: 226 KGVEAGETL-SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 V+ L ++ Y +PIV ++++ +RL LN + S Sbjct: 253 SRVQNDTDLGTNGYAAGAVPIVELQISKAALRLGTWLNKLVEGS 296 >UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID=C8WD33_ZYMMN Length = 319 Score = 202 bits (514), Expect = 9e-51, Method: Composition-based stats. Identities = 66/288 (22%), Positives = 107/288 (37%), Gaps = 38/288 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W EGH +A + V +L + D + W D+ R T Sbjct: 33 WGMEGHEAIAALAWKYMTPTTRKKVNAILAMDHDRLTEPDFMSRATWADKWRSAGHG-ET 91 Query: 57 SPLHFIDTPDKACNFD------YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 P HF+D N R ++G CV + F +LS + DR Sbjct: 92 EPWHFVDIEIDNPNLVTACAAASNRSNPMKNGGAQPCVVSQLDRFERELSSKQTSDQDRV 151 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 AL ++ HF+GD+HQP+H D GGN + + +S NLH WD Sbjct: 152 L----ALKYVLHFVGDLHQPLHAADHDDRGGNCVKVSINNARSLNLHSYWDT-------- 199 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y K+I+ + + + I +D SW V ++A ES + ++ Y Sbjct: 200 -YVVKEIDPDPQHLADSLKKEISPEDKKSW-----VLGDSKQWAMESFQLGKRYAYSFNP 253 Query: 230 --------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 L Y ++ + ++ + G+RLA +LN+ + Sbjct: 254 PAGCDATRPPIPLPAGYDSAARKVAASQLKKAGVRLAYILNHRLRSIP 301 >UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYC1_PEDHD Length = 268 Score = 202 bits (513), Expect = 1e-50, Method: Composition-based stats. Identities = 69/266 (25%), Positives = 110/266 (41%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+G L+++A +K +L L+ W D ++ Y + H Sbjct: 29 WGMLGHRIVGQIAEGYLSNKAKKGIKDVLGN---ESLAMASNWGDFIKSDPAYDYLYNWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + V I L + + ++R A+ L Sbjct: 86 FVNLP---AGLDKQGVFDQLDKETSPNVYNKIPEMAAVLKNRQSTAEEKRL----AMRLL 138 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 139 IHLVGDLNQPMHTARKEDLGGNKVFVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 N + +D L SWR F S AC Y ++ E LS Y Sbjct: 192 ---YANAINYPSNDQLNSWRNNSL-----KDFVYGSYQ-ACNRIYADIKPEERLSYKYNF 242 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + ++ +++ +GGI LA +LN+++ Sbjct: 243 EFVGLLNEQLLKGGICLANMLNDIYK 268 >UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriaceae RepID=A4BZ60_9FLAO Length = 260 Score = 202 bits (513), Expect = 1e-50, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 104/266 (39%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH T IA+ LN A + LL L+ + + D+++ Y + H Sbjct: 25 WGQNGHRATGEIAESHLNKRAKRKIDKLL---NGQSLAFVSTYADEIKSDKAYSEYASWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I L + + L L Sbjct: 82 YVNM-------NLDETYATAAKNTKGDLITGINTCIAVLKDKSS----SSEDKSFHLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH+G D GGNS+ + WF +SNLH VWD ++I Y + Sbjct: 131 IHLVGDLHQPMHIGRKEDKGGNSVKVEWFGKRSNLHAVWDTKMIEGWNMSYLE--LAESA 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I + L W I+ K Y V+A + +S Y Sbjct: 189 KKVSKEQIAAIEAGTLLDWVAE--------------IHEVTKKVYNSVDANKGISYRYSY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 IV ++ GGIRLA +LN++F Sbjct: 235 DHFDIVRDQLQIGGIRLAKILNDIFS 260 >UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15ZB2_PSEA6 Length = 256 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 35/269 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH +T IAQ L +A A+ LLP DL+ +PD++R W Sbjct: 20 WGQIGHRVTGAIAQQHLTPQAQAAISALLP---TEDLAEASTYPDEMRSSPDDFWQKKAG 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P D + A++ FT L+ + ++++ AL Sbjct: 77 PFHYVTIPKGQ-------TYADVGAPEQGDGVSALKMFTANLTSSQTSKAEKQL----AL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GD+HQP+H G +D GGN + +F SNLH VWD E++ Y Sbjct: 126 RFIVHIIGDLHQPLHAGNGTDRGGNDFKVNFFWQDSNLHRVWDSELLDQRQLSYTEWTA- 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 I + D+ W + + ES+ I Y E T+S D Sbjct: 185 --------ILNRKISAQDINDW-----NTTDPKVWIAESVKI-RDEIYPSQE---TISWD 227 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 Y LP +R+ GIR+A LN ++ Sbjct: 228 YLYHHLPQAKQRLKMAGIRIAAYLNEIYK 256 >UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacteroidetes RepID=C6X5W4_FLAB3 Length = 263 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 62/266 (23%), Positives = 108/266 (40%), Gaps = 28/266 (10%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSPL 59 GH + IA+ L+++A +K ++ L+ WPD ++ W T Sbjct: 24 GVTGHRVVAEIAENHLSNKARKNLKKIIGNQK---LAYWANWPDAIKSDTTGVWKQTDTW 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 H+++ + D + + I+ + Q+ + DR AL F Sbjct: 81 HYVNI---SPQADLKSFSDSLQAQTGPNLYTQIKTLSAQIKDKKTSAKDREI----ALRF 133 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 L H +GD QPMHVG D GGN+I L++F +NLH +WD +++ Y + Sbjct: 134 LIHLVGDSSQPMHVGRAGDLGGNTIKLKFFGENTNLHSLWDSKLVDFQKYSYEE--FAKV 191 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I S L W ++ + Y A ++ S DY Sbjct: 192 LDVKSKEEVRAIQSGTLEEWFYDSHLKA--------------NNIYANTVADKSYSYDYN 237 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVF 265 P++ +++ GG+RLA +LN++ Sbjct: 238 YKYAPLLERQLLYGGLRLAKILNDIL 263 >UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KMC3_9ALVE Length = 367 Score = 200 bits (509), Expect = 4e-50, Method: Composition-based stats. Identities = 83/291 (28%), Positives = 138/291 (47%), Gaps = 43/291 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W +GH + +A ++ +A V ++ E L+ W D + + ++ W+ Sbjct: 19 WGPDGHAVVAELADTRMSSKARKWVYDIMGE--GYRLATSASWADSILYGNNSGEWSWSK 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ + D C F Y RDC + ++CVAGAI+N+T QL++ R+ +A+ Sbjct: 77 PLHYANVDD--CEFVYARDCPN-----NVCVAGAIKNYTAQLTNTSLTKEQRQ----DAV 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAA------ 168 FL HFMGD+H+P++ G +D GGN+I + K+NLH VW ++I Sbjct: 126 KFLVHFMGDVHEPLNAGRYTDLGGNTISVAINFADYEKTNLHKVWGEKLIDEYEGELYPG 185 Query: 169 ----------KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS---CVNKFATE 215 KD +E G + G ++ + SW+ CVN+ E Sbjct: 186 PYIQQDADYNKDRTQYWSVSADEIGRGLASGGKYAGKVPSWKSKCESLGIDVCVNEMVQE 245 Query: 216 SINIACKWGYKGVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLL 261 S +AC Y V+ + +DD Y+ SR+ V +++A+G +RLA +L Sbjct: 246 SATLACNQAYVNVDGSQIGNDDGLLMGYYTSRIETVKEQLAKGAVRLAWVL 296 >UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=Q5FP59_GLUOX Length = 300 Score = 200 bits (507), Expect = 6e-50, Method: Composition-based stats. Identities = 77/282 (27%), Positives = 113/282 (40%), Gaps = 26/282 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W GH + IAQ L +A A LL + L + WPD + H K K +P Sbjct: 25 WGPYGHAIVADIAQERLTPQAQKAATALLALENHQTLDQVASWPDTIGHVPKKKGGAPET 84 Query: 59 --LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 H++D +D RDC D +CV + L+ DR A Sbjct: 85 LKWHYVDIDVSHPAYDQARDCPDH-----VCVVEKLPEEIKILADTHASAQDR----LTA 135 Query: 117 LLFLSHFMGDIHQPMHVGF-TSDAGGNSIDLRWFRHK----SNLHHVWDREIIL---TAA 168 L ++ H +GDIHQP+H D GGN+I L +F NLH +WD +I Sbjct: 136 LKWVVHLVGDIHQPLHAAERNKDMGGNAIRLTYFGDNANGHMNLHSLWDEGVIDHEADLH 195 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASW---RECGNVFSCVNKFATESINIACKWGY 225 + + I D+ W + +V++ +A ES ++A Y Sbjct: 196 VGPFYSIDASRAKKEADRLGALITPDETKYWVQDLDGDDVYNATVDWADESHSLARSVAY 255 Query: 226 KGVEA--GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + A G + DY PI+ R+ Q G+RLA +LN Sbjct: 256 GALPANKGADIGKDYTALTWPIMELRLEQAGVRLAAVLNTAL 297 >UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C4V1_9GAMM Length = 290 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 67/271 (24%), Positives = 107/271 (39%), Gaps = 29/271 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W++ GH + +IA+ L D+ A+ LL L + W D++R W Sbjct: 28 WAQNGHRVVGQIAENHLTDKTKMAIAHLLEGDK---LPEVTTWADEMRSDPSKFWKKESV 84 Query: 59 -LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ ++A +F R + AI L + +R Sbjct: 85 IWHYINI-NEAEDFKPNRYRITATKGEVTDAYSAILKSIAVLQSEQTSLDKKR----FYF 139 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL+H +GDIHQPMHVG D GGN + +++F +NLH +WD++++ + Sbjct: 140 RFLTHVVGDIHQPMHVGRKDDRGGNDVKVKYFNKDTNLHSLWDKDLLEGENLSFSEYAY- 198 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + + + ES +IA K S Sbjct: 199 -FIDTTNKELISQYLASE-------------PKDWVLESFHIAKKLY---EVDDGNFSYS 241 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 Y + + R+ QGGIRLA LLN +F S Sbjct: 242 YVYEQKNTMNTRLLQGGIRLAGLLNAIFDPS 272 >UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_LEIBR Length = 328 Score = 196 bits (499), Expect = 4e-49, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 107/290 (36%), Gaps = 35/290 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ ++ + P ++ D+ WPD V+ W + Sbjct: 31 WGCTGHMVLAEIARRQLDPSNEKKIQAMAMKFKESGPFLLSPDMIQAACWPDDVKRWGQ- 89 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 S H+ + V+ + + T LS+ R Y + Sbjct: 90 DAMSTWHYYAMQYNPDGINIT------DSVEAVNAVSVSLDMITSLSNVRSP----LYML 139 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREI--- 163 A ++L H +GD+HQP+H D GGN + +R LH WD Sbjct: 140 NFAWVYLVHLIGDLHQPLHAVSRYSEKYPHGDRGGNLVWVRVQTKMLRLHAFWDNICTAT 199 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 + + + D+ + E + +S DL + V + A ES A Sbjct: 200 PVLYRRPLSSTDLLAISETADRLLKTYSFSSDLKT-------MQDVQRMANESYAFAVNS 252 Query: 224 GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 Y + G TLS Y + + + R+ GG RL +LN + +++ Sbjct: 253 SYADMIPGTTLSAAYISRCVEVAESRLTLGGYRLGYILNKLLSDIDVDEN 302 >UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT7_LACBS Length = 357 Score = 196 bits (499), Expect = 5e-49, Method: Composition-based stats. Identities = 75/338 (22%), Positives = 124/338 (36%), Gaps = 71/338 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHWYK 52 W GH + IAQ L+ + ++ P ++ + W D R+ Sbjct: 23 WGFAGHEIVATIAQIYLHPTVLPTLCTIIDFSSTNFSPPDSTCHIAPIATWAD--RYKSN 80 Query: 53 YKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD 108 W++ LHFI D P +C F + G K + V ++N T L + Sbjct: 81 MTWSAQLHFIGALDDHPPSSCAFPGKNGWA---GTKRVNVLDGMKNVTALLQGW-VKGET 136 Query: 109 RRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 EAL FL HF GD HQPMH+ + GGN + + + ++NLH VWD +I A Sbjct: 137 SDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGKETNLHGVWDDSLITKAI 195 Query: 169 KDYYAK-----DINLLEEDIEGNFTD------------GIWSDDLASWRECGNVFS---- 207 +E+ + G+ D W+D++ W C +V Sbjct: 196 STIPQNYTLPLPYPEIEQALRGSSYDPYIRRIIWEGIVQRWADEIPGWLSCPDVVKRTSV 255 Query: 208 -----------------------CVNKFATESINIACKWGYKGVEAGETL------SDDY 238 C ++ + ++ C + + L + Y Sbjct: 256 DSQVALGLGGTTGIEILPDNDVLCPYHWSRPTHDLLCDGVWPKEDDNPQLPLLELDTPAY 315 Query: 239 --FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 + +V K++A GG+RLA +LN +F Q + Sbjct: 316 SGMIGQRWLVEKQLALGGLRLAGILNYIFVNQGQRGAF 353 >UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_XANC5 Length = 318 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 64/258 (24%), Positives = 99/258 (38%), Gaps = 27/258 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY--KYKWTSP 58 W +GH + RIA+ L+ +A V LL + L + W D++R K + P Sbjct: 74 WGPQGHRLVARIAETELSPQARTQVAQLLAGEPDPTLHGVATWADELREHDPDLGKRSGP 133 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+++ + C + RDC D CV A+ L+ + RR +AL Sbjct: 134 WHYVNLGEHDCTYSPPRDCPD-----GNCVIAALDQQAALLADRTQPLDVRR----QALK 184 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 F+ HF+GDIHQPMH G+ D GGN L+ SNLH +WD ++ A L Sbjct: 185 FVVHFVGDIHQPMHAGYAHDKGGNDFQLQIDGKGSNLHALWDSGMLNDRHLSDDAYLQRL 244 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 L + + A + + + + L Y Sbjct: 245 LALPAATAGSAALPPPAAAWAQASCKIAITPGVY----------------PSAHVLPATY 288 Query: 239 FNSRLPIVMKRVAQGGIR 256 + PI ++ G R Sbjct: 289 IATYRPIAETQLRIAGDR 306 >UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QX99_ASPNC Length = 309 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 113/294 (38%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ V LL N D+S W D ++ K T PLH Sbjct: 21 WGDVGHRAIAYLAEKYLTVAGSNLVNELLANDKNYDISDAATWADTIKW--KRPLTRPLH 78 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P K+C Y DC + C+ + N T Q++ + ++ EAL Sbjct: 79 YINPDDEPPKSCFVSYPHDC-----PPEGCIISQMANMTRQINDRHANMTQQK----EAL 129 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFR--------HKSNLHHVWDREIILT--- 166 +FL H GD+HQP+HV + GGN I + + + NLH VWD I Sbjct: 130 MFLIHLFGDLHQPLHVTGVA-RGGNDIHVCFDGKNHCNNDTKRWNLHSVWDTAIPHKING 188 Query: 167 ----AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 + + + C+ ++ATES + C Sbjct: 189 IKHNLKHNPERLASAKWADRLHEE---NKLRPADTECANTQEPLECIMQWATESNQLNCD 245 Query: 223 WGYKGVEAG---ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 + K L Y+ PIV ++ + +RLA ++ + ++ D+ Sbjct: 246 FVMKKGLQWLEKTDLGVKYYEVAAPIVDDQIFKAAVRLAAWISALAEDREEADN 299 >UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PWU6_9SPHI Length = 262 Score = 194 bits (492), Expect = 3e-48, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 111/266 (41%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+T N E+ D + + + L +G ++ + N L FL Sbjct: 80 YINTEG---NLTKEQFATALQQSPDNNIYKQLIRLSADLKAKDKGLTEMQQN----LYFL 132 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H MGD HQPMHVG +D GGN I++ WF N+H VWD ++ Y + Sbjct: 133 IHLMGDAHQPMHVGRPADLGGNKIEVMWFGKPDNIHRVWDSNLVDYEKYSYTE--YANVL 190 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + D ASW +I YK VE LS Y Sbjct: 191 DIHTRQENQRLTDGDFASWLYDT--------------HIVANKIYKDVEQNSNLSYRYIY 236 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V + +GG+RLA +LN +FG Sbjct: 237 DNKYVVEDALLKGGLRLAKVLNEIFG 262 >UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIU0_HIRBI Length = 264 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 70/270 (25%), Positives = 117/270 (43%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W K GH +T IA+G L+D+A AV+ +L D++ + WPD +R + Sbjct: 25 WGKLGHRVTGEIAEGYLSDQAKVAVEAILG---VEDMAEVSTWPDYMRSSDDEFFKREAF 81 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLHF+ PD E+ + K ++ F L + + R AL Sbjct: 82 PLHFVTVPD-------EQTYAEAGAPKQGDAFTGLERFKAVLQNNESSAEELRL----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + + H + D+HQP+HVG D GGN +++ + SNLH +WD +++ Y + Sbjct: 131 IMVIHIVSDLHQPLHVGKGDDWGGNKVEIMFKGEASNLHEIWDEKLVQDEELSYTE-MAH 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 L+ + ++ + + + ES I Y + LS Sbjct: 190 WLDRKMTPELAQEWYN-------------ADPSVWIAESKEI-RPSIYPK-DGETDLSWQ 234 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y P++ +R++Q G+RLA LN +FG Sbjct: 235 YIYDHRPVMRQRLSQSGVRLAAYLNEIFGE 264 >UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Tax=Trypanosoma brucei RepID=C9ZQW0_TRYBG Length = 326 Score = 193 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 102/282 (36%), Gaps = 29/282 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W+ GH++ IA+ L+ + VK P D WPD ++ Y Sbjct: 27 WAAFGHMVVAEIAKRNLDADVLEKVKQYTQHLSESGPFPKIPDFVQSACWPDDLKS-YDL 85 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + H+ F+ + + + I + + LS++ Y Sbjct: 86 GVMNGWHYTANVYSRDGFELKE-----PLQQKSNIVSVIDSLSATLSYHETP----LYVR 136 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + AL L H GDIHQP+H T D GGN + +R + LH WD + Sbjct: 137 SFALAHLIHHYGDIHQPLHTTSQVSSEYKTGDLGGNLVHVRVRNTTTKLHSFWDDICRPS 196 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +F D + SW + + E +A + Y Sbjct: 197 ISMK---RPLEEKHYAKVRSFADRLVETYDVSW--EHRRQTNATIMSMEGFELAKEIAYA 251 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 GV G LS Y + + +R+ G RLA LNN+ G+ Sbjct: 252 GVVNGSQLSSQYVDRCVETAEQRMTLAGYRLATHLNNILGSK 293 >UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YUT9_9GAMM Length = 281 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 70/271 (25%), Positives = 109/271 (40%), Gaps = 34/271 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 +GH + IA+ L+ + A + + L+ L +WPDQ+R K+ T H+ Sbjct: 20 GADGHRIIVSIAEKHLSKKTAAELTQI---SGGTALTELALWPDQIRGQQKWSHTKSWHY 76 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 I+ D V A++ QL + + RR EAL F Sbjct: 77 INIKDH-------ERFSGLRRSPKGDVLSALKESYKQLKDPKTESQQRR----EALAFFV 125 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFR--HKSNLHHVWDREIILTAAKDYYAKDINLL 179 H GDIHQP+HVG SD GGN + ++W + NLH VWD +I Sbjct: 126 HLAGDIHQPLHVGRYSDLGGNRVSIKWLGSNKRRNLHWVWDTGLIKDEQLGV-------- 177 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINI---ACKWGYKGVEAGETLSD 236 D + + +W+ +A ES + ++G + T+ Sbjct: 178 --DQYSALINKTTAQQRYNWQSDS-----FLDWAMESKVLRAQVYEFGQPVQKGPVTIDQ 230 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y N P++ KR+ G+RLA LN +F + Sbjct: 231 QYINRTKPLLKKRLLMAGVRLAGCLNRLFDS 261 >UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01U80_SOLUE Length = 261 Score = 191 bits (484), Expect = 3e-47, Method: Composition-based stats. Identities = 71/270 (26%), Positives = 109/270 (40%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + R+A L AA V +L L+++ W D VR + P H Sbjct: 19 WGPEGHSLIARLAAARLTPAAAAKVAEILG--PGNTLASISSWADSVRRARA--ESGPWH 74 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D P + D ERDC K CV I++F L + R+ EAL+F+ Sbjct: 75 YVDIPINKPHLDMERDC-----PKGDCVIAKIEDFEKVLVNPAATPVQRK----EALMFI 125 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H D GGN + L +F SNLH VWD ++ E Sbjct: 126 VHFVGDMHQPLHCSDNKDKGGNDVKLEFFGRPSNLHSVWDSGLLGRM----------GAE 175 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGE-----TLS 235 + + + + + V +A + A K Y + + Sbjct: 176 DALFATLNRDLTPKRARKFEKG-----TVENWADQIHKAAQKTTYGRLPKSTAGVPPKID 230 Query: 236 DDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + ++ + +GG RLA +LN Sbjct: 231 AHYEHEADELIRIELEKGGARLAKVLNATL 260 >UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole genome shotgun sequence n=6 Tax=Paramecium tetraurelia RepID=A0BLJ0_PARTE Length = 712 Score = 190 bits (482), Expect = 4e-47, Method: Composition-based stats. Identities = 55/291 (18%), Positives = 106/291 (36%), Gaps = 24/291 (8%) Query: 1 WSKEGHVMTCRIAQGLLN---DEAAHAVKML------LPEYVNGDLSALCVWPDQVRHWY 51 W + GH+MT +IA+ L + L L + + + VW D ++ Sbjct: 422 WWEVGHMMTAQIAKNYLRDNRPDVLAWADSLVQDFNSLTDGKSNTFAEAAVWLDDIKETG 481 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 S H+ D P + +++ AI L++ + + Sbjct: 482 TEFLFS-WHYTDRPINPDGLLI----KIEDESRNINSIYAINQAVAVLTNSKTSRNRHTV 536 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLR-WFRHKSNLHHVWDREI 163 + L L H +GDIHQP+H DAGGN ++++ N H WD Sbjct: 537 FKAQMLRVLLHVIGDIHQPLHDTSLYNNSYPDGDAGGNFLNIQLQNGTLMNFHSFWDSGA 596 Query: 164 ILTAAKDYY-AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 + A + + A+ ++ + + ++ + S + + + + A + Sbjct: 597 LTFAPNNSFLARPLSQSDSEYLDKWSKDLMKKFPIS-KYSNYDMTNPSVWTYLGFRQAQQ 655 Query: 223 WGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 + Y V A + S DY + + + GG RL L ++ Q ++ Sbjct: 656 FVYPMVAASNSYSSDYEKQAIAFCEENLIVGGYRLGSKLIEIYDQILQNEA 706 >UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5K8A7_9ALVE Length = 366 Score = 190 bits (482), Expect = 4e-47, Method: Composition-based stats. Identities = 96/298 (32%), Positives = 137/298 (45%), Gaps = 46/298 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W +GH L ND A AV +L E ++ WPD V H ++W+S Sbjct: 18 WGPDGHATVADAGNKLFNDNANEAVAEILGE--GVRMADYASWPDSVLHGPDSSEWEWSS 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LHF D C+F Y RDC D D CV G I+N+T Q++ R+ AL Sbjct: 76 GLHFADVE--QCHFIYSRDCKD-----DYCVVGGIKNYTRQVADTSLPIEQRQ----VAL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRW---FRHKSNLHHVWDREIILTAA-----K 169 FL HFMGDIHQP+HVG SD GGN+I + LHH WD ++I + Sbjct: 125 KFLMHFMGDIHQPLHVGRHSDYGGNTIKVDMKFANYEYGALHHAWDEKMIDQSQASQYDG 184 Query: 170 DYYAKDIN--------------LLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKF 212 +Y +D N + + + G + D + W CVN Sbjct: 185 EYIQQDANYSTPLAERETFWGITVSDIMTELAEGGAFHDRVPMWLADCETNGLDECVNTM 244 Query: 213 ATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 A ES IAC Y+ ++ G+ LS DY++ R+ IV +++A+G +R A ++N+ F Sbjct: 245 AEESAIIACADAYRHLDGDEIEYGDVLSMDYYDDRIKIVKEQLAKGAVRFAWIMNHAF 302 >UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=B9XJ21_9BACT Length = 377 Score = 188 bits (476), Expect = 2e-46, Method: Composition-based stats. Identities = 69/284 (24%), Positives = 103/284 (36%), Gaps = 35/284 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLP------EYVNGDLSALCVWPDQVRHWYKYK 54 W EGH++ +I L+ L+ N W D + Sbjct: 44 WDAEGHMVVAQIGYNHLDPAVKAKCDALISVALTNVSSQNNTFVTAACWADDNKAALG-- 101 Query: 55 WTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMT 114 T+ H+ID P F + + V AI+ L T+ + + Sbjct: 102 -TAIWHYIDLP-----FSLDGTPTNGVAPASTNVVFAIRQCVATLQ----STNATQIDQA 151 Query: 115 EALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA 167 +L +L HF+GDI QP+H DAGGNS + +NLH +WD Sbjct: 152 ISLRYLIHFVGDIQQPLHASTAVSASSPGGDAGGNSFS--LSGYWNNLHSLWDAG----- 204 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN--VFSCVNKFATESINIACKWGY 225 Y I+ + DG S ++ N V +A ES +A Y Sbjct: 205 -GGYLTNSISRPLTAGGQSIIDGKVSAIEVAYPFTSNIGVIPNPMDWANESWGLAQNVAY 263 Query: 226 KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 G+ T S Y + +R++QGG RLA LLN ++ S Sbjct: 264 AGLTRSSTPSVGYLTTVQNTTQQRMSQGGHRLANLLNTIYSTSP 307 >UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium violaceum RepID=Q7P202_CHRVO Length = 274 Score = 188 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 68/269 (25%), Positives = 108/269 (40%), Gaps = 22/269 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH--WYKYKWTSP 58 W +EGH +T IAQ LL+ +A VK L+P D + L ++ DQ + + Sbjct: 23 WGQEGHRITGYIAQQLLSSKAKAEVKKLIPNA---DFAQLALYMDQHKQELKQTLPGSDQ 79 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P C+ E +C D C A I + L+ +DR +AL Sbjct: 80 WHYNDEPV--CSGVTEDECPD-----GNCAANQIDRYRKVLADRGAAKADR----AQALT 128 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK--SNLHHVWDREIILTAAKDYYAKDI 176 FL H +GDIHQP+H D GGN ++ SNLH VWD ++ K Sbjct: 129 FLIHMVGDIHQPLHAADNLDRGGNDFKVQLPGSSKISNLHSVWDTALVQQELNGADEKSW 188 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 + G + W N ++ + + +A L + Sbjct: 189 AAADLQRYQRNVSGWQGGGVMDWVHESNQYARADVYG----PLAGFSCGASPSTPVYLDN 244 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + +V +++A+ G R+A ++N Sbjct: 245 TYLRAGGLLVDQQLAKAGARIAAVINQAL 273 >UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GGE9_9DELT Length = 285 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 69/283 (24%), Positives = 109/283 (38%), Gaps = 34/283 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG---DLSALCVWPD-QVRHWYKYKWT 56 W +GH + IA+ L+ V+ LL L+ +W D + R ++ + Sbjct: 20 WHDDGHRIVGEIAERNLSPATRAKVRALLQGSDGKGDGSLATASIWADHEARESPEFAFA 79 Query: 57 SPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 + H+++ + C ++ C+A A+ + L R EA Sbjct: 80 ASSHYVNLDGPTSPRELHAQCLERA----GCLATAVPYYADILRSEGASEDQR----AEA 131 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSID------LRWFRHKSNLHHVWDREIILTAAKD 170 L FL HF+GD HQP+H G D GGN ID +NLH WD ++ A + Sbjct: 132 LRFLVHFVGDAHQPLHAGRRGDRGGNDIDRLTIPGYTAKGETTNLHAAWDGALVALALTE 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + GI +D A W + + ES A Y V+ Sbjct: 192 RGVDW-----KAYAVALDAGIDADARARWVGG-----TIYDWLEESRRFAAAEAYLHVDG 241 Query: 231 ------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 G+TL D++ +R++Q G+RLA LL +F Sbjct: 242 LTPVRSGDTLGADWYRRNSSTAEQRLSQAGVRLAALLEAIFED 284 >UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KH31_9GAMM Length = 323 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 61/276 (22%), Positives = 93/276 (33%), Gaps = 36/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + ++A L V+ LL + L W D++R W Sbjct: 58 WGAMGHEIAAQLADPYLTAHTRQQVEALLGKD---TLKTASTWADRMRSDPAPFWQEEAG 114 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P R D A A+ F L ++ AL Sbjct: 115 PYHYVTIPRG-------RQYADVGPPPQGDAASALTQFARDLRSPSVSLERKQL----AL 163 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN + +R F SNLH VWDR++ + A+ Sbjct: 164 RFAIHIIQDLQQPLHVGNGLDRGGNDVPVRIFGETSNLHSVWDRQMFESTARTQAQWLDY 223 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 ++ T + + ES + ++ Sbjct: 224 FKASELLRRPTQN---------------DADPQVWIAESAKLRETLY----PVPASIDTR 264 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 Y LP R+A GIR A LN ++ + Sbjct: 265 YIRRELPRAEARLALAGIRTAAWLNAIYDDNATPGE 300 >UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EEH7_TRIVA Length = 328 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 98/280 (35%), Gaps = 24/280 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W H R+A+ L+ E + +L + + W D ++ + Sbjct: 14 WWGAPHYTVARLAETRLSPEQLKYINDILETWTSEKAVFHDTANWHDDIK-AANVAIMAN 72 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + +++ D + A ++ + T+ ++ + Sbjct: 73 WHFRNQPIFSSDYE-----GDFSYPTTYNITDASKDCINTIMSE---TTTSQWILGFCFR 124 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNSIDL--RWFRHKSNLHHVWDREIILTAAK 169 LSHF+ D H P+H D GGNS + + + N+H +WD + Sbjct: 125 TLSHFVADAHCPVHSAGRWSKAFPDGDRGGNSQAVVCTYGQPCRNMHMLWDSACLDFQIW 184 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D++ E+++ + ++ + + + E+ A K+ Y + Sbjct: 185 PLSKNDVDEYEKNLTNLLNNY----QPKTYLPETYQSTDPDVWENEAYRYASKYVYGNLP 240 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 T +D Y + ++ G RL +L F A + Sbjct: 241 DDFTANDTYIKEGANAAKQLISAAGYRLGEVLLKFFEARK 280 >UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leishmania RepID=Q4QGQ3_LEIMA Length = 381 Score = 186 bits (472), Expect = 7e-46, Method: Composition-based stats. Identities = 63/288 (21%), Positives = 100/288 (34%), Gaps = 31/288 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV-------KMLLPEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ L + V + P + ++ L W D ++ Y Sbjct: 29 WWDKGHMCIAEIARRNLKPDVQAKVQACANALNKIGPFPKSTNIVELGPWADDLKSMGLY 88 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 S HFIDT + + V+ + VA I L + + Sbjct: 89 -TMSTWHFIDTIYNPQDVK-----VTINPVEIVNVASVIP----MLISAITSPTATSDII 138 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWF---RHKSNLHHVWDREI 163 ++ L HF+GDIH P+H D GGN + LH WD Sbjct: 139 ITSVANLIHFVGDIHMPLHSADLFSPEYPLGDLGGNKQIVIVNETAGTSMKLHAFWDSMC 198 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 ++ + ++ F D + S+ E + + A ES +A K Sbjct: 199 --EGPQNNAVRPLDKDAYAELSAFVDNLVKSH--SFTEEQMMMTNSTIMAAESYELAVKN 254 Query: 224 GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ G LS+ Y + + RV G RLA +LN + Sbjct: 255 VYPGISDGTVLSESYKANGKILAAGRVTLAGYRLATILNTALAGVSLD 302 >UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas vaginalis RepID=A2ECC5_TRIVA Length = 319 Score = 186 bits (472), Expect = 7e-46, Method: Composition-based stats. Identities = 59/286 (20%), Positives = 100/286 (34%), Gaps = 27/286 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+M RIA+ LL + ++ +L ++ ++ W D ++ Y Sbjct: 12 WWGHAHMMIGRIAESLLTSKEKKKIEAVLRYGQHPIQTITEATTWQDDLKGTYSLSVMET 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF+D P + + + + + L T+ + L Sbjct: 72 WHFLDHPIN-------KGKNTSIPPPTYNITTYMDSAYRALKD---KTTTDPWVWAFHLR 121 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L HF+GD+H P H + T D GGN + +N+H +WD + Sbjct: 122 SLIHFVGDVHTPHHNVALFNDLFPTGDHGGNLYILNCNLGSGCNNIHFLWDSAGFYFPMR 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC--VNKFATESINIACKWGYKG 227 + I ++ + N T I + + + ES +A +GY Sbjct: 182 NPV---IPKYRDEFQKNATKLINELPQSHYTSQNMDVKTFHPEVWHNESYEVAYNFGYNT 238 Query: 228 VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 G S DYF + +R+A G RL L V G E + Sbjct: 239 TMYGW-PSKDYFTTVQTQSKERIAISGYRLGYFLKEVVGNIPVEPT 283 >UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q989R8_RHILO Length = 278 Score = 186 bits (471), Expect = 9e-46, Method: Composition-based stats. Identities = 68/280 (24%), Positives = 116/280 (41%), Gaps = 36/280 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + IAQ L+ A VK +L V ++++ W D VR+ + + H Sbjct: 21 WGPEGHSIVAEIAQRRLSSTALMEVKRILGGEVA--MASVASWADDVRYAI-HPESYNWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+D P +D C V+ C I +++ + R ++L +L Sbjct: 78 FVDIPLADSKYDPVSQCA--ANVQGDCAIAEIDRAEHEITCATDPLQRR-----DSLRYL 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNS--IDLRWFR--------HKSNLHHVWDREIILTAAKD 170 H +GD+HQP H + G N+ + +++ NLH VWD II Sbjct: 131 IHIVGDLHQPFHTV-ADNTGENALAVTVKFGGLIKSPPKTPADNLHAVWDSTIIKQTTYA 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + G++ D + +D L E +A E+ +A + G+ Sbjct: 190 W-------------GSYVDRLETDWLLKHPEASETL-DPVAWALEAHTLAQEMA-AGITN 234 Query: 231 GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G L +DY+ LP+V +++ + G+RLA +LN + Sbjct: 235 GANLDNDYYAKALPVVDEQLGRAGLRLAAVLNRWLATAPA 274 >UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp. PR1 RepID=A3HUK9_9SPHI Length = 257 Score = 186 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 104/266 (39%), Gaps = 31/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + +A L A V+ +L + W D+++ +Y + H Sbjct: 23 WGQIGHYLIGYMAGQQLKRSARKNVERVLYP---MSIGRSGTWMDEIKSDKRYDYAYSWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ + + + + AI +L ++ E L L Sbjct: 80 YLTSKHGEYDPHLQE--------EGGDAYEAINRIKEELKSGNLNPTE----EAEKLKML 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H + DIHQP+HVG D GGN + L +F SNLH VWD +I + Y + Sbjct: 128 IHMVEDIHQPLHVGTGEDRGGNDVKLEYFWQSSNLHSVWDSGMIDRWSMSYTE-----IG 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +++ T + + + E+++ A YK + LS +Y Sbjct: 183 DELMRRLTPEMEDQYRE---------GSMEDWLQEAVD-ARPLVYK-IPENRKLSYNYDY 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + P++ +R+ +RLA +L ++G Sbjct: 232 AVRPLLEERLIAASVRLAQILEEIYG 257 >UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales RepID=A4CQ68_9FLAO Length = 257 Score = 186 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 69/266 (25%), Positives = 109/266 (40%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH +A+ L+ A AV LL L+ + + D ++ Y+ SP H Sbjct: 22 WGRTGHRAIGEVAEAHLSRRARKAVSRLL---EGESLAKVSTFGDDIKSDTTYRSFSPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + D + I++ L L L Sbjct: 79 YVNLPPETP-------YGEITPNPDGDILQGIEHCIRVLKDPASPRDQ----QVFYLKLL 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMHVG D GGN I L++F +NLH +WD ++I Y L Sbjct: 128 VHLVGDLHQPMHVGRPEDRGGNDIQLQYFDKGTNLHRLWDSDMIEDYGMSYTE-----LA 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 E + I V ++A +S ++A Y VE GE L Y Sbjct: 183 ETLPPATRREI----------RVIQSGSVLEWAGQSQSLA-NRVYASVENGEKLYYRYRY 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 V +++ GG+RLA +LN+++G Sbjct: 232 LWWDSVERQLLLGGLRLAAVLNDIYG 257 >UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas vaginalis RepID=A2ELH6_TRIVA Length = 315 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 98/279 (35%), Gaps = 27/279 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN--GDLSALCVWPDQVRHWYKYKWTSP 58 WS E H + R+AQ +L + + +L + + DL + W D +R Sbjct: 5 WSGEPHQLIARVAQTMLTKKQRKWIDEMLFLWPSEAQDLITVSNWEDTIRSDIDDILMQ- 63 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + ++ + + AI + + + T+ + Sbjct: 64 WHFENKPYIEPEYTPKK------VTRTFNITNAIDDA---MKSILDPTTTSFWTFGFYFR 114 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNSIDLR--WFRHKSNLHHVWDREIILTAAK 169 L HF+GD H P+H DAGGN I L S LH +WD + Sbjct: 115 ALIHFVGDSHCPVHSIAYYSDKYPKGDAGGNFIKLNCSISYFCSTLHKLWDSACLNFQHN 174 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y A + E++I + + E S + + ES A + Y + Sbjct: 175 KYVAPTLEDFEKNITR-----MMNAYPLKILEEHPSLS-PHDWIDESYKTAIDYAYTPLV 228 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + ++D Y + R+ G RL M+ F Sbjct: 229 DWKNINDTYLANGAEAAEYRITLAGYRLGMVFKQFFKER 267 >UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatidae RepID=Q25267_LEIDO Length = 477 Score = 184 bits (467), Expect = 3e-45, Method: Composition-based stats. Identities = 67/287 (23%), Positives = 115/287 (40%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVK---MLL----PEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ + + +L P + D+ W D ++ Sbjct: 126 WWSKGHMSVALIAKRHMGASLVEKAELAAKVLSFSGPYPKSPDMVQTAPWADDIK-TIGL 184 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 K S H+I TP + E D V+ + VA I L E + + Sbjct: 185 KTLSTWHYITTPY----YTDEDFTLDVSPVQTVNVASVIP----MLQTAIEKPTANSDVI 236 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSN--LHHVWDREII 164 ++L L HFMGDIHQP+H SD GGN + + LH WD + Sbjct: 237 VQSLALLLHFMGDIHQPLHNVNLFSNQYPESDLGGNKQLVVIDSKGTKMLLHAYWDS-MA 295 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + ++ + D NF D + + ++ + + + E+ ++A K+ Sbjct: 296 EGKSGEDVPRPLSEADYDDLNNFADYLEATYASTLTDKEKNLVDTTEISKETFDLALKYA 355 Query: 225 YKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G + G TLS++Y + I ++V G RLA +LN + + Sbjct: 356 YPGADNGATLSNEYKTNAKKISERQVLLAGYRLAKMLNTTLKSVSMD 402 >UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT9_LACBS Length = 375 Score = 184 bits (466), Expect = 3e-45, Method: Composition-based stats. Identities = 83/360 (23%), Positives = 123/360 (34%), Gaps = 96/360 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------LSALCVWPDQVRHWYKY 53 W GH + IAQ L+ + +L + L+ + W D++R +K Sbjct: 22 WGAAGHEIIATIAQMYLHPSILPTICDILNFSEDETQPEQPCHLAPISTWADKLR--FKM 79 Query: 54 KWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR 109 +W++ LH++ D P + C F ER G + V AI+N T L + Sbjct: 80 RWSAALHYVGSLDDHPSQTCLFPGERGWA---GTRGGNVLDAIKNVTGLLEDWTR-GEAG 135 Query: 110 RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK 169 EAL FL HFMGD+H P+H+ D GGNS + W ++NLH +WD +I A + Sbjct: 136 DATANEALKFLVHFMGDLHMPLHLT-GRDRGGNSDRVLWSGRQTNLHSLWDGLLIAKAIR 194 Query: 170 DYY-----AKDINLLEEDIEGNFTD------------GIWSDDLASWRECGNVFS----- 207 +E + G D W DD+ W C Sbjct: 195 TVPRNYSRPLPYPDVEHALRGTIYDSYIRRIMWEGVFQKWKDDVPEWFSCPETTPPPPAR 254 Query: 208 ---------------------------CVNKFATESINIACKWGYKGVE-------AGET 233 C +A + C + G Sbjct: 255 GWQQVVMSLKRLAGKQGVEIGPDTDVLCPYHWAKPIHALNCDIVWPKELDEPPYGGGGSK 314 Query: 234 LSDDYFNSRLP----------------------IVMKRVAQGGIRLAMLLNNVFGASQQE 271 +D+ R P +V K +AQGGIRLA +LN +F Sbjct: 315 FADEDVAGRPPKPHPPLLELDTPKYAGVIEDTMVVEKLLAQGGIRLAGILNYLFLEEAAR 374 >UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepID=Q7RSD2_PLAYO Length = 328 Score = 183 bits (463), Expect = 7e-45, Method: Composition-based stats. Identities = 53/298 (17%), Positives = 103/298 (34%), Gaps = 27/298 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRH------- 49 WS EGH++ IA L+D + + Y + VWPD +++ Sbjct: 24 WSDEGHMLISAIAYEGLDDREKKILTQIFQNYKEDNDFNNHIYAAVWPDHIKYYEHPVDT 83 Query: 50 ---WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 H+I+ P N D + + + D + + + F ++ Sbjct: 84 TKRMDGISIMDRWHYINVPYNPTNIDLDMYHKEYYKDTDNSLTISRKIFQDLKLMEKKNN 143 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVW 159 ++ L + H GD+HQP+H D GG +I++ + LHH+ Sbjct: 144 YGSYFSYNFQLRYFIHVFGDMHQPLHTATFFNKHFIKGDFGGTAINVNYNNRTEKLHHLC 203 Query: 160 D------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 D + +A + D L + ++ + + G + A Sbjct: 204 DCVFHARDKKWPSATVEEVTNDARTLMNTYPPEYFGNRLNNGMDEYEYLGYIVEDSYAQA 263 Query: 214 TESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + I A + TL++ Y + ++ +++A GG RL L + + Sbjct: 264 IDHIYYAFPFESLNRHTAYTLTNAYVINLKKVLNEQIALGGYRLTRYLKTIIANVPDD 321 >UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5LHN6_9ALVE Length = 1614 Score = 183 bits (463), Expect = 8e-45, Method: Composition-based stats. Identities = 77/300 (25%), Positives = 122/300 (40%), Gaps = 58/300 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ +++D V L D+ + W D+ H +Y+WT+PLH Sbjct: 22 WGEDGHSIVAAIAQRIVSDRVIEGVNETLGR--GQDMIGVACWADKASHSAQYRWTAPLH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+DTP K C YERDC D D CV GAI N+T + ++R A+ + Sbjct: 80 FVDTPTKQCQMVYERDCRD-----DFCVIGAIYNYTNRAISKSVSRAERE----FAMKLV 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT-------------- 166 + P H + S LH VWD +IL Sbjct: 131 TTDFAPP-GPRH-----------------KVSSKLHQVWDSGLILQDEFELRVQRRREHR 172 Query: 167 -------AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKFATES 216 + + L E G ++ W C A ES Sbjct: 173 KIPPHPPYRHKFEERWHELFEHLWTKLSKGGEYAKHREEWLAPCRQNGLQECTKTMAEES 232 Query: 217 INIACKWGY-----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + +AC Y + + G+ L +YF +R P++ +++A+GG+RLA +L +FG+++ Sbjct: 233 LAVACTAAYHDEYRRWIADGDVLDRNYFLTRNPLMEEQLAKGGVRLAWVLQQMFGSNRHR 292 >UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ Length = 295 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 72/295 (24%), Positives = 116/295 (39%), Gaps = 50/295 (16%) Query: 1 WSKEGHVMTCRIAQGLL-NDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---- 55 W +GH IAQ LL N +A + +L L + PD++R + K Sbjct: 26 WGHQGHKTIGIIAQHLLVNSKAFEEINNILG---GLTLEEISTCPDELRVFQSEKKPMSS 82 Query: 56 --------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 T HFIDTP N +E K CV I ++ L+ Sbjct: 83 VCNQIFTNPEPPTNTGSWHFIDTPISQFNPTHEDI---VKACKSSCVLTEIDRWSNVLAD 139 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-TSDAGGNSIDLRWFRHKSNLHHVWD 160 T+ +AL F+ HF+GDIHQP+HV D GGN + +R R+K+NLH WD Sbjct: 140 ----TTQTNAKRLQALSFVVHFIGDIHQPLHVAERNHDLGGNKVKVRIGRYKTNLHSFWD 195 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ + + + I L + + + + A Sbjct: 196 TNLVNYISTNPISTTILLKSDV----------------AFAQTEAQTTPETWVLQGFQFA 239 Query: 221 CKWGYKGVEAGET----LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ +S+ Y + +P+V ++A G+RL+ L +F +S ++ Sbjct: 240 RNVAYDGIPIDYASVVRISNAYIQNAIPVVKHQLASAGVRLSQHLARIFSSSNKQ 294 >UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5LN34_9ALVE Length = 401 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 58/295 (19%), Positives = 114/295 (38%), Gaps = 35/295 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +A L+ A++ +K LL D W + W++ LH Sbjct: 29 WDIDGHEAVGMVAMSALDSRASNQLKRLLQ---GKDAVEDAGWAH--KAESSIPWSTRLH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR----------- 109 F+ P+ N ++ + C+ A++ F Q S + Sbjct: 84 FLSQPEPFSNTLVV---NEITCPQGQCLLEALKLFYDQAKGDTSKISQKDRLMMSSARLP 140 Query: 110 -RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTA 167 + +A+ FL + +GD+HQP+H GF +D G ++ + +L+ +WD EII Sbjct: 141 VQVTDADAVRFLINLIGDMHQPLHEGFQTDDFGKQTIVKLPGGSTLSLYELWDHEIIQET 200 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG 227 K++ + N ++ D W+E + + K+ ++ A K+ Y Sbjct: 201 IKNHPQFWWSGWTHIQRANP--DTYNADKKLWQENNK--AALEKWCNDNAEFANKFIYTN 256 Query: 228 VEAGETLS----------DDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 + E L ++++++ G R A++LN++ +S Sbjct: 257 PLSNERLPIGSGSPINVDAAVLEKWRQLLIQQILLAGSRTAIVLNDILESSAAPG 311 >UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G6P9_TRIVA Length = 348 Score = 180 bits (457), Expect = 3e-44, Method: Composition-based stats. Identities = 59/296 (19%), Positives = 102/296 (34%), Gaps = 34/296 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG--DLSALCVWPDQVR-HWYKYKWTS 57 W E H+ RIA+ ++ + + +L + + + + W D++ + + Sbjct: 12 WWNEPHMAVVRIAERMITKQQKDWMNVLFSMWPSEADTMVSASTWHDEIPENSAQVSIMK 71 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 HF D P A F+YE V + + L T+ Y Sbjct: 72 NWHFADKPILAPGFEYEYQ-------PTYNVTSVVSDSMNAL---FNPTTKSLYAYHFLF 121 Query: 118 LFLSHFMGDIHQPMHV-------GFTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAA 168 L HF+GDIH P H D GGN I+ ++ LH +WD ++ Sbjct: 122 RNLVHFIGDIHTPCHTAAYYSPKFEEGDRGGNSLKINCKYGEPCKQLHKMWDSGVLNFQH 181 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 D N L ++ E N I S + E+ ++A + Y + Sbjct: 182 M---YLDTNELLDEFEHNI-SHIMQMHPESSLPTVKSL-NAYLWFNETYDVAVNYAYGML 236 Query: 229 EA-------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 + L +Y + ++ + G RLA ++ F ED + T Sbjct: 237 KDLNNSELDKYDLMPNYISKGAMAAEIQIVKAGYRLAYVIQEFFKVHSPEDPRIFT 292 >UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo saltans RepID=B6DTM7_9EUGL Length = 360 Score = 179 bits (455), Expect = 6e-44, Method: Composition-based stats. Identities = 57/291 (19%), Positives = 98/291 (33%), Gaps = 28/291 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-----LPEYVNGDLSALCVWPDQVRHWYKYKW 55 W GH++T IAQ LL + + ++ WPD ++ + Sbjct: 77 WGCAGHMITAEIAQQLLPTNVRRYFTDISAYQQMYYPRITSMTEASCWPDDMKSYTSQYS 136 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 + HF + N C V+ + A+ N QL+ T Sbjct: 137 S--WHFYNVCLLRANGT-NLTCPVWTSVETGQMPTAVANARAQLAMGSNLTHAES---AF 190 Query: 116 ALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 L FL H +GD HQP+H+ D GGN + ++NLH D L Sbjct: 191 WLAFLVHLVGDFHQPLHIATLFNPMFPKGDQGGNRFYIYVNNSRTNLHAFHDDLAWLLPR 250 Query: 169 KDYYAKDINLLEED--IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +D + ++ + ++ NV + + E Y Sbjct: 251 DGFPQRPLAEYPDDVSMIEGLSESLILLQKFAYPSQPNVTNTS-VWIEEGFETGVNISYT 309 Query: 227 GVEAGE-------TLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + LSD Y ++ ++A GG RLA +L ++ Sbjct: 310 LPNGQDLQFNQHFNLSDTYVTRLRSMLQNKLALGGRRLARILMEIYDEVHA 360 >UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2F450_TRIVA Length = 329 Score = 179 bits (454), Expect = 8e-44, Method: Composition-based stats. Identities = 49/281 (17%), Positives = 97/281 (34%), Gaps = 26/281 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + IA + + ++ L ++ + + VW D ++ Y S Sbjct: 11 WWGHAHSLIASIAMKDFSSKERKILEKFLEYGQHKRATIEEVAVWQDDLKGAYDLGIMSS 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF P + + + + L++ + + + L Sbjct: 71 WHFTPRPLIKDGYTATLQ------PVTYNITSYMNSAWNSLTN---PATTDPWIIAFHLR 121 Query: 119 FLSHFMGDIHQPMHV-------GFTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAAK 169 L HF+ D+H P H D GGN I + N+H +WD + Sbjct: 122 SLIHFVADVHTPHHNVGYYSQETPDGDKGGNLYQIICNYGSACMNIHFLWDSACLALPLG 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY-KGV 228 + I ++ N T + + A K++ ES + ++GY + Sbjct: 182 NP---LIPKYLDEFSENVTKIMKNHQKAK--MGDLETIDFMKWSNESYDTVKQYGYSPAI 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 E ++D Y + + + RV+ G RL+ +L ++ + Sbjct: 237 ERYGEVTDQYLKTCQSVALNRVSLAGYRLSTVLRQIYNEKK 277 >UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila SB210 RepID=Q236I5_TETTH Length = 330 Score = 179 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 103/290 (35%), Gaps = 31/290 (10%) Query: 1 WSKEGHVMTCRIAQG---------LLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY 51 W GH++T +A+ L E + L + + W D ++ Sbjct: 19 WWDGGHMITVEVAKQEILARDPALYLKIEKYVTILNPLCDARSQTFVQAASWADDIKDPA 78 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE----GTS 107 W HF + P D + A++ +L Sbjct: 79 MNFW-DKWHFFNKPINEEGLYVVLD----QDSLNNNSINALKRCIQELQKNNTTPINNPD 133 Query: 108 DRRYNMTEALLFLSHFMGDIHQPMH---------VGFTSDAGGNSIDL-RWFRHKSNLHH 157 + + +L H +GD+HQP+H D GGN ++ LH+ Sbjct: 134 NISVQQAIMMRYLIHIVGDMHQPLHNTNLFNYTFSTNQGDLGGNKENVILLNGTSMVLHY 193 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESI 217 +D + A +++ ++ +E +F + S+ + +A ES Sbjct: 194 YFDSGALRLAD---FSRPLSQEQEQQVTDFAASFRAQYPRSFFNERVNITLPEMWAQESY 250 Query: 218 NIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 IA + Y ++ ++ ++ N + ++ +++A GG RLA LL +VF Sbjct: 251 EIAVRDIYPYLKLTNKVTPEWDNLQYEMIKQQIALGGYRLADLLTSVFNP 300 >UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8HTU7_AZOC5 Length = 282 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 65/276 (23%), Positives = 109/276 (39%), Gaps = 37/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ L A V LLP+ L+++ W D VR + T H Sbjct: 26 WGEDGHAIVAEIAQRRLTPTGAALVASLLPK--GASLASVASWADDVR--PDHPETRRWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ P A +D RDC + C+ AI+ + E T+AL L Sbjct: 82 YVGIPMGAATYDPLRDCPS--RPEGDCIVAAIERARLDMHCAPEPA-----ARTDALKLL 134 Query: 121 SHFMGDIHQPMHVGFTSDAGG-NSIDLRWFRH-----------KSNLHHVWDREIILTAA 168 H MGD+HQPMH G + L W +N+H +WD ++ A+ Sbjct: 135 VHLMGDLHQPMHAIAADHLGTRRKVLLNWAGQACTHDCEAPPPTTNMHVLWDTTLVRKAS 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 + G + D + + L +A+E+ + Y V Sbjct: 195 LSW-------------GGYVDRLEAGWLKEADAAAVAAGTPADWASETHGVGLAM-YALV 240 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 ++ Y+ + LP++ +++ + G+RLA +N Sbjct: 241 PPDNVINTTYYRAALPVLDQQLGKAGLRLAHEINAA 276 >UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID=B0T6T3_CAUSK Length = 287 Score = 176 bits (447), Expect = 6e-43, Method: Composition-based stats. Identities = 71/285 (24%), Positives = 110/285 (38%), Gaps = 39/285 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W + GH + +IA+G L +AA AV LL + DL+A W D R ++ T Sbjct: 23 WGRTGHAVVAQIARGYLTPKAAAAVDALLAADTDALTPPDLAARASWADAWRK--DHRQT 80 Query: 57 SPLHFIDTPDKA------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 + HF+D C G + C+ G + F +L+ + ++R Sbjct: 81 TEWHFVDVELDHPDLAGACFGFPASATPASAGPEKDCIVGRLNAFEAELADPKTDAAERL 140 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 A F+ HF+GD+HQP+H D GGN I L ++ NLH WD + Sbjct: 141 L----AFKFVLHFVGDLHQPLHAADNQDRGGNCIPLALGGPRTVNLHSYWDTVAVEAIEA 196 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY---- 225 D + + + I + +W + +A ES +A Y Sbjct: 197 DP---------DKLAAKLSAQITPAERKAWEKG-----DAKTWAMESFALAKSTVYTIGS 242 Query: 226 ----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 A L Y S V ++ + G+RLA+ LN G Sbjct: 243 KPGCASDTAPVPLPAGYNQSAQAAVALQLKKAGVRLALELNRALG 287 >UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahymena thermophila RepID=Q23AG7_TETTH Length = 630 Score = 176 bits (446), Expect = 7e-43, Method: Composition-based stats. Identities = 57/298 (19%), Positives = 98/298 (32%), Gaps = 38/298 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK---MLLPEYVNGDLSAL--------CVWPDQVR-HWYK 52 H++ IA+ L L Y + + VW D ++ + Sbjct: 24 PHMLVLAIAKKELMKNDMEVYNITAKYLDTYSTQGVDTVSTTTYEENAVWADDIKVYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K H+I + N + A N L++ + + Sbjct: 84 QKAMEMWHYIGNKDSNPQNLTPLKKDPMAD---SENALNAYNNIVKVLTNEKFVGQMTEF 140 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------------FTSDAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G F D GGN + ++ K+ Sbjct: 141 KVNM-LKMLVHIVGDIHMPHHTGSFYNATYKNDKGEFWGDLGGNRQMINFYTSTGEMKKT 199 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 N+H +D + + +N + D I + N + +A Sbjct: 200 NIHFYFDSSCFFYTWTNRLVRPLNETFKIYFQRELDRIVAQYPKESLNIDNT-KTFSDWA 258 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 ES N+A Y + + + DD++NS ++ KR+ G RLA L +F + Sbjct: 259 DESWNLALNNVYPFLLSKNEIHYGDDFYNSSFDMIQKRIVTAGYRLAYTLQKLFTPEK 316 >UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisrubri RepID=Q1N3Y8_9GAMM Length = 226 Score = 176 bits (445), Expect = 9e-43, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 103/258 (39%), Gaps = 32/258 (12%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDK 67 M A L A H ++ +L VW D ++ ++ PLH+++ P Sbjct: 1 MVAAAAWPQLTPYAKHQIESILGFG-REKFVNASVWADHIKSDQRFNHLKPLHYVNLPKG 59 Query: 68 ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDI 127 + + +RDC + C+ AI +F+ S A+ L H + DI Sbjct: 60 STQYKQQRDCPE-----GQCIVQAIYDFSE------YARSGSEREQAMAVRMLIHLIADI 108 Query: 128 HQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNF 187 HQP+H G+ D GGN ++++ + +LH +WD +++ +++ LL++ + Sbjct: 109 HQPLHAGYKEDRGGNWFEVKYQDYTLSLHKLWDHQLVERFHENWQQGSTELLKDMPKATL 168 Query: 188 TDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVM 247 K+A S + + Y+ E +S+ Y + Sbjct: 169 YS-------------------PEKWAEISHALVERSVYETQEN-RLVSEAYLEMADDVTH 208 Query: 248 KRVAQGGIRLAMLLNNVF 265 +++ RLAM LN ++ Sbjct: 209 RQLQLASWRLAMWLNQLW 226 >UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A822_9GAMM Length = 293 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 89/269 (33%), Gaps = 36/269 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + +A L+ A + LL + L++ W D++R W Sbjct: 19 WGAMGHELAGTLAAPYLSANARAQIDALL---KDETLASASTWADRMRGDPDPFWQEEAG 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ PD A+Q F L T +R AL Sbjct: 76 PYHYVTVPDGQS-------YTQVGAPPQGDGYTALQQFRKDLRDPTTPTRRKRL----AL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN I + SNLH VWDR++ + + Sbjct: 125 RFALHIVQDLQQPLHVGNGRDRGGNQIRVAINGETSNLHSVWDRQLFESTGRSKETWLDY 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 D+ S + ES + + Sbjct: 185 FRRGDLLREP---------------NPADSDPLLWIRESAALRETLY----PVPTAIDRA 225 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 Y +LP +R+A +R A LN F Sbjct: 226 YIKQQLPRAEQRLALSAVRTAAWLNATFD 254 >UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2E6R1_TRIVA Length = 330 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 26/275 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + I+Q L + + +L D+ + WPD + Y K + Sbjct: 12 WWGHSHTIIAHISQNQLTHKQISNINRILSSSGFETTDIEKISSWPDDLI-EYNLKSMAE 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P + D + V I + L T+ + + Sbjct: 71 WHYADKP-----YVPYEDFNFIKPPPTYNVTTYINDAWETLHD---PTTTDLWAWAFHIR 122 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L H++GDIH P H D GGN + W N+H +WD + Sbjct: 123 NLIHYVGDIHTPHHNIARFTVYHQNGDMGGNLYRLNCTWGDACKNIHFLWDSCALAFPIA 182 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D + D+ N + ++ ++ ES IA GY + Sbjct: 183 DITN---PIYASDLAKN--SSLIEEEFPMSSFENMTSVDPRAWSLESYAIASTLGYA-LP 236 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + S DY + +R+A G RL +L + Sbjct: 237 SYSEPSQDYLYNARQAGKRRIAMAGYRLGYMLKEL 271 >UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_ERYLH Length = 276 Score = 173 bits (439), Expect = 4e-42, Method: Composition-based stats. Identities = 64/290 (22%), Positives = 106/290 (36%), Gaps = 48/290 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHW-Y 51 W H +T IA+ + + A++ L PE L VWPD VR + Sbjct: 8 WGFFAHTVTGDIAEANIRPDTRAAMQRLFRAEGLLGTPECELKTLQDATVWPDCVRRMRW 67 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 ++ T+ H+ TP ++ ++C C+ I L+ + R Sbjct: 68 RWGHTAAWHYRTTPICEP-YEPWKNCPG-----GNCILAQIDRNQRILADESLPANVR-- 119 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSNLHHVWDREIILTAAKD 170 +AL F+ HF+GD+H P+H G D GGN + + NLH +WD + A Sbjct: 120 --LQALAFMVHFVGDVHMPLHSGDKDDRGGNDRETDYGIAPGLNLHWIWDGPLAERAITS 177 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG--- 227 + GI +D + ES I+ + Y Sbjct: 178 ARPSLVRRYSAAERAELAGGISAD-----------------WGRESWAISRDFVYPNAFD 220 Query: 228 --------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 + L+ + + +P+ +RV Q G+R+A LL+ F Sbjct: 221 TDAVCETDLPGETALTQEDIVAAIPVSQRRVTQAGLRIARLLDEAFAPGP 270 >UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7R9_ANADF Length = 285 Score = 173 bits (438), Expect = 5e-42, Method: Composition-based stats. Identities = 60/279 (21%), Positives = 100/279 (35%), Gaps = 35/279 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS+ GH + IA+ L A V+ +L + + + W D R T H Sbjct: 28 WSEPGHRIVAAIAEERLGPSARRLVREVLGATPMSN-ADVAGWADAQRD----PATRAWH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P A FD RDC ++ CV A++ +L +A +L Sbjct: 83 YVNIPL-AAAFDPARDC-----PREACVVAALERAIAELRDGEGAAR-----RADAFRWL 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAKDIN 177 H + D+HQP+H G D GGN + R H VWD++++ + Sbjct: 132 VHLVADVHQPLHAGDGRDRGGNDLPTRRERARGQPRPFHRVWDQDVLGPILRRRGTV--- 188 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET---- 233 I + A W ++A ES +A + Sbjct: 189 ----AAARALARDIGPAEAARWAARP----SPAEWADESHALARALYAELGPLPRDGRIV 240 Query: 234 -LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L +Y + + ++ + G+RLA LL + A Sbjct: 241 LLPREYADRQRARTELQLQKAGVRLAALLERIAAARAVR 279 >UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XA25_9BACT Length = 309 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 56/298 (18%), Positives = 92/298 (30%), Gaps = 44/298 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-------------GDLS-------AL 40 WS GH++ A L + V +L + + DLS Sbjct: 24 WSGAGHMVIAAEAYHELPERTRSKVDEILKAHPDYAKWVATHSKEKFADLSLSEYVFLRA 83 Query: 41 CVWPDQVRHW----YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 WPD++R + H++D P K F E + I Sbjct: 84 SKWPDEIRRAKGQGSRSYDHPHWHYVDYPLKPTKFPLEP-----GPSPKDDLLYGIAQCE 138 Query: 97 TQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH-------VGFTSDAGGNSIDLRWF 149 L + ++ L +L H +GD+HQP+H D GGN ++ Sbjct: 139 KNLCDSKASPEEK----AVYLSYLIHLVGDVHQPLHCCSLVNETYPNGDKGGNDFYVKPG 194 Query: 150 RHKSNLHHVWDREIILTAA-KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC 208 LH WD + ++ + I LL + + + + W G + Sbjct: 195 NKGIKLHSFWDGLLGTSSKPQTQIYYAIELLHDHPRKSLPELAKATTPKDWSLEGRQIAI 254 Query: 209 VNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + IN C + L +Y + R A G RLA + + Sbjct: 255 DKAYLRADINGGCGTSEQNA---CELPSNYTKEAKAVAENRAALAGYRLADEIQMLIK 309 >UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ECB Length = 323 Score = 172 bits (435), Expect = 1e-41, Method: Composition-based stats. Identities = 55/314 (17%), Positives = 92/314 (29%), Gaps = 55/314 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL------------------PEYVNGDLSALCV 42 W GH++ +A L+ + LL + Sbjct: 24 WWGTGHMVVTSVAWRQLSQQEQEQAHALLKAHPKYNDWMSSYPADVPGLSKGLYAAMAAS 83 Query: 43 -WPDQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 W D +R H++D P +F + V I+ ++ Sbjct: 84 LWADDIRDKNNPATHPEWHYVDYPLVPPHFP-----KEPAPNPTNDVLVGIKECERVIAS 138 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNSIDLRWFRHK 152 T ++ E + +L H +GD+HQP+H D GGNS +R + Sbjct: 139 PTTSTQEK----GEMVSWLIHLVGDVHQPLHCASLTNDDFPAPEGDRGGNSAFVRPDKQS 194 Query: 153 S--NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 NLH VWD ++ D E + + ++ Sbjct: 195 KAINLHMVWDSQL-----GGARVADAGSSREALNKAIL--LETEHPRVAAAELQKSPSPE 247 Query: 211 KFATESINIACKWGYKGVE---------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 ++ E +A + Y L + Y I +RV G RLA +L Sbjct: 248 SWSLEGRELAIQEAYLHGNLRYAVGKQLNAPVLPEGYTKKARAISERRVTLAGYRLADML 307 Query: 262 NNVFGASQQEDSVV 275 + S E Sbjct: 308 KRLLAVSTAEPERA 321 >UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepID=Q5ZV70_LEGPH Length = 285 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 100/282 (35%), Gaps = 38/282 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQVRHWYKYKW 55 W+ GH + +IA L ++ + L + W D +R W Sbjct: 28 WNAIGHQLVAQIAYDNLTPQSRR-MCDLYSHSKSKTSSNVNFVKSASWLDSIRAHD-VHW 85 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 LH+ID P + D + + D+ I LS + +D++ Sbjct: 86 FDALHYIDIP-------FSMDETELPVLTDINALWGINQAIAVLSSKKASIADKKL---- 134 Query: 116 ALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 +L L H +GDIHQP+H D GGN L +NLH WD + Sbjct: 135 SLRILVHLVGDIHQPLHTVTKISKKLPKGDLGGNLFQLAKNPIGNNLHQYWDNGGGILIG 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 +D + + N + WS AS ++ S +A YK V Sbjct: 195 QDKFFQIKNK------ARQLEKKWSCQSAS------KEKNPQQWINASHQLALTKVYK-V 241 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 A + Y + I K++ G RLA LLNN+ + Sbjct: 242 SAHQVPGKQYQLNTQNITEKQILLAGCRLAYLLNNIAEGKNK 283 >UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2E030_TRIVA Length = 372 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 96/285 (33%), Gaps = 36/285 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQV-----RHWYKY 53 W H M R++ L D + +L + + + W D++ R Sbjct: 12 WWNGPHEMVARVSWNDLTDRQQKIIYKILLTWPDEQKLFTNCGSWLDEIAAKYNRGTDLI 71 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 P HF+D P D + ++ + A+ + + T+ + + Sbjct: 72 SHFKPWHFVDFPL----IDGCENFEEKDTPFVYNITSALNHIISSFLD---PTTKSLWAI 124 Query: 114 TEALLFLSHFMGDIHQPMHVGFTS---------DAGGNSIDLRWFRHKSNLHHVWDREII 164 + L H + D+H P+H D G N L + NLH +WD + Sbjct: 125 NFDIRMLLHLVADVHTPVHCIDRYTPSSGTCKADHGANFFSLSLSINGKNLHSLWDSAVY 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + L + + + + ++ V +A S IA ++ Sbjct: 185 AYPTGSFSEEMVQKLIFEYKDKIPEDSYVQNM-----------NVTAWALHSYEIAKEYV 233 Query: 225 YKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y G++ + + +D Y P ++ R+A +++ Sbjct: 234 YNGLKLNQYVGENDAYVTRAQPQAKAQIILASKRMAYIIDQFVKK 278 >UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 Tax=Tetrahymena thermophila RepID=UPI000150A357 Length = 389 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 97/291 (33%), Gaps = 28/291 (9%) Query: 3 KEGHVMTCRIAQGLL---NDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYK- 52 H++ IA+ L + E + ++ +W D +++WYK Sbjct: 26 DLPHMLILGIAKETLIEKDPEIIQIAEKYFDQFEEPHQKGQVQFEEHSIWSDDIKYWYKS 85 Query: 53 -YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K+ H+ID N+ + ++ + A L + Sbjct: 86 SVKYWDTWHYIDQIYNPSNYPID---VNKQKDSNSNAQVAFNQIKETLKNKNLNGKITVM 142 Query: 112 NMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRW-FRHKSNLHHVWDREI 163 L L H +GDIHQP+H D GGN ++ K+NLH +D Sbjct: 143 KHIF-LKHLVHLVGDIHQPLHTVSFYSYQFQNGDLGGNKQMVQLSDNRKNNLHFYFDSGA 201 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 +D + N D + + + +++ ES I+ + Sbjct: 202 FYYTFEDRIHRPFNESFIDYFEEEIARLIKLYPREELKINDEDIQFDQWVKESYMISIEQ 261 Query: 224 GYKGVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 Y ++ ++D+ + K++ + G RLA +L + + Sbjct: 262 IYSQIDLTGNQKINKITDENHRKNQELCQKQIVKAGYRLANILVDFLKDEK 312 >UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CE90A Length = 482 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 59/303 (19%), Positives = 98/303 (32%), Gaps = 39/303 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK---MLLPEYVNGDLSAL--------CVWPDQVR-HWYK 52 H++ IA+ L K L + + + VW D ++ + Sbjct: 24 PHMLILGIAKRELMKNDQEIYKITAKYLDTFSASGIETISTTSYEENAVWGDDIKTYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K HFI + N +D A N + + Sbjct: 84 QKAMGMWHFIGNKDSNPENLTLVKD----PMADSENALNAYDNIVKTFKNKSFIGKITEF 139 Query: 112 NMTEALLFLSHFMGDIHQPMHVGF-------------TSDAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G D GGN ++++ + Sbjct: 140 KI-MMLKMLVHLVGDIHMPHHTGSYYNSTIVGPNKEIWGDRGGNRQKIKFYTSTGKKEST 198 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 ++H +D K + +N + D I + N N +A Sbjct: 199 DIHFYFDSSCFYYNWKSRLQRPLNDTFKAYFEAELDRIMTQYPKETLNINNA-QTFNDWA 257 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 ES NIA Y + + D ++NS ++ KR+ G RLA L N+F A + + Sbjct: 258 EESWNIALTEVYPFLLKNNEIRFGDAFYNSSFDMIQKRIVIAGYRLAYTLQNMFAAEKGK 317 Query: 272 DSV 274 + Sbjct: 318 IDL 320 >UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium loti RepID=O68530_RHILO Length = 309 Score = 165 bits (417), Expect = 1e-39, Method: Composition-based stats. Identities = 72/300 (24%), Positives = 113/300 (37%), Gaps = 43/300 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYKYK 54 W +EGH IAQ L A+ V+ LL ++ ++++ W D R +K Sbjct: 22 WGQEGHAAVAEIAQHRLTSSASDVVQRLLRAHLGLTGQQVVSMASIASWADDYR-ADGHK 80 Query: 55 WTSPLHFIDTPDKA--------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 TS HF+D P + ++D RDC D C+ A+ LS + Sbjct: 81 DTSNWHFVDIPLASLPGGSSATTDYDAIRDCAD-DATYGSCLLKALPAQEAILSDATKDD 139 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHV-----GFTSDAGGNSIDLRWF-----------R 150 R +AL F+ H GD+ QP+H G D GGN++ + + R Sbjct: 140 ESR----WKALAFVIHLTGDLAQPLHCVQRVDGSQKDQGGNTLTVTFNVTRPAPDNSTFR 195 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR-ECGNVFSCV 209 + H VWD ++I D+ E+ + D + D W EC Sbjct: 196 DFTTFHSVWDTDLITFKYYDWGLA-AAEAEKLLPTLAADLLADDTPEKWLAECHRQAEAA 254 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 + + G+ L YF P+V +++A GG+ LA LN + Sbjct: 255 YQALPAGTPLKSDIGHP-----VILDQAYFEKFHPVVTQQLALGGLHLAAELNEALKGGK 309 >UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZI8_MONBE Length = 179 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 71/156 (45%), Positives = 93/156 (59%), Gaps = 4/156 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH T IA+ LL ++AA V +L N + ++ W D VR + W++PLH Sbjct: 26 WGPIGHQTTAAIAETLLTEKAATTVAQIL---DNASMVSVSTWADDVRSTSAWAWSAPLH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 FIDTPD+ C+FDY RDC + G D CVAGAI N+T QL + EAL F+ Sbjct: 83 FIDTPDRVCSFDYSRDCQN-DGRPDFCVAGAIVNYTRQLELAVAQGRLQDETTQEALKFV 141 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLH 156 HF+GDIHQP+HV FTSD GGN +++ +F NLH Sbjct: 142 IHFLGDIHQPLHVSFTSDEGGNLVNVTFFGEPENLH 177 >UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepID=Q8ILX4_PLAF7 Length = 320 Score = 163 bits (413), Expect = 5e-39, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 96/303 (31%), Gaps = 34/303 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDL---SALCVWPDQV---------- 47 WS E H++ IA LND + + + +W D++ Sbjct: 19 WSDEPHMLISYIAYINLNDGEKEILNRIFQNGNDAIFDNPITASIWADKIKPNNHKRTFH 78 Query: 48 ----RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R + H++ Y H + G +++ L R Sbjct: 79 SSNFRRNELLDIFNEWHYVQLNYNPMKI-YIAPYHLRAHKGKHNAMGILKHIYRILIEVR 137 Query: 104 EG-TSDRRYNMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNL 155 + Y+ L F H D+HQP+H D GG I + + + L Sbjct: 138 QKMGHGTYYSYNFYLRFFIHIFSDLHQPLHAINFFNSNYPNGDRGGTDISVNYKGSINKL 197 Query: 156 HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATE 215 H++ D I T K + ++ +E D + + ++ A E Sbjct: 198 HYLCDN-IFKTRKKQWPNINMTNIERDARYLMSTYPPESFGNKLFLPHDKIKYIDDIAHE 256 Query: 216 SINIACKWGYKGVEAG-------ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 S +IA + Y +++ + + ++ ++ G RL+ L ++ Sbjct: 257 SHDIAVQNIYSFFPLTDLKRSEQYSINQHFVINTKKLLNSQMVLAGYRLSAYLKDIIANI 316 Query: 269 QQE 271 + Sbjct: 317 PPD 319 >UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmodium knowlesi strain H RepID=B3LAP6_PLAKH Length = 331 Score = 163 bits (411), Expect = 9e-39, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 99/300 (33%), Gaps = 30/300 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQV--------- 47 WS EGH++ IA L D+ ++ + Y D VW D + Sbjct: 24 WSDEGHLLISAIAYEGLTDDEKFVLQTIFKNYKEDNDFNDPVTAAVWADHIKPIDYHYTT 83 Query: 48 --RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YRE 104 R + + H+ P N ++ K +++ T L + ++ Sbjct: 84 KVRRIGGLELMNKWHYTSNPYNPTNIPLNE-YRKKYYQKTDNALSVLKSIFTSLKNMNKQ 142 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHH 157 ++ L + H GDIH+P+HV D G I++++ + LH+ Sbjct: 143 ENHGTFFSYNFNLRYFIHIFGDIHEPLHVVEFFNKHFPEGDNGATLINIKYNNNVEKLHY 202 Query: 158 VWD------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNK 211 + D T+ ++ N L + + +DL+ + + Sbjct: 203 LCDCVFHTRSRRWPTSGMKEMLEEGNALMKMYPPEYFGDRLKNDLSDLEYLDFIVNDSYT 262 Query: 212 FATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 A I + L + + ++ +++A GG RL L + + Sbjct: 263 KAVNDIYSNFPHDTLNSKTPYVLDNSAVDKLKKMLNEQIALGGYRLRRYLKIMIENVPDD 322 >UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PFZ0_USTMA Length = 397 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 64/374 (17%), Positives = 119/374 (31%), Gaps = 109/374 (29%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----------------GDLSALCVWP 44 W GH + IAQ L+ + +LP Y L+ L WP Sbjct: 35 WGIAGHQIVATIAQTQLHPLVREQLCTILPNYTRYPSHWPTSEDSKPRTHCHLAVLAGWP 94 Query: 45 DQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 D +R +Y W+ LH+++ + + + V ++ N+T+++ Sbjct: 95 DTIRS--RYPWSGQLHYVN--PVDDHPPSQCLYGETGWTSPNNVLTSMVNYTSRVV---- 146 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 ++ + AL F+ H GD HQP+H+ + GGN + + + K+ LH VWD +I Sbjct: 147 --TETGWQRDMALRFMVHLFGDAHQPLHLTGRA-RGGNDVWVHFEGRKARLHTVWDTLLI 203 Query: 165 LTAAKDYYAKDINLLEEDIEGNFT--------------------------DGIWSDDLAS 198 ++ L IE D W + + Sbjct: 204 DKQIRELSNYTTRLPSGRIESALVGARYDPLIRFILKEGLGQPASRGQEGDAWWKQESSG 263 Query: 199 WRECGNVFS--------------------------------CVNKFATESINIACKWGYK 226 W C S C ++ ++ C + + Sbjct: 264 WPACQGQRSEIGALTQEYEGQLALSSISEDPHRVDNTVLPICPYEWTRPMHSLVCTYAFA 323 Query: 227 GVEAGETLS----------------------DDYF--NSRLPIVMKRVAQGGIRLAMLLN 262 + +Y R ++ K++A+ G+RLA +LN Sbjct: 324 APVPAWEPAPPPGQGEPEPSPTPVPEPELDVPEYVGRIERDKVIHKQLAKAGLRLAAVLN 383 Query: 263 NVFGASQQEDSVVA 276 + ++ + A Sbjct: 384 TLLLPAEVDSLRSA 397 >UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidopsis thaliana RepID=O65425_ARATH Length = 454 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 73/143 (51%), Positives = 98/143 (68%), Gaps = 2/143 (1%) Query: 16 LLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDYE 74 D+ AVK LLPE G L+ C WPD+++ +++WTS LH+++TP+ CN++Y Sbjct: 4 FFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTLHYVNTPEYRCNYEYC 63 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALLFLSHFMGDIHQPMHV 133 RDCHD H KD CV GAI N+T QL E + + YN+TEALLFLSH+MGD+HQP+H Sbjct: 64 RDCHDTHKHKDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALLFLSHYMGDVHQPLHT 123 Query: 134 GFTSDAGGNSIDLRWFRHKSNLH 156 GF D GGN+I + W+ +KSNLH Sbjct: 124 GFLGDLGGNTIIVNWYHNKSNLH 146 >UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FAR0_TRIVA Length = 326 Score = 161 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 92/280 (32%), Gaps = 27/280 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W E H R+A+ +L+ + +L + + W D ++ P Sbjct: 12 WWGEPHYFIARLAESMLSASEVKYLNRVLATWESEKAVFHDTGNWHDDLK-PIGMPLMVP 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P N++ V ++ LS + ++ + + Sbjct: 71 WHFRNQPVVDPNYNL------VTYPVTYNVTQVNKDC---LSAIYDTSTTSMWILGFCFR 121 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNS--IDLRWFRHKSNLHHVWDREIILTAAK 169 L+HF+ D H P+H D G LH VWD + Sbjct: 122 SLAHFVADAHCPVHASCYFSADYPNGDGGATKEKFVCPVDEVCDKLHFVWDSGSLNFQTW 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLAS-WRECGNVFSCVNKFATESINIACKWGYKGV 228 + E ++ +W++ +++ +++ ++A ++ Y Sbjct: 182 PIPESLVKEAEYNL-----SHLWTNYPPEKHYSSTYNSIDPDQWQSDAYDVAKEYVYGLY 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + G ++ +YFN P K ++ RL +L F Sbjct: 237 QFGHNVTGEYFNKTQPPAAKLISVAAYRLGKVLQTFFHKR 276 >UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT7_PHYIN Length = 343 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 68/312 (21%), Positives = 113/312 (36%), Gaps = 48/312 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY-----VNGDLSALCVWPDQVRHW----- 50 W GH++ +A+ L+++ ++ +L ++ G+++ VW D ++ Sbjct: 27 WWDNGHMLVGEVAKQLMSEADVVTIESVLSKWNEDFPNTGEITTSAVWMDLIKCTSVSSY 86 Query: 51 ------YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 S H+ID P +E D +D A L Sbjct: 87 CQSPLAPSITSMSDWHYIDLPVNINGDKWEYKDADLSLFEDTMGGDAASVIEGALRS--L 144 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT-------SDAGGNSIDLRWFRHKSNLHH 157 T+ + + H GD+HQP+H D GGNS SNLH Sbjct: 145 KTTKSSWAANLFIRNFIHIFGDLHQPLHTVAGVSEAFTEGDGGGNSEYFASPCAFSNLHA 204 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGI-----WSDDLASWRECGNVF------ 206 VWD L + ++ A +I+ + ++ N TD I SD L + + Sbjct: 205 VWDAAGGLYSLNNW-ALNIDDFKSTLQSNATDLIALLLNISDTLDFSQYENTTYNELYTA 263 Query: 207 ----SCVNKFATESINIACKWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGI 255 S + + E+ + A Y G++ T S Y I KR+A GG Sbjct: 264 LVTNSALREVILETYSYADTVVYSGLDLNATSSGKYPCPSSSYLTLAGEISQKRIAIGGS 323 Query: 256 RLAMLLNNVFGA 267 RLA++L + Sbjct: 324 RLAIILKHFAAQ 335 >UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6ABV1_9CRYT Length = 433 Score = 159 bits (402), Expect = 9e-38, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 111/312 (35%), Gaps = 49/312 (15%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVW--------PDQVRHWYKY 53 +GH A L H +K L+ D+ + W P + ++Y Sbjct: 22 DADGHSAIAMTAMSGLKGNTLHQLKRLM---NGKDIVDISAWGERVSQKHPSTMPFHFQY 78 Query: 54 KWTSPLHFI--------------DTPDKACNFDYERDCHDQH-----GVKDMCVAGAIQN 94 + + LHF D + ++ C++ C+ I++ Sbjct: 79 QDMNELHFDKFLPESAPQMFGLGDGTRSFSHTYSDKYCNEVGASAECKETGHCLVPMIKH 138 Query: 95 FTTQLSHYREG----TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----L 146 ++L + ++++ FL + +GD+HQP+H GFT G + Sbjct: 139 LYSRLIGLDRNKISYPEGIQLTDSDSVKFLVNLIGDLHQPLHFGFTESNAGRDFHGHLII 198 Query: 147 RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 +L +W++ +I + I+ + W+E G Sbjct: 199 NGTEETISLFEIWEKGLIQKLKIEKPQFWYGGWTHVFA---IRDIFDKETILWKERG--I 253 Query: 207 SCVNKFATESINIACKWGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAML 260 ++ +A ESI I C + E L++++ + I+ R+ G RL+++ Sbjct: 254 DIIDDWARESIQIMCSALFIHPLNQEKLTNNFNIDPLLEFAWFEILRSRLLIAGARLSIV 313 Query: 261 LNNVFGASQQED 272 LN++ + ++ Sbjct: 314 LNDILKYREGKE 325 >UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobasidiella neoformans RepID=Q560K3_CRYNE Length = 393 Score = 159 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 66/223 (29%), Positives = 90/223 (40%), Gaps = 33/223 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH M IAQ L + +LPE N L+ + W D VR+ +Y+ T+P+H Sbjct: 20 WGAAGHEMVATIAQIHLFPSTRAKLCSILPEEANCHLAPVAAWADIVRN--RYRGTAPMH 77 Query: 61 FI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 +I D P C F +D+ V AIQNFT + + G Sbjct: 78 YINARNDHPQDHCEFGQH-----GWQNEDVNVITAIQNFTRLIMDGKGGKDVD-----IP 127 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L FL HF+GD HQP+H+ D GGN + + NLH VWD II ++ Sbjct: 128 LRFLVHFIGDSHQPLHLA-GRDKGGNGAKFLFEGRERNLHSVWDSGIITKNIRELSNYTS 186 Query: 177 NLLEEDIEG----------------NFTDGIWSDDLASWRECG 203 L + IE W D++ SW C Sbjct: 187 PLPSKHIERCLPGAIFDPYVRWIVWEGIRLWWRDEVDSWISCP 229 Score = 51.7 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 16/71 (22%), Positives = 27/71 (38%), Gaps = 9/71 (12%) Query: 207 SCVNKFATESINIACKWGYKGVEAGETL-------SDDYFNS--RLPIVMKRVAQGGIRL 257 SC + + + C + G+ +D+Y R I+ K +A G+RL Sbjct: 311 SCPYHWISPIHQLNCDIVWPSKYTGQPNEPLIELDTDEYLGEIGRQKILEKMIAMAGLRL 370 Query: 258 AMLLNNVFGAS 268 A +LN Sbjct: 371 AKVLNEALAEE 381 >UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium RepID=A3FPP7_CRYPV Length = 416 Score = 156 bits (395), Expect = 5e-37, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 107/296 (36%), Gaps = 35/296 (11%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 EGH L + + ++ L+ D+ + W + R K+ T P HF Sbjct: 24 DAEGHSAIGMTTISGLQNNFSQKLRRLM---NGKDIVDISGWGE--RVSKKHPSTLPFHF 78 Query: 62 IDTP--DKACNFDYERDCHDQ--------HGVKDMCVAGAIQNFTTQLSHYREG-----T 106 D N + D ++ C+ I++ +L Sbjct: 79 QGQSKGDYFKNGELGNDFKEKFILKSDSNCKHTGHCLVPMIKHLYYRLIGDNSKFKINYP 138 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----LRWFRHKSNLHHVWDRE 162 + ++++ FL + +GD+HQPMH GF D G I + + +L +W+ Sbjct: 139 EGIQLTDSDSIKFLINLIGDLHQPMHFGFIEDGLGREIKGMMSINGTNERLSLFEIWESG 198 Query: 163 IILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 I + + I+ +L W+E G +N +A E+ I Sbjct: 199 IARKLKTEKPQFWFGGWTHILA---IRDIFDKELLLWKERG--IEMINDWAKENFEIVTN 253 Query: 223 WGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 Y + + + D++ + L I R+ G RL+++LN++ + ++ Sbjct: 254 EIYFHPISKQPIIDNFNVDVTLEFAWLEIFRSRILIAGARLSIILNDILKLREGKE 309 >UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z194_9GAMM Length = 275 Score = 156 bits (394), Expect = 8e-37, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 108/282 (38%), Gaps = 48/282 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH C A + A+ LL L LC W D+++ + T H Sbjct: 30 WWDDGHQQVCEQAVAQVQPATLAAIADLLDAP----LGELCSWADEIKG--QRPETRQWH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + + A+ +L H EALL++ Sbjct: 84 YLNAPPD------TLSIGNAPRPEGGDIIAALNEQIHRLKHAPTN------QRREALLWV 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF----------RHKSNLHHVWDREIILTAAKD 170 H +GD+HQP+H+G+ SD GGN+ L R + ++H VWD I+ + Sbjct: 132 GHLIGDLHQPLHLGYASDLGGNTYRLELPEELALQLNEKRERVSMHAVWDGLILRYQDQP 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC--KWGYKGV 228 A +E + N I + +A E++++ K Y+ Sbjct: 192 SVAATATPIERPLLLNPEVEIIA------------------WADETLSVLNDAKVHYRHG 233 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 +TL+ Y S V ++ + RLA LL+ F S++ Sbjct: 234 TRLQTLTSQYLISNRSAVDLQIRRAATRLAALLDWAFSQSKR 275 >UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8P2Q4_POSPM Length = 753 Score = 156 bits (393), Expect = 9e-37, Method: Composition-based stats. Identities = 60/240 (25%), Positives = 94/240 (39%), Gaps = 31/240 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPE-------------YVNGDLSALCVWPDQV 47 W GH + IAQ L+ + +L Y L+ + W D+V Sbjct: 323 WGAAGHEIVATIAQIHLDPSVLPVLCDILYPPSSSSHKASTSSAYPPCHLAPIAAWADRV 382 Query: 48 RHWYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R Y+WT+PLH++ D P +C F G ++ V A+ N T Q++ Sbjct: 383 RGSPAYRWTAPLHYVGAVDDAPADSCAFPGPNGWA---GRHNINVLAAVSNKTGQVA-AF 438 Query: 104 EGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREI 163 + EAL +L HFMGD+H P+H+ + GGN + + SNLH VWD + Sbjct: 439 LSGEAGLHEGEEALKYLVHFMGDMHMPLHLT-GKERGGNGAKVTFDGRVSNLHSVWDNLL 497 Query: 164 ILTAAKDYYAKDINLLEEDI--EGNFTDGIWSDDLASWRECG-------NVFSCVNKFAT 214 I A + L + E + I+ + G F+ V ++ Sbjct: 498 IAQALRTVPPNYTWPLPDMRGVEAHLRGAIYDPYIRRIIYEGFGTDAVAGRFTDVEEWLD 557 >UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SGH7_VERA1 Length = 303 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 85/282 (30%), Gaps = 23/282 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H A+ L+ A + +L L + W D R + + T+ H Sbjct: 21 WNTDIHQQIGFAAEKFLSPAAKAILSEILEPESGASLGRIGAWADAHRGTPEGRHTTTWH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P CN Y RDC C+ A+ N T L D + Sbjct: 81 WINPADQPPSFCNVHYNRDC-----TSGGCIVSALANETQILKSCIRSVKDASLSAAPTP 135 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + V D + +S + + I Sbjct: 136 RAPTPPT--------VFPVVDREEEKF-VYLTPARSGTAPL--STCSAANVTGFPNTTIQ 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVEAGETL 234 D+ + W C +C ++A ++ C + + L Sbjct: 185 PFFSDMVDRIRADTYFVPTRDWLSCTDPSTPLACPLEWARDANQWNCDYAFSQNTNASDL 244 Query: 235 -SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 + Y PI ++A+ +R+A N + + ++ VV Sbjct: 245 RTSGYAEGAWPIAELQIAKAVLRIATWFNKLADCNFKDREVV 286 >UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QW83_9PLAN Length = 338 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 61/318 (19%), Positives = 104/318 (32%), Gaps = 58/318 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ +GH + IA L E A+ +L ++ Sbjct: 28 WNAKGHRLVAAIAYRSLTPEDRDALIEILKQHPRFAADFERQMPDVVKSGTKDQQQEWLF 87 Query: 38 SALCVWPDQVR----HWYKYKWTSPLHFIDTPDKACNFDYER----------DCHDQHGV 83 VWPD +R H+I+ P + + V Sbjct: 88 GHAAVWPDYIRGFKGEESDKYHRPTWHYINWPHYLSDAEAAELAMPPMVNRHLDPAMTPV 147 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-------- 135 + + +I +Q + +R + +L H MGD+HQPMH Sbjct: 148 LEQNLMQSIARLRSQFVDSKYSAEER----AVMICWLLHTMGDLHQPMHGASLFCKPLFV 203 Query: 136 TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAK--DINLLEEDIEGNFTDGIWS 193 D GGNSI R NLH VWD + + + + L ++ T S Sbjct: 204 QGDRGGNSILTRQSG---NLHAVWDNALGNDDSFREVNRHATLLLATPEMTKIGTASQAS 260 Query: 194 DDLASWRECGNVFSCVNKF--ATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKR 249 + +W E + + + + A S K V+ L++DY + + +R Sbjct: 261 IEQKTWLEESHALAVEHVYDQAVLSHVRVQMLTAKNVDDFPPLMLNEDYLRNSSKVSERR 320 Query: 250 VAQGGIRLAMLLNNVFGA 267 + G R+A +L + Sbjct: 321 SVEAGYRIAAVLRQLLHP 338 >UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2JAU7_NOSP7 Length = 332 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 55/307 (17%), Positives = 96/307 (31%), Gaps = 54/307 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKM-------------------------LLPEYVNG 35 W+K GH+++ IA L + + PE N Sbjct: 41 WNKSGHMVSGAIAYSELKQSNQQNLDKVVAILKEHPEYSKFEQQWNSLNQSNISPEDKNL 100 Query: 36 DLSALCV-WPDQVRHWYKYKWTSPLHFIDTPDKA--CNFDYERDCHDQHGVKDMCVAGAI 92 L W D+ R ++ H+I+ P + + R+ D+ + A Sbjct: 101 YLFMWAAKWADEARDNPEFNH-PTWHYINFPYQPGRASNSIPREIPDEE-----NIIFAF 154 Query: 93 QNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNS 143 Q + + A+ +L H +GD+HQP+H D GG Sbjct: 155 QKNLDVVKSNASNSDK-----AVAICWLFHLIGDVHQPLHTTKLITNQYPQPEGDRGGTR 209 Query: 144 --IDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE 201 I ++ +LH WD I+ + L + N + +W Sbjct: 210 FYIRVKPNSQTISLHKFWDDLILGSERFQAVRNAATSLRSSYQRNKLPELRETKFNNWA- 268 Query: 202 CGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 ++ G G+ L +Y + I +R++ G RLA +L Sbjct: 269 ---KLESFRIAKQDAYLNGKLSGSSDKNDGKLLPANYAATAKQIAQRRMSLAGYRLADVL 325 Query: 262 NNVFGAS 268 N + G Sbjct: 326 NQLLGQR 332 >UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3P1_9PLAN Length = 330 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 98/312 (31%), Gaps = 58/312 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ GH + IA L E A+ LL ++ + Sbjct: 24 WNYAGHRVIASIAWDQLTPETQAAMIALLKQHPRFEQDFQSRMPEVILKASPAVQDRWLF 83 Query: 38 SALCVWPDQVRH----WYKYKWTSPLHFIDTPDKACN-----------FDYERDCHDQHG 82 WPD R + H+I+ P + + Sbjct: 84 MRAATWPDIARSFKEADREKYHHGTWHYINQPIYLDTASELSLSSKLPVNTAKSIRQGDD 143 Query: 83 VKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH--------VG 134 + A++ Q+ +D+ AL ++ H GD HQP+H Sbjct: 144 PLQFNILQALEYNVAQMKDPAVSEADK----ALALCWIMHLTGDSHQPLHSSALFSKGSF 199 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 D GGNSI + KSNLH WD + + L D + Sbjct: 200 PEGDRGGNSIRI----GKSNLHAQWDGLLGNSFKDSEIVSQAVGLARDPALKQLGEQATK 255 Query: 195 DL--ASWRECGNVFSCVNKFATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKRV 250 +L A W + + + + + A + E + L Y+ + I +KR Sbjct: 256 NLNYADWIDESHALAKSAGYTQLILAAAKQNDSPQNEFLKLKDLPAAYYRTAGAIAVKRA 315 Query: 251 AQGGIRLAMLLN 262 AQ G RLA ++N Sbjct: 316 AQSGWRLAAVIN 327 >UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5LKE6_9ALVE Length = 342 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 65/291 (22%), Positives = 122/291 (41%), Gaps = 41/291 (14%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 + H + +A L D+ + ++L LS W ++ + W + L Sbjct: 17 GSDFHAVVVELADLRLADKTRQELSIMLGNDYR--LSTTANWAARL----NFPWLADL-- 68 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 + CNF Y RDC + C+AG+I N+T ++ T +R +EA+ FL Sbjct: 69 STAYNDHCNFSYARDCTN----NGRCLAGSIWNYTNRMIDPYLSTKER----SEAVKFLV 120 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSN--LHHVWDREIILTAA-----KDYYA 173 H + D H P+ G +SD GG I++ SN L W +I+ Y Sbjct: 121 HLVADAHLPLSAGRSSDQGGKKINVHINFADFSNVDLSKAWREKILDEMQGALYPGKYVQ 180 Query: 174 KDINLLEEDIE---------GNFTDGIWSDDLASWRECGNV---FSCVNKFATESINIAC 221 +D N ++ G D ++ + SW +C++ E+ ++AC Sbjct: 181 QDSNSSSHRMKFWRVTSNSIGADLDQKYAGMVPSWLAECTQHGINACIDMILNEAADLAC 240 Query: 222 KWGYKGVE-----AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 + Y+ ++ + LS +Y+ SR+ ++ +++A+ RL +++ F Sbjct: 241 RIAYRNMDGRDIQNNDDLSREYYTSRIGMLREQLAKAATRLGWIMDEAFKN 291 >UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47K45_DECAR Length = 301 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 104/312 (33%), Gaps = 70/312 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--------------LSALCVWPDQ 46 W+ GH + IA L+ A+ L + + + + WPD Sbjct: 20 WNAAGHRLVAVIAWQQLSPATRDAISAALAHHPDHERWVEKARSREGIAVFAEASTWPDD 79 Query: 47 VRHWYKYKW------------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCV 88 +R+ + H++D V+D + Sbjct: 80 IRNDPRLYDEDREPPTPAVPGLPETARHKRWHYVDLD-------------ATGKVRDGEL 126 Query: 89 AGAIQNFTTQLS-HYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL- 146 I+ + L + + + AL +L H + DIHQP+HVG D GGN +++ Sbjct: 127 DRQIERLSQLLQAKGSSPGTRKSEQIAYALPWLLHLVADIHQPLHVGQHGDEGGNKVEIE 186 Query: 147 RWFRHK---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECG 203 F + S+LH WD + N LE++ Sbjct: 187 NPFNKRLPFSSLHLYWDDLPGPPWLRG------NRLEKNAGRLLDS-----------YPK 229 Query: 204 NVFSCVNKFATESINIACKWGYKGVEAG--ETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 V V + ES + Y V +S+D+ ++ I +R+ + G RL LL Sbjct: 230 PVQGNVALWRDESHQLLAA-AYPKVSGSLLPIISEDFQDNARQIANRRIVEAGYRLGHLL 288 Query: 262 NNVFGASQQEDS 273 ++F ++ Sbjct: 289 ESIFRERVSRET 300 >UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KFB6_TOXGO Length = 439 Score = 150 bits (378), Expect = 6e-35, Method: Composition-based stats. Identities = 56/324 (17%), Positives = 107/324 (33%), Gaps = 66/324 (20%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDT 64 H L+ A A+K LL DL+ + W R KY T+ LHF+ Sbjct: 32 AHEAVSMTTLSGLSTSANQALKKLL---NGKDLADVAGWAH--RVSDKYPDTARLHFMSQ 86 Query: 65 PDKACNFDYERDC---HDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 P D VK C+ A+ F L + Sbjct: 87 PTCPSKPLRTDDIILDKSFCEVKGNCLLEALTYFFFHLVDPDQNKVEQTNPDVITTTNFV 146 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK----SNLHHVWD 160 D + +A+ ++ + +GD+HQP+H+G D G +++ + + L++ + Sbjct: 147 FPHDIKTTDADAVKYIINLVGDMHQPLHMGSADDDYGRRAVVQYSDGEQMRLTTLYNFLE 206 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ K + N G + + + + +++A E+ + Sbjct: 207 AGLVDKTVKQRQYFWFSGWTHV---NSVKGAYDSEKSLFATNKEKM--FSEWAKENRAVL 261 Query: 221 CKWGYKGVEA------------GETLSDDYFNSRLP--------------------IVMK 248 C Y V G D+Y + L ++ K Sbjct: 262 CNEVYPHVRKTGKDARAAANALGSDAVDEYAKAVLDGSSDVPLFEIDAAAEFALFQVLKK 321 Query: 249 RVAQGGIRLAMLLNNVFGASQQED 272 R+ G R+A+++N + + +D Sbjct: 322 RILLAGARVAIVMNYILQVRESKD 345 >UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RIT3_9PROT Length = 320 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 60/314 (19%), Positives = 96/314 (30%), Gaps = 73/314 (23%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD---------------LSALCVWPDQVRH 49 GH ++ IA ++ AV LL ++ + + WPD +R Sbjct: 33 GHRISAMIAWESMDAGTKSAVGQLLRQHPDYERWQARAHGGDPELTAFLEASTWPDDIRK 92 Query: 50 WYKYKWTS------------------PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGA 91 ++ T H++D P G AG Sbjct: 93 DRRFYTTGREEPTATLPGFPDMERRLHWHYVDRPVNP-------------GAGTGPAAGV 139 Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS------DAGGNSID 145 I L+ AL +L H +GD HQP+H D GGN + Sbjct: 140 IDRQLAVLARIVGDRQATMAERAYALPWLIHLVGDAHQPLHAASRYGPDGQSDNGGNLVS 199 Query: 146 -LRWFRHK---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE 201 + F + +LH WD +D + Sbjct: 200 IVNPFAARYTSMSLHRYWDDLPGPPWLRDGRLASAARSLAAL----------------HR 243 Query: 202 CGNVFSCVNKFATESINIACKWGY-KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAML 260 ++ ES +A + Y G +A T+S + L I +RVA+ G RLA L Sbjct: 244 PPTSPGTPEQWLDESWRLARERVYPPGDDAVPTISATFHEDALAIAGRRVAEAGYRLADL 303 Query: 261 LNNVFGASQQEDSV 274 L + + + + Sbjct: 304 LQRLLHSGPRREDR 317 >UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KWM0_9GAMM Length = 271 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 93/264 (35%), Gaps = 43/264 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH C A + + LL N ALC WPD+++ T+P H Sbjct: 22 WWDLGHAAICDAALEYVKPGTRLEIDRLLATRDNRGFGALCSWPDEIK--TDQPTTAPWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P D + + + +LS R EALL++ Sbjct: 80 YLNVPVGTT------DIATAPRPAEGDILAVLTEQQARLSQANTDIHAR----AEALLWV 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFR----------HKSNLHHVWDREIILTAAKD 170 +H +GD+HQP+HV + D GG+S L+ R ++ +H +WD + L A Sbjct: 130 AHLVGDLHQPLHVAYAEDRGGSSYRLQVPREIRALLGERYEETGMHQIWDGYLPLYARYS 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK--WGYKGV 228 + L+ E ++A ES+ I Y Sbjct: 190 GGSGLKQLVIEQ-------------------SAEAGGTPLEWAQESLTIMNNPGTAYLYG 230 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQ 252 L + Y I +KR+ Q Sbjct: 231 YRITILDEAYLAKNYRIALKRMKQ 254 >UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KF36_TOXGO Length = 397 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 54/358 (15%), Positives = 92/358 (25%), Gaps = 96/358 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQV-------- 47 W H++ IA+ ++ A V +L + + VW D + Sbjct: 25 WHSGPHMIVAAIARSEMSALAQIKVDYILGLWRGQYPDHATMERASVWLDDINGKGPPYE 84 Query: 48 ---RHWYKYKWTSPLHFIDTPDKA------------------------------------ 68 R + K +H ++ P Sbjct: 85 KPSRRFDFLKIFQFMHGVNIPYNPEGIQLQGLDALLPLYERSAEFLLDMAWDGLKATTPT 144 Query: 69 --------CNFDYERDCHDQHGVKDMCVAGAIQNF------------------TTQLSHY 102 C+ + V A NF ++Q+S Sbjct: 145 TEKLEDPFCSVPPPVSSFSLASYSEGTVNAANGNFLEVSHPDEYRRNTGVSARSSQVSTD 204 Query: 103 REGTSDRRYNMTEALLFLSHFMGDIHQPMH-------VGFTSDAGGNSID-LRWFRHKSN 154 E ++ L + H + DIHQP+H D G I + +N Sbjct: 205 AESPVGTVLSLNFYLRMVIHLVADIHQPLHSLLAFSPAFPHGDRFGTKISMVLPNGEDTN 264 Query: 155 LHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFAT 214 LH WD + + D + EE D L S + + A Sbjct: 265 LHAFWDGAGSVYTKRRGEFTDEEIAEEA--RRIKLEFPKDSLESHLKPELLAPNFRNMAE 322 Query: 215 ESINIACKWGYKG--------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 ES + Y+ + + Y +++A G RL L + Sbjct: 323 ESHRLGAALAYREFNFRTFRPADLPYVPTHTYLADVRLACRRQIAIAGYRLGYALEEL 380 >UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8PCL3_COPC7 Length = 484 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 83/227 (36%), Gaps = 52/227 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-----------GDLSALCVWPDQVRH 49 W GH + IAQ L+ + LL V+ LS++ W D + Sbjct: 27 WGAAGHEIVATIAQIHLHPSVLPTICALLDIDVDASDDTSSLRAKCHLSSIATWAD--KE 84 Query: 50 WYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG 105 K +W++ +H++ D P + C F + G + + V A +N T L+ + Sbjct: 85 KMKIRWSAAMHYVGAVDDFPRERCEFPGPKGWA---GTRSINVLDATKNVTRILAEWGGV 141 Query: 106 TSDRRYNMT-------------------------------EALLFLSHFMGDIHQPMHVG 134 + ++ EA FL HF+GD+HQP+H+ Sbjct: 142 DENEFSLVSPVTSYVPPYGSRSQVPGKRVKQLPVPGPLQEEAFKFLVHFVGDMHQPLHLT 201 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + GGN I + + +NLH WD I + L + Sbjct: 202 GRA-RGGNGIKIHFGTRTTNLHSAWDTMIPTKLIRTVPRNYTRPLPD 247 >UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYG7_9BACT Length = 346 Score = 137 bits (344), Expect = 5e-31, Method: Composition-based stats. Identities = 53/334 (15%), Positives = 95/334 (28%), Gaps = 76/334 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------------LSALCVWPDQV 47 W GH +A L A + ++ +L +PD + Sbjct: 22 WDTPGHEQIADMAYTRLTPAAKNKIREILQHGDPRYVPANNGDDTLRDAFRRASSFPDVI 81 Query: 48 RHW-------------------------------YKYKWTSPLHFIDTPDKACNFDYERD 76 R +Y H+ DTP Sbjct: 82 RDPGASTVFDDAYVDRMNLTFQPDVSPQQLAKPKSEYIRCKTWHYYDTPIH-------YS 134 Query: 77 CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY-NMTEALLFLSHFMGDIHQPMHVGF 135 + + A T QL+ + + + L ++ H GD+HQP+H Sbjct: 135 TSHAPKIYESNALVAYNYATAQLAKLKNSAAGADLRDAAWWLCWIEHLTGDLHQPLHCTS 194 Query: 136 T------SDAGGNSIDL--RWFRHK-----SNLHHVWDREIILTAAKDYYAKDINLLEED 182 D GGN++++ W NLH WD I A A+ + Sbjct: 195 NYAHNHRGDIGGNAVNIIAPWDGASGALHAVNLHSYWDEGIDHAAGGHRSARQDLTPADA 254 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE---------AGET 233 + TD ++ + V + + +A Y+ G Sbjct: 255 M--EVTDAWLRNNQLKPGDSDAADLNVAHWIAQGAALADAHVYQETNAAGQTQEIIDGTN 312 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 ++ Y ++ + + + RLA +LN +F Sbjct: 313 VTPQYTTDQIDVCEHQAVRAAYRLAAVLNGIFQP 346 >UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensis MED297 RepID=A4BF01_9GAMM Length = 262 Score = 136 bits (343), Expect = 6e-31, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 96/271 (35%), Gaps = 28/271 (10%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALC--VWPDQVRHWYKYKWTSPLHFI 62 GH M ++ L D A ++ L E + ++ + V D R + K PL Sbjct: 9 GHTMVAQLMVPFLKDGARSELERLYGEDWSREIVSRAAMVQADLNR--PQNKSMIPLQLT 66 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 F ++ C + + C GA+ L +D+R +A ++L H Sbjct: 67 LFEQGDETFQPDKHCPN-----NRCSVGAVLESREVLLRSSFSDADKR----QATIYLMH 117 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFR-HKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + +H P++ G D GG I L+ NL +W+ ++ K ++ Sbjct: 118 YALQMHIPVNSGLKRDDGGRKIYLKDDDLQPVNLAWIWNHDLYRQMDKRWF--------- 168 Query: 182 DIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNS 241 I D +W E N +A E+ IA Y G S + Sbjct: 169 TYAQELYRDIEKVDPQAWVESMN----PADWALEAHEIAEAEVYPLAAEGR-YSAQLKRA 223 Query: 242 RLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 ++ +++ + R A L N +F D Sbjct: 224 GTAVLEEQLKKAAYRTASLFNEMFPPEDAPD 254 >UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing protein n=1 Tax=Caulobacter segnis ATCC 21756 RepID=D0Y4Z6_9CAUL Length = 307 Score = 136 bits (343), Expect = 6e-31, Method: Composition-based stats. Identities = 57/314 (18%), Positives = 93/314 (29%), Gaps = 77/314 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAA--------------HAVKMLLPEYVNG-DLSALCVWPD 45 W+ GH+M +A + +A VK + E + WPD Sbjct: 23 WNGRGHMMVAAVAWEEMTPKAKARAAALLRKNPNYGDWVKGVPVELADKVAFMNAATWPD 82 Query: 46 QVRHWYKYKWTSP-------------------LHFIDTPDKACNFDYERDCHDQHGVKDM 86 +R ++ P HF + + D + Sbjct: 83 DIRSTHQDDGYDPTVPQADDNVGYSDPYVHAYWHFTN-------IAFSIDATPVPPPPAV 135 Query: 87 CVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG-------FTSDA 139 I+ F+ L+ S + L++++H +GD+HQPMH D Sbjct: 136 NAIERIKLFSATLA-----PSGDDDVQSYDLVWVAHLVGDMHQPMHATSRYSQAKKRGDN 190 Query: 140 GGNSIDLRWFRHKS---NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDL 196 GGN + + LH WD + ++D + +D L Sbjct: 191 GGNGVFVCKTGQCDKGQKLHQFWDYGVG-------SSQDYASVIAA----------ADKL 233 Query: 197 ASWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKRVAQ 252 + + ES +A Y + L+ Y +VA Sbjct: 234 PKAPAAQRAIGDPDAWLQESYQLARTKAYVDPIGPAKGPYVLTTRYRVEAGQTCEAQVAL 293 Query: 253 GGIRLAMLLNNVFG 266 G RLA LLN G Sbjct: 294 AGARLADLLNARLG 307 >UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID=B3L390_PLAKH Length = 417 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 55/319 (17%), Positives = 107/319 (33%), Gaps = 61/319 (19%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 S EGH +A L E + +K LL D+ + W V K K +HF Sbjct: 34 SGEGHEAIGMVAMSGLKSEQLYELKKLL---SGKDIVDIGKWGHLV--HEKIKGAESMHF 88 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 + + C C D++G+ C+ +I++F +L+ + Sbjct: 89 -NLQNHDCKR-AVFKCEDENGL---CLINSIKHFYVKLAGGKPTDHTTGQSTNQSTGQAT 143 Query: 106 -------------------TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL 146 + + +AL +L + D+HQP+ + + D GG I + Sbjct: 144 EEHALNSAPPEAKDIPFKYPQNIAFTDADALKYLVSLIADMHQPLRIAYRYDNGGKDIKV 203 Query: 147 ----RWFRHKSNLHHVWDREIILTAAKDYYAKDINLL--------EEDIEGNFTDGIWSD 194 + ++NL + E+I K Y + E + + Sbjct: 204 IHHDDYKTVRTNLFDYMESELINKMIKRYQSAWYGGWTHINRLLDEHKKDEKLFSEKGIN 263 Query: 195 DLASWRECGNVFSCVNKFATE--SINIACKWGYKGVEAGETLSDDYFNSRL--PIVMKRV 250 + W E C + + + K + + + Y ++ + Sbjct: 264 AIDIWGEQIINEFCSEFYLNSYVTNFMVEKKDELHFDTSKEIEITYDLEFHLERLLKVNI 323 Query: 251 AQGGIRLAMLLNNVFGASQ 269 + G R+A+LLN++F + Sbjct: 324 LRAGSRIAILLNSLFANRK 342 >UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileria RepID=Q4UCH4_THEAN Length = 391 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 49/311 (15%), Positives = 98/311 (31%), Gaps = 61/311 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ A + +KMLL DL W D+V + + PLH Sbjct: 22 WNELCREAIESTAMSAITYMRLRRLKMLL---KGEDLVDYTWWADEV--LKRIPESLPLH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 + PDK N ++ C + ++C+ I+ F L + Sbjct: 77 YQYQPDKKSN-NFNFTCSN-----NLCLMAGIKYFFAVLMNSGYPVGTSNTQKFDIPPLG 130 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 +++ ++ + +L + D+H P+H+ FT +I + VW+ I Sbjct: 131 YPRKIKFSPSDCIKYLVVLLSDLHHPLHLDFTQPDSIATIPVDLSDFP-----VWEN-IS 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE---------------CGNVFSCV 209 + + L+ + + + SW C Sbjct: 185 VQTLNTKRPLYGDFLKHIYMPKYIEVNENAWYGSWTHVSTLGLRYSTELDLFNNKTVECF 244 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMK-----------RVAQGGIRLA 258 +A E+ ++ E LSD + + ++ G R+A Sbjct: 245 EVWAAETASLNNTIF--DKEDFVYLSDTVRTKAIRFTERLDSKLGFLMRLQIVMAGARVA 302 Query: 259 MLLNNVFGASQ 269 ++LN + + Sbjct: 303 IVLNYILSHRE 313 >UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrhizobium RepID=A4YRX0_BRASO Length = 312 Score = 131 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 76/296 (25%), Gaps = 69/296 (23%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL---------------PEYVNGDLSALCVWPD 45 W EGH+ +A L+ LL + WPD Sbjct: 22 WWDEGHMQIAYLAYKKLSPTVRDRADALLKLNPDYASWIAGAPQGQEKLYAFVHAATWPD 81 Query: 46 QVRHWYKYKW------------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMC 87 ++ Y + H+ D D + Sbjct: 82 DIKMKPDYYDDQVGDSTAKQLVPYGHLKHTYWHYKD----------ALFSVDDTPLPRPD 131 Query: 88 VAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV---------GFTSD 138 A+ ++ + + +L + H +GD+HQP+H D Sbjct: 132 AVDAVSQLKLMIAKLPANSDATEPLRSYSLSWTIHLVGDLHQPLHAIARYSAALPDKGGD 191 Query: 139 AGGNSIDL-RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 GGN + NLH WD Y + + D G + Sbjct: 192 RGGNEEQVIAANGETQNLHAYWDG-----IFGGYSTVFGAMFDADQRGGLS-------TV 239 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKR 249 + +A ES ++A Y + L+ +Y + K+ Sbjct: 240 TADPGKAQIVDPATWAQESFDLAKSVAYAAPIRTDKQPVELTREYETNARDTARKQ 295 >UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DKF6_TRIVA Length = 323 Score = 126 bits (317), Expect = 6e-28, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 81/269 (30%), Gaps = 29/269 (10%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDL----SALCVWPDQV-RHWYKYKWTSPLHFI 62 +I + + + + DL + + W V R + +K + HF Sbjct: 19 TVSQIVLDKMGKAYTANLSSVFLAAGDTDLVSHPAKVGAWMSYVERPPFNFKGFNHWHFT 78 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 P F D + I N +G++ R + + ++ L Sbjct: 79 RQPYVPKEFGQIPSQIDNDNL--------ISNVMEMSDDIYKGSTKRSWPLAFSMKILFA 130 Query: 123 FMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREII--LTAAKDYYA 173 + DIH P+HV D G ++ + K+NL V++ Y Sbjct: 131 GVCDIHTPLHVSEYFSSEFPNGDQNGRLYEVVYKGQKTNLFDVYETGCGLDENLQVTYDE 190 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET 233 N +++ + D + S E + + Y V+ G Sbjct: 191 SFWNDVKDLADNLLEDFKFVSKKFSRTEITAQNAT-------TYQYTVDKIYSLVKPGGE 243 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 L+ + N + RL +LN Sbjct: 244 LTTEMINECQSHTRDMMRLAAERLVYILN 272 >UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYE5_9ALVE Length = 357 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 48/275 (17%), Positives = 96/275 (34%), Gaps = 19/275 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+ H +L+ + LL +S + + Y T H Sbjct: 21 WDKDIHERIGEAVSRVLSYRDIEDLNKLLKGQSIPYMSR---YAHDKLQYANYDRTVENH 77 Query: 61 FIDTPDK-ACNFD-YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 + C FD D + + + T + + + Sbjct: 78 YETQLRDWQCTFDVNNPDKYAESQGLYRSIHDIFGRVTHASKSGEDHGIAKDMTEPVQIS 137 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDREIILTAAKDYYAKDIN 177 +L + D+HQP+H GF +D G I +++ +NL+ W+R+I +AA + Sbjct: 138 WLLGLVQDLHQPLHTGFGADDHGRRISVQYHDDPSTNLYDFWERDIS-SAANLETQLVLK 196 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK--------GVE 229 +++ DG + L + + ++ ES+ ++C Y V Sbjct: 197 AYNAELDKLVQDGGYGIQLVNKIYSKG----IAEWIAESMEMSCSDIYSVIAGGRGREVP 252 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + DD + + K+V + R A++L+ + Sbjct: 253 RMYQIDDDVYAKWRDLATKQVVKAAARSAVVLHGI 287 >UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFX0_FLAJ1 Length = 332 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 78/270 (28%), Gaps = 35/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + A L + + ++ PD ++ YK P H Sbjct: 25 WGNVGHERINKAAVMALPKQLQ-----IFFYNHIDFITQEASVPDIRKYALNYKEEGPRH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + D + Y + + D + I++ +L+ + + Sbjct: 80 YFDMENFGAADTYPQTLEEAKQKYDAKFLSDNGILPWYIEDMMAKLTKAFKEKNRAEILF 139 Query: 114 TEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 A L H++GD H P+H D + +H +W+ + K+Y Sbjct: 140 LAAD--LGHYVGDAHMPLHTSANHDG--------QLTDQKGIHSLWESRLPELFVKNYK- 188 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESIN------IACKWGYKG 227 +N+ E + IW + + T + A K Sbjct: 189 --LNVPEAQYYTDVHKAIWDMINDTHSFAQPLLDIDKSLRTATPQDKVFKLDAEGKVLKS 246 Query: 228 VEAGETLSDDYFNSRLP----IVMKRVAQG 253 SD+Y +V ++ + Sbjct: 247 KYNTAVFSDEYAKKLHEQLNGMVETQMRKA 276 >UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium profundum RepID=Q6LI73_PHOPR Length = 305 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 70/311 (22%), Positives = 109/311 (35%), Gaps = 77/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDL---------SALCVWP 44 W+ +GHV +IA L+ A V +L +PE + + + L + P Sbjct: 29 WNYQGHVTVAQIAYQNLDTTARTQVDVLAAKAYQSMPEDIQQKMDSFEGASQFAKLAMVP 88 Query: 45 DQVRHWY-------------------KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D +R K T H+I+ + C D Sbjct: 89 DLIRKIPAEDIWAQMGETIPASLNQWDEKETGAWHYINQ-----AYPATSQC-------D 136 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS-------- 137 I+ + L + +++F+SH GD HQPMH S Sbjct: 137 FIHVPNIKLVASYLFDDFKQNPQ-----AASMMFMSHVAGDSHQPMHSISQSLSKNVCVT 191 Query: 138 DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 D G N L + +LHH+WD + L +IN D++ + Sbjct: 192 DLGANKHTLDV--PQKDLHHLWDSGMGLLG----TEHNINDFATDLQLAY---------P 236 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 S + VN + TES +A +GY V S+ Y+N +V +R+ Q G RL Sbjct: 237 STTMTLGKTADVNLWVTESYQLA-DFGYS-VAIDAKPSESYYNKGTELVKQRLTQAGYRL 294 Query: 258 AMLLNNVFGAS 268 A LN+ Sbjct: 295 ADELNSALAKK 305 >UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DRT9_TRIVA Length = 300 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 45/271 (16%), Positives = 77/271 (28%), Gaps = 28/271 (10%) Query: 7 VMTCRIAQGLLNDEAAHAVKMLLPE--YVNGDLSALCVWPDQV-RHWYKYKWTSPLHFID 63 + + + + +S W R + + HF Sbjct: 3 AIAGEVGLEQFGFSLQKKLNSVFQNAGDDFTRVSQAAAWLYYAERPPFNIPSFNHWHFYS 62 Query: 64 TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHF 123 P N E D +KD NF + R G R + + Sbjct: 63 QPINPNNLSIE-THIDVDNLKD--------NFDSIRKSVRGGKVSRTWPFAFLMKLYLTG 113 Query: 124 MGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 M DI+ P+HV D G +++ + +L+ +W+ Y+ + Sbjct: 114 MCDIYSPLHVSELFNEQFPNGDRNGRDFYVKYNGNFISLYDLWETGCG------YFDSQV 167 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 + ED LA E V + + N Y G+ G +S Sbjct: 168 DFTSEDDWKKIDKLTNELSLAFTSEDWPSTLSVTQVIEGNYNYTRDTVYNGLVNGSEVSK 227 Query: 237 DYFNSRLPIVMKRVAQGGIRLA---MLLNNV 264 +Y + V G R+A LN + Sbjct: 228 EYITTCQNYAQDIVILAGKRIATDLANLNII 258 >UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11TZ7_CYTH3 Length = 318 Score = 117 bits (292), Expect = 5e-25, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 81/267 (30%), Gaps = 35/267 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H ++A L K ++ V PD+ R+ + +P H Sbjct: 24 WGFFAHKEINKMAVFTLPHPLMSFYKR-----HIDFITEQAVNPDKRRYIVSGE--APKH 76 Query: 61 FIDTPDKACNFDYERD--------CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 ++D + + R + + + T +L+ + + Sbjct: 77 YMDIEYYSDSILIVRPDWNTAQAIYPEDSLHAHGILPWNLVRLTYRLTDAFKHRDAKSIL 136 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 A L H++GD+H P+H + + +H +W+ + + DY Sbjct: 137 KLSAD--LGHYVGDLHVPLHTTKNYNG--------QLTGQQGIHGLWESRLPELFSADY- 185 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + L + + +W S R C V + + + Y+ Sbjct: 186 --NYYLGTANYVTDIKKVVWESMTES-RACVAQVLAVELKLQQQMKADKIFSYEDRNGQT 242 Query: 231 ----GETLSDDYFNSRLPIVMKRVAQG 253 S+ Y + +V KR+ Sbjct: 243 VRVYSYDFSNAYHKALEDMVQKRMRAA 269 >UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=A4KXI8_HVAVE Length = 277 Score = 117 bits (292), Expect = 5e-25, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 86/274 (31%), Gaps = 46/274 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ GH + +A+ + + + + L + PD + + LH Sbjct: 33 WAQNGHRVCAAVARAHIAP---ALLNHIESNLLKATLDEVSNDPDNIDVERR-----HLH 84 Query: 61 ---FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 ++DTP D C+ A Sbjct: 85 WVNYVDTPSDGAQNVSSYLTSDCQIDNRECIVSA-------------------------- 118 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAKDYYAKDI 176 H++ D+HQP+HV + A + + WF + LH VWD E+ Y + Sbjct: 119 ---VHYICDLHQPLHVIPATYANQSFARVLWFHGFNYTLHQVWD-ELPEQLHLSYESHAK 174 Query: 177 NLLEEDIEGNFTDGIWSD-DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETL- 234 L+ I + + W + + + E + E G + Sbjct: 175 WLVRHHISPEMYVAMVKQTTVDKWIDSRVAAYEIARKLNE--KLVKCHTENNSERGRYIC 232 Query: 235 SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + + S P V +A GG+RLA L F Sbjct: 233 NLKFVFSARPTVDSSLASGGVRLAGYLKQSFKNK 266 >UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PTL3_9SPHI Length = 315 Score = 116 bits (291), Expect = 6e-25, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 77/275 (28%), Gaps = 36/275 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + +++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDEITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACN---------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + ++ + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWYRAKEKIEEKRLLSNGIVPWQIYITYQKLVKAFIARDKIKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFATHY 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + W+ S V + + + K Y Sbjct: 183 KLTAG---KAQFITDPAALGWAIVYESAPLADTVLRIEKELSVR-FPASQKKTYLTRNNV 238 Query: 232 ETLSD------DYFNSRLPIVMKRVAQGGIRLAML 260 L+ Y + +V R+ Q R+ L Sbjct: 239 LVLTYSDAYAKAYHEALNGMVEVRMRQAIHRIGSL 273 >UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis RepID=A7ARD9_BABBO Length = 393 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 90/304 (29%), Gaps = 52/304 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W A + + +K++L + DL W D+VR + ++ LH Sbjct: 23 WDDITREAIESTAMSAITFDRLRRMKVILRGH---DLVDYTWWSDEVRK--RIPESATLH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 D+ C ++ C + +C+ + F +L Sbjct: 78 RQLQNDETC-LTFDSTCPN-----GLCLIQGSKFFFAKLMSSGYSIVSQPIKFELPLFRY 131 Query: 106 TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIIL 165 D + ++ L +L + D+H P +V + W+ + Sbjct: 132 PKDVTFTPSDCLKYLVVLLSDMHYPFNVDLAEPHSLAHRKVDLSGFPM-----WE-ALSK 185 Query: 166 TAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE---------------CGNVFSCVN 210 + + + ++ SW N + Sbjct: 186 EKLGHAKPSFEDFIMKVYMPHYIQTNEESWYGSWTNVEVLGSRYKVEQETFNRNTWDNFE 245 Query: 211 KFATESINIACK-----WGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 +A+E+ N+ C + + LSD + + ++ G R+A++LN + Sbjct: 246 IWASETANLHCNGLVTKSDFSKDKQTIKLSDALLDRIGNTIKFQIVLAGARVAVVLNYIL 305 Query: 266 GASQ 269 + Sbjct: 306 SHRE 309 >UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BI21_TERTT Length = 343 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 83/311 (26%), Gaps = 75/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV--------------KMLLPEY--VNGDLSALCVWP 44 WS GH + A L+ A LP+ L WP Sbjct: 64 WSYSGHAVILGSALSQLDPTARKEAFTQIEYLYNRASGNSRFLPKSCLSQKSLCFFASWP 123 Query: 45 DQVRHWYKYKWT-------------------SPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D+ R + + HF + + + C + + Sbjct: 124 DRERDKTLGELYRMVGAEVPAVLKGLTSSEIASWHFTNQVFNLNDRKFSAACELRDRGQL 183 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV------GFTSDA 139 V ++ L +H + D HQP+H G D Sbjct: 184 YDVLPQLE--------SALIRELSIAQRAVTLALWTHLLADAHQPLHNLTGSLEGCAHDF 235 Query: 140 GGNSIDL--RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 GGN + + R + + +LH +WD L D + Sbjct: 236 GGNGLCVVKRRNKCERSLHQLWDSGAGLFDKPDMISPLGVADAR---------------- 279 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 ES+ +A + +E S+ Y + + R Q R+ Sbjct: 280 -----SPTAVDYRVIQNESLALASEVYAPNLELS---SNAYITTVRRLSRIRAQQAAQRI 331 Query: 258 AMLLNNVFGAS 268 A+LL + G Sbjct: 332 ALLLKELTGNK 342 >UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VWZ8_DYAFD Length = 341 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 46/266 (17%), Positives = 80/266 (30%), Gaps = 36/266 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + L+ V PD+ R+ + + H Sbjct: 42 WGFWAHKRINRLAVFRLPMEMQ-----VFYKKHIDYLTENAVNPDKRRYAVVGE--AERH 94 Query: 61 FIDTPDKACNFDYERDCH---------DQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID + H + K V +Q +QL+ + R Sbjct: 95 FIDLDVYGDSALAVLPKHWQAAVNKVGEDSLRKHGIVPWHVQIAASQLTSAFREKNAARI 154 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + +H W+ + A+ Y Sbjct: 155 LRMSAD--LGHYIADAHVPLHTTRNYNG--------QLTGQDGIHGFWESRLPEIYAEQY 204 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA- 230 + IW AS +V + K TE+ K+ ++ Sbjct: 205 DMWLGP---AAYREDIAHDIWQAVEASHSGSDSVLA-FEKQLTEAFKPDKKYAFELRNNI 260 Query: 231 -----GETLSDDYFNSRLPIVMKRVA 251 S+ Y + V +R+ Sbjct: 261 LTRMHSRDFSEKYHRALAGQVERRMR 286 >UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FG69_TRIVA Length = 339 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 81/246 (32%), Gaps = 27/246 (10%) Query: 35 GDLSALCVWPDQV-RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQ 93 +LS L W + V R + K + HF P + +Y + + + D+ A + Sbjct: 34 KNLSKLSTWMNYVERPPFNLKCFNHWHFSREPFTLESRNYIPQYNGKDNLVDVLKESATK 93 Query: 94 NFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDL 146 F + ++ L L + DIH MH D G + Sbjct: 94 IFFLI--------PSSPFILSTHLKVLFAGVPDIHATMHTQEFFSNDFPDGDRNGQVFYV 145 Query: 147 RWFRHKSNLHHVWDREI-ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNV 205 + ++L V + + + +++D ++ + +S Sbjct: 146 MYNGTNTSLFDVLESGCGLDSQKHATFSRDFWEDVRKLKVELFKSWETPTFSS------T 199 Query: 206 FSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 S V E+ Y + G+T+SD++ +++ + A +L ++ Sbjct: 200 DSVVEAAKIENREYTKATIYSKLRPGDTISDEFITECQTRTKQQILKS----AEILYHIT 255 Query: 266 GASQQE 271 +E Sbjct: 256 ENKMKE 261 >UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstonia solanacearum RepID=Q8XRE8_RALSO Length = 337 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 107/320 (33%), Gaps = 66/320 (20%) Query: 2 SKEGHVMTCRIAQGLLN-DEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK-------- 52 +GH +A L+ A V+ +L L VW D + + Sbjct: 27 GPDGHQTVGELADSLIAGTNAESQVQNILGM----TLEQASVWADCAKGVTRTQSGKFVY 82 Query: 53 -----YKWTSP---------------------------------LHFIDTPDKACNFDYE 74 Y P H+ D + + Sbjct: 83 QGAGHYPECKPFETTTGKSAMVAFVKRNWSGCHPAADEEVCHKQYHYTDVALQRGQYQQ- 141 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 G D + AI+ +L + + EALL LSH++GDIHQP+HV Sbjct: 142 ----GLVGTSDHDIVAAIRAAIIKLQGGTTPSPIDFASKREALLLLSHYVGDIHQPLHVS 197 Query: 135 FTS-DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 DA G+ +D + I+ K ++ D + G+ + Sbjct: 198 AVYLDAQGHVVDPDQGTFDPQTKTIGGNSILDAGKKLHFEWDQVPAALKPDQLGVSGV-A 256 Query: 194 DDLASWRECGNVFSCVNKFATESINIACK----WGYKGVEAGE----TLSDDYFNSRLPI 245 + A G++ S ++AT++++ A + +A + TL +Y + R + Sbjct: 257 EARAIPLTSGDIISWPAQWATDTMHSAAPAFSGTAFSAEDASKHWQVTLPANYVSERETV 316 Query: 246 VMKRVAQGGIRLAMLLNNVF 265 ++ + G RLA LL ++ Sbjct: 317 QRAQLIKAGARLAQLLQAIW 336 >UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNU1_CHIPD Length = 313 Score = 111 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 36/268 (13%), Positives = 71/268 (26%), Gaps = 35/268 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + LS D+ R+ P H Sbjct: 20 WGFFAHQRINRLAVFSLPPEML-----VFYKPNIEYLSTHATDADKRRYI--IPEEGPRH 72 Query: 61 FIDTPDKACNF---------DYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 +ID + + + ++L+ + R Sbjct: 73 YIDIDHYGQAPFAALPRSWEEALLKYTADTLQTYGILPWYLTQMLSRLTQAFKDKDPDRI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A + H+ GD H P+H + + +H +W+ I A Sbjct: 133 MRLSAD--IGHYAGDAHVPLHACSNHNG--------QRTGQQGIHGLWESRIPELMADKT 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA- 230 + + + W L S V K ++ K+ Y+ Sbjct: 183 FQ--YLSAKAYYIKDINAYTWQIVLESAAAADTVLQQ-EKLVSDRFPSGRKFAYEKRNGK 239 Query: 231 -----GETLSDDYFNSRLPIVMKRVAQG 253 + Y + ++ +R++ Sbjct: 240 LIRNYATAYAKAYHGALGDMIERRMSAA 267 >UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E734_9SPHI Length = 271 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 78/261 (29%), Gaps = 35/261 (13%) Query: 7 VMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPD 66 + +A L + + L V PD+ R+ + H++D Sbjct: 1 MRINELAVFTLPEGMYT-----FYKQNRRYLRDHAVDPDKRRYADT--SEAARHYLDVEH 53 Query: 67 KA-CNFDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 C R D + IQ +L + + + A Sbjct: 54 YEVCIDSIPRKYPDAVKKYGLKKMNQSGILPWQIQQSYYKLVRAFQQRDSAKILIYSA-- 111 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 +L H++ D P+H D + +H W+ + ++DY + L Sbjct: 112 YLGHYLSDAQVPLHTTANHDG--------QLSGQQGIHAFWESRLPELFSEDY---NFLL 160 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET----- 233 + + + W + +V ++ S I K+GY + Sbjct: 161 GKAQYISDPLEEAWKMVSKTHLLVDSVLQ-LDSVLNSSFPIYRKYGYSKRKNKVVKQHTE 219 Query: 234 -LSDDYFNSRLPIVMKRVAQG 253 S Y +S +V +++ + Sbjct: 220 GYSRLYHDSMKHMVERQMREA 240 >UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QFB3_9SPHI Length = 354 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 48/272 (17%), Positives = 83/272 (30%), Gaps = 48/272 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L + K LS V PD+ R+ + +P H Sbjct: 52 WGFFAHQQINRLAVFTLPVDMIPFFKK-----HINFLSDNAVNPDKRRYAVVGE--APRH 104 Query: 61 FIDTPDKACNF--DYERDCHDQHGVKDMC-------VAGAIQNFTTQLSHYREGTSDRRY 111 FID R + V IQ QL+ + + RR Sbjct: 105 FIDLDAYPDTTSATLPRYYKEATDRYGEDSLALHGLVPWQIQLTKYQLTEAFKQRNVRRI 164 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D + P+H + ++ +H W+ + + +Y Sbjct: 165 LRVAAD--LGHYIADANVPLHTTRNYNG--------QLTNQQGIHGFWESRLPELFSANY 214 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC------VNKFATESINIACKWGY 225 D + I+S A+WR N + + + TE + K+G+ Sbjct: 215 ----------DFLTGQAEYIYSPQKAAWRAVFNANAALDSVLHIERQLTEQVGETRKYGF 264 Query: 226 KGVEA------GETLSDDYFNSRLPIVMKRVA 251 + S Y V +++ Sbjct: 265 EERNGITAKVYSADFSQQYHERLHGQVERQMR 296 >UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT4_LACBS Length = 242 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 80/245 (32%), Gaps = 67/245 (27%) Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH 151 ++N T L + EAL FL HF GD HQPMH+ + GGN + + + Sbjct: 1 MKNVTALLQGW-VKGETSDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGK 58 Query: 152 KSNLHHVWDREIILTAAKDYYAK-----DINLLEEDIEGNFTD------------GIWSD 194 ++ WD +I +E+ + G D W+D Sbjct: 59 QTT----WDDSLITKVISTIPQNYTLPLPYPEIEQALRGASYDPYIRRIIWEGILQKWAD 114 Query: 195 DLASWRECGNVFS---------------------------CVNKFATESINIACKWG--- 224 ++ W C + C +A S ++ C Sbjct: 115 EIPGWLSCPDAVKRTFVDSQIALGLEGTTGIEILPDNDVLCPYHWARPSHDLLCDGVWLK 174 Query: 225 ------YKGVEAGETL------SDDY--FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 Y+ + + Y + +V K++A GG+RLA L N +F Q Sbjct: 175 EVDEPPYRRTDDNPHPPLLELETPAYSGMIGQRWLVEKQLALGGLRLAGLFNYIFADQGQ 234 Query: 271 EDSVV 275 + + Sbjct: 235 RGAFI 239 >UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LJW8_RHOVA Length = 200 Score = 109 bits (273), Expect = 8e-23, Method: Composition-based stats. Identities = 47/174 (27%), Positives = 64/174 (36%), Gaps = 29/174 (16%) Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L L+HFMGDIHQPMHV F D GGN I +S LH WD +I Sbjct: 25 LKTLTHFMGDIHQPMHVSFEDDKGGNLISASGLCGRS-LHAAWDSCLIEKTLG------- 76 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA---------------- 220 + I + I S D + W V +A E+ I Sbjct: 77 -FDSDTIATSLEAEITSGDRSRWLAGDIGPKAVASWANETFTITTRPEVGYCERASDGCR 135 Query: 221 ----CKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + G + + + Y + P V R+ G+RL +LN+V Q Sbjct: 136 YSAYQPEYHGGAQKVVVVDEHYLSVNAPFVRDRIKAAGVRLGAVLNSVLMPDQS 189 >UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y3Y4_PEDHD Length = 285 Score = 109 bits (273), Expect = 9e-23, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 78/274 (28%), Gaps = 35/274 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H+ R+A L + LS V PD+ R+ + H Sbjct: 20 WGFYAHIRINRLAVFTLPAGLNR-----FYKANISYLSDHAVDPDKRRYADT--AEAARH 72 Query: 61 FIDTPDKACNFD-YERDCHDQHGV-------KDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 ++D + D R + ++ + IQ +L H + Sbjct: 73 YLDVELYEAHIDSIPRKWEEAVKRYGLVRLNQNGILPWQIQKSYYKLVHALRDRDSLKIL 132 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 + A +L H++ D H P+H + ++ +H W+ + AK Y Sbjct: 133 IYSA--YLGHYLADAHVPLHTTQNHNG--------QLSNQLGIHAFWESRLPELFAKKY- 181 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + + + N W + + V K+ + Sbjct: 182 --NYVVGQAIYIENPLKEAWKIITHTHKMVDTVL-TFEARLNARFPAHRKYSFSERNNQV 238 Query: 231 GETLSDDYFNSRLP----IVMKRVAQGGIRLAML 260 G S Y + +V +++ + Sbjct: 239 GRQYSLAYSKAFHDGMNHMVERQMRAAIHSIGSY 272 >UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2F5A5_TRIVA Length = 343 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 28/257 (10%), Positives = 71/257 (27%), Gaps = 35/257 (13%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSA---LCVWPDQVRHWYKY-KWTSPLHFIDTPDKACN 70 L ++ ++ ++ + W + H + Sbjct: 26 RKLGNKGISKLQKVIDM-TGEKMERPSLAGSWLASLLHAPSNTNCFDHWRYSQKNIN-AI 83 Query: 71 FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQP 130 E C ++ ++ C + +GT + + D P Sbjct: 84 PHPEHHCINKDDLE--CTLDKLN------KTIMKGTLNGPWPYNFGFKVFLTLYMDSFDP 135 Query: 131 MHVGFT--------SDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 +HV D G ++++ +LH W+ K + + E+ Sbjct: 136 VHVTEYFDNDTFIDGDDNGKKFNIKFKGKNMSLHDFWETGCGRYVLKTPFNGNGWKEIEE 195 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFA---TESINIACKWGY--KGVEAGETLSDD 237 + + C + +A +S N++ + Y ++ L ++ Sbjct: 196 TTTRLYKRLNDSKF--------ITPCPSDYAGAINQSFNLSKEIVYNLSMIQKDNDLPEE 247 Query: 238 YFNSRLPIVMKRVAQGG 254 Y + + +R+ Q Sbjct: 248 YIKTCYELTDQRILQAA 264 >UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD1_9BURK Length = 117 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 54/112 (48%), Gaps = 10/112 (8%) Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 ++ P CN+ ERDC D CV AI L T AL ++ Sbjct: 1 MNFPRGDCNYQQERDCPD-----GKCVIAAIDRQIEVLR-----TPGDDEKRLTALKYVV 50 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 HF+GDIHQP+H GF D GGNS L+ F SNLH VWD +I + +D Sbjct: 51 HFIGDIHQPLHAGFGDDRGGNSYQLQAFMRGSNLHAVWDTGLIKSLKQDNEQ 102 >UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=B3EUC7_AMOA5 Length = 317 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 81/261 (31%), Gaps = 32/261 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R A L K L ++ V PD+ R+ + + + H Sbjct: 22 WGFAAHKHINRCAVFTLPPAMFTFYKYYLG-----YITENAVNPDKRRYVLEGE--ASRH 74 Query: 61 FIDTPDKACN--FDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRY 111 +ID N +D V IQ+ +L++ + Sbjct: 75 YIDLDYYGDNALDKLPKDWAQATHKYSQDTLLAHGIVPWHIQHMQHRLTNAFRNKDIAQI 134 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 + + H++ D + P+H + + +H +W+ + ++Y Sbjct: 135 LKLSSD--IGHYIADANVPLHTTQNYNG--------QLTGQDGIHGLWETRLPELFKEEY 184 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 N + W + + N+ + +++ N K+ Y+ A Sbjct: 185 NFFLGN---ATYVKDPQQRAWKAIIQAHATVPNLLKLEKE-LSQNFNTLHKFSYEKRGAS 240 Query: 232 --ETLSDDYFNSRLPIVMKRV 250 + S+ Y + ++ +V Sbjct: 241 LKKVYSEAYARAYHDLLQGQV 261 >UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EIL3_TRIVA Length = 310 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 71/225 (31%), Gaps = 28/225 (12%) Query: 40 LCVWPDQVRHWYKY-KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQ 98 W +V + K + F+ TP + +Y R+ D + + G I N Sbjct: 52 AGGWLARVEYAPTNTKCFNHWRFVQTPINGSD-NYHRNKDDLTVQLNGLLGGLINNTI-- 108 Query: 99 LSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF--------TSDAGGNSIDLRWFR 150 ++ A S + P+H D G +++ Sbjct: 109 ---------TDKWAYNFAFKVASALFFEAFSPLHTSELFDNDRFKDGDDSGKKYMIKYQG 159 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 ++ +L WD + E +F + L R NV Sbjct: 160 NEMSLLDFWDSGCGRYTRQT-------PYTETQWTDFYKNVDYMLLKFPRPSCNVNITWQ 212 Query: 211 KFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 +++N+ Y+G++ + LS +Y + + I +R+A Sbjct: 213 MAVNDTLNVTNTVVYQGIKYSQELSKEYIDKCIEITDERLACAAY 257 >UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugiperda ascovirus 1a RepID=Q0E526_SFAVA Length = 261 Score = 104 bits (260), Expect = 3e-21, Method: Composition-based stats. Identities = 40/275 (14%), Positives = 92/275 (33%), Gaps = 49/275 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ GH + +A+ L+ V+ + L + D+ + + +H Sbjct: 24 WALTGHRVCANVARRLIPSPILKHVET--EVLDHETLDGVSNVADE-----TPRSLAAMH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ ++ L + + + + Sbjct: 77 YVNYNVTPT-----------------------RSARKVLEYTENNMTSTYRWDAAFITNV 113 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDR--EIILTAAKDYYAKDIN 177 H + D+HQP+HV +D + +W + LH +WD ++ L + Y +N Sbjct: 114 VHLLCDLHQPLHVVPYADVPSTFTETQWVNGQNTTLHTIWDTLPDLRLLSHHIYAEWLVN 173 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESIN-IACKWGYKGVEAGETL-- 234 L+ + + D W ++A ++ + AG L Sbjct: 174 KLKANTYALLFEQ---DRPHKWL-------DSRRYAYDAAKRLNDNLARCHTNAGSKLLI 223 Query: 235 ---SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + + +S +V + + GG+RLA + +++ Sbjct: 224 NSCNYRFVDSARALVDESLLYGGVRLAAYITSLYS 258 >UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A652_9BACT Length = 348 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 51/306 (16%), Positives = 90/306 (29%), Gaps = 56/306 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W EGH + ++A L E V+ ++ L PD+ R+ + Sbjct: 23 WDYEGHRIVNQLALAALPPEFPAFVRE---AANAERIAFLSGEPDRWRNVEDGPLRHAQT 79 Query: 58 PLHFIDTPD---------------------------KACNFDYERDCHDQHGVKD--MCV 88 P HF D + + D+ +D + Sbjct: 80 PDHFFDIEYLVEGGLPLAKLSEFRQVFAVQLAEARAARPSAYPKSGSKDKDRTRDLVGFL 139 Query: 89 AGAIQNFTTQLSHYRE------------GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT 136 AI ++ ++ R N+ + L H++GD QP+H Sbjct: 140 PWAITENYGRVKSAFTYLKAYEALGTPEEVANARANVVYQMGLLGHYVGDGAQPLHTTKH 199 Query: 137 SDAG----GNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 + G++ + R F + LH D I A + D +G Sbjct: 200 FNGWAGEAGSAANPRGFTTRRTLHSWIDGGYIAAARITVADLLPRAFKADPLTLSGEGRG 259 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D VF + Y+ +AGE + + +R+ + Sbjct: 260 GNDARR----DPVFEAALAYLVRQHEQVIPL-YELEKAGELNAPPATRKGRAFIEQRLQE 314 Query: 253 GGIRLA 258 GG LA Sbjct: 315 GGRMLA 320 >UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FVU8_9SPHI Length = 238 Score = 97.2 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 57/215 (26%), Gaps = 29/215 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + ++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDQITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACNFD---------YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWSRAKEKIEQKRLLSNGIVPWQIYITYQKLVKAFIARDKTKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFAPQY 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 + + W+ S V Sbjct: 183 KLTTG---KAQFITDPAALGWAIVYESAPLADTVL 214 >UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWS6_9SPHI Length = 280 Score = 92.9 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 80/262 (30%), Gaps = 36/262 (13%) Query: 12 IAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACN- 70 +A L E + ++ V PD+ R+ + + H+ID + N Sbjct: 1 MAIYSLPPELIA-----FYKPHIQFITEKAVNPDRRRYAVIGE--AEKHYIDLDEYGENP 53 Query: 71 --------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 ++ ++ K+ + L+ E +++ A L H Sbjct: 54 LDILPIYWYEAVEKFSEEELRKNGIGPWSAYLTFLNLTEAFESKNEKAILRLSAD--LGH 111 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 ++ D++ P+H + + +H W+ I + A + + Sbjct: 112 YLADLNVPLHTTKNYNG--------QLTGQEGIHGFWESRIPESQANRFELWVG---TAE 160 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA------GETLSD 236 IW + +V K T + K+ Y+ + E + Sbjct: 161 YISQPQQAIWDAVAQAHAMVDSVL-TFEKELTSNFPQDQKYSYEQRNSLTVRVYSEEFTQ 219 Query: 237 DYFNSRLPIVMKRVAQGGIRLA 258 Y + V +++ + +A Sbjct: 220 QYAEALDHQVDRQMRKSIKMIA 241 >UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21JG1_SACD2 Length = 321 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 73/247 (29%), Gaps = 60/247 (24%) Query: 43 WPDQVRH-------------------WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGV 83 WPD VR YK TS H+ + N C+ ++ Sbjct: 100 WPDLVRSQKLSVLFKAVGATTPADLAAYKNYTTSTWHYHNV-FYDSNNKLLLSCNKKNRG 158 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT------S 137 K A++ + A F H +GD HQP+H Sbjct: 159 KLYSALSALE--------SSLQSDLSISQQAIAFAFYVHLVGDAHQPLHNVSRANKHCEH 210 Query: 138 DAGGNSIDLRWFRHKSNL--HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDD 195 D GGN+ L+ K +L H WD L A + DI ++ Sbjct: 211 DRGGNTYCLKKKGAKCSLNAHQFWD----LAAFNPVESIDIQPVKHK------------- 253 Query: 196 LASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 CG + + E+ + K + + Y ++ I R+ Sbjct: 254 ----AACGTSPAWGSYLLAEAKELVVNLYPKNDDFN---NAKYRSNAKSIAKSRIEMAAS 306 Query: 256 RLAMLLN 262 R A ++ Sbjct: 307 RTAQIMK 313 >UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B1ZQR9_OPITP Length = 349 Score = 91.4 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 36/308 (11%), Positives = 70/308 (22%), Gaps = 63/308 (20%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYK--WTSP 58 W GH + + A L + V+ ++ L PD+ R+ + Sbjct: 26 WDYTGHRIVNQAALASLPADFPEFVRA---PAAAERIAFLAGEPDRWRNVPDLPIKHANG 82 Query: 59 L-HFIDTPD----------------------------KACNFDYERDCHDQHGVKD--MC 87 L H+ D F + ++ Sbjct: 83 LDHYCDLEHLAGAGVDPRTVSSLRFEFALTFAAGRAAHPEKFPPIDPAKNADRSREWAGF 142 Query: 88 VAGAIQNFTTQLSHYRE-------------GTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 A + +L + R N+ + + H +GD+ QP+H Sbjct: 143 APWAAAEYYGKLKSAFSYLKAYQEHGGTPVEIENARANILYLMGVMGHVVGDLAQPLHTT 202 Query: 135 --FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 G N + + +H D +I + Sbjct: 203 MHHHGWVGEN---PHGYSTWTGIHAWLDGGLIAQTGVTAGEVCAQVRPAHAL-------- 251 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 VF V +A + + +++ Sbjct: 252 -SVQPRADGRDPVFVQVMDYALAQNARVEPLYQLEKAGKLAPEAADLSEARTFICEQLQV 310 Query: 253 GGIRLAML 260 GG L + Sbjct: 311 GGEMLGSI 318 >UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FZN6_TRIVA Length = 232 Score = 90.6 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 54/166 (32%), Gaps = 16/166 (9%) Query: 100 SHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT-------SDAGGNSIDLRWFRHK 152 T + + A + P ++ D G ++ + K Sbjct: 7 KSLFPQTIQGAWPINVAWKSYFGLFLEAFNPTNIANYYSNNHTEGDNNGKDFEIFYKGRK 66 Query: 153 SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKF 212 +N+H W K + ++ + ++ D+ + +N Sbjct: 67 TNIHDFWGSLCGRLTGKYPFNSNVWSDIDK---------YAHDITLVYRNVTHYQNINDI 117 Query: 213 ATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLA 258 T+S NIA Y GV GE LSD+Y + K++A LA Sbjct: 118 LTQSYNIAKDVVYVGVNEGEILSDEYVEKCYDVTSKQLASAAFSLA 163 >UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT6_PHYIN Length = 269 Score = 90.6 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 83/293 (28%), Gaps = 83/293 (28%) Query: 14 QGLLNDEAAHAVKMLLPEY-----VNGDLSALCVWPDQVRHW-----------YKYKWTS 57 + +L++ ++ +L + G+++ VW D V+ S Sbjct: 11 RNVLDEADVTTIESILSRWDEDFPNTGEITTTAVWMDIVKCTAESSTCLTPASPSITSIS 70 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ P +E D A + + Sbjct: 71 DWHYINLPLHINGDKWEDKDTDLTLRSTQSRVSARPSLS--------------------- 109 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAK--- 174 D GGNS SN H VWD L + + Sbjct: 110 --------------------DGGGNSETFTSPCVFSNPHAVWDAAGGLYSLNKWSLNIDS 149 Query: 175 -------------DINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 + ++++I + + ++L + V + A E+ N A Sbjct: 150 FRPTLENASELIALLPSVQDNITFSQYVNVTYNELNTALVTNQVL---REVALETYNFAN 206 Query: 222 KWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y ++ T S Y I KR+A G RLA++L + Sbjct: 207 TIVYSNLDLNATSSGTYPCPSASYLAMVGEISQKRIAIAGSRLAVVLKHFAAQ 259 >UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=B9EZB3_ORYSJ Length = 170 Score = 88.3 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 48/122 (39%), Positives = 69/122 (56%), Gaps = 8/122 (6%) Query: 73 YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH 132 RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL+HF+GD+HQP+H Sbjct: 28 PRRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFLAHFVGDVHQPLH 85 Query: 133 VGFTSDAGGNSIDLRWFRHKSNLH-----HVWDREIILTAAKDYYAKDINLLEEDIEGNF 187 VGF D GGN+I + + +S +H D E +T DY+ ++E+ + Sbjct: 86 VGFEEDEGGNTIKVHCYAIES-IHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQAG 144 Query: 188 TD 189 Sbjct: 145 IR 146 Score = 85.6 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 36/76 (47%), Positives = 51/76 (67%) Query: 200 RECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAM 259 + G V+ +A ESI+++C + YK VE TL DDYF SR PIV KR+AQ GIRLA+ Sbjct: 90 EDEGGNTIKVHCYAIESIHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQAGIRLAL 149 Query: 260 LLNNVFGASQQEDSVV 275 +LN +FG + + +V+ Sbjct: 150 ILNRIFGEDKPDGNVI 165 >UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R8_TRIVA Length = 181 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 46/137 (33%), Gaps = 8/137 (5%) Query: 135 FTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 D GGN I+ + +++H WD ++ A T I Sbjct: 2 PNGDRGGNLYHINCPYGAACNHIHFFWDAIVLNYMLMKPTASLYRNEFIKNVTRLTKEIT 61 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 L + ++ ES+ A K+GY + + Y+ RVA Sbjct: 62 ESSLNL-----DKTVDPMAWSMESLEYAKKYGYS-TPINDAPNASYYEIVRKYGSIRVAM 115 Query: 253 GGIRLAMLLNNVFGASQ 269 G RL LL+++ + Sbjct: 116 AGHRLGYLLDSLLDKAP 132 >UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=C7J139_ORYSJ Length = 141 Score = 83.3 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 51/64 (79%), Positives = 56/64 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AAHAV+ LL E +GDLSALCVWPDQVRHWYKY+WTSPLH Sbjct: 30 WSKEGHMLTCRIAQDLLEPAAAHAVRNLLTEEADGDLSALCVWPDQVRHWYKYRWTSPLH 89 Query: 61 FIDT 64 FIDT Sbjct: 90 FIDT 93 >UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KMV3_TOXGO Length = 632 Score = 82.5 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 35/127 (27%), Gaps = 18/127 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQVRHW----- 50 W EGH++ +A+ L E ++ +L E+ L VW D V Sbjct: 27 WHDEGHMLVAAVAKEYLKPETVEKIEYILSEWSPQYPTTSTLETAAVWLDHVACSMPGRY 86 Query: 51 -------YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 + P H+ N E Q + + L + Sbjct: 87 CRGFLGLDDIRIFKPWHYTSNVFNPQNLTLEPLYEVQPYPQTGSS-WILLKSYESLRNCT 145 Query: 104 EGTSDRR 110 + + Sbjct: 146 GDSRASQ 152 Score = 60.6 bits (145), Expect = 5e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 51/172 (29%) Query: 123 FMGDIHQPMHVG-------FTSDAGGNSIDLRWFR------------------------- 150 GD HQP+H D GGN+I + R Sbjct: 276 IYGDAHQPLHATETYSKAFPNGDFGGNNISIVLPRSEKMLENYPSTPEEFPEVGAEAHRG 335 Query: 151 ----HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 H+ +LH WD + +Y D++ L+++ + ++ D F Sbjct: 336 SGVPHRQSLHSQWDGAFGQYNSL-FYEVDLDELKKEAQRLV--RLYPVD----EHAKRTF 388 Query: 207 SCVNKFATESINIACKWGYKGVE--------AGETLSDDYFNSRLPIVMKRV 250 + + + ES +A + E S +Y + K++ Sbjct: 389 ADFHGISIESSMLARSHVFSEFEWSTFSASSLPYHPSVEYIEKSKKVCEKQI 440 >UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J9_ACIC5 Length = 319 Score = 80.6 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 83/287 (28%), Gaps = 60/287 (20%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W K+GH M +A L ++ +++ L PD+ R + + + Sbjct: 29 WGKDGHKMINHLAVTSLPPSIPAFLR---SPAAVDEITYLGPEPDRWRSPAEPELDAMQA 85 Query: 58 PLHFIDT-------PDKACNFDY------------------ERDCHDQHGVKDMCVAGAI 92 P H+ID P + Y + V + Sbjct: 86 PDHYIDMELADRIAPLPRERYQYIAKLYAYIEAHPDQAREMQPTHIGFQPYISEEVWERL 145 Query: 93 QNF---TTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG--FTSDAGGNSIDLR 147 ++ QL + T + + +L H++ D QP+H + G N Sbjct: 146 KSAMRDYRQLKAAGKDTMPVQQAIIFYAGWLGHYVADGSQPLHTTIEYNGWVGPN---PN 202 Query: 148 WFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS 207 + ++H ++ E + + E + I + W + Sbjct: 203 HYTTSHHIHSQFESEFVHDNMTN--------AEVRQYMKPVEPIGDEWTQYWDYLNTTHA 254 Query: 208 CVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 V+ E + + G++G +R+A G Sbjct: 255 DVD----EVYQLWNEHGFEGKGT---------AESRKFTAERLAAGA 288 >UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_886 (Fragment) n=2 Tax=cellular organisms RepID=D1ZW87_SORMA Length = 159 Score = 77.1 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 40/123 (32%), Gaps = 15/123 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRH-WYK 52 + GH IA+ + E A+ +L P + VW D V+ + Sbjct: 42 WEYGHQSVATIARLNVRSETRAAIDRILRHQALLETPTCPARTIEEASVWADCVKPLGER 101 Query: 53 YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 + + H+ + FD + C D CV+ I+ L + ++ Sbjct: 102 FSYAYSWHYQNVDVCRP-FDLKAACKD-----GNCVSAQIERDVKLLKDPKVPMREKVLA 155 Query: 113 MTE 115 + Sbjct: 156 LAF 158 >UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania major RepID=Q4Q7F8_LEIMA Length = 180 Score = 76.4 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 32/79 (40%) Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 + ++ E V ES A Y GV G TLSD Y + R+ GG Sbjct: 88 ETYTFPEALRTLVDVVAIHEESHMFAVNTSYPGVTPGATLSDAYLARCKRVAEARLTLGG 147 Query: 255 IRLAMLLNNVFGASQQEDS 273 RL LLN + + +++ Sbjct: 148 YRLGYLLNELLPSIPVDEA 166 >UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 Tax=Ricinus communis RepID=B9TFK5_RICCO Length = 228 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 73/235 (31%), Gaps = 64/235 (27%) Query: 47 VRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 V + S H+ D P + +++ G D + ++ L T Sbjct: 2 VAYTTANPKHSEYHYTDVPFQLAHYEDH-----GVGTTDHDIVQTLKQCIAVLQGKGNAT 56 Query: 107 SD-RRYNMTEALLFLSHFMGDIHQPMHVGFTS----------------------DAGGNS 143 ++ + +ALL L+H GDI QP+HVG GGN+ Sbjct: 57 TNPHNFTPRQALLMLTHLTGDIAQPLHVGEGYVGKNGGFVVPTQKQLDDKEAFATQGGNN 116 Query: 144 I---DLRWFRHKSNL------------------------HHVWDREIILTAAKDYYAKDI 176 + D++ S L H WD ++ A + A+ Sbjct: 117 LQLDDIKLTAKSSELIPAAAPDDSKPAAPARTPQATRAFHSYWDTTVVNYAFRRIGARTP 176 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + + G+ + +A +++ +A K Y V G Sbjct: 177 EQFA--------QMVSAGNPVVAPNSGDPVTWPYAWADQTLVVA-KLAYADVVPG 222 >UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G3H0_9SPHI Length = 100 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 3/70 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDKACN 70 +I+T Sbjct: 80 YINTEGNLTK 89 >UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R9_TRIVA Length = 115 Score = 70.6 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 32/108 (29%), Gaps = 7/108 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+ IA G L+ + + + L+ + W D ++ YK+ Sbjct: 12 WWAHAHMAITEIALGHLSSKKINKLYELINRDGLPFQSVVDSSAWQDDLKDTYKFHAIGD 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 HF D P + V + + L+ + Sbjct: 72 WHFSDNPIY-----MNKTIPAIIPNPSYNVTSFLYDALDTLNDPTTTS 114 >UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SFS5_9CAUL Length = 339 Score = 66.7 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 60/247 (24%), Gaps = 43/247 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPL- 59 W GH + A L ++ GD+ PD + K Sbjct: 24 WGPTGHRIVGEEAARALPAYMPEFLR---SAQGVGDIGFYSNEPDAWKGAGKVHDFERDS 80 Query: 60 -HFIDTPDKACNFDYERDCHDQHGVKDMCVA----------------GAIQNFTTQLSH- 101 HFID D R D I + + Sbjct: 81 AHFIDLDDDGKTLAGVRLQEVPQSRSDFDALLRSKNVMPWKSGYLNYALIDAWQQVVKDF 140 Query: 102 ----------YREGTSDRRYNMTEALL-----------FLSHFMGDIHQPMHVGFTSDAG 140 E R+ + EA+ LSH++GD QP+H+ + Sbjct: 141 AYWRGMTYLEAHESDPKRKAWLKEAIRRREALTLRDIGILSHYVGDSSQPLHLSIHYNGW 200 Query: 141 GNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR 200 G +H + + + + L E + +W+ Sbjct: 201 GKEYPNPQTFTLEPIHGPLESAFVSANINNEDVRAAMLASEPCTLAVERCFDAKLERNWK 260 Query: 201 ECGNVFS 207 ++ Sbjct: 261 YVTPLYE 267 >UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48C1 Length = 288 Score = 64.0 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 43/158 (27%), Gaps = 24/158 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W GH A L D L+ PD+ ++ + + Sbjct: 26 WWSGGHETVAAAAAARLPDGVPE-----FFRNGGKHLAHFSGDPDRWKNREMTFLRRAEE 80 Query: 58 PLHFIDTPDKACNFDYERDCHD----------QHGVKDMCVAGAIQNFTTQLSHYREGTS 107 HF+D D +D + K + AI + +L+ Sbjct: 81 GNHFLDLEDLDGKKYPATHRYDGLKMVYGELKKEPNKVGTLPYAIVEYYEKLTVGFYDHR 140 Query: 108 DRRYNMTEALLFLS------HFMGDIHQPMHVGFTSDA 139 + + + L H+ GD P+H D Sbjct: 141 KAPKDTSVPMKCLVYGGTLAHYTGDAAMPLHTTRDFDG 178 >UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD0_9BURK Length = 79 Score = 64.0 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 23/52 (44%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK 52 W +GH + +A+ L+ A V LL + L+++ W D+ R Sbjct: 26 WGSDGHKIVAMLAEAQLSPAARKEVDRLLAQEPGATLASISTWADEHRSPAT 77 >UniRef50_B4WCT7 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WCT7_9CAUL Length = 338 Score = 64.0 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 51/204 (25%), Gaps = 45/204 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHW--YKYKWTSP 58 W GH + A L + +K D+ L PD+ + + Sbjct: 23 WGNTGHRLIGIAAMRALPADMPGFLKT---PGAIADVGELAREPDRWKGAGQPHDRERDT 79 Query: 59 LHFIDT-----------------PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQL-- 99 HFID P +D + A+ + L Sbjct: 80 AHFIDLDDAGHVFDRRGMPLAELPRLKSEYDAALTKAGLDVDDAGYLPYAMIDAWQNLGR 139 Query: 100 -----------SHYREGTSDRRYNMTEALL----------FLSHFMGDIHQPMHVGFTSD 138 + + + L + H++GD QP H + Sbjct: 140 DFAYWRVLNAAERRETNMERQAWYRADRLRREALILRDIGVMGHYVGDGSQPHHTTIHYN 199 Query: 139 AGGNSIDLRWFRHKSNLHHVWDRE 162 G + F + H +++ Sbjct: 200 GWGEFPNPEGFTNSRQTHALFEGA 223 >UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKD8_THEYD Length = 262 Score = 63.7 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 38/127 (29%), Gaps = 16/127 (12%) Query: 27 MLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFID-------TPDKACNFDYERDCHD 79 + + + PD +R Y +P H+ D TP+ F + Sbjct: 32 AYIAKKAGIRIPEAACMPDIIR-DENYDLLAPFHYHDASPDTVVTPEYIDKFGIKEAFLL 90 Query: 80 QH--------GVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPM 131 + I ++ D L+ ++H++GD+ QP+ Sbjct: 91 VDGKNFRISVPHPAGVLYWKIVQIYEKMKSLDRTKPDNVLAYEYYLVSIAHYIGDLSQPL 150 Query: 132 HVGFTSD 138 H D Sbjct: 151 HNFPYGD 157 >UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q028C4_SOLUE Length = 352 Score = 62.9 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 38/311 (12%), Positives = 78/311 (25%), Gaps = 82/311 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYK---WTS 57 W GH + A + + ++ L + G L + PD R + Sbjct: 22 WGVRGHTVANLAALEGITQDGPAFLR--LQKAYIGHLGTI---PDTWRSPSEPYLRISED 76 Query: 58 PLH--------FIDTPDKA----------------------------------------- 68 H FI P + Sbjct: 77 ANHGWYTEGFDFIPNPPHSRTEFTLRVYDEYLKNKSKDPERAKLLNIRYTGLQAYSIIEG 136 Query: 69 -----CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHF 123 R+ + + + L+ + ++ + ++ H+ Sbjct: 137 YERMKAGMRLYRNVSGPEEANRVNIGSIYAAISPTLADRAQVQQMLANDIAFYMGWVGHY 196 Query: 124 MGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDI 183 + D QP+H D + D + + N+H ++ + + +D++ Sbjct: 197 VADAAQPLHNSIHHDGW-SGADPKGYTRDPNIHGRFESQYLDLIGVT--EEDVDKYMRK- 252 Query: 184 EGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRL 243 E D +W L E + Y+ G + Sbjct: 253 EPRLLDNVWKAVLDHSLEARGFT---------------EEVYRLDLRGA-FTKKDDAEAR 296 Query: 244 PIVMKRVAQGG 254 +V KR+A G Sbjct: 297 ELVCKRLAAGA 307 >UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitidis ER-3 RepID=C5GNE5_AJEDR Length = 380 Score = 60.2 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 35/114 (30%), Gaps = 9/114 (7%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMT 114 +I+ D ++ + + L++ + N T Sbjct: 144 YINPADNPPAYETFTTTGTALSRDALSKPLQMPQSRLSLAYMPSNLENSNMNRT 197 >UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobacterium abscessus ATCC 19977 RepID=B1MDJ0_MYCA9 Length = 728 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 41/254 (16%), Positives = 74/254 (29%), Gaps = 79/254 (31%) Query: 1 WSKEGHVMTCR---------------------------------IAQGLLNDEAAHAVKM 27 W + GH IAQ L EA Sbjct: 376 WGQTGHYSIATFTLDAIRSPNLKTLMQANLDAISFSLSELDPKSIAQRL--KEARSNPDG 433 Query: 28 LLPEYVNGDLSALCVW---PDQV-----RHWYKYKWTSPL---HFIDTPDKACNFDYERD 76 ++P DL VW P++V H Y+ P H+ D + + RD Sbjct: 434 IIPLADVPDL----VWKNLPNKVVGGRDDHMVGYRSQGPEHPCHYADIDEPGPDGSIVRD 489 Query: 77 ----------------CHDQHGVKDMC----VAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 +D+ G + + + F + + + ++ Sbjct: 490 LCLQDIANLTVTKWQQFYDERGHRTPDKRGLLPFRVWQFYDAMVGFAKSRQVDQFVCAAG 549 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L L+H++GD QP+H + +D + +H ++ ++I A+ A Sbjct: 550 L--LAHYVGDASQPLHGSYLADG-------YPDGTGAGVHSCYESKMIDRYARQLVAAIP 600 Query: 177 NLLEEDIEGNFTDG 190 L + D Sbjct: 601 ADLATLGDLELIDD 614 >UniRef50_D0XMV2 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XMV2_9CAUL Length = 348 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 59/211 (27%), Gaps = 45/211 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHW--YKYKWTSP 58 W GH A L DE ++ ++ L PD+ + + Sbjct: 28 WGSTGHRTIGVAAVRALPDELPAFLRT---PGAAAEIGELSREPDRTKGAGQPHDRERDT 84 Query: 59 LHFIDTPDK-----------------ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLS- 100 HF+D D +D + + AI + QL+ Sbjct: 85 AHFVDLDDDGHVMNASGPTLSQLPELKSQYDAQLAAAGIAVNDAGYLPYAIMDGFQQLAR 144 Query: 101 --------HYREGTSDRRYNMTEA--------------LLFLSHFMGDIHQPMHVGFTSD 138 + E + +LSH++GD QP H+ + Sbjct: 145 DFATWRVLNAAEAREADPAKRAWYREDRLRREALILRDMGYLSHYVGDGSQPHHMSIHYN 204 Query: 139 AGGNSIDLRWFRHKSNLHHVWDREIILTAAK 169 G+ + F + + H ++ I + Sbjct: 205 GWGDYPNPEGFTNARSTHGAFEGAFIRRNLR 235 >UniRef50_B0T3S4 Putative uncharacterized protein n=5 Tax=Caulobacteraceae RepID=B0T3S4_CAUSK Length = 348 Score = 58.3 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 37/299 (12%), Positives = 78/299 (26%), Gaps = 66/299 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPL- 59 W GH M +A + ++ + PD+ + K T Sbjct: 32 WGAWGHRMVGVVAAESFPSDIPAFLRT---PEAVAAIGEYAREPDRWKGSGKIHDTDRDA 88 Query: 60 -HFIDTPDKACNFDYERDCHDQHGVKDMC-----------------VAGAIQNFTTQLSH 101 HF+D D+ + + + + +I + QL Sbjct: 89 AHFLDVDDEGKMYGGPKFSVETLPPTRADYEKALAAVGHDSWNAGYLPYSIIDGYQQLVK 148 Query: 102 YRE---------GTSDRRYNMTEA--------------LLFLSHFMGDIHQPMHVGFTSD 138 T + L +H++GD QP+H+ + Sbjct: 149 DFTYWRILQTVTKTEKDKIRKAYYVADLKRREELLVRDLGVWAHYVGDASQPLHLSVHYN 208 Query: 139 AGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLAS 198 G+ + + H ++ ++ A + + A Sbjct: 209 GWGDYPNPNGYTQSKATHGNFEGPLVKAVAVN------------------ADVEKLVPAY 250 Query: 199 WRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 ++ + + T ++ E G +D V +R+A G +L Sbjct: 251 KDCGCSIETRTVSYLTTTVGFVEPLYKLEKEGGLVATDP---RAKAFVDERLAAGATQL 306 >UniRef50_Q6MQM4 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MQM4_BDEBA Length = 356 Score = 58.3 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 84/315 (26%), Gaps = 60/315 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPD-QVRH--WYKYKWTS 57 W GH CR+A L+ + ++ + LC PD + K + Sbjct: 19 WGGRGHDTICRVATFLVKEPGLKEYM----QHKPQMMGHLCNMPDFYWKSLGGDAAKLGN 74 Query: 58 PLHFIDTPD---------------------KACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 HFID K F + + F Sbjct: 75 STHFIDIEVIGLDVKDITVDYKQLMTDFTGKPNKFKNDGSTIKSIPQEFGSSWWRADQFM 134 Query: 97 TQL-------------------SHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS 137 + + Y+M ++ + HF+GD QP H Sbjct: 135 RHIAGLKEDFAKAKAPTSFKEEQDNELPYNKLAYDMVVSMGLMGHFVGDNCQPFHTTADY 194 Query: 138 DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 D + +H ++ +++ D + F + + Sbjct: 195 DG--------YAAGHGGIHAYFEDQVVGQFDGDLDYLVLKAARGMKNPEFLKPKTAIEKM 246 Query: 198 SWRECGNVFSCVNKFATESI----NIACKWGYKGVEAGETLSDDY-FNSRLPIVMKRVAQ 252 + + + + + G + A E F P+++ +A+ Sbjct: 247 KVLSVISNKEIPKILKMDPVIKKSTLVKEKGMELKTAAERQPASVAFKKMKPMIVTEMAR 306 Query: 253 GGIRLAMLLNNVFGA 267 G + LA L + + + Sbjct: 307 GAVLLAALWDEAYAS 321 >UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitidis SLH14081 RepID=C5JC63_AJEDS Length = 303 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 34/97 (35%), Gaps = 11/97 (11%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTT 97 +I+ P R + V + C G++ + Sbjct: 144 YIN-PADNAGTKNGR-VLNGLPVVNGCAEGSVADVED 178 >UniRef50_B0RM73 Exported putative nuclease n=3 Tax=Xanthomonas campestris pv. campestris RepID=B0RM73_XANCB Length = 342 Score = 48.3 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 50/206 (24%), Gaps = 41/206 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYK---WTS 57 W K H A L D+ +K ++ V PD R + Sbjct: 24 WGKRAHAAIDTAAIQALPDDGPVFLKR-----HVQVIADGAVLPDGWRSESEPFLKIEED 78 Query: 58 PLH--------FIDTP-----------DKACNFDYERDCHDQHG-------------VKD 85 P H F+ P RD + Sbjct: 79 PNHGWFREQFAFMQNPPRSRYAFVLALYDEQRRLALRDPAAAERMNVRWAGTLPYAATEG 138 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID 145 A L E T + + + +H++GD QP H D + Sbjct: 139 YERIVATMRQIRALRAAGEDTRELERTCAFLVSWFAHYIGDGAQPQHDSIHHDGWQ-GAN 197 Query: 146 LRWFRHKSNLHHVWDREIILTAAKDY 171 + +H ++ + + A Sbjct: 198 PHGYSIDPKVHGKFESDYVDKIALTP 223 >UniRef50_C8X622 Putative uncharacterized protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8X622_NAKMY Length = 765 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 44/333 (13%), Positives = 86/333 (25%), Gaps = 72/333 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV-----KMLLPEYVNGD------------------- 36 W K GH +A + + + Sbjct: 414 WGKTGHYTLATVACAQVVTPTLRTLMAANQDRISFPAAGLSPGDIDQATKDAKQHGGFVP 473 Query: 37 LSALC--VWPDQVRHWYKYKWTSPL-------HF--IDTPDKACNFDYERDC-------- 77 L+ + +W + + TSP H+ ID P A + C Sbjct: 474 LADVADVIWKNLAGQVRGGRDTSPRTGPEHPTHYADIDEPRPADHLTLRALCMQDPANVA 533 Query: 78 -----------HDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGD 126 +Q + + F + RY L ++H++GD Sbjct: 534 VGVWQAFYDALGEQASRDRGLLPFRVWQFYDAMLDALAQDDLVRYLAAAGL--MAHYVGD 591 Query: 127 IHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKD---INLLEEDI 183 QP+H +D +H ++ +I A D A + L Sbjct: 592 ACQPLHGSTLADG-------LPDGTGKGVHSAYESAMIDHHAADILAALLGRLQDLAAHP 644 Query: 184 EGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRL 243 G + + ++ + + ++ ++ Sbjct: 645 LPPVASGQQAAVATVALMDRTATAIPPV------DLVNAYAATPGGQSKAVTGKLWDRFG 698 Query: 244 PIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 P + +A G LAML ++ + Q + A Sbjct: 699 PATVSVLADGARTLAMLWDSAWTQGQGDTRFTA 731 >UniRef50_B5YDN6 Putative uncharacterized protein n=1 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YDN6_DICT6 Length = 250 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 75/266 (28%), Gaps = 75/266 (28%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS + H + A L E + L E + G ++ PD++ Sbjct: 29 WSAKTHQKIAKEALYSLPKEYQRKLSPYLDELLEGSVA-----PDRI------------- 70 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY-NMTEALLF 119 + D + + ++ + + L + + + L Sbjct: 71 YKDFNNHVFHVHGDKGKGPEEVREKY------------LEIISLIQEGKSWRLVAFQLGV 118 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSH++ D++ P+H D Y KD +++ Sbjct: 119 LSHYIADLNNPLHT--------------------------DSSKREDEFHSKYEKDADVI 152 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 I+ + +SI A ++ YK +E LS D F Sbjct: 153 NPKIKSELI----------------YIKYPASYILDSIFSANRF-YKDIEKAY-LSGDKF 194 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVF 265 I +++ + + A + Sbjct: 195 RDVSKITQEQIDKAALDTASYFYSAL 220 >UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_CLOHA Length = 399 Score = 42.9 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 66/233 (28%), Gaps = 23/233 (9%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHA----VKMLLP--EYVNGDLSALCVWPDQVRHWYKYKW 55 H + A +L ++ VK E L +PD + K Sbjct: 34 GTGTHALIVTQAVEILKNDVISTSPLSVKENFKILESNLKKLQRGSTYPD---YDPKAYA 90 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH--YREGTSDRRYNM 113 HF D PD NF + + +G+ + ++ + + + Sbjct: 91 LYQDHFWD-PDTDNNFTKDSKWYLAYGI-NETGESQLRKLFALAKDEWKKGNYEQATWLL 148 Query: 114 TEALLFLSHFMGDIHQPMH---VGFTSDAGGNSIDLRWFRHKSN--LHHVWDREIILTAA 168 + L H+ GD H P H V AG + K + LH + Sbjct: 149 GQGL----HYFGDFHTPYHPSNVTAVDSAGHTKFETYVEGKKDSYKLHTAGANSVKEFYP 204 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 +++ + + + A + ATE+++ Sbjct: 205 TTLQNTNLDNWITEYSRGWAKKAKNMYYAHATMSH-SWKDWEIAATETMHNVQ 256 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, s... 294 2e-78 UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyt... 285 1e-75 UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B... 279 7e-74 UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepI... 276 6e-73 UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepI... 275 8e-73 UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY... 272 1e-71 UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN 252 1e-65 UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens ... 236 7e-61 UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepI... 234 3e-60 UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H... 231 2e-59 UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacter... 226 9e-58 UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR 225 2e-57 UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold... 223 4e-57 UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE... 223 7e-57 UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus... 221 2e-56 UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD... 218 2e-55 UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerot... 215 1e-54 UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistip... 214 3e-54 UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidops... 214 3e-54 UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella... 211 1e-53 UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepI... 211 1e-53 UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis... 211 2e-53 UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus... 211 2e-53 UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold... 210 6e-53 UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactyl... 209 8e-53 UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales ... 208 2e-52 UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus... 206 9e-52 UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 Re... 205 1e-51 UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ 205 2e-51 UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. B... 205 2e-51 UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=... 203 5e-51 UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM... 203 5e-51 UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_... 202 1e-50 UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15... 201 1e-50 UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus... 201 3e-50 UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriacea... 200 3e-50 UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacter... 200 3e-50 UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID... 200 5e-50 UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM ... 200 6e-50 UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=... 199 1e-49 UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_... 198 2e-49 UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromona... 198 2e-49 UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N... 195 2e-48 UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Asperg... 193 5e-48 UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritiv... 193 6e-48 UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_X... 193 7e-48 UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 498... 193 7e-48 UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Ta... 193 7e-48 UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacter... 193 8e-48 UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole geno... 192 1e-47 UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ... 191 2e-47 UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usi... 190 4e-47 UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp.... 187 3e-46 UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas v... 187 5e-46 UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacif... 186 5e-46 UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichom... 186 5e-46 UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR... 186 6e-46 UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leish... 186 6e-46 UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=... 186 7e-46 UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatida... 186 8e-46 UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium vi... 185 1e-45 UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales... 185 1e-45 UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas v... 184 2e-45 UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q9... 184 3e-45 UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT... 182 1e-44 UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinu... 181 2e-44 UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepI... 181 3e-44 UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila S... 181 3e-44 UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Per... 181 3e-44 UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo sal... 180 4e-44 UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichom... 180 4e-44 UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ 180 5e-44 UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichom... 179 9e-44 UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans OR... 178 3e-43 UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahy... 174 2e-42 UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID... 174 2e-42 UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichom... 174 3e-42 UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw10... 174 3e-42 UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT7... 173 4e-42 UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=... 173 6e-42 UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisru... 173 6e-42 UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_... 173 7e-42 UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacteri... 171 3e-41 UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepI... 169 8e-41 UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichom... 168 2e-40 UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 ... 168 3e-40 UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 ... 167 4e-40 UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium lo... 166 9e-40 UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis R... 164 3e-39 UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilag... 163 5e-39 UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidops... 162 1e-38 UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmod... 161 2e-38 UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytoph... 160 4e-38 UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepI... 160 4e-38 UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichom... 160 5e-38 UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacte... 159 8e-38 UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobas... 157 3e-37 UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptos... 157 3e-37 UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-69... 156 7e-37 UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium ... 155 2e-36 UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellu... 153 5e-36 UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc ... 153 9e-36 UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechlor... 151 3e-35 UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaM... 150 5e-35 UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Plancto... 150 5e-35 UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkins... 150 6e-35 UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxopla... 149 1e-34 UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxopla... 144 3e-33 UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candida... 144 3e-33 UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma p... 143 9e-33 UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing pr... 137 4e-31 UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprino... 137 5e-31 UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID... 136 1e-30 UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensi... 135 2e-30 UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoni... 134 2e-30 UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrh... 132 1e-29 UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileri... 131 2e-29 UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichom... 129 1e-28 UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavoba... 128 2e-28 UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingo... 127 4e-28 UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichom... 126 9e-28 UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinu... 123 6e-27 UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytopha... 123 6e-27 UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium... 122 1e-26 UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadoba... 119 1e-25 UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitino... 118 2e-25 UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredin... 118 2e-25 UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobac... 117 6e-25 UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=... 116 8e-25 UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobac... 115 1e-24 UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis R... 114 3e-24 UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spiroso... 114 4e-24 UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichom... 113 5e-24 UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opituta... 112 1e-23 UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomi... 112 1e-23 UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstoni... 111 3e-23 UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bactero... 110 6e-23 UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichom... 107 3e-22 UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichom... 107 4e-22 UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curviba... 107 5e-22 UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N... 106 6e-22 UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugi... 104 4e-21 UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingo... 103 8e-21 UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algorip... 102 1e-20 UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verruco... 101 4e-20 UniRef50_B0T3S4 Putative uncharacterized protein n=5 Tax=Cauloba... 94 4e-18 UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Sacchar... 93 1e-17 UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytoph... 91 3e-17 UniRef50_Q6MQM4 Putative uncharacterized protein n=1 Tax=Bdellov... 91 5e-17 UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichom... 91 5e-17 UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza s... 89 1e-16 UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidoba... 89 2e-16 UniRef50_B4WCT7 Putative uncharacterized protein n=1 Tax=Brevund... 89 2e-16 UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticca... 88 2e-16 UniRef50_D0XMV2 Putative uncharacterized protein n=1 Tax=Brevund... 88 4e-16 UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichom... 85 2e-15 UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=... 84 6e-15 UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxopla... 83 1e-14 UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candida... 77 9e-13 UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_8... 76 1e-12 UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania... 76 1e-12 UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 ... 74 5e-12 UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium... 72 3e-11 UniRef50_B0RM73 Exported putative nuclease n=3 Tax=Xanthomonas c... 70 6e-11 UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichom... 70 9e-11 UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=... 69 2e-10 UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitid... 66 2e-09 UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermod... 64 6e-09 UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curviba... 63 9e-09 UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobac... 60 1e-07 UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitid... 55 3e-06 Sequences not found previously or not previously below threshold: UniRef50_C8X622 Putative uncharacterized protein n=1 Tax=Nakamur... 52 2e-05 UniRef50_B5YDN6 Putative uncharacterized protein n=1 Tax=Dictyog... 45 0.003 UniRef50_Q97KA0 Phospholipase C related protein n=2 Tax=Clostrid... 44 0.005 UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_C... 43 0.010 UniRef50_B8E180 Putative uncharacterized protein n=1 Tax=Dictyog... 42 0.017 UniRef50_B9XJQ5 Putative uncharacterized protein n=1 Tax=bacteri... 42 0.028 UniRef50_Q01YQ6 Putative uncharacterized protein n=1 Tax=Candida... 41 0.068 >UniRef50_D1HBQ0 Whole genome shotgun sequence of line PN40024, scaffold_301.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HBQ0_VITVI Length = 332 Score = 294 bits (753), Expect = 2e-78, Method: Composition-based stats. Identities = 146/272 (53%), Positives = 199/272 (73%), Gaps = 3/272 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH C+IA+G L+++A AVK LLP+Y GDL+A+C W D++RH + ++W+ PLH Sbjct: 25 WGKEGHYAVCKIAEGFLSEDALGAVKALLPDYAEGDLAAVCSWADEIRHNFHWRWSGPLH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD+CV GAI N+T QL+ Y S+ RYN+TEAL+F Sbjct: 85 YVDTPDYRCNYEYCRDCHDFRGHKDICVTGAIYNYTKQLTSGYHNSGSEIRYNLTEALMF 144 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGFT D GGN+I +RW+R K+NLHH+WD II +A K YY D+ ++ Sbjct: 145 LSHFIGDVHQPLHVGFTGDEGGNTIIVRWYRRKTNLHHIWDNMIIDSALKTYYNSDLAIM 204 Query: 180 EEDIEGNFTDGIWSDDLASWREC-GNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + I+ N T WS D++SW+ C + +C N +A+ESI++ACK+ Y+ G TL DDY Sbjct: 205 IQAIQRNITGD-WSFDISSWKNCASDDTACPNLYASESISLACKFAYRNATPGSTLGDDY 263 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 F SRLPIV KR+AQGGIRLA LN +F + + Sbjct: 264 FLSRLPIVEKRLAQGGIRLAATLNRIFASQPK 295 >UniRef50_Q9SXA6 Bifunctional nuclease bfn1 n=20 Tax=Magnoliophyta RepID=Q9SXA6_ARATH Length = 305 Score = 285 bits (729), Expect = 1e-75, Method: Composition-based stats. Identities = 201/277 (72%), Positives = 241/277 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AH V+ LLP+YV GDLSALCVWPDQ+RHWYKY+WTS LH Sbjct: 29 WSKEGHILTCRIAQNLLEAGPAHVVENLLPDYVKGDLSALCVWPDQIRHWYKYRWTSHLH 88 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +IDTPD+AC+++Y RDCHDQHG+KDMCV GAIQNFT+QL HY EGTSDRRYNMTEALLFL Sbjct: 89 YIDTPDQACSYEYSRDCHDQHGLKDMCVDGAIQNFTSQLQHYGEGTSDRRYNMTEALLFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 SHFMGDIHQPMHVGFTSD GGN+IDLRW++HKSNLHHVWDREIILTA K+ Y K+++LL+ Sbjct: 149 SHFMGDIHQPMHVGFTSDEGGNTIDLRWYKHKSNLHHVWDREIILTALKENYDKNLDLLQ 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 ED+E N T+G+W DDL+SW EC ++ +C +K+A+ESI +ACKWGYKGV++GETLS++YFN Sbjct: 209 EDLEKNITNGLWHDDLSSWTECNDLIACPHKYASESIKLACKWGYKGVKSGETLSEEYFN 268 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 +RLPIVMKR+ QGG+RLAM+LN VF V AT Sbjct: 269 TRLPIVMKRIVQGGVRLAMILNRVFSDDHAIAGVAAT 305 >UniRef50_B9HYZ1 Predicted protein n=20 Tax=Spermatophyta RepID=B9HYZ1_POPTR Length = 297 Score = 279 bits (714), Expect = 7e-74, Method: Composition-based stats. Identities = 141/268 (52%), Positives = 181/268 (67%), Gaps = 3/268 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W KEGH TC+IA+G L EA AVK LLPE GDL+ +C WPD++R + Y W+S LH Sbjct: 25 WGKEGHYATCKIAEGYLTAEALAAVKELLPESAEGDLANVCSWPDEIR--FHYHWSSALH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH-YREGTSDRRYNMTEALLF 119 ++DTPD CN++Y RDCHD G KD CV GAI N+T QL Y+ S+ YN+TEAL+F Sbjct: 83 YVDTPDFRCNYEYFRDCHDSSGRKDRCVTGAIYNYTNQLLSLYQNSNSESNYNLTEALMF 142 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSHF+GD+HQP+HVGF D GGN+I + W+R KSNLHHVWD II +A K +Y+ D+ + Sbjct: 143 LSHFIGDVHQPLHVGFLGDLGGNTIQVHWYRRKSNLHHVWDNMIIESALKTFYSSDLATM 202 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 I+ N T+ + N C N +A+ESI++ACK+ YK G TL DDYF Sbjct: 203 IRAIQNNITENWSNQQPLWEHCAHNHTVCPNPYASESISLACKFAYKNASPGSTLEDDYF 262 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 SRLP+V KR+AQGGIRLA LN +F + Sbjct: 263 LSRLPVVEKRLAQGGIRLAATLNRIFAS 290 >UniRef50_Q8LA68 Endonuclease, putative n=13 Tax=Embryophyta RepID=Q8LA68_ARATH Length = 296 Score = 276 bits (705), Expect = 6e-73, Method: Composition-based stats. Identities = 130/273 (47%), Positives = 184/273 (67%), Gaps = 4/273 (1%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPL 59 W K+GH C++A+G D+ AVK LLPE G L+ C WPD+++ +++WTS L Sbjct: 21 WGKDGHYTVCKLAEGFFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTL 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALL 118 H+++TP+ CN++Y RDCHD H +D CV GAI N+T QL E + + YN+TEALL Sbjct: 81 HYVNTPEYRCNYEYCRDCHDTHKHRDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALL 140 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FLSH+MGD+HQP+H GF D GGN+I + W+ +KSNLHHVWD II +A + YY + Sbjct: 141 FLSHYMGDVHQPLHTGFLGDLGGNTIIVNWYHNKSNLHHVWDNMIIDSALETYYNSSLPH 200 Query: 179 LEEDIEGNFTDGIWSDDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + ++ +G WS+D+ SW+ C + +C N +A+ESI++ACK+ Y+ G TL D+ Sbjct: 201 MIQALQAKLKNG-WSNDVPSWKSCHFHQKACPNLYASESIDLACKYAYRNATPGTTLGDE 259 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 YF SRLP+V KR+AQGGIRLA LN +F A + Sbjct: 260 YFLSRLPVVEKRLAQGGIRLAATLNRIFSAKPK 292 >UniRef50_Q9LGA5 Os01g0128200 protein n=12 Tax=Magnoliophyta RepID=Q9LGA5_ORYSJ Length = 308 Score = 275 bits (704), Expect = 8e-73, Method: Composition-based stats. Identities = 140/276 (50%), Positives = 194/276 (70%), Gaps = 7/276 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+GH++ C+IA+ L+++AA AV+ LLPE G+LS +C W D+VR + Y W+ PLH Sbjct: 34 WGKQGHIIVCKIAEKYLSEKAAAAVEELLPESAGGELSTVCPWADEVR--FHYYWSRPLH 91 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + +TP + CNF Y RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL Sbjct: 92 YANTP-QVCNFKYSRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFL 148 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 +HF+GD+HQP+HVGF D GGN+I + W+R K NLHHVWD II TA KD+Y + ++ + Sbjct: 149 AHFVGDVHQPLHVGFEEDEGGNTIKVHWYRRKENLHHVWDNSIIETAMKDFYNRSLDTMV 208 Query: 181 EDIEGNFTDGIWSDDLASWRECGN-VFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 E ++ N TDG WS+D++ W CGN +C N +A ESI+++C + YK VE TL DDYF Sbjct: 209 EALKMNLTDG-WSEDISHWENCGNKKETCANDYAIESIHLSCNYAYKDVEQDITLGDDYF 267 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 SR PIV KR+AQ GIRLA++LN +FG + + +V+ Sbjct: 268 YSRYPIVEKRLAQAGIRLALILNRIFGEDKPDGNVI 303 >UniRef50_C3VEY2 Bifunctional nuclease n=2 Tax=rosids RepID=C3VEY2_CUCSA Length = 311 Score = 272 bits (694), Expect = 1e-71, Method: Composition-based stats. Identities = 162/258 (62%), Positives = 200/258 (77%), Gaps = 1/258 (0%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDYE 74 LL EAA AV+ LLPE G+LSA+CVWPDQ+R KY+W SPLH+ +TPD +C+F Y+ Sbjct: 50 ELLIPEAAEAVQDLLPESAGGNLSAMCVWPDQIRLQSKYRWASPLHYANTPD-SCSFVYK 108 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 RDCH+ G DMCVAGAI+NFTTQL+ YR D +N+TEALLFLSHF+GDIHQP+HVG Sbjct: 109 RDCHNDAGQPDMCVAGAIRNFTTQLTTYRTQGFDSPHNLTEALLFLSHFVGDIHQPLHVG 168 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 F SDAGGN+I++RWFR KSNLHHVWDR+IIL A DYY KD LL +++ N T GIWS+ Sbjct: 169 FESDAGGNTIEVRWFRRKSNLHHVWDRDIILEALGDYYDKDGGLLLDELNRNLTQGIWSN 228 Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 D++ W C V SCVN++A ES +ACKW Y+GVEAG TLS++Y++SRLPIVM+R+AQGG Sbjct: 229 DVSEWERCSTVNSCVNRWADESTGLACKWAYEGVEAGITLSEEYYDSRLPIVMERLAQGG 288 Query: 255 IRLAMLLNNVFGASQQED 272 +RLAMLLN VF Sbjct: 289 VRLAMLLNRVFAEDATRG 306 >UniRef50_A5A339 Endonuclease n=1 Tax=Glycine max RepID=A5A339_SOYBN Length = 297 Score = 252 bits (643), Expect = 1e-65, Method: Composition-based stats. Identities = 128/270 (47%), Positives = 168/270 (62%), Gaps = 6/270 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GHV+ C+IAQ L++ AA AVK LLP DLS C W D V H Y W S LH Sbjct: 27 WGDDGHVIVCKIAQARLSEAAAEAVKKLLPISAGNDLSTKCSWADHVHHI--YPWASALH 84 Query: 61 FIDTPDKACNFDYERDCHD-QHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 + +TP+ C++ RDC D + G+K CV AI N+TTQL Y + RYN+T++L F Sbjct: 85 YANTPEALCSYKNSRDCVDYKKGIKGRCVVAAINNYTTQLLEY-GSDTKSRYNLTQSLFF 143 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 SHFMGDIHQP+H GF SD GGN+I +RW++ K NLHH+WD I+LT +Y D++ Sbjct: 144 PSHFMGDIHQPLHCGFLSDNGGNAITVRWYKRKQNLHHIWDSTILLTEVDKFYDSDMDEF 203 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNV-FSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + ++ N T +W+D + W CG+ C +A+ES ACKW YK G L+DDY Sbjct: 204 IDALQQNITK-VWADQVEEWENCGDKDLPCPATYASESTIDACKWAYKDATEGSVLNDDY 262 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 F SRLPIV R+AQ G+RLA +LN VF Sbjct: 263 FLSRLPIVNMRLAQAGVRLAAILNRVFEKK 292 >UniRef50_A9U2Y4 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U2Y4_PHYPA Length = 284 Score = 236 bits (601), Expect = 7e-61, Method: Composition-based stats. Identities = 117/272 (43%), Positives = 167/272 (61%), Gaps = 11/272 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +TC IA+ LL + A+ LLP+ NG+L+ LC WPD VR KYKWT LH Sbjct: 23 WGADGHRVTCLIAEPLLYEPTKQAIAALLPKSANGNLADLCTWPDDVRWMDKYKWTRELH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++TP+ C +DY RDCHD G ++C++GAI NFT L + T +R +L Sbjct: 83 WVNTPNHVCKYDYNRDCHDHMGTPNVCISGAINNFTHILWN---HTRNRNMKNGRGILLC 139 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 ++P+H GF SD GGN+I + W+ +S+LHHVWD EI+ A K+ + D ++ Sbjct: 140 C------YEPLHTGFRSDQGGNNISVYWYHRRSDLHHVWDTEIVSKALKENHNSDPEIMA 193 Query: 181 EDIEGNFTDGIWSDDLASWRECGN-VFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I N TD W+ ++ +W C N SC + +ATESIN+ACKW Y G G L D+Y+ Sbjct: 194 DSILNNATDN-WASEVDAWGICHNRKLSCPDTYATESINLACKWAYSGAAPGTALGDEYY 252 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 SRLP V R+AQGG+RLA +LN++F + + Sbjct: 253 TSRLPTVELRLAQGGVRLAAILNSIFDPNAPQ 284 >UniRef50_B8MCF5 Nuclease PA3, putative n=2 Tax=Leotiomyceta RepID=B8MCF5_TALSN Length = 363 Score = 234 bits (596), Expect = 3e-60, Method: Composition-based stats. Identities = 84/284 (29%), Positives = 126/284 (44%), Gaps = 20/284 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ L+D A K +L + + L+ + W D R KW++PLH Sbjct: 47 WGTLGHATVAYIAQNYLDDATATWAKGVLGDTSDSYLANIASWADSYRSTSAGKWSAPLH 106 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D+P +CN DYERDC C AI N+T ++ R ++ EAL Sbjct: 107 FIDAEDSPPTSCNVDYERDCG-----SSGCSVSAIANYTQRVGDGRLSKANT----AEAL 157 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAKDYYAKDI 176 FL HF+GD+ QP+H D GGN I + + + S NLH WD I D Sbjct: 158 KFLVHFLGDVTQPLH-DEALDRGGNEITVTFDGYDSDNLHSDWDTYIPQKLVGGSTLSDA 216 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECG---NVFSCVNKFATESINIACKWGYKGVEAG-- 231 ++ G + A+W + + + +A+++ C A Sbjct: 217 QTWANELISQIDSGSYKSVAANWIKGDDISDPITSATTWASDANAFVCSVVMPNGVAALQ 276 Query: 232 -ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 L DY+NS +P + ++A+GG RLA LN+++ A + Sbjct: 277 QGDLYPDYYNSVIPTIELQIAKGGYRLANWLNSIYSAHIAKRKR 320 >UniRef50_B6H0E5 Pc12g06500 protein n=2 Tax=Penicillium RepID=B6H0E5_PENCW Length = 344 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 80/281 (28%), Positives = 126/281 (44%), Gaps = 19/281 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + + L+ + W D+ R KW++PLH Sbjct: 21 WGALGHATVAYVAQHYISSEAASWAQGILNDTSSSYLANVASWADKYRLTDDGKWSAPLH 80 Query: 61 FIDT---PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +ID P K+CN DYERDC D + C A+ N+T++ R T EAL Sbjct: 81 YIDAMDDPPKSCNVDYERDCGD-----EGCSVSAVANYTSRAGDGRLSTDHT----AEAL 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GDI QP+H + GGN ID+ + + NLH WD + D Sbjct: 132 RFLVHFIGDITQPLH-DENYEVGGNGIDVTFDGYDDNLHSDWDTYMPGKLVGGSSLTDAQ 190 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECG---NVFSCVNKFATESINIACKWGYKG---VEAG 231 + + G + + SW E + + ++A+++ C Sbjct: 191 GWADSLVDEINSGTYKEQAKSWIEGDTISDAVTTATRWASDANAFVCTVVMPDGAAALQT 250 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 L Y+NS + + +VA+GG RLA +N ++ +D Sbjct: 251 GDLYPTYYNSAIGTIEMQVAKGGYRLANWINLIYEQKVAKD 291 >UniRef50_A0M3W8 S1/P1 endonuclease family protein n=6 Tax=Bacteroidetes RepID=A0M3W8_GRAFK Length = 260 Score = 226 bits (575), Expect = 9e-58, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 121/266 (45%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH T IA+ L+++A +A+ LL + L+ + + D ++ +Y+ P H Sbjct: 24 WGKTGHRATAEIAETHLSNKAKNAIDGLLGGHG---LAFVANYADDIKSDPEYREFGPWH 80 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + ++ K + AI+ L +++ L L Sbjct: 81 YVNIDPE------NKKYIEEEANKSGDLVQAIKKCVEVLKDQNSSRDEKQ----FYLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP H G D GGN I +RWF SN+H VWD ++I Y +N Sbjct: 131 VHFVGDLHQPFHTGHAEDKGGNDIQVRWFNEGSNIHRVWDSDMINFYQMSYTELALN--T 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +D+ N I L W ES +A Y GV+ GE L Y Sbjct: 189 KDLSKNQIKAIEKGKLLDWVY-------------ESRAMAEDL-YTGVDNGEKLGYSYMY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +P V++++ +GGIRLA +LN+++ Sbjct: 235 KNMPTVLEQLQKGGIRLAKILNDIYS 260 >UniRef50_P24021 Nuclease S1 n=6 Tax=Leotiomyceta RepID=NUS1_ASPOR Length = 287 Score = 225 bits (572), Expect = 2e-57, Method: Composition-based stats. Identities = 73/276 (26%), Positives = 115/276 (41%), Gaps = 20/276 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASSTESFCQNILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FI D P ++C DY+RDC C AIQN+T L G+ AL Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYTNILLESPNGSEALN-----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GDIHQP+H +AGGN ID+ + +NLHH+WD + AA Y Sbjct: 131 KFVVHIIGDIHQPLH-DENLEAGGNGIDVTYDGETTNLHHIWDTNMPEEAAGGYSLSVAK 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVE---AG 231 + + G +S SW + + S +A ++ C Sbjct: 190 TYADLLTERIKTGTYSSKKDSWTDGIDIKDPVSTSMIWAADANTYVCSTVLDDGLAYINS 249 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 LS +Y++ P+ + +A+ G RLA L+ + Sbjct: 250 TDLSGEYYDKSQPVFEELIAKAGYRLAAWLDLIASQ 285 >UniRef50_D1Z5H6 Whole genome shotgun sequence assembly, scaffold_4 n=10 Tax=Sordariomycetes RepID=D1Z5H6_SORMA Length = 336 Score = 223 bits (569), Expect = 4e-57, Method: Composition-based stats. Identities = 76/291 (26%), Positives = 123/291 (42%), Gaps = 26/291 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH+ +A +++ A + LL L+ + W D +R+ +WT PLH Sbjct: 21 WGGFGHITVAYLASNFVSNTTAAYFQTLLRNDTTDYLANVATWADSIRYTKWGRWTGPLH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D+P +C YERDC + CV AIQN+T+++ +R +A Sbjct: 81 YIDAKDSPPHSCGIVYERDCK-----PEGCVVSAIQNYTSRVLDQSLHVVER----AQAA 131 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA-------AKD 170 F+ HF+GDIHQP+H + GGN I + + + NLHHVWD I Sbjct: 132 KFVIHFVGDIHQPLHT-EDVEKGGNGISVFFDDKRFNLHHVWDSSIAEKIVTHKKHGVGR 190 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASW---RECGNVFSCVNKFATESINIACKWGYKG 227 E + +G + + + W E + ++A E C Sbjct: 191 RPFPAAKKWAEQLAEEIREGQYKANSSEWVKGLELKSASEIALEWAVEGNAHVCTVVLPE 250 Query: 228 VE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 + L YF + P+V ++A+ G RLA L+ V A + +++ Sbjct: 251 GPEAIRDQELGGAYFEAAAPVVELQIAKAGYRLAAWLDLVVTAISKNETIS 301 >UniRef50_B0DXE1 Predicted protein n=4 Tax=Agaricales RepID=B0DXE1_LACBS Length = 317 Score = 223 bits (567), Expect = 7e-57, Method: Composition-based stats. Identities = 82/305 (26%), Positives = 120/305 (39%), Gaps = 48/305 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSP 58 W +GH+ A L A V+ L + L W D VR Y W ++P Sbjct: 20 WGADGHMAVGYTAMQFLAPNALSFVQNSLGSSYSRSLGPAATWADTVRSQAAYSWCASAP 79 Query: 59 LHFI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF+ D P +C+ RDC C+ AI N+TT++ + R+ E Sbjct: 80 FHFVDAEDNPPTSCSVSETRDCG-----SGNCILTAIANYTTRVVQTSLSATQRQ----E 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKD 175 AL FL HF+GDI QP+HV GGN I ++ +NLH +WD II K Y Sbjct: 131 ALKFLDHFLGDITQPLHV-EALKVGGNDITVKCNGSSTNLHALWDTGIIEGFLKAQYGNS 189 Query: 176 INLLEEDIEGNFTDGIWSDDLASWRECGN----------------------------VFS 207 + + G ++ ASW C + Sbjct: 190 VTTWANSLATRIKTGNFASSKASWIACSDPSAPLSQKRSIQDDIDEFLAARSTAAITPLK 249 Query: 208 CVNKFATESINIACKWGYKGVEAGETL----SDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 C +A +S C + + G G+ L + Y PI+ +++A+G RLA LN Sbjct: 250 CPLVWAQDSNTFDCSYVF-GFTTGKDLCSGGTSSYAAGAQPIIEEQIAKGAYRLAAWLNV 308 Query: 264 VFGAS 268 +F S Sbjct: 309 LFDGS 313 >UniRef50_C5K479 Nuclease PA3, putative n=5 Tax=Perkinsus marinus ATCC 50983 RepID=C5K479_9ALVE Length = 337 Score = 221 bits (563), Expect = 2e-56, Method: Composition-based stats. Identities = 90/291 (30%), Positives = 147/291 (50%), Gaps = 32/291 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH + ++ Q + E A+ ++ + V +S W D+V++ ++KW+S Sbjct: 19 WGHDGHAVVAQLGQERIKKETQEALDAIMGKGVP--MSNYSSWADEVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 SLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAK 174 F+ HF+GD HQP+H+G D GGN I + + +NLH WD ++I Sbjct: 126 KFIVHFVGDAHQPLHIGKPEDLGGNKIAVHLGFGEKPSTNLHSTWDSKLIYELEDQSDPI 185 Query: 175 DINLL----EEDIEGNFTD-GIWSDDLASWRECGNVFS---CVNKFATESINIACKWGYK 226 D E+ + G ++D++ W E + CV+ + +ES AC + Y+ Sbjct: 186 DGEPSWMITEDAVSDELDKGGKYADEIDDWIEDCEKYGLDVCVDSWLSESSKTACDYSYR 245 Query: 227 GVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 V + L DY+N+R+ +V +++A+GG+RL LLN VF A Sbjct: 246 HVNGSLIVDHDFLPMDYYNNRIEVVKEQLAKGGVRLTWLLNTVFAAQDATP 296 >UniRef50_Q0CD39 Predicted protein n=2 Tax=Aspergillus RepID=Q0CD39_ASPTN Length = 300 Score = 218 bits (554), Expect = 2e-55, Method: Composition-based stats. Identities = 72/287 (25%), Positives = 127/287 (44%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ + + LLP N D+S W D+ + +Y T P H Sbjct: 21 WGDVGHRTVAYVAENYLTEDGSKFLDNLLPFSNNFDISDAATWADEQKR--RYPKTKPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D D ++ D C+ A++ T+Q+S Y +N TEA+LFL Sbjct: 79 YVDIKDDP--VHHKCDISSLDCPNGDCIISAMEAMTSQVSEYS-------FNRTEAVLFL 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA--------AKDYY 172 HF GD+H P+HV GGN ID+ + NLH +WD ++ D Sbjct: 130 VHFFGDLHMPLHV-EGLCRGGNEIDVSFNGRNDNLHSIWDTDMPHKINGIKHSLKHNDEK 188 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK---GVE 229 + ++ I+ N + + C ++ATES ++ C +K Sbjct: 189 TASLKWAKDLIQKNLHR---PATVTECNDVTQPQKCFKQWATESNHLNCAVVFKRGLQYL 245 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 + L+ DY+ +P++ +++ + G+RLA +N++ + + VA Sbjct: 246 TTQDLAGDYYEDAVPVIEEQIFKAGVRLATWINSIAEKQHAKAAFVA 292 >UniRef50_A7ETG5 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7ETG5_SCLS1 Length = 283 Score = 215 bits (547), Expect = 1e-54, Method: Composition-based stats. Identities = 72/278 (25%), Positives = 108/278 (38%), Gaps = 23/278 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A + + +MLL L+ + W D R + Sbjct: 21 WGTLGHQTVAYVATNFVAESTRDYFQMLLRNDTGSYLAGVATWADSYRLAALLRLFQR-- 78 Query: 61 FIDTPDKA-CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 F +T A C + RDC + + CV GAI NFT+QL + Sbjct: 79 FFNTEINAACGVKFARDCGE-----EGCVVGAILNFTSQLLDPNVSRYHKYIAAKF---- 129 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 +GDIHQP+H + GGN+I + + ++NLH WD I Y D Sbjct: 130 ----VGDIHQPLHA-ENINIGGNTIKVTFNGKETNLHSFWDTAIPEELVGGYSMADAQEW 184 Query: 180 EEDIEGNFTDGIWSDDLASWRECGN---VFSCVNKFATESINIACKWGYKG---VEAGET 233 + GI+ SW E N + +A +S C V G+ Sbjct: 185 ANVLTTAIKTGIYKSQAKSWLEDMNIGDPLTTALGWAKDSNAFICTTVIPDGAEVLQGKE 244 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 LS +Y+ S +P+V +VA+ G RLA L+ + + E Sbjct: 245 LSGEYYESGIPVVELQVARAGYRLAAWLDMIVRGIKTE 282 >UniRef50_B0MYD6 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYD6_9BACT Length = 257 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 73/266 (27%), Positives = 109/266 (40%), Gaps = 29/266 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH + IA+ L EAA + +L + W D H +Y +T+ H Sbjct: 21 WGPKGHDVVAYIAECNLTPEAAEKIDKILG---GASMVYWANWLDSASHTPEYAYTATWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 + + + D + AI +L + + + L L Sbjct: 78 YANVDEGF-------TYETMTKNPDGDIVEAIDRIVAELKGGQLDPAQEQL----YLKML 126 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH G SD GGNS+ +R+F +SNLH VWD + A K Y + N L Sbjct: 127 VHLVGDLHQPMHTGHLSDRGGNSVPVRFFGRESNLHAVWDSSLPEAAHKWSYTEWQNQL- 185 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I S W E N C+ Y G LS DY Sbjct: 186 DRLTEEEVARIQSGTPLDWFEESNAI--------------CREIYVATPEGSDLSYDYIA 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 P++ +++ +GG RLA LLN ++G Sbjct: 232 KYAPVIERQLLRGGHRLAGLLNEIYG 257 >UniRef50_O65424 Putative bifunctional nuclease n=2 Tax=Arabidopsis thaliana RepID=O65424_ARATH Length = 362 Score = 214 bits (544), Expect = 3e-54, Method: Composition-based stats. Identities = 108/258 (41%), Positives = 153/258 (59%), Gaps = 38/258 (14%) Query: 14 QGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDY 73 + ++ AVK LLPE NG+L+A+C WPD+++ +++WTS LHF DTPD CN++Y Sbjct: 138 KSYFEEDTVVAVKKLLPESANGELAAVCSWPDEIKKLPQWRWTSALHFADTPDYKCNYEY 197 Query: 74 ERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV 133 +N+TEAL+FLSH+MGDIHQP+H Sbjct: 198 ------------------------------------SHNLTEALMFLSHYMGDIHQPLHE 221 Query: 134 GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 GF D GGN I + W+ ++NLH VWD II +A + YY + + +++ +G WS Sbjct: 222 GFIGDLGGNKIKVHWYNQETNLHRVWDDMIIESALETYYNSSLPRMIHELQAKLKNG-WS 280 Query: 194 DDLASWRECG-NVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D+ SW C N +C N +A+ESI++ACK+ Y+ AG TL D YF SRLP+V KR+AQ Sbjct: 281 NDVPSWESCQLNQTACPNPYASESIDLACKYAYRNATAGTTLGDYYFVSRLPVVEKRLAQ 340 Query: 253 GGIRLAMLLNNVFGASQQ 270 GGIRLA LN +F A ++ Sbjct: 341 GGIRLAGTLNRIFSAKRK 358 Score = 87.2 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 60/164 (36%), Positives = 76/164 (46%), Gaps = 34/164 (20%) Query: 98 QLSHYREGTSDR-RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLH 156 QL E + YN+TEAL+FLSHF+GDIHQP+HVGF D GGN+I +RW+R K+NLH Sbjct: 2 QLMSASENSDTIVHYNLTEALMFLSHFIGDIHQPLHVGFLGDEGGNTITVRWYRRKTNLH 61 Query: 157 H----------------------------VWDREIILTAAKDYYAKDINLLEEDIEGNFT 188 H VWD II +A K YY K + L+ E ++ N T Sbjct: 62 HVSVCYRMLKEKVIFPDWINYSYDLPMMKVWDNMIIESALKTYYNKSLPLMIEALQANLT 121 Query: 189 DGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGE 232 I S WR + E +A K GE Sbjct: 122 MTISSLGYPLWRRDLR-----KSYFEEDTVVAVKKLLPESANGE 160 >UniRef50_A3XR21 Putative S1/P1 Nuclease n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XR21_9FLAO Length = 263 Score = 211 bits (538), Expect = 1e-53, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 109/266 (40%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH T IA L A++ LL + L + + D+++ + +Y+ S H Sbjct: 28 WGSKGHRATAAIAVKYLKPRTKKAIEKLLG---DETLVTVSTYGDEIKSYEEYRKYSSWH 84 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I ++ ++R L L Sbjct: 85 YVN-------IAPGLSYAEADKNEYGDLVQGINTCKEVITSEDATIEEKR----FYLKML 133 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H+G D GGN +RWF + +NLH +WD ++I + Y N Sbjct: 134 VHFIGDLHQPLHLGHAEDKGGNDFQVRWFNNGTNLHSLWDSKLIESYGMSYSELATN--F 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + I DL W G + + + Y E GE LS Y Sbjct: 192 GQVSKKQFKEISKGDLMDWVSEGQILA--------------EKVYDSAEIGEKLSYRYQA 237 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V +++ +GG+RLA LLN +F Sbjct: 238 DYNQMVQEQLQKGGVRLAALLNELFD 263 >UniRef50_Q7S8Q5 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S8Q5_NEUCR Length = 306 Score = 211 bits (538), Expect = 1e-53, Method: Composition-based stats. Identities = 68/290 (23%), Positives = 114/290 (39%), Gaps = 32/290 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH-WYKYKWTSPL 59 W K GH +AQ L V+ +L + + + W D R+ W++ L Sbjct: 20 WGKLGHATVASVAQQYLTPNTVKQVQTILGDNSTSYMGNIASWADSFRYESAANAWSAGL 79 Query: 60 HFIDT----PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 HF++ P ++C+ DC + CV AI N+T ++ + + Sbjct: 80 HFVNGHDGPPPESCHLVLPEDC-----PPEGCVVSAIGNYTERVQMKNITADQK----AQ 130 Query: 116 ALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------ 169 AL F+ HF+GDI QP+H + G N+I + + +K+NLH WD I Sbjct: 131 ALKFIVHFLGDIAQPLHTEGFGE-GANNITVTFQGYKTNLHAAWDTSIPNAMLGISPPTS 189 Query: 170 --DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF------SCVNKFATESINIAC 221 + + D ++ G + D+ W +V +A + C Sbjct: 190 AANITSADFLGWANNLAAKINQGQYRKDVRRWLRYHSVATRKASERAAAAWAQDGNEEVC 249 Query: 222 KWGYK---GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 250 HYVMKVPGNQLNGTEIGGDYYKGATEVVERSIIKGGIRLAGWLNLIFDNR 299 >UniRef50_Q2SFD4 Probable endonuclease n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFD4_HAHCH Length = 304 Score = 211 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 68/271 (25%), Positives = 111/271 (40%), Gaps = 20/271 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + C +A L+ A V+ LL + + C+WPDQVR ++K T H Sbjct: 50 WGELGHRVVCDVAWKELSPVARDQVQKLLQQAGKRTFAEACLWPDQVRSEKEFKHTGSYH 109 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ A +C + CV A+ + L E + +AL+F+ Sbjct: 110 YVNVERAAKRVSTAENCESK-----GCVLTALNAYAEALKG--EPRQGYQATPAQALMFI 162 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GDIHQP+HV + D GGN + + ++NLH +WD I + + K + Sbjct: 163 GHFIGDIHQPLHVSYADDRGGNKVVYKVAGEETNLHRLWDVNIPESGLPRDWRKAGKKVR 222 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 G + +A ES+ I K G S Sbjct: 223 GKHRGETVTAL-------------SLQEAEAWANESLAITRKVYESLPPQGSEWSKKDLA 269 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 P+ R+ Q G+RL +LN + ++Q + Sbjct: 270 REYPVAEMRLYQAGVRLGAVLNQLLASNQDQ 300 >UniRef50_C5K482 Nuclease PA3, putative n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5K482_9ALVE Length = 328 Score = 211 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 86/283 (30%), Positives = 145/283 (51%), Gaps = 32/283 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH + ++ Q +N E A+ ++ + V + W D V++ ++KW+S Sbjct: 19 WGHDGHAVVAQLGQERINKETQEAIDAIMGKGVP--MYNYSSWADDVKYGPDGNEWKWSS 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ DTPD C+FDY RDC + D CVAGA++N++ ++ R+ EAL Sbjct: 77 PLHYADTPD--CHFDYARDCKN-----DYCVAGALKNYSRRVVDESLPLEQRQ----EAL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREII-LTAAKDYYA 173 F+ HF+GD HQP+H G D GGN ID+ +NLH WD ++ + + A Sbjct: 126 KFIVHFVGDAHQPLHAGNPKDRGGNKIDVSLGFARHQHTNLHSTWDSALLYEFQGRGHRA 185 Query: 174 KDINLL---EEDIEGNFTD-GIWSDDLASWRECGNVF---SCVNKFATESINIACKWGYK 226 + E+ I+ G ++ D+ W E + +C+ K+ E+ AC++ YK Sbjct: 186 RGAPYWTVTEDAIDDELDKGGRYAGDVDDWVEDCEKYGYDACIEKWVDETAKAACEYSYK 245 Query: 227 GVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + + +D Y++ R+ + +++A+ GIRL LLNN+ Sbjct: 246 HMNGSRVVDNDYLPMKYYDGRIEVAKEQLAKAGIRLTWLLNNL 288 >UniRef50_D1ZIR6 Whole genome shotgun sequence assembly, scaffold_39 n=1 Tax=Sordaria macrospora RepID=D1ZIR6_SORMA Length = 309 Score = 210 bits (533), Expect = 6e-53, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 116/294 (39%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K GH +AQ L V+ +L + + + W D R+ W+S LH Sbjct: 19 WGKLGHATVASVAQQYLTPNTVKQVQAILGDKSTTYMGNIASWADSFRYEEGNAWSSGLH 78 Query: 61 FIDT----PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 F++ P ++C+ DC + CV AI N+T ++ + R T+A Sbjct: 79 FVNGHDAPPPESCHLILPEDC-----PPEGCVVSAIGNYTERVQNKELAAEQR----TQA 129 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK------- 169 L F+ HF+GDI QP+H + G N++ + + +K+NLH WD I T Sbjct: 130 LKFIIHFLGDIAQPLHTEAFGE-GANNVTVFFDGYKTNLHAAWDTSIPNTMLGISPPTSA 188 Query: 170 -DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC-------VNKFATESINIAC 221 + D ++ G + D+ W + + +A + C Sbjct: 189 ANITNADFLGWANNLAAKINQGSYRRDVRRWLRNHRLPANRKGAERAAAAWAQDGNEEVC 248 Query: 222 KWGYK---GVEAGETLSD----DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + K G + DY+ +V + + +GGIRLA LN +F Sbjct: 249 HYVMKIPGNQLNGTEIGAGAGGDYYKGAAEVVERSIIKGGIRLAGWLNLIFDKR 302 >UniRef50_B7FP92 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FP92_PHATR Length = 308 Score = 209 bits (532), Expect = 8e-53, Method: Composition-based stats. Identities = 95/308 (30%), Positives = 147/308 (47%), Gaps = 43/308 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------LSALCVWPDQVRHWYKY 53 W KEGH + +A LL++++ AV+ +L + D L + W D VR ++Y Sbjct: 6 WGKEGHEVVGNLAWKLLSEQSQSAVRNILQDVPIPDNCTACSPLGQVADWADTVRRTHEY 65 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTS---DRR 110 W+ PLH++D C F+YERDC + D+CVAGA+ N+T L +R + Sbjct: 66 FWSGPLHYVDISQDECRFEYERDCAN-----DICVAGAVVNYTRHLQKFRRDETREYGDE 120 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK------------------ 152 + ++L+FL+HF+GD+HQP+HV +SD GGNSI + + Sbjct: 121 LLVRDSLMFLTHFVGDLHQPLHVSRSSDRGGNSIHVVYSPGNADTAPKDGRLGYLRAGRH 180 Query: 153 ---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN--VFS 207 NLH VWD II T K Y + L E+ + + + W C N + Sbjct: 181 HHVDNLHAVWDTGIIETCVKLNYKESRVLWEKVLYERIIQAQGTGEWDVWTSCPNGAQQT 240 Query: 208 CVNKFATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 CV++++ +S+ A W Y+ V+ G LS Y+ +RLP V ++ RLA L Sbjct: 241 CVSEWSEQSLEYALIWAYRNVDGTAIGDGTHLSHAYYETRLPFVEHQLTVAAARLATTLE 300 Query: 263 NVFGASQQ 270 F + Sbjct: 301 ISFTQNVA 308 >UniRef50_Q3IBZ8 Putative S1/P1 Nuclease n=2 Tax=Alteromonadales RepID=Q3IBZ8_PSEHT Length = 288 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 72/274 (26%), Positives = 114/274 (41%), Gaps = 30/274 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH + +IA+ L++ LLP N L+ + WPD++R W +S Sbjct: 27 WGQNGHRIIAKIAESHLSETTKT---KLLPLLNNESLAQVSTWPDEMRSAPGEFWQRKSS 83 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+T ++ V + I L + ++ +L Sbjct: 84 RWHYINTSANKPISLNHSHTKNKESVT--NILEGIHYSIKVLQDEQSSLDAKQ----FSL 137 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL H +GD HQP H G D GGN+I ++ F ++NLH +WD ++I Y Sbjct: 138 RFLVHLVGDSHQPFHAGRADDRGGNNIKVKHFGQETNLHSLWDSKLIEGENLSY------ 191 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 F D I +++ + S + ES N+A + +S Sbjct: 192 -------TEFADFINTNNQT--LISEYLTSTPTSWLVESNNLAESIY---NKNETNISYS 239 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y +PI+ R+ QGGIRLA LLN++F S Sbjct: 240 YIFDHMPIIKTRLQQGGIRLAGLLNSLFDESATP 273 >UniRef50_B8NJ54 Nuclease S1, putative n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NJ54_ASPFN Length = 320 Score = 206 bits (523), Expect = 9e-52, Method: Composition-based stats. Identities = 71/306 (23%), Positives = 112/306 (36%), Gaps = 43/306 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH IAQ + + +L + L+ + W D ++ +++ P H Sbjct: 21 WGNLGHETVAYIAQSFVASPTESFCQDILGDDSTSYLANVATWADTYKYTDAGEFSKPYH 80 Query: 61 FID---TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLS----------------- 100 FID P ++C DY+RDC C AIQN+ + Sbjct: 81 FIDAQDNPPQSCGVDYDRDCGSA-----GCSISAIQNYVSYFRVYNNIGCSSYLDQYSPG 135 Query: 101 -----------HYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF 149 R S R +S +GD HQP+H +AGGN ID+ + Sbjct: 136 ISQWLGGVECPEIRGSCSSRPLTGLIRFPNMSQIIGDTHQPLH-DENLEAGGNGIDVTYD 194 Query: 150 RHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWREC---GNVF 206 +NLHH+WD + AA Y + + G +S SW E + Sbjct: 195 GETTNLHHIWDTNMPEEAAGGYSLSVAKTYADLLTERIKTGTYSSKKDSWTEGIDIKDPV 254 Query: 207 SCVNKFATESINIACKWGYKGVE---AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 S +A ++ C LS +Y++ P+ + +A+ G RLA L+ Sbjct: 255 STSMIWAADANTYVCSTVLDDGLAYINSTDLSGEYYDKSQPVFEELIAKAGYRLAAWLDL 314 Query: 264 VFGASQ 269 + S Sbjct: 315 IASQSA 320 >UniRef50_Q0AMT2 S1/P1 nuclease n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMT2_MARMM Length = 299 Score = 205 bits (521), Expect = 1e-51, Method: Composition-based stats. Identities = 81/282 (28%), Positives = 123/282 (43%), Gaps = 21/282 (7%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-GDLSALCVWPDQVRHWYKYKWTSPLH 60 +GH + C +A L+DE + L+ + +C W D VR ++ T+P H Sbjct: 27 GPDGHRIVCDLAWRYLSDETRTEIDRLVAQDPEFDHFRDVCSWADDVRGS-THRHTAPWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+ + D E DC + D C+ AI DR EAL FL Sbjct: 86 YINQTRDDPHVDAE-DCAE-----DGCITSAIDLHAGIFVDRSRSDEDR----LEALKFL 135 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH-KSNLHHVWDREIILTAAKDYYAKDINLL 179 +H+MGDIHQP+HV D GGN I++ W ++NLH VWD EI+L + Sbjct: 136 AHWMGDIHQPLHVSIEGDRGGNDINVLWRGERRTNLHRVWDSEILLDYM---AETWPYID 192 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK----WGYKGVEAGETLS 235 + D D + +D + +A ES +I + + E Sbjct: 193 DGDRWAQLADQLAADIPLNGISVYTPL-APVDWAQESHDIVRSRGFAYYWARAEEMIEPG 251 Query: 236 DDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 D Y++ LP+ ++R+ QGG+RLA LLN + Q + T Sbjct: 252 DAYYDRNLPVSLQRLKQGGVRLAGLLNQLVEERQLSGTGAVT 293 >UniRef50_P24504 Nuclease PA3 n=2 Tax=Penicillium RepID=NUP3_PENSQ Length = 270 Score = 205 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 74/280 (26%), Positives = 123/280 (43%), Gaps = 19/280 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +AQ ++ EAA + +L + L+++ W D+ R KW++ LH Sbjct: 1 WGALGHATVAYVAQHYVSPEAASWAQGILGSSSSSYLASIASWADEYRLTSAGKWSASLH 60 Query: 61 FIDTPDKA---CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 FID D CN DYERDC C AI N+T ++S + + EAL Sbjct: 61 FIDAEDNPPTNCNVDYERDCG-----SSGCSISAIANYTQRVSDSSLSSEN----HAEAL 111 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL HF+GD+ QP+H GGN I++ + + NLH WD + + D Sbjct: 112 RFLVHFIGDMTQPLH-DEAYAVGGNKINVTFDGYHDNLHSDWDTYMPQKLIGGHALSDAE 170 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGN---VFSCVNKFATESINIACKWGYKG---VEAG 231 + + N G ++ W + N + ++A+++ + C Sbjct: 171 SWAKTLVQNIESGNYTAQATGWIKGDNISEPITTATRWASDANALVCTVVMPHGAAALQT 230 Query: 232 ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L Y++S + + ++A+GG RLA +N + G+ + Sbjct: 231 GDLYPTYYDSVIDTIELQIAKGGYRLANWINEIHGSEIAK 270 >UniRef50_A6EB04 Putative S1/P1 Nuclease n=1 Tax=Pedobacter sp. BAL39 RepID=A6EB04_9SPHI Length = 250 Score = 205 bits (520), Expect = 2e-51, Method: Composition-based stats. Identities = 66/266 (24%), Positives = 106/266 (39%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+ L+ +A VK +L L+ W D ++ Y + H Sbjct: 11 WGMLGHRIVGQIAEAHLSKKALKGVKGVLGN---ETLAMASNWGDFIKSDTSYNYLYNWH 67 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + + V++ V + L + A+ L Sbjct: 68 FVNLP---AGLDKQGVFNVLDKVQEPNVYNKVPEMVAILKDNNSSAEQK----VFAMRML 120 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 121 VHLIGDLNQPMHTARKDDLGGNKVAVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 173 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 D + LASW + + S AC Y + + LS Y Sbjct: 174 ---YAKAIDYPSTAQLASW-----NGLSLRDYVYGSYE-ACNQIYAKTKGDDKLSYQYNF 224 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + L ++ +++ +GGI LA +LN ++ Sbjct: 225 NFLKLLNEQLLKGGICLANVLNEIYK 250 >UniRef50_Q4DEV4 Class I nuclease-like protein, putative n=2 Tax=Trypanosoma cruzi RepID=Q4DEV4_TRYCR Length = 333 Score = 203 bits (517), Expect = 5e-51, Method: Composition-based stats. Identities = 61/284 (21%), Positives = 102/284 (35%), Gaps = 28/284 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDE-------AAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ E AA + P D W D ++ Sbjct: 28 WWCNGHMLVNEIARRRLHPEVALIVEEAAVNLSASGPFPHTTDFVESGCWADDIK-KLGL 86 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 H+IDTP N + +++ + ++ L Y M Sbjct: 87 FVMEDWHYIDTPYNPQNINIKKNPVNTEN---------LKTVIESLKRTLMKQDLVPYIM 137 Query: 114 TEALLFLSHFMGDIHQPMHVGF-------TSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + A++ ++HF+GDIHQP+H D GGN+ + LH +WD Sbjct: 138 SFAIVNIAHFLGDIHQPLHAVELFSPEYPHGDRGGNAETVIVHGKMMALHSLWDSIC--Q 195 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + ++ F D + + + + A ES +IA + Y Sbjct: 196 GDVKNPRRPLDRWHYAKLREFADRLEDTY--KFPAEVKNETNTTQMAMESYDIAVQVAYP 253 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G G ++D+Y RV G RLA +LN + +Q+ Sbjct: 254 GFVDGAKITDEYLEKCRAAAESRVVLAGYRLANVLNQLLDKTQK 297 >UniRef50_C7PH62 S1/P1 nuclease n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PH62_CHIPD Length = 266 Score = 203 bits (516), Expect = 5e-51, Method: Composition-based stats. Identities = 70/268 (26%), Positives = 109/268 (40%), Gaps = 27/268 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH--WYKYKWTSP 58 W GH + IA L +A A+ LL ++ + WPD ++ +KY TSP Sbjct: 24 WGVTGHRVVAEIASRHLTPQARKAIIALLGP---QSMAMVANWPDFIKSDTTHKYDHTSP 80 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H++D P ++ + + + +L +D+ AL Sbjct: 81 WHYLDFPANVDRVHFDE--VLKEHTTGENLYAQTEALIKKLKDPATSKADK----VFALT 134 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 FL H +GD+HQP+H+G D GGN I + WF +SNLH VWD ++I Y L Sbjct: 135 FLIHMIGDMHQPLHIGRDEDQGGNKIPVMWFDKQSNLHRVWDEQLIEFQQLSYTEYTQAL 194 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 + + S +A W N S Y A + LS Y Sbjct: 195 --DTASAAEVRKLQSGSIADWMYDSNQLS--------------NKVYALTHANDKLSYRY 238 Query: 239 FNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + + ++ +GG+RLA LLN ++ Sbjct: 239 NYWFIADLNGQLLKGGLRLAALLNQIYK 266 >UniRef50_B2W4S8 Nuclease PA3 n=2 Tax=Pleosporineae RepID=B2W4S8_PYRTR Length = 312 Score = 202 bits (513), Expect = 1e-50, Method: Composition-based stats. Identities = 73/284 (25%), Positives = 116/284 (40%), Gaps = 22/284 (7%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H +A+ + + +L NG + W D H + ++ H Sbjct: 19 WNTDVHNQIGFMAETFFTPQTTLILAKILEPKYNGSVGRAAAWADGYAHTSEGHFSYQWH 78 Query: 61 FIDTPDK---ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD------RRY 111 +IDT D +C+ DY RDC K CV AI N T L D Sbjct: 79 WIDTHDNQPESCHLDYVRDCA-----KGGCVVSAIANQTGILRECITQVQDGKLAGGTNL 133 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA--- 168 + AL +++HF+GDIHQP+H + GGN+ + + H + LH VWD I AA Sbjct: 134 TCSYALKWVAHFLGDIHQPLHASGRA-VGGNTYKVVFGNHSTQLHAVWDGFIPYYAAEAS 192 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN---VFSCVNKFATESINIACKWGY 225 + + ++ D+ + W C N C +A ES C + Y Sbjct: 193 HPFSNQSLDPFFADLVTRIRKDQFYSAPYMWLSCTNPSTPIDCATAWARESNKWDCDYVY 252 Query: 226 KGVEAGETL-SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 V+ L ++ Y +PIV ++++ +RL LN + S Sbjct: 253 SRVQNDTDLGTNGYAAGAVPIVELQISKAALRLGTWLNKLVEGS 296 >UniRef50_Q15ZB2 S1/P1 nuclease n=4 Tax=Alteromonadales RepID=Q15ZB2_PSEA6 Length = 256 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 35/269 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W + GH +T IAQ L +A A+ LLP DL+ +PD++R W Sbjct: 20 WGQIGHRVTGAIAQQHLTPQAQAAISALLP---TEDLAEASTYPDEMRSSPDDFWQKKAG 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P D + A++ FT L+ + ++++ AL Sbjct: 77 PFHYVTIPKGQ-------TYADVGAPEQGDGVSALKMFTANLTSSQTSKAEKQL----AL 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F+ H +GD+HQP+H G +D GGN + +F SNLH VWD E++ Y Sbjct: 126 RFIVHIIGDLHQPLHAGNGTDRGGNDFKVNFFWQDSNLHRVWDSELLDQRQLSYTEWTA- 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 I + D+ W + + ES+ I Y E T+S D Sbjct: 185 --------ILNRKISAQDINDW-----NTTDPKVWIAESVKI-RDEIYPSQE---TISWD 227 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 Y LP +R+ GIR+A LN ++ Sbjct: 228 YLYHHLPQAKQRLKMAGIRIAAYLNEIYK 256 >UniRef50_C5KMC3 Nuclease PA3, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KMC3_9ALVE Length = 367 Score = 201 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 83/291 (28%), Positives = 138/291 (47%), Gaps = 43/291 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W +GH + +A ++ +A V ++ E L+ W D + + ++ W+ Sbjct: 19 WGPDGHAVVAELADTRMSSKARKWVYDIMGE--GYRLATSASWADSILYGNNSGEWSWSK 76 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLH+ + D C F Y RDC + ++CVAGAI+N+T QL++ R+ +A+ Sbjct: 77 PLHYANVDD--CEFVYARDCPN-----NVCVAGAIKNYTAQLTNTSLTKEQRQ----DAV 125 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAA------ 168 FL HFMGD+H+P++ G +D GGN+I + K+NLH VW ++I Sbjct: 126 KFLVHFMGDVHEPLNAGRYTDLGGNTISVAINFADYEKTNLHKVWGEKLIDEYEGELYPG 185 Query: 169 ----------KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS---CVNKFATE 215 KD +E G + G ++ + SW+ CVN+ E Sbjct: 186 PYIQQDADYNKDRTQYWSVSADEIGRGLASGGKYAGKVPSWKSKCESLGIDVCVNEMVQE 245 Query: 216 SINIACKWGYKGVEAGETLSDD-----YFNSRLPIVMKRVAQGGIRLAMLL 261 S +AC Y V+ + +DD Y+ SR+ V +++A+G +RLA +L Sbjct: 246 SATLACNQAYVNVDGSQIGNDDGLLMGYYTSRIETVKEQLAKGAVRLAWVL 296 >UniRef50_A4BZ60 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriaceae RepID=A4BZ60_9FLAO Length = 260 Score = 200 bits (509), Expect = 3e-50, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 104/266 (39%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH T IA+ LN A + LL L+ + + D+++ Y + H Sbjct: 25 WGQNGHRATGEIAESHLNKRAKRKIDKLL---NGQSLAFVSTYADEIKSDKAYSEYASWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + + + I L + + L L Sbjct: 82 YVNM-------NLDETYATAAKNTKGDLITGINTCIAVLKDKSS----SSEDKSFHLKML 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMH+G D GGNS+ + WF +SNLH VWD ++I Y + Sbjct: 131 IHLVGDLHQPMHIGRKEDKGGNSVKVEWFGKRSNLHAVWDTKMIEGWNMSYLE--LAESA 188 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + I + L W I+ K Y V+A + +S Y Sbjct: 189 KKVSKEQIAAIEAGTLLDWVAE--------------IHEVTKKVYNSVDANKGISYRYSY 234 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 IV ++ GGIRLA +LN++F Sbjct: 235 DHFDIVRDQLQIGGIRLAKILNDIFS 260 >UniRef50_C6X5W4 S1/P1 endonuclease family protein n=3 Tax=Bacteroidetes RepID=C6X5W4_FLAB3 Length = 263 Score = 200 bits (509), Expect = 3e-50, Method: Composition-based stats. Identities = 62/266 (23%), Positives = 107/266 (40%), Gaps = 28/266 (10%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW--TSPL 59 GH + IA+ L+++A +K ++ L+ WPD ++ W T Sbjct: 24 GVTGHRVVAEIAENHLSNKARKNLKKIIGNQK---LAYWANWPDAIKSDTTGVWKQTDTW 80 Query: 60 HFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLF 119 H+++ + D + + I+ + Q+ + DR AL F Sbjct: 81 HYVNI---SPQADLKSFSDSLQAQTGPNLYTQIKTLSAQIKDKKTSAKDREI----ALRF 133 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 L H +GD QPMHVG D GGN+I L++F +NLH +WD +++ Y + Sbjct: 134 LIHLVGDSSQPMHVGRAGDLGGNTIKLKFFGENTNLHSLWDSKLVDFQKYSYEE--FAKV 191 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 + I S L W ++ Y A ++ S DY Sbjct: 192 LDVKSKEEVRAIQSGTLEEWFYDS--------------HLKANNIYANTVADKSYSYDYN 237 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVF 265 P++ +++ GG+RLA +LN++ Sbjct: 238 YKYAPLLERQLLYGGLRLAKILNDIL 263 >UniRef50_C8WD33 S1/P1 nuclease n=5 Tax=Alphaproteobacteria RepID=C8WD33_ZYMMN Length = 319 Score = 200 bits (508), Expect = 5e-50, Method: Composition-based stats. Identities = 66/288 (22%), Positives = 107/288 (37%), Gaps = 38/288 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W EGH +A + V +L + D + W D+ R T Sbjct: 33 WGMEGHEAIAALAWKYMTPTTRKKVNAILAMDHDRLTEPDFMSRATWADKWRSAGHG-ET 91 Query: 57 SPLHFIDTPDKACNFD------YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 P HF+D N R ++G CV + F +LS + DR Sbjct: 92 EPWHFVDIEIDNPNLVTACAAASNRSNPMKNGGAQPCVVSQLDRFERELSSKQTSDQDRV 151 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 AL ++ HF+GD+HQP+H D GGN + + +S NLH WD Sbjct: 152 L----ALKYVLHFVGDLHQPLHAADHDDRGGNCVKVSINNARSLNLHSYWDT-------- 199 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y K+I+ + + + I +D SW V ++A ES + ++ Y Sbjct: 200 -YVVKEIDPDPQHLADSLKKEISPEDKKSW-----VLGDSKQWAMESFQLGKRYAYSFNP 253 Query: 230 --------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 L Y ++ + ++ + G+RLA +LN+ + Sbjct: 254 PAGCDATRPPIPLPAGYDSAARKVAASQLKKAGVRLAYILNHRLRSIP 301 >UniRef50_C6XYC1 S1/P1 nuclease n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XYC1_PEDHD Length = 268 Score = 200 bits (507), Expect = 6e-50, Method: Composition-based stats. Identities = 67/266 (25%), Positives = 109/266 (40%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + +IA+G L+++A +K +L L+ W D ++ Y + H Sbjct: 29 WGMLGHRIVGQIAEGYLSNKAKKGIKDVLGN---ESLAMASNWGDFIKSDPAYDYLYNWH 85 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F++ P D + V I L + + ++R A+ L Sbjct: 86 FVNLP---AGLDKQGVFDQLDKETSPNVYNKIPEMAAVLKNRQSTAEEKRL----AMRLL 138 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD++QPMH D GGN + + WF KSNLH VWD +I Y Sbjct: 139 IHLVGDLNQPMHTARKEDLGGNKVFVTWFGEKSNLHRVWDEGLIEYQQLSYTE------- 191 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 N + +D L SWR + AC Y ++ E LS Y Sbjct: 192 ---YANAINYPSNDQLNSWRNNSLKDFVYGSYQ------ACNRIYADIKPEERLSYKYNF 242 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + ++ +++ +GGI LA +LN+++ Sbjct: 243 EFVGLLNEQLLKGGICLANMLNDIYK 268 >UniRef50_Q5FP59 Nuclease S1 n=1 Tax=Gluconobacter oxydans RepID=Q5FP59_GLUOX Length = 300 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 77/282 (27%), Positives = 113/282 (40%), Gaps = 26/282 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W GH + IAQ L +A A LL + L + WPD + H K K +P Sbjct: 25 WGPYGHAIVADIAQERLTPQAQKAATALLALENHQTLDQVASWPDTIGHVPKKKGGAPET 84 Query: 59 --LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 H++D +D RDC D +CV + L+ DR A Sbjct: 85 LKWHYVDIDVSHPAYDQARDCPD-----HVCVVEKLPEEIKILADTHASAQDR----LTA 135 Query: 117 LLFLSHFMGDIHQPMHVGF-TSDAGGNSIDLRWFR----HKSNLHHVWDREIIL---TAA 168 L ++ H +GDIHQP+H D GGN+I L +F NLH +WD +I Sbjct: 136 LKWVVHLVGDIHQPLHAAERNKDMGGNAIRLTYFGDNANGHMNLHSLWDEGVIDHEADLH 195 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASW---RECGNVFSCVNKFATESINIACKWGY 225 + + I D+ W + +V++ +A ES ++A Y Sbjct: 196 VGPFYSIDASRAKKEADRLGALITPDETKYWVQDLDGDDVYNATVDWADESHSLARSVAY 255 Query: 226 KGVEA--GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + A G + DY PI+ R+ Q G+RLA +LN Sbjct: 256 GALPANKGADIGKDYTALTWPIMELRLEQAGVRLAAVLNTAL 297 >UniRef50_A4HI96 p1/s1 nuclease n=10 Tax=Leishmania RepID=A4HI96_LEIBR Length = 328 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 106/290 (36%), Gaps = 35/290 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W GH++ IA+ L+ ++ + P ++ D+ WPD V+ W + Sbjct: 31 WGCTGHMVLAEIARRQLDPSNEKKIQAMAMKFKESGPFLLSPDMIQAACWPDDVKRWGQ- 89 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 S H+ + V+ + + T LS+ R Y + Sbjct: 90 DAMSTWHYYAMQYNPDGINI------TDSVEAVNAVSVSLDMITSLSNVRSPL----YML 139 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREI--- 163 A ++L H +GD+HQP+H D GGN + +R LH WD Sbjct: 140 NFAWVYLVHLIGDLHQPLHAVSRYSEKYPHGDRGGNLVWVRVQTKMLRLHAFWDNICTAT 199 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 + + + D+ + E + +S DL V + A ES A Sbjct: 200 PVLYRRPLSSTDLLAISETADRLLKTYSFSSDLK-------TMQDVQRMANESYAFAVNS 252 Query: 224 GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 Y + G TLS Y + + + R+ GG RL +LN + +++ Sbjct: 253 SYADMIPGTTLSAAYISRCVEVAESRLTLGGYRLGYILNKLLSDIDVDEN 302 >UniRef50_A4C4V1 Putative S1/P1 Nuclease n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C4V1_9GAMM Length = 290 Score = 198 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 67/272 (24%), Positives = 107/272 (39%), Gaps = 29/272 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSP-- 58 W++ GH + +IA+ L D+ A+ LL L + W D++R W Sbjct: 28 WAQNGHRVVGQIAENHLTDKTKMAIAHLLEGDK---LPEVTTWADEMRSDPSKFWKKESV 84 Query: 59 -LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ ++A +F R + AI L + +R Sbjct: 85 IWHYINI-NEAEDFKPNRYRITATKGEVTDAYSAILKSIAVLQSEQTSLDKKR----FYF 139 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 FL+H +GDIHQPMHVG D GGN + +++F +NLH +WD++++ + Sbjct: 140 RFLTHVVGDIHQPMHVGRKDDRGGNDVKVKYFNKDTNLHSLWDKDLLEGENLSFSEYAY- 198 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 + + + + ES +IA K S Sbjct: 199 -FIDTTNKELISQYLASE-------------PKDWVLESFHIAKKLY---EVDDGNFSYS 241 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 Y + + R+ QGGIRLA LLN +F S Sbjct: 242 YVYEQKNTMNTRLLQGGIRLAGLLNAIFDPSA 273 >UniRef50_B0DTT7 Predicted protein n=2 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT7_LACBS Length = 357 Score = 195 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 73/338 (21%), Positives = 119/338 (35%), Gaps = 71/338 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHWYK 52 W GH + IAQ L+ + ++ P ++ + W D R+ Sbjct: 23 WGFAGHEIVATIAQIYLHPTVLPTLCTIIDFSSTNFSPPDSTCHIAPIATWAD--RYKSN 80 Query: 53 YKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSD 108 W++ LHFI D P +C F + G K + V ++N T L + Sbjct: 81 MTWSAQLHFIGALDDHPPSSCAFPGKNGWA---GTKRVNVLDGMKNVTALLQGW-VKGET 136 Query: 109 RRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 EAL FL HF GD HQPMH+ + GGN + + + ++NLH VWD +I A Sbjct: 137 SDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGKETNLHGVWDDSLITKAI 195 Query: 169 KDYYA-----------------KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFS---- 207 + I W+D++ W C +V Sbjct: 196 STIPQNYTLPLPYPEIEQALRGSSYDPYIRRIIWEGIVQRWADEIPGWLSCPDVVKRTSV 255 Query: 208 -----------------------CVNKFATESINIACKWGYKGVEAGETL------SDDY 238 C ++ + ++ C + + L + Y Sbjct: 256 DSQVALGLGGTTGIEILPDNDVLCPYHWSRPTHDLLCDGVWPKEDDNPQLPLLELDTPAY 315 Query: 239 --FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 + +V K++A GG+RLA +LN +F Q + Sbjct: 316 SGMIGQRWLVEKQLALGGLRLAGILNYIFVNQGQRGAF 353 >UniRef50_A2QX99 Contig An11c0270, complete genome n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2QX99_ASPNC Length = 309 Score = 193 bits (490), Expect = 5e-48, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 113/294 (38%), Gaps = 36/294 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH +A+ L ++ V LL N D+S W D ++ K T PLH Sbjct: 21 WGDVGHRAIAYLAEKYLTVAGSNLVNELLANDKNYDISDAATWADTIKW--KRPLTRPLH 78 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P K+C Y DC + C+ + N T Q++ + ++ EAL Sbjct: 79 YINPDDEPPKSCFVSYPHDC-----PPEGCIISQMANMTRQINDRHANMTQQK----EAL 129 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFR--------HKSNLHHVWDREIILTA-- 167 +FL H GD+HQP+HV + GGN I + + + NLH VWD I Sbjct: 130 MFLIHLFGDLHQPLHVTGVA-RGGNDIHVCFDGKNHCNNDTKRWNLHSVWDTAIPHKING 188 Query: 168 -----AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 + + + C+ ++ATES + C Sbjct: 189 IKHNLKHNPERLASAKWADRLHEE---NKLRPADTECANTQEPLECIMQWATESNQLNCD 245 Query: 223 WGYKGVEAG---ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 + K L Y+ PIV ++ + +RLA ++ + ++ D+ Sbjct: 246 FVMKKGLQWLEKTDLGVKYYEVAAPIVDDQIFKAAVRLAAWISALAEDREEADN 299 >UniRef50_C5PWU6 S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PWU6_9SPHI Length = 262 Score = 193 bits (490), Expect = 6e-48, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 111/266 (41%), Gaps = 26/266 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +I+T N E+ D + + + L +G ++ + N L FL Sbjct: 80 YINTEG---NLTKEQFATALQQSPDNNIYKQLIRLSADLKAKDKGLTEMQQN----LYFL 132 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H MGD HQPMHVG +D GGN I++ WF N+H VWD ++ Y + Sbjct: 133 IHLMGDAHQPMHVGRPADLGGNKIEVMWFGKPDNIHRVWDSNLVDYEKYSYTE--YANVL 190 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 + + D ASW +I YK VE LS Y Sbjct: 191 DIHTRQENQRLTDGDFASWLYDT--------------HIVANKIYKDVEQNSNLSYRYIY 236 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 +V + +GG+RLA +LN +FG Sbjct: 237 DNKYVVEDALLKGGLRLAKVLNEIFG 262 >UniRef50_Q3BPV9 Endonuclease S1 n=15 Tax=Bacteria RepID=Q3BPV9_XANC5 Length = 318 Score = 193 bits (489), Expect = 7e-48, Method: Composition-based stats. Identities = 64/258 (24%), Positives = 99/258 (38%), Gaps = 27/258 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVR--HWYKYKWTSP 58 W +GH + RIA+ L+ +A V LL + L + W D++R K + P Sbjct: 74 WGPQGHRLVARIAETELSPQARTQVAQLLAGEPDPTLHGVATWADELREHDPDLGKRSGP 133 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+++ + C + RDC D CV A+ L+ + RR +AL Sbjct: 134 WHYVNLGEHDCTYSPPRDCPD-----GNCVIAALDQQAALLADRTQPLDVRR----QALK 184 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 F+ HF+GDIHQPMH G+ D GGN L+ SNLH +WD ++ A L Sbjct: 185 FVVHFVGDIHQPMHAGYAHDKGGNDFQLQIDGKGSNLHALWDSGMLNDRHLSDDAYLQRL 244 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY 238 L + + A + + + + L Y Sbjct: 245 LALPAATAGSAALPPPAAAWAQASCKIAITPGVY----------------PSAHVLPATY 288 Query: 239 FNSRLPIVMKRVAQGGIR 256 + PI ++ G R Sbjct: 289 IATYRPIAETQLRIAGDR 306 >UniRef50_C6XIU0 S1/P1 nuclease n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIU0_HIRBI Length = 264 Score = 193 bits (489), Expect = 7e-48, Method: Composition-based stats. Identities = 70/270 (25%), Positives = 116/270 (42%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK---YKWTS 57 W K GH +T IA+G L+D+A AV+ +L D++ + WPD +R + Sbjct: 25 WGKLGHRVTGEIAEGYLSDQAKVAVEAILG---VEDMAEVSTWPDYMRSSDDEFFKREAF 81 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 PLHF+ PD E+ + K ++ F L + + R AL Sbjct: 82 PLHFVTVPD-------EQTYAEAGAPKQGDAFTGLERFKAVLQNNESSAEELRL----AL 130 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + + H + D+HQP+HVG D GGN +++ + SNLH +WD +++ Y + Sbjct: 131 IMVIHIVSDLHQPLHVGKGDDWGGNKVEIMFKGEASNLHEIWDEKLVQDEELSYTE-MAH 189 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 L+ + ++ + + ES I Y + LS Sbjct: 190 WLDRKMTPELAQEWYNA-------------DPSVWIAESKEI-RPSIYPK-DGETDLSWQ 234 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y P++ +R++Q G+RLA LN +FG Sbjct: 235 YIYDHRPVMRQRLSQSGVRLAAYLNEIFGE 264 >UniRef50_C9ZQW0 Single strand-specific nuclease, putative n=6 Tax=Trypanosoma brucei RepID=C9ZQW0_TRYBG Length = 326 Score = 193 bits (489), Expect = 7e-48, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 102/282 (36%), Gaps = 29/282 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDLSALCVWPDQVRHWYKY 53 W+ GH++ IA+ L+ + VK P D WPD ++ Y Sbjct: 27 WAAFGHMVVAEIAKRNLDADVLEKVKQYTQHLSESGPFPKIPDFVQSACWPDDLKS-YDL 85 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + H+ F+ + + + I + + LS++ Y Sbjct: 86 GVMNGWHYTANVYSRDGFELKE-----PLQQKSNIVSVIDSLSATLSYHETPL----YVR 136 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILT 166 + AL L H GDIHQP+H T D GGN + +R + LH WD + Sbjct: 137 SFALAHLIHHYGDIHQPLHTTSQVSSEYKTGDLGGNLVHVRVRNTTTKLHSFWDDICRPS 196 Query: 167 AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +F D + SW + + E +A + Y Sbjct: 197 ISMK---RPLEEKHYAKVRSFADRLVETYDVSW--EHRRQTNATIMSMEGFELAKEIAYA 251 Query: 227 GVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 GV G LS Y + + +R+ G RLA LNN+ G+ Sbjct: 252 GVVNGSQLSSQYVDRCVETAEQRMTLAGYRLATHLNNILGSK 293 >UniRef50_Q1YUT9 Probable endonuclease n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YUT9_9GAMM Length = 281 Score = 193 bits (489), Expect = 8e-48, Method: Composition-based stats. Identities = 69/271 (25%), Positives = 108/271 (39%), Gaps = 34/271 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 +GH + IA+ L+ + A + + L+ L +WPDQ+R K+ T H+ Sbjct: 20 GADGHRIIVSIAEKHLSKKTAAELTQI---SGGTALTELALWPDQIRGQQKWSHTKSWHY 76 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 I+ D V A++ QL + + RR EAL F Sbjct: 77 INIKDH-------ERFSGLRRSPKGDVLSALKESYKQLKDPKTESQQRR----EALAFFV 125 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFR--HKSNLHHVWDREIILTAAKDYYAKDINLL 179 H GDIHQP+HVG SD GGN + ++W + NLH VWD +I Sbjct: 126 HLAGDIHQPLHVGRYSDLGGNRVSIKWLGSNKRRNLHWVWDTGLIKDEQLGVDQYSA--- 182 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINI---ACKWGYKGVEAGETLSD 236 + + +W+ +A ES + ++G + T+ Sbjct: 183 -------LINKTTAQQRYNWQSDS-----FLDWAMESKVLRAQVYEFGQPVQKGPVTIDQ 230 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y N P++ KR+ G+RLA LN +F + Sbjct: 231 QYINRTKPLLKKRLLMAGVRLAGCLNRLFDS 261 >UniRef50_A0BLJ0 Chromosome undetermined scaffold_114, whole genome shotgun sequence n=6 Tax=Paramecium tetraurelia RepID=A0BLJ0_PARTE Length = 712 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 55/291 (18%), Positives = 107/291 (36%), Gaps = 24/291 (8%) Query: 1 WSKEGHVMTCRIAQGLLN---DEAAHAVKML------LPEYVNGDLSALCVWPDQVRHWY 51 W + GH+MT +IA+ L + L L + + + VW D ++ Sbjct: 422 WWEVGHMMTAQIAKNYLRDNRPDVLAWADSLVQDFNSLTDGKSNTFAEAAVWLDDIKETG 481 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 S H+ D P + + +++ AI L++ + + Sbjct: 482 TEFLFS-WHYTDRPINPDGLLIKI----EDESRNINSIYAINQAVAVLTNSKTSRNRHTV 536 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLR-WFRHKSNLHHVWDREI 163 + L L H +GDIHQP+H DAGGN ++++ N H WD Sbjct: 537 FKAQMLRVLLHVIGDIHQPLHDTSLYNNSYPDGDAGGNFLNIQLQNGTLMNFHSFWDSGA 596 Query: 164 ILTAAKDYY-AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 + A + + A+ ++ + + ++ + S + + + + A + Sbjct: 597 LTFAPNNSFLARPLSQSDSEYLDKWSKDLMKKFPIS-KYSNYDMTNPSVWTYLGFRQAQQ 655 Query: 223 WGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 + Y V A + S DY + + + GG RL L ++ Q ++ Sbjct: 656 FVYPMVAASNSYSSDYEKQAIAFCEENLIVGGYRLGSKLIEIYDQILQNEA 706 >UniRef50_C5K8A7 Nuclease S1, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5K8A7_9ALVE Length = 366 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 96/298 (32%), Positives = 138/298 (46%), Gaps = 46/298 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY---KYKWTS 57 W +GH L ND A AV +L E ++ WPD V H +++W+S Sbjct: 18 WGPDGHATVADAGNKLFNDNANEAVAEILGE--GVRMADYASWPDSVLHGPDSSEWEWSS 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 LHF D C+F Y RDC D D CV G I+N+T Q++ R+ AL Sbjct: 76 GLHFADVE--QCHFIYSRDCKD-----DYCVVGGIKNYTRQVADTSLPIEQRQ----VAL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRW---FRHKSNLHHVWDREIILTAA-----K 169 FL HFMGDIHQP+HVG SD GGN+I + LHH WD ++I + Sbjct: 125 KFLMHFMGDIHQPLHVGRHSDYGGNTIKVDMKFANYEYGALHHAWDEKMIDQSQASQYDG 184 Query: 170 DYYAKDIN--------------LLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKF 212 +Y +D N + + + G + D + W CVN Sbjct: 185 EYIQQDANYSTPLAERETFWGITVSDIMTELAEGGAFHDRVPMWLADCETNGLDECVNTM 244 Query: 213 ATESINIACKWGYKGVEA-----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 A ES IAC Y+ ++ G+ LS DY++ R+ IV +++A+G +R A ++N+ F Sbjct: 245 AEESAIIACADAYRHLDGDEIEYGDVLSMDYYDDRIKIVKEQLAKGAVRFAWIMNHAF 302 >UniRef50_Q01U80 S1/P1 nuclease n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01U80_SOLUE Length = 261 Score = 190 bits (483), Expect = 4e-47, Method: Composition-based stats. Identities = 71/270 (26%), Positives = 109/270 (40%), Gaps = 33/270 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + R+A L AA V +L L+++ W D VR + P H Sbjct: 19 WGPEGHSLIARLAAARLTPAAAAKVAEILG--PGNTLASISSWADSVRRARA--ESGPWH 74 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++D P + D ERDC K CV I++F L + R+ EAL+F+ Sbjct: 75 YVDIPINKPHLDMERDC-----PKGDCVIAKIEDFEKVLVNPAATPVQRK----EALMFI 125 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 HF+GD+HQP+H D GGN + L +F SNLH VWD ++ E Sbjct: 126 VHFVGDMHQPLHCSDNKDKGGNDVKLEFFGRPSNLHSVWDSGLLGRM----------GAE 175 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGE-----TLS 235 + + + + + V +A + A K Y + + Sbjct: 176 DALFATLNRDLTPKRARKFEKG-----TVENWADQIHKAAQKTTYGRLPKSTAGVPPKID 230 Query: 236 DDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + ++ + +GG RLA +LN Sbjct: 231 AHYEHEADELIRIELEKGGARLAKVLNATL 260 >UniRef50_A3HUK9 Putative S1/P1 Nuclease n=1 Tax=Algoriphagus sp. PR1 RepID=A3HUK9_9SPHI Length = 257 Score = 187 bits (475), Expect = 3e-46, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 104/266 (39%), Gaps = 31/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH + +A L A V+ +L + W D+++ +Y + H Sbjct: 23 WGQIGHYLIGYMAGQQLKRSARKNVERVLYP---MSIGRSGTWMDEIKSDKRYDYAYSWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ + + + + AI +L ++ E L L Sbjct: 80 YLTSKHGEYDPHLQE--------EGGDAYEAINRIKEELKSGNLNPTEE----AEKLKML 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H + DIHQP+HVG D GGN + L +F SNLH VWD +I + Y + Sbjct: 128 IHMVEDIHQPLHVGTGEDRGGNDVKLEYFWQSSNLHSVWDSGMIDRWSMSYTE-----IG 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 +++ T + + + E+++ A YK + LS +Y Sbjct: 183 DELMRRLTPEMEDQYRE---------GSMEDWLQEAVD-ARPLVYK-IPENRKLSYNYDY 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 + P++ +R+ +RLA +L ++G Sbjct: 232 AVRPLLEERLIAASVRLAQILEEIYG 257 >UniRef50_A2ECC5 Class I nuclease, putative n=2 Tax=Trichomonas vaginalis RepID=A2ECC5_TRIVA Length = 319 Score = 187 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 59/286 (20%), Positives = 100/286 (34%), Gaps = 27/286 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+M RIA+ LL + ++ +L ++ ++ W D ++ Y Sbjct: 12 WWGHAHMMIGRIAESLLTSKEKKKIEAVLRYGQHPIQTITEATTWQDDLKGTYSLSVMET 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF+D P + + + + + L T+ + L Sbjct: 72 WHFLDHPIN-------KGKNTSIPPPTYNITTYMDSAYRALKD---KTTTDPWVWAFHLR 121 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L HF+GD+H P H + T D GGN + +N+H +WD + Sbjct: 122 SLIHFVGDVHTPHHNVALFNDLFPTGDHGGNLYILNCNLGSGCNNIHFLWDSAGFYFPMR 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC--VNKFATESINIACKWGYKG 227 + I ++ + N T I + + + ES +A +GY Sbjct: 182 NPV---IPKYRDEFQKNATKLINELPQSHYTSQNMDVKTFHPEVWHNESYEVAYNFGYNT 238 Query: 228 VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 G S DYF + +R+A G RL L V G E + Sbjct: 239 TMYGW-PSKDYFTTVQTQSKERIAISGYRLGYFLKEVVGNIPVEPT 283 >UniRef50_A6GGE9 Probable endonuclease n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GGE9_9DELT Length = 285 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 69/283 (24%), Positives = 109/283 (38%), Gaps = 34/283 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG---DLSALCVWPD-QVRHWYKYKWT 56 W +GH + IA+ L+ V+ LL L+ +W D + R ++ + Sbjct: 20 WHDDGHRIVGEIAERNLSPATRAKVRALLQGSDGKGDGSLATASIWADHEARESPEFAFA 79 Query: 57 SPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 + H+++ + C + + C+A A+ + L R EA Sbjct: 80 ASSHYVNLDGPTSPRELHAQCLE----RAGCLATAVPYYADILRSEGASEDQR----AEA 131 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSID------LRWFRHKSNLHHVWDREIILTAAKD 170 L FL HF+GD HQP+H G D GGN ID +NLH WD ++ A + Sbjct: 132 LRFLVHFVGDAHQPLHAGRRGDRGGNDIDRLTIPGYTAKGETTNLHAAWDGALVALALTE 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + GI +D A W + + ES A Y V+ Sbjct: 192 RGVDW-----KAYAVALDAGIDADARARWVGG-----TIYDWLEESRRFAAAEAYLHVDG 241 Query: 231 ------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 G+TL D++ +R++Q G+RLA LL +F Sbjct: 242 LTPVRSGDTLGADWYRRNSSTAEQRLSQAGVRLAALLEAIFED 284 >UniRef50_A2EEH7 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EEH7_TRIVA Length = 328 Score = 186 bits (473), Expect = 5e-46, Method: Composition-based stats. Identities = 47/280 (16%), Positives = 98/280 (35%), Gaps = 24/280 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W H R+A+ L+ E + +L + + W D ++ + Sbjct: 14 WWGAPHYTVARLAETRLSPEQLKYINDILETWTSEKAVFHDTANWHDDIK-AANVAIMAN 72 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + +++ D + A ++ + T+ ++ + Sbjct: 73 WHFRNQPIFSSDYE-----GDFSYPTTYNITDASKDCINTIMSE---TTTSQWILGFCFR 124 Query: 119 FLSHFMGDIHQPMH-------VGFTSDAGGNSIDL--RWFRHKSNLHHVWDREIILTAAK 169 LSHF+ D H P+H D GGNS + + + N+H +WD + Sbjct: 125 TLSHFVADAHCPVHSAGRWSKAFPDGDRGGNSQAVVCTYGQPCRNMHMLWDSACLDFQIW 184 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D++ E+++ + ++ + + + E+ A K+ Y + Sbjct: 185 PLSKNDVDEYEKNLTNLLNNY----QPKTYLPETYQSTDPDVWENEAYRYASKYVYGNLP 240 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 T +D Y + ++ G RL +L F A + Sbjct: 241 DDFTANDTYIKEGANAAKQLISAAGYRLGEVLLKFFEARK 280 >UniRef50_B8KH31 S1/P1 nuclease n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KH31_9GAMM Length = 323 Score = 186 bits (472), Expect = 6e-46, Method: Composition-based stats. Identities = 61/276 (22%), Positives = 93/276 (33%), Gaps = 36/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + ++A L V+ LL + L W D++R W Sbjct: 58 WGAMGHEIAAQLADPYLTAHTRQQVEALLGKD---TLKTASTWADRMRSDPAPFWQEEAG 114 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ P R D A A+ F L ++ AL Sbjct: 115 PYHYVTIPRG-------RQYADVGPPPQGDAASALTQFARDLRSPSVSLERKQL----AL 163 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN + +R F SNLH VWDR++ + A+ Sbjct: 164 RFAIHIIQDLQQPLHVGNGLDRGGNDVPVRIFGETSNLHSVWDRQMFESTARTQAQWLDY 223 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 ++ T + + ES + ++ Sbjct: 224 FKASELLRRPTQ---------------NDADPQVWIAESAKLRETLY----PVPASIDTR 264 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDS 273 Y LP R+A GIR A LN ++ + Sbjct: 265 YIRRELPRAEARLALAGIRTAAWLNAIYDDNATPGE 300 >UniRef50_Q4QGQ3 3'-nucleotidase/nuclease, putative n=3 Tax=Leishmania RepID=Q4QGQ3_LEIMA Length = 381 Score = 186 bits (472), Expect = 6e-46, Method: Composition-based stats. Identities = 62/288 (21%), Positives = 99/288 (34%), Gaps = 31/288 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV-------KMLLPEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ L + V + P + ++ L W D ++ Sbjct: 29 WWDKGHMCIAEIARRNLKPDVQAKVQACANALNKIGPFPKSTNIVELGPWADDLKSM-GL 87 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 S HFIDT + + V+ + VA I L + + Sbjct: 88 YTMSTWHFIDTIYNPQDVK-----VTINPVEIVNVASVIP----MLISAITSPTATSDII 138 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWF---RHKSNLHHVWDREI 163 ++ L HF+GDIH P+H D GGN + LH WD Sbjct: 139 ITSVANLIHFVGDIHMPLHSADLFSPEYPLGDLGGNKQIVIVNETAGTSMKLHAFWDSMC 198 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 ++ + ++ F D + S+ E + + A ES +A K Sbjct: 199 --EGPQNNAVRPLDKDAYAELSAFVDNLVKSH--SFTEEQMMMTNSTIMAAESYELAVKN 254 Query: 224 GYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ G LS+ Y + + RV G RLA +LN + Sbjct: 255 VYPGISDGTVLSESYKANGKILAAGRVTLAGYRLATILNTALAGVSLD 302 >UniRef50_B9XJ21 S1/P1 nuclease n=1 Tax=bacterium Ellin514 RepID=B9XJ21_9BACT Length = 377 Score = 186 bits (472), Expect = 7e-46, Method: Composition-based stats. Identities = 70/284 (24%), Positives = 104/284 (36%), Gaps = 35/284 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLP------EYVNGDLSALCVWPDQVRHWYKYK 54 W EGH++ +I L+ L+ N W D + Sbjct: 44 WDAEGHMVVAQIGYNHLDPAVKAKCDALISVALTNVSSQNNTFVTAACWADDNKAALG-- 101 Query: 55 WTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMT 114 T+ H+ID P F + + V AI+ L T+ + + Sbjct: 102 -TAIWHYIDLP-----FSLDGTPTNGVAPASTNVVFAIRQCVATLQ----STNATQIDQA 151 Query: 115 EALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTA 167 +L +L HF+GDI QP+H DAGGNS L + +NLH +WD Sbjct: 152 ISLRYLIHFVGDIQQPLHASTAVSASSPGGDAGGNSFSL--SGYWNNLHSLWDAG----- 204 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN--VFSCVNKFATESINIACKWGY 225 Y I+ + DG S ++ N V +A ES +A Y Sbjct: 205 -GGYLTNSISRPLTAGGQSIIDGKVSAIEVAYPFTSNIGVIPNPMDWANESWGLAQNVAY 263 Query: 226 KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 G+ T S Y + +R++QGG RLA LLN ++ S Sbjct: 264 AGLTRSSTPSVGYLTTVQNTTQQRMSQGGHRLANLLNTIYSTSP 307 >UniRef50_Q25267 3'-nucleotidase/nuclease n=7 Tax=Trypanosomatidae RepID=Q25267_LEIDO Length = 477 Score = 186 bits (471), Expect = 8e-46, Method: Composition-based stats. Identities = 67/287 (23%), Positives = 115/287 (40%), Gaps = 26/287 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVK---MLL----PEYVNGDLSALCVWPDQVRHWYKY 53 W +GH+ IA+ + + +L P + D+ W D ++ Sbjct: 126 WWSKGHMSVALIAKRHMGASLVEKAELAAKVLSFSGPYPKSPDMVQTAPWADDIK-TIGL 184 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 K S H+I TP + E D V+ + VA I L E + + Sbjct: 185 KTLSTWHYITTPY----YTDEDFTLDVSPVQTVNVASVIP----MLQTAIEKPTANSDVI 236 Query: 114 TEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSN--LHHVWDREII 164 ++L L HFMGDIHQP+H SD GGN + + LH WD + Sbjct: 237 VQSLALLLHFMGDIHQPLHNVNLFSNQYPESDLGGNKQLVVIDSKGTKMLLHAYWDS-MA 295 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + ++ + D NF D + + ++ + + + E+ ++A K+ Sbjct: 296 EGKSGEDVPRPLSEADYDDLNNFADYLEATYASTLTDKEKNLVDTTEISKETFDLALKYA 355 Query: 225 YKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G + G TLS++Y + I ++V G RLA +LN + + Sbjct: 356 YPGADNGATLSNEYKTNAKKISERQVLLAGYRLAKMLNTTLKSVSMD 402 >UniRef50_Q7P202 Probable endonuclease n=1 Tax=Chromobacterium violaceum RepID=Q7P202_CHRVO Length = 274 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 68/269 (25%), Positives = 108/269 (40%), Gaps = 22/269 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH--WYKYKWTSP 58 W +EGH +T IAQ LL+ +A VK L+P D + L ++ DQ + + Sbjct: 23 WGQEGHRITGYIAQQLLSSKAKAEVKKLIPNA---DFAQLALYMDQHKQELKQTLPGSDQ 79 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P C+ E +C D C A I + L+ +DR +AL Sbjct: 80 WHYNDEPV--CSGVTEDECPD-----GNCAANQIDRYRKVLADRGAAKADR----AQALT 128 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK--SNLHHVWDREIILTAAKDYYAKDI 176 FL H +GDIHQP+H D GGN ++ SNLH VWD ++ K Sbjct: 129 FLIHMVGDIHQPLHAADNLDRGGNDFKVQLPGSSKISNLHSVWDTALVQQELNGADEKSW 188 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 + G + W N ++ + + +A L + Sbjct: 189 AAADLQRYQRNVSGWQGGGVMDWVHESNQYARADVYG----PLAGFSCGASPSTPVYLDN 244 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 Y + +V +++A+ G R+A ++N Sbjct: 245 TYLRAGGLLVDQQLAKAGARIAAVINQAL 273 >UniRef50_A4CQ68 Putative S1/P1 Nuclease n=2 Tax=Flavobacteriales RepID=A4CQ68_9FLAO Length = 257 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 69/266 (25%), Positives = 110/266 (41%), Gaps = 30/266 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W + GH +A+ L+ A AV LL L+ + + D ++ Y+ SP H Sbjct: 22 WGRTGHRAIGEVAEAHLSRRARKAVSRLL---EGESLAKVSTFGDDIKSDTTYRSFSPWH 78 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + D + I++ L + L L Sbjct: 79 YVNLPPETP-------YGEITPNPDGDILQGIEHCIRVLKDPASPRDQQ----VFYLKLL 127 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLE 180 H +GD+HQPMHVG D GGN I L++F +NLH +WD ++I Y L Sbjct: 128 VHLVGDLHQPMHVGRPEDRGGNDIQLQYFDKGTNLHRLWDSDMIEDYGMSYTE-----LA 182 Query: 181 EDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFN 240 E + I V ++A +S ++A Y VE GE L Y Sbjct: 183 ETLPPATRREI----------RVIQSGSVLEWAGQSQSLA-NRVYASVENGEKLYYRYRY 231 Query: 241 SRLPIVMKRVAQGGIRLAMLLNNVFG 266 V +++ GG+RLA +LN+++G Sbjct: 232 LWWDSVERQLLLGGLRLAAVLNDIYG 257 >UniRef50_A2ELH6 Class I nuclease, putative n=1 Tax=Trichomonas vaginalis RepID=A2ELH6_TRIVA Length = 315 Score = 184 bits (467), Expect = 2e-45, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 97/279 (34%), Gaps = 27/279 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN--GDLSALCVWPDQVRHWYKYKWTSP 58 WS E H + R+AQ +L + + +L + + DL + W D +R Sbjct: 5 WSGEPHQLIARVAQTMLTKKQRKWIDEMLFLWPSEAQDLITVSNWEDTIRSDIDDILMQ- 63 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P + ++ + + AI + + T+ + Sbjct: 64 WHFENKPYIEPEYTPKK------VTRTFNITNAIDDAMKSILD---PTTTSFWTFGFYFR 114 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNSIDLR--WFRHKSNLHHVWDREIILTAAK 169 L HF+GD H P+H DAGGN I L S LH +WD + Sbjct: 115 ALIHFVGDSHCPVHSIAYYSDKYPKGDAGGNFIKLNCSISYFCSTLHKLWDSACLNFQHN 174 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 Y A + E++I + + E S + + ES A + Y + Sbjct: 175 KYVAPTLEDFEKNITR-----MMNAYPLKILEEHPSLS-PHDWIDESYKTAIDYAYTPLV 228 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + ++D Y + R+ G RL M+ F Sbjct: 229 DWKNINDTYLANGAEAAEYRITLAGYRLGMVFKQFFKER 267 >UniRef50_Q989R8 Endonuclease n=1 Tax=Mesorhizobium loti RepID=Q989R8_RHILO Length = 278 Score = 184 bits (466), Expect = 3e-45, Method: Composition-based stats. Identities = 68/280 (24%), Positives = 115/280 (41%), Gaps = 36/280 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W EGH + IAQ L+ A VK +L V ++++ W D VR+ + H Sbjct: 21 WGPEGHSIVAEIAQRRLSSTALMEVKRILGGEVA--MASVASWADDVRYAIH-PESYNWH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+D P +D C V+ C I +++ + R ++L +L Sbjct: 78 FVDIPLADSKYDPVSQCA--ANVQGDCAIAEIDRAEHEITCATDPLQRR-----DSLRYL 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNS--IDLRWFR--------HKSNLHHVWDREIILTAAKD 170 H +GD+HQP H + G N+ + +++ NLH VWD II Sbjct: 131 IHIVGDLHQPFHTV-ADNTGENALAVTVKFGGLIKSPPKTPADNLHAVWDSTIIKQTTYA 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA 230 + G++ D + +D L E +A E+ +A + G+ Sbjct: 190 W-------------GSYVDRLETDWLLKHPEASETL-DPVAWALEAHTLAQEMA-AGITN 234 Query: 231 GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 G L +DY+ LP+V +++ + G+RLA +LN + Sbjct: 235 GANLDNDYYAKALPVVDEQLGRAGLRLAAVLNRWLATAPA 274 >UniRef50_B0DTT9 Predicted protein n=2 Tax=Agaricales RepID=B0DTT9_LACBS Length = 375 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 83/360 (23%), Positives = 123/360 (34%), Gaps = 96/360 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG-------DLSALCVWPDQVRHWYKY 53 W GH + IAQ L+ + +L + L+ + W D++R +K Sbjct: 22 WGAAGHEIIATIAQMYLHPSILPTICDILNFSEDETQPEQPCHLAPISTWADKLR--FKM 79 Query: 54 KWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR 109 +W++ LH++ D P + C F ER G + V AI+N T L + Sbjct: 80 RWSAALHYVGSLDDHPSQTCLFPGERGWA---GTRGGNVLDAIKNVTGLLEDWTR-GEAG 135 Query: 110 RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAK 169 EAL FL HFMGD+H P+H+ D GGNS + W ++NLH +WD +I A + Sbjct: 136 DATANEALKFLVHFMGDLHMPLHLT-GRDRGGNSDRVLWSGRQTNLHSLWDGLLIAKAIR 194 Query: 170 DYYAK-----DINLLEEDIEGNFTD------------GIWSDDLASWRECGNVFS----- 207 +E + G D W DD+ W C Sbjct: 195 TVPRNYSRPLPYPDVEHALRGTIYDSYIRRIMWEGVFQKWKDDVPEWFSCPETTPPPPAR 254 Query: 208 ---------------------------CVNKFATESINIACKWGYKGVE-------AGET 233 C +A + C + G Sbjct: 255 GWQQVVMSLKRLAGKQGVEIGPDTDVLCPYHWAKPIHALNCDIVWPKELDEPPYGGGGSK 314 Query: 234 LSDDYFNSRLP----------------------IVMKRVAQGGIRLAMLLNNVFGASQQE 271 +D+ R P +V K +AQGGIRLA +LN +F Sbjct: 315 FADEDVAGRPPKPHPPLLELDTPKYAGVIEDTMVVEKLLAQGGIRLAGILNYLFLEEAAR 374 >UniRef50_C5LN34 S1/P1nuclease, putative n=7 Tax=Perkinsus marinus ATCC 50983 RepID=C5LN34_9ALVE Length = 401 Score = 181 bits (459), Expect = 2e-44, Method: Composition-based stats. Identities = 58/300 (19%), Positives = 113/300 (37%), Gaps = 35/300 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH +A L+ A++ +K LL D W + W++ LH Sbjct: 29 WDIDGHEAVGMVAMSALDSRASNQLKRLLQ---GKDAVEDAGWAH--KAESSIPWSTRLH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR----------- 109 F+ P+ N ++ + C+ A++ F Q S + Sbjct: 84 FLSQPEPFSNTLVV---NEITCPQGQCLLEALKLFYDQAKGDTSKISQKDRLMMSSARLP 140 Query: 110 -RYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTA 167 + +A+ FL + +GD+HQP+H GF +D G ++ + +L+ +WD EII Sbjct: 141 VQVTDADAVRFLINLIGDMHQPLHEGFQTDDFGKQTIVKLPGGSTLSLYELWDHEIIQET 200 Query: 168 AKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG 227 K++ + N ++ D W+E K+ ++ A K+ Y Sbjct: 201 IKNHPQFWWSGWTHIQRANP--DTYNADKKLWQENNKAAL--EKWCNDNAEFANKFIYTN 256 Query: 228 VEAGETLS----------DDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 + E L ++++++ G R A++LN++ +S + Sbjct: 257 PLSNERLPIGSGSPINVDAAVLEKWRQLLIQQILLAGSRTAIVLNDILESSAAPGLRSGS 316 >UniRef50_Q7RSD2 3'-nucleotidase/nuclease n=8 Tax=Plasmodium RepID=Q7RSD2_PLAYO Length = 328 Score = 181 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 53/298 (17%), Positives = 103/298 (34%), Gaps = 27/298 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRH------- 49 WS EGH++ IA L+D + + Y + VWPD +++ Sbjct: 24 WSDEGHMLISAIAYEGLDDREKKILTQIFQNYKEDNDFNNHIYAAVWPDHIKYYEHPVDT 83 Query: 50 ---WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 H+I+ P N D + + + D + + + F ++ Sbjct: 84 TKRMDGISIMDRWHYINVPYNPTNIDLDMYHKEYYKDTDNSLTISRKIFQDLKLMEKKNN 143 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVW 159 ++ L + H GD+HQP+H D GG +I++ + LHH+ Sbjct: 144 YGSYFSYNFQLRYFIHVFGDMHQPLHTATFFNKHFIKGDFGGTAINVNYNNRTEKLHHLC 203 Query: 160 D------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 D + +A + D L + ++ + + G + A Sbjct: 204 DCVFHARDKKWPSATVEEVTNDARTLMNTYPPEYFGNRLNNGMDEYEYLGYIVEDSYAQA 263 Query: 214 TESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + I A + TL++ Y + ++ +++A GG RL L + + Sbjct: 264 IDHIYYAFPFESLNRHTAYTLTNAYVINLKKVLNEQIALGGYRLTRYLKTIIANVPDD 321 >UniRef50_Q236I5 S1/P1 Nuclease n=2 Tax=Tetrahymena thermophila SB210 RepID=Q236I5_TETTH Length = 330 Score = 181 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 103/290 (35%), Gaps = 31/290 (10%) Query: 1 WSKEGHVMTCRIAQG---------LLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWY 51 W GH++T +A+ L E + L + + W D ++ Sbjct: 19 WWDGGHMITVEVAKQEILARDPALYLKIEKYVTILNPLCDARSQTFVQAASWADDIKDPA 78 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE----GTS 107 W HF + P D + A++ +L Sbjct: 79 MNFW-DKWHFFNKPINEEGLYVVLD----QDSLNNNSINALKRCIQELQKNNTTPINNPD 133 Query: 108 DRRYNMTEALLFLSHFMGDIHQPMH---------VGFTSDAGGNSIDL-RWFRHKSNLHH 157 + + +L H +GD+HQP+H D GGN ++ LH+ Sbjct: 134 NISVQQAIMMRYLIHIVGDMHQPLHNTNLFNYTFSTNQGDLGGNKENVILLNGTSMVLHY 193 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESI 217 +D + A +++ ++ +E +F + S+ + +A ES Sbjct: 194 YFDSGALRLAD---FSRPLSQEQEQQVTDFAASFRAQYPRSFFNERVNITLPEMWAQESY 250 Query: 218 NIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 IA + Y ++ ++ ++ N + ++ +++A GG RLA LL +VF Sbjct: 251 EIAVRDIYPYLKLTNKVTPEWDNLQYEMIKQQIALGGYRLADLLTSVFNP 300 >UniRef50_C5LHN6 ATP-dependent RNA helicase, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5LHN6_9ALVE Length = 1614 Score = 181 bits (458), Expect = 3e-44, Method: Composition-based stats. Identities = 77/300 (25%), Positives = 122/300 (40%), Gaps = 58/300 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ +++D V L D+ + W D+ H +Y+WT+PLH Sbjct: 22 WGEDGHSIVAAIAQRIVSDRVIEGVNETLGR--GQDMIGVACWADKASHSAQYRWTAPLH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 F+DTP K C YERDC D D CV GAI N+T + ++R A+ + Sbjct: 80 FVDTPTKQCQMVYERDCRD-----DFCVIGAIYNYTNRAISKSVSRAERE----FAMKLV 130 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILT-------------- 166 + P H + S LH VWD +IL Sbjct: 131 TTDFAPP-GPRH-----------------KVSSKLHQVWDSGLILQDEFELRVQRRREHR 172 Query: 167 -------AAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF---SCVNKFATES 216 + + L E G ++ W C A ES Sbjct: 173 KIPPHPPYRHKFEERWHELFEHLWTKLSKGGEYAKHREEWLAPCRQNGLQECTKTMAEES 232 Query: 217 INIACKWGY-----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 + +AC Y + + G+ L +YF +R P++ +++A+GG+RLA +L +FG+++ Sbjct: 233 LAVACTAAYHDEYRRWIADGDVLDRNYFLTRNPLMEEQLAKGGVRLAWVLQQMFGSNRHR 292 >UniRef50_B6DTM7 Single-strand-specific nuclease n=2 Tax=Bodo saltans RepID=B6DTM7_9EUGL Length = 360 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 57/291 (19%), Positives = 98/291 (33%), Gaps = 28/291 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-----LPEYVNGDLSALCVWPDQVRHWYKYKW 55 W GH++T IAQ LL + + ++ WPD ++ + Sbjct: 77 WGCAGHMITAEIAQQLLPTNVRRYFTDISAYQQMYYPRITSMTEASCWPDDMKSYTSQYS 136 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 + HF + N C V+ + A+ N QL+ T Sbjct: 137 S--WHFYNVCLLRANGT-NLTCPVWTSVETGQMPTAVANARAQLAMGSNLTHAES---AF 190 Query: 116 ALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 L FL H +GD HQP+H+ D GGN + ++NLH D L Sbjct: 191 WLAFLVHLVGDFHQPLHIATLFNPMFPKGDQGGNRFYIYVNNSRTNLHAFHDDLAWLLPR 250 Query: 169 KDYYAKDINLLEED--IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK 226 + + + +D + ++ + ++ NV + + E Y Sbjct: 251 DGFPQRPLAEYPDDVSMIEGLSESLILLQKFAYPSQPNVTNTS-VWIEEGFETGVNISYT 309 Query: 227 GVEAGE-------TLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 + LSD Y ++ ++A GG RLA +L ++ Sbjct: 310 LPNGQDLQFNQHFNLSDTYVTRLRSMLQNKLALGGRRLARILMEIYDEVHA 360 >UniRef50_A2F450 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2F450_TRIVA Length = 329 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 49/281 (17%), Positives = 97/281 (34%), Gaps = 26/281 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--PEYVNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + IA + + ++ L ++ + + VW D ++ Y S Sbjct: 11 WWGHAHSLIASIAMKDFSSKERKILEKFLEYGQHKRATIEEVAVWQDDLKGAYDLGIMSS 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF P + + + + L++ + + + L Sbjct: 71 WHFTPRPLIKDGYTATLQ------PVTYNITSYMNSAWNSLTN---PATTDPWIIAFHLR 121 Query: 119 FLSHFMGDIHQPMHV-------GFTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAAK 169 L HF+ D+H P H D GGN I + N+H +WD + Sbjct: 122 SLIHFVADVHTPHHNVGYYSQETPDGDKGGNLYQIICNYGSACMNIHFLWDSACLALPLG 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY-KGV 228 + I ++ N T + + A K++ ES + ++GY + Sbjct: 182 NP---LIPKYLDEFSENVTKIMKNHQKAK--MGDLETIDFMKWSNESYDTVKQYGYSPAI 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 E ++D Y + + + RV+ G RL+ +L ++ + Sbjct: 237 ERYGEVTDQYLKTCQSVALNRVSLAGYRLSTVLRQIYNEKK 277 >UniRef50_Q04SY8 Nuclease S1 n=4 Tax=Leptospira RepID=Q04SY8_LEPBJ Length = 295 Score = 180 bits (456), Expect = 5e-44, Method: Composition-based stats. Identities = 72/295 (24%), Positives = 116/295 (39%), Gaps = 50/295 (16%) Query: 1 WSKEGHVMTCRIAQGLL-NDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---- 55 W +GH IAQ LL N +A + +L L + PD++R + K Sbjct: 26 WGHQGHKTIGIIAQHLLVNSKAFEEINNILG---GLTLEEISTCPDELRVFQSEKKPMSS 82 Query: 56 --------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 T HFIDTP N +E K CV I ++ L+ Sbjct: 83 VCNQIFTNPEPPTNTGSWHFIDTPISQFNPTHEDI---VKACKSSCVLTEIDRWSNVLAD 139 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF-TSDAGGNSIDLRWFRHKSNLHHVWD 160 T+ +AL F+ HF+GDIHQP+HV D GGN + +R R+K+NLH WD Sbjct: 140 ----TTQTNAKRLQALSFVVHFIGDIHQPLHVAERNHDLGGNKVKVRIGRYKTNLHSFWD 195 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ + + + I L + + + + A Sbjct: 196 TNLVNYISTNPISTTILLKSDV----------------AFAQTEAQTTPETWVLQGFQFA 239 Query: 221 CKWGYKGVEAGE----TLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 Y G+ +S+ Y + +P+V ++A G+RL+ L +F +S ++ Sbjct: 240 RNVAYDGIPIDYASVVRISNAYIQNAIPVVKHQLASAGVRLSQHLARIFSSSNKQ 294 >UniRef50_A2G6P9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G6P9_TRIVA Length = 348 Score = 179 bits (454), Expect = 9e-44, Method: Composition-based stats. Identities = 59/296 (19%), Positives = 102/296 (34%), Gaps = 34/296 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNG--DLSALCVWPDQVR-HWYKYKWTS 57 W E H+ RIA+ ++ + + +L + + + + W D++ + + Sbjct: 12 WWNEPHMAVVRIAERMITKQQKDWMNVLFSMWPSEADTMVSASTWHDEIPENSAQVSIMK 71 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 HF D P A F+YE V + + L T+ Y Sbjct: 72 NWHFADKPILAPGFEYEY-------QPTYNVTSVVSDSMNAL---FNPTTKSLYAYHFLF 121 Query: 118 LFLSHFMGDIHQPMHV-------GFTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAA 168 L HF+GDIH P H D GGN I+ ++ LH +WD ++ Sbjct: 122 RNLVHFIGDIHTPCHTAAYYSPKFEEGDRGGNSLKINCKYGEPCKQLHKMWDSGVLNFQH 181 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 D N L ++ E N I S + E+ ++A + Y + Sbjct: 182 M---YLDTNELLDEFEHNI-SHIMQMHPESSLPTVKSL-NAYLWFNETYDVAVNYAYGML 236 Query: 229 EA-------GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVAT 277 + L +Y + ++ + G RLA ++ F ED + T Sbjct: 237 KDLNNSELDKYDLMPNYISKGAMAAEIQIVKAGYRLAYVIQEFFKVHSPEDPRIFT 292 >UniRef50_A8HTU7 Endonuclease n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8HTU7_AZOC5 Length = 282 Score = 178 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 65/276 (23%), Positives = 108/276 (39%), Gaps = 37/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W ++GH + IAQ L A V LLP L+++ W D VR + T H Sbjct: 26 WGEDGHAIVAEIAQRRLTPTGAALVASLLP--KGASLASVASWADDVR--PDHPETRRWH 81 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 ++ P A +D RDC + C+ AI+ + E T+AL L Sbjct: 82 YVGIPMGAATYDPLRDCP--SRPEGDCIVAAIERARLDMHCAPEPA-----ARTDALKLL 134 Query: 121 SHFMGDIHQPMHVGFTSDAGG-NSIDLRWFRH-----------KSNLHHVWDREIILTAA 168 H MGD+HQPMH G + L W +N+H +WD ++ A+ Sbjct: 135 VHLMGDLHQPMHAIAADHLGTRRKVLLNWAGQACTHDCEAPPPTTNMHVLWDTTLVRKAS 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 + G + D + + L +A+E+ + Y V Sbjct: 195 LSW-------------GGYVDRLEAGWLKEADAAAVAAGTPADWASETHGVGLAM-YALV 240 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 ++ Y+ + LP++ +++ + G+RLA +N Sbjct: 241 PPDNVINTTYYRAALPVLDQQLGKAGLRLAHEINAA 276 >UniRef50_Q23AG7 Putative uncharacterized protein n=2 Tax=Tetrahymena thermophila RepID=Q23AG7_TETTH Length = 630 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 57/298 (19%), Positives = 98/298 (32%), Gaps = 38/298 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK---MLLPEYVNGDLSAL--------CVWPDQVR-HWYK 52 H++ IA+ L L Y + + VW D ++ + Sbjct: 24 PHMLVLAIAKKELMKNDMEVYNITAKYLDTYSTQGVDTVSTTTYEENAVWADDIKVYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K H+I + N + A N L++ + + Sbjct: 84 QKAMEMWHYIGNKDSNPQNLTPLKKDPMAD---SENALNAYNNIVKVLTNEKFVGQMTEF 140 Query: 112 NMTEALLFLSHFMGDIHQPMHVG-------------FTSDAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G F D GGN + ++ K+ Sbjct: 141 KVNM-LKMLVHIVGDIHMPHHTGSFYNATYKNDKGEFWGDLGGNRQMINFYTSTGEMKKT 199 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 N+H +D + + +N + D I + N + +A Sbjct: 200 NIHFYFDSSCFFYTWTNRLVRPLNETFKIYFQRELDRIVAQYPKESLNIDNT-KTFSDWA 258 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 ES N+A Y + + + DD++NS ++ KR+ G RLA L +F + Sbjct: 259 DESWNLALNNVYPFLLSKNEIHYGDDFYNSSFDMIQKRIVTAGYRLAYTLQKLFTPEK 316 >UniRef50_B0T6T3 S1/P1 nuclease n=1 Tax=Caulobacter sp. K31 RepID=B0T6T3_CAUSK Length = 287 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 70/285 (24%), Positives = 110/285 (38%), Gaps = 39/285 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQVRHWYKYKWT 56 W + GH + +IA+G L +AA AV LL + DL+A W D R ++ T Sbjct: 23 WGRTGHAVVAQIARGYLTPKAAAAVDALLAADTDALTPPDLAARASWADAWR--KDHRQT 80 Query: 57 SPLHFIDTPDKACNFD------YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRR 110 + HF+D + G + C+ G + F +L+ + ++R Sbjct: 81 TEWHFVDVELDHPDLAGACFGFPASATPASAGPEKDCIVGRLNAFEAELADPKTDAAERL 140 Query: 111 YNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKS-NLHHVWDREIILTAAK 169 A F+ HF+GD+HQP+H D GGN I L ++ NLH WD + Sbjct: 141 L----AFKFVLHFVGDLHQPLHAADNQDRGGNCIPLALGGPRTVNLHSYWDTVAVEA--- 193 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGY---- 225 I + + + I + +W + +A ES +A Y Sbjct: 194 ------IEADPDKLAAKLSAQITPAERKAWEKG-----DAKTWAMESFALAKSTVYTIGS 242 Query: 226 ----KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 A L Y S V ++ + G+RLA+ LN G Sbjct: 243 KPGCASDTAPVPLPAGYNQSAQAAVALQLKKAGVRLALELNRALG 287 >UniRef50_A2E6R1 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2E6R1_TRIVA Length = 330 Score = 174 bits (441), Expect = 3e-42, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 91/275 (33%), Gaps = 26/275 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H + I+Q L + + +L D+ + WPD + Y K + Sbjct: 12 WWGHSHTIIAHISQNQLTHKQISNINRILSSSGFETTDIEKISSWPDDLI-EYNLKSMAE 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 H+ D P + D + V I + L T+ + + Sbjct: 71 WHYADKP-----YVPYEDFNFIKPPPTYNVTTYINDAWETLHD---PTTTDLWAWAFHIR 122 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNSI--DLRWFRHKSNLHHVWDREIILTAAK 169 L H++GDIH P H D GGN + W N+H +WD + Sbjct: 123 NLIHYVGDIHTPHHNIARFTVYHQNGDMGGNLYRLNCTWGDACKNIHFLWDSCALAFPIA 182 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE 229 D + D+ N + ++ ++ ES IA GY + Sbjct: 183 DITN---PIYASDLAKN--SSLIEEEFPMSSFENMTSVDPRAWSLESYAIASTLGYA-LP 236 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + S DY + +R+A G RL +L + Sbjct: 237 SYSEPSQDYLYNARQAGKRRIAMAGYRLGYMLKEL 271 >UniRef50_A7H7R9 S1/P1 nuclease n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7R9_ANADF Length = 285 Score = 174 bits (440), Expect = 3e-42, Method: Composition-based stats. Identities = 60/279 (21%), Positives = 101/279 (36%), Gaps = 35/279 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS+ GH + IA+ L A V+ +L + + + W D R T H Sbjct: 28 WSEPGHRIVAAIAEERLGPSARRLVREVLGATPMSN-ADVAGWADAQRD----PATRAWH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P A FD RDC ++ CV A++ +L +A +L Sbjct: 83 YVNIPL-AAAFDPARDC-----PREACVVAALERAIAELRDGEGAAR-----RADAFRWL 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF---RHKSNLHHVWDREIILTAAKDYYAKDIN 177 H + D+HQP+H G D GGN + R H VWD++++ + Sbjct: 132 VHLVADVHQPLHAGDGRDRGGNDLPTRRERARGQPRPFHRVWDQDVLGPILRRRGTVAA- 190 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET---- 233 I + A W + ++A ES +A + Sbjct: 191 ------ARALARDIGPAEAARWAARPS----PAEWADESHALARALYAELGPLPRDGRIV 240 Query: 234 -LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 L +Y + + ++ + G+RLA LL + A Sbjct: 241 LLPREYADRQRARTELQLQKAGVRLAALLERIAAARAVR 279 >UniRef50_A4A822 Nuclease S1 n=1 Tax=Congregibacter litoralis KT71 RepID=A4A822_9GAMM Length = 293 Score = 173 bits (439), Expect = 4e-42, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 89/269 (33%), Gaps = 36/269 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W GH + +A L+ A + LL + L++ W D++R W Sbjct: 19 WGAMGHELAGTLAAPYLSANARAQIDALL---KDETLASASTWADRMRGDPDPFWQEEAG 75 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 P H++ PD A+Q F L T +R AL Sbjct: 76 PYHYVTVPDGQS-------YTQVGAPPQGDGYTALQQFRKDLRDPTTPTRRKRL----AL 124 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 F H + D+ QP+HVG D GGN I + SNLH VWDR++ + + Sbjct: 125 RFALHIVQDLQQPLHVGNGRDRGGNQIRVAINGETSNLHSVWDRQLFESTGRSKETWLDY 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDD 237 D+ S + ES + + Sbjct: 185 FRRGDLLREP---------------NPADSDPLLWIRESAALRETLY----PVPTAIDRA 225 Query: 238 YFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 Y +LP +R+A +R A LN F Sbjct: 226 YIKQQLPRAEQRLALSAVRTAAWLNATFD 254 >UniRef50_UPI0001745ECB hypothetical protein VspiD_30620 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ECB Length = 323 Score = 173 bits (438), Expect = 6e-42, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 93/315 (29%), Gaps = 55/315 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL------------------PEYVNGDLSALCV 42 W GH++ +A L+ + LL + Sbjct: 24 WWGTGHMVVTSVAWRQLSQQEQEQAHALLKAHPKYNDWMSSYPADVPGLSKGLYAAMAAS 83 Query: 43 -WPDQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 W D +R H++D P +F + V I+ ++ Sbjct: 84 LWADDIRDKNNPATHPEWHYVDYPLVPPHFP-----KEPAPNPTNDVLVGIKECERVIAS 138 Query: 102 YREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNSIDLRWFRHK 152 T ++ E + +L H +GD+HQP+H D GGNS +R + Sbjct: 139 PTTSTQEK----GEMVSWLIHLVGDVHQPLHCASLTNDDFPAPEGDRGGNSAFVRPDKQS 194 Query: 153 S--NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 NLH VWD ++ D E + + ++ Sbjct: 195 KAINLHMVWDSQL-----GGARVADAGSSREALNKAIL--LETEHPRVAAAELQKSPSPE 247 Query: 211 KFATESINIACKWGYKGVE---------AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 ++ E +A + Y L + Y I +RV G RLA +L Sbjct: 248 SWSLEGRELAIQEAYLHGNLRYAVGKQLNAPVLPEGYTKKARAISERRVTLAGYRLADML 307 Query: 262 NNVFGASQQEDSVVA 276 + S E + Sbjct: 308 KRLLAVSTAEPERAS 322 >UniRef50_Q1N3Y8 Probable endonuclease n=1 Tax=Bermanella marisrubri RepID=Q1N3Y8_9GAMM Length = 226 Score = 173 bits (438), Expect = 6e-42, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 103/258 (39%), Gaps = 32/258 (12%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDK 67 M A L A H ++ +L VW D ++ ++ PLH+++ P Sbjct: 1 MVAAAAWPQLTPYAKHQIESILGFG-REKFVNASVWADHIKSDQRFNHLKPLHYVNLPKG 59 Query: 68 ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDI 127 + + +RDC + C+ AI +F+ S A+ L H + DI Sbjct: 60 STQYKQQRDCPE-----GQCIVQAIYDFSE------YARSGSEREQAMAVRMLIHLIADI 108 Query: 128 HQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNF 187 HQP+H G+ D GGN ++++ + +LH +WD +++ +++ LL++ + Sbjct: 109 HQPLHAGYKEDRGGNWFEVKYQDYTLSLHKLWDHQLVERFHENWQQGSTELLKDMPKA-- 166 Query: 188 TDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVM 247 K+A S + + Y+ E +S+ Y + Sbjct: 167 -----------------TLYSPEKWAEISHALVERSVYETQEN-RLVSEAYLEMADDVTH 208 Query: 248 KRVAQGGIRLAMLLNNVF 265 +++ RLAM LN ++ Sbjct: 209 RQLQLASWRLAMWLNQLW 226 >UniRef50_Q2N7X6 Endonuclease n=3 Tax=Erythrobacter RepID=Q2N7X6_ERYLH Length = 276 Score = 173 bits (437), Expect = 7e-42, Method: Composition-based stats. Identities = 63/290 (21%), Positives = 105/290 (36%), Gaps = 48/290 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRHW-Y 51 W H +T IA+ + + A++ L PE L VWPD VR + Sbjct: 8 WGFFAHTVTGDIAEANIRPDTRAAMQRLFRAEGLLGTPECELKTLQDATVWPDCVRRMRW 67 Query: 52 KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 ++ T+ H+ TP ++ ++C C+ I L+ + Sbjct: 68 RWGHTAAWHYRTTPICEP-YEPWKNCPG-----GNCILAQIDRNQRILADESLPAN---- 117 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSNLHHVWDREIILTAAKD 170 +AL F+ HF+GD+H P+H G D GGN + + NLH +WD + A Sbjct: 118 VRLQALAFMVHFVGDVHMPLHSGDKDDRGGNDRETDYGIAPGLNLHWIWDGPLAERAITS 177 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKG--- 227 + GI +D + ES I+ + Y Sbjct: 178 ARPSLVRRYSAAERAELAGGISAD-----------------WGRESWAISRDFVYPNAFD 220 Query: 228 --------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 + L+ + + +P+ +RV Q G+R+A LL+ F Sbjct: 221 TDAVCETDLPGETALTQEDIVAAIPVSQRRVTQAGLRIARLLDEAFAPGP 270 >UniRef50_B9XA25 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XA25_9BACT Length = 309 Score = 171 bits (432), Expect = 3e-41, Method: Composition-based stats. Identities = 56/298 (18%), Positives = 92/298 (30%), Gaps = 44/298 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-------------GDLS-------AL 40 WS GH++ A L + V +L + + DLS Sbjct: 24 WSGAGHMVIAAEAYHELPERTRSKVDEILKAHPDYAKWVATHSKEKFADLSLSEYVFLRA 83 Query: 41 CVWPDQVRHW----YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 WPD++R + H++D P K F E + I Sbjct: 84 SKWPDEIRRAKGQGSRSYDHPHWHYVDYPLKPTKFPLEP-----GPSPKDDLLYGIAQCE 138 Query: 97 TQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH-------VGFTSDAGGNSIDLRWF 149 L + ++ L +L H +GD+HQP+H D GGN ++ Sbjct: 139 KNLCDSKASPEEK----AVYLSYLIHLVGDVHQPLHCCSLVNETYPNGDKGGNDFYVKPG 194 Query: 150 RHKSNLHHVWDREIILTAA-KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC 208 LH WD + ++ + I LL + + + + W G + Sbjct: 195 NKGIKLHSFWDGLLGTSSKPQTQIYYAIELLHDHPRKSLPELAKATTPKDWSLEGRQIAI 254 Query: 209 VNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + IN C + L +Y + R A G RLA + + Sbjct: 255 DKAYLRADINGGCGTSEQN---ACELPSNYTKEAKAVAENRAALAGYRLADEIQMLIK 309 >UniRef50_Q5ZV70 3'-nucleotidase/nuclease n=6 Tax=Legionella RepID=Q5ZV70_LEGPH Length = 285 Score = 169 bits (428), Expect = 8e-41, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 101/282 (35%), Gaps = 38/282 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQVRHWYKYKW 55 W+ GH + +IA L ++ + L + W D +R + W Sbjct: 28 WNAIGHQLVAQIAYDNLTPQSRR-MCDLYSHSKSKTSSNVNFVKSASWLDSIR-AHDVHW 85 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTE 115 LH+ID P + D + + D+ I LS + +D++ Sbjct: 86 FDALHYIDIP-------FSMDETELPVLTDINALWGINQAIAVLSSKKASIADKKL---- 134 Query: 116 ALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAA 168 +L L H +GDIHQP+H D GGN L +NLH WD + Sbjct: 135 SLRILVHLVGDIHQPLHTVTKISKKLPKGDLGGNLFQLAKNPIGNNLHQYWDNGGGILIG 194 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGV 228 +D + + N + WS AS ++ S +A YK V Sbjct: 195 QDKFFQIKNKA------RQLEKKWSCQSAS------KEKNPQQWINASHQLALTKVYK-V 241 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 A + Y + I K++ G RLA LLNN+ + Sbjct: 242 SAHQVPGKQYQLNTQNITEKQILLAGCRLAYLLNNIAEGKNK 283 >UniRef50_A2E030 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2E030_TRIVA Length = 372 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 94/285 (32%), Gaps = 36/285 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQV-----RHWYKY 53 W H M R++ L D + +L + + + W D++ R Sbjct: 12 WWNGPHEMVARVSWNDLTDRQQKIIYKILLTWPDEQKLFTNCGSWLDEIAAKYNRGTDLI 71 Query: 54 KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNM 113 P HF+D P D + ++ + A+ + + T+ + + Sbjct: 72 SHFKPWHFVDFPL----IDGCENFEEKDTPFVYNITSALNHIISSFLD---PTTKSLWAI 124 Query: 114 TEALLFLSHFMGDIHQPMHVGFTS---------DAGGNSIDLRWFRHKSNLHHVWDREII 164 + L H + D+H P+H D G N L + NLH +WD + Sbjct: 125 NFDIRMLLHLVADVHTPVHCIDRYTPSSGTCKADHGANFFSLSLSINGKNLHSLWDSAVY 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWG 224 + + + L + + + + V +A S IA ++ Sbjct: 185 AYPTGSFSEEMVQKLIFEYKDKIPEDSY-----------VQNMNVTAWALHSYEIAKEYV 233 Query: 225 YKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y G++ + + +D Y P ++ R+A +++ Sbjct: 234 YNGLKLNQYVGENDAYVTRAQPQAKAQIILASKRMAYIIDQFVKK 278 >UniRef50_UPI000150A357 hypothetical protein TTHERM_00515230 n=1 Tax=Tetrahymena thermophila RepID=UPI000150A357 Length = 389 Score = 168 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 50/291 (17%), Positives = 97/291 (33%), Gaps = 28/291 (9%) Query: 3 KEGHVMTCRIAQGLL---NDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYK- 52 H++ IA+ L + E + ++ +W D +++WYK Sbjct: 26 DLPHMLILGIAKETLIEKDPEIIQIAEKYFDQFEEPHQKGQVQFEEHSIWSDDIKYWYKS 85 Query: 53 -YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K+ H+ID N+ + ++ + A L + Sbjct: 86 SVKYWDTWHYIDQIYNPSNYPID---VNKQKDSNSNAQVAFNQIKETLKNKNLNGKITVM 142 Query: 112 NMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRW-FRHKSNLHHVWDREI 163 L L H +GDIHQP+H D GGN ++ K+NLH +D Sbjct: 143 KHIF-LKHLVHLVGDIHQPLHTVSFYSYQFQNGDLGGNKQMVQLSDNRKNNLHFYFDSGA 201 Query: 164 ILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKW 223 +D + N D + + + +++ ES I+ + Sbjct: 202 FYYTFEDRIHRPFNESFIDYFEEEIARLIKLYPREELKINDEDIQFDQWVKESYMISIEQ 261 Query: 224 GYKGVEAG-----ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 Y ++ ++D+ + K++ + G RLA +L + + Sbjct: 262 IYSQIDLTGNQKINKITDENHRKNQELCQKQIVKAGYRLANILVDFLKDEK 312 >UniRef50_UPI00006CE90A hypothetical protein TTHERM_00559790 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CE90A Length = 482 Score = 167 bits (422), Expect = 4e-40, Method: Composition-based stats. Identities = 59/304 (19%), Positives = 96/304 (31%), Gaps = 39/304 (12%) Query: 5 GHVMTCRIAQGLLNDEAAHAVK------MLLPEYVNGDLS-----ALCVWPDQVR-HWYK 52 H++ IA+ L K +S VW D ++ + Sbjct: 24 PHMLILGIAKRELMKNDQEIYKITAKYLDTFSASGIETISTTSYEENAVWGDDIKTYGDA 83 Query: 53 YKWTSPLHFI-DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 K HFI + N +D A N + + Sbjct: 84 QKAMGMWHFIGNKDSNPENLTLVKD----PMADSENALNAYDNIVKTFKNKSFIGKITEF 139 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTS-------------DAGGNSIDLRWFR-----HKS 153 + L L H +GDIH P H G D GGN ++++ + Sbjct: 140 KI-MMLKMLVHLVGDIHMPHHTGSYYNSTIVGPNKEIWGDRGGNRQKIKFYTSTGKKEST 198 Query: 154 NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFA 213 ++H +D K + +N + D I + N N +A Sbjct: 199 DIHFYFDSSCFYYNWKSRLQRPLNDTFKAYFEAELDRIMTQYPKETLNINNA-QTFNDWA 257 Query: 214 TESINIACKWGYKGVEAGETL--SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 ES NIA Y + + D ++NS ++ KR+ G RLA L N+F A + + Sbjct: 258 EESWNIALTEVYPFLLKNNEIRFGDAFYNSSFDMIQKRIVIAGYRLAYTLQNMFAAEKGK 317 Query: 272 DSVV 275 + Sbjct: 318 IDLS 321 >UniRef50_O68530 Endonuclease S1 homolog n=1 Tax=Mesorhizobium loti RepID=O68530_RHILO Length = 309 Score = 166 bits (419), Expect = 9e-40, Method: Composition-based stats. Identities = 72/300 (24%), Positives = 113/300 (37%), Gaps = 43/300 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD------LSALCVWPDQVRHWYKYK 54 W +EGH IAQ L A+ V+ LL ++ ++++ W D R +K Sbjct: 22 WGQEGHAAVAEIAQHRLTSSASDVVQRLLRAHLGLTGQQVVSMASIASWADDYR-ADGHK 80 Query: 55 WTSPLHFIDTPDKA--------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 TS HF+D P + ++D RDC D C+ A+ LS + Sbjct: 81 DTSNWHFVDIPLASLPGGSSATTDYDAIRDCAD-DATYGSCLLKALPAQEAILSDATKDD 139 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHV-----GFTSDAGGNSIDLRWF-----------R 150 R +AL F+ H GD+ QP+H G D GGN++ + + R Sbjct: 140 ESR----WKALAFVIHLTGDLAQPLHCVQRVDGSQKDQGGNTLTVTFNVTRPAPDNSTFR 195 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR-ECGNVFSCV 209 + H VWD ++I D+ E+ + D + D W EC Sbjct: 196 DFTTFHSVWDTDLITFKYYDWG-LAAAEAEKLLPTLAADLLADDTPEKWLAECHRQAEAA 254 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQ 269 + + G+ L YF P+V +++A GG+ LA LN + Sbjct: 255 YQALPAGTPLKSDIGHP-----VILDQAYFEKFHPVVTQQLALGGLHLAAELNEALKGGK 309 >UniRef50_A9UZI8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UZI8_MONBE Length = 179 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 70/156 (44%), Positives = 92/156 (58%), Gaps = 4/156 (2%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH T IA+ LL ++AA V +L + ++ W D VR + W++PLH Sbjct: 26 WGPIGHQTTAAIAETLLTEKAATTVAQILDNA---SMVSVSTWADDVRSTSAWAWSAPLH 82 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 FIDTPD+ C+FDY RDC + G D CVAGAI N+T QL + EAL F+ Sbjct: 83 FIDTPDRVCSFDYSRDCQN-DGRPDFCVAGAIVNYTRQLELAVAQGRLQDETTQEALKFV 141 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLH 156 HF+GDIHQP+HV FTSD GGN +++ +F NLH Sbjct: 142 IHFLGDIHQPLHVSFTSDEGGNLVNVTFFGEPENLH 177 >UniRef50_Q4PFZ0 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PFZ0_USTMA Length = 397 Score = 163 bits (413), Expect = 5e-39, Method: Composition-based stats. Identities = 63/374 (16%), Positives = 118/374 (31%), Gaps = 109/374 (29%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----------------GDLSALCVWP 44 W GH + IAQ L+ + +LP Y L+ L WP Sbjct: 35 WGIAGHQIVATIAQTQLHPLVREQLCTILPNYTRYPSHWPTSEDSKPRTHCHLAVLAGWP 94 Query: 45 DQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 D +R +Y W+ LH+++ + + + V ++ N+T+++ Sbjct: 95 DTIRS--RYPWSGQLHYVN--PVDDHPPSQCLYGETGWTSPNNVLTSMVNYTSRVV---- 146 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 ++ + AL F+ H GD HQP+H+ + GGN + + + K+ LH VWD +I Sbjct: 147 --TETGWQRDMALRFMVHLFGDAHQPLHLTGRA-RGGNDVWVHFEGRKARLHTVWDTLLI 203 Query: 165 LTAAKDYYAKDINLLEEDIEGNFT--------------------------DGIWSDDLAS 198 ++ L IE D W + + Sbjct: 204 DKQIRELSNYTTRLPSGRIESALVGARYDPLIRFILKEGLGQPASRGQEGDAWWKQESSG 263 Query: 199 WRECGNV--------------------------------FSCVNKFATESINIACKWGYK 226 W C C ++ ++ C + + Sbjct: 264 WPACQGQRSEIGALTQEYEGQLALSSISEDPHRVDNTVLPICPYEWTRPMHSLVCTYAFA 323 Query: 227 GVEAGETLS----------------------DDYF--NSRLPIVMKRVAQGGIRLAMLLN 262 + +Y R ++ K++A+ G+RLA +LN Sbjct: 324 APVPAWEPAPPPGQGEPEPSPTPVPEPELDVPEYVGRIERDKVIHKQLAKAGLRLAAVLN 383 Query: 263 NVFGASQQEDSVVA 276 + ++ + A Sbjct: 384 TLLLPAEVDSLRSA 397 >UniRef50_O65425 Putative bifunctional nuclease n=1 Tax=Arabidopsis thaliana RepID=O65425_ARATH Length = 454 Score = 162 bits (409), Expect = 1e-38, Method: Composition-based stats. Identities = 73/143 (51%), Positives = 98/143 (68%), Gaps = 2/143 (1%) Query: 16 LLNDEAAHAVKMLLPEY-VNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACNFDYE 74 D+ AVK LLPE G L+ C WPD+++ +++WTS LH+++TP+ CN++Y Sbjct: 4 FFEDDTIAAVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTLHYVNTPEYRCNYEYC 63 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDR-RYNMTEALLFLSHFMGDIHQPMHV 133 RDCHD H KD CV GAI N+T QL E + + YN+TEALLFLSH+MGD+HQP+H Sbjct: 64 RDCHDTHKHKDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALLFLSHYMGDVHQPLHT 123 Query: 134 GFTSDAGGNSIDLRWFRHKSNLH 156 GF D GGN+I + W+ +KSNLH Sbjct: 124 GFLGDLGGNTIIVNWYHNKSNLH 146 >UniRef50_B3LAP6 Putative uncharacterized protein n=1 Tax=Plasmodium knowlesi strain H RepID=B3LAP6_PLAKH Length = 331 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 98/300 (32%), Gaps = 30/300 (10%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN----GDLSALCVWPDQV--------- 47 WS EGH++ IA L D+ ++ + Y D VW D + Sbjct: 24 WSDEGHLLISAIAYEGLTDDEKFVLQTIFKNYKEDNDFNDPVTAAVWADHIKPIDYHYTT 83 Query: 48 --RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG 105 R + + H+ P N ++ K +++ T L + + Sbjct: 84 KVRRIGGLELMNKWHYTSNPYNPTNIPLNE-YRKKYYQKTDNALSVLKSIFTSLKNMNKQ 142 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHH 157 ++ L + H GDIH+P+HV D G I++++ + LH+ Sbjct: 143 ENHGTFFSYNFNLRYFIHIFGDIHEPLHVVEFFNKHFPEGDNGATLINIKYNNNVEKLHY 202 Query: 158 VWD------REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNK 211 + D T+ ++ N L + + +DL+ + + Sbjct: 203 LCDCVFHTRSRRWPTSGMKEMLEEGNALMKMYPPEYFGDRLKNDLSDLEYLDFIVNDSYT 262 Query: 212 FATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 A I + L + + ++ +++A GG RL L + + Sbjct: 263 KAVNDIYSNFPHDTLNSKTPYVLDNSAVDKLKKMLNEQIALGGYRLRRYLKIMIENVPDD 322 >UniRef50_D0NJT7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT7_PHYIN Length = 343 Score = 160 bits (405), Expect = 4e-38, Method: Composition-based stats. Identities = 67/312 (21%), Positives = 112/312 (35%), Gaps = 48/312 (15%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY-----VNGDLSALCVWPDQVRHW----- 50 W GH++ +A+ L+++ ++ +L ++ G+++ VW D ++ Sbjct: 27 WWDNGHMLVGEVAKQLMSEADVVTIESVLSKWNEDFPNTGEITTSAVWMDLIKCTSVSSY 86 Query: 51 ------YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYRE 104 S H+ID P +E D +D A L Sbjct: 87 CQSPLAPSITSMSDWHYIDLPVNINGDKWEYKDADLSLFEDTMGGDAASVIEGALRS--L 144 Query: 105 GTSDRRYNMTEALLFLSHFMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHH 157 T+ + + H GD+HQP+H D GGNS SNLH Sbjct: 145 KTTKSSWAANLFIRNFIHIFGDLHQPLHTVAGVSEAFTEGDGGGNSEYFASPCAFSNLHA 204 Query: 158 VWDREIILTAAKDYYAKDINLLEEDIEGNFTDGI-----WSDDLASWRECGNVFSCVNKF 212 VWD L + ++ A +I+ + ++ N TD I SD L + ++ + Sbjct: 205 VWDAAGGLYSLNNW-ALNIDDFKSTLQSNATDLIALLLNISDTLDFSQYENTTYNELYTA 263 Query: 213 AT----------ESINIACKWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGI 255 E+ + A Y G++ T S Y I KR+A GG Sbjct: 264 LVTNSALREVILETYSYADTVVYSGLDLNATSSGKYPCPSSSYLTLAGEISQKRIAIGGS 323 Query: 256 RLAMLLNNVFGA 267 RLA++L + Sbjct: 324 RLAIILKHFAAQ 335 >UniRef50_Q8ILX4 p1/s1 nuclease, putative n=4 Tax=Plasmodium RepID=Q8ILX4_PLAF7 Length = 320 Score = 160 bits (405), Expect = 4e-38, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 96/303 (31%), Gaps = 34/303 (11%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDL---SALCVWPDQV---------- 47 WS E H++ IA LND + + + +W D++ Sbjct: 19 WSDEPHMLISYIAYINLNDGEKEILNRIFQNGNDAIFDNPITASIWADKIKPNNHKRTFH 78 Query: 48 ----RHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R + H++ Y H + G +++ L R Sbjct: 79 SSNFRRNELLDIFNEWHYVQLNYNPMKI-YIAPYHLRAHKGKHNAMGILKHIYRILIEVR 137 Query: 104 EG-TSDRRYNMTEALLFLSHFMGDIHQPMHVG-------FTSDAGGNSIDLRWFRHKSNL 155 + Y+ L F H D+HQP+H D GG I + + + L Sbjct: 138 QKMGHGTYYSYNFYLRFFIHIFSDLHQPLHAINFFNSNYPNGDRGGTDISVNYKGSINKL 197 Query: 156 HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATE 215 H++ D I T K + ++ +E D + + ++ A E Sbjct: 198 HYLCDN-IFKTRKKQWPNINMTNIERDARYLMSTYPPESFGNKLFLPHDKIKYIDDIAHE 256 Query: 216 SINIACKWGYKGVEAG-------ETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 S +IA + Y +++ + + ++ ++ G RL+ L ++ Sbjct: 257 SHDIAVQNIYSFFPLTDLKRSEQYSINQHFVINTKKLLNSQMVLAGYRLSAYLKDIIANI 316 Query: 269 QQE 271 + Sbjct: 317 PPD 319 >UniRef50_A2FAR0 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FAR0_TRIVA Length = 326 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 92/280 (32%), Gaps = 27/280 (9%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--LSALCVWPDQVRHWYKYKWTSP 58 W E H R+A+ +L+ + +L + + W D ++ P Sbjct: 12 WWGEPHYFIARLAESMLSASEVKYLNRVLATWESEKAVFHDTGNWHDDLK-PIGMPLMVP 70 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 HF + P N++ V ++ LS + ++ + + Sbjct: 71 WHFRNQPVVDPNYNL------VTYPVTYNVTQVNKDC---LSAIYDTSTTSMWILGFCFR 121 Query: 119 FLSHFMGDIHQPMHVG-------FTSDAGGNS--IDLRWFRHKSNLHHVWDREIILTAAK 169 L+HF+ D H P+H D G LH VWD + Sbjct: 122 SLAHFVADAHCPVHASCYFSADYPNGDGGATKEKFVCPVDEVCDKLHFVWDSGSLNFQTW 181 Query: 170 DYYAKDINLLEEDIEGNFTDGIWSDDLAS-WRECGNVFSCVNKFATESINIACKWGYKGV 228 + E ++ +W++ +++ +++ ++A ++ Y Sbjct: 182 PIPESLVKEAEYNL-----SHLWTNYPPEKHYSSTYNSIDPDQWQSDAYDVAKEYVYGLY 236 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + G ++ +YFN P K ++ RL +L F Sbjct: 237 QFGHNVTGEYFNKTQPPAAKLISVAAYRLGKVLQTFFHKR 276 >UniRef50_A0Z194 Endonuclease S1 n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z194_9GAMM Length = 275 Score = 159 bits (403), Expect = 8e-38, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 108/282 (38%), Gaps = 48/282 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W +GH C A + A+ LL L LC W D+++ + T H Sbjct: 30 WWDDGHQQVCEQAVAQVQPATLAAIADLLDAP----LGELCSWADEIKG--QRPETRQWH 83 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P + + + A+ +L H EALL++ Sbjct: 84 YLNAPPDT------LSIGNAPRPEGGDIIAALNEQIHRLKHAPTN------QRREALLWV 131 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWF----------RHKSNLHHVWDREIILTAAKD 170 H +GD+HQP+H+G+ SD GGN+ L R + ++H VWD I+ + Sbjct: 132 GHLIGDLHQPLHLGYASDLGGNTYRLELPEELALQLNEKRERVSMHAVWDGLILRYQDQP 191 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC--KWGYKGV 228 A +E + N I + +A E++++ K Y+ Sbjct: 192 SVAATATPIERPLLLNPEVEIIA------------------WADETLSVLNDAKVHYRHG 233 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 +TL+ Y S V ++ + RLA LL+ F S++ Sbjct: 234 TRLQTLTSQYLISNRSAVDLQIRRAATRLAALLDWAFSQSKR 275 >UniRef50_Q560K3 Putative uncharacterized protein n=2 Tax=Filobasidiella neoformans RepID=Q560K3_CRYNE Length = 393 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 66/223 (29%), Positives = 90/223 (40%), Gaps = 33/223 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH M IAQ L + +LPE N L+ + W D VR+ +Y+ T+P+H Sbjct: 20 WGAAGHEMVATIAQIHLFPSTRAKLCSILPEEANCHLAPVAAWADIVRN--RYRGTAPMH 77 Query: 61 FI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEA 116 +I D P C F +D+ V AIQNFT + + G Sbjct: 78 YINARNDHPQDHCEFGQH-----GWQNEDVNVITAIQNFTRLIMDGKGGKDVD-----IP 127 Query: 117 LLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 L FL HF+GD HQP+H+ D GGN + + NLH VWD II ++ Sbjct: 128 LRFLVHFIGDSHQPLHLA-GRDKGGNGAKFLFEGRERNLHSVWDSGIITKNIRELSNYTS 186 Query: 177 NLLEEDIEG----------------NFTDGIWSDDLASWRECG 203 L + IE W D++ SW C Sbjct: 187 PLPSKHIERCLPGAIFDPYVRWIVWEGIRLWWRDEVDSWISCP 229 Score = 52.5 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 9/72 (12%) Query: 206 FSCVNKFATESINIACKWGYKGVEAGETL-------SDDYFNS--RLPIVMKRVAQGGIR 256 SC + + + C + G+ +D+Y R I+ K +A G+R Sbjct: 310 PSCPYHWISPIHQLNCDIVWPSKYTGQPNEPLIELDTDEYLGEIGRQKILEKMIAMAGLR 369 Query: 257 LAMLLNNVFGAS 268 LA +LN Sbjct: 370 LAKVLNEALAEE 381 >UniRef50_B6ABV1 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6ABV1_9CRYT Length = 433 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 111/312 (35%), Gaps = 49/312 (15%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVW--------PDQVRHWYKY 53 +GH A L H +K L+ D+ + W P + ++Y Sbjct: 22 DADGHSAIAMTAMSGLKGNTLHQLKRLM---NGKDIVDISAWGERVSQKHPSTMPFHFQY 78 Query: 54 KWTSPLHFI--------------DTPDKACNFDYERDCHDQ-----HGVKDMCVAGAIQN 94 + + LHF D + ++ C++ C+ I++ Sbjct: 79 QDMNELHFDKFLPESAPQMFGLGDGTRSFSHTYSDKYCNEVGASAECKETGHCLVPMIKH 138 Query: 95 FTTQLSHYREG----TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----L 146 ++L + ++++ FL + +GD+HQP+H GFT G + Sbjct: 139 LYSRLIGLDRNKISYPEGIQLTDSDSVKFLVNLIGDLHQPLHFGFTESNAGRDFHGHLII 198 Query: 147 RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 +L +W++ +I + I+ + W+E G Sbjct: 199 NGTEETISLFEIWEKGLIQKLKIEKPQFWYGGWTHVFA---IRDIFDKETILWKERG--I 253 Query: 207 SCVNKFATESINIACKWGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAML 260 ++ +A ESI I C + E L++++ + I+ R+ G RL+++ Sbjct: 254 DIIDDWARESIQIMCSALFIHPLNQEKLTNNFNIDPLLEFAWFEILRSRLLIAGARLSIV 313 Query: 261 LNNVFGASQQED 272 LN++ + ++ Sbjct: 314 LNDILKYREGKE 325 >UniRef50_B8P2Q4 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8P2Q4_POSPM Length = 753 Score = 156 bits (394), Expect = 7e-37, Method: Composition-based stats. Identities = 60/240 (25%), Positives = 95/240 (39%), Gaps = 31/240 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPE-------------YVNGDLSALCVWPDQV 47 W GH + IAQ L+ + +L Y L+ + W D+V Sbjct: 323 WGAAGHEIVATIAQIHLDPSVLPVLCDILYPPSSSSHKASTSSAYPPCHLAPIAAWADRV 382 Query: 48 RHWYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 R Y+WT+PLH++ D P +C F G ++ V A+ N T Q++ Sbjct: 383 RGSPAYRWTAPLHYVGAVDDAPADSCAFPGPNGWA---GRHNINVLAAVSNKTGQVA-AF 438 Query: 104 EGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREI 163 + EAL +L HFMGD+H P+H+ + GGN + + SNLH VWD + Sbjct: 439 LSGEAGLHEGEEALKYLVHFMGDMHMPLHLT-GKERGGNGAKVTFDGRVSNLHSVWDNLL 497 Query: 164 ILTAAKDYYAKDINLLEED--IEGNFTDGIWSDDLASWRECG-------NVFSCVNKFAT 214 I A + L + +E + I+ + G F+ V ++ Sbjct: 498 IAQALRTVPPNYTWPLPDMRGVEAHLRGAIYDPYIRRIIYEGFGTDAVAGRFTDVEEWLD 557 >UniRef50_A3FPP7 S1/P1nuclease, putative n=2 Tax=Cryptosporidium RepID=A3FPP7_CRYPV Length = 416 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 107/296 (36%), Gaps = 35/296 (11%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 EGH L + + ++ L+ D+ + W + R K+ T P HF Sbjct: 24 DAEGHSAIGMTTISGLQNNFSQKLRRLM---NGKDIVDISGWGE--RVSKKHPSTLPFHF 78 Query: 62 IDTP--DKACNFDYERDCHDQ--------HGVKDMCVAGAIQNFTTQLSHYREG-----T 106 D N + D ++ C+ I++ +L Sbjct: 79 QGQSKGDYFKNGELGNDFKEKFILKSDSNCKHTGHCLVPMIKHLYYRLIGDNSKFKINYP 138 Query: 107 SDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID----LRWFRHKSNLHHVWDRE 162 + ++++ FL + +GD+HQPMH GF D G I + + +L +W+ Sbjct: 139 EGIQLTDSDSIKFLINLIGDLHQPMHFGFIEDGLGREIKGMMSINGTNERLSLFEIWESG 198 Query: 163 IILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK 222 I + + I+ +L W+E G +N +A E+ I Sbjct: 199 IARKLKTEKPQFWFGGWTHILA---IRDIFDKELLLWKERG--IEMINDWAKENFEIVTN 253 Query: 223 WGYKGVEAGETLSDDYF------NSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 Y + + + D++ + L I R+ G RL+++LN++ + ++ Sbjct: 254 EIYFHPISKQPIIDNFNVDVTLEFAWLEIFRSRILIAGARLSIILNDILKLREGKE 309 >UniRef50_D2QW83 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QW83_9PLAN Length = 338 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 61/318 (19%), Positives = 105/318 (33%), Gaps = 58/318 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ +GH + IA L E A+ +L ++ Sbjct: 28 WNAKGHRLVAAIAYRSLTPEDRDALIEILKQHPRFAADFERQMPDVVKSGTKDQQQEWLF 87 Query: 38 SALCVWPDQVRH----WYKYKWTSPLHFIDTPDKACNFDYER----------DCHDQHGV 83 VWPD +R H+I+ P + + V Sbjct: 88 GHAAVWPDYIRGFKGEESDKYHRPTWHYINWPHYLSDAEAAELAMPPMVNRHLDPAMTPV 147 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH--------VGF 135 + + +I +Q + +R + +L H MGD+HQPMH + Sbjct: 148 LEQNLMQSIARLRSQFVDSKYSAEER----AVMICWLLHTMGDLHQPMHGASLFCKPLFV 203 Query: 136 TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAK--DINLLEEDIEGNFTDGIWS 193 D GGNSI R NLH VWD + + + + L ++ T S Sbjct: 204 QGDRGGNSILTRQSG---NLHAVWDNALGNDDSFREVNRHATLLLATPEMTKIGTASQAS 260 Query: 194 DDLASWRECGNVFSCVNKF--ATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKR 249 + +W E + + + + A S K V+ L++DY + + +R Sbjct: 261 IEQKTWLEESHALAVEHVYDQAVLSHVRVQMLTAKNVDDFPPLMLNEDYLRNSSKVSERR 320 Query: 250 VAQGGIRLAMLLNNVFGA 267 + G R+A +L + Sbjct: 321 SVEAGYRIAAVLRQLLHP 338 >UniRef50_B2JAU7 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2JAU7_NOSP7 Length = 332 Score = 153 bits (385), Expect = 9e-36, Method: Composition-based stats. Identities = 53/305 (17%), Positives = 94/305 (30%), Gaps = 50/305 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKM-------------------------LLPEYVNG 35 W+K GH+++ IA L + + PE N Sbjct: 41 WNKSGHMVSGAIAYSELKQSNQQNLDKVVAILKEHPEYSKFEQQWNSLNQSNISPEDKNL 100 Query: 36 DLSALCV-WPDQVRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQN 94 L W D+ R ++ H+I+ P + + + + + A Q Sbjct: 101 YLFMWAAKWADEARDNPEFNH-PTWHYINFPYQPGR---ASNSIPREIPDEENIIFAFQK 156 Query: 95 FTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG---------FTSDAGGNS-- 143 + + A+ +L H +GD+HQP+H D GG Sbjct: 157 NLDVVKSNASNSDK-----AVAICWLFHLIGDVHQPLHTTKLITNQYPQPEGDRGGTRFY 211 Query: 144 IDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECG 203 I ++ +LH WD I+ + L + N + +W Sbjct: 212 IRVKPNSQTISLHKFWDDLILGSERFQAVRNAATSLRSSYQRNKLPELRETKFNNWA--- 268 Query: 204 NVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNN 263 ++ G G+ L +Y + I +R++ G RLA +LN Sbjct: 269 -KLESFRIAKQDAYLNGKLSGSSDKNDGKLLPANYAATAKQIAQRRMSLAGYRLADVLNQ 327 Query: 264 VFGAS 268 + G Sbjct: 328 LLGQR 332 >UniRef50_Q47K45 Putative uncharacterized protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47K45_DECAR Length = 301 Score = 151 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 104/312 (33%), Gaps = 70/312 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD--------------LSALCVWPDQ 46 W+ GH + IA L+ A+ L + + + + WPD Sbjct: 20 WNAAGHRLVAVIAWQQLSPATRDAISAALAHHPDHERWVEKARSREGIAVFAEASTWPDD 79 Query: 47 VRHWYKYKW------------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCV 88 +R+ + H++D V+D + Sbjct: 80 IRNDPRLYDEDREPPTPAVPGLPETARHKRWHYVDLD-------------ATGKVRDGEL 126 Query: 89 AGAIQNFTTQLS-HYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL- 146 I+ + L + + + AL +L H + DIHQP+HVG D GGN +++ Sbjct: 127 DRQIERLSQLLQAKGSSPGTRKSEQIAYALPWLLHLVADIHQPLHVGQHGDEGGNKVEIE 186 Query: 147 RWFRHK---SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECG 203 F + S+LH WD + N LE++ Sbjct: 187 NPFNKRLPFSSLHLYWDDLPGPPWLRG------NRLEKNAGRLLDS-----------YPK 229 Query: 204 NVFSCVNKFATESINIACKWGYKGVEAG--ETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 V V + ES + Y V +S+D+ ++ I +R+ + G RL LL Sbjct: 230 PVQGNVALWRDESHQLLAA-AYPKVSGSLLPIISEDFQDNARQIANRRIVEAGYRLGHLL 288 Query: 262 NNVFGASQQEDS 273 ++F ++ Sbjct: 289 ESIFRERVSRET 300 >UniRef50_C9SGH7 Nuclease PA3 n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SGH7_VERA1 Length = 303 Score = 150 bits (379), Expect = 5e-35, Method: Composition-based stats. Identities = 47/282 (16%), Positives = 85/282 (30%), Gaps = 23/282 (8%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ + H A+ L+ A + +L L + W D R + + T+ H Sbjct: 21 WNTDIHQQIGFAAEKFLSPAAKAILSEILEPESGASLGRIGAWADAHRGTPEGRHTTTWH 80 Query: 61 FI---DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 +I D P CN Y RDC C+ A+ N T L D + Sbjct: 81 WINPADQPPSFCNVHYNRDC-----TSGGCIVSALANETQILKSCIRSVKDASLSAAPTP 135 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDIN 177 + V D + +S + + I Sbjct: 136 RAPTPPT--------VFPVVDREEEKF-VYLTPARSGTAPL--STCSAANVTGFPNTTIQ 184 Query: 178 LLEEDIEGNFTDGIWSDDLASWREC---GNVFSCVNKFATESINIACKWGYKGVEAGETL 234 D+ + W C +C ++A ++ C + + L Sbjct: 185 PFFSDMVDRIRADTYFVPTRDWLSCTDPSTPLACPLEWARDANQWNCDYAFSQNTNASDL 244 Query: 235 -SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVV 275 + Y PI ++A+ +R+A N + + ++ VV Sbjct: 245 RTSGYAEGAWPIAELQIAKAVLRIATWFNKLADCNFKDREVV 286 >UniRef50_A6C3P1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3P1_9PLAN Length = 330 Score = 150 bits (378), Expect = 5e-35, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 98/312 (31%), Gaps = 58/312 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----------------------L 37 W+ GH + IA L E A+ LL ++ + Sbjct: 24 WNYAGHRVIASIAWDQLTPETQAAMIALLKQHPRFEQDFQSRMPEVILKASPAVQDRWLF 83 Query: 38 SALCVWPDQVRH----WYKYKWTSPLHFIDTPDKACN-----------FDYERDCHDQHG 82 WPD R + H+I+ P + + Sbjct: 84 MRAATWPDIARSFKEADREKYHHGTWHYINQPIYLDTASELSLSSKLPVNTAKSIRQGDD 143 Query: 83 VKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH--------VG 134 + A++ Q+ +D+ AL ++ H GD HQP+H Sbjct: 144 PLQFNILQALEYNVAQMKDPAVSEADK----ALALCWIMHLTGDSHQPLHSSALFSKGSF 199 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 D GGNSI + KSNLH WD + + L D + Sbjct: 200 PEGDRGGNSIRI----GKSNLHAQWDGLLGNSFKDSEIVSQAVGLARDPALKQLGEQATK 255 Query: 195 DL--ASWRECGNVFSCVNKFATESINIACKWGYKGVE--AGETLSDDYFNSRLPIVMKRV 250 +L A W + + + + + A + E + L Y+ + I +KR Sbjct: 256 NLNYADWIDESHALAKSAGYTQLILAAAKQNDSPQNEFLKLKDLPAAYYRTAGAIAVKRA 315 Query: 251 AQGGIRLAMLLN 262 AQ G RLA ++N Sbjct: 316 AQSGWRLAAVIN 327 >UniRef50_C5LKE6 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5LKE6_9ALVE Length = 342 Score = 150 bits (378), Expect = 6e-35, Method: Composition-based stats. Identities = 65/291 (22%), Positives = 122/291 (41%), Gaps = 41/291 (14%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 + H + +A L D+ + ++L LS W ++ + W + L Sbjct: 17 GSDFHAVVVELADLRLADKTRQELSIMLGNDY--RLSTTANWAARL----NFPWLADL-- 68 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 + CNF Y RDC + C+AG+I N+T ++ T +R +EA+ FL Sbjct: 69 STAYNDHCNFSYARDCTN----NGRCLAGSIWNYTNRMIDPYLSTKER----SEAVKFLV 120 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSN--LHHVWDREIILTAA-----KDYYA 173 H + D H P+ G +SD GG I++ SN L W +I+ Y Sbjct: 121 HLVADAHLPLSAGRSSDQGGKKINVHINFADFSNVDLSKAWREKILDEMQGALYPGKYVQ 180 Query: 174 KDINLLEEDIE---------GNFTDGIWSDDLASWRECGNV---FSCVNKFATESINIAC 221 +D N ++ G D ++ + SW +C++ E+ ++AC Sbjct: 181 QDSNSSSHRMKFWRVTSNSIGADLDQKYAGMVPSWLAECTQHGINACIDMILNEAADLAC 240 Query: 222 KWGYKGVE-----AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 + Y+ ++ + LS +Y+ SR+ ++ +++A+ RL +++ F Sbjct: 241 RIAYRNMDGRDIQNNDDLSREYYTSRIGMLREQLAKAATRLGWIMDEAFKN 291 >UniRef50_B6KFB6 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KFB6_TOXGO Length = 439 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 57/324 (17%), Positives = 107/324 (33%), Gaps = 66/324 (20%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDT 64 H L+ A A+K LL DL+ + W R KY T+ LHF+ Sbjct: 32 AHEAVSMTTLSGLSTSANQALKKLL---NGKDLADVAGWAH--RVSDKYPDTARLHFMSQ 86 Query: 65 PDKACNFDYERDC---HDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 P D VK C+ A+ F L + Sbjct: 87 PTCPSKPLRTDDIILDKSFCEVKGNCLLEALTYFFFHLVDPDQNKVEQTNPDVITTTNFV 146 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHK----SNLHHVWD 160 D + +A+ ++ + +GD+HQP+H+G D G +++ + + L++ + Sbjct: 147 FPHDIKTTDADAVKYIINLVGDMHQPLHMGSADDDYGRRAVVQYSDGEQMRLTTLYNFLE 206 Query: 161 REIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIA 220 ++ K + N G + + + N +++A E+ + Sbjct: 207 AGLVDKTVKQRQYFWFSGWTHV---NSVKGAYDSEKS--LFATNKEKMFSEWAKENRAVL 261 Query: 221 CKWGYKGVEA------------GETLSDDYFNSRLP--------------------IVMK 248 C Y V G D+Y + L ++ K Sbjct: 262 CNEVYPHVRKTGKDARAAANALGSDAVDEYAKAVLDGSSDVPLFEIDAAAEFALFQVLKK 321 Query: 249 RVAQGGIRLAMLLNNVFGASQQED 272 R+ G R+A+++N + + +D Sbjct: 322 RILLAGARVAIVMNYILQVRESKD 345 >UniRef50_B6KF36 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KF36_TOXGO Length = 397 Score = 144 bits (363), Expect = 3e-33, Method: Composition-based stats. Identities = 54/366 (14%), Positives = 93/366 (25%), Gaps = 96/366 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQV-------- 47 W H++ IA+ ++ A V +L + + VW D + Sbjct: 25 WHSGPHMIVAAIARSEMSALAQIKVDYILGLWRGQYPDHATMERASVWLDDINGKGPPYE 84 Query: 48 ---RHWYKYKWTSPLHFIDTPDKA------------------------------------ 68 R + K +H ++ P Sbjct: 85 KPSRRFDFLKIFQFMHGVNIPYNPEGIQLQGLDALLPLYERSAEFLLDMAWDGLKATTPT 144 Query: 69 --------CNFDYERDCHDQHGVKDMCVAGAIQNF------------------TTQLSHY 102 C+ + V A NF ++Q+S Sbjct: 145 TEKLEDPFCSVPPPVSSFSLASYSEGTVNAANGNFLEVSHPDEYRRNTGVSARSSQVSTD 204 Query: 103 REGTSDRRYNMTEALLFLSHFMGDIHQPMH-------VGFTSDAGGNSID-LRWFRHKSN 154 E ++ L + H + DIHQP+H D G I + +N Sbjct: 205 AESPVGTVLSLNFYLRMVIHLVADIHQPLHSLLAFSPAFPHGDRFGTKISMVLPNGEDTN 264 Query: 155 LHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFAT 214 LH WD + + D + EE D L S + + A Sbjct: 265 LHAFWDGAGSVYTKRRGEFTDEEIAEEA--RRIKLEFPKDSLESHLKPELLAPNFRNMAE 322 Query: 215 ESINIACKWGYKG--------VEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 ES + Y+ + + Y +++A G RL L + Sbjct: 323 ESHRLGAALAYREFNFRTFRPADLPYVPTHTYLADVRLACRRQIAIAGYRLGYALEELSA 382 Query: 267 ASQQED 272 + Sbjct: 383 YLPVPE 388 >UniRef50_C7RIT3 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RIT3_9PROT Length = 320 Score = 144 bits (363), Expect = 3e-33, Method: Composition-based stats. Identities = 60/314 (19%), Positives = 95/314 (30%), Gaps = 73/314 (23%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD---------------LSALCVWPDQVRH 49 GH ++ IA ++ AV LL ++ + + WPD +R Sbjct: 33 GHRISAMIAWESMDAGTKSAVGQLLRQHPDYERWQARAHGGDPELTAFLEASTWPDDIRK 92 Query: 50 WYKYKWTS------------------PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGA 91 ++ T H++D P G AG Sbjct: 93 DRRFYTTGREEPTATLPGFPDMERRLHWHYVDRPVNP-------------GAGTGPAAGV 139 Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS------DAGGNSID 145 I L+ AL +L H +GD HQP+H D GGN + Sbjct: 140 IDRQLAVLARIVGDRQATMAERAYALPWLIHLVGDAHQPLHAASRYGPDGQSDNGGNLVS 199 Query: 146 -LRWFR---HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRE 201 + F +LH WD +D + Sbjct: 200 IVNPFAARYTSMSLHRYWDDLPGPPWLRDGRLASAARSLAALHR---------------- 243 Query: 202 CGNVFSCVNKFATESINIACKWGY-KGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAML 260 ++ ES +A + Y G +A T+S + L I +RVA+ G RLA L Sbjct: 244 PPTSPGTPEQWLDESWRLARERVYPPGDDAVPTISATFHEDALAIAGRRVAEAGYRLADL 303 Query: 261 LNNVFGASQQEDSV 274 L + + + + Sbjct: 304 LQRLLHSGPRREDR 317 >UniRef50_B8KWM0 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KWM0_9GAMM Length = 271 Score = 143 bits (359), Expect = 9e-33, Method: Composition-based stats. Identities = 58/264 (21%), Positives = 93/264 (35%), Gaps = 43/264 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH C A + + LL N ALC WPD+++ T+P H Sbjct: 22 WWDLGHAAICDAALEYVKPGTRLEIDRLLATRDNRGFGALCSWPDEIK--TDQPTTAPWH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ P D + + + +LS R EALL++ Sbjct: 80 YLNVPVGTT------DIATAPRPAEGDILAVLTEQQARLSQANTDIHAR----AEALLWV 129 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRWFR----------HKSNLHHVWDREIILTAAKD 170 +H +GD+HQP+HV + D GG+S L+ R ++ +H +WD + L A Sbjct: 130 AHLVGDLHQPLHVAYAEDRGGSSYRLQVPREIRALLGERYEETGMHQIWDGYLPLYARYS 189 Query: 171 YYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACK--WGYKGV 228 + L+ E ++A ES+ I Y Sbjct: 190 GGSGLKQLVIEQSAEAG-------------------GTPLEWAQESLTIMNNPGTAYLYG 230 Query: 229 EAGETLSDDYFNSRLPIVMKRVAQ 252 L + Y I +KR+ Q Sbjct: 231 YRITILDEAYLAKNYRIALKRMKQ 254 >UniRef50_D0Y4Z6 Phospholipase C/P1 nuclease domain-containing protein n=1 Tax=Caulobacter segnis ATCC 21756 RepID=D0Y4Z6_9CAUL Length = 307 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 57/314 (18%), Positives = 93/314 (29%), Gaps = 77/314 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAA--------------HAVKMLLPEYVNG-DLSALCVWPD 45 W+ GH+M +A + +A VK + E + WPD Sbjct: 23 WNGRGHMMVAAVAWEEMTPKAKARAAALLRKNPNYGDWVKGVPVELADKVAFMNAATWPD 82 Query: 46 QVRHWYKYKWTSP-------------------LHFIDTPDKACNFDYERDCHDQHGVKDM 86 +R ++ P HF + + D + Sbjct: 83 DIRSTHQDDGYDPTVPQADDNVGYSDPYVHAYWHFTN-------IAFSIDATPVPPPPAV 135 Query: 87 CVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS-------DA 139 I+ F+ L+ S + L++++H +GD+HQPMH D Sbjct: 136 NAIERIKLFSATLA-----PSGDDDVQSYDLVWVAHLVGDMHQPMHATSRYSQAKKRGDN 190 Query: 140 GGNSIDLRWFRHKS---NLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDL 196 GGN + + LH WD + ++D + +D L Sbjct: 191 GGNGVFVCKTGQCDKGQKLHQFWDYGVG-------SSQDYASVIAA----------ADKL 233 Query: 197 ASWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKRVAQ 252 + + ES +A Y + L+ Y +VA Sbjct: 234 PKAPAAQRAIGDPDAWLQESYQLARTKAYVDPIGPAKGPYVLTTRYRVEAGQTCEAQVAL 293 Query: 253 GGIRLAMLLNNVFG 266 G RLA LLN G Sbjct: 294 AGARLADLLNARLG 307 >UniRef50_A8PCL3 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8PCL3_COPC7 Length = 484 Score = 137 bits (344), Expect = 5e-31, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 83/227 (36%), Gaps = 52/227 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVN-----------GDLSALCVWPDQVRH 49 W GH + IAQ L+ + LL V+ LS++ W D + Sbjct: 27 WGAAGHEIVATIAQIHLHPSVLPTICALLDIDVDASDDTSSLRAKCHLSSIATWAD--KE 84 Query: 50 WYKYKWTSPLHFI----DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG 105 K +W++ +H++ D P + C F + G + + V A +N T L+ + Sbjct: 85 KMKIRWSAAMHYVGAVDDFPRERCEFPGPKGWA---GTRSINVLDATKNVTRILAEWGGV 141 Query: 106 TSDRRYNMT-------------------------------EALLFLSHFMGDIHQPMHVG 134 + ++ EA FL HF+GD+HQP+H+ Sbjct: 142 DENEFSLVSPVTSYVPPYGSRSQVPGKRVKQLPVPGPLQEEAFKFLVHFVGDMHQPLHLT 201 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + GGN I + + +NLH WD I + L + Sbjct: 202 GRA-RGGNGIKIHFGTRTTNLHSAWDTMIPTKLIRTVPRNYTRPLPD 247 >UniRef50_B3L390 S1/p1nuclease, putative n=8 Tax=Plasmodium RepID=B3L390_PLAKH Length = 417 Score = 136 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 55/319 (17%), Positives = 107/319 (33%), Gaps = 61/319 (19%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHF 61 S EGH +A L E + +K LL D+ + W V K K +HF Sbjct: 34 SGEGHEAIGMVAMSGLKSEQLYELKKLL---SGKDIVDIGKWGHLV--HEKIKGAESMHF 88 Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG---------------- 105 + + C C D++G+ C+ +I++F +L+ + Sbjct: 89 -NLQNHDCKR-AVFKCEDENGL---CLINSIKHFYVKLAGGKPTDHTTGQSTNQSTGQAT 143 Query: 106 -------------------TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDL 146 + + +AL +L + D+HQP+ + + D GG I + Sbjct: 144 EEHALNSAPPEAKDIPFKYPQNIAFTDADALKYLVSLIADMHQPLRIAYRYDNGGKDIKV 203 Query: 147 ----RWFRHKSNLHHVWDREIILTAAKDYYAKDINLL--------EEDIEGNFTDGIWSD 194 + ++NL + E+I K Y + E + + Sbjct: 204 IHHDDYKTVRTNLFDYMESELINKMIKRYQSAWYGGWTHINRLLDEHKKDEKLFSEKGIN 263 Query: 195 DLASWRECGNVFSCVNKFATE--SINIACKWGYKGVEAGETLSDDYFNSRL--PIVMKRV 250 + W E C + + + K + + + Y ++ + Sbjct: 264 AIDIWGEQIINEFCSEFYLNSYVTNFMVEKKDELHFDTSKEIEITYDLEFHLERLLKVNI 323 Query: 251 AQGGIRLAMLLNNVFGASQ 269 + G R+A+LLN++F + Sbjct: 324 LRAGSRIAILLNSLFANRK 342 >UniRef50_A4BF01 Probable endonuclease n=1 Tax=Reinekea blandensis MED297 RepID=A4BF01_9GAMM Length = 262 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 96/271 (35%), Gaps = 28/271 (10%) Query: 5 GHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALC--VWPDQVRHWYKYKWTSPLHFI 62 GH M ++ L D A ++ L E + ++ + V D R + K PL Sbjct: 9 GHTMVAQLMVPFLKDGARSELERLYGEDWSREIVSRAAMVQADLNR--PQNKSMIPLQLT 66 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 F ++ C + + C GA+ L +D+R +A ++L H Sbjct: 67 LFEQGDETFQPDKHCPN-----NRCSVGAVLESREVLLRSSFSDADKR----QATIYLMH 117 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFR-HKSNLHHVWDREIILTAAKDYYAKDINLLEE 181 + +H P++ G D GG I L+ NL +W+ ++ K ++ Sbjct: 118 YALQMHIPVNSGLKRDDGGRKIYLKDDDLQPVNLAWIWNHDLYRQMDKRWF--------- 168 Query: 182 DIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNS 241 I D +W E N +A E+ IA Y G S + Sbjct: 169 TYAQELYRDIEKVDPQAWVESMN----PADWALEAHEIAEAEVYPLAAEGR-YSAQLKRA 223 Query: 242 RLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 ++ +++ + R A L N +F D Sbjct: 224 GTAVLEEQLKKAAYRTASLFNEMFPPEDAPD 254 >UniRef50_B4CYG7 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYG7_9BACT Length = 346 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 53/334 (15%), Positives = 94/334 (28%), Gaps = 76/334 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-------------LSALCVWPDQV 47 W GH +A L A + ++ +L +PD + Sbjct: 22 WDTPGHEQIADMAYTRLTPAAKNKIREILQHGDPRYVPANNGDDTLRDAFRRASSFPDVI 81 Query: 48 RHW-------------------------------YKYKWTSPLHFIDTPDKACNFDYERD 76 R +Y H+ DTP + Sbjct: 82 RDPGASTVFDDAYVDRMNLTFQPDVSPQQLAKPKSEYIRCKTWHYYDTPIHYSTSHAPKI 141 Query: 77 CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY-NMTEALLFLSHFMGDIHQPMHVGF 135 A T QL+ + + + L ++ H GD+HQP+H Sbjct: 142 YES-------NALVAYNYATAQLAKLKNSAAGADLRDAAWWLCWIEHLTGDLHQPLHCTS 194 Query: 136 TS------DAGGNSIDL--RWFRHK-----SNLHHVWDREIILTAAKDYYAKDINLLEED 182 D GGN++++ W NLH WD I A A+ + Sbjct: 195 NYAHNHRGDIGGNAVNIIAPWDGASGALHAVNLHSYWDEGIDHAAGGHRSARQDLTPADA 254 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVE---------AGET 233 + TD ++ + V + + +A Y+ G Sbjct: 255 M--EVTDAWLRNNQLKPGDSDAADLNVAHWIAQGAALADAHVYQETNAAGQTQEIIDGTN 312 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 ++ Y ++ + + + RLA +LN +F Sbjct: 313 VTPQYTTDQIDVCEHQAVRAAYRLAAVLNGIFQP 346 >UniRef50_A4YRX0 Putative uncharacterized protein n=2 Tax=Bradyrhizobium RepID=A4YRX0_BRASO Length = 312 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 76/296 (25%), Gaps = 69/296 (23%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLL---------------PEYVNGDLSALCVWPD 45 W EGH+ +A L+ LL + WPD Sbjct: 22 WWDEGHMQIAYLAYKKLSPTVRDRADALLKLNPDYASWIAGAPQGQEKLYAFVHAATWPD 81 Query: 46 QVRHWYKYKW------------------TSPLHFIDTPDKACNFDYERDCHDQHGVKDMC 87 ++ Y + H+ D D + Sbjct: 82 DIKMKPDYYDDQVGDSTAKQLVPYGHLKHTYWHYKD----------ALFSVDDTPLPRPD 131 Query: 88 VAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV---------GFTSD 138 A+ ++ + + +L + H +GD+HQP+H D Sbjct: 132 AVDAVSQLKLMIAKLPANSDATEPLRSYSLSWTIHLVGDLHQPLHAIARYSAALPDKGGD 191 Query: 139 AGGNSIDL-RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 GGN + NLH WD Y + + D G + Sbjct: 192 RGGNEEQVIAANGETQNLHAYWDG-----IFGGYSTVFGAMFDADQRGGLS-------TV 239 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGV----EAGETLSDDYFNSRLPIVMKR 249 + +A ES ++A Y + L+ +Y + K+ Sbjct: 240 TADPGKAQIVDPATWAQESFDLAKSVAYAAPIRTDKQPVELTREYETNARDTARKQ 295 >UniRef50_Q4UCH4 Bifunctional nuclease, putative n=2 Tax=Theileria RepID=Q4UCH4_THEAN Length = 391 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 49/311 (15%), Positives = 98/311 (31%), Gaps = 61/311 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ A + +KMLL DL W D+V + + PLH Sbjct: 22 WNELCREAIESTAMSAITYMRLRRLKMLL---KGEDLVDYTWWADEV--LKRIPESLPLH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 + PDK N ++ C + ++C+ I+ F L + Sbjct: 77 YQYQPDKKSN-NFNFTCSN-----NLCLMAGIKYFFAVLMNSGYPVGTSNTQKFDIPPLG 130 Query: 106 -TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREII 164 +++ ++ + +L + D+H P+H+ FT +I + VW+ I Sbjct: 131 YPRKIKFSPSDCIKYLVVLLSDLHHPLHLDFTQPDSIATIPVDLSDFP-----VWEN-IS 184 Query: 165 LTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGN---------------VFSCV 209 + + L+ + + + SW C Sbjct: 185 VQTLNTKRPLYGDFLKHIYMPKYIEVNENAWYGSWTHVSTLGLRYSTELDLFNNKTVECF 244 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMK-----------RVAQGGIRLA 258 +A E+ ++ E LSD + + ++ G R+A Sbjct: 245 EVWAAETASLNNTIF--DKEDFVYLSDTVRTKAIRFTERLDSKLGFLMRLQIVMAGARVA 302 Query: 259 MLLNNVFGASQ 269 ++LN + + Sbjct: 303 IVLNYILSHRE 313 >UniRef50_A2DKF6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DKF6_TRIVA Length = 323 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 81/269 (30%), Gaps = 29/269 (10%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDL----SALCVWPDQV-RHWYKYKWTSPLHFI 62 +I + + + + DL + + W V R + +K + HF Sbjct: 19 TVSQIVLDKMGKAYTANLSSVFLAAGDTDLVSHPAKVGAWMSYVERPPFNFKGFNHWHFT 78 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 P F D + I N +G++ R + + ++ L Sbjct: 79 RQPYVPKEFGQIPSQIDNDNL--------ISNVMEMSDDIYKGSTKRSWPLAFSMKILFA 130 Query: 123 FMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREII--LTAAKDYYA 173 + DIH P+HV D G ++ + K+NL V++ Y Sbjct: 131 GVCDIHTPLHVSEYFSSEFPNGDQNGRLYEVVYKGQKTNLFDVYETGCGLDENLQVTYDE 190 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET 233 N +++ + D + S E + + Y V+ G Sbjct: 191 SFWNDVKDLADNLLEDFKFVSKKFSRTEITAQNAT-------TYQYTVDKIYSLVKPGGE 243 Query: 234 LSDDYFNSRLPIVMKRVAQGGIRLAMLLN 262 L+ + N + RL +LN Sbjct: 244 LTTEMINECQSHTRDMMRLAAERLVYILN 272 >UniRef50_A5FFX0 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FFX0_FLAJ1 Length = 332 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 42/289 (14%), Positives = 83/289 (28%), Gaps = 35/289 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + A L + + ++ PD ++ YK P H Sbjct: 25 WGNVGHERINKAAVMALPKQLQ-----IFFYNHIDFITQEASVPDIRKYALNYKEEGPRH 79 Query: 61 FIDTPDKACNFDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRYNM 113 + D + Y + + D + I++ +L+ + + Sbjct: 80 YFDMENFGAADTYPQTLEEAKQKYDAKFLSDNGILPWYIEDMMAKLTKAFKEKNRAEILF 139 Query: 114 TEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 A L H++GD H P+H D + +H +W+ + K+Y Sbjct: 140 LAAD--LGHYVGDAHMPLHTSANHDG--------QLTDQKGIHSLWESRLPELFVKNYK- 188 Query: 174 KDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESIN------IACKWGYKG 227 +N+ E + IW + + T + A K Sbjct: 189 --LNVPEAQYYTDVHKAIWDMINDTHSFAQPLLDIDKSLRTATPQDKVFKLDAEGKVLKS 246 Query: 228 VEAGETLSDDYFNSRLP----IVMKRVAQGGIRLAMLLNNVFGASQQED 272 SD+Y +V ++ + A + + + D Sbjct: 247 KYNTAVFSDEYAKKLHEQLNGMVETQMRKAITATASFWYTAWVNAGKPD 295 >UniRef50_C5PTL3 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PTL3_9SPHI Length = 315 Score = 127 bits (319), Expect = 4e-28, Method: Composition-based stats. Identities = 44/289 (15%), Positives = 82/289 (28%), Gaps = 36/289 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + +++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDEITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACN---------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + ++ + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWYRAKEKIEEKRLLSNGIVPWQIYITYQKLVKAFIARDKIKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFATHY 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + W+ S V + + + K Y Sbjct: 183 KLTAG---KAQFITDPAALGWAIVYESAPLADTVLRIEKELSVR-FPASQKKTYLTRNNV 238 Query: 232 ETLSD------DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSV 274 L+ Y + +V R+ Q R+ L + + + Q D Sbjct: 239 LVLTYSDAYAKAYHEALNGMVEVRMRQAIHRIGSLWYSAWIEAGQPDLR 287 >UniRef50_A2DRT9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DRT9_TRIVA Length = 300 Score = 126 bits (316), Expect = 9e-28, Method: Composition-based stats. Identities = 45/271 (16%), Positives = 77/271 (28%), Gaps = 28/271 (10%) Query: 7 VMTCRIAQGLLNDEAAHAVKMLLPE--YVNGDLSALCVWPDQV-RHWYKYKWTSPLHFID 63 + + + + +S W R + + HF Sbjct: 3 AIAGEVGLEQFGFSLQKKLNSVFQNAGDDFTRVSQAAAWLYYAERPPFNIPSFNHWHFYS 62 Query: 64 TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHF 123 P N E D +KD NF + R G R + + Sbjct: 63 QPINPNNLSIE-THIDVDNLKD--------NFDSIRKSVRGGKVSRTWPFAFLMKLYLTG 113 Query: 124 MGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDI 176 M DI+ P+HV D G +++ + +L+ +W+ Y+ + Sbjct: 114 MCDIYSPLHVSELFNEQFPNGDRNGRDFYVKYNGNFISLYDLWETGCG------YFDSQV 167 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSD 236 + ED LA E V + + N Y G+ G +S Sbjct: 168 DFTSEDDWKKIDKLTNELSLAFTSEDWPSTLSVTQVIEGNYNYTRDTVYNGLVNGSEVSK 227 Query: 237 DYFNSRLPIVMKRVAQGGIRLA---MLLNNV 264 +Y + V G R+A LN + Sbjct: 228 EYITTCQNYAQDIVILAGKRIATDLANLNII 258 >UniRef50_C5KYE5 S1/P1nuclease, putative n=6 Tax=Perkinsus marinus ATCC 50983 RepID=C5KYE5_9ALVE Length = 357 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 48/275 (17%), Positives = 95/275 (34%), Gaps = 19/275 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W K+ H +L+ + LL +S + Y T H Sbjct: 21 WDKDIHERIGEAVSRVLSYRDIEDLNKLLKGQSIPYMSRYA---HDKLQYANYDRTVENH 77 Query: 61 FIDTPDK-ACNFD-YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 + C FD D + + + T + + + Sbjct: 78 YETQLRDWQCTFDVNNPDKYAESQGLYRSIHDIFGRVTHASKSGEDHGIAKDMTEPVQIS 137 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDREIILTAAKDYYAKDIN 177 +L + D+HQP+H GF +D G I +++ +NL+ W+R+I +AA + Sbjct: 138 WLLGLVQDLHQPLHTGFGADDHGRRISVQYHDDPSTNLYDFWERDIS-SAANLETQLVLK 196 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYK--------GVE 229 +++ DG + L + + ++ ES+ ++C Y V Sbjct: 197 AYNAELDKLVQDGGYGIQLVNKIYSKG----IAEWIAESMEMSCSDIYSVIAGGRGREVP 252 Query: 230 AGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNV 264 + DD + + K+V + R A++L+ + Sbjct: 253 RMYQIDDDVYAKWRDLATKQVVKAAARSAVVLHGI 287 >UniRef50_Q11TZ7 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11TZ7_CYTH3 Length = 318 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 40/290 (13%), Positives = 84/290 (28%), Gaps = 35/290 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H ++A L K ++ V PD+ R+ + +P H Sbjct: 24 WGFFAHKEINKMAVFTLPHPLMSFYKR-----HIDFITEQAVNPDKRRYIVSGE--APKH 76 Query: 61 FIDTPDKACNFDYERD--------CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 ++D + + R + + + T +L+ + + Sbjct: 77 YMDIEYYSDSILIVRPDWNTAQAIYPEDSLHAHGILPWNLVRLTYRLTDAFKHRDAKSIL 136 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 A L H++GD+H P+H + + +H +W+ + + DY Sbjct: 137 KLSAD--LGHYVGDLHVPLHTTKNYNG--------QLTGQQGIHGLWESRLPELFSADYN 186 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + + +W S R C V + + + Y+ Sbjct: 187 YYLG---TANYVTDIKKVVWESMTES-RACVAQVLAVELKLQQQMKADKIFSYEDRNGQT 242 Query: 231 ----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 S+ Y + +V KR+ L + + + D Sbjct: 243 VRVYSYDFSNAYHKALEDMVQKRMRAAIKCTGDLWYTCWVNAGRPDPAAF 292 >UniRef50_Q6LI73 Hypothetical endonuclease n=2 Tax=Photobacterium profundum RepID=Q6LI73_PHOPR Length = 305 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 70/311 (22%), Positives = 109/311 (35%), Gaps = 77/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKML-------LPEYVNGDL---------SALCVWP 44 W+ +GHV +IA L+ A V +L +PE + + + L + P Sbjct: 29 WNYQGHVTVAQIAYQNLDTTARTQVDVLAAKAYQSMPEDIQQKMDSFEGASQFAKLAMVP 88 Query: 45 DQVRHWY-------------------KYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D +R K T H+I+ + C D Sbjct: 89 DLIRKIPAEDIWAQMGETIPASLNQWDEKETGAWHYINQ-----AYPATSQC-------D 136 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS-------- 137 I+ + L + +++F+SH GD HQPMH S Sbjct: 137 FIHVPNIKLVASYLFDDFKQNPQ-----AASMMFMSHVAGDSHQPMHSISQSLSKNVCVT 191 Query: 138 DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 D G N L + +LHH+WD + L +IN D++ + Sbjct: 192 DLGANKHTLDV--PQKDLHHLWDSGMGLLG----TEHNINDFATDLQLAY---------P 236 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 S + VN + TES +A +GY V S+ Y+N +V +R+ Q G RL Sbjct: 237 STTMTLGKTADVNLWVTESYQLA-DFGYS-VAIDAKPSESYYNKGTELVKQRLTQAGYRL 294 Query: 258 AMLLNNVFGAS 268 A LN+ Sbjct: 295 ADELNSALAKK 305 >UniRef50_C6VWZ8 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VWZ8_DYAFD Length = 341 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 81/276 (29%), Gaps = 36/276 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + L+ V PD+ R+ + + H Sbjct: 42 WGFWAHKRINRLAVFRLPMEMQ-----VFYKKHIDYLTENAVNPDKRRYAVVGE--AERH 94 Query: 61 FIDTPDKACNFDYERDCH---------DQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID + H + K V +Q +QL+ + R Sbjct: 95 FIDLDVYGDSALAVLPKHWQAAVNKVGEDSLRKHGIVPWHVQIAASQLTSAFREKNAARI 154 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + +H W+ + A+ Y Sbjct: 155 LRMSAD--LGHYIADAHVPLHTTRNYNG--------QLTGQDGIHGFWESRLPEIYAEQY 204 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA- 230 + IW AS +V K TE+ K+ ++ Sbjct: 205 DMWLGP---AAYREDIAHDIWQAVEASHSGSDSVL-AFEKQLTEAFKPDKKYAFELRNNI 260 Query: 231 -----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLL 261 S+ Y + V +R+ + + Sbjct: 261 LTRMHSRDFSEKYHRALAGQVERRMRASVQMVGDVW 296 >UniRef50_C7PNU1 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNU1_CHIPD Length = 313 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 37/280 (13%), Positives = 73/280 (26%), Gaps = 35/280 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L E + + LS D+ R+ P H Sbjct: 20 WGFFAHQRINRLAVFSLPPEML-----VFYKPNIEYLSTHATDADKRRYI--IPEEGPRH 72 Query: 61 FIDTPDKACNF---------DYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 +ID + + + ++L+ + R Sbjct: 73 YIDIDHYGQAPFAALPRSWEEALLKYTADTLQTYGILPWYLTQMLSRLTQAFKDKDPDRI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A + H+ GD H P+H + + +H +W+ I A Sbjct: 133 MRLSAD--IGHYAGDAHVPLHACSNHNG--------QRTGQQGIHGLWESRIPELMADKT 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA- 230 + + + W L S V K ++ K+ Y+ Sbjct: 183 FQ--YLSAKAYYIKDINAYTWQIVLESAAAADTVLQQ-EKLVSDRFPSGRKFAYEKRNGK 239 Query: 231 -----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 + Y + ++ +R++ A + Sbjct: 240 LIRNYATAYAKAYHGALGDMIERRMSAAISATANYWYTAW 279 >UniRef50_C5BI21 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BI21_TERTT Length = 343 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 84/311 (27%), Gaps = 75/311 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV--------------KMLLPEY--VNGDLSALCVWP 44 WS GH + A L+ A LP+ L WP Sbjct: 64 WSYSGHAVILGSALSQLDPTARKEAFTQIEYLYNRASGNSRFLPKSCLSQKSLCFFASWP 123 Query: 45 DQVRHWYKYKWT-------------------SPLHFIDTPDKACNFDYERDCHDQHGVKD 85 D+ R + + HF + + + C + + Sbjct: 124 DRERDKTLGELYRMVGAEVPAVLKGLTSSEIASWHFTNQVFNLNDRKFSAACELRDRGQL 183 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHV------GFTSDA 139 V +++ L +H + D HQP+H G D Sbjct: 184 YDVLPQLESA--------LIRELSIAQRAVTLALWTHLLADAHQPLHNLTGSLEGCAHDF 235 Query: 140 GGNSIDL--RWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 GGN + + R + + +LH +WD L D + Sbjct: 236 GGNGLCVVKRRNKCERSLHQLWDSGAGLFDKPDMISPLGVADAR---------------- 279 Query: 198 SWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 ES+ +A + +E S+ Y + + R Q R+ Sbjct: 280 -----SPTAVDYRVIQNESLALASEVYAPNLELS---SNAYITTVRRLSRIRAQQAAQRI 331 Query: 258 AMLLNNVFGAS 268 A+LL + G Sbjct: 332 ALLLKELTGNK 342 >UniRef50_C6Y3Y4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y3Y4_PEDHD Length = 285 Score = 117 bits (292), Expect = 6e-25, Method: Composition-based stats. Identities = 38/279 (13%), Positives = 79/279 (28%), Gaps = 35/279 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H+ R+A L K LS V PD+ R+ + H Sbjct: 20 WGFYAHIRINRLAVFTLPAGLNRFYKA-----NISYLSDHAVDPDKRRYADT--AEAARH 72 Query: 61 FIDTPDKACNFD-YERDCHDQHGV-------KDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 ++D + D R + ++ + IQ +L H + Sbjct: 73 YLDVELYEAHIDSIPRKWEEAVKRYGLVRLNQNGILPWQIQKSYYKLVHALRDRDSLKIL 132 Query: 113 MTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYY 172 + A +L H++ D H P+H + ++ +H W+ + AK Y Sbjct: 133 IYSA--YLGHYLADAHVPLHTTQNHNG--------QLSNQLGIHAFWESRLPELFAKKY- 181 Query: 173 AKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA-- 230 + + + N W + + V K+ + Sbjct: 182 --NYVVGQAIYIENPLKEAWKIITHTHKMVDTVL-TFEARLNARFPAHRKYSFSERNNQV 238 Query: 231 ----GETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 S + + +V +++ + + + Sbjct: 239 GRQYSLAYSKAFHDGMNHMVERQMRAAIHSIGSYWYSAW 277 >UniRef50_A4KXI8 Putative S1/P1 nuclease n=2 Tax=Ascovirus RepID=A4KXI8_HVAVE Length = 277 Score = 116 bits (290), Expect = 8e-25, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 85/274 (31%), Gaps = 46/274 (16%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W++ GH + +A+ + + + + L + PD + + LH Sbjct: 33 WAQNGHRVCAAVARAHIAP---ALLNHIESNLLKATLDEVSNDPDNIDVERR-----HLH 84 Query: 61 ---FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 ++DTP D C+ A Sbjct: 85 WVNYVDTPSDGAQNVSSYLTSDCQIDNRECIVSA-------------------------- 118 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF-RHKSNLHHVWDREIILTAAKDYYAKDI 176 H++ D+HQP+HV + A + + WF LH VWD E+ Y + Sbjct: 119 ---VHYICDLHQPLHVIPATYANQSFARVLWFHGFNYTLHQVWD-ELPEQLHLSYESHAK 174 Query: 177 NLLEEDIEGNFTDGIWSD-DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETL- 234 L+ I + + W + + + E + E G + Sbjct: 175 WLVRHHISPEMYVAMVKQTTVDKWIDSRVAAYEIARKLNE--KLVKCHTENNSERGRYIC 232 Query: 235 SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGAS 268 + + S P V +A GG+RLA L F Sbjct: 233 NLKFVFSARPTVDSSLASGGVRLAGYLKQSFKNK 266 >UniRef50_A6E734 Putative uncharacterized protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6E734_9SPHI Length = 271 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 82/273 (30%), Gaps = 35/273 (12%) Query: 7 VMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPD 66 + +A L + + L V PD+ R+ + H++D Sbjct: 1 MRINELAVFTLPEGMYT-----FYKQNRRYLRDHAVDPDKRRYADT--SEAARHYLDVEH 53 Query: 67 KA-CNFDYERDCHDQHG-------VKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALL 118 C R D + + IQ +L + + + A Sbjct: 54 YEVCIDSIPRKYPDAVKKYGLKKMNQSGILPWQIQQSYYKLVRAFQQRDSAKILIYSA-- 111 Query: 119 FLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINL 178 +L H++ D P+H D + +H W+ + ++DY + L Sbjct: 112 YLGHYLSDAQVPLHTTANHDG--------QLSGQQGIHAFWESRLPELFSEDY---NFLL 160 Query: 179 LEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGET----- 233 + + + W + +V ++ S I K+GY + Sbjct: 161 GKAQYISDPLEEAWKMVSKTHLLVDSVLQ-LDSVLNSSFPIYRKYGYSKRKNKVVKQHTE 219 Query: 234 -LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 S Y +S +V +++ + + + + Sbjct: 220 GYSRLYHDSMKHMVERQMREAIRKTGAYWYSAW 252 >UniRef50_A7ARD9 S1/P1 nuclease, putative n=1 Tax=Babesia bovis RepID=A7ARD9_BABBO Length = 393 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 90/304 (29%), Gaps = 52/304 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W A + + +K++L + DL W D+VR + ++ LH Sbjct: 23 WDDITREAIESTAMSAITFDRLRRMKVILRGH---DLVDYTWWSDEVRK--RIPESATLH 77 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREG--------------- 105 D+ C ++ C + +C+ + F +L Sbjct: 78 RQLQNDETC-LTFDSTCPN-----GLCLIQGSKFFFAKLMSSGYSIVSQPIKFELPLFRY 131 Query: 106 TSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIIL 165 D + ++ L +L + D+H P +V + W+ + Sbjct: 132 PKDVTFTPSDCLKYLVVLLSDMHYPFNVDLAEPHSLAHRKVDLSGFPM-----WE-ALSK 185 Query: 166 TAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECG---------------NVFSCVN 210 + + + ++ SW N + Sbjct: 186 EKLGHAKPSFEDFIMKVYMPHYIQTNEESWYGSWTNVEVLGSRYKVEQETFNRNTWDNFE 245 Query: 211 KFATESINIACK-----WGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVF 265 +A+E+ N+ C + + LSD + + ++ G R+A++LN + Sbjct: 246 IWASETANLHCNGLVTKSDFSKDKQTIKLSDALLDRIGNTIKFQIVLAGARVAVVLNYIL 305 Query: 266 GASQ 269 + Sbjct: 306 SHRE 309 >UniRef50_D2QFB3 Putative uncharacterized protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QFB3_9SPHI Length = 354 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 48/272 (17%), Positives = 83/272 (30%), Gaps = 48/272 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R+A L + K LS V PD+ R+ + +P H Sbjct: 52 WGFFAHQQINRLAVFTLPVDMIPFFKK-----HINFLSDNAVNPDKRRYAVVGE--APRH 104 Query: 61 FIDTPDKACNF--DYERDCHDQHGVKDMC-------VAGAIQNFTTQLSHYREGTSDRRY 111 FID R + V IQ QL+ + + RR Sbjct: 105 FIDLDAYPDTTSATLPRYYKEATDRYGEDSLALHGLVPWQIQLTKYQLTEAFKQRNVRRI 164 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D + P+H + ++ +H W+ + + +Y Sbjct: 165 LRVAAD--LGHYIADANVPLHTTRNYNG--------QLTNQQGIHGFWESRLPELFSANY 214 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC------VNKFATESINIACKWGY 225 D + I+S A+WR N + + + TE + K+G+ Sbjct: 215 ----------DFLTGQAEYIYSPQKAAWRAVFNANAALDSVLHIERQLTEQVGETRKYGF 264 Query: 226 KGVEA------GETLSDDYFNSRLPIVMKRVA 251 + S Y V +++ Sbjct: 265 EERNGITAKVYSADFSQQYHERLHGQVERQMR 296 >UniRef50_A2FG69 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FG69_TRIVA Length = 339 Score = 113 bits (283), Expect = 5e-24, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 85/277 (30%), Gaps = 31/277 (11%) Query: 8 MTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSA----LCVWPDQV-RHWYKYKWTSPLHFI 62 + + E + + DL+ L W + V R + K + HF Sbjct: 3 TVSNMVMDKIELEYKQKLARTFLRSADYDLAKNLSKLSTWMNYVERPPFNLKCFNHWHFS 62 Query: 63 DTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 P + +Y + + + D+ A + F + ++ L L Sbjct: 63 REPFTLESRNYIPQYNGKDNLVDVLKESATKIFFLI--------PSSPFILSTHLKVLFA 114 Query: 123 FMGDIHQPMHV-------GFTSDAGGNSIDLRWFRHKSNLHHVWDREI-ILTAAKDYYAK 174 + DIH MH D G + + ++L V + + + +++ Sbjct: 115 GVPDIHATMHTQEFFSNDFPDGDRNGQVFYVMYNGTNTSLFDVLESGCGLDSQKHATFSR 174 Query: 175 DINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETL 234 D ++ + +S V E+ Y + G+T+ Sbjct: 175 DFWEDVRKLKVELFKSWETPTFSSTDSV------VEAAKIENREYTKATIYSKLRPGDTI 228 Query: 235 SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQE 271 SD++ +++ + A +L ++ +E Sbjct: 229 SDEFITECQTRTKQQILKS----AEILYHITENKMKE 261 >UniRef50_C0A652 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A652_9BACT Length = 348 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 53/319 (16%), Positives = 95/319 (29%), Gaps = 56/319 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W EGH + ++A L E V+ ++ L PD+ R+ + Sbjct: 23 WDYEGHRIVNQLALAALPPEFPAFVRE---AANAERIAFLSGEPDRWRNVEDGPLRHAQT 79 Query: 58 PLHFIDTPD---------------------------KACNFDYERDCHDQHGVKD--MCV 88 P HF D + + D+ +D + Sbjct: 80 PDHFFDIEYLVEGGLPLAKLSEFRQVFAVQLAEARAARPSAYPKSGSKDKDRTRDLVGFL 139 Query: 89 AGAIQNFTTQLSHYRE------------GTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT 136 AI ++ ++ R N+ + L H++GD QP+H Sbjct: 140 PWAITENYGRVKSAFTYLKAYEALGTPEEVANARANVVYQMGLLGHYVGDGAQPLHTTKH 199 Query: 137 SDAG----GNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 + G++ + R F + LH D I A + D +G Sbjct: 200 FNGWAGEAGSAANPRGFTTRRTLHSWIDGGYIAAARITVADLLPRAFKADPLTLSGEGRG 259 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 +D VF + Y+ +AGE + + +R+ + Sbjct: 260 GNDARR----DPVFEAALAYLVRQHEQVIPL-YELEKAGELNAPPATRKGRAFIEQRLQE 314 Query: 253 GGIRLAMLLNNVFGASQQE 271 GG LA L + + Q+ Sbjct: 315 GGRMLACLWITAWREAGQD 333 >UniRef50_D2LJW8 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LJW8_RHOVA Length = 200 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 51/203 (25%), Positives = 73/203 (35%), Gaps = 34/203 (16%) Query: 89 AGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRW 148 A+ + L+ ++ T R L L+HFMGDIHQPMHV F D GGN I Sbjct: 2 VSAVLDDMRDLAFAQDVTEQLRL-----LKTLTHFMGDIHQPMHVSFEDDKGGNLISASG 56 Query: 149 FRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSC 208 +S LH WD +I + I + I S D + W Sbjct: 57 LCGRS-LHAAWDSCLIEKTLG--------FDSDTIATSLEAEITSGDRSRWLAGDIGPKA 107 Query: 209 VNKFATESINIA--------------------CKWGYKGVEAGETLSDDYFNSRLPIVMK 248 V +A E+ I + G + + + Y + P V Sbjct: 108 VASWANETFTITTRPEVGYCERASDGCRYSAYQPEYHGGAQKVVVVDEHYLSVNAPFVRD 167 Query: 249 RVAQGGIRLAMLLNNVFGASQQE 271 R+ G+RL +LN+V Q Sbjct: 168 RIKAAGVRLGAVLNSVLMPDQSP 190 >UniRef50_Q8XRE8 Putative signal peptide protein n=1 Tax=Ralstonia solanacearum RepID=Q8XRE8_RALSO Length = 337 Score = 111 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 106/320 (33%), Gaps = 66/320 (20%) Query: 2 SKEGHVMTCRIAQGLLN-DEAAHAVKMLLPEYVNGDLSALCVWPDQVRH----------- 49 +GH +A L+ A V+ +L L VW D + Sbjct: 27 GPDGHQTVGELADSLIAGTNAESQVQNILGM----TLEQASVWADCAKGVTRTQSGKFVY 82 Query: 50 --WYKYKWTSP---------------------------------LHFIDTPDKACNFDYE 74 Y P H+ D + + Sbjct: 83 QGAGHYPECKPFETTTGKSAMVAFVKRNWSGCHPAADEEVCHKQYHYTDVALQRGQYQQ- 141 Query: 75 RDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 G D + AI+ +L + + EALL LSH++GDIHQP+HV Sbjct: 142 ----GLVGTSDHDIVAAIRAAIIKLQGGTTPSPIDFASKREALLLLSHYVGDIHQPLHVS 197 Query: 135 FTS-DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWS 193 DA G+ +D + I+ K ++ D + G+ + Sbjct: 198 AVYLDAQGHVVDPDQGTFDPQTKTIGGNSILDAGKKLHFEWDQVPAALKPDQLGVSGV-A 256 Query: 194 DDLASWRECGNVFSCVNKFATESINIACK----WGYKGVEAGE----TLSDDYFNSRLPI 245 + A G++ S ++AT++++ A + +A + TL +Y + R + Sbjct: 257 EARAIPLTSGDIISWPAQWATDTMHSAAPAFSGTAFSAEDASKHWQVTLPANYVSERETV 316 Query: 246 VMKRVAQGGIRLAMLLNNVF 265 ++ + G RLA LL ++ Sbjct: 317 QRAQLIKAGARLAQLLQAIW 336 >UniRef50_B3EUC7 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=B3EUC7_AMOA5 Length = 317 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 81/261 (31%), Gaps = 32/261 (12%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R A L K L ++ V PD+ R+ + + + H Sbjct: 22 WGFAAHKHINRCAVFTLPPAMFTFYKYYLG-----YITENAVNPDKRRYVLEGE--ASRH 74 Query: 61 FIDTPDKACN--FDYERDCHDQHGVKD-------MCVAGAIQNFTTQLSHYREGTSDRRY 111 +ID N +D V IQ+ +L++ + Sbjct: 75 YIDLDYYGDNALDKLPKDWAQATHKYSQDTLLAHGIVPWHIQHMQHRLTNAFRNKDIAQI 134 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 + + H++ D + P+H + + +H +W+ + ++Y Sbjct: 135 LKLSSD--IGHYIADANVPLHTTQNYNG--------QLTGQDGIHGLWETRLPELFKEEY 184 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 N + W + + N+ + +++ N K+ Y+ A Sbjct: 185 NFFLGN---ATYVKDPQQRAWKAIIQAHATVPNLLKLEKE-LSQNFNTLHKFSYEKRGAS 240 Query: 232 --ETLSDDYFNSRLPIVMKRV 250 + S+ Y + ++ +V Sbjct: 241 LKKVYSEAYARAYHDLLQGQV 261 >UniRef50_A2EIL3 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EIL3_TRIVA Length = 310 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 71/225 (31%), Gaps = 28/225 (12%) Query: 40 LCVWPDQVRHWYKY-KWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQ 98 W +V + K + F+ TP + +Y R+ D + + G I N Sbjct: 52 AGGWLARVEYAPTNTKCFNHWRFVQTPINGSD-NYHRNKDDLTVQLNGLLGGLINNTI-- 108 Query: 99 LSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGF--------TSDAGGNSIDLRWFR 150 ++ A S + P+H D G +++ Sbjct: 109 ---------TDKWAYNFAFKVASALFFEAFSPLHTSELFDNDRFKDGDDSGKKYMIKYQG 159 Query: 151 HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVN 210 ++ +L WD + E +F + L R NV Sbjct: 160 NEMSLLDFWDSGCGRYTRQT-------PYTETQWTDFYKNVDYMLLKFPRPSCNVNITWQ 212 Query: 211 KFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 +++N+ Y+G++ + LS +Y + + I +R+A Sbjct: 213 MAVNDTLNVTNTVVYQGIKYSQELSKEYIDKCIEITDERLACAAY 257 >UniRef50_A2F5A5 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2F5A5_TRIVA Length = 343 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 28/257 (10%), Positives = 71/257 (27%), Gaps = 35/257 (13%) Query: 15 GLLNDEAAHAVKMLLPEYVNGDLSA---LCVWPDQVRHWYKY-KWTSPLHFIDTPDKACN 70 L ++ ++ ++ + W + H + Sbjct: 26 RKLGNKGISKLQKVIDM-TGEKMERPSLAGSWLASLLHAPSNTNCFDHWRYSQKNIN-AI 83 Query: 71 FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQP 130 E C ++ ++ C + +GT + + D P Sbjct: 84 PHPEHHCINKDDLE--CTLDKLN------KTIMKGTLNGPWPYNFGFKVFLTLYMDSFDP 135 Query: 131 MHVGF--------TSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 +HV D G ++++ +LH W+ K + + E+ Sbjct: 136 VHVTEYFDNDTFIDGDDNGKKFNIKFKGKNMSLHDFWETGCGRYVLKTPFNGNGWKEIEE 195 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFA---TESINIACKWGY--KGVEAGETLSDD 237 + + C + +A +S N++ + Y ++ L ++ Sbjct: 196 TTTRLYKRLNDSKF--------ITPCPSDYAGAINQSFNLSKEIVYNLSMIQKDNDLPEE 247 Query: 238 YFNSRLPIVMKRVAQGG 254 Y + + +R+ Q Sbjct: 248 YIKTCYELTDQRILQAA 264 >UniRef50_C9YFD1 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD1_9BURK Length = 117 Score = 107 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 54/112 (48%), Gaps = 10/112 (8%) Query: 62 IDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLS 121 ++ P CN+ ERDC D CV AI L T AL ++ Sbjct: 1 MNFPRGDCNYQQERDCPD-----GKCVIAAIDRQIEVLR-----TPGDDEKRLTALKYVV 50 Query: 122 HFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYA 173 HF+GDIHQP+H GF D GGNS L+ F SNLH VWD +I + +D Sbjct: 51 HFIGDIHQPLHAGFGDDRGGNSYQLQAFMRGSNLHAVWDTGLIKSLKQDNEQ 102 >UniRef50_B0DTT4 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0DTT4_LACBS Length = 242 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 80/245 (32%), Gaps = 67/245 (27%) Query: 92 IQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRH 151 ++N T L + EAL FL HF GD HQPMH+ + GGN + + + Sbjct: 1 MKNVTALLQGW-VKGETSDDAANEALKFLIHFFGDAHQPMHMT-GRERGGNQVKVAFGGK 58 Query: 152 KSNLHHVWDREIILTAAKDYYAK-----DINLLEEDIEGNFTD------------GIWSD 194 ++ WD +I +E+ + G D W+D Sbjct: 59 QTT----WDDSLITKVISTIPQNYTLPLPYPEIEQALRGASYDPYIRRIIWEGILQKWAD 114 Query: 195 DLASWRECGNVFS---------------------------CVNKFATESINIACKWG--- 224 ++ W C + C +A S ++ C Sbjct: 115 EIPGWLSCPDAVKRTFVDSQIALGLEGTTGIEILPDNDVLCPYHWARPSHDLLCDGVWLK 174 Query: 225 ------YKGVEAGETL------SDDY--FNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQ 270 Y+ + + Y + +V K++A GG+RLA L N +F Q Sbjct: 175 EVDEPPYRRTDDNPHPPLLELETPAYSGMIGQRWLVEKQLALGGLRLAGLFNYIFADQGQ 234 Query: 271 EDSVV 275 + + Sbjct: 235 RGAFI 239 >UniRef50_Q0E526 29.6 kDa S1/P1 nuclease n=1 Tax=Spodoptera frugiperda ascovirus 1a RepID=Q0E526_SFAVA Length = 261 Score = 104 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 88/274 (32%), Gaps = 47/274 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W+ GH + +A+ L+ V+ + L + D+ + + +H Sbjct: 24 WALTGHRVCANVARRLIPSPILKHVET--EVLDHETLDGVSNVADE-----TPRSLAAMH 76 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFL 120 +++ + L + + + + Sbjct: 77 YVNYNVTPTR-----------------------SARKVLEYTENNMTSTYRWDAAFITNV 113 Query: 121 SHFMGDIHQPMHVGFTSDAGGNSIDLRW-FRHKSNLHHVWDR--EIILTAAKDYYAKDIN 177 H + D+HQP+HV +D + +W + LH +WD ++ L + Y +N Sbjct: 114 VHLLCDLHQPLHVVPYADVPSTFTETQWVNGQNTTLHTIWDTLPDLRLLSHHIYAEWLVN 173 Query: 178 LLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETL--- 234 L+ + + D W + K + AG L Sbjct: 174 KLKANTYALLFEQ---DRPHKWLDSRRYAYDAAKRLND------NLARCHTNAGSKLLIN 224 Query: 235 --SDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFG 266 + + +S +V + + GG+RLA + +++ Sbjct: 225 SCNYRFVDSARALVDESLLYGGVRLAAYITSLYS 258 >UniRef50_C2FVU8 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FVU8_9SPHI Length = 238 Score = 103 bits (256), Expect = 8e-21, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 58/220 (26%), Gaps = 29/220 (13%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H + R A L E + + ++ V D + Y SP H Sbjct: 20 WGFYAHKLINRNAVFTLPTEL-----AVFYKQNIDQITEKAVDAD--KRCYIDSAESPRH 72 Query: 61 FIDTPDKACNF---------DYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 FID N + + + + V I +L + Sbjct: 73 FIDLDAYDTNTLDTLPVHWSRAKEKIEQKRLLSNGIVPWQIYITYQKLVKAFIARDKTKI 132 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 A L H++ D H P+H + + + +H W+ + A Y Sbjct: 133 IRHSAD--LGHYVADAHVPLHTTKNYNG--------QYTDQIGIHAFWESRLPEMFAPQY 182 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNK 211 + + W+ S V + Sbjct: 183 KLTTG---KAQFITDPAALGWAIVYESAPLADTVLRIEKE 219 >UniRef50_A3HWS6 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWS6_9SPHI Length = 280 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 84/276 (30%), Gaps = 36/276 (13%) Query: 12 IAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTPDKACN- 70 +A L E + ++ V PD+ R+ + + H+ID + N Sbjct: 1 MAIYSLPPELIA-----FYKPHIQFITEKAVNPDRRRYAVIGE--AEKHYIDLDEYGENP 53 Query: 71 --------FDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 ++ ++ K+ + L+ E +++ A L H Sbjct: 54 LDILPIYWYEAVEKFSEEELRKNGIGPWSAYLTFLNLTEAFESKNEKAILRLSAD--LGH 111 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 ++ D++ P+H + + +H W+ I + A + + Sbjct: 112 YLADLNVPLHTTKNYNG--------QLTGQEGIHGFWESRIPESQANRFELWVG---TAE 160 Query: 183 IEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEA------GETLSD 236 IW + +V K T + K+ Y+ + E + Sbjct: 161 YISQPQQAIWDAVAQAHAMVDSVL-TFEKELTSNFPQDQKYSYEQRNSLTVRVYSEEFTQ 219 Query: 237 DYFNSRLPIVMKRVAQGGIRLAMLLNNVFGASQQED 272 Y + V +++ + +A + + Q D Sbjct: 220 QYAEALDHQVDRQMRKSIKMIADFWYTAWVNAGQPD 255 >UniRef50_B1ZQR9 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B1ZQR9_OPITP Length = 349 Score = 101 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 73/317 (23%), Gaps = 59/317 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK--YKWTSP 58 W GH + + A L + V+ ++ L PD+ R+ K + Sbjct: 26 WDYTGHRIVNQAALASLPADFPEFVRA---PAAAERIAFLAGEPDRWRNVPDLPIKHANG 82 Query: 59 L-HFIDTPD----------------------------KACNFDYERDCHDQHGVKD--MC 87 L H+ D F + ++ Sbjct: 83 LDHYCDLEHLAGAGVDPRTVSSLRFEFALTFAAGRAAHPEKFPPIDPAKNADRSREWAGF 142 Query: 88 VAGAIQNFTTQLSHYRE-------------GTSDRRYNMTEALLFLSHFMGDIHQPMHVG 134 A + +L + R N+ + + H +GD+ QP+H Sbjct: 143 APWAAAEYYGKLKSAFSYLKAYQEHGGTPVEIENARANILYLMGVMGHVVGDLAQPLHTT 202 Query: 135 FTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSD 194 + + + +H D +I + Sbjct: 203 MHHHGW-VGENPHGYSTWTGIHAWLDGGLIAQTGVTAGEVCAQVRPAHAL---------S 252 Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 VF V +A + + +++ GG Sbjct: 253 VQPRADGRDPVFVQVMDYALAQNARVEPLYQLEKAGKLAPEAADLSEARTFICEQLQVGG 312 Query: 255 IRLAMLLNNVFGASQQE 271 L + + + + Sbjct: 313 EMLGSIWLTAWRNTIPD 329 >UniRef50_B0T3S4 Putative uncharacterized protein n=5 Tax=Caulobacteraceae RepID=B0T3S4_CAUSK Length = 348 Score = 94.1 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 37/299 (12%), Positives = 78/299 (26%), Gaps = 66/299 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPL- 59 W GH M +A + ++ + PD+ + K T Sbjct: 32 WGAWGHRMVGVVAAESFPSDIPAFLRT---PEAVAAIGEYAREPDRWKGSGKIHDTDRDA 88 Query: 60 -HFIDTPDKACNFDYERDCHDQHGVKD-----------------MCVAGAIQNFTTQLSH 101 HF+D D+ + + + + +I + QL Sbjct: 89 AHFLDVDDEGKMYGGPKFSVETLPPTRADYEKALAAVGHDSWNAGYLPYSIIDGYQQLVK 148 Query: 102 YRE---------GTSDRRYNMTEA--------------LLFLSHFMGDIHQPMHVGFTSD 138 T + L +H++GD QP+H+ + Sbjct: 149 DFTYWRILQTVTKTEKDKIRKAYYVADLKRREELLVRDLGVWAHYVGDASQPLHLSVHYN 208 Query: 139 AGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLAS 198 G+ + + H ++ ++ A + + A Sbjct: 209 GWGDYPNPNGYTQSKATHGNFEGPLVKAVAVN------------------ADVEKLVPAY 250 Query: 199 WRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 ++ + + T ++ E G +D V +R+A G +L Sbjct: 251 KDCGCSIETRTVSYLTTTVGFVEPLYKLEKEGGLVATDP---RAKAFVDERLAAGATQL 306 >UniRef50_Q21JG1 Putative uncharacterized protein n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21JG1_SACD2 Length = 321 Score = 92.9 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 46/247 (18%), Positives = 73/247 (29%), Gaps = 60/247 (24%) Query: 43 WPDQVRH-------------------WYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGV 83 WPD VR YK TS H+ + N C+ ++ Sbjct: 100 WPDLVRSQKLSVLFKAVGATTPADLAAYKNYTTSTWHYHNV-FYDSNNKLLLSCNKKNRG 158 Query: 84 KDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFT------S 137 K A++ + A F H +GD HQP+H Sbjct: 159 KLYSALSALE--------SSLQSDLSISQQAIAFAFYVHLVGDAHQPLHNVSRANKHCEH 210 Query: 138 DAGGNSIDLRWFRHKSNL--HHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDD 195 D GGN+ L+ K +L H WD L A + DI ++ Sbjct: 211 DRGGNTYCLKKKGAKCSLNAHQFWD----LAAFNPVESIDIQPVKHK------------- 253 Query: 196 LASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGI 255 CG + + E+ + K + + Y ++ I R+ Sbjct: 254 ----AACGTSPAWGSYLLAEAKELVVNLYPKNDDFN---NAKYRSNAKSIAKSRIEMAAS 306 Query: 256 RLAMLLN 262 R A ++ Sbjct: 307 RTAQIMK 313 >UniRef50_D0NJT6 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NJT6_PHYIN Length = 269 Score = 91.4 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 83/293 (28%), Gaps = 83/293 (28%) Query: 14 QGLLNDEAAHAVKMLLPEY-----VNGDLSALCVWPDQVRHW-----------YKYKWTS 57 + +L++ ++ +L + G+++ VW D V+ S Sbjct: 11 RNVLDEADVTTIESILSRWDEDFPNTGEITTTAVWMDIVKCTAESSTCLTPASPSITSIS 70 Query: 58 PLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEAL 117 H+I+ P +E D A + + Sbjct: 71 DWHYINLPLHINGDKWEDKDTDLTLRSTQSRVSARPSLS--------------------- 109 Query: 118 LFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAK--- 174 D GGNS SN H VWD L + + Sbjct: 110 --------------------DGGGNSETFTSPCVFSNPHAVWDAAGGLYSLNKWSLNIDS 149 Query: 175 -------------DINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 + ++++I + + ++L + V V A E+ N A Sbjct: 150 FRPTLENASELIALLPSVQDNITFSQYVNVTYNELNTALVTNQVLREV---ALETYNFAN 206 Query: 222 KWGYKGVEAGET-------LSDDYFNSRLPIVMKRVAQGGIRLAMLLNNVFGA 267 Y ++ T S Y I KR+A G RLA++L + Sbjct: 207 TIVYSNLDLNATSSGTYPCPSASYLAMVGEISQKRIAIAGSRLAVVLKHFAAQ 259 >UniRef50_Q6MQM4 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MQM4_BDEBA Length = 356 Score = 90.6 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 86/319 (26%), Gaps = 60/319 (18%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPD-QVRH--WYKYKWTS 57 W GH CR+A L+ + ++ + LC PD + K + Sbjct: 19 WGGRGHDTICRVATFLVKEPGLKEYM----QHKPQMMGHLCNMPDFYWKSLGGDAAKLGN 74 Query: 58 PLHFIDTPD---------------------KACNFDYERDCHDQHGVKDMCVAGAIQNFT 96 HFID K F + + F Sbjct: 75 STHFIDIEVIGLDVKDITVDYKQLMTDFTGKPNKFKNDGSTIKSIPQEFGSSWWRADQFM 134 Query: 97 TQL-------------------SHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS 137 + + Y+M ++ + HF+GD QP H Sbjct: 135 RHIAGLKEDFAKAKAPTSFKEEQDNELPYNKLAYDMVVSMGLMGHFVGDNCQPFHTTADY 194 Query: 138 DAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLA 197 D + +H ++ +++ D + F + + Sbjct: 195 DG--------YAAGHGGIHAYFEDQVVGQFDGDLDYLVLKAARGMKNPEFLKPKTAIEKM 246 Query: 198 SWRECGNVFSCVNKFATESI----NIACKWGYKGVEAGETLSDDY-FNSRLPIVMKRVAQ 252 + + + + + G + A E F P+++ +A+ Sbjct: 247 KVLSVISNKEIPKILKMDPVIKKSTLVKEKGMELKTAAERQPASVAFKKMKPMIVTEMAR 306 Query: 253 GGIRLAMLLNNVFGASQQE 271 G + LA L + + ++ + Sbjct: 307 GAVLLAALWDEAYASAGKP 325 >UniRef50_A2FZN6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FZN6_TRIVA Length = 232 Score = 90.6 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 54/166 (32%), Gaps = 16/166 (9%) Query: 100 SHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTS-------DAGGNSIDLRWFRHK 152 T + + A + P ++ D G ++ + K Sbjct: 7 KSLFPQTIQGAWPINVAWKSYFGLFLEAFNPTNIANYYSNNHTEGDNNGKDFEIFYKGRK 66 Query: 153 SNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKF 212 +N+H W K + ++ + ++ D+ + +N Sbjct: 67 TNIHDFWGSLCGRLTGKYPFNSNVWSDIDK---------YAHDITLVYRNVTHYQNINDI 117 Query: 213 ATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLA 258 T+S NIA Y GV GE LSD+Y + K++A LA Sbjct: 118 LTQSYNIAKDVVYVGVNEGEILSDEYVEKCYDVTSKQLASAAFSLA 163 >UniRef50_B9EZB3 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=B9EZB3_ORYSJ Length = 170 Score = 89.5 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 48/122 (39%), Positives = 69/122 (56%), Gaps = 8/122 (6%) Query: 73 YERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMH 132 RDCH+ + MCV GAI N+T QL Y G S YN+TE+L+FL+HF+GD+HQP+H Sbjct: 28 PRRDCHNSRHQQGMCVVGAINNYTDQL--YSYGDSKSSYNLTESLMFLAHFVGDVHQPLH 85 Query: 133 VGFTSDAGGNSIDLRWFRHKSNLH-----HVWDREIILTAAKDYYAKDINLLEEDIEGNF 187 VGF D GGN+I + + +S +H D E +T DY+ ++E+ + Sbjct: 86 VGFEEDEGGNTIKVHCYAIES-IHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQAG 144 Query: 188 TD 189 Sbjct: 145 IR 146 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 36/76 (47%), Positives = 51/76 (67%) Query: 200 RECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRLAM 259 + G V+ +A ESI+++C + YK VE TL DDYF SR PIV KR+AQ GIRLA+ Sbjct: 90 EDEGGNTIKVHCYAIESIHLSCNYAYKDVEQDITLGDDYFYSRYPIVEKRLAQAGIRLAL 149 Query: 260 LLNNVFGASQQEDSVV 275 +LN +FG + + +V+ Sbjct: 150 ILNRIFGEDKPDGNVI 165 >UniRef50_C1F7J9 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J9_ACIC5 Length = 319 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 36/285 (12%), Positives = 80/285 (28%), Gaps = 56/285 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKW---TS 57 W K+GH M +A L ++ +++ L PD+ R + + + Sbjct: 29 WGKDGHKMINHLAVTSLPPSIPAFLR---SPAAVDEITYLGPEPDRWRSPAEPELDAMQA 85 Query: 58 PLHFIDTP-------------------------DKACNFDYERDCHDQHGVKDMCVAGAI 92 P H+ID + + V + Sbjct: 86 PDHYIDMELADRIAPLPRERYQYIAKLYAYIEAHPDQAREMQPTHIGFQPYISEEVWERL 145 Query: 93 QNF---TTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWF 149 ++ QL + T + + +L H++ D QP+H + + + Sbjct: 146 KSAMRDYRQLKAAGKDTMPVQQAIIFYAGWLGHYVADGSQPLHTTIEYNGW-VGPNPNHY 204 Query: 150 RHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCV 209 ++H ++ E + + + + I + W + V Sbjct: 205 TTSHHIHSQFESEFVHDNMTNAEVRQY--------MKPVEPIGDEWTQYWDYLNTTHADV 256 Query: 210 NKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 + E + + G++G +R+A G Sbjct: 257 D----EVYQLWNEHGFEGKGT---------AESRKFTAERLAAGA 288 >UniRef50_B4WCT7 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WCT7_9CAUL Length = 338 Score = 89.1 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 40/299 (13%), Positives = 76/299 (25%), Gaps = 66/299 (22%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHW--YKYKWTSP 58 W GH + A L + +K D+ L PD+ + + Sbjct: 23 WGNTGHRLIGIAAMRALPADMPGFLKT---PGAIADVGELAREPDRWKGAGQPHDRERDT 79 Query: 59 LHFIDT-----------------PDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQL-- 99 HFID P +D + A+ + L Sbjct: 80 AHFIDLDDAGHVFDRRGMPLAELPRLKSEYDAALTKAGLDVDDAGYLPYAMIDAWQNLGR 139 Query: 100 -----------SHYREGTSDRRYNMTEALL----------FLSHFMGDIHQPMHVGFTSD 138 + + + L + H++GD QP H + Sbjct: 140 DFAYWRVLNAAERRETNMERQAWYRADRLRREALILRDIGVMGHYVGDGSQPHHTTIHYN 199 Query: 139 AGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLAS 198 G + F + H +++ A+ D + + A+ Sbjct: 200 GWGEFPNPEGFTNSRQTHALFEGAFTNRVAR------------------LDAVEAAMPAA 241 Query: 199 WRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIRL 257 + +V + + T ++ + SD V +R+A G L Sbjct: 242 DLDGFDVKARTVSYLTTTLGTVIPFYRLEKAGAFRDSDP---RGAAFVNERLAAGAAEL 297 >UniRef50_C5SFS5 Putative uncharacterized protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SFS5_9CAUL Length = 339 Score = 88.3 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 66/247 (26%), Gaps = 43/247 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWT--SP 58 W GH + A L ++ GD+ PD + K Sbjct: 24 WGPTGHRIVGEEAARALPAYMPEFLR---SAQGVGDIGFYSNEPDAWKGAGKVHDFERDS 80 Query: 59 LHFID---------------TPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLS--- 100 HFID P +FD + K + A+ + Q+ Sbjct: 81 AHFIDLDDDGKTLAGVRLQEVPQSRSDFDALLRSKNVMPWKSGYLNYALIDAWQQVVKDF 140 Query: 101 ---------HYREGTSDRRYNMTEALL-----------FLSHFMGDIHQPMHVGFTSDAG 140 E R+ + EA+ LSH++GD QP+H+ + Sbjct: 141 AYWRGMTYLEAHESDPKRKAWLKEAIRRREALTLRDIGILSHYVGDSSQPLHLSIHYNGW 200 Query: 141 GNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWR 200 G +H + + + + L E + +W+ Sbjct: 201 GKEYPNPQTFTLEPIHGPLESAFVSANINNEDVRAAMLASEPCTLAVERCFDAKLERNWK 260 Query: 201 ECGNVFS 207 ++ Sbjct: 261 YVTPLYE 267 >UniRef50_D0XMV2 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XMV2_9CAUL Length = 348 Score = 87.5 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 61/211 (28%), Gaps = 45/211 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHW--YKYKWTSP 58 W GH A L DE ++ ++ L PD+ + + Sbjct: 28 WGSTGHRTIGVAAVRALPDELPAFLRT---PGAAAEIGELSREPDRTKGAGQPHDRERDT 84 Query: 59 LHFIDTPDK-----------------ACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH 101 HF+D D +D + + AI + QL+ Sbjct: 85 AHFVDLDDDGHVMNASGPTLSQLPELKSQYDAQLAAAGIAVNDAGYLPYAIMDGFQQLAR 144 Query: 102 YRE-------------GTSDRRYNMTEALL----------FLSHFMGDIHQPMHVGFTSD 138 + R + + L +LSH++GD QP H+ + Sbjct: 145 DFATWRVLNAAEAREADPAKRAWYREDRLRREALILRDMGYLSHYVGDGSQPHHMSIHYN 204 Query: 139 AGGNSIDLRWFRHKSNLHHVWDREIILTAAK 169 G+ + F + + H ++ I + Sbjct: 205 GWGDYPNPEGFTNARSTHGAFEGAFIRRNLR 235 >UniRef50_A2G9R8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R8_TRIVA Length = 181 Score = 85.2 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 46/137 (33%), Gaps = 8/137 (5%) Query: 135 FTSDAGGN--SIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIW 192 D GGN I+ + +++H WD ++ A T I Sbjct: 2 PNGDRGGNLYHINCPYGAACNHIHFFWDAIVLNYMLMKPTASLYRNEFIKNVTRLTKEIT 61 Query: 193 SDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQ 252 L + ++ ES+ A K+GY + + Y+ RVA Sbjct: 62 ESSLNL-----DKTVDPMAWSMESLEYAKKYGYS-TPINDAPNASYYEIVRKYGSIRVAM 115 Query: 253 GGIRLAMLLNNVFGASQ 269 G RL LL+++ + Sbjct: 116 AGHRLGYLLDSLLDKAP 132 >UniRef50_C7J139 Os04g0636400 protein n=2 Tax=Oryza sativa RepID=C7J139_ORYSJ Length = 141 Score = 83.7 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 51/64 (79%), Positives = 56/64 (87%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WSKEGH++TCRIAQ LL AAHAV+ LL E +GDLSALCVWPDQVRHWYKY+WTSPLH Sbjct: 30 WSKEGHMLTCRIAQDLLEPAAAHAVRNLLTEEADGDLSALCVWPDQVRHWYKYRWTSPLH 89 Query: 61 FIDT 64 FIDT Sbjct: 90 FIDT 93 >UniRef50_B6KMV3 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KMV3_TOXGO Length = 632 Score = 82.5 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 37/127 (29%), Gaps = 18/127 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGD-----LSALCVWPDQV------RH 49 W EGH++ +A+ L E ++ +L E+ L VW D V R+ Sbjct: 27 WHDEGHMLVAAVAKEYLKPETVEKIEYILSEWSPQYPTTSTLETAAVWLDHVACSMPGRY 86 Query: 50 W------YKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYR 103 + P H+ N E Q + + L + Sbjct: 87 CRGFLGLDDIRIFKPWHYTSNVFNPQNLTLEPLYEVQPYPQTGSS-WILLKSYESLRNCT 145 Query: 104 EGTSDRR 110 + + Sbjct: 146 GDSRASQ 152 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 51/172 (29%) Query: 123 FMGDIHQPMHV-------GFTSDAGGNSIDLRWFR------------------------- 150 GD HQP+H D GGN+I + R Sbjct: 276 IYGDAHQPLHATETYSKAFPNGDFGGNNISIVLPRSEKMLENYPSTPEEFPEVGAEAHRG 335 Query: 151 ----HKSNLHHVWDREIILTAAKDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVF 206 H+ +LH WD + +Y D++ L+++ + ++ D F Sbjct: 336 SGVPHRQSLHSQWDGAFGQYNSL-FYEVDLDELKKEAQRLV--RLYPVD----EHAKRTF 388 Query: 207 SCVNKFATESINIACKWGYKGVE--------AGETLSDDYFNSRLPIVMKRV 250 + + + ES +A + E S +Y + K++ Sbjct: 389 ADFHGISIESSMLARSHVFSEFEWSTFSASSLPYHPSVEYIEKSKKVCEKQI 440 >UniRef50_Q028C4 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q028C4_SOLUE Length = 352 Score = 76.8 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 38/311 (12%), Positives = 78/311 (25%), Gaps = 82/311 (26%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYK---WTS 57 W GH + A + + ++ L + G L + PD R + Sbjct: 22 WGVRGHTVANLAALEGITQDGPAFLR--LQKAYIGHLGTI---PDTWRSPSEPYLRISED 76 Query: 58 PLH--------FIDTPDKA----------------------------------------- 68 H FI P + Sbjct: 77 ANHGWYTEGFDFIPNPPHSRTEFTLRVYDEYLKNKSKDPERAKLLNIRYTGLQAYSIIEG 136 Query: 69 -----CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHF 123 R+ + + + L+ + ++ + ++ H+ Sbjct: 137 YERMKAGMRLYRNVSGPEEANRVNIGSIYAAISPTLADRAQVQQMLANDIAFYMGWVGHY 196 Query: 124 MGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEEDI 183 + D QP+H D + D + + N+H ++ + + +D++ Sbjct: 197 VADAAQPLHNSIHHDGW-SGADPKGYTRDPNIHGRFESQYLDLIGVT--EEDVDKYMRK- 252 Query: 184 EGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRL 243 E D +W L E + Y+ G + Sbjct: 253 EPRLLDNVWKAVLDHSLEARGFT---------------EEVYRLDLRGA-FTKKDDAEAR 296 Query: 244 PIVMKRVAQGG 254 +V KR+A G Sbjct: 297 ELVCKRLAAGA 307 >UniRef50_D1ZW87 Whole genome shotgun sequence assembly, contig_886 (Fragment) n=2 Tax=cellular organisms RepID=D1ZW87_SORMA Length = 159 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 40/123 (32%), Gaps = 15/123 (12%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHAVKMLL--------PEYVNGDLSALCVWPDQVRH-WYK 52 + GH IA+ + E A+ +L P + VW D V+ + Sbjct: 42 WEYGHQSVATIARLNVRSETRAAIDRILRHQALLETPTCPARTIEEASVWADCVKPLGER 101 Query: 53 YKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYN 112 + + H+ + FD + C D CV+ I+ L + ++ Sbjct: 102 FSYAYSWHYQNVDVCRP-FDLKAACKD-----GNCVSAQIERDVKLLKDPKVPMREKVLA 155 Query: 113 MTE 115 + Sbjct: 156 LAF 158 >UniRef50_Q4Q7F8 Class i nuclease-like protein n=4 Tax=Leishmania major RepID=Q4Q7F8_LEIMA Length = 180 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 32/79 (40%) Query: 195 DLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGG 254 + ++ E V ES A Y GV G TLSD Y + R+ GG Sbjct: 88 ETYTFPEALRTLVDVVAIHEESHMFAVNTSYPGVTPGATLSDAYLARCKRVAEARLTLGG 147 Query: 255 IRLAMLLNNVFGASQQEDS 273 RL LLN + + +++ Sbjct: 148 YRLGYLLNELLPSIPVDEA 166 >UniRef50_B9TFK5 Putative uncharacterized protein (Fragment) n=1 Tax=Ricinus communis RepID=B9TFK5_RICCO Length = 228 Score = 74.1 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 74/235 (31%), Gaps = 64/235 (27%) Query: 47 VRHWYKYKWTSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 V + S H+ D P + +++ G D + ++ L T Sbjct: 2 VAYTTANPKHSEYHYTDVPFQLAHYEDH-----GVGTTDHDIVQTLKQCIAVLQGKGNAT 56 Query: 107 SD-RRYNMTEALLFLSHFMGDIHQPMHVGFTS----------------------DAGGNS 143 ++ + +ALL L+H GDI QP+HVG GGN+ Sbjct: 57 TNPHNFTPRQALLMLTHLTGDIAQPLHVGEGYVGKNGGFVVPTQKQLDDKEAFATQGGNN 116 Query: 144 I---DLRWFRHKSNL------------------------HHVWDREIILTAAKDYYAKDI 176 + D++ S L H WD ++ A + A+ Sbjct: 117 LQLDDIKLTAKSSELIPAAAPDDSKPAAPARTPQATRAFHSYWDTTVVNYAFRRIGARTP 176 Query: 177 NLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAG 231 + + + + G+ + +A +++ +A K Y V G Sbjct: 177 EQFAQM--------VSAGNPVVAPNSGDPVTWPYAWADQTLVVA-KLAYADVVPG 222 >UniRef50_C2G3H0 Possible S1/P1 Nuclease n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2G3H0_9SPHI Length = 100 Score = 71.7 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 3/67 (4%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W GH + IA+ L ++A + L+ + L+ WPD V+ + + TSP H Sbjct: 23 WGMTGHRVVTEIAERHLTNKAKKNIAKLIGK---QHLAYWANWPDFVKSDHAFDETSPFH 79 Query: 61 FIDTPDK 67 +I+T Sbjct: 80 YINTEGN 86 >UniRef50_B0RM73 Exported putative nuclease n=3 Tax=Xanthomonas campestris pv. campestris RepID=B0RM73_XANCB Length = 342 Score = 70.2 bits (170), Expect = 6e-11, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 50/206 (24%), Gaps = 41/206 (19%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYK---WTS 57 W K H A L D+ +K ++ V PD R + Sbjct: 24 WGKRAHAAIDTAAIQALPDDGPVFLKR-----HVQVIADGAVLPDGWRSESEPFLKIEED 78 Query: 58 PLH--------FIDTP-----------DKACNFDYERDCHDQHG-------------VKD 85 P H F+ P RD + Sbjct: 79 PNHGWFREQFAFMQNPPRSRYAFVLALYDEQRRLALRDPAAAERMNVRWAGTLPYAATEG 138 Query: 86 MCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSID 145 A L E T + + + +H++GD QP H D + Sbjct: 139 YERIVATMRQIRALRAAGEDTRELERTCAFLVSWFAHYIGDGAQPQHDSIHHDGWQ-GAN 197 Query: 146 LRWFRHKSNLHHVWDREIILTAAKDY 171 + +H ++ + + A Sbjct: 198 PHGYSIDPKVHGKFESDYVDKIALTP 223 >UniRef50_A2G9R9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2G9R9_TRIVA Length = 115 Score = 69.8 bits (169), Expect = 9e-11, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 32/108 (29%), Gaps = 7/108 (6%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEY--VNGDLSALCVWPDQVRHWYKYKWTSP 58 W H+ IA G L+ + + + L+ + W D ++ YK+ Sbjct: 12 WWAHAHMAITEIALGHLSSKKINKLYELINRDGLPFQSVVDSSAWQDDLKDTYKFHAIGD 71 Query: 59 LHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGT 106 HF D P + V + + L+ + Sbjct: 72 WHFSDNPIY-----MNKTIPAIIPNPSYNVTSFLYDALDTLNDPTTTS 114 >UniRef50_UPI00016C48C1 hypothetical protein GobsU_04989 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C48C1 Length = 288 Score = 68.7 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 50/192 (26%), Gaps = 28/192 (14%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRH---WYKYKWTS 57 W GH A L D + L+ PD+ ++ + + Sbjct: 26 WWSGGHETVAAAAAARLPDGVPEFFRN-----GGKHLAHFSGDPDRWKNREMTFLRRAEE 80 Query: 58 PLHFIDTPDKACNFDYERDCHD----------QHGVKDMCVAGAIQNFTTQLSHYREGTS 107 HF+D D +D + K + AI + +L+ Sbjct: 81 GNHFLDLEDLDGKKYPATHRYDGLKMVYGELKKEPNKVGTLPYAIVEYYEKLTVGFYDHR 140 Query: 108 DRRYNMTEALLFLS------HFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDR 161 + + + L H+ GD P+H D G N + +H D Sbjct: 141 KAPKDTSVPMKCLVYGGTLAHYTGDAAMPLHTTRDFD-GRN--QPDGTVKQKGIHAKVD- 196 Query: 162 EIILTAAKDYYA 173 Sbjct: 197 GFPEKNKITPEE 208 >UniRef50_C5GNE5 Predicted protein n=1 Tax=Ajellomyces dermatitidis ER-3 RepID=C5GNE5_AJEDR Length = 380 Score = 65.6 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 35/114 (30%), Gaps = 9/114 (7%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMT 114 +I+ D ++ + + L++ + N T Sbjct: 144 YINPADNPPAYETFTTTGTALSRDALSKPLQMPQSRLSLAYMPSNLENSNMNRT 197 >UniRef50_B5YKD8 Putative uncharacterized protein n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKD8_THEYD Length = 262 Score = 63.7 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 38/127 (29%), Gaps = 16/127 (12%) Query: 27 MLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFID-------TPDKACNFDYERDCHD 79 + + + PD +R Y +P H+ D TP+ F + Sbjct: 32 AYIAKKAGIRIPEAACMPDIIRDE-NYDLLAPFHYHDASPDTVVTPEYIDKFGIKEAFLL 90 Query: 80 QH--------GVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGDIHQPM 131 + I ++ D L+ ++H++GD+ QP+ Sbjct: 91 VDGKNFRISVPHPAGVLYWKIVQIYEKMKSLDRTKPDNVLAYEYYLVSIAHYIGDLSQPL 150 Query: 132 HVGFTSD 138 H D Sbjct: 151 HNFPYGD 157 >UniRef50_C9YFD0 Putative uncharacterized protein n=2 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YFD0_9BURK Length = 79 Score = 63.3 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 23/52 (44%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYK 52 W +GH + +A+ L+ A V LL + L+++ W D+ R Sbjct: 26 WGSDGHKIVAMLAEAQLSPAARKEVDRLLAQEPGATLASISTWADEHRSPAT 77 >UniRef50_B1MDJ0 Putative uncharacterized protein n=1 Tax=Mycobacterium abscessus ATCC 19977 RepID=B1MDJ0_MYCA9 Length = 728 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 65/248 (26%), Gaps = 67/248 (27%) Query: 1 WSKEGHVMTC-----RIAQGLLNDEAAHAVKML---LPEYVNGDLS----ALCVWPDQV- 47 W + GH I L + + L E ++ PD + Sbjct: 376 WGQTGHYSIATFTLDAIRSPNLKTLMQANLDAISFSLSELDPKSIAQRLKEARSNPDGII 435 Query: 48 ----------------------RHWYKYKWTSPL---HFIDTPDKACNFDYERD------ 76 H Y+ P H+ D + + RD Sbjct: 436 PLADVPDLVWKNLPNKVVGGRDDHMVGYRSQGPEHPCHYADIDEPGPDGSIVRDLCLQDI 495 Query: 77 --------------CHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSH 122 + K + + F + + + ++ L L+H Sbjct: 496 ANLTVTKWQQFYDERGHRTPDKRGLLPFRVWQFYDAMVGFAKSRQVDQFVCAAGL--LAH 553 Query: 123 FMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLLEED 182 ++GD QP+H + +D + +H ++ ++I A+ A L Sbjct: 554 YVGDASQPLHGSYLADG-------YPDGTGAGVHSCYESKMIDRYARQLVAAIPADLATL 606 Query: 183 IEGNFTDG 190 + D Sbjct: 607 GDLELIDD 614 >UniRef50_C5JC63 Predicted protein n=1 Tax=Ajellomyces dermatitidis SLH14081 RepID=C5JC63_AJEDS Length = 303 Score = 54.8 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 34/97 (35%), Gaps = 11/97 (11%) Query: 8 MTCRIAQGLLNDEAAHA-------VKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 + IA LL+ A +K ++ +G + W D+ + K + H Sbjct: 86 VIPLIA--LLSPSAQAWGTKTNRIIKHIVEPQYDGSIGRAAAWADECGRTDEGKDSPTWH 143 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTT 97 +I+ P R + V + C G++ + Sbjct: 144 YIN-PADNAGTKNGR-VLNGLPVVNGCAEGSVADVED 178 >UniRef50_C8X622 Putative uncharacterized protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8X622_NAKMY Length = 765 Score = 51.7 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 44/333 (13%), Positives = 86/333 (25%), Gaps = 72/333 (21%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAV-----KMLLPEYVNGD------------------- 36 W K GH +A + + + Sbjct: 414 WGKTGHYTLATVACAQVVTPTLRTLMAANQDRISFPAAGLSPGDIDQATKDAKQHGGFVP 473 Query: 37 LSALC--VWPDQVRHWYKYKWTSPL-------HF--IDTPDKACNFDYERDC-------- 77 L+ + +W + + TSP H+ ID P A + C Sbjct: 474 LADVADVIWKNLAGQVRGGRDTSPRTGPEHPTHYADIDEPRPADHLTLRALCMQDPANVA 533 Query: 78 -----------HDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMGD 126 +Q + + F + RY L ++H++GD Sbjct: 534 VGVWQAFYDALGEQASRDRGLLPFRVWQFYDAMLDALAQDDLVRYLAAAGL--MAHYVGD 591 Query: 127 IHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKD---INLLEEDI 183 QP+H +D +H ++ +I A D A + L Sbjct: 592 ACQPLHGSTLADG-------LPDGTGKGVHSAYESAMIDHHAADILAALLGRLQDLAAHP 644 Query: 184 EGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRL 243 G + + ++ + + ++ ++ Sbjct: 645 LPPVASGQQAAVATVALMDRTATAIPPV------DLVNAYAATPGGQSKAVTGKLWDRFG 698 Query: 244 PIVMKRVAQGGIRLAMLLNNVFGASQQEDSVVA 276 P + +A G LAML ++ + Q + A Sbjct: 699 PATVSVLADGARTLAMLWDSAWTQGQGDTRFTA 731 >UniRef50_B5YDN6 Putative uncharacterized protein n=1 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YDN6_DICT6 Length = 250 Score = 44.8 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 38/266 (14%), Positives = 74/266 (27%), Gaps = 75/266 (28%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS + H + A L E + L E L V PD++ Sbjct: 29 WSAKTHQKIAKEALYSLPKEYQRKLSPYLDE-----LLEGSVAPDRI------------- 70 Query: 61 FIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY-NMTEALLF 119 + D + + ++ + + L + + + L Sbjct: 71 YKDFNNHVFHVHGDKGKGPEEVREKY------------LEIISLIQEGKSWRLVAFQLGV 118 Query: 120 LSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDYYAKDINLL 179 LSH++ D++ P+H D Y KD +++ Sbjct: 119 LSHYIADLNNPLHT--------------------------DSSKREDEFHSKYEKDADVI 152 Query: 180 EEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYF 239 I+ + + +SI A ++ YK +E LS D F Sbjct: 153 NPKIKSELIYIKY----------------PASYILDSIFSANRF-YKDIEKAY-LSGDKF 194 Query: 240 NSRLPIVMKRVAQGGIRLAMLLNNVF 265 I +++ + + A + Sbjct: 195 RDVSKITQEQIDKAALDTASYFYSAL 220 >UniRef50_Q97KA0 Phospholipase C related protein n=2 Tax=Clostridium RepID=Q97KA0_CLOAB Length = 245 Score = 44.0 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 62/245 (25%), Gaps = 40/245 (16%) Query: 6 HVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLHFIDTP 65 H +A +L ++ ++ VW DQ +K T+ HF D Sbjct: 35 HKYINSLAVEILKNKRKAKQYKFFS-DNIEAINEGTVWADQ-----DFKSTN--HFFDFE 86 Query: 66 DKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRYNMTEALLFLSHFMG 125 + + + Q + +Y ++ L H + Sbjct: 87 KGRGLYGFS------------NLVDEAQKYYNMSLNYLRAGDKKKSL--FYLGAACHIIQ 132 Query: 126 DIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY-YAKDINLLEEDIE 184 D P HV N H ++ II Y + K ++ D Sbjct: 133 DSTVPQHV---------------NNRLLNSHRNFEMWIIQKFLSGYRFMKADEIIRSDNT 177 Query: 185 GNFTDGIWSDDLASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDY--FNSR 242 ++ + +C + ++ S I C+ + Y Sbjct: 178 RDYIRKNAKVANKIYNKCFTIKDKEARYDAISNYIICQAQMSTAGLMMDYYEKYEVIYKN 237 Query: 243 LPIVM 247 Sbjct: 238 HSTSE 242 >UniRef50_P59026 Phospholipase C n=6 Tax=Clostridium RepID=PHLC_CLOHA Length = 399 Score = 43.2 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 67/233 (28%), Gaps = 23/233 (9%) Query: 2 SKEGHVMTCRIAQGLLNDEAAHA----VKMLLP--EYVNGDLSALCVWPDQVRHWYKYKW 55 H + A +L ++ VK E L +PD + K Sbjct: 34 GTGTHALIVTQAVEILKNDVISTSPLSVKENFKILESNLKKLQRGSTYPD---YDPKAYA 90 Query: 56 TSPLHFIDTPDKACNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSH--YREGTSDRRYNM 113 HF D PD NF + + +G+ + ++ + + + Sbjct: 91 LYQDHFWD-PDTDNNFTKDSKWYLAYGINET-GESQLRKLFALAKDEWKKGNYEQATWLL 148 Query: 114 TEALLFLSHFMGDIHQPMH---VGFTSDAGGNSIDLRWFRHKSN--LHHVWDREIILTAA 168 + L H+ GD H P H V AG + K + LH + Sbjct: 149 GQGL----HYFGDFHTPYHPSNVTAVDSAGHTKFETYVEGKKDSYKLHTAGANSVKEFYP 204 Query: 169 KDYYAKDINLLEEDIEGNFTDGIWSDDLASWRECGNVFSCVNKFATESINIAC 221 +++ + + + A + + ATE+++ Sbjct: 205 TTLQNTNLDNWITEYSRGWAKKAKNMYYAHATMSHS-WKDWEIAATETMHNVQ 256 >UniRef50_B8E180 Putative uncharacterized protein n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E180_DICTD Length = 260 Score = 42.5 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 44/136 (32%), Gaps = 33/136 (24%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 WS + H + A L E + L E L V PD+V Sbjct: 39 WSAKTHQKIVKEALYSLPKEYQRKLVPYLDE-----LLEGSVAPDKV------------- 80 Query: 61 FIDTPDKACNFDYERDC-HDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY-NMTEALL 118 + D + + + ++ K + IQ + + + L Sbjct: 81 YRDFDNHIFHVHGNKGRGPEKVREKYYEIISLIQ-------------ERKPWRLIAFQLG 127 Query: 119 FLSHFMGDIHQPMHVG 134 LSH++ D++QP+H Sbjct: 128 VLSHYIADLNQPLHTD 143 >UniRef50_B9XJQ5 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XJQ5_9BACT Length = 86 Score = 41.7 bits (96), Expect = 0.028, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 21/66 (31%) Query: 197 ASWRECGNVFSCVNKFATESINIACKWGYKGVEAGETLSDDYFNSRLPIVMKRVAQGGIR 256 A W E + + G E +L DY + + +R A G R Sbjct: 17 AEWSEFTGLERSRAVALDKGYLQGELKGSTSPERAHSLPGDYTKNAKAVAERRAALAGYR 76 Query: 257 LAMLLN 262 LA + Sbjct: 77 LADEIQ 82 >UniRef50_Q01YQ6 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01YQ6_SOLUE Length = 252 Score = 40.5 bits (93), Expect = 0.068, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 53/207 (25%), Gaps = 36/207 (17%) Query: 1 WSKEGHVMTCRIAQGLLNDEAAHAVKMLLPEYVNGDLSALCVWPDQVRHWYKYKWTSPLH 60 W H R A L A V L PE + C++PD+ Y+ + Sbjct: 19 WDTTPHQKITRAALDSLP---AKFVNRLGPESKLL-IDLYCIYPDR------YQEMAEFG 68 Query: 61 FIDTPDKA---------CNFDYERDCHDQHGVKDMCVAGAIQNFTTQLSHYREGTSDRRY 111 F C H G + + F +S+ E Sbjct: 69 FTRKSAGPQDASEIAVYCVRPDGEAIHGATGDWETDAGSLVYLFERIVSNLAEHRPREAA 128 Query: 112 NMTEALLFLSHFMGDIHQPMHVGFTSDAGGNSIDLRWFRHKSNLHHVWDREIILTAAKDY 171 LSHF+ D P H H V +R + ++ Sbjct: 129 RFA---GVLSHFIADSLSPPHSAPDEHR--------------EFHAVIERSVPDFTLRNR 171 Query: 172 YAKDINLLEEDIEGNFTDGIWSDDLAS 198 + D +++ A+ Sbjct: 172 APRLAADHLLPAAKGSFDQLYAAAAAN 198 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.124 0.357 Lambda K H 0.267 0.0377 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,313,067,110 Number of Sequences: 3077464 Number of extensions: 37084649 Number of successful extensions: 148367 Number of sequences better than 1.0e-01: 179 Number of HSP's better than 0.1 without gapping: 420 Number of HSP's successfully gapped in prelim test: 102 Number of HSP's that attempted gapping in prelim test: 145917 Number of HSP's gapped (non-prelim): 616 length of query: 277 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 150 effective length of database: 649,558,428 effective search space: 97433764200 effective search space used: 97433764200 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 92 (40.2 bits)