Skip to main content

Table 2 Exemplary threshold distribution. The values are for (38, 20)-minimizers, 2 errors, and 1 million reads of length 250. Shown are the distribution \(\#x\) (\(\%x\)) of the number (percentage) of minimizers reaching from \(x=14\) to \(x=35\) and the threshold t(x) using the probabilistic model from [12]. The threshold t(x) incorporates the correction term \(c_p\). On the right, the probability \(b_p(x,a)\) of having a false-positive answers from the IBF with \(p=0.05\) is shown

From: Hierarchical Interleaved Bloom Filter: enabling ultrafast, approximate sequence queries

x

#x

%x

t(x)

\(c_{0.05}\)

\(b_{0.05}(x,1)\)

\(b_{0.05}(x,2)\)

\(b_{0.05}(x,3)\)

14

6

<0.1

5

1

35.9

12.3

2.6

15

214

<0.1

6

1

36.6

13.5

3.1

16

2059

0.2

6

1

37.1

14.6

3.6

17

11,081

1.1

8

2

37.4

15.8

4.1

18

36,651

3.5

9

2

37.6

16.8

4.7

19

83,748

8.0

9

2

37.7

17.9

5.3

20

139,864

13.3

10

2

37.7

18.9

6.0

21

179,962

17.2

11

2

37.6

19.8

6.6

22

185,842

17.7

12

2

37.5

20.7

7.3

23

158,032

15.1

12

2

37.2

21.5

7.9

24

113,696

10.8

13

2

36.9

22.3

8.6

25

70,089

6.7

14

2

36.5

23.1

9.3

26

37,540

3.6

15

2

36.1

23.7

10.0

27

18,040

1.7

15

2

35.6

24.3

10.7

28

7535

0.7

16

2

35.0

24.9

11.4

29

2790

0.3

17

2

34.5

25.4

12.0

30

961

<0.1

18

2

33.9

25.9

12.7

31

343

<0.1

18

2

33.3

26.3

13.4

32

88

<0.1

19

2

32.6

26.6

14.0

33

28

<0.1

20

2

32.0

26.9

14.6

34

5

<0.1

22

3

31.3

27.2

15.3

35

2

<0.1

23

3

30.6

27.4

15.8