Examples of Tables parsed with line-based algorithm
| Diameter of zone of inhibition, mm. | |||||||
| Test bacteria | Hg(II) cystine, µg/disc | Cd(II) cystine, µg/disc | Ni(II) cystine, µg/disc | Kanamycin, µg/disc | |||
| 50 | 80 | 50 | 80 | 100 | 200 | 30 | |
| Staphytococus aureus | 17 | 28 | 22 | 35 | 11 | 17 | 19 |
| Bacillus subtilis | 18 | 31 | 28 | 39 | 12 | 15 | 24 |
| Shigella dysonteriae | 23 | 34 | 25 | 34 | 10 | 19 | 23 |
| Salmonella typhi | 21 | 32 | 24 | 36 | 5 | 11 | 17 |
| Shigella flexneriae | 25 | 31 | 21 | 28 | 15 | 20 | 25 |
| Streptococcus -β- | - | - | - | - | 7 | 12 | 21 |
| haemolyticus | |||||||
| (-) Expt. was not performed.. | |||||||
With line-based approach the last row is divided into 2 ("Streptococcus -β-" and "haemolyticus"). Caption "(-) Expt. was not performed.." was included into the table content by Grobid. It can be improved by training on more data.
| parameter | level | deaths/p-y | crude rate /1,000 p-y | unadjusted rate ratio (95% CI) | adjusted rate ratio (95% CI) |
| Moduane | 77/10,004 | 7.7 | ref | ref | |
| Manthedin g | 262/30,02 5 | 8.7 | 1.13 (0.88-1.46) | 1.44 (1.12-1.86)* | |
| Maselaphal eng | 20/1,868 | 10.7 | 1.39 (0.86-2.26) | 1.30 (0.84-2.03) | |
| Madiga | 266/36,14 5 | 7.4 | 0.96 (0.74-1.23) | 1.18 (0.92-1.52) | |
| village | Ntsima | 14/1,066 | 13.1 | 1.71 (0.97-3.01) | 1.97 (1.12-3.45)* |
| Maphoto | 36/2,497 | 14.4 | 1.87 (1.27-2.77)* | 1.68 (1.13-2.51)* | |
| Ga-Tjale | 44/5,097 | 8.6 | 1.12 (0.78-1.62) | 1.21 (0.83-1.75) | |
| Sefateng | 78/9,490 | 8.2 | 1.07 (0.78-1.46) | 1.13 (0.83-1.55) | |
| 1996-1999 | 221/32,60 3 | 6.8 | ref | ref | |
| period | 2000-2003 | 261/31,88 1 | 8.2 | 1.21 (1.01-1.45)* | 1.17 (0.98-1.40) |
| 2004-2007 | 315/31,70 9 | 9.9 | 1.47 (1.23-1.74)* | 1.35 (1.14-1.60)* | |
| sex | male female | 408/46,52 8 389/49,66 6 | 8.8 7.8 | ref 0.89 (0.78-1.03) | ref 0.66 (0.57-0.76)* |
| 15-49 yrs | 294/48,82 1 | 6.0 | ref | ref | |
| < 5 yrs | 66/10,496 | 6.3 | 1.04 (0.80-1.36) | 1.06 (0.81-1.38) | |
| age group | 5-14 yrs | 13/23,104 | 0.6 | 0.09 (0.05-0.16)* | 0.09 (0.05-0.16)* |
| 50-64 yrs | 119/8,023 | 14.8 | 2.46 (2.00-3.05)* | 2.53 (2.04-3.13)* | |
| 65 yrs & over | 305/5,749 | 53.1 | 8.81 (7.54-10.3)* | 9.71 (8.26-11.4)* | |
| * |
algorithm fails to recognize additional rows if the content of a cell spans on 2 or more rows. See the row that starts with "sex" - the content of the rows "male" and "female" is merged.
| Class | Hygiene level | ||
| Poor (%) | Good (%) | Total | |
| I | 34 (27.42%) | 90 (72.58%) | 124 |
| II | 30 (32.61%) | 62 (67.39%) | 92 |
| III | 56 (44.44%) | 70 (55.56%) | 126 |
| IV | 82 (51.25%) | 78 (48.75%) | 160 |
| Total | 202 (40.23%) | 300 (49.76%) | 502 |
| 2 ÷ = 19.95, df=3, p<0.001 | |||