bitextor
bitextor copied to clipboard
Bitextor generates translation memories from multilingual websites
Changes: - GHA scripts: - Minor fixes. - New workflow which builds a docker image and runs `run-tests-min.sh` using the built image. - Docker GHA script now can be configured...
**Describe the bug** When using either DIC or NDA as the documentAligner, the custom_translate rule still executes and crashes. I believe this is because the ```zcat /data/shards/en/182/1/sentences.gz | eval "$filter_command"...
**Is your feature request related to a problem? Please describe.** GPU are most of the time not on the same machine than web crawlers. CPU takes time and is too...
Hi guys! Wondering if you have a list of other projects or corpora that has been created with the Bitextor/Paracrawl software. I can only think of JParacrawl but I suspect...
This got out of hands, too many small changes. Sorry in advance, I know I should have split all these changes in different branches :/ Real "neural"-related changes: * Automatic...
If we assume that collisions aren't that much of a problem because most entries are just 1 or 2 anyway, could we get away with a truncated fixed size hashtable?...
Hi, is there somewhere a release of Paracrawl with bitextor granularity "Document" instead of sentences. if not what if the easiest way to reproduce those. Cheers.
New Bicleaner AI full models are hosted and managed by HuggingFace utilities and cache. Downloading logic is explained [here](https://github.com/bitextor/bicleaner-ai#download-a-model) and at least right now, download a full model to a...
Bumps [requests](https://github.com/psf/requests) from 2.31.0 to 2.32.0. Release notes Sourced from requests's releases. v2.32.0 2.32.0 (2024-05-20) 🐍 PYCON US 2024 EDITION 🐍 Security Fixed an issue where setting verify=False on the...
Bumps [scikit-learn](https://github.com/scikit-learn/scikit-learn) from 1.1.3 to 1.5.0. Release notes Sourced from scikit-learn's releases. Scikit-learn 1.5.0 We're happy to announce the 1.5.0 release. You can read the release highlights under https://scikit-learn.org/stable/auto_examples/release_highlights/plot_release_highlights_1_5_0.html and...