lpla - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

Pinned Loading

  1. Bitextor generates translation memories from multilingual websites

    Python 302 42

  2. Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

    Python 160 22

  3. Bicleaner fork that uses neural networks

    Python 40 4

  4. Extracts plain text, language identification and more metadata from WARC records

    C++ 23 6