BATch provides command-line access to run large TERMite jobs in parallel on multi-core CPUs.
The main use-case for BATch is the processing of millions of documents such as the entire Medline database, or large numbers of patent or internal documents. For instance, on a standard 4-core CPU PC, the entire Medline database can be annotated with over 20 key life science dictionaries in under 5 hours.
BATch works across file systems such as Hadoop, enabling very large scale document processing.
Get in touch with us to find out how we can transform your data.
© SciBite Limited / Registered in England & Wales No. 07778456