TERMite - SciBite

Solutions
Solutions
Explore SciBite’s full suite of solutions to unlock the potential of your data.

Solutions Overview
Ontology Management

CENtree Ontology Manager

Expert Ontology Services
Semantic Data Enrichment

TERMite Text Analysis Engine
Semantic Search

SciBite Search

SciBite Chat
Datasets Our Partners Data Science and Professional Services (DSPS)
Use Cases
Use Cases
Discover how SciBite’s powerful solutions are supporting scientists and researchers.

Use Cases Overview
Bioassay Registration Departmental Search Drug Safety Electronic Laboratory Notebooks
Enterprise Fair Data LLMs & GenAI Knowledge Graphs Target Validation + Drug Repositioning
Gartner report
Gartner® How to Build Knowledge Graphs That Enable AI-Driven Enterprise Applications

Access report

Read report
Knowledge Hub

Resources
Discover our whitepapers, spec sheets, and webinars for in-depth product knowledge.

Resources

Events
Join us at upcoming events and webinars to learn more about SciBite solutions.

Events

News
Stay informed with the latest SciBite updates, announcements, and industry news.

News

Ctrl Alt Tech Podcast
Where technology meets curiosity. In each episode, we chat with expert guests to explore a wide range of STEM topics.

Podcast

Sign up for the Podcast
About
About SciBite
Explore SciBite’s full suite of solutions to unlock the potential of your data.

Discover more about us
Why SciBite Management Team SciBite Academy Careers
Our Partners
We build powerful partnerships with world-leading organizations.

Our Partners
Clinical Data Partners Data Management Platforms ELN Partners Enterprise Search Partners Knowledge Graph Partners
Sign up for the Podcast
Contact Us

Solutions

Explore SciBite’s full suite of solutions to unlock the potential of your data.

Solutions Overview

Ontology Management

Semantic Data Enrichment

Semantic Search

Datasets Our Partners Data Science and Professional Services (DSPS)

Use Cases

Discover how SciBite’s powerful solutions are supporting scientists and researchers.

Use Cases Overview

Bioassay Registration Departmental Search Drug Safety Electronic Laboratory Notebooks

Enterprise Fair Data LLMs & GenAI Knowledge Graphs Target Validation + Drug Repositioning

Gartner report

Gartner® How to Build Knowledge Graphs That Enable AI-Driven Enterprise Applications

Access report

Read report

Knowledge Hub

Resources

Discover our whitepapers, spec sheets, and webinars for in-depth product knowledge.

Resources

Events

Join us at upcoming events and webinars to learn more about SciBite solutions.

Events

News

Stay informed with the latest SciBite updates, announcements, and industry news.

News

Ctrl Alt Tech Podcast

Where technology meets curiosity. In each episode, we chat with expert guests to explore a wide range of STEM topics.

Podcast

About

About SciBite

Explore SciBite’s full suite of solutions to unlock the potential of your data.

Discover more about us

Why SciBite Management Team SciBite Academy Careers

Our Partners

We build powerful partnerships with world-leading organizations.

Our Partners

Clinical Data Partners Data Management Platforms ELN Partners Enterprise Search Partners Knowledge Graph Partners

SciBite / Solutions / Text Analytics + Semantic Data Enrichment / TERMite

TERMite: Text analysis engine Release the potential of your data by unlocking vital information from scientific text with our named entity recognition and extraction engine.

Is the data you need tied up inside complex collections of electronic documents? Sifting through unconsolidated ambiguous data can be incredibly time-consuming and there’s no guarantee that you’ll actually find what you’re looking for. TERMite gives you the ultra-fast capability to extract vital data with ease by tagging, annotating and organizing your unstructured content, turning it into rich, machine-readable data.

Why choose TERMite?

Fast from day one

Start rapidly processing millions of documents in minutes without the need for any pre-indexing or complex set-up procedures.

Powerful

Indexes at up to one million words per second, with the ability to scan billions of documents and handle large-scale document processing on systems like Hadoop.

Accurate

Precisely tag and link scientific terms within unstructured scientific text using SciBite’s VOCabs containing 20M+ synonyms across 80+ science topics.

Overview TERMite enables scientists and researchers to scan millions of publications, patents, reports and any other document type to uncover the information you need most.

By pairing powerful technology with our hand-curated VOCabs, TERMite recognizes and extracts relevant terms in scientific text. This makes it easier than ever before to find vital facts, key entities and important textual content.

Click on image to enlarge

How TERMite can help you Data-mine millions of documents with ease

TERMite gives professionals the ability to locate important information by identifying critical mentions and relationships across a wide range of documents ranging from literature, patents and grants to your internal documents.

Increase the accuracy of internal search tools

Your existing search portals can be greatly enhanced with TERMite’s ability to find key entities more accurately. Not only does this help to deliver higher levels of performance and productivity, but it can also increase the overall satisfaction of your team.

Ideal for a range of roles

TERMite gives anyone who produces textual content in the sciences or supplies IT systems that contain such text within them (ELNs, project management tools, industry databases) the opportunity to enrich their content, for improved search and navigation.

Integrated, flexible and simple

Designed to fit seamlessly into your existing analysis workflow, TERMite allows users to run one-off analyses or complete routine analyses as part of their existing processes. With constant updates, you’ll also have access to the latest product upgrades and expanded infrastructure support – one less thing to worry about.

TERMite 6.6.3 updates

This release introduces a range of new vocabularies, updates to existing vocabularies, and important bug fixes, all designed to enhance your semantic annotation, term disambiguation, and data integration workflows.

What's New

New vocabularies:
- MONDO (Pack=Clinical): A comprehensive disease ontology based on the Mondo Disease Ontology, harmonizing disease definitions across susceptibility, injury, and disease branches
- EMTREE_PERSON (Pack=Emtree): Focuses on named groups of persons, including age and sex categories, derived from the 2024.03 version of Emtree
- UNIPROTMOUSE (Pack=GenPhen): A new vocabulary of mouse proteins sourced from UniProt
- TAXPATH (Pack=Core): Human pathogen taxonomy, replacing outdated PATHOGEN with expanded coverage (>23,000 entities)
- TAXVERT (Pack=Core): Human vertebrate taxonomy from NCBI Taxon, version 2024-07-03
Key vocabulary updates:
- Significant updates to HGNC, ChEMBL, Cellosaurus, MedDRA, and more, with new concepts, re-organized branches, and improved mappings to public ontologies
- Major expansion of the PKPD vocabulary, nearly doubling its size, with new branches and terms
- Updated content for EMTREE branches, including procedures, healthcare, devices, organisms, and diseases, with hundreds of new concepts, including SARS-CoV-2 variants and other recent classifications
Bug fixes & security enhancements:
- Resolved issues related to TERMite batch processing and server validation errors
- Addressed the latest CVEs to ensure platform security and stability

Deprecated vocabulary:
- The PATHOGEN vocabulary has been deprecated and replaced by the new TAXPATH
Additional notes:
- The new Meddevice pack has been introduced for medical device vocabularies, starting with the MEDDEVICE vocabulary, with plans for future expansion
- The Help menu now includes a direct link to SciBite Academy for customer training

Take a closer at look at TERMite

The latest version of TERMite can make your research smarter and faster with up to 23 NER machine learning models and one-click integration with CENtree that enables editing of publicly available ontologies. For more details on what TERMite can do for you, please read our datasheet. For details on our support framework, visit our Service Level Agreement page.

Download datasheet

Relevant resources, events and news

https://scibite.com/knowledge-hub/resources/semantic-analytics-integrated-approach/ thumbnail image

Resource Semantic analytics: Integrated approach for pharmacovigilance teams to achieve awareness [Whitepaper]

SciBite provides a resource-effective solution to the challenges faced by Pharmacovigilance teams by unlocking the potential of unstructured biomedical content.

TERMite

https://scibite.com/knowledge-hub/resources/scibite-rdf-a-natural-semantic-fit/ thumbnail image

Resource SciBite and resource description framework (RDF): A natural (semantic) fit [Whitepaper]

RDF’s primary role is to give meaning to data. SciBite enables unstructured documents to be converted to RDF, aiding in subsequent analysis and mining.

TERMite

Resource TERMite datasheet

Download our TERMite datasheet (TERM identification, tagging & extraction) is the ultra-fast named entity recognition and extraction engine.

TERMite

https://scibite.com/knowledge-hub/resources/scibite-product-roadmap-2025-q3/ thumbnail image

Resource SciBite product roadmap 2025 Q3

Explore SciBite’s product roadmap for Q3 2025! Discover updates, AI-based features, and improved interoperability for our data solutions built by scientists, for scientists.

CENtree SciBite Search TERMite

Ready to talk to us?

Our experts are ready and waiting to talk to you about your business and your challenges. Once we get to know you, we’ll provide specialist advice on the best ways to save you time, money and hassle while improving the quality of your outcomes.