Please enter your details to get this file.


Data Cleansing to Unlock the Potential of Bioassay Data

Ontology management

The Business Challenge:

A global pharmaceutical company recognised the potential of the huge volumes of bioassay data that they had generated, but struggled to gain insights from this valuable resource. A lack of standardisation across their data repositories, including LIMS and other bioassay databases, had resulted in the different ways to describe the same thing, for example ‘mouse’, ’mice’, ‘Mus musculus’ and ‘m. musculus’, making it hard to collate data for a particular species. This was compounded by the fact that some database fields were sparsely populated fields while others contained useful information buried in long assay descriptions.

The SciBite Solution:

We enriched our species, gene and bioassay vocabularies with customer-specific terms and synonyms to ensure all relevant information would be recognised. We then analysed the assay names from the legacy database and extracted the different entities within each one. Each entity was extracted and mapped to a single, standard vocabulary term to normalise the data.

Extraction of Cell Line, Drug, Species and Target entities within the unstructured titles of a selection of assays

Figure: Extraction of Cell Line, Drug, Species and Target entities within the unstructured titles of a selection of assays. The resulting semantic index enables connections to be made between bioassays

Key Business Benefits:

  • Assays are consistently and unambiguously tagged with key metadata
  • Enables the wealth of information in bioassay databases to be unlocked and exploited

Find out more about how our Ontology Services can benefit your business.

Learn more

Related articles

  1. Are Ontologies relevant in a Machine Learning-centric world?

    SciBite reflects on discussions from the Pistoia Artificial Intelligence / Machine Learning workshop and annual conference in Boston, MA.

    Read
  2. High Performance Ontology Engineering

    One of the key aims of SciBite is to help our customers work with public ontologies in text mining applications. While these ontologies are very valuable resources, they are often built for the purpose of data organisation, not text mining.

    Read

How could the SciBite semantic platform help you?

Get in touch with us to find out how we can transform your data

Contact us