The Business Challenge:
A global pharmaceutical company recognised the potential of the huge volumes of bioassay data that they had generated, but struggled to gain insights from this valuable resource. A lack of standardisation across their data repositories, including LIMS and other bioassay databases, had resulted in the different ways to describe the same thing, for example ‘mouse’, ’mice’, ‘Mus musculus’ and ‘m. musculus’, making it hard to collate data for a particular species. This was compounded by the fact that some database fields were sparsely populated fields while others contained useful information buried in long assay descriptions.
The SciBite Solution:
We enriched our species, gene and bioassay vocabularies with customer-specific terms and synonyms to ensure all relevant information would be recognised. We then analysed the assay names from the legacy database and extracted the different entities within each one. Each entity was extracted and mapped to a single, standard vocabulary term to normalise the data.
Key Business Benefits:
Find out more about how our Ontology Services can benefit your business.
SciBite reflects on discussions from the Pistoia Artificial Intelligence / Machine Learning workshop and annual conference in Boston, MA.Read
Get in touch with us to find out how we can transform your data
© SciBite Limited / Registered in England & Wales No. 07778456