Harnessing ontologies for pharma: Dr Jane Lomax on the synergy of AI and scientific expertise
2024๋ 2์ 20์ผ
์ ์: Ann-Marie Roche

Dr Jane Lomax, Head of Ontologies for SciBite at Elsevier, talks about how her team of semantic AI experts are helping R&D professionals find the information they need fast.
SciBiteโs Head of Ontologies talks about how her team is encoding scientific knowledge into software for the benefit of pharma and the public.
As Head of Ontologies for SciBite, Dr Jane Lomaxย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ has an ambitious goal: to encode scientific knowledge into software so researchers can find and connect the information they need when they need it.
โI believe ontologies have a fundamental role in leveraging the power of large language models โ for the benefit of pharma and the public at large,โ she says.
Based in Cambridge, UK, SciBite joined Elsevier in 2020. Their experts combine semantic AI with text analytics and data enrichment tools to help R&D professionals make faster, more effective decisions.
โWe get people and machines to use the same language to talk about scientific things,โ Jane explains. To do this, they take unstructured content and turns it into ordered machine-readable data for scientific discovery in the life sciences. โAnd this involves working with our expert scientific curators to encode their expertise into our software.โ
With a PhD in Population Genetics and Parasitology and over 20 years of experience with FAIRย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ data and ontologies โ โbasically, ontologies are a codification of scientific facts as we understand themโ โ Jane is a champion of those creating ontologies. โThese are the people doing the foundational work, and theyโre doing it on a shoestring,โ she says.ย
She also believes ontologies offer a methodology to leverage the power of new AI technologies such as large language models (LLMs): โWhile LLMs can bring in their natural language and summarizing skills, ontologies can provide the backbone of scientific knowledge that the LLM can use, as well as making the output explainable and reproducible.โ
Itโs time for a chat!
โHead of Ontologiesโ โ Is that a new cutting-edge title like โPrompt Engineerโ?
Ontologies have been around for much longer than LLMs, but they fit into our AI age. I lead a team of experts in building and using ontologies, which are representations โ or models โ that provide a picture of the world so we can talk about things in this world and how they relate to each other.ย
Webinar
Watch The perils, pitfalls and promise of generative AI for R&Dย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ with SciBiteโs Jane Lomax and other experts.
And what are the big problems ontologies are currently solving?ย ย
On a fundamental level, ontologies provide an agreed-upon and structured understanding of scientific language. At SciBite, we use these ontologies to help scientists extract knowledge from scientific literature. After all, thereโs just so much text in modern science. Not only are the numbers of published papers increasing, but so are the ways we can generate data. There are even whole new types of science arising. You can't possibly process it all yourself. So what we do is provide the means for scientists to be able to condense the process and ask specific questions related to their specialty.
But ontologies also help with other barriers, such as dealing with all text ambiguities. Thereโs the famous hedgehog example. A fruit fly gene was named after Sonic the Hedgehogย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ โ because the fruit fly community is hilarious when it names genes. But itโs called โhedgehogโ for short. And obviously, there's also a creature called the hedgehog. And then thereโs the actual Sonic the Hedgehog. So when looking across a wide array of documents, how do you know which one is being talked about? In such a case, you have to put rules into the software that disambiguate so you know what type of hedgehog youโre dealing with.
Disambiguating Sonic the Hedgehog sounds like fun. But it also sounds like painstaking work. How are these ontologies created and decided upon?
They arise from specialized, usually academic, communities that are basically doing it with very little support and without license restrictions. Itโs actually where I started my career.ย ย
Ontologies are only valuable if theyโre available to everyone; because theyโre (based on) a standard, all databases can talk in the same language and be interoperable. In other words, ontologies have to be open to be useful. And SciBite has built our whole business on top of these ontologies. Without them, SciBite wouldn't exist. Weโve taken these public ontologies and added our special sauce to make them more accessible and easier to use. And AI plays into that.
How exactly do ontologies link with AI?
This is a whole new application for ontologies. While AI technologies are super powerful, the output must still be verified as truth. Ontologies represent the truth as agreed upon by humans: that something is this type of thing, and it relates to these other types of things. So if you can feed that into your AI, you get the best of both worlds.
The AI still does the hard bits โ while having our underlying truths built into it. And this need for verification has only become more relevant with these emerging generative AIs. Ontologies can provide the control at the most important step of the literature review, and a consistency in information retrieval, ensuring we return the same documents each time via an explainable process.ย
โWhile AI technologies are super powerful, the output must still be verified as truth. Ontologies represent the truth as agreed upon by humans: that something is this type of thing, and it relates to these other types of things. So if you can feed that into your AI, you get the best of both worlds.โ

JL
Jane Lomax, PhD
Elsevier์ Head of Ontologies, SciBite
What initially spurred your passion for ontologies?
I got lucky. I started in parasite population genetics. It was great, but wet lab work is also prolonged and unpredictable. I was beginning to suspect such a life wasnโt for me. And actually, what I enjoyed most during my PhD was the analysis โ the bioinformatics part. So after my PhD, I started to look around in this area. And it was the time when the first bio-ontologies were just created, with Gene Ontologyย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ being the first one. And thatโs the one I ended up working for.ย
No one knew what an ontology was in the life sciences โ they often still donโt. But itโs much more widely known because there are a lot now. But at the time, no one knew what it was. So it was a shot in the dark, and it started as something tiny but then basically revolutionized the world of science. That was a great thing to be involved with.ย
Ontologies are all about classifying things and organizing the world; thatโs always been an instinct of mine. So it just meshed nicely. It also involved computer scientists, biologists, philosophers โ all of us coming together and trying to figure this out. It was a really exciting field โ and still is 20 years later.
And now, with SciBite, you are applying ontologies to bring even more order to Elsevierโs mass of scientific data.
Like Gene Ontology, SciBite also started out tiny. And we grew it into a global business with nearly 100 people that Elsevier then acquired so we can grow further and serve our customers better. Weโve got a fantastic team that works really well together. And weโre able to help our customers, mostly pharma, with a solution thanks to our skills and software. So thatโs super satisfying: Weโve gotten good at extracting understanding from data.
And now, as part of Elsevier, you can extract understanding at scale.
Thatโs precisely what we are in the process of doing: applying that across the whole of Elsevierโs suites.
And now after this perfect marriage between SciBite and Elsevier, you are bringing in a third party: large language models. Doesnโt that make things complicated?
It potentially makes everything much easier. These new technologies allow you to ask scientific questions in natural language. In turn, the LLMs will translate that into something structured and be able to request that across all these different data sources. And then they come back with something scientists can understand, complete with references. So itโs no longer a black box but a kind of explainable AI solution. You can go back to the research papers and check.ย
So this is very exciting: explainable AI with SciBite tools across Elsevierโs mass data. [S3] But yes, weโre still figuring it out.ย
โThis is very exciting: explainable AI with SciBite tools across Elsevierโs mass data.โย

JL
Jane Lomax, PhD
Elsevier์ Head of Ontologies, SciBite
And how are you coping with the speed of developments and the surrounding hype?ย
Itโs all moving super-fast and everyone is trying to find their own way. A job like prompt engineer didnโt exist a year ago. At a conference recently, someone said LLMs had the shortest hype cycle ever, especially in the life sciences, because everyone said, โOh my god, these things are amazing.โ And then, almost immediately, they changed their tune to: โWe canโt use this.โย
But I do believe ontologies will play a key part in harnessing the power of LLMs. Meanwhile, the whole community is still just feeling its way. But there will be big changes in the next couple of years. And weโre going to move fast and figure out how we fit in. It's really an exciting time to be part of this community.
Are there other aspects to LLMs that excite you?
I think the democratization of these technologies is going to be key. Before, there was this barrier: You had to be able to write in Python to access this very rich set of tools. Thatโs all changed now. People across different disciplines will now also be able to access these really powerful technologies, which is a huge democratization. And the impact on education is only just being felt. My son is doing his GCSEs (high school diploma exams) now and creating sample exam questions using ChatGPT. Heโs going to grow up with this all being normal and just part of the tools heโs able to use. So I think itโs a game changer.ย
And how do you see SciBite evolving with these rapidly evolving times?ย
I think we're going to continue to be pioneers and innovators in this field. I think what we do well is being able to prototype and iterate on new tech very fast. So I see us as a sort of the innovation skunkworks of Elsevier. We can further supercharge some of Elsevierโs products. And more of our stuff will be used at scale, taking away more of the tedious work. Weโve also got some new products coming out that deal with a big hurdle in the ontology world: mapping between ontologies. So again, I am really excited.
Is there anything that would make your job easier in terms of accelerating R&D for the life sciences? Is there something people can do?
Support, fund and recognize all of these ontologies that underpin all of this cool stuff weโre doing. And if you donโt have money, provide feedback: Use your expertise with those sources. They all have public trackers where you can make suggestions in terms of what needs to be fixed or added. It just makes the products better.
Become part of this virtuous circle!
๊ธฐ์ฌ์

AR
Ann-Marie Roche
Senior Director of Customer Engagement Marketing
Elsevier
Ann-Marie Roche ๋ ์ฝ์ด๋ณด๊ธฐ