10 tips when unlocking AI for R&D
2023๋ 11์ 30์ผ
์ ์: Ann-Marie Roche

ยฉ istock.com/metamorworks
The first edition of our new webinar series explores the perils and pitfalls of generative AI for R&D. Four experts talk about how to take advantage of new technology without losing sight of the big picture.
Elsevierโs four-part webinar series AI in innovation: Unlocking R&D with data-driven AI outlines the issues that can derail your AI projects and how to prepare yourself for these innovations โ and those around the corner. In the first edition, a panel of AI and data experts explore the perils, pitfalls and promise of generative AI for R&Dย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ.
Moderated by Elsevierโs Commercial Director for Corporate Markets Zen Jelenjeย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ, the panel consisted of Elsevierโs VP of Data Science Life Sciences Mark Sheehanย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ and two experts from Elsevierโs SciBite: Director of Data Science & Professional Services Dr Joe Mullenย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ and Head of Ontologies Dr Jane Lomaxย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ.ย
With Elsevierโs history of providing enriched and curated scientific data in AI-driven solutions such as Reaxys and Embase, this episode focuses on the questions our scientists, data scientists and computational chemists get from customers about AI and, more recently, large language models (LLMs).
As Zen explains: โThese arenโt simple questions, and we definitely donโt have all the answers. But today, we have a diverse team from Elsevier and SciBite to explore some of these topics.โ

Zen Jelenje
Do watch the whole episodeย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ โ a lot was covered. Meanwhile, here are some tips from the panel for navigating these changing times.
Tip #1: Get your data in order.
Itโs easy to get distracted by all the noise and hype around LLMs, particularly ChapGPT. But to take advantage of any AI technology, you need to start with your data.
โYour data need to be well organized, well-structured and FAIRย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ โ meeting the principles of Findability, Accessibility, Interoperability and Reusability,โ Joe says. โOnly then will you be ready and flexible enough to quickly and seamlessly latch onto the best solution for the problem you want to solve.โ (see Tip #2).

Joe Mullen, PhD
Tip #2: Donโt rush to a โsolution.โ Start by asking, โWhatโs the specific problem I want to solve?โ
โYou've got to remain focused on identifying what the problems are, and only then look at the ever-evolving solutions to solve those problems,โ Joe says.ย
โInstead of thinking of it as whether to invest in AI,โ adds Zen, "you need to ask the question, โHow does this improve my research?โโ
Tip #3: Donโt consider LLMs as an all-in solution โ especially for life sciences. (However, LLMs can still be part of the solution.)
At the end of the day, scientific progress is built on providence, transparency and reproducibility. And LLMs like ChatGPT are simply not built for that โ for now anyway. Currently, much of Elsevierโs work is built on ontologies. โThese use language to create a model of a domain,โ Jane explains. โIt's a codification of what humans understand about a particular domain โ facts as we now understand them. And I think that's always going to be something that's necessary and useful.
โLLMs, on the other hand, are probabilistic models that are really powerful at generating and understanding human language,โ Jane adds. โTheyโre amazing, and we use them internally.โ But unfortunately, LLMs also hallucinate, and the information is not properly sourced. So in the longer-term, many hope โto have an LLM with an ontology-based factual backbone โ and then youโll have something truly powerful,โ she says.
โI also think that LLMs can bring value to one of our main aims at SciBite,โ says Joe. โAnd thatโs supporting data democratization โ improving the access and interpretation of data. But LLMs wonโt be able to supply this by themselves due to their limitations.โย

Jane Lomax, PhD
Tip #4: Donโt underestimate scaling.
โOne piece of advice: donโt underestimate the difficulty in being able to scale these types of technologies to production,โ says Jane. โWhen we started with this three years ago, we ended up having to take a step back and first build the infrastructure and invest in the skills. We learned a lot through that process, but it was quite a learning curve. So, if you're investing in this, donโt overlook this. Come chat with us.โย
Tip #5: Think operationalization.
โNew technology brings new holistic cost considerations,โ Joe says. โThere are costs associated with rolling out some of these larger models: monetary costs, time costs, disk and carbon footprint costs, and so on and so forth.โย
Tip #6: Get your hands dirty (while failing fast, learning fast and moving on).
โI read a McKinsey report the other day about whether you want to be a taker, a shaper or a maker in the AI space,โ Mark says. โAre you going to wait until itโs fully cooked? Nothing wrong with that. And it can depend on the industry or your companyโs appetite for risk and investment.โ But for Elsevier, the road was clear: jump in now.ย
โAnd definitely having the right team in place is important,โ adds Zen. โAnd since some of the questions we try to address in scientific research are really specific to the domain, it's also harder to wait for somebody else to do it for us.โย ย
โItโs actually very fulfilling to bring the team together to work on new innovations using the latest technologies,โ Mark says. โBut itโs important to acknowledge there will be bumps on the road on that digital transformation journey. There will be mistakes and there will be failures. But itโs also incredibly rewarding when you get it right. You need to learn from your mistakes, pick yourself up and move forward.โ
โAnd this is what weโve been doing for the last 12 to 18 months in terms of GenAI, specifically LLMs. Weโre getting our subject matter experts, our data scientists and our data analysts together to really get their hands dirty and ask, โWhat can I do now that I couldn't do yesterday?โ It's like you're building your muscles up in this space. Youโre learning as you go.โย

Mark Sheehan
Tip #7: Think modularity.
โOur enrichment pipelines continue to become more automated and feature more of the latest AI technologies as we iterate,โ says Mark. โAnd certainly, it's not the case that as soon as a new technology comes in, we throw out what we had before. It works well that we have a mix of rule-based technologies and machine learning technologies. And now we're exploring the latest Gen AI technologies. These can all be complementary.โ
โWe always try to find a way to integrate all these different pipelines, datasets and capabilities into what my team calls a Lego set,โ Mark adds. โIt's a great way to approach things in a modular and flexible way without getting too obsessed about the latest or greatest technologies.โ
Tip #8: Stay on top of whatโs happening.
It might be simpler to wait for others to fail and then adopt. But here you risk being left behind โ and losing any competitive edge. As Joe points out: โAround 10 years ago, AI was beating humans at Space Invadersย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ. Around five years ago, AI got better at Goย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ. Just a few weeks ago, AI started beating humans in real-time drone racingย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ. AI is evolving at such a pace, you need to keep yourself skilled up and aware of what's going on around you.
โAnd again, this is about getting your hands dirty. Reading a few articles and blogs isnโt enough. But itโs a difficult balance: keeping on top of things without getting sucked in, while just trying to identify those problems you want to solve.โ
Tip #9: Keep humans in the loop.ย
As the panel discussed, Subject Matter Experts (SMEs) remain essential to validate the output of any AI algorithm โ and more so when it comes to LLMs. For instance, these SMEs can be deployed as prompt engineers to ask the right questions to the LLMs so the resulting output is easier to validate.ย ย
โPrompt engineering is actually a skill that we should all have some appreciation and understanding of,โ Joe says. โIt's not as straightforward as some people might expect. You need to be able to relay your understanding of the world to an LLM โฆ and this comes back again to the real importance of SMEs when applying it to scientific domains that really require someย expertise.โ
Tip #10: While waiting on regulatory decisions, aim to be responsible.
โIf you ask me about the regulatory environment today, this webinar would be out of date in a month or so,โ Mark says. And indeed: watch this space. But meanwhile, you should aim to be responsible. โRegulations are all about governments coming in saying we need to manage this space because weโre concerned about the future. But it could start with responsible AI where the actual practitioners go โHow can we be responsible and ethical about how we approach this?โ And at Elsevier, weโve really tried to bake this into our daily work from the start withย our Responsible AI principlesย ์ ํญ/์ฐฝ์์ ์ด๊ธฐ.โ
For the full iceberg of insight, watch the webinar. And in the meantime, donโt forget to get your data in order (see tip #1)!
๊ธฐ์ฌ์

AR
Ann-Marie Roche
Senior Director of Customer Engagement Marketing
Elsevier
Ann-Marie Roche ๋ ์ฝ์ด๋ณด๊ธฐ