Large language models (LLMs) often generate responses with factual inaccuracies due to limited internal parametric knowledge. Retrieval-Augmented Generation (RAG) addresses some of these issues by adding external information to the prompt, but indiscriminate retrieval can lead to irrelevant or inaccurate responses. Furthermore, multilingual scenarios exacerbate these limitations, especially for minoritarian languages and niche domains like Space.
To adapt LLMs to Space and create a multilingual assistant for Space stakeholders, SpaceWise proposes two main innovations. First, based on algorithms such as SELF-RAG, SpaceWise enables an arbitrary LLM to assess the relevance of the retrieved information and to decide whether to add such information to the prompt based on such assessment. Additionally, SpaceWise integrates external translation services in the LLM based on innovative concepts such as Toolformer and ReAct, addressing accessibility barriers for non-English speakers.
The diagram depectis the high level components in SpaceWise
SpaceWise is the natural continuation of our efforts to leverage LLM capabilities in the Space domain. Previously, we developed SpaceQA, an open-domain extractive question answering (QA) system. SpaceQA utilizes language models to identify specific text fragments in a document collection that answer user queries. The system integrates a retriever component that identifies potential text passages where the answer could be found, placing it within the category of retrieval-augmented language models. However, SpaceQA lacks generation capabilities, and is limited to pointing out where the answer to the question lies in the text.
SpaceWise takes a step forward from the SpaceQA approach and embraces the power of generative language models. SpaceWise learns when to rely on its parametric knowledge acquired during pretraining and when it is better to utilize a retrieval system, depending on the user's query.
We also worked with generative AI systems in the Space domain. SpaceQQuiz employs sequence to sequence language models trained to generate questions and applied to the quality management domain in space operations.
SpaceWise is a generative AI system that goes beyond generating questions and transcends language barriers. It is a fully-fledged conversational model capable of engaging in chat and following instructions. Moreover, SpaceWise can handle multiple languages using its parametric knowledge and, when necessary, resorting to machine translation tools.
Recently, we collaborated with ESA to streamline access to information about early designs of space missions leveraging LLM. To this end we develop the QA system SpaceQA relying on a autoencoder model. SpaceQA has successfully enabled users to access information using questions in natural language that was previously available only in lengthy feasibility studies in PDF format.
High level components in SpaceQA
A demo of SpaceQA is available here (user/pass: sigir/s1g1r2022!): https://esatde.expertcustomers.ai/SpaceQA/
We also developed a generative AI system to help the Quality Management department in ESA to generate automatically assesment material for evaluating training sessions. SpaceQQuiz leverages auto-regressive models like T5 and BART to generate questions, and a RoBERTa model to extract answers for such questions, thus verifying their suitability.
High level components in SpaceQquiz
A demo of SpaceQquiz is available here (user/pass: s demoINLG/demoINLG2022!): https://esatde.expertcustomers.ai/SpaceQQuiz/
Contributions are welcome, and they are greatly appreciated! Every little bit helps, and credit will always be given.
To contribute, have a look at Contributing <./CONTRIBUTING.rst>
__
SpaceWise is released under the Apache 2.0 license
At Expert.ai we turn language into data so humans can make better decisions.