Fast and reliable internal information using AI Document Explorer
A financial institution
- Customer case
- Data projects
- Data Engineering
Financial institutions need to process large amounts of documentation. For this particular institution, an internal team facilitates this by, for example, creating summaries using text analysis and natural language processing (NLP). They make these available to the various business units. To conduct audits more efficiently, they wanted to develop a question-and-answer model to get the right information to them faster. When ChatGPT was launched, they asked us to create a proof of concept.
Approach
Setting up a large language model within a proprietary environment with proprietary data is a relatively new field. To ensure privacy and set up a secure environment, we used the AI Document Explorer. This is a private instance of a GPT model according to the Retrieval Augemented Generation framework that we connected to the financial institution's existing infrastructure.
We set up the data processing process as follows:
- Documents from SharePoint are retrieved within the Azure environment.
- Data from images, web pages, PDFs, PowerPoints, Word documents and Excel files are converted to readable text.
- This output is split into small pieces of text and indexed for each piece. The semantic value per piece of text is also retrieved by a smart algorithm and stored with it.
We also built a chat app to talk to the data. When a user asks a question within this app, the following steps are taken:
- We convert the query into keywords that are used to return a top 10 indexed pieces of text that are semantically most similar to the keywords.
- We send the returned pieces of text to the ChatGPT model along with a system prompt (instructions) and the original query.
- We return the model's answer and the pieces of text used to the user.
Because ChatGPT passes along the pieces of text used, the user can see which source the answer came from and check if it is correct.
Result
Instead of having to search by themselves for answers to ad hoc questions and explanations of specific processes to be followed, employees of the financial institution can now ask questions about some 100,000 pieces of text of documentation. They get 'human' answers with source references and citations. The output looks as follows:
Information is available faster and reliability is always verified by an employee. Thus, work is done more efficiently and the quality of work remains undiminished.
Want to know more?
Do you also want to safely search your confidential documents with the AI Document Explorer? Joachim would be glad to discuss the possibilities with you.
Business Manager+31(0)20 308 43 90+31(0)6 23 59 83 71joachim.vanbiemen@digital-power.com
Receive data insights, use cases and behind-the-scenes peeks once a month?
Sign up for our email list and stay 'up to data':