Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock
AWS Machine Learning Blog
DECEMBER 6, 2023
When answering a new question in real time, the input question is converted to an embedding, which is used to search for and extract the most similar chunks of documents using a similarity metric, such as cosine similarity, and an approximate nearest neighbors algorithm. The search precision can also be improved with metadata filtering.
Let's personalize your content