Retrieval Augmented Generation

Nasdanika RAG is a retrieval augmented generation framework in Java. The goal of the framework is to provide Java developers means to assemble RAG designs which match their tasks at hand.

The framework takes Capability, Architecture/Solution building blocks approach. Architecture building blocks provide one or more capability, solution building blocks implement one or more architecture building blocks.

One of definitions of software development is “the process of incremental binding of decisions to make them executable”. This framework follows this paradigm - it defines high-level generic abstractions then progressively binds generic parameters and then concrete implementations.

The diagram above depicts the architecture building blocks. The below diagram shows how the building blocks interact during loading and retrieval/generation phases.

For each architecture building block there may be multiple solution building blocks. A combination of compatible solution building blocks is called a “design” or “embodiment”. Solution building blocks for the same architecture building block may be chained and composed to produce new solution building blocks.

The following sections provide brief overviews of the architecture building blocks. Click on the diagram elements or use the left navigation or links to navigate to the blocks’ pages with more detailed information and a list of solution building blocks.

Data Sources

Sources of data. There might be multiple solution building blocks with two primary design dimensions:

Data item format - PDF, HTML, JSON, diagram, Ecore model, PNG, JPEG …
Data storage interface - file system, zip archive, (REST) API, …

Data items can be converted from one format to another. As such data sources can be chained and composed.

Loader

Reads data items from data sources and adds them to the repository

Repository

Item Builder

Creates store items from data items. For example, breaks down a document into sections and paragraphs. Computes embeddings (keys) and some item identifier/locator and adds items the store to make them discoverable by the retriever.

Key Extractor

Takes a data item of a fragment of it (say a paragraph from a text document) and computes a key. The key is something that identifies the data item. Some key types may support similarity computation.

Examples of keys:

Text:
- MD5/SHA digest - exact matches
- Vector embeddings - cosine similarity
- Bags of words - overlap similarity
Person
- Postal address - distance similarity
- DNA - shared DNA
- Marital and social networks relationships - distance

Store

Contains associations between item keys (e.g. vector embeddings) and item identifiers. Data items can be stored by value or by reference. The store can be indexed.

Retriever

Computes query key and retrieves matching items from the store sorted by similarity.

Requestor

Takes a request, passes to the query engine. Then uses the request and the query results to formulate a request to the generator. For example, if the generator is an LLM, it would create prompts or chains of thought from the query results. Then it passes the request, the query results, and the response from the generator to the responder.

Model documentation

Roadmap

Java Core module
Ecore model
Processors - ecore, doc, factories
Drawio mapping
Eclipse tree editor
Eclipse Sirius designer
AI models - Claude antroipic, Orca-2, …
Caching implementations for query engine, key extractor, and generator:
- Memory-sensitive cache based on Apache Commons Pool
- Hazelcast based cache. Can be used in conjunction with Hazecast Docker Image
- Other implementations as needed, e.g. RDBMS -> H2, …
Async/reactive implementation
User feedback collection - ranking of answers, providing the right answer (and then use similarity to choose the best of available)
Benchmarking

Data Sources

Loader

Repository

Item Builder

Key Extractor

Store

Retriever

Requestor

Generator

Responder

User

Resources

Roadmap