# TnsAI Framework — Full Documentation > Concatenated MDX of every docs page on tnsai.dev. > Generated from `content/docs/**/*.mdx` at build time. > Page boundaries marked by `---` and a `## URL` heading. --- # Advanced Agent Features URL: https://tnsai.dev/docs/agents/advanced Description: Beyond the basic Agent lifecycle (create, chat, stop), TnsAI.Core provides specialized subsystems for cognitive support, streaming, ensemble execution, hierarchy management, and chat orchestration. These are internal components extracted from the Agent class for cohesion; most are accessed through the Agent public API rather than directly. import { Callout } from 'fumadocs-ui/components/callout' ## AgentCognitiveSupport `com.tnsai.agents.AgentCognitiveSupport` holds planning, reasoning, evaluation, environment perception, and feedback collection for an Agent. It is created internally during agent construction and accessed through `AgentCapabilities`. ### Evaluation Hooks Eval hooks fire at lifecycle points (before/after chat, before/after tool call, on error, on agent stop) and record metrics into an `EvalContext`. ```java // Add an evaluation hook agent.addEvalHook(myHook); // Enable/disable evaluation agent.setEvalEnabled(true); // Record goal completion agent.recordGoalCompletion("goal-1", true, Map.of("steps", 3)); ``` Key methods on `AgentCognitiveSupport`: | Method | Description | | -------------------------------------------------------------------------------------- | --------------------------------------------------- | | `addEvalHook(EvalHook hook)` | Register an eval hook | | `removeEvalHook(EvalHook hook)` | Unregister an eval hook | | `clearEvalContext()` | Create a fresh `EvalContext` with new session ID | | `setEvalEnabled(boolean)` | Enable or disable all eval hooks | | `isEvalEnabled()` | Check if evaluation is active | | `recordGoalCompletion(String goalId, boolean success, Map details)` | Record a goal outcome | | `fireOnBeforeChat(String message)` | Fire pre-chat eval event | | `fireOnAfterChat(String result, long latencyMs)` | Fire post-chat eval event | | `fireOnBeforeToolCall(String toolName, Map arguments)` | Fire pre-tool eval event | | `fireOnAfterToolCall(String toolName, Object result, boolean success, long latencyMs)` | Fire post-tool eval event | | `fireOnAgentStop(String status)` | Fire agent stop event and complete the eval context | ### Planning (BDI) Planning uses a `PlannerHandle` (SPI, provided by tnsai-intelligence) for GOAP/HTN planning. ```java // Set a planner agent.setPlannerHandle(myPlanner); // Plan for all goals List steps = agent.plan(); // Plan for a specific goal List steps = agent.planForGoal("deliverPackage"); // Execute a plan PlannerHandle.PlanResult result = agent.executePlan(steps); ``` Key methods on `AgentCognitiveSupport`: | Method | Signature | | --------------------- | ---------------------------------------------------------------- | | `plan` | `List plan(List roles)` | | `planForGoal` | `List planForGoal(String goalName, List roles)` | | `executePlan` | `PlanResult executePlan(List steps, List roles)` | | `extractCurrentState` | `Map extractCurrentState(List roles)` | | `setPlannerHandle` | `void setPlannerHandle(PlannerHandle handle)` | | `getPlannerHandle` | `Optional getPlannerHandle()` | | `isPlanningEnabled` | `boolean isPlanningEnabled()` | ### Reasoning Strategy Reasoning strategies (ReAct, Tree-of-Thought, etc.) are pluggable via `ReasoningStrategyHandle`. ```java agent.setReasoningStrategy(reactStrategy); // After chat, inspect reasoning Optional result = agent.getLastReasoningResult(); ``` When reasoning is enabled, `AgentOrchestrator.chat()` routes through the strategy instead of direct LLM invocation. ### Environment Perception Agents can perceive and act on environments (BDI perception-to-belief pipeline). ```java agent.setEnvironment(myEnvironment); Percept percept = agent.perceive(); ActionOutcome outcome = agent.actOnEnvironment("moveForward", Map.of("speed", 5)); ``` Environment changes trigger `onEnvironmentChangeWithRoles`, which checks for unsatisfied goals if a planner is configured. ### Feedback Collection `FeedbackCollector` (SPI, provided by tnsai-intelligence) records tool outcomes for preference learning. ```java agent.setFeedbackCollector(myCollector); FeedbackCollector collector = agent.getFeedbackCollector(); ``` Tool outcomes are recorded automatically via `recordToolOutcome(toolName, result, success, latencyMs)`. ## AgentEnsembleExecutor `com.tnsai.agents.ensemble.AgentEnsembleExecutor` provides multi-LLM ensemble operations accessed via `agent.getEnsembleExecutor()`. ### Patterns | Method | Signature | Description | | ------------------ | ---------------------------------------------------------------------------------------------------------- | ------------------------------------------ | | `parallelChat` | `ParallelResults parallelChat(String message, List llms)` | Fan out to all LLMs, collect all responses | | `raceChat` | `Optional raceChat(String message, List llms, long timeoutSeconds)` | First successful response wins | | `consensusChat` | `Optional consensusChat(String message, List llms, Function scorer)` | Score responses, pick highest | | `majorityVoteChat` | `Optional majorityVoteChat(String message, List llms)` | Most common response pattern | | `ensembleChat` | `Optional ensembleChat(String message, List llms, LLMClient aggregator)` | Synthesize responses with aggregator LLM | ```java // Fan-out to multiple LLMs ParallelLLM.ParallelResults results = agent.getEnsembleExecutor() .parallelChat("Explain quantum computing", List.of(gpt4, claude, gemini)); // Race: first response wins Optional fastest = agent.getEnsembleExecutor() .raceChat("Quick question", List.of(gpt4, claude), 10); // Consensus with scoring Optional best = agent.getEnsembleExecutor() .consensusChat("Review this code", llms, response -> scoreQuality(response)); // Ensemble synthesis Optional synthesized = agent.getEnsembleExecutor() .ensembleChat("Complex analysis", List.of(gpt4, claude), aggregatorLLM); ``` ## AgentHierarchyManager `com.tnsai.agents.hierarchy.AgentHierarchyManager` manages parent-child relationships between agents, including bidirectional consistency and task delegation/escalation. ### Hierarchy Setup ```java Agent supervisor = new SupervisorAgent(); Agent developer = new DeveloperAgent(); Agent tester = new TesterAgent(); // Bidirectional parent-child links supervisor.addChild(developer); supervisor.addChild(tester); // developer.getParent() == supervisor // supervisor.getChildren() contains both // Add multiple at once supervisor.addChildren(developer, tester, designer); ``` ### Task Delegation and Escalation ```java // Supervisor delegates to child supervisor.delegateToChild(developer.id(), "writeCode", Map.of( "task", "Implement authentication", "language", "Java" )); // Developer escalates to supervisor developer.escalateToParent("requestApproval", Map.of( "reason", "Production deployment requires sign-off" )); ``` Both methods use `AgentCommunicationManager` to send `TaskMessageType.REQUEST` messages. `delegateToChild` throws `NoSuchElementException` if the child is not found; `escalateToParent` throws `IllegalStateException` if no parent is set. ### HierarchyContext Interface The manager uses a `HierarchyContext` interface (implemented by `Agent`) to access internals: ```java public interface HierarchyContext { String getAgentId(); Agent getAgentRef(); Agent getParentRef(); void setParentRef(Agent parent); List getChildrenSnapshot(); boolean hasChild(Agent child); boolean addChildDirect(Agent child); boolean removeChildDirect(Agent child); Agent findChildById(String childId); AgentCommunicationManager getCommunicationManager(); } ``` ## AgentStreamingSupport `com.tnsai.agents.streaming.AgentStreamingSupport` handles streaming and event-based chat operations. ### Token Streaming ```java // Stream tokens as a Java Stream Stream tokens = agent.streamChat("Tell me about Java"); tokens.forEach(System.out::print); ``` ### Streaming with Tool Calls The `streamChatWithTools` method combines streaming with multi-turn tool execution (up to 10 iterations): ```java agent.streamChatWithTools("Search for TnsAI docs", chunk -> { if (chunk.isContent()) { System.out.print(chunk.getContent()); } else if (chunk.isToolCall()) { System.out.println("Tool: " + chunk.getToolCall().get().getName()); } }); ``` Flow: stream LLM response -\\> if tool calls received, execute them -\\> send results back to LLM -\\> repeat until final text or max iterations (10). ### Event-Based Chat (AG-UI) ```java // With direct event consumer agent.chatWithEvents("Hello", event -> { if (event instanceof TextDeltaEvent delta) { System.out.print(delta.getText()); } else if (event instanceof ToolCallStartEvent tc) { System.out.println("Calling: " + tc.getToolName()); } }); // With session-based publisher agent.chatWithEvents("Hello", sessionId); ``` Events emitted: `RunStartEvent`, `StatusEvent`, `TextDeltaEvent`, `ToolCallStartEvent`, `ToolCallEndEvent`, `ErrorEvent`, `RunEndEvent`. ## AgentChatOrchestrator `com.tnsai.agents.chat.AgentChatOrchestrator` handles LLM invocation, response processing, structured output, and RAG augmentation. ### LLM Invocation ```java // Full control Object response = chatOrchestrator.invokeLLM(message, useHistory, useTools, trace); // Process response (handles text and tool calls) String result = chatOrchestrator.processLLMResponse(response, useHistory); ``` ### Structured Output (Guardrails Pattern) ```java // With custom parser MyOutput output = agent.chatWithStructure( "Extract the key points", new MyOutputParser(), 3 // maxRetries for correction ); // With format detection (JSON, YAML, TOML) MyRecord record = agent.chatWithFormat("Generate config", MyRecord.class, 2); // With explicit format MyRecord record = agent.chatWithFormat("Generate config", MyRecord.class, OutputFormat.YAML, 2); ``` The orchestrator detects output format from `@OutputFormatSpec` on the target class or agent class, defaulting to JSON. ### RAG (Knowledge Base Augmentation) When a `KnowledgeBase` is set on the agent, the orchestrator automatically augments user messages with relevant knowledge results before sending to the LLM: ```java agent.setKnowledgeBase(myKnowledgeBase); agent.setKnowledgeBaseTopK(5); // Messages are now automatically augmented with knowledge context agent.chat("What is our refund policy?"); ``` ## AgentOrchestrator `com.tnsai.agents.orchestration.AgentOrchestrator` is the top-level coordinator that wires together `AgentToolExecutor`, `AgentChatOrchestrator`, `AgentStreamingSupport`, and `AgentEnsembleExecutor`. It implements the context interfaces for all three sub-delegates so they can access agent internals without circular dependencies. ### Key Public Methods | Method | Description | | ----------------------------------------------------------------------------- | --------------------------------------------------------------- | | `chat(String message, boolean useHistory, boolean useTools)` | Main chat entry point with eval hooks and context graph tracing | | `streamChat(String message)` | Token-by-token streaming | | `streamChatWithTools(String message, Consumer handler)` | Streaming with tool calling loop | | `chatWithEvents(String message, Consumer consumer)` | Event-based chat | | `chatWithEvents(String message, String sessionId)` | Event-based chat with session publisher | | `executeActionPublic(String name, Map params)` | Execute a named action | | `executeActionTyped(ActionRequest request)` | Execute with typed request/response | | `executeActionOnRole(String roleId, String name, Map params)` | Execute on a specific role | | `setToolCallFilter(ToolCallFilter filter)` | Set permission control for tool calls | | `setToolCallListener(ToolCallListener listener)` | Set progress callbacks | | `setKnowledgeBase(KnowledgeBase kb)` | Set RAG knowledge base | | `shutdown()` | End context conversations | ### Context Graph Integration When context graph is enabled, the orchestrator automatically: - Starts/ends context conversations around chat sessions - Creates snapshots at conversation boundaries - Records `DecisionTrace` entries for each chat (success or failure) ## AgentCapabilities `com.tnsai.agents.capabilities.AgentCapabilities` is the capability facade that bridges cognitive support, resilience, variants, and context graphs. Agent delegates to this class for all capability operations. ### Variant Selection ```java agent.setVariant(AgentVariant.HIGH); AgentVariant current = agent.getVariant(); // Auto-resolve based on task AgentVariant resolved = agent.resolveVariant("Complex refactoring task"); // Custom selector agent.setVariantSelector(mySelector); ``` ### Context Graph ```java agent.enableContextGraph(snapshotStore, decisionTraceStore); Optional cm = agent.getContextManager(); boolean enabled = agent.isContextGraphEnabled(); ``` ### Resilience ```java AgentHealthState health = agent.getHealthState(); ResilienceExecutor executor = agent.getResilienceExecutor(); RetryPolicy policy = agent.getDefaultRetryPolicy(); DeadLetterQueue dlq = agent.getDeadLetterQueue(); agent.clearRecoveryState(); ``` ## Related Documentation - [Streaming](/docs/agents/behavior/streaming) -- streaming basics and ChatChunk - [Tools](/docs/capabilities/tools/registration) -- tool registration and the Tool interface - [Roles](/docs/agents/fundamentals/roles) -- role-based action discovery - [Events](/docs/agents/fundamentals/events) -- TnsAI event system - [Resilience](/docs/agents/reliability/resilience) -- retry, circuit breaker, recovery - [Variants](/docs/agents/behavior/variants) -- AgentVariant cost/quality tiers --- # Behavior URL: https://tnsai.dev/docs/agents/behavior Description: How your agent talks, streams, and remembers. import { Callout } from 'fumadocs-ui/components/callout' ## Pages - [Prompt Strategies](/docs/agents/behavior/prompt-strategies) — System prompt templates, role composition, example injection. - [Output Parsing](/docs/agents/behavior/output-parsing) — Structured output with `chatWithFormat`, retries, validation. - [Streaming](/docs/agents/behavior/streaming) — `streamChatWithTools`, chunk types, event flow. - [Variants](/docs/agents/behavior/variants) — Quality, speed, cost tiers on a single agent. - [Memory](/docs/agents/behavior/memory) — Conversation history, window management, summarization. --- # Memory URL: https://tnsai.dev/docs/agents/behavior/memory Description: TnsAI.Core provides a pluggable memory system for agent conversation history. The MemoryStore interface defines storage, retrieval, pruning, and search operations. Four implementations cover different persistence and sharing requirements. The AgentBuilder.memoryStore() method wires a store into an agent. import { Callout } from 'fumadocs-ui/components/callout' ## MemoryStore Interface `MemoryStore` is the contract that all memory implementations must follow. It defines how an agent saves, retrieves, prunes, and searches its conversation history. You pick an implementation based on your persistence needs (in-memory for development, file-based for production, shared for multi-agent setups). ```java public interface MemoryStore { void init(String agentId); void addMessage(String role, String content); void addMessage(Map message); List> getHistory(); List> getRecentHistory(int limit); void clear(); void prune(int maxTokens); default List search(String query, int limit); } ``` | Method | Description | | ----------------------------------------- | ------------------------------------------------------------------------------------------------------------------------ | | `init(String agentId)` | Initializes the store for a specific agent. Triggers history loading for persistent stores. | | `addMessage(String role, String content)` | Adds a simple message with role (`user`, `assistant`, `system`) and content. | | `addMessage(Map message)` | Adds a structured message (e.g., with tool calls, metadata). The map is defensively copied. | | `getHistory()` | Returns the full conversation history as an unmodifiable list. | | `getRecentHistory(int limit)` | Returns the last N messages. | | `clear()` | Removes all messages. | | `prune(int maxTokens)` | Removes oldest messages until the estimated token count is within the limit. System prompts are preserved when possible. | | `search(String query, int limit)` | Semantic search over history. Default returns empty list. `AbstractMemoryStore` provides a TF-IDF implementation. | ## AbstractMemoryStore If you want to create your own memory store, extend `AbstractMemoryStore` instead of implementing `MemoryStore` from scratch. It handles thread safety, pruning, and search out of the box -- you only need to define how history is loaded and what happens after it changes. All built-in implementations extend `AbstractMemoryStore`, which provides: - **Thread safety** via `ReentrantLock` (Virtual Thread friendly, avoids carrier thread pinning) - **LinkedList** backing for O(1) removal during pruning - **TF-IDF search** using log-normalized term frequency and smooth inverse document frequency - **Extension hooks**: `onHistoryChanged()` and `loadHistory()` Subclasses only need to override two methods: ```java // Called after any modification (addMessage, clear, prune) protected void onHistoryChanged() { } // Called during init() to load persisted history protected void loadHistory() { } ``` Additional utility method: ```java // Returns current history size public int size(); ``` ## InMemoryStore The simplest memory store -- it keeps conversation history in a plain list in memory. This is the default when you do not configure any memory store on your agent. It is fast and requires zero setup, but all data is lost when the process exits. ```java public class InMemoryStore extends AbstractMemoryStore { public InMemoryStore(); } ``` All functionality is inherited from `AbstractMemoryStore` with no persistence hooks. Suitable for: - Development and testing - Short-lived conversations - Scenarios where persistence is not required This is the default store used by `AgentBuilder` when no store is explicitly configured. ## FileMemoryStore When you need conversations to survive restarts, use `FileMemoryStore`. It writes each agent's history to a JSON file on disk after every change and reloads it automatically when the agent initializes. ```java public class FileMemoryStore extends AbstractMemoryStore { public FileMemoryStore(); // uses default dir: .tnsai/memory/ public FileMemoryStore(String storageDirPath); public Path getAgentFile(); // path to agent's JSON file } ``` Each agent's history is stored in a separate file: `{storageDirPath}/{agentId}.json`. Suitable for: - Production environments requiring conversation persistence - Long-running agents that need to resume conversations - Multi-session scenarios Example: ```java FileMemoryStore store = new FileMemoryStore("./data/memory"); store.init("agent-1"); store.addMessage("user", "Hello"); // Automatically saved to ./data/memory/agent-1.json // On restart, history is loaded from file FileMemoryStore restored = new FileMemoryStore("./data/memory"); restored.init("agent-1"); List> history = restored.getHistory(); // Contains previous messages ``` ## SharedMemoryStore In multi-agent systems, you often want several agents to read and write the same conversation history. `SharedMemoryStore` wraps any other `MemoryStore` and adds an access-control layer so only authorized agents can interact with the shared memory. ```java public class SharedMemoryStore implements MemoryStore { public SharedMemoryStore(MemoryStore delegate, String namespace); public void grantAccess(String agentId); public void revokeAccess(String agentId); public boolean hasAccess(String agentId); public String getNamespace(); } ``` Thread safety uses `ReentrantReadWriteLock` for high-throughput concurrent reads with exclusive writes. Only the first authorized agent to call `init()` initializes the underlying store. Calling `init()` with an unauthorized agent ID throws `IllegalStateException`. Example: ```java MemoryStore base = new InMemoryStore(); SharedMemoryStore shared = new SharedMemoryStore(base, "team-chat"); shared.grantAccess("agent-a"); shared.grantAccess("agent-b"); // First agent initializes the shared memory shared.init("agent-a"); // Both agents can read and write shared.addMessage("assistant", "Hello from agent-a"); List> history = shared.getHistory(); // visible to all authorized agents ``` ## SharedMemoryRegistry When you have many shared memory namespaces across your application, `SharedMemoryRegistry` acts as a central lookup. It ensures each namespace has exactly one `SharedMemoryStore` instance, creating one on demand if it does not exist yet. The framework uses this internally when processing `@Memory(shared = true, shareWith = {...})` annotations. ```java public final class SharedMemoryRegistry { public static SharedMemoryRegistry getInstance(); public SharedMemoryStore getOrCreate(String namespace); public SharedMemoryStore getOrCreate(String namespace, MemoryStore baseStore); public Optional find(String namespace); public boolean remove(String namespace); public void clear(); public int size(); } ``` `getOrCreate(String namespace)` creates a `SharedMemoryStore` wrapping an `InMemoryStore` if the namespace does not exist. Use the two-argument overload to provide a custom base store (e.g., `FileMemoryStore` for persistence). Example: ```java SharedMemoryRegistry registry = SharedMemoryRegistry.getInstance(); // Get or create a shared namespace SharedMemoryStore shared = registry.getOrCreate("project-alpha"); shared.grantAccess("researcher"); shared.grantAccess("writer"); // With custom persistence SharedMemoryStore persistent = registry.getOrCreate( "persistent-namespace", new FileMemoryStore("./shared-memory") ); // Look up existing namespace Optional found = registry.find("project-alpha"); // Clean up registry.remove("project-alpha"); ``` ## MemoryStoreFactory If you use the `@MemorySpec` annotation to configure memory declaratively, `MemoryStoreFactory` is what turns that annotation into an actual `MemoryStore` instance at runtime. It reads the annotation's fields and automatically wraps the base store with decorators for capacity limits, token budgets, or summarization as needed. ```java public final class MemoryStoreFactory { public static MemoryStore create(MemorySpec config); public static MemoryStore createDefault(); // returns new InMemoryStore() } ``` Supported persistence types: `IN_MEMORY`, `FILE`, `REDIS` (via SPI), `DATABASE` (via SPI). The factory automatically wraps the base store with decorators based on configuration: - **CapacityAwareMemoryStore** -- enforces message count limits with configurable prune strategy - **SummarizingMemoryStore** -- summarizes old messages instead of deleting them when capacity is exceeded - **TokenAwareMemoryStore** -- enforces token limits by pruning after each message ## Integration with AgentBuilder The most common way to configure memory is through the `AgentBuilder`. Call `.memoryStore()` to plug in any `MemoryStore` implementation. If you skip this step, the agent defaults to `InMemoryStore` (volatile, no persistence). ```java // Default (in-memory, volatile) Agent agent = AgentBuilder.create() .model("claude-sonnet-4") .build(); // Persistent file storage Agent agent = AgentBuilder.create() .model("claude-sonnet-4") .memoryStore(new FileMemoryStore("./data/memory")) .build(); // Shared memory between agents SharedMemoryStore shared = SharedMemoryRegistry.getInstance() .getOrCreate("team-namespace"); shared.grantAccess("agent-1"); shared.grantAccess("agent-2"); Agent agent1 = AgentBuilder.create() .model("claude-sonnet-4") .memoryStore(shared) .build(); Agent agent2 = AgentBuilder.create() .model("gpt-4o") .memoryStore(shared) .build(); ``` ## Code Examples These examples demonstrate typical memory usage patterns, from basic conversation persistence to semantic search and token-aware pruning. ### Conversation with History Management This example creates an agent with file-based memory so conversations survive restarts. You can also access the memory store directly to inspect recent history. ```java Agent agent = AgentBuilder.create() .model("claude-sonnet-4") .memoryStore(new FileMemoryStore("./memory")) .build(); // Conversation persists across restarts agent.chat("What is the capital of France?"); agent.chat("What about Germany?"); // Access history directly MemoryStore memory = agent.getMemoryStore(); List> recent = memory.getRecentHistory(5); ``` ### Semantic Search over History The built-in TF-IDF search lets you find past messages by meaning rather than scrolling through the full history. This is useful for agents that need to recall earlier context from a long conversation. ```java MemoryStore store = new InMemoryStore(); store.init("search-agent"); store.addMessage("user", "Tell me about Java generics"); store.addMessage("assistant", "Java generics provide compile-time type safety..."); store.addMessage("user", "How do I create a REST API?"); store.addMessage("assistant", "You can use Spring Boot or Javalin..."); // TF-IDF search finds relevant messages List results = store.search("generics type safety", 2); // Returns messages about Java generics ranked by relevance ``` ### Token-Aware Pruning LLMs have a maximum context window size. When your conversation history grows too large, call `prune()` to remove the oldest messages until the estimated token count fits within the model's limit. System prompts are preserved when possible. ```java MemoryStore store = new InMemoryStore(); store.init("pruning-agent"); // Add many messages for (int i = 0; i < 1000; i++) { store.addMessage("user", "Message " + i + " with some content"); store.addMessage("assistant", "Response to message " + i); } // Prune to fit context window store.prune(4096); // Oldest messages are removed, keeping history within token limit ``` ## Advanced Memory The `com.tnsai.memory.advanced` package provides production-grade memory capabilities for agents that need more than simple history management. These include vector-based semantic search, keyword-based BM25 indexing, hybrid retrieval that combines both, skill persistence, importance-based pruning, staleness detection, and full session serialization. ### VectorMemoryStore When you need to find past messages based on meaning (not just keywords), `VectorMemoryStore` converts each message into an embedding vector and uses cosine similarity to find the closest matches. This powers true semantic retrieval over conversation history. ```java VectorMemoryStore vectorStore = VectorMemoryStore.builder() .embeddingClient(embeddingClient) .dimensions(1536) .build(); vectorStore.init("agent-1"); vectorStore.addMessage("user", "The deployment uses Kubernetes with 3 replicas"); // Semantic search -- finds relevant messages even with different wording List results = vectorStore.search("container orchestration setup", 5); ``` ### BM25Index While vector search excels at finding semantically similar content, sometimes you need exact keyword matching. `BM25Index` uses the BM25 scoring algorithm (the same approach behind traditional search engines) to rank documents by how well they match specific terms. ```java BM25Index index = new BM25Index(); index.addDocument("doc-1", "Kubernetes deployment configuration"); index.addDocument("doc-2", "Database connection pooling settings"); List results = index.search("Kubernetes", 5); ``` ### HybridMemoryRetriever For the best retrieval quality, combine both approaches. `HybridMemoryRetriever` runs a vector search and a BM25 keyword search in parallel, then merges the results using Reciprocal Rank Fusion (RRF). This captures both semantic meaning and exact keyword matches in a single query. ```java HybridMemoryRetriever retriever = HybridMemoryRetriever.builder() .vectorStore(vectorStore) .bm25Index(bm25Index) .vectorWeight(0.6) .keywordWeight(0.4) .fusionK(60) .build(); List results = retriever.search("deployment configuration", 10); ``` RRF fusion merges the ranked lists from both retrieval methods, giving high-quality results that capture both semantic meaning and exact keyword matches. ### SkillMemoryStore Agents can learn reusable procedures and remember them across sessions. `SkillMemoryStore` saves these skills as Markdown files on disk, making them human-readable, easy to version-control with Git, and editable by hand if needed. ```java SkillMemoryStore skillStore = new SkillMemoryStore(".tnsai/skills/"); skillStore.saveSkill("deploy-k8s", SkillEntry.builder() .name("deploy-k8s") .description("Deploy application to Kubernetes cluster") .steps(List.of("Build Docker image", "Push to registry", "Apply manifests")) .tags(Set.of("devops", "kubernetes")) .build()); Optional skill = skillStore.loadSkill("deploy-k8s"); List devopsSkills = skillStore.findByTag("devops"); ``` ### MemoryImportanceScorer When memory grows too large and needs pruning, not all entries are equally valuable. `MemoryImportanceScorer` assigns an importance score to each entry based on recency, access frequency, semantic distinctiveness, and user-marked importance, so the pruning process can keep the most valuable memories. ```java MemoryImportanceScorer scorer = new MemoryImportanceScorer(); double importance = scorer.score(memoryEntry); // Factors: recency, access frequency, semantic distinctiveness, user-marked importance ``` ### StalenessDetector Over time, memory accumulates near-duplicate or outdated entries that waste context window space. `StalenessDetector` uses Jaccard similarity to find entries that are too similar to each other, and age thresholds to flag entries that are too old, so you can clean them up. ```java StalenessDetector detector = StalenessDetector.builder() .similarityThreshold(0.85) // entries above 85% similarity are stale .maxAge(Duration.ofDays(30)) // entries older than 30 days are candidates .build(); List staleIds = detector.detectStale(memoryEntries); ``` ### SessionSerializer / SessionRestorer If you need to save and restore an agent's complete session state (not just conversation history, but all runtime state), these utilities serialize the session to bytes and restore it later. This is useful for long-running agents that need to survive process restarts without losing their place. ```java // Save session state SessionSerializer serializer = new SessionSerializer(); byte[] data = serializer.serialize(agent.getSession()); Files.write(Path.of("session-backup.bin"), data); // Restore session state SessionRestorer restorer = new SessionRestorer(); AgentSession restored = restorer.restore(Files.readAllBytes(Path.of("session-backup.bin"))); agent.restoreSession(restored); ``` --- # Output Parsing & Serialization URL: https://tnsai.dev/docs/agents/behavior/output-parsing Description: TnsAI provides type-safe output parsing for converting raw LLM responses into structured Java objects, and a multi-format serialization system for producing structured output. import { Callout } from 'fumadocs-ui/components/callout' ## OutputParser\ Interface LLMs return free-form text, but your application usually needs structured Java objects. The `OutputParser` interface defines the contract for converting raw LLM text into a typed object of your choice. It also provides prompt instructions you can include in your LLM call so the model knows what format to produce. | Method | Description | | ----------------------------------------------- | ---------------------------------------------------------- | | `parse(String)` | Returns a `ParseResult` (success or error) | | `parseOrThrow(String)` | Returns `T` or throws `ParseException` | | `parseOptional(String)` | Returns `Optional` | | `getTargetType()` | Returns the `Class` this parser produces | | `getFormatInstruction()` | Prompt text guiding the LLM to produce the expected format | | `getSchemaDescription()` | Schema string (e.g. JSON Schema) for the target type | | `getErrorCorrectionPrompt(failedOutput, error)` | Generates a retry prompt when parsing fails | ```java OutputParser parser = JsonOutputParser.forType(WeatherResponse.class); ParseResult result = parser.parse(llmOutput); if (result.isSuccess()) { WeatherResponse weather = result.get(); } ``` ## JsonOutputParser\ The most commonly used parser. It extracts JSON from LLM responses -- even when the model wraps JSON in markdown code blocks or surrounds it with explanatory text -- and deserializes it into your Java class using Jackson. **Features:** - Extracts JSON from ` ```json ` code blocks - Falls back to raw `{...}` or `[...]` detection - Supports Java Records and POJOs - Field validation via `OutputValidator` - Auto-generates schema descriptions from target type reflection ### Factory method The simplest way to create a `JsonOutputParser` is with the static `forType()` factory. It sets up sensible Jackson defaults and auto-generates schema descriptions from your target class. ```java JsonOutputParser parser = JsonOutputParser.forType(Person.class); ``` ### Builder For more control, use the builder to supply a custom Jackson `ObjectMapper`, enable strict mode (which rejects unknown JSON properties), or plug in a custom validator. ```java JsonOutputParser parser = JsonOutputParser.builder(Person.class) .objectMapper(customMapper) // custom Jackson ObjectMapper .strictMode(true) // fail on unknown properties .validator(customValidator) // custom OutputValidator .build(); ``` ### Parsing LLM output with embedded JSON This example demonstrates the parser's ability to extract a JSON block from an LLM response that includes surrounding explanatory text. The parser automatically locates the JSON within the markdown code fence and deserializes it. ````java record Person(String name, int age) {} JsonOutputParser parser = JsonOutputParser.forType(Person.class); ParseResult result = parser.parse(""" Here's the person data: ```json {"name": "John", "age": 30} ``` """); Person person = result.get(); // Person[name=John, age=30] ```` Default ObjectMapper settings: - `FAIL_ON_UNKNOWN_PROPERTIES = false` - `ACCEPT_SINGLE_VALUE_AS_ARRAY = true` - `INDENT_OUTPUT = true` - `NON_NULL` property inclusion ## RetryableParser\ LLMs occasionally produce malformed output -- missing fields, broken JSON, or wrong structure. `RetryableParser` wraps any parser and handles this automatically: when parsing fails, it sends the LLM an error-correction prompt explaining what went wrong and asks it to try again, up to a configurable number of retries. ### Wrapping a parser To add retry behavior, wrap your existing parser with `RetryableParser.wrap()`. You can optionally configure the maximum number of retries (default is 3). ```java JsonOutputParser baseParser = JsonOutputParser.forType(Person.class); RetryableParser parser = RetryableParser.wrap(baseParser) .maxRetries(3) // default is 3 .build(); ``` ### Manual retry flow If you want to control the retry loop yourself (for example, to use a different LLM for corrections), you can get the correction prompt and send it manually. ```java ParseResult result = parser.parse(llmOutput); if (result.isFailure()) { String correctionPrompt = parser.getCorrectionPrompt(llmOutput, result.getError()); // Send correctionPrompt to LLM, then parse the new response } ``` ### Automatic retry with LLM function The easiest approach: pass `parseWithRetry` a function that calls your LLM. It will automatically loop -- sending correction prompts and re-parsing -- up to `maxRetries` times until parsing succeeds or retries are exhausted. ```java ParseResult result = parser.parseWithRetry(initialOutput, prompt -> { return llmClient.chat(prompt); // your LLM call }); if (result.isSuccess()) { Person person = result.get(); } ``` ### Attempt tracking The retryable parser records every attempt so you can inspect what happened during the retry loop -- useful for debugging and monitoring parse success rates. ```java parser.getAttemptCount(); // total attempts made parser.getAttempts(); // List (output, success, error) parser.getLastAttempt(); // most recent attempt parser.clearAttempts(); // reset history ``` ## ParseResult\ Every parser returns a `ParseResult` instead of throwing exceptions or returning null. It is a monadic result type (similar to Rust's `Result` or Scala's `Either`) that always tells you whether parsing succeeded or failed, and gives you safe access to the value or the error message. ### Construction You create `ParseResult` instances through static factory methods rather than constructors. Parsers return these automatically, but you can also create them yourself for testing or custom parsing logic. | Factory method | Description | | ------------------------------------------------------------------- | ------------------------------ | | `ParseResult.success(value, rawOutput, parseTimeMs)` | Successful parse with timing | | `ParseResult.success(value)` | Successful parse (shorthand) | | `ParseResult.failure(error, rawOutput)` | Failed parse | | `ParseResult.validationFailure(error, validationErrors, rawOutput)` | Failed validation with details | ### Querying These methods let you inspect the result, extract the parsed value, or get error details without risking null pointer exceptions. ```java result.isSuccess(); // true if parsed result.isFailure(); // true if error result.get(); // value or throws IllegalStateException result.getOrElse(defaultVal); // value or fallback result.getError(); // error message (null on success) result.getValidationErrors(); // List validation details result.getRawOutput(); // original LLM text result.getParseTimeMs(); // parse duration in ms result.toOptional(); // Optional ``` ### Transformations Like `Optional` or `Stream`, `ParseResult` supports `map` and `flatMap` so you can transform the parsed value without unwrapping it first. Failures pass through unchanged. ```java // Map to a different type ParseResult nameResult = result.map(User::name); // FlatMap to another ParseResult ParseResult

addr = result.flatMap(user -> parseAddress(user.addressJson())); ``` ### Side effects Use these methods to run an action only when the result is a success or a failure, without needing an `if` statement. This keeps your code concise and readable. ```java // Conditional actions result.ifSuccess(user -> save(user)); result.ifFailure(error -> log.warn(error)); // Handle both cases result.ifSuccessOrElse( user -> System.out.println("Parsed: " + user), error -> System.err.println("Error: " + error) ); ``` ## OutputSerializer Interface While `OutputParser` converts LLM text into Java objects, `OutputSerializer` does the reverse: it converts Java objects into structured text formats (JSON, YAML, etc.) and back again. This is useful when you need to produce output in a specific format, or when re-serializing a parsed result into a different format for downstream consumers. | Method | Description | | ---------------------------------------------- | ------------------------------------------------ | | `getFormat()` | The `OutputFormat` this serializer handles | | `serialize(data, prettyPrint)` | Serialize an object to string | | `serialize(data)` | Serialize with pretty print (default) | | `serializeList(items, itemClass, prettyPrint)` | Serialize a typed list | | `deserialize(data, targetClass)` | Deserialize string to object | | `deserializeList(data, itemClass)` | Deserialize string to typed list | | `getFormatInstructions(targetClass)` | LLM prompt instructions for single-object output | | `getListFormatInstructions(itemClass)` | LLM prompt instructions for list output | | `supportsType(dataClass)` | Check if format supports a data structure | ### OutputFormat enum TnsAI supports five output formats. JSON and YAML are standard, while TOON and TONL are custom token-optimized formats that significantly reduce token usage when sending structured data to LLMs. | Format | Extension | MIME Type | Nesting | Token Efficiency | | ------ | --------- | -------------------- | ------- | --------------------- | | `JSON` | `.json` | `application/json` | Yes | Baseline | | `YAML` | `.yaml` | `application/x-yaml` | Yes | \~10-15% fewer tokens | | `TOON` | `.toon` | `text/x-toon` | Yes | \~40% fewer tokens | | `TONL` | `.tonl` | `text/x-tonl` | Yes | \~32-50% fewer tokens | | `TEXT` | `.txt` | `text/plain` | No | N/A | ### Implementations Each format has a dedicated serializer class. You rarely need to use these directly -- the `OutputSerializerRegistry` provides a simpler API for accessing them. - **JsonOutputSerializer** -- Jackson-based JSON with configurable pretty printing. - **YamlOutputSerializer** -- Zero-dependency YAML with multi-line string support and flow-style compact lists. - **ToonOutputSerializer** -- Token-Optimized Object Notation for uniform arrays. - **TonlOutputSerializer** -- Token-Optimized Notation Language with schema support. - **TextOutputSerializer** -- Plain `toString()` serialization. Deserialization limited to `String` and basic primitives. ### OutputSerializerRegistry The registry is a one-stop shop for serialization. It holds all built-in serializers and provides convenience methods so you do not need to look up or instantiate serializers yourself. You can also register custom serializers here. ```java OutputSerializerRegistry registry = OutputSerializerRegistry.getInstance(); // Get a specific serializer OutputSerializer jsonSerializer = registry.getSerializer(OutputFormat.JSON); String json = jsonSerializer.serialize(myObject); // Convenience methods String yaml = registry.serialize(myObject, OutputFormat.YAML); Person person = registry.deserialize(jsonString, OutputFormat.JSON, Person.class); // LLM prompt instructions String instructions = registry.getFormatInstructions(OutputFormat.JSON, Person.class); // Register a custom serializer registry.register(OutputFormat.JSON, new CustomJsonSerializer()); // Reset to defaults registry.reset(); ``` ## Full Example This end-to-end example shows the complete workflow: defining an output type as a Java record, creating a parser with retry support, including format instructions in the prompt, parsing the LLM response with automatic correction, handling the result, and re-serializing to a different format. ```java // 1. Define output type record AnalysisResult(String summary, List issues, double score) {} // 2. Create parser with retry JsonOutputParser baseParser = JsonOutputParser.forType(AnalysisResult.class); RetryableParser parser = RetryableParser.wrap(baseParser) .maxRetries(2) .build(); // 3. Include format instructions in the prompt String prompt = "Analyze this code.\n\n" + baseParser.getFormatInstruction(); // 4. Parse LLM response with automatic retry String llmResponse = llmClient.chat(prompt); ParseResult result = parser.parseWithRetry(llmResponse, llmClient::chat); // 5. Handle result result.ifSuccessOrElse( analysis -> { System.out.println("Score: " + analysis.score()); analysis.issues().forEach(issue -> System.out.println("- " + issue)); }, error -> System.err.println("Parse failed after retries: " + error) ); // 6. Re-serialize to a different format if (result.isSuccess()) { OutputSerializerRegistry registry = OutputSerializerRegistry.getInstance(); String yaml = registry.serialize(result.get(), OutputFormat.YAML); System.out.println(yaml); } ``` --- # Prompt Strategies URL: https://tnsai.dev/docs/agents/behavior/prompt-strategies Description: TnsAI includes a prompt enhancement system that applies proven prompting techniques to improve LLM response quality. The system is built around the PromptStrategy enum, PromptEnhancer builder, and EnhancedPrompt output. import { Callout } from 'fumadocs-ui/components/callout' **Package:** `com.tnsai.prompt.strategy` ## PromptStrategy Enum Prompt strategies are research-backed techniques that improve LLM response quality by structuring how the model approaches a problem. Instead of writing complex system prompts by hand, you pick one or more strategies and the framework generates the right instructions automatically. Twelve predefined strategies are available, based on techniques from OpenAI, Anthropic, and Google AI research. | Strategy | Description | Multi-Pass | Post-Processing | | ----------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | :--------: | :-------------: | | `CHAIN_OF_THOUGHT` | Step-by-step reasoning before final answer. Best for math, logic, multi-step analysis. | No | No | | `CHAIN_OF_VERIFICATION` | Self-verification with generated questions. Initial answer, then verify, then refine. Reported 60% to 92% accuracy improvement on complex queries. | Yes | Yes | | `CONFIDENCE_WEIGHTED` | Includes confidence score (0--100%), key assumptions, and alternatives when confidence is below threshold. | No | No | | `STRUCTURED_THINKING` | Four-phase protocol: UNDERSTAND, ANALYZE, STRATEGIZE, EXECUTE. | No | No | | `MULTI_PERSPECTIVE` | Examines problem from Technical, Business, User Experience, and Risk perspectives, then synthesizes a balanced recommendation. | No | No | | `CONSTRAINT_FIRST` | Separates hard constraints (must satisfy) from soft preferences (nice to have) before proceeding. | No | No | | `ITERATIVE_REFINEMENT` | Multi-pass generation: Draft, Critique, Refine, Review. | Yes | Yes | | `CONTEXT_BOUNDARIES` | Clear separation of Context, Focus, Task, and Constraints. Explicitly flags insufficient information. | No | No | | `FEW_SHOT_EXAMPLES` | Learns from positive and negative examples to guide response format and quality. | No | No | | `META_PROMPTING` | AI designs the optimal prompt for the task first, then responds to it. | Yes | No | | `SIX_PART_ANATOMY` | Comprehensive structure: Role, Objective, Request, Process, Output, Stop Condition. | No | No | | `ATOM_OF_THOUGHT` | Decomposes problems into independent atoms solved in parallel, then synthesizes. Unlike CoT, errors are isolated per atom. +30--40% accuracy on complex reasoning, +20--30% token usage. Best for 70B+ parameter models. | Yes | No | ### Strategy Methods Each strategy enum value provides these methods to retrieve its system instruction text and to check whether it requires multiple LLM passes or post-processing. | Method | Return Type | Description | | -------------------------- | ----------- | --------------------------------------------------------------------------------------------------- | | `getSystemInstruction()` | `String` | The full system instruction text injected for this strategy | | `getDescription()` | `String` | Short human-readable description | | `requiresPostProcessing()` | `boolean` | `true` for `CHAIN_OF_VERIFICATION` and `ITERATIVE_REFINEMENT` | | `isMultiPass()` | `boolean` | `true` for `CHAIN_OF_VERIFICATION`, `ITERATIVE_REFINEMENT`, `META_PROMPTING`, and `ATOM_OF_THOUGHT` | ## PromptEnhancer Builder `PromptEnhancer` is where you assemble your prompting configuration. It is a fluent builder that lets you combine multiple strategies, set a role and objective, define constraints, provide few-shot examples, and choose an output format. When you call `.enhance()`, it compiles everything into an `EnhancedPrompt` ready to send to the LLM. ### Quick Construction If you only need a single strategy with no extra configuration, use the shorthand factory method. ```java // Single strategy PromptEnhancer enhancer = PromptEnhancer.withStrategy(PromptStrategy.CHAIN_OF_THOUGHT); EnhancedPrompt prompt = enhancer.enhance("Solve: 2x + 5 = 15"); ``` ### Full Builder API For more control, use the builder. You can combine multiple strategies, set a role and objective, add constraints and preferences, provide positive and negative examples, define process steps, and choose an output format. ```java PromptEnhancer enhancer = PromptEnhancer.builder() .strategy(PromptStrategy.CHAIN_OF_THOUGHT) .strategy(PromptStrategy.CONFIDENCE_WEIGHTED) .role("Expert mathematician") .objective("Solve algebra problems accurately") .constraint("Show all work") .constraint("Verify answer by substitution") .constraints(List.of("Use standard notation", "Simplify")) .softPreference("Explain in simple terms") .softPreferences(List.of("Use examples", "Keep it concise")) .positiveExample("2x = 10", "x = 5", "Correct division by 2") .positiveExample("3x + 1 = 7", "x = 2") .negativeExample("2x = 10", "x = 10", "Forgot to divide") .processStep("Parse the equation") .processStep("Isolate the variable") .process(List.of("Simplify", "Verify")) .outputFormat(OutputFormat.STRUCTURED) .stopCondition("Stop after verification pass") .verificationQuestions(5) // CoVe: number of verification questions .refinementPasses(3) // Iterative Refinement: number of passes .confidenceThreshold(0.7f) // Confidence Weighted: threshold for alternatives .build(); ``` ### Builder Methods The complete list of builder methods. All setter methods are chainable and can be called in any order. | Method | Parameter | Description | | ----------------------------------------- | -------------------------- | -------------------------------------------------------------- | | `strategy(PromptStrategy)` | strategy | Adds a prompting strategy (chainable, multiple allowed) | | `role(String)` | role | Sets the role/persona (e.g., "Expert researcher") | | `objective(String)` | objective | Sets the high-level goal | | `constraint(String)` | constraint | Adds a single hard constraint | | `constraints(List)` | constraints | Adds multiple hard constraints | | `softPreference(String)` | preference | Adds a single soft preference | | `softPreferences(List)` | preferences | Adds multiple soft preferences | | `positiveExample(String, String)` | input, output | Adds a good example | | `positiveExample(String, String, String)` | input, output, explanation | Adds a good example with reasoning | | `negativeExample(String, String, String)` | input, badOutput, whyBad | Adds a bad example with explanation | | `processStep(String)` | step | Adds a single numbered process step | | `process(List)` | steps | Adds multiple process steps | | `outputFormat(OutputFormat)` | format | Sets the response output format | | `stopCondition(String)` | condition | Sets the completion/stop condition | | `verificationQuestions(int)` | count | Number of verification questions (for `CHAIN_OF_VERIFICATION`) | | `refinementPasses(int)` | passes | Number of refinement passes (for `ITERATIVE_REFINEMENT`) | | `confidenceThreshold(float)` | threshold | Threshold for showing alternatives (for `CONFIDENCE_WEIGHTED`) | ### OutputFormat Enum This enum tells the LLM what format to use in its response. The framework appends the corresponding instruction to the system prompt so the model produces output in the desired structure. `PromptEnhancer.OutputFormat` controls the response format instruction. | Value | Instruction | | ------------------ | --------------------------------------------------- | | `TEXT` | Provide your response as plain text. | | `JSON` | Provide your response as valid JSON. | | `MARKDOWN` | Format your response using Markdown. | | `MARKDOWN_TABLE` | Format your response as a Markdown table. | | `BULLET_POINTS` | Use bullet points for your response. | | `NUMBERED_LIST` | Use a numbered list for your response. | | `STRUCTURED` | Use clear sections with headers for your response. | | `CODE` | Provide code with comments explaining each section. | | `COMPARISON_TABLE` | Create a comparison table with pros/cons. | ### Example Record When using the `FEW_SHOT_EXAMPLES` strategy, you teach the LLM by showing it input/output pairs. The `Example` record holds one such pair, with an optional explanation of why the output is correct (or incorrect for negative examples). ```java // record Example(String input, String output, String explanation) new PromptEnhancer.Example("2x = 10", "x = 5", "Divided both sides by 2"); new PromptEnhancer.Example("2x = 10", "x = 5"); // explanation is optional (null) ``` ### Enhancer Instance Methods Once built, the `PromptEnhancer` instance provides these methods. The main one is `enhance()`, which takes a user message and returns a fully assembled `EnhancedPrompt`. | Method | Return Type | Description | | ----------------------------- | ---------------------- | ------------------------------------------------------------------ | | `enhance(String userMessage)` | `EnhancedPrompt` | Applies all configured strategies and produces the enhanced prompt | | `getStrategies()` | `List` | Returns the configured strategies | | `requiresMultiPass()` | `boolean` | `true` if any strategy is multi-pass | ## EnhancedPrompt `EnhancedPrompt` is what you get after calling `enhance()`. It holds the assembled system prompt (with all strategy instructions baked in), the original user message, and metadata about which strategies are active. Pass it directly to your `LLMClient` to make the enhanced call. ### Methods These methods let you access the prompt content and check which capabilities the enhanced prompt expects from the LLM response. | Method | Return Type | Description | | ----------------------------- | ------------------------ | --------------------------------------------------------------------------------------------------- | | `getSystemPrompt()` | `String` | The full system prompt with all strategy instructions | | `getUserMessage()` | `String` | The original user message | | `getSystemPromptOptional()` | `Optional` | System prompt wrapped in `Optional` for LLMClient convenience | | `getStrategies()` | `List` | List of applied strategies | | `getOutputFormat()` | `Optional` | The output format, if specified | | `requiresMultiPass()` | `boolean` | `true` if any applied strategy is multi-pass | | `requiresPostProcessing()` | `boolean` | `true` if any strategy needs post-processing | | `expectsConfidenceScore()` | `boolean` | `true` if `CONFIDENCE_WEIGHTED` is applied | | `expectsStructuredThinking()` | `boolean` | `true` if `STRUCTURED_THINKING` is applied | | `expectsVerification()` | `boolean` | `true` if `CHAIN_OF_VERIFICATION` is applied | | `getCombinedPrompt()` | `String` | System prompt + user message in a single string (for models without separate system prompt support) | | `getEstimatedOverhead()` | `int` | Estimated additional tokens from enhancement (\~4 chars/token) | ### Using EnhancedPrompt with LLMClient Here is how to pass the enhanced prompt to your LLM client. Most providers accept a separate system prompt; for those that do not, use `getCombinedPrompt()` to get a single string. ```java EnhancedPrompt prompt = enhancer.enhance("What causes inflation?"); // With separate system prompt support ChatResponse response = client.chat( prompt.getUserMessage(), prompt.getSystemPromptOptional(), Optional.empty(), Optional.empty() ); // Without system prompt support String combined = prompt.getCombinedPrompt(); ``` ## Integration with AgentBuilder You can wire prompt strategies directly into an agent through the `AgentBuilder`, so every message the agent processes is automatically enhanced. There are three approaches: adding individual strategies, adding a list of strategies, or providing a fully configured `PromptEnhancer`. ```java // Add individual strategies Agent agent = AgentBuilder.create() .llm(llmClient) .role(myRole) .promptStrategy(PromptStrategy.CHAIN_OF_THOUGHT) .promptStrategy(PromptStrategy.CONFIDENCE_WEIGHTED) .build(); // Add multiple strategies at once Agent agent = AgentBuilder.create() .llm(llmClient) .promptStrategies(List.of( PromptStrategy.STRUCTURED_THINKING, PromptStrategy.MULTI_PERSPECTIVE )) .build(); // Use a fully configured PromptEnhancer PromptEnhancer enhancer = PromptEnhancer.builder() .role("Expert researcher") .objective("Provide accurate information") .strategy(PromptStrategy.CHAIN_OF_VERIFICATION) .constraint("Always cite sources") .positiveExample("Question", "Good answer", "Why it's good") .build(); Agent agent = AgentBuilder.create() .llm(llmClient) .promptEnhancer(enhancer) .build(); ``` | AgentBuilder Method | Description | | ----------------------------------------- | -------------------------------- | | `.promptStrategy(PromptStrategy)` | Adds a single strategy | | `.promptStrategies(List)` | Adds multiple strategies at once | | `.promptEnhancer(PromptEnhancer)` | Sets a fully configured enhancer | ## Code Examples These examples show how to apply different strategies to real-world use cases. Each one demonstrates a different prompting technique suited to the task at hand. ### Chain-of-Thought for Math Chain-of-Thought prompting asks the model to show its step-by-step reasoning before giving a final answer. This significantly improves accuracy on math, logic, and multi-step analysis tasks. ```java PromptEnhancer enhancer = PromptEnhancer.builder() .strategy(PromptStrategy.CHAIN_OF_THOUGHT) .role("Mathematics tutor") .outputFormat(OutputFormat.STRUCTURED) .build(); EnhancedPrompt prompt = enhancer.enhance( "A train travels 120km in 2 hours. It then travels 180km in 3 hours. " + "What is its average speed for the entire journey?" ); ``` ### Chain-of-Verification for Fact Checking Chain-of-Verification (CoVe) makes the model generate an initial answer, then create verification questions to check its own claims, and finally refine the answer based on what it finds. This is a multi-pass strategy that dramatically reduces factual errors. ```java PromptEnhancer enhancer = PromptEnhancer.builder() .strategy(PromptStrategy.CHAIN_OF_VERIFICATION) .verificationQuestions(5) .constraint("Each claim must be independently verifiable") .build(); EnhancedPrompt prompt = enhancer.enhance("What were the causes of World War I?"); if (prompt.requiresMultiPass()) { // Handle multi-pass verification flow } ``` ### Atom-of-Thought for Complex Reasoning Atom-of-Thought decomposes a complex problem into independent "atoms" that can be solved in parallel, then synthesizes the results. Unlike Chain-of-Thought, errors in one atom do not cascade to others. This works best with large models (70B+ parameters) and is combined here with confidence scoring. ```java PromptEnhancer enhancer = PromptEnhancer.builder() .strategy(PromptStrategy.ATOM_OF_THOUGHT) .strategy(PromptStrategy.CONFIDENCE_WEIGHTED) .confidenceThreshold(0.7f) .objective("Analyze system architecture tradeoffs") .build(); EnhancedPrompt prompt = enhancer.enhance( "Compare microservices vs monolith for a 10-person startup " + "building a real-time analytics platform" ); // Estimated overhead: prompt.getEstimatedOverhead() tokens ``` ### Few-Shot with Examples Few-shot prompting teaches the model by example. You provide a few input/output pairs (both good and bad), and the model learns the expected format and quality from them. This is especially effective for classification, formatting, and style-matching tasks. ```java PromptEnhancer enhancer = PromptEnhancer.builder() .strategy(PromptStrategy.FEW_SHOT_EXAMPLES) .positiveExample( "The food was great", "Sentiment: POSITIVE (0.95)", "Clear positive language" ) .positiveExample( "Terrible service, never again", "Sentiment: NEGATIVE (0.98)", "Strong negative indicators" ) .negativeExample( "The food was great", "positive", "Missing confidence score and proper format" ) .outputFormat(OutputFormat.TEXT) .build(); EnhancedPrompt prompt = enhancer.enhance("The product works but could be better"); ``` --- # Streaming URL: https://tnsai.dev/docs/agents/behavior/streaming Description: TnsAI supports three streaming modes for real-time token delivery from LLM providers. import { Callout } from 'fumadocs-ui/components/callout' ## Token Streaming Returns text tokens as they are generated — simplest mode: ```java Stream tokens = agent.streamChat("Explain relativity"); tokens.forEach(System.out::print); ``` ## ChatChunk Streaming Returns typed chunks with metadata (token counts, finish reason, tool calls): ```java llmClient.streamChatWithSpec(request).forEach(chunk -> { switch (chunk.getType()) { case START -> System.out.println("Stream started: " + chunk.getModel()); case CONTENT -> System.out.print(chunk.getContent()); case TOOL_CALL -> handleToolCall(chunk.getToolCall().orElseThrow()); case DONE -> System.out.println("\nTokens: " + chunk.getTokenCount()); case ERROR -> System.err.println("Error: " + chunk.getContent()); } }); ``` ### Chunk Types Each `ChatChunk` has a type that tells you what kind of data it carries. Your code should handle each type to respond appropriately as the stream progresses. | Type | Description | | ----------- | ------------------------------------- | | `START` | Stream initialization with model info | | `CONTENT` | Text content delta | | `TOOL_CALL` | Tool/function invocation request | | `DONE` | Stream complete with finish reason | | `ERROR` | Error occurred during streaming | ### Finish Reasons When a stream ends, the `DONE` chunk includes a finish reason that explains why the LLM stopped generating. This helps you decide what to do next -- for example, if the reason is `TOOL_CALLS`, you need to execute the requested tool and feed the result back. | Reason | Description | | ---------------- | ----------------------- | | `STOP` | Natural completion | | `LENGTH` | Max tokens reached | | `TOOL_CALLS` | LLM wants to call tools | | `CONTENT_FILTER` | Content was filtered | | `ERROR` | Error during generation | ## Handler-Based Streaming Callback pattern with full tool-call loop — ideal for UI integration: ```java llmClient.streamChatWithHandler(request, chunk -> { if (chunk.isContent()) { System.out.print(chunk.getContent()); } else if (chunk.isToolCall()) { // Framework handles tool execution automatically } else if (chunk.isDone()) { System.out.println("\nFinish reason: " + chunk.getFinishReason()); } }); ``` ## Convenience Methods `ChatChunk` provides static factory methods so you can create chunks without calling constructors directly. These are useful when you build custom streaming pipelines or write tests that simulate LLM output. ```java // ChatChunk factory methods ChatChunk.start(model, requestId); ChatChunk.content("Hello", tokenCount, index); ChatChunk.content("Hello"); ChatChunk.toolCall(toolCallObject); ChatChunk.done(FinishReason.STOP, totalTokens); ChatChunk.error("Something went wrong"); ``` ## Which Mode to Use? TnsAI offers three streaming modes at different levels of abstraction. Pick the simplest one that meets your needs. | Mode | Use When | | -------------------- | --------------------------------------------------- | | **Token Stream** | Simple text display, CLI output | | **ChatChunk Stream** | Need metadata (tokens, model), manual tool handling | | **Handler-Based** | UI integration, automatic tool execution loop | ## Async Execution The `AsyncAgent` interface (`com.tnsai.agents.async.AsyncAgent`) provides non-blocking chat operations with multiple consumption patterns. ### Methods `AsyncAgent` exposes several ways to consume responses. Choose based on whether you need simple text, typed events, or reactive backpressure control. | Method | Return Type | Description | | ----------------------------- | --------------------------- | ----------------------------------------------------------- | | `chatAsync(message)` | `CompletableFuture` | Async chat, completes with full response | | `chatAsync(message, options)` | `CompletableFuture` | Async chat with `ChatOptions` | | `chatStream(message)` | `Stream` | Streaming tokens as a Java Stream | | `chatEventStream(message)` | `Stream` | Typed event stream (tokens, tool calls, etc.) | | `chatPublisher(message)` | `Flow.Publisher` | Reactive Streams publisher for backpressure-aware consumers | | `cancel()` | `void` | Cancels any ongoing async operation | | `isProcessing()` | `boolean` | True if an async operation is in progress | | `getProgress()` | `double` | Execution progress (0.0 - 1.0) | ### CompletableFuture The simplest async pattern. `chatAsync` returns a `CompletableFuture` that completes with the full response string once the LLM finishes generating. Use this when you do not need to show partial results to the user. ```java AsyncAgent agent = new MyAsyncAgent(); agent.chatAsync("Tell me about Java") .thenAccept(response -> System.out.println(response)) .exceptionally(e -> { e.printStackTrace(); return null; }); ``` ### Token Stream Returns a `Stream` that emits each text token as it arrives. This lets you print tokens to the console (or a UI) incrementally instead of waiting for the full response. ```java agent.chatStream("Tell me a story") .forEach(token -> System.out.print(token)); ``` ### Typed Event Stream `ChatEvent` subtypes distinguish tokens from tool calls and other events: ```java agent.chatEventStream("Complex task") .forEach(event -> { if (event instanceof ChatEvent.Token t) { System.out.print(t.content()); } else if (event instanceof ChatEvent.ToolCall tc) { System.out.println("Calling tool: " + tc.toolName()); } }); ``` ### Reactive Publisher For backpressure-aware consumers using `java.util.concurrent.Flow`: ```java agent.chatPublisher("Generate a report") .subscribe(new Flow.Subscriber<>() { private Flow.Subscription subscription; @Override public void onSubscribe(Flow.Subscription s) { this.subscription = s; s.request(1); } @Override public void onNext(ChatEvent event) { process(event); subscription.request(1); } @Override public void onError(Throwable t) { t.printStackTrace(); } @Override public void onComplete() { System.out.println("Done"); } }); ``` ### Cancellation You can cancel a running async operation at any time. This is useful for timeout handling or when the user navigates away from a page before the response finishes. ```java CompletableFuture future = agent.chatAsync("Long running task"); // Cancel if still running if (agent.isProcessing()) { agent.cancel(); } ``` --- # Agent Variants URL: https://tnsai.dev/docs/agents/behavior/variants Description: Agent variants let you trade off between response quality, execution speed, and token cost. A single agent can switch variants at runtime -- per task or per action. import { Callout } from 'fumadocs-ui/components/callout' ## AgentVariant Enum The four variant tiers represent different quality/speed/cost tradeoffs. Pick the one that matches your task, or use `AUTO` to let the framework decide at runtime based on task complexity. Defined in `com.tnsai.enums.AgentVariant`. Four tiers: | Variant | Quality | Speed | Cost | Best For | | -------- | -------------- | ------------ | ------------ | -------------------------------------------------- | | `HIGH` | Max (1.0) | Slow (0.3) | High (1.0) | Complex refactoring, security review, architecture | | `MEDIUM` | Balanced (0.7) | Normal (0.6) | Medium (0.5) | Regular development, feature implementation | | `MINI` | Basic (0.4) | Fast (1.0) | Low (0.2) | Quick fixes, typo corrections, simple queries | | `AUTO` | Adaptive | Adaptive | Optimal | Production environments with varied workloads | ### Helper methods These convenience methods let you check what a variant prioritizes without comparing enum values directly. ```java variant.isQualityFocused(); // true for HIGH, MEDIUM variant.isSpeedFocused(); // true for MINI variant.isCostOptimized(); // true for MINI, AUTO ``` ### Task-based suggestion If you are not sure which variant to use, `forTask()` analyzes the task description and suggests one based on keyword matching. This is a simple heuristic -- for smarter auto-selection, use `VariantManager` with auto mode enabled. ```java AgentVariant.forTask("Refactor the auth system"); // HIGH AgentVariant.forTask("Fix a typo in README"); // MINI AgentVariant.forTask("Implement login page"); // MEDIUM ``` Keywords that push toward HIGH: `refactor`, `architect`, `complex`, `critical`, `security`, `review`. Keywords that push toward MINI: `typo`, `fix`, `simple`, `quick`, `minor`, `small`. ## VariantSpec Each variant tier has a `VariantSpec` that defines its concrete settings: which LLM model to use, token limits, available tools, timeout, retry count, and temperature. You can use the built-in specs or build a custom one for your specific models and requirements. ### Predefined specs The built-in specs for each tier ship with sensible defaults for model selection, token limits, and timeouts. These are what you get when you use a variant without customization. | | HIGH | MEDIUM | MINI | | --------------------- | ------------------------- | ---------------------------- | ------------------------------------ | | **Preferred model** | `claude-opus-4` | `claude-sonnet-4` | `claude-haiku-3` | | **Fallback models** | `gpt-4`, `gemini-1.5-pro` | `gpt-4o`, `gemini-1.5-flash` | `gpt-4o-mini`, `gemini-1.5-flash-8b` | | **Max input tokens** | 128,000 | 64,000 | 32,000 | | **Max output tokens** | 16,384 | 8,192 | 4,096 | | **Tool set** | ALL | STANDARD | MINIMAL | | **Timeout** | 10 min | 5 min | 2 min | | **Max retries** | 3 | 2 | 1 | | **Temperature** | 0.7 | 0.5 | 0.3 | | **Streaming** | Yes | Yes | No | `AUTO` defaults to the MEDIUM spec and adjusts dynamically at runtime. ### Using predefined specs Retrieve the built-in spec for a variant tier with `VariantSpec.forVariant()` and query its settings. ```java VariantSpec highSpec = VariantSpec.forVariant(AgentVariant.HIGH); String model = highSpec.getPreferredModel(); // "claude-opus-4" int inputTokens = highSpec.getMaxInputTokens(); // 128000 Duration timeout = highSpec.getTimeout(); // PT10M ``` ### Building a custom spec When the built-in specs do not match your environment (different models, different limits), build a custom one. Custom specs are immutable -- once built, they cannot be changed. ```java VariantSpec custom = VariantSpec.builder() .variant(AgentVariant.HIGH) .preferredModel("claude-opus-4") .fallbackModels(List.of("gpt-4", "gemini-1.5-pro")) .maxInputTokens(128000) .maxOutputTokens(16384) .toolSet(VariantSpec.ToolSet.ALL) .timeout(Duration.ofMinutes(10)) .maxRetries(3) .temperature(0.7) .enableStreaming(true) .addSetting("customKey", "value") .build(); ``` ### Model resolution When the preferred model is unavailable (API outage, not provisioned), `getEffectiveModel` automatically falls back to the next available model from the fallback list. ```java Set available = Set.of("gpt-4o", "claude-haiku-3"); String model = highSpec.getEffectiveModel(available); // "gpt-4o" (fallback) ``` ### Immutable copies Since specs are immutable, changing a field returns a new `VariantSpec` instance. The original is not modified. ```java VariantSpec modified = spec.withVariant(AgentVariant.MEDIUM); VariantSpec remodeled = spec.withModel("gpt-4-turbo"); ``` ### ToolSet levels The `ToolSet` controls which tools are available to the agent in a given variant. Lower tiers restrict tool access to reduce cost and latency. | Level | Description | | ---------- | -------------------- | | `ALL` | All available tools | | `STANDARD` | Most common tools | | `MINIMAL` | Essential tools only | | `NONE` | No tools | ## VariantManager The `VariantManager` handles variant switching at runtime. It can operate in manual mode (you choose the variant) or auto mode (it analyzes each task and picks the best tier). It also tracks usage statistics per variant, so you can see how often each tier is used and how well it performs. ### Creating a manager Create a `VariantManager` with a default variant. If not specified, it defaults to `MEDIUM`. ```java VariantManager manager = new VariantManager(); // defaults to MEDIUM VariantManager manager = new VariantManager(AgentVariant.HIGH); // explicit initial ``` ### Manual switching Explicitly set the variant when you know what quality level the next task needs. ```java manager.setVariant(AgentVariant.HIGH); AgentVariant current = manager.getCurrentVariant(); // HIGH VariantSpec spec = manager.getCurrentSpec(); // VariantSpec for HIGH ``` ### Auto mode In auto mode, the manager analyzes each task description and automatically switches to the most appropriate variant. This is ideal for production environments where tasks vary in complexity. ```java manager.setAutoMode(true); // Analyzes task complexity (0-10 score) and switches automatically AgentVariant suggested = manager.suggestVariant("Refactor the authentication system"); // suggested = HIGH, manager now using HIGH spec ``` Complexity scoring adds/subtracts from a base score of 5. Score `>= 7` returns HIGH, score `<= 3` returns MINI, otherwise MEDIUM. Task length is also considered (`>200` chars adds 1, `<50` chars subtracts 1). ### Custom specs per variant Override the default spec for any variant tier. This is useful when you want to use a different model or different token limits for a specific tier in your environment. ```java VariantSpec custom = VariantSpec.builder() .variant(AgentVariant.HIGH) .preferredModel("my-custom-model") .maxInputTokens(200000) .build(); manager.setVariantSpec(AgentVariant.HIGH, custom); ``` ### Change listeners Register callbacks to be notified whenever the variant changes, whether manually or through auto mode. This is useful for logging, metrics, or adjusting other system behavior based on the active variant. ```java // Register VariantManager.Registration reg = manager.onVariantChange(event -> { System.out.printf("Variant: %s -> %s (reason: %s)%n", event.previous(), event.current(), event.reason()); }); // Unregister reg.unregister(); ``` The `VariantChangeEvent` record contains `previous`, `current`, and `reason` (either `"manual"` or `"auto:"`). ### Usage statistics The manager tracks task count, success rate, and timing per variant. Use this data to understand your cost distribution and identify variants that are underperforming. ```java manager.recordTask(AgentVariant.HIGH, 1500, true); VariantManager.VariantStats stats = manager.getStats(AgentVariant.HIGH); stats.getTaskCount(); // total tasks stats.getSuccessRate(); // 0.0 - 1.0 stats.getAverageDurationMs(); // average task time stats.getMinDurationMs(); stats.getMaxDurationMs(); // All stats Map all = manager.getAllStats(); ``` ## @Variant Annotation Some actions always need a specific quality level regardless of the agent's current setting -- a security audit should always use HIGH, while a text formatter can always use MINI. The `@Variant` annotation locks an action method to a specific variant tier. The framework temporarily switches to that variant for the duration of the action, then restores the previous one. | Attribute | Type | Default | Description | | ------------- | -------------- | ---------- | -------------------------------------------------- | | `value` | `AgentVariant` | (required) | Variant to use | | `reason` | `String` | `""` | Documentation for variant choice | | `recordStats` | `boolean` | `true` | Track usage statistics | | `fallback` | `AgentVariant` | `MEDIUM` | Fallback if primary variant's model is unavailable | ```java // Force HIGH for security-critical action @ActionSpec(type = ActionType.LLM, description = "Security analysis") @Variant(AgentVariant.HIGH) public String analyzeSecurityRisks(String code) { return "Analyze security: " + code; } // Use MINI for a quick utility @ActionSpec(type = ActionType.LOCAL, description = "Format text") @Variant(AgentVariant.MINI) public String formatText(String text) { return text.trim(); } // Let the framework auto-select based on input @ActionSpec(type = ActionType.LLM, description = "Code review") @Variant(value = AgentVariant.AUTO, reason = "Complexity varies by input size") public String reviewCode(String code) { return "Review: " + code; } ``` Actions without `@Variant` use the agent's current variant. The annotation only affects the specific method it decorates. ## Full Example This end-to-end example shows how to set up an agent with auto variant selection, log all variant switches, run tasks of varying complexity, override the variant at runtime, and check usage statistics afterward. ```java // Configure agent with variant support VariantManager variantManager = new VariantManager(AgentVariant.AUTO); variantManager.setAutoMode(true); // Log all variant switches variantManager.onVariantChange(event -> log.info("Variant {} -> {} ({})", event.previous(), event.current(), event.reason())); Agent agent = AgentBuilder.create() .withVariant(AgentVariant.AUTO) .llm(new OpenAIClient()) .build(); // Simple task -- framework auto-selects MINI agent.chat("Fix the typo in line 42"); // Complex task -- framework auto-selects HIGH agent.chat("Refactor the authentication module for OAuth2 support"); // Override at runtime agent.setVariant(AgentVariant.HIGH); agent.chat("Critical security audit of payment processing"); // Check statistics VariantManager.VariantStats highStats = variantManager.getStats(AgentVariant.HIGH); System.out.printf("HIGH: %d tasks, %.0f%% success, avg %dms%n", highStats.getTaskCount(), highStats.getSuccessRate() * 100, highStats.getAverageDurationMs()); ``` --- # Action System URL: https://tnsai.dev/docs/agents/fundamentals/action-system Description: The action system is the execution backbone of TnsAI. When an LLM decides to call a function, or an agent needs to perform work, the request flows through ActionExecutor, which routes it to the appropriate executor based on the action's ActionType. import { Callout } from 'fumadocs-ui/components/callout' ## Architecture Overview ``` Agent.executeAction(name, params) | v ActionExecutor.execute(action, role, params, context) | +-- 1. Validate inputs +-- 2. Check @ApprovalRequired +-- 3. Route by ActionType: | LOCAL -> Reflection invocation on Role | WEB_SERVICE -> WebServiceExecutor (HTTP) | LLM -> LLMRoleExecutor (single-shot LLM call) | MCP_TOOL -> McpToolExecutor (MCP protocol) +-- 4. Return result or wrap in ActionExecutionException ``` Tool dispatch — when the LLM emits a tool call during an `LLM` action — is handled separately by the agent's `ToolMethodDispatcher`, which is built from every POJO and dynamic tool registered with `AgentBuilder`. ## ActionType Enum `com.tnsai.enums.ActionType` defines the four execution methods: | Value | Executor | Description | | ------------- | -------------------- | -------------------------------------------------------------------------- | | `LOCAL` | Reflection | Direct Java method invocation on the Role | | `WEB_SERVICE` | `WebServiceExecutor` | HTTP REST API calls | | `LLM` | `LLMRoleExecutor` | Single-shot LLM call; tool dispatch via the agent's `ToolMethodDispatcher` | | `MCP_TOOL` | `McpToolExecutor` | Model Context Protocol server calls | ## ActionExecutor `com.tnsai.actions.ActionExecutor` is the central, thread-safe dispatcher. ### Construction ```java // Built by AgentBuilder using the agent's tool registry ActionExecutor executor = new ActionExecutor(toolMethodDispatcher); ``` Default executors are registered automatically: `WebServiceExecutor` for `WEB_SERVICE`, `LLMRoleExecutor` for `LLM`, and `McpToolExecutor` for `MCP_TOOL` (if tnsai-mcp is on classpath). The `ToolMethodDispatcher` is built from `AgentBuilder.builtInTools(...)`, `.toolPojos(...)`, and `.dynamicTool(...)` registrations. ### Execution Flow 1. **Validation** -- null checks on action, role, parameters 2. **Approval check** -- if the action method has `@ApprovalRequired`, verifies `_approval_token` is present in parameters 3. **Routing** -- `LOCAL` actions are invoked via reflection; others delegate to `TypedActionExecutor` 4. **Error wrapping** -- all exceptions become `ActionExecutionException` with a category (`PARAMETER`, `INVOCATION`, `NETWORK`, `UNKNOWN`) ### Approval-Required Actions ```java // In a Role class @ApprovalRequired @ActionSpec(description = "Deploy to production") public String deploy(String target) { ... } // Calling with approval token Map params = Map.of( "target", "production", "_approval_token", approvalService.getToken() ); agent.executeAction("deploy", params); ``` If the token is missing, `ApprovalRequiredException` is thrown. ## TypedActionExecutor Interface `com.tnsai.actions.executors.TypedActionExecutor` is the extension point for custom execution strategies. ```java public interface TypedActionExecutor { Object execute( ActionMetadata action, Role role, Map parameters, Map context ); } ``` The `context` map provides runtime data: `"llm"` (LLMClient), `"agent"` (Agent reference), `"mcpToolName"` (for MCP routing). ## Executor Types ### WebServiceExecutor Handles `WEB_SERVICE` actions by making HTTP calls using OkHttp. Features: - URL template variables: `{city}`, `{id}` in endpoints are replaced from parameters - Parameter types: `QUERY` (URL params), `PATH` (URL path segments), `BODY` (JSON payload) - Authentication: `BEARER` and `BASIC` via environment variables - Custom headers via `@Header` annotations - Configurable per-action timeout ```java @ActionSpec( type = ActionType.WEB_SERVICE, endpoint = "https://api.weather.com/v1/forecast/{city}", httpMethod = HttpMethod.GET, paramType = ParamType.QUERY, auth = AuthType.BEARER, authToken = "WEATHER_API_KEY", timeout = 5000 ) @Header(key = "Accept", value = "application/json") public Object getWeather(String city) { return null; } ``` ### LLMRoleExecutor Handles `LLM` actions with a single-shot LLM call. The action's prompt comes from the method body's return value (the convention is to return a String describing what the LLM should do); the executor sends that prompt to the agent's LLM client and returns the raw response. Tool dispatch is **not** done by this executor. If the LLM emits a tool call in its response, dispatch flows through the agent's `ToolMethodDispatcher`, which is built once from the `AgentBuilder.builtInTools(...)` / `.toolPojos(...)` / `.dynamicTool(...)` registrations and shared across every `LLM` action. Per-action overrides: | `@ActionSpec` field | Effect | | ------------------- | -------------------------------------------------------------------------------------------------------- | | `llmSystemPrompt` | Prepended as the system message of the chat request, overriding the LLM client's default for this action | | `llmTemperature` | When `>= 0`, sets the chat temperature; `-1.0f` (default) means "fall back to the LLM client's default" | ### McpToolExecutor Handles `MCP_TOOL` actions by connecting to remote MCP servers. Features: - Automatic tool discovery via `tools/list` - Connection caching per endpoint - API key support via environment variables - Reflection-based MCP client creation (no compile-time dependency on tnsai-mcp) ```java @ActionSpec( type = ActionType.MCP_TOOL, serverUrl = "https://mcp.api.coingecko.com/mcp", description = "Access cryptocurrency data" ) public String cryptoData(String query) { return null; } ``` Key methods: | Method | Description | | -------------------------------------------------- | ---------------------------------- | | `discoverTools(String endpoint, String apiKeyEnv)` | Discover tools from MCP server | | `getEndpointForTool(String toolName)` | Find which endpoint handles a tool | | `close()` | Disconnect all cached MCP clients | ## ActionContract `com.tnsai.actions.contracts.ActionContract` provides optional pre/post condition validation for roles. ```java public interface ActionContract { default void validatePreConditions(ActionMetadata action, Map parameters) throws ValidationException { } default void validatePostConditions(ActionMetadata action, Object result) throws ValidationException { } default void validateInvariants() throws ValidationException { } } ``` A role implements this interface to enforce contracts: ```java public class OrderRole extends Role implements ActionContract { @Override public void validatePreConditions(ActionMetadata action, Map params) throws ValidationException { if ("placeOrder".equals(action.getName())) { if (!params.containsKey("items")) { throw new ValidationException("items parameter required"); } } } } ``` ## TypeConverter `com.tnsai.actions.TypeConverter` handles automatic parameter type conversion in action invocations. ### Supported Conversions | Source | Target Types | | --------------------- | ------------------------------------------------- | | `String` | `int`, `long`, `double`, `float`, `boolean`, Enum | | `Number` | `int`, `long`, `double`, `float` | | `Map` | Any POJO/record (via Jackson) | ```java // String to int Object result = TypeConverter.convert("42", int.class); // 42 // Number to int Object result = TypeConverter.convert(42L, int.class); // 42 // Map to POJO record UserDto(String name, int age) {} Map params = Map.of("name", "Alice", "age", 30); UserDto user = TypeConverter.convertMapToPojo(params, UserDto.class); ``` Utility methods: | Method | Description | | -------------------------------------------- | ------------------------------------------- | | `convert(Object value, Class targetType)` | Convert a value to the target type | | `convertMapToPojo(Map, Class)` | Convert a map to a POJO via Jackson | | `isPrimitiveOrWrapper(Class)` | Check if type is primitive or wrapper | | `isSimpleType(Class)` | Check if type does not need POJO conversion | Enum conversion is case-insensitive: `TypeConverter.convert("get", HttpMethod.class)` matches `HttpMethod.GET`. ## ActionRequest and ActionResponse Typed records that replace raw `Map` and untyped `Object` returns. ### ActionRequest `com.tnsai.actions.model.ActionRequest` -- immutable request record. ```java // With parameters ActionRequest request = ActionRequest.of("searchWeb", Map.of( "query", "Java frameworks", "maxResults", 10 )); // Without parameters ActionRequest request = ActionRequest.of("getStatus"); ``` Fields: `actionName` (required, non-blank), `parameters` (never null, defensively copied). ### ActionResponse `com.tnsai.actions.model.ActionResponse` -- immutable response record. ```java // Success ActionResponse response = ActionResponse.success(resultData); // Failure ActionResponse response = ActionResponse.failure("Connection timeout"); // Failure with partial result ActionResponse response = ActionResponse.failure("Partial data received", partialData); ``` Fields: `value` (result object), `success` (boolean), `error` (String, null on success). ### Usage with Agent ```java ActionRequest request = ActionRequest.of("searchWeb", Map.of("query", "TnsAI")); ActionResponse response = agent.executeAction(request); if (response.success()) { Object data = response.value(); } else { logger.error("Failed: {}", response.error()); } ``` ## Related Documentation - [Roles](/docs/agents/fundamentals/roles) -- defining roles with `@ActionSpec` annotations - [Capabilities](/docs/agents/fundamentals/capabilities) -- reusable body-less `@ActionSpec` contracts via `@Capability` interfaces - [Tools](/docs/capabilities/tools/registration) -- registering POJO toolkits the LLM can call - [Advanced Tools](/docs/capabilities/tools/registration-advanced) -- filters, listeners, and dispatcher introspection - [SPI Reference](/docs/reference/spi) -- SPI interfaces for cross-module extensibility --- # Capabilities URL: https://tnsai.dev/docs/agents/fundamentals/capabilities Description: Capabilities are reusable, body-less action contracts. A @Capability interface carries one or more @ActionSpec-annotated methods that describe what the capability does; the framework dispatches the call at runtime. A role gains the capability by implements-ing the interface — without writing any method bodies. import { Callout } from 'fumadocs-ui/components/callout' This page covers the pattern, composition, override rules, validation, and migration from the legacy `return null;` style. *Available from 0.3.1. The legacy pattern (concrete methods with `return null;` bodies) was removed in 0.5.0.* ## The Problem Before capabilities, every dispatched `@ActionSpec` method required a `return null;` body: ```java @ActionSpec( type = ActionType.LLM, description = "Summarise the text in one short paragraph", llmSystemPrompt = "You are concise.", llmTemperature = 0.2f ) public String summarize(String text) { return null; // body unused — LLMRoleExecutor handles dispatch } ``` The body is dead code: [`ActionExecutor`](/docs/agents/fundamentals/action-system) skips method invocation entirely for `LLM`/`MCP_TOOL`/`WEB_SERVICE` actions when there is no `ActionResult` parameter. Yet the method declaration forces three problems: - **Misleading signature** — static analysis marks the method as "always returns null"; downstream `@NonNull` checks inferred from this are bogus. - **Silent failure on bypass** — anyone who invokes the method outside the dispatch path (direct `role.summarize(x)`, test code, misconfigured filter) gets `null`. This looks identical to a genuine LLM failure and makes debugging painful. - **Duplicated specification** — every role that wants the same capability copy-pastes the `@ActionSpec` annotation. Drift is inevitable. ## The Pattern Move the method into a `@Capability` interface. The interface's `default` body throws `Actions.dispatchedByFramework()` — a loud, framework-owned marker that never executes in the normal dispatch path: ```java @Capability public interface Summarizer { @ActionSpec( type = ActionType.LLM, description = "Summarise the text in one short paragraph", llmSystemPrompt = "You are concise.", llmTemperature = 0.2f ) default String summarize(String text) { throw Actions.dispatchedByFramework(); } } ``` A role picks up the capability by implementing the interface — with no method bodies of its own: ```java @RoleIdentity(name = "Editor", goal = "Produce clean, readable articles") public class EditorRole extends Role implements Summarizer { // State, lifecycle, and other non-capability methods live here. // No `summarize` body — inherited from Summarizer. } ``` When the LLM calls `summarize`, `ActionExecutor` discovers the method through the capability interface and routes dispatch through the `LLM` executor — exactly as it would for a concrete `@ActionSpec` method. The default `throw` body is never executed in the normal path; it only fires if someone bypasses dispatch and invokes the method directly, producing a clear `DispatchedByFrameworkException` instead of a silent `null`. ## Composition A role can implement any number of capability interfaces. Each contributes its actions to the role: ```java @Capability public interface Translator { @ActionSpec(type = ActionType.LLM, description = "Translate the text to the target language") default String translate(String text, String targetLanguage) { throw Actions.dispatchedByFramework(); } } @Capability public interface Classifier { @ActionSpec(type = ActionType.LLM, description = "Classify input as POSITIVE / NEGATIVE / NEUTRAL") default String classifySentiment(String text) { throw Actions.dispatchedByFramework(); } } @RoleIdentity(name = "Assistant", goal = "Handle ad-hoc requests") public class AssistantRole extends Role implements Summarizer, Translator, Classifier { // Three capabilities, zero bodies. All actions ready for LLM dispatch. } ``` `ActionDiscovery` walks the role's interface chain — including super-interfaces of capabilities (`MultilingualSummarizer extends Summarizer`) — so every capability contributes its methods regardless of whether it arrives via direct implementation or transitive extension. ## Override — Role Declaration Wins If a role declares the same signature as a capability's default method, the role's version is what ends up in the discovered action list. Use this to drop dispatch entirely for a specific role and provide a deterministic local implementation: ```java public class StrictEditor extends Role implements Summarizer { // Replaces Summarizer.summarize's LLM dispatch with a deterministic local impl @Override @ActionSpec(type = ActionType.LOCAL, description = "Truncate-first summarise (deterministic)") public String summarize(String text) { return text.length() <= 60 ? text : text.substring(0, 60) + "..."; } } ``` Discovery sees `summarize` declared on the concrete class first, records its signature, and then skips the capability's default when walking the interface chain. Exactly one `summarize` action ends up in the role's metadata, and its type is `LOCAL` rather than `LLM`. ## Validation Two rules are enforced at action-discovery time; violations throw `IllegalStateException` with a message naming the offending interface and method. ### Capability methods must be `default` An abstract method on a `@Capability` interface would force every adopting role to write a body — defeating the point of the annotation. The error message points at the correct body: ```text @Capability interface com.example.Summarizer method summarize must be a default method. Abstract capability methods force every adopting role to write a body, defeating the purpose of the annotation. Use `default { throw Actions.dispatchedByFramework(); }` as the body. ``` ### Capability methods must not declare an `ActionResult` parameter Capabilities are pure dispatch. Post-processing (reading the LLM's raw response, running it through custom logic before returning) belongs on the concrete role class where per-role logic makes sense. The validator rejects the mixed case: ```text @Capability interface com.example.Summarizer method summarize must not declare an ActionResult parameter. Capabilities are pure dispatch — move post-processing (ActionResult-based) to a concrete method on the role class itself. ``` If you want post-processing, keep that specific method off the capability interface and declare it directly on the role (the legacy pattern). Capability-dispatched and role-owned methods coexist on the same role without interference. ## Migration To migrate a legacy role: 1. For each dispatched `@ActionSpec` method (LLM / MCP / WEB\_SERVICE) whose body is `return null;`, extract it to a `@Capability` interface. 2. Give the interface method a `default { throw Actions.dispatchedByFramework(); }` body. 3. On the role class, remove the original method entirely and add `implements YourCapability`. 4. Methods that have a meaningful body — typically because they declare an `ActionResult` parameter for post-processing — stay on the role class unchanged. ### Before ```java @RoleIdentity(name = "Editor", goal = "Produce clean articles") public class EditorRole extends Role { @ActionSpec(type = ActionType.LLM, description = "Summarise the text", llmSystemPrompt = "You are concise.", llmTemperature = 0.2f) public String summarize(String text) { return null; } @ActionSpec(type = ActionType.LLM, description = "Translate to the target language") public String translate(String text, String targetLanguage) { return null; } } ``` ### After ```java @Capability public interface Summarizer { @ActionSpec(type = ActionType.LLM, description = "Summarise the text", llmSystemPrompt = "You are concise.", llmTemperature = 0.2f) default String summarize(String text) { throw Actions.dispatchedByFramework(); } } @Capability public interface Translator { @ActionSpec(type = ActionType.LLM, description = "Translate to the target language") default String translate(String text, String targetLanguage) { throw Actions.dispatchedByFramework(); } } @RoleIdentity(name = "Editor", goal = "Produce clean articles") public class EditorRole extends Role implements Summarizer, Translator { // No capability bodies. } ``` The first time `Summarizer` is used by another role, the duplication problem is already solved — update the prompt once, every adopter gets the change. ## When Not to Use Capabilities - **`ActionType.LOCAL` methods** — these have real bodies that the framework invokes via reflection. They are not "framework-dispatched" and do not have the `return null;` problem. Leave them on the concrete role class. - **Methods that declare an `ActionResult` parameter** — they are validated-out of capability interfaces by design (see above). Keep them on the role. - **One-off methods used by a single role** with no prospect of reuse — extracting a capability interface for one consumer adds indirection without benefit. Capabilities shine when two or more roles share the same `@ActionSpec`. ## Related - [Action System](/docs/agents/fundamentals/action-system) — routing, `ActionType` enum, executor types. - [Roles](/docs/agents/fundamentals/roles) — role identity, responsibilities, lifecycle. ## Implementation References - `com.tnsai.capabilities.Capability` — the marker annotation (`@Target(TYPE)`, `@Retention(RUNTIME)`). - `com.tnsai.actions.Actions.dispatchedByFramework()` — helper returning `DispatchedByFrameworkException` (subtype of `UnsupportedOperationException`). - `com.tnsai.actions.ActionDiscovery` — two-pass discovery: role class first, then the capability interface chain. --- # Event System URL: https://tnsai.dev/docs/agents/fundamentals/events Description: The event system provides full observability into the agent lifecycle. Events use a sealed interface hierarchy with 20+ event types, enabling type-safe pattern matching. import { Callout } from 'fumadocs-ui/components/callout' ## Subscribing to Events To listen to what your agent is doing, pass an event callback to `chatWithEvents`. The callback receives every event the agent fires during the run, and you can use Java's pattern matching to handle only the ones you care about. ```java String response = agent.chatWithEvents("Do research on AI", event -> { switch (event) { case RunStartEvent e -> log.info("Agent run started"); case ActionStartEvent e -> log.info("Action: {}", e.actionName()); case ToolCallStartEvent e -> log.info("Tool: {}", e.toolName()); case ToolCallEndEvent e -> log.info("Result: {}", e.result()); case ErrorEvent e -> log.error("Error: {}", e.message()); case RunEndEvent e -> log.info("Run completed"); default -> {} // Other events } }); ``` ## Event Types Events are grouped into categories based on what part of the agent they relate to. Each event carries contextual data you can inspect in your handler. ### Lifecycle Events These events tell you when the agent starts and stops processing, and when it transitions between states. Use them for logging, timing, or coordinating external systems. | Event | When | | ------------------------ | --------------------------------- | | `RunStartEvent` | Agent begins processing a message | | `RunEndEvent` | Agent finishes processing | | `AgentStateChangedEvent` | Agent state transitions | ### Action Events Actions are the high-level steps an agent takes (for example, "search the web" or "write a file"). These events let you track when each action starts and finishes. | Event | When | | ------------------ | -------------------------- | | `ActionStartEvent` | Action execution begins | | `ActionEndEvent` | Action execution completes | ### Tool Events Tools are the concrete functions an agent can call (like an HTTP client or a file reader). These events fire each time a tool is invoked, so you can monitor tool usage, measure latency, or build dashboards. | Event | When | | -------------------- | ------------------------- | | `ToolCallStartEvent` | Tool invocation begins | | `ToolCallEndEvent` | Tool invocation completes | ### Communication Events These events cover messages flowing to and from the agent, as well as any custom events you emit yourself using the `@EventEmitter` annotation. | Event | When | | ------------------- | ---------------------------------------- | | `MessageEvent` | Agent sends or receives a message | | `EventEmitterEvent` | Custom event emitted via `@EventEmitter` | ### Error Events When something goes wrong during a run, these events let you react immediately. `ErrorEvent` signals a hard failure, while `WarningEvent` signals a recoverable issue the agent can continue past. | Event | When | | -------------- | --------------------------------- | | `ErrorEvent` | An error occurs during processing | | `WarningEvent` | A non-fatal issue is detected | ## Event Publisher If you need to fire your own events from inside agent roles or custom logic, grab the publisher from the agent and call `publish`. Any registered handler will receive your custom event just like a built-in one. ```java TnsAIEventPublisher publisher = agent.getEventPublisher(); publisher.publish(new CustomEvent("data")); ``` ## Event Handler Registration Instead of handling all events in one big switch block, you can register a handler for a single event type. This is useful for focused concerns like metrics collection or audit logging. ```java EventHandlerRegistry registry = agent.getEventHandlerRegistry(); registry.register(ToolCallStartEvent.class, event -> { metrics.increment("tool.calls"); }); ``` ## Annotation-Based Handlers For the cleanest approach, annotate a method with `@EventHandler` and TnsAI will wire it up automatically. No manual registry calls needed -- just declare the event type you want and write your logic. ```java @EventHandler(ToolCallEndEvent.class) public void onToolComplete(ToolCallEndEvent event) { log.info("Tool {} took {}ms", event.toolName(), event.duration()); } ``` --- # Fundamentals URL: https://tnsai.dev/docs/agents/fundamentals Description: The core moving parts of a single agent. This page covers the Agent class itself — construction, chat, memory, lifecycle. See the other pages in this section for Roles, the Action System, Capabilities, and Events. import { Callout } from 'fumadocs-ui/components/callout' An `Agent` is the top-level orchestrator in TnsAI. It owns an LLM client, one or more roles, a memory store, and an event system. Agents handle the full chat loop: receiving a message, consulting their roles for available actions, calling the LLM, executing tool calls, and returning a response. ## Quick Start The fastest way to create an agent is with `AgentBuilder`: ```java Agent agent = AgentBuilder.create() .llm(LLMClientFactory.create("openai", "gpt-4o", 0.7f)) .role(RoleBuilder.create() .name("Assistant") .goal("Help users with their questions") .build()) .build(); String response = agent.chat("What is BDI architecture?"); ``` For more control, extend the `Agent` class directly: ```java @AgentSpec(name = "ResearchAgent", description = "Conducts research") public class ResearchAgent extends Agent { @Override protected LLMClient getLLM() { return LLMClientFactory.create("anthropic", "claude-sonnet-4-20250514", 0.7f); } @Override protected List getRoles() { return List.of(Role.create(ResearchRole.class)); } } ``` ## Creating Agents There are two ways to create an agent: programmatically with `AgentBuilder`, or declaratively by extending the `Agent` class and using annotations. Use the builder when you want quick, inline setup. Use annotations when you want a reusable agent class with its configuration baked in. ### With AgentBuilder (programmatic) `AgentBuilder` lets you configure an agent in a single fluent chain. This is the best approach for simple agents or when you want to assemble an agent dynamically at runtime. ```java Agent agent = AgentBuilder.create() .id("agent-001") .llm(new OpenAIClient("gpt-4o")) .role(myRole) .roles(List.of(role1, role2)) .builtInTools(BuiltInTool.WEB_SEARCH_TOOLS, BuiltInTool.UTILITY_TOOLS) .toolPojos(new MyDomainTools()) .memoryStore(new InMemoryStore()) .maxContextTokens(8192) .build(); ``` ### With Annotations (declarative) If you prefer a class-per-agent design, extend `Agent` and use `@AgentSpec` and `@LLMSpec` annotations. This keeps configuration next to the code and makes agents easy to discover in your project. ```java @AgentSpec(name = "Analyst", description = "Data analysis agent") @LLMSpec(provider = "openai", model = "gpt-4o", temperature = 0.3f) public class AnalystAgent extends Agent { @Override protected List getRoles() { return List.of(Role.create(AnalystRole.class)); } } ``` ## Chat Methods Once you have an agent, you interact with it through chat methods. TnsAI provides several variants depending on whether you need conversation history, streaming output, or visibility into tool calls happening inside the agent loop. ```java // Simple chat — single turn, uses conversation history String response = agent.chat("Explain quantum computing"); // Chat without history String response = agent.chat("Translate this to French", false); // Streaming — returns tokens as they arrive Stream tokens = agent.streamChat("Write a poem about Java"); tokens.forEach(System.out::print); // Event-driven chat — full visibility into the agent loop String response = agent.chatWithEvents("Research AI safety", event -> { switch (event) { case ToolCallStartEvent e -> System.out.println("Calling: " + e.toolName()); case ToolCallEndEvent e -> System.out.println("Result: " + e.result()); case ErrorEvent e -> System.err.println("Error: " + e.message()); default -> {} } }); ``` ## Memory Management Agents automatically track conversation history so the LLM has context across turns. You can also inspect, modify, or prune this history directly when you need to manage token usage or reset a conversation. ```java // Get conversation history List> history = agent.getConversationHistory(); // Clear all history agent.clearHistory(); // Add a message manually agent.addToHistory("user", "Remember this context"); // Prune memory to fit within a token limit (removes oldest messages first) agent.getMemoryStore().prune(4096); ``` ## Lifecycle Agents have a start/stop lifecycle. Call `start()` to initialize the agent and `shutdown()` to release its resources. You can check whether an agent is active or inspect its health state at any time. ```java agent.start(); boolean running = agent.isRunning(); AgentHealthState health = agent.getHealthState(); agent.shutdown(); ``` ## Configuration Summary This table lists every property you can set on an agent through `AgentBuilder`. Only `llm` and at least one `role` are required; everything else has sensible defaults. | Property | Builder Method | Default | Description | | ----------------- | --------------------------------- | ---------------- | ---------------------------------------------------------------- | | ID | `.id(String)` | Auto-generated | Unique agent identifier | | LLM | `.llm(LLMClient)` | Optional | Language model client (omit for traditional agents) | | Roles | `.role(Role)` | Required | Agent roles | | Built-in toolkits | `.builtInTools(BuiltInTool...)` | Empty | Shipped POJO toolkits from `tnsai-tools` | | Custom toolkits | `.toolPojos(Object...)` | Empty | Your own POJOs with `@Tool` methods | | Runtime tools | `.dynamicTool(DynamicToolMethod)` | Empty | Tools whose identity is only known at runtime (e.g. MCP proxies) | | Memory | `.memoryStore(MemoryStore)` | `InMemoryStore` | Conversation memory | | Context limit | `.maxContextTokens(int)` | Provider default | Max context window | | Knowledge base | `.knowledgeBase(KnowledgeBase)` | None | RAG knowledge source | | Prompt strategy | `.promptStrategy(PromptStrategy)` | Default | Prompt enhancement | | Reasoning | `.reasoningStrategy(String)` | None | Reasoning strategy name | ## SPI Extension Points The Core module defines SPI interfaces that other modules implement. Extensions are discovered automatically via `ServiceLoader`: | SPI Interface | Purpose | Implementing Module | | ---------------------- | ---------------------------------- | ------------------- | | `MessageBroker` | Agent communication routing | Coordination | | `ResilienceStrategy` | Resilience pattern implementations | Quality | | `CognitiveModel` | Cognitive processing models | Intelligence | | `CheckpointerFactory` | State checkpointing | Custom | | `CheckpointerProvider` | Checkpoint storage backends | Custom | Register an SPI implementation by adding a file to `META-INF/services/`: ``` # META-INF/services/com.tnsai.spi.MessageBroker com.example.MyCustomMessageBroker ``` ## Next in this Section - [Roles](/docs/agents/fundamentals/roles) — Bundling capabilities into `Role` classes. - [Action System](/docs/agents/fundamentals/action-system) — `@ActionSpec` routing and executor types. - [Capabilities](/docs/agents/fundamentals/capabilities) — Reusable body-less action contracts via `@Capability` interfaces. - [Events](/docs/agents/fundamentals/events) — The agent's lifecycle event bus. --- # Roles URL: https://tnsai.dev/docs/agents/fundamentals/roles Description: A Role defines what an agent can do. Each role has an identity (name, goal, domain), a set of responsibilities, and discoverable actions. Roles generate the system prompt that instructs the LLM. Actions are methods annotated with @ActionSpec — they are discovered at runtime via reflection and routed to one of four executor types. import { Callout } from 'fumadocs-ui/components/callout' ## Creating Roles There are two ways to create a role: programmatically with `RoleBuilder`, or declaratively with annotations. Pick whichever style fits your project -- they produce the same result. ### With RoleBuilder (programmatic) Use `RoleBuilder` when you want to define a role inline -- for example in tests, scripts, or when the role configuration is loaded dynamically at runtime. ```java Role role = RoleBuilder.create() .name("Researcher") .goal("Find and synthesize information from academic sources") .domain("academic-research") .duty("Search papers", "Users need access to recent research") .duty("Summarize findings") .mustNever("Fabricate citations", "Academic integrity") .mustAlways("Include source references", "Traceability") .llm(new AnthropicClient("claude-sonnet-4-20250514")) .build(); ``` ### With Annotations (declarative) Use `@RoleSpec` when you want the role definition to live directly on the class. This is the preferred approach for production roles because everything -- name, capabilities, LLM config -- is visible at a glance. ```java @RoleSpec( name = "Researcher", description = "Finds and synthesizes academic information", beliefs = {"research_context", "available_sources"}, desires = {"find_papers", "synthesize_findings"}, intentions = {"search_database", "analyze_paper"}, capabilities = {"search", "analysis", "summarization"}, domains = {"academic", "research"}, responsibilities = { @Responsibility( name = "Paper Search", description = "Search academic databases", actions = {"searchPapers", "filterResults"} ) }, llm = @LLMConfig(provider = "anthropic", model = "claude-sonnet-4-20250514") ) public class ResearchRole extends Role { @Override public RoleIdentity getIdentity() { return new RoleIdentity("Researcher", "Find papers", "academic"); } @Override public List getResponsibilities() { return List.of( new CoreDuty("Search papers", "Find relevant research"), new CoreDuty("Analyze findings", "Extract key insights") ); } } ``` ## Actions An `Action` is a method annotated with `@ActionSpec` on a Role class. Actions are routed to one of four executor types based on their `ActionType`: | Type | Executor | Description | | ------------- | --------------------- | -------------------------------------------------------------------------- | | `LOCAL` | `TypedActionExecutor` | Direct method invocation via reflection | | `WEB_SERVICE` | `WebServiceExecutor` | HTTP REST API calls | | `LLM` | `LLMRoleExecutor` | Single-shot LLM call; tool dispatch via the agent's `ToolMethodDispatcher` | | `MCP_TOOL` | `McpToolExecutor` | Model Context Protocol tools | ### Defining Actions Annotate any method on your Role class with `@ActionSpec` to expose it as an action. The `type` field tells the framework which executor handles the call. ```java @ActionSpec( name = "searchPapers", description = "Search for academic papers on a topic", type = ActionType.LLM ) public String searchPapers(@LLMParam("The search query") String query) { // Implementation } ``` ## ActionResult When an action delegates to an external system (HTTP call, LLM tool, MCP tool), the framework executes the call and makes the raw result available as an `ActionResult`. There are two usage patterns: ### Pure Delegate (Abstract, No Body) If the method has no body (abstract or the framework handles it entirely), the framework executes the action and returns the result directly. No `ActionResult` parameter is needed: ```java @ActionSpec( type = ActionType.WEB_SERVICE, endpoint = "https://api.example.com/users/{id}" ) public abstract Object getUser(String id); ``` ### Post-Process with ActionResult Add an `ActionResult` parameter to receive the raw execution result and transform it before returning: ```java @ActionSpec( type = ActionType.WEB_SERVICE, endpoint = "https://api.example.com/data/{id}" ) public Object getData(String id, ActionResult result) { // Return as-is return result; // Or extract a field Map json = result.asMap(); return json.get("name"); } ``` ### ActionResult API `ActionResult` wraps the raw value returned by the external system and provides convenience methods for common conversions like JSON parsing and type deserialization. | Method | Return type | Description | | -------------- | --------------------- | --------------------------------------------------------- | | `getValue()` | `Object` | Raw result value | | `asString()` | `String` | Value as String (JSON-serialized if not already a String) | | `asMap()` | `Map` | Value as Map (parsed from JSON if needed) | | `asList()` | `List