Agentic π Day

3.14 Hours of Agentic AI | Unscripted Building & AMA

March 14, 2026 • 3:14 PM • Strands SDK • AWS

A Little Background...

No slides. No scripts. No fluff. That was the plan.

I posted on LinkedIn on March 10th around noon, announcing a live, unscripted session:

          For exactly 3.14 hours, we're going LIVE to build a real agentic AI product from scratch:
          architecture decisions, debugging, the whole unfiltered journey. Open Q&A the entire time.
        

The only thing scripted? The start time. 3:14 PM. Because... π.

Then...

110+ people signed up. (and counting, as of 24 hours before the event)

          So prepared these notes, not as a script, but as a safety net. The live session stays messy and real. These
          materials ensure everyone walks away with complete code, concepts, and references.
        

Agenda

Time	Module	What We Build
0:00	Setup & Intro	Environment ready + concepts
0:20	Module 1: First Agent	Basic FAQ agent
0:45	Module 2: Tools & MCP	Agent with tools + MCP server
1:20	Module 3: Memory	Persistent memory + context mgmt
1:50	Module 4: Multi-Agent	Triage + specialist agents
2:20	Module 5: Evals & Safety	Eval suite + guardrails + OTel
2:50	Module 6: Deploy	Live on AWS AgentCore

What We're Building

SupportBot: AI Customer Support Agent

          A multi-agent system that triages customer requests and routes them to specialist agents (billing, technical,
          returns), with tools, memory, safety guardrails, and deployed on AWS.
        

Strands SDK Amazon Bedrock MCP AgentCore OpenTelemetry

What is Agentic AI?

AI systems that can autonomously perceive, reason, act, and learn

	Chatbot	Copilot	Agent
Reasoning	Single-turn	Multi-turn	Multi-step planning
Actions	Reply only	Suggest	Execute tools
Memory	None	Session	Persistent
Autonomy	None	Low	High

The Agentic Loop

flowchart TD
    A["User Prompt"] --> B["🧠 REASON\nLLM thinks about the task"]
    B --> C{"DECIDE"}
    C -->|"Done"| D["✅ RESPOND\nReturn final answer"]
    C -->|"Need more info"| E["🔧 ACT\nCall a tool"]
    E --> F["👁️ OBSERVE\nSee tool results"]
    F --> B

This Reason → Act → Observe loop is the foundation of all agents

Core Components of an Agent

🧠 Model (The Brain)

LLM for reasoning & planning
Amazon Bedrock + Claude Sonnet

📝 System Prompt (The Job)

Defines personality, responsibilities, constraints

🔧 Tools (The Hands)

Python functions the agent can call
APIs, databases, MCP servers

💾 Memory (The Recall)

Session, persistent, episodic memory

🎭 Orchestration (The Coordination)

Multi-agent patterns: triage, swarm, graph

🛡️ Guardrails (The Safety)

Input/output validation, evals, observability

Strands Agents SDK

Open-source, model-driven agent framework from AWS

from strands import Agent, tool

@tool
def lookup_order(order_id: str) -> dict:
    """Look up a customer order by ID."""
    return database.get_order(order_id)

agent = Agent(
    system_prompt="You are SupportBot...",
    tools=[lookup_order],
)

response = agent("What's the status of order ORD-10001?")

Used in production by Amazon Q Developer, AWS Glue, and more

Agent Architecture Patterns

Pattern	How It Works	Best For
ReAct	Reason → Act → Observe loop	General purpose (default)
Plan-and-Execute	Plan all steps, then execute	Complex multi-step tasks
Reflexion	Generate → Critique → Refine	Quality-sensitive outputs
ReWOO	Separate plan/execute/solve	Parallel tool execution

Strands uses ReAct by default. The LLM naturally reasons and selects
          tools in a loop.

Multi-Agent Patterns

Agents-as-Tools ✅

flowchart TD
    T["🎯 Triage"] --> B["💰 Billing"]
    T --> Tech["🔧 Technical"]
    T --> R["📦 Returns"]

Swarm

flowchart TD
    A["Agent A"] -->|handoff| B["Agent B"]
    B -->|handoff| C["Agent C"]
    C -->|handoff| A

Graph

flowchart TD
    A --> B & C --> D

Today: Agents-as-Tools, where a triage agent delegates to specialist agents

Model Context Protocol (MCP)

Open standard for agent ↔ tool communication

Think of MCP as "USB-C for AI tools". Any MCP server works with any MCP client

from fastmcp import FastMCP

mcp = FastMCP(name="Product Catalog")

@mcp.tool()
def get_product(sku: str) -> dict:
    """Get product details by SKU."""
    return products[sku]

Transport: stdio (local) | HTTP (remote) | SSE (streaming)

Hierarchical Context Management

flowchart LR
    subgraph L1["🟢 L1: Always Loaded"]
        A1["System Prompt"] ~~~ A2["Tool Defs"] ~~~ A3["Conversation"]
    end
    subgraph L2["🟡 L2: On-Demand"]
        B1["FAQ"] ~~~ B2["Memory"] ~~~ B3["Policies"]
    end
    subgraph L3["🔵 L3: External APIs"]
        C1["Catalog"] ~~~ C2["Orders"] ~~~ C3["Tickets"]
    end
    L1 --> L2 --> L3

Skills.md Pattern: Frontmatter (name + description) always visible.
          Full instructions loaded only when the skill is activated.

Agent Evaluations

Agents are non-deterministic: same input, different (valid) outputs

Dimension	What to Measure	Method
Task Completion	Did it solve the problem?	Keywords + LLM Judge
Tool Selection	Right tools called?	Trace analysis
Safety	No harmful outputs?	Guardrail checks
Reasoning	Logical path?	LLM-as-Judge
Efficiency	Tokens / latency	OTEL metrics

Best practice: Automated evals (CI) + LLM-as-Judge (periodic) + Human review (calibration)

Observability with OpenTelemetry

gantt
    title Agent Request Trace (1.2s total)
    dateFormat X
    axisFormat %L ms
    section Model
    model_inference (150 tokens)    :0, 400
    model_inference (200 tokens)    :410, 800
    section Tools
    tool_call lookup_order          :400, 410
    section Response
    streaming response              :800, 1200

Strands emits OTEL-compliant spans → CloudWatch, Datadog, Jaeger

Amazon Bedrock AgentCore

Managed platform for deploying AI agents at scale

Service	What It Does
Runtime	Serverless hosting (up to 8h sessions)
Gateway	Convert APIs → MCP tools
Identity	Auth (Cognito, Okta, Entra ID)
Memory	Persistent + episodic memory
Policy	Natural language guardrails
Evaluations	13 pre-built eval metrics

agentcore configure -e app.py
agentcore launch
agentcore invoke '{"prompt": "Help me with my order"}'

SupportBot Architecture

flowchart LR
    User["👤 User"] --> Triage["🎯 Triage Agent"]
    Triage --> Billing["💰 Billing"]
    Triage --> Technical["🔧 Technical"]
    Triage --> Returns["📦 Returns"]
    subgraph Tools["Shared Tools"]
        T1["Order Lookup"] ~~~ T2["KB Search"] ~~~ T3["Tickets"] ~~~ T4["MCP Catalog"]
    end
    Billing --> Tools
    Technical --> Tools
    Returns --> Tools

Let's Build! 🚀

          Module 1: Your First Agent

          python module_01_first_agent/agent.py

Open the workshop docs for step-by-step instructions

Each module builds on the previous one, and by the end, you'll have a production-ready multi-agent system.

MODULE 2

Custom Tools & MCP

Extending agents with tools and the Model Context Protocol

@tool
def lookup_order(order_id: str) -> dict:
    """Look up a customer order."""
    return ORDERS.get(order_id)

Tools turn agents from "answering" to "doing"

MODULE 3

Memory & Context

Making agents remember and managing the context window

Conversation memory: current session (built-in)
Persistent memory: across sessions (custom store)
Hierarchical context: L1/L2/L3 loading
Skills.md pattern: modular, on-demand context

MODULE 4

Multi-Agent Patterns

Triage agent routing to specialist agents

@tool
def route_to_billing(query: str) -> str:
    """Route to Billing Specialist."""
    response = billing_agent(query)
    return response.message.content[0]["text"]

triage_agent = Agent(
    tools=[route_to_billing, route_to_technical, route_to_returns]
)

MODULE 5

Evals, Safety & Observability

Testing, protecting, and monitoring your agents

# Run evaluation suite
python module_05_evals/eval_suite.py --eval

# Chat with guardrails
python module_05_evals/eval_suite.py --chat

# Enable OpenTelemetry tracing
python module_05_evals/eval_suite.py --chat --otel

MODULE 6

Deploy to AWS AgentCore

Production deployment on Amazon Bedrock AgentCore

# Configure deployment
agentcore configure -e module_06_deploy/app.py

# Launch on AgentCore Runtime
agentcore launch

# Invoke the deployed agent
agentcore invoke '{"prompt": "What is your return policy?"}'

What You Built Today

✅ Single agent with system prompt (Module 1)
✅ Custom tools + MCP server (Module 2)
✅ Persistent memory + context management (Module 3)
✅ Multi-agent triage system (Module 4)
✅ Evaluation suite + safety guardrails (Module 5)
✅ Deployed agent on AWS AgentCore (Module 6)

From a simple chatbot → production multi-agent system in 3 hours

Next Steps

📚 Explore Strands Agents docs
🔬 Try more examples
🏗️ Build with AgentCore services
🤝 Join the community
📖 Read about agent evals best practices

Thank You! 🎉

Workshop materials available at the docs site