Caylent Launches Caylent Accelerate™ for Agentic Cloud Operations, Expediting 70% of Remediation Work and Reducing MTTR by 40%

Caylent Launches Caylent Accelerate™ for Agentic Cloud Operations

Solutions
- By Service
  AWS Foundations & Migrations
  Data Modernization & Analytics
  Application Modernization
  Cloud Native App Dev
  Infrastructure & DevOps Modernization
  Generative AI
  Product Strategy and Experience
  Cloud Operations & Managed Services
  Customer Experience Transformation
  Agentic Engineering and Enablement
  View all
- By Industry
  Healthcare & Life Sciences
  Financial Services
  Media & Entertainment
  SaaS & ISV
  Transportation and Logistics
  Energy, Power and Utilities
  Education Technology
  Private Equity
  View all
- By Type
  AWS Control Tower
  AI Innovation Engine
  MLOps Strategy
  Serverless Data Lake
  Serverless App
  Disaster Recovery Strategy
  View all
- How We Work
  The Caylent Way
  A Human-Centered Approach with an AI-First Delivery Mindset
  Learn More
Resources
- Insights
  The latest Caylent announcements, industry news, insights, and more.
  View All
- Events
  New
  See what’s on, come and say hi to us IRL (or via URL).
  View All
- Case Studies
  See how others are migrating and modernizing on AWS.
  View All
- Customer Spotlight
  Teamfront
  Achieves a 10x Faster Database Migration with Caylent Accelerate™
  Learn More
Company
- About
  Read about our vision, our story and our leadership.
  About us
- Partnerships
  We keep good company. Learn more about our partners.
  Learn more
- Careers
  We're Hiring!
  We’re forever on the lookout for talented people. Join us, it’ll be fun.
  Learn more
- Recent Announcements
  New Solution
  Modernize Legacy Applications Faster with Agentic Fleets
  Read more
Caylent Accelerate™

Building a Secure RAG Application with Amazon Bedrock AgentCore + Terraform

Kevin Nha

March 13, 2026

Generative AI & LLMOps

Learn how to build and deploy a secure, scalable RAG chatbot using Amazon Bedrock AgentCore Runtime, Terraform, and managed AWS services.

Generative AI & LLMOps

Kevin Nha

Kevin is a Cloud Software Architect in the Cloud Native Applications practice at Caylent. He has built many solutions using TypeScript, Python, and Java, and has in-depth experience with building serverless applications on AWS. Having previously worked at Amazon, Kevin has an in-depth understanding of AWS technologies and closely works within the Leadership Principles. He enjoys building and rebuilding applications in the AWS ecosystem and helping clients build cloud-native applications.

View Kevin's articles

Accelerate your GenAI initiatives

Leveraging our accelerators and technical experience

Browse GenAI Offerings

Claude Fable 5: Anthropic's First Public Mythos-Class Model

Explore Claude Fable 5, Anthropic's most capable generally available model, and learn how its advanced reasoning capabilities, safeguards, pricing, and deployment considerations impact real-world enterprise AI adoption.

Generative AI & LLMOps

July 7, 2026

The Agentic SDLC Journey’s North Star

Explore agentic SDLC as the shift in software development where teams balance AI and human ownership of context and decisions while adapting people, processes, and technology to work effectively with AI agents.

Generative AI & LLMOps

July 3, 2026

AWS Context: AWS's Automated Knowledge Graph for AI Agents

Explore how AWS Context creates a governed knowledge graph that gives AI agents the enterprise context needed to reason over data, enabling more accurate, secure, and scalable AI applications.

Generative AI & LLMOps

View all blog posts

@app.entrypoint
def invoke_agent(payload):
    ...
    try:
        for chunk, metadata in agent.stream(
            initial_state,
            stream_mode="messages",
        ):
            if metadata.get("langgraph_node") == "generate_answer":
                yield from __process_stream_chunk(chunk)
    except Exception as exc:
        app.logger.error("Streaming agent response failed")
        yield {
            "type": "error",
            "text": "Something went wrong while streaming the response.",
            "error_details": str(exc),
        }
        return

{
    "prompt": "Hello!",  
    "conversation_history": []
}

class RetrieverAgent:
   ...
   def get_agent_graph(self):
        workflow = StateGraph(AgentState)

        workflow.add_node("generate_query", self.__generate_query)
        workflow.add_node("retrieve", ToolNode([knowledge_base_retriever]))
        workflow.add_node("generate_answer", self.__generate_answer)

        workflow.add_edge(START, "generate_query")
        workflow.add_conditional_edges(
            "generate_query",
            tools_condition,
            {
                "tools": "retrieve",
                END: "generate_answer",
            },
        )
        workflow.add_edge("retrieve", "generate_answer")
        workflow.add_edge("generate_answer", END)

        return workflow.compile()

cp infra/example.tfvars infra/terraform.tfvars

# infra/terraform.tfvars 
 
region  = "us-east-1" 
profile = "" # AWS profile name from 'aws configure sso', leave empty for env variables 
tags = { 
  project = "agentcore-test" 
}
ecr_repository_name = "" # unique name for the ecr repository
...

cd infra
terraform init

# infra/providers.tf

terraform {
  required_version = ">= 1.14.3"
  required_providers {
    aws = {
      source  = "hashicorp/aws"
      version = ">= 6.28.0"
    }
    awscc = {
      source  = "hashicorp/awscc"
      version = ">= 1.68.0"
    }
  }
}

# infra/terraform.tfvars
...
data_source_bucket_arn   = "arn:aws:s3:::<name-of-your-s3-bucket>"

terraform plan # Verify configuration
terraform apply # Approve deployment

...

resource "aws_s3vectors_vector_bucket" "vector_bucket" {
  vector_bucket_name = "agentcore-test-vector-bucket"
}

resource "aws_s3vectors_index" "vector_index" {
  index_name         = "agentcore-test-vector-index"
  vector_bucket_name = aws_s3vectors_vector_bucket.vector_bucket.vector_bucket_name

  data_type       = "float32"
  dimension       = 1024
  distance_metric = "euclidean"

  metadata_configuration {
    non_filterable_metadata_keys = [
      "AMAZON_BEDROCK_TEXT",
      "AMAZON_BEDROCK_METADATA"
    ]
  }
}

resource "aws_s3_bucket" "multimodal_output_bucket" {
  bucket = "agentcore-test-multimodal-output-bucket"
  force_destroy = true
}

resource "aws_bedrockagent_knowledge_base" "knowledge_base" {
  name        = "agentcore-test-knowledge-base"
  description = "Test knowledge base for AgentCore"
  role_arn    = aws_iam_role.bedrock_kb_role.arn
  knowledge_base_configuration {
    type = "VECTOR"
    vector_knowledge_base_configuration {
      embedding_model_arn = "arn:aws:bedrock:${var.region}::foundation-model/amazon.titan-embed-text-v2:0"
      embedding_model_configuration {
        bedrock_embedding_model_configuration {
          dimensions          = 1024
          embedding_data_type = "FLOAT32"
        }
      }
      supplemental_data_storage_configuration {
        storage_location {
          type = "S3"

          s3_location {
            uri = "s3://${aws_s3_bucket.multimodal_output_bucket.bucket}"
          }
        }
      }
    }
  }
  storage_configuration {
    type = "S3_VECTORS"
    s3_vectors_configuration {
      index_arn = aws_s3vectors_index.vector_index.index_arn

    }
  }
}

resource "awscc_bedrock_data_source" "s3_data_source" {
  knowledge_base_id = aws_bedrockagent_knowledge_base.knowledge_base.id
  name              = "agentcore-test-s3-data-source"
  description       = "Data source for the Amazon Bedrock Knowledge Base: agentcore-test-knowledge-base from S3 with semantic chunking"
  data_source_configuration = {
    s3_configuration = {
      bucket_arn = var.data_source_bucket_arn
    }
    type = "S3"
  }
  vector_ingestion_configuration = {
    chunking_configuration = {
      chunking_strategy = "SEMANTIC"
      semantic_chunking_configuration = {
        breakpoint_percentile_threshold = 95
        buffer_size                     = 0 # either 0 or 1
        max_tokens                      = 300
      }
    }
    parsing_configuration = {
      parsing_strategy = "BEDROCK_DATA_AUTOMATION"
      bedrock_data_automation_configuration = {
        parsing_modality = "MULTIMODAL"
      }
    }
  }
}

resource "aws_ecr_repository" "agentcore_runtime_agent_code_ecr_repository" {
  name         = "agentcore-test-runtime-agent-code-ecr-repository"
  force_delete = true
}

resource "null_resource" "push_initial_image" {
  depends_on = [aws_ecr_repository.agentcore_runtime_agent_code_ecr_repository]

  triggers = {
    repository_url = aws_ecr_repository.agentcore_runtime_agent_code_ecr_repository.repository_url
    region         = var.region
  }

  provisioner "local-exec" {
    command = <<-EOT
      # Check if image exists, if not push alpine:latest as placeholder 
      ...
    EOT
  }
}
...

...

resource "aws_bedrockagentcore_agent_runtime" "agentcore_runtime" {
  agent_runtime_name = "agentcore_test_runtime"
  description        = "Agentcore runtime for the agentcore-test application"
  role_arn           = aws_iam_role.agentcore_runtime_role.arn
  protocol_configuration {
    server_protocol = "HTTP"
  }

  environment_variables = {
    BEDROCK_KNOWLEDGE_BASE_ID = aws_bedrockagent_knowledge_base.knowledge_base.id
  }

  authorizer_configuration {
    custom_jwt_authorizer {
      discovery_url   = "https://cognito-idp.${var.region}.amazonaws.com/${aws_cognito_user_pool.userpool.id}/.well-known/openid-configuration"
      allowed_clients = [aws_cognito_user_pool_client.userpool_client.id]
    }
  }

  agent_runtime_artifact {
    container_configuration {
      container_uri = "${aws_ecr_repository.agentcore_runtime_agent_code_ecr_repository.repository_url}:latest"
    }
  }

  network_configuration {
    network_mode = "PUBLIC"
  }

  depends_on = [null_resource.push_initial_image]
}

cd ../ # go back to root of project if necessary
cp scripts/env.agent.template scripts/.env.agent # make sure ECR repopsitory matches Terraform output
scripts/upload-agent-to-ecr.sh

aws cognito-idp initiate-auth \
  --auth-flow USER_PASSWORD_AUTH \
  --client-id <user-pool-client-id> \
  --auth-parameters USERNAME=<email>,PASSWORD=<tempPassword>

{
    "ChallengeName": "NEW_PASSWORD_REQUIRED",
    "Session": "AYABeMpHy...",
    "ChallengeParameters": {
        "USER_ID_FOR_SRP": "...",
        "requiredAttributes": "[]",
        "userAttributes": "{\"email_verified\":\"true\",\"email\":\"...\"}"
    }
}

aws cognito-idp respond-to-auth-challenge \
  --region us-east-1 \
  --client-id <user-pool-client-id> \
  --challenge-name NEW_PASSWORD_REQUIRED \
  --session "<SESSION_FROM_PREVIOUS_CALL>" \
  --challenge-responses \
      USERNAME=<email>,NEW_PASSWORD=<newPassword>

{
    "ChallengeParameters": {},
    "AuthenticationResult": {
        "AccessToken": "eyJra...",
        "ExpiresIn": 86400,
        "TokenType": "Bearer",
        "RefreshToken": "eyJjd...",
        "IdToken": "eyJra..."
    }
}

curl -X POST \
"https://bedrock-agentcore.us-east-1.amazonaws.com/runtimes/arn%3Aaws%3Abedrock-agentcore%3Aus-east-1%3A<ACCOUNT_ID>%3Aruntime%2F<AGENTCORE_RUNTIME_ID>/invocations?qualifier=DEFAULT" \
  -H "Authorization: Bearer <TOKEN>" \
  -H "Content-Type: application/json" \
  -H "X-Amzn-Bedrock-AgentCore-Runtime-Session-Id: session-5123123141231555555555555555555555214124215552" \
  -d '{"prompt": "Hello", "conversation_history": []}'

data: {"type": "text", "text": "Hey"}
data: {"type": "text", "text": " there! "}
data: {"type": "text", "text": "👋 Welcome"}
data: {"type": "text", "text": "!"}
data: {"type": "text", "text": " How"}
data: {"type": "text", "text": " can I help you today?"}
data: {"type": "text", "text": " Feel"}
data: {"type": "text", "text": " free to ask me anything –"}
data: {"type": "text", "text": " I"}
data: {"type": "text", "text": "'m here to assist!"}

curl -X POST \
"https://bedrock-agentcore.us-east-1.amazonaws.com/runtimes/arn%3Aaws%3Abedrock-agentcore%3Aus-east-1%3A<ACCOUNT_ID>%3Aruntime%2F<AGENTCORE_RUNTIME_ID>/invocations?qualifier=DEFAULT" \
  -H "Authorization: Bearer <TOKEN>" \
  -H "Content-Type: application/json" \
  -H "X-Amzn-Bedrock-AgentCore-Runtime-Session-Id: session-5123123141231555555555555555555555214124215552" \
  -d '{"prompt": "When is the next AGM for Drylab news?",  "conversation_history": []}'

data: {"type": "text", "text": "According"}
data: {"type": "text", "text": " to the"}
data: {"type": "text", "text": " Drylab News"}
data: {"type": "text", "text": " newsletter"}
data: {"type": "text", "text": " from"}
data: {"type": "text", "text": " May"}
data: {"type": "text", "text": " 2017"}
data: {"type": "text", "text": ", the next Annual"}
data: {"type": "text", "text": " General Meeting (AGM) was"}
data: {"type": "text", "text": " scheduled for **June"}
data: {"type": "text", "text": " 16"}
data: {"type": "text", "text": "th"}
data: {"type": "text", "text": " at"}
data: {"type": "text", "text": " 15"}
data: {"type": "text", "text": ":00**"}
data: {"type": "text", "text": " (3:00 PM)."}
data: {"type": "text", "text": " An"}
data: {"type": "text", "text": " invitation"}
data: {"type": "text", "text": " was to"}
data: {"type": "text", "text": " be distribute"}
data: {"type": "text", "text": "d to all owners"}
data: {"type": "text", "text": " in"}
data: {"type": "text", "text": " advance"}
data: {"type": "text", "text": "."}

curl -X POST \
"https://bedrock-agentcore.us-east-1.amazonaws.com/runtimes/arn%3Aaws%3Abedrock-agentcore%3Aus-east-1%3A<ACCOUNT_ID>%3Aruntime%2F<AGENTCORE_RUNTIME_ID>/invocations?qualifier=DEFAULT" \
  -H "Authorization: Bearer <TOKEN>" \
  -H "Content-Type: application/json" \
  -H "X-Amzn-Bedrock-AgentCore-Runtime-Session-Id: session-5123123141231555555555555555555555214124215552" \
  -d '{"prompt": "In one sentence tell me about the method of the placebo effect experiment", "conversation_history": []}'

data: {"type": "text", "text": "The experiment"}
data: {"type": "text", "text": " teste"}
data: {"type": "text", "text": "d the placebo effect by having"}
data: {"type": "text", "text": " Par"}
data: {"type": "text", "text": "kinson's Disease"}
data: {"type": "text", "text": " patients receive treatments"}
data: {"type": "text", "text": " describe"}
data: {"type": "text", "text": "d as co"}
data: {"type": "text", "text": "sting $100"}
data: {"type": "text", "text": " an"}
data: {"type": "text", "text": "d then"}
data: {"type": "text", "text": " $"}
data: {"type": "text", "text": "1"}
data: {"type": "text", "text": "500"}
data: {"type": "text", "text": ","}
data: {"type": "text", "text": " measuring"}
data: {"type": "text", "text": " changes in their motor"}
data: {"type": "text", "text": " function"}
data: {"type": "text", "text": " after"}
data: {"type": "text", "text": " each"}
data: {"type": "text", "text": " administration"}

cd infra
terraform destory # yes

Kevin Nha

Accelerate your GenAI initiatives

Related Blog Posts

Claude Fable 5: Anthropic's First Public Mythos-Class Model

The Agentic SDLC Journey’s North Star

AWS Context: AWS's Automated Knowledge Graph for AI Agents

Accelerate your GenAI initiatives

Kevin Nha

Why Amazon Bedrock AgentCore?

Architecture Decisions for Production RAG

Knowledge Base & Vector Store

Embedding Model

Chunking Strategy

LLM Model

Hands-On Tutorial

Prerequisites.

Agent Code Overview

Agent Orchestration Logic

Terraform Setup

Knowledge Base Setup

Deployment

Key Infrastructure Components

Knowledge Base (bedrock_kb.tf)

Cognito Authentication (cognito.tf)

AgentCore Runtime (agentcore_runtime.tf)

Post-Deployment Steps

Create Cognito User

Sync Data Source

Upload Agent Image

Testing the Agent

Update Temporary Password

Invoke the Agent

Cleanup

Troubleshooting

Conclusion

How Caylent Can Help

Learn more about the services mentioned

Generative AI Strategy

AWS Generative AI Proof of Value

Why Amazon Bedrock AgentCore?

Architecture Decisions for Production RAG

Knowledge Base & Vector Store

Embedding Model

Chunking Strategy

LLM Model

Hands-On Tutorial

Prerequisites.

Agent Code Overview

Agent Orchestration Logic

Terraform Setup

Knowledge Base Setup

Deployment

Key Infrastructure Components

Knowledge Base (bedrock_kb.tf)

Cognito Authentication (cognito.tf)

AgentCore Runtime (agentcore_runtime.tf)

Post-Deployment Steps

Create Cognito User

Sync Data Source

Upload Agent Image

Testing the Agent

Update Temporary Password

Invoke the Agent

Cleanup

Troubleshooting

Conclusion

How Caylent Can Help

Learn more about the services mentioned

Generative AI Strategy

AWS Generative AI Proof of Value