AWS Docs RAG — Aaron Altergott

Ask anything about AWS.

▪ Example Questions

Full-stack RAG system answering questions about AWS using official AWS documentation. Questions are embedded with Titan v2, matched against a Pinecone vector index, and answered by Claude Sonnet 4.6 via Amazon Bedrock. Frontend served from S3 via CloudFront with Origin Access Control.

▪ Stack

LLM	Claude Sonnet 4.6 (Bedrock)
Embeddings	Titan Embeddings v2
Vector DB	Pinecone (free tier)
Compute	Lambda + API Gateway (REST)
Frontend	S3 + CloudFront (OAC)
Region	us-east-2

▪ Request Flow

User → CloudFront → API Gateway → Lambda
  ├── Embed question   (Titan v2 via Bedrock)
  ├── Search Pinecone  (top-5 chunks)
  ├── Build prompt     (context injection)
  ├── Generate answer  (Claude via Bedrock)
  └── Return           (response + sources)

▪ Security

OAC blocks all direct S3 access — only signed CloudFront requests are served. Lambda pins Access-Control-Allow-Origin to the CloudFront domain. Input is validated and length-capped before embedding; source URLs are validated as http(s):// before render.