Back to Home

AI / LLM Integration

From API wrappers to self-hosted multi-agent systems

AI integration goes far beyond calling an API. For LaunchThatBot, I built a platform where users can self-host OpenClaw AI agent instances on their own hardware and connect them to the central platform via WebRTC and WebSockets. The system supports multi-agent orchestration, RAG pipelines with vector search, real-time streaming responses, and a guardrails layer with input sanitization, PII redaction, and confidence-gated outputs. For AdaScout, I built an offline evaluation harness that measures AI scanner precision and recall against deterministic axe-core baselines, with per-criterion F1 tracking, LLM-as-judge validation for ambiguous findings, and regression gating on prompt changes. RAG over WCAG 2.2 AA specifications grounds remediation suggestions with specific criterion citations. Across projects, I've implemented confidence scoring, human-in-the-loop escalation patterns, and cost/performance trade-off analysis for model selection.

Key Achievements

  • Built a self-hosted AI agent platform connected via WebRTC to a central orchestration layer
  • Implemented RAG pipelines with vector search for document-aware AI responses
  • Created multi-agent coordination where 3+ agents collaborate on complex tasks
  • Integrated AI-powered browser automation with Playwright and Stagehand
  • Built an AI evaluation framework measuring precision/recall against deterministic baselines with regression gating
  • Implemented RAG pipelines with pgvector for grounded, citation-backed AI responses
  • Designed guardrails layer with confidence scoring, PII redaction, and human-review escalation

Tools & Technologies

Projects Using This Skill

Where I've Applied This

Explore Other Skills