Engineering

Browse all engineering posts from the Transactional team.

8 posts
Why Your AI Chatbot Forgets Everything (And How to Fix It)
Engineering

Why Your AI Chatbot Forgets Everything (And How to Fix It)

Architecture of persistent user memory using vector search for LLM applications. Embedding strategies, retrieval patterns, memory decay, and per-user scoping.

Transactional Team
Feb 26, 202611 min read
How Semantic Caching Can Cut Your LLM Costs by 60%
Engineering

How Semantic Caching Can Cut Your LLM Costs by 60%

A practical guide to implementing semantic caching with vector embeddings to reduce LLM API costs. Covers architecture, similarity thresholds, cache invalidation, and production considerations.

Transactional Team
Feb 24, 20269 min read
MCP Security: What Developers Need to Know
Engineering

MCP Security: What Developers Need to Know

Security analysis of the Model Context Protocol ecosystem. Authentication gaps, tool poisoning risks, excessive permissions, and a security checklist for developers adopting MCP servers.

Transactional Team
Feb 11, 202612 min read
Prompt Injection Nearly Broke Production AI. These Patterns Can Save You.
Engineering

Prompt Injection Nearly Broke Production AI. These Patterns Can Save You.

Prompt injection incident analysis and proven defense patterns: input sanitization, output validation, system prompt hardening, sandwich defense, and canary tokens.

Transactional Team
Feb 7, 202611 min read
15 Years of Shipping Software Taught Me Email Deliverability is an Infrastructure Problem
Engineering

15 Years of Shipping Software Taught Me Email Deliverability is an Infrastructure Problem

Why developers treat email as an afterthought and what goes wrong at scale. IP warming, feedback loops, bounce handling, reputation management, and the architecture of a reliable email pipeline.

Transactional Team
Feb 5, 202610 min read
We Were Flying Blind on LLM Costs Until We Started Tracing Every Token
Engineering

We Were Flying Blind on LLM Costs Until We Started Tracing Every Token

How we built token-level tracing to gain visibility into LLM costs, latency, and performance across providers. Architecture of the observability pipeline and the cost surprises we caught.

Transactional Team
Jan 30, 202610 min read
Traditional APM Cannot Track AI Errors. Here is What We Built Instead.
Engineering

Traditional APM Cannot Track AI Errors. Here is What We Built Instead.

Why Sentry and Datadog fail for AI-specific errors like hallucinations, context overflows, and model degradation. Architecture of an AI-native error tracking system.

Transactional Team
Jan 28, 202610 min read
We Built an AI Gateway That Routes Across 13 LLM Providers. Here is How.
Engineering

We Built an AI Gateway That Routes Across 13 LLM Providers. Here is How.

Architecture deep-dive into building a unified LLM proxy that routes requests across OpenAI, Anthropic, Google, Mistral, and more with load balancing, failover, and schema normalization.

Transactional Team
Jan 22, 202612 min read

YOUR AGENTS DESERVE
REAL INFRASTRUCTURE.

START BUILDING AGENTS THAT DO REAL WORK.

Deploy Your First Agent