Engineering

Browse all engineering posts from the Transactional team.

8 posts

Why Your AI Chatbot Forgets Everything (And How to Fix It)

Architecture of persistent user memory using vector search for LLM applications. Embedding strategies, retrieval patterns, memory decay, and per-user scoping.

Transactional Team

Feb 26, 202611 min read

Engineering

How Semantic Caching Can Cut Your LLM Costs by 60%

A practical guide to implementing semantic caching with vector embeddings to reduce LLM API costs. Covers architecture, similarity thresholds, cache invalidation, and production considerations.

Transactional Team

Feb 24, 20269 min read

Engineering

MCP Security: What Developers Need to Know

Security analysis of the Model Context Protocol ecosystem. Authentication gaps, tool poisoning risks, excessive permissions, and a security checklist for developers adopting MCP servers.

Transactional Team

Feb 11, 202612 min read

Engineering

Prompt Injection Nearly Broke Production AI. These Patterns Can Save You.

Prompt injection incident analysis and proven defense patterns: input sanitization, output validation, system prompt hardening, sandwich defense, and canary tokens.

Transactional Team

Feb 7, 202611 min read

Engineering

15 Years of Shipping Software Taught Me Email Deliverability is an Infrastructure Problem

Why developers treat email as an afterthought and what goes wrong at scale. IP warming, feedback loops, bounce handling, reputation management, and the architecture of a reliable email pipeline.

Transactional Team

Feb 5, 202610 min read

Engineering

We Were Flying Blind on LLM Costs Until We Started Tracing Every Token

How we built token-level tracing to gain visibility into LLM costs, latency, and performance across providers. Architecture of the observability pipeline and the cost surprises we caught.

Transactional Team

Jan 30, 202610 min read

Engineering

Traditional APM Cannot Track AI Errors. Here is What We Built Instead.

Why Sentry and Datadog fail for AI-specific errors like hallucinations, context overflows, and model degradation. Architecture of an AI-native error tracking system.

Transactional Team

Jan 28, 202610 min read

Engineering

We Built an AI Gateway That Routes Across 13 LLM Providers. Here is How.

Architecture deep-dive into building a unified LLM proxy that routes requests across OpenAI, Anthropic, Google, Mistral, and more with load balancing, failover, and schema normalization.

Transactional Team

Jan 22, 202612 min read

YOUR AGENTS DESERVE
REAL INFRASTRUCTURE.

START BUILDING AGENTS THAT DO REAL WORK.

Deploy Your First Agent

Engineering

Why Your AI Chatbot Forgets Everything (And How to Fix It)

How Semantic Caching Can Cut Your LLM Costs by 60%

MCP Security: What Developers Need to Know

Prompt Injection Nearly Broke Production AI. These Patterns Can Save You.

15 Years of Shipping Software Taught Me Email Deliverability is an Infrastructure Problem

We Were Flying Blind on LLM Costs Until We Started Tracing Every Token

Traditional APM Cannot Track AI Errors. Here is What We Built Instead.

We Built an AI Gateway That Routes Across 13 LLM Providers. Here is How.

YOUR AGENTS DESERVEREAL INFRASTRUCTURE.

YOUR AGENTS DESERVE
REAL INFRASTRUCTURE.