Posts

Public writing from Bolt Foundry

Posts remain live as published. Older entries may reflect earlier terminology or historical framing rather than the current public Gambit story.

Latest published post

Gambit: an open source agent harness for reliable agents, assistants, and workflows

Dan Sisco

Gambit is our open-source agent harness for people who need production-friendly planning, grading, and test agents without rebuilding orchestration from scratch.

agents
open-source
gambit
Read post

Context engineering is the way

Dan Sisco

Context engineering is the new term for what we've been working on at Bolt Foundry: systematically optimizing LLM performance through structured samples, graders, and proper information hierarchy.

context engineering
evals
reliability
Read post

Evals from scratch: Building LLM evals with aibff from Markdown and TOML

Dan Sisco

We built a reliable eval system using Markdown, TOML, and a command-line tool that adapts when you change prompts, demonstrated through creating graders for an AI-powered sports newsletter.

evals
labs
reliability
Read post

From inconsistent outputs to perfect reliability in under an hour

Dan Sisco

How Velvet increased their citation XML output reliability to 100% in under an hour using LLM attention management principles.

case-study
reliability
llm
Read post

5 things about LLM prompts we think everyone should know

Dan Sisco

Most teams are building LLM prompts wrong. Here are 5 essential concepts for building reliable LLM applications.

llm
prompts
reliability
Read post