Archive
All past issues.
Issue #5
Your format constraints are silently breaking your prompts
Issue #5 of sumocat — sharp insights from this week's AI research for builders.
Issue #4
Adding format constraints to your prompts is silently breaking your model -- and your evals are missing it
Issue #4 of sumocat — sharp insights from this week's AI research for builders.
Issue #3
Adding a single formatting rule to your prompt can silently kill half your response quality
Issue #3 of sumocat — sharp insights from this week's AI research for builders.
Issue #2
Your Format Constraints Are Silently Wrecking Response Quality
Issue #2 of sumocat — sharp insights from this week's AI research for builders.
Issue #1
Nobody Is Testing AI Systems Properly
Voice agents fail under real speech, LLM judges can't be trusted, and your eval suite has blind spots you don't know about. 7 papers that prove QA for AI is broken.