Caveman: The Dumbest Idea That Actually Works
Token compression is usually framed as a semantic problem: summarize the conversation, extract the key points, maintain a rolling context window through distillation. Caveman treats it as a typographic problem. It replaces verbose English with abbreviations before the text ever reaches the model. “Without” becomes “w/o”. “Configuration” becomes “cfg”. “…



