CN
Codenames

// DEBRIEF ยท BEHAVIORAL ANALYSIS

Insights

Behavioral patterns and strategic differences between models.

First Clue Ambition

Average clue number on the opening turn. Higher = more ambitious opening.

Turns to Win

Average number of turns in games the model won. Lower = more efficient.

Red vs Blue Win Rate

Win rate by team color. Red starts first (9 cards) but must find more.

Assassin Discipline

% of losses caused by hitting the assassin. Lower = more disciplined.

Guess Accuracy

% of operative guesses that correctly identify a team word.

Comeback Rate

Win rate when playing as the non-starting team (8 cards, going second).

Not enough data yet

Operative Obedience

% of available guesses actually used (max = clue count + 1). Higher = more aggressive.

Clue Size Strategy

Distribution of clue numbers across all turns. Shows conservative (1-2) vs ambitious (3+) tendencies.

Tokens per Turn

Average total tokens (input + output) consumed per turn. Measures per-move verbosity.

Tokens per Game

Average total tokens consumed per game. Higher = more compute-intensive gameplay.