hammer-benchmark-regression - Claude Code Skill | AgentSkill