VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO June 23, 2026 · Hacker News Read full story at source