Opus 4.6 has been awful for me and my team. It goes immediately off the rails an...

majora2007 · 2026-02-18T15:40:24 1771429224

That's interesting, 4.6 is finally when AI started to become good in my eyes. I have a very strict plan phase, argue, plan then partial execute. I like it to do boilerplate then I do the hard stuff myself and have it do a once over at the end.

Although I have had it try to debug something and just get stuck chugging tokens.

1broseidon · 2026-02-18T05:46:33 1771393593

I have found this to be true too and I thought I was the only one. Everyone is praising 4.6 and while it’s great at agentic and tool use, it does not follow instructions as cleanly as 4.5 - I also feel like 4.5 was just way more efficient too

qalmakka · 2026-02-18T10:10:38 1771409438

I think that's because not everyone does the same job within the same stack and constraints. I'm yet to find an LLM that writes the kind of C++ I dabble with without having to manually tweak it myself (or that truly understands our codebase). Conversely, I find that LLMs are now excellent at python and orchestration tasks for instance. It's very situational

1broseidon · 2026-02-18T13:21:03 1771420863

100% - you are very right. 4.6 is amazing for orchestration. I even built some tools around agent to agent contracting.

I use 4.6 as the brain and then handoff to a more rigid llm like GPT 5.2 or Opus 4.5