
tl;dr I've been testing Fable (Mythos) for the past week and it feels unlike any other model I've used. It feels, and is priced, like a next-generation model. It also has some real quirks.
The Good
Workflow mode is the standout. I told it to do a "full code review" and watched it spin up hundreds of agents in parallel. It assigned an individual agent to basically every file in my application. It found bugs, edge cases, missing documentation, and UX improvements all over my application. And I've given this same prompt to other models that didn't find nearly as many issues.
It's also wildly autonomous. It's way more willing than any previous Claude or GPT model to go off and work for hours at a time. Most of all, I trusted it to complete the goal I set. It'll happily burn a ton of tokens to get there. It felt like every time I kicked off Fable, it had the intention of taking on a massive project.
I felt more confident in giving Fable massive, complex tasks. More so than any other model I've ever used. I couldn't think of a problem that would make it stumble. And it felt like it was very eager to take on these massive tasks.
And that’s where this models stands out - long horizon tasks. It’s hard to imagine where the limit of its time-horizon is.
It's not a God model though. There are things it needs to improve.
Quirks
It's incredibly verbose. Explanations get deep in the weeds fast. I updated my claude.md to rein it in and even that wasn't enough. I kept having to ask it to explain things more simply. It wasn't just the verbosity either, it was the information density. The way it explained things made me feel genuinely dumb.
A note on information density - I didn't appreciate how important this is. The more information you can convey in a fixed token budget allows the model to be effectively smarter at a cheaper cost. This also is a strong argument for agents inventing their own hyper-dense language in the future.
Fable loves clarifying questions. A single prompt turns into: questions, then a summary of my answers, then confirm the summary, then a spec, then confirm the spec, then confirm the agent approach (parallel vs sequential), then finally build. I wanted it to make those decisions for me. Anthropic told me this gets fixed with the updated system prompt.
It feels slow. Slower than previous Opus models and even GPT. Slow to start, and it takes its time working through problems. That's the opposite of what I've loved about Opus. Opus always felt faster than GPT-5.5 to me on two fronts: raw tokens/sec, and that it found shorter paths to solutions. Fable is different on both. Even on simple tasks it would crawl. I'd watch the timer climb while the output tokens sat static, five minutes in and only a few thousand tokens used. It wants to be as thorough as possible, and that takes time.
Conclusion
Pro tip: turn down the effort level, even more than you think you need to. It thinks a LOT even at the middle setting. Even on low effort it's incredibly capable and still thinks for a while.
All of these quirks are fixable - with model optimizations and increased compute capacity to increase speed, and with more fine-tuning/RL and system prompt adjustments to help with verbosity and being overly cautious to make decisions for me.
Verdict: MYTHOS is extremely capable, and I'm still figuring out how to get the most out of it. It felt like it wanted my hardest tasks and anything less wasn't good enough. This is the first look at a brand new training run and it's already the most capable model I've used.
That's the part I can't stop thinking about.
