
Last updated: 19 March 2026
If you've opened Microsoft 365 Copilot recently and noticed a dropdown in the top-right corner of Copilot Chat, you're not imagining things. Microsoft has been quietly rolling out model choice, and most users now have access to multiple AI models and response modes.
The problem is that none of this is explained particularly well inside the product. So if you've been wondering what the difference is between the default experience and GPT-5.4, what Auto, Quick Response and Think Deeper actually do, where GPT-5.3 fits in, or why there's now a Claude option appearing, this post breaks it all down in plain English.
As of late 2025, GPT-5 is the standard model that powers Microsoft 365 Copilot for all licensed users. It became the default in a phased rollout between October and November 2025, and by now it's mandatory across web, desktop and mobile. You can't opt out of it, and you don't need to do anything to enable it.
GPT-5 introduced something Microsoft calls dynamic model routing. Behind the scenes, Copilot looks at your prompt and automatically decides which variant of the model to use. Simple questions get routed to a fast, high-throughput model that responds quickly. More complex prompts get sent to a deeper reasoning model that takes more time but produces more thoughtful, structured answers.
For most everyday use, GPT-5 with its automatic routing does a solid job. If you're summarising emails, drafting messages, asking quick questions or getting meeting recaps, the default experience handles it well without you needing to think about models at all.
Alongside the model selector, Microsoft has introduced a response mode selector that gives you direct control over how much reasoning Copilot applies before answering. You'll find this near the prompt area in Copilot Chat, and there are three options:
Your mode selection persists across chats, so once you've set it, it stays until you change it.
This is where it gets interesting. In addition to the default GPT-5 experience, Microsoft has made OpenAI's newer models available as optional upgrades you can select from the model dropdown in the top-right corner of Copilot Chat.
Rolling out since December 2025, GPT-5.2 delivers better instruction following, improved maths and coding performance and stronger multi-step reasoning. It also handles significantly longer context, meaning it's better at maintaining coherence across long chat threads or large documents.
When paired with the Think Deeper mode, GPT-5.2 is a powerhouse. On OpenAI's GDPval benchmark (which measures AI performance across real-world professional tasks), GPT-5.2 Thinking achieved a 70.9% win or draw rate against human experts, compared to GPT-5's 38.8%. If you require genuine reasoning rather than pattern matching, selecting GPT-5.2 with Think Deeper is worth the extra wait.
GPT-5.2 now appears in the model picker as both GPT-5.2 Quick Response and GPT-5.2 Think Deeper, and is considered a legacy option now that newer models are available.
On 3 March 2026, Microsoft began rolling out GPT-5.3 Instant. It appears in the model picker as GPT-5.3 Quick Response.
GPT-5.3 replaced GPT-5.2 Instant as the everyday fast model. It delivers more accurate responses, stronger writing and better web synthesis, meaning it does a better job of combining search results with its own knowledge rather than just dumping links at you.
Real-world testing shows its quality is highly dependent on the task:
As of mid-March 2026, Microsoft has started rolling out GPT-5.4 Thinking to Microsoft 365 Copilot users. It appears in the model picker as GPT-5.4 Think Deeper and is now the most capable reasoning model available in Copilot.
GPT-5.4 is OpenAI's latest frontier model. It combines the coding capabilities of GPT-5.3-Codex with improved reasoning, agentic workflows and a one-million-token context window. In practical terms, this means it can hold significantly more information in its working memory when reasoning through complex tasks.
Where GPT-5.3 focuses on speed and everyday quality, GPT-5.4 focuses on depth. Microsoft describes it as better at working through complex, multi-step tasks with more clarity and consistency, producing stronger first drafts without the usual back-and-forth. It is also more token-efficient than GPT-5.2 when reasoning, meaning it solves problems using fewer internal tokens while delivering better results.
Use GPT-5.4 Think Deeper when you need Copilot to go deep: analysing lengthy contracts, building detailed financial models, comparing strategic options across multiple documents or working through technical problems that require sustained reasoning. For quick everyday tasks, the default Auto mode or GPT-5.3 Quick Response will still be faster and more practical.
If GPT-5.4 is the most capable model available, why wouldn't you just leave it selected permanently? There are two main reasons:
Save the manual model selection for the work that genuinely benefits from it: strategy documents, complex analysis, detailed reports and multi-step problem solving.
Microsoft has also started offering Anthropic's Claude models (Claude Sonnet 4.5 and Claude Opus 4.5) within Copilot Studio and the Researcher agent.
As of January 2026, Anthropic has been onboarded as a Microsoft subprocessor, meaning Claude usage is covered under Microsoft's Product Terms and Data Protection Addendum (DPA). Admins can manage this from the Microsoft 365 admin centre under Copilot > Settings > Data access > AI providers operating as Microsoft subprocessors.
Here's the important detail for UK and EU organisations: Anthropic's models are currently excluded from the EU Data Boundary and in-country processing commitments. All Claude processing happens on Anthropic's infrastructure in the United States. Your data leaves Microsoft's managed environment, gets processed in the US and then returns. Because of this, Microsoft has toggled Claude to Off by default for organisations in the EU, EFTA and the UK.
If your organisation has strict GDPR or data residency requirements, you need to think carefully before enabling Claude. The DPA covers the data protection, but the physical processing location introduces international data transfer considerations that your compliance team must assess. You can opt-in, but it requires an active decision by an admin.
Here's the simple version of how it all fits together:
The key takeaway is that you now have real choices about how Copilot works for you. Model access varies depending on whether you are on Copilot Basic or Premium, so it is worth knowing which tier you are on. Do not assume "newer" means "better" for every prompt. The default Auto experience is fantastic, but knowing exactly when to reach for GPT-5.4 Think Deeper or GPT-5.3 Quick Response can make a meaningful difference to your output quality.
For accounting firms and law firms in particular, understanding which model handles which task type is a practical skill that directly affects your bottom line.
If you'd like help getting your team trained on these new features through our Microsoft Copilot training, or need guidance on the admin settings and compliance implications, book a consultation with us.
Book a free consultation to talk through where you are and where you want to be. No pressure, no hard sell. Just an honest conversation.