Why My Cursor Remote SSH Connection Kept Dropping and How I Fixed It
June 19, 2025How I Fixed Cursor IDE’s Unbearable Interface Lag on Linux with a Simple Script
June 19, 2025I was optimizing my AI workflows in Cursor IDE with Gemini 2.5 Flash when I hit a confusing roadblock. Different sources claimed conflicting context window sizes for MAX mode. This inconsistency could’ve tanked my project efficiency, so I rolled up my sleeves to find the real answer.
The Problem I Encountered
Google’s official documentation stated Gemini 2.5 Flash had a 128k token context window, even with MAX mode enabled.
But when I opened Cursor IDE’s model selector, it clearly displayed 1M tokens. Which was true? This mismatch made cost planning impossible for my coding projects.
My Investigation Process
First, I tested both modes in Cursor IDE. Strangely, both showed 1M tokens after a recent update.
That seemed wrong. Non-MAX mode should have lower capacity to keep costs down. I remembered earlier screenshots showing 128k for non-MAX.
The conflicting signals made me wonder: Was this a bug? Or had something changed?
The Breakthrough Discovery
After checking with reliable sources, I got confirmation: the documentation was outdated.
Here’s the reality: Gemini 2.5 Flash gives 1M tokens only in MAX mode. Non-MAX stays at 128k. The docs simply hadn’t been updated to reflect this.
Actionable Steps for Smooth Sailing
- Trust the IDE display: Cursor’s model selector shows real-time context sizes – ignore stale documentation
- Use MAX mode strategically: Activate only for massive files or complex queries to avoid token overuse
- Stick to non-MAX for routine tasks: Perfect for quick code completions where 128k suffices
Verifying directly in Cursor IDE became my new habit. It’s prevented budget surprises and kept my AI coding flowing smoothly.