5 min read

The Cost of Using AI Is Moving From Dollars to Discipline

I hit my AI session limit mid-thought, already paying for the hundred-dollar plan, with no way forward but to pay more. Paying would have fixed that night, not the habit. The cost of using AI well is shifting from dollars to discipline, and here are the three habits that kept me building.

For a few weeks I had been mapping out a roadmap for AI agents, some for my personal portfolio, some to automate the growth work behind Neural Gains Weekly. This was strategic, reasoning-heavy work, all of it in the regular chat app, the same place most people sit down to solve problems of their own. I was working through one problem after another, and the work just flowed. Plan a piece, get a response, sharpen it, move to the next. The rhythm of solving problems with AI was pure bliss.

Then I hit the session limit. The tool locked me out, the screen told me to wait, and the work stopped cold. The ground came out from underneath me, mid-thought, with nowhere to put the momentum. I was already paying for the $100-a-month plan, and the only way to keep going right then was to pay even more.

Why Your Sessions Run Out Faster Even as Tokens Get Cheaper

The session limit is a time problem, not a money problem. When you have ample time, these limits are easy to dismiss. You hit the ceiling, you shrug, you pick the work back up tomorrow. But the last six weeks for me have been work, travel, and presentations stacked back to back, and time was the one thing I did not have. When time is compressed, the wait stops being a footnote and becomes the obstacle, and you need another game plan.

The strange part is that this squeeze is happening while the raw cost of AI keeps falling. Epoch AI, a research group that tracks these prices, found that in recent years the cost to run a model at a given level of capability has been falling by a median of about 50 times a year, and for some tasks by as much as 900 times. If you have been following the token mechanics, that trend is no surprise. On a per-token basis, AI has never been cheaper.

So why does the same work run out my session faster than it did six months ago? Because the cheap part was never the constraint. The era of effectively subsidized, all-you-can-use access for at-home power users is quietly ending, and the limits are where you feel it first.

You could start to see this trend take shape in May, when the squeeze stopped being a feeling and showed up in the open. Anthropic doubled Claude Code's five-hour rate limits for its paid plans, removed the peak-hour slowdowns there, and raised Opus rate limits on the API, all of it announced alongside a SpaceX compute deal of more than 300 megawatts and over 220,000 GPUs. Those increases went to Claude Code and the Opus API, while the regular chat session limit was not on the list. The relief went where the revenue is.

These companies are not villains. They are businesses making historic bets on compute, and bets that size have to be paid back. The new capacity flows to the products and customers funding the buildout, which is exactly what you would expect any business to do. Seeing it that clearly is what turns the frustration into a plan.

Do Not Burn Your Best Model on Your Smallest Tasks

Once I started treating the limit as a budget rather than something to fight, the first move was obvious. Stop running my most powerful model on work that does not need it.

Most of what I do in a week is light work, like drafting a newsletter intro, running a meta-tag pass, or cleaning up formatting. It runs fine on a faster, lighter model and leaves my premium capacity for the problems that actually need deep reasoning. For a long time I left the strongest model selected for everything, the way you leave a light on in a room you have already walked out of, and that default drains a session faster than anything else.

The makers of these models say the same thing. Anthropic's own guidance notes that the heaviest model uses far more of your usage per turn, and advises switching up to it only when a task needs it.

The fix takes about three seconds. Before I start anything, I ask whether it really needs the top model. If not, I drop a tier and save the heavy lifting for later.

Walk Into Your Hardest Work With a Full Tank

My second habit is about how much usage I have left when the hard work starts.

Despite the name, the five-hour window is not really five hours of work. Your usage is capped by tokens, and a token-heavy session can burn through that cap in an hour, which means your time at the keyboard can run out long before you are done thinking. So when I know a session will be intensive, I do not walk into it having already spent half my budget on small stuff.

I start it with a full tank and give my heaviest sessions a clean start.

That one change has done more for my output than any prompt trick. The deep work gets the room it needs, and I rarely hit the wall in the middle of the problem I care most about solving.

When You Hit the Wall, Route Around It

Prevention only takes you so far. Some nights you do everything right and still run out, which brings me back to that night with the agent plan.

Rather than pay to push through, I copied my context into ChatGPT, the plan so far and the open questions, and kept building there.

It was not seamless. I had to reorient the new model and rebuild a little of where I was, but within minutes the momentum came back. I finished that night without spending an extra dollar.

You almost certainly have access to more than one capable model already, between the free tiers of ChatGPT, Claude, and Gemini and whatever plan you pay for. What you build in one can move to another in a couple of minutes, so when one tool cuts you off, take the work elsewhere instead of paying to break back in.

Stay Deliberate, Keep Building

Through some of the busiest weeks I have had, three habits kept me building at full capability from home, without paying a cent more. The cost of using AI well is shifting from dollars to discipline, and discipline is the one part of this you fully control.

None of these habits are clever, which is the point. Anyone can pick them up. If you are feeling the same squeeze, start with the one that fits your week.

Discipline beats spend, and from a home setup that is most of the game. The math changes when the money is a company's instead of your own, which is exactly where the next Founder's Corner is headed. For now, keep experimenting and keep building.

Enjoy this? Get it in your inbox every Tuesday.

Practical AI workflows. No hype. No spam. Just receipts.

Subscribe Free

Before you go...

Get one practical AI workflow in your inbox every Tuesday. Free. No spam. Just receipts.

Subscribe Free