Cherry Place AI · Field Notes
ISSUE No. 01 FILED Q2 2026 PAGES 1 OF 1
A FORENSIC ACCOUNTING

The hidden costs of not deploying Claude at your consultancy.

The hardest cost to see is the one your firm has already accepted. It does not show up on the P&L. It shows up as not enough hours in the day. Six line items below, sourced and totaled.

§ Preamble
part one

ChatGPT seats charged to the corporate card. Junior associates burning Saturday mornings re-formatting decks they could draft in fifteen minutes. Proposals that take a week because nobody trusts the AI to write them, so nobody set up the AI to write them.

None of this shows up on the P&L. Partners read it as a staffing problem. It is not a staffing problem. It is an architecture problem. And every quarter you defer the architecture, the cost compounds against you while it compounds for the firm across the street.

§ The Ledger · 6 Entries
part two
Entry 01

The friction tax, paid every working day.

Open a chat. Paste in the brand voice. Paste in the client background. Paste in last quarter's positioning doc. Now you can ask your question. Then copy the answer back into Notion, edit it, paste into Slack to share, switch to the CRM, copy the relevant fields, drag a file from Drive. Repeat fifty times a day.

This is the same hours bleeding out two ways at once. Context recreation at the start of each session, then tool switching to move the answer into the surface area where the work actually lives.

Microsoft's 2024 Work Trend Index found knowledge workers spend roughly 57% of their time on email, meetings, and chat. Most "AI usage" today layers on top of that 57%, not in place of it.

A real deployment loads context once (knowledge base, voice, methodology, past engagements) and operates inside the tools your team already uses (Slack, Drive, CRM, Notion) instead of next to them. Both costs collapse together because they were always the same architecture problem.

Entry 02

The overnight hours, surrendered.

Your team logs off at 7pm. Claude does not have to. Firms that have built the architecture run agents overnight: research synthesis from yesterday's discovery calls, first drafts of tomorrow's status reports, CRM updates, Tuesday's deck in skeleton form, Slack triage from channels that hit overnight.

By 6am their senior people walk into a desk that is already two hours ahead. The proposal that would have started Monday morning is half-written by Sunday night. The client research that would have eaten Tuesday is on the partner's screen by Tuesday breakfast.

Your firm starts each morning from zero. Same talent, same tools, fewer hours of useful output produced. Not because your people are slower. Because the other firm bought back the night.

Entry 03

The proposal cycle, still measured in days.

The Dell'Acqua et al. study at BCG put 758 consultants through 18 consulting tasks. The AI-augmented group finished 25% faster with 40% higher quality on tasks within the AI's frontier. That study used a chat window. It did not use skills, memory, or integrations. The performance ceiling is meaningfully higher with real architecture underneath.

Translate to a single workflow your firm runs constantly. Proposal drafting. Pulling case studies, customizing scope, formatting, getting pricing right. The Dell'Acqua frontier covers most of those steps. With skills that know your formats, memory of your wins, and integrations into the drive folder where the case studies live, the cycle compresses from days to hours.

In the BCG study, consultants without AI augmentation tried to use the AI for tasks outside its frontier and produced lower-quality output. The performance asymmetry is now between firms that have deployed AI architecture and firms that just have seats.

The cost of a slow proposal is not the labor inside the proposal. It is the deals that close to the firm that responded first.

Entry 04

The institutional brain, leaking quarterly.

Every consultancy talks about its methodology. Most consultancies cannot find it. It lives in the heads of three senior partners, in a 2019 deck someone made for a workshop, in client onboarding emails nobody indexed, in the implicit choices a great associate makes that the firm has never written down.

When a senior partner leaves, you do not just lose a biller. You lose the chunk of firm IP they were quietly carrying. New consultants ramp slowly because there is no central brain to ramp them against. Junior consultants invent the wheel because nobody told them where the wheel lived.

A real Claude deployment includes a knowledge base your firm's brain actually lives in. Indexed playbooks, decks, past engagements, voice samples, methodology docs, won proposals, lost proposals with notes on why. Every consultant gets a senior partner on demand. The firm stops paying tuition every time someone joins or leaves.

Entry 05

The partner hours, eaten by drafting.

Ask any partner what they wish they had more time for. The answer is some combination of: client development, strategic work, firm-building, recruiting senior talent, thinking. Ask them what they actually spend their evenings on. The answer is some combination of: tweaking slides, rewriting an associate's draft, formatting a proposal, polishing an email a junior should have nailed the first time, drafting the QBR they could not delegate.

The math here is brutal. A founding partner doing $400/hr work in client development creates pipeline. The same partner doing $50/hr work polishing a deck creates exhaustion.

Claude built well does the polish layer at high quality, in the partner's voice, against the firm's frameworks. The partner reviews and ships. Their evenings move from formatting to firm-building.

Entry 06

The positioning gap, present tense.

This is no longer a forecast. Procurement teams at mid-market and enterprise buyers are running AI-delivery questions today. "Describe your firm's AI deployment." "What workflows are AI-augmented?" "What is your AI governance posture?"

Firms that cannot answer credibly do not necessarily lose the deal. They get marked down. The firms that can answer move up the shortlist. The Microsoft Work Trend Index reported that 66% of leaders say they would not hire someone without AI skills, and that posture is now flowing into how those same leaders evaluate their service-provider shortlists.

The gap is not about who has Claude. Almost everyone has Claude. It is about who actually has it deployed in a way they can defend on a buyer call without reaching for a slide.

Reconciliation Note

The total below excludes Entries 02, 04, and 06 by design.

Those three are capacity, ramp, and positioning effects that compound against revenue downstream rather than as direct dollar costs. Stacking them onto the dollar-modeled lines would inflate the headline number and double-count economic exposure.

Entry 01 has been de-duplicated to combine context-recreation and tool-switching into a single hours estimate, since the underlying minutes overlap. The result is a tighter, more defensible floor.

§ The Tally
part three

Summed against a 25-person firm, $275/hr blended.

Conservative side of every assumption. Direct dollar lines only. Capacity, ramp, and positioning effects tracked separately above.

01 Friction tax · combined context + tool-switching, de-duped $297,000 / yr Modeled · MS WTI '24
02 Overnight hours surrendered · capacity gap +47% capacity Tracked separately
03 Proposal cycle drag · 25% faster / 40% higher quality unrealized $308,000 / yr Modeled · BCG/HBS '23
04 Institutional brain leakage · ramp + retention drag ~50% slower ramp Tracked separately
05 Partner hours eaten by drafting · 10 hrs/wk × 4 partners $210,000 / yr Direct labor only
06 Positioning gap · RFP shortlist exposure Lost pipeline Tracked separately
Annual floor · directly modeled in dollars
$815K

The number above only captures Entries 01, 03, and 05. Entries 02, 04, and 06 represent capacity, ramp, and positioning effects that compound downstream and resist clean dollar-modeling. The full BCG/HBS-anchored model on the calculator at cherryplaceai.com/roi extends this to fourteen workflows and shows the 5-year compounded position.

§ The Path Forward
part four

The path from here to deployed.

If the floor above looks like a number you would say yes to recovering, the path is three phases. Discovery first, then build, then a measurement window before you take the keys.

Phase 01 · Discovery
The audit and the architecture brief.
Your tools, workflows, knowledge base. Where the friction tax actually lives. Which three workflows would compound fastest if Claude carried them. What ships first vs. what waits.
Phase 02 · Build
Knowledge base, skills, integrations, agents.
Your firm's brain indexed. Skills for proposal, research, status. Integrations into Gmail, Drive, Slack, CRM. Multiple agents running, including overnight. Your team learns the one workflow that runs everything.
Phase 03 · Run
Instrument, measure, hand the keys over.
Heavy on instrumentation. Recovered hours measured against the model. If the gap is more than 30% in either direction, scope adjusts. After that, you own it. Documented, runbook-shipped, no vendor lock-in.
§ Methodology & Sources
part five

Where the numbers come from.

  • External · Peer-reviewed
    Dell'Acqua et al., 2023. 758 BCG consultants. GPT-4 augmented consultants completed 12% more tasks, 25% faster, with 40% higher quality on tasks within the AI's capability frontier. Strongest published benchmark for consulting-specific AI productivity. Anchors Entry 03.
  • External · Survey
    31,000 knowledge workers across 31 countries. 57% of working time on email, meetings, chat. 66% of leaders would not hire without AI skills. Anchors Entries 01 and 06.
  • Operator-side · clearly labeled
    Cherry Place AI deployment data (operator)
    Operator-modeled estimates, drawn from architecture work and live deployment instrumentation. Clearly distinguished above with the rust-tagged (operator) markers. Used for capacity, ramp, and overnight-execution figures (Entries 02, 04). Not blended into the same authority tier as BCG/HBS.

The math, in one block.

recovery = consultants × billable hrs/yr (1456) × workflow time-share × AI offload depth × billable rate × 0.85 realization × scenario capture (0.6× / 1.0× / 1.4×)
workflow time-share
% of consultant-week the workflow consumes, benchmarked to BCG/HBS task taxonomy.
AI offload depth
% of the workflow Claude can credibly carry given current capability and integration architecture.
realization
0.85 haircut. Recovered hours are not all immediately billable, the conservative discount holds the model honest.
scenario capture
Conservative · Standard · Aggressive bands. 0.6× / 1.0× / 1.4×. The figure tally above runs the Conservative band.
de-duplication
Entry 01 combines context-recreation and tool-switching into a single hours estimate to avoid double-counting overlapping minutes. The combined floor (6–12 hrs/consultant/wk) sits below the sum of the two original ranges.
§ The Next Step
part six

Walk this through against your firm.

Thirty minutes on screen. Your tools, your stack, your three highest-leverage workflows. We will scope what would actually ship and what it would change. No deck, no demo loop.

Book the 30-min walkthrough
If after thirty minutes it isn't a fit, you've still got that time with someone who builds these for a living.