Case Study • MDAP Systems

Caching for Million-Step
LLM Workflows

How Massively Decomposed Agentic Processes achieve 75% cost reduction with intelligent caching.

1M+

LLM steps per task

75%

Cache hit rate

$18k

Saved per task

What are MDAPs?

Massively Decomposed Agentic Processes (MDAPs) represent a breakthrough in AI system architecture. Instead of relying on a single LLM to solve complex problems, MDAPs break tasks into millions of tiny subtasks, each handled by specialized "microagents."

Traditional LLMs

✗ Fail after 100-200 reasoning steps
✗ Errors accumulate over time
✗ Cannot scale to organizational tasks
✗ No built-in error correction

MDAP Systems

✓ Successfully complete 1M+ steps
✓ Zero-error execution via voting
✓ Extreme task decomposition
✓ Multi-agent consensus at each step

Real-World Example: MAKER System

Recent research (arxiv.org/abs/2511.09030) demonstrated the first MDAP system to solve a 1 million step task with zero errors. The system uses:

• Microagents: Each handles one atomic subtask
• Voting mechanism: 3-5 agents vote on each decision
• Error correction: Mistakes caught and fixed immediately
• Scalability: Proven to work beyond 1M steps

The Cost Problem

Running a Million-Step Task

Base LLM calls 1,000,000 steps

Multi-agent voting (3 agents per step) × 3 = 3,000,000 calls

Error correction retries (~5%) + 150,000 calls

Total LLM API Calls

3.15M

Cost (GPT-4 @ $0.03/1k tokens, avg 300 tokens)

$28,350

The Challenge: While MDAPs solve the accuracy problem, they create a massive cost problem. Running even a single task can cost tens of thousands of dollars.

Why MDAPs Are Perfect for Caching

High Repetition

Microagents are specialized and focused. The same subtask patterns repeat thousands of times across different workflows.

Voting Redundancy

When 3-5 agents vote on the same decision, they make identical LLM calls. Perfect for immediate cache hits.

Deterministic Subtasks

Each microagent performs a well-defined atomic operation. Same input → same output, every time.

Error Correction

When error correction reruns a failed step, it makes the exact same LLM call. Caching eliminates retry costs.

With AgentCache

Total API Calls

3.15M

790k

(75% cached)

Response Time

2.1s

45ms

(98% faster)

Total Cost

$28k

$7,100

Save $21k

Cost Breakdown by Component:

Initial microagent calls (1M steps)

$9,000 $2,250 (75% cached)

Multi-agent voting (2M redundant calls)

$18,000 $4,500 (75% cached)

Error correction retries (150k calls)

$1,350 $350 (74% cached)

Real-World Scenario

Example: Supply Chain Optimization System

A Fortune 500 company uses an MDAP system to optimize global supply chain logistics in real-time. The system processes 1M+ decision points per optimization run.

❌ Without AgentCache

Optimizations per day: 4

Cost per optimization: $28,350

Daily cost: $113,400

Annual cost: $41.4M

✓ With AgentCache

Optimizations per day: 4

Cost per optimization: $7,100

Daily cost: $28,400

Annual cost: $10.4M

Annual Savings

$31M

75% cost reduction

Technical Implementation

Integration Pattern

// Microagent with caching
class CachedMicroagent {
  async execute(subtask) {
    // Check cache first
    const cached = await agentcache.get({
      provider: 'openai',
      model: 'gpt-4',
      messages: subtask.prompt,
      namespace: `mdap-${this.agentType}`
    });
    
    if (cached.hit) {
      return cached.response; // 45ms, $0 cost
    }
    
    // Cache miss - call LLM
    const response = await openai.chat.completions.create({
      model: 'gpt-4',
      messages: subtask.prompt
    });
    
    // Store for future use
    await agentcache.set({
      provider: 'openai',
      model: 'gpt-4',
      messages: subtask.prompt,
      response: response.choices[0].message,
      namespace: `mdap-${this.agentType}`
    });
    
    return response;
  }
}

// Multi-agent voting with cache
async function multiAgentVote(subtask, agents) {
  const votes = await Promise.all(
    agents.map(agent => agent.execute(subtask))
  );
  
  // After first agent calls LLM, others hit cache
  // 3 agents × 1M steps = 2M free cached calls
  
  return consensus(votes);
}

Key Insight: By namespacing cache keys by agent type, we ensure specialized agents benefit from their own cache while allowing cross-workflow reuse.

85-90%

Cache hit rate for voting agents (2nd+ vote)

70-75%

Cache hit rate for primary microagent calls

95%+

Cache hit rate for error correction retries

Calculate Your Savings

Steps per task

Agents per step (voting)

Tasks per month

Cost per 1k tokens ($)

Without Cache

$567,000

per month

With AgentCache

$141,750

per month

Monthly Savings

$425,250

75% reduction

Ready to Scale Your MDAP System?

Join the organizations building million-step AI workflows without million-dollar bills.

Start Free Trial Enterprise Demo

Free tier: 10,000 requests/month • Enterprise volume pricing available

Caching for Million-StepLLM Workflows

What are MDAPs?

Traditional LLMs

MDAP Systems

Real-World Example: MAKER System

The Cost Problem

Running a Million-Step Task

Why MDAPs Are Perfect for Caching

High Repetition

Voting Redundancy

Deterministic Subtasks

Error Correction

With AgentCache

Cost Breakdown by Component:

Real-World Scenario

Example: Supply Chain Optimization System

❌ Without AgentCache

✓ With AgentCache

Technical Implementation

Integration Pattern

Calculate Your Savings

Ready to Scale Your MDAP System?

Caching for Million-Step
LLM Workflows