Blog — Tiny Lab

You open ChatGPT on your phone, paste in a bit of context, get an answer, then paste it back into whatever you were doing. It works. It also resets every time. That is calculator mode.

Ask a question, get an answer, lose the context. Repeat all day.

Calculator mode shows up everywhere. Email rewrite. Calendar planning. Meeting summaries. A better paragraph. A faster response. It is personal productivity, and it is real value.

It still leaves a team with the same old problem. Work gets done, yet the team does not get sharper together.

We have been running a different experiment.

We run OpenClaw inside Slack, in shared channels, where our team already works. The bot can see the same conversations we see. It holds context across weeks. It operates across many channels with different jobs, and it behaves differently in each one because the work is different in each one.

The big insight has been simple. Where the AI lives matters more than which model you picked.

A small moment that keeps repeating

A few weeks ago, we hit one of those questions that quietly burns time.

What did we decide on pricing again?

You can feel the usual pattern immediately. Someone remembers part of it. Someone else remembers it differently. People scroll. The discussion restarts. Even when the team lands in the same place, it still paid the tax.

In our shared channel, the bot pulled up the earlier decision, the reasoning behind it, and the math that supported it. It took seconds. The conversation ended. Work continued.

That moment sounds small. It changes how a team moves. Decisions become artifacts the team can reuse. Calculator mode cannot give you that, because calculator mode forgets.

What changes when the bot works in public

Private AI gives you a private win. It helps you produce.

Team AI creates a shared surface. It helps the team coordinate.

That sounds like semantics until you look at what actually breaks teams.

Execution breaks less often than coordination. People ship good work in parallel, then discover they were building different versions of the future. The fix shows up as meetings, rewrites, and a slow bleed of momentum.

A shared channel changes the default. Work stays visible while it is still soft. A teammate can steer it before it hardens. The bot can surface context before the team re-litigates. A decision can keep its why attached, instead of turning into folklore.

Over time, three effects show up again and again.

One: Shared understanding forms faster

The simplest way to describe it is that everyone reads the same thread.

When the bot posts research or a recommendation in a shared channel, the whole team sees the same inputs and the same conclusion. There are fewer moments where five people carry five slightly different interpretations of reality. That reduces a lot of invisible friction.

It also changes what people ask. Instead of asking for answers, people ask for better questions. The team starts steering together.

Two: Context compounds instead of evaporating

A lot of work gets repeated because the reason disappeared.

Teams remember that a decision was made. They forget why it was made, what constraint forced it and the tradeoff that was accepted. Then the discussion restarts from scratch.

A shared channel keeps the why near the decision. A team bot can pull it forward later because it was present when the decision happened.

That becomes team memory. The kind that works when nobody remembers to document anything.

Three: Coordination gets better without extra meetings

A personal assistant aims at completion. A team agent can aim at coordination.

It can notice when two threads are converging on the same problem. Surface a missing dependency early. Ask the one clarifying question that prevents a week of rework.

This is the least glamorous part of the story, and it is where the leverage lives. Visibility enables small corrections. Small corrections prevent big corrections.

Why inbox productivity feels different

Email and calendars are individual systems. Even when multiple people are involved, the unit of work still revolves around a person.

Slack channels revolve around shared attention.

When you put AI into email, it makes the person faster at email. The team still has to coordinate elsewhere, and the why still gets trapped in private threads.

When you put AI into a shared Slack channel, it can act like a team member. It can participate where decisions are being made. Remember those decisions because it saw them happen. Answer status questions without being briefed again.

All because the work did not happen in a separate window. The location changes what the AI can be.

Calculator mode versus team member mode

Calculator mode has a shape. Separate window and thus separate memory. You start over every time.

Team member mode has a different shape. Where the team works. Same threads the team reads. Context the team shares.

Once you feel it, the distinction gets obvious. In calculator mode, you keep paying a tax. In team member mode, the tax drops.

This is why we have been treating this like a lab. We try a behavior. Watch how people use it, look for the failure mode, then change one variable and run it again.

Some changes matter a lot. Response time matters because conversational rhythm matters. Memory matters because teams revisit decisions. Visibility matters because coordination requires shared attention.

The goal is not a bot that sounds smart. We want bots that make a team work better together.

The real point

AI is getting better at tasks. That is obvious.

The bigger shift is where the work happens.

If your AI lives in a separate window, it will keep behaving like a calculator. Useful, fast, and forgetful.

Put the AI where your team actually works, and it can become something else. A shared brain that keeps decisions attached to their reasons. A coordination aid that reduces rework. A teammate that makes the work visible enough to steer.

That is the experiment we are running. And it keeps paying off in small moments that compound.

Pair programming has 40 years of research proving it works. Teams completed tasks 40% faster with defect rates one thousandth of solo programmers. The findings were consistent across dozens of studies.

Almost nobody does it.

The reason is obvious. Pair programming doubles headcount for every line of code. In a world where developer time is expensive and managers optimize for shipping velocity, putting two engineers on the same problem feels wasteful.

But what if the second pair wasn't expensive? What if collaborative coding didn't require doubling your engineering team?

That's what happened when AI entered the picture. Except the collaboration pattern extends far beyond code.

My friend Smit Patel and I were on a call when he noticed what I was doing. I was working with an AI across multiple Slack channels, building research, drafting content, analyzing patterns. We were collaborating. Iterating. Building things neither of us could build alone.

"That sounds like pair prompting," he said. "Never heard of that before."

The term stuck because it named something real. Pair programming worked because of distributed cognition, spreading the mental load across two people with different strengths. Pair prompting works the same way, except one participant has perfect recall and infinite patience, while the other has judgment and taste.

Pair prompting. It's pair programming for everyone.

What Pair Programming Taught Us

40 years of pair programming research has consistent, striking findings.

A 1998 field study by John Nosek tracked experienced programmers working on challenging problems in their own environments. Every single team outperformed individual programmers. Teams completed tasks 40% faster and enjoyed the process more.

Laurie Williams' dissertation research in 2000 found pairs took 15% more developer hours but produced code with 15% fewer bugs. The quality gain more than compensated for the time investment.

The most dramatic finding came from a 1996 management study by Robert Jensen. Pairs produced 175 lines per person-month versus an individual average of 77. Their error rate was three orders of magnitude lower. One thousandth the defect rate.

The Mechanics of Collaborative Cognition

Pair programming works because of distributed cognition. The way thinking spreads across people and tools rather than staying locked in individual heads.

When you're programming alone, you hold the problem, the solution approach, the implementation details, and the edge cases all in your head at once. Context switches destroy that fragile state. A Slack notification, a meeting, even getting coffee. Each one dumps your mental model and forces a rebuild.

Pairs distribute the cognitive load. One person holds the strategic view while the other handles tactical execution. Driver focuses on syntax and immediate implementation. Navigator thinks about design, edge cases, and what comes next. They switch fluidly as the problem demands.

The documented benefits show up in predictable patterns.

Design quality improves when the navigator catches architectural problems before they ship.
Bugs get caught at creation time instead of in QA or production.
Knowledge transfer happens automatically without formal documentation.
Pairs reduce staffing risk because knowledge lives in two heads instead of one.
People enjoy pair programming more than solo work — statistically significant satisfaction gains.

When AI Joins the Pair

This is where pair prompting splits from pair programming.

Traditional pairs split into driver and navigator. Human-AI pairs don't work that way. The human isn't writing code while AI reviews architecture. The human prompts, the AI generates, and both iterate on output neither fully controls.

The collaboration happens at the boundary between intent and execution.

In one channel, we built a full social media strategy across multiple threads. Voice analysis from fifty plus posts, a content calendar, a story bank, headline iteration, and an outreach database. The AI pulled patterns from the writing, then generated options we refined until they felt right.

In another channel, a founder needed research on 106 specific companies and 170 plus founders. The AI built structured data, identified patterns, found contact information, and summarized in ways a human would spend days constructing. The founder directed the research and validated findings.

AI handles the cognitive load that crushes humans. Remembering every detail from six months ago, cross-referencing fifteen sources simultaneously, maintaining perfect consistency across parallel workstreams.

What We've Learned Running Pair Prompting in Practice

Context is the shared workspace. In pair programming, both people see the same code on the same screen. In pair prompting, the Slack channel becomes the shared workspace. Every message builds context both participants can reference. The AI remembers every detail. The human tracks the strategic thread.

Prompting is collaborative, not transactional. Early AI usage treated prompts like search queries. Type a question, get an answer, move on. Pair prompting treats prompts like design conversations. The value isn't in the first output. It's in the collaborative refinement.

Sub-agent spawning distributes work across time. When the AI faces a complex research task, it spawns sub-agents to handle parallel workstreams. Research 30 companies while analyzing competitor positioning while building a contact database. Each sub-agent reports back when done. The main instance synthesizes.

The humans set the quality bar. AI will generate until you stop it. The human decides when output is good enough, when it needs another iteration, when the approach is wrong entirely. The AI has no internal quality metric. It produces probable next tokens. The human provides the termination condition.

What This Means for How We Work

In pair programming, the unit of work is the pair. You need two engineers. The collaboration happens in real time. The output is code.

In pair prompting, the unit of work is the human directing AI execution across multiple workstreams. You need one person and AI assistance. The collaboration can be async. The output is anything the AI can generate and the human can evaluate.

This shifts the constraint from "do I have enough people" to "do I have enough judgment."

A person can build an entire social media strategy without hiring a content strategist because the AI handles execution while they handle direction and quality control. The work that required a team now requires one person with clear taste and an AI that can generate at scale.

The three-way collaboration creates capabilities that were not available before because the distribution of cognitive load and execution capacity shifts what one person can accomplish.

The Infrastructure Question

We built this on OpenClaw because we needed specific capabilities commercial AI doesn't provide.

Every conversation builds on previous conversations. The AI remembers what we decided six threads ago. Social media strategy, company research, content work. It all accumulates as searchable, referenceable context that persists across channels.

Multiple AI instances have different skills and different context. They don't interfere with each other. Running specialized instances in parallel used to require separate accounts and context juggling. Now it's just separate channels.

We control the interaction model instead of adapting to what ChatGPT or Claude thinks we need. Slack channels beat chat interfaces. Voice diagnostics run before publishing. Memory systems persist across sessions. The workflow shapes the work.

The commercial AI products optimize for one-shot questions from millions of users. Pair prompting needs persistent collaboration with a small number of humans doing complex work over time. The infrastructure requirements point in opposite directions.

Put the bot where the work happens.