PACT: Optimizing Communication in Multi-Agent Systems

Emma Carter

June 8, 2026

original

Multi-agent systems built on large language models often struggle with token waste and context window overflow due to free-form text communication. A new paper introduces PACT, a protocol that compresses agent outputs into concise action-state records. This approach significantly reduces inference costs while preserving critical information, making it suitable for various MAS topologies and offering a pragmatic solution to a common bottleneck.

Multi-agent systems (MAS) are increasingly powered by large language models, yet a subtle but significant bottleneck often gets overlooked: how these agents actually talk to each other. Developers typically default to letting agents communicate in free-form natural language. While this sounds flexible, it can quickly devour token budgets and shared context windows. A recent arXiv paper dives deep into this problem, offering a systematic analysis and proposing a solution called PACT.

The Hidden Costs of Free-Form Agent Chat

The research team began by dissecting five common inter-agent communication strategies. These ranged from passing full outputs to using only key phrases or periodic summaries. They then tested these strategies across two typical MAS topologies: serial and parallel. Interestingly, no single strategy emerged as universally superior. However, a clear pattern did surface: strategies that preserved more action-centric information consistently helped downstream agents perform their tasks more efficiently.

The core issue lies in the inherent verbosity of natural language. When Agent A tells Agent B, "I've just finished analyzing the user's input and determined the main intent is to query the weather," a good chunk of that message is redundant. If every agent communicates with such inefficiency, the context window rapidly fills with unnecessary data, and inference costs skyrocket.

PACT: From Chat to Structured State Updates

The paper introduces PACT (Protocolized Action-state Communication and Transmission), essentially a structured compression scheme. It reframes inter-agent communication as an update to a shared system state. Each agent's output is 'projected' into a compact action-state record before being added to the shared history. This record typically includes what the current agent did and how the system state changed as a result.

Think of it like a version control system committing only changes, rather than copying the entire file every time. PACT doesn't just truncate or summarize; it specifically retains the most critical parts for downstream computation, discarding descriptive language that doesn't impact decision-making. Experiments showed that PACT consistently reduced token consumption and response latency across various MAS configurations, all while maintaining task success rates.

Practical Implications for Developers

This research highlights a previously underestimated lever in multi-agent system optimization: the communication protocol itself. Many developers meticulously fine-tune prompts and models but rarely scrutinize what's actually being transmitted between agents. PACT offers a reusable framework – you don't necessarily need to adopt its exact format, but the principle of "compressing irrelevant information while preserving action core" is broadly applicable.

For teams building complex agent chains: Expect potential token cost reductions of 30% to 50%, significantly mitigating context window overflow issues.
For researchers: PACT provides a quantifiable framework for analyzing MAS communication efficiency, opening doors for future exploration into more adaptive compression strategies.
For independent developers: Even without implementing the full protocol, simply adding a "send only actions" filter to agent outputs can yield noticeable improvements.

Limitations and Future Directions

The authors acknowledge that PACT might over-compress in tasks heavily reliant on nuanced natural language understanding, such as open-ended role-playing, potentially losing stylistic or emotional information. Furthermore, automatically generating the optimal action-state record format still requires system-specific tuning. However, this work undeniably brings the question of "what agents should say to each other" to the forefront – it's not an afterthought.

If you're building multi-agent applications, a good starting point might be to log the raw output of each agent during a conversation and see how much of it truly gets used by subsequent agents. The answer might surprise you.

multi-agent systemscommunication protocolPACTlarge language modelsinference costaction-state communicationcontext windowMAS efficiencyLLM optimization

Comments

No comments yet

Be the first to comment

Explore More

Similar Tools

Completo AI

Completo AI is a next-generation productivity tool that leverages AI to automatically analyze project goals and generate structured task lists. Aimed at project managers, freelancers, and small teams, it seeks to eliminate the tedious manual steps of task breakdown, boosting planning efficiency significantly. It's designed to streamline the initial project setup, allowing users to move from concept to actionable plan in seconds.

WeiClaw

WeiClaw is a smart hardware device that connects to Agent-enabled PCs, intelligently managing sleep and wake cycles. By monitoring Agent status and taking over message channels, it automates energy saving and remote management, allowing PCs to sleep when idle and wake on demand. Ideal for individuals and teams looking to cut power consumption and extend hardware lifespan.

BotIntelli

BotIntelli is an enterprise-grade AI operating system that empowers teams to build, deploy, and manage intelligent agent workflows without writing a single line of code. It unifies access to over 20 major large language models like GPT-4, Claude, and Gemini, streamlining complex business process automation. It's ideal for organizations looking to lower the barrier to AI adoption and boost operational efficiency.

Nodey

Nodey is an iOS companion app for n8n, bringing workflow management to your iPhone. It allows real-time monitoring of workflow status, AI-powered diagnostics for failures, natural language workflow creation, and unique NFC/geofence triggers. It's a lightweight mobile tool designed for existing n8n users.

Valkoma AI

Valkoma AI is a versatile platform built on Google Gemini, integrating an AI website generator, chatbot builder, social media post generator, prompt optimization auditor, and autonomous AI agent system builder. It allows users to create 3D immersive websites with a single sentence, deploy customer service bots rapidly, automate LinkedIn and Instagram content, and design automated workflows—all without coding. It's designed to boost efficiency for entrepreneurs and marketing teams.

AutomationMart

AutomationMart is a marketplace offering over 500 pre-built workflow templates for Make.com, n8n, and Zapier. Designed for non-technical users, these ready-to-use blueprints eliminate the need for complex configuration, allowing for rapid automation setup. It's a pragmatic solution for anyone looking to quickly deploy automated processes without starting from scratch.

Open-source Alternatives

agent-device: CLI for AI Agent Mobile Control

agent-device is an open-source command-line tool that empowers AI agents to directly control iOS and Android devices via a CLI interface. Built with TypeScript, it supports essential operations like taps, swipes, and text input, making it easy to integrate into automation workflows. It's ideal for developers and testers who need AI to interact with real mobile devices.

agent-sandbox: Kubernetes-Native AI Agent Management

agent-sandbox is an open-source project from Kubernetes SIG, designed to manage isolated, stateful, and singleton AI agent runtimes. Developed in Go, it offers declarative APIs and CRDs, simplifying agent deployment and operations. It's ideal for AI applications requiring long-running, persistent state, and has garnered over 3100 stars on GitHub.

Omnigent: Unify Your AI Agents with a Meta-Framework

Omnigent is an open-source meta-layer framework that lets you seamlessly switch or combine AI agents like Claude Code, Codex, and Pi without rewriting integration code. It offers policy control, sandbox isolation, and cross-device real-time collaboration. This Python project, boasting 2562 stars, is ideal for development teams needing multi-agent coordination and streamlined AI workflows.

agent-squad: Orchestrate Multiple AI Agents with Swift

agent-squad is an open-source Swift framework designed for managing multiple AI agents and complex conversational flows. It offers a flexible architecture for orchestrating multi-agent collaboration, task distribution, and dialogue management, making it ideal for building intelligent assistants, customer service systems, and automated workflows.

mindshub: Swap AI Models Without Rewriting Code

mindshub, an open-source model hub from MindsDB, lets you hot-swap AI models like GPT, Llama, or custom-trained solutions without touching your core business logic. It provides a unified interface, making model switching as simple as changing a configuration line. For teams prioritizing flexibility and future-proofing their AI applications, mindshub offers a pragmatic solution to a common development headache.

Activepieces: Open-Source AI Workflow Automation

Activepieces is an open-source workflow automation platform designed for AI agents and intelligent workflows. It integrates with over 400 Model Context Protocol (MCP) servers, allowing for visual orchestration of AI-driven processes. Built with TypeScript, it empowers developers and teams to quickly build sophisticated automations, significantly lowering the barrier to entry for AI application development.