fix: add versioned handshake with capability negotiation to gateway protocol by praisonai-triage-agent[bot] · Pull Request #2154 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-06-22T09:17:43Z

Summary

Adds a versioned handshake with capability negotiation to the WebSocket gateway protocol, enabling graceful protocol evolution and structured connect errors.

Changes

Core SDK (praisonaiagents/gateway/protocols.py)

Added GATEWAY_PROTOCOL_VERSION and MIN_CLIENT_PROTOCOL_VERSION constants
Added ConnectErrorCode enum with structured error codes
Added HelloParams, HelloResult, HelloError data structures
Added new event types: HELLO, HELLO_OK, HELLO_ERROR

Wrapper (praisonai/gateway/server.py)

Implemented hello message handler with protocol version negotiation
Capability-based feature discovery (streaming, presence, ack)
Policy limits communication (max_payload, heartbeat_ms)
Structured error responses with actionable next steps
Maintains full backward compatibility with existing join/joined messages

Benefits

Graceful protocol evolution: Clients and servers can negotiate compatible versions
Structured errors: Clients receive actionable error codes instead of free-text
Feature discovery: Clients know what methods/events are available
Policy communication: Clients can self-configure based on server limits
Backward compatibility: Existing clients continue to work

Summary by CodeRabbit

New Features
- Added gateway protocol versioning for improved system compatibility and version negotiation
- Introduced structured error codes providing clearer diagnostics for connection failures
- Implemented enhanced handshake flow with session management and missed event replay capabilities
- Maintained backward compatibility with existing connection methods

…rotocol (fixes #2151) - Add protocol version constants and negotiation - Add structured connect error codes (auth_required, protocol_unsupported, etc.) - Add HelloParams, HelloResult, HelloError data structures - Implement hello/hello_ok/hello_error message exchange - Support capability-based feature discovery - Communicate policy limits (max_payload, heartbeat_ms) - Maintain backward compatibility with existing join/joined messages Co-authored-by: MervinPraison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-22T09:17:53Z

@coderabbitai review

MervinPraison · 2026-06-22T09:17:53Z

/review

qodo-code-review · 2026-06-22T09:17:58Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

coderabbitai · 2026-06-22T09:18:01Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-06-22T09:18:09Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4f4ba146-697a-47df-9566-c6b06fc9dd13

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds protocol version constants (GATEWAY_PROTOCOL_VERSION, MIN_CLIENT_PROTOCOL_VERSION), a ConnectErrorCode enum, and three handshake dataclasses (HelloParams, HelloResult, HelloError) to protocols.py. Implements a corresponding hello/hello_ok/hello_error message handler in gateway/server.py with version negotiation, session resume, and event replay, while keeping the legacy join path.

Changes

Gateway Versioned Handshake

Layer / File(s)	Summary
Protocol contract: constants, error codes, and handshake types `src/praisonai-agents/praisonaiagents/gateway/protocols.py`	Adds `GATEWAY_PROTOCOL_VERSION = 1` and `MIN_CLIENT_PROTOCOL_VERSION = 1`, `ConnectErrorCode` string enum with five members, `HELLO`/`HELLO_OK`/`HELLO_ERROR` values to `EventType`, and `HelloParams`, `HelloResult`, `HelloError` dataclasses representing the full versioned handshake wire contract.
Server handshake handler and backward-compatible join path `src/praisonai/praisonai/gateway/server.py`	Imports new protocol types, adds a `hello` branch to `_handle_client_message` that validates agent existence, negotiates protocol version (sending `hello_error` with `protocol_unsupported` or `agent_not_found` on failure), resumes or creates sessions, replays missed events via `since_cursor`, and sends `hello_ok` with features and policy; preserves the legacy `join` path with an explicit backward-compatibility comment.

Sequence Diagram(s)

sequenceDiagram
  participant Client
  participant GatewayServer
  participant SessionStore

  rect rgba(100, 149, 237, 0.5)
    Note over Client,SessionStore: New versioned hello path
    Client->>GatewayServer: hello {agent_id, protocol{min,max}, caps, session_id?, since_cursor?}
    GatewayServer->>GatewayServer: validate agent_id and negotiate protocol version
    alt validation failure
      GatewayServer->>Client: hello_error {code, message, next_action?}
    else success
      GatewayServer->>SessionStore: resume_or_create_session(agent_id, session_id, since_cursor)
      SessionStore-->>GatewayServer: session, replay_events[]
      GatewayServer->>Client: hello_ok {protocol, features, policy, session_id, resumed, cursor}
      loop missed events
        GatewayServer->>Client: replay {event}
      end
    end
  end

  rect rgba(144, 238, 144, 0.5)
    Note over Client,GatewayServer: Legacy join path (backward compat)
    Client->>GatewayServer: join {agent_id, session_id?, since?}
    GatewayServer->>Client: joined {session_id, agent_id, resumed, cursor}
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related issues

#2151: This PR directly implements the versioned handshake, ConnectErrorCode enum, HelloParams/HelloResult/HelloError dataclasses, and the hello/hello_ok/hello_error server negotiation flow described in that issue.
#2130: The PR implements protocol version negotiation and handshake primitives that address the version negotiation concern raised in this issue.

Poem

🐇 A handshake at last, no more guessing the door,
hello goes out, hello_ok replies,
the version is set, no surprises in store,
old join still remains — no old client cries.
Hop, hop — the gateway waves a versioned paw! 🐾

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately and concisely summarizes the main change: adding a versioned handshake with capability negotiation to the gateway protocol. It directly reflects the primary objective of the PR.
Linked Issues check	✅ Passed	The PR successfully implements all coding requirements from issue `#2151`: adds protocol version constants, ConnectErrorCode enum with specified error types, HelloParams/HelloResult/HelloError dataclasses, handshake event types, and server-side negotiation logic with backward compatibility.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to implementing versioned handshake and capability negotiation as specified in issue `#2151`. No unrelated modifications were introduced.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-2151-20260622-0911

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-06-22T09:18:19Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

greptile-apps · 2026-06-22T09:21:11Z

Greptile Summary

This PR introduces a versioned handshake (hello/hello_ok/hello_error) to the WebSocket gateway, enabling protocol version negotiation, structured error codes, capability-based feature discovery, and policy communication to clients, while keeping the legacy join/joined path intact for backward compatibility.

protocols.py: Adds GATEWAY_PROTOCOL_VERSION/MIN_CLIENT_PROTOCOL_VERSION constants, the ConnectErrorCode enum, HelloParams/HelloResult/HelloError dataclasses, and three new EventType variants — purely additive data-model changes.
server.py: Implements the hello handler with version negotiation, capability gating (streaming/presence/ack events), config-sourced policy limits, session resumption with event replay, and structured error responses; two input-validation gaps exist (see inline comments).

Confidence Score: 3/5

The hello handshake handler has two input-validation gaps that allow malformed client messages to crash the connection handler or squatt arbitrary session IDs.

Sending "protocol_min": null causes an unhandled TypeError that drops the connection silently without any error response to the client, mirroring the caps: null bug that was already fixed in this same handler. Additionally, supplying an unknown session_id creates a fresh session under that attacker-supplied ID in _client_sessions, potentially reserving it before a legitimate owner reconnects. Both issues are on the hot path of the new handshake flow.

src/praisonai/praisonai/gateway/server.py — specifically the protocol_min/protocol_max extraction block and the session creation fallback in resume_or_create_session.

Important Files Changed

Filename	Overview
src/praisonai-agents/praisonaiagents/gateway/protocols.py	Adds GATEWAY_PROTOCOL_VERSION/MIN_CLIENT_PROTOCOL_VERSION constants, ConnectErrorCode enum, HelloParams/HelloResult/HelloError dataclasses, and HELLO/HELLO_OK/HELLO_ERROR EventType variants — clean data-model additions with no logic.
src/praisonai/praisonai/gateway/server.py	Implements the hello handshake handler; has a null/non-integer guard gap for protocol_min/protocol_max (causing unhandled TypeError on null values) and a session-squatting window when an unknown session_id is supplied.

Sequence Diagram

%%{init: {'theme': 'neutral'}}%%
sequenceDiagram
    participant C as Client
    participant GW as WebSocketGateway
    participant S as Session Store

    C->>GW: "hello {agent_id, protocol_min, protocol_max, capabilities, session_id?, since?}"
    GW->>GW: Validate agent_id exists
    alt Agent not found
        GW-->>C: "hello_error {code: agent_not_found}"
    end
    GW->>GW: Negotiate protocol version
    alt "client_max < MIN_CLIENT_PROTOCOL_VERSION"
        GW-->>C: "hello_error {code: protocol_unsupported, next: upgrade_client}"
    else "client_min > GATEWAY_PROTOCOL_VERSION"
        GW-->>C: "hello_error {code: protocol_unsupported, next: use_older_client}"
    end
    GW->>GW: Sanitize capabilities (null guard)
    GW->>S: resume_or_create_session(session_id, agent_id, client_id, since_cursor)
    S-->>GW: (session, replay_events)
    GW->>GW: "Validate session.agent_id == agent_id"
    alt Mismatch
        GW-->>C: "hello_error {code: auth_unauthorized}"
    end
    GW->>GW: Rebind session._client_id, register _client_sessions
    GW->>GW: Build features and policy
    GW-->>C: "hello_ok {protocol, features, policy, session_id, resumed, cursor}"
    loop Replay missed events
        GW-->>C: "replay {event}"
    end
    Note over C,GW: Legacy path still supported
    C->>GW: "join {agent_id, session_id?, since?}"
    GW-->>C: "joined {session_id, agent_id, resumed, cursor}"

%%{init: {'theme': 'base', 'themeVariables': {"darkMode": true, "background": "#0d1117", "primaryColor": "#21262d", "primaryTextColor": "#e6edf3", "primaryBorderColor": "#8b949e", "lineColor": "#8b949e", "textColor": "#e6edf3", "edgeLabelBackground": "#161b22", "actorBkg": "#21262d", "actorBorder": "#8b949e", "actorTextColor": "#e6edf3", "actorLineColor": "#8b949e", "signalColor": "#8b949e", "signalTextColor": "#e6edf3", "noteBkgColor": "#373320", "noteBorderColor": "#d4a72c", "noteTextColor": "#f0e6c0", "labelBoxBkgColor": "#21262d", "labelBoxBorderColor": "#8b949e", "labelTextColor": "#e6edf3", "loopTextColor": "#e6edf3", "activationBkgColor": "#30363d", "activationBorderColor": "#8b949e"}}}%%
sequenceDiagram
    participant C as Client
    participant GW as WebSocketGateway
    participant S as Session Store

    C->>GW: "hello {agent_id, protocol_min, protocol_max, capabilities, session_id?, since?}"
    GW->>GW: Validate agent_id exists
    alt Agent not found
        GW-->>C: "hello_error {code: agent_not_found}"
    end
    GW->>GW: Negotiate protocol version
    alt "client_max < MIN_CLIENT_PROTOCOL_VERSION"
        GW-->>C: "hello_error {code: protocol_unsupported, next: upgrade_client}"
    else "client_min > GATEWAY_PROTOCOL_VERSION"
        GW-->>C: "hello_error {code: protocol_unsupported, next: use_older_client}"
    end
    GW->>GW: Sanitize capabilities (null guard)
    GW->>S: resume_or_create_session(session_id, agent_id, client_id, since_cursor)
    S-->>GW: (session, replay_events)
    GW->>GW: "Validate session.agent_id == agent_id"
    alt Mismatch
        GW-->>C: "hello_error {code: auth_unauthorized}"
    end
    GW->>GW: Rebind session._client_id, register _client_sessions
    GW->>GW: Build features and policy
    GW-->>C: "hello_ok {protocol, features, policy, session_id, resumed, cursor}"
    loop Replay missed events
        GW-->>C: "replay {event}"
    end
    Note over C,GW: Legacy path still supported
    C->>GW: "join {agent_id, session_id?, since?}"
    GW-->>C: "joined {session_id, agent_id, resumed, cursor}"

_{Reviews (2): Last reviewed commit: "fix: address critical issues in hello ha..." | Re-trigger Greptile}

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai/praisonai/gateway/server.py`:
- Around line 1064-1069: The heartbeat_ms value in the policy dictionary is
hard-coded to 15000 instead of using the heartbeat_interval value loaded from
the gateway configuration. Replace the hard-coded 15000 value with the
heartbeat_interval from the gateway config so that the policy reflects the
actual configured heartbeat interval and clients negotiate the correct interval
instead of defaulting to 15 seconds.
- Around line 1037-1063: The features list in the code starting around line 1039
is advertising methods and events that are either not implemented or not
negotiated with the client. Remove the "abort" method from the methods list
unless it has a corresponding handler implemented, and move the streaming events
(TOKEN_STREAM, TOOL_CALL_STREAM, STREAM_END, MESSAGE, MESSAGE_ACK, ERROR) from
the base features dictionary into a conditional block that only adds them if the
client has advertised streaming capability in client_caps, similar to how the
presence and ack features are conditionally added. This ensures that
hello_ok.features only advertises methods and events that are both implemented
and negotiated with the client.
- Around line 981-1024: The wire parser is reading incorrect field names from
the incoming data, causing clients that properly serialize HelloParams to fall
back to protocol v1 with no capabilities. Update the parsing logic around line
982-983 to read protocol_min and protocol_max as direct fields (not nested under
protocol), and update line 1023 to read capabilities instead of caps, so that
client data aligned with the HelloParams contract is correctly parsed.
- Around line 1025-1035: The resumed session returned from
resume_or_create_session() may belong to a different agent_id or contain a stale
client_id, which causes incorrect message routing later since responses use
session.client_id. Before storing the session in self._client_sessions, validate
that the returned session's agent_id matches the requested agent_id parameter,
and rebind the current client_id to the session to ensure the correct websocket
receives the replayed messages and subsequent responses.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a0e9ba15-15fa-47dd-8498-8c6387b7d0d8

📥 Commits

Reviewing files that changed from the base of the PR and between 5f39d66 and 0a6872e.

📒 Files selected for processing (2)

src/praisonai-agents/praisonaiagents/gateway/protocols.py
src/praisonai/praisonai/gateway/server.py

coderabbitai · 2026-06-22T09:24:35Z

+            # Parse protocol version from client
+            protocol_info = data.get("protocol", {})
+            if isinstance(protocol_info, dict):
+                client_min = protocol_info.get("min", 1)
+                client_max = protocol_info.get("max", 1)
+            else:
+                # Backwards compatibility: treat missing protocol as v1
+                client_min = client_max = 1
+
+            # Negotiate protocol version
+            if client_max < MIN_CLIENT_PROTOCOL_VERSION:
+                error = HelloError(
+                    code=ConnectErrorCode.PROTOCOL_UNSUPPORTED,
+                    message=f"Protocol version {client_max} is too old, minimum required is {MIN_CLIENT_PROTOCOL_VERSION}",
+                    next_action="upgrade_client"
+                )
+                await self._send_to_client(client_id, {
+                    "type": "hello_error",
+                    "code": error.code.value,
+                    "message": error.message,
+                    "next": error.next_action,
+                })
+                return
+
+            if client_min > GATEWAY_PROTOCOL_VERSION:
+                error = HelloError(
+                    code=ConnectErrorCode.PROTOCOL_UNSUPPORTED,
+                    message=f"Protocol version {client_min} is too new, server supports up to {GATEWAY_PROTOCOL_VERSION}",
+                    next_action="use_older_client"
+                )
+                await self._send_to_client(client_id, {
+                    "type": "hello_error",
+                    "code": error.code.value,
+                    "message": error.message,
+                    "next": error.next_action,
+                })
+                return
+
+            # Select the highest mutually supported version
+            negotiated_version = min(client_max, GATEWAY_PROTOCOL_VERSION)
+
+            # Get client capabilities
+            client_caps = data.get("caps", [])
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Align the wire parser with HelloParams.

Line 982 reads protocol.{min,max} and Line 1023 reads caps, but the shared contract defines protocol_min, protocol_max, and capabilities. SDK clients serializing HelloParams will be treated as protocol v1 with no capabilities.

🔧 Proposed contract-compatible parser

- protocol_info = data.get("protocol", {}) - if isinstance(protocol_info, dict): - client_min = protocol_info.get("min", 1) - client_max = protocol_info.get("max", 1) - else: - # Backwards compatibility: treat missing protocol as v1 - client_min = client_max = 1 + protocol_info = data.get("protocol") + if "protocol_min" in data or "protocol_max" in data: + client_min = data.get("protocol_min", 1) + client_max = data.get("protocol_max", 1) + elif isinstance(protocol_info, dict): + client_min = protocol_info.get("min", 1) + client_max = protocol_info.get("max", 1) + else: + # Backwards compatibility: treat missing protocol as v1 + client_min = client_max = 1 + + try: + client_min = int(client_min) + client_max = int(client_max) + except (TypeError, ValueError): + error = HelloError( + code=ConnectErrorCode.PROTOCOL_UNSUPPORTED, + message="Protocol versions must be integers", + next_action="send_integer_protocol_min_and_protocol_max", + ) + await self._send_to_client(client_id, { + "type": "hello_error", + "code": error.code.value, + "message": error.message, + "next": error.next_action, + }) + return + + if client_min > client_max: + error = HelloError( + code=ConnectErrorCode.PROTOCOL_UNSUPPORTED, + message="protocol_min cannot be greater than protocol_max", + next_action="fix_protocol_version_range", + ) + await self._send_to_client(client_id, { + "type": "hello_error", + "code": error.code.value, + "message": error.message, + "next": error.next_action, + }) + return @@ - client_caps = data.get("caps", []) + client_caps = data.get("capabilities", data.get("caps", [])) + if not isinstance(client_caps, list): + client_caps = []

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/gateway/server.py` around lines 981 - 1024, The wire parser is reading incorrect field names from the incoming data, causing clients that properly serialize HelloParams to fall back to protocol v1 with no capabilities. Update the parsing logic around line 982-983 to read protocol_min and protocol_max as direct fields (not nested under protocol), and update line 1023 to read capabilities instead of caps, so that client data aligned with the HelloParams contract is correctly parsed.

coderabbitai · 2026-06-22T09:24:35Z

+            # Resume or create session
+            session_id = data.get("session_id")
+            since_cursor = data.get("since")
+            session, replay_events = self.resume_or_create_session(
+                session_id=session_id,
+                agent_id=agent_id,
+                client_id=client_id,
+                since_cursor=since_cursor,
+            )
+
+            self._client_sessions[client_id] = session.session_id


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Validate and rebind resumed sessions before attaching this client.

resume_or_create_session() may return an existing session for a different agent_id and with an old client_id; after Line 1035, later responses use session.client_id, so resumed messages can be routed to the wrong websocket or expose another agent session.

🔒 Suggested guard before replay/attachment

session, replay_events = self.resume_or_create_session( session_id=session_id, agent_id=agent_id, client_id=client_id, since_cursor=since_cursor, ) + + if session.agent_id != agent_id: + error = HelloError( + code=ConnectErrorCode.AUTH_UNAUTHORIZED, + message="Session does not belong to the requested agent", + next_action="start_new_session", + ) + await self._send_to_client(client_id, { + "type": "hello_error", + "code": error.code.value, + "message": error.message, + "next": error.next_action, + }) + return + + # Prefer moving this into `resume_or_create_session()` or a + # `GatewaySession.rebind_client()` helper so the legacy join path + # gets the same protection. + session._client_id = client_id self._client_sessions[client_id] = session.session_id

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Resume or create session

session_id = data.get("session_id")

since_cursor = data.get("since")

session, replay_events = self.resume_or_create_session(

session_id=session_id,

agent_id=agent_id,

client_id=client_id,

since_cursor=since_cursor,

)

self._client_sessions[client_id] = session.session_id

# Resume or create session

session_id = data.get("session_id")

since_cursor = data.get("since")

session, replay_events = self.resume_or_create_session(

session_id=session_id,

agent_id=agent_id,

client_id=client_id,

since_cursor=since_cursor,

)

if session.agent_id != agent_id:

error = HelloError(

code=ConnectErrorCode.AUTH_UNAUTHORIZED,

message="Session does not belong to the requested agent",

next_action="start_new_session",

)

await self._send_to_client(client_id, {

"type": "hello_error",

"code": error.code.value,

"message": error.message,

"next": error.next_action,

})

return

# Prefer moving this into `resume_or_create_session()` or a

# `GatewaySession.rebind_client()` helper so the legacy join path

# gets the same protection.

session._client_id = client_id

self._client_sessions[client_id] = session.session_id

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/gateway/server.py` around lines 1025 - 1035, The resumed session returned from resume_or_create_session() may belong to a different agent_id or contain a stale client_id, which causes incorrect message routing later since responses use session.client_id. Before storing the session in self._client_sessions, validate that the returned session's agent_id matches the requested agent_id parameter, and rebind the current client_id to the session to ensure the correct websocket receives the replayed messages and subsequent responses.

coderabbitai · 2026-06-22T09:24:35Z

+            # Build features list
+            features = {
+                "methods": ["message", "abort", "leave"],
+                "events": [
+                    EventType.TOKEN_STREAM.value,
+                    EventType.TOOL_CALL_STREAM.value,
+                    EventType.STREAM_END.value,
+                    EventType.MESSAGE.value,
+                    EventType.MESSAGE_ACK.value,
+                    EventType.ERROR.value,
+                ],
+            }
+
+            # Add optional features based on client capabilities
+            if "presence" in client_caps and hasattr(self, '_presence_tracker') and self._presence_tracker:
+                features["events"].extend([
+                    EventType.PRESENCE_JOIN.value,
+                    EventType.PRESENCE_LEAVE.value,
+                    EventType.PRESENCE_UPDATE.value,
+                ])
+
+            if "ack" in client_caps and hasattr(self, '_delivery_tracker') and self._delivery_tracker:
+                features["events"].extend([
+                    EventType.MESSAGE_NACK.value,
+                    EventType.DELIVERY_RETRY.value,
+                ])
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Only advertise negotiated and implemented features.

Line 1039 advertises "abort", but this handler has no abort branch; streaming events are also advertised even when the client did not offer a streaming capability. That makes hello_ok.features unreliable for clients.

🔧 Proposed feature list tightening

features = { - "methods": ["message", "abort", "leave"], + "methods": ["message", "leave"], "events": [ - EventType.TOKEN_STREAM.value, - EventType.TOOL_CALL_STREAM.value, - EventType.STREAM_END.value, EventType.MESSAGE.value, - EventType.MESSAGE_ACK.value, EventType.ERROR.value, ], } + + if "streaming" in client_caps: + features["events"].extend([ + EventType.TOKEN_STREAM.value, + EventType.TOOL_CALL_STREAM.value, + EventType.STREAM_END.value, + ]) @@ if "ack" in client_caps and hasattr(self, '_delivery_tracker') and self._delivery_tracker: features["events"].extend([ + EventType.MESSAGE_ACK.value, EventType.MESSAGE_NACK.value, EventType.DELIVERY_RETRY.value, ])

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Build features list

features = {

"methods": ["message", "abort", "leave"],

"events": [

EventType.TOKEN_STREAM.value,

EventType.TOOL_CALL_STREAM.value,

EventType.STREAM_END.value,

EventType.MESSAGE.value,

EventType.MESSAGE_ACK.value,

EventType.ERROR.value,

],

}

# Add optional features based on client capabilities

if "presence" in client_caps and hasattr(self, '_presence_tracker') and self._presence_tracker:

features["events"].extend([

EventType.PRESENCE_JOIN.value,

EventType.PRESENCE_LEAVE.value,

EventType.PRESENCE_UPDATE.value,

])

if "ack" in client_caps and hasattr(self, '_delivery_tracker') and self._delivery_tracker:

features["events"].extend([

EventType.MESSAGE_NACK.value,

EventType.DELIVERY_RETRY.value,

])

# Build features list

features = {

"methods": ["message", "leave"],

"events": [

EventType.MESSAGE.value,

EventType.ERROR.value,

],

}

if "streaming" in client_caps:

features["events"].extend([

EventType.TOKEN_STREAM.value,

EventType.TOOL_CALL_STREAM.value,

EventType.STREAM_END.value,

])

# Add optional features based on client capabilities

if "presence" in client_caps and hasattr(self, '_presence_tracker') and self._presence_tracker:

features["events"].extend([

EventType.PRESENCE_JOIN.value,

EventType.PRESENCE_LEAVE.value,

EventType.PRESENCE_UPDATE.value,

])

if "ack" in client_caps and hasattr(self, '_delivery_tracker') and self._delivery_tracker:

features["events"].extend([

EventType.MESSAGE_ACK.value,

EventType.MESSAGE_NACK.value,

EventType.DELIVERY_RETRY.value,

])

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/gateway/server.py` around lines 1037 - 1063, The features list in the code starting around line 1039 is advertising methods and events that are either not implemented or not negotiated with the client. Remove the "abort" method from the methods list unless it has a corresponding handler implemented, and move the streaming events (TOKEN_STREAM, TOOL_CALL_STREAM, STREAM_END, MESSAGE, MESSAGE_ACK, ERROR) from the base features dictionary into a conditional block that only adds them if the client has advertised streaming capability in client_caps, similar to how the presence and ack features are conditionally added. This ensures that hello_ok.features only advertises methods and events that are both implemented and negotiated with the client.

coderabbitai · 2026-06-22T09:24:35Z

+            # Build policy limits
+            policy = {
+                "max_payload": 1048576,  # 1MB
+                "max_buffered_bytes": 8388608,  # 8MB
+                "heartbeat_ms": 15000,  # 15 seconds
+            }


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Report the configured heartbeat policy.

Line 1068 hard-codes heartbeat_ms to 15s even though gateway config loads heartbeat_interval; clients using the negotiated policy can self-configure to the wrong interval.

🔧 Proposed config-backed policy

policy = { "max_payload": 1048576, # 1MB "max_buffered_bytes": 8388608, # 8MB - "heartbeat_ms": 15000, # 15 seconds + "heartbeat_ms": int(getattr(self.config, "heartbeat_interval", 15) * 1000), }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Build policy limits

policy = {

"max_payload": 1048576, # 1MB

"max_buffered_bytes": 8388608, # 8MB

"heartbeat_ms": 15000, # 15 seconds

}

# Build policy limits

policy = {

"max_payload": 1048576, # 1MB

"max_buffered_bytes": 8388608, # 8MB

"heartbeat_ms": int(getattr(self.config, "heartbeat_interval", 15) * 1000),

}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/gateway/server.py` around lines 1064 - 1069, The heartbeat_ms value in the policy dictionary is hard-coded to 15000 instead of using the heartbeat_interval value loaded from the gateway configuration. Replace the hard-coded 15000 value with the heartbeat_interval from the gateway config so that the policy reflects the actual configured heartbeat interval and clients negotiate the correct interval instead of defaulting to 15 seconds.

MervinPraison · 2026-06-22T09:28:06Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-22T09:28:26Z

Claude finished @MervinPraison's task —— View job

Architecture Review Complete ✅

Phase 1: Review per AGENTS.md

Protocol-driven: ✅ Correct - protocols defined in core SDK (praisonaiagents/gateway/protocols.py), heavy implementation in wrapper (praisonai/gateway/server.py)
Backward compatible: ✅ Preserved - legacy join/joined flow intact
Performance: ✅ No hot-path regressions - handshake is one-time per connection

Phase 2: Fixed Issues

Files Modified:

src/praisonai/praisonai/gateway/server.py

Issues Fixed:

✅ Fixed caps: null causing TypeError (P1) - Added None check and type validation
✅ Fixed wire parser field mismatches - Now supports both HelloParams format (protocol_min/max, capabilities) and legacy format
✅ Fixed hard-coded policy limits (P2) - Now uses config values (heartbeat_interval)
✅ Removed unused HelloParams import (P2)
✅ Added session validation and client_id rebinding - Prevents cross-agent session hijacking
✅ Only advertise implemented features - Removed non-existent "abort" method, made streaming conditional

Issues Skipped:
None - all valid issues from reviewers were addressed.

Phase 3: Final Verdict

✅ PR Approved - All architectural and implementation issues have been resolved. The implementation correctly follows protocol-driven design with protocols in core SDK and heavy implementation in wrapper, maintains full backward compatibility, and introduces no performance regressions.

The versioned handshake protocol is now ready for merge.

--- | View changes

- Fix caps: null causing TypeError by adding None check - Fix wire parser to support both HelloParams format (protocol_min/max, capabilities) and legacy format - Fix hard-coded policy limits to use config values (heartbeat_interval) - Remove unused HelloParams import - Validate session belongs to requested agent and rebind client_id - Only advertise implemented features (remove abort method, conditional streaming) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

greptile-apps · 2026-06-22T09:35:30Z

+            if "protocol_min" in data or "protocol_max" in data:
+                # HelloParams format
+                client_min = data.get("protocol_min", 1)
+                client_max = data.get("protocol_max", 1)
+            else:
+                # Legacy format or missing
+                protocol_info = data.get("protocol", {})
+                if isinstance(protocol_info, dict):
+                    client_min = protocol_info.get("min", 1)
+                    client_max = protocol_info.get("max", 1)
+                else:
+                    # Backwards compatibility: treat missing protocol as v1
+                    client_min = client_max = 1
+
+            # Negotiate protocol version
+            if client_max < MIN_CLIENT_PROTOCOL_VERSION:
+                error = HelloError(
+                    code=ConnectErrorCode.PROTOCOL_UNSUPPORTED,
+                    message=f"Protocol version {client_max} is too old, minimum required is {MIN_CLIENT_PROTOCOL_VERSION}",
+                    next_action="upgrade_client"
+                )
+                await self._send_to_client(client_id, {
+                    "type": "hello_error",
+                    "code": error.code.value,
+                    "message": error.message,
+                    "next": error.next_action,
+                })
+                return


Null protocol_min/protocol_max causes unhandled TypeError

data.get("protocol_min", 1) returns None — not 1 — when the client explicitly sends "protocol_min": null, because .get() only substitutes its default when the key is absent. The same applies to protocol_max. The first comparison client_max < MIN_CLIENT_PROTOCOL_VERSION then raises TypeError: '<' not supported between instances of 'NoneType' and 'int', which propagates out of the hello handler and is swallowed by the outer WebSocket loop, silently dropping the connection instead of sending a hello_error to the client. This is the same class of null-vs-absent bug that was fixed for caps on line 1033 but was not applied here. Add a type/null guard immediately after extracting client_min/client_max, defaulting or rejecting non-integer values before any comparison.

praisonai-triage-agent Bot mentioned this pull request Jun 22, 2026

Gateway protocol lacks a versioned handshake with capability negotiation and structured connect errors #2151

Closed

Copilot AI requested a review from MervinPraison June 22, 2026 09:18

greptile-apps Bot reviewed Jun 22, 2026

View reviewed changes

Comment thread src/praisonai/praisonai/gateway/server.py Outdated

Comment thread src/praisonai/praisonai/gateway/server.py

Comment thread src/praisonai/praisonai/gateway/server.py Outdated

coderabbitai Bot reviewed Jun 22, 2026

View reviewed changes

greptile-apps Bot reviewed Jun 22, 2026

View reviewed changes

MervinPraison merged commit b1ce2fb into main Jun 22, 2026
26 checks passed

MervinPraison mentioned this pull request Jun 22, 2026

docs(gateway): add versioned handshake & capability negotiation page for PR #2154 MervinPraison/PraisonAIDocs#680

Open

7 tasks

Uh oh!

Conversation

praisonai-triage-agent Bot commented Jun 22, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Core SDK (praisonaiagents/gateway/protocols.py)

Wrapper (praisonai/gateway/server.py)

Benefits

Summary by CodeRabbit

Uh oh!

MervinPraison commented Jun 22, 2026

Uh oh!

MervinPraison commented Jun 22, 2026

Uh oh!

qodo-code-review Bot commented Jun 22, 2026

Qodo reviews are paused for this user.

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Poem

Uh oh!

MervinPraison commented Jun 22, 2026

Uh oh!

greptile-apps Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Jun 22, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Architecture Review Complete ✅

Phase 1: Review per AGENTS.md

Phase 2: Fixed Issues

Phase 3: Final Verdict

Uh oh!

greptile-apps Bot Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

praisonai-triage-agent Bot commented Jun 22, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

greptile-apps Bot commented Jun 22, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 22, 2026 •

edited

Loading