privsep: Ensure we recv for real after a successful recv MSG_PEEK by rsmarples · Pull Request #556 · NetworkConfiguration/dhcpcd

rsmarples · 2025-11-17T22:51:45Z

Adjust the code flow so that the same errors would be caught after the final recv.
This ensures we read what is really meant for us and not something silly.

Should fix #555.

Adjust the code flow so that the same errors would be caught after the final recv. This ensures we read what is really meant for us and not something silly. Should fix #555.

coderabbitai · 2025-11-17T22:51:54Z

Important

Review skipped

This PR was authored by the user configured for CodeRabbit reviews. By default, CodeRabbit skips reviewing PRs authored by this user. It's recommended to use a dedicated user account to post CodeRabbit review feedback.

To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

Reworked privsep IPC I/O to a two-phase recvmsg flow (peek to size, then mandatory consume), introduced optional heap allocation for per-message payloads (psr_mallocdata/psr_data), removed in-place mdata realloc/copy paths and the per-process data pointer/free callback fields and cleanup.

Changes

Cohort / File(s)	Summary
Privsep root I/O and buffer handling `src/privsep-root.c`	Replaced single MSG_PEEK read with explicit `msghdr`/`iovec` + `recvmsg(MSG_PEEK
Process struct and cleanup behavior `src/privsep.h`, `src/privsep.c`	Removed `psp_data` and `psp_freedata` fields from `struct ps_process`; removed runtime cleanup that invoked `psp_freedata` or freed `psp_data` in `ps_freeprocess`, so per-process data/free callbacks are no longer used during process teardown.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant Root
    participant KernelSocket

    Client->>KernelSocket: sendmsg(error+payload)
    Root->>KernelSocket: recvmsg(MSG_PEEK|MSG_WAITALL)  // peek header + lengths
    KernelSocket-->>Root: header + total_len (peeked)
    alt allocate payload (psr_mallocdata)
        Root->>Root: malloc(psr_data, total_len)
        Root->>KernelSocket: recvmsg(MSG_WAITALL)  // consume and read payload into psr_data
        KernelSocket-->>Root: payload (consumed)
    else use preallocated buffer
        Root->>KernelSocket: recvmsg(MSG_WAITALL)  // consume into existing buffer
        KernelSocket-->>Root: payload (consumed)
    end
    Root->>Client: process/error callback invoked with read data

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

privsep: enforce message boundaries with MSG_EOR on our messages #533 — Similar privsep IPC changes: switching to msghdr/recvmsg with MSG_WAITALL/MSG_EOR and modified iovec/msg handling; likely closely related to the recvmsg/iovec/error-handling changes in this PR.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly describes the main change: ensuring a real recv follows a successful MSG_PEEK to validate data.
Description check	✅ Passed	The description explains that the code now performs an actual recv after MSG_PEEK and re-checks errors, aiming to fix issue `#555`.
Linked Issues check	✅ Passed	The PR implements a two-phase recv approach (MSG_PEEK followed by actual recv) to ensure proper error detection, directly addressing issue `#555`'s buffer space error.
Out of Scope Changes check	✅ Passed	All changes focus on the recv/MSG_PEEK flow in privileged separation code; removed psp_data cleanup and related fields are directly tied to refactoring data handling within scope.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 385ff24 and f4446d5.

📒 Files selected for processing (1)

src/privsep-root.c (2 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (14)

GitHub Check: netbsd (--disable-ipv6)
GitHub Check: netbsd (--disable-ipv6, -DSMALL)
GitHub Check: netbsd (-DSMALL)
GitHub Check: netbsd (--disable-dhcp6, -DSMALL)
GitHub Check: netbsd (--disable-dhcp6)
GitHub Check: netbsd (--disable-ipv4ll)
GitHub Check: netbsd (--disable-ipv4ll, -DSMALL)
GitHub Check: netbsd (--disable-ipv4)
GitHub Check: netbsd (--disable-arp, -DSMALL)
GitHub Check: netbsd (--disable-ipv4, -DSMALL)
GitHub Check: netbsd (--disable-arp)
GitHub Check: netbsd
GitHub Check: freebsd
GitHub Check: openbsd

src/privsep-root.c

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

src/privsep-root.c (1)
129-138: Critical: Fix MSG_WAITALL hang by using incoming message size.

Lines 134 and 137 set iov_len from the context buffer sizes (psr_ctx->psr_mdatalen, psr_ctx->psr_datalen), which can be larger than the actual incoming message size psr_error->psr_datalen. When MSG_WAITALL is used with an iov_len larger than available data, recvmsg blocks indefinitely waiting for bytes that will never arrive, wedging the privileged proxy and causing cascading failures like "No buffer space available."

This is the same critical issue identified in the past review and must be fixed.

Apply this diff to clamp iov_len to the actual incoming message size:
 	if (psr_error->psr_datalen != 0) {
 		if (psr_ctx->psr_usemdata) {
 			iov[1].iov_base = psr_ctx->psr_mdata;
-			/* psr_mdatalen could be smaller then psr_datalen
-			 * if the above malloc failed. */
-			iov[1].iov_len =
-			    MIN(psr_ctx->psr_mdatalen, psr_ctx->psr_datalen);
+			/* Clamp to actual incoming size to avoid MSG_WAITALL hang.
+			 * If buffer is too small, MSG_TRUNC will catch it. */
+			iov[1].iov_len = psr_error->psr_datalen;
+			if (iov[1].iov_len > psr_ctx->psr_mdatalen)
+				iov[1].iov_len = psr_ctx->psr_mdatalen;
 		} else {
 			iov[1].iov_base = psr_ctx->psr_data;
-			iov[1].iov_len = psr_ctx->psr_datalen;
+			iov[1].iov_len = psr_error->psr_datalen;
+			if (iov[1].iov_len > psr_ctx->psr_datalen)
+				iov[1].iov_len = psr_ctx->psr_datalen;
 		}
 	}

🧹 Nitpick comments (1)

src/privsep-root.c (1)
102-111: Consider removing MSG_WAITALL from the MSG_PEEK operation.

Using MSG_WAITALL with MSG_PEEK may cause unnecessary blocking when the message header hasn't fully arrived yet. Since this is just a peek operation to determine buffer size, and you're checking for short reads immediately after (line 110), MSG_PEEK alone should suffice.

Apply this diff:
-	len = recvmsg(fd, &msg, MSG_PEEK | MSG_WAITALL);
+	len = recvmsg(fd, &msg, MSG_PEEK);

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f4446d5 and 834a322.

📒 Files selected for processing (1)

src/privsep-root.c (2 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (14)

GitHub Check: netbsd (--disable-dhcp6, -DSMALL)
GitHub Check: netbsd (--disable-ipv6, -DSMALL)
GitHub Check: netbsd (--disable-dhcp6)
GitHub Check: netbsd (--disable-ipv6)
GitHub Check: netbsd (--disable-ipv4ll)
GitHub Check: netbsd (--disable-ipv4ll, -DSMALL)
GitHub Check: netbsd (--disable-arp)
GitHub Check: netbsd (-DSMALL)
GitHub Check: netbsd (--disable-ipv4)
GitHub Check: netbsd (--disable-arp, -DSMALL)
GitHub Check: netbsd
GitHub Check: netbsd (--disable-ipv4, -DSMALL)
GitHub Check: freebsd
GitHub Check: openbsd

🔇 Additional comments (2)

src/privsep-root.c (2)

113-127: LGTM! Proper handling of allocation failure.

The allocation failure handling is correct: when realloc fails, the code leaves the smaller buffer in place and relies on MSG_TRUNC detection in the subsequent recvmsg to signal ENOBUFS. This ensures the message is properly discarded from the queue.

141-151: LGTM! Comprehensive validation of received message.

The final receive path correctly:

Centralizes the actual message consumption for both early failures and normal flow

Validates minimum header size

Detects truncation via MSG_TRUNC and returns ENOBUFS

Ensures exact size match between received data and expected header + payload

Allows us to distinguish from a shortened header. Log the result and expected sizes to try and diagnose.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

src/privsep-root.c (2)
118-140: Dynamic buffer growth and iov_len capping avoid hangs and handle ENOBUFS cleanly

The realloc of psr_ctx->psr_mdata and the subsequent setup:

Preserve the old buffer if realloc fails, so the pointer and size remain valid.

Use iov[1].iov_base from either psr_mdata or the caller’s buffer as before.

Critically, cap iov[1].iov_len to psr_error->psr_datalen, so the total requested length is never larger than the actual message size, eliminating the previous MSG_WAITALL hang scenario when the context buffer was larger than the incoming payload.

In the “realloc failed” case, psr_mdatalen < psr_error->psr_datalen, which will force a truncated read in the final recvmsg, correctly surfacing ENOBUFS while draining the queue.

Only tiny nit: the new comment still says “failed to malloc” even though realloc is used here.
-       /* If we failed to malloc then psr_mdatalen will be smaller
+       /* If we failed to realloc then psr_mdatalen will be smaller
142-157: Final recvmsg validation is robust; comment could be tightened to match new invariants

The final recvmsg(fd, &msg, MSG_WAITALL) plus:

Short‑header check (len < sizeof(*psr_error)),

Explicit MSG_TRUNC → ENOBUFS mapping,

Exact size check against sizeof(*psr_error) + psr_error->psr_datalen with EBADMSG and logging,

gives a solid guard against partial or malformed records and guarantees the caller sees consistent errors while the message is fully consumed from the queue. With the new iov_len capping above, the “receive buffers are bigger than the actual message” case cannot actually occur anymore, so the comment at Line 143 is now slightly misleading; consider updating it to say explicitly that the total requested length is bounded by the announced psr_datalen, which is what prevents stalling with MSG_WAITALL.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 595c305 and 36d3dad.

📒 Files selected for processing (1)

src/privsep-root.c (2 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (14)

GitHub Check: netbsd (--disable-dhcp6, -DSMALL)
GitHub Check: netbsd (--disable-ipv4ll, -DSMALL)
GitHub Check: netbsd (--disable-dhcp6)
GitHub Check: netbsd (--disable-ipv6, -DSMALL)
GitHub Check: netbsd (--disable-ipv4ll)
GitHub Check: netbsd (--disable-ipv6)
GitHub Check: netbsd (-DSMALL)
GitHub Check: netbsd
GitHub Check: netbsd (--disable-ipv4, -DSMALL)
GitHub Check: netbsd (--disable-arp, -DSMALL)
GitHub Check: netbsd (--disable-arp)
GitHub Check: netbsd (--disable-ipv4)
GitHub Check: freebsd
GitHub Check: openbsd

🔇 Additional comments (1)

src/privsep-root.c (1)

89-112: Two-phase recvmsg with MSG_PEEK looks correct and keeps the queue in sync

The introduction of struct msghdr msg and the initial recvmsg(fd, &msg, MSG_PEEK | MSG_WAITALL) followed by an unconditional non‑peek recvmsg (via goto recv when needed) ensures you (a) size buffers based on the header, and (b) always actually consume the message from the SOCK_SEQPACKET queue. This addresses the prior risk of acting only on peeked data without draining the record, without introducing obvious new failure modes.

perkelix · 2025-12-03T10:19:51Z

Does this one seem ready to merge too?

rsmarples · 2025-12-03T10:22:21Z

Does this one seem ready to merge too?

I would like some feedback on it first as I still don't know if it fixes people issues as I cannot replicate the problem.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/privsep-root.c`:
- Around line 130-144: The error path after the second recvmsg can leak memory
when psr_mallocdata succeeded: ensure pc->psr_data is freed (or set to NULL)
before invoking PSR_ERROR in the recv handling block; modify the recv error
branches in the recv/psr_error handling so that before any PSR_ERROR call you
either free(pc->psr_data) or set pc->psr_data = NULL (so ps_root_mreaderror and
callers like ps_root_getifaddrs won't receive a dangling pointer), or factor out
a single cleanup label that clears pc->psr_data and then jumps to the PSR_ERROR
invocation to avoid repetition.

src/privsep-root.c

privsep: Ensure we recv for real after a successful recv MSG_PEEK

f4446d5

Adjust the code flow so that the same errors would be caught after the final recv. This ensures we read what is really meant for us and not something silly. Should fix #555.

coderabbitai bot reviewed Nov 17, 2025

View reviewed changes

src/privsep-root.c Outdated Show resolved Hide resolved

Improve commentary for prior

834a322

coderabbitai bot reviewed Nov 18, 2025

View reviewed changes

rsmarples added 2 commits November 18, 2025 07:42

More minor comments.

595c305

Return EBADMSG on recvmsg len mismatch

36d3dad

Allows us to distinguish from a shortened header. Log the result and expected sizes to try and diagnose.

coderabbitai bot reviewed Nov 30, 2025

View reviewed changes

rsmarples added 3 commits November 30, 2025 09:50

Improve comments.

a3ce698

Don't work out buffers on a truncated header.

cd75b08

Move the vars for reading onto the stack and off the context

16a0bfb

perkelix mentioned this pull request Dec 3, 2025

dhcpcd core dumps on FreeBSD 15-RELEASE #565

Open

rsmarples mentioned this pull request Dec 3, 2025

regression 10.3.0 fails to acquire DHCPv4 #564

Open

perkelix mentioned this pull request Jan 3, 2026

SIOCSPFXFLUSH_IN6: No buffer space available after updating to 10.3.0 (FreeBSD 14.3) #555

Closed

Move message to debug

1a8e965

coderabbitai bot reviewed Feb 17, 2026

View reviewed changes

src/privsep-root.c Show resolved Hide resolved

Free data on error

d1bf99f

rsmarples merged commit 661cdbd into master Feb 17, 2026
3 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

privsep: Ensure we recv for real after a successful recv MSG_PEEK#556

privsep: Ensure we recv for real after a successful recv MSG_PEEK#556
rsmarples merged 9 commits intomasterfrom
privsep-recvmsg

rsmarples commented Nov 17, 2025

Uh oh!

coderabbitai bot commented Nov 17, 2025 •

edited

Loading

Review skipped

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

perkelix commented Dec 3, 2025

Uh oh!

rsmarples commented Dec 3, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

rsmarples commented Nov 17, 2025

Uh oh!

coderabbitai bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

perkelix commented Dec 3, 2025

Uh oh!

rsmarples commented Dec 3, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Nov 17, 2025 •

edited

Loading