IPC Implementation Rewrite #114

maxtyson123 · 2025-03-25T09:05:19Z

resolves #113

As discussed in Issue #113 the current IPC message passing tutorial may be hard to follow for some users and the overall design could be reworked to be better.

This PR proposes a improved design:

Requires only 2 syscalls instead of 3
Prevents a userspace program dictating when the buffer can be freed (see 36.1 How It Works - final bullet point)
Allows for multiple messages
More freedom for the process that owns the endpoint to handle messages how it deems fit for it's usecase

(This IPC mecanhisim was a combination of the current one in the book and my implementation in Max OS

07_IPC/03_Message_Passing.md

DeanoBurrito · 2025-03-25T23:29:47Z

Thanks for this, I'll take a look later today.

07_IPC/03_Message_Passing.md

dreamos82 · 2025-03-30T11:34:56Z

LGTM.
Let's wait for @DeanoBurrito review.

DeanoBurrito · 2025-03-31T23:59:28Z

sorry for the delay, looking at this now. Time works differently where I'm from 😅

DeanoBurrito

Writing is good overall, but there's a couple of design issues I see:

The kernel is accessing the userspace heap (see the distinction you made about kmalloc() vs malloc())? This is a bad idea, as the kernel now relies on user code to 'do the right thing' and provide a working malloc() to allocate the recipients message buffers. Or alternatively the kernel has to be involved in userspace heap management (the user program might not even have a heap, or may have multiple, or some per-thread caches - how do you deal with that on the kernel side?).

This is a tricky one to deal with, the common approach is that userspace allocates the memory and informs the kernel of where the buffer is located as part of a system call. This is where a dedicated ipc_receive() function is handy, because the recipient can pass the buffer they wish the message to be copied into. You could also have a big block of memory attached to the endpoint and have that act as shared memory between the kernel and receiving process (this would need some synchronization if the receiving process isnt blocking on reads).

Similar to 1, this version of message parsing combines IPC concepts and userspace/kernel interactions into one chapter. In a real implementation there may be some overlap, but I think it muddies the waters when it comes to explaining something new. It would be better to break this down into smaller ideas: how to move messages between two address spaces (all of this happens in the kernel, this is what the original message parsing example did), and then the API presented to userspace (not previously covered).

DeanoBurrito · 2025-04-01T00:04:31Z

07_IPC/03_Message_Passing.md

+- _Process 1_  wants to receive incoming messages on an endpoint, so it calls a function telling the kernel to create an endpoint in our IPC manager. This function will setup and return a block of (userspace) memory containing a message queue. We'll call this function `create_endpoint()`.
+- _Process 2_ wants to send a message sometime later, so it allocates a buffer and writes some data there.
+- _Process 2_ now calls a function to tell the kernel it wants to send this buffer as a message to an endpoint. We'll call this function `ipc_send()`.
+- Inside `ipc_send()` the buffer is copied into kernel memory. In our example, we'll use the heap for this memory. We can then switch to process 1's address space and copy the buffer on the heap into the queue.


copy the buffer on the heap into the queue.
is a little ambiguous, I think relying less on context would clean this up:
copy the kernel's buffer into the endpoint's queue.

DeanoBurrito · 2025-04-01T00:05:55Z

07_IPC/03_Message_Passing.md

+    uintptr_t next_message;
+} ipc_message_t;
+
+ typedef struct ipc_message_queue{


Is this type needed? since an endpoints and queues have a one-to-one relationship I dont think this is necessary. Instead embedding the list head in ipc_endpoint should be fine.

DeanoBurrito · 2025-04-01T00:06:47Z

07_IPC/03_Message_Passing.md

-    void* msg_buffer;
-    size_t msg_length;
+    ipc_message_queue_t* queue;
+    uint64_t owner_pid;


I'd prefer tid_t over uint64_t here, it describes intent better. uint64_t may not always be a convenient type depending on the architecture.

DeanoBurrito · 2025-04-01T00:09:43Z

07_IPC/03_Message_Passing.md

- What happens if there unread messages when destroying an endpoint? How do you handle them?
- Who is allowed to remove an endpoint?
+- What happens if there are unread messages when destroying an endpoint? How do you handle them?
+- Who is allowed to remove an endpoint? (`owner_pid` would be useful here)


what about:

(hint: owner_pid is useful here)

DeanoBurrito · 2025-04-01T00:11:12Z

07_IPC/03_Message_Passing.md


-In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
+// Add  the message to the queue


extra space after Add

DeanoBurrito · 2025-04-01T00:12:53Z

07_IPC/03_Message_Passing.md


-In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
+// Add  the message to the queue
+// Left for the reader to do, trivial linked list appending


this reads like a note I'd leave for myself rather than a hint. What about:

// append message struct to the endpoint's queue, implementation is left as an exercise for the reader.

DeanoBurrito · 2025-04-01T00:15:57Z

07_IPC/03_Message_Passing.md

- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise you will need some way to access this memory.
- A process waiting on an endpoint (to either send or receive a message) could be waiting quite a while in some circumstances. This is time the cpu could be doing work instead of blocking and spinning on a lock. A simple optimization would be to put the thread to sleep, and have it be woken up whenever the endpoint is updated: a new message is sent, or the current message is read.
- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one-less system call. Enforcing a maximum size for messages also has security benefits.
+- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example, the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of the physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise, you will need some way to access this memory.


, otherwise,
Too many commas, the second one can be omitted I think.

DeanoBurrito · 2025-04-01T00:19:12Z

07_IPC/03_Message_Passing.md

- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one-less system call. Enforcing a maximum size for messages also has security benefits.
+- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example, the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of the physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise, you will need some way to access this memory.
+- A process waiting on an endpoint (to either send or receive a message) could be waiting quite a while in some circumstances. This is a time when the cpu could be doing work instead of blocking and spinning on a lock. A simple optimization would be to put the thread to sleep, and have it be woken up whenever the endpoint is updated: a new message is sent, or the current message is read.
+- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one less system call. Enforcing a maximum size for messages also has security benefits.


the point about saving a system call is irrelevant for this implementation, since the message contents and metadata are already in the recipients address space. There's no system call in the example you give your receiving messages.
The rest is good.

maxtyson123 added 2 commits March 25, 2025 21:51

Draft new IPC implementation

0a5c59c

Fix some info

61dfe49

dreamos82 reviewed Mar 25, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

Spell Fixes

342a46c

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

Various style fixes

8ca4910

dreamos82 approved these changes Mar 30, 2025

View reviewed changes

DeanoBurrito requested changes Apr 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IPC Implementation Rewrite #114

IPC Implementation Rewrite #114

maxtyson123 commented Mar 25, 2025

DeanoBurrito commented Mar 25, 2025

dreamos82 commented Mar 30, 2025

DeanoBurrito commented Mar 31, 2025

DeanoBurrito left a comment

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025


		In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
		// Add the message to the queue

IPC Implementation Rewrite #114

Are you sure you want to change the base?

IPC Implementation Rewrite #114

Conversation

maxtyson123 commented Mar 25, 2025

DeanoBurrito commented Mar 25, 2025

dreamos82 commented Mar 30, 2025

DeanoBurrito commented Mar 31, 2025

DeanoBurrito left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment