IPC Implementation Rewrite #114

maxtyson123 · 2025-03-25T09:05:19Z

resolves #113

As discussed in Issue #113 the current IPC message passing tutorial may be hard to follow for some users and the overall design could be reworked to be better.

This PR proposes a improved design:

Requires only 2 syscalls instead of 3
Prevents a userspace program dictating when the buffer can be freed (see 36.1 How It Works - final bullet point)
Allows for multiple messages
More freedom for the process that owns the endpoint to handle messages how it deems fit for it's usecase

(This IPC mecanhisim was a combination of the current one in the book and my implementation in Max OS

07_IPC/03_Message_Passing.md

DeanoBurrito · 2025-03-25T23:29:47Z

Thanks for this, I'll take a look later today.

07_IPC/03_Message_Passing.md

dreamos82 · 2025-03-30T11:34:56Z

LGTM.
Let's wait for @DeanoBurrito review.

DeanoBurrito · 2025-03-31T23:59:28Z

sorry for the delay, looking at this now. Time works differently where I'm from 😅

DeanoBurrito

Writing is good overall, but there's a couple of design issues I see:

The kernel is accessing the userspace heap (see the distinction you made about kmalloc() vs malloc())? This is a bad idea, as the kernel now relies on user code to 'do the right thing' and provide a working malloc() to allocate the recipients message buffers. Or alternatively the kernel has to be involved in userspace heap management (the user program might not even have a heap, or may have multiple, or some per-thread caches - how do you deal with that on the kernel side?).

This is a tricky one to deal with, the common approach is that userspace allocates the memory and informs the kernel of where the buffer is located as part of a system call. This is where a dedicated ipc_receive() function is handy, because the recipient can pass the buffer they wish the message to be copied into. You could also have a big block of memory attached to the endpoint and have that act as shared memory between the kernel and receiving process (this would need some synchronization if the receiving process isnt blocking on reads).

Similar to 1, this version of message parsing combines IPC concepts and userspace/kernel interactions into one chapter. In a real implementation there may be some overlap, but I think it muddies the waters when it comes to explaining something new. It would be better to break this down into smaller ideas: how to move messages between two address spaces (all of this happens in the kernel, this is what the original message parsing example did), and then the API presented to userspace (not previously covered).

DeanoBurrito · 2025-04-01T00:04:31Z

07_IPC/03_Message_Passing.md

+- _Process 1_  wants to receive incoming messages on an endpoint, so it calls a function telling the kernel to create an endpoint in our IPC manager. This function will setup and return a block of (userspace) memory containing a message queue. We'll call this function `create_endpoint()`.
+- _Process 2_ wants to send a message sometime later, so it allocates a buffer and writes some data there.
+- _Process 2_ now calls a function to tell the kernel it wants to send this buffer as a message to an endpoint. We'll call this function `ipc_send()`.
+- Inside `ipc_send()` the buffer is copied into kernel memory. In our example, we'll use the heap for this memory. We can then switch to process 1's address space and copy the buffer on the heap into the queue.


copy the buffer on the heap into the queue.
is a little ambiguous, I think relying less on context would clean this up:
copy the kernel's buffer into the endpoint's queue.

DeanoBurrito · 2025-04-01T00:05:55Z

07_IPC/03_Message_Passing.md

+    uintptr_t next_message;
+} ipc_message_t;
+
+ typedef struct ipc_message_queue{


Is this type needed? since an endpoints and queues have a one-to-one relationship I dont think this is necessary. Instead embedding the list head in ipc_endpoint should be fine.

DeanoBurrito · 2025-04-01T00:06:47Z

07_IPC/03_Message_Passing.md

-    void* msg_buffer;
-    size_t msg_length;
+    ipc_message_queue_t* queue;
+    uint64_t owner_pid;


I'd prefer tid_t over uint64_t here, it describes intent better. uint64_t may not always be a convenient type depending on the architecture.

DeanoBurrito · 2025-04-01T00:09:43Z

07_IPC/03_Message_Passing.md

- What happens if there unread messages when destroying an endpoint? How do you handle them?
- Who is allowed to remove an endpoint?
+- What happens if there are unread messages when destroying an endpoint? How do you handle them?
+- Who is allowed to remove an endpoint? (`owner_pid` would be useful here)


what about:

(hint: owner_pid is useful here)

DeanoBurrito · 2025-04-01T00:11:12Z

07_IPC/03_Message_Passing.md


-In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
+// Add  the message to the queue


extra space after Add

DeanoBurrito · 2025-04-01T00:12:53Z

07_IPC/03_Message_Passing.md


-In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
+// Add  the message to the queue
+// Left for the reader to do, trivial linked list appending


this reads like a note I'd leave for myself rather than a hint. What about:

// append message struct to the endpoint's queue, implementation is left as an exercise for the reader.

DeanoBurrito · 2025-04-01T00:15:57Z

07_IPC/03_Message_Passing.md

- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise you will need some way to access this memory.
- A process waiting on an endpoint (to either send or receive a message) could be waiting quite a while in some circumstances. This is time the cpu could be doing work instead of blocking and spinning on a lock. A simple optimization would be to put the thread to sleep, and have it be woken up whenever the endpoint is updated: a new message is sent, or the current message is read.
- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one-less system call. Enforcing a maximum size for messages also has security benefits.
+- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example, the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of the physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise, you will need some way to access this memory.


, otherwise,
Too many commas, the second one can be omitted I think.

DeanoBurrito · 2025-04-01T00:19:12Z

07_IPC/03_Message_Passing.md

- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one-less system call. Enforcing a maximum size for messages also has security benefits.
+- We've described a double-copy implementation here, but you might want to try a single-copy implementation. Single-copy implementations *can* be faster, but they require extra logic. For example, the kernel will need to access the recipient's address space from the sender's address space, how do you manage this? If you have all of the physical memory mapped somewhere (like an identity map, or direct map (HHDM)) you could use this, otherwise, you will need some way to access this memory.
+- A process waiting on an endpoint (to either send or receive a message) could be waiting quite a while in some circumstances. This is a time when the cpu could be doing work instead of blocking and spinning on a lock. A simple optimization would be to put the thread to sleep, and have it be woken up whenever the endpoint is updated: a new message is sent, or the current message is read.
+- In this example we've allowed for messages of any size to be sent to an endpoint, but you may want to set a maximum message size for each endpoint when creating it. This makes it easier to receive messages as you know the maximum possible size the message can be, and can allocate a buffer without checking the size of the message. This might seem silly, but when receiving a message from userspace the program has to make a system call each time it wants the kernel to do something. Having a maximum size allows for one less system call. Enforcing a maximum size for messages also has security benefits.


the point about saving a system call is irrelevant for this implementation, since the message contents and metadata are already in the recipients address space. There's no system call in the example you give your receiving messages.
The rest is good.

dreamos82 · 2025-04-18T20:14:54Z

@maxtyson123 any news on this PR? Do you want to make the changes requested?

IAmTheNerdNextDoor · 2025-04-27T19:45:27Z

@maxtyson123 any news on this PR? Do you want to make the changes requested?

and he never responded...

maxtyson123 · 2025-04-27T20:15:32Z

Sorry didn't see the earlier messages. Quite busy right now but I can begin working on it when I have spare time.

dreamos82 · 2025-04-27T20:29:24Z

es. Quite busy right now but I can be

Perfect thanks! :) Just let us know in case you don't have time at all to finish it, and you want someone else to take it over.

maxtyson123 · 2025-04-27T22:52:27Z

Just to be clear before I begin working on this:

Now that you mention it, I do agree that it wasn't the best practice, albeit for different reasons. In Max Os each process has its own Memory Manager that handles that process's memory (Note: this may change when I work on actually implementing threads). I now realise that my suggested method "locks" the reader into my design of managing memory in the kernel and in that way. How would you suggest this is fixed, going back to the 'read()' design or something new?
It should be broken up into two sections. One subsection of the "Userspace" section for resource management, detailing HOW to move information between address spaces. Note: I haven't finished my filesystem implementation (what I'm working on for Max Os now) so if we want to address files in there aswell (which I think mat be a good idea) I will need to finish that first. Separately, in the IPC section, the code should be changed to build on top of this API.

P.S. Is there a better place to talk with easier back and forth? That way we can keep the public github discussion to what is actually relevant to the PR and can also have a method smaller things

DeanoBurrito · 2025-04-28T06:21:08Z

Hey Max, all good - life is like that sometimes. Glad to see you're still interested in this :)
We have a discord server we hang out in (I'm a little absent myself these days, but I do eventually get around to responding to messages). Here's the invite: https://discord.gg/X5YmgDKW

As for your questions:

That's pretty standard to have a 'memory manager' (in this case its your VirtualMemoryManager class) per address space, which is usually linked one-to-one with a process. A number of threads would share the address space of the process, so that all makes sense. I'm not sure about the purpose of your MemoryManager class though, it looks like it allocates memory for a heap in the address space it's attached to (so far so good), but never maps the pages as user-accessible, so you have a kernel heap in the higher half and then one per address space in the lower half if I understand correctly? This sounds like it's breaking the 'lower half is for userspace' idea. So my suggestion there is to remove the lower-half kernel heaps, keep kernel stuff in the higher half, and then yeah as you mentioned - going back to the read() design. Or something similar where userspace passes the buffer to the kernel.

The other big benefit to the read() approach is the user thread can block until there's a new message, or rather you can see it as read() will only return when there's a new message to be processed (ignoring edge cases where the read times out, or is cancelled). If the user thread only wants to poll, you can support that to, and in that case it's like the section you wrote where it checks for a new message and if there's none it carries on with other work.

Yeah that division sounds good. It'd be good to focus on providing general theory first for exposing kernel resources to userspace (via handles/file descriptors/objects/your preferred terminology), and how you might go about moving data in and out of those resources. That framework could then easily be applied to files, ipc and other things.
Personally I'd like to keep the focus of the userspace chapter on dealing with userspace, so I wouldnt object to some small examples of how this might be used - but I think it's better to keep file access related stuff to the vfs chapter.

maxtyson123 added 2 commits March 25, 2025 21:51

Draft new IPC implementation

0a5c59c

Fix some info

61dfe49

dreamos82 reviewed Mar 25, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

Spell Fixes

342a46c

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

inuyasha82 reviewed Mar 27, 2025

View reviewed changes

07_IPC/03_Message_Passing.md Outdated Show resolved Hide resolved

Various style fixes

8ca4910

dreamos82 approved these changes Mar 30, 2025

View reviewed changes

DeanoBurrito requested changes Apr 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IPC Implementation Rewrite #114

IPC Implementation Rewrite #114

maxtyson123 commented Mar 25, 2025

DeanoBurrito commented Mar 25, 2025

dreamos82 commented Mar 30, 2025

DeanoBurrito commented Mar 31, 2025

DeanoBurrito left a comment

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

DeanoBurrito Apr 1, 2025

dreamos82 commented Apr 18, 2025

IAmTheNerdNextDoor commented Apr 27, 2025

maxtyson123 commented Apr 27, 2025 •

edited

Loading

dreamos82 commented Apr 27, 2025 •

edited

Loading

maxtyson123 commented Apr 27, 2025

DeanoBurrito commented Apr 28, 2025


		In theory this works, but we've overlooked one huge issue: what if there's already a message at the endpoint? You should handle this, and there's a couple of ways to go about it:
		// Add the message to the queue

IPC Implementation Rewrite #114

Are you sure you want to change the base?

IPC Implementation Rewrite #114

Conversation

maxtyson123 commented Mar 25, 2025

DeanoBurrito commented Mar 25, 2025

dreamos82 commented Mar 30, 2025

DeanoBurrito commented Mar 31, 2025

DeanoBurrito left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dreamos82 commented Apr 18, 2025

IAmTheNerdNextDoor commented Apr 27, 2025

maxtyson123 commented Apr 27, 2025 • edited Loading

dreamos82 commented Apr 27, 2025 • edited Loading

maxtyson123 commented Apr 27, 2025

DeanoBurrito commented Apr 28, 2025

maxtyson123 commented Apr 27, 2025 •

edited

Loading

dreamos82 commented Apr 27, 2025 •

edited

Loading