Sun did try multi-thread fork semantics, but that didn't help. UNIX "fork" start...

Danieru · on Oct 13, 2014

> share or copy code, data, stack, and I think file opens

Sounds like linux's "clone" system call. Which is the underlying syscall which clib's fork() uses.

You can do just about anything imaginable with it: http://linux.die.net/man/2/clone

For example: you could create a child process-like-thing which shares nothing but the signal handler table. No idea what that would be good for.

caf · on Oct 13, 2014

Not all combinations are allowed. In this specific case, if you specify CLONE_SIGHAND then you must also specify CLONE_VM (so the processes share a virtual memory space, and are essentially threads).

Danieru · on Oct 14, 2014

Ah good catch, sorry I just skimmed the man page for an interesting sounding feature.

dap · on Oct 14, 2014

For that reason, clone(2) always felt like an overgenerality -- an attempt to decompose something into orthogonal parts, most combinations of which actually don't make sense.

barrkel · on Oct 14, 2014

Even when you have a more formal "run" primitive, you still want a lot of inheritance; file handles (console programs and redirection), security identity and authorizations, current working directory.

There's also the matter of API complexity. If you want to configure the context in which a program runs, there are roughly two different API approaches you can take. You can deeply parameterize the 'run' command, or you can configure the "current" process and then hand that off to the new process. But you usually need the second API (configuring the current process) anyway.

So I don't see fork(), per se, as a particularly crufty API. Slightly more configurability, like clone() or Plan 9 is better.

Multi-threaded signals are pretty dreadful, though. In a multi-threaded system, signals should be delivered on a separate thread, not via an existing thread getting hijacked.

caf · on Oct 14, 2014

Multi-threaded signals are pretty dreadful, though. In a multi-threaded system, signals should be delivered on a separate thread, not via an existing thread getting hijacked.

That is basically the recommended way to handle signals in a pthreaded program - have one dedicated signal-handling thread that calls `sigwait()` in a loop, and block all signals in the signal masks of the other threads.

gonzo · on Oct 13, 2014

"fork()" didn't start as a hack. There is no documentation to suggest that one return from fork() was a copy now on-disk while the other remained in ram.

http://cm.bell-labs.com/cm/cs/who/dmr/man21.pdf http://www3.alcatel-lucent.com/bstj/vol57-1978/articles/bstj...

Animats · on Oct 13, 2014

Yes, there is.

John Lion's "A commentary on the Sixth Edition UNIX Operating System", goes through the PDP-11 UNIX kernel line by line.

http://www.lemis.com/grog/Documentation/Lions/book.pdf

Page 37, Section 7.13, "newproc":

1906: Call "xswap" (4368) to copy the data segments into the disk swap area. Because the second parameter is zero, the main memory area will not be released.

1907: Mark the new process as "swapped out".

1908: Return the current process to its normal state.

wahern · on Oct 14, 2014

From what I can tell, that code is in a conditional which checks whether the new processes can fit in main memory. If it can fit, it jumps to line 1913.

Anyhow, there's a difference between the fork interface being a hack, and the fork implementation being a hack. Unix is is a cornucopia of implementation hacks. That doesn't mean the interfaces weren't deliberately and thoughtfully designed.

Much like C, what makes Unix unique and still relevant is that the deliberate design took into account practical implementation considerations. Unix and C are most elegant from an engineer's perspective. It's an interesting balance of interface complexity and implementation complexity. This is why some people claim that the Unix design philosophy epitomizes "worse is better".

hhhhjgjhg · on Oct 13, 2014

plan9 rfork() never shares the stack segment. the other segments can be shared or copied depending on the RFMEM flag. theres no special voodoo needed for "thread" local storage, its just memory reserved at the top of the stack.