[EROS-Arch] Re: design change -- IPC semantics
Charles Landau
clandau@macslab.com
Mon, 06 Nov 2000 14:25:18 -0800
shapj@us.ibm.com wrote:
> I'm contemplating some changes to the fine print of the capability invocation
> rules. The changes are as follows:
>
> ** 1. It is currently the case that the data string transfer is "all or
> nothing". If an N byte buffer was provided and K bytes were received, then at
> the end of the trap instruction bytes 0..K-1 contain the string sent by the
> invoker, and bytes K..N-1 are unchanged.
>
> REVISION: The semantic change is that the application may no longer assume that
> bytes K..N-1 are unchanged at the end of the IPC trap.
>
> Issue 1: It is now detectable that partially completed IPCs may have occurred.
> Note however, that there is already a trust relationship between all possible
> senders and the invokee by virtue of the fact that they held capabilities to the
> invokee.
Issue 2: The sender can't assume that if the invocation hasn't occurred, no data
was sent. If the sender changes his mind about sending (for example, by stopping
and redirecting the process with a debugger), he can't assume the receiver didn't
get any data.
>
>
> ** 2. It is currently the case that the entire sent string will be checked for
> sender readability, regardless of the number of bytes received by the recipient.
> That is, if the invoker transmits 8192 bytes and the invokee accepts only 16,
> all 8192 bytes must be readable to the invoker.
>
> REVISION: The bytes actually moved will be tested for validity. It is undefined
> whether additional bytes will be tested for validity.
>
> Issue 1: This exposes the size of the receive buffer. The sender and the
> recipient are already in legitemate communication, thus this covert channel is
> not important. If this is a concern, the sender
you mean receiver
> may prevent this detection by
> always receiving MSG_LIMIT bytes (currently 64k).
>
> Issue 2: This alters the reproducibility of the sender fault behavior. The
> sender fault behavior is already influenced by where page boundaries fall.
>
> ** RATIONALE: The current semantics requires that both receive buffer and send
> buffer be probed for validity before initiating the string transfer. This
> approximately triples the cost of the transfer. Under the revised architecture,
> the interdomain transfer path can simply copy the bits optimistically.
That is, when invoking a start or resume key, or in any other case where no side
effects occur before delivery of the string. (The side effect of partial delivery
of the string was covered in item #1.)
> If the
> pages on one side or the other are invalid, this will induce a page fault, and
> the page fault handler will do the appropriate thing (either invoke segment
> keeper on sender side or truncate the transfer on receiver side).
>
> The catch is that sender A can get halfway through, page fault, and then sender
> B can get in while sender A is producing the page. This motivates change #2.
>
> Note that no new process states are introduced -- an interrupted transfer is
> restarted from the beginning.
>
...
>
> **7. It is presently the case that the invoker enters the WAITING/AVAILABLE
> state (according to invocation type) prior to the execution of the capability
> operation. For example, during a CALL on a kernel key, a resume key is actually
> synthesized and the process actually does enter the waiting state briefly. This
> is done for the sole purpose of correctly supporting the semantics of a process
> that performs a RETURN on a START capability to itself (a perverse case, by the
> way, that really should never occur). On a multiprocessor, this state change is
> observable.
>
> REVISION: It is now guaranteed that the correct *final* state of the invoker
> will be set before or instantaneously with setting the state of the invokee. In
> the case where the invoker and invokee are the same, it is no longer guaranteed
> that the temporary process run state change will ever be observable.
Presumably it was never guaranteed that any temporary state was observable, since
it wasn't guaranteed that you would run during the period when the temporary state
was in effect.
> In
> particular, a CALL on a kernel capability need no longer observably alter the
> process state to WAITING. and an invocation on a gate capability need not
> observably reflect the change in the invoker process state before the invokee is
> made runnable [i.e. is placed on the run queue; the delay from "becoming
> runnable" to actually getting a CPU is potentially arbitrary, and isn't what I
> am talking about here.]
>
> In particular, this allows the case of RETURN to START key to yourself to be
> handled as a special case by updating the invoker state early in an out-of-line
> path, where the general case of RETURN still does the invoker state update late
> in the path.
Having a RETURN to START key to yourself work does seem like the right thing to do,
if, as you've suggested, it isn't too hard to implement. By the way, it isn't too
hard to do this; SEND to a START key to an unsuspecting server process, passing the
same START key as the last key parameter. If the server process isn't careful, it
will, when it's done, RETURN to that last key.
>
>
> CONSEQUENCE:
>
> Unless I've missed something, the following keys still need to manipulate string
> buffers after these changes:
>
> process key
> misc key (returner, range key)
> gate keys (start/resume)
> red segment keys (gate key indirection)
> red node keys (gate key indirection)
> device keys (data buffer xmit/rcv)
>
> Jonathan S. Shapiro, Ph. D.
> IBM T.J. Watson Research Center
> Email: shapj@us.ibm.com
> Phone: +1 914 784 7085 (Tieline: 863)
> Fax: +1 914 784 7595