[E-Lang] remote comms: Timeouts and Connection Failure

Mark S. Miller markm@caplet.com
Fri, 06 Apr 2001 13:43:17 -0700


At 10:48 AM Friday 4/6/01, Tyler Close wrote:
>At 10:28 AM 4/6/01 -0700, Mark S. Miller wrote:
>>[...] would that be good enough for a deal?
>
>Since you seem so thin on time lately, I think I'd need some sort of 
>guaranteed minimum expenditure of MarkM hours.

Fair enough.  Since the resulting parallelism could very significantly 
shorten the delay to E 1.0, I consider this high priority.  I'll negotiate a 
quantitative answer with you privately, as soon as I get some feedback from 
some current customers of E, and the time-urgency of their need for a 
revived CapTP.


>For the implementation I have in mind, one in N checkpoints would be an "all 
>BLOBs" checkpoint, whose runtime would obviously be linear with the number 
>of BLOBs in the Vat. All checkpoints N - 1 would only write out the diff to 
>the previous checkpoint and so should be just as fast as in Lock.

However amnesia turns out, this sounds very promising. If someone wants a 
large scalable system serving paying customers at high performance and 
without occasional long delay, I have no problem saying to them "buy a Lock 
license and upgrade to Lock".  In fact, I think this kind of split is one of 
the better open source revenue models.

However, the cheezy system has to be adequate for smaller scale and amateur 
use, so it depends on the length and frequency of these delays.  What 
penalty results from making N larger? In any case, I suspect your occasional 
delays are about the same length as the delays I would have subjected them 
to at every checkpoint, so yours is probably a strict win.


>So, you could forget about amnesia if you're willing to suffer block-up once 
>every N. We could try to put some thought into how to schedule this Nth 
>checkpoint so that it was as palatable as possible. Maybe N is of variable 
>length and is only done if the Vat has been idle for some preset amount of 
>time.

Depending on the tradeoffs of N, and the resulting frequency and length of 
these delays, I suspect this is all something we can live with.

 From a dinner conversation last night, it sounds like Dean may have some 
further reasons for wanting amnesia support, having to do with recovery from 
larger disasters, which I'll leave to Dean to explain.  But it sound like a 
*normal* crash and revive may no longer have to result in a rollback visible 
to other vats.  It's at least promising.


>>In any case, this offer makes me very hopeful.  On to persistence!!
>
>Not so fast buster, lets finish up collections first! ;)
>
>Something done is progress. Something half done is not.

;) 

Once I do start in on persistence, my first task will be to try to resolve 
the Hydro question.  But reviving CapTP may have to come first.


        Cheers,
        --MarkM