Bug #1112195 “Mir client API lacks simple synchronous interface” : Bugs : Mir

Daniel van Vugt (vanvugt) on 2013-02-01

description:

updated

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-01:

#1

To clarify, I suggest a simple split. Functions that are presently asynchronous should have two versions. For example:
mir_connect()
mir_async_connect()

Unless someone can think of a way to keep them as one function in an elegant way?

Revision history for this message

Michi Henning (michihenning) wrote on 2013-02-01:

#2

There is a generic underlying mechanism that can be used. I implemented this for Ice. The API was inspired in part by the .NET API for asynchronous methods. With templates, it's possible to transparently implement the synchronous API in terms of the asynchronous one (with essentially zero overhead).

Basically, the API consists of begin_<method> and end_<method> calls. I start the call by calling begin_<method>() and, when I'm ready to collect the results, I call end_<method>(). If, at the time I make the end_ call, the invocation has completed, the end_ method returns immediately; otherwise, it blocks until the result is available.

This mechanism makes it possible to construct synchronous calls on top of asynchronous calls very easily and elegantly. There are hooks that can be use to type-safely move state between the invoking end and the responding end, and the end_ method can easily be generated by a template to invoke a callback.

My recommendation would be to *not* use callbacks as the default. It is much harder to implement synchronous behavior in terms of callbacks than it is to implement asynchronous behaviour in terms of a completion method. An API along the lines of .NET keeps the flexibility, causes minimum overhead, and doesn't force an awkward interaction model on the application.

http://www.zeroc.com/articles/csharp-async.pdf

Revision history for this message

Michi Henning (michihenning) wrote on 2013-02-01:

#3

Sorry, the more appropriate link is probably the one below, which shows the C++ API:

http://www.zeroc.com/articles/cpp-async.pdf

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-04:

#4

Nice, Michi. An "end" function for each operation would certainly avoid the need for callbacks even in the asynchronous case. And each operation's "end" function can return different data types, unlike the generic mir_wait_for().

I'm guessing it would look like:
    op = mir_begin_something(...);
    ...
    result = mir_end_something(op, ...);

That would be a great way to reduce pain and avoid callbacks in the client API. I still suspect the simple synchronous case requires an extra simple version though:
result = mir_something(...);

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-04:

#5

Of course, one thing you can't do with the "end" approach is wait on multiple things at once. Like you could with a:
void mir_wait_for_all(MirWaitHandle **wait_handle);

This is one of Windows' strengths, generalizing the Event concept so you can wait on different types of events. Similar for Unix/Linux if everything you're waiting on is a select'able file descriptor.

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-04:

#6

Sorry, I was wrong. If you use MirWaitHandle as the "op" type above then you could implement wait-for-many functionality.

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-04:

#7

There is a simple way to combine all these approaches and do away with callbacks...

    Something result;
    MirWaitHandle *w = mir_something(..., &result);
    ...
    mir_wait_for(w);
    use_result(result);

Revision history for this message

Daniel van Vugt (vanvugt) wrote on 2013-02-04:

#8

Bumped to High. No matter how you look at it, the client API needs more work to be simpler and more digestible.

Changed in mir:
importance:	Medium → High

Revision history for this message

Michi Henning (michihenning) wrote on 2013-02-04:

#9

Download full text (3.5 KiB)

With the begin/end approach, waiting for multiple things is easy. For example, I can fire off five asynch calls and wait for all of them to complete like this:

a = begin_a();
b = begin_b();
c = begin_c();
a2 = begin_a();
b2 = begin_b();

This calls a() and b() twice (just to illustrate that this is possible), and c() once. Now I can wait like this:

a_r = end_a(a);
b_r = end_b(b);
c_r = end_c(c);
a2_r = end_a(a2);
b2_r = end_b(b2);

The last end_ call completes only once all the preceding ones have completed. (I can write the end_ calls in any order; the effect is the same.)

Implementing a synchronous call on top of the async version is trivial. (Illustrated here with an int return type.)

int
someCall()
{
return end_someCall(begin_someCall());
}

If you want callbacks, you allow a functor to be passed as the first parameter of the begin_ call. The implementation of the overloaded begin_ method stashes the AsyncResultPtr (which is a smart pointer) into a queue or some such, together with the functor. A separate thread (or thread pool) then iterates over the queue and calls the functor, whose implementation calls the end_ method. When the end_ call completes, the functor calls the callback. Once the callback completes, the thread re-joins the thread pool.

In a nutshell, that's all there is to it. In Ice, I wrote much of this by writing the C++ API on the fly from the interface definitions. Internally, it was done mostly with templates, so there wasn't that much code to generate, actually. (Or, rather, I got the compiler to do much of the dirty work for me.)

There are a number of nice aspects about the begin_/end_ approach:

It's stateless, so I can have as many outstanding async calls as are needed without having to jump through hoops.

I can very easily fire off a call, do some more work, and then synchronously wait for call completion when it suits me, *without* having to write a callback. (I can just call the end_ method wherever it's convenient.)

If I want a callback, that's easily arranged for. Moreover, by using functors, there is no need to derive from a common base. (I really dislike designs that force the API client to derive from something I provide.)

I can have one thread start the call and have another thread complete the call, so there is no threading policy imposed on the application. All I need to do is pass the AsyncResult from the calling thread to the completion thread.

It's easy to move application state between the calling end and the completion end.

Everything can easily be made as type-safe as I like, to point where the async call is just as type-safe as a synchronous call.

I don't have to make things type-safe. A single method an be used to process results from an arbitrary number of method invocations, if that's what I want, by providing the method name as a string.

Providing all of this for MIR is probably over-kill. Much of what's in this machinery arose from Ice, which naturally has a type-unsafe dispatch mechanism beneath the covers (the Ice run-time), with a type-safe veneer layered on top by generating C++ code from interface definitions.

For MIR, it probably would be sufficient to stick with a hard...

With the begin/end approach, waiting for multiple things is easy. For example, I can fire off five asynch calls and wait for all of them to complete like this:

a = begin_a();
b = begin_b();
c = begin_c();
a2 = begin_a();
b2 = begin_b();

This calls a() and b() twice (just to illustrate that this is possible), and c() once. Now I can wait like this:

a_r = end_a(a);
b_r = end_b(b);
c_r = end_c(c);
a2_r = end_a(a2);
b2_r = end_b(b2);

The last end_ call completes only once all the preceding ones have completed. (I can write the end_ calls in any order; the effect is the same.)

Implementing a synchronous call on top of the async version is trivial. (Illustrated here with an int return type.)

int
someCall()
{
    return end_someCall(begin_someCall());
}