Re: [g-a-devel] [Accessibility] Re: [Accessibility-atspi] D-Bus AT-SPI - The way forward

From: Havoc Pennington <hp redhat com>
To: Rob Taylor <rob taylor codethink co uk>
Cc: gnome-accessibility-devel <gnome-accessibility-devel gnome org>, "dbus lists freedesktop org" <dbus lists freedesktop org>, accessibility-atspi-linux-foundation <accessibility-atspi lists linux-foundation org>, Mark Doffman <mark doffman codethink co uk>, kde-accessibility <kde-accessibility kde org>, accessibility-linux-foundation <accessibility lists linux-foundation org>, michael meeks novell com
Subject: Re: [g-a-devel] [Accessibility] Re: [Accessibility-atspi] D-Bus AT-SPI - The way forward
Date: Wed, 12 Dec 2007 16:08:09 -0500

Hi,

Rob Taylor wrote:

CC'ing the D-Bus mailing list as there's lots of interesting stuff here.

OK, this is about 400 topics in one email, cross-posted to 5 mailinglists ;-)

I do think the dbus list could be really helpful on a lot of this, andthat in general people extending or having trouble with dbus have beenmuch too slow to mail the list.

Can I propose that if people are seriously working on, or seriouslyhaving a problem with, any of these issues that they post a single-topicthread to the dbus list per-issue.

The vast majority of these issues have been discussed before, and thereis frequently even a "plan of record" and we just lack someoneimplementing it. A few of these issues are already solved, even, andsimply asking might save someone some trouble.

I'll comment in a random way on a few of the points in here below, but Idread a thread with many topics on 5 different lists, so can I ask thatfurther followup be single-topic, change the subject line, and drop thenon-dbus lists? (assuming the followup is about dbus generally and not a11y)



OK, some quick comments on specific points.

Performance
===

D-Bus performance is already well-known and pretty well understood; Igot the same results as the AtSpiDbusInvestigation page in 2004:

http://lists.freedesktop.org/pipermail/dbus/2004-November/001779.html
and updated information and suggestions related to this here:
http://lists.freedesktop.org/archives/dbus/2007-October/008822.html

The basic fact is that for a round-trip blocking method call there is aconstant-factor overhead of around 3x vs. raw sockets. (ORBit is not alot slower than raw sockets, iirc.) If you use the bus daemon, thisinherently doubles, since there are double the number of "hops." Lookingat the causes of this overhead, it is difficult to change fundamentally;I'm sure with effort you could get it down to 2x, and perhaps a bitbetter by writing a libdbus replacement with less flexibility. But youwon't get to 1x.

The constant factor falls out of a number of design decisions, some ofthem in the protocol, and others in libdbus. This was intentional. Myopinion is that these decisions were largely correct, but opinions vary.(Some of the design decisions are easy to explore changing, such aswhether to validate the data; see the archive links above.)

Given that fact, I don't think there is much more to say. There is noway application design should vary based on whether 2x or 3x wasmeasured. Unless you intend to hack on libdbus itself and want to drilldown into hotspots to fix them, what you as an app developer should havein your mind is "there's a single-digit constant-factor overhead vs. rawsockets" and "avoid round trips!!!!"

For AT-SPI I bet the bottom line is that you guys have got to reduce theamount of traffic and round trips, probably by changing or extending theAPI.

If you need something raw-socket-like, then stick to CORBA, or use D-Busfor discovery only then set up a custom socket channel, or whatever.There is no reason to drop CORBA when it is suitable. The reason fordbus is that (in my opinion) CORBA was not suitable for many desktopuse-cases. That does not mean CORBA is not suitable for AT-SPI.


Richer Introspection Data
===

Re: struct names in the introspection, etc. I feel sure there are oldthreads on this but am too lazy to dig them up.

Other than digging those up and posting the archive links, I guess thefirst step would be to write up the motivation and proposal on the list.In general it seems like a reasonable idea.


Interface Repository
===

We have certainly discussed on the list simply installing XML files sostatic languages can access them at compile time. Not sure of the statusof this, but it should amount to a spec patch, maybe even we alreadyhave a patch. There are definitely old threads and it's worth doing.

A runtime IR service, I'm not sure what it would be for. The dbusapproach is that each app provides its own introspection data.

This has pros and cons vs. keeping a global repository of types, but I'mskeptical that doing it *both* the central IR way and the introspectionway at the same time makes sense.


IDL vs. XML
===

Remember that writing the XML files by hand is not intended. It wouldnot be needed if we had reasonable tools.

The original vision, which I still think would be best, is that youimplement the object in some language. The XML is then generated byscanning that language - pulling docs from the language's native inlinedocs, pulling interfaces from the language's native interfaces, etc.This extracted XML can then be installed for use by static languages,for example. Definitely this is how the GLib bindings were intended towork; we did NOT want people to have to write an XML file then generatecode from it. (For the "server side" or object implementation, that is.)

A CORBA-like IDL fits into this same concept. The idea is that if youwanted to hand-write your IDL, maybe a nice CORBA-style syntax would bepreferred. No problem. You use libIDL to write a little tool to convertyour nice syntax to the XML format.

In other words, the XML format is a lingua franca. That's part of whyXML is used, because you can write quick tools and scripts in Python orPerl or whatever that manipulate the XML.



Passing Types With Objects/Structs
===

The major point here that I agree with: as Michael says, theintrospection calls probably add up to a lot of round trips, especiallyfor dynamic languages.

However, there are surely some good ways to optimize that which arebackward compatible and *simple* - i.e. that do not imply that dbus hasa global-across-all-processes type system or type repository, because itdoesn't and imo shouldn't.


<background digression>

A principle of the dbus design is that dbus is NOT a type system, it's amarshaling system.

That was part of the whole point of dbus vs. CORBA: you are supposed touse the type system *of your programming language* or *of your componentsystem*. dbus is a *marshaling* mechanism, not a universe of types. Fordbus, "struct A { int }" and "struct B { int }" are the same thing.

Unlike at least some of the old theories about how to use CORBA, dbus isNOT intended to be a component system; it is NOT intended to be a way todefine cross-language objects or types. It *can* be used to *remote* acomponent. See D-Bus FAQ for more on IPC vs. components.

The out-of-band introspection data is intended to provide optional hintsfor how to generate a language binding. But it's also intended that theintrospection data can be ignored; you can just treat the dbus messagesas raw structured data.

</background digression>

So, how would I approach the introspection round trips and/or bandwidth?I would think some combination of standard strategies, such as batchcalls to introspect multiple objects at once. Another possible approachwould be to have an implicit type repository *per application* -something like an extended Introspect() call that lets you specify anintrospection context, and in an introspection context the app wouldsend you each interface only once, and refer to it by reference thesecond and subsequent times.

Probably step 1 would be to profile the bottlenecks for the dynamicbindings that use introspection, with some common apps they might beintrospecting. We should not add a bunch of complexity on performancespeculation, only on performance data.

For the solution, I think it's important to keep the layering that theintrospection data is an *optional hint* that can be used to interpretdbus messages.


Object Paths
===

> In terms of attaching objects to a connection, it'd be really nice to
> have the attach method take not only a object path, but also a
> possible
> function for parsing the remaining components of a path whose prefix
> matches the given object path.

If I understand this correctly, this is already allowed. You canregister a handler for an entire subtree of the object path namespace.The intended usage of that is to allow you to do your own path tohandler mapping, e.g. the example I usually give is that you couldregister to handle "/documents" and then do your own interpretation of"/documents/0", "/documents/1", etc.


Object References
===

There has been past discussion, possibly worth digging up, about somestandard format for a complete "IOR" - which would include a serveraddress, optionally a bus name, and an object path. i.e. the info toallow you to create a DBusConnection and then create an object proxy.


The "shared connections" feature is intended to support this.

i.e. if you resolved this "IOR" you would have to get the DBusConnectionfrom the server address, then create your proxy. Without sharedconnections, you would create a DBusConnection per proxy, which would beabsurdly, horrifyingly inefficient.

Anyway, I think shared connections are the only hard part about thisfeature, and that part is already implemented.

I think the reason there's no "IOR" feature so far is because very fewpeople have needed it. It's rare to want to pass an object referencethat is "location independent" (not known to be on some specific bus orprovided by some other specific predefined program). When you're talkingto another program, and thus getting an object reference, you wouldnormally already know what bus that other program is on.

But, if someone thinks this feature through and codes it, it makessense, as I said the shared connections feature is already there andintended to support it.


Binary Introspection Data
===

I don't see how this is worth the enormous pain of reimplementing allkinds of stuff. It is not even clearly better than XML; there are plentyof contexts where XML is more convenient. And there is no provenperformance problem, or proof that binary would be dramatically better,or at least nobody has posted the proof where I've seen it.

I don't see a massive deprecation and reimplementation effort spanningquite a few projects, justified purely by subjective aesthetics.

In any case, I bet performance would be better addressed via themechanisms discussed earlier - batching up the introspection data, orallowing it to be passed "by reference" when the same app gets the sameinterface a second time. Certainly that seems like the thing to try first.


Perspective
===

Let me say again. I know it's a lot of fun to screw with componentsystems and type systems and IPC systems. (Obviously I've done itmyself.) However, we should not delude ourselves that this is especially*worthwhile* in most cases.

Where are apps having the most trouble, doing the most things wrong,etc.? Arcane improvements to the IPC system are not the answer.

Most of the problems are on a higher level. e.g. lack of convenience APIfor stuff like this:

http://svn.mugshot.org/dumbhippo/trunk/client/linux/src/hippo-dbus-helper.h

Or in GNOME, we aren't even using dbus for the baseline, simplefunctionality it already provides; e.g. there's still no single-instancesupport in gtk. Why would we be adding all kinds of new stuff to dbus,when we're still sucking at using the functionality we have?

Let's remember that DCOP was implemented in a very short period of time,and was dead simple - MUCH less complex than dbus is - and people usedit heavily and successfully for lots of real functionality.


Havoc

Follow-Ups:
- [g-a-devel] ATSPI over D-Bus (was Re: [Accessibility] Re: [Accessibility-atspi] D-Bus AT-SPI - The way forward)
  - From: Rob Taylor

References:
- [g-a-devel] D-Bus AT-SPI - The way forward
  - From: Mark Doffman
- Re: [g-a-devel] [Accessibility-atspi] D-Bus AT-SPI - The way forward
  - From: Michael Meeks
- Re: [g-a-devel] [Accessibility-atspi] D-Bus AT-SPI - The way forward
  - From: Mark Doffman
- Re: [g-a-devel] [Accessibility] Re: [Accessibility-atspi] D-Bus AT-SPI - The way forward
  - From: Rob Taylor

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]