Re: [cuo-wg] An Evolved Approach to CDSI / Executable English

To:	bhaugh@xxxxxxx, "common upper ontology working group" <cuo-wg@xxxxxxxxxxxxxx>, "Schoening, James R C-E LCMC CIO/G6" <James.Schoening@xxxxxxxxxxx>
From:	"Adrian Walker" <adriandwalker@xxxxxxxxx>
Date:	Thu, 23 Nov 2006 11:45:25 -0500
Message-id:	<1e89d6a40611230845q5f1443c8i8605e1fb330842f4@xxxxxxxxxxxxxx>

Hi Brian --

Many thanks for your note.

You wrote...

...using natural language to communicate with the users/SMEs certainly seems essential. Previously, this has been accomplished via conversation between the techie modelers and the SMEs. But, restricted [emphasis added] English interfaces, like yours and others, offer the promise of more direct and less ambiguous communication between SMEs and the formal information model/ontology tools.

Agreed, English interfaces offer a lot of promise.   However, the Executable English interface supported by the Internet Business Logic system [1] is actually an unrestricted one.   The technology is different from that of CLCE [2] and similar systems. The vocabulary in [1] is open, and so -- to a very large extent -- is the syntax. There is no need for external dictionary or grammar construction or maintenance, yet the English semantics are strict. Since English is at best a moving target, this approach has many practical advantages. For example, it is not necessary to train authors or users to use a restricted subset of English.

In [1], the unrestricted vocabulary and largely open syntax, together with strict English semantics, is supported via a trade off. The idea is to keep the natural language component as simple as possible, while supporting SMEs and end users as they write executable content into a browser, using their own words and phrases. I'll be glad to discuss off-list how this works. Hopefully, folks who would like to evaluate whether the trade off is useful for their purposes will go hands on on the system to see for themselves that it works.

You wrote also...

I've been wondering if you have done a comparison of your Executable English compared to the Common Logic Controlled English (CLCE) form, especially with respect to expressive power, where CLCE seems quite strong.

Yes. Here's an informal comparison.

Since CLCE [2] maps to FOL, I believe it lacks:

A. Recursion    (needed for reasoning over hierarchies -- e.g. transitive over [3]),

B. Aggregation     (find the rolled up available funding from all departments [4])
                                 (find the total oil supply available [5])

C. Closed world negation    (if Adrian is not in the list of employees of IDA, then Adrian does not work for IDA).

[1] supports these. It also supports English explanations of reasoning that involves A, B, and C.

CLCE [2] supports open world negation, which is currently considered to be advantageous for the Semantic Web. However, in practice, most databases are used via closed world negation.

[1] does not support open world negation, but could be extended to do so, if a really strong practical motivating example could be found.

I hope this is helpful, and not too much technical detail.   Do you think there is any way of boiling this kind of discussion down so that it would be of interest to Jim for the paper?

                                                       Thanks, -- Adrian

[1] Internet Business Logic, online at www.reengineeringllc.com. Shared use is free.

[2] Common Logic Controlled English (CLCE). http://www.jfsowa.com/clce/specs.htm
[3] www.reengineeringllc.com/demo_agents/TransitiveOver1.agent

[4] www.reengineeringllc.com/demo_agents/FundManagement1.agent

[5] www.reengineeringllc.com/demo_agents/Oil-IndustrySupplyChain1Sqlmysql.agent

On 11/22/06, Haugh, Brian <bhaugh@xxxxxxx> wrote:

Adrian ,

I can see how you might get the impression that the user was left out. But, as I described the C2IEDM/GH5 effort, the users are ideally involved from the outset, although those users best be domain experts as well. There is the further question of how the translations or ontologies might evolve over time, which was not discussed. In any case, a strategy for involving the users and/or updating the results seems independent of whether you take a designed or an evolutionary approach. So, I wouldn't say it combines them, whether or not you use a machine executable form of a natural language.

But, using natural language to communicate with the users/SMEs certainly seems essential. Previously, this has been accomplished via conversation between the techie modelers and the SMEs. But, restricted English interfaces, like yours and others, offer the promise of more direct and less ambiguous communication between SMEs and the formal information model/ontology tools.

Incidentally, I've been wondering if you have done a comparison of your Executable English compared to the Common Logic Controlled English (CLCE) form, especially with respect to expressive power, where CLCE seems quite strong.

Brian

__________________________________________
Brian A. Haugh, Ph.D.
Institute for Defense Analyses
Information Technology and Systems Division Phone: (703) 845-6678
4850 Mark Center Drive                       Fax: (703) 845-6848
Alexandria, VA 22311-1882                    Email: bhaugh@xxxxxxx

From: cuo-wg-bounces@xxxxxxxxxxxxxx [mailto: cuo-wg-bounces@xxxxxxxxxxxxxx] On Behalf Of Adrian Walker
Sent: Wednesday, November 22, 2006 10:39 PM
To: bcox@xxxxxxxxxxxxxxx; common upper ontology working group
Subject: Re: [cuo-wg] An Evolved Approach to CDSI?

Brian, Brad --

One concern with the hi-tech and the low-tech approaches that you describe is that they both seem to be essentially "waterfall" in nature. That is, the experts hard code how things shall work, and the users have to live with the results. Even if the real world situation changes!

Of course that's a simplification. The expert-user-expert loop has to be closed somehow. But the result can be brittle operationally, and expensive to maintain.

If, on the other hand, we do the best we can to support the writing of executable knowledge at a level that users can read and -- with suitable permissions -- edit, then the results can hopefully be more agile and less expensive to maintain.   If the expert and user communities overlap in this way, the feedback loop can be very short. Also, knowledge flows in from the users at the cutting edge -- rather like Wikipedia and Web 2.0, only they are mainly for non-executable content.

To put it another way, we could combine the advantages of the designed and evolutionary approaches.

This is one of the possible advantages of the Wiki-like aspect of Executable English.

Just a late night thought.... What do you think?

                                                        Cheers,   -- Adrian

Internet Business Logic (R)
Executable open vocabulary English
Online at www.reengineeringllc.com
                                Shared use is free

Adrian Walker
Reengineering
Phone: USA 860 830 2085

On 11/22/06, Brad Cox, Ph.D. <bcox@xxxxxxxxxxxxxxx> wrote:

> "Pipelines," also known as "stovepipes," are not known to promote
> effective interoperability across systems, much less across domains.

We seem to have miscommunicated. Pipeline doesn't mean stovepipe, certainly
not in my domain. It means a linear chain of nodes, linearly connected by
interfaces. I never proposed pipelining as a solution, but as a concrete
example to show a lower bound (N-1) that is far smaller than the upper bound
(N*(N-1)). Real systems fall somewhere between.

The rest of your argument seems to hinge on that misunderstanding. Any system,
however connected, will have M interfaces between its nodes. The interfaces do
the mappings. Each translation is *NOT* likely to lose or distort information;
each translation works exactly the same way, day in, day out. The internet
would never work if bits degrade along the way.

The question is whether those mappings are derived from some explicit upper
ontology that understands all domains of interest, versus independently for
each interface by pairs of domain experts on each side of each interface.

The first approach relies on high-tech ontology specification and translator
derivation tools. The second one relies on human intelligence (domain experts)
feeding extremely low-tech tools (XSLT or slightly better).

Take your choice. My bet is on humans over AI every time.

--
Work: Brad Cox, Ph.D; Binary Group; Mail bcox@xxxxxxxxxxxxxxx
Home: 703 361 4751; Chat brdjcx@aim; Web http://virtualschool.edu

---------- Original Message -----------
From: "Haugh, Brian" <bhaugh@xxxxxxx>
To: "common upper ontology working group" < cuo-wg@xxxxxxxxxxxxxx>
Sent: Wed, 22 Nov 2006 18:38:34 -0500
Subject: Re: [cuo-wg] An Evolved Approach to CDSI?

> Perhaps some replies to issues raised by Brad Cox (in quotes) may help
> clarify some of the real problems faced by CDSI, which his proposal for
> an "evolved approach" does not adequately address.
>
> But, those who already feel that they understand the "N^2 problem" and
> its ramifications for Brad's proposal and CDSI may want to skip this.
> __________
> "I don't feel I understand what people mean by the term "N^2 problem....
> for N machines in a linear pipeline, the number of interfaces is N-1."
>
> "Pipelines," also known as "stovepipes," are not known to promote
> effective interoperability across systems, much less across domains.
> Translation pipelines, however, are even worse than traditional
> stovepipe systems, which may share some common internal representations.
> A translation "pipeline" would face the results you get from the
> children's game of "telephone", only worse. Each translation is likely
> to lose or distort some aspect of information obtained from a source
> with different semantic foundations, so by the end the results may be
> unrecognizable.
>
> While we may all agree that you won't have the worst N^2 case in every
> context, you really don't want a lot of linear pipelines translating
> between systems/domains. In addition, capability needs are driving
> requirements for direct interoperability between more and more
> systems/domains. Hence, the N^2 upper bound is a real concern.
> ____________
> "The approach doesn't much depend on what standard (language) is used."
>
> While the definition of the ("designed") approach is indedpendent of the
> language used; I expect all would agree that any success achieved by an
> application of this approach will be strongly dependent on the
> expressive power of the language, as well as its semantic foundations,
> breadth of adoption, adequacy of support tools, etc.
> ______________
> [In] "the evolved approach ...groups...address the problem in much the
> same way we solve inteoperability with natural languages; by using
> dictionaries and related tools, using interpreters, etc."
>
> While unclear, it sounds like the proposed "evolved approach" involves
> point-to-point translations between systems/domains and a competitive
> marketplace for achieving the best translations. If so, it seems to me
> that any faith in such an approach underestimates:
>
> 1) the number of point-to-point translation components required
>
> 2) the difficulty of doing translation between systems/domains with no
> common semantic foundation (you still need the "domain experts" as in
> the "designed approach")
>
> 3) the exhorbitant costs of supporting a competitive market place in
> point-to-point translation components. The market for customized
> components to translate between indivdual systems or even domains does
> not seem adequate to support an effective free market. Providing
> incentives to government employees (or contractors) to produce multiple
> competing translations doesn't sound real cost-effective either. Someone
> has to pay the labor for all those translations (Order K*N^2, where K is
> the number of competing solutions for each pairwise translation).
> General-purpose tools for dictionaries and translation cannot do the
> translations themselves, even if there were a viable market for them.
> __________________
> "But mainly because people just don't solve ontology differences that
> way in
>  the real (non-IT) world. They just buy a dictionary, or hire a
> translator.  Problem solved."
>
> This analogy with natural language translation by humans doesn't hold up
> too well for a number of obvious reasons. We don't want to use humans to
> do all real-time translations manually due to costs in labor, time, and
> money. Even existing translations of (& information extraction from)
> human readable natural language text is moving towards automation,
> (e.g., TIDES, ACE, AFE) due to volume and cost issues.
>
> So, I think we all recognize the needs for machines to do the
> cross-domain translations and to use the results for tasks like
> discovery and analysis. But, as acknowledged, machines are not as smart
> a people, so we can't just hand them a dictionary :-). To effectively
> translate and analyze information from multiple disparate sources,
> computers need software translation components that are grounded in
> formal semantics supporting automated inference. But, an "evolved
> approach" dependent on funding a competitive market in machine-readable
> translation components ("dictionaries") for all pariwise unrelated
> languages/systems/domains (which need to interoperate) doesn't sound
> real cost-effective (Order (K*N^2) is a real cost issue).
> __________________
>
> While the "evolved approach" that's been described has long been
> considered a non-starter by many, other concepts of "evolution" and
> "bottom-up" development in semantic interoperability might well have
> something to offer to the CDSI problem. There has been work on
> automating the extraction of ontology elements from text, as well as the
> evolution of ontologies over time to reflect changing usage. The work by
> John Sowa in this area, already cited in this forum, is one example of
> interest. Such approaches have evolved well beyond the simplistic "hire
> a translator" proposal. Still, they seem to need some more evolution
> before they are ready for prime time applications.
>
> And, while the "designed approach" has been dismissed, it has a track
> record of success in promoting data interoperability amongst disparate
> systems. The C2IEDM/GH5/JC3IEDM, which others have mentioned, is one
> good example wherein a multi-national group of domain experts
> collaborated to produce a common information model that serves as the
> "lingua franca" for operational interoperability in Command and Control
> across very different systems maintained by different governments. While
> such efforts have been unfairly disparaged as too "top-down", when
> successful they actually rely heavily on bottom-up inputs from Subject
> Matter Experts working closely with IT folks to get the information
> models to support user needs.
>
> That is not to say that the specific approach taken by the C2IEDM/GH5 is
> adequate for CDSI, as it falls short in its semantic content (being
> inadequate for machine "understanding") and it covers only one domain
> (Command and Control). Still, some variant of the generic "designed
> approach" might be adapted for CDSI by the use of a common upper
> ontology and/or a small set of upper/middle ontologies. But, I.
>
> Brian
> __________________________________________
> Brian A. Haugh, Ph.D.
> Institute for Defense Analyses
> Information Technology and Systems Division  Phone: (703) 845-6678
> 4850 Mark Center Drive                       Fax: (703) 845-6848
> Alexandria, VA 22311-1882                    Email: bhaugh@xxxxxxx
>
> > -----Original Message-----
> > From: cuo-wg-bounces@xxxxxxxxxxxxxx
> [mailto: cuo-wg-bounces@xxxxxxxxxxxxxx]
> > On Behalf Of Brad Cox, Ph.D.
> > Sent: Monday, November 20, 2006 3:38 PM
> > To: rick@xxxxxxxxxxxxxx; common upper ontology working group
> > Subject: Re: [cuo-wg] White Paper
> >
> > Thanks for the encouraging note, Richard. I'd backed off, convinced
> I'd
> > wasn't
> > being heard. But buoyed by your note, I'll take one more shot at
> > explaining
> > what I've been trying to get across.
> >
> > One of the things that's confusing me is I don't feel I understand
> what
> > people
> > mean by the term "N^2 problem". I'm guessing that's shorthand for
> costs
> > increaases as limitOf(N*(N-1)) as N -> infinity = N^2. Fair enough;
> its
> > shorter.
> >
> > But that applies if all N machines are to be connected to all N-1
> others.
> > Actually cost increases as the number of *interfaces*. N^2 is just an
> > upper
> > bound on that. But why concentrate on the upper bound when interfaces
> > could be
> > counted as easily, without the concern over whether upper bounds are
> > realistic? For example, for N machines in a linear pipeline, the
> number of
> > interfaces is N-1, hardly N^2 or even N*(N-1).
> >
> > So rephrasing the problem as one of semantic interoperability between
> M
> > interfaces where M is larger than we might like but still far less
> than
> > N*(N-1). I been trying to point out that there are two ways of
> approaching
> > that problem. I've called them the designed approach and the evolved
> > approach.
> >
> > In the designed approach, a (small) community of experts uses high
> > technology
> > ontology tools to build a generalized solution (upper ontology) that
> can
> > generate the mappings needed to make any given interface
> interoperable.
> > The
> > approach doesn't much depend on what standard (language) is used. I
> used
> > OWL
> > as my example because that's what I'm most familiar with. Structured
> > English,
> > structured french, or plain ol' Java/Cobol/Haskel would do about as
> well,
> > albeit with varying readibility. What's important here is that the
> > approach is
> > centrally planned, largely confined to an expert community, although
> > hopefully
> > with at least some support by domain experts with conflicting demands
> on
> > their
> > time.
> >
> > The evolved approach is entirely different and more bottom-up. M
> > interfaces
> > imply there are M  groups of individuals that care about making each
> > specific
> > interface (call it M(i)) interoperate. Those M groups are empowered
> > (governance?) to address the problem in much the same way we solve
> > inteoperability with natural languages; by using dictionaries and
> related
> > tools, using interpreters, etc. Dictionaries and interpreters are
> evolved
> > systems. Externally these are commercial products that compete with
> each
> > other
> > in a competive system (free markets). But I could well imagine that
> domain
> > experts within govt might produce translation dictionaries that might
> > compete
> > in a similar way, if govt could find a way to incentive them to focus
> on
> > the
> > problem over other pressing uses of their time.
> >
> > Point is, I could well see how the second (evolved) approach could
> "solve"
> > the interoperability problem" as I've stated it. I've much less
> confidence
> > (approaching zero) that the designed approach (as I defined it) ever
> > could.
> > This is partially because AI technology just isn't very smart, and
> > partially
> > because you still need domain experts and don't have a way to
> incentive
> > them
> > to contribute, since you've counted too heavily on high technology as
> the
> > sole
> > solution.
> >
> > But mainly because people just don't solve ontology differences that
> way
> > in
> > the real (non-IT) world. They just buy a dictionary, or hire a
> translator.
> > Problem solved.
> >
> > --
> > Work: Brad Cox, Ph.D; Binary Group; Mail bcox@xxxxxxxxxxxxxxx
> > Home: 703 361 4751; Chat brdjcx@aim; Web http://virtualschool.edu
> >
> >
>
>  _________________________________________________________________
> Message Archives: http://colab.cim3.net/forum/cuo-wg/
> Subscribe/Unsubscribe/Config: http://colab.cim3.net/mailman/listinfo/cuo-wg/
> To Post: mailto: cuo-wg@xxxxxxxxxxxxxx
> Community Portal: http://colab.cim3.net/
> Shared Files: http://colab.cim3.net/file/work/SICoP/cuo-wg/
> Community Wiki:
http://colab.cim3.net/cgi-bin/wiki.pl?SICoP/CommonUpperOntologyWG
------- End of Original Message -------

_________________________________________________________________
Message Archives: http://colab.cim3.net/forum/cuo-wg/
Subscribe/Unsubscribe/Config: http://colab.cim3.net/mailman/listinfo/cuo-wg/
To Post: mailto:cuo-wg@xxxxxxxxxxxxxx
Community Portal: http://colab.cim3.net/
Shared Files: http://colab.cim3.net/file/work/SICoP/cuo-wg/
Community Wiki: http://colab.cim3.net/cgi-bin/wiki.pl?SICoP/CommonUpperOntologyWG

_________________________________________________________________
Message Archives: http://colab.cim3.net/forum/cuo-wg/
Subscribe/Unsubscribe/Config: http://colab.cim3.net/mailman/listinfo/cuo-wg/
To Post: mailto:cuo-wg@xxxxxxxxxxxxxx
Community Portal: http://colab.cim3.net/
Shared Files: http://colab.cim3.net/file/work/SICoP/cuo-wg/
Community Wiki: http://colab.cim3.net/cgi-bin/wiki.pl?SICoP/CommonUpperOntologyWG

 _________________________________________________________________
Message Archives: http://colab.cim3.net/forum/cuo-wg/
Subscribe/Unsubscribe/Config: http://colab.cim3.net/mailman/listinfo/cuo-wg/
To Post: mailto:cuo-wg@xxxxxxxxxxxxxx
Community Portal: http://colab.cim3.net/
Shared Files: http://colab.cim3.net/file/work/SICoP/cuo-wg/
Community Wiki: 
http://colab.cim3.net/cgi-bin/wiki.pl?SICoP/CommonUpperOntologyWG    (01)

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [cuo-wg] An Evolved Approach to CDSI / Executable English, Adrian Walker <= Re: [cuo-wg] An Evolved Approach to CDSI / Executable English, Haugh, Brian

Previous by Date:	[cuo-wg] Interoperability, John Flynn
Next by Date:	Re: [cuo-wg] An Evolved Approach to CDSI?, Brad Cox, Ph.D.
Previous by Thread:	[cuo-wg] Pay Hayes is correct, James R. Schoening
Next by Thread:	Re: [cuo-wg] An Evolved Approach to CDSI / Executable English, Haugh, Brian
Indexes:	[Date] [Thread] [Top] [All Lists]