zlacker

Great read. Surprised to read Wolfram never actually got to use CYC. Anyone here who has and can talk about its capabilities?

replies(4): >>stakha+87 >>gumby+bO >>nvm0n2+7R >>lispm+Ih1

>>Chaita+(OP)
I briefly looked into it many moons ago when I was a Ph.D. student working in the area of computational semantics in 2006-10. This was already well past the hayday of CYC though.

The first stumbling block was that CYC wasn't openly available. Their research group was very insular, and they were very protective of their IP, hoping to pay for their work through licensing deals and industry- or academic collaborations that could funnel money their way.

They had a subset called "OpenCYC" though, which they released more publicly in the hope of drawing more attention. I tried using that, but soon got frustrated with the software. The representation was in a CYC-specific language called "CycL" and the inference engine was CYC-specific as well and based on a weird description logic specifically invented for CYC. So you couldn't just hook up a first-order theorem prover or anything like that. And "description logic" is a polite term for what their software did. It seemed mostly designed as a workaround to the fact that open-ended inferencing of the kind they spoke of to motivate their work would have depended way too frequently on factoids of common sense knowledge that were missing from the knowledge base. I got frustrated with that software very quickly and eventually gave up.

This was a period of AI-winter, and people doing AI were very afraid to even use the term "AI" to describe what they were doing. People were instead saying they were doing "pattern processing with images" or "audio signal processing" or "natural language processing" or "automated theorem proving" or whatever. Any mention of "AI" made you look naive. But Lenat's group called their stuff "AI" and stuck to their guns, even at a time when that seemed a bit politically inept.

From what I gathered through hearsay, CYC were also doing things like taking a grant from the defense department, and suddenly a major proportion of the facts in the ontology were about military helicopters. But they still kept beating the drum about how they were codifying "common sense" knowledge, and, if only they could get enough "common sense" knowledge in there, they would break through a resistance level at some point, where they could have the AI program itself, i.e. use the existing facts to derive more facts by reading and understanding plain text.

replies(2): >>zozbot+Q7 >>Michae+Yn

>>stakha+87
Doesn't description logic mostly boil down to multi-modal logic, which ought to be representable as a fragment of FOL (w/ quantifiers ranging over "possible worlds")?

Description logic isn't just found in Cyc, either; Semantic Web standards are based on it, for similar reasons - it's key to making general inference computationally tractable.

replies(1): >>stakha+Aa

>>zozbot+Q7
I'm not trying to be dismissive of description logics. (And I'm not dismissive of Lenat and his work, either). A lot of things can fall under that umbrella term. The history of description logic may in fact be just as old as post-syllogism first-order predicate calculus (the syllogism is, of course, far older, dating back to Aristotle). In the Principia Mathematica there's a quantifier that basically means "the", which is incidentally also the most common word in the English language, and that can be thought of as a description logic too. But the perspective of a Mathematician on this is very different from that of an AI systems "practitioner", and CYC seemed to belong more to the latter tradition.

>>stakha+87
That's fascinating to read, thanks for sharing.

Did it ever do something genuinely surprising? That seemed beyond the state-of-the-art at the time?

replies(1): >>stakha+hx

>>Michae+Yn
One of the people from Cyc gave a talk at the research group I was in once and mentioned an idea that kind of stuck with me.

...sorry, it takes some building-up to this: At the time, a lot of work in NLP was focused on building parsers that were trying to draw constituency trees from sentences, or extract syntactic dependency structures, but do so in a way that completely abstracted away from semantics, or looked at semantics as an extension of syntax, but not venturing into the territory of inference and common sense. So, a sentence like "Green ideas sleep furiously" (to borrow from Chomsky's example), was just as good as a research object to someone doing that kind of research as a sentence that actually makes sense and is comprised of words of the same lexical categories, like "Absolute power corrupts absolutely". -- I suspect, that line of research is still going strong, so the past tense may not be quite appropriate here. I'm using it, because I have been so out of the loop since leaving academia.

The major problem these folk are facing is an exploding combinatorial space of ambiguity at the grammatical level ("I saw a man with a telescope" can be bracketed "I saw (a man) with a telescope" or "I saw a (man with a telescope)") and the semantic level ("Every man loves a woman" can mean "For every man M there exists a woman W, such that M loves W" or it can mean "There exists a woman W, such that for every man M it is true that M loves W"). Even if you could completely solve the parsing problem, the ambiguity problem would remain.

Now this guy from the Cyc group said: Forget about parsing. If you give me the words that are in the sentence and you're not even giving me any clue about how the words were used in the sentence, I can already look into my ontology and tell you how the ontology would be most likely to connect the words.

Now, the sentence "The cat chased the dog" obviously means something different from "The dog chased the cat" despite using the same words. But in most text genres, you're likely to only encounter sentences that are saying things that are commonly held as true. So if you have an ontology that tells you what's commonly held as true, that gives you a statistical prior that enables you to understand language. In fact, you probably can't hope to understand language without it, and it's probably the key to "disambiguation".

This thought kind of flipped my worldview upside down. I had always kind of thought of it as this "pipelined architecture" where you first need to parse the text, before it even makes sense to think about how to solve the problems of what to do with the output from that parser. But that was unnecessarily limiting. You can look at the problem as a joint-decoding problem, and it may very well be the case that the lion's share of entropy comes from elsewhere, and it may be foolish to go around trying to build parsers, if you haven't yet hooked up your system to the information source that provides the lion's share of entropy, namely common-sense knowledge.

Now, I don't think that Cyc had gotten particularly close to solving that problem either, and, in fact, it was a bit uncharacteristic for a "Cycler" to talk about statistical priors at all, as their work hadn't even gotten into the territory of collecting those kinds of statistics. But, as a theoretical point, I thought it was very valid.

>>Chaita+(OP)
Some of us who worked on Cyc commented in an earlier post about Doug's decease.

>>Chaita+(OP)
I played with OpenCyc once. It was quite hard to use because you had to learn things like CycL and I couldn't get their natural language processing module to work.

The knowledge base was impressively huge but it also took a lot of work to learn because at the lower levels it was extremely abstract. A lot of the assertions in the KB were establishing very low level stuff that only made sense if you were really into abstract logic or philosophy.

They made bold claims on their website for what it could do, but I could never reproduce them. There was supposedly a more advanced version called ResearchCyc though, which I didn't have access to.

replies(1): >>creer+Yg1

>>nvm0n2+7R
That was exactly my reaction to it: it seemed to require sooooo much background knowledge about the entire system to do anything. And because you were warned about issues with consistency it seemed you were warned about just fudging some things. That it was a quick way to an application that couldn't work. The learning curve seemed daunting.

>>Chaita+(OP)
Wolfram is able to write it in such a way that somehow it is mostly about him. :-(

There is some overlap between Cyc and his Alpha. Cyc was supposed to provide a lot of common sense knowledge, which would be reusable. When Expert Systems were a thing, one of the limiting factor were said to be limited amount of broader knowledge of the world. Knowledge a human learns by experience, interacting with the world. This would involve a lot of facts about the world and also about all kinds of exceptions (Example: a mother typically is older than its child, unless the child was adopted and the mother is younger). Cyc knows a lot of 'facts' and also many ways of logic reasoning plus many logic 'reasoning rules'.

Wolfram Alpha has a lot of knowledge about facts, often in some form of maths or somewhat structured data.

replies(1): >>dang+Ij1

>>lispm+Ih1
Ok, but let's avoid doing the mirror image thing where we make the thread about Wolfram doing that.

https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

replies(1): >>lispm+em1

>>dang+Ij1
Well, it's a disappointing and shallow read, because the topic of the usefulness of combining Cyc and Alpha would have been interesting.

replies(1): >>dang+Kj2

>>lispm+em1
Wolfram writes good historical articles. One just needs to put on some glasses that filter out the annoyance part of the spectrum.