If this seems like a bad idea, it's because it is.
And you know what? It should act as a warning to anybody thinking seriously about building large-scale semantic systems like the semantic web.
Cyc is pretty much useless (or at least the utility it provides is not exactly setting the world on fire). Until somebody can figure out how to employ an already very mature semantic graph like Cyc and do something(s) truly useful with it, don't waste anybody's time advocating the semantic web.
More imporantly, Cyc is a highly controlled semantic graph, with all of its information carefully curated. Imagine how terrible the results of semantic inference engines will be in the wild-west of the web!
I think the CYC project should be used only as an example of spending a huge amount of time and money on a flawed structure as opposed to a condemnation of semantic approaches.
(In mathematics you don't technically need very many axioms or inference rules, although you do need a large body of heuristics and hints if you want a proof system which confines itself to proving inferences that are actually useful in some sense of the word, rather than proving a combinatorial explosion of trivial theorems. Dealing with the body of elementary arithmetic problems, however, wouldn't necessarily be particularly intractable -- last time I checked software proof systems can already deal with proving theorems in areas of mathematics like this and a fair bit further.)
Douglas Lenat: http://en.wikipedia.org/wiki/Douglas_Lenat Eurisco: http://en.wikipedia.org/wiki/Eurisko
OpenCyc's purpose is to spread the use of the Cyc ontology in the Semantic web and Linked Data.
Given that human knowledge encompasses more than one axiomatic system, it would be foolish to endow a system designed to replicate human knowledge with an immutable set of axioms.
Please watch this presentation of Richard Feynman on the nature of maths and physics: http://www.feynmanphysicslectures.com/relation-of-mathematic...
Ideas that Lenat promoted with Eurisko and Cyc have made him successful by most criteria. But it would be better had he published a proven finished product, including it's innards. My feeling is that, in publishing, one should show the code or it never happened.
The internal mental furnishings of most human beings are seemingly full of these, however.
I think Cyc is a joke, too, but your argument doesn't hold.
I am however, trying to draw an analogy between Cyc and an open Semantic Web. If Cyc has issues like the ones you describe, these issues will only be magnified ad infinitum in a completely open graph where there are effectively no controls.
http://sw.opencyc.org/2009/04/07/concept/en/Game
"A specialization of DevisedStructuredActivity. Each instance of Game is an abstraction of a game that is played according to a semi-rigid set of rules. Each instance includes both the rules (see GameRulesFn) and a specification of any physical components required for play (instances of GameBoard, Ball, etc.). Neither Events of playing games (instances of PlayingAGame) nor any physical components required for play (e.g. GameBoards) are instances of Game."
Or do they mean something else, and if so, how would I find that something else (what formats/names) in their downloads?
(The highly contextual ones do pretty well in the market, though.)