zlacker

Cyc is one of those bad ideas that won't die, and which keeps getting rediscovered on HN. Lenat wasted decades of his life on it. Knowledge graphs like Cyc are labor intensive to build and difficult to maintain. They are brittle in the face of change, and useless if they cannot represent the underlying changes of reality.

replies(6): >>zopf+A3 >>breck+m9 >>xpe+P9 >>thesz+dc >>richar+SV >>mindcr+q12

>>blueye+(OP)
I wonder to what degree an LLM could now produce frames/slots/values in the knowledge graph. With so much structure already existing in the Cyc knowledge graph, could those frames act as the crystal seed upon which an LLM could crystallize its latent knowledge about the world from the trillions of tokens it was trained upon?

replies(1): >>tkgall+k4

>>zopf+A3
I had the same thought. Does anybody know if there have been attempts either to incorporate Cyc-like graphs into LLM training data or to extend such graphs with LLMs?

replies(3): >>radomi+E8 >>thesz+Gc >>mike_h+MR

>>tkgall+k4
From time to time, I read articles on the boundary between neural nets and knowledge graphs like a recent [1]. Sadly, no mention of Cyc.

I'd bet, judging mostly from my failed attempts at playing with OpenCyc around 2009, is that the Cyc has always been too closed and to complex to tinker with. That doesn't play nicely with academic work. When people finish their PhDs and start working for OpenAI, they simply don't have Cyc in their toolbox.

[1] https://www.sciencedirect.com/science/article/pii/S089360802...

replies(1): >>viksit+69

>>radomi+E8
oh i just commented elsewhere in the thread about our work in integrating frames and slots into LSTMs a few years ago! second this.

>>blueye+(OP)
I think before 2022 it was still an open question whether it was a good approach.

Now it's clear that knowledge graphs are far inferior to deep neural nets, but even still few people can explain the _root_ reason why.

I don't think Lenat's bet was a waste. I think it was sensible based on the information at the time.

The decision to research it largely in secret, closed source, I think was a mistake.

replies(3): >>xpe+Y9 >>xpe+za >>galaxy+l01

>>blueye+(OP)
The comment above misses the point in at least four ways. (1) Being aware of history is not the same as endorsing it. (2) Knowledge graphs are useful for many applications. (3) How narrow of a mindset and how much hindsight bias must one have to claim that Lenat wasted decades of his life? (4) Don’t forget to think about this in context about what was happening in the field of AI.

>>breck+m9
> Now it's clear that knowledge graphs are far inferior to deep neural nets

No. It depends. In general, two technologies can’t be assessed independently of the application.

replies(1): >>famous+mm

>>breck+m9
> but even still few people can explain the _root_ reason why.

_The_ (one) root reason? Ok, I’ll bite.

But you need to define your claim. What application?

replies(1): >>breck+SA4

>>blueye+(OP)
Lenat was able to produce superhuman performing AI in the early 1980s [1].

[1] https://voidfarer.livejournal.com/623.html

You can label it "bad idea" but you can't bring LLMs back in time.

replies(1): >>goatlo+rq

>>tkgall+k4
There are lattice-based RNNs applied as language models.

In fact, if you have a graph and a path-weighting model (RNN, TDCNN or Transformer), you can use beam search to evaluate paths through graphs.

>>xpe+Y9
Anything other than clear definitions and unambiguous axioms (which happens to be most of the real world) and gofai falls apart. Like it can't even be done. There's a reason it was abandoned in NLP long before the likes of GPT.

There aren't any class of problems deep nets can't handle. Will they always be the most efficient or best performing solution ? No, but it will be possible.

replies(3): >>mepian+Zn >>xpe+3b8 >>xpe+1c8

>>famous+mm
They should handle the problem of hallucinations then.

replies(2): >>famous+Ex >>eru+Yx

>>thesz+dc
Why didn't it ever have the impact that LLMs are having now? Or that DeepMind has had? Cyc didn't pass the Turing Test or become superhuman chess and go players. Yet it's had much more time to become successful.

replies(4): >>eru+Rx >>thesz+yI >>Barrin+iJ >>riku_i+d42

>>mepian+Zn
Bigger models hallucinate less.

and we don't call it hallucinations but gofai mispredicts plenty.

replies(1): >>xpe+Ka8

>>goatlo+rq
I'm very skeptical of Cyc and other symbolic approaches.

However I think they have a good excuse for 'Why didn't it ever have the impact that LLMs are having now?': lack of data and lack of compute.

And it's the same excuse that neural networks themselves have: back in those days, we just didn't have enough data, and we didn't have enough compute, even if we had the data.

(Of course, we learned in the meantime that neural networks benefit a lot from extra data and extra compute. Whether that can be brought to bear on Cyc-style symbolic approaches is another question.)

replies(1): >>thesz+oJ

>>mepian+Zn
They are working on it. And current large language models via eg transformers aren't the only way to do AI with neural networks nor are they the only way to do AI with statistical approaches in general.

Cyc also has the equivalent of hallucinations, when their definitions don't cleanly apply to the real world.

>>goatlo+rq
I guess it is funding. Compare funding of Google+Meta to what was/is available to Cyc.

Cyc was able to produce an impact, I keep pointing to MathCraft [1] which, at 2017, did not have a rival in the neural AI.

[1] https://en.wikipedia.org/wiki/Cyc#MathCraft

>>goatlo+rq
Because the goal of Cyc was always to build a genuinely generally intelligent common sense reasoning system and that is a non-sexy long term research project. Being great at chess isn't a measure that's relevant for a project like this, Stockfish is superhuman at chess.

>>eru+Rx
Usually, LLM's output gets passed through beam search [1] which is as symbolic as one can get.

[1] https://www.width.ai/post/what-is-beam-search

It is possible to even have 3-gram model to output better text predictions if you combine it with the beam search.

replies(1): >>eru+vU

>>tkgall+k4
The problem is not one of KB size. The Cyc KB is huge. The problem is that the underlying inferencing algorithms don't scale whereas the transformer algorithm does.

>>thesz+oJ
See >>40073039 for a discussion.

>>blueye+(OP)
We are living in the future /

I'll tell you how I know /

I read it in the paper /

Fifteen years ago -

(John Prine)

>>breck+m9
I assume the problem with symbolic inference is that from a single inconsistent premise logic can produce any statement possible.

If that is so then symbolic AI does not easily scale because you cannot feed inconsistent information into it. Compare this to how humans and LLMs learn, they both have no problem with inconsistent information. Yet statistically speaking humans can easily produce "useful" information.

replies(2): >>genril+Ol1 >>xpe+ie8

>>galaxy+l01
Based on the article, it seems like the Cyc had ways to deal with inconsistency. I don't know the details of how they did it, but Paraconsistant Logics [0] provide a general way to prevent any statement from being provable from an inconsistency.

[0] https://en.wikipedia.org/wiki/Paraconsistent_logic

replies(1): >>galaxy+Yj2

>>blueye+(OP)
They are brittle in the face of change, and useless if they cannot represent the underlying changes of reality.

FWIW, KG's don't have to be brittle. Or, at least they don't have to be as brittle as they've historically been. There are approaches (like PROWL[1]) to making graphs probabilistic so that they're asserting subjective beliefs about statements, instead of absolute statements. And then the strength of those beliefs can increase or decrease in response to new evidence (per Bayes Theorem). Probably the biggest problem with this stuff is that it tends to be crazy computationally expensive.

Still, there's always the chance of an algorithmic breakthrough or just hardware improvements bringing some of this stuff into the real of practical.

[1]: https://www.pr-owl.org/

>>goatlo+rq
> Why didn't it ever have the impact that LLMs are having now?

Google has its own Knowledge Graph, with billions of daily views, which is wider but more shallow version of Cyc. It is unclear if LLM user facing impact surpassed that project.

>>genril+Ol1
Interesting, from the article:

"Mathematical framework and rules of paraconsistent logic have been proposed as the activation function of an artificial neuron in order to build a neural network"

>>xpe+za
> _The_ (one) root reason? Ok, I’ll bite.

A "secret" hiding in plain sight.

replies(1): >>xpe+Ca8

>>breck+SA4
Does writing riddles help anyone? Maybe it helps you, maybe by giving you a smirk or dopamine hit? Think about others, please.

What is obvious to you is not obvious to others. I recommend explaining and clarifying if you care about persuasion.

replies(1): >>breck+Lq9

>>famous+Ex
> Bigger models hallucinate less.

I'm skeptical. Based on what research?

replies(1): >>famous+Dc9

>>famous+mm
> Anything other than clear definitions and unambiguous axioms (which happens to be most of the real world) and gofai falls apart.

You've overstated/exaggerated the claim. A narrower version of the claim is more interesting and more informative. History is almost never as simple as you imply.

replies(1): >>famous+Qb9

>>famous+mm
> There aren't any class of problems deep nets can't handle. Will they always be the most efficient or best performing solution ? No, but it will be possible.

This assumes that all classes of problems reduce to functions which can be approximated, right, per the universal approximation theorems?

Even for cases where the UAT applies (which is not everywhere, as I show next), your caveat understates the case. There are dramatically better and worse algorithms for differing problems.

But I think a lot of people (including the comment above) misunderstand or misapply the UATs. Think about the assumptions! UATs assume a fixed length input, do they not? This breaks a correspondence with many classes of algorithms.*

## Example

Let's make a DNN that sorts a list of numbers, shall we? But we can't cheat and only have it do pairwise comparisons -- that is not the full sorting problem. We have to input the list of numbers and output the list of sorted numbers. At run-time. With a variable-length list of inputs.

So no single DNN will do! For every input length, we would need a different DNN, would we not? Training this collection of DNNs will be a whole lot of fun! It will make Bitcoin mining look like a poster-child of energy conservation. /s

* Or am I wrong? Is there a theoretical result I don't know about?

replies(1): >>famous+ce9

>>galaxy+l01
> Compare this to how humans and LLMs learn, they both have no problem with inconsistent information.

I don't have time to fully refute this claim, but it is very problematic.

1. Even a very narrow framing of how neural networks deal with inconsistent training data would perhaps warrant a paper if not a Ph.D. thesis. Maybe this has already been done? Here is the problem statement: given a DNN with a given topology trained with SGD and a given error function, what happens when you present flatly contradictory training examples? What happens when the contradiction doesn't emerge until deeper levels of a network? Can we detect this? How?

2. Do we really _want_ systems that passively tolerate inconsistent information? When I think of an ideal learning agent, I want one that would engage in the learning process and seek to resolve any apparent contradictions. I haven't actively researched this area, but I'm confident that some have, if only because Tom Mitchell at CMU emphasizes different learning paradigms in his well-known ML book. So hopefully enough people reading that think "yeah, the usual training methods for NNs aren't really that interesting ... we can do better."

3. Just because humans 'tolerate' inconsistent information in some cases doesn't mean they do so well, as compared to ideal Bayesian agents.

4. There are "GOFAI" algorithms for probabilistic reasoning that are in many cases better than DNNs.

>>xpe+3b8
I really haven't.

>>xpe+Ka8
GPT-4 hallucinates a lot less than 3.5. Same with the Claude Models. This is from personal experience. There are also benchmarks (like TruthfulQA) that try to measure hallucinations that show the same thing.

>>xpe+1c8
>Even for cases where the UAT applies (which is not everywhere, as I show next), your caveat understates the case. There are dramatically better and worse algorithms for differing problems.

The grand goal of AI is a general learner that can at least tackle any kind of problem we care about. Are DNNs the best performing solution for every problem? No and I agree on that. But they are applicable to a far wider range of problems. There is no question what the better general learner paradigm is.

>* Or am I wrong? Is there a theoretical result I don't know about?

Thankfully, we don't need to get into theoreticals. Go ask GPT-4 to sort an arbitrary list of numbers. Change the length and try again.

>>xpe+Ca8
> Does writing riddles help anyone?

Pardon the cliffhanger style.

I have begun crafting an explanation, but not sure when it will be ready.

But when you recognize that thinking predates symbolic language, and start thinking about what thinking needs, you get closer to the answer.