zlacker

[parent] [thread] 1 comments
1. kracke+(OP)[view] [source] 2025-04-09 06:06:19
>I would be interested in reading a paper that does a good job of explaining what a parameter ends up representing in an LLM model.

https://distill.pub/2020/circuits/ https://transformer-circuits.pub/2025/attribution-graphs/bio...

replies(1): >>ChuckM+6t1
2. ChuckM+6t1[view] [source] 2025-04-09 17:43:53
>>kracke+(OP)
That's an interesting paper and worth reading. Not sure it has answered my question but I did learn some things from it that I had not considered.

This was the quote I resonated with :-)

"... the discoveries we highlight here only capture a small fraction of the mechanisms of the model."

It sometimes feels a bit like papers on cellular biology with DNA discussions in which descriptions of the enzymes and proteins involved are insightful but the mechanism that operates the reaction remains opaque.

[go to top]