zlacker

[parent] [thread] 31 comments
1. rlili+(OP)[view] [source] 2025-12-06 15:18:48
Makes me wonder if decompilation could eventually become so trivial that everything would become de-facto open source.
replies(11): >>Xmd5a+11 >>js8+ma >>Viking+oa >>Aeolun+xc >>jasonj+bd >>DrNosf+ze >>tcdent+Ze >>johnfn+aC >>anabis+MO >>jonhoh+qQ >>TheAce+6b1
2. Xmd5a+11[view] [source] 2025-12-06 15:26:24
>>rlili+(OP)
This deserves a discussion
replies(2): >>ronsor+M2 >>stevem+Aa
◧◩
3. ronsor+M2[view] [source] [discussion] 2025-12-06 15:37:05
>>Xmd5a+11
I've used LLMs to help with decompilation since the original release of GPT-4. They're excellent at recognizing the purpose of functions and refactoring IDA or Ghidra pseudo-C into readable code.
replies(2): >>galang+m3 >>eurode+K4
◧◩◪
4. galang+m3[view] [source] [discussion] 2025-12-06 15:42:19
>>ronsor+M2
How does it do on things that were originally written in assembly?
replies(1): >>saagar+j5
◧◩◪
5. eurode+K4[view] [source] [discussion] 2025-12-06 15:54:43
>>ronsor+M2
Someone please try this on an original (early 1980s) IBM-PC BIOS.
replies(2): >>mh-+my >>tadfis+1N
◧◩◪◨
6. saagar+j5[view] [source] [discussion] 2025-12-06 15:58:31
>>galang+m3
This is typically easier because the code was written for humans already.
7. js8+ma[view] [source] 2025-12-06 16:36:50
>>rlili+(OP)
Yes, I believe it will. What I predict will happen is that most commercial software will be hosted and provided through "trusted" platforms with limited access, making reverse engineering impossible.
8. Viking+oa[view] [source] 2025-12-06 16:36:58
>>rlili+(OP)
I wonder when you're never going to run expensive software on your own CPU.

It'll either all be in the cloud, so you never run the code...

Or it'll be on a chip, in a hermetically sealed usb drive, that you plug in to your computer.

◧◩
9. stevem+Aa[view] [source] [discussion] 2025-12-06 16:39:31
>>Xmd5a+11
We're very far away from this.
10. Aeolun+xc[view] [source] 2025-12-06 16:55:01
>>rlili+(OP)
When the decompilation like that is trivial, so is recreation without decompilation. It implies the LLM know exactly how thins work.
11. jasonj+bd[view] [source] 2025-12-06 17:01:00
>>rlili+(OP)
It would be "source available", if anything, not "open source".

> An open-source license is a type of license for computer software and other products that allows the source code, blueprint or design to be used, modified or shared (with or without modification) under defined terms and conditions.

https://en.wikipedia.org/wiki/Open_source

Companies have been really abusing what open source means- claiming something is "open source" cause they share the code and then having a license that says you can't use any part of it in any way.

Similarly if you ever use that software or depending on where you downloaded it from, you might have agreed not to decompile or read the source code. Using that code is a gamble.

replies(4): >>sa1+Gd >>DrNosf+go >>yieldc+Mr >>mkatx+wA
◧◩
12. sa1+Gd[view] [source] [discussion] 2025-12-06 17:04:44
>>jasonj+bd
But clean room reverse engineered code can have its own license, no?
replies(3): >>simonw+2f >>vunder+ty >>comex+I91
13. DrNosf+ze[view] [source] 2025-12-06 17:12:02
>>rlili+(OP)
This day will arrive.

And it will be great for retro game preservation.

Having more integrated tools and tutorials on this would be awesome.

14. tcdent+Ze[view] [source] 2025-12-06 17:14:24
>>rlili+(OP)
That's definitely a possible future abstraction and one are about the future of technology I'm excited about.

First we get to tackle all of the small ideas and side projects we haven't had time to prioritize.

Then, we start taking ownership of all of the software systems that we interact with on a daily basis; hacking in modifications and reverse engineering protocols to suit our needs.

Finally our own interaction with software becomes entirely boutique: operating systems, firmware, user interfaces that we have directed ourselves to suit our individual tastes.

◧◩◪
15. simonw+2f[view] [source] [discussion] 2025-12-06 17:15:01
>>sa1+Gd
Yeah, I think it can. I'm reminded of the thing in the 80s when Compaq reverse engineered and reimplemented the IBM BIOS by having one team decompile it and write a spec which they handed to a separate team who built a new implementation based on the spec.

I expect that for games the more important piece will be the art assets - like how the Quake game engine was open source but you still needed to buy a copy of the game in order to use the textures.

◧◩
16. DrNosf+go[view] [source] [discussion] 2025-12-06 18:30:48
>>jasonj+bd
But, for example, isn't Cannonball (SEGA Outrun source port) open source?

https://github.com/djyt/cannonball

replies(1): >>jasonj+Xs
◧◩
17. yieldc+Mr[view] [source] [discussion] 2025-12-06 18:56:27
>>jasonj+bd
Open source never meant free to begin with and was never software specific, that’s a colloquialism and I’d love to say “language evolves” in favor of the software community’s use but open source is used in other still similar contexts, specifically legal and public policy ones

FOSS specifically means/meant free and open source software, the free and software words are there for a reason

so we don’t need another distinction like “source available” that people need to understand to convey an already shared concept

yes, companies abuse their community’s interest in something by blending open source legal term as a marketing term

replies(2): >>jasonj+tt >>virapt+ID
◧◩◪
18. jasonj+Xs[view] [source] [discussion] 2025-12-06 19:06:19
>>DrNosf+go
No it is not. There is no license in that repository.

Relevant: https://github.com/orgs/community/discussions/82431

> When you make a creative work (which includes code), the work is under exclusive copyright by default. Unless you include a license that specifies otherwise, nobody else can copy, distribute, or modify your work without being at risk of take-downs, shake-downs, or litigation. Once the work has other contributors (each a copyright holder), “nobody” starts including you.

https://choosealicense.com/no-permission/

replies(1): >>seba_d+2e1
◧◩◪
19. jasonj+tt[view] [source] [discussion] 2025-12-06 19:10:07
>>yieldc+Mr
Whether or not something is "free" is a separate matter and subject to how the software is licensed. If there is no license it is, by definition "source available", not open source. "source available" is not some new distinction I'm making up.

See my other comment: >>46175760

◧◩◪◨
20. mh-+my[view] [source] [discussion] 2025-12-06 19:50:21
>>eurode+K4
Got a bin?
◧◩◪
21. vunder+ty[view] [source] [discussion] 2025-12-06 19:51:12
>>sa1+Gd
In fact, the story of how Atari tried to circumvent the lockout chip on the original NES is a good example of this.

They had gotten surprisingly close to a complete decompilation, but then they tried to request a copy of the source code from the copyright office citing that they needed it as a result of ongoing unrelated litigation with Nintendo.

Later on this killed them in court.

◧◩
22. mkatx+wA[view] [source] [discussion] 2025-12-06 20:08:05
>>jasonj+bd
So instead of reverse engineering.. an llm/agent/whatever could simply produce custom apps for everyone, simply implementing the features an individual might want. A more viable path?
23. johnfn+aC[view] [source] 2025-12-06 20:24:54
>>rlili+(OP)
Surely then people start using LLMs to obfuscate compiled source to the point that another LLM can’t deobfuscate it. I imagine it’s always easier to make something messy than clean. Something like a rule of thermodynamics or something :)

Though, that’s only for actively developer software. I can imagine a great future where all retro games are now source available.

replies(1): >>tuhgde+lD
◧◩
24. tuhgde+lD[view] [source] [discussion] 2025-12-06 20:35:39
>>johnfn+aC
But on the other hand, at the current speed of LLM progression, a game that might have been obfuscated with the help of Opus 4.5 might in two years be decompiled within hours by Opus 6.5.
◧◩◪
25. virapt+ID[view] [source] [discussion] 2025-12-06 20:40:59
>>yieldc+Mr
This is not a space for "language evolves". Open source has very specific definitions and the distinctions there matter for legal purposes https://opensource.org/licenses
replies(1): >>yieldc+UE
◧◩◪◨
26. yieldc+UE[view] [source] [discussion] 2025-12-06 20:51:35
>>virapt+ID
the software community is the one trying to evolve the language in favor of this software license specific use case
◧◩◪◨
27. tadfis+1N[view] [source] [discussion] 2025-12-06 22:08:46
>>eurode+K4
I don't believe that was written in a compiled language, so any old 8086 disassembler should suffice. I would love to see what comments an LLM adds to the assembly code, though.
28. anabis+MO[view] [source] 2025-12-06 22:24:23
>>rlili+(OP)
Would some sparks fly when easy decompile of MSOffice and Photoshop are available, I wonder.
29. jonhoh+qQ[view] [source] 2025-12-06 22:37:46
>>rlili+(OP)
That runs into copyright issues. As someone who does a reasonable amount of decompilation, I wouldn’t ever use an LLM. It falls too close to mechanical transformation territory which is not protected, fair use.

Obviously others aren’t concerned or don’t live in jurisdictions where that would be an issue.

◧◩◪
30. comex+I91[view] [source] [discussion] 2025-12-07 01:31:29
>>sa1+Gd
If we're talking about actual clean-room reverse engineering where only the overall design or spec is copied and not the specific code, then yes. In this process, one person would decompile the original and turn it into a human-readable spec, and another person would write their own implementation. But the decompiled code itself is never distributed.

That's very different from the decompilation projects being discussed here, which do distribute the decompiled code.

These decompilation projects do involve some creative choices, which means that the decompilation would likely be considered a derivative work, containing copyrightable elements from both the authors of the original binary and the authors of the decompilation project. This is similar to a human translation of a literary work. A derivative work does have its own copyright, but distributing a derivative work requires permission from the copyright holders of both the original and the derivative. So a decompilation project technically can set their own license, and thereby add additional restrictions, but they can't overwrite the original license. If there is no original license, the default is that you can't distribute at all.

31. TheAce+6b1[view] [source] 2025-12-07 01:53:36
>>rlili+(OP)
If progress continues, someday it'll be possible to generate the source code for any binary and make a native port to any other platform. Some companies might be upset, but it'll be a huge boon for game and software preservation.
◧◩◪◨
32. seba_d+2e1[view] [source] [discussion] 2025-12-07 02:36:46
>>jasonj+Xs
There is a license: https://github.com/djyt/cannonball/blob/master/docs/license....

...but it's very clearly not an open source license.

[go to top]