So there are multiple sites&groups that pirate video games especially on PC. I was wondering if there are places on the internet where you find source code for games especially the highly modifiable ones like Half Life 2/Portal and Skyrim. Or groups that crack into the source code of games (or even software in general), not only for PC maybe PS, XBox or mobile too, and share it. I just wanted to see some code samples of games or their engines, maybe I get hooked into video game design. Shout out to Valve for sharing a lot about the creation of Half Life 1
Well, source code is not sth that you “crack”, you can only reverse engineer it (I think it was done with Doom, also OpenRA) or steal it from the company’s servers. The use for it is also rather niche, so the risk vs gains ratio is not attractive enough to feed dedicated websites. You can also look at fully open source games like 0AD and check out what they did!
Edit: I stand corrected (thrice); Doom was indeed open-sourced, not reverse engineered. Thanks for pointing out!
Doom was open sourced later. An example of a game that got reverse engineered just fine is Super Mario 64
Or the time the people who made doom reverse engineered super mario 3 for PC, then turned it into commander keen after Nintendo was unimpressed.
It wasn’t quite reverse engineered. They found a hacky way to bypass hardware limitations and basically duplicated the game.
Also Minecraft was reverse engineered.
though mojang did eventually publish name maps
Yeah but that was more of a “if we can’t beat 'em, join 'em” thing and I believe some, if not most of the modders don’t even use the official mappings and prefer the cracked version
I need this to happen to Gran Turismo 4. Series has gone so far downhill since that entry…
No Doom was made open source, same with most of the old Id games.
I was once at a talk by someone in that company and he straight up said that open sourcing it was a mistake. I assume because that meant they couldn’t sell us a thousand versions of it like Skyrim.
No word of whether its ongoing popularity was at all caused by open sourcing it.
The engine is Open Source, not the game itself. You still need to buy it.
The game and the engine are both open source. The game’s assets just aren’t freely available, so you still need an official WAD or an asset replacement pack like Freedoom.
Doom is open source, at least the original ones were
There’s also Diablo with Devilution.
Everything is open source if you can read assembly
Machine code tells you what it’s doing. Source code tells you why.
Only if they wrote why in it though, plenty of people (unfortunately myself included) fail or forget to add meaningful comments or they let their comments go stale when making changes by forgetting to update them (I do it a lot too), and some people also use horrible function names that don’t make any sense.
So it only really applies to source code intended to be released where care was made to ensure it would be readable, it might not apply for source code never intended to be public, such as stolen, leaked, or posthumously released. In this case the only real benefit is that it can be recompiled on different architectures provided there isn’t a dependency issue preventing that.
Source code without comments is still way easier to read than machine code.
And there are very likely comments anyways.
Multiple Devs will work on the same code for years and of course they need or at least appreciate comments.
But even if you can intuitively understand what the machine code is you’ll still need to convert it by hand back into something more portable, or to the machine code of another platform you might want to run it on. There’s not really an easy automated way to do that, even when playing dirty.
That’s the hard part, getting it into a different form, such as from x86 to ARM or from 6502 RISC to x86.
More powerful decompilers would definitely be a very useful application of AI!
Far Cry https://github.com/manfromafar/far-cry-source/tree/main/SourceCode
Super Mario 64 https://github.com/n64decomp/sm64
Zelda Ocarina of Time
https://github.com/zeldaret/ootThank you for sharing!
I mean, if you want to see some games’ source code you don’t have to rely on piracy. As other people have already said, there are open source games, some developers of older games have officially released the source code (notably VVVVVV, doom, and also quake iirc), some devs have released important part of their source code (e g the entire inputs handling code of Celeste).
Additionally, the vast majority of all Unreal Engine games’ engine code, including huge AAAs like Fortnite, is in Unreal Engine (duh), which is
open sourcesource-available.UE is source available, not open source.
From what I understand: During the compilation of the code it becomes unreadable to humans, needing to be reverse engineered, which is entails insane amounts of work. So, unless there is a leak or the game isn’t fully compiled like (I think) Unity games, it will be unlikely to find source code.
Yes and no.
VERY oversimplifying but:
The basic flow of compilation is (human readable-ish) source code becomes symbols which the compiler then applies optimizations to before converting into machine code.
With no optimizations, it is a pretty trivial process to revert back. You lose the names of variables and functions (unless they were added as annotations by the compiler, which they often are for many reasons) but you get the logic.
With optimizations, things get messier. An example that is pretty popular and useful to understand this is https://en.wikipedia.org/wiki/Fast_inverse_square_root The coder likely wrote
invsqrt(foo)
, expecting this to map to a series of hardware functions in the floating point unit. Instead, they see that it became a series of integer operations and bit shifts and get very very confused. Its like that, except imagine an entire 5000 line file becomes two instructions.That said, compilers are stupid and most people (rightfully) use libraries to a large portion. So the above example would actually not happen because they would just see the instructions to do a function call labeled “fast_invsqrt” and not care. Similarly, basic heuristics (or, if you are trying to get funding, machine learning and neural nets) can be applied to figure out what was done in heavily optimized blocks of code. So the resulting “source code” might be something like
int foo(int bar) { int a = 4; // Loop to do some fancy ass math shenanigans to a return a + 5; }
Which results in more or less human readable code again. Also, a lot of compilers will actually embed basically comments in the assembly/binary unless someone remembers to actively disable that. And nobody ever does.
Because source code really doesn’t matter all that much. If anyone ever did a deep dive of the Source Engine and Unreal Engine (of the day. I think that was 3?)… they were actually a lot more similar than not. Yeah, there was a lot of infrastructure and even some fundamental differences (been a minute, but I want to say Source is still additive because of its quake origins whereas UE3 was still subtractive geometry?). But… function influences form and they have most of the same functionality.
The issue is more, like with DRM, those “week one sales” as it were. Data is sparse for a lot of reasons, but Securom for Mass Effect 1 PC is generally treated as a massive win because warez groups could not properly crack it for a week or so which led to a lot of pc players just buying the game because we needed hot blue lady action NOW. Same with the reason so many games still use denuvo.
And same with engine and game features. Unity and Unreal basically were in an arms race for years (and Cryengine basically sucked on the barrel of their own gun) where one would add one feature and the other would add the same one a few months later. Having access to the source code potentially means someone can accelerate that a lot…
Except they won’t. Because this shit is cancer. Any dev who does anything even remotely similar to how Wolverine jacks off to his friend’s wife in a game is opening themselves up to a LOT of lawsuits and investigations. Its why almost every emulator group will go scorched earth on anyone who even acknowledges looking at the nintendo leaks. And it is why someone quitting their job at Microsoft to go work at Google and offer to provide a Teams feature gets blacklisted and reported immediately. Companies don’t fuck around with that.
Which leads to the real reason these are so detrimental to gaming and software. People… put questionable comments in their code all the time. Sometimes it is
///TODO: Implement this in Q2 2024 for release 1.5
. Sometimes it is//Hey, Fuck you Fred. I finally did this. Stop your fucking *removed* and stop whining to John that you couldn't do anything until I did this for you
. And sometimes you get bullshit like Bungie (?) using racial slurs as codenames for a lot of the skins and the like…And it also makes it a lot easier to make a cracked binary if you know what the code was before the DRM was applied.
So decompiling code is actually a lot more viable than you would expect. But also entirely pointless.
In this video MattKC tries to start decompiling Lego Island. It kind of explains it as well and acts as an example
Correct. However, this looks like a direction that language processing and GPTs would excel in and to my knowledge, there already have been some ML addons made for certain RE tools, so this might become more easy in the future. If that is necessarily a good thing is a different story…
ayo gta 5 source code leaked yesterday you might want to check it out
LMAO, I’ve got to be one of God’s favorites
The mentions of unreleased DLC was interesting to read about.
Theres also jak and daxter, i believe it was called openGOAL?
openGOAL
seems interesting
I think I need to store this page on this discussion: https://en.wikipedia.org/wiki/List_of_commercial_video_games_with_available_source_code