Lua 5 decoder – LuaDec (.LUA bytecode bestanden)
LuaDec is a decompiler for the Lua language. It takes compiled Lua bytecodes and attempts to produce equivalent Lua source code on standard output. It targets Lua 5.0.2.
The decompiler performs two passes. The first pass is considered a “preliminary pass”. Only two actions are performed: addresses of backward jumps are gathered for while constructs, and variables referenced by CLOSE instructions are taken note of in order to identify the point where explicit do blocks should be opened.
The actual decompilation takes place in the second, “main pass”. In this pass, the instructions are symbolically interpreted. As functions are traversed recursively, the symbolic content of registers are kept track of, as well as a few additional data structures: two register stacks, one for “variable registers” (registers currently allocated for variables) one for “temporary registers” (registers holding temporary values); and a list of boolean conditions.
Instructions like ADD and MOVE combine the string representation of symbols and move them around in registers. Emission of actual statements is delayed until the temporaries stack is empty, so that constructs like a,b=b,a are processed correctly.
The list of boolean conditions accumulate pairs of relational operators (or TESTs and jumps). The translation of a list into a boolean expression is done in three stages. In the first stage, jump addresses are checked to identify “then” and “else” addresses and to break the list into smaller lists in the case of nested if statements. In the second stage, the relations between the jumps is analyzed serially to devise a parenthization scheme. In the third scheme, taking into account the parenthization and an ‘inversion’ flag (used to tell “jump-if-true” from “jump-if-false” expressions), the expression is printed, recursively.
Two forms of “backpatching” are used in the main processing pass: boolean conditions for while constructs are inserted in the marked addresses (as noted in the first pass), and do blocks are added as necessary, according to the liveness information of local variables.
LuaDec, in its current form, is not a complete decompiler. It does succeed in decompiling most of the Lua constructs, so it could be used as a tool to aid in retrieving lost sources.
The situations where LuaDec “get it wrong” are usually related to deciding whether a sequence of relational constructs including TEST operators are part of an assignment or an if construct. Also, the “single pass” nature of the symbolic interpreter fails at some corner cases where there simply is not enough information in the sequence of operator/jump pairs to identify what are the “then” and “else” addresses.
To try out LuaDec:
bin/luadec luac.out > newlife.lua
LuaDec includes a -d flag, which displays the progress of the symbolic interpretation: for each bytecode, the variable stacks and the code generated so far are shown.
In its default operation mode, LuaDec recursively processes the program functions, starting from the outmost chunk (“main”). When LuaDec detects a decompilation error (by doing sanity checks on its internal data structures), Luadec outputs the portion of the function decompiled so far, and resumes decompilation in the outer closure.
There is an alternative operation mode, set with the -f flag, where all functions are decompiled separately. This does not generate a .lua file structured like the original program, but it is useful in the cases where a decompilation error makes a closure declaration “unreachable” in the default operation mode. This allows a larger portion of the sources to be restored in the event of errors.