Movfuscator – A single-instruction C compiler

erlehmann_ · on Aug 27, 2016

Previously on Battlehacker Newslactica: https://news.ycombinator.com/item?id=10021259

projectdp · on Aug 27, 2016

Saw this recently after reviewing some Defcon 23 videos. The author goes into detail about how it's working and some other fun stuff regarding anti-reverse-engineering.

2015 DEFCON 23 - Chris Domas - Repsych: Psychological Warfare in Reverse Engineering https://www.youtube.com/watch?v=HlUe0TUHOIc

And a paper by him: https://www.cl.cam.ac.uk/~sd601/papers/mov.pdf

kersny · on Aug 27, 2016

See also: Demovfuscator

https://kirschju.re/demov

_c_ · on Aug 28, 2016

Since we are joking, assuming that the MOV instruction exists on many CPU's, could the input for this compiler be considered a needed "portable assembly language"?

http://cr.yp.to/qhasm/20050129-portable.txt

maxpert · on Aug 27, 2016

I wonder what happens to performance of same code.

easuter · on Aug 27, 2016

For large applications performance will undoubtedly nosedive.

maxpert · on Aug 27, 2016

Yep one might just write some key verification code in it but keep rest of the app in GCC

_nalply · on Aug 27, 2016

Some slides here: https://recon.cx/2015/slides/recon2015-14-christopher-domas-...

LightMachine · on Aug 27, 2016

Not sure I understand how that is possible. How would you implement a boolean "and" with only "mov"? If you can only move stuff around, how do you read and compare things?

__s · on Aug 27, 2016

If you look at turing machines they're pretty minimal, just a state machine reading/writing state on a tape

Repo includes a good set of slides: https://github.com/xoreaxeaxeax/movfuscator/blob/master/slid...

Indirect memory accesses serve for conditional execution

Here's the macro for boolean and: https://github.com/xoreaxeaxeax/movfuscator/blob/master/poc/...

liquidzoot · on Aug 27, 2016

You should be careful with this, you'll wear a hole in your instruction set.

sonthonax · on Aug 27, 2016

Dumb question, could that actually happen? Could you actually use a particular set of transistors so much with this that they break?

slacka · on Aug 27, 2016

This wouldn't do any noticeable damage. Modern CPUs have excellent thermal management. As far a wear goes, a hot spot in chip would in theory slightly decrease the long life span of a CPU.

If you expanded your question to hardware in the computer, then yes you can easily cause damage. BIOS’s can be flashed to make the system unbootable or overclock/stress components. Back in the bad old days of Linux, you could easily damage your monitor with the wrong xorg.conf settings.

Your question got me thinking what’s the MTBF of modern CPUs? My google-fu failed me finding any reliable source of this, but I’m sure it’s long, 10+ years.

woliveirajr · on Aug 27, 2016

> Back in the bad old days of Linux, you could easily damage your monitor with the wrong xorg.conf settings

You could also damage a floppy drive making it read/write, for many times, few sectors outer the common limits. Being there, done that.

But after so many discussions on online forums that it was impossible to cause physical damage using software (other than overwriting firmwares), I gave up and kept this (and the asm code) deep inside my heart.

And bringing it up still gives me chills that those discussions will return right now...

koytch · on Aug 28, 2016

Sounds close to

http://www.catb.org/jargon/html/W/walking-drives.html

userbinator · on Aug 27, 2016

Your question got me thinking what’s the MTBF of modern CPUs? My google-fu failed me finding any reliable source of this, but I’m sure it’s long, 10+ years.

Probably decreasing, and soon not much longer than warranty period... the transistors have gotten so small that they're on the threshold of barely working even in normal operation.

As for older CPUs, they could definitely last many decades because of the lower stresses of larger process sizes, and they were designed with much higher margins.

CyberDildonics · on Aug 27, 2016

Do you have anything to back this up?

versteegen · on Aug 27, 2016

According to the paper linked in another comment (https://news.ycombinator.com/item?id=12373015), apparently the high-k dielectric nodes used at 45nm and below show ~5x times worse NBTI ageing than non-high-k 45nm PMOS gates, which decides the tolerances that are selected to provide X years of life.

(IANAEE)

stcredzero · on Aug 27, 2016

Back in the bad old days of Linux, you could easily damage your monitor with the wrong xorg.conf settings.

Back when a certain kind of line printer was commonplace (has a circulating ribbon with the typeface repeated, and n hammers in a line going across the entire width) programmers could sabotage the printer by printing the pattern on the ribbon. This would cause all of the hammers to fire at once, which the machine wasn't designed to withstand.

I've also heard of monitors being broken by having the speaker output the resonant frequency of the glass cover. However, I can't vouch for this one.

rbobby · on Aug 28, 2016

Sounds I'll never forget:

* Modem

* Dot matrix printer

* DEC Line printer

avar · on Aug 28, 2016

    > Back in the bad old days of Linux, you could
    > easily damage your monitor with the wrong
    > xorg.conf settings.

Nit: Back in the old days of Linux there was no xorg.conf, it was called XF86Config

rbobby · on Aug 28, 2016

I could never, and I mean never, get a XF86Config to work. Totally turned me off of Linux.

ComodoHacker · on Aug 28, 2016

>what’s the MTBF of modern CPUs? My google-fu failed me finding any reliable source of this, but I’m sure it’s long, 10+ years.

It's so long that probably nobody bothers to measure it.

detaro · on Aug 27, 2016

Yes, but you probably need something more sophisticated than just running the same instruction type over and over again: https://dl.acm.org/citation.cfm?id=2744295.2724718

versteegen · on Aug 27, 2016

Wow, thanks, that's a fascinating paper! Direct link to pdf: [1].

So it turns out that if a transistor is kept on continuously its threshold voltage gradually increases (Negative-Bias Temperature-Instability (NBTI)), increasing the switching delay. This attack targets transistors along the critical path, increasing the path's delay until it exceeds the allowed tolerance (guardband). Turning the transistor off "heals" it; as a workaround they suggest periodically executing certain nop instructions to ensure critical path transistors spend at least 0.05% of their time turned off. They perform simulations using models of 45nm high-k PMOS transistors to produce their results. A good quote about processor reliability:

   Guardbanding  is  the  current  industrial  practice  to  cope  with  transistor  aging  and
   voltage droops [Agarwal et al. 2007]. It entails slowing down the clock frequency (i.e.,
   adding timing margin during design) based on the worst degradation the transistors
   might experience during their lifetime. The guardbands ensure that enough current
   passes through the processor to keep it above the threshold voltage and in turn ensure
   that the processor functionality is intact for an average period of 5 to 7 years [Tiwari
   and Torrellas 2008]. However, inserting wide guardbands degrades performance and
   increases energy consumption. Hence, processor design companies usually have small
   guardbands, typically 10% [Agarwal et al. 2007]. However, the MAGIC-based attack
   can deteriorate the critical path by 11% and cause erroneous results in 1 month.

This also explains why overclocking a CPU may be a bad idea, although they also show that random instructions don't come close to the worst case ageing.

[1] https://www.researchgate.net/profile/Naghmeh_Karimi/publicat...

amelius · on Aug 28, 2016

Well, I think this can be answered by considering that even under normal conditions there are transistors that are used as much as in your hypothetical scenario. For example, the instruction decoding logic is invoked for every instruction. Since all logic transistors are the same (afaik), I don't think that using one type of instruction would significantly reduce the life-time of your CPU.

TazeTSchnitzel · on Aug 27, 2016

Probably not for MOV, given x86 uses it a heck of a lot anyway, so surely processors are already equipped to handle it.

mappu · on Aug 28, 2016

You should also regularly rotate your CPU in its socket, to ensure all the cores wear evenly.

koytch · on Aug 28, 2016

Uhm, not really, but you might make 20 smaller ones.

http://x86.renejeschke.de/html/file_module_x86_id_176.html

qwertyuiop924 · on Aug 27, 2016

That is hilarious.

aub3bhat · on Aug 27, 2016

Could this be used for creating a ROP Gadget that overcomes ASLR on 64 bit machines?

l_zzie · on Aug 28, 2016

How does it get past aslr? You still need to find addresses of the movs, don't you?

amelius · on Aug 28, 2016

But MOV is not really a single instruction. I would be more impressed by a single opcode compiler.

tbodt · on Aug 27, 2016

How do we do arithmetic if we can only do movs?

I know! Lookup tables!

isuckatcoding · on Aug 27, 2016

ELI5 please. What does this mean for a developer?

bluedino · on Aug 27, 2016

Nothing - it's simply an intellectual exercise. It uses the MOV instruction exclusively to create a working program.

Further reading - http://www.cl.cam.ac.uk/%7Esd601/papers/mov.pdf

0x0 · on Aug 27, 2016

It might find some use in DRM, malware, and other exploits as it would make reverse engineering/decompiling/analysis somewhat more challenging.

haberman · on Aug 27, 2016

Someone made something very clever, but it has no practical usefulness whatsoever. It is an interesting intellectual exercise. It's also very impressive that they could pull this off.

SilasX · on Aug 27, 2016

I would think it has some practical use as research in OISC (one-instruction set computing) processors, which are like the RISC model (do a smaller set of instructions so you can do them faster) on steroids.

lucb1e · on Aug 27, 2016

No practical usefulness? It seems rather great for obfuscation, be it for evil (viruses) or good (license key verification -- which is deemed 'good' merely because they're legal, not because they're not a pain in the arse).

normalhuman · on Aug 27, 2016

Not really. Creating an algorithm for recovering the jumps and the intent of the various MOV patterns would be no more work than it was to write this. Particularly easier because one has access to the obfuscator's code, but I don't think it would be a major hurdle even without the source code.

lucb1e · on Aug 28, 2016

Same could be said for most binaries: they're just compilations (usually with open source or freely available compilers) of C/C++ code. Shouldn't be too hard to reverse once you got all the patterns worked out.

I see your point though. I'm not very experienced on this and I'm sure some patterns can easily be recovered, but until someone goes through the effort it's still a considerable effort compared to being able to read the program normally, and even when someone does it's questionable whether the original can be recovered with some simple 1:1 translation.

jakub_h · on Aug 27, 2016

You can stop worrying about branchless code and have it generated automatically! ;)

danjoc · on Aug 27, 2016

And for the minimalist, the zero-instruction C compiler

https://github.com/jbangert/trapcc

2opdude · on Aug 28, 2016

The MOV mnemonic is more like a family of instructions, isn't it?

Not knowing anything about GPU programming, isn't it similar to Movfuscator in some respects? Both branches are taken and run simultaneously?

toolslive · on Aug 27, 2016

just what I need for my next virus

mighty_atomic_c · on Aug 27, 2016

Seems like it would give a large performance penalty. I don't get it.

detaro · on Aug 27, 2016

It's a joke/demonstration that it is possible, not something you're supposed to use.

posterboy · on Aug 27, 2016

Obfuscation is surely used in malware.

One instruction set computers (OISC) are more than a joke, I suppose, but I didn't dig far into theoretical computer science and can't say, what's important about them.

I read a comment the other day, that stipulated neurons would be akin to massively parallel single instruction computers.

stcredzero · on Aug 27, 2016

One instruction set computers (OISC) are more than a joke, I suppose, but I didn't dig far into theoretical computer science and can't say, what's important about them.

They're for highly parallel programable SIMD number crunching. The OISC would allow for easily fabricating a whole heaping bunch of ALUs.

hulahoof · on Aug 27, 2016

There is a good talk that accompanies the code

caretStick · on Aug 28, 2016

source?

hulahoof · on Aug 28, 2016

youtube.com/watch?v=HlUe0TUHOIc

At about 7 minutes in he is chatting about it but all is a good watch - could only find the YT link sorry