ctf/2016-09-09-asis-final/one_bad_son at master · p4-team/ctf

History

Name		Name	Last commit message	Last commit date
parent directory ..
One_Bad_Son		One_Bad_Son
README.md		README.md
flag.png		flag.png
scr1.png		scr1.png

README.md

One Bad Son (Forensics, 127p)

tl;dr get data from bson, concact flag png

Supplied files:

One_Bad_Son

It turns out that windbg formats binary data pretty well (cat/hexump/xxd output wasn't so clear):

After some googling, the data turned out to be a BSON, a format that's used mainly in MongoDB.

Our first guess was to try to load the dump into mongo using a recovery tool, but that turned out to be pointless.

Taking a step back, we've decided to convert the file to a standard JSON structured file using a python script. The file itself was broken and the parser would not work on it. We used a debugger to see how the library is parsing bson files and we noticed that it first tries to read 4 big endian byte number as record length. In our case there was only a single 0x0 byte instead. Therefore we checked how long is a single record (0x72 bytes) and then attached the missing 3 bytes in the beginning of data so that the data starts with 0x72 0x00 0x00 0x00 (big endian!). With this we could make a script to dump all the data:

import base64
import codecs
import bson

with codecs.open("./One_Bad_Son", "rb") as input_file:
    data = input_file.read()
    data = '\x72\x00\x00' + data
    loaded = bson.decode_all(data)
    with codecs.open("out.txt", "w") as output_file:
        output_file.write("[\n")
        for d in loaded:
            output_file.write(repr(d)+"\n")
        output_file.write("]\n")

This is how a single row looked like:

{u'raw': 100000000000000L, u'len': 2, u'dat': u'iVA=', u'crc': u'c0f36009', u'fname': u'flag', u'mtime': datetime.datetime(48652, 6, 24, 12, 40, 10, 460000), u'_id': u'0262404638'}

In reality we could have just as well written a parser by hand, since the structure was rather clear, but it turned out to be faster this way.

What's interesting now, is that the base64 decoded dat data from first 2 rows (when data sorted by raw field) creates PNG string, so there's a png image encoded in that json!

However some values from the raw field are duplicated so we had to make sure we use each chunk only once.

We wrote a script to decode all unique flag rows sorted by their raw field:

used_id = set()
ordered = sorted(loaded, key=lambda di: int(di['raw']))

flag_data = []
for d in ordered:
    id = d['raw']
    if id not in used_id and d['fname'] == 'flag':
        base = d['dat']
        decoded = base64.b64decode(base)
        flag_data.append(decoded)
        used_id.add(id)
with codecs.open("out.png", "wb") as output_file:
    output_file.write("".join(flag_data))

And the output is:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

one_bad_son

one_bad_son

README.md

One Bad Son (Forensics, 127p)

Files

one_bad_son

Directory actions

More options

Directory actions

More options

Latest commit

History

one_bad_son

Folders and files

parent directory

README.md

One Bad Son (Forensics, 127p)