Add ability to manually choose unpacker #17

Masrepus · 2019-05-06T18:56:06Z

Use cases:

no packer recognized -> manually select one
wrong packer recognized -> override selection

Calvonator · 2020-11-13T09:09:38Z

Is anyone still interested in this feature?

Masrepus · 2020-11-13T09:17:27Z

Yes, we are still interested in this, but unfortunately we are somewhat short on time due to university tasks, so we are currently not as actively developing new features as we would like to. If you want to propose a PR by any chance, you are of course welcome to do so and we'll take the time to review and merge it

Calvonator · 2020-11-17T00:23:46Z

Cool sounds good, I'll see what I can do for you 👍

Masrepus · 2020-11-17T07:11:17Z

Cool, thanks!

Calvonator · 2021-02-05T09:49:36Z

@Masrepus Hi just wanted some quick clarification on the following use case.

wrong packer recognized -> override selection

Does this use case mean to give the user the option to manually override the selected unpacker in the case that the indentifypacker() function selects the wrong unpacker?

Masrepus · 2021-02-05T09:51:43Z

Yes I think that makes sense like that 👍🏼

Calvonator · 2021-02-17T06:49:05Z

@Masrepus Hi just one more clarification for the following use case.
wrong packer recognized -> override selection

Should the user be asked whether they would like to manually override the unpacker each time they begin to unpack a sample or only when the sample is first given to unipacker?

Masrepus · 2021-02-17T07:12:22Z

Hm I think asking every time would be fine. Otherwise we would need to identify the last used unpacker for a sample from history data that might be corrupted, so that needs additional error handling etc. And I guess that might not be such a predominant use case to justify the extra work. But if you think that it might indeed be nice to include it, of course feel free to do so.

Masrepus · 2021-02-17T07:15:14Z

And with asking the user, do you mean that every time a sample is opened, the user needs to explicitly accept/override the unpacker? I think it would be preferable to just use the recognized unpacker and if the user wants to change it, they can do so manually

Calvonator · 2021-02-17T07:27:00Z

Yeah I agree that it would be preferable that it not be asked every time. Unsure on best way to go about this though, do you think making it an option ID such as M so that when M is entered they can change the unpacker of an already listed sample.
Excuse the crudely drawn example below:
https://gyazo.com/b82d8a571a979f4e093dc082ff69d673

Masrepus · 2021-02-17T07:36:58Z

Actually we currently use the name of the last unpacker only for information purposes. The unpacker is identified anew every time a sample is loaded. I would create a new shell command for this, e.g. unpacker <name>. You can create such a shell command by adding a method do_unpacker(self, args) to shell.py, or analogously for a different name. The docstring for such a method is then automatically used as a help text when the user calls help <command_name>

Calvonator · 2021-02-23T11:36:13Z

Haven't worked with making shell commands before so sorry for the silly question. Is the command name just the same as the function name or is that defined elsewhere?
Also I'm a bit confused about when this command is used. Will this command be used after the initial option ID is entered and a sample is chosen?

Masrepus · 2021-02-23T12:18:38Z

Ah I see, sorry I should have clarified that! Currently, the unipacker shell is defined in shell.py. One example would be the log command, which is defined here:

unipacker/unipacker/shell.py

Lines 720 to 742 in de508c0

 def do_log(self, args): 

 """Set logging level 

 Usage: log [OPTIONS] 

 Options: 

  i Log every instruction that is executed 

  r Log memory READ access 

  w Log memory WRITE access 

  s Log system API calls 

  a Log everything""" 

 if args == "a": 

 args = "irsw" 

 print("Log level:") 

 self.engine.log_mem_read = any(x in args for x in ["r", "read"]) 

 print(f"[{'x' if self.engine.log_mem_read else ' '}] mem read") 

 self.engine.log_mem_write = any(x in args for x in ["w", "write"]) 

 print(f"[{'x' if self.engine.log_mem_write else ' '}] mem write") 

 self.engine.log_instr = any(x in args for x in ["i", "instr"]) 

 print(f"[{'x' if self.engine.log_instr else ' '}] instructions") 

 self.engine.log_apicalls = any(x in args for x in ["s", "sys"]) 

 print(f"[{'x' if self.engine.log_apicalls else ' '}] API calls")

As you can see, the method name is do_log, as the shell framework that we use automatically registers methods that start with do_ as commands, using the portion after the underscore as the command name.

Each such function needs to have the arguments self and args, where - as can be seen in the log example - the shell framework passes you the whole string that the user wrote after the command name. As an example, if the user executes log instr, the framework calls do_log and sets the parameter args to "instr". Inside this method, you can then inspect the argument, and perform any operation that you like.

After you are done with whatever the command should do, you just return from the method as usual, and the shell framework makes sure that the user can enter the next command.

In your case, let's suppose you want to provide a command called unpacker that takes one argument which specifies the name of the unpacker to use. You would then need to create a do_unpacker method inside the Shell class of shell.py. This method checks the value of args whether it is the name of a known unpacker. If it isn't, just print some error message and return. Otherwise, you can then modify the unpacker of self.sample to be the one the user requested.

Regarding the usage of the command, the workflow we imagine should be somewhat like this:

The user starts unipacker and selects a sample to be loaded
Once loading is done, the user is now inside the unipacker shell, which is waiting for him to perform commands (e.g. start emulation, configure logging etc)
At this point, the user will probably check which unpacker has been detected, and maybe decide that a different one should be used
They invoke e.g. unpacker upx, which switches the current unpacker to the UPX implementation
If they decide that they actually didn't want to use UPX, they can switch again
At some point, they start emulation by invoking the r command. After the start of the emulation, the chosen unpacker is no longer changeable, as this might lead to problems when it is changed mid-emulation. The do_unpacker method can simply check the self.started field, and if it is true, the command just prints a warning message that changing the unpacker is not supported once emulation is started, and simply exit

Calvonator · 2021-02-24T04:37:04Z

Awesome thank you so much for this reply, I really appreciate the effort you have put in to help me with this. This is my first open source contribution and I'm glad I've had you to guide me through.

I think I should be able to complete this now with the info you've given, hopefully haha.

Masrepus · 2021-02-24T06:49:25Z

No problem, if you have any further questions don't hesitate to ask them.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to manually choose unpacker #17

Add ability to manually choose unpacker #17

Masrepus commented May 6, 2019

Calvonator commented Nov 13, 2020

Masrepus commented Nov 13, 2020

Calvonator commented Nov 17, 2020

Masrepus commented Nov 17, 2020

Calvonator commented Feb 5, 2021

Masrepus commented Feb 5, 2021

Calvonator commented Feb 17, 2021 •

edited

Masrepus commented Feb 17, 2021

Masrepus commented Feb 17, 2021

Calvonator commented Feb 17, 2021

Masrepus commented Feb 17, 2021

Calvonator commented Feb 23, 2021

Masrepus commented Feb 23, 2021

Calvonator commented Feb 24, 2021

Masrepus commented Feb 24, 2021

Add ability to manually choose unpacker #17

Add ability to manually choose unpacker #17

Comments

Masrepus commented May 6, 2019

Calvonator commented Nov 13, 2020

Masrepus commented Nov 13, 2020

Calvonator commented Nov 17, 2020

Masrepus commented Nov 17, 2020

Calvonator commented Feb 5, 2021

Masrepus commented Feb 5, 2021

Calvonator commented Feb 17, 2021 • edited

Masrepus commented Feb 17, 2021

Masrepus commented Feb 17, 2021

Calvonator commented Feb 17, 2021

Masrepus commented Feb 17, 2021

Calvonator commented Feb 23, 2021

Masrepus commented Feb 23, 2021

Calvonator commented Feb 24, 2021

Masrepus commented Feb 24, 2021

Calvonator commented Feb 17, 2021 •

edited