Katana: A Userland Toolchain-Oriented Hotpatching System
Table of Contents
1 Introduction
Katana aims to provide a hot-patching system for userland. Further it aims to work with existing toolchains and formats so as to be easy to use and to hopefully pave the way for incorporating patching as a standard part of the toolchain. Because of this aim, Katana operates at the object level rather than requiring any access to the source code itself. This has the added bonus of making it, in theory, language agnostic (although no work has been done to test it with anything besides programs written in C). A diagram of software lifecycle with hotpatching is shown below

This document is intended to provide a users guide to Katana, insight into its inner workings, and discussion of its flaws and plans for the future. As the software is not complete, making use of Katana without understanding the inner workings and technical shortcomings is not recommended. Nevertheless, the only sections of this document necessary for "Users' Guide" purposes are "What Katana Does", "What Katana Does Not Do (Yet)", and most importantly "How to Use Katana".
This document is a work in progress. It is not a polished guide yet.
2 Other Systems
There are other hotpatching systems in existence. The curious are invited to explore Ginseng and Polus. Both of these systems parse the source code, which adds significant complexity to them and results in significant programmer annotation of the code to give hints to the systems. Ginseng uses complicated type-wrappers when patching variables which does not fit cleanly with existing executables and has some impact on the performance of the software. Ginseng is considerably more mature than Katana, however. Neither system is production ready, but Ginseng is probably closer than Katana at the moment.
The system most like Katana in many ways is KSplice, and the curious reader is definitely invited to investigate. KSplice patches the kernel and not userland, does not attempt to patch variables, and creates patches as kernel modules rather than working towards a general ELF-based patch format.
3 What Katana Does
- Runs on x86 and x86-64
- Generates patches for simple programs
- Applies simple patches
4 What Katana Does Not Do (Yet)
- Patch any major programs: it has not yet been demonstrated on anything more than toy examples
- Provide any method to handle opaque data it cannot patch (void*, situations where which action a user would prefer is unclear, etc)
- Patch previously patched processes
- Provide robust operation
- Run on any architectures other than x86 and x86-64
- Tested on any operating system besides GNU/Linux
- Allow for calls in patched code to previously unused functions
- Work for programs which actually make use of some of the large code model features of the x86-64 ABI.
- And much more
See Roadmap for more things which are not complete
5 What Katana May Never Do
- Work on any binary formats besides ELF
6 How to Use Katana
Katana is intended to be used in two stages. The first stage generates a patch object from two different versions of an treee. By an object tree, we mean the set of object files (.o files) and the executable binary they comprise. Katana works completely at the object level, so the source code itself is not strictly required, although all objects must be compiled with debugging information. This step may be done by the software vendor. In the second stage, the patch is applied to a running process. The original source trees are not necessary during patch application, as the patch object contains all information necessary to patch the in-memory process at the object level. It is also possible to view the contents of a patch object in a human-readable way for the purposes of sanity-checking, determining what changes the patch makes, etc.
6.1 Preparing a Package for Patching Support
Katana aims to be much less invasive than other hot-patching system
and require minimal work to be used with any project. It does,
however, have some requirements.
6.1.1 Source Code Practices
Katana does not look at the source code, therefore unlike several other hotpatching systems, it does not require any annotation in the source code. There are, however, some best practices to follow.
-
Avoid the use of
void*
at least for global variables (since Katana does not currently patch local variables, preferring to wait until any functions using changed variables are no longer on the stack). Since it is typeless and opaque, it is very hard to analyze and patch. -
Avoid unnamed types. i.e., instead of
typedef struct {...} Foo;
usetypedef struct Foo_ {...} Foo;
. - Avoid accessing structure members by offsets instead of by the member names. As long as you keep all the code where you do this up to date, it should not be a problem, but katana cannot detect when you do this.
6.1.2 Compilation/Linking
Required CFLAGS:
- -g
Recommended CFLAGS:
- -ffunction-sections
- -fdata-sections
Recommended LDFLAGS:
- –emit-relocs
6.2 To Generate a Patch
Let the location of your project be /project. You must have two versions of your software available: the version identical to the running software which must be hotpatched, call it v0, and the version to which you wish to hotpatch the running software, call it v1. Let foo be the name of your program. Then /project/v0/foo must exist and /project/v0 must also contain (possibly in subdirectories) all of the object files which contributed to /project/v0/foo. The source code itself is immaterial, as Katana does not parse it. Similarly, /project/v1/foo must exist and /project/v1 contain all of the object files contributing to /project/v1/foo. Katana is then invoked as
katana -g [-o OUTPUT_FILE] /project/v0 /project/v1 foo
or more formally
katana -g [-o OUTUT_FILE] OLD_OBJECTS_DIR NEW_OBJECTS_DIR EXECUTABLE_NAME
If -o OUTPUT_FILE
is not specified, the output file will be OLD_OBJECTS_DIR/EXECUTABLE_NAME.po
6.3 To Apply a Patch
The process to be patched is running with a pid of PID. It can be patched from its current version to a more recent version by the Patch Object (PO) file PATCH. Katana is then invoked as
katana -p [-s] PATCH PID
If all goes well, the patcher will run, print out some status messages, and leave your program in better state than it found it. The optional -s flag tells Katana to stop the target program after patching it and detaching from it. This is mostly of use for debugging Katana.
6.4 To View a Patch
One of the goals of Katana and its Patch Object (PO) format is to increase the transparency of patches: a user about to apply a patch should know what it will do. This goal is not yet fully realized, but it is possible to view some information about a patch with
katana -l PATCH
6.5 See Also
the katana manpage (once it's written, which it is not yet)
7 Patch Object Format
This section of the document is not yet written. It will provide a description and specification of the PO format used by Katana
8 Patch Generation Process
This section of the document is not yet written. It will provide a description of the internal process that Katana uses to generate a patch. Understanding it is not necessary for using Katana.
9 Patch Application Process
This section of the document is not yet written. It will provide a description of the internal process that Katana uses to apply a patch. Understanding it is not necessary for using Katana.
10 Roadmap
This section is highly incomplete. Future goals include
- Better interaction with the heap and dynamically allocated variables
- Better interaction with void*
- More efficient use of .rodata
- Patching already patched processes
- Patch composition
- Patch safety checking: make sure a patch actually corresponds to the process it's being applied to
- Storing warnings from generation inside a patch
11 Credits and Licensing
Katana is under development at Dartmouth College and Copyright 2010 Dartmouth College. It may be distributed under the terms of the GNU General Public License with attribution to Dartmouth College as specified in the file COPYING distributed with Katana. This document is Copyright 2010 Dartmouth College and may be distributed under the terms of the GNU Free Documentation License as found in the file FDL which should have been distributed with this documentation. If it was not, it may be found at http://www.gnu.org/licenses/fdl.txt.
Katana is being written by James Oakley and was designed by Sergey Bratus, Ashwin Ramaswamy, James Oakley, Michael Locasto, and Sean Smith.
Date: 2010-06-22 19:27:22 PDT
HTML generated by org-mode 6.33x in emacs 23