You could do the same thing CPython does: compile common patterns of code into specialized macro-instructions to reduce the interpreter's penalty.