Use the small thread-local cache for mterp invokes.

This speeds up non-quickened interpreter by 2% (measured on golem).

Test: ./art/test.py -b -r --interpreter --host
Change-Id: I6b00db1b2da7fda4cb0a34beb62d3857ae3d72df
2 files changed