diff options
| author | 2012-11-08 07:54:03 -0800 | |
|---|---|---|
| committer | 2012-11-08 07:54:03 -0800 | |
| commit | a4a970ab4eb5188d5aa2a078dfe4fe336ccbfc4e (patch) | |
| tree | 884d703782a6a32677518a1814ccc633b4b51ef3 /src/compiler/codegen | |
| parent | 07131ca93c301e5cbd6a8702d6af777e1662fe65 (diff) | |
Fix overlap bug
There is currently no rule against a dex code pattern that
overlaps the result of a long operation with the source operands.
For example:
ADD_LONG v1, v9, v0
is: (v1,v2) = (v9, v10) + (v0, v1)
However, dx appears to never generate such a pattern. This masked
a JIT (and later Art) bug in the handling of long arithmetic code
generation. The above code expands into a pair of 32-bit add
operations and naively kills the high-order word of the 2nd
operation while computing the low-order word of the result.
The fix is simple - detect this case and use an intermediate
result temporary.
Change-Id: I127f5a51925d5b4249a7a9710cce587dfe0939ad
Diffstat (limited to 'src/compiler/codegen')
| -rw-r--r-- | src/compiler/codegen/GenCommon.cc | 8 |
1 files changed, 4 insertions, 4 deletions
diff --git a/src/compiler/codegen/GenCommon.cc b/src/compiler/codegen/GenCommon.cc index cc3685c6f6..591e12c2d9 100644 --- a/src/compiler/codegen/GenCommon.cc +++ b/src/compiler/codegen/GenCommon.cc @@ -1576,11 +1576,11 @@ void genLong3Addr(CompilationUnit* cUnit, OpKind firstOp, rlSrc2 = loadValueWide(cUnit, rlSrc2, kCoreReg); rlResult = oatEvalLoc(cUnit, rlDest, kCoreReg, true); // The longs may overlap - use intermediate temp if so - if (rlResult.lowReg == rlSrc1.highReg) { + if ((rlResult.lowReg == rlSrc1.highReg) || (rlResult.lowReg == rlSrc2.highReg)){ int tReg = oatAllocTemp(cUnit); - opRegCopy(cUnit, tReg, rlSrc1.highReg); - opRegRegReg(cUnit, firstOp, rlResult.lowReg, rlSrc1.lowReg, rlSrc2.lowReg); - opRegRegReg(cUnit, secondOp, rlResult.highReg, tReg, rlSrc2.highReg); + opRegRegReg(cUnit, firstOp, tReg, rlSrc1.lowReg, rlSrc2.lowReg); + opRegRegReg(cUnit, secondOp, rlResult.highReg, rlSrc1.highReg, rlSrc2.highReg); + opRegCopy(cUnit, rlResult.lowReg, tReg); oatFreeTemp(cUnit, tReg); } else { opRegRegReg(cUnit, firstOp, rlResult.lowReg, rlSrc1.lowReg, rlSrc2.lowReg); |