1*9880d681SAndroid Build Coastguard Worker; Disable machine cse to stress the different path of the algorithm. 2*9880d681SAndroid Build Coastguard Worker; Otherwise, we always fall in the simple case, i.e., only one definition. 3*9880d681SAndroid Build Coastguard Worker; RUN: llc < %s -mtriple=arm64-apple-ios7.0 -disable-machine-cse -aarch64-stress-promote-const -mcpu=cyclone | FileCheck -check-prefix=PROMOTED %s 4*9880d681SAndroid Build Coastguard Worker; The REGULAR run just checks that the inputs passed to promote const expose 5*9880d681SAndroid Build Coastguard Worker; the appropriate patterns. 6*9880d681SAndroid Build Coastguard Worker; RUN: llc < %s -mtriple=arm64-apple-ios7.0 -disable-machine-cse -aarch64-promote-const=false -mcpu=cyclone | FileCheck -check-prefix=REGULAR %s 7*9880d681SAndroid Build Coastguard Worker 8*9880d681SAndroid Build Coastguard Worker%struct.uint8x16x4_t = type { [4 x <16 x i8>] } 9*9880d681SAndroid Build Coastguard Worker 10*9880d681SAndroid Build Coastguard Worker; Constant is a structure 11*9880d681SAndroid Build Coastguard Workerdefine %struct.uint8x16x4_t @test1() { 12*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: test1: 13*9880d681SAndroid Build Coastguard Worker; Promote constant has created a big constant for the whole structure 14*9880d681SAndroid Build Coastguard Worker; PROMOTED: adrp [[PAGEADDR:x[0-9]+]], __PromotedConst@PAGE 15*9880d681SAndroid Build Coastguard Worker; PROMOTED: add [[BASEADDR:x[0-9]+]], [[PAGEADDR]], __PromotedConst@PAGEOFF 16*9880d681SAndroid Build Coastguard Worker; Destination registers are defined by the ABI 17*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: ldp q0, q1, {{\[}}[[BASEADDR]]] 18*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: ldp q2, q3, {{\[}}[[BASEADDR]], #32] 19*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: ret 20*9880d681SAndroid Build Coastguard Worker 21*9880d681SAndroid Build Coastguard Worker; REGULAR-LABEL: test1: 22*9880d681SAndroid Build Coastguard Worker; Regular access is quite bad, it performs 4 loads, one for each chunk of 23*9880d681SAndroid Build Coastguard Worker; the structure 24*9880d681SAndroid Build Coastguard Worker; REGULAR: adrp [[PAGEADDR:x[0-9]+]], [[CSTLABEL:lCP.*]]@PAGE 25*9880d681SAndroid Build Coastguard Worker; Destination registers are defined by the ABI 26*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr q0, {{\[}}[[PAGEADDR]], [[CSTLABEL]]@PAGEOFF] 27*9880d681SAndroid Build Coastguard Worker; REGULAR: adrp [[PAGEADDR:x[0-9]+]], [[CSTLABEL:lCP.*]]@PAGE 28*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr q1, {{\[}}[[PAGEADDR]], [[CSTLABEL]]@PAGEOFF] 29*9880d681SAndroid Build Coastguard Worker; REGULAR: adrp [[PAGEADDR2:x[0-9]+]], [[CSTLABEL2:lCP.*]]@PAGE 30*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr q2, {{\[}}[[PAGEADDR2]], [[CSTLABEL2]]@PAGEOFF] 31*9880d681SAndroid Build Coastguard Worker; REGULAR: adrp [[PAGEADDR3:x[0-9]+]], [[CSTLABEL3:lCP.*]]@PAGE 32*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr q3, {{\[}}[[PAGEADDR3]], [[CSTLABEL3]]@PAGEOFF] 33*9880d681SAndroid Build Coastguard Worker; REGULAR-NEXT: ret 34*9880d681SAndroid Build Coastguard Workerentry: 35*9880d681SAndroid Build Coastguard Worker ret %struct.uint8x16x4_t { [4 x <16 x i8>] [<16 x i8> <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128>, <16 x i8> <i8 32, i8 124, i8 121, i8 120, i8 8, i8 117, i8 -56, i8 113, i8 -76, i8 110, i8 -53, i8 107, i8 7, i8 105, i8 103, i8 102>, <16 x i8> <i8 -24, i8 99, i8 -121, i8 97, i8 66, i8 95, i8 24, i8 93, i8 6, i8 91, i8 12, i8 89, i8 39, i8 87, i8 86, i8 85>, <16 x i8> <i8 -104, i8 83, i8 -20, i8 81, i8 81, i8 80, i8 -59, i8 78, i8 73, i8 77, i8 -37, i8 75, i8 122, i8 74, i8 37, i8 73>] } 36*9880d681SAndroid Build Coastguard Worker} 37*9880d681SAndroid Build Coastguard Worker 38*9880d681SAndroid Build Coastguard Worker; Two different uses of the same constant in the same basic block 39*9880d681SAndroid Build Coastguard Workerdefine <16 x i8> @test2(<16 x i8> %arg) { 40*9880d681SAndroid Build Coastguard Workerentry: 41*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: test2: 42*9880d681SAndroid Build Coastguard Worker; In stress mode, constant vector are promoted 43*9880d681SAndroid Build Coastguard Worker; PROMOTED: adrp [[PAGEADDR:x[0-9]+]], [[CSTV1:__PromotedConst.[0-9]+]]@PAGE 44*9880d681SAndroid Build Coastguard Worker; PROMOTED: ldr q[[REGNUM:[0-9]+]], {{\[}}[[PAGEADDR]], [[CSTV1]]@PAGEOFF] 45*9880d681SAndroid Build Coastguard Worker; Destination register is defined by ABI 46*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: add.16b v0, v0, v[[REGNUM]] 47*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: mla.16b v0, v0, v[[REGNUM]] 48*9880d681SAndroid Build Coastguard Worker; PROMOTED-NEXT: ret 49*9880d681SAndroid Build Coastguard Worker 50*9880d681SAndroid Build Coastguard Worker; REGULAR-LABEL: test2: 51*9880d681SAndroid Build Coastguard Worker; Regular access is strickly the same as promoted access. 52*9880d681SAndroid Build Coastguard Worker; The difference is that the address (and thus the space in memory) is not 53*9880d681SAndroid Build Coastguard Worker; shared between constants 54*9880d681SAndroid Build Coastguard Worker; REGULAR: adrp [[PAGEADDR:x[0-9]+]], [[CSTLABEL:lCP.*]]@PAGE 55*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr q[[REGNUM:[0-9]+]], {{\[}}[[PAGEADDR]], [[CSTLABEL]]@PAGEOFF] 56*9880d681SAndroid Build Coastguard Worker; Destination register is defined by ABI 57*9880d681SAndroid Build Coastguard Worker; REGULAR-NEXT: add.16b v0, v0, v[[REGNUM]] 58*9880d681SAndroid Build Coastguard Worker; REGULAR-NEXT: mla.16b v0, v0, v[[REGNUM]] 59*9880d681SAndroid Build Coastguard Worker; REGULAR-NEXT: ret 60*9880d681SAndroid Build Coastguard Worker %add.i = add <16 x i8> %arg, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 61*9880d681SAndroid Build Coastguard Worker %mul.i = mul <16 x i8> %add.i, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 62*9880d681SAndroid Build Coastguard Worker %add.i9 = add <16 x i8> %add.i, %mul.i 63*9880d681SAndroid Build Coastguard Worker ret <16 x i8> %add.i9 64*9880d681SAndroid Build Coastguard Worker} 65*9880d681SAndroid Build Coastguard Worker 66*9880d681SAndroid Build Coastguard Worker; Two different uses of the same constant in two different basic blocks, 67*9880d681SAndroid Build Coastguard Worker; one dominates the other 68*9880d681SAndroid Build Coastguard Workerdefine <16 x i8> @test3(<16 x i8> %arg, i32 %path) { 69*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: test3: 70*9880d681SAndroid Build Coastguard Worker; In stress mode, constant vector are promoted 71*9880d681SAndroid Build Coastguard Worker; Since, the constant is the same as the previous function, 72*9880d681SAndroid Build Coastguard Worker; the same address must be used 73*9880d681SAndroid Build Coastguard Worker; PROMOTED: ldr 74*9880d681SAndroid Build Coastguard Worker; PROMOTED: ldr 75*9880d681SAndroid Build Coastguard Worker; PROMOTED-NOT: ldr 76*9880d681SAndroid Build Coastguard Worker; PROMOTED: ret 77*9880d681SAndroid Build Coastguard Worker 78*9880d681SAndroid Build Coastguard Worker; REGULAR-LABEL: test3: 79*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr 80*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr 81*9880d681SAndroid Build Coastguard Worker; REGULAR-NOT: ldr 82*9880d681SAndroid Build Coastguard Worker; REGULAR: ret 83*9880d681SAndroid Build Coastguard Workerentry: 84*9880d681SAndroid Build Coastguard Worker %add.i = add <16 x i8> %arg, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 85*9880d681SAndroid Build Coastguard Worker %tobool = icmp eq i32 %path, 0 86*9880d681SAndroid Build Coastguard Worker br i1 %tobool, label %if.else, label %if.then 87*9880d681SAndroid Build Coastguard Worker 88*9880d681SAndroid Build Coastguard Workerif.then: ; preds = %entry 89*9880d681SAndroid Build Coastguard Worker %mul.i13 = mul <16 x i8> %add.i, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 90*9880d681SAndroid Build Coastguard Worker br label %if.end 91*9880d681SAndroid Build Coastguard Worker 92*9880d681SAndroid Build Coastguard Workerif.else: ; preds = %entry 93*9880d681SAndroid Build Coastguard Worker %mul.i = mul <16 x i8> %add.i, <i8 -24, i8 99, i8 -121, i8 97, i8 66, i8 95, i8 24, i8 93, i8 6, i8 91, i8 12, i8 89, i8 39, i8 87, i8 86, i8 85> 94*9880d681SAndroid Build Coastguard Worker br label %if.end 95*9880d681SAndroid Build Coastguard Worker 96*9880d681SAndroid Build Coastguard Workerif.end: ; preds = %if.else, %if.then 97*9880d681SAndroid Build Coastguard Worker %ret2.0 = phi <16 x i8> [ %mul.i13, %if.then ], [ %mul.i, %if.else ] 98*9880d681SAndroid Build Coastguard Worker %add.i12 = add <16 x i8> %add.i, %ret2.0 99*9880d681SAndroid Build Coastguard Worker ret <16 x i8> %add.i12 100*9880d681SAndroid Build Coastguard Worker} 101*9880d681SAndroid Build Coastguard Worker 102*9880d681SAndroid Build Coastguard Worker; Two different uses of the sane constant in two different basic blocks, 103*9880d681SAndroid Build Coastguard Worker; none dominates the other 104*9880d681SAndroid Build Coastguard Workerdefine <16 x i8> @test4(<16 x i8> %arg, i32 %path) { 105*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: test4: 106*9880d681SAndroid Build Coastguard Worker; In stress mode, constant vector are promoted 107*9880d681SAndroid Build Coastguard Worker; Since, the constant is the same as the previous function, 108*9880d681SAndroid Build Coastguard Worker; the same address must be used 109*9880d681SAndroid Build Coastguard Worker; PROMOTED: ldr 110*9880d681SAndroid Build Coastguard Worker; PROMOTED-NOT: ldr 111*9880d681SAndroid Build Coastguard Worker; PROMOTED: ret 112*9880d681SAndroid Build Coastguard Worker 113*9880d681SAndroid Build Coastguard Worker; REGULAR-LABEL: test4: 114*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr 115*9880d681SAndroid Build Coastguard Worker; REGULAR-NOT: ldr 116*9880d681SAndroid Build Coastguard Worker; REGULAR: ret 117*9880d681SAndroid Build Coastguard Workerentry: 118*9880d681SAndroid Build Coastguard Worker %add.i = add <16 x i8> %arg, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 119*9880d681SAndroid Build Coastguard Worker %tobool = icmp eq i32 %path, 0 120*9880d681SAndroid Build Coastguard Worker br i1 %tobool, label %if.end, label %if.then 121*9880d681SAndroid Build Coastguard Worker 122*9880d681SAndroid Build Coastguard Workerif.then: ; preds = %entry 123*9880d681SAndroid Build Coastguard Worker %mul.i = mul <16 x i8> %add.i, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 124*9880d681SAndroid Build Coastguard Worker br label %if.end 125*9880d681SAndroid Build Coastguard Worker 126*9880d681SAndroid Build Coastguard Workerif.end: ; preds = %entry, %if.then 127*9880d681SAndroid Build Coastguard Worker %ret.0 = phi <16 x i8> [ %mul.i, %if.then ], [ %add.i, %entry ] 128*9880d681SAndroid Build Coastguard Worker ret <16 x i8> %ret.0 129*9880d681SAndroid Build Coastguard Worker} 130*9880d681SAndroid Build Coastguard Worker 131*9880d681SAndroid Build Coastguard Worker; Two different uses of the sane constant in two different basic blocks, 132*9880d681SAndroid Build Coastguard Worker; one is in a phi. 133*9880d681SAndroid Build Coastguard Workerdefine <16 x i8> @test5(<16 x i8> %arg, i32 %path) { 134*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: test5: 135*9880d681SAndroid Build Coastguard Worker; In stress mode, constant vector are promoted 136*9880d681SAndroid Build Coastguard Worker; Since, the constant is the same as the previous function, 137*9880d681SAndroid Build Coastguard Worker; the same address must be used 138*9880d681SAndroid Build Coastguard Worker; PROMOTED: ldr 139*9880d681SAndroid Build Coastguard Worker; PROMOTED-NOT: ldr 140*9880d681SAndroid Build Coastguard Worker; PROMOTED: ret 141*9880d681SAndroid Build Coastguard Worker 142*9880d681SAndroid Build Coastguard Worker; REGULAR-LABEL: test5: 143*9880d681SAndroid Build Coastguard Worker; REGULAR: ldr 144*9880d681SAndroid Build Coastguard Worker; REGULAR: ret 145*9880d681SAndroid Build Coastguard Workerentry: 146*9880d681SAndroid Build Coastguard Worker %tobool = icmp eq i32 %path, 0 147*9880d681SAndroid Build Coastguard Worker br i1 %tobool, label %if.end, label %if.then 148*9880d681SAndroid Build Coastguard Worker 149*9880d681SAndroid Build Coastguard Workerif.then: ; preds = %entry 150*9880d681SAndroid Build Coastguard Worker %add.i = add <16 x i8> %arg, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 151*9880d681SAndroid Build Coastguard Worker %mul.i26 = mul <16 x i8> %add.i, <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128> 152*9880d681SAndroid Build Coastguard Worker br label %if.end 153*9880d681SAndroid Build Coastguard Worker 154*9880d681SAndroid Build Coastguard Workerif.end: ; preds = %entry, %if.then 155*9880d681SAndroid Build Coastguard Worker %ret.0 = phi <16 x i8> [ %mul.i26, %if.then ], [ <i8 -40, i8 -93, i8 -118, i8 -99, i8 -75, i8 -105, i8 74, i8 -110, i8 62, i8 -115, i8 -119, i8 -120, i8 34, i8 -124, i8 0, i8 -128>, %entry ] 156*9880d681SAndroid Build Coastguard Worker %mul.i25 = mul <16 x i8> %ret.0, %ret.0 157*9880d681SAndroid Build Coastguard Worker %mul.i24 = mul <16 x i8> %mul.i25, %mul.i25 158*9880d681SAndroid Build Coastguard Worker %mul.i23 = mul <16 x i8> %mul.i24, %mul.i24 159*9880d681SAndroid Build Coastguard Worker %mul.i = mul <16 x i8> %mul.i23, %mul.i23 160*9880d681SAndroid Build Coastguard Worker ret <16 x i8> %mul.i 161*9880d681SAndroid Build Coastguard Worker} 162*9880d681SAndroid Build Coastguard Worker 163*9880d681SAndroid Build Coastguard Workerdefine void @accessBig(i64* %storage) { 164*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: accessBig: 165*9880d681SAndroid Build Coastguard Worker; PROMOTED: adrp 166*9880d681SAndroid Build Coastguard Worker; PROMOTED: ret 167*9880d681SAndroid Build Coastguard Worker %addr = bitcast i64* %storage to <1 x i80>* 168*9880d681SAndroid Build Coastguard Worker store <1 x i80> <i80 483673642326615442599424>, <1 x i80>* %addr 169*9880d681SAndroid Build Coastguard Worker ret void 170*9880d681SAndroid Build Coastguard Worker} 171*9880d681SAndroid Build Coastguard Worker 172*9880d681SAndroid Build Coastguard Workerdefine void @asmStatement() { 173*9880d681SAndroid Build Coastguard Worker; PROMOTED-LABEL: asmStatement: 174*9880d681SAndroid Build Coastguard Worker; PROMOTED-NOT: adrp 175*9880d681SAndroid Build Coastguard Worker; PROMOTED: ret 176*9880d681SAndroid Build Coastguard Worker call void asm sideeffect "bfxil w0, w0, $0, $1", "i,i"(i32 28, i32 4) 177*9880d681SAndroid Build Coastguard Worker ret void 178*9880d681SAndroid Build Coastguard Worker} 179*9880d681SAndroid Build Coastguard Worker 180