xref: /aosp_15_r20/external/llvm/test/Analysis/ScalarEvolution/predicated-trip-count.ll (revision 9880d6810fe72a1726cb53787c6711e909410d58)
1*9880d681SAndroid Build Coastguard Worker; RUN: opt < %s -analyze -scalar-evolution | FileCheck %s
2*9880d681SAndroid Build Coastguard Worker
3*9880d681SAndroid Build Coastguard Workertarget datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
4*9880d681SAndroid Build Coastguard Worker
5*9880d681SAndroid Build Coastguard Worker@A = weak global [1000 x i32] zeroinitializer, align 32
6*9880d681SAndroid Build Coastguard Worker
7*9880d681SAndroid Build Coastguard Worker; The resulting predicate is i16 {0,+,1} <nssw>, meanining
8*9880d681SAndroid Build Coastguard Worker; that the resulting backedge expression will be valid for:
9*9880d681SAndroid Build Coastguard Worker;   (1 + (-1 smax %M)) <= MAX_INT16
10*9880d681SAndroid Build Coastguard Worker;
11*9880d681SAndroid Build Coastguard Worker; At the limit condition for M (MAX_INT16 - 1) we have in the
12*9880d681SAndroid Build Coastguard Worker; last iteration:
13*9880d681SAndroid Build Coastguard Worker;    i0 <- MAX_INT16
14*9880d681SAndroid Build Coastguard Worker;    i0.ext <- MAX_INT16
15*9880d681SAndroid Build Coastguard Worker;
16*9880d681SAndroid Build Coastguard Worker; and therefore no wrapping happend for i0 or i0.ext
17*9880d681SAndroid Build Coastguard Worker; throughout the execution of the loop. The resulting predicated
18*9880d681SAndroid Build Coastguard Worker; backedge taken count is correct.
19*9880d681SAndroid Build Coastguard Worker
20*9880d681SAndroid Build Coastguard Worker; CHECK: Classifying expressions for: @test1
21*9880d681SAndroid Build Coastguard Worker; CHECK: %i.0.ext = sext i16 %i.0 to i32
22*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:  -->  (sext i16 {0,+,1}<%bb3> to i32)
23*9880d681SAndroid Build Coastguard Worker; CHECK:      Loop %bb3: Unpredictable backedge-taken count.
24*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT: Loop %bb3: Unpredictable max backedge-taken count.
25*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT: Loop %bb3: Predicated backedge-taken count is (1 + (-1 smax %M))
26*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT: Predicates:
27*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:    {0,+,1}<%bb3> Added Flags: <nssw>
28*9880d681SAndroid Build Coastguard Workerdefine void @test1(i32 %N, i32 %M) {
29*9880d681SAndroid Build Coastguard Workerentry:
30*9880d681SAndroid Build Coastguard Worker        br label %bb3
31*9880d681SAndroid Build Coastguard Worker
32*9880d681SAndroid Build Coastguard Workerbb:             ; preds = %bb3
33*9880d681SAndroid Build Coastguard Worker        %tmp = getelementptr [1000 x i32], [1000 x i32]* @A, i32 0, i16 %i.0          ; <i32*> [#uses=1]
34*9880d681SAndroid Build Coastguard Worker        store i32 123, i32* %tmp
35*9880d681SAndroid Build Coastguard Worker        %tmp2 = add i16 %i.0, 1         ; <i32> [#uses=1]
36*9880d681SAndroid Build Coastguard Worker        br label %bb3
37*9880d681SAndroid Build Coastguard Worker
38*9880d681SAndroid Build Coastguard Workerbb3:            ; preds = %bb, %entry
39*9880d681SAndroid Build Coastguard Worker        %i.0 = phi i16 [ 0, %entry ], [ %tmp2, %bb ]            ; <i32> [#uses=3]
40*9880d681SAndroid Build Coastguard Worker        %i.0.ext = sext i16 %i.0 to i32
41*9880d681SAndroid Build Coastguard Worker        %tmp3 = icmp sle i32 %i.0.ext, %M          ; <i1> [#uses=1]
42*9880d681SAndroid Build Coastguard Worker        br i1 %tmp3, label %bb, label %bb5
43*9880d681SAndroid Build Coastguard Worker
44*9880d681SAndroid Build Coastguard Workerbb5:            ; preds = %bb3
45*9880d681SAndroid Build Coastguard Worker        br label %return
46*9880d681SAndroid Build Coastguard Worker
47*9880d681SAndroid Build Coastguard Workerreturn:         ; preds = %bb5
48*9880d681SAndroid Build Coastguard Worker        ret void
49*9880d681SAndroid Build Coastguard Worker}
50*9880d681SAndroid Build Coastguard Worker
51*9880d681SAndroid Build Coastguard Worker; The predicated backedge taken count is:
52*9880d681SAndroid Build Coastguard Worker;    (2 + (zext i16 %Start to i32) + ((-2 + (-1 * (sext i16 %Start to i32)))
53*9880d681SAndroid Build Coastguard Worker;                                     smax (-1 + (-1 * %M)))
54*9880d681SAndroid Build Coastguard Worker;    )
55*9880d681SAndroid Build Coastguard Worker
56*9880d681SAndroid Build Coastguard Worker; -1 + (-1 * %M) <= (-2 + (-1 * (sext i16 %Start to i32))
57*9880d681SAndroid Build Coastguard Worker; The predicated backedge taken count is 0.
58*9880d681SAndroid Build Coastguard Worker; From the IR, this is correct since we will bail out at the
59*9880d681SAndroid Build Coastguard Worker; first iteration.
60*9880d681SAndroid Build Coastguard Worker
61*9880d681SAndroid Build Coastguard Worker
62*9880d681SAndroid Build Coastguard Worker; * -1 + (-1 * %M) > (-2 + (-1 * (sext i16 %Start to i32))
63*9880d681SAndroid Build Coastguard Worker; or: %M < 1 + (sext i16 %Start to i32)
64*9880d681SAndroid Build Coastguard Worker;
65*9880d681SAndroid Build Coastguard Worker; The predicated backedge taken count is 1 + (zext i16 %Start to i32) - %M
66*9880d681SAndroid Build Coastguard Worker;
67*9880d681SAndroid Build Coastguard Worker; If %M >= MIN_INT + 1, this predicated backedge taken count would be correct (even
68*9880d681SAndroid Build Coastguard Worker; without predicates). However, for %M < MIN_INT this would be an infinite loop.
69*9880d681SAndroid Build Coastguard Worker; In these cases, the {%Start,+,-1} <nusw> predicate would be false, as the
70*9880d681SAndroid Build Coastguard Worker; final value of the expression {%Start,+,-1} expression (%M - 1) would not be
71*9880d681SAndroid Build Coastguard Worker; representable as an i16.
72*9880d681SAndroid Build Coastguard Worker
73*9880d681SAndroid Build Coastguard Worker; There is also a limit case here where the value of %M is MIN_INT. In this case
74*9880d681SAndroid Build Coastguard Worker; we still have an infinite loop, since icmp sge %x, MIN_INT will always return
75*9880d681SAndroid Build Coastguard Worker; true.
76*9880d681SAndroid Build Coastguard Worker
77*9880d681SAndroid Build Coastguard Worker; CHECK: Classifying expressions for: @test2
78*9880d681SAndroid Build Coastguard Worker
79*9880d681SAndroid Build Coastguard Worker; CHECK:      %i.0.ext = sext i16 %i.0 to i32
80*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:    -->  (sext i16 {%Start,+,-1}<%bb3> to i32)
81*9880d681SAndroid Build Coastguard Worker; CHECK:       Loop %bb3: Unpredictable backedge-taken count.
82*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:  Loop %bb3: Unpredictable max backedge-taken count.
83*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:  Loop %bb3: Predicated backedge-taken count is (2 + (sext i16 %Start to i32) + ((-2 + (-1 * (sext i16 %Start to i32))) smax (-1 + (-1 * %M))))
84*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:  Predicates:
85*9880d681SAndroid Build Coastguard Worker; CHECK-NEXT:    {%Start,+,-1}<%bb3> Added Flags: <nssw>
86*9880d681SAndroid Build Coastguard Worker
87*9880d681SAndroid Build Coastguard Workerdefine void @test2(i32 %N, i32 %M, i16 %Start) {
88*9880d681SAndroid Build Coastguard Workerentry:
89*9880d681SAndroid Build Coastguard Worker        br label %bb3
90*9880d681SAndroid Build Coastguard Worker
91*9880d681SAndroid Build Coastguard Workerbb:             ; preds = %bb3
92*9880d681SAndroid Build Coastguard Worker        %tmp = getelementptr [1000 x i32], [1000 x i32]* @A, i32 0, i16 %i.0          ; <i32*> [#uses=1]
93*9880d681SAndroid Build Coastguard Worker        store i32 123, i32* %tmp
94*9880d681SAndroid Build Coastguard Worker        %tmp2 = sub i16 %i.0, 1         ; <i32> [#uses=1]
95*9880d681SAndroid Build Coastguard Worker        br label %bb3
96*9880d681SAndroid Build Coastguard Worker
97*9880d681SAndroid Build Coastguard Workerbb3:            ; preds = %bb, %entry
98*9880d681SAndroid Build Coastguard Worker        %i.0 = phi i16 [ %Start, %entry ], [ %tmp2, %bb ]            ; <i32> [#uses=3]
99*9880d681SAndroid Build Coastguard Worker        %i.0.ext = sext i16 %i.0 to i32
100*9880d681SAndroid Build Coastguard Worker        %tmp3 = icmp sge i32 %i.0.ext, %M          ; <i1> [#uses=1]
101*9880d681SAndroid Build Coastguard Worker        br i1 %tmp3, label %bb, label %bb5
102*9880d681SAndroid Build Coastguard Worker
103*9880d681SAndroid Build Coastguard Workerbb5:            ; preds = %bb3
104*9880d681SAndroid Build Coastguard Worker        br label %return
105*9880d681SAndroid Build Coastguard Worker
106*9880d681SAndroid Build Coastguard Workerreturn:         ; preds = %bb5
107*9880d681SAndroid Build Coastguard Worker        ret void
108*9880d681SAndroid Build Coastguard Worker}
109*9880d681SAndroid Build Coastguard Worker
110