#
6ee06c7a |
| 28-Feb-2022 |
Steve Gou <[email protected]> |
bpu: bring bpu control signals into use (#1477)
|
#
c7fabd05 |
| 28-Jan-2022 |
Steve Gou <[email protected]> |
parameters: reduce ghr length and make it calculated using a formula (#1442)
* parameters: reduce ghr length and make it calculated using a formula
* bpu: add error checking for ghist ptr, suppor
parameters: reduce ghr length and make it calculated using a formula (#1442)
* parameters: reduce ghr length and make it calculated using a formula
* bpu: add error checking for ghist ptr, support hist lengths that are not power of 2
show more ...
|
#
62e6338e |
| 22-Jan-2022 |
Lingrui98 <[email protected]> |
bpu: handle fall through error at stage 3
|
#
a60a2901 |
| 22-Jan-2022 |
Lingrui98 <[email protected]> |
bpu,ftq: remove oversize logic
|
#
ced16aa1 |
| 19-Jan-2022 |
Lingrui98 <[email protected]> |
bpu: add more detailed s3 redirect perf counters
|
#
67402d75 |
| 17-Jan-2022 |
Lingrui98 <[email protected]> |
bpu: read oldest bits one stage ahead
|
#
7bb9fc10 |
| 14-Jan-2022 |
Lingrui98 <[email protected]> |
ftq: cut redirect path from toIfuReq.valid
|
#
53bac374 |
| 11-Jan-2022 |
Lingrui98 <[email protected]> |
bpu: add one cycle on direct
|
#
85670bac |
| 09-Jan-2022 |
Lingrui98 <[email protected]> |
ras: should not push or pop when s3_redirect
|
#
03c81005 |
| 07-Jan-2022 |
Lingrui98 <[email protected]> |
ittage: use result in stage3
* remove base table and use ftb results as base pred * add corrsponding redirect logic in bpu
|
#
4813e060 |
| 07-Jan-2022 |
Lingrui98 <[email protected]> |
tage: improve performance and reduce area
* split entries into by numBr and use bits in pc to hash between them * use shorter tags for each table * make perfEvents a general interface for branch pre
tage: improve performance and reduce area
* split entries into by numBr and use bits in pc to hash between them * use shorter tags for each table * make perfEvents a general interface for branch predictor components in order to remove casting operation in composer
show more ...
|
#
cb4f77ce |
| 31-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* move statisical corrector to stage 3 * add recover path in stage 3 for ras in case stage 2 falsely push or pop * let stage 2 has the highest physical priority in bpu * le
bpu: timing optimizations
* move statisical corrector to stage 3 * add recover path in stage 3 for ras in case stage 2 falsely push or pop * let stage 2 has the highest physical priority in bpu * left ras broken for the next commit to fix
show more ...
|
#
edc18578 |
| 30-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: timing and performance optimizations
* timing: use single ported SRAMs, invalidating read responses on write * performance: -- shortening history length to accelerate training -- use a predict
ubtb: timing and performance optimizations
* timing: use single ported SRAMs, invalidating read responses on write * performance: -- shortening history length to accelerate training -- use a predictor to reduce s2_redirects on FTB not hit
show more ...
|
#
1c91abb6 |
| 26-Dec-2021 |
Steve Gou <[email protected]> |
bpu: fix an error when two stages has different 'oversize' bit
|
#
86d9c530 |
| 23-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: fix fallThruAddr on fallThruError, implement ghist diff mechanism
|
#
b438d51d |
| 18-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: use folded history class instead of seperately managing a ghr
|
#
b37e4b45 |
| 16-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: refactor prediction mechanism(temp commit)
|
#
c49b0e7f |
| 14-Dec-2021 |
Lingrui98 <[email protected]> |
Merge remote-tracking branch 'origin/change-fallThrough' into ubtb-refactor
|
#
b30c10d6 |
| 14-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* use parallel mux to select provider and altprovider for TAGE and ITTAGE * reduce logics on SC prediction * calculate higher bits of targets at stage 1 for ftb * reduce lo
bpu: timing optimizations
* use parallel mux to select provider and altprovider for TAGE and ITTAGE * reduce logics on SC prediction * calculate higher bits of targets at stage 1 for ftb * reduce logics for RAS and ITTAGE prediction assignment
show more ...
|
#
2a3050c2 |
| 14-Dec-2021 |
Jay <[email protected]> |
Optimize IFU and PreDecode timing (#1347)
* ICache: add ReplacePipe for Probe & Release
* remove ProbeUnit
* Probe & Release enter ReplacePipe
* fix bugs when running Linux on MinimalConfi
Optimize IFU and PreDecode timing (#1347)
* ICache: add ReplacePipe for Probe & Release
* remove ProbeUnit
* Probe & Release enter ReplacePipe
* fix bugs when running Linux on MinimalConfig
* TODO: set conflict for ReplacePipe
* ICache: fix ReplacePipe invalid write bug
* chores: code clean up
* IFU: optimize timing
* PreDecode: separate into 2 module for timing optimization
* IBuffer: add enqEnable to replace valid for timing
* IFU/ITLB: optimize timing
* IFU: calculate cut_ptr in f1
* TLB: send req in f1 and wait resp in f2
* ICacheMainPipe: add tlb miss logic in s0
* Optimize IFU timing
* IFU: fix lastHalfRVI bug
* IFU: fix performance bug
* IFU: optimize MMIO commit timing
* IFU: optmize trigger timing and add frontendTrigger
* fix compile error
* IFU: fix mmio stuck bug
show more ...
|
#
1ca0e4f3 |
| 10-Dec-2021 |
Yinan Xu <[email protected]> |
core: refactor hardware performance counters (#1335)
This commit optimizes the coding style and timing for hardware
performance counters.
By default, performance counters are RegNext(RegNext(_)).
|
#
3e52bed1 |
| 08-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: remove stage 3
|
#
1bc6e9c8 |
| 02-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: remove unuseful 'pred_cycle' signal in meta SRAM
|
#
570faa6c |
| 02-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* let the hit signal of each stage be used at last
|
#
ab890bfe |
| 26-Nov-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* use one hot muxes for ftb read resp * generate branch history shift one hot vec for history update src sel and update for all possible shift values
|