#
bf6aaf09 |
| 05-Jul-2022 |
Lingrui98 <[email protected]> |
ubtb: split the selection logic of validArray
and improve parameterizaton of fromMicroBTBEntry
|
#
9658ce50 |
| 25-Mar-2022 |
LinJiawei <[email protected]> |
Bump chisel to 3.5.0
|
#
c7fabd05 |
| 28-Jan-2022 |
Steve Gou <[email protected]> |
parameters: reduce ghr length and make it calculated using a formula (#1442)
* parameters: reduce ghr length and make it calculated using a formula
* bpu: add error checking for ghist ptr, suppor
parameters: reduce ghr length and make it calculated using a formula (#1442)
* parameters: reduce ghr length and make it calculated using a formula
* bpu: add error checking for ghist ptr, support hist lengths that are not power of 2
show more ...
|
#
f4ebc4b2 |
| 23-Jan-2022 |
Lingrui98 <[email protected]> |
ftb,ftq: add a bit indicating there is an rvi call at the last 2 byte for ras to push the right address
|
#
a60a2901 |
| 22-Jan-2022 |
Lingrui98 <[email protected]> |
bpu,ftq: remove oversize logic
|
#
67402d75 |
| 17-Jan-2022 |
Lingrui98 <[email protected]> |
bpu: read oldest bits one stage ahead
|
#
cb4f77ce |
| 31-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* move statisical corrector to stage 3 * add recover path in stage 3 for ras in case stage 2 falsely push or pop * let stage 2 has the highest physical priority in bpu * le
bpu: timing optimizations
* move statisical corrector to stage 3 * add recover path in stage 3 for ras in case stage 2 falsely push or pop * let stage 2 has the highest physical priority in bpu * left ras broken for the next commit to fix
show more ...
|
#
edc18578 |
| 30-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: timing and performance optimizations
* timing: use single ported SRAMs, invalidating read responses on write * performance: -- shortening history length to accelerate training -- use a predict
ubtb: timing and performance optimizations
* timing: use single ported SRAMs, invalidating read responses on write * performance: -- shortening history length to accelerate training -- use a predictor to reduce s2_redirects on FTB not hit
show more ...
|
#
cb4b23c0 |
| 23-Dec-2021 |
Lingrui98 <[email protected]> |
Merge remote-tracking branch 'origin/master' into bpu-timing
|
#
86d9c530 |
| 23-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: fix fallThruAddr on fallThruError, implement ghist diff mechanism
|
#
d3854a00 |
| 22-Dec-2021 |
Lingrui98 <[email protected]> |
pred: use a same Mux1H both for hit and not hit conditions
|
#
7052722f |
| 21-Dec-2021 |
Jay <[email protected]> |
Add simple instruction prefetch for L2 (directed by branch prediction) (#1374)
* Add Naive Instruction Prefetch
* Add instruction prefetch module in ICache
* send Hint to L2 (prefetched data s
Add simple instruction prefetch for L2 (directed by branch prediction) (#1374)
* Add Naive Instruction Prefetch
* Add instruction prefetch module in ICache
* send Hint to L2 (prefetched data stores in L2)
* Ftq: add prefetchPtr and prefetch interface
* Fix IPrefetch PMP Port preempting problem
* Fix merge conflict
show more ...
|
#
b438d51d |
| 18-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: use folded history class instead of seperately managing a ghr
|
#
b37e4b45 |
| 16-Dec-2021 |
Lingrui98 <[email protected]> |
ubtb: refactor prediction mechanism(temp commit)
|
#
64db0884 |
| 16-Dec-2021 |
Lingrui98 <[email protected]> |
Merge remote-tracking branch 'origin/change-fallThrough' into ubtb-refactor
|
#
6ce52296 |
| 15-Dec-2021 |
JinYue <[email protected]> |
FetchRequestBundle: signal rename
- target -> nextStartAddr - nextlineStart = cachelineAlign(startAddr) + 64
|
#
c49b0e7f |
| 14-Dec-2021 |
Lingrui98 <[email protected]> |
Merge remote-tracking branch 'origin/change-fallThrough' into ubtb-refactor
|
#
34a88126 |
| 14-Dec-2021 |
JinYue <[email protected]> |
IFU: delete fallThrough address
|
#
b30c10d6 |
| 14-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* use parallel mux to select provider and altprovider for TAGE and ITTAGE * reduce logics on SC prediction * calculate higher bits of targets at stage 1 for ftb * reduce lo
bpu: timing optimizations
* use parallel mux to select provider and altprovider for TAGE and ITTAGE * reduce logics on SC prediction * calculate higher bits of targets at stage 1 for ftb * reduce logics for RAS and ITTAGE prediction assignment
show more ...
|
#
2a3050c2 |
| 14-Dec-2021 |
Jay <[email protected]> |
Optimize IFU and PreDecode timing (#1347)
* ICache: add ReplacePipe for Probe & Release
* remove ProbeUnit
* Probe & Release enter ReplacePipe
* fix bugs when running Linux on MinimalConfi
Optimize IFU and PreDecode timing (#1347)
* ICache: add ReplacePipe for Probe & Release
* remove ProbeUnit
* Probe & Release enter ReplacePipe
* fix bugs when running Linux on MinimalConfig
* TODO: set conflict for ReplacePipe
* ICache: fix ReplacePipe invalid write bug
* chores: code clean up
* IFU: optimize timing
* PreDecode: separate into 2 module for timing optimization
* IBuffer: add enqEnable to replace valid for timing
* IFU/ITLB: optimize timing
* IFU: calculate cut_ptr in f1
* TLB: send req in f1 and wait resp in f2
* ICacheMainPipe: add tlb miss logic in s0
* Optimize IFU timing
* IFU: fix lastHalfRVI bug
* IFU: fix performance bug
* IFU: optimize MMIO commit timing
* IFU: optmize trigger timing and add frontendTrigger
* fix compile error
* IFU: fix mmio stuck bug
show more ...
|
#
3e52bed1 |
| 08-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: remove stage 3
|
#
a229ab6c |
| 03-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* let ubtb store full targets and fall through addresses * add some fields in BranchPrediction so that ifu requests can be solely derived from it
|
#
570faa6c |
| 02-Dec-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* let the hit signal of each stage be used at last
|
#
e69babf9 |
| 27-Nov-2021 |
Lingrui98 <[email protected]> |
Merge remote-tracking branch 'origin/master' into bpu-timing
|
#
ab890bfe |
| 26-Nov-2021 |
Lingrui98 <[email protected]> |
bpu: timing optimizations
* use one hot muxes for ftb read resp * generate branch history shift one hot vec for history update src sel and update for all possible shift values
|