1Mesa 22.2.0 Release Notes / 2022-09-21 2====================================== 3 4Mesa 22.2.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 22.2.1. 7 8Mesa 22.2.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 22.2.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 b1f9c8fd08f2cae3adf83355bef4d2398e8025f44947332880f2d0066bdafa8c mesa-22.2.0.tar.xz 25 26 27 28New features 29------------ 30 31- WGL_ARB_create_context_robustness 32 33- d3d12 ARB_robust_buffer_access_behavior 34 35- VK_EXT_robustness2 for lavapipe 36 37- VK_EXT_image_2d_view_of_3d on RADV 38 39- zink and d3d12 GL_EXT_memory_object_win32 and GL_EXT_semaphore_win32 support 40 41- vertexAttributeInstanceRateZeroDivisor support for lavapipe 42 43- panfrost Valhall support (conformant OpenGL ES 3.1 on Mali-G57) 44 45- VK_EXT_primitives_generated_query on RADV 46 47- VK_EXT_non_seamless_cube_map on RADV, ANV, lavapipe 48 49- VK_EXT_border_color_swizzle on lavapipe, ANV, turnip, RADV 50 51- VK_EXT_shader_module_identifier on RADV 52 53- variablePointers on lavapipe 54 55- VK_EXT_multisampled_render_to_single_sampled on lavapipe 56 57- VK_EXT_shader_subgroup_vote/ballot on lavapipe 58 59- EGL_KHR_context_flush_control for all drivers 60 61- GLX_ARB_context_flush_control for all drivers 62 63- VK_EXT_attachment_feedback_loop_layout on RADV 64 65 66Bug fixes 67--------- 68 69- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" 70- [radeonsi] Graphical corruption in Valve Source-based games 71- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO 72- nouveau: corruption on NVAC after switching to the NIR backend 73- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip 74- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built 75- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) 76- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 77- regression from !17949 78- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled 79- r600/sfn: register allocation fails for a number of piglits and deqp tests 80- DG2 corruption in gnome desktop from F36 81- radeonsi: error allocating sparse buffers.. bug? 82- [BISECTED/radeonsi] Flickering triangles with wayfire cube 83- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED 84- turnip: gfxbench aztec ruins crashes on zink 85- glsl: textureGatherOffset param broken 86- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c 87- radeonsi: driver crashes with kwin-effects-cube 88- [Turnip] Final Fantasy X: Incorrect hair rendering 89- turnip: gfxbench car chase crash on zink 90- [r300] - osu! - Shader optimizations cause black and glitchy rendering 91- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available 92- anv: invalid returns from vkCreateDescriptorPool 93- [Bisected][RadeonSI] Blender crashes on startup 94- turnip: depth/stencil regressions 95- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages 96- Poor performance/utilization with Splitgate 97- freedreno / mesa 22.1.3 crashes Firefox Nightly 98- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 99- dzn: Build failure when it's the only driver built - 22.2-devel regression 100- d3d12: Unresolved external symbols from Version.lib 101- turnip: Geometry shader never invoked if there is no EmitVertex and gl_Position is not written 102- Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7 103- turnip: GPU fault on clear 104- intel_nullhw.c:179:28: error: passing 'const void \*const' to parameter of type 'void \*' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] 105- macOS undefined symbol _glapi_tls_Context build error 106- Build on macOS regressed 107- Regression: virgl: The switch to NTT breaks "Total War: Warhammer" 108- macOS's LIST_ENTRY macro conflicts with ours 109- src/util/disk_cache_os.c:234:21: error: too many arguments provided to function-like macro invocation 110- [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1 111- Unity game segfaults often 112- Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0 113- turnip: dEQP-VK.subgroups.ballot_broadcast.framebuffer.* has failures on older deqp-vk 114- !17062 broke XTS xts5/Xlib9/XDrawImageString(16) with Xephyr using glamor on llvmpipe 115- DG2 full BAR broken by low BAR? 116- crocus: SSBOs regressed 117- Homerun Clash game triggers assert in compiler 118- radv: line rasterization mode affects multisampling 119- turnip: caselist fail 120- vtn: mediump_16bit_alu and compatibility 121- radv: dynamic vertex input failure 122- navi21\\radv\\amdgpu Halo Infinite screen corruption and then hang 123- ci/freedreno: a618_vk 1/7 timeouts 124- Windows OpenGL32.dll without shared-glapi currently broken 125- anv: KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage fails on TGL 126- perfetto: crash on context teardown / driver unload 127- panfrost shader classification is wrong in some cases 128- kopper: the latest awfulness exposed by piglit glx-multi-window-single-context 129- android: r600 building error after commit 79ca456b 130- Error compiling gallium-nine on i686 using musl libc 131- [anv][regression][bisected][bsw,gen9atom] dEQP-VK.memory.mapping.dedicated_alloc failing on bsw and gen9atom 132- anv: GTF-GL46.gtf32.GL3Tests.packed_pixels.packed_pixels_pbo failure 133- r600: Tesseract does not render properly with the new NIR backend 134- radv: some zink cts fails 135- anv: ICL hiz issue 136- [RADV, bisected] addrlib changes cause Navi CTS regressions 137- r300: Segfault in shaders/closed/steam/kerbal-space-program/1017.shader_test 138- Get rid of debug_assert() 139- nine: docs? 140- d3d12: Gallium d3d12 backend performance is slow 141- [RADV][bisected] after commit 124848bf9efa1cc5d8a3cdfc654e0c03d57540e3 the games which use vulkan API not starting 142- freedreno: skqp instability from GPU hangs 143- ci/a618: intermittent network failure 144- turnip: EXT_primitives_generated_query broken 145- Godot crashes with RX5500m 146- intel: Some Alan Wake shader fails validation due to bad register usage in EOT message 147- r600/sfn: Add support for fmulz 148- radv: failure in image.sample_texture.*_compressed_format* 149- radeonsi: Hang on RX 5500 XT on latest mesa git 150- blob_align assert triggers 151- radv: Enabling support for instance extensions inside vkCreateInstance does not work 152- zink: rendering artifacts in Portal 2 153- a630: AmnesiaTDD trace gpu faults 154- freedreno build issue - FAILED: src/freedreno/decode/crashdec.txt 155- segfaults under X11 since the Ubuntu update to 22 156- anv: renderdoc explosion on icl 157- turnip: EXT_depth_clip_enable broken 158- lava: test result parsing fail 159- turnip: LRZ fast clear 160- [ANV] GTA IV flickering 161- [TGL] 25-100% perf impact on SynMark2 with 2GB dynamic pool size 162- [Crocus] - Minecraft with Forge and JourneyMap - Graphical glitches 163- intel - Minecraft char bar is broken after Mesa upgrade 164- [wine] RAGE: texture problems 165- "Unsupported modifier, resource creation failed." on stderr 166- [RADV/ACO] Star Citizen Flickeriing Lights 167- anv: query regressions 168- ir3: Infinite loop in scheduler when splitting 169- ci/lava: "Listened to connection" spam 170- ci: docs build fail 171- turnip: 3d copy/clear/... ops should not be counted by occlusion query 172- anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT 173- turnip: wrong border color for Z24S8 174- aco_tests fail with llvm-15 175- Clang 14 issues compiling src/util/u_cpu_detect.c 176- eglExportDMABUFImageQueryMESA crashes on amdgpu gallium/drivers/radeonsi 177- crocus: GL_SELECT regressed on hsw 178- Random texture flickering in Minecraft 179- panfrost: Broken interaction between CRC and tile size 180- panfrost: allow_forward_to_be_killed is too conservative 181- APU downgraded to OpenGL 3.0 since UBO size change in radeonsi 182- [anv, bisected, regression] Pipeline statistics query segfaults 183- radv/NIR: Suboptimal codegen for AccessChain on buffer device addresses 184- Regression - vah264dec + glimagesink causes green taint on entire video with mesa>=22.0.0 185- va: AMD: Wrong buffer size of P010 Luma plane 186- 625b352f14b broke glcts on Navi1x 187- Use marketing_name for the Vulkan device string in RADV 188- Gallium driver crashes due to fd double close in iris driver 189- radv: Khronos ray tracing sample issues 190- r300: copy propagate does not correctly account for control flow 191- r300: Amnesia the Dark Descent slowdown since NIR loop unrolling 192- r300: some unused uniforms are not removed with the NIR varying linker (compared to old glsl varying linker) 193- Mesa currently FTBFS on x32 architecture 194- Runescape randomly crashes. malloc_consolidate(): unaligned fastbin chunk 195- tu: viewport assert 196- d3d12: point coord regression due to NIR IO changes 197- nir_lower_phis_to_regs_block: Assertion \`src->src.is_ssa' failed. 198- Redream emulator showing artifacts 199- Build failure on sparc 200- r600 video playback broken 201- r300: hundreds of tests fail since "clear color buffers using color from a constant buffer" 202- turnip: shader viewport handling 203- Signal 6 (Aborted) when opening GTK3 applications 204- radv: Conditional rendering on compute queue 205- anv: line failure 206- 32 bit build failure 207- d3d12 build regression: d3d12video.h: error: expected ‘;’ at end of member declaration 208- panfrost midgard - major issues with MelonDS emulator - not able to trace properly too 209- Intel/anv: The cloak is flickering in Batman:Arkham Knight 210- regression: "NIR based varying linker" breaks some xwayland/glamor applications with etnaviv 211- pvr: build failure on archs with unsigned long != uint64_t 212- turnip: dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.* fails with angle 213- Nheko misrendering on RK3399 (Mali T860) 214- intel_pps_driver.cc:308:74: error: too many arguments to function ‘void intel_perf_query_result_accumulate_fields(intel_perf_query_result*, const intel_perf_query_info*, const void*, const void*, bool) 215- radv: no nir validation in radv_link_shaders 216- r300: deqp regressions from "glsl: enable the use of the nir based varying linker" 217- [crocus] glmark2 -b jellyfish rendering issues 218- radeonsi-stoney: Failing jobs during a CI bug 219- iris: skqp gles_atlastext flake 220- turnip: use-after-free in shader compilation 221- anv: EXT_primitives_generated_query fails 222- r300: vs predicate logic fails in vs-loop-complex-unroll-with-else-break.shader_test 223- [Intel/Vulkan] White blinking artifacts in Wolfenstein Youngblood 224- radv: Indirect unaligned dispatch 225- [radv] DCC causes artifacts in Senran Kagura Shinovi Versus 226- Radeon RX 5700XT crash when using prusa-slicer 227- glcts proxy_textures_invalid_samples tests assert on iris 228- turnip: gfxbench mh3.1 offscreen crash with ANGLE 229- dEQP-GLES31.functional.shaders.builtin_functions.integer.[i|u]mulextended.* fail 230- Radv leaks a descriptor set layout 231- radeonsi: KHR_debug thread safety 232- [radeon, rv370] Running piglit shaders\@glsl-vs-raytrace-bug26691 test causes hard lockup & reboot 233- clover: Assertion \`NumContainedTys && "Attempting to get element type of opaque pointer"' failed 234- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes causes memory leak in Apex Legends 235- v3d: port v3dv workaround for GFXH-1918 236- freedreno: fragile location packing can break cull/clip distance 237- turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures 238- Panfrost: wrong textures and lighting in "Thief" games in wine (not happening when using Software render) 239- [Bisected] r600: Big performance regression on some games 240- turnip: report corresponding drmFormatModifierPlaneCount for planar format 241- piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression 242- Huge amount of anon_inode:sync_file file descriptor created 243- enable warnings for GNU extensions we don't want 244- Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro 245- NIR validation failed after nir_opt_if 246- Etnaviv: The triangle is lost during linking together triangle strips with degenerate triangles 247- intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results 248- Aperture Desk Job rendering artifacts Intel TGL 249- intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue 250- r300/rv530: "Black rendering" of Baldur's Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi 251- radv/aco: xfb bug 252- '../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info \*, ADDR2_COMPUTE_DCCINFO_OUTPUT \*, struct gfx9_meta_equation \*): assertion "dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)" failed' 253- Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT 254- Vulkan::Calling vkWaitForFences Timeout 255- turnip: bad gmem rendering with PUBG mobile on angle 256- Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0 257- gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1 258- bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed 259- radv,aco: Regression with global atomics with negative offsets 260 261 262Changes 263------- 264 265Adam Jackson (32): 266 267- docs: Note EGL enum allocation for EGL_EXT_present_opaque 268- docs: Update supported drivers for 22.x 269- glx/kopper: Wire up a way for SwapBuffers to fail 270- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes 271- kopper: Grow a swap interval API 272- glx/kopper: Enable GLX_EXT_swap_control etc. 273- egl/kopper: Hook up eglSwapInterval 274- wsi/x11: Hook up KHR_incremental_present 275- dri: Inline dri2InvalidateDrawable into its one caller 276- dri: Move driCopySubBufferExtension into drisw 277- dri: move driQueryRendererIntegerCommon near its one caller 278- dri: Move driC{reate,oncat}Configs near their only caller 279- dri: Move dri{Get,Index}ConfigAttrib near their vtable 280- dri: Remove what's left of utils.[ch] 281- glx/dri2: Require DRI2 >= 1.3 for working buffer invalidation 282- dri2: Require a loader with working buffer invalidation 283- zink: Print the VkResult if vkCreateInstance fails 284- glx/dri: Fix DRI drawable release at MakeCurrent time 285- mesa: Stub out GL_APPLE_object_purgeable 286- glx: Remove the remains of GLX_MESA_multithread_makecurrent 287- gallium/dri: Remove the memory of GLX_MESA_multithread_makecurrent 288- glx: Remove some can't-happen checks from protocolDestroyDrawable 289- glx: Remove unused __glXSetupForCommand from __glXIsDirect 290- glx: Remove excess screen parameter from CreateContext 291- glx/dri2: Stop tracking the protocol version in the display state 292- glx: Fix return values from __glXGetDrawableAttribute 293- glx: Fix drawable refcounting for naked Windows 294- kopper: Implement {EGL,GLX}_EXT_buffer_age 295- wsi/x11: Synchronously check for error from xcb_present_pixmap 296- mesa: Remove Purgeable fields from texture/buffer/renderbuffer objects 297- llvmpipe: Indentation fix 298- gallium/dri2: Enable {GLX_ARB,EGL_KHR}_context_flush_control 299 300Alejandro Piñeiro (22): 301 302- v3dv/pipeline: fix small comment typo 303- v3dv/pipeline_cache: adds check to skip searching for a entry 304- v3dv/pipeline_cache: add on disk cache hit stats 305- v3dv/pipeline: include pipeline layout on the pipeline sha1 306- v3dv/bo: unmap any bo that goes to the bo cache 307- v3dv/format: no need for GetPhysicalDeviceFormatProperties 308- v3dv/format: Add support for VK_KHR_format_feature_flags2 309- nir: get res binding using component 0, instead of asssumig an uint 310- v3dv/pipeline: call nir_lower_explicit_io after first nir optimization loop 311- v3dv/pipeline: expand nir_optimize, drop st_nir_opts 312- v3dv/device: add comment on depthClamp feature 313- broadcom: add no_merge_jobs debug option 314- broadcom: document which debug options are specific for a given driver 315- broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V 316- broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir 317- nir: call nir_metadata_preserve at nir_remove_unused_io_vars 318- v3dv/pipeline: use nir_shader_instructions_pass when lowering layout 319- broadcom/compiler: removed unused function 320- broadcom/compiler: make several passes to return a progress 321- v3d/v3dv: use NIR_PASS(_ 322- vc4/v3d: stop adding NORAST when SHADERDB debug option is used 323- vc4/v3d: restore calling debug_get_option_vc4/v3d_debug 324 325Alessandro Astone (5): 326 327- kopper: Only include XCB headers on supported platforms 328- Android.mk: Cleanup mesa3d-lib 329- Android.mk: Make mesa3d-lib work with absolute path meson outputs 330- Android.mk: Generate the dummy source in local-generated-sources-dir 331- Android.mk: Intermediate output paths may already be absolute 332 333Alexander Kanavin (1): 334 335- swrast_kms: use swkmsDRI2Extension instead of driDRI2Extension 336 337Alexey Bozhenko (2): 338 339- intel/compiler: fix singleton pointer coverity warning 340- spirv: fix OpBranchConditional when both branches are the same 341 342Alyssa Rosenzweig (454): 343 344- panfrost: Add 48-bit and 96-bit tiling routines 345- panfrost: Remove bpp check for tiling 346- panfrost: Only handle 2D in should_linear_convert 347- panfrost: Implement tiled 3D resource access 348- panfrost: Only tile for 2D dimensions 349- panfrost: Allow tiling all texture targets 350- panfrost: Assert alignment in tiling routine 351- panfrost: Document arguments to tiling routines 352- panfrost: Unit test u-interleaved tiling routines 353- docs/panfrost: Document u-interleaved tiling 354- panfrost: Rework dirty tracking for Valhall 355- panfrost: Dirty track batch masks 356- panfrost: Rework PAN_GPU_ID mechanism 357- docs/panfrost: Document drm-shim usage 358- meson: Ignore unused variables in release builds 359- pan/bi: Simplify bi_propagate_pass_flags 360- panvk: Respect line width 361- panvk: Consider primitive restart in index buffer walks 362- pan/bi: Implement fquantize2f16 363- panvk: Handle empty shaders gracefully 364- panvk: Lower shared memory 365- util: Add a generic worklist implementation 366- nir: Use u_worklist to back nir_block_worklist 367- pan/bi: Implement pack_uvec[24]_to_uint 368- panfrost: Add texture features enum to v9.xml 369- pan/blit: Compile blit shaders without IDVS 370- pan/blit: Generalize texture alignment 371- pan/blit: Prepare for Valhall port 372- pan/blit: Support v9 data structures 373- panfrost: Don't allocate storage for PSIZ on Valhall 374- panfrost: Compile libpanfrost for v9 375- panfrost: Add helper to emit UBOs 376- panfrost: Port uniform/UBO logic to Valhall 377- panfrost: Don't fix up alpha test on Bifrost 378- panfrost: Use common state emit for compute jobs 379- panfrost: Adapt compute job emit for Valhall 380- panfrost: Specialize rasterizer state for Valhall 381- panfrost: Simplify attribute format expression 382- panfrost: Specialize vertex elements for Valhall 383- panfrost: Specialize ZSA state for Valhall 384- panfrost: Specialize shader descriptors for Valhall 385- panfrost: Use dirty flags to emit v9 descriptors 386- panfrost: Generate Valhall Malloc IDVS jobs 387- panfrost: Compile for v9 388- panfrost: Kick off v9 support in the driver 389- panfrost: Advertise all textures in drm-shim 390- panfrost: Split pan_layout.c from pan_texture.c 391- panfrost: Always use 64-bit SD with strides 392- panfrost: Add afbc_superblock_{size, width, height} helpers 393- panfrost: Use panfrost_afbc_superblock_width 394- panfrost: Don't pretend to support multiplane AFBC 395- panfrost: Extract panfrost_afbc_is_wide helper 396- panfrost: Use pan_block_size in layout calculation 397- panfrost: Unify panfrost_block_size paths 398- panfrost: Unify paths through image_layout_init 399- panfrost: Remove unused dev argument 400- panfrost: Unit test block size queries 401- panfrost: Simplify how image_layout_init is called 402- panfrost: Add a test for pan_image_layout_init 403- asahi: Workaround broken GLSL compiler 404- agx: Implement some shader-db stats 405- agx: Note that RA proceeds in dominance-order 406- agx: Track write registers more accurately 407- agx: Match order for designated initializers 408- agx: Use correct types for some IR enums 409- agx: Wrap compiler header in extern "C" 410- agx: Add unit test infrastructure 411- agx: Use pseudo ops for mov/not/and/xor/or 412- agx: Implement simple copyprop 413- agx: Dynamically allocate agx_instr->src 414- agx: Add p_split pseudoinstruction 415- agx: Add phi pseudo instruction 416- agx: Add a hash table for vector extracts 417- agx: Add helper to emit combines 418- agx: Add helper to emit splits 419- agx: Use a transfer graph for parallel copies 420- agx: Lower p_split after RA 421- agx: Optimize p_split(kill) specially 422- agx: Emit splits for intrinsics 423- agx: Use extract helper for tex internally 424- agx: Remove else optimization 425- agx: Use a dynarray for predecessors 426- agx: Add predecessor index helper 427- agx: Mark the logical ends of blocks 428- agx: Add agx_after_block_logical helper 429- agx: Introduce worklist infrastructure 430- agx: Adapt liveness analysis for SSA 431- agx: Make DCE dumber 432- agx: Translate phi nodes 433- agx: Copy prop into phi nodes 434- agx: Don't lower phis in NIR 435- agx: Lower phi nodes to parallel copies 436- agx: Split up RA from post-RA lowering 437- agx: Always use hash table for extracts 438- agx: Unit test parallel copy lowering 439- agx: Remove identity moves 440- agx: Add validation pass 441- agx: Update RA comment 442- agx: Remove has_liveness 443- agx: Remove nir_register support 444- agx: Lower mediump flat shading 445- panvk: Ignore point size for !points 446- panvk: Use correct point size limits 447- panfrost: Only flip point sprites for GL 448- panvk: Streamline no shader RSD case 449- panvk: Gate rasterization on !discard, not fs req 450- panvk: Emit fragment RSDs even with no shader 451- panfrost: Remove ancient unused code 452- panfrost: Call tgsi_to_nir earlier 453- panfrost: Don't subclass pipe_compute_state 454- panfrost: Use s->info.stage instead of tgsi stage 455- panfrost: Simplify shader key architecture 456- panfrost: Refactor variant selection code 457- panfrost: Refactor variant rebind code 458- panfrost: Lower point sprites on Bifrost 459- panfrost: Lower user clip planes 460- panfrost: Flip point coords in hardware 461- pan/va: Generalize message? check for asm 462- pan/va: Add missing TABLE (SFU) instructions 463- pan/va: Mark NOT instructions as deprecated 464- pan/va: Add VAR_TEX_BUF_GRADIENT instruction 465- pan/va: Add non-IDVS VAR_TEX instructions 466- panvk: Lower global vars to local 467- panvk: Allocate stack for each thread 468- pan/bi: Handle texture offset + index 469- panvk: Lower image deref atomics 470- panvk: Only write depth when depth test enabled 471- pan/decode: Print row strides, not line strides 472- panvk: Remove unused layout structs 473- panvk: Report row_stride in GetImageSubresourceLayout 474- panfrost: Add helpers to work with legacy strides 475- panfrost: Use row stride for explicit layouts 476- panfrost: Rename away from bytes_per_pixel 477- panfrost: Use row stride to calculate slice size 478- panfrost: Adapt get_param for row strides 479- panfrost: Use row_stride even for linear resources 480- lima,panfrost: Use row stride for tiling routines 481- panfrost: Remove line_stride 482- panfrost: Unify row stride and AFBC row stride 483- panfrost: Unit test stride calculations 484- panfrost: Unit test "from legacy" helper 485- pan/bi: Rename bi_block->name to bi_block->index 486- pan/bi: Count blocks 487- pan/bi: Add u_worklist wrapper macros 488- pan/bi: Use bi_worklist in analyze_helper_requirements 489- pan/bi: Use bi_worklist for liveness 490- pan/bi: Use bi_worklist for post-RA liveness 491- pan/bi: Use worklist for scoreboard analysis 492- pan/bi: Use a dynarray for predecessors 493- nir: Don't set writes_memory for reading XFB 494- panfrost: Relax image check 495- pan/va: Serialize memory stores 496- pan/bi: Align spilled registers on Valhall 497- pan/bi: Add option to test spilling 498- pan/bi: Assert that blend shaders may not spill 499- panfrost: Correct XML for TLS 500- panfrost: Use emit_tls 501- panfrost: Use packed TLS on Valhall 502- panfrost/ci: Smoke test spilling 503- panfrost: Don't use VARYING_SLOT_TEX0 internally 504- panvk: Don't use VARYING_SLOT_TEX0 internally 505- pan/bi: Rework varying linking on Valhall 506- panfrost: Increase the maximum tiler heap size 507- mesa/st: Fix building tests on macOS 508- panvk: Stub pipeline cache using the common code 509- panvk: Conform viewport code to Vulkan spec 510- panvk: Call nir_opt_trivial_continues 511- pan/va: Add some whitespace to Valhall disassembly 512- pan/va: Add whitespace after disassembled branches 513- agx: Restore Valve copyright header 514- pan/bi: Don't analyze td on Valhall 515- pan/bi: Emit LEA_TEX on Valhall 516- pan/bi: Support image loads on Valhall 517- pan/bi: Preserve AXCHG.i32 destination 518- pan/bi: Handle shared atomic exchange on Valhall 519- pan/bi: Handle shared/scratch on Valhall 520- pan/bi: Support atomics on Valhall 521- pan/bi: Ensure the end NOP isn't eliminated 522- asahi: Fix hangs waiting on the notification queue 523- pan/bi: Update bi_count_write_registers for Valhall 524- pan/va: Mark more source sizes 525- pan/va: Lower split 64-bit sources 526- pan/va: Use 64-bit lowering for texturing 527- pan/bi: +JUMP can't read same-cycle temp 528- pan/bi: Lower phis to scalar 529- pan/bi: Use value-based interference with LCRA 530- pan/bi: Add source/destination counts 531- pan/bi: Add collect and split instructions 532- pan/bi: Add bi_before_block cursor 533- pan/bi: Lower split/collect before RA 534- pan/bi: Fix mov and pack_32_2x16 535- pan/bi: Expand MAX_DESTS to 4 536- pan/bi: Create COLLECT during isel 537- pan/bi: Lift split/collect cache from AGX 538- pan/bi: Simplfy BLEND emit 539- pan/bi: Emit collect and split 540- pan/va: Use split for 64-bit lowering 541- pan/va: Don't use bi_word in FAU unit test 542- pan/bi: Scalarize bi_lower_swizzle 543- pan/bi: Scalarize bi_opt_cse 544- pan/bi: Scalarize modifier propagation 545- pan/bi: Scalarize copyprop 546- pan/bi: Remove bi_word and bi_word_node 547- pan/bi: Simplify register precolouring in the IR 548- pan/bi: Remove liveness metadata tracking 549- pan/bi: Don't propagate discard 550- pan/bi: Optimize split of collect 551- pan/bi: See through splits for var_tex fusion 552- pan/bi: Validate preload constraints are satisfied 553- pan/bi: Validate vector widths 554- pan/va: Handle 2-src blend in lower_split_src 555- pan/va: Remove DISCARD.f32 destination 556- pan/va: Use ^ instead of ` to indicate last-use 557- asahi: Run lower_blend for all fragment shaders 558- asahi: Use z24_in_z32f lowering 559- asahi: Validate depth/stencil formats 560- asahi: Don't store to unbound render targets 561- asahi: Handle uncompressed Z32F depth buffers 562- asahi: Pass depth/stencil clear values to firmware 563- asahi: Flush on partial clears 564- asahi: Handle 3rd deflake address 565- asahi: Account for every byte of the cmdbuf 566- asahi: Identify more depth pipeline fields 567- asahi: Dump "unknown 4a" structure 568- asahi: Add XML used for visibility tests 569- asahi: Relax Draw "command" check 570- asahi: Note unknown field 571- asahi: Identify depth bias array 572- asahi: Identify "set depth bias index" field 573- asahi: Identify depth bias enable bit 574- asahi: Add a depth bias structure 575- asahi: Implement polygon offset 576- asahi: Identify partial render pipeline 577- asahi: Sync some names with Project Zero 578- asahi: Add segment list pretty printer 579- asahi: Split unknown field in segment list header 580- asahi: Validate segment list length 581- asahi: Compute segment list length 582- asahi: Align segment list unknowns with Metal 583- asahi: Decode resource allocation responses 584- asahi: Decode resource allocation requests 585- asahi: Identify suballocation mechanism 586- asahi: Identify suballocated size 587- asahi: Drop some unknowns 588- asahi: Sync some more unknown fields 589- asahi: Improve render target dumping 590- asahi: Fix pipeline decoding 591- asahi: Handle RECT textures as 2D 592- asahi: Allocate a larger segment list 593- asahi: Guard against encoder overflows 594- asahi: Add wrap library 595- docs/asahi: Add initial driver docs 596- panfrost: Simplify the kmsro create path 597- panfrost: Don't allocate SHARED as scanout 598- pan/bi: Recoalesce tied operands after spilling 599- pan/bi: Schedule for pressure pre-RA 600- pan/bi: Allow CSEing LEA_BUF_IMM 601- panfrost: Fix crash with GALLIUM_HUD 602- panfrost: Extract afbc_stride_blocks helper 603- panfrost: Extract pan_afbc_row_stride helper 604- panfrost: Fix is_wide return type 605- panfrost: Add pan_afbc_tile_size helper 606- panfrost: Support tiled AFBC in stride helpers 607- panfrost: Align layouts to tiles of superblocks 608- panfrost: Add AFBC stride unit tests 609- panfrost: Add 3D texture layout unit test 610- panfrost: Calculate header_size based on row_stride 611- panfrost: Add a tiled 16x16 layout unit test 612- panfrost: Add 1x1 layout unit tests 613- panfrost: Fix AFBC flags on v6 614- panfrost: Put comment in correct #ifdef 615- panfrost: Handle AFBC Tiled 616- panfrost: Add helper checking tiled AFBC support 617- panfrost: Gate tiled AFBC on GPUs that support it 618- panfrost: Advertise 16x16 tiled AFBC 619- panfrost: Use tiled AFBC textures 620- pan/bi: Fuse result types 621- pan/bi: Test CMP result_type optimization 622- panfrost: Allow FPK on Valhall 623- panfrost: Set overdraw_alpha[01] flags 624- panfrost: Set allow_rotating_primitives 625- pan/bi: Don't allow spilling coverage mask writes 626- pan/bi: Add VAR_TEX fusing unit test 627- pan/va: Fill in missing src_flat16 enum 628- pan/bi: Make LD_VAR w=format instead of w=vecsize 629- pan/bi: Model Valhall source formats 630- pan/bi: Interpolate varyings at 16-bit 631- pan/bi: Extend bi_scoreboard_state for finer tracking 632- pan/bi: Add slot to bi_instr 633- pan/bi: Preserve flow control for non-psiz variant 634- pan/bi: Add ASSERT_SHADER_EQUAL macro 635- pan/bi: Add a trivial ctx->inputs for unit tests 636- pan/bi: Extract bit_block helper 637- pan/bi: Export bi_block_add_successor 638- pan/bi: Export helper termination analysis 639- pan/bi: Print flow control on instructions 640- pan/va: Add pass to insert flow control 641- pan/va: Do scoreboard analysis 642- pan/va: Terminate helper threads 643- pan/va: Add flow control merging pass 644- pan/va: Assign slots roundrobin 645- pan/va: Respect assigned slots 646- pan/va: Unit test flow control insertion 647- pan/va: Unit test flow control merging 648- pan/bi: Use flow control lowering on Valhall 649- pan/mdg: Scalarize with 64-bit sources 650- pan/mdg: Remove uppercase write masks 651- pan/mdg: Use "<<" instead of "lsl" 652- pan/mdg: Only print masked components of swizzle 653- pan/mdg: Only print 2 sources for ALU 654- pan/mdg: Only print 1 source for moves 655- pan/mdg: Distinguish SSA vs reg when printing IR 656- pan/mdg: Print outmods when printing IR 657- panfrost: Fix XML for AFBC header on v9 658- panfrost: Pass arch instead of dev into afbc_format 659- panfrost: Add pan_afbc_compression_mode on Valhall 660- panfrost: Support rendering to AFBC on Valhall 661- panfrost: Handle texturing from AFBC on Valhall 662- panfrost: Allow 2D AFBC on Valhall 663- panvk: Use vk_image_subresource_*_count for clears 664- pan/bi: Respect swizzles in nir_op_pack_64_2x32_split 665- pan/bi: Allow vec6 for collects 666- pan/bi: Align 64-bit register sources 667- pan/bi: Rework Valhall register alignment 668- pan/bi: Fix clper_xor on Mali-G31 669- panfrost/ci: Mark draw_buffers_indexed.* as flakes 670- panfrost: Use C11 static_assert for enums 671- nir: Add transform feedback system values 672- nir: Export nir_io_add_intrinsic_xfb_info 673- pan/bi: Add transform feedback lowering pass 674- pan/bi: Handle transform feedback intrinsics 675- panfrost: Don't allow vertex shaders to have side effects 676- panfrost: Wire up transfrom feedback sysvals 677- panfrost: Create transform feedback shaders 678- panfrost: Launch transform feedback shaders 679- pan/va: Do not insert NOPs into empty shaders 680- panfrost: Extract panfrost_get_index_buffer helper 681- panfrost: Don't calculate min/max indices on v9 682- pan/bi: Extract MUX to CSEL optimization 683- pan/va: Implement more lanes 684- pan/va: Lower MUX to CSEL where possible 685- pan/va: Add MUX lowering tests 686- pan/bi: Implement b2i with MUX 687- pan/va: Record which instructions are signed 688- pan/va: Try negating small constants when lowering 689- pan/va: Try widening small constants 690- pan/va: Unit test constant lowering pass 691- panfrost: Enable Mali-G57 692- docs: Document Mali-G57 conformance 693- panfrost: Fix inverted indirect dispatch condition 694- pan/bi: Implement load_push_constant 695- pan/bi: Don't reorder push with no_ubo_to_push 696- panfrost: Use push constants for indirect dispatch 697- panfrost: Use push constants for indirect draws 698- panvk: Use push constants for clear 699- panvk: Use push constants for copy shaders 700- panfrost: Deduplicate indirect dispatch structs 701- panfrost: Remove redundant first_tag access 702- panfrost: Remove is_64b assignments 703- panfrost: Add panfrost_query_l2_slices helper 704- pan/perf: Fix performance counters on G57 705- mesa,gallium: Make point coord origin a CAP 706- asahi,agx: Don't set PIPE_CAP_POINT_COORD_ORIGIN_UPPER_LEFT 707- panvk: Drop support for Midgard 708- panfrost: Allow pixels using discard to be killed 709- panfrost: Inline pan_fbd_has_zs_crc_ext 710- panfrost: Disable CRC at <16x16 tile sizes 711- panvk: Don't specialize clear shaders for RT 712- panvk: Remove unused pushmaps 713- panvk: Clear Z/S attachments without a shader 714- panvk: Fix stencil clears of combined ZS images 715- panvk: Simplify depth clear preload condition 716- panvk: Use common CmdBeginRenderPass 717- panfrost: Disable QUADS on Valhall 718- panfrost: Fix batch state changes on Valhall 719- panfrost: Add tristate_is_defined helper 720- panfrost: Implement provoking vertices on Valhall 721- pan/va: Move VA_NUM_GENERAL_SLOTS to common 722- pan/va: Simplify insert flow tests 723- pan/va: Workaround quirk of barrier handling 724- pan/va: Unit test barrier handling 725- pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK 726- pan/bi: Constify bi_is_staging_src argument 727- pan/va: Add header guards to valhall_enums.h 728- pan/va: Move va_flow_is_wait_or_none to common 729- pan/va: Include BLEND for va_swap_12 730- pan/va: Use validate_register_pair for BLEND pack 731- pan/va: Mark last register reads 732- pan/va: Unit test va_mark_last 733- pan/bi: Fix LD_BUFFER.i16 definition 734- pan/bi: Align accesses with packed TLS 735- pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32 736- pan/va: Implement sample positions FAU packing 737- pan/va: Rename RSCALE to LDEXP 738- pan/bi: Remove FRSCALE from IR 739- pan/bi: Model MKVEC.v2i8 740- pan/bi: Constant fold MKVEC.v2i8 741- pan/va: Pack MKVEC.v2i8 byte lanes 742- pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8 743- pan/bi: Tune lower_vars_to_scratch 744- panfrost: Enable more FP16 caps on Valhall 745- panfrost: Enable CAP_INDIRECT_TEMP_ADDR on Valhall 746- panfrost: Bump ESSL_FEATURE_LEVEL on Valhall 747- v3d: Drop workaround for u_blitter bug 748- agx: Handle loop { if { loop { .. } } } 749- r600g: Remove streamout-based buffer copy path 750- u_blitter: Remove util_blitter_copy_buffer 751- lima,panfrost: Do not ralloc contexts against the screen 752- pan/va: Handle terminal barriers 753- ttn: Set nir->info.separate_shader 754- pan/bi: Fix unpack_32_2x16 definition 755- pan/va: Add missing <roundmode/> to V2F32_TO_V2F16 756- pan/bi: Implement f2f16{_rtz, _rtne} 757- panfrost: Separate core ID range from core count 758- panfrost: Move bifrost_lanes_per_warp to common 759- panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS 760- panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE 761- panfrost/ci: Disable 0ad trace on T860 762- panfrost: Clear with a quad to avoid flushing 763- panfrost/ci: Disable T760 jobs 764- nir/opt_shrink_vectors: Round to supported vec size 765- panfrost: Revert provoking vertex assertion 766- panfrost: Only key points to point coord origin 767- pan/va: Dump unencodable instructions 768- pan/va: Use invalid_instruction in more places 769- pan/va: Print instructions with pack assert fails 770- pan/va: Handle BIFROST_MESA_DEBUG=nosb 771- panfrost: Make pan_select_max_tile_size O(1) 772- panfrost: Use implementation-specific tile size 773- panfrost: Protect pandecode by a mutex 774- pan/bi: Require ATEST coverage mask input in R60 775- pan/va: Stall after ATEST 776- panfrost: Fix shader_modifies_coverage on Valhall 777- panfrost: Add decoupled early-ZS helpers 778- panfrost: Unit test early-ZS helpers 779- panfrost: Add zsa->zs_always_passes flag 780- panfrost: Use early-ZS helpers 781- util: Enable list_assert in debugoptimized builds 782- util: Gate simple_mtx_assert_locked on !NDEBUG 783- agx: Port minifloat tests to GTest 784- pan/bi: Assert that we use the correct CLPER 785- pan/bi: Rename CLPER_V6.i32 to CLPER_OLD.i32 786- pan/decode: Fix overrun decoding planes 787- nir/lower_idiv: Be less creative about signs 788- agx: Fix packing of samplers in texture instrs 789- panfrost: Don't segfault on unknown models 790- pan/bi: Don't reorder image loads across stores 791- pan/bi: Don't allow ATEST to take a temporary 792- pan/mdg: Print 3 sources for CSEL 793- pan/bi: Fix dual texturing with uniforms 794- pan/bi: Fix out-of-bounds write in va_lower_split_64bit 795- pan/bi: Consider all dests in helper_block_update 796- agx: Fix float copyprop of neg(neg) case 797- panfrost: Respect buffer offset for OpenCL 798 799Andres Calderon Jaramillo (2): 800 801- nir: Account for YUV range. 802- mesa/st: Account for YUV color space and range. 803 804Andres Gomez (5): 805 806- radv/ci: update vkd3d-proton results for AMD's Kabini 807- ci: install a more recent version of Wine in the test-vk image 808- ci: install gfxinfo-mupuf in the test-vk image 809- ci: move b2c artifacts to its own directory 810- ci: add wrapper script for Valve's traces runner 811 812Anuj Phogat (2): 813 814- include/uapi: Update drm_fourcc.h from kernel 815- isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP 816 817Arvind Yadav (7): 818 819- nir: Add a lowering pass for polygon and line smoothing 820- radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing 821- nir: add load_point_coord_maybe_flipped intrinsics for point smoothing 822- nir: Add a lowering pass for point smoothing 823- ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi 824- ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes 825- radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing 826 827Autumn on Tape (5): 828 829- gallivm: add subgroup shuffle support 830- lavapipe: enable subgroup shuffle operations 831- gallivm: use shufflevector for shuffles when index is constant data 832- gallivm: use VPERMPS (x86/AVX2) for 32-bit 8-element shuffles 833- lavapipe: enable subgroup quad operations 834 835Axel Davy (6): 836 837- frontend/nine: Skip invalid swvp calls 838- frontend/nine: Fix buffer tracking out of bounds 839- frontend/nine: Fix ATOC handling 840- frontend/nine: Fix cso restore bug 841- frontend/nine: Fix shader multi-use crash 842- frontend/nine: Fix ff position_t fallback when w = 0 843 844Bas Nieuwenhuizen (39): 845 846- vulkan/wsi/x11: Ensure we have the required number of images for acquire. 847- radv: always enable NGG on GFX11 848- radv: Mark all formats as DCC compatible with each other on gfx11. 849- radv: gfx11 register changes. 850- radv: Adjust for new SWIZZLE_ENABLE. 851- radv: Add gfx11 DCC fast clear support. 852- radv: Disable transform feedback for gfx11. 853- radv: update the initialization of SGPR0/1 registers for HS and GS on GFX11 854- radv: Deal with derefs from opaque types in function parameters. 855- radv: Add performance counter reg write. 856- radv: move ac_perfcounters to physical_device. 857- radv: Add pstate locking for perfcounters. 858- radv: Rename SPM functions. 859- radv: Make inhibiting clockgating not static. 860- radv: Add helper BO for perf counters. 861- radv: Add perf counter lock/unlock commandbuffers. 862- radv: Add a list of performance counters. 863- radv: Implement support for querying performance counters. 864- radv: Implement listing performance counters. 865- radv: Expose VK_KHR_performance_query. 866- radv: Add ability to override the build id for the cache. 867- radv: Add a 32bit memory type. 868- radv: Skip setting empty index buffers to avoid hang 869- radv: Expose function to write vertex descriptors for dgc. 870- radv: Always store stride in the vbo descriptor. 871- radv: Require 32bit memory for indirect buffers. 872- radv: Expose helper for base pa_su_sc_mode_cntl. 873- radv: Make radv_get_vgt_index_size non-static. 874- radv: Add helper to write scissors. 875- radv: Add flushing for DGC. 876- radv: Add DGC meta shader. 877- radv: Implement DGC generated command layout structure. 878- radv: Implement DGC cmdbuffer generation. 879- radv: Implement CmdExecuteGeneratedCommandsNV. 880- radv: Add stub for vkCmdBindPipelineShaderGroupNV. 881- radv: Expose VK_NV_device_generated_commands. 882- vulkan/wsi: Take max extent into consideration for modifier selection. 883- amd/common: Don't rely on DCN support checks with modifiers. 884- amd/common: Disable DCC retile modifiers on RDNA1 885 886Ben Skeggs (3): 887 888- nv50/ir: fix OUT.FINAL on ampere 889- nv50/ir: fix scope of memory ops on ampere 890- nv50/ir: fix ALD/AST where address comes from AL2P 891 892Benjamin Cheng (5): 893 894- radv: fix memory leak of descriptor set layout 895- radv: refactor image binding into struct 896- radv: consolidate descriptor set buffer counts 897- radv: implement disjoint memory for multiplane images 898- radv: expose disjoint image support 899 900Boris Brezillon (159): 901 902- dzn: Skip binding entries with zero descriptors 903- dzn: Make sure sampler heaps don't contain more than 2048 samplers 904- microsoft/compiler: Add a dunmmy SV_SampleIndex when needed 905- microsoft/spirv_to_dxil: Allow forcing per-sample shading 906- dzn: Add support for sampleRateShading 907- dzn: Pass the right type to CreateCommandList() in the reset path 908- dzn: refactor error-handling 909- ci/windows: Add a variable to globally disable jobs using windows runners 910- dzn: Use core helpers to fill physical device features/properties 911- dzn: Report actual device limits 912- d3d12: Move the D3D12ResourceState code to the gallium driver 913- ci/windows: Split the windows-related tests 914- dzn: Add support for EXT_vertex_attribute_divisor 915- dzn: Rename a bunch of dzn_descriptor_set helpers 916- dzn: Pass layout objects to various dzn_descriptor_set_ptr_xxx helpers 917- dzn: Split the write desc helpers in two halves 918- dzn: Add support for KHR_descriptor_update_template 919- microsoft/spirv_to_dxil: Add a draw_id field to vertex_runtime_data 920- dzn: Fix exec_buf_stride calculation 921- dzn: Simplify the triangle_fan_rewrite_index shader 922- dzn: Update the draw_id sysval when issuing indirect draws 923- dzn: Advertise KHR_shader_draw_parameters 924- dzn: Pass an image view to dzn_cmd_buffer_clear_attachment() 925- dzn: Set MAX_RTS to D3D12_SIMULTANEOUS_RENDER_TARGET_COUNT 926- dzn: Move all render-related states to a substruct 927- dzn: Inline dzn_cmd_buffer_clear_attachments() 928- dzn: Add support for dynamic rendering 929- dzn: Get rid of the render pass logic 930- dzn: Allow multiDrawIndirect and drawIndirectFirstInstance 931- dzn: Add KHR_draw_indirect_count support 932- dzn: Add missing VKAPI_{ATTR,CALL} specifiers to BeginCommandBuffer() 933- dzn: Pass the dzn_event pointer to _mesa_hash_table_insert() 934- dzn: Fix the STATIC_ASSERT() in dzn_meta_blits_get_context() 935- vulkan/wsi: Make wsi_win32_surface_get_capabilities() return the current extent 936- dzn: Fix dzn_cmd_buffer_collect_queries() 937- dzn: Use the vk_sync_binary abstraction 938- spirv: Fix windows build 939- dzn: Use the correct ICD path on Win32 940- microsoft/spirv_to_dxil: Fix push_constants type declaration 941- microsoft/spirv_to_dxil: Fix the push_constant UBO size calculation 942- dzn: Emit missing transition barriers for rendering attachments 943- ci/windows: Bump spirv-samples commit-ID 944- microsoft/spirv2dxil: Add a debug option 945- microsoft/compiler: Make sure all SV_Position components are written 946- microsoft/spirv_to_dxil: Extract NIR passes out of spirv_to_dxil() 947- microsoft/spirv_to_dxil: Constify the descriptor binding mapping fields 948- microsoft/compiler: Make input_mappings immune to driver_location holes 949- microsoft/spirv_to_dxil: Remove dead IO vars at the end of dxil_spirv_nir_passes() 950- microsoft/spirv_to_dxil: Add a linking helper 951- microsoft/compiler: Allow vulkan users to handle varyings linking 952- microsoft/spirv_to_dxil: vectorize IOs 953- dzn: Fix maxVertexInputAttributes limit 954- dzn: Handle varying linking explicitly 955- dzn: Fix unitialized var access in dzn_meta_compile_shader() 956- dzn: Fix vertex input handling 957- microsoft/spirv_to_dxil: Let spirv_to_dxil() handle the linking 958- microsoft/compiler: Drop the auto_link parameter passed to nir_to_dxil() 959- dzn: Switch to ID3D12Device2 960- dzn: Drop an unused argument passed to dzn_graphics_pipeline_translate_vi() 961- dzn: Use CreatePipelineState() 962- dzn: Enable depth-bounds testing 963- microsoft/compiler: Lower fquantize2f16 964- nir: Fix flat new_var assignment in create_new_io_vars() 965- microsoft/compiler: Pick a type that matches interpolation mode for structs 966- microsoft/compiler: Truncate function names when needed 967- vulkan/util: Fill VkPhysicalDeviceIDProperties::deviceNodeMask 968- spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store} 969- microsoft/compiler: Images are no longer reprensented by uniform vars 970- microsoft/compiler: Set typed_uav_load_additional_formats when appropriate 971- microsoft/compiler: s/assert()/unreachable()/ in emit_intrinsic() 972- microsoft/compiler: Support load_sample_id_no_per_sample 973- dzn: Decorrelate external image info and external image properties 974- dzn: Return empty external properties 975- dzn: Add a dummy GetDeviceGroupPeerMemoryFeatures() 976- dzn: Add a dummy GetImageSparseMemoryRequirements2() 977- dzn: Add dummy {Create,Destroy}SamplerYcbcrConversion() implementations 978- dzn: Check that no export is requested in the memory allocation path 979- dzn: Lower alignment requirements when allocating buffers or single-sample images 980- dzn: Add a dzn_desc_type_has_sampler() helper 981- dzn: Implement GetDescriptorSetLayoutSupport() 982- dzn: Vulkan 1.1 wants maxPerSetDescriptors >= 1024 983- dzn: Prepare support for pipeline variants 984- dzn: Fix primitiveRestart support 985- dzn: Fix triangle-fan emulation 986- dzn: Support dynamic depth testing parameters properly 987- dzn: Support dynamic line width 988- ci/windows: Allow skipping deqp tests if we have to 989- ci/windows: Add dEQP-VK.dynamic_state.* to the test list 990- ci/windows: Skip dEQP-VK.api.command_buffers.record_many_draws* 991- d3d12: Keep access masks up-to-date when removing vars in d3d12_disable_multisampling() 992- microsoft/compiler: Move d3d12_fix_io_uint_type() to dxil_nir.c 993- microsoft/compiler: Fix Layer type 994- microsoft/compiler: Support gl_Layer 995- dzn: Disable rasterization if the last geometry stage doesn't write the position 996- dzn: Enable support for geometry shaders 997- ci/dzn: Move to a .toml description 998- dzn: Fix the 2DMSArray case in dzn_image_get_dsv_desc() 999- dzn: Fix CmdPushConstants() 1000- dzn: Clamp depthBiasConstantFactor when doing the float -> int conversion 1001- dzn: Transition resource to RENDER_TARGET/DEPTH_WRITE before clears 1002- microsoft/compiler: Fix emit_ubo_var() 1003- ci/dzn: Copy testlog.{css,xsl} to the result dir 1004- nir/serialize: Put dest last in packed_instr::tex 1005- dzn: Advertise anisotropic filtering support 1006- dzn: Advertise shaderImageGatherExtended support 1007- dzn: Enable dynamic indexing on all kind of descriptors 1008- dzn: Enable shader{Clip,Cull}Distance 1009- dzn: Enable the depthClamp feature 1010- nir/serialize: Support texop >= 16 1011- nir/serialize: Silence integer-overflow false positive 1012- vulkan: Add a vk_pipeline_hash_shader_stage() helper 1013- anv: Use vk_pipeline_hash_shader_stage() 1014- v3dv: Fix nir_shader leaks in v3dv_meta_{clear,copy}() 1015- v3dv: Use vk_pipeline_hash_shader_stage() 1016- tu: Use vk_pipeline_hash_shader_stage() 1017- radv: Kill unused fs_m local var in radv_create_shaders() 1018- radv: Use vk_pipeline_hash_shader_stage() 1019- dzn: Hook-up device-lost detection 1020- dzn: Fix indentation 1021- dzn: Initialize UUIDs 1022- dzn: Drop dzn_pipeline_cache.c 1023- dzn: Add missing D3D12_CACHED_PIPELINE_STATE to MAX_COMPUTE_PIPELINE_STATE_STREAM_SIZE 1024- dzn: Fix assertion in d3d12_pipeline_state_stream_new_desc() 1025- dzn: Use d3d12_compute_pipeline_state_stream_new_desc() in the compute path 1026- dzn: Pass the maximum stream size to d3d12_pipeline_state_stream_new_desc() 1027- dzn: Drop useless while(link_mask != 0) 1028- dzn: Drop unused allocator passed to dzn_graphics_pipeline_create() 1029- dzn: Let dzn_pipeline_init() initialize the root signature in the stream 1030- dzn: Fix potential nir_shader leak 1031- dzn: Use vk_to_mesa_shader_stage() 1032- dzn: Save a few indentation levels in graphics_pipeline_compile_shaders() 1033- dzn: Don't delegate binding translation to dxil_spirv_nir_passes() 1034- dzn: Hash pipeline layout stages 1035- dzn: Add a generic cached blob wrapper 1036- dzn: Drop unneeded goto statement in dzn_compute_pipeline_create() 1037- dzn: Move the compute shader compilation logic to a sub-function 1038- dzn: Cache NIR shaders 1039- dzn: Caculate a binding translation hash 1040- dzn: Cache DXIL shaders 1041- dzn: Cache pipeline info 1042- ci/dzn: Add pipeline cache tests to the include list 1043- dzn: Cache and pack transition barriers 1044- dzn: Don't call dzn_cmd_buffer_queue_transition_barriers() prematurely 1045- dzn: Rename dzn_cmd_buffer_queue_image_subres_range_transition() 1046- dzn: Introduce dzn_cmd_buffer_queue_image_range_state_transition() 1047- dzn: Allow shaders to read from depth/stencil attachments 1048- dzn: Fix copies of block-based formats 1049- dzn: Fix unused var warnings 1050- dzn: Fix 'statement with no effect' warning 1051- dzn: Fix printf() format in dxgi_get_factory() error message 1052- dzn: Fix {back,front}_wr_uses_ref assignment 1053- dzn: Fix -Wswitch warnings 1054- dzn: Use the vk_descriptor_set_layout base struct 1055- dzn: Use the vk_pipeline_layout base struct 1056- dzn: Mark transition barriers as executed when we execute barriers 1057- dzn: Query D3D12_FEATURE_D3D12_OPTIONS3 features 1058- dzn: Provide a helper to check if 2 formats are compatible 1059- dzn: Support native image copies when formats are compatible 1060- dzn: Take VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT into account 1061 1062Boyuan Zhang (9): 1063 1064- radeonsi/vcn: add vcn 4.0 encode support 1065- radeonsi/vcn: add vcn 4.0 encode fw interface version 1066- radeonsi/vcn: add encode context ib for vcn 4.0 1067- radeonsi/vcn: add session init ib for vcn 4.0 1068- radeonsi/vcn: enable vcn 4.0 encode for gfx11 asic 1069- radeonsi/vcn: use calculated max hierarchy depth for hevc enc 1070- vl: add cabac init idc for h264 encode 1071- frontends/va: get entropy mode info for h264 encode 1072- radeonsi/vcn/enc: enable cabac entropy mode 1073 1074Brian Paul (101): 1075 1076- gallivm: assorted clean-ups in lp_bld_nir_aos.c 1077- gallivm: whitespace fixes in lp_bld_tgsi.h 1078- gallivm: asst. clean-ups in lp_bld_sample.h 1079- llvmpipe: asst. clean-ups in lp_linear_fastpath.c 1080- llvmpipe: asst. clean-ups in lp_setup.c 1081- llvmpipe: asst. clean-ups in lp_state_setup.c 1082- gallivm: asst. clean-ups in lp_bld_type.[ch] 1083- gallivm: cleanup lp_bld_arit.c 1084- gallivm: clean-up in lp_bld_nir.h 1085- gallivm: add const qualifier in lp_bld_nir_soa.c 1086- llvmpipe: asst. clean-up in lp_state_derived.c 1087- llvmpipe: asst. clean-ups in lp_rast_linear.c 1088- llvmpipe: fix comment typo in lp_linear_fastpath.c 1089- llvmpipe: asst. clean-ups in lp_rast.c 1090- llvmpipe: asst. clean-ups in lp_linear.c 1091- llvmpipe: asst. clean-ups in lp_scene.h 1092- llvmpipe: asst. clean-up and add comments in lp_state_fs.h 1093- gallivm: clean-up in lp_bld_sample.c 1094- llvmpipe: asst. clean-up in lp_setup_context.h 1095- gallivm: asst. clean-ups in lp_bld_sample_soa.c 1096- gallivm: clean-ups in lp_bld_tgsi.[ch] 1097- llvmpipe: asst. clean-ups in lp_state_fs.c 1098- llvmpipe: minor clean-ups in lp_jit.h 1099- llvmpipe: asst. clean-ups in lp_state_fs_linear_llvm.c 1100- llvmpipe: asst. clean-ups in lp_state_fs_linear.c 1101- llvmpipe: asst. clean-ups in lp_setup_tri.c, lp_setup_rect.c 1102- llvmpipe: whitespace clean-ups in lp_rast.h 1103- llvmpipe: minor whitespace/formatting cleanups in lp_state_setup.h 1104- llvmpipe: minor clean-ups in lp_state_fs_analysis.c 1105- llvmpipe: misc clean-up in lp_rast_tri.c 1106- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c 1107- llvmpipe: asst. clean-ups in lp_rast_tri_tmp.h 1108- llvmpipe: asst. clean-ups in lp_rast_priv.h 1109- gallivm: minor clean-ups in lp_bld_sample_soa.c 1110- llvmpipe: misc clean-ups in lp_bld_interp.c 1111- gallivm: asst. clean-ups in lp_bld_nir.c 1112- llvmpipe: fix comment typo in lp_bld_interp.h 1113- llvmpipe: asst. clean-ups in lp_linear_sampler.c 1114- llvmpipe: asst. clean-ups in lp_tex_sample.c 1115- llvmpipe: asst. clean-ups in lp_scene.c 1116- llvmpipe: minor clean-up in lp_state_blend.c 1117- llvmpipe: use enum blend types instead of unsigned 1118- llvmpipe: s/unsigned/enum pipe_compare_func/ 1119- gallivm: asst. clean-ups in lp_bld_logic. 1120- llvmpipe: asst. clean-ups in lp_state_fs.c 1121- draw: s/unsigned/enum pipe_prim_type/ 1122- draw: asst. clean-up in draw_context.h 1123- draw: asst. clean-up in draw_gs.h 1124- draw: asst. clean-up in draw_pipe.c 1125- draw: asst. clean-up in draw_pipe_validate.c 1126- draw: asst. clean-up in draw_private.h 1127- draw: asst. clean-up in draw_pt.h 1128- draw: asst. clean-up in draw_pt.c 1129- draw: asst. clean-up in draw_pt_fetch_shader_emit.c 1130- draw: asst. clean-up in draw_pt_fetch_shader_pipeline.c 1131- draw: asst. clean-up in draw_pt_util.c 1132- draw: asst. clean-up in draw_pt_vsplit.c 1133- draw: asst. clean-up in draw_vbuf.h 1134- gallivm: use enum types instead of unsigned in lp_bld_sample.[ch] 1135- llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c 1136- llvmpipe: remove lp_rast_cmd_arg::state field 1137- gallivm: s/unsigned/enum pipe_swizzle/ 1138- llvmpipe: add simple assertion in generate_fragment() 1139- util/bitset: add BITSET_SIZE() 1140- lavapipe: rework code to compute textures_used, samplers_used 1141- llvmpipe: minor code re-org in lp_state_fs_analysis.c 1142- nir: add const qualifiers, move some decls in nir_to_tgsi_info.c 1143- llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat() 1144- gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/ 1145- llvmpipe: replace GET_A0() macro w/ inline function 1146- llvmpipe: add missing tex_info->texture_unit assignment 1147- llvmpipe: fix invalid memory used in lp_fs_linear_run 1148- llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat() 1149- llvmpipe: replace LP_RAST_OP\_ #defines with enum type 1150- gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32 1151- lavapipe: fix logicop, independent blend enable/disable 1152- llvmpipe: don't allow texture/resource swizzles on linear path 1153- llvmpipe: initialize a local var to fix compiler warning in release build 1154- lavapipe: fix incorrect sv[] array size 1155- llvmpipe: fix texcoord analysis regression 1156- llvmpipe: asst. clean-up in lp_bld_depth.c 1157- llvmpipe: minor white-space clean-up in lp_setup.h 1158- llvmpipe: asst. clean-ups in lp_setup_rect.c 1159- gallivm: asst. clean-ups in lp_bld_const.c 1160- llvmpipe: asst. clean-ups in lp_linear_interp.c 1161- llvmpipe: minor optimization in lp_linear_init_interp() 1162- llvmpipe: asst. clean-ups in lp_state_setup.c 1163- llvmpipe: remove unneeded casts 1164- llvmpipe: replace 'unsigned' with enum types 1165- gallivm: asst. clean-ups in lp_bld_swizzle.c 1166- gallivm: simplify shuffling in emit_store_reg() 1167- gallivm: fix incorrect memset() in SOA emit_load_const() 1168- gallivm: rename 'var' param in emit_store_reg(), emit_store_var(). 1169- llvmpipe: make opaque field a 1-bit field 1170- llvmpipe: minor whitespace clean-up in lp_state_fs.h 1171- llvmpipe: pass frontfacing param to jit_function() call 1172- llvmpipe: asst. clean-up in lp_rast_linear_fallback.c 1173- llvmpipe: remove unneeded casts in lp_rast_linear_fallback.c 1174- llvmpipe: consolidate code in lp_rast_linear_fallback.c 1175- llvmpipe: tighten up some code in lp_rast_linear_fallback.c 1176- llvmpipe: add some const qualifiers in lp_rast_linear_fallback.c 1177 1178Caio Oliveira (1): 1179 1180- intel/compiler: In XeHP prefer <1;1,0> regions before compacting 1181 1182Chad Versace (2): 1183 1184- venus: Refactor vn_fix_graphics_pipeline_create_info 1185- venus: Don't encode ignored pTessellationState 1186 1187Charlie Turner (4): 1188 1189- ci: fix shellcheck violations in the test-vk building script 1190- ci: add Linux build of apitrace 1191- ci: upgrade gfxreconstruct build to v0.9.10 1192- ci: upgrade to DXVK v1.10.1 1193 1194Charmaine Lee (20): 1195 1196- gallium/util: Increase the debug_flush map depth to 64 1197- draw: fix double free of NIR IR 1198- svga: fix shader IR type passed to draw create shader function 1199- svga/nir: enable PIPE_CAP_TGSI_TEXCOORD 1200- svga: enable PIPE_CAP_TGSI_TEXCOORD for vgpu10 and up 1201- svga/tgsi: add translation for DTRUNC 1202- svga/nir: enable lower double option 1203- svga: workaround for unexpected double swizzle 1204- svga: fix clip distance shadow copy writemask 1205- svga: add a helper function for common shader creation 1206- svga: create svga_shader_info 1207- svga: add svga_compile_shader helper function 1208- svga: add need_texcoord_semantic to tgsi_add_point_sprite & tgsi_add_aa_point 1209- svga: fix aa point 1210- svga: fix reference to NULL offset for atomic counter 1211- svga: fix shared memory index 1212- svga: fix mksstats build 1213- mesa/st: fix reference to nir->info after nir_to_tgsi 1214- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context 1215- svga: fix invalid component access of domain location 1216 1217Chia-I Wu (61): 1218 1219- turnip: fix drm modifier support with planar formats 1220- util: Keep quiet NaNs quiet when converting to half float. 1221- anv: advertise rectangularLines only for Gen10+ 1222- turnip: let modifier takes precedence over TU_DEBUG=noubwc 1223- freedreno/fdperf: make refresh rate configurable 1224- freedreno/fdperf: support dumping counters 1225- turnip: fix sampledImageIntegerSampleCounts 1226- turnip: emit VPC_SO_DISABLE in xfb begin/end 1227- ir3: fix mem_ctx for ir3_disasm_info::nir 1228- turnip: fix off-by-one in border color bitset 1229- turnip: fix tu6_pack_border_color for z24 1230- turnip: disable UBWC for SNORM formats 1231- turnip: if-checks autotune debug macros 1232- turnip: base pipeline draw costs on colorWriteMask 1233- turnip: include stencil test for drawcall costs 1234- turnip: estimate render pass costs 1235- turnip: consider render pass costs in autotune 1236- ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure 1237- turnip: force sample interpolations for sample shading 1238- turnip: advertise VK_EXT_border_color_swizzle 1239- freedreno/fdl: use XYZW swap for PIPE_FORMAT_X24S8_UINT 1240- vulkan/device_select: remove get_pdevice_proc_addr 1241- freedreno/drm-shim: add a660 1242- turnip: fix an assertion with drm-shim 1243- ir3: update ir3_const_state comment 1244- ir3: copy req_local_mem for MESA_SHADER_KERNEL 1245- ir3: fix output_loc size 1246- ir3: remove unused patch_vertices_in 1247- ir3: increment constoff right after it is assigned 1248- ir3: fix tess param allocation 1249- ir3: set UL flag before ir3_lower_subgroups 1250- turnip: remove shared_consts from tu_compiled_shaders 1251- turnip: shared_consts and push_consts are mutually exclusive 1252- turnip: add tu_common.h as the common header 1253- turnip: remove includes that are already in tu_common.h 1254- turnip: add tu_drm.h 1255- turnip: add tu_suballoc.h 1256- turnip: update tu_cs.h 1257- turnip: add tu_query.h 1258- turnip: add tu_image.h 1259- turnip: add tu_formats.h 1260- turnip: update tu_descriptor_set.h 1261- turnip: add tu_shader.h 1262- turnip: add tu_pipeline.h 1263- turnip: add tu_clear_blit.h 1264- turnip: add tu_dynamic_rendering.h 1265- turnip: add tu_lrz.h 1266- turnip: add tu_pass.h 1267- turnip: add tu_wsi.h 1268- turnip: update tu_autotune.h 1269- turnip: add tu_device.h 1270- turnip: add tu_cmd_buffer.h 1271- turnip: add tu_android.h 1272- turnip: update tu_util.h 1273- turnip: move away from tu_private.h 1274- turnip: remove tu_private.h 1275- turnip: remove headers from libtu_files 1276- turnip: use SPDX-License-Identifier 1277- turnip: fix a use-after-free in autotune 1278- turnip: lower the queue priority to 1 1279- ir3: fix predicate splitting in scheduler 1280 1281Christian Gmeiner (12): 1282 1283- isaspec: Extend split_bits(..) to accept a bitsize 1284- isaspec: Handle patterns bigger then 64 bit 1285- nir: Fix unused-variable compile warnings 1286- freedreno: Wait for async compile jobs to finish 1287- ir3: Create at least one thread for async shader compile 1288- gtest: Fix maybe-uninitialized compiler warning 1289- r600: Switch to unreachable(..) 1290- svga: Clear query before usage 1291- ci: Remove -Wno-error=maybe-uninitialized 1292- etnaviv: factor out shader screen related init/deint 1293- etnaviv: async shader compile 1294- etnaviv: Move nir_shader_compiler_options to compiler 1295 1296Chuansheng Liu (1): 1297 1298- iris,anv: correct the max thread number for DG2+ 1299 1300Clément Guérin (1): 1301 1302- venus: implement VK_EXT_physical_device_drm 1303 1304Connor Abbott (69): 1305 1306- tu: Delete unused tu_clear_blit GS handling 1307- ir3, freedreno, tu: Make ir3_shader_variant store stream_output 1308- ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant 1309- ir3: Use ir3_shader_variant::type more often 1310- ir3: Allocate disasm_info under variant 1311- ir3: Add ir3_shader_variant::compiler 1312- tu: Keep original blit shaders separately 1313- ir3: Remove ir3_shader_variant::shader 1314- ir3: Add functions to serialize variants 1315- ir3: Support disabling the pipeline cache 1316- ir3: Add ir3_shader_create_variant() 1317- tu: Hash pipeline layout contents 1318- tu: Rewrite cache UUID based on radv 1319- tu: Include turnip debug flags in pipeline cache UUID 1320- tu: Implement pipeline caching with shared Vulkan cache 1321- tu: Zero-initialize compute driver key 1322- tu: Implement VK_EXT_pipeline_creation_feedback 1323- tu: Implement VK_EXT_pipeline_creation_cache_control 1324- tu: Convert to sync2 entrypoints 1325- tu: Expose VK_KHR_synchronization2 1326- tu: Implement VK_EXT_image_2d_view_of_3d 1327- tu: Enable shaderTessellationAndGeometryPointSize 1328- tu: Enable VK_KHR_swapchain_mutable_format 1329- tu: Fix partial-write LRZ determination 1330- tu: Implement extendedDynamicState2LogicOp 1331- tu: Implement VK_EXT_color_write_enable 1332- tu: Enable uniform texture buffers with NPOT formats 1333- freedreno/a6xx: Support more 3-component formats 1334- tu: Fix 3d GMEM store swizzle in texture descriptor 1335- tu: Add missing WFI to the 3d GMEM store path 1336- tu: Save/restore \*_BIN_CONTROL in 3d GMEM store path 1337- ir3/sched: Fix could_sched() determination 1338- tu: Fix linemode for tessellation with isolines 1339- ir3: Fix vectorizer condition for SSBOs 1340- freedreno: Document a650-specific CP_REG_WRITE flag 1341- freedreno/a6xx: Document border color/swizzle interaction 1342- freedreno/a6xx: Fix border color swizzling 1343- freedreno/a6xx: Use fdl format swizzle 1344- freedreno/fdl: Note border color constraints 1345- freedreno/a6xx: Fix border color with Z24UINT_S8UINT 1346- tu: Fix stencil border color with has_z24uint_s8uint 1347- mesa/st: Fix border color type for stencil sampling 1348- nv50/ir/ra: Fix copying compound for moves 1349- freedreno/a6xx: VPC_SO_NCOMP is actually VPC_SO_BUFFER_STRIDE 1350- tu: Use incoherent CCU write for buffer accesses 1351- tu: Support resolving D24S8 -> S8 1352- tu: Fix resolving d32s8 into s8 on fast path 1353- tu/lrz: Do not use framebuffer when inheriting LRZ 1354- tu: Treat CP_WAIT_FOR_ME as a cache invalidate 1355- tu: Rewrite tess factor emission 1356- tu: Move TU_DONT_CARE_AS_LOAD into attachment_set_ops() 1357- tu: Disable GMEM for multiview inside tu_render_pass_gmem_config 1358- tu: Move tu_render_pass definition up 1359- tu: Remove usage of RenderPassBeginInfo 1360- tu: Initial support for dynamic rendering 1361- tu: Remove has_subpass_predication 1362- tu: Update more state with secondaries 1363- tu: Split out some state into a separate struct 1364- util/trace: Fix u_trace_*_iterator with no tracepoints 1365- tu: Add support for suspending and resuming renderpasses 1366- tu: Use common vk_image struct 1367- tu: Use common vk_image_view base struct 1368- tu: Add debug option to use emulated renderpass support 1369- tu: Restore formatting of tu_clear_blit.c 1370- tu: Fix sysmem depth attachment clear flushing 1371- ir3/spill: Fix extracting from a vector at the end of a block 1372- tu/lrz: Fix multiple subpass case with secondaries 1373- tu/lrz: Fix multiple depth attachment case with secondaries 1374- tu: Don't preload variable-count descriptors 1375 1376Constantine Shablya (1): 1377 1378- anv: set image_read_without_format NIR option on Vulkan 1.3 1379 1380Corentin Noël (5): 1381 1382- mesa: Make sure to fallback to handling the original choose texture format 1383- virgl/ci: Update virglrenderer 1384- virgl/ci: Update virglrenderer 1385- virgl: Add support for passing the supported number of components from virgl 1386- virgl: Only progagate the uniform numbers if the numbers are actually right 1387 1388Cristian Ciocaltea (4): 1389 1390- virgl/ci: Rename virgl-lava-* jobs to virgl-iris-* 1391- virgl/ci: Update checksums for some virgl iris traces 1392- ci: Add CPU frequency adjustment capability 1393- ci: Limit Intel CPU scaling frequency for performance tests 1394 1395Daniel Schürmann (36): 1396 1397- nir/lower_alu_to_scalar: don't set the nir_builder cursor 1398- aco: remove 'max_waves' and use 'num_waves' to adjust for LDS and workgroup size 1399- aco: split num_waves adjustment into separate function 1400- aco: adjust num_waves for LDS before scheduling 1401- aco/optimizer: prevent any overflow between SGPR and const offset on MUBUF 1402- aco/ra: fix live-range splits of phi definitions 1403- aco: don't re-create vectors for load_barycentric_* intrinsics 1404- aco/ra: improve split_vector register assignment if the operand is not killed 1405- aco/optimizer: re-combine and copy-propagate p_create_vector(p_split_vector) 1406- radv: move two NIR passes out of tight optimization loop 1407- radv: call nir_lower_vars_to_ssa later 1408- aco: fix spilling of phis without temp operands 1409- nir/opt_vectorize: add callback for max vectorization width 1410- nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback 1411- nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions 1412- aco: correctly validate v_fma_mixhi_f16 register assignment 1413- radv: don't lower vectorized instructions to 32bit 1414- radv: use callback for nir_lower_to_scalar 1415- radv: vectorize nir_op_fdiv & nir_op_flrp & nir_op_ineg 1416- radv: vectorize nir_op_fabs 1417- radv: fix swizzles after nir_opt_algebraic_late 1418- radv: reorder some NIR passes 1419- nir/opt_shrink_vectors: fix re-using of components for vecN 1420- radv/shader_info: fix load_frag_coord and load_sample_pos read masks 1421- aco: fix packed 16bit fneg/fsat optimization 1422- aco: avoid unnecessary copies in emit_wqm() 1423- aco: Avoid live-range splits in Exact mode 1424- aco: fix assertion in insert_exec_mask 1425- spirv: handle phis decorated with RelaxedPrecision 1426- radv: Only create noop-fs for graphics pipelines 1427- radv: create RT traversal as separate shader 1428- radv/rt: use derefs for the traversal stack 1429- radv/rt: use stage ID as handle for general and closestHit shaders 1430- radv/rt: Don't load ClosestHit SBT on every hit, but only once after traversal 1431- radv/rt: fix nir_builder cursor in lower_rt_instructions() 1432- radv/rt: simplify lower_rt_instructions() 1433 1434Daniel Stone (10): 1435 1436- dzn/ci: Don't spam conformance warnings 1437- CI: Disable Windows jobs 1438- ci: Allow lima jobs to be globally disabled 1439- ci: Also disable Windows container builds when down 1440- ci/windows: Eliminate duplicate script definition 1441- ci/zink: Add glx-tfp to skips 1442- ci/lava: Sleep before, not after, API calls 1443- ci: freedreno farm is down 1444- CI: Disable Collabora lab 1445- Revert "CI: Disable Collabora lab" 1446 1447Danylo Piliaiev (53): 1448 1449- turnip: Try harder to keep LRZ valid and fix a few edge cases 1450- freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC 1451- turnip: Skip load/stores for tiles with no geometry 1452- turnip: Add debug option to print gmem load/store skip stats 1453- turnip: Fix tu_debug_flags values clashing 1454- tu: Implement VK_EXT_image_view_min_lod 1455- tu: Do not flush ccu in clear/blits during renderpass 1456- pps: Open writable renderer node in DrmDevice::create 1457- tu: Fix indices of drm_msm_gem_submit_cmd when filling them 1458- freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0 1459- util/u_trace: Add json output 1460- docs/u_trace: document u_trace usage 1461- ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS 1462- ir3: Force late-z if FS has global store/atomic 1463- tu: Fix D32S8 resolves into D32 or S8 in sysmem mode 1464- spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark 1465- turnip: Allow nested CP_COND_REG_EXEC 1466- turnip: Fix out-of-bounds builder->shader_iova access 1467- tu/autotune: Prevent division by zero 1468- turnip: copy disasm strings for pipeline executables 1469- tu: Disable sample counting for 3d blits during occlusion query 1470- tu: Reset xfb_used at the end of a renderpass 1471- meson: Define _GNU_SOURCE for android host system 1472- meson/pps: Check if libdrm exists to compile pps 1473- meson/tu: Don't compile libdrm paths if KGSL is selected 1474- tu: Do not expose storage image/buffer features for PACK16 formats 1475- tu: Use hw binning or sysmem with QUERY_TYPE_PRIMITIVES_GENERATED 1476- tu: Don't count 3d blits in QUERY_TYPE_PRIMITIVES_GENERATED 1477- freedreno: Document rest of GRAS_LRZ_CNTL, clarify UNK_25 event 1478- freedreno: document GRAS_UNKNOWN_810A 1479- tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear 1480- ir3: Use NIR's info.writes_memory to detect when when to force late-z 1481- tu,freedreno: Refactored START/STOP events for pipeline stats 1482- tu: Fix prim gen query and pipeline stats query interaction 1483- zink: re-enable EXT_primitives_generated_query for Turnip 1484- util/u_trace: Fix iteration over config_control 1485- turnip: Refactor tracepoints generation to reduce duplication 1486- turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints 1487- freedreno: Add the rest of tracepoints with start/end to perfetto 1488- freedreno: Refactor tracepoints generation to reduce duplication 1489- freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints 1490- docs/u_trace: Add documentation for \*_GPU_TRACEPOINT envvars 1491- turnip: Disable LRZ fast-clear for gen1 and gen2 1492- tu: Do not dereference descriptorSetLayout in push descriptors tmpl 1493- zink/turnip: Add comments to already triaged failures for tu baseline 1494- turnip: Simple breadcrumbs implementation to debug hangs 1495- tu: WFI after PC_TESSFACTOR_ADDR update 1496- freedreno: WFI after PC_TESSFACTOR_ADDR update 1497- ir3/ra: Always insert interval for precolored inputs 1498- ir3: Never remove GS_HEADER_IR3 sysval input 1499- tu: Disable LRZ write when alpha-to-coverage is enabled 1500- freedreno: Disable LRZ write when alpha-to-coverage is enabled 1501- ir3: Prevent reordering movmsk with kill 1502 1503Dave Airlie (82): 1504 1505- clc: add simple llvm initialise API 1506- clc: initialise one more llvm stage 1507- u_blitter/stencil: take dstbox x/y into accounts for dst fb width 1508- util/stencil: fix stencil fallback blit shader texture types. 1509- llvmpipe: add user memory resources to the debug global list. 1510- gallium/vl: wrap codec support checks in a common function. 1511- gallium/omx: add video codec supported hook for decode paths. 1512- meson: add a video codec support option 1513- gallium/vl: respect the video codecs configure in meson 1514- vulkan/wsi: keep allocate queue families in image, just don't fill them 1515- gallivm: move to new pass manager to handle coroutines change. 1516- radv: precalculate hs offchip parameters. 1517- radv: precalculate tess ring sizes/offsets. 1518- radv/ac: introduce a new common function for hs calcs. 1519- ac: port radeonsi tess factor calculations to common code. 1520- radeonsi: port tess ring calcs to the common helper. 1521- zink: avoid extra submits for empty wait cmdbufs. 1522- zink: don't finish barriers cmd buffer if not used. 1523- radv: cache the slab bo pointer in the pipeline. 1524- radv: only dirty line stipple when necessary. 1525- radv: optimised command buffer reset of vertex bindings. 1526- aco: move info pointer to a copy. 1527- aco: move to a minimal aco shader info struct. 1528- aco/info: reduce the gs ring info to what is needed. 1529- aco: remove radv specific streamout info 1530- aco: convert vs and so info over to aco structs. 1531- aco/info: add some more fields. 1532- aco/radv: convert to aco shader info at the radv level. 1533- aco: remove radv vs prolog key from aco internals. 1534- aco: drop unused radv include 1535- ac/radv: drop info pointer from the ac and radv shader structs 1536- aco/radv: drop radv_nir_compiler_options from aco. 1537- llvmpipe: align scratch size to 64-bit size. 1538- meson: add build-id to pipe libraries 1539- vl: fix codec checks to disable properly 1540- lavapipe: fix depth bias offset flag enables. 1541- llvmpipe: flush resources for kms swrast path. 1542- llvmpipe: handle timespec overflow on fence waits. 1543- nouveau: move codegen into a standalone library. 1544- nouveau: move codegen to a common higher level directory. 1545- nouveau/codegen: drop all ubytes from codegen. 1546- nouveau/codegen: drop gallium headers from the interface. 1547- llvmpipe: disable alpha test branch if multisample is enabled. 1548- egl/x11: split large put image requests to avoid server destroy 1549- egl/x11: add missing put_image cookie cleanups 1550- wsi/x11: add xcb_put_image support for larger transfers. 1551- glx/drisw: use xcb instead of X to query connection 1552- draw/cliptest: add support for guard band + full Z 1553- zink: workaround depth sampler border color when z24 is z32 1554- aco: refactor the radv binary builder out of the core aco fn. 1555- aco/radv: provide a callback from aco shader building to build binary 1556- aco/radv: provide a vs prolog callback from aco to radv. 1557- aco: drop radv_shader.h include 1558- radv: add a dynamic vertex format cache. 1559- lavapipe: drop unreachable pNext checks. 1560- lavapipe: enable variablePointers 1561- kms/dri: add mutex lock around map/unmap 1562- crocus: fail query begin if upload allocation fails. 1563- lavapipe: state latest conformance tests passed 1564- llvmpipe: fix aniso cube map arrays. 1565- crocus: fix leak in query code. 1566- llvmpipe: make last_fence a screen/rast object not a context one. 1567- llvmpipe: keep context list and use to track resource usage. 1568- gallivm: use progress from subgroup lowering. 1569- gallivm: fix casting around read invocation. 1570- llvmpipe: export ARB_shader_ballot 1571- lavapipe: expose some subgroup extensions 1572- zink: fixup subgroup vote/ballot enables. 1573- zink/llvmpipe/ci: skip broken CTS ballot test. 1574- util: add reallocarray wrapper for win32 1575- llvmpipe: Allocate scene tiles dynamically 1576- gallium: add a rasterizer state bit for unrestricted depth values. 1577- mesa/st: disable unrestricted fragment depth values for GL/GLES 1578- util/pack_color: clamp depth values outside range for unorm formats. 1579- llvmpipe/fs: handle unrestricted depth values. 1580- lavapipe: enable VK_EXT_depth_range_unrestricted. 1581- util/reallocarray: add errno.h include. 1582- lavapipe: enable VK_FORMAT_R5G5B5A1_UNORM_PACK16 1583- gallivm: fix printf hook for cached shaders. 1584- draw: don't touch info values that aren't valid. 1585- nir_to_tgsi_info: drop const_buffers_declared 1586- llvmpipe: finish rendering before flushing frontbuffer resources. 1587 1588David Heidelberg (58): 1589 1590- ci: skqp: patch skqp to report also GL results 1591- ci: skqp: upgrade to skqp based on Android CTS 11.0 (r7) 1592- ci: skqp: update URL 1593- ci: intel: Merge anv and iris into src/intel/ci 1594- ci/iris: Enable SKQP on Tiger Lake boards 1595- ci: traces: switch to Valve trimmed traces 1596- ci: traces: re-enable disabled traces which are now trimmed traces 1597- ci: traces: virgl: reenable previously crashing Godot trace 1598- ci: uprev piglit 2022-05-10 1599- ci/iris: skqp: add default GLES rendertests for TGL 1600- ci/iris: skqp: remove flaking atlastext for TGL 1601- ci/virgl: fix checksum for valve traces which run on iris 1602- ci/freedreno: enable ROR and Nheko traces 1603- ci/i915: add entries for RoR and Nheko traces 1604- ci/intel: add RoR and Nheko traces and reenable most of Valve traces 1605- ci/crocus: add RoR and Nheko traces 1606- ci/llvmpipe: add RoR and Nheko traces 1607- ci/radeonsi: add RoR and Nheko traces 1608- ci/virgl: traces: add RoR and Nheko traces 1609- ci/panfrost: add RoR and Nheko traces 1610- ci/iris: skqp: remove flaking atlastext for TGL (gl version) 1611- ci: traces: temporarily disable nheko trace 1612- ci/virgl: traces: temporarily disable nheko trace 1613- ci: uprev piglit 2022-06-09 1614- ci/etnaviv: sort fails.txt 1615- ci/etnaviv: add recent fails and remove 3 unexpected passes 1616- ci/etnaviv: disable tests which takes too long 1617- ci/etnaviv: add recent flakes for gc2000 1618- ci/etnaviv: gc2000: drop flakes from fails 1619- ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces 1620- ci/freedreno: fix A530 glmark2\@ideas:speed=10000 trace 1621- ci/llvmpipe: add Blender, Warzone2100, Freedoom and Unvanquished traces 1622- ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces 1623- ci/virgl: add Blender, Warzone2100, Freedoom and Unvanquished traces 1624- ci/virgl: iris: disable Counter Strike 1.6 trace flaking 1625- ci/crocus: add Blender, Warzone2100, Freedoom and Unvanquished traces 1626- ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces 1627- ci/iris: disable Blender Diorama trace (incompatible with perf. traces) 1628- ci/freedreno: update restricted traces to the brotli compressed version 1629- ci/freedreno: smuggle myself into restricted traces access list 1630- ci/freedreno: disable non-existing trace 1631- ci: Uprev apitrace to 790380e05854 1632- ci: traces: switch to brotli compressed traces 1633- ci/freedreno: add more restricted traces 1634- ci/freedreno: temporary disable AmnesiaTDD 1635- ci/freedreno: disable Stellaris trace 1636- ci/freedreno: disable SpecOps trace, each run flaky 1637- Revert "ci/freedreno: temporary disable AmnesiaTDD" 1638- ci/traces: piglit, be more verbose 1639- docs: briefly document Gallium Nine 1640- ci: add script for manage Mesa CI 1641- ci: Allow disabling the whole of the Igalia farm 1642- ci: move apitrace instalation outside of x86_test-vk 1643- ci: wine-apitrace bump to 11.1 1644- ci/freedreno: 3 pixel change in Raven restricted trace 1645- nine: char can be unsigned on non-x86 platforms 1646- mailmap: fix order of emails for Axel Davy 1647- ci: set reasonable amount of wget retries for each download 1648 1649Dawn Han (6): 1650 1651- Add iub features and properties passthrough 1652- Refactor the descriptor enums to be extensible 1653- Add iub type to vn_descriptor_type Verified the pNext VkWriteDescriptorSet is not NULL on iub type 1654- Add the iub binding count tracking 1655- Refactor and add template support for iub 1656- venus: enabled iub in physical device 1657 1658Denis Pauk (1): 1659 1660- panvk: Return VK_ERROR_INCOMPATIBLE_DRIVER for Midgard 1661 1662Dmitry Osipenko (1): 1663 1664- virgl: Fix unmapping of blob resources 1665 1666Dylan Baker (74): 1667 1668- VERSION: bump to 22.2-devel for next cycle 1669- docs: truncate new_features.txt 1670- docs: add release notes for 22.0.2 1671- docs: Add sha256sums for 22.0.2 1672- docs: update calendar and link releases notes for 22.0.2 1673- docs: update calendar for 22.1.0-rc1 1674- docs: update calendar for 22.1.0-rc2 1675- winsys/radeon: remove unused simple_list.h 1676- r300: remove unused simple_list.h 1677- gallium/rbug: replace simple_list.h with list.h 1678- gallium/driver_trace: remove simple_list.h 1679- util/list.h: add a function to move an item in a list 1680- gallium/util/u_cache.c: remove trailing whitespace 1681- gallium/util/u_cache: replace simple_list.h with list.h 1682- gallium/draw: remove trailing whitespace 1683- gallium/draw: use list.h instead of simple_list.h 1684- llvmpipe: cleanup trailing whitespace 1685- llvmpipe: replace uses of simple_list.h with list.h 1686- util: remove simple_list 1687- docs: Extend calendar entries for 22.0 by 1 releases. 1688- docs: update calendar and link releases notes for 22.0.3 1689- docs: Extend calendar entries for 22.1 by 1 releases. 1690- docs: update calendar for 22.1.0-rc3 1691- docs: update calendar for 22.1.0-rc4 1692- docs: add release notes for 22.0.3 1693- docs: Add sh256sum for mesa 20.0.3 1694- docs: update calendar and link releases notes for 22.1.0 1695- docs: add release notes for 22.1.0 1696- relnotes: Add sha256sum and fix minor formatting issues 1697- docs: Add calendar entries for 22.1 release. 1698- docs: add release notes for 22.0.4 1699- docs: add sha256sum to 22.0.4 notes 1700- docs: Extend calendar entries for 22.0 by 1 releases. 1701- docs: update calendar and link releases notes for 22.0.4 1702- docs: add release notes for 22.1.1 1703- docs: Add sh256sums for 22.1.1 1704- docs: update calendar and link releases notes for 22.1.1 1705- docs: add release notes for 22.0.5 1706- docs: update sha256 for 22.0.5 1707- docs: update calendar and link releases notes for 22.0.5 1708- docs: add release notes for 21.3.9 1709- docs Add sha256 sums for 21.3.9 1710- docs: link releases notes for 21.3.9 1711- docs: add release notes for 22.1.2 1712- docs: add sha256sum to 22.1.2 notes 1713- docs: update calendar and link releases notes for 22.1.2 1714- docs: drop 22.1.4 calendar entry. 1715- docs: Add calendar entries for 22.2 release candidates. 1716- docs: Extend calendar entries for 22.1 by 3 releases. 1717- docs: add release notes for 22.1.3 1718- docs: add sah256sum for mesa 22.1.3 1719- docs: update calendar and link releases notes for 22.1.3 1720- docs: Bump 22.2 branchpoint by two weeks 1721- anv: add gfx version 12.5 flushes to CCS path 1722- iris|anv: gfx version 12.5 data cache flush is not a workaround 1723- VERSION: bump 22.3.0-devel 1724- VERSION: bump for 22.2.0-rc1 1725- .pick_status.json: Update to 8e6bdb2ed31f27c28d258859ec48e8e67f51ff41 1726- .pick_status.json: Update to 0a0205f04552ab21971c431a5f8bb7486d664422 1727- .pick_status.json: Update to 1a3b086b06c6c1ff63a4abe3752ca0a7a854dc77 1728- .pick_status.json: Update to c67e60ae8fefaeadbe06bf0cbd8ad2ee267a6c0e 1729- .pick_status.json: Update to 70891edd9728e7dd73115171fdb03622ce6450ba 1730- VERSION: update to 22.2.0 1731- Revert "VERSION: update to 22.2.0" 1732- VERSION: bump to 22.2.0-rc2 1733- .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36 1734- .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9 1735- .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e 1736- .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated 1737- VERSION: bump to 22.2.0-rc3 1738- .pick_status.json: Update to 0c6fbfca0c91ef012e8ab767a317c07f1f6dc5e6 1739- .pick_status.json: Update to 8eac45b27446cd9b9eaeb147af97fff1e09832cb 1740- .pick_status.json: Update to baf24dea943202b3a92cad0c9f9648597040955a 1741- .pick_status.json: Update to 3d4c36a3bcc51ed441b2667d92291bea30ef7449 1742 1743Eli Schwartz (1): 1744 1745- meson: add various generated header dependencies as order-only deps 1746 1747Ella Stanforth (1): 1748 1749- v3dv: Implement VK_KHR_performance_query 1750 1751Emil Velikov (1): 1752 1753- c11: reinstate the original license and authorship 1754 1755Emma Anholt (232): 1756 1757- nouveau/nir: Don't try to emit OP_FMA pre-nvc0. 1758- Revert "ci: Disable Google's lab" 1759- ci/virgl: Add piglit and GL4.3 testing to the virpipe path. 1760- nir: Add lowering for fround_even on r300. 1761- nir: Skip fround_even on already-integral values. 1762- nir: Avoid generating extra ftruncs for array handling. 1763- turnip: Add TU_DEBUG=layout for dumping image layouts. 1764- freedreno/ir3: Make sched nodes before adding deps. 1765- turnip: Add nir_opt_conditional_discard. 1766- freedreno/ir3: Call nir_opt_find_array_copies(). 1767- turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. 1768- turnip: Don't disable LRZ in subpasses after the first in the easy case. 1769- nir_to_tgsi: Add a workaround for virglrenderer TG4. 1770- nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs. 1771- nir_to_tgsi: Don't forget to split 64-bit store_per_vertex_output. 1772- nir_to_tgsi: Avoid swizzling from undefined channels in load_output. 1773- nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs. 1774- tgsi: Emit ureg HW_ATOMIC decls in range order. 1775- tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW. 1776- nine: Disable optional use of TTN when MUL_ZERO_WINS is available. 1777- r600: Fix up some mis-indentation of blocks. 1778- ci/r600: Manual run updates. 1779- r600: Fix reading back from a temp array immediately after writing on RV770. 1780- r600/sb: Avoid causing an exception when getting the reciprocal of 0u. 1781- r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. 1782- nouveau/nv50: Print the number of loops in shader-db output. 1783- ci/nouveau: Add expectations files for GM206. 1784- ci/nouveau: Add nv92 xfails. 1785- nouveau/nir: Disable bitfield ops pre-nvc0. 1786- nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR. 1787- nouveau/nv50: Set the primid sysval flag if it's in the sysval list, too. 1788- nouveau/nir: Fix the inverted sense of usesSampleMaskIn. 1789- nouveau/nir: Fix edgeflag input detection. 1790- nouveau/nir: Set the input for vertex/instance ID like TGSI does. 1791- ci/dzn: Cut the runtime by running 1/3 of the enabled tests. 1792- nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit. 1793- nir_to_tgsi: Lower FS input array indexing since we don't declare input arrays. 1794- tgsi: Extend array lengths when merging usage_masks. 1795- virgl: Switch to nir-to-tgsi by default. 1796- nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. 1797- nir_to_tgsi: Drop the txl(lod=0)->tex nir_lower_tex() fixup. 1798- nouveau/nir: Move FS output stores to the end of the last block. 1799- ci/nouveau: Add MESA_GLES_VERSION_OVERRIDE=3.1 baseline state. 1800- nouveau/nir: Add support for pre-GF100 images and ssbos. 1801- nouveau/nir: Put the UBO offset indirect into the address reg. 1802- nouveau: Enable the NIR backend by default. 1803- util: Add some unit tests of the half-float conversions. 1804- nir: Eliminate out-of-bounds read/writes in local lowering. 1805- nir: Don't assert on tg4 offset range. 1806- svga: Add support for requesting NIR and translating to TGSI. 1807- ci/iris: Demote APL deqp to manual-only for now. 1808- ci/iris: Add a bunch of APL and KBL flakes recently. 1809- mesa/st: Always generate NIR from GLSL, and use nir_to_tgsi for TGSI drivers. 1810- mesa/st: Remove st_glsl_to_tgsi. 1811- mesa/st: Remove TGSI shader program variant handling. 1812- mesa/st: Drop the attrib remapping in st_prepare_vertex_program(). 1813- mesa/st: Remove TGSI-only shader lowering code. 1814- tgsi_scan: Remove unused input_array_last/output_array_last. 1815- glsl: Remove UBO reference lowering. 1816- mesa: Remove unused Mesa IR PROGRAM_* register files. 1817- gallium/tgsi: Remove tgsi_emulate. 1818- mesa/st: Remove now unused TGSI paths from disk cache. 1819- glsl: Retire the non-NIR GLSL linking paths. 1820- gallium: Remove now-unused shader caps. 1821- glsl: Remove the unused lower_if_to_cond_assign. 1822- glsl: Drop the dead MOD_TO_FLOOR path. 1823- mesa/st: Clean up a bit of st_prepare_vertex_program(). 1824- compiler/glsl: Remove the dead parts of build_program_resource_list(). 1825- glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code. 1826- glsl: Remove unused lower_variable_index_to_cond_assign. 1827- svga: Set lower_bitops for vpu9 screen. 1828- glsl: Stop lowering ir_quadop_vector. 1829- nouveau: disable fences when running under drm-shim. 1830- Revert "ci: remove nouveau from shader-db runs" 1831- ci/iris: Cut the glk-deqp test coverage in half. 1832- glsl: Make all drivers take the GLSLOptimizeConservatively path. 1833- turnip: Add a TU_DEBUG=perf debug option. 1834- ci/crocus: Manual CI updates after CI was down for a bit. 1835- ci/crocus: Merge the piglit runs with the deqp runs. 1836- i915g/ci: Add depth-clear-precision-check xfails like everyone else. 1837- ci/crosvm: Simplify the CID setup. 1838- ci/crosvm: Rename VSOCK_TEMP_DIR -> VM_TEMP_DIR. 1839- ci/crosvm: Terminate the previous crosvm after a deqp-runner timeout. 1840- freedreno/ir3: Fix 16-bit bit_count. 1841- spirv: Use nir_vec_scalars() to simplify matrix transpose. 1842- spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump. 1843- turnip: Make RelaxedPrecision-decorated ALU ops 16-bit. 1844- freedreno/ir3: Add support for 16-bit nir_texop_lod. 1845- freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. 1846- mesa/st: Only scalarize for doubles lowering if we're lowering doubles. 1847- glsl,nir: Move i/umulExtended lowering to NIR. 1848- panfrost: always print the bad ALU op if we're failing to translate. 1849- ci/turnip: Add missing xfails for a618 full VK run 1850- nir_lower_mediump: Drop assertion about not containing movs. 1851- freedreno/ir3: Enable load/store vectorization for SSBO access, too. 1852- freedreno/ir3: Make the types of tex coord padding match. 1853- freedreno/ir3: Fix validation of half-precision image store values. 1854- freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts. 1855- i915g: Ignore the new SEPARABLE_PROGRAM property. 1856- ci/i915: Merge the piglit and deqp runs. 1857- ci/i915: Update manual piglit job expectations. 1858- turnip: Don't bother creating tile_load/store_cs for sysmem rendering. 1859- turnip: Refactor a bit of tu6_emit_tile_select(). 1860- turnip: Move binning decisions from FB usage time to FB creation time. 1861- turnip: Use fb->binning_possible to decide on conditional tile load/stores. 1862- ci/turnip: Add a little forced touch-testing of XFB with no binning requested. 1863- turnip: Refactor a bit of repeated code for subpass setup. 1864- turnip: Include 3d-based CmdClearAttachments() in binning visibility. 1865- turnip: Refactor a bit of subpass attachment processing. 1866- turnip: Allow load/store skipping in vkCmdClearAttachments(). 1867- turnip: Move tile loads back into the draw CS. 1868- nouveau/nv30: Make sure fsat is lowered in the VS. 1869- gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE 1870- glsl: Remove stale lower_instructions comments. 1871- glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR. 1872- tgsi_exec: Fix inf/nan handling for divide by zero. 1873- glsl: Drop the div-to-mul-rcp lowering for floats. 1874- nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set. 1875- glsl: Drop INT_DIV_TO_MUL_RCP lowering. 1876- ci/crocus: Drop g41's xfail for rgba_half_float_oes. 1877- ci/iris: Disable skqp until it can be stabilized. 1878- turnip: Reverse the order of walking pipes or tiles on odd rows. 1879- mesa/st: Abort the linking on driver link failure. 1880- .gitignore: Qualify the path for the ignored build directory. 1881- nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. 1882- gallium: Rename MUL_ZERO_WINS to LEGACY_MATH_RULES. 1883- nir_to_tgsi: Set LEGACY_MATH_RULES cap for use_legacy_math_rules shaders. 1884- r600/sfn: Implement 0*x=0 behavior for use_legacy_math_rules. 1885- nouveau/nir: Implement mul_zero_wins behavior for use_legacy_math_rules. 1886- tgsi_to_nir: Set use_legacy_math_rules for TGSI_PROPERTY_LEGACY_MATH_RULES. 1887- gallium: Rename PIPE_CAP_TGSI_LEGACY_MATH_RULES to drop "TGSI" 1888- Revert "nine: Disable optional use of TTN when MUL_ZERO_WINS is available." 1889- iris: Enable PIPE_CAP_LEGACY_MATH_RULES. 1890- crocus: Enable PIPE_CAP_LEGACY_MATH_RULES. 1891- ci/dzn: Skip a test that intermittently times out at a minute. 1892- ci/iris: Disable blender-demo-cube_diorama on APL. 1893- ci: Terminate capture-devcoredump before tarring up artifacts. 1894- turnip: Document some fields about resolves. 1895- ci/bare-metal: Apply autopep8 to our python scripts. 1896- ci/bare-metal: Add per-boot-stage timeouts for fastboot and poe. 1897- ci/bare-metal: Get rid of servo's serial feed threads. 1898- ci/bare-metal: Add handling of netboot firmwares for servo boards. 1899- ci/bare-metal: Add test phase timeouts to all boards. 1900- ci: Remove the unused .test-manual 1901- ci/vc4: Merge quick_shader in with deqp-gles 1902- ci/vc4: Turn on deqp-egl testing by default. 1903- vc4: Propagate txf_ms's dest_type to the lowered txf. 1904- freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx. 1905- freedreno/a5xx: Set the buffer bit appropriately in XS_CTRL_REG0. 1906- ci/freedreno: Turn a530 back on by default and update expectations. 1907- ci/bare-metal: Remove "stage: test" from .baremetal-test. 1908- ci/bare-metal: Consolidate needs declarations in .baremetal-test-\*. 1909- ci/bare-metal: Collapse artifacts wget by default. 1910- ci/traces: GC unused code for DXVK trace replay. 1911- ci/traces: Drop ZINK_USE_LAVAPIPE ICD override. 1912- ci/freedreno: Add vulkan+gl integration testing in piglit. 1913- ci/turnip: Test traces on turnip using zink. 1914- ci/turnip: Trim the a630 VK run a bit. 1915- ci: Use "!references" to manage scheduled pipeline rules. 1916- ci: Pull out farm disables to a reused rule. 1917- ci: Pull common zink frontend rules to a shared rule list. 1918- ci/freedreno: Use !references to clean up restricted traces rules. 1919- ci: Simplify vulkan rules using !references. 1920- ci: Split core GL from core VK-or-GL rules. 1921- ci/freedreno: Filter when we run test jobs for VK or GL changes. 1922- ci/zink: Reuse anv-rules. 1923- ci/zink: Simplify lavapipe rules setup, and clarify what the rules are for. 1924- ci: Remove the trailing "when: never"s from rules. 1925- ir3: Retire the cp postsched pass now that we do RA in SSA. 1926- ci: Uprev deqp to 1.3.3.0. 1927- ci/llvmpipe: Skip ssbo.layout.random.all_shared_buffer.36 1928- ci: Make the retry policy default for all jobs. 1929- ci: Rename .ci-run-policy rules to .build-rules. 1930- ci: Drop .build-rules from container jobs. 1931- ci: Remove .build-rules from core test job definitions. 1932- ci: Fix non-freedreno performance jobs running during Marge merges. 1933- ci: Remove GIT_STRATEGY from .test-manual-mr. 1934- mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0. 1935- vc4: Disable OES_texture_3D being exposed. 1936- vc4: Move previous existing 3D xfails up to the group of 3d xfails. 1937- vc4: Add notes on the remaining dEQP failures. 1938- turnip: Account for additional_cs_reserve_size for both bin and render. 1939- turnip: Fix up per-stage additional size accounting. 1940- turnip: Fix the reservation for vertex inputs. 1941- turnip: Refactor vertex input setup a little. 1942- ir3: Make sure to pass the interp_mode through in our load_bary lowering. 1943- nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. 1944- freedreno: Rename the "SIZE" regs for interpolateAtOffset to "CENTERRHW" 1945- ir3: Clarify what's happening in the interpolateAtOffset() math. 1946- ir3: Use non-persp interpolation when appropriate for interpolateAtOffset. 1947- zink: Do the timestamp-to-ns math in a double to have better precision. 1948- mesa: Fix the error check for VertexAttrib*. 1949- vc4: Work around a HW bug with 2-vert line loops. 1950- vc4: Add links to test bug reports. 1951- kopper: Fix the return value of kopperSetSwapInterval(). 1952- kopper: Use the swap interval that was set at swapchain creation time. 1953- glx: Unify dri2/dri3 vblank_mode logic. 1954- kopper: Respect the vblank_mode env var. 1955- egl: Just include driconf.h instead of redefining its values. 1956- loader: Just include driconf.h instead of redefining its values. 1957- ci/freedreno: Mark an occasional flake pass that happens on a530. 1958- ir3: Fix the no-emitted-vertex condition emission in geom lowering. 1959- mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir. 1960- mesa/arbprog: Use nir_lower_io_to_temporaries. 1961- mesa/arbprog: Stop doing optimization in the ARB program IR. 1962- ci/crocus: Update portal 2 trace shas for the recent fix. 1963- ci/crocus: Disable the blender trace. 1964- ci/crocus: Drop xfails for the recent image external fix. 1965- freedreno/ir3: Enable core NIR's 16-bit ALU optimizations. 1966- intel/fs: Simplify brw_barycentric_mode() args. 1967- ci/bare-metal: Add timeouts to the shell commands called in fastboot. 1968- ci/freedreno: Switch a630 to manual/disabled for lab maintenance. 1969- glsl: Use the same NIR path for shared mem lowering as SPIRV does. 1970- glsl: Remove optimize_swizzles. 1971- ci/turnip: Add a bit of spilling-vs-ballot testing on a618. 1972- ci/freedreno: Add some more known flakes for a630 from our IRC logs. 1973- ci/turnip: Bump up the a630 full run timeout. 1974- ci/turnip: Add a couple of missing a630 fails. 1975- Revert "ci/freedreno: Switch a630 to manual/disabled for lab maintenance." 1976- nir: Split usub_sat lowering flag from uadd_sat. 1977- ci/crocus: Fix the commented YML for blender-demo-cube_diorama. 1978- gallivm: Refactor a bit of UBO/SSBO range checking. 1979- gallivm: Optimize single-invocation SSBO stores. 1980- ci/virgl: Make the VM-based virgl tests report to #virgl-ci, too. 1981- nir: Add an opt pass for phis after if choosing between true/false. 1982- glsl: Remove opt_conditional_discard(). 1983- nir: Use nir_foreach_phi_src consistently. 1984- ci/freedreno: Update a630 s8 resolve xfails. 1985- turnip: Split the tiling config into separate layouts based on CCU usage. 1986- turnip: Use the GMEM CCU space for attachments when the stores won't. 1987- zink: Make sure that we keep the existing ici pNext chain on inserts. 1988- spirv: Mark phis as mediump instead of directly lowering them to 16 bit. 1989 1990Enrico Galli (6): 1991 1992- microsoft/spirv_to_dxil: Add pass to lower dynamic accesses on ubo[1] 1993- microsoft/compiler: Fix for arrays of 1 element 1994- microsoft/compiler: Add support for unnamed ubos 1995- microsoft/compiler: Fix when using a shadow sampler more than once 1996- microsoft/spirv_to_dxil: Fix discard semantics 1997- microsoft/spirv_to_dxil: Fix discard to demote + return pass 1998 1999Eric Engestrom (36): 2000 2001- meson: replace deprecated dep.get_pkgconfig_variable(...) with dep.get_variable(pkgconfig : ...) 2002- docs: show and link to the new macOS CI 2003- broadcom/rpi4-skips: drop duplicated lines 2004- mailmap: update my email to Igalia 2005- vulkan: add script to detect & replace identifiers that have been renamed 2006- vulkan/runtime: use updated tokens from vk.xml 2007- vulkan: use updated tokens from vk.xml 2008- nir: use updated tokens from vk.xml 2009- radv: use updated tokens from vk.xml 2010- v3dv: use updated tokens from vk.xml 2011- turnip: use updated tokens from vk.xml 2012- zink: use updated tokens from vk.xml 2013- lavapipe: use updated tokens from vk.xml 2014- pvr: use updated tokens from vk.xml 2015- anv: use updated tokens from vk.xml 2016- dzn: use updated tokens from vk.xml 2017- panvk: use updated tokens from vk.xml 2018- venus: use updated tokens from vk.xml 2019- zink: fix portability_subset usage after rename from EXTX to KHR 2020- zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR 2021- zink: add missing guards around \`have_{ext}` 2022- docs/isl: fix typos & formatting 2023- vk/util: handful of pythonic cleanups 2024- vk/util: simplify extensions gen code 2025- util/list: rename LIST_ENTRY() to list_entry() 2026- nouveau: use existing list_first_entry() macro instead of re-writing it 2027- docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip 2028- docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv 2029- docs/ci: fix Emma's name 2030- turnip: expose support for VK_EXT_acquire_drm_display 2031- ci: add \`--print-errorlogs` to \`meson test` 2032- bin/gen_release_notes.py: bump advertised vulkan version to 1.3 2033- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties 2034- wsi/x11: fix memleak in wsi_x11_connection_create() 2035- meson: replace manual compiler flags with meson arguments 2036- broadcom: fix dependencies in static_library() calls 2037 2038Erico Nunes (11): 2039 2040- ci: put lima farm back online 2041- docs/ci: fix literalinclude of caching setup files 2042- lima/ppir: optimize branch comparison when possible 2043- lima/ppir: merge branch nodes in simple cases 2044- egl: Fix DETECT_OS macro usage 2045- ci: update linux kernel to v5.17 2046- ci: update docs for linux kernel uprevs 2047- lima/ci: update piglit ci expectations 2048- lima/ci: reenable lima-mali450-piglit-gpu:arm64 2049- lima/ci: Don't skip piglit opengl 1 tests 2050- ci: Revert "CI: Lima farm is offline" 2051 2052Erik Faye-Lund (157): 2053 2054- gallium: rename continue shader-cap 2055- gallium: rename dround shader-cap 2056- gallium: rename dfracexp/dldexp shader-cap 2057- gallium: rename ldexp shader-cap 2058- vulkan: explicitly cast object-type enum 2059- meson: turn on -Wno-unused-function project-wide 2060- nir: introduce and use nir_component_mask 2061- dzn: fixup indent 2062- dzn: remove all usage of ComPtr<T> 2063- dzn: always use ID3D12Device1 2064- dzn: always use ID3D12GraphicsCommandList1 2065- dzn: use ID3D10Blob instead of ID3DBlob 2066- dzn: pass IDXGIAdapter1 to d3d12_create_device 2067- dzn: port to d3d12 c-api 2068- dzn: add a bunch of missing struct-keywords 2069- dzn: drop auto usage 2070- dzn: drop using references 2071- dzn: no more reinterpret_cast 2072- dzn: use define instead of constexpr 2073- dzn: use c-style for-statement 2074- dzn: use c-style initialization 2075- dzn: c-style casts 2076- dzn: do not set unused default member initializer 2077- dzn: port code to plain c 2078- dzn: merge util sources 2079- dzn: remove needless defines 2080- dzn: remove some needless casts 2081- dzn: clean up pointer syntax 2082- dzn: add space before += operator 2083- dzn: drop needless enum-casts 2084- dzn: fixup indent 2085- dzn: remove stale cpp_args 2086- nir/lower_int64: do not try to clamp floats to int-range 2087- vulkan: do not depend on alignof(void) 2088- lavapipe: quiet non-conformant warning on ci 2089- panvk: quiet non-conformant warning on ci 2090- vulkan: use c_msvc_compat_args for shared code 2091- vulkan: drop empty vulkan_wsi_args 2092- microsoft/clc: fixup indentation 2093- microsoft/compiler: fixup indentation 2094- meson: remove unused defines 2095- meson: deprecate specifying osmesa-bits 2096- gallium/xlib: fix stale comment 2097- mesa: add missing error-path 2098- util: fix test on msvc 2099- lavapipe: tolerate NULL UBO and SSBO descriptors 2100- lavapipe: tolerate NULL image view descriptors 2101- lavapipe: tolerate NULL image-buffer view descriptors 2102- lavapipe: tolerate NULL sampler-buffer view descriptors 2103- lavapipe: add support for null-descriptors from EXT_robustness2 2104- zink: pass screen to extension getter macro 2105- zink: initialize dispatch-table before queue-init 2106- zink: call vk-functions through dispatch-table 2107- zink: pass screen to zink_create_instance 2108- zink: dynamically load a few functions 2109- zink: use run-time linking to loader 2110- dzn: reset correct list 2111- vulkan/wsi: unbreak win32-support 2112- d3d12: add missing cmath includes 2113- tests/graw: rename shaders from .sh to .txt 2114- dzn: add a debug-flag to wait for the debugger 2115- dzn: add debug option to redirect stdout/stderr 2116- editorconfig: remove scons-ism 2117- editorconfig: remove html-rule 2118- editorconfig: remove m4-rule 2119- editorconfig: remove pl-rule 2120- radeonsi: port amdgcn_glslc build to meson 2121- freedreno: remove stale makefile 2122- intel: remove stale makefile 2123- meson: modernize win_flex stdint.h logic 2124- c99_compat.h: remove inline and __func__ shims 2125- util: add missing c99_compat.h includes 2126- util: remove needless c99_compat.h includes 2127- egl: remove needless c99_compat.h includes 2128- gallium/util: remove needless c99_compat.h includes 2129- mapi: remove needless c99_compat.h includes 2130- gallium: remove needless c99_compat.h includes 2131- include: remove needless c99_compat.h includes 2132- intel/isl: remove needless c99_compat.h includes 2133- vulkan/device_select: remove needless c99_compat.h includes 2134- include: drop c99_math.h 2135- include: drop c11_compat.h 2136- pvr: do not use c_msvc_compat_args 2137- turnip: do not do STATIC_ASSERT on a variable 2138- freedreno: do not use variable in STATIC_ASSERT 2139- freedreno: do not do STATIC_ASSERT on variables 2140- iris: do not do STATIC_ASSERT on variables 2141- freedreno: degrade STATIC_ASSERT to assert 2142- v3dv: do not do STATIC_ASSERT on variables 2143- util: add IS_POT macro 2144- intel/compiler: use macro for power-of-two check 2145- broadcom/compiler: use macro for power-of-two check 2146- util: use static_assert directly 2147- util: implement STATIC_ASSERT using c++11 / c11 primitives 2148- meson: add back -Werror=vla to msvc compat flags 2149- ci/windows: add back build-error detection 2150- d3d12: move cubemap-lowering to common-code 2151- microsoft/compiler: mark image-functions as such 2152- microsoft/compiler: make sampler-lowering optional 2153- microsoft/spirv_to_dxil: lower cube-images to 2d arrays 2154- dzn: enable binding_model tests 2155- dzn: correct assert-condition 2156- dzn: handle stencil-attachment-optimal layout 2157- dzn: use a null-rtv to handle no-attachment 2158- dzn: enable two more exts 2159- util: sort includes 2160- llvmpipe: assume c99 2161- c99_compat.h: assume c99 support 2162- util: use c11 alignof instead of our own 2163- glcpp: remove outdated msvc hack 2164- vulkan/util: assume c11 2165- util: use c11 alignas instead of rolling our own 2166- gallium: remove stale comment 2167- gallium: reorganize includes 2168- llvmpipe: use c11 alignas instead of PIPE_ALIGN_VAR 2169- gallium: use c11 alignas instead of PIPE_ALIGN_VAR 2170- gallium: use c++11 alignas instead of PIPE_ALIGN_VAR 2171- gallium: remove repeated check 2172- gallium: remove unused macros 2173- gallium: clean up PIPE_ALIGN_STACK-definition 2174- docs/zink: correct requirement for tbos 2175- dzn: correct order of src_subres and dst_subres args 2176- docs: set language to english 2177- dzn: fill in driver name and info 2178- dzn: fill in float-control details 2179- dzn: fill in non-uniform-indexing props 2180- dzn: fill in bindless props 2181- dzn: fill in depth/stencil resolve props 2182- dzn: fill in minmax props 2183- dzn: fill misc props 2184- dzn: expose VK_KHR_driver_properties 2185- zink: remove pointless test 2186- zink: remove pointless assert 2187- zink: drop pointless comment 2188- llvmpipe: merge all rasterizer-bind functions 2189- llvmpipe: consolidate legacy_points and multisample state 2190- llvmpipe: clean up bresenham implementation 2191- docs/zink: correct name of extension 2192- docs/zink: document required VK_KHR_swapchain_mutable_format 2193- docs/zink: document required VK_EXT_border_color_swizzle 2194- zink: warn about missing image2DViewOf3D feature 2195- zink: fix EXT_color_write_enable check 2196- gallium/hud: do not use texture-rect for font 2197- llvmpipe: remove unused function 2198- zink: clean up extension emitting 2199- zink: set dynamic indexing caps 2200- zink: more accurately set {Sampled,Image}1D caps 2201- zink: more accurately set {Sampled,Image}Buffer caps 2202- zink: more accurately set ImageQuery cap 2203- docs: fixup link to virgl docs 2204- zink: add have_D32_SFLOAT_S8_UINT boolean 2205- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking 2206- zink: type_main -> type_void_func 2207- zink: add spirv_builder_function_call 2208- zink: wrap discard in a function 2209- zink: clamp miplodbias when creating sampler 2210- docs/zink: document rgtc requirement 2211 2212Fabrice Fontaine (1): 2213 2214- src/util/futex.h: fix build on 32-bit architectures using 64-bit time_t 2215 2216Feng Jiang (1): 2217 2218- virgl: do not share virgl_screen between different drm_files 2219 2220Filip Gawin (5): 2221 2222- r300: Print warning when stubbing derivatives 2223- r300: keep negation if w is an inline constant 2224- r300: don't check for unitialized reads when rewriting register 2225- r300: prefer old not native swizzle in constant folding 2226- r300: don't read from output transform_r300_vertex_SEQ/SNE 2227 2228Francisco Jerez (2): 2229 2230- intel/dev: Compute pixel pipe information based on geometry topology DRM query. 2231- intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC. 2232 2233Frank Binns (3): 2234 2235- pvr: replace p_compiler.h boolean with stdbool 2236- pvr: shorten error to err in label names 2237- pvr: Add TI AM62 as a supported device. 2238 2239Georg Lehmann (74): 2240 2241- aco: Remove some old comments in aco_opcodes.py. 2242- aco: Reuse previous -1 result in find_msb to avoid using VOP3. 2243- nir/lower_mediump: Add a new pass to fold 16bit image load/store. 2244- aco: Implement image_store d16. 2245- aco: Implement image_load d16. 2246- radv: Use nir_fold_16bit_image_load_store_conversions. 2247- radv: Run copy_prop and dce after folding 16bit sampling/load/store. 2248- aco: Remove D16 zero components from image stores. 2249- meson: Tell glslang to be quiet. 2250- radeonsi: Use nir_fold_16bit_image_load_store_conversions. 2251- wsi/x11: Don't leak xcb_get_geometry_reply_t. 2252- nir/opt_algebraic: Fix mask in shift by constant combining. 2253- nir/opt_algebraic: Simple xor/ishr optimizations. 2254- radv: Don't retile read-only images. 2255- nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. 2256- aco: Implement uclz. 2257- aco: Implement usub_sat. 2258- aco: Implement isub_sat. 2259- ac/llvm: Implement usub_sat and isub_sat. 2260- ac/llvm: Implement uclz. 2261- nir: Add optional lowering for mul_32x16. 2262- radv: Lower mul_32x16. 2263- radv: Lower hadd/hradd. 2264- radv: Lower 8bit isub_sat/usub_sat. 2265- radv: Enable VK_INTEL_shader_integer_functions2. 2266- radv, aco: Packed usub_sat/isub_sat. 2267- zink: Use VK_USE_64_BIT_PTR_DEFINES to check for 64bit platforms. 2268- radv, aco: Lower txf offset in NIR. 2269- nir/lower_tex: Add lower_array_layer_round_even option. 2270- radv, aco: Round texture array layer in NIR. 2271- vulkan: Spec update to 1.3.217. 2272- radv: Implement VK_EXT_non_seamless_cube_map. 2273- aco: Print r128/a16 MIMG bits separately. 2274- aco: Remove r128_a16 MIMG builder option. 2275- anv: Implement VK_EXT_non_seamless_cube_map. 2276- radv: Remove redundant memset in radv_descriptor_set_create. 2277- radv: Check descriptor pool entry count before allocating a new set. 2278- radv: Don't check if we need to copy immutable samplers for non push templates. 2279- aco/ra: Static assert that changing instruction type to VOP2 is valid. 2280- aco/ra: Move mac encoding optimization to its own function. 2281- aco/ir: Pad SOP2 and SOPC to the same size as SOPK. 2282- aco/ra: Optimize some SOP2 instructions with literal to SOPK. 2283- aco/optimizer: Convert s_add_u32 with literals to s_add_i32 if carry is not used. 2284- aco/optimizer: Optimize SOPC with literal to SOPK. 2285- r600/sfn: Add missing std::array include. 2286- aco: Fix swapping sources in SOPC -> SOPK optimization. 2287- aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9. 2288- nir_to_tgsi: Lower uadd_sat/usub_sat. 2289- zink: Lower uadd_sat/usub_sat. 2290- r600: Lower uadd_sat/usub_sat. 2291- ir3: Implement [iu]sub_sat. 2292- nir/lower_io: Fix array length of buffers larger than INT32_MAX. 2293- nir/opt_algebraic: Optimize [ui](add|sub)_sat with 0. 2294- radv,aco: Don't lower and vectorize 16bit iabs. 2295- radv: Remove ineg from lower_bit_size_callback callback. 2296- radv, aco: Don't lower 16bit isign. 2297- ir3: Only run 16bit tex NIR passes on a5xx+. 2298- ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress. 2299- nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. 2300- nir/lower_mediump: Add an option to only fold if all tex sources can be folded. 2301- ir3: Stop using nir_legalize_16bit_sampler_srcs. 2302- aco: Add G16 opcodes. 2303- aco: Support 16bit sources for texture ops. 2304- radv: Fold 16bit tex sources. 2305- aco: Add a note about G16 without A16 to README-ISA.md. 2306- aco/ir: Generalize (un)ordered_swapped. 2307- aco/ir: Add integer get_cmp_info. 2308- aco/ir: Fix swapped nle. 2309- aco/ir: Add vcmpx opcode to get_cmp_info. 2310- aco/ir: Add v_cmp_class to get_cmp_info. 2311- aco/ir: Add swapped opcode for v_cmp_u/v_cmp_o. 2312- radeonsi: Stop using nir_legalize_16bit_sampler_srcs. 2313- nir/fold_16bit_tex_image: Default to only_fold_all. 2314- aco: Force tex operand to have the correct sub dword size before packing. 2315 2316Gert Wollny (85): 2317 2318- virgl: Extend integer write out output fix to all non-move integers ops 2319- r600/sfn: Fix store_shared_r600 write masks 2320- ntt: Add option to not optimized register allocation 2321- nir: Add pass to split 64 bit vec3 and vec4 variable access and phis 2322- ntt: Split 64 bit vec3 and vec4 local variables and phis 2323- virgl: Clear all indirect flags when writing to a temporary 2324- nir: Don't optimize to 64 bit fsub if the driver doesn't support it 2325- ntt: remove dead input variables before lowering FS IO 2326- r600/sb: Don't optimize float GT and GE 2327- r600: tune nir options 2328- r600: Add support for TGSI_OPCODE_ATOMIMIN and IMAX 2329- virgl: don't move input gl_SampleMaskIn to a temp 2330- r600: Allow eight bit, three channel formats for vertex buffers 2331- nir: honor lower_double options for ffloor and ffract 2332- r600/sb: Don't create three source ops with all kcache values 2333- r600/sb: Fall back to un-optimized shader if scheduling fails 2334- virgl: Don't let ntt optimize the register allocation 2335- tgsi: Don't bother setting the TGSI_PROPERTY_SEPARABLE_PROGRAM 2336- virgl: Submit the TGSI_PROPERTY_SEPARABLE_PROGRAM when the host understands it 2337- r600: Add number of ALU groups to statistics 2338- r600: Make sure that LDS instructions only use bank swizzle 012 2339- r600: Update nir options 2340- r600/sfn: rewrite NIR backend 2341- r600: enable sb by default also for NIR 2342- r600: Cleanup nir options 2343- r600/sfn: support nir_op_mulz and legazy math rules 2344- r600/sfn: Add missing copyright information 2345- r600/sfn: Don't use assert to test success of operation 2346- r600: limit loops when trying to merge alu groups 2347- r600/sfn: Never consider an op with register dest as dead 2348- r600/sfn: Add support for fdph 2349- r600/sfn: lower CLIPVERTEX to clip planes 2350- r600/sfn: emulate pmr::monotonic_buffer_resource if needed 2351- r600/sfn: clean up some commented out code 2352- r600/sfn: fix storing shader output array locations 2353- r600/sfn: lower undef to zero 2354- r600/sfn: count LDS queue pop reads separately in assembler 2355- r600/sfn: Fix the kcache failure handling 2356- r600/sfn: Fix emmision of LDS instructions 2357- r600/sfn: Fix used of local shared write mask 2358- r600/sfn: Fix streamout with non-zero component write 2359- r600: don't dump shader info to files on debug 2360- r600/sfn: make sure the memory pool is released after shader translation 2361- r600/sfn: lower the right shader variant 2362- r600: Delete possible old NIR variant when translating from TGSI 2363- r600: don't clone the nir shader, just use it 2364- r600ß/sfn: Don't fall through with VS outputs 2365- r600/dri-tweak: Disable ARB_shader_image_load_store for "Tomb Raider 2013" 2366- r600/sb: fall back to unoptimized shader if RA fails 2367- r600: Don't set TES_EVAL UCP dirty when it is not supported 2368- virgL: lower fneg and fabs 2369- virgl: honor host caps in shader cash sha evaluation 2370- r600/sfn: Factor out fragement shader class for EG 2371- r600/sfn: Support pre-EG style FS input 2372- r600/sfn: Prepare StreamOut instruction for pre EG opcodes 2373- r600/sfn: Handle slot differences between pre-EG and EG 2374- r600/sfn: Handle chipclass with kcache allocation 2375- r600/sfn: Handle buffer size query on pre-EG 2376- r600/sfn: Handle pre-EG GS input fetch 2377- r600/sfn: Handle primitive input handling for pre-EG 2378- r600/sfn: Handle non-compressed MS texel fetch 2379- r600/sfn: Set trans flag for flt_to_int on pre-EG 2380- r600/sfn: Handle pre-EG buffer fetch 2381- r600/sfn: Disable FP64 for pre-CEDAR 2382- r600/sfn: Advertise support for fmulz 2383- r600/sfn: Enable NIR for pre RG hardware 2384- r600/sfn: remove leftover debug output 2385- r600/sfn: Handle R600 class sin/cos 2386- r600: increase possible stack size in binary code 2387- r600/sfn: Schedule shift instruction on R600 in t-slot 2388- r600/sfn: Add GS thread fix just like the TGSI code path 2389- r600/sfn: Sort FS inputs to make interpolated values come first 2390- r600/sfn: Fix color outputs when color0 writes all 2391- r600: Force NOPs when loading AR on R600 class hardware 2392- r600/sfn: Handle R600 scratch read 2393- r600: Don't use SB with R600 style scratch reads 2394- r600/sfn: Handle color0 writes all on R700 like on EG 2395- r600/sfn: Don't tag mem-ring and stream instructions as exports 2396- r600/sfn: Don't schedule GDS instructions early 2397- r600/sfn: Don't scan the whole block for ready instructions 2398- r600/sfn: Use a heuristic to keep SSBO setup and store close 2399- r600: Fix reporting TGSI IR support 2400- r600/sfn: Use a low number for unused target register 2401- virgl: when reading back wait first, then do the transfer 2402- r600/sfn: override register ID when it doesn't matter 2403 2404Glenn Kennard (1): 2405 2406- nv30: Fix non-scissored clears after a scissor has been set 2407 2408Greg Depoire--Ferrer (5): 2409 2410- util/queue: add missing space to comment in util_queue_destroy 2411- zink: check for error when initializing util_queue 2412- lavapipe: check for error when initializing lvp_queue 2413- util/queue: handle thread cration failure in util_queue_adjust_num_threads 2414- zink: fix crash on dlclose when zink_internal_create_screen failed 2415 2416Guilherme Gallo (48): 2417 2418- ci/lava: Encapsulate job data in a class 2419- ci/lava: Filter log lines from LAVA return 2420- ci/lava: Return test-suite result as exit code 2421- ci/lava: Fix LAVA job validation 2422- ci/lava: Set lava-signal to kmsg 2423- ci/lava: Always validate the lava job 2424- ci/lava: Use lava-test-case to run custom scripts in LAVA 2425- ci/lava: Trap init-stage2.sh background processes 2426- ci/lava: Fix shebang in deqp-runner.sh 2427- ci/lava: Parse all test cases from 0_mesa suite 2428- ci/lava: Let LAVA job submitter run without JWT file 2429- ci/lava: Reduce LAVA boot phase timeout to 3 minutes 2430- ci/lava: Cancel the job if the script is interrupted 2431- ci/lava: Improve exception handling 2432- ci/lava: Retry when data fetching log RPC call is corrupted 2433- ci: Fix init-stage2 exit code 2434- ci: Update trace after CI fix 2435- ci: Fix tests expectations 2436- ci/lava: Make job submitter parse the job result 2437- ci/lava: Fix colored LAVA outputs 2438- ci/lava: Fix Gitlab Section markers 2439- ci/lava: Add support for more complex color codes 2440- ci/lava: Filter out undesired messages 2441- ci/lava: Improve result parsing regex 2442- ci/lava: Create Gitlab log sections handler 2443- ci/lava: Create LogFollower and move logging methods 2444- ci/lava: Follow job execution via LogFollower 2445- ci/lava: heartbeat: don't consider kernel message logs 2446- ci/lava: Wrap job info into a collapsed section 2447- ci/lava: Wrap job definition dump into a collapsed section 2448- ci/lava: Add an integration test for LAVA jobs 2449- ci/lava: Don't print LAVA debug messages 2450- ci/lava: Highlight kernel messages in bold 2451- ci/lava: Highlight job retrying message 2452- ci/lava: Stop printing after the result line 2453- ci/lava: Flexibilize section marker regexes 2454- ci/lava: Rename console color names 2455- ci/lava: Update license header 2456- ci/lava: Split lava_log into modules 2457- ci/lava: Detect R8152 issues preemptively and retry 2458- ci/lava: Make hung job status yellow 2459- ci/lava: Color red for fatal and yellow for warning 2460- ci/lava: Add \`slow` pytest marker 2461- ci/lava: Add canceled job status 2462- ci/lava: Add delay before calling lava-test-case 2463- ci/lava: Increase boot timeout 2464- ci/lava: Only parse result within testcase section 2465- ci/lava: Remove MINIO_RESULTS_UPLOAD if requested 2466 2467Hans-Kristian Arntzen (4): 2468 2469- radv: Fix RTPSO hashing of pGroups. 2470- vulkan: Update Vulkan XML and headers to 1.3.219. 2471- vulkan: Add common code for VK_EXT_shader_module_identifier. 2472- radv: Implement VK_EXT_shader_module_identifier. 2473 2474Hoe Hao Cheng (4): 2475 2476- zink/codegen: remember the fields in feats/props structs of extensions 2477- zink/codegen: do not include compilation structs with extension structs 2478- zink/codegen: do not automatically consider extensions promoted 2479- zink/codegen: use the updated identifiers 2480 2481Hyunjun Ko (15): 2482 2483- radv: fix assertion for the count of xfb output 2484- turnip: add an assertion for max descriptor set count. 2485- ir3: handle intrinsic_load_draw_id when scanning driver constants 2486- turnip: clamp to 1 for negative bottom-right of viewport. 2487- turnip: fix wrong size for lrz cs when lrz_track_quirk is enabled. 2488- freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE 2489- turnip: keep the depth_clip_disable state at the pipeline builder. 2490- zink: remove the workaround for depth_clip_enable on turnip 2491- turnip: Remove an unnecessary assert. 2492- turnip: expose VK_EXT_shader_module_identifier 2493- ir3: change maximum size of const files. 2494- ir3: handle shared consts. 2495- turnip: clean up unused parameters for user consts. 2496- turnip: Change to use shared consts for PushConstants 2497- tu: increase maxPushConstantsSize to 256. 2498 2499Iago Toral Quiroga (79): 2500 2501- v3dv: ignore no-op barriers 2502- v3dv: drop obsolete comment 2503- v3dv: don't emit final noop job if we don't have anything to signal 2504- v3dv: document Vulkan requirements for signaling operations 2505- v3dv: ignore barriers for image layout transitions from undefined layouts 2506- v3dv: drop default sampler states if not used 2507- v3dv: flag BCL barriers for all-graphics and all-commands stages 2508- v3dv: skip binning sync if binning shaders don't access external resources 2509- v3dv: use wait stage for wait semaphores to decide about binning syncs 2510- v3dv: check input attachment usage as sampled usage 2511- v3dv: implement VK_EXT_separate_stencil_usage 2512- v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts 2513- docs/features: flag VK_KHR_timeline_semaphore as implemented 2514- v3dv: expose separateDepthStencilLayouts 2515- v3dv: implement vkGetPipelineExecutableInternalRepresentationsKHR 2516- v3dv: implement vkGetPipelineExecutablePropertiesKHR 2517- v3d/compiler: add more stats to prog_data 2518- v3dv: implement vkGetPipelineExecutableStatisticsKHR 2519- v3dv: expose VK_KHR_pipeline_executable_properties 2520- v3dv: don't leak variant QPU when pipeline compile fails 2521- v3dv: drop unsused struct 2522- v3dv: expose VK_KHR_shader_float_controls 2523- v3dv: expose VK_KHR_spirv_1_4 2524- v3dv: expose VK_KHR_vulkan_memory_model 2525- v3dv: fix format usage checks when extended usage flag is set 2526- broadcom/compiler: don't promote early fragment tests when writing sample mask 2527- v3dv: consume barriers at the right stages 2528- v3dv: only clear BCL barrier state if we don't have pending graphics barriers 2529- v3dv: use an explicit struct type to track barrier state 2530- v3dv: track sources of barriers 2531- v3dv: limit sync for barriers to hw queues selected by source mask 2532- v3dv: don't be too aggressive disabling early Z 2533- v3dv: use the global RCL EZ disable if we don't have any EZ draws in the job 2534- broadcom/compiler: disable flags optimization for loop conditions 2535- v3dv: merge pending secondary barrier state into primary command buffers 2536- v3dv: handle barriers at the end of a command buffer 2537- v3dv: fix leak 2538- broadcom/compiler: fix postponed TMU spills with multiple writes 2539- broadcom/compiler: don't predicate postponed spills 2540- v3dv: fix pool descriptor count for inline uniform buffers 2541- v3dv: fix merge jobs 2542- nir: fix documentation for uadd_carry and usub_borry opcodes 2543- v3dv: don't lower uadd_carry and usub_borrow 2544- v3dv: remove obsolete comment 2545- nir/serialize: fix missing divergence info after deserialization 2546- broadcom/compiler: don't try to hide TMU latency at QPU scheduling 2547- broadcom/compiler: don't over-estimate latency of TMU instructions 2548- v3dv: fix comment in texel buffer shader copy path 2549- v3dv: remove unused lowering for nir_intrinsic_load_layer_id 2550- v3dv: fix comment for point_sprite_mask filed in shader key 2551- v3dv: clean up get_internal_type_bpp_for_image_aspects 2552- v3d,v3dv: stop hardcoding various image limits 2553- v3d: stop using a smaller texture limit in OpenGL 2554- v3d,v3dv: stop copying and pasting the translate_swizzle helper 2555- v3dv: only save/restore push constant data for meta operations if needed 2556- v3dv: move push constant data to the command buffer state 2557- v3dv: limit upload of indirect push constant data 2558- v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO 2559- v3dv: stop tracking push constant buffer references 2560- v3dv: allocate more push constant buffers if needed 2561- v3dv: add a dirty state for pending push constants UBO updates 2562- v3dv: explain why we clear certain state after a draw call 2563- v3dv: remove duplicate condition 2564- nir: add nir_address_format_2x32bit_global 2565- broadcom/compiler: implement 2x32 global intrinsics 2566- broadcom/compiler: track if a shader uses global intrinsics 2567- v3dv: implement VK_KHR_buffer_device_address 2568- v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT 2569- v3dv: remove assert that poolSizeCount must be larger than 0 2570- v3dv: fix offset reported by vkGetImageSubresourceLayout 2571- v3dv: remove unnecesary FIXME comment 2572- v3d,v3dv: add a common v3d_hw_prim_type helper 2573- v3dv: only check binning sync for semaphores for the first CL job 2574- v3dv: explicitly set sampler min/max filter properties to false 2575- v3dv: fill in Vulkan 1.2 missing device properties 2576- v3dv: enable missing Vulkan 1.2 features for supported extensions 2577- v3dv: enable missing mandatory Vulkan 1.2 features 2578- broadcom/ci: update list of slow tests 2579- v3dv: expose Vulkan 1.2 2580 2581Ian Romanick (26): 2582 2583- glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp 2584- glsl: Add flag to disable part of do_vec_index_to_cond_assign 2585- nir: Use nir_vector_extract to generate code for ir_binop_vector_extract 2586- glsl: Delete lower_extracts code 2587- anv: Remove FS executables when applying the null FS optimization 2588- intel/fs: Add missing synchronization for WaW dependency 2589- nir: i32csel opcodes should compare with integer zero 2590- nir/algebraic: Fix NaN-unsafe fcsel patterns 2591- nir: Add and use algebraic property "is selection" 2592- nir/range_analysis: Teach range analysis about fdot opcodes 2593- intel/compiler: Rename vec4 state URB opcodes to have VEC4\_ prefix 2594- intel/eu: Validate some aspects of URB messages 2595- intel/compiler: Move logical-send lowering to a separate file 2596- intel/fs: Add _LOGICAL versions of URB messages 2597- intel/fs: Lower URB messages to SEND 2598- intel/fs: Remove non-_LOGICAL URB messages 2599- intel/vec4: Set lower_usub_sat 2600- spirv: Fix array length of buffers larger than INT32_MAX. 2601- intel/fs: Make logical URB write instructions more like other logical instructions 2602- intel/fs: Make logical URB read instructions more like other logical instructions 2603- intel/fs: Don't pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send 2604- intel/fs: Eliminate "masked" and "per slot offset" URB messages 2605- intel/fs: Use canonical form for "work around" tags 2606- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident 2607- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering 2608- glsl: Remove lower_offset_arrays pass 2609 2610Icecream95 (45): 2611 2612- clc: Use stringstream for printing spirv errors 2613- panfrost: Stop overallocating compressed textures 2614- panfrost: Enable NIR lowering of half float packing 2615- panfrost: Fix pack_32_2x16 implementation 2616- pan/bi: Use texture index instead of sampler for message preloading 2617- pan/va: Improvements to LEA_TEX instructions 2618- pan/va: Add more transcendental operations 2619- pan/va: Add SEG_ADD/SEG_SUB operation 2620- pan/va: Add absneg modifier to V2F32_TO_V2F16 2621- pan/va: Improve texture instructions 2622- nir/lower_tex: Copy more fields in lower_tex_to_txd and friends 2623- pan/mdg: Keep min_bound at 16 when alignment requires it 2624- pan/mdg: Use MAX2 to set min_alignment 2625- pan/mdg: Fix mask usage when filling before a spill 2626- pan/mdg: Fix disassembly of store instructions 2627- pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled 2628- pan/mdg: Fix multiple spilt writes in the same bundle 2629- panfrost: Copy blend constant into variant even when reusing it 2630- panfrost: Increase the limit for blend shader variants 2631- panfrost: Move patched_s out of the pan_blitter_views struct 2632- nir: Add store_combined_output_pan BASE back 2633- pan/bi: Read base for combined stores 2634- pan/mdg: Read base for combined stores 2635- panfrost: Only write depth / stencil once if MRT is used 2636- pan/bi: Reverse linear constraint bits 2637- pan/bi: Add nodearray datastructure 2638- pan/bi: Use nodearrays for linear constraints 2639- panfrost: Constant stencil value tracking 2640- panfrost: Stop using sparse_array for batch BOs 2641- panfrost: Use a macro for checking for a shared bind type 2642- panfrost: Remove sync arguments from panfrost_batch_submit 2643- panfrost: Enable QUAD_STRIP and POLYGON on v6 2644- pan/va: Use the _safe iterator when adding blend shader calls 2645- pan/decode: fflush buffers after dumping and before aborts 2646- pan/decode: Use tag bits for resource entry count 2647- pan/decode: Change indent when decoding resources 2648- panfrost: Only emit images when they are present 2649- panfrost: Emit the correct number of attributes 2650- panfrost: Clear the GENERAL varying buffer field if unused 2651- panfrost: Correctly calculate prefetch suppresion varying index 2652- panfrost: Add a debug option for checking overflows on pool uploads 2653- panfrost: Don't unbind recently bound streamout targets 2654- panfrost: Allow NULL streamout targets 2655- nine: Make vdecl_index_map always signed 2656- nine: Only enable thread_submit by default on x86 2657 2658Igor Torrente (2): 2659 2660- venus: Add support to VK_KHR_maintenance4 extension 2661- venus: Use maintenance4 to get max_size_buffer 2662 2663Ikshwaku Chauhan (1): 2664 2665- Revert "radeon: add EFC support to only VCN2.0 devices" 2666 2667Illia Abernikhin (1): 2668 2669- vulkan/runtime: repair thread safety in method vk_queue_start_submit_thread() 2670 2671Indrajit Das (5): 2672 2673- radeonsi/gfx11: VRS changes 2674- radeonsi: save the fs constant buffer to the util blitter context 2675- gallium/u_blitter: clear color buffers using color from a constant buffer 2676- mesa/st: add nir shader to clear color buffers using constant value 2677- mesa/st: clear color buffers using color from a constant buffer 2678 2679Italo Nicola (3): 2680 2681- mesa/st: call pipe->link_shader even if ir loaded from cache when linking 2682- tgsi: Add SEPARABLE_PROGRAM property 2683- virgl: overpropagate precise flags 2684 2685Iván Briano (6): 2686 2687- vulkan/wsi/wayland: Fix double free on error condition 2688- anv: re-alloc push constants after secondary command buffers 2689- anv: do not get rid of empty/useless fragment shaders 2690- anv: assert inheritance_info is not NULL 2691- anv: emit scissors when the pipeline changes 2692- anv: pipelineStageCreationFeedbackCount is allowed to be 0 2693 2694James Zhu (5): 2695 2696- amd: update headers to support decode software ring 2697- radeonsi/gfx11: update codec support for gfx11 2698- radeonsi/vcn: add decode software ring support for gfx11 2699- radeonsi/vcn: add jpeg decode support for gfx11 2700- amd/common: some ASICs with gfx9 use compute rings for render 2701 2702Jan Beich (1): 2703 2704- vulkan/wsi: treat EBADF as missing DMA_BUF_IOCTL_{EXPORT,IMPORT}_SYNC_FILE 2705 2706Jan Palus (1): 2707 2708- pvr: use UINT64_C for 64-bit constant 2709 2710Faith Ekstrand (312): 2711 2712- clc: Only initialize LLVM once 2713- clc: Rework logging a bit 2714- clc: Declare LLVMContexts on the stack 2715- intel/compiler: Set lower_fisnormal 2716- nir,microsoft: Move scale_fdiv into a common NIR pass 2717- nir/opcodes: fisfinite32 should return bool32 2718- util: Remove util_cpu_detect 2719- util/blob: Add align helpers 2720- util/set: Respect found in search_or_add_pre_hashed 2721- vulkan: Add a common VkPipelineCache implementation 2722- vulkan/pipeline_cache: Add helpers for storing NIR in the cache 2723- anv: Rename a fail label in CreateDevice 2724- anv: Switch to the new common pipeline cache 2725- anv: Clean up pipeline cache helpers a bit 2726- nir: Lower all bit sizes of usub_borrow 2727- vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping 2728- vulkan: Use ALL_COMMANDS_BIT for waits/signals instead of ~0 2729- spirv: Handle Op*MulExtended for non-32-bit types 2730- vulkan/log: Allow but warn if called with a NULL object 2731- vulkan/log: Allow but warn for client-invisible objects 2732- nir/constant_folding: Break TXB folding into a helper function 2733- nir: Constant fold sampler/texture offsets 2734- panvk: Take buffer offsets into account in BindVertexBuffers 2735- panvk: Eliminate unused vertex attributes 2736- nir/deref: Add an alu-of-cast optimization 2737- panvk: Add address/range helpers for panvk_buffer 2738- vulkan/wsi: Close file descriptors in wsi_destroy_image 2739- vulkan/wsi: Only use a single fd per wsi_image 2740- vulkan/wsi: Hang on to file descriptors 2741- vulkan/wsi: Reset the image fence right before vkQueueSubmit 2742- vulkan/wsi: Set the right stage flags for semaphore waits 2743- vulkan/wsi: Add signal_fence/semaphore_for_image helpers 2744- nir/cf: Return a cursor from nir_cf_extract as well 2745- nir: Fix constant folding for non-32-bit ifind_msb and clz 2746- nir: Set image_buffers and msaa_images in lower_samplers_as_deref 2747- nir: Stop assuming shader_info::textures_used is 32-bit 2748- shader_info: Make images_used a bitset 2749- nir: Gather samplers_used separately from textures 2750- util/bitset: Support larger ranges in BITSET_TEST/CLEAR_RANGE 2751- shader_info: Bump the number of images and textures supported 2752- gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers 2753- mesa/st: Clamp MaxImageUniforms to MAX_IMAGE_UNIFORMS 2754- gallium: Bump PIPE_MAX_SHADER_IMAGES to 64 2755- panvk: Call nir_lower_io_arrays_to_elements_no_indirects 2756- nir/lower_blend: Be more explicit about deref assumptions 2757- nir/lower_blend: Stop passing the whole options object around 2758- nir/lower_blend: Support SNORM and integer formats for logic ops 2759- nir/builder: Add a nir_trim_vector helper 2760- nir/builder: Add a nir_resize_vector helper 2761- midgard: Handle FB fetch from non-vec4 output variables. 2762- nir/lower_blend: Expand or shrink output variables as needed 2763- panvk: Advertise support for logicOp 2764- iris: Set BindingTableEntryCount for compute shaders 2765- nir: Mark negative re-distribution on fadd as imprecise 2766- clover: Set images/samplers_used when lowering images 2767- lavapipe: Set shader_info::samplers_used 2768- gallium: Set shader_info::samplers_used in pstipple_fs 2769- ttn: Set shader_info::samplers_used 2770- mesa/st: Set samplers_used in lower_tex_src_plane 2771- gallium/draw: Properly handle nr_samplers != nr_sampler_views in keys 2772- llvmpipe: Fill out samplers even if nr_samplers = 0 2773- lavapipe: Set images_used in lvp_lower_pipeline_layout 2774- ntt: Don't gather samplers_declared twice 2775- panvk: Move CreateDescriptorSetLayout to per-arch 2776- panvk: Break descriptor lowering into its own file 2777- panvk: Add a buffer to each descriptor set 2778- panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned 2779- panfrost: Add some sanity checking for sysvals 2780- bifrost,midgard: Allow providing a fixed sysval layout 2781- panvk: Get rid of the per-pipeline sysvals BO 2782- panvk: Use a flat sysvals struct 2783- panvk: Stop calling lower_uniforms_to_ubo 2784- panvk: Put the sysval and push const UBOs at fixed indices 2785- panvk: Interleave UBOs with multiple descriptor sets 2786- panvk: Put SSBO addresses in the descriptor buffer 2787- vulkan: Start collecting enabled features in vk_device 2788- panvk: Enable robustBufferAccess 2789- panvk: Advertise VK_KHR_variable_pointers 2790- panvk: Add an elems field to panvk_buffer_view 2791- pavnk: Pass bind layouts to texture and image descriptor helpers 2792- panvk: Implement texture/image queries 2793- FIXUP: Use 16-bit things for texture sizes 2794- panvk: Re-arrange descriptor set functions 2795- panvk: Rewrite the write portion of vkUpdateDescriptorSets 2796- panvk: Set immutable samplers properly up-front 2797- panvk: Implement descriptor copies properly 2798- panvk: Drop panvk_descriptor 2799- panvk: Round FillBuffer sizes down to a multiple of 4 2800- nir: Preserve metadata if remove_dead_derefs makes no progress 2801- nir: Add a var set version of lower_indirect_derefs 2802- mesa/st: Use nir_shader_instructions_pass for st_nir_lower_builtin 2803- mesa/st: Check deref modes in lower_builtin_instr 2804- mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin 2805- nir: Put the builder first in lower_input_attachments helpers 2806- nir: Use nir_shader_instructions_pass in nir_lower_input_attachments 2807- bifrost: Run nir_lower_global_vars_to_local before nir_lower_vars_to_scratch 2808- panvk: So more nir_lower_tex before descriptor lowering 2809- panvk: Lower blending after lower_var_copies 2810- v3dv: Add a create_image_view helper for internal views 2811- vulkan: Only be clever about vk_image_view::view_format for normal views 2812- v3dv: Don't use color aspects for depth/stencil images 2813- v3dv: Drop the region temporary from blit_shader 2814- vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create 2815- radv: Only use PLANE_0 in meta when actually needed 2816- radv: Use vk_image as the base for radv_image 2817- radv: Use vk_image_view as the base for radv_image_view 2818- glsl: Drop this != NULL assertions 2819- vulkan/pipeline_cache: Implement deserialize for raw objects 2820- v3dv: Loosen an assert in copy_buffer_to_image_shader 2821- vulkan: Fall back to raw data objects when deserializing if ops == NULL 2822- lavapipe: Use the correct ICD path on Win32 2823- vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper 2824- radv: Add a sqtt entrypoint for CmdBindVertexBuffers2 2825- radv: Use the common CmdBindVertexBuffers wrapper 2826- lavapipe: Use the common BindVertexBuffers wrapper 2827- nir/algebraic: Add two more pack/unpack rules 2828- intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround() 2829- intel/fs: Copy color_outputs_valid into wm_prog_data 2830- anv: Drop alpha_to_coverage from the NULL FS optimization 2831- anv: Handle the null FS optimization after compiling shaders 2832- anv: Don't disable the fragment shader if XFB is enabled 2833- intel: Only set VectorMaskEnable when needed 2834- vulkan: Add a base struct for buffers 2835- anv: Use the base vk_buffer struct 2836- panvk: Use the vk_buffer base struct 2837- vulkan/runtime: Add min_lod to vk_image_view 2838- intel/isl: Add isl_view::min_lod_clamp for IVB+ 2839- anv: Implement VK_EXT_image_view_min_lod 2840- isl: Add some asserts about multisampled surfaces 2841- nir: Handle register sources in lower_phis_to_regs_block 2842- nir: Add a correctness note for nir_lower_phis_to_regs_block 2843- nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info 2844- nir: Add a nir_xfb_info to nir_shader 2845- anv: Use nir_shader_gather_xfb_info 2846- lavapipe: Use nir_shader_gather_xfb_info 2847- turnip: Use nir_gather_xfb_info 2848- radv: Use nir_gather_xfb_info 2849- nir: Drop nir_shader_get_xfb_info 2850- glsl/nir: Populate nir_shader::xfb_info after linking varyings 2851- glsl/nir: Stop leaking varyings_info 2852- glsl/nir: Stash the xfb_info in the nir_shader when linking XFB 2853- st,nir: Use nir_shader::xfb_info in nir_lower_io_passes 2854- nir/glsl: Use rzalloc for nir_xfb_info 2855- nir: xfb_buffer_info::stride is in bytes 2856- ttn: Populate the images/textures/samplers_used fields in shader_info 2857- vulkan,anv: Move the image offset/extent sanitize helpers to common code 2858- radv: Use the common image offset/extent sanitize helpers 2859- vulkan,radv: Move vk_format_get_plane_format to common code 2860- vulkan/format: Add a vk_format_get_aspect_format helper 2861- vulkan: Add a helper for image<->buffer copies 2862- anv: Use the common image<->buffer copy helper 2863- vulkan,radv: Steal some image offset/extent helpers from radv 2864- radv: Use vk_image_buffer_copy_layout 2865- panvk: Use vk_image_buffer_copy_layout 2866- lavapipe: Use vk_image_buffer_copy_layout 2867- nir/deref: Break out a helper for compare_deref_paths 2868- nir/deref: Use an index in compare_deref_paths 2869- nir/deref: Make compare_deref_paths take a stop callback 2870- nir/deref: Re-arrange variable checks in compare_deref_paths 2871- nir: Increase nir_variable_data::mode to 16 bits 2872- nir/vars_tests: Use nir_var_mem_global instead of ssbo 2873- nir/deref: Handle SSBO array bindings specially 2874- nir/deref: Handle RESTRICT for SSBO deref bindings 2875- compiler/types: Don't place members in the previous substruct's end padding 2876- anv/wsi: Stop resetting semaphores 2877- vulkan/wsi: Fix a signal_semaphore_with_memory check 2878- wsi: Always signal semaphores and fences in wsi_common_acquire_next_image 2879- panvk: Use the common AcquireNextImage implementation 2880- v3dv: Use the common AcquireNextImage implementation 2881- radv: Drop create_sync_for_memory 2882- lavapipe: Use the common AcquireNextImage implementation 2883- pvr: Use the common AcquireNextImage implementation 2884- turnip: Use the common AcquireNextImage implementation 2885- turnip: Use the common QueuePresent implementation 2886- turnip: Set supported_sync_types before wsi_init 2887- vulkan/wsi: Use a STACK_ARRAY for stage_flags 2888- vulkan/wsi: Reset fences earlier in wsi_common_queue_present 2889- vulkan/wsi: Re-arrange QueueSubmit code in wsi_common_queue_present 2890- vulkan/wsi: Set memory ownership after signaling fences and semaphores 2891- vulkan/wsi: Signal semaphores and fences from the dma-buf 2892- intel/compiler: Use NIR_PASS(_, ...) 2893- anv: Use NIR_PASS(_, ...) 2894- anv: Properly clamp attachment layer counts 2895- vulkan/wsi: Use HAVE_LIBDRM to detect DRM instead of !_WIN32 2896- vulkan: Add some border color helpers 2897- panvk: Use the new border color helpers 2898- lavapipe: Use the new border color helper 2899- turnip: Use the new border color helpers 2900- vulkan/nir: Make spirv_data const in vk_spirv_to_nir 2901- vulkan: Add a vk_pipeline_shader_stage_to_nir helper 2902- gallium: Add a u_default_clear_buffer helper 2903- iris: Use u_default_clear_buffer 2904- panfrost: Use u_default_clear_buffer 2905- nir/opt_memcpy: lower copies to/from tightly packed types 2906- nir: Add an options parameter to deref_instr_has_complex_use 2907- nir/opt_memcpy: Add another case for function_temp 2908- vulkan/render_pass: Add a better helper for render pass inheritance 2909- anv: Use CmdBeginRendering for resumes in BeginCommandBuffer when possible 2910- vulkan/render_pass: Use a special layout for self-dependencies 2911- vulkan/render_pass: Pass sample locations to barriers 2912- vulkan/render_pass: Allow for mixed sample counts 2913- vulkan/render_pass: Support VkAttachmentSampleCountInfoAMD 2914- vulkan: Fix a comment 2915- vulkan: Add a base struct for descriptor set layouts 2916- vulkan: Add a common implementation of pipeline layouts 2917- lavapipe: Use the vk_descriptor_set_layout base struct 2918- lavapipe: Use the vk_pipeline_layout base struct 2919- panvk: Use the vk_descriptor_set_layout base struct 2920- panvk: Use the vk_pipeline_layout base struct 2921- vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue 2922- anv: Replace an assert() with unreachable() 2923- util: Re-indent util_sign_extend, comment, and add asserts 2924- util: Use shifts in util_sign_extend 2925- util: Use util_sign_extend in fast_idiv_by_const 2926- mesa/bptc: Use util_sign_extend 2927- util,nir: Move mask_sign_extend from opt_load_store_vectorize to util 2928- isl: Use util_sign_extend 2929- intel/decoder: Use util_mask_sign_extend() 2930- panfrost,asahi: Use util_sign_extend for unpacking 2931- nir: Use util_mask_sign_extend when serializing constants 2932- vulkan/wsi: Pass the size to MapMemory in the SW path 2933- vulkan/wsi/wayland: Use a single memcpy in the SW path 2934- vulkan/wsi: Move select_memory_type to common and rework it a bit 2935- vulkan/wsi: Add a helper for creating CPU images 2936- vulkan/wsi/x11: Use wsi_configure_cpu_image 2937- vulkan/wsi/wayland: Use wsi_configure_cpu_image 2938- vulkan/wsi: Delete SW support from configure_native_image 2939- vulkan/wsi: Persistently map CPU images 2940- vulkan/wsi: Compute stride and size in configure_buffer_image 2941- vulkan/wsi: Align buffer image strides to optimalBufferCopyRowPitchAlignment 2942- vulkan/wsi: Support tiled CPU images 2943- vulkan/wsi/x11: Don't leak shm_reply if we don't have dri3 or present 2944- vulkan/wsi/x11: Only use MIT_SHM if the device supports EXT_external_memory_host 2945- vulkan/wsi/wayland: Use host pointer import when available 2946- dozen: Increase optimalBufferCopy*Alignment 2947- dozen: Stop asking for semaphore/fence signaling 2948- vulkan/wsi/wayland: Only memcpy if the swapchain is actually software 2949- vulkan/wsi: Fix structure chaining in wsi_create_buffer_image_mem 2950- shader_info: Move subgroup_size out of cs and make it an enum 2951- vulkan: Re-order pipeline hashing 2952- vulkan: Constify vk_spirv_version 2953- spirv,vulkan: Set shader_info::subgroup_size 2954- mesa,glsl,ttn: Set subgroup_size to UNIFORM 2955- anv: Drop unnecessary parameters to anv_pipeline_compile_cs 2956- anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage 2957- anv: Use vk_pipeline_shader_stage_to_nir 2958- vulkan: Add a vk_pipeline_shader_stage_is_null() helper 2959- anv: Use vk_pipeline_shader_stage_is_null() 2960- intel,anv,iris,crocus: Drop subgroup size from the shader key 2961- anv: Implement VK_EXT_shader_module_identifier 2962- anv: Init supported extensions before WSI 2963- vulkan/wsi/wayland: Add a BUFFER_TYPE flag 2964- vulkan/wsi: Add debug variables to force the SW and PRIME buffer blit paths 2965- intel/blorp: Set uses_sample_shading for MSAA blit shaders 2966- radv: Set uses_sample_shading for copy shaders 2967- intel/fs: Use shader_info::fs::uses_sample_shading 2968- nir: Add a pass for lowering shaders to single-sampled 2969- intel/fs: Use nir_lower_single_sampled 2970- glsl/nir: Set uses_sample_shading from glsl_to_nir 2971- spirv/nir: Set uses_sample_shading from spirv_to_nir 2972- nir/gather_info: Stop gathering uses_sample_shading 2973- glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue 2974- intel/fs: Simplify persample_dispatch 2975- intel/fs,anv: Rework handling of coarse and sample shading 2976- panfrost: Simplify sample_shading 2977- anv: Stop compacting surface state tables 2978- vulkan: Detect pNext chain loops in vk_foreach_struct() 2979- anv: Don't use the wrong ARRAY_SIZE 2980- Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()" 2981- vulkan: Add a vk_limits.h file for runtime limits 2982- vulkan: Add an enum for all dynamic graphics states 2983- vulkan: Add standard sample locations 2984- vulkan: Add data structures to store all graphics state 2985- util: Constify __bitset_test_range 2986- vulkan: Add a common data structure for dynamic states 2987- vulkan: Add a common implementation of vkCmdSet* 2988- vulkan: Copy the depth/stencil state optimization code from ANV 2989- Use vk_foreach_struct_const where needed 2990- vulkan: Detect pNext chain loops in vk_foreach_struct() (v2) 2991- anv: Add an instance multiplier to anv_pipeline 2992- anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2 2993- anv: Stop recording sample locations per-sample-count 2994- anv: Convert to using vk_graphics_pipeline_state 2995- anv: Rework setting primitive topology 2996- anv: Stop merging DEPTH_STENCIL state 2997- anv: Switch to using common dynamic state tracking 2998- vulkan/graphics_state: Improve the depth/stencil optimization code 2999- anv: Do depth/stencil optimization for dynamic depth/stencil 3000- vulkan: Call lower_clip_cull_distance_arrays in vk_spirv_to_nir 3001- vulkan/nir: Don't remove dead XFB outputs 3002- vulkan: Call gather_xfb_info in vk_spirv_to_nir 3003- lavapipe: Use more Vulkan NIR heleprs 3004- vulkan: Allow up to 12 pointers in multialloc 3005- vulkan: Record shader stages in vk_graphics_pipeline_state 3006- vulkan: Multisample state isn't always included in fragment shader state 3007- vulkan: Depth/stencil isn't fragment output state but multisample is 3008- vulkan: Add a fully_dynamic_state_groups() helper 3009- vulkan: Fix pipeline libraries with dynamic-only VI or FSR state 3010- vulkan: Input assembly and depth/stencil can also be fully dynamic 3011- vulkan: Don't assert VkGraphicsPipelineLibraryCreateInfoEXT::Flags == 0 3012- zink: Mark depth/stencil visual tests as flakes 3013- lavapipe: Use shader_info::uses_sample_shading 3014- lavapipe: Use the vk_graphics_pipeline_state 3015- vulkan: Append subpass structures to VkRenderingInfo last 3016- vulkan: Include self-dep info in rendering continues 3017- intel/eu: Don't throw validation errors on float MOV_INDIRECT 3018- lima: Lower undefs to zero and run DCE after from_ssa 3019- nir/from_ssa: Ignore undef sources 3020- vulkan: Always populate vk_render_pass_state::render_pass 3021- vulkan: Improve the docs for vk_subpass_info 3022- radv: Use both aspects for depth/stencil blit destinations 3023- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors 3024 3025Jason Volk (4): 3026 3027- r600: Fix userspace pointer support for evergreen compute. 3028- r600: Improve compute memory pool performance; reduce fragmentation. 3029- r600: Elide downloads for discarded and immutable compute memories. 3030- radeon: Support shared memory user pointers. 3031 3032Jeffrey Knockel (1): 3033 3034- egl: dereference XCB drawable pointers 3035 3036Jesse Natalie (158): 3037 3038- d3d12: Don't block DISPLAY based on format for non-Windows 3039- wgl: Implement WGL_ARB_create_context_robustness 3040- d3d12: Hook up robustness extensions 3041- d3d12: Plug context leaks 3042- d3d12: Plug screen leaks 3043- d3d12: Attempt screen reset during context create 3044- d3d12: Add a unit test for context reset recovery 3045- microsoft/compiler: Unload DXIL validator library \*after* calling Release() 3046- d3d12: Correctly key off of polygon stipple enable cap 3047- mesa: Ensure ARB programs end in a newline 3048- bin: Add a script for filtering/sorting refcount logs 3049- u_debug_refcnt: Don't loop for initial refcounts if the initial value is huge 3050- u_debug_symbol: Use correct sizeof for DbgHelp 3051- u_debug_symbol: Allow re-init of DbgHelp to work around already-initialized issues 3052- u_debug_stack: Lock around stack dumps to prevent interleaving 3053- u_primconvert: Refactor to remove recursion from util_primconvert_draw_vbo 3054- u_primconvert: Handle take_index_buffer_ownership 3055- d3d12: Use a pipe_reference in d3d12_bo 3056- d3d12: Fix leaks in map with do-not-wait 3057- d3d12: Fix a couple over-releases from incorrect take-ownership flag 3058- d3d12: Destroy empty residency bo set during early-return 3059- d3d12: Include windows.h on Windows before dxgicommon.h 3060- d3d12: Don't use VLAs 3061- d3d12: MSVC warning around operator precedence causing uint32_t==bool 3062- dzn: Add new parameter to vk_image_view_init call 3063- d3d12: Fix forward decl for sw_winsys 3064- microsoft/compiler: Fixup sampler derefs in tex instrs that don't \*need* samplers 3065- nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) 3066- d3d12: Re-add missed bitmasks to shader key comparison 3067- CI/d3d12: Add skips and update baselines 3068- CI/windows: Install Vulkan SDK from LunarG directly isntead of Chocolatey 3069- CI/windows: Delete comment for installing vulkan-runtime 3070- CI/windows: Disable chocolatey progress printing to shrink log size 3071- CI/windows: Disable LLVM CMake developer warnings to shrink log size 3072- CI/windows: Move --fraction for deqp-runner to run command line 3073- dzn: Update fails/flakes 3074- CI: Re-enable Windows builds 3075- nir: Consider PNTC to be a varying 3076- wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains 3077- meson: Use a C99 STDC_VERSION for flex 3078- d3d12: Report PIPE_CAP_ACCELERATED as 0 on WARP 3079- d3d12: Properly set HS input control point count 3080- dzn: Init sync types before wsi 3081- mesa: Use C11 alignas for cross-platform SSE support 3082- CI: Lima farm is offline 3083- dzn: Skip another test that times out occasionally 3084- util/disk_cache: Implement disk_cache_get_function_identifier for Windows 3085- microsoft/clc: Enable tests that pass on server 2022 3086- microsoft/clc: Fix test double free in the case of compilation failure 3087- microsoft/clc: Remove dead image vars 3088- microsoft/clc: Add a unit test for unused image kernel args 3089- vulkan_entrypoints_gen: For Windows ARM64EC, prefix symbols with '#' 3090- dzn: Support Vulkan loader interface v5 3091- gallium/util: Move u_dl and u_pointer to src/util 3092- dzn: Annotate sometimes-unused variables. 3093- dzn: Define COBJMACROS before any includes 3094- dzn: Use os_time_sleep instead of Win32 Sleep 3095- dzn: Use u_dl to load D3D12 3096- dzn: Add an eventfd sync implementation 3097- dzn: Fix maybe-uninitialized warning 3098- dzn: Skip dxil validator and some Windows-only debug flags for Linux 3099- dzn: Use IUnknown instead of IDXGIAdapter1 as the stored adapter 3100- dzn: Don't hash adapter LUID as part of device UUID 3101- dzn: Use a custom adapter desc instead of DXGI adapter desc 3102- dzn: Move DXGI code to a separate file and only build it on Windows 3103- dzn: Add a DXCore enumeration path 3104- dzn: Add ABI helpers for D3D12 functions returning structs 3105- vulkan/wsi/win32: Use the new helpers and persistent map 3106- dzn: Fix winsys reporting 3107- dzn: Only support high/normal queue priorities 3108- dzn: Add for condition to break nested loop 3109- dzn: Re-order includes in dzn_nir to prevent TRUE redefinition 3110- dzn: Only add Werror flags that the compiler supports 3111- dzn: Disable Wunused-value for C++ files 3112- dzn: Fix comparison order for format enums 3113- dzn: Re-order dzn_physical_device_desc so = { 0 } doesn't initialize a struct 3114- dzn: Use fabs on floating point input 3115- CI: Add dzn to debian vulkan/clang/release builds 3116- mesa: Support D3D11/D3D12 memory imports 3117- mapi: Add more EXT_external_objects_win32 functions/enums 3118- gallium: Add a 'name' field to winsys_handle 3119- mesa: Implement ImportMemoryWin32NameEXT 3120- gallium: Add 'name' field to Win32 semaphore import 3121- driver_noop: Remove infinite recursion from create_fence_win32 3122- gallium: Add a new fence type with a pipe cap to indicate it can be imported 3123- mesa: Support importing D3D12 fences as timeline semaphores 3124- mesa: Implement ImportSemaphoreWin32NameEXT 3125- gallium, mesa: Support setting timeline semaphore values 3126- d3d12: Store the rest of the device IDs in the screen 3127- d3d12: Compute UUIDs required by external objects extension 3128- d3d12: Add pipe getters for Win32 and base external objects device matching 3129- d3d12: Support B4G4R4A4 format 3130- d3d12: Get adapter LUID after device creation 3131- d3d12: PIPE_BIND_SHARED doesn't mean linear and is always on opened resources 3132- d3d12: Support creating memory objects 3133- d3d12: Implement resource_from_memobj 3134- d3d12: Support opening resources and memobj by name 3135- d3d12: Support importing fences / timeline semaphores 3136- d3d12: Implement server signal/wait 3137- d3d12: Implement fence opening and value setting 3138- d3d12: Fix up resource import validation 3139- docs: Update features.txt and new_features.txt 3140- u_atomic: Implement p_atomic_xchg for Windows 3141- u_atomic: Fix MSVC p_atomic_add_return 3142- u_atomic: Add p_atomic_fetch_add which returns the old value 3143- zink: Use p_atomic_fetch_add 3144- simple_mtx: Replace GCC sync intrinsics with u_atomic ops 3145- Fix static glapi on Windows 3146- dzn: Missed ABI fixes for GetCustomHeapProperties 3147- dzn: Remove the cast when the SDK version is high enough 3148- d3d12: Add helpers to build with correct ABI for MinGW 3149- microsoft/clc: Add helpers to build with correct ABI for MinGW 3150- subprojects: Point DirectX-Headers at a specific tag 3151- CI: Update DirectX-Headers dependency for MinGW/Debian 3152- util: Add a Win32 futex impl 3153- d3d12: Support clip halfz 3154- d3d12: Add a blank d3d12_resource_state.h/cpp 3155- d3d12: Move desired resource state to new files 3156- d3d12: Move current resource state to new files 3157- d3d12: Give bos a unique identifier to be used for state tracking 3158- d3d12: Add a list of contexts alive for the current screen 3159- d3d12: Hold lock when removing resources from residency list 3160- d3d12: Notify contexts about deletion of bos 3161- d3d12: Add a context state tracking structure 3162- d3d12: Treat depth/stencil as planar for plane count helper 3163- d3d12: Create/free context state entries 3164- d3d12: Track a global resource state for non-simultaneous-access resources 3165- d3d12: Record a state fixup command list when necessary 3166- d3d12: Swap the remainder of state tracking to new method 3167- d3d12: Rename bind invalidate options to transition flags 3168- d3d12: Extract core barrier logic 3169- d3d12: Optimize transition_subresource_states that covers a whole resource 3170- d3d12: Add a transition flag indicating that state accumulation is needed 3171- d3d12: Remove state tracking implementation details from header 3172- nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O 3173- microsoft/compiler: Add a max shader model option 3174- microsoft/compiler: Add a max validator version 3175- microsoft/compiler: Emit metadata based on long-lived signature data 3176- microsoft/compiler: Split signature processing into two parts 3177- microsoft/compiler: Ensure 4-component position writes via NIR 3178- microsoft/compiler: Lower I/O to scalar 3179- microsoft/compiler: Support DXIL validator version 1.5 3180- microsoft/compiler: Compute correct usage masks 3181- microsoft/compiler: Correctly compute dynamic indexing I/O masks 3182- microsoft/compiler: DXIL validator 1.6 counts UAVs differently 3183- microsoft/compiler: Set the barycentrics flag for attribute_at_vertex 3184- microsoft/compiler: DXIL validator 1.6 uses a new PSV struct version 3185- microsoft/compiler: DXIL validator 1.6 uses a new PSV resource struct 3186- microsoft/compiler: Support DXIL validator 1.6 3187- microsoft/compiler: Support DXIL validator 1.7 3188- microsoft/compiler: Add a getter for a validator version from DXIL.dll 3189- microsoft/compiler: Blacklist DXIL validator 1.6 from 20348 SDK 3190- d3d12: Retrieve validator version 3191- dzn: Retrieve validator version 3192- egl/wgl: Delete unused variables/code 3193- egl/wgl: Fix some awkward sizeof formatting 3194- microsoft/compiler: Discard shouldn't be marked readnone 3195- microsoft/compiler: Fix PSV struct when numthreads is 0 3196 3197Jiang Feng (1): 3198 3199- virgl: Set res->maybe_busy to true when creating resources 3200 3201Jonathan Gray (2): 3202 3203- intel/dev: sync ADL-S pci ids with linux 3204- intel/dev: add RPL-S pci ids from drm-intel-next 3205 3206Jonathan Marek (2): 3207 3208- freedreno/registers: add a7xx registers for drm/msm kernel driver 3209- freedreno/registers: add missing varset="chip" for new enum values 3210 3211Jonathan Weinstein (1): 3212 3213- lavapipe: support instance rate zero for VK_EXT_vertex_attribute_divisor 3214 3215Jordan Justen (48): 3216 3217- intel/dev: Add device info for RPL-P 3218- vulkan/wsi/x11: Use atomic_int with sent_image_count 3219- drm-uapi/i915_drm.h: Update from drm-next (2022-04-28) 3220- intel/dev: Read hwconfig from i915 3221- intel_dev_info: Add --hwconfig command line parameter 3222- iris: Fix assertion meant to only target the clear-color stride 3223- nir/divergence: handle more \*_intel intrinsics 3224- intel/dev: Add INTEL_PLATFORM_DG2_G12 3225- intel/dev: Add DG2 G12 PCI IDs 3226- intel/dev: Enable first set of DG2 PCI IDs 3227- intel: Fix build of mi_builder_tests by including c99_compat.h 3228- intel: Build mi_builder_test whenever build-tests is set 3229- drm-uapi/i915_drm.h: Update from drm-next (2022-05-06) 3230- intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE 3231- anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state() 3232- anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state() 3233- anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE 3234- anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1 3235- intel/dev: Add devinfo::mem to store i915 regions information 3236- anv: Use memory info from devinfo 3237- iris/bufmgr: Use memory info from devinfo 3238- intel/tools: Print memory info in intel_dev_info 3239- vulkan/wsi: Disable dma-buf sync file if ENOSYS is returned 3240- intel/compiler: Don't create vec4 reg-set for gen8+ 3241- intel/dev: Add vram.unmappable.size region info 3242- intel/tools: Print unmappable region info in intel_dev_info 3243- intel/dev: Add intel_vram_all_mappable() 3244- iris/resource: Assert that DG2 CCS buffers don't also try to set BO_ALLOC_SMEM 3245- iris/resource: Avoid mapping when not needed in iris_resource_init_aux_buf() 3246- iris/bufmgr: Add all_vram_mappable which is currently always true 3247- iris/bufmgr: Set mmap_mode to IRIS_MMAP_NONE for lmem in small-BAR mode 3248- iris: Set clear_color_unknown if the bo is not mappable 3249- anv/allocator: Add assert and TODO comment for future small BAR uapi 3250- iris/bufmgr: Add assert and TODO comment for future small BAR uapi 3251- drm-uapi/i915_drm.h: Update from drm-next (2022-07-22) 3252- iris/bufmgr: Add I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS for vram mappable buffers 3253- intel/dev: Use i915 region probed_cpu_visible_size when non-zero 3254- intel/dev: Determine the amount of free vram using small BAR uapi 3255- intel/dev: Enable remaining DG2 and ATS-M device IDs 3256- anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR 3257- intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE 3258- iris: Drop extra file-descriptor dup in iris_drm_screen_create() 3259- intel/pci_ids: Update ADL-S strings 3260- intel/pci_ids: Add 0x468b ADL-S PCI-id 3261- intel/pci_ids: Update ATS-M device names 3262- intel/pci_ids: Update DG2 device names 3263- intel/pci_ids: Add dg2 0x5698 pci-id 3264- intel/pci_ids: Drop non-upstream dg2 pci-ids 3265 3266Jose Maria Casanova Crespo (1): 3267 3268- v3dv/ci: increase fraction to 10 on v3dv ci jobs. 3269 3270Josh Billingsley (1): 3271 3272- driconf: add SD Gundam G Generation Cross Rays 3273 3274Joshua Ashton (1): 3275 3276- aco: Use movk for AddressHi bits in vertex prolog 3277 3278José Fonseca (6): 3279 3280- trace: Fix framebuffer state serialization. 3281- lavapipe: Prevent mapping buffers beyond their size. 3282- trace: Dump NIR. 3283- trace: Parse character data more efficiently. 3284- trace: Bring state dump up to speed. 3285- trace: Allow to control nir dumping via an environment variable. 3286 3287Juan A. Suarez Romero (37): 3288 3289- ci/v3dv: remove fixed test 3290- v3d: report the correct unsupported blit format 3291- v3d: do not tile 1D textures 3292- v3d: use surface format defined on pipe_blit 3293- gallium: add hook on getting canonical format 3294- v3d: define our own canonical supported formats 3295- v3d: enable GL_ARB_copy_image extension 3296- gallium: move get_canonical_format hook to pipe_screen 3297- v3dv: duplicate key on hashtable insert 3298- v3dv: store device_id on device init 3299- v3d/simulator: add support for AMD cards 3300- mesa: unref syncobj after wait_sync 3301- v3d: enable early Z/S clears 3302- v3d: disable early-Z on odd frame dimensions 3303- v3d: allow TFU blitting for single layer textures 3304- v3d: add hook to get on-disk shader cache 3305- v3d: add builtin support for white/black clamp-to-border 3306- v3dv: check BO mapping result 3307- docs: update extensions for V3D and VC4 3308- ci: bump VK-GL-CTS to 1.3.2.0 3309- v3d: fix blending for mixed RT formats 3310- v3d: expose GL_ARB_draw_buffers_blend 3311- v3d/ci: Add traces 3312- v3d: use function to initialize refcount 3313- v3d: save only required states in blitter 3314- v3d: release all color buffers on context destroy 3315- v3d: add ARB_polygon_offset_clamp extension support 3316- v3dv/ci: Update expected results 3317- v3d/ci: Add flake test 3318- v3d/v3dv/ci: update expected results 3319- v3d/ci: (partially) revert expected list 3320- v3d/ci: update expected list 3321- v3dv/ci: Update expected list 3322- v3d/vc4/ci: Fix typo in skipped list 3323- v3d/ci: enable ARB_texture_cube_map piglit tests 3324- vc4: properly restore vc4 debug option 3325- vc4: store tex sampler in proper register 3326 3327Kai Wasserbäch (1): 3328 3329- fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods. 3330 3331Karmjit Mahil (26): 3332 3333- pvr: Fix clang-format errors caused by vk outarrays. 3334- pvr: Add PVR_WORKGROUP_DIMENSIONS define. 3335- pvr: Add initial implementation of vkCmdDispatch(). 3336- pvr: Add compute update shared kernel in vkCmdDispatch(). 3337- pvr: Add compute update kernel in vkCmdDispatch(). 3338- pvr: Update pvrsrvkm to fw 1.17 . 3339- pvr: Add csb enum helper funciton header. 3340- pvr: Write immutable descriptor words in vkAllocateDescriptorSets(). 3341- pvr: Fix incorrect samples to cr_isp_aa_mode conversion. 3342- pvr: Change compute hard coding infrastructure. 3343- pvr: Remove unused output_selects from vertex shader state. 3344- pvr: Add graphics pipeline hard coding infrastructure. 3345- pvr: Enable 33.15.11.3 (AXE-1-16M). 3346- pvr: Extend hard coding infra to allow per stage per device coding. 3347- pvr: Fix physical device limits. 3348- pvr: Handle vdm degen_cull_enable. 3349- pvr: Fix off by 1 error in buffer_id for ubo pds program. 3350- pvr: Rename loop iterator variable. 3351- pvr: Update pvrsrv build version for fixed size fw. 3352- pvr: Rename uniform pds program to descriptor pds program. 3353- pvr: Add static consts support in descriptor pds program. 3354- pvr: Add static consts to hard coding infra. 3355- pvr: Add initial vkCmdPipelineBarrier skeleton. 3356- pvr: Add IDF/WDF program for compute pipeline barrier. 3357- pvr: Add compute pipeline barrier handling. 3358- pvr: Fix cdm shared reg usage reported to fw. 3359 3360Karol Herbst (20): 3361 3362- nir: add 16 and 64 bit fisnormal lowering 3363- vtn: clamp SpvOpImageQuerySize dest to 32 bit 3364- nv50/nir: align tlsspace to 0x10 3365- llvmpipe: PIPE_COMPUTE_CAP_GRID_DIMENSION is uint64_t 3366- llvmpipe/fence: make the fence id counter atomic 3367- iris/cs: take buffer offsets into account for CL 3368- nir: add a nir_remove_non_entrypoints helper 3369- ntt: make use of new samplers_used field 3370- clc: parse localSize and localSizeHint 3371- clc: drop parsingComplete check 3372- clc: speed up compilation by not relying on opencl-c.h 3373- clc: fix compiler features_macro CTS Test 3374- clc: disable opaque pointers until they are supported 3375- nv50/ir: recoginse Ampere 3376- nir/gather_info: allow to run it before inlining 3377- nir/gce: pin call instructions 3378- nouveau: use the contexts pushbuf and client where possible 3379- nouveau/mm: make code thread safe 3380- nv50/ir: fix OP_UNION resolving when used for vector values 3381- nv50: properly flush the TSC cache on 3D 3382 3383Ken Johnson (2): 3384 3385- d3d12: Reduce cost of shader key comparisons 3386- mesa/main: Correct alloca() use without #include "c99_alloca.h" 3387 3388Kenneth Graunke (41): 3389 3390- anv: Fix INTEL_DEBUG=bat on XeHP 3391- intel/decoder: Fix binding table pointer decoding with large offsets 3392- iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS 3393- Revert "st/mesa: Transcode ASTC to BC7 (BPTC) where possible" 3394- intel: Drop Wa_1409226450 (stall before instruction cache invalidation) 3395- intel: Fix analysis invalidation in eliminate_find_live_channel 3396- intel/compiler: Move spill/fill tracking to the register allocator 3397- mesa: Split tx_compress_dxtn into per-format functions 3398- mesa: Avoid temp images in _mesa_texstore_rgb_dxt1 for GL_RGBA source 3399- mesa: Require temp images in _mesa_texstore_*_dxt* with SkipImages 3400- mesa: Avoid temp images in _mesa_texstore_*_dxt* for stride = 0 3401- isl: Add an isl_format_supports_typed_atomics() helper. 3402- intel/compiler: Fix uncompaction of signed word immediates on Tigerlake 3403- intel: Set a more useful fake devinfo->gtt_size in no-hw mode 3404- intel/compiler: Fix A64 header construction with a uniform address 3405- iris: Update comment about 2GB dynamic state range 3406- iris: Use PIPE_* defines rather than ones from main/config.h 3407- crocus: Use PIPE_* defines rather than ones from main/config.h 3408- intel/compiler: Stop including src/mesa/main/config.h 3409- intel/compiler: Fix brw_gfx_ver_enum.h to be a proper header file 3410- intel/compiler: Split 3DPRIM_* defines out to a separate header. 3411- intel/tools: Stop malloc'ing device info in i965_disasm 3412- intel/compiler: Move opcode_desc handling to a separate header 3413- intel/compiler: Introduce a new brw_isa_info structure 3414- intel/compiler: Remove use of thread_local for opcode tables 3415- intel/compiler: Convert brw_eu.cpp back to brw_eu.c 3416- intel/compiler: Handle split-sends in EOT high-register pinning case 3417- intel/fs: Opportunistically split SEND message payloads 3418- intel/compiler: Avoid copy propagating large registers into EOT messages 3419- intel/compiler: Don't set SBID on EOT send messages 3420- mesa: Mark render-to-texture as unsafe if there's no pipe_resource 3421- intel/eu: Handle compaction when inserting validation errors 3422- intel/eu: Fix XeHP register region validation for hstride == 0 3423- intel/eu: Clarify spec citations for XeHP region restrictions 3424- intel/eu: Mark header present in URB memory fences on XeHP 3425- intel/compiler: Lower FIND_[LAST\_]LIVE_CHANNEL in IR on Gfx8+ 3426- intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel() 3427- iris: Fix PIPE_CAP_UMA 3428- iris: Use linear for exported resources if we can't convey tiling 3429- iris: Delete unused iris_screen::aperture_bytes field 3430- crocus: Fix memory leaks on iris_resource_create failure paths 3431 3432Konrad Dybcio (1): 3433 3434- freedreno: Enable A619 3435 3436Konstantin Seurer (106): 3437 3438- radv: Fix ray queries with !15854 3439- radv: Do not discard hits with t=tmax 3440- radv: Enable rt primitive culling for spirv2nir 3441- ci: Always install glslangValidator for build jobs 3442- radv: Use VAs in radv_meta_buffer.c 3443- radv: Make fill_buffer_shader non-static 3444- radv: Format radv_acceleration_structure.c 3445- radv: Add the fuchsia radix sort 3446- radv: Build accaleration structures using LBVH 3447- radv: Clean up the accel-struct build shaders 3448- radv: Fix lowering ignore_ray_intersection 3449- vulkan: Add a shader module clone helper 3450- radv: Copy shader modules to avoid use after free 3451- radv/radix_sort: Add missing entry points 3452- radv/radix_sort: Make variable names consistent 3453- radv: Use the entire morton code as sort key 3454- ac/nir: Do not set cursor in try_extract_additions 3455- radv: Move RADV_RT_STAGE_BITS to radv_private.h 3456- radv: Treat rt stages like compute stages 3457- radv: Use RADV_RT_STAGE_BITS more often 3458- nir: Add a ray launch size addr intrinsic 3459- radv: Allow indirect dispatches without bo 3460- radv: Indirect unaligned dispatches 3461- radv,aco: Use ray_launch_size_addr 3462- radv: Implement vkCmdTraceRaysIndirectKHR 3463- radv: Advertise indirect ray tracing 3464- radv: Allow radv_fill_buffer to work with VAs only 3465- radv: Use radv_fill_buffer for accel struct builds 3466- Revert "radv: Make fill_buffer_shader non-static" 3467- radv: Add and use radv_bvh_node_type 3468- radv: Replace nir_vector_extract with nir_channel 3469- radv/ci: Skip slow/crashing ray tracing tests 3470- radv: Enable ray queries by default 3471- radv: Add fail/flake lists for navi22 3472- radv: Fix handling of primitiveOffset 3473- radv: Ignore transformOffset if transformData is 0 3474- vulkan: Use the correct enum in vk_buffer 3475- radv: Use vk_buffer 3476- radv: Add and use radv_cp_dma_wait_for_stages 3477- radv: Some acceleration structure cleanups 3478- radv: Require an alignment of 64 for accel structs 3479- radv: Fix the accel struct size calculation 3480- vulkan: Spec update to 1.3.216 3481- spirv: Add plumbing for ray_cull_mask 3482- radv: ray cull mask 3483- radv: Add accel struct queries for maintenance1 3484- radv: ACCESS_2_SHADER_BINDING_TABLE_READ_BIT_KHR 3485- radv: PIPELINE_STAGE_2_ACCELERATION_STRUCTURE_COPY 3486- nir: Handle ray_launch_size_addr in opt_preamble 3487- radv: vkCmdTraceRaysIndirect2KHR 3488- radv: Advertise VK_KHR_ray_tracing_maintenance1 3489- nir,spirv: Preserve inbounds access information 3490- nir/nir_lower_io: Optimize 32-bit inbounds access 3491- radv: Move some rt intrinsics to the top 3492- radv: Introduce an enum for the any hit status 3493- radv: Fix the t comparison for triangles 3494- radv: Refactor lower_rt_derefs 3495- nir: Add a nir_test_mask helper 3496- radv: Use nir_test_mask instead of i2b(iand) 3497- nir: Use nir_test_mask instead of i2b(iand) 3498- d3d12: Use nir_test_mask instead of i2b(iand) 3499- intel: Use nir_test_mask instead of i2b(iand) 3500- microsoft: Use nir_test_mask instead of i2b(iand) 3501- dozen: Use nir_test_mask instead of i2b(iand) 3502- radv: Use two bools for ahit_status 3503- radv: Skip pipeline_no_null_shaders_flag tests 3504- radv: Fix acceleration structure size queries 3505- radv: Use a global address for ray_launch_size 3506- radv: Use a global address for sbt_base 3507- radv: Only set rt stack size for dynamic stacks 3508- vulkan: Common vk_format_get_component_bits 3509- nir: Add a common gen_rect_vertices implementation 3510- radv: Use nir_gen_rect_vertices 3511- v3dv: Use nir_gen_rect_vertices 3512- radeonsi: Set vertex_id_zero_based 3513- aco: Remove dead nir_intrinsic_load_vertex_id case 3514- ac/llvm: Remove load_vertex_id handling 3515- util/format: Use an explicit length for the descs 3516- util/format: Assert that formats are valid 3517- radv: Remove format desc null checks 3518- radv: Use desc->format 3519- v3dv: Remove format desc null asserts 3520- turnip: Remove format desc null assert 3521- gallivm: Remove format desc null checks 3522- tgsi: Remove format desc null check 3523- util: Remove format desc null check 3524- agx: Remove format desc null check 3525- etnaviv: Remove format desc null check 3526- llvmpipe: Remove format desc null checks 3527- panfrost: Remove format desc null check 3528- r300: Remove format desc null check 3529- r600: Remove format desc null checks 3530- radeonsi: Remove format desc null checks 3531- softpipe: Remove format desc null check 3532- virgl: Remove format desc null checks 3533- pvr: Remove format desc null check 3534- util/format: Remove format desc null checks 3535- gallium/tests: Remove format desc null checks 3536- radv: Add meta saving/restoring for predicating 3537- radv: Use RADV_META_SUSPEND_PREDICATING 3538- radv: Simplify the meta init fail path 3539- radv: Fix out of bounds access in RTPSO creation 3540- vulkan: Fix descriptor set layout allocation scope 3541- vulkan: Handle descriptor set layout alloc fails 3542- radv: Use vk_descriptor_set_layout 3543- radv: Fix stack size calculation with stage ids 3544 3545Kristian Høgsberg (1): 3546 3547- util/u_debug: Use 'initialized' instead of 'first' 3548 3549Kyle Russell (1): 3550 3551- etnaviv: don't halve max_instructions for GC400 3552 3553Leandro Ribeiro (1): 3554 3555- Revert "egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()" 3556 3557Lionel Landwerlin (120): 3558 3559- anv: use shadow surface for stencil input attachment on gfx7 3560- intel: fix URB programming for GT1s 3561- intel/fs: require memory fence commit bit on Gfx9 3562- anv: improve INTEL_DEBUG for submit 3563- anv/intel: add a new debug flag for stalling after every draw/dispatch 3564- anv: implement DEBUG_SYNC 3565- anv: allow getting the address of the beginning of the batch 3566- intel/fs: tidy up lower of ray queries 3567- ci/iris: mark another test as flaky 3568- anv: implement VK_EXT_primitives_generated_query 3569- nir/divergence: handle load_global_block_intel 3570- anv: fix INTEL_DEBUG=sync 3571- intel: fixup number of threads per EU on XeHP 3572- anv: remove unused enum 3573- anv: fix acceleration structure descriptor template writes 3574- anv: skip acceleration structure in binding table emission 3575- anv: reemit 3DSTATE_STREAMOUT after memcpy 3576- anv: fix clflush usage on utrace copy batch 3577- anv: limit clflush usage 3578- vulkan/runtime: 0-out pipeline cache object 3579- iris: use new kernel uAPI to compute video memory 3580- anv: fix invalid utrace memcpy l3 config on gfx < 11 3581- anv: reset all dynamic state after secondary execution 3582- anv: add missing logic op set in pipeline dyn state 3583- Revert "anv: fix dynamic state emission" 3584- anv: rework sample location 3585- anv: never emit 3DSTATE_CPS in the pipeline batch 3586- anv: don't emit 3DSTATE_STREAMOUT in pipeline batch 3587- anv: don't emit 3DSTATE_WM in pipeline batch 3588- anv: don't emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch 3589- anv: don't emit 3DSTATE_DEPTH_BOUNDS in pipeline batch 3590- anv: don't emit 3DSTATE_VF_TOPOLOGY in pipeline batch 3591- anv: remove static_state_mask 3592- docs: explain state emission in Anv 3593- intel: fix stall debug option 3594- nir/cf: return cursor after insertion of cf_list 3595- nir/lower_shader_calls: don't use nop instructions as cursors 3596- nir/lower_shader_calls: don't insert code after break/continue 3597- nir/lower_shader_calls: put inserted instructions into a dummy block 3598- anv: fix primitives generated queries values 3599- intel/perf: add support for new opcodes in code generation 3600- intel/perf: store a copy of devinfo 3601- intel/perf: add support for dualsubslice count variable 3602- intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables 3603- intel/perf: stop overriding oa_format 3604- intel/perf: add OA A counter type 3605- intel/perf: add support new variable counting the number of EUs in slice0-3 3606- intel/perf: add max vfuncs 3607- intel/perf: add new layout for Gfx12.5 products 3608- intel/perf: disable sseu setting on Gfx12.5+ 3609- intel/perf: deal with OA reports timestamp values on DG2 3610- intel/ds: fix compilation 3611- u_trace/anv/iris: drop cs argument for recording traces 3612- intel/disasm: add missing handling of <1;1,0> 3613- util/debug: add an enable parsing helper 3614- util/perf: add options to enable/disable tracepoints 3615- anv: move internal RT shaders to the internal cache 3616- nir/divergence: handle load_scratch_base_ptr 3617- nir/divergence: handle load_ray_num_dss_rt_stacks_intel 3618- anv: don't expose EXT_border_color_swizzle on gfx7 3619- intel/validator: validate dst/src types against devinfo support 3620- anv: fix index buffer emission 3621- vulkan/wsi: fix crash with debug names on swapchain 3622- intel/dev: add a helper to update memory info 3623- intel/dev: add a fallback when memory regions are not available 3624- anv: limit RT writes to number of color outputs 3625- anv: disable perf queries on non RCS engines 3626- anv: make apply_pipeline_layout/compute_push_layout visible to NIR debug 3627- anv: silence border color swizzle debug message 3628- clc: add new feature options for intel_clc 3629- intel/clc: enable fp16 & subgroups for GRL 3630- anv: disable non uniform indexing of UBOs 3631- anv: remove device features handling now in runtime 3632- anv: use the right helper to invalidate memory 3633- intel/fs: make sure memory writes have landed for thread dispatch 3634- intel/nir: temporarly disable opt_uniform_atomics for RT/CL 3635- anv/utrace: use a bo pool for utrace buffers 3636- intel/fs: ray query fix for global address 3637- nir/serialize: restore ray query variables 3638- isl: add new helper for format component compatibility 3639- anv: track if images can be fast clear with non-zero color 3640- anv: move gfx12+ CCS_E clear color restriction 3641- anv: deal with isl format swizzles for buffer views 3642- anv: validate image view lowered storage formats for storage 3643- anv: add a comment about handling buffer view swizzles on gfx7 3644- intel: protect against empty invalidate ranges 3645- anv: skip flush/invalidate faster 3646- spirv: switch to uint64 for rayquery internal type 3647- anv: ensure tile flush before streamout writes 3648- intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. 3649- anv: remove local computation of dynamic states 3650- anv: simplify dynamic buffer count in pipeline layout 3651- anv: break up anv_pipeline_compile_graphics() 3652- anv: split graphics nir loading 3653- anv: rename internal function for consistency 3654- anv: move CreateGraphicsPipelines to common code 3655- anv: move CreateComputePipelines to common code 3656- anv: move CreateRayTracingPipelines to common code 3657- anv: reorder & document fields of anv_graphics_pipeline 3658- anv: name non dynamic state fields correctly 3659- anv: fix primitive topology dynamic state emission on gfx7 3660- anv: move restart index to gfx state 3661- anv: remove pipeline setting dynamic state 3662- intel/dev: deal with i915 unallocated_size on smem 3663- anv: allocate RT scratch in local memory 3664- anv: make use of the new smallbar uAPI 3665- anv: decode init batch with INTEL_DEBUG=bat 3666- intel/dev: move verification function to a header 3667- intel/tools/drm-shim: fixup eu_stride for topology 3668- intel/tools/stub: fixup parsing of --platform= 3669- intel/dev: add a test verifying that device override works 3670- intel/dev: remove INTEL_DEVID_OVERRIDE 3671- anv: fixup PIPE_CONTROL restriction on gfx8 3672- anv: don't return incorrect error code for vkCreateDescriptorPool 3673- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines 3674- anv: fixup assertions on lowered storage formats 3675- anv: fix assert in memory budget code when extension is not supported 3676- intel/fs: fix load_scratch intrinsic 3677- intel/fs: fixup scratch load/store handling on Gfx12.5+ 3678- intel/fs: fixup SEND validation check on overlapping src0/src1 3679 3680Liviu Prodea (7): 3681 3682- dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32 3683- ci/MSVC: LLD is not really used anywhere 3684- ci/MinGW: LLD is not really used anywhere 3685- d3d12/dzn/spirv2dxil: Require version library 3686- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it 3687- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover 3688- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly 3689 3690Louis-Francis Ratté-Boulianne (4): 3691 3692- d3d12: Move d3d12_nir_lower_vs_vertex_conversion() to a common place 3693- microsoft/compiler: Add support for more scaled formats 3694- dzn: Allow some non-native formats to be used as vertex inputs 3695- dzn: Enable dEQP-VK.pipeline.vertex_input.* test cases 3696 3697Lucas Stach (37): 3698 3699- etnaviv: update headers from rnndb 3700- etnaviv: set VIVS_GL_VERTEX_ELEMENT_CONFIG depending on prim type 3701- etnaviv: don't supertile textures if supertiling is disabled via debug option 3702- etnaviv: fill all minor GPU features from the kernel 3703- etnaviv: update headers from rnndb 3704- etnaviv: correct bits per tile and clear value for HALTI5 3705- etnaviv: use new PE pipe address states on >= HALTI0 3706- etnaviv: use feature flag to determine which RS states to use 3707- etnaviv: use feature bit for one const src per instuction limitation 3708- etnaviv: clean up tiling setup in etna_compile_rs_state 3709- etnaviv: add support for big tile RS states 3710- etnaviv: properly set additional DEC400 compression states 3711- etnaviv: use feature bit to check for big tile support 3712- etnaviv: add tile size helper 3713- etnaviv: reinstate accidentially commented assert 3714- etnaviv: improve single buffer setup 3715- etnaviv: add linear PE support 3716- etnaviv: switch depth mode when linear PE is used 3717- etnaviv: expose real map buffer alignment 3718- etnaviv: allow mapped buffers during execution 3719- etnaviv: drm: always use hash to track BO index 3720- etnaviv: drm: don't cache mmap offset 3721- etnaviv: drm: make etna_bo_map thread safe 3722- etnaviv: drm: rename etna_drm_table_lock 3723- etnaviv: add function to get resource status 3724- etnaviv: rework resource status tracking (again) 3725- etnaviv: drop scanout assumption for resources allocated via modifier interface 3726- kmsro: add 'imx-lcdif' driver support 3727- etnaviv: move dummy BOs to screen 3728- etnaviv: add texture descriptor suballocator 3729- etnaviv: tex_desc: make error handling more consistent 3730- etnaviv: tex_desc: remove descriptor patch TODO comment 3731- etnaviv: fix use after free in async shader compile 3732- etnaviv: remove etna_align_up() function 3733- etnaviv: move checking for MC2.0 for TS into screen init 3734- etnaviv: use linear PE rendering only on properly aligned surfaces 3735- etnaviv: add debug option to disable linear PE feature 3736 3737M Henning (7): 3738 3739- nouveau/nir: Allow up to 6 nested joins 3740- nouveau: Emit cctl to flush L1 cache for atomics 3741- nouveau: Skip cctl for atomic counters in tgsi 3742- nouveau/nir: Split fewer 64-bit loads 3743- ttn: Make ttn_optimize_nir match gl_nir_opts 3744- nv50: Fix tlsSpace alignment 3745- nouveau: Drop C++03 compat code 3746 3747Manas Chaudhary (2): 3748 3749- panvk: Add PANVK_DEBUG_DUMP dump mappings 3750- panvk: Allow PAN_BO_INVISIBLE to be mapped with dump flag 3751 3752Manuel Stoeckl (4): 3753 3754- dri_interface: add XBGR16161616 format 3755- gbm: add GBM_FORMAT_[AX]BGR16161616 3756- dri: fix incorrect fourcc for R16G16B16A16_UNORM format 3757- dri: add [ax]bgr16161616 to format lookup tables 3758 3759Marcin Ślusarz (40): 3760 3761- anv: invalidate all metadata in anv_nir_lower_ubo_loads 3762- intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader 3763- intel/compiler: invalidate metadata in brw_nir_initialize_mue 3764- compiler: add VARYING_SLOT_CULL_PRIMITIVE 3765- intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh 3766- intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup 3767- anv: update task/mesh distribution with the recommended values 3768- intel/compiler: print name of the unhandled intrinsic 3769- anv: disable streamout before emitting mesh shading state 3770- intel/compiler: implement primitive shading rate for mesh 3771- anv: handle primitive shading rate for mesh 3772- intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu 3773- intel/tools: clean up mmio regs definitions 3774- intel/tools: drop wrappers around mmio regs macros 3775- intel/tools/aubinator: add support for 2 "new" subopcodes 3776- intel/tools/aubinator: list all platforms in help message 3777- intel/tools: add macros for gfx12+ variant of VCSUNIT0 3778- nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis 3779- nir: add load_task_payload intrinsic to nir_divergence_analysis 3780- intel/tools: fix 32-bit build 3781- anv: remove redundant code calculating dynamic states mask 3782- anv: remove invalid copy/pasted comment 3783- anv: mask out not applicable state flags when setting up mesh pipeline 3784- intel/dev: fixup max_cs_workgroup_threads after hwconfig is read 3785- nir/opt_load_store_vectorize: handle task payloads 3786- intel/compiler: vectorize task payload loads/stores 3787- nir/lower_task_shader: insert barrier before/after shared memory read/write 3788- intel/compiler: assert that base is 0 for [load|store]_shared intrins 3789- intel: fix typos found by codespell 3790- intel/common: allocate space for at least one task urb 3791- intel/compiler: adjust task payload offsets as late as possible 3792- anv: disable injection of primitive shading rate for mesh 3793- intel/compiler: print shaders after nir_remove_unused_varyings 3794- nir: convert unused mesh outputs to shared memory 3795- intel/compiler: emit URB fences for TASK/MESH 3796- intel/compiler: there are 4 types of fences on gfx >= 12.5 3797- intel/compiler: insert URB fence before task/mesh termination 3798- intel/compiler: use NIR_PASS more 3799- intel/compiler: fix mesh urb write regression 3800- anv: disable task redistribution 3801 3802Marek Olšák (311): 3803 3804- gallium: move drivers/radeon/* into drivers/radeonsi/ 3805- gallium: move radeon_winsys.h into gallium/include/winsys/ 3806- gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h 3807- gallium/ddebug: implement pipe_vertex_state callbacks 3808- nir: fix an uninitialized variable valgrind warning in nir_group_loads 3809- ac/surface: fix an addrlib race condition on gfx9 3810- radeonsi: set more state bits for disabled binning 3811- radeonsi: don't compile monolithic PS for dual source blending 3812- radeonsi: add an SQTT workaround for chips with disabled RBs 3813- radeonsi: set PA_SC_EDGERULE according to bottom_edge_rule state 3814- radeonsi: initialize line stipple config registers 3815- radeonsi: fix ALPHA_IS_ON_MSB determination 3816- radeonsi: handle an out-of-bounds sampler/image index better 3817- radeonsi: don't treat skipped GS copy shader compilation as successful 3818- radeonsi: dump shaders after si_shader_binary_upload to fix printed LDS stat 3819- radeonsi: ignore AMD_DEBUG=nodcc if the texture is imported 3820- radeonsi: run nir_io_add_const_offset_to_base for TES/TCS as late optimizations 3821- radeonsi: scalarize IO instructions 3822- radeonsi: add pipe_stream_output_info into si_shader_context 3823- radeonsi: use nir_shader_get_entrypoint in si_nir_scan_shader 3824- radeonsi: apply key.ge.opt.kill_{outputs,pointsize,clipdistance} in NIR 3825- radeonsi: gather pipe_stream_output_info from NIR intrinsics 3826- radeonsi: move most "info" fields from si_shader_selector into si_shader_info 3827- radeonsi: move si_shader_info::stage into si_shader_selector 3828- ac: add ac_nir_optimize_outputs, a NIR version of ac_optimize_vs_outputs 3829- radeonsi: assign param export indices before compilation 3830- ac/llvm: remove LLVM pass ac_optimize_vs_outputs 3831- amd: document chips 3832- ac/surface/tests: generalize and extend gfx10 tests 3833- ac: add more non-shadowed registers to the lists 3834- ac/llvm: add AC_WAIT_EXP for ac_build_waitcnt 3835- ac/llvm: remove immoffset parameter from ac_build_tbuffer_store 3836- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load 3837- ac/llvm: remove inst_offset parameter from ac_build_buffer_store_dword 3838- ac/llvm: use soffset correctly in ac_build_buffer_load 3839- ac/llvm: remove inst_offset parameter from ac_build_buffer_load 3840- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load_byte/short 3841- ac/llvm: remove unused code from ac_llvm_build.c/h 3842- ac/llvm: don't use tbuffer_store as a fallback for swizzled stores 3843- ac/llvm: rename tbuffer -> buffer where tbuffer instructions are not generated 3844- ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c 3845- ac/gpu_info: remove old and unused fields from radeon_info 3846- ac/llvm: remove unused LLVM helpers 3847- ac/llvm: don't create the target machine if the LLVM processor is unsupported 3848- radeonsi: remove an obsolete comment about LLVM 14 3849- radeonsi: fail to create pipe_screen if LLVM doesn't support the GPU 3850- winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen 3851- radeonsi/ci: add polaris11 baseline 3852- util: add pipe formats R10G10B10X2_SINT, B10G10R10X2_SINT, G16R16_SINT 3853- util: change util_format_snorm8_to_sint8 -> snorm_to_sint for radeonsi 3854- ac/gpu_info: rework how num_se is derived 3855- ac: remove really_needs_scratch, parameter from ac_parse_shader_binary_config 3856- ac/llvm: set the correct cache policy for sparse buffer loads 3857- ac/llvm: don't set ac_dlc in visit_intrinsic 3858- ac: support GR channel order in ac_choose_spi_color_formats 3859- ac,radeonsi: report SCALED formats as unsupported by samplers and color buffers 3860- radeonsi: always print the lower-case GPU name for radeonsi-run-tests.py 3861- radeonsi: cleanup local includes 3862- radeonsi: align the tessellation ring address to 2MB 3863- radeonsi: remove unused si_cs_shader_state::uses_scratch 3864- radeonsi: simplify setting COMPUTE_PGM_RSRC2.SCRATCH_EN 3865- radeonsi: code style fix 3866- radeonsi: add si_can_use_compute_blit 3867- radeonsi: don't report R64_*INT as a sampler format because it doesn't work 3868- radeonsi: don't allow broken compute image copies into A8R8_UNORM 3869- radeonsi: add color buffer support for 2_10_10_10 pipe formats 3870- radeonsi: add color buffer support for R9G9B9E5_FLOAT on gfx10.3 3871- radeonsi: implement compute_copy_image between 1D_ARRAY and other texture types 3872- radeonsi: cosmetic changes in si_compute_copy_image 3873- radeonsi: assert format support in si_compute_copy_image 3874- radeonsi: don't expose formats that CB supports, but not shaders 3875- radeonsi: rename si_test_blit -> si_test_image_copy_region 3876- radeonsi/test_image_copy: remove whole image copies 3877- radeonsi/test_image_copy: remove linear and power-of-two size testing 3878- radeonsi/test_image_copy: remove special SDMA codepaths 3879- radeonsi/test_image_copy: simplify texture size generation 3880- radeonsi/test_image_copy: separate generating a random pipe_resource 3881- radeonsi/test_image_copy: separate printing pipe_resource into a function 3882- radeonsi/test_image_copy: test all interesting texture targets 3883- radeonsi/test_image_copy: test copying a block-compressed format 3884- radeonsi/test_image_copy: test mipmaps 3885- radeonsi/test_image_copy: test all formats 3886- radeonsi/ci: update sienna_cichlid results 3887- radeonsi: handle compressed formats in si_compute_copy_image 3888- radeonsi: improve the subsampled copy_image calculation 3889- radeonsi: remove compressed and subsampled gfx copy from resource_copy_region 3890- radeonsi: remove si_create_sampler_view_custom and related code 3891- radeonsi: remove si_create_surface_custom 3892- mesa: consider the sample count when choosing a texture format 3893- frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers 3894- amd/registers: hardcode GC base offsets in the json generator 3895- amd/registers: add gfx11 to the json generator 3896- amd/registers: add gfx11.json 3897- amd/registers: add gfx11-rsrc.json 3898- amd: enable gfx11 in header generator, fix drivers with renamed gfx6-10 defs 3899- amd: add gfx11 to packet definitions 3900- amd: import gfx11 addrlib 3901- amd: add Mesa-only addrlib changes 3902- ac: don't align VGPRs to 8 or 16 for gfx11 3903- ac: implement ac_get_tbuffer_format for gfx11 3904- ac: scratch buffer register changes for gfx11 3905- ac: implement register shadowing for gfx11 3906- ac/llvm: export mrt0 instead of null on gfx11 3907- ac/llvm: don't set DLC on gfx11 because it means something else there 3908- ac/llvm: update pknorm and waitcnt for gfx11 3909- ac/llvm: don't set GLC for stores on gfx11 3910- ac/gpu_info: set cu_mask correctly for gfx11 3911- ac/surface: gfx11 changes 3912- ac/surface: define gfx11 modifiers 3913- ac/surface: add gfx11 support to modifiers tests 3914- radeonsi/gfx11: increase the hw screen offset alignment 3915- radeonsi/gfx11: always allow DCC stores 3916- radeonsi/gfx11: expect packed threadID VGPRs 3917- radeonsi/gfx11: enable NGG-only draw paths 3918- radeonsi/gfx11: enable arbitrary DCC format reinterpretation 3919- radeonsi/gfx11: use the new TCS WaveID SGPR to compute vs_rel_patch_id 3920- radeonsi/gfx11: program inst_pref_size for graphics 3921- radeonsi/gfx11: buffer descriptor changes 3922- radeonsi/gfx11: add CB deltas 3923- ac,radeonsi/gfx11: remove FMASK loads 3924- ac,radeonsi/gfx11: set SWIZZLE_ENABLE correctly 3925- radeonsi/gfx11: don't set non-existent SPI_SHADER_USER_DATA_VS_x 3926- radeonsi/gfx11: don't set COMPR for exports, use 0x3 channel mask instead 3927- radeonsi/gfx11: implement attributes through memory 3928- radeonsi/gfx11: scattered register deltas 3929- radeonsi/gfx11: don't set non-existent CP_COHER_START_DELAY 3930- radeonsi/gfx11: emit SQ_NON_EVENT for tessellation at the end of IBs 3931- radeonsi/gfx11: don't use FLUSH_AND_INV_DB_META 3932- radeonsi/gfx11: TF_RING_SIZE changed to a per-SE size 3933- radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order 3934- radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS 3935- radeonsi/gfx11: change LDS allocation granularity for PS 3936- radeonsi/gfx11: don't count the non-existent scratch_byte_offset SGPR 3937- radeonsi/gfx11: resolve MSAA using u_blitter 3938- radeonsi/gfx11: mark streamout as unimplemented for now 3939- radeonsi/gfx11: limit CP DMA to max 32KB sizes 3940- radeonsi: inline si_cp_dma_prefetch in si_draw_vbo for lower overhead 3941- radeonsi/gfx11: add a workaround for CB perf counters 3942- amd: add chip identification for gfx1100-1103 3943- radeonsi: remove tautologies when setting CU_EN 3944- amd: rename ring_type --> amd_ip_type and match the kernel enum values 3945- amd: rename RING_* enums to AMD_IP_* 3946- amd: replace num_rings[type] with ip[type].num_queues 3947- ac/gpu_info: simplify HW IP querying 3948- ac/gpu_info: remove dead GDS query code 3949- ac/gpu_info: print all IP versions reported by the kernel 3950- ac/gpu_info: fix incorrect IP versions reported by the kernel 3951- radeonsi: report 10_10_10_2 scaled formats as unsupported on gfx6-9 as well 3952- radeonsi: report more missing/broken texture formats as unsupported on gfx6-9 3953- ac/llvm: implement memory_barrier_tcs_patch properly to fix LLVM 15 3954- radeonsi: wait before s_barrier in TCS epilog to fix LLVM 15 3955- ac/llvm: inline emit_membar 3956- ac/llvm: replace LLVMBuildFence with ac_build_waitcnt(VSTORE) 3957- ac/llvm: add gl_shader_stage parameter into ac_build_s_barrier 3958- ac/llvm: move the gfx6 optimization for TCS barriers into ac_build_s_barrier 3959- ac/llvm: fix the remaining s_barriers for LLVM 15 3960- radeonsi: add si_get_tcs_epilog_key 3961- ac/llvm: skip s_barrier if tess patches don't cross a wave boundary 3962- radeonsi: inline si_nir_build_llvm 3963- ac/llvm: remove ac_shader_abi::emit_outputs 3964- radeonsi: cleanups getting rid of the ambigous "epilogue" word 3965- radeonsi: get rid of the ambiguous "prologue" word 3966- radeonsi: try to group stage-specific code in si_llvm_translate_nir 3967- ac: replace 5 ac_shader_abi::load_* callbacks with 1 intrinsic_load callback 3968- amd: change chip_class naming to "enum amd_gfx_level gfx_level" 3969- radeonsi: rename RADEON_FLAG_UNCACHED -> RADEON_FLAG_GL2_BYPASS 3970- winsys/amdgpu-radeon: rework how we describe heaps 3971- winsys/amdgpu: handle RADEON_FLAG_ENCRYPTED as a heap modifier 3972- radeonsi: use PIPE_RESOURCE_FLAG_UNMAPPABLE and DRIVER_INTERNAL more 3973- winsys/amdgpu-radeon: clean up code related to heaps 3974- include: update amdgpu_drm.h 3975- radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE 3976- radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache) 3977- radeonsi: fix a crash when failing to create a context 3978- radeonsi: print an error when failing to create a context 3979- radeonsi/gfx11: set BIG_PAGE for the attribute ring 3980- ac/gpu_info: silence a valgrind warning due to amdgpu_query_hw_ip_info 3981- radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition 3982- ac/gfx11: fix the scratch buffer 3983- radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export 3984- radeonsi/gfx11: don't insert shader code for GS_PIPELINE_STATS_EMU 3985- radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs 3986- Revert "radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order" 3987- radeonsi/gfx11: fix VM faults due to the attribute ring 3988- radeonsi/gfx11: optimize attribute stores 3989- radeonsi: merge all preamble states into one 3990- radeonsi: move CS preamble emission into the winsys 3991- winsys/amdgpu: initialize IB_PREAMBLE in advance 3992- radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney) 3993- Revert "frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers" 3994- Revert "mesa: consider the sample count when choosing a texture format" 3995- amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt 3996- r300,r600,svga: save the FS constant buffer for u_blitter to fix a regression 3997- st/mesa: remove unused GENERIC input and output from the clear VS 3998- ac/gpu_info: always retile DCC on gfx10 and newer chips 3999- mesa: clamp UINT values greater than INT_MAX for glGetInteger 4000- mesa: change GL_MAX_COMPUTE_WORK_GROUP_COUNT to UINT 4001- mesa: change GL_MAX_UNIFORM_BLOCK_SIZE to UINT 4002- mesa: change GL_MAX_TEXTURE_BUFFER_SIZE to UINT 4003- mesa: change GL_MAX_SHADER_STORAGE_BLOCK_SIZE to UINT 4004- st/mesa: limit MaxComputeWorkGroupCount to INT_MAX - 1 due to failing tests 4005- st/mesa: round down MaxUniformBlockSize to a multiple of 4 for piglit 4006- gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to \*_BUFFER0_* 4007- gallium: add PIPE_CAP_MAX_CONSTANT_BUFFER_SIZE_UINT for UBOs 4008- gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT 4009- gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> \*_UINT 4010- radeonsi: increase the max compute LDS size to 64KB for gfx7+ 4011- radeonsi: report correct maximum compute grid sizes 4012- radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly 4013- radeonsi: change max TBO/SSBO sizes again and rework max alloc size 4014- radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly 4015- radeonsi: set the max UBO size same as the max SSBO size 4016- winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+ 4017- radeonsi: print LDS size in bytes 4018- radeonsi: don't use info.gs.invocations if it's not GS 4019- radeonsi: fix uninitialized wait_mem_scratch_tmz 4020- radeonsi: remove unused dword from wait_mem_scratch 4021- radeonsi: fix polygon stippling without color and Z outputs (v2) 4022- radeonsi: restructure PS no-export fixups 4023- radeonsi: determine DB_SHADER_CONTROL in si_shader_ps 4024- radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug 4025- radeonsi: add BREAK_BATCH at the beginning of IBs 4026- radeonsi: simplify how pipeline statistic offsets are computed 4027- radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS 4028- radeonsi: rename and regroup VS_STATE definitions 4029- radeonsi: move GS_STATE bits to the end to make space at the beginning 4030- radeonsi: rework how vs_state_bits is set and unpacked 4031- radeonsi: fix an NGG streamout hang with monolithic shaders 4032- radeonsi: fix a crash in gfx10_sh_query_get_result_resource 4033- radeonsi: unconditionally enable the streamout overflow query with NGG 4034- radeonsi: inline gfx10_emit_streamout_begin/end 4035- radeonsi: fix NGG streamout hang by allocating GDS in the right place 4036- radeonsi: remove streamout code from shaders if no streamout buffers are bound 4037- winsys/amdgpu: add a kernel GDS management workaround retrying on -ENOMEM 4038- radeonsi: allocate GDS only once per process 4039- radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout 4040- ac,radeonsi: don't export null from PS if it has no effect on gfx10+ 4041- ac/gpu_info: clamp gart_size_kb and vram_size_kb to fix buggy kernel driver 4042- gallium/u_threaded: fix buffer busyness tracking 4043- ac/llvm: fix tcs_wave_id unpacking on gfx11 4044- amd/gfx11: add PixelWaitSync packet fields 4045- radeonsi/gfx11: compile monolithic PS if it writes memory 4046- radeonsi/gfx11: allocate more space for pipeline statistics 4047- radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_* 4048- radeonsi/gfx11: don't use memory for waiting for cache flushes 4049- radeonsi/gfx11: don't allocate unused wait_mem_scratch 4050- radeonsi/gfx11: drop the ES vertex count requirement 4051- radeonsi/gfx11: add missing register shadowing code 4052- radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout 4053- amd: update addrlib - trivial changes 4054- ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11 4055- radeonsi: fix random PS wave size 4056- util: remove our redefinition of assert 4057- Change all debug_assert calls to assert 4058- util: remove now unused debug_assert 4059- ttn: set dest_type for TXQ 4060- gallium: add pipe_blit_info::dst_sample, update u_blitter to write that sample 4061- ac/gpu_info: rename info fields to num_cu, memory_bus_width, memory_freq_mhz 4062- ac/gpu_info: remove tabs 4063- ac/gpu_info: rework and extend device info to be more informative 4064- ac/gpu_info: use hw_ip::ip_discovery_version to set IP versions 4065- ac/gpu_info: use the kernel-reported GFX IP version to set gfx_level 4066- ac/gpu_info: remove vram_size and gtt_size in favor of \*_kb variants 4067- ac/gpu_info: remove redundant uvd_decode 4068- ac/gpu_info: remove redundant jpeg_decode 4069- ac/gpu_info: remove redundant uvd_encode 4070- ac/gpu_info: remove redundant vce_encode 4071- ac/gpu_info: remove redundant vcn_encode 4072- amd: require amdgpu DRM 3.2.0 from April 2016 4073- gallium/radeon: require radeon DRM 2.45.0 from April 2016 4074- radeonsi: remove workarounds for radeon DRM < 2.45.0 4075- ac/gpu_info: use drm_amdgpu_device_info instead of amdgpu_gpu_info 4076- ac/gpu_info: remove amdgpu_gpu_info parameter from ac_query_gpu_info 4077- ac/gpu_info: remove unused has_unaligned_shader_loads 4078- gallium/radeon: require radeon DRM 2.50.0 (kernel 4.12) from July 2017 4079- ac/gpu_info: require amdgpu DRM 3.15.0 (kernel 4.12) from July 2017 4080- util/u_threaded: fix a CPU storage crash due to an invalid codepath taken 4081- glthread: don't ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE 4082- glthread: don't sync on IsEnabled(GL_DEPTH_TEST) by tracking it in glthread 4083- util/format: add util_format_rgbx_to_rgba helper 4084- amd/addrlib: fix 3D texture allocation failures on gfx11 4085- ac/surface: don't forbid 256KB swizzle modes on smaller gfx11 chips 4086- radeonsi: unify VGT_TESS_DISTRIBUTION programming 4087- radeonsi: move SI_MAX_VRAM_MAP_SIZE to si_debug_options.h 4088- radeonsi: call pipe->blit instead of util_blitter_blit after MSAA resolving 4089- radeonsi: make si_launch_grid_internal static 4090- radeonsi: add common helper si_launch_grid_internal_images that is more robust 4091- radeonsi: remove compute-based DCC decompression because it's broken 4092- radeonsi: don't do image stores with RGBX, L, LA, I, and SRGB formats 4093- radeonsi: follow shader_info.float_controls_execution_mode (mostly) 4094- radeonsi: add need_fmask_expand parameter into si_decompress_subresource 4095- radeonsi: check for 16-bit hw support instead of relying on options.fp16 4096- radeonsi: move compute-related code from si_blit.c to si_compute_blit.c 4097- radeonsi: fold async_copy into the preceding conditional in si_blit 4098- radeonsi: cosmetic changes around do_hardware_msaa_resolve 4099- radeonsi/gfx11: use correct VGT_TESS_DISTRIBUTION settings 4100- radeonsi/gfx11: set SAMPLE_MASK_TRACKER_WATERMARK = 15 and clean up 4101- radeonsi/gfx11: use better PRIM_GRP_SIZE_GFX11 setting 4102- radeonsi/gfx11: skip code in si_update_shaders that has no effect 4103- radeonsi/gfx11: rename si_calc_inst_pref_size -> si_get_shader_prefetch_size 4104- radeonsi/gfx11: enable shader prefetch except for initial chip revisions 4105- radeonsi/gfx11: use a better workaround for the export conflict bug 4106- radeonsi: make various blit functions non-static 4107- radeonsi: allow texture_map to upload only 1 sample for MSAA instead of all 4108- radeonsi: add a randomized blit test 4109- radeonsi: move the no-AA small prim precision cull constant into an SGPR 4110- radeonsi: move small prim precision computation out of si_emit_cull_state 4111- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 4112- glthread: unbind framebuffers in glDeleteFramebuffers 4113- glthread: call _mesa_glthread_DeleteBuffers unconditionally 4114- ac/surface: disallow 256KB swizzle modes on gfx11 APUs 4115 4116Marek Vasut (3): 4117 4118- dri: Check buffer height and avoid overflow 4119- etnaviv: Rename etna_emit_tex() args 4120- etnaviv: Implement TXD 4121 4122Mark Collins (1): 4123 4124- tu: Clamp priority in DRM submitqueue creation 4125 4126Mark Janes (11): 4127 4128- intel: parse intel_measure environment without side effects 4129- intel: relax assertion in INTEL_MEASURE 4130- anv: fix performance bug in INTEL_MEASURE 4131- intel: provide a callback to clean up after intel_measure_gather 4132- iris: provide a callback to INTEL_MEASURE to clean up snapshots 4133- util: define helpful macros for compiler diagnostic features 4134- iris: pad all structures used in a shader key 4135- iris: reorder to minimize padding 4136- intel/compiler: pad all data structures used by shader cache keys 4137- intel/compiler: reorder shader cache keys to minimize padding 4138- intel/dev: drop warning for unhandled hwconfig keys 4139 4140Martin Roukala (né Peres) (27): 4141 4142- ci/b2c: fix the generation of the IMAGE_UNDER_TEST variable 4143- ci/b2c: bump the kernel to 5.17.1 4144- ci/b2c: update b2c to enable monitoring support 4145- radv/ci: add a Van Gogh dEQP runner 4146- ci: add ACO_DEBUG to the list of variables to pass down for testing 4147- ci/radv: remove amdgpu.gpu_recovery=0 from the kernel cmdline 4148- radv/ci: reboot the DUTs when hitting a ring timeout 4149- radv/ci: reduce the execution timeout 4150- radv/ci: re-enable fossils testing 4151- radv/ci: enable the experimental support for mesh shaders 4152- radv/ci: update the list of flakes 4153- radv/ci: allow polaris10 to retry more times 4154- radv/ci: bump the timeout of vangogh/renoir 4155- radv/ci: update the vkcts expectations for the CI machines 4156- radv/ci: remove an expected fail from all families 4157- radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22 4158- ci/valve: do not extract the install tarball on the runner 4159- ci/valve: uprev boot2container 4160- Revert "ci/valve: uprev boot2container" 4161- radv/ci: test vkd3d on kabini 4162- ci/radv: move a test from the fail list to the flake list 4163- ci/radv: add a missing flake 4164- ci/radv: enable vkcts testing on kabini 4165- ci/x86: update to llvm 13 4166- radv/ci: test the llvm backend on navi21 4167- radv/ci: allow specifying a warning pattern 4168- radv/ci: do not reboot on soft-recovered hangs, just warn 4169 4170Matt Coster (20): 4171 4172- pvr: csbgen: Formatting pass (PEP-8 plus other minor changes) 4173- pvr: csbgen: Add a missing @staticmethod decorator 4174- pvr: csbgen: Make some loops more pythonic 4175- pvr: csbgen: Cleanup imports 4176- pvr: csbgen: Remove "dim" functionality from Group 4177- pvr: csbgen: Remove unused function parameters 4178- pvr: csbgen: Add typing information 4179- pvr: csbgen: Isolate "main" function with __name__ check 4180- pvr: csbgen: Replace map() and filter() with list comprehension 4181- pvr: csbgen: Fix "local variable may be undefined" lints 4182- pvr: csbgen: Add \*_unpack() functions for all generated struct types 4183- pvr: debug: Implement PVR_DEBUG variable 4184- pvr: csbgen: Make all generated enums unambiguous 4185- pvr: Add a const qualifier to inline pvr_csb_*() helpers 4186- pvr: Remove pvr_cmd_struct() macro 4187- pvr: Make pvr_cmd_pack() macro clearly internal 4188- pvr: Split out unioned structs from struct pvr_sub_cmd 4189- pvr: Implicitly assert that the correct sub-command type is present 4190- pvr: Split pvr_dev_addr_t into a separate header 4191- pvr: Add helper macros for creating pvr_dev_addr_t instances 4192 4193Matt Turner (3): 4194 4195- mesa: Drop dead #include "sparc/sparc.h" 4196- freedreno/ir3: Move the texture array coord fixup to nir 4197- freedreno/ir3: Pass 16-bit sampler coordinates when possible. 4198 4199Matti Hamalainen (7): 4200 4201- gallium/tools: fixes to option handling 4202- gallium/tools: add option for ignoring junk calls in trace dumper 4203- gallium/tools: reimplement tracediff completely in Python 4204- pytracediff: add per-line difference highlighting for blocks 4205- pytracediff: make -M ("method only") option print arguments for differing calls 4206- pytracediff: change how 'junk' calls are handled 4207- pytracediff: implement pager ('less') invocation internally 4208 4209Max Kellermann (2): 4210 4211- gallium/u_threaded: add missing reference counts for draw_multi slots 4212- gallium/u_threaded: fix offset calculation for draw_multi slots 4213 4214Michael Olbrich (1): 4215 4216- crocus: export GEM handle with RDWR access rights 4217 4218Michael Skorokhodov (2): 4219 4220- glsl: Fix ir_quadop_vector validation 4221- anv: Update line range 4222 4223Michel Zou (7): 4224 4225- gallium: fix unused symbols warnings 4226- meson: call run_command with check=true 4227- vulkan/wsi: fix missing unistd include 4228- zink: fix pointer size conversion warning 4229- lavapipe: allow null handleTypes 4230- meson: drop dladdr check on win32 4231- util/xmlconfig: fix -Wmaybe-uninitialized warning 4232 4233Mihai Preda (28): 4234 4235- gallium: add opaque pointers shim for LLVM < 8.0 4236- gallium: use LLVM opaque pointers in draw_llvm.c 4237- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_interp.c 4238- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_depth.c 4239- gallivm: use LLVM opaque pointers in lp_bld_arit.c 4240- gallivm: use LLVM opaque pointers in lp_bld_assert.c 4241- gallivm: use LLVM opaque pointers in lp_bld_format_aos.c 4242- gallivm: use LLVM opaque pointers in lp_bld_conv.c 4243- gallivm: use LLVM opaque pointers in lp_bld_coro.c 4244- gallivm: use LLVM opaque pointers in lp_bld_printf.c 4245- gallivm: use LLVM opaque pointers in lp_bld_flow.c 4246- gallivm: use LLVM opaque pointers in lp_bld_intr.c 4247- gallivm: use LLVM opaque pointers in lp_bld_format_s3tc.c 4248- gallivm: use LLVM opaque pointers in lp_bld_gather.c 4249- gallivm: use LLVM opaque pointers in lp_bld_nir_soa.c 4250- gallivm: use LLVM opaque pointers in lp_bld_sample.c 4251- gallivm: use LLVM opaque pointers in lp_bld_sample_soa.c 4252- gallivm: use LLVM opaque pointers in lp_bld_struct.c 4253- gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c 4254- gallivm: use LLVM opaque pointers in lp_bld_tgsi_aos.c 4255- gallivm: LLVM opaque pointers small changes 4256- gallium: refactor a channel loop in draw_llvm.c 4257- Revert "gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c" 4258- gallium: LLVM-15 contexts use non-opaque pointers 4259- gallivm: fix a few llvm non-opaque pointers 4260- gallivm: refactor a bit the cache access in view of LLVM opaque pointers 4261- gallivm: add lp_build_struct_get() variants that take the LLVM type 4262- gallivm: push LLVM version guard into assert 4263 4264Mike Blumenkrantz (634): 4265 4266- nir/lower_tex: fix rect queries with lower_rect set 4267- mesa/st: set normalized coords for RECT samplers if rects are unsupported 4268- zink: unset PIPE_CAP_TEXRECT 4269- zink: radv baseline fix 4270- zink: nv ci update 4271- zink: clamp max shader images to 32 4272- zink: remove tcs patch slot map 4273- zink: disable EXT_extended_dynamic_state2 on AMDPRO 4274- zink: add baseline for amdpro 4275- zink: reject resource creation if format features don't match attachment 4276- zink: use mixed zs renderpass for depth read/write 4277- zink: unconditionally set line width on rasterizer state change 4278- zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available 4279- zink: fix extended restart prim types without dynamic state2 4280- zink: do not create fences at all if timeline semaphores are supported 4281- zink: require timeline semaphores 4282- zink: remove batch lock 4283- zink: rename zink_query::batch_id 4284- zink: remove driver-based max_fences throttling 4285- zink: hook up VK_KHR_create_renderpass2 4286- zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available 4287- zink: require renderpass2 4288- zink: require KHR_imageless_framebuffer 4289- zink: delete all non-imageless framebuffer code 4290- zink: remove framebuffer indirection 4291- zink: rename imageless framebuffer functions 4292- zink: remove the worst part of basic framebuffer support 4293- zink: remove ZINK_NO_TIMELINES 4294- zink: make a kopper debug print into an error 4295- zink: rework zink_kopper_update() assert 4296- zink: always bind gfx pipeline at the top of draw 4297- zink: store min required stride values on the vertex state 4298- zink: selectively disable dynamic vertex stride 4299- nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims 4300- lavapipe: run nir_fold_16bit_sampler_conversions 4301- zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo 4302- zink: NV_linear_color_attachment 4303- zink: nv ci update 4304- zink: fix synchronization when drawing from streamout 4305- zink: fix xfb counter buffer barriers 4306- zink: remove xfb_barrier flag 4307- zink: handle device-local unsynchronized maps 4308- util/draw: fix map size of indirect buffer in util_draw_indirect_read 4309- util/draw: handle draw_count=0 when reading indirect parameters 4310- util/draw: fix indirect draw count readback 4311- gallium: add PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE 4312- zink: unset PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE 4313- zink: move the kopper present fence to the displaytarget object 4314- wgl: support GL 4.6 4315- zink: make unsupported blit formats an error 4316- zink: make update_framebuffer_state() public 4317- zink: rework clear_depth_stencil hook 4318- zink: add a stencil blit fallback path 4319- zink: fix tcs control barriers for use without vk memory model 4320- zink: fix semantics mask for compute control barriers 4321- zink: hook up VK_KHR_vulkan_memory_model 4322- zink: use vulkan memory model shader semantics for tcs barriers 4323- zink: always use vk mem model for shaders if available 4324- zink: fix memory_barrier intrinsic 4325- zink: fix stride=0 for dynamic state vertex stride checking 4326- zink: add synchronization for buffer clears 4327- mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries 4328- aux/trace: make get_sparse_texture_virtual_page_size useful 4329- glsl/nir: set new_style_shadow for sparse tex ops as necessary 4330- zink: fix group memory barrier emission 4331- vulkan: bump layer api versions to current vk header version 4332- vk/cache: unbreak vk_pipeline_cache_create with flags 4333- kopper: add DISPLAY_TARGET bind for depth buffer 4334- zink: fix/improve swapchain surface info updating 4335- zink: fix up swapchain depth buffer geometry during fb update 4336- drisw: remove dead code 4337- kopper: always fetch and store drawable info 4338- kopper: move drawable geometry updating up in function 4339- kopper: store whether screen has dmabuf support 4340- kopper: copy a bunch of code for texture_from_pixmap 4341- zink: add supported present modes to kopper displaytarget 4342- zink: add a util function for finding swapchain resource 4343- zink: flag swapchains when updating fails 4344- zink: handle dead swapchains in acquire 4345- zink: unset image layout when killing swapchain 4346- zink: rework swapchain object check for acquires 4347- zink: fix return for zink_kopper_acquire 4348- zink: handle swapchain acquire failures more directly 4349- zink: flag swapchain resources as swapchains 4350- zink/kopper: add a mechanism for checking swapchain status 4351- zink: radv ci updates 4352- ac: remove gallivm header include 4353- kopper: add some error logging for pixmap->texture failure 4354- kopper: check for modifiers to use modifier functionality 4355- zink/kopper: add a function for determining if running on software 4356- kopper: check whether zink is using sw 4357- kopper: rename a confusing variable 4358- kopper: implement texture-from-pixmap for software (non-dmabuf) 4359- zink: fix dmabuf plane returns 4360- zink: export fd info for all 2d images 4361- zink: ignore KMS handle types 4362- zink: remove drm_fd 4363- llvmpipe: add a ci flake 4364- llvmpipe: disable statistic queries during draw if queries are disabled 4365- llvmpipe: remove misleading query comment 4366- llvmpipe: disable compute statistics queries if queries are disabled 4367- wgl: always set alpha on kopper windows 4368- zink: hook up VK_EXT_primitives_generated_query 4369- zink: pass query object to get_num_queries() 4370- zink: pass query object to get_num_query_pools() 4371- zink: pass query object to get_num_results() 4372- zink: pass screen param to convert_query_type() 4373- zink: add and use a function to detected emulated primgen queries 4374- zink: add a flag to zink_query to trigger rasterizer discard workaround 4375- zink: use VK_EXT_primitives_generated_query when available 4376- wgl: don't auto-load zink before software drivers 4377- lavapipe: lower quad_broadcast intrinsics 4378- zink: add a ci flake 4379- kopper: invalidate drawables when resizing textures in place 4380- kopper: pass the current context to dri_flush 4381- util/blitter: fix sampler restore with 0 saved samplers 4382- radv: emit fewer framebuffer registers 4383- zink: restore conditional ordering for query begin/end 4384- nir/gather_info: flag fbfetch on subpass image loads 4385- llvmpipe: never infer early zs tests when fbfetch is active 4386- zink: split renderpasses for TextureBarrierNV() usage 4387- zink: scale depth bias by factor of 2 4388- zink: pass sparse backing page offset to binding function 4389- zink: pass sparse bind bo offset through for texture binds 4390- zink: set all usage flags when querying sparse features 4391- zink: fix multisample conditional in sparse image query 4392- zink: bump number of image binds that can be batched to 50 4393- zink: semi-handle 1D sparse texture rewrites for drivers that don't support them 4394- zink: decouple descriptor templates from layouts 4395- zink: flatten push descriptor template into normal template array 4396- zink: implement indirect buffer indexing 4397- zink: delete nir_lower_dynamic_bo_access 4398- lavapipe: enqueue pipeline destruction 4399- zink: fix null buffer/surface formats 4400- zink: add more image usage for null surfaces 4401- zink: use descriptor surfaces for notemplates ref updating 4402- lavapipe: handle null samplerviews 4403- gallivm: fix ssbo oob reads 4404- lavapipe: fix CmdPushDescriptorSetWithTemplateKHR with refcounting 4405- st/draw_feedback: set constant buffer stride 4406- gallivm/draw: fix oob ubo reads 4407- llvmpipe: always set ssbo data pointers for draw 4408- gallivm: fix oob txf swizzling 4409- lavapipe: set robustBufferAccess2 and robustImageAccess2 4410- lavapipe: ci updates 4411- docs: EXT_robustness2 for lavapipe 4412- zink: ci updates 4413- kopper: remove unused zink include 4414- zink: remove unused headers/struct members/linkage 4415- zink: outdent some code 4416- zink: add a flag to indicate whether a resource is exportable 4417- zink: break out resource bind-adding into separate function for reuse 4418- zink: introduce copy contexts 4419- zink: create a copy context for the screen on init 4420- Revert "zink: export fd info for all 2d images" 4421- zink: always check for fb rebinds when starting renderpass 4422- zink: unset pipe_resource::next pointers when creating resource copies 4423- zink: add a bind flag to indicate a resource is exported as a dmabuf 4424- zink: fix conditional for modifier usage 4425- zink: force memory exports if dmabuf bind is specified 4426- zink: add a LINEAR drm modifier if rebinding to add dmabuf export 4427- zink: rebind resources for export as needed 4428- zink: create images with modifiers any time there is an import handle 4429- zink: fix up sparse texture sampling for shadow samplers 4430- zink: clamp out min_lod operands for explicit lod ops 4431- zink: fix sparse binding for arrayed textures 4432- zink: set sparse flag in cubemap lowering 4433- zink: remove misleading sparse comment 4434- zink: fix sparse texture depth calcs for arrayed textures 4435- zink: remove implicit query resets 4436- zink: remove refs from shader images 4437- zink: reuse local res pointer in set_shader_images 4438- zink: simplify dumb update flagging in set_shader_images 4439- zink: read shader image r/w usage from incoming data struct 4440- zink: copy incoming shader image struct after doing updates 4441- zink: stop leaking shader image surfaces 4442- zink: fix 3d shader image miplevel calc for depth 4443- zink: directly reuse surface ivci when rebinding 4444- gallium/noop: implement pipe_screen::create_fence_win32 4445- aux/trace: screen::create_fence_win32 4446- zink: add win32 guards for device extensions with win32 in name 4447- lavapipe: ignore depth/stencil states if attachment is not present 4448- lavapipe: fix pipeline handling for dynamic render zs state with pipeline library 4449- spirv: fix barrier scope assert 4450- zink: manually validate shaders in debug builds 4451- zink: fix init with MESA_SHADER_CACHE_DISABLE enabled 4452- zink: use copy context to eliminate dependency on EXT_calibrated_timestamps 4453- zink: move get_timestamp to screen 4454- zink: add tu baseline for a630 4455- zink: add an anv-tgl ci job 4456- zink: never create a sw context unless sw is requested 4457- zink: only reject non-blittable resource creation if it has no other binds 4458- zink: add separate error message for push descriptor set alloc fail 4459- zink: add extra validation for resource creation 4460- zink: remove input attachment usage when pruning surface usage 4461- zink: add extended usage and retry if resource creation fails 4462- zink: fix surface/bufferview cache comparisons 4463- zink: force render target usage for sampler-only resources 4464- zink: clamp 1D_ARRAY sparse textures to 2D as needed 4465- zink: handle PIPE_BUFFER sparse texture queries 4466- zink: fix non-dynamic vertex stride update flagging 4467- zink: add c++ guards for zink_screen.h 4468- zink: add ZINK_DEBUG=sync 4469- zink: update radv baseline 4470- zink: rename a fb surface variables 4471- zink: always use zink_batch_rp to init renderpass 4472- zink: use zink_batch_no_rp to end renderpasses 4473- zink: move renderpass handling to zink_render_pass.c 4474- zink: use global image rebind counter for dmabuf export 4475- zink: remove fb rebind check during renderpass begin 4476- zink: use local variable for getting rt barrier info 4477- zink: break out sample location updating to separate function 4478- zink: break up swapchain renderpass update to util function 4479- zink: remove renderpass state flag for swapchain 4480- zink: pass rt attrib to zink_render_pass_attachment_get_barrier_info() 4481- zink: fix indentation 4482- zink: break out color rt attrib analysis into util function 4483- zink: break out zs rt attrib analysis into util function 4484- zink: track whether there's any transient attachments bound 4485- zink: lift some renderpass mechanics up a level in the api 4486- zink: split out scissor flagging from framebuffer updating 4487- zink: use dynamic rendering (most of the time) 4488- turnip: fix assert for max xfb outputs 4489- zink: only warn once for SRGB fb without KHR_swapchain_mutable_format 4490- zink: clamp dynamic render layerCount to 1 4491- zink: create all descriptor ref templates at once 4492- zink: add screen-based indexing to descriptor sets 4493- zink: use descriptor indices in compiler 4494- zink: use descriptor set index indirection in program init 4495- zink: use descriptor set index indirection in cache state updates 4496- zink: flag descriptor sets as changed before updating hashes 4497- zink: add a "compact" descriptor mode 4498- zink: use XOR for descriptor hash accumulation 4499- zink: radv ci updates 4500- zink: use a zink_render_pass_pipeline_state bit for fbfetch 4501- zink: add renderpass bits for color/depth r/w 4502- zink: update radv piglit fails 4503- vtn: assert that vector shuffle indices are in-bounds 4504- vtn: assert that composite members have the same bit size as the result 4505- zink: require draw params ext/cap in ntv if drawid is used 4506- zink: emit spirv cap for subgroup vote ops 4507- zink: use correct scope for subgroup vote ops 4508- zink: fix subgroup vote detection 4509- zink: lower subgroup width to 1 for unsupported subgroup vote stages 4510- zink: remove first_frame stalling 4511- zink: drop wideLines requirement 4512- zink: drop largePoints requirement 4513- st/pbo_compute: do pbo readback directly to the buffer object if it exists 4514- st/pbo_compute: use different calc for non-3d compute buffer sizing 4515- st/pbo_compute: make compute download conditional in shader slightly more readable 4516- st/pbo_compute: fix z coords for compute pbos 4517- zink: only infer modifiers if winsys handle has a stride 4518- zink: drop requirement for 6 descriptorsets to use bindless 4519- zink: store the real (non-compact) binding usage for programs 4520- zink: fix change flagging for compact descriptor cache 4521- zink: flag all cached descriptors as needing update on program change 4522- zink: add a #define for descriptor compaction 4523- zink: make swapchain kill message more descriptive 4524- st/texture: use base teximage for compute blits 4525- st/texture: allow compute blits for (some) non-finalized textures 4526- lavapipe: fix dynamic patchControlPoints 4527- mesa: throw a log warning any time a fallback texture is used 4528- zink: fix radv piglit fails 4529- zink: force queries to start/end out of renderpass 4530- zink: tu ci updates 4531- zink: reorder pipeline dynamic state1 substruct 4532- zink: improve packing for pipeline dynamic state1 info 4533- zink: break out dynamic state1 pipeline info to struct 4534- zink: use memcmp for comparing dynamic state1 info 4535- zink: use dynamic cull mode 4536- zink: break out pipeline dynamic state2 into struct 4537- zink: use dynamic patch vertices if available 4538- zink: expand patch_vertices pipeline key bitsize 4539- zink: fix framebuffer attachment usage asserts for dummy attachments 4540- aux/trace: fix sampler view dumping 4541- aux/trace: implement pipe_screen::is_compute_copy_faster 4542- zink: disable compute pbos on turnip 4543- egl/wayland: manually swap backbuffer when using zink 4544- egl/wayland: skip buffer creation on zink 4545- kopper: use get_drawable_info path for non-x11 drawables 4546- ci: disable unit tests 4547- zink: fix dmabuf plane layout struct scoping 4548- zink: represent plane offsets using offset from plane 0 vs size of plane 4549- zink: rename a variable 4550- zink: handle aux plane imports 4551- Revert "zink: remove drm_fd" 4552- zink: add back kms handling 4553- mesa: improve relocation problem message 4554- mesa/st: bump param reservation to 20 4555- format_utils: properly parenthesize macro params 4556- d3d12: skip time-elapsed piglit tests in ci 4557- virgl: add some ci flakes 4558- mesa/st: bump param reservation to 28 4559- mesa: add statevar for atomic counter offsets 4560- mesa: conditionally set constants dirty for atomic counter binds 4561- st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible 4562- mesa: handle atomic counter lowering for drivers with big ssbo offset aligns 4563- zink: remove ANV depth clip control workaround 4564- zink: disable EXT_primitives_generated_query on turnip 4565- zink: track invalidation for image resources 4566- zink: flag renderpass for change if image resource changes valid state 4567- zink: rename renderpass attrib value 4568- zink: split out a dynamic render ternary 4569- zink: hook up surface invalidation to LOAD_OP_DONT_CARE 4570- zink: invalidate blit dsts if fully covered 4571- zink: remove buffer valid range tracking from blit 4572- mesa/streaming-memcpy: add memcpy fallback 4573- streaming-memcpy: move to src/util/ and compile unconditionally 4574- mesa/st: use util_streaming_load_memcpy for compute pbo copying 4575- zink: break out of zs mixed layout update loop when work is done 4576- zink: only update layout when doing mixed zs attachment renderpass check 4577- aux/trace: add pipe_context::link_shader 4578- aux/tc: handle link_shader 4579- zink: simplify some compiler bo code 4580- zink: fix buffer descriptor hashing 4581- zink: always create descriptor templates 4582- zink: flag optimization pass after inlining uniforms 4583- zink: break out nir alu typing in ntv 4584- zink: run nir_lower_alu_to_scalar in optimizer loop for 64bit lowering 4585- zink: always scalarize pack/unpack alu ops 4586- zink: don't lower pack/unpack intrinsics 4587- zink: run more int64 passes during optimization loop if int64 isn't available 4588- zink: fix ntv partial stores 4589- zink: flag all shader i/o variables as !is_xfb 4590- zink: fix variable locations in manual xfb emission 4591- zink: if a variable is flagged as being xfb, treat it as manual xfb output 4592- zink: improve/expand manual xfb emission 4593- zink: implement nir_op_unpack_64_2x32 in ntv 4594- zink: rewrite 64bit shader i/o as 32bit 4595- zink: scalarize when rewriting explicit 64bit io 4596- zink: avoid adding more 64bit alus during bo rewriting 4597- zink: fix 32bit bo rewriting 4598- zink: update tu a630 baseline 4599- zink: add new flakes 4600- zink: fix feature requirement 4601- zink: add feature req for border colors 4602- zink: more requirement docs 4603- zink: rewrite atomic ssbo intrinsics as atomic derefs 4604- zink: run copy_prop_vars during optimization 4605- zink: warn on missing customBorderColorWithoutFormat 4606- zink: more lavapipe glcts fails 4607- zink: cap driver inlining using ssa allocation limit 4608- zink: use separate pointer for swapchain presents 4609- zink: keep a count of async presents occuring for a given swapchain 4610- zink: defer old swapchain destruction 4611- zink: add implicit sync workaround for non-mesa drivers 4612- zink: unset res->swapchain upon killing a swapchain 4613- zink: fix up KILL to a more sensible log message 4614- lavapipe: VK_EXT_border_color_swizzle 4615- anv: VK_EXT_border_color_swizzle 4616- zink: fix dynamic stride conditional in pipeline creation 4617- zink: fix generated tcs update path 4618- zink: use the patch_vertices value from the shader key, not the context 4619- zink: unset generated tcs pointer from tes on deletion 4620- zink: fix generated tcs deletion 4621- zink: only add necessary binds during dmabuf export 4622- zink: avoid uninit values in renderpass state 4623- zink: don't print VK_EXT_shader_atomic_float warning for nir_intrinsic_image_deref_atomic_exchange 4624- zink: hook up VK_EXT_non_seamless_cube_map 4625- zink: use VK_EXT_non_seamless_cube_map when available 4626- zink: rename nonseamless struct members to emulate_nonseamless 4627- zink: skip some emulated cube code if the extension is present 4628- zink: fix cubemap lowering for array samplers 4629- zink: fix cubemap lowering bit size 4630- zink: clamp dynamic renderArea to framebuffer size 4631- zink: check for pending clears to determine write status of zs attachments 4632- zink: add a function for getting the minimum framebuffer layers 4633- zink: clamp dynamic fb layers better 4634- zink: clamp renderpass layers better 4635- zink: anv ci updates 4636- zink: use uint64_t for batch_id 4637- zink: remove thread_queue 4638- zink: use a dynarray for semaphore wait flags 4639- zink: add batch api for adding a wait semaphore 4640- zink: add semaphore handling for sparse binds 4641- zink: detect sparse-only binding queue 4642- zink: always bind on the sparse queue 4643- zink: only enable vtn debugging if validation is active 4644- zink: tu ci baseline update 4645- zink: radv baseline updates 4646- drisw: probe kms if fd is passed 4647- kmsro: add error message on drm ioctl failure 4648- zink: rename a variable 4649- zink: unify actual map calls in buffer mapping 4650- zink: use only the extents for buffer unmap flushes 4651- zink: fix buffer transfer flushes with offsets 4652- zink: store and reuse memory heap flags for buffer placement 4653- zink: move BAR allocation demotion up the stack 4654- zink: be a little selective about BAR fallback memory type 4655- zink: hook up VK_EXT_border_color_swizzle 4656- zink: use VK_EXT_border_color_swizzle 4657- zink: print runtime warning on missing EXT_border_color_swizzle 4658- mesa/st: add PIPE_CAP_DITHERING 4659- zink: unset PIPE_CAP_DITHERING 4660- mesa/st: increment num_uniforms for hw accel select shaders 4661- zink: move draw-time barrier generation down a little 4662- zink: track vertex buffer bind counts on resources 4663- zink: rework buffer barrier generation 4664- zink: track image binds 4665- zink: add a #define for vk shader bits 4666- zink: collect gfx stages for all bindings during barrier generation 4667- zink: don't short-circuit gfx stage finding during barrier generation 4668- zink: relax zink_resource_buffer_needs_barrier checks 4669- zink: ci updates 4670- glx/drisw: store the flush extension to the screen 4671- glx/drisw: invalidate drawables upon binding context if flush extension exists 4672- zink: fix dual_src_blend driconf workaround 4673- nir/lower_point_size: apply point size clamping 4674- mesa: break out PointSizeIsOne setting to util function 4675- mesa: rename PointSizeIsOne -> PointSizeIsSet 4676- mesa: skip pointsize exports if pointsize attenuation is enabled 4677- mesa: enforce pointsize exports if pointsize is being clamped 4678- mesa: explicitly disallow multiple pointsize exports from generating 4679- zink: ci updates 4680- zink: add flag to indicate if a resource is a dmabuf 4681- zink: flag dmabufs for foreign queue transition on flush_resource call 4682- lavapipe: zero out blend info if blend isn't enabled 4683- lavapipe: copy more pNexts for pipeline creation 4684- lavapipe: fix renderpass info handling during pipeline creation 4685- lavapipe: add a pipeline library assert 4686- lavapipe: skip post-copy pNext checking during pipeline creation for composites 4687- zink: use the bigger of the variable type and interface type for bo sizing 4688- zink: track ssbo bind counts 4689- zink: track overall samplerview bind counts 4690- zink: fix image bind counting 4691- zink: track gfx/compute descriptor barrier info 4692- zink: use tracked barrier info for generated barriers 4693- zink: remove another zink/tu fail 4694- zink: fix-ish depth clipping without VK_EXT_depth_clip_enable 4695- zink: add a turnip driver workaround for EXT_depth_clip_enable 4696- zink: remove swizzle from fbfetch lowering 4697- lavapipe: always set point_tri_clip 4698- zink: fix kopper_acquire return value 4699- zink: track whether current swapchain has data 4700- zink: move to AoS for storing swapchain struct data 4701- zink: move 'acquired' flag onto swapchain struct 4702- zink: use better determination for handling swapchain acquire submits 4703- zink: remove broken kopper assert 4704- zink: remove unnecessary code in kopper readback acquire 4705- zink: improve no-op acquire detection 4706- zink: flag batch as having work during frontbuffer flush 4707- zink: ensure pending present flushes are handled during frontbuffer flush 4708- zink: delete zink_resource_object::acquire 4709- zink: no-op pixmap frontbuffer flush calls 4710- zink: acquire swapchain images on image map 4711- zink: use kopper_displaytarget type directly 4712- zink: delete zink_resource_object::dt_has_data 4713- zink: set PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY for device-local resources 4714- lavapipe: break out part of shader optimization into function 4715- lavapipe: streamline xfb shader setup 4716- nir/types: fix glsl_matrix_type_is_row_major() assert 4717- zink: store context flags 4718- zink: don't increment screen->num_contexts for copy context 4719- zink: disable turnip traces temporarily 4720- lavapipe: don't remove xfb outputs 4721- zink: enforce viewport depth clamping 4722- zink: size uniform_0 ubo accurately 4723- zink: further improve bo sizing 4724- zink: force unsized array in bo creation using the last block member type 4725- zink: move/refactor bo_vars struct creation in compiler 4726- zink: handle bitsizes in get_bo_vars() analysis 4727- zink: don't run nir_lower_uniforms_to_ubo repeatedly 4728- zink: pass zink_shader to optimize_nir during final optimize pass 4729- zink: add an optimizer pass to enforce bo bounds 4730- zink: use std430 packing by default 4731- zink: use a std430 interface for bo types 4732- mesa: remove incomplete texture warning 4733- mesa/st: make get_sampler_view_format() public 4734- mesa/st: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO 4735- zink: init driver workarounds earlier in screen creation 4736- zink: disable customBorderColorWithoutFormat on turnip 4737- zink: don't warn for missing customBorderColorWithoutFormat on turnip 4738- zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO 4739- zink: flush pending clears for fb texture barriers 4740- mesa: fix SignalSemaphoreEXT behavior 4741- lavapipe: don't overwrite entire VkFormatProperties3 struct 4742- lavapipe: don't crash on null xfb buffer pointer 4743- zink: reject swizzled format blits 4744- lavapipe: support inlined shader spirv for compute 4745- zink: allow multiple tex components for depth tg4 4746- zink: always use 32bit sample ops 4747- zink: add a compiler pass to match up tex op dest types 4748- zink: use sampler_bind_count to simplify some code 4749- zink: break out samplerview layout reset code 4750- zink: always update sampler descriptor layouts on fb surface unbind 4751- zink: always use 32bit floats for so output types 4752- zink: handle bare matrix types in xfb emission 4753- zink: split xfb block emission from array/matrix handling 4754- zink: fix xfb array inlining 4755- zink: use right glsl length getter for ntv partial stores 4756- zink: bitcast extracted streamout components to uint before creating uvec 4757- zink: add a compiler pass to split xfb block outputs 4758- zink: fix xfb emit check in compiler 4759- mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate 4760- mesa: track which sampler wrap params use GL_CLAMP 4761- mesa: update GL_CLAMP emulation when binding/unbinding textures 4762- zink: remove bogus range tracking from texture clear 4763- zink: fix transient attachment rp assert 4764- zink: remove format check from clear texture 4765- zink: fix coverage check for texture clears 4766- zink: stop using u_blitter for texture clears 4767- zink: improve zink_clear_depth_stencil check for current attachment 4768- zink: massively simplify zink_clear_depth_stencil 4769- zink: remove non-renderpass clear path from zink_clear_texture 4770- zink: remove u_blitter usage from zink_clear_render_target 4771- zink: always use storeOp=STORE for depth renderpass 4772- zink: track a bitmask of fb attachments with mismatched layer counts 4773- zink: add explicit (awful) handling for fb layer mismatch clears 4774- zink: clamp color clear values based on format 4775- zink: lift clearing on fb state change up a level 4776- zink: remove out-of-renderpass clears 4777- zink: delete zink_fb_clear_util_unpack_clear_color 4778- zink: delete srgb tracking for clears 4779- zink: simplify zink_framebuffer_clear_data union 4780- zink: inject a 0,0,0,1 clear for RGBX formats 4781- zink: break out tc/trace context unwrapping 4782- zink: nv baseline update 4783- zink: tu a630 baseline update 4784- mesa/st: add implicit zeroing of clipdistance array 4785- zink: move descriptor_mode to be a global 4786- zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK 4787- zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED 4788- zink: default to lazy/templated descriptors 4789- zink: handle max_vertices=0 in geometry shader 4790- zink: handle null samplerview in get_imageview_for_binding() 4791- zink: fix crashing on bo allocation failure 4792- zink: improve oom error message 4793- zink: call bind_last_vertex_stage() when binding vs 4794- zink: fix viewport count tracking 4795- zink: use gfx_barrier for synchronization when binding buffer descriptors 4796- vulkan/wsi: fix multiple acquires for sw without mit-shm 4797- llvmpipe: ignore multisample state for alpha_to_one 4798- zink: don't try to start a renderpass before CmdClearAttachments 4799- zink: don't flag renderpass change on depth write change as often 4800- zink: flag renderpass change if renderpass clears change 4801- zink: do dynamic render implicit clear updates on rp_changed 4802- zink: unflag ctx->rp_changed after dynamic render update 4803- zink: remove redundant renderpass update check for clears 4804- zink: add an assert to verify that deferred clears aren't added in renderpass 4805- zink: split out rp_changed to be more granular for dynamic render 4806- zink: break out renderpass attachment load op calc 4807- zink: break out zs renderpass layout calc 4808- zink: break out color rt layout calc 4809- zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info() 4810- zink: add a bits union for zink_render_pass_state comparisons 4811- zink: improve granularity of renderpass switching 4812- zink: allow no-op renderpass updates in zink_batch_rp() 4813- zink: move layout-setting for dynamic render attachments after no-op case 4814- zink: handle null samplerview/image rebinds more gracefully 4815- zink: scale number of shader cache-get threads 4816- zink: invoke descriptor_program_deinit for programs on context destroy 4817- vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize 4818- zink: add env var to abort on device-lost if no reset callback is set 4819- util/vbuf: handle multidraws 4820- anv: disable dev.i915.perf_stream_paranoid=0 warning 4821- lavapipe: propagate xfb info for pipeline library rasterization stages 4822- zink: remove deqp fails for lavapipe 4823- zink: only flag fbfetch as rp update if fbfetch wasn't already configured 4824- zink: assert !rp_changed after renderpass has started 4825- zink: avoid crashing when bindless texture updates aren't used 4826- lavapipe: EXT_non_seamless_cube_map 4827- lavapipe: accurately set image/ssbo access based on shader usage 4828- lavapipe: implement extreme uniform inlining 4829- lavapipe: remove some skips 4830- zink: add a ci flake 4831- lavapipe: move vertex input dynamic states into vertex input block 4832- lavapipe: make binding/layout helper functions static inline 4833- lavapipe: use correct offset calculation for flagging ssbo/image write access 4834- lavapipe: use 64bit masks for shader access 4835- lavapipe: set image access on pipeline bind, not descriptor bind 4836- llvmpipe: remove weird clamping for format support with sample counts 4837- llvmpipe: allow resource_copy_region to perform replicate operations 4838- lavapipe: fix stencil clears 4839- llvmpipe: add a shortcut for blitter resolves 4840- vulkan: hook up VK_EXT_multisampled_render_to_single_sampled 4841- lavapipe: VK_EXT_multisampled_render_to_single_sampled 4842- lavapipe: advertise VK_EXT_multisampled_render_to_single_sampled 4843- util/blitter: respect PIPE_TEXTURE_RECT 4844- lavapipe: fix shader access stage conversion from mesa -> pipe 4845- zink: stop ORing unordered barrier access 4846- zink: convert ZINK_DEBUG to enum 4847- zink: add doc for ZINK_DEBUG=compact 4848- zink: add ZINK_DEBUG=noreorder 4849- zink: zink_resource::unordered_barrier -> unordered_exec 4850- zink: explicitly unset unordered_exec when resource can't be unordered 4851- zink: rework unordered_exec handling 4852- zink: add a second resource param to get_cmdbuf() for unordered promotion 4853- zink: promote image/buffer copy operations to unordered 4854- zink: move buffer/image functions down in file 4855- zink: expand unordered_exec 4856- zink: make get_cmdbuf() public 4857- zink: enable unordered blit/resolve ops 4858- virgl: more ci flakes 4859- d3dadapter: fix sw pipe loading 4860- nine: set view_mask=0 4861- zink: fix gfx program cache pruning with generated tcs 4862- zink: don't flag lazy push constant set dirty on batch change 4863- zink: combine loops for lazy descriptor program deinit 4864- zink: handle mutable swapchain images with dmabuf 4865- zink: store VkFormatFeatureFlags on creation 4866- zink: use modifier feature flags during surface creation when necessary 4867- zink: init cache_put program fence on program creation 4868- zink: force flush clears on fb change if fb geometry changes 4869- zink: force a new framebuffer for clear_depth_stencil if the clear region is big 4870- zink: drop mode_changed check from linewidth/depthbias draw updates 4871- zink: handle unscaled depth bias from nine 4872- zink: handle !half_pixel_center 4873- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE 4874- nine: check return on resource_get_handle 4875- zink: demote dmabuf tiling to linear if modifiers aren't supported 4876- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param 4877- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present 4878- zink: add all format modifiers when adding for dmabuf export 4879- zink: don't fixup sparse texops 4880- vulkan: Update the XML and headers to 1.3.224 4881- radv: fix return type for meta resolve shaders 4882- nir/validate: clamp unsized tex dests to 32bit 4883- mesa: fix blending when using luminance/intensity emulation 4884- mesa: require render target bind for A/L/I in format selection 4885- vk/render_pass: don't deref null resolve attachments 4886- zink: handle nir_intrinsic_sparse_residency_code_and mechanics 4887- zink: ignore nir_texop_lod for tex dest matching 4888- tu: fix invalid free on alloc failure 4889- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() 4890- zink: don't emit entrypoints for function temp variables 4891- zink: check the variable mode before taking samplemask path in ntv 4892- zink: explicitly use unsigned types for bit shifts 4893- zink: only add srgb mutable for images with modifiers 4894- zink: don't emit illegal interpolation 4895- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer 4896- zink: flag all assigned output slots as mapped 4897- zink: handle split acquire/present 4898- st_pbo/compute: fix 1D_ARRAY offsets 4899- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors 4900 4901Mike Lothian (1): 4902 4903- util: Fix invalid usage of alignas in u_cpu_detect.c 4904 4905Mykhailo Skorokhodov (3): 4906 4907- anv: Use sampleLocationsEnable for sample locations 4908- iris: Move Wa_1806527549 and enable by default 4909- anv: Move Wa_1806527549 and enable by default 4910 4911Nagappa Koppad, Basanagouda (1): 4912 4913- iris:Duplicate DRM fd internally instead of reuse. 4914 4915Nanley Chery (11): 4916 4917- iris: Return a 64B stride for clear color plane 4918- intel/isl: Add a score for I915_FORMAT_MOD_4_TILED 4919- isl,iris: Add DG2 CCS modifier support for XeHP 4920- mesa: Simplify UNORM8 conversions for sRGB ASTC textures 4921- mesa: Follow ASTC decode mode extension for RGBA8 output 4922- iris: Don't leak surface states for compressed resources 4923- iris: Don't leak compressed resources in iris_create_surface 4924- iris: Use fill_surface_states for compressed resources 4925- anv: Make the D16 reg mode single-sampled 4926- iris: Make the D16 reg mode single-sampled 4927- iris: Dedent enum iris_depth_reg_mode 4928 4929Nicolas Caramelli (1): 4930 4931- egl: Fix EGL_EXT_platform_xcb name string to match the registry 4932 4933Panagiotis Apostolou (1): 4934 4935- util: Don't block SIGSEGV for new threads 4936 4937Patrick Lerda (1): 4938 4939- panfrost: Fix unwanted valgrind message related to restart_index 4940 4941Paulo Zanoni (4): 4942 4943- iris: fix race condition during busy tracking 4944- intel/compiler: compute int64_options based on devinfo->has_64bit_int 4945- intel/compiler: split handling of 64 bit floats and ints 4946- intel/compiler: call ordered_unit() only once at update_inst_scoreboard() 4947 4948Pavel Asyutchenko (7): 4949 4950- gallium: add PIPE_CAP_FBFETCH_ZS and expose extension 4951- glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil 4952- nir_to_tgsi: Don't count ZS fbfetch vars as outputs 4953- llvmpipe: enable per-sample shading when FB fetch is used 4954- llvmpipe: simplify early/late zs tests selection 4955- llvmpipe: implement FB fetch for depth/stencil 4956- llvmpipe: enable PIPE_CAP_FBFETCH_ZS 4957 4958Pavel Ondračka (13): 4959 4960- r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader 4961- r300: skip draws instead of using a dummy vertex shader 4962- r300: fix vertex shader control flow in loops 4963- r300: deduplicate common NIR options 4964- r300: be less agresive with copy propagate in loops 4965- r300: don't try to use inline constants instead of constant swizzles 4966- r300: Update list of RV515 dEQP failures and add some flakes 4967- r300: merge simple movs with constant swizzles together 4968- r300: expose PIPE_CAP_LEGACY_MATH_RULES 4969- r300: only run merge_movs pass on R500 4970- r300: Keep rc_rename_regs() from overflowing RC_REGISTER_MAX_INDEX 4971- r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF 4972- r300: fix variables detection for paired ALU and TEX instructions in different branches 4973 4974Pierre-Eric Pelloux-Prayer (81): 4975 4976- radeonsi: make si_pc_emit_shaders non static 4977- radeonsi: add helpers to deal with spm counters 4978- radeonsi: add spm counters setup code 4979- radeonsi/sqtt: setup spm bo 4980- radeonsi/sqtt: enable spm counters on gfx10+ 4981- radeonsi: make sure profile_peak is used before using sqtt 4982- ac/spm: setup write broadcasting correctly 4983- ac/surface: adjust gfx9.pitch[*] based on surf->blk_w 4984- radeonsi: remove commutative_blend_add option 4985- radeonsi: drop assume_no_z_fights option 4986- drirc: remove i965 entry 4987- gallium: add PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER 4988- radeonsi: enable PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER 4989- radeonsi: fix VS kill_outputs handling 4990- radeonsi: don't always strip NIR debug infos 4991- radeonsi: generate unique shader name in si_get_nir_shader 4992- amd: update gfx10_format_table.py for gfx11 4993- radeonsi/gfx11: add assert in legacy vs path 4994- radeonsi/gfx11: register changes 4995- radeonsi/gfx11: image descriptor changes 4996- radeonsi/tests: allow empty line and comments in csv files 4997- radeonsi/tests: add a heuristic to pick the baseline 4998- radeonsi/tests: reformat with black 4999- u_threaded: clear non-async debug callback correctly 5000- radeonsi: flush VGT streamout like PAL 5001- radeonsi: implement pipeline stats workaround 5002- gallium: add a union to access queries counters 5003- gallium: always return the correct pipeline stats index 5004- gallium: deduplicate target -> stats index code 5005- radeonsi: deduplicate query offsets 5006- radeonsi: store the pipeline stats index 5007- radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders 5008- radeonsi/test: update sienna_cichlid results 5009- radeonsi/query: use the qbo correct size 5010- radeonsi/ngg: reuse the pipeline stats buffer when using atomics 5011- radeonsi: fix gs_invocation query with NGG 5012- radeonsi: don't use wave32 for GE on gfx10 if culling is used 5013- amd: fix ac_build_mbcnt_add in wave32 mode 5014- util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster 5015- radeonsi: remove unused params from si_shader_select_with_key 5016- radeonsi: scale the number of shader compiler threads 5017- radeonsi: store shader variants in an array 5018- glx: set max values for pbuffer width / height 5019- radeonsi: fix glTexBuffer max size handling 5020- radeonsi: wait for PS idle in si_set_framebuffer_state 5021- radeonsi/tests: add a --slow option 5022- radeonsi/tests: use a smaller tests-per-group value 5023- radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop 5024- ac: export LLVM c++ ResetCommandLineParser method 5025- ac: reset LLVM command line parser 5026- ac/llvm: add use_waterfall_for_divergent_tex_samplers option 5027- radeonsi: enable use_waterfall_for_divergent_tex_samplers 5028- st/mesa: use mutex in st_get_texture_sampler_view_from_stobj 5029- gallium: update winsys_handle documentation 5030- gallium: add size attribute to winsys_handle 5031- va/surface: set the correct size in vaExportSurfaceHandle 5032- radeonsi: set size in si_texture_get_handle 5033- tradeonsi: fix preamble state producing incorrect packets 5034- radeonsi: prevent recursion in si_decompress_dcc 5035- radeonsi: add helper to use si_screen::aux_context 5036- radeonsi: use helpers to access si_screen::aux_context 5037- Revert "winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+" 5038- dri: store internal_format when known 5039- mesa/st: don't guess the internal format if it's known 5040- util/blob: use memcpy in read functions 5041- mesa: skip draws with invalid indices offset 5042- vulkan/wsi: define pWaitDstStageMask in the blit submission 5043- ac: use LLVMContextSetOpaquePointers if available 5044- radeonsi: use LLVMBuildLoad2 where possible 5045- ac: add per output is_16bit flag to ac_shader_abi 5046- ac/llvm: handle opaque pointers in visit_store_output 5047- ac/llvm: use LLVMBuildLoad2 in visit_load 5048- radeonsi: use LLVMBuildLoad2 in llvm PS 5049- radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads 5050- radeonsi: check last_dirty_buf_counter and dirty_tex_counter 5051- amdgpu/bo: update uses_secure_bos when importing buffers 5052- nir: add a nir_opt_if_options enum 5053- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot 5054- radeonsi: use nir_opt_large_constants earlier 5055- mesa: avoid reading back textures from VRAM 5056- radeonsi: invalidate L2 when using dcc stores 5057 5058Qiang Yu (63): 5059 5060- radeonsi: lower nir_intrinsic_sparse_residency_code_and 5061- mesa/program: fix nir output reg overflow 5062- mesa/vbo: remove unused vbo_context->binding 5063- nir/builder: add load/store array variable helper functions 5064- mesa: add hardware accelerated select constant 5065- mesa: add _mesa_bufferobj_get_subdata 5066- mesa: refine name stack code to prepare for hw select 5067- mesa: add hw select name stack code path 5068- mesa/vbo: enclose none-vertex functions with HW_SELECT_MODE 5069- mapi: add api setup header for hw select mode 5070- mesa: set CurrentServerDispatch too when glBegin/End 5071- mesa: add HWSelectModeBeginEnd dispatch table 5072- mesa: pass select result buffer offset as attribute/varying 5073- mesa/st: implement hardware accelerated GL_SELECT 5074- zink: reset PIPE_CAP_ACCELERATED when cpu soft rendering 5075- virgl: return -1 for PIPE_CAP_ACCELERATED 5076- mesa: enable HardwareAcceleratedSelect 5077- ac/llvm: get back nir_intrinsic_load_tess_rel_patch_id_amd 5078- nir: add nir_intrinsic_load_lshs_vertex_stride_amd 5079- ac/nir: use nir_intrinsic_load_lshs_vertex_stride_amd 5080- radeonsi: implement load_lshs_vertex_stride abi 5081- radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader 5082- ac/nir: skip gl_Layer/gl_ViewportIndex write for LS 5083- radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones 5084- gallium: disable hardware select for crocus 5085- ac/llvm: conditionally check wave id in gs sendmsg 5086- radv: no need to do gs_alloc_req for newer chips in ngg vs/tes 5087- radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop 5088- radeonsi: use si_shader as parameter in si_get_nir_shader 5089- radeonsi: support multi stage shader state creation in nir shaderlib 5090- radeonsi: add si_create_passthrough_tcs 5091- radeonsi: replace llvm based fixed tcs with nir 5092- radeonsi: implement nir_intrinsic_load_tcs_num_patches_amd 5093- ac/nir: remove unused parameter in tes input lower 5094- ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd 5095- ac/nir: use nir_intrinsic_load_hs_out_patch_data_offset_amd in tess lower 5096- ac/nir: add pass_tessfactors_by_reg param to hs output lower 5097- ac/nir: add wave_size parameter to ac_nir_lower_hs_outputs_to_mem 5098- radeonsi: preload tess offchip ring for tcs 5099- radeonsi: implement nir_intrinsic_load_ring_tess_offchip(_offset)_amd 5100- radeonsi: implement nir_intrinsic_load_tess_rel_patch_id_amd for both tcs and tes 5101- ac/llvm: handle write mask for nir_intrinsic_store_buffer_amd 5102- ac/nir: add no_input_lds_space param to hs output lower 5103- radeonsi: replace llvm tcs output with nir lower pass 5104- radeonsi: enable PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS 5105- radeonsi: set uses_vmem_load_other for nir_intrinsic_load_buffer_amd 5106- radeonsi: replace llvm tes input load with nir lowering 5107- ac/llvm: fix code format alignment in visit_load_local_invocation_index 5108- radeonsi: implement two esgs ring nir intrinsic 5109- ac/nir: skip gl_ViewportIndex and gl_Layer write in ES 5110- radeonsi: set lds for gs/es to handle nir shared memory load/store 5111- ac/nir: remove unused param num_reserved_es_outputs from gs input lower 5112- ac/nir: change es output lower param to esgs_itemsize 5113- radeonsi: replace llvm es output with nir lowering 5114- ac/nir: add triangle_strip_adjacency_fix to gs input lower 5115- ac/llvm: get back nir_intrinsic_load_gs_vertex_offset_amd 5116- radeonsi: replace llvm gs input handle with nir lowering 5117- ac/nir/ngg: fix nogs culling scratch size 5118- ac/nir/ngg: add a barrier before prim id export 5119- ac/nir/ngg: Decouple primitive ID store and primitive export. 5120- nir/lower_gs_intrinsics: fix primitive count for points 5121- radeonsi: fix tcs_out_lds_offsets arg alignment 5122- winsys/amdgpu: fix non-page-aligned sparse buffer creation 5123 5124Rajnesh Kanwal (29): 5125 5126- pvr: Remove double error reporting in error path. 5127- pvr: Change phase_rate_change type to bool from uint32_t. 5128- pvr: Setup common nop shader. 5129- pvr: Add services winsys transfer context support. 5130- pvr: Add support to create transfer context and setup required shaders. 5131- pvr: Add vk_sync support and remove service winsys syncobjs interface. 5132- pvr: Fix argument comments in render job submission. 5133- pvr: Pass device info struct in pds generator. 5134- pvr: Add support for R32G32B32A32_UINT, R32_UINT and R8_UINT formats. 5135- pvr: Allow signal_sync pointer to be NULL in job submission. 5136- pvr: Remove vk_format_is_pure_integer and use common helper. 5137- pvr: Add transfer 3d parameter heap support. 5138- pvr: Fix ISP_RENDER_MODE_TYPE enum in rogue_cr.xml 5139- pvr: Restrict argument to dev_info where possible. 5140- pvr: Fix UTIL_ARCH_BIG_ENDIAN check in vk_format.h. 5141- pvr: Change ALIGN macro to ALIGN_ATTR. 5142- pvr: Add services winsys transfer cmd submit interface. 5143- pvr: Move transfer logic to pvr_job_transfer.[ch] 5144- pvr: Remove clang-format off comment from vk_format.h 5145- pvr: Implement vkCmdDraw API. 5146- pvr: Init and store default sampler in pvr_device. 5147- pvr: Use common vk_buffer implementation. 5148- pvr: Implement vkCreateBufferView and vkDestroyBufferView. 5149- pvr: Move binding related checks in common code. 5150- pvr: Add support for sampler descriptor. 5151- pvr: Add support for combined image and sampler descriptor. 5152- pvr: Add support for image descriptor. 5153- pvr: Add support for storage and uniform texel descriptor. 5154- pvr: Add support for input attachment descriptor. 5155 5156Renato Pereyra (4): 5157 5158- venus: add support for vk_xwayland_wait_ready 5159- venus: Add driconf option to enable implicit fencing 5160- Revert "wsi/x11: Don't leak xcb_get_geometry_reply_t." 5161- Revert "wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes" 5162 5163Rhys Perry (94): 5164 5165- aco: fix disassembly of SMEM with both SGPR and constant offset 5166- ac/nir: properly handle large global access constant offsets 5167- nir/gather_info: fix system_value_read for rt/mesh system values 5168- nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper 5169- radv: remove unnecessary align_mul/offset initialization 5170- radv: initialize meta shader options earlier 5171- radv: use radv_meta_build_nir_vs_generate_vertices() more 5172- radv: use nir_op_imm helpers 5173- radv: use inot(a) instead of ine(a, true)/ieq(a, false) 5174- ac/nir: fix 64-bit NGG GS output stores 5175- radv: always use nir_lower_io_lower_64bit_to_32 5176- nir/copy_prop_vars: fix non-vector shader call payloads 5177- radv: fix clearing of TRUNC_COORD with tg4 and immutable samplers 5178- radv: consider radix sort shaders to be internal/meta 5179- aco/ra: fix vgpr_limit 5180- aco: ensure that definitions fixed to operands have matching regclasses 5181- radv: add radv_has_shader_buffer_float_minmax 5182- radv: allow LBVH on GFX6+ 5183- radv: remove subtractions in address calculations 5184- radv: use SMEM for a few load_global 5185- aco: split load_sbt_amd result 5186- aco: skip s_barrier if TCS patches are within subgroup 5187- ac/nir: skip s_barrier if TCS patches are within subgroup 5188- aco: fix cmpswap global atomic definition on GFX6 5189- nir: allow NIR_PASS(_, ) 5190- nir: print file when validation fails 5191- nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base 5192- nir: call nir_metadata_preserve in nir_vectorize_tess_levels 5193- nir: call nir_metadata_preserve in nir_lower_memory_model 5194- radv: call nir_metadata_preserve in various lowering passes 5195- radv: add missing NIR_PASS() and switch from NIR_PASS_V() 5196- radv: validate shaders after linking passes 5197- aco: fix p_constaddr with a non-zero offset 5198- aco/tests: add test for p_constaddr with a non-zero offset 5199- aco: only add/subtract low bits of program addresses 5200- aco/ra: fix usage of invalidated iterator 5201- aco: use p_parallelcopy for uniform reduction with zero source 5202- aco: clarify a portion of do_pack_2x16 5203- aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11 5204- aco/tests: fix gfx11 variants printed as gfx12 5205- aco: disable sdwa on gfx11 5206- aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00 5207- aco/tests: update for GFX11's removal of SDWA 5208- radv,aco: disable shader cache when ACO debug options are used 5209- aco: fix SMEM load_global_amd with non-zero offset 5210- aco: fix SMEM load_global with VGPR address and non-zero offset 5211- aco: consider fma with multiplication by power-of-two unfused 5212- ac: use ResetAllOptionOccurrences instead of ResetCommandLineParser 5213- gitlab: ask for useful titles 5214- gitlab: ask that reporters don't include long logs in descriptions 5215- nir/deref: stop assuming coherent accesses of different SSBOs may alias 5216- aco: update SMEM offset workaround for LLVM 15 5217- aco: don't skip VS->TCS barrier if TCS output vertices doesn't match input 5218- aco: cleanup force-waitcnt output 5219- aco/ra: update register file when updating phi definition 5220- radv/ci: re-enable vega10 fossils testing 5221- aco: add and use constantValue16() 5222- aco: don't accept med3 opcodes in get_minmax_info() 5223- aco: include _e64 variants of 16-bit min/max in minmax optimizations 5224- aco: ignore precise flag when optimizing integer clamps 5225- aco: fix single-alignbyte do_pack_2x16() path with fp inline constants 5226- aco: fix propagate_constants_vop3p with integer vop3p and 16-bit constants 5227- aco: don't use opsel to fold constants into dot accumulation sources 5228- aco: fix redirect combine in propagate_constants_vop3p() with negatives 5229- aco: try sign-extending or shifting constants in propagate_constants_vop3p 5230- aco: don't use 32-bit fp inline constants for fp16 vop3p literals 5231- aco/tests: add vop3p constant combine tests 5232- nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd 5233- nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) 5234- nir: allow 16-bit fsin_amd/fcos_amd 5235- radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd 5236- aco: make flat access latency match mtbuf/mubuf/mimg 5237- aco: include flat-like in vmem clause statistics 5238- aco: make FLAT_instruction::offset signed 5239- aco: improve support for scratch_* instructions 5240- aco: combine additions and constants into scratch load/store 5241- aco: handle subtractions in parse_base_offset 5242- aco: refactor VGPR spill/reload lowering 5243- aco: avoid WAW hazard with BVH MIMG and other VMEM 5244- aco: include scratch/global in VMEM WAW optimization 5245- aco: treat flat-like as vmem in some scheduling heuristics 5246- aco: initialize scratch base registers on GFX9-GFX10.3 5247- aco: use scratch_* for scratch load/store on GFX9+ 5248- aco: use scratch_* for VGPR spill/reload on GFX9+ 5249- radv: fix vbo_bound_mask indexing 5250- radv: don't update misaligned_mask in CmdBindVertexBuffers2 5251- aco: remove unnecessary exp_cnt increments 5252- aco: fix LdsBranchVmemWARHazard with 2+ branch chains 5253- aco: fix hash statistic 5254- radv: remove claimed support for sRGB vertex buffer formats 5255- radv: fix 16-bit support in radv_lower_vs_input 5256- aco: fix 16-bit VS inputs 5257- aco: don't expand vec3 VS input load to vec4 on GFX6 5258- aco: add SCC clobber in build_cube_select 5259 5260Ricardo Garcia (1): 5261 5262- vulkan: allow null descriptor set layouts in pipeline layouts 5263 5264Riteo (1): 5265 5266- vulkan/device_select_wayland: fix a memory leak with DRM device handling 5267 5268Rob Clark (64): 5269 5270- freedreno: Misc indent fixes 5271- freedreno/drm/virtio: Appease valgrind 5272- freedreno/drm: Drop FD_PP_PGTABLE 5273- freedreno/drm: Close bo handle after bo->destroy() 5274- freedreno/drm: Move bo common init 5275- freedreno/drm/virtio: Support ring_idx 5276- freedreno/drm/virtio: Use userspace IOVA allocation 5277- freedreno/drm/virtio: Drop blocking in host 5278- freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE 5279- freedreno/drm/virtio: Protocol updates 5280- freedreno/drm/virtio: Pass guest handles to execbuf 5281- freedreno/drm/virtio: Async ccmd batching 5282- freedreno/drm/virtio: Split up large uploads 5283- freedreno/a6xx: A couple more FD_BO_NOMAP 5284- freedreno: Split out helper for staging blit 5285- freedreno: Use staging transfer if mmap fails 5286- freedreno/drm: Fix bos_on_stack calculation 5287- drm-shim: Add GET_UNIQUE support 5288- freedreno/drm-shim: Update to latest uapi version 5289- freedreno/drm-shim: Robustify error handling 5290- drm-shim: Better mmap offsets 5291- drm-shim: Add error handling for bo_init() 5292- freedreno/drm-shim: Better iova handling 5293- drm-shim: Cleanup on device file close 5294- freedreno/drm: Use DEBUG_GET_ONCE_OPTION() 5295- freedreno/drm/virtio: Don't try to mmap imported bo's 5296- freedreno/devices: Add another SKU 5297- Revert "ci: freedreno farm is down" 5298- freedreno: Add pkt4 assert 5299- freedreno/registers: Small cleanup 5300- freedreno/a6xx: Split VFD_FETCH[] if needed 5301- freedreno/a6xx: Initialize VFD_FETCH[n].SIZE to zero 5302- freedreno/a6xx: assert valid vertex_flags reg 5303- turnip: assert valid vertex_flag reg 5304- freedreno/ir3: Don't lower_gs multiple times 5305- freedreno/ir3: Add copy_vars() helper 5306- freedreno/ir3: Copy vars if needed on EndPrimitive() 5307- freedreno/ir3: Add more tess varying slots 5308- freedreno/a6xx: Fix indentation 5309- freedreno/a6xx: Fix VS const packet size 5310- freedreno/ir3: Fix GS clip-plane lowering 5311- freedreno: rename ir3_emit_driver_params() 5312- freedreno/a6xx: Handle driver-params in GS/DS 5313- freedreno/a6xx: Do clip-plane lowering in backend 5314- freedreno/gmem: Reverse order of alternative tile rows 5315- util: Fix c++ enum casting pickiness 5316- libsync: Fix compiler warning 5317- freedreno/a6xx: Remove fd6_format.[ch] 5318- freedreno/a6xx: Fix enum tag 5319- freedreno: Extract common helper macros 5320- freedreno/ir3: Remove unneeded forward declaration 5321- freedreno/autotune: Make 'offset' macro "private" 5322- freedreno/drm: Combine upper and lower 32b of OR val 5323- freedreno/a6xx: De-open-code CACHE_FLUSH enum 5324- freedreno: Use enum for primtypes table 5325- freedreno/ci: Update unit test reference decodes 5326- freedreno/registers: Whitespace fix for gen_header.py 5327- freedreno/registers: Allow varset to be specified on enum 5328- freedreno/registers: Move varset to <enum> 5329- freedreno/registers/a6xx: Some reg64 conversion 5330- freedreno/drm: Fix potential bo cache vs export crash 5331- freedreno/gmem: Fix col0 calc 5332- llvmpipe: Add some missing locking 5333- freedreno: We really don't need aligned vbo's 5334 5335Roman Stratiienko (3): 5336 5337- Android.mk: Fix c11-related build failures 5338- v3dv: Enable sync_fd importing/exporting on Android 5339- v3dv: Limit API version to v1.0 for Android 5340 5341Ruijing Dong (6): 5342 5343- radeonsi/vcn: Add support of array_mode for gfx11 5344- radeonsi/vcn: update av1 decoding to support vcn4 5345- radeonsi/vcn: prepare for unified queue in vcn4 5346- radeonsi/vcn: support unified queue in vcn4 5347- frontends/va: do texture_map when needed 5348- frontends/va: WA for ffmpeg 10bit encoding crash 5349 5350Ryan Neph (5): 5351 5352- venus: update venus-protocol with VK_EXT_image_view_min_lod 5353- venus: enable VK_EXT_image_view_min_lod 5354- venus: support VK_KHR_copy_commands2 5355- venus: refactor VkCommandBufferBeginInfo fixups to function 5356- venus: add support for VK_KHR_dynamic_rendering 5357 5358Sagar Ghuge (1): 5359 5360- anv: Disable storage image compression for possible atomic ops 5361 5362Sami Kyöstilä (2): 5363 5364- subprojects: Roll Perfetto to v27.1 5365- util: Shut down Perfetto before driver unload 5366 5367Samuel Pitoiset (213): 5368 5369- radv: exclude PRIMITIVE_{COUNT,INDICES} from the per-vertex output mask 5370- radv: use shader_info::writes_memory 5371- radv: use shader_info::gs::active_stream_mask 5372- radv: use shader_info::inputs_read/outputs_written for FS IO 5373- radv: use shader_info::outputs_written/per_primitive_outputs for VS outputs 5374- radv: use shader_info::system_values_read 5375- radv: fix enabling adjust_frag_coord_z and apply per-pipeline 5376- radv: add support for independent descriptor set layouts 5377- radv: add support for VkShaderModuleCreateInfo as pNext 5378- radv: fix initializing pipeline_key::topology for GFX9 and older 5379- radv: mark all active stages earlier in the pipeline creation path 5380- radv: use the hardware primitive topology everywhere 5381- radv: initialize the vertex input interface state in only one place 5382- radv: do not support UNIFORM_TEXEL_BUFFER with SRGB 5383- radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs 5384- radv/ci: update the flakes list for GFX9 chips 5385- aco: fix load_barycentric_at_{sample,offset} on GFX6-7 5386- nir: fix marking XFB varyings as always active IO 5387- nir: mark XFB varyings as unmoveable to prevent them to be remapped 5388- radv: clarify why STAGE_2_CLEAR_BIT needs to wait for CP DMA to be idle 5389- radv: remove redundant VK_PIPELINE_STAGE_2_TRANSFER_BIT for CP DMA idle 5390- radv: add an SQTT workaround for chips with disabled RBs 5391- radv: fix handling divisor == 0 with dynamic vertex input state 5392- radv: allow to disable sinking of load inputs for FS via drirc 5393- radv: enable radv_disable_sinking_load_input_fs for Grid Autosport 5394- radv: use correct push constants range for internal operations 5395- radv/ci: update list of expected failures for Bonaire (GFX7) 5396- radv/ci: remove empty flakes lists for GFX9 5397- radv: fix the number of generated primitive queries with NGG GS vs legacy 5398- radv/ci: stop skipping dEQP-VK.synchronization.* on Bonaire 5399- radv: use 3D views for 3D internal operations on GFX6-8 5400- radv: re-emit dynamic line stipple state if the primitive topology changed 5401- radv/radix: handle intentional allocation failures properly 5402- ac,radeonsi: add has_sqtt_auto_flush_mode_bug 5403- radv/sqtt: fix configuring AUTO_FLUSH_MODE on GFX10.3 5404- radv: only init acceleration structure if RT is enabled 5405- radv: disable instance packing to fix pipeline query stats 5406- radv: remove useless check against nir_texop_samples_identical 5407- radv/winsys: allow to use RADV_FORCE_FAMILY=gfx1100 5408- radv: disable DCC for Senra Kagura Shinovi Versus 5409- aco: recognize GFX11 in few places 5410- aco: do not align VGPRS to 8 or 16 on GFX11 5411- radv,aco: add support for packed threadID VGPRs on GFX11 5412- radv,aco: use the new TCS WaveID SGPR to compute vs_rel_patch_id on GFX11 5413- aco: export MRT0 instead of NULL on GFX11 5414- aco: do not set DLC for loads on GFX11 5415- aco: do not set GLC stores on GFX11 5416- aco: update LDS allocation granularity for PS on GFX11 5417- aco: update waitcnt on GFX11 5418- aco: do not set RESOURCE_LEVEL for buffer descriptors on GFX11 5419- aco: do not set COMPR for exports but use 0x3 channel mask on GFX11 5420- radv,aco: export alpha-to-coverage via MRTZ on GFX11 5421- aco: only retrieve the scratch offset when it's declared 5422- radv,aco: do not implicitly export the primitive ID for mesh shaders 5423- aco: remove unreachable code about viewport index/layer and mesh shaders 5424- radv,aco: add a workaround for binding 2D views of a 3D image on GFX9 5425- radv: implement VK_EXT_image_2d_view_of_3d 5426- radv: advertise VK_EXT_image_2d_view_of_3d 5427- radv: do not declare the scratch offset on GFX11 5428- aco: do not emit the primitive ID twice for NGG VS or TES with GS 5429- radv: do not align VGPRS to 8 or 16 on GFX11 5430- radv: use the new format table on GFX11 5431- radv: update scratch buffer registers on GFX11 5432- radv: update sampler registers on GFX11 5433- radv: update buffer descriptor registers on GFX11 5434- radv: update image descriptor registers on GFX11 5435- radv: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE on GFX11 5436- radv: configure DB_RENDER_CONTROL on GFX11 5437- radv: do not emit non-existent CP_COHER_START_DELAY on GFX11 5438- radv: do not emit FLUSH_AND_INV_DB_META on GFX11 5439- radv: update TF_RING_size to a per-SE size on GFX11 5440- radv: update VRS registers on GFX11 5441- radv: apply a workaround for CB perf counters on GFX11 5442- radv: limit CP DMA to max 32KB sizes on GFX11 5443- radv: more register changes on GFX11 5444- radv: do not lower loading TESS/ESGS rings using the ABI for LLVM 5445- aco: use ac_is_llvm_processor_supported() for checking LLVM asm support 5446- radv: rename shader compile functions to spirv_to_nir/nir_to_asm 5447- radv: do not try to dump the NIR of the trap handler shader 5448- radv: add new pipeline helpers for NIR->ASM compilation 5449- radv: init states from pTessellationState at only one place 5450- radv: init states from pViewportState at only one place 5451- radv: init states from pRasterizationState at only one place 5452- aco: remove unnecessary intrinsics that are lowered at the ABI level 5453- radv: determine if shaders use NGG before linking 5454- radv: export implicit primitive ID in NIR for legacy VS or TES 5455- radv: constify radv_pipeline in more radv_pipeline_generate_XXX() helpers 5456- radv: rename radv_pipeline_generate_XXX() to radv_pipeline_emit_XXX() 5457- radv: remove couple of useless pCreateInfo pointers in the emit path 5458- radv: remove unused radv_get_conservative_raster_mode() helper 5459- radv: move gs_table_depth to the physical_device 5460- radv: cleanup using device/physical_device during pipeline creation 5461- radv: split radv_pipeline into radv_{graphics,compute,library}pipeline 5462- radv: introduce radv_graphics_pipeline_info and split existing info structs 5463- radv/ci: fix fails list for NAVI21 5464- radv: move HS info and task_num_entries to the physical device 5465- radv: rename radv_cmd_state::pipeline to graphics_pipeline 5466- radv: unify radv_pipeline_has_XXX() helpers 5467- radv: move streamout_shader to radv_graphics_pipeline 5468- radv: move active_stages to radv_graphics_pipeline 5469- radv: rename VERT_GRP_SIZE to VERTS_PER_SUBGRP on GFX11 5470- radv: do not emit more non-existent registers on GFX11 5471- radv: update framebuffer registers on GFX11 5472- radv: update VRS rates on GFX11 5473- radv: use the fragment resolve path by default on GFX11 5474- radv: report adjusted LDS size for fragment shaders on GFX11 5475- radv: fix configuring COLOR_INVALID on GFX11 5476- radv: fix VK_BLEND_FACTOR_CONSTANT_COLOR translation on GFX11 5477- radv: configure DB_Z_INFO.NUM_SAMPLES correctly on GFX11 5478- radv: disable VK_AMD_shader_fragment_mask on GFX11 5479- radv: disable attachementFragmentShadingRate on GFX11 5480- radv: use pipeline->slab_bo in more places 5481- radv: fix writing buffer markers with non-zero memory offset 5482- radv: copy viewport/scissor when initializing radv_viewport_info 5483- radv: init states from pMultisampleState at only one place 5484- radv: init states from pDepthStencilState at only one place 5485- radv: init states from VkPipelineRenderingCreateInfo at only one place 5486- radv: use AMD values for computing blend related state 5487- radv: init states from pColorBlendState at only one place 5488- radv: init states from VkAttachmentSampleCountInfo at only one place 5489- radv: init states from VkPipelineDiscardRectangleState at only one place 5490- radv: init states from VkPipelineFragmentShadingRateState at only one place 5491- radv: ignore DYNAMIC_STENCIL_OP if stencil test isn't enabled 5492- radv: reduce radv_tessellation_info::patch_control_points to 8-bit 5493- radv: reduce radv_input_assembly_info::primitive_topology to 8-bit 5494- radv: reduce radv_rasterization_info::polygon_mode to 8-bit 5495- radv: reduce radv_discard_rectangle_info::count to 8-bit 5496- radv: fix lowering GS intrinsics if NGG is disabled per pipeline 5497- radv/ci: update list of failures for Pitcairn 5498- radv: enable radv_zero_vram for Hammerting 5499- radv/winsys: add get_chip_name() to the null winsys 5500- radv: report the marketing name as part of the device name 5501- radv: rename ngg_gs_state to ngg_query_state 5502- radv: track if primitives generated query features are enabled 5503- radv: add primitives_generated_query to the graphics pipeline key 5504- radv: declare the NGG query argument for primitives generated query 5505- ac/nir/ngg: count the number of generated primitives for VS and TES 5506- radv: flush the NGG query state when the argument is declared 5507- radv: add few helpers related to streamout 5508- radv: implement VK_EXT_primitives_generated_query 5509- radv: advertise VK_EXT_primitives_generated_query 5510- radv/ci: rename deqp-XXX jobs to vkcts-XXX 5511- zink/ci: skip two KHR-GL46 tests that timeout most of the time with RADV 5512- zink/ci: update list of expected failures with RADV 5513- radv: remove redundant check when importing vertex input info 5514- radv: update radv_is_vrs_enabled() to use radv_graphics_pipeline_info 5515- radv: update the check to determine if dynamic discard rectangle is needed 5516- radv: update the check to determine if dynamic sample location is needed 5517- radv: update the check to determine if rasterization is enabled 5518- radv: remove remaining unused pCreateInfo pointers 5519- ci: bump the hang-detection tool in the test image for Vulkan 5520- ci: fix passing down MESA_LOADER_DRIVER_OVERRIDE for zink-anv-tgl 5521- zink/ci: change the surface type used for dEQP 5522- ci/valve: split .b2c-test into .b2c-test-vk and .b2c-test-gl 5523- radv/ci: add GLES/GLCTS testing with Zink on NAVI10 5524- zink/ci: update list of failures again 5525- radv/ci: refactor test machines and dEQP rules to reduce copy&paste config 5526- ci: update vkd3d-proton builder/runner 5527- ci: print a message when vk3d-proton returns successfully 5528- ci: add VKD3D_PROTON_RESULTS to the list of variables to pass down for testing 5529- radv/ci: add vkd3d-proton testing 5530- radv: advertise VK_EXT_border_color_swizzle on GFX10+ 5531- ci: uprev vkd3d-proton to 39d07dea2cef34bfb3ed39741f026bc637e3eec4 5532- radv/amdgpu: make sure to reset the number of BO when there is no ranges 5533- radv/amdgpu: do not add "global" BO to the virtual BO list 5534- aco: fix validation of SOP1 instructions without definitions 5535- radv,aco: rename radv_prolog_binary/radv_shader_prolog to shader_part 5536- radv/llvm: always emit a null export even if the FS doesn't discard 5537- radv: disable small primitive culling for user sample locations 5538- radv: use RADEON_FLAG_VA_UNCACHED for the trace BO 5539- radv: fix command line for dumping waves with UMR 5540- radv: dump UMR waves before UMR rings 5541- radv: do not emit SQTT user data packets on the SDMA queue 5542- radv/ci: add CI lists for LLVM on NAVI21 5543- radv: use LOAD_CONTEXT_REG to load the opaque buffer size on GFX10+ 5544- radv: do not abort if SPM isn't supported for the current GPU 5545- radv: remove old workaround for HTILE layers with F1 2021 5546- zink/ci: update list of failures for piglit with RADV 5547- radv: disable DCC for Melty Blood Actress Again Current Code 5548- radv: always enable VK_EXT_debug_utils 5549- zink/ci: update list of failures 5550- radv/ci: update list of failures against CTS 1.3.3.0 5551- aco: fix load_barycentric_at_sample without MSAA 5552- radv: fix dumping VS prologs assembly 5553- radv/ci: enable fossils testing for GFX1100 5554- radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2 5555- radv: fix wide points/lines by configuring the guardband correctly 5556- zink/ci: update list of CTS flakes for RADV 5557- ci: uprev vkd3d-proton to 5b73139f182d86cd58a757e4b5f0d4cfad96d319 5558- radv,aco: track if a fragment shader needs an epilog 5559- radv,aco: introduce {radv,aco}_ps_epilog_key 5560- radv: declare a new user SGPR arg in FS for the epilog PC 5561- radv: add a function that declares PS epilog shader arguments 5562- aco: add new pseudo instruction p_jump_to_epilog 5563- aco: emit p_jump_to_epilog if the main fragment shader has an epilog 5564- aco: do not abort if the FS doesn't export anything but has an epilog 5565- aco: prevent adding DONE/VM to the last export if the FS has an epilog 5566- aco: add aco_postprocess_shader() helper 5567- radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part 5568- aco: refactor export_fs_mrt_color() for PS epilogs preparation 5569- aco: add support for compiling PS epilogs 5570- radv: implement PS epilogs 5571- radv: disable viewport depth clamping only when necessary 5572- aco: requires Exact for p_jump_to_epilog 5573- radv: only force 1x sample for Bresenham lines when pipeline draws lines 5574- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT 5575- radv: implement VK_EXT_attachment_feedback_loop_layout 5576- radv: ignore out-of-order rasterization if stencil write mask is dynamic 5577- radv: fix gathering XFB info if there is dead outputs 5578- radv: fix cleaning the meta query state if an error occured 5579- radv: re-emit viewports if negative one to one or depth clamp mode changed 5580- radv: fix bogus assertion with RADV_FORCE_VRS 5581- radv: fix pipelineStageCreationFeedbackCount when it's 0 5582 5583Sarah Walker (2): 5584 5585- pvr: Update for firmware 1.17\@6256262 5586- pvr: Move BRN 44079, 48492 and 66011 code into pvrsrvkm specific directory 5587 5588Sathishkumar S (2): 5589 5590- radeon/vcn: skip create and destroy message for jpeg 5591- radeon/vcn: engage all available jpeg engines 5592 5593Sebastian Keller (1): 5594 5595- egl/wayland: Don't try to access modifiers u_vector as dynarray 5596 5597Sergi Blanch Torne (1): 5598 5599- ci/crosvm: clean crosvm socket 5600 5601Sidney Just (19): 5602 5603- wgl: add a flag to determine if running on zink 5604- wgl: add zink to the list of auto-loaded drivers 5605- zink: support VK_KHR_win32_surface 5606- kopper: add win32 loader interface 5607- zink: support win32 wsi 5608- wgl: support kopper 5609- gallium: support for implementing EXT_external_objects_win32 5610- mapi: added EXT_external_objects_win32 definitions 5611- gallium/noop: implement win32 device LUID and node mask getters 5612- gallium/trace: implement win32 device LUID and node mask getters 5613- mesa: implement GL_DEVICE_LUID_EXT and GL_DEVICE_NODE_MASK_EXT getters 5614- mesa: implement import win32 memory and semaphore 5615- mesa: pipe cap for EXT_memory_object_win32 and EXT_semaphore_win32 5616- zink: codegen for Win32 external object extensions 5617- zink: implement win32 fence import 5618- zink: implement win32 memory handle import 5619- zink: implement device LUID and node mask 5620- zink: enable pipe cap for win32 external memory and fences 5621- docs: updated zink features and release notes 5622 5623Sil Vilerino (39): 5624 5625- gallium/vl: Add software winsys and offscreen winsys 5626- util/u_format: Drop assert that has valid/well-defined behavior 5627- util/vl_vlc: Support compiling in C++ 5628- st_vdpau: Pass format when opening resource from handle in st_vdpau_resource_from_description 5629- d3d12: Add .clang_format file 5630- gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords 5631- gallium va: VaDeriveImage support stride/offset being different for NV12 planes 5632- gallium: Add values to pipe_video_cap for multi-slice and multi-reference encode 5633- gallium va: Handle new VA attributes with new pipe video caps 5634- gallium: Add multiple slice support to pipe_h264_enc_picture_desc 5635- gallium va: Add support for multiple slices encoding 5636- gallium radeon/r600/omx/va: Adds support for multiple reference encoding 5637- gallium: Add MinLumaBiPredSize8x8 to pipe_h264_sps, pic_init_qs_minus26 to pipe_h264_pps 5638- gallium omx: Fill out MinLumaBiPredSize8x8 and pic_init_qs_minus26 5639- gallium vdpau: Fill out level_idc and MinLumaBiPredSize8x8 5640- gallium va: VaHandlePictureParameterBufferH264 fill out new pipe params MinLumaBiPredSize8x8, pic_init_qs_minus26, chroma_format_idc, bit_depth_chroma, bit_depth_chroma_minus8 5641- d3d12: Add d3d12_promote_to_permanent_residency 5642- ci: Update x86 debian build to pick up tag v1.602.0-r1 from DirectX-Headers 5643- d3d12: Add video decode implementation of pipe_video_codec 5644- d3d12: Add video encode implementation of pipe_video_codec 5645- d3d12: Add create_video_codec and create_video_buffer entrypoints 5646- d3d12: Add util video functions to d3d12_format 5647- d3d12: Improve planar resource support to handle video requirements 5648- d3d12: Add support for d3d12 video in d3d12_screen 5649- d3d12: Add GLSL singleton refcounting to screen for offscreen non-GL rendering 5650- meson: Support d3d12 as a video-supporting driver 5651- winsys: Do not use Display type when X11 is not present in build 5652- meson: Update DirectX-Headers dependency to 1.602.0 version with fallback to the wrap 5653- meson: Add build option for gallium-d3d12-video feature 5654- d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment 5655- d3d12: Fix debug_printf format strings for cross platform builds 5656- d3d12: Video - Remove unused spCopyQueues from enc/dec objects 5657- pipe/video: Add PIPE_VIDEO_ENTRYPOINT_PROCESSING 5658- vl: Replace usage of entrypoint UNKNOWN with PROCESSING for VP 5659- va: Replace usage of entrypoint UNKNOWN with PROCESSING for VP 5660- gallium/video: Add video post processing interface 5661- va: Add support for VPP rotation, flip, alpha blend, crop, scaling 5662- d3d12: Add pipe_video_codec::process_frame implementation 5663- d3d12: Enable VPP rotation, flip, alpha blend, crop, scaling via pipe_video_codec::process_frame 5664 5665SoroushIMG (4): 5666 5667- zink: Fix BO size when it's not aligned to 16 bytes 5668- zink: Fix spirv stream 0 vertex emit for multistream shaders 5669- Zink: Fix clear being missed when using emulated draws in zink_blit 5670- zink: Fix incorrect emission of SPIR-V shift ops 5671 5672SureshGuttula (4): 5673 5674- radeonsi: Set display_remote for non-refernced frames 5675- radeonsi/vcn : update enc->dpb ref_use for index 0 5676- Revert "radeonsi: Set display_remote for non-refernced frames" 5677- Revert "radeon: hardcode uvd/vce encoder not_referenced value to false" 5678 5679Sviatoslav Peleshko (7): 5680 5681- mesa: flush bitmap caches when changing scissors or window rects state 5682- anv: workaround apps that assume full subgroups without specifying it 5683- intel/blorp/gen6: Set BLEND_STATEChange only if emitting the blend state 5684- anv: Dirty all dynamic state bits when creating command buffer state 5685- intel/blorp: Dirty depth bounds dynamic state bits after blorp 5686- intel/nullhw: Use correct macro to fix build regression 5687- iris: Always initialize shader compilation queue ready fence 5688 5689TSnake41 (1): 5690 5691- zink: print result code string on vulkan failure 5692 5693Tapani Pälli (5): 5694 5695- isl: disable mcs (and mcs+ccs) for color msaa on DG2 5696- anv: use anv_cmd_dirty_mask_t type for dynamic state 5697- intel/fs: setup SEND message descriptor from nir scope 5698- iris: implement Wa_14015264727 for DG2 5699- anv: implement Wa_14015264727 for DG2 5700 5701Tatsuyuki Ishi (9): 5702 5703- radv: Fix redundant subpass barriers due to erroneous comparison 5704- radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit 5705- radv/ci: Move transient_attachment_bit from fail to skip list 5706- radv/ci: skip image.sample_texture.*_compressed_format* 5707- radv/ci: Move sample_texture.*_compressed_format_* to faillist for gfx<=9 5708- radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard. 5709- amd: Revert gfx10 addrlib changes 5710- radv: Only set pstate for the first hw_ctx. 5711- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. 5712 5713Thomas Debesse (2): 5714 5715- gallium/clover: LLVM setLangDefaults moved from clangFrontend to clangBasic 5716- gallium/clover: pass -no-opaque-pointers to Clang 5717 5718Tiago Koji Castro Shibata (2): 5719 5720- d3d12: fallback to integrated adapter instead of arbitrary one 5721- d3d12: add more formats to supported conversions 5722 5723Timothy Arceri (71): 5724 5725- glsl/st: vectorise interfaces of SSO shader programs 5726- nir: fix setting varying from uniform as flat 5727- nir: fix sorting before assigning varying driver locations 5728- nir: always set the exact_trip_count_unknown loop terminator property 5729- nir: remove unreachable loop terminators 5730- svga: remove duplicate nir compile options 5731- nir/glsl: add glsl_record_compare() wrapper 5732- nir/glsl: wrapper contains_{double,interger}() 5733- nir/glsl: wrapper field_index() 5734- nir/glsl: wrap component_slots_aligned() 5735- glsl/mesa: move parse_program_resource_name() to common linker_util code 5736- glsl: add new build program resource helpers 5737- nir: add variable data fields required for NIR glsl varying linking 5738- nir: abort io info gathering if location is not set or is a temp value 5739- nir: skip lowering io to scalar for must_be_shader_input 5740- glsl: implement lower_packed_varyings() as a NIR pass 5741- glsl: add a NIR based varying linker 5742- glsl: implement opt_dead_builtin_varyings() as a NIR pass 5743- glsl: implement lower_xfb_varying() as a NIR pass 5744- glsl: enable the use of the nir based varying linker 5745- glsl: remove now unused GLSL IR varying linker code 5746- glsl: simplify finding cursor in varying packing code 5747- nir: add support for forced sampler indirect loop unrolling 5748- gallium/drivers: set force_indirect_unrolling_sampler for all required drivers 5749- glsl: move validation of sampler indirects to the nir linker 5750- i915g: use a valid setting for force_indirect_unrolling 5751- softpipe: switch to NIR loop unrolling 5752- asahi: switch to NIR loop unrolling 5753- d3d12: switch to NIR loop unrolling 5754- etnaviv: switch to NIR loop unrolling 5755- i915g: switch to NIR loop unrolling 5756- r300: disable GLSL IR loop unrolling 5757- r600: switch to NIR loop unrolling 5758- vc4: disable GLSL IR loop unrolling 5759- nir/i915g/r300/nv30: skip marking varyings as flat in some drivers 5760- etnaviv: assert if etna_shader_io_file reg overrun 5761- etnaviv: vectorise io 5762- nir/gcm: fix pushing instructions into if blocks 5763- ci: uprev piglit 2022-05-31 5764- nir: add nir based version of the lower_const_arrays_to_uniforms pass 5765- glsl/nir: skip adding hidden uniforms to the remap tables 5766- glsl/nir: allow the nir linker to remove dead uniforms we created 5767- glsl: move common link time optimisation calls to linker code 5768- glsl: move gl_nir_link_opts() call out of the st code 5769- glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms() 5770- glsl: remove now unused lower_const_arrays_to_uniforms() 5771- gallivm: disable GLSL IR loop unrolling in LLVMPIPE 5772- freedreno/ir3: tidy up duplication of common nir options 5773- freedreno: switch to NIR loop unrolling 5774- lima: lower all undefs to zero in vs 5775- lima: fixup nir indirect unroll options to match gallium CAP 5776- lima: switch to NIR loop unrolling 5777- nouveau/nv30: disable GLSL IR loop unrolling 5778- nouveau/nv50: disable GLSL IR loop unrolling 5779- nouveau/nvc0: disable GLSL IR loop unrolling 5780- svga: disable GLSL IR loop unrolling 5781- gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP 5782- glsl: remove the now unused GLSL IR loop unrolling code 5783- glsl: drop extra optimise swizzles call 5784- glsl: tidy up link_varyings_and_uniforms() 5785- glsl: remove never true do_dead_code() parameter 5786- glsl: inline do_common_optimization() call 5787- util: add dri config option to disable GL_MAP_UNSYNCHRONIZED_BIT 5788- util: use force_gl_map_buffer_synchronized workaround with RAGE 5789- glsl: merge lower_buffer_access with lower_shared_reference 5790- glsl: drop non-nir path for atan in builtin functions 5791- st/glsl: fix broken vertex attrib mapping 5792- nir/loop_unroll: clean up after complex_unroll_single_terminator() 5793- glsl: correctly track cross slot component packing 5794- Revert "nir: Preserve offsets in lower_io_to_scalar_early" 5795- glsl: dont lower precision for textureGatherOffsets 5796 5797Timur Kristóf (119): 5798 5799- radv: Minor formatting fix in radv_device.c file. 5800- radv: Move queue submit sparse bindings to a separate function. 5801- radv: Move empty queue submit code path to a separate function. 5802- radv: Move normal (non-empty) queue submit to a separate function. 5803- radv: Move up early exit for transfer queues in radv_get_preamble_cs. 5804- radv: Simplify some boolean code in radv_get_preamble_cs. 5805- radv: Initialize BO pointers when creating preambles. 5806- radv: Refactor cache flush code for the initial preambles. 5807- radv: Don't create continue preamble when it's not needed. 5808- radv: Rename fill_geom_tess_rings to radv_fill_shader_rings. 5809- radv: Don't use pointers to pointers when updating the preambles. 5810- radv: Only emit what is appropriate to the queue family for preambles. 5811- ac/nir: Add ac_nir_load_arg helper for shader arguments. 5812- radv: Move radv_nggc_settings enum out of radv_cmd_buffer. 5813- nir: Add upper bound for AMD shader arg intrinsics. 5814- radv: Lower ABI in NIR for tess/ESGS/NGG shader arguments. 5815- ac/nir: Remove now-superfluous ac_nir_lower_tess_to_const. 5816- aco: Remove now-superfluous intrinsics. 5817- ac/llvm: Remove now-superfluous intrinsics and ABI callbacks. 5818- radv/amdgpu: Group queue submit info into a structure. 5819- radv/amdgpu: Initialize semaphore info with a designated initializer. 5820- radv/amdgpu: Pass new queue submit info structure to internal function. 5821- radv/amdgpu: Pass correct struct type instead of repeating the cast. 5822- radv/amdgpu: Use scheduled dependency for submitting to multiple queues. 5823- ac/nir: Add I/O lowering for task and mesh shaders. 5824- ac: Add task ring entry shader argument. 5825- radv: Add radv_pipeline_has_task helper. 5826- radv: Set user data register for task shaders. 5827- radv: Set wave size for task shaders. 5828- radv: Fill task shader info. 5829- radv: Add task shader arguments. 5830- radv: Add task ring entry argument for mesh shaders. 5831- radv: Implement task shader intrinsics in the ABI. 5832- radv: Enable nir_opt_offsets for task shaders. 5833- radv: Use I/O lowering for task and mesh shaders. 5834- radv: Postprocess task shader configuration. 5835- radv: Allow linking task shaders. 5836- radv: Lower shared and task_payload variables in task/mesh shaders. 5837- radv: Print task shader stage name before disasm. 5838- radv: Fix task shader info. 5839- radv: Fix loading task shader ring buffer addresses. 5840- aco: Fix scratch with task shaders. 5841- radv: Introduce ring info structure for queues, refactor preamble generation. 5842- radv: Use a ws variable during preamble creation. 5843- radv: Pass radv_device pointer to some functions. 5844- radv: Refactor queue state to separate structure. 5845- radv: Clarify emitting graphics shader pointers. 5846- nir: Add explicit task payload atomic intrinsics. 5847- radv: Implement conditional rendering for async compute queue. 5848- ac: Remove trailing white space. 5849- aco: Remove trailing whitespace. 5850- radv: Remove trailing whitespace. 5851- radv: Disable predication for supass clear and image clears. 5852- nir: Add new launch_mesh_workgroups intrinsic. 5853- nir: Add common task shader lowering to make the backend's job easier. 5854- radv: Add mesh and task stage names to pipeline executable properties. 5855- nir: Keep track of cross-invocation mesh shader output access. 5856- radv, ac/nir: Fix multiview layer export for mesh shaders. 5857- ac/nir/taskmesh: Use task shader lowering from common NIR code. 5858- ac/nir/taskmesh: Use 3 dimensional workgroup ID. 5859- ac/nir: Add remappability to tess and ESGS I/O lowering passes. 5860- ac: Add RDNA2 task+mesh shader draw packet opcodes. 5861- ac: Add task shader ring information. 5862- radv: Implement task shader draw and payload rings. 5863- nir: Introduce new intrinsics for AMD specific mesh shader task ring. 5864- ac/nir/ngg: Clean up mesh shader output LDS layout. 5865- ac/nir/ngg: Use mesh shader scratch ring when outputs don't fit LDS. 5866- ac/nir/ngg: Use variables for outputs without cross-invocation access. 5867- radv: Implement mesh shader scratch ring. 5868- radv: Enable NGG wave ID for mesh shader scratch ring. 5869- ac/nir/ngg: Add mesh shader workgroup index. 5870- radv: Lower mesh shader 3D workgroup ID to 1D index. 5871- ac/nir/taskmesh: Preserve workgroup ID Y and Z when applying firstTask. 5872- radv: Always use 3D block ID and grid size in task shaders. 5873- gallium/u_blitter: Fix depth. 5874- zink: Enable the VK_EXT_depth_clip_enable extension. 5875- zink: Always enable depth clamping, make depth clipping independent. 5876- nir/lower_task_shader: don't use base index for shared memory intrinsics 5877- ac/nir/ngg: Refactor LDS instructions in NGG GS vertex emit and export. 5878- ac/nir/ngg: Ignore driver location for mesh shader outputs. 5879- radv: Don't assign driver locations to mesh shader outputs. 5880- radv: Add CULL_PRIMITIVE to special output mask. 5881- radv: Use NIR optimization to move discards to the top. 5882- radv: Only initialize DGC state when DGC is enabled. 5883- radv: Remove trailing whitespace introduced by DGC commits. 5884- radv: Refactor predication for compute queues. 5885- radv: Refactor some CP DMA functions to work with radeon_cmdbuf. 5886- radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf. 5887- radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf. 5888- radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf. 5889- radv: Refactor view index emit to use a per-stage function. 5890- radv: Move inline push constants to a new function. 5891- radv: Minor cleanup of radv_queue_submit_normal. 5892- radv: Allow reusing pipeline compute state emit functions. 5893- radv: Add dispatch_initiator_task field to radv_device. 5894- radv: Create internal cmdbuf when a graphics pipeline needs compute. 5895- radv: Flush descriptors and push constants for task shaders. 5896- radv: Introduce radv_before_taskmesh_draw. 5897- radv: Implement mesh shading draw calls with task shaders. 5898- radv: Synchronization for task shaders. 5899- radv: Support task shaders in secondary cmd buffers. 5900- radv: Workaround MEC taskmesh dispatch hang when count buffer has zero. 5901- radv: Submit internal compute cmdbuf. 5902- radv: Copy BO list to ACE internal CS. 5903- radv: Enable task shader feature for NV_mesh_shader. 5904- ac/nir/ngg: Move primitive ID workgroup barrier to proper place. 5905- aco: Remove hack for primitive ID export. 5906- ac/nir/ngg: Create output variable for primitive ID export. 5907- ac/llvm: Implement load_num_subgroups for NGG shaders. 5908- ac/llvm: Add LLVM bug workaround to ac_build_mbcnt_add. 5909- ac/llvm: Use gs_prim_id for NGG VS. 5910- ac/nir/ngg: Copy comment about LDS layout for NGG GS. 5911- ac/nir/ngg: Remember proper bit sizes of GS output variables. 5912- ac/llvm: Implement GDS atomic add NIR intrinsic. 5913- radv/llvm: Remove incorrect hardcoded workgroup size from NGG GS. 5914- radv: Properly set LDS size for LLVM NGG shaders. 5915- radv: Use NIR-based NGG lowering with LLVM. 5916- aco: Fix p_init_scratch for task shaders. 5917- nir/gather_info: Clear cross-invocation output mask. 5918 5919Tomeu Vizoso (4): 5920 5921- lima/ci: Disable lima-mali450-piglit-gpu:arm64 5922- panvk/ci: Disable CI for a while 5923- ci: Disable jobs to the Collabora lab 5924- Revert "ci: Disable jobs to the Collabora lab" 5925 5926Vadym Shovkoplias (4): 5927 5928- anv: Fix geometry flickering issue when compute and 3D passes are combined 5929- driconf: Add a limit_trig_input_range option 5930- drirc: Set limit_trig_input_range option for the Wolfenstein Youngblood 5931- drirc: Set limit_trig_input_range option for glmark2 5932 5933Vasily Khoruzhick (7): 5934 5935- lima: Implement texture_barrier 5936- lima: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT 5937- lima: Lower nir_op_idiv with other scalar math operations. 5938- u_transfer_helper: flush temporary transfer first for MSAA 5939- lima: implement lima-specific blitter 5940- lima: wire up MSAA 4x support 5941- lima: fix buffer overallocation for index, vertex and constant buffers 5942 5943Victor Hermann Chiletto (1): 5944 5945- radv: always check entry count in descriptor pool when allocating 5946 5947Viktoriia Palianytsia (1): 5948 5949- anv: Add workaround for sample mask with multisampling 5950 5951Vinson Lee (14): 5952 5953- panfrost: Fix memory leaks on unit test failures. 5954- zink: Fix memory leak on error path. 5955- radeonsi: Move NULL check before dereference. 5956- dri2: Fix memory leak. 5957- d3d12: Initialize d3d12_video_encoder_bitstream member m_uiOffset. 5958- amd: Initialize Gfx11Lib members in constructor. 5959- d3d12: Remove unnecessary NULL check. 5960- clc: Fix build with llvm-15. 5961- microsoft/spirv_to_dxil: Fix missing-prototypes errors. 5962- radv: Remove unnecessary null check. 5963- microsoft/compiler: Fix assert. 5964- tu: Check dereferenced value of rop_reads_dst. 5965- aco: Initialize spill_ctx members in constructor. 5966- r600/sfn: Initialize TestShaderFromString member m_instr_factory. 5967 5968Vlad Zahorodnii (2): 5969 5970- winsys/amdgpu-radeon: Allow specifying context priority 5971- radeonsi: Add support for EGL_IMG_context_priority 5972 5973Yevhenii Kolesnikov (1): 5974 5975- nir: Remove single-source phis before opt_if_loop_last_continue 5976 5977Yiwei Zhang (51): 5978 5979- venus: override aspectMask for internal tiling modifier 5980- venus: use linear modifier for legacy common wsi path 5981- venus: fix view format for ahb image 5982- venus: refactor to add struct vn_env 5983- venus: add env perf options and introduce no_async_set_alloc 5984- venus: add VN_PERF option no_async_buffer_create 5985- venus: add VN_PERF option no_async_queue_submit 5986- venus: refactor vn_cmd_submit 5987- venus: flush when batched draw calls reach a threshold 5988- venus: disable VK_EXT_extended_dynamic_state2 5989- venus: refactor android gralloc pieces 5990- venus: cache front_rendering_usage bit at gralloc init 5991- venus: update vn_GetSwapchainGrallocUsage2ANDROID for shared present 5992- venus: enable ANB shared presentable image prop 5993- venus: add VN_RELAX_BASE_SLEEP_US env var for tuning cpu utils 5994- venus: log performance envs upon instance creation with debug init 5995- venus: extend buffer cache to cover layering usage 5996- vulkan: remove the VkPhysicalDeviceMemoryProperties workaround 5997- radv: remove VkPhysicalDeviceMemoryProperties2 workaround 5998- venus: use common entry point for VkPhysicalDeviceMemoryProperties 5999- venus: resolve AHARDWAREBUFFER_FORMAT_Y8Cb8Cr8_420 to external format 6000- venus: fix a bug on buffer cache init failure path 6001- venus: fix the queue init failure path 6002- venus: fix a missing mtx_destroy in vn_device_init 6003- venus: adjust device resources init order 6004- venus: let device track queue families used for device creation 6005- venus: add vn_feedback_pool backed by coherent buffer (part 1) 6006- venus: add vn_feedback_pool backed by coherent buffer (part 2) 6007- venus: add feedback check/set/reset helpers 6008- venus: add event feedback 6009- venus: add NO_FENCE_FEEDBACK perf option and disable sparse resource 6010- venus: create per queue family command pool for feedback cmd 6011- venus: renderer to store allow_vk_wait_syncs capset 6012- venus: add fence feedback 6013- venus: resolve YV12 AHB to VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM 6014- venus: use narrow range to match up with mesa EGL 6015- venus: swizzle the chroma channels for YVU420 to match the VkFormat 6016- venus: GetPhysicalDeviceSparseImageFormatProperties to match sparse support 6017- venus: suballocate more for layering 6018- venus: add more tracepoints for perf analysis 6019- venus: filter out VK_EXT_physical_device_drm on the driver side 6020- venus: check dynamic state for raster enablement 6021- venus: reenable VK_EXT_extended_dynamic_state2 6022- Revert "venus: suballocate more for layering" 6023- anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support 6024- venus: fix external memory ext filtering 6025- venus: avoid feedback for external fence 6026- venus: allow no external memory handle when renderers lacks support 6027- zink: fix zink_create_fence_fd to properly import 6028- zink: fix in-fence lifecycle 6029- venus: ignore pInheritanceInfo if not secondary command buffer 6030 6031Yogesh Mohan Marimuthu (12): 6032 6033- radeonsi/gfx11: make flat_scratch changes for graphics 6034- radeonsi/gfx11: make flat_scratch changes for compute 6035- radeonsi/gfx11: interp changes for 32bit 6036- radeonsi/gfx11: interp changes for 16bit 6037- radeonsi/gfx11: instruction cache line size is 128 bytes 6038- radeonsi/gfx11: program inst_pref_size for compute 6039- radeonsi/gfx11: program db render control register 6040- radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there 6041- ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending 6042- vulkan/wsi: fix extra free if buffer_blit_queue 6043- radeonsi: remove tabs from code 6044- radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state() 6045 6046Yogesh Mohanmarimuthu (1): 6047 6048- radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE 6049 6050Yonggang Luo (119): 6051 6052- zink: Remove redundant framebuffer_mtx from zink_screen.h 6053- win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py 6054- win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols. 6055- misc: Replace \`#ifdef\\t__cplusplus` with \`#ifdef\\s\\s__cplusplus` 6056- misc: replace \`#ifdef\\s\\s__cplusplus` with \`#ifdef\\s__cplusplus` 6057- zink: Trim spaces in zink.rst 6058- zink: Improve zink.rst with links 6059- st: Improve _mesa_error about GL_OUT_OF_MEMORY in st_cb_texture.c 6060- llvmpipe: Do not use _Atomic keyword that doesn't support by MSVC 6061- radv: fixes msvc compiling error 6062- radv: Fixes compiling error with msvc 6063- radv: Add losing member hw_ip_version_major and hw_ip_version_minor for win32 6064- radv: Add macros for win32 that accessed 6065- nir: Fixes MSVC compiling error about unused variable \`_` 6066- egl/wgl: On win32, there is no support for EGL_EXT_device and EGL_EXT_platform_device 6067- util: Fixes test_util_get_process_exec_path on windows host with msys2/mingw 6068- util: Fixes typo in test_util_get_process_exec_path 6069- ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson 6070- radv: Remove usage of \`cnd_monotonic.h` 6071- d3d12: Fixes d3d12 compiling errors in \`D3D12ResourceState.h` for mingw 6072- d3d12: Move shared code that include d3d12 headers into d3d12_common.h 6073- d3d12: Fixes compiling error by include \`<unknwn.h>` 6074- d3d12: \`#include <windows.h>` instead \`#include <Windows.h>` for building under linux with mingw 6075- d3d12/wgl/test: Fixes wgl_tests.cpp for d3d12 with mingw 6076- dzn: Fixes compiling error when build with msys2/mingw 6077- dzn: Fixes compiling error by include \`<unknwn.h>` 6078- microsoft/clc: Fixes compiling of microsoft clc with mingw 6079- docs: Fixes dispatch.rst with __THREAD_INITIAL_EXEC 6080- meson: Fixes timespec_get detect code 6081- c11/threads: Remove the detecting code for \`HAVE_TIMESPEC_GET` in threads_win32.h 6082- c11: Implement c11/time.h with c11/impl/time.c 6083- c11: Implement timespec_get on win32 properly when not available 6084- util: Use timespec_get directly, it's always present 6085- util: Remove usage of designated initializers in timespec_test.cpp 6086- util: always enable timespec_test.cpp 6087- util: Remove unused function timespec_passed 6088- util: Add unittest for timespec_get 6089- ci/vs2019: There is no meaning to set $env:ErrorActionPreference 6090- ci/vs2019: switch to powershell 7 6091- ci/vs2019: Update editorconfig for ps1 files 6092- ci/vs2019: Give each option its own line in mesa_build.ps1 6093- ci/vs2019: Rename to vsInstallPath 6094- ci/vs2019: Do not install subprojects 6095- ci/vs2019: Ignore cert install output, it's too long 6096- ci/vs2019: Ignore error when removing directory recursively 6097- ci/vs2019: Disable progress when install python 6098- ci/vs2019: Disable progess bar for pip3 install 6099- ci/vs2019: Use shared VULKAN_SDK_VERSION for install and build vulkan related packages 6100- ci/vs2019: Add a dedicated script for entering VS dev shell 6101- ci/vs2019: Split call to C:\\vs_buildtools.exe into multiple lines 6102- ci/vs2019: Show the total vs2019 install time 6103- ci/vs2019: Reduce the vs2019 build tools components to be installed 6104- ci/vs2019: ignore _build and _install directory by write * into .gitignore 6105- ci/vs2019: Install all build artifacts into C:\\mesa-deps 6106- ci/vs2019: Split choco install output 6107- ci/vs2019: Split install of vs2019 and choco into separate Docker image 6108- ci/vs2019: Remove comment for windows_test_vs2019 that not true 6109- ci/vs2019: Convert mesa_build.ps1 to using PowerShell 7 && operator 6110- microsoft/compiler: Include stddef.h before using size_t 6111- microsoft/compiler: Fix error from double extern 6112- dxil: Fixes compiling dxcapi.h with mingw. 6113- dxil: Sync dxil_spirv_shader_stage with gl_shader_stage 6114- dxil: Fixes warning about comparing different enum value and enum literal 6115- microsoft/spirv_to_dxil: Delete unused local variable 6116- microsoft/clc: Delete unused local variables 6117- microsoft/clc: Add missing void to no-parameter function signature 6118- microsoft/spirv_to_dxil: Fixes maybe-uninitialized compiling error in dxil_spirv_nir.c 6119- ci: Building spirv-to-dxil on both mingw and debian/x64 6120- intel: using C++11 keyword thread_local 6121- glx: using C++11 keyword thread_local 6122- util: Define WIN32_LEAN_AND_MEAN before include of \`windows.h` in u_thread.h 6123- c11: Move the implementation of threads.h into c source code 6124- c11: Fixes memory leak of pack in thrd_create on win32 6125- c11: Implement thread_local in c11/threads.h 6126- util/c11: Update function u_thread_create to be c11 conformance 6127- meson: Using get_argument_syntax as the \`--compiler_id` option for gen_vs_module_defs.py 6128- meson: Use cc.get_argument_syntax instead cc.get_id when possible. 6129- meson/rtti: Using get_argument_syntax to check msvc compatible compiler 6130- meson/clover: Getting clover can be built with no-rtti llvm in mingw 6131- ci/vs2019: Upgrade Windows 10 SDK version to 20348 6132- docs: Update windows build requirement with Visual Studio 6133- d3d12: Use static_cast instead of dynamic_cast in d3d12_video_enc_h264.cpp 6134- d3d12: Convert #include <Windows.h> to #include <windows.h> for mingw on linux 6135- d3d12: Fixes compiling error in d3d12/wgl/d3d12_wgl_framebuffer.cpp with gcc 6136- microsoft/clc: Fixes narrowing error in clc_compiler_test.cpp with mingw/gcc 6137- microsoft/clc: Disable clc_compiler_test on non-windows platform 6138- dzn: Fixes incompatible pointer type error 6139- ci/x86_build: Getting pushd popd be paired, avoid using cd 6140- ci: Prepare the container for building all mesa components with mingw under linux 6141- ci: Trigger the new mingw/linux dockers to be build 6142- d3d12: Turn d3d12_format.h to include d3d12_common.h 6143- ci: Building all mesa functional with mingw on debian 6144- meson: Enable wgl tests on mingw 6145- CODEOWNERS: Update c11 code owners 6146- CODEOWNERS: evelikov renamed to xexaxo 6147- gallium: Move -DHAVE_PIPE_LOADER_DRI and -DHAVE_PIPE_LOADER_KMS to be pre_args 6148- gallium: Do not call to pipe_loader_sw_probe_kms when not present 6149- gallium: Rename macros HAVE_PIPE_LOADER* 6150- gallium: Disable dri2 interface on OSX 6151- glx: __glX_tls_Context should use __THREAD_INITIAL_EXEC 6152- util: thread_local doesn't support to be exported on OSX 6153- glx: Convert tab to space in applegl_glx.c 6154- dri: Trim trailing space in dri/dri_util.* 6155- ci: Building osmesa on OSX 6156- util: Disable tests/sparse_array_test.cpp on MacOS as it's too slow 6157- mapi: Trim trailing spaces in stub.c and u_current.c 6158- meson: Remove usage of use_elf_tls 6159- egl: Remove usage of USE_ELF_TLS macro 6160- mapi: Remove usage of USE_ELF_TLS 6161- loader: Remove usage of USE_ELF_TLS in loader.c 6162- glx: Remove usage of USE_ELF_TLS 6163- util: Remove usage of USE_ELF_TLS in u_thread.h 6164- meson: Remove pre_args += '-DUSE_ELF_TLS' 6165- glx: Remove usage of pipe_tsd_set and pipe_tsd_get 6166- auxiliary: Remove pipe_tsd 6167- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library 6168- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp 6169- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func 6170- c11: #include <threads.h> when the os/platform provide it 6171 6172Yurii Kolesnykov (2): 6173 6174- Remove Travis CI config 6175- Init macOS GitHub Action 6176 6177Zack Rusin (4): 6178 6179- svga: Don't try to build x86/x64 assembly on different arch's 6180- svga: finish readbacks before mapping resources 6181- svga: Use direct maps when GB objects are present 6182- svga: Add support for SVGAv3 6183 6184Zhang, Jianxun (1): 6185 6186- iris: Wa_14016820455 for GFX_VERx10 == 12.5 6187 6188illiliti (1): 6189 6190- Use proper types for meson objects 6191 6192jheaff1 (1): 6193 6194- build(glx): Fix build by adding missing deps 6195 6196naveen (1): 6197 6198- github/ci: Set permissions for GitHub actions 6199 6200newbluemoon (1): 6201 6202- nine: replace ulimit with sysconf call 6203 6204prashanth (1): 6205 6206- vulkan/device_select: force default device if environment variable set 6207 6208sjfricke (2): 6209 6210- isl: fix bug where sb.MOCS is not being set 6211- anv: fix assert to build with shader cache disabled 6212 6213xperia64 (1): 6214 6215- wgl: Add driver_zink as a dependency of the wgl frontend 6216 6217İlhan Atahan (1): 6218 6219- Add Adreno 616 and 620 to use turnip on these GPU's . 6220