1Mesa 23.1.0 Release Notes / 2023-05-10 2====================================== 3 4Mesa 23.1.0 is a new development release. People who are concerned 5with stability and reliability should stick with a previous release or 6wait for Mesa 23.1.1. 7 8Mesa 23.1.0 implements the OpenGL 4.6 API, but the version reported by 9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / 10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. 11Some drivers don't support all the features required in OpenGL 4.6. OpenGL 124.6 is **only** available if requested at context creation. 13Compatibility contexts may report a lower version depending on each driver. 14 15Mesa 23.1.0 implements the Vulkan 1.3 API, but the version reported by 16the apiVersion property of the VkPhysicalDeviceProperties struct 17depends on the particular driver being used. 18 19SHA256 checksum 20--------------- 21 22:: 23 24 a9dde3c76571c4806245a05bda1cceee347c3267127e9e549e4f4e225d92e992 mesa-23.1.0.tar.xz 25 26 27New features 28------------ 29 30- VK_EXT_pipeline_library_group_handles on RADV 31- VK_EXT_image_sliced_view_of_3d on RADV/GFX10+ 32- VK_KHR_map_memory2 on ANV and RADV 33- fullyCoveredFragmentShaderInputVariable on RADV/GFX9+ 34- VK_EXT_discard_rectangles version 2 on RADV 35- VK_EXT_graphics_pipeline_library on RADV 36- extendedDynamicState3ColorBlendEquation on RADV 37- primitiveUnderestimation on RADV/GFX9+ 38- VK_KHR_fragment_shading_rate on RADV/GFX11 39- VK_EXT_mesh_shader on RADV/GFX11 40- RGP support on RADV/GFX11 41- GL_NV_alpha_to_coverage_dither_control on r600/evergreen+ 42 43 44Bug fixes 45--------- 46 47- [radeonsi] flickering debug chunk border lines in Minecraft 48- radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues 49- [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@[email protected] (23.1.0-rc4) 50- vulkan/device_select: no way to select between GPUs of the same model due to bugs 51- Intel/anv: Modifier problems running gamescope embedded 52- radv: 7900 XTX hair flickering/rendering issues in VaM 53- radv: cache crashing 54- nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches 55- radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream 56- [regression] iris: unable to use driver as secondary GPU (primary AMD GPU) 57- iris: steam doesn't render on dg2 58- [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist 59- Vulkancts clipping / tesselation tests trigger gpu hang on DG2 60- Swaped fields in picture in vlc and mythtv if hw accel is on 61- WGL: Assert assigns dwThreadId variable 62- nine regression with r600 (bisected) 63- [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround) 64- radv: Jedi Fallen Order flickering & blocky plants 65- nouveau: NV50 (NVAC) broken in latest master 66- rusticl failed to build with rust-bindgen 0.65.0 67- Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation 68- vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830 69- Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton) 70- radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24 71- radv: Gotham Knights GPU hang with Navi 24 72- aco: s_load_dword with negative soffset cause GPU hang 73- piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down 74- overlay layer: unable to launch titles on steam 75- radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer 76- VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?) 77- radv: possibly not setting state dirty bits correctly 78- RADV: VRS attachment not working in specific scenario 79- rusticl: invalid SPIR-V kernel causes panic 80- [RADV] The Last Of Us Part 1: artifacting in the menu (with workaround) 81- AMD va-api outputs corrupt encoding 82- !20673 regressed \`dEQP-VK.wsi.xlib.surface.query_formats` 83- aco: missing dependency on generated header 84- zink: spirv validation errors with spirv 1.6 85- freedreno/a6xx: Assertion \`view->rsc_seqno == rsc->seqno' failed. 86- iris regression in map stride after import with gen9 parts 87- anv: zink ADL failures 88- Vulkan loader \`vk_common_GetPhysicalDeviceFormatProperties` fails to sanitize properties bits. 89- Loading a model in PrusaSlicer 2.6.0-alpha5 crashes GNOME on radeonsi 90- [glx][bisected][regression]Intel HD 3000 failing to create context on applications like Unity 91- v3d: dEQP-GLES3.functional.texture.specification.teximage2d_pbo.rg32f_cube and similar fails when run together with other tests 92- standalone glsl compiler not getting built with mingw 93- !22191 broke test-docs-mr 94- mesa: index buffer leaking 95- RadeonSI: null dereference in amdgpu_cs_add_buffer, potential refcount mismatch, running BeyondAllReason 96- NIR can't unroll any loop from nine 97- Steel Division 2 - radv/gpu hang - bisected 98- Turnip: VKD3D can't run due to lack of memory property flag 99- eglCreateImageKHR, error: EGL_BAD_ALLOC (0x3003), message: "createImageFromDmaBufs failed" on AMD multi-gpu with explicit format modifiers 100- radv: In the game Quake II RTX appeared artifacts at fresh mesa builds 101- radv: Vampire: The Masquerade - Bloodline (Unofficial Patch) regression 102- radeonsi broken for gcn1 card 103- libgrl.a installed but not used? 104- radv: crash compiling UE5 lumen hardware RT shader 105- spec@ext_transform_feedback@builtin-varyings gl_culldistance fail 106- Panfrost T860 - broken system with latest mesa on gnome wayland jammy 107- aco: unused vtmp_in_loop 108- FTBFS: src/amd/llvm/ac_llvm_util.c:248:4: error: implicit declaration of function ‘LLVMAddIPSCCPPass’ (LLVM C interface removed upstream) 109- vulkan: new generated physical_device_feature missing meson dependency 110- Build broken on old-ish Python versions 111- radv: Support fullyCoveredFragmentShaderInputVariable from VK_EXT_conservative_rasterization on RDNA2+ 112- radv,nir: dEQP-VK.ray_query.builtin.rayqueryterminate.* failures 113- RFE: Use _mesa_is_foo(ctx) helpers more 114- spec@ext_transform_feedback@builtin-varyings gl_culldistance fail 115- ci: infinite XDG_RUNTIME_DIR spam 116- ci: XDG_RUNTIME_DIR spam 117- [KBL] iris failures with dEQP-GLES3.functional.texture.compressed.astc.void_extent* 118- glsl compiled error when the RHS of operator \`>>` is int64_t by enabling GL_ARB_gpu_shader_int64 extension 119- turnip: inline uniforms regression 120- QPainter fails to render multiple shapes with a brush set since Mesa 23.0 121- eglSwapBuffers blocks in wayland when it's wl_surface_frame event is stolen. 122- plasmashell sometimes hangs with mesa_glthread 123- pps_device.h:23:11: error: ‘uint32_t’ does not name a type 124- Build fails with llvm 17: llvm/ADT/Triple.h: No such file or directory 125- nir: i2f32(i2i32(x@8)) isn't being collapsed to i2f32(x) 126- zink-lvp no longer running tests 127- radv: Immortals Fenyx Rising: Grass Flicker on R9 380X and Steam Deck 128- radv: A Plague Tale: Requiem black "flash" on 7900XTX 129- 7900 XTX: Graphical corruption / artifacts in Cyberpunk 130- radeonsi draws spurious values to depth buffer 131- Commit ccaaf8fe04c956d9f16f98b7f7fa69a2526283bc causes GPU ring timeouts on BONAIRE 132- radv: CmdCopyQueryPoolResults broken for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT with queryCount > 1 133- rusticl over llvmpipe + ffmpeg's Opencl filter = error -51 134- rusticl over llvmpipe + ffmpeg's Opencl filter = error -51 135- ci: Remove LAVA Gitlab section handler workaround 136- System freeze when playing some h264 videos with VA-API on Rembrandt 137- OpenGL crashes in X-Plane 11 138- r600/TURKS: NIR Shader related errors on CLI with the game "A Hat in Time" and Gallium Nine 139- agx: shifts aren't sound 140- ci: build logs hidden 141- fatal error: intel/dev/intel_wa.h: No such file or directory 142- [Bisected] Regression: Project Zomboid renders black 143- hasvk: Black pixels with 8xMSAA and fast clears on Intel(R) HD Graphics 4400 (HSW GT2) 144- radv: GTA IV graphical artifacts on 7900XTX 145- radv: Resident Evil Revelations 2 artifacts on 7900XTX with DCC 146- radv: Prototype 2 black textures on RDNA 3 when DCC is enabled 147- Mesa 23.0.0 crashes immediately with indirect rendering 148- virpipe-on-gl: arb_enhanced_layouts\@matching_fp64_types crashes 149- [RADV] Returnal - pistol muzzle flash fills whole screen (graphical artifact) 150- ACO: dEQP-VK.binding_model.descriptor_buffer.multiple.graphics_geom_buffers1_sets3_imm_samplers hangs on NAVI10 151- Build failures with recent lld 152- r600,regression: Glitches on terrain with the NIR backend on Transport Fever 2 153- [radeonsi] Regression with MSAA fix for Unreal / Unreal Tournament 99 154- spirv: Switch Vulkan drivers to use \`deref_buffer_array_length` 155- r600/TURKS: Crash of the game "A Hat in Time" with Gallium Nine and NIR path (third report) 156- [gen9atom] Vulkan tests cause gpu hang: dEQP-VK.memory_model.* 157- GL_SHADER_BINARY_FORMAT_SPIR_V is not added to the list of GL_SHADER_BINARY_FORMATS even if GL_ARB_gl_spirv is supported. 158- mesa: "Fragmented" dynamic lights in IronWail with \`r_fsaamode 1` on 159- [ANV/DG2] Vertex explosion in nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_gltf 160- CUEtools FLACCL hit assert in rusticl 161- Assertion Failed on Intel HD 5500 with Linux / Mesa 22.3.1 / OpenGL 162- Rise of the Tomb Raider's Ambient Occlusion pass misrenders (swimming shadows) 163- vk_enum_to_str: missing VkPipelineCreateFlags 164- [glsl] [spirv] ssbo unsizied array not supported ? 165- Creating a vulkan physical device on an AMD GPU causes following calls to drmModeAddFB to fail with ENOENT 166- Minecraft: spec related compile errors 167- mesa: _mesa_glthread_upload crash 168- glthread: OpenGL submission blocks while swapping buffers 169- glthread: Loading a shader cache in yuzu slows down with mesa_glthread=true 170- Commit ""radeonsi: enable glthread by default"" (d6fabe49cd72fb) causes a regression in gstreamer gtkglsink element 171- llvmpipe: linear rasterizer / depth bug 172- radv: (Using mesh shader) NIR validation failed after nir_lower_io_to_scalar_early 173- panfrost Mali-G31 glamor regression 174- allwinner a64: DRM_IOCTL_MODE_CREATE_DUMB failed: Cannot allocate memory after some time of apps usage 175- turnip: dEQP-VK.ubo.random.all_shared_buffer.48 slow 176- wine + dxvk + Rise of the Tomb Raider crashes in Soviet Installation 20% with VK_ERROR_DEVICE_LOST 177- Sometimes VLC player process gets stuck in memory after closure if video output used is Auto or OpenGL 178- ci: Remove LAVA Gitlab section handler workaround 179- kwin_wayland crashes involving dri2_create_drawable when Plasma starts and the llvmpipe driver from Mesa 23.0-rc3 and 23.0-rc4 is used 180- turnip: no ubwc fast clear for depth on a618 181- anv: VK_ACCESS_2_SHADER_READ_BIT doesn't seem to be handled correctly 182- Vulkan WSI flags leak into NIR, breaking build on BSDs 183- Iris corruptions in zoom calls 184- Sampling with aux enabled with ISL_AUX_STATE_PASS_THROUGH seems broken on Tigerlake+ 185- anv: incorrect task shader payload 186- radv: Hi-Fi Rush incorrectly rendering face shadows with DCC on 7900 XTX 187- [iris] isl_calc_min_row_pitch seems incorrect on a750 188- DG2: incorrect rendering in Sascha Willems raytracing callable demo 189- turnip: conditional load/store hurts some workloads 190- Some blackouts / rendering issues with RADV_PERFTEST=gpl in Battlefield 1 (DX11) 191- radv/zink: ACO assert with DOOM2016 192- Registered special XGE not unregistered 193- draw_llvm.c:788:7: error: implicit declaration of function ‘LLVMContextSetOpaquePointers’ 194- asahi: Optimize lower_resinfo for cube maps 195- Metro Exodus hits nir validation with a driver supporting raytracing. 196- ANV Gen 9.5 swapchain corruption when using newer \`VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL` layout 197- turnip: optimal bin layout 198- piglit.spec.arb_shader_texture_image_samples.compiler.fs-image-samples_frag regression 199- lavapipe assert fails on Windows 200- zink: itoral-gl-terrain-demo rendering failure 201- asahi: Implement a shader disk cache 202- [ICL] Trine 4 trace causing GPU HANG 203- radv: Segfault during createAccelerationStructure when the backing buffer is not bound to memory 204- 7900 XTX stuck at 'compiling shaders' in Monster Hunter Rise 205- radv: slow GPL (fast) link times 206- libEGL warning: failed to get driver name for fd -1 207- iris: Context incorrectly marked as guilty 208- simple_mtx.h:34:12: fatal error: valgrind.h: No such file or directory 209- [ANV] Commit 4ceaed78 causes misrendering on Cyberpunk 2077 210- adding eglGetMscRateANGLE support for multiple monitors with different refresh rates 211- Performance regression in Chromium WebGL when implement ANGLE_sync_control_rate with egl/x11 212- ci: Ensure that the Intel/Freedreno trace pipelines only show up in relevant MR's 213- anv: Performance issue with Vulkan on Wayland KWin 214- Incorrect format conversion on big endian 215- radv: State of Decay 2 character rendering regression 216- aco_tests assembler.gfx11.vop12c_v128/gfx11 failure 217- r600,regression: Loading of DOOM stuck at 0% with the NIR backend 218- RADV: enabling TC-compat HTILE in GENERAL for compute queues is likely broken 219- Confidential issue #8065 220- VAAPI HEVC encode broken since 22.3 221- GPU HANG: ecode 12:1:859ffffb (Resetting rcs0 for stopped heartbeat on rcs0) - reproducible 222- zink: src/gallium/auxiliary/pipebuffer/pb_slab.c:138: Assertion failed: \`heap < slabs->num_heaps` 223- [zink] Assertion \`heap < slabs->num_heaps' failed on Pascal (bisected) 224- [RADV] Incorrect copies to/from compressed textures with mipmaps 225- mesa_glthread=true and probably ANY id Tech 3 engine games, offroad... 226- radeonsi: VRAM Leak/abnormally high usage in Minecraft mod pack 227- nir/lower_blend: Bogus assert 228- anv-tgl-vk: fails a multiple jobs after changing sharding 229- radv CTS crashes since ebec42d799b22b7b3d06acd710f5687252446a06 230- llvmpipe: dEQP-EGL programs.link failures. 231- libmesa_util depends on gallium 232- EGL report EGL_EXT_create_context_robustness with kms_dri drvier while can't create context with EGL_LOSE_CONTEXT_ON_RESET_EXT attribute. 233- v3d: missing drm format modifier support on Raspberry Pi 4 required for mpv 234- Return To Monkey Island black screen 235- Return To Monkey Island black screen 236- navi22 amdgpu: bo 000000002843d677 va 0x0800000400-0x08000005ff conflict with 0x0800000400-0x0800000600 237- Ryzen 6800H laptop amdgpu: bo 00000000b1eb583a va 0x0800000200-0x08000003ff conflict with 0x0800000200-0x0800000400 238- [RADV] [MISSED PERFORMANCE POTENTIAL] Vulkan not working when Color Depth is set to "16", but Vulkan works when Color Depth is set to "24" 239- v3dv: f2f16_rtz lowering could be improved 240- debug build compilation failed: inlining failed in call to ‘always_inline’ ‘src_is_ssa’: indirect function call with a yet undetermined callee 241- radv: regression: broken UI rendering in Elden Ring 242- radv: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR 243 244 245Changes 246------- 247 248Adam Jackson (22): 249 250- glx/dri3: Simplify protocol version tracking 251- glx: Remove glx_context::screen 252- glx: Remove a can't-happen NULL check 253- glx: Remove support for glXGetDriverConfig for old drivers 254- glx: Clean up some funny business from context bind/unbind 255- glx: Reflow MakeContextCurrent a little 256- glx: Check for initial "glX" first in glXGetProcAddress 257- glx: Move 1.2 GLXPixmap code into glx_pbuffer.c 258- glx: Inline a few single-use constant strings into their user 259- glx: Fix drawable type inference in visual/fbconfig setup 260- glx: Harmonize glXCreateGLXPixmap with glXCreatePixmap 261- mesa: Fix extension table formatting 262- mesa: Trivially advertise NV_generate_mipmap_sRGB 263- wsi/x11: Make get_sorted_vk_formats handle varying channel widths 264- wsi/x11: Infer the default surface format from the root window's visual 265- wsi/x11: Support depth 16 visuals 266- glx/dri: Use X/GLX error codes for our create_context_attribs 267- dri: Validate more of the context version in validate_context_version 268- glx/dri: Fix error generation for invalid GLX_RENDER_TYPE 269- glx: Disable the indirect fallback in CreateContextAttribs 270- glx: Fix error handling yet again in CreateContextAttribs 271- mesa: Enable NV_texture_barrier in GLES2+ 272 273Adam Stylinski (2): 274 275- glx: fix a macro being invoked with the wrong parameter name 276- mesa: fix out of bounds stack access on big endian 277 278Alan Coopersmith (1): 279 280- util/disk_cache: Handle OS'es without d_type in struct dirent 281 282Alejandro Piñeiro (17): 283 284- vulkan/wsi: check if image info was already freed 285- v3dv/format: remove unused v3dv_get_tex_return_size 286- v3dv/pipeline: rename lower_tex_src_to_offset to lower_tex_src 287- v3dv: pass alignment to v3dv_buffer_init 288- v3dv/image: use 64-byte alingment for linear images if needed 289- v3dv: skip two ycbcr tests 290- broadcom/compiler: v3d_nir_lower_txf_ms doesn't need v3d_compile 291- broadcom/compiler: treat PIPE_FORMAT_NONE as 32-bit formats for output type 292- v3dv: enable shaderStorageImageReadWithoutFormat 293- broadcom/compiler: fix indentation at v3d_nir_lower_image_load_store 294- nir: track if var copies lowering was called 295- radv: use shader_info->var_copies_lowered 296- anv: use shader_info->var_copies_lowered 297- v3d/v3dv: use shader_info->var_copies_lowered 298- v3dv: handle ASPECT_MEMORY_PLANE aspect flags when getting plane number 299- v3dv/debug: add debug option to disable TFU codepaths 300- v3dv/pipeline: use pipeline depth bias enabled to fill up CFG packet 301 302Alexandros Frantzis (2): 303 304- egl/wayland: Fix destruction of event queue with proxies still attached. 305- vulkan/wsi/wayland: Fix destruction of event queue with proxies still attached. 306 307Alyssa Rosenzweig (351): 308 309- nir/peephole_select: Allow load_preamble 310- agx: Peephole select after opt_preamble 311- asahi: Handle sampler->compare_mode 312- panfrost: Don't use AFBC of sRGB luminance-alpha 313- pan/bi: Fix incorrect compilation of fsat(reg.yx) 314- pan/bi: Add a unit test for fsat(reg.yx) 315- panfrost: Enable NV_primitive_restart on Valhall 316- panfrost: Fix logic ops on Bifrost 317- panfrost: Stop testing CAP_INT16 318- panfrost: Remove PAN_MESA_DEBUG=deqp 319- panfrost: Remove unused debug parameter 320- panfrost: Fix clears with conditional rendering 321- panfrost: Document render_condition_check contract 322- nir: Add Midgard-specific fsin/fcos ops 323- nir: Optimize vendored sin/cos the same way 324- pan/mdg: Use special NIR ops for trig scaling 325- pan/mdg: Scalarize LUT instructions in NIR 326- pan/mdg: Remove MSGS debug 327- mesa: Set info.separate_shader for ARB programs 328- nir/lower_blend: Fix alpha=1 for RGBX format 329- nir/lower_blend: Clamp blend factors 330- nir/lower_blend: Fix SNORM logic ops 331- nir/lower_blend: Avoid useless iand with logic ops 332- nir/lower_blend: Don't do logic ops on pure float 333- nir/lower_blend: Handle undefs in stores 334- nir/lower_blend: No-op nir_color_mask if no mask 335- asahi: Omit extra call to clock_gettime 336- nir/opt_preamble: Treat \*size as an input 337- nir/opt_preamble: Consider load_preamble as movable 338- agx: Lower system values in NIR in the driver 339- agx: Bump preamble_storage_size to 512 340- agx: Centralize texture lowering 341- asahi: Use non-UAPI specific BO create flags 342- nir: Add a late texcoord replacement pass 343- asahi: Run nir_lower_fragcolor during preprocessing 344- asahi: Lower texcoords late 345- panfrost: Implement GL_EXT_render_snorm on Bifrost+ 346- ail: Add layout->mipmapped_z input 347- ail: Test mipmapped_z behaviour 348- ail: Test 63x63 cube map 349- asahi: Set layout->mipmapped_z for 3D textures 350- asahi: Fix encoding of uniform size 351- asahi: Strengthen agx_usc_uniform contract 352- asahi/nir_lower_sysvals: Split large ranges 353- asahi: Correct alignment for USC Uniform packets 354- agx: Support uniform registers as LODs 355- asahi: Use writeback when it looks beneficial 356- asahi: Make STAGING resources linear 357- asahi: Prefer blit-based texture transfer 358- asahi: Implement nontrivial rasterizer discard 359- asahi: DRY dirty tracking conditions 360- asahi: Remove redundant tri merge disable bit 361- asahi: Merge fragment control XML 362- agx: Keep varyings forwarded to texture as fp32 363- asahi: Don't use 16-bit inputs to 32-bit st_tile 364- docs/asahi: Document clip distance varyings 365- agx: Fix storing to varying arrays 366- agx: Handle constant-offset in address matching 367- asahi: Add XML for custom border colours 368- agx/decode: Add a data parameter to stateful 369- agx/decode: Handle extended samplers 370- asahi: Implement custom border colours 371- asahi: Fix delete_vs_state implementation 372- asahi: Add compute kernel scaffolding 373- asahi: Don't leak shader NIR 374- asahi: Add hooks for SSBO and images 375- asahi: Fake more caps for dEQP-GLES31 376- asahi: Advertise seamless cube maps 377- asahi: Stub out MSAA for dEQP 378- asahi: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS 379- asahi: Add compute batches 380- asahi: Implement load_ssbo_address/get_ssbo_size 381- asahi: Identify more compute-related XML 382- agx: Implement compute ID intrinsics 383- agx: Implement barriers 384- nir/print: Extract get_location_str 385- nir/print: Pretty-print I/O semantic locations 386- nir/print: Pretty-print color0/1_interp 387- agx: Allow uniform sources on phis 388- agx: Run DCE twice 389- agx: Lower uniform sources with a dedicated pass 390- agx: Don't scalarize preambles in NIR 391- nir/lower_clip: Only emit 1 discard 392- tu,vulkan: Add common Get*OpaqueCaptureDescriptorDataEXT 393- radv: Use common Get*OpaqueCaptureDescriptorDataEXT 394- agx: Remove unused AGX_MAX_VARYINGS 395- agx: Respect component in frag load_input 396- agx: Fix AGX_MAX_CF_BINDINGS 397- agx: Remove bogus gl_Position assertion 398- agx: Implement load_helper_invocation 399- agx: Write sample mask even with no colour output 400- asahi: Submit batches that don't touch RTs 401- asahi: Add XML for indirect dispatch 402- asahi: Add XML for indirect draws 403- asahi: Add XML for VDM memory barriers 404- panvk: Take lock when tracing 405- panvk: Fix varying linking 406- panvk: Disable SNORM rendering 407- asahi: Remove default=true on index list values 408- asahi: Refactor index buffer upload for indirect 409- asahi: Implement indirect draws 410- panfrost: Fix some fields in v10.xml 411- pan/decode: Add support for decoding CSF 412- asahi: Vectorize background colour load 413- panfrost: Disable CRC by default 414- panfrost: Fix prim restart XML on Valhall 415- nir: Augment raw_output_pan with IO_SEMANTICS+BASE 416- pan/lower_framebuffer: Operate on lowered I/O 417- nir/lower_blend: Don't touch store->dest 418- nir/lower_blend: Don't handle gl_FragColor 419- nir/lower_blend,agx,panfrost: Use lowered I/O 420- asahi: Lower clip distances late 421- asahi: Move agx_preprocess_nir to CSO create 422- agx: Don't treat clip distances specially 423- agx: Do more work in agx_preprocess_nir 424- asahi: Fix rendering into mipmapped framebuffers 425- agx: Lower offsets in NIR 426- agx: Model and pack gathers 427- agx: Implement gathers (nir_texop_tg4) 428- docs/features: Sync Asahi with reality 429- asahi: Advertise ARB_derivative_control 430- asahi: Advertise ARB_texture_barrier 431- agx: Model atomic instructions 432- agx: Model local loads/stores 433- agx: Disallow immediate bases to device_load 434- agx: Pack global atomics 435- agx: Pack local load/store instructions 436- agx: Translate NIR atomics 437- agx: Translate load/store_shared 438- agx: Lower shared memory offsets to 16-bit 439- agx: Pack local atomics 440- agx: Implement b2b32 441- agx: Handle group_memory_barrier 442- agx: Add and use agx_nir_ssa_index helper 443- agx: Handle ssa_undef as zero 444- agx: Add agx_internal_format_supports_mask helper 445- asahi: Implement color masks with masked stores 446- asahi: Make shader-db work again 447- panfrost: Use proper locations in blend shaders 448- nir/lower_blend: Consume dual stores 449- nir: Add nir_texop_lod_bias_agx 450- asahi: Lower lod_bias_agx to uniform registers 451- agx: Lower sampler LOD bias 452- nir/lower_blend: Don't dereference null 453- docs/feature: Mark ARB_sync as done on Asahi 454- asahi/decode: Handle VDM barriers 455- nir: Add nir_lower_helper_writes pass 456- pan/mdg: Use nir_lower_helper_writes 457- asahi: Advertise dual-source blending 458- agx: Mask shifts in the backend 459- agx: Fix 2D MSAA array texture register allocation 460- asahi: Mark PIPE_FORMAT_NONE "supported" 461- agx: Don't write sample mask from preambles 462- agx: Add AGX_MESA_DEBUG=nopreamble option 463- agx: Clean up after lowering address arithmetic 464- agx: Factor out allows_16bit_immediate check 465- agx: Inline 16-bit load/store offsets 466- agx: Constify agx_print 467- agx: Refactor vector creation 468- agx: Use agx_emit_collect for st_tile 469- agx: Don't print pre-optimization shader 470- agx: Only lower int64 late 471- asahi: Bump shader buffers 472- asahi/meta: Use lowered I/O 473- agx: Disable tri merging with side effects 474- agx: Handle fragment shader side effects 475- asahi: Rework system value lowering 476- asahi: Wire up compute kernels 477- nir/lower_tex: Add lower_index_to_offset 478- pan/bi: Use lower_index_to_offset 479- ir3: Use lower_index_to_offset 480- nir/opt_barrier: Generalize to control barriers 481- glsl/nir: Use scoped_barrier for control barrier 482- pan/bi: Drop control_barrier handling 483- pan/mdg: Drop control_barrier handling 484- ir3: Drop non-scoped barrier handling 485- gallivm: Drop non-scoped barrier handling 486- agx/lower_address: Break on match 487- agx/lower_address: Optimize "shift + constant" 488- agx/lower_address: Handle large shifts 489- agx/lower_address: Handle 8-bit load/store 490- agx/lower_address: Fix handling of 64-bit immediates 491- agx/lower_address: Handle 16-bit offsets 492- agx: Assert that memory index is 32-bit reg 493- agx: Fix clang-formatting 494- agx: Pack indirect texture/sampler handles 495- agx: Handle indirect texture/samplers 496- asahi: Don't allow linear depth/stencil buffers 497- asahi, agx: Implement dummy samplers 498- asahi,agx: Implement buffer textures with gnarly NIR 499- panfrost: Remove some unused definitions 500- docs/panfrost: Move description of instancing 501- panfrost: Don't use DECODE_FIXED16 for sample position 502- panfrost: Handle fixed-point packing in GenXML 503- panfrost: Add XML for framebuffer pointers 504- panfrost: Use framebuffer pointer XML 505- panfrost: Remove FBD tag enum from XML 506- panfrost: Inline the last MALI_POSITIVE use 507- panfrost: Remove MALI_POSITIVE macro 508- pan/mdg: Remove reference to removed macro 509- agx: Don't set lower_pack_split 510- agx: Make partial DCE optional 511- agx: Fix subdivision coalescing 512- agx: Implement extract_[ui]16 513- agx: Use nir_lower_mem_access_bit_sizes 514- agx: Switch to scoped_barrier 515- nir/lower_point_size: Use shader_instructions_pass 516- ail: Restructure generated tests 517- agx: Lower discard late 518- util/prim_convert: Don't set index_bounds_valid 519- pan/bi: Ignore signedness in vertex fetch 520- panfrost: Identify "Base vertex offset" signedness 521- panfrost: Assert that we don't see unsupported vertex formats 522- panfrost: Defeature 24-bit textures 523- panfrost: Handle null textures robustly 524- panfrost/ci: Skip draw_buffers_indexed.random.* on Midgard 525- panfrost/ci: Identify some Piglit flakes 526- panfrost/ci: Add some Piglit skips 527- panfrost/ci: Remove fbo-mrt-new-bind fail+flake 528- panfrost: Note glDrawRangeElements underflow 529- asahi: Fix occlusion query lifetime 530- panfrost: Don't round up Midgard polygon list BOs 531- panvk: Use vk_get_physical_device_features 532- asahi: Use a dynarray for writers 533- ci: Add clang-format to the amd64 container 534- ci: Enforce clang-format for asahi 535- gallium: Fix u_stream_outputs_for_vertices with QUADS 536- nir/builder: Add nir_umod_imm helper 537- blorp,anv,hasvk: Use umod_imm 538- v3d,v3dv: Use udiv_imm/umod_imm 539- radv: Use umod_imm 540- ir3: Use umod_imm 541- nir: Add Panfrost intrinsics to lower sample mask 542- nir: Add Mali load_output taking converison 543- panfrost: Use 0/~0 boolean for MSAA sysval 544- pan/bi: Don't duplicate texture op cases 545- pan/bi: Lower sample mask writes in NIR 546- pan/bi: Lower load_output to make sysval explicit 547- pan/bi: Allow specializing bifrost_nir_options by arch 548- pan/bi: Lower gl_VertexID in NIR 549- pan/bi: Remove bi_load_sysval 550- pan/mdg: Use I/O semantics for MRT blend stores 551- panfrost: Remove inputs->blend.rt 552- panfrost: Remove unused inputs.nr_cbufs 553- pan/bi: Only lower once 554- pan/mdg: Only lower once 555- pan/bi: Split out early preprocessing from late 556- pan/mdg: Split out early preprocessing from late 557- pan/lower_framebuffer: Only call for FS 558- pan/lower_framebuffer: Use nir_shader_instructions_pass 559- pan/blit: Lower load_sampler_lod_parameters_pan 560- panfrost: Preprocess shaders in the driver 561- pan/lower_framebuffer: Lower MSAA blend shaders 562- panfrost: Lower clip_fs late 563- panfrost: Lower texcoords late 564- panfrost: Effectively lower gl_FragColor late 565- panfrost: Preprocess shaders at CSO create time 566- panfrost: Remove stale TODO 567- panvk: Lower sysvals in NIR 568- panvk: Don't use vec4 for vertex_instance_offsets 569- panvk: Inline blend constants as syvals 570- panfrost: Add NIR-based sysval lowering pass 571- panfrost: Lower sysvals in GL 572- panfrost: Move sysvals to GL driver struct 573- panvk: Remove unused function 574- panfrost: Move panfrost_sysvals to GL driver 575- pan/bi: Export bifrost_nir_lower_load_output 576- pan/bi: Call pan_nir_lower_zs_store late 577- panvk: Lower blending late 578- panfrost: Remove Midgard RSD fields from Bifrost 579- asahi: Convert to SPDX headers 580- mesa/st: Only set seamless for GLES3 581- mesa/st: Normalize wrap modes for seamless cubes 582- asahi: Don't lie about seamless cube maps 583- panfrost: Print perf debug when flushing everything 584- panfrost: Print perf debug on seqnum overflow 585- panfrost: Don't redundantly call emit_const_buf 586- panfrost: Mark packs as ALWAYS_INLINE 587- panfrost: Don't update access with a single batch 588- panfrost: Add a v9 fast path for no images 589- panfrost: Clean up tiler calculations 590- panfrost: Estimate vertex count for hier mask 591- panfrost: Choose hierarchy masks by vertex count 592- docs: Remove docs about macOS hardware drivers 593- nv50,nvc0: Use u_pipe_screen_get_param_defaults 594- panfrost: Always upload a workaround sampler 595- pan/{mdg,bi}: Always use sampler 0 for txf 596- panfrost: Unset TEXTURE_BUFFER_SAMPLERS 597- gallium: Remove PIPE_CAP_TEXTURE_BUFFER_SAMPLER 598- docs/gallium: Note samplers are not used for txf 599- nir/print: Don't print sampler_index for txf 600- asahi: Support more renderable formats 601- agx: DCE even with noopt 602- agx: Assert that we don't overflow registers 603- agx: Constify agx_{read,write}_registers 604- agx: Don't allow uniform source to local_atomic 605- agx: Don't destroy usub_sat with constant 606- asahi: Add perf debug for generate_mipmap 607- asahi: Add perf debug for shader variants 608- agx: Set loads_varying accurately 609- agx: Add helper for calculating occupancy 610- asahi/decode: Remove agxdecode_dump_bo 611- asahi/decode: Print VDM barriers 612- asahi: Set PIPE_CAP_LOAD_CONSTBUF 613- agx: Coalesce more collects 614- agx: Don't overallocate registers 615- asahi: Honour sampler count 616- asahi: Implement null textures 617- asahi: Lower 1D to 2D 618- asahi: Dirty track depth bias uploads 619- asahi: Clamp texture buffer sizes 620- agx: Tease apart some sample_mask packing magic 621- agx: Rename writeout to wait_pix 622- agx: Make signal_pix instructions explicit 623- vulkan: Add common features2_to_features 624- radv: Use vk_features2_to_features 625- v3dv: Use vk_features2_to_features 626- lavapipe: Use vk_features2_to_features 627- pvr: Use vk_features2_to_features 628- anv,hasvk: Use vk_features2_to_features 629- tu: Use vk_features2_to_features 630- nir: Combine if_uses with instruction uses 631- nir/opt_ray_queries: Don't use list_length 632- nir/opt_loop_unroll: Avoid list_length 633- nir: Remove 2nd argument from nir_before_src 634- nir/validate: Don't treat if-uses specially 635- dxil: Avoid list_length 636- nir: Reduce indirection 637- nir: Factor out nir_src_rewrite_ssa helper 638- nir: Use nir_src_rewrite_ssa 639- dxil: Use nir_src_rewrite_ssa 640- nir: Remove nir_if_rewrite_condition_ssa 641- nir/repair_ssa: Refactor some use handling 642- nir/validate: Only walk uses once 643- mailmap: Update my e-mail 644- panfrost: Symlink gallium .clang-format to common 645- panfrost/winsys: Add .clang-format for winsys folder 646- panfrost/winsys: Clang-format 647- pan/decode: Move comment out of designated initializer 648- panfrost: Re-run clang-format 649- panvk: Clang-format 650- ci: Run clang-format on panfrost 651- mesa/st: Set uses_sample_shading when forcing per-sample 652- nir/lower_blend: Set uses_fbfetch_output conservatively 653- nir/lower_blend: Enable per-sample shading 654- pan/bi: Lower swizzles for 8-bit CSEL 655- pan/bi: Respect swizzles for more vector ops 656- pan/bi: Use nir_lower_mem_access_bit_sizes 657- panfrost: Allocate shared memory in OpenCL 658- pan/decode: Print compute job payloads 659- asahi: Fix disk cache disable with AGX_MESA_DEBUG 660 661Amber (15): 662 663- util/u_trace: pass utrace context to marker functions. 664- freedreno: add support for markers. 665- ir3, isaspec: add raw instruction to assembler/disassembler. 666- ir3: support texture and sampler index with offsets 667- nir: support lowering nir_intrinsic_image_samples to a constant load 668- ir3: use lower_image_samples_to_one 669- intel/compiler: use lower_image_samples_to_one 670- freedreno: make sure depth/stencil layouts are always tiled 671- freedreno: use A6XX_GRAS_SC_CNTL_SINGLE_PRIM_MODE with fb readback 672- gallium: make BlendCoherent usable from gallium drivers 673- freedreno: use blendcoherent to set FLUSH_PER_OVERLAP 674- freedreno: check for conditional rendering in launch_grid 675- nir: allow nir_lower_fb_read to support multiple render targets 676- nir: Add memory coherency information to shaders. 677- freedreno, nir, ir3: implement GL_EXT_shader_framebuffer_fetch 678 679Andres Calderon Jaramillo (1): 680 681- r600: Report multi-plane formats as unsupported 682 683André Almeida (2): 684 685- radv: Implement vk.check_status 686- winsys/amdgpu: Fix amdgpu_cs_query_reset_state2 error log 687 688Antonio Gomes (11): 689 690- rusticl: Enabling reading/writing for images created from buffers 691- rusticl: Enabling image fill for images created from buffers 692- rusticl: Enable copy for images created from buffers 693- rusticl: Enable mapImage for images created from buffers 694- gallium, rusticl: Add tex2d_from_buf in image_view and sampler_view 695- mesa/st, nine, nouveau: Fix uninitialized pipe_sampler_view structs 696- lvmpipe/cs: Add support for 2d images created from buffers 697- gallium: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT) 698- rusticl: Implement spec for cl_khr_image2d_from_buffer 699- llvmpipe: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT) 700- iris: Add support for 2d images created from buffers 701 702Anuj Phogat (3): 703 704- anv: implement TES distribution mode WA 22012785325 705- iris: implement TES distribution mode WA 22012785325 706- intel/genxml/125: Add preferred SLM allocation size field 707 708Asahi Lina (43): 709 710- asahi: Split off common BO code into its own file 711- asahi: Split off macOS support into its own file 712- asahi: Refuse to transfer out-of-bounds mip levels 713- meson: Fix Asahi build on macOS 714- asahi: Fix shader key cloning overreads 715- asahi: Do not use memctx for pools / meta cache 716- asahi: Drop agx_device.memctx 717- asahi: Only apply FS lowerings to fragment shaders 718- asahi: Add BO_SHAREABLE flag 719- asahi: Add readonly BO flag 720- asahi: Identify USC cache invalidate 721- asahi: Flush USC caches on the first draw 722- asahi: Drop macOS backend 723- asahi: Add nocluster,sync,stats debug flags 724- asahi: Align device submission API with upcoming UAPI 725- asahi: Implement Linux driver scaffolding, sans UAPI 726- asahi: Add APIs for DMA-BUF sync file import/export 727- asahi: Add agx_debug_fault() helper 728- asahi: Add result buffer to context/batches 729- asahi: Add agx_bo_mmap() calls to transfer path 730- asahi: Pull device name from device struct 731- asahi: Do not overread user index buffers 732- asahi: Fix scissor culling check when out of bounds for FB/viewport 733- asahi: Fix device fd leak in agx_close_device 734- asahi: Destroy the renderonly context on screen destroy 735- asahi: clang-format the world again 736- asahi: Assert on TIB strides > 64 737- asahi: Support importing sync objects on BO export 738- asahi: Make agx_flush_resource reallocate non-shareable resources 739- asahi: Extend batch tracking for explicit sync 740- Revert "asahi: Advertise dual-source blending" 741- asahi: Make agx_alloc_staging() take a screen instead of a context 742- asahi: Enable glthread 743- asahi: Locate low VA BOs correctly 744- asahi: Fix style nits 745- asahi: Implement valid buffer range tracking 746- asahi: Make BO import path failures more robust 747- asahi: Add a helper macro for debug/error messages 748- asahi: Add resource debugging 749- asahi: Print reasons why compression is disabled 750- asahi: Fix compressed ZS support 751- asahi: Flip kmsro around to allocate on the GPU 752- asahi: Allow explicit non-LINEAR modifiers for scanout 753 754Axel Davy (1): 755 756- frontend/nine: Fix num_textures count 757 758Bas Nieuwenhuizen (26): 759 760- aco: Pass correct number of coords to Vega 1D LOD instruction. 761- radv: Strictly limit alignment needed within a descriptor set. 762- radv: Reduce descriptor pool allocation for alignment. 763- radv: Set FDCC_CONTROL SAMPLE_MASK_TRACKER_WATERMARK 764- radv: Shift left the tile swizzle more on GFX11. 765- nir: Apply a maximum stack depth to avoid stack overflows. 766- radv: Add helper to hash stages. 767- radv: Hash group handles as part of RT pipeline key. 768- radv: Use provided handles for switch cases in RT shaders. 769- radv: Use group handles based on shader hashes. 770- radv: Implement & expose VK_EXT_pipeline_library_group_handles. 771- Update my mailmap aliases 772- ac/surface,radv: Avoid pitch weirdness if image not used for rendertarget. 773- ac/surface: Only allow stencil pitch adjustment for mipmaps. 774- ac/surface,radv: Opt out of stencil adjust. 775- util: Add aligned int64_t types for x86(non 64). 776- util/disk_cache: Align atomic size. 777- radv: Align atomic values. 778- radv: Reserve space in framebuffer emission. 779- radv: Reserve space in various streamout functions. 780- radv: Reserve space in conditional rendering functions. 781- radv: Reserve space in si_cs_emit_cache_flush. 782- radv: Reserve space for updating DCC metadata. 783- radv: Reserve space for fast clear related writes. 784- radv: Reserve space for indirect descriptor set address writes. 785- radv: Move all the dirty flags from TES binding to TCS binding. 786 787Benjamin Cheng (1): 788 789- radv: initialize cmd_buffer upload list earlier 790 791Boyuan Zhang (6): 792 793- radeonsi/vcn: check fence before destroying dpb 794- radeonsi/vcn: check fence before destroying decoder 795- radeonsi/vcn: validate fence handle before using it 796- virgl/video: disable decoder fence 797- virgl: add more formats to conv table 798- frontends/va: check decoder in va surface call 799 800Brian Paul (9): 801 802- anv: add a third memory type for LLC configuration 803- llvmpipe: do additional checks in lp_state_fs_analysis.c for linear shaders 804- llvmpipe: remove debug printf spam in lp_setup_wait_empty_scene() 805- gallium/xlib: call fence_finish() in XMesaSwapBuffers() 806- llvmpipe: fix ps invocations query bug 807- llvmpipe: rename some vars related to occlusion query and ps invocations 808- llvmpipe: s/tabs/spaces/ 809- llvmpipe: s/unsigned/enum pipe_query_type/ 810- llvmpipe: clean-up llvmpipe_get_query_result() 811 812Błażej Szczygieł (1): 813 814- glx: Fix glXGetFBConfigFromVisualSGIX 815 816Caio Oliveira (26): 817 818- glsl: Account for unsized arrays in NIR linker 819- hasvk: Update driver name in debug information 820- intel: Add extra zeros at the end of debug identifiers 821- iris, crocus: Align workaround address to 32B 822- anv, hasvk: Align workaround address to 32B 823- nir: Add nir_intrinsic_rotate 824- nir/lower_subgroups: Add option lower_rotate_to_shuffle 825- spirv: Implement SPV_KHR_subgroup_rotate 826- nir: Support use_scoped_barrier in nir_lower_atomics_to_ssbo 827- microsoft/compiler: Handle scoped barrier in Tess splitting 828- gallivm: Fix handling of nir_intrinsic_scoped_barrier 829- glsl: Implement use_scoped_barrier option for lowering memory barriers 830- intel/compiler: Mark various memory barriers intrinsics unreachable 831- pan/compiler: Fix handling of nir_intrinsic_scoped_barrier 832- pan/midgard: Handle nir_intrinsic_scoped_barrier in Midgard compiler 833- panfrost: Use NIR scoped barriers instead of memory barriers 834- spirv: Don't specify nir_var_uniform or nir_var_mem_ubo in barriers 835- spirv/tests: Subclass spirv_test helper to namespace the tests 836- spirv/tests: Add script to generate C array from SPIR-V source 837- spirv/tests: Parametrize stage in get_nir() helper 838- spirv/tests: Add some basic control flow tests 839- spirv: Add skip_os_break_in_debug_build option to use in unit tests 840- intel/fs: Handle scoped barriers with execution scope 841- intel/vec4: Handle scoped barriers with execution scope 842- intel/compiler: Drop brw_nir_lower_scoped_barriers 843- intel/compiler: Drop non-scoped barrier handling 844 845Caleb Cornett (6): 846 847- d3d12: Lower minimum supported Shader Model to 6.0 848- futex: Change INT_MAX to INT32_MAX. 849- util: Add #ifdefs for Xbox GDK support. 850- dxil_validator: Add support for Xbox GDK. 851- wgl: Add support for Xbox GDK. 852- d3d12: Add support for Xbox GDK. 853 854Charlie Birks (1): 855 856- docs: add a few vulkan extensions supported by multiple drivers 857 858Charmaine Lee (5): 859 860- svga: fix resource_get_handle from resource created without SHARED bind flag 861- svga: fix compatible formats for shareable surfaces 862- svga: use upload buffer if texture has pending changes 863- translate: do not clamp element index in generic_run 864- svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device 865 866Chia-I Wu (38): 867 868- turnip: replace TU_DEBUG_DONT_CARE_AS_LOAD by a bool 869- turnip: make debug_flags a global variable 870- freedreno: add has_implicit_modifier helper 871- freedreno: support UBWC scanout 872- turnip: add a comment to tu_format_for_aspect 873- turnip: move a comment about FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8 874- turnip: remove tu_native_format::tile_mode 875- turnip: make tu6_format_*_supported static 876- turnip: let tu6_format_vtx* take pipe format 877- turnip: add blit_format_texture 878- turnip: add blit_format_color and blit_base_format 879- turnip: handle ubwc in blit_base_format 880- turnip: reorder tu6_format_* 881- freedreno/registers: correct WFM bit in CP_REG_TEST 882- turnip: add a comment to tu_render_pass_cond_config 883- turnip: skip unnecessary CP_REG_TEST for cond load/store 884- freedreno/registers: document more bits of CP_REG_TEST 885- freedreno: avoid conditional ib in fd6_emit_tile 886- radv: fix a hang with binning on CHIP_RENOIR 887- turnip: fix a major leak with GPL LTO 888- turnip: fix a null descriptor set dereference 889- turnip: avoid FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8 for event blits 890- radv: add a size check in radv_create_buffer for Android 891- util/log: refactor mesa_log 892- util/log: allow multiple loggers 893- util/log: improve logger_file newline handling 894- util/log: improve logger_android 895- util/log: add logger_syslog 896- util/log: add support for MESA_LOG_FILE 897- util/log: add logger_windbg 898- mesa: add missing newlines for _mesa_debug/_mesa_log callers 899- mesa: use mesa_log from output_if_debug 900- anv: process utrace payloads on queue submission 901- ci/radv: remove dEQP-VK.image.sample_texture.* fails/flakes 902- radv: set RADEON_FLAG_GTT_WC for external mem on vram 903- radv: rework radv_layout_fmask_compressed 904- radv: add RADV_FMASK_COMPRESSION_PARTIAL 905- radv: disable tc_compatible_cmask on GFX9 in some cases 906 907Christian Gmeiner (1): 908 909- etnaviv: nir: use lower_fround_even 910 911Collabora's Gfx CI Team (3): 912 913- Uprev Piglit to 60e7f0586bac0cfcfcb5871046e31ca2057a5117 914- Uprev Piglit to 2391a83d1639a7ab7bbea02853b922878687b0e5 915- Uprev Piglit to 355ad6bcb2cb3d9e030b7c6eef2b076b0dfb4d63 916 917Connor Abbott (12): 918 919- freedreno/a6xx: Rename CP_CSQ_IB*_STAT 920- freedreno/a6xx: Add CP_ROQ_*_STAT 921- freedreno/a6xx: Fix CP_ROQ_THRESHOLDS_1 922- freedreno/a6xx: Fill in ROQ status registers 923- freedreno/crashdec: Fix apparent off-by-one with ROQ size 924- freedreno/crashdec: Add prefetch test 925- tu: Fix tile_align_h on a650 926- freedreno: Fix or/and'ing two BitmaskEnums 927- tu: Use vk_pipeline_get_renderpass_flags() 928- vk/render_pass: Support VK_EXT_fragment_density_map 929- nir, spirv: Add support for VK_EXT_fragment_density_map 930- tu: Don't override depth for GMEM 931 932Constantine Shablya (12): 933 934- anv: handle ATTACHMENT_OPTIMAL layout 935- anv: use Vulkan runtime's robust buffer access 936- hasvk: use Vulkan runtime's robust buffer access 937- anv,hasvk: flush what UNIFORM_READ flushes on SHADER_READ 938- vulkan: relocate rmv to its correct home 939- vulkan: tidy up vk_physical_device_features 940- vulkan: delete trailing namespace 941- vulkan: add hepler for vkGetPhysicalDeviceFeatures2 942- vulkan: use vk_features for vk_device::enabled_features 943- anv: use vk_get_physical_device_features 944- vulkan: fix building with python3.8 945- vulkan: depend idep_vulkan_runtime_headers on vk_physical_device_features.h 946 947Corentin Noël (12): 948 949- ci/venus: Remove failure now passing 950- kopper: Do not free the given screen in initScreen implementation 951- ci: uprev virglrenderer 952- ci/venus: Skip tests risking out of memory issues 953- ci: uprev virglrenderer and crosvm 954- ci: Setup XDG_RUNTIME_DIR in crosvm-init 955- ci: Allow to use crosvm-runner before deqp-runner 956- ci: Uprev crosvm and virglrenderer 957- venus/ci: Only run one crosvm instance 958- mesa: OpenGL ES 3.0 requires EXT_instanced_arrays 959- glapi: Make EXT_draw_instanced functions available for GLES 2.0 960- mesa: Add EXT_instanced_arrays support 961 962Daniel Schürmann (82): 963 964- radv: CSE ray_launch_{size|id} 965- radv: rename shader_info->cs.uses_sbt -> shader_info->cs.is_rt_shader 966- radv: unconditionally enable scratch for RT shaders 967- radv/rt: introduce and set rt_pipeline->stack_size 968- radv/rt: use dynamic_callable_stack_base also for static stack_sizes 969- radv/rt: don't hash maxPipelineRayRecursionDepth 970- nir: add Continue Construct to nir_loop 971- nir: add assertions that loops don't have a Continue Construct 972- nir: create nir_push_continue() and related helpers 973- nir: add lowering for Loop Continue Constructs 974- spirv: use Loop Continue Construct to emit SPIR-V loops and lower after parsing 975- nir/lower_continue_constructs: special-case Continue Constructs with zero or one predecessors 976- nir/lower_continue_targets: only repair SSA when necessary 977- nir: simplify nir_block_cf_tree_{next|prev} 978- radv/rt: rename library_pipeline->groups to library_pipeline->group_infos 979- radv/rt: defer library_pipeline allocation 980- radv/rt: introduce struct radv_ray_tracing_module 981- radv/rt: move stack_sizes into radv_ray_tracing_module 982- radv/rt: only reserve stack_sizes after rt_case insertion 983- radv: expose radv_postprocess_nir() 984- radv: expose radv_pipeline_capture_shaders() 985- radv/rt: introduce and use radv_rt_pipeline_compile() 986- radv: remove unused parameters from radv_compute_pipeline_compile() 987- radv/rt: move radv_pipeline_key from rt_variables to traversal_data 988- nir/gather_info: allow terminate() in non-PS 989- aco: fix NIR infinite loops 990- radv/rt: use terminate() when returning from raygen shaders 991- aco/dominance: set immediate dominator for any BB without predecessors 992- aco/value_numbering: clear hashmap between disconnected CFGs 993- aco/dead_code_analysis: don't add artificial uses to p_startpgm 994- aco/insert_exec_mask: allow for disconnected CFG 995- aco/spill: allow for disconnected CFG 996- radv/rt: place any-hit scratch vars after intersection scratch vars 997- radv/rt: Fix any_hit scratch variables. 998- mesa: add gl_shader_stage_is_rt() 999- radv: add RT shader args 1000- radv: handle RT stages in radv_nir_shader_info_pass() 1001- radv: add RT stages to radv_get_shader_name() 1002- radv: add RT shader handling to radv_postprocess_config 1003- aco: add RT stage enums 1004- aco: don't set private_segment_buffer/scratch_offset on GFX9+ 1005- aco: move rt_dynamic_callable_stack_base_amd to VGPR 1006- aco: implement load_ray_launch_{id|size} 1007- aco: create hw_init_scratch() function for p_init_scratch lowering 1008- aco: implement select_rt_prolog() 1009- radv: add radv_create_rt_prolog() 1010- radv: compile rt_prolog 1011- radv/rt: use prolog for raytracing shaders 1012- aco: remove aco::rt_stack variable 1013- radv: remove unused parameter from radv_open_rtld_binary() 1014- radv: separate radv_postprocess_binary_config() from radv_shader_create() 1015- radv: remove unnecessary copy of binary->config 1016- radv: inline radv_postprocess_config() 1017- radv: separate radv_capture_shader_executable_info() from radv_shader_create() 1018- radv: move gl_shader_stage from radv_binary to radv_shader_info 1019- radv: remove radv_create_gs_copy_shader() 1020- radv: refactor shader_compile() 1021- radv: skip pipeline caching with RADV_DEBUG=shaders 1022- radv: fix radv_shader_binary member fields to 32 bit. 1023- radv/rt: Fix VK_KHR_pipeline_executable_properties 1024- aco: split ps_epilog args before exporting them 1025- aco/ra: adjust_max_used_regs() for fixed Operands 1026- aco: don't use shared VGPRs for shaders consisting of multiple binaries 1027- radv: update PS num_vgprs in case of epilogs rather than overallocating VGPRs 1028- vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object 1029- vulkan/pipeline_cache: Don't re-insert disk-cache hits into disk-cache 1030- vulkan/pipeline_cache: implement vk_pipeline_cache_create_and_insert_object() 1031- vulkan/pipeline_cache: use vk_pipeline_cache_create_and_insert_object() during vk_pipeline_cache_load() 1032- vulkan/pipeline_cache: add cache parameter to deserialize() function 1033- vulkan/pipeline_cache: move vk_log on failed deserialization to vk_pipeline_cache_load() 1034- radv: derive struct radv_shader from vk_pipeline_cache_object 1035- radv: unconditionally store the binary code in radv_shader 1036- radv: add radv_shader_serialize() and radv_shader_deserialize() functions 1037- radv: add struct radv_pipeline_cache_object 1038- radv: implement radv_shader_create_cached() 1039- radv: use vk_pipeline_cache 1040- radv: clean up pipeline-cache interface 1041- radv/ci: add 2 more Flakes for Navi21 1042- radv/rt: fix total stack size computation 1043- radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error 1044- vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects 1045- radv: add padding to radv_shader_binary_legacy 1046 1047Daniel Stone (18): 1048 1049- ci/fdno: Only run full tests on a limited subset of machines 1050- ci/radv: Skip vkCreateInstance memory-fail test 1051- ci/anv: Temporarily halve TGL testing load 1052- intel/isl: Don't scream FINISHME into logs for 3D vs. CCS 1053- ci/radv: Drop raven quick_shader load 1054- ci/fdno: Add a618 Vulkan flakes 1055- ci/zink: Add flake seen in the wild 1056- ci/radv: Lower stoney CTS load 1057- ci/android: Use a more aggressive timeout for the job 1058- ci: Actually run Piglit on LAVA 1059- ci: Disable Collabora LAVA farm 1060- Revert "ci: Disable Collabora LAVA farm" 1061- CI: Disable Windows runners 1062- CI: Disable mingw job 1063- ci/panfrost: Add texturesize flake seen in the wild 1064- CI: Disable freedreno 1065- ci/radeonsi: sort and dedup stoney skips 1066- ci/radeonsi: Skip really slow tests on stoney 1067 1068Danylo Piliaiev (52): 1069 1070- tu/kgsl: do not use kgsl_command_object::offset 1071- tu: Prevent using stale value of RB_UNKNOWN_88D0 on BLIT 1072- tu: Prevent using stale value of GRAS_SC_CNTL in sysmem clear 1073- freedreno: Document A6XX_GRAS_SC_CNTL::rotation field 1074- turnip: Ensure that there is no renderpass rotation in binning 1075- turnip: Disable draw states after dyn renderpass in all cases 1076- ir3: Consider dst type in ubo_vec4 to ldc lowering 1077- tu: Don't expose KHR_present_id,KHR_present_wait without KHR_swapchain 1078- turnip: Add debug option to find usage of stale reg values 1079- docs/freedreno: Add info about stale reg stomper dbg option 1080- ci/tu: Add 1/200 pass to test for stale reg usage 1081- ir3: Add cat5/cat7 cache related instructions 1082- ir3: Add cat7 sleep instruction 1083- freedreno/register: Define chip enum values 1084- util/perf: C++-proof util/perf 1085- util/format: Make format_table compatible with C++ 1086- spirv: sort spirv_supported_capabilities 1087- vk/vk_extension_gen: Make table struct initializable in C++ on older gcc 1088- vk/wsi: C++-proof wsi_common_drm.h 1089- vk/util: remove (void \*) casts from vk_foreach_multi_draw macros 1090- vk/util: Generate defines to help casting structs with vk_find_struct 1091- freedreno/common: C++-proof freedreno_uuid.h 1092- ir3: C++-proofing 1093- tu: C++-proofing: fix offsetof with dynamic array index 1094- tu: C++-proofing: fix struct initializers 1095- tu: C++-proofing: various enum fixes 1096- tu: C++-proof: do not goto over variables initialization 1097- tu: C++-proofing: fix designator initializer order 1098- tu: C++-proofing: fix extension table initialization 1099- tu: C++-proofing: Initialize tu_reg_value in-order by pack funcs 1100- tu: C++-proofing: fix casting from void * fpermissive warnings 1101- tu: C++-proofing: ease access to global bo struct 1102- tu: C++-proofing: prevent taking address from rvalue 1103- tu: C++-proofing: cast result when extracting field from reg value 1104- tu: C++-proofing: misc fixes 1105- freedreno/msm: Rename drm_msm_gem_submit_reloc::or in C++ code 1106- tu: compile as C++ 1107- vk/entry_points: Add option to generate template entrypoints 1108- freedreno/regs: Include assert.h in generated headers 1109- tu: Generate entrypoints for each gen 1110- turnip: add cached and cached-coherent memory types 1111- tu/drm: Support cached non-coherent memory 1112- freedreno/registers: Document new CP_EVENT_WRITE::SEQNO 1113- freedreno/registers: More a7xx regs 1114- freedreno/computerator: C++ proofing 1115- freedreno: C++ fixes for computerator to compile 1116- freedreno/computerator: Convert to C++ 1117- freedreno: Move fd6_pack.h to common code accessible by computerator 1118- freedreno: Add dummy a730/a740 definition 1119- freedreno/computerator: Templatize a6xx backend 1120- freedreno/computerator: Add support for a7xx 1121- vulkan: Sanitize pSampleMask in CmdSetSampleMaskEXT 1122 1123Dave Airlie (37): 1124 1125- ci: bump vk cts to 1.3.3.1 + and a crash fix. 1126- vulkan/video: add common h264/h265 parameter set management code. 1127- vulkan/format: add a 10-bit video format 1128- radv: remove the status query mark it unsupported. 1129- radv: add new upload alloc aligned api 1130- ac: add name to codec info struct 1131- radv: adding video decode queue support 1132- radv: add video decoder register setup. 1133- radv/video: add initial frameworking. 1134- radv/video: add initial h264 decoder for VCN 1135- radv/video: add h264 support for uvd 1136- radv: add vcn h265 decode. 1137- radv/video: add h265 decode UVD support 1138- radv/vcn: enable dynamic dpb tier 2 for h264/h265 on navi21+ 1139- anv: add video engine support in various places 1140- anv: set Y/4 tiling for video decode images 1141- anv: add video format features for the one supported video output format 1142- anv/format: handle video extensions structs by ignoring them 1143- intel/genxml: align some of the fields with the media driver 1144- intel/genxml: add missing power well control bits 1145- anv/image: allocate some memory for mv storage after video images. 1146- anv: add initial video decode support for h264. 1147- anv/query: add query status report 1148- anv: enable video decode extensions. 1149- anv/video: fix video memory bindings. 1150- crocus: disable Y tiling for render targets properly. 1151- crocus: switch gen4/5 tiling flags to follow suggestions. 1152- llvmpipe: fix compute address bits to return native pointer size. 1153- anv: always pick graphics queue to execute prime blits on. 1154- radv: add video format support to format probing. 1155- anv/video: fix chroma qp to be a integer value. 1156- anv/video: disable picture id reampping. 1157- anv: fix image height for field pictures. 1158- radv/video: fix h264 frame heights when field images are in use 1159- radv/video: fix used for reference flags. 1160- radv/video: fix h265 decoding sizes. 1161- radv/trace: don't attempt to emit trace on non-graphics/compute queues 1162 1163David (Ming Qiang) Wu (1): 1164 1165- radeonsi/vcn: add an exception of field case for h264 decoding 1166 1167David Heidelberg (73): 1168 1169- ci/zink: Penumbra is now fixed. 1170- freedreno/ci: Switch also performance a630 job to manual 1171- ci/anv: add multiple fails uncovered by change of sharding 1172- ci/intel: fully utilize asus-cx9400-volteer 1173- ci/piglit: explicitely define we want GLX tests 1174- ci: migrate from wget to curl 1175- ci/piglit: 2023-01-19 uprev 1176- ci: bump ci-fairy with session support (robust downloads) 1177- ci: Sir trace has small invisible change in rendering 1178- ci: bump Mold to the 1.10.0 1179- ci: uprev piglit (etag md5 checksumming support) 1180- ci/lavapipe: use dxvk for the traces 1181- ci: revert download of git cache to the wget 1182- ci/llvmpipe: add flake timeout for rusticl program@execute@[email protected] 1183- util/process_test: make the error variable static 1184- intel: enable -mfpmath=sse on x86 1185- intel: use c_see2_arg instead of explicit -msse2 1186- ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_dispatch 1187- meson: print c_cpp_args 1188- intel/vulkan: add missing dependency on generated headers 1189- ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_flush 1190- ci/alpine: keep the curl inside the image 1191- ci: alpine: install bash and coreutils for date -d 1192- ci: implement unified sections 1193- ci: make meson build and test uncollapsed 1194- ci: deqp-runner: drop already unused function 1195- ci: Retry, retry, retry... No one likes to trigger Marge more than once. 1196- ci/zink: add skip for the Single-GL46.enhanced_layouts.ssb_member_align_non_power_of_2 1197- ci/lavapipe: add recent occasional flake 1198- ci/freedreno: rare flake KHR-GL45.sample_variables.mask.rgba8i.samples_4.mask_3 1199- crocus/meson: add dependency on libintel_dev also for versioned static libraries 1200- ci/ci_run_n_monitor: while we usually disable many jobs, print them inline 1201- ci: do not exit when an error happens inside the section 1202- ci/lavapipe: fixes typo 1203- ci/zink: fixup the zink-lvp job 1204- ci: disable mesa-swrast runner jobs 1205- ci/lava: implement the priority 1206- ci/weston: before testing, verify that XWayland is really running 1207- ci/weston: add background PID 1208- ci: add and utilize dalboz devices 1209- ci/amd: move skqp and va jobs on raven from XOrg to the XWayland 1210- ci/panfrost: correct the job name, as it runs on gles2 1211- ci/lava: every LAVA job doesn't want to run gles2 deqp, drop it 1212- ci: build Wayland support for the amd64 1213- ci/iris: update apl and glk expectations, after enabling Wayland support 1214- ci/clover: disable the jobs 1215- ci/traces: disable nheko trace with zink since it flakes 1216- ci/freedreno: add recent occasional flakes 1217- ci/traces: add two skips due to flakes 1218- ci/intel: add dEQP-EGL.functional.wide_color.window_fp16_default_colorspace flake 1219- ci: distribute XDG_RUNTIME_DIR with setup-test-env script 1220- ci: disable weston session timeout for llvmpipe 1221- meson: implement quirk for the compilation under armv7 GCC with LTO 1222- aco: drop leftover variable 1223- ci: bump Alpine to 3.17 (again) 1224- ci/freedreno: do not build tools executables without explicitly enabling them 1225- freedreno/decode: fix possible overflow 1226- ci: rename .lava-test to .lava-test-deqp to describe it correctly 1227- ci: create lava-test without deqp HWCI_TEST_SCRIPT 1228- ci: remove deqp from lava piglit and traces runs 1229- ci/freedreno: split deqp from other jobs 1230- ci/freedreno: define Google farm specific includes 1231- ci/freedreno: Make traces work on LAVA caching proxy 1232- ci/broadcom: test occasionally fails, but typically passes 1233- ci: disable lima farm, currently out-of-space, needs to be fixed 1234- ci: implement sections for cuttlefish 1235- ci/v3d: add flaking spec@ext_framebuffer_blit@fbo-sys-blit 1236- Revert "mesa: Enable NV_texture_barrier in GLES2+" 1237- ci/amd: update device status 1238- ci/amd: raven is currently downgraded to 2 machines only, adapt 1239- ci/amd: add draw.dynamic_rendering flake 1240- ci/freedreno: fix the a530_piglit job and switch to Weston 1241- panvk: clear dangling pointers 1242 1243David Redondo (1): 1244 1245- egl/wayland: fix oob buffer access during buffer_fds clean up 1246 1247David Rosca (2): 1248 1249- frontents/va: Use PIPE_USAGE_STAGING for coded buffer 1250- frontends/va: Map VAEncCodedBufferType buffer as PIPE_MAP_READ 1251 1252Dmitry Baryshkov (2): 1253 1254- freedreno/a5xx: reorder GPMU registers 1255- freedreno/a5xx: add SP clock control register 1256 1257Dmitry Osipenko (6): 1258 1259- util/cache_test: Unset env vars left after Cache.List test 1260- util/mesa-db: Don't account header size 1261- util/mesa-db: Support removal of cache entries 1262- util/cache_test: Remove dummy cache entry added by cache_exists() 1263- util/mesa-db: Introduce multipart mesa-db cache 1264- util/disk_cache: Switch to multipart mesa-db cache 1265 1266Dylan Baker (38): 1267 1268- VERSION: bump to 23.1.0-devel for further development 1269- docs: reset new_features.txt 1270- meson: bump minimum required version to meson 0.59 1271- meson: replace has_exe_wrapper with can_run_host_binaries 1272- meson: replace uses of ExternalProgram.path with .full_path 1273- meson: drop meson < 0.54 workaround 1274- meson: use a feature option for dri3 1275- meson: use a feature option for gallium-vdpau 1276- meson: use a feature option for gallium-va 1277- meson: use a feature option for gallium-xa 1278- meson: use a feature option for shader_cache 1279- meson: use a feature option for shared-glapi 1280- meson: use a feature option for gles1 1281- meson: use a feature option for gles2 1282- meson: use a feature option for gbm 1283- meson: use a feature option for llvm 1284- meson: use a feature option for valgrind 1285- meson: use a feature option for libunwind 1286- meson: use a feature option for lmsensors 1287- meson: use a feature option for power8 1288- meson: use a feature option for xlib-lease 1289- meson: use a feature option for zstd 1290- meson: use a feature option for egl 1291- meson: use a feature option for shared-llvm 1292- meson: Use feature option methods for xmlconfig 1293- meson: remove version checks for < 0.59 1294- meson: use builtin support for reading version from a file 1295- meson: use [] instead of 'lib for !windows name_prefix 1296- meson: use the same workaround for setting 'lib' on windows 1297- meson: combine checks for linker --gc-sections support 1298- util: rzalloc and free hash_table_u64 1299- iris: consider bufmgr creation to have failed if \`dup`ing of the fd fails 1300- intel/mi: use 64bit constant for bitshift 1301- intel/dev: create a helper dependency for libintel_dev 1302- docs: Add calendar entries for 23.0 release. 1303- docs: add release notes for 23.0.0 1304- docs: Add sha256 sum for 23.0.0 1305- docs/relnotes: add 23.0.0 to relnotes.rst 1306 1307Ella Stanforth (1): 1308 1309- v3dv: add support for multi-planar formats, enable YCbCr 1310 1311Emma Anholt (211): 1312 1313- dri2: Fix exposing robustness with swkms. 1314- ci/llvmpipe: Drop dEQP-EGL.functional.sharing.*.link.7 flakes. 1315- ci/iris: Add known flakes for skqp. 1316- ci/iris: Generalize the 8888_pbuffer EGL known flakes and share with GLK. 1317- ci/zink: Add more blit conversion xfails for a618. 1318- freedreno: Skip CPU/GPU timestamp sync when not supported. 1319- ci/freedreno: Add glx-swap-event-async as a flake. 1320- freedreno/pps: Fix a signed/unsigned complaint. 1321- ci: Enable building the testing drivers with perfetto. 1322- ci: Add some new folks to the restricted-traces access list. 1323- Revert "nouveau/ci: temporary disable gk20a-gles" 1324- ci/virgl: Disable iris traces for now while it's unstable. 1325- ci: Drop windowoverlap xfails, since it's always skipped. 1326- ci/zink: Drop xfail for copy-sub-buffer. 1327- ci/zink: Drop glx-swap-copy xfails. 1328- ci/zink: Clear issue #7781 flakes. 1329- ci/freedreno: Switch the piglit job to using a deqp-runner suite. 1330- ci: Move PIGLIT_PLATFORM settings out of the .tomls. 1331- ci/piglit: Add some common piglit skips for Mesa CI's testing of glx. 1332- ci/piglit: Exclude swapbuffers front-readback tests with PIGLIT_PLATFORM=gbm. 1333- zink: Fatal error if requesting validation and we fail to load the layer. 1334- zink: Add missing Flat decorations on some inputs. 1335- zink: Fix validation failure for maxLod < minLod. 1336- zink: Fix up mismatches of memory model vs addressing model. 1337- zink: Re-emit the SpvBuiltInSampleMask access chain each load. 1338- ci/zink: Add coverage using the vulkan validation layer on lvp. 1339- ci/zink: Update TGL full-run xfails. 1340- ci/zink: Update radv xfails for the recent shadow fixes. 1341- ci/freedreno: Mark max-texture-size as a flake. 1342- ci: Move the performance jobs' allow_failure:true to the gl rules. 1343- ci: Add manual rules variations to disable irrelevant driver jobs. 1344- freedreno: Don't sync timestamps while perfetto isn't running. 1345- ci/zink: Disable Amnesia trace until the linked issue gets fixed. 1346- ci/zink: Move the zink-anv-tgl manual full run to custom manual deps. 1347- ci: Run our manual jobs during the nightly scheduled run. 1348- ci: Fix perf jobs blocking Marge pipelines. 1349- ci: Fix perf job condition. 1350- ci: Drop the itoral-gl-terrain demo from traces. 1351- tu: Mark tiling impossible if we couldn't lay out gmem in the first place. 1352- turnip: Optimize tile sizes to reduce the number of bins. 1353- tu: Only emit the conditional gmem subpass resolves when gmem is possible. 1354- turnip: Make the tiling-impossible case have an impossible tile layout. 1355- gallivm: Optimize emit_read_invocation's first-invocation loop. 1356- gallivm: Refactor out a shared "get the first active invocation" loop. 1357- gallivm: Return 0 first_active_invocation when we know that up front. 1358- gallivm: Use cttz instead of a loop for first_active_invocation(). 1359- gallivm: Use first active invocation in some image/ssbo accesses. 1360- ci/lvp: Drop the subgroupbroadcast skips. 1361- llvmpipe: Enable LP_DEBUG on normal builds. 1362- gallivm: Enable GALLIVM_DEBUG (mostly) on non-DEBUG builds. 1363- gallivm: Fix the type of array nir_registers. 1364- gallivm: Fix codegen performance for constant-index register array stores. 1365- gallivm: Do the same codegen improvement for constant-index array loads. 1366- ci/swrast: Drop skips for tests whose perf had been fixed. 1367- ci/llvmpipe: Drop skip of InteractionFunctionCalls2. 1368- ci/freedreno: Don't forget to report flakes on a618, too. 1369- u_trace: Add an interface for checking trace enablement outside a context. 1370- zink: Add tracing of blit operations. 1371- ci: Disable systems in my farm that haven't recovered. 1372- ci/zink: Update TGL full-run xfails. 1373- ci/freedreno: Disable the a306_piglit_gl job. 1374- ci/freedreno: Update a530 manual-run xfails. 1375- ci/freedreno: Add an xfail for a618 VK full run. 1376- ci/freedreno: Update a3xx piglit_shader xfails. 1377- ci/nouveau: Disable the gm20b jobs entirely. 1378- ci/radv: Update navi21 llvm xfails. 1379- ci/crocus: Update HSW expectations. 1380- ci/freedreno: Update manual-run xfails for a530. 1381- Revert "freedreno/a5xx: Fix clip_mask" 1382- ci/radv: Add a skip for navi21-llvm for a test that consistently timeouts. 1383- ci/etnaviv: Drop stale xfails from gc7000. 1384- ci/etnaviv: Update deqp xfails for gc2000. 1385- egl/kopper: Add assert for no kopper in dri2_copy_region. 1386- egl: Add a note explaining the swapBuffers badness in dri2_x11_copy_buffers(). 1387- egl/kopper: Use the kopper private interface for swapBuffers. 1388- egl/kopper: Pass ancillary invalidate flush flags down to gallium. 1389- ci: Add a manual full and 1/10th hasvk CTS runs. 1390- hasvk: Silence conformance warning in CI. 1391- hasvk: Fix SPIR-V warning about TF unsupported on gen7. 1392- anv: Fix gfx8/9 VB range > 32bits workaround detection. 1393- hasvk: Fix gfx8/9 VB range > 32bits workaround detection. 1394- glsl: Drop the (v.x + v.y + v.z + v.w) -> dot(v, 1.0) optimization. 1395- ci/etnaviv: Drop one more gc7000 xfail. 1396- ci/freedreno: Drop a530 piglit_gl coverage. 1397- ci/turnip: Drop the #8219 xfail. 1398- ci/zink+turnip: Disable flaky minetest trace. 1399- ci/hasvk: Add a synchronization flake. 1400- ci: Fix stage of etnaviv manual runs. 1401- ci/zink: Add a glx flake on anv 1402- ci/crocus: Add new tess xfails and a link to the regression bug report. 1403- ci/crocus: Mark unvanquished as flaky. 1404- anv: Skip the RT flush when doing depth-only rendering. 1405- anv: Skip BTI RT flush if we're doing an op that doesn't use render targets. 1406- glsl/opt_algebraic: Drop ~~x == x transformation. 1407- glsl/opt_algebraic: Drop log(exp(x)) -> x and exp(log(x)) -> x optimisations. 1408- glsl/opt_algebraic: Drop pow-recognizer. 1409- glsl/opt_algebraic: Drop abs(-x) -> abs(x) and abs(abs(x)) -> abs(x). 1410- glsl/opt_algebraic: Drop -(-x) -> x optimization. 1411- glsl/opt_algebraic: Drop f2i(trunc(x)) -> f2i(x) optimization. 1412- glsl/opt_algebraic: drop fsat(fadd(b2f(x), b2f(y))) -> b2f(ior(x, y)) opt. 1413- glsl/opt_algebraic: Drop shifts of 0 optimizations. 1414- glsl/opt_algebraic: Drop pow optimizations. 1415- glsl/opt_algebraic: Drop rcp optimizations. 1416- glsl/opt_algebraic: Drop and/or/xor optimizations. 1417- glsl/opt_algebraic: Drop fdiv(1,x) -> frcp(x) and fdiv(x,1) -> x optimizations. 1418- glsl/opt_algebraic: Drop add/sub with 0 optimizations. 1419- glsl/opt_algebraic: Drop x + -x -> 0 optimization. 1420- glsl/opt_algebraic: Drop csel(true/false, x, y) optimization. 1421- nir: Add optimization for fdot(x, 0) -> 0. 1422- glsl/opt_algebraic: Drop fdot 0-channel optimizations. 1423- glsl/opt_algebraic: Drop scalar all_eq/any_neq -> eq/neq opt. 1424- glsl/opt_algebraic: Drop the eq/neq add-removal optimization. 1425- glsl/opt_algebraic: Drop no-op pack/unpack optimization. 1426- glsl/opt_algebraic: Drop the flrp/ffma simplifiers. 1427- glsl/opt_algebraic: Drop some fmul simplifications. 1428- nir: Port a floor->truncate algebraic opt pattern from GLSL. 1429- glsl/opt_algebraic: Drop the ftrunc pattern recognizer. 1430- glsl/opt_algebraic: Drop the flrp recognizer. 1431- glsl: Remove unused as_rvalue_to_saturate(). 1432- ci: Update traces expectations for gutting glsl opt_algebraic. 1433- panfrost/midgard: Fix handling of csel with a vector constant condition. 1434- panfrost/midgard: Drop redundant arg to emit_explicit_constant. 1435- glsl: Move lower_vector_insert to GLSL-to-NIR. 1436- nir/split_64bit_vec3_and_vec4: Handle 64-bit matrix types. 1437- gallivm: Return 0 for first active invocation when no invocations are active. 1438- gallivm: Use first_active_invocation for ubo/kernel memory loads. 1439- gallivm: Use first_active_invocation for scalar SSBO loads. 1440- gallivm: Add some notes about other invocation_0_must_be_active usages. 1441- ci: Add some xfail updates from VKCTS 1.3.5.0 for the manual jobs. 1442- ci/etnaviv: Drop the dEQP-GLES2.functional.uniform_api.random.94 xfail. 1443- anv+hasvk: Use driconf to disable 16-bit for zink. 1444- zink: Pass the cmdbuf to the end of the marker, too. 1445- Revert "ci: disable mesa-swrast runner jobs" 1446- ci: Re-enable some swrast testing using fd.o's shared runners for now. 1447- glsl/nir: Include early glsl-to-nir output in NIR_DEBUG=print. 1448- glsl_to_nir: Use a variable's constant_value if it wasn't const-propped out. 1449- glsl: Delete constant propagation pass. 1450- glsl: Delete constant folding pass. 1451- glsl: Delete constant-variables pass. 1452- ci: Update trace expectations for GLSL constant prop removal. 1453- ci/zink: Update TGL xfails/flakes based on the last nightly pipelines. 1454- ci/turnip: Extend a630 vk full timeout to 3 hours. 1455- ci/iris: Add skips for slow tests on APL. 1456- turnip: Don't push inline uniform buffer contents outside constlen. 1457- ci/turnip: Clear out stale xfails. 1458- ci/turnip: Disable dEQP-VK.image.queue_transfer.* for now. 1459- ci/turnip: Move some more of the 1.3.5 new xfails under links. 1460- glsl: Simplify vector constructors from scalars. 1461- glsl/lower_precision: Add a unit test that I thought we might fail at. 1462- glsl/lower_precision: Add a cut-down testcase for #8124 1463- glsl: Set the precisions of builtin function arguments and returns. 1464- glsl: Handle highp promotion of builtin function args in the builtins. 1465- glsl: Set the precision of function return value temporaries. 1466- glsl/lower_precision: Drop most special-casing of builtin arg precision. 1467- glsl: Fix the precision of atomic counter builtin function args. 1468- glsl/lower_precision: Add actual spec quotes for "check_parameters" 1469- nir/lower_mediump: Fix assertion about copy_deref lowering matching. 1470- ci/iris: Update more manual job xfails from the Wayland build change. 1471- ci/crocus: Update expectations from VK CTS 1.3.5.0. 1472- ci/hasvk: Update some xfails from the 8-sample fast clear disable. 1473- ci/etnaviv: Get the gc2000_piglit manual job mostly working. 1474- glsl/standalone: Pull program create/destroy out to a public function. 1475- glsl/standalone: Pull out a helper function for adding GLSL source shaders. 1476- glsl/standalone: Make all standalone contexts have NewProgram set. 1477- glsl: Write a new test for GLSL and NIR mediump lowering. 1478- ci/crocus: Fix 1.3.5.0 xfails. 1479- ci/etnaviv: Polish the gc2000 xfails a bit. 1480- ci/zink: Update the tgl manual run xfails. 1481- gallivm: Skip loads/stores that are definitely outside of compact vars. 1482- nir/lower_sysvals: Add support for un-lowered tess_level_inner/outer. 1483- nir_to_tgsi: Handle stores to compact outputs. 1484- glsl: Delete the lower_tess_level pass. 1485- glsl: Remove the TessLevel lowering special case from xfb. 1486- glsl: Drop dead prototype. 1487- ci/freedreno: Flake KHR-GL45.shader_image_load_store.basic-allTargets-store 1488- ci/broadcom: Skip another texelfetch case. 1489- perfetto: Add a .clang-format for the directory. 1490- intel/perfetto: Drop unused "pipelined" field. 1491- perfetto: Make a MesaRenderpassDataSource with common setup/start/stop. 1492- perfetto: Deduplicate clock sync packet emit from renderstage sources. 1493- perfetto: Move intel's cmdbuf/queue annotation code to the shared util. 1494- ci/zink: Drop validation exception for leaks at device destroy. 1495- ci/zink: Disable godot-tps-gles3 on a630. 1496- docs: Update Vulkan renderpass docs for !22191 1497- ci: Add missing dependency on doxygen sources for docs-generation jobs. 1498- docs: Claim less functionality for glsl_compiler. 1499- glsl: Move ForceGLSLAbsSqrt handling to glsl-to-nir. 1500- zink: Add mapping for nir_op_ldexp, but disable it for 64-bit's sake. 1501- glsl: Retire ldexp lowering in favor of the nir lowering flag. 1502- glsl/softfp64: GC the temp vars after we lower them to SSA. 1503- glsl/softfp64: Add fisfinite lowering. 1504- state_tracker: Lower frexp before lowering doubles. 1505- intel: Always call nir_lower_frexp. 1506- ir3: Move turnip's nir_lower_frexp to the shared compiler. 1507- nouveau: Add missing nir_opt_algebraic_late. 1508- nouveau: Enable frexp lowering in the backend. 1509- zink: Enable nir_lower_frexp. 1510- v3d: Lower frexp in the GL compiler like we do in Vulkan. 1511- agx: Enable nir_lower_frexp. 1512- panfrost/midgard: Enable nir_lower_frexp. 1513- nir_to_tgsi: Always lower frexp_exp/sig. 1514- glsl: Drop frontend lowering of 32-bit frexp. 1515- glsl: Drop PIPE_SHADER_CAP_DFRACEXP_DLDEXP_SUPPORTED. 1516- tgsi: Drop TGSI_OPCODE_DFRACEXP. 1517- ci/zink: Disable a630 portal-2-v2 due to kernel OOMs. 1518- etnaviv: Fix regression from if_uses change. 1519- blob: Don't valgrind assert for defined memory if we aren't writing. 1520- util/log: Fix log messages over 1024 characters. 1521- vulkan: Handle alignment failure in the pipeline cache. 1522- vulkan: Actually increment the count of objects in GetPipelineCacheData. 1523- ci/radeonsi: Mark glx-make-current as flaky. 1524 1525EmperorPenguin18 (1): 1526 1527- v3d: expose more drm formats with SAND128 modifier 1528 1529Eric Engestrom (172): 1530 1531- bin/ci: add gitlab_gql.py.cache to the .gitignore 1532- mesa/st: drop unused param 1533- ci/bare-metal: add more timestamps to help debugging issues 1534- ci: be explicit about the \`meson setup` subcommand 1535- docs: add release notes for 22.3.4 1536- docs/relnotes: add sha256sum for 22.3.4 1537- docs: update calendar for 22.3.4 1538- meson: turn android-libbacktrace into a feature option 1539- v3dv: mark dEQP-VK.api.command_buffers.record_many_draws_secondary_2 as flaky 1540- ci/android: move common config to common job 1541- ci/android: move virgl-specific gpu_mode to virgl-defined variables 1542- ci/android: move virgl-specific fails/flakes/skips lists to virgl-defined variables 1543- ci/android: move virgl-specific deqp suite to virgl-defined variables 1544- ci/android: move virgl-specific so lib name to virgl-defined variables 1545- ci/android: add missing line terminator at the end of the file 1546- docs: add release notes for 22.3.5 1547- docs: update calendar for 22.3.5 1548- panfrost: drop no-longer-needed libglsl 1549- gallium/u_screen.h: add missing stdint.h include 1550- util: avoid calling kcmp on Android 1551- etnaviv: use simple_mtx to avoid breaking windows in the next commit 1552- gallium: move etnaviv screen_lookup_or_create function to common code 1553- freedreno: replace custom code with u_pipe_screen_lookup_or_create() 1554- lima: replace custom code with u_pipe_screen_lookup_or_create() 1555- v3d: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens 1556- vc4: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens 1557- panfrost: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens 1558- asahi: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens 1559- u_pipe_screen_lookup_or_create: avoid re-querying the fd to have a consistent hash key 1560- broadcom/ci: mark test as flaky 1561- vk/util: keep track of extension requirements 1562- vk/runtime: keep track of supported instance extensions 1563- vk/runtime: turn vk.xml extension requirements into asserts 1564- meson: move float64_glsl_file one meson.build up 1565- meson: only build mapi when needed 1566- meson: only build the loader when needed 1567- meson: only build libglsl_util when needed 1568- meson: only build glsl when needed 1569- meson: drop \`TODO: opengl`, it's done 1570- ci: simplify adding & removing deqp patches 1571- ci: remove no-op sed 1572- ci: fix grouping of image tags 1573- ci: bump tags of deqp images 1574- docs: add 23.1 branchpoint & rc dates 1575- meson: make GLX require OpenGL 1576- meson/windows: only build libgl-gdi for desktop gl 1577- meson: allow building GLES without GL 1578- mesa: add _mesa_is_desktop_gl_compat() and _mesa_is_desktop_gl_core() helpers 1579- mesa: make use of the new _mesa_is_desktop_gl_compat() helper 1580- mesa: make use of the new _mesa_is_desktop_gl_core() helper 1581- mesa: make more use of the existing _mesa_is_gles* helpers 1582- mesa: add & use new _mesa_is_gles1() & _mesa_is_gles2() helpers 1583- mesa: make more use of the new _mesa_is_gles1() helper 1584- mesa: make more use of the new _mesa_is_gles2() helper 1585- mesa: optimize out _mesa_is_desktop_gl*() and _mesa_is_gles*() calls when not built 1586- ci: stop watching for changes in removed script 1587- meson: improve formatting of options file 1588- broadcom/ci: refactor a bit 1589- broadcom/ci: fold .vc4-rpi3-piglit:armhf into its only user 1590- broadcom/ci: use deqp-runner to run piglit tests 1591- docs/release-calendar: drop the last 22.2.x, it won't happen 1592- broadcom/ci: group x11 and wayland variant of the same test failing 1593- broadcom/ci: use weston's xwayland instead of starting X as well 1594- broadcom/ci: add x11- prefix to x11 EGL tests 1595- broadcom/ci: drop create_pixmap_surface from the fails; it passes now 1596- broadcom/ci: skip buffer_age.no_preserve and swap_buffers_with_damage on wayland 1597- broadcom/ci: add two known failures 1598- broadcom/ci: re-enable egl on wayland 1599- docs: include explicit \`setup` in instructions 1600- docs: add release notes for 22.3.6 1601- docs/relnotes: add sha256sum for 22.3.6 1602- docs: update calendar for 22.3.6 1603- v3d: update supertuxkart reference after 1c028a4d5b623e73bdf5 1604- docs: mention the meson summary 1605- docs: mention \`meson configure` and drop broken workaround script 1606- meson: reuse vulkan_wsi_list for defining vk_wsi_args 1607- meson: replace vk_wsi_args with dependencies to let meson take care of transitivity 1608- egl: include directly the useful vulkan header, instead of including everything 1609- glx: include directly the useful vulkan header, instead of including everything 1610- gbm: drop unnecessary vulkan dependency 1611- radv: split linker script for android since it requires different symbols 1612- glsl: align definition of _mesa_problem with the one in main/error.h 1613- glapi/meson: drop duplicate line in deps 1614- meson: allow checking for null pointers even if they're supposed to be non-null 1615- panfrost/ci: add EGL tests 1616- asahi/winsys: add .clang-format 1617- vk: move radv's linker symbols scripts for use in all drivers 1618- v3dv: add linker script to fix android symbols 1619- tu: add linker script to fix android symbols 1620- anv: add linker script to fix android symbols 1621- vn: add linker script to fix android symbols 1622- android/vk: drop unnecessary symbols 1623- vk: be stricter about symbols check between android and other platforms 1624- v3d/ci: add dEQP-GLES3.functional.texture.specification.teximage2d_pbo.*_cube flakes 1625- osmesa: add exported symbols check 1626- docs: add release notes for 22.3.7 1627- docs/relnotes: add sha256sum for 22.3.7 1628- docs: update calendar for 22.3.7 1629- v3dv/ci: add a test to the known failures 1630- meson: bump minimum version to 0.60 1631- meson: allow feature options to take true/false to mean enabled/disabled 1632- meson: inline gtest_test_protocol now that it's always 'gtest' 1633- v3dv: split out broadcom_shader_stage_to_gl() calls to improve readability 1634- ci: take valve farm offline 1635- ci: disable weston session timeout 1636- broadcom/ci: no need to skip the tests that swap buffers anymore 1637- ci/broadcom: move rare failure to the flakes 1638- ci: drop redundant .no_scheduled_pipelines-rules + .core-rules since the latter already includes it 1639- ci/rustfmt: simplify getting all the rust files 1640- ci/rustfmt: print which files are checked 1641- ci: group RESULT logic in a single place 1642- v3dv/ci: fix test name (\`,Fail` is not part of the test name) 1643- asahi: replace copies of .clang-format with symlinks 1644- asahi: fix a few typos 1645- v3d: fix \`dirty` bitset being too small to accept V3D_DIRTY_SSBO 1646- v3dv: use common GetPhysicalDeviceFeatures 1647- v3dv: reorder features as 1.0, 1.1, 1.2, 1.3 1648- v3dv: use vk_get_physical_device_features 1649- v3d/ci: add another depthstencil-default_fb-drawpixels-* to the flakes 1650- v3d/ci: group dEQP-GLES3.functional.texture.specification.teximage2d_pbo.* flakes and add another one 1651- ci: centralize detection of ccache in link-werror wrapper 1652- ci: add linker wrapper for clang 1653- ci: always use the -Werror wrapper 1654- ci: deduplicate compiler wrappers 1655- ci/docs: start documenting ci_run_n_monitor.py 1656- v3d: add link to issue investigating failure 1657- asahi: change create_renderonly signature to uniformize it 1658- etnaviv: change create_renderonly signature to uniformize it 1659- freedreno: change create_renderonly signature to uniformize it 1660- lima: change create_renderonly signature to uniformize it 1661- panfrost: change create_renderonly signature to uniformize it 1662- v3d: change create_renderonly signature to uniformize it 1663- vc4: change create_renderonly signature to uniformize it 1664- kmsro: uniformize renderonly creation 1665- kmsro: sort drivers alphabetically 1666- ci/broadcom: consolidate vc4-rpi3* jobs into a single vc4-rpi3-gl:armhf 1667- ci/broadcom: consolidate v3d-rpi4* jobs into a single v3d-rpi4-gl:armhf 1668- ci/broadcom: slightly increase coverage of vk tests 1669- vc4/ci: add arm64 failure to flakes as it works on armhf 1670- broadcom/ci: run gl jobs on arm64, just like vk 1671- vc4/ci: add another sync flake 1672- panfrost: assign the correct create_for_resource from the start 1673- Revert "broadcom/ci: run gl jobs on arm64, just like vk" 1674- v3dv/ci: mark known dEQP-VK.wsi.xlib.surface.query_formats failure 1675- ci/rustfmt: make sure to only check each file once 1676- v3d: disable GL_NV_conditional_render 1677- VERSION: bump for 23.1.0-rc1 1678- .pick_status.json: Update to 8ebc5cbe2b828f34b9bfb32c528d3514ead59798 1679- v3dv/ci: drop fixed failure from fails.txt 1680- .pick_status.json: Update to 0d7912d239dac5bf3c8b07f2a6ca467f760d6aa6 1681- .pick_status.json: Update to 543b6ca7c4b00c4bfff5668ba0a0643d565db201 1682- amd: fix buggy usage of unreachable() 1683- compiler: fix buggy usage of unreachable() 1684- pvr: fix buggy usage of unreachable() 1685- vk/util: fix buggy usage of unreachable() 1686- v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit 1687- VERSION: bump for 23.1.0-rc2 1688- .pick_status.json: Update to 3017d01c9ded9c9fd097b600081b1bbe86e90fb8 1689- .pick_status.json: Update to a18a51a708a86f51e0a5ab031b379f65bc84fb49 1690- .pick_status.json: Update to c060b649c5a866f42e5df73f41c6e2809cf30e99 1691- ci: rework vulkan validation layer build script 1692- .pick_status.json: Update to 3f14fd8578549e34db2f564396f300819b2ff10f 1693- VERSION: bump for 23.1.0-rc3 1694- .pick_status.json: Update to 040aeb5a23e5cc8a71a352e55282d514dd2ab64f 1695- .pick_status.json: Update to 9f522ac0c65ceae11ad1a4e84ec9f32a9393a25c 1696- .pick_status.json: Update to efc94390f716b70ac1d5b09c6f949f938aeadcac 1697- VERSION: bump for 23.1.0-rc4 1698- .pick_status.json: Update to 6d84b34359dcbad477209adb9f9d0592c5a71bb9 1699- .pick_status.json: Update to cb4e4fc5de48886758a26ff19d322947b5abfcec 1700- dzn: fix pointer type mismatch 1701- .pick_status.json: Update to 57afa7c0b12d6d0c9013368853080dfea5b50d07 1702- .pick_status.json: Update to 31e6d15801a9904089aa2913c8eb5a31b79c7dfc 1703 1704Erico Nunes (5): 1705 1706- lima/ci: Add more piglit unsupported tests to skip 1707- Revert "CI: Lima farm is offline" 1708- lima: don't use resource_from_handle while creating scanout 1709- lima/ci: restore swap buffers egl tests 1710- Revert "ci: disable lima farm, currently out-of-space, needs to be fixed" 1711 1712Erik Faye-Lund (54): 1713 1714- zink: whitespace fixup 1715- zink: fix depth-clip disable cap 1716- zink: remove depth_clip_control_missing workaround 1717- radeonsi: respect smoothing_enabled 1718- meson: remove dupliace add_devenv call 1719- meson: remove deprecated osmesa-bits option 1720- meson: remove deprecated dri-drivers option 1721- meson: avoid using deprecated build_root() method 1722- meson: use files() instead of joining paths 1723- freedreno/meson: simplify script-path logic 1724- meson: do not reconstruct ICD paths 1725- anv, hasvk: remove stale TODO-files 1726- zink: correct companies in requirements 1727- zink: remove incorrect trailing comma 1728- meson: remove unused USE_FOO_ASM defines 1729- vulkan: prefer vulkan_core.h over vulkan.h 1730- meson: don't pass vk wsi args where they don't belong 1731- Revert "meson: Fix Asahi build on macOS" 1732- zink: prefer vulkan_core.h over vulkan.h 1733- zink: get rid of needless dependency 1734- ci: correct typo in name of linkcheck job 1735- docs: update link to intel optimization reference manual 1736- nir: add a print_internal debug-flag 1737- docs: implement new vk-feat role 1738- docs/zink: use vk-feat role for features 1739- docs/zink: remove some trailing spaces 1740- docs/zink: fixup wording of the GL 4.6 requirements 1741- meson: correct typo in comment 1742- ci: move docs-stuff out of root .gitlab-ci.yml 1743- docs: fixup broken envvar-role syntax 1744- docs: escape a few more strings 1745- docs: fixup broken indentation 1746- docs/zink: mention vk1.2 mirror-clamp feature option 1747- docs/zink: clean up requirements-language 1748- docs: move developers article to main website 1749- docs: remove old thanks-article 1750- docs: prefer http-links over ftp 1751- docs/freedreno: fix turnip-heading level 1752- docs: drop reference to modindex 1753- docs: move old relnotes to _extra directory 1754- docs: use version-number as toctree-title for relnotes 1755- zink: emit terminate for spir-v 1.6 1756- zink: use demote from spir-v 1.6 when possible 1757- zink: use spir-v 1.6 local-size when needed 1758- zink: enable spir-v 1.6 for vulkan 1.3 1759- docs: format code-block as ini 1760- docs: format code-block as toml 1761- docs: make code-block indents consistent 1762- ci: move virgl-rules after intel-rules 1763- virgl/ci: clean up manual rules for virgl 1764- ci: remove unused rules 1765- zink: do not use sampled-image for buffers 1766- nir: fix constant-folding of 64-bit fpow 1767- llvmpipe: fixup refactor copypasta 1768 1769Faith Ekstrand (99): 1770 1771- nir: Add more opcodes to nir_tex_instr_is_query() 1772- nir/builder: Add some texture helpers 1773- radv: Use the new NIR builder tex helpers for meta 1774- anv: Refactor Android externalFormat handling in CreateYcbcrConversion 1775- anv/android: Use VkFormat for externalFormat 1776- util/format: YUYV and UYVY have 4 8-bit channels 1777- vulkan/formats: Add YCbCr format information 1778- vulkan: Add a common vk_ycbcr_conversion struct 1779- anv: Use the common vk_ycbcr_conversion object 1780- anv: Use the YCbCr format info from common code 1781- nir: Add copyright and include guards to nir_vulkan.h 1782- anv,nir: Move the ANV YCbCr lowering pass to common code 1783- gallium,util: Pull u_indices and u_primconvert back into gallium 1784- mailmap: Remap e-mail addresses for Faith Ekstrand 1785- vtn: Set alignment on initial UBO/SSBO casts 1786- anv: Let spirv_to_nir() set UBO/SSBO base cast alignments 1787- hasvk: Let spirv_to_nir() set UBO/SSBO base cast alignments 1788- intel/compiler: Document wm_prog_key::persample_interp 1789- intel/nir: Lower barycentrics to per-sample in a dedicated pass 1790- nir: Remove nir_lower_io_force_sample_interpolation 1791- intel/compiler: Use SHADER_OPCODE_SEND for PI messages 1792- intel/fs: Return early in a couple builtin setup helpers 1793- intel/compiler: Convert brw_wm_aa_enable to brw_sometimes 1794- intel/fs: Make per-sample and coarse dispatch tri-state 1795- intel/compiler: Convert wm_prog_key::persample_interp to a tri-state 1796- intel/compiler: Convert wm_prog_key::multisample_fbo to a tri-state 1797- intel/fs/validate: Assert SEND [extended] descriptors are uniform 1798- intel/fs: Break out yet another FB write helper 1799- intel/fs: Rework dynamic coarse handling 1800- nir/deref: Preserve alignments in opt_remove_cast_cast() 1801- nir/from_ssa: Use more helpers in resolve_parallel_copies 1802- nir/from_ssa: Only re-locate values that are destinations 1803- nir/from_ssa: Move the loop bounds check in resolve_parallel_copy 1804- nir: Add a load/store bit size lowering pass 1805- intel/nir: Use nir_lower_mem_access_bit_sizes() 1806- Revert "vk/runtime: turn vk.xml extension requirements into asserts" 1807- Revert "vk/util: keep track of extension requirements" 1808- vulkan: Remove unused fields from Extension and ApiVersion 1809- vulkan: Improve extension parsing 1810- vulkan: Parse the platform in Extensions.from_xml() 1811- vulkan: Add a get_all_required() helper 1812- vulkan: Properly filter entrypoints 1813- vulkan: Properly filter by api in enum_to_str 1814- Vulkan: Properly filter structs in vk_cmd_queue_gen 1815- vulkan: Filter out provisional extensions 1816- vulkan: Move the features generator to vulkan/util 1817- vulkan: Properly filter structs in vk_physical_device_features 1818- vulkan/layers: Use PUBLIC instead of VK_LAYER_EXPORT 1819- vulkan/device-select-layer: Include vulkan.h 1820- vulkan: Update the XML and headers to 1.3.241 1821- nir/lower_io: Handle buffer_array_length for more address modes 1822- anv: Drop our manual SSBO size handling 1823- hasvk: Drop our manual SSBO size handling 1824- panvk: Drop our manual SSBO size handling 1825- turnip: Set spirv_options::use_deref_buffer_array_length 1826- lavapipe: Set spirv_options::use_deref_buffer_array_length 1827- v3dv: Set spirv_options::use_deref_buffer_array_length 1828- spirv: Always emit deref_buffer_array_length intrinsics 1829- nir: Check against combined alignment in nir_lower_mem_access_bit_sizes 1830- nir: Add mode filtering to lower_mem_access_bit_sizes 1831- nir: Add UBO support to nir_lower_mem_access_bit_sizes 1832- nir: Add a combined alignment helper 1833- nir: Rename align to whole_align in lower_mem_load 1834- nir: Rename nir_mem_access_size_align::align_mul to align 1835- nir: Make chunk_align_offset const in lower_mem_load() 1836- nir: Handle wider unaligned loads in lower_mem_access_bit_size 1837- intel/nir: Limit unaligned loads to vec4 1838- vulkan/runtime: Rename and document storage image Z range 1839- intel/blorp: Set array_len for 3D images properly 1840- isl: Set Depth to array len for 3D storage images 1841- intel: Use nir_lower_tex_options::lower_index_to_offset 1842- vulkan: Update XML and headers to 1.3.244 1843- vulkan: Provide wrappers for VK_EXT_map_memory2 functions 1844- anv: Limit memory maps to the client-allocated size 1845- anv: Implement VK_KHR_map_memory2 1846- intel/isl: Support Yf/Ys/Tile-64 in isl_surf_get_image_offset_sa 1847- intel/blorp: Drop the TODO file 1848- docs: Fix Faith's name in relnotes 1849- nir: Drop a bunch of Authors tags 1850- spirv: Drop a bunch of Authors tags 1851- intel: Drop some author comments and update Faith's name 1852- util,mesa,panfrost: Drop some author tags 1853- vulkan: vk_android.c should be copyright Intel 1854- util: Update some copyright tags 1855- CODEOWNERS: s/jekstrand/gfxstrand 1856- vulkan,anv,hasvk,radv: Add a common vk_image_usage_to_ahb_usage helper 1857- vulkan/android: Fix hardware buffer usage flags 1858- vulkan: Add an ahardware_buffer_format field to vk_image 1859- anv,hasvk: Set vk_image.ahardware_buffer_format 1860- radv: Set vk_image.ahardware_buffer_format 1861- vulkan,anv,hasvk,radv: Unify Android hardware buffer creation 1862- vulkan: Add a vk_device_memory base struct 1863- anv: Use the new vk_device_memory base struct 1864- vulkan: Record pipeline flags in the render pass 1865- vulkan: Plumb rendering flags through vk_graphics_pipeline_state 1866- anv/pipeline: Use feedback loop flags for self-dependencies 1867- hasvk/pipeline: Use feedback loop flags for self-dependencies 1868- vulkan: Drop vk_render_pass_state::\*self_dependenc* 1869- vulkan: Drop VkRenderingSelfDependencyInfoMESA 1870 1871Felix DeGrood (10): 1872 1873- intel/perf: Hide extended metrics by default 1874- anv: cs_stall during compute state flush on < gen12.5 1875- anv: only emit CFE_STATE when scratch space increases 1876- anv: set CFE_STATE.OverDispatchControl to default 1877- iris: report draw count for perfetto 1878- anv/blorp: support surf generation for addresses 1879- anv/blorp: implement anv_cmd_buffer_fill_area 1880- anv/blorp: add flush reasons to RT flushes 1881- anv: reset query pools using blorp 1882- anv: disable reset query pools using blorp opt on MTL 1883 1884Filip Gawin (2): 1885 1886- crocus: don't quantize the clear value 1887- nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked 1888 1889Francisco Jerez (11): 1890 1891- intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics. 1892- intel/disasm/gfx12+: Use helper instead of hardcoded bit access for 64-bit immediates. 1893- intel/disasm/gfx12+: Fix print out of non-existing condmod field with 64-bit immediate. 1894- intel/eu/gfx12+: Implement decoding of 64-bit immediates. 1895- intel/fs/gfx12+: Drop redundant handling of SHADER_OPCODE_BROADCAST in exec pipe inference. 1896- intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation. 1897- intel/eu/gfx8-9: Fix execution with all channels disabled due to HW bug #220160235. 1898- intel/rt: Fix L3 bank performance bottlenecks due to SW stack stride alignment. 1899- intel/fs: Track force_writemask_all behavior of copy propagation ACP entries. 1900- intel/fs: Fix copy propagation dataflow analysis in presence of force_writemask_all ACP overwrites. 1901- intel/fs: Fix register coalesce in presence of force_writemask_all copy source writes. 1902 1903Frank Binns (7): 1904 1905- pvr: small cleanups 1906- pvr: remove start/stop transfer flags 1907- pvr: stop restricting the compiler to the Sascha Willems triangle demo 1908- pvr: remove duplicate define 1909- pvr: initialise size for placeholder "zeroed" shaders 1910- pvr: replace nop binary shader with run-time compiled shader 1911- pvr: fix clang-format issue 1912 1913Friedrich Vock (26): 1914 1915- radv/rt: Divide by the correct workgroup size 1916- radv/bvh: Prevent NANs when computing node cost 1917- radv/rmv: Also check the other pid field 1918- radv/rmv: Avoid more CPU unmap deadlocks 1919- radv/rmv: Log bo destruction before freeing it 1920- radv/rmv: Correct timestamp shifting 1921- vulkan/rmv: Use the timestamp divisor instead of a hardcoded value 1922- vulkan/rmv: Remove delta parameter from dump helpers 1923- mesa: Report GL_SHADER_BINARY_FORMAT_SPIR_V as supported 1924- docs: Fix formatting for RMV tracing docs 1925- radv: Extend hit attribute lowering for LDS 1926- radv: Use LDS for closest-hit hit attributes 1927- radv: Emit RT shader VA user SGPR 1928- radv/rt: Add shader config combination/postprocessing utils 1929- radv: Add RT shader stage names for executable properties 1930- aco: Swap operands for v_and_b32 in RT prolog 1931- radv/rt: Also adjust the SGPR count in postprocess_rt_config 1932- aco: Un-swap addressable VGPRs/SGPRs in RT prolog 1933- radv: Work around use-after-free compiler errors 1934- radv: Add RT stages to radv_mesa_to_rgp_shader_stages 1935- radv/rmv: Fix creating RT pipelines 1936- radv/rmv: Fix import memory 1937- radv/rt: Plug some memory leaks during shader creation 1938- radv: Don't leak the RT prolog binary 1939- radv: Always call si_emit_cache_flush before writing timestamps 1940- radv: Add driconf to always drain waves before writing timestamps 1941 1942GH Cao (1): 1943 1944- gallium: Add MCJIT target triplet for Windows ARM64 1945 1946Ganesh Belgur Ramachandra (1): 1947 1948- ac/nir: fix CDNA image lowering for array textures 1949 1950Georg Lehmann (81): 1951 1952- Revert "aco: Combine v_cvt_u32_f32 with insert to v_cvt_pk_u8_f32." 1953- aco: use s_bfm_64 for constant copies 1954- aco: use s_pack_ll_b32_b16 for constant copies 1955- aco: Improve wave64 cycle estimates. 1956- aco: fix imod/omod for gfx11 VOP3 opcodes 1957- aco: add mov/cndmask opcodes to does_fp_op_flush_denorms 1958- aco: don't allow output modifiers for v_cvt_pkrtz_f16_f32 1959- aco: allow output modifiers for ldexp_f16 1960- aco: don't list imod/omod support v_fmaak_f32/v_fmamk_f32 1961- aco: support omod/imod for v_fmac_f16 1962- aco: remove stale TODOs about v_interp opsel 1963- aco: new 16bit VOP3 opcodes can use opsel 1964- aco: Don't use vcmpx with DPP. 1965- aco: combine a ^ ~b and ~(a ^ b) to v_xnor_b32 1966- amd,nir: remove byte_permute_amd intrinsic 1967- nir: change 16bit image dest folding option to per type 1968- amd: don't use d16 for integer loads 1969- amd: d16 uses rtz conversion for 32bit float 1970- aco: use v_permlane(x)16_b32 for masked swizzle 1971- aco/gfx11: use dpp_row_xmask and dpp_row_share 1972- aco: use and swizzle mask in dpp quad perm 1973- aco/optimizer_postRA: assume all registers are untrackable in loop headers 1974- nir/opt_algebraic: add patterns for iand/ior of feq/fneu with 0 1975- aco: mark mad definition as precise if the mul/add were precise 1976- aco: use v_fma_mix_f32 for v_fma_f32 with 2 fp16 representable, different literals 1977- nir/lower_mediump: don't use fp16 for constants if the result is denormal 1978- aco: treat VINTERP_INREG as VALU 1979- aco/ir: rework IR to have one common valu instruction struct 1980- aco/ra: set opsel_hi to zero when converting to VOP2 1981- aco: validate VALU modifiers 1982- aco/print_ir: simplify using VALU instruction 1983- aco/optimizer: simplify using VALU instruction 1984- aco: remove VOP[123C]P? structs 1985- aco: add bitfield array helper classes 1986- aco: use bitfield array helpers for valu modifiers 1987- aco/assembler/gfx11: simplify 16bit VOP12C promotion to VOP3 1988- aco/optimizer: don't reallocate instruction when converting to VOP3 1989- aco: don't reallocate fma{mk,ak,_mix} instruction 1990- aco: copy abs/neg with assignment 1991- aco: use integer access for neg_lo/neg_hi 1992- aco: use array indexing for opsel/opsel_lo/opsel_hi 1993- aco: access neg/abs as int in usesModifiers 1994- aco: use bitfield_array for temporary neg/abs/opsel 1995- nir: optimize i2f(f2i(fsign)) 1996- aco: remove duplicates from .clang-format 1997- amd: remove duplicate from .clang-format 1998- aco: don't check usesModifiers for pseudo instructions 1999- aco: fix p_interp_gfx11 comment 2000- aco: make .clang-format usable with tests 2001- aco/ir: fix copy paste bug in convert_to_SDWA 2002- aco/util: override default assignment operator for bitfield helpers 2003- aco: clean up to_mad_mix 2004- aco/ra: don't reallocate VOP3 instruction for non-vcc lane mask 2005- aco/vn: hash opsel for VOP12C 2006- aco/assembler: support VOP12C opsel 2007- aco: validate VOP12C opsel 2008- aco/to_hw_instr: use VOP1 opsel for v_mov_b16 2009- aco/ra: prepare for VOP12C opsel 2010- aco/optimizer: preserve opsel when fusing fma 2011- aco: handle opsel in combine_comparison_ordering 2012- aco: handle opsel in combine_ordering_test 2013- aco: handle opsel in combine_constant_comparison_ordering 2014- aco: update match_op3_for_vop3 for VOP12C opsel 2015- aco: support v_cvt_f32_f16 with opsel in combine_mad_mix 2016- aco: support neg(mul)/abs(mul) optimization in more cases 2017- aco: return true in usesModifiers for VOP12C with opsel 2018- aco: swap opsel when swapping VOP2/C operands 2019- aco/ir: copy opsel when converting to DPP 2020- aco: don't label mul with opsel as abs/neg 2021- aco/gfx11: allow opsel for VOP12C 2022- aco/optimizer: use opsel for VOP12C 2023- aco: keep label_mul/usedef/minmax in apply_extract 2024- aco/optimizer: remove to_SDWA 2025- aco: add tests for fma with opsel 2026- aco: add tests for dpp with opsel 2027- aco: add tests for swap operand with opsel 2028- aco: add tests for cmp ordering with opsel 2029- aco: add test for min/max combining with opsel 2030- aco/tests: run optimize.mad_mix.input_conv.modifiers on gfx11 2031- aco: add tests for neg(mul) with opsel 2032- aco/tests: add missing dependency on generated header 2033 2034Gert Wollny (49): 2035 2036- glsl/nir: only set uses_sample_shading when the output is a fbfetch 2037- nir: Add possibility to store image var offset in range_base 2038- nir: Add range_base to atomic_counter and an option to use it 2039- ntt: handle the image intrinsic range_base when translating to TGSI 2040- ntt: Make use of the range_base offset when translating atomics in NTT 2041- virgl: lower image variable offsets into the intrinsic range_base value 2042- virgl: Request setting the atomic offset in the range_base 2043- virgl: drop the separable flag for cases that can't be handled 2044- r600/sfn: Fix readport check 2045- r600/sfn: Do a bit of cleanup with the secondary read port validation 2046- r600/sfn: Fix opcode and result dest slot mask for variable size dot 2047- r600/sfn: Fix splitting of multislot alu ops 2048- virgl: remove unused virgl_encoder_inline_write 2049- r600/sfn: Use range_base for atomics and images 2050- r600/sfn: Work around dependency issue when splitting op to group 2051- r600/sfn: drop useless instr use count 2052- r600/sfn: Fix a typo 2053- r600/sfn: Silence warnings about unused parameters 2054- r600/sfn: Don't copy propagate indirect loads to more than one dest 2055- r600/sfn: Stop try scheduling in t-slot with empty related v-slot 2056- r600/sfn: rename texture coordinate offset for clarity 2057- r600/sfn: address use in group only if instr can be added 2058- r600/sfn: Forward setting the block ID and index 2059- r600: Don't start new CF for every fetch through tex clause 2060- r600/sfn: Fix handling of fetch through texture clause 2061- r600/sfn: Fix alu trans op flag setup 2062- r600/sfn: Fix Cayman trans from string and add test for copy prop 2063- vulkan/wsi: Take Xwayland into account for x11_min_image_count 2064- zink/kopper: Add extra swapchain images for Venus 2065- r600/sfn: be more conservative with channel use in multi-slot ops 2066- r600/sfn: Fix readport cylce map 2067- r600/sfn: Fix minimum required registers 2068- r600/sfn: Add AluGroup method to update readport validation from scratch 2069- r600/sfn: Split AluInstr replace_source into test and actual replace 2070- r600/sfn: Add method to AluGroup to replace sources 2071- r600/sfn: Add print method to AluReadportValidation 2072- r600/sfn: redirect copy propagation to alu parent group 2073- r600/sfn/tests: Add a test for the copy prop into a group 2074- r600/sfn: Fix atomic lowering 2075- virgl: Enable AMD_vertex_shader_(layer|viewport_index) when host supports it 2076- virgl: Don't try to do re-alloc or readback by transfer for blob resources 2077- ntt: add option to lower SSBO bindings to buffer index 2078- virgl: Lower binding start into buffer indices 2079- r600/sfn: fix container allocators 2080- r600/sfn: Lower tess levels to vectors in TCS 2081- r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats 2082- r600/sfn: assign window_space_position in shader state 2083- r600/sfn: Ass support for image_samples 2084- r600/sfn: fix cube to array lowering for LOD 2085 2086Giancarlo Devich (25): 2087 2088- d3d12: Use varying comparison function for TESS stage key compare 2089- d3d12: Add unions to encompass shader key stage vars, use in hashing 2090- nir: Check sampler_binding is valid when lowering tex shadow 2091- d3d12: Don't clear d3d12_shader_key 2092- d3d12: Move d3d12_context_state_table_entry to d3d12_resource_state.h 2093- d3d12: Assign up to 16 simultaneously active contexts unique IDs 2094- d3d12: Track up to 16 active context resource states locally in d3d12_bo 2095- d3d12: Don't recompute has_flat_varyings or missing_dual_src_outputs 2096- d3d12: Track max varying slot, set and compare less bytes 2097- d3d12: Don't unnecessarily zero out gs/tcs keys 2098- d3d12: Don't memcmp gs/tcs keys 2099- d3d12: Create varying structures as necessary, reference them 2100- d3d12: Don't loop in update_draw_indirect_with_sysvals 2101- d3d12: Compare shader keys with a switch, instead of cascading if's 2102- d3d12: Compare shader keys with union-encompassing fields all at once 2103- d3d12: Compare shader key common parts with memcmp, instead of if's 2104- d3d12: Cache varying info to reduce compare/copy cost 2105- d3d12: Use memcmp for full tcs/gs variant keys 2106- d3d12: Track up to 16 contexts worth of pending barriers locally in bos 2107- d3d12: Don't unnecessarily recompute manual_depth_range 2108- d3d12: Use context-level sampler_state array for filling shader keys 2109- d3d12: Use short circuit in shader key compare; update key hash 2110- d3d12: Reduce gs variant key init cost; unnecessary validate gs calls 2111- d3d12: Unroll shader variant selection loop 2112- d3d12: Track up to 16 contexts worth of batch references locally in bos 2113 2114Guilherme Gallo (24): 2115 2116- radeonsi/ci: Update stoney test expectations 2117- radeonsi/ci: Skip slow traces on raven 2118- Revert "ci: disable Collabora's LAVA lab for maintance" 2119- ci/lava: Move LAVA dependencies to pip 2120- ci/lava: Add LavaFarm class to find LAVA farm from runner tag 2121- ci/lava: Fix LAVA logs issues for Collabora jobs 2122- ci: Upload debian-release artifact to S3 2123- ci: Create debian-arm64-release job 2124- ci: Use release builds in perf jobs 2125- ci: Use workflow to make CI aware of performance jobs 2126- ci: Reuse MESA_CI_PERFORMANCE_ENABLED in performance-rules 2127- ci: Handle carriage return characters in LAVA logs 2128- ci: Fix release build use for performance jobs 2129- ci/baremetal: Wrap artifact download curl with xtrace 2130- ci: Improve piglit-traces "no-perf" filter 2131- ci: Fix freedreno-rules-performance 2132- ci: Add piglit traces hidden jobs 2133- ci/freedreno: create a618-traces and perf jobs 2134- ci/zink: Add zink-a618 trace jobs 2135- ci/zink: Add zink-turnip-manual-rules 2136- ci/zink: Add zink-tu-a618-traces-performance job 2137- ci/zink: Fix zink-tu-a618-traces perf job rules 2138- ci/zink: Reduce zink-tu-a618-traces parallelism 2139- Revert "ci: disable Collabora's LAVA lab for maintance" 2140 2141Hampus Linander (4): 2142 2143- nir: Add extr_agx opcode 2144- agx: Add extr instruction to AGX backend 2145- agx: Use AGX extr for tex lowering 2146- agx: Optimize lower_resinfo for cube maps 2147 2148Hans-Kristian Arntzen (13): 2149 2150- radv: Fix invalid 64-bit shift. 2151- radv: Fix missing VK_ACCESS_2_SHADER_SAMPLED_READ_BIT. 2152- radv: Implement VK_ACCESS_2_DESCRIPTOR_BUFFER_READ_BIT_EXT. 2153- wsi/common: Add common implementation of vkReleaseSwapchainImagesEXT. 2154- wsi/x11: Implement EXT_swapchain_maintenance1. 2155- wsi/common: Implement swapchain present fence. 2156- wsi/common: Add comment about DEFERRED_ALLOCATION_BIT_EXT. 2157- wsi/common: Add function to modify present mode. 2158- wsi/wayland: Implement EXT_swapchain_maintenance1. 2159- wsi/display: Implement EXT_swapchain_maintenance1. 2160- wsi/win32: Implement VK_EXT_swapchain_maintenance1. 2161- radv: Expose VK_EXT_swapchain_maintenance1. 2162- wsi/x11: Fix present ID signal when IDLE comes before COMPLETE. 2163 2164Harri Nieminen (6): 2165 2166- docs/specs: Fix typos 2167- docs/gallium: Fix typos 2168- docs/freedreno: Fix typos 2169- docs/panfrost: Fix typo 2170- docs/svga3d: Fix typo 2171- bin: Fix typos 2172 2173Helen Koike (12): 2174 2175- ci/debian-android: move pkgconfig paths to the cross file 2176- ci: move patches to patches directory 2177- android: allow system = 'android' on cross file 2178- ci/android: move sdk version and ndk to a job variable 2179- ci: compile deqp for android 2180- ci: compile deqp-runner for android 2181- ci: debian-android compile virgl 2182- ci: export artifacts from debian-android 2183- ci/android: add android to the ci 2184- android/ci: fix removal of inexistent file 2185- android/ci: Fix call to adb 2186- android/ci: raise error on script when not related to the tests 2187 2188Hyunjun Ko (1): 2189 2190- vulkan/runtime: match the spec when taking pipeline subsets. 2191 2192Iago Toral Quiroga (20): 2193 2194- broadcom/compiler: produce better code for f2f16 with RTZ rounding 2195- v3dv: add paths to handle partial copies of linear images 2196- v3dv: drop unused field from v3dv_cmd_buffer 2197- v3dv: increase BO allocation size when growing CLs 2198- v3dv: ensure we allocate at least the requested space for a CL 2199- v3dv: add a cl_advance_and_end helper 2200- v3dv: ensure at least V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction 2201- v3dv: ensure we apply binning syncs to secondary command buffers 2202- v3dv: fix stencil view aspect selection of depth/stencil image 2203- v3d: support r{g,gba}16f formats for vertex buffers 2204- broadcom/compiler: track pending ldtmu count with each TMU lookup 2205- v3dv: pause occlusion queries during vkCmdClearAttachments 2206- v3dv: fix format swizzle for buffer views 2207- v3dv: drop unused parameter 2208- v3dv: always acquire display device before checking if we can present 2209- vulkan/wsi/display: set pDisplay to NULL on error 2210- v3d,v3dv: stop trying to force 16-bit TMU output for shadow comparisons 2211- broadcom/compiler: fix v3d_qpu_uses_sfu 2212- broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper 2213- broadcom/compiler: fix incorrect check for SFU op 2214 2215Ian Romanick (60): 2216 2217- ntt: Add support for fcsel_gt and fcsel_ge opcodes 2218- nir/lower_int_to_float: Add support for i32csel opcodes 2219- r300: Enable generation of fcsel_gt and fcsel_ge opcodes 2220- i915: Enable generation of fcsel_gt and fcsel_ge opcodes 2221- gallium/draw: Enable aapoint NIR helpers to generate bool1, bool32, or float32 Booleans 2222- gallium/draw: Enable polygon stipple NIR helpers to generate bool1 or bool32 Booleans 2223- nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper) 2224- nir/builder: Handle f2b conversions specially in nir_type_convert 2225- nir: Eliminate nir_op_f2b 2226- lavapipe: Fix bad array index scale factor in lvp_inline_uniforms pass 2227- lavapipe: Only check NULL pointers in one place in src_only_uses_uniforms 2228- nir/inline_uniforms: Change num_offsets type to uint8_t 2229- nir/inline_uniforms: Pass max_num_bo and max_offset around as parameters 2230- nir/inline_uniforms: Allow possibility of more than one UBO 2231- nir/inline_uniforms: Allow possibility of uni_offsets and num_offsets being NULL 2232- nir/inline_uniforms: Make src_only_uses_uniforms public, change name 2233- nir/inline_uniforms: Make add_inlinable_uniforms public 2234- nir/inline_uniforms: Add inot condition support 2235- nir/tests: Don't unconditionally log shaders from this one CF test 2236- nir/tests: Refactor creation of loops for loop_analyze test cases 2237- nir/tests: Add tests for "inverted" loops 2238- nir/tests: Add tests for nir_loop_info::induction_vars tracking 2239- nir/loop_analyze: Track induction variables with uniform increments 2240- nir/loop_analyze: Use nir_loop_variable::update_src instead of nir_basic_induction_var::alu 2241- nir/loop_analyze: Use nir_loop_variable::init_src instead of nir_basic_induction_var::def_outside_loop 2242- nir/loop_analyze: Eliminate nir_basic_induction_var 2243- nir/loop_analyze: Track induction variables with uniform initializer 2244- nir/loop_analyze: Simplify some logic in compute_induction_information 2245- nir: ifind_msb_rev can only have int32 sources 2246- intel/compiler: Lower find_lsb in NIR 2247- nir: intel/compiler: Move ifind_msb lowering to NIR 2248- intel/compiler: Tighter src and dest size bounds checking for some opcodes 2249- nir/algebraic: Only lower ufind_msb with 32-bit sources 2250- nir: intel/compiler: Move ufind_msb lowering to NIR 2251- nir/builder: Do not generate 8- or 16-bit find_msb 2252- nir/algebraic: Do not generate 8- or 16-bit find_msb 2253- nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources 2254- nir/algebraic: Optimize some ifind_msb to ufind_msb 2255- nir/lower_int64: Optionally lower ufind_msb using uadd_sat 2256- intel/fs: Don't copy propagate from saturate to sel 2257- nir/algebraic: Undistribute fsat from fmax 2258- intel/fs: Output opt_combine_constants debug to stderr 2259- intel/fs: Refactor part of opt_combine_constants to a separate function 2260- intel/fs: Rework the loop of opt_combine_constants that collects constants 2261- intel/compiler: Remove one overload of backend_instruction::insert_before 2262- intel/compiler: Use NIR_PASS instead of NIR_PASS_V 2263- intel/compiler: Micro optimize inst_is_in_block 2264- intel/fs: Use specialized version of regions_overlap in opt_copy_propagation 2265- intel/compiler: Micro optimize regions_overlap 2266- intel/fs: Linked list micro optimizations in brw_nir_move_interpolation_to_top 2267- intel/fs: Preserve meta data more often in brw_nir_move_interpolation_to_top 2268- intel/fs: White space fixes 2269- nir/tests: Add many loop analysis tests for induction vars updated by shifts 2270- nir/tests: Add more loop analysis tests for induction vars updated by shifts 2271- nir/tests: Add many loop analysis tests for induction variables modified by imul 2272- nir/loop_analyze: Add a function to evaluate an ALU as constant 2273- nir/loop_analyze: Track induction variable basis information 2274- nir/loop_analyze: Change invert_cond instead of changing the condition 2275- nir/loop_analyze: Use try_eval_const_alu and induction variable basis info 2276- nir/tests: Port almost all loop_analyze tests to new macro-based infastructure 2277 2278Ikshwaku Chauhan (1): 2279 2280- radeonsi: Fix distortion for yuv422 format for GFX10. 2281 2282Illia Abernikhin (1): 2283 2284- util: Extend vk_enum_to_str with bitmasks vk_enum_to_str only generates literals for enums with type: @type="enum", but many enums have type: @type="bitmask" and were not taken into account here. 2285 2286Illia Polishchuk (4): 2287 2288- ANV: Add extra memory types for ANV driver instead of a single one 2289- hasvk: Add extra memory types for hasvk driver instead of a single one 2290- nir: Add sha1 hash for nir shaders converted from spir-v 2291- glx: fix indirect initialization crash 2292 2293Ilya K (1): 2294 2295- intel/vk/grl: don't install libgrl.a 2296 2297Isaac Bosompem (1): 2298 2299- tool/pps: Fix 32-bit build issue with format string 2300 2301Isabella Basso (5): 2302 2303- nir/algebraic: insert patterns inside optimizations list 2304- nir/algebraic: extend mediump patterns 2305- nir/algebraic: extend lowering patterns for conversions on smaller bit sizes 2306- nir/algebraic: make patterns for float conversion lowerings imprecise 2307- nir/algebraic: remove duplicate bool conversion lowerings 2308 2309Italo Nicola (10): 2310 2311- panfrost: fix off-by-one when exporting format modifiers 2312- panfrost: fix tiny sample_positions BO memory leak 2313- hud: use defines for default scale/rotation/visibility values 2314- hud: add GALLIUM_HUD_OPACITY envvar 2315- panfrost: fix strict-aliasing violations when packing fb ptrs 2316- etnaviv: abort() instead of assert(0) on compiler error 2317- etnaviv: use stderr for compiler error logging 2318- etnaviv: add default clear_buffer and clear_texture APIS 2319- etnaviv: lower (un)pack_{2x16,2x32}_split and extract_{byte,word} 2320- etnaviv: implement nir_op_uclz and lower find_{msb,lsb} to uclz 2321 2322Iván Briano (7): 2323 2324- anv: uncompressed views of compressed 3d images are now valid 2325- vulkan: track the right value on CmdSetColorWriteMasks 2326- anv: fix testing for dynamic color blend bits 2327- anv: stop tracking color blend state in the pipeline 2328- anv: use the parameter passed to the macro 2329- intel/fs: handle interpolation modes for at_sample and at_offset too 2330- vulkan/wsi/display: do not dereference a NULL pointer 2331 2332Jakub Kulík (1): 2333 2334- mesa: Fix format transform on big endian platforms 2335 2336Jan Beich (1): 2337 2338- util/u_process: implement util_get_command_line for BSDs 2339 2340Janne Grunau (1): 2341 2342- asahi: Fix typo in debug/error message helper macro 2343 2344Jarred Davies (16): 2345 2346- pvr: Use common queue submit implementation 2347- pvr: Add support for VK_KHR_timeline_semaphore 2348- pvr: Enable threaded submit when supported 2349- pvr: Clear wait syncs after job submission 2350- pvr: Don't update fragment signal sync when fragment stage is disabled 2351- pvr: Fix segfaults when pDepthStencilAttachment is NULL 2352- pvr: Generate EOT program at runtime 2353- pvr: Generate dummy emit for renders without any emits 2354- pvr: Add support for multiple emits from EOT program 2355- pvr: Select a single aspect format for the texture state of DS image views 2356- pvr: Add initial support for VK_FORMAT_S8_UINT 2357- pvr: Don't allocate/upload 0 size coeff programs 2358- pvr: Always mark robustBufferAccess as supported 2359- pvr: Rename pvr_xgl_pds.c to pvr_pipeline_pds.c 2360- pvr: Add robustness buffer support 2361- pvr: Mark all normalized formats as supporting with_packed_usc_channel 2362 2363Jesse Natalie (224): 2364 2365- ci/windows: Download updated WARP 1.0.4 package 2366- dzn/ci: Remove flakes/fails that don't hit anymore 2367- dzn/ci: Add image test group, which is all passing now 2368- dzn: Fix clear bind flag logic 2369- microsoft/compiler: Lower pack_[u/s]norm_2x16 2370- microsoft/compiler: Implement texture sample count query 2371- microsoft/compiler: Remove arrays when testing for structs in I/O 2372- microsoft/compiler: Always emit float types in the I/O signature for structs 2373- microsoft/compiler: Re-work the logic for adding SV_SampleIndex to force sample-rate 2374- microsoft/compiler: Use nir info.fs.uses_sample_shading to force sample-rate 2375- microsoft/compiler: Set num_components to 4 when updating pos write instructions 2376- spirv2dxil: For removing unused vars, consider the whole I/O var size 2377- spirv2dxil: When removing unused inputs, make sure they're actually inputs 2378- spirv2dxil: Allow killing position as an undef varying 2379- spirv2dxil: Replace not-provided inputs with zero instead of undef 2380- dzn: Get options13 2381- dzn: Support alpha blend factor 2382- dzn: When changing root signature, dirty descriptors too 2383- dzn: Use R24G8_TYPELESS for 24/8 depth resources 2384- dzn: Support int border colors 2385- dzn: Storage buffer sizes need to be 4-byte-aligned 2386- dzn: Set MultisampleEnable to enable MSAA lines 2387- dzn: Use typeless format for creation of depth-only or stencil-only D24S8 2388- dzn: Define a symbol that was present in older D3D headers 2389- dzn: Support root signature 1.2 2390- dzn: Support unnormalized coordinate samplers 2391- dzn: Always align cached pipeline header size to input element align 2392- dzn: Add a zeroed zsa state when depth or raster is disabled 2393- dzn: Disable depth when the rasterizer is disabled due to no position output 2394- dzn: Fix format support checks for storage/uniform texel buffers 2395- dzn: Remove cmdbuf query 'wait' list 2396- microsoft/compiler: Delete incorrect implementation for load_layer_id 2397- microsoft/compiler: Subpass textures are supposed to be arrays 2398- microsoft/compiler: Delete stale TODO comment 2399- microsoft/compiler: Support view instancing 2400- spirv2dxil: Pass runtime conf struct to lower_shader_system_values 2401- spirv2dxil: Implement lowering for multiview 2402- spirv2dxil: Claim multiview support 2403- dzn: Put nir compilation options in a struct 2404- dzn: Handle multiview pipeline creation 2405- dzn: Handle draws and clears for multiview rendering 2406- dzn: Implement multiview queries 2407- dzn: Enable multiview 2408- dzn: Enable independent blending 2409- dzn: Delete an unnecessary assert 2410- dzn: Rework meta blit VS 2411- microsoft/compiler: Add an overload param to unary function helpers 2412- microsoft/compiler: Implement a few basic wave/subgroup intrinsics 2413- microsoft/compiler: Add lowering passes for basic subgroup vars 2414- spirv2dxil: Use 32-bit shared offsets 2415- spirv2dxil: Support basic subgroups 2416- dzn: Support basic subgroups 2417- microsoft/compiler: Fix atomic image umax 2418- microsoft/compiler: Lower device index to zero 2419- spirv2dxil: Support dispatches with base group indices 2420- dzn: Support vkCmdDispatchBase 2421- dzn: Use common physical device list/enumeration helpers 2422- dzn: Respect suspending/resuming flags to omit clears/resolves 2423- dzn: Set dynamic rendering caps 2424- dzn: When rendering to 3D, don't treat layers as subresources for barriers 2425- dzn: Move patched vertex buffer capability check up a level 2426- dzn: Use SHADER_LOAD to indicate SAMPLED_IMAGE support 2427- dzn: Usage image view usage instead of image usage 2428- dzn: Support EXTENDED_USAGE bit 2429- dzn: Usage MULTISAMPLE_LOAD support instead of RT/DS support for MSAA 2430- dzn: Descriptor limits are based on binding tier, not heap tier 2431- dzn: A single sampler descriptor set needs to support 1024 samplers 2432- dzn: Don't expose variable pointers 2433- dzn: Fix independent blend check 2434- dzn: Enable Vulkan 1.1 2435- microsoft/compiler: Don't emit threadgroup barriers for graphics shaders 2436- microsoft/compiler: Handle i2i1 and u2u1 2437- microsoft/compiler: Handle i1 overloads 2438- microsoft/compiler: Implement more wave/quad ops 2439- microsoft/compiler: Support emitting the SM6.6 wave size tag 2440- spirv2dxil: Lower some wave op properties 2441- spirv2dxil: Support subgroup SPIR-V caps 2442- dzn: Support more subgroup/quad ops 2443- dzn: Implement subgroup size control extension 2444- dzn: Use core feature matching logic instead of rolling our own 2445- microsoft/compiler: Support float controls 2446- dzn: Fix dynamic rendering clear load op for non-multiview 2447- dzn: Handle separate stencil usage 2448- dzn: Cache GPUVA for buffers 2449- dzn: Support float control 2450- dzn: Always do clears with copies on non-graphics queues 2451- dzn: Enhanced barriers fixes/workarounds 2452- dzn: Ensure we don't mix DSV+simultaneous-access 2453- dzn: Support Vulkan 1.2 2454- dzn: Fix Windows WSI 2455- dzn: Don't recursively lock the physical device enum mutex 2456- dzn: Report as a software device for non-Windows 2457- CI/windows: Don't limit deqp-runner to 4 jobs 2458- CI/windows: Apply CI_FDO_CONCURRENT to piglit too 2459- dzn: Consider linked shaders when computing DXIL hash 2460- wsi/win32: Always use non-SRGB formats for DXGI 2461- wsi/win32: Use app-provided timeout instead of arbitrary hardcoded value 2462- CI: Lima farm is offline 2463- dzn, driconf: Add a driconf entry for NMS to claim wide line support 2464- vulkan/wsi: Add a wsi_device param to get_present_modes 2465- vulkan/wsi/win32: Support tearing (immediate) and VSync (FIFO) present modes 2466- wsi/win32: Don't require buffer blits for software drivers 2467- wsi/win32: We don't need a window DC for DXGI 2468- clc: Include opencl-c-base.h with LLVM 15 (using builtins) 2469- microsoft/clc: Set features that are used by CL tests 2470- ci/windows: Update LLVM to 15 2471- nir: Add alignment to load_push_constant 2472- nir_lower_fp16_casts: Allow opting out of lowering certain rounding modes 2473- microsoft/compiler: Handle struct consts in DXIL module dumper 2474- microsoft/compiler: Handle frcp for float16/float64 2475- microsoft/compiler: Ensure native_low_precision is set for 16-bit bitcasts/stores 2476- microsoft/compiler: Handle undef-rounding f2f16 as rtz 2477- microsoft/compiler: Move unaligned load/store pass from CL 2478- microsoft/compiler: Pass deref modes to unaligned pass and handle push const 2479- microsoft/compiler: Simplify bitpacking for load/store lowering with nir_extract_bits 2480- microsoft/compiler: Pass an alignment to constant buffer load lowering 2481- microsoft/compiler: Handle 48-bit stores to SSBO/shared 2482- microsoft/compiler: Support raw buffer load/store intrinsics with 16bit alignment 2483- microsoft/compiler: Support lowering SSBO accesses to 16bit vectors 2484- spirv2dxil: Set min UBO/SSBO alignments 2485- spirv2dxil: Lower unaligned loads and stores 2486- spirv2dxil: Move shader model into runtime conf struct 2487- spirv2dxil: Support 16bit types 2488- dzn: Enable get_surface_capabilities2 2489- dzn: Delete unused extensions table 2490- dzn: Get options4 2491- dzn: Enable 16bit types when supported 2492- dzn: Enable KHR_storage_buffer_storage_class 2493- vulkan/wsi: Fix Windows build 2494- radv: Fix returning an expression from a void function 2495- Revert "CI: Disable Windows runners" 2496- nir: Propagate alignment when rematerializing cast derefs 2497- microsoft/compiler: Implement wave reduce/exclusive scan ops that are supported 2498- microsoft/compiler: Add a lowering pass for scan ops that aren't supported 2499- spirv2dxil: Handle arithmetic subgroup ops 2500- dzn: Claim the arithmetic subgroup bit 2501- ci/windows: Update warp to 1.0.5 2502- microsoft/compiler: Handle writable buffer UAV size queries 2503- d3d12: Report correct texel buffer max size 2504- d3d12: Fix buffer SRV/UAV creation 2505- d3d12: Remove now-unused UAV format from shader info 2506- microsoft/compiler: Fix setting bit 31 in feature flags 2507- microsoft/compiler: Only set typed UAV load feature bit for multi-comp loads 2508- microsoft/compiler: Refactor type -> resource kind helper 2509- microsoft/compiler: Add helpers for getting res_props structs 2510- microsoft/compiler: Split handle annotation into two parts 2511- microsoft/compiler: Handle "bindless" image/tex sources as heap indices 2512- microsoft/compiler: Support descriptor heap indexing for UBO/SSBO 2513- microsoft/compiler: Use store_dest instead of store_dest_value more 2514- microsoft/compiler: Update header docs for binding modes supported by compiler 2515- spirv2dxil: Add a pass to lower deref tex/image and vulkan ubo/ssbo to bindless 2516- spirv2dxil: Only lower readonly images to SRVs when the option is set 2517- spirv2dxil: Support descriptor indexing capabilities 2518- dzn: Remove device pointers from descriptor heaps 2519- dzn: Remove descriptor heap type from descriptor heap wrapper 2520- dzn: Fix a leak in descriptor set layout creation 2521- dzn: Add some docs around descriptor sets and remove redundant/unused data 2522- dzn: Put UAVs first for storage images/buffers in descriptor tables 2523- dzn: Consistently order depth formats before stencil 2524- dzn: Don't use plane slice 1 for depth+stencil SRVs 2525- dzn: Set up SRV descs for 3D textures correctly 2526- dzn: Skip setting up UAVs for depth resources 2527- dzn: Add initial bindless infrastructure 2528- dzn: When bindless, only allocate one descriptor per layout entry 2529- dzn: Remove defragmenting of descriptor pools 2530- dzn: Delete unused function 2531- dzn: Allocate descriptor sets in buffers for bindless mode 2532- dzn: Don't dirty bindings if root signature doesn't change 2533- dzn: Use separate dirty bits for descriptor sets/dynamic buffers 2534- dzn: Bind buffers for bindless descriptor sets 2535- dzn: Add a binding classification in the pipeline layout remapping 2536- dzn: When binding a bindless root signature, bind descriptor heaps first 2537- dzn: Ensure root signatures are re-bound after a meta op 2538- dzn: Only bind descriptor sets up to the used amount of the current layout 2539- dzn: Apply bindless lowering when compiling pipelines 2540- dzn: Add a debug option for enabling bindless mode 2541- dzn: Support descriptor indexing via bindless 2542- dzn: Enable variable size bindings 2543- dzn: Use mesa_loge for DXIL validation errors 2544- microsoft/clc: Add shader model / validator to compiler API 2545- d3d12: Move forward-front-face pass to common DXIL code 2546- spirv2dxil: Expose yz flip pass to external callers 2547- dzn: Add a helper to generate triangle->point GS 2548- dzn: Handle polygon point mode 2549- dzn: Claim fillModeNonSolid 2550- CI/windows: Update headers and Agility redist to 1.710.0-preview 2551- CI/windows: Increase timeout for build container job 2552- microsoft/compiler: Fix 8-bit loads and stores when supporting 16-bit DXIL 2553- microsoft/compiler: Fix barrier for wave ID computation 2554- microsoft/compiler: Assign 1D wave IDs based on local thread ID 2555- microsoft/compiler: Fix large shifts 2556- spirv2dxil: Add some more supported caps 2557- dzn: Add a driconf entry for enabling 8bit loads and stores 2558- dzn: Add a driconf option for enabling subgroup ops in VS/GS 2559- dzn: Fix SRV barrier state on compute command lists 2560- dzn: Raise max number of descriptor sets to 8 2561- dzn: Report some more caps correctly that are supported 2562- dzn: Align descriptor sets in the bindless buffer 2563- dzn: Ensure pipeline variants are used for dynamic stencil masks 2564- dzn: Don't use write-combine memory for cache-coherent UMA 2565- dzn: Ensure buffer offsets are aligned 2566- dzn: Attempt to force depth write states for depth access in LAYOUT_GENERIC 2567- dzn: Don't do initial-layout barriers for simultaneous-access resources 2568- dzn: Batch command lists together 2569- dzn: Fix bindless descriptor sets with multiple dynamic buffers that need custom descriptors 2570- dzn: Early-out on no-op barriers 2571- dzn: Clean up ABI helpers now that we require DirectX-Headers 606 2572- dzn: Use GetResourceAllocationInfo3 for castable formats 2573- dzn: Don't leave deleted physical devices in the instance pdev list 2574- dzn: Remove skips now that WARP is faster 2575- dzn: Support >2K samplers with bindless 2576- dzn: Remove xfail for test that passes (if run) 2577- microsoft/compiler: Don't split loads/stores that will be split by lower_explicit_io 2578- dzn: Changes to descriptor set dirty flag handling 2579- dzn: Use a linear allocator for upload data on command buffers 2580- dzn: Ignore unnormalized sampling flag if driver doesn't support it 2581- dzn: Never set STATE_RENDER_TARGET on a compute command list 2582- dzn: Don't enable bindless by default 2583- d3d12: Support creating PSOs with no attachments with MSAA without TIR 2584- d3d12: Fix buffer reference leak for SO count staging buffer 2585- dzn: Handle mismatches in bound descriptor set vs pipeline layout 2586- d3d12: Respect buffer offsets for sampler views 2587- dzn: Hook up subgroup size to compute shader compilation 2588- dzn: Delete queue-level event waits 2589 2590Jonathan Gray (3): 2591 2592- egl/dri2: avoid undefined unlocks 2593- intel/dev: remove invalid EHL pci id 2594- intel/dev: Add another EHL pci id 2595 2596Jonathan Marek (1): 2597 2598- turnip: fix use of align() instead of util_align_npot() with tile_align_w 2599 2600Jordan Justen (8): 2601 2602- intel/vk/grl: Don't include anv_private.h in genX_grl.h 2603- intel/vk/grl: Allow genX_grl.h to be included by C++ files 2604- intel/vk/grl: Allow grl/grl_cl_kernel.h to be included by C++ files 2605- intel/vk/grl: genX-ify genX_grl_uuid.cpp 2606- intel/vk/grl: genX-ify grl_cl_kernel_name() 2607- intel/dev: Enable MTL PCI ids 2608- intel/compiler: Support fmul_fsign opt for fp64 when int64 isn't supported 2609- intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops 2610 2611Joshua Peisach (1): 2612 2613- gallum/asahi: fix memory leak in agx_resource_from_handle 2614 2615José Fonseca (4): 2616 2617- llvmpipe: Ensure floating point SSE state is reset regardless of the write mask. 2618- llvmpipe: Honor zero sample_mask when multisample is disabled. 2619- trace: Don't use italic escape code. 2620- wgl: Fix unintentional assignment on assert. 2621 2622José Roberto de Souza (107): 2623 2624- anv: Start to move i915 specific code from anv_device to i915/anv_device 2625- anv: Export anv_exec_batch_debug() and chain_command_buffers() 2626- anv: Split i915 code from anv_batch_chain.c 2627- anv: Move anv_device_check_status() code to i915/anv_device.c 2628- intel/dev: Export functions that will be used by different kernel drivers 2629- intel/dev: Move i915 code to i915/intel_device_info.c 2630- intel/dev: Split hwconfig i915 specific code 2631- intel/dev: Detect what is the kernel mode driver loaded 2632- intel: Add intel_kmd_type parameter to intel_engine_get_info() 2633- intel: Add kmd_type parameter to necessary intel_gem.h functions 2634- anv: Nuke anv_queue:index_in_family 2635- hasvk: Nuke anv_queue:index_in_family 2636- intel/ds: Nuke intel_ds_queue::queue_id 2637- intel/ds: Fix crash when allocating more intel_ds_queues than u_vector was initialized 2638- intel/genxml/gen125: Add walker configuration fields to 3DSTATE_WM 2639- intel/genxml/gen125: Tune 3DSTATE_WM Walker direction 2640- intel: Add intel_memory_class_instance 2641- anv: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance 2642- anv: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels 2643- anv: Add basic KMD backend infrastructure 2644- anv: Start to move anv_gem_stubs.c to kmd backend 2645- anv: Remove remaining bits of anv_i915_query() 2646- hasvk: Remove remaining bits of anv_i915_query() 2647- anv: Add gem_close to kmd backend 2648- anv: Add gem_mmap to kmd backend 2649- anv: Move execute_simple_batch() and queue_exec_locked() to kmd backend 2650- intel/common: Move i915 files to i915 folder 2651- iris: Export batch debug functions 2652- iris: Export update_batch_syncobjs() 2653- iris: Export num_fences() 2654- intel: Make gen12 URB space reservation dependent on compute engine presence 2655- intel/blorp: Allocate only necessary amount of VERTEX_BUFFER_STATE 2656- intel: Pull in xe_drm.h 2657- intel: Add Meson parameter to enable Xe KMD support 2658- intel/dev: Add INTEL_KMD_TYPE_XE 2659- intel/dev: Implement Xe functions to fill intel_device_info 2660- intel/dev: Implement Xe functions to handle hwconfig 2661- intel/dev: Query and compute hardware topology for Xe 2662- iris: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance 2663- iris/bufmgr: Add i915_gem_set_domain() 2664- iris: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels 2665- iris: Add initial skeleton of kmd backend 2666- iris: Move iris_bo_madvise() to i915/iris_bufmgr.c 2667- iris: Add iris_bo_set_caching() 2668- intel/common: Implement the Xe functions for intel_engine 2669- intel/common: Implement the Xe functions for intel_gem 2670- iris: Move bo_madvise to kmd backend 2671- iris: Move bo_set_caching to kmd backend 2672- iris: Move iris_bo_busy_gem() to i915/iris_bufmgr.c 2673- iris: Move iris_bo_wait_gem() to i915/iris_bufmgr.c 2674- iris: Don't mark protected bo as reusable 2675- intel/perf: Disable it for Xe KMD 2676- build: Block build of HASVK, Crocus and i915 in non-x86 architectures 2677- iris: Add gem_mmap() to kmd backend 2678- iris: Add batch_check_for_reset() to kmd backend 2679- iris: Move i915 submit_batch() to i915 backend 2680- anv: Implement gem_create for Xe backend 2681- anv: Implement Xe functions to create and destroy VM 2682- anv: Implement gem close and mmap for Xe backend 2683- anv: Add gem VM bind and unbind to backend 2684- anv: Integrate gem vm bind and unbind kmd backend functions 2685- iris: Drop I915_EXEC_FENCE types 2686- iris: Drop usage of i915 EXEC_OBJECT_WRITE 2687- iris: Move iris_bufmgr_init_global_vm() to i915/iris_bufmgr.c and prepare for Xe KMD 2688- anv: Implement Xe version of anv_physical_device_get_parameters() 2689- anv: Properly alloc buffers that will be promoted to framebuffer in Xe KMD 2690- anv: Handle external objects allocation in Xe 2691- iris: Only mark buffer as exported if drmPrimeHandleToFD() succeed 2692- iris: Implement the Xe version of iris_bufmgr_init_global_vm() 2693- iris: Implement the function to destroy VM in Xe 2694- iris: Implement gem_create() in Xe kmd backend 2695- iris: Implement gem_mmap() in Xe kmd backend 2696- iris: Store iris_context's priority 2697- iris: Move to i915/iris_batch.c code to create and replace i915 context 2698- iris: Move to iris_i915_batch.c code to destroy i915 context 2699- intel: Move memory aligment information to intel_device_info 2700- anv: Use intel_device_info memory alignment 2701- intel: Set mem_alignment in Xe kmd 2702- anv: Apply memory alignment requirements in Xe kmd 2703- intel: Add TODO about removal of 2Mb alignment in i915 2704- anv: Replace I915_ENGINE_CLASS_VIDEO by INTEL_ENGINE_CLASS_VIDEO 2705- anv: Create Xe engines 2706- anv: Implement Xe version of check_status() 2707- anv: Handle Xe queue/engine priority 2708- anv: Implement Xe version of execute_simple_batch() 2709- iris: Prepare iris_bufmgr functions for vm bind error paths 2710- iris: Add vm bind and unbind to kmd backend 2711- iris: Implement gem_vm_bind() and gem_vm_unbind() in Xe kmd backend 2712- iris: Ajust gem buffer allocation size in Xe kmd 2713- intel: Sync xe_drm.h 2714- anv: Partialy import drm-uapi/gpu_scheduler.h and use it 2715- anv: Fetch max_context_priority from drm_xe_query_config 2716- intel: Allocate mesh shader URB space before task shader 2717- anv: Move to a function code to clflush batch buffers 2718- anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace() 2719- anv: Disable anv_bo_sync_type for Xe kmd 2720- anv: Add assert in functions not supported by Xe kmd 2721- iris: Add BO_ALLOC_SHARED 2722- iris: Handle allocation of exported buffers in Xe kmd 2723- iris: Handle allocation of scanout buffers in Xe 2724- iris: Implement Xe version of bo_madvise() and bo_set_caching() 2725- anv: Fix vm bind of imported buffers 2726- iris: Add function to close gem bos 2727- iris: Handle Xe syncronization with syncobjs 2728- loader: Add Xe KMD support 2729- iris: Fix close of exported bos 2730- iris: Allow shared scanout buffer to be placed in smem as well 2731 2732Juan A. Suarez Romero (8): 2733 2734- v3d/v3dv: define performance counters in common 2735- v3d: cache pipe query results 2736- v3d: include offset as part of streamout target 2737- v3d: implement NV_conditional_render extension 2738- v3d: fix condition for EZ disabling when stencil on 2739- v3d: set depth compare function correctly 2740- v3d: use primitive type to get stream output offset 2741- v3d: apply 1D texture miplevel alignment in arrays 2742 2743Julia Tatz (3): 2744 2745- zink: zink_heap isn't 1-to-1 with memoryTypeIndex 2746- zink: trival renames heap_idx -> memoryTypeIndex 2747- zink: correct sparse bo mem_type_idx placement 2748 2749Juston Li (29): 2750 2751- venus: refactor out vn_feedback_event_cmd_record2 2752- venus: refactor VK_KHR_synchronization2 ext sync fd requirements 2753- venus: require importable external semaphores for WSI 2754- venus: require exportable bit for ext fence sync fd 2755- venus: require exportable/importable bit for ext semaphores sync fd 2756- venus: remove filtering external semaphores for QueueSubmit 2757- venus: drop VkQueueBindSparse 2758- venus: append fence feedback batch 2759- venus: refactor QueueSubmit/QueueSubmit2 2760- venus: vn_queue: align vulkan object variable naming 2761- docs/envvars: add missing mesa disk cache envvars 2762- util/fossilize_db: don't destroy foz on RO load fail 2763- utils/fossilize_db: refactor out loading RO foz dbs 2764- util/fossilize_db: add runtime RO foz db loading via FOZ_DBS_DYNAMIC_LIST 2765- util/fossilize_db: fix macOS inotify build error 2766- util/fossilize_db: add ifdef for inotify header 2767- util/tests/cache_test: Skip Cache.List if not supported 2768- anv: check initial cmd_buffer is chainable 2769- venus: refactor semaphore helper functions 2770- venus: refactor batch submission fixup 2771- venus: add NO_TIMELINE_SEM_FEEDBACK perf option 2772- venus: add timeline semaphore feedback cmds 2773- venus: enable timeline semaphore feedback 2774- venus: add SHADER_DEVICE_ADDRESS_BIT to buffer cache 2775- venus: switch to lazy VkBuffer cache 2776- venus: add VN_DEBUG_CACHE flag 2777- venus: Add VkBuffer cache statistics for debug 2778- venus: shader cache fossilize replay fix 2779- util/disk_cache: use posix_fallocate() for index files 2780 2781Kai Wasserbäch (5): 2782 2783- fix: gallivm: limit usage of LLVMContextSetOpaquePointers() to LLVM 15 2784- fix(FTBFS): gallivm: fix LLVM #include of Triple.h, moved to TargetParser 2785- fix(FTBFS): clover: fix LLVM #include of Triple.h, moved to TargetParser 2786- fix: clover/llvm: replace llvm::None with std::nullopt for LLVM 17+ 2787- fix: gallivm: fix LLVM #include of Host.h, moved to TargetParser 2788 2789Kai-Heng Feng (2): 2790 2791- iris: Retry DRM_IOCTL_I915_GEM_EXECBUFFER2 on ENOMEM 2792- Revert "iris: Avoid abort() if kernel can't allocate memory" 2793 2794Karmjit Mahil (28): 2795 2796- pvr: Process set and reset event sub commands. 2797- pvr: Process wait event sub command. 2798- pvr: Add SPM scratch buffer infrastructure. 2799- pvr: Acquire scratch buffer on framebuffer creation. 2800- pvr: Update comment about ZS and MSAA buffers for pvrsrvkm submission. 2801- pvr: Set SPMSCRATCHBUFFER flag. 2802- pvr: Add SPM load usc empty programs 2803- pvr: Upload spm load programs to device. 2804- pvr: Add support for VK_ATTACHMENT_LOAD_OP_LOAD. 2805- pvr: Move descriptor write into pvr_write_descriptor_set() 2806- pvr: Add support to copy descriptors on vkUpdateDescriptorSets() 2807- pvr: Handle VK_QUERY_RESULT_WAIT_BIT. 2808- pvr: Store enum pvr_stage_allocation instead of VkShaderStageFlags 2809- pvr: Put old descriptor set approach behind a hardcoding check 2810- pvr: Change last_DMA to last_dma 2811- pvr: Write descriptor set addrs table dev addr into shareds 2812- pvr: Add PVR_SELECT() helper macro 2813- pvr: Add push consts support to descriptor program. 2814- pvr: Add support for dynamic buffers descriptors 2815- pvr: Add support for blend constants. 2816- pvr: Move PBE START_POS into csb enum helpers header 2817- pvr: Setup SPM EOT state 2818- pvr: Remove unused msaa_mode field 2819- pvr: Remove component_alignment 2820- pvr: Setup SPM background object 2821- pvr: Don't advertise currently unsupported features 2822- pvr: Advertise STORAGE_IMAGE_BIT for B10G11R11_UFLOAT_PACK32 2823- pvr: Don't advertise S8_UINT support 2824 2825Karol Herbst (44): 2826 2827- rusticl: fix build error with valgrind being enabled 2828- rusticl/util: extract offset_of macro 2829- rusticl/icd: Make it work in case Rustc shuffles struct around 2830- rusticl/kernel: fix clGetKernelInfo CL_KERNEL_ATTRIBUTES for non source programs 2831- rusticl/program: enable spirv 2832- llvmpipe/ci: increase deqp-runner timeout 2833- rusticl/device: fix some device limits 2834- rusticl/device: limit CL_DEVICE_MAX_CONSTANT_ARGS 2835- rusticl: no compute only 2836- rusticl: allocate printf buffer as staging 2837- nir: Skip samplers and textures in lower_explicit_io 2838- nir/deref: don't replace casts with deref_struct if we'd lose the stride 2839- ci/zink: move threading tests to flakes 2840- rusticl/kernel: Images arg sizes also have to match the host pointer size 2841- gallivm: fix lp_vec_add_offset_ptr for 32 bit builds 2842- nvc0: enable fp helper invocation memory loads on Turing+ 2843- nir: track existence of variable shared memory 2844- rusticl/kernel: set has_variable_shared_mem on the nir 2845- gallium: add get_compute_state_info 2846- lp: implement get_compute_state_info 2847- iris: implement get_compute_state_info 2848- nv50: implement get_compute_state_info 2849- nvc0: implement get_compute_state_info 2850- panfrost: move max_thread_count and take reg_count into account 2851- panfrost: implement get_compute_state_info 2852- rusticl/kernel: make use of cso info 2853- radeonsi: implement get_compute_state_info 2854- radeonsi: use default float mode for CL 2855- rusticl: enable radeonsi 2856- rusticl: split platform into core and api parts 2857- rusticl/platform: rename _cl_platform_id to Platform 2858- rusticl/platform: move getter into the type 2859- rusticl/platform: move device initialization to the platform 2860- rusticl/program: allow dumping compilation logs through RUSTICL_DEBUG 2861- rusticl/program: make IL programs look closer to CLC ones 2862- clc: add clc_validate_spirv 2863- rusticl/program: validate the SPIR-V when created from IL 2864- rusticl/program: extract common code of compile and build 2865- rusticl/program: rework source code tracking 2866- rusticl/event: drop work item before updating status 2867- radeonsi: lower mul_high 2868- ac/llvm: support shifts on 16 bit vec2 2869- rusticl: don't set size_t-is-usize for >=bindgen-0.65 2870- nvc0: do not randomly emit fences. 2871 2872Kenneth Graunke (37): 2873 2874- intel/blorp: Lower base_workgroup_id to zero 2875- intel/compiler: Move atomic op translation into emit_*_atomic() 2876- intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs 2877- intel/compiler: Add an lsc_op_num_data_values() helper 2878- intel/compiler: Eliminate SHADER_OPCODE_UNTYPED_ATOMIC_FLOAT 2879- intel/compiler: Drop redundant 32-bit expansion for shared float atomics 2880- intel/compiler: Delete fs_visitor::nir_emit_{ssbo,shared}_atomic_float() 2881- intel/compiler: Combine nir_emit_{ssbo,shared}_atomic into one helper 2882- intel/compiler: Delete all the A64 atomic variants for type sizes 2883- intel/compiler: Drop dest checking in atomic code 2884- intel/compiler: Use more symbolic source names in components_read() 2885- anv: Add missing untyped data port flush on PIPELINE_SELECT 2886- iris: Add missing untyped data port flush on PIPELINE_SELECT 2887- loader: Add infrastructure for tracking active CRTC resources 2888- egl: Rewrite eglGetMscRateANGLE to avoid probes and handle multi-monitor 2889- iris: Perform load_constant address math in 32-bit rather than 64-bit 2890- anv: Perform load_constant address math in 32-bit rather than 64-bit 2891- anv: Make a batch decoder for each queue family 2892- nir: Print divergence information for registers as well as SSA defs 2893- nir: Fix merge_set_dump() to compile again 2894- nir: Fix typos in the from-SSA pass comments 2895- intel: Use common helpers for TCS passthrough shaders 2896- intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes 2897- intel/fs: Add builder helpers for F32TO16/F16TO32 that work on Gfx7.x 2898- intel/fs: Delete a TODO about using brw_F32TO16. 2899- intel/fs: Use new F16TO32 helpers for unpack_half_split_* opcodes 2900- Revert "intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes" 2901- intel/fs: Use F32TO16/F16TO32 helpers in fquantize16 handling 2902- intel/fs: Move packHalf2x16 handling to lower_pack() 2903- intel/eu: Simplify brw_F32TO16 and brw_F16TO32 2904- intel/vec4: Retype texture/sampler indexes to UD 2905- intel/fs: Make bld.F16TO32 actually emit F16TO32 not F32TO16 2906- i965/vec4: Implement uclz in the vec4 backend 2907- st/mesa, iris: Add optional CPU-based ASTC void extent denorm flushing 2908- intel/compiler: Use nir_dest_bit_size() for ballot bit size check 2909- iris: Extend resource creation helpers to allow for explicit strides 2910- iris: Hack around gbm_gralloc stride restrictions 2911 2912Konrad Dybcio (3): 2913 2914- freedreno/registers: Add RBBM_GPR0_CNTL for non-GMU operation 2915- freedreno: Add A2xx perf counter reg values 2916- freedreno: Add A2xx REG_A2XX_RBBM_PM_OVERRIDE2 bitfields 2917 2918Konstantin Seurer (78): 2919 2920- vulkan: Track the nullDescriptor feature 2921- radv: Add a helper for finding memory indices 2922- radv: Create a null TLAS as meta state 2923- radv: Use the null accel struct instead of emitting 0 2924- radv/rt: Get rid of accel struct null checks 2925- radv: Advertise rt pipelines for Control (DX12) 2926- radv/bvh/meson: Add the option to set defines 2927- radv/bvh: Add a define for extended SAH 2928- radv: Add a shader variant for PLOC with extended SAH 2929- radv: Wrap internal build type inside a build_config struct 2930- radv: Enable extended SAH for shallow BVHs 2931- radv: Merge the leaf and internal converter 2932- radv: Improve the BVH size estimation 2933- radv: Fix creating accel structs with unbound buffers 2934- radv: Work around shader_call_data variables in raygen shaders 2935- radv/rq: Use 16 stack entries if there is only one ray query 2936- radv/llvm: Use the shader names as module name 2937- ac/llvm: Fix validation error with global io 2938- radv: Scalarize global IO with LLVM enabled 2939- radv: Make radv_compute_dispatch non-static 2940- radv: Implement ordered compute dispatches 2941- radv: Use an ordered dispatch for BVH encoding 2942- radv: Remove radv_indirect_unaligned_dispatch 2943- vulkan,nir: Refactor ycbcr conversion state into a struct 2944- radv: Use common ycbcr conversion lowering 2945- radv/rra: Find copy memory index when initializing the trace state 2946- radv/rra: Hide deferred accel struct data destruction behind an env var 2947- radv: Hash VK_PIPELINE_CREATE_RAY_TRACING_NO_NULL_* flags 2948- radv: Clean up dynamic RT stack allocation 2949- ac/llvm: Implement bvh64_intersect_ray_amd 2950- radv: Make accel struct meta state initialization thread safe 2951- radv: Force ACO for BVH build shaders 2952- radv: Pre-compile BVH build shaders if there is a cache 2953- radv: Advertise ray query support with LLVM 2954- radv/rt: Skip instances after loading the entire node 2955- vulkan: Add vk_acceleration_structure 2956- radv: Use vk_acceleration_structure 2957- anv: Use vk_acceleration_structure 2958- radv/bvh/encode: Use the node type for identifying internal nodes 2959- radv/bvh: Replace is_final_tree with bvh_offset 2960- radv/bvh/encode: Move bvh_offset NULL check to the top of the loop 2961- radv/bvh/encode: Introduce is_root_node 2962- radv/bvh/encoder: Move dst_node initialization into the loop 2963- radv: Add a build config for compact builds 2964- radv/bvh: Implement compact encoding 2965- radv: Use compact encoding 2966- radv: Move the geometry infos before the BVH 2967- radv/bvh: Move the size header field up 2968- radv/bvh: Add a shader for filling the header 2969- radv: Use indirect header filling for compact builds 2970- nir: Add cull_mask_and_flags_amd intrinsic 2971- radv/rt: Merge cull_mask and flags 2972- radv/rt: Pre shift cull_mask 2973- radv: Move header and geometry info init into separate functions 2974- radv: Only init geometry infos if RRA is enabled 2975- radv/rt: Use ushr for extracting the cull mask 2976- radv/rt: Fix updating stack_size if the shader uses scratch 2977- radv/rt: Use vk_pipeline_hash_shader_stage for RT stages 2978- vulkan: Add vk_shader_module_init 2979- radv/rt: Properly handle pNext of pipeline library stages 2980- radv/sqtt: Skip dumping pipeline libraries 2981- radv: Fix loading stack_size from the cache 2982- radv: Fix inserting stack_size into the cache 2983- radv/rt: Handle load_constant instructions when inlining shaders 2984- nir/lower_shader_calls: Remat derefs before lowering resumes 2985- radv/rt: Refactor rq_load lowering 2986- radv/rq: Rematerialize inv_dir before proceed 2987- radv: Set user SGPR locations when declaring args 2988- radv: Stop counting user SGPRS separately 2989- radv/ci: Update ray tracing pipeline fail/skip lists 2990- radv: Add radv_shader_type to fix gs_copy and trap handler handling 2991- radv: Remove some dead radv_shader_args setup 2992- aco: Remove is_gs_copy_shader 2993- radv: Remove has_previous_stage 2994- radv: Pack and encode geometry id and flags on the CPU 2995- radv/bvh: Remove calculate_node_bounds 2996- radv: Remove radv_bvh_aabb_node::aabb 2997- nir/lower_fp16_casts: Fix SSA dominance 2998 2999Kurt Kartaltepe (1): 3000 3001- drirc: Set limit_trig_input_range option for Nier games 3002 3003Lang Yu (1): 3004 3005- amd/common: fix a typo 3006 3007Lepton Wu (1): 3008 3009- egl/dri2: Use primary device in EGL device platform for kms_swrast 3010 3011Lina Versace (8): 3012 3013- util/glsl2spirv: Fix build with Python 3.6 3014- docs: Lower Python requirement to 3.6 3015- mailmap: Lina is Chad's new name 3016- mailmap: Add Lina's new google.com address 3017- venus: Update protocol for VK_EXT_memory_budget 3018- venus: Delete vn_renderer_info::has_cache_management 3019- venus: Refactor vn_physical_device_init_memory_properties 3020- venus: Enable VK_EXT_memory_budget 3021 3022Lionel Landwerlin (151): 3023 3024- pps: print out message when we get the first counters 3025- anv: record secondaries' traces into primaries 3026- intel/ds: track secondary cmdbuffers in perfetto 3027- intel/ds: move event_id access to perfetto lambda 3028- util/u_trace: add support for variable length trace points 3029- vulkan/debug_utils: copy debug util labels 3030- anv: add support for command buffer tagging in traces 3031- intel/ds: add INTEL_GPU_TRACEPOINT envvar to toggle tracepoints 3032- intel/ds: remove unused trace point 3033- intel/utrace: document tracepoints 3034- Revert "ci: build hasvk if we're building anv" 3035- intel: use a shared UUID with other drivers 3036- nir/divergence: add missing RT intrinsinc handling 3037- anv: fix generated indirect draw shader stats checks 3038- nir/lower_io: fix bounds checking for 64bit_bounded_global 3039- anv: fix preemption enable emission in gpu_memcpy 3040- intel/fs: avoid cmod optimization on instruction with different write_mask 3041- intel/decoder: print out compute push constants 3042- intel/common: add a INTEL_DECODE variable to parameter decoder at runtime 3043- vulkan/wsi/wayland: improve same gpu detection 3044- intel/fs: drop FS_OPCODE_UNIFORM_PULL_CONSTANT_LOAD_GFX7 3045- anv: fix null descriptors 3046- docs: list anv in EXT_extended_dynamic_state3 support 3047- intel: add missing PS restriction on BDW+ 3048- anv: expose EXT_load_store_op_none 3049- intel/fs: make alpha_to_coverage a tristate 3050- anv: rename RT pipeline function helper 3051- anv: rename a few internal functions to highlight gfx use 3052- intel/ds: track end of pipe bits 3053- anv: use malloc for host only descriptor sets 3054- anv: put the video extension behind a variable 3055- intel/fs: fix mesh indirect movs 3056- intel/dev: add a default urb value for intel_stub_gpu on dg2 3057- anv/hasvk: handle a SAMPLED_READ/STORAGE_READ access flags 3058- anv: remove copied information from runtime graphics state 3059- anv: move 3DSTATE_VERTEX_ELEMENT emission to dynamic path 3060- anv: implement VK_EXT_vertex_input_dynamic_state 3061- intel/fs: bound subgroup invocation read to dispatch size 3062- hasvk: fix KHR_shader_float_controls reporting 3063- intel/perf: also add the oa timestamp shift on MTL 3064- anv: move debug submit to helper and call it on execbuf failure 3065- anv: track vram only BOs to print things out on ENOMEM execbuf 3066- anv: fix vma heap memory leak 3067- anv: fix invalid masking of 48bit address 3068- anv: remove assert typed write support when using NULL surface 3069- vulkan/runtime: store parameters of VK_EXT_sliced_view_of_3d 3070- anv: fixup Wa_16011107343 for Gfx12 only 3071- iris: fix Wa_16011107343 for Gfx12 3072- anv: remove more Gfx7 code 3073- genxml: Fix STATE_BASE_ADDRESS::BindlessSurfaceStateSize field size 3074- genxml: fix border color offset field on Gfx12+ 3075- anv/hasvk: speed up null image/view descriptor writes 3076- anv: fix scratch buffer reloc in 3DSTATE_HS 3077- anv: fixup condition for Wa_14016118574 3078- anv: pull Wa_14016118574 out of some loop not changing state 3079- util/glsl2spirv: add support for include directive 3080- anv: fix incorrect parameter 3081- anv: correctly reset generation address on command buffer reset 3082- anv: fix generated forward jump with more than 67M draws 3083- anv: remove copied code from generation shader 3084- anv: remove BTI related flush in generation shaders 3085- anv: correctly program 3DSTATE_SF in generation shaders 3086- anv: limit push constant dirtyness with generation shaders 3087- anv: remove pre hasvk split assert 3088- anv: remove commented code 3089- anv: fix 3DSTATE_PS emission in generation shaders 3090- anv: fix indirect draws VF cache tracking of index buffer 3091- anv: make sure mi_memcpy lands before push constant loads 3092- anv: remove MI_NOOPs at the end of the generation batch 3093- anv: use a single generation shader for indirect draws 3094- anv: rename generated draws for Gfx11 3095- anv: use 64bit int support in generation shaders 3096- anv: pack more data into generated draws input 3097- anv: move common shader code into header 3098- anv: use a list of generated shaders 3099- anv: remove unused item_count parameter 3100- anv: add gfx9 generated draw support 3101- blorp: add dependency on idep_intel_dev 3102- vulkan/runtime: only consider slice info with 3D image views 3103- anv: VK_EXT_image_sliced_view_of_3d 3104- nir: fix nir_ishl_imm 3105- anv: enable VK_EXT_pipeline_library_group_handles 3106- anv/iris: report counter symbols with debug option 3107- intel/fs: report max register pressure in shader stats 3108- anv: report max register pressure in pipeline properties 3109- anv: force MEDIA_INTERFACE_DESCRIPTOR_LOAD reemit after 3D->GPGPU switch 3110- radv: use 1ull for alignment computations 3111- util: allow align64() to do alignments >= 4Gb 3112- docs: fix invalid link 3113- iris: trace frames with u_trace 3114- anv: export EXT_pipeline_library_group_handles only with RT 3115- docs: update Anv features support 3116- anv: more formats for acceleration structure vertices 3117- intel/fs: don't SEND messages as partial writes 3118- intel/fs: fix nir_opt_peephole_ffma max vec assumption 3119- intel/fs: fixup sources number from opt_algebraic 3120- intel/fs: add MOV source count validation 3121- intel/fs: prevent large vector ops generated by peephole_ffma 3122- intel/fs: fix subgroup invocation read bounds checking 3123- vulkan/wsi: add a headless swapchain implementation/option 3124- intel/compiler: report max dispatch width statistic 3125- anv: report shader max dispatch width in pipeline props 3126- intel/devinfo: add an option to pick platform to print 3127- intel/devinfo: printout URB entries 3128- intel/dev: use generated WA helpers for Wa_22012575642 3129- intel/devinfo: dedicated entries for XeHP 3130- intel/devinfo: initialize pci_device_id with from_pci_id() 3131- intel/dev: fold Gfx12 URB entries in Gfx12 HW info 3132- util/u_trace: move needs_cs_param option to tracepoints 3133- vulkan/runtime: also copy strings on queue debug utils 3134- intel/ds: rename frame timeline row to queue 3135- anv: fix incorrect utrace bo release 3136- anv: fixup locking for utrace submission increments 3137- anv: rename anv_utrace_flush_copy in anv_utrace_submit 3138- anv: add utrace support for queue debug utils 3139- anv: implement recommended flush/wait of AUX-TT invalidation 3140- iris: implement recommended flush/wait of AUX-TT invalidation 3141- anv: hash immutable sampler conversion data not pointers 3142- anv: compute the largest GRL kernel scratch size 3143- anv: move queue check helpers to anv_private 3144- anv: take care of maxStorageBufferRange being uint32_t 3145- isl: update max buffer size for SKL+ 3146- intel/dev: set a default valid kmd_type 3147- intel/perf: fix OA format selection on MTL 3148- intel/fs: run VGRF compaction just before max live register accounting 3149- intel/fs: don't consider fixup_nomask_control_flow SENDs predicate 3150- intel/fs: UNDEF fixup_nomask_control_flow temp register 3151- intel/fs: copy instruction sources in logical send lowering 3152- intel/fs: factor out lsc surface descriptor settings 3153- nir: reuse nir_component_mask() where it makes sense 3154- nir: add 2 new intel intrinsics for uniform ssbo/shared loads 3155- intel/fs: optimize uniform SSBO & shared loads 3156- intel/fs: also allow vec8+ vectorization of load_global_const_block_intel 3157- anv: pass steam output as argument for anv_dump_pipe_bits 3158- anv: replace query flush before gpu copy by semaphore wait 3159- anv: fixup streamout write barriers 3160- intel/fs: use nomask for setting cr0 for float controls 3161- anv: exclude performance queries from blorp clears 3162- intel/ds: add a new timeline row for frames 3163- anv: add utrace tracking of frame boundaries 3164- vulkan/runtime: discard unused graphics stages in libraries 3165- intel/vec4: force exec_all on float control instruction 3166- vulkan/overlay: deal with unknown pNext structures 3167- isl: don't set inconsistent fields for depth when using stencil only 3168- isl: fix a number of errors on storage format support on Gfx9/12.5 3169- anv: rework Wa_14017076903 to only apply with occlusion queries 3170- nir/divergence: add missing load_global_constant_* intrinsics 3171- anv: fix anv_nir_lower_ubo_loads pass 3172- intel/fs: fix per vertex input clamping 3173- intel/compiler: make uses_pos_offset a tri-state 3174- intel/fs: fix scheduling of HALT instructions 3175 3176Liviu Prodea (1): 3177 3178- meson: Ignore unused variables when assertions are disabled 3179 3180Lone_Wolf (3): 3181 3182- compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+ 3183- clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+ 3184- clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+ 3185 3186Luc Ma (2): 3187 3188- xlib: fix glXDestroyContext in Gallium frontends 3189- meson: keep Mako version checking in accord with build msg 3190 3191Lucas Fryzek (11): 3192 3193- crocus: Add support for \`get_screen_fd` 3194- tegra: Add support for \`get_screen_fd` 3195- nouveau: Add support for \`get_screen_fd` 3196- zink: Add support for \`get_screen_fd` 3197- iris: Add support for \`get_screen_fd` 3198- i915: Add support for \`get_screen_fd` 3199- svga: Add support for \`get_screen_fd` 3200- virgl: Add support for \`get_screen_fd` 3201- r300/r600/radeon_si: Add support for \`get_screen_fd` 3202- d3d12/llvmpipe/softpipe: Add support for \`get_screen_fd` 3203- gallium: Modify default path for DMABUF to use DRM 3204 3205Lucas Stach (7): 3206 3207- etnaviv: don't drop TS capability on GPUs with MMUv2 3208- etnaviv: drm: fix BO array leaks 3209- etnaviv: free pm queries dynarray on screen destroy 3210- etnaviv: drm: fix check if BO is on a deferred destroy list 3211- etnaviv: fix double scanout import of multiplanar resources 3212- etnaviv: flush VS texture cache when texture data is changed 3213- etnaviv: fix texture barrier implementation 3214 3215Luigi Santivetti (4): 3216 3217- pvr: fix uses_tile_buffers in clear color attachment 3218- pvr: add support for tile buffer output clear 3219- pvr: add padding bytes when allocating buffer memory 3220- pvr: fix segfault in dEQP-VK.ycbcr.query.* 3221 3222Luna Nova (3): 3223 3224- device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1) 3225- device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior 3226- device_select_layer: pick a default device before applying DRI_PRIME 3227 3228Lynne (1): 3229 3230- aco_validate: allow for wave32 in p_dual_src_export_gfx11 3231 3232M Henning (1): 3233 3234- nouveau/codegen: Check nir_dest_num_components 3235 3236Maarten Lankhorst (1): 3237 3238- iris: Place scanout buffers only into lmem for discrete GPUs 3239 3240Marcin Ślusarz (23): 3241 3242- intel/compiler: fix generation of vec8/vec16 alu instruction 3243- intel/compiler/mesh: handle const data in task & mesh programs 3244- intel/compiler: fine-grained control of dispatch widths 3245- nir: add nir_mod_analysis & its tests 3246- intel/compiler/mesh: optimize indirect writes 3247- intel/compiler/mesh: support longer write messages 3248- intel/compiler/mesh: remove dead code path supporting >4 dword writes 3249- intel/compiler/mesh: use U888X packed index format 3250- anv: bump ANV_MAX_QUEUE_FAMILIES 3251- intel/compiler: replace gl_Layer & gl_ViewportIndex by 0 in fs if ms doesn't write it 3252- anv: fix how unset gl_Viewport & gl_Layer are handled in mesh case 3253- intel/compiler/mesh: use slice id of task urb handles in mesh shaders 3254- anv: enable task redistribution 3255- intel/compiler/mesh: apply URB payload mask once per program 3256- intel/compiler/mesh: follow the type of offset variable 3257- intel/compiler: remove unused field from fs_thread_payload 3258- anv: halve the push constants space in mesh pipelines 3259- crocus/meson: add back dependency on libintel_dev 3260- anv,hasvk: remove stale comments 3261- anv: call nir_shader_gather_info early 3262- anv: work around for per-prim attributes corruption 3263- intel/compiler: compactify locations of mesh outputs 3264- anv: ignore structure types handled in vk_device_memory_create 3265 3266Marek Olšák (212): 3267 3268- glthread: fix an upload buffer leak 3269- util: fix util_is_vbo_upload_ratio_too_large 3270- mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture 3271- glapi: autogenerate function parameters with no space between * and variable 3272- glthread: handle GL_*_ARRAY in glEnable/Disable 3273- glthread: set GL_OUT_OF_MEMORY if we fail to upload indices 3274- glthread: set GL_OUT_OF_MEMORY if we fail to upload vertices 3275- glthread: execute glMultiDrawArrays(draw_count < 0) asynchronously 3276- glthread: change multi_draw_elements_async() to never fail due to large size 3277- glthread: do vertex uploads if an index buffer is present for glDrawElements 3278- mesa: move gl_vertex_format_user definition into glthread.h 3279- glthread: pack and name the type of glthread_vao::Attrib 3280- glthread: make marshal functions for glBegin/End attribs non-static 3281- glthread: remove the vbo_upload_ratio_too_large fallback for glMultiDrawElements 3282- glthread: do vertex uploads if an index buffer is present for MultiDrawElements 3283- glthread: disallow glthread if buffer uploads are unsupported 3284- ac/llvm: run the LLVM sinking pass because LLVM will stop running it 3285- ac/llvm: run the IPSCCP pass 3286- ac/llvm: remove llvm:: now that we use "using namespace llvm" 3287- amd: update amdgpu_drm.h 3288- ac/gpu_info: add PCIe info 3289- radeonsi/ci: update gfx10.3 results 3290- radeonsi/ci: add gfx1100 results 3291- radeonsi: fix RB+ blending with sRGB formats 3292- radeonsi/gfx11: unset SAMPLE_MASK_TRACKER_WATERMARK to fix hangs 3293- amd: split GFX1103 into GFX1103_R1 and GFX1103_R2 3294- amd: fix tile_swizzle on gfx11 - should be shifted by 10 bits, not 8 3295- amd: update SX_BLEND_OPT_EPSILON.MRT0_EPSILON enum definitions 3296- amd: update shadowed register tables for gfx11 3297- amd: improve RB+ blending precision 3298- radeonsi: implement RB+ depth-only rendering for better perf 3299- radeonsi/gfx11: remove the INST_PREF_SIZE workaround 3300- radeonsi/gfx11: add a comment why we use PRIM_GRP_SIZE <= 252 3301- radeonsi/gfx11: adjust ACCUM_* fields for tessellation 3302- radeonsi/gfx11: fix blend->cb_target_mask dependency for shader keys 3303- radeonsi/gfx11: move the PIXEL_PIPE_STAT_CONTROL event into the GFX preambles 3304- radeonsi/gfx11: use new packet EVENT_WRITE_ZPASS 3305- radeonsi: deduplicate VS/TES/GS update code 3306- radeonsi/gfx11: always set MSAA_NUM_SAMPLES=0 for DCC_DECOMPRESS 3307- radeonsi: merge si_ps_key_update_framebuffer_blend & .._update_blend_rasterizer 3308- radeonsi: determine alpha_to_coverage robustly in si_update_framebuffer_blend_rasterizer 3309- radeonsi: never set INTERPOLATE_COMP_Z 3310- amd: unify and tune the attribute ring size for gfx11 3311- amd: change pbb_max_alloc_count for gfx11 3312- amd: update the cache size for gfx1103_r1 3313- amd: update late_alloc_wave64 for gfx11 3314- amd: sort and re-indent packet definitions 3315- amd: fix typo in shadowed uconfig registers on gfx11 3316- amd: document OOB behavior on gfx11 3317- amd/registers: remove confusing definitions from gfx10-rsrc.json 3318- radeonsi: set NEVER as the depth compare func if depth compare is disabled 3319- amd/llvm: fix LLVM 15 & 16 crashes in SelectionDAG.cpp 3320- radeonsi: call ac_init_llvm_once before any util_queue initialization 3321- radeonsi: set sampler COMPAT_MODE in the corresponding branch 3322- amd/ci: update sanctuary trace sha1 3323- radeonsi/gfx11: don't add mrt0 export for alpha-to-coverage if mrtz is present 3324- radeonsi/gfx11: don't add alpha to mrt0 format for A2C if exporting via mrtz 3325- amd: define new SET_*_REG_PAIRS packets 3326- radeonsi: clean up si_set_mutable_tex_desc_fields 3327- amd/surface: clean up is_dcc_supported_by_L2 3328- amd,util: fix how lod bias is converted to fixed-point 3329- amd: don't hardcode real VGPR allocation granularity on gfx10.3 and gfx11 3330- glthread: track the current element array buffer in the Core profile too 3331- mesa: ignore indices[i] if count[i] == 0 for MultiDrawElements 3332- glthread: initialize indices[i] for no-op MultiDrawElements 3333- glthread: upload non-BO indices in the core profile to fix GStreamer 3334- glthread: add a heuristic to stop locking global mutexes with multiple contexts 3335- glthread: ignore non-VBO vertex arrays with NULL data pointers 3336- Revert "ci/zink: Disable Amnesia trace until the linked issue gets fixed." 3337- glthread: rewrite glMultiDrawArrays to never fail to upload vertices 3338- glthread: change glMultiDrawElements to execute draw_count < 0 asynchronously 3339- glthread: don't execute glDraw code if we're inside glBegin/End 3340- glthread: don't pass index bounds to the driver for async calls 3341- glthread: move some draw call parameters closer to their use 3342- glthread: don't bind/unbind uploaded indexbuf, pass it to glDraw directly 3343- glthread: don't bind/unbind uploaded indexbuf, pass it to glMultiDraw directly 3344- glthread: track vertex formats for all attributes 3345- glthread: add a vertex upload path that unrolls indices for glDrawElements 3346- glthread: reorder draw code a little 3347- glthread: add ctx->GLThread.draw_always_async to simplify draw checking 3348- glthread: remove goto statements and add unlikely() into draw functions 3349- glthread: inline draw functions that have only one use 3350- glthread: don't execute Draw and BufferSubData calls if the context is lost 3351- glthread: handle non-VBO uploads for glMultiModeDraw{Arrays,Elements}IBM 3352- glthread: add API to allow passing DrawID from glthread to mesa 3353- glthread: convert (Multi)DrawIndirect into direct if user buffers are present 3354- glthread: remove unnecessary debug code 3355- glthread: don't free glthread for GL_DEBUG_OUTPUT_SYNCHRONOUS, only disable it 3356- glthread: don't restore non-VBO vertex arrays after all draws 3357- Revert "radeonsi/ci: Update stoney test expectations" 3358- radeonsi: fix COMPAT_MODE on gfx8-9 3359- amd: fix LOD_BIAS on gfx6-9 and adjust the lod bias CAP 3360- amd: add missing gfx11 register definitions 3361- amd: bump AMD_MAX_SE and change the CU mask type to 16 bits 3362- radeonsi/gfx11: fix the CU_EN clear mask for RSRC4_GS 3363- radeonsi/gfx11: don't set non-existent VGT_STRMOUT_BUFFER_CONFIG 3364- radeonsi/gfx11: set CB_COLORi_INFO.MAX_COMP_FRAG on GFX1103_R2 3365- radeonsi: move a few DB_SHADER_CONTROL states into si_shader_ps 3366- radeonsi: change si_shader::ctx_reg to a nameless union for better readability 3367- radeonsi: remove no-op setting of THDS_PER_SUBGRP 3368- radeonsi: use SPI_SHADER_USER_DATA_HS_0 definition instead of LS_0 3369- radeonsi: set PA_SU_VTX_CNTL consecutively with PA_CL_GB_VERT_CLIP_ADJ 3370- radeonsi/gfx11: ignore alpha_is_on_msb because the hw ignores it 3371- radeonsi: replace si_screen::has_out_of_order_rast with the radeon_info field 3372- radeonsi: disable Smart Access Memory because CPU access has large overhead 3373- amd,radeonsi: remove unused LLVM functions 3374- amd/registers: unify VRS combiner definition names between gfx103 and gfx11 3375- amd: replace SI_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN 3376- radeonsi: remove returns from si_emit_global_shader_pointers 3377- radeonsi: reformat emit_cb_render_state, create_blend_state, create_rs_state 3378- radeonsi: remove a gfx11 check in si_shader_gs (legacy GS) 3379- radeonsi: remove unused VS_STATE_LS_OUT_PATCH_SIZE 3380- radeonsi: always add 1 to lshs_vertex_stride now that LS_OUT_PATCH_SIZE is gone 3381- radeonsi: correct and clean up obsolete vs_state_bits comments 3382- radeonsi: rename esgs_itemsize -> esgs_vertex_stride 3383- amd: query the per-SIMD VGPR counts from the kernel, don't hardcode them 3384- radeonsi: don't clamp z_samples to fix Unreal Tournament 99 3385- amd/registers: only define SPI and COMPUTE registers in the 0xB000 range 3386- radeonsi: reorganize emit_db_render_state and simplify VRS code 3387- radeonsi: reorganize si_initialize_color_surface for better readability 3388- radeonsi: reorganize si_init_depth_surface for better readability 3389- radeonsi: don't set PACKET_TO_ONE_PA for line stippling 3390- radeonsi/gfx11: change the default of COMPUTE_DISPATCH_INTERLEAVE to 256 3391- amd: implement conformant TRUNC_COORD behavior for gfx11 3392- amd/gpu_info: add a workaround for SI_FORCE_FAMILY=gfx1100 3393- nir,amd: add and use nir_intrinsic_load_esgs_vertex_stride_amd 3394- nir: lower to fragment_mask_fetch/load_amd with EQAA correctly 3395- glthread: fix a perf regression due to draw_always_async flag, fix DrawIndirect 3396- mesa: fix glPopClientAttrib with fixed-func VP and zero-stride varyings 3397- mesa: remove a redundant call to _mesa_update_edgeflag_state_vao 3398- mesa: initialize VertexProgram._VaryingInputs before the first use 3399- amd: update amdgpu_drm.h 3400- amd,radeonsi: change enabled_rb_mask to 64 bits 3401- amd: query cache sizes from the kernel 3402- ac/nir: don't use load_esgs_vertex_stride_amd on gfx6-8 3403- amd: massively simplify how info->spi_cu_en is applied 3404- amd/rtld: allow 64K LDS for all shader stages except for gfx6 3405- radeonsi/ci: update flakes and gfx8-polaris11 results 3406- radeonsi: remove Smart Access Memory because CPU access has large overhead 3407- radeonsi: reorganize si_emit_framebuffer_state for better readability 3408- radeonsi: don't merge SET_* packets that have a different index in si_pm4_state 3409- radeonsi: reindent code in si_state_binning.c 3410- radeonsi: add si_pm4_set_reg_va to simplify setting reg_va_low_idx for RGP 3411- radeonsi: check the pm4.reg_va_low_idx assertion unconditionally 3412- radeonsi: simplify encoding VGPRS and SGPRS 3413- radeonsi: assume shader is never NULL in si_emit_shader_* 3414- nir: return progress from nir_lower_io_to_scalar 3415- nir: skip nir_op_unpack_32_4x8 in nir_lower_alu_width 3416- ac/nir: add ac_nir_lower_subdword_loads to lower 8/16-bit loads to 32 bits 3417- aco: implement nir_op_unpack_32_4x8 3418- ac/llvm: implement nir_op_unpack_32_4x8 3419- amd: lower subdword UBO loads in NIR 3420- amd: lower multi-component subdword SSBO loads in NIR 3421- lavapipe/ci: add a new flake 3422- amd: add nir_intrinsic_xfb_counter_sub_amd and fix overflowed streamout offsets 3423- amd/llvm,radeonsi/gfx11: switch to using GDS_STRMOUT registers 3424- radeonsi/gfx11: only allocate GDS OA for streamout, GDS memory is not needed 3425- radeonsi: emulate VGT_ESGS_RING_ITEMSIZE in the shader on gfx9-11 3426- radeonsi: merge si_emit_initial_compute_regs with si_init_cs_preamble_state 3427- radeonsi: separate nir_texop_descriptor_amd lowering 3428- radeonsi: lower nir_texop_sampler_descriptor_amd 3429- radeonsi: set pm4.atom.emit in si_get_shader_pm4_state 3430- radeonsi: reindent si_shader_ls, si_shader_es, si_shader_gs, si_shader_vs 3431- radeonsi: reorganize si_shader_hs 3432- radeonsi: reorganize si_shader_ngg 3433- radeonsi: reorganize si_shader_ps 3434- radeonsi: other cosmetic changes in si_state_shaders.cpp 3435- radeonsi: allow using 64K LDS for NGG to allow larger workgroups 3436- radeonsi: increase NGG workgroup size to 256 for VS/TES with streamout and GS 3437- glapi: move files specific to shared-glapi into the shared-glapi subdirectory 3438- glapi: inline the meson list files_mapi_util 3439- mesa: move ctx->Table -> ctx->Dispatch.Table except Client & MarshalExec 3440- mesa: rename CurrentClientDispatch to GLApi 3441- mesa: put dispatch table initialization into one place 3442- glthread: qualify the \*cmd unmarshal parameter with restrict 3443- vbo: fix current attribs not updating gallium vertex elements 3444- radeonsi: remove unused TCS/TES SGPR fields 3445- radeonsi: dump shader stats only if dumping asm shaders 3446- radeonsi: replace nonir,noir,noasm,preoptir options with new reworked options 3447- radeonsi: remove duplicated gfx11 check in si_msaa_resolve_blit_via_CB 3448- radeonsi: rework MSAA resolve averaging to exploit instruction-level parallelism 3449- radeonsi: add AMD_DEBUG=nowcstream to enable caching for stream_uploader 3450- radeonsi: don't print the base non-view texture format for AMD_TEST=computeblit 3451- radeonsi: fix AMD_TEST=computeblit being rejected on gfx < 11 3452- radeonsi: don't convert to fp16 in the compute blit if not testing 3453- radeonsi: don't use fp16_rtz for FP formats in the compute blit 3454- radeonsi: correct an assertion if we get a display list with no vertex buffers 3455- ac/nir: don't emit duplicated parameter exports 3456- ac/nir: use plural correctly in the ac_nir_export_parameters name 3457- radeonsi: remove unused vs_output_param_mask 3458- egl: reorder code in _eglQueryDevicesEXT, add \*swrast variable 3459- egl: don't expose swrast device if swrast is not built 3460- amd/llvm: fix handling of unsupported vec3 loads on gfx6 3461- amd/llvm: remove no-op code for vec3 loads in ac_build_tbuffer_load 3462- amd: update addrlib 3463- amd: rename GFX1036 -> RAPHAEL_MENDOCINO 3464- amd: set the correct LLVM processor name for gfx1036 3465- radeonsi/gfx11: reduce MSAA samples to 8 for no-attachment framebuffer 3466- radeonsi: simplify binning settings to work around GPU hangs 3467- amd: add gfx940 register definitions 3468- amd: add initial code for gfx940 3469- radeonsi: use COMPUTE_DISPATCH_SCRATCH_BASE on gfx940 3470- radeonsi: always use ffma32 on gfx940 3471- ac/surface: force linear image layout for chips not supporting image opcodes 3472- radeonsi: add an emulated image descriptor for gfx940 3473- ac/nir: implement image opcode emulation for CDNA, enable it in radeonsi 3474- radeonsi: don't set registers that don't exist on gfx940 3475- amd/registers: simplify integer division by 0x1000 in the parser 3476- amd/registers: fix the parser to include CP_COHER registers for gfx940 3477- amd/registers: update gfx940.json 3478- amd/registers: use gfx9 packet definitions for gfx940 3479- nir: fix 2 bugs in nir_create_passthrough_tcs 3480 3481Mario Kleiner (1): 3482 3483- v3dv: Enable (leased) direct display extensions. 3484 3485Mark Collins (4): 3486 3487- meson: update flex/bison requirement to cover all usages 3488- meson: forcefully disable libdrm when host doesn't have it 3489- tu: KGSL backend rewrite 3490- tu: fix tu_GetInstanceProcAddr not handling null instance 3491 3492Mark Janes (13): 3493 3494- intel: Implement Wa_16011448509 3495- util: add macro to support gcc/clang poison 3496- intel/dev: generate helpers to identify platform workarounds 3497- intel/dev: Print required workarounds with intel_dev_info 3498- intel/fs: use generated workaround helpers for Wa_14010017096 3499- intel/fs: use generated helpers for Wa_1209978020 / Wa_18012201914 3500- intel/fs: use generated workaround helpers for Wa_14017989577 3501- intel: use generated workaround helpers for Wa_1409600907 3502- intel: use generated helpers for Wa_1409433168/Wa_16011107343 3503- intel/fs: use generated helpers for Wa_14013363432 / Wa_14012688258 3504- intel/dev: fix macro string concatenation for INTEL_WA_{id}_GFX_VER 3505- intel/dev: fix macro naming convention in gen_wa_helpers.py 3506- intel/dev: use GFX_VERx10 to detect genX compilation 3507 3508Martin Roukala (né Peres) (22): 3509 3510- ci/deqp-runner: compress results.csv before uploading it to GitLab 3511- ci/piglit: compress results.csv before uploading it to GitLab 3512- zink/ci/radv: remove a test from the fails list 3513- zink/ci: add a fail to the VG flake list 3514- zink/ci: relocate radv testing from radv's gitlab-ci.yml 3515- zink/ci: add spec\@!opengl 1.1\@line-smooth-stipple to the fails list 3516- ci/b2c: uprev to b2c v0.9.9 3517- ci/debian/x86_test-vk: drop an outdated dependency 3518- ci/core-manual-rules: enclose the whole condition in quotes 3519- zink/ci: allow running manual jobs again on RADV 3520- ci/init-stage2: allow sourcing the job env vars from the CWD 3521- ci/init-stage2: always set XDG_RUNTIME_DIR 3522- ci/b2c: move away from the hand-rolled initscript 3523- ci: bring back the valve farm online 3524- ci/valve-farm-rules: allow running jobs from outside the mesa namespace 3525- radv/ci: reduce the parallelism for vkcts-vangogh 3526- zink/ci: increase the parallelism of zink-radv-vangogh-valve 3527- zink/ci: update the radv expectations 3528- radv/ci: update VanGogh's expectations 3529- ci/b2c: increase the console timeout to 4 minutes 3530- radv/ci: update the navi10 expectations 3531- zink/ci: add a test to the fails list 3532 3533Matt Coster (6): 3534 3535- pvr: Extract setup of winsys job submit flags into separate functions 3536- pvr: Add support for geometry-only render jobs 3537- pvr: Add pvr_csb_bake() 3538- pvr: Rename global_queue_job_count to global_cmd_buffer_submit_count 3539- pvr: Split render job submission for multi-layer framebuffers 3540- pvr: Add firmware stream support for transfer submit 3541 3542Matthieu Bouron (1): 3543 3544- lavapipe: honor dst base array layer when resolving color attachments 3545 3546Mauro Rossi (1): 3547 3548- hasvk: include "vk_android.h" header in anv_android.c 3549 3550Maíra Canal (1): 3551 3552- v3dv: remove unused clamp_to_transparent_black_border property 3553 3554Michel Dänzer (29): 3555 3556- mesa/st: Fix GL_EXT_texture_type_2_10_10_10_REV name in comment 3557- mesa/st: Handle all 10 bpc types in st_choose_format 3558- glsl/standalone: Fix up _mesa_reference_shader_program_data signature 3559- glsl/standalone: Do not pass memory allocated with ralloc_size to free 3560- anv/grl: Use union for reinterpreting integer as float 3561- clover: Reserve vector memory in make_text_section 3562- ci: Update Fedora image to 36 3563- ci: Re-enable intel-clc in fedora-release job 3564- ci: Enable i915 Gallium driver in fedora-release job 3565- ci: Enable the hasvk Vulkan driver in the fedora-release job 3566- frontend/dri: Initialize callbacks in dri_swrast_kms_init_screen 3567- nouveau: Make getSize return unsigned int 3568- r600: Use container_of instead of direct pointer cast 3569- crocus: Use ralloc_free for memory allocated with rzalloc 3570- iris: Use ralloc_free for memory allocated with rzalloc 3571- ci: Remove some -Werror workarounds for debian-android job 3572- ci: Split up -Werror workarounds for debian-mingw32-x86_64 job 3573- intel/vk/grl: Do not use no_override_init_args for C++ 3574- ci: Pass -Werror to compiler linking stage for LTO 3575- ci: Allow passing c{,pp}_link_args to meson 3576- ci: Make ccache optional 3577- ci: Drop ccache from Fedora image 3578- ci: Install procps-ng in Fedora image 3579- ci: Enable LTO for fedora-release job 3580- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions 3581- svga: Make vmw_svga_winsys_buffer_map definition match declaration 3582- svga: Make declaration of emit_input_declaration match definition 3583- clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16 3584- clover/llvm: Use std::nullopt already with LLVM 16 3585 3586Michel Zou (4): 3587 3588- ci/mingw: drop useless -Wno-error flags 3589- vulkan/wsi: fix -Wnarrowing warning 3590- vk/entry_points:: fix mingw build 3591- mesa/draw: fix -Wformat warning 3592 3593Mike Blumenkrantz (536): 3594 3595- zink: simplify get_slot_components() for xfb emission 3596- zink: add renderdoc handling 3597- zink: prune old swapchains on present 3598- zink: break out implicit feedback loop detection into separate function 3599- zink: set textures_used in analyze_io 3600- zink: outdent code in add_implicit_color_feedback_loop() 3601- zink: make implicit feedback loop application stricter 3602- zink: skip implicit feedback loop layout changes if feedback loop not present 3603- zink: store drm format as internal_format for imported resources 3604- zink: handle modifier nplanes queries correctly for planar formats 3605- zink: NV_compute_shader_derivatives 3606- zink: preserve present resources during async presentation 3607- zink: add a util function for creating semaphores 3608- zink: add a binary semaphore cache 3609- zink: move semaphore caching to zink_reset_batch_state() 3610- zink: consolidate semaphore creation where possible 3611- zink: simplify some dynarray concat descriptor code 3612- zink: delete need_blend_constants 3613- zink: don't use ds3 blend states without color attachments 3614- radv: repack radv_graphics_pipeline struct 3615- radv: reorder dynamic state checks during bind 3616- radv: simplify depth aspect check in radv_handle_image_transition() 3617- radv: add some graphics pipeline hints to optimize pipeline bind 3618- radv: remove redundant type sizing 3619- radv: add an early out in radv_cmd_buffer_flush_dynamic_state() 3620- zink: use actual swapchain object for surface comparison 3621- radv: stop using radv_pipeline_has_stage() in BindPipeline 3622- zink: flag old-style shadow tex mask for fragment shaders 3623- zink: break out tex dest rewriting into separate function 3624- zink: add an extra_data param to zink_shader_compile 3625- zink: track depth swizzle on samplerviews 3626- zink: add a fs shader key member to indicate depth texturing mode 3627- zink: rework depth sampler splatting in shaders 3628- zink: block pipeline fast-pathing for any programs using depth texture modes 3629- zink: plug in the program/module parts of shadow texture mode emulation 3630- zink: create another samplerview for shadow textures 3631- zink: remove old depth swizzle workaround 3632- zink: pass depth swizzle data block to shader compile 3633- mesa: remove dead parameter doc for _mesa_new_texture_object() 3634- mesa: populate gl_program::ShadowSamplers mask from shader data 3635- mesa: (more) correctly handle incomplete depth textures 3636- zink: fix implicit feedback loop detection 3637- radv: Move constant flushing check out to callers. 3638- zink: fix VK_DYNAMIC_STATE_LINE_WIDTH usage 3639- zink: move barrier jit to zink_context.c 3640- zink: don't skip repeated handling feedback loops 3641- zink: return false for implicit feedback loop check with image binds 3642- zink: update sampler layout when detecting feedback loop for first time 3643- zink: force GENERAL layout for all fb attachments with image binds 3644- zink: validation ci updates 3645- zink: reorder commands more aggressively 3646- Revert "zink: allow direct memory mapping for any COHERENT+CACHED buffer" 3647- zink: fix heap/memory type selection 3648- zink: add VK_BUFFER_USAGE_SHADER_DEVICE_ADDRESS_BIT for buffers if ext is enabled 3649- zink: set vkusage/vkflags for buffer resource objects 3650- zink: assert that buffer descriptor usage is populated before bind 3651- zink: always set RESOURCE usage for descriptor buffers 3652- zink: disable bindless texture ext with descriptor buffer 3653- zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT when using DB 3654- zink: don't add dmabuf export type if dmabuf isn't supported 3655- zink: stop leaking push descriptor templates 3656- zink: always unref old images when adding new binds 3657- zink: hook up VK_EXT_multisampled_render_to_single_sampled 3658- zink: shrink zink_render_pass_state::msaa_expand_mask 3659- zink: use VK_EXT_multisampled_render_to_single_sampled for EXT_multisample_render_to_texture 3660- lavapipe: move noop fs creation to device 3661- lavapipe: add refcounting for shader nir 3662- lavapipe: refcount nir shaders instead of cloning 3663- lavapipe: break out (and slightly refactor) gallium shader cso creation 3664- lavapipe: create gfx gallium csos at pipeline bind 3665- lavapipe: delete unused pipelines immediately 3666- lavapipe: delete lvp_pipeline::mem_ctx 3667- lavapipe: try harder to reuse pipeline layouts during merge 3668- zink: only set VkPipelineColorBlendStateCreateInfo::attachmentCount without full ds3 3669- zink: fix zink_mem_type_idx_from_bits() 3670- zink: rework descriptor buffer templating to use offsets 3671- Revert "zink: fix zink_mem_type_idx_from_bits()" 3672- zink: enable PIPE_CAP_ALLOW_GLTHREAD_BUFFER_SUBDATA_OPT 3673- zink: make bindless buffer_infos a union 3674- zink: fix bindless struct member comments 3675- zink: skip updating descriptor buffer sets that aren't active 3676- zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT on compute pipelines 3677- zink: break out descriptor binding into separate function 3678- zink: add a flag to indicate whether a descriptor buffer is bound 3679- zink: implement descriptor buffer handling of bindless texture 3680- zink: enable bindless texture with ZINK_DESCRIPTORS=db 3681- zink: free descriptor buffer maps on batch state destroy 3682- zink: fix more cases of heap/memtype suballocator mismatch 3683- zink: cache and reuse dummy inputattachment for fbfetch 3684- zink: handle missing line rasterization modes with ds3 3685- zink: add back VK_DESCRIPTOR_BINDING_PARTIALLY_BOUND_BIT for bindless 3686- gallium: add PIPE_CAP_NULL_TEXTURES 3687- radeonsi: set PIPE_CAP_NULL_TEXTURES 3688- zink: conditionally enable PIPE_CAP_NULL_TEXTURES 3689- zink: fix max acquired image count 3690- lavapipe: disable VK_FORMAT_FEATURE_2_COLOR_ATTACHMENT_BLEND_BIT for int formats 3691- zink: set PIPE_CAP_VALIDATE_ALL_DIRTY_STATES 3692- zink: move bindless_layout to screen and init on creation 3693- zink: take screen param in init_db_template_entry() 3694- zink: const-ify a bunch of shader key inlines 3695- zink: move gpl usability checks to static inline for reuse 3696- zink: remove duplicated gpl output blend initializations 3697- zink: store last pipeline directly for zink_gfx_program::last_pipeline 3698- zink: pass screen to descriptor_util_pool_key_get() 3699- zink: delete zink_screen::framebuffer_cache 3700- zink: make last_vertex_stage the first bit in zink_vs_key_base 3701- zink: ralloc zink_shader structs 3702- zink: add a define for the "default" optimal key 3703- zink: add a define for testing that an optimal key is the default 3704- zink: add VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT for gpl libs 3705- zink: don't set blend_id with full_ds3 3706- zink: set gfx feedback loop bit in pipeline state for driver workaround 3707- zink: set zs feedback loop bit from driver workaround on ctx create 3708- zink: fix gpl lib hashing 3709- zink: use screen indexing for bindless descriptor set in db bind 3710- zink: use screen indexing for bindless descriptor set in template bind 3711- util/vbuf: fix multidraw unrolling 3712- zink: flag bindless_init before calling zink_batch_bind_db() in init 3713- zink: avoid the descriptor set multiplier for bindless buffers 3714- zink: split out VkShaderModule creation 3715- zink: add flags param to zink_pipeline_layout_create() 3716- zink: split out gfx pipeline library creation 3717- zink: add gpl flags for libraries based on shaders passed 3718- zink: allow multiple gpl libraries in zink_create_gfx_pipeline_combined() 3719- zink: move gpl input/output funcs to zink_pipeline.c 3720- zink: enable combining intermediate gpl libs from combine function 3721- zink: use GPL to handle (simple) separate shader objects 3722- zink: set PIPE_CAP_SURFACE_REINTERPRET_BLOCKS 3723- zink: store gfx_hash on zink_gfx_program 3724- zink: break out zink_gfx_program::libs into refcounted object 3725- zink: implement cross-program pipeline library sharing 3726- zink: add newlines to some debug printfs 3727- zink: rename some variables in zink_set_shader_images() 3728- zink: unset gfx shader read when unbinding shader images 3729- zink: remove stale comment 3730- zink: unref image buffer descriptors on unbind 3731- zink: rework set_shader_images() hook 3732- zink: pull out image descriptor updating in set_shader_images 3733- zink: add a local is_compute var for set_shader_images 3734- zink: allocate all batch command buffers in one call 3735- zink: sync LTO compiles for GPL pipelines on shader free 3736- zink: fix descriptor pool free iterating 3737- zink: don't fetch/update pipeline cache for separate shader programs 3738- zink: assert that the found program matches the expected one in shader_free 3739- zink: flag gfx programs as removed-from-cache by default 3740- zink: rework separate shader descriptor iterating 3741- zink: use a single descriptor buffer for all non-bindless types 3742- zink: add an io assignment pass for separate shaders 3743- zink: rename a struct member for clarity 3744- zink: move zink_batch_state::db_bound reset to zink_batch_descriptor_reset() 3745- zink: move db_bound to batch descriptor data 3746- zink: ensure db is bound before separate shader update 3747- zink: store base descriptor size on the screen 3748- zink: implement a scaling descriptor buffer size 3749- zink: calloc separable program zink_gfx_library_key struct 3750- zink: also replace hash_entry::key when replacing separable program 3751- zink: always use NEAREST for zs blits 3752- zink: fix indentation of rebind_image() 3753- zink: only try for a fb rebind if fb binds exist in rebind_image() 3754- zink: account for null surface when trying to retain clears on fb bind 3755- zink: break out pipe_surface init for new surface creation 3756- zink: const-ify a surface param 3757- zink: don't handle mutable init on surface creation with tc enabled 3758- zink: verify compressed format layer count when creating surfaces 3759- zink: set ZINK_DESCRIPTORS=db for radv jobs 3760- zink: enable renderpass optimizations by default for selected drivers 3761- Revert "zink: always use NEAREST for zs blits" 3762- zink: block LINEAR filtered blits for zs formats 3763- lavapipe: enable linear filtering for depth formats 3764- gallium: plumb resolve attachments through from frontends -> pipe_framebuffer_state 3765- dri3: avoid deadlocking when polling deleted windows for events 3766- vulkan/wsi: avoid deadlocking dri3 when polling deleted windows for events 3767- vl/dri3: avoid deadlocking when polling deleted windows for events 3768- zink: delete some now-broken ntv dref sampling code 3769- zink: more accurately handle i/o for separate shaders 3770- zink: handle semi-matching i/o for separate shaders 3771- zink: enable renderpass optimizing on lavapipe 3772- kopper: fix loop iterating for msaa texture creation 3773- zink: move db input attachment size check to screen init 3774- zink: make ZINK_DESCRIPTOR_MODE=db the default 3775- driconf: add zink glthread disable for a game 3776- aux/tc: add a 'has_resolve' member to tc_renderpass_info 3777- zink: actually hook up ZINK_DEBUG=norp 3778- zink: add ZINK_DEBUG=map 3779- zink: add debug marker tracing for qbo updates 3780- util/box: add intersection test functions for 1d/3d 3781- zink: add some tracking for copy box regions 3782- zink: add a util function for optimizing TRANSFER_DST image barriers 3783- zink: utilize copy box tracking to avoid barrier emission for buf2img copies 3784- zink: fix slab allocator sizing 3785- zink: delete dead uniform variables 3786- zink: fix shader read access removal for barrier generation 3787- zink: rework descriptor unbind params to use is_compute directly 3788- zink: fix bindless texture barrier generation 3789- zink: delete unused emit_image param in ntv 3790- zink: simplify/rework image typing in ntv 3791- zink: avoid adding ubo/ssbo bindings multiple times for different bitsizes 3792- zink: add locking for zink_screen::copy_context and defer creation 3793- zink: allow direct memory mapping for any COHERENT+CACHED buffer 3794- lavapipe: EXT_image_sliced_view_of_3d 3795- vulkan/wsi: switch to using an options struct for last param 3796- vulkan/wsi/x11: make 4 image minimum for xwayland driver-specific 3797- docs: add pipeline library support for tu 3798- aux/tc: track whether queries have been terminated in a renderpass 3799- aux/tc: only call tc_parse_draw() when parsing renderpass info 3800- zink: move zink_batch_no_rp call for query reset 3801- zink: remove suspended queries from list before resuming 3802- zink: reset queries on the promoted cmdbuf when possible 3803- zink: rewrite zink_query_start struct 3804- zink: fix zink_query_start initialization 3805- zink: fix possible query destroy leak 3806- zink: make zink_vk_query unref consistent 3807- zink: delete zink_query::last_start_idx 3808- zink: handle multiple query starts in qbo update 3809- zink: don't auto-sync qbos on query end 3810- zink: add zink_query::suspended to indicate suspended state of queries 3811- zink: un-suspend queries if they end while suspended 3812- zink: don't double suspend queries 3813- zink: refuse to start cs invocation queries in renderpass 3814- zink: only try doing qbo updates on query suspend if !in_rp 3815- zink: handle null query results 3816- zink: handle null query results for conditional render 3817- zink: only update qbo for TIME_ELAPSED on start if !in_rp 3818- zink: try updating qbos on query resume if !in_rp 3819- zink: reorder some query code 3820- zink: rework find_or_allocate_qp() 3821- zink: create/use query pools dynamically 3822- zink: rework query pool overflow 3823- zink: track whether queries were started in a renderpass 3824- zink: break out query suspend functionality for reuse 3825- zink: use more consistent check for deleting zink_query::stats_list links 3826- zink: pull 'was_line_loop' into ctx for query updating 3827- zink: always start/stop/resume queries inside renderpasses 3828- zink: use tc renderpass optimizing to more optimally start queries 3829- zink: skip buffer barriers for ACCESS_NONE -> ACCESS_READ / ACCESS_WRITE 3830- zink: disable queries for clear_texture() 3831- zink: resume queries after conditional render and clears are processed 3832- zink: only resume queries inside renderpasses from set_active_query_state 3833- zink: track whether a primgen query is suspended and needing color write hacks 3834- zink: rework xfb queries for drivers with poor primgen support 3835- zink: merge qbo update copies when possible 3836- zink: set predicate_dirty on query creation 3837- zink: eliminate internal qbo copy barrier 3838- util/box: fix off-by-one calc error in intersection funcs 3839- zink: avoid ballooning of copy box tracking 3840- zink: add perfetto tracing for barriers 3841- zink: avoid unnecessary read-only layout changes for zs attachments 3842- zink: skip attachment barrier for redundant layout-setting if !valid 3843- zink: add functions for faster batch-usage completion checks 3844- zink: fix zink_resource_access_is_write() 3845- zink: use split image barriers if the image can be easily proved idle 3846- zink: skip buffer barriers if the buffer can be easily proved idle 3847- zink: determine whether debug markers are used on screen create 3848- zink: be more descriptive with perfetto buffer barriers 3849- zink: reorder some blit debug markers 3850- zink: pass cmdbuf to debug marker begin 3851- zink: remove debug markers for u_blitter ops 3852- zink: when skipping a TRANSFER_DST image barrier, set access tracking 3853- zink: add frame trace markers on the queue 3854- zink: unbind fb on context destroy 3855- zink: only add deferred barrier on fb unbind when layout needs to change 3856- zink: fix descriptor update flagging on null ssbo set 3857- zink: propagate valid_buffer_range when replacing buffer storage 3858- zink: check for layout updates when unbinding samplerviews 3859- zink: eliminate pre barrier for adding resource binds 3860- zink: don't unset existing access when adding resource binds 3861- zink: explicitly flush src clears when u_blittering 3862- zink: always set color writes on the unordered cmdbuf 3863- zink: bind descriptor buffers to unordered cmdbuf 3864- zink: set dynamic pcp for unordered cmdbuf 3865- zink: always set batch usage for descriptors after barrier 3866- llvmpipe: fix LP_PERF=no_depth to ignore depth format 3867- zink: track the last write access for resources 3868- zink: add a mechanism for managing TRANSFER_DST buffer barriers 3869- zink: add a mechanism to trigger copy box resets from batch state reset 3870- zink: add a driver workaround to disable copy box optimizations 3871- zink: hook up buffer TRANSFER_DST barrier optimizing 3872- aux/tc: fix rp info resizing clobbering current info 3873- vulkan/wsi: fix crash in failed swapchain creation for wayland 3874- lavapipe: split out spirv compile of shaders 3875- lavapipe: split out shader struct members into their own struct 3876- lavapipe: pass shader struct and layout to scan_pipeline_info() 3877- lavapipe: more small shader struct usage tweaks 3878- lavapipe: move xfb init to shader struct 3879- lavapipe: rename inline uniform function params 3880- lavapipe: move uniform inline functions to shader struct 3881- lavapipe: break out main shader lowering into separate function 3882- Revert "Revert "ci: disable mesa-swrast runner jobs"" 3883- zink: ignore renderdoc if ZINK_RENDERDOC isn't in use 3884- radv: delete radv_graphics_pipeline_compile() asserts 3885- radv: avoid a huge memset in radv_graphics_pipeline_compile() 3886- aux/tc: use renderpass tracking to optimize texture_subdata calls 3887- lavapipe: fix dynamic depth clamping 3888- lavapipe: set render_condition_enabled=false for vkCmdClearDepthStencilImage 3889- lavapipe: add command debugging 3890- lavapipe: beef up LVP_POISON_MEMORY 3891- ci: fix LVP_POISON_MEMORY usage 3892- zink: rework zink_resource::valid_buffer_range 3893- zink: return the unordered state from zink_resource_buffer_transfer_dst_barrier() 3894- zink: unify image TRANSFER_DST barrier checks 3895- zink: rename zink_check_transfer_dst_barrier() 3896- zink: super reorder buffer copies 3897- tu: don't set startup debug on debug builds 3898- zink: fix copy box iteration 3899- glthread: align small buffer uploads to 4 bytes 3900- zink: fix copy box iteration when adding 3901- zink: fix copy box merging adjacency 3902- aux/trace: delete GALLIUM_TRACE_NIR log message 3903- zink: fix layer check for compressed format surface creation 3904- zink: track current queue for resources 3905- zink: remove redundant dmabuf_acquire setting 3906- zink: use res->queue to auto-handle queue transitions back to gfx queue 3907- zink: track tc fences better 3908- zink: add an assert to catch renderpass optimizing bugs 3909- zink: don't use/update tc rp info while blitting 3910- zink: reset tc fb info upon splitting a renderpass 3911- zink: add and use a function for "safely" ending renderpasses 3912- zink: disable queries when flushing clears from set_fb 3913- zink: disable tc flush notify with rp optimizing 3914- zink: trigger oom flushes more aggressively from copy ops 3915- aux/tc: flag late zs clears as partial clears 3916- aux/tc: use a local 'deferred' variable in tc_flush() 3917- aux/tc: fix renderpass splitting on flush 3918- aux/tc: track the number of active queries 3919- aux/tc: don't sync for get_sample_position 3920- aux/tc: add a function to reset rp info 3921- aux/tc: fix initial rp info allocation 3922- aux/tc: make some of the rp tracking api private 3923- aux/tc: rework inter-batch renderpass info handling 3924- zink: only flag rp info for updating on flush, don't actually update 3925- zink: add tracing for copy ops 3926- zink: expand ZINK_DEBUG=sync to cover copy ops 3927- zink: add some asserts for zs layout in dynamic render 3928- zink: double check layouts for possible feedback loop images 3929- zink: end rp earlier in set_framebuffer_state 3930- zink: add a function for applying u_blitter barriers 3931- zink: add a dynamic render version of clear_texture hook 3932- zink: reset fb clears using the clears_enabled mask 3933- zink: manually apply barriers whenever zink_context::blitting is set 3934- zink: split out pipeline rp info update function 3935- zink: add zink_context::unordered_blitting to preserve unordered flags 3936- zink: implement unordered u_blitter calls 3937- zink: round geometry for u_blitter debug markers 3938- zink: catch zs u_blitter ops for draw markers 3939- zink: add debug markers for draws 3940- zink: track zsbuf info even when rp optimizing is disabled 3941- zink: fix dsa state parsing for tc info 3942- zink: track whether the fb zsbuf is readonly 3943- zink: add a fixup case for readonly zsbuf clears 3944- zink: improve no-oping of write -> readonly zsbuf layouts 3945- zink: don't flag rp layout check on next draw when rp optimizing 3946- zink: allow zink_is_zsbuf_used() without tc / rp optimizing 3947- zink: rename add_implicit_color_feedback_loop() 3948- zink: explicitly eliminate feedback loops for unused zsbufs 3949- zink: further eliminate zs implicit feedback loops for read-only access 3950- zink: split out luminance/alpha clear conversion code for reuse 3951- zink: convert luminance/alpha clear colors in dynamic texture clear 3952- zink: handle swapchain creation failure less lazily 3953- zink: simplify resource_check_defer_buffer_barrier() 3954- zink: delete unused barrier api 3955- zink: never split a renderpass for a loadop change 3956- zink: flag some rp ends as unsafe 3957- zink: add batch refs for framebuffer surfaces on bind and ref update 3958- zink: fix unordered access for image descriptors 3959- zink: force unordered_write=false when binding image descriptors 3960- zink: add an assert to ensure zsbuf invalidation doesn't break rendering 3961- zink: only run post-fb-unbind layout stuff if the resource isn't being destroyed 3962- zink: always set sampler layouts when unbinding fb images while rp optimizing 3963- zink: rework handling of unordered->ordered write buffer barriers 3964- zink: don't update fbfetch in db mode if inputAttachmentDescriptorSize==0 3965- zink: add ZINK_DEBUG=flushsync 3966- zink: track whether zsbuf is unused 3967- zink: flag rp layout change if zsbuf usedness changes on dsa/fs state bind 3968- driconf: make glthread=true default for source games 3969- aux/trace: dump blend states with enums 3970- aux/trace: fix GALLIUM_TRACE_NIR handling 3971- zink: fix some type mismatches for c++ compilation 3972- zink: break out a src region barrier check for reuse 3973- zink: move all barrier-related functions to c++ 3974- zink: use c++ template to deduplicate all the buffer barrier code 3975- zink: minor tweaks for image barriers 3976- zink: use c++ template to deduplicate image barrier functions 3977- zink: stop leaking separate shader nir 3978- lavapipe: always copy streamout info when creating shaders 3979- lavapipe: don't memcpy tess_ccw when copying pipeline library shaders 3980- lavapipe: refactor shader compile functions to not take pipeline params 3981- lavapipe: track bound shader stages on rendering_state 3982- lavapipe: add a device member to rendering_state 3983- lavapipe: stop using rendering_state::pipeline 3984- lavapipe: refactor compute shader binding 3985- lavapipe: merge some loops in handle_graphics_pipeline() 3986- lavapipe: PIPE_SHADER\_ -> MESA_SHADER\_ 3987- lavapipe: don't access pipeline shader structs as much during bind 3988- lavapipe: pull out dynamic tess origin check in gfx pipeline bind 3989- lavapipe: break out all the important parts of gfx pipeline setting for reuse 3990- lavapipe: delete unused struct member 3991- lavapipe: refactor pipeline destroy a bit 3992- lavapipe: add a ref for the tess_ccw nir on creation 3993- lavapipe: unify lvp_pipeline_nir creation 3994- lavapipe: dynamically bind noop fs at draw time when needed 3995- lavapipe: don't double unbind gfx stages on pipeline bind 3996- lavapipe: split out gfx stage unbinding 3997- lavapipe: only update shader access for bind/unbind stages 3998- lavapipe: only unset tess_states pointers on tes bind 3999- lavapipe: avoid uniformly unsetting gs_output_lines 4000- lavapipe: move default rasterizer state values to rendering_state init 4001- mapi: add InternalInvalidateFramebufferAncillaryMESA 4002- glthread: add _mesa_glthread_invalidate_zsbuf() 4003- kopper: apply ancillary invalidation through glthread on swapbuffers 4004- llvmpipe: fix linear fs analysis with nonzero fs outputs 4005- llvmpipe: fix handling of unused color attachments 4006- zink: add spirv builder function for terminate 4007- zink: set src access when rebinding buffers, unset unordered_* 4008- zink: fix quads emulation gs with array variables 4009- zink: block resolves where src extents > dst extents 4010- zink: omit VkPipelineVertexInputStateCreateInfo with dynamic vinput 4011- zink: flag vertex buffers for rebind after vstate draws 4012- zink: use search_or_add for masking vstate 4013- zink: bind vertex state directly from draw hook 4014- zink: add another vstate draw template for popcnt presence 4015- zink: explicitly pass null velems when creating pipelines with dynamic vinput 4016- zink: don't swizzle velems state for vstate draws 4017- zink: use fast popcnt for vstate draws 4018- zink: stop caching vertex states 4019- lavapipe: break out pipeline layout creation for reuse 4020- lavapipe: implement EXT_shader_object 4021- lavapipe: advertise EXT_shader_object 4022- zink: delete shader reordering in assign_io() 4023- zink: add and populate a shader_info struct to zink_shader 4024- zink: pass nir_shader to update_so_info() 4025- zink: generate flat_flags during shader creation 4026- zink: use zink_shader::info instead of zink_shader::nir::info 4027- zink: simplify fbfetch output detection from fs 4028- zink: pass nir directly to zink_shader_tcs_create() 4029- zink: swap nir pointers when compiling compute shaders 4030- zink: directly return nir from zink_shader_tcs_create 4031- zink: streamline nir cloning for assign_io 4032- zink: store nir as serialized on zink_shader structs 4033- zink: simplify assign_io() further 4034- zink: break out nir blob deserializing 4035- zink: move nir cloning out to callers of zink_shader_compile 4036- zink: store num_inlinable_uniforms separately for cs programs 4037- zink: always store nir serialized 4038- zink: be explicit about separate shader dsl indexing during creation 4039- zink: rework choose_pdev (again) 4040- glthread: use id 0 for internal buffer objects 4041- radv: fix leak of nir from retained shaders 4042- zink: don't try copying multiple results for conditional render copy 4043- zink: more explicitly track/check rp optimizing per-context 4044- zink: don't access non_fs part of zink_shader from fs 4045- zink: reuse d3d12 variable copying to make passthrough gs more robust 4046- zink: reuse copy_vars for generated tcs 4047- zink: don't trigger shader variants on pcp change if driver supports dynamic pcp 4048- Revert "zink: don't trigger shader variants on pcp change if driver supports dynamic pcp" 4049- zink: try to prune resources from barrier jit on fb unbind 4050- lavapipe: copy fragment shader when merging GPL pipelines 4051- lavapipe: refactor/consolidate GPL shader copying 4052- lavapipe: don't double-inline ubo0 4053- lavapipe: implement inline variant caching 4054- zink: block oom flushes during unordered blits 4055- zink: unroll array loop when copying vars for passthrough shaders 4056- zink: free GPL input/output libs on context destroy to avoid leaking 4057- zink: fix GPL lib leaking 4058- zink: remove redundant 'blitting' check in zink_prep_fb_attachment() 4059- zink: break out feedback loop pipeline state flagging for reuse 4060- zink: pre-convert attachment id to attachment idx 4061- zink: eliminate implicit feedback loops on rp begin 4062- zink: track per-image swapchain layouts 4063- zink: handle swapchain handoffs around makecurrent 4064- zink: remove a fixed validation error for ci 4065- mesa/st/program: don't init xfb info if there are no outputs 4066- zink: remove atomics from zink_query 4067- zink: pass ctx through query destroy paths 4068- zink: always defer query pool deletion 4069- zink: move memoryTypeIndex selection down in general bo allocation 4070- zink: slightly rework memoryTypeIndex selection to pre-determine heap 4071- zink: restore BAR allocation failure demotion 4072- zink: make general bo allocation more robust by iterating 4073- zink: avoid zero-sized memcmp for descriptor layouts 4074- iris: use util_framebuffer_get_num_samples when setting ps dispatch samples 4075- zink: manually re-set framebuffer after msrtss replicate blit 4076- zink: handle 'blitting' flag better in msrtss replication 4077- zink: skip msrtss replicate if the attachment will be full-cleared 4078- zink: avoid recursion during msrtss blits from flushing clears 4079- nir/lower_alpha_test: rzalloc state slots 4080- zink: fix non-db bindless texture buffers 4081- zink: emit demote cap when using demote 4082- zink: only print copy box warning once per resource 4083- util/debug: move null checks out of debug message macro 4084- zink: don't bitcast bool deref loads/stores 4085- drisw: don't leak the winsys 4086- zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support 4087- draw: fix viewmask iterating 4088- zink: don't pin flush queue threads if no threads exist 4089- zink: add z32s8 as mandatory GL3.0 profile attachment format 4090- nir/gs: fix array type copying for passthrough gs 4091- zink: fix array copying in pv lowering 4092- gallivm: break out native vector width calc for reuse 4093- llvmpipe: do late init for llvm builder 4094- zink: break out VkImageViewUsageCreateInfo applying for reuse 4095- zink: reapply VkImageViewUsageCreateInfo when rebinding a surface 4096- draw: fix robust ubo size calc 4097- llvmpipe: fix native vector width init 4098- zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile 4099- zink: only unset a generated tcs if the bound tcs is the generated one 4100- zink: set depth dynamic state values unconditionally 4101- zink: null some descriptor buffer pointers during destruction 4102- zink: sync queries at the end of cmdbufs 4103- cso: unbind fb state when unbinding the context 4104- i915: use util_copy_framebuffer_state to set fb state 4105- i915: use util_unreference_framebuffer_state to unref fb state 4106- iris: use util_unreference_framebuffer_state to unref fb state 4107- softpipe: use util_unreference_framebuffer_state to unref fb state 4108- v3d: use util_unreference_framebuffer_state to unref fb state 4109- vc4: use util_unreference_framebuffer_state to unref fb state 4110- llvmpipe: use util_unreference_framebuffer_state to unref fb state 4111- svga: use util_unreference_framebuffer_state to unref fb state 4112- zink: don't init mutable resource bit for swapchain images 4113- zink: don't init mutable for swapchain src during blit 4114- zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported 4115- zink: stringify unsupported prim restart log error 4116- zink: delete persistent map tracking 4117- zink: add PERSISTENT for db buffer maps 4118- zink: delete unnecessary pipeline stage flags from inference 4119- zink: use an intermediate variable for binding ssbo slots 4120- zink: unbind the ssbo slot being iterated, not the index of the buffer 4121- zink: flush INDIRECT_BUFFER mem barrier for compute 4122- zink: disable batched unordered barries with ZINK_DEBUG=noreorder 4123- zink: block batching of unordered barriers if previous usage was write 4124- zink: fix uncached memory readback 4125- glsl/lower_samplers_as_deref: apply bindings for unused samplers 4126- zink: bind bindless db set when updating separate shader db sets 4127- zink: compare desc set to detect bindless vars in separate shaders 4128- zink: adjust bindless texel buffer handle before indexing 4129- zink: block more flushes during unordered blits 4130- zink: also cache swapchain semaphores 4131 4132Mohamed Ahmed (3): 4133 4134- vulkan/runtime: move common buffer related entrypoints to vk_buffer.c 4135- vulkan/runtime: implement vkGetBufferMemoryRequirements2() 4136- anv: remove GetBufferMemoryRequirements2() 4137 4138Nanley Chery (16): 4139 4140- docs: Document the implicit barriers around blits 4141- glsl: Add compute shaders to encode DXT5/BC3 4142- glsl: Modify the #includes in the DXT5 shaders 4143- mesa: Create _mesa_CreateShaderProgramv_impl 4144- mesa/st: Add get_compute_program 4145- mesa/st: Add and use create_bc1_endpoint_ssbo 4146- mesa/st: Add st_compute_transcode_astc_to_dxt5 4147- mesa/st: Add st_texture_image_resource_level 4148- mesa/st: Enable compute-based transcoding to DXT5 4149- mesa/st: Measure compressed fallback unmap paths 4150- iris: Update comment in iris_cache_flush_for_render 4151- iris: Flush caches for aux-mode changes more often 4152- iris: Drop iris_cache_flush_for_render 4153- iris: Allocate ZEROED BOs for shared resources 4154- iris/bufmgr: Add and use zero_bo 4155- iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED 4156 4157Nataraj Deshpande (1): 4158 4159- anv: Bump VkDeviceMemory objects limit to 4GB 4160 4161Neha Bhende (1): 4162 4163- docs: Add GL 4.3 support info in mesa docs 4164 4165Nicolas Dufresne (1): 4166 4167- util/format: Fix wrong colors when importing YUYV and UYVY 4168 4169Nicolas F (1): 4170 4171- driconf: remove the adaptive sync special case for mpv 4172 4173Oleksii Bozhenko (5): 4174 4175- glsl: fix gl_CullDistance lowering from float[8] to vec4[2] 4176- ci: Uprev Piglit 4177- Move combining clip and cull optimization before linking 4178- wsi: add rgb_component_bits_are_equal 4179- wsi: remove get_sorted_vk_formats duplication 4180 4181Patrick Lerda (25): 4182 4183- lima: fix memory leak related to u_transfer_helper_create() 4184- mesa/program: fix memory leak triggered by parser errors 4185- mesa/st: fix possible crash related to arb invalid memory access 4186- r600: fix shader blob memory leak 4187- vbo/save: fix possible crash related to fixup_vertex() 4188- mesa/shaderapi: fix path memory leak 4189- mesa/framebuffer: fix gl_framebuffer.resolve refcnt imbalance 4190- mesa/program: fix memory leak triggered by invalid extended swizzle selector 4191- mesa/program: fix memory leak triggered by multiple targets used on one texture image unit 4192- mesa/program: fix memory leak triggered by arb alias 4193- radeonsi: fix memory leak related to ureg_get_tokens() 4194- glx: fix memory leak related to __glXCloseDisplay() 4195- r600: fix refcnt imbalance related to shader 4196- intel: fix memory leak related to brw_nir_create_passthrough_tcs() 4197- r600: fix typo that could lead to a possible crash 4198- egl: fix memory leak related to _eglRefreshDeviceList() 4199- r600: fix refcnt imbalance related to r600_set_vertex_buffers() 4200- r600: fix refcnt imbalance related to evergreen_set_shader_images() 4201- lima: fix refcnt imbalance related to framebuffer 4202- r600/sfn: fix memory leak related to sh_info->arrays 4203- aux/draw: fix memory leak related to ureg_get_tokens() 4204- crocus: fix refcnt imbalance related to framebuffer 4205- crocus: fix refcnt imbalance related to crocus_create_surface() 4206- r600: fix refcnt imbalance related to atomic_buffer_state 4207- radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment 4208 4209Paul Gofman (1): 4210 4211- driconf: add a workaround for Kaiju-A-Gogo 4212 4213Paulo Zanoni (8): 4214 4215- anv: don't leave undefined values in exec->syncobj_values 4216- anv: check the return value of anv_execbuf_add_bo_bitset() 4217- anv: run buf_finish() if add_bo() fails during execute_simple_batch() 4218- anv: rename anv_execbuf->array_length to bo_array_length 4219- anv: use vk_realloc for the anv_execbuf arrays 4220- hasvk: don't leave undefined values in exec->syncobj_values 4221- hasvk: check the return value of anv_execbuf_add_bo_bitset() 4222- anv: there's no need to set exec_obj offsets twice 4223 4224Pavel Ondračka (16): 4225 4226- nir/lower_bool: ntt: Generate a good opcode for bcsel 4227- r300: update rv515 ci failures list 4228- r300: skip sin/cos input range transformation for nine and ntt 4229- r300: remove backend input range transformation for sin and cos 4230- ntt: pass ubo_vec4_max nir_opt_offsets flag through ntt options 4231- r300: set ubo_vec4_max ntt option properly 4232- r300: remove backend negative addressing emulation 4233- nir: nir opt_shrink_vectors whitespace fix 4234- nir: mark progress when removing trailing unused alu channels 4235- nir: mark progress when removing trailing unused load_const channels 4236- r300: set register file to none if swizzles are constant only 4237- nir: shrink phi nodes in nir_opt_shrink_vectors 4238- r300: drop VDPAU support 4239- r300: simplify KILL transformation 4240- nine: use separate register for aL emulation 4241- r300: fix unconditional KIL on R300/R400 4242 4243Pedro J. Estébanez (4): 4244 4245- spirv_to_dxil: Unify spirv_to_nir_options 4246- spirv2dxil: Split read-only image as SRV logic into declared and inferred 4247- spirv: Assume input attachments are read-only 4248- Revert "microsoft/compiler: Use SRVs for read-only images" 4249 4250Philip Langdale (1): 4251 4252- radeonsi: correctly declare YUV420_10 RT Format support for AV1 4253 4254Philipp Zabel (2): 4255 4256- vulkan/wsi/wayland: fix acquire_next_image to report timeouts properly 4257- zink: fix build with -Dvulkan-beta=true 4258 4259Pierre-Eric Pelloux-Prayer (37): 4260 4261- radeonsi: simplify dpbb settings 4262- ac/info: move pci bus info in a struct 4263- ac: add ac_query_pci_bus_info helper 4264- ac: don't call ac_query_pci_bus_info from ac_query_gpu_info 4265- radeonsi/sqtt: don't read results for disabled SEs 4266- radeonsi/sqtt: disable SE1+ on GFX11 4267- radeonsi/sqtt: update registers for gfx11 4268- radeonsi/sqtt: implement offset workaround for gfx11 4269- vbo: remove bogus assert 4270- vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage 4271- glthread: fix glArrayElement handling 4272- drm-uapi/dma-buf.h: use __u32/__u64 types 4273- winsys/amdgpu: use DMA_BUF_SET_NAME_B if available 4274- radeonsi/gfx11: clamp PRIM_GRP_SIZE 4275- radeonsi/gfx11: fix ge_cntl programming 4276- amd/surface: fix base_mip_width of subsampled formats 4277- winsys/amdgpu: use amdgpu_device_get_fd 4278- radeonsi/video: use specific PIPE_BIND\_ value for video buffers 4279- radeonsi: fix incorrect vgpr indices in the ps_prolog 4280- radeonsi/test: use gbm-skips.txt 4281- radeonsi/test: update test results 4282- radeonsi: don't use PKT3_SET_SH_REG_INDEX on gfx9 and older 4283- radeonsi: fix fast depth_clear_value/stencil_clear_value 4284- yegl/wayland: fix glthread deadlocks 4285- Revert "driconf: add a workaround for plasmashell freezing" 4286- ac/llvm: fix build with LLVM 17 4287- mesa: fix CopyImageSubDataOES with GL_TEXTURE_EXTERNAL_OES 4288- amd/surface: rename metadata functions 4289- ac/surface: introduce umd metadata v2 4290- radeonsi: add AMD_DEBUG=extra_md 4291- radeonsi: don't use si_decompress_dcc if the blitter is running 4292- radv: add RADV_DEBUG=extra_md 4293- radeonsi: don't use alignment_log2 of imported buffers 4294- mesa: fix invalid index_bo refcounting 4295- util/vbuf: clarify indirect draws handling 4296- util/vbuf: fix index_bo leak 4297- radeonsi: update test results 4298 4299Pino Toscano (1): 4300 4301- symbols-check: support OSes based on GNU toolchain 4302 4303Qiang Yu (78): 4304 4305- radeonsi: implement nir_load_ring_gsvs_amd 4306- radeonsi: implement nir_load_ring_gs2vs_offset_amd 4307- radeonsi: lower nir streamout intrinsics in abi 4308- radeonsi: use nir_print_xfb_info to replace si_dump_streamout 4309- radeonsi: use ac_nir_lower_legacy_vs to replace si_llvm_vs_build_end 4310- radeonsi: add nir implementation of gs copy shader generation 4311- radeonsi: build legacy gs output info when shader compile 4312- radeonsi: replace llvm gs copy shader generation with nir 4313- radeonsi: remove llvm gs copy shader generate 4314- radeonsi: replace llvm legacy gs code with nir lowering 4315- radeonsi: move gfx10_ngg_export_vertex to si_shader_llvm.c 4316- gallium/aux: remove nir_helpers 4317- nir/xfb_info: nir_gather_xfb_info_from_intrinsics update nir xfb_info 4318- radeonsi: update nir xfb info after medium io lowering 4319- nir: add nir_export_amd intrinsic 4320- ac/llvm: implement nir_export_amd 4321- aco: implement nir_export_amd 4322- ac/nir: gs and nogs use ac_nir_export_primitive 4323- ac/nir: add ac_nir_export_position 4324- ac/nir: add ac_nir_export_parameter 4325- ac/nir: add force_vrs to ac_nir_export_position 4326- amd,radeonsi: implement nir_load_force_vrs_rates_amd in driver abi 4327- radeonsi: clamp vertex color in legacy gs instead of gs copy shader 4328- radeonsi: update outputs written nir info 4329- radeonsi: remove the extra handling for VS/TES primitive id 4330- radeonsi: set nr_pos_exports outside of llvm translation 4331- ac/nir,radv,radeonsi: legacy vs use ac_nir_export_(position|parameter) 4332- ac/nir,radv,radeonsi: gs copy shader use ac_nir_export_(position|parameter) 4333- ac/nir/ngg: fix clip dist culling mask uninitialized 4334- ac/nir/ngg: change clipdist_neg_mask_var type to uint32 4335- ac/nir/ngg,radv,radeonsi: nogs use ac_nir_export_(position|parameter) 4336- ac/nir/ngg: prepare gather_vs_outputs to be used by gs 4337- ac/nir/ngg: gs use ac_nir_export_(position|parameter) 4338- ac/nir/ngg,radv: ms use ac_nir_export_(primitive|position|parameter) 4339- nir,ac/llvm,aco: remove nir_export_primitive_amd 4340- nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd 4341- aco: remove early_rast wait insert 4342- radv: move radv_consider_force_vrs above radv_fill_shader_info 4343- radv: use amd common force_vrs option 4344- ac/llvm,radeonsi: lower nir_load_barycentric_at_sample in abi 4345- radeonsi: add num_component param to load_internal_binding 4346- ac/llvm,radeonsi: lower fbfetch in abi 4347- radeonsi: only init llvm output when needed. 4348- ac/llvm: only init outputs when fragment shader for radv 4349- aco: only ls and ps use store output now 4350- aco, radv: Add load_grid_size_from_user_sgpr to aco options. 4351- aco, radv: Move is_trap_handler_shader to aco info. 4352- ac/nir: move store_var_components to common place 4353- ac/nir: tcs write tess factor support pass by reg 4354- ac/nir: init tess factor location with IO remap 4355- ac/nir: handle tess factor output missing case 4356- ac/llvm,radeonsi: lower nir_load_ring_tess_factors_amd 4357- radeonsi: lower nir_load_ring_tess_factors_offset_amd 4358- radeonsi: monolithic TCS emit tessfactor in nir directly 4359- ac/llvm: respect channel_type when ac_build_buffer_load 4360- ac/llvm: add missing type convert for nir_load_buffer_amd 4361- nir: pack_(s|u)norm_2x16 support float16 as input 4362- ac/llvm: implement float16 nir_op_pack_(s|u)norm_2x16 4363- aco: implement float16 nir_op_pack_(s|u)norm_2x16 4364- nir,radeonsi: add and implement nir_load_alpha_reference_amd 4365- nir: add nir_fisnan helper function 4366- ac/nir: add ac_nir_lower_ps 4367- radeonsi: monolithic PS emit epilog in nir directly 4368- radeonsi: expose si_nir_load_internal_binding 4369- ac/nir: add ac_nir_load_arg_at_offset 4370- radeonsi: add si_nir_lower_vs_inputs 4371- ac/llvm: vs_rel_patch_id can also be fixed up 4372- ac/llvm: move ac_fixup_ls_hs_input_vgprs to amd common 4373- radeonsi: monolithic VS emit prolog in nir directly 4374- ac/llvm,radeonsi: remove abi->load_inputs implementation 4375- ac/llvm: remove ac_build_opencoded_load_format 4376- radeonsi: fix max scrach lds size calculation when ngg 4377- ac/nir/ngg: fix gs culling vertex liveness check for odd vertices 4378- ac/nir/ngg: fix store shared alignment 4379- ac/llvm: remove some unused code replaced by nir 4380- ac,aco: move gfx10 ngg prim count zero workaround to nir 4381- aco: fix nir_f2u64 translation 4382- ac/nir/cull: fix line position w culling 4383 4384Raun (2): 4385 4386- dzn: Enable VK_KHR_bind_memory2 4387- dzn: Enable VK_KHR_get_memory_requirements2 4388 4389Rhys Perry (48): 4390 4391- radv: implement GS load_ring_gsvs_amd/load_ring_gs2vs_offset_amd 4392- radv,aco: use ac_nir_lower_legacy_gs 4393- aco: restore semantic_can_reorder for GS output stores 4394- ac/nir: use store_buffer_amd's base index 4395- ac/llvm: add support for fp32 addition atomics 4396- aco: add support for fp32 addition atomics 4397- radv: load ssbo_atomic_fadd descriptor 4398- radv/gfx11: expose shaderBufferFloat32AtomicAdd 4399- aco/tests: fix assembler.gfx11.vop12c_v128 with LLVM 15 4400- aco/tests: update assembler tests for latest LLVM 16 4401- radv: skip creation of null TLAS for null winsys 4402- aco: set has_color_exports with GPL 4403- aco: end reduce tmp after control flow, when used within control flow 4404- aco/tests: add setup_reduce_temp.divergent_if_phi 4405- aco/spill: always end spill vgpr after control flow 4406- aco: limit VALUPartialForwardingHazard search 4407- radv: set state.vbo_misaligned_mask_invalid in radv_bind_vs_input_state 4408- ac: move ring_offsets to ac_shader_args 4409- ac/llvm: let ring_offsets be accessed like a normal arg 4410- radv/llvm: use the ring_offsets shader arg 4411- aco: fix out-of-bounds access when moving s_mem(real)time across SMEM 4412- aco: don't modify exec in p_interp_gfx11 4413- aco: don't apply modifiers through DPP to unsupported instructions 4414- aco: fix pathological case in LdsDirectVALUHazard 4415- aco: always update orig_names in get_reg_phi() 4416- radv: remove is_internal pipeline creation parameter 4417- aco/tests: add tests for v_fma_f32 with 2 fp16 literals 4418- aco: make IDSet sparse 4419- nir/range_analysis: fix vectorized phis and intrinsics 4420- nir: use xyzw order for precise fdot 4421- nir: make fdph lowering match fdot 4422- nir: add nir_lower_alu_width_test.fdot_order 4423- aco/gfx11: fix RT prolog scratch initialization 4424- aco: set needs_flat_scr=true for RT 4425- util/dynarray: allow an initial stack allocation to be used 4426- nir/range_analysis: add missing masking of shift amounts 4427- nir/range_analysis: add helpers for limiting stack usage 4428- nir/range_analysis: use perform_analysis() in nir_unsigned_upper_bound() 4429- nir/range_analysis: use perform_analysis() in nir_analyze_range() 4430- radv: fix setting radv_shader_info::user_data_0 with rt 4431- aco: don't optimize s_or_b64(v_cmp_u_f32(a, b), cmp(a, a)) 4432- aco: fix nir_var_shader_out barriers for task shaders 4433- radv/gfx11: improve RT scratch allocation 4434- nir: make nir_fisnan helper exact 4435- aco: remove SMEM_instruction::prevent_overflow 4436- ac/nir/ps: fix null export write mask miss set to 0xf 4437- aco: don't move exec reads around exec writes 4438- aco: don't move exec writes around exec writes 4439 4440Rob Clark (180): 4441 4442- freedreno/ci: Switch a630 jobs over to manual 4443- freedreno/ci: Cleanup a618 yaml 4444- freedreno/ci: Add a618 egl/skqp/piglit jobs 4445- Revert "freedreno/ci: Switch also performance a630 job to manual" 4446- Revert "freedreno/ci: Switch a630 jobs over to manual" 4447- freedreno/ci: Add an a618 flake 4448- freedreno/drm: Remove assert 4449- freedreno: Fix tracking of enabled SSBOs 4450- freedreno/a6xx: Workaround for no pos/psize 4451- freedreno: Don't re-install a flushed batch 4452- freedreno/a6xx: Rework barrier handling 4453- freedreno/ir3: Stop copying options 4454- freedreno/ir3: Let driver specify fb-read descriptor 4455- freedreno: Track image/SSBO usage for all stages 4456- freedreno/ir3: Add descriptor set lowering 4457- freedreno/a6xx: Pre-bake IBO descriptor sets 4458- freedreno/a6xx: Add bindless state 4459- freedreno/a6xx: Switch over to bindless IBO 4460- freedreno/a6xx: Remove bindfull IBO state 4461- freedreno/a6xx: Removing munging of tex state for IBO 4462- freedreno/a6xx: Remove tex fb_read state 4463- freedreno/a6xx: Move compute to tex state group 4464- freedreno/a6xx: Move tex state building 4465- freedreno/a6xx: Expose SSBO/image for all shader stages 4466- freedreno: Restore GL_VENDOR string 4467- gallium/util: Add util_writes_depth() helper 4468- freedreno/a6xx: Add LRZ perf warn for ztest direction changes 4469- freedreno/a6xx: Invalidate LRZ on blend+depthwrite 4470- turnip: Rename lrz force_disable_mask 4471- turnip: Invalidate LRZ on blend+depthwrite 4472- util/xmlconfig: Use os_get_option() 4473- freedreno: Add driconf to disable conservative LRZ 4474- freedreno/a6xx: Add a few kernel regs/etc 4475- freedreno/drm: Add some ref/unref debugging 4476- freedreno/drm: Detect zombie BOs 4477- freedreno/drm: Remove bo_del_or_recycle() 4478- freedreno/drm: Split out bo->finalize() 4479- freedreno/drm: Synchronize handle close and lookup 4480- freedreno/drm/virtio: Flush before CREATE_BLOB 4481- freedreno/drm: Restart import on zombie race 4482- freedreno/gmem: Fix for partial z/s fast-clear 4483- freedreno/decode: Increase size of offsets table 4484- freedreno/a6xx: LRZ for MSAA 4485- freedreno/ir3: Scalarize load_ssbo 4486- freedreno/a6xx: Add missing CS_BINDLESS mapping 4487- freedreno/a6xx: Add CS instrlen workaround 4488- freedreno: nondraw-batch 4489- freedreno: Skip flush_resource with explicit sync 4490- freedreno/a6xx: Don't double-write SP_CS_OBJ_START 4491- freedreno: Don't open-code setting dirty CS state 4492- freedreno/a6xx: Make shader state independent of grid info 4493- freedreno/a6xx: Also FLUSH_CACHE on image barrier 4494- freedreno/a6xx: Remove excess CS flushing 4495- freedreno+ir3: Move storage_16bit to compiler options 4496- freedreno/a6xx: Move CS state to PROG state group 4497- freedreno/drm: Move sa_cpu_prep() to core 4498- freedreno/drm/virtio: Limit guest handles passed to virtgpu 4499- ir3: Quiet unused variable warning 4500- freedreno: Quiet unused variable warnings 4501- freedreno/a2xx: Move pack_rgba() 4502- freedreno: Indent fixes 4503- freedreno/a6xx: Move num_driver_params to program state 4504- freedreno: Move num_vertices calc to backend 4505- freedreno: Remove impossible NULL check 4506- freedreno: Add FD_DIRTY_QUERY 4507- freedreno: Avoid screen lock when no rsc tracking needed 4508- freedreno: Account for multi-draw in num_draws 4509- freedreno: Push num_draws down to backend 4510- freedreno/a6xx: Drop unused return 4511- freedreno/a6xx: Split out flush_streamout() helper 4512- freedreno/a6xx: Multi-draw support 4513- freedreno/a6xx: Do tex-state invalidates in same ctx 4514- freedreno/drm: Make rb refcnt non-atomic 4515- freedreno/a6xx: Remove tex-state refcnting 4516- freedreno: Move blend out of dirty-rsc tracking 4517- freedreno: Move FD_MESA_DEBUG cases out of draw_vbo 4518- freedreno/a6xx: Pre-compute PROG related LRZ state 4519- freedreno: Avoid taking screen lock 4520- freedreno/batch: Stop tracking cross-context deps 4521- freedreno: Drop batch lock 4522- freedreno: Add seqno helper 4523- freedreno/drm: Optimize stateobj re-emit 4524- freedreno/a6xx: Move rsc seqno out of tex cache key 4525- freedreno/a6xx: Fix set_sampler_views(start != 0) 4526- freedreno/a6xx: Drop unneed fd6_texture_state() arg 4527- freedreno/a6xx: Fix sampler view rsc_seqno for X32_S8X24 4528- freedreno/a6xx: Add a way to assert valid format 4529- freedreno/a6xx: Remove needs_invalidate flag 4530- freedreno/a6xx: Small cleanup 4531- freedreno/a6xx: Static-ify sampler_view_update() 4532- freedreno/a6xx: Fix view_seqno in tex cache key 4533- freedreno/cffdec: Fix unitialized count for pkt2 4534- freedreno/cffdec: Add helper to find next pkt 4535- freedreno/cffdec: Add helper to parse CP_INDIRECT_BUFFER 4536- freedreno/cffdec: Fix hang location detection 4537- freedreno/crashdec: Refactor crashdec tests 4538- freedreno/crashdec: Add another prefetch test 4539- freedreno/crashdec: Handle multi-IB prefetching 4540- freedreno/crashdec: Disable GALLIUM_DUMP_CPU 4541- vk/runtime: Allow enumerate and try_create_for_drm to coexist 4542- turnip: Move things to prep for multi-kernel support 4543- turnip: drm code-motion 4544- turnip: Split out vfuncs for kernel interface 4545- turnip: Allow knl backend specific entrypoints 4546- turnip: Move QueueWaitIdle entrypoint to kgsl 4547- turnip: Handle kgsl vs drm specifics at runtime 4548- turnip: Refactor device loading 4549- tu+meson: Re-work KMD selection 4550- tu/kgsl: Propagate tu_physical_device_init() errors 4551- Revert "freedreno: Account for multi-draw in num_draws" 4552- freedreno/a6xx: Namespace reg/pkt packer vars 4553- freedreno/a6xx: Convert blitter to OUT_REG() 4554- freedreno/a6xx: Fix mirror x/y blits 4555- util: Add a simple no-op libdrm shim 4556- turnip: Use libdrm shim 4557- loader: Use libdrm shim 4558- vk/runtime: Use libdrm shim 4559- freedreno/common: Replace or_mask() with BitsetEnum<T> 4560- freedreno: Promote non-drawing batches to sysmem 4561- freedreno: Nerf strict-aliasing warning for all of gcc 4562- freedreno/registers: Schema validation for gen_header.py 4563- freedreno/registers: Add regs for a690 4564- freedreno: Quiet c++ warning about designated initializers 4565- freedreno/ir3: Un-inline enums 4566- freedreno/ir3: Don't use negative opc for meta instructions 4567- freedreno/ir3: c++-proof the headers 4568- freedreno/ir3+tu: Calculate subgroup size in ir3 4569- freedreno/ir3: Add missing driver params 4570- freedreno: Un-inline buffer-mask enum 4571- freedreno: c++-proofing 4572- freedreno/a6xx: Rework texture_clear fallback 4573- freedreno/a6xx: Add missing "inline" 4574- freedreno/a6xx: Fix designator initializer order 4575- freedreno/a6xx: Convert to c++ 4576- freedreno/registers: Fix designator order 4577- freedreno/registers: Add prefix="variant" 4578- freedreno/registers: Merge a6xx and a7xx regs 4579- freedreno/registers: Start adding a7xx pipe/control regs 4580- freedreno/decode: Start adding a7xx support 4581- freedreno/registers: Start adding stuff for a7xx 4582- freedreno/registers: Track varset 4583- freedreno/registers: Split out regpair builder helper 4584- freedreno/registers: Add c++ magic for register variants 4585- freedreno/registers: Fix nameless fields 4586- freedreno/registers: Define rest of CP_REG_WRITE 4587- freedreno/a6xx: Simplify iova emit 4588- mesa: Rework discard_framebuffer() 4589- driconf: Add ignore_discard_framebuffer option 4590- driconf: Work around incorrect GI discard/invalidate 4591- freedreno: Specify GMEM tile alignment per GPU 4592- freedreno+tu: Big GMEM support 4593- freedreno+tu: Add a690 support 4594- freedreno/a6xx: Restore mode 4595- freedreno/rnn: Fix reg names for regs with variants 4596- freedreno/afuc: Add raw mode for disasm 4597- freedreno/registers: Add control reg for zap fw base 4598- Revert "CI: Disable freedreno" 4599- dri2/android: Bypass throttling 4600- freedreno/drm: Fast path for idle check 4601- freedreno/drm: Stop cleanup at first active BO 4602- mesa: Add a few more function traces 4603- freedreno/drm: Make threaded-submit optional 4604- freedreno/drm: Disable threaded-submit for msm 4605- freedreno: Optimize repeated finishes 4606- freedreno: Stop being too clever by half 4607- freedreno: Hoist dirty vars 4608- freedreno: Extract out a helper 4609- freedreno: Inline single-use helpers 4610- freedreno: Re-work dirty-resource tracking 4611- freedreno: Avoid looping shader stages if nothing dirty 4612- freedreno: Move driconf settings into sub-struct 4613- freedreno: Support the disable_throttling=true driconf option 4614- util/disk_cache: Split out queue initialization 4615- util/disk_cache: Add NONE type 4616- util/disk_cache: Use queue state to skip put 4617- util/disk_cache: Move blob_put_cb to the async queue 4618- freedreno/a6xx: Allow z24s8 format casts 4619- freedreno/a6xx: Fix valid_format_cast logic for newer a6xx 4620- freedreno: Fix resource tracking vs rebind/invalidate 4621- dri/android: Fix MSAA resolve 4622 4623Rohan Garg (9): 4624 4625- iris: Don't flush the render cache for a compute batch 4626- anv: drop unused headers 4627- anv: reuse the VK_IMAGE_ASPECT_PLANES_BITS_ANV macro 4628- isl: fix some documentation 4629- anv/blorp: use existing function to convert the op to a string 4630- anv: break out of the loop when the first color attachment is found 4631- anv,hasvk: cleanup unused enum 4632- intel/genxml: Add the preferred slm size enum for gen125 4633- anv,blorp,iris: Set PreferredSLMAllocationSize on gfx125+ 4634 4635Roland Scheidegger (2): 4636 4637- llvmpipe: only use accurate_a0 hack if there are no textures bound 4638- lavapipe, nir: Fix wrong array index scaling in nir_collect_src_uniforms 4639 4640Rose Hudson (4): 4641 4642- radeonsi: report 0 block size for Polaris HEVC encoding 4643- asahi: wire up shader disk cache support 4644- agx: isolate compiler debug flags 4645- asahi: disable disk cache in debug runs 4646 4647Ruijing Dong (5): 4648 4649- frontends/va: revert commit 0b02db30 4650- raseonsi/vcn: fix a h264 decoding issue 4651- frontends/va: disable skip_frame_enable in vaapi interface. 4652- radeonsi/vcn: correct cropping for hevc case 4653- radeonsi/vcn: fix decoding bs buffer alignement issue. 4654 4655Ryan Neph (17): 4656 4657- ci: fix directory existence racing in parallel test execution 4658- util/u_process: add MESA_PROCESS_NAME override to util_get_process_name() 4659- util/u_process: remove util_get_process_name_may_override() 4660- util/xmlconfig: add MESA_DRICONF_EXECUTABLE_OVERRIDE 4661- venus: update venus-protocol headers to partially fix WA1 4662- venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant 4663- ci: uprev virglrenderer 4664- venus: update venus-protocol headers to fix WA1 4665- Revert "venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant" 4666- venus: add vn_relax_init/_fini() 4667- venus: set/check ring status bits independently 4668- venus: init exp features before ring init again 4669- venus: update to latest protocol for ringMonitoring 4670- venus: check and configure new ringMonitoring feature 4671- venus: re-use VN_DEBUG_NO_ABORT to disable ring monitoring abort() 4672- virgl: hook new get_fd proc for drm winsys 4673- i915: hook new get_fd proc for drm winsys 4674 4675Sagar Ghuge (10): 4676 4677- iris: Stop marking context unconditionally as guilty 4678- intel/fs: Always stall between the fences on Gen11+ 4679- nir: Handle other variants of image_samples properly while lowering 4680- intel/compiler: Add swsb_stall debug option 4681- anv: Implement Wa_14015297576 4682- iris: Implement Wa_14015297576 4683- intel/compiler: Add Wa_14014063774 for slm_fence 4684- intel/decoder: Bump the binding table guess value to 32 4685- anv: Drop unused param from add_surface_reloc 4686- anv: Drop dead code that sets the L3BypassDisable field 4687 4688Sai Teja Pottumuttu (2): 4689 4690- iris: Fix to release BO immediately if not busy 4691- anv: Fix stride mismatch in mesa and minigbm 4692 4693Sajeesh Sidharthan (3): 4694 4695- radeonsi/vcn: disable fence for JPEG decoding 4696- radeonsi/vcn: set bitstream buffer size to encoded bitstream size 4697- radeonsi/vcn: optimize bitstream buffer resize logic 4698 4699Sam Edwards (1): 4700 4701- nouveau: Fix null dereference in nouveau_pushbuf_destroy 4702 4703Samuel Iglesias Gonsálvez (1): 4704 4705- docs/developers: Add Igalia as Mesa consultancy 4706 4707Samuel Pitoiset (279): 4708 4709- radv: fix missing implementation of creating images from swapchains 4710- radv: fix hashing pipeline keys if RADV_PERFTEST=ngg_streamout is used 4711- radv: fix re-emitting RB+ when the non-compacted color format changes 4712- ac/nir: clear unused components before storing XFB outputs to LDS 4713- ac: add TC_OP_ATOMIC_SUB_32 4714- radv: fix setting MAX_MIP for BC views 4715- radv: fix buffer to image copies with BC views on the graphics queue 4716- radv: fix creating BC image views when the base layer is > 0 4717- radv: rename ac_surf_nbc_view::max_mip to num_levels 4718- radv: move some color blend helpers to radv_private.h 4719- radv: add a new helper for normalizing blend factors 4720- radv: add support for dynamic blend equation 4721- radv: enable compiling PS epilogs on-demand for dynamic color blend equations 4722- radv: fix detecting that blend is enabled when all CB states are dynamic 4723- radv: advertise extendedDynamicState3ColorBlendEquation 4724- radv: remove an old FIXME about a possible bug with TC-compat HTILE 4725- radv/winsys: fix incorrect PCIID for GFX11 in the null winsys 4726- radv: print depth image size with RADV_DEBUG=img 4727- radv: fix RADV_DEBUG=hang with multiple cmdbuffer per submission 4728- radv/winsys: prefix all error messages with RADV 4729- radv: fix creating libraries with PS epilog and all CB states as dynamic 4730- radv: fix ignoring graphics shader stages that don't need to be imported 4731- radv: add a layer for fixing rendering issues with RAGE2 4732- radv: simplify VK_PIPELINE_CREATE_FAIL_ON_PIPELINE_COMPILE_REQUIRED 4733- radv: pass the number of stages to radv_hash_shaders() 4734- radv: split radv_create_shaders() between graphics and compute shaders 4735- radv: rename radv_create_shaders() to radv_graphics_pipeline_compile() 4736- radv: ignore all CB dynamic states when there is no color attachments 4737- radv: regroup dynamic states initialization 4738- radv: only initialize non-zero values for the default dynamic state 4739- radv: stop setting INTERPOLATE_COMP_Z 4740- radv: fix RB+ for SRGB formats 4741- radv: adjust ACCUM tessellation fields on GFX11+ 4742- radv: fix GPL fast-linking with libs that have retained NIR shaders 4743- radv: skip shaders cache for fast-linked pipelines with GPL 4744- radv: remove useless check about CS in radv_lower_io() 4745- radv: simplify pipeline_has_ngg during graphics shaders compilation 4746- radv: add helpers for capturing shaders and statistics 4747- radv: pass radv_graphics_pipeline to radv_graphics_pipeline_compile() 4748- radv: move retained shaders info to radv_graphics_pipeline 4749- radv: pass radv_compute_pipeline to radv_compute_pipeline_compile() 4750- radv: pass pCreateInfo to radv_graphics_pipeline_compile() 4751- radv: optimize radv_pipeline_layout_add_set() slightly 4752- radv: remove redundant zero initialization of pipeline layout 4753- radv: remove radv_pipeline_stage::spirv::sha1 4754- radv: allow to create a noop FS in a library with GPL 4755- radv: remove one unused variable in radv_graphics_lib_pipeline_init() 4756- radv: pass the lib flags for generating the pipeline key 4757- radv: return a boolean value in radv_pipeline_needs_dynamic_ps_epilog() 4758- radv: stop using the graphics pipeline key after compilation 4759- radv: determine the last VGT API stage earlier 4760- radv: skip compilation when possible with GPL fast-linking 4761- radv: simplify an assertion after considering RADV_FORCE_VRS 4762- radv: do not insert fast-linked libraries to the shaders cache 4763- radv: fix skipping graphics pipeline compilation when the FS is NULL 4764- radv: cleanup graphics pipeline library flags uses 4765- radv: simplify determining when the fragment shader needs an epilog 4766- radv: regroup PS epilog info when generating the graphics pipeline key 4767- radv: fix disabling MRT compaction for on-demand PS epilogs 4768- radv: make sure to disable MRT compaction when compiling a PS epilog with GPL 4769- radv: simplify creating a FS epilog from a library 4770- radv: stop skipping the cache for compute/raytracing pipelines with GPL 4771- radv: stop skipping the cache for monolithic graphics pipelines with GPL 4772- docs: add missing RADV_PERFTEST=video_decode 4773- docs: stop reporting RADV_PERFTEST=gpl as experimental/suboptimal 4774- radv/ci: set RADV_PERFTEST=GPL for all VKCTS jobs 4775- radv/ci: bump the number of runners to 3 for vkcts-navi21-valve 4776- radv: restore uploading shaders individually instead of consecutively 4777- radv: implement graphics shaders relocation for a RGP workaround 4778- radv: fix importing retained NIR shaders when a lib uses the RETAIN bit 4779- radv: use last_vgt_api_stage for determining the last stage with XFB 4780- radv: only initialize shader arguments for the active stages 4781- radv simplify compiling graphics shaders with a mask of active NIR stages 4782- radv: disable DCC for mipmaps on GFX11 4783- radv: ignore registering pipeline libaries with SQTT 4784- radv/ci: add missing expected failures with RADV_PERFTEST=gpl on GFX1100 4785- radv: reduce maximum line width to 8.0 4786- radv: add support for rectangularLines 4787- Revert "radv: acquire pstate on-demand when capturing with RGP" 4788- radv/amdgpu: only set a new pstate if the current one is different 4789- radv: only skip emitting the pipeline blend state if the FS uses an epilog 4790- radv: stop using a PS epilog when the FS doesn't write any color outputs 4791- ci: uprev vkd3d-proton 4792- zink/ci: skip KHR-GL46.texture_swizzle.functional with RADV 4793- zink/ci: set RADV_PERFTEST=gpl for RADV jobs 4794- radv/ci: disable vkcts-kabini-valve 4795- radv/ci: move CI lists for external GPUs in separate folder 4796- radv: configure SQ_THREAD_TRACE_CTRL.REG_AT_HWM on GFX11 4797- radv: only enable SQTT for SE0 on GFX11 4798- radv: make sure to wait for the trace buffer also on GFX11 4799- radv: implement a workaround for SQTT on GFX11 4800- radv: disable SPM counters with RGP on GFX11 4801- radv: enable SQTT tracing on GFX11 4802- radv: set VS_OUT_MISC_SIDE_BUS_ENA for clip distances on GFX10.3+ 4803- radv/ci: cleanup CI lists for dEQP-VK.memory.* tests that timeout 4804- ac/nir: add resinfo lowering for sliced storage 3D views 4805- radv: implement VK_EXT_image_sliced_view_of_3d on GFX10+ 4806- radv: advertise VK_EXT_image_sliced_view_of_3d on GFX10+ 4807- radv: cleanup radv_emit_{conservative,msaa}_state() functions 4808- radv: stop setting ENABLE_POSTZ_OVERRASTERIZATION to 1 4809- radv: set MSAA_NUM_SAMPLES to 0 for underestimate rasterization 4810- radv: enable primitiveUnderestimation on GFX9+ 4811- zink/ci: skip one more test that timeout with RADV 4812- radv: fix flushing non-coherent images inside secondaries on GFX9+ 4813- radv: fix flushing non-coherent images in EndCommandBuffer() 4814- radv: fix draw calls with 0-sized index buffers and robustness on NAVI10 4815- radv: only expose EXT_pipeline_library_group_handles if RT is enabled 4816- amd,ac/rgp: fix SQTT memory types 4817- radv: ignore alpha_is_on_msb on GFX11 because the hw ignores it 4818- radv: use new EVENT_WRITE_ZPASS packet3 on GFX11 4819- radv: fix DCC decompress on GFX11 4820- radv: stop allocationg the attr ring BO for compute queues on GFX11 4821- ci: uprev CTS to 1.3.5.0 4822- radv/ci: adjust timeouts for Vega10 and Renoir 4823- radv/ci: stop skipping some graphics pipeline library tests 4824- radv/ci: update CI lists for CTS 1.3.5.0 on GFX110/POLARIS10/PITCAIRN 4825- wsi: move an assertion in wsi_xxx_surface_get_capabilities2() 4826- radv: do not add descriptor BOs on update when the global BO list is used 4827- radv: fix incorrect stride for primitives generated query with GDS 4828- radv: fix border color swizzle for stencil-only format on GFX9+ 4829- radv: fix defining RADV_USE_WSI_PLATFORM 4830- radv: move disabling DCC for VRS rate images in radv_get_surface_flags() 4831- ac/surface: add RADEON_SURF_VRS_RATE for selecting swizzle mode on GFX11 4832- radv: add support for VRS attachment on GFX11 4833- radv: do not emit PA_SC_VRS_OVERRIDE_CNTL from the pipeline on GFX11 4834- radv: advertise attachmentFragmentShadingRate on GFX11 4835- radv: enable VK_KHR_fragment_shading_rate on GFX11 4836- radv: disable DCC with signedness reinterpretation on GFX11 4837- radv: move instance related code to radv_instance.c 4838- radv: move physical device related code to radv_physical_device.c 4839- radv: move queue related code to radv_queue.c 4840- radv: move sampler related code to radv_sampler.c 4841- radv: move event related code to radv_event.c 4842- radv: move buffer related code to radv_buffer.c 4843- radv: move device memory related code to radv_device_memory.c 4844- radv: zero-initialize radv_shader_args right before declaring them 4845- radv: zero-initialize radv_shader_info earlier for graphics pipeline 4846- radv: fix the error code when the driver fails to create a PS epilog 4847- radv: determine if a graphics pipeline needs a noop FS earlier 4848- radv: keep track of the retained NIR shaders sha1 for LTO pipelines 4849- radv: allow to cache optimized (LTO) pipelines with GPL 4850- radv: rename RADV_PIPELINE_LIBRARY to RADV_PIPELINE_RAY_TRACING_LIB 4851- radv: add helpers for destroying various pipeline types 4852- radv: fix NGG streamout with VS and GPL on GFX11 4853- spirv: add SpvCapabilityFragmentFullyCoveredEXT 4854- spirv,nir: add support for SpvBuiltInFullyCoveredEXT 4855- radv: lower nir_intrinsic_load_fully_covered 4856- radv: enable SAMPLE_COVERAGE_ENA if the fully covered built-in is used 4857- radv: implement fullyCoveredFragmentShaderInputVariable 4858- radv: enable fullyCoveredFragmentShaderInputVariable on GFX9+ 4859- radv: remove set but never used num_preserved_sgprs 4860- radv: stop storing the binary as part of radv_shader_part 4861- radv: store spi_shader_col_format to radv_shader_part_binary 4862- radv: store the total radv_shader_part_binary size 4863- radv: upload prologs/epilogs as part of radv_shader_part_create() 4864- radv: allow to return the PS epilog binary to the pipeline 4865- radv: make radv_shader_part_create() non-static 4866- radv: add support for caching PS epilogs 4867- radv: stop using radv_get_shader_shader() for task shaders 4868- radv: replace radv_lookup_user_sgpr() by radv_get_user_sgpr() 4869- radv: pass shader/base_reg to radv_emit_descriptor_pointers() 4870- radv: pass shader/base_reg to radv_emit_inline_push_consts() 4871- radv: pass shader/base_reg to radv_emit_userdata_address() 4872- radv: use a separate compute path in radv_flush_constants() 4873- radv: pass radv_shader to radv_dump_shader_stats() 4874- radv: rework dumping shaders when a GPU hang is reported 4875- radv: pass radv_ray_tracing_pipeline to radv_rt_pipeline_compile() 4876- radv: pass radv_shader to radv_shader_need_indirect_descriptor_sets() 4877- radv: implement VK_KHR_map_memory2 4878- radv: use common GetBufferMemoryRequirements2() 4879- radv: move cs_regalloc_hang_bug to radv_shader_info 4880- radv: pass a radv_shader to radv_emit_dispatch_packets() 4881- radv: remove radv_pipeline::device completely 4882- radv: rework binding shaders to cmdbuf by introducing new helpers 4883- radv: move radv_meta_* to a new folder 4884- radv: copy the multisample state to radv_cmd_state 4885- radv: move uses_user_sample_locations to radv_multisample_state 4886- radv: separate the sample shading state between FS and graphics pipeline 4887- radv: add DI_PT_RECTLIST to si_conv_prim_to_gs_out() 4888- radv: stop checking dynamic states when emitting the guardband state 4889- radv: rename gfx9_gs_info to radv_legacy_gs_info 4890- radv: move {esgs,gsvs}_ring_size to radv_legacy_gs_info 4891- radv/rt: bind the pipeline stack when it's not dynamic 4892- radv/ci: update CI lists for Polaris10 and Pitcairn 4893- radv: stop using get_vs_output_info() when emitting VS/NGG shaders 4894- radv: emit the GS copy shader outside of radv_pipeline_emit_hw_gs() 4895- radv: add radv_get_last_vgt_shader() helper 4896- radv: stop using the pipeline for emitting PS inputs 4897- radv: use the shader info stage to simplify emitting NGG shaders 4898- radv: use the ES type to apply a workaround for NGG on GFX10 4899- radv: pass the ES shader to radv_pipeline_emit_hw_ngg() 4900- radv: stop using the pipeline for emitting shaders 4901- radv: pass shader/base_reg to radv_emit_view_index_per_stage 4902- radv: pass a shaders array to radv_get_shader() 4903- radv: add radv_bind_shader() helper 4904- radv: add an assertion about shader stage to radv_bind_pre_rast_shader() 4905- radv: keep track of active stages as part of the cmdbuf state 4906- radv: determine the last VGT shader at pipeline bind time 4907- radv: stop using last_vgt_api_stage_{locs} during cmdbuf recording 4908- radv: move dirtying flags for mesh shading to radv_bind_pre_rast_shader() 4909- radv: copy bound shaders to the cmdbuf state 4910- radv: determine and store the next graphics stage to radv_shader_info 4911- radv: move user_data_0 to the shader info pass 4912- radv: replace pipeline->is_ngg occurrences during cmdbuf recording 4913- radv: replace pipeline->force_vrs_per_vertex during cmdbuf recording 4914- radv: use serialized NIR for graphics libs with the RETAIN flag 4915- radv: remove radv_graphics_pipeline::use_per_attribute_vb_descs 4916- radv: remove radv_graphics_pipeline::last_vertex_attrib_bit 4917- radv: remove radv_graphics_pipeline::next_vertex_stage 4918- radv: remove radv_graphics_pipeline::can_use_simple_input 4919- aco: remove unused aco_shader_info::vb_desc_usage_mask 4920- radv: adjust vb_desc_usage_mask for dynamic VS inputs in the info pass 4921- radv: remove radv_graphics_pipeline::vb_desc_usage_mask 4922- radv: remove radv_graphics_pipeline::vb_desc_alloc_size 4923- radv: rework emitting inner coverage when a fragment shader is bound 4924- radv: copy custom blend mode to the cmdbuf state 4925- radv: add a helper that returns the current rasterized primitive 4926- radv: copy rast_prim to the cmdbuf state 4927- radv: copy uses_{drawid,baseinstance} to the cmdbuf state 4928- radv: copy ia_multi_vgt_param to the cmdbuf state 4929- radv: add a helper to convert a VkPipelineBindPoint 4930- radv: copy need_indirect_descriptor_sets to radv_cmd_state 4931- radv: add push constant state to the cmdbuf state 4932- radv: fix sample shading when a new fragment shader is bound 4933- vulkan: add dynamic support for rectangles enable/mode 4934- radv: add dynamic support for rectangles enable/mode 4935- vulkan: Update XML and headers to 1.3.246 4936- radv: copy db_render_control to the cmdbuf state 4937- radv: set PS_ITER_SAMPLE(1) for sample shading during cmdbuf recording 4938- radv: configure PA_SC_MODE_CNTL_1 during cmdbuf recording 4939- radv: add the raygen shader BO to the cmdbuf list 4940- radv: fix binding raytracing/compute pipelines 4941- zink/ci: remove primitive-id-no-gs-quads from the NAVI10 fail list 4942- radv/ci: add one more flake 4943- radv: only copy non-NULL shaders when loaded from the cache 4944- radv: rely on non-NULL binaries when inserting shaders to the cache 4945- radv: allow to create/insert PS epilogs from/to the cache for libs 4946- radv: remove dead code in radv_pipeline_get_nir() 4947- radv: add VkGraphicsPipelineLibraryFlag to the graphics pipeline key 4948- radv: ensure to retain NIR shaders for GPL libs found in the cache 4949- radv: enable shaders cache for libraries with GPL 4950- radv: fix VS prologs with GPL and static binding stride 4951- radv: emit the PS epilog after the graphics pipeline 4952- radv: add a helper for retaining NIR shaders 4953- radv: move the serialized NIR to radv_graphics_lib_pipeline 4954- radv: simplify a check when retaining NIR shaders 4955- radv: do not retain noop FS for libs when a cache hit happened 4956- radv: import retained NIR shaders later in the compilation process 4957- radv/rt: stop storing unused hashes/identifiers 4958- radv: create a helper for copying VkPipelineShaderStageCreateInfo 4959- radv: copy stages instead of serializing NIR for GPL with the RETAIN flag 4960- radv: enable VK_EXT_graphics_pipeline_library by default 4961- radv/ci: update expected failures for PITCAIRN 4962- radv/ci: remove no longer existing tests for PITCAIRN 4963- radv/ci: update expected failures with BONAIRE 4964- docs: add more release notes for RADV 4965- radv: fix re-emitting vertex user SGPRs when binding a graphics pipeline 4966- radv/ci: remove one RT test from the expected failures on RDNA3 4967- radv: split radv_pipeline.c into radv_pipeline_{compute,graphics}.c 4968- radv: fix pipeline creation feedback with imported graphics libs 4969- radv: cleanup after splitting radv_pipeline.c 4970- radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines 4971- vulkan: ignore rasterizationSamples when the state is dynamic 4972- radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout 4973- radv: re-emit the guardband state when related PSO are bound 4974- radv: disable fast-clears with CMASK for 128-bit formats 4975- radv: do not allow 1D block-compressed images with (extended) storage on GFX6 4976- radv: fix usage flag for 3D compressed 128 bpp images on GFX9 4977- radv: update binning settings to work around GPU hangs 4978- radv/amdgpu: fix adding continue preambles and postambles BOs to the list 4979- radv: wait for occlusion queries in the resolve query shader 4980- radv: delay enabling/disabling occlusion queries at draw time 4981- radv: track DB_COUNT_CONTROL changes to avoid context rolls 4982- radv: add the perf counters BO to the preambles BO list 4983- radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+ 4984- ac/nir: fix 8-bit/10-bit PS exports clamping 4985- radv: fix dynamic depth clamp enable support 4986- radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS} 4987- radv: disable RB+ blend optimizations on GFX11 when a2c is enabled 4988 4989Sarah Walker (1): 4990 4991- pvr: Update FWIF transfer queue register structures 4992 4993Sathishkumar S (8): 4994 4995- radeonsi/vcn: add register definitions for JPEG 4.0.3 4996- radeonsi/vcn: use register versions for jpeg 4997- radeonsi/vcn: add support for picture crop on JPEG 4.0.3 4998- radeonsi/vcn: support ARGB/RGBA conversion on JPEG 4.0.3 4999- radeonsi/vcn: set jpeg reg version for gfx940 5000- radeonsi/vcn: reset to default value when ROI/FC is not used 5001- frontends/va: support crop region in jpeg decode 5002- radeonsi/vcn: enable RGBA/ARGB formats on gfx940 jpeg 5003 5004Sebastian Wick (1): 5005 5006- loader: do not check the mesa DRI_Mesa version if it was not found 5007 5008Sergi Blanch Torne (8): 5009 5010- ci: disable Collabora's LAVA lab for maintance 5011- Revert "ci: Collabora's LAVA lab for maintance" 5012- ci: Uprev kernel to 6.1.7 5013- ci: disable Collabora's LAVA lab for maintance 5014- ci: disable Collabora's LAVA lab for maintance 5015- ci: include setup test environment script in the output artifacts 5016- Revert "ci: disable Collabora's LAVA lab for maintance" 5017- ci: disable Collabora's LAVA lab for maintance 5018 5019Sidney Just (4): 5020 5021- zink: Fix non debug builds failing to compile on 5022- loader: Add missing brace to fix compile 5023- zink: add check for samplerMirrorClampToEdge Vulkan 1.2 feature 5024- zink: Add missing features to the profile file 5025 5026Sil Vilerino (15): 5027 5028- d3d12: Honor suggested driver profile/level for H264/HEVC encode 5029- d3d12: Video processing - Fix out of bounds array access 5030- d3d12: Video Encode - Fix ID3D12CommandAllocator leak 5031- d3d12: Fix VP9 Decode - Checking 0xFF instead of 0x7F for invalid frame_ref[i].Index7Bits 5032- frontend/va: Add format support checks for VA_RT_FORMAT_* in VaCreateConfig/VaGetConfigAttributes 5033- frontend/va: Remove duplicate code in format support checking/reporting. 5034- frontend/va: Keep track of some VP9 previous frame data for current frame use_prev_in_find_mvs_refs 5035- d3d12: VP9 Decode - Fix use_prev_in_find_mvs_refs calculation 5036- d3d12: Fix video decode for interlaced streams with reference only textures required 5037- d3d12: H264/HEVC Encode - Set both VBV InitialCapacity/Size in CBR Rate Control to same value when requested 5038- d3d12: Encode H264/HEVC - Do not write PPS unless different from active 5039- d3d12: Encode - Only upload headers when written headers size is > 0 5040- nir: Fix use of alloca() without #include c99_alloca.h 5041- Revert "d3d12: Honor suggested driver profile/level for H264/HEVC encode" 5042- d3d12: Video processor to only promote resources to permanent residency when there is work to be flushed 5043 5044Simon Fels (2): 5045 5046- venus: allow vtest socket being specified by env variable 5047- virgl/vtest: allow socket being specified by env variable 5048 5049Simon Perretta (38): 5050 5051- pvr: Add new Rogue compiler framework 5052- pvr: Add support for optional instruction params 5053- pvr: Support dual-destination ALU instructions 5054- pvr: Commonise some instruction member defs 5055- pvr: Drop the ENUM_PACKED macro 5056- pvr: Keep NIR SSA defs instead of registers 5057- pvr: Adjust instruction repeat offset 5058- pvr: Validate instruction repeat and src/dst sizes 5059- pvr: Add block printing support during validation 5060- pvr: Clarify unreachable text 5061- pvr: Add ADD64 support 5062- pvr: Add memory load support 5063- pvr: Add bitwise instruction support 5064- pvr: Additional register subarray support 5065- pvr: Support loading immediate values 5066- pvr: Load descriptors from memory 5067- pvr: Split pvr_private.h 5068- pvr: Use descriptor/set/table offsets from driver 5069- pvr: Add NIR pass to lower vars to SSA 5070- pvr: Amend subarray ownership code 5071- pvr: Add support for fitr.pixel 5072- pvr: Add support for sample instructions 5073- pvr: Add support for validating modifier combos 5074- pvr: Add support for emitpix 5075- pvr: Add support for WOP 5076- pvr: Register allocation improvements 5077- pvr: Fix descriptor set address calculation 5078- pvr: Add support for generating per-job EOT program 5079- pvr: Add support for generating NOP program 5080- pvr: Add support for IDF 5081- pvr: Add support for ST 5082- pvr: Add branch support 5083- pvr: Add support for TST 5084- pvr: Add basic support for manual instruction grouping 5085- pvr: Add support for MOVC 5086- pvr: Add late op lowering pass and conditional execution 5087- pvr: Amend definitions for ST and IDF 5088- pvr: Add encodings for index registers 5089 5090Simon Ser (1): 5091 5092- egl: fix fd_display_gpu on surfaceless and device platforms 5093 5094Sonny Jiang (5): 5095 5096- radeonsi: Add NV12 support for AV1 5097- gallium/pipe: change PIPE_DEFAULT_DECODER_FEEDBACK_TIMEOUT_NS to 1 second 5098- amd/common: Add gfx940 codec query support 5099- radeonsi/vcn: Add video capabilities support for gfx940 5100- radeonsi/vcn: Add decode support for gfx940 5101 5102SoroushIMG (31): 5103 5104- zink: add pass checking for lod overflow in txf 5105- zink: add zink_cs_key 5106- zink: add VK_EXT_image_robustness 5107- zink: add robust_access field to shader key 5108- zink: lower LOD-invalid txf when imageRobustAccess2 is missing 5109- zink: update gl43 profile to allow imageRobustAccess 5110- zink: fix sparse residency query and minLOD feature checks 5111- zink: fix cap check for arb sparse texture2 5112- zink: only save frag const buffers when used by blit 5113- zink: fix leak when rebinding same image surface 5114- zink: clear null image surfaces to 0 5115- zink: fix pointcoord y inversion 5116- zink: relax bresenhamLines requirement for non-strictLine drivers 5117- zink: fix compute shader leaks 5118- zink: allocate program shader caches from the program's mem ctx 5119- zink: stop creating pipeline library cache for non-optimal_key drivers 5120- zink: free resource objects' views array during destruction 5121- zink: fix stale point sprite mode state 5122- zink: fix shadow mask change logic when binding sampler views 5123- zink: track shadow swizzle for all shader stages 5124- zink: minor formatting change 5125- zink: add needs_zs_shader_swizzle shader key 5126- zink: extend shadow swizzle pass to all zs textures 5127- zink: add depth/stencil needs shader swizzle workaround field 5128- zink: workaround undefined swizzle 1 for z/s textures 5129- zink: rename shadow key to zs swizzle 5130- zink: Add driver name and API version to renderer name 5131- zink: do not emit line stipple dynamic state when emulating 5132- zink: take location_frac into account in lower_line_smooth_gs 5133- zink: fix incorrect line mode check for bresenham 5134- zink: refcount the correct query pool 5135 5136Sui Jingfeng (1): 5137 5138- meson: add basic support for loongarch 5139 5140SureshGuttula (1): 5141 5142- radeonsi: Add support for DPB resize 5143 5144Sviatoslav Peleshko (9): 5145 5146- anv: Handle VkAccelerationStructureBuildRangeInfoKHR::transformOffset 5147- driconf/anv: Apply limit_trig_input_range WA to Rise of the Tomb Raider 5148- iris: Avoid creating uncompressed view with unaligned tile offsets on BDW 5149- anv: Handle all fields in VkAccelerationStructureBuildRangeInfoKHR 5150- anv: Move WA MEDIA_VFE_STATE after stalling PIPE_CONTROL 5151- glsl: Fix codegen for constant ir_binop_{l,r}shift with mixed types 5152- isl: Check all channels in isl_formats_have_same_bits_per_channel 5153- anv: Handle UNDEFINED format in image format list 5154- anv: Improve image/view usage bits verification 5155 5156Tapani Pälli (42): 5157 5158- intel/compiler: add cpp_std=c++17 when building tests 5159- intel/hasvk: remove some stale comments, wa was removed 5160- anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable 5161- hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable 5162- iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable 5163- mesa: move component bits queries as GL ES only 5164- intel/genxml: set unused 3DSTATE_PS_EXTRA field as mbz 5165- intel: enable existing workaround for ICL platform 5166- intel/blorp: disable REP16 for gfx12+ with R10G10B10_FLOAT_A2 5167- iris: disable preemption for 3DPRIMITIVE during streamout 5168- iris: handle error in iris_resource_from_handle 5169- spirv: add workaround for Metro Exodus in spirv_to_nir 5170- radv: revert Metro Exodus workaround which was moved to common code 5171- mesa/st: refactor st_destroy_texcompress_compute condition 5172- mesa/st: add astc decoder lookup tables 5173- mesa/st: initialize resources for ASTC decoding 5174- mesa: add astc decoder shader template (glsl es version) 5175- mesa/st: support compute shader decoding of ASTC 5176- anv: Wa_14016407139, add required pc when SBA programmed 5177- iris: implement emission of 3DSTATE_HS for Wa_1306463417 5178- anv: emit 3DSTATE_HS in cmd_buffer_flush_gfx_state 5179- anv: limit generated draws to pipelines without HS stage 5180- anv: implement emission of 3DSTATE_HS for Wa_1306463417 5181- iris: emit 3DSTATE_HS for each primitive on gfx12 5182- anv: emit 3DSTATE_HS for each primitive on gfx12 5183- intel/compiler: add comment about workaround on simd width 5184- anv: fix sends_count_expectation assert on simd32 5185- intel/isl: disable TILE64 for YCRCB formats 5186- anv: implement occlusion query related Wa_14017076903 5187- iris: implement occlusion query related Wa_14017076903 5188- intel/fs: restore message layout changes for cube array 5189- anv: use primitive ID override when shader does not supply it 5190- anv: take primitive ID override to account Wa_14015297576 5191- anv: check for MESA_SHADER_TESS_CTRL with get_tcs_prog_data 5192- intel/common: limit the amount of SLM with Wa_14017341140 5193- intel/fs: use intel_needs_workaround for Wa_22013689345 5194- intel/compiler: use intel_needs_workaround for Wa_14012437816 5195- isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125 5196- iris: implement state cache invalidate for Wa_16013063087 5197- anv: cleanup bitmask construction for PIPELINE_SELECT 5198- anv: implement state cache invalidate for Wa_16013063087 5199- isl: fix layout for comparing surf and view properties 5200 5201Tatsuyuki Ishi (22): 5202 5203- radv: Fix depth-only-with-discard when epilogs are used. 5204- radv: Fix emitting tess indirect descriptors twice. 5205- radv: Loop over shader stages in flush_indirect_descriptor_sets. 5206- radv: Fix noop FS not getting constructed for GPL pipelines. 5207- radv: Fix missing rbplus_allowed check for dynamic PS epilogs. 5208- radv: Assert the hardware support rbplus when emitting rbplus state. 5209- radv: Keep shader code ptr in a separately allocated buffer. 5210- radv/sqtt: Use code buffer from radv_shader directly instead of copying. 5211- radv: Replace radv_trap_handler_shader with radv_shader. 5212- radeonsi: SDMA v4 size field is size - 1 5213- radv: SDMA v4 size field is size - 1 5214- radv: Remove SDMA padding from copy helpers. 5215- radv: Use common helpers to translate format in SDMA copy. 5216- radv/rt: Don't upload the prolog twice. 5217- radv: Use radeon_cmdbuf for sdma_copy_image. 5218- radv: Introduce sdma_copy_buffer for GFX7+. 5219- radv: Upload shaders to invisible VRAM on small BAR systems. 5220- radv: Wait for shader uploads asynchronously. 5221- radv: Fix missing wait of GS copy shader upload for dmashaders. 5222- amd: Add radv_foreach_stage to ForEachMacros. 5223- radv: Pre-compute descriptor set layout hash. 5224- ci/android: Make armv8's arch aarch64 instead of arm. 5225 5226Teng, Jin Chung (2): 5227 5228- frontend/va: Add large_scale_tile from VADecPictureParameterBufferAV1 5229- d3d12: AV1 Dec - Set anchor_frame_idx only when large_scale_tile equals 1 5230 5231Thomas H.P. Andersen (4): 5232 5233- docs/panvk: VK_KHR_descriptor_update_template 5234- meson: use summary() 5235- meson: use sections in summary() 5236- v3dv: use common code for descriptor update template 5237 5238Thong Thai (6): 5239 5240- gallium/auxiliary/vl: clean-up progressive shader 5241- radeonsi/vcn: use encoder/decoder caps reported by kernel 5242- gallium/auxiliary/vl: add crop to compute shader 5243- mesa/main: rework locale setup/teardown 5244- util: check and initialize locale before using it 5245- tgsi: use locale independent float and double parsing 5246 5247Timothy Arceri (15): 5248 5249- nir/nir_opt_copy_prop_vars: remove extra loop 5250- nir/nir_opt_copy_prop_vars: avoid comparison explosion 5251- nir/nir_opt_copy_prop_vars: reuse hash tables 5252- nir/nir_opt_copy_prop_vars: reuse dynamic arrays 5253- nir/nir_opt_copy_prop_vars: reorder clone calls 5254- nir/nir_opt_copy_prop_vars: don't call memset when cloning 5255- ci: enable dEQP-VK.ubo.random.all_shared_buffer.48 5256- glsl: copy prop vars before scalarizing alus 5257- glsl: add _token_list_prepend() helper to the parser 5258- glsl: isolate object macro replacments 5259- glsl: remove do_copy_propagation_elements() optimisation pass 5260- glsl: allow 64-bit integer on RHS of shift 5261- util/00-mesa-defaults: add Akka Arrh workaround 5262- mesa: add _mesa_is_api_gles2() helper 5263- glsl: move some GL ES checks to the NIR linker 5264 5265Timur Kristóf (155): 5266 5267- aco/optimizer: Add missing v_lshlrev condition to can_apply_extract. 5268- aco/optimizer: Optimize p_extract + v_mul_u32_u24 to v_mad_u32_u16. 5269- radv: Make NGG query emission a dirty flag. 5270- radv: Get rid of app_shaders_internal. 5271- radv, aco: Add uses_full_subgroups to compute shader info. 5272- aco: Enable constant exec mask based optimization on compute shaders. 5273- radv: Lower dynamic VS inputs in NIR. 5274- aco: Remove dynamic VS input loads. 5275- nir: Add pack_half_2x16_rtz_split opcode. 5276- radv, aco, ac: Implement pack_half_2x16_rtz_split. 5277- nir: Lower pack_half_2x16_split to RTZ if available. 5278- nir: Add algebraic optimization for VKD3D-Proton fp32->fp16 conversion. 5279- ac/gpu_info: Add has_pcie_bandwidth_info. 5280- radv: Don't place CS in VRAM when bandwidth is too low. 5281- nir/opt_algebraic: Add optimization for ieq/ine and right-shift. 5282- radv: Disable NGG culling when conservative overestimation is used. 5283- ac/nir/cull: Alway remove zero-area triangles in face culling. 5284- ac/nir/ngg: Include culled primitives in query. 5285- radv: Don't change LDS_SIZE for NGG culling shaders. 5286- radv: Move checking primitive topology to radv_get_ngg_culling_settings. 5287- radv: Use shader code to skip NGG culling in small workgroups. 5288- radv: Remove NGG culling skip from command buffer. 5289- radv: Refactor radv_emit_ngg_culling_state so it's based on dirty flags. 5290- nir: Clarify comment above load_buffer_amd. 5291- ac: Port ACO's get_fetch_format to ac_get_safe_fetch_size. 5292- ac/llvm: Remove "structurized" argument and instead check vindex. 5293- ac/llvm: Fix buffer_load_amd with larger than 32-bit channel sizes. 5294- ac/llvm: Fix ac_build_buffer_load to work with more than 4 channels. 5295- ac/llvm: Change ac_build_tbuffer_load to take format and channel type. 5296- radv: Move VS input lowering to new file: radv_nir_lower_vs_inputs. 5297- aco: Get rid of redundant load_vmem_mubuf function. 5298- aco: Don't set scalar offset on buffer load instructions when it's zero. 5299- aco: Remove MTBUF zero operand. 5300- radv: Call nir_lower_array_deref_of_vec in radv_lower_io_to_scalar_early. 5301- aco/optimizer: Change v_cmp with subgroup invocation to constant. 5302- radv: Emulate VGT_ESGS_ITEMSIZE in shaders on GFX9+. 5303- util: Add util_format_get_array. 5304- ac: Add pending_vmem field to args. 5305- radv: Set pending_vmem on dynamic VS input args. 5306- aco: Generalize vs_inputs to args_pending_vmem. 5307- aco, radv: Rename aco_*_key to aco_*_info. 5308- aco, radv: Move PS epilog and VS prolog args to their info structs. 5309- aco, radv: Don't use radv_shader_args in aco. 5310- aco: Don't include headers from radv. 5311- ac/nir: clear nir_var_shader_out from TCS barriers 5312- aco: Remove vtx_binding from MUBUF/MTBUF instructions. 5313- nir: Add load_typed_buffer_amd intrinsic. 5314- aco: Implement load_typed_buffer_amd. 5315- ac/llvm: Implement typed buffer load intrinsic. 5316- radv: Lower non-dynamic VS inputs in NIR. 5317- radv: Apply swizzle and alpha adjust in radv_nir_lower_vs_inputs. 5318- aco: Remove VS inputs from visit_load_input. 5319- aco: Rename visit_load_input to visit_load_fs_input. 5320- radv: Remove VS inputs code from LLVM backend. 5321- ac/llvm: Remove unused function ac_build_struct_tbuffer_load. 5322- aco, radv: Remove VS IO information from ACO. 5323- aco: Don't add soffset to swizzled MUBUF base. 5324- aco: Use zero for MUBUF/MTBUF when soffset is undefined. 5325- aco: Disable MUBUF/MTBUF offsets when they are zero. 5326- aco: Always enable idxen for swizzled buffer access on GFX11. 5327- ac/nir/ngg: Remove usused lds_es enum values. 5328- ac/nir/ngg: Rename saved_uniform to reusable_nondeferred_variable. 5329- ac/nir/ngg: Split some functions out of save_reusable_variables. 5330- ac/nir/ngg: Move divergence analysis call to analyze_shader_before_culling. 5331- ac/nir/ngg: Rename state variables to "s". 5332- ac/nir/ngg: Remove some superfluous variables. 5333- ac/nir/ngg: Create separate variable for repacked rel_patch_id. 5334- ac/nir/ngg: Rename repacked variables to clarify their name. 5335- ac: Add more defines for mesh shading packets. 5336- radv: Use new mesh shading packet defines. 5337- radv: Add per-prim attributes to ring_attr stride. 5338- radv: Use per-prim params in has_param_exports. 5339- radv: Add extra offset to per-prim params. 5340- radv: Use PRIM_ATTR for PS inputs on GFX11. 5341- radv: Include per-prim params in NUM_INTERP on GFX11. 5342- radv: Adjust mesh draw packets for GFX11. 5343- ac/nir/ngg: Clarify mesh shader scratch ring. 5344- ac/nir/ngg: Use attribute ring for mesh shader params. 5345- ac/nir/ngg: Split legacy workgroup index function. 5346- ac/nir/ngg: Fix mesh shader layer on GFX11. 5347- ac/nir/ngg: Store special MS outputs in attribute ring for PS to read. 5348- radv: Enable mesh shading on GFX11. 5349- radv: Fix swizzled VS input loads when some components are unused. 5350- radv: Don't expose NV_mesh_shader and don't use it in CI. 5351- radv: Remove NV_mesh_shader API entrypoints. 5352- radv: Remove first_task and ib_addr/ib_stride. 5353- radv: Clean up emitting zero mesh shader draw id. 5354- ac/nir/ngg: Remove NV_mesh_shader support. 5355- ac/nir: Remove ac_nir_apply_first_task_to_task_shader. 5356- nir: Remove IB address and stride intrinsics. 5357- radv: Move radv_nir_* to a new folder. 5358- radv: Move radv_nir_lower_primitive_shading_rate to new file. 5359- radv: Move radv_nir_lower_fs_intrinsics to new file. 5360- radv: Move radv_nir_lower_intrinsics_early to new file. 5361- radv: Move radv_nir_lower_view_index to new file. 5362- radv: Move radv_nir_lower_viewport_to_zero to new file. 5363- radv: Move radv_nir_export_multiview to new file. 5364- radv, ac/nir: Move sin/cos lowering to a common pass. 5365- radv: Move I/O lowering functions into a new file. 5366- radv: Use radv_get_shader to get vertex shader when binding pipeline. 5367- ac/nir/ngg: Slightly improve attribute ring offset calculation. 5368- ac/nir: Store only lowest 8 bits for task draw ring DWORD3. 5369- ac/nir: When task->mesh dispatch Y or Z are 0, also set X to 0. 5370- aco: Consider p_cbranch_nz as divergent branch too. 5371- aco: Don't remove exec writes that also write other registers. 5372- aco: Simplify get_phi_operand using Operand::c32_or_c64. 5373- aco: Don't verify branch exec read when eliminating exec writes. 5374- aco: Pop branch operands when targets are same in SSA elimination. 5375- aco: Call dominator_tree before lower_phis. 5376- aco: Better phi lowering for merge block when else-side is const. 5377- nir: Gather compile time constant task->mesh dispatch size. 5378- radv: Use linear_dispatch info in GFX11 task/mesh draw packet. 5379- radv/amdgpu: Extract CS chain and unchain functions. 5380- radv/amdgpu: Expose CS chain and unchain on the winsys. 5381- radv/amdgpu: Extract radv_amdgpu_add_cs_to_bo_list function. 5382- radv/amdgpu: Remember which CS the current one is chained to. 5383- radv/amdgpu: Walk chained CS objects for BO list. 5384- radv/amdgpu: Unchain CS array in queue code not in winsys. 5385- radv: Chain cmd buffers in queue code when possible, not in winsys. 5386- radv/amdgpu: Remove can_patch and chained submit code path. 5387- ac/llvm: Cover runtime 0 in GFX10 gs_alloc_req workaround. 5388- aco: Fix optimization of v_cmp with subgroup invocation. 5389- aco: Don't use nir_selection_control in aco_ir. 5390- aco: Only include nir.h in instruction selection. 5391- radv: Don't include nir.h in radv_shader.h 5392- radv: Create continue preamble on GFX6 even when no shader rings are used. 5393- ac: Add maximum number of submitted IBs. 5394- radv/amdgpu: Fix mismatching return type of radv_amdgpu_cs_submit. 5395- radv/amdgpu: Only allow IB BOs on graphics and compute queues. 5396- radv/amdgpu: Use correct alignment when creating CS BOs. 5397- radv/amdgpu: Extract radv_amdgpu_cs_add_old_ib_buffer. 5398- radv/amdgpu: Add a few assertions during submit. 5399- radv/amdgpu: Remove hw_can_chain in favour of use_ib. 5400- radv/amdgpu: Rewrite fallback code path so it can split submissions. 5401- radv/amdgpu: Allow multiple continue preambles. 5402- radv/amdgpu: Add continue preambles to fallback submit. 5403- radv/amdgpu: Add postambles to fallback submit. 5404- radv/amdgpu: Add ability to submit non-chained CS to fallback. 5405- radv/amdgpu: Split gang submissions correctly when not chained. 5406- radv: Fill continue preambles and postambles properly. 5407- radv: Split submission in winsys instead of radv_queue. 5408- radv/amdgpu: Use fallback submit for queues that can't use IBs. 5409- radv/amdgpu: Clean up submission functions. 5410- radv/amdgpu: Respect maximum number of submitted IBs per IP type. 5411- radv: Allow task/mesh shaders with RADV_DEBUG=noibs. 5412- radv/amdgpu: Add bool is_secondary argument to cs_create function. 5413- radv/amdgpu: Extract radv_amdgpu_cs_bo_create function. 5414- radv/amdgpu: Place secondary CS without IB2 in non-WC GTT. 5415- ac, aco, radv: Clarify LDS size on GFX6, and NGG shaders. 5416- radv: Don't hardcode LDS granularity in gfx9_get_gs_info. 5417- aco: Remove setup_*_variables and add setup_lds_size instead. 5418- aco, radv: Remove "key" from aco_compiler_options. 5419- aco, radv: Remove redundant enable_mrt_output_nan_fixup from PS epilog info. 5420- ac/nir/ngg: Don't store primitive IDs from culled primitives. 5421- aco: Disallow constant propagation on SOPP and fixed operands. 5422 5423Tomeu Vizoso (7): 5424 5425- android: Make libbacktrace optional again 5426- android: Cleanup unneeded headers from the sync stub 5427- ci: Build for Android with libbacktrace=false 5428- ci: Use NDK 25b to build for the Android ABI level 33 5429- etnaviv: handle missing alu conversion opcodes 5430- etnaviv: print writemask of store operations 5431- etnaviv: don't read too much from uniform arrays 5432 5433Turo Lamminen (4): 5434 5435- radv: Change radeon_cmdbuf counters to uint64_t to make alias analysis optimize radeon_emit better 5436- radv: Clean up variables in si_get_ia_multi_vgt_param 5437- radv: Avoid redundant fetch of radv_device 5438- radv: Optimize emitting prefetches 5439 5440Val Packett (1): 5441 5442- mailmap: Remap name and email for Val Packett 5443 5444Vincent Davis Jr (1): 5445 5446- gbm/backend: fix gbm compile without dri 5447 5448Vinson Lee (2): 5449 5450- radv: Fix memory leak. 5451- pps: Fix build errors. 5452 5453Vitaliy Triang3l Kuzmin (2): 5454 5455- radv: Set DB_Z_INFO.NUM_SAMPLES to MSAA_EXPOSED_SAMPLES without Z/S 5456- r600: Alpha to coverage dithering on Evergreen+ 5457 5458Väinö Mäkelä (12): 5459 5460- intel/vec4: Set the rounding mode 5461- intel/vec4: Don't optimize multiply by 1.0 away 5462- hasvk: Don't claim shaderDenormPreserveFloat32 on gfx7 5463- hasvk: Tell spirv_to_nir float controls are always supported 5464- hasvk: Enable PixelShaderKillsPixel when omask is used 5465- hasvk: Mark VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL as stencil write optimal 5466- hasvk: Handle subpass self-dependencies for stencil shadow copies 5467- ci/intel: Update hasvk HSW xfails 5468- hasvk: Disable non-zero fast clears for 8xMSAA images 5469- intel/ci: Remove outdated hasvk copy_and_blit xfails 5470- intel/ci: Remove hasvk xfails missing from the CTS 5471- intel/ci: Remove skipped float_controls tests from hasvk xfails 5472 5473X512 (3): 5474 5475- hgl: remove 5476- haiku: fix build 5477- EGL: implement Haiku driver over Gallium 5478 5479Yevhenii Kolesnikov (2): 5480 5481- nir/loop_analyze: Track induction variables incremented by more operations 5482- nir/loop_analyze: Determine iteration counts for more kinds of loops 5483 5484Yiwei Zhang (46): 5485 5486- venus: log upon device creation 5487- venus: lazily query and cache gralloc front rendering usage 5488- venus: disable non AHB external memory bits on Android 5489- venus: fix formating 5490- venus: fix tracing init to include instance creation 5491- venus: render server enforces blob_id_0 5492- venus: move exp features init back to use ring submit 5493- venus: further disallow sparse resource 5494- venus: replace binary search with hardcode for max buffer size 5495- venus: start requiring all experimental features 5496- venus: clean up memoryResourceAllocationSize 5497- venus: clean up globalFencing 5498- venus: refactor sync fd fence and sempahore features 5499- venus: tighten up the sync fd requirements for Android wsi 5500- venus: distinguish external memory from mappable memory support 5501- venus: fix external buffer creation 5502- venus: remove redundant abstractions for wsi struct search 5503- venus: refactor image create info pnext tracking 5504- venus: simplify ahb image creation 5505- venus: simplify support for non-AHB external images 5506- venus: fix external image creation 5507- venus: fix device memory export alloc info 5508- venus: fix VK_EXT_image_view_min_lod feature query 5509- venus: ensure invariance of buffer memory requirement size 5510- venus: sync to latest protocol for ring status enum 5511- venus: abort ring submit when ring is in fatal status 5512- venus: propagate vn_ring to vn_relax 5513- venus: vn_relax to abort on ring fatal status upon warn order 5514- venus: revert back the warn order 5515- venus: sync to latest protocol for asyncRoundtrip 5516- venus: switch to use 64bit roundtrip seqno 5517- venus: make vn_instance_wait_roundtrip asynchronous 5518- venus: let vn_instance_submit_command track ring seqno 5519- venus: make common wsi bo submission async 5520- venus: refactor to add vn_sync_payload_external 5521- venus: make external fence and semaphore export async 5522- Revert "zink/kopper: Add extra swapchain images for Venus" 5523- venus: sync latest protocol for layering extensions 5524- venus: add VK_EXT_load_store_op_none support 5525- venus: add VK_EXT_rasterization_order_attachment_access support 5526- venus/docs: sync to latest venus supported extensions 5527- venus: requires asyncRoundtrip 5528- venus: requires ringMonitoring 5529- venus: move exp feature init back to ring and remove unused function 5530- venus: forward ARM driverVersion for ANGLE workarounds 5531- radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults 5532 5533Yogesh Mohan Marimuthu (18): 5534 5535- egl: add render_gpu tag to dri2_dpy->fd and dri2_dpy->dri_screen variable 5536- loader,glx: add render_gpu tag psc->driScreen and psc->fd 5537- loader,glx,egl,vl,d3d: loader_get_user_preferred_fd() function to return original_fd 5538- egl: remove is_different_gpu variable from struct dri2_egl_display 5539- glx: remove is_different_gpu variable from struct dri_screen 5540- loader,glx,egl: remove is_different_gpu variable from loader 5541- ac,radeonsi: move shadow regs create ib preamble function to amd common 5542- radv: add shadowregs variable to RADV_DEBUG environment variable 5543- radv: add support for register shadowing 5544- radv: set preemp flag and pre_ena bit for shadowregs 5545- radv: INDEX_TYPE and NUM_INSTANCES PKT3 are not shadowed 5546- radv: fence complete struct is 4 qw size 5547- radv: allow NULL initial_preamble_cs in radv_amdgpu_winsys_cs_submit_sysmem() 5548- radeonsi: remove some shadow reg optimization for bf1 game 5549- wsi/display: check alloc failure in wsi_display_alloc_connector() 5550- ac/surface: only adjust pitch if surf_pitch was modified 5551- amd/surface: add RADEON_SURF_NO_TEXTURE flag 5552- radv: set RADEON_SURF_NO_TEXTURE flag in radv_get_surface_flags() 5553 5554Yogesh Mohanmarimuthu (7): 5555 5556- egl: add fd_display_gpu to struct dri2_egl_display 5557- egl,egl/x11: keep display fd open for prime 5558- egl: create DRI screen for display GPU in case of prime 5559- loader,glx,egl/x11: init dri_screen_display_gpu in struct loader_dri3_drawable 5560- egl/wayland: keep display fd open for prime 5561- loader: make image_format_to_fourcc() non-static 5562- egl/wayland: for prime, allocate linear_copy from display GPU VRAM 5563 5564Yonggang Luo (8): 5565 5566- util: Implement util_iround with lrintf unconditionally 5567- util: Fixes error: no previous prototype for 'mesa_cache_db_entry_remove' Fixes: c92c99481fd ("util/mesa-db: Support removal of cache entries") 5568- vulkan: Use static_assert for check HWVULKAN_DISPATCH_MAGIC == ICD_LOADER_MAGIC 5569- meson: Split c_cpp_args from pre_args 5570- meson: Combine duplicated c_args and cpp_args 5571- meson: When sse2 enabled, both c and cpp using sse2 options 5572- meson: Split sse2_arg and sse2_args out of c_cpp_args 5573- meson: Use sse2_arg and sse2_args to replace usage of c and c_sse2_args 5574 5575Yusuf Khan (2): 5576 5577- nvc0/nv50: support and enable EXT_memory_object* 5578- gallium: create query_memory_info implementation for sw drivers 5579 5580Yuxuan Shui (1): 5581 5582- loader: unregister special event in loader_dri3_drawable_fini 5583 5584antonino (54): 5585 5586- zink: fix line smooth lowering 5587- zink: add \`zink_emulate_point_smooth` driconf 5588- zink: add \`lower_point_smooth` to \`zink_fs_key` 5589- zink/nir_to_spirv: add support for \`nir_intrinsic_load_point_coord` 5590- nir: handle output beeing written to deref in \`nir_lower_point_smooth` 5591- zink: handle point_smooth emulation 5592- drirc: set \`zink_emulate_point_smooth` for Quake II 5593- zink: fix stipple pattern in oblique lines 5594- zink: fix \`final_hash` update in \`zink_gfx_program_update` 5595- mesa: correctly allocate space for converted primtives 5596- gallium: decompose quad strips into quads if supported 5597- zink: handle switching between primitives 5598- nir: handle primitives with adjacency 5599- nir: avoid generating conflicting output variables 5600- nir: calculate number of vertices in nir_create_passthrough_gs 5601- nir: handle edge flags in nir_create_passthrough_gs 5602- zink: add \`has_edgeflags` flag to zink_shader and zink_gfx_program 5603- zink: handle edgeflags 5604- nir: allow to force line strip out in nir_create_passthrough_gs 5605- zink: force line strip out when emulating stipple 5606- zink: filled quad emulation gs generation function 5607- zink: add \`zink_rast_prim` enum 5608- zink: handle quads 5609- zink: fix flat shading on filled quads 5610- zink: add flags to \`zink_gfx_program` and \`zink_context` 5611- zink: add \`needs_inlining` to \`zink_shader` 5612- zink: implement flat shading using inlined uniforms 5613- nir/zink: handle provoking vertex mode in \`nir_create_passthrough_gs` 5614- zink: handle provoking vertex mode for filled quads 5615- nir: keep xfb properties in nir_create_passthrough_gs 5616- zink: keep xfb properties in quad emulation gs 5617- zink: advertise support for the quad primitive 5618- zink: prevent crash when freeing 5619- zink: unified \`zink_set_primitive_emulation_keys` and \`zink_create_primitive_emulation_gs` 5620- zink: zink: add \`parent` to \`zink_shader::non_fs` 5621- zink: improve generated gs unbinding 5622- zink: unbind generated gs in \`bind_last_vertex_stage` 5623- zink/ci: remove \`primitive-id-no-gs-quads` from radv-vangogh-fails 5624- nir: only handle flat interpolation when needed in \`nir_create_passthrough_gs` 5625- zink: simplify logic to call \`zink_set_primitive_emulation_keys` 5626- zink: add field to 'zink_gs_key' and enum 5627- zink: add provoking vertex mode lowering 5628- zink: always advertize provoking vertex mode support 5629- zink: update requirements now that pv mode can be emulated 5630- zink: add \`descriptor_bindless_id` to \`zink_shader_info` 5631- zink: fix sampler array collision in \`nir_to_spirv` 5632- zink: don't emulate edgeflags for patches 5633- zink: use correct primitives for passthrough gs with tess 5634- zink: fix pv mode lowring index calculation 5635- zink: use ring buffer to preserve last element 5636- zink: fix exit condition on pv emulation loop 5637- zink: fix line strip offsets in pv mode emulation 5638- zink: fix store subsitution in \`lower_pv_mode_gs_store` 5639- zink: take location_frac into account in pv emulation 5640 5641driver1998 (1): 5642 5643- gallium: Use DETECT_OS_WINDOWS instead of 'WIN32' 5644 5645osy (1): 5646 5647- virgl: enable timer queries only if host supports it 5648 5649t0b3 (1): 5650 5651- nir/nir_opt_move: fix ALWAYS_INLINE compiler error 5652 5653volodymyr.o (1): 5654 5655- mesa ctx->API --> _mesa_is_foo(ctx) 5656 5657xurui (2): 5658 5659- panfrost: Check the return value of drmGetVersion 5660- zink: bs->dd.push_pool[1].pool should be freed 5661