1Revision history for SPIRV-Tools 2 3v2024.2 2024-04-22 4 - General 5 - Add SPIRV_TOOLS_EXPORT to public C++ API (#5591) 6 - Use bazel 7 and bzlmod (#5601) 7 - Optimizer 8 - opt: add GroupNonUniformPartitionedNV capability to trim pass (#5648) 9 - Fix rebuilding types with circular references. (#5637) 10 - Add AliasedPointer decoration (#5635) 11 - add support for vulkan-shader-profiler external passes (#5512) 12 - Validator 13 - A fix to support of SPV_QCOM_image_processing2 (#5646) 14 - spirv-val: Add Vulkan check for Rect Dim in OpTypeImage (#5644) 15 - Validate duplicate decorations and execution modes (#5641) 16 - Validator: Support SPV_NV_raw_access_chains (#5568) 17 18v2024.1 2024-03-06 19 - General 20 - Add tooling support for SPV_KHR_maximal_reconvergence (#5542) 21 - Add support for SPV_KHR_float_controls2 (#5543) 22 - SPV_KHR_quad_control (#5547) 23 - Fold 64-bit int operations (#5561) 24 - update image enum tests to remove Kernel capability (#5562) 25 - Support operand kind for SPV_INTEL_maximum_registers (#5580) 26 - SPV_NV_shader_atomic_fp16_vector (#5581) 27 - Support for SPV_QCOM_image_processing2 (#5582) 28 - Fix access chain struct checks (#5592) 29 - Optimizer 30 - opt: add Int16 and Float16 to capability trim pass (#5519) 31 - Add preserver-interface option to spirv-opt (#5524) 32 - spirv-opt: Fix OpCompositeExtract relaxation with struct operands (#5536) 33 - opt: Add VulkanMemoryModelDeviceScope to trim (#5544) 34 - opt: Add TrimCapabilities pass to spirv-opt tool (#5545) 35 - Add modify-maximal-reconvergence to spirv-opt help (#5546) 36 - opt: add SPV_EXT_mesh_shader to opt allowlist (#5551) 37 - opt: Add OpEntryPoint to DescriptorScalarReplacement pass (#5553) 38 - opt: prevent meld to merge block with MaximalReconvergence (#5557) 39 - [OPT] Use new instruction folder for for all opcodes in spec consti folding (#5569) 40 - [OPT] Identify arrays with unknown length in copy prop arrays (#5570) 41 - [OPT] Add removed unused interface var pass to legalization passes (#5579) 42 - Validator 43 - spirv-val: Re-enable OpControlBarrier VU (#5527) 44 - spirv-val: Add Mesh Primitive Built-In validaiton (#5529) 45 - spirv-val: Validate PhysicalStorageBuffer Stage Interface (#5539) 46 - spirv-val: Multiple interface var with same SC (#5528) 47 - spirv-val: Revert Validate PhysicalStorageBuffer Stage Interface (#5575) 48 - spirv-val: Make Constant evaluation consistent (#5587) 49 50v2023.6 2023-12-18 51 - General 52 - update_build_version.py produce deterministic header. (#5426) 53 - Support missing git in update_build_version.py (#5473) 54 - Optimizer 55 - Add ComputeDerivativeGroup*NV capabilities to trim capabilities pass. (#5430) 56 - Do not crash when tryingto fold unsupported spec constant (#5496) 57 - instrument: Fix handling of gl_InvocationID (#5493) 58 - Fix nullptr argument in MarkInsertChain (#5465) 59 - opt: support 64-bit OpAccessChain index in FixStorageClass (#5446) 60 - opt: add StorageImageReadWithoutFormat to cap trim (#5475) 61 - opt: add PhysicalStorageBufferAddresses to trim (#5476) 62 - Fix array size calculation (#5463 63 - Validator 64 - spirv-val: Loosen restriction on base type of DebugTypePointer and DebugTypeQualifier (#5479) 65 - spirv-val: Add WorkgroupMemoryExplicitLayoutKHR check for Block (#5461) 66 67v2023.5 2023-10-15 68 - General 69 - Support 2 Intel extensions (#5357) 70 - SPV_QCOM_image_processing support (#5223) 71 - Optimizer 72 - opt: fix StorageInputOutput16 trimming. (#5359) 73 - opt: add StoragePushConstant16 to trim pass (#5366) 74 - opt: enable StorageUniform16 (#5371) 75 - opt: add bitmask support for capability trimming (#5372) 76 - opt: Add SwitchDescriptorSetPass (#5375) 77 - opt: add FragmentShader*InterlockEXT to capability trim pass (#5390) 78 - opt: add Int64 capability to trim pass (#5398) 79 - opt: add Float64 capability to trim pass (#5428) 80 - opt: add raytracing/rayquery to trim pass (#5397) 81 - opt: add ImageMSArray capability to trim pass. (#5395) 82 - Add SPV_KHR_physical_storage_buffer to allowlists (#5402) 83 - Add SPV_EXT_fragment_shader_interlock to allow lists (#5393) 84 - Make sure that fragment shader interlock instructions are not removed by DCE (#5400) 85 - instrument: Use Import linkage for instrumentation functions (#5355) 86 - Add a new legalization pass to dedupe invocation interlock instructions (#5409) 87 - instrument: Ensure linking works even of nothing is changed (#5419) 88 - Validator 89 - Move token version/cap/ext checks from parsing to validation (#5370) 90 - val: re-add ImageMSArray validation (#5394) 91 - Linker 92 - linker: Add --use-highest-version option 93 94v2023.4 2023-07-17 95 - General 96 - Set cmake_policy CMP0128 (#5341) 97 - Add python3 requirement for the script (#5326) 98 - Add support for LiteralFloat type (#5323) 99 - SPV_KHR_cooperative_matrix (#5286) 100 - Allow OpTypeBool in UniformConstant (#5237) 101 - Allow physical storage buffer pointer in IO (#5251) 102 - Remove const zero image operands (#5232) 103 - Optimizer 104 - Enable vector constant folding (#4913) (#5272) 105 - Fold negation of integer vectors (#5269) 106 - Add folding rule for OpTranspose (#5241) 107 - Add SPV_NV_bindless_texture to spirv optimizations (#5231) 108 - Fix incorrect half float conversion (#5349) 109 - Add SPV_EXT_shader_atomic_float_add to allow lists (#5348) 110 - Instrument 111 - instrument: Cast gl_VertexIndex and InstanceIndex to uint (#5319) 112 - instrument: Fix buffer address length calculations (#5257) 113 - instrument: Reduce number of inst_bindless_stream_write_6 calls (#5327) 114 - Validator 115 - Validate GroupNonUniform instructions (#5296) 116 - spirv-val: Label SPV_KHR_cooperative_matrix VUID (#5301) 117 - Validate layouts for PhysicalStorageBuffer pointers (#5291) 118 - spirv-val: Remove VUID from 1.3.251 spec (#5244) 119 - Diff 120 - spirv-diff: Update test expectations (#5264) 121 - spirv-diff: Leave undefined ids unpaired. (#5262) 122 - spirv-diff: Properly match SPV_KHR_ray_query types. (#5259) 123 - diff: Don't give up entry point matching too early. (#5224) 124 125v2023.3 2023-05-15 126 - General 127 - Update spirv_headers to include SPV_KHR_ray_tracing_position_fetch (#5205) 128 - spirv-tools: Add support for QNX (#5211) 129 - build: set std=c++17 for BUILD.gn (#5162) 130 - Optimizer 131 - Run ADCE when the printf extension is used. (#5215) 132 - Don't convert struct members to half (#5201) 133 - Apply scalar replacement on vars with Pointer decorations (#5208) 134 - opt: Fix null deref in OpMatrixTimesVector and OpVectorTimesMatrix (#5199) 135 - instrument: Add set and binding to bindless error records (#5204) 136 - instrument: Change descriptor state storage format (#5178) 137 - Fix LICMPass (#5087) 138 - Add Vulkan memory model to allow lists (#5173) 139 - Do not remove control barrier after spv1.3 (#5174) 140 - Validator 141 - spirv-val: Label Interface Location/Component VUIDs (#5221) 142 - Add support for SPV_EXT_shader_tile_image (#5188) 143 - Fix vector OpConstantComposite type validation (#5191) 144 - spirv-val: Label new Vulkan VUID 07951 (#5154) 145 - Fuzz 146 - Do not define GOOGLE_PROTOBUF_INTERNAL_DONATE_STEAL_INLINE if it is already defined. (#5200) 147 148v2023.2 2023-03-10 149 - General 150 - build: move from c++11 to c++17 (#4983) 151 - tools: refactorize tools flags parsing. (#5111) 152 - Add C interface for Optimizer (#5030) 153 - libspirv.cpp: adds c++ api for spvBinaryParse (#5109) 154 - build: change the way we set cxx version for bazel. (#5114) 155 - Optimizer 156 - Fix null pointer in FoldInsertWithConstants. (#5093) 157 - Fix removal of dependent non-semantic instructions (#5122) 158 - Remove duplicate lists of constant and type opcodes (#5106) 159 - opt: fix spirv ABI on Linux again. (#5113) 160 - Validator 161 - Validate decoration of structs with RuntimeArray (#5094) 162 - Validate operand type before operating on it (#5092) 163 - spirv-val: Conditional Branch without an exit is invalid in loop header (#5069) 164 - spirv-val: Initial SPV_EXT_mesh_shader builtins (#5080) 165 166v2023.1 2023-01-17 167 - General 168 - Renamed "master" to "main" (issue#5051) 169 - Validate version 5 of clspv reflection (#5050) 170 - Remove testing support for VS2015 (#5027) 171 - Fix undef behaviour in hex float parsing (#5025) 172 - Require C++11 *or later* (#5020) 173 - Instrument 174 - Instrument: Fix bindless checking for BufferDeviceAddress (#5049) 175 - Optimizer 176 - Optimize allocation of spvtools::opt::Instruction::operands_ (#5024) 177 - spirv-opt: Fix OpCompositeInsert with Null Constant (#5008) 178 - spirv-opt: Handle null CompositeInsert (#4998) 179 - Add option to ADCE to remove output variables from interface. (#4994) 180 - Add support for tesc, tese and geom to EliminateDead*Components (#4990) 181 - Add pass to eliminate dead output components (#4982) 182 - spirv-opt: Add const folding for CompositeInsert (#4943) 183 - Add passes to eliminate dead output stores (#4970) 184 - Prevent eliminating case constructs in block merging (#4976) 185 - Validator 186 - Fix layout validation (#5015) 187 - Fix use of invalid analysis (#5013) 188 - Fix infinite loop in validator (#5006) 189 - Add validation support for SPV_NV_shader_invocation_reorder. (#4979) 190 - Only validate full layout in Vulkan environments (#4972) 191 - spirv-val: Label new Vulkan OpPtrAccessChain VUs (#4975) 192 - spirv-val: Add OpPtrAccessChain Base checks (#4965) 193 194 195v2022.4 2022-10-12 196 - General 197 - Support Narrow Types in BitCast Folding Rule (#4941) 198 - spirv-diff: Allow no SpecId (#4904) 199 - build: cmake: Add support for GNU/Hurd (#4895) 200 - Implement tool changes for SPV_EXT_mesh_shader. (#4915) 201 - Validator 202 - spirv-val: Add SPV_ARM_core_builtins validation (#4958) 203 - spirv-val: Add an option to use friendly names or not (#4951) 204 - spirv-val: Consistently quote ids in messages (#4950) 205 - spirv-val: Add initial SPV_EXT_mesh_shader validation (#4924) 206 - spirv-val: Make it legal to use arrays of ray queries (#4938) 207 - spirv-val: Better message for using OpTypeBool in input/output (#4901) 208 - spirv-val: Add SPV_KHR_ray_tracing storage class (#4868) 209 - Optimizer 210 - spirv-opt: Fix stacked CompositeExtract constant folds (#4932) 211 - Improve time to build dominators (#4916) 212 - Fix ADCE to mark scope and inlined_at of line instructions as live. (#4910) 213 - Improve algorithm to reorder blocks in a function (#4911) 214 - Add structs to eliminate dead input components (#4894) 215 - spirv-opt: fix copy-propagate-arrays index opti on structs. (#4891) 216 - Fix ADCE to not eliminate top level DebugInfo instructions (#4889) 217 - Fix array copy propagation (#4890) 218 219v2022.3 2022-08-08 220 - General 221 - Add SPV_KHR_fragment_shader_barycentric support (#4805) 222 - Add support for SPV_KHR_subgroup_rotate (#4786) 223 - use exec_tools instead of tools for better RBE compatibility (#4837) 224 - Write binary files to stdout in binary on windows. (#4834) 225 - Allow spirv-opt print-all to show pretty IDs (#4888) 226 - Validator 227 - spirv-val: Add PerVertexKHR (#4807) 228 - spirv-opt : Add FixFuncCallArgumentsPass (#4775) 229 - spirv-val: Add CullMaskKHR support (#4792) 230 - Require ColMajor or RowMajor for matrices (#4878) 231 - spirv-val: Add SPV_KHR_ray_query (#4848) 232 - spirv-val: Add SPV_KHR_ray_tracing instructions (#4871) 233 - Implement SPV_NV_bindless_texture related changes (#4847) 234 - spirv-val: Add OpConvertUToAccelerationStructureKHR (#4838) 235 - spirv-val: Add support for SPV_AMD_shader_early_and_late_fragment_tests (#4812) 236 - Optimizer 237 - Fold multiply and subtraction into FMA with negation (#4808) 238 - Add more folding for composite instructions (#4802) 239 - spirv-opt: add pass for interface variable scalar replacement (#4779) 240 - Don't try to unroll loop with step count 0. (#4769) 241 - spirv-opt: SPV_NV_bindless_texture related changes (#4870) 242 - Linker 243 - linker: Recalculate interface variables (#4784) 244 245v2022.2 2022-04-07 246 - General 247 - Add OpModuleProcessed to debug opcode (#4694) 248 - Optimizer 249 - Complete handling of RayQueryKHR type (#4690) 250 - Have scalar replacement use undef instead of null (#4691) 251 - Optimize Instruction::Instruction (#4705) 252 - Handle propagation of arrays with decorations (#4717) 253 - spirv-opt: Add OpExecutionModeId support (#4719) 254 - Optimize Type::HashValue (#4707) 255 - Optimize DefUseManager allocations (#4709) 256 - Add pass to remove DontInline function control (#4747) 257 - Better handling of 0xFFFFFFFF when folding vector shuffle (#4743) 258 - Reset the id bound on the module in compact ids (#4744) 259 - spirv-opt: (WIP) Eliminate Dead Input Component Pass (#4720) 260 - Support SPV_KHR_uniform_group_instructions (#4734) 261 - Handle shaders without execution model in spread-volatile-semantics (#4766) 262 - Validator 263 - Fix handling of Nontemporal image operand (#4692) 264 - [spirv-val] Allow 0 Component Count for DebugTypeArray for Shader (#4706) 265 - spirv-val: Validate DebugTypeMatrix (#4732) 266 - spirv-val: Label Vulkan VUID 04734 (#4739) 267 - spirv-val: Label VUID 06491 (#4745) 268 - spirv-val: Disallow array of push constants (#4742) 269 - spirv-val: Label Vulkan RuntimeArray VUID (#4749) 270 - spirv-val: Add Vulkan Image VUID 06214 (#4750) 271 - spirv-val: Add Vulkan Dref not allowed 3D dim VUID (#4751) 272 - spirv-val: Label and add test for PSB Aligned (#4756) 273 - spirv-val: Add Vulkan 32-bit bit op Base (#4758) 274 - spirv-val: Add more Vulkan VUID labels (#4764) 275 - Diff 276 - Introduce spirv-diff (#4611) 277 - Stabilize the output of spirv-diff (#4698) 278 - spirv-diff: Handle OpSpecConstant array sizes (#4700) 279 - spirv-diff: Match OpSpecConstantComposite correctly (#4704) 280 - spirv-diff: Use GetSingleWord*Operand (#4768) 281 - spirv-diff: Basic support for OpTypeForwardPointer (#4761) 282 - spirv-diff: Fix OpTypeFunction matching w.r.t operand count (#4771) 283 284v2022.1 2022-01-26 285 - General 286 - Add SPIR-V 1.6 support to wasm build (#4674) 287 - Improvements to disassembly within PassManager (#4677) 288 - Basic support for SPIR-V 1.6 (#4663) 289 - reflect debug (#4662) 290 - Fix endianness of string literals (#4622) 291 - Optimizer 292 - spirv-opt: add pass to Spread Volatile semantics (#4667) 293 - Fix constant propagation and folding of FClamp instructions (#4651) 294 - Manually fold floating point division by zero (#4637) 295 - Allow ADCE to remove dead inputs (#4629) 296 - Linker 297 - Linker improvements (#4679) 298 * test/linker: Code factorisation and small tweaks 299 * linker: Do not fail when going over limits 300 - Validator 301 - val: interface struct with builtins must be Block (#4665) 302 - Fuzzer 303 - Avoid id bound errors during opt fuzzing (#4658) 304 - Avoid uninitialised read when parsing hex float (#4646) 305 306v2021.4 2021-11-11 307 - General 308 - Add a WebAssembly build (#3752) 309 - Make cxx exceptions controllable (#4591) 310 - Validator 311 - Improve decoration validation (#4490) 312 - Optimizer 313 - Add spirv-opt pass to replace descriptor accesses based on variable indices (#4574) 314 - Do not fold snegate feeding sdiv (#4600) 315 - Handle overflowing id in merge return (#4606) 316 - Fuzzer 317 - Add libFuzzer target for spirv-fuzz (#4434) 318 - Linter 319 320v2021.3 2021-08-24 321 - General 322 - Initial support for SPV_KHR_integer_dot_product (#4327) 323 - Add non-semantic vulkan extended instruction set (#4362) 324 - Add common enum for debug info instructions from either opencl or vulkan (#4377) 325 - Validator 326 - Add validation for SPV_EXT_shader_atomic_float16_add (#4325) 327 - Disallow loading a runtime-sized array (#4473) 328 - spirv-val: Validate vulkan debug info similarly to opencl debug info (#4466) 329 - Optimizer 330 - spirv-opt: support SPV_EXT_shader_image_int64 (#4379) 331 - spirv-opt: Add dataflow analysis framework (#4402) 332 - Add control dependence analysis to opt (#4380) 333 - Add spirv-opt convert-to-sampled-image pass (#4340) 334 - spirv-opt: Add handling of vulkan debug info to DebugInfoManager (#4423) 335 - Fuzz 336 - spirv-fuzz: support AtomicLoad (#4330) 337 - spirv-fuzz: Support AtomicStore (#4440) 338 - spirv-fuzz: TransformationWrapVectorSynonym that rewrites scalar operations using vectors (#4376) 339 - spirv-fuzz: Add minimal SPIR-V example to test shaders (#4415) 340 - spirv-fuzz: support building using gn (#4365) 341 - Linter 342 - Add new target for spirv-lint (#4446) 343 - spirv-lint: add basic CLI argument handling (#4478) 344 - Add divergence analysis to linter (#4465) 345 346v2021.2 2021-06-18 347 - General 348 - Support SPV_KHR_subgroup_uniform_control_flow (#4318) 349 - Support Intel extensions for fixed point and hls-float (#4321) 350 - Fix crash when optimizing shaders with DebugPrintf (#4280) 351 352 - Validator 353 - Support Vulkan Storage Class for Execution Model (#4212) 354 355 - Optimizer 356 - Handle SPV_KHR_vulkan_memory_model in dead-code elimination (#4320) 357 - Support folding OpBitcast with numeric constants (#4247) 358 359 - Fuzz 360 - Add tests for MaybeGet* functions in fuzzerutil (#4284) 361 - Fix OutlineFunction in presence of unreachable blocks (#4308) 362 - Fix def-use update in PermutePhiOperands (#4309) 363 - Swap positions of two functions in a module (#4236) 364 365v2021.1 2021-04-19 366 - General 367 - Support SPV_KHR_linkonce_odr, SPV_KHR_expect_assume (#4161) 368 - Fixes for the vscode language server extension (#4150) 369 - Validator 370 - Add validation for SPV_EXT_shader_atomic_float_min_max (#4105) 371 - Add Vulkan Execution Scope checks (#4183) 372 - Vulkan 64-bit OpAtomicStore check (#4163) 373 - Optimizer 374 - Add interpolate legalization pass (#4220) 375 - Fuzz 376 - Various performance optimizations 377 - Do not add too many dead blocks (#4217) 378 - Add WGSL compatibility flag to context (#4193) 379 - Add persistent state to the fuzzer (#4137) 380 381v2020.7 2021-02-16 382 - General 383 - Support pending Intel extensions (#4116) 384 - Remove WebGPU support (#4108) 385 - Validator 386 - Vulkan image gather constant component (#4133) 387 - Add Vulkan PSB64 convert VUID (#4122) 388 - Validate SPV_KHR_workgroup_memory_explicit_layout (#4128) 389 - Validate VK_KHR_zero_initialize_workgroup_memory (#4124) 390 - Add Vulkan image gather offset VUID (#4118) 391 - Label Vulkan atomic semantics VUIDs (#4120) 392 - Label VUID 04662 (#4123) 393 - Label VUID 04683 (#4121) 394 - Add Vulkan EXT builtins (#4115) 395 - Validate Sampled=1 for Vulkan ImageQuerySizeLod, ImageQueryLevels, ImageQueryLod (#4103) 396 - Add Vulkan Memory Scope VUs (#4106) 397 - Add Vulkan Addressing Model check (#4107) 398 - Vulkan atomic storage class (#4079) 399 - Label standalone Vulkan VUID (#4091) 400 - Add Vulkan decroation VUID (#4090) 401 - Add Vulkan FP Mode VUID (#4088) 402 - Fix Vulkan image sampled check (#4085) 403 - Add Vulkan ForwardPointer VUID (#4089) 404 - Add Vulkan ImageTexelPointer format check (#4087) 405 - Add Vulkan Group Operation VUID (#4086) 406 - Add first StandAlone VUID 04633 (#4077) 407 - Add Subgroup VUIDs (#4074) 408 - validate return type of OpImageRead (#4072) 409 - tighter validation of multisampled images (#4059) 410 - validate OpTypeImage Sampled values for environemnts (#4064) 411 - validate StorageImageMultisampled capability (#4062) 412 - Add last TessLevelOuter and TessLevelInner VUID (#4055) 413 - Add last ClipDistance and CullDistance VUID (#4054) 414 - Add last ViewportIndex and Layer VUID (#4053) 415 - Add last Position VUID (#4052) 416 - Allow forward pointer to be used in types generally (#4044) 417 - Optimizer 418 - Mark module as modified if convert-to-half removes decorations (#4127) 419 - Fix binding number calculation in desc sroa (#4095) 420 - Run DCE when SPV_KHR_shader_clock is used (#4049) 421 - Debug Info 422 - Set correct scope and line info for DebugValue (#4125) 423 - Avoid integrity check failures caused by propagating line instructions (#4096) 424 - Linker 425 - Linker usability improvements (#4084) 426 - Instrumentation 427 - Generate differentiated error codes for buffer oob checking (#4097) 428 - Fuzz 429 - Fix OpPhi handling in DuplicateRegionWithSelection (#4065) 430 431v2020.6 2020-12-07 432 - General 433 CMake: Add SPIRV_TOOLS_BUILD_STATIC flag (#3910) 434 - Disassembler 435 Add some context comments to disassembly. (#3847) 436 - Optimizer 437 - Take new (raytracing) termination instructions into account. (#4050) 438 - Do run DCE if SPV_KHR_ray_query is used. (#4047) 439 - Handle 8-bit index in elim dead member (#4043) 440 - Add texel buffer out-of-bounds checking instrumentation (#4038) 441 - Update MeshShadingNV dependencies (and land Ray tracing updates) (#4028) 442 - Fix buffer oob instrumentation for matrix refs (#4025) 443 - Fix SSA re-writing in the presence of variable pointers. (#4010) 444 - Add support to prevent functions from being inlined if they have 445 DontInline flag (#3858) 446 - Add SPV_EXT_shader_image_int64 (#3852) 447 - Support SPV_KHR_fragment_shading_rate (#3943) 448 - Fix use-after-move in val/validate.cpp (#3848) 449 - Debug Info 450 - properly preserve DebugValue indexes operand (#4022) 451 - Add DebugValue for invisible store in single_store_elim (#4002) 452 - Propagate OpLine to all applied instructions in spirv-opt (#3951) 453 - Add DebugValue for DebugDecl invisible to value assignment (#3973) 454 - Add DebugValue for function param regardless of scope (#3923) 455 - Debug info preservation in convert-local-access-chains pass (#3835) 456 - Debug info preservation in redundancy-elimination pass (#3839) 457 - Debug info preservation in if-conversion pass (#3861) 458 - Validator 459 - Add validation support for the ray tracing built-in variables (#4041) 460 - Use less stack space when validating Vulkan builtins (#4019) 461 - Fix SPV_KHR_fragment_shading_rate VUID label (#4014) 462 - Label Layer and ViewportIndex VUIDs (#4013) 463 - Allow the ViewportIndex and Layer built-ins on SPIR-V 1.5 (#3986) 464 - Fix validation of OpPhi instructions (#3919) 465 - Fuzz 466 - Fix facts arising from CompositeConstruct (#4034) 467 - Do not flatten conditionals that create synonyms (#4030) 468 - Add support for reining in rogue fuzzer passes (#3987) 469 - Fix assertion failure in FuzzerPassAddCompositeExtract (#3995) 470 - Fix invalid equation facts (#4009) 471 - Fix bugs in TransformationFlattenConditionalBranch (#4006) 472 - Fix bug related to transformation applicability (#3990) 473 - Add expand vector reduction transformation (#3869) 474 - Add FuzzerPassAddCompositeExtract (#3904) 475 - Fix mismatch with shrinker step limit (#3985) 476 - Fix off-by-one error in replayer (#3982) 477 - Get order right for OpSelect arguments (#3974) 478 - Do not add synonym-creating loops in dead blocks (#3975) 479 - Skip OpTypeSampledImage when propagating up (#3976) 480 - Pass OpUndef in function call if needed (#3978) 481 - Fix off-by-one in TransformationCompositeConstruct (#3979) 482 - Tolerate absent ids in data synonym fact management (#3966) 483 - Fix to id availability (#3971) 484 - Fix operand types (#3962) 485 - Don't flatten conditional if condition is irrelevant (#3944) 486 - Do not produce OpPhis of type OpTypeSampledImage (#3964) 487 - Restrict fuzzer pass to reachable blocks (#3970) 488 - Handle more types when extending OpPhi instructions (#3969) 489 - Skip early terminator wrappers when merging returns (#3968) 490 - Avoid irrelevant constants in synonym-creating loops (#3967) 491 - Skip dead blocks in FuzzerPassAddOpPhiSynonyms (#3965) 492 - Avoid the type manager when looking for struct types (#3963) 493 - Fix to TransformationDuplicateRegionWithSelection (#3941) 494 - Skip OpFunction when replacing irrelevant ids (#3932) 495 - Use component-wise selectors when flattening conditional branches (#3921) 496 - Avoid void struct member when outlining functions (#3936) 497 - Do not allow Block-decorated structs when adding parameters (#3931) 498 - Fix to operand id type (#3937) 499 - Handle dead blocks in TransformationEquationInstruction (#3933) 500 - Do not allow sampled image load when flattening conditionals (#3930) 501 - Take care of OpPhi instructions when inlining (#3939) 502 - Fix to TransformationInlineFunction (#3913) 503 - Wrap early terminators before merging returns (#3925) 504 - Lower probability of adding bit instruction synonyms (#3917) 505 - Fix handling of OpPhi in FlattenConditionalBranch (#3916) 506 - Avoid creating blocks without parents (#3908) 507 - Do not allow creation of constants of block-decorated structs (#3903) 508 - Fixes related to irrelevant ids (#3901) 509 - Fix to transformation that adds a synonym via a loop (#3898) 510 - Fix to duplicate region with selection (#3896) 511 - Do not expose synonym facts for non-existent ids (#3891) 512 - Do not add synonyms involving irrelevant ids (#3890) 513 - Do not replace irrelevant ids that are not in blocks (#3892) 514 - Wrap OpKill and similar in function calls (#3884) 515 - Integrate spirv-reduce with shrinker (#3849) 516 - Report fresh ids in transformations (#3856) 517 - Support OpNot bit instruction case (#3841) 518 - Return IR and transformation context after replay (#3846) 519 520v2020.5 2020-09-22 521 - General 522 - Enable building with BUILD_SHARED_LIBS=1 (#3490) 523 - Avoid using /MP4 for clang on windows. (#3662) 524 - Fix compiler error on macOS with XCode12. (#3836) 525 - Optimizer 526 - Preserve OpenCL.DebugInfo.100 through private-to-local pass (#3571) 527 - Preserve debug info in scalar replacement pass (#3461) 528 - Debug info preservation in loop-unroll pass (#3548) 529 - Preserve debug info in dead-insert-elim pass (#3652) 530 - Improve non-semantic instruction handling in the optimizer (#3693) 531 - Let ADCE pass check DebugScope (#3703) 532 - Add undef for inlined void function (#3720) 533 - Fix SSA-rewrite to remove DebugDeclare for variables without loads (#3719) 534 - Handle DebugScope in compact-ids pass (#3724) 535 - Add buffer oob check to bindless instrumentation (#3800) 536 - Validator 537 - Update OpenCL capabilities validation (#3149) 538 - Validator support for non-semantic clspv reflection (#3618) 539 - OpenCL.DebugInfo.100 DebugTypeArray with variable size (#3549) 540 - Only validation locations for appropriate execution models (#3656) 541 - Validate more OpenCL.DebugInfo.100 instructions (#3684) 542 - Allow DebugTypeTemplate for Type operand (#3702) 543 - spirv-val: Add Vulkan VUID labels to BuiltIn (#3756) 544 - Allow SPV_KHR_8bit_storage extension. (#3780) 545 - Validate SPIRV Version number when parsing binary header (#3834) 546 - Reduce 547 - Support reducing a specific function (#3774) 548 - Fuzz 549 - adds TransformationReplaceCopyObjectWithStoreLoad (#3567) 550 - adds TransformationReplaceCopyMemoryWithLoadStore (#3575) 551 - adds TransformationReplaceLoadStoreWithCopyMemory (#3586) 552 - Implement the OpOuterProduct linear algebra case (#3617) 553 - Pass to replace int operands with ints of opposite signedness (#3612) 554 - TransformationMoveInstructionDown (#3477) 555 - Add TransformationMakeVectorOperationDynamic (#3597) 556 - TransformationReplaceAddSubMulWithCarryingExtended (#3598) 557 - FuzzerPassPropagateInstructionsUp (#3478) 558 - add FuzzerPassAddCompositeInserts (#3606) 559 - Add inline function transformation (#3517) 560 - Transformation to replace the use of an irrelevant id (#3697) 561 - Add SPIRV_FUZZ_PROTOC_COMMAND (#3789) 562 - Add TransformationDuplicateRegionWithSelection (#3773) 563 - Transformation to flatten conditional branch (#3667) 564 - Handle OpPhis in TransformationInlineFunction (#3833) 565 - Create synonym of int constant using a loop (#3790) 566 - Support dead blocks in TransformationAddSynonym (#3832) 567 - Linker 568 569v2020.4 2020-07-22 570 - General 571 - Changed variable names to be more descriptive (#3433) 572 - Add support to GPU-AV instrumentation for Task and Mesh shaders (#3512) 573 - Permit Simple and GLSL450 memory model in WEBGPU_0 (#3463) 574 - Support SPV_KHR_terminate_invocation (#3568) 575 - Optimizer 576 - Preserving debug information in optimizations 577 (#3389,#3420,#3425,#3356,#3459,#3444,#3492,#3451,#3497i,#3498,#3542) 578 - Eliminate branches with condition of OpConstantNull (#3438) 579 - Use structured order to unroll loops. (#3443) 580 - Updated desc_sroa to support flattening structures (#3448) 581 - Support OpCompositeExtract pattern in desc_sroa (#3456) 582 - Fix ADCE pass bug for mulitple entries (#3470) 583 - Sink pointer instructions in merge return (#3569) 584 - Validator 585 - Validate location assignments (#3308) 586 - Fix reachability in the validator (#3541) 587 - Reduce 588 - Fuzz 589 - Add support for OpSpecConstant* (#3373) 590 - Add replace linear algebra instruction transformation (#3402) 591 - Implement vector shuffle fuzzer pass (#3412) 592 - Swap operands in OpBranchConditional (#3423) 593 - Permute OpPhi instruction operands (#3421) 594 - Add FuzzerPassAddCopyMemoryInstructions (#3391) 595 - TransformationInvertComparisonOperator (#3475) 596 - Add variables with workgroup storage class (#3485) 597 - Add image sample unused components transformation (#3439) 598 - TransformationReplaceParameterWithGlobal (#3434) 599 - Support adding dead break from back-edge block (#3519) 600 - Fuzzer pass to interchange zero-like constants (#3524) 601 - Linker 602 603v2020.3 2020-05-27 604 - General 605 - Prevent Effcee from installing things when building spirv-tools with testing enabled (#3256) 606 - Update acorn version (#3294) 607 - If SPIRV-Headers is in our tree, include it as subproject (#3299) 608 - allow cross compiling for Windows Store, UWP, etc. (#3330) 609 - Optimizer 610 - Remove deprecated interfaces from instrument passes (#3361) 611 - Preserve debug info in inline pass (#3349) 612 - Handle more cases in dead member elim (#3289) 613 - Preserve debug info in eliminate-dead-functions (#3251) 614 - Fix Struct CFG analysis for single block loop (#3293) 615 - Add tests for recently added command line option (#3297) 616 - Consider sampled images as read-only storage (#3295) 617 - Allow various validation options to be passed to spirv-opt (#3314) 618 - Add debug information analysis (#3305) 619 - Preserve debug info for wrap-opkill (#3331) 620 - refactor inlining pass (#3328) 621 - Add unrolling to performance passes (#3082) 622 - Validator 623 - Add validation support for ImageGatherBiasLodAMD (#3363) 624 - Validate ShaderCallKHR memory scope (#3332) 625 - Validate Buffer and BufferBlock apply only to struct types (#3259) 626 - Reduce 627 - increase default step limit (#3327) 628 - Remove unused uniforms and similar (#3321) 629 - Fuzz 630 - Add support for StorageBuffer (#3348) 631 - Add validator options (#3254) 632 - Limit adding of new variables to 'basic' types (#3257) 633 - Transformation to add OpConstantNull (#3273) 634 - Handling of more fuzzing opportunities (#3277, #3280, #3281, #3290, #3292) 635 - Respect rules for OpSampledImage (#3287) 636 - Do not outline regions that produce pointer outputs (#3291) 637 - Linker 638 639v2020.2 2020-03-26 640 - General: 641 - Support extended instructions in the vscode language server 642 - Make spvOpcodeString part of the public API (#3174) 643 - Added guide to writing a spirv-fuzz fuzzer pass (#3190) 644 - Add support for KHR_ray_{query,tracing} extensions (#3235) 645 - Optimizer 646 - Debug Printf support (#3215) 647 - Add data structure for DebugScope, DebugDeclare in spirv-opt (#3183) 648 - Fix identification of Vulkan images and buffers (#3253) 649 - Validator 650 - Add support for SPV_AMD_shader_image_load_store_lod (#3186) 651 - Add validation rules for OpenCL.DebugInfo.100 extension (#3133) 652 - Adding WebGPU specific Workgroup scope rule (#3204) 653 - Disallow phis of images, samplers and sampled images (#3246) 654 - Reduce 655 - Fuzz 656 - Fuzzer passes to add local and global variables (#3175) 657 - Add fuzzer passes to add loads/stores (#3176) 658 - Fuzzer pass to add function calls (#3178) 659 - Fuzzer pass that adds access chains (#3182) 660 - Fuzzer pass to add equation instructions (#3202) 661 - Add swap commutable operands transformation (#3205) 662 - Add fuzzer pass to permute function parameters (#3212) 663 - Allow OpPhi operand to be replaced with a composite synonym (#3221) 664 - Linker 665 666v2020.1 2020-02-03 667 - General: 668 - Add support for SPV_KHR_non_semantic_info (#3110) 669 - Support OpenCL.DebugInfo.100 extended instruction set (#3080) 670 - Added support for Vulkan 1.2 671 - Add API function to better handle getting the necessary environment (#3142) 672 - Clarify mapping of target env to SPIR-V version (#3150) 673 - Implement constant folding for many transcendentals (#3166) 674 - Optimizer 675 - Change default version for CreatInstBindlessCheckPass to 2 (#3096, #3119) 676 - Better handling of OpLine on merge blocks (#3130) 677 - Use placeholder switch instead of placeholder loop in MergeReturn pass. (#3151) 678 - Handle TimeAMD in AmdExtensionToKhrPass. (#3168) 679 - Validator 680 - Fix structured exit validation (#3141) 681 - Reduce 682 - Fuzz 683 - Fuzzer pass to merge blocks (#3097) 684 - Transformation to add a new function to a module (#3114) 685 - Add fuzzer pass to perform module donation (#3117) 686 - Fuzzer passes to create and branch to new dead blocks (#3135) 687 - Fuzzer pass to add composite types (#3171) 688 - Linker: 689 - Remove names and decorations of imported symbols (#3081) 690 691v2019.5 2019-12-11 692 - General: 693 - Export SPIRV-Tools targets on installation 694 - SPIRV-Tools support for SPIR-V 1.5 (#2865) 695 - Add WebGPU SPIR-V Assembler in JavaScript. (#2876) 696 - Add Bazel build configuration. (#2891) 697 - Add support for building with emscripten (#2948) 698 - Update SPIR-V binary header test for SPIR-V 1.5 (#2967) 699 - Add fuzzer for spirv-as call path (#2976) 700 - Improved CMake install step. (#2963) 701 - Add fuzzer for spirv-dis call path (#2977) 702 - Ensure timestamp does not vary with timezone. (#2982) 703 - Add a vscode extension for SPIR-V disassembly files (#2987) 704 - Add iOS as a supported platform (#3001) 705 - utils/vscode: Add SPIR-V language server support 706 - Respect CMAKE_INSTALL_LIBDIR in installed CMake files (#3054) 707 - Permit the debug instructions in WebGPU SPIR-V (#3063) 708 - Add support for Fuchsia. (#3062) 709 - Optimizer 710 - Add descriptor array scalar replacement (#2742) 711 - Add pass to wrap OpKill in a function call (#2790) 712 - Fold FMix during constant folding. (#2818) 713 - Add pass to replace AMD shader ballot extension (#2811) 714 - Add pass to make Float32 operation relax precision (#2808) 715 - Add pass to make relax precision operation Float16 (#2808) 716 - Add pass to replace uses of 3 AMD extensions (#2814) 717 - Fold Min, Max, and Clamp instructions. (#2836) 718 - Better handling of OpKill in continues (#2842,#2922,#2933) 719 - Enable OpTypeCooperativeMatrix specialization (#2927) 720 - Support constant-folding UConvert and SConvert (#2960) 721 - Update Offset to ConstOffset bitmask if operand is constant. (#3024) 722 - Improve RegisterSizePasses (#3059) 723 - Folding: perform add and sub on mismatched integer types (#3084) 724 - Graphics robust access: use signed clamp (#3073) 725 Fixes: 726 - Instrument: Fix version 2 output record write for tess eval shaders. (#2782) 727 - Instrument: Add support for Buffer Device Address extension (#2792) 728 - Fix check for changed binary in API call. (#2798) 729 - For WebGPU<->Vulkan optimization, set correct execution environment (#2834) 730 - Handle OpConstantNull in copy-prop-arrays. (#2870) 731 - Use OpReturn* in wrap-opkill (#2886) 732 - Validator 733 - Add generic builtin validation of target (#2843) 734 - Extra resource interface validation (#2864) 735 - Adding valilidation checks for OpEntryPoint duplicate names and execution mode (#2862) 736 - Relaxed bitcast with pointers (#2878) 737 - Validate physical storage buffer restrictions (#2930) 738 - Add SPV_KHR_shader_clock validation (#2879, #3013) 739 - Validate that selections are structured (#2962) 740 - Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980) 741 - Check that derivatives operate on 32-bit values (#2983) 742 - Validate array stride does not cause overlap (#3028) 743 - Validate nested constructs (#3068) 744 Fixes: 745 - Fix validation of constant matrices (#2794) 746 - Update "remquor" validation 747 - Only allow previously declared forward refs in structs (#2920) 748 - Reduce 749 - Remove relaxed precision decorations (#2797) 750 - Reduce/fuzz: improve command line args (#2932) 751 - Improve remove unref instr pass (#2945) 752 Fixes: 753 - Fuzz 754 - Fix add-dead-break and add-dead-continue passes to respect dominance (#2838) 755 - Add fuzzer pass to copy objects (#2853) 756 - Add fuzzer pass to replace ids with synonyms (#2857) 757 - Allow validation during spirv-fuzz replay (#2873) 758 - Employ the "swarm testing" idea in spirv-fuzz (#2890) 759 - reduce/fuzz: improve command line args (#2932) 760 - option to convert shader into a form that renders red (#2934) 761 - Add fuzzer pass to change selection controls (#2944) 762 - add transformation and pass to construct composites (#2941) 763 - Add fuzzer pass to change loop controls (#2949) 764 - Add fuzzer pass to change function controls (#2951) 765 - Add fuzzer pass to add NoContraction decorations (#2950) 766 - Add missing functionality for matrix composites (#2974) 767 - Fuzzer pass to adjust memory access operands (#2968) 768 - Transformation to extract from a composite object (#2991) 769 - Vector shuffle transformation (#3015) 770 - Improve debugging facilities (#3074) 771 - Function outlining fuzzer pass (#3078) 772 773 774v2019.4 2019-08-08 775 - General: 776 - Memory model support for SPIR-V 1.4 777 - Add new spirv-fuzz tool 778 - Add option for base branch in check_code_format.sh 779 - Removed MarkV and Stats code. (#2576) 780 - Instrument: Add version 2 of record formats (#2630) 781 - Linker: Better type comparison for OpTypeArray and OpTypeForwardPointer (#2580) 782 - Optimizer 783 - Bindless Validation: Instrument descriptor-based loads and stores (#2583) 784 - Better folding for OpSpecConstantOp (#2585, #2614) 785 - Add in individual flags for Vulkan <-> WebGPU passes (#2615) 786 - Handle nested breaks from switches. (#2624) 787 - Optimizer: Handle array type with OpSpecConstantOp length (#2652) 788 - Perform merge return with single return in loop. (#2714) 789 - Add --preserve-bindings and --preserve-spec-constants (#2693) 790 - Remove Common Uniform Elimination Pass (#2731) 791 - Allow ray tracing shaders in inst bindle check pass. (#2733) 792 - Add pass to inject code for robust-buffer-access semantics (#2771) 793 - Treat access chain indexes as signed in SROA (#2776) 794 - Handle RelaxedPrecision in SROA (#2788) 795 - Add descriptor array scalar replacement (#2742) 796 Fixes: 797 - Handle decorations better in some optimizations (#2716) 798 - Change the order branches are simplified in dead branch elim (#2728) 799 - Fix bug in merge return (#2734) 800 - SSA rewriter: Don't use trivial phis (#2757) 801 - Record correct dominators in merge return (#2760) 802 - Process OpDecorateId in ADCE (#2761) 803 - Fix check for unreachable blocks in merge-return (#2762) 804 - Handle out-of-bounds scalar replacements. (#2767) 805 - Don't move debug or decorations when folding (#2772) 806 - Protect against out-of-bounds references when folding OpCompositeExtract (#2774) 807 - Validator 808 - Validate loop merge (#2579) 809 - Validate construct exits (#2459) 810 - Validate OpenCL memory and addressing model environment rules (#2589) 811 - Validate OpenCL environment rules for OpTypeImage (#2606) 812 - Allow breaks to switch merge from nested construct (#2604) 813 - Validate OpenCL environment rules for OpImageWrite (#2619) 814 - Allow arrays of out per-primitive builtins for mesh shaders (#2617) 815 - Validate OpenCL rules for ImageRead and OpImageSampleExplicitLod (#2643) 816 - Add validation for SPV_EXT_fragment_shader_interlock (#2650) 817 - Add builtin validation for SPV_NV_shader_sm_builtins (#2656) 818 - Add validation for Subgroup builtins (#2637) 819 - Validate variable initializer type (#2668) 820 - Disallow stores to UBOs (#2651)A 821 - Validate Volatile memory semantics bit (#2672) 822 - Basic validation for Component decorations (#2679) 823 - Validate that in OpenGL env block variables have Binding (#2685) 824 - Validate usage of 8- and 16-bit types with only storage capabilities (#2704) 825 - Add validation for SPV_EXT_demote_to_helper_invocation (#2707) 826 - Extra small storage validation (#2732) 827 - For Vulkan, disallow structures containing opaque types (#2546) 828 - Validate storage class OpenCL environment rules for atomics (#2750) 829 - Update OpControlBarriers rules for WebGPU (#2769) 830 - Update OpMemoryBarriers rules for WebGPU (#2775) 831 - Update WebGPU validation rules of OpAtomic*s (#2777) 832 Fixes: 833 - Disallow merge targeting block with OpLoopMerge (#2610) 834 - Update vloadn and vstoren validation to match the OpenCL Extended 835 Instruction Set Specification (#2599) 836 - Update memory scope rules for WebGPU (#2725) 837 - Allow LOD ops in compute shaders with derivative group execution modes (#2752) 838 - Reduce 839 Fixes: 840 841v2019.3 2019-05-14 842 - General: 843 - Require Python 3 since Python 2 will out of service soon. 844 - Add a continuous test that does memory checks using the address sanitizer. 845 - Fix the build files so the SPIRV_USE_SANITIZER=address build works. 846 - Packaging top of tree build artifacts again. 847 - Added support for SPIR-V 1.4. (#2550) 848 - Optimizer 849 - Remove duplicates from list of interface IDs in OpEntryPoint instruction (#2449) 850 - Bindless Validation: Descriptor Initialization Check (#2419) 851 - Add option to validate after each pass (#2462) 852 - Add legalization pass to fix mismatched pointer (#2430, #2535) 853 - Add error messages when the input contains unknown instructions. (#2487) 854 - Add pass to convert from WebGPU Spir-V to Vulkan Spir-V and back. (#2495) 855 Fixes: 856 - #2412: Dead memeber elimination should not change input and output variables. 857 - #2405: Fix OpDot folding of half float vectors. 858 - #2391: Dead branch elim should not fold away back edges. 859 - #2441: Removing decorations when doing constant propagation. 860 - #2455: Maintain inst to block mapping in merge return. 861 - #2453: Fix merge return in the face of breaks. 862 - #2456: Handle dead infinite loops in DCE. 863 - #2458: Handle variable pointer in some optimizations. 864 - #2452: Fix dead branch elimination to handle unreachable blocks better. 865 - #2528: Fix undefined bit shift in sroa. 866 - #2539: Change implementation of post order CFG traversal. 867 - Validator 868 - Add validation of storage classes for WebGPU (#2446) 869 - Add validation for ExecutionMode in WebGPU (#2443) 870 - Implement WebGPU specific CFG validation (#2386) 871 - Allow NonWritable to target struct members. (#2420) 872 - Allow storage type mismatch for parameter in relaxed addressing mode. 873 - Allow non memory objects as parameter in relaxed addressing mode. 874 - Disallow nested Blocks and buffer blocks (#2410). 875 - Add validation for SPV_NV_cooperative_matrix (#2404) 876 - Add --strip-atomic-counter-memory (#2413) 877 - Check OpSampledImage is only passed into valid instructions (#2467) 878 - Handle function decls in Structured CFG analysis (#2474) 879 - Validate that OpUnreacahble is not statically reachable (#2473) 880 - Add pass to generate needed initializers for WebGPU (#2481) 881 - Allow images without format for OpenCL. (#2470) 882 - Remove unreachable block validation (#2525) 883 - Reduce runtime of array layout checks (#2534) 884 - Add validation specific to OpExecutionModeId (#2536) 885 - Validate sign of int types. (#2549) 886 - VK_KHR_uniform_buffer_standard_layout validation (#2562) 887 Fixes: 888 - #2439: Add missing DepthGreater case to Fragment only check. 889 - #2168: Disallow BufferBlock on StorageBuffer variables for Vulkan. 890 - #2408: Restrict and Aliased decorations cannot be applied to the same id. 891 - #2447: Improve function call parameter check. 892 - Reduce 893 - Add Pass to remove unreferenced blocks. (#2398) 894 - Allows passing options to the validator. (#2401) 895 - Improve reducer algorithm and other changes (#2472) 896 - Add Pass to remove selections (#2485) 897 - Add passes to simplify branches (#2507) 898 Fixes: 899 - #2478: fix loop to selection pass for loops with combined header/continue block 900 901v2019.2 2019-02-20 902 - General: 903 - Support SPV_EXT_physical_storage_buffer 904 - A number of memory leak have been fixed. 905 - Removed use of deprecated Google test macro: 906 - Changed the BUILD.gn to only build tests in Chromium. 907 - Optimizer 908 - Upgrade memory model improvments for modf and frexp. 909 - Add a new pass to move loads closer to their uses: code sinking. 910 - Invalidating the type manager now invalidates the constnat manager. 911 - Expand instrumentation pass for bindless bounds checking to runtime-sized descriptor arrays. 912 - Add a new pass that removes members from structs that are not used: dead member elimination. 913 Fixes: 914 - #2292: Remove undefined behaviour when folding bit shifts. 915 - #2294: Fixes for instrumentation code. 916 - #2293: Fix overflow when folding -INT_MIN. 917 - #2374: Don't merge unreachable blocks when merging blocks. 918 - Validator 919 - Support SPV_KHR_no_integer_wrap and related decorations. 920 - Validate Vulkan rules for OpTypeRuntimeArray. 921 - Validate NonWritable decoration. 922 - Many WebGPU specific validation rules were added. 923 - Validate variable pointer related function call rules. 924 - Better error messages. 925 Fixes: 926 - #2307: Check forwards references in OpTypeArray. 927 - #2315, #2303: Fixed the layout check for relaxed layout. 928 - #1628: Emit an error when an OpSwitch target is not an OpLabel. 929 - Reduce 930 - Added more documentation for spirv-reduce. 931 - Add ability to remove OpPhi instructions. 932 - Add ability to merge two basic blocks. 933 - Add ability to remove unused functions and unused basic blocks. 934 Fixes: 935 936v2019.1 2019-01-07 937 - General: 938 - Created a new tool called spirv-reduce. 939 - Add cmake option to turn off SPIRV_TIMER_ENABLED (#2103) 940 - New optimization pass to update the memory model from GLSL450 to VulkanKHR. 941 - Recognize OpTypeAccelerationStructureNV as a type instruction and ray tracing storage classes. 942 - Fix GCC8 build. 943 - Add --target-env flag to spirv-opt. 944 - Add --webgpu-mode flag to run optimizations for webgpu. 945 - The output disassembled line number stead of byte offset in validation errors. (#2091) 946 - Optimizer 947 - Added the instrumentation passes for bindless validation. 948 - Added passes to help preserve OpLine information (#2027) 949 - Add basic support for EXT_fragment_invocation_density (#2100) 950 - Fix invalid OpPhi generated by merge-return. (#2172) 951 - Constant and type manager have been turned into analysies. (#2251) 952 Fixes: 953 - #2018: Don't inline functions with a return in a structured CFG contstruct. 954 - #2047: Fix bug in folding when volatile stores are present. 955 - #2053: Fix check for when folding floating pointer values is allowed. 956 - #2130: Don't inline recursive functions. 957 - #2202: Handle multiple edges between two basic blocks in SSA-rewriter. 958 - #2205: Don't unswitch a latch condition during loop unswitch. 959 - #2245: Don't fold branch in loop unswitch. Run dead branch elimination to fold them. 960 - #2204: Fix eliminate common uniform to place OpPhi instructions correctly. 961 - #2247: Fix type mismatches caused by scalar replacement. 962 - #2248: Fix missing OpPhi after merge return. 963 - #2211: After merge return, fix invalid continue target. 964 - #2210: Fix loop invariant code motion to not place code between merge instruction and branch. 965 - #2258: Handle CompositeInsert with no indices in VDCE. 966 - #2261: Have replace load size handle extact with no index. 967 - Validator 968 - Changed the naming convention of outputing ids with names in diagnostic messages. 969 - Added validation rules for UniformConstant variables in Vulkan. 970 - #1949: Validate uniform variable type in Vulkan 971 - Ensure for OpVariable that result type and storage class operand agree (#2052) 972 - Validator: Support VK_EXT_scalar_block_layout 973 - Added Vulkan memory model semantics validation 974 - Added validation checkes spefic to WebGPU environment. 975 - Add support for VK_EXT_Transform_feedback capabilities (#2088) 976 - Add validation for OpArrayLength. (#2117) 977 - Ensure that function parameter's type is not void (#2118) 978 - Validate pointer variables (#2111) 979 - Add check for QueueFamilyKHMR memory scope (#2144) 980 - Validate PushConstants annotation and type (#2140) 981 - Allow Float16/Int8 for Vulkan 1.0 (#2153) 982 - Check binding annotations in resource variables (#2151, #2167) 983 - Validate OpForwardPointer (#2156) 984 - Validate operation for OpSpecConstantOp (#2260) 985 Fixes: 986 - #2049: Allow InstanceId for NV ray tracing 987 - Reduce 988 - Initial commit wit a few passes to reduce test cases. 989 - Validation is run after each reduction step. 990 Fixes: 991 992 993v2018.6 2018-11-07 994 - General: 995 - Added support for the Nvidia Turing and ray tracing extensions. 996 - Make C++11 the CXX standard in CMakeLists.txt. 997 - Enabled a parallel build for MSVC. 998 - Enable pre-compiled headers for MSVC. 999 - Added a code of conduct. 1000 - EFFCEE and RE2 are now required when build the tests. 1001 - Optimizer 1002 - Unrolling loops marked for unrolling in the legalization passes. 1003 - Improved the compile time of loop unrolling. 1004 - Changee merge-return to create a placeholder loop around the function. 1005 - Small improvement to merge-blocks to allow it to merge more often. 1006 - Enforce an upper bound for the ids, and add option to set it. 1007 - #1966: Report error if there are unreachable block before running merge return 1008 Fixes: 1009 - #1917: Allow 0 (meaning unlimited) as a parameter to --scalar-replacement 1010 - #1915: Improve handling of group decorations. 1011 - #1942: Fix incorrect uses of the constant manager. Avoids type mismatches in generated code. 1012 - #1997: Fix dead branch elimination when there is a loop in folded selection. 1013 - #1991: Fixes legality check in if-conversion. 1014 - #1987: Add nullptr check to array copy propagation. 1015 - #1984: Better handling of OpUnreachable in ADCE. 1016 - #1983: Run merge return on reachable functions only. 1017 - #1956: Handled atomic operations in ADCE. 1018 - #1963: Fold integer divisions by 0 to 0. 1019 - #2019: Handle MemberDecorateStringGOOGLE in ADCE and strip reflect. 1020 - Validator 1021 - Added validation for OpGroupNonUniformBallotBitCount. 1022 - Added validation for the Vulkan memory model. 1023 - Added support for VK_KHR_shader_atddomic_int64. 1024 - Added validation for execution modes. 1025 - Added validation for runtime array layouts. 1026 - Added validation for 8-bit storage. 1027 - Added validation of OpPhi instructions with pointer result type. 1028 - Added checks for the Vulkan memory model. 1029 - Validate MakeTexelAvailableKHR and MakeTexelVisibleKHR 1030 - Allow atomic function pointer for OpenCL. 1031 - FPRounding mode checks were implemented. 1032 - Added validation for the id bound with an option to set the max id bound. 1033 Fixes: 1034 - #1882: Improve the validation of decorations to reduce memory usage. 1035 - #1891: Fix an potential infinite loop in dead-branch-elimination. 1036 - #1405: Validate the storage class of boolean objects. 1037 - #1880: Identify arrays of type void as invalid. 1038 - #487: Validate OpImageTexelPointer. 1039 - #1922: Validate OpPhi instructions are at the start of a block correctly. 1040 - #1923: Validate function scope variable are at the start of the entry block. 1041 1042v2018.5 2018-09-07 1043 - General: 1044 - Support SPV_KHR_vulkan_memory_model 1045 - Update Dim capabilities, to match SPIR-V 1.3 Rev 4 1046 - Automated build bots no run tests for the VS2013 case 1047 - Support Chromium GN build 1048 - Use Kokoro bots: 1049 - Disable Travis-CI bots 1050 - Disable AppVeyor VisualStudio Release builds. Keep VS 2017 Debug build 1051 - Don't check export symbols on OSX (Darwin): some installations don't have 'objdump' 1052 - Reorganize source files and namespaces 1053 - Fixes for ClangTidy, and whitespace (passes 'git cl presumit --all -uf') 1054 - Fix unused param compile warnings/errors when Effcee not present 1055 - Avoid including time headers when timer functionality is disabled 1056 - Avoid too-stringent warnings flags for Clang on Windows 1057 - Internal refactoring 1058 - Add hooks for automated fuzzing 1059 - Add testing of command line executables 1060 - #1688: Use binary mode on stdin; fixes "spirv-dis <foo.spv" on Windows 1061 - Optimizer 1062 - The optimizer validates the module before it begins 1063 - Add API to register passes by string name 1064 - Fold a vector shuffle feeding a vector shuffle 1065 - Add -combine-access-chains transform 1066 - Refactor how IRContext is handled by passes 1067 - Improve bookkeeping for instruction result type and result id 1068 - Fix over-duplication of decorations 1069 - Fix handling of exits from selections in dead-branch elimination, and dead code 1070 elimination. 1071 - Fix handling of certain kinds of flow control in merge-return 1072 Fixes: 1073 - #1721: Fix size bug when folding vector shuffles 1074 - #1722: Fix size infinite loop when folding vector shuffles 1075 - #1724: Fix finding a constant of a specific type 1076 - #1727: Dead branch elim: Reorder blocks if needed to satisfy dominance rule 1077 - #1729: Handle VariablePointers cases in various optimizations 1078 - #1731: Fix vector shuffle with literal id indicating undef value 1079 - #1736: Fix handling of decorations and phis in merge-return 1080 - #1787: Fix handling of decorations related to access chains 1081 - #1865: Avoid leaking memory for SPIR-V constant values 1082 - Validator 1083 - Improve error messages 1084 - Avoid platform-dependent traversal ordering, to ensure consistent messages 1085 - Use libspirv::Instruction where possible 1086 - Add option to skip all block layout checks 1087 - Validate all type IDs 1088 - Validate uses of OpFunction 1089 - Validate uses of OpTypeFunction 1090 - Disallow a struct containing its own type https://crbug.com/874372 1091 - #1685: Vulkan permits non-monotonic offsets for block members 1092 - #1697: Enforce block layout rules even when using relaxed block layout option 1093 - #1719: Fix line number for vector shuffle valiation error 1094 - #1789: Avoid assertion failure when validating some functions 1095 - #1800: Fix validation of OpCopyMemorySized 1096 - #1822: Stop enforcing struct member offset montonicity 1097 - #1831: Disallow void members in structs 1098 1099v2018.4 2018-07-08 1100 - General: 1101 - Support SPV_KHR_8bit_storage 1102 - Add gclient and presubmit configurations 1103 - Enable Kokoro build bots (#1625) 1104 - Group tests into fewer executables, reduces load on CI 1105 - Port test script to Python 3 1106 - Symbol export tests respect SPIRV_SKIP_TESTS 1107 - #1596: Operand lookup succeeds if enabled by a capability 1108 - #1624: Instruction lookup succeeds if enabled by a capability 1109 - Refactoring namespaces: 1110 - #1678: Change libspirv to spvtools 1111 - Code in source/utils moved into spvtools::utils 1112 - Code in source/comp moved into spvtools::comp 1113 - Optimizer: 1114 - Remove insert-extract-elim pass. Use simplification pass instead. 1115 - Preserve instruction-to-block mapping in most passes, to reduce runtime. 1116 - Small vector optimization for operands 1117 - Add pass to move Private variables to Function. Increase opportunity to optimize. 1118 - Fixes: 1119 #1120: Check static uses of entry point interfaces 1120 #1372: Avoid merging some structs, to preserve names for reflection 1121 #1577: Scalar replacement uses only undecorated types. 1122 #1578: Fix handling of forward-pointer types, and types embedding pointers 1123 to themselves. 1124 #1591: Inliner: Callee variable with initializer should have a store at the call site. 1125 #1634: Fix crash: Use type id in vector type lookup 1126 #1649: Fix assert in compact-ids pass 1127 Fix constant folder: ensure it uses the right type 1128 #1659: Folding rules added to IRContext. Avoids leak. 1129 - Validator 1130 - Add work-in-progress WebGPU environment. Disallows OpUndef 1131 - #670, #1581: Improve error messages; disassemble instruction 1132 - #491: Check structured switches 1133 - #937: Check layout rules for Block and BufferBlock in Uniform, StorageBuffer, PushConstant 1134 - #1281: Check invalid branches into structured constructs 1135 - #1522: Disallow array-of-arrays with DescriptorSets 1136 - #1577: Allow duplicate pointer types. 1137 - #1581: Better messages: output ID names along with numbers in more cases. 1138 - #1597: Check Vulkan 1.1 capabilities 1139 - #1618: Check invalid exit from structured case construct 1140 - #1622: Run IdPass before DataRulesPass 1141 - #1632: Reduce test time by artificially lowering limits in limit test 1142 - #1638: Block-decorated structs member order must respect offset order 1143 - #1657: Improve CFG validation diagnostics 1144 - Khronos SPIR-V #337: GLSL.std.450 Refract instruction Eta param can be any float scalar. 1145 - #1606: PushConstant Blocks follow storage-buffer layout rules 1146 - #1664: Check layout of StorageBuffer variables with Block decoration, using storage buffer 1147 rules 1148 - #1666: Layout validation should permit {vec3; float} packing 1149 - #1637, #1668: Layout validation uses RowMajor, ArrayStride, MatrixStride properly 1150 - Linker 1151 - Avoid buffer overrun when creating OpModuleProcessed 1152 1153v2018.3 2018-05-25 1154 - General: 1155 - Support SPV_EXT_descriptor_indexing 1156 - Support SPV_GOOGLE_decorate_string 1157 - Support SPV_GOOGLE_hlsl_functionality1 1158 - Support SPV_NV_shader_subgroup_partitioned 1159 - Use "unified1" grammar from SPIRV-Headers 1160 - Simplify support for new extensions. Assembler, disassembler, and simple validation 1161 support is automatic if new tokens are introduced with appropriate extension 1162 attributes in the "unified1" SPIR-V core grammar. 1163 - Disassembler: Emit more digits on floating point, to reliably reproduce all 1164 significand bits. (Use std::max_digits10 instead of std::digits10) 1165 - Fix compilation for old XCode versions: Explicit construction required for std::set. 1166 - Optimizer: 1167 - Add --strip-reflect 1168 - Add --time-report 1169 - Add --loop-fission 1170 - Add lop fusion. 1171 - Add loop peeling pass and internal utility. 1172 - Improve optimizer runtime. 1173 - Merge-return now works with structured control flow. 1174 - New (faster) SSA rewriter to convert local loads and stores to SSA IDs and phis. 1175 Can replace load/store elimination passes. 1176 - Fix instruction folding case: insertion that feeds and extract, when the extract 1177 remains. 1178 - Fold OpDot. 1179 - Fold OpFNegate. 1180 - Fold multply and divide of same value. 1181 - Fold FClamp feeding a compare. 1182 - Fold OpLoad feeding an extract, to reduce excessive copying. (#1547) 1183 - Fold Fmix feeding an extract. 1184 - Use simplification pass instead of insert-extract elimination. 1185 - Constant fold OpVectorTimesScalar. 1186 - Copy propagate arrays, in simple cases. 1187 - Aggressive dead code elimination: Can remove more instructions, e.g. derivatives. 1188 - Aggressive dead code elimination: Remove Workgroup variables that are written but not read. 1189 - Better handling of OpImageTexelPointer 1190 - Initial utilities for scalar evolution. 1191 - Add Vector dead code elimination. 1192 - Each pass can only run once. 1193 - Allow code hosting in if-conversion. 1194 - Add external interface for adding a PassToken, so external code can make their own 1195 passes. 1196 - Fixes: 1197 #1404: Don't optimize away the compute compute workgroup size constant. 1198 #1407: Remove a bad assertion 1199 #1456: Fix bug in SSA rewriter related to variables updated in loops. 1200 #1487: Fix long runtime in Dead insertion elimination: Don't revist select phi nodes. 1201 #1492: Aggressive dead code elimination can remove OpDecorateStringGOOGLE. 1202 #1527: Fix inlining of functions having OpKill and OpUnreachable. 1203 #1559: Fix assert failure in reduce-load-size pass. 1204 #1556: Aggressive dead code elimination: Fix handling of OpCopyMemory. 1205 - Validator: 1206 - Check Vulkan built-in variables 1207 - Check Vulkan-specific atomic result type rule. 1208 - Relax control barrier check for SPIR-V 1.3. Fixes #1427 1209 - Check OpPhi. 1210 - Check OpMemoryModel. 1211 - Stop checking sizes derived from spec-constants. 1212 - Re-enable checks for OpUConvert. 1213 - Vulkan: Fix check for PrimitiveId: Permit as Input in fragment shader. 1214 - Validate binary version for the given target environment. 1215 - Add tests for OpBranch checks. 1216 - Vulkan 1.1: Check scope for non-uniform subgroup operations. 1217 - Fix checks for SPV_AMD_gpu_shader_int16. 1218 - Fix logical layout check for OpDecorateId. 1219 - Fix checks for ViewportIndex & Layer for Vulkan and SPV_EXT_shader_viewport_index_layer. 1220 - Fixes: 1221 #1470: Vulkan: Don't restrict WorkgroupSize to Input storage class. 1222 #1469: Vulkan: Permit Subgroup memory scope for Vulkan 1.1. 1223 #1472: Per-vertex variable validation fixes. 1224 #1483: Valdiate barrier execution scopes for Vulkan 1.1. 1225 - Fixes: 1226 #898: Linker properly removes FuncParamAttr from imported symbols. 1227 #924, #1174: Fix handling of decoration groups in optimizer, linker. 1228 1229v2018.2 2018-03-07 1230 - General: 1231 - Support SPIR-V 1.3 and Vulkan 1.1. 1232 - Default target environment is now SPIR-V 1.3. For command-line tools, 1233 use the --target-env option to override the default. Examples: 1234 # Generate a SPIR-V 1.0 binary instead of SPIR-V 1.3 1235 spirv-as --target-env spv1.0 a.spvasm -o a.spv 1236 spirv-as --target-env vulkan1.0 a.spvasm -o a.spv 1237 # Validate as Vulkan 1.0 1238 spirv-val --target-env vulkan1.0 a.spv 1239 - Support SPV_GOOGLE_decorate_string and SPV_GOOGLE_hlsl_functionality1 1240 - Fixes: 1241 - Fix Android.mk build. Compilation was failing due to missing definitions of 1242 SpvCapabilityFloat16ImageAMD and other enumerated values. 1243 - Optimizer: Avoid generating duplicate names when merging types. 1244 - #1375: Validator: SPV_AMD_gpu_shaer_half_float implicitly allows declaration 1245 of the 16-bit floating point type. 1246 - #1376: Optimizer: Avoid folding half-precision float. 1247 1248v2018.1 2018-03-02 1249 - General: 1250 - Support Visual Studio 2013 again. (Continue support for VS 2015 and VS 2017.) 1251 - Support building SPIRV-Tools as a shared library. 1252 - Improve the HLSL legalization optimization recipe. #1311 1253 - Optimizer: 1254 - General speedups. 1255 - Remove generic dead code elimination functionality from transforms: 1256 --eliminate-local-single-block 1257 --eliminate-local-single-store 1258 --eliminate-local-multi-store 1259 To recover the previous behaviour, a recipe using those transforms should now 1260 also invoke the --eliminate-dead-code-aggressive transform. 1261 - Improve folding, including coverage for floating point, OpSelect, and arithmetic 1262 with non-trivial constant operands. 1263 - Add loop-invariant code motion pass. 1264 - Add loop-unrolling pass, for honouring unroll hits. 1265 - Add loop-unswitch pass. 1266 - Add instruction simplification pass. 1267 - Aggressive dead code elimination: Understands capability hierarchy when finding 1268 instructions it can eliminate (combinators). (PR #1268) 1269 - CCP can now fold floating point arithmetic. #1311 1270 - Validator: 1271 - Validate barrier instructions. 1272 - Check Vulkan-specific rules for atomics. 1273 - Check Vulkan prohibition of Location or Component decorations on BuiltIn variables. 1274 - Linker: 1275 - Add --verify-ids option 1276 - Add option to allow a resulting module to be partially linked. 1277 - Handle OpModuleProcessed (instructions in SPIR-V layout section 7c) 1278 - Fixes: 1279 - #1265: Optimizer: Fix use-after free bug in if-conversion. (Fix object lifecycle bug 1280 in type manager.) 1281 - #1282: Fix new warnings found by GCC 8.0.1. 1282 - #1285: Optimizer: Fix random failures during inlining. (Dangling references in DefUseManager) 1283 - #1295: Optimizer: Fix incorrect handling of Phi nodes in CCP. 1284 - #1300: Fix CCP: avoid bad CCP transitions and unsettled values. 1285 - #1304: Avoid static-duration variables of class type (with constructors). 1286 - #1323: Fix folding of an insert composite feeding a composite extract. 1287 - #1339: Fix CCP: Handle OpConstantNull boolean values as conditions. 1288 - #1341: DCEInst: Keep atomic instructions (and some others with side effects). 1289 - #1354: Don't fold integer division. 1290 - #1357: Support OpConstantNull in folding. 1291 - #1361: CCP: Fix handling of non-constant module-scope values 1292 1293v2018.0 2018-02-02 1294 - General 1295 - VisualStudio 2013 is no longer supported. VisualStudio 2015 is supported. 1296 - Use "include/unified1" directory from SPIRV-Headers. Requires recent SPIRV-Headers source. 1297 - Disassembler: spirv-dis adds --color option to force color disassembly. 1298 - Optimizer: 1299 - Add pass to eliminate dead insertions. 1300 - Aggressive dead code elimination now removes OpSwitch constructs. 1301 - Block merging occurs in more cases. 1302 - Add driver workaround transform: replace OpUnreachable with harmless branch to merge. 1303 - Improve instruction folding framework. 1304 - Add loop analysis. 1305 - Add scalar replacement of aggregates to size-optimization recipe. 1306 - Add pass to replace instructions invalid for a shader stage, with a harmless value. 1307 This changes the semantics of the program! Not for general use! 1308 - Rearragne and add passes to performance-optimization recipe, to produce better results. 1309 - Validator: 1310 - Validate OpenCL extended instructions. 1311 - Shaders can't perform atomics on floats. 1312 - Validate memory semantics values in atomics. 1313 - Validate instruction-adjacency constraints, e.g. OpPhi predecessors, merge instructions 1314 immediately precede branches. 1315 - Fixes: 1316 - PR 1198: Optimizer: Fix CCP in presence of matrix constants. 1317 - #1199: Optimizer: Fix CCP: don't propagate spec constants. 1318 - #1203: Optimizer: Fix common uniform elim bug introduced by refactoring. 1319 - #1210: Optimizer: Aggressive dead code elimination: Fix 'break' identification. 1320 - #1212: Optimizer: Aggressive dead code elimination: Was skipping too many instructions. 1321 - #1214: Optimizer: Aggressive dead code elimination: Fix infinite loop. 1322 - #1228: Optimizer: Fix CCP: Handling of varying Phi nodes; was resulting in infinite loop. 1323 - #1245: Optimizer: Dead branch elimination: Avoid a null pointer dereference. 1324 - #1250: Optimizer: Dead branch elimination: Avoid spuriously reporting a change. 1325 1326v2017.3 2018-01-12 1327 - General: 1328 - Support DebugInfo extended instruction set, targeted at OpenCL environments. 1329 See the SPIR-V Registry. 1330 - Generate a SPIRV-Tools.pc file for pkg-config. 1331 - Optimizer: 1332 - Progress for legalization of code generated from HLSL (issue #1118): 1333 - Add --legalize-hlsl option to run transforms used to transform intermediate 1334 code generated by HLSL to SPIR-V for Vulkan compilers. Those compilers 1335 normally run these transforms automatically. This option is used for developing 1336 those transforms. 1337 - Add Private-to-Function variable conversion for modules with logical 1338 addressing. 1339 - Add --ccp: SSA Conditional Constant Propagation (CCP) 1340 - Add --print-all to show disassembly for each optimization pass. 1341 - Internal: Add loop descriptors and post-order tree iterator. 1342 - Generalized dead branch elimination 1343 - Aggressive dead code elimination (ADCE) now removes dead functions and 1344 module-scope variables. 1345 - Vector extract/insert elimination now optimizes through some cases of 1346 VectorShuffle, and GLSL.std.450 Mix extended instruction. 1347 - Validator: 1348 - Add validation for GLSL.std.450 extended instruction set. 1349 - Check out of bounds composite accesses, where that's statically computable. 1350 Fixes #1112. 1351 - Check upper bits of literal numbers that aren't a multiple of 32-bits wide. 1352 - More validation of primitive instructions 1353 - Add optional "relaxed" checking logical addressing mode to permit some 1354 cases of pointer-to-pointer. Contributes to HLSL legalization (issue #1118). 1355 - Fixes: 1356 #1100: Validator: Image operand Sample can be used with OpImageSparseFetch, 1357 OpImageSparseRead. 1358 #1108: Remove duplicates transform was incorrectly removing non-duplicate 1359 decorations. 1360 #1111: Optimizer's type manager could reference deleted memory. 1361 #1112: Fix decoration equality check, e.g. it is now symmetric. 1362 #1129: Validator now disallows Dim=SupbassData for OpImageSparseRead. 1363 #1143: Fix CCP: Was generating incorrect code for loops. 1364 #1153: Fix CCP crash. 1365 #1154: Optimizer's internal instruction-to-block mappings were sometimes 1366 inconsistent. 1367 #1159: Fix CCP infinite loop. 1368 #1168: Fix dead branch elimination intermittently generating incorrect code. 1369 Fixes https://github.com/KhronosGroup/glslang/issues/1205 1370 #1186: Fix validation of PackDouble2x32 and UnpackDouble2x32 1371 1372v2017.2 2017-12-15 1373 - General: 1374 - Support OpenCL 1.2, 2.0 target environments, including embedded profiles 1375 - Add CONTRIBUTING.md 1376 - Fix exit status code for spirv-link 1377 - Disassember: Enable emitting ANSI colour codes to a string 1378 - Library avoids polluting global namespace. The libraries can export C and C++ 1379 symbols starting with "spv", or in a C++ namespace. Add a test for this. 1380 - Linux release builds include debug information, for easier profiling 1381 - Build bots no longer test VisualStudio 2013 1382 - Testing dependency RE2 requires VisualStudio 2015 or later 1383 - Build bots check code formatting 1384 - Optimizer: 1385 - Add --skip-validation to spirv-opt 1386 - Add dominance tree analysis 1387 - Add generic value propagation engine 1388 - Add global redundancy elimination within a function 1389 - Add scalar replacement of function-scope variables of composite type 1390 - Aggressive dead code elimination: Remove empty loops 1391 - Killing an instruction notifies the IRContext 1392 - IRContext::KillInst deletes the instruction 1393 - Move CFG analysis to IRContext 1394 - Add constant manager 1395 - Fix: Don't consider derivative instructions as combinators. 1396 - Fix: Don't delete an instruction twice in local dead-code-elimination 1397 - Fix: Don't consider derivative instructions as combinators. 1398 - Validator: 1399 - Finish checking of image instructions (Section 3.32.10) 1400 - Check sparse image instructions 1401 - Check OpTypeImage, OpTypeSampleImage 1402 - Check composite instructions (Section 3.32.12) 1403 - Check atomic instructions (Section 3.32.18) 1404 - Check OpEmitStreamVertex, OpEndStreamPrimitive instructions 1405 - Re-enable validation of OpCopyObject 1406 - OpKill, image ImplicitLod and QueryLod instructions can only be used in Fragment 1407 shaders. 1408 - Fixes for image instruction validation: 1409 - Lod image operand only usable with ExplicitLod and OpImageFetch 1410 - ExplicitLod Lod image operand must be float scalar 1411 - OpImageFectch Lod image operand must be int scalar 1412 - OpImageGather component operand must be 32-bits (integer scalar) 1413 - OpImageQuerySizeLod Lod must be integer scalar 1414 - Fixes: 1415 #622: Remove names and decorations when inlining 1416 #989: Aggressive dead code elim: Don't optimize away live breaks from a loop 1417 #991: Fix validation of SPV_AMD_shader_ballot 1418 #1004: Use after free of an instruction, in remove-duplicates transform 1419 #1007: OpImageRead not required to return 4-component vector 1420 #1009: OpImageRead can return scalar int/float types 1421 #1011: OpImageWrite should allow scalar int/float texel types 1422 #1012: Fix validat Dref type check 1423 #1017: Load-store elimination considers variable initializations 1424 #1034: Fix Windows debug build: operator< should be a weak ordering 1425 #1083: Inlining: Set parent (function) for each inlined basic block. 1426 #1075: Aggressive dead code elimination: Was leaving dangling references to 1427 removed blocks. 1428 1429v2017.1 2017-11-23 1430 - Update README with details on the [email protected] mailing list. 1431 - General: 1432 - Automatically deploy built artifacts to GitHub Releases 1433 - Add a Linker (module combiner). Under development. 1434 - Add Android.mk for Android NDK builds. 1435 - Add the 'effcee' library as an optional dependency for use in tests. 1436 Eventually it will be a required dependency, once downstream projects have 1437 a chance to adjust. Requires 're2' library. 1438 - Avoid static-duration variables of class type (with constructors). 1439 - Hack around bugs in gcc-4.8.1 template handling 1440 - Faster opcode lookup 1441 - Validator: 1442 - Recognize extensions listed on SPIR-V registry, 1443 through #25 SPV_AMD_shader_fragment_mask 1444 - Validator issues an info message when it sees an unrecognized extension. 1445 - Type check basic arithmetic operations 1446 - Type check carry/extended arithmetic operations 1447 - Type check vector arithmetic operations 1448 - Type check Relational and Logical instructions 1449 - Type check Bit instructions 1450 - Check type uniqueness rules 1451 - Check conversion instructions 1452 - Check image instructions 1453 - Check derivative instructions 1454 - Check OpVectorShuffle 1455 - Check OpBranchConditional 1456 - OpModuleProcessed is only allowed after debug names section and before annotations 1457 section. 1458 - Checks the right kind of return is called for each function (void or non-void). 1459 - Add option to relax type check when storing structs (--relax-store-struct) 1460 - Optimizer: 1461 - Refactoring internal representation of the module, including: 1462 - IRContext: owns a module and manages analyses 1463 - Instructions are owned by intrusive lists, and have unique IDs 1464 - BasicBlock owns its instruction list. 1465 - DefUseManager: change representation of uses, for faster processing 1466 on large modules. 1467 - Add high level recipes: -O, -Os, and -Oconfig 1468 Recipes for -O and -Os are under development. 1469 - Add eliminate-dead-function transform 1470 - Add strength reduction transform: For now, convert multiply by power of 2 1471 to a bit shift. 1472 - Add CFG cleanup transform 1473 - Add removal of dead module-scope variables 1474 - Add merge-return transform for modules without structured control flow 1475 - Add redundancy elimination within a basic block (local value numbering) 1476 - Extract-insert elimination: 1477 - Recognize the case where the first instruction in the sequence is an 1478 OpCompositeConstruct or OpConstantComposite 1479 - Handle some cases of nested structs 1480 - Dead branch elimination now can eliminate entire selection constructs 1481 when all arms are dead. 1482 - Compressing codec: 1483 - Updated algorithm to 1.01, 1.02, 1.03 1484 - Not built by default. Use -DSPIRV_BUILD_COMPRESSION=ON to build. 1485 - Codec can be parameterized by a customized model. 1486 - Fixes: 1487 #728: Fix decoration of inlined functions 1488 #798: spirv-as should fail when given unrecognized long option 1489 #800: Inliner: Fix inlining function into header of multi-block loop 1490 #824: Eliminate-local-multi-store: Fix a crash 1491 #826: Elimiante-local-multi-store: Fix a crash 1492 #827: Fix crash when compact-ids transform runs before another transform. 1493 #834: Add Cmake option to build the compressing codec. Off by default. 1494 #911: Fix classification of Line and NoLine instructions 1495 1496v2017.0 2017-09-01 1497 - Update README to describe that assembler, disassembler, and binary parser support 1498 are based on grammar files from the SPIRV-Headers repository. 1499 1500v2016.7 2017-09-01 1501 - Add SPIR-V 1.2 1502 - OpenCL 2.2 support is now based on SPIR-V 1.2 1503 - Support AMD extensions in assembler, disassembler: 1504 SPV_AMD_gcn_shader 1505 SPV_AMD_shader_ballot 1506 SPV_AMD_shader_explicit_vertex_parameter 1507 SPV_AMD_shader_trinary_minmax 1508 SPV_AMD_gpu_shader_half_float 1509 SPV_AMD_texture_gather_bias_lod 1510 SPV_AMD_gpu_shader_int16 1511 - Optimizer: Add support for: 1512 - Inline all function calls in entry points. 1513 - Flatten decoration groups. Fixes #602 1514 - Id compaction (minimize Id bound). Fixes #624 1515 - Eliminate redundant composite insert followed by extract 1516 - Simplify access chains to local variables 1517 - Eliminate local variables with a single store, if possible 1518 - Eliminate local variables with a several stores, if possible 1519 - Eliminate loads and stores in same block to local variables 1520 - Eliminate redundant insert/extract to composite values 1521 - Aggressive dead instruction elimination 1522 - Eliminate dead branches 1523 - Merge blocks when the second can only be preceded by the first 1524 - Eliminate ommon uniform loads 1525 - Assembler: Add option to preserve numeric ids. Fixes #625 1526 - Add build target spirv-tools-vimsyntax to generate spvasm.vim, a SPIR-V 1527 assembly syntax file for Vim. 1528 - Version string: Allow overriding of wall clock timestamp with contents 1529 of environment variable SOURCE_DATE_EPOCH. 1530 - Validator implements relaxed rules for SPV_KHR_16bit_storage. 1531 - CMake installation rules use GNUInstallDirs. For example, libraries 1532 will be installed into a lib64 directory if that's the norm for the 1533 current system. 1534 - Fixes: 1535 #500: Parameterize validator limit checks 1536 #508: Support compilation under CYGWIN 1537 #517: Fix validation when continue (or case) contstruct is also the head of a 1538 nested control construct. 1539 #551: If a merge block is reachable, it must be *strictly* dominated by its 1540 header. 1541 #548: Validator: Error when the reserved OpImageSparseSampleProj* opcodes 1542 are used. 1543 #611: spvtools::Optimizer was failing to save the module to the output 1544 binary vector when all passes succeded without changes. 1545 #629: The inline-entry-points-all optimization could generate invalidly 1546 structured code when the inlined function had early returns. 1547 #697: Optimizer's Instruction::ForEachInId method was skipping semantics-id 1548 and scope-id. 1549 #755: Inliner: Fix inlining of callee with single Return appearing before 1550 the end of the function. 1551 #776: Fix dead branch elimination in presence of complex but dead control 1552 flow. 1553 #781: SPV_KHR_variable_pointers allows duplicate pointer types 1554 #782: Inliner: Fix remapping of non-label forward references in callee 1555 #787: Inliner: Fix remapping of inlined entry block when called from 1556 single block loop. 1557 #790: Inliner: Fix remapping of inlined entry block when callee has 1558 multiple returns. 1559 1560v2016.6 2016-12-13 1561 - Published the C++ interface for assembling, disassembling, validation, and 1562 optimization. 1563 - Support SPV_KHR_shader_draw_parameters in assembler, disassembler, parser. 1564 - Validator: 1565 - Add validator API accepting raw binary words 1566 - Increased coverage: 1567 - Checks "Data rules" in Universal Validation Rules, section 2.16.1 1568 - WIP: Universal Limits. 1569 - The minimum mandated upper bounds are checked. 1570 - TODO: Parameterize the validator to allow larger limits accepted by 1571 a more than minimally capable implementation. 1572 - OpSampledImage checks 1573 - OpConstantComposite checks 1574 - Id bound check 1575 - Disasssembler: 1576 - Generates friendly GLSL-based names for more builtin variables 1577 - Generates friendly names for numeric OpConstant values 1578 - Vendor tool info extracted from SPIR-V XML registry file. 1579 - Fixes issues: 1580 #429: Validator: Allow OpTypeForwardPointer and OpTypeStruct to reference 1581 undefined IDs 1582 #482: Validator: OpVariable initializer can be an ID of a module-scope variable 1583 1584v2016.5 2016-09-16 1585 - Support SPV_KHR_shader_ballot in assembler, disassembler, parser. 1586 - Disassembler: Generate friendly names for built-in variables. 1587 - Partial fixes: 1588 #359: Add Emacs helper for automatically diassembling/assembling a SPIR-V 1589 binary on file load/save. 1590 - Fixes: 1591 #414: Validator: Allow OpUndef for composite constants 1592 #415: Validator: Phi can use its own value in some cases. 1593 1594v2016.4 2016-09-01 1595 - Relicensed under Apache 2.0 1596 - Add optimization passes (in API and spirv-opt command) 1597 - Fold spec constants defined with OpSpecConstantOp and 1598 OpSpecConstantComposite to normal constants with fixed value(s). 1599 - Fixes issues: 1600 #318: Relicensed under Apache 2.0 1601 1602v2016.3 2016-08-24 1603 - Add target environment enums for OpenCL 2.1, OpenCL 2.2, 1604 OpenGL 4.0, OpenGL 4.1, OpenGL 4.2, OpenGL 4.3, OpenGL 4.5. 1605 - Add spirv-cfg, an experimental tool to dump the control flow graph 1606 as a GraphiViz "dot" graph 1607 - Add optimization pass: Eliminate dead constants. 1608 - Add spirv-lesspipe.sh filter utility 1609 - Fixes issues: 1610 #288: Check def-use dominance rules for OpPhi (variable,parent) operands 1611 #339: Allow OpUndef in types-constants-global-vars section, as required 1612 by SPIR-V 1.0 Rev7, 1.1 Rev 3. 1613 #340: Avoid race on mkdir during build 1614 #365: Relax PointSize, ClipDistance, CullDistance capability check in all 1615 environments not just Vulkan 1.0. 1616 1617v2016.2 2016-08-05 1618 - Validator is incomplete 1619 - Checks ID use block is dominated by definition block 1620 - Add optimization passes (in API and spirv-opt command) 1621 - Strip debug info instructions 1622 - Freeze spec constant to their default values 1623 - Allow INotEqual as operation for OpSpecConstantOp 1624 - Fixes bugs: 1625 #270: validator: crash when continue construct is unreachable 1626 #279: validator: infinite loop when analyzing some degenerate control 1627 flow graphs 1628 #286: validator: don't incorrectly generate def-use error for 1629 (variable,parent) parameters to OpPhi 1630 #290: disassembler: never generate bare % for an identifier 1631 #295: validator: def-use dominance check should ignore unreachable uses 1632 #276: validator: allow unreachable continue constructs 1633 #297: validator: allow an unreachable block to branch to a reachable 1634 merge block 1635 1636v2016.1 2016-07-19 1637 - Fix https://github.com/KhronosGroup/SPIRV-Tools/issues/261 1638 Turn off ClipDistance and CullDistance capability checks for Vulkan. 1639 - The disassembler can emit friendly names based on debug info (OpName 1640 instructions), and will infer somewhat friendly names for most types. 1641 This is turned on by default for the spirv-dis command line tool. 1642 - Updated to support SPIR-V 1.1 rev 2 1643 - Input StorageClass, Sampled1D capability, and SampledBuffer capability 1644 do not require Shader capability anymore. 1645 1646v2016.0 2016-07-04 1647 1648 - Adds v<year>.<index> versioning, with "-dev" indicating 1649 work in progress. The intent is to more easly report 1650 and summarize functionality when SPIRV-Tools is incorporated 1651 in downstream projects. 1652 1653 - Summary of functionality (See the README.md for more): 1654 - Supports SPIR-V 1.1 Rev 1 1655 - Supports SPIR-V 1.0 Rev 5 1656 - Supports GLSL std450 extended instructions 1.0 Rev 3 1657 - Supports OpenCL extended instructions 1.0 Rev 2 1658 - Assembler, disassembler are complete 1659 - Supports floating point widths of 16, 32, 64 bits 1660 - Supports integer widths up to 64 bits 1661 - Validator is incomplete 1662 - Checks capability requirements in most cases 1663 - Checks module layout constraints 1664 - Checks ID use-definition ordering constraints, 1665 ignoring control flow 1666 - Checks some control flow graph rules 1667 - Optimizer is introduced, with few available transforms. 1668 - Supported on Linux, OSX, Android, Windows 1669 1670 - Fixes bugs: 1671 - #143: OpenCL pow and pown arguments 1672