Ryujinx

Author	SHA1	Message	Date
riperiperi	76b53e018a	GPU: Add fallback when textureGatherOffsets is not supported (#5792 ) * GPU: Add fallback when textureGatherOffsets is not supported. This PR adds a fallback for GPUs or APIs that don't support an equivalent to the method `textureGatherOffsets`, where each of the 4 gathered texels has an individual offset. This is done by reusing the existing code to handle non-const offsets for texture instructions, though it has also been corrected as there were a few implementation issues. MoltenVK reports support for this capability, and it didn't error when we initially released the MacOS build, but that has since changed. MVK still reports support, but spirv-cross has been fixed in a way that it _attempts_ to use this capability, but the metal compiler errors since it doesn't exist. Some other fixes: - textureGatherOffsets emulation has been changed significantly. It now uses 4 texture sample instructions (not gather), calculates a base texel (i=0 j=0) and adds the offsets onto it before converting into a tex coord. The final result is offset into a texel center, so it shouldn't be subject to interpolation, though this isn't perfect and could have some error with floating point formats with linear sampling. It is subject to texture wrap mode as it should be, which is why texelFetch was not used. - Maybe gather should be used here with component `w` (i=0, j=0), though this multiplies number of texels fetched by 4... The way it was doing this before _was_ wrong_, but doing it right would avoid issues with texel center precision. - textureGatherOffset (singular) now performs textureGather with the offset applied to the coords, rather than the slower fallback where each texel is fetched individually. * Increment shader cache version, remove unused arg * Use base texture size for gather coord offset. Implicit LOD for gather is not supported. * Use 4 texture gathers for offsets emulation Avoids issues with interpolation at cost of performance (not sure how bad this is) * Address Feedback	2023-10-20 15:05:09 +02:00
gdkchan	5ff6ea6d82	Fix ShaderTools GpuAcessor default values (#5646 )	2023-09-05 01:16:09 +02:00
gdkchan	6ed613a6e6	Fix vote and shuffle shader instructions on AMD GPUs (#5540 ) * Move shuffle handling out of the backend to a transform pass * Handle subgroup sizes higher than 32 * Stop using the subgroup size control extension * Make GenerateShuffleFunction static * Shader cache version bump	2023-08-16 21:31:07 -03:00
gdkchan	effd546331	Implement scaled vertex format emulation (#5564 ) * Implement scaled vertex format emulation * Auto-format (whitespace) * Delete ToVec4Type	2023-08-16 08:30:33 -03:00
gdkchan	b423197619	Delete ShaderConfig and organize shader resources/definitions better (#5509 ) * Move some properties out of ShaderConfig * Stop using ShaderConfig on backends * Replace ShaderConfig usages on Translator and passes * Move remaining properties out of ShaderConfig and delete ShaderConfig * Remove ResourceManager property from TranslatorContext * Move Rewriter passes to separate transform pass files * Fix TransformPasses.RunPass on cases where a node is removed * Move remaining ClipDistancePrimitivesWritten and UsedFeatures updates to decode stage * Reduce excessive parameter passing a bit by using structs more * Remove binding parameter from ShaderProperties methods since it is redundant * Replace decoder instruction checks with switch statement * Put GLSL on the same plan as SPIR-V for input/output declaration * Stop mutating TranslatorContext state when Translate is called * Pass most of the graphics state using a struct instead of individual query methods * Auto-format * Auto-format * Add backend logging interface * Auto-format * Remove unnecessary use of interpolated strings * Remove more modifications of AttributeUsage after decode * PR feedback * gl_Layer is not supported on compute	2023-08-13 22:26:42 -03:00
gdkchan	f95b7c5877	Fix incorrect fragment origin when YNegate is enabled (#4673 ) * Fix incorrect fragment origin when YNegate is enabled * Shader cache version bump * Do not update support buffer if shader does not read gl_FragCoord * Pass unscaled viewport size to the support buffer	2023-07-29 18:47:03 -03:00
gdkchan	eb0bb36bbf	Implement transform feedback emulation for hardware without native support (#5080 ) * Implement transform feedback emulation for hardware without native support * Stop doing some useless buffer updates and account for non-zero base instance * Reduce redundant updates even more * Update descriptor init logic to account for ResourceLayout * Fix transform feedback and storage buffers not being updated in some cases * Shader cache version bump * PR feedback * SetInstancedDrawVertexCount must be always called after UpdateState * Minor typo	2023-06-10 18:31:38 -03:00
gdkchan	2cdcfe46d8	Remove barrier on Intel if control flow is potentially divergent (#5044 ) * Remove barrier on Intel if control flow is potentially divergent * Shader cache version bump	2023-06-08 17:43:16 -03:00
gdkchan	fe30c03cac	Implement soft float64 conversion on shaders when host has no support (#5159 ) * Implement soft float64 conversion on shaders when host has no support * Shader cache version bump * Fix rebase error	2023-06-08 17:09:14 -03:00
gdkchan	21c9ac6240	Implement shader storage buffer operations using new Load/Store instructions (#4993 ) * Implement storage buffer operations using new Load/Store instruction * Extend GenerateMultiTargetStorageOp to also match access with constant offset, and log and comments * Remove now unused code * Catch more complex cases of global memory usage * Shader cache version bump * Extend global access elimination to work with more shared memory cases * Change alignment requirement from 16 bytes to 8 bytes, handle cases where we need more than 16 storage buffers * Tweak preferencing to catch more cases * Enable CB0 elimination even when host storage buffer alignment is > 16 (for Intel) * Fix storage buffer bindings * Simplify some code * Shader cache version bump * Fix typo * Extend global memory elimination to handle shared memory with multiple possible offsets and local memory	2023-06-03 20:12:18 -03:00
cstamford	dc0dbc50ab	Add support for VK_EXT_depth_clip_control. (#5027 ) * Add support for VK_EXT_depth_clip_control. * Code review feedback Minor formatting Co-authored-by: gdkchan <gab.dark.100@gmail.com> * Check .DepthClipControl to make sure the host actually supports the feature. * Review feedback: remove Vulkan platform switch, relying on QueryHostSupportsDepthClipControl to drive the behaviour - OpenGL returns true, and any future platforms that don't support the [-1, 1] depth mode can return false for the transformation. --------- Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2023-05-28 23:31:56 +02:00
TSR Berry	cee7121058	Move solution and projects to src	2023-04-27 23:51:14 +02:00

12 commits