N-archive/yuzu - SiliconForest Atelier

Author	SHA1	Message	Date
ameerj	6ac97405df	Vk Async pipeline compilation	2020-08-16 12:02:22 -04:00
Lioncash	c4ed791164	common/fileutil: Convert namespace to Common::FS Migrates a remaining common file over to the Common namespace, making it consistent with the rest of common files. This also allows for high-traffic FS related code to alias the filesystem function namespace as namespace FS = Common::FS; for more concise typing.	2020-08-16 06:52:40 -04:00
Lioncash	167d36ec3c	vulkan/wrapper: Avoid unnecessary copy in EnumerateInstanceExtensionProperties() Given this is implicitly creating a std::optional, we can move the vector into it.	2020-08-14 08:23:49 -04:00
Lioncash	b724a4d90c	General: Tidy up clang-format warnings part 2	2020-08-13 14:19:08 -04:00
Lioncash	06809ad7bc	vulkan: Silence more -Wmissing-field-initializer warnings	2020-08-03 12:28:57 -04:00
Lioncash	80eedff9e1	vulkan: Resolve -Wmissing-field-initializer warnings	2020-07-25 03:50:18 -04:00
bunnei	dc2d31b1b2	Merge pull request #4393 from lioncash/unused5 vk_rasterizer: Remove unused variable in Clear()	2020-07-24 20:33:58 -07:00
bunnei	1d7de0a8ee	Merge pull request #4394 from lioncash/unused6 video_core: Remove unused variables	2020-07-23 19:54:59 -07:00
Rodrigo Locatti	7278c59d70	Merge pull request #4359 from ReinUsesLisp/clamp-shared renderer_{opengl,vulkan}: Clamp shared memory to host's limit	2020-07-21 04:51:05 -03:00
Lioncash	e17fb5ee97	video_core: Remove unused variables Silences several compiler warnings about unused variables.	2020-07-21 00:57:25 -04:00
Lioncash	4b369126c4	vk_rasterizer: Remove unused variable in Clear() The relevant values are already assigned further down in the lambda, so this can be removed entirely.	2020-07-21 00:49:10 -04:00
bunnei	3d13d7f48f	Merge pull request #4324 from ReinUsesLisp/formats video_core: Fix, add and rename pixel formats	2020-07-21 00:13:04 -04:00
bunnei	821d295f24	Merge pull request #4364 from lioncash/desig5 vulkan: Make use of designated initializers where applicable	2020-07-18 00:12:43 -04:00
ReinUsesLisp	81c8f92f2e	vk_device: Fix build error on old MSVC versions Designated initializers on old MSVC versions fail to build when they take the address of a constant.	2020-07-17 20:27:53 -03:00
bunnei	19c6bf72db	Merge pull request #4322 from ReinUsesLisp/fix-dynstate vk_state_tracker: Fix dirty flags for stencil_enable on VK_EXT_extended_dynamic_state	2020-07-17 09:50:45 -04:00
Lioncash	7785123b1c	wrapper: Make use of designated initializers where applicable	2020-07-16 20:01:01 -04:00
Lioncash	01da386617	vk_texture_cache: Make use of designated initializers where applicable	2020-07-16 19:52:38 -04:00
Lioncash	169759e069	vk_texture_cache: Amend mismatched access masks and indices in UploadBuffer Discovered while converting relevant parts of the codebase over to designated initializers.	2020-07-16 19:45:46 -04:00
Lioncash	08d36afd40	vk_swapchain: Make use of designated initializers where applicable	2020-07-16 19:27:02 -04:00
Lioncash	3c060503bc	vk_stream_buffer: Make use of designated initializers where applicable	2020-07-16 19:22:11 -04:00
Lioncash	70147e913f	vk_staging_buffer_pool: Make use of designated initializers where applicable	2020-07-16 19:22:03 -04:00
Lioncash	2025f847bb	vk_shader_util: Make use of designated initializers where applicable	2020-07-16 19:17:41 -04:00
Lioncash	97e7663004	vk_scheduler: Make use of designated initializers where applicable	2020-07-16 19:11:43 -04:00
Lioncash	fd7af52ec3	vk_sampler_cache: Make use of designated initializers where applicable	2020-07-16 19:06:40 -04:00
Lioncash	772b6e4d28	vk_resource_manager: Make use of designated initializers where applicable	2020-07-16 19:02:35 -04:00
Lioncash	8ebd6a21c5	vk_renderpass_cache: Make use of designated initializers where applicable	2020-07-16 18:57:23 -04:00
Lioncash	01f297f2e0	vk_rasterizer: Make use of designated initializers where applicable	2020-07-16 18:49:42 -04:00
Lioncash	c07b0ffe47	vk_query_cache: Make use of designated initializers where applicable	2020-07-16 18:34:04 -04:00
Lioncash	d43e923990	vk_pipeline_cache: Make use of designated initializers where applicable	2020-07-16 18:32:29 -04:00
Lioncash	7d5f93832c	vk_memory_manager: Make use of designated initializers where applicable	2020-07-16 18:26:30 -04:00
Lioncash	75c00c3cb0	vk_image: Make use of designated initializers where applicable	2020-07-16 18:24:26 -04:00
Lioncash	6d165481ad	vk_descriptor_pool: Make use of designated initializers where applicable	2020-07-16 18:19:45 -04:00
Lioncash	fb563e75e9	vk_graphics_pipeline: Resolve narrowing warnings For whatever reason, VK_TRUE and VK_FALSE aren't defined as having a VkBool32 type, so we need to cast to it explicitly.	2020-07-16 18:13:49 -04:00
Lioncash	5330ca396d	vk_compute_pipeline: Make use of designated initializers where applicable	2020-07-16 17:32:12 -04:00
Lioncash	757ddd8158	vk_compute_pass: Make use of designated initializers where applicable Note: Some barriers can't be converted over yet, as they ICE MSVC.	2020-07-16 17:23:56 -04:00
Lioncash	a66a0a6a53	vk_buffer_cache: Make use of designated initializers where applicable Note: An array within CopyFrom() cannot be converted over yet, as it ICEs MSVC when converted over.	2020-07-16 16:59:39 -04:00
Rodrigo Locatti	be68ee88c2	Merge pull request #4333 from lioncash/desig3 vk_graphics_pipeline: Make use of designated initializers where applicable	2020-07-16 17:41:45 -03:00
Rodrigo Locatti	b6d73ec9c2	Merge pull request #4332 from lioncash/vkdev vk_device: Make use of designated initializers where applicable	2020-07-16 17:41:20 -03:00
ReinUsesLisp	a5a72cbd20	renderer_{opengl,vulkan}: Clamp shared memory to host's limit This stops shaders from failing to build when the exceed host's shared memory size limit. An error is logged.	2020-07-16 16:02:46 -03:00
Lioncash	0f8b977663	vk_device: Make use of designated initializers where applicable Avoids redundant repetitions of variable names, and allows assignment all in one statement.	2020-07-13 22:24:01 -04:00
Lioncash	0475a167f8	vk_graphics_pipeline: Make use of designated initializers where applicable Avoids redundant variable name repetitions.	2020-07-13 21:07:56 -04:00
ReinUsesLisp	fbc232426d	video_core: Rearrange pixel format names Normalizes pixel format names to match Vulkan names. Previous to this commit pixel formats had no convention, leading to confusion and potential bugs.	2020-07-13 01:44:23 -03:00
ReinUsesLisp	eda37ff26b	video_core: Fix DXT4 and RGB565	2020-07-13 01:01:09 -03:00
ReinUsesLisp	480850ffe7	video_core: Fix B5G6R5_UNORM render target format	2020-07-13 01:01:09 -03:00
ReinUsesLisp	990b14f181	video_core: Fix B5G6R5U	2020-07-13 01:01:09 -03:00
ReinUsesLisp	1d20aac795	video_core: Implement RGBA32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	9338599d72	video_core: Implement RGBA32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	95c0f5afe5	video_core: Implement RGBA16_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	977d6c46f3	video_core: Implement RGBA8_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	50c6030a8d	video_core: Implement RG32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	e849d68048	video_core: Implement RG8_SINT render target and fix RG8_UINT	2020-07-13 01:01:09 -03:00
ReinUsesLisp	f29fede49c	video_core: Implement R8_SINT render target	2020-07-13 01:01:08 -03:00
ReinUsesLisp	fd33e996e0	video_core: Implement R8_SNORM render target	2020-07-13 01:01:08 -03:00
Lioncash	db6fbd5894	vk_blit_screen: Make use of designated initializers where applicable Now that we make use of C++20, we can use designated initializers to make things a little nicer to read.	2020-07-12 19:45:30 -04:00
ReinUsesLisp	0fe09df386	vk_state_tracker: Fix dirty flags for stencil_enable on VK_EXT_extended_dynamic_state Fixes a regression on any game using stencil on devices with VK_EXT_extended_dynamic_state.	2020-07-12 20:43:42 -03:00
ReinUsesLisp	fca26980a2	vk_rasterizer: Pass <pSizes> to CmdBindVertexBuffers2EXT This has been fixed in Nvidia's public beta driver 451.74. The previous beta driver will be broken, people using these will have to update.	2020-07-10 18:15:32 -03:00
Rodrigo Locatti	e73c53fad1	Merge pull request #4283 from lat9nq/fix-linux-nvidia-vulkan vk_stream_buffer: Prevent Vulkan crash in Linux on recent NVIDIA driver	2020-07-10 00:18:44 -03:00
lat9nq	63d23835ef	configuration: implement per-game configurations (#4098 ) * Switch game settings to use a pointer In order to add full per-game settings, we need to be able to tell yuzu to switch to using either the global or game configuration. Using a pointer makes it easier to switch. * configuration: add new UI without changing existing funcitonality The new UI also adds General, System, Graphics, Advanced Graphics, and Audio tabs, but as yet they do nothing. This commit keeps yuzu to the same functionality as originally branched. * configuration: Rename files These weren't included in the last commit. Now they are. * configuration: setup global configuration checkbox Global config checkbox now enables/disables the appropriate tabs in the game properties dialog. The use global configuration setting is now saved to the config, defaulting to true. This also addresses some changes requested in the PR. * configuration: swap to per-game config memory for properties dialog Does not set memory going in-game. Swaps to game values when opening the properties dialog, then swaps back when closing it. Uses a `memcpy` to swap. Also implements saving config files, limited to certain groups of configurations so as to not risk setting unsafe configurations. * configuration: change config interfaces to use config-specific pointers When a game is booted, we need to be able to open the configuration dialogs without changing the settings pointer in the game's emualtion. A new pointer specific to just the configuration dialogs can be used to separate changes to just those config dialogs without affecting the emulation. * configuration: boot a game using per-game settings Swaps values where needed to boot a game. * configuration: user correct config during emulation Creates a new pointer specifically for modifying the configuration while emulation is in progress. Both the regular configuration dialog and the game properties dialog now use the pointer Settings::config_values to focus edits to the correct struct. * settings: split Settings::values into two different structs By splitting the settings into two mutually exclusive structs, it becomes easier, as a developer, to determine how to use the Settings structs after per-game configurations is merged. Other benefits include only duplicating the required settings in memory. * settings: move use_docked_mode to Controls group `use_docked_mode` is set in the input settings and cannot be accessed from the system settings. Grouping it with system settings causes it to be saved with per-game settings, which may make transferring configs more difficult later on, especially since docked mode cannot be set from within the game properties dialog. * configuration: Fix the other yuzu executables and a regression In main.cpp, we have to get the title ID before the ROM is loaded, else the renderer will reflect only the global settings and now the user's game specific settings. * settings: use a template to duplicate memory for each setting Replaces the type of each variable in the Settings::Values struct with a new class that allows basic data reading and writing. The new struct Settings::Setting duplicates the data in memory and can manage global overrides per each setting. * configuration: correct add-ons config and swap settings when apropriate Any add-ons interaction happens directly through the global values struct. Swapping bewteen structs now also includes copying the necessary global configs that cannot be changed nor saved in per-game settings. General and System config menus now update based on whether it is viewing the global or per-game settings. * settings: restore old values struct No longer needed with the Settings::Setting class template. * configuration: implement hierarchical game properties dialog This sets the apropriate global or local data in each setting. * clang format * clang format take 2 can the docker container save this? * address comments and style issues * config: read and write settings with global awareness Adds new functions to read and write settings while keeping the global state in focus. Files now generated per-game are much smaller since often they only need address the global state. * settings: restore global state when necessary Upon closing a game or the game properties dialog, we need to restore all global settings to the original global state so that we can properly open the configuration dialog or boot a different game. * configuration: guard setting values incorrectly This disables setting values while a game is running if the setting is overwritten by a per game setting. * config: don't write local settings in the global config Simple guards to prevent writing the wrong settings in the wrong files. * configuration: add comments, assume less, and clang format No longer assumes that a disabled UI element means the global state is turned off, instead opting to directly answer that question. Still however assumes a game is running if it is in that state. * configuration: fix a logic error Should not be negated * restore settings' global state regardless of accept/cancel Fixes loading a properties dialog and causing the global config dialog to show local settings. * fix more logic errors Fixed the frame limit would set the global setting from the game properties dialog. Also strengthened the Settings::Setting member variables and simplified the logic in config reading (ReadSettingGlobal). * fix another logic error In my efforts to guard RestoreGlobalState, I accidentally negated the IsPowered condition. * configure_audio: set toggle_stretched_audio to tristate * fixed custom rtc and rng seed overwriting the global value * clang format * rebased * clang format take 4 * address my own review Basically revert unintended changes * settings: literal instead of casting "No need to cast, use 1U instead" Thanks, Morph! Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> * Revert "settings: literal instead of casting " This reverts commit 95e992a87c898f3e882ffdb415bb0ef9f80f613f. * main: fix status buttons reporting wrong settings after stop emulation * settings: Log UseDockedMode in the Controls group This should have happened when use_docked_mode was moved over to the controls group internally. This just reflects this in the log. * main: load settings if the file has a title id In other words, don't exit if the loader has trouble getting a title id. * use a zero * settings: initalize resolution factor with constructor instead of casting * Revert "settings: initalize resolution factor with constructor instead of casting" This reverts commit 54c35ecb46a29953842614620f9b7de1aa9d5dc8. * configure_graphics: guard device selector when Vulkan is global Prevents the user from editing the device selector if Vulkan is the global renderer backend. Also resets the vulkan_device variable when the users switches back-and-forth between global and Vulkan. * address reviewer concerns Changes function variables to const wherever they don't need to be changed. Sets Settings::Setting to final as it should not be inherited from. Sets ConfigurationShared::use_global_text to static. Co-Authored-By: VolcaEM <volcaem@users.noreply.github.com> * main: load per-game settings after LoadROM This prevents `Restart Emulation` from restoring the global settings after the per-game settings were applied. Thanks to BSoDGamingYT for finding this bug. * Revert "main: load per-game settings after LoadROM" This reverts commit 9d0d48c52d2dcf3bfb1806cc8fa7d5a271a8a804. * main: only restore global settings when necessary Loading the per-game settings cannot happen after the ROM is loaded, so we have to specify when to restore the global state. Again thanks to BSoD for finding the bug. * configuration_shared: address reviewer concerns except operator overrides Dropping operator override usage in next commit. Co-Authored-By: LC <lioncash@users.noreply.github.com> * settings: Drop operator overrides from Setting template Requires using GetValue and SetValue explicitly. Also reverts a change that broke title ID formatting in the game properties dialog. * complete rebase * configuration_shared: translate "Use global configuration" Uses ConfigurePerGame to do so, since its usage, at least as of now, corresponds with ConfigurationShared. * configure_per_game: address reviewer concern As far as I understand, it prevents the program from unnecessarily copying strings. Co-Authored-By: LC <lioncash@users.noreply.github.com> Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> Co-authored-by: VolcaEM <volcaem@users.noreply.github.com> Co-authored-by: LC <lioncash@users.noreply.github.com>	2020-07-09 22:42:09 -04:00
lat9nq	1c7d106aac	vk_stream_buffer: set allocable_size to 9 MiB This solves the crash on Linux systems running the current Linux Long Lived branch nVidia driver.	2020-07-09 21:28:32 -04:00
bunnei	35f7740b6c	Merge pull request #4150 from ReinUsesLisp/dynamic-state-impl vulkan: Use VK_EXT_extended_dynamic_state when available	2020-07-07 10:58:09 -04:00
bunnei	41a333321a	Merge pull request #4175 from ReinUsesLisp/read-buffer gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading	2020-07-02 23:30:08 -04:00
Rodrigo Locatti	d217017c9e	Merge pull request #4191 from Morph1984/vertex-formats maxwell_to_gl/vk: Reorder vertex formats	2020-06-30 03:30:00 -03:00
Rodrigo Locatti	f84cbf6429	Merge pull request #4140 from ReinUsesLisp/validation-layers renderer_vulkan: Update validation layer name and test before enabling	2020-06-29 02:12:38 -03:00
Morph	4a35df337b	maxwell_to_vk: Reorder vertex formats and add A2B10G10R10 for all types except float	2020-06-28 02:57:10 -04:00
Fernando Sahmkow	528b19a842	General: Tune the priority of main emulation threads so they have higher priority than less important helper threads.	2020-06-27 11:36:09 -04:00
ReinUsesLisp	9d55e5586f	vk_rasterizer: Use nullptr for <pSizes> in CmdBindVertexBuffers2EXT Disable this temporarily.	2020-06-26 20:57:22 -03:00
ReinUsesLisp	8584a77eb2	vk_pipeline_cache: Avoid hashing and comparing dynamic state when possible With extended dynamic states, some bytes don't have to be collected from the pipeline key, hence we can avoid hashing and comparing them on lookups.	2020-06-26 20:57:22 -03:00
ReinUsesLisp	1a84209418	vulkan/fixed_pipeline_state: Move state out of individual structures	2020-06-26 20:57:22 -03:00
ReinUsesLisp	c94b398f14	vk_rasterizer: Use VK_EXT_extended_dynamic_state	2020-06-26 20:57:22 -03:00
ReinUsesLisp	a6db8e5f4d	renderer_vulkan/wrapper: Add VK_EXT_extended_dynamic_state functions	2020-06-26 20:55:15 -03:00
ReinUsesLisp	c387a72c76	fixed_pipeline_state: Add requirements for VK_EXT_extended_dynamic_state This moves dynamic state present in VK_EXT_extended_dynamic_state to a separate structure in FixedPipelineState. This is structure is at the bottom allowing us to hash and memcmp only when the extension is not supported.	2020-06-26 20:55:15 -03:00
ReinUsesLisp	7527402a46	vk_device: Enable VK_EXT_extended_dynamic_state when available	2020-06-26 20:55:15 -03:00
bunnei	78d3b54ea7	Merge pull request #4111 from ReinUsesLisp/preserve-contents-vk vk_rasterizer: Don't preserve contents on full screen clears	2020-06-26 18:48:12 -04:00
ReinUsesLisp	6481d91e4a	gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing.	2020-06-26 16:58:40 -03:00
ReinUsesLisp	32a2dcd415	buffer_cache: Use buffer methods instead of cache virtual methods	2020-06-24 02:36:14 -03:00
ReinUsesLisp	32485917ba	gl_buffer_cache: Mark buffers as resident Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions.	2020-06-24 02:36:14 -03:00
Rodrigo Locatti	406d298457	Merge pull request #4110 from ReinUsesLisp/direct-upload-sets vk_update_descriptor: Upload descriptor sets data directly	2020-06-22 05:02:13 -03:00
ReinUsesLisp	2f09c7ddd3	renderer_vulkan: Update validation layer name and test before enabling Update validation layer string to VK_LAYER_KHRONOS_validation. While we are at it, properly check for available validation layers before enabling them.	2020-06-22 04:10:45 -03:00
bunnei	c27c76ed43	Merge pull request #4126 from lioncash/noexcept vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR()	2020-06-21 22:36:14 -04:00
bunnei	7d1dca4c98	Merge pull request #4099 from MerryMage/macOS-build Fix compilation on macOS	2020-06-19 23:31:04 -04:00
Lioncash	a6e5b84d1f	vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR() Check() can throw an exception if the Vulkan result isn't successful. We remove the check so that std::terminate isn't outright called and allows for better debugging (should it ever actually fail).	2020-06-19 23:01:59 -04:00
ReinUsesLisp	cf137ea40b	vk_rasterizer: Don't preserve contents on full screen clears There's no need to load contents from the CPU when a clear resets all the contents of the underlying memory. This is already implemented on OpenGL and the texture cache.	2020-06-18 18:18:33 -03:00
ReinUsesLisp	7d763f060e	vk_update_descriptor: Upload descriptor sets data directly Instead of copying to a temporary payload before sending the update task to the worker thread, insert elements to the payload directly.	2020-06-18 17:47:19 -03:00
MerryMage	69f38355ed	vk_rasterizer: BindTransformFeedbackBuffersEXT accepts a size of type VkDeviceSize	2020-06-18 15:47:44 +01:00
MerryMage	b1eada6079	renderer_vulkan: Fix macOS GetBundleDirectory reference	2020-06-18 15:47:44 +01:00
Morph	2f420618ea	vk_sampler_cache: Emulate GL_LINEAR/NEAREST minification filters Emulate GL_LINEAR/NEAREST minification filters using minLod = 0 and maxLod = 0.25 during sampler creation	2020-06-18 04:56:31 -04:00
Morph	be660e7749	maxwell_to_vk: Reorder filter cases and correct mipmap_filter=None maxwell_to_vk: Reorder filtering modes to start with None, then Nearest, then Linear. maxwell_to_vk: Logs filter modes under UNREACHABLE_MSG instead of UNIMPLEMENTED_MSG, since any unknown filter modes are invalid and not unimplemented. maxwell_to_vk: Return VK_SAMPLER_MIPMAP_MODE_NEAREST instead of VK_SAMPLER_MIPMAP_MODE_LINEAR when mipmap_filter is None with the description from the VkSamplerCreateInfo(3) man page.	2020-06-18 04:56:31 -04:00
Rodrigo Locatti	0bd9bc7201	Merge pull request #4066 from ReinUsesLisp/shared-ptr-buf buffer_cache: Avoid passing references of shared pointers and misc style changes	2020-06-15 22:29:32 -03:00
bunnei	c2ea1e1bcb	Merge pull request #4049 from ReinUsesLisp/separate-samplers shader/texture: Join separate image and sampler pairs offline	2020-06-13 13:48:27 -04:00
bunnei	5633887569	Merge pull request #3986 from ReinUsesLisp/shader-cache shader_cache: Implement a generic runtime shader cache	2020-06-12 23:14:48 -04:00
ReinUsesLisp	6508cdd003	buffer_cache: Avoid passing references of shared pointers and misc style changes Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=).	2020-06-09 18:30:49 -03:00
ReinUsesLisp	c95c254f3e	texture_cache: Implement rendering to 3D textures This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly.	2020-06-08 05:01:00 -03:00
Rodrigo Locatti	2293e8a11a	Merge pull request #4034 from ReinUsesLisp/storage-texels vk_rasterizer: Implement storage texels and atomic image operations	2020-06-07 18:43:24 -03:00
ReinUsesLisp	abcea1bb18	rasterizer_cache: Remove files and includes The rasterizer cache is no longer used. Each cache has its own generic implementation optimized for the cached data.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	678f95e4f8	vk_pipeline_cache: Use generic shader cache Trivial port the generic shader cache to Vulkan.	2020-06-07 04:32:57 -03:00
bunnei	98671b4cfe	Merge pull request #4013 from ReinUsesLisp/skip-no-xfb vk_rasterizer: Skip transform feedbacks when extension is unavailable	2020-06-05 11:14:36 -04:00
ReinUsesLisp	5b2b6d594c	shader/texture: Join separate image and sampler pairs offline Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch	2020-06-05 00:24:51 -03:00
ReinUsesLisp	866c1165af	vk_shader_decompiler: Implement atomic image operations Implement atomic operations on images. On GLSL these are atomicImage* functions (e.g. atomicImageAdd).	2020-06-02 02:20:02 -03:00
ReinUsesLisp	4a6b9a1a71	vk_rasterizer: Implement storage texels This is the equivalent of an image buffer on OpenGL. - Used by Octopath Traveler	2020-06-02 02:16:33 -03:00
ReinUsesLisp	3a59e724c9	maxwell_to_vk: Add R16UI image format - Used by Octopath Traveler	2020-06-02 02:15:20 -03:00
bunnei	6c0b1a9ee2	Merge pull request #3996 from ReinUsesLisp/front-faces fixed_pipeline_state,gl_rasterizer: Swap negative viewport checks for front faces	2020-06-01 14:04:35 -04:00
bunnei	e68ee43a1a	Merge pull request #3930 from ReinUsesLisp/animal-borders vk_rasterizer: Implement constant attributes	2020-05-31 18:40:17 -04:00
bunnei	058ec22787	Merge pull request #3982 from ReinUsesLisp/membar-cts shader/other: Implement MEMBAR.CTS	2020-05-30 11:51:42 -04:00
ReinUsesLisp	5616be12be	vk_rasterizer: Skip transform feedbacks when extension is unavailable Avoids calling transform feedback procedures when VK_EXT_transform_feedback is not available.	2020-05-29 03:05:29 -03:00
bunnei	1bb3122c1f	Merge pull request #3991 from ReinUsesLisp/depth-sampling texture_cache: Implement depth stencil texture swizzles	2020-05-28 23:33:38 -04:00
bunnei	630fc12d4e	Merge pull request #3961 from Morph1984/bgra8_srgb maxwell_to_vk: Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM	2020-05-27 16:44:22 -04:00
ReinUsesLisp	32e6727dae	shader/other: Implement MEMBAR.CTS This silences an assertion we were hitting and uses workgroup memory barriers when the game requests it.	2020-05-27 00:19:45 -03:00
ReinUsesLisp	8bba84a401	texture_cache: Implement depth stencil texture swizzles Stop ignoring image swizzles on depth and stencil images. This doesn't fix a known issue on Xenoblade Chronicles 2 where an OpenGL texture changes swizzles twice before being used. A proper fix would be having a small texture view cache for this like we do on Vulkan.	2020-05-26 17:44:50 -03:00
ReinUsesLisp	efe7b7483b	fixed_pipeline_state: Remove unnecessary check for front faces flip The check to flip faces when viewports are negative were a left over from the old OpenGL code. This is not required on Vulkan where we have negative viewports.	2020-05-26 16:32:27 -03:00
bunnei	508242c267	Merge pull request #3981 from ReinUsesLisp/bar shader/other: Implement BAR.SYNC 0x0	2020-05-26 14:40:13 -04:00
bunnei	86345c126a	Merge pull request #3978 from ReinUsesLisp/write-rz shader_decompiler: Visit source nodes even when they assign to RZ	2020-05-25 21:31:33 -04:00
bunnei	1adabdac7f	Merge pull request #3905 from FernandoS27/vulkan-fix Correct a series of crashes and intructions on Async GPU and Vulkan Pipeline	2020-05-24 15:23:38 -04:00
bunnei	487dd05170	Merge pull request #3979 from ReinUsesLisp/thread-group shader/other: Implement thread comparisons (NV_shader_thread_group)	2020-05-24 00:33:06 -04:00
ReinUsesLisp	5d0986a53b	shader/other: Implement BAR.SYNC 0x0 Trivially implement this particular case of BAR. Unless games use OpenCL or CUDA barriers, we shouldn't hit any other case here.	2020-05-21 23:20:43 -03:00
ReinUsesLisp	e2b67a868b	shader/other: Implement thread comparisons (NV_shader_thread_group) Hardware S2R special registers match gl_Thread*MaskNV. We can trivially implement these using Nvidia's extension on OpenGL or naively stubbing them with the ARB instructions to match. This might cause issues if the host device warp size doesn't match Nvidia's. That said, this is unlikely on proper shaders. Refer to the attached url for more documentation about these flags. https://www.khronos.org/registry/OpenGL/extensions/NV/NV_shader_thread_group.txt	2020-05-21 23:18:37 -03:00
ReinUsesLisp	ed4e324991	shader_decompiler: Visit source nodes even when they assign to RZ Some operations like atomicMin were ignored because they returned were being stored to RZ. This operations have a side effect and it was being ignored.	2020-05-21 23:16:03 -03:00
ReinUsesLisp	434856c636	vk_shader_decompiler: Don't assert for void returns Atomic instructions can be used without returning anything and this is valid code. Remove the assert.	2020-05-21 23:16:03 -03:00
ReinUsesLisp	891236124c	buffer_cache: Use boost::intrusive::set for caching Instead of using boost::icl::interval_map for caching, use boost::intrusive::set. interval_map is intended as a container where the keys can overlap with one another; we don't need this for caching buffers and a std::set-like data structure that allows us to search with lower_bound is enough.	2020-05-21 16:44:00 -03:00
Morph	d0fc12684a	maxwell_to_vk: Add format B8G8R8A8_SRGB Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM Used by Bravely Default II	2020-05-18 13:02:09 -04:00
ReinUsesLisp	7a27b7f3a3	vk_rasterizer: Match OpenGL's FlushAndInvalidate behavior Match OpenGL's behavior. This can fix or simplify bisecting issues on Vulkan.	2020-05-15 20:40:08 -03:00
bunnei	b1a1bd12ca	Merge pull request #3899 from ReinUsesLisp/float-comparisons shader_ir: Add separate instructions for ordered and unordered comparisons and fix NE on GLSL	2020-05-13 09:51:14 -04:00
ReinUsesLisp	91dddca26e	vk_rasterizer: Implement constant attributes Constant attributes (in OpenGL known disabled attributes) are not supported on Vulkan, even with extensions. To emulate this behavior we return zero on reads from disabled vertex attributes in shader code. This has no caching cost because attribute formats are not dynamic state on Vulkan and we have to store it in the pipeline cache anyway. - Fixes Animal Crossing: New Horizons terrain borders	2020-05-13 04:36:47 -03:00
ReinUsesLisp	cf6a40fc12	vk_rasterizer: Remove buffer check in attribute selection This was a left over from OpenGL when disabled buffers where not properly emulated. We no longer have to assert this as it is checked in vertex buffer initialization.	2020-05-13 04:36:47 -03:00
bunnei	1beaebe666	Merge pull request #3816 from ReinUsesLisp/vk-rasterizer-enable vk_graphics_pipeline: Implement rasterizer_enable on Vulkan	2020-05-11 18:22:51 -04:00
Fernando Sahmkow	8d15f8b28e	VkPipelineCache: Use a null shader on invalid address.	2020-05-09 20:51:34 -04:00
Fernando Sahmkow	0a4be73b9b	VideoCore: Use SyncGuestMemory mechanism for Shader/Pipeline Cache invalidation.	2020-05-09 19:25:29 -04:00
Rodrigo Locatti	7e376af8fc	Merge pull request #3839 from Morph1984/r8g8ui texture: Implement R8G8UI	2020-05-09 05:28:55 -03:00
ReinUsesLisp	4e57f9d5cf	shader_ir: Separate float-point comparisons in ordered and unordered This allows us to use native SPIR-V instructions without having to manually check for NAN.	2020-05-09 04:55:15 -03:00
bunnei	a9ee6e346b	Merge pull request #3842 from makigumo/maxwell_to_vk_vertexattribute_signed_int maxwell_to_vk: implement missing signed int formats	2020-05-09 00:36:09 -04:00
bunnei	50c27d5ae1	Merge pull request #3885 from ReinUsesLisp/viewport-swizzles video_core: Implement viewport swizzles with NV_viewport_swizzle	2020-05-08 15:16:53 -04:00
ReinUsesLisp	227278098a	vk_sampler_cache: Use VK_EXT_custom_border_color when available This should fix grass interactions on Breath of the Wild on Vulkan. It is currently untested against validation layers. Nvidia's Windows 443.09 beta driver or Linux 440.66.12 is required for now.	2020-05-04 20:49:23 -03:00
ReinUsesLisp	2dbf5290f2	vk_graphics_pipeline: Implement viewport swizzles with NV_viewport_swizzle	2020-05-04 18:31:17 -03:00
bunnei	2aff0b4733	Merge pull request #3808 from ReinUsesLisp/wait-for-idle {maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers	2020-05-03 02:43:18 -04:00
bunnei	f4ca8e0d3e	Merge pull request #3732 from lioncash/header vulkan: Remove unnecessary includes	2020-05-02 01:36:57 -04:00
bunnei	0128901102	Merge pull request #3809 from ReinUsesLisp/empty-index vk_rasterizer: Skip index buffer setup when vertices are zero	2020-05-02 01:21:57 -04:00
ReinUsesLisp	3b668e1210	vk_graphics_pipeline: Implement rasterizer_enable on Vulkan We can simply enable rasterizer discard matching the current pipeline key.	2020-05-02 01:47:25 -03:00
bunnei	e6b4311178	Merge pull request #3693 from ReinUsesLisp/clean-samplers shader/texture: Support multiple unknown sampler properties	2020-05-02 00:45:41 -04:00
Dan	96ee1b42bc	maxwell_to_vk: implement missing signed int formats	2020-04-30 23:39:16 +02:00
Morph	7909860d16	texture: Implement R8G8UI - Used by The Walking Dead: The Final Season	2020-04-30 13:19:36 -04:00
bunnei	bf3f030a0d	Merge pull request #3807 from ReinUsesLisp/fix-depth-clamp maxwell_3d: Fix depth clamping register	2020-04-30 13:07:31 -04:00
bunnei	c7b5a87c90	Merge pull request #3799 from ReinUsesLisp/iadd-cc shader: Implement P2R CC, IADD Rd.CC and IADD.X	2020-04-30 12:56:36 -04:00
bunnei	da2b8295e1	Merge pull request #3805 from ReinUsesLisp/preserve-contents texture_cache: Reintroduce preserve_contents accurately	2020-04-30 12:56:19 -04:00
Lioncash	6c53edd4d3	vulkan: Remove unnecessary includes Reduces some header churn and reduces rebuilds when some header internals change. While we're at it we can also resolve a missing include in buffer_cache.	2020-04-28 21:54:46 -04:00
bunnei	72b73d22ab	Merge pull request #3784 from ReinUsesLisp/shader-memory-util shader/memory_util: Deduplicate code	2020-04-28 12:05:50 -04:00
ReinUsesLisp	d6a24b4a5b	vk_rasterizer: Skip index buffer setup when vertices are zero Xenoblade 2 invokes a draw call with zero vertices. This is likely due to indirect drawing (glDrawArraysIndirect). This causes a crash in the staging buffer pool when trying to create a buffer with a size of zero. To workaround this, skip index buffer setup entirely when the number of indices is zero.	2020-04-28 02:24:33 -03:00
ReinUsesLisp	fe931ac976	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU).	2020-04-28 02:18:12 -03:00
ReinUsesLisp	bb1ed66d99	maxwell_3d: Fix depth clamping register Using deko3d as reference: `4e47ba0013/source/maxwell/gpu_3d_state.cpp (L42)` We were using bits 3 and 4 to determine depth clamping, but these are the same both enabled and disabled: state->depthClampEnable ? 0x101A : 0x181D The same happens on Nvidia's OpenGL driver, where they do something like this (default capabilities, GL 4.5 compatibility): (state & DEPTH_CLAMP) != 0 ? 0x201a : 0x281c There's always a difference between the first bits in this register, but bit 11 is consistently disabled on both deko3d/NVN and OpenGL. This commit changes yuzu's behaviour to use bit 11 to determine depth clamping. - Fixes depth issues on Super Mario Odyssey's intro.	2020-04-27 20:50:14 -03:00
Fernando Sahmkow	1517cba8ca	Merge pull request #3766 from ReinUsesLisp/renderpass-cache-key vk_renderpass_cache: Pack renderpass cache key and unify keys	2020-04-27 16:05:14 -04:00
Fernando Sahmkow	a65e9ad552	Merge pull request #3756 from ReinUsesLisp/integrated-devices vk_memory_manager: Remove unified memory model flag	2020-04-27 16:04:22 -04:00
ReinUsesLisp	8da16cf9fb	texture_cache: Reintroduce preserve_contents accurately This reverts commit `94b0e2e5da`. preserve_contents proved to be a meaningful optimization. This commit reintroduces it but properly implemented on OpenGL. We have to make sure the clear removes all the previous contents of the image. It's not currently implemented on Vulkan because we can do smart things there that's preferred to be introduced in a separate commit.	2020-04-26 19:53:02 -03:00
Rodrigo Locatti	7e38dd580f	Merge pull request #3753 from ReinUsesLisp/ac-vulkan {gl,vk}_rasterizer: Add lazy default buffer maker and use it for empty buffers	2020-04-26 01:55:43 -03:00
ReinUsesLisp	ddd82ef42b	shader/memory_util: Deduplicate code Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader.	2020-04-26 01:38:51 -03:00
ReinUsesLisp	255197e643	shader/arithmetic_integer: Implement CC for IADD	2020-04-25 22:55:26 -03:00
bunnei	c5bf693882	Merge pull request #3721 from ReinUsesLisp/sort-devices vulkan/wrapper: Sort physical devices	2020-04-25 03:27:40 -04:00
ReinUsesLisp	527a1574c3	vk_rasterizer: Pack texceptions and color formats on invalid formats Sometimes for unknown reasons NVN games can bind a render target format of 0. This may be a yuzu bug. With the commits before this the formats were specified without being "packed", assuming all formats and texceptions will be written like in the color_attachments vector. To address this issue, iterate all render targets and pack them as they are valid. This way they will match color_attachments. - Fixes validation errors and graphical issues on Breath of the Wild.	2020-04-24 22:21:29 -03:00
Markus Wick	c499c22cf7	Fix -Werror=conversion error.	2020-04-24 09:33:04 +02:00
ReinUsesLisp	72deb773fd	shader_ir: Turn classes into data structures	2020-04-23 18:00:06 -03:00
ReinUsesLisp	3e35101895	vk_rasterizer: Fix framebuffer creation validation errors Framebuffer creation was ignoring the number of color attachments.	2020-04-23 17:34:16 -03:00
ReinUsesLisp	8c37cd1af6	vk_pipeline_cache: Unify pipeline cache keys into a single operation This allows us to call Common::CityHash and std::memcmp only once for GraphicsPipelineCacheKey. While we are at it, do the same for compute.	2020-04-23 17:34:16 -03:00
ReinUsesLisp	f665c92114	vk_renderpass_cache: Pack renderpass cache key to 12 bytes	2020-04-23 17:34:16 -03:00
bunnei	bf2ddb8fd5	Merge pull request #3677 from FernandoS27/better-sync Introduce Predictive Flushing and Improve ASYNC GPU	2020-04-22 22:09:38 -04:00
ReinUsesLisp	d9463f4562	vk_pipeline_cache: Fix unintentional memcpy into optional The intention behind this was to assign a float to from an uint32_t, but it was unintentionally being copied directly into the std::optional. Copy to a temporary and assign that temporary to std::optional. This can be replaced with std::bit_cast<float> once we are in C++20.	2020-04-22 21:36:05 -03:00
Fernando Sahmkow	afae40a99e	Merge pull request #3653 from ReinUsesLisp/nsight-aftermath renderer_vulkan: Integrate Nvidia Nsight Aftermath on Windows	2020-04-22 11:39:01 -04:00
Fernando Sahmkow	39e5b72948	Async GPU: Correct flushing behavior to be similar to old async GPU behavior.	2020-04-22 11:36:26 -04:00
Fernando Sahmkow	644588fd88	ShaderCache/PipelineCache: Cache null shaders.	2020-04-22 11:36:25 -04:00
Fernando Sahmkow	f616dc0b59	Address Feedback.	2020-04-22 11:36:24 -04:00
ReinUsesLisp	b752faf2d3	vk_fence_manager: Initial implementation	2020-04-22 11:36:19 -04:00
Fernando Sahmkow	131b342130	OpenGL: Guarantee writes to Buffers.	2020-04-22 11:36:18 -04:00
Fernando Sahmkow	1fb516cd97	GPU: Implement Flush Requests for Async mode.	2020-04-22 11:36:17 -04:00
Fernando Sahmkow	b7bc3c2549	FenceManager: Manage syncpoints and rename fences to semaphores.	2020-04-22 11:36:16 -04:00
Fernando Sahmkow	4adfc9bb08	Rasterizer: Document SignalFence & ReleaseFences and setup skeletons on Vulkan.	2020-04-22 11:36:14 -04:00
Fernando Sahmkow	165ae823f5	ThreadManager: Sync async reads on accurate gpu.	2020-04-22 11:36:12 -04:00
Fernando Sahmkow	8b1eb44b3e	BufferCache: Implement OnCPUWrite and SyncGuestHost	2020-04-22 11:36:07 -04:00
Fernando Sahmkow	da8f17715d	GPU: Refactor synchronization on Async GPU	2020-04-22 11:36:06 -04:00
ReinUsesLisp	6f47bd9641	vk_memory_manager: Remove unified memory model flag All drivers (even Intel) seem to have a device local memory type that is not host visible. Remove this flag so all devices follow the same path. This fixes a crash when trying to map to host device local memory on integrated devices.	2020-04-21 22:06:38 -03:00
ReinUsesLisp	488ed8bd02	vk_rasterizer: Add lazy default buffer maker and use it for empty buffers Introduce a default buffer getter that lazily constructs an empty buffer. This is intended to match OpenGL's buffer 0. Use this for disabled vertex and uniform buffers. While we are at it, include vertex buffer usages for staging buffers to silence validation errors.	2020-04-21 19:55:52 -03:00
ReinUsesLisp	0bbae63300	gl_rasterizer: Fix buffers without size On NVN buffers can be enabled but have no size. According to deko3d and the behavior we see in Animal Crossing: New Horizons these buffers get the special address of 0x1000 and limit themselves to 0xfff. Implement buffers without a size by binding a null buffer to OpenGL without a side. `1d1930beea/source/maxwell/gpu_3d_vbo.cpp (L62-L63)`	2020-04-21 19:55:44 -03:00
Rodrigo Locatti	f293b15611	Merge pull request #3718 from ReinUsesLisp/better-pipeline-state fixed_pipeline_state: Pack structure, use memcmp and CityHash on it	2020-04-21 18:17:58 -03:00
Mat M	cb5b8ca886	Merge pull request #3733 from ambasta/patch-2 Initialize quad_indexed_pass before uint8_pass	2020-04-20 20:36:46 -04:00
Fernando Sahmkow	ec2f8f4272	Merge pull request #3700 from ReinUsesLisp/stream-buffer-sizes vk_stream_buffer: Fix out of memory on boot on recent Nvidia drivers	2020-04-20 09:37:42 -04:00
Amit Prakash Ambasta	5324b1d01e	Initialize quad_indexed_pass before uint8_pass Fixes Werror=reorder in gcc	2020-04-20 04:53:52 +05:30
bunnei	85c17a2c35	Merge pull request #3694 from ReinUsesLisp/indexed-quads vk_compute_pass: Implement indexed quads	2020-04-19 16:52:40 -04:00
Jan Beich	afcc84a172	renderer_vulkan: assume X11 if not Windows/macOS after `bf1d66b7c0` Render.Vulkan <Error> video_core/renderer_vulkan/renderer_vulkan.cpp:CreateInstance:131: Presentation not supported on this platform Render.Vulkan <Error> video_core/renderer_vulkan/renderer_vulkan.cpp:CreateSurface:378: Presentation not supported on this platform Core <Critical> core/core.cpp:Load:199: Failed to initialize system (Error 5)!	2020-04-19 00:32:23 +00:00
ReinUsesLisp	c81bf06d03	vulkan/wrapper: Sort physical devices Sort discrete GPUs over the rest, Nvidia over AMD, AMD over Intel, Intel over the rest. This gives us a somewhat consistent order when Optimus is removed (renderdoc does this when it's attached). This can break the configuration of users with an Intel GPU that manually remove Optimus on yuzu. That said, it's a very unlikely to happen.	2020-04-18 21:31:15 -03:00
ReinUsesLisp	d62f57cf5a	fixed_pipeline_state: Hash and compare the whole structure Pad FixedPipelineState's size to 384 bytes to be a multiple of 16. Compare the whole struct with std::memcmp and hash with CityHash. Using CityHash instead of a naive hash should reduce the number of collisions. Improve used type traits to ensure this operation is safe. With these changes the improvements to the hashable pipeline state are: Optimized structure Hash: 89 ns Comparison: 103 ns Construction: 164 ns Struct size: 384 bytes Original structure Hash: 148 ns Equal: 174 ns Construction: 281 ns Size: 1384 bytes * Attribute state initialization is not measured These measures are averages taken with std::chrono::high_accuracy_clock on MSVC shipped on Visual Studio 16.6.0 Preview 2.1.	2020-04-18 19:57:26 -03:00
ReinUsesLisp	b571c92dfd	fixed_pipeline_state: Pack blending state Reduce FixedPipelineState's size to 364 bytes.	2020-04-18 19:23:35 -03:00
ReinUsesLisp	548dd27f45	fixed_pipeline_state: Pack rasterizer state Reduce FixedPipelineState's size to 600 bytes.	2020-04-18 19:22:57 -03:00
ReinUsesLisp	7790144a55	fixed_pipeline_state: Pack depth stencil state Reduce FixedPipelineState's size to 632 bytes.	2020-04-18 19:22:11 -03:00
ReinUsesLisp	ab6704f20c	fixed_pipeline_state: Pack attribute state Reduce FixedPipelineState's size from 1384 to 664 bytes	2020-04-18 19:21:19 -03:00
ReinUsesLisp	a7b6bd56d7	vk_stream_buffer: Fix out of memory on boot on recent Nvidia drivers Nvidia recently introduced a new memory type for data streaming (awesome!), but yuzu was assuming that all heaps had enough memory for the assumed stream buffer size (256 MiB). This worked fine on AMD but Nvidia's new memory heap was smaller than 256 MiB. This commit changes this assumption and allocates a bit less than the size of the preferred heap, with a maximum of 256 MiB (to avoid allocating all system memory on integrated devices). - Fixes a crash on NVIDIA 450.82.0.0	2020-04-17 18:12:48 -03:00
ReinUsesLisp	c961770900	vk_compute_pass: Implement indexed quads Implement indexed quads (GL_QUADS used with glDrawElements*) with a compute pass conversion. The compute shader converts from uint8/uint16/uint32 indices to uint32. The format is passed through push constants to avoid having different variants of the same shader. - Used by Fast RMX - Used by Xenoblade Chronicles 2 (it still has graphical due to synchronization issues on Vulkan)	2020-04-16 21:12:32 -03:00
Fernando Sahmkow	c81f256111	Merge pull request #3600 from ReinUsesLisp/no-pointer-buf-cache buffer_cache: Return handles instead of pointer to handles	2020-04-16 19:58:13 -04:00
ReinUsesLisp	090fd3fefa	buffer_cache: Return handles instead of pointer to handles The original idea of returning pointers is that handles can be moved. The problem is that the implementation didn't take that in mind and made everything harder to work with. This commit drops pointer to handles and returns the handles themselves. While it is still true that handles can be invalidated, this way we get an old handle instead of a dangling pointer. This problem can be solved in the future with sparse buffers.	2020-04-16 02:33:34 -03:00
Lioncash	11837e8f13	video_core: Amend doxygen comment references Fixes broken documentation references.	2020-04-15 22:33:29 -04:00
Fernando Sahmkow	e33196d4e7	Merge pull request #3612 from ReinUsesLisp/red shader/memory: Implement RED.E.ADD and minor changes to ATOM	2020-04-15 15:03:49 -04:00
Mat M	9208d555b7	Merge pull request #3668 from ReinUsesLisp/vtx-format-16ui maxwell_to_vk: Add uint16 vertex formats	2020-04-15 11:43:52 -04:00
ReinUsesLisp	3036067047	maxwell_to_vk: Add uint16 vertex formats	2020-04-15 04:06:30 -03:00
ReinUsesLisp	b4e43c64c8	maxwell_to_vk: Add missing breaks Avoid invalid fallbacks.	2020-04-15 04:05:33 -03:00
ReinUsesLisp	0ca456830f	vk_blit_screen: Initialize all members in VkPipelineViewportStateCreateInfo When the dynamic state is specified, pViewports and pScissors are ignored, quoting the specification: pViewports is a pointer to an array of VkViewport structures, defining the viewport transforms. If the viewport state is dynamic, this member is ignored. That said, AMD's proprietary driver itself seem to read it regardless of what the specification says.	2020-04-15 03:30:08 -03:00
ReinUsesLisp	37e5c4fa7c	vk_rasterizer: Default to 1 viewports with a size of 0 Silence validation layer errors.	2020-04-14 04:44:34 -03:00
ReinUsesLisp	0e232cfdc1	renderer_vulkan: Integrate Nvidia Nsight Aftermath on Windows Adds optional support for Nsight Aftermath. It is enabled through ENABLE_NSIGHT_AFTERMATH in cmake. A path to the SDK has to be provided by the environment variable NSIGHT_AFTERMATH_SDK. Nsight Aftermath allows an application to generate "minidumps" of the GPU state when a device loss happens. By analysing these on Nsight we can know what a game was doing and why it triggered a device loss. The dump is generated inside %APPDATA%\yuzu\log\gpucrash and this directory is deleted every time a new instance is initialized with Nsight enabled. To enable it on yuzu there has a to be a driver and device capable of running Nsight Aftermath on Vulkan. That means only Turing based GPUs on the latest stable driver, beta drivers won't work for now. It is manually enabled in Configuration>Debug>Enable Graphics Debugging because when using all debugging capabilities there is a runtime cost.	2020-04-14 00:39:21 -03:00
ReinUsesLisp	6cfe2a7246	renderer_vulkan: Remove Nvidia checkpoints	2020-04-13 17:33:59 -03:00
ReinUsesLisp	16105c6a66	renderer_vulkan: Catch device losses in more places	2020-04-13 17:33:59 -03:00
Rodrigo Locatti	7e4a132a77	Merge pull request #3636 from ReinUsesLisp/drop-vk-hpp renderer_vulkan: Drop Vulkan-Hpp	2020-04-13 17:08:04 -03:00
ReinUsesLisp	94b0e2e5da	texture_cache: Remove preserve_contents preserve_contents was always true. We can't assume we don't have to preserve clears because scissored and color masked clears exist. This removes preserve_contents and assumes it as true at all times.	2020-04-11 01:51:02 -03:00
ReinUsesLisp	2905142f47	renderer_vulkan: Drop Vulkan-Hpp	2020-04-10 22:49:02 -03:00
bunnei	51c6688e21	Merge pull request #3594 from ReinUsesLisp/vk-instance yuzu: Drop SDL2 and Qt frontend Vulkan requirements	2020-04-10 20:06:55 -04:00
Fernando Sahmkow	7cd6daf115	VkRasterizer: Eliminate Legacy code.	2020-04-08 18:59:09 -04:00
Fernando Sahmkow	913f42a3a7	Memory: Address Feedback.	2020-04-08 13:40:46 -04:00
ReinUsesLisp	bf1d66b7c0	yuzu: Drop SDL2 and Qt frontend Vulkan requirements Create Vulkan instances and surfaces from the Vulkan backend.	2020-04-07 16:32:19 -03:00
ReinUsesLisp	bc1b4b85b0	renderer_vulkan: Query device names from the backend	2020-04-07 02:23:23 -03:00
Fernando Sahmkow	ea535d9470	Shader/Pipeline Cache: Use VAddr instead of physical memory for addressing.	2020-04-06 09:23:07 -04:00
Fernando Sahmkow	3dd5c07454	Query Cache: Use VAddr instead of physical memory for adressing.	2020-04-06 09:23:07 -04:00
Fernando Sahmkow	7fcd0fee6d	Buffer Cache: Use vAddr instead of physical memory.	2020-04-06 09:23:06 -04:00
Fernando Sahmkow	6ee316cb8f	Texture Cache: Use vAddr instead of physical memory for caching.	2020-04-06 09:23:05 -04:00
Fernando Sahmkow	9c0f40a1f5	GPU: Setup Flush/Invalidate to use VAddr instead of CacheAddr	2020-04-06 09:21:46 -04:00
Fernando Sahmkow	588a20be3f	Merge pull request #3513 from ReinUsesLisp/native-astc video_core: Use native ASTC when available	2020-04-06 09:21:11 -04:00
ReinUsesLisp	3185245845	shader/memory: Implement RED.E.ADD Implements a reduction operation. It's an atomic operation that doesn't return a value. This commit introduces another primitive because some shading languages might have a primitive for reduction operations.	2020-04-06 02:24:47 -03:00
Fernando Sahmkow	69277de29d	Merge pull request #3592 from ReinUsesLisp/ipa shader_decompiler: Remove FragCoord.w hack and change IPA implementation	2020-04-05 19:29:40 -04:00
Rodrigo Locatti	825a6e2615	Merge pull request #3552 from jroweboy/single-context Refactor Context management (Fixes renderdoc on opengl issues)	2020-04-02 01:38:25 -03:00
ReinUsesLisp	2339fe199f	shader_decompiler: Remove FragCoord.w hack and change IPA implementation Credits go to gdkchan and Ryujinx. The pull request used for this can be found here: https://github.com/Ryujinx/Ryujinx/pull/1082 yuzu was already using the header for interpolation, but it was missing the FragCoord.w multiplication described in the linked pull request. This commit finally removes the FragCoord.w == 1.0f hack from the shader decompiler. While we are at it, this commit renames some enumerations to match Nvidia's documentation (linked below) and fixes component declaration order in the shader program header (z and w were swapped). https://github.com/NVIDIA/open-gpu-doc/blob/master/Shader-Program-Header/Shader-Program-Header.html	2020-04-01 21:48:55 -03:00
ReinUsesLisp	2f0da10dc3	vk_device: Add missing ASTC queries	2020-04-01 01:14:04 -03:00
ReinUsesLisp	b6571ca9f0	video_core: Use native ASTC when available	2020-04-01 01:14:04 -03:00
Rodrigo Locatti	baf91c920c	Merge pull request #3591 from ReinUsesLisp/vk-wrapper-part2 renderer_vulkan/wrapper: Add a Vulkan wrapper (part 2 of 2)	2020-03-31 22:14:26 -03:00
ReinUsesLisp	f22f6b72c3	renderer_vulkan/wrapper: Add vkEnumerateInstanceExtensionProperties wrapper	2020-03-31 21:32:08 -03:00
ReinUsesLisp	27dd542c60	renderer_vulkan/wrapper: Add command buffer handle	2020-03-31 21:32:08 -03:00
ReinUsesLisp	5c90d060d8	renderer_vulkan/wrapper: Add physical device handle	2020-03-31 21:32:08 -03:00
ReinUsesLisp	0eb37de98f	renderer_vulkan/wrapper: Add device handle	2020-03-31 21:32:08 -03:00
ReinUsesLisp	11774308d3	renderer_vulkan/wrapper: Add swapchain handle	2020-03-31 21:32:07 -03:00
ReinUsesLisp	7fe52ef77f	renderer_vulkan/wrapper: Add fence handle	2020-03-31 21:32:07 -03:00
ReinUsesLisp	3a63ae0658	renderer_vulkan/wrapper: Add device memory handle	2020-03-31 21:32:07 -03:00
ReinUsesLisp	397f53dea1	renderer_vulkan/wrapper: Add pool handles	2020-03-31 21:32:07 -03:00
ReinUsesLisp	affee77b70	renderer_vulkan/wrapper: Add buffer and image handles	2020-03-31 21:32:07 -03:00
ReinUsesLisp	d85ca0ab33	renderer_vulkan/wrapper: Add queue handle	2020-03-31 21:32:07 -03:00
ReinUsesLisp	151ddcf419	renderer_vulkan/wrapper: Add instance handle	2020-03-31 21:32:07 -03:00
Rodrigo Locatti	c19425ed69	Merge pull request #3506 from namkazt/patch-9 shader_decode: Implement partial ATOM/ATOMS instr	2020-03-31 00:56:28 -03:00
Rodrigo Locatti	69728e8ad5	Merge pull request #3566 from ReinUsesLisp/vk-wrapper-part1 renderer_vulkan/wrapper: Add a Vulkan wrapper (part 1 of 2)	2020-03-30 21:57:36 -03:00
Nguyen Dac Nam	a2cc80b605	vk_decompiler: add atomic op and handler function.	2020-03-30 17:44:45 +07:00
ReinUsesLisp	b6c9fba81c	renderer_vulkan/wrapper: Address feedback	2020-03-28 04:09:02 -03:00
ReinUsesLisp	2694552b7f	renderer_vulkan/wrapper: Add owning handles	2020-03-27 03:21:04 -03:00
ReinUsesLisp	7413b30923	renderer_vulkan/wrapper: Add pool allocations owning templated class	2020-03-27 03:21:04 -03:00
ReinUsesLisp	d8d392b39a	renderer_vulkan/wrapper: Add owning handle templated class	2020-03-27 03:21:04 -03:00
ReinUsesLisp	60f351084a	renderer_vulkan/wrapper: Add destroy and free overload set	2020-03-27 03:21:04 -03:00
ReinUsesLisp	a9e4528d10	renderer_vulkan/wrapper: Add dispatch table and loaders	2020-03-27 03:21:04 -03:00
ReinUsesLisp	3f0b7673f0	renderer_vulkan/wrapper: Add exception class	2020-03-27 03:21:04 -03:00
ReinUsesLisp	f5cee0e885	renderer_vulkan/wrapper: Add ToString function for VkResult	2020-03-27 03:21:03 -03:00
ReinUsesLisp	92c8d783b3	renderer_vulkan/wrapper: Add Vulakn wrapper and a span helper The intention behind a Vulkan wrapper is to drop Vulkan-Hpp. The issues with Vulkan-Hpp are: - Regular breaks of the API. - Copy constructors that do the same as the aggregates (fixed recently) - External dynamic dispatch that is hard to remove - Alias KHR handles with non-KHR handles making it impossible to use smart handles on Vulkan 1.0 instances with extensions that were included on Vulkan 1.1. - Dynamic dispatchers silently change size depending on preprocessor definitions. Different files will have different dispatch definitions, generating all kinds of hard to debug memory issues. In other words, Vulkan-Hpp is not "production ready" for our needs and this wrapper aims to replace it without losing RAII and exception safety.	2020-03-27 03:13:18 -03:00
Dan	744b207d92	maxwell_to_vk: implement signedscaled vertex formats	2020-03-27 00:14:19 +01:00
James Rowe	282adfc70b	Frontend/GPU: Refactor context management Changes the GraphicsContext to be managed by the GPU core. This eliminates the need for the frontends to fool around with tricky MakeCurrent/DoneCurrent calls that are dependent on the settings (such as async gpu option). This also refactors out the need to use QWidget::fromWindowContainer as that caused issues with focus and input handling. Now we use a regular QWidget and just access the native windowHandle() directly. Another change is removing the debug tool setting in FrameMailbox. Instead of trying to block the frontend until a new frame is ready, the core will now take over presentation and draw directly to the window if the renderer detects that its hooked by NSight or RenderDoc Lastly, since it was in the way, I removed ScopeAcquireWindowContext and replaced it with a simple subclass in GraphicsContext that achieves the same result	2020-03-24 21:03:42 -06:00
ReinUsesLisp	38c1e77f01	vk_texture_cache: Silence misc warnings	2020-03-18 20:03:19 -03:00
ReinUsesLisp	b6b2e31e5e	vk_staging_buffer_pool: Silence unused constant warning	2020-03-18 20:03:19 -03:00
ReinUsesLisp	fc51ece7bf	vk_rasterizer: Remove unused variable	2020-03-18 20:03:19 -03:00
ReinUsesLisp	98d85cdc20	vk_pipeline_cache: Remove unused variable	2020-03-18 20:03:19 -03:00
ReinUsesLisp	dab450ec46	maxwell_to_vk: Sielence -Wswitch warning	2020-03-18 20:03:19 -03:00
Mat M	edb9cccb36	Merge pull request #3510 from FernandoS27/dirty-write DirtyFlags: relax need to set render_targets as dirty	2020-03-17 17:29:22 -04:00
Mat M	d787856621	Merge pull request #3518 from ReinUsesLisp/scissor-clears vk_rasterizer: Implement scissor clears and layered clears	2020-03-17 17:27:15 -04:00
Mat M	9fdfd58f9f	Merge pull request #3519 from ReinUsesLisp/int-formats maxwell_to_vk: Implement RG32 and RGB32 integer vertex formats	2020-03-17 17:26:16 -04:00
Rodrigo Locatti	b16c8e0e8d	Merge pull request #3515 from ReinUsesLisp/vertex-vk-assert vk_rasterizer: Fix vertex range assert	2020-03-15 21:26:54 -03:00
Rodrigo Locatti	7cc46a6faa	Merge pull request #3501 from ReinUsesLisp/rgba16-snorm video_core: Implement RGBA16_SNORM	2020-03-15 21:24:53 -03:00
Rodrigo Locatti	d64edf21bb	Merge pull request #3503 from makigumo/patch-2 maxwell_to_vk: add vertex format eA2B10G10R10UnormPack32	2020-03-15 21:21:38 -03:00
ReinUsesLisp	52acb7f9a0	maxwell_to_vk: Implement RG32 and RGB32 integer vertex formats	2020-03-15 18:51:49 -03:00
ReinUsesLisp	71cc772988	vk_rasterizer: Implement layered clears	2020-03-15 18:37:19 -03:00
makigumo	f91046bf8d	vk_shader_decompiler: fix linux build	2020-03-15 18:00:14 +01:00
ReinUsesLisp	a7131af7d6	vk_rasterizer: Fix vertex range assert End can be equal to start in CalculateVertexArraysSize. This is quite common when the vertex size is zero.	2020-03-15 04:04:17 -03:00
ReinUsesLisp	8baf98e439	vk_rasterizer: Reimplement clears with vkCmdClearAttachments	2020-03-15 03:40:41 -03:00
Fernando Sahmkow	380fc8d2e1	DirtyFlags: relax need to set render_targets as dirty The texture cache already takes care of setting a render target to dirty when invalidated.	2020-03-14 11:47:33 -04:00
ReinUsesLisp	69c7a01f88	vk/gl_shader_decompiler: Silence assertion on compute	2020-03-13 18:33:05 -03:00
ReinUsesLisp	62560f1e63	vk_shader_decompiler: Fix default varying regression	2020-03-13 18:33:05 -03:00
Rodrigo Locatti	47459f6a36	vk_shader_decompiler: Fix implicit type conversion Co-Authored-By: Mat M. <mathew1800@gmail.com>	2020-03-13 18:33:05 -03:00
ReinUsesLisp	2fae1e6205	vk_rasterizer: Implement transform feedback binding zero	2020-03-13 18:33:05 -03:00
ReinUsesLisp	b67360c0f8	vk_shader_decompiler: Add XFB decorations to generic varyings	2020-03-13 18:33:05 -03:00
ReinUsesLisp	8d5bdcb17b	vk_device: Enable VK_EXT_transform_feedback when available	2020-03-13 18:33:05 -03:00
ReinUsesLisp	c320702092	vk_device: Shrink formatless capability name size	2020-03-13 18:33:05 -03:00
ReinUsesLisp	7acebd7eb6	vk_shader_decompiler: Use registry for specialization	2020-03-13 18:33:05 -03:00
Rodrigo Locatti	244fe13219	Merge branch 'master' into shader-purge	2020-03-13 16:44:06 -03:00
makigumo	753bc2026f	fix formatting	2020-03-13 11:37:24 +01:00
makigumo	54681909be	maxwell_to_vk: add vertex format eA2B10G10R10UnormPack32	2020-03-13 11:26:13 +01:00
Fernando Sahmkow	00e9ba0603	Merge pull request #3483 from namkazt/patch-1 vk_rasterizer: fix mistype on SetupGraphicsImages	2020-03-12 22:10:48 -04:00
Fernando Sahmkow	f159a12820	Merge pull request #3480 from ReinUsesLisp/vk-disabled-ubo vk_rasterizer: Support disabled uniform buffers	2020-03-12 22:09:49 -04:00
ReinUsesLisp	4dcca90ef4	video_core: Implement RGBA16_SNORM Implement RGBA16_SNORM with the current API. Nothing special here.	2020-03-12 21:42:33 -03:00
ReinUsesLisp	e8efd5a901	video_core: Rename "const buffer locker" to "registry"	2020-03-09 18:40:06 -03:00
Rodrigo Locatti	22e825a3bc	Merge pull request #3301 from ReinUsesLisp/state-tracker video_core: Remove gl_state and use a state tracker based on dirty flags	2020-03-09 18:34:37 -03:00
Nguyen Dac Nam	16cfbb068c	vk_reasterizer: fix mistype on SetupGraphicsImages This should use Maxwell3D engine. Fixed some GPU error on Kirby and maybe other games.	2020-03-08 10:06:59 +07:00
bunnei	662feb8c1c	Merge pull request #3481 from ReinUsesLisp/abgr5-storage maxwell_to_vk: Remove Storage capability for A1B5G5R5U	2020-03-07 19:51:33 -05:00
ReinUsesLisp	e4f9ce0379	vk_rasterizer: Support disabled uniform buffers	2020-03-06 18:47:51 -03:00
ReinUsesLisp	aa6fe3f1aa	maxwell_to_vk: Remove Storage capability for A1B5G5R5U	2020-03-06 18:47:27 -03:00
bunnei	49eff536d0	Merge pull request #3463 from ReinUsesLisp/vk-toctou vk_swapchain: Silence TOCTOU race condition	2020-03-05 19:38:42 -05:00
bunnei	0361aa1915	Merge pull request #3451 from ReinUsesLisp/indexed-textures vk_shader_decompiler: Implement indexed textures	2020-03-05 11:42:46 -05:00
bunnei	67e7186d79	Merge pull request #3455 from ReinUsesLisp/attr-scaled video_core: Implement more scaled attribute formats	2020-03-03 22:46:20 -05:00
ReinUsesLisp	ac204754d4	dirty_flags: Deduplicate code between OpenGL and Vulkan	2020-02-28 17:56:43 -03:00
ReinUsesLisp	6669b359a3	vk_rasterizer: Pass Maxwell registers to dynamic updates	2020-02-28 17:56:43 -03:00
ReinUsesLisp	042256c6bb	state_tracker: Remove type traits with named structures	2020-02-28 17:56:43 -03:00
ReinUsesLisp	6ac3eb4d87	vk_state_tracker: Implement dirty flags for stencil properties	2020-02-28 17:56:43 -03:00
ReinUsesLisp	f9df2c6bcd	vk_state_tracker: Implement dirty flags for depth bounds	2020-02-28 17:56:43 -03:00
ReinUsesLisp	cd0e28c9ec	vk_state_tracker: Implement dirty flags for blend constants	2020-02-28 17:56:43 -03:00
ReinUsesLisp	a33870996b	vk_state_tracker: Implement dirty flags for depth bias	2020-02-28 17:56:43 -03:00
ReinUsesLisp	42f1874965	vk_state_tracker: Implement dirty flags for scissors	2020-02-28 17:56:43 -03:00
ReinUsesLisp	1bd95a314f	vk_state_tracker: Initial implementation Add support for render targets and viewports.	2020-02-28 17:56:43 -03:00
ReinUsesLisp	9e74e6988b	maxwell_3d: Flatten cull and front face registers	2020-02-28 17:56:41 -03:00
ReinUsesLisp	96ac3d518a	gl_rasterizer: Remove dirty flags	2020-02-28 16:39:27 -03:00
ReinUsesLisp	0aaa69e4d7	vk_swapchain: Silence TOCTOU race condition It's possible that the window is resized from the moment we ask for its size to the moment a swapchain is created, causing validation issues. To workaround this Vulkan issue request the capabilities again just before creating the swapchain, making the race condition less likely.	2020-02-26 17:07:18 -03:00
bunnei	e25297536f	frontend: qt: bootmanager: Vulkan: Restore support for VK backend.	2020-02-25 21:23:01 -05:00
bunnei	78ab2e0474	Merge pull request #3417 from ReinUsesLisp/r32i texture: Implement R32I	2020-02-25 14:08:45 -05:00
bunnei	e22ad52cdb	Merge pull request #3425 from ReinUsesLisp/layered-framebuffer texture_cache: Implement layered framebuffer attachments	2020-02-24 10:14:50 -05:00
ReinUsesLisp	1e9213632a	vk_shader_decompiler: Implement indexed textures Implement accessing textures through an index. It uses the same interface as OpenGL, the main difference is that Vulkan bindings are forced to be arrayed (the binding index doesn't change for stacked textures in SPIR-V).	2020-02-24 01:26:07 -03:00
ReinUsesLisp	e2dd59e341	video_core: Implement more scaler attribute formats While changing this, fix assert in vk_shader_decompiler. We now know scaled formats are expected to be float in shaders attributes.	2020-02-24 00:27:37 -03:00
bunnei	2b4cdb73b6	Merge pull request #3424 from ReinUsesLisp/spirv-layer vk_shader_decompiler: Implement Layer output attribute	2020-02-22 23:45:16 -05:00
Rodrigo Locatti	4a6a1aeab4	Merge pull request #3433 from namkazt/patch-1 renderer_vulkan: Add the rest of case for TryConvertBorderColor	2020-02-21 15:56:09 -03:00
Rodrigo Locatti	ef27b4b7b5	Merge pull request #3434 from namkazt/patch-2 vk_shader: Implement ImageLoad	2020-02-21 15:55:05 -03:00
Rodrigo Locatti	6b2719c0bb	Merge pull request #3435 from namkazt/patch-3 vulkan: add DXT23_SRGB	2020-02-21 15:48:19 -03:00
Nguyen Dac Nam	c0c4da27d9	vk_device: remove left over from other branch	2020-02-21 08:56:18 +07:00
Nguyen Dac Nam	ecf275887b	clang-format	2020-02-20 09:39:30 +07:00
Nguyen Dac Nam	fbbad95845	shader_decompiler: only add StorageImageReadWithoutFormat when available	2020-02-20 09:28:13 +07:00
bunnei	b2bc7682b4	Merge pull request #3414 from ReinUsesLisp/maxwell-3d-draw maxwell_3d: Unify draw methods	2020-02-19 16:13:50 -05:00
Nguyen Dac Nam	88cb05e6e7	shader_decompiler: add check in case of device not support ShaderStorageImageReadWithoutFormat	2020-02-19 12:57:22 +07:00
Nguyen Dac Nam	e61c7e9310	vk_device: setup shaderStorageImageReadWithoutFormat	2020-02-19 12:56:36 +07:00
Nguyen Dac Nam	47106ab152	vk_device: add check for shaderStorageImageReadWithoutFormat	2020-02-19 12:55:56 +07:00
bunnei	e545c2322c	Merge pull request #3410 from ReinUsesLisp/vk-draw-index vk_shader_decompiler: Fix vertex id and instance id	2020-02-18 22:37:33 -05:00
Nguyen Dac Nam	2ef8af93aa	vk_shader: add Capability StorageImageReadWithoutFormat	2020-02-19 10:16:51 +07:00
Nguyen Dac Nam	f6f0762e81	vk_shader: Implement function ImageLoad (Used by Kirby Start Allies) Please enter the commit message for your changes. Lines starting	2020-02-19 08:39:01 +07:00
Nguyen Dac Nam	ec206f7f95	fixups mistake auto commit.	2020-02-19 01:24:32 +07:00
Nguyen Dac Nam	eaf60ca5d8	Update code structure Co-Authored-By: Mat M. <mathew1800@gmail.com>	2020-02-19 01:23:08 +07:00
Nguyen Dac Nam	9295966d26	add vertex UnsignedInt size RGBA	2020-02-18 21:52:51 +07:00
Nguyen Dac Nam	9fc42fffd9	add eBc2SrgbBlock to formats	2020-02-18 21:44:09 +07:00
Nguyen Dac Nam	493f0ad904	vulkan: add DXT23_SRGB	2020-02-18 21:39:50 +07:00
Nguyen Dac Nam	ba84f0988f	renderer_vulkan: Add the rest of case for TryConvertBorderColor	2020-02-18 16:52:54 +07:00
ReinUsesLisp	6a0220b2e1	texture_cache: Implement layered framebuffer attachments Layered framebuffer attachments is a feature that allows applications to write attach layered textures to a single attachment. What layer the fragments are written to is decided from the shader using gl_Layer.	2020-02-16 04:19:32 -03:00
ReinUsesLisp	1caf3f11c8	vk_shader_decompiler: Implement Layer output attribute SPIR-V's Layer is GLSL's gl_Layer. It lets the application choose from a shader stage (vertex, tessellation or geometry) which framebuffer layer write the output fragments to.	2020-02-16 04:17:37 -03:00
ReinUsesLisp	14c2a4a2ec	texture: Implement R32I	2020-02-15 16:26:50 -03:00
ReinUsesLisp	91aa58e410	maxwell_3d: Unify draw methods Pass instanced state of a draw invocation as an argument instead of having two separate virtual methods.	2020-02-14 18:09:40 -03:00
ReinUsesLisp	bcd348f238	vk_query_cache: Implement generic query cache on Vulkan	2020-02-14 17:38:27 -03:00
ReinUsesLisp	cbea8c74de	vk_shader_decompiler: Fix vertex id and instance id Vulkan's VertexIndex and InstanceIndex don't match with hardware. This is because Nvidia implements gl_VertexID and gl_InstanceID. The math that relates these is: gl_VertexIndex = gl_BaseVertex + gl_VertexID gl_InstanceIndex = gl_InstanceIndex + gl_InstanceID To emulate it using what Vulkan's SPIR-V offers (the Index variants) this commit substracts gl_Base from gl_*Index to obtain the OpenGL and hardware's equivalent.	2020-02-13 20:25:28 -03:00
ReinUsesLisp	0eb36c90f4	vk_rasterizer: Use noexcept variants of std::bitset Removes bounds checking from "texceptions" instances.	2020-02-04 18:04:24 -03:00
bunnei	c31ec00d67	Merge pull request #3337 from ReinUsesLisp/vulkan-staged yuzu: Implement Vulkan frontend	2020-02-03 16:56:25 -05:00
bunnei	b5bbe7e752	Merge pull request #3282 from FernandoS27/indexed-samplers Partially implement Indexed samplers in general and specific code in GLSL	2020-02-01 20:41:40 -05:00
ReinUsesLisp	f92cbc5501	yuzu: Implement Vulkan frontend Adds a Qt and SDL2 frontend for Vulkan. It also finishes the missing bits on Vulkan initialization.	2020-01-29 17:53:11 -03:00
ReinUsesLisp	788d57d723	settings: Add settings for graphics backend	2020-01-29 17:53:11 -03:00
ReinUsesLisp	d95d4ac843	shader/memory: Implement ATOM.ADD ATOM operates atomically on global memory. For now only add ATOM.ADD since that's what was found in commercial games. This asserts for ATOM.ADD.S32 (handling the others as unimplemented), although ATOM.ADD.U32 shouldn't be any different. This change forces us to change the default type on SPIR-V storage buffers from float to uint. We could also alias the buffers, but it's simpler for now to just use uint. While we are at it, abstract the code to avoid repetition.	2020-01-26 01:54:24 -03:00
Fernando Sahmkow	bb8eb15d39	Shader_IR: Address feedback.	2020-01-25 09:04:59 -04:00
Fernando Sahmkow	37b8504faa	Shader_IR: Correct Custom Variable assignment.	2020-01-24 16:44:47 -04:00
Fernando Sahmkow	3c34678627	Shader_IR: Implement Injectable Custom Variables to the IR.	2020-01-24 16:43:31 -04:00
ReinUsesLisp	1690f1adba	vk_shader_decompiler: Disable default values on unwritten render targets Some games like The Legend of Zelda: Breath of the Wild assign render targets without writing them from the fragment shader. This generates Vulkan validation errors, so silence these I previously introduced a commit to set "vec4(0, 0, 0, 1)" for these attachments. The problem is that this is not what games expect. This commit reverts that change.	2020-01-24 01:16:21 -03:00
Fernando Sahmkow	79e0991d9b	Merge pull request #3330 from ReinUsesLisp/vk-blit-screen vk_blit_screen: Initial implementation	2020-01-20 22:32:16 -04:00
ReinUsesLisp	a665581684	vk_blit_screen: Address feedback	2020-01-20 18:43:11 -03:00
bunnei	69b44392a7	Merge pull request #3328 from ReinUsesLisp/vulkan-atoms vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V	2020-01-20 00:01:52 -05:00
bunnei	5a077c95ce	Merge pull request #3322 from ReinUsesLisp/vk-front-face vk_graphics_pipeline: Set front facing properly	2020-01-19 23:22:34 -05:00
ReinUsesLisp	f5dfe68a94	vk_blit_screen: Initial implementation This abstraction takes care of presenting accelerated and non-accelerated or "framebuffer" images to the Vulkan swapchain.	2020-01-19 21:12:43 -03:00
bunnei	41373d212e	Merge pull request #3313 from ReinUsesLisp/vk-rasterizer vk_rasterizer: Implement Vulkan's rasterizer	2020-01-19 18:09:01 -05:00
ReinUsesLisp	b2c976ad0e	vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V Also updates sirit to include atomic instructions.	2020-01-19 16:40:31 -03:00
ReinUsesLisp	94915d4ea1	vk_graphics_pipeline: Set front facing properly Front face was being forced to a certain value when cull face is disabled. Set a default value on initialization and drop the forcefully set front facing value with culling disabled.	2020-01-18 18:50:47 -03:00
bunnei	15163edaaa	Merge pull request #3312 from ReinUsesLisp/atoms-u32 shader/memory: Implement ATOMS.ADD.U32	2020-01-18 00:54:07 -05:00
ReinUsesLisp	09b1d762d7	vk_rasterizer: Address feedback	2020-01-17 21:40:01 -03:00
ReinUsesLisp	fe5356d223	vk_rasterizer: Implement Vulkan's rasterizer This abstraction is Vulkan's equivalent to OpenGL's rasterizer. It takes care of joining all parts of the backend and rendering accordingly on demand.	2020-01-16 23:05:15 -03:00
ReinUsesLisp	38e789c761	renderer_vulkan: Add header as placeholder	2020-01-16 22:54:15 -03:00
bunnei	e041f33569	Merge pull request #3300 from ReinUsesLisp/vk-texture-cache vk_texture_cache: Implement generic texture cache on Vulkan	2020-01-16 19:19:26 -05:00
ReinUsesLisp	f09cd52980	vk_texture_cache: Address feedback	2020-01-16 18:23:10 -03:00
ReinUsesLisp	63ba41a26d	shader/memory: Implement ATOMS.ADD.U32	2020-01-16 17:30:55 -03:00
Rodrigo Locatti	82e1285c1e	vk_texture_cache: Fix typo in commentary Co-Authored-By: MysticExile <30736337+MysticExile@users.noreply.github.com>	2020-01-16 16:59:46 -03:00
bunnei	6985eea519	Merge pull request #3290 from ReinUsesLisp/gl-clamp maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver	2020-01-13 19:16:06 -05:00
ReinUsesLisp	09e17fbb0f	vk_texture_cache: Implement generic texture cache on Vulkan It currently ignores PBO linearizations since these should be dropped as soon as possible on OpenGL.	2020-01-13 20:37:50 -03:00
Rodrigo Locatti	b1138e5ea1	vk_compute_pass: Address feedback Comment hardcoded SPIR-V modules.	2020-01-10 22:46:34 -03:00
ReinUsesLisp	3d46709b7f	maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver Nvidia's driver defaults invalid enumerations to GL_CLAMP. Vulkan doesn't expose GL_CLAMP through its API, but we can hack it on Nvidia's driver using the internal driver defaults.	2020-01-10 17:12:50 -03:00
ReinUsesLisp	908e085d02	vk_compute_pass: Add compute passes to emulate missing Vulkan features This currently only supports quad arrays and u8 indices. In the future we can remove quad arrays with a table written from the CPU, but this was used to bootstrap the other passes helpers and it was left in the code. The blob code is generated from the "shaders/" directory. Read the instructions there to know how to generate the SPIR-V.	2020-01-08 19:24:26 -03:00
ReinUsesLisp	82a64da077	vk_shader_util: Add helper to build SPIR-V shaders	2020-01-08 19:22:20 -03:00
ReinUsesLisp	6888d776ff	vk_pipeline_cache: Initial implementation Given a pipeline key, this cache returns a pipeline abstraction (for graphics or compute).	2020-01-06 22:02:26 -03:00
ReinUsesLisp	2effdeb924	vk_graphics_pipeline: Initial implementation This abstractio represents the state of the 3D engine at a given draw. Instead of changing individual bits of the pipeline how it's done in APIs like D3D11, OpenGL and NVN; on Vulkan we are forced to put everything together into a single, immutable object. It takes advantage of the few dynamic states Vulkan offers.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	dc96a59fa0	vk_compute_pipeline: Initial implementation This abstraction represents a Vulkan compute pipeline.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	b392a5986e	vk_pipeline_cache: Add file and define descriptor update template filler This function allows us to share code between compute and graphics pipelines compilation.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	3142f1b597	fixed_pipeline_state: Add depth clamp	2020-01-06 22:02:26 -03:00
ReinUsesLisp	9c548146ca	vk_rasterizer: Add placeholder	2020-01-06 22:02:26 -03:00
bunnei	5be00cba15	Merge pull request #3276 from ReinUsesLisp/pipeline-reqs vk_update_descriptor/vk_renderpass_cache: Add pipeline cache dependencies	2020-01-06 17:03:34 -05:00
ReinUsesLisp	5aeff9aff5	vk_renderpass_cache: Initial implementation The renderpass cache is used to avoid creating renderpasses on each draw. The hashed structure is not currently optimized.	2020-01-06 18:28:32 -03:00
ReinUsesLisp	322d6a0311	vk_update_descriptor: Initial implementation The update descriptor is used to store in flat memory a large chunk of staging data used to update descriptor sets through templates. It provides a push interface to easily insert descriptors following the current pipeline. The order used in the descriptor update template has to be implicitly followed. We can catch bugs here using validation layers.	2020-01-06 18:28:32 -03:00
ReinUsesLisp	5b01f80a12	vk_stream_buffer/vk_buffer_cache: Avoid halting and use generic cache The stream buffer before this commit once it was full (no more bytes to write before looping) waiting for all previous operations to finish. This was a temporary solution and had a noticeable performance penalty in performance (from what a profiler showed). To avoid this mark with fences usages of the stream buffer and once it loops wait for them to be signaled. On average this will never wait. Each fence knows where its usage finishes, resulting in a non-paged stream buffer. On the other side, the buffer cache is reimplemented using the generic buffer cache. It makes use of the staging buffer pool and the new stream buffer.	2020-01-06 18:13:41 -03:00
ReinUsesLisp	ceb851b590	vk_memory_manager: Misc changes * Allocate memory in discrete exponentially increasing chunks until the 128 MiB threshold. Allocations larger thant that increase linearly by 256 MiB (depending on the required size). This allows to use small allocations for small resources. * Move memory maps to a RAII abstraction. To optimize for debugging tools (like RenderDoc) users will map/unmap on usage. If this ever becomes a noticeable overhead (from my profiling it doesn't) we can transparently move to persistent memory maps without harming the API, getting optimal performance for both gameplay and debugging. * Improve messages on exceptional situations. * Fix typos "requeriments" -> "requirements". * Small style changes.	2020-01-06 18:13:41 -03:00
ReinUsesLisp	85bb6a6f08	vk_buffer_cache: Temporarily remove buffer cache This is intended for a follow up commit to avoid circular dependencies.	2020-01-06 17:58:46 -03:00
Fernando Sahmkow	56e450a3f7	Merge pull request #3264 from ReinUsesLisp/vk-descriptor-pool vk_descriptor_pool: Initial implementation	2020-01-05 15:54:41 -04:00
bunnei	cd0a7dfdbc	Merge pull request #3258 from FernandoS27/shader-amend Shader_IR: add the ability to amend code in the shader ir.	2020-01-04 14:05:17 -05:00
Fernando Sahmkow	3dd6b55851	Shader_IR: Address Feedback	2020-01-04 14:40:57 -04:00
Rodrigo Locatti	6e347d8d1b	Update src/video_core/renderer_vulkan/vk_descriptor_pool.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com>	2020-01-03 17:34:30 -03:00
ReinUsesLisp	1fe7df4517	vk_descriptor_pool: Initial implementation Create a large descriptor pool where we allocate all our descriptors from. It has to be wide enough to support any pipeline, hence its large numbers. If the descritor pool is filled, we allocate more memory at that moment. This way we can take advantage of permissive drivers like Nvidia's that allocate more descriptors than what the spec requires.	2020-01-01 16:44:06 -03:00
Fernando Sahmkow	b3371ed09e	Shader_IR: add the ability to amend code in the shader ir. This commit introduces a mechanism by which shader IR code can be amended and extended. This useful for track algorithms where certain information can derived from before the track such as indexes to array samplers.	2019-12-30 15:31:48 -04:00
Fernando Sahmkow	7bd447355f	Merge pull request #3248 from ReinUsesLisp/vk-image vk_image: Add an image object abstraction	2019-12-30 14:25:14 -04:00
Rodrigo Locatti	4cbb363d3f	vk_image: Avoid unnecesary equals	2019-12-30 13:28:23 -03:00
Rodrigo Locatti	f2c61bbe13	vk_staging_buffer_pool: Initialize last epoch to zero	2019-12-29 19:19:43 -03:00
ReinUsesLisp	3813af2f3c	vk_staging_buffer_pool: Add a staging pool for temporary operations The job of this abstraction is to provide staging buffers for temporary operations. Think of image uploads or buffer uploads to device memory. It automatically deletes unused buffers.	2019-12-25 18:12:17 -03:00
ReinUsesLisp	c83bf7cd1e	vk_image: Add an image object abstraction This object's job is to contain an image and manage its transitions. Since Nvidia hardware doesn't know what a transition is but Vulkan requires them anyway, we have to state track image subresources individually. To avoid the overhead of tracking each subresource in images with many subresources (think of cubemap arrays with several mipmaps), this commit tracks when subresources have diverged. As long as this doesn't happen we can check the state of the first subresource (that will be shared with all subresources) and update accordingly. Image transitions are deferred to the scheduler command buffer.	2019-12-25 18:00:16 -03:00
ReinUsesLisp	b9e3f5eb36	fixed_pipeline_state: Define symetric operator!= and mark as noexcept Marks as noexcept Hash, operator== and operator!= for consistency.	2019-12-24 18:24:08 -03:00
ReinUsesLisp	4a3026b16b	fixed_pipeline_state: Define structure and loaders The intention behind this hasheable structure is to describe the state of fixed function pipeline state that gets compiled to a single graphics pipeline state object. This is all dynamic state in OpenGL but Vulkan wants it in an immutable state, even if hardware can edit it freely. In this commit the structure is defined in an optimized state (it uses booleans, has paddings and many data entries that can be packed to single integers). This is intentional as an initial implementation that is easier to debug, implement and review. It will be optimized in later stages, or it might change if Vulkan gets more dynamic states.	2019-12-22 22:59:11 -03:00
bunnei	1e76655f83	Merge pull request #3238 from ReinUsesLisp/vk-resource-manager vk_resource_manager: Catch device losses and other changes	2019-12-22 15:57:16 -05:00
Fernando Sahmkow	3dc585d011	Merge pull request #3237 from ReinUsesLisp/vk-shader-decompiler vk_shader_decompiler: Misc changes	2019-12-22 12:36:56 -04:00
Fernando Sahmkow	aea978e037	Merge pull request #3230 from ReinUsesLisp/vk-emu-shaders renderer_vulkan/shader: Add helper GLSL shaders	2019-12-22 11:23:09 -04:00
ReinUsesLisp	af93909c9c	vk_shader_decompiler: Use Visit instead of reimplementing it ExprCondCode visit implements the generic Visit. Use this instead of that one. As an intended side effect this fixes unwritten memory usages in cases when a negation of a condition code is used.	2019-12-20 21:36:25 -03:00
ReinUsesLisp	e41da22c8d	vk_resource_manager: Add entry to VKFence to test its usage	2019-12-19 16:31:34 -03:00
ReinUsesLisp	ec983a2451	vk_reosurce_manager: Add assert for releasing fences Notify the programmer when a request to release a fence is invalid because the fence is already free.	2019-12-19 16:31:34 -03:00
ReinUsesLisp	6ddffa010a	vk_resource_manager: Implement VKFenceWatch move constructor This allows us to put VKFenceWatch inside a std::vector without storing it in heap. On move we have to signal the fences where the new protected resource is, adding some overhead.	2019-12-19 16:31:34 -03:00
ReinUsesLisp	54747d60bc	vk_device: Add entry to catch device losses VK_NV_device_diagnostic_checkpoints allows us to push data to a Vulkan queue and then query it even after a device loss. This allows us to push the current pipeline object and see what was the call that killed the device.	2019-12-19 16:31:33 -03:00
ReinUsesLisp	2a63b3bdb9	vk_shader_decompiler: Fix full decompilation When full decompilation was enabled, labels were not being inserted and instructions were misused. Fix these bugs.	2019-12-19 16:24:45 -03:00
ReinUsesLisp	de918ebeb0	vk_shader_decompiler: Skip NDC correction when it is native Avoid changing gl_Position when the NDC used by the game is [0, 1] (Vulkan's native).	2019-12-19 16:24:45 -03:00
ReinUsesLisp	485c21eac3	vk_shader_decompiler: Normalize output fragment attachments Some games write from fragment shaders to an unexistant framebuffer attachment or they don't write to one when it exists in the framebuffer. Fix this by skipping writes or adding zeroes.	2019-12-19 16:24:45 -03:00
ReinUsesLisp	f4a25f854c	vk_device: Add query for RGBA8Uint	2019-12-19 02:08:29 -03:00
ReinUsesLisp	abb33d4aec	vk_shader_decompiler: Update sirit and implement Texture AOFFI	2019-12-19 01:42:13 -03:00
bunnei	d53cf05513	Merge pull request #3221 from ReinUsesLisp/vk-scheduler vk_scheduler: Delegate commands to a worker thread and state track	2019-12-18 22:04:08 -05:00
ReinUsesLisp	b52297767e	renderer_vulkan/shader: Add helper GLSL shaders These shaders are used to specify code that is not dynamically generated in the Vulkan backend. Instead of packing it inside the build system, it's manually built and copied to the C++ file to avoid adding unnecessary build time dependencies. quad_array should be dropped in the future since it can be emulated with a memory pool generated from the CPU.	2019-12-16 17:59:08 -03:00
ReinUsesLisp	e3ea583893	maxwell_to_vk: Improve image format table and add more formats A1B5G5R5 uses A1R5G5B5. This is flipped with image view swizzles; flushing is still not properly implemented on Vulkan for this particular format.	2019-12-13 03:12:29 -03:00
ReinUsesLisp	f27b21077d	maxwell_to_vk: Implement more vertex formats	2019-12-13 03:12:28 -03:00
ReinUsesLisp	8db8631d81	maxwell_to_vk: Implement more primitive topologies Add an extra argument to query device capabilities in the future. The intention behind this is to use native quads, quad strips, line loops and polygons if these are released for Vulkan.	2019-12-13 03:12:28 -03:00
ReinUsesLisp	15513f0801	maxwell_to_vk: Approach GL_CLAMP closer to the GL spec The OpenGL spec defines GL_CLAMP's formula similarly to CLAMP_TO_EDGE and CLAMP_TO_BORDER depending on the filter mode used. It doesn't exactly behave like this, but it's the closest we can get with what Vulkan offers without emulating it by injecting shader code.	2019-12-13 03:12:28 -03:00
ReinUsesLisp	f845df8651	maxwell_to_vk: Use VK_EXT_index_type_uint8 when available	2019-12-13 02:37:23 -03:00
ReinUsesLisp	2df9a2dcaf	vk_scheduler: Delegate commands to a worker thread and state track Introduce a worker thread approach for delegating Vulkan work derived from dxvk's approach. https://github.com/doitsujin/dxvk Now that the scheduler is what handles all Vulkan work related to command streaming, store state tracking in itself. This way we can know when to reupload Vulkan dynamic state to the queue (since this one is invalidated between command buffers unlike NVN). We can also store the renderpass state and graphics pipeline bound to avoid redundant binds and renderpass begins/ends.	2019-12-13 02:24:48 -03:00
ReinUsesLisp	425a254fa2	shader: Implement MEMBAR.GL Implement using memoryBarrier in GLSL and OpMemoryBarrier on SPIR-V.	2019-12-10 16:45:03 -03:00
ReinUsesLisp	233ed96a5c	vk_shader_decompiler: Fix build issues on old gcc versions	2019-12-10 01:55:38 -03:00
ReinUsesLisp	d30cf51d7d	vk_shader_decompiler: Reduce YNegate's severity	2019-12-09 23:52:28 -03:00
ReinUsesLisp	0b5b93053d	shader_ir/other: Implement S2R InvocationId	2019-12-09 23:52:28 -03:00
ReinUsesLisp	ecbfa416f0	vk_shader_decompiler: Misc changes Update Sirit and its usage in vk_shader_decompiler. Highlights: - Implement tessellation shaders - Implement geometry shaders - Implement some missing features - Use native half float instructions when available.	2019-12-09 23:51:57 -03:00
ReinUsesLisp	19ce0d4f1a	vk_device: Misc changes - Setup more features and requirements. - Improve logging for missing features. - Collect telemetry parameters. - Add queries for more image formats. - Query push constants limits. - Optionally enable some extensions.	2019-12-09 01:04:48 -03:00
ReinUsesLisp	7ea362e134	externals: Update Vulkan-Headers	2019-12-08 22:08:19 -03:00
ReinUsesLisp	f632d00eb1	vk_swapchain: Add support for swapping sRGB We don't know until the game is running if it's using an sRGB color space or not. Add support for hot-swapping swapchain surface formats.	2019-12-06 22:42:08 -03:00
bunnei	e36814d6d5	Merge pull request #3109 from FernandoS27/new-instr Implement FLO & TXD Instructions on GPU Shaders	2019-12-06 18:18:16 -05:00
Lioncash	3f08e8d8d4	core/memory: Migrate over GetPointer() With all of the interfaces ready for migration, it's trivial to migrate over GetPointer().	2019-11-26 21:55:38 -05:00
Lioncash	536fc7f0ea	core: Prepare various classes for memory read/write migration Amends a few interfaces to be able to handle the migration over to the new Memory class by passing the class by reference as a function parameter where necessary. Notably, within the filesystem services, this eliminates two ReadBlock() calls by using the helper functions of HLERequestContext to do that for us.	2019-11-26 21:55:37 -05:00
ReinUsesLisp	c8a48aacc0	video_core: Unify ProgramType and ShaderStage into ShaderType	2019-11-22 21:28:48 -03:00
ReinUsesLisp	48a1687f51	texture_cache: Drop abstracted ComponentType Abstracted ComponentType was not being used in a meaningful way. This commit drops its usage. There is one place where it was being used to test compatibility between two cached surfaces, but this one is implied in the pixel format. Removing the component type test doesn't change the behaviour.	2019-11-14 18:21:42 -03:00
Fernando Sahmkow	cd0f5dfc17	Shader_IR: Implement TXD instruction.	2019-11-14 11:15:27 -04:00
Fernando Sahmkow	f3d1b370aa	Shader_IR: Implement FLO instruction.	2019-11-14 11:15:27 -04:00
ReinUsesLisp	56e237d1f9	shader_ir/warp: Implement FSWZADD	2019-11-07 20:08:41 -03:00
ReinUsesLisp	08b2b1080a	gl_shader_decompiler: Reimplement shuffles with platform agnostic intrinsics	2019-11-07 20:08:41 -03:00
Fernando Sahmkow	8909f52166	Shader_IR: Implement Fast BRX and allow multi-branches in the CFG.	2019-10-25 09:01:30 -04:00
Fernando Sahmkow	7ecf9f7228	Merge pull request #2983 from lioncash/fallthrough gl_shader_decompiler/vk_shader_decompiler: Resolve implicit fallthrough cases	2019-10-22 13:16:46 -04:00
Lioncash	c6bec9aa10	vk_shader_decompiler: Mark operator() function parameters as const references These parameters aren't actually modified in any way, so they can be made const references.	2019-10-17 19:44:00 -04:00
Lioncash	6947bf8e44	vk_shader_decompiler: Resolve fallthrough within ExprDecompiler's ExprCondCode operator() This would previously result in NeverExecute and UnusedIndex being treated as regular predicates.	2019-10-15 19:40:58 -04:00
Fernando Sahmkow	3c09d9abe6	Shader_Ir: Address Feedback and clang format.	2019-10-04 18:52:57 -04:00
Fernando Sahmkow	507a9c6a40	vk_shader_decompiler: Correct Branches inside conditionals.	2019-10-04 18:52:56 -04:00
Fernando Sahmkow	000ad558dd	vk_shader_decompiler: Clean code and be const correct.	2019-10-04 18:52:55 -04:00
Fernando Sahmkow	100a4bd988	vk_shader_compiler: Don't enclose branches with if(true) to avoid crashing AMD	2019-10-04 18:52:54 -04:00
Fernando Sahmkow	466cd52ad4	vk_shader_compiler: Correct SPIR-V AST Decompiling	2019-10-04 18:52:52 -04:00
Fernando Sahmkow	2e9a810423	Shader_IR: allow else derivation to be optional.	2019-10-04 18:52:52 -04:00
Fernando Sahmkow	ca9901867e	vk_shader_compiler: Implement the decompiler in SPIR-V	2019-10-04 18:52:51 -04:00
bunnei	376f1a4432	Merge pull request #2869 from ReinUsesLisp/suld shader/image: Implement SULD and fix SUATOM	2019-09-23 21:47:03 -04:00
FearlessTobi	55d272efe6	video_core: Implement RGBX16F PixelFormat	2019-09-22 02:16:44 +02:00
ReinUsesLisp	44000971e2	gl_shader_decompiler: Use uint for images and fix SUATOM In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop.	2019-09-21 17:33:52 -03:00
ReinUsesLisp	675f23aedc	shader/image: Implement SULD and remove irrelevant code * Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array.	2019-09-21 17:32:48 -03:00
ReinUsesLisp	0526bf1895	shader_ir/warp: Implement SHFL	2019-09-17 17:44:07 -03:00
Fernando Sahmkow	18fac59050	Merge pull request #2858 from ReinUsesLisp/vk-device vk_device: Add miscellaneous features and minor style changes	2019-09-14 03:52:06 -04:00
ReinUsesLisp	01d96e1136	vk_device: Add miscellaneous features and minor style changes * Increase minimum Vulkan requirements * Require VK_EXT_vertex_attribute_divisor * Require depthClamp, samplerAnisotropy and largePoints features * Search and expose VK_KHR_uniform_buffer_standard_layout * Search and expose VK_EXT_index_type_uint8 * Search and expose native float16 arithmetics * Track current driver with VK_KHR_driver_properties * Query and expose SSBO alignment * Query more image formats * Improve logging overall * Minor style changes * Minor rephrasing of commentaries	2019-09-13 02:10:07 -03:00
ReinUsesLisp	36abf67e79	shader/image: Implement SUATOM and fix SUST	2019-09-10 20:22:31 -03:00
ReinUsesLisp	4e35177e23	shader_ir: Implement VOTE Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.	2019-08-21 14:50:38 -03:00
Fernando Sahmkow	11f4e739bd	Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.	2019-07-20 17:38:25 -04:00
ReinUsesLisp	45c162444d	shader/half_set_predicate: Fix HSETP2 implementation	2019-07-19 22:21:22 -03:00
Fernando Sahmkow	1bdb59fc6e	Merge pull request #2695 from ReinUsesLisp/layer-viewport gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders	2019-07-15 16:28:07 -04:00
bunnei	bb67091c77	Merge pull request #2609 from FernandoS27/new-scan Implement a New Shader Scanner, Decompile Flow Stack and implement BRX BRA.CC	2019-07-11 17:36:23 -04:00
bunnei	7fb7054bc8	Merge pull request #2686 from ReinUsesLisp/vk-scheduler vk_scheduler: Drop execution context in favor of views	2019-07-10 16:35:48 -04:00
Fernando Sahmkow	8a6fc529a9	shader_ir: Implement BRX & BRA.CC	2019-07-09 08:14:37 -04:00
ReinUsesLisp	c9d886c84e	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging.	2019-07-07 20:42:55 -03:00
Lioncash	cbdd6cd1c0	vk_sampler_cache: Remove unused includes These are no longer used within this header, so they can be removed.	2019-07-07 13:40:36 -04:00
Lioncash	4b27680639	video_core: Add missing override specifiers	2019-07-07 13:38:39 -04:00
ReinUsesLisp	86a874a2fc	vk_scheduler: Drop execution context in favor of views Instead of passing by copy an execution context through out the whole Vulkan call hierarchy, use a command buffer view and fence view approach. This internally dereferences the command buffer or fence forcing the user to be unable to use an outdated version of it on normal usage. It is still possible to keep store an outdated if it is casted to VKFence& or vk::CommandBuffer. While changing this file, add an extra parameter for Flush and Finish to allow releasing the fence from this calls.	2019-07-07 03:30:22 -03:00
ReinUsesLisp	06c4ce8645	shader: Decode SUST and implement backing image functionality	2019-06-20 21:38:33 -03:00
Zach Hilman	c0e7b91145	Merge pull request #2538 from ReinUsesLisp/ssy-pbk shader: Split SSY and PBK stack	2019-06-15 20:30:13 -04:00
Zach Hilman	de33ad25f5	Merge pull request #2514 from ReinUsesLisp/opengl-compat video_core: Drop OpenGL core in favor of OpenGL compatibility	2019-06-07 17:23:25 -04:00
ReinUsesLisp	fe8e6618f2	shader: Split SSY and PBK stack Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT;	2019-06-07 02:18:27 -03:00
ReinUsesLisp	bf4dfb3ad4	shader: Use shared_ptr to store nodes and move initialization to file Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class.	2019-06-05 20:41:52 -03:00
bunnei	a20ba09bfd	Merge pull request #2520 from ReinUsesLisp/vulkan-refresh vk_device,vk_shader_decompiler: Miscellaneous changes	2019-06-05 18:10:00 -04:00
ReinUsesLisp	a89cc0bafc	maxwell_to_gl: Use GL_CLAMP to emulate Clamp wrap mode	2019-05-30 13:21:01 -03:00
ReinUsesLisp	f424b46036	vk_device: Let formats array type be deduced	2019-05-26 03:09:06 -03:00
ReinUsesLisp	a4c5e3e339	vk_shader_decompiler: Misc fixes Fix missing OpSelectionMerge instruction. This caused devices loses on most hardware, Intel didn't care. Fix [-1;1] -> [0;1] depth conversions. Conditionally use VK_EXT_scalar_block_layout. This allows us to use non-std140 layouts on UBOs. Update external Vulkan headers.	2019-05-26 01:48:04 -03:00
ReinUsesLisp	dec3c981d0	vk_device: Enable features when available and misc changes Keeps track of native ASTC support, VK_EXT_scalar_block_layout availability and SSBO range. Check for independentBlend and vertexPipelineStorageAndAtomics as a required feature. Always enable it. Use vk::to_string format to log Vulkan enums. Style changes.	2019-05-26 01:41:34 -03:00
ReinUsesLisp	9c3461604c	shader: Implement S2R Tid{XYZ} and CtaId{XYZ}	2019-05-20 16:36:49 -03:00
bunnei	d49efbfb4a	Merge pull request #2441 from ReinUsesLisp/al2p shader: Implement AL2P and ALD.PHYS	2019-05-19 14:02:58 -04:00
Mat M	dadcf317dc	Merge pull request #2461 from lioncash/unused-var video_core: Remove a few unused variables and functions	2019-05-14 06:36:26 -04:00
Rodrigo Locatti	940a71089d	Merge pull request #2413 from FernandoS27/opt-gpu Rasterizer Cache: refactor flushing & optimize memory usage of surfaces	2019-05-13 23:01:59 -03:00
Lioncash	e3c45b4338	renderer_vulkan/vk_shader_decompiler: Remove unused variable from DeclareInternalFlags()	2019-05-09 18:47:48 -04:00
ReinUsesLisp	06b363c9b5	shader: Remove unused AbufNode Ipa mode	2019-05-02 21:46:25 -03:00
bunnei	c52233ec8b	Merge pull request #2322 from ReinUsesLisp/wswitch video_core: Silent -Wswitch warnings	2019-04-28 22:24:58 -04:00
Fernando Sahmkow	4c36b78567	Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache.	2019-04-21 11:42:07 -04:00
bunnei	650d9b1044	Merge pull request #2409 from ReinUsesLisp/half-floats shader_ir/decode: Miscellaneous fixes to half-float decompilation	2019-04-19 21:31:52 -04:00
Fernando Sahmkow	a3eb91ed8c	RasterizerCache Redesign: Flush flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility.	2019-04-19 20:44:56 -04:00
ReinUsesLisp	fbe8d1ceaa	video_core: Silent -Wswitch warnings	2019-04-18 15:54:39 -03:00
bunnei	4294062516	Merge pull request #2318 from ReinUsesLisp/sampler-cache gl_sampler_cache: Port sampler cache to OpenGL	2019-04-17 21:45:56 -04:00
ReinUsesLisp	ef8245bed2	vk_shader_decompiler: Add missing operations	2019-04-15 21:32:57 -03:00
ReinUsesLisp	f43995ec53	shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	64613db605	shader_ir/decode: Implement half float saturation	2019-04-15 21:16:10 -03:00
ReinUsesLisp	5c280e6ff0	shader_ir: Implement STG, keep track of global memory usage and flush	2019-04-14 00:25:32 -03:00
ReinUsesLisp	75d23a3679	vk_shader_decompiler: Implement flow primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	58ad8dfac6	vk_shader_decompiler: Implement most common texture primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	4667ed8e22	vk_shader_decompiler: Implement texture decompilation helper functions	2019-04-10 14:20:25 -03:00
ReinUsesLisp	676172e20d	vk_shader_decompiler: Implement Assign and LogicalAssign	2019-04-10 14:20:25 -03:00
ReinUsesLisp	d316d248ab	vk_shader_decompiler: Implement non-OperationCode visits	2019-04-10 14:20:25 -03:00
ReinUsesLisp	b758c861b0	vk_shader_decompiler: Implement OperationCode decompilation interface	2019-04-10 14:20:25 -03:00
ReinUsesLisp	fec4eb9776	vk_shader_decompiler: Implement Visit	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ca51f99840	vk_shader_decompiler: Implement labels tree and flow	2019-04-10 14:20:25 -03:00
ReinUsesLisp	13aa664f3f	vk_shader_decompiler: Implement declarations	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ad53b233c5	vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler	2019-04-10 14:20:25 -03:00
Lioncash	26223f8124	video_core/engines: Remove unnecessary inclusions where applicable Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h	2019-04-05 18:26:32 -04:00
bunnei	7931a68d4e	Merge pull request #2302 from ReinUsesLisp/vk-swapchain vk_swapchain: Implement a swapchain manager	2019-04-03 11:50:05 -04:00
ReinUsesLisp	c5047540c9	video_core: Abstract vk_sampler_cache into a templated class	2019-04-02 15:54:11 -03:00
bunnei	1960164055	Merge pull request #2297 from lioncash/reorder video_core: Amend constructor initializer list order where applicable	2019-03-30 20:00:26 -04:00
ReinUsesLisp	746dab407e	vk_swapchain: Implement a swapchain manager	2019-03-29 00:00:51 -03:00
Lioncash	a5fa4b311e	video_core: Amend constructor initializer list order where applicable Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings.	2019-03-27 12:37:53 -04:00
Lioncash	bbe700359d	video_core: Add missing override specifiers Ensures that the signatures will always match with the base class. Also silences a few compilation warnings.	2019-03-27 12:24:52 -04:00
bunnei	241563d15c	gpu: Move GPUVAddr definition to common_types.	2019-03-20 22:36:02 -04:00
bunnei	2eaf6c41a4	gpu: Use host address for caching instead of guest address.	2019-03-14 22:34:42 -04:00
Mat M	a3734d7e31	vk_sampler_cache: Use operator== instead of memcmp Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc>	2019-03-12 21:05:36 -03:00
ReinUsesLisp	aa59d77c3b	vk_sampler_cache: Implement a sampler cache	2019-03-12 20:20:57 -03:00
bunnei	1143923cdd	Merge pull request #2191 from ReinUsesLisp/maxwell-to-vk maxwell_to_vk: Initial implementation	2019-03-08 11:51:08 -05:00
Lioncash	f9ee0dc7ee	video_core/engines: Remove unnecessary includes Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those.	2019-03-05 20:35:32 -05:00
ReinUsesLisp	1f6571b3de	maxwell_to_vk: Initial implementation	2019-03-04 04:06:05 -03:00
ReinUsesLisp	8e84e81e74	vk_buffer_cache: Fix clang-format	2019-03-02 02:16:45 -03:00
ReinUsesLisp	35c105a108	vk_buffer_cache: Implement a buffer cache This buffer cache is just like OpenGL's buffer cache with some minor style changes. It uses VKStreamBuffer.	2019-03-01 17:33:36 -03:00
bunnei	1b13859af8	Merge pull request #2152 from ReinUsesLisp/vk-stream-buffer vk_stream_buffer: Implement a stream buffer	2019-02-27 21:19:15 -05:00
Lioncash	16ea93c11e	vk_memory_manager: Reorder constructor initializer list in terms of member declaration order Reorders members in the order that they would actually be initialized in. Silences a -Wreorder warning.	2019-02-27 11:08:19 -05:00
ReinUsesLisp	730eb1dad7	vk_stream_buffer: Remove copy code path	2019-02-26 02:09:43 -03:00
ReinUsesLisp	33a0597603	vk_stream_buffer: Implement a stream buffer This manages two kinds of streaming buffers: one for unified memory models and one for dedicated GPUs. The first one skips the copy from the staging buffer to the real buffer, since it creates an unified buffer. This implementation waits for all fences to finish their operation before "invalidating". This is suboptimal since it should allocate another buffer or start searching from the beginning. There is room for improvement here. This could also handle AMD's "pinned" memory (a heap with 256 MiB) that seems to be designed for buffer streaming.	2019-02-24 04:27:51 -03:00
ReinUsesLisp	281a8bf259	vk_resource_manager: Minor VKFenceWatch changes	2019-02-24 04:19:04 -03:00
bunnei	f7090bacc5	Merge pull request #2146 from ReinUsesLisp/vulkan-scheduler vk_scheduler: Implement a scheduler	2019-02-23 23:32:43 -05:00
ReinUsesLisp	92050c4d86	vk_memory_manager: Fixup commit interval allocation VKMemoryCommitImpl was using as the end of its interval "begin + end". That ended up wasting memory.	2019-02-24 01:04:41 -03:00
ReinUsesLisp	f546fb35ed	vk_scheduler: Implement a scheduler The scheduler abstracts command buffer and fence management with an interface that's able to do OpenGL-like operations on Vulkan command buffers. It returns by value a command buffer and fence that have to be used for subsequent operations until Flush or Finish is executed, after that the current execution context (the pair of command buffers and fences) gets invalidated a new one must be fetched. Thankfully validation layers will quickly detect if this is skipped throwing an error due to modifications to a sent command buffer.	2019-02-22 01:33:32 -03:00
ReinUsesLisp	b675c97cdd	vk_memory_manager: Implement memory manager A memory manager object handles the memory allocations for a device. It allocates chunks of Vulkan memory objects and then suballocates.	2019-02-19 03:42:28 -03:00
ReinUsesLisp	ae6c052ed9	vk_resource_manager: Implement a command buffer pool with VKFencedPool	2019-02-14 18:44:26 -03:00
ReinUsesLisp	a2b6de7e9f	vk_resource_manager: Add VKFencedPool interface Handles a pool of resources protected by fences. Manages resource overflow allocating more resources. This class is intended to be used through inheritance.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	0ffdd0a683	vk_resource_manager: Implement VKResourceManager and fence allocator CommitFence iterates a pool of fences until one is found. If all fences are being used at the same time, allocate more.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	aa0b6babda	vk_resource_manager: Implement VKFenceWatch A fence watch is used to keep track of the usage of a fence and protect a resource or set of resources without having to inherit from their handlers.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	25c2fe1c6b	vk_resource_manager: Implement VKFence Fences take ownership of objects, protecting them from GPU-side or driver-side concurrent access. They must be commited from the resource manager. Their usage flow is: commit the fence from the resource manager, protect resources with it and use them, send the fence to an execution queue and Wait for it if needed and then call Release. Used resources will automatically be signaled when they are free to be reused.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	33a4cebc22	vk_resource_manager: Add VKResource interface VKResource is an interface that gets signaled by a fence when it is free to be reused.	2019-02-14 18:36:15 -03:00
ReinUsesLisp	8beca060d1	vk_device: Abstract device handling into a class VKDevice contains all the data required to manage and initialize a physical device. Its intention is to be passed across Vulkan objects to query device-specific data (for example the logical device and the dispatch loader).	2019-02-12 21:43:02 -03:00
ReinUsesLisp	18fe910957	renderer_vulkan: Add declarations file This file is intended to be included instead of vulkan/vulkan.hpp. It includes declarations of unique handlers using a dynamic dispatcher instead of a static one (which would require linking to a Vulkan library).	2019-02-12 18:33:02 -03:00

... 10 11 12 13 14 ...

1076 commits