N-archive/yuzu - SiliconForest Atelier

Author	SHA1	Message	Date
ameerj	a832aa699f	codec: Improve libav memory alloc and cleanup	2021-08-16 14:40:53 -04:00
ameerj	bc3efb79cc	codec: Fallback to CPU decoding if no compatible GPU format is found	2021-08-16 14:40:53 -04:00
lat9nq	92bc51b66a	cmake: Add VDPAU and NVDEC support to FFmpeg Adds {h264_,vp9_}{nvdec,vdpau} hwaccels.	2021-08-16 14:40:52 -04:00
ameerj	537c6ac8fe	vk_blit_screen: Fix non-accelerated texture size calculation Addresses the potential OOB access in UnswizzleTexture.	2021-08-16 14:28:10 -04:00
Merry	1770503185	xbyak: Update include path	2021-08-15 19:26:38 +01:00
bunnei	87d63b858a	Merge pull request #6861 from yzct12345/const-mempy-is-all-the-speed decoders: Optimize memcpy for the other functions	2021-08-15 02:38:12 -07:00
bunnei	0509fe3377	Merge pull request #6838 from ameerj/sws-align vic: Specify sws_scale height stride.	2021-08-12 11:28:33 -07:00
ameerj	356e10898f	codec: Replace deprecated av_init_packet usage	2021-08-12 01:28:01 -04:00
ameerj	659039ca6d	nvdec: Implement GPU accelerated decoding for all platforms Supplements the VAAPI intel gpu decoder by implementing the D3D11VA decoder for Windows, and CUVID/VDPAU for Nvidia and AMD on drivers linux respectively.	2021-08-12 01:28:01 -04:00
yzct12345	430255caf8	decoders: Templates allow memcpy optimizations	2021-08-12 04:45:25 +00:00
Fernando S	6a082df427	Merge pull request #6820 from yzct12345/split-cache texture_cache: Split out template definitions	2021-08-10 12:23:05 +02:00
ameerj	a779cede7c	vic: Specify sws_scale height stride. Silences a sws_scale runtime warning about unaligned strides.	2021-08-09 23:24:16 -04:00
Mai M	2da91ec75b	Merge pull request #6844 from ameerj/vp9-empty-frame vp9: Ensure the first frame is complete	2021-08-08 19:02:39 -04:00
ameerj	fa22695705	vp9: Ensure the first frame is complete Silences a runtime error due to the first frame missing the frame data, and being set to hidden despite being a key-frame.	2021-08-08 13:49:00 -04:00
yzct12345	c4eafcc861	texture_cache: Address ameerj's review	2021-08-08 11:02:51 +00:00
Fernando S	859deda3bb	Merge pull request #6834 from K0bin/buffer-image-granularity Respect Vulkan bufferImageGranularity	2021-08-08 11:57:40 +02:00
bunnei	bd0e1d3a25	Merge pull request #6830 from ameerj/nvdec-unimpld-codec nvdec: Better logging for unimplemented codecs	2021-08-07 12:37:39 -07:00
Robin Kertels	bb29dcb7f2	vulkan_memory_allocator: Respect bufferImageGranularity	2021-08-07 15:28:05 +02:00
ameerj	928b64d2ce	nvdec: Better logging for unimplemented codecs	2021-08-07 01:08:33 -04:00
bunnei	268b5764c7	Merge pull request #6791 from ameerj/astc-opt astc_decoder: Various performance and memory optimizations	2021-08-06 21:45:24 -07:00
yzct12345	e80323b8b0	texture_cache: Address ameerj's review	2021-08-07 01:27:47 +00:00
bunnei	f183668a87	Merge pull request #6799 from ameerj/vp9-fixes nvdec: Fix VP9 reference frame refreshes	2021-08-06 17:46:46 -07:00
ameerj	e3688f0627	vp9: Cleanup unused variables With reference frames refreshes fix, we no longer need to buffer two frames in advance. We can also remove other unused or otherwise unneeded variables.	2021-08-06 20:08:11 -04:00
ameerj	a3f80a97a3	vp9: Fix reference frame refreshes This resolves the artifacting when decoding VP9 streams.	2021-08-06 20:08:08 -04:00
yzct12345	02e98f6c93	texture_cache: Don't change copyright year	2021-08-05 20:52:12 +00:00
yzct12345	5566f3dbc0	texture_cache: Address ameerj's review	2021-08-05 20:46:24 +00:00
yzct12345	f9563c8f24	texture_cache: Split templates out	2021-08-05 13:52:30 +00:00
yzct12345	2868d4ba84	nvdec: Implement VA-API hardware video acceleration (#6713 ) * nvdec: VA-API * Verify formatting * Forgot a semicolon for Windows * Clarify comment about AV_PIX_FMT_NV12 * Fix assert log spam from missing negation * vic: Remove forgotten debug code * Address lioncash's review * Mention VA-API is Intel/AMD * Address v1993's review * Hopefully fix CMakeLists style this time * vic: Improve cache locality * vic: Fix off-by-one error * codec: Async * codec: Forgot the GetValue() * nvdec: Address ameerj's review * codec: Fallback to CPU without VA-API support * cmake: Address lat9nq's review * cmake: Make VA-API optional * vaapi: Multiple GPU * Apply suggestions from code review Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com> * nvdec: Address ameerj's review * codec: Use anonymous instead of static * nvdec: Remove enum and fix memory leak * nvdec: Address ameerj's review * codec: Remove preparation for threading Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com>	2021-08-03 23:43:11 -04:00
yzct12345	f56d0db5bd	decoders: Optimize swizzle copy performance (#6790 ) This makes UnswizzleTexture up to two times faster. It is the main bottleneck in NVDEC video decoding.	2021-08-02 11:18:58 -04:00
Fernando S	30f0b7cf31	Merge pull request #6720 from ameerj/vk-screenshot renderer_vulkan: Implement screenshots	2021-08-01 13:31:33 +02:00
Ameer J	db32c3762b	Merge pull request #6765 from ReinUsesLisp/y-negate-vk vk_rasterizer: Flip viewport on Y_NEGATE	2021-08-01 01:47:37 -04:00
ameerj	c439fc9be9	astc_decoder: Reduce workgroup size This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32.	2021-08-01 01:22:27 -04:00
ameerj	5ab8053511	astc_decoder: Compute offset swizzles in-shader Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes.	2021-08-01 01:22:26 -04:00
ameerj	b2862e4772	astc_decoder: Make use of uvec4 for payload data	2021-07-31 22:28:04 -04:00
ameerj	a75d70fa90	astc_decoder: Simplify Select2DPartition	2021-07-31 21:36:26 -04:00
ameerj	5665d05547	astc_decoder: Optimize the use EncodingData This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea.	2021-07-31 21:36:26 -04:00
ameerj	15c0c213b1	astc.h: Move data to cpp implementation Moves leftover values that are no longer used by the gpu decoder back to the cpp implementation.	2021-07-31 21:26:42 -04:00
bunnei	7530594602	Merge pull request #6759 from ReinUsesLisp/pipeline-statistics renderer_vulkan: Add setting to log pipeline statistics	2021-07-30 11:18:52 -07:00
ReinUsesLisp	b185567a03	vk_rasterizer: Flip viewport on Y_NEGATE Matches OpenGL's behavior. I don't believe this register flips geometry, but we have to try to match behavior on both backends.	2021-07-29 02:17:53 -03:00
ameerj	7ac99bb127	renderers: Add explicit invert_y bool to screenshot callback OpenGL and Vulkan images render in different coordinate systems. This allows us to specify the coordinate system of the screenshot within each renderer	2021-07-28 21:46:08 -04:00
ameerj	75e7f54fb0	renderer_vulkan: Implement screenshots	2021-07-28 21:45:55 -04:00
ameerj	548bac8989	vk_blit_screen: Add public CreateFramebuffer method	2021-07-28 21:43:02 -04:00
ameerj	1e6c5d323d	vk_blit_screen: Make Draw method more generic Allows specifying the framebuffer and render area dimensions, rather than being hard coded for the render window.	2021-07-28 21:37:30 -04:00
ReinUsesLisp	3b006f4fe2	renderer_vulkan: Add setting to log pipeline statistics Use VK_KHR_pipeline_executable_properties when enabled and available to log statistics about the pipeline cache in a game. For example, this is on Turing GPUs when generating a pipeline cache from Super Smash Bros. Ultimate: Average pipeline statistics ========================================== Code size: 6433.167 Register count: 32.939 More advanced results could be presented, at the moment it's just an average of all 3D and compute pipelines.	2021-07-27 21:29:24 -03:00
bunnei	92887a65f0	Merge pull request #6749 from lioncash/rtarget render_target: Add missing initializer for size extent	2021-07-27 17:28:53 -07:00
Rodrigo Locatti	ab206d6378	Merge pull request #6748 from lioncash/engine-init video_core/engine: Consistently initialize rasterizer pointers	2021-07-27 16:17:20 -03:00
bunnei	2717e79c74	Merge pull request #6745 from lioncash/copies video_core: Remove some unused variables	2021-07-27 11:38:32 -07:00
Lioncash	00e100de08	render_target: Add missing initializer for size extent Everything else has a default constructor that does the straightforward thing of initializing most members to a default value, except for the size. We explicitly initialize the size (and others, for consistency), to prevent potential uninitialized reads from occurring. Particularly given the largeish surface area that this struct is used in.	2021-07-27 07:41:21 -04:00
Lioncash	f8964dd89a	video_core/engine: Consistently initialize rasterizer pointers Ensures all of the engines have consistent and deterministic initialization of the rasterizer pointers.	2021-07-27 07:30:57 -04:00
Lioncash	8c82c594f0	vulkan_wrapper: Fix SetObjectName() always indicating objects as images We should be using the passed in object type instead.	2021-07-27 07:19:15 -04:00

... 11 12 13 14 15 ...

6104 commits