N-archive/yuzu - SiliconForest Atelier

Author	SHA1	Message	Date
Lioncash	08b270676b	gl_rasterizer: Silence unused variable warning Makes use of src, so it's not considered unused.	2019-05-04 02:00:17 -04:00
Lioncash	a6f7a44aab	common/zstd_compression: Remove #pragma once directive from source file Introduced in `72477731ed`. This is only necessary within header files.	2019-05-04 01:54:29 -04:00
Lioncash	1230a0e7ce	core/frontend/emu_window: Make GraphicsContext's destructor virtual This class is used in a polymorphic context, so destruction of the context will lead to undefined behavior if the destructor isn't virtual.	2019-05-04 01:47:38 -04:00
Fernando Sahmkow	e64c41efe8	Refactors and name corrections.	2019-05-01 15:31:39 -04:00
Lioncash	2bcb8a20b4	service/audren_u: Handle variadic command buffers in GetWorkBufferSize() Also introduced in REV5 was a variable-size audio command buffer. This also affects how the size of the work buffer should be determined, so we can add handling for this as well. Thankfully, no other alterations were made to how the work buffer size is calculated in 7.0.0-8.0.0. There were indeed changes made to to how some of the actual audio commands are generated though (particularly in REV7), however they don't apply here.	2019-04-30 23:52:28 -04:00
Lioncash	03746be097	service/audren_u: Handle version 2 of performance frame info in GetWorkBufferSize() Introduced in REV5. This is trivial to add support for, now that everything isn't a mess of random magic constant values. All this is, is a change in data type sizes as far as this function cares.	2019-04-30 23:52:28 -04:00
Lioncash	de93507a5a	service/audren_u: Clean up work buffer calculations "Unmagics" quite a few magic constants within this code, making it much easier to understand. Particularly given this factors out specific sections into their own self-contained lambda functions.	2019-04-30 23:51:59 -04:00
ReinUsesLisp	4aa081b4e7	gl_shader_disk_cache: Skip stored shader variants instead of asserting Instead of asserting on already stored shader variants, silently skip them. This shouldn't be happening but when a shader is invalidated and it is not stored in the shader cache, this assert would hit and save that shader anyways when the asserts are disabled.	2019-05-01 00:36:11 -03:00
Fernando Sahmkow	95261639fb	Fix Layered ASTC Textures By adding the missing layer offset in ASTC compression.	2019-04-30 23:02:31 -04:00
Lioncash	75a8b304d4	loader/nso: Remove left-in debug pragma Unintentionally introduced in `552d5071fa`	2019-04-30 22:55:53 -04:00
bunnei	79e54abe19	Merge pull request #2100 from FreddyFunk/disk-cache-precompiled-file gl_shader_disk_cache: Improve precompiled shader cache generation speed and size	2019-04-30 19:24:01 -04:00
bunnei	91e239d66f	Merge pull request #2435 from ReinUsesLisp/misc-vc shader_ir: Miscellaneous fixes	2019-04-28 22:29:43 -04:00
bunnei	2be32eb3d2	Merge pull request #2412 from lioncash/system kernel/vm_manager: Remove usages of global system accessors	2019-04-28 22:27:14 -04:00
bunnei	c52233ec8b	Merge pull request #2322 from ReinUsesLisp/wswitch video_core: Silent -Wswitch warnings	2019-04-28 22:24:58 -04:00
bunnei	9a3737120d	Merge pull request #2423 from FernandoS27/half-correct Corrections on Half Float operations: HADD2 HMUL2 and HFMA2	2019-04-28 22:24:22 -04:00
Lioncash	565fce71b1	service/audctl: Update documentation comments to be relative to 8.0.0 The state of these service calls are still the same in version 8.0.0.	2019-04-27 23:17:58 -04:00
ReinUsesLisp	2156e52014	shader_ir: Move Sampler index entry in operand< to sort declarations	2019-04-26 01:13:05 -03:00
ReinUsesLisp	b77b4b76bb	shader_ir: Add missing entry to Sampler operand< comparison	2019-04-26 01:11:24 -03:00
ReinUsesLisp	0b91087a1e	shader_ir/texture: Fix sampler const buffer key shift	2019-04-26 01:09:29 -03:00
bunnei	78574e7a47	Merge pull request #2416 from lioncash/wait kernel/svc: Clean up wait synchronization related functionality	2019-04-24 22:56:08 -04:00
bunnei	94db649205	Merge pull request #2424 from FernandoS27/compat Allow picking a Compatibility Profile for OpenGL.	2019-04-24 22:54:27 -04:00
bunnei	53f746fa9a	Merge pull request #2228 from DarkLordZach/applet-manager-p1 applets: Add AppletManager and implement PhotoViewer and Error applets	2019-04-24 22:53:21 -04:00
bunnei	0592869076	Merge pull request #2404 from lioncash/unicode CMakeLists: Ensure we specify Unicode as the codepage on Windows	2019-04-24 22:51:17 -04:00
FreddyFunk	1a3ff252a4	Re added new lines at the end of files	2019-04-23 23:19:28 +02:00
unknown	3091b40691	gl_shader_disk_cache: Compress precompiled shader cache file with Zstandard	2019-04-23 22:24:31 +02:00
unknown	9db2c734c9	gl_shader_disk_cache: Use VectorVfsFile for the virtual precompiled shader cache file	2019-04-23 22:24:23 +02:00
unknown	3fe542cf60	gl_shader_disk_cache: Remove per shader compression	2019-04-23 21:40:01 +02:00
Fernando Sahmkow	b3118ee316	Fixes and Corrections to DMA Engine	2019-04-23 15:28:18 -04:00
Hexagon12	8df9449bb8	Merge pull request #2422 from ReinUsesLisp/fixup-samplers gl_state: Fix samplers memory corruption	2019-04-23 18:30:35 +03:00
Hexagon12	b2fbcaae30	Merge pull request #2425 from FernandoS27/y-direction Fix flipping on some games by applying Y direction register	2019-04-23 18:29:29 +03:00
Fernando Sahmkow	f1e5314f1a	Add Swizzle Parameters to the DMA engine	2019-04-23 11:21:00 -04:00
Fernando Sahmkow	e140e2ebc6	Add Documentation Headers to all the GPU Engines	2019-04-23 08:44:52 -04:00
Fernando Sahmkow	021d28c9b8	Corrections and styling	2019-04-23 08:02:24 -04:00
bunnei	4fad91ca45	Merge pull request #2383 from ReinUsesLisp/aoffi-test gl_shader_decompiler: Disable variable AOFFI on unsupported devices	2019-04-22 22:14:02 -04:00
bunnei	9cab042674	Merge pull request #2420 from lioncash/audctl service/audctl: Implement GetTargetVolumeMin() and GetTargetVolumeMax()	2019-04-22 22:12:48 -04:00
Fernando Sahmkow	701ce1c9d0	Implement Maxwell3D Data Upload	2019-04-22 19:27:36 -04:00
Fernando Sahmkow	e4ff140b99	Introduce skeleton of the GPU Compute Engine.	2019-04-22 19:05:43 -04:00
Fernando Sahmkow	a91d3fc639	Revamp Kepler Memory to use a subegine to manage uploads	2019-04-22 18:50:56 -04:00
bunnei	b5889cbd6f	Merge pull request #2403 from FernandoS27/compressed-linear Support compressed formats on linear textures.	2019-04-22 17:09:42 -04:00
bunnei	68b707711a	Merge pull request #2411 from FernandoS27/unsafe-gpu GPU Manager: Implement ReadBlockUnsafe and WriteBlockUnsafe	2019-04-22 17:09:00 -04:00
bunnei	01100f8afd	Merge pull request #2400 from FernandoS27/corret-kepler-mem Implement Kepler Memory on both Linear and BlockLinear.	2019-04-22 16:47:05 -04:00
Fernando Sahmkow	4c36b78567	Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache.	2019-04-21 11:42:07 -04:00
Fernando Sahmkow	623b2e4b8f	Corrections Half Float operations on const buffers and implement saturation.	2019-04-20 21:11:33 -04:00
bunnei	da0c3bc658	Merge pull request #2407 from FernandoS27/f2f Do some corrections in conversion shader instructions.	2019-04-20 00:42:34 -04:00
Fernando Sahmkow	788497fd9d	Allow picking a Compatibility Profile for OpenGL. This option allows picking the compatibility profile since a lot of bugs are fixed in it. We devs will use this option to easierly debug current problems in our Core implementation.:wq	2019-04-20 00:05:24 -04:00
bunnei	650d9b1044	Merge pull request #2409 from ReinUsesLisp/half-floats shader_ir/decode: Miscellaneous fixes to half-float decompilation	2019-04-19 21:31:52 -04:00
Fernando Sahmkow	08cdcc2871	Apply Position Y Direction	2019-04-19 20:49:00 -04:00
Fernando Sahmkow	a3eb91ed8c	RasterizerCache Redesign: Flush flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility.	2019-04-19 20:44:56 -04:00
Fernando Sahmkow	db4b2bc798	make ReadBlockunsafe and WriteBlockunsafe, ignore invalid pages.	2019-04-19 20:35:54 -04:00
bunnei	62c2404d3c	Merge pull request #2415 from lioncash/const kernel/wait_object: Make GetHighestPriorityReadyThread() a const member function	2019-04-19 19:12:02 -04:00
bunnei	cd38eadcc1	Merge pull request #2414 from lioncash/reorder yuzu/bootmanager: Resolve constructor initializer list warnings	2019-04-19 19:11:47 -04:00
bunnei	b6faea0dd2	Merge pull request #2421 from lioncash/svc-call kernel/svc: Name supervisor call 0x36	2019-04-19 19:10:20 -04:00
bunnei	40dc893c37	Merge pull request #2374 from lioncash/pagetable core: Reorganize boot order	2019-04-19 19:09:20 -04:00
ReinUsesLisp	d74cb16535	gl_state: Fix samplers memory corruption It was possible for "samplers" to be read without being written. This addresses that.	2019-04-19 17:07:56 -03:00
Lioncash	f8be3f55da	kernel/svc: Name supervisor call 0x36 This call was added to the SVC handlers in the 8.0.0 kernel, so we can finally give it a name.	2019-04-19 14:34:56 -04:00
Lioncash	19f8f86bdb	service/audctl: Implement GetTargetVolumeMin() and GetTargetVolumeMax() These two service functions are literally hardcoded to always return these values without any other error checking.	2019-04-18 16:39:54 -04:00
ReinUsesLisp	fbe8d1ceaa	video_core: Silent -Wswitch warnings	2019-04-18 15:54:39 -03:00
bunnei	83b830eb2f	Merge pull request #2397 from lioncash/thread-unused kernel/thread: Remove unused guest_handle member variable	2019-04-17 21:46:46 -04:00
bunnei	4294062516	Merge pull request #2318 from ReinUsesLisp/sampler-cache gl_sampler_cache: Port sampler cache to OpenGL	2019-04-17 21:45:56 -04:00
bunnei	5bd5140bde	Merge pull request #2348 from FernandoS27/guest-bindless Implement Bindless Textures on Shader Decompiler and GL backend	2019-04-17 20:59:49 -04:00
Zach Hilman	2adb226b26	web_browser: Make OpenPage non-const	2019-04-17 11:35:24 -04:00
Zach Hilman	8f8049e846	main: Add GMainWindow hooks for Error display	2019-04-17 11:35:24 -04:00
Zach Hilman	a04d36c5a4	main: Switch to AppletManager for frontend	2019-04-17 11:35:24 -04:00
Zach Hilman	76452cd5b3	qt: Add dialog implementation of Error applet	2019-04-17 11:35:24 -04:00
Zach Hilman	f6e2295055	general_backend: Move StubApplet and add backend PhotoViewer	2019-04-17 11:35:24 -04:00
Zach Hilman	80c9e4d3ab	general_frontend: Add frontend scaffold for PhotoViewer applet	2019-04-17 11:35:24 -04:00
Zach Hilman	d9f6715d45	frontend: Add frontend receiver for Error applet	2019-04-17 11:35:24 -04:00
Zach Hilman	de3cfb1d37	applets: Add Error applet Responsible for displaying error codes and messages	2019-04-17 11:35:24 -04:00
Zach Hilman	d273bec68f	applets: Port current applets to take frontend in constructor As opposed to using Core::System::GetInstance()	2019-04-17 11:35:24 -04:00
Zach Hilman	f7540157e4	web_browser: Make OpenPage const	2019-04-17 11:35:24 -04:00
Zach Hilman	ec0bc3061e	core: Remove specific applets in favor of AppletManager	2019-04-17 11:35:24 -04:00
Zach Hilman	6cea62b756	am: Delegate applet creation to AppletManager	2019-04-17 11:35:24 -04:00
Zach Hilman	e51d33f0ce	applets: Add AppletManager class to control lifetime	2019-04-17 11:35:24 -04:00
Lioncash	c268ffd831	kernel/thread: Unify wait synchronization types This is a holdover from Citra, where the 3DS has both WaitSynchronization1 and WaitSynchronizationN. The switch only has one form of wait synchronizing (literally WaitSynchonization). This allows us to throw out code that doesn't apply at all to the Switch kernel. Because of this unnecessary dichotomy within the wait synchronization utilities, we were also neglecting to properly handle waiting on multiple objects. While we're at it, we can also scrub out any lingering references to WaitSynchronization1/WaitSynchronizationN in comments, and change them to WaitSynchronization (or remove them if the mention no longer applies).	2019-04-17 09:30:56 -04:00
Lioncash	433b59c112	kernel/svc: Migrate svcCancelSynchronization behavior to a thread function The actual behavior of this function is slightly more complex than what we're currently doing within the supervisor call. To avoid dumping most of this behavior in the supervisor call itself, we can migrate this to another function.	2019-04-17 09:30:56 -04:00
Lioncash	6b2bece81f	kernel/wait_object: Make GetHighestPriorityReadyThread() a const member function This doesn't actually modify internal state of a wait object, so it can be const qualified.	2019-04-17 06:44:34 -04:00
Lioncash	54e9f9b6ed	yuzu/bootmanager: Replace unnnecessary constructor initializer list member of GGLContext The default constructor will always run, even when not specified, so this is redundant. However, the context member can indeed be initialized in the constructor initializer list.	2019-04-17 00:04:10 -04:00
Lioncash	52e43734c4	yuzu/bootmanager: Remove unnecessary includes This include isn't used anymore so it can be removed.	2019-04-16 23:52:57 -04:00
Lioncash	fbfc347351	yuzu/bootmanager: Resolve constructor initializer list warnings Resolves -Wreorder warnings. These will automatically be initialized to nullptr anyways, so these were redundant.	2019-04-16 23:49:26 -04:00
bunnei	0cfbd3325b	Merge pull request #2315 from ReinUsesLisp/severity-decompiler shader_ir/decode: Reduce the severity of common assertions	2019-04-16 22:21:19 -04:00
bunnei	21d498bc06	Merge pull request #2384 from ReinUsesLisp/gl-state-clear gl_rasterizer: Apply just the needed state on Clear	2019-04-16 22:19:03 -04:00
bunnei	be6b9e2d2b	Merge pull request #2405 from lioncash/qt CMakeLists: Define QT_USE_QSTRINGBUILDER for the Qt target	2019-04-16 22:17:09 -04:00
bunnei	1b83f255c2	Merge pull request #2092 from ReinUsesLisp/stg shader/memory: Implement STG and global memory flushing	2019-04-16 22:15:17 -04:00
bunnei	2654eb659e	Merge pull request #2376 from lioncash/const yuzu/configure_hotkey: Minor changes	2019-04-16 22:13:12 -04:00
bunnei	382fbbb198	Merge pull request #2401 from lioncash/guard common/{lz4_compression, zstd_compression}: Add missing header guards	2019-04-16 22:11:04 -04:00
Lioncash	819c21d99e	CMakeLists: Ensure we specify Unicode as the codepage on Windows Previously we were building with MBCS, which is pretty undesirable. We want the application to be Unicode-aware in general. Currently, we make the command line variant of yuzu use ANSI variants of the non-standard getopt functions that we link in for Windows, given we only have an ANSI option-set. We should really replace getopt with a library that we make all build types of yuzu link in, but this will have to do for the time being.	2019-04-16 21:23:34 -04:00
Lioncash	b6a87b422e	kernel/vm_manager: Remove usages of global system accessors Makes the dependency on the system instance explicit within VMManager's interface.	2019-04-16 20:02:50 -04:00
Fernando Sahmkow	d0082de82a	Implement IsBlockContinous This detects when a GPU Memory Block is not continous within host cpu memory.	2019-04-16 18:49:35 -04:00
Fernando Sahmkow	da91e6e4b6	Apply Const correctness to SwizzleKepler and replace u32 for size_t on iterators.	2019-04-16 12:00:46 -04:00
Fernando Sahmkow	13d626fc21	Use ReadBlockUnsafe for fetyching DMA CommandLists	2019-04-16 11:22:34 -04:00
Fernando Sahmkow	06d1c5a991	Document unsafe versions and add BlockCopyUnsafe	2019-04-16 10:11:35 -04:00
Fernando Sahmkow	6fc562a9aa	Use ReadBlockUnsafe for Shader Cache	2019-04-15 23:34:03 -04:00
Fernando Sahmkow	ef381e6924	Use ReadBlockUnsafe on TIC and TSC reading Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there.	2019-04-15 23:10:24 -04:00
Fernando Sahmkow	367704aa82	GPU MemoryManager: Implement ReadBlockUnsafe and WriteBlockUnsafe	2019-04-15 23:01:35 -04:00
Fernando Sahmkow	3e96c367bd	Use WriteBlock and ReadBlock.	2019-04-15 22:42:34 -04:00
bunnei	9186f76b07	Merge pull request #2382 from lioncash/table service: Update service function tables	2019-04-15 21:46:15 -04:00
bunnei	fc64156533	Merge pull request #2393 from lioncash/svc kernel/svc: Implement svcMapProcessCodeMemory/svcUnmapProcessCodeMemory	2019-04-15 21:43:56 -04:00
bunnei	a7c3275b8b	Merge pull request #2398 from lioncash/boost kernel/thread: Remove BoostPriority()	2019-04-15 21:42:16 -04:00
Fernando Sahmkow	bec28d692d	Implement Block Linear copies in Kepler Memory.	2019-04-15 21:22:16 -04:00
ReinUsesLisp	ef8245bed2	vk_shader_decompiler: Add missing operations	2019-04-15 21:32:57 -03:00
ReinUsesLisp	f43995ec53	shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	abcbcb1b2a	gl_shader_decompiler: Fix MrgH0 decompilation GLSL decompilation for HMergeH0 was wrong. This addresses that issue.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	64613db605	shader_ir/decode: Implement half float saturation	2019-04-15 21:16:10 -03:00
ReinUsesLisp	90cbf89303	shader_ir/decode: Reduce severity of unimplemented half-float FTZ	2019-04-15 21:16:09 -03:00
ReinUsesLisp	acf618afbc	renderer_opengl: Implement half float NaN comparisons	2019-04-15 21:13:26 -03:00
ReinUsesLisp	ae46ad48ed	shader_ir: Avoid using static on heap-allocated objects Using static here might be faster at runtime, but it adds a heap allocation called before main.	2019-04-15 21:12:43 -03:00
Fernando Sahmkow	aa471274d9	Do some corrections in conversion shader instructions. Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs.	2019-04-15 19:16:27 -04:00
Lioncash	d28bb56c91	CMakeLists: Define QT_USE_QSTRINGBUILDER for the Qt target This is a compile definition introduced in Qt 4.8 for reducing the total potential number of strings created when performing string concatenation. This allows for less memory churn. This can be read about here: https://blog.qt.io/blog/2011/06/13/string-concatenation-with-qstringbuilder/ For a change that isn't source-compatible, we only had one occurrence that actually need to have its type clarified, which is pretty good, as far as transitioning goes.	2019-04-15 17:59:41 -04:00
Lioncash	3283aa1e20	svc: Specify handle value in thread's name Allows the handle to be seen alongside the entry point.	2019-04-15 15:56:18 -04:00
Fernando Sahmkow	8a099ac99f	Correct Kepler Memory on Linear Pushes.	2019-04-15 14:51:36 -04:00
Fernando Sahmkow	773d955dfa	Support compressed formats on linear textures.	2019-04-15 13:56:09 -04:00
Lioncash	4620ed47a3	common/{lz4_compression, zstd_compression}: Add missing header guards These two files were missing the #pragma once directive.	2019-04-15 13:00:08 -04:00
Fernando Sahmkow	bf561e4340	Correct Pitch in Fermi2D	2019-04-15 12:24:29 -04:00
Lioncash	e3566e6c1d	kernel/thread: Remove BoostPriority() This is a holdover from Citra that currently remains unused, so it can be removed from the Thread interface.	2019-04-15 06:59:19 -04:00
Lioncash	09caf8a756	kernel/thread: Remove unused guest_handle member variable This member variable is entirely unused. It was only set but never actually utilized. Given that, we can remove it to get rid of noise in the thread interface.	2019-04-14 06:06:06 -04:00
ReinUsesLisp	f15c59a164	gl_shader_decompiler: Use variable AOFFI on supported hardware	2019-04-14 05:13:19 -03:00
ReinUsesLisp	5c280e6ff0	shader_ir: Implement STG, keep track of global memory usage and flush	2019-04-14 00:25:32 -03:00
bunnei	1f4dfb3998	Merge pull request #2378 from lioncash/ro ldr: Minor amendments to IPC-related parameters	2019-04-13 22:16:10 -04:00
bunnei	c9454c8422	Merge pull request #2373 from FernandoS27/z32 Set Pixel Format to Z32 if its R32F and depth compare enabled, and Implement format ZF32_X24S8	2019-04-13 22:14:51 -04:00
bunnei	6088898b02	Merge pull request #2357 from zarroboogs/force-30fps-mode Add a toggle to force 30FPS mode	2019-04-13 22:14:04 -04:00
bunnei	a788c861bd	Merge pull request #2381 from lioncash/fs fsp_srv: Minor cleanup related changes	2019-04-13 22:09:58 -04:00
bunnei	ee2206a1b7	Merge pull request #2386 from ReinUsesLisp/shader-manager gl_shader_manager: Move code to source file and minor clean up	2019-04-13 22:09:27 -04:00
bunnei	065f83c6c3	Merge pull request #2017 from jroweboy/glwidget Frontend: Migrate to QOpenGLWindow and support shared contexts	2019-04-13 22:08:40 -04:00
bunnei	ee3f576495	Merge pull request #2389 from FreddyFunk/rename-gamedir ui_settings: Rename game directory variables	2019-04-13 22:06:51 -04:00
Lioncash	4d293bb5cb	kernel/svc: Implement svcUnmapProcessCodeMemory Essentially performs the inverse of svcMapProcessCodeMemory. This unmaps the aliasing region first, then restores the general traits of the aliased memory. What this entails, is: - Restoring Read/Write permissions to the VMA. - Restoring its memory state to reflect it as a general heap memory region. - Clearing the memory attributes on the region.	2019-04-12 21:56:03 -04:00
Lioncash	76a2465655	kernel/svc: Implement svcMapProcessCodeMemory This is utilized for mapping code modules into memory. Notably, the ldr service would call this in order to map objects into memory.	2019-04-12 21:55:50 -04:00
bunnei	b42595fa6b	Merge pull request #2391 from lioncash/scope common/scope_exit: Replace std::move with std::forward in ScopeExit()	2019-04-12 21:52:35 -04:00
bunnei	0faf7b17a1	Merge pull request #2392 from lioncash/swap common/swap: Minor cleanup and improvements to byte swapping functions	2019-04-12 21:52:16 -04:00
FreddyFunk	382722b9c4	Fix Clang Format	2019-04-12 16:40:35 +02:00
Lioncash	0d8ef2d3b9	common/swap: Improve codegen of the default swap fallbacks Uses arithmetic that can be identified more trivially by compilers for optimizations. e.g. Rather than shifting the halves of the value and then swapping and combining them, we can swap them in place. e.g. for the original swap32 code on x86-64, clang 8.0 would generate: mov ecx, edi rol cx, 8 shl ecx, 16 shr edi, 16 rol di, 8 movzx eax, di or eax, ecx ret while GCC 8.3 would generate the ideal: mov eax, edi bswap eax ret now both generate the same optimal output. MSVC used to generate the following with the old code: mov eax, ecx rol cx, 8 shr eax, 16 rol ax, 8 movzx ecx, cx movzx eax, ax shl ecx, 16 or eax, ecx ret 0 Now MSVC also generates a similar, but equally optimal result as clang/GCC: bswap ecx mov eax, ecx ret 0 ==== In the swap64 case, for the original code, clang 8.0 would generate: mov eax, edi bswap eax shl rax, 32 shr rdi, 32 bswap edi or rax, rdi ret (almost there, but still missing the mark) while, again, GCC 8.3 would generate the more ideal: mov rax, rdi bswap rax ret now clang also generates the optimal sequence for this fallback as well. This is a case where MSVC unfortunately falls short, despite the new code, this one still generates a doozy of an output. mov r8, rcx mov r9, rcx mov rax, 71776119061217280 mov rdx, r8 and r9, rax and edx, 65280 mov rax, rcx shr rax, 16 or r9, rax mov rax, rcx shr r9, 16 mov rcx, 280375465082880 and rax, rcx mov rcx, 1095216660480 or r9, rax mov rax, r8 and rax, rcx shr r9, 16 or r9, rax mov rcx, r8 mov rax, r8 shr r9, 8 shl rax, 16 and ecx, 16711680 or rdx, rax mov eax, -16777216 and rax, r8 shl rdx, 16 or rdx, rcx shl rdx, 16 or rax, rdx shl rax, 8 or rax, r9 ret 0 which is pretty unfortunate.	2019-04-12 00:07:39 -04:00
Lioncash	612e1388df	core/core: Move process execution start to System's Load() This gives us significantly more control over where in the initialization process we start execution of the main process. Previously we were running the main process before the CPU or GPU threads were initialized (not good). This amends execution to start after all of our threads are properly set up.	2019-04-11 22:11:41 -04:00
Lioncash	32a6ceb4e5	core/process: Remove unideal page table setting from LoadFromMetadata() Initially required due to the split codepath with how the initial main process instance was initialized. We used to initialize the process like: Init() { main_process = Process::Create(...); kernel.MakeCurrentProcess(main_process.get()); } Load() { const auto load_result = loader.Load(*kernel.GetCurrentProcess()); if (load_result != Loader::ResultStatus::Success) { // Handle error here. } ... } which presented a problem. Setting a created process as the main process would set the page table for that process as the main page table. This is fine... until we get to the part that the page table can have its size changed in the Load() function via NPDM metadata, which can dictate either a 32-bit, 36-bit, or 39-bit usable address space. Now that we have full control over the process' creation in load, we can simply set the initial process as the main process after all the loading is done, reflecting the potential page table changes without any special-casing behavior. We can also remove the cache flushing within LoadModule(), as execution wouldn't have even begun yet during all usages of this function, now that we have the initialization order cleaned up.	2019-04-11 22:11:41 -04:00
Lioncash	a4b0a8559c	core/core: Move main process creation into Load() Now that we have dependencies on the initialization order, we can move the creation of the main process to a more sensible area: where we actually load in the executable data. This allows localizing the creation and loading of the process in one location, making the initialization of the process much nicer to trace.	2019-04-11 22:11:40 -04:00
Lioncash	6d0551196d	video_core/gpu: Create threads separately from initialization Like with CPU emulation, we generally don't want to fire off the threads immediately after the relevant classes are initialized, we want to do this after all necessary data is done loading first. This splits the thread creation into its own interface member function to allow controlling when these threads in particular get created.	2019-04-11 22:11:40 -04:00
Lioncash	f2331a804a	core/cpu_core_manager: Create threads separately from initialization. Our initialization process is a little wonky than one would expect when it comes to code flow. We initialize the CPU last, as opposed to hardware, where the CPU obviously needs to be first, otherwise nothing else would work, and we have code that adds checks to get around this. For example, in the page table setting code, we check to see if the system is turned on before we even notify the CPU instances of a page table switch. This results in dead code (at the moment), because the only time a page table switch will occur is when the system is not running, preventing the emulated CPU instances from being notified of a page table switch in a convenient manner (technically the code path could be taken, but we don't emulate the process creation svc handlers yet). This moves the threads creation into its own member function of the core manager and restores a little order (and predictability) to our initialization process. Previously, in the multi-threaded cases, we'd kick off several threads before even the main kernel process was created and ready to execute (gross!). Now the initialization process is like so: Initialization: 1. Timers 2. CPU 3. Kernel 4. Filesystem stuff (kind of gross, but can be amended trivially) 5. Applet stuff (ditto in terms of being kind of gross) 6. Main process (will be moved into the loading step in a following change) 7. Telemetry (this should be initialized last in the future). 8. Services (4 and 5 should ideally be alongside this). 9. GDB (gross. Uses namespace scope state. Needs to be refactored into a class or booted altogether). 10. Renderer 11. GPU (will also have its threads created in a separate step in a following change). Which... isn't ideal per-se, however getting rid of the wonky intertwining of CPU state initialization out of this mix gets rid of most of the footguns when it comes to our initialization process.	2019-04-11 22:11:40 -04:00
bunnei	ea80e2bc57	Merge pull request #2235 from ReinUsesLisp/spirv-decompiler vk_shader_decompiler: Implement a SPIR-V decompiler	2019-04-11 21:54:23 -04:00
bunnei	83a2fb3c3a	Merge pull request #2360 from lioncash/svc-global kernel/svc: Deglobalize the supervisor call handlers	2019-04-11 21:50:05 -04:00
bunnei	e2f2155dab	Merge pull request #2388 from lioncash/constexpr kernel: Make handle type declarations constexpr	2019-04-11 21:49:45 -04:00
Lioncash	66b73fd399	common/swap: Mark byte swapping free functions with [[nodiscard]] and noexcept Allows the compiler to inform when the result of a swap function is being ignored (which is 100% a bug in all usage scenarios). We also mark them noexcept to allow other functions using them to be able to be marked as noexcept and play nicely with things that potentially inspect "nothrowability".	2019-04-11 20:42:44 -04:00
Lioncash	9cb4b7be40	common/swap: Simplify swap function ifdefs Including every OS' own built-in byte swapping functions is kind of undesirable, since it adds yet another build path to ensure compilation succeeds on. Given we only support clang, GCC, and MSVC for the time being, we can utilize their built-in functions directly instead of going through the OS's API functions. This shrinks the overall code down to just if (msvc) use msvc's functions else if (clang or gcc) use clang/gcc's builtins else use the slow path	2019-04-11 20:36:19 -04:00
Lioncash	598954436f	common/swap: Remove 32-bit ARM path We don't plan to support host 32-bit ARM execution environments, so this is essentially dead code.	2019-04-11 20:15:47 -04:00
Lioncash	b569641098	common/scope_exit: Replace std::move with std::forward in ScopeExit() The template type here is actually a forwarding reference, not an rvalue reference in this case, so it's more appropriate to use std::forward to preserve the value category of the type being moved.	2019-04-11 20:01:33 -04:00
Lioncash	6300ccbc3c	kernel: Make handle type declarations constexpr Some objects declare their handle type as const, while others declare it as constexpr. This makes the const ones constexpr for consistency, and prevent unexpected compilation errors if these happen to be attempted to be used within a constexpr context.	2019-04-11 16:34:53 -04:00
FreddyFunk	dffa1a872a	ui_settings: Rename game directory variables	2019-04-11 19:55:56 +02:00
Fernando Sahmkow	c9305959d3	gl_rasterizer_cache: Relax restrictions on FastCopySurface and FastLayeredCopySurface	2019-04-11 13:14:28 -04:00
Lioncash	ca96dc4676	service: Update service function tables Updates function tables based off information from SwitchBrew.	2019-04-11 02:47:00 -04:00
bunnei	6951741a94	Merge pull request #2278 from ReinUsesLisp/vc-texture-cache video_core: Implement API agnostic view based texture cache	2019-04-10 21:17:35 -04:00
bunnei	0371650bd7	Merge pull request #2372 from FernandoS27/fermi-fix Correct Fermi Copy on Linear Textures.	2019-04-10 21:17:03 -04:00
ReinUsesLisp	93af663683	gl_shader_manager: Move code to source file and minor clean up	2019-04-10 19:29:15 -03:00
ReinUsesLisp	6df25e9c7b	gl_rasterizer: Apply just the needed state on Clear	2019-04-10 18:13:15 -03:00

1 2 3 4 5 ...

10076 commits