H3cJP/yuzu - yuzu - Gitea: Git with a cup of tea

H3cJP/yuzu

Author	SHA1	Message	Date
Lioncash	0d8ef2d3b9	common/swap: Improve codegen of the default swap fallbacks Uses arithmetic that can be identified more trivially by compilers for optimizations. e.g. Rather than shifting the halves of the value and then swapping and combining them, we can swap them in place. e.g. for the original swap32 code on x86-64, clang 8.0 would generate: mov ecx, edi rol cx, 8 shl ecx, 16 shr edi, 16 rol di, 8 movzx eax, di or eax, ecx ret while GCC 8.3 would generate the ideal: mov eax, edi bswap eax ret now both generate the same optimal output. MSVC used to generate the following with the old code: mov eax, ecx rol cx, 8 shr eax, 16 rol ax, 8 movzx ecx, cx movzx eax, ax shl ecx, 16 or eax, ecx ret 0 Now MSVC also generates a similar, but equally optimal result as clang/GCC: bswap ecx mov eax, ecx ret 0 ==== In the swap64 case, for the original code, clang 8.0 would generate: mov eax, edi bswap eax shl rax, 32 shr rdi, 32 bswap edi or rax, rdi ret (almost there, but still missing the mark) while, again, GCC 8.3 would generate the more ideal: mov rax, rdi bswap rax ret now clang also generates the optimal sequence for this fallback as well. This is a case where MSVC unfortunately falls short, despite the new code, this one still generates a doozy of an output. mov r8, rcx mov r9, rcx mov rax, 71776119061217280 mov rdx, r8 and r9, rax and edx, 65280 mov rax, rcx shr rax, 16 or r9, rax mov rax, rcx shr r9, 16 mov rcx, 280375465082880 and rax, rcx mov rcx, 1095216660480 or r9, rax mov rax, r8 and rax, rcx shr r9, 16 or r9, rax mov rcx, r8 mov rax, r8 shr r9, 8 shl rax, 16 and ecx, 16711680 or rdx, rax mov eax, -16777216 and rax, r8 shl rdx, 16 or rdx, rcx shl rdx, 16 or rax, rdx shl rax, 8 or rax, r9 ret 0 which is pretty unfortunate.	2019-04-12 00:07:39 -04:00
Lioncash	612e1388df	core/core: Move process execution start to System's Load() This gives us significantly more control over where in the initialization process we start execution of the main process. Previously we were running the main process before the CPU or GPU threads were initialized (not good). This amends execution to start after all of our threads are properly set up.	2019-04-11 22:11:41 -04:00
Lioncash	32a6ceb4e5	core/process: Remove unideal page table setting from LoadFromMetadata() Initially required due to the split codepath with how the initial main process instance was initialized. We used to initialize the process like: Init() { main_process = Process::Create(...); kernel.MakeCurrentProcess(main_process.get()); } Load() { const auto load_result = loader.Load(*kernel.GetCurrentProcess()); if (load_result != Loader::ResultStatus::Success) { // Handle error here. } ... } which presented a problem. Setting a created process as the main process would set the page table for that process as the main page table. This is fine... until we get to the part that the page table can have its size changed in the Load() function via NPDM metadata, which can dictate either a 32-bit, 36-bit, or 39-bit usable address space. Now that we have full control over the process' creation in load, we can simply set the initial process as the main process after all the loading is done, reflecting the potential page table changes without any special-casing behavior. We can also remove the cache flushing within LoadModule(), as execution wouldn't have even begun yet during all usages of this function, now that we have the initialization order cleaned up.	2019-04-11 22:11:41 -04:00
Lioncash	a4b0a8559c	core/core: Move main process creation into Load() Now that we have dependencies on the initialization order, we can move the creation of the main process to a more sensible area: where we actually load in the executable data. This allows localizing the creation and loading of the process in one location, making the initialization of the process much nicer to trace.	2019-04-11 22:11:40 -04:00
Lioncash	6d0551196d	video_core/gpu: Create threads separately from initialization Like with CPU emulation, we generally don't want to fire off the threads immediately after the relevant classes are initialized, we want to do this after all necessary data is done loading first. This splits the thread creation into its own interface member function to allow controlling when these threads in particular get created.	2019-04-11 22:11:40 -04:00
Lioncash	f2331a804a	core/cpu_core_manager: Create threads separately from initialization. Our initialization process is a little wonky than one would expect when it comes to code flow. We initialize the CPU last, as opposed to hardware, where the CPU obviously needs to be first, otherwise nothing else would work, and we have code that adds checks to get around this. For example, in the page table setting code, we check to see if the system is turned on before we even notify the CPU instances of a page table switch. This results in dead code (at the moment), because the only time a page table switch will occur is when the system is not running, preventing the emulated CPU instances from being notified of a page table switch in a convenient manner (technically the code path could be taken, but we don't emulate the process creation svc handlers yet). This moves the threads creation into its own member function of the core manager and restores a little order (and predictability) to our initialization process. Previously, in the multi-threaded cases, we'd kick off several threads before even the main kernel process was created and ready to execute (gross!). Now the initialization process is like so: Initialization: 1. Timers 2. CPU 3. Kernel 4. Filesystem stuff (kind of gross, but can be amended trivially) 5. Applet stuff (ditto in terms of being kind of gross) 6. Main process (will be moved into the loading step in a following change) 7. Telemetry (this should be initialized last in the future). 8. Services (4 and 5 should ideally be alongside this). 9. GDB (gross. Uses namespace scope state. Needs to be refactored into a class or booted altogether). 10. Renderer 11. GPU (will also have its threads created in a separate step in a following change). Which... isn't ideal per-se, however getting rid of the wonky intertwining of CPU state initialization out of this mix gets rid of most of the footguns when it comes to our initialization process.	2019-04-11 22:11:40 -04:00
bunnei	ea80e2bc57	Merge pull request #2235 from ReinUsesLisp/spirv-decompiler vk_shader_decompiler: Implement a SPIR-V decompiler	2019-04-11 21:54:23 -04:00
bunnei	83a2fb3c3a	Merge pull request #2360 from lioncash/svc-global kernel/svc: Deglobalize the supervisor call handlers	2019-04-11 21:50:05 -04:00
bunnei	e2f2155dab	Merge pull request #2388 from lioncash/constexpr kernel: Make handle type declarations constexpr	2019-04-11 21:49:45 -04:00
Lioncash	66b73fd399	common/swap: Mark byte swapping free functions with [[nodiscard]] and noexcept Allows the compiler to inform when the result of a swap function is being ignored (which is 100% a bug in all usage scenarios). We also mark them noexcept to allow other functions using them to be able to be marked as noexcept and play nicely with things that potentially inspect "nothrowability".	2019-04-11 20:42:44 -04:00
Lioncash	9cb4b7be40	common/swap: Simplify swap function ifdefs Including every OS' own built-in byte swapping functions is kind of undesirable, since it adds yet another build path to ensure compilation succeeds on. Given we only support clang, GCC, and MSVC for the time being, we can utilize their built-in functions directly instead of going through the OS's API functions. This shrinks the overall code down to just if (msvc) use msvc's functions else if (clang or gcc) use clang/gcc's builtins else use the slow path	2019-04-11 20:36:19 -04:00
Lioncash	598954436f	common/swap: Remove 32-bit ARM path We don't plan to support host 32-bit ARM execution environments, so this is essentially dead code.	2019-04-11 20:15:47 -04:00
Lioncash	b569641098	common/scope_exit: Replace std::move with std::forward in ScopeExit() The template type here is actually a forwarding reference, not an rvalue reference in this case, so it's more appropriate to use std::forward to preserve the value category of the type being moved.	2019-04-11 20:01:33 -04:00
Lioncash	6300ccbc3c	kernel: Make handle type declarations constexpr Some objects declare their handle type as const, while others declare it as constexpr. This makes the const ones constexpr for consistency, and prevent unexpected compilation errors if these happen to be attempted to be used within a constexpr context.	2019-04-11 16:34:53 -04:00
FreddyFunk	dffa1a872a	ui_settings: Rename game directory variables	2019-04-11 19:55:56 +02:00
Fernando Sahmkow	c9305959d3	gl_rasterizer_cache: Relax restrictions on FastCopySurface and FastLayeredCopySurface	2019-04-11 13:14:28 -04:00
Lioncash	ca96dc4676	service: Update service function tables Updates function tables based off information from SwitchBrew.	2019-04-11 02:47:00 -04:00
bunnei	6951741a94	Merge pull request #2278 from ReinUsesLisp/vc-texture-cache video_core: Implement API agnostic view based texture cache	2019-04-10 21:17:35 -04:00
bunnei	0371650bd7	Merge pull request #2372 from FernandoS27/fermi-fix Correct Fermi Copy on Linear Textures.	2019-04-10 21:17:03 -04:00
ReinUsesLisp	93af663683	gl_shader_manager: Move code to source file and minor clean up	2019-04-10 19:29:15 -03:00
ReinUsesLisp	6df25e9c7b	gl_rasterizer: Apply just the needed state on Clear	2019-04-10 18:13:15 -03:00
Lioncash	dae2449880	ldr: Mark IsValidNROHash() as a const member function This doesn't modify instance state, so it can be made const.	2019-04-10 15:57:02 -04:00
Lioncash	0032cf3818	ldr: Amend parameters for LoadNro/UnloadNro LoadNrr/UnloadNrr The initial two words indicate a process ID. Also UnloadNro only specifies one address, not two.	2019-04-10 15:56:43 -04:00
ReinUsesLisp	75d23a3679	vk_shader_decompiler: Implement flow primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	58ad8dfac6	vk_shader_decompiler: Implement most common texture primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	4667ed8e22	vk_shader_decompiler: Implement texture decompilation helper functions	2019-04-10 14:20:25 -03:00
ReinUsesLisp	676172e20d	vk_shader_decompiler: Implement Assign and LogicalAssign	2019-04-10 14:20:25 -03:00
ReinUsesLisp	d316d248ab	vk_shader_decompiler: Implement non-OperationCode visits	2019-04-10 14:20:25 -03:00
ReinUsesLisp	b758c861b0	vk_shader_decompiler: Implement OperationCode decompilation interface	2019-04-10 14:20:25 -03:00
ReinUsesLisp	fec4eb9776	vk_shader_decompiler: Implement Visit	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ca51f99840	vk_shader_decompiler: Implement labels tree and flow	2019-04-10 14:20:25 -03:00
ReinUsesLisp	13aa664f3f	vk_shader_decompiler: Implement declarations	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ad53b233c5	vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler	2019-04-10 14:20:25 -03:00
ReinUsesLisp	970d9e57c8	video_core: Add sirit as optional dependency with Vulkan sirit is a runtime assembler for SPIR-V	2019-04-10 14:20:25 -03:00
Lioncash	8676832064	fsp_srv: Remove unnecessary parameter popping in IDirectory's Read() IDirectory's Read() function doesn't take any input parameters. It only uses the output parameters that we already provide.	2019-04-10 13:04:08 -04:00
Lioncash	fc436bb09b	fsp_srv: Log out option values in IFile's Read and Write functions These indicate options that alter how a read/write is performed. Currently we don't need to handle these, as the only one that seems to be used is for writes, but all the custom options ever seem to do is immediate flushing, which we already do by default.	2019-04-10 13:01:52 -04:00
bunnei	97648f4841	Merge pull request #2345 from ReinUsesLisp/multibind gl_rasterizer: Use ARB_multi_bind to update buffers with a single call per drawcall	2019-04-10 11:23:19 -04:00
bunnei	1312cf15d6	Merge pull request #2377 from lioncash/todo kernel/server_session: Remove obsolete TODOs	2019-04-10 10:29:24 -04:00
Lioncash	08d507a196	kernel/server_session: Remove obsolete TODOs These are holdovers from Citra.	2019-04-09 23:34:49 -04:00
bunnei	ed9dba89d3	Merge pull request #2375 from FernandoS27/fix-ldc Remove unnecessary bounding in LD_C	2019-04-09 21:23:24 -04:00
bunnei	f46c3164e7	Merge pull request #2353 from lioncash/surface yuzu/debugger: Remove graphics surface viewer	2019-04-09 21:20:02 -04:00
Lioncash	e1101d3e20	configure_hotkeys: Pass the dialog as a parent to SequenceDialog() Without passing in a parent, this can result in focus being stolen from the dialog in certain cases. Example: On Windows, if the logging window is left open, the logging Window will potentially get focus over the hotkey dialog itself, since it brings all open windows for the application into view. By specifying a parent, we only bring windows for the parent into view (of which there are none, aside from the hotkey dialog).	2019-04-09 20:06:49 -04:00
Lioncash	b47c0c8a80	configure_hotkeys: Avoid dialog memory leak within Configure() Without a parent, this dialog won't have its memory freed when it happens to get destroyed.	2019-04-09 20:05:57 -04:00
Fernando Sahmkow	c9f35d96be	Remove bounding in LD_C	2019-04-09 20:02:11 -04:00
Lioncash	dbf13f8169	configure_hotkeys: Mark member variables as const where applicable in Configure()	2019-04-09 19:50:14 -04:00
Lioncash	cf6cdd20f8	configure_hotkeys: Make comparison check a little more self-documenting This is checking if an index is valid or not and returning early if it isn't.	2019-04-09 19:47:20 -04:00
Lioncash	c4ba717491	configure_dialog: Amend constructor initializer list order Avoids a -Wreorder compiler warning.	2019-04-09 19:39:43 -04:00
Lioncash	8c05dfaa61	configure_hotkey: Remove unnecessary include Avoids dumping all of the core settings machinery into whatever files include this header. Nothing inside the header itself actually made use of anything in settings.h anyways.	2019-04-09 19:37:08 -04:00
Lioncash	e28a5b0d18	configure_hotkey: Make IsUsedKey() a const member function This doesn't actually modify instance state of the dialog, so this can be made const.	2019-04-09 19:35:54 -04:00
bunnei	2598433f9c	Merge pull request #2354 from lioncash/header video_core/texures/texture: Remove unnecessary includes	2019-04-09 19:19:41 -04:00
bunnei	61f63bb994	Merge pull request #1957 from DarkLordZach/title-provider file_sys: Provide generic interface for accessing game data	2019-04-09 19:16:37 -04:00
bunnei	353a099481	Merge pull request #2366 from FernandoS27/xmad-fix Correct XMAD mode, psl and high_b on different encodings.	2019-04-09 19:15:01 -04:00
bunnei	1a3098f11a	Merge pull request #2132 from FearlessTobi/port-4437 Port citra-emu/citra#4437: "citra-qt: Make hotkeys configurable via the GUI (Attempt 2)"	2019-04-09 18:08:30 -04:00
bunnei	71182643f7	Merge pull request #2370 from lioncash/qt-warn yuzu/loading_screen: Resolve runtime Qt string formatting warnings	2019-04-09 17:21:18 -04:00
bunnei	bc7e149835	Merge pull request #2369 from FernandoS27/mip-align gl_backend: Align Pixel Storage	2019-04-09 17:20:43 -04:00
bunnei	088c7c1bb5	Merge pull request #2368 from FernandoS27/fix-lop Correct LOP_IMM encoding	2019-04-09 17:19:56 -04:00
Fernando Sahmkow	cd91e98dab	Correct Fermi Copy on Linear Textures.	2019-04-09 14:13:58 -04:00
Lioncash	2abf979c35	kernel/process: Set page table when page table resizes occur. We need to ensure dynarmic gets a valid pointer if the page table is resized (the relevant pointers would be invalidated in this scenario). In this scenario, the page table can be resized depending on what kind of address space is specified within the NPDM metadata (if it's present).	2019-04-09 13:00:56 -04:00
Fernando Sahmkow	7c458311d3	Implement Texture Format ZF32_X24S8.	2019-04-09 12:33:46 -04:00
Fernando Sahmkow	b0aa8ad736	Correct depth compare with color formats for R32F	2019-04-09 12:06:59 -04:00
Lioncash	b73e433dff	yuzu/loading_screen: Resolve runtime Qt string formatting warnings In our error console, when loading a game, the strings: QString::arg: Argument missing: "Loading...", 0 QString::arg: Argument missing: "Launching...", 0 would occasionally pop up when the loading screen was running. This was due to the strings being assumed to have formatting indicators in them, however only two out of the four strings actually have them. This only applies the arguments to the strings that have formatting specifiers provided, which avoids these warnings from occurring.	2019-04-09 10:49:38 -04:00
zarroboogs	be6466d5c0	added a toggle to force 30fps mode	2019-04-09 02:14:03 +03:00
Fernando Sahmkow	9f16833097	gl_backend: Align Pixel Storage This commit makes sure GL reads on the correct pack size for the respective texture buffer.	2019-04-08 17:16:02 -04:00
Fernando Sahmkow	5c55ae4e18	Correct LOP_IMN encoding	2019-04-08 13:39:12 -04:00
Fernando Sahmkow	16adc735a5	Correct XMAD mode, psl and high_b on different encodings.	2019-04-08 13:01:17 -04:00
Fernando Sahmkow	ef8be408d3	Adapt Bindless to work with AOFFI	2019-04-08 12:07:56 -04:00
Fernando Sahmkow	492040bd9c	Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format.	2019-04-08 11:36:11 -04:00
Fernando Sahmkow	797e351bf8	Fix bad rebase	2019-04-08 11:35:22 -04:00
Fernando Sahmkow	c60b0b8432	Fix TMML	2019-04-08 11:35:22 -04:00
Fernando Sahmkow	a77e9a27b0	Simplify ConstBufferAccessor	2019-04-08 11:35:19 -04:00
Fernando Sahmkow	fd4e994de3	Refactor GetTextureCode and GetTexCode to use an optional instead of optional parameters	2019-04-08 11:35:18 -04:00
Fernando Sahmkow	4841440382	Implement TXQ_B	2019-04-08 11:29:52 -04:00
Fernando Sahmkow	189bd1980c	Implement TMML_B	2019-04-08 11:29:49 -04:00
Fernando Sahmkow	ac3ba9a33e	Corrections to TEX_B	2019-04-08 11:28:44 -04:00
Fernando Sahmkow	90d06acfed	Fixes to Const Buffer Accessor and Formatting	2019-04-08 11:23:47 -04:00
Fernando Sahmkow	7af82ca022	Implement Bindless Handling on SetupTexture	2019-04-08 11:23:46 -04:00
Fernando Sahmkow	fe392fff24	Unify both sampler types.	2019-04-08 11:23:45 -04:00
Fernando Sahmkow	e28fd3d0a5	Implement Bindless Samplers and TEX_B in the IR.	2019-04-08 11:23:42 -04:00
Fernando Sahmkow	c4ac05c82c	Implement Const Buffer Accessor	2019-04-08 11:19:34 -04:00
Lioncash	b117ca5fce	kernel/svc: Deglobalize the supervisor call handlers Adjusts the interface of the wrappers to take a system reference, which allows accessing a system instance without using the global accessors. This also allows getting rid of all global accessors within the supervisor call handling code. While this does make the wrappers themselves slightly more noisy, this will be further cleaned up in a follow-up. This eliminates the global system accessors in the current code while preserving the existing interface.	2019-04-07 20:30:05 -04:00
bunnei	f14328bf0a	Merge pull request #2300 from FernandoS27/null-shader shader_cache: Permit a Null Shader in case of a bad host_ptr.	2019-04-07 17:58:27 -04:00
bunnei	c2fee0e519	Merge pull request #2355 from ReinUsesLisp/sync-point maxwell_3d: Reduce severity of ProcessSyncPoint	2019-04-07 17:56:11 -04:00
bunnei	8aaf418bd6	Merge pull request #2306 from ReinUsesLisp/aoffi shader_ir: Implement AOFFI for TEX and TLD4	2019-04-07 17:52:30 -04:00
bunnei	3c1ce290d0	Merge pull request #2361 from lioncash/pagetable core/memory: Minor simplifications to page table management	2019-04-07 17:50:31 -04:00
bunnei	6b18a1592f	Merge pull request #2321 from ReinUsesLisp/gl-state-rework gl_state: Rework to enable individual applies	2019-04-07 17:50:07 -04:00
bunnei	21a4e7deea	Merge pull request #2098 from FreddyFunk/disk-cache-zstd gl_shader_disk_cache: Use Zstandard for compression	2019-04-07 17:48:33 -04:00
bunnei	52ad5fa0e8	Merge pull request #2356 from lioncash/pair kernel/{server_port, server_session}: Return pairs instead of tuples from pair creation functions	2019-04-07 17:48:00 -04:00
bunnei	d9b1c24f4f	Merge pull request #2362 from lioncash/enum core/memory: Remove unused enum constants	2019-04-07 17:46:09 -04:00
bunnei	80162888e6	Merge pull request #2352 from bunnei/mem-manager-fixes memory_manager: Improved implementation of read/write/copy block.	2019-04-07 17:44:59 -04:00
Fernando Sahmkow	021cd56bc9	Permit a Null Shader in case of a bad host_ptr.	2019-04-07 07:52:01 -04:00
Lioncash	36a1e6a982	core/memory: Remove unused enum constants These are holdovers from Citra and can be removed.	2019-04-07 03:04:55 -04:00
Lioncash	abae7577d2	core/memory: Remove GetCurrentPageTable() Now that nothing actually touches the internal page table aside from the memory subsystem itself, we can remove the accessor to it.	2019-04-07 02:47:37 -04:00
Lioncash	a6a82bb004	arm/arm_dynarmic: Remove unnecessary current_page_table member Given the page table will always be guaranteed to be that of whatever the current process is, we no longer need to keep this around.	2019-04-07 02:43:51 -04:00
Lioncash	e779686a76	kernel: Handle page table switching within MakeCurrentProcess() Centralizes the page table switching to one spot, rather than making calling code deal with it everywhere.	2019-04-07 01:12:54 -04:00
Lioncash	7a7ffa602d	kernel/server_session: Return a std::pair from CreateSessionPair() Keeps the return type consistent with the function name. While we're at it, we can also reduce the amount of boilerplate involved with handling these by using structured bindings.	2019-04-06 01:42:03 -04:00
Lioncash	04d265562f	kernel/server_port: Return a std::pair from CreatePortPair() Returns the same type that the function name describes.	2019-04-06 01:36:53 -04:00
ReinUsesLisp	ddcb711ee8	maxwell_3d: Reduce severity of ProcessSyncPoint	2019-04-06 02:18:20 -03:00
Lioncash	89c106e31b	video_core/textures/convert: Replace include with a forward declaration Avoids dragging in a direct dependency in a header.	2019-04-06 00:14:36 -04:00
Lioncash	fbf452ab0e	video_core/texures/texture: Remove unnecessary includes Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches.	2019-04-06 00:03:35 -04:00
Lioncash	218ae888f3	yuzu/debugger: Remove graphics surface viewer This doesn't actually work anymore, and given how long it's been left in that state, it's unlikely anyone actually seriously used it. Generally it's preferable to use RenderDoc or Nsight to view surfaces.	2019-04-05 23:54:00 -04:00

1 2 3 4 5 ...

9896 Commits