H3cJP/yuzu - yuzu - Gitea: Git with a cup of tea

H3cJP/yuzu

Author	SHA1	Message	Date
Liam	bcc2d7e69b	Vulkan: convert S8D24 <-> ABGR8	2022-03-15 20:05:21 -04:00
ameerj	a5bff8e9b3	astc_decoder: Combine FastReplicate functions to work around new NV driver bug The new Nvidia drivers have a bug where the FastReplicateTo6 function produces a lookup into the REPLICATE_TO_8 table rather than the REPLICATE_TO_6 table. This seems to be an optimization gone wrong. Combining the logic of the FastReplicate functions seems to address the bug.	2022-01-16 16:13:20 -05:00
Fernando Sahmkow	1e474fb9d1	Texture Cache: Correct conversion shaders.	2021-11-22 00:21:42 +01:00
Fernando Sahmkow	8532849439	TextureCache: Simplify blitting of D24S8 formats and fix bugs.	2021-11-22 00:00:01 +01:00
Fernando Sahmkow	b96caf200d	HostShaders: Fix D24S8 convertion shaders.	2021-11-21 21:04:04 +01:00
Fernando Sahmkow	4ca6e9a9e2	TextureCache: Assure full conversions on depth/stencil write shaders.	2021-11-20 06:17:01 +01:00
Fernando Sahmkow	e02cff2f69	TextureCache: Add R16G16 to D24S8 converter.	2021-11-20 00:02:12 +01:00
Fernando Sahmkow	1d5e6a51d7	TextureCache: Add B10G11R11 to D24S8 converter.	2021-11-19 23:22:44 +01:00
Fernando Sahmkow	b805c7bf05	TextureCache: Implement additional D24S8 convertions.	2021-11-19 06:27:44 +01:00
Fernando Sahmkow	2ec7fcecb7	Vulkan: implement D24S8 <-> RGBA8 convertions.	2021-11-19 03:17:02 +01:00
FernandoS27	d46a71e786	HostShader: fix Gaussian filter.	2021-11-16 22:11:33 +01:00
ameerj	87abab71ff	host_shaders: Misc copyright/style changes	2021-11-16 22:11:33 +01:00
Marshall Mohror	dcc5b4f6b0	Presentation: Only use FP16 in scaling shaders on supported devices in Vulkan	2021-11-16 22:11:32 +01:00
Fernando Sahmkow	99547d2656	HostShader: Fix gaussian and add attribution.	2021-11-16 22:11:32 +01:00
FernandoS27	e6f1ed08fb	Vulkan: Implement FXAA	2021-11-16 22:11:32 +01:00
Marshall Mohror	48cf376462	OpenGL: Implement FXAA	2021-11-16 22:11:32 +01:00
FernandoS27	9e065b9c7d	VideoCore: Add gaussian filtering.	2021-11-16 22:11:32 +01:00
Marshall Mohror	916b882ea8	Update scaleforce to use FP16	2021-11-16 22:11:32 +01:00
Marshall Mohror	37cb0377ae	vulkan: Implement FidelityFX Super Resolution	2021-11-16 22:11:31 +01:00
ameerj	ae8d19d17e	Renderers: Unify post processing filter shaders	2021-11-16 22:11:29 +01:00
Fernando Sahmkow	a6b88e85bf	Renderer: Implement Bicubic and ScaleForce filters.	2021-11-16 22:11:29 +01:00
ameerj	22162f906b	host_shaders: Remove opengl_copy_bgra.comp	2021-09-16 19:49:13 -04:00
ameerj	c439fc9be9	astc_decoder: Reduce workgroup size This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32.	2021-08-01 01:22:27 -04:00
ameerj	5ab8053511	astc_decoder: Compute offset swizzles in-shader Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes.	2021-08-01 01:22:26 -04:00
ameerj	b2862e4772	astc_decoder: Make use of uvec4 for payload data	2021-07-31 22:28:04 -04:00
ameerj	a75d70fa90	astc_decoder: Simplify Select2DPartition	2021-07-31 21:36:26 -04:00
ameerj	5665d05547	astc_decoder: Optimize the use EncodingData This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea.	2021-07-31 21:36:26 -04:00
Ameer J	bab400daaf	Merge pull request #6459 from lat9nq/ubuntu-fixes cmake: Improve Linux dependency checking for externals	2021-06-30 21:47:57 -04:00
ameerj	ace20ba4a4	astc_decoder.comp: Remove unnecessary LUT SSBOs We can move them to instead be compile time constants within the shader.	2021-06-19 10:56:13 -04:00
ameerj	31b125ef57	astc: Various robustness enhancements for the gpu decoder These changes should help in reducing crashes/drivers panics that may occur due to synchronization issues between the shader completion and later access of the decoded texture.	2021-06-19 09:00:33 -04:00
ameerj	5fc8393125	astc_decoder: Fix LDR CEM1 endpoint calculation Per the spec, L1 is clamped to the value 0xff if it is greater than 0xff. An oversight caused us to take the maximum of L1 and 0xff, rather than the minimum. Huge thanks to wwylele for finding this. Co-Authored-By: Weiyi Wang <wwylele@gmail.com>	2021-06-15 20:19:01 -04:00
lat9nq	932c0184a7	cmake: Fix find_program usage for 3.15 yuzu requires CMake 3.15 yet find_program was using REQUIRED, which is only available on 3.18 and later. Instead, we check for "<VAR>-NOTFOUND". In addition, check for additional requirements before building libusb or FFmpeg with autotools. Otherwise, CMake configuration will pass yet compilation will fail.	2021-06-13 01:15:54 -04:00
ameerj	2f83d9a61b	astc_decoder: Refactor for style and more efficient memory use	2021-03-25 16:53:51 -04:00
Rodrigo Locatti	2f30c10584	astc_decoder: Reimplement Layers Reimplements the approach to decoding layers in the compute shader. Fixes multilayer astc decoding when using Vulkan.	2021-03-13 12:16:03 -05:00
ameerj	c7553abe89	astc_decoder: Fix out of bounds memory access resolves a crash with some anamolous textures found in Astral Chain.	2021-03-13 12:16:03 -05:00
ameerj	20eb368e14	renderer_vulkan: Accelerate ASTC decoding Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc>	2021-03-13 12:16:03 -05:00
ameerj	f6566338eb	host_shaders: Modify shader cmake integration to allow for larger shaders using a raw string to encapsulate the entire shader code limits us to shaders of size less than 2KB. This change overcomes this limitation.	2021-03-13 12:16:03 -05:00
ameerj	2985e5e94c	renderer_opengl: Accelerate ASTC texture decoding with a compute shader ASTC texture decoding is currently handled by a CPU decoder for GPU's without native ASTC decoding support (most desktop GPUs). This is the cause for noticeable performance degradation in titles which use the format extensively. This commit adds support to accelerate ASTC decoding using a compute shader on OpenGL for GPUs without native support.	2021-03-13 12:16:03 -05:00
ameerj	0639244d85	renderer_opengl: Swizzle BGR textures on copy OpenGL does not natively support BGR internal formats, which causes many BGR textures to render incorrectly, with Red and Blue channels swapped. This commit aims to address this by swizzling the blue and red channels on texture copies when a BGR format is encountered.	2021-03-04 14:14:19 -05:00
ReinUsesLisp	82c2601555	video_core: Reimplement the buffer cache Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.	2021-02-13 02:17:22 -03:00
lat9nq	fc43eac82a	video_core: host_shaders: Don't pass --quiet to glslangValidator if unavailable Prevents CMake from calling `glslangValidator` with `--quiet` when it is not available, i.e. on older downstream versions from Ubuntu.	2021-02-01 23:39:54 -05:00
ReinUsesLisp	f81c783b5b	host_shaders/cmake: Pass --quiet to glslang to keep it quiet Silences noisy builds on toolchains.	2021-01-24 04:55:23 -03:00
ReinUsesLisp	21b18057f7	host_shaders: Add Vulkan assembler compute shaders	2020-12-30 02:03:50 -03:00
ReinUsesLisp	87ff58b1d7	host_shaders: Add helper to blit depth stencil fragment shader	2020-12-30 02:02:07 -03:00
ReinUsesLisp	ae5725b709	host_shaders: Add texture color blit fragment shader	2020-12-30 02:00:48 -03:00
ReinUsesLisp	64fbf319f1	host_shaders: Add shaders to present to the swapchain	2020-12-30 01:59:12 -03:00
ReinUsesLisp	82b7daed9c	host_shaders: Add shaders to convert between depth and color images	2020-12-30 01:48:44 -03:00
ReinUsesLisp	dc81a90640	host_shaders: Add compute shader to copy BC4 as RG32UI to RGBA8	2020-12-30 01:47:08 -03:00
ReinUsesLisp	5169ce9fcd	host_shaders: Add shader to render a full screen triangle	2020-12-30 01:44:09 -03:00
ReinUsesLisp	59c46f9de9	host_shaders: Add pitch linear upload compute shader	2020-12-30 01:41:42 -03:00

1 2

55 Commits