Successfully running my own GPU FIR filter, the code looks terrible but I'm gonna clean it and post it on GitHub, unfortunately vacation starts now and will be switching over to a Mac, will be able to tune it for mac users and make it cleaner, not much time until August.
Currently running 12 channels @ 64k taps @ 192khz @ 2048 samples = 1.024 Million Taps
and the load is... 4% on a Nvidia RTX 3080, 50% load for Max 8's
I should be able to bring that 50% down to almost nothing once the CPU side of things gets multi threaded (one thread per filter)
Right now this is a MAX 8 plugin, porting it to VST3 should be easy, or anything else interesting.
It is based of OpenCL instead of CUDA so should be compatible with Intel, AMD and Apple GPUs.
Max 8 has a FIR filter (buffir~) currently limited at 4096 taps max across the entire app, it is pure CPU and single threaded.
And if you don't know Max 8, you are missing out!
Currently running 12 channels @ 64k taps @ 192khz @ 2048 samples = 1.024 Million Taps
and the load is... 4% on a Nvidia RTX 3080, 50% load for Max 8's
I should be able to bring that 50% down to almost nothing once the CPU side of things gets multi threaded (one thread per filter)
Right now this is a MAX 8 plugin, porting it to VST3 should be easy, or anything else interesting.
It is based of OpenCL instead of CUDA so should be compatible with Intel, AMD and Apple GPUs.
Max 8 has a FIR filter (buffir~) currently limited at 4096 taps max across the entire app, it is pure CPU and single threaded.
And if you don't know Max 8, you are missing out!