Foundry Local is an end-to-end local AI solution for building applications that run entirely on the user's device. It provides native SDKs (C#, JavaScript, Python, and Rust), a curated catalog of ...
[2024-11-12]: Support for sageattn_varlen is available now. For SageAttention V1 in Triton (slower than SageAttention V2/V2++/V3), refer to SageAttention-1 branch and install using pip: pip install ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results