Qrwkv 72B (free)

免费使用

featherless/qwerky-72b:free

上下文长度: 32,768 text->text Other 2025-03-20 更新

Qrwkv-72B is a linear-attention RWKV variant of the Qwen 2.5 72B model, optimized to significantly reduce computational cost at scale. Leveraging linear attention, it achieves substantial inference speedups (>1000x) while retaining competitive accuracy on common benchmarks like ARC, HellaSwag, Lambada, and MMLU. It inherits knowledge and language support from Qwen 2.5, supporting approximately 30 languages, making it suitable for efficient inference in large-context applications.

模型参数

架构信息

模态: text->text

Tokenizer: Other

限制信息

上下文长度: 32,768

最大回复长度: 4,096

Qrwkv 72B (free)

模型参数

架构信息

限制信息

相关模型

Venice: Uncensored (free)

TheDrummer: Valkyrie 49B V1

TheDrummer: Skyfall 36B V2

TheDrummer: Anubis Pro 105B V1

Tencent: Hunyuan A13B Instruct (free)

Tencent: Hunyuan A13B Instruct