海量在线大模型 兼容OpenAI API

Qrwkv 72B (free)

免费使用
开始对话
featherless/qwerky-72b:free
上下文长度: 32,768 text->text Other 2025-03-20 更新
Qrwkv-72B is a linear-attention RWKV variant of the Qwen 2.5 72B model, optimized to significantly reduce computational cost at scale. Leveraging linear attention, it achieves substantial inference speedups (>1000x) while retaining competitive accuracy on common benchmarks like ARC, HellaSwag, Lambada, and MMLU. It inherits knowledge and language support from Qwen 2.5, supporting approximately 30 languages, making it suitable for efficient inference in large-context applications.

模型参数

架构信息

模态: text->text
Tokenizer: Other

限制信息

上下文长度: 32,768
最大回复长度: 4,096