Skip to content

Conversation

@Marviel
Copy link

@Marviel Marviel commented Jan 22, 2026

Summary

Adds Infinity as an OpenAI-compatible provider with 4 models.

Provider Details

Models

Model Context Input Output Cached Features
QuantTrio/GLM-4.7-AWQ 64K $0.40/1M $1.50/1M $0.20/1M Reasoning, Tool Calls
QuantTrio/DeepSeek-V3.2-AWQ 32K $0.28/1M $0.42/1M Reasoning, Tool Calls
openai/gpt-oss-120b 32K $0.04/1M $0.19/1M Reasoning, Tool Calls
nvidia/Kimi-K2-Thinking-NVFP4 32K $0.40/1M $1.75/1M Reasoning, Tool Calls

All models are open-weights and support temperature control.

@Marviel Marviel changed the title Add Infinity as a provider feat: Add Infinity provider Jan 22, 2026
@Marviel Marviel force-pushed the dev branch 2 times, most recently from 710d31a to 8b93818 Compare January 22, 2026 18:36
- Provider: Infinity (https://api.infinity.inc/v1)
- Uses @ai-sdk/openai-compatible for OpenAI-compatible API
- Models:
  - QuantTrio/GLM-4.7-AWQ: $0.40/1M input, $0.20/1M cached (64K context, reasoning)
  - QuantTrio/DeepSeek-V3.2-AWQ: $0.28/1M input, $0.42/1M output (32K context)
  - openai/gpt-oss-120b: $0.04/1M input, $0.19/1M output (32K context)
  - nvidia/Kimi-K2-Thinking-NVFP4: $0.40/1M input, $1.75/1M output (32K context, reasoning)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant