Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 13 additions & 2 deletions packages/types/src/providers/moonshot.ts
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ import type { ModelInfo } from "../model.js"
// https://platform.moonshot.ai/
export type MoonshotModelId = keyof typeof moonshotModels

export const moonshotDefaultModelId: MoonshotModelId = "kimi-k2-0711-preview"
export const moonshotDefaultModelId: MoonshotModelId = "kimi-k2-0905-preview"
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the new kimi-k2-0905-preview model already available in production? Changing the default model could break functionality for users if this model isn't accessible yet. Have you verified its availability?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


export const moonshotModels = {
"kimi-k2-0711-preview": {
Expand All @@ -17,9 +17,20 @@ export const moonshotModels = {
cacheReadsPrice: 0.15, // $0.15 per million tokens (cache hit)
description: `Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters.`,
},
"kimi-k2-0905-preview": {
maxTokens: 32_000,
contextWindow: 262_144,
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The context window has been doubled from 131,072 to 262,144 tokens. Has this been tested with large contexts? This significant increase could impact:

  • Memory usage and performance
  • API rate limits
  • Cost calculations for users

supportsImages: false,
supportsPromptCache: true,
inputPrice: 0.6, // $0.60 per million tokens (cache miss)
outputPrice: 2.5, // $2.50 per million tokens
cacheWritesPrice: 0, // $0 per million tokens (cache miss)
cacheReadsPrice: 0.15, // $0.15 per million tokens (cache hit)
description: `Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters.`,
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The description is identical to the older kimi-k2-0711-preview model. If this is a newer version with improvements, could we update the description to reflect what's new or different about the 0905 version?

},
"kimi-k2-turbo-preview": {
maxTokens: 32_000,
contextWindow: 131_072,
contextWindow: 262_144,
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar concern here - the context window for kimi-k2-turbo-preview has been doubled. Has this change been tested for performance impact, especially given this is the 'turbo' variant optimized for speed?

supportsImages: false,
supportsPromptCache: true,
inputPrice: 2.4, // $2.40 per million tokens (cache miss)
Expand Down
Loading