DeepSeek Updated to 1M Context Window

DeepSeek has recently significantly expanded the context window of its language model, bringing it to 1 million tokens.

Update Details

The update, initially detected on Reddit, indicates that the DeepSeek application now supports a significantly larger context window. At the same time, the model's knowledge cutoff date has been extended to May 2025. It remains to be clarified whether this expansion of the context window is due to a new model or an improvement of the existing model. At the moment, no official announcements or updates have been released on the project's Hugging Face page.

Implications

A larger context window allows the model to process and generate text based on a larger amount of previous information, potentially improving the quality and consistency of the deliveries. For those evaluating on-premise deployments, there are trade-offs to consider when using models with extended context windows, in terms of memory and computing power requirements. AI-RADAR offers analytical frameworks on /llm-onpremise to evaluate these trade-offs.

DeepSeek Updated to 1M Context Window

Update Details

Implications

💻 Need GPU Cloud Infrastructure?

💬 Commenti (0)

📚 Approfondimenti

Approfondisci su LLM On-Premise

DeepSeek testa un modello con finestra di contesto da 1 milione di token

Ripetere i prompt migliora le prestazioni dei modelli linguistici

Deepseek-R1: un anno dal rilascio del modello linguistico