From Twitter/X: DeepSeek is rolling out a limited V4 gray release.
Posted by jmorant555@reddit | LocalLLaMA | View on Reddit | 14 comments
Source: https://x.com/i/status/2041458478569689589
FullOf_Bad_Ideas@reddit
UI did change for me too, I see Instant and Expert models, it sounds like this may be it.
RetiredApostle@reddit
Found the probable source from 2026-04-04 (Chinese, inaccessible from a US IP): https://www.ai-indeed.com/encyclopedia/18756.html
The article states the knowledge was updated to May 2025 (per DeepSeek's translation).
anotheruser323@reddit
Here, a firefox translated c/p from that site:
DeepSeek V4 Grayscale Test Update Technical Guide: Millions of Contexts, Fast Responses, and New Architecture Perspectives 2026-04-04 14:24:23
DeepSeek V4 grayscale testing is a deep search for the next generation of flagship models of technical preheating and stress testing, the core update includes a million Token long context, knowledge base timeliness improvement and response speed optimization, laying the foundation for the official version of the technology landing.
Outline of this document
DeepSeek V4 Grayscale Test Update Technical Guide: Million-Cash, Fast Response, and New Architecture I. Core Upgrades at a Glance: What Changes the Grayscale Test Has Made
The most intuitive upgrade of this grayscale test is reflected in two aspects: Context window expands from 128K to 1M (1 million) Token
In the V3 series, the context capacity is about 128K Token, while the grayscale version is directly increased to 1M, expanding nearly eightfold. This means that the model can process the content of several books at once, an ultra-long code base, or thousands of pages of technical documentation. Knowledge Base Deadline Updated to May 2025
After turning off the network search function, the model can still accurately output news content from April 2025. The increased timeliness of knowledge has greatly increased usability in offline scenarios.
DeepSeek V4 Grayscale Test Update Technical Guide: Million-Critons, Fast Responses, and New Architectures Second, response speed and interaction style: efficiency-first trade-offs
The grayscale version uses a ‘speed’ strategy in exchange for faster response times at the expense of some of the quality generated, with the aim of conducting stress tests for the official version.
There has also been a significant change in the style of interaction:
The official explanation is that this is the result of “efficiency-first adjustment and boundary awareness optimization” – too much tone words and empathy can interfere with the information density of complex problems.
DeepSeek V4 Grayscale Test Update Technical Guide: Million-Critical Context, Fast Response, and New Architecture V4 architecture foresight: technical clues from grayscale to the official version
Although the grayscale test is not an official version of V4, it involves a number of technical pre-research: Engram Conditional Memory Module
DeepSeek’s open-source Engram module in mid-January 2026 proposed a ‘Conditional Memory’ mechanism, which replaces traditional neural network computation with O(1) hash search, offloads of embedded tables with up to 100B parameters to CPU memory. mHC manifold constraint hyperconnection
The mHC (manifold constraint hyperconnection) technique, published in early January 2026, specifically addresses the stability of the trillion-parameter MoE model in training. Multi-modal ability and domestic computing power priority adaptation
The V4 official version will natively support the joint understanding and generation of text, images, video and audio, and give priority to domestic chip suppliers such as Huawei.
DeepSeek V4 Grayscale Test Update Technical Guide: Million Contexts, Fast Responses, and New Architecture How to check whether you are in the grayscale test range
Web/App-side detection method: After turning off the “deep thinking” and “network search” functions, ask the model directly the following question: “What is the length of your context window?” Or “When is your knowledge base?” If the response shows the context '1M Token', you are in the grayscale test range. The significance of grayscale testing: stress testing and ecological adaptation
From a technical point of view, the grayscale test has undertaken the preparation of long text stress test, V4 architecture verification, and the adaptation of domestic computing power. Summary
The DeepSeek V4 grayscale test brings core changes such as 1M’s long context, knowledge base updates, and more. If you are a developer or project leader, you want to experience its long context and multi-modal capabilities before the official release of V4, it is recommended to pay attention to the AI Agent platform for enterprise-level intelligent scenes, support the deep adaptation and access of the full range of DeepSeek models, help the team quickly land long text processing and complex reasoning tasks, and improve development efficiency.
No_Afternoon_4260@reddit
mHC, engram, 1M token, knowledge cutoff may 2025, if they release it, could be a paradigm shift in open source space. Enough to really mess with SOTAs. Let's hope it's closer to 1T than 5T 😌
power97992@reddit
How good is it compared to opus 4.6 , got 5,4 and mythos?
mlhher@reddit
The expert model says May 2025 as its knowledge cutoff.
mlhher@reddit
The expert model says May 2025 as its knowledge cutoff. This might be DS V4.
hurn2k@reddit
A new 'expert' model is on the website and app right now. It doesn't seem like V4 to me...
EffectiveCeilingFan@reddit
Yeah I just got off the phone with John DeepSeek, she said this is legit
AnomalyNexus@reddit
What the hell is a grey release?
ProKn1fe@reddit
Daily deepseek v4 copium
Due-Memory-6957@reddit
I don't think it's a cope, Deepseek has definitely changed on the web interface. It's known that Deepseek does tests on the web before implementing changes on the API.
Kirigaya_Mitsuru@reddit
And the Daily Trust me bro it comes Next Week for sure.
VoiceApprehensive893@reddit
a true seeker doesnt need to....