陆首群评 DeepSeek 英文版在 Hugging Face 上发表了

2025-06-14 發表於开源资讯

“陆主席评DeepSeek”（英文版）在Hugging Face上发表了！

链接：https://huggingface.co/blog/COPU2004/lu-shouquns-view-of-deepseek

以下为英文版全文：

Honorary Chairman of China OSS (Open Source Software) Promotion Union (COPU)

Abstract

DeepSeek can be seen as a representative work of China's current AI, exploring a new path for the development of AI, which may change the development pattern of AI in the world and trigger a fierce competition in global AI. DeepSeek insists on open source innovation. At present, it is also a generative autoregressive language model, which inevitably has negative defects such as limitations and hallucinations of common language models. The author of this article proposes that DeepSeek should make corrections and transition towards the development of AI.

DeepSeek’s greatest success is that Liang Wenfeng’s team has developed a new path for developing AI with an innovative attitude: “low investment, low cost, limited resources, high efficiency, and high cost performance (output)”.
DeepSeek can be regarded as a representative work of China's current AI and is changing the development pattern of AI in the world. It lowered the bar for the public and enterprises globally to use AI. Open up a smooth road for emerging forces to develop AI. Negating the old path to develop AI with "huge investment, high cost, massive resources, low efficiency, and low cost performance (output)".
It is not an exaggeration to call Liang Wenfeng’s team a group of wizards or geniuses who have achieved “national destiny” innovation!
Liang Wenfeng's team insists on open source innovation. Open source helps the iterative innovation, stability and upgrade of AI, and the development of the ecosystem. DeepSeek integrates the full open source of the C-end of the large model and the implementation of an open source business model on the B-end. It not only implements open source innovation, but also supports the development of the open source industry. This is also a major creation of DeepSeek.
Some people use this to suppress DeepSeek by rating according to the current output product rankings. In fact, the output performance of DeepSeek and other large generative language models is on par with each other, and there is no exaggerated situation of one being higher than the other. If we compare them in a more scientific way based on cost-effectiveness, DeepSeek is definitely the best in the world.
Currently, there are not many secrets about DeepSeek’s key technology. Some large generative language models in China and abroad have basically learned DeepSeek’s key technology. When it comes to the next stage of AI competition, it can be said that everyone is on the same starting line.
The advent of DeepSeek has triggered a fierce competition in global AI.
The current DeepSeek model, like other large language models, are a generative autoregressive large language model. Limitations and negative defects exists in DeepSeek, affecting its performance. In DeepSeek’s development, it is important to overcome limitations, root out defects, greatly improve intelligence, save energy and increase efficiency, and expand applications.
For the generative autoregressive language model, since language cannot replace the real world, it lacks world knowledge, or cannot generate new knowledge to truly understand the physical world. In addition, language is not equal to thinking, and it also limits the depth of thinking during operation, which ultimately limits the level of intelligence produced. The autoregressive mechanism of the language model training architecture is based on Tokens and the signal processing and statistics it supports, which is the root cause of the hallucination.
DeepSeek, like other standard and inclusive base models, is difficult to directly transform into high-quality productivity for enterprises and industries. It still needs to improve its temporarily missing commercial value. They lack a deep understanding of enterprises and industries. While they are really applied in the business scenarios of enterprises and industries (such as finance, manufacturing, medical care, etc.), to generate value for enterprises and industries, they must capture the data of enterprises and industries and then apply them to fill the gaps.
It is suggested that an important task for DeepSeek’s development is to solve its problem of deviation and transition, and strive to win in the fierce global competition.
The goal of calibrating DeepSeek is to develop real and advanced AI - Artificial General Intelligence (AGI). When developing AGI, we must avoid being impatient for quick success. To achieve AGI, we must first develop the tasks of AI in the transition stage (such as multimodality, embodiment, agents and world models, etc.). AGI is an AI with an autonomous system. AGI is at a crossroads of whether AI intelligence can surpass humans. This is related to whether it affects human safety and even affects the extremely serious problem of whether humans can survive on the earth. When developing AGI to ensure preventive measures for human safety, it also requires countries around the world to take unified actions on the basis of mutual trust and implement the policy of combining technology and management (regulation). The task is extremely severe and arduous.

中文版：https://www.oschina.net/news/355187/lu-shouquns-view-of-deepseek

陆首群评 DeepSeek 英文版在 Hugging Face 上发表了

相關推薦

陆首群评 DeepSeek

DeepSeek 透露下一代国产芯片即将发布

Hugging Face 发布开放权重模型贡献榜：Qwen 与 DeepSeek 跻身 TOP15

Hugging Face 牵头推动 Transformers 库模型架构标准化

Hugging Face 发布 Open Computer Agent，文字指令远程操控虚拟电脑

Llama 4 不是真开源，比 DeepSeek 格局差多了

百川智能发布 70 亿参数开源中英文大模型 baichuan-7B

Hugging News #0626: 音频课程更新、在线体验 baichuan-7B 模型

DeepSeek 开源 DeepSeek-Prover-V2-671B

Qwen3 全球下载量破千万

小红书开源基于 VLM 的文档解析模型 dots.ocr

Hugging Face 被限制访问

Hugging Face 发布 AI Sheets，使用 AI 模型处理数据集的开源工具

Hugging Face 加入 PyTorch 基金会，成为首要成员