陆首群评 DeepSeek 英文版在 Hugging Face 上发表了


陆主席评DeepSeek”(英文版)在Hugging Face上发表了!

链接:https://huggingface.co/blog/COPU2004/lu-shouquns-view-of-deepseek

以下为英文版全文:

Honorary Chairman of China OSS (Open Source Software) Promotion Union (COPU)

Abstract

DeepSeek can be seen as a representative work of China's current AI, exploring a new path for the development of AI, which may change the development pattern of AI in the world and trigger a fierce competition in global AI. DeepSeek insists on open source innovation. At present, it is also a generative autoregressive language model, which inevitably has negative defects such as limitations and hallucinations of common language models. The author of this article proposes that DeepSeek should make corrections and transition towards the development of AI.

  1. DeepSeek’s greatest success is that Liang Wenfeng’s team has developed a new path for developing AI with an innovative attitude: “low investment, low cost, limited resources, high efficiency, and high cost performance (output)”.

  2. DeepSeek can be regarded as a representative work of China's current AI and is changing the development pattern of AI in the world. It lowered the bar for the public and enterprises globally to use AI. Open up a smooth road for emerging forces to develop AI. Negating the old path to develop AI with "huge investment, high cost, massive resources, low efficiency, and low cost performance (output)".

  3. It is not an exaggeration to call Liang Wenfeng’s team a group of wizards or geniuses who have achieved “national destiny” innovation!

  4. Liang Wenfeng's team insists on open source innovation. Open source helps the iterative innovation, stability and upgrade of AI, and the development of the ecosystem. DeepSeek integrates the full open source of the C-end of the large model and the implementation of an open source business model on the B-end. It not only implements open source innovation, but also supports the development of the open source industry. This is also a major creation of DeepSeek.

  5. Some people use this to suppress DeepSeek by rating according to the current output product rankings. In fact, the output performance of DeepSeek and other large generative language models is on par with each other, and there is no exaggerated situation of one being higher than the other. If we compare them in a more scientific way based on cost-effectiveness, DeepSeek is definitely the best in the world.

  6. Currently, there are not many secrets about DeepSeek’s key technology. Some large generative language models in China and abroad have basically learned DeepSeek’s key technology. When it comes to the next stage of AI competition, it can be said that everyone is on the same starting line.

  7. The advent of DeepSeek has triggered a fierce competition in global AI.

  8. The current DeepSeek model, like other large language models, are a generative autoregressive large language model. Limitations and negative defects exists in DeepSeek, affecting its performance. In DeepSeek’s development, it is important to overcome limitations, root out defects, greatly improve intelligence, save energy and increase efficiency, and expand applications.

  9. For the generative autoregressive language model, since language cannot replace the real world, it lacks world knowledge, or cannot generate new knowledge to truly understand the physical world. In addition, language is not equal to thinking, and it also limits the depth of thinking during operation, which ultimately limits the level of intelligence produced. The autoregressive mechanism of the language model training architecture is based on Tokens and the signal processing and statistics it supports, which is the root cause of the hallucination.

  10. DeepSeek, like other standard and inclusive base models, is difficult to directly transform into high-quality productivity for enterprises and industries. It still needs to improve its temporarily missing commercial value. They lack a deep understanding of enterprises and industries. While they are really applied in the business scenarios of enterprises and industries (such as finance, manufacturing, medical care, etc.), to generate value for enterprises and industries, they must capture the data of enterprises and industries and then apply them to fill the gaps.

  11. It is suggested that an important task for DeepSeek’s development is to solve its problem of deviation and transition, and strive to win in the fierce global competition.

  12. The goal of calibrating DeepSeek is to develop real and advanced AI - Artificial General Intelligence (AGI). When developing AGI, we must avoid being impatient for quick success. To achieve AGI, we must first develop the tasks of AI in the transition stage (such as multimodality, embodiment, agents and world models, etc.). AGI is an AI with an autonomous system. AGI is at a crossroads of whether AI intelligence can surpass humans. This is related to whether it affects human safety and even affects the extremely serious problem of whether humans can survive on the earth. When developing AGI to ensure preventive measures for human safety, it also requires countries around the world to take unified actions on the basis of mutual trust and implement the policy of combining technology and management (regulation). The task is extremely severe and arduous.

中文版:https://www.oschina.net/news/355187/lu-shouquns-view-of-deepseek


相關推薦

2025-06-14

术、管理(监管)并举的方针,任务极其严峻和艰巨。 英文版:https://www.oschina.net/news/355188/lu-shouquns-view-of-deepseek-en

2025-06-12

Hugging Face 近日发布开放权重模型贡献榜,中国团队Qwen和DeepSeek成功入围前15名。该榜单表彰为开源社区提供高质量模型权重的团队,其模型广泛应用于学术与产业创新。 由阿里巴巴云智能集团支持的Qwen团队,以Qwen3系列模型

2025-05-17

Mlx、Qwen、Glm、Unsloth、Axoloth、Deepspeed、IBM、Gemma、Llama、Deepseek、Microsoft、Nvidia、InternLM、Llava、AllenAI、Cohere、TogetherAI 等众多生态系统参与者共同努力,将 transformers 库中的模型定义代码作为标准,旨在为所有模型提供一

2025-04-08

显得异常复杂。1littlecoder指出,即使是中国公司的模型如DeepSeek和Qwen,用户只需点击几下即可下载。而Meta的模型却设置了多重障碍: "Meta的模型要求你首先登录Hugging Face账户,这点我能理解,他们可能有垃圾邮件问题。然后填

2025-05-10

Hugging Face 推出了免费云端 AI 智能体工具 Open Computer Agent,支持用户通过文本指令,远程操控基于 Linux 的虚拟计算机,使用 Firefox 等应用。 https://huggingface.co/spaces/smolagents/computer-agent 据介绍,Open Computer Agent 工具集成 smolagents

2023-06-16

上下文窗口长度为 4096。 目前 baichuan-7B 大模型已在 Hugging Face、GitHub 以及 Model Scope 平台发布。baichuan-7B 代码采用 Apache-2.0 协议,模型权重采用了免费商用协议,只需进行简单登记即可免费商用。 Hugging Face:https://huggingfac

2025-05-01

DeepSeek 在官方 Hugging face 库上低调开源发布了其最新开源模型 DeepSeek-Prover-V2-671B。一个专注于数学定理证明的大语言模型,专门针对形式化数学证明任务进行优化。 新模型具有以下特点: 模型规模巨大:参数量约为671B(671

2025-06-10

单的全球开源冠军、国产模型冠军。 Qwen3推理成本仅为DeepSeek R1三分之一,在产业链上下游引发新浪潮,吸引包括英伟达、英特尔、ARM、联发科、AMD 等多家头部芯片厂商,北上津杭等十余地算力平台,以及华为昇腾、百度千帆

2023-06-29

每一周,我们的同事都会向社区的成员们发布一些关于 Hugging Face 相关的更新,包括我们的产品和平台更新、社区活动、学习资源和内容更新、开源库和模型更新等,我们将其称之为「Hugging News」,本期 Hugging News 有哪些有趣的

2023-10-24

一起寻宝 AI 时代 根据 Yahoo 新闻的报道,人工智能公司 Hugging Face 发言人在周五的一封电子邮件中写道: 中国的人工智能社区非常出色,我们希望他们能更好地访问 HF,但目前我们对于政府的法规无能为力。 The Chinese AI

2023-08-09

PyTorch 基金会宣布 Hugging Face 已加入为首要成员。与此同时,Hugging Face 开源负责人 Lysandre Debut 也成为了 PyTorch 基金会治理委员会的一员。 Hugging Face 是一个致力于降低机器学习和深度学习入门门槛的社区和公司,其模型

2023-08-24

科技媒体 The Information 的报道,开源模型库公司 Hugging Face 正进行一轮新的融资,本轮融资由 Salesforce 旗下投资机构 Salesforce Ventures 领投,Salesforce CEO Marc Benioff 在推特证实了这一消息。Hugging Face 此次融资预计将筹集约

2023-05-06

Hugging Face 日前联合 ServiceNow 发布了一个免费的大型语言模型 StarCoder(150 亿参数),该模型经过训练主要用途是可以生成代码,目的是为了对抗 GitHub Copilot 和亚马逊 CodeWhisperer 等基于 AI 的编程工具。 目前像是 GitHub Copilot 这

2025-06-13

Hugging Face 开源了专用于评估 GUI 智能体的综合测试套件 ScreenSuite。 什么是 GUI Agent?简单说,就是“能像人一样操作屏幕”的 AI!它能识别界面内容、点击按钮、输入文字、滚动页面……实现真实的“虚拟助手”体验。