DeepSeek

The Future is Now !

Thursday 13 Dec 2040 17:37:57



Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is owned and funded by the Chinese hedge fund High-Flyer. DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both of the companies. The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025. Released under the MIT License, DeepSeek-R1 provides responses comparable to other contemporary large language models, such as OpenAI's GPT-4 and o1. Its training cost was reported to be significantly lower than other LLMs. The company claims that it trained its V3 model for US$6 million—far less than the US$100 million cost for OpenAI's GPT-4 in 2023—and using approximately one-tenth the computing power consumed by Meta's comparable model, Llama 3.1. DeepSeek's success against larger and more established rivals has been described as "upending AI". DeepSeek's models are described as "open weight," meaning the exact parameters are openly shared, although certain usage conditions differ from typical open-source software. The company reportedly recruits AI researchers from top Chinese universities and also hires from outside traditional computer science fields to broaden its models' knowledge and capabilities. DeepSeek significantly reduced training expenses for their R1 model by incorporating techniques such as mixture of experts (MoE) layers. The company also trained its models during ongoing trade restrictions on AI chip exports to China, using weaker AI chips intended for export and employing fewer units overall. Observers say this breakthrough sent "shock waves" through the industry which were described as triggering a "Sputnik moment" for the US in the field of artificial intelligence, particularly due to its open-source, cost-effective, and high-performing AI models. This threatened established AI hardware leaders such as Nvidia; Nvidia's share price dropped sharply, losing US$600 billion in market value, the largest single-company decline in U.S. stock market history.



Article title : DeepSeek
"deepseek-ai/DeepSeek-Coder, DeepSeek, 27 January 2025, archived from the original on 27 January 2025, retrieved 27 January 2025 "deepseek-ai/deepseek-coder-5..."
Article title : DeepSeek (chatbot)
"chatbot services. Notably DeepSeek has said that these new models will be released and made open source. On 30 April 2025, Deepseek released its math-focused..."
Article title : Liang Wenfeng
"2025. "DeepSeek-R1 Release | DeepSeek API Docs". api-docs.deepseek.com. Retrieved 28 January 2025. "DeepSeek-R1/DeepSeek_R1.pdf at main · deepseek-ai/DeepSeek-R1"..."
Article title : Prompt injection
"Infosecurity Magazine reported that DeepSeek-R1, a large language model (LLM) developed by Chinese AI startup DeepSeek, exhibited vulnerabilities to direct..."
Article title : Reasoning model
""DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning". arXiv:2501.12948 [cs.CL]. DeepSeek 支持"深度思考+联网检索"能力 [DeepSeek adds..."
Article title : List of large language models
"September 2025. "Introducing DeepSeek-V3.2-Exp | DeepSeek API Docs". api-docs.deepseek.com. Retrieved 2025-10-01. "deepseek-ai/DeepSeek-V3.2-Exp · Hugging Face"..."
Article title : Mixture of experts
"January 2024). "DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models". arXiv:2401.06066 [cs.CL]. DeepSeek-AI; Liu, Aixin;..."
Article title : Qwen
"Alibaba, Qwen2.5-Max outperforms other foundation models such as GPT-4o, DeepSeek-V3, and Llama-3.1-405B in key benchmarks. In February 2025, Alibaba announced..."
Article title : 2025 in artificial intelligence
"Genesis launch Timeline of artificial intelligence "Release DeepSeek-R1 · deepseek-ai/DeepSeek-R1@23807ce". GitHub. Archived from the original on 21 January..."
Article title : High-Flyer
"location is in Hangzhou, Zhejiang. It is the founder and backer of AI firm DeepSeek. High-Flyer was founded in February 2016 by Liang Wenfeng and two of his..."
deepseek-ai/DeepSeek-Coder, DeepSeek, 27 January 2025, archived from the original on 27 January 2025, retrieved 27 January 2025 "deepseek-ai/deepseek-coder-5

[source: wikipedia]


Home



Saturday 13 Dec 2025 17:37:57

216.73.216.189