DeepSeek
The Future is Now !
Thursday 17 May 2040 08:12:04
Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, it is owned and funded by the Chinese hedge fund High-Flyer. DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both companies. The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025. Released under the MIT License, DeepSeek-R1 provides responses comparable to other contemporary large language models, such as OpenAI's GPT-4 and o1. Its training cost was reported to be significantly lower than other LLMs. The company claims that it trained its V3 model for US$6 million—far less than the US$100 million cost for OpenAI's GPT-4 in 2023—and using approximately one-tenth the computing power consumed by Meta's comparable model, Llama 3.1. DeepSeek's success against larger and more established rivals has been described as "upending AI". DeepSeek's models are described as "open weight," meaning the exact parameters are openly shared, although certain usage conditions differ from typical open-source software. The company reportedly recruits AI researchers from top Chinese universities and also hires from outside traditional computer science fields to broaden its models' knowledge and capabilities. DeepSeek significantly reduced training expenses for their R1 model by incorporating techniques such as mixture of experts (MoE) layers. The company also trained its models during ongoing trade restrictions on AI chip exports to China, using weaker AI chips intended for export and employing fewer units overall. Observers say this breakthrough sent "shock waves" through the industry, threatening established AI hardware leaders such as Nvidia; Nvidia's share price dropped sharply, losing US$600 billion in market value, the largest single-company decline in U.S. stock market history.
Article title : DeepSeek
"Retrieved 27 January 2025. deepseek-ai/DeepSeek-Coder, DeepSeek, 27 January 2025, retrieved 27 January 2025 "deepseek-ai/deepseek-coder-5.7bmqa-base · Hugging..."
Article title : DeepSeek (chatbot)
"chatbot services. Notably DeepSeek has said that these new models will be released and made open source. On 30 April 2025, Deepseek released its math-focused..."
Article title : Liang Wenfeng
"2025. "DeepSeek-R1 Release | DeepSeek API Docs". api-docs.deepseek.com. Retrieved 28 January 2025. "DeepSeek-R1/DeepSeek_R1.pdf at main · deepseek-ai/DeepSeek-R1"..."
Article title : Reasoning language model
"January 2025, DeepSeek released R1, a model competitive with o1 at lower cost, highlighting the effectiveness of GRPO. On January 25, 2025, DeepSeek launched..."
Article title : Prompt injection
"Infosecurity Magazine reported that DeepSeek-R1, a large language model (LLM) developed by Chinese AI startup DeepSeek, exhibited vulnerabilities to prompt..."
Article title : Retro (soundtrack)
"November 2024. Santhosh, through his X account, had claimed that he used DeepSeek for producing the background scores for the film, which helped him to reduce..."
Article title : List of large language models
"Retrieved 2024-07-23. deepseek-ai/DeepSeek-V3, DeepSeek, 2024-12-26, retrieved 2024-12-26 Feng, Coco (25 March 2025). "DeepSeek wows coders with more..."
Article title : Mixture of experts
"(11 January 2024), DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models, arXiv:2401.06066 DeepSeek-AI; Liu, Aixin; Feng..."
Article title : 2025 in artificial intelligence
"algorithms. Timeline of artificial intelligence "Release DeepSeek-R1 · deepseek-ai/DeepSeek-R1@23807ce". GitHub. Archived from the original on 21 January..."
Article title : High-Flyer
"location is in Hangzhou, Zhejiang. It is the founder and backer of AI firm DeepSeek. High-Flyer was founded in February 2016 by Liang Wenfeng and two of his..."
Retrieved 27 January 2025. deepseek-ai/DeepSeek-Coder, DeepSeek, 27 January 2025, retrieved 27 January 2025 "deepseek-ai/deepseek-coder-5.7bmqa-base · Hugging
[source: wikipedia]