ai:tools:deepseek

AI Tools

DeepSeek

DeepSeek (ai-tool)
Full NameDeepSeek
Short NameDeepSeek
DescriptionUnravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
CompanyTBD
Logo
Webhttps://www.deepseek.com/
Categorytbd
LicenseTBD
Snippet from Wikipedia: DeepSeek

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, it is owned and funded by the Chinese hedge fund High-Flyer. DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both companies. The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025.

Released under the MIT License, DeepSeek-R1 provides responses comparable to other contemporary large language models, such as OpenAI's GPT-4o and o1. Its training cost is reported to be significantly lower than other LLMs. The company claims that it trained its V3 model for US$6 million compared to $100 million for OpenAI's GPT-4 in 2023, and approximately one-tenth of the computing power used for Meta's comparable model, Llama 3.1. DeepSeek's success against larger and more established rivals has been described as "upending AI".

DeepSeek's models are "open weight", which provides less freedom for modification than true open-source software. The company reportedly recruits AI researchers from top Chinese universities and hires from outside the computer science field to diversify its models' knowledge and abilities.

The DeepSeek R1 model was trained at a significantly lower cost than other models by using techniques such as mixture of experts to reduce costs. The model was also trained during ongoing trade restrictions on AI chip exports to China, causing it to be trained on weaker AI chips made for export to China, and using fewer chips compared to other models. This breakthrough in reducing expenses, although increasing efficiency and maintaining the model's performance power and quality in the AI industry, sent "shockwaves" through the market. It threatened the dominance of AI leaders like Nvidia and contributed to the largest drop for a single company in US stock market history, as Nvidia lost $600 billion in market value.

  • ai/tools/deepseek.txt
  • Last modified: 2025/02/09 19:01
  • by Henrik Yllemo