Deep Seek

Organization

A Chinese AI model approaching western frontier capabilities in areas like cybersecurity.

First Mentioned

1/9/2026, 4:44:55 AM

Last Updated

5/10/2026, 5:09:27 AM

Research Retrieved

1/9/2026, 4:47:13 AM

Summary

DeepSeek (Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.) is a Chinese artificial intelligence company founded in July 2023 by Liang Wenfeng. Based in Hangzhou and funded by the hedge fund High-Flyer, the organization specializes in developing high-performance large language models (LLMs). DeepSeek gained international prominence in early 2025 for its ability to produce models like DeepSeek-R1 and V3 that rival proprietary systems from OpenAI and Meta at a fraction of the cost. By utilizing Mixture of Experts (MoE) architecture and reinforcement learning, the company trained its V3 model for approximately $6 million, significantly lower than the estimated $100 million cost for GPT-4. Despite US export controls on advanced semiconductors, DeepSeek's success with "open weight" models has been described as a "Sputnik moment" for the AI industry, notably contributing to a record $600 billion single-day market value loss for Nvidia.

Referenced in 3 Documents

Research Data

Extracted Attributes

CEO
Liang Wenfeng
Founded
2023-07-01
Founder
Liang Wenfeng
Full Name
Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.
Headquarters
Hangzhou, Zhejiang, China
Software License
MIT License
Core Architecture
Mixture of Experts (MoE)
Parent Organization
High-Flyer (Hedge Fund)
Model Parameters (R1)
670 billion
Training Cost (V3 Model)
$6,000,000 USD

Timeline

DeepSeek is founded in Hangzhou, China, by Liang Wenfeng. (Source: Wikipedia)
2023-07-01
DeepSeek releases the DeepSeek-R1 model and its eponymous chatbot under the MIT License. (Source: CSAIL Alliances - MIT)
2025-01-20
Nvidia's market value drops by $600 billion, the largest single-day decline in U.S. history, attributed to DeepSeek's industry disruption. (Source: Wikipedia)
2025-01-27
DeepSeek-V3.2 is released, featuring enhanced agent capabilities and reasoning integration. (Source: DeepSeek Official Website)
2025-02-06

Wikipedia

View on Wikipedia

DeepSeek

Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd., doing business as DeepSeek, is a Chinese artificial intelligence (AI) company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, DeepSeek is owned and funded by the Chinese hedge fund High-Flyer. DeepSeek was founded in July 2023 by Liang Wenfeng, the co-founder of High-Flyer, who also serves as the CEO for both of the companies. The company launched an eponymous chatbot alongside its DeepSeek-R1 model in January 2025. Released under the MIT License, DeepSeek-R1 provides responses comparable to other contemporary large language models, such as OpenAI's GPT-4 and o1. Its training cost was reported to be significantly lower than other LLMs. The company claims that it trained its V3 model for US$6 million—far less than the US$100 million cost for OpenAI's GPT-4 in 2023—and using approximately one-tenth the computing power consumed by Meta's comparable model, Llama 3.1. DeepSeek's success against larger and more established rivals has been described as "upending AI". DeepSeek's models are described as "open weight," meaning the exact parameters are openly shared, although certain usage conditions differ from typical open-source software. The company reportedly recruits AI researchers from top Chinese universities and also hires from outside traditional computer science fields to broaden its models' knowledge and capabilities. DeepSeek significantly reduced training expenses for their R1 model by incorporating techniques such as mixture of experts (MoE) layers. The company also trained its models during ongoing trade restrictions on AI chip exports to China, using weaker AI chips intended for export and employing fewer units overall. Observers say this breakthrough sent "shock waves" through the industry which were described as triggering a "Sputnik moment" for the US in the field of artificial intelligence, particularly due to its open-source, cost-effective, and high-performing AI models. This threatened established AI hardware leaders such as Nvidia; Nvidia's share price dropped sharply, losing US$600 billion in market value, the largest single-company decline in U.S. stock market history.

Web Search Results

What is Deep Seek? | DeepSeek AI Blog
## FAQ ### What is Deep Seek? Deep Seek is a new search technology that goes beyond old search engines. It lets users find information in a deeper and more precise way. It uses artificial intelligence to improve how we find and get information. ### How does Deep Seek's technology differ from traditional search engines? Deep Seek is different because it uses semantic search algorithms. This means it understands the context and meaning of what you're searching for. This leads to more accurate and relevant results, making searching better. ### Can Deep Seek enhance academic and scientific research? [...] ## How Deep Seek Revolutionizes Online Research Traditional search engines often leave gaps in results, forcing users to sift through irrelevant data. Deep Seek bridges this gap with artificial intelligence search that prioritizes precision. Imagine a journalist hunting for obscure historical documents or a student struggling to find niche academic sources. Deep Seek's algorithms process queries differently, uncovering connections even seasoned researchers might miss. [...] ## Understanding Deep Seek Technology Deep Seek started by fixing problems in old search tools. Before it was launched, people noticed that regular search engines had trouble with hard questions. This led to the idea of making a system that could look deeper than just simple keywords. > "We wanted to create a search engine that thinks like a human, not a machine." — Deep Seek Development Team Deep Seek was launched in 2020, coming from MIT's AI Lab. The team behind it included Dr. Elena Torres and Dr. Raj Patel. They worked on understanding words and using neural networks. ### Core Technological Components
DeepSeek-R1 explained : Pioneering the Next Era of Reasoning ...
Enter DeepSeek, a revolutionary framework that reimagines reasoning in LLMs through pure reinforcement learning (RL). By enabling models to autonomously develop reasoning behaviors, DeepSeek’s first-generation models — DeepSeek-R1-Zero and DeepSeek-R1 — set new benchmarks, rivaling proprietary systems like OpenAI’s cutting-edge models. DeepSeek goes further by democratizing access to high-performance AI. Through innovative distillation techniques, it transfers advanced reasoning capabilities to smaller, more efficient models, making powerful AI accessible and cost-effective. This dual focus on scalability and efficiency positions DeepSeek as a transformative force in AI development. [...] This blog explores DeepSeek’s groundbreaking RL-based training, its multi-stage pipeline, and the distillation process that empowers smaller models. Join us as we uncover how DeepSeek is reshaping the future of reasoning in LLMs and democratizing advanced AI for a broader audience. ## Written by Sahin Ahmed, Data Scientist 1.3K followers ·183 following Lifelong learner passionate about AI, LLMs, Machine Learning, Deep Learning, NLP, and Statistical Modeling to make a meaningful impact. MSc in Data Science. ## Responses (3) Help Status About Text to speech
DeepSeek: What You Need to Know | CSAIL Alliances - MIT
DeepSeek is a small artificial intelligence lab and startup based in Hangzhou, China, founded in 2023 by Liang Wenfeng, a prominent investor and entrepreneur in AI technology. In addition to being the company’s CEO, Wenfeng also created the hedge fund solely responsible for funding DeepSeek, High-Flyer. Forbes says, “This unique funding model has allowed DeepSeek to pursue ambitious AI projects without the pressure of external investors, enabling it to prioritize long-term research and development.” [...] On January 20th, 2025 DeepSeek released DeepSeek R1, a new open-source Large Language Model (LLM) which is comparable to top AI models like ChatGPT but was built at a fraction of the cost, allegedly coming in at only $6 million. For comparison, ChatGPT4 is estimated to have cost OpenAI over $100 million. DeepSeek R1 has about 670 billion parameters, making it the largest open-source LLM yet, according to BBC. [...] While DeepSeek R1 is rapidly gaining popularity, many worry about the security threats. Forbes points out that DeepSeek’s privacy policy expressly tells users that extensive personal information (including IP addresses and keystrokes) is collected and stored in the People’s Republic of China. Chinese national security laws require Chinese firms to share data with government agencies (one reason for the scrutiny on TikTok), so businesses should exercise caution in using DeepSeek with sensitive data. With its explosion in popularity, DeepSeek has already faced a large-scale cyberattack which forced the platform to disable new user registrations.
What is DeepSeek? AI Model Basics Explained - YouTube
#deepseek #ai #machinelearning [...] unique to models from DeepSeek. There are models from the French AI company Mistral that also use this, and in fact the IBM Granite model that is also built on a mixture of experts architecture. So it's a commonly used architecture. So that is DeepSeek R1. It's an AI reasoning model that is matching other industry leading models on reasoning benchmarks, while being delivered at a fraction of the cost in both training and inference. All of which makes me think this is an exciting time for AI reasoning models. [...] # What is DeepSeek? AI Model Basics Explained ## IBM Technology 1510000 subscribers 4511 likes ### Description 231538 views Posted: 6 Feb 2025 Want to learn more about how to choose the right AI foundation model? Read the Ebook here → Learn more about DeepSeek here → Want to hear more about the facts and hype of DeepSeek from our Mixture of Experts? Watch here → Explore the unique capabilities of DeepSeek, an innovative AI reasoning model. 🌟 Martin Keen and Aaron Baughman discuss the evolution of DeepSeek models, focusing on DeepSeek-R1. Learn how it uses chain of thought reasoning, reinforcement learning, and expert architectures to achieve top-tier performance efficiently. 🚀 AI news moves fast. Sign up for a monthly newsletter for AI updates from IBM →
DeepSeek | 深度求索
DeepSeek | 深度求索 Image 1: DeepSeek Logo 🎉 DeepSeek-V3.2 正式版发布，强化 Agent 能力，融入思考推理，在网页端、APP 和 API 全面上线，点击查看详情。探索未至之境开始对话与 DeepSeek-V3.2 免费对话体验全新旗舰模型API 开放平台调用 DeepSeek 最新模型快速集成、流畅体验获取手机 AppEnglish Image 2: DeepSeek Logo © 2025 杭州深度求索人工智能基础技术研究有限公司版权所有浙ICP备2023025841号浙B2-20250178浙公网安备33010502011812号研究 DeepSeek R1DeepSeek V3DeepSeek Coder V2DeepSeek VLDeepSeek V2DeepSeek CoderDeepSeek MathDeepSeek LLM 产品 DeepSeek AppDeepSeek 网页版开放平台API 价格服务状态法务 & 安全隐私政策用户协议反馈安全漏洞加入我们岗位详情

Deep Seek

First Mentioned

Last Updated

Research Retrieved

Summary

Referenced in 3 Documents

Research Data

Extracted Attributes

CEO

Founded

Founder

Full Name

Headquarters

Software License

Core Architecture

Parent Organization

Model Parameters (R1)

Training Cost (V3 Model)

Timeline

Wikipedia

DeepSeek

Web Search Results