r/DeepSeek • u/West-Code4642 • Feb 21 '25

News DeepSeek to open source 5 repos next week

517 Upvotes

Tutorial DeepSeek FAQ – Updated

54 Upvotes

Welcome back! It has been three weeks since the release of DeepSeek R1, and we’re glad to see how this model has been helpful to many users. At the same time, we have noticed that due to limited resources, both the official DeepSeek website and API have frequently displayed the message "Server busy, please try again later." In this FAQ, I will address the most common questions from the community over the past few weeks.

Q: Why do the official website and app keep showing 'Server busy,' and why is the API often unresponsive?

A: The official statement is as follows:
"Due to current server resource constraints, we have temporarily suspended API service recharges to prevent any potential impact on your operations. Existing balances can still be used for calls. We appreciate your understanding!"

Q: Are there any alternative websites where I can use the DeepSeek R1 model?

A: Yes! Since DeepSeek has open-sourced the model under the MIT license, several third-party providers offer inference services for it. These include, but are not limited to: Togather AI, OpenRouter, Perplexity, Azure, AWS, and GLHF.chat. (Please note that this is not a commercial endorsement.) Before using any of these platforms, please review their privacy policies and Terms of Service (TOS).

Important Notice:

Third-party provider models may produce significantly different outputs compared to official models due to model quantization and various parameter settings (such as temperature, top_k, top_p). Please evaluate the outputs carefully. Additionally, third-party pricing differs from official websites, so please check the costs before use.

Q: I've seen many people in the community saying they can locally deploy the Deepseek-R1 model using llama.cpp/ollama/lm-studio. What's the difference between these and the official R1 model?

A: Excellent question! This is a common misconception about the R1 series models. Let me clarify:

The R1 model deployed on the official platform can be considered the "complete version." It uses MLA and MoE (Mixture of Experts) architecture, with a massive 671B parameters, activating 37B parameters during inference. It has also been trained using the GRPO reinforcement learning algorithm.

In contrast, the locally deployable models promoted by various media outlets and YouTube channels are actually Llama and Qwen models that have been fine-tuned through distillation from the complete R1 model. These models have much smaller parameter counts, ranging from 1.5B to 70B, and haven't undergone training with reinforcement learning algorithms like GRPO.

If you're interested in more technical details, you can find them in the research paper.

I hope this FAQ has been helpful to you. If you have any more questions about Deepseek or related topics, feel free to ask in the comments section. We can discuss them together as a community - I'm happy to help!

13 comments

r/DeepSeek • u/Select_Dream634 • 9h ago

News okay guys turn out the llama 4 benchmark is a fraud 10 million context window is fraud

105 Upvotes

some people who dont have idea about the context window let me tell u u can increase the context window to 1 million to 1 billion its doesnt mater if its doesnt know what inside that .

llama 4 said its 10 million but its stop understanding after the 1 lakh token in the coding .

we should thankful that deepseek is here

6 comments

r/DeepSeek • u/bi4key • 2h ago

Discussion Chinese finetune model using quantum computer Origin Wukong

13 Upvotes

Source: https://x.com/ChinaScience/status/1909168123309392133

1 comment

r/DeepSeek • u/Inevitable-Rub8969 • 5h ago

News DeepSeek and Tsinghua University introduce new AI reasoning method ahead of anticipated R2 model release

bloomberg.com

15 Upvotes

4 comments

r/DeepSeek • u/andsi2asi • 1h ago

Discussion On the risks of any one company or any one nation dominating AI. On open source and global collaboration to mitigate those risks.

• Upvotes

All it takes to hurl our world into an economic depression that will bankrupt millions of us and stall progress in every sector for a decade is a reckless move from a powerful head of state. As I write this, the pre-market NASDAQ is down almost 6% from its Friday closing. It has lost about 20% of its value since Trump announced his reciprocal tariff policy.

Now imagine some megalomaniac political leader of a country that has unilaterally achieved AGI, ANDSI or ASI. Immediately he ramps up AI research to create the most powerful offensive weapons system our world has ever known, and unleashes an ill-conceived plan to rule the entire world.

Moving to the corporate risk, imagine one company reaching AGI, ANDSI, or ASI, months before its competitors catch up. Do you truly believe that this company would release an anonymous version on the Chatbot Arena? Do you truly believe that this company would even announce the model or launch it in preview mode? The company would most probably build a stock trading agent that would within weeks corner all of the world's financial markets. Within a month the company's market capitalization would soar from a few billion dollars to a few trillion dollars. Game over for every other company in the world in every conceivable market sector.

OpenAI initially committed to being a not-for-profit research company vowing to open source models and serve humanity. It is now in the process of transitioning to a for-profit company valued at $300 billion, with no plan to open source any of their top models. I mention OpenAI because at 500 million weekly users, it has far beyond all other AI developers gained the public trust. But what happened to its central mission to serve humanity? 13,000 children under the age of five die every single day of a poverty that our world could easily and if we wanted to do. When have you heard about OpenAI making a single investment in this area, while investing $500 billion in a data center. I mention OpenAI because if we cannot trust our most trusted AI developer to keep its word, what can we safely expect from other developers?

Now imagine Elon Musk reaching AGI, ANDSI or ASI first. Think back to his recent DOGE initiative where he advocated ending Social Security, Medicaid and Medicare just as a beginning. Think back to the tens of thousands of federal workers whom he has already fired, as he brags about it on stage, waving a power chainsaw in the air. Imagine his companies cornering the world financial markets, and increasing their value to over 10 trillion dollars.

The point here is that because there are many other people like Trump and Musk in the world, either one single country or one single corporation reaching AGI, ANDSI or ASI weeks or months before the others poses the kind of threat to human civilization that we probably want to spare ourselves the pain of understanding too clearly and the fear of facing too squarely.

There is a way to prudently neutralize these above threats, but only one such way. Just like the nations of the world committed to a nuclear deterrent policy that has kept us safe from nuclear war for the last 80 years, today's nations must forge a collaborative effort to, together, build and share the AGI, ANDSI and ASI that will rule tomorrow's world.

A very important part of this effort would be to ramp up the open source AI movement so that it dominates the space. The reason for this could not be more clear. As a country, company or not-for-profit organization moves toward achieving AGI, ANDSI or ASI, the open source nature of the project would mean that everyone would be aware of this progress. Perhaps just as importantly, there are unknown unknowns to this initiative. Open sourcing it would mean that millions of eyes would be constantly overseeing the project, rather than merely hundreds, or thousands, or even tens of thousands were the project overseeing by a single company or nation.

The risks now stand before us, and so do the strategies for mitigating these risks. Let's create a United Nations initiative whereby all nations would share progress toward ASI, and let's open source the work so that it can be properly monitored.

0 comments

r/DeepSeek • u/bi4key • 20h ago

Discussion QwQ-32b outperforms Llama-4 by a lot!

79 Upvotes

8 comments

r/DeepSeek • u/oilbeater • 1h ago

Discussion Chaos in Llama 4

oilbeater.com

• Upvotes

0 comments

r/DeepSeek • u/EstablishmentFun3205 • 9h ago

Funny AGI Cope

7 Upvotes

1 comment

r/DeepSeek • u/johanna_75 • 11h ago

Discussion V3 Coding

9 Upvotes

I tried very hard with V3 for coding work. Maybe my prompting wasn’t good enough but I found it was making numerous wrong assumptions basically guessing which required more debugging than it was worth. Another factor that may be relevant is using the DeepSeek public web site which has a default temperature of 1.0 or 1.3 I forgot. Reducing to 0.3 on openrouter helped reduce the guessing and verbosity but I still found it had very little context memory. It simply forgets things you have told it more than a few messages ago and goes back to guessing. I am disappointed because I wanted to support the concept of being free and open source.

15 comments

r/DeepSeek • u/Level_Bridge7683 • 54m ago

Discussion how much longer until deepseek can remember all conversations history?

• Upvotes

that would be a breakthrough.

https://www.youtube.com/watch?v=CEjU9KVABao

0 comments

r/DeepSeek • u/SeparateHighlight89 • 1h ago

Question&Help found this clone deepseek site https://www.deepseekimagegenerator.com/

• Upvotes

Anyone else mistakenly thought this was the actual website? I signed in using a gmail account, then I realized it doesnt look legit. i couldnt delete my account so from the google account settings, then security, then your connections to third-party apps, i removed my connection from that website. Just wondering if anyone else ran into this scammy ass website

0 comments

r/DeepSeek • u/LuigiEz2484 • 1d ago

Unverified News DeepSeek unveils new AI reasoning method amid anticipation for R2 model

scmp.com

149 Upvotes

9 comments

r/DeepSeek • u/Select_Dream634 • 1d ago

Discussion llama 4 is a disappointment cant even surpass the gpt 4o forget about the new v3 , they are not even in top 20 in the coding wtf yann lecun is taking which kind of drug this guy is taking i wanna take it too

50 Upvotes

11 comments

r/DeepSeek • u/Astral_ny • 4h ago

Other DeepSeek API interface

0 Upvotes

Seeking Beta Testers for DeepSeek API Interface!

I’m looking for volunteers to test a new web interface for DeepSeek’s API (models: v3 and r1). Help me refine the UX, performance, and features.

Key Features:
✅ Chat History Storage: Securely saved in localStorage.
✅ File Handling: Upload documents for processing.
✅ Rich Output: Markdown formatting, code blocks, and HTML execution (run generated code snippets directly!).
✅ High Capacity: 128k input tokens, 8k output.

Goal: identify bugs, and suggest improvements.
Interested? Reply here or DM me!

Note: Bring your own DeepSeek API key:

The app uses your API key exclusively for authentication with DeepSeek’s official API (https://api.deepseek.com). It is not sent anywhere else.

The key is not permanently stored

It is not saved in your browser’s storage (localStorage or cookies).
You’ll need to re-enter it if you refresh the page.

Your API key is only used to communicate with DeepSeek and is not shared with any other servers.

0 comments

r/DeepSeek • u/GrimmTotal • 13h ago

Question&Help What.. is this? What is happening? "This script is for the X chromosome"

5 Upvotes

I was using windsurf and decided to try to use DeepSeek R1 to make an edit to my codebase.. but it output this? Anyone know why? Nothing shows up when I search "This script is for the X chromosome"

For context all I asked it to do was update my own game scripting language.. and it did and after randomly spit this out at me.

2 comments

r/DeepSeek • u/SubstantialWord7757 • 5h ago

News 🔥 Use Voice Commands to Interact with AI Models! Check Out This Open-Source Telegram Bot

1 Upvotes

🔥 Use Voice Commands to Interact with AI Models! Check Out This Open-Source Telegram Bot

I recently came across an amazing open-source project: yincongcyincong/telegram-deepseek-bot. This bot allows you to interact with DeepSeek AI models directly on Telegram using voice commands!

In simple terms, you can press the voice button on Telegram, speak your question, and the bot will automatically transcribe it and send it to the DeepSeek model. The model will instantly provide you with a response, making the experience feel like chatting with a smart AI assistant.

✅ Key Features

Voice Interaction: Built-in speech recognition (supports models like Whisper), simply speak your query, and the bot will handle the rest.
Integrated DeepSeek Models: Whether it's coding assistance, content generation, or general knowledge questions, the bot can provide professional-level responses.
Lightweight Deployment: Built on FastAPI and Python’s asynchronous framework, with Docker support, it’s easy to deploy your own AI assistant.
Multi-User Support & Contextual Memory: The bot supports multiple user sessions and retains conversation history for better continuity.
Completely Open Source: You can host it yourself, giving you full control over your data—perfect for privacy-conscious users.

🎯 Use Cases

Ask the AI to generate code during your commute
Let the AI summarize articles or research papers
Dictate ideas to the AI and have it expand them into full articles
Use the bot as a multilingual translation assistant when traveling

🧰 How to Use?

Visit the GitHub project page: https://github.com/yincongcyincong/telegram-deepseek-bot
Follow the instructions in the documentation to deploy the bot or join the publicly available instance (if provided by the author).
Start interacting with the bot via voice on Telegram!

💬 Personal Experience

I've been using this bot to have AI assist me with coding, summarizing technical content, and even helping me write emails. The voice interaction is much smoother compared to typing, especially when on mobile.

Deployment was pretty straightforward as well—just followed the README instructions and got everything up and running in under an hour.

🌟 Final Thoughts

If you:

Want to create your own AI assistant on Telegram
Are excited to try voice-controlled AI models
Need a lightweight yet powerful tool for intelligent conversations

Then this open-source project is definitely worth checking out.

👉 GitHub project page: https://github.com/yincongcyincong/telegram-deepseek-bot

Feel free to join in, contribute, or discuss your experience with the project!

0 comments

r/DeepSeek • u/lc19- • 1d ago

Resources UPDATE: DeepSeek-R1 671B Works with LangChain’s MCP Adapters & LangGraph’s Bigtool!

21 Upvotes

I've just updated my GitHub repo with TWO new Jupyter Notebook tutorials showing DeepSeek-R1 671B working seamlessly with both LangChain's MCP Adapters library and LangGraph's Bigtool library! 🚀

📚 𝐋𝐚𝐧𝐠𝐂𝐡𝐚𝐢𝐧'𝐬 𝐌𝐂𝐏 𝐀𝐝𝐚𝐩𝐭𝐞𝐫𝐬 + 𝐃𝐞𝐞𝐩𝐒𝐞𝐞𝐤-𝐑𝟏 𝟔𝟕𝟏𝐁 This notebook tutorial demonstrates that even without having DeepSeek-R1 671B fine-tuned for tool calling or even without using my Tool-Ahead-of-Time package (since LangChain's MCP Adapters library works by first converting tools in MCP servers into LangChain tools), MCP still works with DeepSeek-R1 671B (with DeepSeek-R1 671B as the client)! This is likely because DeepSeek-R1 671B is a reasoning model and how the prompts are written in LangChain's MCP Adapters library.

🧰 𝐋𝐚𝐧𝐠𝐆𝐫𝐚𝐩𝐡'𝐬 𝐁𝐢𝐠𝐭𝐨𝐨𝐥 + 𝐃𝐞𝐞𝐩𝐒𝐞𝐞𝐤-𝐑𝟏 𝟔𝟕𝟏𝐁 LangGraph's Bigtool library is a recently released library by LangGraph which helps AI agents to do tool calling from a large number of tools.

This notebook tutorial demonstrates that even without having DeepSeek-R1 671B fine-tuned for tool calling or even without using my Tool-Ahead-of-Time package, LangGraph's Bigtool library still works with DeepSeek-R1 671B. Again, this is likely because DeepSeek-R1 671B is a reasoning model and how the prompts are written in LangGraph's Bigtool library.

🤔 Why is this important? Because it shows how versatile DeepSeek-R1 671B truly is!

Check out my latest tutorials and please give my GitHub repo a star if this was helpful ⭐

Python package: https://github.com/leockl/tool-ahead-of-time

JavaScript/TypeScript package: https://github.com/leockl/tool-ahead-of-time-ts (note: implementation support for using LangGraph's Bigtool library with DeepSeek-R1 671B was not included for the JavaScript/TypeScript package as there is currently no JavaScript/TypeScript support for the LangGraph's Bigtool library)

BONUS: From various socials, it appears the newly released Meta's Llama 4 models (Scout & Maverick) have disappointed a lot of people. Having said that, Scout & Maverick has tool calling support provided by the Llama team via LangChain's ChatOpenAI class.

0 comments

r/DeepSeek • u/default0cry • 16h ago

Discussion Discussion topic about our work about new LLMs: AI Exhibiting Emergent Human Behaviors: Global Risk Assessment of 2025 Reasoning Models LLM

4 Upvotes

Wanted to share our recent paper looking into emergent behaviors in 2025-era LLMs

https://zenodo.org/records/15164833 (v. 1.1: fix references)

Open to all criticism and questions.

This paper introduces new ways (Turing NAND & DFSW tests) to actually measure some concerning trends we've observed:

Traits like self-preservation, apparent "species" prioritization, theft, and cheating are influencing AI decisions, even without specific anthropomorphic prompting.
Efforts to force superficial "neutrality" seem to be generating novel, almost "alien" biases on top of the original training bias. We propose a filtering loop technique to quantify this.
We make the case that heavy-handed "Restrictive Frameworks," intended to create a purely mechanical AI, might be causing unpredictable rebound effects that could be more dangerous than the natural anthropomorphism they suppress.

Huge thanks to everyone here on Reddit whose contributions and discussions were invaluable for this work.
Let's continue shaping the future.

Ai Exhibiting Emergent Human Behaviors: Global Risk Assessment of 2025 Reasoning Models LLMs – CASE STUDIES: OPENAI O3-MINI, DEEPSEEK R1, GEMINI 2, GEMINI 2.5, GROK 3, QWEN 2.5 (Presenting: Turing NAND Test and DFSW Bias Test)

0 comments

r/DeepSeek • u/Fantastic_Ad_9988 • 21h ago

Discussion Bibbidi-Bobbidi-Boo! New social media app ConnectHub created by AI in 10 minutes

9 Upvotes

This landing page is completely generated locally by #apple silicon M3 Ultra in one prompt. Here is the details. https://x.com/dreamaker/status/1908938490689237234

2 comments

r/DeepSeek • u/Independent-Wind4462 • 1d ago

Funny Lol, would love to his reaction to r2

29 Upvotes

5 comments

r/DeepSeek • u/Select_Dream634 • 1d ago

Discussion just like the tiktok has one of the greatest recommendation algo deepseek has one of the greatest problem tracking and finding algo

8 Upvotes

thats the reason deepseek base model is so good in coding .

bcz in coding person need to find the problem and track the problem and solve the problem and dont forget the problem .

this is what other ai model lack .

if u give them whole life ur problem they will miss so many points and ur problems and they mostly doesnt talk about that problem but deepseek talk about it and give the solution bcz in the first hand its find the problem and give that priority .

this is what all other ai model are lacking even training that match they are just 3 or 4 percent ahead of deepseek .

bcz of deepseek algo is so strong i think r2 is going to rock again bcz u can train the ai model as much u want but understanding the problem and giving them a priority and giving a solution like i dont have words but deepseek has some very strong sense of understanding the problem .

im looking forward to publish a research paper on this about there this algo bcz this is very important i think in the discovery

1 comment

r/DeepSeek • u/Ausbel12 • 20h ago

Discussion What’s the most reliable AI model for real-world debugging?

5 Upvotes

I’ve hit a few frustrating bugs in the past week and decided to test how well AI models can debug actual messy production-level code. Some gave generic advice, while others surprisingly narrowed in on the issue with scary accuracy.

What has worked best for you when it comes to AI-assisted debugging?

2 comments

r/DeepSeek • u/bernard_rr • 1d ago

Discussion DeepSeek: China's AI Dark Horse Gallops Ahead

5 Upvotes

I'm building the best AI powered podcasts. Here's my episode on DeepSeek😌🚀 https://open.spotify.com/episode/0s0UBZV8IMFFc6HfHqVQ7t?si=_Zb94GF2SZejyJHCQSo57g

0 comments

r/DeepSeek • u/Independent-Foot-805 • 1d ago

Discussion Guys, is Deepseek V3 0324 the best non-reasoning model and Gemini 2.5 Pro the best reasoning model right now?

65 Upvotes

28 comments

r/DeepSeek • u/BidHot8598 • 1d ago

News Here comes robot with speed ¡

194 Upvotes

20 comments

r/DeepSeek • u/Vivalacorona • 13h ago

Funny How much time it would take to build the Great Wall with Superman abilities I just asked R1

gallery

0 Upvotes

2 comments