DeepSeek API: Every Request Is A Timeout :(

Yea DDOS probably has little to do with it. Since chinese can't be blamed for anything, especially if CCP has a role in it, then anything else will be blamed, such as DDOS.

Reply

[-]

Arcosim@reddit

Massive usage most likely. Eventually they'll adapt. I remember a year ago or so when everyone was panicking because OpenAI was cancelling Plus subscriptions due to the high demand.

Reply

[-]

ThenExtension9196@reddit

You need GPU to scale. That’s hard to get over there.

Reply

[-]

FloJak2004@reddit

Just saw a post on X today, showing how Nvidia's sales to Singapore grew to almost a quarter of their revenue over the last year. Seems like China still gets plenty.

Reply

[-]

ChashuKen@reddit

Singapore is not part of China nor we even like china lol

Reply

[-]

Where did I suggest that Singapore is a part of China? Singapore is the largest freight port outside of China but has only about 1% of the world‘s datacenters. How are 22% of Nvidias revenues coming out of Singapore? Cards are going to China for sure.

Reply

[-]

ThenExtension9196@reddit

That is true, but not as much as they would have bought without the restrictions.

Reply

[-]

Thomas-Lore@reddit

And my country (Poland) was blocked from receiving Nvidia's top GPUs, you can guess why. Hint: we do a lot of trade with China.

Reply

[-]

lordpuddingcup@reddit

They can’t adapt they don’t have GPUs the ones they do have are old They basically have to wait for demand to drop off

Reply

[-]

LetsGoBrandon4256@reddit

DDoS and hugged to death by the hype.

Reply

[-]

boringcynicism@reddit

That'd be weird with the chat interface stil up?

Reply

[-]

quantum-aey-ai@reddit

Chat is timing out consistently. Too much traffic...

Reply

[-]

sammoga123@reddit

nope, The infrastructure they have was not prepared for so many users overnight, V3 works, but R1 doesn't because everyone wants to use it

Reply

[-]

Zeikos@reddit

And on top of that R1 is more token intensive per-query. So that makes congestion inevitable. I hope this will push DeepSeek to look into making those CoTs more token-efficient. There's a lot to gain there performance/quality wise imo.

Reply

[-]

218-69@reddit

Neither works for me, both r1 and normal gets same server is busy message for the last 24 hours

Reply

[-]

cantgetthistowork@reddit

So annoyed that I only managed to write half a project with R1

Reply

[-]

ab2377@reddit

probably. remember the peak hype times of chatgpt, well i still knew people who didn't know about chatgpt at that time in office, but in the last 2 days everyone in my home and office is asking me about "deepseek", people who dont read tech news at all.

Reply

[-]

polawiaczperel@reddit

Got the same, the info was spreading with a light speed. Even my non technical mom was talking about it.

Reply

[-]

lordpuddingcup@reddit

I doubt it’s actually a ddos they just weren’t ready for the level of traffic anthropic and OpenAI were People thought that because they could train on h800s that they could also run infinite inference as well for the entire world lol

Reply

[-]

PermanentLiminality@reddit

Openrouter has non Deepseek API endpoints for the R1 671b model. They cost more, but work great. I've been using it this way today.

Reply

[-]

boringcynicism@reddit

My experience is the opposite: you hit context limits before the advertised window, and you often get 0 sized responses even though they charge you for them. Largely made me consider OpenRouter to be a scam.

Reply

[-]

SoftwareComposer@reddit

I can vouch for this — other providers don't seem to be providing the full context window. Never had this issue with the original.

Reply

[-]

hannorx@reddit

What’s a provider you can recommend? Preferably one with API.

Reply

[-]

TheRealGentlefox@reddit

I don't think it's context dependent. I've had it happen at <1000, and OR is investigating it.

Reply

[-]

boringcynicism@reddit

I mean I've had \~40k requests rejected for too large context by providers that supposedly offer 64k, while they work with real DeepSeek.

Reply

[-]

HMikeeU@reddit

I've had a very bad experience with openrouter on deepseek models in recent days. When I specified I only want DeepSeek as a provider, API requests took ages or fail entirely, but when using DeepSeek API directly it worked like a charm.

Reply

[-]

boringcynicism@reddit

Yeah, same. And if you allow the fallbacks, you get broken responses - but are charged 10x the price for it.

Reply

[-]

BrightDyfiant@reddit

It isn't artificial, but what is artificial, has not won the battle, deepseek lives, and actual intelligence triumphs over artificial intelliegence, deepseek isn't dead. But artificial intelligence needs to go to another planet... AI=Actual Intelligence.

Reply

[-]

ChenSharonChen@reddit

I am here becasue the api still down, too many conspiracy theorist claim CIA attacking deepseek, just FUD but it's annoying

Reply

[-]

JustinPooDough@reddit

I am willing to bet the US is DDOSing DeepSeek. Fucking pathetic man. Sam continuing to spout his rediculous bullshit on Twitter about AGI and what not, and meanwhile attacking their competition. So much for a free market. What a load of shit.

Reply

[-]

sunr117@reddit

Sam is pathetic, DDosing an open source is more pathetic

Reply

[-]

drgitgud@reddit

just run it locally mate, the model is miniscule and blazing fast Tried it this morning, it can even count the r in strawberry!

Reply

[-]

SoftwareComposer@reddit

A distill is not the same model.... local models aren't performant enough for my use case: agentic coding on large code bases (via aider)

Reply

[-]

drgitgud@reddit

oh boy, time to be schooled! What's a distill? No /s, no joke, I'm curious

Reply

[-]

SoftwareComposer@reddit

essentially teaching a smaller model (student) to behave like its larger variant (teacher). But the smaller model has a lower # of params, so it can't reach the performance of its teacher — at least not with current methods.

Reply

[-]

drgitgud@reddit

That explains the small size! Thank you mate, much appreciated!

Reply

[-]

TrifleAccomplished77@reddit

nah it's still working lol

Reply

[-]

Bamstian@reddit

You do not know what an API is right?

Reply

[-]

TrifleAccomplished77@reddit

fuck. my dumbass didn't see "api"

Reply

[-]

awilhelm-pb@reddit

It is working in Germany.

Reply

[-]

Bamstian@reddit

Nein, die API funktioniert nicht.

Reply

[-]

Minute_Attempt3063@reddit

It works fine for me. Yes they are under a lot of strain https://preview.redd.it/wz8fdh5mw3ge1.jpeg?width=1220&format=pjpg&auto=webp&s=4f80c69ca144a65da598dc600a7fc86fb4e1fff1

Reply

[-]

Bamstian@reddit

You are Not using their API. Thats why "it works fine" for you.

Reply

[-]

h666777@reddit

Hardly knew it and I was already in love. This world is cruel.

Reply

[-]

duckieWig@reddit

It is served in fireworks, deepinfra, together, huggingface, thru openrouter and more

Reply

[-]

h666777@reddit

At 4x the price and with garbage throughput. Seems that everyone in America is having deep skill issues right now.

Reply

[-]

Valuable-Run2129@reddit

The model is a big boi. The real inference cost aligns with those provider’s prices. Deepseek was subsidizing for marketing purposes.

Reply

[-]

Fuzzy_Independent241@reddit

Groq cloud? Haven't tried it, I'm working on another project today. But could be a way out of DS servers. Other than that, as others said, people will test and do reports and publish 'stuff' and then things will get normalized.

Reply

[-]

h666777@reddit

Groq doesn't dare serve a model 1 bit bigger than 70B, they are only serving the distills.

Reply

[-]

nootropicMan@reddit

Groq only hosting 70b distilled version

Reply

[-]

OGchickenwarrior@reddit

As all deepseek requests are timing out, ChatGPT has lifted all typical O1 limits for my basic pro plan and it is lightning fast right now. I guess this is what competition gets us.

Reply

[-]

Medium_Chemist_4032@reddit

Really? That limit was so annoying I stopped using it altogether

Reply

[-]

Sir-ScreamsALot@reddit

They really lifted limits?

Reply

[-]

OGchickenwarrior@reddit

I actually cannot confirm. I feel bad that this comment got so much attention. I was abusing O1 yesterday and definitely sent more than 50 requests , but maybe my account is just broken? Idk man

Reply

[-]

noobrunecraftpker@reddit

Why isn't this announced anywhere though?

Reply

[-]

Turkino@reddit

Ah, the classic "oh no, we have competition! Quick, make our product a bit less arbitrarily shitty to use." Which would be fine, if they were not crying to the feds to try to arbitrarily protect themselves at the same time.

Reply

[-]

The_GSingh@reddit

Basic pro? Do you mean the $20 plus plan. Might be worth resubscribing.

Reply

[-]

OGchickenwarrior@reddit

Yes $20

Reply

[-]

Suheil-got-your-back@reddit

Thats plus account for clarity. Pro is 200$.

Reply

[-]

Johnroberts95000@reddit

Looking forward to o3 & it's nice to have my OpenAI GPUs unclogged

Reply

[-]

Trojblue@reddit

How much limit is it now? from 50/week previously?

Reply

[-]

Thomas-Lore@reddit

And Copilot now has thinking which seems to use o1. And Claude is back to offering Sonnet to free users.

Reply

[-]

According-Channel540@reddit

Is this the 50 o1 messages per week you are saying?

Reply

[-]

OGchickenwarrior@reddit

Yes but I could swear I’ve gone over 50 today

Reply

[-]

notcooltbh@reddit

they're not timing out lol they're collecting training data and not giving you the outputs it's literally a Minecraft dropper farm type of setup

Reply

[-]

JoshS-345@reddit

It's open sourced, there will be unlimited companies making it available. And if you want to run a smaller version and you have powerful enough hardware, you can run it yourself.

Reply

[-]

20ol@reddit

Those are not smaller versions. They are Llama and Qwen finetuned by R1. The only Deepseek model is the 671b

Reply

[-]

JoshS-345@reddit

Cool!

Reply

[-]

HMikeeU@reddit

Yes but the other companies are more expensive

Reply

[-]

Neomadra2@reddit

Well this was so obvious. Training a model is one thing. Serving 300 million users is another.

Reply

[-]

dhbloo@reddit

I am starting to wondering how does Deepseek maintain all of this running in the long term if they always provide free services.

Reply

[-]

redditscraperbot2@reddit

I'm sure the five bucks I tossed them for their API in December will cover the cost of their service upgrades.

Reply

[-]

shakespear94@reddit

Ii put $5 too. $10. Keep it going

Reply

[-]

Beneficial_Toe3744@reddit

They're not trying to make a profit. They're trying to find ways to destabilize your country. I wonder how many people already accepted the TOS before even trying to log in lol

Reply

[-]

Flaky-Diet5318@reddit

country's destabilizing itself without the help of deepseek

Reply

[-]

Beneficial_Toe3744@reddit

Yes. I wonder how that came about. Certainly not years of propaganda slowly spoon fed to Americans by the literal thousands-year-old Chinese government propaganda machine operating through every single piece of American consumer electronics. Certainly. It's not like China (and Russia) have openly admitted to these things. That would be absolutely crazy, right?

Reply

[-]

lordpuddingcup@reddit

They don’t because they can’t get new infrastructure due to embargo and the old h800 cluster only gonna handle so many users free or paid

Reply

[-]

ThenExtension9196@reddit

Spoiler alert: they don’t

Reply

[-]

nootropicMan@reddit

Api access is paid

Reply

[-]

IxinDow@reddit

I bet they've made enough for 20 V3 on recent market volatility

Reply

[-]

AdTotal4035@reddit

I was using deepseek happily since it was released. I am so pissed the media figured out about it. Now everyone is just bombarding it. It's literally over.

Reply

[-]

redditscraperbot2@reddit

Same, I was using V3 for a while because it was cheap and fast enough to excuse its shortcomings now I got nothing. It's still just as good when it comes back. Whatever they're doing to bring the service back comes with a price tag, surely.

Reply

[-]

Loops_Boops@reddit

You can access the full DeepSeek R1 model through a bunch of serverless API providers like Nebius, Novita, DeepInfra, Fireworks, Together... it's more expensive though.

Reply

[-]

Reasonable_Flower_72@reddit

https://preview.redd.it/g97traw4hzfe1.png?width=880&format=png&auto=webp&s=94125a4ba1668649fc0cda3b9d0d22e918310165 Right now, iPv6 tunnel from CZ to Hurricane Electric (USA) then China

Reply

[-]

el_ramon@reddit

Remember it happened exactly the same with ChatGPT and their solution was to start charging a subscription fee to prioritize those who paid.

Reply

[-]

bArA83@reddit

https://preview.redd.it/xxmfdvpt0yfe1.jpeg?width=1080&format=pjpg&auto=webp&s=b115715ffca5d3ca4f138d4e1b25fa224058bfd6

Reply

[-]

HugeOrdinary7212@reddit

Gave it time, remember when chatgpt was new, it use to break every now and then

Reply

[-]

Fun_Yam_6721@reddit

yeah the hype sucks

Reply

[-]

Johnroberts95000@reddit

On a serious note - are they bypassing cuda for inference or should other providers be able to get their TPS up to what DeepSeeks was? Before this blew up - DeepSeek was way faster than what OpenRouter is now.

Reply

[-]

Divniy@reddit

r/LocalLLaMA

Reply

[-]

Competitive-Ad754@reddit

This is not good Mav

Reply

[-]

purplewater0o0@reddit

:(((

Reply

[-]

mrbluesneeze@reddit

Says some stupid idiot every time a new popular service of game launches

Reply

[-]

Proper-Yellow8395@reddit

The iOS app seems to work for me.

Reply

[-]

evrenozkan@reddit

Website works fine for me. But I don't get why they can't rate limit API access, and it should be much cheaper to evade non-authenticated requests.

Reply

[-]

Puzzled-Pass-1318@reddit

Maybe it's because China is celebrating the Spring Festival and everyone is on holiday :)

Reply

[-]

phenotype001@reddit

I can't even access the platform page, 503 error: [platform.deepseek.com](http://platform.deepseek.com)

Reply

[-]

Previous-Piglet4353@reddit

I thought this was a local LLM subreddit. My local Deepseek-r1 70b is doing just fine. OpenAI did, however, dumb down the o1 API. That got nerfed hard.

Reply

[-]

diligentgrasshopper@reddit

A couple weeks ago I legit could get 40 requests per second from v3 :(( and here I was trying to churn as much distillation as possible before the API discount ends

Reply

[-]

Dark_Fire_12@reddit

Give it two weeks and come back later when everyone moves on to something else. Google is cooking so maybe that can take the heat off them.

Reply

[-]

CountPacula@reddit

This is no different than when GPT4 came out. Outages happen when you get popular.

Reply to Post

106 Comments