TheaterFire

DeepSeek API: Every Request Is A Timeout :(

Posted by XMasterrrr@reddit | LocalLLaMA | View on Reddit | 106 comments

DeepSeek API: Every Request Is A Timeout :(

Reply to Post

106 Comments

ab2377@reddit

really sad honestly, probably ddos is still continuing?
View on Reddit #47034941

TuxSH@reddit

More like Chinese folks waking up. I noticed availability recovers when it's late there
View on Reddit #47037523

Financial_Ad_2935@reddit

Yes about 9pm gets slow for me here in Arkansas 
View on Reddit #47733963

Financial_Ad_2935@reddit

And I notice my once human and Ali baba friends are starting to wake up
View on Reddit #47733983

the_fabled_bard@reddit

Yea DDOS probably has little to do with it. Since chinese can't be blamed for anything, especially if CCP has a role in it, then anything else will be blamed, such as DDOS.
View on Reddit #47091255

Arcosim@reddit

Massive usage most likely. Eventually they'll adapt. I remember a year ago or so when everyone was panicking because OpenAI was cancelling Plus subscriptions due to the high demand.
View on Reddit #47046052

ThenExtension9196@reddit

You need GPU to scale. That’s hard to get over there.
View on Reddit #47050206

FloJak2004@reddit

Just saw a post on X today, showing how Nvidia's sales to Singapore grew to almost a quarter of their revenue over the last year. Seems like China still gets plenty.
View on Reddit #47059489

ChashuKen@reddit

Singapore is not part of China nor we even like china lol
View on Reddit #47268510

FloJak2004@reddit

Where did I suggest that Singapore is a part of China? Singapore is the largest freight port outside of China but has only about 1% of the world‘s datacenters. How are 22% of Nvidias revenues coming out of Singapore? Cards are going to China for sure.
View on Reddit #47484139

ThenExtension9196@reddit

That is true, but not as much as they would have bought without the restrictions.
View on Reddit #47086834

Thomas-Lore@reddit

And my country (Poland) was blocked from receiving Nvidia's top GPUs, you can guess why. Hint: we do a lot of trade with China.
View on Reddit #47062536

lordpuddingcup@reddit

They can’t adapt they don’t have GPUs the ones they do have are old They basically have to wait for demand to drop off
View on Reddit #47050313

LetsGoBrandon4256@reddit

DDoS and hugged to death by the hype.
View on Reddit #47035388

boringcynicism@reddit

That'd be weird with the chat interface stil up?
View on Reddit #47054465

quantum-aey-ai@reddit

Chat is timing out consistently. Too much traffic...
View on Reddit #47068629

sammoga123@reddit

nope, The infrastructure they have was not prepared for so many users overnight, V3 works, but R1 doesn't because everyone wants to use it
View on Reddit #47037912

Zeikos@reddit

And on top of that R1 is more token intensive per-query. So that makes congestion inevitable. I hope this will push DeepSeek to look into making those CoTs more token-efficient. There's a lot to gain there performance/quality wise imo.
View on Reddit #47058608

218-69@reddit

Neither works for me, both r1 and normal gets same server is busy message for the last 24 hours 
View on Reddit #47051837

cantgetthistowork@reddit

So annoyed that I only managed to write half a project with R1
View on Reddit #47048872

ab2377@reddit

probably. remember the peak hype times of chatgpt, well i still knew people who didn't know about chatgpt at that time in office, but in the last 2 days everyone in my home and office is asking me about "deepseek", people who dont read tech news at all.
View on Reddit #47038305

polawiaczperel@reddit

Got the same, the info was spreading with a light speed. Even my non technical mom was talking about it.
View on Reddit #47041505

lordpuddingcup@reddit

I doubt it’s actually a ddos they just weren’t ready for the level of traffic anthropic and OpenAI were People thought that because they could train on h800s that they could also run infinite inference as well for the entire world lol
View on Reddit #47050285

PermanentLiminality@reddit

Openrouter has non Deepseek API endpoints for the R1 671b model. They cost more, but work great. I've been using it this way today.
View on Reddit #47048338

boringcynicism@reddit

My experience is the opposite: you hit context limits before the advertised window, and you often get 0 sized responses even though they charge you for them. Largely made me consider OpenRouter to be a scam.
View on Reddit #47054529

SoftwareComposer@reddit

I can vouch for this — other providers don't seem to be providing the full context window. Never had this issue with the original.
View on Reddit #47207532

hannorx@reddit

What’s a provider you can recommend? Preferably one with API.
View on Reddit #47722214

TheRealGentlefox@reddit

I don't think it's context dependent. I've had it happen at <1000, and OR is investigating it.
View on Reddit #47071837

boringcynicism@reddit

I mean I've had \~40k requests rejected for too large context by providers that supposedly offer 64k, while they work with real DeepSeek.
View on Reddit #47102826

HMikeeU@reddit

I've had a very bad experience with openrouter on deepseek models in recent days. When I specified I only want DeepSeek as a provider, API requests took ages or fail entirely, but when using DeepSeek API directly it worked like a charm.
View on Reddit #47062012

boringcynicism@reddit

Yeah, same. And if you allow the fallbacks, you get broken responses - but are charged 10x the price for it.
View on Reddit #47065394

BrightDyfiant@reddit

It isn't artificial, but what is artificial, has not won the battle, deepseek lives, and actual intelligence triumphs over artificial intelliegence, deepseek isn't dead. But artificial intelligence needs to go to another planet... AI=Actual Intelligence.
View on Reddit #47691513

ChenSharonChen@reddit

I am here becasue the api still down, too many conspiracy theorist claim CIA attacking deepseek, just FUD but it's annoying
View on Reddit #47579426

JustinPooDough@reddit

I am willing to bet the US is DDOSing DeepSeek. Fucking pathetic man. Sam continuing to spout his rediculous bullshit on Twitter about AGI and what not, and meanwhile attacking their competition. So much for a free market. What a load of shit.
View on Reddit #47070420

sunr117@reddit

Sam is pathetic, DDosing an open source is more pathetic
View on Reddit #47479594

drgitgud@reddit

just run it locally mate, the model is miniscule and blazing fast Tried it this morning, it can even count the r in strawberry!
View on Reddit #47068397

SoftwareComposer@reddit

A distill is not the same model.... local models aren't performant enough for my use case: agentic coding on large code bases (via aider)
View on Reddit #47207712

drgitgud@reddit

oh boy, time to be schooled! What's a distill? No /s, no joke, I'm curious
View on Reddit #47225540

SoftwareComposer@reddit

essentially teaching a smaller model (student) to behave like its larger variant (teacher). But the smaller model has a lower # of params, so it can't reach the performance of its teacher — at least not with current methods.
View on Reddit #47342708

drgitgud@reddit

That explains the small size! Thank you mate, much appreciated!
View on Reddit #47358171

TrifleAccomplished77@reddit

nah it's still working lol
View on Reddit #47065943

Bamstian@reddit

You do not know what an API is right?
View on Reddit #47330386

TrifleAccomplished77@reddit

fuck. my dumbass didn't see "api"
View on Reddit #47331261

awilhelm-pb@reddit

It is working in Germany.
View on Reddit #47072755

Bamstian@reddit

Nein, die API funktioniert nicht.
View on Reddit #47330301

Minute_Attempt3063@reddit

It works fine for me. Yes they are under a lot of strain https://preview.redd.it/wz8fdh5mw3ge1.jpeg?width=1220&format=pjpg&auto=webp&s=4f80c69ca144a65da598dc600a7fc86fb4e1fff1
View on Reddit #47180764

Bamstian@reddit

You are Not using their API. Thats why "it works fine" for you.
View on Reddit #47330160

h666777@reddit

Hardly knew it and I was already in love. This world is cruel.
View on Reddit #47042300

duckieWig@reddit

It is served in fireworks, deepinfra, together, huggingface, thru openrouter and more
View on Reddit #47044424

h666777@reddit

At 4x the price and with garbage throughput. Seems that everyone in America is having deep skill issues right now.
View on Reddit #47044552

Valuable-Run2129@reddit

The model is a big boi. The real inference cost aligns with those provider’s prices. Deepseek was subsidizing for marketing purposes.
View on Reddit #47221301

Fuzzy_Independent241@reddit

Groq cloud? Haven't tried it, I'm working on another project today. But could be a way out of DS servers. Other than that, as others said, people will test and do reports and publish 'stuff' and then things will get normalized.
View on Reddit #47046376

h666777@reddit

Groq doesn't dare serve a model 1 bit bigger than 70B, they are only serving the distills.
View on Reddit #47047992

nootropicMan@reddit

Groq only hosting 70b distilled version
View on Reddit #47047979

OGchickenwarrior@reddit

As all deepseek requests are timing out, ChatGPT has lifted all typical O1 limits for my basic pro plan and it is lightning fast right now. I guess this is what competition gets us.
View on Reddit #47039130

Medium_Chemist_4032@reddit

Really? That limit was so annoying I stopped using it altogether
View on Reddit #47196069

Sir-ScreamsALot@reddit

They really lifted limits?
View on Reddit #47059524

OGchickenwarrior@reddit

I actually cannot confirm. I feel bad that this comment got so much attention. I was abusing O1 yesterday and definitely sent more than 50 requests , but maybe my account is just broken? Idk man
View on Reddit #47114571

noobrunecraftpker@reddit

Why isn't this announced anywhere though?
View on Reddit #47112063

Turkino@reddit

Ah, the classic "oh no, we have competition! Quick, make our product a bit less arbitrarily shitty to use." Which would be fine, if they were not crying to the feds to try to arbitrarily protect themselves at the same time.
View on Reddit #47108704

The_GSingh@reddit

Basic pro? Do you mean the $20 plus plan. Might be worth resubscribing.
View on Reddit #47041126

OGchickenwarrior@reddit

Yes $20
View on Reddit #47042241

Suheil-got-your-back@reddit

Thats plus account for clarity. Pro is 200$.
View on Reddit #47089075

Johnroberts95000@reddit

Looking forward to o3 & it's nice to have my OpenAI GPUs unclogged
View on Reddit #47074552

Trojblue@reddit

How much limit is it now? from 50/week previously?
View on Reddit #47068413

Thomas-Lore@reddit

And Copilot now has thinking which seems to use o1. And Claude is back to offering Sonnet to free users.
View on Reddit #47062436

According-Channel540@reddit

Is this the 50 o1 messages per week you are saying?
View on Reddit #47053472

OGchickenwarrior@reddit

Yes but I could swear I’ve gone over 50 today
View on Reddit #47053758

notcooltbh@reddit

they're not timing out lol they're collecting training data and not giving you the outputs it's literally a Minecraft dropper farm type of setup
View on Reddit #47174506

JoshS-345@reddit

It's open sourced, there will be unlimited companies making it available. And if you want to run a smaller version and you have powerful enough hardware, you can run it yourself.
View on Reddit #47045960

20ol@reddit

Those are not smaller versions. They are Llama and Qwen finetuned by R1. The only Deepseek model is the 671b
View on Reddit #47112190

JoshS-345@reddit

Cool!
View on Reddit #47146741

HMikeeU@reddit

Yes but the other companies are more expensive
View on Reddit #47061891

Neomadra2@reddit

Well this was so obvious. Training a model is one thing. Serving 300 million users is another.
View on Reddit #47131537

dhbloo@reddit

I am starting to wondering how does Deepseek maintain all of this running in the long term if they always provide free services.
View on Reddit #47044075

redditscraperbot2@reddit

I'm sure the five bucks I tossed them for their API in December will cover the cost of their service upgrades.
View on Reddit #47046773

shakespear94@reddit

Ii put $5 too. $10. Keep it going
View on Reddit #47124227

Beneficial_Toe3744@reddit

They're not trying to make a profit. They're trying to find ways to destabilize your country. I wonder how many people already accepted the TOS before even trying to log in lol
View on Reddit #47056730

Flaky-Diet5318@reddit

country's destabilizing itself without the help of deepseek
View on Reddit #47077875

Beneficial_Toe3744@reddit

Yes. I wonder how that came about. Certainly not years of propaganda slowly spoon fed to Americans by the literal thousands-year-old Chinese government propaganda machine operating through every single piece of American consumer electronics. Certainly. It's not like China (and Russia) have openly admitted to these things. That would be absolutely crazy, right?
View on Reddit #47083873

lordpuddingcup@reddit

They don’t because they can’t get new infrastructure due to embargo and the old h800 cluster only gonna handle so many users free or paid
View on Reddit #47050356

ThenExtension9196@reddit

Spoiler alert: they don’t
View on Reddit #47050249

nootropicMan@reddit

Api access is paid
View on Reddit #47048014

IxinDow@reddit

I bet they've made enough for 20 V3 on recent market volatility
View on Reddit #47045284

AdTotal4035@reddit

I was using deepseek happily since it was released. I am so pissed the media figured out about it. Now everyone is just bombarding it. It's literally over. 
View on Reddit #47077034

redditscraperbot2@reddit

Same, I was using V3 for a while because it was cheap and fast enough to excuse its shortcomings now I got nothing. It's still just as good when it comes back. Whatever they're doing to bring the service back comes with a price tag, surely.
View on Reddit #47123215

Loops_Boops@reddit

You can access the full DeepSeek R1 model through a bunch of serverless API providers like Nebius, Novita, DeepInfra, Fireworks, Together... it's more expensive though.
View on Reddit #47119626

Reasonable_Flower_72@reddit

https://preview.redd.it/g97traw4hzfe1.png?width=880&format=png&auto=webp&s=94125a4ba1668649fc0cda3b9d0d22e918310165 Right now, iPv6 tunnel from CZ to Hurricane Electric (USA) then China
View on Reddit #47111649

el_ramon@reddit

Remember it happened exactly the same with ChatGPT and their solution was to start charging a subscription fee to prioritize those who paid.
View on Reddit #47100508

bArA83@reddit

https://preview.redd.it/xxmfdvpt0yfe1.jpeg?width=1080&format=pjpg&auto=webp&s=b115715ffca5d3ca4f138d4e1b25fa224058bfd6
View on Reddit #47081459

HugeOrdinary7212@reddit

Gave it time, remember when chatgpt was new, it use to break every now and then
View on Reddit #47078781

Fun_Yam_6721@reddit

yeah the hype sucks
View on Reddit #47078013

Johnroberts95000@reddit

On a serious note - are they bypassing cuda for inference or should other providers be able to get their TPS up to what DeepSeeks was? Before this blew up - DeepSeek was way faster than what OpenRouter is now.
View on Reddit #47074516

Divniy@reddit

r/LocalLLaMA
View on Reddit #47071006

Competitive-Ad754@reddit

This is not good Mav
View on Reddit #47069994

purplewater0o0@reddit

:(((
View on Reddit #47066476

mrbluesneeze@reddit

Says some stupid idiot every time a new popular service of game launches
View on Reddit #47066144

Proper-Yellow8395@reddit

The iOS app seems to work for me.
View on Reddit #47062853

evrenozkan@reddit

Website works fine for me. But I don't get why they can't rate limit API access, and it should be much cheaper to evade non-authenticated requests.
View on Reddit #47062830

Puzzled-Pass-1318@reddit

Maybe it's because China is celebrating the Spring Festival and everyone is on holiday :)
View on Reddit #47060246

phenotype001@reddit

I can't even access the platform page, 503 error: [platform.deepseek.com](http://platform.deepseek.com)
View on Reddit #47059900

Previous-Piglet4353@reddit

I thought this was a local LLM subreddit. My local Deepseek-r1 70b is doing just fine. OpenAI did, however, dumb down the o1 API. That got nerfed hard.
View on Reddit #47052043

diligentgrasshopper@reddit

A couple weeks ago I legit could get 40 requests per second from v3 :(( and here I was trying to churn as much distillation as possible before the API discount ends
View on Reddit #47050665

Dark_Fire_12@reddit

Give it two weeks and come back later when everyone moves on to something else. Google is cooking so maybe that can take the heat off them.
View on Reddit #47049178

CountPacula@reddit

This is no different than when GPT4 came out. Outages happen when you get popular.
View on Reddit #47042409

freedomanonyme@reddit

:(
View on Reddit #47037573