More

sourcecodeplz · 2026-05-28T17:26:13 1779989173

From the release it seems we will also get Mythos pretty soon.

sourcecodeplz · 2026-05-27T19:49:37 1779911377

Yep, exactly this. And I have so much less anxiety that I have to use my 5-hour/weekly usage or I lose it... with deepseek api the credits never expire, I can use them when I want, how much I want and the prices are ridiculously low for the quality/intelligence/performance.

sourcecodeplz · 2026-05-27T18:01:46 1779904906

I think this is very true. They probably got scared of the almost 1b weekly active users of ChatGPT, and how people would rather ask ChatGPT than use Google. It will be a balance but this is a great opportunity for smaller search engines to make a real comeback.

sourcecodeplz · 2026-05-27T17:24:56 1779902696

With deepseek and xiaomi mimo models slashing their prices 99%, I don't see a great future for openai / antrhopic with regards to their 1T valuations. Maybe 1T valuation will be the whole market, West + East.

jillesvangurp · 2026-05-27T21:06:15 1779915975

Most of the corporate world in the EU or North America will be hesitant to rely on Chinese AI providers. There are some very real blockers for that for things like data security, compliance, etc. And recent geopolitics don't help.

Legalities aside, you need to look not at the model quality but at the infrastructure needed to scale these models from tens (now) to hundreds (soon) of millions of users. Only a handful of companies actually have the resources and funding to do that. That's what these huge valuations are based on. These companies are gearing up to scale to these levels. That's why they are spending on data centers. Whoever has access to those data centers gets to tap into the revenue stream of people using models running on those.

The market for frontier models is roughly split between OpenAI, Anthropic, and Google. And then you have companies like X/SpaceX, Amazon, and Microsoft being more successful with their infrastructure than their AI products and companies like Apple, Meta that have the money and the aspiration but are so far not really managing to be very successful with their AI strategies.

Deepseek is just very poorly positioned to capture a lot of the enterprise revenue in the EU or North America. But they might become very dominant outside the US/EU. And of course China itself is going to be a huge market and equally unlikely to want to be depending on US owner AI companies.

sourcecodeplz · 2026-05-27T22:12:24 1779919944

Deepseek and all the other Chinese models have open-weights. You can host them yourself, no need to send data to China or rely on them.

tredre3 · 2026-05-27T23:10:11 1779923411

There is still a risk of supply-chain attack. People give LLMs direct access to their entire infrastructure via tools, and never check the code produced. It's not difficult to steer an LLM during training so that they'd output malware only when prompted a certain way, and that wouldn't come up during the initial evaluation.

Personally I see no difference between China and America in terms of risks of them embedding "backdoors" so to speak, but I disagree when people claim that open-weight models are obviously safe just because they can be ran locally.

skeledrew · 2026-05-28T08:49:45 1779958185

> It's not difficult to steer an LLM during training so that they'd output malware only when prompted a certain way

Perhaps, but that's also a good way to lose users+reputation as there's no way to control when said malware is generated. Once the first instance is discovered cybersec researchers will have a field day reproducing it and showing the world.

dannyw · 2026-05-28T01:49:43 1779932983

It is not a trivial challenge setting up model serving infra for ~1T or larger models, especially in a high reliability environment (e.g. your team is using it for work, or you're using it to power production apps). Sure, there are third party providers, although the quality and reliability of their inference varies.

lbreakjai · 2026-05-28T00:10:30 1779927030

Run it on Bedrock. If you're already on AWS, procurement doesn't even need to be involved.

slopinthebag · 2026-05-27T23:35:24 1779924924

Run Deepseek on Deepinfra then? Or Fireworks if US-based is important. None of these are real issues outside maybe convincing your legal team to do a bit of homework.

jillesvangurp · 2026-05-28T06:35:39 1779950139

I don't think you are appreciating the physical constraints here. Deepseek doesn't really have the hardware in the US or EU to do anything at scale.

Sure, you can self host a non-frontier OSS model yourself; including Deepseek. And no doubt some people will pay one of the companies I mentioned to rent the infrastructure to do exactly that. Much of the rest of the world will be paying directly for direct access to the frontier models.

As for the legal/compliance stuff, I recommend you don't take any big decisions on that front without consulting lawyers. My understanding of that is that most serious companies in the EU have to take these topics pretty seriously. I'm sure in the US, hosting all your data and secrets in Chinese data centers isn't a whole lot less controversial.

The Chinese could of course choose try to match the current levels of investment Google, OpenAI, Anthropic, etc. are putting into local infrastructure. But as far as I know they aren't and there are probably a few political blockers for that.

Without infrastructure, their role is being a niche player in these markets. It doesn't really matter how good they are if they can't scale to most of the market.

rbehrends · 2026-05-28T09:13:11 1779959591

I mean, Western providers such as Fireworks AI/Microsoft Foundry (US) or Tensorix (EU) already are offering many of these models on their own hardware with all the typical compliance boxes ticked through a standard API. Either as open weight models or through partnerships with Chinese firms, or both. DeepSeek etc. do not have to do anything here other than making their models available to Western partners (either as open weights or through a licensing agreement).

skeledrew · 2026-05-27T17:47:55 1779904075

They'll still have their dedicated enterprise customers. I think the Chinese providers will pull more of the single users who're paying their own way, than those backed by company budget. And it's a pretty good split as the demand becomes better distributed, resulting in better service (I'll never forgot must how bad access to Claude became until they got access to Colossus) and less potential for lock-in (we really don't want there to be a duopoly, etc on good AI).

sourcecodeplz · 2026-05-27T13:39:57 1779889197

I tried both Claude Code and OpenCode with deepseek flash api. claude code eats more tokens for the same task (but only tested it for an hour).

sourcecodeplz · 2026-05-27T10:43:18 1779878598

Claude Code CLI is just a software package, if Anthropic API is down you could always connect Deepseek/other provider API to Claude Code CLI...

lionkor · 2026-05-27T11:09:18 1779880158

The point is that, with a sufficiently complex setup (with skills, MCPs, prompts, etc.) the difference in AI models will impact the quality of work. You might not care now, but you might care when you have 2 million lines of code and zero idea whats going on.

The point is vendor lock-in. The vibe coding community has reinvented vendor lock-in and is bound to repeat every mistake associated with it.

koonsolo · 2026-05-27T15:47:15 1779896835

Can you give an example of a skill or prompt that would work in Claude and not in the others?

sgc · 2026-05-27T17:47:11 1779904031

Pretty much every single detailed prompt made after trial, error, and refinement is tailored to a specific LLM. They will all perform worse used with other LLMs than a similar prompt tailored for the second LLM would perform, and at times quite poorly.

tvmalsv · 2026-05-27T23:39:23 1779925163

How well would it work to ask the working LLM to rewrite the prompt to get the best results? Do the models understand enough about themselves to do that?

sgc · 2026-05-28T01:39:00 1779932340

Claude has a /product-self-knowledge skill, and I am sure the others have something similar. So yes, it is possible if you work with care, as necessary with all things LLM related. There are hundreds if not thousands of skills on github that were created just this way.

CamperBob2 · 2026-05-28T01:12:09 1779930729

That's kind of pointless, then, because what happens when Anthropic releases their next-gen model?

sgc · 2026-05-28T01:40:45 1779932445

It's not like you aim to do it, you are just in a feedback loop improving results for the tool you are using. It is inherent in any prompt developed through iteration.

sourcecodeplz · 2026-05-27T01:00:40 1779843640

I've read on X that deepseek api can stay alive for hours vs 5 minutes tops for other providers. they do it with ram and ssd, not only vram.

sourcecodeplz · 2026-05-27T00:35:22 1779842122

You can use Codex as an orchestrator and claude code via mimo/deepseek api as executor. I've read this a lot before but when you really try it, it is really something in the way you can stretch your credits.

sourcecodeplz · 2026-05-26T16:57:29 1779814649

It runs right now on 512gb RAM Macs and PCs.

Our_Benefactors · 2026-05-26T20:16:03 1779826563

It runs like shit though in terms of tokens/second and still has a reduced context window. Vs a single claude prompt can easily get into 300k tokens without breaking a sweat.

I want local AI to be a thing but the hardware isn’t here yet, because the only options are a Mac Studio or DGX machines strapped together. RAM prices needs to crash before local AI has a chance at actually competing.

zozbot234 · 2026-05-26T20:38:31 1779827911

The more recent Chinese models are no longer heavily limited by context size. It can easily fit in RAM on a prosumer laptop. (You can also use swap space to extemd that, since context is only written to once per inference, thus a relatively mild wear-and-tear concern.)

Our_Benefactors · 2026-05-27T00:59:00 1779843540

Claude has 1M context window for the enterprise. 128k feels like a toy in comparison.

sourcecodeplz · 2026-05-27T01:10:59 1779844259

Deepseek pro/flash both have 1m.

ATMLOTTOBEER · 2026-05-27T15:08:19 1779894499

You’re right, and it feels like these people saying otherwise either don’t use these tools professionally (and therefore can’t tell a difference between local/cloud models) or literally just haven’t tried running local models

As soon as I can buy hardware for less than 5k that runs an opus 4.6+/5.5 model locally I will do it instantly

sourcecodeplz · 2026-05-26T16:48:39 1779814119

Don't think so, from what i've heard deepseek isn't loosing money on inference.