Deepseek nerds, have you tried any CLI/Coding agent integrations with it?

aanes_appreciator [he/him, comrade/them]@hexbear.net · 2 months ago

Deepseek nerds, have you tried any CLI/Coding agent integrations with it?

sudoer777@lemmy.ml · 2 months ago

A computer capable of running DeepSeek-V3.2 at full precision would cost well over $50k (iirc).

I saw a Hacker News article of someone running Deepseek R1 for $6k, although still too expensive IMO

GLM 4.6

I need to try this.

Minimax-M2

Kimi K2 Thinking also just came out

piccolo [any]@hexbear.net · 2 months ago

Honestly I have not been super impressed with Kimi K2. Maybe the thinking model is better, but in my experience GLM has been much better. I’ll still give it a shot though.

I saw a Hacker News article of someone running Deepseek R1 for $6k, although still too expensive IMO

Do you remember what their setup was? My guess would be CPU inference with a metric fuckton of RAM if they were running it at the full quantization, which could work but would be pretty slow. But for $6k it’d be impossible to buy enough VRAM to run it at full quant on GPUs.

sudoer777@lemmy.ml · 2 months ago

This was the article: https://news.ycombinator.com/item?id=42861628

They bought 768 GB of RAM and 2 AMD EPYC CPUs, for 6-8 tokens per second