restingboredface@sh.itjust.works to Privacy@lemmy.mlEnglish · 1 年前DeepSeek collects keystroke data and more, storing it in Chinese serversmashable.comexternal-linkmessage-square347linkfedilinkarrow-up1598arrow-down1117file-text
arrow-up1481arrow-down1external-linkDeepSeek collects keystroke data and more, storing it in Chinese serversmashable.comrestingboredface@sh.itjust.works to Privacy@lemmy.mlEnglish · 1 年前message-square347linkfedilinkfile-text
minus-squareayaya@lemdro.idlinkfedilinkEnglisharrow-up16·1 年前This is mildly pedantic but you’re not actually running Deepseek R1, you’re running a 7B version of Qwen that’s been fine-tuned on Deepseek R1 outputs. All of the “distilled” models are existing models trained on R1.
minus-squareZeDoTelhado@lemmy.worldlinkfedilinkarrow-up6arrow-down9·1 年前Nice catch. I’ll be sure after do run the real thing
minus-squarestink@lemmygrad.mllinkfedilinkEnglisharrow-up13arrow-down4·1 年前If you don’t know what you are doing please stop trying to act like an expert in the subject.
This is mildly pedantic but you’re not actually running Deepseek R1, you’re running a 7B version of Qwen that’s been fine-tuned on Deepseek R1 outputs. All of the “distilled” models are existing models trained on R1.
Nice catch. I’ll be sure after do run the real thing
If you don’t know what you are doing please stop trying to act like an expert in the subject.
When did they claim to be an expert??