That's what they claim at least in the paper but that particular claim is not ve...

byefruit · on Jan 25, 2025

It's amazing how different the standards are here. Deepseek's released their weights under a real open source license and published a paper with their work which now has independent reproductions.

OpenAI literally haven't said a thing about how O1 even works.

huangruoyu · on Jan 27, 2025

DeepSeek the holding company is called high-flyer, they actually do open source their AI training platform as well, here is the repo: https://github.com/HFAiLab/hai-platform

Scipio_Afri · on Jan 28, 2025

Last update was 2 years ago before H100s or H800 existed. No way it has the optimized code that they used in there

Trioxin · on Jan 29, 2025

Who independently reproduced it? I haven't found such a thing.

huangruoyu · on Jan 27, 2025

it's open source, here is their platform called hai: https://github.com/HFAiLab/hai-platform

Scipio_Afri · on Jan 28, 2025

Last update was 2 years ago before H100s or H800 existed. No way it has the optimized code that they used in there

marbli2 · on Jan 25, 2025

They can be more open and yet still not open source enough that claims of theirs being unverifiable are still possible. Which is the case for their optimized HAI-LLM framework.

byefruit · on Jan 25, 2025

That's not what I'm saying, they may be hiding their true compute.

I'm pointing out that nearly every thread covering Deepseek R1 so far has been like this. Compare to the O1 system card thread: https://news.ycombinator.com/item?id=42330666

Very different standards.