Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's what they claim at least in the paper but that particular claim is not verifiable. The HAI-LLM framework they reference in the paper is not open sourced and it seems they have no plans to.

Additionally there are claims, such as those by Scale AI CEO Alexandr Wang on CNBC 1/23/2025 time segment below, that DeepSeek has 50,000 H100s that "they can't talk about" due to economic sanctions (implying they likely got by avoiding them somehow when restrictions were looser). His assessment is that they will be more limited moving forward.

https://youtu.be/x9Ekl9Izd38?t=178



It's amazing how different the standards are here. Deepseek's released their weights under a real open source license and published a paper with their work which now has independent reproductions.

OpenAI literally haven't said a thing about how O1 even works.


DeepSeek the holding company is called high-flyer, they actually do open source their AI training platform as well, here is the repo: https://github.com/HFAiLab/hai-platform


Last update was 2 years ago before H100s or H800 existed. No way it has the optimized code that they used in there


Who independently reproduced it? I haven't found such a thing.


it's open source, here is their platform called hai: https://github.com/HFAiLab/hai-platform


Last update was 2 years ago before H100s or H800 existed. No way it has the optimized code that they used in there


They can be more open and yet still not open source enough that claims of theirs being unverifiable are still possible. Which is the case for their optimized HAI-LLM framework.


That's not what I'm saying, they may be hiding their true compute.

I'm pointing out that nearly every thread covering Deepseek R1 so far has been like this. Compare to the O1 system card thread: https://news.ycombinator.com/item?id=42330666

Very different standards.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: