Tuesday, January 21, 2025

AI from China: DeepSeek; Open Source

DeepSeek claims its 'reasoning' model beats OpenAI's o1 on certain benchmarks | TechCrunch

Chinese AI lab DeepSeek has released an open version of DeepSeek-R1, its so-called reasoning model, that it claims performs as well as OpenAI’s o1 on certain AI benchmarks.

R1 is available from the AI dev platform Hugging Face under an MIT license, meaning it can be used commercially without restrictions. According to DeepSeek, R1 beats o1 on the benchmarks AIME, MATH-500, and SWE-bench Verified.





at the World Artificial Intelligence Conference in Shanghai, Baidu’s CEO, Robin Li Yanhong, asked a surprising question: Does China have too many AI startups? As he put it: “In 2023, intense competition among over 100 LLMs has emerged in China, resulting in a significant waste of resources, particularly computing power. … How about real-world applications? Who has benefited from them?”





Yann LeCun • Following"To people who see the performance of DeepSeek and think:
"China is surpassing the US in AI."
You are reading this wrong.
The correct reading is:
"Open source models are surpassing proprietary ones."

DeepSeek has profited from open research and open source (e.g. PyTorch and Llama from Meta)
They came up with new ideas and built them on top of other people's work.
Because their work is published and open source, everyone can profit from it.
That is the power of open research and open source."













No comments: