How is he the Chinese Start-up I AI Deepseek with Openai and Google

One day after Christmas, a small Chinese start-up named Deepseek revealed a new system of one that could match the latest chatbot skills by companies like Openai and Google.

Only that would be a milestone. But the team behind the system, called Deepseek-V3, described an even bigger step. In a research paper explaining how they built the technology, Deepseek’s engineers said they used only some of the highly specialized computer chips that supported him to train their systems.

These chips are at the center of a tense technological competition between the United States and China. As the US government works to maintain the country’s lead in the global race of it, it is trying to limit the number of powerful chips, such as those produced by the firm of Silicon Valley Nvidia, which can be sold to China and other rivals.

But the performance of the Deepseek model raises questions about the unwanted consequences of the US government’s trade restrictions. Checks have forced scholars in China to become creators with a wide range of tools that are freely available online.

Chatbot Deepseek answered questions, solved logical problems, and wrote his computer programs with as much skill as anything already in the market, according to standard tests that US companies have used.

And it was created cheaply, challenging the prevailing idea that only the largest technology industry companies – all based in the United States – could afford to make it the most advanced systems. Chinese engineers said they only needed about $ 6 million raw calculators to build their new system. This is about 10 times less than the Meta technology giant spent on building his latest technology.

“The number of companies that have $ 6 million to spend is much higher than the number of companies that have $ 100 million or $ 1 billion to spend,” said Chris V. Nicholson, an investor with capital company Page One Ventures, which focuses on it technologies.

Since Openai sparked the boom in 2022 with the release of chatgpt, many experts and investors had concluded that no company could compete with market leaders without spending hundreds of millions of dollars on specialized chips.

The world’s leading companies train their chatbot using supercomputers using up to 16,000 chips, if no more. Deepseek’s engineers, on the other hand, said they only needed about 2000 computer chips specialized in Nvidia.

China restrictions in China forced Deepseek’s engineers to “train it more efficiently so they can still be competitive,” said Jeffrey Ding, a professor at George Washington University, specialized in developing technology and international relations.

Earlier this month, the Biden administration issued new rules that aim to prevent China from taking it out of advanced chips through other countries. Rules are based on numerous rounds of previous restrictions that prevent Chinese companies from being able to buy or produce advanced computer chips. President Trump has not yet indicated whether to keep the rules or cancel them.

The US government has tried to keep advanced chips away from the hands of Chinese companies due to concerns that they can be used for military purposes. In response, some firms in China have accumulated thousands of chips, while others have taken them from a flowering underground market of smuggling.

DEEPSEEK is run by a quantitative firm of shares called High Flyer. By 2021, she had channeled her profits in the purchase of thousands of Nvidia chips, which she used to train her earlier models. The company, which did not respond to comment requests, has become known in China for receiving new talent from the best universities with the promise of high wages and the ability to follow the research questions that most awaken their interest.

Zihan Wang, a computer engineer who has worked in a previous model Deepseek, said the company also hires people without any backdrop of computer science to help technology understand and be able to generate poetry and questions in the infamous examination of accepting Chinese college.

Deepseek does not produce any products for consumers, letting his engineers focus entirely on research. This means that its technology is not the most strictly surrounded by China’s regulations for it, which require that the technology facing the consumer is in line with government controls on information.

The leading US companies continue to advance the state of art in December, Openai discovered a new “reasoning” system called O3 that exceeds the performance of existing technologies, although it is not yet widely available outside the company. But Deepseek continues to show that it is not far behind. This month, she published a model of her impressive reasoning.

(The New York Times has sued Openai and his partner, Microsoft, accusing them of violating the copyright of the news content with regard to the systems of he. Openai and Microsoft have denied these claims.)

An essential part of this rapidly changing global market is an old idea: open source software. Like many other companies, Deepseek has his latest open source system, which means he has shared the basic code with other businesses and scholars. This allows others to build and distribute their products using the same technologies.

While employees in large Chinese technology companies are limited to collaborating with colleagues, “if you work with open sources, you work with talent around the world,” said Yineng Zhang, main software engineer in Baseten in San Francisco Works in open source sglang. Project. It helps people and other companies build products using the Deepseek system.

The open source ecosystem for him gathered steam in 2023 when Meta freely shared a system called Llama. Many assumed that this community would only flourish if companies like Meta – Technology giants with massive data centers filled with specialized chips – would continue to use their open source technologies. But Deepseek and others have shown that they can also expand the powers of open -sourced technologies. ”

Many executives and experts have argued that large US companies should not open the source of their technologies because they can be used to spread disinformation or cause other serious damage. Some US lawmakers have explored the possibility of preventing or curbing practice.

But others argue that if regulators hinder the advance of open source technology in the United States, China will gain a significant advantage. If the best open source technologies come from China, they argue, American developers will build their systems on these technologies. In the long run, this can put China at the center of research and development.

“The Center of Gravity of the open source community has moved to China,” said Ion Stoica, a computer science professor at the University of California, Berkeley. “This can be a major risk to the US because it allows China to accelerate the development of new technologies.

Hours after his inauguration, President Trump canceled an executive order of the Biden administration that threatened to curb open source technologies.

Dr. Stoica and its students recently built a system called Sky-T1 that rivals the performance of the latest OpenAi system, called Openai O1, in some standard tests. They needed only $ 450 in computer power.

They did this by building at the top of two open -sourced technologies issued by Chinese Alibaba technology giant.

Their $ 450 system is not as powerful as the Openai technology or the new Deepseek system. And the techniques they used are unlikely to provide systems that exceed the performance of key technologies. But the project showed that even small resource operations can build competitive systems.

Reuven Cohen, a Toronto technology consultant, has used Deepseek-V3 since late December. He says he is comparable to the latest systems from Openai, Google and the beginning of San Francisco Anthropic – and much cheaper to use.

“Deepseek is a way for me to save money,” he said. “This is the type of technology that someone like I want to use.”

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top