Artificial Intelligence

   

The Circle of Life for LLMs[:] Was the Reaction to DeepSeek Justified?

Authors: Stephane H. Maes

Since the release of DeepSeek LLMs, the industry, the investors, and the media have reacted with alarm, surprised that a Chinese startup—despite operating on a low budget and with limited access to specialized AI hardware—could surpass the latest models with reasoning capabilities. This has led to geopolitical concerns about threats to U.S. technological dominance, and the effectiveness of AI chip sanctions imposed by the U.S. on China. Investor confidence in leading U.S. tech companies involved in AI, AI hardware, and AI/cloud hosting has been shaken, contributing to a significant stock market drop on January 27, 2025.In this paper, we argue that while the success of DeepSeek V3 and R1 is remarkable, it does not signal the decline of any major player. Instead, it is a natural progression of how LLMs and generative AI function. Most LLM providers, of a same LLM generation, rely on similar algorithms, big-data pools, and development techniques, meaning that models tend to converge in performance once their methodologies become public. Different starting points often lead to LLMs of comparable capabilities for a same generation. Techniques such as model distillation and reinforcement learning further enable the reduction of model size, data requirements, and hardware constraints. As a result, each time a model is developed, it can be replicated, closely matched, or even surpassed soon after—sometimes with significantly lower effort than the original, or with a significantly smaller set of parameters. This cycle of life will continue as long as LLMs remain a competitive field, vs. a commodity, and until new AI approaches beyond GenAI emerge, or the old AI reemerges.Such a pattern will continue, repeating the cycle. Open source models have the advantage of drawing from broader communities and collective innovation, making it increasingly difficult for proprietary models to maintain an edge. As development costs rise, it will be interesting to see whether proprietary models can sustain their dominance.Ultimately, there was no reason for panic. AI may be in a bubble, but if it bursts, it will not be because DeepSeek outperforms OpenAI’s latest model. Instead, the real challenges facing LLMs and GenAI lie elsewhere. The path to AGI is likely beyond current LLMs. While AI agents may extend the viability of GenAI, other factors pose more significant long-term threats. If LLMs are not the future of AI, there is little reason to be concerned about new players mastering them.

Comments: 33 Pages. All related details of the projects (and updates) can be found and followed at https://shmaes.wordpress.com/

Download: PDF

Submission history

[v1] 2025-03-02 21:53:17

Unique-IP document downloads: 364 times

Vixra.org is a pre-print repository rather than a journal. Articles hosted may not yet have been verified by peer-review and should be treated as preliminary. In particular, anything that appears to include financial or legal advice or proposed medical treatments should be treated with due caution. Vixra.org will not be responsible for any consequences of actions that result from any form of use of any documents on this website.

Add your own feedback and questions here:
You are equally welcome to be positive or negative about any paper but please be polite. If you are being critical you must mention at least one specific error, otherwise your comment will be deleted as unhelpful.

comments powered by Disqus