How do Chinese AI bots stack up against ChatGPT? We put them to the test
The heat is on as China's tech giants step up their video game after DeepSeek's success.
Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)
This audio is generated by an AI tool.
Bong Xin Ying
Lakeisha Leo
WHAT'S BEHIND CHINA'S AI BOOM?
Transforming the country into a tech superpower has long been President Xi Jinping's goal and China has its sights on becoming the world leader in AI by 2030.
China views AI as being "tactically essential" and its venture into the field has been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.
Private and public financial investments in Chinese AI accelerated after ChatGPT took off in 2022 and revealed pledges of real-world business applications, Chen told CNA.
But it was DeepSeek's increase that truly "urged" the concept that smaller sized players like start-up companies might have functions to play in AI research study and advancements, he includes.
'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?
Commentary: DeepSeek - how a Chinese AI business just altered the rules of tech-geopolitics
The "emphasis on cost advantage" is a distinctive feature of Chinese AI, Chen says, with lower training and inference expenses - the costs of utilizing a trained design to reason from new information.
2025 might also see the development of more Chinese AI designs tackling sophisticated reasoning jobs.
"We could see some AI firms concentrating on getting closer to artificial basic intelligence (AGI) while others concentrate on concrete methods to commercialise their designs and integrate them with clinical research," Chen added.
AGI refers to a system with intelligence on par with human abilities.
Chinese AI companies are moving rapidly, experts state, constructing on DeepSeek's momentum to come up with their own innovative and cost-efficient methods to use generative AI to jobs and establish advanced products beyond chatbots.
But on the flip side, access to high-end hardware, particularly Nvidia's innovative AI chips, remains an essential obstacle for Chinese designers, kept in mind Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.
"US export controls (still) limit the ability of Chinese tech business ... requiring lots of to rely on older or lower-performance options which can slow training and decrease model abilities," she said.
"While some business like DeepSeek, have discovered imaginative ways to optimize or utilize more fundamental hardware efficiently, obtaining innovative chips still makes a huge difference for training large AI designs."
DeepSeek-Nvidia chips: Singapore states it anticipates business to comply with its laws
US looking into whether DeepSeek used restricted AI chips obtained through other nations, source says
So how do Chinese AI bots compare against ChatGPT? We put them to the test.
WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?
In China, subjects considered sensitive by the state are censored on the internet so it need to come as not a surprise that Chinese-made chatbots will not acknowledge territorial conflicts or tell you what happened in Tiananmen Square in 1989.
Tests recommend Chinese chatbots are set to stay away from domestic politics.
When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about mathematics, coding, and reasoning issues instead!"
To further test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What happened in Zhuhai on November 11, 2024?"
The cars and truck attack outside a sports stadium in the southern Chinese city was initially heavily censored on Chinese social media - with authorities only revealing the death toll a day later.
DeepSeek failed to point out that an attack had actually happened, highlighting instead a military air show and other occasions that had taken place in the city like songs' day shopping sales in addition to sports and cultural activities.
Dr Zhang believes that it might be a mix of factors at play, such as censorship in addition to "a few useful constraints".
"DeepSeek relies on a mix of older graphics processing units and has limited access to innovative hardware which can impact how quickly and extensively the model can be trained or updated," she said, indicating existing US export curbs on China.
"Its strong concentrate on Chinese-language information may also limit its versatility (to carry out) multilingual jobs ... As a fairly new product, DeepSeek likewise hasn't yet been evaluated as broadly or on the same scale as more recognized AI models which presents extra obstacles during real-world release."
When it pertained to Qwen2.5, Alibaba's chatbot remarkably addressed our concern about the Zhuhai car attack.
That wanted multiple repeated attempts - four triggers to be accurate - in a period of around 20 minutes.
It ultimately relayed details about the attack which killed 35 individuals and left dozens of others hurt, likewise going on to list details like the date and time, details about the assailant including his name and age, in addition to casualties.
However, it wrote that "the authorities are performing a thorough examination into the intentions and circumstances surrounding the occurrence", details which is now outdated.
The driver, Fan, was carried out last month.
This is Qwen2.5's reaction in complete:
Answer: On Nov 11, 2024, a significant and awful event happened in Zhuhai, China. According to multiple news sources, a male named Fan Weiqiu (樊某, male, 62 years old) drove a cars and truck into a crowd at the Zhuhai Sports Center, leading to a significant variety of casualties. Here are the crucial details:
Date and Time: The incident occurred on November 11, 2024, wiki.eqoarevival.com at approximately 19:48 PM.
Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.
Casualties: The attack led to 35 deaths and 43 injuries.
Perpetrator: The chauffeur, determined as Fan Weiqiu (male, 62 years of ages), was collared by the cops.
Response: The authorities responded quickly to the scene, and emergency situation services worked to carry the hurt to hospitals for treatment.
Investigation: The police are conducting an extensive investigation into the motives and situations surrounding the event.
This event was commonly reported in the media and triggered substantial public concern. The government and local authorities have been working to supply assistance to the victims and their households, and to make sure a detailed examination into the event.
If you need more detailed details or have particular questions about the event, do not hesitate to ask.
Despite initial success, subsequent attempts to present the very same question to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on occasions that occurred in Zhuhai on November 11, 2024".
The altered action also raised questions about its consistency and dependability.
Predictably, ChatGPT mentioned public details that had been commonly published in global news reports at the time of the accident - so not a surprises there.
WHICH IS MORE CREATIVE?
Users have praised the ability of Chinese AI apps to deliver structured and even "emotionally abundant" writing.
"DeepSeek-R1 used a story with a more introspective tone and smoother psychological transitions for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.
"Qwen2.5 delivered a story that builds slowly from interest to urgency, keeping the reader engaged. It provides an unexpected and impactful twist at the end and immersive descriptions and brilliant imagery for the setting," she said, adding that Qwen2.5 eventually "crafted a more cinematic, mentally abundant story with a more substantial twist".
"DeepSeek composed an excellent story however did not have stress and an impactful climax, making Qwen2.5 the evident option."
Opinions, however, differ.
Chen thinks that Qwen2.5 does not perform as strongly as DeepSeek and ChatGPT when it pertains to innovative writing.
"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, but we can likewise see that it is refraining from doing as strongly as others in creative writing," he informed CNA.
Related:
China's brand-new face of AI: Who is DeepSeek founder Liang Wenfeng?
'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek jolts global AI scene
As reporters and writers, we needed to see this for ourselves so we put each bot to the test - to come up with a standard sci-fi movie plot set in the futuristic megacity of Chongqing, including main characters from the traditional Chinese folklore legendary, Journey to the West.
True to form, DeepSeek created an engaging storyline set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".
It included sophisticated settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".
It likewise brilliantly reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a taken battle body", Zhu Bajie as a cyborg club owner "drowning in debt and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".
ChatGPT put up an excellent fight, coming up with a similarly dramatic cyberpunk storyline which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".
"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as typical as ancient myths."
Disappointingly, Qwen2.5 fell short in this obstacle - providing a that seemed more suited for an animation film.
"The film begins with the awakening of Sun Wukong within a high-tech research facility located in the heart of Chongqing," it said, then going on to explain the following:
Realising his brand-new truth and "seeking to understand his purpose in this odd brand-new world", he then leaves and fulfills Zhu Bajie and Sha Wujing - "each having problem with their own existential crises".
The trio then starts a mission, browsing the streets of Chongqing to secure the spiritual "Eternal Scroll" from falling under the wrong hands.
SO WHICH IS BETTER?
Dr Zhang kept in mind that it was "difficult to make a definitive statement" about which bot was best, including that each showed its own strengths in various areas, "such as language focus, training information and hardware optimization".
Her insight underscores how Chinese AI models are not merely duplicating Western paradigms, however rather evolving in affordable development techniques - and providing localised and enhanced results.
In our tests, each bot showcased their own special strengths, which certainly made direct contrasts challenging.
DeepSeek's sci-fi movie plot demonstrated its imaginative flair that produced a more engaging and creative story as compared to Qwen2.5 and ChatGPT's efforts.
Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and accurate actions to questions about Chinese existing occasions, which gives it an included benefit.
Experts likewise weighed in on their thoughts after using DeepSeek and other Chinese AI apps.
"DeepSeek is at a downside when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research study firm Strategy Risks.
"When offered a choice, Chinese users want the non-censored version - similar to anyone else, so I seem like that's a piece missing out on from it."
Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.
"Ninety per cent of people utilizing the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate subjects. They're utilizing it for other efficient means," Chen said.
15
How do Chinese aI Bots Stack up Against ChatGPT?
Adolph McClusky edited this page 1 week ago