# large language models

Latest news and articles about large language models

Total: 11 articles found

Close-up of wooden Scrabble tiles spelling 'China' and 'Deepseek' on a wooden surface.

Two Spring Festivals, One Industry: How China’s Tech Giants Turned AI into a Holiday Battle for National Reach

China’s AI competition has shifted from model development to a consumer battleground during two consecutive Lunar New Year campaigns. Alibaba, ByteDance, Tencent and Baidu used subsidies, embedded assistants and viral features to fight for national traffic, while smaller firms pursue agent‑style products that combine multiple models. The outcome will reshape who controls mass AI touchpoints in China, narrow the US–China model gap and raise barriers for smaller players unless they adopt alternative, interoperable strategies.

SoBiz2026年2月19日 05:14

#China AI#Alibaba#ByteDance

Colorful 3D render showcasing AI and programming with reflective abstract visuals.

Technology

China’s Kimi Rockets to a $10–12bn Valuation After Two Rapid Funding Rounds Exceeding $1.2bn

Kimi, a Chinese AI startup also known as Yue Zhi Anmian, has raised over $700 million in a new funding round led by existing investors, bringing two consecutive financings to more than $1.2 billion and valuing the company at $10–12 billion. The deals signal strong investor appetite for large AI models in China and a scramble by tech giants to secure model supply, but commercialisation and regulatory risks remain significant.

NeTe2026年2月17日 08:14

#Kimi#large language models#Alibaba

Colorful PHP code displayed on a dark screen, ideal for programming themes.

Technology

China’s Zhipu Pushes Prices Up as GLM-5 Goes Global — A Turning Point for Domestic AI Commercialisation

Zhipu Technology raised prices for its GLM Coding Plan and launched GLM-5 overseas on February 12, citing surging developer demand and the need for heavier investment in compute and model optimisation. The increase — 30% or higher domestically and substantially larger on overseas API pricing — marks a shift in China’s AI industry from low‑price competition to value-based monetisation.

NeTe2026年2月12日 17:14

#Zhipu#GLM-5#GLM Coding Plan

A man with a turban holds a sign saying 'India Stay Home' behind a glass wall, emphasizing pandemic awareness.

Technology

Microsoft Bets on Homegrown AI, Predicts Rapid Automation of White‑Collar Work

Microsoft is pivoting from heavy reliance on OpenAI to building its own leading large language models, mobilising vast compute and teams and planning roughly $140 billion in AI‑related capital spending. CEO Mustafa Suleyman warned many desk‑based white‑collar tasks could be automated within 12–18 months, a claim that underlines both the opportunity and disruption in Microsoft’s strategy to deploy professional‑grade general AI for enterprises.

NeTe2026年2月12日 17:04

#Microsoft#OpenAI#Mustafa Suleyman

Black and white close-up of Lexus F Sport steering wheel, emphasizing luxury car interior design.

Technology

A Night of Acceleration: Zhipu’s GLM‑5 Debuts as MiniMax and DeepSeek Race to Keep Up

Three leading Chinese AI firms unveiled near‑simultaneous upgrades that signal a shift from demo‑level coding assistants to production‑oriented, agentic systems. Zhipu launched GLM‑5 as an open‑source foundation for long‑horizon engineering tasks, while MiniMax and DeepSeek pushed product and context upgrades aimed at real‑world throughput and extended interactions.

NeTe2026年2月12日 11:04

#Zhipu#GLM-5#MiniMax

Image displaying DeepSeek AI interface for messaging and search functionality.

Technology

DeepSeek's Quiet Leap: 1‑Million‑Token Context and May‑2025 Knowledge Cut Hint at a Next‑Gen Chinese LLM

DeepSeek has begun limited testing of a model that supports a 1 million token context window and uses training data up to May 2025, a significant expansion from its previous 128k limit. The change suggests material architectural or pipeline upgrades and signals intensified competition among Chinese AI providers to ship more capable, enterprise‑ready models.

NeTe2026年2月11日 14:55

#DeepSeek#long context#1M tokens

A hand interacts with a smart speaker on a minimalist shelf, featuring a plant and stacked books.

Technology

Amazon Circles a Custom OpenAI Model to Supercharge Alexa as Talks of a Large Equity Deal Advance

Amazon is negotiating with OpenAI on a commercial agreement that could include a multibillion-dollar equity investment and the creation of bespoke OpenAI models for Amazon products such as Alexa. The arrangement would deepen technical collaboration but raise strategic and regulatory questions about control, vendor lock-in and market concentration in AI.

NeTe2026年2月4日 14:50

#Amazon#OpenAI#Alexa

Low-angle view of colorful residential buildings against a vivid blue sky in Hong Kong.

Business

Hang Seng Edges Higher as Tech Stocks Slip and AI Plays Rally

Hong Kong’s Hang Seng closed marginally higher while the Hang Seng Tech Index fell, as AI-related ‘large-model’ stocks and gold miners rallied and major internet platforms like Tencent slid. The session illustrated a market split: speculative AI plays attracted flows even as policy-sensitive mega-cap tech names remained susceptible to rumor-driven volatility.

NeMo2026年2月3日 13:00

#Hong Kong stocks#Hang Seng#Hang Seng Tech

A wall of traditional sake barrels in Shibuya City, Tokyo, showcasing Japanese culture.

Technology

Yang Zhilin Steps Forward: Moon’s Dark Side Ships Kimi K2.5 to Buy Time Against DeepSeek

Moon’s Dark Side released Kimi K2.5 with founder Yang Zhilin personally presenting the incremental upgrade, signalling a strategic shift from parameter-led competition to engineering improvements focused on coding and agent orchestration. The release is a defensive, deliverable move to shore up market position ahead of an expected DeepSeek model launch and to buy time for a more substantive K3 upgrade.

NeTe2026年1月29日 03:10

#Kimi#K2.5#Yang Zhilin

Stylish setup of iPhone 14 Pro showcasing dynamic island feature with accessories.

Technology

Alibaba Debuts Qwen3‑Max‑Thinking, a Tool‑Enabled Inference Model Aiming to Rival GPT‑5.2

Alibaba has launched Qwen3‑Max‑Thinking, an inference model that combines adaptive tool calling and test‑time scaling to improve reasoning, factual accuracy and alignment. Alibaba claims benchmark parity with leading models such as GPT‑5.2‑Thinking, and has deployed the capability in Qwen Chat, signalling rapid commercialisation within its cloud and consumer ecosystem.

NeTe2026年1月26日 16:10

#Alibaba#Qwen3‑Max‑Thinking#Qwen Chat

Technology

China’s Kimi Says Algorithmic Ingenuity, Not Massive Compute, Powered Its Leap — and the AI Race May Be Changing

At Davos, Kimi’s leadership said it achieved state‑of‑the‑art results with its K2 series while using a fraction of the compute typical of leading US labs, crediting deep algorithmic and engineering innovation. The company plans to use fresh capital to expand hardware for a next‑generation K3 model, underscoring a broader Chinese push to compete via efficiency rather than brute‑force compute.

NeTe2026年1月22日 21:30

#Kimi#Kimi K2 Thinking#Chinese AI