... | 🕐 --:--
-- -- --
عاجل
⚡ عاجل: كريستيانو رونالدو يُتوّج كأفضل لاعب كرة قدم في العالم ⚡ أخبار عاجلة تتابعونها لحظة بلحظة على خبر ⚡ تابعوا آخر المستجدات والأحداث من حول العالم
⌘K
AI مباشر
246953 مقال 299 مصدر نشط 38 قناة مباشرة 7015 خبر اليوم
آخر تحديث: منذ ثانيتين

Hy3 Preview: Tencent’s Base-Model Play Built For The Larger Ecosystem

تكنولوجيا
Forbes
2026/04/23 - 11:03 501 مشاهدة
InnovationConsumer TechHy3 Preview: Tencent’s Base-Model Play Built For The Larger EcosystemByVivian Toh,Contributor.Forbes contributors publish independent expert analyses and insights. Vivian Toh is the chief editor of Tech Tech China covering China Tech Follow AuthorApr 23, 2026, 07:03am EDT--:-- / --:--This voice experience is generated by AI. Learn more.This voice experience is generated by AI. Learn more.Logo of Hy3 preview on Hunyuan's websiteHy websiteA user on Little Red Book recently asked Yuanbao, Tencent’s AI chatbot: "I always feel lonely. What should I do?" The response was not a list of coping strategies. The model spent two seconds — visible in its reasoning trace — calibrating an empathetic tone and leaving the conversation open. The post went viral. What made it notable was not that an AI had learned to be nice, but that an AI product had learned to behave in a way its users actually wanted. That distinction — between model capability and product fit — is what Hy3 Preview is designed to solve, and it points to Tencent's broader base-model strategy.90 Days, From ScratchIn February 2026, Tencent tore down its pre-training and reinforcement-learning infrastructure and rebuilt both from scratch. Six weeks later it began training Hy3 preview. Ten weeks after that, it went live. The rebuild was guided by three principles: capability systematisation (refusing to let any model "specialise" its way out of product usefulness), evaluation authenticity (testing against real tasks, not leaderboards), and cost-performance (co-designing model and inference framework so capability gains do not price the model out of deployment). The 90-day timeline is impressive. These three principles explain how it was possible.The Deliberate Choice Not to Go BiggerHy3 Preview runs a mixture-of-experts architecture that Tencent describes as a fusion of fast and slow thinking: 294 billion parameters total, 21 billion activated per forward pass, routi...
مشاركة:

مقالات ذات صلة

AI
يا هلا! اسألني أي شي 🎤