Tether Brings AI Memory Compression To Consumer Devices

تكنولوجيا

Forbes

2026/06/02 - 15:58 512 مشاهدة

تحليل ذكي | AI Editorial Analysis

•InnovationEnterprise TechTether Brings AI Memory Compression To Consumer DevicesByThomas Coughlin,Contributor.Forbes contributors publish independent expert analyses and insights.

•Covering Digital Storage Technology & Market.

•IEEE President in 2024Follow AuthorJun 02, 2026, 11:58am EDTdata compressiongettyI have written in March about Google’s TurboQuant for compressing data in memory for AI applications, focusing on data...

هذا الخبر من Forbes. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

InnovationEnterprise TechTether Brings AI Memory Compression To Consumer DevicesByThomas Coughlin,Contributor.Forbes contributors publish independent expert analyses and insights. Covering Digital Storage Technology & Market. IEEE President in 2024Follow AuthorJun 02, 2026, 11:58am EDTdata compressiongettyI have written in March about Google’s TurboQuant for compressing data in memory for AI applications, focusing on data center applications. In that article, I said that TurboQuant is a compression algorithm to address the challenge of memory overhead in key-value storage for AI models with zero accuracy loss. I also said that by enabling AI with lower memory and storage requirements, we make that memory and storage even more useful and this will likely increase AI workflows, particularly on-premise. This could increase the memory and storage demand for implementing local AI inference. With today’s costs for digital memory and storage, this technology could enable useful AI implementations at much lower costs.Recently a company called Tether introduced a version of TurboQuant that can be used on consumer devices like laptops and phones to process documents and extending AI conversations locally by using local memory and storage rather than public cloud-based resources. Tether Turboquant is an open-source AI memory compression algorithm that reduces the key-value (KV) cache of large language models (LLMs) by 3-6 times, depending upon the workload. The figure below, from Tether shows an 5 times reduction in required memory using TurboQuant. Data resource requirements with and without TurboQuantTetherTurboQuant compresses the KV cache using during inference sessions but doesn’t change the trained LLM model weights. This is important as a model is accessed by a user. The KV cache keeps past keys and values in memory and this increases over time as a user interacts with the model. The KV cache contents grow with every token and every active session. This can become a...

المصدر: Forbes | Source: Forbes

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة Forbes. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by Forbes. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

المزيد عن تكنولوجيا | More on Technology

هذا الخبر ضمن تغطية خبر لقسم تكنولوجيا. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: Forbes. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Technology. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: Forbes. Tags: AI, memory compression, consumer devices.

Tether Brings AI Memory Compression To Consumer Devices

المزيد عن تكنولوجيا | More on Technology

مقالات ذات صلة

Farnborough Airshow 2023: The Rising Importance of Ground Weapons Over Jets in Modern Warfare

Dünyaca Ünlü Telefon Markası Türkiye Pazarından Çekiliyor: Kullanıcılar Ne Düşünüyor?

Gemini'nin Yeni Özelliği: Fotoğraf Yükleme Devri Kapandı

معركة العملاقة الكهربائية: مقارنة بين BMW iX و Mercedes EQS و Audi Q8 e-tron لعام 2026!

Electric SUVs 2026: The Ultimate Showdown Between BMW iX, Mercedes EQS, and Audi Q8 e-tron!

تحديث (واتس آب) يثير غضب المستخدمين.. وطريقة بسيطة لإلغائه