Learning Montezuma’s Revenge from a single demonstration

تكنولوجيا

OpenAI Blog

2018/07/04 - 07:00 516 مشاهدة

تحليل ذكي | AI Editorial Analysis

•We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previously published result.

•Our algorithm is simple: the agent plays a sequence of games starting from carefully chosen states from the demonstration, and learns from them by optimizing the game score using PPO, the same reinfor...

هذا الخبر من OpenAI Blog. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previously published result. Our algorithm is simple: the agent plays a sequence of games starting from carefully chosen states from the demonstration, and learns from them by optimizing the game score using PPO, the same reinforcement learning algorithm that underpins OpenAI Five.

المصدر: OpenAI Blog | Source: OpenAI Blog

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة OpenAI Blog. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by OpenAI Blog. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

المزيد عن تكنولوجيا | More on Technology

هذا الخبر ضمن تغطية خبر لقسم تكنولوجيا. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: OpenAI Blog. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Technology. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: OpenAI Blog. Tags: OpenAI, scholars, education.

Learning Montezuma’s Revenge from a single demonstration

المزيد عن تكنولوجيا | More on Technology

مقالات ذات صلة

Telstra network goes down: Aussies unable to make phone calls or access mobile data

«Cet espace a émergé de lui-même» : Anthropic a découvert une zone de pensée secrète au cœur de Claude, son intelligence artificielle

After govt notice, Meta details child safety steps

بعد القيود الأميركية.. الاتحاد الأوروبي يطور خطة طوارئ للذكاء الاصطناعي

François Vonthron (Poppins ) : Poppins, un jeu vidéo remboursé par la Sécurité sociale - 07/07

مواجهة مثيرة: BMW iX وMercedes EQS وAudi Q8 e-tron – من سيفوز بلقب أفضل SUV كهربائية في 2026؟