Improving mathematical reasoning with process supervision

تكنولوجيا

OpenAI Blog

2023/05/31 - 07:00 516 مشاهدة

تحليل ذكي | AI Editorial Analysis

•We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct fina...

•In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is end...

هذا الخبر من OpenAI Blog. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome supervision”). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is endorsed by humans.

المصدر: OpenAI Blog | Source: OpenAI Blog

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة OpenAI Blog. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by OpenAI Blog. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

المزيد عن تكنولوجيا | More on Technology

هذا الخبر ضمن تغطية خبر لقسم تكنولوجيا. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: OpenAI Blog. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Technology. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: OpenAI Blog. Tags: AI, mathematics, reasoning.

Improving mathematical reasoning with process supervision

المزيد عن تكنولوجيا | More on Technology

مقالات ذات صلة

Andy Burnham's New Cabinet: Healey Takes the Helm as Chancellor and Miliband Assumes Foreign Secretary Role

Google Maps Enhances Android Auto with Waze-like Features for Seamless Navigation

أسرع 3 سيارات SUV يمكنك شراءها: من سيأخذ الصدارة، Lamborghini Urus أم BMW XM أم Porsche Cayenne؟

Ultimate Showdown: Lamborghini Urus vs BMW XM vs Porsche Cayenne - Which SUV is the Fastest Beast on the Road?

Le nouvel Air Force One de Donald Trump va être retiré pour être doté d'améliorations en termes de sécurité

EasySim : Internet Gratuit pour les Pèlerins du Hajj et de la Umrah en Arabie Saoudite