🕐 --:--
-- --
عاجل
⚡ عاجل: كريستيانو رونالدو يُتوّج كأفضل لاعب كرة قدم في العالم ⚡ أخبار عاجلة تتابعونها لحظة بلحظة على خبر ⚡ تابعوا آخر المستجدات والأحداث من حول العالم
⌘K
AI مباشر
420084 مقال 251 مصدر نشط 79 قناة مباشرة 2401 خبر اليوم
آخر تحديث: منذ 3 ثواني
الفلاتر: 🏷️ tech مسح الكل ✕

Nonlinear computation in deep linear networks

OpenAI Blog تكنولوجيا منذ 8 سنوات

Nonlinear computation in deep linear networks

OpenAI Blog تكنولوجيا منذ 8 سنوات

Measure Brand Lift Across TV and Facebook - meta.com

Measure Brand Lift Across TV and Facebook  meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

Contact - Tesla

Contact  Tesla

Tesla News تكنولوجيا منذ 8 سنوات

Learning to model other minds

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Learning to model other minds

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

OpenAI Blog تكنولوجيا منذ 8 سنوات

UAE martyr Sultan Al Naqbi laid to rest in Ras Al Khaimah - Emirates 24|7

UAE martyr Sultan Al Naqbi laid to rest in Ras Al Khaimah  Emirates 24|7

Emirates 24|7 تكنولوجيا منذ 8 سنوات

Learning with opponent-learning awareness

OpenAI Blog تكنولوجيا منذ 8 سنوات

Find Us - Tesla

Find Us  Tesla

Tesla News تكنولوجيا منذ 8 سنوات

Supercharger - Tesla

Supercharger  Tesla

Tesla News تكنولوجيا منذ 8 سنوات

Announcing New Ways to Enjoy Memories with Friends - meta.com

Announcing New Ways to Enjoy Memories with Friends  meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

OpenAI Baselines: ACKTR & A2C

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requires only slightly more computation than A2C per update.

OpenAI Blog تكنولوجيا منذ 8 سنوات

OpenAI Baselines: ACKTR & A2C

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requires only slightly more computation than A2C per update.

OpenAI Blog تكنولوجيا منذ 8 سنوات

More on Dota 2

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as good as their training datasets, but in self-play systems, the available data improves automatically as the agent gets better.

OpenAI Blog تكنولوجيا منذ 8 سنوات

More on Dota 2

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as good as their training datasets, but in self-play systems, the available data improves automatically as the agent gets better.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Marketplace Expanding to Europe - meta.com

Marketplace Expanding to Europe  meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

Dota 2

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards building AI systems which accomplish well-defined goals in messy, complicated situations involving real humans.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Dota 2

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards building AI systems which accomplish well-defined goals in messy, complicated situations involving real humans.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

OpenAI Blog تكنولوجيا منذ 8 سنوات
AI
يا هلا! اسألني أي شي 🎤
FREE Free 1GB Internet + Free International Calls

$1 trial — eSIM in 190+ countries — No roaming charges

Download Free