Faster physics in Python

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Hard Questions: Who Should Decide What Is Hate Speech in an Online Global Community? - meta.com

Hard Questions: Who Should Decide What Is Hate Speech in an Online Global Community? meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

Introducing Hard Questions - meta.com

Introducing Hard Questions meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

OpenAI Blog تكنولوجيا منذ 8 سنوات

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum—the difficulty of the environment is determined by the skill of your competitors (and if you’re competing against clones of yourself, the environment exactly matches your skill level). Second, a multiagent environment has no stable equilibrium: no matter how smart an agent is, there’s always pressure to get sma...

OpenAI Blog تكنولوجيا منذ 8 سنوات

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum—the difficulty of the environment is determined by the skill of your competitors (and if you’re competing against clones of yourself, the environment exactly matches your skill level). Second, a multiagent environment has no stable equilibrium: no matter how smart an agent is, there’s always pressure to get sma...

OpenAI Blog تكنولوجيا منذ 8 سنوات

Making Facebook Live More Accessible With Closed Captions - meta.com

Making Facebook Live More Accessible With Closed Captions meta.com

Meta Newsroom تكنولوجيا منذ 8 سنوات

UCB exploration via Q-ensembles

OpenAI Blog تكنولوجيا منذ 8 سنوات

UCB exploration via Q-ensembles

OpenAI Blog تكنولوجيا منذ 8 سنوات

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

OpenAI Blog تكنولوجيا منذ 9 سنوات

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

OpenAI Blog تكنولوجيا منذ 9 سنوات

Excess Wear and Use Guide - Tesla

Excess Wear and Use Guide Tesla

Tesla News تكنولوجيا منذ 9 سنوات

Robots that learn

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

OpenAI Blog تكنولوجيا منذ 9 سنوات

Robots that learn

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

OpenAI Blog تكنولوجيا منذ 9 سنوات

Sign up for the Recap newsletter: our free sport highlights email

The best of our sports journalism from the past seven days and a heads-up on the weekend’s actionSubscribe to get our editors’ pick of the Guardian’s award-winning sport coverage. We’ll email you the stand-out features and interviews, insightful analysis and highlights from the archive, plus films,...

The Guardian Sport تكنولوجيا منذ 9 سنوات

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

OpenAI Blog تكنولوجيا منذ 9 سنوات

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

OpenAI Blog تكنولوجيا منذ 9 سنوات

Equivalence between policy gradients and soft Q-learning

OpenAI Blog تكنولوجيا منذ 9 سنوات

Equivalence between policy gradients and soft Q-learning

OpenAI Blog تكنولوجيا منذ 9 سنوات

نتائج البحث

Faster physics in Python

Hard Questions: Who Should Decide What Is Hate Speech in an Online Global Community? - meta.com

Introducing Hard Questions - meta.com

Learning from human preferences

Learning from human preferences

Learning to cooperate, compete, and communicate

Learning to cooperate, compete, and communicate

Making Facebook Live More Accessible With Closed Captions - meta.com

UCB exploration via Q-ensembles

UCB exploration via Q-ensembles

OpenAI Baselines: DQN

OpenAI Baselines: DQN

Excess Wear and Use Guide - Tesla

Robots that learn

Robots that learn

Sign up for the Recap newsletter: our free sport highlights email

Roboschool

Roboschool

Equivalence between policy gradients and soft Q-learning

Equivalence between policy gradients and soft Q-learning