تكنولوجيا
11581 مقال
Model S - Tesla
Model S Tesla
Electric Cars, Solar & Clean Energy | Tesla Italy - Tesla
Electric Cars, Solar & Clean Energy | Tesla Italy Tesla
Electric Cars, Solar & Clean Energy | Tesla New Zealand - Tesla
Electric Cars, Solar & Clean Energy | Tesla New Zealand Tesla
Electric Cars, Solar & Clean Energy | Tesla Ireland - Tesla
Electric Cars, Solar & Clean Energy | Tesla Ireland Tesla
Building a home with heart - About Amazon
Building a home with heart About Amazon
OVERSIGHT BOARD TRUST - meta.com
OVERSIGHT BOARD TRUST meta.com
Solving Rubik’s Cube with a robot hand
We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automatic Domain Randomization (ADR). The system can handle situations it never saw during training, such as being prodded by a stuffed giraffe. This shows that reinforcement learning isn’t just a tool for virtual tasks, but can solve physical-world probl...
Solving Rubik’s Cube with a robot hand
We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automatic Domain Randomization (ADR). The system can handle situations it never saw during training, such as being prodded by a stuffed giraffe. This shows that reinforcement learning isn’t just a tool for virtual tasks, but can solve physical-world probl...
Removing Coordinated Inauthentic Behavior in UAE, Nigeria, Indonesia and Egypt - meta.com
Removing Coordinated Inauthentic Behavior in UAE, Nigeria, Indonesia and Egypt meta.com
Range Tips - Tesla
Range Tips Tesla
Facebook, Elections and Political Speech - meta.com
Facebook, Elections and Political Speech meta.com
Charging - Tesla
Charging Tesla
An Update on Our App Developer Investigation - meta.com
An Update on Our App Developer Investigation meta.com
Home - Amazon Sustainability - Amazon Sustainability
Home - Amazon Sustainability Amazon Sustainability
Roadster – Electric Sports Car | Tesla Hong Kong - Tesla
Roadster – Electric Sports Car | Tesla Hong Kong Tesla
People Raise Over $2 Billion for Causes on Facebook - meta.com
People Raise Over $2 Billion for Causes on Facebook meta.com
Fine-tuning GPT-2 from human preferences
We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization tasks the labelers preferred sentences copied wholesale from the input (we’d only asked them to ensure accuracy), so our models learned to copy. Summarization required 60k human labels; simpler tasks which continue text in various styles required...
Fine-tuning GPT-2 from human preferences
We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization tasks the labelers preferred sentences copied wholesale from the input (we’d only asked them to ensure accuracy), so our models learned to copy. Summarization required 60k human labels; simpler tasks which continue text in various styles required...
Establishing Structure and Governance for an Independent Oversight Board - meta.com
Establishing Structure and Governance for an Independent Oversight Board meta.com
Emergent tool use from multi-agent interaction
We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build a series of six distinct strategies and counterstrategies, some of which we did not know our environment supported. The self-supervised emergent complexity in this simple environment further suggests that multi-agent co-adaptation may one day produce extremely complex and intelligent behavior.