Category
Research
-
-
Improving GANs using optimal transport
-
On first-order meta-learning algorithms
-
Reptile: A scalable meta-learning algorithm
-
Some considerations on learning to explore via meta-reinforcement learning
-
Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research
-
Ingredients for robotics research
-
Interpretable machine learning through teaching
-
Interpretable machine learning through teaching
-
Discovering types for entity disambiguation
-
Requests for Research 2.0
-
Scaling Kubernetes to 2,500 nodes
-
Block-sparse GPU kernels
-
Learning sparse neural networks through L₀ regularization
-
Interpretable and pedagogical examples
-
Learning a hierarchy
-
Generalizing from simulation
-
Sim-to-real transfer of robotic control with dynamics randomization
-
Asymmetric actor critic for image-based robot learning
-
Domain randomization and generative models for robotic grasping
-
Meta-learning for wrestling
-
Competitive self-play
-
Nonlinear computation in deep linear networks
-
Learning to model other minds
-
Learning with opponent-learning awareness
-
Learning with opponent-learning awareness
-
OpenAI Baselines: ACKTR & A2C
-
More on Dota 2
-
Dota 2
-
Gathering human feedback
-
Better exploration with parameter noise
-
Proximal Policy Optimization
-
Robust adversarial inputs
-
Hindsight Experience Replay
-
Teacher–student curriculum learning
-
Faster physics in Python
-
Learning to cooperate, compete, and communicate
-
UCB exploration via Q-ensembles
-
OpenAI Baselines: DQN
-
Robots that learn
-
Roboschool
-
Equivalence between policy gradients and soft Q-learning
-
Stochastic Neural Networks for hierarchical reinforcement learning
-
Stochastic Neural Networks for hierarchical reinforcement learning
-
Unsupervised sentiment neuron
-
Unsupervised sentiment neuron
-
Spam detection in the physical world
-
Spam detection in the physical world
-
Evolution strategies as a scalable alternative to reinforcement learning
-
One-shot imitation learning
-
Learning to communicate
-
Emergence of grounded compositional language in multi-agent populations
-
Prediction and control with temporal segment models
-
Third-person imitation learning
-
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
-
Universe
-
#Exploration: A study of count-based exploration for deep reinforcement learning
-
#Exploration: A study of count-based exploration for deep reinforcement learning
-
On the quantitative analysis of decoder-based generative models
-
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
-
RL²: Fast reinforcement learning via slow reinforcement learning
-
Variational lossy autoencoder
-
Extensions and limitations of the neural GPU
-
Transfer from simulation to real world through learning deep inverse dynamics model
-
Infrastructure for deep learning
-
Generative models
-
OpenAI Gym Beta
-
Weight normalization: A simple reparameterization to accelerate training of deep neural networks