Block-sparse GPU kernels

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. We’...

In Openai IA, Research

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. We’ve used them to attain state-of-the-art results in text sentiment analysis and generative modeling of text and images.

Learning sparse neural networks through L₀ regularization

Scaling Kubernetes to 2,500 nodes

Related Posts

Researchers discover a shortcoming that makes LLMs less reliable

Understanding neural networks through sparse circuits

Securing Research Infrastructure for Advanced AI

Zico Kolter Joins OpenAI’s Board of Directors