Introduction

KernelFalcon is a new deep learning architecture designed to generate autonomous GPU kernels. The architecture combines code generation with correctness verification, using a combination of optimization algorithms and automated tests.

Technical Details

  • Uses a deep learning model to generate code in TensorFlow
  • Verifies kernel correctness using automated tests and optimization search algorithms
  • Implements a control structure to manage failure and recovery of tests