The work of writing a TensorRT plugin requires solid CUDA programming skills and considerable time to make the plugin work properly and optimized. In this section, we will demonstrate an end-to-end tool to generate a user specific op to a TRT plugin, enabled by Relay and AutoScheduler of TVM. This tool can help AI developers to get rid of the tedious hand-written TRT plugin, and convert their models to TRT automatically.