Designing novel hardware architecture for ML is hard, especially for achieving high energy-efficiency and performance. However, the importance of a robust system software stack is often neglected and poses a bigger challenge. In this talk we will present how at EdgeCortix we have adopted and effectively extended TVM to bring to life the MERA software stack that enables end to end neural network acceleration on our proprietary processor architecture, across ASICs and FPGAs.