News

Abstract: To achieve low latency for edge applications, a singlechip sparse accelerator is proposed, which can conduct deep neural network (DNN) inference only using limited on-chip memory. Private ...