Design And Optimization Research Of Compiler Backend For Large-scale Coarse Grained Reconfigurable Architecture

Posted on:2021-05-23

Degree:Master

Type:Thesis

Country:China

Candidate:P F Ye

Full Text:PDF

GTID:2518306503974389

Subject:IC Engineering

Abstract/Summary:

PDF Full Text Request

With the developing of semiconductor process,the cost of chip designing is higher and higher.And the demand of “Application defines chip” is more and more urgent.Reconfigurable chips can take both computing efficiency and software programmability into account,satisfying the demand by supporting both “Application defines software” and “Software defines chip”.Being responsible for automatically mapping applications onto multiple reconfigurable processing units,the compiler backend plays an important role in reconfigurable chips.Increasing processing elements is one of the inevitable developing trends of reconfigurable chips,but it also brings new requirements and challenges to the compiler backend designing.This paper focuses on a large-scale coarse grained reconfigurable architecture with 1024 processing elements.Due to its new architecture features including heterogeneous memory access unit design,more limited PE interconnection,and multi-level pipeline design,the existing compiler backend design is no longer applicable.This paper designs and implements a new compiler backend.Based on the LLVM compiler framework,the new backend tool can automatically extract the loop of computing intensive applications and construct the data flow graph.After preprocessing,scheduling and mapping of the data flow graph,the configure package needed to execute on the target reconfigurable chips is generated.In this paper,this backend tool,together with other tools used in the compilation process,is integrated into a complete compiler of the target system.The compiler can simply and directly generate the executable files of the three-layer instruction set architecture,improving the usability of the compiler.In order to make full use of the abundant parallel resources of the target architecture,this paper explores and implements two backend optimization strategies based on the problem of high cost of instruction switching and synchronization.A new backend optimization strategy based on DFG splitting is proposed for small-scale data flow graphs.The strategy considers both memory aware optimization and data repartition,greatly improving the final application acceleration ratio.For large-scale data flow graphs,an instruction similarity optimization algorithm based on simulated annealing is proposed,which is combined with the structure of segmented instruction switching.It can greatly reduce instruction storing overhead.This paper implements the automatic compiling of typical computing intensive applications onto the target architecture.The compiled executable files can run correctly as an input for RTL simulation environment,and the application acceleration ratio is 23.2 times of general purpose processors.For the two optimization strategies,by comparing the simulation data of performance and instruction similarity,the performance improvement of 129% and the instruction similarity improvement of 56.97% and the instruction storing overhead decreasing of 72.32% are obtained respectively on average.

Keywords/Search Tags:

CGRA, compiler backend, parallelism

PDF Full Text Request

Related items

1	Fast Backend Porting And Optimization For Niosâ…¡ Processor Based On LLVM
2	Backend Porting For CSKY Based On LLVM Compiler Infrastructure
3	Analysis Of LLVM Compiler Infrastructure And Backend Porting For Arm
4	Research And Implementation Of Compiler Back-end Development Based On Knowledge Map
5	Design And Implementation Of A Porting Assistant System For Retargetable Compiler
6	Generating, Optimizing, and Scheduling a Compiler Level Representation of Stream Parallelism
7	Study On The Automatic Recognition Of Subword-Parallelism In Multimedia Programs
8	Design Andl Implementation Of Script Language Backend Tool Based On LLVM
9	Research And Development Of Communication Specific Vector Processor Compiler Based On GCC
10	Design And Implementation Of Compiler Directives For Tasks Parallelism In Message Passing Computing