[25] | 1 | The emerging complex and integrated heterogeneous embedded system platforms require |
---|
[32] | 2 | adequate design methods to efficiently model, explore, analyze and design the ever complex software |
---|
| 3 | and hardware architectures. Future Embedded Systems suppliers, in order to meet rapidly increasing |
---|
| 4 | performance requirements and a pressure to lower development cost and shorten time-to-market, |
---|
| 5 | will have to adopt new design methods and flows in order to keep pace with the increasing |
---|
[25] | 6 | complexity of design problems. Such methods, addressing these challenges starting from high levels of |
---|
[32] | 7 | abstraction, will have to perform large solution space explorations both for software and (possibly |
---|
[38] | 8 | reconfigurable) hardware, reducing the design effort and offering a high predictability of results |
---|
| 9 | with respect to cost and performance objectives. |
---|
[25] | 10 | Current design methodologies provide quite low-level abstraction capabilities. However in a few years |
---|
[32] | 11 | from now tens of programmable processors will be embedded in an IC with more than 100M |
---|
[38] | 12 | transistors, therefore adding to the complexity of the problem of designing such systems. |
---|
| 13 | Taking into account that the complexity of the software part is increasing at an even |
---|
| 14 | faster rate, current solutions for design space exploration, mainly manually based, by no |
---|
| 15 | means do supply an adequate efficiency. |
---|
[25] | 16 | Consequently, there is an urgent need to leverage system level |
---|
| 17 | exploration through the use of a high level specification of the application and an early design |
---|
[32] | 18 | space exploration step. The first system oriented approaches are appearing, among which those |
---|
| 19 | based on C/C++ and SystemC are the most popular. Such approaches can take place before and/or after |
---|
| 20 | the co-design or architecture refinement steps and target the design space pruning in order to fully |
---|
[25] | 21 | exploit potential solutions that meet design and application constraints (power, latency, |
---|
| 22 | throughput) within the design and market timeframe. |
---|
[17] | 23 | \\ |
---|
[25] | 24 | Thus, new system-level design flows need to be developed, enabling the exploration of an application |
---|
[38] | 25 | independently of the implementation, almost at the beginning of the design process. |
---|
| 26 | A fundamental element of this evolution is the definition of abstraction layers that should allow the |
---|
| 27 | performance driven re-use of software and hardware components at the system level. |
---|
| 28 | In this context, COACH will combine modeling and estimation methods and compilers and |
---|
| 29 | design space exploration techniques. This approach will be a radical innovation in |
---|
| 30 | embedded system design methodology. |
---|
[25] | 31 | \\ |
---|
[38] | 32 | The reason is that the COACH framework is applied before high-level design tools in the embedded |
---|
| 33 | systems design flow. In that way, it will make possible a real and efficiently combined |
---|
| 34 | exploitation of high-level synthesis tools, parallelizing approaches and compilers, already |
---|
[25] | 35 | available on the market. These tools and approaches are not yet massively adopted, precisely |
---|
[38] | 36 | because this preliminary design step is missing. COACH will indeed permit (i) to predict and |
---|
[25] | 37 | control implementation optimizations, (ii) to target multiple implementation technologies |
---|
| 38 | (and thus the associated tools) from a unique specification and (iii) to efficiently integrate high |
---|
| 39 | and low-level design tools in a unique seamless design flow. |
---|
| 40 | \\ |
---|
| 41 | The performance estimation methods combined with the design space exploration techniques will |
---|
| 42 | finally allow the design process to start from system level specification and automatically explore the |
---|
| 43 | potential architectures in order to find out the optimal implementation in a shorter design time and at |
---|
| 44 | a lower global cost. |
---|
[17] | 45 | \par |
---|
[38] | 46 | To get an efficient embedded system, the system designer has to take into account |
---|
| 47 | application characteristics when it chooses one of the available technologies. |
---|
| 48 | This choice is not easy and in most cases the designer has to try different |
---|
[17] | 49 | technologies to retain the most adapted one. |
---|
| 50 | \\ |
---|
| 51 | The first objective of COACH is to provide an open-source framework to |
---|
[38] | 52 | design embedded system on FPGA devices. |
---|
| 53 | The COACH framework allows the designer to explore various software/hardware |
---|
[17] | 54 | partitions of the target application, to run timing and functional |
---|
| 55 | simulations and to generate automatically both the software and the |
---|
| 56 | synthesizable description of the hardware. |
---|
| 57 | The main topics of the project are: |
---|
| 58 | \begin{itemize} |
---|
| 59 | \item |
---|
[46] | 60 | \textbf{Design space exploration}: It consists in analysing the application runnig |
---|
[17] | 61 | on FPGA, defining the target technology (SoC, MPSoC, ASIP, ...) and |
---|
| 62 | hardware/software partitioning of tasks depending on technology choice. |
---|
| 63 | This exploration is driven basically by throughput, latency and power |
---|
| 64 | consumption criteria. |
---|
| 65 | \item |
---|
[46] | 66 | \textbf{Micro-architectural exploration}: When hardware components are required, the |
---|
[17] | 67 | HLS tools of the framework generate them automatically. At this stage the |
---|
[38] | 68 | framework provides various HLS tools that allow the micro-architectural space |
---|
[30] | 69 | design exploration. The exploration criteria also are throughput, latency |
---|
[17] | 70 | and power consumption. |
---|
[30] | 71 | At this stage, preliminary source-level transformations will be |
---|
| 72 | required to improve the efficiency of the target component. |
---|
| 73 | For instance, one may transform a loop nest to expose parallelism, |
---|
| 74 | or shrink an array to promote it to a register or reduce a memory footprint. |
---|
| 75 | |
---|
[17] | 76 | \item |
---|
[46] | 77 | \textbf{Performance measurement}: For each point in the design space, |
---|
[38] | 78 | figures of merit are available such as throughput, latency, power |
---|
[17] | 79 | consumption, area, memory allocation and data locality. They are evaluated |
---|
[38] | 80 | using virtual prototyping, estimation or analyzing methodologies. |
---|
[17] | 81 | \item |
---|
[46] | 82 | \textbf{Targeted hardware technology}: The COACH description of a system is |
---|
[38] | 83 | independent of the FPGA family. Every point of the design |
---|
[17] | 84 | space can be implemented on any FPGA having the required resources. |
---|
| 85 | Basically, COACH handles both Altera and Xilinx FPGA families. |
---|
| 86 | \end{itemize} |
---|
| 87 | As an extension of embedded system design, COACH deals also with High |
---|
| 88 | Performance Computing (HPC). |
---|
[38] | 89 | In HPC, the kind of targeted application is an existing one running on a PC. |
---|
| 90 | The COACH framework helps designer to accelerate it by migrating critical parts into a |
---|
| 91 | SoC implemented on an FPGA plugged to the PC bus. |
---|
[17] | 92 | \par |
---|
[38] | 93 | COACH is the result of the will of several laboratories to unify their knowhow |
---|
| 94 | and skills in the following domains: Operating system and hardware |
---|
[46] | 95 | communication (\tima, \upmc), SoC and MPSoC (\upmc and \tima), ASIP (\irisa) and |
---|
[38] | 96 | HLS (\upmc, \ubs) and compilation (\irisa, \lip). |
---|
[17] | 97 | The project objective is to integrate these various domains into a unique |
---|
| 98 | free framework (licence ...) masking as much as possible these domains and |
---|
| 99 | its different tools to the user. |
---|
| 100 | |
---|