Context Navigation

← Previous Changeset
Next Changeset →

Changeset 30

Timestamp:

Jan 12, 2010, 3:27:48 PM (16 years ago)

Author:

coach

Message:

M anr/section-2.tex
M anr/section-2.2.tex
M anr/section-3.1.tex

Location:

anr

Files:

: 3 edited

section-2.2.tex (modified) (1 diff)
section-2.tex (modified) (1 diff)
section-3.1.tex (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

anr/section-2.2.tex

-                      r25
+                      r30
 compilers\cite{FIXME:IRISA} since 2002).
 %%% EXPERTISE DANS DES DOMAINES: FIXME:LIP
+\mustbecompleted{For polyedric transformations and memory optimization, SYNTOL, BEE, ... LIP (CA ou PF)}
+%%%\mustbecompleted{For polyedric transformations and memory optimization, SYNTOL, BEE, ... LIP (CA ou PF)}
+Compsys was founded in 2002 by several senior researchers with experience in
+high performance computing and automatic parallelization. They have been
+among the initiators of the polyhedral model, a theory which serve to
+unify many parallelism detection and exploitation techniques for regular
+programs. It is expected that the techniques developped by Compsys for
+parallelism detection, scheduling, process construction and memory management
+will be very useful as a first step for a high-level synthesis tool.
 \par
 %%% DESCRIPTION DES PROJETS ANR UTILISES: SOCLIB OK

anr/section-2.tex

-                      r25
+                      r30
 HLS tools of the framework generate them automatically. At this stage the
 framework provides various HLS tools allowing the micro-architectural space
 design exploration. The exploration criteria are also throughput, latency
+design exploration. The exploration criteria also are throughput, latency
 and power consumption.
+%FIXME:CA
+%FIXME:CA At this stage, preliminary source-level transformations will be
+%FIXME:CA required to improve the efficiency of the target component.
+%FIXME:CA COACH will also provide such facilities, such as automatic parallelization
+%FIXME:CA and memory optimisation.
+At this stage, preliminary source-level transformations will be
+required to improve the efficiency of the target component.
+For instance, one may transform a loop nest to expose parallelism,
+or shrink an array to promote it to a register or reduce a memory footprint.
 \item
 Performance measurement: For each point of design space exploration,

anr/section-3.1.tex

-                      r12
+                      r30
 % Paul je ne suis pas sur que ce soit vraiment un etat de l'art
 % Christophe, ce que tu m'avais envoye se trouve dans obsolete/body.tex
+\mustbecompleted{
+Hardware is inherently parallel. On the other hand, high level languages,
+like C or Fortran, are abstractions of the processors of the 1970s, and
+hence are sequential. One of the aims of an HLS tool is therefore to
+extract hidden parallelism from the source program, and to infer enough
+hardaware operators for its efficient exploitation.
+\\
+Present day HLS tools search for parallelism in linear pieces of code
+acting only on scalars -- the so-called basic blocs. On the other hand,
+it is well known that most programs, especially in the fields of signal
+processing and image processing, spend most of their time executing loops
+acting on arrays. Efficient use of the large amount of hardware available
+in the next generation of FPGA chips necessitates parallelism far beyond
+what can be extracted from basic blocs only.
+\\
+The Compsys team of LIP has built an automatic parallelizer, Syntol, which
+handle restricted C programs -- the well known polyhedral model --,
+computes dependences and build a symbolic schedule. The schedule is
+a specification for a parallel program. The parallelism itself can be
+expressed in several ways: as a system of threads, or as data-parallel
+operations, or as a pipeline. In the context of the COACH project, one
+of the task will be to decide which form of parallelism is best suited
+to hardware, and how to convey the results of Syntol to the actual
+synthesis tools. One of the advantages of this approach is that the
+resulting degree of parallelism can be easilly controlled, e.g. by
+adjusting the number of threads, as a mean of exploring the
+area / performance tradeoff of the resulting design.
+\\
+Another point is that potentially parallel programs necessarily involve
+arrays: two operations which write to the same location must be executed
+in sequence. In synthesis, arrays translate to memory. However, in FPGAs,
+the amount of on-chip memory is limited, and access to an external memory
+has a high time penalty. Hence the importance of reducing the size of
+temporary arrays to the minimum necessary to support the requested degree
+of parallelism. Compsys has developped a stand-alone tool, Bee, based
+on research by A. Darte, F. Baray and C. Alias, which can be extended
+into a memory optimizer for COACH.
+}
+%\mustbecompleted{
+%Hardware is inherently parallel. On the other hand, high level languages,
+%like C or Fortran, are abstractions of the processors of the 1970s, and
+%hence are sequential. One of the aims of an HLS tool is therefore to
+%extract hidden parallelism from the source program, and to infer enough
+%hardware operators for its efficient exploitation.
+%\\
+%Present day HLS tools search for parallelism in linear pieces of code
+%acting only on scalars -- the so-called basic blocs. On the other hand,
+%it is well known that most programs, especially in the fields of signal
+%processing and image processing, spend most of their time executing loops
+%acting on arrays. Efficient use of the large amount of hardware available
+%in the next generation of FPGA chips necessitates parallelism far beyond
+%what can be extracted from basic blocs only.
+\\
+%The Compsys team of LIP has built an automatic parallelizer, Syntol, which
+%handle restricted C programs -- the well known polyhedral model --,
+%computes dependences and build a symbolic schedule. The schedule is
+%a specification for a parallel program. The parallelism itself can be
+%expressed in several ways: as a system of threads, or as data-parallel
+%operations, or as a pipeline. In the context of the COACH project, one
+%of the task will be to decide which form of parallelism is best suited
+%to hardware, and how to convey the results of Syntol to the actual
+%synthesis tools. One of the advantages of this approach is that the
+%resulting degree of parallelism can be easilly controlled, e.g. by
+%adjusting the number of threads, as a mean of exploring the
+%area / performance tradeoff of the resulting design.
+\\
+%Another point is that potentially parallel programs necessarily involve
+%arrays: two operations which write to the same location must be executed
+%in sequence. In synthesis, arrays translate to memory. However, in FPGAs,
+%the amount of on-chip memory is limited, and access to an external memory
+%has a high time penalty. Hence the importance of reducing the size of
+%temporary arrays to the minimum necessary to support the requested degree
+%of parallelism. Compsys has developped a stand-alone tool, Bee, based
+%on research by A. Darte, F. Baray and C. Alias, which can be extended
+%into a memory optimizer for COACH.
+%}
+The problem of compiling sequential programs for parallel computers
+has been studied since the advent of the first parallel architectures
+in the 1970s. The basic approach consists in applying program transformations
+which exhibit or increase the potential parallelism, while guaranteeing
+the preservation of the program semantics. Most of these transformations
+just reorder the operations of the program; some of them modify its
+data structures. Dpendences (exact or conservative) are checked to guarantee
+the legality of the transformation.
+This has lead to the invention of many loop transformations (loop fusion,
+loop splitting, loop skewing, loop interchange, loop unrolling, ...)
+which interact in a complicated way. More recently, it has been noticed
+that all of these are just changes of basis in the iteration domain of
+the program. This has lead to the invention of the polyhedral model, in
+which the combination of two transformation is simply a matrix product.
+As a side effect, it has been observed that the polytope model is a useful
+tool for many other optimization, like memory reduction and locality
+improvement. Another point is
+that the polyhedral domain \emph{stricto sensu} applies only to
+very regular programs. Its extension to more general programs is
+an active research subject.
 \subsubsection{Interfaces}

Note: See TracChangeset for help on using the changeset viewer.