Changeset 174


Ignore:
Timestamp:
Feb 15, 2010, 2:32:46 PM (14 years ago)
Author:
coach
Message:

Paul: Addition de deux references

Location:
anr
Files:
3 edited

Legend:

Unmodified
Added
Removed
  • anr/anr.bib

    r135 r174  
    620620 year = {2001},
    621621}
     622@INPROCEEDINGS{FP:96
     623        ,AUTHOR = "Paul Feautrier"
     624        ,TITLE = "Automatic Parallelization in the Polytope Model"
     625        ,BOOKTITLE = "The Data-Parallel Programming Model"
     626        ,YEAR = 1996   
     627        ,EDITOR = "Guy-Ren\'e Perrin and Alain Darte"
     628        ,PAGES = "79--103"
     629        ,VOLUME = "LNCS 1132"
     630        ,PUBLISHER = "Springer"
     631}
     632
     633@book{DRV:2000,
     634    author={Alain Darte and Yves Robert and Fr\'ed\'eric Vivien},
     635    title={Scheduling and automatic Parallelization},
     636    publisher={Birkh\"auser}, year=2000
     637}
  • anr/section-3.1.tex

    r134 r174  
    166166
    167167\subsubsection{Automatic Parallelization}
    168 % FIXME:LIP FIXME:PF FIXME:CA
    169 % Paul je ne suis pas sur que ce soit vraiment un etat de l'art
    170 % Christophe, ce que tu m'avais envoye se trouve dans obsolete/body.tex
    171 %\mustbecompleted{
    172 %Hardware is inherently parallel. On the other hand, high level languages,
    173 %like C or Fortran, are abstractions of the processors of the 1970s, and
    174 %hence are sequential. One of the aims of an HLS tool is therefore to
    175 %extract hidden parallelism from the source program, and to infer enough
    176 %hardware operators for its efficient exploitation.
    177 %\\
    178 %Present day HLS tools search for parallelism in linear pieces of code
    179 %acting only on scalars -- the so-called basic blocs. On the other hand,
    180 %it is well known that most programs, especially in the fields of signal
    181 %processing and image processing, spend most of their time executing loops
    182 %acting on arrays. Efficient use of the large amount of hardware available
    183 %in the next generation of FPGA chips necessitates parallelism far beyond
    184 %what can be extracted from basic blocs only.
    185 
    186 %The Compsys team of LIP has built an automatic parallelizer, Syntol, which
    187 %handle restricted C programs -- the well known polyhedral model --,
    188 %computes dependences and build a symbolic schedule. The schedule is
    189 %a specification for a parallel program. The parallelism itself can be
    190 %expressed in several ways: as a system of threads, or as data-parallel
    191 %operations, or as a pipeline. In the context of the COACH project, one
    192 %of the task will be to decide which form of parallelism is best suited
    193 %to hardware, and how to convey the results of Syntol to the actual
    194 %synthesis tools. One of the advantages of this approach is that the
    195 %resulting degree of parallelism can be easilly controlled, e.g. by
    196 %adjusting the number of threads, as a mean of exploring the
    197 %area / performance tradeoff of the resulting design.
    198 
    199 %Another point is that potentially parallel programs necessarily involve
    200 %arrays: two operations which write to the same location must be executed
    201 %in sequence. In synthesis, arrays translate to memory. However, in FPGAs,
    202 %the amount of on-chip memory is limited, and access to an external memory
    203 %has a high time penalty. Hence the importance of reducing the size of
    204 %temporary arrays to the minimum necessary to support the requested degree
    205 %of parallelism. Compsys has developped a stand-alone tool, Bee, based
    206 %on research by A. Darte, F. Baray and C. Alias, which can be extended
    207 %into a memory optimizer for COACH.
    208 %}
    209168
    210169The problem of compiling sequential programs for parallel computers
     
    214173the preservation of the program semantics. Most of these transformations
    215174just reorder the operations of the program; some of them modify its
    216 data structures. Dpendences (exact or conservative) are checked to guarantee
     175data structures. Dependences (exact or conservative) are checked to guarantee
    217176the legality of the transformation.
    218177
     
    221180which interact in a complicated way. More recently, it has been noticed
    222181that all of these are just changes of basis in the iteration domain of
    223 the program. This has lead to the invention of the polyhedral model, in
    224 which the combination of two transformation is simply a matrix product.
    225 
    226 As a side effect, it has been observed that the polytope model is a useful
     182the program. This has lead to the introduction of the polyhedral model,
     183\cite{FP:96,DRV:2000} in which the combination of two transformation is
     184simply a matrix product.
     185
     186Since hardware is inherently parallel, finding parallelism in sequential
     187programs in an important prerequisite for HLS. The large FPGA chips of
     188today can accomodate much more parallelism than is available in basic blocks.
     189The polyhedral model is the ideal tool for finding more parallelism in
     190loops.
     191
     192As a side effect, it has been observed that the polyhedral model is a useful
    227193tool for many other optimization, like memory reduction and locality
    228194improvement. Another point is
  • anr/section-7.tex

    r167 r174  
    4646  professor at ENS Lyon (Paul Feautrier) and a research associate
    4747  (CR2) at INRIA Rh\^one-Alpes (Christophe Alias).  The non-permanent
    48   personal required is a PhD student that will work on network process
     48  personel required is a PhD student that will work on network process
    4949  generation from polyhedral loops, then on extensions to
    5050  non-polyhedral loops.  We are looking for a student with both
     
    5353  a working implementation.
    5454  \parlf
    55   The table below summarizes the \hommemois in man-months by
     55  The table below summarizes the \hommemois by
    5656  deliverables and tasks for both permanent and non-permanent
    5757  personnels.  The non-permanent personnels costs represent 23\% of
Note: See TracChangeset for help on using the changeset viewer.