Context Navigation

Changes between Version 5 and Version 6 of user_applications

Timestamp:: Oct 13, 2015, 1:04:41 PM (10 years ago)
Author:: alain
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

user_applications

-                      v5
+                      v6
 [[PageOutline]]
+The following applications use the GIET_VM [wiki:library_stdio system calls] and [wiki:user_libraries user libraries]. The multi-threaded applications have been designed to analyse the TSAR manycore architecture scalability.
+The following applications use the GIET_VM [wiki:library_stdio system calls] and [wiki:user_libraries user libraries].
+The multi-threaded applications use the POSIX threads API, and have been specifically designed to analyse the TSAR manycore architecture scalability.
 == __ Display__ ==
+This mono-processor application read a stream of images (128 lines * 128 pixels / 1 byte per pixel), from a file on a FAT32 disk controller, and display it interactively on the frame buffer. The ''images.raw'' file available in the application directory contains 20 images. This application can be used to test peripherals such as block devices, frame buffer, and dynamic allocation of  TTY terminals.
+This single thread application illustrates the use of the CMA (chained Buffer DMA) peripheral to display a stream of images.
+The application read a stream of images from the ''/misc/images_128.ram'' file,
+stored on a FAT32 disk controller. It displays the stream of images on the FBF (graphical display) peripheral. The ''images_128.raw''  contains 20 images : 128 lines * 128 pixels / 1 byte per pixel.
 It requires one private TTY terminal.
+The source code can be found [source:soft/giet_vm/applications/display/main.c here].
+The source code can be found [source:soft/giet_vm/applications/display/display.c here], and the mapping directives are defined  [source:soft/giet_vm/applications/display/display.py here].
+== __Coproc__ ==
+This single thread application illustrates the use of multi-channels  hardware accelerators by an user application.
+The hardware coprocessor must be connected to the system by a ''vci_mwmr_dma'' component.
+In this application, the coprocessor makes the Greater Common Divider computation between two vectors
+of randomly generated 32 bits integers. The vector size is a parameter.
+It requires one private TTY terminal.
+The source code can be found [source:soft/giet_vm/applications/display/coproc.c here], and the mapping directives are defined  [source:soft/giet_vm/applications/display/coproc.py here].
 == __Transpose__ ==
 This multi-threaded application read a stream of images (128 lines * 128 pixels), transpose it (X <-> Y), and display it on the frame buffer.
 It can run on a multi-processors, multi-clusters architecture, with one thread per processor.
 The input and output buffers containing the image are distributed in all clusters.
+This multi-threaded application is typical of parallelism that can be exploited in low-level image processing.
+It can run on a multi-processors, multi-clusters architecture, with one thread per processor core.
+The total number of threads depends on the hardware architecture, and is computed as ( x_size * y_size * nprocs ) . The main() function is executed by the thread running on P[0,0,0]. All others threads are executing the execute() function. Each execute() function is handling (image_size / nthreads) lines.
+The number of clusters  must be a power of 2 no larger than 32.
+This application ask the user to enter the name of a file containing an image stored on the FAT32 disk, check that the selected image fit the frame buffer size, transpose the image (X <-> Y), display the result on the graphical display, and save the transposed image to the FAT32 disk.
+The main() function is executed by the thread running on P[0,0,0], all others threads are executing the execute() function. Each execute() function is handling (image_size / nthreads) lines.  The input and output buffers containing the source and transposed images are allocated from the user heap distributed in all clusters. Therefore, the data read are mostly local, but the data write are mostly remote.
+The number of clusters  must be a power of 2 no larger than 256.
 The number of processors per cluster must be a power of 2 no larger than 4.
 It requires one private TTY terminal.
+It requires one TTY terminal, shared by all threads.
+For each image the application makes a self test (checksum for each line). The actual display on the frame buffer depends on frame buffer availability.
+The source code can be found [source:soft/giet_vm/applications/transpose/main.c here].
+The source code can be found [source:soft/giet_vm/applications/transpose/transpose.c here], and the mapping is defined  [source:soft/giet_vm/applications/transpose/transpose.py here].
 == __Convol__ ==
+The source code can be found [source:soft/giet_vm/applications/convol/main.c here].
+This multi-threaded application is a typical medical image processing application.
+It implements a 2D convolution product, that can run on a multi-processors, multi-clusters architecture, with one thread
+per processor. The image  size is 1024 * 1024 pixels, 2 bytes per pixel. It has been provided by Philips, and is stored
+on the FAT32 disk in ''/misc/philips_image_1024.raw''.
+The convolution kernel is [201]*[35] pixels, but it can be factored in two independant line and column convolution products,
+requiring an intermediate image transposition.
+The five buffers containing the image are distributed in all clusters.
+The main() function can be executed on any processor P[x,y,p].
+It makes the initialisations, launch (N-1) threads to run the execute() function on the (N-1) other processors than P[x,y,p], call himself the execute() function, and finally call the instrument() function to display instrumentation results
+when the parallel execution is completed.
+The number of clusters containing processors must be power of 2 no larger than 256.
+The number of processors per cluster must be power of 2 no larger than 8.
+It requires one TTY terminal, shared by all threads.
+The source code can be found [source:soft/giet_vm/applications/convol/convol.c here], and the mapping is defined  [source:soft/giet_vm/applications/transpose/transpose.py here].
 == __Classif__ ==
 …
 Instrumentation results display is done by the "store" task in cluster[0][0] when all "store" tasks completed the number of clusters specified by the CONTAINERS_MAX parameter.
 The source code can be found [source:soft/giet_vm/applications/classif/main.c here].
+The source code can be found [source:soft/giet_vm/applications/classif/classif.c here], and the mapping is defined [source:soft/giet_vm/applications/classif/classif.py here].