Next: 7.2.4 Parallelization using split
Up: 7.2 Euler equations
Previous: 7.2.2 Sequential solving
  Contents
7.2.3 Parallelization using distributed boxes
The distribution of complete boxes on the processors is related to
the non-overlapping node distribution from section 4.3.2.
The remaining crucial point in the parallelization of (7.17)
consists in the efficient calculation of the sum
in (7.18).
One can think of the unknowns
stored in the center of
volume elements.
The local contributions of the flux
in algorithm 7.8
are stored inconsistently on the edges (faces),
i.e.,
.
For the parallel implementation, an overlapping distribution of
and
with one element overlap is necessary.
Denoting by
the extended subdomain of
the
extended solution vector can be written as
,
analogously
.
Figure 7.3:
Distributed boxes
![\begin{figure}\begin{center}
\unitlength0.016\textwidth
\begin{picture}(25,28)...
...mega}} \makebox[0pt]{}_{\boldsymbol{j}}$}}
\end{picture}\end{center}\end{figure}](img1029.gif) |
Remarks :
- Algorithm 7.10 can be used for all 3 formulas
(7.19)-(7.21).
- By using (7.19) the values
are not needed
so that step 1.) can be omitted.
- The data exchange in step 3.) by
using (7.20) or (7.21)
is not necessary if one calculates the value
in
redundantly to
) in
.
Disadvantages:
- More arithmetic costs (nearly doubling).
- Performing the redundant calculations only in the
boundary elements means additional alternatives
in the code.
|
Next: 7.2.4 Parallelization using split
Up: 7.2 Euler equations
Previous: 7.2.2 Sequential solving
  Contents
Gundolf Haase
2000-03-20