Introduction into recent processor development
Introduction into recent processor development and keywords therein.
That includes superscalarity, vector units, threads, caches and NUMA on
a single CPU.
Examples for recent HPC-processors
Contents:
- We will start with the classical von-Neumann
Computer
- Concepts in one core
- FLOPS
- Superscalarity (multiply functional units)
- Pipelining
- Out-of-order execution
- Speculative execution
- SSE-units
(AVX)
- Memory
- again latency and bandwidth
- DRAM
- SRAM
- Caches [Rauber §2.7]
- NUMA
- CPU speed vs. memory speed
- Moore's law
- clock rate vs. energy consumption
- multiple cores
- ccNUMA
- BBC report on the
transputer in 1986
Demonstration:
- Scalar product: vectorization, loop unrolling
- matrix-vektor product: rowise vs. columnwise access, tiling
- profiling