Performance Engineering
Linux Perf: Measuring Specific Code Sections with Pause/Resume APIs
Navigating the Complexity of Large Codebases Using Vtune + xdot (or perf + gprof2dot)
C-Reduce: Systematically Tackling (Not Only) Compiler Bugs
pahole: Analysing Memory Layout of Complex Data Structures With Ease
core-to-core-latency: A Nice Little Tool!
LinkTest : Measuring Communication Latency and Bandwidth At Scale
Understanding CPU Architecture And Performance Using LIKWID
I/O Performance Analysis with Darshan
Intel’s One API : What We Know and How to Get Ready
First Screencast : Summary of Computing Laws!
Summary of Computing Laws : Amdahl, Dennard, Gustafson, Little, Moore and More…!
Blade : Cube’s OTF2 Trace Visualizer
Summary Of Python Profiling Tools – Part I
Python Profiling : Deterministic vs Statistical Profilers
Summary of Debugging Tools for Parallel Applications
Summary of Profiling Tools for Parallel Applications