The Shape of Code
Distribution of method chains in Java and Python
Finding links between gcc source code and the C Standard
Modeling the distribution of method sizes
Early research on economies of scale for computer systems
Data+code for book: The New C Standard
Distribution of integer literals in text/speech and source code
ISO C++ committee has a new chief sheep herder
Percentage of methods containing no reported faults
Halstead/McCabe: a complicated formula for LOC
Half-life of Open source research software projects
Positive and negative descriptions of numeric data
Predicted impact of LLM use on developer ecosystems
Impact of developer uncertainty on estimating probabilities
A process to find and extract data-points from graphs in pdf files