Codebook
Definition: A codebook is a high-level summary that describes the contents, structure, nature and layout of a data set. A well-documented codebook contains information intended to be complete and self-explanatory for each variable in a data file, such as the wording and coding of the item, and the underlying construct. It provides transparency to researchers who may be unfamiliar with the data but wish to reproduce analyses or reuse the data.
Related terms: Data dictionary, Metadata
References:
- Arslan, R. C. (2019). How to Automatically Document Data With the codebook Package to Facilitate Data Reuse. Advances in Methods and Practices in Psychological Science, 2(2), 169–187. https://doi.org/10.1177/2515245919838783
- Inter-university Consortium for Political and Social Research. (2025). What is a Codebook? https://www.icpsr.umich.edu/web/ICPSR/cms/1983
Originally drafted by: Tina Lonsdorf
Reviewed by: Ali H. Al-Hoorie, Ashley Blake, Kai Krautter, Charlotte R. Pennington, Flávio Azevedo