Step 2: summarize strata information

If your data is stratified you will need prepare identifiers for each strata level

Important

If your data is not stratified you can skip this step and move on to Step 3.

Step 2 template: πŸ“₯ strata.csv

Deliverable: strata.csv

strata.csv is a table that contains all possible strata_id for each variable. This will organize strata information β€˜long’ meaning if a variable is stratified there should be multiple rows per variable. It should contain the following columns

  • var_name
  • strata_1_name: name of the first strata. Should have no spaces and no underdashes ’_’ all text should be in Pascal case.
  • strata_1_raw raw value of the first strata. Should have no spaces and no underdashes ’_’ all text should be in Pascal case.
  • strata_1_value recoded value of the first strata (for interpretability). Should have no spaces and no underdashes ’_’ all text should be in Pascal case.
  • strata_2_raw raw value of the second strata. Should have no spaces and no underdashes ’_’ all text should be in Pascal case.
  • strata_2_name recoded value of the second strata (for interpretability). Should have no spaces and no underdashes ’_’ all text should be in Pascal case.
  • strata_2_value value of the second strata. Should have no spaces and no underdashes ’_’ all text should be in Pascal case.