SPSS-like frequency tables for categorical variables

Prints an SPSS-style frequency table (Freq, Total %, Valid %, Cum. %) for each variable supplied. Designed for use with unquoted variable names, and also accepts a plain vector.

Usage

jfreq(
  data,
  ...,
  subset = NULL,
  variable.id = NULL,
  value.id = NULL,
  case.processing.detail = NULL
)

Arguments

data: A data frame, or a vector.
...: Unquoted variable name(s) within data (ignored if data is a vector).
subset: An optional unquoted logical expression (e.g. Group == 1) to subset cases for this call only. Applied after jcomplete and jsubset. Does not affect other function calls.
variable.id: Character or NULL. Variable label display mode: one of "both", "names", "labels", "legend", or "legend.bottom". "names" shows variable names only; "both" shows "name: label"; "labels" uses each variable's label as its table caption (best for short labels); "legend" prints a label legend under each variable's own table; "legend.bottom" prints one consolidated legend after all tables. NULL (default) defers to joutput()'s variable.id setting. Not a logical. (Replaces the former inline Type/label block.)
value.id: Character or NULL. Value-label display mode for the frequency-table valid rows: "both" ("code: label"), "values" (bare code), or "labels" (the label, degrading to the bare code where a code has none). "legend" and "legend.bottom" keep the bare code in the table and print a value-label legend after it ("legend" per-table, "legend.bottom" consolidated where multiple tables are produced). A no-op for variables with no value labels. NULL (default) defers to joutput()'s value.id setting. Not a logical.
case.processing.detail: Accepted for API symmetry. jfreq's Case Processing Summary is top-table only (no missing-data breakdown), so this argument has no effect; per-variable code detail already appears in each variable's frequency table.

Value

Invisibly returns a list of class jst_freq containing: frequencies (named list of data frames, one per variable) and sample_info (pipeline and missing data counts).

Details

Output is structured consistently with jdesc(): a single red "Frequencies" title is printed first, followed by the default-data note (if a juse() default was used), any pipeline messages, and the Case Processing Summary (when at least one pipeline stage was active for this call). Each variable then gets its own block consisting of the variable name on its own line, indented Type and Variable label lines (suppressed when joutput()'s variable.id toggle is off), a blank line, and the frequency table. The frequency table ends with a Total row showing the post-pipeline N.

For haven-labelled variables, value labels and numeric codes are combined in the frequency table rows (e.g. 1: Strongly Oppose). The type line reports haven_labelled (Categorical) and suppresses the uninformative vctrs_vctr class. Variable labels are shown for all variable types, not only haven-labelled ones.

Examples