nudb_use.variables package¶
Subpackages¶
- nudb_use.variables.derive package
- nudb_use.variables.derive.all_data_helpers module
- nudb_use.variables.derive.derive_decorator module
- nudb_use.variables.derive.fullfoert module
- nudb_use.variables.derive.fullfoert_foerste module
- nudb_use.variables.derive.klass_correspondences_and_variants module
fa_erfagskole_nokut_nus()fa_studiepoeng_nus()uh_gradmerke_nus()uh_gruppering_nus()uh_studiepoeng_nus()utd_foreldet_kode_nus()utd_isced2011_attainment_nus()utd_isced2011_programmes_nus()utd_isced2013_fagfelt_nus()utd_klassetrinn_lav_hoey_nus()utd_samle_eller_enkeltutd_nus()utd_utdanningsprogram_nus()utd_varighet_antall_mnd_nus()vg_kompetanse_nus()vg_kurstrinn_nus()
- nudb_use.variables.derive.klass_labels module
fa_erfagskole_nokut_nus_label()fa_studiepoeng_nus_label()fuh_nett_eller_stedbasert_label()fuh_opptaksgrunnlag_label()fuh_utvekslingsland_label()gro_elevstatus_label()nus2000_label()pers_invkat_label()pers_kjoenn_label()uh_campus_kommune_label()uh_foerste_nus2000_label()uh_gradmerke_nus_label()uh_gruppering_nus_label()uh_institusjon_id_label()uh_statsborgerskap_inn_label()uh_studgrunnlagsland_label()uh_studiepoeng_nus_label()uh_univ_eller_hoegskole_label()utd_aktivitetsnivaa_heltid_deltid_label()utd_datakilde_label()utd_foreldet_kode_nus_label()utd_hoeyeste_nus2000_label()utd_isced2011_attainment_nus_label()utd_isced2011_programmes_nus_label()utd_isced2013_fagfelt_nus_label()utd_klassetrinn_lav_hoey_nus_label()utd_samle_eller_enkeltutd_nus_label()utd_skolekom_label()utd_studieland_label()utd_utdanningsprogram_nus_label()utd_utdanningstype_label()utd_utveksling_label()utd_varighet_antall_mnd_nus_label()utd_viderutd_nettbasert_label()vg_kompetanse_nus_label()vg_kontraktstype_label()vg_kurstrinn_nus_label()vg_rettstype_inntak_label()
- nudb_use.variables.derive.land module
- nudb_use.variables.derive.nus_variants module
- nudb_use.variables.derive.person module
- nudb_use.variables.derive.person_idents module
- nudb_use.variables.derive.registrert module
- nudb_use.variables.derive.registrert_foerste module
- nudb_use.variables.derive.uh_univ_eller_hoegskole module
- nudb_use.variables.derive.utd_foreldres_utdnivaa module
- nudb_use.variables.derive.utd_hoeyeste module
- nudb_use.variables.derive.utd_skoleaar module
- nudb_use.variables.specific_vars package
nudb_use.variables.checks module¶
Validation utilities for ensuring variable schemas match expectations.
- check_cols_against_klass_codelists(df, col_codelist=None)¶
Validate DataFrame values against KLASS codelists.
- Return type:
None- Parameters:
df (DataFrame)
col_codelist (dict[str, list[str] | dict[str, str]] | None)
- check_column_presence(df, dataset_name=None, check_for=None, raise_errors=True)¶
Validate columns against config or a supplied list.
- Return type:
list[Exception]- Parameters:
df (DataFrame)
dataset_name (str | None)
check_for (None | list[str])
raise_errors (bool)
- identify_cols_not_in_keep_drop_in_paths(paths, cols_keep, cols_drop, raise_error_found=False)¶
Identify columns present in data files that are missing from keep/drop lists.
- Return type:
set[str]- Parameters:
paths (list[Path])
cols_keep (list[str])
cols_drop (list[str])
raise_error_found (bool)
- pyarrow_columns_from_metadata(path)¶
Read column names from a Parquet file via metadata only.
- Return type:
list[str]- Parameters:
path (str | Path)
nudb_use.variables.cleanup module¶
Utilities for reorganizing and trimming NUDB datasets.
- move_col_after_col(df, col_anchor, col_move_after)¶
Move a specified column in a DataFrame to immediately follow another specified column.
- Parameters:
df (
DataFrame) – Input pandas DataFrame.col_anchor (
str) – Name of the column after which the specified column will be moved.col_move_after (
str) – Name of the column to move.
- Returns:
New DataFrame with the specified column moved to follow the anchor column.
- Return type:
pd.DataFrame
- move_content_from_col_to(df, from_col, to_col)¶
Fill empty values (NA) in one column with values from another column.
- Parameters:
df (
DataFrame) – DataFramefrom_col (
str) – Column where information is taken.to_col (
str) – Column where information is moved to.
- Returns:
DataFrame with values filled out.
- Return type:
pd.DataFrame