pyspssio.Reader
- class pyspssio.Reader(*args, row_offset=0, row_limit=None, usecols=None, chunksize=None, convert_datetimes=True, include_user_missing=True, string_nan='', **kwargs)[source]
Bases:
HeaderClass for reading metadata and data
- __init__(*args, row_offset=0, row_limit=None, usecols=None, chunksize=None, convert_datetimes=True, include_user_missing=True, string_nan='', **kwargs)[source]
Methods
__init__(*args[, row_offset, row_limit, ...])close()Close file
commit_header()Finalize metadata
open()Open file
read_data([row_limit, convert_datetimes, ...])Read data
set_locale(locale)Set I/O module to a specific locale
Attributes
case_countNumber of cases
case_sizeRecord case size (in bytes)
case_weight_varCase weight variable
compressionCompression level
file_attributesArbitrary user-defined file attributes
file_encodingFile encoding reported by I/O module
interface_encodingI/O interface mode (Unicode or code page)
is_compatible_encodingCheck encoding compatibility
Metadata object
mrsetsMulti response set definitions
mrsets_countNumber of multi response set definitions
release_infoBasic file information
var_alignmentsVariable alignments
var_attributesVariable attributes
var_column_widthsColumn display widths
var_compat_namesShort (8-byte) variable names
var_countNumber of variables
var_formatsVariable formats as strings
var_formats_tupleVariable formats as tuples in the form (type, width, decimals)
var_handlesVariable handles references
var_labelsVariable labels
var_measure_levelsVariable measure levels
var_missing_valuesMissing values
var_namesVariable names
var_rolesVariable roles
var_setsVariable sets
var_typesVariable types
var_value_labelsVariable value labels
- property metadata: dict
Metadata object
This object contains properties/attributes from the Header class mostly pertaining to variable information and data structure.
- read_data(row_limit=None, convert_datetimes=None, include_user_missing=None)[source]
Read data
- Parameters:
row_limit (
int(default:None)) – Maximum number of rows to returnconvert_datetimes (
bool(default:None)) – Convert SPSS datetimes to Python/Pandas datetime columns; False returns seconds from October 15, 1582 (SPSS start date)include_user_missing (
bool(default:None)) – Whether to keep user missing values or replace them with NaN (numeric) and “” (strings)
- Return type:
DataFrame