fatf.utils.data.datasets
.load_health_records¶
-
fatf.utils.data.datasets.
load_health_records
() → Dict[str, numpy.ndarray][source]¶ Loads in a fake health records dataset.
The dataset contains a mixture of categorical and numerical columns generated with faker. The data array is a structured numpy array with the following columns and types:
'name'
(string),'email'
(string),'age'
(integer),'weight'
(integer),'gender'
(string),'zipcode'
(string),'diagnosis'
(string) and'dob'
(string) – date of birth. The target variable is binary (numerical) and encodes a medical treatment has succeeded:1
is'success'
and0
is'fail'
.- Returns
- dataDict[string, numpy.ndarray]
A dictionary with the dataset and its metadata. See
fatf.utils.data.datasets.load_data
for the data format.