Synthetic dataset of patients with cancer to demonstrate package functions

us_second_cancer

Format

A data frame with the following variables:

fake_id

ID of patient

SEQ_NUM

Original tumor sequence

registry

SEER registry

sex

Biological sex of patient

race

Race

datebirth

Date of birth

t_datediag

Date of diagnosis of tumor

t_site_icd

Primary site of tumor in ICD-O coding

t_hist

Histology, i.e. ICD-O-3-Code on tumor morphology (4 digits)

t_dco

Tumor diagnosis is based on Death Certificate only

fc_age

Age at first primary cancer in years

datedeath

Date of death

p_alive

Patient alive at end of follow-up 2019

p_dodmin

Minimum Date of Death if datedeath is missing

fc_agegroup

Age group of first cancer diagnosis

t_yeardiag

Time period of diagnosis of tumor