Skip to contents

This dataset, RotterdamBreastCancer_df, is a data frame containing data on 2982 patients with primary breast cancer. The data was collected as part of the Rotterdam tumor bank and was used in Royston and Altman (2013) for survival analysis and prognostic model evaluation.

Usage

data(RotterdamBreastCancer_df)

Format

A data frame with 2982 observations and 15 variables:

pid

Patient ID (integer).

year

Year of diagnosis (integer).

age

Age at diagnosis in years (integer).

meno

Menopausal status: 1 = premenopausal, 2 = postmenopausal (integer).

size

Tumor size categorized into three levels (factor).

grade

Tumor grade: 1 = low, 2 = intermediate, 3 = high (integer).

nodes

Number of lymph nodes involved (integer).

pgr

Progesterone receptor status (integer).

er

Estrogen receptor status (integer).

hormon

Hormonal therapy: 1 = yes, 0 = no (integer).

chemo

Chemotherapy: 1 = yes, 0 = no (integer).

rtime

Time to recurrence in days (numeric).

recur

Recurrence status: 1 = recurrence, 0 = no recurrence (integer).

dtime

Time to death in days (numeric).

death

Death status: 1 = deceased, 0 = alive (integer).

Source

Data taken from the survival package. Based on records from the Rotterdam tumor bank and used in Royston and Altman (2013) for survival analysis.

Details

The dataset name has been kept as 'RotterdamBreastCancer_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the OncoDataSets package and assists users in identifying its specific characteristics. The suffix '_df' indicates that the dataset is a data frame. The original content has not been modified in any way.