This dataset, horse_colic_surgery_df, is a data frame containing clinical observations of horses with colic, where the primary task is to determine if the lesion requires surgery. The data consists of 300 cases with 31 clinical variables, modified from the original UCI repository version with adjusted factor levels.
Usage
data(horse_colic_surgery_df)
Format
A data frame with 300 observations and 31 variables:
- surgery
Factor with 2 levels indicating surgical requirement
- age
Factor with 1 level (age group)
- hospitalID
Integer hospital identifier
- temp_rectal
Numeric rectal temperature
- pulse
Numeric pulse rate
- respiratory_rate
Numeric respiratory rate
- temp_extreme
Factor with 4 levels (temperature extremes)
- pulse_peripheral
Factor with 4 levels (peripheral pulse)
- capillayr_refill_time
Factor with 3 levels (capillary refill time)
- pain
Numeric pain score
- peristalsis
Numeric peristalsis measure
- abdominal_distension
Numeric distension score
- nasogastric_tube
Numeric tube measure
- nasogastric_reflux
Numeric reflux quantity
- nasogastric_reflux_PH
Numeric reflux pH
- rectal_examination
Numeric exam result
- abdomen
Numeric abdomen assessment
- cell_volume
Numeric cell volume
- protein
Numeric protein level
- abdominocentesis_appearance
Numeric appearance score
- abdomcentesis_protein
Numeric protein measure
- outcome
Factor with 3 levels (outcome status)
- surgical_lesion
Factor with 2 levels (lesion type)
- lesion_type1
Factor with 60 levels (primary lesion type)
- lesion_type2
Integer secondary lesion code
- lesion_type3
Integer tertiary lesion code
- cp_data
Factor with 2 levels (CP data)
- temp_extreme_ordered
Ordered factor with 4 levels (temperature)
- temp_extreme_num
Numeric temperature measure
- mucous_membranes_col
Factor with 6 levels (membrane color)
- mucous_membranes_group
Factor with 5 levels (membrane group)
Details
The dataset name has been kept as 'horse_colic_surgery_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the DigestiveDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way beyond factor level adjustments.