Definition
The Cox Proportional Hazards (Cox-PH) Model is a semiparametric regression model for survival data that relates hazard to covariates (the s) without assuming a distribution for the baseline hazard.
Model:
In linear form:
where:
- : baseline hazard — the hazard when all covariates equal 0 (or reference level). This is a function of time only.
- : regression coefficients, each corresponds to covariate
- : vector of covariates (can be categorical or numeric)
Semiparametric Nature
The model splits into two components:
| Component | Nature | Description |
|---|---|---|
| Non-parametric | Distribution ignored — no parametric form assumed | |
| Parametric | Parameters estimated from data |
The model is semiparametric because it contains both a non-parametric part () and a parametric part (the covariate contribution). is treated as a nuisance parameter — inference focuses on .
Why “Proportional Hazards”?
For two subjects A and B with covariate vectors and :
The hazard ratio (HR) is constant over time — it depends only on the difference in covariates, not on . This is the proportional hazards assumption.
Covariate Types
- Numeric: enters directly (e.g., age, dose). Higher-order terms (e.g., ) can be added.
- Categorical: Dummy variables required. For categories, dummy variables with one category as reference.
Dummy Variable Construction
For a categorical variable stage with levels 1–4 (reference = stage 4):
Model:
| Stage | Model | Hazard relative to Stage 4 |
|---|---|---|
| 1 | ||
| 2 | ||
| 3 | ||
| 4 (ref) |
Interactions
If the effect of depends on , add an interaction term:
For interactions between categorical (dummy) and numeric variables, multiply each dummy by the numeric variable.
Dependency Graph
graph TD classDef def fill:#1e3a5f,stroke:#90caf9,stroke-width:2px,color:#bbdefb classDef proc fill:#1b3b28,stroke:#a5d6a7,stroke-width:2px,color:#c8e6c9 classDef sheet fill:#3d2e15,stroke:#ffcc80,stroke-width:2px,color:#ffe0b2 subgraph Prereqs["Prerequisites"] ST["Survival Time — def"]:::def CD["Censored Data — def"]:::def HF["Hazard Function — def"]:::def LCD["Likelihood (Censored) — def"]:::def KM["Kaplan-Meier — def"]:::def LR["Log-Rank Test — def"]:::def STR["Stratified Test — def"]:::def ST --> HF CD --> LCD HF --> KM LCD --> KM KM --> LR LR --> STR end subgraph CoxPH["Cox Proportional Hazards"] CPH["Cox PH Model — def"]:::def HR["Hazard Ratio — def"]:::def PHC["Checking PH Assumption — proc"]:::proc PL["Partial Likelihood — def"]:::def LWT["Likelihood with Ties — def"]:::def CS["Cox PH Cheatsheet — sheet"]:::sheet CPH --> HR CPH --> PL HR --> PHC PL --> LWT PHC --> CS LWT --> CS end HF --> CPH STR --> CPH LCD --> PL