Definition

The Cox Proportional Hazards (Cox-PH) Model is a semiparametric regression model for survival data that relates hazard to covariates (the s) without assuming a distribution for the baseline hazard.

Model:

In linear form:

where:

  • : baseline hazard — the hazard when all covariates equal 0 (or reference level). This is a function of time only.
  • : regression coefficients, each corresponds to covariate
  • : vector of covariates (can be categorical or numeric)

Semiparametric Nature

The model splits into two components:

ComponentNatureDescription
Non-parametricDistribution ignored — no parametric form assumed
ParametricParameters estimated from data

The model is semiparametric because it contains both a non-parametric part () and a parametric part (the covariate contribution). is treated as a nuisance parameter — inference focuses on .

Why “Proportional Hazards”?

For two subjects A and B with covariate vectors and :

The hazard ratio (HR) is constant over time — it depends only on the difference in covariates, not on . This is the proportional hazards assumption.

Covariate Types

  • Numeric: enters directly (e.g., age, dose). Higher-order terms (e.g., ) can be added.
  • Categorical: Dummy variables required. For categories, dummy variables with one category as reference.

Dummy Variable Construction

For a categorical variable stage with levels 1–4 (reference = stage 4):

Model:

StageModelHazard relative to Stage 4
1
2
3
4 (ref)

Interactions

If the effect of depends on , add an interaction term:

For interactions between categorical (dummy) and numeric variables, multiply each dummy by the numeric variable.

Dependency Graph

graph TD
    classDef def fill:#1e3a5f,stroke:#90caf9,stroke-width:2px,color:#bbdefb
    classDef proc fill:#1b3b28,stroke:#a5d6a7,stroke-width:2px,color:#c8e6c9
    classDef sheet fill:#3d2e15,stroke:#ffcc80,stroke-width:2px,color:#ffe0b2

    subgraph Prereqs["Prerequisites"]
        ST["Survival Time — def"]:::def
        CD["Censored Data — def"]:::def
        HF["Hazard Function — def"]:::def
        LCD["Likelihood (Censored) — def"]:::def
        KM["Kaplan-Meier — def"]:::def
        LR["Log-Rank Test — def"]:::def
        STR["Stratified Test — def"]:::def
        
        ST --> HF
        CD --> LCD
        HF --> KM
        LCD --> KM
        KM --> LR
        LR --> STR
    end

    subgraph CoxPH["Cox Proportional Hazards"]
        CPH["Cox PH Model — def"]:::def
        HR["Hazard Ratio — def"]:::def
        PHC["Checking PH Assumption — proc"]:::proc
        PL["Partial Likelihood — def"]:::def
        LWT["Likelihood with Ties — def"]:::def
        CS["Cox PH Cheatsheet — sheet"]:::sheet
        
        CPH --> HR
        CPH --> PL
        HR --> PHC
        PL --> LWT
        PHC --> CS
        LWT --> CS
    end

    HF --> CPH
    STR --> CPH
    LCD --> PL