Log-Rank Test

Tests whether hazard function of different groups are equal.

Definition

Let:

$t_{1} < t_{2} < \dots < t_{D}$ : ordered distinct event times (across all groups pooled)
$D$ : number of distinct event times
$K$ : number of groups (populations) being compared
$d_{ij}$ : number of events in group $j$ at time $t_{i}$
$Y_{ij}$ : number at risk in group $j$ just before $t_{i}$
$d_{i} = \sum_{j = 1}^{K} d_{ij}$ : total events at time $t_{i}$ (pooled)
$Y_{i} = \sum_{j = 1}^{K} Y_{ij}$ : total at risk at time $t_{i}$ (pooled)
$W (t_{i})$ : weight function applied at time $t_{i}$ (see Weight Functions)

Hypotheses

H_{0} H_{1} : h_{1} (t) = h_{2} (t) = \dots = h_{K} (t) \forall t \leq τ : not all h_{j} (t) are equal for some t \leq τ

where $τ$ is the end of study time (maximum follow-up).

Under $H_{0}$ , the expected hazard in each group is $E [h_{j} (t)] = \frac{d _{i}}{Y _{i}}$

Test Statistic

$Z_{j} (τ) = \sum_{i = 1}^{D} W (t_{i}) [d_{ij} - Y_{ij} \frac{d _{i}}{Y _{i}}], j = 1, \dots, K$

Variance and Covariance

\overset{σ}{^}_{jj} \overset{σ}{^}_{j g} = i = 1 \sum D W (t_{i})^{2} \frac{Y _{ij}}{Y _{i}} (1 - \frac{Y _{ij}}{Y _{i}}) (\frac{Y _{i} - d _{i}}{Y _{i} - 1}) d_{i} = - i = 1 \sum D W (t_{i})^{2} \frac{Y _{ij} Y _{i g}}{Y _{i}^{2}} (\frac{Y _{i} - d _{i}}{Y _{i} - 1}) d_{i}, g \neq = j

Overall Test

Let $\hat{Σ}$ be the $(K - 1) \times (K - 1)$ estimated covariance matrix with entries $\overset{σ}{^}_{jj}$ on the diagonal and $\overset{σ}{^}_{j g}$ off-diagonal. Then:

$χ^{2} = (Z_{1}, \dots, Z_{K - 1}) \hat{Σ}^{- 1} (Z_{1}, \dots, Z_{K - 1})^{T} \sim χ_{K - 1}^{2}$

For $K = 2$ , the test simplifies to: $Z = \frac{\sum _{i = 1}^{D} W ( t _{i} ) [ d _{i 1} - Y _{i 1} \frac{d _{i}}{Y _{i}} ]}{\sum _{i = 1}^{D} W ( t _{i} ) ^{2} \frac{Y _{i 1}}{Y _{i}} ( 1 - \frac{Y _{i 1}}{Y _{i}} ) ( \frac{Y _{i} - d _{i}}{Y _{i} - 1} ) d _{i}} \sim N (0, 1)$

Weight Functions

Test Name	$W (t_{i})$	Description
Log-Rank	$1$	Equal weight to all event times
Gehan / Breslow	$Y_{i}$	Generalization of Mann-Whitney-Wilcoxon / Kruskal-Wallis
Tarone-Ware	$Y_{i}$	Intermediate weighting between log-rank and Gehan

Interpretation

The log-rank test ( $W = 1$ ) is most powerful when hazard ratios are constant over time. Gehan’s test ( $W = Y_{i}$ ) gives more weight to early event times. Tarone-Ware ( $W = Y_{i}$ ) balances the two.

Example: Leukemia Remission Data

Case: Compare survival between two leukemia treatment groups — Group 1 (6-MP treatment) vs Group 2 (placebo).

# Group 1 (6-MP treatment)
time1  <- c(6, 6, 6, 7, 10, 13, 16, 22, 23, 6, 9, 10, 11, 17, 19, 20, 25, 32, 32, 34, 35)
status1 <- c(1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0)
 
# Group 2 (placebo)
time2  <- c(1, 1, 2, 2, 3, 4, 4, 5, 5, 8, 8, 8, 8, 11, 11, 12, 12, 15, 17, 22, 23)
status2 <- c(1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1)

Step 1 — KM curves:

library(survival)
 
fit1 <- survfit(Surv(time1, status1) ~ 1)
fit2 <- survfit(Surv(time2, status2) ~ 1)
 
plot(fit1, conf.int="none", col="blue",
     xlab="Time (weeks)", ylab="Survival Probability")
lines(fit2, conf.int="none", col="red")
legend(19, 1, c("Treatment", "Placebo"), col=c("blue","red"), lty=1)

Step 2 — Log-rank test:

time      <- c(time1, time2)
status    <- c(status1, status2)
treatment <- c(rep(1, length(time1)), rep(2, length(time2)))
 
fit <- survdiff(Surv(time, status) ~ treatment)
fit
# Output: N, Observed, Expected, (O-E)^2/E, (O-E)^2/V, Chisq, p-value

Interpretation: The log-rank test answers: “Do the two survival curves differ beyond random chance?” The KM plot provides visual comparison; the log-rank test provides statistical confirmation. A significant p-value means the treatment effect is statistically significant.

Interpretation

For survival analysis, the log-rank test ( $W = 1$ ) is most powerful under the proportional hazards assumption. If PH is violated, consider alternative weights (Gehan/Breslow) or methods.

Log-Rank Test

Table of Contents

Table of Contents

Log-Rank Test

Definition

Hypotheses

Test Statistic

Variance and Covariance

Overall Test

Weight Functions

Example: Leukemia Remission Data

Recent Notes

M/M/∞ Queueing System

M/M/s Queueing System

M/M/1 Queueing System

Queueing System with Balking

Birth and Death Queueing Models

Graph View

Related notes