#separator:tab
#html:true
#notetype column:1
#deck column:2
#tags column:5
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Censored Random Variable	\[X := \min{(T, C)}\] \[V := \begin{cases}    1 & T \leq C\\\\   0 & T &gt; C \end{cases} \] Where \(T\) is the time to event and \(C\) is the time to censoring. \(V\) is called the visibility.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Survivor Function<br>(Also its two properties)	\[F(t) = {\mathbb{P}}(T &gt; t) = \int_t^\infty f(t)\,dt\] <br>1. Decreasing in \(t\) <br> 2. When continuous, \(F(0) = 1\)<br> <br>NOTE: Our professor denotes the survivor function as \(F(t)\), but the usual convention is to write \(S(t)\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Kaplan-Meier Estimate, Form 2	\[ \hat F(t) = \prod_{j:a_j \leq t} \left(1 - \frac{d_j}{r_j}\right)\] <br>with \(d_j  = |\{ i: x_i = a_i \land v_i = 1 \}|\) (non-censured death times at event time \(a_i\))<br> and \(r_j = |\{ i : x_i \geq a_i \}|\) (all events at and after \(t = a_i\); number at risk)<br> <br> Note this is based on the erroneous assumptions that \(a_i...a_m\) are fixed constants and that \(T\) is discrete only with events at \(a_j\). Though it ends up being computationally equivalent to method 1!<br> \(\frac{d_j}{r_j}\) represents the probability of any event occurring at \(a_i\) (of the number of people at risk, how many were known to have perished in the time interval?)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> The hypothesis tested in the Log Rank Test	Null hypothesis: \(F_j^{(0)} = F_j^{(1)}\) for all \(j \in [m]\)<br> Alternate hypothesis: \(F_j^{(0)} \not= F_j^{(1)}\) for any \(j \in [m]\)<br> <br> where \(F_j^{(k)}\) represents the survivor function over \((a_{j}, a_{j+1}]\) for the \(k\)th group in \(k \in \{0, 1\}\). (assume the \(a_j\) are all the event times of the two groups put together)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> The Log-Rank Test	\[\frac{Z}{S} \underset{approx}{\sim} {\mathrm{N}}(0, 1)\] <br>with <br>\[Z = \sum_j Z_j = \sum_j d_j^{(0)} - d_j\frac{r_j^{(0)}}{r_j}\] Z is the number of observed (uncensored) events minus the number of expected events were \(H_0\) true.<br>  <br> NONEXAMINABLE: \[S^2 = \sum_j \frac{d_j (r_j - d_j)r_j^{(0)}r_j^{(1)}}{r_j^{2}(r_j-1)}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	What's a good rule of thumb for when the Log-Rank test will be powerful?	The curves are well-seperated and don't cross	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Relative Risk	\[\begin{align*} RR &= \frac{\text{sum of observed / sum of expected (in group 0)}}{\text{sum of observed / sum of expected (in group 1)}}\\\\ &= \frac{\sum_j d_j^{(0)} / \sum_j d_j \frac{r_j^{(0)}}{r_j} }{\sum_j d_j^{(1)} / \sum_j d_j \frac{r_j^{(1)}}{r_j}} \end{align*}\] <br>Events are \(RR\times\) more likely in group 0 than group 1.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> The Hypergeometric Distribution	The probability of \(k\) successes given \(n\) draws, where we draw without replacement from \(N\) objects with \(K\) that we want. <br>\[\mathrm{Hypergeometric}(N, K, n)\] <br>(probability distribution over \(k\))	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Expectation of the hypergeometric distribution	\[{\mathbb{E}}(\mathrm{Hypergeometric}(N, K, n)) = n \frac{K}{N}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Accelerated-Life Families	A family of distributions generated by \(F(\lambda t)\) for some scaling \(\lambda &gt; 0\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Proportional-Hazards Families	A family of distributions generated by \(F(t)^k\) for some shape parameter \(k &gt;0\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Survivor Function of the Weibull Distribution	\[F(t) = e^{-(\theta t)^k}\] where \(\theta &gt;0\) is the shape parameter and \(\theta &gt; 0\) is the scale parameter. <br>Note that this reduces to the exponential RV when \(k = 1\). <br>Weibull distributions are both an accelerated life and proportional hazards family!	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	State the contribution to the likelihood for an observed event versus a censored event.	When \(v_i = 1\) (uncensored), the contribution is the density \(f(x_i, \theta)\). <br> When \(v_i = 0\) (censored), the contribution is \({\mathbb{P}}(x_i &lt; T_i) = F(x_i, \theta)\) (where \(F\) is the survivor function). <br>So the total likelihood is \[L(\theta) = \prod_{v_i = 1}f(x_i, \theta)\prod_{v_i = 0}F(x_i, \theta)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	PDF of the exponential distribution	\[f(t; \theta) = \theta e^{-\theta t}\] given \(\theta &gt; 0\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Survivor Function of the exponential distribution	\[F(t; \theta) = e^{-\theta t}\] given \(\theta &gt;0\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Requirements for a nonparametric survival function \(\tilde F(t)\)	1. Bounded between zero and one: \(0 \leq \tilde F(t) \leq 1\)<br> 2. Decreasing: where \(a \leq b\), \(\tilde F(a) \geq \tilde F(b)\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Contributions to a non-parametric survivor function likelihood	Where uncensored, we have: \[{\mathbb{P}}(T = x_i) = {\mathbb{P}}(T \geq x_i) - {\mathbb{P}}(T &gt; x_i) = F(x_i^-) - F(x_i) = F_i^- - F_i\] where the minus subscript represents the limit from the left.<br> <br>Where censored, we have: \[{\mathbb{P}}(T &gt; x_i) = F(x_i) = F_i\] <br>So for each event the likelihood contribution is \[L(\theta) = \prod_{v_i = 1}(F_i^- - F_i) \prod_{v_i = 0} F_i\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How can we simplify a non-parametric survivor function likelihood?	We can reduce the number of parameters by our assumptions: \[F_1^- = 1\] \[F_n = 0\] Also the function should only change between \(F_i^-\) and \(F_i\) terms! (we're trying to build a step function; piecewise linear)<br> We should end up with a polynomial we can maximimize easily.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Non-parametric likelihood contributions of left-censored observations and interval-censored observations	Left-censored: \(1 - F(x_i) = 1 - F_i\)<br> Interval-censored on an interval \((a, b)\): \(F(a) - F(b)\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Hazard Function (raw form)	\[h(t) = \lim_{\Delta \to 0} \frac{{\mathbb{P}}(t &lt; T &lt; t + \Delta {\,|\,} T &gt; t)}{\Delta}\] You can simplify this to the form generally used using Bayes' Theorem and recognizing the definition of derivative: \[h(t) = \frac{f(t)}{F(t)}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Hazard Function (practical form)	\[h(t) = \frac{f(t)}{F(t)}\] where \(F(t)\) is the survivor function.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Integrated Hazard	\[H(t) = \int_0^t h(t') dt'\] Models the accumulation of hazard over time.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Survivor function as a function of integrated hazard	\[F(t) = e^{-H(t)}\] or equivalently \[H(t) = -\log(F(t))\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Counting Process	A function \(N : {\mathbb{R}} \to {\mathbb{N}}\) that counts how many times an event has occurred. It always fulfills:<br> \[N(0) = 0\] Always increasing<br> Always right-continuous (increments at an event time, not right after it)<br>	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Survival Analysis as a Counting Process	In survival analysis, we only care about the first time an event happens, so \(N(t) \in \{0, 1\}\).<br> <br>Putting it in our usual notation, we have: \[N(t) = I\{ X \leq t, V = 1 \}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> History (of a stochastic process)	\({\mathcal{H}}_t\) denotes knowledge of everything that has happened in a stochastic process up to and including \(t\)<br> <br>\({\mathcal{H}}_{t-}\) denotes knowledge up to but not including \(t\).<br> <br>For our purposes, \({\mathcal{H}}_{t-}\) is completely captured by membership to the risk set at \(t\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	What do we mean by \(dN(t)\)?	It's called Stieltjes Notation: \[dN(t) = N(t + dt) - N(t)\] It can only take the values \(0\) or \(1\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Intensity \(\lambda(t)\)	\[\lambda_i(t) = Y_i(t)h_i(t)\] <br>where \(Y_i(t) = I\{X_i \geq t\}\) (1 when person \(i\) is still in the risk set). <br>Let \(\Lambda\) be the integrated intensity, so equivalently we have <br>\[d\Lambda_i(t) = \sum_i Y_i(t)dH_i(t)\] <br>Define \(\Lambda_+ = \sum_i \Lambda_i\) and \(Y_+ = \sum_i Y_i\), and assume \(H_i(t) = H(t)\) for all \(i\) (could also assume proportionality instead) <br>\[d\Lambda_+(t) = Y_+(t) dH(t)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Show that \(dN(t) - d\Lambda(t)\) is a martingale	Work from the definition of the derivative of the integrated intensity: \[\begin{align*} {\mathbb{P}}(dN(t) = 1 {\,|\,} {\mathcal{H}}_{t-}) &= d\Lambda(t)\\\\ {\mathbb{E}}[dN(t) {\,|\,} {\mathcal{H}}_{t-}] &= d\Lambda(t)\\\\ {\mathbb{E}}[dN(t) - d\Lambda(t) {\,|\,} {\mathcal{H}}_{t-}] &= 0 \end{align*}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Given the per-individual martingale identity \[{\mathbb{E}}[dN(t) - d\Lambda(t) {\,|\,} {\mathcal{H}}_{t-}]= 0,\] show that the population derivative of integrated intensity equals the derivative of the population counting process at all \(t\).	Define the population counting process and integrated hazard: \[N_+(t) = \sum_i N_i(t)\] \[\Lambda_+(t) = \sum_i \Lambda_i(t)\] By our assumption, we have: \[\begin{align*} {\mathbb{E}}[dN(t)] - {\mathbb{E}}[ d\Lambda(t) {\,|\,} {\mathcal{H}}_{t-}]&= 0\\\\ {\mathbb{E}}[dN(t)] &= {\mathbb{E}}[ d\Lambda(t) {\,|\,} {\mathcal{H}}_{t-}] \end{align*}\] So by the method of moments, we have: \[dN_+(t) = d\Lambda_+(t)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Use \(dN_+(t) = d\Lambda_+(t)\) and \(d\Lambda_+(t) = Y_+(t) dH(t)\) to build the Nelson-Aalen Estimator (find an estimator \(d\hat H(t)\))	Substituting in and isolating \(d\hat H(t)\), we get: \[d\hat H(t) = \frac{dN_+(s)}{Y_+(s)}\] Integrate both sides: \[\int_0^t d\hat H(s)ds = \hat H(t) = \int_0^t \frac{dN_+(s)}{Y_+(s)} ds\] But \(N_+(t)\) is just a step function that jumps at each failure time, so this integral is just a sum of 'snapshots' at each \(t_j \leq t\): \[\hat H(t) = \sum_{t_j \leq t} \frac{1}{Y_+(t_j)}\] Sum of snapshots of hazard increments.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Nelson-Aalen Estimator	Given a set of events \(\{a_1, ...,a_j,..., a_n\}\) with no ties, we can estimate the cumulative hazard \(H(t)\) as \[\hat H(t) = \sum_{a_j \leq t} \frac{1}{Y_+(a_j)}\] where \(Y_+(t)\) is the number of people at risk at \(t\) (this is exactly the \(r_j\) used in Kaplan-Meier).<br> <br>Note that we can estimate the survivor function using Nelson-Aalen: \[\hat F(t) = \exp(-\hat H(t))\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Semiparametric Modeling	Model the bits we care about explaining (\(\beta\)) with a parametric method, and model the bits we don't care so much about (\(\psi\)) with a nonparametric method.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Proportional Hazards Modeling	Semiparametric method for modeling hazard. Assume that everyone has hazards in the same proportional hazards family: \[h_2(t) = Kh_1(t),\, K&gt; 0\] Express \(K\) with the explainable parameters \(\beta\), and express \(h_0(t)\) with the nonparametric \(\psi\): \[h(t, z^i, \beta, \psi) = \phi(z^i, \beta)h_0(t, \psi)\] <br>Note that this is a strong assumption!	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Partial Likelihood	An expression of likelihood that neglects some of the data in order to solve for only some of the model parameters.<br> <br>In Cox regression, it was shown that this is mathematically valid!	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> (Partial) Likelihood in Proportional Hazards Modeling	\[L(\beta) = \prod_{i\,:\, v_i = 1} \frac{\phi(z^i, \beta)}{\sum_{i'\in R_i} \phi(z^{i'}, \beta)}\] where \(R_i\) is the risk set at \(t_i\). Note that \(h_0(x_i, \psi)\) gets cancelled out in the fraction.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Cox Regression	Proportional hazards modeling with the following form for \(\phi(z, \beta)\): \[\phi(z, \beta) = e^{\beta^T z}\]<br> <br>This yields \[L(\beta) = \prod_{i\,:\, v_i = 1} \frac{e^{\beta^T z^i}}{\sum_{i'\in R_i} e^{\beta^T z^{i'}}}\] It's easy to take the log-likelihood then: \[\log L(\beta) = \sum_i v_i \left( \beta^Tz^i - \log{\sum_{i' \in R_i} e^{\beta^T z^{i'}}} \right)\] And a derivative with respect to \(\beta\): \[ \frac{d}{d\beta} \log L(\beta) = \sum_i v_i \left( z^i - \sum_{i' \in R_i}\frac{z^{i'}e^{\beta^T z^{i'}}}{ e^{\beta^T z^{i'}}} \right) \] Use Newton-Raphson from here.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	The two kinds of ties in a Nelson-Aalen model, and how we can account of them	1. Tie by lack of precision<br> Suppose in the dataset ordered 1,2,3,4, 2 and 3 tied. Then we can account for this by just adding the two possibilities: \[...\left( \frac{\phi_2 }{\phi_2 + \phi_3 +\phi_4}\frac{\phi_3}{\phi_3 + \phi_4} + \frac{\phi_3 }{\phi_2 + \phi_3 +\phi_4}\frac{\phi_2}{\phi_2 + \phi_4} \right)...\] We can approximate this with \[...\left( \frac{\phi_2\phi_3 }{(\phi_2 + \phi_3 +\phi_4)(\frac{1}{2}\phi_2 + \frac{1}{2}\phi_3 + \phi_4)} \right)...\] 2. Genuine Tie<br> Same example, use the following: \[...\left( \frac{\phi_2\phi_3 }{\phi_2\phi_3 + \phi_2\phi_4 + \phi_3\phi_4} \right)...\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Estimating Baseline Hazard	\[ \hat H_0(t) = \sum_{i : x_i \leq t,\, v_i = 1} \frac{1}{\sum_{i' \in R_i}\hat \phi(z_{'i}, \beta)} \]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Many Baseline Hazards Modeling	Suppose we had a treatment trail where individuals in the US and the UK are separately given the same medication. We expect the treatment effect to be the same, but the baseline risk to be different. <br> <br>To model this: we say there are \(L\) strata, with the function \(q(i)\) mapping the \(i\)th individual to their strata group. Then just use the custom risk set: \[R_i := \{ i' :\, x_{i'} \geq x_i \land q(i') = q(i) \}\] For an event occurring to individual \(i\), we only consider other individuals in the same strata (with known events occurring after that individual's events). <br> <br>Estimate \(\beta\) by taking the product of each stratum's partial likelihood. Estimate the baseline hazard separately.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Matched Pair Analysis Setup	Smallest possible stratification. We have two treatments \(k \in \{ 0,1\}\) and \(L\) pairs where one is exposed to treatment, and the other is not. Using our partial hazard split per individual: <br>\[h_{k, l}(t) = \phi_k h_l(t)\] <br>Clearly we can't estimate each individual \(h_l(t)\), but we can estimate the \(\phi_k\) shared among each pair. <br> <br>Assume we have the baseline effect \(\phi_0 = 1\), and the treatment effect \(\phi_1 = e^\beta\) (or combined \(\phi_k=e^{k \beta}\). \(\beta\) is called the log hazard ratio between treatments.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Matched Pair Analysis Likelihood Contributions	Ignore pairs with ties: nothing useful we can learn from a tie. There's only two pieces of information we need from each pair \(l\): \[k(l) = I\{X_{1,l} &lt; X_{0,l} \}\] Just the index of whichever treatment fails first. \[v(l) = V_{k(l), l}\] 0 if it was actually a censoring, not a failure; 1 if not.<br> This yields the full likelihood contribution: \[ \left(\frac{e^{k(l)\beta}}{1 + e^\beta} \right)^{v(l)} \] This ends up just being equivalent to a binomial likelihood!<br> <br>Note that there's no likelihood contribution when:<br> - there's a tie<br> - one individual is censored before the other has an event<br> - both individuals are censored	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	When we graph log(survival) and log(log(survival)) over time, what are we graphing?	log(survival): Integrated hazard, slope is hazard<br> log(log(survival)): We'll see a constant vertical difference between two curves if their hazards are proportional.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Probability Integral Transform for Hazard	Take the random variable \(T\) with integrated hazard \(H(t)\). Then the random variable \(U := H(T)\) is a \(Exp(1)\) distribution. <br>Proof: \[\begin{align*} {\mathbb{P}}(u &gt; U) &= {\mathbb{P}}(u &gt; H(T))\\\\ &= {\mathbb{P}}(H^{-1}(u) &gt; T) \\\\ &= F(H^{-1}(T))\\\\  &= e^{-H(H^{-1}(u))} \\\\ &= e^{-u}  \end{align*}\] <br>We can use this to quantify how accurate our models are!	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Cox-Snell Residual	Fit the model to construct \(y_i = \hat H_i(x_i)\). By the PIT for Hazard, we expect \(y_i \sim Exp(1)\). <br> <br>Problem: we still may have censored individuals. So we can use a method like Kaplan-Meier to instead build a nonparametric representation of \(\hat F_{y_i}\).<br>  <br>Graphically, \(\hat F_y(y)\) should look like a straight line against \(y\), with a slope of -1.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Trick for accounting for smaller-than-expected \(y_i\) mean in computing the Cox-Snell Residual	Add one to the \(y_i\)s for each censored individual: \[y_{i}' := y_i + (1-v_i)\] <br>This comes from the memoryless property of the exponential: \[{\mathbb{E}}[Y | Y + c] = c + 1\] This strategy is called mean imputation.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Martingale Residual	\[y_{i}'' = 1-y_{i}' = v_i - \hat H_i(x_i)\] We know this must have mean zero: \({\mathbb{E}}[y_{i}''] = 0\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How can we use Martingale plots as a means of designing an explanatory model?	1. Fit a model without using explanatory variables (\(z\)).<br> 2. Plot the Martingale residuals \((y_i'')\) against \(z\): where the graph is above 0, we see more events than expected. For instance if you see a log plot, that implies you should probably use Cox regression.<br> 3. Fit a fixed dataset and see if the means have been corrected towards zero.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Frailty	Each individual has some frailty \(u\) that describes how susceptible they are to risk. We denote the population risk \(\bar F(t)\): <br>\[\bar F = {\mathbb{E}}_u[F(t{\,|\,} U = u)]\] <br>Note however that this doesn't imply that the population hazard \(\bar h(t)\) is \({\mathbb{E}}_u[h(t{\,|\,} u)]\): \[\bar h(t) = \frac{\bar f(t {\,|\,} u)}{\bar F(t {\,|\,} u)}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Population Hazard under Proportional Frailty	\[ h(t{\,|\,} U=u) = uh_0(t)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Gamma Distribution PDF	\[Gamma(x; p, \lambda) = \frac{\lambda^p x^{p-1}e^{-\lambda x}}{\Gamma(p)}\] \(p\) is called the shape or index parameter, \(\lambda\) is the rate parameter.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Mean of the Gamma Distribution	\[\frac{p}{\lambda}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Variance of the Gamma Distribution	\[\frac{p}{\lambda^2}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Laplace Transform of the Gamma Distribution	With \(Gamma(x; p, \lambda)\): \[\left(\frac{p}{1 + p} \right)^{\lambda}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Population Hazard under Gamma-distributed Frailty	\[\bar h(t) = \frac{p h_0(t)}{p + H_0(t)}\] <br>A lower \(p = \lambda = \psi\) means a higher variance among individuals: they either die off quickly or stick around.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Cox Regression with Frailty	\[h_{z}(t {\,|\,} U = u) = u e^{\beta z}h_0(t)\] <br>You can use this to substitute into population hazard: <br>\[\bar h(t) = \frac{p e^{\beta t} h_0(t)}{p + e^{\beta z} H_0(t)}\] <br>Note then the population hazard ratio will start at \(e^\beta\) at \(t = 0\), but then converge to 1 as \(t \to \infty\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	What is the central problem with frailty in practice? How can we resolve this?	There's not enough information in \((X_i, V_i)\) pairs alone to assess \(g\) and \(H_0\) separately. Three resolutions:<br> <br>1. Be aware that we can't tell apart a homogeneous population with decreasing hazard from a population varying in frailty but with constant hazard over time.<br> <br>2. Measure as many explanatory variables as possible<br> <br>3. Use a model where frailty goes into the error term (accelerated frailty): \[F(t{\,|\,} u) = F_0(ue^{\beta z}t)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	The two approaches to competing risks modeling	1. Two different times to events, \(T_A\) and \(T_b\), we only observe one.<br> <br>2. One time-to-event variable \(T\), and at event time, choose between event type \(A\) or \(B\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	What do \(F(t)\), \(\tilde f_A(t)\), and \(G_A(t)\) represent in competing hazards modeling, and how do we compute them?	\(F(t)\) is just the probability of any event before \(t\), so  \[F(t) = \exp(-H_A(t) - H_B(t))\] <br>\(\tilde f_A(t)\) is the event density at \(t\) (not a proper distribution): \[\tilde f_A(t) = h_A(t)F(t)\] <br>\(G_A(t)\) is the probability of a the specific event \(A\) occurring before or at \(t\), so  \[G_A(t) = \int_0^t \tilde f_A(t') dt'\]<br> <br>Note we can also compute \(\tilde F_A(t) = \exp(-H_A(t))\), the survivor function for \(A\) if the event \(B\) entirely disappeared.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Aalen-Johansen Estimator	\[\hat G_A(t) = \sum_{j: a_j \leq t} \left[ \prod_{j'=1}^{j' = j-1} (1 - \Delta \hat H_{j'}) \right] \Delta \hat H_j^A\] <br>where  \[\Delta \hat H_j = \frac{1}{r_j}\] \[\Delta \hat H^A_j = \frac{1}{r_j}{\mathbb{1}}_{E_j = A}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Net Survival Setup	Want to consider excess deaths (E) versus the chance of a background event that has nothing to do with the disease we're studying (B). <br>\[h^i(t) = h^i_B(t) + h_E^i(t)\] <br>We want an unbiased estimator for  \[d\bar H_E(t) = \frac{\sum_i F_E^i(t) dH_E^i(t)}{\sum_i F_E^i(t)}\] Incremental excess integrated population hazard<br> <br>(Note that we assume \(h^i_B(t)\) is known from government data)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Pohar-Perme Estimator	\[d\tilde H_E(t) = \frac{\sum_i dN_i(t) / F_B^i(t)}{\sum_i Y_i(t) / F_B^i(t)} - \frac{\sum_i Y_i(t) dH^i_B(t) / F_B^i(t)}{\sum_i Y_i(t) / F_B^i(t)}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Causation versus association	Causation is the language of intervention. Association is the language of probability.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Confounding	When some event is a common cause of two variables. The two variables may appear correlated, but don't cause each other.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Collider Bias / Berkson's Paradox	Celebrity example: You may notice that ugly celebrities tend to be talented, and that untalented celebrities tend to be attractive. But this is only because they wouldn't be a celebrity if they weren't either.<br> <br>When two different events cause the same outcome, and that outcome makes observation easier, then those two events may seem erroneously negatively correlated.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Reverse Causation	If symptoms are consistently treated, treatment may appear to 'cause' the illness.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How can we measure causality?	1. Perform controlled experiments<br> 2. Perform randomized experiments<br> 3. Account for confounding in analyses<br> Use instrumental variables<br> 4. Assume that some part of the data-generating process behaves like randomization (eg mendelian randomization)<br> This is called a natural experiment	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Negative Controls	To check your work, check causality of obviously uncorrelated phenomena	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Difference-in-Differences	Sudden divergence of trendlines in two populations: there must be some reason for the divergence!	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Regression Continuity	Check that measured data that should be smooth is actually smooth. Jump points are suspicious.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Immortal Time Bias	Must consider the age requirement of some property: nobel prize winners are older than non-nobel prize winners.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Intention-to-treat	Who will be included in the study must be set-in-stone before time zero (can't disclude people on the fly)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Case-control studies	Case studies paired with a control group. Not optimal, but sometimes necessary for rare diseases.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Cohort Studies	Look at a (snapshot) population and observe what effects it over time.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Rubin's Dictum	For objective causal inference, design trumps analysis.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Transition Rate, and when it is a Markov process and time homogeneous	\[q_{rs}(t; {\mathcal{F}}_t)=\lim_{\delta t \to 0} \frac{{\mathbb{P}}(X(t + \delta t) = s {\,|\,} X(t) = r,\, {\mathcal{F}}_t)}{\delta t}\] \(X(t)\) represents the state the individual is in at time \(t\), \({\mathcal{F}}_t\) is all information about the process prior to \(t\).<br> <br>It's a Markov process when \(q_{rs}\) has no dependence on \({\mathcal{F}}_t\), and it's time-homogeneous when \(q_{rs}\) has no dependence on time (time probabilities over an interval only depend on the duration of the interval).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Transition Intensity Matrix \(Q\)	\[Q_{r,s} := q_{r,s} \text{ for } r \neq s\] \[Q_{r,r} = -\sum_{s \neq r} q_{rs}\] So rows of \(Q\) sum to zero.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Sojourn Time	Distribution of time spent in state \(r\) before moving<br> Has an exponential distribution with rate \[\lambda = \sum_{s,\, s \neq r} q_{rs} = -q_{rr}\]<br> <br>So the average time in a state before switching is given by  <br>\[\frac{1}{\lambda} = \frac{1}{\sum_{s,\, s \neq r} q_{rs}} = \frac{1}{-q_{rr}}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Probability that next state is \(s\) given current state is \(r\) in a continuous Markov process	\[\frac{q_{rs}}{\sum_{j,\, j\neq r} q_{rj}} = \frac{-q_{rs}}{q_{rr}}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Transition Probability Matrix \(P(t)\)	\[P_{rs}(t) = {\mathbb{P}}(\text{State $s$ at time $t$} {\,|\,} \text{State $r$ at time $0$})\] Solution to the forward Kolmogorov equations: \[\frac{dP(t)}{dt} = P(t)Q(t)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How to solve for the transition probability matrix given \(P(0) = I\), \(Q\) is time-homogeneous	\[P(t) = \exp (tQ) = \sum_{n=0}^\infty \frac{t_n}{n!}Q^n\] Use an eigendecomposition for \(Q = UDU\inv\), \[= \exp(tQ) = U\exp (Dt) U\inv\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Expected time matrix \(T(t)\) (in a continuous time Markov process)	Expected total length of time spent in state \(s\), starting from state \(r\): \[T_{rs}(t) = {\mathbb{E}}\left[ \int_0^t I_{X(u) = s} du \right] = \int_0^t P_{rs}(u)du\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Expected number of visits to a state (in a continuous-time Markov process)	\[\sum_{i\neq s}\int_0^t P_{ri}(u) q_{is}du = \sum_{i\neq s} T_{ri}(t)q_{is}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How to find the probability of ever visiting a state (in a continuous-time Markov process)	Create a new problem by zeroing out the \(s\)th row of Q (a trap state), then just compute the new \(P^*(t) = \exp(tQ^*)\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Likelihood contribution of discrete data in a continuous-time Markov process	Let \(a\) be \(x_{i0}\) (state before), and let \(b\) be \(x_{i1}\) (state after), then \[{\mathbb{P}}(X_{i1} = b {\,|\,} a) = P_{ab}(t_{i1} - t_{i0} {\,|\,} \theta)\] Take the product of these all to get the full likelihood.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Likelihood contribution of a known time interruption (like death) in a continuous time Markov process	Problem: we don't know what state the individual was in prior to death. So we have to allow for every possibility: \[\sum_{s \neq d} p_{rs}(t_{i,n_i} - t_{i, n_{i-1}})q_{sd}\] where \(d\) is the death state, and \(r\) is the last known pre-death state \(x_{i,n_{i-1}}\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Likelihood contribution of a censored individual in a continuous time Markov process	Problem: We don't know what disease state the individual is in at the end of the time period, only that they never died (or else we would have known) \[\sum_{s \neq d} P_{r,s}(t_{i, n_i} - t_{i,n_{i-1}}) = 1 - P_{r,d}(t_{i, n_i} - t_{i,n_{i-1}})\] where \(d\) is the death state, and \(r\) is the last known pre-death state \(x_{i,n_i-1}\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Net Survival Integral	\[F_{net} = \exp \int_0^t  d\bar H_E(s) ds\] What fraction of patients would survive to time \(t\) if it were the only possible cause of death?	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	How can we make up for left-truncated data?	Divide by \(F(S_i)\) in the likelihood contributions (if the individual had had the event before \(S_i\), they would not have been included in the dataset)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	Accounting for piecewise hazard rates	Divide time into intervals; use left-truncation on all but the first region and right-censor any observation after the cutoff time. <br>\[\hat \theta_{interval} = \frac{\text{number of events in interval}}{\text{sum of all time at risk in interval}}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Best way to account for left-truncated data in practice	Kaplan-meier accounts for this already; just make sure that left-truncated individuals aren't considered as part of the risk set until their introduction.<br> <br>This gives the same estimator as the likelihood and piecewise approaches.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Period Survival	We want to characterize survival rates in some restricted period, say 2024. Denote the interval of interest \((a, b]\).<br> <br>Convert calendar time (time \(u\), \(y_i\) diagnosis date, \(z_i\) event observation date, \(w_i\) event or censored) into individual's time (\(s_i\) truncation time, \(x_i\) event/censoring time, \(v_i\) event or censored)<br> <br>Only include individuals that spend some amount of time in the interval pre-event. Left-truncate time before interval cutoff; censor events that occur after the time cutoff.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Assumptions made in the SIR model	\(X_i(t)\) maps the \(i\)th individual to the state they are in at time \(t\).<br> \(S(t), I(t), R(t)\) are the number of people susceptible, infected, and recovered, respectively. ie, \[S(t) = \sum_{i=1}^N I_{X_i(t) = S}\] \(N\) is the total population, assume it is fixed, so that for all \(t\), \[N = S(t) + I(t) + R(t)\] The rate of moving from \(S\) to \(I\) is denoted \(\lambda(t)\), is usually equal to \(\beta I(t)\).<br> The rate of moving from \(I\) to \(R\) is denoted \(\gamma(t)\), usually just a constant \(\gamma\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Gillespie Algorithm	Continuous time discrete individual algorithm:<br> Two events:<br> someone is infected (\((s, i) \mapsto (s-1, i+1)\))<br> or someone recovers (\((s, i) \mapsto (s, i-1)\)).<br> Draw next time: \(\sim \exp (\beta is + \gamma i)\),<br> At that next time, pick one of the two events (like a binomial): infected w.p. \(\frac{\beta is}{\beta is + \gamma i}\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Chain-Binomial Model	Discrete time discrete individual algorithm, sort of like a probabilistic ODE:<br> Take time steps of size \(\delta: [t, t + \delta)\)<br> \(S_{t + \delta} = S_t - B_t\)<br> \(I_{t + \delta} = I_t + B_t - C_t\)<br> \(R_{t + \delta} = R_t + C_t\)<br> where \(B_t, C_t\) are binomial RVs: \[B_t \sim \mathrm{Binom}(S_t, 1 - \exp (-\beta I_t \delta)) \] \[C_t \sim \mathrm{Binom}(I_t, 1 - \exp (-\gamma \delta)) \] (where the exponential comes from \(\geq 1\) event in a Poisson RV)<br> We can simplify \(B_t\) if we assume \(\beta \delta\) is small: \[B_t \sim \mathrm{Binom}(S_t, \beta I_t \delta)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Reed-Frost Model	Given state \((S_t, I_t)\), individuals have a latent (infected) time of 1, and at the end of their latent period, they try to infect every susceptible individual with probability \(p\). So \[I_{t+1} \sim \mathrm{Binomial}(S_t, 1 - (1-p)^{I_t})\] Useful for small populations like households.	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Greenwood Formulation	Chain binomial model where instead of the infection rate being proportional to the number of infected individuals, it's just \(p\) if any infected individual exists: \[I_{t + 1} \sim \mathrm{Binomial} (S_t, p)\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Continuous Deterministic SIR Model	Assume we start with \(N\) susceptible individuals and 1 infected individual. System of ODEs:<br> \(\frac{d}{dt}S(t) = -\lambda(t) S(t)\)<br> \(\frac{d}{dt} I(t) = \lambda(t) S(t) - \gamma I(t)\)<br> \(\frac{d}{dt}R(t) = \gamma I(t)\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Threshold result for the SIR ODE system	We need \(\frac{dI}{dt}(0) &gt; 0\), so we must have \(S(0) = N &gt; \gamma / \beta\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> The reproductive number \(R_0\)	We need \(\frac{dI}{dt}(0) &gt; 0\), so we must have \(S(0) = N &gt; \gamma / \beta\)<br> <br>Define \(R_0 = N \frac{\beta}{\gamma}\): number of secondary infections caused by one infected individual in a fully susceptible population (epidemic will only take off if at time zero, \(R_0 &gt; 1\).<br> <br>Related also to \(R_e(t)\), just the \(R_0\) with the current \(S(t)\) in place of \(N\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Prevalence of an infection	\[\pi(t) = I(t) / N\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Delayed Incubation Period Equation	\[\mu(t) = \int_0^t h(u) f(t - u) du\] where \(\mu(t)\) is the rate of newly observed cases at \(t\), \(h(t)\) is the rate of infections at \(t\), and \(f\) is a probability distribution for the incubation length of the disease. Note this is a convolution!<br> <br>Note this simplifies to the following in discrete time: \[\mu_k = \sum_{i=1}^k h_i f_{k-i}\]	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Population Survivor under Gamma-distributed Frailty	Assume the density of \(u\) is given by \(g(u)\), with \({\mathbb{E}}[g(u)] = 1\). Then we have: <br>\[{\mathbb{E}}_u[F(t)] = {\mathbb{E}}_u[e^{-H(t{\,|\,} u)}] = \int_0^\infty \exp (-u H_0(t))g(u) du\] Recognizing the Laplace transform and plugging in \(u \sim Gamma(p, \lambda)\): \[\bar F(t) = \left( \frac{p}{p+H_0(t)} \right)^\lambda\] <br>Usually we want the expected value of \(g(u)\) to be one, so we normally use \(p = \lambda\), denoted \(\psi\).	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Risk Set	\[R = \{ j : x_j \geq a_j \}\] Note \(r_j = |R|\)<br> Note that this includes everyone that hasn't been censored yet right before \(t_j\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	<b>Definition:</b> Event Set / Death Set	\[D = \{ i : x_i = a_i \land v_i = 1 \}\] Note \(d_j = |D|\) The number of individuals with (uncensored) events exactly at \(t_i\)	MedicalStats PartIIINotes
Basic	Part III Notes::MedicalStats	MLE for the log-likelihood for the exponential survivor function	\[\hat \theta = \frac{V_+}{X_+}\] where \(V_+\) is the total number of (non-censored) events, and \(X_+\) is the total time spent at risk.	MedicalStats PartIIINotes