2025-04-21
Construct and fit a multiple logistic regression model
Test for significance of individual coefficients or sets of coefficients in multiple logistic regression
Estimate the predicted probability of our outcome using multiple logistic regression
Present the odds ratios for multiple variables at once
Interpret odds ratios for coefficients while adjusting for other variables
For breast cancer diagnosis example, recall:
Primary covariate: Race/ethnicity
Non-Hispanic white individuals are more likely to be diagnosed with breast cancer
Additional covariate: Age
Test for significance of individual coefficients or sets of coefficients in multiple logistic regression
Estimate the predicted probability of our outcome using multiple logistic regression
Present the odds ratios for multiple variables at once
Interpret odds ratios for coefficients while adjusting for other variables
In multiple logistic regression model, we have > 1 independent variable
Sometimes referred to as the “multivariable regression”
The independent variable can be any type:
We will follow similar procedures as we did for simple logistic regression
Why the bold \(X\)?
\(\mathbf{X}\) represents the vector of all the \(X\)’s. This is how we represent our group of covariates in our model.
As with the simple logistic regression, we use maximum likelihood method for estimating coefficients
For a model with \(k\) independent variables, there is \(k+1\) coefficients to estimate
\[ \begin{aligned} \text{logit}\left(\pi(\mathbf{X})\right) = & \beta_0 +\beta_1 \cdot I \left( R/E = \text{``H/L"} \right) +\beta_2 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +\beta_3 \cdot I \left( R/E = \text{``NH API'} \right) +\beta_4 \cdot I \left( R/E = \text{``NH B"} \right) +\beta_5 \cdot \text{Age}_c \end{aligned}\]
summary()multi_bc = glm(Late_stage_diag ~ Race_Ethnicity + Age_c, data = bc, family = binomial)
summary(multi_bc)
Call:
glm(formula = Late_stage_diag ~ Race_Ethnicity + Age_c, family = binomial,
data = bc)
Coefficients:
Estimate Std. Error z value
(Intercept) -1.038389 0.027292 -38.048
Race_EthnicityHispanic-Latinx -0.015424 0.083653 -0.184
Race_EthnicityNH American Indian/Alaskan Native -0.085704 0.484110 -0.177
Race_EthnicityNH Asian/Pacific Islander 0.133965 0.083797 1.599
Race_EthnicityNH Black 0.357692 0.071789 4.983
Age_c 0.057151 0.003209 17.811
Pr(>|z|)
(Intercept) < 2e-16 ***
Race_EthnicityHispanic-Latinx 0.854
Race_EthnicityNH American Indian/Alaskan Native 0.859
Race_EthnicityNH Asian/Pacific Islander 0.110
Race_EthnicityNH Black 6.27e-07 ***
Age_c < 2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
(Dispersion parameter for binomial family taken to be 1)
Null deviance: 11861 on 9999 degrees of freedom
Residual deviance: 11484 on 9994 degrees of freedom
AIC: 11496
Number of Fisher Scoring iterations: 4
tidy()multi_bc = glm(Late_stage_diag ~ Race_Ethnicity + Age_c, data = bc, family = binomial)
tidy(multi_bc, conf.int=T) %>% gt() %>% tab_options(table.font.size = 38) %>%
fmt_number(decimals = 3)| term | estimate | std.error | statistic | p.value | conf.low | conf.high |
|---|---|---|---|---|---|---|
| (Intercept) | −1.038 | 0.027 | −38.048 | 0.000 | −1.092 | −0.985 |
| Race_EthnicityHispanic-Latinx | −0.015 | 0.084 | −0.184 | 0.854 | −0.181 | 0.147 |
| Race_EthnicityNH American Indian/Alaskan Native | −0.086 | 0.484 | −0.177 | 0.859 | −1.120 | 0.813 |
| Race_EthnicityNH Asian/Pacific Islander | 0.134 | 0.084 | 1.599 | 0.110 | −0.032 | 0.297 |
| Race_EthnicityNH Black | 0.358 | 0.072 | 4.983 | 0.000 | 0.216 | 0.498 |
| Age_c | 0.057 | 0.003 | 17.811 | 0.000 | 0.051 | 0.063 |
\[ \begin{aligned} \text{logit}\left(\widehat{\pi}(\mathbf{X})\right) = & \widehat{\beta}_0 + \widehat{\beta}_1 \cdot I \left( R/E = \text{``H/L"} \right) + \widehat{\beta}_2 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & + \widehat{\beta}_3 \cdot I \left( R/E = \text{``NH API'} \right) + \widehat{\beta}_4 \cdot I \left( R/E = \text{``NH B"} \right) + \widehat{\beta}_5 \cdot \text{Age}_c \\ \\ \text{logit}\left(\widehat{\pi}(\mathbf{X})\right) = &-4.56 -0.02 \cdot I \left( R/E = \text{``H/L"} \right) -0.09 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +0.13 \cdot I \left( R/E = \text{``NH API'} \right) +0.36 \cdot I \left( R/E = \text{``NH B"} \right) +0.06 \cdot \text{Age}_c \end{aligned}\]
Estimate the predicted probability of our outcome using multiple logistic regression
Present the odds ratios for multiple variables at once
Interpret odds ratios for coefficients while adjusting for other variables
We use the same three tests that we discussed in Simple Logistic Regression to test individual coefficients
Wald test
Score test
Likelihood ratio test (LRT)
When I say “test a single coefficient” or “test multiple coefficients” I am referring to the \(\beta\)’s
When I say “test a variable” I mean “determine if the model with the variable is more likely than the model without that variable”
Assumes test statistic W follows a standard normal distribution under the null hypothesis
Test statistic: \[W=\frac{{\widehat{\beta}}_j}{SE_{\widehat{\beta}_j}}\sim N(0,1)\]
95% Wald confidence interval: \[{\widehat{\beta}}_1\pm1.96 \cdot SE_{{\widehat{\beta}}_j}\]
The Wald test is a routine output in R (summary() of glm() output)
tidy()Important note: Wald test is best for confidence intervals of our coefficient estimates or estimated odds ratios.
\[ \begin{aligned} \text{logit}\left(\pi(x_i)\right) = & \beta_0 +\beta_1 \cdot I \left( R/E = \text{``H/L"} \right) +\beta_2 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +\beta_3 \cdot I \left( R/E = \text{``NH API'} \right) +\beta_4 \cdot I \left( R/E = \text{``NH B"} \right) +\beta_5 \cdot \text{Age}_c \end{aligned}\]
If we want to use the Wald test to determine if a variable is significant in our model
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Needed steps:
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Set the level of significance \(\alpha\)
Specify the null ( \(H_0\) ) and alternative ( \(H_A\) ) hypotheses
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
| term | estimate | std.error | statistic | p.value | conf.low | conf.high |
|---|---|---|---|---|---|---|
| (Intercept) | −1.04 | 0.03 | −38.05 | 0.00 | −1.09 | −0.99 |
| Race_EthnicityHispanic-Latinx | −0.02 | 0.08 | −0.18 | 0.85 | −0.18 | 0.15 |
| Race_EthnicityNH American Indian/Alaskan Native | −0.09 | 0.48 | −0.18 | 0.86 | −1.12 | 0.81 |
| Race_EthnicityNH Asian/Pacific Islander | 0.13 | 0.08 | 1.60 | 0.11 | −0.03 | 0.30 |
| Race_EthnicityNH Black | 0.36 | 0.07 | 4.98 | 0.00 | 0.22 | 0.50 |
| Age_c | 0.06 | 0.00 | 17.81 | 0.00 | 0.05 | 0.06 |
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
| Characteristic | OR1 | 95% CI1 | p-value |
|---|---|---|---|
| Race/ethnicity | |||
| NH White | — | — | |
| Hispanic-Latinx | 0.98 | 0.83, 1.16 | 0.9 |
| NH American Indian/Alaskan Native | 0.92 | 0.33, 2.25 | 0.9 |
| NH Asian/Pacific Islander | 1.14 | 0.97, 1.35 | 0.11 |
| NH Black | 1.43 | 1.24, 1.65 | <0.001 |
| Age (1 year increase) | 1.06 | 1.05, 1.07 | <0.001 |
| NH stands for “Non-Hispanic” | |||
| 1 OR = Odds Ratio, CI = Confidence Interval | |||
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Given race and ethnicity is already in the model, the regression model with age is more likely than the model without age (p-val < 0.001).
Likelihood ratio test answers the question:
Test a single coefficient by comparing different models
Important: LRT can be used conduct hypothesis tests for multiple coefficients
If testing single variable and it’s continuous or binary, still use this hypothesis test:
If testing single variable and it’s categorical with more than 2 groups, use this hypothesis test:
If testing a set of variables, use this hypothesis test:
\[ \begin{aligned} \text{logit}\left(\pi(\mathbf{X})\right) = & \beta_0 +\beta_1 \cdot I \left( R/E = \text{``H/L"} \right) +\beta_2 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +\beta_3 \cdot I \left( R/E = \text{``NH API'} \right) +\beta_4 \cdot I \left( R/E = \text{``NH B"} \right) +\beta_5 \cdot Age_c \end{aligned}\]
We can test a single coefficient or multiple coefficients
Likelihood ratio test is only suitable to test “nested” models
“Nested” models means the bigger model (full model) contains all the independent variables of the smaller model (reduced model)
We cannot compare the following two models using LRT:
Model 1: \[ \begin{aligned} \text{logit}\left(\pi(\mathbf{X})\right) = & \beta_0 +\beta_1 \cdot I \left( R/E = \text{``H/L"} \right) +\beta_2 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +\beta_3 \cdot I \left( R/E = \text{``NH API'} \right) +\beta_4 \cdot I \left( R/E = \text{``NH B"} \right) \end{aligned}\]
Model 2: \[\begin{aligned} \text{logit}\left(\pi(Age)\right) = & \beta_0+\beta_1 \cdot Age_c \end{aligned}\]
If the two models to be compared are not nested, likelihood ratio test should not be used
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Needed steps:
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Set the level of significance \(\alpha\)
Specify the null ( \(H_0\) ) and alternative ( \(H_A\) ) hypotheses
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
multi_bc = glm(Late_stage_diag ~ Race_Ethnicity + Age_c, data = bc, family = binomial)
re_bc = glm(Late_stage_diag ~ Race_Ethnicity, data = bc, family = binomial)
lmtest::lrtest(multi_bc, re_bc)Likelihood ratio test
Model 1: Late_stage_diag ~ Race_Ethnicity + Age_c
Model 2: Late_stage_diag ~ Race_Ethnicity
#Df LogLik Df Chisq Pr(>Chisq)
1 6 -5741.8
2 5 -5918.1 -1 352.63 < 2.2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Single, continuous variable: Age
Given race and ethnicity is already in the model, is the regression model with age more likely than the model without age?
Given race and ethnicity is already in the model, the regression model with age is more likely than the model without age (p-val < \(2.2\cdot10^{-16}\)).
Single, >2 categorical variable: Race and Ethnicity
Given age is already in the model, is the regression model with race and ethnicity more likely than the model without race and ethnicity?
Needed steps:
Single, >2 categorical variable: Race and Ethnicity
Given age is already in the model, is the regression model with race and ethnicity more likely than the model without race and ethnicity?
Set the level of significance \(\alpha\)
Specify the null ( \(H_0\) ) and alternative ( \(H_A\) ) hypotheses
Single, >2 categorical variable: Race and Ethnicity
Given age is already in the model, is the regression model with race and ethnicity more likely than the model without race and ethnicity?
multi_bc = glm(Late_stage_diag ~ Race_Ethnicity + Age_c, data = bc, family = binomial)
age_bc = glm(Late_stage_diag ~ Age_c, data = bc, family = binomial)
lmtest::lrtest(multi_bc, age_bc)Likelihood ratio test
Model 1: Late_stage_diag ~ Race_Ethnicity + Age_c
Model 2: Late_stage_diag ~ Age_c
#Df LogLik Df Chisq Pr(>Chisq)
1 6 -5741.8
2 2 -5754.8 -4 26.053 3.087e-05 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Single, >2 categorical variable: Race and Ethnicity
Given age is already in the model, is the regression model with race and ethnicity more likely than the model without race and ethnicity?
Given age is already in the model, the regression model with race and ethnicity is more likely than the model without race and ethnicity (p-val = \(3.1\cdot10^{-5}\) < 0.05).
Set of variables: Race and Ethnicity, and Age
Is the regression model with race and ethnicity and age more likely than the model without race and ethnicity nor age?
Needed steps:
Set of variables: Race and Ethnicity, and Age
Is the regression model with race and ethnicity and age more likely than the model without race and ethnicity nor age?
Set the level of significance \(\alpha\)
Specify the null ( \(H_0\) ) and alternative ( \(H_A\) ) hypotheses
Set of variables: Race and Ethnicity, and Age
Is the regression model with race and ethnicity and age more likely than the model without race and ethnicity nor age?
multi_bc = glm(Late_stage_diag ~ Race_Ethnicity + Age_c, data = bc, family = binomial)
intercept_bc = glm(Late_stage_diag ~ 1, data = bc, family = binomial)
lmtest::lrtest(multi_bc, intercept_bc)Likelihood ratio test
Model 1: Late_stage_diag ~ Race_Ethnicity + Age_c
Model 2: Late_stage_diag ~ 1
#Df LogLik Df Chisq Pr(>Chisq)
1 6 -5741.8
2 1 -5930.5 -5 377.32 < 2.2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Set of variables: Race and Ethnicity, and Age
Is the regression model with race and ethnicity and age more likely than the model without race and ethnicity nor age?
The regression model with race and ethnicity and age is more likely than the model omitting race and ethnicity and age (p-val < \(2.2\cdot10^{-16}\)).
Construct and fit a multiple logistic regression model
Test for significance of individual coefficients or sets of coefficients in multiple logistic regression
Present the odds ratios for multiple variables at once
Interpret odds ratios for coefficients while adjusting for other variables
Calculations will be slightly different
We may be interested in predicting probability of having a late stage breast cancer diagnosis for a specific age.
The predicted probability is the estimated probability of having the event for given values of covariate(s)
Recall our fitted model for late stage breast cancer diagnosis: \[ \begin{aligned} \text{logit}\left(\widehat{\pi}(\mathbf{X})\right) = &-4.56 -0.02 \cdot I \left( R/E = \text{``H/L"} \right) -0.09 \cdot I \left( R/E = \text{``NH AIAN"} \right) \\ & +0.13 \cdot I \left( R/E = \text{``NH API'} \right) +0.36 \cdot I \left( R/E = \text{``NH B"} \right) +0.06 \cdot \text{Age}_c \end{aligned}\]
We can convert it to the predicted probability: \[\widehat{\pi}(\mathbf{X})=\dfrac{\exp \left( \widehat{\beta}_0 + \widehat{\beta}_1 \cdot I \left( R/E = \text{``H/L"} \right) + ... + \widehat{\beta}_5 \cdot \text{Age}_c \right)} {1+\exp \left(\widehat{\beta}_0 + \widehat{\beta}_1 \cdot I \left( R/E = \text{``H/L"} \right) + ... + \widehat{\beta}_5 \cdot \text{Age}_c \right)}\]
We can calculate this using the the predict() function like in BSTA 512
augment() functionPredicting probability of late stage breast cancer diagnosis
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, what is the predicted probability for late stage breast cancer diagnosis (with confidence intervals)?
Needed steps:
Calculate probability prediction
Check if we can use Normal approximation
Calculate confidence interval
Interpret results
Predicting probability of late stage breast cancer diagnosis
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, what is the predicted probability for late stage breast cancer diagnosis (with confidence intervals)?
Predicting probability of late stage breast cancer diagnosis
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, what is the predicted probability for late stage breast cancer diagnosis (with confidence intervals)?
We can use the Normal approximation if: \(\widehat{p}n = \widehat{\pi}(X)\cdot n > 10\) and \((1-\widehat{p})n = (1-\widehat{\pi}(X))\cdot n > 10\).
We can use the Normal approximation!
Predicting probability of late stage breast cancer diagnosis
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, what is the predicted probability for late stage breast cancer diagnosis (with confidence intervals)?
3b. Calculate confidence interval (Option 2: with Normal approximation)
Predicting probability of late stage breast cancer diagnosis
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, what is the predicted probability for late stage breast cancer diagnosis (with confidence intervals)?
For someone who is 60 years old and Non-Hispanic Asian/Pacific Islander, the predicted probability of late stage breast cancer diagnosis is 0.269 (95% CI: 0.238, 0.299).
Construct and fit a multiple logistic regression model
Test for significance of individual coefficients or sets of coefficients in multiple logistic regression
Estimate the predicted probability of our outcome using multiple logistic regression
library(gtsummary)
tbl_regression(multi_bc,
label = list(Race_Ethnicity ~ "Race/ethnicity",
Age_c ~ "Age (1 year increase)"),
exponentiate = TRUE) %>%
as_gt() %>%
tab_options(table.font.size = 38) %>%
tab_source_note(md("*NH stands for \"Non-Hispanic\"*"))| Characteristic | OR1 | 95% CI1 | p-value |
|---|---|---|---|
| Race/ethnicity | |||
| NH White | — | — | |
| Hispanic-Latinx | 0.98 | 0.83, 1.16 | 0.9 |
| NH American Indian/Alaskan Native | 0.92 | 0.33, 2.25 | 0.9 |
| NH Asian/Pacific Islander | 1.14 | 0.97, 1.35 | 0.11 |
| NH Black | 1.43 | 1.24, 1.65 | <0.001 |
| Age (1 year increase) | 1.06 | 1.05, 1.07 | <0.001 |
| NH stands for “Non-Hispanic” | |||
| 1 OR = Odds Ratio, CI = Confidence Interval | |||
library(broom.helpers)
MLR_tidy = tidy_and_attach(multi_bc, conf.int=T, exponentiate = T) %>%
tidy_remove_intercept() %>%
tidy_add_reference_rows() %>%
tidy_add_estimate_to_reference_rows() %>%
tidy_add_term_labels()
glimpse(MLR_tidy)Rows: 6
Columns: 16
$ term <chr> "Race_EthnicityNH White", "Race_EthnicityHispanic-Latin…
$ variable <chr> "Race_Ethnicity", "Race_Ethnicity", "Race_Ethnicity", "…
$ var_label <chr> "Race_Ethnicity", "Race_Ethnicity", "Race_Ethnicity", "…
$ var_class <chr> "factor", "factor", "factor", "factor", "factor", "nume…
$ var_type <chr> "categorical", "categorical", "categorical", "categoric…
$ var_nlevels <int> 5, 5, 5, 5, 5, NA
$ contrasts <chr> "contr.treatment", "contr.treatment", "contr.treatment"…
$ contrasts_type <chr> "treatment", "treatment", "treatment", "treatment", "tr…
$ reference_row <lgl> TRUE, FALSE, FALSE, FALSE, FALSE, NA
$ label <chr> "NH White", "Hispanic-Latinx", "NH American Indian/Alas…
$ estimate <dbl> 1.0000000, 0.9846940, 0.9178662, 1.1433526, 1.4300256, …
$ std.error <dbl> NA, 0.083653090, 0.484110085, 0.083796726, 0.071788616,…
$ statistic <dbl> NA, -0.1843845, -0.1770333, 1.5986877, 4.9825778, 17.81…
$ p.value <dbl> NA, 8.537118e-01, 8.594822e-01, 1.098900e-01, 6.274274e…
$ conf.low <dbl> NA, 0.8344282, 0.3262638, 0.9688184, 1.2414629, 1.05221…
$ conf.high <dbl> NA, 1.158411, 2.254643, 1.345732, 1.645053, 1.065538
MLR_tidy = MLR_tidy %>%
mutate(var_label = case_match(var_label,
"Race_Ethnicity" ~ "Race and ethnicity",
"Age_c" ~ ""),
label = case_match(label,
"NH White" ~ "Non-Hispanic White",
"Hispanic-Latinx" ~ "Hispanic-Latinx",
"NH American Indian/Alaskan Native" ~ "Non-Hispanic American \n Indian/Alaskan Native",
"NH Asian/Pacific Islander" ~ "Non-Hispanic \n Asian/Pacific Islander",
"NH Black" ~ "Non-Hispanic Black",
"Age_c" ~ "Age (yrs)"))MLR_tidy = MLR_tidy %>% mutate(label = fct_reorder(label, term))
plot_MLR = ggplot(data=MLR_tidy,
aes(y=label, x=estimate, xmin=conf.low, xmax=conf.high)) +
geom_point(size = 3) + geom_errorbarh(height=.2) +
geom_vline(xintercept=1, color='#C2352F', linetype='dashed', alpha=1) +
theme_classic() +
facet_grid(rows = vars(var_label), scales = "free",
space='free_y', switch = "y") +
labs(x = "OR (95% CI)",
title = "Odds ratios of Late Stage Breast Cancer Diagnosis") +
theme(axis.title = element_text(size = 25),
axis.text = element_text(size = 25),
title = element_text(size = 25),
axis.title.y=element_blank(),
strip.text = element_text(size = 25),
strip.placement = "outside",
strip.background = element_blank())
Construct and fit a multiple logistic regression model
Test for significance of individual coefficients or sets of coefficients in multiple logistic regression
Estimate the predicted probability of our outcome using multiple logistic regression
Present the odds ratios for multiple variables at once
Each estimated coefficient provides an estimate of the log odds adjusting for all other variables included in the model
For interpretation, we need to
| Characteristic | OR1 | 95% CI1 | p-value |
|---|---|---|---|
| Race/ethnicity | |||
| NH White | — | — | |
| Hispanic-Latinx | 0.98 | 0.83, 1.16 | 0.9 |
| NH American Indian/Alaskan Native | 0.92 | 0.33, 2.25 | 0.9 |
| NH Asian/Pacific Islander | 1.14 | 0.97, 1.35 | 0.11 |
| NH Black | 1.43 | 1.24, 1.65 | <0.001 |
| Age (1 year increase) | 1.06 | 1.05, 1.07 | <0.001 |
| 1 OR = Odds Ratio, CI = Confidence Interval | |||
The estimated odds of late stage breast cancer diagnosis for Hispanic-Latinx individuals is 0.98 times that of Non-Hispanic White individuals, controlling for age (95% CI: 0.83, 1.16).
The estimated odds of late stage breast cancer diagnosis for Non-Hispanic American Indian/Alaskan Natives is 0.92 times that of Non-Hispanic White individuals, controlling for age (95% CI: 0.33, 2.25).
The estimated odds of late stage breast cancer diagnosis for Non-Hispanic Asian/Pacific Islanders is 1.14 times that of Non-Hispanic White individuals, controlling for age (95% CI: 0.97, 1.35).
The estimated odds of late stage breast cancer diagnosis for Non-Hispanic Black individuals is 1.43 times that of Non-Hispanic White individuals, controlling for age (95% CI: 1.24, 1.65).
For every one year increase in age, there is an 6% increase in the estimated odds of late stage breast cancer diagnosis, adjusting for race and ethnicity (95% CI: 5%, 7%).
Lesson 9: Multiple Logistic Regression