Lab 2: Feedback and Discussion
2026-02-11
Notes from me and the TAs
- Make sure your .Rproj is open!
- Please use the template .qmd that I give you!
- Univariate exploration should include all the race variables (indicators for each race)
- You should not have any NA’s
- Please make sure you coded EACH variable correctly…
- Saw people missing a gender identity
- Please interpret all the coefficient estimates!!
- Everyone should have more than one coefficient
- Good reference for self-perception: “Neither ….”
- The whole point of plotting each variable is to start understanding trends in the data…
- Looking at some plots, I don’t think you can do that…
Notes on plots
- Make sure your categorical variable is presented in order
- Make sure your plots are clear and easy to read (e.g., axis labels, title, etc.)
- If the category names are too long, tilt the names or make new line
- Histograms are for CONTINUOUS variables, bar plots are for CATEGORICAL variables
- Do NOT use the continuous variables for your categorical predictor!!
- I’m seeing a lot of people use
important_001 instead of their categorical/factor version of this variable
- Univariate vs bivariate
- Colors in histograms are fun, but not necessary!!
- Do not use a different color for each plot - color is meant to convey information!
- The main thing it does it alert me that you might be using AI bc AI does this a lot!
Get into groups of 2-4
- No more than 4!
- Try to group with people with whom you do not already work on labs
- Introduce yourself if you do not know each other
- Share you html documents with each other (email, airdrop, etc.)
Introduce Research Question
- Share your research question with each other!
- Should be using the same outcome + your main variable
- Share your other variables
Share your bivariate plots
- Can you get an idea of the relationship between your main variable and the outcome from the plot?
- Is your predictor categorical?
- Do you see any trends?
Simple logistic regression
- Does everyone have the correct interpretations of the coefficient estimates?
- What’s your reference group? Does this make sense?
- What trends are you seeing?