1. Can I treat the ordinal predictor as continuous?
Short answer: Yes, you can - and it's often done in practice, especially if:
- The ordinal variable has many levels (generally ≥5 is a common heuristic),
- The distances between levels are believed to be approximately equal (i.e., the "interval" assumption isn't wildly implausible),
- You're primarily interested in testing linear trends (e.g., "does increasing disease severity relate to a change in Y?").
Arguments in favor:
- Simplicity and interpretability: You get one coefficient that reflects the linear trend across levels.1.
- Efficiency: Fewer parameters to estimate compared to modeling each level (e.g., as dummies).
- Power: Less loss of degrees of freedom = more statistical power.
Common in practice: Especially when the variable is derived from summing multiple subdomains (as your severity score is), many researchers treat such scores as continuous.
2. How can I test whether it’s “okay” to treat it as continuous?
Best practices in R:
a. Compare the linear model vs categorical model:
If the ANOVA p-value is not significant, the linear term is sufficient - treating X as continuous is defensible.
AIC: Lower = better. A small AIC difference (<2) suggests both models are comparable.
b. Test linearity of effect:
You can use a restricted cubic spline or generalized additive model (GAM) to check for non-linearity:
or
If spline or GAM suggests a nearly straight line, that supports using a linear term.
3. If not, what’s the most accepted method in clinical research?
If treating as continuous is not defensible:
Treat as categorical (factor): This is the most conservative and most commonly used method in clinical research.
Pros: No assumptions about linearity or equal spacing.
Cons: More parameters; less power.
Polynomial contrasts or splines:
Sometimes used to model non-linear but smooth effects.
Splines (e.g., natural splines) allow flexibility without estimating 10 dummy variables.
Ordinal regression: As you mentioned, not applicable since Y is continuous.