Since we only know the sample standard deviation, not the populations standard deviation, and the sample size is relatively small, our test statistic will be t, not z.
t = (xbar - x0)/(s/√n) = (98.2 - 98)/(.4/√18) ≈ 2.121
We can compare this t-statistic with a t-table (such as this one https://www.sjsu.edu/faculty/gerstman/StatPrimer/t-table.pdf), looking for the row where the degrees of freedom = n - 1 = 17.
If we're doing a two-tailed test, we look for where at the top it says "two tails, 0.05," then we look for the row where the degrees of freedom is 17, and we find a critical value of 2.11. It's extremely close, but our test statistic is bigger than the critical value, i.e. it is in the rejection region. Thus there is evidence to suggest that the true average time his students take on a test is significantly different than 98 minutes.