Testing for significant difference? help please!

Question

My topic was: Are there significant multitasking differences between males and females? I had a total of 90 subjects (45 males and 45 females) play an online game. The game consisted of levels, and their score (that they received once they lost) was in the form of a number (for example: 49 or 105 or 65, etc. I also had each subject play the game four times (for four trials) and report the score each time. So a typical "Score Sheet" looked like this: Practice Trial 1: 18 Trial 1: 34 Trial 2: 58 Trail 3: 42I was thinking, for each subject, of first taking the average of their THREE HIGHEST SCORES and counting that as their "Final Score" For example, the "Final Score" of the above subject would be: 44.6666 ...that would be his/her "Final Score".....then, I find the mean of all males' Final Scores and the mean of all females' Final scores. This mean for each gender is what I will use in my statistical test. For instance, say Johnny has a mean of 44.666 and David a mean of 47.888 and then Rocky a mean of 43.2222 and same for females. In this case, I was thinking to use a T-test independent sample.However, I wanted to know....Is this allowed, to take the average of each persons 3 highest scores or is there some other statistical test I have to do before moving on to the T-Test . I know in the t-test you have to take the average of all male scores and all female scores.....(for instance, take the average Johnny, David and Rocky's MEAN scores and the average of all the individual female MEAN scores)By the way, I've never taken a statistics class in my life....this is for my Research class and our teacher somehow expects us to know about statistical analysis (I just found out what a T-test was a couple of minutes ago through Google and Youtube....I have no clue how to even begin or carry out a T-test, but I'll just have to figure it out ) Thank you! What do you think I should do?

David W. · Accepted Answer

Your initial premise determines the allowable statistical tests and processes that you may take. That's why it is important to state it clearly and to select samples correctly and to perform only appropriate tests and ...
 
You think that you have two distinct populations -- males and females.  If the samples show that it is not very likely that they differ, then you might not have two populations at all -- that is, there is no significant difference between males and females.  So, you create two populations from which to take samples -- males and females.  Of course, the differences that are characteristic of each group are now separate; so, samples should show that.
 
The number of samples determines the probability of differences being due to which population the sample belongs.  Each population has a frequency distribution for the characteristic being observed and the likelihood of the samples having a frequency distribution that looks like their source population increases as the number of samples increases.  For example, if it is equally likely that you will roll 1-6 using a "fair" die, the initial frequency might be uneven, but with more and more rolls, you expect (with a percent certainty) that the frequency distribution will be uniform.
 
So, if two populations (males and females) indeed have a difference concerning the characteristic being studied (significant multitasking differences), larger and larger samples will more and more likely be different.  This is what "confidence interval" is all about.
 
Now, it critically important that you have carefully defined "multitasking" if you are going to use that term.  And, your definition must say specifically that differences in multitasking produce different scores with this game, so let's just look for difference in game scores (don't assume anything in definitions! -- you are only determining how likely males and females get a different score on this game).
 
So, statisticians look at frequency distributions first.  That gives them a sense of the distribution of the data.  If there are "outliers" (obvious, unusual data points), there are complicated rules for deleting them.  Some are easy:  a typo, a missing value, a value that doesn't make sense (e.g., outside usual range), etc.  The low score you mentioned is not one of these and should not be eliminated just because you don't like it -- especially if you say that "most subjects get a low score" (then it is obviously not an error).  With multiple observations of one variable, you now have a frequency distribution for one person (one observation) and need to either reduce that to one value (e.g., take an average) or continue to consider that all observations have multiple values (like looking at multiple scores for a student in a class in order to determine whether they improved;  an average per student doesn't show improvement at all).
 
If the two frequency distributions look different, that's a good clue that there are differences.  Now the numerical statistical tests will determine the probability that this is not just a random difference between the two populations.  Here, your Research Teacher has an important role -- evaluating your research based on conducting the appropriate statistical tests.  If you are expected to know this already, it makes it tough.  If your Research Teacher can outline:  (1) test 1, (2) test 2, (3) test 3, (4) conclusion, it makes it much easier.  And, the subsequent tests usually depend on the outcomes of the previous tests (e.g., if the two frequency distributions look exactly the same, give up, there's likely no difference).
 
The description of each statistical test should describe (1) the population, (2) the size of the sample required to assure a required confidence, (3) the fact that the noted difference would not occur any other way (i.e., based on some variable not observed, like age of participants or time-of-day).
 
Since this is for a Research class (graded) and you lack expertise, you should get the best (not just other students or on-line volunteer tutors) that you can afford in order to make it an excellent learning experience.  Let your Research Teacher know that you are quite serious about the research, about learning necessary statistics, and about succeeding in your coursework -- teachers usually like students who do a more than the required minimum (thus, give better grades).  This means no longer saying, "I've never ..." and "I don't ..." but, rather, "I'm learning ..." and "I think ..."

Jim S. · Answer

Hi Sofia,
       Rather than getting complicated by taking the top 3 scores I would take all 180 male scores and 180 female scores and use the group averages and test average male scores=average female scores using the 2 sided t test. i.e.the null hypothesis is that there is no difference in the two means.
One thing I would do first is construct a histogram for each group just to see what the overall characteristics of the two data sets looks like i.e. are they approximately normal (bell shaped)
 
Hope this helps, let me know if you have additional questions.
 
Jim

Testing for significant difference? help please!

2 Answers By Expert Tutors

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

standard deviation

Impact on range and mean by increasing salaries

Determine the area under a normal distribution graph with a mean of 55 and a standard deviation of 7?

How to find a percentage from standard deviation and mean?

What are the oringanal numbers if the mode is 8 the mean is 10 and the median is 10

RECOMMENDED TUTORS

IXL

Rosetta Stone

Education.com

TPT

Vocabulary.com

ABCya

SpanishDictionary.com

Inglés.com

Emmersion

Testing for significant difference? help please!

2 Answers By Expert Tutors

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

standard deviation

Impact on range and mean by increasing salaries

Determine the area under a normal distribution graph with a mean of 55 and a standard deviation of 7?

How to find a percentage from standard deviation and mean?

What are the oringanal numbers if the mode is 8 the mean is 10 and the median is 10

RECOMMENDED TUTORS

find an online tutor