Skip to the content.

Example workflow using this website

Preface

In this section I will outline the thought process I would like you to follow when using this website for guidance as well as the necessary items you need to have completed before you can effectively use this site.

What you should have completed already:

Example workflow

My research project is “Is there a notable difference in miles per gallon with between cars with an automatic transmission and cars with an automatic transmission?”

I have collected data that looks like this:

Automatic Manual
21.4 21.0
18.7 21.0
18.1 22.8
14.3 32.4
24.4 30.4
22.8 33.9
19.2 27.3
17.8 26.0
16.4 30.4
17.3 15.8
15.2 19.7
10.4 15.0
10.4 21.4
14.7 21.0
21.5 21.0
15.5 22.8
15.2 32.4
13.3 30.4
19.2 33.9

Identifying the broad category of the question

Out of the list of headings, my research question most closely aligns with the overarching question marked in bold:

I know I am comparing the mean miles per gallon between 2 groups (automatic vs manual transmission).

Deciding what statistical test to use

Now that I have identified the general comparison I would like to do, I will then look at the topic’s subheadings to narrow down my assumptions about the data to determine what sort of test I should use.

Out of the list of subheadings below, bold indicates which category the data for my project is most similar to:

Independent groups:

Are the groups independent or paired?

Independent groups: chi-squared test or Fisher’s exact test

Paired groups: McNemar’s test

The final decision

Now that I have made the assumptions that my data contains two independent groups and the data are most likely normally distributed, I have narrowed it down to two options: a T-test or an ANOVA. I then take a look at the pages for both and realize that the ANOVA would be useful for datasets containing > two categories. However, my dataset only contains two categories (automatic vs manual transmission) therefore the T-test would be a better fit.

I would then choose my analysis tool (JMP, Excel, Sheets, or R) and run the analysis.

Example analysis

I have included an example Excel analysis below.

  1. Input all the raw data into your Excel sheet:

  2. Filter any of the extra data so you are only working with the data of interest

  3. Input the formula for a T-test into google sheets using the formula below. Make sure that the rows of column A and B are representative of the dimensions of your data. In the generic formula: Range1 is the rows of your first set of observations, Range2 is the rows of your second set of observations, tails is either one-tailed or two-tailed (1, 2), and type is either paired test, two-sample equal variance or two-sample unequal variance (1, 2, 3). The generic formula looks like: =T.TEST(Range1, Range2, tails, type) while the specific formula looks like: =T.TEST(A2:A20, B2:B14, 2, 3).

  4. Once you have input the formula into the cell, the number in the cell will be the p-value of the T-test.