Skip to the content.

What is categorical vs numerical data?

TL;DR numerical data refers to numbers while categorical data is basically everything else.

Categorical data

Categorical data is any data that can be place in a particular category based on its name or label. For example, if I survey 100 people at a boba shop and ask them what sort of drink they ordered the data I am recording is categorical. From this data, each person provides exactly one response making their answer mutually exclusive.

Categorical data can be further broken down into ordinal and nominal data.

Numerical data

Numerical data is just numbers in their pure form. They are not being used to rank anything and they are not being used as descriptors. For example, if I measure the time it takes for a drop of paint to dry, that would be numerical data. If I measure the time it takes for a blade of grass to grow 1 mm; that would also be numerical data.

Much like categorial data, numerical data can also occur in two forms: continuous and discrete data.

Within continuous data there are two additional categories: interval and ratio data.