# Introduction to Statistics

Statistics is a branch of mathematics that deals with the collection, analysis and interpretation of data.

Data can be defined as groups of information that represent the qualitative or quantitative attributes of a variable or set of variables. In layman's terms, data in statistics can be any set of information that describes a given entity. An example of data can be the ages of the students in a given class. When you collect those ages, that becomes your data.

A set in statistics is referred to as a population. Though this term is commonly used to refer to the number of people in a given place, in statistics, a population refers to any entire set from which you collect data.

## Data Collection Methods

As we have seen in the definition of statistics, data collection is a fundamental aspect and as a consequence, there are different methods of collecting data which when used on one particular set will result in different kinds of data. Let's move on to look at these individual methods of collection in order to better understand the types of data that will result.

### Census Data Collection

Census data collection is a method of collecting data whereby all the data from each and every member of the population is collected.

For example, when you collect the ages of all the students in a given class, you are using the census data collection method since you are including all the members of the population (which is the class in this case).

This method of data collection is very expensive (tedious, time consuming and costly) if the number of elements (population size) is very large. To understand the scope of how expensive it is, think of trying to count all the ten year old boys in the country. That would take a lot of time and resources, which you may not have.

### Sample Data Collection

Sample data collection, which is commonly just referred to as sampling, is a method which collects data from only a chosen portion of the population.

Sampling assumes that the portion that is chosen to be sampled is a good estimate of the entire population. Thus one can save resources and time by only collecting data from a small part of the population. But this raises the question of whether sampling is accurate or not. The answer is that for the most part, sampling is approximately accurate. This is only true if you choose your sample carefully to be able to closely approximate what the true population consists of.

Sampling is used commonly in everyday life, for example, all the different research polls that are conducted before elections. Pollsters don't ask all the people in a given state who they'll vote for, but they choose a small sample and assume that these people represent how the entire population of the state is likely to vote. History has shown that these polls are almost always close to accuracy, and as such sampling is a very powerful tool in statistics.

### Experimental Data Collection

Experimental data collection involves one performing an experiment and then collecting the data to be further analyzed. Experiments involve tests and the results of these tests are your data.

An example of experimental data collection is rolling a die one hundred times while recording the outcomes. Your data would be the results you get in each roll. The experiment could involve rolling the die in different ways and recording the results for each of those different ways.

Experimental data collection is useful in testing theories and different products and is a very fundamental aspect of mathematics and all science as a whole.

### Observational Data Collection

Observational data collection method involves not carrying out an experiment but observing without influencing the population at all. Observational data collection is popular in studying trends and behaviors of society where, for example, the lives of a bunch of people are observed and data is collected for the different aspects of their lives.