# All definitions for Chapter 1 in the Edexcel 9-1 GCSE Statistics Textbook

This is a detailed compilation of every single textbook definition for terms within GCSE Statistics Chapter 1: Collection of Data. I've taken these terms right out of the textbook: http://bit.ly/Statisics-Book, so it is 100% correct! Hope it helps!

Created by: ABizz
Created on: 13-04-19 14:49
Data
Information
Raw data
Data as it is collected - before it is ordered, grouped or rounded
Quantitative data
Numerical observations and measurements, eg. shoe size, temperature
Continuous data
Data that can take any value on a continuous numerical scale, eg. length, mass
Discrete data
Data that can only take specific values on a continuous numerical scale, eg. shoe size, number of pets
Qualitative data
Non-numerical observations, eg. colour, name
Categorical data
Data that can be sorted into non-overlapping categories, eg. colour
Ordinal data
Data that can be written in order/given a numerical rating scale, eg. league positions of football teams
Bivariate data
Involves pairs of related data, eg.age and price of cars
Multivariate data
Involves sets of three or more data values, eg. plant colour, height and leaf size
Class intervals
Groups within classes, which do not overlap (discrete data is sorted into these), eg. 1-10, 11-20
Primary data
Data which is collected by/for the person who is going to use it. This is done through direct observation, eg. through a survey or questionnaire.
Secondary data
Data which has been collected by someone else/from reference sources such as the National Statistics Office
Population
Everything or everybody that could possibly be involved in an investigation
Census
A survey or investigation with data taken from every member of a population
Sample
A small group of people/items which fully represent the population they are taken from
Sampling units
People or items to be sampled
Sampling frame
A list of all the sampling units
Petersen capture-recapture method
A method for estimating the size of a population
Random sampling
Sampling units are chosen randomly; every member of the population has an equal chance of being included
Judgement sampling
Personal judgement is used to select a sample that is representative of the population
Opportunity sampling
Only the people/objects available at the time are used
Cluster sampling
The data naturally splits into groups, eg. geographical areas. The list of clusters is the sampling frame, and clusters are randomly selected to make the sample.
Systematic sampling
Items are chosen at regular intervals using a sampling frame, eg. every 5th person is chosen
Quota sampling
The population is grouped by characteristics, eg. age/gender. A group of these are selected for the sample, eg. 10 males aged 10-25
Stratified sampling
A sample which contains members from each stratum in proportion to the actual size of the stratum
Data collection sheet
A table or tally chart used to record results
Explanatory/independent variable
Explains changes in that variable, or affects the response/dependent variable
Response/dependent variable
The focus of an experiment/the variable which is observed in an investigation
Laboratory experiments
Experiments conducted in a controlled environment
Field experiments
Experiments conducted in the test subjects' everyday environment - researcher has control over variables
Natural experiments
Experiments conducted in the test subjects' everyday environment - researcher has no control over variables
Simulation
Used to model real life events and help with making predictions - easier and cheaper than actually conducting experiment
Questionnaire
A set of questions designed to obtain data
Respondent
The person who is completing the questionnaire
Open question
A question which has no suggested answers
Closed question
A question which has a set list of suggested answers
Random response method
A method which uses a random event to decide how to answer a sensitive question, eg. 'have you ever shoplifted'?
Outlier/anomalous data value
A value that does not fit the general pattern of the data
Cleaning data
A process which identifies/corrects inaccurate data values/errors, removes units/symbols and decides what to do about any missing data
Extraneous variables
Variables which you may not be interested in, but could still affect your results - these must be controlled
Control group
Used to test the effectiveness of a treatment - one group is given the treatment; the other is not given anything
Matched pair tests
Two groups of people are used to test the effectiveness of a treatment; each individual in one group is paired with an individual in the second group (who has similar characteristics, eg. age, height)
Hypothesis
An idea that can be tested by collecting and analyzing data
