Associated variables

Variables that are associated with one another numerically. Also called dependent variables.

Case

An individual in a sample dataset on whom one or more variables have been measured. Also called an observational unit.

Categorical variable

A variable with values (levels) that represent categories.

Causal relationship

A relationship between variables in which changing the value of one variable has an affect on the value of another variable.

Cohort

A group of individuals (cases) who share some characteristic(s) (e.g., all born in the same year).

Continuous variable

A numerical variable that can take any numerical value within an interval (e.g., a decimal number between 0 and 1).

Control group

Cases in the sample who do not receive a specific treatment being investigated.

Data

Measurements of one or more variables on a sample of observations.

Data matrix

A common way to organize data in which the matrix rows represent cases (observational units) and the matrix columns represent variables.

Dependent variables

Variables that are associated with one another numerically. Also called associated variables[The term "dependent variable" is sometimes used for a "response variable," but that convention is not used in this course.]

Discrete variable

A numerical variable that can only take numerical values with gaps between them (e.g., 0, 1, 2, ...).

Experimental study

A study in which researchers conduct an experiment to investigate the possibility of a causal connection between variables by controlling the values of the explanatory variable(s) for selected cases.

Explanatory variable

When we suspect one variable might causally affect another, we label the first variable the
explanatory variable. Also called a predictor variable.

Independent variables

Two variables that are not associated and have no evident relationship. [Do not mix this up with the concept of independent events in probability. Also the term "independent variable" is sometimes used for an "explanatory variable," but that convention is not used in this course.]

Levels

The possible values of a categorical variable.

Negative association

An association between two numerical variable in which an increase in one variable tends to be associated with a decrease in the other variable.

Nominal variable

A categorical variable in which the levels have no meaningful, natural order.

Numerical variable

A variable that can take a range of numerical values, and it is meaningful to add, subtract, or take averages with those values. Also called a quantitative variable.

Observational study

A study in which researchers collect data in a way that does not directly interfere with how the data arise.

Observational unit

An individual in a sample dataset on whom one or more variables have been measured. Also called a case.

Ordinal variable

A categorical variable in which the levels have a meaningful, natural order.

Placebo

A "sham" treatment with no known impact on a response variable (e.g., a sugar pill).

Positive association

An association between two numerical variables in which an increase in one variable tends to be associated with an increase in the other variable.

Predictor variable

When we suspect one variable might causally affect another, we label the first variable the
explanatory variable. Also called an explanatory variable.

Quantitative variable

A variable that can take a range of numerical values, and it is meaningful to add, subtract, or take averages with those values. Also called a numerical variable.

Randomized experiment

An experiment in which individuals (cases) are randomly assigned to a group (e.g., "treatment" or "control").

Response variable

When we suspect one variable might causally affect another, we label the second variable the
response variable.

Scatterplot

A graph of two numerical variables in which each sample point is plotted at the intersection of the value of one variable on the horizontal axis and the other variable on the vertical axis.

Statistics

The study of how best to collect, analyze, and draw conclusions from data.

Summary statistic

A single number summarizing the values of a variable for a sample of observations.

Treatment group

Cases in the sample who receive a specific treatment being investigated.

Variable

A characteristic of an observational unit that has been measured.

License

Share This Book