The end result of any data collection activity, assuming we do it correctly, is a consolidation of data that we can study and analyse to find relationships between variables.
The layout of that data collection must match the way analysis packages such as SigmaXL and Minitab view the data, otherwise no analysis can be undertaken.
Those packages will simply not be able to recognise the different data types in the worksheet.
Matching the needs of those packages is quite simple and involves using the top row to name the variables in the data collection, I.e. the names of the Y and the Xs.
And then sticking the data directly underneath those headings.
An effectively laid out worksheet looks like this.
THE DATA COLLECTION PLAN
The data collection plan that matches this assembly of data looks like this.
The sampling plan guides us in the number of rows of data (I.e. data points) we collect and assemble in the data file.
Numerical variables are listed as secondary metrics which we study in a different way than categorical variables.
In most cases a correlation analysis is the primary strategy for looking at their relationship with the primary metric.
KEY POINTS ABOUT DATA COLLECTION PLANNING
The key points are these:
(A) Our data collection (DC) plan is there to help us design the elements of the data worksheet.
(B) The list of variables in the DC plan - the primary metric, the categorical Xs and the numerical Xs - determine the column headings in the data worksheet.
(C) Every time we collect a data point for the primary metric, we also collect one data point for every other variable.
(D) The sampling plan guides us in how we collect the data and how many rows we collect.
(E) Because there can be a lot of variation in how people collect the numerical variables, we need to operationally define what those variables are and how they must be collected.
(F) Categorical variables don't need the same definition as the numerical, because they are observed data that makes it easy for data collectors to be consistent in what they record.