Difference between revisions of "Data Screening"

Revision as of 18:33, 16 November 2019

Data Screening

Once data from a research study is gathered and has been entered into SPSS, researchers must examine their data to be sure they can validly interpret their results. Valid interpretation of data is reliant on two data features:

1. The data must meet the assumptions of the analysis procedure.

2. The data in the data file are "an accurate representation or transcription of what was provided by research participants as their original responses or what was provided by archival sources as original data" (Meyers, Gamst, & Guarino, 2017, p. 31).

Contribution by: Britany Kuslis, WCSU Cohort 8

Reference:

Meyers, L., Gamst, G., & Guarino, A.J. (2017). Applied multivariate research: Design and interpretation. Thousand Oaks, CA: Sage Publications.

Data Cleaning

Value Cleaning

Value cleaning is ensuring the values are "within the limits of reasonable expectation" within the "to the extent that it is possible...within the bounds of feasibility"(Meyers, Gamst, & Guarino, 2017, p. 32). For example, you want to ensure the age of a presumed adult is not 9 years old or that a response to an item rated on a likert scale of 1-5 is not a 6 or an otherwise value that is not within the bounds of the study.

Contribution by: Britany Kuslis, WCSU Cohort 8

Reference: Meyers, S., Gamst, G, & Guarino, A.J. (2017). Applied multivariate research: Design and interpretation. Thousand Oaks, CA: Sage Publications.

Outliers

Causes of Outliers

Detection of Multivariate Outliers

Detection of Multivariate Outliers: Scatterplot Matrices

Detection of Multivariate Outliers: Mahalanobis Distance

The Mahalanobis Distance statistic measures "the multivariate 'distance' between each case and the group multivariate mean (known as centroid) taking into account the correlations between the variables" (Meyers, Gamst, & Guarino, 2017, p. 52). This method is used to determine if there are scores that vary from the mean of a set of DV's. The Mahalanobis distances details how far a case is from the group center mass of the predictor or IV's. The greater the distance the higher the possibility of a multivariate outlier. According to Lawrence S. Meyers, Glenn Gamst and A.J. Guarino, "Each case is evaluated using the chi square distribution with a stringent alpha level of .001. Cases that reach this significance threshold can be considered multivariate outliers and possible candidates for elimination. This approach is also not without its critics (e.g., Wilcox, 2012) for alternative approaches to multivariate outlier detection" (Meyers, Gamst, & Guarino, 2017, p.53).

Identifying Multivariate Outliers with Mahalanobis Distance-->[1]

Mahalanobis Distance -->[2]

Contribution by: Britany Kuslis, WCSU Cohort 8

References:

Clapham, Matthew E. “Mahalanobis Distance.” YouTube, YouTube.com, 2016, www.youtube.com/watch?v=spNpfmWZBmg.

Grande, Dr. Todd. “Identifying Multivariate Outliers with Mahalanobis Distance.” YouTube, YouTube.com, 2016, www.youtube.com/watch?v=AXLAX6r5JgE.

Meyers, L., Gamst, G, & Guarino, A.J. (2017). Applied multivariate research: Design and interpretation. Thousand Oaks, CA: Sage Publications.

@@ Line 19: / Line 19: @@
 == Value Cleaning ==
+Value cleaning is ensuring the values are "within the limits of reasonable expectation" within the "to the extent that it is possible...within the bounds of feasibility"(Meyers, Gamst, & Guarino, 2017, p. 32). For example, you want to ensure the age of a presumed adult is not 9 years old or that a response to an item rated on a likert scale of 1-5 is not a 6 or an otherwise value that is not within the bounds of the study.
+''Contribution by: Britany Kuslis, WCSU Cohort 8''
+Reference:
+Meyers, S., Gamst, G, & Guarino, A.J. (2017). ''Applied multivariate research: Design and interpretation.'' Thousand Oaks, CA: Sage Publications.
 == Outliers ==

Difference between revisions of "Data Screening"

Revision as of 18:33, 16 November 2019

Contents

Data Screening

Data Cleaning

Value Cleaning

Outliers

Causes of Outliers

Detection of Multivariate Outliers

Detection of Multivariate Outliers: Scatterplot Matrices

Detection of Multivariate Outliers: Mahalanobis Distance

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools