Science method

Exploratory Factor Analysis - Science method

Explore the latest questions and answers in Exploratory Factor Analysis, and find Exploratory Factor Analysis experts.
Questions related to Exploratory Factor Analysis
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
In EFA, is it okay that each of the participants will answer all the items the questionnaires? For example, I have 50 items and I decided to have 500 participants. Is it okay if each of the participant will answer all the items?
Thanks.
Relevant answer
Answer
Yes, that would be ideal.
  • asked a question related to Exploratory Factor Analysis
Question
1 answer
After mapping all survey questions to KPI,Using Exploratory Factor Analysis, I got main loaded factors , which can be said as KPI, can than low loaded factors be considered as influencing factors to these KPI.
Relevant answer
Answer
Dear Sir,
Why you need to consider these factors has influence on main KPI's
I think these factors are different KPI's with minor effect.
you can consider some of these factors if you fill that they are still importance by increase the numbers of KPI's in your model.
if these factors are related to the main factors, PCA will eliminate them
Please check the attached paper
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
Performing Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) involves several systematic steps. Here's a detailed guide for each:
How to Perform Exploratory Factor Analysis (EFA)
  1. Prepare the Data:Collect Data: Ensure you have a sufficiently large dataset. Check for Suitability: Use the Kaiser-Meyer-Olkin (KMO) test and Bartlett’s Test of Sphericity to determine if your data is suitable for factor analysis.
  2. Select the Number of Factors:Eigenvalues: Use the Kaiser criterion (eigenvalues > 1). Scree Plot: Examine the scree plot to identify the point where the eigenvalues level off. Parallel Analysis: Compare the eigenvalues with those obtained from random data.
  3. Extract the Factors:Methods: Common methods include Principal Axis Factoring (PAF) or Maximum Likelihood (ML). Software: Use statistical software like SPSS, R (using the psych package), or SAS.
  4. Rotate the Factors:Orthogonal Rotation: Varimax rotation for uncorrelated factors. Oblique Rotation: Promax or Oblimin for correlated factors. Interpretation: Identify and label the factors based on the rotated factor loadings.
  5. Interpret the Results:Factor Loadings: Examine loadings to understand which variables correlate strongly with which factors. Communalities: Check communalities to understand the variance explained by the factors.
How to Perform Confirmatory Factor Analysis (CFA)
  1. Specify the Model:Theory-Driven: Define the number of factors and the loadings of observed variables on these factors based on theoretical expectations or previous research. Path Diagram: Draw a path diagram to visualize the model.
  2. Collect and Prepare Data:Sample Size: Ensure a large enough sample, typically a minimum of 200 observations or 5-10 times the number of parameters. Data Cleaning: Check for missing data and outliers.
  3. Estimate the Model:Software: Use software like AMOS, LISREL, Mplus, or R (using the lavaan package). Fit the Model: Use the software to estimate the parameters of the specified model.
  4. Assess Model Fit:Fit Indices: Evaluate fit indices such as Chi-Square (χ²), Root Mean Square Error of Approximation (RMSEA), Comparative Fit Index (CFI), and Tucker-Lewis Index (TLI). Modification Indices: Check modification indices to identify potential improvements to the model.
  5. Refine the Model:Modification: Make theoretical justifiable adjustments based on modification indices. Re-Estimation: Re-estimate the model after modifications.
  6. Interpret the Results:Factor Loadings: Examine standardized factor loadings to understand relationships between observed variables and latent factors. Residuals: Analyze residuals to ensure the model is capturing the data well.
Example using R for EFA and CFA
EFA Example (Using psych package)
# Install and load the psych package
install.packages("psych")
library(psych)
# Sample data
data <- read.csv("your_data.csv")
# Perform EFA
efa_result <- fa(data, nfactors = 3, rotate = "varimax")
# Print the result
print(efa_result)
CFA Example (Using lavaan package)
# Install and load the lavaan package
install.packages("lavaan")
library(lavaan)
# Define the CFA model
model <- '
Factor1 =~ X1 + X2 + X3
Factor2 =~ X4 + X5 + X6
Factor3 =~ X7 + X8 + X9
'
# Sample data
data <- read.csv("your_data.csv")
# Fit the CFA model
cfa_result <- cfa(model, data = data)
# Summary of the results
summary(cfa_result, fit.measures = TRUE)
By following these steps, you can perform EFA to explore the underlying factor structure of your data and CFA to confirm the hypothesized structure.
To give reference
Singha, R. (2024). How to perform Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA)? Retrieved from https://www.researchgate.net/post/How_to_perform_Exploratory_Factor_Analysis_EFA_and_Confirmatory_Factor_Analysis_CFA?_init=1
Relevant answer
Answer
You can add an item or respondent. Control respondent for focussing in asking quesion
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
I am running a PCA in JASP and SPSS with the same settings, however, the PCA in SPSS shows some factors with negative value, while in JASP all of them are positive.
In addition, when running a EFA in JASP, it allows me to provide results with Maximum Loading, whle SPSS does not. JASP goes so far with the EFA that I can choose to extract 3 factors and somehow get the results that one would have expected from previous researches. However, SPSS does not run under Maximum Loading setting, regardless of setting it to 3 factors or Eigenvalue.
Has anyone come across the same problem?
UPDATE: Screenshots were updated. EFA also shows results on SPSS, just without cumulative values, because value(s) are over 1. But why the difference in positive and negative factor loadings between JASP and SPSS.
Relevant answer
Answer
Hi dears.
Why in EFA results with JASP, I can't see the communality table of items?
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I am running an Exploratory Factor Analysis (EFA) for a variable.
EFA showed that the first six initial eigenvalues were more than 1, so I needed to retain them according to Larsen & Warne (2010) and Shrestha (2021) stated that researchers retain all factors for which the eigenvalue is greater than 1.0 following the rule of Guttman (1954).
The seventh component was also more than one but less than 1.2. How can I justify that the seventh component can be removed? Because an initial eigenvalue less than 1.2 is typically considered insignificant and may justify the removal of a factor.
Therefore, I have been trying to find a reference to justify my action. Please provide me with references that mention that an initial eigenvalue less than 1.2 is typically considered insignificant and may justify removing a factor.
Thank you.
Relevant answer
Answer
Yes, you are correct!
Consider whether the inclusion of the seventh factor aligns with the theoretical framework of your research. If the first six factors adequately represent the constructs you're interested in, and the seventh factor does not contribute substantially to the understanding of the phenomenon under study (as you said), it may be justifiable to exclude it.
Eigenvalues represent the amount of variance explained by each factor. Generally, factors with eigenvalues greater than 1 are considered for retention. However, if the eigenvalue of the seventh factor is close to 1 and substantially lower than those of the first six factors, it suggests that the seventh factor accounts for relatively little variance and may not be essential for interpretation (it is visible in your results).
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
My research investigates the influence of training and development and performance appraisal on the competitive advantage of Nigerian private schools. It also explores the potential role of ethical climate as a mediating factor in this relationship.
In this study, a quantitative research approach was utilized to collect and analyze numerical data and to identify patterns and relationships. The analysis includes a variety of techniques, such as descriptive analysis, exploratory factor analysis, confirmatory factor analysis, and structural equation modeling. Widely recognized statistical software programs, SPSS and AMOS, have been used to perform these advanced analytical methods.
However, the reviewer comments that the variable construct is still based on theory, not empirically in Nigerian Private Schools. How can I do it or respond to it?
Relevant answer
Answer
I think the reviewer is expecting you to provide examples of a few studies of a similar nature to show that your study is not just based on theory but is based on gaps in the existing literature. Therefore, I'd suggest you to search and include a few studies that examine the impact of training and development and performance appraisal on the competitive advantage of schools.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I am running an exploratory factor analysis for six constructs. Each of them has eight items. The first six factors were more than 1, so I needed to retain them, but the seventh component was more than one but less than 1.2. How can I justify that the seventh component can be removed because an initial eigenvalue less than 1.2 is typically considered insignificant and may justify the removal of a factor? I have been trying to find a reference to justify my action. Please provide me the references that mentioned that an initial eigenvalue less than 1.2 is typically considered insignificant and may justify the removal of a factor.
I appreciate any help you can provide.
Relevant answer
Answer
The eigenvalue of 1 (or even 1.2) criterion is not appropriate in most cases. It usually results in extracting too many factors.
Parallel analysis is considered the gold standard for deciding how many factors to extract which is the most consequential decision one makes in a factor analysis. It works by bootstrapping the extraction of eigenvalues for up to the number of items, based on a large number of randomized samplings (typically 1,000). The output includes percentile values for each number of factors. Choosing the number of factors corresponding to the 95th percentile becomes essentially applying a .05 test of significance to one's selection.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
I am running an exploratory factor analysis for six constructs. Each of them has eight items. The first six factors were more than 1, so I needed to retain them, but the seventh component was more than one but less than 1.2. How can I justify that the seventh component can be removed because an initial eigenvalue less than 1.2 is typically considered insignificant and may justify the removal of a factor? I have been trying to find a reference to justify my action. Please provide me the references that mentioned that an initial eigenvalue less than 1.2 is typically considered insignificant and may justify the removal of a factor.
I appreciate any help you can provide.
Relevant answer
Answer
Factor selection criteria that are based on arbitrary eigenvalue cutoffs such as 1 or 1.2 are not very useful. There is little theoretical justification for a particular cutoff, and, oftentimes, these arbitrary cutoffs do not lead to meaningful factor solutions. Have you tried parallel analysis? See
Preacher, K. J., & MacCallum, R. C. (2003). Repairing Tom Swift's electric factor analysis machine. Understanding statistics: Statistical issues in psychology, education, and the social sciences, 2(1), 13-43.
Also, it sounds like you already have a theoretically expected number of factors (six), so why not use CFA (rather than EFA) in the first place?
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
I need to access and read academic studies that use the exploratory factor analysis (EFA) as a analysis method with no more than 10 initial factors. I want to explore if this is academically possible and if there are studies that show the feasibility. If you can help me out with that, i would highly appreciate your efforts. Thank you very much guys :)
Relevant answer
Answer
Exploring the Factors Influencing Consumer Purchase Intention in Online Shopping: A Case Study of Apparel Products"
ORAssessing Quality of Life in Patients with Chronic Illness: A Multidimensional Approach"
  • Researchers: Garcia, M., et al.
  • Published in: Health Psychology Research, 2018
  • Methodology: Used EFA with 9 initial factors to examine quality of life aspects among patients with chronic illnesses.
  • Findings: Identified factors related to physical health, emotional well-being, social support, and coping strategies.These examples demonstrate the academic feasibility of using EFA with no more than 10 initial factors
  • asked a question related to Exploratory Factor Analysis
Question
8 answers
with three factors, the total variance explained is 53%. With two factors, the total variance is 45%. Can I keep one item for a given factor if I think that item is important to be kept in the tool? That item will not be loaded into other factors when I select the two-factor model.
Relevant answer
Answer
Nilanthi Suraweera In addition to David L Morgan suggestions, I think you should try other methods, PFA with oblique rotations or PCA with orthogonal or oblique rotation. I come across Hair et al. book "Multivariate Data Analysis," if you have 30 or more items, both PFA and PCA produce same results. My suggestion is you should try various methods of extractions and rotations and the result is meaningful to interpret, I think it might work. If not, may be you may want to consider on your sample size and data cleaning before EFA (I think both of which you can find in the book I mentioned).
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Hi!
This might be a bit of a stupid question, but I am currently writing my master thesis. One of the things I am doing is a factor analysis on a scale developed in Canada. This scale has only been validated on the Canadian workforce (the developers have one time done a exploratory factor analysis and two times done a confirmatory factor analysis). I am doing an exploratory and a confirmatory factor analysis in the Norwegian workforce to see what factor structure I would find here, and if it is the same as in Canada. As this this only one of three things I am doing in my masters I have hypothesis for all the other findings, so my supervisor would like me to have hypothesis for the factor structure as well. Whenever I try to come up with some arguments, I always feel like I am just arguing for the same attitudes in both countries, rather than the factor structure.
My question is: How do you make a hypothesis for this where you argue for the same/a different factor structure without arguing for the same/different attitudes?
Thank you in advance! :)
Relevant answer
Answer
Factor analysis can be used to identify the factors that contribute to the structure of the workforce.
It can as well be used to identify the key skills and competencies that are required for different roles in an organisation, gaps in the current workforce and also develop strategies to address these issues.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
I did EFA on a set of Likert Scale data.
I got 3 factors based on parallel analysis.
Factor 1 - 15 items
Factor 2 - 5 items
Factor 3 - 3 items
Didi CFA and got good fit.
Is it possible to reduce the number of factors to say 4 or 5?
I tried reducing it, did cronbach's alpha ..before was like 0.93, after is 0.92-ish
BTW, factor loadings of all 14 of the 15 items are also quite high - 0.69 to 0.92
I'm trying to reduce the number because in SEM, the 15 items is a lot of freeparameters resulting in identification issues.
If you have suggestions, let me know. Thanks!
Relevant answer
Answer
Thanks for your suggestion Christian! I will try this and hope to have some good results. Trying SEM for the first time is both difficult and fulfilling. I'm happy there are online communities willing to help out.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Hi,
I did an exploratory factor analysis. Direct Oblimin and Principal Axis factoring. The 12 variables loaded on 2 factors. The factor loadings on factor 1 (column one in pattern matrix) and factor loadings on factor 2 (column two in pattern matrix) are highly negatively correlated (R2=0.991).
Is this normal or problematic? Thank you!
Relevant answer
Answer
I don't think it is common to report the correlation between loadings on different factors. I would report (among other things) the actual loadings (pattern matrix), the structure matrix (correlations between variables and factors), and the correlations between the factors.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Can I use exploratory factor analysis on the same data after an unfavorable result of confirmatory factor analysis? In the procedure where I first apply exploratory factor analysis on a data set, then it is no longer correct (and there are methodological and statistical reasons for this) to use confirmatory factor analysis on the same data set. But what about the first case (from CFA to EFA on the same data)?
Relevant answer
Answer
I think it is fine to use EFA after CFA to explore the reasons for the misfit of a CFA model. This can sometimes help you find out whether a solution is "almost correct," for example, when the number of factors and factor pattern (loadings) as such are as you expected but there are one or more non-zero cross-loadings. In other cases, EFA may reveal that you need more factors then expected, that is, that the factor structure as such is different from the original CFA model.
  • asked a question related to Exploratory Factor Analysis
Question
1 answer
I am compiling factors from previous literature review for exploratory factor analysis.
Q1) How to extract the data into csv file?
Q2) is it more reliable to use SMARTPLS 4 to do EFA?
Q3) is it possible to use jamovi to do EFA?
Q4) how do I prove the reliability and validity of using jamovi as the tool?
Thank you
Relevant answer
Answer
I might not be able to answer all your questions, but I'd like to share my literature review experience regarding factors. While conducting the review, I primarily utilized thematic analysis and closely examined the factors mentioned in previous literature. By doing so, I was able to identify a research gap in this area. To aid my analysis, I used the software Atlas.ti, which helped me extract relevant data into a table format.
Meanwhile for EFA, I do not have experience in using SmartPLS. Normally I use SPSS to examine the EFA. It is also possible to use jamovi, and you can get the same result. Recently, Jamovi has been widely used for statistical analysis. So, you can refer to other related articles to prove the validity and reliability of the software.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
I would like to find out if there is any impact of social media on the creation of entrepreneurial opportunity. I have two groups (Online business entrpreneurs and traditional business entrepreneurs) and 20 statements in total that I have measured using 5 point likert scale (strongly agree, agree, neutral, disagree, strongly disagree and Not applicable(suggested by my Supervisor)). Those who run traditional business chose "not applicable" for 8 items and I have ignored those 8 items and proceed with rest of the 12 items for further analysis e.g. Mann-Whitney U test, Kruskal-Walis test and correlation analysis.
Now I have 3 questions:
1. Is it okay to ignore those items having "not applicable"?
2. I have tried exploratory factor analysis for both groups and got 3 factors for each, however, one factor among them has only two items under it. Is two items under a factor acceptable?
3. What other tests or analysis I can do other than the above mentioned tests?
Relevant answer
Answer
@David Thank you for your response. I tried factor analysis according to your suggestion and found out 3 factors. One factor having 5 items but having both positive and negative statements. For example:
1. It is easier to spread information via socia media.
2. It is easier to promote products via social media.
3. It is less costly to run business via social media.
4. It is tough to protect customers personal information due to cyber-attacks.
I thought of naming the factor as "Perceived advantages of Social Media", however, the 4th item seems negative to me. Is there any suggestion from your side to solve this??
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I conducted an Exploratory Factor Analysis using Principal Axis Factoring with Promax rotation, resulting in the identification of 8 factors. Now, I aim to examine potential associations between these extracted factor scores (treated as continuous variables) and other variables in my research. However, I am uncertain about the most appropriate statistical approach to compute these factor scores.
Should I compute the average sum of each factor, employ regression, or utilize the Bartlett method, Anderson-Rubin, any others….?
Any insights or alternative approaches are highly appreciated.
Thank you!
Relevant answer
Answer
Hello Christian-Joseph,
Here's a link to a very readable introduction to some of the considerations associated with different alternatives to the computation of factor scores:
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hello,
I have a data set with N = 369 individuals measured at a single time point. The goal of the study is to create an assessment of psychological safety (PS). The assessment is a self-report measure asking participants to indicate how psychologically safe they feel using a unipolar 5-point Likert scale ranging from 1 (not at all) to 5 (extremely).
In addition to the assessment I am creating, I also measured a number of demographic variables (e.g., age, salary) and a few additional measures of team environment for validation (e.g., an existing measure of PS, level of team interdependence).
My primarily goal is to run exploratory factor analysis (EFA). This is the first time anyone has conceptualized PS as multidimensional, so one of the primary goals is to uncover the potential factor structure of PS. Also, to identify candidate items for deletion.
In order to prepare for the EFA analyses, I am cleaning the data by following recommendations in (the excellent) Tabachnik & Fidell (2013, 6th ed).
I am currently at the point where I am checking the data for multivariate outliers, starting with Mahalanobis distance. And I cannot find explicit guidelines regarding which variables I should be including as "IVs" in the analysis.
QUESTION: Which variables should I be including in my search for multivariate outliers? Do I include all variables, or only my target variables?
Specifically, do I include only the variables that represent the item pool for my forthcoming PS assessment? Or do I include all the PS items AND demographic variables, the existing PS assessment, interdependence measure, etc.??
I ran the Mahalanobis distance analyses 2 times using both approaches, and found substantial differences:
  • TIME 1 - With just the PS assessment variables --> I identified n = 28 multivariate outliers.
  • TIME 2 - With PS items + demographics, etc. --> I identified n = 10 multivariate outliers (all identified as outliers in the TIME 1 analysis).
Syntax I am using - the bolded variables are the ones I am questioning if I should include or not:
REGRESSION
/MISSING LISTWISE
/STATISTICS COEFF OUTS R ANOVA COLLIN TOL
/CRITERIA=PIN(.05) POUT(.10)
/NOORIGIN
/DEPENDENT Subjno
/METHOD=ENTER Age Salary Edu WorkStructure TeamSize Tenure_OnTeam JapaneseBizEnviron EdmondsonPS_TOT Interdep_TOT Valued_TOT PS_1 PS_48 PS_141 PS_163 PS_43 PS_53 PS_73 PS_133 PS_135 PS_19 PS_60_xl26 PS_93 PS_106_xl26 PS_143 PS_58 PS_86 PS_182 PS_56 PS_69 PS_103 PS_164 PS_22 PS_35 PS_91 PS_30 PS_59 PS_63 PS_90 PS_131 PS_140 (**Note, PS assessment var list is truncated b/c large number)
/RESIDUALS = OUTLIERS(MAHAL)
/SAVE MAHAL.
Relevant answer
Answer
Thank you so much, both of your answers were extremely helpful! I ultimately made the decision to run the analyses 3 times and then compare results - (1) full data set, (2) with n = 10 multivariate outliers removed, and (3) with n = 28 multivariate outliers removed. It was time-consuming, but luckily there were very few differences in results, so I didn't have to make any hard decisions.
I will have to look into the analyses David Morse suggested. In selecting my rating scale labels, I used research performed by Beckstead (2014). This study attempted to map the strength of adjectives that are typically attached to Likert scales - thus allowing researchers to select rating scale labels that are "spaced an equal distance apart," and thus to proceed with analyses under a crude assumption of interval scaling. But given this spacing assumption is based on single study, its hardly satisfactory.
Thank you again both of you!
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
i’m trying to run polychotic correlation with Stata v13, but I’m confused.
Relevant answer
Answer
Below is a do-file where you will find detailed information on how to do using Standard methods of performing factor analysis (polychoric, factormat):
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
Hey guys,
I have found two multi-item scales in my previous research regarding my master thesis. I want to know if I can compute an EFA for the dependent and for the independent variable?
Relevant answer
Answer
Yes you can run exploratory factr analysis for regressor variable and criterion
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
I'm doing a validation study and for construct validity, I'll analyze my data through EFA and CFA. So, the same data should be used in both processes or should we divide the data? And if we divide the data what percent of the data should be included in EFA and what percent in CFA?
Relevant answer
Answer
As mentioned earlier, it is not recommended to use the same data for EFA and CFA. As far as dividing the dataset into two for EFA and CFA is concerned, there are no specific criteria (at least, I did not come across them in the last 13 years). While 50-50 split could be one approach, another could consider the statistical power - Large enough to produce reliable factors in EFA, for example.
The following references will be helpful:
Beavers, A. S., Lounsbury, J. W., Richards, J. K., Huck, S. W., Skolits, G. J., & Esquivel, S. L. (2013). Practical considerations for using exploratory factor analysis in educational research. Practical Assessment, Research, and Evaluation, 18(1), 6.
Kyriazos, T. A. (2018). Applied psychometrics: sample size and sample power considerations in factor analysis (EFA, CFA) and SEM in general. Psychology, 9(08), 2207.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Can you please help me on how to use Exploratory Factor analysis as a Method of my research, actually my research is all about the Dimension of illegal activities commission. Please -I'm new researcher
Relevant answer
Answer
It will probably help people to answer if you say which books and other sources that you have been reading to learn about this, which software you are most comfortable with, why you want to use EFA as opposed to other methods like CFA or PCA, provide more information about your research, and provide more specific questions.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
What is meant by collecting data separatly for newly developed questions to run EFA in PLS-SEM? Like do i need to make to separte questionnaire sets one consisting of questions adopted from prior literatures and the other for the self developed questions?
Is it possible to run EFA from the data collected by pilot study and for the empirical work only running the CFA?
Please suggest.
Thanks in advance.
Relevant answer
Answer
Newly developed questionnaires mean that the construct validity of the scale is not yet checked, so to ensure construct and discriminant validities researchers normally go for EFA followed by CFA.
The following sources may help:
(17) (PDF) An Easy Approach to Exploratory Factor Analysis: Marketing Perspective Noor Ul Hadi (researchgate.net)
(17) (PDF) Specifying the Problem of Measurement Models Misspecification in Management Sciences Literature (researchgate.net)
  • asked a question related to Exploratory Factor Analysis
Question
10 answers
Dear RG community,
I have been trying to help my PhD student with an EFA. We have ended up with a very nice stabilised solution (using principal axis factoring, varimax rotation and extracting a fixed number of factors) apart from the determinant of the correlation matrix being too low. We had initially checked for correlations > 0.8 in absolute value but did not find any so we went back and removed one item from each pair where the bivariate correlation was > 0.7 in absolute value. This improved the determinant but it was still considerably lower than the recommended threshold of 0.00001. What do you suggest we should do: continue to reduce the bivariate correlation threshold and remove more items, just live with the low determinant value as what we are really interested in is the scales themselves derived from the EFA, or is there another way to detect and remove multicollinearity?
Thanks very much for your expert advice.
Relevant answer
Answer
Hello Maksim,
Factoring, and replacing individual variates as IVs with factor scores to which they contribute, _is_ one way to deal with collinearity (most often in linear models, such as regression), so long as you do not rotate the factor structure obliquely.
However, the original query for this thread was asking about the presence of multicollinearity in EFA, so that was the question to which I was responding.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Hello, all!
I have translated and heavily edited a validated survey for use in a new context. This involved changing language, context, and cutting items from 56 to 30.
I'm now trying to find the underlying factor structure of this data, which I assume will differ from the 8 factors in the original 56-item scale.
For the purposes of my research, I tried to have respondents compare their expectations with their preferences for a given item. So, an item would present a statement such as (hypothetically) "During class, students make connections between content from different subjects." This would be followed by two Likert-type questions: "I would like this to be true" and "I expect this will be true". Repsondents answered each of these sub-questions for every item, on a 6-point scale (no neutral).
I'm a bit stuck, conceptually, on how to approach factor analysis with these paired questions. Conceptually, I'd assume that it shouldn't matter whether I incuded both "Q_a" and "Q_b", since those should load onto the same factor. But then, I think, the nature of the questions might confound such an analysis.
Does anyone have any wisdom or literature on how this type of paired response factor analysis has been done?
Thank you!
Relevant answer
Answer
Hello again Ad,
You could use discrepancy scores between judged importance and expectation as a variable, and characterize it as akin to a goodness of perceived fit indicator for an individual.
Whether these resultant values are better handled as a scale vs. grouped, common factors depends on a number of concerns (again, back to specific research question/s, as well as data collection scheme, empirical data characteristics). As you've indicated that these data would be collected at a common time (once; at beginning of term/year), that helps to avoid the volatility influence.
Ordinal IRT models don't require collapsing variable scores into dichotomous form.
LCA can use ordinal scores as well as dichotomous scores for categorical variables. However, if you are successful in scaling or factoring the fit scores, you can argue that the resulting scale scores or factor scores are interval strength, and use one of many clustering methods instead of LCA. I mention this because it sounds as if you are very interested in subtyping your cases into qualitatively distinct groups. Alternatively, cases may be the objects of interest in factoring instead of variables. In Cattell's explanation, this is referred to as "Q-type" factoring.
Good luck with your study.
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
Dear researchers, can both EFA and CFA be applied to the database obtained at the end of the scale development process at the same time?
Relevant answer
Answer
Hello Ömer,
I'm uncertain as to what you mean by "...at the same time."
If you are uncertain as to the underlying structure for a set of variables, and have no prior evidence, claim, or theory as to what that structure should be, then EFA is certainly a viable option for exploring and making initial judgments as to what factor space might be appropriate.
Once you determine a defensible candidate structure, it is certainly a good idea to verify that the structure can adequately account for observed relationships among the variables by using CFA with data not used to derive the EFA.
The reason for not applying to the very same data as used for the EFA is the concern that the EFA process can be opportunistic, resulting in a solution which overfits the data set from which it was derived, and therefore may not generalize.
You can find a number of published studies in which author/s have taken part of data set to derive a candidate structure via EFA, then evaluated that structure, via CFA, with the remainder of the data set.
If this doesn't address your concern, perhaps you could elaborate.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
1- I determined exploratory factor analysis (eight dimensions and 37 questions were finalized).
2- Then I checked the measurement model. In the measurement model, only 13 questions had factor loadings greater than 0.5.
3- In this case, the previous dimensions taken from the exploratory factor analysis cannot be used, because one has been removed and the three dimensions have only 2 questions.
4- In this new mode, it is not possible to define a model with hidden variables, because we only have the indicators. Because the number of indicators for each dimension in this new mode is not enough, so the dimensions cannot be considered. Because according to Hair (2010), at least three questions are needed for each variable.
Question:
What do you recommend?
Relevant answer
Answer
What are you measuring? Considering the number of dimensions and factors, I suspect that you are measuring a well-defined concept. Factor analysis is a tool that should be based on a strong theory. Also, providing information on the parameters used in EFA is helpful. How did you come up with the measurement model? The measurement model assumes a well-established factor structure. It seems that you didn't establish that in EFA. If EFA is not providing a stable solution, CFA will not provide any better solution.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
It is welcome any study from any field, as much as possible :)
Thank in you in advance
Relevant answer
Answer
One way to identify articles that use exploratory factor analysis (EFA) factor by factor is to observe whether they report the results of the Bartlett's test of sphericity for each factor. Barlett's test reports whether there is a possible factor solution for the entire set of variables being analysed; if researchers report a p-value and/or statistic for this test for each factor, it means that the EFA was necessarily performed on a factor-by-factor basis.
As complementary information, EFA is not useful as a preliminary step to scales that have already been validated in the literature, and clearly performing it on a factor-by-factor basis goes against the main aims of EFA.
However, I believe that the use of EFA on a factor-by-factor basis is a much more widespread practice than we might think.
Moyano-Fuentes, J., Maqueira-Marín, J.M., Martínez-Jurado, P.J. and Sacristán-Díaz, M. (2021), "Extending lean management along the supply chain: impact on efficiency", Journal of Manufacturing Technology Management, Vol. 32 No. 1, pp. 63-84. https://doi.org/10.1108/JMTM-10-2019-0388
I have even found some CFA factor-by-factor examples:
Tortorella, G. L., Giglio, R., & van Dun, D. H. (2019). Industry 4.0 adoption as a moderator of the impact of lean production practices on operational performance improvement. International Journal of Operations & Production Management, 39(6/7/8), 860–886. doi:10.1108/ijopm-01-2019-0005
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
Good morning,
I am not an expert on Factor Analysis, so I hope the explanation of my problem makes sense.
My current task is to perform analyses on an older data set from experiments my lab conducted a couple of years ago. We have 212 individual items, consisting of a dozen or so demographic questions and items from a total of 24 different scales that measure separate constructs. Given the large number of items and constructs, I would like to reduce the number of dimensions to achieve a more clear starting point for theory development. Obviously, an exploratory factor analysis is a good choice for this.
My question is whether I have to input the 200 or so individual non-demographic items in the EFA, or whether I can instead just use the 24 composite variables/constructs and reduce the number of dimensions from there. My hesitation using all the individual items is that an EFA would simply return something very similar to the composite variables, as the constructs generally have a quite high internal consistency and are fairly distinct from each other based on theory. The obvious caveat with using the composite variables is that it is not something I have seen done much, and as I am not an expert on EFA, I am unsure as to whether there is a major methodological road block to using composite variables that I am unaware of.
Thank you for your help!
Best wishes,
Pascal
Relevant answer
Answer
Good morning,
In your situation, you can perform an EFA on the 24 composite variables/constructs instead of the 200+ individual non-demographic items. This approach helps you reduce the number of dimensions and is suitable when the constructs have high internal consistency and are theoretically distinct. It's important, however, to ensure the composite variables are appropriately calculated (e.g., by averaging or summing the items) and to verify that the assumptions for EFA are met. Additionally, consider interpreting the results cautiously and further validating them, as using composite variables in EFA is less common compared to individual items.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
The result of the exploratory factor analysis for the measurement tool I used is not compatible with the original scale. For example, in the original study, item 3 is in factor 1, but in the study I conducted, it is in factor 3. In general, all items in the scale show factor loadings in different sub-dimensions than the original ones. How should I proceed?
Relevant answer
Answer
When your EFA results don't match the original scale, first re-examine your data and methodology. Consider differences in sample characteristics, cultural or contextual factors, and any potential methodological issues. You may choose to report the new factor structure if it is theoretically justifiable and supported by your data. Additionally, assess the reliability and validity of the new structure before proceeding with further analysis.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hi
I'm going to do a cluster analysis, after conducting an exploratory factor analysis for a series of Likert scale questions.
The cluster analysis will of course include other variables such as demographics, together with the factor analysis products.
My question is, do I use Factor Score (regression based or Bartlett based), for those 'Factors' from EFA or do I use mean score (average of the Likert scale questions that manifest individual factor)?
I know factor score has many advantage than mean score, but factor scores have a standard deviation of 1, and mean of 0. It's hard to interpret the later on cluster analysis based on these. While mean score seems to make more sense to interpret.
So, which score shall I use for the later on cluster analysis? mean score or factor score, or something else?
Thank you.
Relevant answer
Answer
Hi, I would suggest you to consider a simultaneous method instead a sequential one. Tandem analysis results intuitive and straightforward, however it may not yield an optimal cluster allocation as the two methods (dimensionality reduction and clustering) optimize different criteria. Dimension reduction typically aims to retain as much variance as possible in as few dimensions as possible, whereas cluster analysis aims to find similar and dissimilar observations in the data set and allocate the observations accordingly to clusters. Many methods have been proposed throughout the years. In particular, for continuous (or, interval) data you can consider reduced K-means (De Soete and Carroll 1994), factorial K-means (Vichi
and Kiers 2001) as well as a compromise version of these two methods. For categorical data, you can consider cluster correspondence analysis (Van de Velden, Iodice D’Enza, and Palumbo 2017), which, for the analysis of categorical data, is equivalent to GROUPALS (Van Buuren and Heiser 1989), multiple correspondence analysis and K-means (MCA K-means, Hwang, Dillon, and Takane 2006), and iterative factorial clustering of binary variables (i-FCB, Iodice D’Enza and Palumbo 2013). A nice R pkg implementing all these method is clustrd (Markos, Iodice D'Enza and van de Velden, 2019). I hope this helps. Carlo
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hi all,
Kindly help me to know when can we use Exploratory Factor Analysis(EFA) and Confirmatory Factor analysis(CFA)?
Regards,
Relevant answer
Answer
In addition to the resources provided by Holger Steinmetz , I would like to add that EFA is almost always unnecessary unless you are really exploring the number of factors. In most cases, we are not doing that. We usually have at least some ideas related to which variables should measure which and (how many) factor(s). Therefore, CFA is typically the more appropriate and more powerful (more flexible) method because it allows us to test our a priori hypotheses about factor structures and to compare competing factor models statistically.
  • asked a question related to Exploratory Factor Analysis
Question
7 answers
In the context of regression, methods for detecting collinearity are well described in the literature. In the context of exploratory factor analysis (EFA), however, I am facing a situation where I get highly unstable factor loadings and factor correlations from different boot-strapped samples (despite a sample size of more than 700), which I assume is due to multicollinearity. I am not sure how to explore its source. The common methods rely on the intercorrelation among explanatory variables (i.e., the latent factors in EFA), but the intercorrelation its-self is highly unstable.
My search for finding references on this topic was not much successful. Any explanation and/or reference on multicollinearity in EFA would be appreciated.
Thank you in advance,
Ali
Relevant answer
Answer
Hi,
there is (to my knowledge) no such thing as colinearity in factor models. To the contrary: Highly correlating indicators should lead to a much more stable situation was those indicators would be clearly load onto the same factor.
Best
Holger
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
Hello,
I applied exploratory factor analysis with network analysis to data from healthy and diseased patients. The analysis shows different clusters of parameters; some are similar in both groups, and some groups cluster differently. For instance, the parameters IleValLeu are similarly clustered in both groups whereas the parameters Pro Ala are not (see figure).
How shall I interpret the data? For instance, in the Pro/Ala case, I expected to see some differences between the groups, but they look pretty much the same to me.
Is the differences about correlation? The scatterplot of the data shows slightly different regression models, but not something compelling.
Is it the value itself? But again there is no real difference in the value distribution between the two groups.
So, what is the actual outcome of the network analysis?
Thank you
Relevant answer
Answer
Dear Luigi,
if I understood your question correctly, I would say that to interpret results of the network analysis you need to understand the components of the network. What are nodes? What are links? What does it mean if a link connects two nodes? What does it mean if nodes are not connected?
For example, if I read your network correctly, you can easily see that Ala and Pro are correlated stronger in your decease network, compared to the control. Is it significant? Is it important? An answer is beyond network analysis, but lays in your experiment design and other information you have. You can apply some statistics to test if difference in a strength of correlation is statistically significant, but you still need to add a biology level for interpretation.
What is good about network analysis, that you can use graphs theory based properties to get more layers of differences between your conditions, if, again, it makes sense for your data.
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
Hypothetically, if I would like to validate a scale and I need to explore its latent factors first with EFA followed by a CFA to validate its structure, do I need a new data set for the CFA? Some argued that randomly spliting one dataset into two is also acceptable. My question is that can I apply EFA on the dataset (with all data included) and then randomly select half of the dataset to conduct a CFA?
Relevant answer
Answer
Shashi Kant Singh
Thank you kindly for your answer Dr. Singh.
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
Dear Research Scholars, you are doing well!
I am a Ph.D. scholar in Education. Now I am working on my thesis. kindly guide me that when to perform the EFA whether it use on pilot studying data or actual research data.
Regards
Muhammad Nadeem
Ph.D. In Education , faculty of Education,
University of Sindh, Jamshoro
Relevant answer
When you need to hypothesise a connection between variables, exploratory factor analysis is the method to apply.
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
I am conducting an exploratory factor analysis and to determine the number of factors I used a paired analysis.
How can I generate the number of factors correctly in stata? Or other tool?
When using parallel analysis in stata, for example, if you proceed with Principal Axis Factoring all my Eigenvalues from the Parallel Analysis using a Principal Axis Factoring lower than 1. (Suggests in this case to retain all factors)
when out of curiosity, I use principal component factors, or even principal component analysis (I know this is not EFA), it suggests retaining 3 factors (which satisfies me)
Relevant answer
Answer
Hi there,
the most basic question is whether a common factor model (which underlies EFA) is reasonable. Are your indicators of a kind that makes the assumption reasonalbe that they were caused by one or several underlying common causes? How does the correlation matrix looks like?
If a PCA is better than this suggests that the former assumption is unreasonable. This is fine but keep in mind that a PC is a simple composite of the indicator without any surplus meaning.
HTH
Holger
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., Strahan, E. J., MacCllum, R., & Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods, 4(3), 272-299. https://doi.org/10.1037/1082-989X.4.3.272
Russel, D. W., & Russell, D. W. (2002). In search for underlying dimensions - the use (and abuse) of factor analysis in personality and social psychology bulletin. Personality and Social Psychology Bulletin, 28(2), 1629-1646. https://doi.org/10.1177/014616702237645
Preacher, K. J., & MacCallum, R. C. (2003). Repairing tom swift's electric factor analysis machine. Understanding Statistics, 2, 13-43.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
When designing the questionnaire for EFA, what do I need to keep in mind when it comes to the order of the questions?
More specifically, does the order of the questions need to be completely randomized or is it generally allowed to still ask questions in topic blocks according to potential factors/constructs I have in mind?
Thanks everyone!
Relevant answer
Answer
A randomized order of items can have the advantage that it may prevent response styles (such as saying "yes" to every item, skipping items, other careless response styles). When items are arranged according to topic/same attribute, participants may get bored and employ a careless response style, especially when the items are similarly worded. They may also find it easier to figure out what attributes you are trying to measure, which may or may not be desirable for your study.
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
For my research project I am adding new items to a previously validated scale. in the previous research they performed an exploratory factor analysis revealing a two-factor structure, but the internal consistency of one of the scales was quite poor so the aim of my study was to add new words to improve the internal consistency. so do i need to do another exploratory factor analysis as the scale will now have new words or can i do a confirmatory factor analysis because i'm still using the same scale?
Relevant answer
Answer
Megan Burkinshaw, I'm really sorry: I have only just now (~18 months later) noticed your question to me. It seems I overlooked a notification concerning it. Although you've probably moved on, I'll answer in case it helps others.
My answer is, yes, I do recommend doing EFA when new words (I assume you mean items, not words - but the same probably applies to words) have been added to a scale.
Feel free to get back - and, again, my apologies for a late reply.
  • asked a question related to Exploratory Factor Analysis
Question
7 answers
I did the confirmatory factor analysis of the questionnaire. If in each subscale I select the questions that have the highest factor load and I consider these questions as a short version and do other steps to verify the validity and reliability of the questionnaire, is this the correct way?
by doing exploratory factor analysis, I have less sub scale in comparison with sub scales in the long-form questionnaire.
Relevant answer
Answer
Can anyone recommend a research paper pr chapter detailing best practice for developing a short version of an exusting scale we have developed?
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
In exploratory factor analysis, to name the dimensions, some items are different and heterogeneous from other items of the same dimension, and they make it difficult to name each category. For example, in the first category "personality type of the manager", in the 3rd category, two items "level of analysis of the spouse" and "wife's perception of the existence of social justice", in the 5th category, the item "organizational level of the manager (senior, middle, operational manager) ", in category 6, the item "Education level of wife" and in category 7, the item "Job transition of wife".
Regardless of the heterogeneous items above with other items in the same category, the naming of each category has been done as follows. The question is: what to do with heterogeneous objects? Are they deleted?
Category 1. Family intimacy: spouse's relationship with family; years of married life; wife's compassionate analysis of the company events recounted by the manager; Manager's personality type
Category 3. Manager's attitude towards his wife: manager's mental health; The amount of analysis of the wife; Manager's leadership style; The manager's level of trust in his wife; Wife's perception of the existence of social justice
Category 5. Spouse's power of influence on the manager: manager's organizational level (senior, middle, operational manager); The wife's effort to show support to the manager; Spouse's ability to regulate manager's emotions; The extent of the spouse's influence on the manager's decision-making; wife being a housewife
Category 6. Environment: the level of stability of the environment; Accompanying men (in the role of manager) with their working wives; Manager's income; spouse's level of education; The degree of balance between work and family by the manager
Category 7. Spouse's physical health Question: Spouse's job transfer; Wife's physical health; Manager's physical health
Relevant answer
Answer
My perspective on this issue is that it doesn't make sense to keep heterogeneous/multidimensional items on the same factor. The very goal of factor analysis if to achieve unidimensionality within a given factor--you don't want items that measure very different things on the same factor because it makes the factor uninterpretable . That is, all items that are used as measures of the same factor should (ideally) be unidimensional (i.e., measure only the given factor with zero or at least minimal cross-loadings on other factors).
Things like, for example, "stability of the environment", "manager's income", and "spouse's level of education" are very highly distinct dimensions that would not be expected to load on a single factor to begin with. It does not make sense to put or keep them on the same factor. The interpretation of such a factor would be unclear or impossible.
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
Here is the code I'm using:
[fa(vars, 4, rotate="oblimin", fm = "pa")]
Relevant answer
Answer
Hi Everyone, I just wanted to let you know that scaling the variables worked for me to fix this issue and it very minorly changed any results (one variable was no longer removed during the iteration process).
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hi. I recently developed new survey items to measure extrinsic motivation. In the exploratory factor analysis, four items intended to measure extrinsic motivation loaded onto the same factor with negative scores (-.822; -.813; -.808; -.553). However, in the subsequent confirmatory factor analysis, the same items produced positive scores (.78; .80; .67; .56). I do not have a theory about why this could be and I would appreciate any help with explaining this finding.
Thank you.
Relevant answer
Answer
Hello Richard,
Rotational schemes in EFA are arithmetically driven, not theoretically driven. So, having one factor rotated 180 degrees "out of phase" with the theoretical orientation only means that: (a) loadings and factor pattern values; and (b) correlations with other factors would all be reversed in sign if the orientation were changed to the intended direction.
As for your follow-up question about the three "amotivation" items, do note that, even if you conceptualize a bipolar variable, there's no assurance that EFA will yield one. In my experience, it's not uncommon for items which have different valence to generate two factors instead of one. The "true" structure could be one bipolar, two, bi-factor, or second order. The good news is, you can evaluate these via CFA to see if the one you were aiming for works well with the data.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
I used confirmatory factor analysis without using the exploratory factor analysis first. The assumption that supported this was the clear presence of the factors in the literature. Is it correct not to go for the exploratory factor analysis and jump directly to CFA when you have clearly established factors in the literature and theory?
Relevant answer
Answer
Yes, absolutely. There is no point in "exploring" factors when you have a clear theory or prior knowledge about the factors and how the observed variables (measurements) relate to them. In that case, you want to "confirm" (test) rather than explore/determine the number and nature of the factors. EFA is only needed when you have no prior theory and need to explore or when you have a CFA solution that completely fails to fit and you don't know why.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Can i split a factor that has been identified through EFA.
N=102 4 factors have been identified.
However, one of the 4 actually has two different ideas that obviously are factoring together. I am working on trying to explain how they go together but it is very easy to explain them as two separate factors.
When I conduct a Conformatory analysis the model fit is better for them separate. . . but running a confirmatory analysis on the same population of subjects that I conducted the Exploratory analysis on appears to be a frowned upon behavior.
Relevant answer
Answer
Eric Orr Performing EFA and CFA on the same dataset serves no use. EFA is used to extract factors for the first time from a dataset, whereas CFA is used to verify factors extracted from a separate dataset.
However, exploratory factor analysis (EFA) is commonly employed to determine a measure's factor structure and to assess its internal reliability. When researchers have no theories regarding the nature of the underlying factor structure of their measure, EFA is frequently advised.
  • asked a question related to Exploratory Factor Analysis
Question
5 answers
Exploratory Factor Analysis and Confirmatory Factor Analysis are used in scale development studies. Rasch Analysis method can also be used in scale development. There are some researchers who consider the Rasch Analysis as up-to-date analysis. Frankly, I don't think so, but is there a feature that makes EFA, CFA or Rasch superior to each other in Likert type scale development?
Relevant answer
Answer
Rasch is better for a unidimensional scale
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I am conducting an EFA for a big sample and nearly 100 variables, but no matter what I do, the determinant keeps its ~0 value.
What should I do now?
Relevant answer
Answer
Hello Assia,
When computed for a correlation matrix, the determinant may range from 0 (no independent variation) to 1 (no relationship among the variables at all). Lower values generally would suggest the data set _does_ share common variance, and therefore may be amenable to factoring.
A variable set with a determinant of exactly zero would indicate that all variables were isomorphic and interchangeable (one gives exactly the same information as any other). While you could contrive such a data set (for example, asking adults their age four times in a row, and treating the responses as separate variables), it should be apparent that scores on the variables should correlate perfectly (hence, a zero determinant for the matrix).
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I plan to develop a semi-structured assessment tool and further validate it on a relatively small sample of below 50 (clinical sample). I have been asked by the research committee to consider factor analysis.
So in this context, I wanted to know if anyone has used regularized factor analysis for tool validation which is recommended for small sample sizes?
Relevant answer
Answer
The sample size is relatively quite small but if the size is above 100 then you can try. There had been studies who have opted for exploratory factor analysis on such smaller sample. But you got to check the KMO and Bartlett's Test of Sphericity to see the adequacy of your data. Try reading the following research papers who support smaller samples for EFA.
De Winter, J.C.F., Dodou, D., & Wieringa, P.A. (2009). Exploratory factor analysis with small sample sizes. Multivariate Behavioral Research, 44, 147–181.
Wirth, R. J., & Edwards, M. C. (2007). Item factor analysis: current approaches and future directions. Psychological Methods, 12, 58-79.
Barrett, P. (2007). Structural equation modelling: Adjudging model fit. Personality and Individual Differences, 42, 815-824.
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
I am aware that a high degree of normality in the data is desirable when maximum likelihood (ML) is chosen as the extraction method in EFA and that the constraint of normality is less important if principal axis factoring (PAF) is used as the method of extraction.
However, we have a couple of items in which the data are highly skewed to the left (i.e., there are very few responses at the low end of the response continuum). Does that put the validity of our EFAs at risk even if we use PAF?
This is a salient issue in some current research I'm involved in because the two items are among a very small number of items that we would like, if possible, to load on one of our anticipated factors.
Relevant answer
Answer
Christian and Ali, thanks for your posts. Appreciated. I'll follow up on both of them.
Robert
  • asked a question related to Exploratory Factor Analysis
Question
10 answers
Dear all,
I am conducting research on the impact of blockchain traceability for charitable donations on donation intentions (experimental design with multiple conditions, i.e., no traceability vs. blockchain traceability).
One scale/factor measures “likelihood to donate” consisting of 3 items (dependent variable).
Another ”trust” factor, consisting of 4 items (potential mediator).
Furthermore, a “perception of quality” consisting of 2 items (control).
And a scale “prior blockchain knowledge” consisting of 4 items (control).
My question is: since all these scales are taken from prior research, is CFA sufficient? Or, since the factors are from different studies (and thus have never been used together in one survey/model) should I start out with an EFA?
For instance, I am concerned that one (or perhaps both) items of ”perception of charity quality” might also load on the “trust”-scale. e.g., the item “I am confident that this charity uses money wisely”
Curious to hear your opinions on this, thank you in advance!
Relevant answer
Answer
Kim Fleche, I'm glad my post was helpful. Please be aware that coefficient alpha (often less appropriately referred to as Cronbach's alpha) is a much more complex, uninformative, and deceptive metric than many researchers seem to appreciate.
A major feature of coefficient alpha is that it is highly dependent on the number of items involved. Because of that, a small number of nicely correlated items can have quite a low coefficient alpha. Conversely, by the time there are 20 or more items, the value of alpha can be quite high despite little association between many of those items.
The following publications might be helpful:
  • asked a question related to Exploratory Factor Analysis
Question
12 answers
Greetings,
I am a DBA student conducting a study about "Factors Impacting Employee Turnover in the Medical Device Industry in the UAE."
My research model consists of 7 variables, out of which:
  • 5 Variables measured using multi-item scales adapted from literature ex. Perceived External Prestige (6 items), Location (4 items), Flextime (4 items),.. etc.
  • 2 are nominal variables
I want to conduct a reliability analysis using SPSS & I thought I need to do the below?
  1. Conduct reliability test using SPSS Cronbach's alpha for each construct (except for nominal variables)
  2. Deal with low alpha coefficients (how to do so?)
  3. Conduct Exploratory Factor Analysis to test for discriminant validity
Am I thinking right? Attached are my results up to now..
Thank you
Relevant answer
Answer
The issue is not my specialty , with my best wishes
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
Hi everyone,
I have longitudinal data for the same set of 300 subjects over seven years. Can I use '''year' as a control variable? Initially, I used one way ANOVA and found no significant different across seven years in each construct.
Which approach is more appropriate?. Pooling time series after ANOVA (if not significant) or using 'year' as a control variable?
Relevant answer
Answer
I thinking, when there is no significant difference found Through, its improper to use it as control variable. Better is allow PLS to create its own groups if any were present in the data.
  • asked a question related to Exploratory Factor Analysis
Question
2 answers
Hello everyone,
As the title suggests, I am trying to figure out how to compute a reduced correlation matrix in R. I am running an Exploratory Factor Analysis using Maximum Likelihood as my extraction method, and am first creating a scree plot as one method to help me determine how many factors to extract. I read in Fabrigar and Wegener's (2012) Exploratory Factor Analysis, from their Understanding Statistics collection, that using a reduced correlation matrix when creating a scree plot for EFA is preferable compared to the unreduced correlation matrix. Any help is appreciated!
Thanks,
Alex
Relevant answer
Answer
Take a look at the attached Google search for your answer. Best wishes, David Booth
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
I`m conducting the translation of a very short scale of 12 ítems to assess therapeutic alliance in children. I have 61 answers and I wonder if that number of subjects it`s acceptable to run Exploratory Factor Analysis. I know that there is a suggestion of 5 participants for item to do EFA and 10 participants for item to do CFA. However, the number of participants here seem to be very smal for these analysys. What it´s your opinion?
Relevant answer
Answer
Hello Leandro,
The vague answer is, more cases is generally better than fewer cases. There are two reasons:
1. The factor model you seek must, ideally, be capable of providing good estimates for the (12*11/2 =) 66 unique observed relationships among your 12 variables (here, item scores). That's a lot to ask of 61 cases.
2. In general, the smaller the N of cases, the more volatile are the observed relationships among the variables from sample to sample. Hence, the less likely that your sample data will accurately represent the correlations that may exist in the population. As these correlations are what the EFA is intended to account for, if they are incorrect then you likely will identify a factor model that does not generalize well to the population. The literature is full of studies in which one set of authors using a small or modest sample claims their EFA shows "different" results than those from another set of authors, even if ostensibly drawing from the same population.
Can you still proceed? Yes, of course--the numbers won't leap up from your data file and protest! However, do be mindful that: (a) guidelines such as "at least 100 cases" and "10-20 cases per variable" for EFA abound; and (b) you likely would want to characterize your results as tentative or exploratory, rather than as a definitive solution to the question of the true factor structure.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Exploratory factor analysis was conducted to determine the underlying constructs of the questionnaire. The results show the % variance explained by each factor. What does % variance mean in EFA? How to interpret the results? How can I explain the % variance to a non-technical person in simple non-statistical language?
Relevant answer
Answer
The % variance explained by a factor in a given variable is often called communality (h^2). It gives the proportion of individual differences in an observed (measured) variable that is accounted for by the common factor. The communality is similar to R-squared in regression, except that the independent variable is a latent factor. Under certain conditions, the communality is equal to the reliability of a variable, namely when the variable only measures the common factor and measurement error.
In summary, the communality gives you an estimate of a variable's ability to reliably measure the common factor. The closer the communality to 1, the more reliable the variable as a measure of the factor.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
I have good result with low variance explanation (Less than 50%) in exploratory factor analysis, and read some discussions about the acceptable for total variance explanation < 50% in social sciences. Please recommend papers to support this issue or give me your suggestion.
Thanks in advance.
Relevant answer
Answer
Hello Sirwarit,
There are a lot of possible answers to your query as it was posted.
1. It could be that the set of variables you are attempting to factor simply do not share common latent variable underpinnings, and therefore no parsimonious factor structure (other than, perhaps, something approaching the "worst case" of k factors for k variables) exists.
2. It could be that the variables are poorly measured and/or suffer from high levels of specificity, such that the common variance is "low," according to your appraisal. Better versions or measurements of the variables might alleviate this problem.
3. You could have extracted too few factors to account for the variation in the variable set. Alternatively, a chunk of the variables in your set simply might not belong in the set, and this is causing your solution(s) to have low variance accounted for.
4. Your judgment that 50% of variance being explained by the factor structure you've selected from your EFA might be too pessimistic. I've seen a lot of published studies in which EFA solutions adopted by author/s account for 35-60 percent of variance. Often, this coincides with author/s' decision to jettison a number of the variables from the solution because of low variable-factor loadings. Are these ideal structures? Probably not, but one of the reasons for engaging in EFA is to refine a variable set into a group which does have common latent underpinnings...so, it is not unreasonable to expect that some variables might not end up being included.
I agree with Imran Anwar that it wasn't clear what you meant by having obtained good results.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
I am using the Environmental Motives Scale in a new population. My sample size is 263.
The results of my exploratory factor analysis showed 2 factors (Eigenvalue>1 and with loadings >0.3) - Biospheric Factor and Human Factor
Cronbach alpha was high for both factors (>0.8)
However, unexpectedly, confirmatory factor analysis showed that the model did not fit well:
RMSEA= 0.126, TLI =0.872 and SRMR = 0.063, AIC = 6786
After a long time on Youtube, I then checked the residual matrix and found that the standardized covariance residuals between two of the items in the Biospheric factor was 7.480. From what I understand if values are >3, it indicates that there may be additional factor/s that are accounting for correlation besides the named factor. I therefore tried covarying the error terms of those two items and rechecked the model fit using CFA.
Results of this model show much better model fit.
RMSEA = 0.083, TLI = 0.945, SRMR = 0.043, AIC = 6731 (not as much difference as I thought there would be)
The questions I am now left with (which google does not seem to have the answer to) are:
1. Is it acceptable to covary the error terms to improve model fit?
2. How does covarying error terms impact on the scoring of the individual scales? Can I still add up the items to measure biospheric vs human scales as I would have without the covarying terms?
I would be so grateful for any insight or assistance.
Thank you
Tabitha
Relevant answer
Answer
Tabitha Osler Many software programs for confirmatory factor analysis (CFA) will allow you to estimate and save individual factor scores as new variables to your data set. These scores can then be used in the same way as a conventional scale score (e.g., sum score of items). One advantage of factor scores is that they take the different item loadings into account and that they would allow you to also account for the error correlation parameter when estimating the factor scores.
I'm not sure which software program you're using. In any case, the program manual/user's guide should have information regarding options for estimating and saving factor scores.
  • asked a question related to Exploratory Factor Analysis
Question
7 answers
I used exploratory factor analysis for 4 latent variables, the result in the table called "total of variance explained" has "% of variance". is it similar to average variance extracted?
What steps to do discriminant validity in SPSS?  I run the factor analysis, then compute the latent variable to become observed variables, after that I run the correlation. is it the correct process?
Thanks for your attention
Relevant answer
Answer
VALUES SHOULD BE GREATER THAN .05
  • asked a question related to Exploratory Factor Analysis
Question
10 answers
Hi, I am working on a project about ethical dilemmas. This project requires development of a new questionnaire that should be valid and reliable. We started with collecting the items from the literature (n= 57), performed content validity where irrelevant items were removed (n=46), and piloted it to get the level of internal consistency. Results showed that the questionnaire has a high level of content validity and internal consistency. We were requested to perform exploratory factor analysis to confirm convergent and discriminant validity.
Extraction PCA
rotation varimax
Results: the items' communalities were higher than 0.6.
kMO 70%
Barttlett's test is significant.
Number of extracted factors 11with total explained variance 60%.
My issue is 6 factors contain only 2 items. Should I remove all these items?
With notice that the items are varied, each one describes a different situation, and only they share in that they are ethical dilemmas and deleting them will affect the overall questionnaire ability to assess participants' level of difficulty and frequency of such situations.
EFA is new concept for me; I am really confused by this data.
Relevant answer
Answer
Eman ALI Dyab, if you have been asked to conduct exploratory factor analysis (EFA), I suggest you follow that instruction rather than conducting confirmatory factor analysis.
There are other things you should consider, however. Given that your items' communalities are > .60, I think you don't need to abide by the rule of thumb that you have 10 x the number of participants as there are items, but I think you do need to ensure you have enough participants. Maybe 200 would be sufficient.
Also, given that you have 11 factors in the data, I wonder whether you are using the Kaiser criterion to determine the number of factors. That method has been criticised for at least 20 years. A better method is to use the scree test in conjunction with parallel analysis. The scree plot is often quite easy to interpret. I am attaching information about parallel analysis in case that helps.
Apart from that, I think it would be better to use EFA, not PCA (PCA is not really EFA, and you have been asked to conduct EFA, anyhow) and that you use an oblique (e.g., promax), rather than varimax, rotation. (Use of varimax has been criticised for quite a long time - though a lot of researchers keep using it, probably because they blindly follow what the crowd is doing.)
My inclination would be to remove items that load only on two-item factors. However, before doing that, I'd go back and use the scree plot in conjunction with parallel analysis, EFA with an oblique rotation, and a series of EFAs in which poorly performing items were successively removed.
All the best for your research.
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Do you know any renowned article which has been published in Scopus journal describing that for conducting the Exploratory Factor Analysis (EFA), which method is the best, 'Principal component' or 'Principal Axis Factoring ' in SPSS?
Relevant answer
Answer
Hi, Mohammad Kamrul Ahsan. Here are some good papers on EFA that I hope will help you.
Preacher, K. J., & MacCallum, R. C. (2003). Repairing Tom Swift's electric factor analysis machine. Understanding statistics: Statistical issues in psychology, education, and the social sciences, 2(1), 13-43.
Yong, A. G., & Pearce, S. (2013). A beginner’s guide to factor analysis: Focusing on exploratory factor analysis. Tutorials in quantitative methods for psychology, 9(2), 79-94.
Costello, A. B., & Osborne, J. (2005). Best practices in exploratory factor analysis: Four recommendations for getting the most from your analysis. Practical assessment, research, and evaluation, 10(1), 7.
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., & Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological methods, 4(3), 272.
Williams, B., Onsman, A., & Brown, T. (2010). Exploratory factor analysis: A five-step guide for novices. Australasian journal of paramedicine, 8(3).
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
I collected 109 responses for 60 indicators to measure the status of urban sustainability as a pilot study. So far I know, I cannot run EFA as 1 indicator required at least 5 responses, but I do not know whether I can run PCA with limited responses? Would you please suggest to me the applicability of PCA or any other possible analysis?
Relevant answer
Answer
I would recommend you read about the difference between EFA and PCA first. Whether or not you should run an EFA has nothing to do with the number of response options on the indicators, five or otherwise. In general, EFA is preferable to PCA as it is considered to be the 'real' factor analysis. The are many threads on RG on this issue.
Best
Marcel
  • asked a question related to Exploratory Factor Analysis
Question
4 answers
Query1)
Can mirt exploratory factor analysis method be used for factor structure for marketing/management research studies because most of the studies that I have gone through are related to education test studies? My objective is to extract factors to be used in subsequent analysis (Regression/SEM) My data is comprised of questions like: Data sample for Rasch Factors Thinking about your general shopping habits, do you ever: a. Buy something online b. Use your cell phone to buy something online c. Watch product review videos online RESPONSE CATEGORIES: Yes = 1 No = 0 Data sample for graded Factors Thinking about ride-hailing services such as Uber or Lyft, do you think the following statements describe them well? a. Are less expensive than taking a taxi c. Use drivers who you would feel safe riding with d. Save their users time and stress e. Are more reliable than taking a taxi or public transportation RESPONSE CATEGORIES: Yes =3 Not sure = 2
No = 1 Query2) If we use mirt exploratory factor analysis using rasch model for dichotomous and graded for polytomous, do these models by default contain tetrachoric correlation for rash model and polychoric correlation for graded models? My objective is to extract factors to be used in subsequent analysis (Regression/SEM) Note: I am using R for data analysis
Relevant answer
Answer
I would really appreciate that you spared some time to answer my question. May be I am unable to ask question properly but here my objective is to create factors from underlying battery of items with different scales. So my question is simple that can I use mirt to perform EFA to create factors to be used in subsequent analysis (Regression/SEM).
One more things I would like to know that what exactly "easy" and "difficult" items you mean in your given answer?
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hi,
I used a self-efficacy tool for my sample. According to the original article, there is only one factor in the tool. However, in the Exploratory factor analysis for my sample, two factors were found. How can I interpret this result?
Thank you so much for your answer in advance!
Relevant answer
Answer
Basically, A statistically significant result is not due to chance and is determined by two important variables: sample size and effect size. The sample size refers to the size of the sample for your experiment.
Best Regards
Dr. Fatemeh Khozaei
  • asked a question related to Exploratory Factor Analysis
Question
11 answers
Hi, I have run an exploratory factor analysis (principal axis factoring, oblique rotation) on 16 items using a 0.4 threshold. This yielded two factors, which I had anticipated as the survey was constructed to measure two constructs. Two items had factor loadings <0.4 (from Factor 1) so I removed them, leaving 14. However, upon closer inspection, one of the items from Factor 2 loaded on to Factor 1 (|B| = 0.460).
The distinction between the two constructs is very clear so there should not be any misunderstanding on the part of the participants (n = 104). I'm unsure of what to do. I checked the Cronbach's alpha for each factor: Factor 1 (a = .835 with the problematic item, a = .839 without). Factor 2 is a = .791).
Do I remove the item? Any advice would be very much appreciated. Thank you!
Relevant answer
Answer
Joanne Lim, I'm glad my post was helpful. The Costello and Osborne article is informative and authoritative. If you want more information about sample sizes for EFA, feel free to ask me.
I think we might have a misunderstanding about communalities versus loadings. In SPSS output, the communalities are provided before the loadings, and it's the extraction communalities you need to pay attention to. I think they need to all be above ~.50 if you want to work with a smallish sample.
Your loadings look OK to me.
You could be spot on when referring to wording of items in your scale possibly not being satisfactory. With regard to the scale that I referred to in my post above, I'd forgotten to mention that a number of the items have wording that I regard as quite unpolished. Some are even silly.
One last note, please don't worry if the coefficient alphas for your two factors are quite low. With as few as 6 or 7 items, the alphas can be quite low (down around .70, maybe a touch less) without cause for concern. Check out the interitem correlations!
  • asked a question related to Exploratory Factor Analysis
Question
16 answers
Can anyone help me with the sample size calculation for the exploratory factor analysis? Do you know how to calculate it and with which statistical program? Thank you.
Relevant answer
Answer
Fernando Calvo, there are a number of recommendations concerning the desirable sample size for exploratory factor analysis - and they don't always agree with each other.
The simplest recommendation is to aim for 10 times the number of people as there are items to be submitted to EFA. So, if you have 20 items, you'd need 200 people. That's the recommendation in the following chapter:
Dixon, AE. Exploratory factor analysis. In Plitchta SB, Kelvin EA, editors. Munro’s statistical methods for health care research. 6th ed. Philadelphia, PA. Wolters Kluwer; 2013. pp. 371–398.
There are other recommendations, however. My colleagues and I dealt with them in an article we'd had published nearly 2 years ago:
Ma, K., Trevethan, R., & Lu, S. (2019). Measuring teacher sense of efficacy: Insights and recommendations concerning scale design and data analysis from research with preservice and inservice teachers in China. Frontiers of Education in China, 14(4), 612–686. https://doi.org/10.1007/s11516-019-0029-1
It's an open-access article. Check out the middle paragraph on page 628 for several recommendations, and choose whatever seems most appropriate to your situation.
All the best with your research.
  • asked a question related to Exploratory Factor Analysis
Question
1 answer
Can mirt exploratory factor analysis method be used for factor structure for marketing/management research studies because most of the studies that I have gone through are related to education test studies? My objective is to extract factors to be used in subsequent analysis (Regression/SEM) My data is comprised of questions like: Data sample for Rasch Factors Thinking about your general shopping habits, do you ever: a. Buy something online b. Watch product review videos online RESPONSE CATEGORIES: 1 Yes 2 No Data sample for graded Factors Thinking about ride-hailing services such as Uber or Lyft, do you think the following statements describe them well? a. Are less expensive than taking a taxi b. Save their users time and stress RESPONSE CATEGORIES: 1 Yes 2 No 3 Not sure
Relevant answer
Answer
The Mirt package generally concentrates on Item Response Theory. You can perform the Exploratory Factor Analysis you want more easily with the "fa" function in the "psych" package in R. You will also find many different and up-to-date estimation methods of factor analysis in this package.
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
I have extracted factor scores after EFA and then used K-means to identify clusters.
But there are confusion in validating the number of cluster in SPSS, as it does not show any AIC or BIC value on the basis i can differentiate and finalise the number of clusters.
Relevant answer
Answer
SPSS macros calculating most important cluster validity indices are available in the library "Internal clustering criteria" found on "Kirill's SPSS macros page" (easily googled).
  • asked a question related to Exploratory Factor Analysis
Question
6 answers
I have adopted my questionnaire from previous literature. I want to know if I still need to carry out EFA before carrying out PLS-SEM for my thesis?
Relevant answer
Answer
@Zia Aslam The reference you shared here is an amazing landmark paper for the scholars doing survey-based primary data research. A must read manuscript indeed. Below is the attached manuscript in pdf.
  • asked a question related to Exploratory Factor Analysis
Question
3 answers
Hi,
I am working on exploratory factor analysis in SPSS using promax rotation.
Upon checking pattern matrix, there is one question which has factor loading greater than 1.0? should I need to ignore if loading greater than 1.0?
Also i realised there are few negative loadings in pattern matrix? Can negative loadings be still considered for the analysis?
Thanks much.
Relevant answer
Answer
Hello Ramya,
The pattern matrix gives the regression weights for how scores on a given variable are estimated to be formed, given some value/score on each of the respective factors. These weights can be > |1|. The structure matrix gives the simple correlation of each variable with scores on each factor, not adjusted for the relationships among factors. As such, the structure matrix should not include any values that exceed |1|.
As the pattern matrix furnishes "affiliation" values that do adjust for the relationships among factors, most folks would opt to review that first for understanding what's going on in the factor solution. However, full understanding of a factor solution requires thoughtful review of (a) the pattern and structure matrices; (b) the factor correlation matrix; (c) the variable correlation matrix; (d) variable communalities (after extraction/rotation); and of course, any relevant theory as well as understanding of measurement issues associated with the respective variables.
Negative values are also not uncommon in factor pattern matrices, just as negative regression weights may be observed in multiple linear regression. Again, recall that the purpose of the pattern matrix is to show how scores on the respective factors would be optimally weighted so as to explain differences in scores on each observed variable. Whenever you have multiple factors (and therefore a reason to rotate a factor solution in the first place), the "optimal" combination of factor scores to "explain" scores on an observed variable may have positive, negative, or both valences of estimated weights.
Good luck with your work.
  • asked a question related to Exploratory Factor Analysis
Question
7 answers
We can say that these factorial analysis approach are generally used for two main purposes:
1) a more purely psychometric approach in which the objectives tend to verify the plausibility of a specific measurement model; and,
2) with a more ambitious aim in speculative terms, applied to represent the functioning of a psychological construct or domain, which is supposed to be reflected in the measurement model.
What do you think of these generals' use?
The opinion given for Wes Bonifay and colleagues can be useful for the present discussion:
Relevant answer
Answer
Hello
Factor Analysis is a multivariate statistical technique applied to a single set of variables when the investigator is interested in determining which variables in the set form logical subsets that are relatively independent of one another. In other words, factor analysis is particularly useful to identify the factors underlying the variables by means of clubbing related variables in the same factor:
Thanks