ArticlePDF Available

The Bangla Big Five Inventory-2: a comprehensive psychometric validation

Springer Nature
Scientific Reports
Authors:

Abstract and Figures

The Big Five Inventory-2 (BFI-2) is a widely recognized tool for assessing personality traits across five domains and fifteen facets. However, its psychometric properties in non-Western cultures like Bangladesh remain unexplored. This study aimed to validate the Bangla BFI-2 (BFI-2-B) within a Bangladeshi community sample to provide a culturally adapted personality assessment tool. A cross-sectional survey was conducted on 1,095 participants, where 646 participants (59%; 358 female; Agemean = 24.25 years, SD = 4.47) passed all attention checks. Participants responded to a demographic questionnaire, Bangla Big Five-2 (BFI-2-B), and Bangla NEO Five-Factor Inventory (NEO FFI). The domain-level structural validity was analyzed using Exploratory Structural Equation Modeling (ESEM). A series of five different latent models were tested by Confirmatory Factor Analysis (CFA) for facet-level structural validity. Full Measurement invariance across gender, education level, and language were tested. The item quality was assessed using Item Response Theory analysis (IRT). Convergent and discriminant validity were assessed by correlating the BFI-2-B domains with the NEO-FFI. The BFI-2-B demonstrated high internal consistency across domains (> 0.70) and facets (> 0.60; except energy level, intellectual curiosity, and respectfulness). ESEM confirmed structural validity at the domain level (CFI & TLI = 0.96). CFA analysis revealed that at the facet level, a three-facet structure with an acquiescence factor yielded the most acceptable fit (CFI& TLI\ge0.95; RMSEA\le0.06; SRMR\le 0.08). Full measurement invariance was established across gender and educational levels, but only weak invariance was found across languages, indicating linguistic challenges. The similar domains of BFI-2-B and NEO-FFI had strong correlations (r\ge 0.59), and distinct domains exhibited low correlations, indicating strong convergent and discriminant validity. IRT analysis showed that most items had high to moderate discrimination. The BFI-2-B is a reliable and valid tool for assessing personality in the Bangladeshi context, with robust psychometric properties across domains and most facets. Addressing linguistic nuances and testing in more diverse samples can further enhance its cross-cultural applicability. The stage 1 protocol for this Registered Report was accepted in principle on 24/09/24. The protocol, as accepted by the journal, can be found at: 10.17605/OSF.IO/7DTQG.
Content may be subject to copyright.
The Bangla Big Five Inventory-2:
a comprehensive psychometric
validation
Mushqul Anwar Siraji2, Fahria Karim2, Christopher J. Soto3 & Shamsul Haque1
The Big Five Inventory-2 (BFI-2) is a widely recognized tool for assessing personality traits across
ve domains and fteen facets. However, its psychometric properties in non-Western cultures like
Bangladesh remain unexplored. This study aimed to validate the Bangla BFI-2 (BFI-2-B) within a
Bangladeshi community sample to provide a culturally adapted personality assessment tool. A cross-
sectional survey was conducted on 1,095 participants, where 646 participants (59%; 358 female;
Agemean = 24.25 years, SD = 4.47) passed all attention checks. Participants responded to a demographic
questionnaire, Bangla Big Five-2 (BFI-2-B), and Bangla NEO Five-Factor Inventory (NEO FFI). The
domain-level structural validity was analyzed using Exploratory Structural Equation Modeling (ESEM).
A series of ve dierent latent models were tested by Conrmatory Factor Analysis (CFA) for facet-
level structural validity. Full Measurement invariance across gender, education level, and language
were tested. The item quality was assessed using Item Response Theory analysis (IRT). Convergent and
discriminant validity were assessed by correlating the BFI-2-B domains with the NEO-FFI. The BFI-
2-B demonstrated high internal consistency across domains (> 0.70) and facets (> 0.60; except energy
level, intellectual curiosity, and respectfulness). ESEM conrmed structural validity at the domain
level (CFI & TLI = 0.96). CFA analysis revealed that at the facet level, a three-facet structure with an
acquiescence factor yielded the most acceptable t (CFI& TLI
0.95; RMSEA
0.06; SRMR
0.08).
Full measurement invariance was established across gender and educational levels, but only weak
invariance was found across languages, indicating linguistic challenges. The similar domains of BFI-
2-B and NEO-FFI had strong correlations (r
0.59), and distinct domains exhibited low correlations,
indicating strong convergent and discriminant validity. IRT analysis showed that most items had high
to moderate discrimination. The BFI-2-B is a reliable and valid tool for assessing personality in the
Bangladeshi context, with robust psychometric properties across domains and most facets. Addressing
linguistic nuances and testing in more diverse samples can further enhance its cross-cultural
applicability.
The stage 1 protocol for this Registered Report was accepted in principle on 24/09/24. The protocol,
as accepted by the journal, can be found at: 10.17605/OSF.IO/7DTQG.
Personality traits are relatively enduring patterns of thoughts, emotions, and behaviors guiding an individual’s
tendency to respond in a certain way under a given circumstance1. ere is a consensus in the literature that
personality traits robustly inuence our health and wellness2, job performance3, and academic achievement4.
Hence, it is critically important to measure personality traits precisely. In that vein, a major psychometric
theory in the personality eld- Big Five (BF) theory57 was brought to light, which categorized personality traits
into ve independent bipolar dimensions: Extraversion, Neuroticism or Negative Emotionality, Agreeableness,
Consciousness, and Open mindedness. In addition to this ve-domain model of the BF, literature has extended
its support to the hierarchical nature of the BF domains with several facets nested under each domain810. e
main dierence between domain and facet is the degree of conceptual breadth they cover. Each personality trait
domain of the BF (e.g., BF-Extraversion) enjoys a higher bandwidth of conceptual breadth and represents a large
amount of behavioral information7. In contrast, personality trait facets are narrowly dened (e.g., the “sociable
facet under the “BF-Extraversion” domain) and provide a specic description of behaviors.
Rooted in the BF theory, several personality assessment tools were developed, including the NEO Personality
Inventory-revised11, International Personality Item Pool-NEO8, and Big Five Aspect Scale12. ese long, in-
1Department of Psychology, Jerey Cheah School of Medicine and Health Sciences, Monash University Malaysia,
Subang Jaya, Malaysia. 2Department of History and Philosophy, School of Humanities and Social Sciences, North
South University, Dhaka, Bangladesh. 3Department of Psychology, Colby College, Waterville, USA. email:
shamsul@monash.edu
OPEN
REGISTERED
REPORT
Scientic Reports | (2025) 15:11008 1
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports
Content courtesy of Springer Nature, terms of use apply. Rights reserved
depth measures provide a clear picture of BF domains with incremental information regarding the facets within
each domain. However, over the years, researchers faced a major challenge of balancing the content adequacy
of the personality measures with the procedures length and cost-eectiveness. e Big Five Inventory-1 (BFI-
1) was introduced nearly 20years ago7,13 to address this. BFI-1 incorporates 44- brief, easy-to-understand trait
descriptive phrases representing the BF domains to facilitate coherence, clarity, and eciency with a substantial
reduction in completion time (5–10min depending on the reader’s ability) compared to other personality
measures14. However, the focus of BFI-1 was limited solely to the domain level of traits.
Recently, BFI-1 received a major revision14, in which the authors incorporated the hierarchical model of
personality traits by including three prominent facet traits under each BF domain. In the new BF Inventory
2 (BFI-2), the item number increased to 60 (15 facets with four items each, nested under ve domains) with
an equal number of positively and negatively keyed items (both at the domain and facet level) to address the
consenting responding style—participants’ tendency of consistently providing agree or disagree responses in
the survey items, regardless of the item content. With a more balanced item formulation, BFI-2 oers greater
predictive precision14. Soto and John14, in their validation study, reported satisfactory psychometric properties
of the BFI-2 among a U.S. student sample (n = 470). Despite the increase in length, BFI-2 remains economical
compared to other scales15 and has been widely used in research and applied elds. As of 1 November 2023, the
initial BFI-2 paper14 has been cited 2259 times (Google Scholar). BFI-2 has been translated into more than 30
languages and psychometrically validated in Chinese15, Turkish16, Danish17, Dutch18, Russian19, and German20.
It is important to acknowledge that the initial psychometric properties of the BFI-2 were obtained from
Western, Educated, Industrialized, Rich, and Democratic (WEIRD) populations. Given the observed dierences
in personality domain structures and other correlates of personality traits among the personality measures across
cultures, using them without proper psychometric validations on non-Western cultures would lead to erroneous
interpretation21,22. To what extent these ndings could be generalized to other countries, such as Bangladesh –
an Asian country – remains less explored. Zhang, et al.15, in their Chinese BFI-2 validation with four diverse
samples, successfully reproduced the BF domains however, two facets—trust (BF-Agreeableness domain) and
intellectual curiosity (BF-Open Mindedness)—exhibited relatively low reliability (n = 765). Ahya and Siaputra23,
in their validation work of Indonesian BFI-2, indicated that only 50 items exhibited satisfactory factor loadings
in an exploratory factor analysis. Additionally, the ve-domain structure did not yield adequate t through
conrmatory factor analysis. However, the facet-level model was successfully reproduced. Altogether, the ndings
of the adaptation work of BFI-2 in Asian countries indicate the necessity of psychometrically validating the
BFI-2 in the Bangladeshi context to increase the generalizability and precision of this measurement. Moreover,
there is a lack of published psychometrically valid scales in Bangla to measure personality. Only a Bangla ten-
item Personality Inventory24 is available in the published literature, developed based on BFI-1 and employing
only two items per domain. ough short measures are sometimes handy, a full-length measure of personality,
such as BFI-2, which accounts for both domains and facets, would facilitate an in-depth understanding of
the personality of the Bangladeshi community people. us, in this paper, we took the initiative to adapt and
psychometrically validate the BFI-2 in Bangla.
e BFI-1, which has been widely used globally for the last 20years in various cultural and linguistic settings,
is reported to be susceptible to gender dierences, where women tend to score higher than men in some trait
domains2527. Further, Dahmann and Anger28 found that low education level led to more extraversion and
less emotional stability among young German adults, indicating the susceptibility of personality domain to
education level. ese measurement invariance (MI) issues across gender and education levels in BF-1 have
been studied robustly25,28. However, for the BFI-2, measurement invariance (MI) has been investigated mostly
cross-culturally (English vs Native language): U.S. student and internet validation sample (n = 1470) vs. Chinese
sample (n = 1718)15, and U.S. internet validation sample (n = 1000) vs. German sample (n = 1338)20. Research on
measurement invariance (MI) focusing on gender, education level, and language (Bangla vs. English) invariance
for BFI-2 is currently lacking but highly recommended.
Validation studies of BFI-2 following factor analytic strategies (exploratory and conrmatory) conducted
over the years could now be considered suboptimal methods. Many studies used Principal Component
Analysis (PCA) to report the domain-level factor structure of BFI-214,19,29. ough commonly used, PCA does
not dierentiate between common variance (variance due to the hypothesized factors) and unique variance
(variance attributed to the respondents’ characteristics). Hence, PCA is less suited to examine the factor
structure of personality measures. Furthermore, conventional reporting of Exploratory Factor Analysis (EFA)
lacks specic goodness-of-t criteria and follows some rules of thumb to investigate the factor structure. Factor
structures obtained by these conventional EFA are oen susceptible to sample size and do not replicate easily30.
Hence, it became a common practice in the personality domain to employ Conrmatory Factor Analysis (CFA),
which focuses on validating a hypothesized factor structure30. Yet reproducing the factor structure of personality
inventories, including BF, using the CFA method remains controversial30,31. e invariance of factor structure
across dierent populations could be attributed to the tenuous simple structure assumption of the CFA analysis
strategy, which typically requires items of BF measures to be trait-specic. is issue could be avoided by
employing the Exploratory Structure Equation Modeling (ESEM)32.
ESEM accounts for non-trivial cross-loading consistently found in personality measurements33. It’s
encouraging to observe that recent adaptation studies utilize the ESEM technique to investigate the hypothesized
factor structure, especially the domain-level structure of BFI-215,20,34,35. However, individual item quality
assessment of BFI-2 has scarcely been conducted using Item Response eory (IRT). In contrast to Classical
Test eory (CTT) approaches (ESEM, factor analysis) that assume all items’ contributions are equal, IRT
acknowledges that some items could be more dicult than others and considers the interaction of item diculty
and personal characteristics (latent traits) of the respondents while measuring psychological constructs, i.e.,
personality. By assessing item diculty, item discrimination, item bias, and item information carried across the
Scientic Reports | (2025) 15:11008 2
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
latent trait, IRT-based analysis supplements the CTT-based analyses to increase the precision and accuracy of
a measure. Soto and John36 employed IRT-based analysis to develop short and extra-short forms of the BFI-2.
However, a full-scale examination of the BFI-2 using IRT principles is highly required.
is study aims to validate the Big Five Inventory-2 among Bangladeshi community samples. In that pursuit,
we seek answers to four questions: (Q1) What is the latent structure of the Bangla BFI-2 (BFI-2-B)? (Q2) What
is the extent of measurement invariance of the BFI-2-B across gender, education level, and language (English vs
Bangla language)? (Q3) What is the quality of items in the BFI-2-B when assessed using Item Response eory
(IRT)? (Q4) What is the convergent and discriminant validity of the BFI-2-B? e Table 1 summarizes the research
questions addressed in this study. To answer these four questions, we set four specic objectives for this study.
First, we will investigate the structural validity of the domain-level and facet-level structure of the Bangla BFI-2
(BFI-2-B; Q1). We hypothesize that the domain level and facet level structure of BFI-2 would be reproducible in
the Bangladeshi community sample (H1). Second, we will investigate the measurement invariance of the BFI-
2-B across gender, education level, and language (Q2). We hypothesize that full invariance of the BFI-2-B would
be established for gender, education, and language (H2). ird, we will employ item response theory (IRT) based
analysis to assess the item quality of the BFI-2-B (Q3). Lastly (Q4), we will gather convergent and discriminant
validity of the BFI-2-B by calculating correlations between the big ve domains of personality measured using
the BFI-2-B and the Bangla-translated NEO Five-Factor Inventory (NEO-FFI)37. Previous studies14,38 while
establishing the measurement properties of BFI-2, reported strong correlations (
.50)
between the similar
domains of BFI-2 and NEO-FFI (convergent validity, e.g., BFI-2 Extraversion with NEO-FFI Extraversion) and
very weak to moderate correlations between distinct domains (discriminant validity, e.g., BFI-2 Extraversion
with NEO-FFI Conscientiousness). Hence, we hypothesize strong (≥ 0.50) correlations between similar domains
(showing convergent validity) and discriminant validity (correlations between distinct domains) would be lower
than the convergent validity of BFI-2-B and Bangla-translated NEO-FFI (H3).
Methods
Ethics information
e project has obtained ethics clearance from the Monash University Human Research Ethics Committee
(Project ID: 41934). All methods were performed in accordance with the relevant guidelines and regulations.
Question Hypothesis (if
applicable) Sampling plan Analysis plan Interpretation given to dierent outcomes
Q.1. What is the
latent structure
of Bangla BFI-2?
H1: e domain
level and facet
level structure of
BFI-2 would be
reproducible in
the Bangladeshi
community
sample
For structural equation modeling framework-
based analysis it is to have 10 participants
per item. BFI-2-B has 60 items, at least
600 participants are required. Monte Carlo
Simulation studies suggested a sample size of
200–500 to evaluate the obtained result safely
when analyzing ordinal data with the “Weighted
Least Square with mean and variance”
(WLSMV) estimator. Hence, we target to attain
a sample size of 600. Data collection will be
stopped once we have 600 complete data
We would investigate the structural
validity of the BFI-2-B both at the
domain and facet levels. e domain-
level structural validity of BFI-2-B
would be investigated using Exploratory
Structural Analysis Modeling (ESEM).
Facet-level structural validity of each
domain, we would employ conrmatory
factor analysis (CFA) with the
“Weighted Least Square with mean and
variance” (WSMV) estimator
To assess the model t in ESEM and CFA we
would follow the popular suggestions of Hu
and Bentler51: Comparative t index (CFI)
and the Tucker Lewis index (TLI): good t
0.95, acceptable t
0.90); the root means
square error of approximation (RMSEA):
good t < .06, acceptable t < .08; and the
standardized root mean square (SRMR) good
t < 0.08, acceptable t < 0.10
Q.2. What is
the extent of
measurement
invariance of
the BFI-2-B
across gender,
education level
and language
(English Vs.
Bangla)?
H2: A full
measurement
invariance across
gender, education
level, and language
would be observed
We would follow the sampling plan of Q1 given
that the analyses strategies fall under same
family
ESEM-based ve-domain model
would be used investigate the full
measurement invariance using the
13-model taxonomy of Marsh, et al.52
To assess measurement invariance, we
would follow the suggestions of Cheung and
Rensvold55, where invariance between two
models would be indicated by ΔRMSEA
0.01 and ΔCFA
=-0.01
Q.3. What is the
quality of items
in BFI-2-B when
assessed using
Item Response
eory (IRT)?
Not applicable
Sampling adequacy for IRT analysis was
analyzed using a Monte Carlo Simulation using
the “SimDesign” package (Chalmers & Adkins,
2020) with a sample size varying from 100–600.
IRT models at the facet level with 4 items was
considered the average root mean squared error
(RMSE) was calculated to estimate the optimal
sample size for the graded response model. e
RMSE became stable with a range of 1.34–1.39
for N = 500 to 600. Our planned sample size
exceeds the requirement
We would subject each facet to item
response theory-based item analysis.
For each facet, we would t a graded
response-based IRT model57 with
a marginal maximum likelihood
estimation method with the MHRM
algorithm
We would report the item diculty and
item discrimination and categorize the
items based on discriminatory power
following the guidelines of Baker59: none = 0,
very low = 0.01–0.34, low = 0.35–0.64,
moderate = 0.65–1.34, high = 1.35–1.69, very
high > 1.70. e test information curve will
be inspected to identify the range of latent
continuum (
θ
) with the highest information
and least standard error of measurement.
We will estimate and report the marginal
reliability coecients for the tted IRT models
Q.4. What is the
convergent and
discriminant
validity of the
BFI-2-B
H3: We expect a
strong (≥ 0.50)
correlation for
convergent
validity.
Convergent
validity is expected
to be higher than
discriminant
validity (H3)
A power analysis with 80% power with
alpha = 0.05, at least 82 participants is required
for a person-product moment correlation
analysis. Our planned sample size exceeds the
requirement
Convergent validity:
Correlation of similar domains of BFI-2
and NEO-FFI, e.g., BFI-2 Extroversion
and Neo Extroversion
Discriminant validity:
Correlation of distinct domains of BFI-
2 and NEO-FFI, e.g., BFI-2 Extraversion
with NEO-FFI Conscientiousness
We would check the statistical signicance
(p < 0.05), directionality and strength of the
correlations. We expected that the correlation
of similar domains would be strong (
0.50
) and higher than the correlations of distinct
domains
Tab le 1. Design table.
Scientic Reports | (2025) 15:11008 3
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Digital informed consent from all participants were recorded at the beginning of the online survey. Participation
was voluntary, and participants could withdraw from participation at any time without being impacted negatively.
Participants received a gi worth 3 USD and feedback on their personality scores for their contribution.
Design
Materials
Demographic questionnaire. e demographic questionnaire asked about participants’ age, gender, socio-
economic status, religion, marital status, occupation, and education level.
Big Five Inventory- 2. e Big Five Inventory-2 (BFI-2)14 is a 60-item scale incorporating the hierarchical
structure with 15 facets nested under 5 major personality domains: BF-Extraversion, BF-Negative Emotionality,
BF-Agreeableness, BF-Conscientiousness, and BF-Open Mindedness. ese 60 items are short, easily understood
(e.g., “Is outgoing, sociable”, “is relaxed, handles stress well”) and use a 5-point Likert type response format
(1 = Disagree strongly; 2 = Disagree a little; 3 = Neutral; no opinion; 4 = Agree with a little; 5 = Agree strongly). Soto
and John14 reported satisfactory reliability (0.81–0.90 for domains and 0.59–0.86 for facets across the internet
validation sample-1000 adults and student validation sample-470 students).
Big Five Inventory-2-Bangla. e BFI-2-B was developed by translating and adapting BFI-2 items using the
International Test Commission (ITC) guidelines39. ree bilingual scholars, natives of Bangladesh, conducted
the forward translation of the BFI-2 to Bangla. e translated versions were judged and synthesized by the
authors. Subsequently, three bilingual scholars back-translated the Bangla scale into English. e authors
again synthesized the three back-translations, compared them with the original scale, and made necessary
amendments. Twelve mental health professionals assessed the content validity of the BFI-2-B by independently
evaluating the relevance of the 60 items using a 4-point Likert type scale (1: not at all relevant, 2: slightly relevant,
3: quite Relevant, 4: Highly Relevant). e item-level content validity (I-CVI) and scale-level content validity
index (S-CVI) were estimated as content validity indicators. Items with an I-CVI score > 0.83 and S-CVI > 0.90
indicate adequate content validity40,41. Eight items were below the cut-o value. ese eight items were readjusted
and re-evaluated by the same experts. Aer adjustment, I-CVI scores of all items were acceptable. e S-CVI
for the total scale was 0.94, estimated using the average method, and indicated satisfactory content validity40,41.
NEO Five-Factor Inventory. e NEO Five-Factor Inventory (NEO FFI)37 is a 60-item scale that measures
personality using the same ve domains as BFI-2: NEO-Extraversion, NEO-Negative Emotionality, NEO-
Agreeableness, NEO-Conscientiousness and NEO-Open Mindedness. NEO-FFI measures each of the ve domains
using 12 items with a 5-point Likert-type scale (1 = Disagree strongly; 2 = Disagree a little; 3 = Neutral; no opinion;
4 = Agree a little; 5 = Agree strongly). We translated the NEO-FFI into Bangla using the forward–backward
translation method and used it in the current study.
Procedure
We rst ran a pilot (N = 50) study to double-check the questionnaire translation and debug the R scripts for
analysis. Aer this, we launched a quantitative, cross-sectional, anonymous online survey. Participants were
recruited using social media platforms. On the online survey’s landing page, explanatory statements contained
information on inclusion and exclusion criteria and data condentiality statements. It was mentioned in the
explanatory statement that their participation would be voluntary and that they could withdraw from participation
at any time without being impacted negatively. If happy with the explanatory statements, participants agreed to
participate (oered consent online). Participants rst responded to a demographic questionnaire. Second, they
completed the 60-item BFI-2 and, nally, responded to the 60-item NEO-FFI-B. e survey required around
20–25min to complete. Participants received a gi worth 3 USD and feedback on their personality scores for
their contribution.
Sampling
As a rule of thumb, 10 participants per item were required for analysis based on the structural equation modeling
framework42,43. Since BFI-2-B has 60 items, at least 600 participants were required. Monte Carlo Simulation
studies suggested a sample size of 200–500 to evaluate the obtained result safely when analyzing ordinal data
with the Weighted Least Square with Mean and Variance (WLSMV) estimator. Further, we assessed the sampling
adequacy for IRT analysis using a Monte Carlo Simulation using the “SimDesign” package44 with a sample size
varying from 100–600. We simulated the IRT models at the facet level with 4 items. We calculated the average
root mean squared error (RMSE) to estimate the optimal sample size for the graded response model. e RMSE
became stable with a range of 1.38–1.40 for N = 500 to 600. Lastly, a power analysis using G*Power45 indicated
that 82 participants are required for a signicant correlation of medium eect (r = 0.3;α = 0.05) with 80% statistical
power. Our planned sample size well exceeded the requirement. We launched an online survey based on these
guidelines to collect data from 600 participants at least. Four items for attention checks were included in the
survey to ensure that participants are attentive while they respond to the survey (e.g., is is an attention check.
Please select “Strongly disagree”). Once we launched the survey, we checked weekly for completeness (100%
completeness with all attention checks correctly answered). We planned to stop data collection once we had at
least 600 participants and a 100% completion rate. In case of excess data, we incorporated them in our analysis.
Exclusion and inclusion criteria
Any Bangladeshi national living in Bangladesh, aged over 18, who could read and write the Bangla language
were eligible to participate in the survey. Bangladeshi nationals who did not permanently reside in Bangladesh
(settled overseas) were excluded due to their exposure to multiple social norms and cultures. Participants who
failed the attention check were excluded. Table 1 summarizes the design of the study.
Scientic Reports | (2025) 15:11008 4
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Analysis plan
We used R46 with RStudio for all the analysis. Several statistical packages, including psych47, esemComp48, and
lavaan49 were used. Figure1 summarizes the data analysis steps.
Descriptive statistics and reliability analysis
We reported the descriptive statistics and estimated the reliability of each domain and facet of the BFI-2-B in
our sample using the psych47 package. We reported Cronbach’s α and McDonald’s ωt coecient for reliability
estimates.
Domain and facet level structural validity
We investigated the structural validity of the BFI-2-B at both the domain and facet levels. e domain-level
structural validity of the BFI-2-B was investigated using Exploratory Structural Equation Modeling (ESEM)
using esemComp48 package. ESEM is a substantive-methodological synergy that integrates the computational
power of conrmatory factor analysis and exploratory factor analysis32,50. We examined whether the ve BFI-
2-B domains could be recovered in our sample at the item level (each item serves as an indicator of the Big
Fig. 1. Data analysis steps.
Scientic Reports | (2025) 15:11008 5
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Five domain). We allowed all items to load on all ve domains using a rotated target matrix (60 × 5) where the
primary loading’s positions were le unspecied, and the cross-loadings were set to zero. To assess the model
t, followed the popular suggestions of Hu and Bentler51: Comparative t index (CFI) and the Tucker Lewis
index (TLI): good t
0.95, acceptable t
0.90); the root means square error of approximation (RMSEA):
good t < 0.06, acceptable t < 0.08; and the standardized root mean square (SRMR) good t < 0.08, acceptable
t < 0.10.
To investigate the facet-level structural validity of each domain, we employed conrmatory factor analysis
(CFA) with the Weighted Least Square with Mean and Variance (WSMV) estimator. lavaan49 was used to run
the CFA. Here, we tested the ve models that Soto and John14 used in their study to investigate the facet level
structure of BFI-2: (M1) Single Domain Structure- all 12 items of a domain were allowed to load on a single
factor; (M2) Single Domain with acquiescence- in addition to the single domain factor an acquiescence method
factor were added. In line with their study, we constrained the factor loadings of all items on the acquiescence
factor equal to 1, making both true-keyed and false-keyed items load in the same direction. To ensure the proper
representation of individual dierences that were caused by response style but not due to personality content,
the acquiescence factor was not allowed to correlate with domain factors; (M3) positive and negative items—two
correlated factors where true keyed items were loaded on one factor, and false-keyed were loaded on another;
(M4) the three facets model- representing the three facets within each Big Five Domain; (M5) three facets plus
acquiescence model- An acquiescence factor were added to the three facets model. In this model, each item
was allowed to load on its corresponding facet and the acquiescence method factor. All factor loadings in the
acquiescence factor were constrained to 1, and no correlation will be allowed between the acquiescence factor
and the three facets. To assess the model t, we followed the aforementioned guidelines of Hu and Bentler51.
Measurement invariance
We extended the ESEM-based ve-domain model to investigate the full measurement invariance using the
13-model taxonomy of Marsh, et al.52 across gender, educational level, and language (English vs. Bangla).
lavaan49 was used to run the measurement invariance analyses. To assess the invariance across Bangla and
English versions of the test, we used the internet sample (N = 1000) collected by the third author and used to
nalize BFI-2 items, their domain, and facet structures14. We compared that data with the data accumulated
from Bangladeshi participants using the Bangla version of the test. For gender, language, and education level
full measurement invariance testing, we used the ESEM paradigm, which allowed us to compare the invariance
using raw data, thus accounting for individual item functioning. Typically, measurement invariance investigation
incorporates a sequence of model t, starting with the least restricted model—congural invariance with no
parameter invariance constraints53. Next, the weak measurement invariance model constrains factor loadings
to be equal across the groups. Strong measurement invariance assumes the indicator means and factor loadings
to be invariant over groups. Strict measurement invariance constrains the factor loadings, indicator means, and
item uniqueness to be equal over the groups. Marsh, et al.54 proposed a 13-model taxonomy that expands this
four-model-based invariance testing by integrating both factorial and measurement invariance techniques where
the sequence of testing propagates from congural invariance to a model of full invariance (Complete factorial
invariance) that posits strict invariance as well as the invariance of latent mean and factor variance–covariance
matrix. At least, strict measurement invariance would be required to compare the BFI-2-B scales scores across
dierent attributes such as gender, education levels, and language (For a more extended discussion of the 13
model taxonomy, please see Marsh, et al.52). To assess measurement invariance, we followed the suggestions of
Cheung and Rensvold55, where invariance between two models would be indicated by ΔRMSEA
0.01 and
ΔCFA
≥−0.01
.
Item response theory-based analysis
We subjected each facet to item response theory (IRT)-based item analysis. mirt56 package was used to run
the IRT-based item analysis. We t a graded response-based IRT model57 for each facet using the marginal
maximum likelihood estimation method with the MHRM algorithm. We gathered information on item
diculty, discrimination, and test information. We assessed the local t of the items using int, outt, S-
χ2,
and S-
χ2
associated RMSEA statistics. Person t was evaluated using
Zh
statistics58.
Zh
< -2 was considered a
mist for each facet. We reported the item diculty and item discrimination and categorized the items based
on discriminatory power following the guidelines of Baker59 none = 0, very low = 0.01–0.34, low = 0.35–0.64,
moderate = 0.65–1.34, high = 1.35–1.69, very high > 1.70. e overall test information curves were inspected to
identify the range of latent continuum (
θ
) with the highest information and least standard error of measurement.
We estimated marginal reliability coecients for the tted IRT models.
Convergent and discriminant validity
Lastly, we investigated the convergent and discriminant validity of the BFI-2-B by using Pearson’s partial
product-moment correlation between the scores of BFI-2-B and the Bangla-translated NEO-FFI, controlling
for the gender and age of the participants. We expected strong (≥ 0.50) correlations between similar domains
for convergent validity. Discriminant validity (correlations between distinct domains) was expected to be lower
than convergent validity.
Deviations from the preregistered plan
We acknowledge two deviations from the preregistered plan. First, due to technological limitations, we aggregated
item scores for each facet and treated the facet scores as continuous indicators while tting the ESEM model for
full invariance testing, instead of analyzing item-level data as planned. Second, the pilot data, collected before
Stage 1 Acceptance of the manuscript, to evaluate the feasibility of the plan and the accuracy of the analysis code
Scientic Reports | (2025) 15:11008 6
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
as requested by reviewers, were included in the full study sample rather than reported separately as initially
planned.
Results
Pilot
We conducted a pilot study with 50 participants to double-check the questionnaire translation and debug
the R scripts for analysis. e entire survey took approximately 20min to complete. To collect data from 50
participants with a 100% completion rate (attention check items included), we had to record data from 57
participants (87.72% success rate). Among the 50 participants, 28 were male (Agemean = 22.68; AgeSD = 1.83),
and 22 were female (Age: Mean = 23.59; SD = 4.24). e majority were undergraduate students (98%), Muslim
(98%) and unmarried (96%). In the pilot, the reliability of the ve domains of BF were extraversion: Cronbach’s
α = 0.76, McDonald’s ωt = 0.78; agreeableness: Cronbach’s α = 0.8, McDonald’s ωt = 0.80; conscientiousness:
Cronbachs α = 0.76, McDonald’s ωt = 0.78; negative emotionality: Cronbach’s α = 0.83, McDonald’s ωt = 0.84;
open mindedness: Cronbach’s α = 0.80, McDonald’s ωt = 0.81. All analysis codes were executable. e pilot data
indicated that the proposed study is realistic and feasible.
Characteristics of study sample
We have collected 1095 participants’ responses (including the pilot sample), resulting in complete responses (100%
completion) of 646 participants who passed all four attention checks. Table 2 summarizes the characteristics
of the current sample. Among the participants, 358 were females (Agemean = 24.59 years; AgeSD = 4.69) and 288
males (Agemean = 23.33 years; AgeSD = 4.14). Most participants were Muslim (91%), with similar proportions
across genders (92% for females and 91% for males); 82% were unmarried, and 84% were university students.
Participants with government jobs represented 13% of the sample, while those in business or private sector jobs
were less common (1.7% and 1.2%, respectively). e mean score of participants’ social stance (an individual’s
perceived social position in a social hierarchy measured by a single-item 10-point Likert scale: 1 for the lowest
and 10 for the highest social position) was 6.65 (SD = 1.63), with females showing a slightly higher mean (6.72,
SD = 1.65) than males (6.56, SD = 1.61).
Characteristics of internet sample collected earlier
e internet validation sample (N = 1000) gathered for the development of BFI-214 earlier (English version)
was compared with our sample (completed the Bengali version) to investigate the measurement invariance for
language. 50% of that sample was female. eir age ranged from 18 to 74 (Mean = 28.73years, SD = 11.68), with
most (65%) under 30. ey were mostly White/Caucasian (66%) and residing in the United States (82%). A
detailed description of the sample is published elsewhere14.
Descriptive statistics and reliability analysis
Tables 3 and 4 present the descriptive statistics of BFI-2-B. All items violated the normality assumption, due
to most participants agreeing with more desirable items and disagreeing with undesirable items. e corrected
item-total correlations ranged between 0.12–0.57 (Mean = 0.35; SD = 0.10). Table 4 provides the descriptives of
Var iable O verall, N = 6461Female, N = 3581Male, N = 2881
Age 24.25 (4.47) 24.59 (4.69) 23.83 (4.14)
Religion
Islam 590 (91%) 329 (92%) 261 (91%)
Hindu 47 (7.3%) 23 (6.4%) 24 (8.3%)
Christian 5 (0.8%) 5 (1.4%) 0 (0%)
Others 3 (0.5%) 0 (0%) 3 (1.0%)
Budda 1 (0.2%) 1 (0.3%) 0 (0%)
Marital status
Unmarried 530 (82%) 272 (76%) 258 (90%)
Married 113 (17%) 84 (23%) 29 (10%)
Divorced 3 (0.5%) 2 (0.6%) 1 (0.3%)
Profession
Student 544 (84%) 285 (80%) 259 (90%)
Government job 83 (13%) 63 (18%) 20 (6.9%)
Business 11 (1.7%) 6 (1.7%) 5 (1.7%)
Private job 8 (1.2%) 4 (1.1%) 4 (1.4%)
Education
Undergraduate 555 (86%) 299 (84%) 256 (89%)
Post-graduate 91 (14%) 59 (16%) 32 (11%)
Perceived social status 6.65 (1.63) 6.72 (1.65) 6.56 (1.61)
Tab le 2. Characteristics of study sample (N = 646). 1Mean (SD); n (%)
Scientic Reports | (2025) 15:11008 7
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
BFI-2-B at the domain and facet level with mean-level gender dierences within each sample. Women tend to
describe themselves as somewhat more agreeable, conscientious, and emotional (for all instances, p < 0.05). At
the facet level, women described themselves as more compassionate, trustworthy, organized, anxious, depressed,
and emotionally volatile than men (for all instances, p < 0.05). Men described themselves as more intellectual
and creative (for all instances, p < 0.05).
Table 5 presents the reliability coecients of BFI-2-B and NEO-FFI. e values of Cronbach’s α, and
McDonald’s Omega for all ve domains of BFI-2-B and NEO-FFI (except NEO- conscientiousness & NEO-
Open mindedness) were > 0.70. Supplementary Table S1 presents the reliability coecients of the 15 facets of
BFI-2-B. e reliability indices for the 15 facets ranged between 0.38 to 0.89 for Cronbachs α and between 0.40
to 0.89 for McDonald’s ωt. All facets except energy level, responsibility, and intellectual curiosity had reliability
values > 0.60.
Domain and facet level structural validity
Table 6 presents the factor loading of domain-level validity analysis. e tted ESEM model with ve domains
exhibited good model t [χ2 = 3774.55, df = 1480, p < 0.001 CFI = 0.96, TLI = 0.96, RMSEA = 0.06 (90% CI:0.05–
0.05), SRMR = 05]. As shown in Table 6, all items under BF-Negative Emotionality exhibited the highest loadings
under the corresponding domain. Among the 12 items of the BF-Extraversion domain, 7 items had the highest
factor loading on the targeted Extraversion domain, one item (RBFI11) had the highest loading on Negative
Emotionality, and four items had the highest loading on Open Mindedness (BFI06, BFI21, BFI41, BFI56).
Results showed that the items of BF-Agreeableness equally spread over two domains-Agreeableness (RBFI12,
RBFI17, RBFI22, RBFI37, RBFI42, RBFI47) and Open Mindedness (BFI02, BFI07, BFI27, BFI32, BFI52, BFI57).
Among the 12 items of BF-Conscientiousness, 9 were loaded highest on the corresponding factor, and three
were loaded highest on Open Mindedness (BFI13, BFI38, BFI43). Items of BF-Open Mindedness were spread
over Agreeableness ( RBFI5, RBFI20, BFI35, RBFI50, RBFI55), Open Mindedness (BFI10, BFI15, BFI40,
RBFI45, BFI60) and Extraversion (RBFI25, RBFI30) domains. ese results indicate that the BFI-2’s intended
multidimensional structure can be recovered from the BFI-2-B, but with cross-cultural dierences in the precise
location of some items.
Table 7 presents the model t statistics for the series of ve-facet models within each domain. For all tted
models, the CFIs were > 0.90 and SRMR < 0.10 (except single domain open-mindedness). For most cases, a better
t was observed for three facets plus acquiescence models (extraversion, agreeableness, conscientiousness, open-
mindedness). For negative emotionality, the RMSEA values were over the upper threshold (RMSEA > 0.08) for
all tted models, but all other t statistics were acceptable for the three facets plus acquiescence model. Taken
together, these results suggest that, within each Big Five domain, BFI-2-B item responses can be modeled using
a combination of three substantive facet factors and an acquiescence method factor.
Measurement invariance
Due to technological limitations, while tting the ESEM model for full invariance testing, we aggregated the item
scores for each facet and treated the facet scores as continuous indicators. Parallelling the domain structure, the
facets were rotated towards a 15 × 5 target matrix with unspecied primary loadings and secondary loadings
set to zero. Such a technique to reduce the computational power required to compare complex latent models is
common in the literature15,20 and does not hinder our ability to test the hypothesis that the full invariance of the
BFI-2-B would be established for gender, education, and language (H2).
Supplementary Table S2 presents the results of the full invariance analysis of BFI-2-B data. For gender-focused
measurement invariance (male vs. female), all ΔCFAs were
= 0.01 except M5 (strong factorial invariance),
M7 (strict factorial invariance), and all ΔRMSEAs were
0.01 except M10. For education-focused measurement
invariance analysis, all ΔCFAs were
= − 0.01 except M3, M5 (strong factorial invariance), M7 (strict factorial
invariance), and M9, and all ΔRMSEAs were
=0.01. In all cases, CFIs were
0.90, and RMSEAs were
0.0.08.
No tted models were rejected by both the cuto values (ΔRMSEA
0.01 and ΔCFA
≥−0.01
) simultaneously.
ese results suggest the highest level of measurement invariance—complete factorial invariance—for gender
and education levels. By contrast, for language-focused measurement invariance analysis, only M1, M2, and M5
models exhibited acceptable t indices (CFI > 0.90). us, only weak factorial invariance was observed across
language.
Item response theory-based analysis
Supplementary Table S3 presents the IRT parameters, including item discrimination (a), item diculty, and
item-t indices for each of the 15 facets. All items exhibited acceptable item t (int & outt < 1.4, S-χ2 associated
RMSEA statistics
0.06)
. Supplementary Fig. S1 indicates that for all 15 models, Zh statistics are larger than − 2
for most participants, suggesting a good person t. As shown in Supplementary Table S3, among the 60 items, 3
items were very low, 2 items were low, 21 items were moderate, 12 were high, and 22 had very high discrimination
scores. Test information curves (TIC) for the tted models (Supplementary Fig. S2 & Supplementary Table S4)
indicated that each facet had a good range of coverage across the underlying traits with some facets (Compassion:
− 4.5 to 1.8, Respectfulness: − 5.3 to 1.7, Intellectual Curiosity: − 4.5 to 2) skewed toward lower trait levels. Marginal
reliability ranged between 0.57 to 0.88. All facets had marginal reliability > 0.60 except the responsibility facet
(Supplementary Table 3S). Taken together, these results indicate that items of BFI-2-B are well-calibrated to
measure the intended traits across a wide range of the latent continuum, demonstrating adequate psychometric
properties for assessing personality facets within the Bangladeshi population.
Scientic Reports | (2025) 15:11008 8
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Items Mean SD Skew Kurtosis Normality Corrected item total correlation
BFI01 3.52 1.01 − 0.71 − 0.13 0.85* 0.3
BFI06 3.63 0.89 − 0.69 0.56 0.86* 0.45
RBFI11 3.17 1.1 − 0.11 − 1.01 0.89* 0.15
RBFI16 2.64 1.25 0.37 − 1.01 0.88* 0.17
BFI21 2.7 1.13 0.26 − 0.93 0.89* 0.17
RBFI26 3.27 1.12 − 0.19 − 1.05 0.88* 0.37
RBFI31 2.02 0.87 1.22 1.71 0.76* 0.22
RBFI36 2.85 1.07 0.14 − 0.93 0.89* 0.35
BFI41 3.28 1.01 − 0.18 − 0.77 0.90* 0.54
BFI46 3.47 1.12 − 0.51 − 0.59 0.88* 0.25
RBFI51 2.91 1.01 0.18 − 0.66 0.90* 0.31
BFI56 3.72 0.81 − 0.79 0.78 0.82* 0.48
BFI02 4.17 0.75 − 1.11 2.42 0.77* 0.22
BFI07 4.44 0.64 − 1.51 5.24 0.68* 0.31
RBFI12 3.48 1.13 − 0.3 − 1.01 0.88* 0.31
RBFI17 3.99 1.09 − 1.26 0.94 0.78* 0.31
RBFI22 4.1 0.95 − 0.98 0.3 0.80* 0.35
BFI27 4.02 0.9 − 1.05 1.17 0.81* 0.28
BFI32 4.02 0.76 − 0.77 1.03 0.81* 0.38
RBFI37 3.16 1.14 − 0.01 − 1.2 0.87* 0.38
RBFI42 2.64 1.08 0.41 − 0.74 0.88* 0.33
RBFI47 3.99 1.02 − 1.02 0.5 0.82* 0.4
BFI52 4.29 0.58 − 0.47 1.56 0.72* 0.38
BFI57 3.64 0.98 − 0.69 − 0.06 0.85* 0.34
RBFI03 3.34 1.21 − 0.33 − 1.02 0.88* 0.43
RBFI08 2.82 1.22 0.15 − 1.09 0.90* 0.57
BFI13 3.67 0.97 − 0.79 0.26 0.85* 0.19
BFI18 3.59 1.05 − 0.7 − 0.15 0.86* 0.36
RBFI23 2.98 1.16 − 0.06 − 1.1 0.89* 0.5
RBFI28 2.28 0.93 1.00 0.69 0.79* 0.29
BFI33 3.55 1.07 − 0.52 − 0.51 0.88* 0.46
BFI38 3.71 0.77 − 0.65 0.63 0.82* 0.49
BFI43 4.2 0.75 − 0.97 1.42 0.78* 0.34
RBFI48 3.84 1.1 − 0.86 − 0.05 0.84* 0.4
BFI53 3.66 0.99 − 0.71 0.03 0.86* 0.42
RBFI58 2.83 1.11 0.28 − 1.05 0.87* 0.39
RBFI04 2.75 1.11 0.27 − 0.73 0.91* 0.46
RBFI09 2.36 0.99 0.73 0.08 0.86* 0.46
BFI14 2.98 1.21 0.09 − 1.13 0.89* 0.35
BFI19 2.95 1.11 0.07 − 0.99 0.90* 0.44
RBFI24 2.41 1.09 0.67 − 0.24 0.87* 0.41
RBFI29 2.74 1.09 0.51 − 0.65 0.87* 0.49
BFI34 3.62 1.14 − 0.53 − 0.72 0.87* 0.45
BFI39 3.69 1.05 − 0.64 − 0.38 0.86* 0.34
RBFI44 2.45 1.06 0.62 − 0.35 0.87* 0.42
RBFI49 3.24 1.15 − 0.18 − 1.04 0.89* 0.26
BFI54 3.53 1.12 − 0.59 − 0.57 0.87* 0.41
BFI59 2.87 1.19 0.21 − 1.08 0.89* 0.52
RBFI05 3.26 1.23 − 0.19 − 1.15 0.89* 0.25
BFI10 4.03 0.8 − 0.79 0.7 0.82* 0.29
BFI15 3.71 0.82 − 0.72 0.56 0.83* 0.5
BFI20 3.93 0.98 − 1.1 1.02 0.81* 0.22
RBFI25 3.8 1.02 − 0.78 − 0.01 0.85* 0.35
RBFI30 3.42 1.06 − 0.49 − 0.63 0.87* 0.46
BFI35 4.13 0.75 − 0.98 1.88 0.78* 0.33
BFI40 3.73 0.91 − 0.65 0.07 0.85* 0.25
Continued
Scientic Reports | (2025) 15:11008 9
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Convergent and discriminant validity
Table 8 presents domain-level correlations of the BFI-2-B with NEO-FFI. Results indicated that BFI-2-B
converged strongly with NEO-FFI while controlling for gender and age. e correlations between the similar
domains were ≥ 0.59 for all instances. Correlations between distinct domains were ≤ 0.39 for all instances. ese
results indicate good convergent and discriminant validity for the BFI-2-B.
Discussion
e current study aimed to validate the Bangla Big Five Inventory-2 (BFI-2-B) by examining important
psychometric properties, including reliability and validity (structural, convergent, and discriminant),
measurement invariance (across gender, education level and language), and item quality (item discrimination
and test information). e study was designed to test three specic hypotheses: (H1) that the domain and facet-
level structures of the BFI-2 would be reproducible in the Bangladeshi culture, (H2) that full measurement
invariance would be established across gender (male vs female), education levels (Undergrads vs Postgrads)
Cronbach’s α McDonald’s ωt
BFI-2-B NEO-FFI BFI-2-B NEO-FFI
Extraversion 0.76 0.84 0.83 0.84
Agreeableness 0.78 0.72 0.77 0.78
Conscientiousness 0.81 0.61 0.79 0.62
Negative emotionality 0.86 0.87 0.86 0.87
Open mindedness 0.76 0.58 0.77 0.64
Tab le 5. Reliability of the BFI-2-B and Bangla NEO-FFI.
Domain Combined1, N = 646 Female1, N = 358 Male1, N = 288 p
BF-extraversion 3.52 (1.01) 3.45 (1.01) 3.61 (1.02) 0.047
Sociability 3.52 (1.01) 3.45 (1.01) 3.61 (1.02) 0.047
Assertiveness 3.63 (0.89) 3.64 (0.90) 3.62 (0.87) 0.7
Energy level 3.17 (1.10) 3.24 (1.10) 3.09 (1.10) 0.070
BF-agreeableness 4.17 (0.75) 4.22 (0.72) 4.10 (0.79) 0.037
Compassion 4.17 (0.75) 4.22 (0.72) 4.10 (0.79) 0.037
Respectfulness 4.44 (0.64) 4.45 (0.64) 4.42 (0.65) 0.6
Tr ust 3.48 (1.13) 3.56 (1.11) 3.39 (1.16) 0.062
BF-conscientiousness 3.34 (1.21) 3.45 (1.20) 3.21 (1.21) 0.009
Organization 3.34 (1.21) 3.45 (1.20) 3.21 (1.21) 0.009
Productiveness 2.82 (1.22) 2.84 (1.25) 2.80 (1.18) 0.6
Responsibility 3.67 (0.97) 3.64 (0.99) 3.72 (0.94) 0.4
BF-negative emotionality 2.75 (1.11) 2.98 (1.10) 2.45 (1.04) < 0.001
Anxiety 2.75 (1.11) 2.98 (1.10) 2.45 (1.04) < 0.001
Depression 2.36 (0.99) 2.47 (1.01) 2.23 (0.96) < 0.001
Emotional volatility 2.98 (1.21) 3.14 (1.22) 2.77 (1.18) < 0.001
BF-open-mindedness 3.26 (1.23) 3.33 (1.19) 3.16 (1.28) 0.11
Intellectual curiosity 4.03 (0.80) 3.92 (0.82) 4.16 (0.76) < 0.001
Aesthetic sensitivity 3.26 (1.23) 3.33 (1.19) 3.16 (1.28) 0.11
Creative imagination 3.71 (0.82) 3.60 (0.85) 3.85 (0.76) < 0.001
Tab le 4. Descriptive statistics for BFI-2-B domains and facets. 1Mean (SD); p value is reported for independent
sample t-tests.
Items Mean SD Skew Kurtosis Normality Corrected item total correlation
RBFI45 3.93 0.97 − 0.81 0.02 0.83* 0.21
RBFI50 3.69 1.19 − 0.79 − 0.27 0.85* 0.26
RBFI55 3.37 0.91 − 0.08 − 0.6 0.89* 0.12
BFI60 3.42 0.97 − 0.42 − 0.3 0.89* 0.42
Tab le 3. Descriptives of BFI-2-B items. *p < 0.001.
Scientic Reports | (2025) 15:11008 10
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Negative Emotionality Conscientiousness Extraversion Open mindedness Agreeableness
BF-extraversion
BFI01 0.022 0.048 0.412 0.227 − 0.045
BFI06 − 0.228 0.135 0.116 0.314 − 0.006
RBFI11 0.369 − 0.044 0.190 0.151 − 0.090
RBFI16 0.151 − 0.065 0.802 − 0.199 0.075
BFI21 − 0.121 0.059 0.304 0.312 − 0.482
RBFI26 − 0.122 0.188 0.363 − 0.004 0.060
RBFI31 − 0.112 − 0.049 0.635 − 0.253 0.033
RBFI36 − 0.211 − 0.042 0.395 0.201 − 0.156
BFI41 − 0.243 0.300 0.218 0.375 − 0.191
BFI46 0.163 0.027 0.568 0.093 0.022
RBFI51 − 0.081 0.137 0.413 0.079 − 0.137
BFI56 − 0.072 0.095 0.336 0.468 − 0.056
BF-agreeableness
BFI02 0.080 0.168 − 0.265 0.454 0.130
BFI07 − 0.154 0.176 − 0.298 0.506 0.055
RBFI12 − 0.183 0.058 − 0.034 − 0.121 0.460
RBFI17 0.099 0.026 0.015 0.172 0.526
RBFI22 − 0.230 0.087 − 0.262 0.096 0.417
BFI27 − 0.142 0.191 − 0.323 0.368 0.050
BFI32 0.011 0.275 − 0.199 0.467 0.128
RBFI37 − 0.156 0.047 − 0.027 − 0.032 0.517
RBFI42 − 0.195 0.097 0.118 − 0.302 0.420
RBFI47 0.023 0.135 0.017 0.070 0.559
BFI52 − 0.111 0.221 − 0.313 0.539 0.137
BFI57 − 0.205 0.174 − 0.199 0.261 0.087
BF-conscientiousness
RBFI03 0.028 0.841 0.058 − 0.221 0.049
RBFI08 − 0.156 0.551 0.283 − 0.234 0.221
BFI13 0.013 0.083 − 0.004 0.338 − 0.003
BFI18 0.105 0.875 − 0.057 − 0.034 − 0.076
RBFI23 − 0.259 0.300 0.252 − 0.114 0.149
RBFI28 − 0.191 0.408 0.076 − 0.162 − 0.027
BFI33 0.114 0.960 − 0.023 − 0.074 0.001
BFI38 − 0.200 0.266 0.107 0.426 − 0.126
BFI43 − 0.041 0.297 − 0.173 0.495 − 0.053
RBFI48 0.138 0.806 0.015 − 0.157 0.163
BFI53 − 0.046 0.489 0.024 0.263 − 0.098
RBFI58 − 0.186 0.237 0.146 − 0.146 0.232
BF-negative emotionality
RBFI04 0.646 − 0.067 0.186 − 0.340 0.174
RBFI09 0.595 − 0.116 0.127 − 0.277 0.143
BFI14 0.434 − 0.023 − 0.074 0.178 − 0.198
BFI19 0.534 − 0.030 0.042 0.232 − 0.383
RBFI24 0.416 − 0.221 − 0.045 − 0.177 0.188
RBFI29 0.812 0.004 0.182 − 0.235 0.117
BFI34 0.747 0.095 − 0.126 0.150 − 0.083
BFI39 0.659 0.061 − 0.232 0.261 − 0.005
RBFI44 0.608 − 0.153 0.342 − 0.296 0.131
RBFI49 0.520 − 0.039 0.104 − 0.088 0.172
BFI54 0.665 0.018 − 0.223 0.170 0.033
BFI59 0.777 0.051 0.045 − 0.066 − 0.069
BF-open mindedness
RBFI05 0.107 − 0.198 0.301 0.002 0.517
BFI10 0.050 − 0.206 0.254 0.473 0.131
BFI15 − 0.137 0.081 0.180 0.541 − 0.006
Continued
Scientic Reports | (2025) 15:11008 11
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
and language (Bangla vs English), (H3) that the BFI-2-B would show strong convergent validity (r
0.50)
for
similar domains measured using the Bangla NEO-FFI, as well as discriminant validity between distinct domains.
e BFI-2-B demonstrated satisfactory internal consistency for the ve major domains, with Cronbach’s α and
McDonald’s ωt values surpassing 0.76 for all domains. ese ndings align with the reliability indices reported for
the original English BFI-214= 0.83–0.90) and adaptations in other languages, such as Chinese15 and Dutch18.
Model χ2df CFI TLI RMSEA
RMSEA 90%
CI
SRMRLower Upper
Extraversion
M1: single domain 483.24 54 0.906 0.885 0.111 0.102 0.120 0.084
M2:single domain plus acquiescence 481.66 53 0.908 0.883 0.112 0.103 0.121 0.084
M3: positive and negative items 470.55 53 0.908 0.886 0.111 0.101 0.120 0.081
M4: three facets 226.96 51 0.961 0.950 0.073 0.064 0.083 0.062
M5:three facets plus acquiescence 204.16 50 0.966 0.955 0.069 0.059 0.079 0.059
Agreeableness
M1: single domain 338.72 54 0.944 0.932 0.091 0.082 0.100 0.077
M2: single domain plus acquiescence 208.56 53 0.970 0.962 0.068 0.058 0.077 0.058
M3: positive and negative items 219.95 53 0.967 0.959 0.070 0.061 0.080 0.062
M4: three facets 282.47 51 0.955 0.941 0.084 0.075 0.094 0.071
M5: three facets plus acquiescence 196.57 50 0.971 0.962 0.068 0.058 0.078 0.057
Conscientiousness
M1:single domain 474.28 54 0.972 0.966 0.110 0.101 0.119 0.086
M2: single domain plus acquiescence 273.01 53 0.985 0.982 0.080 0.071 0.090 0.067
M3: positive and negative items 460.28 53 0.973 0.966 0.109 0.100 0.118 0.085
M4: three facets 227.67 51 0.988 0.985 0.073 0.064 0.083 0.064
M5: three facets plus acquiescence 119.34 50 0.995 0.994 0.047 0.036 0.058 0.045
Negative emotionality
M1: single domain 501.60 54 0.962 0.954 0.113 0.104 0.123 0.079
M2: single domain plus acquiescence 501.33 53 0.962 0.953 0.115 0.105 0.124 0.079
M3: positive and negative items 360.64 53 0.974 0.968 0.095 0.086 0.104 0.064
M4:three facets 398.11 51 0.971 0.962 0.103 0.093 0.112 0.074
M5: three facets plus acquiescence 386.65 50 0.972 0.962 0.102 0.093 0.112 0.074
Open-mindedness
M1: single domain 672.83 54 0.898 0.875 0.133 0.124 0.142 0.101
M2: single domain plus acquiescence 475.31 53 0.930 0.913 0.111 0.102 0.120 0.087
M3: positive and negative items 648.49 53 0.901 0.877 0.132 0.123 0.141 0.099
M4: three facets 163.74 51 0.981 0.976 0.059 0.049 0.069 0.053
M5: three facets plus acquiescence 158.76 50 0.982 0.982 0.058 0.048 0.068 0.053
Tab le 7. Fit statistics for conrmatory factor analysis of the BFI-2-B items. df = degrees of freedom;
CFI = Comparative t index; TLI = Tucker-Lewis Index; RMSEA = Root means square error of approximation,
CFI and TLI values
0.900, RMSEA values
<0.080
, and SRMR values
<0.10
are bolded. All Chi-Square
values are signicant at p < 0.001.
Negative Emotionality Conscientiousness Extraversion Open mindedness Agreeableness
BFI20 0.482 − 0.267 0.517 0.047 0.790
RBFI25 − 0.066 − 0.171 0.267 0.259 0.249
RBFI30 − 0.061 − 0.064 0.318 0.288 0.271
BFI35 0.347 − 0.197 0.416 0.164 0.752
BFI40 − 0.174 0.205 − 0.256 0.416 − 0.169
RBFI45 0.061 − 0.202 0.171 0.354 0.158
RBFI50 0.300 − 0.206 0.392 − 0.117 0.780
RBFI55 0.068 − 0.276 0.242 0.065 0.306
BFI60 − 0.120 0.014 0.195 0.468 − 0.019
Tab le 6. ESEM factor loadings of the 60 items of BFI-2-B. e highest factor loading for each row is in bold.
Scientic Reports | (2025) 15:11008 12
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Most facets also demonstrated adequate reliability. However, intellectual curiosity under BF-Open Mindedness,
energy level under BF-Extraversion, and responsibility under BF-Conscientiousness exhibited lower reliability.
ese discrepancies may be due to cultural dierences in how personality traits are valued and expressed in Asian
versus Western contexts. Intellectual curiosity, which entails a desire for new ideas and intellectual engagement,
is a highly valued trait in many Western societies, where individualism and self-exploration are encouraged60.
Studies show that Western cultures emphasize intellectual pursuits for personal growth and self-expression61.
In contrast, intellectual curiosity may hold a dierent position within Asian cultures, where collectivist values
may prioritize conformity, respect for tradition, and group harmony over individual exploration62. is cultural
dierence could aect how respondents interpret items related to intellectual curiosity, leading to inconsistent
responses and lower reliability scores. e energy level facet within BF-Extraversion reects enthusiasm, vitality,
and social engagement, traits that are prominently valued in Western societies where extraversion is oen seen
as an asset in social and professional settings63. In Asian contexts, however, social restraint and modesty are
oen more valued, and high-energy behaviors may be interpreted dierently or even discouraged, especially
in social interactions64. Consequently, items measuring energy level might not resonate as strongly with
respondents in Asian cultures64. e lower reliability of the responsibility (BF-Conscientiousness) facet may
also reect cultural nuances in interpreting this trait. In Western cultures, responsibility is typically associated
with individual dependability and personal accountability65, traits that are seen as stable and integral to one’s
personality. However, in many Asian societies, responsibility may be viewed more contextually and may be tied
closely to roles within family and community structures61. Rather than being a stable, individual characteristic,
responsibility may be perceived as a situational obligation that one upholds in specic contexts (e.g., family
responsibilities) rather than as a broad, consistent trait . is variability in interpretation can lead to inconsistent
responses and lower reliability for responsibility-related items in Asian contexts.
e results supported our rst hypothesis that the domain and facet-level structures of the BFI-214 would
be reproducible in the Bangladeshi culture. It is reassuring that at the domain level, our results supported the
existence of the Big Five traits mirroring the results found in other international adaptations15,16,29. However,
some items were observed to have cross-loadings, especially for the BF-Open Mindedness and BF-Agreeableness
domains. is suggests that though the overall domain structure of BFI-2 is recognizable in its Bangla adaptation,
item-level variations were observed, which may be due to the cultural and linguistic contexts. For example,
certain items related to interpersonal behavior or emotional expressions might carry distinct connotations
in the Bangladeshi context, thus altering their interpretations. e translation process, while rigorous, may
not fully capture the subtleties embedded in the original items. At the facet level, consistent with the original
development work of the BFI-214, the best model t for the BFI-2-B was achieved when each of the ve major
personality domains was modeled with three distinct facets per domain and an additional acquiescence factor.
is approach aligns with the BFI-2’s design philosophy. Including an acquiescence factor is particularly
benecial as it accounts for response bias related to individuals’ tendency to agree with statements regardless of
content. is factor enhances the model’s robustness by adjusting for variance arising from acquiescent response
styles rather than true dierences in personality traits. us, structuring each domain with three specic facets
alongside an acquiescence factor mirrors the theoretical underpinnings and methodological rigor of the BFI-
2-B. is structure has proven eective across various languages and contexts, indicating that this facet-based,
multidimensional personality measurement approach is versatile and adaptable.
Supporting our second hypothesis, results showed that BFI-2-B demonstrated full measurement invariance
across gender (male vs. female) and education levels (undergraduates vs. postgraduates). Similar ndings were
reported in the Japanese adaptation29 of BFI-2, where the highest level (strict) of measurement invariance was
established across genders. e establishment of MI across gender and education level suggests that the BFI-B
items are interpreted similarly among male and female participants and individuals with dierent educational
levels within the Bangladeshi sample. is cross-group comparability highlights the scale’s suitability for diverse
applications, making it a valuable tool for research and practice in varied contexts. However, one important
NEO-agreeableness NEO-conscientiousness NEO-extraversion NEO-NE NEO-open mindedness
Correlation of BFI-2 and NEO-FFI (controlled for gender)
BF-agreeableness 0.59 0.35 0.25 − 0.29 0.09
BF-conscientiousness 0.19 0.79 0.32 − 0.35 − 0.01
BF-Extraversion − 0.05 0.38 0.73 − 0.35 0.10
BF-NE − 0.26 − 0.38 − 0.26 0.79 − 0.01
BF-Open mindedness 0.13 0.30 0.23 − 0.21 0.64
Correlation of BFI-2 and NEO-FFI (controlled for age)
BF-Agreeableness 0.59 0.32 − .23 − 0.24 0.10
BF-Conscientiousness 0.17 0.77 − .29 − 0.01 − 0.22
BF-Extraversion − 0.07 − .38 − 0.73 − 0.35 0.09
BF-NE − 0.18 − 0.33 − 0.24 0.78 0.01
BF-Open mindedness 0.13 0.30 0.23 − 0.21 0.64
Tab le 8. Correlations of the BFI-2-B domains with NEO-FFI. All correlations are signicant (p < 0.001). e
highest correlation in each row is in bold.
Scientic Reports | (2025) 15:11008 13
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
pattern to be highlighted is that for some domains and facets, females tended to score higher than males (at the
mean level). In the current study, females described themselves as more agreeable, conscientious, and emotional.
At the facet level, women described themselves as more compassionate, trustworthy, organized, anxious,
depressed, and emotionally volatile than men. By contrast, men described themselves as more intellectually
creative. Such ndings are aligned with the existing literature14,15. Soto and John14 reported that females tend
to describe themselves as more extroverted, agreeable, conscientious, and emotional than males. Yoshino, et
al.29 while adapting BFI-2 in the Japanese language reported that within the BF-Extraversion domain, males
had a higher assertiveness score than females, while females had a higher sociability score. Future research is
warranted to investigate these gender-based dierences in personality traits.
In contrast to our expectation that full measurement invariance across language (English vs Bangla) would
be observed for BFI-2-B, our results supported only weak measurement invariance for language. Similar
challenges in language-based measurement invariance were also observed in the German adaptation20, where
exact measurement invariance was estimated using CFA, congural invariance was reported for agreeableness
and open-mindedness, and weak measurement invariance was reported for extraversion and conscientiousness.
ese ndings collectively point to potential challenges related to cultural and linguistic interpretations of specic
constructs and items. In Western cultures, traits like openness and extraversion are oen positively associated
with self-expression and individual achievement, whereas in Bangladesh and other South Asian cultures, these
traits may be viewed through a lens that emphasizes social harmony, modesty, and family obligations66,67.
Furthermore, words associated with traits like responsibility or intellectual curiosity may hold slightly dierent
connotations in Bangla than in English, potentially aecting how respondents interpret these items. e lack of
MI between Bangla and English BFI-2 may also be partly attributed to the dierences in response styles between
the two cultures. Research indicates that people from collectivist cultures, such as Bangladesh, may respond to
personality assessments in ways that reect social desirability or normative behaviors, prioritizing community-
oriented values over individual traits. In contrast, respondents in individualistic Western cultures might
prioritize self-expression, leading to dierent response patterns66,68. ese variations can aect item functioning,
as some BFI-2 items may resonate dierently in a collectivist versus an individualist cultural context, even if the
underlying construct is similar.
e Item Response eory (IRT) analysis of the BFI-2-B provided insights into item functioning and
measurement precision across the een personality facets. Overall, most items displayed acceptable t indices,
indicating their adequacy in capturing the underlying traits within each facet. Items demonstrated a range
of discrimination values, with 22 items showing very high discrimination, 12 items high, 21 items moderate,
and a few items with lower discrimination scores. e test information curves (TICs) further indicated that
most facets covered a wide range of the latent trait continuum, particularly around the mean levels of traits.
However, certain facets, such as compassion and respectfulness within BF-Agreeableness and aesthetic sensitivity
within BF-Open Mindedness, showed coverage skewed towards lower trait levels, suggesting that the BFI-
2-B items within these facets provide more reliable information for individuals with lower trait expressions.
Marginal reliability estimates were satisfactory for most facets, with values above 0.60 across facets, except for
responsibility (BF-Conscientiousness), which fell slightly below the threshold. e lower reliability of this facet
may reect that cultural interpretation of responsibility is context-dependent rather than a stable, individual
trait61. is variability suggests that further renement of items within these facets could enhance reliability in
the Bangladeshi setting.
e results of the current study supported our third hypothesis that the BFI-2-B would show strong convergent
validity (> 0.50) for the similar domains measured using the Bangla NEO-FFI, and discriminant correlations
between distinct domains would be lower than the convergent correlations. is reinforces the robustness of
the BFI-2-B as a valid personality assessment tool. e BFI-2-B demonstrated strong convergent validity, with
correlations for similar domains exceeding r = 0.59. ese high correlations suggest that the BFI-2-B eectively
measures personality constructs comparable to those captured by the NEO-FFI, aligning with expected patterns
based on the theoretical overlap between these two well-established personality models. Discriminant validity
was conrmed since the correlations between distinct domains were substantially lower than the correlations
between similar domains. is dierentiation among domains suggests that the BFI-2-B eectively distinguishes
between personality traits. ese ndings are consistent with results from the development work of English
BFI-214 that reported high convergent and discriminant validity with established personality measures. In both
Western and non-Western adaptations, high correlations between the BFI-2 and NEO-FFI domains underscore
the reliability of the BFI-2 structure across dierent cultural and linguistic settings, supporting the broad
applicability of the Big Five model.
Limitations and future directions
While the BFI-2-B has demonstrated promising psychometric properties in this study, several limitations must
be acknowledged. First, although the sample size was adequate, the study relied on an internet-based sample,
which may introduce sampling bias. Internet samples may not fully represent the diversity of the Bangladeshi
population, particularly regarding socioeconomic, geographical, and educational backgrounds. Additionally,
this study retained data from 646 participants out of the initial 1,095. is exclusion of 41% of the data due
to participants failing attention checks, may impact the representativeness and generalizability of the results.
Second, measurement invariance across languages showed only weak invariance, indicating potential cultural
and linguistic dierences in item interpretation between Bangla and English versions. is limitation suggests
that the translation process may not capture subtle cultural nuances in certain personality traits, potentially
impacting the cross-linguistic validity of the BFI-2-B. Furthermore, certain facets within domains, such as energy
level (BF-Extraversion), responsibility (BF- Conscientiousness), and intellectual curiosity (BF-Open-mindedness),
Scientic Reports | (2025) 15:11008 14
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
exhibited lower reliability. is could indicate cultural dierences in trait interpretation, which may aect the
reliability of these facets in non-Western samples.
To address these limitations and enhance the robustness of the BFI-2-B, future studies should consider
employing mixed-method approaches, including cognitive interviews, to gain deeper insights into how
Bangladeshi respondents interpret specic items. is could help identify cultural nuances inuencing
responses and improve translation accuracy. Additionally, future research should seek to replicate these ndings
across more diverse samples, including rural populations and those from varied socioeconomic backgrounds,
to enhance generalizability. Finally, to improve cross-linguistic measurement invariance, researchers might
develop culture-specic items or adapt existing items to better align with local norms and values .
Conclusion
is study represents a comprehensive psychometric validation of the Bangla Big Five Inventory-2 (BFI-2-B),
providing a valuable tool for assessing personality traits in the Bangladeshi population. e BFI-2-B demonstrated
strong internal consistency (especially at the domain levels) and structural validity (at both the domain and facet
levels), reinforcing its alignment with the theoretical framework of the Big Five personality model. e study
successfully established full measurement invariance across gender and education levels, highlighting the BFI-2-
B’s suitability for diverse applications within these demographic groups in Bangladesh. Additionally, the strong
convergent and discriminant validity demonstrated by the BFI-2-B supports its capability to measure distinct
personality constructs eectively. However, challenges emerged in achieving full measurement invariance
across languages, suggesting potential cultural and linguistic nuances in item interpretation that warrant further
exploration. Lower reliability scores for specic facets, such as energy level and intellectual curiosity, underscore
the need for caution when interpreting these facets in Bangladeshi contexts. Overall, the BFI-2-B provides a
psychometrically robust, culturally adapted measure of personality that holds promise for research and applied
settings in Bangladesh. With continued research and adaptation, the BFI-2-B can serve as a foundational tool for
advancing personality assessment in Bangladesh, contributing to more nuanced understandings of personality
traits within this cultural context.
Data availability
All raw data and materials are publicly available on Open Science Framework (https://osf.io/7dtqg/) and GitHub
(https://github.com/mind-psychometry/BanglaBigFive2).
Code availability
Data analysis codes are publicly available on Open Science Framework (https://osf.io/7dtqg/) and GitHub
(https://github.com/mind-psychometry/BanglaBigFive2).
Received: 21 November 2023; Accepted: 11 February 2025
References
1. Roberts, B. W. Back to the future: Personality and Assessment and personality development. J. Res. Pers. 43, 137–145. h t t p s : / / d o i . o
r g / 1 0 . 1 0 1 6 / j . j r p . 2 0 0 8 . 1 2 . 0 1 5 (2009).
2. Strickhouser, J. E., Zell, E. & Krizan, Z. Does personality predict health and well-being? A metasynthesis. Health Psychol. 36,
797–810. https://doi.org/10.1037/hea0000475 (2017).
3. Judge, T. A., Rodell, J. B., Klinger, R. L., Simon, L. S. & Crawford, E. R. Hierarchical representations of the ve-factor model of
personality in predicting job performance: Integrating three organizing frameworks with two theoretical perspectives. J. Appl.
Psychol. 98, 875–925. https://doi.org/10.1037/a0033901 (2013).
4. De Feyter, T., Caers, R., Vigna, C. & Berings, D. Unraveling the impact of the Big Five personality traits on academic performance:
e moderating and mediating eects of self-ecacy and academic motivation. Learn. Individual Dier. 22, 439–448. h t t p s : / / d o i .
o r g / 1 0 . 1 0 1 6 / j . l i n d i f . 2 0 1 2 . 0 3 . 0 1 3 (2012).
5. Digman, J. M. Personality structure: Emergence of the ve-factor model. Annu. Rev. Psychol. 41, 417–440. h t t p s : / / d o i . o r g / 1 0 . 1 1 4 6
/ a n n u r e v . p s . 4 1 . 0 2 0 1 9 0 . 0 0 2 2 2 1 (1990).
6. Goldberg, L. R. Language and individual dierences: e search for universals in personality lexicons. Rev. Personal. Soc. Psychol.
2, 141–165 (1981).
7. John, O. P., Naumann, L. P., & Soto, C. J. Paradigm shi to the integrative Big Five trait taxonomy: History, measurement, and
conceptual issues Vol. 3rd (e Guilford Press, 2008).
8. Goldberg, L. R. A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several ve-factor
models. Personal. Psychol. Europe 7, 7–28 (1999).
9. Ashton, M. C., Jackson, D. N., Paunonen, S. V., Helmes, E. & Rothstein, M. G. e criterion validity of broad factor scales versus
specic facet scales. J. Res. Pers. 29, 432–442. https://doi.org/10.1006/jrpe.1995.1025 (1995).
10. Paunonen, S. V. & Ashton, M. C. Big Five factors and facets and the prediction of behavior. J. Pers. Soc. Psychol. 81, 524–539.
https://doi.org/10.1037/0022-3514.81.3.524 (2001).
11. Costa, P. T. NEO personality inventory-revised (NEO PI-R). (Odessa, Fla. (P.O. Box 998, Odessa 33556): Psychological Assessment
Resources, [1992] ©1992, 1992).
12. DeYoung, C. G., Quilty, L. C. & Peterson, J. B. Between facets and domains: 10 aspects of the Big Five. J. Pers. Soc. Psychol. 93,
880–896. https://doi.org/10.1037/0022-3514.93.5.880 (2007).
13. John, O. P., Donahue, E. M. & Kentle, R. L. e Big Five Inventory-Versions 4a and 54 (University of California, Berkeley, 1991).
14. Soto, C. J. & John, O. P. e Next Big Five Inventory (BFI-2): Developing and assessing a hierarchical model with 15 facets to
enhance bandwidth, delity, and predictive power. J. Pers. Soc. Psychol. 113, 117–143. https://doi.org/10.1037/pspp0000096 (2017).
15. Zhang, B. et al. e big ve inventory–2 in China: A comprehensive psychometric evaluation in four diverse samples. Assessment
29, 1262–1284. https://doi.org/10.1177/10731911211008245 (2022).
16. Cemalcilar, Z. et al. Testing the BFI-2 in a non-WEIRD community sample. Personal. Individual Dier. 182, 111087. h t t p s : / / d o i . o r
g / 1 0 . 1 0 1 6 / j . p a i d . 2 0 2 1 . 1 1 1 0 8 7 (2021).
Scientic Reports | (2025) 15:11008 15
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
17. Vedel, A. et al. Development and validation of the danish big ve inventory-2:? Domain- And facet-level structure, construct
validity, and reliability. Eur. J. Psychol. Assessment 37, 42–51. https://doi.org/10.1027/1015-5759/a000570 (2021).
18. Denissen, J. J. A., Geenen, R., Soto, C. J., John, O. P. & van Aken, M. A. G. e Big Five Inventory-2: Replication of psychometric
properties in a Dutch adaptation and rst evidence for the discriminant predictive validity of the facet scales. J Pers Assess 102,
309–324. https://doi.org/10.1080/00223891.2018.1539004 (2020).
19. Shchebetenko, S., Kalugin, A. Y., Mishkevich, A. M., Soto, C. J. & John, O. P. Measurement invariance and sex and age dierences of
the Big Five Inventory–2: Evidence from the Russian version. Assessment 27, 472–486. https://doi.org/10.1177/1073191119860901
(2020).
20. Rammstedt, B., Danner, D., Soto, C. J. & John, O. P. Validation of the short and extra-short forms of the Big Five Inventory-2 (BFI-
2) and their German adaptations. Eur. J. Psychol. Assessment 36, 149–161. https://doi.org/10.1027/1015-5759/a000481 (2020).
21. Roberts, B. W., Kuncel, N. R., Shiner, R., Caspi, A. & Goldberg, L. R. e power of personality: e comparative validity of
personality traits, socioeconomic status, and cognitive ability for predicting important life outcomes. Perspect. Psychol. Sci. 2,
313–345. https://doi.org/10.1111/j.1745-6916.2007.00047.x (2007).
22. Soto, C. J. How replicable are links between personality traits and consequential life outcomes? e life outcomes of personality
replication project Psychol. Sci. 30(711), 727. https://doi.org/10.1177/0956797619831612 (2019).
23. Ahya, A. & Siaputra, I. B. Validation of Big Five Inventory-2 BFI-2 for Indonesia: Not perfect but still valid and reliabel to measure
personality. Jurnal Psikologi Ulayat 9, 179–203 (2022).
24. Islam, M. N. e Big Five model of personality in Bangladesh: Examining the ten-item personality inventory. Psihologija 52,
395–412. https://doi.org/10.2298/PSI181221013I (2019).
25. Chiorri, C., Marsh, H. W., Ubbiali, A. & Donati, D. Testing the factor structure and measurement invariance across gender of the
big ve inventory through exploratory structural equation modeling. J. Pers. Assess. 98, 88–99. h t t p s : / / d o i . o r g / 1 0 . 1 0 8 0 / 0 0 2 2 3 8 9 1 .
2 0 1 5 . 1 0 3 5 3 8 1 (2016).
26. Srivastava, S., John, O. P., Gosling, S. D. & Potter, J. Development of personality in early and middle adulthood: Set like plaster or
persistent change?. J. Personal. Soc. Psychol. 84, 1041–1053. https://doi.org/10.1037/0022-3514.84.5.1041 (2003).
27. S chmitt, D. P., Realo, A., Voracek, M. & Allik, J. Why can’t a man be more like a woman? Sex dierences in big ve personality traits
across 55 cultures. J. Personal. Soc. Psychol. 94, 168–182. https://doi.org/10.1037/0022-3514.94.1.168 (2008).
28. Dahmann, S. C. & Anger, S. e impact of education on personality: Evidence from a German high school reform. (2014).
29. Yoshino, S. et al. A validation of the Japanese adaptation of the Big Five Inventory-2. Front. Psychol. 13, 924351 (2022).
30. Bowden, S., Saklofske, D., Van de Vijver, F., Sudarshan, N. & Eysenck, S. Cross-cultural measurement invariance of the Eysenck
Personality Questionnaire across 33 countries. Personal. Individual Dier. 103, 53–60 (2016).
31. Marsh, H. W. Application of conrmatory factor analysis and structural equation modeling in sport and exercise psychology.
Handbook of sport psychology, 774–798 (2007).
32. Asparouhov, T. & Muthén, B. Exploratory structural equation modeling. Struct. Equ. Model. 16, 397–438. h t t p s : / / d o i . o r g / 1 0 . 1 0 8 0 /
1 0 7 0 5 5 1 0 9 0 3 0 0 8 2 0 4 (2009).
33. Danner, D., Lechner, C. M., S oto, C. J. & John, O. P. Modelling the incremental value of personality facets: e domains-incremental
facets-acquiescence bifactor showmodel. Eur. J. Personal. 35, 67–84. https://doi.org/10.1002/per.2268 (2021).
34. Gomes, C. M. A. & Gjikuria, E. Comparing the ESEM and CFA approaches to analyze the Big Five factors. Avaliação Psicológica
16, 261–267 (2017).
35. Lignier, B. et al. Factor structure, psychometric properties, and validity of the Big Five Inventory-2 facets: evidence from the French
adaptation (BFI-2-Fr). Curr. Psychol., 1–16 (2022).
36. Soto, C. J. & John, O. P. Short and extra-short forms of the Big Five Inventory–2: e BFI-2-S and BFI-2-XS. J. Res. Personal. 68,
69–81 (2017).
37. Costa, P. & McCrae, R. NEO Inventories Professional Manual (Inc, 2010).
38. Halama, P., Kohút, M., Soto, C. J. & John, O. P. Slovak adaptation of the Big Five Inventory (BFI-2): Psychometric properties and
initial validation. Stud. Psychol. 62, 74–87 (2020).
39. Bartram, D. et al. ITC guidelines for translating and adapting tests. Int. J. Test. 18, 101–134. h t t p s : / / d o i . o r g / 1 0 . 1 0 8 0 / 1 5 3 0 5 0 5 8 . 2 0 1
7 . 1 3 9 8 1 6 6 (2018).
40. Polit, D. F., Beck, C. T. & Owen, S. V. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Res.
Nurs. Health 30, 459–467. https://doi.org/10.1002/nur.20199 (2007).
41. Lynn, M. R. Determination and quantication of content validity. Nurs. Res. 35, 382–386. h t t p s : / / d o i . o r g / 1 0 . 1 0 9 7 / 0 0 0 0 6 1 9 9 - 1 9 8 6
1 1 0 0 0 - 0 0 0 1 7 (1986).
42. Nicolaou, A. I. & Masoner, M. M. Sample size requirements in structural equation models under standard conditions. Int. J.
Account. Inf. Syst. 14, 256–274. https://doi.org/10.1016/j.accinf.2013.11.001 (2013).
43. Nunnally, J. C. Psychometric eory. (McGraw-Hill, 1967).
44. Chalmers, R. P. & Adkins, M. C. Writing eective and reliable Monte Carlo simulations with the SimDesign package. Quant.
Methods Psychol. 16, 248–280 (2020).
45. Faul, F., Erdfelder, E., Buchner, A. & Lang, A.-G. Statistical power analyses using G* Power 3.1: Tests for correlation and regression
analyses. Behav. Res. Methods 41, 1149–1160 (2009).
46. R Core Team. R: A language and environment for statistical computing. (2023).
47. Revelle, W. psych: Procedures for Psychological, Psychometric, and Personality Research. (Northwestern University, 2023).
48. Mateus, S. & Leon, T. d. B. esemComp: ESEM-within-CFA syntax composer. (2022).
49. Rosseel, Y. lavaan: An R package for structural equation modeling. J. Stat. Soware 48, 1–36 (2012).
50. Tóth-Király, I., Bõthe, B., Rigó, A. & Orosz, G. An illustration of the Exploratory structural equation modeling (ESEM) framework
on the passion scale. Front. Psychol. 8, 1968–1968. https://doi.org/10.3389/fpsyg.2017.01968 (2017).
51. Conventional criteria versus new alternatives. Hu, L. t. & Bentler, P. M. Cuto criteria for t indexes in covariance structure
analysis. Struct. Equ. Model. Multidiscipl. J. 6, 1–55. https://doi.org/10.1080/10705519909540118 (1999).
52. Marsh, H. W. et al. Exploratory structural equation modeling, integrating CFA and EFA: Application to students’ evaluations of
university teaching. Struct. Equ. Model. 16, 439–476. https://doi.org/10.1080/10705510903008220 (2009).
53. Meredith, W. Measurement invariance, factor analysis and factorial invariance. Psychometrika 58, 525–543. h t t p s : / / d o i . o r g / 1 0 . 1 0 0
7 / B F 0 2 2 9 4 8 2 5 (1993).
54. Marsh, H. W. et al. A new look at the big ve factor structure through exploratory structural equation modeling. Psychol. Assess 22,
471–491. https://doi.org/10.1037/a0019227 (2010).
55. Cheung, G. W. & Rensvold, R. B. Evaluating goodness-of-t indexes for testing measurement invariance. Struct. Equ. Model. 9,
233–255 (2002).
56. Chalmers, R. Mirt: A multidimensional item response theory package for the R environment. J. Stat. Sow. 48, 1–29 (2012).
57. Samejima, F. Estimation of latent ability using a response pattern of graded scores. Psychometrika 35, 139. h t t p s : / / d o i . o r g / 1 0 . 1 0 0 7
/ B F 0 2 2 9 0 5 9 9 (1970).
58. Desjardins, C. & Bulut, O. Handbook of Educational Measurement and Psychometrics Using R. 1st edn, (2018).
59. Baker, F. B. e Basics of Item Response eory Using R. 1st edn, 27–34 (Springer, 2017).
60. Silvia, P. 95–98 (Oxford University Press, 2006).
61. Markus, H. R. & Kitayama, S. in College Student Development and Academic Life 264–293 (Routledge, 2014).
62. Chiu, C.-Y. & Hong, Y.-Y. in Social Psychology: Handbook of Basic Principles 785–804 (Guilford Press, 2007).
Scientic Reports | (2025) 15:11008 16
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
63. Wilmot, M. P., Wanberg, C. R., Kammeyer-Mueller, J. D. & Ones, D. S. Extraversion advantages at work: A quantitative review and
synthesis of the meta-analytic evidence. J. Appl. Psychol. 104, 1447 (2019).
64. Luo, R., Tamis-LeMonda, C. S. & Song, L. Chinese parents’ goals and practices in early childhood. Early Childhood Res. Quart. 28,
843–857. https://doi.org/10.1016/j.ecresq.2013.08.001 (2013).
65. Hofstede, G. Culture’s consequences: Comparing values, behaviors, institutions and organizations across nations. ousand Oaks
(2001).
66. Triandis, H. C. Individualism-collectivism and personality. J. Personal. 69, 907–924 (2001).
67. Church, A. T. Culture and personality: Toward an integrated cultural trait psychology. J. Personal. 68, 651–703 (2000).
68. Heine, S. J. & Buchtel, E. E. Personality: e universal and the culturally specic. Annu. Rev. Psychol. 60, 369–394 (2009).
Acknowledgements
is research project is supported by a Startup Grant for New Faculty (FY2024-2025) administered by the North
Soth University Oce of Research.
Author contributions
M.A.S., F.K., C.J.S, and S.H. designed the study. M.A.S. and F.K. curated the data. M.A.S. conducted the formal
analysis and prepared all gures and tables. M.A.S., F.K., C.J.S, and S.H. wrote and reviewed the main manuscript
text. All authors reviewed the manuscript.
Declarations
Competing interests
e authors declare no competing interests.
Additional information
Supplementary Information e online version contains supplementary material available at h t t p s : / / d o i . o r g / 1
0 . 1 0 3 8 / s 4 1 5 9 8 - 0 2 5 - 9 0 2 6 4 - 0 .
Correspondence and requests for materials should be addressed to S.H.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Open Access is article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives
4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in
any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide
a link to the Creative Commons licence, and indicate if you modied the licensed material. You do not have
permission under this licence to share adapted material derived from this article or parts of it. e images or
other third party material in this article are included in the article’s Creative Commons licence, unless indicated
otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence
and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to
obtain permission directly from the copyright holder. To view a copy of this licence, visit h t t p : / / c r e a t i v e c o m m o
n s . o r g / l i c e n s e s / b y - n c - n d / 4 . 0 / .
© e Author(s) 2025
Scientic Reports | (2025) 15:11008 17
| https://doi.org/10.1038/s41598-025-90264-0
www.nature.com/scientificreports/
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
The purpose of this study was to adapt a Japanese version of the Big Five Inventory-2 (BFI-2-J) to examine its factor structure, reliability, validity, and measurement invariance. The BFI-2-J assesses five domains and 15 facets of the Big Five personality traits. We analyzed two datasets: 487 Japanese undergraduates and 500 Japanese adults. The results of the principal component analysis and confirmatory factor analysis revealed that the domain-facet structure of the BFI-2-J was similar to that of other language versions. The reliability of the BFI-2-J is sufficient. The correlation coefficients between the BFI-2-J and the other Big Five and self-esteem measures supported convergent and discriminant validity. Moreover, we confirmed measurement invariance across age and sex groups in domain-level and facet-level models. The results suggest that the BFI-2-J is a good instrument for measuring the Big Five personality traits and their facets in Japan. The BFI-2-J is expected to be useful in Japanese personality research and international comparative research.
Article
Full-text available
The aim of this study was to realize a French adaptation of the Big Five Inventory-2 (BFI-2), and to further examine the BFI-2’s convergent and discriminant validity via a comparison with the NEO-PI-3 and with the syndromes assessed by the Symptom Checklist (SCL-90-R). Bifactor Exploratory Structural Equation Modeling almost fully supported the BFI-2’s factor structure and measurement model with five major factors, 15 facets, and an acquiescence method factor. All the scales measuring the major factors showed excellent reliability and almost all the scales measuring facets showed acceptable to excellent reliability and satisfactory metric and scalar invariance across gender. The BFI-2 domains and facets were found to be strongly correlated with the scales of the NEO-PI-3 measuring similar constructs. The BFI-2 Negative Emotionality domain and its facets were positively related to most of the SCL-90-R scales, and Extraversion and its facets related negatively with Interpersonal Sensitivity and Depression. In conclusion, data from the French adaptation confirmed the relevance of the BFI-2 hierarchical factor structure, as well as its scales’ reliability and convergent and discriminant validity, which supports and extends the body of knowledge from the original American BFI-2 and its Danish, Dutch, German, Russian, and Slovakian adaptations.
Article
Full-text available
The Big Five Inventory-2 (BFI-2) has received wide recognition since its publication because it strikes a good balance between content coverage and brevity. The current study translated the BFI-2 into Chinese, evaluated its psychometric properties in four diverse Chinese samples (college students, adult employees, adults treated for substance use, and adolescents), and compared its factor structure with those obtained from two U.S. samples. Across two studies, the Chinese BFI-2 demonstrated good reliability (Cronbach’s α and test–retest reliability), structural validity, convergent/discriminant validity, and criterion-related validity at the domain level. At lower levels of analyses, some facets and negatively worded items functioned better among participants with higher than those with lower education levels. Implications, limitations, and future directions are discussed.
Article
Full-text available
The purpose of this tutorial is to discuss and demonstrate how to write safe, effective, and intuitive computer code for Monte Carlo simulation experiments containing one or more simulation factors. Throughout this tutorial the SimDesign package (Chalmers, 2020), available within the R programming environment, will be adopted due to its ability to accommodate a number of desirable execution features. The article begins by discussing a selection of attractive coding strategies that should be present in Monte Carlo simulation experiments, showcases how the SimDesign package can satisfy many of these desirable strategies, and provides a worked mediation analysis simulation example to demonstrate the implementation of these features. To demonstrate how the package can be used for real-world experiments, the simulation explored by Flora and Curran (2004) pertaining to a confirmatory factor analysis robustness study with ordinal response data is also presented and discussed.
Article
Full-text available
Personality can be described at different levels of abstraction. Whereas the Big Five domains are the dominant level of analysis, several researchers have called for more fine-grained approaches, such as facet-level analysis. Personality facets allow more comprehensive descriptions of individual differences, more accurate predictions of outcomes, and a better understanding of the mechanisms underlying trait–outcome relationships. However, several methodological issues plague existing evidence on the added value of facet-level descriptions: Manifest facet scale scores differ with respect to their reliability, domain-level variance (variance that is due to the domain factor) and incremental facet-level variance (variance that is specific to a facet and not shared with the other facets of the same domain). Moreover, manifest scale scores overlap substantially, which affects associations with criterion variables. We suggest a novel structural equation modeling approach that allows domain-level variance to be separated from incremental facet-level variance. We analyzed data from a heterogeneous sample of adults in the United States (N = 1,193) who completed the 60-item Big Five Inventory-2. Results illustrate how the variance of manifest personality items and scale scores can be decomposed into domain-level and incremental facet-level variance. The association with criterion variables (educational attainment, income, health, life satisfaction) further demonstrates the incremental predictive power of personality facets.
Article
Full-text available
Following the publication of the Big Five Inventory- 2 (BFI-2) and its abbreviated forms (the 30-item BFI-2-S and 15-item BFI-2-XS), two studies were conducted to develop and validate a Danish translation of these measures. Study 1 first developed a preliminary Danish BFI-2 item pool consisting of translations of the 60 BFI-2 items, then tested and refined this item pool using two waves of data collection, and identified a set of 60-item formulations for the Danish BFI-2. Study 1 then examined the domain- and facet-level structure of the Danish BFI-2, and the construct validity and reliability of this measure. Study 2 tested the generalizability of the measurement properties of the Danish BFI-2 found in Study 1 as well as the preliminary measurement properties of its abbreviated forms (the Danish BFI-2-S and BFI-2-XS) in a new sample. The results of these studies indicate that the Danish BFI-2 is a reliable and valid personality measure with psychometric properties and construct validity corresponding to the English-language original. The preliminary results regarding measurement properties of the abbreviated forms are encouraging and should inspire further validation.
Article
Full-text available
The article describes the process of adaptation of the Big Five Inventory-2 into the Slovak language and cultural context. The translation process of the Slovak BFI-2 was based on three data samples using item analysis and basic psychometric properties. The present study estimates the psychometric properties of the Slovak BFI-2 and its hierarchical structure using exploratory and confirmatory factor analysis in an independent sample of 526 participants recruited through an online research panel. It also provides data on convergent-discriminant validity in relation to alternative Big Five measures (NEO-FFI, TIPI) and to standard well-being measures. The results showed good internal consistency on the domain level, and somewhat lower on the facet level. Both exploratory and confirmatory factor analyses successfully recovered the conceptual structure of the Slovak BFI-2. The BFI-2 domains and facets showed adequate convergent-discriminant validity, based on the meaningful pattern of correlations with the other Big Five measures and well-being scales. These findings suggest that the Slovak version of the BFI-2 is a reliable and valid measure of the Big Five personality traits, and is appropriate for use in Slovak and cross-cultural research.
Article
Full-text available
Researchers, over the world, often create very brief measures of Big Five personality dimensions, so that they can assess people’s personality in a reasonably short period of time. The most prominent and well-established measure among all brief personality measures is the ‘Ten Item Personality Inventory’ (TIPI). The present study aimed to translate, adapt, and validate the TIPI for use in the Bangladeshi culture. After completing the standardized translation procedure, the Bangla version of the Ten Item Personality Inventory (TIPI–B) was examined in a study including 662 Bangladeshi adults. Though an exploratory factor analysis with one half of the sample (n = 330) had explained 77.53% of the total variance, it did not show the scale’s five dimensions as independent with two items for each. Acceptable goodness of fit indices (χ2/df = 3.177, GFI =.960, CFI = .935, TLI = .937, SRMR = .061, and RMSEA = .76) were found for the scale through a confirmatory factor analysis performed on the second half of the sample (n = 332). Acceptable internal consistencies, significant test-retest reliabilities, and convergent and discriminant validities were established in the scale through different statistical analyses. Thus, the TIPI–B with its five dimensions can be used as a valid and reliable measure to assess the personality of Bangladeshi people.
Article
Full-text available
The Big Five Inventory–2 (BFI-2) is a recently published 60-item questionnaire that measures personality traits within the five-factor model framework. An important aspect of the BFI-2 is that it measures the traits at both the domain and facet levels and also controls acquiescence bias via the balanced number of true- and false-keyed items across the domains and facets. The current research evaluates factorial measurement invariance of a Russian version of the BFI-2 across sex and age within samples of 1,024 university students (Study 1) and 1,029 Internet users (Study 2). Across these samples, men scored lower on the domains of negative emotionality and agreeableness and slightly higher on extraversion. Sex differences were also obtained on various facets. In the Internet sample, age correlated modestly with several Big Five domains in accordance with the well-documented maturity principle. The newly developed Russian version of BFI-2 showed good reliability and validity across both samples. Moreover, random intercept exploratory factor analyses showed that the BFI-2 displayed a hierarchical five-domain-15-facet structure that demonstrated strict measurement invariance across sex and age.
Article
We present two studies testing the validity and nomological properties of the Turkish adaptation of the Big Five Inventory–2 (BFI-2) using a university student sample and a nationally representative community sample of young adults aged 18–35. Findings from the university student sample replicate the psychometric properties of the BFI-2. Findings from the community sample replicate the factor structure and majority of the trait-outcome associations obtained from non-community samples in WEIRD populations. However, there were notable differences in terms of the internal consistency reliabilities of the personality domains, and some trait-outcome associations, specifically with outcomes that are germane to the Turkish culture.