Skip to main content

Psychology: Research and Review

Validity evidence of two short scales measuring the Big Five personality factors


The objective of this study was to obtain evidence of the convergent and factor validity of the Reduced Scale of Big Five Personality Factors (ER5FP), with 20 items, and of the Reduced Inventory of Big Five Personality Factors (IGFP-5R), with 32 items. The two Brazilian scales were administered to 554 participants aged 16–69 years (M = 30.6, SD = 8.6). The measurement model of each instrument was tested using confirmatory factor analysis. Both scales showed an adequate adjustment of the measurement model to the data (root mean square error of approximation < .06; standardized root mean square residual < .06) after excluding a number of items. Correlations between the factors of both instruments were estimated. Moderate evidence of convergent validity was found for Extraversion, Neuroticism, and Openness to Experience (raw correlations ranging from .44 and .57, and disattenuated correlations from .60 to .80). For Agreeableness and Conscientiousness, weaker evidence was found (raw correlations of .33 and .29, and disattenuated correlations of .48 and .43, respectively).


The five-factor model (FFM), also known as Big Five, is currently one of the most influential and investigated models used in the personality research field (De Raad & Mlacic, 2015; McCrae, 2011). In Brazil, although the number of studies published using the FFM as reference is relatively small in comparison to other countries (Silva & Nakano, 2011; Passos & Laros, 2014), it can be stated that the development of instruments based on this model is increasing. Among the psychological instruments approved by the Federal Council of Psychology (Conselho Federal de Psicologia, 2017) and available on the Brazilian market, we can cite the NEO Personality Inventory - Revised (NEO-PI-R (Flores-Mendoza, 2007)), the Factorial Battery of Personality (Bateria Fatorial de Personalidade (BFP) in the original (Nunes, Hutz, & Nunes, 2010)), the Factorial Scale of Neuroticism (Escala Fatorial de Neuroticismo, in the original (Hutz & Nunes, 2001)), the Factorial Extraversion Scale (Escala Fatorial de Extroversão, in the original (Nunes & Hutz, 2007a)), and the Factorial Scale of Socialization (Escala Fatorial de Socialização (Nunes & Hutz, 2007b)). In addition to these, there are a number of instruments under development, either for research purposes or for future commercialization (e.g., Andrade, 2008; Carvalho, Nunes, Primi, & Nunes, 2012; Hauck Filho, Machado, Teixeira, & Bandeira, 2012a,b; Gomes & Golino, 2012; Hutz et al., 1998; Natividade & Hutz, 2015; Passos & Laros, 2015; Primi, Santos, John, & De Fruyt, 2016; Vasconcelos, 2005; Vasconcellos & Hutz, 2008).

Since the instruments for personality assessment tend to be extensive, usually consisting of more than 100 items, one of the research objectives in this area is the development of reduced scales (Carvalho et al., 2012; Hauck Filho et al., 2012a, b; Natividade & Hutz, 2015; Passos & Laros, 2015). The literature shows evidence that a large number of items in an instrument can be a source of measurement error. For example, very long instruments can induce discouragement, fatigue, and inattention in the participants. Furthermore, a large number of items may impede the joint administration of two or more instruments. Therefore, scales with relatively few items can potentially minimize such problems and be useful in different testing settings (e.g., surveys and screenings), provided that they have acceptable reliability indices and satisfactory evidence of validity.

One possible disadvantage of using reduced scales concerns the relatively high amount of measurement errors due to the fact that the number of items of a scale is negatively related to the amount of measurement errors. Another drawback of using reduced scales is related to the possible occurrence of increasing type I and II errors (Credé, Harms, Niehorster, & Gaye-Valentine, 2012). However, the literature already presented evidence that reduced instruments can be a reliable alternative for personality assessment (Donnellan, Oswald, Baird, & Lucas, 2006; Laverdière, Morin, & St-Hilaire, 2013; Rammstedt, 2007). With this motivation, some reduced scales for personality assessment based on the FFM have been proposed in recent years in Brazil.

Andrade (2008), for example, developed the Reduced Inventory of the Big Five Personality Factors (Inventário Reduzido dos Cinco Grandes Fatores de Personalidade (IGFP-5R) in the original), based on the translation and adaptation for Brazil of the Spanish version of the Big Five Inventory (Benet-Martinez & John, 1998). Originally composed of 44 items, the inventory was administered to a sample of 5247 subjects from all Brazilian geographic regions. Confirmatory factor analysis demonstrated the adequacy of a model with 32 items and five correlated factors with reliability coefficients ranging from .66 to .76. By analyzing the items using the item response theory (IRT), the author certified that the instrument had adequate psychometric parameters, even considering that the items did not cover the entire latent trait continuum. Among other conclusions, Andrade (2008) pointed out the need to carry out studies investigating the evidence of the convergent validity between the IGFP-5R and other FFM-based instruments, such as NEO-PI-R and NEO-FFI-R.

Hauck Filho et al. (2012a, b) proposed the reduced marker scale (Escala de Marcadores Reduzidos (EMR) in the original), based on adjective markers for personality assessment identified by Hutz et al. (1998). Analyzing data from a sample of 674 university students, a five-factor structure was found. Each factor was composed of five items, with reliability coefficients of factor scores ranging from .61 to .80. In another study with the EMR on a sample of 208 adolescents Hauck Filho et al. (2012a, b), factor analysis also indicated a five-factor structure with each factor containing four items. The reliability coefficients of the factor scores ranged from .55 to .80. Subsequently, Machado, Hauck Filho, Teixeira, and Bandeira (2014) analyzed the responses of 887 university students to the EMR. Factor analysis pointed to a structure with five oblique factors, and IRT analysis indicated a good adjustment of the measurement model, although with a concentration of the items in a restricted range of the latent trait continuum, as occurred in the study conducted by Andrade (2008). Finally, Pariz, Haddad, and Machado (2016) gathered evidence of convergent and criterion validity regarding the EMR. The results indicated statistically significant correlations between the EMR and the BFP (Nunes et al., 2010), with the exception of the Socialization (i.e., Agreeableness) and the Openness to Experience factors.

Another example is the study conducted by Carvalho et al. (2012) regarding the translation and adaptation to Brazil of the Ten-Item Personality Inventory (TIPI), developed by Gosling, Rentfrow, and Swann (2003). According to Carvalho et al., TIPI is one of the most cited reduced scales based on the FFM. The results of data analysis of a sample of 404 Brazilian adolescents indicated a structure of three factors, but with low reliability indices ranging from .41 to .63. Among other notes, the authors indicated that before discarding the use of the TIPI in the Brazilian cultural context, it is necessary to gather evidence of validity based on external variables, including other instruments constructed based on the FFM.

Natividade and Hutz (2015) elaborated the reduced scale of personality descriptors (Escala Reduzida de Descritores de Personalidade (Red5) in the original), composed of 20 items. In one study, analyses of a sample of 1889 adults resulted in an orthogonal structure with five principal components with reliability coefficients ranging from .59 to .84. In a second study, with a sample of 512 adults, confirmatory factor analysis corroborated the structure obtained in the first study. There was also evidence of convergent validity between the Red5 and the BFP (Nunes et al., 2010).

Finally, Passos and Laros (2015) proposed a semantic differential scale with 47 items, called Reduced Scale of Big Five Personality Factors (Escala Reduzida de Cinco Grandes Fatores de Personalidade (ER5FP) in the original). Exploratory and confirmatory factor analyses with data from a sample of 365 undergraduate students indicated a five-factor structure with 20 items. The reliability coefficients ranged from .71 to .85. The authors concluded that the scale was adequate as a measure of the five factors, considering the sample analyzed. As a research agenda, the authors highlighted the objective of increasing and diversifying the sample (e.g., the inclusion of participants with other levels of education and from all Brazilian geographic regions) and carrying out convergent and criterion validity studies.

In order to contribute to the development of reduced instruments for the assessment of personality based on the FFM and given continuity to the studies of Passos and Laros (2015) and Andrade (2008), the present study has as overall objective obtaining evidence of the convergent validity between the ER5FP and the IGFP-5R. Moreover, this study aimed to test the factorial validity of the two instruments, analyzing the theoretically expected measurement model for the two scales (i.e., five correlated factors). Finally, this study also had the objective of analyzing the role of covariates in the description of the factors that compose the scales.

The covariates gender, age, and level of education were used to carry out comparisons between groups. The covariant sex was chosen based on studies that indicate that there are differences between men and women regarding personality traits (Schmitt et al., 2017; Weisberg, DeYoung, & Hirsh, 2011). Schmitt, Realo, Voracek, and Allik (2008) concluded in their study with data from 55 nations that in more prosperous and egalitarian societies, the personality traits of men and women tend to be less similar. In general, previous findings indicate that women have higher scores on Extraversion, Agreeableness, and Neuroticism (Weisberg et al. 2011). The effect of age on personality traits has also been investigated in several studies (Donnellan & Lucas, 2008; Rammstedt, 2007; Srivastava, John, Gosling, & Potter, 2003). In a study with a reduced version of the Big Five Inventory, the BFI-10, Rammstedt (2007) concluded that younger participants tended to present higher scores on extraversion, while the older ones had the highest scores on Agreeableness and Conscientiousness. Regarding the effect of level of education, the findings in the studies of Rammstedt (2007) indicated that people with higher levels of education tend to present higher scores on Openness to Experience. In addition to the covariates cited above, we also analyzed variables informing marital status (see Boyce, Wood, & Ferguson, 2016) and whether the participant is a parent (see Prinzie, Stams, Deković, Reijntjes, & Belsky, 2009) in order to verify whether there is any association of these two variables with the personality traits assessed by both instruments.



The sample of this study is composed of 554 subjects (Table 1), mostly female (58.9%) and with a mean age of 30.6 years (SD = 8.61). Regarding the level of education, 1.5% reported having started or finished the primary education level, 29.7% the upper secondary, 51.4% the first stage of tertiary education, and 23.3% graduate school (i.e., lato sensu specialization, master or doctoral degrees). The data were collected in the Distrito Federal and in the Metropolitan Region of Salvador, Bahia. An item of the questionnaire asking about the region of origin of the respondents revealed that 47.5% of the participants were originally from the Center-West region, 39.2% from the Northeast, and 13.3% from other geographical regions of the country.

Table 1 Sociodemographic information of the participants (N = 554)


Reduced Scale of Big Five Personality Factors (ER5FP)

The ER5FP (Passos & Laros, 2015) is a semantic differential rating scale composed of 20 pairs of bipolar adjectives, with a 7-point Likert type response scale. In the study of Passos and Laros, satisfactory reliability coefficients of the factor scores were found (Extraversion = .85, Conscientiousness = .78, Agreeableness = .81, Neuroticism = .80, and Openness to Experience = .71).

Reduced Inventory of the Big Five Personality Factors (IGFP-5R)

The IGFP-5R is an instrument based on the Big Five Inventory (BFI) that was submitted to a validation process for the Brazilian context by Andrade (2008). The IGFP-5R is composed of 32 items, with a 5-point Likert type response scale. The reliability coefficients of the factor scores found in the Andrade study were considered acceptable (Extraversion = .76, Conscientiousness = .66, Agreeableness = 0.74, Neuroticism = 0.75, and Openness to Experience = .68).


Data collection

The researchers followed the guidelines of the Resolution 510/2016 of the Brazilian National Health Council (Conselho Nacional de Saúde) regarding scientific research involving human beings. One of the authors recruited the participants and collected the data in one public and two private educational institutions. Prior to the basic instructions, the participants were informed about the research objectives and the confidentiality concerning their identity and signed the Free and Informed Consent Term. The participants individually answered a form, with completion time ranging from 10 to 30 min. The form was composed by the socioeconomic questionnaire, the ER5FP, and the IGFP-5R in this order.

Data analysis

The data analysis occurred in four stages. The first stage involved cleaning the database and performing exploratory analyses. Participants who did not respond to one of the instruments or did not report sociodemographic information were withdrawn, as well as those considered outliers based on the Mahalanobis distance. As a result, 9.2% of the cases were excluded, leaving 554 valid cases remaining. Then, the missing data were imputed using the linear trend at point technique of SPSS 18.0. After the exploratory analyses, the assumptions of confirmatory factor analysis (CFA) for the data were checked. First, the assumption of univariate normality was evaluated through the inspection of the asymmetry and kurtosis of the items. An item was considered non-normally distributed if its asymmetry was greater than 1.0 and its kurtosis was superior to 2.0 (Miles & Shevlin, 2001; Osborne, 2002). None of the items of the two personality instruments showed univariate non-normality. Afterwards, the multivariate normality of the items was verified using the multivariate kurtosis coefficient of Mardia (Byrne, 2016).

In the second stage, the CFA of the two instruments was conducted through structural equation modeling with maximum likelihood estimation. The measurement model defined for the two scales aimed the identification of five correlated factors: Extraversion, Conscientiousness, Agreeableness, Neuroticism, and Openness to Experience. Based on the modification indices, changes were made in the initial models of the two scales. Modification indices indicating a substantial impact on the improvement of fit suggested including a correlation between the error terms of several items related to the same factor. Byrne (2016) states that the suggestion of including a correlation between the error terms of two items in the model reveals, in most cases, that the items are very similar in terms of content. In these cases, the author recommends to exclude one of the two items, since the second item does not add information. Thus, the item with the lowest factor loading was excluded of the item pairs for which the modification index suggested correlation between the errors. In the CFA analyses, the recommendations of Byrne (2016), Kline (2016), and Weston, Gore, Chan, and Catalano (2008) were followed.

In the third stage, Guttman’s coefficient lambda 2 (λ2) was calculated to evaluate the internal consistency of the scores of each factor. This coefficient was adopted because it is more appropriate in studies using instruments with a reduced number of items for each latent component (Sijtsma, 2012). In sequence, the raw convergent validity coefficients and the ones corrected for attenuation were calculated. Pearson’s correlation was used to obtain evidence of the convergent validity of the two personality instruments. To correct the raw coefficients for the effect of measurement errors, we used the dual correction formula for attenuation (Osborne, 2013), which is as follows: \( {r}_{x^{\prime }{y}^{\prime }}=\raisebox{1ex}{${r}_{xy}$}\!\left/ \!\raisebox{-1ex}{$\sqrt{r_{xx}{r}_{yy}}$}\right. \). In this formula, \( {r}_{x^{\prime }{y}^{\prime }} \) is the validity coefficient corrected for the measurement error in both the test and the criterion, rxy is the observed correlation, rxx is the reliability of the test, and ryy is the reliability of the criterion.

The last stage was the analysis of differences between groups on the five-factor scores of each instrument. The groups were selected based on the following covariates: gender (male vs. female), age (less than 30 years old vs. more than 30), being a parent, marital status, and level of education. The statistical package SPPS 18.0 was used in the exploratory and descriptive analyses and for testing the differences between means. The SPSS Amos 18.0 package was employed for the verification of the univariate and multivariate normality, for the design of the factor models and for the CFA.


Model fitting

As can be seen in Table 2, the initial ER5FP model has a root mean square error of approximation (RMSEA) value greater than .06 and a standardized root mean square residual (SRMR) value greater than .08, which is not considered a good fit. According to the recommendations of Weston et al. (2008), the minimum criteria for assuring the quality of a model fit are RMSEA ≤ .06 and SRMR ≤ .08. The Comparative Fix Index (CFI) and Goodness of Fit Index (GFI) values of the initial ER5FP model also indicated an unsatisfactory fit since they fell below the .95 criterion. To improve the quality of model fit, alterations in the model were made based on the modification indices suggested by the Amos package. Following the procedure described in the “Method” section, five of the 20 items were excluded from the initial ER5FP measurement model. The fit of the final model with 15 items was satisfactory, with RMSEA = .059, SRMR = .054, and CFI and GFI around .95. Thus, all adjustment indices indicated that the final ER5FP measurement model is appropriate for the data. However, it is important to highlight that the modifications made on the initial model are indicatives that the original structure with 20 items failed to replicate in the sample analyzed in this study.

Table 2 Fit indices of the initial and final measurement models of the scales

Similarly, the original structure with 32 items of the IGFP-5R failed to replicate in this study. The original measurement model of the IGFP-5R was also not satisfactory, with RMSEA and SRMR values above the criterion value and with CFI and GFI values below the criterion value (Table 2). Changes were made based on the modification indices to obtain a good fit. Sixteen of the 32 items of the original measurement model were excluded. The fit of the final measurement model was good, with RMSEA = .038, SRMR = .043, CFI = .95, and GFI = .96.

Reliability coefficients, factor loadings, and correlations between factors

Table 3 displays the factor loadings, the communalities, the correlations between the factors, the item-rest (rir) correlations, the mean correlation between the items, and the reliability coefficients of the scores on the factors regarding the ER5FP. It is noteworthy that, even with a smaller number of items, the reliability of the factor scores is reasonable, except for the last factor, Openness to Experience, which showed a reliability coefficient of .58. The reliability coefficients of the other factors ranged between .67 and .79, with the greater value referent to the Extraversion factor, which also presented the highest mean correlation among its items (r = .56).

Table 3 ER5FP: reliability coefficients (Guttman’s λ2), factor loadings (FL), communalities (h2), item-rest correlations (rir), and correlations between factors (N = 554)

Regarding the raw correlations between the factors, the highest positive value was observed between Openness to Experience and Conscientiousness (r = .33), while the highest negative values were observed between Neuroticism and Conscientiousness (r = −.14) and between Neuroticism and Agreeableness (r = −.13). The observed values of the correlations between the five factors have similarities with the results of the study conducted by Herzberg and Brähler (2006) on the evaluation of reduced scales. Analyzing the 16-Adjective Measure, the authors also found negative correlations between Neuroticism and Conscientiousness (r = −.09) and positive correlations between Openness and Agreeableness (r = .23).

Table 4 displays the same information for the IGFP-5R. The reliability coefficients of the factors are reasonable, ranging from .65 to .72. The correlations between the items of each factor ranged from .34 to .46, with the highest value related to the Neuroticism factor. Concerning the raw correlations between the factors, the highest positive coefficient was observed between Agreeableness and Openness (r = .15), while the highest negative coefficient was observed between Neuroticism and Conscientiousness (r = −.22). This information is consonant with the results regarding the ER5FP and the findings of Herzberg and Brähler (2006).

Table 4 IGFP-5R: reliability coefficients (Guttman’s λ2), factor loadings (FL), communalities (h2), item-rest correlations (rir), and correlations between factors (N = 554)

Table 5 shows the mean, standard deviation, and variance of the scales. For the ER5FP, a comparison was made between the means found in the construction study (Passos & Laros, 2015) and those found in this validity study. For the IGFP-5R, data is presented only for this study. Regarding the distribution of the factor scores, the Agreeableness factor showed the highest mean both for the IGFP-5R (M = 4.48; SD = .61) and the ER5FP (M = 5.71; SD = 1.05). In comparison with the ER5FP development study, the variance of factor scores in this study is lower, especially for Openness and Extraversion (28% and 18% less variance, respectively).

Table 5 Statistics of the factor scores of the scales

Testing differences between groups

Differences on the five-factor scores of the ER5FP and the IGFP-5R were analyzed between groups of participants based on the following covariates: sex (men vs. women), age (less than 30 years old vs. more than 30), being a parent or not, marital status (single vs. married), and level of education. Concerning the comparisons of means using the ER5FP data, with regard to sex, it was observed that women (M = 3.96) tended to present higher levels of Neuroticism than men (M = 3.59), p < .01. When evaluating the covariate age, older people (M1) present higher scores than the younger ones (M0) on Openness (M1 = 5.52, M0 = 5.27, p < .05) and Agreeableness (M1 = 5.79, M0 = 5.60, p < .05). As for being a parent (M1) or not (M0), the results indicated that the participants who had children tended to be more extraverted (M1 = 4.70, M0 = 4.42, p < .05), agreeable (M1 = 5.85, M0 = 5.59, p < .01), and open to new experiences (M1 = 5.52, M0 = 5.30, p < .01). Finally, although the difference was not large, married participants (M1) had higher scores than the single ones (M0) on Agreeableness (M1 = 5.82, M0 = 5.60, p < .01). Using the ER5FP data, no significant differences were found for the level of education, considering the various levels.

The comparisons between groups using the factor scores of the IGFP-5R produced the following results. There were higher rates of Neuroticism among women (M = 2.90) than among men (M = 2.51), p < .01. Although the differences were not large, women (M = 4.53) also presented higher scores on the Agreeableness factor than men (M = 4.41), p < .05. The results also revealed that males (M = 4.00) tended to be more open to new experiences than females (M = 3.82), p < .05. Concerning age differences, older people (M1) tended to show higher indices than the younger ones (M0) on Conscientiousness (M1 = 3.48, M0 = 3.25, p < .01) and Agreeableness (M1 = 4.54, M0 = 4.42, p < .05), and lower on Neuroticism (M1 = 2.63, M0 = 2.85, p < .05). When assessing differences between groups based on the covariate being a parent, it was observed that the participants who had children (M1) presented higher averages on the Agreeableness factor (M1 = 4.56, M0 = 4.42, p < .05). Regarding the other covariates, marital status, and level of education, the averages found for the groups did not reveal statistically significant differences.

Convergent validity evidences

The correlation coefficients corrected for attenuation between the factors of the ER5FP and the factors of the IGFP5-R are shown in the diagonal of Table 6, together with the raw coefficients, which are presented between parentheses. The corrected correlation coefficients can be interpreted as evidence of convergent validity between the instruments. The lower and upper triangles represent evidence of discriminant validity. The corrected coefficients ranged from .43 to .80, while the raw coefficients showed a much lower magnitude, ranging from .29 to .57.

Table 6 Raw and corrected for attenuation correlation coefficients between factors of the ER5FP and IGFP-5R scales (N = 554)

Considering the validity coefficients corrected for attenuation, Extraversion (r = .80), Openness (r = .74), and Neuroticism (r = .60) showed satisfactory evidences of convergent validity, while Agreeableness (r = .48) and Conscientiousness (r = .43) were lower than expected. The correlations between different factors also merit attention, such as between Openness to Experience (IGFP-5R) and Conscientiousness (ER5FP), r = .42, and between Openness to Experience (ER5FP) and Agreeableness (IGFP-5R), r = .39. The magnitude of both corrected and raw coefficients are lower than found in similar studies (cf. Hahn, Gottschling, & Spinath, 2012).

Discussion and conclusions

After the modifications of the original models (Andrade, 2008; Passos & Laros, 2015), both scales presented adequate psychometric quality in this study, as shown by a good model fit and adequate reliability coefficients (λ2). For the ER5FP, with exception of Openness to Experience (.58), the factors showed adequate reliability coefficients ranging from .67 (Conscientiousness) to .79 (Extraversion). For the IGFP-5R, the reliability coefficients varied from .65 (Extraversion) to .72 (Neuroticism). The factors Extraversion and Neuroticism in both instruments presented the highest reliability coefficients and inter-item correlations. This fact may be due to the quality and clarity of the definition of the factors in question, which may have allowed greater coverage of the content of each factor (Frazier, Naugle, & Harggety, 2006). According to cross-cultural studies, Neuroticism and Openness to Experience failed to replicate across different languages (De Raad et al., 2010), which might explain the low reliability of Openness to Experience in the ER5FP.

According to Byrne (2016), after making modifications of the original model, the researcher stops working in a confirmatory mode and starts working in an exploratory fashion. That is the case of this study, once it was necessary to change the models based on modification indices to improve their fit. These changes resulted in the deletion of five items of the original 20 items of the ER5FP and 16 of the original 32 items of the IGFP-5R. Although the five theoretical factors from the FFM model were recovered from the analysis, the loss of items is an indication that the original structure of both instruments failed to replicate in the sample analyzed in this study. Although this data-driven strategy was efficient to improve the fit of the models, it does have limitations. For instance, further studies should contemplate the investigation of the confirmatory validity of the models in independent samples to assess the generalizability of both the original models (Andrade, 2008; Passos & Laros, 2015) and the models identified in this study.

Another related concern is with the loss of items and the content validity of the scales. As other reduced scales, ER5FP and IGFP-5R have a limited coverage of the constructs when compared with long scales. The loss of items as a result of the analyses could potentially reduce this coverage. However, the content of the factors of both ER5FP and IGFP-5R preserved some of the core aspects of the five-factor model—as described in The 100 Revised Synonym Clusters list (Goldberg, 1992; Goldberg & Rosolack, 1994). Concerning the ER5FP, the Extraversion factor contains items related to Gregariousness, Shyness, and Reserve (items 1 and 6) and Expressiveness and Silence (item 5). The Conscientiousness items are associated with Persistence (e.g., item 2) and Decisiveness, Indecisiveness, and Inconsistency (items 10 and 12). The items of the Agreeableness factor are linked to Empathy and Overcriticalness (item 4) and Amiability, Callousness and Rudeness (items 7 and 14). The Neuroticism items are related to Placidity and Fear (items 3 and 18). The Openness items are associated to Creativity and Unimaginativeness (item 8) and Nonconformity (item 15).

In case of the IGFP-5R, the Extraversion items are linked to Reserve (item 12), Shyness and Inhibition (item 16), and Silence (item 42). The items of the Conscientiousness factor are associated to Negligence (item 17), Sloth (item 19), Forgetfulness (Item 22), and Disorganization (item 38). The Agreeableness items are related to Amiability (items 15 and 18) and Cooperation (item 8). The Neuroticism items are linked to Instability (item 10) and Fear (items 34 and 36). Finally, the items on the Openness factor are associated with Creativity (items 9 and 11) and Intellectuality (item 39).

Regarding the convergent validity between the two scales, the correlation coefficients corrected for attenuation showed satisfactory evidence of validity for Extraversion (r = .80), Openness (r = .74), and Neuroticism (r = .60), but not for Agreeableness (r = .48) and Conscientiousness (r = .43). The magnitude of the raw correlation was even lower. We raise two hypotheses to explain these results. First is the format of the items. While the ER5FP contains bipolar items (e.g., communicative-silent, extraverted-timid, sociable-reserved in Extraversion), the items of the IGFP-5R are unipolar and tend to be emphasize only negative or positive traits in each factor (e.g., reserved, timid, inhibited, quiet, silent in the Extraversion factor). Second is the content of the factors. As one can see in the two previous paragraphs, there are somewhat different emphases in the content of each of the factors. This is also a consequence of the deletion of items in the analyses.

The means of the ER5FP factor scores identified in the development study and in the current study resemble the findings of Donnellan et al. (2006). The data for the Conscientiousness and Openness factors approximate the results presented by the authors regarding the TIPI (Ten-Item Personality Inventory) data, which also have a response scale of 7 points.

Similarities were found comparing differences between groups for the data obtained with the two instruments. On both scales, women presented higher average scores on Neuroticism. Parents obtained higher scores on Agreeableness with both instruments. Concerning the covariate age, the results revealed significant differences for Agreeableness on the two scales, indicating that older people tend to be more agreeable. Regarding marital status, the only difference was on the factor Agreeableness of the ER5FP data, with married participants scoring higher. No differences were found for the level of education. This last result could be related to a homogeneity regarding the educational background in the sample analyzed in this study.

However, exploring the theoretical implications of the observed differences is beyond the scope of this study. Nevertheless, the interested reader can find a number of studies relating the Big Five to a number of covariates such as age (Donnellan & Lucas, 2008; Terraciano, McCrae, Brant, & Costa Jr., 2005), marital status (Boyce et al., 2016), parenthood (Prinzie et al., 2009), and gender (Schmitt et al., 2017; Weisberg et al., 2011).

The sample has other limitations. Because it was carried out basically in the Distrito Federal and in the Metropolitan Region of Salvador, Bahia, the study did not present a representative sample of the entire Brazilian population. In addition, since data collection was performed in educational institutions, there was little variability in the response pattern, which is not the most recommended in studies that seek to evaluate the psychometric quality of instruments. Future studies will benefit if carried out with representative samples, so as to cover the entire national territory.

Investigating convergent validity is an essential step in the development of psychometric instruments and theoretical models. Thus, this study contributes in advancing the research with the FFM, as well as in developing measurement instruments. In addition to investigating the psychometric properties of the instruments, this study may also encourage further research on the relationship between personality and issues related to social context. Finally, the results of this study and those from the works of Passos and Laros (2015) and Andrade (2008) provided preliminary evidence that ER5FP and IGFP-5R are moderately good alternatives for use in future research involving personality assessment within the theoretical framework of the five-factor model.





Big Five Inventory


Bateria Fatorial da Personalidade


Comparative Fix Index




Degrees of freedom


Escala de Marcadores Reduzidos


Reduced Scale of Big Five Personality Factors




Five-factor model


Goodness of Fit Index


Reduced Inventory of Big Five Personality Factors




NEO Personality Inventory - Revised


Openness to Experience


Escala Reduzida de Descritores de Personalidade


Root mean square error of approximation


Standardized Root Mean Square Residual


Ten-Item Personality Inventory


Guttman’s coefficient lambda 2

χ 2 :

Chi square


  • Andrade, J. M. (2008). Evidências de validade do inventário dos cinco grandes fatores de personalidade para o Brasil. Tese de Doutorado. Brasília: Universidade de Brasília.

    Google Scholar 

  • Benet-Martinez, V., & John, O. P. (1998). Los cinco grandes across cultures and ethnic groups: multitrait multimethod analyses of the big five in Spanish and English. Journal of Personality and Social Psychology, 75(3), 729–750.

    Article  Google Scholar 

  • Boyce, C. J., Wood, A. M., & Ferguson, E. (2016). For better or for worse: the moderating effects of personality on the marriage-life satisfaction link. Personality and Individual Differences, 97, 61–66.

    Article  Google Scholar 

  • Byrne, B. M. (2016). Structural equation modeling with AMOS: basic concepts, applications and programming, (3rd ed., ). New York: Routledge.

    Google Scholar 

  • Carvalho, L. d. F., Nunes, M. F. O., Primi, R., & Nunes, C. H. S. d. S. (2012). Evidências desfavoráveis para avaliação da personalidade com um instrumento de 10 itens. Paidéia, 22, 63–71.

    Article  Google Scholar 

  • Conselho Federal de Psicologia. (2017). SATEPSI - Sistema de Avaliação de Testes Psicológicos. ReIRTeved from

  • Credé, M., Harms, P., Niehorster, S., & Gaye-Valentine, A. (2012). An evaluation of the consequences of using short measures of the Big Five personality traits. Journal of Personality and Social Psychology, 102, 874–888

    Article  Google Scholar 

  • De Raad, B., Barelds, D. P., Levert, E., Ostendorf, F., Mlacić, B., Di Blas, L., … Katigbak, M. S. (2010). Only three factors of personality description are fully replicable across languages: A comparison of 14 trait taxonomies. Journal of Personality and Social Psychology, 98(1), 160–173.

    Article  Google Scholar 

  • De Raad, B., & Mlacic, B. (2015). The lexical foundation of the big five factor model. In T. Widiger (Ed.), The Oxford handbook of the five factor model.

    Chapter  Google Scholar 

  • Donnellan, M. B., & Lucas, R. E. (2008). Age differences in the Big Five across the life span: evidence from two national samples. Psychology and Aging, 23, 558–562.

    Article  Google Scholar 

  • Donnelan, M. B., Oswald, F. L., Baird, B. M., & Lucas, R. E. (2006). The mini-IPIP scales: tiny-yet-effective measures of the Big Five factors of personality. Psychological Assessment, 18(2), 192–203.

    Article  Google Scholar 

  • Frazier, T. W., Naugle, R. I., & Haggerty, K. A. (2006). Psychometric adequacy and comparability of the short and full forms of the Personality Assessment Inventory. Psychological Assessment, 18(3), 324–333.

    Article  Google Scholar 

  • Flores-Mendoza, C. (2007). Manual do NEO-PI-R e NEO-FFI. São Paulo: Vetor Editora.

    Google Scholar 

  • Goldberg, L. R. (1992). The development of markers for the Big Five factor structure. Psychological Assessment, 4, 26–42.

    Article  Google Scholar 

  • Goldberg, L. R., & Rosolack, T. K. (1994). The Big Five Factor Structure as an integrative framework: an empirical comparison with Eysenck’s P-E-N Model. In C. F. Halverson Jr., G. A. Kohnstamn, & R. P. Martin (Eds.), The developing structure of temperament and personality from infancy to adulthood, (pp. 7–35). New York: Erlbaum.

    Google Scholar 

  • Gomes, C. M. A., & Golino, H. F. (2012). Relações hierárquicas entre os traços amplos do Big Five. Psicologia: Reflexão e Crítica, 25(3), 445–456.

    Google Scholar 

  • Gosling, S. D., Rentfrow, P. J., & Swann, W. B. (2003). A very brief measure of the Big-Five personality domains. Journal of Research in Personality, 37, 504–528.

    Article  Google Scholar 

  • Hahn, E., Gottschling, J., & Spinath, F. M. (2012). Short measurements of personality - validity and reliability of the GSOEP Big Five Inventory (BFI-S). Journal of Research in Personality, 46(3), 355–359.

    Article  Google Scholar 

  • Hauck Filho, N., Machado, W. L., Teixeira, M. A. P., & Bandeira, D. R. (2012a). Evidências de validade de marcadores reduzidos para a avaliação da personalidade no modelo dos cinco grandes fatores. Psicologia: Teoria e Pesquisa, 28, 417–423.

    Google Scholar 

  • Hauck Filho, N., Teixeira, M. A. P., Machado, W. L., & Bandeira, D. R. (2012b). Marcadores reduzidos para a avaliação da personalidade em adolescentes. Psico-USF, 17(2), 256–261.

    Article  Google Scholar 

  • Herzberg, P. Y., & Brähler, E. (2006). Assessing the Big-Five personality domains via short forms: a cautionary note and a proposal. European Journal of Psychological Assessment, 22(3), 139–148.

    Article  Google Scholar 

  • Hutz, C. S., Nunes, C. H., Silveira, A. D., Serra, J., Anton, M., & Wieczorek, L. S. (1998). O desenvolvimento de marcadores para a avaliação da personalidade no modelo dos cinco grandes fatores. Psicologia: Reflexão e Crítica, 11(2), 395–411.

    Article  Google Scholar 

  • Hutz, C. S., & Nunes, C. H. S. (2001). Escala Fatorial de Neuroticismo. São Paulo: Casa do Psicólogo.

    Google Scholar 

  • Kline, R. B. (2016). Principles and practices of Structural Equation Modeling (4th ed.). London: The Guilford Press.

    Google Scholar 

  • Laverdière, O., Morin, A. J. S., & St-Hilaire, F. (2013). Factor structure and measurement invariance of a short measure of the Big Five personality traits. Personality and Individual Differences, 55(7), 739–743.

    Article  Google Scholar 

  • Machado, W. L., Hauck Filho, N., Teixeira, M. A. P., & Bandeira, D. R. (2014). Análise de Teoria de Resposta ao Item de marcadores reduzidos da personalidade. Psico, 45(4), 551–558.

    Article  Google Scholar 

  • McCrae, R. R. (2011). Personality theories for the 21st century. Teaching of Psychology, 38(3), 209–214.

    Article  Google Scholar 

  • Miles, J., & Shevlin, M. (2001). Applying regression & correlation. A guide for students and researchers. London: Sage Publications.

    Google Scholar 

  • Natividade, J. C., & Hutz, C. S. (2015). Escala reduzida de descritores dos cinco grandes fatores de personalidade: prós e contras. Psico, 46(1), 79–89.

    Article  Google Scholar 

  • Nunes, C. H. S., & Hutz, C. S. (2007a). Escala Fatorial de Extroversão. São Paulo: Casa do Psicólogo.

    Google Scholar 

  • Nunes, C. H. S., & Hutz, C. S. (2007b). Escala Fatorial de Socialização. São Paulo: Casa do Psicólogo.

    Google Scholar 

  • Nunes, C. H. S., Hutz, C. S., & Nunes, M. (2010). Bateria Fatorial de Personalidade (BFP): Manual Técnico. São Paulo: Casa do Psicólogo.

    Google Scholar 

  • Osborne, J. W. (2002). Notes on the use of data transformations. Practical Assessment, Research & Evaluation, 8(6), 1–8.

    Google Scholar 

  • Osborne, J. W. (2013). Best practices in data cleaning: a complete guide to everything you need to do before and after collecting your data. Thousand Oaks: Sage.

    Book  Google Scholar 

  • Pariz, J., Haddad, E., & Machado, W. L. (2016). Convergent and criterion-based validity for a brief scale of the five-factor model. Avaliação Psicológica, 15(3), 346–351.

    Article  Google Scholar 

  • Passos, M. F., & Laros, J. A. (2014). O modelo dos cinco grandes fatores: revisão de literatura. Peritia – Revista Portuguesa de Psicologia, 21, 13–21.

    Google Scholar 

  • Passos, M. F. D., & Laros, J. A. (2015). Construção de uma escala reduzida de Cinco Grandes Fatores de personalidade. Avaliação Psicológica, 14(1), 115–123.

    Article  Google Scholar 

  • Primi, R., Santos, D., John, O. P., & De Fruyt, F. (2016). The development of a nationwide inventory assessing social and emotional skills in Brazilian youth. European Journal of Psychological Assessment, 32, 5–16.

    Article  Google Scholar 

  • Prinzie, P., Stams, G. J., Deković, M., Reijntjes, A. H., & Belsky, J. (2009). The relations between parents’ Big Five personality factors and parenting: a meta-analytic review. Journal of Personality and Social Psychology, 97(2), 351–362.

    Article  PubMed  Google Scholar 

  • Rammstedt, B. (2007). The 10-Item Big Five Inventory. European Journal of Psychological Assessment, 23(3), 193–201.

    Article  Google Scholar 

  • Schmitt, D., Realo, A., Voracek, M., & Allik, J. (2008). Why can’t man be more like a woman? Sex differences in big five personality traits across 55 cultures. Journal of Personality and Social Psychology, 94, 168–172.

    Article  Google Scholar 

  • Schmitt, D. P., Long, A. E., McPhearson, A., O’Brien, K., Remmert, B., & Shah, S. H. (2017). Personality and gender differences in global perspective. International Journal of Psychology, 52(1), 45–56.

    Article  PubMed  Google Scholar 

  • Sijtsma, K. (2012). Future of psychometrics: ask what psychometrics can do for psychology. Psychometrika, 77, 4–20.

    Article  Google Scholar 

  • Silva, I. B., & Nakano, T. C. (2011). Modelo dos cinco grandes fatores da personalidade: análise de pesquisas. Avaliação Psicológica, 10(1), 51–62.

    Google Scholar 

  • Srivastava, S., John, O., Gosling, S., & Potter, J. (2003). Development of personality in early and middle adulthood: Set like plaster or persistent change? Journal of Personality and Social Psychology, 84(5), 1041–1053.

    Article  Google Scholar 

  • Terraciano, A., McCrae, R. R., Brant, L. J., & Costa Jr., P. T. (2005). Hierarchical linear modeling analyses of the NEO-PI-R scales in the Baltimore longitudinal study of aging. Psychology and Aging, 20, 493–506.

    Article  Google Scholar 

  • Vasconcelos, T. S. (2005). O inventário fatorial dos cinco fatores de personalidade no ambiente de trabalho. Tese de Doutorado. Brasília: Universidade de Brasília.

  • Vasconcellos, S. J. S., & Hutz, C. S. (2008). Construção e validação de uma escala de abertura à experiência. Avaliação Psicológica, 7(2), 135–141.

    Google Scholar 

  • Weisberg, Y. J., DeYoung, C. G., & Hirsh, J. B. (2011). Gender differences in personality across the ten aspects of the Big Five. Frontiers in Psychology, 2, 178.

    Article  PubMed  PubMed Central  Google Scholar 

  • Weston, R., Gore, P. A., Chan, F., & Catalano, D. (2008). An introduction to using structural equation models in rehabilitation psychology. Rehabilitation Psychology, 53, 340–356.

    Article  Google Scholar 

Download references


Not applicable.


No funding was received.

Availability of data and materials

Databanks can be requested to the first author by the e-mail address listed in the contact details.

Author information

Authors and Affiliations



JAL has made substantial contributions to the conception and design of the study, analysis and interpretation of the data, and drafting and revising the manuscript. AJdSP has made substantial contributions to the interpretation of the data and drafting and revising the manuscript. JMdA has made substantial contributions to the interpretation of the data and revising the manuscript. MFDP has made substantial contributions to the conception and design, acquisition of the data, and analysis and interpretation of the data. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Alexandre José de Souza Peres.

Ethics declarations

Authors’ information

JAL is a professor at the Institute of Psychology of the University of Brasília, Brazil.

AJSP is a professor at the Federal University of Mato Grosso do Sul, Brazil.

JMA is a professor at the Institute of Psychology of the University of Brasília, Brazil.

MFDP is analyst on health management at the Fundação Oswaldo Cruz (Fiocruz), Brazil.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Laros, J.A., Peres, A.J.S., Andrade, J.M. et al. Validity evidence of two short scales measuring the Big Five personality factors. Psicol. Refl. Crít. 31, 32 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: