MBTI® Step II Research
Gender differences MBTI users Differences by educational level Differences by language Internal consistency reliability Further research information
Gender differences
Thirteen of the twenty facets showed statistically significant gender differences. The largest differences were on four of the T-F facet scales, with females being, on average, more Empathetic, Compassionate, Accepting and Tender, and males being, on average, more Logical, Reasonable, Critical and Tough.
Back to top ^
MBTI users
The trial sample for Step II included a group of 453 MBTI Step I users. How do these MBTI users differ from the general UK population?
In Step I terms, MBTI users are much more likely to be Intuitive. 75% saw themselves as Intuitive, compared with only 23% for the general population. They were also more likely to have Thinking, Perceiving and to some extent Extraverted preferences than the general population:
| The percentage of each type within the MBTI users group: |
ISTJ N = 28 6% SSR = 0.45 |
ISFJ N = 16 3% SSR = 0.28 |
INFJ N = 30 7% SSR = 3.88 |
INTJ N = 49 11% SSR = 7.71 |
ISTP N = 3 1% SSR = 0.11 |
ISFP N = 3 1% SSR = 0.11 |
INFP N = 46 10% SSR = 3.19 |
INTP N = 36 8% SSR = 3.29 |
ESTP N = 13 3% SSR = 0.50 |
ESFP N = 9 2% SSR = 0.23 |
ENFP N = 55 12% SSR = 1.92 |
ENTP N = 57 13% SSR = 4.50 |
ESTJ N = 21 5% SSR = 0.44 |
ESFJ N = 18 4% SSR = 0.32 |
ENFJ N = 30 7% SSR = 2.36 |
ENTJ N = 39 9% SSR = 2.97 |
SSR (self selection ratio) shows the extent to which each type is over-represented (SSR greater than 1) or under-represented (SSR less than 1) compared to the UK general population. All Intuitive types are over-represented, and all Sensing types are under-represented.
These Step I results tell part of the story, but Step II can provide further information. In the Step II trialling, an additional UK sample of 205 people, broadly representative of the general population, took the questionnaire. The table below shows how MBTI users differ from this group in terms of their Step II results
| Step 2 Facet Scale |
MBTI Users (n=453) |
Rep Sample (n=205) |
Mean Difference |
E-I Facet Scales Initiating-Receiving Expressive-Contained Gregarious-Intimate Active-Reflective Enthusiastic-Quiet |
Mean .01 -.42 .06 -.23 -.43 |
SD 3.87 3.61 3.41 3.43 3.48 |
Mean .34 .85 .48 .48 .25 |
SD 3.67 3.20 3.10 3.51 3.15 |
.09 .36** .13 .20* .20* |
S-N Facet Scales Concrete-Abstract Realistic-Imaginative Practical-Conceptual Experiential-Theoretical Traditional-Original |
1.79 1.92 .96 1.61 1.55 |
3.17 3.20 3.05 3.24 3.10 |
.00 -.24 -.69 -.38 -.35 |
3.12 3.32 3.00 3.16 3.21 |
-.55** -.64** -.53** -.59** -.58** |
T-F Facet Scales Logical-Empathetic Reasonable-Compassionate Questioning-Accommodating Critical-Accepting Tough-Tender |
-.75 .28 -.96 1.51 .59 |
3.37 3.44 3.02 2.75 3.39 |
.26 .50 .05 .91 1.11 |
3.16 3.24 3.19 2.85 3.26 |
.30** .07 .33** -.21** .15 |
J-P Facet Scales Systematic-Casual Planful-Open-Ended Early Starting-Pressure-Prompted Scheduled-Spontaneous Methodical-Emergent |
.30 -.09 .30 .00 -.05 |
3.46 3.96 3.72 3.89 3.24 |
.38 .04 -.58 -.23 .03 |
3.33 3.67 3.23 3.32 3.08 |
.02 .03 -.25** -.06 .03 |
Mean difference: difference in means between the MBTI users group and the representative sample. Positive value indicates that MBTI users are nearer the left-hand pole of the facet; negative value indicates that MBTI users are nearer the right-hand pole. * difference statistically significant at the 5% level. ** significant at the 1% level.
On average, the MBTI users are significantly nearer the Intuitive pole on all five S-N facets.
For E-I, they are more Expressive, Active and Enthusiastic, with no significant difference on the other two facets. Even those MBTI users whose preference is for Introversion may be more likely to exhibit Extraverted behaviours than other Introverts.
On T-F, MBTI users tend to be more Logical and Questioning, but also more Accepting, than the general population. Such a mix of results is not uncommon amongst those who work in human resources or people-focused consultancy. Those with a preference for Feeling often need to present some T behaviours in a business context; those with a preference for Thinking do, by the nature of their work, need to attend to people factors.
For J-P, they are more likely to be Pressure-Prompted, with almost zero differences on the other four facets. Although MBTI users are overall evenly split between Judging and Perceiving types, they are as a group more likely to do their best work under time pressure.
Back to top ^
Differences by educational level
As education systems across Europe differ greatly, the age at which people left full-time education was used as an indicator of educational level. The sample group was split into three categories, which it was felt would be broadly representative of people who left full-time education either:
- During or at the end of secondary education (ages 15-18 years)
- During or at the end of further education (ages 19-22 years)
- During or at the end of postgraduate education (ages 23 years or older)
No significant differences were found amongst any of the facets in the E-I preference block. However, significant differences were found for almost all of the other facets (with the exception of Tough-Tender and Methodical-Emergent).
Age at which people left full-time education appears to show no relationship with any of the facets in the E-I preference block.
People who left full-time education at a later stage in life tended, on average, to achieve scores further towards the N poles of all the S-N facets than those who left education at an earlier age.
People who left full-time education at a later stage in life tended, on average, to achieve scores further towards the T poles of most of the T-F facets than those who left education at an earlier age.
Back to top ^
Differences by language
The Step II questionnaire was trialled in 9 different languages. The list below shows how those taking the questionnaire in each language differed from the group as a whole. A range of different sampling methods was used, and not all of the groups were representative of the general population of that country.
Danish Mean facet scores amongst the Danish-speaking sample tended more towards the E pole on three facets (Expressive-Contained, Active-Reflective and Enthusiastic-Quiet), more towards the N pole on three facets (Concrete-Abstract, Experiential-Theoretical and Traditional-Original), and more towards the P pole on two facets (Early starting-Pressure prompted and Scheduled-Spontaneous).
Dutch Amongst the Dutch-speaking sample there was a relatively consistent pattern across the facets within the S-N preference block, with mean scores tending towards the S pole on four of the facet scales (Concrete-Abstract, Realistic-Imaginative, Practical-Inferential and Experiential-Theoretical). In addition, mean scores tended toward the F pole on one facet (Questioning-Accommodating).
English Amongst the UK sample there were very few deviations from the overall sample means. Scores tended more towards the N pole on one facet (Experiential-Theoretical) and toward the F pole on one facet (Critical-Accepting).
French Amongst the French-speaking sample, mean facet scores showed fewer deviations from the overall sample means. Scores tended more towards the I pole on one facet (Expressive-Contained), more towards the T pole on two facets (Questioning-Accommodating and Critical-Accepting), and more towards the F pole on another facet (Logical-Empathetic).
German Amongst the German-speaking sample, there were relatively few deviations from the overall sample means. Scores tended more towards the S pole on one facet (Practical-Inferential), more towards the T pole on one facet (Critical-Accepting), and more towards the J pole on another facet (Systematic-Casual).
Italian Amongst the Italian-speaking sample, there were four noticeable deviations from the overall sample means. Scores tended more towards the I pole on one facet (Participative-Reflective), more towards the N pole on one facet (Experiential-Theoretical), and more towards the T pole on two facets (Reasonable-Compassionate and Critical-Accepting).
Norwegian Amongst the Norwegian-speaking sample there was a consistent pattern across the facets within the E-I preference block, with mean scores tending towards the E pole on all five of the facet scales (Initiating-Receiving, Expressive-Contained, Gregarious-Intimate, Active-Reflective and Enthusiastic-Quiet). In addition, mean scores tended toward the F pole on one facet (Tough-Tender), and towards the J pole on one facet (Systematic-Casual). This pattern is almost identical to the Swedish-speaking sample.
Spanish Amongst the Spanish-speaking sample, there were five noticeable deviations from the overall sample means. Scores tended more towards the S pole on two facets (Concrete-Abstract and Experiential-Theoretical), more towards the T pole on one facet (Reasonable-Compassionate) and more towards the J pole on two facets (Systematic-Casual and Early starting-Pressure prompted).
Swedish Amongst the Swedish-speaking sample there was a consistent pattern across the facets within the E-I preference block, with mean scores tending towards the E pole on all five of the facet scales (Initiating-Receiving, Expressive-Contained, Gregarious-Intimate, Active-Reflective and Enthusiastic-Quiet). In addition, mean scores tended toward the N pole on one facet (Experiential-Theoretical), toward the F pole on one facet (Tough-Tender), and towards the J pole on one facet (Systematic-Casual). This pattern is almost identical to the Norwegian-speaking sample.
Back to top ^
Internal consistency reliability
The new European edition has much improved internal consistency reliability (compared with Form K), as shown in the table below:
|
No. of items |
Alpha |
Mean |
SD |
SEm |
E-I Facet Scales Initiating-Receiving Expressive-Contained Gregarious-Intimate Active-Reflective Enthusiastic-Quiet |
8 7 7 8 9 |
.84 .79 .67 .74 .74 |
-.35 -.06 -.14 -.46 -.50 |
3.43 3.24 2.90 3.17 3.09 |
.07 .06 .06 .06 .06 |
S-N Facet Scales Concrete-Abstract Realistic-Imaginative Practical-Conceptual Experiential-Theoretical Traditional-Original |
9 7 8 8 8 |
.75 .75 .59 .78 .74 |
.17 .42 -.13 -.33 .19 |
2.94 3.26 2.95 3.12 3.06 |
.06 .06 .06 .06 .06 |
T-F Facet Scales Logical-Empathetic Reasonable-Compassionate Questioning-Accommodating Critical-Accepting Tough-Tender |
9 8 7 8 8 |
.84 .76 .50 .54 .77 |
.07 -.44 -.79 .22 .49 |
3.16 3.18 2.95 2.64 3.13 |
.06 .06 .06 .05 .06 |
J-P Facet Scales Systematic-Casual Planful-Open-Ended Early Starting-Pressure-Prompted Scheduled-Spontaneous Methodical-Emergent |
8 6 6 8 8 |
.77 .81 .77 .77 .68 |
.08 .07 -.24 -.33 -.40 |
3.19 3.56 3.55 3.27 3.02 |
.06 .07 .07 .06 .06 |
| Median values |
8 |
.76 |
|
Test-retest reliability is also good, with coefficients ranging from 0.66 to 0.92 with a median of 0.84.
Back to top ^
Further research information
The results above give a taste of the more extensive research findings in the MBTI® Step II Manual: European edition. The research section of the manual contains more details on:
- The development of the new questionnaire
- Properties of the Step II facet scales
- The relationship of Step I to Step II, and between different versions of Step II
- Reliability of the Step II facet scales
- Reliability of Step II interpretative categories
- Construct validity and factor analysis
- Correlation with other instruments
- Validity against behavioural descriptions
- Studies providing evidence for the application of Step II results
- Use of the Step II facet scales to test Type theory
- Group differences by gender, age, ethnic origin, education level, occupational level and occupational sector.
Back to top ^ |