Science self-concept – More than the sum of its parts?

March 21, 2020

The article “Science Self-Concept – More Than the Sum of its Parts?” has now been published in “The Journal of Experimental Education” (btw in existence since 1932). The first 50 copies are free, in case you are interested.

My first preprint. 😀Is a general science self-concept equivalent to an aggregated subject-specific science concept? It's about different modeling approaches, measurement invariance and concepts of equivalence. Check it out! Comment if you like: https://t.co/3STwiTV0Up pic.twitter.com/SfbYxuHfse
— Ulrich Schroeders (@Navajoc0d3) November 6, 2019

In comparison to the preprint version, some substantial changes have been made to the final version of the manuscript, especially in the research questions and in the presentation of the results. Due to word restriction, we also removed a section from the discussion, in which we summarized differences and commonalities of the bifactor vs. higher-order models. We also speculated about why the type of modeling may also depend on the study’s subject, that is, on conceptual differences in intelligence vs. self-concept research. The argumentation may be a bit wonky, but at least I find the idea so persuasive that I want to reproduce it in the following. If you have any comments, please feel free to drop me a line.

Hierarchical vs. Bifactor Modeling

Reviewing the psychometric literature on hierarchical and bifactor modeling, one gets the impression that there are large statistical or conceptual differences between these modeling approaches. For example, Chen et al. (2006) listed several advantages of the bifactor model over the second-order model, but the differences are presumably more subtle (Gustafsson & Balke, 1993). Remember that the higher-order models can be turned into a special version of the bifactor models by means of the Schmid-Leiman-transformation (Reise et al., 2010, Schulze, 2004), that is, an (unconstrained) bifactor model and a higher-order model will only produce different results to the extent that the proportionality constraints are violated.

In our reading, the long debate about the appropriate modeling approach is blurred by the fact that the indicators of such models are often either parcels or subtests scores rather than items. In case of aggregated scores of different scales or subtests (e.g., Swedish and mathematics achievement as marker tests for crystallized intelligence in Gustafsson & Balke, 1993), often the bifactor model is preferred because the higher uniqueness of the indicators makes it hard to build a common trait in the higher-order model (see also Cucina & Byle, 2017). In case of parcels, the influence on modeling is more opaque (Cole, Perkins, & Zelkowitz, 2016), but parceling is often misused to mask heterogeneity by leveling out content differences (Little, Cunnigham, Shahar, & Widaman, 2002), which leads to an artificial homogenization of the indicators and generally weakens the subject-specific factors.

Compared to studies discussed in the psychometric literature on hierarchical vs. bifactor modeling, there are some differences in the present case. First, all models were estimated at the item level (with rather homogeneous sets of items), making it obsolete to aggregate the responses. Second, in contrast to research on cognitive abilities that relies on high interrelations (i.e., positive manifold, van der Maas et al. 2006), self-concept research has to deal with two opposing self-concepts—the verbal and the mathematical self-concept that are almost unrelated (Möller, Pohlmann, Köller, & Marsh, 2009). Also, self-concepts are only moderately correlated in the sciences (Jansen, Schroeders, & Lüdtke, 2014). These differences might have led to our result of the absence of significant differences between the bifactor and the second-order model, despite the large sample size. Thus, both models concur that the aggregated science self-concept and the subject-unspecific science self-concept are very highly correlated (r = .94). Therefore, one might be inclined to say that both measurements are equivalent, but this is not necessarily true.

That statistical unity is not to be confused with causal unity and that issues of measurement invariance have to be taken into account are two points we still discuss in the published version, to which we refer the interested reader.

References

Chen, F. F., West, S. G., & Sousa, K. H. (2006). A comparison of bifactor and second-order models of quality of life. Multivariate Behavioral Research, 41(2), 189–225. https://doi.org/10.1207/s15327906mbr4102_5
Cole, D. A., Perkins, C. E., & Zelkowitz, R. L. (2016). Impact of homogeneous and heterogeneous parceling strategies when latent variables represent multidimensional constructs. Psychological Methods, 21(2), 164–174. https://doi.org/10.1037/met0000047
Cucina, J., & Byle, K. (2017). The bifactor model fits better than the higher-order model in more than 90% of comparisons for mental abilities test batteries. Journal of Intelligence, 5(3), 27. https://doi.org/10.3390/jintelligence5030027
Gustafsson, J.-E., & Balke, G. (1993). General and specific abilities as predictors of school achievement. Multivariate Behavioral Research, 28(4), 407–434. https://doi.org/10.1207/s15327906mbr2804_2
Jansen, M., Schroeders, U., & Lüdtke, O. (2014). Academic self-concept in science: Multidimensionality, relations to achievement measures, and gender differences. Learning and Individual Differences, 30, 11–21. https://doi.org/10.1016/j.lindif.2013.12.003
Little, T. D., Cunningham, W. A., Shahar, G., & Widaman, K. F. (2002). To parcel or not to parcel: Exploring the question, weighing the merits. Structural Equation Modeling, 9(2), 151–173. https://doi.org/10.1207/S15328007SEM0902_1
Möller, J., Pohlmann, B., Köller, O., & Marsh, H. W. (2009). A meta-analytic path analysis of the internal/external frame of reference model of academic achievement and academic self-concept. Review of Educational Research, 79(3), 1129–1167. https://doi.org/10.3102/0034654309337522
Reise, S. P., Moore, T. M., & Haviland, M. G. (2010). Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores. Journal of Personality Assessment, 92(6), 544–559. https://doi.org/10.1080/00223891.2010.496477
Schulze, R. (2004). Modeling structures of intelligence. In O. Wilhelm, & R. W. Engle (Eds.), Handbook of understanding and measuring intelligence (pp. 241–263). Thousand Oaks, CA: Sage Publications.
van der Maas, H. L. J., Dolan, C. V., Grasman, R. P. P. P., Wicherts, J. M., Huizenga, H. M., & Raijmakers, M. E. J. (2006). A dynamical model of general intelligence: The positive manifold of intelligence by mutualism. Psychological Review, 113(4), 842–861. https://doi.org/10.1037/0033-295X.113.4.842