Computational Thinking (CT) is widely recognised as a transversal competence essential for learning, problem solving, and knowledge transfer across disciplines. However, its effective integration into school education remains strongly dependent on the availability of assessment instruments that are pedagogically meaningful, psychometrically sound, and applicable across diverse educational contexts. This paper presents COMATH, a cross-national assessment instrument designed to evaluate CT in students aged 9–14. The instrument adopts a phase-based development and validation framework that integrates Bebras-inspired tasks, Item Response Theory, factor-analytic methods, learning analytics, and teacher and student feedback. The assessment was iteratively developed and piloted between 2023 and 2025 in six European countries, with data collected from 6,480 students and 155 teachers. The findings demonstrate that a phased assessment approach enables systematic calibration of task difficulty, robust evaluation of item functioning, and meaningful interpretation of student performance across age groups and national contexts. The results further highlight how well-designed CT assessment can support instructional decision-making rather than serve solely as a summative measure. The study argues for conceptualising CT assessment as a dynamic and iterative process that links measurement, psychometric validation, and pedagogical use in school education.
The new Croatian Informatics curriculum, which introduces computational thinking concepts into learning outcomes has been put into practice. A computational thinking assessment model reflecting the learning outcomes of the Croatian curriculum was created using an evidence-centered design approach. The possibility of assessing the computational thinking concepts, abstraction, decomposition, and algorithmic thinking, in an actual classroom situation and examples of such assessment is increasingly coming to the forefront of computer science educational research. Precisely for that purpose, the research was conducted. Research data are collected through the test and questionnaire of 407 pupils (10 middle schools, age 12), analysed by exploratory factor analysis and non-parametric tests. Results showed that the presented model was suitable to assess the understanding of the concepts of abstraction and algorithmic thinking, independently of the previous experience with programming languages and pupil's gender, while assessment of decomposition needs more work and improvement, some recommendations are provided. Also, it received positive feedback from pupils and teachers what implicated that such an assessment model could help teachers in building a real-time measurement instrument.