<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">INFEDU</journal-id>
      <journal-title-group>
        <journal-title>Informatics in Education</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2335-8971</issn>
      <issn pub-type="ppub">1648-5831</issn>
      <publisher>
        <publisher-name>VU</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">INFEDU_2024_1_10</article-id>
      <article-id pub-id-type="doi">10.15388/infedu.2024.10</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Reliability and Validity of an Automated Model for Assessing the Learning of Machine Learning in Middle and High School: Experiences from the “ML for All!” course</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Rauber</surname>
            <given-names>Marcelo Fernando</given-names>
          </name>
          <email xlink:href="mailto:marcelo.rauber@ifc.edu.br">marcelo.rauber@ifc.edu.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_000"/>
        </contrib>
        <aff id="j_INFEDU_aff_000">Graduate Program in Computer Science, Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis/SC, Brazil.
Federal Institute Catarinense (IFC) - Camboriú/SC - Brazil.</aff>
        <contrib contrib-type="author">
          <name>
            <surname>von Wangenheim</surname>
            <given-names>Christiane Gresse</given-names>
          </name>
          <email xlink:href="mailto:c.wangenheim@ufsc.br">c.wangenheim@ufsc.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_001"/>
        </contrib>
        <aff id="j_INFEDU_aff_001">Graduate Program in Computer Science, Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis/SC, Brazil.</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Barbetta</surname>
            <given-names>Pedro Alberto</given-names>
          </name>
          <email xlink:href="mailto:pedro.barbetta@ufsc.br">pedro.barbetta@ufsc.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_002"/>
        </contrib>
        <aff id="j_INFEDU_aff_002">Graduate Program in Methods and Management in Evaluation - Federal University of Santa Catarina, Florianópolis/SC, Brazil.</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Borgatto</surname>
            <given-names>Adriano Ferreti</given-names>
          </name>
          <email xlink:href="mailto:adriano.borgatto@ufsc.br">adriano.borgatto@ufsc.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_003"/>
        </contrib>
        <aff id="j_INFEDU_aff_003">Graduate Program in Methods and Management in Evaluation - Federal University of Santa Catarina, Florianópolis/SC, Brazil.</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Martins</surname>
            <given-names>Ramon Mayor</given-names>
          </name>
          <email xlink:href="mailto:ramon.mayor@posgrad.ufsc.br">ramon.mayor@posgrad.ufsc.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_004"/>
        </contrib>
        <aff id="j_INFEDU_aff_004">Graduate Program in Computer Science, Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis/SC, Brazil.</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Hauck</surname>
            <given-names>Jean Carlo Rossa</given-names>
          </name>
          <email xlink:href="mailto:jean.hauck@ufsc.br">jean.hauck@ufsc.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_005"/>
        </contrib>
        <aff id="j_INFEDU_aff_005">Graduate Program in Computer Science, Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis/SC, Brazil.</aff>
      </contrib-group>
      <volume>23</volume>
	  <issue>2</issue>
	  <fpage>409</fpage>
	  <lpage>437</lpage>
      <permissions>
        <copyright-year>2023</copyright-year>
        <copyright-holder>Vilnius University, ETH Zürich</copyright-holder>
        <license license-type="open-access">
          <license-p>Open access article under the CC BY license.</license-p>
        </license>
      </permissions>
      <abstract>
        <p>The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students’ learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of an automated assessment of the students’ learning of an image classification model created as a learning outcome of the “ML for All!” course. Results based on data collected from 240 students indicate that the assessment can be considered reliable (coefficient Omega = 0.834/Cronbach's alpha α=0.83). We also identified moderate to strong convergent and discriminant validity based on the polychoric correlation matrix. Factor analyses indicate two underlying factors “Data Management and Model Training” and “Performance Interpretation”, completing each other. These results can guide the improvement of assessments, as well as the decision on the application of this model in order to support ML education as part of a comprehensive assessment.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>K-12</kwd>
        <kwd>Middle and high school</kwd>
        <kwd>Machine Learning</kwd>
        <kwd>Artificial Intelligence</kwd>
        <kwd>Neural network</kwd>
        <kwd>Image Classification</kwd>
        <kwd>Assessment</kwd>
        <kwd>Evaluation</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
