Profiling the Skill Mastery of Introductory Programming Students: A Cognitive Diagnostic Modeling Approach

Abstract

The global shortage of skilled programmers remains a persistent challenge. High dropout rates in introductory programming courses pose a significant obstacle to graduation. Previous studies highlighted learning difficulties in programming students, but their specific weaknesses remained unclear. This gap exists due to the predominant focus on the overall academic performance evaluation. To address this gap, this study employed cognitive diagnostic modeling (CDM) to profile the skill mastery of programming students. An empirical analysis was conducted to select the most appropriate model for the data, and the linear logistic model (LLM) was determined to be the best fit. Final examination results from 308 information technology (IT) and 279 computer science (CS) students were analyzed using the LLM. Unfortunately, findings revealed that programming students exhibited proficiency primarily in code tracing and language proficiency but displayed deficits in theoretical understanding, logical reasoning, and algorithmic thinking. From a practical standpoint, this deficiency in fundamental skills sheds light on the factors contributing to academic failures and potentially eventual dropout in programming education. When comparing the student population by academic program, CS students demonstrated superior mastery compared to their IT counterparts, although both groups exhibited a lack of mastery in code tracing. These deviations underscore the pressing need for tailored educational strategies that address the unique strengths and weaknesses of each student group. Overall, this study offers valuable insights into programming education literature and contributes to the expanding application of CDM in educational research.

Keywords: Cognitive Diagnostic Modeling, Programming Education, Information Technology, Computer Science, Educational Assessment

Introduction

Computer programming is an essential skill in modern society. According to the Asian Development Bank, (2022), programming skills are becoming increasingly essential in all job sectors worldwide. This trend is unsurprising, considering the ongoing digitalization and the growing complexity of technological solutions in various industries. Many non-coding job positions also now demand proficiency in computer programming. The International Labour Organization, (2021) posited that coding skills today are required not only of programmers but also of scientists, engineers, designers, and artists. This demand underscores the necessity of integrating these skills into educational curricula. Governments across the globe have responded by adapting their educational systems. This adaptation involves the systematic incorporation of programming courses across different levels of education (Ou et al., 2023), in addition to offering them within information technology (IT), computer science (CS), and other computing programs. For instance, Macrides et al., (2022) noted a significant trend in early childhood education towards teaching coding. Their systematic review highlighted the adoption of screen-based visual programming and robotics for this purpose. This trend continues into middle school, where Lira et al., (2022) identified programming camps as a significant supplemental educational tool. It also extends to higher education, where Agbo et al., (2019) observed the increasing integration of computational thinking in teaching problem-solving skills and programming education. By incorporating these skills into the curricula, countries are preparing future generations for the challenges and opportunities of an increasingly digital world.

Despite this global shift in education, the world continues to face a shortage of skilled programmers. In a report covering four major countries (i.e., Canada, China, Germany, and Singapore), the International Labour Organization, (2020) has identified substantial deficits in the number of software developers and programmers. Similar widespread shortages have been observed by the European Labour Authority, (2023) in the EU27, Norway, and Switzerland. This global issue is multifaceted, but a critical component centers around the challenges within education systems. Programming education often presents significant learning difficulties for students. Garcia, (2021) asserted that these challenges are influenced by individual differences (e.g., inherent aptitudes and learning styles) as well as cognitive (e.g., problem-solving skills and logical reasoning abilities) and non-cognitive (e.g., motivation and attitude) factors. The effectiveness of programming education is also highly dependent on the quality of teaching and curriculum design. However, Ou et al., (2023) observed that the current quality of programming education is lacking, and there is a need for enhanced curriculum development in schools. This situation underscores the importance of rigorous assessment in programming education. Such assessments are vital in identifying areas where students face challenges. They also guide the refinement of teaching methods and curriculum to ensure they are aligned with the evolving needs of learners. However, most assessments are focused on measuring overall performance rather than diagnosing specific cognitive strengths and weaknesses (Garcia & Revano, 2021). This general approach tends to overlook critical insights into individual learning attributes, which is crucial for addressing the unique challenges each student faces.

This gap in traditional assessment methods highlights the need for an approach that can identify the cognitive abilities of programming students. While traditional assessments measure overall performance, they often fail to diagnose specific cognitive strengths and weaknesses. This limitation makes it difficult to provide targeted support. To address this issue, Cognitive Diagnostic Modeling (CDM) emerges as a fitting solution. CDM is a sophisticated analytical approach that focuses on understanding and diagnosing the specific cognitive skills and knowledge structures individuals possess. By providing detailed and multidimensional diagnostic feedback, CDM can identify examinees' strengths and weaknesses across a spectrum of attributes. This technique can be particularly beneficial in programming education, where understanding the intricacies of a student's cognitive abilities can lead to more effective instructional strategies. Applying a CDM approach can inform curriculum development and optimize learning outcomes by aligning instructional methods with the diverse cognitive needs of students. Therefore, this study seeks to address the following research questions (RQ):

RQ1: Which cognitive diagnosis model most adequately fits the empirical data?
RQ2: What are the attributes mastered by students at the grade and individual levels?
RQ3: How do these mastery profiles vary between CS and IT students?

Background of the Study

Cognitive Diagnosis Modeling

CDM is an advanced analytical approach designed to understand and diagnose the specific cognitive skills and knowledge structures that individuals possess. Its foundational roots date back to the development of the rule space method (Tatsuoka, 1983). Over the years, CDM has evolved to incorporate various models and techniques aimed at providing detailed and multidimensional diagnostic feedback (Liu et al., 2023). Unlike traditional psychometric frameworks that are more descriptive, such as item response theory (IRT) and classical test theory (CTT), CDM offers a diagnostic framework that classifies examinees' strengths and weaknesses across a spectrum of attributes (de la Torre & Minchen, 2014). In this context, an attribute is described as essential knowledge and cognitive abilities crucial for solving specific problems or tasks. For example, a CDM could indicate whether students learning programming have mastered specific attributes essential to coding, such as understanding basic syntax, applying control structures such as loops and conditionals, or efficiently debugging code (Garcia et al., 2022). This level of granularity provides more detailed evidence than other psychometric models, making CDM particularly useful for guiding teaching and learning decisions in the classroom (Effatpanah et al., 2019; Paulsen & Valdivia, 2022). Given its capabilities as a psychometric model, CDM is frequently used as the analytical framework in cognitive diagnostic assessment as it offers a more comprehensive evaluation of students' learning processes (Li et al., 2021).

The application of CDM has shown significant success in various educational contexts, including reading (Jang et al., 2015), listening (Meng et al., 2023), writing (Effatpanah et al., 2019), mathematics (Chandía et al., 2023), and accounting (Helm et al., 2022). These studies have demonstrated the effectiveness of CDM in providing detailed insights into specific skill sets and cognitive abilities of students. However, despite the empirical evidence demonstrating the benefits of CDM in other educational domains, it has not yet been adopted in programming education. A review of the literature reveals a significant gap, as no studies have specifically employed CDM as an assessment approach in programming education. The closest prior work involves the assessment of computational thinking, which has a broader focus on problem-solving and algorithmic reasoning rather than programming-specific skills (Li & Traynor, 2022). Unfortunately, traditional assessments used in programming education (e.g., Qayyum et al., 2018; Schnieder & Williams, 2022), while useful in evaluating the general understanding and competence of learners, often fail to diagnose specific cognitive strengths and weaknesses. In contrast, CDM offers a unique opportunity to identify the nuances of students' cognitive abilities. By diagnosing specific areas where students may struggle or excel, CDM can provide educators with the insights needed to tailor instruction more effectively. Given the limitations of traditional assessment methods in programming, there is a compelling need to explore and integrate CDM into this field to enhance both learning outcomes and instructional practices.

Foundational Models in Cognitive Diagnosis

An important consideration in applying a CDM is selecting an appropriate model (Wu et al., 2024). CDM encompasses various types of models, each with unique features and applications. Saturated models, such as the G-DINA (generalized deterministic inputs, noisy "and" gate) model (de la Torre, 2011), are the most comprehensive, as they allow for the estimation of all possible interactions among attributes. These models are highly flexible and can capture complex relationships, but they require a large amount of data and can be computationally intensive. Conversely, constrained models simplify the structure by assuming that certain interactions are negligible, thus reducing the number of parameters to be estimated. Examples of constrained models include the DINA (deterministic inputs, noisy "and" gate) model (Junker & Sijtsma, 2001), the DINO (deterministic input, noisy "or" gate) model (Templin & Henson, 2006), additive CDM (ACDM; de la Torre, 2011), linear logistic model (LLM; Maris, 1999), reduced reparameterized unified model (RRUM; Hartz, 2002), and more. These models are easier to manage and interpret but may not capture all the nuances of the data. When there are uncertain relationships among attributes, Ma and de la Torre, (2020) noted that the higher-order GDINA model with Rasch, 1-Parameter Logistic (1PL), and 2-parameter Logistic (2PL) joint attribute distributions can be considered to select appropriate models for empirical studies.

In some cases, a mixed model approach is used as a supplement to standard CDM analysis, where different models are applied to individual items within the same assessment. This technique allows for a tailored analysis that can better fit the varying complexities of various questions in an instrument. For instance, simpler items might be analyzed with reduced models, while more complex items might require saturated models to fully capture the cognitive processes involved. In a practical application, Ravand and Robitzsch, (2018) applied this method in a reading comprehension context and found that a mixed model provided a better fit than the G-DINA model. Given the abundance of viable models, de la Torre and Lee, (2013) argued that objectively choosing the most appropriate model is crucial rather than relying on personal preference or a predetermined model. As a guiding approach, the parsimony principle suggests selecting the simplest model when faced with multiple statistically equivalent models. However, model selection should also be based on how well the model assumptions correspond to the theoretical basis used to construct a given test (Li et al., 2015). de la Torre and Lee, (2013) noted that the Wald Test, a statistical test for parameter significance, can be used to compare models under the G-DINA framework. Using this test allows the selection of the model that best fits the specific context of the assessment. The choice of model impacts the accuracy and utility of the diagnostic information obtained, making it essential to consider the characteristics of the items and the attributes being measured (Effatpanah et al., 2019; Helm et al., 2022). This careful selection ensures that the CDM approach is effectively tailored to provide fine-grained diagnostic information and the most meaningful insights into students' cognitive abilities and learning needs.

Methods

Study Setting and Participants

The research was conducted at one of the leading institutes of technology in the Philippines. This university hosts a College of Computer Studies and Multimedia Arts (CCSMA), which offers IT and CS undergraduate programs. A fundamental component shared between these programs is a series of introductory and advanced computer programming courses. One of the programming courses that plays a significant role in the curriculum of both programs is Computer Programming 1, which comprises lecture (CCS0003) and laboratory (CCS0003L) components. The primary objective of this introductory programming course is to teach first-year computing students the foundational skills in computational logic and design. The course covers traditional problem-solving techniques (e.g., flowcharting and pseudo-coding) and basic programming concepts covering input/output operations, conditional and repetitive control structures, and arrays. Garcia, (2021) utilized the same course in conducting experimental research on evaluating cooperative learning pedagogy in computer programming. The selection of this course for the study is strategic, as it represents a shared educational experience for IT and CS students. The course maintains uniformity in its syllabus, teaching materials, and online modules, ensuring instruction consistency across different faculty members and between the two programs. It also guarantees that all computing students are assessed under similar conditions, making the evaluation of their skills and knowledge fair and unbiased. By maintaining uniformity in course content and delivery, the research design effectively controls for extraneous variables that might otherwise influence the outcome of the study (Garcia, 2023).

Research Instrument and Data Collection

This study utilized a comprehensive 100-item multiple-choice final examination from the CCS0003 course. Administered during the first trimester of the academic year 2023-2024, all IT and CS students enrolled in the course took this departmental examination for an hour. The instrument development was spearheaded by the faculty-in-charge, with subsequent validation by a team of co-faculty members who also teach the course. This collaborative approach in the instrument's development and validation ensured its academic rigor and alignment with the course's educational objectives.It is important to note that, although arguably better approaches to assess students exist (e.g., practical coding assessments), the number of items and the multiple-choice format are departmental requirements. Despite the multiple-choice format, several questions presented students with scenarios involving machine problems requiring them to interpret and analyze provided code snippets. Successfully responding to these questions necessitates a comprehensive understanding of the underlying algorithms. Additionally, this instrument was created simply as a final course assessment and not specifically for CDM analysis. Lee et al., (2012) argued that very few assessments are designed based on a cognitive diagnosis framework. More commonly, CDM is applied retrospectively to assessments initially developed with a unidimensional item response theory framework (i.e., retrofitting).

Nonetheless, the primary reason for selecting the final examination is the availability of detailed data (i.e., student responses on an item-by-item basis and the correctness of these responses). The dataset was readily accessible through ZipGrade – a mobile optical scanner application for grading multiple-choice assessments. The administrative office of the CCSMA was formally requested to provide a copy of the examination and the results. In response to the request, and with consideration for ethical research practices, they provided randomly selected data from various IT (n = 308) and CS (n = 269) classes. Upon receipt of the data, the first step was anonymizing it to ensure student confidentiality. This anonymization process involved removing all personally identifiable information, such as names, identification numbers, and any other markers traceable to individual students. This step was crucial for protecting student privacy and upholding the integrity of our research. More importantly, this approach strictly complied with data protection regulations and institutional ethical guidelines.

Attributes	Definition	References
Theoretical Understanding	Deep understanding of the principles and theories that form the foundation of programming.	(Garcia & Revano, 2021; Hota et al., 2023; Thuné & Eckerdal, 2019)
Language Proficiency	Mastery in using programming languages effectively to solve problems and create applications.	(Garcia et al., 2022; Guo, 2018; Xie et al., 2019; Zhang et al., 2023)
Logical Reasoning	Ability to apply coherent and rational thinking to solve problems and make decisions in programming.	(Barlow-Jones & van der Westhuizen, 2017; Djurdjevic-Pahl et al., 2017)
Algorithmic Thinking	Skill in designing, understanding, and implementing instructions to solve specific problems efficiently.	(Angeli, 2022; Kiss & Arki, 2017; Lamagna, 2015; Tsukamoto et al., 2017)
Code Tracing	Competence in following a program's execution flow and comprehending the behavior of the code.	(Kumar, 2015; Russell, 2022; Stankov et al., 2023; Zhang et al., 2023)

Q-Matrix

Upon obtaining a copy of the examination, domain expertise was utilized, enriched by insights from relevant literature (e.g., Xie et al., 2019), to identify the essential attributes that computer programming students must possess. At this point, the primary goal was to compile a comprehensive list of these attributes. Then, the examination was reviewed item-by-item to check for missing attributes or to confirm that all identified attributes were covered. Indeed, specific attributes (e.g., debugging and code documentation) were not included in the study due to the absence of corresponding questions in the examination. The initial list of attributes was then presented to the faculty-in-charge who developed the examination. Following a consultation, a consensus was reached that the attributes measured by this examination include theoretical understanding, language proficiency, logical reasoning, algorithmic thinking, and code tracing (see Table 1). Subsequently, a Q-Matrix was developed to illustrate the relationship between examination items and the identified attributes. Mapping the test items onto an item-by-skill table is a critical first step in CDM (Tatsuoka, 1983). The Q-Matrix underwent validation by the faculty team responsible for the creation and validation of the CCS0003 examination. In cases of disagreement, conflicting viewpoints were discussed and resolved through collaborative decision-making to ensure a unified and accurate representation in the matrix. Several revisions were made based on their feedback, and the revised version served as our initial Q-Matrix.

Data Analysis

Following the development of the initial Q-matrix, data analysis was conducted using the R programming language, employing the GDINA framework (Ma & de la Torre, 2020) as well as the tidyr, ggplot2, and fmsb packages. The data analysis was initiated with an empirical validation of the item-by-skill table. Ma and de la Torre, (2020) have observed that Q-matrices developed by domain experts often tend to be subjective, which is why it is critical to validate them empirically to avoid erroneous attribute estimation. The results provided by the G-DINA model were consulted using the Proportion of Variance Accounted For (PVAF) with a cutoff greater than 0.95 (de la Torre & Chiu, 2016). Additionally, the mesa plots (refer to Figure 1) of items flagged for revision were manually checked for further analysis. Revisions were made only when they were logically consistent with the item and the skills required for its correct response. This validation process led to the finalization of the Q-Matrix, with the results indicating that 86 out of 100 q-vectors were retained (e.g., Item 48; Figure Figure 1a). Regarding the 14 items with suggested q-vector modifications: six items had one suggested change each (e.g., Item 25: from 10110 to 11110; Figure 1b), six items had two suggested changes each (e.g., Item 57: from 01001 to 01111; Figure 1c), and one item had three suggested changes (e.g., Item 54: from 10000 to 10111; Figure 1d. The final and validated Q-matrix can be found in Appendix A.

Afterward, the analysis progressed by fitting the G-DINA model while imposing the monotonicity constraints on the dataset. This saturated model provided a baseline for our analysis. Subsequently, various models were explored, including the DINA model, the DINO model, ACDM, LLM, and RRUM. Given the diversity of cognitive processes involved, it may be more beneficial to avoid forcing a single model onto the entire dataset. Recognizing the complexity of cognitive processes and preventing the imposition of a single model on the entire test, an item-level model fit analysis was conducted. This approach allowed us to consider how each model applied to individual test items rather than the entire test. To select the most appropriate model at the item level, this study followed the process outlined by Ma et al., (2016). First, the Wald statistic for all models for every item was calculated. de la Torre and Lee, (2013) recommended the use of the Wald test as an objective means of determining the most appropriate models. In this approach, the null hypothesis posits that the reduced model fits the item as well as the saturated model. If the null hypothesis is rejected (p < .05), the reduced model is dismissed. If more than one reduced model is retained and DINA or DINO is among them, the one with the most significant p-value is selected. The outcome of this analysis was a mixed model (subsequently referred to as MIXED), which combined different models at an item level. In addition to these models, we incorporated higher-order G-DINA models such as Rasch, 1PL, and 2PL into our analysis. Several studies have demonstrated the potential of using higher-order models in examining the skill profiles of students (e.g., Zhang et al., 2022).

All these models were included in the relative fit analysis, where the performance of saturated, reduced, mixed, and higher-order models was compared using the anova() function in the G-DINA framework. This comparative analysis was pivotal in selecting the most appropriate model. Models that were not rejected during this analysis were further examined, and the one with the lowest Akaike Information Criterion (AIC; Akaike, 1974) and Bayesian Information Criterion (BIC; Schwarz, 1978) was selected as the most suitable model. Both AIC and BIC are relative fit indices used for selecting between non-nested models, with lower values indicating a better fit. These indices provide a balance between model complexity and goodness of fit, helping avoid overfitting while ensuring ac

Related Research

Hackathons as Extracurricular Activities: Unraveling the Motivational Orientation Behind Student Participation

Computer Applications in Engineering Education

Read Paper

References

Agbo, F. J., Oyelere, S. S., Suhonen, J., & Adewumi, S. (2019). A Systematic Review of Computational Thinking Approach for Programming Education in Higher Education Institutions. Proceedings of the 19th Koli Calling International Conference on Computing Education Research. https://doi.org/10.1145/3364510.3364521
Akaike, H. (1974). A New Look at the Statistical Model Identification. IEEE Transactions on Automatic Control, 19(6), 716-723. https://doi.org/10.1109/TAC.1974.1100705
Angeli, C. (2022). The Effects of Scaffolded Programming Scripts on Pre-Service Teachers’ Computational Thinking: Developing Algorithmic Thinking Through Programming Robots. International Journal of Child-Computer Interaction, 31, 1-20. https://doi.org/10.1016/j.ijcci.2021.100329
Asian Development Bank. (2022). Digital Jobs and Digital Skills: A Shifting Landscape in Asia and the Pacific.
Barlow-Jones, G., & van der Westhuizen, D. (2017). Problem Solving as a Predictor of Programming Performance. ICT Education. https://doi.org/10.1007/978-3-319-69670-6_14
Chandía, E., Sanhueza, T., Mansilla, A., Morales, H., Huencho, A., & Cerda, G. (2023). Nonparametric Cognitive Diagnosis of Profiles of Mathematical Knowledge of Teacher Education Candidates. Current Psychology, 42(36), 32498-32511. https://doi.org/10.1007/s12144-023-04256-2
Chen, J., de la Torre, J., & Zhang, Z. (2013). Relative and Absolute Fit Evaluation in Cognitive Diagnosis Modeling. Journal of Educational Measurement, 50(2), 123-140. https://doi.org/10.1111/j.1745-3984.2012.00185.x
Davier, M. v., & Lee, Y.-S. (2019). Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages. Springer. https://doi.org/10.1007/978-3-030-05584-4
de la Torre, J., & Chiu, C.-Y. (2016). A General Method of Empirical Q-matrix Validation. Psychometrika, 81(2), 253-273. https://doi.org/10.1007/s11336-015-9467-8
de la Torre, J., & Lee, Y.-S. (2013). Evaluating the Wald Test for Item-Level Comparison of Saturated and Reduced Models in Cognitive Diagnosis. Journal of Educational Measurement, 50(4), 355-373. https://doi.org/https://doi.org/10.1111/jedm.12022
de la Torre, J., & Minchen, N. (2014). Cognitively Diagnostic Assessments and the Cognitive Diagnosis Model Framework. Psicología Educativa, 20, 89-97. https://doi.org/10.1016/j.pse.2014.11.001
de la Torre, J. (2011). The Generalized DINA Model Framework. Psychometrika, 76(2), 179-199. https://doi.org/10.1007/s11336-011-9207-7
Delafontaine, J., Chen, C., Park, J. Y., & Van den Noortgate, W. (2022). Using Country-Specific Q-Matrices for Cognitive Diagnostic Assessments with International Large-Scale Data. Large-scale Assessments in Education, 10(1), 1-36. https://doi.org/10.1186/s40536-022-00138-4
Dirzyte, A., Perminas, A., Kaminskis, L., Žebrauskas, G., Sederevičiūtė – Pačiauskienė, Ž., Šliogerienė, J., Suchanova, J., Rimašiūtė – Knabikienė, R., Patapas, A., & Gajdosikiene, I. (2023). Factors Contributing to Dropping Out of Adults’ Programming E-Learning. Heliyon, 9(12), 1-16. https://doi.org/10.1016/j.heliyon.2023.e22113
Djurdjevic-Pahl, A., Pahl, C., Fronza, I., & El Ioini, N. (2017). A Pathway into Computational Thinking in Primary Schools. Emerging Technologies for Education. https://doi.org/10.1007/978-3-319-52836-6_19
Effatpanah, F., Baghaei, P., & Boori, A. A. (2019). Diagnosing EFL Learners’ Writing Ability: A Diagnostic Classification Modeling Analysis. Language Testing in Asia, 9(1), 1-23. https://doi.org/10.1186/s40468-019-0090-y
European Labour Authority. (2023). Report on Labour Shortages and Surpluses – 2022. Publications Office of the European Union. https://doi.org/10.2883/50704
Garcia, M. B. (2021). Cooperative Learning in Computer Programming: A Quasi-Experimental Evaluation of Jigsaw Teaching Strategy with Novice Programmers. Education and Information Technologies, 26(4), 4839-4856. https://doi.org/10.1007/s10639-021-10502-6
Garcia, M. B. (2023). Facilitating Group Learning Using an Apprenticeship Model: Which Master is More Effective in Programming Instruction? Journal of Educational Computing Research, 61(6), 1207-1231. https://doi.org/10.1177/07356331231170382
Garcia, M. B., Enriquez, J. B. R., Adao, R. T., & Happonen, A. (2022). "Hey IDE, Display Hello World": Integrating a Voice Coding Approach in Hands-on Computer Programming Activities. 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM). https://doi.org/10.1109/HNICEM57413.2022.10109412
Garcia, M. B., Juanatas, I. C., & Juanatas, R. A. (2022). TikTok as a Knowledge Source for Programming Learners: A New Form of Nanolearning? 2022 10th International Conference on Information and Education Technology (ICIET). https://doi.org/10.1109/ICIET55102.2022.9779004
Garcia, M. B., & Revano, T. F. (2021). Assessing the Role of Python Programming Gamified Course on Students’ Knowledge, Skills Performance, Attitude, and Self-Efficacy. 2021 IEEE 13th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM). https://doi.org/10.1109/HNICEM54116.2021.9731935
Garcia, M. B., Revano, T. F., Maaliw, R. R., Lagrazon, P. G. G., Valderama, A. M. C., Happonen, A., Qureshi, B., & Yilmaz, R. (2023). Exploring Student Preference between AI-Powered ChatGPT and Human-Curated Stack Overflow in Resolving Programming Problems and Queries. 2023 IEEE 15th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM). https://doi.org/10.1109/HNICEM60674.2023.10589162
Graafsma, I. L., Robidoux, S., Nickels, L., Roberts, M., Polito, V., Zhu, J. D., & Marinus, E. (2023). The Cognition of Programming: Logical Reasoning, Algebra and Vocabulary Skills Predict Programming Performance Following an Introductory Computing Course. Journal of Cognitive Psychology, 35(3), 364-381. https://doi.org/10.1080/20445911.2023.2166054
Guo, P. J. (2018). Non-Native English Speakers Learning Computer Programming: Barriers, Desires, and Design Opportunities. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3173574.3173970
Hartz, S. M. (2002). A Bayesian Framework for the Unified Model for Assessing Cognitive Abilities: Blending Theory with Practicality. Dissertation Abstracts International: Section B: The Sciences and Engineering, 63(2-B), 864. https://psycnet.apa.org/record/2002-95016-234
Helm, C., Warwas, J., & Schirmer, H. (2022). Cognitive Diagnosis Models of Students’ Skill Profiles as a Basis for Adaptive Teaching: An Example From Introductory Accounting Classes. Empirical Research in Vocational Education and Training, 14(1), 1-30. https://doi.org/10.1186/s40461-022-00137-3
Hota, C. P. P. K., Asanambigai, V., & Lakshmi, D. (2023). Predicting Academic Grades of Students in Computer Programming Using Classification Algorithms. 2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS). https://doi.org/10.1109/ICACCS57279.2023.10112996
International Labour Organization. (2020). Skills Shortages and Labour Migration in the Field of Information and Communication Technology in Canada, China, Germany and Singapore. https://www.ilo.org/wcmsp5/groups/public/---ed_dialogue/---sector/documents/publication/wcms_755663.pdf
International Labour Organization. (2021). Changing Demand for Skills in Digital Economies and Societies: Literature Review and Case Studies from Low- and Middle-Income Countries. https://www.ilo.org/wcmsp5/groups/public/---ed_emp/---ifp_skills/documents/publication/wcms_831372.pdf
Jang, E. E., Dunlop, M., Park, G., & van der Boom, E. H. (2015). How Do Young Students With Different Profiles of Reading Skill Mastery, Perceived Ability, and Goal Orientation Respond to Holistic Diagnostic Feedback? Language Testing, 32(3), 359-383. https://doi.org/10.1177/0265532215570924
Junker, B. W., & Sijtsma, K. (2001). Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory. Applied Psychological Measurement, 25(3), 258-272. https://doi.org/10.1177/01466210122032064
Kiss, G., & Arki, Z. (2017). The Influence of Game-based Programming Education on the Algorithmic Thinking. Procedia - Social and Behavioral Sciences, 237, 613-617. https://doi.org/10.1016/j.sbspro.2017.02.020
Kovari, A., & Katona, J. (2023). Effect of Software Development Course on Programming Self-Efficacy. Education and Information Technologies, 28(9), 10937-10963. https://doi.org/10.1007/s10639-023-11617-8
Kumar, A. N. (2015). Solving Code-tracing Problems and its Effect on Code-writing Skills Pertaining to Program Semantics. Proceedings of the 2015 ACM Conference on Innovation and Technology in Computer Science Education. https://doi.org/10.1145/2729094.2742587
Lamagna, E. A. (2015). Algorithmic Thinking Unplugged. Journal of Computing Sciences in Colleges, 30(6), 45-52. https://dl.acm.org/doi/10.5555/2753024.2753036
Lee, Y.-S., de la Torre, J., & Park, Y. S. (2012). Relationships Between Cognitive Diagnosis, CTT, and IRT Indices: An Empirical Investigation. Asia Pacific Education Review, 13(2), 333-345. https://doi.org/10.1007/s12564-011-9196-3
Li, H., Hunter, C. V., & Lei, P.-W. (2015). The Selection of Cognitive Diagnostic Models for a Reading Comprehension Test. Language Testing, 33(3), 391-409. https://doi.org/10.1177/0265532215590848
Li, T., & Traynor, A. (2022). The Use of Cognitive Diagnostic Modeling in the Assessment of Computational Thinking. AERA Open, 8, 23328584221081256. https://doi.org/10.1177/23328584221081256
Li, Y., Zhen, M., & Liu, J. (2021). Validating a Reading Assessment Within the Cognitive Diagnostic Assessment Framework: Q-Matrix Construction and Model Comparisons for Different Primary Grades. Frontiers in Psychology, 12, 1-13. https://doi.org/10.3389/fpsyg.2021.786612
Lira, C. D., Wong, R., & Adesope, O. (2022). A Systematic Review on the Effectiveness of Programming Camps on Middle School Students' Programming Knowledge and Attitudes of Computing. Journal of Computing Sciences in Colleges, 38(1), 89-98. https://dl.acm.org/doi/abs/10.5555/3575618.3575627
Liu, Y., Zhang, T., Wang, X., Yu, G., & Li, T. (2023). New Development of Cognitive Diagnosis Models. Frontiers of Computer Science, 17(1), 1-13. https://doi.org/10.1007/s11704-022-1128-3
Ma, W., & de la Torre, J. (2020). An Empirical Q-Matrix Validation Method for the Sequential Generalized DINA Model. British Journal of Mathematical and Statistical Psychology, 73(1), 142-163. https://doi.org/10.1111/bmsp.12156
Ma, W., & de la Torre, J. (2020). GDINA: An R Package for Cognitive Diagnosis Modeling. Journal of Statistical Software, 93(14), 1-26. https://doi.org/10.18637/jss.v093.i14
Ma, W., Iaconangelo, C., & de la Torre, J. (2016). Model Similarity, Model Selection, and Attribute Classification. Applied Psychological Measurement, 40(3), 200-217. https://doi.org/10.1177/0146621615621717
Macrides, E., Miliou, O., & Angeli, C. (2022). Programming in Early Childhood Education: A Systematic Review. International Journal of Child-Computer Interaction, 32, 1-17. https://doi.org/10.1016/j.ijcci.2021.100396
Maris, E. (1999). Estimating Multiple Classification Latent Class Models. Psychometrika, 64(2), 187-212. https://doi.org/10.1007/BF02294535
Meng, Y., Wang, Y., & Zhao, N. (2023). Cognitive Diagnostic Assessment of EFL Learners’ Listening Barriers Through Incorrect Responses. Frontiers in Psychology, 14, 1-11. https://doi.org/10.3389/fpsyg.2023.1126106
Nakayama, M., Uto, M., Temperini, M., & Sciarrone, F. (2021). Estimating Ability of Programming Skills Using IRT based Peer Assessments. 2021 19th International Conference on Information Technology Based Higher Education and Training (ITHET). https://doi.org/10.1109/ITHET50392.2021.9759571
Ou, Q., Liang, W., He, Z., Liu, X., Yang, R., & Wu, X. (2023). Investigation and Analysis of the Current Situation of Programming Education in Primary and Secondary Schools. Heliyon, 9(4), 1-16. https://doi.org/10.1016/j.heliyon.2023.e15530
Paulsen, J., & Valdivia, D. S. (2022). Examining Cognitive Diagnostic Modeling in Classroom Assessment Conditions. The Journal of Experimental Education, 90(4), 916-933. https://doi.org/10.1080/00220973.2021.1891008
Qayyum, N. u., Seman, M. S. A., Shah, A., Qureshi, M. S., & Raza, A. (2018). A Review of Programming Code Assessment Approaches. 2018 IEEE 5th International Conference on Engineering Technologies and Applied Sciences (ICETAS). https://doi.org/10.1109/ICETAS.2018.8629221
Ravand, H., & Robitzsch, A. (2018). Cognitive Diagnostic Model of Best Choice: A Study of Reading Comprehension. Educational Psychology, 38(10), 1255-1277. https://doi.org/10.1080/01443410.2018.1489524
Rupp, A. A., & Templin, J. L. (2008). Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art. Measurement: Interdisciplinary Research and Perspectives, 6(4), 219-262. https://doi.org/10.1080/15366360802490866
Russell, S. (2022). Automated Code Tracing Exercises for CS1. Proceedings of 6th Conference on Computing Education Practice. https://doi.org/10.1145/3498343.3498347
Schnieder, M., & Williams, S. (2022). How to Assess Programming Skills: Review and Analysis. 2022 IEEE German Education Conference (GeCon). https://doi.org/10.1109/GeCon55699.2022.9942789
Schwarz, G. (1978). Estimating the Dimension of a Model. The Annals of Statistics, 6(2), 461-464. https://doi.org/10.1214/aos/1176344136
Shi, Q., Ma, W., Robitzsch, A., Sorrel, M. A., & Man, K. (2021). Cognitively Diagnostic Analysis Using the G-DINA Model in R. Psych, 3(4), 812-835. https://doi.org/10.3390/psych3040052
Stankov, E., Jovanov, M., & Madevska Bogdanova, A. (2023). Smart Generation of Code Tracing Questions for Assessment in Introductory Programming. Computer Applications in Engineering Education, 31(1), 5-25. https://doi.org/https://doi.org/10.1002/cae.22567
Tatsuoka, K. K. (1983). Rule Space: An Approach for Dealing With Misconceptions Based on Item Response Theory. Journal of Educational Measurement, 20(4), 345-354. https://doi.org/10.1111/j.1745-3984.1983.tb00212.x
Templin, J. L., & Henson, R. A. (2006). Measurement of Psychological Disorders Using Cognitive Diagnosis Models. Psychological Methods, 11(3), 287-305. https://doi.org/10.1037/1082-989X.11.3.287
Thuné, M., & Eckerdal, A. (2019). Analysis of Students’ Learning of Computer Programming in a Computer Laboratory Context. European Journal of Engineering Education, 44(5), 769-786. https://doi.org/10.1080/03043797.2018.1544609
Tisza, G., & Markopoulos, P. (2021). Understanding the Role of Fun in Learning to Code. International Journal of Child-Computer Interaction, 28, 1-10. https://doi.org/10.1016/j.ijcci.2021.100270
Tsukamoto, H., Oomori, Y., Nagumo, H., Takemura, Y., Monden, A., & Matsumoto, K. i. (2017). Evaluating Algorithmic Thinking Ability of Primary Schoolchildren Who Learn Computer Programming. 2017 IEEE Frontiers in Education Conference (FIE). https://doi.org/10.1109/FIE.2017.8190609
Wu, X., Sun, S., Xu, T., & Wang, A. (2024). Research on the Selection of Cognitive Diagnosis Model From the Perspective of Experts. Current Psychology, 43(15), 13802-13810. https://doi.org/10.1007/s12144-023-05438-8
Xie, B., Loksa, D., Nelson, G. L., Davidson, M. J., Dong, D., Kwik, H., Tan, A. H., Hwa, L., Li, M., & Ko, A. J. (2019). A Theory of Instruction for Introductory Programming Skills. Computer Science Education, 29(2-3), 205-253. https://doi.org/10.1080/08993408.2019.1565235
Zhang, Y., Jin, Y., Xiong, Z., Leung, S. O., Chen, G., Li, N., & Li, B. (2022). Personalized Assessment: Applying Higher-Order Cognitive Diagnosis Models in Secondary Mathematics. Asian Journal for Mathematics Education, 1(4), 455-474. https://doi.org/10.1177/27527263221136301
Zhang, Y., Paquette, L., Pinto, J. D., & Fan, A. X. (2023). Utilizing Programming Traces to Explore and Model the Dimensions of Novices' Code-Writing Skill. Computer Applications in Engineering Education, 31(4), 1041-1058. https://doi.org/https://doi.org/10.1002/cae.22622

Cite this paper

Garcia, M. B. (2024). Profiling the Skill Mastery of Introductory Programming Students: A Cognitive Diagnostic Modeling Approach. Education and Information Technologies, 30(5), 6455-6481. https://doi.org/10.1007/s10639-024-13039-6.

Download Citation

SUBMITTED

Feb 27 2024

REVISED

May 22 2024

PUBLISHED

Oct 08 2024

LINK

https://doi.org/10.1007/s10639-024-13039-6

Keywords

Authors

Garcia, Manuel B.

Educational Innovation and Technology Hub

FEU Institute of Technology, Philippines

iD 0000-0003-2615-422X

Contact Info

Follow Me

Profiling the Skill Mastery of Introductory Programming Students: A Cognitive Diagnostic Modeling Approach

Abstract

Introduction

Background of the Study

Cognitive Diagnosis Modeling

Foundational Models in Cognitive Diagnosis

Methods

Study Setting and Participants

Research Instrument and Data Collection

Q-Matrix

Data Analysis

Hackathons as Extracurricular Activities: Unraveling the Motivational Orientation Behind Student Participation

References

Cite this paper

Keywords

Authors

Garcia, Manuel B.

Profiling the Skill Mastery of Introductory Programming Students: A Cognitive Diagnostic Modeling Approach

Garcia, Manuel B.

Abstract

Promoting Social Relationships Using a Couch Cooperative Video Game: An Empirical Experiment with Unacquainted Players

Introduction

Background of the Study

Cognitive Diagnosis Modeling

Foundational Models in Cognitive Diagnosis

Methods

Study Setting and Participants

Research Instrument and Data Collection

Q-Matrix

Data Analysis

Hackathons as Extracurricular Activities: Unraveling the Motivational Orientation Behind Student Participation

References

Cite this paper

Keywords

Authors

Garcia, Manuel B.