Personalized and Constructive Feedback for Computer Science Students Using the Large Language Model (LLM)
Khan, Yaqoob, Tasadduq et al.
The evolving pedagogy paradigms are leading toward educational transformations. One fundamental aspect of effective learning is relevant, immediate, and constructive feedback to students. Providing constructive feedback to large cohorts in academia is an ongoing challenge. Therefore, academics are moving towards automated assessment to provide immediate feedback. However, current approaches are often limited in scope, offering simplistic responses that do not provide students with personalized feedback to guide them toward improvements. This paper addresses this limitation by investigating the performance of Large Language Models (LLMs) in processing students assessments with predefined rubrics and marking criteria to generate personalized feedback for in-depth learning. We aim to leverage the power of existing LLMs for Marking Assessments, Tracking, and Evaluation (LLM-MATE) with personalized feedback to enhance students learning. To evaluate the performance of LLM-MATE, we consider the Software Architecture (SA) module as a case study. The LLM-MATE approach can help module leaders overcome assessment challenges with large cohorts. Also, it helps students improve their learning by obtaining personalized feedback in a timely manner. Additionally, the proposed approach will facilitate the establishment of ground truth for automating the generation of students assessment feedback using the ChatGPT API, thereby reducing the overhead associated with large cohort assessments.
academic
Personalized and Constructive Feedback for Computer Science Students Using the Large Language Model (LLM)
The evolution of educational paradigms is driving transformative change in education. A fundamental aspect of effective learning is providing students with relevant, timely, and constructive feedback. Delivering constructive feedback to large student populations remains an ongoing challenge for academia. Consequently, scholars are turning to automated assessment to provide immediate feedback. However, current approaches often have limited scope and provide simplistic responses that cannot offer personalized feedback to guide student improvement. This paper addresses this limitation by investigating the performance of Large Language Models (LLMs) in processing student assessments using predefined rubrics and generating personalized feedback. The authors aim to leverage the power of existing LLMs for assessment marking, tracking, and evaluation (LLM-MATE), enhancing student learning through personalized feedback.
Scalable Feedback Challenge: Difficulty in providing timely, personalized, and constructive feedback to large student populations
Limitations of Traditional Automated Assessment: Existing automated assessment methods have limited scope and provide only simplistic responses, lacking personalized guidance
Teacher Workload: Manual assessment of numerous student assignments is time-consuming and labor-intensive, making it difficult to ensure feedback quality and consistency
Leveraging the powerful text comprehension and generation capabilities of Large Language Models, combined with predefined rubrics, to provide personalized and constructive feedback for multimodal assessments (text, images, programming) of computer science students.
Proposed LLM-MATE Framework: A Large Language Model-based marking, tracking, and evaluation system capable of handling multimodal student assessments
Zero-Shot Prompt Engineering Method: Developed specialized ChatGPT prompting strategies for student assessment that generate high-quality feedback without requiring training data
Multimodal Assessment Capability: Validated the effectiveness of LLMs in processing software architecture assessments containing text and diagrams
Teacher Validation Study: Demonstrated the reliability of AI-generated feedback through comparative validation with human experts
Practical Application Value: Provided a feasible solution for automated assessment in large-scale courses
Technical Feasibility: ChatGPT can effectively process multimodal assessments of computer science students and generate high-quality personalized feedback
Educational Value: AI-generated feedback is more detailed and constructive than traditional human feedback, facilitating student learning improvement
Practicality: The LLM-MATE approach can help address assessment challenges in large-scale courses and improve teaching efficiency
Consistency: AI assessment provides more consistent evaluation standards compared to multiple human assessors
This paper cites 38 relevant references, primarily including:
Core References:
González-Calatayud et al. (2021) - Survey of AI student assessment systems
Maier & Klotz (2022) - Personalized feedback in digital learning environments
Biswas & Bhattacharya (2024) - ML-based intelligent real-time feedback system
Liu et al. (2023) - Systematic review of prompt engineering methods
Technical Support References:
White et al. (2024) - ChatGPT prompting patterns
Wei et al. (2022) - Chain-of-thought prompting method
Chen et al. (2023) - LLM applications in software engineering
Overall Assessment: This is a research paper with practical application value. Although it has certain limitations in technical innovation and experimental scale, it provides valuable exploration and practical experience for the educational technology field. The research methodology is sound, results are credible, and it has positive significance for promoting AI applications in educational assessment.