Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 19 |
Since 2016 (last 10 years) | 44 |
Since 2006 (last 20 years) | 119 |
Descriptor
Error Patterns | 162 |
Evaluation Methods | 162 |
Student Evaluation | 31 |
Simulation | 26 |
Computation | 25 |
Models | 24 |
Comparative Analysis | 22 |
Statistical Analysis | 22 |
Foreign Countries | 20 |
Correlation | 17 |
Teaching Methods | 15 |
More ▼ |
Source
Author
Guo, Jiin-Huarng | 2 |
Kim, Eun Sook | 2 |
Luh, Wei-Ming | 2 |
Savalei, Victoria | 2 |
Yoon, Myeongsun | 2 |
Abigail Miller | 1 |
Abraham, W. Todd | 1 |
Alhaisoni, Eid | 1 |
Allen, Gove | 1 |
Alzuoud, Khalid | 1 |
Amini, Mojtaba | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 26 |
Postsecondary Education | 16 |
Secondary Education | 6 |
Elementary Education | 5 |
High Schools | 4 |
Elementary Secondary Education | 2 |
Kindergarten | 2 |
Grade 1 | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Audience
Practitioners | 12 |
Teachers | 8 |
Researchers | 3 |
Location
China | 4 |
Taiwan | 2 |
Turkey | 2 |
Arizona | 1 |
California | 1 |
Canada | 1 |
Colombia | 1 |
Europe | 1 |
Finland | 1 |
Georgia | 1 |
Germany | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lauren A. Mason; Abigail Miller; Gregory Hughes; Holly A. Taylor – Cognitive Research: Principles and Implications, 2025
False alarming, or detecting an error when there is not one, is a pervasive problem across numerous industries. The present study investigated the role of elaboration, or additional information about non-error differences in complex visual displays, for mitigating false error responding. In Experiment 1, learners studied errors and non-error…
Descriptors: Error Correction, Error Patterns, Evaluation Methods, Visual Aids
Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022
Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…
Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction
Konstantinou, Ioannis Ch. – Open Journal for Educational Research, 2022
The purpose of this article is to review the literature on the issue of grading as a method and technique of expressing students' performance in terms of school reality. Initially, a growing concern about the role of assessment of student's performance in the learning and, generally, in the educational process, is highlighted. Subsequently, the…
Descriptors: Grading, Student Evaluation, Evaluation Methods, Performance Based Assessment
Suto, Irenka; Williamson, Joanna; Ireland, Jo; Macinska, Sylwia – Research Papers in Education, 2023
Errors that occasionally manifest in examination papers and other educational assessment instruments can threaten reliability and validity. For example, a multiple choice question could have two correct response options, or a geography question containing an inaccurate map could be unanswerable. In this paper we explore this oft-neglected element…
Descriptors: Error Patterns, International Assessment, Test Construction, Failure
Yue Zhang; Max Stephens; Xiaomei Liu – Asia-Pacific Journal of Teacher Education, 2024
The study aimed to establish an assessment model for mathematics teachers' knowledge of students' misconceptions in the "Space and Shape" domain, develop the testing tool, investigate and analyse the overall and differences in performance, and propose suggestions for improvement. The assessment model included content knowledge and…
Descriptors: Foreign Countries, Elementary School Teachers, Mathematics Teachers, Knowledge Level
Schneider, Johannes; Richner, Robin; Riser, Micha – International Journal of Artificial Intelligence in Education, 2023
Autograding short textual answers has become much more feasible due to the rise of NLP and the increased availability of question-answer pairs brought about by a shift to online education. Autograding performance is still inferior to human grading. The statistical and black-box nature of state-of-the-art machine learning models makes them…
Descriptors: Grading, Natural Language Processing, Computer Assisted Testing, Ethics
Li, Liang-Yi; Huang, Wen-Lung – Educational Technology & Society, 2023
With the increasing bandwidth, videos have been gradually used as submissions for online peer assessment activities. However, their transient nature imposes a high cognitive load on students, particularly lowability students. Therefore, reviewers' ability is a key factor that may affect the reviewing process and performance in an online video peer…
Descriptors: Peer Evaluation, Undergraduate Students, Video Technology, Evaluation Methods
Karakaya, Ferhat; Yilmaz, Mehmet – Journal of Pedagogical Research, 2022
There have been significant advances in science and technology in recent years. Therefore, all countries need qualified people who can take on the challenges of life today and compete in the international arena. This has led countries to adopt new approaches to education. STEM education is one of the latest examples of those approaches. This study…
Descriptors: Science Teachers, Teacher Attitudes, Evaluation Methods, STEM Education
Stouffer, Joe – Reading Teacher, 2021
Responding to recent challenges to Clay's Running Records (2019) and their analysis using a three-cueing system, the author examines this reading assessment from an additive perspective of both bottom-up and top-down orientations of reading instruction. Endorsing their inclusion among classroom reading assessments, the author navigates the tension…
Descriptors: Reading Instruction, Evaluation Methods, Student Evaluation, Reading Fluency
Piotr Jabkowski – International Journal of Social Research Methodology, 2023
Social research methodologists have postulated that the transparency of survey procedures and data processing is mandatory for assessing the Total Survey Error. Recent analyses of data from cross-national surveys have demonstrated an increase in the quality of documentation reports over time and significant differences in documentation quality…
Descriptors: Social Science Research, Cross Cultural Studies, Documentation, Error Patterns
Barone, Jennifer; Khairallah, Pamela; Gabriel, Rachael – Reading Teacher, 2020
Running records can be the assessments that teachers are looking for when searching for an efficient way to plan meaningful literacy instruction. Running records can give teachers immediate insights to guide on-the-fly prompting and teaching decisions to build reader independence. The authors use classroom examples to illustrate how taking and…
Descriptors: Literacy Education, Reading Instruction, Progress Monitoring, Error Patterns
Payadnya, I. Putu Ade Andre; Suwija, I. Ketut; Wibawa, Kadek Adi – Mathematics Teaching Research Journal, 2021
The research aimed to analyze the students' abilities in solving realistic mathematics problems using "What-If"-Ethnomathematics Instruments with content focused on plane and space materials. The "What-If"-Ethnomathematics instruments are instruments that enable educators to analyze various errors and obstacles experienced by…
Descriptors: Mathematics Skills, Problem Solving, Thinking Skills, Learning Strategies
Chan, Sathena; May, Lyn – Language Testing, 2023
Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…
Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills
Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019
Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…
Descriptors: Rating Scales, Models, Evaluators, Data Collection
Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019
Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…
Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques