Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 25 |
Since 2016 (last 10 years) | 75 |
Since 2006 (last 20 years) | 175 |
Descriptor
Scores | 249 |
Student Evaluation | 249 |
Test Validity | 139 |
Test Reliability | 69 |
Validity | 69 |
Evaluation Methods | 67 |
Academic Achievement | 54 |
Foreign Countries | 46 |
Correlation | 45 |
Test Construction | 41 |
Predictive Validity | 38 |
More ▼ |
Source
Author
Tindal, Gerald | 3 |
Anderson, Paul D. | 2 |
Boyer, Michelle | 2 |
Deno, Stanley L. | 2 |
Erford, Bradley T. | 2 |
Goldhaber, Dan | 2 |
Goldschmidt, Pete | 2 |
Haladyna, Thomas M. | 2 |
Heritage, Margaret | 2 |
Herman, Joan L. | 2 |
Lembke, Erica S. | 2 |
More ▼ |
Publication Type
Education Level
Location
Florida | 10 |
United States | 6 |
New York | 5 |
Australia | 4 |
China | 4 |
Illinois | 4 |
Massachusetts | 4 |
United Kingdom (England) | 4 |
Canada | 3 |
Germany | 3 |
Turkey | 3 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 3 |
No Child Left Behind Act 2001 | 2 |
Americans with Disabilities… | 1 |
Education for All Handicapped… | 1 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024
Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…
Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability
Gill, Tim – Research Matters, 2023
Secondary Checkpoint assessments are taken by students at the end of the Cambridge Lower Secondary programme (aged 14) in countries around the world. Many students continue with Cambridge after this and take IGCSE exams two years later. Given that there is a high level of coherence between the curricula in the two stages, performance in Secondary…
Descriptors: Student Evaluation, Secondary School Students, Achievement Tests, Predictive Validity
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Carney, Michele; Paulding, Katie; Champion, Joe – Applied Measurement in Education, 2022
Teachers need ways to efficiently assess students' cognitive understanding. One promising approach involves easily adapted and administered item types that yield quantitative scores that can be interpreted in terms of whether or not students likely possess key understandings. This study illustrates an approach to analyzing response process…
Descriptors: Middle School Students, Logical Thinking, Mathematical Logic, Problem Solving
Roduta Roberts, Mary; Gotch, Chad M.; Cook, Megan; Werther, Karin; Chao, Iris C. I. – Measurement: Interdisciplinary Research and Perspectives, 2022
Performance-based assessment is a common approach to assess the development and acquisition of practice competencies among health professions students. Judgments related to the quality of performance are typically operationalized as ratings against success criteria specified within a rubric. The extent to which the rubric is understood,…
Descriptors: Protocol Analysis, Scoring Rubrics, Interviews, Performance Based Assessment
Dragoset, Lisa; Baxter, Cassandra; Dotter, Dallas; Walsh, Elias – Regional Educational Laboratory Mid-Atlantic, 2019
The purpose of this report is to investigate the feasibility of constructing a school-level measure of students' academic growth from kindergarten to grade 3, and to assess the validity and precision of that measure. The study measured schoolwide student growth for reading and math using student growth percentiles based on Maryland's 2014/15…
Descriptors: Elementary School Students, Academic Achievement, Primary Education, Student Evaluation
Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022
Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…
Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation
Bowen-Mendoza, Lorena; Pinargote-Ortega, Maricela; Meza, Jaime; Ventura, Sebastián – Journal of Computing in Higher Education, 2022
Peer evaluation consists of the evaluation of students by their peers following criteria or rubrics provided by the teacher, where the way to evaluate students is specified so that they achieve the desired competencies. The quality of the measurement instrument must meet two essential criteria: validity and reliability. In this research, we…
Descriptors: Peer Evaluation, Student Evaluation, Scoring Rubrics, Information Technology
Dawn R. Coleman – ProQuest LLC, 2022
Community colleges are under ongoing pressure to increase the success of students placed into developmental courses; while many reforms focus on the length of course sequences, there is also attention on how students are assessed and placed. Studies of these assessment and placement (A&P) methods focus primarily on predictive validity; one…
Descriptors: Teacher Attitudes, Student Evaluation, Student Placement, Placement Tests
Dadey, Nathan; Keng, Leslie; Boyer, Michelle; Marion, Scott – National Center for the Improvement of Educational Assessment, 2021
State summative educational assessment is about to begin in earnest. Rightfully, many are raising questions about the quality, meaning, and appropriate use of the assessment results. This document was written to support state educational agencies (SEAs) and their assessment providers in devising effective and efficient analysis plans. This…
Descriptors: Educational Assessment, Summative Evaluation, Student Evaluation, Test Use
Jay Schyler Raadt – ProQuest LLC, 2020
In response to concerns about using only standardized multiple-choice assessments, some school districts have moved to using alternative ratings of student achievement with authentic assessments. However, such assessments are often limited in terms of the psychometric validity data supporting their use. The present study mixed quantitative and…
Descriptors: Performance Based Assessment, Middle School Students, Scoring Rubrics, Content Validity
Jessica B. Koslouski; Kristabel Stark; Sandra M. Chafouleas; T. Chris Riley-Tillman – Grantee Submission, 2023
Social, emotional, and behavioral (SEB) instruments are currently used in schools to screen, refer, and progress monitor students. Although many of these instruments have demonstrated strong technical adequacy, there has been far less examination of their consequential validity--that is, positive or negative intended and unintended consequences of…
Descriptors: Behavior Rating Scales, Screening Tests, Test Validity, Scores
Kocakulah, Aysel – Participatory Educational Research, 2022
The aim of this study is to develop and apply a rubric to evaluate the solutions proposed for questions about electromagnetic induction belonging to university second year pre-service teachers. In this study which has pretest-posttest quasi-experimental design with control group, teaching of the topic of electromagnetic induction was applied to…
Descriptors: Scoring Rubrics, Student Evaluation, Undergraduate Students, Problem Solving
Yang, Yan; Cox, Cody; Cho, YoonJung – Journal of Psychoeducational Assessment, 2020
Despite the critical role of emotions in multicultural teacher education, no attempt has been made to develop an instrument including affect as a dimension in measuring cultural competence for preservice teachers. To bridge this gap, the present three-study research used three distinct samples of 456 preservice teachers to develop and estimate the…
Descriptors: Cultural Awareness, Measures (Individuals), Student Attitudes, Preservice Teachers
Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Journal of Educational Measurement, 2021
Eye-tracking technology can create a record of the location and duration of visual fixations as a test-taker reads test questions. Although the cognitive process the test-taker is using cannot be directly observed, eye-tracking data can support inferences about these unobserved cognitive processes. This type of information has the potential to…
Descriptors: Eye Movements, Test Validity, Multiple Choice Tests, Cognitive Processes