Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 28 |
Descriptor
Achievement Tests | 62 |
Evaluation Methods | 62 |
Test Reliability | 48 |
Test Validity | 31 |
Student Evaluation | 17 |
Academic Achievement | 14 |
Elementary Secondary Education | 14 |
Test Construction | 14 |
Foreign Countries | 13 |
Reliability | 12 |
Scores | 12 |
More ▼ |
Source
Author
Thomson, Peter | 2 |
Aiken, Lewis R. | 1 |
Alexander, Patricia A. | 1 |
Anderson, Lorin W. | 1 |
Antia, Shirin D. | 1 |
Barniol, Pablo | 1 |
Berebbi, Shir | 1 |
Berliner, David C. | 1 |
Bezruczko, Nikolaus | 1 |
Bommer, William | 1 |
Buser, Karen | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 2 |
Researchers | 2 |
Teachers | 2 |
Location
Florida | 3 |
Australia | 2 |
California | 1 |
Chile | 1 |
Delaware | 1 |
Europe | 1 |
Illinois | 1 |
Indonesia | 1 |
Israel | 1 |
Mexico | 1 |
Michigan | 1 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 1 |
No Child Left Behind Act 2001 | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Juan M. Sanchez – Journal of Biological Education, 2024
Bias assessment (systematic errors) is fundamental in industry and service laboratories, where reliable results must be obtained to give correct answers to specific problems. Therefore, knowledge and practice in quality methodologies is of fundamental importance for students. Unfortunately, laboratory lessons often focus on connecting theory and…
Descriptors: Achievement Tests, Science Laboratories, Biology, Science Education
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Marilena Z. Leana-Tascilar – Cogent Education, 2024
This study aimed to develop a comprehensive tool to assess underachievement in gifted students, incorporating input from parents, teachers, and students themselves. A total of 285 participants, including 95 gifted students, their parents, and teachers, were involved in the study. The results have revealed a four-factor structure for the Gifted…
Descriptors: Psychometrics, Academic Achievement, Underachievement, Academically Gifted
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Toker, Turker – International Journal of Curriculum and Instruction, 2023
Achievement tests are among the most widely used data collection tools to measure the knowledge and skill levels of individuals. For this reason, the existence of valid and reliable achievement tests that can perfectly reveal the competencies that a person should have in any discipline is of great importance. The purpose of this research is to…
Descriptors: Basic Skills, Evaluation Methods, Test Items, Test Validity
Gliksman, Yarden; Berebbi, Shir; Hershman, Ronen; Henik, Avishai – Applied Cognitive Psychology, 2022
Math fluency (MF) is the ability to quickly and accurately solve simple math exercises. Proficiency in MF is one of the buildings of arithmetic achievement during school. However, so far only paper and pencil tests have been used to assess MF. In the current study, we present the BGU-MF (Ben-Gurion University Math Fluency) test, a new computerized…
Descriptors: Foreign Countries, Mathematics Skills, Mathematics Tests, Computer Assisted Testing
Gulsah Gurkan – ProQuest LLC, 2021
Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…
Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries
Valeria Cavioni; Luisa Broli; Ilaria Grazzani – International Journal of Emotional Education, 2024
The importance of enhancing social and emotional skills in educational settings has gained prominence, with many countries and organizations embracing the Social and Emotional Learning (SEL) framework to equip individuals with the tools needed for shaping a self-identity, emotional regulation, goal achievement, empathy, nurturing relationships,…
Descriptors: Social Emotional Learning, Guidelines, Educational Policy, Cross Cultural Studies
Wakabayashi, Tomoko; Claxton, Jill; Smith, Everett V., Jr. – Journal of Psychoeducational Assessment, 2019
The Child Observation Record (COR), initially developed in 1993 by HighScope Educational Research Foundation, is an observation-based instrument that provides systematic assessment of young children's knowledge and abilities in all major areas of development. Teachers or caregivers spend a few minutes each day writing brief notes or…
Descriptors: Observation, Evaluation Methods, Early Childhood Education, Kindergarten
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Warlop, Daniel M. – Curriculum and Teaching Dialogue, 2016
This chapter is a research summary of the author's doctoral dissertation completed in May, 2015, which investigates the way Standardized Assessment (SA) is used in state educational accountability structures. This quasi-experimental quantitative study found that SA scores trend towards consistency over time, and that there is additional variance,…
Descriptors: Accountability, Educational Assessment, Student Evaluation, Public Education
Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019
This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…
Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing
Lee, Young-Sun; Lembke, Erica – ZDM: The International Journal on Mathematics Education, 2016
The present study examined the technical adequacy of curriculum-based measurement (CBM) measure of early numeracy for kindergarten through third grade students. Our CBM measures were developed to reflect broad and theoretically derived categories of mathematical thinking: quick retrieval, written computation, and number sense. The mastery of these…
Descriptors: Curriculum Based Assessment, Evaluation Methods, Numeracy, Kindergarten
Pokropek, Artur – Sociological Methods & Research, 2015
This article combines statistical and applied research perspective showing problems that might arise when measurement error in multilevel compositional effects analysis is ignored. This article focuses on data where independent variables are constructed measures. Simulation studies are conducted evaluating methods that could overcome the…
Descriptors: Error of Measurement, Hierarchical Linear Modeling, Simulation, Evaluation Methods
Alexander, Patricia A.; Dumas, Denis; Grossnickle, Emily M.; List, Alexandra; Firetto, Carla M. – Journal of Experimental Education, 2016
Relational reasoning is the foundational cognitive ability to discern meaningful patterns within an informational stream, but its reliable and valid measurement remains problematic. In this investigation, the measurement of relational reasoning unfolded in three stages. Stage 1 entailed the establishment of a research-based conceptualization of…
Descriptors: Cognitive Ability, Logical Thinking, Thinking Skills, Cognitive Processes