Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 217 |
Descriptor
Educational Testing | 606 |
Evaluation Methods | 606 |
Student Evaluation | 270 |
Educational Assessment | 229 |
Elementary Secondary Education | 153 |
Academic Achievement | 132 |
Program Evaluation | 131 |
Achievement Tests | 112 |
Accountability | 108 |
Educational Policy | 103 |
Disabilities | 98 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 148 |
Elementary Education | 73 |
Secondary Education | 65 |
High Schools | 61 |
Grade 4 | 56 |
Grade 8 | 55 |
Higher Education | 36 |
Grade 10 | 29 |
Postsecondary Education | 27 |
Grade 11 | 21 |
Adult Education | 8 |
More ▼ |
Audience
Practitioners | 40 |
Teachers | 20 |
Researchers | 8 |
Administrators | 7 |
Policymakers | 7 |
Students | 2 |
Counselors | 1 |
Media Staff | 1 |
Location
United Kingdom | 18 |
Canada | 13 |
Florida | 10 |
United Kingdom (England) | 9 |
California | 8 |
Kentucky | 8 |
Australia | 7 |
United States | 7 |
United Kingdom (Wales) | 6 |
New York | 5 |
Virginia | 5 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
Sinharay, Sandip – Journal of Educational Measurement, 2023
Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…
Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation
Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024
We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…
Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners
Atkinson, Cathy; Barrow, Joanna; Norris, Sarah – Educational Psychology in Practice, 2022
Assessment is one of the five functions of the educational psychologist's (EP's) role, yet there is a dearth of research exploring its distinctive contribution to school-based practice, and a lack of definition about what it is. In this study, the assessment practices of EPs were compared with those of other educational professionals who had…
Descriptors: Educational Psychology, School Psychologists, Evaluation Methods, Educational Testing
Sainan Xu; Jing Lu; Jiwei Zhang; Chun Wang; Gongjun Xu – Grantee Submission, 2024
With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational…
Descriptors: Educational Assessment, Bayesian Statistics, Statistical Inference, Item Response Theory
Cheng, Yi-Ling – ProQuest LLC, 2016
The present study explored the dimensionality of cognitive structure from two approaches. The first approach used a famous relation between Visual Spatial Working Memory (VSWM) and calculation to demonstrate the multidimensional item response analyses when true dimensions are unknown. The second approach explored the detectability of dimensions by…
Descriptors: Cognitive Structures, Scores, Correlation, Spatial Ability
Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015
The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…
Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
White, John – London Review of Education, 2013
It is time to replace the examination regime at 16 and 18 by something more appropriate. The coalition government has been solidifying its place by its Baccalaureate reforms at both ages, but this is a move in quite the wrong direction. Whatever the wider purposes that the examination system may serve, its core aim is to find out how well students…
Descriptors: Student Evaluation, Evaluation Methods, Educational Testing, Testing Programs
Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…
Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics
Measuring the Continuum of Literacy Skills among Adults: Educational Testing and the LAMP Experience
Guadalupe, Cesar; Cardoso, Manuel – International Review of Education, 2011
The field of educational testing has become increasingly important for providing different stakeholders and decision-makers with information. This paper discusses basic standards for methodological approaches used in measuring literacy skills among adults. The authors address the increasing interest in skills measurement, the discourses on how…
Descriptors: Adult Literacy, Educational Testing, Testing Programs, Standards
American Educational Research Association (AERA), 2014
Developed jointly by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, "Standards for Educational and Psychological Testing" (Revised 2014) addresses professional and technical issues of test development and use in education, psychology, and…
Descriptors: Standards, Educational Testing, Psychological Testing, Test Construction
Noorbehbahani, F.; Kardan, A. A. – Computers & Education, 2011
e-Learning plays an undoubtedly important role in today's education and assessment is one of the most essential parts of any instruction-based learning process. Assessment is a common way to evaluate a student's knowledge regarding the concepts related to learning objectives. In this paper, a new method for assessing the free text answers of…
Descriptors: Evaluation Methods, Educational Assessment, Student Evaluation, Scoring
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods