Publication Date
In 2025 | 2 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 30 |
Since 2016 (last 10 years) | 84 |
Since 2006 (last 20 years) | 521 |
Descriptor
Educational Testing | 1263 |
Student Evaluation | 302 |
Elementary Secondary Education | 285 |
Foreign Countries | 225 |
Educational Assessment | 205 |
Evaluation Methods | 203 |
Academic Achievement | 197 |
Test Construction | 155 |
Test Use | 151 |
Standardized Tests | 146 |
Scores | 140 |
More ▼ |
Source
Author
Popham, W. James | 13 |
Sinharay, Sandip | 10 |
Koretz, Daniel | 7 |
Newton, Paul E. | 7 |
Plake, Barbara S. | 7 |
Sireci, Stephen G. | 7 |
Haberman, Shelby J. | 6 |
Phelps, Richard P. | 6 |
Baker, Eva L. | 5 |
Camara, Wayne J. | 5 |
Mislevy, Robert J. | 5 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 78 |
Teachers | 38 |
Administrators | 31 |
Researchers | 21 |
Policymakers | 15 |
Parents | 3 |
Media Staff | 1 |
Students | 1 |
Location
United Kingdom | 42 |
United Kingdom (England) | 31 |
California | 30 |
Canada | 29 |
United States | 25 |
Australia | 18 |
United Kingdom (Wales) | 12 |
China | 11 |
Florida | 11 |
United Kingdom (Great Britain) | 11 |
Netherlands | 10 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024
The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…
Descriptors: Artificial Intelligence, Models, Cheating, Identification
Russell, Michael – Educational Measurement: Issues and Practice, 2022
Despite agreement about the central importance of validity for educational and psychological testing, consensus regarding the definition of validity remains elusive. Differences in the definition of validity are examined and reveals that a potential cause of disagreement stems from differences in word use and meanings given to key terms commonly…
Descriptors: Test Validity, Psychological Testing, Educational Testing, Vocabulary
Li, Dongmei – Journal of Educational Measurement, 2022
Equating error is usually small relative to the magnitude of measurement error, but it could be one of the major sources of error contributing to mean scores of large groups in educational measurement, such as the year-to-year state mean score fluctuations. Though testing programs may routinely calculate the standard error of equating (SEE), the…
Descriptors: Error Patterns, Educational Testing, Group Testing, Statistical Analysis
Hilarius Jago Duda; Didin Syafruddin; Lusila Parida – Anatolian Journal of Education, 2023
The problem of this research is how to use assessment for school learning and what are students' creative thinking skills? Research objectives: First, to uncover, analyze, identify, describe the learning assessment used by teachers and students of Nusantara Indah Sintang Senior High School. Second, to express, analyze and describe students'…
Descriptors: Educational Testing, Creative Thinking, High School Students, Student Attitudes
Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021
Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…
Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing
Suthathip Thirakunkovit – Language Testing in Asia, 2025
Establishing a cut score is a crucial aspect of the test development process since the selected cut score has the potential to impact students' performance outcomes and shape instructional strategies within the classroom. Therefore, it is vital for those involved in test development to set a cut score that is both fair and justifiable. This cut…
Descriptors: Cutting Scores, Culture Fair Tests, Language Tests, Test Construction
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021
Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…
Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing
van Groen, Maaike M.; Eggen, Theo J. H. M. – Journal of Applied Testing Technology, 2020
When developing a digital test, one of the first decisions that need to be made is which type of Computer-Based Test (CBT) to develop. Six different CBT types are considered here: linear tests, automatically generated tests, computerized adaptive tests, adaptive learning environments, educational simulations, and educational games. The selection…
Descriptors: Computer Assisted Testing, Formative Evaluation, Summative Evaluation, Adaptive Testing
Sinharay, Sandip – Journal of Educational Measurement, 2023
Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…
Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation
Suto, Irenka; Ireland, Jo – International Journal of Assessment Tools in Education, 2021
Errors in examination papers and other assessment instruments can compromise fairness. For example, a history question containing an incorrect historical date could be impossible for students to answer. Incorrect instructions at the start of an examination could lead students to answer the wrong number of questions. As there is little research on…
Descriptors: Testing Problems, Educational Testing, Test Construction, Work Environment
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…
Descriptors: Construct Validity, Test Theory, Test Use, Affordances
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023
Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…
Descriptors: Scores, Test Items, Accuracy, Psychometrics
Tavares, Walter; Kuper, Ayelet; Kulasegaram, Kulamakan; Whitehead, Cynthia – Advances in Health Sciences Education, 2020
The array of different philosophical positions underlying contemporary views on competence, assessment strategies and justification have led to advances in assessment science. Challenges may arise when these philosophical positions are not considered in assessment design. These can include (a) a logical incompatibility leading to varied or…
Descriptors: Performance Based Assessment, Educational Testing, Test Interpretation, Test Results
Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020
In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement