Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 13 |
Descriptor
Criterion Referenced Tests | 165 |
Educational Testing | 165 |
Norm Referenced Tests | 63 |
Test Construction | 55 |
Achievement Tests | 54 |
Elementary Secondary Education | 51 |
Testing Programs | 41 |
Educational Assessment | 40 |
Standardized Tests | 39 |
Evaluation Methods | 37 |
Testing Problems | 36 |
More ▼ |
Source
Author
Ebel, Robert L. | 4 |
Popham, W. James | 3 |
Berk, Ronald A. | 2 |
Bielinski, John | 2 |
Coffman, William E. | 2 |
Haberman, Shelby J. | 2 |
Hambleton, Ronald K. | 2 |
Minnema, Jane | 2 |
Oosterhof, Albert C. | 2 |
Rose, Janet S. | 2 |
Thurlow, Martha | 2 |
More ▼ |
Publication Type
Education Level
Elementary Education | 3 |
Elementary Secondary Education | 3 |
Secondary Education | 2 |
Grade 10 | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Higher Education | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
More ▼ |
Audience
Practitioners | 11 |
Researchers | 4 |
Parents | 3 |
Teachers | 3 |
Students | 2 |
Administrators | 1 |
Community | 1 |
Counselors | 1 |
Policymakers | 1 |
Support Staff | 1 |
Location
Canada | 3 |
Wisconsin | 3 |
Australia | 2 |
California | 2 |
Georgia | 2 |
Idaho | 2 |
New Mexico | 2 |
United Kingdom (England) | 2 |
Arizona (Phoenix) | 1 |
Brunei | 1 |
Delaware | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…
Descriptors: Construct Validity, Test Theory, Test Use, Affordances
Brickell, Henry M. – Journal of MultiDisciplinary Evaluation, 2011
Evaluators use their eyes to see what is there, whether it is intended or not. But they use their test instruments to measure what is intended, whether it is there or not. Evaluators have been broadening their repertoire of instruments for years: curriculum-embedded tests, observer checklists, audiotape recorders, videotape recorders, unobtrusive…
Descriptors: Evaluators, Situational Tests, Criterion Referenced Tests, Videotape Recorders
Airola, Denise Tobin – ProQuest LLC, 2011
Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…
Descriptors: Evidence, Reference Groups, Norm Referenced Tests, Criterion Referenced Tests
Mundia, Lawrence – Journal of International Education and Leadership, 2012
The commentary and overview explored how curriculum and assessment reforms are being used by a small university and small country to improve the quality of education and gain international recognition. Although the reforms are potentially beneficial to the students, university, and country, there are dilemmatic factors that may either enhance or…
Descriptors: Foreign Countries, Universities, College Outcomes Assessment, Qualitative Research
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Levy, Roy – Measurement: Interdisciplinary Research and Perspectives, 2009
In "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art," Rupp and Templin (2008) undertake the ambitious task of providing a thorough portrait of the current state of diagnostic classification models (DCM). In this commentary, the author applauds Rupp and Templin for their…
Descriptors: Classification, Models, Evidence, Measurement
Stevens, Karmenlita L. – ProQuest LLC, 2009
The purpose of this study is to compare the teacher retention rates in public elementary and middle schools in Georgia that met or did not meet the academic performance component of Adequate Yearly Progress. The teacher retention rates were expected to be higher in schools that met the academic performance component of AYP and lower in the schools…
Descriptors: Report Cards, Middle Schools, Teacher Persistence, Educational Improvement
Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…
Descriptors: Test Items, Classification, Psychometrics, Item Response Theory
Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009
Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…
Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Xu, Zeyu; Nichols, Austin – National Center for Analysis of Longitudinal Data in Education Research, 2010
The gold standard in making causal inference on program effects is a randomized trial. Most randomization designs in education randomize classrooms or schools rather than individual students. Such "clustered randomization" designs have one principal drawback: They tend to have limited statistical power or precision. This study aims to…
Descriptors: Test Format, Reading Tests, Norm Referenced Tests, Research Design
Haberman, Shelby J. – ETS Research Report Series, 2008
In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…
Descriptors: Scores, Validity, Educational Testing, Correlation
Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009
On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…
Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics
Porter, John W. – Phi Delta Kappan, 1976
Michigan's superintendent of public instruction is convinced that a state testing program can provide the kind of information educators need to improve instructional planning. (Author)
Descriptors: Criterion Referenced Tests, Educational Assessment, Educational Testing, State Programs
Harsh, J. Richard – 1974
It is argued that, by design, norm-referenced tests (NRT) and criterion-referenced tests (CRT) are conceived with different frames of reference. They are not totally exclusive of each other, but they do direct attention to different uses and references for information and decision making. Their combined contributions allow a more detailed and…
Descriptors: Criterion Referenced Tests, Educational Assessment, Educational Testing, Norm Referenced Tests