Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Standards | 8 |
Standard Setting (Scoring) | 4 |
Testing Programs | 4 |
State Programs | 3 |
Judges | 2 |
Mathematics Tests | 2 |
Minimum Competency Testing | 2 |
Psychological Testing | 2 |
Reliability | 2 |
Test Items | 2 |
Test Use | 2 |
More ▼ |
Source
Applied Measurement in… | 8 |
Author
Publication Type
Journal Articles | 8 |
Reports - Research | 5 |
Reports - Descriptive | 2 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 1 |
Grade 12 | 1 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Two Year Colleges | 1 |
Audience
Location
Georgia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Texas Assessment of Academic… | 1 |
What Works Clearinghouse Rating
Poe, Mya; Oliveri, Maria Elena; Elliot, Norbert – Applied Measurement in Education, 2023
Since 1952, the "Standards for Educational and Psychological Testing" has provided criteria for developing and evaluating educational and psychological tests and testing practice. Yet, we argue that the foundations, operations, and applications in the "Standards" are no longer sufficient to meet the current U.S. testing demands…
Descriptors: Racism, Social Justice, Standards, Psychological Testing
Leighton, Jacqueline P. – Applied Measurement in Education, 2013
The Standards for Educational and Psychological Testing indicate that multiple sources of validity evidence should be used to support the interpretation of test scores. In the past decade, examinee response processes, as a source of validity evidence, have received increased attention. However, there have been relatively few methodological studies…
Descriptors: Psychological Testing, Standards, Interviews, Protocol Analysis

Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999
Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…
Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards

Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998
A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)
Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs

Chang, Lei; And Others – Applied Measurement in Education, 1996
The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)
Descriptors: Competence, Difficulty Level, Economics, Judges

Jaeger, Richard M. – Applied Measurement in Education, 1988
The modified caution index's use in identifying judges whose patterns of item judgment appear aberrant when compared with the pattern produced by the entire group (N=158) of judges was studied. Effects on test standards and passing rates of removing test standards of these judges were also assessed. (TJH)
Descriptors: Criterion Referenced Tests, Evaluators, Item Analysis, Mathematics Tests