Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Educational Measurement:… | 17 |
Author
Attali, Yigal | 1 |
Burton, Elizabeth | 1 |
Cangelosi, James S. | 1 |
Crawford, Angela | 1 |
Diamond, Esther E. | 1 |
Fisher, Thomas H. | 1 |
Fisher, Thomas M. | 1 |
Fremer, John | 1 |
Frisbie, David A. | 1 |
Grabovsky, Irina | 1 |
Haladyna, Thomas M. | 1 |
More ▼ |
Publication Type
Journal Articles | 17 |
Reports - Evaluative | 9 |
Opinion Papers | 3 |
Reports - Research | 3 |
Information Analyses | 2 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Researchers | 1 |
Location
Pennsylvania | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Teacher Performance… | 1 |
What Works Clearinghouse Rating
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Johnson, Evelyn S.; Crawford, Angela; Moylan, Laura A.; Zheng, Yuzhu – Educational Measurement: Issues and Practice, 2018
The evidence-centered design framework was used to create a special education teacher observation system, Recognizing Effective Special Education Teachers. Extensive reviews of research informed the domain analysis and modeling stages, and led to the conceptual framework in which effective special education teaching is operationalized as the…
Descriptors: Evidence Based Practice, Special Education Teachers, Observation, Disabilities
Pommerich, Mary – Educational Measurement: Issues and Practice, 2012
Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…
Descriptors: Testing, Scores, Measurement, Test Construction

Diamond, Esther E.; Fremer, John – Educational Measurement: Issues and Practice, 1989
The Joint Committee on Testing Practices has completed the "Code of Fair Testing Practices in Education," which is meant for the public and focuses on the proper use of tests in education--admissions, educational assessment and diagnosis, and student placement. The Code separately addresses test developers' and users' roles. (SLD)
Descriptors: Educational Testing, Evaluation Utilization, Examiners, Scoring

Quellmalz, Edys S. – Educational Measurement: Issues and Practice, 1984
A summary of the writing assessment programs reviewed in this journal is presented. The problems inherent in the programs are outlined. A coordinated research program on major problems in writing assessment is proposed as being beneficial and cost-effective. (DWH)
Descriptors: Essay Tests, Program Evaluation, Scoring, State Programs

Cangelosi, James S. – Educational Measurement: Issues and Practice, 1984
Test development procedures and six methods for determining cut-off scores are briefly described. An alternate method, appropriate when the test developer also determines the cut-off score, is suggested. Unlike other methods, the standard is set during the test development stage. Its computations are intelligible to nonstatistically-oriented…
Descriptors: Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education, Error of Measurement

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1995
A newly developed performance standard-setting procedure, termed iterative judgmental policy capturing (JPC), is applicable to assessments composed of distinct multidimensional exercises. The procedure is described, and results are reported from the application of JPC in a study involving a panel of 20 teachers and 6 performance exercises. (SLD)
Descriptors: Decision Making, Educational Assessment, Licensing Examinations (Professions), Multidimensional Scaling

Solano-Flores, Guillermo; Shavelson, Richard J. – Educational Measurement: Issues and Practice, 1997
Conceptual, practical, and logistical issues in the development of science performance assessments (SPAs) are discussed. The conceptual framework identifies task, response format, and scoring system as components, and conceives of SPAs as tasks that attempt to recreate conditions in which scientists work. Developing SPAs is a sophisticated effort…
Descriptors: Elementary Secondary Education, Performance Based Assessment, Science Education, Science Tests

Linn, Robert L.; Burton, Elizabeth – Educational Measurement: Issues and Practice, 1994
Generalizability of performance-based assessment scores across raters and tasks is examined, focusing on implications of generalizability analyses for specific uses and interpretations of assessment results. Although it seems probable that assessment conditions, task characteristics, and interactions with instructional experiences affect the…
Descriptors: Educational Assessment, Educational Experience, Generalizability Theory, Interaction

Haladyna, Thomas M. – Educational Measurement: Issues and Practice, 1992
Context-dependent item sets, containing a subset of test items related to a passage or stimulus, are discussed. A brief review of methods for developing item sets reveals their potential for measuring high-level thinking. Theories and technologies for scoring item sets remain largely experimental. Research needs are discussed. (SLD)
Descriptors: Cognitive Tests, Educational Technology, Licensing Examinations (Professions), Problem Solving

Roeber, Edward D. – Educational Measurement: Issues and Practice, 1984
In every instance in the process of constructing and using a test, the microcomputer can aid the classroom teacher. However, the teacher will not apply the microcomputer to classroom testing without added training both in classroom testing and in using the microcomputer. (BW)
Descriptors: Computer Assisted Testing, Educational Testing, Elementary Secondary Education, Item Analysis

Fisher, Thomas M.; Smith, Julia – Educational Measurement: Issues and Practice, 1991
Incidents affecting the implementation of large-scale testing programs are described to illustrate associated problems. Issues addressed include creation of test materials, preparation of answer documents, transportation of test materials, scoring and analysis of tests, and dissemination and utilization of test results. (TJH)
Descriptors: Answer Keys, Computer Assisted Testing, Information Dissemination, Program Implementation

Fisher, Thomas H.; And Others – Educational Measurement: Issues and Practice, 1985
The new Florida Master Teacher Program in which the state provides bonuses directly to qualified teachers is described. Three measurement issues in implementing the program are discussed: (1) evaluating a teacher's classroom performance; (2) evaluating a teacher's subject area knowledge; and (3) combining scores to determine which teachers…
Descriptors: Elementary Secondary Education, Incentives, Job Performance, Merit Pay

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992
Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)
Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests
Previous Page | Next Page ยป
Pages: 1 | 2