ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Educational Testing	14
Measurement	14
Testing Problems	14
Educational Assessment	8
Evaluation Methods	7
Evaluation Problems	6
Evaluation Research	5
Teacher Evaluation	5
Test Construction	5
Psychometrics	4
Achievement Tests	3
Correlation	3
Measurement Techniques	3
Test Validity	3
Academic Achievement	2
Content Validity	2
Data Analysis	2
Educational Improvement	2
Evaluation Criteria	2
Item Response Theory	2
Knowledge Base for Teaching	2
Mathematics Education	2
Mathematics Instruction	2
Pedagogical Content Knowledge	2
Reading Achievement	2
More ▼

Source

Journal of Educational…	3
Measurement:…	2
American Educational Research…	1
Educational Measurement:…	1
Online Submission	1

Publication Type

Journal Articles	7
Reports - Research	4
Opinion Papers	3
Reports - Evaluative	2
Collected Works - Proceedings	1
Guides - General	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Investigating Effect of Ignoring Hierarchical Data Structures on Accuracy of Vertical Scaling Using Mixed-Effects Rasch Model

Download full text

Wang, Shudong; Jiao, Hong; Jin, Ying; Thum, Yeow Meng – Online Submission, 2010

The vertical scales of large-scale achievement tests created by using item response theory (IRT) models are mostly based on cluster (or correlated) educational data in which students usually are clustered in certain groups or settings (classrooms or schools). While such application directly violated assumption of independent sample of person in…

Descriptors: Scaling, Achievement Tests, Data Analysis, Item Response Theory

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Measurement Implications of "A Nation at Risk."

Peer reviewed

Hogan, Thomas P. – Educational Measurement: Issues and Practice, 1983

The implications of "A Nation at Risk" for the field of measurement are examined. These are a need for more frequent and varied tests; responsibility for eliminating measurement problems; and a resurgence of standardized testing at the high school level. The need for measures of teaching quality will increase. (DWH)

Descriptors: Educational Improvement, Educational Testing, Futures (of Society), Measurement

Test Anxiety: Situationally Specific or General?

Download full text

Tobias, Sigmund; Hedl, John J., Jr. – 1972

This paper reports two experiments whose purpose was to relate two bodies of research on anxiety: test and trait-state anxiety. It was reasoned that state anxiety measures obtained in an evaluation testing condition should be more similar to test anxiety than state anxiety measures obtained in non-evaluative situations, such as a game in Study I…

Descriptors: Anxiety, Behavioral Science Research, College Students, Educational Testing

Validating the MKT Measures: Some Responses to the Commentaries

Peer reviewed

Direct link

Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007

The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…

Descriptors: Educational Testing, Tests, Measurement, Faculty Development

Generalizability and Specificity of Interpretive Arguments: Observations Inspired by the Commentaries

Peer reviewed

Direct link

Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007

In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…

Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement

Evaluating Compensatory Education Program Test Results Using Each Compensatory Teachers' Pupils as Subgroups for Analysis.

Download full text

Connecticut State Dept. of Education, Hartford. Bureau of Evaluation and Educational Services. – 1975

The practice of analyzing all available project children in as large a group as possible is considered not to be justifiable when distinct subgroups of pupils are represented. Instead, the approach suggested here determines the test score gain a pupil achieves from the beginning to the end of the year, with all of the pupil gain scores of a single…

Descriptors: Academic Achievement, Classification, Compensatory Education, Educational Assessment

Measuring What Learners Learn (With a Special Look at Performance Contracting).

Download full text

Stake, Robert E. – 1971

A discussion of performance contracting, defined as an agreement between a group offering instruction and a school needing the services, is presented. Four major hazards to direct measurement of specific learning are considered: poor statement of objectives; selection of the wrong tests; misinterpretation of test scores; and depersonalization of…

Descriptors: Accountability, Criterion Referenced Tests, Educational Objectives, Educational Testing

Proceedings of the Invitational Conference on Testing Problems (New York, New York, October 31, 1953).

Download full text

Educational Testing Service, Princeton, NJ. – 1953

Seven major topics were included in the conference proceedings: (1) Improving Evaluation of Educational Outcomes at the College Level; (2) Individual versus Group Decision Making; (3) Problems and Procedures in Profile Analysis; (4) Making Test Results Meaningful; (5) The Teaching of Educational Measurement; (6) The Interview as an Evaluation…

Descriptors: Course Content, Decision Making, Educational Benefits, Educational Improvement

Generating Outcome Measurements: Achievement and Attitudes. A Guide to Educational Outcome Measurements and Their Uses. Seminar No. 3.

Mushkin, Selma J.; Billings, Bradley B. – 1975

This guide is essentially designed as a teaching aid for those who would inform planners, officials of educational ministries, school administrators, principals, and teachers about educational outcome measurements. In outline and graphic form, the guide presents topics for discussion in a seminar dealing with how to obtain information on…

Descriptors: Academic Achievement, Affective Measures, Comparative Education, Educational Assessment

THE MLA FOREIGN LANGUAGE PROFICIENCY TESTS FOR TEACHERS AND ADVANCED STUDENTS--A PROFESSIONAL EVALUATION AND RECOMMENDATIONS FOR TEST DEVELOPMENT.

PAQUETTE, F. ANDRE; TOLLINGER, SUZANNE – 1966

THE DIRECTOR OF TESTING OF THE MODERN LANGUAGE ASSOCIATION (MLA), WITH THE ASSISTANCE OF 28 SELECTED IMPARTIAL PROFESSIONALS, PRODUCED INDIVIDUALLY AND IN TEAMS THIS CRITICAL APPRAISAL OF THE MLA FOREIGN LANGUAGE PROFICIENCY TESTS IN ORDER TO POINT OUT EXISTING DEFICIENCIES AND TO SUGGEST IMPROVEMENTS IN FUTURE TEST DEVELOPMENT. THIS HANDBOOK…

Descriptors: Achievement Tests, Advanced Students, Applied Linguistics, Cultural Background

Baldwin, Su G.	1
Billings, Bradley B.	1
Clauser, Brian E.	1
Cui, Ying	1
Dillon, Gerard F.	1
Hedl, John J., Jr.	1
Hill, Heather C.	1
Hogan, Thomas P.	1
Jiao, Hong	1
Jin, Ying	1
Leighton, Jacqueline P.	1
Margolis, Melissa J.	1
Mee, Janet	1
Mushkin, Selma J.	1
Myford, Carol M.	1
PAQUETTE, F. ANDRE	1
Papay, John P.	1
Schilling, Stephen	1
Stake, Robert E.	1
TOLLINGER, SUZANNE	1
Thum, Yeow Meng	1
Tobias, Sigmund	1
Wang, Shudong	1
Wolfe, Edward W.	1
More ▼