ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Educational Assessment	15
Evaluation Methods	15
Scoring	5
Educational Testing	4
Elementary Secondary Education	4
Evaluation Research	4
Measurement	4
Elementary School Students	3
Evaluation Problems	3
Item Response Theory	3
Models	3
Psychometrics	3
Research Methodology	3
Scores	3
Simulation	3
Test Validity	3
Testing Problems	3
Comparative Analysis	2
Computer Simulation	2
Data Analysis	2
Elementary Education	2
Evaluation Criteria	2
Measures (Individuals)	2
Science Education	2
Statistical Analysis	2
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	15
Reports - Research	10
Reports - Evaluative	4
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Georgia

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Advanced Placement…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Modeling Data from Collaborative Assessments: Learning in Digital Interactive Social Networks

Peer reviewed

Direct link

Wilson, Mark; Gochyyev, Perman; Scalise, Kathleen – Journal of Educational Measurement, 2017

This article summarizes assessment of cognitive skills through collaborative tasks, using field test results from the Assessment and Teaching of 21st Century Skills (ATC21S) project. This project, sponsored by Cisco, Intel, and Microsoft, aims to help educators around the world enable students with the skills to succeed in future career and…

Descriptors: Cognitive Ability, Thinking Skills, Evaluation Methods, Educational Assessment

Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…

Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

The Validity of Scores from Alternative Methods of Assessing Spelling Achievement.

Peer reviewed

Frisbie, David A.; Cantor, Nancy K. – Journal of Educational Measurement, 1995

Studied the validity of alternative methods for assessing the spelling achievements of students in grades 2 through 7. Results from 760 third graders, 721 fifth graders, and 639 seventh graders indicate that no single objective format stood out above the others, although some demonstrated superiority to the dictation format on several dimensions.…

Descriptors: Dictation, Educational Assessment, Elementary Education, Elementary School Students

Methodological Considerations in the Development of Indicators of Achievement in Data from the National Assessment.

Peer reviewed

Anderson, Ronald E.; And Others – Journal of Educational Measurement, 1982

Findings on alternative procedures for evaluating measures of achievement in individual data packages at the National Assessment of Educational Progress are presented with their methodological implications. The need for secondary analysts to be aware of the organization of the data, and positive and negative features are discussed. (Author/CM)

Descriptors: Achievement, Databases, Educational Assessment, Elementary Secondary Education

Evaluation of Procedure-Based Scoring for Hands-On Science Assessment.

Peer reviewed

Baxter, Gail P.; And Others – Journal of Educational Measurement, 1992

A procedure-based observational scoring system and a notebook completed by students were evaluated as science assessments for 41 fifth grade students experienced in hands-on science and 55 fifth grade students inexperienced in hands-on science. Results suggest that notebooks may be a reasonable, although less reliable, surrogate for observed…

Descriptors: Classroom Observation Techniques, Comparative Analysis, Educational Assessment, Elementary School Students

Overview of the Scaling Methodology Used in the National Assessment.

Peer reviewed

Beaton, Albert E.; Johnson, Eugene G. – Journal of Educational Measurement, 1992

The National Assessment of Educational Progress (NAEP) uses item response theory (IRT) based scaling methods to summarize information in complex data sets. The necessity of global scores or more detailed subscores, creation of developmental scales for different ages, and use of scale anchoring for scale interpretation are discussed. (SLD)

Descriptors: Age Differences, Educational Assessment, Elementary Secondary Education, Evaluation Methods

Examining Rater Errors in the Assessment of Written Composition with a Many-Faceted Rasch Model.

Peer reviewed

Engelhard, George, Jr. – Journal of Educational Measurement, 1994

Rater errors (rater severity, halo effect, central tendency, and restriction of range) are described, and criteria are presented for evaluating rating quality based on a many-faceted Rasch (FACETS) model. Ratings of 264 compositions from the Eighth Grade Writing Test in Georgia by 15 raters illustrate the discussion. (SLD)

Descriptors: Criteria, Educational Assessment, Elementary Education, Elementary School Students

Developing the NAEP Content-Area Frameworks and Innovative Assessment Methods in the 1992 Assessments of Mathematics, Reading, and Writing.

Peer reviewed

Mullis, Ina V. S. – Journal of Educational Measurement, 1992

An overview is given of the consensus process for development of the frameworks underlying the National Assessment of Educational Progress (NAEP) assessments, with emphasis on those for the 1990 and 1992 mathematics assessments, the 1992 reading assessment, and the 1994 science assessments. Innovative techniques for 1992 are described. (SLD)

Descriptors: Academic Standards, Content Validity, Educational Assessment, Elementary Secondary Education

Scoring a Performance-Based Assessment by Modeling the Judgments of Experts.

Peer reviewed

Clauser, Brian E.; And Others – Journal of Educational Measurement, 1995

A scoring algorithm for performance assessments is described that is based on expert judgments but requires the rating of only a sample of performances. A regression-based policy capturing procedure was implemented for clinicians evaluating skills of 280 medical students. Results demonstrate the usefulness of the algorithm. (SLD)

Descriptors: Algorithms, Clinical Diagnosis, Computer Simulation, Educational Assessment

Measuring Thinking Skills through Classroom Assessment.

Peer reviewed

Stiggins, Richard J.; And Others – Journal of Educational Measurement, 1989

Classroom assessment procedures of 36 teachers in grades 2 to 12 were studied to determine the extent to which they measure students' higher order thinking skills in mathematics, science, social studies, and language arts. A striking finding was the absence of evaluation of comparative and evaluative thinking. (SLD)

Descriptors: Classroom Techniques, Cognitive Processes, Educational Assessment, Elementary Secondary Education

Clauser, Brian E.	2
Anderson, Ronald E.	1
Armstrong, Ronald D.	1
Baldwin, Su G.	1
Baxter, Gail P.	1
Beaton, Albert E.	1
Cantor, Nancy K.	1
Cui, Ying	1
Dillon, Gerard F.	1
Dorsey, David W.	1
Engelhard, George, Jr.	1
Frisbie, David A.	1
Gochyyev, Perman	1
Johnson, Eugene G.	1
Leighton, Jacqueline P.	1
Margolis, Melissa J.	1
Mee, Janet	1
Meijer, Rob R.	1
Michaels, Hillary R.	1
Mullis, Ina V. S.	1
Myford, Carol M.	1
Scalise, Kathleen	1
Shi, Min	1
Stiggins, Richard J.	1
More ▼